malhajar commited on
Commit
296e89b
·
1 Parent(s): 5884758

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -4,7 +4,8 @@ datasets:
4
  ---
5
  # Platypus2-70B-instruct-4bit-gptq
6
 
7
- Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization
 
8
 
9
  ### Benchmark Metrics
10
 
 
4
  ---
5
  # Platypus2-70B-instruct-4bit-gptq
6
 
7
+ Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization.
8
+ The model is only 35 GIB in size in comparision with the original garage-bAInd/Platypus2-70B-instruct 127 GIB in size and can run on a single GPU
9
 
10
  ### Benchmark Metrics
11