Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,8 @@ datasets:
|
|
4 |
---
|
5 |
# Platypus2-70B-instruct-4bit-gptq
|
6 |
|
7 |
-
Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization
|
|
|
8 |
|
9 |
### Benchmark Metrics
|
10 |
|
|
|
4 |
---
|
5 |
# Platypus2-70B-instruct-4bit-gptq
|
6 |
|
7 |
+
Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization.
|
8 |
+
The model is only 35 GIB in size in comparision with the original garage-bAInd/Platypus2-70B-instruct 127 GIB in size and can run on a single GPU
|
9 |
|
10 |
### Benchmark Metrics
|
11 |
|