TheBloke
/

CodeLlama-13B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

TheBloke commited on Aug 26, 2023

Commit

3a41a13

·

1 Parent(s): dfbe832

Initial GPTQ model commit

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -47,10 +47,10 @@ Multiple GPTQ parameter permutations are provided; see Provided Files below for
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference (deprecated)](https://huggingface.co/TheBloke/CodeLlama-13B-GGML)
 * [Meta's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/codellama/CodeLlama-13b-hf)
-## Prompt template: TBC
 ```
-Info on prompt template will be added shortly.
 ```
 ## Provided files and GPTQ parameters
@@ -159,7 +159,7 @@ model = AutoGPTQForCausalLM.from_quantized(model_name_or_path,
 """
 prompt = "Tell me about AI"
-prompt_template=f'''Info on prompt template will be added shortly.
 '''
 print("\n\n*** Generate:")

 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference (deprecated)](https://huggingface.co/TheBloke/CodeLlama-13B-GGML)
 * [Meta's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/codellama/CodeLlama-13b-hf)
+## Prompt template: None
 ```
+{prompt}
 ```
 ## Provided files and GPTQ parameters
 """
 prompt = "Tell me about AI"
+prompt_template=f'''{prompt}
 '''
 print("\n\n*** Generate:")