Sagarkrishna commited on
Commit
f60c100
·
verified ·
1 Parent(s): be92f50

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +82 -0
README.md ADDED
@@ -0,0 +1,82 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-sa-4.0
3
+ metrics:
4
+ - accuracy
5
+ pipeline_tag: text-generation
6
+ tags:
7
+ - code
8
+ ---
9
+
10
+ A capable language model for text to SQL generation for Postgres, Redshift and Snowflake that is on-par with the most capable generalist frontier models.
11
+
12
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/603bbad3fd770a9997b57cb6/h52Z_OKYBaDDQMFZyU5pF.png)
13
+
14
+ ## Model Description
15
+
16
+ Developed by: Defog, Inc
17
+ Model type: [Text to SQL]
18
+ License: [CC-by-SA-4.0]
19
+ Finetuned from model: [Meta-Llama-3-8B-Instruct]
20
+
21
+ ## meta-llama/Meta-Llama-3-8B-Instruct for CTranslate2
22
+
23
+ **The model is quantized version of the [defog/llama-3-sqlcoder-8b](https://huggingface.co/defog/llama-3-sqlcoder-8b) with int8_float16 quantization and can be used in [CTranslate2](https://github.com/OpenNMT/CTranslate2).**
24
+
25
+ ## Conversion details
26
+
27
+ The original model was converted on 2024-4 with the following command:
28
+ ```
29
+ ct2-transformers-converter --model Path\To\Local\meta-llama\Meta-Llama-3-8B-Instruct \
30
+ --quantization int8_float16 --output_dir Meta-Llama-3-8B-Instruct-ct2-int8_float16
31
+ ```
32
+
33
+ ## How to use
34
+
35
+ This repository for use with [CTranslate2](https://github.com/OpenNMT/CTranslate2).
36
+
37
+ ### Use with CTranslate2
38
+
39
+ This example code is obtained from [CTranslate2_transformers](https://opennmt.net/CTranslate2/guides/transformers.html#mpt) and [tokenizer AutoTokenizer](https://huggingface.co/docs/transformers/main_classes/tokenizer).
40
+ More detailed information about the `generate_batch` methon can be found at [CTranslate2_Generator.generate_batch](https://opennmt.net/CTranslate2/python/ctranslate2.Generator.html#ctranslate2.Generator.generate_batch).
41
+
42
+ ```python
43
+ import ctranslate2
44
+ import transformers
45
+
46
+ from huggingface_hub import snapshot_download
47
+ model_id = "SagarKrishna/Llama-3-8B-Text2SQL_Instruct-ct2-int8_float16"
48
+ model_path = snapshot_download(model_id)
49
+ model = ctranslate2.Generator(model_path)
50
+ tokenizer = transformers.AutoTokenizer.from_pretrained(model_id)
51
+
52
+ messages = [
53
+ {"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
54
+ {"role": "user", "content": "Who are you?"},
55
+ ]
56
+
57
+ input_ids = tokenizer.apply_chat_template(
58
+ messages,
59
+ tokenize=False,
60
+ add_generation_prompt=True
61
+ )
62
+
63
+ terminators = [
64
+ tokenizer.eos_token_id,
65
+ tokenizer.convert_tokens_to_ids("<|eot_id|>")
66
+ ]
67
+
68
+ input_tokens = tokenizer.convert_ids_to_tokens(tokenizer.encode(input_ids))
69
+
70
+ results = model.generate_batch([input_tokens], include_prompt_in_result=False, max_length=256, sampling_temperature=0.6, sampling_topp=0.9, end_token=terminators)
71
+ output = tokenizer.decode(results[0].sequences_ids[0])
72
+
73
+ print(output)
74
+ ```
75
+
76
+ ## Ideal prompt and inference parameters
77
+ Set temperature to 0, and do not do sampling.
78
+
79
+ ## Evaluation
80
+ This model was evaluated on SQL-Eval, a PostgreSQL based evaluation framework developed by Defog for testing and alignment of model capabilities.
81
+
82
+ You can read more about the methodology behind SQLEval [here](https://defog.ai/blog/open-sourcing-sqleval/).