CHZY-1
/

sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

CHZY-1 commited on Oct 23, 2024

Commit

5a184fc

·

verified ·

1 Parent(s): f204fbe

Update README.md

Files changed (1) hide show

README.md +19 -11

README.md CHANGED Viewed

@@ -6,31 +6,39 @@ tags:
 - trl
 - sft
 - generated_from_trainer
 model-index:
 - name: sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32
-This model is a fine-tuned version of [defog/sqlcoder-7b-2](https://huggingface.co/defog/sqlcoder-7b-2) on an unknown dataset.
-## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters

 - trl
 - sft
 - generated_from_trainer
+- QLora
+- peft
+- SQL
+- causal-lm
 model-index:
 - name: sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32
   results: []
+language:
+- en
 ---
 # sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32
+This model is a fine-tuned version of [defog/sqlcoder-7b-2](https://huggingface.co/defog/sqlcoder-7b-2) on 260 MS SQL examples (Task, Schema and Answer triplets) related to financial/banking domain.
 ## Intended uses & limitations
+MS SQL Server - SQL Query Generation
+## Training
+This model was trained using the QLoRA method with the following configurations:
+- r = 64,
+- lora_alpha = 32
+- lora_dropout = 0.05
+- bias='none'
+- task_type='CAUSAL_LM'
+Quantization parameters:
+- load_in_4bit=True
+- bnb_4bit_quant_type="nf4"
+- bnb_4bit_compute_dtype=torch.bfloat16
 ### Training hyperparameters