nbeerbower
/

Dumpling-Qwen2.5-7B-1k-r16

Text Generation

text-generation-inference

Model card Files Files and versions Community

nbeerbower commited on Feb 1

Commit

efe9220

·

verified ·

1 Parent(s): e5a0cfd

Update README.md

Files changed (1) hide show

README.md +21 -1

README.md CHANGED Viewed

@@ -42,4 +42,24 @@ base_model:
 ### Method
-[QLoRA ORPO tune](https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html) with 2x RTX 3090 for 2 epochs.

 ### Method
+[QLoRA ORPO tune](https://mlabonne.github.io/blog/posts/2024-04-19_Fine_tune_Llama_3_with_ORPO.html) with 2x RTX 3090 for 2 epochs.
+```
+# QLoRA config
+bnb_config = BitsAndBytesConfig(
+    load_in_4bit=True,
+    bnb_4bit_quant_type="nf4",
+    bnb_4bit_compute_dtype=torch_dtype,
+    bnb_4bit_use_double_quant=True,
+)
+# LoRA config
+peft_config = LoraConfig(
+    r=16,
+    lora_alpha=32,
+    lora_dropout=0.05,
+    bias="none",
+    task_type="CAUSAL_LM",
+    target_modules=['up_proj', 'down_proj', 'gate_proj', 'k_proj', 'q_proj', 'v_proj', 'o_proj']
+)
+```