UsernameJustAnother commited on
Commit
3bbb11b
·
verified ·
1 Parent(s): 2d5f367

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -17,6 +17,33 @@ tags:
17
  - **License:** apache-2.0
18
  - **Finetuned from model :** unsloth/Mistral-Nemo-Instruct-2407
19
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
21
 
22
  [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
17
  - **License:** apache-2.0
18
  - **Finetuned from model :** unsloth/Mistral-Nemo-Instruct-2407
19
 
20
+ Finetune with secret sauce dataset, rsLoRA, r = 64, on an Colab A100 instance. 30GB vRAM used, 2 epochs ~ 3hrs of training.
21
+
22
+ ```
23
+ r = 64,
24
+ target_modules = ["q_proj", "k_proj", "v_proj", "o_proj",
25
+ "gate_proj", "up_proj", "down_proj",],
26
+ lora_alpha = 64,
27
+ lora_dropout = 0, # Supports any, but = 0 is optimized
28
+ bias = "none", # Supports any, but = "none" is optimized
29
+ use_gradient_checkpointing = "unsloth", # True or "unsloth" for very long context
30
+ random_state = 3407,
31
+ use_rslora = True, # lora_alpha --> 8
32
+ loftq_config = None,
33
+
34
+ per_device_train_batch_size = 2,
35
+ gradient_accumulation_steps = 4,
36
+ warmup_steps = 5,
37
+ num_train_epochs = 2,
38
+ learning_rate = 2e-5, # down from 2e-4, could go down to (5e-5 then 1e-5)
39
+ fp16 = not is_bfloat16_supported(),
40
+ bf16 = is_bfloat16_supported(),
41
+ logging_steps = 1,
42
+ optim = "adamw_8bit",
43
+ weight_decay = 0.01,
44
+ lr_scheduler_type = "linear",
45
+ seed = 3407,
46
+ ```
47
  This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
48
 
49
  [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)