liuylhf
/

mixtral-lora

Generated from Trainer

4-bit precision

Model card Files Files and versions Community

liuylhf commited on Feb 28, 2024

Commit

b258909

·

verified ·

1 Parent(s): b8d4cf9

End of training

Files changed (2) hide show

README.md +21 -1
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
@@ -111,7 +112,9 @@ fsdp_config:
 # mixtral-lora
-This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on an unknown dataset.
 ## Model description
@@ -144,6 +147,23 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 2
 ### Framework versions
 - PEFT 0.8.2

 license: apache-2.0
 library_name: peft
 tags:
+- axolotl
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
 # mixtral-lora
+This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1746
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 2
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 3.397         | 0.0   | 1    | 3.2822          |
+| 0.1294        | 0.2   | 67   | 0.2029          |
+| 0.1664        | 0.4   | 134  | 0.1918          |
+| 0.1742        | 0.6   | 201  | 0.1853          |
+| 0.163         | 0.8   | 268  | 0.1827          |
+| 0.1537        | 1.0   | 335  | 0.1798          |
+| 0.1056        | 1.19  | 402  | 0.1781          |
+| 0.1688        | 1.39  | 469  | 0.1765          |
+| 0.1187        | 1.59  | 536  | 0.1752          |
+| 0.1823        | 1.79  | 603  | 0.1748          |
+| 0.1022        | 1.99  | 670  | 0.1746          |
 ### Framework versions
 - PEFT 0.8.2

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e4a95b8cb2f2c353540901802b3808ae2997ccce80c562be6bcdf3ff5f0cae97
 size 969596450

 version https://git-lfs.github.com/spec/v1
+oid sha256:e26eed223bbbbaefc98dd7881d82cf2636f97ef84b71fee3376696995f7eb6ce
 size 969596450