nenad1002
/

quantum-research-bot-v1.0

Text Generation

text-generation-inference

Model card Files Files and versions Community

nenad1002 commited on Sep 2, 2024

Commit

82430f0

·

verified ·

1 Parent(s): 66b1362

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -83,7 +83,7 @@ For ReFT, the nodes in the last 8 layers were unfrozen with attention to allow t
 After 3 to 4 epochs, the model began to overfit regardless of the strategies employed. Increasing both batch size and the number of epochs resulted in higher final training and evaluation cross-entropy.
-Following an extensive grid search, supervised fine-tuning of Llama 3.1-8B with LoRA+ and the parameters mentioned above yielded the best training and evaluation cross-entropy.
 #### Preprocessing [optional]

 After 3 to 4 epochs, the model began to overfit regardless of the strategies employed. Increasing both batch size and the number of epochs resulted in higher final training and evaluation cross-entropy.
+Following an extensive grid search, supervised fine-tuning of [Llama 3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) with LoRA+ and the parameters mentioned below yielded the best training and evaluation cross-entropy.
 #### Preprocessing [optional]