dfurman
/

Llama-2-13B-Instruct-v0.2

Text Generation

Model card Files Files and versions Community

dfurman commited on Nov 17, 2023

Commit

eae039d

·

1 Parent(s): 61c5051

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ This model was built via parameter-efficient finetuning of the [meta-llama/Llama
 ## Model Sources
-- **Repository:** [here](https://github.com/daniel-furman/sft-demos/blob/main/src/sft/llama/sft_Llama_2_13b_chat_hf_v0_1_peft.ipynb)
 ## Evaluation Results
@@ -202,7 +202,7 @@ print(tokenizer.decode(input_ids[0]))
 We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune LLMs on instruction-following datasets.
-See [here](https://github.com/daniel-furman/sft-demos/blob/main/src/sft/mistral/sft_Mistral_7B_Instruct_v0_1_peft.ipynb) for the finetuning code, which contains an exhaustive view of the hyperparameters employed.
 The following `TrainingArguments` config was used:

 ## Model Sources
+- **Repository:** [here](https://github.com/daniel-furman/sft-demos/blob/main/src/sft/llama/sft_Llama_2_13B_Instruct_v0_2_peft.ipynb)
 ## Evaluation Results
 We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune LLMs on instruction-following datasets.
+See [here](https://github.com/daniel-furman/sft-demos/blob/main/src/sft/llama/sft_Llama_2_13B_Instruct_v0_2_peft.ipynb) for the finetuning code, which contains an exhaustive view of the hyperparameters employed.
 The following `TrainingArguments` config was used: