saltlux
/

luxia-21.4b-alignment-v1.2

Text Generation

text-generation-inference

Model card Files Files and versions Community

4season commited on Jun 16, 2024

Commit

e1a104a

·

verified ·

1 Parent(s): 8687e75

Update README.md

Files changed (1) hide show

README.md +0 -1

README.md CHANGED Viewed

@@ -29,7 +29,6 @@ We used a mixture of the following datasets
 ### luxia-21.4b-alignment model
 We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).
-After DPO training, we linearly merged models to boost performance.
 We used a mixture of the following datasets
 - jondurbin/truthy-dpo-v0.1

 ### luxia-21.4b-alignment model
 We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).
 We used a mixture of the following datasets
 - jondurbin/truthy-dpo-v0.1