Lambent
/

qwen2.5-14B-alternate-instruct-slerp

Text Generation

text-generation-inference

Model card Files Files and versions

Lambent commited on Sep 23, 2024

Commit

dcf52e6

·

verified ·

1 Parent(s): 4f91585

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -13,6 +13,11 @@ tags:
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method
 This model was merged using the SLERP merge method.

 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
+Same idea as Lambent/qwen2.5-14B-selfmerge-A, but training the base model on an ~20M token instruct and continued pretraining dataset first.
+Hope is the lightweight instruction tuning might add some synergy with the original instruct.
 ### Merge Method
 This model was merged using the SLERP merge method.