OpenMOSE
/

PRWKV-7-Phi-4-Instruct-Preview-v0.1

Model card Files Files and versions Community

OpenMOSE commited on Mar 14

Commit

77abc16

·

1 Parent(s): aa79e79

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -45,6 +45,7 @@ The training process consisted of three distinct stages:
 ### Stage 3: Supervised Fine-Tuning (Using RWKV-LM-RLHF)
 - Utilized a distillation dataset of 900K samples
 - Implemented Variable Rank PEFT to enhance training efficiency
 - Bone(Block Affine Transformation) r=512+

 ### Stage 3: Supervised Fine-Tuning (Using RWKV-LM-RLHF)
 - Utilized a distillation dataset of 900K samples
+- Smoothed Loss for faster convergence
 - Implemented Variable Rank PEFT to enhance training efficiency
 - Bone(Block Affine Transformation) r=512+