OpenMOSE
/

PRWKV-7-Phi-4-Instruct-Preview-v0.1

Model card Files Files and versions Community

OpenMOSE commited on about 1 month ago

Commit

b77926f

·

1 Parent(s): 77abc16

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -44,7 +44,7 @@ The training process consisted of three distinct stages:
 - Only the attention mechanism was trained; all other components (MLP layers, embeddings, heads) were frozen
 ### Stage 3: Supervised Fine-Tuning (Using RWKV-LM-RLHF)
-- Utilized a distillation dataset of 900K samples
 - Smoothed Loss for faster convergence
 - Implemented Variable Rank PEFT to enhance training efficiency
 - Bone(Block Affine Transformation) r=512+

 - Only the attention mechanism was trained; all other components (MLP layers, embeddings, heads) were frozen
 ### Stage 3: Supervised Fine-Tuning (Using RWKV-LM-RLHF)
+- Utilized a distillation dataset of 900K samples (Chinese,Japanese,English)
 - Smoothed Loss for faster convergence
 - Implemented Variable Rank PEFT to enhance training efficiency
 - Bone(Block Affine Transformation) r=512+