Orpo finetuned models
Muhammad Bin Usman
Muhammad2003
AI & ML interests
- Model Alignment (SFT / DPO / ORPO )
- Model Merging / Pruning / MoE + latest tecniques
- Instruction tuning and Preference datasets curation
- Evaluation
Recent Activity
updated
a model
about 1 month ago
Muhammad2003/Llama3-LegalLM
published
a model
about 1 month ago
Muhammad2003/Llama3-LegalLM
upvoted
a
collection
8 months ago
haiku
Organizations
models
21

Muhammad2003/Llama3-LegalLM
Updated
•
1

Muhammad2003/router-classifier
Text Classification
•
Updated
•
10

Muhammad2003/router-embedding
Sentence Similarity
•
Updated
•
1
•
1

Muhammad2003/TriMistral-7B-TIES
Text Generation
•
Updated
•
2

Muhammad2003/TriMistral-7B-SLERP
Text Generation
•
Updated
•
7

Muhammad2003/TriMistral-7B-MODELSTOCK
Text Generation
•
Updated
•
4

Muhammad2003/TriMistral-7B-DARETIES
Text Generation
•
Updated

Muhammad2003/Llama-3-8B-DPO-500
Text Generation
•
Updated

Muhammad2003/Llama-3-8B-DPO-1500
Text Generation
•
Updated
•
2

Muhammad2003/Llama-3-8B-DPO-1000
Text Generation
•
Updated
•
3
datasets
7
Muhammad2003/routing-dataset
Viewer
•
Updated
•
14.3k
•
25
Muhammad2003/OpenMed_11k_train
Viewer
•
Updated
•
11.3k
•
25
Muhammad2003/OpenMed_11k
Viewer
•
Updated
•
11.7k
•
18
Muhammad2003/GrandMed_364k
Viewer
•
Updated
•
364k
•
20
Muhammad2003/Nectar-DPO-50k
Viewer
•
Updated
•
50k
•
33
Muhammad2003/Big_Pretrain_11K
Viewer
•
Updated
•
11.7k
•
25
Muhammad2003/Toxic_PreTrain_8k
Viewer
•
Updated
•
8.41k
•
22