Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
SURESHBEEKHANI
/
Gemma_2B_Medical_ORPO_RLHF_Fine_Tuning
like
0
Question Answering
GGUF
SURESHBEEKHANI/medical-reasoning-orpo
English
gemma
conversational
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Gemma_2B_Medical_ORPO_RLHF_Fine_Tuning
1 contributor
History:
7 commits
SURESHBEEKHANI
Update README.md
3a43993
verified
about 2 months ago
.gitattributes
Safe
1.69 kB
(Trained with Unsloth)
about 2 months ago
README.md
Safe
3.18 kB
Update README.md
about 2 months ago
config.json
Safe
29 Bytes
(Trained with Unsloth)
about 2 months ago
unsloth.Q4_K_M.gguf
1.63 GB
LFS
(Trained with Unsloth)
about 2 months ago
unsloth.Q5_K_M.gguf
1.84 GB
LFS
(Trained with Unsloth)
about 2 months ago
unsloth.Q8_0.gguf
2.67 GB
LFS
(Trained with Unsloth)
about 2 months ago