Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

SURESHBEEKHANI
/

llama_3_2_3B-dpo-rlhf-fine-tuning

Question Answering

Inference Endpoints

Model card Files Files and versions Community

llama_3_2_3B-dpo-rlhf-fine-tuning

1 contributor

History: 9 commits

SURESHBEEKHANI's picture

Update README.md

5e90b32 verified 22 days ago

.gitattributes

1.58 kB

(Trained with Unsloth) 22 days ago
README.md

4.63 kB

Update README.md 22 days ago
config.json

29 Bytes

(Trained with Unsloth) 22 days ago
unsloth.Q4_K_M.gguf

2.02 GB
LFS

(Trained with Unsloth) 22 days ago