freewheelin
/

free-llama3-dpo-v0.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Model Card for free-llama-dpo-v0.2

Developed by : Freewheelin AI Technical Team

Hardware and Software

Training Factors: We fine-tuned this model using the HuggingFace TRL Trainer

Method

This model was trained using the learning method introduced in the SOLAR paper.

Downloads last month: 1,630

Safetensors

Model size

8.03B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for freewheelin/free-llama3-dpo-v0.2

Quantizations

Spaces using freewheelin/free-llama3-dpo-v0.2 7