Uploaded finetuned model

Developed by: CompassioninMachineLearning
License: apache-2.0
Finetuned from model: CompassioninMachineLearning/negaiplateau_plusgrpo

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Safetensors

Model size

8B params

Tensor type

BF16

Model tree for CompassioninMachineLearning/Instruct6knegaiplateau_plusAIgrpo

Unable to build the model tree, the base model loops to the model itself. Learn more.