Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ucalyptus
/
prem-1B-grpo
like
3
Text Generation
Transformers
Safetensors
openai/gsm8k
English
llama
math
reasoning
grpo
gsm8k
reinforcement-learning
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
prem-1B-grpo
/
generation_config.json
ucalyptus
Upload LlamaForCausalLM
4d4870d
verified
7 days ago
raw
Copy download link
history
blame
contribute
delete
Safe
111 Bytes
{
"_from_model_config"
:
true
,
"bos_token_id"
:
1
,
"eos_token_id"
:
2
,
"transformers_version"
:
"4.48.2"
}