Post
1830
GRPO reasoning embedded in a custom Prem-1B model
ucalyptus/prem-663ff8769efa4d3700ba14e5
ucalyptus/prem-1B-grpo
ucalyptus/prem-663ff8769efa4d3700ba14e5
ucalyptus/prem-1B-grpo
Join the community of Machine Learners and AI enthusiasts.
Sign Up