Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
***
free126
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
about 18 hours ago
free126/Qwen2-0.5B-GRPO-test
published
a model
about 18 hours ago
free126/Qwen2-0.5B-GRPO-test
commented
on
an
article
1 day ago
From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning
View all activity
Organizations
None yet
models
3
Sort: Recently updated
free126/Qwen2-0.5B-GRPO-test
Updated
about 18 hours ago
free126/OrpoLlama-3-8B
Updated
May 7, 2024
free126/bert-finetuned-squad
Question Answering
•
Updated
Feb 21, 2024
•
127
datasets
None public yet