Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sravanthi pulijala
sravanthib
Follow
AI & ML interests
None yet
Recent Activity
published
a model
about 17 hours ago
sravanthib/Qwen-math-open-RL
published
a model
about 18 hours ago
sravanthib/Qwen-math-Simple-RL
updated
a model
1 day ago
sravanthib/qwen-32b-multinode-try
View all activity
Organizations
None yet
sravanthib
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
a model
about 17 hours ago
sravanthib/Qwen-math-open-RL
Updated
about 17 hours ago
published
a model
about 18 hours ago
sravanthib/Qwen-math-Simple-RL
Updated
about 18 hours ago
updated
a model
1 day ago
sravanthib/qwen-32b-multinode-try
Updated
1 day ago
published
2 models
1 day ago
sravanthib/qwen-32b-multinode-try
Updated
1 day ago
sravanthib/new-multinode-try
Updated
1 day ago
updated
a model
2 days ago
sravanthib/multinode-try
Updated
2 days ago
published
a model
2 days ago
sravanthib/multinode-try
Updated
2 days ago
updated
2 models
2 days ago
sravanthib/with_accelerate_output_Qwen2-0.5B-GRPO-test
Updated
2 days ago
sravanthib/tokenizer-aded-Llama3.1-8b-instruct-RL
Updated
2 days ago
published
a model
2 days ago
sravanthib/tokenizer-aded-Llama3.1-8b-instruct-RL
Updated
2 days ago
published
a model
3 days ago
sravanthib/with_accelerate_output_Qwen2-0.5B-GRPO-test
Updated
2 days ago
updated
a model
3 days ago
sravanthib/with_accelarate_output_Qwen2-0.5B-GRPO-test
Updated
3 days ago
published
a model
3 days ago
sravanthib/single_node_llama_custom-code-test
Updated
3 days ago
updated
a model
4 days ago
sravanthib/Final-try-Llama3.1-8b-instruct-RL
Text Generation
•
Updated
4 days ago
•
63
published
6 models
4 days ago
sravanthib/grpo-output
Updated
4 days ago
sravanthib/Final-try-Llama3.1-8b-instruct-RL
Text Generation
•
Updated
4 days ago
•
63
sravanthib/Llama-Simple-RL
Updated
4 days ago
sravanthib/Simple-RL
Updated
4 days ago
sravanthib/SFT_and_RL_final-Simple-RL
Updated
4 days ago
sravanthib/llama-3b-Simple-RL
Updated
4 days ago
Load more