2 132 63

Raja Biswas

rbiswasfc

AI & ML interests

NLP, Generative AI

Recent Activity

upvoted a paper about 14 hours ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

upvoted a paper about 14 hours ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

upvoted a paper about 14 hours ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

View all activity

Organizations

rbiswasfc's activity

upvoted 4 papers about 14 hours ago

upvoted 2 collections about 14 hours ago

OpenR1-Math

Collection

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co/blog/open-r1/update-2 • 2 items • Updated about 20 hours ago • 2

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 11 items • Updated about 20 hours ago • 49

upvoted a paper about 14 hours ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published 3 days ago • 9

upvoted an article 1 day ago

Article

Open R1: Update #2

and 6 others •

1 day ago

• 125

upvoted a paper 2 days ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published 7 days ago • 15

upvoted an article 2 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

15 days ago

• 710

upvoted 3 papers 2 days ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 6 days ago • 49

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 6 days ago • 44

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 154

upvoted a paper 4 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 83

upvoted an article 8 days ago

Article

Open-R1: Update #1

and 7 others •

10 days ago

• 270

liked a dataset 11 days ago

cais/hle

Viewer • Updated about 7 hours ago • 2.7k • 4.43k • 215

liked a model 12 days ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated 12 days ago • 9.83k • 208

liked a model 13 days ago

Qwen/Qwen2.5-32B-Instruct

Text Generation • Updated Sep 25, 2024 • 384k • 193

upvoted 2 papers 16 days ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published 21 days ago • 39

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 28 days ago • 54