41 430 568

Sugato Ray PRO

sugatoray

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

upvoted an article about 9 hours ago

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

updated a collection about 12 hours ago

AV LLMs

liked a model about 12 hours ago

Zyphra/Zonos-v0.1-transformer

View all activity

Organizations

sugatoray's activity

upvoted an article about 9 hours ago

Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

•

7 days ago

• 6

upvoted 2 articles about 12 hours ago

Article

Open R1: Update #2

and 6 others •

1 day ago

• 120

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

1 day ago

• 21

upvoted an article 2 days ago

Article

How to deploy and fine-tune DeepSeek models on AWS

13 days ago

• 40

upvoted a paper 2 days ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published 7 days ago • 15

upvoted an article 3 days ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

8 days ago

• 41

upvoted 3 papers 3 days ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 6 days ago • 49

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 6 days ago • 11

The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles

Paper • 2502.01081 • Published 9 days ago • 12

upvoted a collection 4 days ago

Hibiki fr-en

Collection

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 5 days ago • 45

upvoted a paper 5 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 154

upvoted an article 6 days ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

22 days ago

• 35

upvoted 2 papers 7 days ago

Explaining Large Language Models Decisions Using Shapley Values

Paper • 2404.01332 • Published Mar 29, 2024 • 1

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published 8 days ago • 34

upvoted an article 7 days ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 919

upvoted a paper 8 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 11 days ago • 99

upvoted 4 collections 8 days ago