Prithiv Sakthi's picture

Prithiv Sakthi

prithivMLmods

·

https://huggingface.co/strangerzonehf

AI & ML interests

computer vision, multimodality, adapters @starngerzonehf @strangerguardhf

Recent Activity

updated a model about 6 hours ago

prithivMLmods/Open-R1-Mini-Experimental

liked a model about 6 hours ago

prithivMLmods/Open-R1-Math-7B-Instruct

liked a dataset about 6 hours ago

prithivMLmods/Deepthink-Reasoning-Ins

View all activity

Organizations

prithivMLmods's activity

upvoted a collection about 7 hours ago

Reasoning is all you need 🌠

The garage for 14B and 7B reasoning models, listed based on benchmarks. • 24 items • Updated about 6 hours ago • 2

upvoted 2 articles 1 day ago

Article

Open R1: Update #2

By

and 6 others •

1 day ago

• 115

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

By

•

1 day ago

• 19

upvoted 2 collections 1 day ago

Vision Infer Custom

VLM - Multimodality • 2 items • Updated 17 days ago • 7

Reasoning Exp Domain Models 🧠

Based on Qwen's 2 vL • 2 items • Updated 1 day ago • 7

upvoted a paper 4 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 153

upvoted a paper 5 days ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 6 days ago • 44

upvoted 2 articles 6 days ago

Article

Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers

By

•

6 days ago

• 6

Article

Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB

By

•

15 days ago

• 15

upvoted an article 7 days ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 912

upvoted a collection 7 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 11 items • Updated about 12 hours ago • 48

upvoted 2 collections 8 days ago

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated 9 days ago • 50

Optimus Reasoning

14b, 7b reasoning models • 11 items • Updated 3 days ago • 7

upvoted 2 articles 9 days ago

Article

o3-mini & Deepseek-R1

By

•

9 days ago

• 27

Article

Open-R1: Update #1

By

and 7 others •

10 days ago

• 268

upvoted 5 papers 10 days ago

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published 12 days ago • 22

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Paper • 2501.16609 • Published 15 days ago • 6

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published 12 days ago • 16

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published 12 days ago • 17

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 13 days ago • 22