Shikhar Singh

AxAI

axe--

AI & ML interests

Commonsense & Language Grounding

Recent Activity

upvoted a collection 1 day ago

Gemma 3 Release

liked a model 14 days ago

microsoft/Magma-8B

liked a model 16 days ago

allenai/olmOCR-7B-0225-preview

View all activity

Organizations

None yet

AxAI's activity

upvoted a collection 1 day ago

Gemma 3 Release

Collection

9 items • Updated 1 day ago • 197

upvoted 4 papers 19 days ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 21 days ago • 85

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 21 days ago • 179

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 21 days ago • 129

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 21 days ago • 97

upvoted a paper 20 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 22 days ago • 161

upvoted an article 21 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

23 days ago

• 65

upvoted 2 articles 28 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 152

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

upvoted an article about 1 month ago

Article

Open R1: Update #2

and 6 others •

about 1 month ago

• 202

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 345

upvoted a paper 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 262

upvoted an article 2 months ago

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

•

Aug 26, 2024

• 44

upvoted 7 papers 3 months ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 62

YOLO-World: Real-Time Open-Vocabulary Object Detection

Paper • 2401.17270 • Published Jan 30, 2024 • 36