16 152 639

Drishti Sharma PRO

DrishtiSharma

DrishtiShrrrma

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

DrishtiSharma/phi-gradio-logs

updated a Space 2 days ago

DrishtiSharma/patent-generator-v1

published a Space 2 days ago

DrishtiSharma/patent-generator-v1

View all activity

Organizations

DrishtiSharma's activity

upvoted a paper 2 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted a paper 6 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 7 days ago • 84

upvoted an article 22 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

23 days ago

• 65

upvoted a paper 22 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 23 days ago • 77

upvoted 3 papers 23 days ago

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published 29 days ago • 18

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published 25 days ago • 29

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published 25 days ago • 22

upvoted a paper 26 days ago

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published 29 days ago • 22

upvoted 4 papers 27 days ago

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published 29 days ago • 30

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 28 days ago • 33

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published 29 days ago • 41

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 29 days ago • 143

upvoted a collection 28 days ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 14 items • Updated 2 days ago • 101

upvoted 3 papers 28 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 86

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 126

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published about 1 month ago • 51

upvoted a paper about 1 month ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 142

upvoted an article about 1 month ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

Feb 10

• 48

upvoted 2 papers about 1 month ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 60

Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights

Paper • 2403.03506 • Published Mar 6, 2024 • 1