23 79 1

Yury Panikov

panikov

panikov

AI & ML interests

None yet

Recent Activity

commented on a paper 2 days ago

Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries

upvoted a paper 2 days ago

Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries

upvoted a paper 3 days ago

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

View all activity

Organizations

None yet

panikov's activity

commented a paper 2 days ago

Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries

Paper • 2502.20475 • Published 14 days ago • 2 •

commented a paper 3 days ago

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Paper • 2503.04548 • Published 7 days ago • 8 •

commented 2 papers 6 days ago

Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer

Paper • 2503.02495 • Published 9 days ago • 8 •

FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion

Paper • 2503.04222 • Published 7 days ago • 13 •

commented a paper 19 days ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 21 days ago • 85 •

commented 2 papers 21 days ago

AIDE: AI-Driven Exploration in the Space of Code

Paper • 2502.13138 • Published 23 days ago • 7 •

ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation

Paper • 2502.13581 • Published 22 days ago • 5 •

commented a paper 23 days ago

Large Language Models and Mathematical Reasoning Failures

Paper • 2502.11574 • Published 24 days ago • 3 •

commented a paper 24 days ago

We Can't Understand AI Using our Existing Vocabulary

Paper • 2502.07586 • Published about 1 month ago • 10 •

commented a paper 30 days ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9 • 34 •

commented 8 papers about 1 month ago

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published Feb 7 • 43 •

Value-Based Deep RL Scales Predictably

Paper • 2502.04327 • Published Feb 6 • 6 •

Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression

Paper • 2502.04296 • Published Feb 6 • 6 •

Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach

Paper • 2502.03639 • Published Feb 5 • 9 •

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Paper • 2502.01584 • Published Feb 3 • 9 •

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Paper • 2501.18119 • Published Jan 30 • 25 •

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published Jan 31 • 10 •

Unraveling the Capabilities of Language Models in News Summarization

Paper • 2501.18128 • Published Jan 30 • 4 •

New activity in LinguaLift/IndicMMLU-Pro about 1 month ago

Sanskrit support

#4 opened about 1 month ago by

panikov

commented a paper about 1 month ago

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Paper • 2501.16411 • Published Jan 27 • 18 •