忍者

byteprobe

AI & ML interests

RL | NLP | LLM | multimodal | agent

Recent Activity

upvoted a paper about 5 hours ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

upvoted a paper about 18 hours ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

liked a Space about 18 hours ago

nanotron/ultrascale-playbook

View all activity

Organizations

byteprobe's activity

upvoted a paper about 5 hours ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 21 days ago • 85

upvoted a paper about 18 hours ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 21 days ago • 97

liked a Space about 18 hours ago

2.23k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 2 datasets about 19 hours ago

TIGER-Lab/MMLU-Pro

Viewer • Updated Nov 27, 2024 • 12.1k • 43.8k • 330

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated 7 days ago • 251k • 5.36k • 148

upvoted a paper about 19 hours ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 29 days ago • 184

liked a model about 19 hours ago

tencent/HunyuanVideo-I2V

Image-to-Video • Updated about 6 hours ago • 2.1k • 243

liked a dataset about 19 hours ago

gaia-benchmark/GAIA

Updated 28 days ago • 8.94k • 257

liked a model about 19 hours ago

google/gemma-3-27b-it

Image-Text-to-Text • Updated 1 day ago • 38.5k • 442

upvoted 3 papers about 19 hours ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 27 days ago • 103

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 21 days ago • 129

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 6 days ago • 103

upvoted 5 papers 1 day ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 21 days ago • 162

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 10 days ago • 72

upvoted a paper 2 days ago

ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

Paper • 2502.09696 • Published 28 days ago • 39

upvoted an article 2 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

22 days ago

• 205

updated a collection 2 days ago

AI-Generated Text Detection

Collection

Paper collections about AI-generated text detection. • 13 items • Updated 2 days ago • 2