lhl's picture

lhl PRO

leonardlin

·

https://randomfoo.net/

lhl
lhl

AI & ML interests

None yet

Recent Activity

updated a model about 21 hours ago

shisa-ai/gemma2Brewardmodel

published a model about 21 hours ago

shisa-ai/gemma2Brewardmodel

upvoted a paper 3 days ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

View all activity

Organizations

leonardlin's activity

upvoted a paper 3 days ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 29

upvoted 4 papers 5 days ago

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Paper • 2502.02339 • Published 8 days ago • 18

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 154

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 6 days ago • 49

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 6 days ago • 44

upvoted a paper 7 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published Dec 16, 2024 • 18

upvoted a paper 8 days ago

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published 12 days ago • 17

upvoted a paper 11 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 12 days ago • 25

upvoted 7 papers about 2 months ago

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published Dec 17, 2024 • 31

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published Dec 17, 2024 • 41

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Paper • 2412.12606 • Published Dec 17, 2024 • 41

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published Dec 18, 2024 • 24

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 128

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 41

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

upvoted 5 papers 6 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 70

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Paper • 2408.08291 • Published Aug 15, 2024 • 11

Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields

Paper • 2408.03822 • Published Aug 7, 2024 • 14

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 54

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Paper • 2408.02085 • Published Aug 4, 2024 • 18