87 60 282

Lee Junbum PRO

beomi

https://junbuml.ee

AI & ML interests

AI/ML GDE. Advancing Low-Resource Language Open Access LLM

Recent Activity

liked a dataset about 13 hours ago

DigitalLearningGmbH/MATH-lighteval

liked a dataset 2 days ago

exp-models/korean-reasoning-mixture-20250203-preview

liked a dataset 2 days ago

open-thoughts/OpenThoughts-114k

View all activity

Organizations

beomi's activity

upvoted a paper 5 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 6 days ago • 11

upvoted a paper 6 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 154

upvoted a paper 8 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 11 days ago • 99

upvoted an article 9 days ago

Article

Welcome to Inference Providers on the Hub 🔥

15 days ago

• 322

upvoted a paper 12 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 12 days ago • 51

upvoted a paper 13 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 13 days ago • 51

upvoted a paper 20 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 20 days ago • 315

upvoted a paper 25 days ago

Do generative video models learn physical principles from watching videos?

Paper • 2501.09038 • Published 28 days ago • 32

upvoted a paper 28 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 28 days ago • 273

upvoted a paper 2 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 77

upvoted 2 collections 2 months ago

EXAONE-3.5

Collection

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated Dec 10, 2024 • 91

Llama 3.3 (All Versions)

Collection

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 8 days ago • 35

upvoted 4 papers 2 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 126

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 43

upvoted a collection 2 months ago

OLMo 2

Collection

Artifacts for the second set of OLMo models. • 22 items • Updated 1 day ago • 82

upvoted 3 papers 3 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 11

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 37

Style-Friendly SNR Sampler for Style-Driven Generation

Paper • 2411.14793 • Published Nov 22, 2024 • 36