Sean McLeish's picture

6 9 3

Sean McLeish PRO

smcleish

·

https://mcleish7.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

upvoted a collection 2 days ago

Recurrent Models

upvoted a paper 3 days ago

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

View all activity

Organizations

smcleish's activity

upvoted a paper 1 day ago

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

Paper • 2502.06533 • Published 5 days ago • 13

upvoted a collection 2 days ago

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 14 items • Updated 5 days ago • 5

upvoted a paper 3 days ago

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

Paper • 2502.06857 • Published 7 days ago • 21

upvoted a paper 5 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 7 days ago • 92

upvoted a paper 7 days ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published 8 days ago • 25

upvoted a collection 7 days ago

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80. • 59 items • Updated 3 days ago • 4

upvoted 2 papers 8 months ago

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18

Transformers meet Neural Algorithmic Reasoners

Paper • 2406.09308 • Published Jun 13, 2024 • 44

upvoted a paper 9 months ago

Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27, 2024 • 52