Yi Cui

onekq

AI & ML interests

Benchmark, Code Generation Model

Recent Activity

updated a model 1 day ago
onekq/outputs
published a model 1 day ago
onekq/outputs
updated a collection 2 days ago
R1 Reproduction Works
View all activity

Organizations

MLX Community's profile picture ONEKQ AI's profile picture

Posts 13

view post
Post
1656
o3-mini is slightly better than R1, but lags behind Claude. Sorry folks, no new SOTA ๐Ÿ˜•

But OAI definitely owns the fashion of API. temperature and top_p are history now, reasoning_effort will be copied by other vendors.

onekq-ai/WebApp1K-models-leaderboard

Articles 2

Article
4

Does Daily Software Engineering Work Need Reasoning Models?

datasets

None public yet