Motoki Wu's picture

Motoki Wu

tokestermw

·

https://motoki.co

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

sesame/csm-1b

upvoted a collection about 20 hours ago

Gemma 3 Release

upvoted an article about 20 hours ago

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

View all activity

Organizations

tokestermw's activity

upvoted a collection about 20 hours ago

Gemma 3 Release

9 items • Updated about 3 hours ago • 217

upvoted an article about 20 hours ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

2 days ago

• 206

upvoted a collection 8 days ago

Light-R1

Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated about 12 hours ago • 9

upvoted a collection 9 days ago

Hallucination detection

Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated 8 days ago • 15

upvoted a paper 13 days ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published 16 days ago • 25

upvoted a paper 15 days ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 18 days ago • 27

upvoted a paper 16 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 16 days ago • 68

upvoted 3 papers 17 days ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 126

InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback

Paper • 2502.15027 • Published 21 days ago • 7

SIFT: Grounding LLM Reasoning in Contexts via Stickers

Paper • 2502.14922 • Published 22 days ago • 30

upvoted a collection 18 days ago

Sky-T1-7B

A series of 7B models trained with different recipes and the corresponding training data. • 8 items • Updated 28 days ago • 6

upvoted a collection 22 days ago

Process Reward Models

Model and Datasets for Qwen 2.5 Math PRM 7B • 6 items • Updated 23 days ago • 2

upvoted a paper 24 days ago

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published 27 days ago • 32

upvoted a paper 28 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 29 days ago • 46

upvoted 2 papers about 1 month ago

Agency Is Frame-Dependent

Paper • 2502.04403 • Published Feb 6 • 22

ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning

Paper • 2502.04689 • Published Feb 7 • 7

upvoted an article about 1 month ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 202

upvoted 3 papers about 1 month ago

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper • 2502.04404 • Published Feb 6 • 23

Scaling Embedding Layers in Language Models

Paper • 2502.01637 • Published Feb 3 • 24

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6 • 25