Andrew's picture

5 4 8

Andrew

WpythonW

·

AI & ML interests

Love LLMs

Recent Activity

published a model 16 days ago

WpythonW/gemma-text-to-sql

new activity about 1 month ago

nanonets/Nanonets-OCR-s:vLLM compatibility issue with nanonets/Nanonets-OCR-s: Processor initialization conflict

liked a model about 1 month ago

Qwen/Qwen3-Embedding-0.6B

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 137

upvoted 2 papers 6 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 154

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 146

upvoted a collection 6 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 15 days ago • 522