Junlin Zhou's picture

Junlin Zhou

jlzhou

·

edwardzjl

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

The Differences Between Direct Alignment Algorithms are a Blur

updated a model 1 day ago

tablegpt/TableGPT2-7B

updated a collection 1 day ago

View all activity

Organizations

jlzhou's activity

upvoted a paper about 12 hours ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 11 days ago • 112

updated a model 1 day ago

tablegpt/TableGPT2-7B

Updated 1 day ago • 2.24k • 159

updated a collection 1 day ago

TableGPT2

3 items • Updated 1 day ago • 4

commented a paper 3 days ago

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Paper • 2502.01584 • Published 11 days ago • 9 •

reacted to schuler's post with 👍 3 days ago

Post

7132

📢 New Research Alert: Making Language Models Smaller & Smarter!

Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.

The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.

🔑 Key Findings:
• 77% parameter reduction.
• Maintained model capabilities.
• Improved generalization.

Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm

2 replies

·

upvoted an article 3 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

7 days ago

• 25

updated a model 6 days ago

jlzhou/Qwen2.5-3B-Infinity-Instruct-0625

Text Generation • Updated 6 days ago • 55

liked a Space 7 days ago

Open LLM Leaderboard Results PR Opener

Add results to model card from Open LLM Leaderboard

New activity in jlzhou/Qwen2.5-3B-Infinity-Instruct-0625 7 days ago

Adding Evaluation Results

#1 opened 7 days ago by

published an article 7 days ago

Article

Distributed SFT with trl and DeepSpeed Part 2: Scaling Locally

By

•

7 days ago

• 1

liked a model 9 days ago

Qwen/Qwen2.5-3B-Instruct

Text Generation • Updated Sep 25, 2024 • 544k • • 178

updated a model 9 days ago

jlzhou/ppo-LunarLander-v2

Reinforcement Learning • Updated 9 days ago • 6

published a model 9 days ago

jlzhou/ppo-LunarLander-v2

Reinforcement Learning • Updated 9 days ago • 6

liked a model 9 days ago

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated 12 days ago • 576k • • 739

upvoted a paper 9 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 16 days ago • 53

liked a dataset 9 days ago

open-thoughts/OpenThoughts-114k

Viewer • Updated about 21 hours ago • 228k • 52.3k • 483

upvoted a paper 19 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

upvoted 3 papers 21 days ago

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30, 2024 • 76

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 78

Enhancing Training Efficiency Using Packing with Flash Attention

Paper • 2407.09105 • Published Jul 12, 2024 • 15