2 2 19

Zhicheng Wang

Dicer

https://blog.dicer.fun

Dicer-Zz

AI & ML interests

NLP

Recent Activity

updated a model 16 days ago

Dicer/ppo-Huggy

published a model 16 days ago

Dicer/ppo-Huggy

updated a model 16 days ago

Dicer/ppo-LunarLander-v2

View all activity

Organizations

Dicer's activity

updated a model 16 days ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated 16 days ago • 130

published a model 16 days ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated 16 days ago • 130

updated a model 16 days ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated 16 days ago • 5

published a model 16 days ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated 16 days ago • 5

upvoted 2 articles 21 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 70

Article

Vision Language Models Explained

Apr 11, 2024

• 285

liked 5 datasets 5 months ago

liked a model 6 months ago

XLabs-AI/flux-controlnet-collections

Text-to-Image • Updated Aug 30, 2024 • 43.5k • 453

liked a Space 12 months ago

5.06k

MTEB Leaderboard

🥇

Select benchmarks and languages for text embeddings evaluation

liked a model about 1 year ago

openbmb/MiniCPM-2B-sft-fp32

Text Generation • Updated Sep 7, 2024 • 506 • 296

liked a dataset about 1 year ago

bigscience/P3

Viewer • Updated Mar 4, 2024 • 122M • 98.3k • 214

liked a model about 1 year ago

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • Updated Sep 27, 2024 • 3.89M • • 2.68k

liked a dataset over 1 year ago

Muennighoff/natural-instructions

Viewer • Updated Dec 23, 2022 • 7.15M • 1.78k • 61

liked 2 models almost 2 years ago

databricks/dolly-v2-12b

Text Generation • Updated Jun 30, 2023 • 5.01k • 1.96k

huggyllama/llama-13b

Text Generation • Updated Apr 7, 2023 • 5.95k • 139

liked a dataset almost 2 years ago

RyokoAI/ShareGPT52K

Preview • Updated Apr 2, 2023 • 331 • 315