Shang Hong Sim's picture

9 1

Shang Hong Sim

shanghong

·

https://shanghongsim.github.io/

AI & ML interests

Neural decoding, neuroengineering, signal processing

Recent Activity

upvoted an article 1 day ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

updated a collection 27 days ago

updated a collection 27 days ago

View all activity

Organizations

shanghong's activity

upvoted an article 1 day ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 70

updated a collection 27 days ago

RAG

3 items • Updated 27 days ago

upvoted a paper 27 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 28 days ago • 33

upvoted 3 collections 27 days ago

DeepSeek-R1

8 items • Updated Jan 21 • 571

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 21 days ago • 49

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 14 items • Updated 2 days ago • 100

updated a model about 1 month ago

declare-lab/trustalign_llama2_7b

Updated about 1 month ago • 21

updated a collection about 1 month ago

Trust-Align

12 items • Updated about 1 month ago • 3

published a model about 1 month ago

declare-lab/trustalign_llama2_7b

Updated about 1 month ago • 21

updated a model about 1 month ago

declare-lab/trustalign_llama3_8b

Updated about 1 month ago • 57 • 1

updated a collection about 1 month ago

Trust-Align

12 items • Updated about 1 month ago • 3

published a model about 1 month ago

declare-lab/trustalign_llama3_8b

Updated about 1 month ago • 57 • 1

updated a model about 1 month ago

declare-lab/trustalign_qwen2.5_0.5b

Updated about 1 month ago • 24

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 802

upvoted a collection about 1 month ago

Trust-Align

12 items • Updated about 1 month ago • 3

updated 2 models about 2 months ago

declare-lab/trustalign_llama3.2_3b

Updated Jan 25 • 17

declare-lab/trustalign_llama3.2_1b

Updated Jan 25 • 24 • 1