paulo's picture

1 2 6

paulo

paulofinardi

·

finard

AI & ML interests

chatbots and recommendation systems

Recent Activity

upvoted an article 4 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

upvoted a collection 4 days ago

Deepseek Papers

commented on a paper about 1 year ago

DocLLM: A layout-aware generative language model for multimodal document understanding

View all activity

Organizations

paulofinardi's activity

upvoted an article 4 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

5 days ago

• 20

upvoted a collection 4 days ago

Deepseek Papers

Deepseek papers collection • 15 items • Updated 8 days ago • 60