11 77 112

Aurélien-Morgan CLAUDON

Aurelien-Morgan

https://huggingface.co/retrain-pipelines

AI & ML interests

None yet

Recent Activity

new activity about 9 hours ago

huggingface/HuggingDiscussions:[FEEDBACK] Daily Papers

upvoted an article 2 days ago

Getting started with Hugging Face Inference Endpoints

updated a Space 2 days ago

retrain-pipelines/README

View all activity

Organizations

Aurelien-Morgan's activity

New activity in huggingface/HuggingDiscussions about 9 hours ago

[FEEDBACK] Daily Papers

111

#32 opened 8 months ago by

kramp

upvoted an article 2 days ago

Article

Getting started with Hugging Face Inference Endpoints

Oct 14, 2022

• 1

updated a Space 2 days ago

README

📈

upvoted a paper 5 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 154

upvoted a paper 6 days ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published 14 days ago • 23

liked a Space 6 days ago

291

Chat with DeepSeek-VL2-small

🌍

Generate text based on images and prompts

upvoted a paper 9 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 11 days ago • 98

reacted to Kseniase's post with ❤️ 9 days ago

Post

4790

8 Free Sources on Reinforcement Learning

With the phenomenon of DeepSeek-R1's top reasoning capabilities, we all saw the true power of RL. At its core, RL is a type of machine learning where a model/agent learns to make decisions by interacting with an environment to maximize a reward. RL learns through trial and error, receiving feedback in the form of rewards or penalties.

Here's a list of free sources that will help you dive into RL and how to use it:

1. "Reinforcement Learning: An Introduction" book by Richard S. Sutton and Andrew G. Barto -> https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf

2. Hugging Face Deep Reinforcement Learning Course -> https://huggingface.co/learn/deep-rl-course/unit0/introduction
You'll learn how to train agents in unique environments, using best libraries, share your results, compete in challenges, and earn a certificate.

3. OpenAI Spinning Up in Deep RL -> https://spinningup.openai.com/en/latest/index.html
A comprehensive overview of RL with many useful resources

4. "Reinforcement Learning and Optimal Control" books, video lectures and course material by Dimitri P. Bertsekas from ASU -> https://web.mit.edu/dimitrib/www/RLbook.html
Explores approximate Dynamic Programming (DP) and RL with key concepts and methods like rollout, tree search, and neural network training for RL and more.

5. RL Course by David Silver (Google DeepMind) -> https://www.youtube.com/watch?v=2pWv7GOvuf0&list=PLqYmG7hTraZDM-OYHWgPeb
Many recommend these video lectures as a good foundation

6. RL theory seminars -> https://sites.google.com/view/rltheoryseminars/home?authuser=0
Provides virtual seminars from different experts about RL advancements

7. "Reinforcement Learning Specialization" (a 4-course series on Coursera) -> https://www.coursera.org/learn/fundament

8. Concepts: RLHF, RLAIF, RLEF, RLCF -> https://www.turingpost.com/p/rl-f
Our flashcards easily explain what are these four RL approaches with different feedback