huggingface-projects/Deep-RL-Course-Certification Viewer • Updated about 10 hours ago • 1.54k • 1.51k • 15
view post Post 266 Latest TRL release brings major upgrades for multimodal alignment!We dive into 3 new techniques to improve VLM post-training in our new blog:🌋 GRPO🎞️ GSPO🐙 MPO➕ vLLM integration for online training w/ transformers backend\🐡 Blog: https://huggingface.co/blog/trl-vlm-alignment See translation 🤗 1 1 + Reply
view post Post 2161 GPT-4.1-mini level model right in your iPhone 🤯 openbmb/MiniCPM-V-4 is only 4B while surpassing GPT-4.1-mini in vision benchmarks 🔥allows commercial use as well! See translation 🚀 4 4 + Reply
view post Post 2531 The next generation of AI-powered websites is going to be WILD! 🤯In-browser tool calling & MCP is finally here, allowing LLMs to interact with websites programmatically.To show what's possible, I built a demo using Liquid AI's new LFM2 model, powered by 🤗 Transformers.js: LiquidAI/LFM2-WebGPUAs always, the demo is open source (which you can find under the "Files" tab), so I'm excited to see how the community builds upon this! 🚀 See translation 1 reply · 🔥 3 3 👍 1 1 + Reply
view post Post 2059 OpenAI's open models are out! 💃Try: https://www.gpt-oss.com/Learn: https://huggingface.co/blog/welcome-openai-gpt-oss See translation 1 reply · 🔥 5 5 👍 1 1 + Reply
view post Post 910 we're all sleeping on this OCR model rednote-hilab/dots.ocr 🔥dots.ocr is a new 3B model with sota performance, support for 100 languages & allowing commercial use! 🤯single e2e model to extract image, convert tables, formula, and more into markdown 📝try it MohamedRashad/Dots-OCR See translation 🔥 2 2 + Reply