There is no such thing as a tokenizer-free lunch
By
•
•
47RexBERT: Encoders for a brave new world of E-Commerce
By
and 1 other
•
•
41Nemotron-Personas-Japan: Synthesized Data for Sovereign AI
By
and 6 others
•
•
20SyGra: The One-Stop Framework for Building Data for LLMs and SLMs
By
and 3 others
•
•
9mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL
By
and 1 other
•
•
22Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips
By
•
•
7Model Quality: Hugging Face Is All You Need
By
•
•
7DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
By
•
•
223Small Language Models (SLM): A Comprehensive Overview
By
•
•
73Uncensor any LLM with abliteration
By
•
•
679Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
By
•
•
71PP-OCRv5 on Hugging Face: A Specialized Approach to OCR
By
and 5 others
•
•
100Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi
By
and 1 other
•
•
5Code a simple RAG from scratch
By
•
•
201Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face
By
•
•
68How to Choose the Best Open Source LLM for Your Project in 2025
By
•
•
71AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models
By
and 4 others
•
•
14Unleashing the Full Potential of ERNIE4.5 using FastDeploy
By
and 3 others
•
•
11PrediBench: Testing AI models on prediction markets
By
and 1 other
•
•
4Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm
By
and 5 others
•
•
93