view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • 3 days ago • 3
OLMoE (January 2025) Collection Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 3 days ago • 9
view article Article 🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows By Kseniase • 12 days ago • 11
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 23 days ago • 318
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • 25 days ago • 36
view article Article Exploring Synthetic Data Generation with DataDreamer By asoria • 24 days ago • 6
llama.vim Collection Recommended models for the llama.vim and llama.vscode plugins • 5 items • Updated 10 days ago • 21
view article Article A Beginner-Friendly PyTorch Tutorial: Build and Train Your First Model By dvgodoy • 25 days ago • 5
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 6 days ago • 176
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) By ariG23498 • 26 days ago • 13
Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese Paper • 2408.12480 • Published Aug 22, 2024 • 23
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 153