Bamba-9B-v2 - Fast and powerful!
By
and 12 others
•
•
25PipelineRL
By
and 3 others
•
•
17I trained a Language Model to schedule events with GRPO!
By
•
•
15Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios
By
and 3 others
•
•
14Uncensor any LLM with abliteration
By
•
•
538🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?
By
•
•
222DeepWiki: Best AI Documentation Generator for Any Github Repo
By
•
•
11Creating your custom Ghibli Text-to-Image model
By
and 3 others
•
•
9Introduction to State Space Models (SSM)
By
•
•
125Code a simple RAG from scratch
By
•
•
63Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time
By
and 4 others
•
•
30What is test-time compute and how to scale it?
By
and 1 other
•
•
81Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs
By
•
•
5ColPali: Efficient Document Retrieval with Vision Language Models 👀
By
•
•
241Efficient LLM Pretraining: Packed Sequences and Masked Attention
By
•
•
37OpenManus: The Open Source Alternative to Manus AI
By
•
•
12ChatGPT-4o's Image Generation Capabilities and Its Wild Examples
By
•
•
19How to Use FastAPI MCP Server: A Complete Guide
By
•
•
25What is The Agent2Agent Protocol (A2A) and Why You Must Learn It Now
By
•
•
14What is MoE 2.0? Update Your Knowledge about Mixture-of-experts
By
and 1 other
•
•
4