Blog, Articles, and discussions

Community Articles

OVHcloud on Hugging Face Inference Providers 🔥

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Norm-Preserving Biprojected Abliteration

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

Building Deep Research: How we Achieved State of the Art

KV Caching Explained: Optimizing Transformer Inference Efficiency

We’re open-sourcing our text-to-image model and the process behind it

Announcing the LLM Open Finance models

Text-to-image Architectural Experiments

A Guide to Hugging Face’s Papers Page

From GRPO to DAPO and GSPO: What, Why, and How

Projected Abliteration

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling

Introduction to State Space Models (SSM)

Uncensor any LLM with abliteration

Code a simple RAG from scratch

Mastering Tensor Dimensions in Transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

How much did N-ATLaS-LLM move the needle? A Focused Evaluation of N-ATLaS on Yoruba, Igbo, and Hausa with AfroBench

timmcvcommunity

Timm ❤️ Transformers: Use any timm model with transformers

+1

January 16, 2025

multimodalllamaindexopen-source-collab

Visual Document Retrieval Goes Multilingual

January 10, 2025

communitydatasetssynthetic-data

Docmatix - a huge dataset for Document Visual Question Answering

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

🤗 PEFT welcomes new merging methods

February 19, 2024

communityguidecv

Introduction to 3D Gaussian Splatting

September 18, 2023

communityguidecv

Object Detection Leaderboard

September 18, 2023

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model

+7

August 22, 2023

communityguidecv

Practical 3D Asset Generation: A Step-by-Step Guide

partnershipsmultimodalnlp

Accelerating Vision-Language Models: BridgeTower on Habana Gaudi2

multi-modalcvguide

A Dive into Text-to-Video Models

partnershipsawsnlp

Accelerating Hugging Face Transformers with AWS Inferentia2

cvfederated-learningfl

Creating Privacy Preserving AI with Substra

Community Articles

OVHcloud on Hugging Face Inference Providers 🔥

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Norm-Preserving Biprojected Abliteration

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

Building Deep Research: How we Achieved State of the Art

KV Caching Explained: Optimizing Transformer Inference Efficiency

We’re open-sourcing our text-to-image model and the process behind it

Announcing the LLM Open Finance models

Text-to-image Architectural Experiments

A Guide to Hugging Face’s Papers Page

From GRPO to DAPO and GSPO: What, Why, and How

Projected Abliteration

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling

Introduction to State Space Models (SSM)

Uncensor any LLM with abliteration

Code a simple RAG from scratch

Mastering Tensor Dimensions in Transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

How much did N-ATLaS-LLM move the needle? A Focused Evaluation of N-ATLaS on Yoruba, Igbo, and Hausa with AfroBench

View all articles