Luigi's picture

Luigi

Luigi

·

AI & ML interests

NLP / ASR / Chat LLM / VLMs

Recent Activity

liked a model about 8 hours ago

0xSero/MiniMax-M2.1-REAP-50

liked a model 1 day ago

0xSero/MiniMax-M2.1-REAP-50-W4A16

liked a model 13 days ago

Supertone/supertonic

View all activity

Organizations

upvoted a paper 3 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 106

upvoted a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

upvoted an article 4 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

Sep 4, 2025

•

267

upvoted a changelog 4 months ago

Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30, 2025

• 200

upvoted 6 papers 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

FastVLM: Efficient Vision Encoding for Vision Language Models

Paper • 2412.13303 • Published Dec 17, 2024 • 72

MobileCLIP2: Improving Multi-Modal Reinforced Training

Paper • 2508.20691 • Published Aug 28, 2025 • 5

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 56

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 58

MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation

Paper • 2501.06713 • Published Jan 12, 2025 • 4

upvoted 4 articles 8 months ago

Article

Deriving DPO's Loss

Dec 24, 2024

•

29

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

Jan 18, 2024

•

75

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

Feb 17, 2025

•

28

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

•

71

upvoted 2 collections 9 months ago

high-quality Chinese training datasets

a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated May 22, 2025 • 23

Chinese Tiny LLM

9 items • Updated Apr 5, 2024 • 8

upvoted an article over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

437

upvoted a paper almost 2 years ago

BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Paper • 2402.04291 • Published Feb 6, 2024 • 50

upvoted 2 collections about 2 years ago

Function Calling v3

Models fine-tuned for function-calling • 14 items • Updated Apr 27, 2024 • 21

Mixtral HQQ Quantized Models

4-bit and 2-bit Mixtral models quantized using https://github.com/mobiusml/hqq • 9 items • Updated Mar 29, 2024 • 14