Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models Paper • 2502.12892 • Published 23 days ago • 1
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • 3 days ago • 115
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 6 days ago • 71
QE4PE: Word-level Quality Estimation for Human Post-Editing Paper • 2503.03044 • Published 9 days ago • 6
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models Paper • 2502.15886 • Published 20 days ago • 1
We Can't Understand AI Using our Existing Vocabulary Paper • 2502.07586 • Published about 1 month ago • 10
ReAct: Synergizing Reasoning and Acting in Language Models Paper • 2210.03629 • Published Oct 6, 2022 • 24
Building Bridges, Not Walls -- Advancing Interpretability by Unifying Feature, Data, and Model Component Attribution Paper • 2501.18887 • Published Jan 31 • 1
Sparse Autoencoders Trained on the Same Data Learn Different Features Paper • 2501.16615 • Published Jan 28 • 1
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders Paper • 2501.17148 • Published Jan 28 • 1
Gemma Neogenesis 💎🌍🇮🇹 Collection Datasets and models for Neogenesis: Post-training recipe for improving Gemma 2 for a specific language. Notebook: https://t.ly/iuKdy • 12 items • Updated 3 days ago • 5
Enhancing Automated Interpretability with Output-Centric Feature Descriptions Paper • 2501.08319 • Published Jan 14 • 10