Yoann Poupart
Xmaster6y
AI & ML interests
AI Safety | Interpretability | LLM | RL
Recent Activity
updated
a dataset
4 days ago
LuxWorld/trajectories
upvoted
a
paper
5 days ago
Analyze Feature Flow to Enhance Interpretation and Steering in Language
Models
upvoted
a
paper
5 days ago
Mechanistic Permutability: Match Features Across Layers