Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
MixEval
community
https://mixeval.github.io/
NiJinjie
Psycoy
Activity Feed
Follow
10
AI & ML interests
LLM & LMM evaluation
Recent Activity
yuexiang96
authored
a paper
6 days ago
Demystifying Long Chain-of-Thought Reasoning in LLMs
yuexiang96
authored
a paper
13 days ago
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
yuexiang96
authored
a paper
19 days ago
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
View all activity
Team members
7
models
None public yet
datasets
2
Sort: Recently updated
MixEval/MixEval-X
Viewer
•
Updated
Dec 10, 2024
•
7.68k
•
433
•
10
MixEval/MixEval
Viewer
•
Updated
Sep 27, 2024
•
5k
•
196
•
21