Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1626288919629-60ef2a438432bc401cd0abbe.jpeg)
Toloka
company
Verified
AI & ML interests
Human In The Loop - data labeling, model training and hosting, human verification, and more
Recent Activity
View all activity
Organization Card
Hey, this is Toloka!
Collections
1
spaces
2
models
4
datasets
9
toloka/beemo
Viewer
•
Updated
•
2.19k
•
220
•
14
toloka/u-math
Viewer
•
Updated
•
1.1k
•
206
•
17
toloka/mu-math
Viewer
•
Updated
•
1.08k
•
123
•
20
toloka/CLESC
Viewer
•
Updated
•
500
•
49
•
2
toloka/VoxDIY-RusNews
Updated
•
113
•
3
toloka/CrowdSpeech
Updated
•
116
•
5
toloka/crowdkit-datasets
Updated
•
180
toloka/WSDMCup2023
Viewer
•
Updated
•
46.2k
•
226
•
4
toloka/TolokerGraph
Preview
•
Updated
•
49