|
# Dynamic Highscores - Todo List |
|
|
|
## Analysis and Planning |
|
- [x] Extract and analyze uploaded framework files |
|
- [x] Examine leaderboard component structure and functionality |
|
- [x] Examine dashboard component structure and functionality |
|
- [x] Analyze requirements.txt files for dependencies |
|
|
|
## Database Schema Design |
|
- [x] Design schema for user authentication and tracking |
|
- [x] Design schema for benchmark datasets |
|
- [x] Design schema for model submissions and evaluations |
|
- [x] Design schema for tagging system (Merge, Agent, Reasoning, Coding, etc.) |
|
- [x] Design schema for daily submission limits |
|
|
|
## User Authentication System |
|
- [x] Implement HuggingFace login integration |
|
- [x] Create user profile management |
|
- [x] Implement special privileges for admin account |
|
|
|
## Benchmark Selection Interface |
|
- [x] Create interface for browsing HuggingFace datasets |
|
- [x] Implement dataset loading functionality |
|
- [x] Create dataset preview and selection UI |
|
|
|
## Model Evaluation Queue System |
|
- [x] Implement CPU-only evaluation system |
|
- [x] Create queue management for benchmark submissions |
|
- [x] Implement daily submission limit (1 per day per user) |
|
- [x] Add admin override for submission limits |
|
|
|
## Leaderboard with Filtering |
|
- [x] Implement unified leaderboard for all models |
|
- [x] Add tag-based filtering (Merge, Agent, Reasoning, Coding) |
|
- [x] Implement sorting and searching functionality |
|
- [x] Create visualization components for benchmark results |
|
|
|
## Integration |
|
- [x] Combine dashboard and leaderboard components |
|
- [x] Create unified UI with consistent styling |
|
- [x] Implement navigation between different sections |
|
- [x] Ensure proper data flow between components |
|
|
|
## Testing and Deployment |
|
- [x] Test user authentication flow |
|
- [x] Test benchmark selection and submission |
|
- [x] Test leaderboard filtering and visualization |
|
- [x] Prepare for deployment on HuggingFace Spaces |
|
|