qLeaderboard-aBase4Community

Running

App Files Files Community

Quazim0t0 commited on Mar 22

Commit

b6ddb3a

verified ·

1 Parent(s): b3e25ad

Upload app.py

Browse files

Files changed (1) hide show

app.py +55 -4

app.py CHANGED Viewed

@@ -13,15 +13,13 @@ from auth import HuggingFaceAuth
 from benchmark_selection import BenchmarkSelector, create_benchmark_selection_ui
 from evaluation_queue import EvaluationQueue, create_model_submission_ui
 from leaderboard import Leaderboard, create_leaderboard_ui
-from model_config import ModelConfigManager, create_community_framework_ui
 from sample_benchmarks import add_sample_benchmarks
 # Initialize components in main thread
 db = DynamicHighscoresDB()
 auth_manager = HuggingFaceAuth(db)
 benchmark_selector = BenchmarkSelector(db, auth_manager)
-model_config_manager = ModelConfigManager(db)
-evaluation_queue = EvaluationQueue(db, auth_manager, model_config_manager)
 leaderboard = Leaderboard(db)
 # Initialize sample benchmarks if none exist
@@ -322,7 +320,60 @@ with gr.Blocks(css=css, title="Dynamic Highscores") as app:
             benchmark_ui = create_benchmark_selection_ui(benchmark_selector, auth_manager)
         with gr.TabItem("🌐 Community Framework", id=3):
-            community_ui = create_community_framework_ui(model_config_manager)
     gr.Markdown("""
     ### About Dynamic Highscores

 from benchmark_selection import BenchmarkSelector, create_benchmark_selection_ui
 from evaluation_queue import EvaluationQueue, create_model_submission_ui
 from leaderboard import Leaderboard, create_leaderboard_ui
 from sample_benchmarks import add_sample_benchmarks
 # Initialize components in main thread
 db = DynamicHighscoresDB()
 auth_manager = HuggingFaceAuth(db)
 benchmark_selector = BenchmarkSelector(db, auth_manager)
+evaluation_queue = EvaluationQueue(db, auth_manager)
 leaderboard = Leaderboard(db)
 # Initialize sample benchmarks if none exist
             benchmark_ui = create_benchmark_selection_ui(benchmark_selector, auth_manager)
         with gr.TabItem("🌐 Community Framework", id=3):
+            # Create a simple placeholder for the Community Framework tab
+            gr.Markdown("""
+            # 🌐 Dynamic Highscores Community Framework
+            ## About Dynamic Highscores
+            Dynamic Highscores is an open-source community benchmark system for evaluating language models on any dataset. This project was created to fill the gap left by the retirement of HuggingFace's "Open LLM Leaderboards" which were discontinued due to outdated benchmarks.
+            ### Key Features
+            - **Flexible Benchmarking**: Test models against any HuggingFace dataset, not just predefined benchmarks
+            - **Community-Driven**: Anyone can add new benchmarks and submit models for evaluation
+            - **Modern Evaluation**: Focus on contemporary benchmarks that better reflect current model capabilities
+            - **CPU-Only Evaluation**: Ensures fair comparisons across different models
+            - **Daily Submission Limits**: Prevents system abuse (one benchmark per day per user)
+            - **Model Tagging**: Categorize models as Merge, Agent, Reasoning, Coding, etc.
+            - **Unified Leaderboard**: View all models with filtering capabilities by tags
+            ### Why This Project Matters
+            When HuggingFace retired their "Open LLM Leaderboards," the community lost a valuable resource for comparing model performance. The benchmarks used had become outdated and didn't reflect the rapid advances in language model capabilities.
+            Dynamic Highscores addresses this issue by allowing users to select from any benchmark on HuggingFace, including the most recent and relevant datasets. This ensures that models are evaluated on tasks that matter for current applications.
+            ## Model Configuration System (Coming Soon)
+            We're working on a modular system for model configurations that will allow users to:
+            - Create and apply predefined configurations for different model types
+            - Define parameters such as Temperature, Top-K, Min-P, Top-P, and Repetition Penalty
+            - Share optimal configurations with the community
+            ### Example Configuration (Gemma)
+            ```
+            Temperature: 1.0
+            Top_K: 64
+            Min_P: 0.01
+            Top_P: 0.95
+            Repetition Penalty: 1.0
+            ```
+            ## Contributing to the Project
+            We welcome contributions from the community! If you'd like to improve Dynamic Highscores, here are some ways to get involved:
+            - **Add New Features**: Enhance the platform with additional functionality
+            - **Improve Evaluation Methods**: Help make model evaluations more accurate and efficient
+            - **Fix Bugs**: Address issues in the codebase
+            - **Enhance Documentation**: Make the project more accessible to new users
+            - **Add Model Configurations**: Contribute optimal configurations for different model types
+            To contribute, fork the repository, make your changes, and submit a pull request. We appreciate all contributions, big or small!
+            """)
     gr.Markdown("""
     ### About Dynamic Highscores