Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
microsoft
/
MageBench-Leaderboard
Running

App Files Files Community
Fetching metadata from the HF Docker repository...
MageBench-Leaderboard
Ctrl+K
Ctrl+K
  • 2 contributors
History: 39 commits
daiqi's picture
daiqi
Add {'Score': '867', 'Name': 'gbhd', 'BaseModel': 'bdfb', 'Env': 'Sokoban', 'Target-research': 'Model-Eval-Online', 'Subset': 'mini', 'Link': 'fdns', 'State': 'Checking'} to checking queue
5b280bc verified 6 months ago
  • src
    Update src/envs.py 6 months ago
  • .gitattributes
    1.58 kB
    Upload demo.mp4 6 months ago
  • .gitignore
    136 Bytes
    Duplicate from demo-leaderboard-backend/leaderboard 6 months ago
  • .pre-commit-config.yaml
    1.53 kB
    Duplicate from demo-leaderboard-backend/leaderboard 6 months ago
  • Makefile
    208 Bytes
    Duplicate from demo-leaderboard-backend/leaderboard 6 months ago
  • README.md
    1.44 kB
    initial commit 6 months ago
  • app.py
    12.2 kB
    Update app.py 6 months ago
  • commit_results.jsonl
    32.7 kB
    Upload commit_results.jsonl 6 months ago
  • demo.mp4
    335 MB
    LFS
    Upload demo.mp4 6 months ago
  • pyproject.toml
    548 Bytes
    Duplicate from demo-leaderboard-backend/leaderboard 6 months ago
  • requirements.txt
    214 Bytes
    Update requirements.txt 6 months ago
  • test-output.json
    166 Bytes
    Add {'Score': '867', 'Name': 'gbhd', 'BaseModel': 'bdfb', 'Env': 'Sokoban', 'Target-research': 'Model-Eval-Online', 'Subset': 'mini', 'Link': 'fdns', 'State': 'Checking'} to checking queue 6 months ago