Spaces:
Running
Running
File size: 3,343 Bytes
250567b e52aaaa 250567b 002654a 250567b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 |
# ๐ฏ Phonemic Transcription Leaderboard
Welcome to the Phonemic Transcription Leaderboard! This simple leaderboard helps you track and compare the performance of different speech-to-phoneme model. Feel free to use it for your own hugging face leaderboards!
![leaderboard](img/leaderboard.png)
## โจ Features
* ๐ Interactive leaderboard with real-time sorting
* ๐ Easy model submission system
* ๐ Automatic evaluation of submitted models
* ๐ฑ Responsive design that works on all devices
## ๐ฏ What This Project Does
This leaderboard tracks two key metrics for phonemic transcription models:
* **PER (Phoneme Error Rate)**: How accurately your model converts speech to phonemes
* **PWED (Phoneme Weighted Edit Distance)**: A more nuanced metric that considers phonemic features
Models are evaluated on the TIMIT speech corpus, a gold standard in speech recognition research.
## ๐ Getting Started
### Prerequisites
* Python 3.10
* Git
* A love for speech recognition! ๐ค
### Quick Installation
1. Clone this repository:
```bash
git clone [your-repo-url]
cd phonemic-leaderboard
```
2. Set up your environment:
```bash
# Create a virtual environment with Python 3.10
python3.10 -m venv venv
# Activate the virtual environment
source venv/bin/activate
# Install the required dependencies
pip install -r requirements.txt
```
3. Launch the leaderboard:
```bash
# Run the application
uvicorn app:app --host 0.0.0.0 --port 7860
```
4. Visit `http://localhost:7860` in your browser and see the magic! โจ
## ๐ฎ Using the Leaderboard
### Submitting a Model
1. Go to the "Submit Model" tab
2. Enter your model details:
* Model name (e.g., "wav2vec2-phoneme-wizard")
* Submission name (e.g., "MyAwesomeModel v1.0")
* GitHub/Kaggle/HuggingFace URL (optional)
3. Click Submit and watch your model climb the ranks! ๐
### Checking Model Status
1. Navigate to the "Model Status" tab
2. Enter your model name or task ID
3. Get real-time updates on your model's evaluation progress
## ๐ Understanding the Results
The leaderboard shows:
* Model names and submission details
* PER and PWED scores (lower is better!)
* Links to model repositories
* Submission dates
Sort by either metric to see who's leading the pack!
## ๐ ๏ธ Technical Details
* Built with Gradio for a smooth UI experience
* Runs on a basic compute plan (16GB RAM, 2vCPUs) for easy reproducibility
* Evaluation can take several hours - perfect time to grab a coffee โ
## ๐ค Contributing
Want to make this leaderboard even better? We'd love your help! Here are some ways you can contribute:
* Add new evaluation metrics
* Improve the UI design
* Enhance documentation
* Submit bug fixes
* Add new features
## ๐ License
This project is licensed under the MIT License - see the LICENSE file for details.
## ๐ Acknowledgments
* Thanks to the TIMIT speech corpus for providing evaluation data
* Shoutout to the [panphon library](https://github.com/dmort27/panphon) for PWED calculations
* Built with love by Koel Labs ๐
## ๐ Need Help?
Got questions? Found a bug? Want to contribute? Open an issue or reach out to us! We're here to help make speech recognition evaluation fun and accessible for everyone!
Remember: Every great model deserves its moment to shine! ๐
---
Happy Transcribing! ๐คโจ |