metadata
title: README
emoji: π
colorFrom: yellow
colorTo: green
sdk: static
pinned: false
pyannote.audio is an open-source toolkit for speaker diarization.
Pretrained pipelines reach state-of-the-art performance on most academic benchmarks and are used in production by dozens of companies.
Benchmark | v1.1 | v2.0 | v2.1 | v3.0 | Premium |
---|---|---|---|---|---|
AISHELL-4 | - | 14.6 | 14.1 | 12.3 | 12.3 |
AliMeeting (channel 1) | - | - | 27.4 | 24.3 | 19.4 |
AMI (IHM) | 29.7 | 18.2 | 18.9 | 19.0 | 16.7 |
AMI (SDM) | - | 29.0 | 27.1 | 22.2 | 20.1 |
AVA-AVD | - | - | - | 49.1 | 42.7 |
DIHARD 3 (full) | 29.2 | 21.0 | 26.9 | 21.7 | 17.0 |
MSDWild | - | - | - | 24.6 | 20.4 |
REPERE (phase2) | - | 12.6 | 8.2 | 7.8 | 7.8 |
VoxConverse (v0.3) | 21.5 | 12.6 | 11.2 | 11.3 | 9.5 |
Diarization error rate (in %) |