Spaces:
Configuration error
Configuration error
File size: 2,432 Bytes
4b9d55e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 |
TODO: break this down into: transcription/translation, streaming transcription/translation, live transcription, audio generation, model listing
TODO: add video demos for all
TODO: add a note about OPENAI_API_KEY
## Curl
```bash
curl http://localhost:8000/v1/audio/transcriptions -F "[email protected]"
```
## Python
=== "httpx"
```python
import httpx
with open('audio.wav', 'rb') as f:
files = {'file': ('audio.wav', f)}
response = httpx.post('http://localhost:8000/v1/audio/transcriptions', files=files)
print(response.text)
```
## OpenAI SDKs
=== "Python"
```python
import httpx
with open('audio.wav', 'rb') as f:
files = {'file': ('audio.wav', f)}
response = httpx.post('http://localhost:8000/v1/audio/transcriptions', files=files)
print(response.text)
```
=== "CLI"
```bash
export OPENAI_BASE_URL=http://localhost:8000/v1/
export OPENAI_API_KEY="cant-be-empty"
openai api audio.transcriptions.create -m Systran/faster-whisper-small -f audio.wav --response-format text
```
=== "Other"
See [OpenAI libraries](https://platform.openai.com/docs/libraries) and [OpenAI speech-to-text usage](https://platform.openai.com/docs/guides/speech-to-text).
## Open WebUI
### Using the UI
1. Go to the [Admin Settings](http://localhost:8080/admin/settings) page
2. Click on the "Audio" tab
3. Update settings
- Speech-to-Text Engine: OpenAI
- API Base URL: http://faster-whisper-server:8000/v1
- API Key: does-not-matter-what-you-put-but-should-not-be-empty
- Model: Systran/faster-distil-whisper-large-v3
4. Click "Save"
### Using environment variables (Docker Compose)
!!! warning
This doesn't seem to work when you've previously used the UI to set the STT engine.
```yaml
# NOTE: Some parts of the file are omitted for brevity.
services:
open-webui:
image: ghcr.io/open-webui/open-webui:main
...
environment:
...
# Environment variables are documented here https://docs.openwebui.com/getting-started/env-configuration#speech-to-text
AUDIO_STT_ENGINE: "openai"
AUDIO_STT_OPENAI_API_BASE_URL: "http://faster-whisper-server:8000/v1"
AUDIO_STT_OPENAI_API_KEY: "does-not-matter-what-you-put-but-should-not-be-empty"
AUDIO_STT_MODEL: "Systran/faster-distil-whisper-large-v3"
faster-whisper-server:
image: fedirz/faster-whisper-server:latest-cuda
...
```
|