Spaces:
Configuration error
Configuration error
| TODO: break this down into: transcription/translation, streaming transcription/translation, live transcription, audio generation, model listing | |
| TODO: add video demos for all | |
| TODO: add a note about OPENAI_API_KEY | |
| ## Curl | |
| ```bash | |
| curl http://localhost:8000/v1/audio/transcriptions -F "[email protected]" | |
| ``` | |
| ## Python | |
| === "httpx" | |
| ```python | |
| import httpx | |
| with open('audio.wav', 'rb') as f: | |
| files = {'file': ('audio.wav', f)} | |
| response = httpx.post('http://localhost:8000/v1/audio/transcriptions', files=files) | |
| print(response.text) | |
| ``` | |
| ## OpenAI SDKs | |
| === "Python" | |
| ```python | |
| import httpx | |
| with open('audio.wav', 'rb') as f: | |
| files = {'file': ('audio.wav', f)} | |
| response = httpx.post('http://localhost:8000/v1/audio/transcriptions', files=files) | |
| print(response.text) | |
| ``` | |
| === "CLI" | |
| ```bash | |
| export OPENAI_BASE_URL=http://localhost:8000/v1/ | |
| export OPENAI_API_KEY="cant-be-empty" | |
| openai api audio.transcriptions.create -m Systran/faster-whisper-small -f audio.wav --response-format text | |
| ``` | |
| === "Other" | |
| See [OpenAI libraries](https://platform.openai.com/docs/libraries) and [OpenAI speech-to-text usage](https://platform.openai.com/docs/guides/speech-to-text). | |
| ## Open WebUI | |
| ### Using the UI | |
| 1. Go to the [Admin Settings](http://localhost:8080/admin/settings) page | |
| 2. Click on the "Audio" tab | |
| 3. Update settings | |
| - Speech-to-Text Engine: OpenAI | |
| - API Base URL: http://faster-whisper-server:8000/v1 | |
| - API Key: does-not-matter-what-you-put-but-should-not-be-empty | |
| - Model: Systran/faster-distil-whisper-large-v3 | |
| 4. Click "Save" | |
| ### Using environment variables (Docker Compose) | |
| !!! warning | |
| This doesn't seem to work when you've previously used the UI to set the STT engine. | |
| ```yaml | |
| # NOTE: Some parts of the file are omitted for brevity. | |
| services: | |
| open-webui: | |
| image: ghcr.io/open-webui/open-webui:main | |
| ... | |
| environment: | |
| ... | |
| # Environment variables are documented here https://docs.openwebui.com/getting-started/env-configuration#speech-to-text | |
| AUDIO_STT_ENGINE: "openai" | |
| AUDIO_STT_OPENAI_API_BASE_URL: "http://faster-whisper-server:8000/v1" | |
| AUDIO_STT_OPENAI_API_KEY: "does-not-matter-what-you-put-but-should-not-be-empty" | |
| AUDIO_STT_MODEL: "Systran/faster-distil-whisper-large-v3" | |
| faster-whisper-server: | |
| image: fedirz/faster-whisper-server:latest-cuda | |
| ... | |
| ``` | |