Merge pull request #527 from Thaniel94/add-whisper-language-constraint

feat: Documentation for WHISPER_LANGUAGE
This commit is contained in:
Tim Jaeryang Baek 2025-05-05 23:36:23 +04:00 committed by GitHub
commit 7e1f8c9929
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
2 changed files with 7 additions and 0 deletions

View File

@ -1871,6 +1871,12 @@ Using a remote Playwright browser via `PLAYWRIGHT_WS_URL` can be beneficial for:
- Default: `False` - Default: `False`
- Description: Toggles automatic update of the Whisper model. - Description: Toggles automatic update of the Whisper model.
#### `WHISPER_LANGUAGE`
- Type: `str`
- Default: `None`
- Description: Specifies the ISO 639-1 language Whisper uses for STT (ISO 639-2 for Hawaiian and Cantonese). Whisper predicts the language by default.
### Speech-to-Text (OpenAI) ### Speech-to-Text (OpenAI)
#### `AUDIO_STT_ENGINE` #### `AUDIO_STT_ENGINE`

View File

@ -19,6 +19,7 @@ The following is a summary of the environment variables for speech to text (STT)
|----------|-------------| |----------|-------------|
| `WHISPER_MODEL` | Sets the Whisper model to use for local Speech-to-Text | | `WHISPER_MODEL` | Sets the Whisper model to use for local Speech-to-Text |
| `WHISPER_MODEL_DIR` | Specifies the directory to store Whisper model files | | `WHISPER_MODEL_DIR` | Specifies the directory to store Whisper model files |
| `WHISPER_LANGUAGE` | Specifies the ISO 639-1 (ISO 639-2 for Hawaiian and Cantonese) Speech-to-Text language to use for Whisper (language is predicted unless set) |
| `AUDIO_STT_ENGINE` | Specifies the Speech-to-Text engine to use (empty for local Whisper, or `openai`) | | `AUDIO_STT_ENGINE` | Specifies the Speech-to-Text engine to use (empty for local Whisper, or `openai`) |
| `AUDIO_STT_MODEL` | Specifies the Speech-to-Text model for OpenAI-compatible endpoints | | `AUDIO_STT_MODEL` | Specifies the Speech-to-Text model for OpenAI-compatible endpoints |
| `AUDIO_STT_OPENAI_API_BASE_URL` | Sets the OpenAI-compatible base URL for Speech-to-Text | | `AUDIO_STT_OPENAI_API_BASE_URL` | Sets the OpenAI-compatible base URL for Speech-to-Text |