diff --git a/docs/tutorials/integrations/openai-edge-tts-integration.md b/docs/tutorials/integrations/openai-edge-tts-integration.md index 1db790e..2a81b8f 100644 --- a/docs/tutorials/integrations/openai-edge-tts-integration.md +++ b/docs/tutorials/integrations/openai-edge-tts-integration.md @@ -24,7 +24,7 @@ Similar to [openedai-speech](https://github.com/matatonic/openedai-speech), [ope - Docker installed on your system - Open WebUI running -- ffmpeg installed (required for audio format conversion and playback speed adjustments) +- ffmpeg (Optional - Only required if opting to not use `mp3` format) ## ⚡️ Quick start @@ -42,7 +42,7 @@ This will run the service at port 5050 with all the default configs - Set your TTS Settings to match the screenshot below - _Note: you can specify the TTS Voice here_ -![Screenshot of Open WebUI Admin Settings for Audio adding the correct endpoints for this project](https://utfs.io/f/MMMHiQ1TQaBoQ2AnPhUlTDGtR4B2v7E9JZN1PU5nAseoaXIc) +![Screenshot of Open WebUI Admin Settings for Audio adding the correct endpoints for this project](https://utfs.io/f/MMMHiQ1TQaBo9GgL4WcUbjSRlqi86sV3TXh47KYBJCkdQ20M) :::info The default API key is the string `your_api_key_here`. You do not have to change that value if you do not need the added security. @@ -67,7 +67,7 @@ If you prefer to run this project directly with Python, follow these steps to se #### 1. Clone the Repository ```bash -git clone https://github.com/your-username/openai-edge-tts.git +git clone https://github.com/travisvn/openai-edge-tts.git cd openai-edge-tts ``` @@ -103,7 +103,7 @@ PORT=5050 DEFAULT_VOICE=en-US-AndrewNeural DEFAULT_RESPONSE_FORMAT=mp3 -DEFAULT_SPEED=1.0 +DEFAULT_SPEED=1.2 DEFAULT_LANGUAGE=en-US @@ -140,7 +140,7 @@ Generates audio from the input text. Available parameters: - **model** (string): Set to "tts-1" or "tts-1-hd" (default: `"tts-1"`). - **voice** (string): One of the OpenAI-compatible voices (alloy, echo, fable, onyx, nova, shimmer) or any valid `edge-tts` voice (default: `"en-US-AndrewNeural"`). - **response_format** (string): Audio format. Options: `mp3`, `opus`, `aac`, `flac`, `wav`, `pcm` (default: `mp3`). -- **speed** (number): Playback speed (0.25 to 4.0). Default is `1.0`. +- **speed** (number): Playback speed (0.25 to 4.0). Default is `1.2`. :::tip You can browse available voices and listen to sample previews at [tts.travisvn.com](https://tts.travisvn.com) @@ -205,7 +205,7 @@ docker run -d -p 5050:5050 \ -e PORT=5050 \ -e DEFAULT_VOICE=en-US-AndrewNeural \ -e DEFAULT_RESPONSE_FORMAT=mp3 \ - -e DEFAULT_SPEED=1.0 \ + -e DEFAULT_SPEED=1.2 \ -e DEFAULT_LANGUAGE=en-US \ -e REQUIRE_API_KEY=True \ travisvn/openai-edge-tts:latest @@ -215,6 +215,8 @@ docker run -d -p 5050:5050 \ For more information on `openai-edge-tts`, you can visit the [GitHub repo](https://github.com/travisvn/openai-edge-tts) +For direct support, you can visit the [Voice AI & TTS Discord](https://discord.gg/GkFbBCBqJ6) + ## 🎙️ Voice Samples [Play voice samples and see all available Edge TTS voices](https://tts.travisvn.com/)