Docker for multiple TTS Engines with a GRadio interface
-
Updated
Aug 29, 2024 - Jupyter Notebook
Docker for multiple TTS Engines with a GRadio interface
Voice-controlled robotic assistant with natural language processing, command validation, and speech synthesis. Built with a microservices architecture.
This case study uses Multimodal Generative AI (text, image, audio, video) to create a complete, professional digital marketing campaign for the small bakery, demonstrating a cost-effective content creation process.
Fine-tuned Parler-TTS (600M) for Hinglish language, Indian accent, and emotion-conditioned speech synthesis. Published at arXiv:2506.16310.
Add a description, image, and links to the parler-tts topic page so that developers can more easily learn about it.
To associate your repository with the parler-tts topic, visit your repo's landing page and select "manage topics."