Multimodal pipeline for turning PPTs or documents into dubbed videos with voice cloning, timeline alignment, and retimed video composition.
python ffmpeg tts ppt multimodal video-generation voice-cloning audio-video-sync timeline-alignment slide-processing
-
Updated
Apr 23, 2026 - Python