created a video captioning system that converts the speech to text and overlays the text as captions on video. The text is synchronized to video This project uses Whisper, and OpenCV to generate and overlay captions on videos.
Speech-to-Text with Whisper
Caption Overlay with OpenCV