ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
-
Updated
Feb 10, 2026 - Python
ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio Denoising, and Enhancement, Support models such as paraformer, sensevoice, fireredasr, zipformer, moonshine, wenet, whisper, fsmn-vad, silero-vad, CT Transformer punc, Spleeter, Uvr5, etc, apply ONNX models in various scenarios.
An upgrade framework for train and validate compare with icefall using Lightning.
A template for serving zipformer on Triton Inference Server.
Offline Speech-to-Text for React Native using sherpa-onnx Supports Zipformer, Paraformer, NeMo CTC, Whisper & more.
React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing (STT/TTS/Diarization/VAD) completely offline on the device. Support for Android & iOS
Streaming piano-transcription system
🎤 Enable offline speech recognition in React Native using sherpa-onnx, supporting various model architectures for reliable performance.
Add a description, image, and links to the zipformer topic page so that developers can more easily learn about it.
To associate your repository with the zipformer topic, visit your repo's landing page and select "manage topics."