Feature request: Add FunASR/SenseVoice as alternative ASR backend

## Feature Request

Add FunASR/SenseVoice as an alternative to Whisper for transcription.

### Why

- **SenseVoice** (234M params): 50+ languages, 5x faster than Whisper-small, non-autoregressive (no hallucination)
- **Built-in speaker diarization** (cam++): No need for separate diarization pipeline
- **Complete pipeline in one package**: VAD + ASR + punctuation + timestamps + speaker diarization
- **OpenAI-compatible API**: `funasr-server` serves `POST /v1/audio/transcriptions` — drop-in replacement

### Quick start

```bash
pip install funasr
from funasr import AutoModel
model = AutoModel(model="iic/SenseVoiceSmall")
result = model.generate(input="audio.wav")
```

### References

- [FunASR](https://github.com/modelscope/FunASR) — 16K+ stars
- [SenseVoice](https://github.com/FunAudioLLM/SenseVoice) — 8K+ stars

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature request: Add FunASR/SenseVoice as alternative ASR backend #459

Feature Request

Why

Quick start

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Feature request: Add FunASR/SenseVoice as alternative ASR backend #459

Description

Feature Request

Why

Quick start

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions