Skip to content

Feature request: Add FunASR/SenseVoice as alternative ASR backend #459

Description

@LauraGPT

Feature Request

Add FunASR/SenseVoice as an alternative to Whisper for transcription.

Why

  • SenseVoice (234M params): 50+ languages, 5x faster than Whisper-small, non-autoregressive (no hallucination)
  • Built-in speaker diarization (cam++): No need for separate diarization pipeline
  • Complete pipeline in one package: VAD + ASR + punctuation + timestamps + speaker diarization
  • OpenAI-compatible API: funasr-server serves POST /v1/audio/transcriptions — drop-in replacement

Quick start

pip install funasr
from funasr import AutoModel
model = AutoModel(model="iic/SenseVoiceSmall")
result = model.generate(input="audio.wav")

References

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions