Skip to content

Incompatible File Types and Audio Channels Causing Azure Transcription Service Failures #17

@mitchellrevill

Description

@mitchellrevill

We have encountered an issue where files passed onto the Azure transcription service often fail due to incompatibilities with the file type and audio channels. due to issues with Azure TTS supporting a limited amount of audio formats/Only mono audio files

Source: https://learn.microsoft.com/en-us/dotnet/api/microsoft.cognitiveservices.speech.speechsynthesisoutputformat?view=azure-dotnet

Steps to Reproduce:

  • Pass an audio file with incompatible file type or audio channels to the Azure transcription service.
  • Observe the failure in the Azure Functions trigger "Invocations"

Expected Behavior: The transcription service should successfully process the audio files without failures.

Actual Behavior: The transcription service fails to process the audio files due to incompatibilities with the file type and audio channels. For users, it feels like a 50/50 chance that the file will work when uploaded.

Proposed Solution: Implement a local conversion of the audio files before they are passed to the Azure transcription service. This conversion should ensure that the file type and audio channels are compatible with the transcription service requirements. Note that Azure TTS only supports mono audio channels.

Additional Context: This issue has been affecting our ability to efficiently transcribe audio files, and a local conversion solution would help mitigate these failures.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions