Microsoft's MAI-Transcribe-1 runs 2.5x faster than its predecessor at $0.36 per audio hour

2026-04-03

Summary

Microsoft has launched MAI-Transcribe-1, a new speech-to-text model that processes audio 2.5 times faster than its predecessor and costs $0.36 per audio hour. It supports 25 languages and excels in challenging conditions like background noise. The model is available in Microsoft Teams and through public previews, and it outperforms several other models in accuracy tests.

Why This Matters

The introduction of MAI-Transcribe-1 is significant because it offers faster and more accurate transcription services at a lower cost, which can benefit businesses relying on audio data processing. By improving performance under difficult conditions, it addresses common challenges in fields that require precise transcription, such as legal and medical sectors.

How You Can Use This Info

Professionals can leverage MAI-Transcribe-1 to enhance productivity in meetings or customer interactions by integrating it into platforms like Microsoft Teams. Developers and businesses can explore this technology through Microsoft Foundry and AI Playground for applications requiring reliable and cost-effective transcription services.

Read the full article