Audio language
Speakers Count
Vocabulary

Helps Whisper recognise technical terms. Not included in the transcript.

Advanced options
Task
VAD filter Improves accuracy on recordings with long pauses.
Whisper model Used when Azure/GCP unavailable. large-v3 is the default.
AI cleanup Fixes errors, punctuation, and domain terms after transcription.

Drop audio file(s) here, or

MP3, WAV, OGG, M4A, FLAC, WEBMmax 200 MB per file

Ready

Select a tool, run a request, and the result appears here.