Speakers Identify speakers Count

Vocabulary

Helps Whisper recognise technical terms. Not included in the transcript.

Advanced options

Task Transcribe Translate to English

VAD filter Remove silence / noise Improves accuracy on recordings with long pauses.

Whisper model Used when Azure/GCP unavailable. large-v3 is the default.

AI cleanup None GPT-4o Mini GPT-4o Fixes errors, punctuation, and domain terms after transcription.

Drop audio file(s) here, or browse

MP3, WAV, OGG, M4A, FLAC, WEBM — max 200 MB per file

+ Add files

Select a tool, run a request, and the result appears here.