Engine
Model
Audio language
Speakers Count
Vocabulary

Helps Whisper recognise technical terms. Not included in the transcript.

Drop audio file(s) here, or

MP3, WAV, OGG, M4A, FLAC, WEBMmax 200 MB per file

Advanced settings
Beam size

Controls search breadth — higher values improve accuracy but take longer. 5 is recommended for legal recordings.

VAD filter

Voice Activity Detection — skips silent passages before transcribing. Speeds up processing and prevents the model hallucinating on silence.

Ready

Select a tool, run a request, and the result appears here.