Question 1

Why use WAV instead of MP3 for transcription?

Accepted Answer

WAV is lossless — the audio is stored exactly as recorded with no compression. For transcription, this only matters at the edges: very quiet voices, heavily reverberant rooms, or speakers with subtle accents. On clean voice recordings, WAV and MP3 transcribe at nearly identical accuracy. WAV is more relevant if you want to keep a high-fidelity master and have the disk space to spare.

Question 2

What's the maximum WAV file size I can upload?

Accepted Answer

25MB on the free tier. WAV files are big — a 45-minute 16-bit 44.1kHz mono recording is about 230MB, which exceeds the free tier. Either compress to MP3 at 64 kbps mono first (drops to ~20MB with no accuracy loss), or upgrade to Pro for 500MB uploads.

Question 3

Does it support 24-bit and 96kHz WAV?

Accepted Answer

Yes. We accept 8/16/24/32-bit WAV at any sample rate from 8kHz to 96kHz, mono or stereo. The transcription engine downsamples internally to 16kHz mono before processing — the higher fidelity isn't useful for speech-to-text but doesn't break anything.

Question 4

Will compressing my WAV to MP3 hurt accuracy?

Accepted Answer

No. We tested side-by-side: identical sermon audio at 24-bit/48kHz WAV (~750MB) and 64 kbps mono MP3 (~22MB) produce transcripts that differ by less than 0.1%. The Whisper model is trained on compressed audio and is essentially blind to the difference. Compress freely.

Question 5

Can I upload a WAV from my church board mixer?

Accepted Answer

Yes — most digital mixers (Behringer X32, Allen & Heath, Yamaha) export multi-track WAV. Use the mixed-down stereo bus or the dedicated preacher mic track for best accuracy. Avoid using a room mic track as the source; it will pick up congregation noise and lower accuracy.

Question 6

What about multi-channel WAV files?

Accepted Answer

Stereo WAV is fine — we sum to mono before transcription. True multi-channel WAV (3+ channels, like 5.1 surround) is accepted but only the first two channels are read. If you have an isolated preacher mic on channel 3, mix it to a stereo or mono file in any DAW first.

Format	Free tier max	Pro max	Notes
WAV (16-bit / 44.1kHz)	25 MB ≈ 2.5 min	500 MB ≈ 50 min	Native — works at full fidelity
WAV (24-bit / 48kHz)	25 MB ≈ 1.5 min	500 MB ≈ 30 min	Broadcast standard — works native
WAV (24-bit / 96kHz)	25 MB ≈ 45 sec	500 MB ≈ 15 min	Studio quality — works native
AIFF (Apple)	25 MB	500 MB	Same as WAV — accepted
FLAC (lossless)	25 MB ≈ 5–8 min	500 MB	~50% smaller than WAV, same fidelity
MP3 / M4A / OGG (recommended for size)	25 MB ≈ 45 min	500 MB	Best fit if file size matters

Service	Cost / 45-min WAV	Accuracy	Max WAV size	Free tier
Sermon Transcription (Std)	$0.27	99.0–99.5%	500 MB (Pro)	10 min free
Sermon Transcription (Premium)	$0.90	99.5%+ with diarization	500 MB (Pro)	10 min free
Rev AI	$11.25	90–95%	2 GB	5 hours free
Rev human	$67.50	99%+	2 GB	None
Otter Pro	~$0.64 effective	90–95%	3 GB	300 min / mo (30-min file cap)
HappyScribe AI	~$9.00	85–92%	2 GB	None

Transcribe WAV to Text — Free + Accurate

What is a WAV file?

Step-by-step: WAV to text

Open the transcription tool

Drag your .wav file into the upload zone

Over 25MB? Compress first or upgrade

Pick Standard or Premium tier

Wait about a tenth of the audio length

Download .txt, .srt, .vtt, or .docx

Audio format & size compatibility

WAV-specific tips

The WAV transcription workflow

WAV transcription pricing vs alternatives

WAV transcription FAQ

Upload your WAV. Get text back in 5 minutes.

Related