If you record podcasts, you’ve probably wondered what bitrate to publish at. Higher feels safer, but it costs your listeners (and your storage bill) bandwidth. Let’s settle the question.
The short answer
- Mono spoken word: 64 kbps is enough.
- Stereo conversational: 96 kbps sounds great.
- High-production music podcasts: 128 kbps stereo.
Most podcasters publish at 128 kbps stereo by default. That’s higher than necessary for voice but provides headroom for music interludes and intros.
Why not 320 kbps?
Voice has a much narrower frequency range than music. Once you’re above ~64 kbps for mono speech, you’re spending bits on inaudible nuances. A 60-minute episode at 320 kbps is ~144 MB; at 96 kbps it’s only ~43 MB. Listeners on mobile data will thank you.
Real-world examples
- NPR Fresh Air: 64 kbps mono.
- The Daily (NYT): 64 kbps mono.
- The Joe Rogan Experience: 96 kbps stereo (used to be higher).
- Music podcasts (e.g., Song Exploder): 128 kbps stereo.
Make your podcast sound consistent
Bitrate isn’t the whole story — perceived loudness matters more. Apply EBU R 128 normalization to make every episode hit the same target loudness (around −16 LUFS for podcasts). Our converter does this for you with the Normalize loudness toggle.
Workflow with MP4intoMP3
Many podcasters record with a camera that outputs MP4 (especially for video podcasts). Here’s the quickest workflow:
- Drop the MP4 onto mp4intomp3.com.
- Choose MP3 at 96 kbps, mono, 44.1 kHz.
- Toggle Normalize loudness.
- Add ID3 tags — at minimum title, artist, album, and year.
- Export and upload to your podcast host.
That’s it — broadcast-ready audio in two minutes.
Don’t forget mono
If your podcast is talking-heads only, mix to mono (set Channels to 1). You’ll halve the file size with zero perceptible quality loss for voice content.
Final word
Bigger is not always better. The right bitrate is the smallest one your audience can’t distinguish from the original. For voice, that’s 64–96 kbps. For music, 128 kbps. Use those defaults and your podcast will load faster, cost less to host, and sound exactly the same.