The Podcast Journalism Landscape

Podcast journalism has grown from a niche format to a primary news consumption mechanism for significant audience segments, particularly in the 18–34 demographic. Edison Research's 2024 Infinite Dial study found that 47 percent of US adults aged 12+ had listened to a podcast in the past month, with news and current affairs representing approximately 28 percent of podcast listening. The New York Times' The Daily, BBC Global News Podcast, and The Guardian's Today in Focus are among the most-listened-to podcasts globally.

AI tools have transformed the economics of podcast journalism production, reducing post-production time by 60–80% and enabling single journalists to produce broadcast-quality audio content without audio engineering expertise.

AI Transcription: The Foundation

The most universally adopted AI tool in podcast journalism is automatic transcription. OpenAI Whisper and its commercial implementations (Otter.ai, Descript, Riverside) reduce the 4:1 time ratio of manual transcription (four hours of transcription per one hour of recording) to a 1:10 ratio (six minutes of review per one hour of recording). For podcast journalists, this unlocks both practical efficiency and editorial value: transcripts enable searchable episode archives, accessibility for deaf/hard-of-hearing audiences, show note generation, and content repurposing for newsletters and articles.

AI-Powered Audio Editing

Descript's "Studio Sound" feature uses AI to remove background noise, room echo, and audio inconsistency from recordings — enabling professional sound quality from recordings made in non-studio environments. Its "Remove Filler Words" feature automatically identifies and removes "um," "uh," and similar hesitations from transcripts, enabling precise audio editing through text deletion. These tools have eliminated the need for dedicated audio engineers in many podcast journalism productions.

AI Voice and Show Production

AI voice synthesis (ElevenLabs, Play.ht, Resemble AI) enables podcast publishers to generate audio versions of written articles in a consistent editorial voice — potentially at the cost of genuine human vocal presence that audience connection depends on. Several publications including The Economist and Bloomberg have deployed AI-generated audio versions of written content, with careful disclosure to audiences. The ethical line between accessibility tool (synthesised audio for print content) and deceptive practice (presenting synthetic voice as real human recording) requires explicit editorial policy.