Podcast Editing Workflow: From Raw Recording to Final Cut
Podcast production looks different at every level of scale, but the core workflow is consistent. Whether you're editing a solo show in GarageBand or producing a narrative series in Reaper, the same stages apply — and understanding each one makes you a faster, more deliberate editor.
This guide covers the full end-to-end workflow, with a focus on how transcript-based editing fits into the process.
Stage 1: Recording During session
Good audio starts at the recording stage. No amount of editing fixes fundamentally bad audio. The basics: close-microphone technique (3–6 inches from the mic), headphones for all speakers to avoid bleed, a quiet room, and recording each speaker on a separate track if possible.
For remote interviews, tools like Riverside.fm and Zencastr record local audio from each participant, avoiding the compression and artifacts of Zoom. This matters more than most people realize — Zoom audio that's been re-encoded sounds noticeably worse when edited, especially after noise reduction.
Key output: Raw WAV files for each speaker, recorded at 48kHz/24-bit or better. Keep backups immediately.
Stage 2: Intake and organization 15–30 min
Before you start editing, get organized. Create a project folder with a consistent structure:
raw/— original, unedited audio files (never touch these)edit/— your DAW project filestranscript/— transcript filesexport/— final rendered audio
Label your files consistently. "EP047-interview-jane-raw.wav" is much better than "recording (1).wav" six months later when you're looking for a clip.
Do a quick listen to check audio quality. Catch problems early: excessive background noise, technical failures, one speaker much louder than the other.
Stage 3: AI transcription 1–5 min
Upload your main audio file to a transcription service and get a word-level transcript. With modern AI transcription tools like Deepgram, this takes 1–2 minutes for an hour of audio. In Paper Edit, it's automatic on upload.
The transcript is the raw material for the next stage. Don't proofread or correct it yet — you're about to read through it as an editor, not a proofreader.
Stage 4: Structural edit (paper editing) 45–90 min
This is the heart of the workflow. Read through the transcript with editorial eyes: What's the story? What's the best material? What can go?
The paper editing process has four sub-stages:
- First read: Get the lay of the land. No marking yet.
- Markup: Highlight keepers, cross out cuts, flag maybes.
- Script building: Arrange selected passages into narrative order.
- Review: Read through the script as a whole. Does it hold together?
At the end of this stage you have a script — a precise list of audio segments, in order, with start and end times. This is the "paper edit."
Stage 5: Rough assembly 15–30 min
Convert the paper edit into an audio timeline. If you're using Paper Edit, this means exporting the Reaper RPP project file — all your clips are automatically placed on the timeline. If you're working manually, import your cuts into your DAW in order.
Do a quick listen-through of the assembled rough cut. Is the overall shape right? Are there gaps that need bridging narration? Make notes but don't make micro-edits yet.
Stage 6: Fine edit 30–60 min
Now you're in the DAW doing waveform work. This is where you:
- Tighten cut points — trim clips so they start and end cleanly
- Add crossfades between clips to smooth transitions
- Remove individual breaths, clicks, and mouth noises
- Fix any problems with the assembly: abrupt cuts, missing context, timing issues
This pass goes fast because all the structural decisions are already made. You're polishing, not deciding.
Stage 7: Music, sound design, and production 15–45 min
Add the production elements that define your show's identity: intro music, outro, transition music, ads/sponsorship reads, chapter markers if your feed supports them. This is where the episode starts to feel like a finished product rather than an edited conversation.
Keep music levels roughly 20–25 dB below speech. Music that competes with speech is one of the most common mistakes in podcast production.
Stage 8: Mix and master 20–40 min
The mix is about balancing everything — relative levels of all elements, EQ to remove problematic frequencies, dynamic control. The master is about the final loudness and loudness consistency. For podcasts:
- Target loudness: -16 LUFS integrated (Spotify, Apple Podcasts standard)
- True peak ceiling: -1 dBTP
- Noise floor: Below -60 dB is fine for most content
Tools like Auphonic can automate a lot of the loudness and noise reduction work. For a manually-mixed episode, a simple chain is: noise reduction → EQ → compression → limiter.
Stage 9: Export and publish 10–20 min
Render the final MP3 (128kbps stereo or 64kbps mono are standard). Add ID3 metadata: title, episode number, artwork, description. Upload to your podcast hosting platform and schedule. Done.
Keep your project files. You'll want them if you need to re-export, fix an error, or re-use a clip.
How Long Does the Full Workflow Take?
For a 60-minute raw interview, rough estimates with a paper editing workflow:
- Intake + transcription: 20–35 min
- Structural edit (paper editing): 45–90 min
- Rough assembly + fine edit: 45–90 min
- Production + mix + export: 45–90 min
- Total: ~2.5 to 5 hours
Without the paper editing step, structural editing alone in a DAW typically adds 1–2 hours. The transcript-based approach pays back that investment every episode.
Also see: how AI podcast editing works — a deeper look at where AI tools fit in this workflow.
Speed up your editing workflow today
Paper Edit handles transcription, structural editing, and Reaper export — the three most time-consuming steps — in one place.
Try Paper Edit free →