
The fastest way to edit a documentary is to make your story decisions on the transcript first, then hand the timeline a list of exact cuts to assemble. Read the words, mark the moments that matter, arrange them into a spine, and only then open your NLE. The picture follows the words. That is the whole trick, and it has been the documentary editor's trick for decades.
Most people get this backwards. They import forty hours of footage, start scrubbing, and three days later they have a vague feeling about a few good bits and no structure. Reading is faster than watching. You can skim a 60-minute interview transcript in eight minutes and know exactly where the gold is.
Documentary is a writing problem disguised as an editing problem. The story lives in what people say and the order you put it in. When you work from a transcript, you are working with text you can search, highlight, reorder, and cut without rendering anything. You find the arc on the page, then the timeline just executes it.
This is the paper edit, codified by Michael Rabiger in Directing the Documentary (Focal Press), with chapters literally titled From Transcript to Assembly and From Paper Edit to First Assembly. It is not a hack. It is the craft.
The catch is the handoff. A paper edit on sticky notes or in a Google Doc is a list of decisions that someone still has to find and cut by hand in the timeline. That re-finding is where the time goes. Solve the handoff and you have solved documentary editing speed.
You need word-level timecode, not just a clean script. A transcript that says a sentence happened somewhere in the file is useless for cutting. A transcript where every word is stamped to a frame is a cut list waiting to happen. If you are not sure how to get there, start with how to transcribe an interview.
Read each interview once, fast, and highlight the lines that earn their place: the surprising admission, the clean explanation, the emotional beat, the line that makes you lean in. Do not overthink it on the first pass. You are looking for the best soundbites, the load-bearing sentences.
Inside each select, kill the filler and the false starts. The 'um,' the 'you know,' the three-times-restarted sentence. A tight select reads clean and cuts clean. See how to remove filler words for the fast way to do this.
Now sequence your selects into a story. Setup, tension, turn, payoff. Reorder freely, because nothing is rendered yet. This is the cheapest place in the entire pipeline to be wrong, so be wrong here several times until it sings.
Hand your NLE the decisions. Export an XML, FCPXML, or EDL that drops every selected clip onto the timeline in your order, trimmed to the words you kept. Open it in DaVinci Resolve, Premiere Pro, Final Cut, or Avid and you are looking at a rough assembly in minutes instead of days. This is exactly what paper-edit software is for.
Say you shot a 90-minute founder interview plus 40 minutes of B-roll for a brand documentary. Old way: import everything, scrub, log, and you are a day and a half in before you have a structure. Transcript-first: read the 90 minutes in about twelve, highlight roughly 14 minutes of strong material, tighten it to 9, arrange those selects into a five-beat spine on the page, export, and conform. You walk into your NLE with a 9-minute assembly already on the timeline, every clip trimmed to the line you kept. Then you do the real work: pacing, B-roll, music, color.
That is the point. Transcript-first does not replace editing. It deletes the most tedious, least creative part so you spend your hours on the part that shows up on screen.
A transcript flattens performance. It cannot tell you that the line landed flat, that the pause before it was the whole point, or that the laugh underneath changed the meaning. Errol Morris is blunt about this. In a conversation with Transom he said he never edits from transcripts: 'Paper cuts give you a very false idea... I edit from the film, never from the transcripts.'
He is right that text alone is a false friend. The answer is not to abandon transcript-first, it is to verify. When your selects carry word-level timecode, you can play any line straight from the transcript and hear the take before you commit it. You read for structure, you listen for tone, and you only lock a select once both agree. That closes the gap Morris is warning about while keeping the speed.
Reordering and tightening a subject's words is normal documentary practice. Cutting a rambling answer down to its clear core is honest editing. Stitching words the person never said into a sentence they never spoke, the so-called Frankenbite, is not. Manfred Becker draws this line clearly in Creating Reality in Factual Television (Routledge). Tighten and rearrange, yes. Fabricate meaning, never.
ScriptCut is the pre-edit. You get your footage transcribed with word-level timecode, read and highlight your selects, remove fillers, arrange the spine on the page, play any clip to check the take, and export a timeline (XML, FCPXML, EDL, plus subtitles and a clean audio cut) straight into Resolve, Premiere, Final Cut, or Avid. It does the decision layer and hands the NLE a ready assembly. It does not replace your editor, it gets them to the creative work faster. Start free at ScriptCut.
Make your story decisions on the transcript first, then verify each line on the footage before you lock it. Reading is faster for finding structure; watching is essential for judging tone. Word-level timecode lets you do both without choosing one.
Far less than scrubbing. You can read a 90-minute interview in roughly twelve minutes, mark selects, and export a timeline that conforms in minutes, versus a day or more of logging tape by hand.
Tightening and reordering what someone actually said is standard, honest documentary practice. Stitching together words to make them say something they never said (a Frankenbite) is not. Keep the meaning true.
An XML, FCPXML, or EDL timeline with every selected clip trimmed to the words you kept, plus subtitles and a clean audio cut, ready to open in DaVinci Resolve, Premiere Pro, Final Cut Pro, or Avid.