
To make a highlight reel, transcribe your footage, mark the strongest moments in the transcript, arrange them into a short beat that earns each cut, and export a timeline to your editor instead of scrubbing clips on a timeline for hours. The reel itself is usually 60 to 120 seconds. The work that makes it good happens before you ever touch the timeline.
I have built reels from conference panels, founder interviews, event recaps, and sports days. The footage is always too long and the deadline is always too short. The mistake almost everyone makes is opening the NLE first and start trimming. You end up married to the first decent clip you find, and the reel becomes a list of okay moments instead of a built thing.
A highlight reel is a selection problem before it is an editing problem. You are choosing maybe two minutes out of two hours. Reading is far faster than watching for that kind of triage. The meta-analysis by Marc Brysbaert puts average adult silent reading around 238 words per minute for non-fiction, while conversational speech runs closer to 150. You can scan a 90-minute transcript in the time it takes to half-watch the first ten minutes of footage.
So the first move is to transcribe the footage with word-level timecodes, then read it like a producer marking a script. This is the paper edit approach, the same method documentary editors have used for decades. You are not editing video yet. You are deciding what belongs.
Be ruthless about what counts as a highlight. A moment makes the reel only if it does one of these:
Everything else is context, and context belongs in the full edit, not the reel. If you find yourself keeping a clip because it is informative, that is a sign it is not a highlight. Highlights hit, they do not explain.
Say you shot a four-person panel for a product launch and the client wants a 90-second sizzle for social. Here is the order I work in.
First, I read the whole transcript and tag selects. Out of 75 minutes I usually find 15 to 20 candidate moments. That is already too many for 90 seconds, which holds roughly six to nine real beats, so the next pass is cutting selects against each other.
Second, I arrange. I open with the single strongest line, the one that makes someone stop scrolling. I do not save the best for last on a reel, because most people never reach the end. This is the same logic as a cold open: lead with the payoff.
Third, I trim inside each select. People ramble into and out of a good point. I cut the lead-in ("so, you know, I think what we found was") and land on the clean version of the line. Removing filler words alone often shaves 15 percent off a select without touching meaning.
Fourth, I export the arranged selection as a timeline to DaVinci Resolve or Premiere. Now the b-roll, music, and graphics work happens against a structure that already holds together, instead of being the thing I am using to paper over a weak edit.
A reel is not a random clip dump in descending order of quality. The ones that perform have a shape:
Vary energy on purpose. Two intense moments back to back cancel each other out. A loud line, then a quiet honest one, then a funny one reads as a story. The same three moments shuffled into loud-loud-loud reads as noise.
The reel is too long. Ninety seconds feels short when you have great footage, but a tight 60 beats a baggy 120 every time. If you cannot decide between two good moments, the reel is the wrong place to keep both.
You buried the hook. If your best moment is at 0:40, the people who would have loved it left at 0:20. Move it to the front.
You confused a reel with a selects reel. A selects reel is an internal working document, every usable take in order. A highlight reel is a finished, audience-facing cut. Do not hand a client a selects reel and call it a highlight reel.
Music drives the cut and the words become decoration. On a soundbite-led reel, the words carry the story and the music supports them. Cut to the line, not to the beat.
The slow part of a reel was never the cutting. It was the scrubbing, finding the moments, second-guessing, re-watching. Doing the selection in the transcript collapses that. You read, you mark, you arrange, and the timeline you open in your editor is already the reel in rough form. The polish, the b-roll, the grade, that stays in your NLE where it belongs.
ScriptCut is built for exactly this pre-edit step: transcribe, mark your selects, trim fillers, arrange the order, then export an XML, EDL, or subtitle file straight into Resolve, Premiere, Final Cut, or Avid. You spend your time choosing the moments, not hunting for them.
A highlight reel is won in the selection. Read the transcript, keep only moments that hit, lead with the best one, and bring a finished structure into your editor. The reel almost makes itself once the right moments are sitting in the right order.
Most highlight reels land between 60 and 120 seconds. For social, lean toward 60. A tight reel that holds attention beats a longer one that loses people halfway. Length should follow the strength of the footage, not the amount of it.
A selects reel is an internal working document containing every usable moment, in order, for the team to review. A highlight reel is a finished, polished cut meant for an audience. Use the selects reel to choose what goes in the highlight reel.
Read the transcript and keep only moments that land a quotable line, show real emotion, pay off something the viewer cares about, or carry a strong visual. If a clip mainly explains or informs, it belongs in the full edit, not the reel.
Yes. Transcribe the footage with word-level timecodes and read it to find your moments, which is far faster than watching everything. You only need to watch the specific moments you have already chosen, to confirm the visuals work.