Key takeaways
- A hook is the first 1–2 seconds of a short-form ad; on paid social it decides most of the watch-through, so it deserves more test variants than any other part of the creative.
- Most working UGC hooks fall into five families: problem call-out, pattern interrupt, social proof, curiosity gap, and direct claim or demo.
- With an AI pipeline, one validated ad body can be re-hooked cheaply — same 25–35 seconds of body, a new first shot — so a single concept becomes 5–10 testable ads.
- Hooks fatigue in days on paid social, not weeks, which is why a library of patterns plus cheap variant production beats polishing one opening shot.
- Each hook variant needs its own brief: spoken line, on-screen text, first visual, framing, and emotional register — not just a new sentence pasted over the same footage.
A hook is the first one to two seconds of a short-form ad — the single shot and line that decide whether a viewer keeps watching or scrolls. After producing AI UGC ads daily, we keep returning to roughly 20 hook patterns across five families: problem call-out, pattern interrupt, social proof, curiosity gap, and direct claim or demo. Below is the full library — each pattern with a one-line template and the situation where it earns its keep — plus the part most articles skip: how to turn one validated ad into many hook variants cheaply, because every hook dies within days anyway.
Why does the first 2 seconds matter more than the other 28?
On feed placements, the platform makes a keep-or-skip decision for the viewer almost instantly, and the viewer makes one for themselves right after. If the opening shot doesn’t stop the scroll, nothing downstream — the demo, the offer, the edit — ever gets seen. That’s why we treat the hook as a separate creative unit with its own iteration budget. The body of an ad can survive weeks of rotation; the hook is the part that burns out first and the part where a small change moves results the most.
This asymmetry is also what makes AI production a good fit for hooks specifically. In our keyframe-first pipeline, a hook variant is one new keyframe, one new 4–6 second i2v clip, and a re-render of the assembly — not a reshoot. The body stays untouched. Cheap variants change how you write hooks: you stop hunting for the one perfect opener and start testing a spread of patterns.
Family 1: problem call-out hooks
These hooks name the viewer’s pain before naming the product. They self-select hard: people without the problem scroll past, people with it stop. That filtering is a feature — watch-through and intent both rise even if raw hook rate drops.
- Direct pain call-out — “If your [X] still does [annoying thing], watch this.” Works when the pain is common, concrete, and slightly embarrassing to admit.
- The “stop doing this” opener — “Stop [common behavior] — it’s why your [X] isn’t working.” Works when the audience already tries to solve the problem the wrong way.
- Cost-of-inaction — “Every week you keep [doing X], you’re losing [time/money/result].” Works for B2B-ish and utility products where the waste is quantifiable.
- The mirror — actor restates the viewer’s exact inner monologue: “I knew I needed [X], I just kept putting it off.” Works for considered purchases with guilt or procrastination attached.
Family 2: pattern interrupt hooks
Pattern interrupts win the first half-second visually, before a single word lands. They’re the most placement-dependent family — what interrupts a polished feed looks normal in a chaotic one — and the most prone to clickbait decay if the body doesn’t pay off the weirdness.
- Mid-action open — the clip starts in the middle of something already happening (pouring, dropping, unboxing half-done). Works because there’s no “intro” to skip; the brain wants to resolve the action.
- The wrong-place product — the product appears somewhere it shouldn’t be (a coffee cup on a gym bench, skincare in a car). Works for visually distinctive products with strong brand color.
- Whisper or silence open — the actor leans in and whispers, or there’s a beat of dead silence before the line. Works in sound-on placements where every other ad opens loud.
- Visual glitch or freeze — a deliberate freeze-frame, rewind, or jump cut in the first second. Works for younger, edit-literate audiences; reads as broken to older ones.
Family 3: social proof hooks
Social proof hooks borrow trust the brand hasn’t earned yet. The honest rule we hold ourselves to: never fabricate numbers, reviews, or named customers. The pattern works fine with real, verifiable claims — and an AI actor delivering a true claim is still a true claim.
- The reluctant convert — “I genuinely didn’t believe this would work.” Works for skeptical categories (supplements, productivity tools) where doubt is the default.
- Crowd reference — “Everyone keeps asking me about [X], so here it is.” Works when the product is visible in public — bags, drinks, gadgets, anything strangers comment on.
- The recommendation relay — “My [sister/trainer/dentist] told me to try this.” Works because borrowed authority feels less like an ad than a first-person pitch.
- Honest-review framing — “Real talk: here’s what’s good and what’s not.” Works mid-funnel and for retargeting, where the viewer has seen the polished version already and wants the catch.
Family 4: curiosity gap hooks
Curiosity hooks open a loop the viewer has to keep watching to close. They tend to post the highest hook rates in our tests and the steepest drop-offs when the payoff lands late — the gap buys you seconds, not the full ad. Put the payoff inside the first 10 seconds or the loop snaps.
- The withheld object — “I can’t believe nobody talks about this” while the product stays just out of frame. Works when the reveal itself is visually satisfying.
- Before-the-after — open on the “after” state with “this took me 12 days” framing, then rewind. Works for transformation products: skin, fitness, home, organization.
- The forbidden angle — “My [industry] friends will hate me for sharing this.” Works for products that undercut an expensive incumbent (salon, agency, gym).
- The unfinished sentence — the spoken line cuts off at the most loaded word and the next shot answers it. Works as a pure editing trick; pairs with almost any body.
Family 5: direct claim and demo hooks
The unfashionable family that quietly performs. No misdirection — just the product, the claim, and proof, immediately. These hooks have the lowest hook rates and often the best cost per action, because everyone who stays past second two is already qualified.
- Claim plus countdown — “This removes [problem] in under 30 seconds — watch.” Works when the demo is genuinely fast and visual; never use it when it isn’t.
- The side-by-side — split screen of with/without from frame one. Works for anything with a visible delta: cleaning, color, texture, speed.
- Price anchor open — “This costs less than your [daily coffee/one gym visit].” Works for impulse price points where the objection is purely “is it worth it”.
- The blunt demo — no spoken hook at all: hands, product, action, result, in the first two seconds. Works in sound-off placements and as the control variant every test should include.
How do you turn one concept into ten hook variants with AI?
Traditional UGC makes hook testing expensive: every variant means re-briefing a creator, waiting for a new take, and hoping the energy matches the original. In an AI pipeline the body of the ad — the demo, the proof, the offer — is a fixed, already-validated asset. Re-hooking it is a small, contained production job, and because we anchor every shot on a graded keyframe, the new hook actually matches the world of the body: same actor identity, same location, same outfit, same light. The mechanics (and costs) of that workflow are covered in our AI UGC cost breakdown; the hook-swap process itself looks like this:
- Pick the validated body: a 25–35 second ad where the middle and end already hold retention — only the opening underperforms or has fatigued.
- Choose 3–5 hook patterns from different families above. Same-family variants tell you which line is better; cross-family variants tell you which psychology works, which is the more valuable answer early on.
- Write each hook as a full micro-brief (spoken line, on-screen text, first visual, framing — see the template below), not just a new sentence.
- Generate keyframes for each hook using the same Scene Bible as the body — one locked location, one outfit, the same palette and lens feel — so the cut into the body is invisible.
- Grade the stills and animate only the keepers via image-to-video. Expect 2–4 keyframe candidates per hook to keep one; rejecting a bad still costs cents, rejecting a bad clip costs minutes.
- Re-render the assembly per variant on the same beat grid so the first cut still lands on a music onset, then ship all variants into the same ad set and let spend decide.
How fast do hooks fatigue — and what does that mean for volume?
On paid social, a winning hook fatigues in days, not weeks. The audience that responded to it gets reached, frequency climbs, and the scroll-stop effect of the opening — especially for pattern interrupts and curiosity gaps — decays fastest precisely because it depends on surprise. The body of the ad ages much more slowly; the proof and the offer don’t stop being true.
That math is the whole argument for volume. If a hook lives roughly a week and you want an always-on account, you need a continuous supply of fresh openings — which is brutal when each one is a shoot, and routine when each one is a keyframe plus a short i2v clip. This is also why we’d rather ship five honest 7-out-of-10 hooks this week than one polished 9 next month: the polished one dies on the same schedule. For how this fits into a full production cadence, see our complete AI UGC guide.
What to specify when briefing a hook variant
A vague brief (“make it punchier”) produces interchangeable variants. Each hook variant we generate is specified down to the frame. Per variant, write down:
- Pattern and family — name the pattern from the library (e.g. “reluctant convert”, social proof family) so the intent is explicit and results can be grouped by family later.
- Spoken line — the exact words for seconds 0–2, written for speech, not print. Read it aloud once; if it takes longer than two seconds, cut it.
- On-screen text — the burned-in overlay, max 5–7 words, and where it sits so it survives platform UI (avoid the bottom third).
- First visual — what is literally in frame at 0:00: actor, product, action already in progress, and the camera framing (close-up, medium, POV).
- Emotional register — one word: skeptical, excited, conspiratorial, deadpan. This drives the actor’s expression in the keyframe and the motion prompt in i2v.
- Continuity constants — which Scene Bible elements must carry over from the body (location, outfit, palette) so the cut from hook to body doesn’t read as two different ads.
- The cut point — the exact frame of the body this hook lands into, so the editor (or the assembly script) doesn’t guess.
What we’d do in your place
Don’t start by testing all 20 patterns. Take your best existing ad — or one validated concept — and build a first batch of five hooks, one from each family: one problem call-out, one pattern interrupt, one social proof, one curiosity gap, one blunt demo as the control. Run them in one ad set for a few days, note which family wins, then spend the next batch exploring inside that family. Two cycles of this usually tells you more about your audience than a quarter of single-ad testing.
If you’d rather not build the pipeline yourself, this loop — validated body, fresh hooks weekly, machine-graded keyframes, beat-synced assembly — is exactly what we run for clients as AI UGC ads at SHOT.IS. Bring one concept; leave with a hook library that refreshes itself.