DR.BROR Creator tools store
Instant email delivery·Secure Stripe checkout
← All guides
Guide · Updated 2026-05-25

Writing TikTok hooks for music — what the first three seconds need to do

TikTok hook patterns for independent artists, by emotion, with the structural rules that decide whether your clip survives the swipe.

The first frame decides everything

On TikTok, the median view length on a music post is under three seconds. The swipe doesn't punish the chorus — it punishes whatever you put before it. The first frame is doing one job: opening a tension loop the viewer can't close without watching the next two seconds. Anything else (logo, slate, intro card, 'wait for it' captions) is a tax on retention.

Five hook frames that consistently retain

Curiosity-gap ('the moment a lyric stopped making sense'), prediction ('most people guess wrong'), low-stakes confession ('I wrote this in a fight'), counterfactual ('this song almost didn't get released'), and concrete specificity ('the second line is about my dad'). Pick one frame per post — mixing them collapses the loop. The hook in the caption and the hook on the screen don't have to match, but they can't fight each other.

On-screen text is the hook, not decoration

Sound-off viewing is the default mode. The on-screen text within the first second is the hook, regardless of what the audio is doing. The text should be one short line, high-contrast, in a position the algorithm doesn't crop in the For You feed (avoid the bottom 20% of the frame, the username overlay zone). Caption text and on-screen text are two separate hooks — write both.

Match the hook to the song's emotional shape

Heartbreak songs do not retain on hype hooks. Hype songs die on confession hooks. The closer the hook's emotional register sits to the song's, the longer the average view length, because the viewer's expectation is met instead of bait-and-switched. Curiosity-gap hooks work on almost anything because they don't commit to a register; that's also why they can feel generic and need a strong second beat.

The second beat — what you put at 0:03

If the first frame is the hook, 0:03 is the proof. Most music posts that retain past 0:03 do one of three things: drop into the chorus, reveal the line the hook teased, or smash-cut to a visual that confirms the emotional register. Posts that just keep showing more of the same frame don't make it to 0:05.

Frequently asked

Do I need to use a 'wait for it' style caption?

Sometimes. The 'wait for it' frame is a curiosity-gap hook and it works, but it's so common now that it feels generic on most music content. If your reveal is genuinely surprising, use it. If it's just the chorus, find a sharper hook.

Does the algorithm reward longer videos or shorter ones?

Average view time is what matters, not length. A 10-second video with 90% average view time outperforms a 60-second video with 25% average view time. Make the video as long as the hook can hold.

How important is the song choice for the post?

If you're promoting your own track, you don't have a choice — the goal is to make that song's hook land. If you're testing which moment of the song works best as a TikTok hook, post three clips with three different snippets and read the retention curves to decide.

Should I show my face?

Not required. Anonymous, faceless music posts retain fine if the on-screen text and the audio are doing the work. Face shots only beat faceless when the artist's expression is part of the hook (vulnerability, reaction, lip-sync conviction).