Shorts Captions Guide, Line Length, Fonts, and Emphasis (2026)
TL;DR: Shorts captions should be 5 to 7 words per line, 1 to 2 lines, placed in the lower third of the screen (never at the very bottom), in a bold sans-serif with a black background box or outline. Add single-word emphasis and you can hold viewers who watch with the sound off all the way to the end.
A large share of Shorts viewers watch with the sound off. That makes captions in Shorts not decoration but the main event. One well-made caption track can change your retention. Let's go through the rules one by one.
Line length: 5 to 7 words per line, 2 lines max
A vertical frame (1080x1920) is narrow, so if you use long-form caption lengths (a full sentence per line), the text either shrinks or spills into three lines. 5 to 7 words per line (roughly 30 characters) is the limit for reading at a glance.
- Bad: "Today I'm finally going to share the tip I've been saving for ages" (shown all at once)
- Good: "Today I'm finally sharing" then "the tip I've been saving" (split into two clips)
If you set a per-line character limit at the transcription stage, the splitting happens automatically. Both bakecut and tools like Vrew have this setting.
Position: lower third, but never the very bottom
The bottom edge of a Shorts screen is covered by platform UI: the title, channel name, and buttons. Push your captions too low and they get hidden. The sweet spot is 20 to 30% up from the bottom of the frame, which is both safe and natural for the eye. Also check that the captions don't cover the speaker's mouth.
Font: bold sans-serif by default, handwriting only as an accent
To be readable fast on a small screen, a heavy sans-serif (Montserrat Bold, Archivo Black, Inter Black, and similar) is the default. Handwriting and script fonts hurt readability, so save them for titles or a single accent shot.
For size, around 5 to 8% of the screen width works well. The field test: "it has to be readable holding the phone at arm's length."
Background: a box or an outline, pick at least one
Shorts backgrounds swing wildly in brightness from scene to scene, so plain white text will always disappear somewhere. Two options:
- Semi-transparent black box: best readability, covers more of the frame
- Outline (stroke): covers less of the frame, looks clumsy if too thick
The current Shorts trend is a rounded-corner black box with white text.
Emphasis: color only the key word
If the whole sentence looks the same, the eye slides right off. Highlight just one word per clip in yellow or a neon color and the gaze locks on. Go one step further and you get the "word highlight" effect where each word changes color as it's spoken, which works especially well in Shorts. This word-level emphasis is a built-in feature in bakecut, while in many other tools you have to build it manually one word at a time.
Emojis follow the same principle: one above the key word, never plastered on every sentence.
Timing: in sync with the voice, not even 0.1 seconds late
In Shorts, viewer patience is measured in single seconds. If a caption appears after the word is spoken, the lag is immediately noticeable. Automatic transcription tools capture per-word timing for you, so the recommended flow is transcribe first, then edit, rather than placing captions by hand. If accuracy is your concern, start with 5 ways to improve auto-caption accuracy.
Wrap-up: for Shorts captions, the tool is half the battle
bakecut is a tool that applies everything in this guide at once: line-length splitting, word emphasis, and emojis. It splits lines to a readable length during transcription, and word highlighting and emoji effects are built in, making it the fastest way to caption Shorts. Works on Mac and Windows, and you can start for free.
FAQ
How long should each caption line be in Shorts?
5 to 7 words per line, 2 lines max. Beyond that, the text shrinks or wraps into extra lines, and reading speed can't keep up with the video.
Where should I place captions in Shorts?
Centered, 20 to 30% up from the bottom of the screen. Avoid the very bottom, where YouTube and Instagram UI will cover them.
What font should I use for Shorts captions?
Heavy sans-serifs like Montserrat Bold, Archivo Black, or Inter Black are the standard. Check that the license allows free commercial use.
Should I burn captions into the video or upload them separately?
For Shorts, Reels, and TikTok, captions burned into the frame (hard subtitles) are the norm. Platform caption features offer limited styling, so designed captions are only possible by burning them in. The differences are covered in hard vs soft subtitles.
Subtitles on your computer, no upload.
The AI subtitle editor that styles every single word