Drop a video.
CapForge does the rest.

tweak it

Six presets.
Infinite tweaks.

export next

Locally. Privately. Yours.

share
What's inside

Built for speed.
Built for craft.

Every feature is there because a creator asked for it. Nothing upsold, nothing hidden behind a paywall.

On-device AI

Whisper transcription runs on your machine. No upload, no queue, no cloud bill.

Word-level timing

Each word gets its own timestamp. Karaoke, per-word highlighting, precise sync. All free.

99+ languages

Auto-detect or pick a language. Works on whatever your subject speaks.

Six caption presets

Highlight, Karaoke, Bounce, Reveal, Script, Chunky. Each is a fully-editable starting point.

Runs offline

Plane, cafe, bunker. If your machine has power, CapForge works. No network required.

Transparent export

Bake captions into your video, or export a transparent MOV to composite anywhere.

Pricing

It's free.
Forever.

CapForge is independent software. No subscription, no tiers, no limits on how many captions you forge. If it saves you hours, consider buying me a coffee on Ko-fi.

Questions people ask

Quick answers

Everything you'd reasonably wonder before downloading.

Is CapForge really free?
Yes. No trial, no subscription, no feature paywall. It's a passion project. If it saves you time you can support it on Ko-fi, but you never have to.
Does it send my video to a server?
No. All transcription happens locally using an on-device Whisper model. Your clip never leaves your machine. You can airplane-mode the whole process.
Which languages are supported?
99+ languages from the Whisper family, including English, Spanish, French, German, Portuguese, Italian, Dutch, Russian, Chinese, Japanese, Korean, Arabic, Hindi and more. Auto-detect is on by default.
What export formats do you support?
Video: MP4, MOV, and transparent MOV (ProRes 4444 alpha). Subtitles: SRT, VTT, and plain text. Encoding is hardware-accelerated where available.
Is it open source?
The application is distributed freely and the source lives on GitHub. See the repository for license details and release history.
System requirements?
macOS 12 Monterey or later on Apple Silicon, or Windows 10/11 64-bit. Transcription is fastest with at least 8 GB of RAM and a modern CPU (Apple Silicon runs it very efficiently).
Ready?

Stop typing.
Start forging.