PORTAL / LIBRARY / transcribe

[ REPORTING & OPS ]

/transcribe

For input `~/Downloads/sales-workshop.mp4` with `--topic hormozi --slug sales-workshop`:

Download the skill file (.md)

Placeholders like ACME Agency, <id> and you@example.com mark values that are per-agency — your install fills them with YOUR clients and accounts. If a section references a helper script you don't have yet, it ships with that workflow's install.

Transcribe — Long-Form Video/Audio → Knowledge Base

Triggers

What it produces

For input ~/Downloads/sales-workshop.mp4 with --topic hormozi --slug sales-workshop:

.claude/context/hormozi/
├── README.md                     # Index — script appends a row here
├── transcripts/
│   └── sales-workshop.md         # Verbatim, segment-timestamped (Whisper)
├── synthesis/
│   └── sales-workshop.md         # Structured synthesis (Sonnet, mimics existing book format)
└── slides/sales-workshop/        # Only if --with-slides
    └── slide_0001.png ...

The transcript file is archival — large, never auto-loaded into agents. The book file is what agents Read on demand for retrieval. Match this distinction when deciding what flags to pass.

Inputs accepted

Cost (rough)

Component4hr video
Whisper API~$1.45
Vision (slides, ~50 imgs)~$0.05
Synthesis (Sonnet 4.6)~$0.50
Total~$2

Wall time: ~5-10 min for a 4hr video (Whisper runs 4-way parallel).

Workflow

Step 1 — Confirm intent with the user

Before running, confirm:

Step 2 — Run

node shared/transcribe.mjs "<input_file>" --topic <topic> --slug <slug> [--with-slides]

Run from the repo root (c:\Users\faris\agency-os). The script:

  1. Extracts audio with ffmpeg (16kHz mono mp3)
  2. Splits into ~10-min chunks
  3. Sends chunks to Whisper API in parallel (4 simultaneous)
  4. Stitches segments back with timestamps
  5. (if --with-slides) detects slide changes via ffmpeg scene detection, screenshots each, captions each via Claude Haiku
  6. Writes transcript markdown with inline screenshots
  7. (if synthesis enabled) Calls Claude Sonnet 4.6 with the topic's existing synthesis file as a style reference, produces a structured synthesis, writes to synthesis/<slug>.md
  8. Appends a row to <topic>/README.md

Step 3 — Hand back to the user

After the script finishes, output:

Step 4 — Refine the synthesis

The script's Sonnet pass is a strong V1 but won't match a hand-edited book like 100m-money-models.md in depth. After the run, suggest:

"Open synthesis/<slug>.md — the synthesis is solid but a hand pass to add cross-references to the Cross-Book Playbook (in the topic README) and to flesh out the Application Map will make this much more useful long-term."

Don't do this pass automatically — it's slow and judgment-heavy. Let Faris choose to do it.

Topic-specific notes

hormozi

  1. Update the Cross-Book Playbook in the topic README with new principles
  2. Update the Application Map with new tactics for Faris's businesses

Other topics

Common pitfalls

Why this exists

Long-form video is the highest-density learning material Faris consumes (Hormozi workshops, Brunson talks, sales training, prospect deep-dives), but it's the hardest to retrieve from. This skill turns that material into structured, agent-readable knowledge that the copywriter, media-buyer, and sales-ops agents can pull from when designing offers, ad copy, and money models.