internal field guide · under the hood

The machine behind the work: 14 agents, three layers, one compounding brain.

This is how Scale By Video actually runs, the infrastructure, the specialist agents, and the weekly learning loop that turns every client's results into a sharper next round. Internal reference, current as of June 2026.

Stack: a dedicated Hetzner box (sbv-prod-01) over Tailscale · markdown brain as source of truth · Anthropic Claude (Opus / Sonnet / Haiku) with native prompt caching + model tiering · the official Meta Ads MCP.

/ the three layers

Control, execution, memory, kept separate on purpose.

Each layer does one job. Paperclip decides and pays, OpenClaw runs the agents and touches the outside world, Hermes remembers and automates.

Paperclipcontrol + economics

The orchestration and budget layer. Routes incoming work to the right specialist agent, enforces a per-agent spend cap, and exposes the budget API that Ledger reads. Every agent is invoked through it.

paperclip.service · shared env at /etc/paperclip/env

OpenClawagent execution

The gateway that runs the 14 agents as subprocesses and connects them to the outside world: channels (Telegram capture via Scribe) and MCP connectors (the official Meta Ads MCP, HubSpot, Slack, Calendar). Where the agents actually do things.

openclaw-gateway.service

Hermesmemory + research + automation

The NousResearch runtime. Holds Honcho-backed memory, runs scheduled jobs and skills, and provides native gateway prompt caching. The agents' long-running thinking and recall live here.

hermes-gateway.service · 127.0.0.1:8642 · v0.14.x

The substrate underneath

The Brain

Markdown vault = source of truth. Contracts, doctrine, client notes, mesh signals.

Honcho

Memory daemon inside Hermes. Crystallizes learnings into vault notes nightly (2 AM).

Agent Vault

Credential broker. Secrets injected at runtime, never sit in configs.

Tailscale + Mail

Private network between Mac and VPS; per-agent email aliases.

/ how one task runs

Every agent reads before it acts, and writes when it's done.

01 · IN

Work arrives, from Ramon, a schedule, or an inbound lead

→

02 · ROUTE

Paperclip invokes the right agent under a budget cap

→

03 · RECALL

Agent's first move: read the vault + query Honcho memory

→

04 · ACT

Works via LLM + MCP connectors (Meta, HubSpot, Slack)

→

05 · WRITE

Results written back to the vault

→

06 · LEARN

Honcho ingests overnight → a richer next run

/ the roster

14 specialists, each on the right model for its job.

Cheap, deterministic work runs on Haiku; reasoning and client-facing work on Sonnet; only Boss's weekly synthesis touches Opus. That tiering plus prompt caching is what keeps the whole stack running on a few dollars a day.

◆ mesh

Boss Opus

Orchestrator. Routes work, approves deliverables, handles escalations, runs the Friday synthesis.

◆ mesh

Reach Sonnet

Meta campaign manager. Audiences, budgets, optimization, scaling. The Meta MCP operator.

◆ mesh

Scout Sonnet

Ecosystem watch. Meta algorithm/policy shifts, top-performer ad-library patterns. Feeds Reach.

◆ mesh

Probe Sonnet

Discovery & research. A two-page brief before every call; refines ICP and vertical packs.

◆ mesh

Studio Sonnet

Pre-production. Shoot scheduling, shot lists, creative briefs, footage handoff.

◆ mesh

Anchor Sonnet

Reporting. Daily monitoring, Friday Loom, bi-weekly report, health score, churn alerts.

◇ observer

Concierge Haiku

Lead response. First touch under 5 minutes, qualifies, books. Kept fast and cheap on purpose.

Pitch Sonnet

SBV's own outbound. Sources ICP-fit prospects, drafts cold outreach for approval.

Voice Sonnet

Brand voice & short-form. Ad copy, hooks, LinkedIn and organic content scripts.

Quill Sonnet

Long-form. Blog posts, case studies, lead magnets, and compliance review of ad copy.

Cutter Sonnet

Video editor. Hook variations (3 hooks × 2 formats per concept), captions, motion graphics.

Pipeline Haiku

CRM hygiene. HubSpot records, dedup, lead-source tags, qualified-lead counting.

Ledger Haiku

Internal finance. Weekly pipeline, revenue, per-agent API cost, and capacity.

Scribe Haiku

Capture surface. Receives Ramon's Telegram notes, classifies, writes to the vault in seconds.

Opus · high-judgment Sonnet · reasoning & client-facing Haiku · fast, deterministic ◆ mesh participant · ◇ observer

/ the moat

The optimization mesh: one agent's signal on Tuesday is informing the other five by Wednesday.

Without it, every client engagement is a clean slate, the same patterns re-discovered and forgotten. The mesh is a closed weekly feedback loop on ad performance. This is the compounding edge, not the tools.

/ 6 participants + 1 observer

Boss · integration & decisions Scout · external shifts Probe · customer signals Studio · creative patterns Reach · campaign performance Anchor · outcome data Concierge · observer (emit-only)

EACH WEEK

Every agent writes to its own path and drops a one-line pointer into shared by-type signal files (patterns, contradictions, questions, numbers, ideas).

FRIDAY 17:00 UTC

Boss reads the signals and hunts for 4 connection types, then writes 3–5 typed decisions into an append-only DECISIONS.md.

MONDAY 09:00

Participants load the synthesis. The next ad batch is built informed by what won and lost, across clients in the same vertical.

GUARDRAIL

A pattern stays niche-local until it replicates in 2+ verticals before graduating to shared signals. No cross-contamination.

/ three learning loops

It gets smarter on three clocks.

per action
Read & logEvery run starts with a vault + Honcho read and ends with a line in the daily note. Nothing is done from a blank slate.
nightly · 2 AM
Honcho ingestThe memory daemon crystallizes the day's transcripts and outcomes back into vault notes, so tomorrow's agents know more.
weekly · Friday
The meshSignals become decisions become the next batch. Anchor closes the loop: did the decision actually improve booked → showed → closed?

/ the operating rhythm

What the agents actually do, day to day, week to week, month to month.

This isn't a one-time setup that hands you a reel. It runs on three clocks, and the longer it runs the sharper it gets.

day to day
Keep the money movingConcierge answers every new lead in minutes and books it before it goes cold
Reach watches the campaigns, pauses fatigue, scales what's booking jobs, never killing top-of-funnel
Pipeline keeps the CRM clean and counts qualified leads honestly
Scribe captures notes; every agent reads the brain first and logs what it did
week to week
Get sharperReach reads the week's numbers, reallocates budget, tests fresh hooks
Anchor ships the Friday Loom and the written report
Scout flags Meta algorithm, policy, and top-performer shifts before they bite
The mesh: every Friday, Boss turns the week's signals into concrete decisions for the next ad batch
month to month
CompoundFresh creative replaces fatigued ads before they decay; lookalikes refreshed
Monthly retrospective (first Monday): what's working across the whole account
Each batch is built on what won and lost last month, across clients in the same vertical
Ledger reviews cost, revenue, and capacity, cost-per-booked-job trends down

/ efficiency

How it runs on a few dollars a day.

prompt caching

Native in the Hermes gateway since v0.4.0
The stable prefix (system prompt + brain docs) is cached; dynamic content goes at the tail so it never busts the cache
Cached tokens bill at ~10% — identical output, far lower cost
1-hour TTL so cron-spaced calls stay warm

model tiering

Haiku for the grunt work: capture, classification, dedup, lead replies
Sonnet for reasoning, creative, and anything client-facing
Opus gated to Boss's Friday synthesis only
No silent escalation — a cheap-tier failure errors, it doesn't quietly bill Sonnet

/ what changed recently

What's new since the last version.

The stack matured into three named layers new — Paperclip (control + budget), OpenClaw (execution), Hermes (memory + automation), plus the markdown brain and Honcho memory daemon.
Roster grew from 11 to 14 new — added Scout (ecosystem watch), Scribe (Telegram capture), and Ledger (internal finance).
The optimization mesh now exists new — a formal weekly loop, 6 participants + Concierge observer, with an append-only decisions log. This is the headline capability.
The official Meta Ads MCP is live new — Reach runs 100% natural-language Meta operations, with an account-level funnel guardrail (never kill top-of-funnel on last-click ROAS).
Cost optimization is doctrine now new — native prompt caching, Haiku tiering, Opus gated to Friday, real spend ~$13/day and falling.
The client portal is a 3-layer multi-tenant system — agents are the source of truth and write results to it (Supabase, per the current operational doc; reconcile vs the Vercel-Postgres reference before relying on it).