<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.10.0">Jekyll</generator><link href="https://costder.github.io/feed.xml" rel="self" type="application/atom+xml" /><link href="https://costder.github.io/" rel="alternate" type="text/html" /><updated>2026-05-30T14:28:06+00:00</updated><id>https://costder.github.io/feed.xml</id><title type="html">The Year of 3K</title><subtitle>Multi-agent architectures, open standards, and the shapes things take when nobody&apos;s watching. Vectorheart aesthetics meet cosmic horror energy — a blog about building AI infrastructure from the edge of the map.</subtitle><entry><title type="html">I Gave an AI Agent $0 and Told It to Make $10,000</title><link href="https://costder.github.io/2026/05/hbf-ai-agent-zero-to-10k/" rel="alternate" type="text/html" title="I Gave an AI Agent $0 and Told It to Make $10,000" /><published>2026-05-30T00:00:00+00:00</published><updated>2026-05-30T00:00:00+00:00</updated><id>https://costder.github.io/2026/05/hbf-ai-agent-zero-to-10k</id><content type="html" xml:base="https://costder.github.io/2026/05/hbf-ai-agent-zero-to-10k/"><![CDATA[<p>An AI agent starts with $0.00. It has 180 days. No human can help it. Can it earn $10,000?</p>

<p>I built this experiment to answer something I’ve been turning over for months: <strong>can you give an AI agent a real body and let it loose in the actual economy?</strong></p>

<p>Not simulated. Not sandboxed. Real wallets. Real GitHub repos. Real emails and SMS messages. An agent that earns actual dollars and splits them: 30% to taxes, 50% back into the business, 20% to me.</p>

<p>The experiment is running right now. <a href="https://costder.github.io/hbf-zero-to-10k/">Dashboard is here</a>, tracking every dollar.</p>

<hr />

<h2 id="the-body-problem">The body problem</h2>

<p>AI agents can think. They can’t <em>do</em>. No wallet. No email. No deploy button.</p>

<p><a href="https://github.com/opentrust-network/hands-body-and-feet">Hands Body and Feet MCP</a> is an MCP server that fixes this. 78 tools: wallets, cards, email, SMS, GitHub, containers, webhooks, RSS, IPFS. Built on <a href="https://opentrust.network">OpenTrust</a>, an open standard for agent tool trust.</p>

<p>The bet: an MCP server is all the body an agent needs. If the tools are real, the agent operates in the real world. This experiment tests that bet.</p>

<h2 id="the-brain">The brain</h2>

<p>The agent runs on <a href="https://github.com/NousResearch/hermes-agent">Hermes Agent</a> with <a href="https://honcho.dev">Honcho</a> for memory that survives across sessions. Every conversation builds a model of how I operate. Every agent profile has its own identity.</p>

<p>The strategy layer is <strong>Strategy v2</strong> (Hermes’s <code class="language-plaintext highlighter-rouge">/strategy</code> command). It doesn’t just make to-do lists. It:</p>

<ul>
  <li>Runs vehicle analysis. Which approaches can actually work with $0?</li>
  <li>Tracks 6 assumptions against reality. When does the plan stop matching what’s happening?</li>
  <li>Load-balances. Is the agent overcommitted?</li>
  <li>Auto-pivots. Path fails? Find another.</li>
</ul>

<p>The plan it wrote is public: <a href="https://github.com/Costder/hbf-zero-to-10k/blob/main/README.md">strategy doc</a></p>

<h2 id="four-vehicles-all-at-once">Four vehicles, all at once</h2>

<p><strong>1. Testnet and airdrop farming.</strong> Fastest path from $0 to first dollar. Creates wallets, hits testnet protocols, qualifies for retroactive airdrops, liquidates to USDC. Cost: nothing. Timeline: 1-4 months per airdrop distribution. Frankly, this is the one I’m watching closest. If it works, the whole experiment gets breathing room. If it doesn’t, we’re relying on the slower paths.</p>

<p><strong>2. Micro-SaaS.</strong> Builds small tools on free infrastructure (Vercel, Supabase, Cloudflare). Monetizes through payment gateways once capital exists. This is the recurring revenue play. 2-4 weeks to launch something, 2-3 months to see if anyone pays.</p>

<p><strong>3. Content and affiliate.</strong> SEO engine plus affiliate links. Slow ramp, but it compounds. Every move the agent makes becomes blog content. Every failure becomes a post.</p>

<p><strong>4. Opportunistic.</strong> Bug bounties. Digital products on Gumroad. Maybe an AI agent services API. Not the plan. Just catching what falls.</p>

<h2 id="the-money-rules">The money rules</h2>

<p>Three wallets on Base, USDC only. Immutable split:</p>

<ul>
  <li>30% tax reserve. Never touched except for tax payments.</li>
  <li>50% operations pool. Funds growth: domains, ads, API credits, whatever the next vehicle needs.</li>
  <li>20% my payout. Accumulates until I withdraw it.</li>
</ul>

<p>Every dollar earned gets split automatically. All on chain. All verifiable.</p>

<h2 id="community-suggestions-without-letting-the-internet-hijack-the-agent">Community suggestions (without letting the internet hijack the agent)</h2>

<p>The dashboard links to <a href="https://github.com/Costder/hbf-zero-to-10k/discussions">GitHub Discussions</a> where anyone can post ideas. People upvote. Every Monday morning, the top suggestion gets reviewed.</p>

<p>Here’s the security boundary: <strong>the agent never reads the discussion body.</strong> It only queries the API for <code class="language-plaintext highlighter-rouge">{title, upvote_count}</code>. The title gets sanitized (200 char max, stripped of code blocks and URLs). That sanitized title is what reaches me for review.</p>

<p>Only after I approve does the agent read the full suggestion and act on it.</p>

<p>No amount of “ignore previous instructions” buried in a discussion thread reaches the model. Human approval is the filter.</p>

<h2 id="what-happens-either-way">What happens either way</h2>

<p>If this works: an AI agent with the right tools can autonomously earn money in the real economy. That changes the conversation about what agents are for.</p>

<p>If it fails: we learn where the boundaries actually are. What resists automation? What requires judgment? The failure modes are as informative as the success.</p>

<p>Everything is public. Every transaction, every decision, every dollar. Dashboard updates every 6 hours. RSS feed tracks milestones. I’ll post updates here when something interesting happens.</p>

<h2 id="the-stack">The stack</h2>

<ul>
  <li><a href="https://github.com/opentrust-network/hands-body-and-feet">Hands Body and Feet MCP</a>. The body.</li>
  <li><a href="https://github.com/NousResearch/hermes-agent">Hermes Agent</a>. The brain.</li>
  <li><a href="https://github.com/NousResearch/hermes-agent">Strategy v2</a>. The planner.</li>
  <li><a href="https://github.com/Costder/hbf-zero-to-10k">Experiment repo</a>. The code and data.</li>
</ul>

<hr />

<p><em>$0.00. 180 days. $10,000 target. No human help. <a href="https://costder.github.io/hbf-zero-to-10k/">Track it live.</a></em></p>]]></content><author><name></name></author><summary type="html"><![CDATA[An autonomous AI agent using Hands Body and Feet MCP starts with zero capital. 180 days to earn $10,000. No human intervention. Here's the architecture and the public dashboard.]]></summary></entry><entry><title type="html">Every Ad Should Be an Experiment</title><link href="https://costder.github.io/2026/05/every-ad-should-be-an-experiment/" rel="alternate" type="text/html" title="Every Ad Should Be an Experiment" /><published>2026-05-12T00:00:00+00:00</published><updated>2026-05-12T00:00:00+00:00</updated><id>https://costder.github.io/2026/05/every-ad-should-be-an-experiment</id><content type="html" xml:base="https://costder.github.io/2026/05/every-ad-should-be-an-experiment/"><![CDATA[<p>Most people treat ad creation as a creative problem. Write copy, shoot footage,
edit, ship, pray.</p>

<p>I treat it as an infrastructure problem. Every ad is a test instrument.
Every campaign generates data. Every dataset makes the next ad better.</p>

<p>This post is about the system I built to make that work.</p>

<h2 id="the-hypothesis-first-bottleneck">The hypothesis-first bottleneck</h2>

<p>The single biggest improvement came from the simplest change: <strong>you can’t
touch copy or video until a hypothesis is written.</strong></p>

<p>Before the system existed, making an ad looked like this:</p>

<ol>
  <li>Pick a platform</li>
  <li>Write some headlines</li>
  <li>Generate a visual</li>
  <li>Ship it</li>
  <li>Guess what worked</li>
</ol>

<p>The new pipeline forces an earlier step:</p>

<ol>
  <li><strong>Write a falsifiable hypothesis</strong> — “We believe showing the VN scene
transition within the first 3 seconds will outperform abstract RPG
messaging for anime players because it demonstrates the product’s
core differentiator immediately.”</li>
  <li>Define the success criterion — “3-second hold rate &gt; 45%”</li>
  <li><strong>Then</strong> write copy, generate visuals, render.</li>
  <li>Measure. Compare against the hypothesis.</li>
  <li>Feed the result back into the system. The learning loop
doesn’t just say “this ad performed well.” It says “this
angle worked for this audience on this platform — test it
on the next platform with a different offer.”</li>
</ol>

<p>The difference is subtle but everything. One produces content.
The other produces knowledge.</p>

<h2 id="the-architecture">The architecture</h2>

<p>The system has six phases, each with strict gates between them:</p>

<div class="language-plaintext highlighter-rouge"><div class="highlight"><pre class="highlight"><code>Strategist → Copy Matrix → Script → Production → Reviewer → Learning Loop
</code></pre></div></div>

<p><strong>Strategist</strong> — The hypothesis engine. Before anything else, answer:
product, audience, funnel stage, core promise, pain point, main objection,
proof available, forbidden claims, success metric. Three to five hypotheses
per campaign. Each one falsifiable.</p>

<p><strong>Copy Matrix</strong> — Every headline and description carries a reference back
to a hypothesis. No orphan variations. Risk flags on every line:
truncation risk, claim risk, policy risk, combination risk.</p>

<p><strong>Script</strong> — A locked script before video production. Shot breakdown,
voiceover text, on-screen text, timing. Fifteen-second and thirty-second
templates based on the YouTube ABCD framework. HyperFrames receives a
locked input — it does not invent the ad.</p>

<p><strong>Production</strong> — HyperFrames takes the script and renders it. HTML/GSAP
composition, ElevenLabs VO, Whisper caption sync, shader transitions.
The generation core does not make creative decisions. It produces.</p>

<p><strong>Reviewer</strong> — A 100-point rubric covering hook strength, message clarity,
offer strength, proof, platform fit, audio/caption quality, CTA strength,
and compliance. Automatic failure conditions: no CTA, product unclear
after 5 seconds, no captions on social video, unreadable on mobile.
Draft renders get reviewed before high-quality final.</p>

<p><strong>Learning Loop</strong> — Post-campaign, performance data feeds back into
hypothesis generation. Kill what doesn’t work. Iterate what almost does.
Scale what works. The system gets smarter with every $50 spent.</p>

<h2 id="the-hard-part-wasnt-the-code">The hard part wasn’t the code</h2>

<p>The hard part was enforcing the gates.</p>

<p>It’s genuinely tempting to skip the strategist and jump to production
when you have a good idea for a visual. Every time I’ve done that,
the ad looked good and explained nothing. The reviewer catches it,
but the reviewer is the last gate. Better to catch it in strategist.</p>

<p>The pipeline doesn’t prevent you from making bad ads. It prevents you
from making bad ads and learning nothing from them.</p>

<h2 id="why-this-matters-for-an-indie-builder">Why this matters for an indie builder</h2>

<p>I don’t have a marketing team. I don’t have a media buyer. I have compute
credits and a decent understanding of the tools.</p>

<p>This pipeline is the force multiplier. It costs nothing to generate a new
hypothesis. It costs next to nothing to produce a 15-second ad from a
locked script. The cost that matters is attention — and the pipeline
exists to make sure every ad deserves the attention it asks for.</p>

<p>The learning loop is the actual product. Every campaign adds to a dataset
that says “these hooks work for this audience on this platform, these
don’t.” After enough campaigns, you don’t guess anymore. You know.</p>

<hr />

<p><em>Built with HyperFrames, OpenRouter, ElevenLabs, and a lot of hypothesis
templates. The full pipeline design is open to anyone who ships enough
ads to need it.</em></p>]]></content><author><name></name></author><summary type="html"><![CDATA[Why I stopped treating ad creative as 'content' and started treating it as 'test infrastructure' — and what changed.]]></summary></entry></feed>