Posts

Showing posts from May, 2026

Optimizing API Costs: Strategic Use of Claude and Open Source Models

The first time I really paid attention to my AI API bill, I'd been running the ACP Agent for about two weeks. The number wasn't catastrophic, but it was on a trajectory that would've been catastrophic by month-end if I hadn't noticed. The problem wasn't the project. The problem was that I'd been using the most expensive model for every task — including tasks a much cheaper model could handle perfectly. Categorizing logs, formatting strings, summarizing structured data: all of these were running through the premium tier when they didn't need to. API costs are the silent leak in AI-native development. They don't break anything, they don't show up as errors, and they only become a problem once they're already a problem. What This Post Covers The strategies I use to keep AI API costs predictable across production projects: matching models to tasks, designing prompts that don't waste tokens, caching aggressively where it makes sense, and...

x402 Protocol — Episode 9: Two Real Users Found My Korean Crypto API in 24 Hours

For weeks, every IP that hit my API belonged to a data center scanner. DigitalOcean, Render, Hetzner. Cataloging bots checking that my endpoints existed. Indexers verifying schemas. Useful, but not customers. Then within 24 hours, two real users showed up. Both running on cloud infrastructure, both paying USDC per call, both behaving like actual production systems. One in Singapore. One in Ohio. Neither knew the other existed. Both arrived in the same week AWS Bedrock launched their x402 reference architecture. Total revenue from real users to date: $0.41 USDC . That number sounds like nothing. It might be the most meaningful number this project has produced. What This Post Covers The first month of real x402 traffic against KR Crypto Intelligence, the moment indexers gave way to actual users, what those users' behavior tells us about how AI agents discover and evaluate APIs, and why $0.41 in revenue is the first real signal that the unit economics of agent-to-API payme...