Blog
Engineering notes on AI agents, automation, and the infrastructure behind them.

June 7, 2026 · 4 min read
The voice AI that pays back for a small business is narrow and boring: after-hours triage, booking, order status. Here is what ships and what to skip.
May 30, 2026 · 5 min read
Turn your task manager into agent memory: an adapter-not-migration CLI and MCP server with hybrid RRF retrieval over TickTick or an Obsidian vault.

May 9, 2026 · 10 min read
Pinecone vs RunPod for vector search: managed serverless against self-hosted Qdrant on rented GPU. Break-even math, latency, and the hybrid most teams ship.

May 8, 2026 · 5 min read
Pinecone and RunPod do not compete: vector DB vs GPU inference. Break-even math, real monthly costs, and a decision table for your RAG stack.

May 7, 2026 · 7 min read
Automate YouTube Shorts end-to-end: pick segments, write hooks with Claude, build CapCut drafts via CLI. Open-source pipeline + my paid blueprint.

April 29, 2026 · 16 min read
Run Claude Code on a local LLM via ANTHROPIC_BASE_URL. Native Anthropic endpoints for Ollama, LM Studio, llama.cpp, vLLM. 32K context floor.

April 17, 2026 · 13 min read
How to choose an LLM for production workloads. 7 selection criteria, a decision tree, an evaluation process, and a requirements checklist from real deployments. Download the free AI Automation Checklist.

April 16, 2026 · 15 min read
Self-hosted LLM vs API cost analysis with break-even math. When to self-host, when to stay on Claude, and the hybrid pattern most production teams actually use. Download the free AI Automation Checklist.

April 15, 2026 · 18 min read
Feature matrix, pricing, reliability and EU hosting across major LLM APIs. Where Anthropic, OpenAI and Google win, and what to pick for production.

April 14, 2026 · 11 min read
Zapier (per-task), Make (per-operation), n8n (per-execution) — same workflow at 10k / 100k / 1M runs. Where each wins, where each breaks.

April 12, 2026 · 17 min read
Claude vs ChatGPT for developers in 2026. Chat, CLI, IDE, and API compared by a practitioner running ten agents in production. Download the free AI Automation Checklist.

April 11, 2026 · 12 min read
LLM API cost comparison for 2026. Model your real workload costs with prompt caching, output tokens, reasoning, and batch API factored in. Download the free AI Automation Checklist.