Blog

Engineering notes on AI agents, automation, and the infrastructure behind them.

Pinecone vs RunPod for Vector Search: Managed vs Self-Hosted (2026)

Pinecone vs RunPod for Vector Search: Managed vs Self-Hosted (2026)

May 9, 2026 · 10 min read
Pinecone vs RunPod for vector search: managed serverless against self-hosted Qdrant on rented GPU. Break-even math, latency, and the hybrid most teams ship.
Pinecone vs RunPod: Vector DB vs GPU Host (You Probably Need Both)

Pinecone vs RunPod: Vector DB vs GPU Host (You Probably Need Both)

May 8, 2026 · 5 min read
Pinecone is a vector database. RunPod is a GPU host. They solve different problems. Here's what you actually compared and what your stack needs.
Automate YouTube Shorts with CapCut: The CLI + Claude Pipeline

Automate YouTube Shorts with CapCut: The CLI + Claude Pipeline

May 7, 2026 · 7 min read
Automate YouTube Shorts end-to-end: pick segments, write hooks with Claude, build CapCut drafts via CLI. Open-source pipeline + my paid blueprint.
Claude Code with Local LLMs and ANTHROPIC_BASE_URL: Ollama, LM Studio, llama.cpp, vLLM

Claude Code with Local LLMs and ANTHROPIC_BASE_URL: Ollama, LM Studio, llama.cpp, vLLM

April 29, 2026 · 16 min read
Run Claude Code on a local LLM via ANTHROPIC_BASE_URL. Native Anthropic endpoints for Ollama, LM Studio, llama.cpp, vLLM. 32K context floor.
How to Choose an LLM for Production: 7 Criteria That Matter

How to Choose an LLM for Production: 7 Criteria That Matter

April 17, 2026 · 13 min read
How to choose an LLM for production workloads. 7 selection criteria, a decision tree, an evaluation process, and a requirements checklist from real deployments. Download the free AI Automation Checklist.
Self-Hosted LLM vs API Cost: Break-Even Analysis (2026)

Self-Hosted LLM vs API Cost: Break-Even Analysis (2026)

April 16, 2026 · 15 min read
Self-hosted LLM vs API cost analysis with break-even math. When to self-host, when to stay on Claude, and the hybrid pattern most production teams actually use. Download the free AI Automation Checklist.
LLM API Comparison 2026: Best API for Production

LLM API Comparison 2026: Best API for Production

April 15, 2026 · 18 min read
An opinionated LLM API comparison for production. Claude vs GPT vs Gemini vs Mistral vs DeepSeek on features, developer experience, reliability, and fit. Download the free AI Automation Checklist.
Zapier vs Make vs n8n Pricing at Scale (2026)

Zapier vs Make vs n8n Pricing at Scale (2026)

April 14, 2026 · 11 min read
Automation platform pricing comparison decoded: zapier task pricing, n8n execution pricing, and make operation pricing side-by-side at realistic volumes. Download the free AI Automation Checklist.
Claude vs ChatGPT for Developers: A 2026 Practitioner Review

Claude vs ChatGPT for Developers: A 2026 Practitioner Review

April 12, 2026 · 17 min read
Claude vs ChatGPT for developers in 2026. Chat, CLI, IDE, and API compared by a practitioner running ten agents in production. Download the free AI Automation Checklist.
LLM API Cost Comparison 2026: Framework, Not a Stale Table

LLM API Cost Comparison 2026: Framework, Not a Stale Table

April 11, 2026 · 12 min read
LLM API cost comparison for 2026. Model your real workload costs with prompt caching, output tokens, reasoning, and batch API factored in. Download the free AI Automation Checklist.
Make.com vs n8n Comparison 2026: Cost, Reliability, AI Agents

Make.com vs n8n Comparison 2026: Cost, Reliability, AI Agents

April 11, 2026 · 6 min read
Make.com vs n8n comparison 2026. Cost ($20 vs $500/mo), data residency, error handling, AI agents. Real numbers from production deployments.
Claude API vs OpenAI API in 2026: Which One Ships Faster?

Claude API vs OpenAI API in 2026: Which One Ships Faster?

April 11, 2026 · 6 min read
Claude vs OpenAI in 2026. Honest comparison from someone shipping on both. Tool use, pricing, compliance, and which breaks under production load. Download the free AI Automation Checklist.