What are these guides for?

Decision frameworks and comparison pieces for teams choosing an AI automation stack. Each guide leads with a verdict, then shows the numbers, tradeoffs, and the edge cases that change the answer.

Are these guides German or English?

Both. Core comparison guides ship in English, DACH-specific guides (KI-Automatisierung im Mittelstand, Make vs n8n Vergleich, freelance-ai-engineer-beauftragen) ship in German.

How current are the numbers?

Prices, limits, and model specs are verified against vendor docs at publication and refreshed quarterly. Every guide footer notes its last-verified date. Flag anything stale and I will refresh.

Do the guides recommend specific vendors?

Yes. Each guide names a default choice for the most common case and explains when to deviate. No vendor is sponsored. Recommendations reflect what I actually ship for DACH clients.

Blog

Engineering notes on AI agents, automation, and the infrastructure behind them.

Claude API Pricing Tiers and Cost Optimization Playbook (2026)

July 5, 2026 · 12 min read

Claude API and Fable 5 pricing, plus every lever to cut Anthropic API cost without losing quality: routing, caching, batching, effort tuning.

Claude Fable 5 Cost: What It Actually Costs and How to Control It (2026)

July 5, 2026 · 6 min read

Claude Fable 5 costs $10/$50 per million tokens, twice Opus 4.8. What drives the bill, when it pays for itself, and how to govern the spend.

claude-fable-5 pricing cost-optimization anthropic llm-apis

Voice AI Agents for Small Business: What Actually Ships in 2026

June 7, 2026 · 4 min read

The voice AI that pays back for a small business is narrow and boring: after-hours triage, booking, order status. Here is what ships and what to skip.

voice-ai ai-agents smb automation customer-support

Agent Memory From Your Task Manager: CLI + MCP, No Vector DB

May 30, 2026 · 5 min read

Turn your task manager into agent memory: an adapter-not-migration CLI and MCP server with hybrid RRF retrieval over TickTick or an Obsidian vault.

agent-memory mcp claude ticktick rag retrieval

Pinecone vs RunPod for Vector Search: Managed vs Self-Hosted (2026)

May 9, 2026 · 10 min read

Pinecone vs RunPod for vector search: managed serverless against self-hosted Qdrant on rented GPU. Break-even math, latency, and the hybrid most teams ship.

vector-search pinecone runpod qdrant infrastructure rag

Pinecone vs RunPod: They're Not Competitors (2026)

May 8, 2026 · 6 min read

Pinecone vs RunPod are not competitors: one is a vector database, one is GPU compute. What to actually compare, with break-even cost math.

pinecone runpod vector-db gpu-compute rag

Automate YouTube Shorts with CapCut: The CLI + Claude Pipeline

May 7, 2026 · 7 min read

Automate YouTube Shorts end-to-end: pick segments, write hooks with Claude, build CapCut drafts via CLI. Open-source pipeline + my paid blueprint.

Claude Code with Local LLMs and ANTHROPIC_BASE_URL: Ollama, LM Studio, llama.cpp, vLLM

April 29, 2026 · 16 min read

Run Claude Code on a local LLM via ANTHROPIC_BASE_URL. Native Anthropic endpoints for Ollama, LM Studio, llama.cpp, vLLM. 32K context floor.

claude-code local-llm ollama anthropic-api llama-cpp vllm

How to Choose an LLM for Production: 7 Criteria That Matter

April 17, 2026 · 13 min read

How to choose an LLM for production workloads. 7 selection criteria, a decision tree, an evaluation process, and a requirements checklist from real deployments. Download the free AI Automation Checklist.

llm production decision-guide ai-architecture

Self-Hosted LLM vs API Cost: Break-Even Analysis (2026)

April 16, 2026 · 15 min read

Self-hosted LLM vs API cost analysis with break-even math. When to self-host, when to stay on Claude, and the hybrid pattern most production teams actually use. Download the free AI Automation Checklist.

llm self-hosted infrastructure cost-optimization

LLM API Comparison 2026: Claude, OpenAI, Gemini for Production

April 15, 2026 · 19 min read

Feature matrix, pricing, reliability and EU hosting across major LLM APIs. Where Anthropic, OpenAI and Google win, and what to pick for production.

llm claude openai gemini mistral comparison

Zapier vs Make vs n8n (2026): Pricing, True Cost at Scale, and What Breaks in Production

April 14, 2026 · 17 min read

Zapier (per-task), Make (per-operation), n8n (per-execution) at 10k / 100k / 1M runs — plus what breaks first in production and when to self-host.

automation zapier make n8n pricing production