Adding an API call between my Agent and the LLM seems like it would kill latency...

ofabioroma · 2026-01-21T15:25:15 1769009115

~20ms overhead. Built on Cloudflare's edge, so it's fast globally. The value isn't speed—it's that you stop rebuilding context infrastructure and get versioning/debugging for free.

nicklo · 2026-01-21T22:20:01 1769034001

i've always wondered (for this, portkey, etc) - why not have a parallel option that fires an extra request instead of mitm the llm call?

ofabioroma · 2026-01-21T22:23:45 1769034225

You can fire them in parallel for simple cases. The issue is when you have multi-agent setups. If context isn't persisted before a sub-agent reads it, you get stale state. Single source of truth matters when agents are reading and writing to the same context.

For single-agent flows, parallel works fine.