- Cortex: the engine behind the chat window
7 May 2026
AI has completely changed how I work as an engineer, and we are never going back. This is a look behind the chat window at the platform that made the change possible — two NVIDIA DGX Sparks running vLLM, an OpenAI-compatible proxy with load balancing and full tracing, a vector store with two years of institutional memory, and a fleet of MCP servers wired into the real systems we run on. The model is the smallest piece. Everything else is what makes it useful.