LLM Gateways

Filed under: Cloud Engineering · AI Infrastructure · Local Lab Where Part 1 left off In Part 1 we got Bifrost running locally, wired up Ollama with qwen3.5, and confirmed the stack end to end. Requests through the gateway, streaming, tool calling. This post adds MCP, the Model Context Protocol. Part 1 gave the model a reliable connection. This part gives it tools. By the end you’ll have a local MCP server exposing real capabilities (system info, allowlisted shell commands, math) connected through Bifrost so qwen3.5 can run them. Still no cloud. ...

LLM Gateways

The API in Front of the AI: Part 2

The API in Front of the AI