Overhaul of Agent Gateway supporting A2A, MCP, and Kubernetes Gateway API

Today, we’re excited to share the next major milestone: Agent Gateway is now a full-featured, AI-native gateway that combines deep MCP and A2A protocol awareness, robust traffic policy controls, inference gateway support, Kubernetes Gateway API support, and unified access to major LLMs, all purpose-built with Rust for real-world agentic systems.

Back when we first introduced Agent Gateway, it was designed to fill a critical gap in the AI stack: enabling structured, secure, and scalable communication between agents, tools, and LLMs using protocols like MCP and A2A. We wrote about this in our last blog post where we explained why traditional API gateways fall short in agentic environments. Since then, the project has grown tremendously.

And notably, Agent Gateway now fully supports:

The latest MCP spec (2025-06-18), including latest authorization changes
The v0.2.x release of A2A

Let’s take a closer look at the updated capabilities.

Deep Protocol Awareness: MCP and A2A

Agent Gateway continues to deepen its native support for the emerging Model Context Protocol (MCP) and Agent-to-Agent Protocol (A2A) with updated support including:

Protocol-aware routing, telemetry, and tracing
Support for MCP server/tool aggregation (virtualized MCP servers)
Updated MCP Authorization support per the June 2025 spec
Native authorization policy engine using Cedar for fine-grained authorizations
Support for exposing local stdio-based MCP servers as remote targets

Agent Gateway can be used to implement authentication and authorization for your MCP servers with minimal set up. We will keep Agent Gateway updated as these specs continue to evolve.

Kubernetes Gateway API Support

Agent Gateway now implements the Kubernetes Gateway API with support for HTTPRoute, GRPCRoute, TCPRoute, and TLSRoute. This isn’t just basic support, we’ve implemented all core and extended features, and most experimental ones too. This means you can:

Match traffic based on headers, paths, and query params
Apply CORS, redirects, header rewrites, request mirroring
Configure timeouts, retries, and direct responses

For platform teams already using Kubernetes-native tooling, this makes Agent Gateway a seamless drop-in with advanced L7 control. It’s a critical step toward integrating agentic workloads into the broader platform ecosystem.

Fine-Grained Traffic Policy Controls

In real enterprise deployments, security and reliability aren't optional. That’s why Agent Gateway now includes advanced policy features:

Local and remote rate limiting
JWT authentication and external auth, (ie, extAuthz) hooks
Upstream auth (cloud identity, TLS, etc)
Full OpenTelemetry support for metrics, logs, and distributed tracing

These controls allow you to run production-grade agent infrastructure that meets enterprise security and observability requirements.

LLM Gateway

Agent Gateway can now be used as an “AI Gateway” and route traffic directly to LLMs (OpenAI, Anthropic, Gemini, Bedrock, etc). Top usecases here are unified LLM API, prompt guarding, and resilience (failover, rate limiting, etc).

Agent Gateway now provides a unified OpenAI-compatible API across:

OpenAI / Azure OpenAI
Anthropic
Google Gemini
Amazon Bedrock
Google Vertex

This enables users to seamlessly move between providers without changes to their application, even dynamically based on the health or performance of a specific provider.

Agent gateway can also be used to implement prompt guarding to scan/filter for sensitive information or full on direct prompt injection attacks:

Regex-based filters to block prompts with known unsafe patterns
Optional webhook integration to run pre-flight validation using custom logic

Combined with the fine-grained traffic policy controls, Agent Gateway can act as a powerful LLM Gateway.

Inference Routing

If you are running your own models on self-hosted GPU infrastructure, Agent Gateway now implements the Inference Gateway extensions for more accurate, efficient prompt routing. Using the InferencePool API, you can route based on

Prompt criticality
GPU and KV cache utilization
Work queue / waiting queue depth
Lora adapters

This gives AI platform operators more efficient and cost-conscious routing. This can also form the foundation for deeper optimizations like llm-d.

Get Involved

We’re thrilled to see the community interest grow around this project. If you’re building agent infrastructure, dealing with LLM routing, or trying to enforce policy in AI-native environments, we’d love your feedback and contributions:

Agent Gateway has evolved far beyond just a proxy. It’s becoming the control point for secure, scalable agentic systems. We’re just getting started.

‍

Overhaul of Agent Gateway supporting A2A, MCP, and Kubernetes Gateway API

Deep Protocol Awareness: MCP and A2A

Kubernetes Gateway API Support

Fine-Grained Traffic Policy Controls

LLM Gateway

Inference Routing

Get Involved

Featured content

Tracing GenAI Applications Is Not Enough

Gloo Mesh 2.10: More Secure, Scalable Cloud Connectivity

MCP Authorization is a Non-Starter for Enterprise

Securing and Observing Your Services, Simplified

From MCP Servers to Services: Introducing kmcp for Enterprise-Grade MCP Development

The Power of a Single API to Secure, Observe, and Control Traffic in All Directions

Why Building Large Kubernetes Clusters Is (Still) a Bad Idea

Fortifying Your Cloud Native Connectivity Security Posture with Solo and Ambient Mesh

Migrating from Sidecars to Ambient Mesh - Risks, Challenges, and Benefits

Overhaul of Agent Gateway supporting A2A, MCP, and Kubernetes Gateway API

How Ambient Mesh Delivers Advanced Resource and Cost Savings

Getting Started with Ambient Mesh: From 0 to 100 mph

Agent Discovery, Naming, and Resolution - the Missing Pieces to A2A

Part Two: MCP Authorization The Hard Way

Part One: MCP Authorization The Hard Way

Agent Identity and Access Management - Can SPIFFE Work?

Deep Dive into llm-d and Distributed Inference

Gloo Mesh 2.8 simplifies service mesh operations with new enhanced user experience across multi-cluster environments.

Gloo Gateway 1.19 accelerates context-rich, real-time AI apps with Gateway API

llm-d: Distributed Inference Serving on Kubernetes

AI Reliability Engineering For More Dependable Humans

Kubernetes Identity the Right Way with SPIRE and Ambient

Optimizing GenAI in Production: High-Value Use Cases for AI Gateways

Solo.io Recognized as a Visionary in the 2024 Gartner® Magic Quadrant™ for API Management for the SECOND year in a row.

Guardians of the Governance: GenAI Gateway Guidance with GitOps and Gloo

Istio Ambient Waypoint Proxy explained

Hands-On with the Kubernetes Gateway API and Envoy Proxy: A Tutorial with GitOps and Gloo Gateway

Istio and the State of DevOps: Enhancing Key Metrics

What is an AI Gateway and its role in AI Applications?

Best practices for secure Istio deployment with Gloo Mesh Core

Gloo Mesh 2.6: Istio's Ambient mode now ready for production

HTTP Observability Without Compromises

Advance your knowledge of service mesh tech with Solo.io Academy certifications

Service Mesh for the developer workflow, a series

Challenges of adopting service mesh in enterprise organizations

Service Mesh in the Real World #2 — Ingress Traffic Control

Service Mesh in the Real World Video Series – Episode # 1: Egress Traffic

Service Mesh the easy way with AWS App Mesh and SuperGloo

Webinar Recap: Intro to Service Mesh Hub and SMI

D-TECK Uses Solo.io Gloo Gateway and Google Cloud to Help Businesses Make Better HR Decisions

Minimize the blast radius of changes with Solo.io Gloo Gateway and Weaveworks Flagger

Announcing Service Mesh Interface (SMI) Support and Collaboration

Service Mesh Interface (SMI) and our Vision for the Community and Ecosystem

The need for a standard, service mesh API

SuperGloo to the Rescue! Making it easier to write extensions for Service Mesh

Introducing The Service Mesh Hub -everything you need for your service mesh

Kubernetes Ingress Past, Present, and Future

Solo.io Streamlines Service Mesh and Serverless Adoption for Enterprises in Google Cloud

Ingenico

ParkMobile

Vonage

Domino’s Pizza

Gloo Mesh Feature Comparison

Service Mesh for Developers, Part 1: Exploring the Power of Observability and OpenTelemetry

Service Mesh at Scale

Compare Capabilities of the Top Service Mesh Platforms

Compare Capabilities of the Top API Gateways

Establishing zero trust security for modern cloud architectures

Unlocking the Power of Your API Gateway

API Gateways: Productivity, Resilience, and Security for Next-Generation Cloud Applications

Driving Business Value with Istio

Service Mesh Vendor Comparison

Istio Then & Now

4 Reasons Why You Need an AI Gateway

Gloo Gateway vs. Kong

Gloo Gateway vs. Apigee

3 Reasons You Need an API Gateway for Microservices Apps

Gloo Mesh Lab: OpenTelemetry collectors and relay

Gloo Mesh Lab: Extended telemetry from ztunnel

Gloo Mesh Lab: Configure enhanced waypoint proxies

Gloo Mesh Lab: Multicluster peering

Ambient Mesh Lab: EnvoyFilter Support