Version: 0.2

Connect Azure OpenAI

This guide will help you configure Envoy AI Gateway to work with Azure OpenAI's foundation models.

There are two ways to do the Azure OpenAI authentication: Microsoft Entra ID and API Key.

We will use Microsoft Entra ID to authenticate an application to use the Azure OpenAI service. You can obtain an access token using the OAuth 2.0 client credentials grant flow. This process involves registering the application in Microsoft Entra ID (formerly Azure Active Directory), configuring the appropriate permissions, and acquiring a token from the Microsoft identity platform. The access token is then used as proof of authorization in API requests to the Azure OpenAI endpoint.

For detailed steps, refer to the official Microsoft documentation.

API Key authentication is not supported yet.

Prerequisites

Before you begin, you'll need:

Azure credentials with access to OpenAI service.
Basic setup completed from the Basic Usage guide
Basic configuration removed as described in the Advanced Configuration overview

Azure Credential Setup

An Azure account with OpenAI service access enabled
Your Azure tenant ID, client ID, and client secret key
Enabled model access to "GPT-4o"

Configuration Steps

1. Configure Azure Credentials

Edit the basic.yaml file to replace these placeholder values:

AZURE_TENANT_ID: Your Azure tenant ID
AZURE_CLIENT_ID: Your Azure client ID
AZURE_CLIENT_SECRET: Your Azure client secret

Security Note

Keep your Azure credentials secure and never commit them to version control. The credentials will be stored in Kubernetes secrets.

2. Apply Configuration

Apply the updated configuration and wait for the Gateway pod to be ready. If you already have a Gateway running, then the secret credential update will be picked up automatically in a few seconds.

kubectl apply -f basic.yaml

kubectl wait pods --timeout=2m \
  -l gateway.envoyproxy.io/owning-gateway-name=envoy-ai-gateway-basic \
  -n envoy-gateway-system \
  --for=condition=Ready

3. Test the Configuration

You should have set $GATEWAY_URL as part of the basic setup before connecting to providers. See the Basic Usage page for instructions.

curl -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "messages": [
      {
        "role": "user",
        "content": "Hi."
      }
    ]
  }' \
  $GATEWAY_URL/v1/chat/completions

Troubleshoot

If you encounter issues:

Verify your Azure credentials are correct and active
Check pod status

kubectl get pods -n envoy-gateway-system

View controller logs:

kubectl logs -n envoy-ai-gateway-system deployment/ai-gateway-controller

Common errors:
- 401/403: Invalid credentials or insufficient permissions
- 404: Model not found or not available in region
- 429: Rate limit exceeded

Configuring More Models

To use more models, add more AIGatewayRouteRules to the basic.yaml file with the model ID in the value field. For example, to use GPT-4.5 Preview

apiVersion: aigateway.envoyproxy.io/v1alpha1
kind: AIGatewayRoute
metadata:
  name: envoy-ai-gateway-basic
  namespace: default
spec:
  schema:
    name: OpenAI
  targetRefs:
    - name: envoy-ai-gateway-basic
      kind: Gateway
      group: gateway.networking.k8s.io
  rules:
    - matches:
        - headers:
            - type: Exact
              name: x-ai-eg-model
              value: gpt-4.5-preview
      backendRefs:
        - name: envoy-ai-gateway-basic-aws

Prerequisites​

Azure Credential Setup​

Configuration Steps​

1. Configure Azure Credentials​

2. Apply Configuration​

3. Test the Configuration​

Troubleshoot​

Configuring More Models​