Gateway API

The AgentZero gateway exposes a localhost HTTP API for programmatic access to the agent runtime.

Starting the Gateway

agentzero gateway
agentzero gateway --host 127.0.0.1 --port 8080
agentzero gateway --new-pairing

Endpoints

Method	Path	Auth	Description
`GET`	`/`	None	Dashboard HTML page
`GET`	`/health`	None	Service health probe
`GET`	`/metrics`	None	Prometheus-compatible metrics
`POST`	`/pair`	Pairing code	Exchange pairing code for bearer token
`POST`	`/v1/ping`	Bearer	Echo test endpoint
`POST`	`/v1/webhook/:channel`	Bearer	Channel message dispatch
`POST`	`/api/chat`	Bearer	Chat with agent (JSON response)
`POST`	`/v1/chat/completions`	Bearer	OpenAI-compatible chat completions (supports SSE streaming)
`GET`	`/v1/models`	Bearer	List available models (OpenAI-compatible)
`GET`	`/ws/chat`	Bearer	WebSocket chat with streaming agent responses
`POST`	`/webhook`	Bearer	Legacy webhook endpoint

Authentication

The gateway uses a pairing flow:

On first start, the gateway prints a one-time pairing code to the terminal
POST the pairing code to /pair to get a bearer token
Use the bearer token in subsequent requests

# Health check (no auth required)
curl http://127.0.0.1:42617/health

{ "status": "ok" }

# Exchange pairing code for token
curl -X POST http://127.0.0.1:42617/pair \
  -H "X-Pairing-Code: <code-from-terminal>"

# Authenticated request
curl -X POST http://127.0.0.1:42617/v1/ping \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json"

Chat Endpoints

Agent Chat (`POST /api/chat`)

Send a message to the agent and receive a complete JSON response.

curl -X POST http://127.0.0.1:42617/api/chat \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json" \
  -d '{"message": "What is the weather?", "context": ""}'

{ "message": "I can help with that...", "tokens_used_estimate": 42 }

Returns 503 Service Unavailable if the gateway was started without agent configuration.

OpenAI-Compatible Completions (`POST /v1/chat/completions`)

Accepts the standard OpenAI chat completions format. Set stream: true for SSE streaming.

# Non-streaming
curl -X POST http://127.0.0.1:42617/v1/chat/completions \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o-mini", "messages": [{"role": "user", "content": "hello"}]}'

# Streaming (SSE)
curl -X POST http://127.0.0.1:42617/v1/chat/completions \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o-mini", "messages": [{"role": "user", "content": "hello"}], "stream": true}'

SSE events follow the OpenAI format:

data: {"id":"chatcmpl-...","choices":[{"index":0,"delta":{"content":"token"},"finish_reason":null}]}

data: [DONE]

The model field is passed through to the agent, allowing model override per request.

WebSocket Chat (`GET /ws/chat`)

Upgrade to a WebSocket connection for bidirectional streaming chat. Send a text message and receive streaming delta frames:

// Incoming delta
{"type": "delta", "delta": "partial response text"}

// Stream complete
{"type": "done"}

// Error
{"type": "error", "message": "description"}

Models (`GET /v1/models`)

List available models in OpenAI-compatible format.

curl http://127.0.0.1:42617/v1/models \
  -H "Authorization: Bearer <token>"

Middleware

The gateway includes built-in middleware for production hardening:

Rate Limiting — Sliding window counter that rejects excess requests with 429 Too Many Requests.

Request Size Limits — Rejects requests with Content-Length exceeding the configured maximum (default: 10 MB) with 413 Payload Too Large.

CORS — Configurable origin allowlist for browser clients. Supports exact origin matching and wildcard (*). Handles preflight OPTIONS requests automatically.

Graceful Shutdown — On SIGTERM or SIGINT, the gateway drains active connections before exiting.

Configuration

[gateway]
host = "127.0.0.1"          # bind interface
port = 42617                 # bind port
require_pairing = true       # require OTP pairing
allow_public_bind = false    # allow non-loopback bind

[gateway.node_control]
enabled = false
# auth_token = "your-token"
allowed_node_ids = []

Daemon Mode

Run the gateway as a background process with automatic local AI service discovery, PID file management, and log rotation:

# Start in background
agentzero daemon start --host 127.0.0.1 --port 8080

# Check status (includes PID, uptime, address)
agentzero daemon status
agentzero daemon status --json

# Stop the daemon
agentzero daemon stop

# Run in foreground (for debugging or systemd)
agentzero daemon start --foreground

Daemon logs are written to {data_dir}/daemon.log with automatic rotation (10 MB max, 5 rotated files).

Service Installation

Install as a system service for automatic startup:

# Auto-detect init system (systemd or openrc)
agentzero service install

# Explicit init system
agentzero service --service-init systemd install
agentzero service --service-init openrc install

# Lifecycle
agentzero service start
agentzero service status
agentzero service restart
agentzero service stop
agentzero service uninstall

systemd installs as a user-level unit (no root required). OpenRC installs system-wide (requires sudo).