Observability built into the gateway from the start rather than added as an afterthought means every operational question -- which client is causing the error spike, which endpoint has the highest p99 latency, which backend service is the bottleneck -- has a data-backed answer. Structured access logs for every API request: timestamp (UTC, millisecond precision), client ID, API key identifier, HTTP method, request path, response status code, request body size, response body size, upstream backend service, upstream response time, total gateway time, and correlation ID. Log format is JSON, shipped to CloudWatch Logs, Datadog, Elasticsearch, or Splunk via a log forwarder configured at deployment. Correlation ID generated at the gateway (UUID v4) and injected into the X-Correlation-Id request header forwarded to backend services -- backend service logs that include the correlation ID allow a single slow request to be traced across the gateway log, the service log, and the database query log with a single ID. Distributed tracing integration with OpenTelemetry (trace context propagation via W3C Trace Context headers) for teams using Jaeger, Tempo, or AWS X-Ray. Metrics published per endpoint: request count, error rate (4xx and 5xx separately), p50/p95/p99 latency, and upstream service time. Dashboards in Grafana or CloudWatch with anomaly detection alerts: error rate above baseline, p95 latency exceeding configured threshold, and sudden traffic volume changes (both spikes indicating abuse and drops indicating client-side failures). Access log export to a SIEM (Splunk, Elastic SIEM, Microsoft Sentinel) for security monitoring, audit trail, and compliance with PCI DSS, SOC 2, or ISO 27001 access logging requirements.