Limits

The limits block configures request/response limits, connection limits, and rate limiting. These settings are critical for predictable behavior, resource protection, and “sleepable operations.”

Basic Configuration

limits {
    max-header-size-bytes 8192
    max-header-count 100
    max-body-size-bytes 10485760  // 10MB
}

Header Limits

limits {
    max-header-size-bytes 8192     // Total headers size
    max-header-count 100           // Maximum header count
    max-header-name-bytes 256      // Per-header name size
    max-header-value-bytes 4096    // Per-header value size
}

Setting	Default	Description
`max-header-size-bytes`	`8192` (8KB)	Total size of all headers
`max-header-count`	`100`	Maximum number of headers
`max-header-name-bytes`	`256`	Maximum header name length
`max-header-value-bytes`	`4096` (4KB)	Maximum header value length

Security note: Large header limits can be exploited for denial-of-service. Keep defaults unless you have specific requirements.

Recommended Header Limits

Environment	header-size	header-count	header-value
Production	4096-8192	50-100	2048-4096
Development	16384	200	8192
API Gateway	8192	100	4096

Body Limits

limits {
    max-body-size-bytes 10485760       // 10MB - maximum request body
    max-body-buffer-bytes 1048576      // 1MB - buffer for inspection
    max-body-inspection-bytes 1048576  // 1MB - bytes sent to agents
}

Setting	Default	Description
`max-body-size-bytes`	`10485760` (10MB)	Maximum request body size
`max-body-buffer-bytes`	`1048576` (1MB)	Maximum buffered body size
`max-body-inspection-bytes`	`1048576` (1MB)	Maximum body sent to agents

Body Size Guidelines

Use Case	Recommended Size
API endpoints	1-10 MB
File uploads	100 MB - 1 GB
JSON APIs	1-5 MB
Form submissions	1-10 MB

Memory impact: max-body-buffer-bytes × max-in-flight-requests = potential memory usage for body buffering.

Per-Route Body Limits

Override body limits on specific routes:

routes {
    route "upload" {
        matches {
            path-prefix "/upload/"
        }
        upstream "storage"
        policies {
            max-body-size "500MB"
        }
    }

    route "api" {
        matches {
            path-prefix "/api/"
        }
        upstream "backend"
        policies {
            max-body-size "1MB"
        }
    }
}

Decompression Limits

Protect against decompression bombs:

limits {
    max-decompression-ratio 100.0       // Max 100:1 compression ratio
    max-decompressed-size-bytes 104857600  // 100MB decompressed
}

Setting	Default	Description
`max-decompression-ratio`	`100.0`	Maximum compression ratio allowed
`max-decompressed-size-bytes`	`104857600` (100MB)	Maximum decompressed size

Security note: Compression bombs (zip bombs) can use 1KB of compressed data to expand to gigabytes. These limits prevent resource exhaustion.

Connection Limits

limits {
    max-connections-per-client 100
    max-connections-per-route 1000
    max-total-connections 10000
    max-idle-connections-per-upstream 100
}

Setting	Default	Description
`max-connections-per-client`	`100`	Per-client connection limit
`max-connections-per-route`	`1000`	Per-route connection limit
`max-total-connections`	`10000`	Global connection limit
`max-idle-connections-per-upstream`	`100`	Idle connections per upstream

Connection Limit Sizing

Deployment	per-client	per-route	total
Small	50	500	5000
Medium	100	1000	10000
Large	200	2000	50000
API Gateway	100	5000	100000

Formula: max-total-connections should be less than or equal to OS file descriptor limit minus a safety margin.

Check your limits:

ulimit -n  # Current soft limit
cat /proc/sys/fs/file-max  # System limit

Request Limits

limits {
    max-in-flight-requests 10000
    max-in-flight-requests-per-worker 1000
    max-queued-requests 1000
}

Setting	Default	Description
`max-in-flight-requests`	`10000`	Total concurrent requests
`max-in-flight-requests-per-worker`	`1000`	Per-worker concurrent requests
`max-queued-requests`	`1000`	Pending requests in queue

When limits are reached:

New requests receive 503 Service Unavailable
Retry-After header indicates when to retry

Agent Limits

limits {
    max-agent-queue-depth 100
    max-agent-body-bytes 1048576     // 1MB to agents
    max-agent-response-bytes 10240   // 10KB from agents
}

Setting	Default	Description
`max-agent-queue-depth`	`100`	Pending requests per agent
`max-agent-body-bytes`	`1048576` (1MB)	Request body sent to agents
`max-agent-response-bytes`	`10240` (10KB)	Response size from agents

These limits protect the proxy from misbehaving agents and ensure bounded memory usage.

Rate Limiting

Global Rate Limits

limits {
    max-requests-per-second-global 10000
}

Global limit applies across all clients and routes.

Per-Client Rate Limits

limits {
    max-requests-per-second-per-client 100
}

Limits requests per unique client IP address.

Per-Route Rate Limits

limits {
    max-requests-per-second-per-route 1000
}

Limits total requests to each route.

Rate Limit Behavior

Rate limiting uses token bucket algorithm:

Burst capacity: 10× the per-second limit
Refill rate: Continuous refill at configured rate
Response: 429 Too Many Requests with Retry-After header

Example:

limits {
    max-requests-per-second-per-client 100
}

Burst: 1000 requests
Sustained: 100 requests/second
Over limit: 429 response, retry in ~1 second

Route-Level Rate Limits

For fine-grained control, configure rate limits per route:

routes {
    route "api" {
        matches {
            path-prefix "/api/"
        }
        upstream "backend"
        policies {
            rate-limit {
                requests-per-second 100
                burst 500
                key "client_ip"  // or "header:X-API-Key"
            }
        }
    }
}

Rate limit keys:

client_ip - Client IP address
header:X-API-Key - Header value
path - Request path
route - Route ID

Memory Limits

limits {
    max-memory-bytes 2147483648     // 2GB hard limit
    max-memory-percent 80.0         // 80% of system memory
}

Setting	Default	Description
`max-memory-bytes`	None	Absolute memory limit
`max-memory-percent`	None	Percentage of system memory

When memory limits are reached:

New connections are rejected
Request queues are flushed
Alert is raised via metrics/logs

Profile Presets

Development

limits {
    // Permissive for testing
    max-header-size-bytes 16384
    max-header-count 200
    max-body-size-bytes 104857600  // 100MB
    max-in-flight-requests 100000

    // No rate limits
    max-requests-per-second-global null
    max-requests-per-second-per-client null
}

Production (Conservative)

limits {
    // Restrictive headers
    max-header-size-bytes 4096
    max-header-count 50
    max-header-name-bytes 128
    max-header-value-bytes 2048

    // Limited body
    max-body-size-bytes 1048576  // 1MB
    max-body-buffer-bytes 524288 // 512KB

    // Connection protection
    max-connections-per-client 50
    max-total-connections 5000
    max-in-flight-requests 5000

    // Rate limiting
    max-requests-per-second-global 10000
    max-requests-per-second-per-client 100

    // Memory protection
    max-memory-percent 80.0
}

High-Traffic API

limits {
    // Standard headers
    max-header-size-bytes 8192
    max-header-count 100

    // JSON payloads
    max-body-size-bytes 10485760  // 10MB

    // High throughput
    max-connections-per-client 200
    max-connections-per-route 5000
    max-total-connections 50000
    max-in-flight-requests 50000
    max-in-flight-requests-per-worker 5000

    // Rate limiting
    max-requests-per-second-global 100000
    max-requests-per-second-per-client 1000
}

Complete Example

limits {
    // Headers
    max-header-size-bytes 8192
    max-header-count 100
    max-header-name-bytes 256
    max-header-value-bytes 4096

    // Bodies
    max-body-size-bytes 10485760
    max-body-buffer-bytes 1048576
    max-body-inspection-bytes 1048576

    // Decompression protection
    max-decompression-ratio 100.0
    max-decompressed-size-bytes 104857600

    // Connections
    max-connections-per-client 100
    max-connections-per-route 1000
    max-total-connections 10000
    max-idle-connections-per-upstream 100

    // Requests
    max-in-flight-requests 10000
    max-in-flight-requests-per-worker 1000
    max-queued-requests 1000

    // Agents
    max-agent-queue-depth 100
    max-agent-body-bytes 1048576
    max-agent-response-bytes 10240

    // Rate limiting
    max-requests-per-second-global 10000
    max-requests-per-second-per-client 100
    max-requests-per-second-per-route 1000

    // Memory
    max-memory-percent 80.0
}

Default Values Summary

Category	Setting	Default
Headers	`max-header-size-bytes`	8192
	`max-header-count`	100
	`max-header-name-bytes`	256
	`max-header-value-bytes`	4096
Bodies	`max-body-size-bytes`	10485760 (10MB)
	`max-body-buffer-bytes`	1048576 (1MB)
	`max-body-inspection-bytes`	1048576 (1MB)
Decompression	`max-decompression-ratio`	100.0
	`max-decompressed-size-bytes`	104857600 (100MB)
Connections	`max-connections-per-client`	100
	`max-connections-per-route`	1000
	`max-total-connections`	10000
	`max-idle-connections-per-upstream`	100
Requests	`max-in-flight-requests`	10000
	`max-in-flight-requests-per-worker`	1000
	`max-queued-requests`	1000
Agents	`max-agent-queue-depth`	100
	`max-agent-body-bytes`	1048576 (1MB)
	`max-agent-response-bytes`	10240 (10KB)
Rate Limits	All rate limits	None (disabled)
Memory	Memory limits	None (disabled)

Monitoring Limits

Metrics

Sentinel exposes limit-related metrics:

sentinel_header_size_exceeded_total
sentinel_body_size_exceeded_total
sentinel_connection_limit_reached_total
sentinel_rate_limit_exceeded_total
sentinel_memory_usage_bytes
sentinel_connections_active
sentinel_requests_in_flight

Logging

Limit violations are logged at WARN level:

{
  "level": "WARN",
  "message": "Request body size exceeded limit",
  "limit": 10485760,
  "actual": 15728640,
  "client_ip": "10.0.0.5",
  "trace_id": "2kF8xQw4BnM"
}

Troubleshooting

413 Payload Too Large

Request body exceeds max-body-size-bytes:

# Check current limit
grep max-body-size sentinel.kdl

# Increase for specific route
route "upload" {
    policies {
        max-body-size "100MB"
    }
}

429 Too Many Requests

Rate limit exceeded:

# Check Retry-After header
curl -I https://api.example.com/endpoint
# Retry-After: 1

# Increase limits or implement client-side throttling

503 Service Unavailable (Connection Limit)

Connection or request limits reached:

Check current connections: curl localhost:9090/admin/stats
Review max-total-connections and max-in-flight-requests
Check for connection leaks in clients

Next Steps

Server Configuration - Worker threads and global settings
Upstreams - Connection pool settings