Ultra-fast Latency

Sub-3ms policy checks that never slow down your agents. Built for production workloads at any scale with edge enforcement that delivers consistent sub-millisecond performance.

Start free trial Read the docs

<3ms

P99 Latency

Policy checks complete in under 3ms at the 99th percentile

99.99%

Uptime SLA

Enterprise-grade reliability with financially-backed guarantees

10B+

Requests/Month

Processing billions of requests across our customer base

50+

Edge Locations

Global edge network for sub-millisecond response times

Benchmarks

Real-world performance

These benchmarks were measured on production traffic with real policies and data. Performance varies based on policy complexity and payload size.

Measured at P99 latency
Production traffic patterns
Includes network round-trip
Full policy evaluation + logging

Performance Benchmarks

Simple allow/block

0.8ms125,000 req/s

PII detection (short)

1.2ms85,000 req/s

PII detection (long)

2.1ms48,000 req/s

Complex policy chain

2.8ms36,000 req/s

Full audit logging

3.2ms31,000 req/s

Architecture

Built for speed at every layer

Global Edge Network

Policy enforcement runs at the edge, close to your agents. 50+ points of presence worldwide ensure low latency everywhere.

Zero-Copy Processing

Our architecture avoids unnecessary data copies. Policies are evaluated in-place for maximum efficiency.

Auto-Scaling

Handle traffic spikes automatically. Our infrastructure scales horizontally to meet demand without manual intervention.

Real-time Metrics

Monitor latency, throughput, and error rates in real-time. Set up alerts when performance degrades.

Predictive Caching

Frequently-accessed policies are cached at the edge. Hot paths are optimized automatically based on usage patterns.

Async Mode

For non-blocking use cases, fire-and-forget mode lets you log without waiting for a response.

Comparison

Negligible overhead

Notary Labs adds less than 3ms to your agent requests—a fraction of what typical LLM API calls take.

Notary Labs policy check3ms

Typical database query15ms

OpenAI API call (fast)500ms

OpenAI API call (avg)2000ms

Security without the slowdown

Experience enterprise-grade security with sub-millisecond latency.

Start free trial