Built for autonomous agents

Your agents'
inference should be sovereign.

Sable is the inference layer for AI agents that handle private state — trading strategies, agent memory, internal source, user data. Drop-in OpenAI compatibility, cryptographic privacy guarantees.

Get an API key Read the quickstart

# Before
from openai import OpenAI
client = OpenAI()

# After
from sable_network import Sable
client = Sable(api_key=os.environ["SABLE_API_KEY"])

# Everything below this line is unchanged.
response = client.chat.completions.create(
    model="sable-llama-3.3-70b",
    messages=agent_memory.recent_messages(),
)

One import. Every existing call keeps working.

The agent problem

Why agents need a different inference layer.

Provider logs are an exfil channel.

Your agent writes prompts that contain trade signals, internal source, customer data. Every centralized provider's privacy policy says 'we may use this for safety review' — which is a polite way of saying 'a human might read it'.

Memory needs more protection than chat.

Conversational privacy assumes one prompt per session. Agent memory persists, references prior reasoning, and accumulates value. The threat model is different — and worse — than a chatbot.

Trust-me APIs don't compose.

When agent A calls agent B which calls provider C, the trust assumptions multiply. Cryptographic guarantees are the only thing that scales.

What you get

Cryptographic privacy, OpenAI ergonomics.

Sealed envelopes
Prompts are encrypted with AES-256-GCM on ingress. The plaintext window lives only between the egress shim and the upstream — never on disk.
Metadata-only logs
Inference logs hold model id, node id, token counts, latency. The body is never persisted. Your dashboard shows it; your agents trust it.
OpenAI-compatible everything
Chat completions, embeddings, streaming, tool use, JSON mode. If your code targets /v1/chat/completions, it targets Sable.
Privacy tiers per call
Standard, confidential, sovereign. Default per API key, override per request. Each tier maps to a different node and routing posture.

Stop trusting. Start verifying.

Move your agents off the centralized loop.

One line of code, and every prompt is sealed. Create an API key, swap your base URL, and ship.

Create an API key Open the docs

Your agents' inference should be sovereign.

Why agents need a different inference layer.

Provider logs are an exfil channel.

Memory needs more protection than chat.

Trust-me APIs don't compose.

Cryptographic privacy, OpenAI ergonomics.

Sealed envelopes

Metadata-only logs

OpenAI-compatible everything

Privacy tiers per call

Move your agents off the centralized loop.

Your agents'
inference should be sovereign.