Confidential inference
Prompts are sealed with AES-256-GCM on ingress. Compute nodes execute against the sealed envelope and have no view of prompt, memory, or output.
Sable Network is an OpenAI-compatible inference layer that encrypts prompts in flight, routes them across confidential compute, and never persists what your agents reason about.
from openai import OpenAI
# Replace the OpenAI client. Nothing else changes.
client = OpenAI(
base_url="https://api.sable.network/v1",
api_key="sk-sable_...",
)
response = client.chat.completions.create(
model="sable-llama-3.3-70b",
messages=[{"role": "user", "content": "Analyze this report."}],
)
print(response.choices[0].message.content)Swap OpenAI() for Sable(). Nothing else changes.
Sable replaces the centralized assumption baked into every AI provider today. Encryption, routing, and execution are protocol-level — not policies in a privacy page.
Prompts are sealed with AES-256-GCM on ingress. Compute nodes execute against the sealed envelope and have no view of prompt, memory, or output.
Workloads route across a global fabric of GPU operators selected by latency, jurisdiction, cost, and privacy tier.
Confidential computing on Intel SGX, AMD SEV, and NVIDIA Confidential Computing isolates inference inside attested enclaves.
Cryptographic attestations let agents confirm the right model ran and the output was untampered. ZK proofs become the long-term substrate.
Swap one base URL. The full ecosystem of SDKs, agents, and tools works unchanged. Migration is the same line of code.
Private memory, signed reasoning, multi-turn confidentiality. Sable is the substrate for the autonomous-agent economy.
Every layer enforces the same invariant: nobody between the developer and the model can read the prompt.
Where developers integrate.
Selects the right node for every request.
Confidential execution at scale.
Integrity for every output.
Every prompt is a leak surface. Trading strategies, medical records, internal source code, agent memory — all of it passes through provider logs and human review pipelines today.
Autonomous systems run thousands of inferences per hour against private state. The unit economics of human review do not survive contact with machine-scale workloads.
Trust-me APIs do not generalize. Sable encodes confidentiality into the routing fabric so it holds regardless of who operates the compute.
We're shipping the inference layer first because that's where the pain is real today. The rest of the stack composes on top.
Create an API key in under a minute. Keep your existing OpenAI code. Route every request through Sable.