SableNetwork

Streaming

Pass stream: true to receive Server-Sent Events. The same SDK iterators that work with OpenAI work here.

stream = client.chat.completions.create(
  model="sable-llama-3.3-70b",
  messages=[{"role":"user","content":"Stream me a haiku about privacy."}],
  stream=True,
)
for chunk in stream:
  delta = chunk.choices[0].delta.content or ""
  print(delta, end="", flush=True)

Failover behavior

If the primary upstream fails before the first byte streams, Sable transparently fails over to the next configured upstream. Once bytes start flowing, there's no failover — interrupted streams surface as an SSE disconnect.