Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.subconscious.dev/llms.txt

Use this file to discover all available pages before exploring further.

The Subconscious Cloud API is the fastest way to access our inference infrastructure. All requests go through api.subconscious.dev, our managed gateway that handles authentication, rate limiting, and routing.

How It Works

OpenAI compatible

Use the standard OpenAI SDK with no custom libraries needed

Global edge

Low-latency access from anywhere

Managed infrastructure

We handle scaling, reliability, and uptime

Architecture

Your API requests flow through a managed gateway:
  1. Your application calls api.subconscious.dev/v1/chat/completions
  2. The gateway authenticates your API key and enforces rate limits
  3. The request is routed to the inference cluster
  4. Responses stream back to your application

API Keys

API keys are created and managed from your Subconscious dashboard. Each organization can have multiple active keys.
Python
from openai import OpenAI

client = OpenAI(
    api_key="your-api-key",               # From your dashboard
    base_url="https://api.subconscious.dev/v1",
)

Default Limits

LimitDefault
Tokens per minute1,000,000
Requests per minute100
Tokens per day10,000,000
Need higher limits? See Dedicated Endpoints or contact us.

Billing

Usage is billed per token. See Pricing for current rates.
  • Credits are added from your dashboard
  • Usage is deducted automatically per request
  • Auto-pay is available for uninterrupted service
  • Usage dashboards show real-time consumption