The Subconscious Cloud API is the fastest way to access our inference infrastructure. All requests go throughDocumentation Index
Fetch the complete documentation index at: https://docs.subconscious.dev/llms.txt
Use this file to discover all available pages before exploring further.
api.subconscious.dev, our managed gateway that handles authentication, rate limiting, and routing.
How It Works
OpenAI compatible
Use the standard OpenAI SDK with no custom libraries needed
Global edge
Low-latency access from anywhere
Managed infrastructure
We handle scaling, reliability, and uptime
Architecture
Your API requests flow through a managed gateway:- Your application calls
api.subconscious.dev/v1/chat/completions - The gateway authenticates your API key and enforces rate limits
- The request is routed to the inference cluster
- Responses stream back to your application
API Keys
API keys are created and managed from your Subconscious dashboard. Each organization can have multiple active keys.Python
Default Limits
| Limit | Default |
|---|---|
| Tokens per minute | 1,000,000 |
| Requests per minute | 100 |
| Tokens per day | 10,000,000 |
Billing
Usage is billed per token. See Pricing for current rates.- Credits are added from your dashboard
- Usage is deducted automatically per request
- Auto-pay is available for uninterrupted service
- Usage dashboards show real-time consumption