On-Prem & Customer Cloud - Subconscious Docs

For organizations with strict data residency, compliance, or network isolation requirements, we deploy Subconscious inference directly into your infrastructure, whether that’s your own cloud (AWS, GCP, Azure) or an on-premises data center.

Data residency

All data stays within your network boundary, and nothing leaves your environment

Full inference stack

Model serving, API gateway, and monitoring deployed in your infrastructure

White-glove setup

Our team works directly with your infrastructure and security teams

Deployment Options

Customer Cloud

We deploy into your existing cloud environment:

AWS: EKS, EC2, or SageMaker-based deployment
GCP: GKE or Compute Engine deployment
Azure: AKS or VM-based deployment

Your data never leaves your cloud account. We provide the container images, configuration, and deployment automation.

On-Premises

For air-gapped or fully on-premises environments:

Deploy to bare metal or virtualized GPU infrastructure
No internet connectivity required after initial setup
Full control over network policies and access

What’s Included

Model weights and serving infrastructure optimized for your hardware
OpenAI- and Anthropic-compatible API endpoint running inside your network with the same API you already use
Monitoring and observability including health checks, metrics, and logging
Ongoing support with updates, patches, and direct engineering support

Requirements

GPU infrastructure (specific requirements depend on deployment size)
Container orchestration (Kubernetes preferred)
Network access for initial setup and updates (can be air-gapped post-setup)

Talk to Sales

Schedule a call to scope your on-prem deployment.

Dedicated Endpoints Local Devices

Data residency

Full inference stack

White-glove setup

​Deployment Options

​Customer Cloud

​On-Premises

​What’s Included

​Requirements