Skip to main content
For organizations with strict data residency, compliance, or network isolation requirements, we deploy Subconscious inference directly into your infrastructure, whether that’s your own cloud (AWS, GCP, Azure) or an on-premises data center.

Data residency

All data stays within your network boundary, and nothing leaves your environment

Full inference stack

Model serving, API gateway, and monitoring deployed in your infrastructure

White-glove setup

Our team works directly with your infrastructure and security teams

Deployment Options

Customer Cloud

We deploy into your existing cloud environment:
  • AWS: EKS, EC2, or SageMaker-based deployment
  • GCP: GKE or Compute Engine deployment
  • Azure: AKS or VM-based deployment
Your data never leaves your cloud account. We provide the container images, configuration, and deployment automation.

On-Premises

For air-gapped or fully on-premises environments:
  • Deploy to bare metal or virtualized GPU infrastructure
  • No internet connectivity required after initial setup
  • Full control over network policies and access

What’s Included

  • Model weights and serving infrastructure optimized for your hardware
  • OpenAI- and Anthropic-compatible API endpoint running inside your network with the same API you already use
  • Monitoring and observability including health checks, metrics, and logging
  • Ongoing support with updates, patches, and direct engineering support

Requirements

  • GPU infrastructure (specific requirements depend on deployment size)
  • Container orchestration (Kubernetes preferred)
  • Network access for initial setup and updates (can be air-gapped post-setup)

Talk to Sales

Schedule a call to scope your on-prem deployment.