How to Deploy Quorvex AI¶

Choose a deployment mode and configure it for your environment, from local development to Kubernetes auto-scaling.

Prerequisites¶

Docker and Docker Compose v2 (for all modes except local dev)
An .env or .env.prod file with required secrets
For Kubernetes: a cluster with kubectl configured and a container registry

Deployment Modes Overview¶

Mode	Use Case	Command	Scaling
Local dev	Solo developer	`make dev`	Single instance
Docker dev	Team/reproducible	`make docker-up`	Single instance
Production (Standard)	Small team, VNC	`make prod-up`	1 backend + browsers
Production (Workers)	Medium team	`make workers-up`	N browser containers
Docker Swarm	Enterprise, simpler	`make swarm-up`	Overlay networking
Kubernetes	Enterprise, auto-scale	`make k8s-deploy`	HPA auto-scaling

Local Development¶

The simplest mode. Runs the backend and frontend as local processes using SQLite.

make setup    # One-time: venv, deps, browsers
make dev      # Start backend (port 8001) + frontend (port 3000)

make dev executes start-ui.sh, which:

Kills any existing processes on ports 8001 and 3000
Starts a PostgreSQL container if Docker is available (falls back to SQLite)
Launches the FastAPI backend with uvicorn --reload
Launches the Next.js frontend with npm run dev

Services:

Service	URL
Dashboard	http://localhost:3000
Backend API	http://localhost:8001
API Docs (Swagger)	http://localhost:8001/docs

Logs are written to api.log and web.log in the project root.

make stop     # Stop all services
make logs     # Tail backend + frontend logs

Docker Development¶

Run all services in containers using docker-compose.yml.

make docker-build    # Build images
make docker-up       # Start all services
make docker-down     # Stop all services

Production Deployment (Docker Compose)¶

Production uses docker-compose.prod.yml with an .env.prod configuration file.

Prerequisites¶

Docker and Docker Compose v2
An .env.prod file with production secrets

Create Production Environment File¶

cp .env.example .env.prod

Edit .env.prod with production values:

# Required secrets
ANTHROPIC_AUTH_TOKEN=your-production-token
JWT_SECRET_KEY=$(openssl rand -hex 32)
POSTGRES_PASSWORD=$(openssl rand -hex 16)
MINIO_ROOT_PASSWORD=$(openssl rand -hex 16)

# Security settings
REQUIRE_AUTH=true
ALLOW_REGISTRATION=false

# Optional: initial admin user
INITIAL_ADMIN_EMAIL=admin@yourcompany.com
INITIAL_ADMIN_PASSWORD=your-secure-password

Standard Mode (with VNC)¶

Runs a single backend container with Playwright browsers and VNC streaming. Best for small teams that need live browser observation.

make prod-up

Services started:

Service	URL	Description
Dashboard	http://localhost:3000	Next.js frontend
API	http://localhost:8001	FastAPI backend
VNC View	http://localhost:6080	Live browser (noVNC)
MinIO Console	http://localhost:9001	Object storage admin

The backend container uses supervisord to manage: - Xvfb (virtual display at :99, 1920x1080x24) - Fluxbox (window manager) - x11vnc (VNC server) - websockify (WebSocket bridge on port 6080) - uvicorn (API server on port 8001)

Resource allocation: 24 GB memory limit, 8 CPUs, 2 GB shared memory.

Development Mode (Local Code Mounting)¶

Mount local source code into production containers for faster iteration without rebuilding:

make prod-dev

Changes to orchestrator/ auto-reload via uvicorn --reload. Changes to web/src/ auto-reload via Next.js.

Workers Mode (Isolated Browsers)¶

Separates browsers into dedicated worker containers. The backend runs as a slim image (no browsers, 4 GB memory instead of 24 GB). Workers communicate via Redis job queue.

make workers-up              # Start with default 4 workers
make workers-scale N=8       # Scale to 8 workers
make workers-status          # View worker status and resource usage
make workers-logs            # Tail worker logs
make workers-down            # Stop everything

Architecture in workers mode:

Frontend (port 3000)
    |
Backend-Slim (port 8001, no browsers)
    |
Redis (job queue)
    |
+---+---+---+---+
| W1 | W2 | W3 | W4 |   <-- browser-workers (scalable)
+---+---+---+---+

Each worker container has: - 2 GB memory limit, 2 CPUs - 1 GB shared memory - Isolated Chromium browser instance

Production Commands Reference¶

make prod-up              # Start standard mode
make prod-down            # Stop services (30s graceful timeout)
make prod-down-safe       # Backup first, then stop
make prod-restart         # Restart backend only (picks up code changes)
make prod-logs            # Tail backend + frontend logs
make prod-build           # Rebuild images (with cache)
make prod-build-no-cache  # Rebuild images (fresh, no cache)
make prod-status          # Service status + health check

Upgrading Production¶

make upgrade

This runs a 6-step procedure:

Pre-flight health check
Full backup (DB + specs + tests + PRDs)
git pull latest code
Rebuild Docker images
Run database migrations
Restart services and verify health

Rollback if something goes wrong:

make db-downgrade              # Roll back migration
git checkout <previous-tag>    # Revert code
make prod-build && make prod-up  # Rebuild and restart

Backup and Recovery¶

Running Backups¶

make backup              # Database only
make backup-full         # DB + specs + tests + PRDs + ChromaDB
make backup-status       # View backup history

Backups are stored in the backup_data Docker volume and optionally synced to MinIO.

Restoring from Backup¶

make restore-list                    # List available backups
make restore TS=20260208_143022      # Restore from timestamp
make restore-from-minio TS=...      # Restore from MinIO

Scheduled Backups¶

The backup-scheduler service runs in production and executes: - 2 AM daily: Full backup with MinIO sync - 3 AM daily: Artifact archival (hot/warm/cold retention tiers)

Artifact Retention¶

Tier	Age	Storage	Contents
Hot	0-30 days	Local (`runs/`)	All artifacts
Warm	30-90 days	MinIO	Core artifacts only (plan.json, validation.json, report.html)
Cold	90+ days	Deleted	Database metadata only

make archival             # Run archival now
make archival-dry-run     # Preview what would be archived
make storage-health       # Check storage health

Docker Swarm¶

For enterprise deployments without Kubernetes. Uses Docker's built-in orchestration with overlay networking and rolling updates.

Deploy¶

# Initialize Swarm (if not already)
docker swarm init

# Deploy the stack
make swarm-up

Scale¶

make swarm-scale N=8       # Scale browser workers to 8
make swarm-status          # View service status
make swarm-down            # Remove stack

Stack Architecture¶

Defined in docker-compose.swarm.yml:

Service	Replicas	Resources
browser-workers	4 (scalable)	2 GB / 2 CPUs per replica
backend (slim)	2	4 GB / 4 CPUs per replica
frontend	2	512 MB / 0.5 CPUs
redis	1	256 MB
postgres	1 (manager node)	4 GB / 2 CPUs

Rolling updates are configured with: - Parallelism: 2 (workers), 1 (backend) - Delay: 10 seconds between batches - Failure action: rollback

Kubernetes¶

Enterprise auto-scaling deployment with HPA (Horizontal Pod Autoscaler).

Prerequisites¶

Kubernetes cluster 1.24+
kubectl configured
Container registry for images
nginx-ingress controller (optional, for external access)
Storage class for PersistentVolumeClaims

Deploy¶

# 1. Configure secrets
cp k8s/secrets.yaml k8s/secrets.local.yaml
# Edit secrets.local.yaml with your values
kubectl apply -f k8s/secrets.local.yaml

# 2. Build and push images
docker build -t your-registry/playwright-worker:latest -f docker/browser-worker/Dockerfile .
docker build -t your-registry/playwright-backend-slim:latest -f docker/backend-slim/Dockerfile .
docker build -t your-registry/playwright-frontend:latest -f web/Dockerfile web/
docker push your-registry/playwright-worker:latest
docker push your-registry/playwright-backend-slim:latest
docker push your-registry/playwright-frontend:latest

# 3. Update kustomization.yaml with your registry
# 4. Deploy
make k8s-deploy

Auto-Scaling Configuration¶

The HPA in k8s/browser-worker-deployment.yaml is configured as:

Parameter	Value
Min replicas	2
Max replicas	20
CPU target	70% utilization
Memory target	80% utilization
Scale-up delay	Immediate
Scale-down delay	5 minutes

Manage¶

make k8s-status          # Pods, services, HPA, ingress
make k8s-scale N=10      # Manual scale (HPA may override)
make k8s-logs            # Interactive log tailing
make k8s-delete          # Remove all resources

Resource Limits¶

Component	CPU Request/Limit	Memory Request/Limit
Browser Worker	1 / 2	1 Gi / 2 Gi
Backend	1 / 4	1 Gi / 4 Gi
Frontend	250m / 500m	256 Mi / 512 Mi
PostgreSQL	500m / 2	1 Gi / 4 Gi
Redis	100m / 500m	64 Mi / 256 Mi

Persistent Volumes¶

PVC	Size	Purpose
postgres-pvc	10 Gi	Database storage
runs-pvc	50 Gi	Test run artifacts
logs-pvc	10 Gi	Application logs
specs-pvc	5 Gi	Test specifications
tests-pvc	10 Gi	Generated tests
test-results-pvc	20 Gi	Playwright reports

Kubernetes Files¶

All manifests are in k8s/:

File	Purpose
`kustomization.yaml`	Kustomize configuration and image overrides
`namespace.yaml`	`playwright-agent` namespace
`secrets.yaml`	Secret template (copy to `secrets.local.yaml`)
`configmap.yaml`	Non-secret configuration
`backend-deployment.yaml`	Backend slim deployment (2 replicas)
`frontend-deployment.yaml`	Frontend deployment
`browser-worker-deployment.yaml`	Worker deployment + HPA
`postgres-deployment.yaml`	PostgreSQL StatefulSet
`redis.yaml`	Redis deployment
`pvc.yaml`	PersistentVolumeClaims
`ingress.yaml`	Ingress rules for external access

Database Migrations¶

When deploying model changes to PostgreSQL:

make db-migrate M="describe your change"   # Generate migration
make db-upgrade                              # Apply pending migrations
make db-downgrade                            # Roll back one step
make db-history                              # View migration history
make db-stamp R=001                          # Stamp existing DB at revision

Migrations are stored in orchestrator/migrations/versions/ and managed by Alembic.

TLS/SSL with Nginx¶

An optional nginx reverse proxy is available for TLS termination:

Place certificates in nginx/certs/
Configure nginx/nginx.conf

Start with the nginx profile:

docker compose --env-file .env.prod -f docker-compose.prod.yml --profile standard --profile nginx up -d

Nginx exposes ports 80 and 443 and proxies to the backend and frontend.

Health Checks¶

All production services include health checks:

make health-check    # Hit all health endpoints

Available health endpoints:

Endpoint	Purpose
`GET /health`	Backend API status
`GET /health/storage`	Local + MinIO storage status
`GET /health/backup`	Last backup status
`GET /health/alerts`	Active alerts

Maintenance Commands¶

make docker-prune     # Remove dangling images, stopped containers, build cache
make volume-sizes     # Show Docker volume sizes
make db-vacuum        # Run VACUUM ANALYZE on PostgreSQL
make deps-lock        # Capture venv versions to requirements.freeze (NOT requirements.lock)

Verification¶

After deployment, confirm everything works:

make health-check passes all endpoints
Dashboard loads at the configured URL
Login works with admin credentials (if auth is enabled)
A test spec runs and completes through the pipeline
Backups are being created (check make backup-status)

Company Deployment -- on-premises deployment walkthrough
Disaster Recovery -- backup and recovery procedures
Authentication -- enable auth and manage users
Troubleshooting -- diagnose deployment issues

How to Deploy Quorvex AI¶

Prerequisites¶

Deployment Modes Overview¶

Local Development¶

Docker Development¶

Production Deployment (Docker Compose)¶

Prerequisites¶

Create Production Environment File¶

Standard Mode (with VNC)¶

Development Mode (Local Code Mounting)¶

Workers Mode (Isolated Browsers)¶

Production Commands Reference¶

Upgrading Production¶

Backup and Recovery¶

Running Backups¶

Restoring from Backup¶

Scheduled Backups¶

Artifact Retention¶

Docker Swarm¶

Deploy¶

Scale¶

Stack Architecture¶

Kubernetes¶

Prerequisites¶

Deploy¶

Auto-Scaling Configuration¶

Manage¶

Resource Limits¶

Persistent Volumes¶

Kubernetes Files¶

Database Migrations¶

TLS/SSL with Nginx¶

Health Checks¶

Maintenance Commands¶

Verification¶

Related Guides¶