System Architecture Overview¶

Quorvex AI converts natural language test specifications into production-ready Playwright tests through a layered architecture of user interfaces, pipeline engines, AI agents, and infrastructure services.

Why This Architecture¶

The system solves a fundamental tension: AI-powered test generation requires orchestrating multiple long-running, stateful operations (browser automation, LLM conversations, file I/O) while remaining responsive and scalable. Rather than building a monolith, the architecture separates concerns into loosely coupled layers that communicate through well-defined interfaces.

graph TB
    subgraph UI["User Interfaces"]
        CLI["CLI\norchestrator/cli.py"]
        WEB["Web Dashboard\nNext.js :3000"]
    end

    subgraph BACKEND["Backend API · FastAPI :8001"]
        API["FastAPI Application"]
        AUTH["Auth & RBAC"]
        ROUTERS["API Routers\nspecs · runs · regression\nexploration · requirements\nrtm · analytics · health"]
    end

    subgraph PIPELINE["Pipeline Engine"]
        NATIVE["Pipeline"]
        HEALER["Self-Healing Loop"]
        EXPLORER["App Explorer"]
        PRD["PRD Processor"]
    end

    subgraph AILAYER["AI Layer"]
        RUNNER["AgentRunner"]
        SDK["Claude Agent SDK"]
        AGENTS["Agent Definitions\n.claude/agents/*.md"]
    end

    subgraph MEMORY["Memory System"]
        VS["Vector Store · ChromaDB"]
        GS["Graph Store · NetworkX"]
        ES["Exploration Store · SQLModel"]
    end

    subgraph INFRA["Infrastructure"]
        BP["Browser Pool"]
        AQ["Agent Queue · Redis"]
        DB[("Database")]
        STORE["Storage · Local + MinIO"]
    end

    CLI --> NATIVE
    WEB --> API
    API --> AUTH
    API --> ROUTERS
    ROUTERS --> NATIVE
    ROUTERS --> EXPLORER

    NATIVE --> HEALER
    NATIVE --> RUNNER
    RUNNER --> SDK
    SDK --> AGENTS

    NATIVE --> BP
    EXPLORER --> BP
    RUNNER --> VS
    EXPLORER --> ES

    API --> DB
    API --> AQ
    STORE --> DB

Dual-Interface Design¶

The platform exposes two interfaces -- CLI and Web Dashboard -- that share the same backend logic but serve different use cases.

CLI (orchestrator/cli.py) is optimized for automation and CI/CD. It spawns pipeline stages as subprocesses, communicates through file artifacts (JSON, exit codes, stdout), and requires no database. This makes it suitable for scripting, one-off runs, and environments where a web server is unwanted.

Web Dashboard (web/ + orchestrator/api/) adds persistence, collaboration, and monitoring. The Next.js frontend communicates with a FastAPI backend over HTTP. This mode requires a database (PostgreSQL in production, SQLite for development) and enables features like regression batches, scheduled runs, and team collaboration.

Why two interfaces?

Many teams start with the CLI during evaluation, then adopt the dashboard as usage grows. Keeping both interfaces ensures the core pipeline logic never depends on a web server, which simplifies testing and debugging.

Component Interaction Trade-offs¶

Subprocess Isolation¶

Every pipeline stage runs as a separate subprocess. This is the single most important architectural decision. The Claude Agent SDK throws "cancel scope" errors during cleanup that can discard accumulated results. By isolating each stage, cleanup errors in one stage cannot corrupt another. The cost is slightly higher latency from process spawning, but reliability wins over speed for a tool that generates production test code.

For details on the subprocess model, see Pipeline Architecture.

Stateless API, Stateful Workers¶

The FastAPI backend is intentionally stateless -- it delegates long-running work to background processes or the agent queue. This means you can run multiple API instances behind a load balancer without sticky sessions. Browser operations are managed through the browser pool, and agent tasks flow through a Redis queue when available.

Memory as an Optimization Layer¶

The memory system is designed as an optimization, not a requirement. Pipelines work without it, but memory improves quality by passing proven selectors to generators and feeding exploration discoveries into requirements generation. This design means a failed ChromaDB instance degrades performance gracefully rather than blocking test generation.

Technology Stack¶

Layer	Technology	Why This Choice
Frontend	Next.js (App Router), React, Tailwind	Server components for fast loads; Tailwind for rapid UI iteration
Backend API	FastAPI, Uvicorn, SQLModel	Async-first for browser operations; SQLModel bridges Pydantic and SQLAlchemy
AI	Claude Agent SDK, Anthropic API	MCP tool support enables live browser interaction during generation
Database	PostgreSQL (prod), SQLite (dev)	SQLite for zero-config development; PostgreSQL for production concurrency
Browser	Playwright	Multi-browser support; best-in-class auto-wait and selector APIs
Memory	ChromaDB (vector), NetworkX (graph)	Embedded mode avoids external services; cosine similarity for selector matching
Queue	Redis	Lightweight, battle-tested; doubles as rate limit and session store
Storage	MinIO (S3-compatible)	Self-hosted object storage for artifact archival without cloud vendor lock-in

Data Flow¶

The system transforms input (markdown specs, PDFs, live URLs) into output (passing Playwright tests, requirements, traceability matrices) through feedback loops:

flowchart TB
    subgraph "Inputs"
        MD["Markdown Spec"]
        PDF["PDF PRD"]
        URL["Live URL"]
    end

    subgraph "Processing"
        PLAN["Planner"]
        GEN["Generator"]
        HEAL["Healer"]
        EXPLORE["Explorer"]
    end

    subgraph "Outputs"
        CODE["Playwright Test<br/>tests/generated/*.spec.ts"]
        REQ["Requirements"]
        RTM["RTM"]
    end

    subgraph "Feedback"
        MEM["Memory System"]
    end

    MD --> PLAN --> GEN --> CODE
    PDF -->|"extract features"| PLAN
    URL --> EXPLORE --> REQ --> RTM

    CODE -->|"fail"| HEAL -->|"fix"| CODE
    CODE -->|"pass: store patterns"| MEM
    MEM -.->|"proven selectors"| GEN

The healing loop is central to reliability. When a generated test fails, the healer analyzes the browser state at the failure point and fixes the code. This loop runs up to 3 times with the native healer and up to 20 times in hybrid mode. Each successful run feeds selector patterns back into the memory system, making future generations more accurate.

Credential Flow¶

All AI credentials load from .env via orchestrator/load_env.py. The setup_claude_env() function must be called before any Agent SDK usage. Per-project credentials (stored encrypted in Project.settings) override .env values at runtime, enabling multi-tenant deployments where different teams use different API keys.

Pipeline Architecture -- Why subprocess execution and how pipeline types differ
Memory System -- How vector and graph stores improve test quality
Browser Pool -- Concurrency management and resource protection
Infrastructure -- Deployment topology trade-offs
Security Model -- Authentication and authorization design