VCAL Documentation
Unified VCAL docs • Self-hosted AI infrastructure • Rust-based tooling

Documentation for the VCAL infrastructure stack

Find guides, configuration references, deployment notes, and architecture documentation for AI Cost Firewall, vcal-core, and VCAL Server.

Open-source gateway

AI Cost Firewall

Documentation for the OpenAI-compatible gateway that reduces wasted LLM spend with exact and semantic caching.

  • • Quickstart and Docker Compose setup
  • • Configuration reference
  • • Redis, Qdrant, Prometheus, and Grafana
  • • Semantic cache lifecycle and diagnostics
Open-source core

vcal-core

Documentation for the core VCAL library and semantic matching foundation used across the VCAL ecosystem.

  • • Concepts and architecture
  • • API and usage notes
  • • Semantic cache building blocks
  • • Integration guidance for developers
Commercial semantic cache

VCAL Server

Documentation for the production semantic cache server designed for private AI infrastructure and enterprise deployment.

  • • Installation and trial licensing
  • • Docker and binary deployment
  • • Persistence, operations, and observability
  • • API usage and production guidance
Recommended path

Choose the right documentation entry point

VCAL documentation is separated by product so each component can evolve independently, while this page stays as the stable entry point for the entire documentation ecosystem.

New to VCAL

Start with AI Cost Firewall

Use the gateway docs first if your goal is to reduce LLM cost, add request visibility, or run a quick local demo.

Building integrations

Read vcal-core

Use the core docs when you need lower-level concepts, developer integration guidance, or semantic cache internals.

Production deployment

Deploy VCAL Server

Use the server docs when semantic caching becomes part of your production AI infrastructure.