Local Deployment

UsageGuard provide the option to deploy and run core UsageGuard components inside your infrastructure.

Requirments

You will need to be able to download, host and run the following:

Core Components

OpenSearch (Documentation )
PostgreSQL (Documentation )

Documents / Search Support

AWS S3 or MinIO (Documentation )
Pinecone or Qdrant (Documentation )

Local Inference

Ollama (Documentation )

Services API Keys

You can always only work with local inference using Ollama as a provider, however you can also supply API keys for the services that you want to use like OpenAI or AWS Bedrock which will provide access to models from Anthropic, Mistral, Nova family of models and more.

You can also opt for using Qdrant as your vector database or provide API key for Pinecone (Serverless).

UsageGuard Installation

UsageGuard local deployments has two components each in its own docker image

Reverse Proxy APIs (ug-ecr-reverse-proxy)
Management Dashboards (ug-ecr-reverse-proxy-app)

Management Dashboards are optional however its highly recommended.

# Windows PowerShell
.\PullContainer.ps1 -Token "<paste-token-here>" -RepositoryName "ug-ecr-reverse-proxy"
.\PullContainer.ps1 -Token "<paste-token-here>" -RepositoryName "ug-ecr-reverse-proxy-app"

# Linux / macOS
./pull_container.sh "<your-token>" "ug-ecr-reverse-proxy"
./pull_container.sh "<your-token>" "ug-ecr-reverse-proxy-app"

Once you opt-in for local deployment contact our support to issue tokens for pulling UsageGuard images.

please get in touch with us at support.

for each UsageGuard container, mount appsettings.json into the container with your own settings or use enviroument variable to override existing settings.