AI Proxy

Important

Vibe code experiment.

AI Proxy

A lightweight proxy for LLM interactions with basic guardrails, logging, and metrics.

Features

Single LLM provider support (OpenAI)
Basic guardrail for banned words
Logging of requests and responses
Prometheus metrics
Config-driven setup
Docker deployment
Monitoring with Prometheus and Grafana

Configuration

The proxy can be configured using a YAML file or environment variables:

Config File (`config/config.yaml`)

server:
  port: 8080

llm:
  url: "https://api.openai.com/v1/chat/completions"
  api_key: "YOUR_OPENAI_API_KEY"

guardrails:
  banned_words:
    - "bomb"
    - "attack"

Environment Variables

SERVER_PORT: Server port (default: 8080)
LLM_URL: LLM API endpoint URL
LLM_API_KEY: LLM API key
BANNED_WORDS: Comma-separated list of banned words

API

Query Endpoint

Request:

{
  "prompt": "Your prompt to the LLM",
  "model_params": {
    "model": "gpt-3.5-turbo",
    "temperature": 0.7,
    "max_tokens": 256
  }
}

Response:

{
  "completion": "LLM response text..."
}

Metrics Endpoint

GET /metrics

Returns Prometheus-formatted metrics including:

llm_requests_total: Total number of LLM requests processed
llm_errors_total: Total number of errors from LLM calls
llm_tokens_total: Total number of tokens used in LLM calls
guardrail_blocks_total: Total number of requests blocked by guardrails

Health Check

GET /health

Running Locally

# Build and run
go build -o ai-proxy ./cmd/server
./ai-proxy --config config/config.yaml

Running with Docker

# Build Docker image
docker build -t ai-proxy:0.1 .

# Run with configuration in environment variables
docker run -p 8080:8080 \
  -e LLM_API_KEY=your_openai_api_key \
  ai-proxy:0.1

# Or mount a custom config file
docker run -p 8080:8080 \
  -v $(pwd)/config/config.yaml:/app/config/config.yaml \
  ai-proxy:0.1

Monitoring Setup

The project includes a complete monitoring stack with Prometheus and Grafana.

Running with Monitoring

# Start the entire stack (AI Proxy, Prometheus, and Grafana)
docker-compose up -d

# Access the services:
# - AI Proxy: http://localhost:8080
# - Prometheus: http://localhost:9090
# - Grafana: http://localhost:3000 (login with admin/admin)

# Stop the services
docker-compose down

Example Usage

# Send a query
curl -X POST http://localhost:8080/v1/query \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Tell me a joke", "model_params": {"temperature": 0.7}}'

# Check metrics
curl http://localhost:8080/metrics

Potential Enhancements

This MVP focuses on core functionality. Some possible future enhancements could include:

Enhanced Guardrails
- More sophisticated content filtering options
- Support for custom filtering rules
Additional LLM Support
- Integration with other LLM providers
Simple Authentication
- Basic rate limiting
Performance Improvements
- Optional caching for common queries
- Optimizations for high-traffic scenarios

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
cmd/server		cmd/server
config		config
grafana/provisioning		grafana/provisioning
internal		internal
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
ai_proxy_dashboard.json		ai_proxy_dashboard.json
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
prometheus.yml		prometheus.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Proxy

Features

Configuration

Config File (`config/config.yaml`)

Environment Variables

API

Query Endpoint

Metrics Endpoint

Health Check

Running Locally

Running with Docker

Monitoring Setup

Running with Monitoring

Example Usage

Potential Enhancements

About

Releases

Contributors 2

Languages

License

charmitro/ai-proxy

Folders and files

Latest commit

History

Repository files navigation

AI Proxy

Features

Configuration

Config File (config/config.yaml)

Environment Variables

API

Query Endpoint

Metrics Endpoint

Health Check

Running Locally

Running with Docker

Monitoring Setup

Running with Monitoring

Example Usage

Potential Enhancements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Contributors 2

Languages

Config File (`config/config.yaml`)