Quick Start

This guide gets you from zero to a working Highflame integration in under ten minutes. By the end you will have routed a request through Highflame, seen a decision in Observatory, and chosen the integration path that fits your stack.


Before you start

You will need:

  • A Highflame account — sign up at studio.highflame.ai

  • Python 3.10+ or Node.js 18+ if you plan to use the SDK path

  • An API key from your LLM provider (OpenAI, Anthropic, etc.) if you plan to use the Highflame Agent Gateway


Step 1 — Get your Highflame API key

  1. Sign in to Highflame Studio

  2. Navigate to Account → Developer Settings

  3. Create an API key and export it:

export HIGHFLAME_API_KEY="hf_sk_..."

Step 2 — Make your first protected request

Choose the path that matches how your application works. Both paths use the same detection pipeline, Cedar policies, and Observatory backend — they differ only in where integration happens.


Path A — Agent Gateway (no code changes)

If your application already calls an LLM using an OpenAI-compatible client, point it at Highflame instead. Add one header and change the base URL. No other changes to your application code are required.

What changed:

  • base_url points to Highflame instead of the upstream provider

  • model uses the provider/model format so Highflame knows which provider to route to

  • x-highflame-api-key identifies your project and active policies

Highflame supports multiple providers in this format: openai/gpt-4o, anthropic/claude-sonnet-4-6, azure/my-deployment, gemini/gemini-2.0-flash, and more. See Integration Examples for the full list.


Path B — Highflame SDK (inline guardrails)

If you want explicit, per-step enforcement inside an agent workflow — guarding prompts, tool calls, and responses directly in code — use the SDK.

Install:

Make your first guarded request:

  • Safe traffic returns allow.

  • Risky traffic returns deny with the policy reason that triggered it.


Step 3 — Verify in Observatory

After making a request, open Highflame Studio and go to Observatory → Traces. You should see the request with:

  • Decision — allow, deny, redact, or monitor

  • Detector scores — which signals fired (injection score, PII flags, etc.)

  • Latency — time spent in Highflame vs. the upstream provider

  • Token usage — input and output token counts

If a policy blocked the request, it also appears in Observatory → Threats with the determining policy IDs and rejection reason.


Next steps

Where you go from here depends on what you are securing:

Goal
Start here

Understand the three integration patterns and when to use each

See more Gateway and SDK code examples

Secure AI coding assistants (Cursor, Claude Code)

Test an existing agent system for vulnerabilities

Validate model artifacts before deployment

Write tests for your Shield integration

Last updated