Quick Start
This guide gets you from zero to a working Highflame integration in under ten minutes. By the end you will have routed a request through Highflame, seen a decision in Observatory, and chosen the integration path that fits your stack.
Before you start
You will need:
A Highflame account — sign up at studio.highflame.ai
Python 3.10+ or Node.js 18+ if you plan to use the SDK path
An API key from your LLM provider (OpenAI, Anthropic, etc.) if you plan to use the Highflame Agent Gateway
Step 1 — Get your Highflame API key
Sign in to Highflame Studio
Navigate to Account → Developer Settings
Create an API key and export it:
export HIGHFLAME_API_KEY="hf_sk_..."Step 2 — Make your first protected request
Choose the path that matches how your application works. Both paths use the same detection pipeline, Cedar policies, and Observatory backend — they differ only in where integration happens.
Path A — Agent Gateway (no code changes)
If your application already calls an LLM using an OpenAI-compatible client, point it at Highflame instead. Add one header and change the base URL. No other changes to your application code are required.
What changed:
base_urlpoints to Highflame instead of the upstream providermodeluses theprovider/modelformat so Highflame knows which provider to route tox-highflame-api-keyidentifies your project and active policies
Highflame supports multiple providers in this format: openai/gpt-4o, anthropic/claude-sonnet-4-6, azure/my-deployment, gemini/gemini-2.0-flash, and more. See Integration Examples for the full list.
Path B — Highflame SDK (inline guardrails)
If you want explicit, per-step enforcement inside an agent workflow — guarding prompts, tool calls, and responses directly in code — use the SDK.
Install:
Make your first guarded request:
Safe traffic returns
allow.Risky traffic returns
denywith the policy reason that triggered it.
Step 3 — Verify in Observatory
After making a request, open Highflame Studio and go to Observatory → Traces. You should see the request with:
Decision — allow, deny, redact, or monitor
Detector scores — which signals fired (injection score, PII flags, etc.)
Latency — time spent in Highflame vs. the upstream provider
Token usage — input and output token counts
If a policy blocked the request, it also appears in Observatory → Threats with the determining policy IDs and rejection reason.
Next steps
Where you go from here depends on what you are securing:
Understand the three integration patterns and when to use each
See more Gateway and SDK code examples
Secure AI coding assistants (Cursor, Claude Code)
Test an existing agent system for vulnerabilities
Validate model artifacts before deployment
Write tests for your Shield integration
Last updated