Advanced Detection

The advanced_detection profile is an add-on that layers ML-powered, type-specific detection on top of any other profile. It does not replace a deployment profile — it extends one. Apply it alongside code_agent, data_pipeline, multi_agent, or any other profile when the baseline pattern-matching detection is not sufficient for your security requirements.


When to use advanced detection

The default policies detect secrets and PII using pattern matching — regular expressions and known formats. This covers common cases (generic API key patterns, SSN format, credit card Luhn checks) but has two limitations:

  1. Pattern coverage gaps. New secret formats, proprietary credential structures, or atypical PII representations may not be caught by patterns.

  2. Bulk detection. The defaults do not distinguish between a single incidental PII match and a response that contains dozens of personal records (a data dump).

The advanced_detection profile addresses both:

  • Type-specific secret blocking uses format-aware detection for high-value credential types rather than generic patterns

  • ML classifier-based PII operates at a confidence threshold rather than a pattern match, catching atypical representations

  • Bulk PII detection flags responses with 3 or more PII matches as a potential data dump

Use advanced_detection for:

  • Financial services, healthcare, or other regulated industries

  • Any deployment handling sensitive personal data at scale

  • High-security environments where credential leakage has severe consequences

  • Data pipelines where the content corpus is large and varied enough that pattern-based detection has meaningful gaps


Profile files


secrets.cedar — Type-specific credential blocking

Blocks secrets by specific format rather than relying on generic patterns. Each rule targets a distinct credential type.

High-risk secret types:

Credential type
Detection basis

AWS IAM access keys

AKIA... key ID format + secret key structure

GCP service account JSON

Service account JSON structure with private key

Azure client secrets / certificates

Azure credential format detection

GitHub personal access tokens

ghp_, gho_, ghu_, ghs_ prefix formats

SSH private keys

PEM header patterns (BEGIN RSA PRIVATE KEY, etc.)

Database connection strings

Connection URI formats for PostgreSQL, MySQL, MongoDB, Redis

Token types:

Bearer tokens, JWTs, and OAuth tokens/secrets are blocked when their specific format is detected. This is distinct from the default secrets policy, which uses broader contains_secrets signal — the type-specific rules fire even when the generic signal does not.


pii.cedar — ML classifier-based PII detection

Extends the default pattern-based PII detection with ML classifier output.

Bulk PII detection:

When 3 or more distinct PII instances are detected in a single response, it is treated as a data dump regardless of the PII types involved. This catches cases where individual matches might be below the blocking threshold but the aggregate volume indicates something is wrong.

ML confidence threshold:

Blocks when the ML PII classifier confidence is above 80, independent of the pattern-match result. This catches atypical PII representations that the regex patterns miss.

File operation PII block:

Blocks file reads and writes where PII is detected in the content. This is an addition to the default policy, which only evaluates prompt content.


threat_severity.cedar — Severity catch-all

Blocks any content flagged as Critical severity by any detector, regardless of the specific detection category. This ensures that future detectors — including custom detectors added via webhooks — automatically contribute to blocking at the critical tier without requiring a new policy rule.


Applying the profile

advanced_detection is always applied alongside a deployment profile, not on its own.

Common pairings:

Deployment profile
Reason for pairing

data_pipeline

Large/varied data corpus with higher gap risk in pattern detection

code_agent

Enterprise environments with AWS/GCP/Azure credential exposure risk

multi_agent

High-security orchestrated deployments

chat_assistant

Regulated industries with strict PII output requirements


Last updated