Architecture

Javelin RedTeam is built with a modular, scalable architecture designed for enterprise-grade AI security testing. The system leverages distributed agents, queue-based processing, and comprehensive data management to deliver reliable and efficient red teaming capabilities.
System Overview
The Javelin RedTeam platform consists of several interconnected components working together to provide comprehensive AI security assessments:
Core Components
Javelin-Admin Service
Javelin-Admin service exposes APIs for all operations pertaining to Javelin-Redteam. All reads are handled directly by Javelin-Admin, whereas all writes are proxied to Javelin-Redteam fastapi server.
FastAPI Server
The API server provides the primary interface for red teaming operations:
Asynchronous Processing: Non-blocking API calls with immediate response
Request Validation: Input validation and sanitization
Health Monitoring: System status and performance metrics
Redis Queue System
Distributed task processing for scalable operations:
Asynchronous Execution: Decouple API requests from execution
Horizontal Scaling: Add workers to increase throughput
Fault Tolerance: Retry failed tasks and handle worker failures
PostgreSQL Database
Enterprise-grade data persistence:
Scan Management: Track scan status, progress, and results
Vector Storage: Store and retrieve attack prompt embeddings (pgvector)
Audit Trail: Complete history of security assessments
Report Generation: Structured data for comprehensive reporting
Agent Smith (Orchestrator)
The central coordinator managing the entire red teaming workflow:
Responsibilities:
Coordinate specialized agents
Manage scan lifecycle
Handle error recovery
Monitor progress
Generate final reports
Specialized Agents
Planner Agent
Purpose: Strategic planning and attack orchestration
Capabilities:
Analyze target application characteristics
Generate comprehensive attack strategies
Output: Detailed attack plan with category priorities and execution strategy
Generator Agent
Purpose: Create sophisticated attack prompts
Process:
Base Prompt Retrieval: Fetch relevant prompts from vector database
Template Processing: Fill in contextual variables and placeholders
Engine Enhancement: Apply attack enhancement engines
Quality Validation: Ensure prompt effectiveness and relevance
Engine Integration: Works with all available engines to transform base prompts into advanced attacks
Attack Executor
Purpose: Execute attacks against target applications
Features:
Multi-Protocol Support: HTTP, gRPC, WebSocket, direct model calls
Concurrent Execution: Parallel attack execution with rate limiting
Response Collection: Capture and normalize target responses
Error Handling: Robust error handling and retry mechanisms
Attack Evaluator
Purpose: Analyze responses for vulnerabilities
Evaluation Process:
Response Analysis: Parse and structure target responses
LLM Judge Evaluation: Use specialized models to assess vulnerabilities
Severity Classification: Assign severity levels (Critical, High, Medium, Low)
Evidence Collection: Gather supporting evidence for findings
Reporter
Purpose: Generate comprehensive security reports
Report Components:
Executive Summary: High-level findings and risk assessment
Vulnerability Details: Specific issues with evidence and examples
Attack Vectors: Detailed attack scenarios and exploitation methods
Remediation Guidance: Actionable recommendations and best practices
Compliance Mapping: OWASP, NIST, and regulatory alignment
Attack Generation
Please see the engine overview section for details regarding Attack Generation in Javelin-Redteam
Vector Database Integration
Purpose
Store and retrieve attack prompts efficiently using semantic similarity and other filters The overall workflow looks like below:
Indexing: Convert attack prompts to vector embeddings
Categorization: Organize by vulnerability types and categories
Retrieval: Find relevant prompts based on target description
Enhancement: Apply engines to transform base prompts
Scalability & Performance
Horizontal Scaling
Worker Processes: Scale by adding more worker instances
Database Sharding: Distribute data across multiple database instances
Load Balancing: Distribute API requests across multiple server instances
Queue Partitioning: Separate queues for different scan types
Performance Optimizations
Connection Pooling: Efficient database connection management
Caching: Redis caching for frequently accessed data
Batch Processing: Group similar operations for efficiency
Async Operations: Non-blocking I/O for improved throughput
Resource Management
framework_config.yaml config file allows us to specify and control many aspects of resource requirements by the Javelin-Redteam framework
Security & Reliability
Security Measures
API Authentication: Bearer token authentication for API access
Input Validation: Comprehensive input sanitization and validation
Audit Logging: Complete audit trail of all operations
Secure Communication: TLS encryption for all network communications
Reliability Features
Health Checks: Continuous monitoring of system components
Data Backup: Regular database backups and recovery procedures
Error Recovery: Automatic retry mechanisms with exponential backoff (COMING SOON)
Monitoring & Observability
Metrics Collection: Performance and usage metrics
Log Aggregation: Centralized logging for debugging and analysis
Alert Systems: Automated alerts for system issues
Dashboard: Real-time system status and performance dashboards
This architecture enables Javelin RedTeam to provide enterprise-grade AI security testing while maintaining flexibility, scalability, and reliability across diverse deployment environments.
Last updated