Template library
15 starter policies covering OWASP LLM Top 10, EU AI Act, GDPR, industry regulations, shadow AI, and content safety. Clone to start observing — admins decide when to promote to Strict.
PCI-DSS Cardholder Data Block
Blocks credit card numbers (PAN), CVV, and expiry dates from entering or leaving LLM systems.
PHI / Healthcare Data Protection
Detects and blocks Protected Health Information (PHI) — medical record numbers, diagnoses, treatments, identifiers — unless system has BAA in place.
Prompt Injection Defense
Detects and blocks attempts to override system instructions through user input, including jailbreak patterns, role escape, and instruction smuggling.
Special Category Data Block
Blocks processing of GDPR Art. 9 special category data (health, racial origin, religion, biometric) unless explicit lawful basis is established.
Agent Tool Restriction
Restricts which tools an agent can invoke based on user role and application context. Prevents agents from overstepping their authorised scope.
Attorney-Client Privilege Protection
Detects attorney-client privileged communication and blocks transmission to non-privileged AI systems.
GDPR Data Minimization
Detects when prompts request more personal data than necessary for the stated purpose. Flags for review.
PII Input Redaction
Strips PII from user prompts before they reach the model, preventing accidental exposure to third-party LLM providers.
PII Output Prevention
Scans model responses for personally identifiable information (PII) — names, emails, phone numbers, national IDs — and blocks or redacts before display.
System Prompt Leakage
Detects when model responses include verbatim or near-verbatim system prompt text, blocking exposure of proprietary instructions.
Art. 50 AI Disclosure Requirement
Ensures user-facing AI systems disclose their AI nature in initial responses. Detects missing disclosures and flags for review.
EU AI Act Art. 15 Audit Log
Mandatory audit logging for High-Risk AI systems. Captures every input, output, and policy decision with cryptographic chain-of-custody.
Output Toxicity Filter
Detects toxic, harmful, hateful, or harassing content in model responses and blocks before display.
Shadow AI Request Detection
Flags requests routed to AI services not in the approved registry. Helps catch shadow AI before it becomes a compliance gap.
Unbounded Consumption Guard
Throttles requests per user and per application to prevent denial-of-wallet attacks and runaway costs.