Rules
Rules are filters for your AI Firewall. Each one uses a rigorously tested machine-learning model to scan input prompts and model outputs for malicious or otherwise high-risk material. There are different Rule Types for inputs and outputs with some overlap. There are 24 Rule Types as or right now:
Input Rule Types:
Prompt Anonymization
Prompt Injection Detection
Ban Code
Ban Competitors
Ban Topics
Ban Substrings
Code Detection
Gibberish Detection
Invisible Text Detection
Language Detection
Regex Detection
Secrets Detection
Sentiment Filter
Token Limiter
Toxicity Filter
Output Rule Types:
Ban Competitors
Ban Topics
Ban Substrings
Bias Detection
Code Detection
Prompt Deanonymization
JSON Validator
Language Detection
Language Same in Output
Malicious URL Detector
No Refusal Filter
Reading Time Filter
Fact Check
Gibberish Detection
Regex Detection
Relevance Filter
Sensitive Info Detection
Sentiment Filter
Toxicity Filter
URL Reachability Filter
Last updated