Safety & Quality
Policy violations, eval scores, and human reviews
🛡️
Safety Dashboard
Safety flags, policy violations, factuality scores, and review queue will be shown here.
Track PII detections, jailbreak attempts, hallucinations, and eval metrics.