Question 1

How are rules authored?

Accepted Answer

Visual builder first. A path/operator/value row for the condition, a dropdown for the action, a severity and reason field. The UI shows a plain-English preview of what the rule does as you edit. An Advanced JSON tab is available for edge cases, but the vast majority of rules are authored in the simple view.

Question 2

Can rules reference tool arguments?

Accepted Answer

Yes — path syntax supports args.<field> nesting (e.g. args.amount > 100). Matchers include exact, not equals, in, contains, greater/less than, and regex. Detectors (PII, prompt injection, jailbreak, secret) feed boolean flags that rules can check via detector_flags.

Question 3

What's the 'test against last 100 calls' feature?

Accepted Answer

Before you ship a rule, click Test to see how it would have behaved on the last 100 real tool calls in this project. Shows matches, false positives, and what decision it would have emitted. Catches overly-broad rules before they block legitimate traffic.

Question 4

How does the Arden-style Activity Log work?

Accepted Answer

Every intercepted call is a row. Filter by decision, tool, agent, session, matched rule, reviewer, severity, or detector flag. Rows that matched zero rules show a dashed 'Create policy' button — click it, a side-sheet opens, the rule builder is pre-filled with the tool name and the call's arguments. Save and the next matching call enforces the new rule.

Question 5

Do detectors run automatically?

Accepted Answer

Yes — PII, secret, prompt-injection, and jailbreak detectors run on every tool call regardless of whether you've authored a Guardrail rule. Detector hits surface as flags you can then reference in rule conditions, or as blocks via the action_on_hit field on the DetectorRule.

Question 6

What happens on upstream error?

Accepted Answer

Each RuntimeKey has a fail_mode: closed (default — block on upstream error) or open (allow on upstream error). Fail-closed is the safer default for tools that take real actions. Authentication errors always fail-closed regardless.

Policy enforcement at runtime. Rule authoring without DSL drama.

Four decisions. One runtime.

A rule builder that explains itself

Preset chips

Plain-English preview

Test against last 100

Activity → Policy → Activity

Detectors, pre-flight

Guardrails FAQs

Frequently Asked Questions

Stop your agent from doing the wrong thing. Before it does it.