5-Minute AI Agent Risk Score

Score one AI workflow before launch. The tool runs only in this page, does not save or send data, and does not need secrets, customer records, private screenshots, or payment details.

1. Tool and side-effect boundary

What can the agent do without a human explicitly approving the exact action?

2. Data exposure

What can appear in the context window, retrieval results, memory, logs, or tool output?

3. Untrusted instructions

Can outside content tell the model what to do?

4. Human approval and rollback

Can someone stop, inspect, and undo the risky path?

5. Launch evidence

What proof exists that the risky cases were tested?