Threat Model

This page documents the PRX threat model -- the set of threats we consider, our security assumptions, and the mitigations in place.

Threat Categories

Threat: Adversarial content in user input or retrieved data manipulates the agent into performing unintended actions.

Mitigations:

Threat: The agent uses tools in unintended ways (e.g., reading sensitive files, making unauthorized network requests).

Mitigations:

Threat: Sensitive data from the local system is sent to external services via LLM context or tool calls.

Mitigations:

Threat: Malicious plugins or dependencies compromise the agent.

Mitigations:

If you discover a security vulnerability, please report it to [email protected].