I built an OpenAI compatible firewall for AI agents. Try to break it.
A developer created an OpenAI-compatible firewall for AI agents called Arc Gate. It analyzes entire sessions rather than individual prompts, tracking authority and escalating restrictions based on user behavior. The tool aims to prevent prompt injection attacks by monitoring multi-turn interactions. You can test the firewall on the project’s GitHub page.
- Analyzes entire sessions, not just individual prompts.
- Escalates restrictions based on user behavior across turns.
- Aims to prevent prompt injection attacks in multi-turn interactions.