Add integration tests for security analyzer #8774

rbren · 2025-05-28T22:02:52Z

This change is worth documenting at https://docs.all-hands.dev/
Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

End-user friendly description of the problem this fixes or functionality this introduces.

N/A - This is a test improvement only.

Summarize what the PR does, explaining any non-trivial design decisions.

This PR adds integration tests for the security analyzer component to ensure it properly integrates with the event stream and correctly handles actions with different security risk levels. The tests verify:

High-risk actions are properly identified and blocked
Low-risk actions are allowed to proceed
Medium-risk actions require confirmation when confirmation mode is enabled
User rejection of actions works correctly

Additionally, this PR adds three new security analyzers:

PushoverSecurityAnalyzer: A simple analyzer that allows all actions by marking them as low risk
BullySecurityAnalyzer: A simple analyzer that blocks all actions by marking them as high risk
LLMSecurityAnalyzer: An analyzer that uses an LLM to evaluate actions for security risks

These new analyzers provide different security evaluation strategies:

The Pushover and Bully analyzers provide simple, predictable behavior for testing and can be useful in development environments where you want to either bypass security checks or enforce strict security policies.
The LLM-based analyzer provides a more sophisticated approach by leveraging language models to evaluate the potential risks of actions based on their content.

These tests help ensure that no action gets passed to the runtime until the security analyzer has evaluated it and determined it is safe to run.

Link of any specific issues this addresses:

N/A

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:3a85a35-nikolaik   --name openhands-app-3a85a35   docker.all-hands.dev/all-hands-ai/openhands:3a85a35

openhands-agent added 3 commits May 28, 2025 22:02

Add integration tests for security analyzer

d48b04e

Add PushoverSecurityAnalyzer and BullySecurityAnalyzer with tests

4a76459

Add LLMSecurityAnalyzer with tests

3a85a35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add integration tests for security analyzer #8774

Add integration tests for security analyzer #8774

Uh oh!

rbren commented May 28, 2025 •

edited by github-actions bot

Loading

Uh oh!

Uh oh!

Add integration tests for security analyzer #8774

Are you sure you want to change the base?

Add integration tests for security analyzer #8774

Uh oh!

Conversation

rbren commented May 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

rbren commented May 28, 2025 •

edited by github-actions bot

Loading