feat(sandbox)🔒: Add secure script execution for LLM agents using Docker #166

ericmjl · 2025-01-12T15:02:18Z

Implemented a secure sandbox environment for executing agent-generated Python scripts.
Introduced Docker-based isolation with resource and security constraints.
Added metadata handling for script dependencies and execution context.
Included comprehensive tests for the sandbox functionality.

- Implemented a secure sandbox environment for executing agent-generated Python scripts. - Introduced Docker-based isolation with resource and security constraints. - Added metadata handling for script dependencies and execution context. - Included comprehensive tests for the sandbox functionality.

github-actions · 2025-01-12T15:03:33Z

PR Preview Action v1.4.8
🚀 Deployed preview to https://ericmjl.github.io/llamabot/pr-preview/pr-166/
on branch `gh-pages` at 2025-01-28 14:02 UTC

…er container. - Modified the container run command to use 'python' explicitly for script execution. - This change prevents execution issues related to script file permissions or shebang configurations.

…t handling - Added new dependencies to support enhanced functionality. - Refactored argument handling in the ToolToCall class for better clarity and maintainability. - Updated the pyproject.toml and lock files to reflect the latest dependency versions.

…used parameters - Removed the 'purpose' parameter from the 'write_and_execute_script' function and related metadata. - Added descriptions to Pydantic model fields for better documentation. - Adjusted caching logic to skip specific tool results.

…ument structure - Modified the test_tool_to_call_model test to use the updated ToolArguments and CachedArguments structures. - Ensured the test validates the new list-based argument and cached result formats.

- Updated the `hash_result` function to return only the first 8 characters of the SHA256 hash. - Modified test cases to validate the new hash length constraint.

…handling for improved clarity and functionality - Revised the `write_and_execute_script` function to accept dependencies as a comma-separated string. - Enhanced the `ScriptExecutor` class to provide detailed execution results including stdout, stderr, and status. - Improved metadata handling and script writing for better maintainability.

…cution improvements - Added a new tool for internet search using DuckDuckGo API. - Improved script execution by modifying Python version handling and result processing. - Refactored memory caching logic to exclude specific tool results. - Enhanced sandbox environment with increased memory and dedicated cache space.

- Deleted the 'write_and_execute_script' function from the functions list. - Ensured the functionality remains intact without the removed function.

- Updated the hash for the llamabot package in the lockfile.

- Updated the dependency version for 'litellm' in both 'pixi.lock' and 'pyproject.toml'. - Added a new notebook 'agentbot.ipynb' demonstrating the usage of the 'AgentBot' class. - Enhanced 'structuredbot_json.ipynb' with additional examples and model experiments.

- Replaced 'ToolArguments' with 'ToolArgument' in test cases. - Ensured the test cases align with the updated class definitions.

- Updated typing for volume configuration to use dict instead of Mapping. - Added RuntimeError raising for script execution failures with detailed error messages. - Modified test cases to validate new error handling and output parsing.

…dencies. - Deleted the test_script_with_dependencies function from test_sandbox.py. - This was deemed unnecessary because we do allow access to the internet from within the container.

ericmjl and others added 15 commits January 12, 2025 14:39

fix(sandbox)🐛: Ensure script execution uses Python explicitly in Dock…

77e8b09

…er container. - Modified the container run command to use 'python' explicitly for script execution. - This change prevents execution issues related to script file permissions or shebang configurations.

fix(hashing)🔧: Limit hash output to 8 characters for consistency

64880dc

- Updated the `hash_result` function to return only the first 8 characters of the SHA256 hash. - Modified test cases to validate the new hash length constraint.

Update dependencies and Ollama model names.

c6e4b54

refactor(agentbot)🛠️: Remove unused function from the AgentBot class.

1c9ae88

- Deleted the 'write_and_execute_script' function from the functions list. - Ensured the functionality remains intact without the removed function.

Merge branch 'main' into agents-write-tools

be06f55

chore(dependencies)🔒: Update lockfile for dependency integrity.

fb91c70

- Updated the hash for the llamabot package in the lockfile.

fix(tests)🧪: Corrected the class name in test cases for consistency.

209dc6e

- Replaced 'ToolArguments' with 'ToolArgument' in test cases. - Ensured the test cases align with the updated class definitions.

test(sandbox)🧪: Remove redundant test for script execution with depen…

c537fbd

…dencies. - Deleted the test_script_with_dependencies function from test_sandbox.py. - This was deemed unnecessary because we do allow access to the internet from within the container.

ericmjl merged commit b73e2b0 into main Jan 28, 2025
11 checks passed

ericmjl deleted the agents-write-tools branch January 28, 2025 23:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sandbox)🔒: Add secure script execution for LLM agents using Docker #166

feat(sandbox)🔒: Add secure script execution for LLM agents using Docker #166

ericmjl commented Jan 12, 2025

github-actions bot commented Jan 12, 2025 •

edited

Loading

feat(sandbox)🔒: Add secure script execution for LLM agents using Docker #166

feat(sandbox)🔒: Add secure script execution for LLM agents using Docker #166

Conversation

ericmjl commented Jan 12, 2025

github-actions bot commented Jan 12, 2025 • edited Loading

github-actions bot commented Jan 12, 2025 •

edited

Loading