Multi-Agent Reinforcement Learning in Unity ML-Agents

This repository presents a multi-agent reinforcement learning system for Unity’s Soccer Twos environment, utilizing Proximal Policy Optimization with Concurrent Actions (POCA). The project explores optimizing agent performance through sensor modifications, observation memory, and reward system enhancements, analyzing trade-offs between computational efficiency and learning effectiveness.

Project Overview

This study focuses on training AI agents in a competitive soccer simulation, evaluating different reinforcement learning configurations to optimize ELO performance, training efficiency, and resource utilization.

Key modifications include:

Forward-Focused Ray-Cast Sensor: Restricts agents’ perception to realistic forward-facing observations, eliminating unrealistic backward vision.
Observation Memory Mechanism: Introduces short-term memory to retain recent observations, improving decision-making.
Custom Reward System: Implements a structured goal-oriented reward system to encourage teamwork and competitive play.
Hyperparameter Optimization: Experiments with learning rate adjustments, network size reductions, and concurrent environment scaling.

Setup and Usage

To explore the project:

Open the project in Unity.
Navigate to Project/Assets/ML-Agents and use the pre-configured training environments.
Adjust training parameters in the config files and execute training runs through the Unity ML-Agents Trainer.

Experiments and Results

Five different configurations were tested to evaluate training speed, ELO scores, and computational resource usage:

Configuration	ELO Score	Training Time (s)
Default POCA	1547	24026
Increased Learning Rate	1582	39091
Enhanced Memory Mechanism	1440	39224
Reduced Network Size	1471	14271
Increased Concurrent Environments	1524	28844

Key Findings

Increased Learning Rate yielded the highest ELO score but significantly increased training time.
Reduced Network Size provided the fastest training with moderate ELO performance.
Observation Memory did not substantially improve performance but increased computational cost.
Scaling Concurrent Environments improved efficiency while maintaining stable performance.

Performance Metrics

The system was tested on Unity’s Profiler for CPU/GPU load, memory usage, and frame rates. The Reduced Network Size configuration demonstrated optimal efficiency, while Enhanced Memory had the highest computational overhead.

Contributors

Kaan Başaran
Antoni Rodawski
Ahmed Metwally
Alex Andreescu
Bati Gozen
Sitanshu Puranum
Zhengzhong Carrey Huang

References

Unity ML-Agents Toolkit: https://github.com/Unity-Technologies/ml-agents
Unity Profiler Documentation: https://docs.unity3d.com/Manual/Profiler.html
Reinforcement Learning in Unity: https://arxiv.org/abs/1809.02627

Name		Name	Last commit message	Last commit date
Latest commit History 3,524 Commits
.github		.github
.yamato		.yamato
DevProject		DevProject
PerformanceProject		PerformanceProject
Project		Project
colab		colab
com.unity.ml-agents.extensions		com.unity.ml-agents.extensions
com.unity.ml-agents		com.unity.ml-agents
config		config
docs		docs
localized_docs		localized_docs
ml-agents-envs		ml-agents-envs
ml-agents-plugin-examples		ml-agents-plugin-examples
ml-agents-trainer-plugin		ml-agents-trainer-plugin
ml-agents		ml-agents
protobuf-definitions		protobuf-definitions
unity-volume		unity-volume
utils		utils
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.pre-commit-search-and-replace.yaml		.pre-commit-search-and-replace.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
SURVEY.md		SURVEY.md
colab_requirements.txt		colab_requirements.txt
conftest.py		conftest.py
markdown-link-check.fast.json		markdown-link-check.fast.json
markdown-link-check.full.json		markdown-link-check.full.json
mkdocs.yml		mkdocs.yml
pytest.ini		pytest.ini
setup.cfg		setup.cfg
test_constraints_version.txt		test_constraints_version.txt
test_requirements.txt		test_requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Reinforcement Learning in Unity ML-Agents

Project Overview

Setup and Usage

Experiments and Results

Key Findings

Performance Metrics

Contributors

References

About

Releases

Packages

Languages

License

MKBasaran/ml-agents

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Reinforcement Learning in Unity ML-Agents

Project Overview

Setup and Usage

Experiments and Results

Key Findings

Performance Metrics

Contributors

References

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages