DRL with Population Coded Spiking Neural Network

This package is the PyTorch implementation of the Population-coded Spiking Actor Network (PopSAN) that integrates with both on-policy (PPO) and off-policy (DDPG, TD3, SAC) DRL algorithms for learning optimal and energy-efficient continuous control policies.

The paper has been accepted at CoRL 2020. The arXiv preprint is available here.

New: We have created a new GitHub repo to demonstrate the online runtime interaction with Loihi. If you are interested in using Loihi for real-time robot control, please check it out.

Citation

Guangzhi Tang, Neelesh Kumar, Raymond Yoo, and Konstantinos P. Michmizos. "Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous Control." 4th Conference on Robot Learning (CoRL 2020), Cambridge MA. USA.

@inproceedings{tang2020deep,
  title={Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous Control},
  author={Tang, Guangzhi and Kumar, Neelesh and Yoo, Raymond and Michmizos, Konstantinos P},
  booktitle={4th Conference on Robot Learning (CoRL 2020)},
  pages={1--10},
  year={2020}
}

Software Installation

Ubuntu 16.04
Python 3.5.2
MuJoCo 2.0
OpenAI GYM 0.15.3 (with mujoco_py 2.0.2.5)
PyTorch 1.2 (with CUDA 10.0 and tensorboard 2.1)
NxSDK 0.9

A CUDA enabled GPU is not required but preferred for training. The results in the paper are generated from models trained using both Nvidia Tesla K40c and Nvidia GeForce RTX 2080Ti.

Intel's neuromorphic library NxSDK is only required for SNN deployment on the Loihi neuromorphic chip. If you are interested in deploying the trained SNN on Loihi, please contact the Intel Neuromorphic Lab.

We have provided the requirements.txt for the python environment without NxSDK. In addition, we recommend setting up the environment using virtualenv.

Example Usage

1. Training PopSAN

To train PopSAN using TD3 algorithm, execute the following commands:

cd <Dir>/<Project Name>/popsan_drl/popsan_td3
python td3_cuda_norm.py --env HalfCheetah-v3

This will automatically train 1 million steps and save the trained models. The steps to train DDPG, SAC, and PPO are the same as above.

2. Deploy the trained PopSAN on Loihi

To evaluate PopSAN realization on Loihi, execute the following commands to start testing:

cd <Dir>/<Project Name>/loihi_realization
python test_loihi.py

This will test the 10 trained models on Loihi. To run the code correctly, data_dir value in the script needs to be changed to the folder that stores all trained models.

Acknowledgment

This work is supported by Intel's Neuromorphic Research Community Grant Award. Part of our code, including DRL training and PPO multiprocessing environments, were built upon OpenAI Spinning Up, OpenAI Baselines, and Stable Baselines. We would like to thank their contribution to the community.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
loihi_realization		loihi_realization
popsan_drl		popsan_drl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRL with Population Coded Spiking Neural Network

Citation

Software Installation

Example Usage

1. Training PopSAN

2. Deploy the trained PopSAN on Loihi

Acknowledgment

About

Releases

Packages

Languages

License

treestreamymw/pop-spiking-deep-rl

Folders and files

Latest commit

History

Repository files navigation

DRL with Population Coded Spiking Neural Network

Citation

Software Installation

Example Usage

1. Training PopSAN

2. Deploy the trained PopSAN on Loihi

Acknowledgment

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages