H2O+ is An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps, which offers great flexibility to bridge various choices of offline and online learning methods, while also accounting for dynamics gaps between the real and simulation environment. Compared to H2O, H2O+ gets rid of over-conservative offline RL backbone and enjoys explorative benefits of simulation environments. Through extensive simulation and real-world robotics experiments, we demonstrate superior performance and flexibility over advanced cross-domain online and offline RL algorithms, details of which can be seen in our webpage.
To install the dependencies, run the command:
pip install -r requirements.txt
Add this repo directory to your PYTHONPATH
environment variable:
export PYTHONPATH="$PYTHONPATH:$(pwd)"
Practitioner can rewrite scripts about the task, environment, and training algorithms for Issac Sim wheel-legged robot at their use, according to the examples in SimpleSAC
.
We benchmark H2O+ and its baselines on MuJoCo simulation environment and D4RL datasets. To begin, enter the folder SimpleSAC
:
cd SimpleSAC
Then you can run H2O+ experiments using the following example commands.
python drh2o_main.py \
--env_list HalfCheetah-v2 \
--data_source medium_replay \
--unreal_dynamics gravity \
--variety_list 2.0
python drh2o_main.py \
--env_list Walker-v2 \
--data_source medium_replay \
--unreal_dynamics friction \
--variety_list 0.3
python drh2o_main.py \
--env_list HalfCheetah-v2 \
--data_source medium \
--variety_list 1.0 \
--joint_noise_std 1.0
You can resort to wandb to login your personal account with your wandb API key.
export WANDB_API_KEY=YOUR_WANDB_API_KEY
and run wandb online
to turn on the online syncronization.
If you are using H2O+ framework or code for your project development, please cite the following paper:
@inproceedings{
niu2025h2o+,
title={H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps},
author={Haoyi Niu and Tianying Ji and Bingqi Liu and Haocheng Zhao and Xiangyu Zhu and Jianying Zheng and Pengfei Huang and Guyue Zhou and Jianming HU and Xianyuan Zhan},
booktitle={IEEE International Conference on Robotics and Automation},
year={2025}
}