Skip to content
/ H2Oplus Public
forked from t6-thu/H2Oplus

[ICRA 2025] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps.

Notifications You must be signed in to change notification settings

AIR-DI/H2Oplus

This branch is up to date with t6-thu/H2Oplus:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Feb 3, 2025
ea9b3c3 · Feb 3, 2025

History

6 Commits
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025
Feb 3, 2025

Repository files navigation

H2O+

H2O+ is An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps, which offers great flexibility to bridge various choices of offline and online learning methods, while also accounting for dynamics gaps between the real and simulation environment. Compared to H2O, H2O+ gets rid of over-conservative offline RL backbone and enjoys explorative benefits of simulation environments. Through extensive simulation and real-world robotics experiments, we demonstrate superior performance and flexibility over advanced cross-domain online and offline RL algorithms, details of which can be seen in our webpage.

Installation and Setups

To install the dependencies, run the command:

    pip install -r requirements.txt

Add this repo directory to your PYTHONPATH environment variable:

    export PYTHONPATH="$PYTHONPATH:$(pwd)"

Run Wheel-legged Robot Experiments with Issac Sim

Practitioner can rewrite scripts about the task, environment, and training algorithms for Issac Sim wheel-legged robot at their use, according to the examples in SimpleSAC.

Run Benchmark Experiments

We benchmark H2O+ and its baselines on MuJoCo simulation environment and D4RL datasets. To begin, enter the folder SimpleSAC:

    cd SimpleSAC

Then you can run H2O+ experiments using the following example commands.

Simulated in HalfCheetah-v2 with 2x gravity and Medium Replay dataset

    python drh2o_main.py \
        --env_list HalfCheetah-v2 \
        --data_source medium_replay \
        --unreal_dynamics gravity \
        --variety_list 2.0 

Simulated in Walker-v2 with .3x friction and Medium Replay dataset

    python drh2o_main.py \
        --env_list Walker-v2 \
        --data_source medium_replay \
        --unreal_dynamics friction \
        --variety_list 0.3 

Simulated in HalfCheetah-v2 with joint noise N(0,1) and Medium dataset

    python drh2o_main.py \
        --env_list HalfCheetah-v2 \
        --data_source medium \
        --variety_list 1.0 \
        --joint_noise_std 1.0 

Visualization of Learning Curves

You can resort to wandb to login your personal account with your wandb API key.

    export WANDB_API_KEY=YOUR_WANDB_API_KEY

and run wandb online to turn on the online syncronization.

Citation

If you are using H2O+ framework or code for your project development, please cite the following paper:

@inproceedings{
    niu2025h2o+,
    title={H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps},
    author={Haoyi Niu and Tianying Ji and Bingqi Liu and Haocheng Zhao and Xiangyu Zhu and Jianying Zheng and Pengfei Huang and Guyue Zhou and Jianming HU and Xianyuan Zhan},
    booktitle={IEEE International Conference on Robotics and Automation},
    year={2025}
}

About

[ICRA 2025] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 91.7%
  • JavaScript 3.9%
  • HTML 3.3%
  • CSS 1.1%