GitHub - hustvl/AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

🌌 AlphaDrive: Unleashing the Power of VLMs in Autonomous
Driving via Reinforcement Learning and Reasoning

Bo Jiang¹, Shaoyu Chen^1,2, Qian Zhang², Wenyu Liu¹, Xinggang Wang^1,📧

¹ Huazhong University of Science and Technology, ² Horizon Robotics, ^📧 corresponding author

vis.mp4

✨ Highlights

To the best of our knowledge, AlphaDrive is the first to integrate GRPO-based RL with planning reasoning to autonomous driving, significantly boosting both performance and training efficiency.
We are excited to discover that, following RL training, AlphaDrive exhibits some emergent multimodal planning capabilities, which is promising for improving driving safety and efficiency.

📋 News

[2025-3-26]: We have released the training and evaluation scripts of AlphaDrive.

[2025-3-11]: AlphaDrive arXiv paper released. Code are coming soon. Please stay tuned! ☕️

🎮 Getting Started

Installtion

git clone [email protected]:hustvl/AlphaDrive.git
conda create -n alphadrive python=3.11 -y
conda activate alphadrive
sh setup.sh

Data Preparation

We provide the prompt templates used in AlphaDrive for training and generating planning reasoning data, and an example QA is provided in example.json.

Training

For Supervised Fine-tuning Phase:

sh train_tools/run_train_sft.sh

For Reinforcement Learning Phase:

sh train_tools/run_train_grpo.sh

Evaluation

You can evaluate the meta-action planning accuracy using the script below.

sh eval_tools/qwen2vl_plan_cmd_eval.sh

📊 Qualitative Results

❤️ Acknowledgements

This repo is built on open-r1 and R1-V. We sincerely thank the contributors for their great work!

📚 Citation

If you find AlphaDrive useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@article{jiang2025alphadrive,
      title={AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning}, 
      author={Bo Jiang and Shaoyu Chen and Qian Zhang and Wenyu Liu and Xinggang Wang},
      year={2025},
      eprint={2503.07608},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.07608}, 
}

🥰 Related Projects

Check out our other awesome projects:

VAD & VADv2: Vectorized Scene Representation for Efficient Autonomous Driving.

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving.

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving.

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning.

MapTR: An End-to-End Framework for Online Vectorized HD Map Construction.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
data_tools		data_tools
eval_tools		eval_tools
src		src
train_tools		train_tools
LICENSE		LICENSE
README.md		README.md
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌌 AlphaDrive: Unleashing the Power of VLMs in Autonomous
Driving via Reinforcement Learning and Reasoning

✨ Highlights

📋 News

🎮 Getting Started

Installtion

Data Preparation

Training

Evaluation

📊 Qualitative Results

❤️ Acknowledgements

📚 Citation

🥰 Related Projects

About

Releases

Packages

Languages

License

hustvl/AlphaDrive

Folders and files

Latest commit

History

Repository files navigation

🌌 AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

✨ Highlights

📋 News

🎮 Getting Started

Installtion

Data Preparation

Training

Evaluation

📊 Qualitative Results

❤️ Acknowledgements

📚 Citation

🥰 Related Projects

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

🌌 AlphaDrive: Unleashing the Power of VLMs in Autonomous
Driving via Reinforcement Learning and Reasoning

Packages