Skip to content

SiliangZeng/R1-Experiment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

R1-Experiments

we use this public project to run RL reasoning experiment for Llama and Qwen models

source code from willccbb/grpo_demo.py

Run

pip install -r requirements.txt
pip install flash-attn --no-build-isolation
pip install git+https://github.com/huggingface/trl.git

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published