Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data for dataset selection #16

Open
bOTdESUU opened this issue Mar 21, 2025 · 2 comments
Open

Data for dataset selection #16

bOTdESUU opened this issue Mar 21, 2025 · 2 comments

Comments

@bOTdESUU
Copy link

Hi, thanks for the nice works!

I am trying to develop method to curate dataset similar to LIMR and it would be helpful if you could release the data for calculating the LIMR score and potentially the model on full dataset so that I do not need to rerun the RL. If I understand the code correctly, it should be the ./data/output/math.8k.json file.

@bOTdESUU
Copy link
Author

Thanks for the response.

If I understand it correctly, the scores json is the alignment socre. Instead I am interested in the rewards per epoch of all the samples which used to calulate the alinment score, the rewards r_i^k as suggested in section 2.2.1. Sorry for the confusion and please let me know if it already in the repo or would you willing to release it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants