You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed that evaluate.py only has verify_float() for numerical answers. However, some ground truth in MATH dataset are strings like "p - q". Could you provide a verify function to handle these cases?
Thanks!
The text was updated successfully, but these errors were encountered:
I think the only way is to check whether p-q or $p$-$q$ exists in the output through re, or use a stronger LLM (might be slow and unstable). Hope there is a better way.
I think the only way is to check whether p-q or p -$q$ exists in the output through re, or use a stronger LLM (might be slow and unstable). Hope there is a better way.
I think the only way is to check whether p-q or p -$q$ exists in the output through re, or use a stronger LLM (might be slow and unstable). Hope there is a better way.
Hi! Thanks for your awesome work!
I noticed that evaluate.py only has verify_float() for numerical answers. However, some ground truth in MATH dataset are strings like "p - q". Could you provide a verify function to handle these cases?
Thanks!
The text was updated successfully, but these errors were encountered: