-
Notifications
You must be signed in to change notification settings - Fork 114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support model evaluation #197
Conversation
Support argo workflow system Support evaluation datasets support opencompass
support harness evaluation
🚀 The |
The TipsCodeReview Commands (invoked as MR or PR comments)
CodeReview Discussion ChatThere are 2 ways to chat with Starship CodeReview:
Note: Be mindful of the bot's finite context window. CodeReview Documentation and Community
|
for LLM evaluation, provide one-stop platform for large model evaluation, aiming to provide a fair, open, and reproducible benchmark for large model evaluation.
currently supported evaluation