prometheus-eval

All

7 repositories

prometheus-eval
Public
Evaluate your LLM's response with Prometheus and GPT4 💯
python evaluation gpt4 llm llmops vllm litellm llm-as-a-judge llm-as-evaluator
Python
•
Apache License 2.0
•60•979•12•1•Updated Apr 25, 2025Apr 25, 2025
scaling-evaluation-compute
Public
Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"
0•12•1•0•Updated Mar 25, 2025Mar 25, 2025
prometheus-vision
Public
[ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.
Python
•
Apache License 2.0
•7•74•3•0•Updated Sep 13, 2024Sep 13, 2024
.github
Public
Organization README for prometheus-eval
0•0•0•0•Updated Jun 11, 2024Jun 11, 2024
leaderboard
Public
BiGGen-Bench Leaderboard
Python
•0•0•0•0•Updated Jun 4, 2024Jun 4, 2024
prometheus-eval.github.io
Public
Documentation and blogposts for Prometheus
1•0•0•0•Updated May 1, 2024May 1, 2024
prometheus
Public
[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.
Python
•
MIT License
•18•303•4•0•Updated Nov 11, 2023Nov 11, 2023