Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
Quantifying Uncertainty in Answers from any Language Model and Enhancing
their TrustworthinessAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |
Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaNeural Information Processing Systems (NeurIPS), 2023 |
Exploring the Use of Large Language Models for Reference-Free Text
Quality Evaluation: An Empirical StudyInternational Joint Conference on Natural Language Processing (IJCNLP), 2023 |
ELI5: Long Form Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2019 |