ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.08491
  4. Cited By
Prometheus: Inducing Fine-grained Evaluation Capability in Language
  Models

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

12 October 2023
Seungone Kim
Jamin Shin
Yejin Cho
Joel Jang
Shayne Longpre
Hwaran Lee
Sangdoo Yun
Seongjin Shin
Sungdong Kim
James Thorne
Minjoon Seo
    ALM
    LM&MA
    ELM
ArXivPDFHTML

Papers citing "Prometheus: Inducing Fine-grained Evaluation Capability in Language Models"

18 / 168 papers shown
Title
Leveraging Large Language Models for NLG Evaluation: Advances and
  Challenges
Leveraging Large Language Models for NLG Evaluation: Advances and Challenges
Zhen Li
Xiaohan Xu
Tao Shen
Can Xu
Jia-Chen Gu
Yuxuan Lai
Chongyang Tao
Shuai Ma
LM&MA
ELM
26
9
0
13 Jan 2024
Structsum Generation for Faster Text Comprehension
Structsum Generation for Faster Text Comprehension
Parag Jain
Andreea Marzoca
Francesco Piccinno
ReLM
31
5
0
12 Jan 2024
The Critique of Critique
The Critique of Critique
Shichao Sun
Junlong Li
Weizhe Yuan
Ruifeng Yuan
Wenjie Li
Pengfei Liu
ELM
27
0
0
09 Jan 2024
RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language
  Models
RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models
Meiling Tao
Xuechen Liang
Tianyu Shi
Lei Yu
Yiting Xie
32
4
0
17 Dec 2023
AlignBench: Benchmarking Chinese Alignment of Large Language Models
AlignBench: Benchmarking Chinese Alignment of Large Language Models
Xiao Liu
Xuanyu Lei
Sheng-Ping Wang
Yue Huang
Zhuoer Feng
...
Hongning Wang
Jing Zhang
Minlie Huang
Yuxiao Dong
Jie Tang
ELM
LM&MA
ALM
120
42
0
30 Nov 2023
Digital Socrates: Evaluating LLMs through Explanation Critiques
Digital Socrates: Evaluating LLMs through Explanation Critiques
Yuling Gu
Oyvind Tafjord
Peter Clark
ELM
LRM
24
2
0
16 Nov 2023
LifeTox: Unveiling Implicit Toxicity in Life Advice
LifeTox: Unveiling Implicit Toxicity in Life Advice
Minbeom Kim
Jahyun Koo
Hwanhee Lee
Joonsuk Park
Hwaran Lee
Kyomin Jung
6
6
0
16 Nov 2023
Volcano: Mitigating Multimodal Hallucination through Self-Feedback
  Guided Revision
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision
Seongyun Lee
Sue Hyun Park
Yongrae Jo
Minjoon Seo
22
50
0
13 Nov 2023
LitSumm: Large language models for literature summarisation of non-coding RNAs
LitSumm: Large language models for literature summarisation of non-coding RNAs
Andrew Green
C. Ribas
Nancy Ontiveros-Palacios
Sam Griffiths-Jones
Anton I. Petrov
Alex Bateman
Blake Sweeney
13
4
0
06 Nov 2023
AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using
  Open-Source LLMs
AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs
Yann Hicke
Anmol Agarwal
Qianou Ma
Paul Denny
AI4Ed
26
24
0
05 Nov 2023
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Lianghui Zhu
Xinggang Wang
Xinlong Wang
ELM
ALM
54
106
0
26 Oct 2023
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial
  Applications
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications
Manuel Faysse
Gautier Viaud
C´eline Hudelot
Pierre Colombo
25
9
0
21 Oct 2023
How Good is ChatGPT in Giving Advice on Your Visualization Design?
How Good is ChatGPT in Giving Advice on Your Visualization Design?
Nam Wook Kim
Grace Myers
Benjamin Bach
26
20
0
14 Oct 2023
Evaluating Large Language Models at Evaluating Instruction Following
Evaluating Large Language Models at Evaluating Instruction Following
Zhiyuan Zeng
Jiatong Yu
Tianyu Gao
Yu Meng
Tanya Goyal
Danqi Chen
ELM
ALM
23
166
0
11 Oct 2023
Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges
Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges
Hiba Ahsan
Denis Jered McInerney
Jisoo Kim
Christopher Potter
Geoffrey S. Young
Silvio Amir
Byron C. Wallace
19
12
0
08 Sep 2023
Aligning Large Language Models through Synthetic Feedback
Aligning Large Language Models through Synthetic Feedback
Sungdong Kim
Sanghwan Bae
Jamin Shin
Soyoung Kang
Donghyun Kwak
Kang Min Yoo
Minjoon Seo
ALM
SyDa
73
67
0
23 May 2023
Can Large Language Models Be an Alternative to Human Evaluations?
Can Large Language Models Be an Alternative to Human Evaluations?
Cheng-Han Chiang
Hung-yi Lee
ALM
LM&MA
209
568
0
03 May 2023
Instruction Tuning with GPT-4
Instruction Tuning with GPT-4
Baolin Peng
Chunyuan Li
Pengcheng He
Michel Galley
Jianfeng Gao
SyDa
ALM
LM&MA
157
579
0
06 Apr 2023
Previous
1234