ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.09935
  4. Cited By
DEBATE: Devil's Advocate-Based Assessment and Text Evaluation

DEBATE: Devil's Advocate-Based Assessment and Text Evaluation

16 May 2024
Alex G. Kim
Keonwoo Kim
Sangwon Yoon
    ELM
ArXivPDFHTML

Papers citing "DEBATE: Devil's Advocate-Based Assessment and Text Evaluation"

4 / 4 papers shown
Title
Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation
Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation
SeongYeub Chu
JongWoo Kim
MunYong Yi
53
1
0
21 Feb 2025
Can Large Language Models Be an Alternative to Human Evaluations?
Can Large Language Models Be an Alternative to Human Evaluations?
Cheng-Han Chiang
Hung-yi Lee
ALM
LM&MA
206
559
0
03 May 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
BLEU, METEOR, BERTScore: Evaluation of Metrics Performance in Assessing
  Critical Translation Errors in Sentiment-oriented Text
BLEU, METEOR, BERTScore: Evaluation of Metrics Performance in Assessing Critical Translation Errors in Sentiment-oriented Text
Hadeel Saadany
Constantin Orasan
29
35
0
29 Sep 2021
1