ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.07935
  4. Cited By
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A
  Preliminary Study Towards Reliable NLG Evaluation

Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation

12 June 2024
Jie Ruan
Wenqing Wang
Xiaojun Wan
    AAML
    ELM
ArXivPDFHTML

Papers citing "Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation"

8 / 8 papers shown
Title
HelloBench: Evaluating Long Text Generation Capabilities of Large
  Language Models
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Haoran Que
Feiyu Duan
Liqun He
Yutao Mou
Wangchunshu Zhou
...
Ge Zhang
Junran Peng
Zhaoxiang Zhang
Songyang Zhang
Kai Chen
LM&MA
ELM
VLM
43
11
0
24 Sep 2024
q2d: Turning Questions into Dialogs to Teach Models How to Search
q2d: Turning Questions into Dialogs to Teach Models How to Search
Yonatan Bitton
Shlomi Cohen-Ganor
Ido Hakimi
Yoad Lewenberg
Roee Aharoni
Enav Weinreb
44
4
0
27 Apr 2023
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
4,048
0
24 May 2022
Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and
  Their Implications
Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications
Kaitlyn Zhou
Su Lin Blodgett
Adam Trischler
Hal Daumé
Kaheer Suleman
Alexandra Olteanu
ELM
94
25
0
13 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,217
0
21 Mar 2022
Machine-in-the-Loop Rewriting for Creative Image Captioning
Machine-in-the-Loop Rewriting for Creative Image Captioning
Vishakh Padmakumar
He He
56
17
0
07 Nov 2021
BiSECT: Learning to Split and Rephrase Sentences with Bitexts
BiSECT: Learning to Split and Rephrase Sentences with Bitexts
Joongwon Kim
Mounica Maddela
Reno Kriz
Wei-ping Xu
Chris Callison-Burch
54
25
0
10 Sep 2021
Identifying Morality Frames in Political Tweets using Relational
  Learning
Identifying Morality Frames in Political Tweets using Relational Learning
Shamik Roy
Maria Leonor Pacheco
Dan Goldwasser
27
41
0
09 Sep 2021
1