ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.04363
  4. Cited By
Challenges in Trustworthy Human Evaluation of Chatbots

Challenges in Trustworthy Human Evaluation of Chatbots

North American Chapter of the Association for Computational Linguistics (NAACL), 2024
5 December 2024
Wenting Zhao
Alexander M. Rush
Tanya Goyal
    ALM
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "Challenges in Trustworthy Human Evaluation of Chatbots"

6 / 6 papers shown
Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security
Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security
Ali Naseh
Anshuman Suri
Yuefeng Peng
Harsh Chaudhari
Alina Oprea
Amir Houmansadr
109
0
0
07 Oct 2025
Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback
Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback
Parker Whitfill
Stewy Slocum
ALM
83
0
0
11 Aug 2025
Curiosity by Design: An LLM-based Coding Assistant Asking Clarification Questions
Curiosity by Design: An LLM-based Coding Assistant Asking Clarification Questions
Harsh Darji
Thibaud Lutellier
ELM
171
1
0
28 Jul 2025
Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild
Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild
Sheshera Mysore
Debarati Das
Hancheng Cao
Bahareh Sarrafzadeh
484
4
0
21 May 2025
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
Zhiyuan Zeng
Yizhong Wang
Hannaneh Hajishirzi
Pang Wei Koh
ELM
404
15
0
11 Mar 2025
Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Rui Min
Tianyu Pang
Chao Du
Qian Liu
Minhao Cheng
Min Lin
AAML
445
11
0
29 Jan 2025
1