Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2412.04363
Cited By
Challenges in Trustworthy Human Evaluation of Chatbots
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
5 December 2024
Wenting Zhao
Alexander M. Rush
Tanya Goyal
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"Challenges in Trustworthy Human Evaluation of Chatbots"
6 / 6 papers shown
Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security
Ali Naseh
Anshuman Suri
Yuefeng Peng
Harsh Chaudhari
Alina Oprea
Amir Houmansadr
109
0
0
07 Oct 2025
Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback
Parker Whitfill
Stewy Slocum
ALM
83
0
0
11 Aug 2025
Curiosity by Design: An LLM-based Coding Assistant Asking Clarification Questions
Harsh Darji
Thibaud Lutellier
ELM
171
1
0
28 Jul 2025
Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild
Sheshera Mysore
Debarati Das
Hancheng Cao
Bahareh Sarrafzadeh
484
4
0
21 May 2025
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
Zhiyuan Zeng
Yizhong Wang
Hannaneh Hajishirzi
Pang Wei Koh
ELM
404
15
0
11 Mar 2025
Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Rui Min
Tianyu Pang
Chao Du
Qian Liu
Minhao Cheng
Min Lin
AAML
445
11
0
29 Jan 2025
1