Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2412.04363
Cited By

Challenges in Trustworthy Human Evaluation of Chatbots

Challenges in Trustworthy Human Evaluation of Chatbots

North American Chapter of the Association for Computational Linguistics (NAACL), 2024

5 December 2024

Alexander M. Rush

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "Challenges in Trustworthy Human Evaluation of Chatbots"

6 / 6 papers shown

Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security

Text-to-Image Models Leave Identifiable Signatures: Implications for Leaderboard Security

Harsh Chaudhari

Amir Houmansadr

109

0

0

07 Oct 2025

Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback

Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback

Parker Whitfill

83

0

0

11 Aug 2025

Curiosity by Design: An LLM-based Coding Assistant Asking Clarification Questions

Curiosity by Design: An LLM-based Coding Assistant Asking Clarification Questions

Thibaud Lutellier

171

1

0

28 Jul 2025

Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild

Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild

Sheshera Mysore

Bahareh Sarrafzadeh

484

4

0

21 May 2025

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

Hannaneh Hajishirzi

404

15

0

11 Mar 2025

Improving Your Model Ranking on Chatbot Arena by Vote Rigging

Improving Your Model Ranking on Chatbot Arena by Vote Rigging

445

11

0

29 Jan 2025