Real-Time Visual Feedback to Guide Benchmark Creation: A
Human-and-Metric-in-the-Loop WorkflowConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023 |
Don't Blame the Annotator: Bias Already Starts in the Annotation
InstructionsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022 |
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning
TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
How Robust are Model Rankings: A Leaderboard Customization Approach for
Equitable EvaluationAAAI Conference on Artificial Intelligence (AAAI), 2021 |