All Papers
0 / 0 papers shown
Title |
|---|
Title |
|---|

Title |
|---|
![]() DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Aligning Black-box Language Models with Human JudgmentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 |
![]() Improving Model Evaluation using SMART Filtering of Benchmark DatasetsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
![]() Mitigating Selection Bias with Node Pruning and Auxiliary OptionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |