Improving Automatic Evaluation of Large Language Models (LLMs) in Biomedical Relation Extraction via LLMs-as-the-JudgeAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark DatasetAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
Assessing and Enhancing Large Language Models in Rare Disease
Question-answering Guanchu Wang Junhao Ran Ruixiang Tang Chia-Yuan Chang Chia-Yuan Chang Yu-Neng Chuang Zirui Liu Vladimir Braverman Zhandong Liu Xia Hu |
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls
of Large Language Models on Bengali NLPInternational Conference on Language Resources and Evaluation (LREC), 2023 |