HeQ: a Large and Diverse Hebrew Reading Comprehension BenchmarkConference on Empirical Methods in Natural Language Processing (EMNLP), 2025 |
SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models' Knowledge of Indian CultureAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Data Pruning by Information MaximizationInternational Conference on Learning Representations (ICLR), 2025 |