Title |
|---|
![]() The Viability of Crowdsourcing for RAG EvaluationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025 |
Improving the Reusability of Conversational Search Test CollectionsEuropean Conference on Information Retrieval (ECIR), 2025 |
![]() Synthetic Test Collections for Retrieval EvaluationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2024 |
![]() Large language models can accurately predict searcher preferencesAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023 |
![]() G-Eval: NLG Evaluation using GPT-4 with Better Human AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
![]() Simplified Data Wrangling with ir_datasetsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021 |