
Title |
|---|
![]() Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Why Uncertainty Estimation Methods Fall Short in RAG: An Axiomatic AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2025 |
![]() Towards Long Context Hallucination DetectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 |
![]() An Empirical Study of Evaluating Long-form Question AnsweringAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025 |
![]() HalluLens: LLM Hallucination BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language ModelsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025 |
![]() Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy CitationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() CoLoTa: A Dataset for Entity-based Commonsense Reasoning over Long-Tail KnowledgeAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025 |