
![]() Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented
GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() Finding Blind Spots in Evaluator LLMs with Interpretable ChecklistsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() Do Multimodal Foundation Models Understand Enterprise Workflows? A
Benchmark for Business Process Management TasksNeural Information Processing Systems (NeurIPS), 2024 |
![]() Learning to Generate Answers with Citations via Factual Consistency
ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
![]() Estimating Knowledge in Large Language Models Without Generating a
Single TokenConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() Large language model validity via enhanced conformal prediction methodsNeural Information Processing Systems (NeurIPS), 2024 |