BELLE: A Bi-Level Multi-Agent Reasoning Framework for Multi-Hop Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
MDCure: A Scalable Pipeline for Multi-Document Instruction-FollowingAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
DocETL: Agentic Query Rewriting and Evaluation for Complex Document ProcessingProceedings of the VLDB Endowment (PVLDB), 2024 |
LLMMapReduce: Simplified Long-Sequence Processing using Large
Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |