CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Retrieval Augmented Spelling Correction for E-Commerce ApplicationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
BABILong: Testing the Limits of LLMs with Long Context
Reasoning-in-a-HaystackNeural Information Processing Systems (NeurIPS), 2024 |
The Faiss libraryIEEE Transactions on Big Data (IEEE Trans. Big Data), 2024 |
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level
Hallucination DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
CacheGen: KV Cache Compression and Streaming for Fast Language Model
ServingConference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), 2023 |
Making Retrieval-Augmented Language Models Robust to Irrelevant ContextInternational Conference on Learning Representations (ICLR), 2023 |
A Comprehensive Overview of Large Language ModelsACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023 Humza Naveed Asad Ullah Khan Shi Qiu Muhammad Saqib Saeed Anwar Muhammad Usman Naveed Akhtar Nick Barnes Lin Wang |
Lost in the Middle: How Language Models Use Long ContextsTransactions of the Association for Computational Linguistics (TACL), 2023 |