
Title |
|---|
![]() DeepEyesV2: Toward Agentic Multimodal ModelIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025 |
![]() WebWalker: Benchmarking LLMs in Web TraversalAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline SummarizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 |
![]() Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning AgentInternational Conference on Learning Representations (ICLR), 2024 |
![]() VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality DocumentsInternational Conference on Learning Representations (ICLR), 2024 |