X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic SystemAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
WebWalker: Benchmarking LLMs in Web TraversalAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Infogent: An Agent-Based Framework for Web Information AggregationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
Large Language Models Empowered Personalized Web AgentsThe Web Conference (WWW), 2024 |
Beyond Browsing: API-Based Web AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
Tur[k]ingBench: A Challenge Benchmark for Web Agents Kevin Xu Yeganeh Kordi Kate Sanders Yizhong Wang Adam Byerly Kate Sanders Adam Byerly Jingyu Zhang Benjamin Van Durme Daniel Khashabi |
Multi-Level Compositional Reasoning for Interactive Instruction
FollowingAAAI Conference on Artificial Intelligence (AAAI), 2023 |
WebArena: A Realistic Web Environment for Building Autonomous AgentsInternational Conference on Learning Representations (ICLR), 2023 Shuyan Zhou Frank F. Xu Hao Zhu Xuhui Zhou Robert Lo ...Tianyue Ou Yonatan Bisk Daniel Fried Uri Alon Graham Neubig |
Referring to Screen Texts with Voice AssistantsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |
Mind2Web: Towards a Generalist Agent for the WebNeural Information Processing Systems (NeurIPS), 2023 |
META-GUI: Towards Multi-modal Conversational Agents on Mobile GUIConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
mForms : Multimodal Form-Filling with Question AnsweringInternational Conference on Language Resources and Evaluation (LREC), 2020 |