
![]() SmartPlay: A Benchmark for LLMs as Intelligent AgentsInternational Conference on Learning Representations (ICLR), 2023 |
![]() ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem SolvingInternational Conference on Learning Representations (ICLR), 2023 |
![]() You Only Look at Screens: Multimodal Chain-of-Action AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |
![]() AutoDroid: LLM-powered Task Automation in AndroidACM/IEEE International Conference on Mobile Computing and Networking (MobiCom), 2023 |
![]() AgentBench: Evaluating LLMs as AgentsInternational Conference on Learning Representations (ICLR), 2023 |
![]() A Real-World WebAgent with Planning, Long Context Understanding, and
Program SynthesisInternational Conference on Learning Representations (ICLR), 2023 |
![]() Android in the Wild: A Large-Scale Dataset for Android Device ControlNeural Information Processing Systems (NeurIPS), 2023 |
![]() Large Language Models as Tool MakersInternational Conference on Learning Representations (ICLR), 2023 |
![]() CAMEL: Communicative Agents for "Mind" Exploration of Large Language
Model SocietyNeural Information Processing Systems (NeurIPS), 2023 |