v1v2v3v4 (latest)

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Yongliang Shen

Kaitao Song

Xu Tan

Dongsheng Li

Weiming Lu

Yueting Zhuang

MLLM

ArXiv (abs)PDF HTML HuggingFace (12 upvotes)

Papers citing "HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"

50 / 763 papers shown

Automated Phishing Detection Using URLs and Webpages

Huilin Wang

Bryan Hooi

281

03 Aug 2024

NOLO: Navigate Only Look Once

323

02 Aug 2024

A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks

Chong Ma

...

Tianming Liu

295

02 Aug 2024

Autonomous LLM-Enhanced Adversarial Attack for Text-to-MotionAAAI Conference on Artificial Intelligence (AAAI), 2024

289

01 Aug 2024

ReplanVLM: Replanning Robotic Tasks with Visual Language Models

190

31 Jul 2024

Prompt2DeModel: Declarative Neuro-Symbolic Modeling with Natural Language

Hossein Rajaby Faghihi

228

30 Jul 2024

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

218

25 Jul 2024

CityX: Controllable Procedural Content Generation for Unbounded 3D Cities

Junran Peng

334

24 Jul 2024

MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery

Pei Zhou

Yanchao Yang

251

21 Jul 2024

Designing Algorithms Empowered by Language Models: An Analytical Framework, Case Studies, and Insights

263

20 Jul 2024

KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models

Licheng Wen

201

19 Jul 2024

MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

...

331

18 Jul 2024

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

Xuesong Niu

234

17 Jul 2024

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation

Hu Jinfang

Bowen Zhou

LM&MA

392

12 Jul 2024

Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents

596

11 Jul 2024

Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models

297

09 Jul 2024

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Chen Qian

332

09 Jul 2024

Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy

Zhenyu Guan

Xiangyu Kong

Fangwei Zhong

Yizhou Wang

240

09 Jul 2024

GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing

326

08 Jul 2024

LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts

Wei Wang

244

113

06 Jul 2024

Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness

Jae Sung Park

Yejin Choi

323

02 Jul 2024

VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs

Qiucheng Wu

Handong Zhao

Michael Stephen Saxon

T. Bui

William Yang Wang

Yang Zhang

Shiyu Chang

CoGe

220

02 Jul 2024

Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

Enshu Liu

Huazhong Yang

Yu Wang

MoE

263

01 Jul 2024

Teola: Towards End-to-End Optimization of LLM-based Applications

643

29 Jun 2024

ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents

Li Zhang

Mengwei Xu

Xuhui Liu

LLMAG

387

28 Jun 2024

When Search Engine Services meet Large Language Models: Visions and Challenges

353

28 Jun 2024

DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph

229

25 Jun 2024

Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning

Trevor Darrell

237

21 Jun 2024

Transferable speech-to-text large language model alignment moduleInterspeech (Interspeech), 2024

Boyong Wu

Chao Yan

Haoran Pu

152

19 Jun 2024

AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM AgentsNeural Information Processing Systems (NeurIPS), 2024

Florian Tramèr

424

19 Jun 2024

Automatic benchmarking of large multimodal models via iterative experiment programming

Yiming Wang

243

18 Jun 2024

ARTIST: Improving the Generation of Text-rich Images by Disentanglement

Tong Yu

Tong Sun

261

17 Jun 2024

Towards Vision-Language Geo-Foundation Model: A Survey

Yue Zhou

Xue Jiang

Yiping Ke

245

13 Jun 2024

GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

379

103

12 Jun 2024

Scaling Large Language Model-based Multi-Agent Collaboration

Chen Qian

Zihao Xie

YiFei Wang

Wei Liu

Yufan Dang

...

Zhuoyun Du

Weize Chen

Cheng Yang

Zhiyuan Liu

Maosong Sun

AI4CE LLMAG LM&Ro

475

133

11 Jun 2024

Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

Weihua Luo

413

11 Jun 2024

RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent

471

11 Jun 2024

Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

Hannaneh Hajishirzi

264

10 Jun 2024

AICoderEval: Improving AI Domain Code Generation of Large Language Models

161

07 Jun 2024

LogiCode: an LLM-Driven Framework for Logical Anomaly DetectionIEEE Transactions on Automation Science and Engineering (T-ASE), 2024

245

07 Jun 2024

Tool-Planner: Task Planning with Clusters across Multiple Tools

371

06 Jun 2024

A Survey of Language-Based Communication in Robotics

William Hunt

Sarvapali D. Ramchurn

Mohammad D. Soorati

LM&Ro

715

06 Jun 2024

AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways

Sheng Wen

409

134

04 Jun 2024

Towards a copilot in BIM authoring tool using a large language model-based agent for intelligent human-machine interaction

216

02 Jun 2024

Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models

Zejian Li

249

29 May 2024

Evaluating the External and Parametric Knowledge Fusion of Large Language Models

...

Lifeng Shang

Qun Liu

Yong Liu

Ruiming Tang

KELM

250

29 May 2024

MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning

319

28 May 2024

A Human-Like Reasoning Framework for Multi-Phases Planning Task with Large Language Models

Chengxing Xie

Difan Zou

LRM LLMAG

225

28 May 2024

Tool Learning with Large Language Models: A Survey

Jun Xu

342

217

28 May 2024

LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding

303

27 May 2024