v1v2v3v4 (latest)

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Yongliang Shen

Kaitao Song

Xu Tan

Dongsheng Li

Weiming Lu

Yueting Zhuang

MLLM

ArXiv (abs)PDF HTML HuggingFace (12 upvotes)

Papers citing "HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"

50 / 754 papers shown

Mitigating Hallucination in Visual Language Models with Visual Supervision

Ming Tang

244

27 Nov 2023

Function-constrained Program Synthesis

Patrick Hajali

Ignas Budvytis

189

27 Nov 2023

See and Think: Embodied Agent in Virtual EnvironmentEuropean Conference on Computer Vision (ECCV), 2023

394

26 Nov 2023

Agent as Cerebrum, Controller as Cerebellum: Implementing an Embodied LMM-based Agent on Drones

190

25 Nov 2023

Griffon: Spelling out All Object Locations at Any Granularity with Large Language ModelsEuropean Conference on Computer Vision (ECCV), 2023

242

24 Nov 2023

Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents

Liming Zhu

377

22 Nov 2023

A Survey on Multimodal Large Language Models for Autonomous Driving

Wenqian Ye

...

342

431

21 Nov 2023

AcademicGPT: Empowering Academic Research

...

Lei Zhang

211

21 Nov 2023

InteraSSort: Interactive Assortment Planning Using Large Language ModelsSocial Science Research Network (SSRN), 2023

Saketh Reddy Karra

Theja Tulabandhula

169

20 Nov 2023

VLM-Eval: A General Evaluation on Video Large Language Models

156

20 Nov 2023

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

...

Rui Wang

363

20 Nov 2023

TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems

...

292

19 Nov 2023

UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework

100

16 Nov 2023

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Bin Lin

1.6K

1,181

16 Nov 2023

When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks

Hao Peng

Xiaozhi Wang

...

Bin Xu

Lei Hou

Juanzi Li

258

15 Nov 2023

Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023

508

352

14 Nov 2023

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

Yunfei Chu

Jin Xu

Xiaohuan Zhou

Qian Yang

Shiliang Zhang

Zhijie Yan

Chang Zhou

Jingren Zhou

AuLLM

317

595

14 Nov 2023

TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System

Wanggui He

240

11 Nov 2023

How to Bridge the Gap between Modalities: Survey on Multimodal Large Language Model

Shasha Li

275

10 Nov 2023

TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models

206

08 Nov 2023

ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents

274

06 Nov 2023

RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative SimulationInternational Conference on Machine Learning (ICML), 2023

Yufei Wang

Chuang Gan

482

169

02 Nov 2023

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xingshan Zeng

Lifeng Shang

Qun Liu

Kam-Fai Wong

ELM

286

30 Oct 2023

ControlLLM: Augment Language Models with Tools by Searching on GraphsEuropean Conference on Computer Vision (ECCV), 2023

...

Yu Qiao

392

26 Oct 2023

Managing extreme AI risks amid rapid progress

...

344

26 Oct 2023

DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language ModelsNeural Information Processing Systems (NeurIPS), 2023

314

181

25 Oct 2023

Woodpecker: Hallucination Correction for Multimodal Large Language ModelsScience China Information Sciences (Sci China Inf Sci), 2023

Enhong Chen

334

197

24 Oct 2023

A Communication Theory Perspective on Prompting Engineering Methods for Large Language ModelsJournal of Computational Science and Technology (JCST), 2023

Hanlin Gu

212

24 Oct 2023

Large Language Models are Visual Reasoning CoordinatorsNeural Information Processing Systems (NeurIPS), 2023

Bo Li

Ziwei Liu

276

23 Oct 2023

ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search

297

20 Oct 2023

Frozen Transformers in Language Models Are Effective Visual Encoder Layers

430

19 Oct 2023

Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale

242

18 Oct 2023

Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle

Jiang Bian

191

17 Oct 2023

Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks

460

223

16 Oct 2023

BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in BiologyConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Odhran O'Donoghue

Aleksandar Shtedritski

292

16 Oct 2023

An Expression Tree Decoding Strategy for Mathematical Equation Generation

328

14 Oct 2023

SAI: Solving AI Tasks with Systematic Artificial Intelligence in Communication Network

163

13 Oct 2023

Large Language Models as Source Planner for Personalized Knowledge-grounded DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Irwin King

259

13 Oct 2023

Octopus: Embodied Vision-Language Programmer from Environmental FeedbackEuropean Conference on Computer Vision (ECCV), 2023

...

Ziwei Liu

293

12 Oct 2023

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation

Zicheng Liu

102

12 Oct 2023

Towards Robust Multi-Modal Reasoning via Model SelectionInternational Conference on Learning Representations (ICLR), 2023

303

12 Oct 2023

GameGPT: Multi-agent Collaborative Framework for Game Development

303

12 Oct 2023

OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation

Zicheng Liu

222

11 Oct 2023

An Empirical Study of Instruction-tuning Large Language Models in ChineseConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Zheng Lin

199

11 Oct 2023

Lemur: Harmonizing Natural Language and Code for Language AgentsInternational Conference on Learning Representations (ICLR), 2023

...

Lingpeng Kong

Bailin Wang

Caiming Xiong

Tao Yu

240

10 Oct 2023

Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling TaskNatural Language Processing and Chinese Computing (NLPCC), 2023

...

Weiran Xu

213

10 Oct 2023

ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

272

09 Oct 2023

Establishing Trustworthiness: Rethinking Tasks and Model EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

187

09 Oct 2023

Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source ModelNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Zhiyuan Liu

216

08 Oct 2023

Self-Knowledge Guided Retrieval Augmentation for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yile Wang

Peng Li

Maosong Sun

Yang Liu

RALM KELM

236

08 Oct 2023