Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.17580
Cited By
v1
v2
v3
v4 (latest)
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (12 upvotes)
Papers citing
"HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"
50 / 754 papers shown
Mitigating Hallucination in Visual Language Models with Visual Supervision
Zhiyang Chen
Yousong Zhu
Yufei Zhan
Zhaowen Li
Honghui Dong
Jinqiao Wang
Ming Tang
VLM
MLLM
244
52
0
27 Nov 2023
Function-constrained Program Synthesis
Patrick Hajali
Ignas Budvytis
189
1
0
27 Nov 2023
See and Think: Embodied Agent in Virtual Environment
European Conference on Computer Vision (ECCV), 2023
Zhonghan Zhao
Wenhao Chai
Xuan Wang
Li Boyi
Shengyu Hao
Shidong Cao
Tianbo Ye
Gaoang Wang
LM&Ro
LLMAG
394
54
0
26 Nov 2023
Agent as Cerebrum, Controller as Cerebellum: Implementing an Embodied LMM-based Agent on Drones
Haoran Zhao
Fengxing Pan
Huqiuyue Ping
Yaoming Zhou
AI4CE
190
17
0
25 Nov 2023
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
European Conference on Computer Vision (ECCV), 2023
Yufei Zhan
Yousong Zhu
Zhiyang Chen
Fan Yang
E. Goles
Jinqiao Wang
ObjD
242
30
0
24 Nov 2023
Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents
Qinghua Lu
Liming Zhu
Xiwei Xu
Zhenchang Xing
Stefan Harrer
Jon Whittle
LM&Ro
AI4CE
LLMAG
377
21
0
22 Nov 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
342
431
0
21 Nov 2023
AcademicGPT: Empowering Academic Research
Shufa Wei
Xiaolong Xu
Xianbiao Qi
Xi Yin
Jun Xia
...
Chihao Dai
Lihua Wang
Xiaohui Liu
Lei Zhang
Yutao Xie
LM&MA
211
5
0
21 Nov 2023
InteraSSort: Interactive Assortment Planning Using Large Language Models
Social Science Research Network (SSRN), 2023
Saketh Reddy Karra
Theja Tulabandhula
169
3
0
20 Nov 2023
VLM-Eval: A General Evaluation on Video Large Language Models
Shuailin Li
Yuang Zhang
Yucheng Zhao
Qiuyue Wang
Fan Jia
Yingfei Liu
Tiancai Wang
MLLM
ELM
156
6
0
20 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAG
LM&Ro
LRM
363
92
0
20 Nov 2023
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Yilun Kong
Jingqing Ruan
Yihong Chen
Bin Zhang
Tianpeng Bao
...
Xiaoru Hu
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
292
50
0
19 Nov 2023
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
Chris Kelly
Luhui Hu
Cindy Yang
Yu Tian
Deshun Yang
Bang Yang
Zaoshan Huang
Zihao Li
Yuexian Zou
AI4TS
100
3
0
16 Nov 2023
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Bin Lin
Yang Ye
Bin Zhu
Jiaxi Cui
Munan Ning
Peng Jin
Li-ming Yuan
VLM
MLLM
1.6K
1,181
0
16 Nov 2023
When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks
Hao Peng
Xiaozhi Wang
Jianhui Chen
Weikai Li
Yunjia Qi
...
Zhili Wu
Kaisheng Zeng
Bin Xu
Lei Hou
Juanzi Li
258
42
0
15 Nov 2023
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Computer Vision and Pattern Recognition (CVPR), 2023
Peng Jin
Ryuichi Takanobu
Caiwan Zhang
Xiaochun Cao
Li-ming Yuan
MLLM
508
352
0
14 Nov 2023
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Yunfei Chu
Jin Xu
Xiaohuan Zhou
Qian Yang
Shiliang Zhang
Zhijie Yan
Chang Zhou
Jingren Zhou
AuLLM
317
595
0
14 Nov 2023
TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System
Haoyuan Li
Hao Jiang
Tianke Zhang
Zhelun Yu
Aoxiong Yin
Hao Cheng
Siming Fu
Yuhao Zhang
Wanggui He
LLMAG
240
8
0
11 Nov 2023
How to Bridge the Gap between Modalities: Survey on Multimodal Large Language Model
Shezheng Song
Xiaopeng Li
Shasha Li
Shan Zhao
Jie Yu
Jun Ma
Xiaoguang Mao
Weimin Zhang
275
18
0
10 Nov 2023
TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models
Zhen Yang
Yingxue Zhang
Fandong Meng
Jie Zhou
VLM
MLLM
206
4
0
08 Nov 2023
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents
Shaoguang Mao
Yuzhe Cai
Yan Xia
Wenshan Wu
Xun Wang
Fengyi Wang
Tao Ge
Furu Wei
274
24
0
06 Nov 2023
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation
International Conference on Machine Learning (ICML), 2023
Yufei Wang
Zhou Xian
Feng Chen
Tsun-Hsuan Wang
Yian Wang
Katerina Fragkiadaki
Zackory M. Erickson
David Held
Chuang Gan
LM&Ro
482
169
0
02 Nov 2023
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Wai-Chung Kwan
Xingshan Zeng
Yufei Wang
Yusen Sun
Liangyou Li
Lifeng Shang
Qun Liu
Kam-Fai Wong
ELM
286
13
0
30 Oct 2023
ControlLLM: Augment Language Models with Tools by Searching on Graphs
European Conference on Computer Vision (ECCV), 2023
Zhaoyang Liu
Zeqiang Lai
Zhangwei Gao
Erfei Cui
Ziheng Li
...
Lewei Lu
Qifeng Chen
Yu Qiao
Jifeng Dai
Wenhai Wang
MLLM
392
57
0
26 Oct 2023
Managing extreme AI risks amid rapid progress
Yoshua Bengio
Geoffrey Hinton
Andrew Yao
Dawn Song
Pieter Abbeel
...
Juil Sock
Stuart J. Russell
Daniel Kahneman
J. Brauner
Sören Mindermann
344
30
0
26 Oct 2023
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models
Neural Information Processing Systems (NeurIPS), 2023
Ge Zheng
Bin Yang
Jiajin Tang
Hong-Yu Zhou
Sibei Yang
LRM
MLLM
314
181
0
25 Oct 2023
Woodpecker: Hallucination Correction for Multimodal Large Language Models
Science China Information Sciences (Sci China Inf Sci), 2023
Xinglong Mao
Chaoyou Fu
Zhengye Zhang
Tong Xu
Hao Wang
Dianbo Sui
Chunjiang Ge
Ke Li
Xingguo Sun
Enhong Chen
VLM
MLLM
334
197
0
24 Oct 2023
A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models
Journal of Computational Science and Technology (JCST), 2023
Wailing Ng
Yuanqin He
Xuefang Zhao
Hanlin Gu
Chen Zhang
Haijun Yang
Lixin Fan
Qiang Yang
212
7
0
24 Oct 2023
Large Language Models are Visual Reasoning Coordinators
Neural Information Processing Systems (NeurIPS), 2023
Liangyu Chen
Bo Li
Sheng Shen
Jingkang Yang
Chunyuan Li
Kurt Keutzer
Trevor Darrell
Ziwei Liu
VLM
LRM
276
91
0
23 Oct 2023
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search
Yuchen Zhuang
Xiang Chen
Tong Yu
Saayan Mitra
Victor S. Bursztyn
Ryan Rossi
Somdeb Sarkhel
Chao Zhang
LLMAG
297
98
0
20 Oct 2023
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
Ziqi Pang
Ziyang Xie
Yunze Man
Yu-Xiong Wang
430
47
0
19 Oct 2023
Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Qichao Wang
Tian Bian
Yian Yin
Qifeng Bai
Hong Cheng
Helen M. Meng
Zibin Zheng
Liang Chen
Bingzhe Wu
VLM
DiffM
242
6
0
18 Oct 2023
Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle
Xu Yang
Xiao Yang
Yuante Li
Jinhui Li
Peng Yu
Zeqi Ye
Jiang Bian
191
1
0
17 Oct 2023
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks
Erfan Shayegani
Md Abdullah Al Mamun
Yu Fu
Pedram Zaree
Yue Dong
Nael B. Abu-Ghazaleh
AAML
460
223
0
16 Oct 2023
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Odhran O'Donoghue
Aleksandar Shtedritski
John Ginger
Ralph Abboud
Ali E. Ghareeb
Justin Booth
Samuel G. Rodriques
292
29
0
16 Oct 2023
An Expression Tree Decoding Strategy for Mathematical Equation Generation
Wenqi Zhang
Yongliang Shen
Qingpeng Nong
Zeqi Tan
Zeqi Tan Yanna Ma
Weiming Lu
AIMat
328
7
0
14 Oct 2023
SAI: Solving AI Tasks with Systematic Artificial Intelligence in Communication Network
Lei Yao
Yong Zhang
Zilong Yan
Jialu Tian
163
4
0
13 Oct 2023
Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogue
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongru Wang
Minda Hu
Yang Deng
Rui Wang
Fei Mi
Weichao Wang
Yasheng Wang
Wai-Chung Kwan
Irwin King
Kam-Fai Wong
RALM
259
8
0
13 Oct 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
European Conference on Computer Vision (ECCV), 2023
Jingkang Yang
Yuhao Dong
Shuai Liu
Yue Liu
Ziyue Wang
...
Haoran Tan
Jiamu Kang
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
LM&Ro
293
80
0
12 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
102
28
0
12 Oct 2023
Towards Robust Multi-Modal Reasoning via Model Selection
International Conference on Learning Representations (ICLR), 2023
Xiangyan Liu
Rongxue Li
Wei Ji
Tao Lin
LLMAG
LRM
303
8
0
12 Oct 2023
GameGPT: Multi-agent Collaborative Framework for Game Development
Dake Chen
Haoyang Zhang
Hanbin Wang
Yunhao Huo
Yuzhao Li
Junjie Wang
LLMAG
303
33
0
12 Oct 2023
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Jie An
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Kevin Qinghong Lin
Zicheng Liu
Lijuan Wang
Jiebo Luo
222
14
0
11 Oct 2023
An Empirical Study of Instruction-tuning Large Language Models in Chinese
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Q. Si
Tong Wang
Zheng Lin
Xu Zhang
Yanan Cao
Weiping Wang
ALM
199
23
0
11 Oct 2023
Lemur: Harmonizing Natural Language and Code for Language Agents
International Conference on Learning Representations (ICLR), 2023
Yiheng Xu
Hongjin Su
Chen Xing
Boyu Mi
Qian Liu
...
Siheng Zhao
Lingpeng Kong
Bailin Wang
Caiming Xiong
Tao Yu
240
88
0
10 Oct 2023
Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling Task
Natural Language Processing and Chinese Computing (NLPCC), 2023
Guanting Dong
Jinxu Zhao
Tingfeng Hui
Daichi Guo
Wenlong Wan
...
Yueyan Qiu
Zhuoma Gongque
Keqing He
Zechen Wang
Weiran Xu
AAML
213
32
0
10 Oct 2023
ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
KAI-QING Zhou
Kwonjoon Lee
Teruhisa Misu
Xin Eric Wang
LRM
272
9
0
09 Oct 2023
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Robert Litschko
Max Müller-Eberstein
Rob van der Goot
Leon Weber
Barbara Plank
LRM
187
3
0
09 Oct 2023
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Cheng Qian
Chenyan Xiong
Zhenghao Liu
Zhiyuan Liu
LRM
216
27
0
08 Oct 2023
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yile Wang
Peng Li
Maosong Sun
Yang Liu
RALM
KELM
236
80
0
08 Oct 2023
Previous
1
2
3
...
10
11
12
...
14
15
16
Next