Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.17580
Cited By
v1
v2
v3
v4 (latest)
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (12 upvotes)
Papers citing
"HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"
50 / 751 papers shown
Title
Integrating Planning into Single-Turn Long-Form Text Generation
Yi Liang
You Wu
Honglei Zhuang
Li Chen
Jiaming Shen
...
Zhen Qin
Sumit Sanghai
Xuanhui Wang
Carl Yang
Michael Bendersky
202
6
0
08 Oct 2024
AgentSquare: Automatic LLM Agent Search in Modular Design Space
International Conference on Learning Representations (ICLR), 2024
Yu Shang
Yu Li
Keyu Zhao
Likai Ma
Qingbin Liu
Fengli Xu
Yong Li
LLMAG
461
49
0
08 Oct 2024
Hyperbolic Fine-tuning for Large Language Models
Menglin Yang
Aosong Feng
Bo Xiong
Jihong Liu
Irwin King
Rex Ying
293
10
0
05 Oct 2024
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation
Neural Information Processing Systems (NeurIPS), 2024
Hongcheng Wang
Peiqi Liu
Wenzhe Cai
Mingdong Wu
Zhengyu Qian
Hao Dong
275
4
0
04 Oct 2024
AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML
Patara Trirat
Wonyong Jeong
Sung Ju Hwang
LLMAG
342
49
0
03 Oct 2024
Agent-Oriented Planning in Multi-Agent Systems
International Conference on Learning Representations (ICLR), 2024
Ao Li
Yuexiang Xie
Songze Li
Fugee Tsung
Bolin Ding
Yaliang Li
AIFin
884
19
0
03 Oct 2024
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
Rinon Gal
Adi Haviv
Yuval Alaluf
Amit H. Bermano
Daniel Cohen-Or
Gal Chechik
DiffM
128
8
0
02 Oct 2024
Moral Alignment for LLM Agents
International Conference on Learning Representations (ICLR), 2024
Elizaveta Tennant
Stephen Hailes
Mirco Musolesi
455
21
0
02 Oct 2024
Khattat: Enhancing Readability and Concept Representation of Semantic Typography
Ahmed Hussein
Alaa Elsetohy
Sama Hadhoud
Tameem Bakr
Yasser Rohaim
Badr AlKhamissi
VLM
171
1
0
01 Oct 2024
Dynamic Planning for LLM-based Graphical User Interface Automation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Shaoqing Zhang
Zhuosheng Zhang
Kehai Chen
Xinbei Ma
Muyun Yang
Tiejun Zhao
Min Zhang
LLMAG
176
18
0
01 Oct 2024
Recent Advances in Speech Language Models: A Survey
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Wenqian Cui
Dianzhi Yu
Xiaoqi Jiao
Ziqiao Meng
Guangyan Zhang
Qichao Wang
Yiwen Guo
Irwin King
AuLLM
473
62
0
01 Oct 2024
Interactive Speculative Planning: Enhance Agent Efficiency through Co-design of System and User Interface
Qingfeng Lan
Mengting Wan
Shashank Vadrevu
Ryan Nadel
Yongfeng Zhang
Chi Wang
LLMAG
166
8
0
30 Sep 2024
Mitigating the Negative Impact of Over-association for Conversational Query Production
Information Processing & Management (IPM), 2024
Ante Wang
Linfeng Song
Zijun Min
Ge Xu
Xiaoli Wang
Junfeng Yao
Jinsong Su
308
4
0
29 Sep 2024
MIO: A Foundation Model on Multimodal Tokens
Zekun Wang
King Zhu
Chunpu Xu
Wangchunshu Zhou
Jiaheng Liu
...
Yuanxing Zhang
Ge Zhang
Ke Xu
Jie Fu
Wenhao Huang
MLLM
AuLLM
422
20
0
26 Sep 2024
Analyzing Probabilistic Methods for Evaluating Agent Capabilities
Axel Højmark
Govind Pimpale
Arjun Panickssery
Marius Hobbhahn
Jérémy Scheurer
277
4
0
24 Sep 2024
LLM With Tools: A Survey
Zhuocheng Shen
204
35
0
24 Sep 2024
SwiftDossier: Tailored Automatic Dossier for Drug Discovery with LLMs and Agents
Gabriele Fossi
Youssef Boulaimen
Leila Outemzabet
Nathalie Jeanray
Stephane Gerart
Sebastien Vachenc
Joanna Giemza
Salvatore Raieli
159
4
0
24 Sep 2024
Multi-modal Generative AI: Multi-modal LLMs, Diffusions, and the Unification
X. Wang
Yuwei Zhou
Bin Huang
Hong Chen
Wenwu Zhu
DiffM
438
1
0
23 Sep 2024
ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources
Shuting Yang
Zehui Liu
Wolfgang Mayer
RALM
161
8
0
20 Sep 2024
LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Design of Multi Active/Passive Core-Agent Architectures
Information Fusion (Inf. Fusion), 2024
Amine B. Hassouna
Hana Chaari
Ines Belhaj
LLMAG
307
8
0
17 Sep 2024
Aligning AI with Public Values: Deliberation and Decision-Making for Governing Multimodal LLMs in Political Video Analysis
Tanusree Sharma
Wenbo Guo
Zachary Kilhoffer
Yun Huang
Dawn Song
Yang Wang
237
6
0
15 Sep 2024
WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks
Jingwen Tong
Jiawei Shao
Qiong Wu
Wei Guo
Zijian Li
Zehong Lin
Jun Zhang
LLMAG
LM&Ro
205
30
0
12 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
715
54
0
10 Sep 2024
Towards Building a Robust Knowledge Intensive Question Answering Model with Large Language Models
Natural Language Processing and Chinese Computing (NLPCC), 2024
Xingyun Hong
Yan Shao
Zhilin Wang
Manni Duan
Jin Xiongnan
217
0
0
09 Sep 2024
UNIT: Unifying Image and Text Recognition in One Vision Encoder
Neural Information Processing Systems (NeurIPS), 2024
Yi Zhu
Yanpeng Zhou
Chunwei Wang
Yang Cao
Jianhua Han
Lu Hou
Hang Xu
ViT
VLM
237
9
0
06 Sep 2024
Recent Advances in Attack and Defense Approaches of Large Language Models
Jing Cui
Yishi Xu
Zhewei Huang
Shuchang Zhou
Jianbin Jiao
Junge Zhang
PILM
AAML
339
8
0
05 Sep 2024
TinyAgent: Function Calling at the Edge
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Lutfi Eren Erdogan
Nicholas Lee
Siddharth Jha
Sehoon Kim
Ryan Tabrizi
Suhong Moon
Coleman Hooper
Gopala Anumanchipalli
Kurt Keutzer
Amir Gholami
LLMAG
386
35
0
01 Sep 2024
GenAI-powered Multi-Agent Paradigm for Smart Urban Mobility: Opportunities and Challenges for Integrating Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) with Intelligent Transportation Systems
Haowen Xu
Jinghui Yuan
Anye Zhou
Guanhao Xu
Wan Li
Xuegang Ban
Xinyue Ye
221
16
0
31 Aug 2024
Enhancing SQL Query Generation with Neurosymbolic Reasoning
Henrijs Princis
Cristina David
Alan Mycroft
153
4
0
25 Aug 2024
EMO-LLaMA: Enhancing Facial Emotion Understanding with Instruction Tuning
Bohao Xing
Zitong Yu
Xin Liu
Kaishen Yuan
Qilang Ye
Weicheng Xie
Huanjing Yue
Jingyu Yang
Heikki Kälviäinen
185
23
0
21 Aug 2024
PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection
AAAI Conference on Artificial Intelligence (AAAI), 2024
Tri Cao
Chengyu Huang
Yuexin Li
Huilin Wang
Amy He
Nay Oo
Bryan Hooi
LLMAG
OffRL
356
24
0
20 Aug 2024
Visual Agents as Fast and Slow Thinkers
International Conference on Learning Representations (ICLR), 2024
Guangyan Sun
Haoyang Ling
Zhenting Wang
Cheng-Long Wang
Siqi Ma
Qifan Wang
Ying Nian Wu
Ying Nian Wu
Dongfang Liu
Dongfang Liu
LLMAG
LRM
486
42
0
16 Aug 2024
Can Large Language Models Reason? A Characterization via 3-SAT
Rishi Hazra
Gabriele Venturato
Pedro Zuidberg Dos Martires
Luc de Raedt
ELM
ReLM
LRM
217
15
0
13 Aug 2024
Proficient Graph Neural Network Design by Accumulating Knowledge on Large Language Models
Jialiang Wang
Hanmo Liu
Hanmo Liu
Zhili Wang
Jiachuan Wang
Lei Chen
Xiaofang Zhou
243
0
0
13 Aug 2024
MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models
Yuchen Dong
Xiaoxiang Fang
Yuchen Hu
Renshuang Jiang
Zhe Jiang
214
0
0
07 Aug 2024
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data
Calvin Tan
Jerome Wang
ALM
257
5
0
07 Aug 2024
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Yanjie Wang
Alan Yuille
Zhuowan Li
Zilong Zheng
LRM
292
6
0
05 Aug 2024
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future
Haolin Jin
Linghan Huang
Haipeng Cai
Jun Yan
Bo Li
Huaming Chen
366
77
0
05 Aug 2024
Automated Phishing Detection Using URLs and Webpages
Huilin Wang
Bryan Hooi
250
5
0
03 Aug 2024
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
307
1
0
02 Aug 2024
A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks
Jiaqi Wang
Hanqi Jiang
Yi-Hsueh Liu
Chong Ma
Xu-Yao Zhang
...
Shu Zhang
Wei Zhang
Dinggang Shen
Tianming Liu
Shu Zhang
VLM
AI4TS
259
80
0
02 Aug 2024
Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion
AAAI Conference on Artificial Intelligence (AAAI), 2024
Honglei Miao
Fan Ma
Ruijie Quan
Kun Zhan
Yi Yang
AAML
265
8
0
01 Aug 2024
ReplanVLM: Replanning Robotic Tasks with Visual Language Models
Aoran Mei
Guo-Niu Zhu
Huaxiang Zhang
Zhongxue Gan
169
38
0
31 Jul 2024
Prompt2DeModel: Declarative Neuro-Symbolic Modeling with Natural Language
Hossein Rajaby Faghihi
Aliakbar Nafar
Andrzej Uszok
Hamid Karimian
Parisa Kordjamshidi
187
2
0
30 Jul 2024
Efficient Inference of Vision Instruction-Following Models with Elastic Cache
Zuyan Liu
Benlin Liu
Jiahui Wang
Yuhao Dong
Guangyi Chen
Yongming Rao
Ranjay Krishna
Jiwen Lu
VLM
198
25
0
25 Jul 2024
CityX: Controllable Procedural Content Generation for Unbounded 3D Cities
Shougao Zhang
Mengqi Zhou
Yuxi Wang
Chuanchen Luo
Rongyu Wang
Yiwei Li
Xucheng Yin
Zhaoxiang Zhang
Junran Peng
266
17
0
24 Jul 2024
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Pei Zhou
Yanchao Yang
230
2
0
21 Jul 2024
Designing Algorithms Empowered by Language Models: An Analytical Framework, Case Studies, and Insights
Yanxi Chen
Yaliang Li
Bolin Ding
Jingren Zhou
219
8
0
20 Jul 2024
KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models
Kemou Jiang
Xuan Cai
Zhiyong Cui
Aoyong Li
Yilong Ren
Haiyang Yu
Hao Yang
Daocheng Fu
Licheng Wen
Pinlong Cai
LLMAG
174
18
0
19 Jul 2024
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Guoli Yin
Haoping Bai
Shuang Ma
Feng Nan
Yanchao Sun
...
Xiaoming Wang
Jiulong Shan
Meng Cao
Ruoming Pang
Zirui Wang
LLMAG
ELM
303
12
0
18 Jul 2024
Previous
1
2
3
...
5
6
7
...
14
15
16
Next