Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.17580
Cited By
v1
v2
v3
v4 (latest)
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (12 upvotes)
Papers citing
"HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"
50 / 753 papers shown
QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
Zongyu Lin
Yao Tang
Xingcheng Yao
Da Yin
Ziniu Hu
Zhaoxin Fan
Kai-Wei Chang
LRM
506
11
0
04 Feb 2025
Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
Knowledge Discovery and Data Mining (KDD), 2023
Jingwei Yi
Yueqi Xie
Bin Zhu
Emre Kiciman
Guangzhong Sun
Xing Xie
Fangzhao Wu
AAML
451
149
0
28 Jan 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
517
34
0
23 Jan 2025
FaceOracle: Chat with a Face Image Oracle
Wassim Kabbani
Kiran Raja
Raghavendra Ramachandra
C. Busch
CVBM
182
1
0
13 Jan 2025
Visual Large Language Models for Generalized and Specialized Applications
Jiayi Zhang
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
461
33
0
06 Jan 2025
AI Agent for Education: von Neumann Multi-Agent System Framework
Yuan-Hao Jiang
Ruijia Li
Yizhou Zhou
Changyong Qi
Hanglei Hu
Yuang Wei
Bo Jiang
Yonghe Wu
LLMAG
423
15
0
03 Jan 2025
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta
Yutaka Matsuo
Aleksandra Faust
Izzeddin Gur
CLL
606
19
0
03 Jan 2025
Towards Sustainable Large Language Model Serving
ACM SIGEnergy Energy Informatics Review (SEIR), 2024
Sophia Nguyen
Beihao Zhou
Yi Ding
Sihang Liu
484
26
0
31 Dec 2024
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Neural Information Processing Systems (NeurIPS), 2024
Hao Fei
Shengqiong Wu
Hao Zhang
Tat-Seng Chua
Shuicheng Yan
475
74
0
31 Dec 2024
GAIS: A Novel Approach to Instance Selection with Graph Attention Networks
Zahiriddin Rustamov
Ayham Zaitouny
Rafat Damseh
Nazar Zaki
282
2
0
26 Dec 2024
AV-EmoDialog: Chat with Audio-Visual Users Leveraging Emotional Cues
Se Jin Park
Yeonju Kim
Hyeongseop Rha
Bella Godiva
Y. Ro
152
2
0
23 Dec 2024
SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization
Tan-Hanh Pham
Hoang-Nam Le
Phu-Vinh Nguyen
Chris Ngo
Truong-Son Hy
AuLLM
LRM
265
1
0
21 Dec 2024
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers
Dimitrios Mallis
Ahmet Serdar Karadeniz
Sebastian Cavada
Danila Rukhovich
Niki Maria Foteinopoulou
K. Cherenkova
Anis Kacem
Djamila Aouada
603
15
0
18 Dec 2024
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Junjie Lin
Jian Zhao
Lin Liu
Yue Deng
Youpeng Zhao
Lanxiao Huang
Xia Lin
Wengang Zhou
Haoyang Li
260
2
0
16 Dec 2024
Olympus: A Universal Task Router for Computer Vision Tasks
Computer Vision and Pattern Recognition (CVPR), 2024
Yuanze Lin
Yunsheng Li
Dongdong Chen
Weijian Xu
Ronald Clark
Juil Sock
VLM
ObjD
1.2K
2
0
12 Dec 2024
ChatDyn: Language-Driven Multi-Actor Dynamics Generation in Street Scenes
Yuxi Wei
Jingbo Wang
Yuwen Du
Dingju Wang
Liang Pan
Chenxin Xu
Yao Feng
Bo Dai
Siheng Chen
AI4CE
291
2
0
11 Dec 2024
Simulating Human-like Daily Activities with Desire-driven Autonomy
International Conference on Learning Representations (ICLR), 2024
Yiding Wang
Yuxuan Chen
Fangwei Zhong
Long Ma
Yizhou Wang
471
13
0
09 Dec 2024
Language Model as Visual Explainer
Neural Information Processing Systems (NeurIPS), 2024
Xingyi Yang
Xinchao Wang
VLM
208
1
0
08 Dec 2024
LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
Bingchen Li
Xin Li
Yiting Lu
Zhibo Chen
597
1
0
05 Dec 2024
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance
The Web Conference (WWW), 2024
Zhe Wang
Haozhu Wang
Yanjun Qi
OffRL
362
1
0
01 Dec 2024
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Zhihao Sun
Haoran Jiang
Haoran Chen
Yixin Cao
Jiaqi Leng
Zuxuan Wu
Yu-Gang Jiang
277
10
0
29 Nov 2024
Action Engine: Automatic Workflow Generation in FaaS
Future generations computer systems (FGCS), 2024
Akiharu Esashi
Pawissanutt Lertpongrujikorn
Shinji Kato
M. Salehi
328
0
0
29 Nov 2024
Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2024
Tsun-hin Cheung
Ka-Chun Fung
Songjiang Lai
Kwan-Ho Lin
Vincent To-Yee NG
K. Lam
233
0
0
28 Nov 2024
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Computer Vision and Pattern Recognition (CVPR), 2024
Alice Heiman
Xiaoman Zhang
E. Chen
Sung Eun Kim
Pranav Rajpurkar
HILM
MedIm
652
5
0
27 Nov 2024
Autonomous Imagination: Closed-Loop Decomposition of Visual-to-Textual Conversion in Visual Reasoning for Multimodal Large Language Models
Qingbin Liu
Yumeng Li
Boyuan Xiao
Yichang Jian
Ziang Qin
Tianjia Shao
Yao-Xiang Ding
Kun Zhou
LRM
MLLM
511
4
0
27 Nov 2024
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
Duo Wu
Jiangming Wang
Yuan Meng
Yanning Zhang
Le Sun
Zhi Wang
1.2K
2
0
25 Nov 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
International Conference on Learning Representations (ICLR), 2024
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
578
68
0
20 Nov 2024
Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms
Minghe Gao
Wendong Bu
Bingchen Miao
Yang Wu
Yunfei Li
Juncheng Billy Li
Siliang Tang
Qi Wu
Yueting Zhuang
Meng Wang
LM&Ro
311
7
0
17 Nov 2024
Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning
Jingru Yang
Huan Yu
Yang Jingxin
C. Xu
Yin Biao
Yu Sun
Shengfeng He
136
1
0
15 Nov 2024
Spider: Any-to-Many Multimodal LLM
Jinxiang Lai
Jie Zhang
Jun Liu
Jian Li
Xiaocheng Lu
Song Guo
MLLM
528
4
0
14 Nov 2024
Large Language Models for Constructing and Optimizing Machine Learning Workflows: A Survey
Yang Gu
Hengyu You
Jian Cao
Muran Yu
Haoran Fan
Shiyou Qian
LM&MA
AI4CE
413
10
0
11 Nov 2024
CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
International Conference on Learning Representations (ICLR), 2024
Jie Liu
Pan Zhou
Yingjun Du
Ah-Hwee Tan
Cees G. M. Snoek
Jan-Jakob Sonke
E. Gavves
LLMAG
378
8
0
07 Nov 2024
Understanding Generative AI in Robot Logic Parametrization
Yuna Hwang
Arissa J. Sato
Pragathi Praveena
N. White
Bilge Mutlu
LM&Ro
156
2
0
06 Nov 2024
Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
BigData Congress [Services Society] (BSS), 2024
Yu Pan
Jianxin Sun
Hongfeng Yu
Joe Luck
Geng Bai
Nipuna Chamara
Yufeng Ge
Tala Awada
251
3
0
31 Oct 2024
Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective
Shenghao Xie
Wenqiang Zu
Mingyang Zhao
Duo Su
Shilong Liu
Ruohua Shi
Guoqi Li
Shanghang Zhang
Lei Ma
LRM
448
11
0
29 Oct 2024
Improving In-Context Learning with Small Language Model Ensembles
M. Mehdi Mojarradi
Lingyi Yang
Robert McCraith
Adam Mahdi
179
6
0
29 Oct 2024
What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration
Neural Information Processing Systems (NeurIPS), 2024
L. Qin
Qiguang Chen
Hao Fei
Zhi Chen
Min Li
Wanxiang Che
207
26
0
27 Oct 2024
Language Agents Meet Causality -- Bridging LLMs and Causal World Models
John Gkountouras
Matthias Lindemann
Phillip Lippe
E. Gavves
Ivan Titov
LRM
262
5
0
25 Oct 2024
Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks
Graziano A. Manduzio
Federico A. Galatolo
M. G. Cimino
Enzo Pasquale Scilingo
Lorenzo Cominelli
LRM
182
7
0
24 Oct 2024
An Intelligent Agentic System for Complex Image Restoration Problems
International Conference on Learning Representations (ICLR), 2024
Kaiwen Zhu
Jinjin Gu
Zhiyuan You
Yu Qiao
Chao Dong
486
24
0
23 Oct 2024
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
402
87
0
21 Oct 2024
NetSafe: Exploring the Topological Safety of Multi-agent Networks
Miao Yu
Shilong Wang
Guibin Zhang
Junyuan Mao
Chenlong Yin
Qijiong Liu
Qingsong Wen
Kun Wang
Yang Wang
281
25
0
21 Oct 2024
Who is Undercover? Guiding LLMs to Explore Multi-Perspective Team Tactic in the Game
Ruiqi Dong
Zhixuan Liao
Guangwei Lai
Yuhan Ma
Danni Ma
Chenyou Fan
LLMAG
206
1
0
20 Oct 2024
RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Zhuoran Liu
Danpei Zhao
Bo Yuan
316
9
0
17 Oct 2024
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
International Conference on Learning Representations (ICLR), 2024
Mingyang Chen
Haoze Sun
Tianpeng Li
Fan Yang
Hao Liang
Keer Lu
Tengjiao Wang
Wentao Zhang
Guosheng Dong
Weipeng Chen
LRM
336
22
0
16 Oct 2024
LLM-SmartAudit: Advanced Smart Contract Vulnerability Detection
Zhiyuan Wei
Jing Sun
Zijiang Zhang
Xianhao Zhang
Meng Li
Zhe Hou
267
24
0
12 Oct 2024
DAWN: Designing Distributed Agents in a Worldwide Network
IEEE Access (IEEE Access), 2024
Zahra Aminiranjbar
Jianan Tang
Qiudan Wang
Shubha Pant
Mahesh Viswanathan
AI4CE
LLMAG
410
6
0
11 Oct 2024
Agents Thinking Fast and Slow: A Talker-Reasoner Architecture
Konstantina Christakopoulou
Shibl Mourad
Maja Matarić
LLMAG
218
21
0
10 Oct 2024
Agent S: An Open Agentic Framework that Uses Computers Like a Human
International Conference on Learning Representations (ICLR), 2024
Saaket Agashe
Jiuzhou Han
Shuyu Gan
Jiachen Yang
Ang Li
Xin Eric Wang
LLMAG
LM&Ro
AIFin
236
96
0
10 Oct 2024
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
International Conference on Learning Representations (ICLR), 2024
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jun Xu
Ji-Rong Wen
351
24
0
10 Oct 2024
Previous
1
2
3
4
5
6
...
14
15
16
Next