Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.17580
Cited By
v1
v2
v3
v4 (latest)
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (12 upvotes)
Papers citing
"HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"
50 / 751 papers shown
Title
SkyMoE: A Vision-Language Foundation Model for Enhancing Geospatial Interpretation with Mixture of Experts
Jiaqi Liu
Ronghao Fu
Lang Sun
Haoran Liu
Xiao Yang
Weipeng Zhang
Xu Na
Zhuoran Duan
Bo Yang
MoE
60
0
0
02 Dec 2025
STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls
Shubhi Asthana
Bing Zhang
Chad DeLuca
Ruchi Mahindru
Hima Patel
20
0
0
01 Dec 2025
COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis
Tsz-To Wong
Ching-Chun Huang
Hong-Han Shuai
AI4TS
276
0
0
01 Dec 2025
Energy-Aware Data-Driven Model Selection in LLM-Orchestrated AI Systems
Daria Smirnova
Hamid Nasiri
Marta Adamska
Zhengxin Yu
Peter Garraghan
12
0
0
30 Nov 2025
GEO-Detective: Unveiling Location Privacy Risks in Images with LLM Agents
Xinyu Zhang
Yixin Wu
Boyang Zhang
Chenhao Lin
Chao Shen
Michael Backes
Y. Zhang
36
0
0
27 Nov 2025
Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning
Shanwei Fan
Bin Zhang
Zhiwei Xu
Yingxuan Teng
Siqi Dai
Lin Cheng
Guoliang Fan
136
0
0
26 Nov 2025
EWE: An Agentic Framework for Extreme Weather Analysis
Zhe Jiang
Jiong Wang
Xiaoyu Yue
Zijie Guo
Wenlong Zhang
Fenghua Ling
Wanli Ouyang
L. Bai
128
1
0
26 Nov 2025
VICoT-Agent: A Vision-Interleaved Chain-of-Thought Framework for Interpretable Multimodal Reasoning and Scalable Remote Sensing Analysis
Chujie Wang
Zhiyuan Luo
Ruiqi Liu
Can Ran
Shenghua Fan
Xi Chen
Chu He
LRM
157
0
0
25 Nov 2025
The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment
Ziheng Ouyang
Yiren Song
Y. Liu
Shihao Zhu
Qibin Hou
Ming-Ming Cheng
Mike Zheng Shou
92
0
0
25 Nov 2025
Beyond Relational: Semantic-Aware Multi-Modal Analytics with LLM-Native Query Optimization
Junhao Zhu
Lu Chen
Xiangyu Ke
Ziquan Fang
Tianyi Li
Yunjun Gao
Christian S. Jensen
90
0
0
25 Nov 2025
HuggingR
4
^{4}
4
: A Progressive Reasoning Framework for Discovering Optimal Model Companions
Shaoyin Ma
Jie Song
Huiqiong Wang
Li Sun
Mingli Song
LLMAG
285
0
0
24 Nov 2025
ARIAL: An Agentic Framework for Document VQA with Precise Answer Localization
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Dheeraj Kulshrestha
R. Ramnath
60
0
0
22 Nov 2025
REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing
Binger Chen
Tacettin Emre Bök
Behnood Rasti
Volker Markl
Begüm Demir
96
0
0
21 Nov 2025
ARISE: Agentic Rubric-Guided Iterative Survey Engine for Automated Scholarly Paper Generation
Zi Wang
Xingqiao Wang
Sangah Lee
Xiaowei Xu
72
0
0
21 Nov 2025
AutoBackdoor: Automating Backdoor Attacks via LLM Agents
Y. Li
Z. Li
Wei Zhao
Nay Myat Min
Hanxun Huang
Xingjun Ma
Jun Sun
AAML
LLMAG
SILM
346
0
0
20 Nov 2025
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
Alexis Audran-Reiss
Jordi Armengol-Estapé
Karen Hambardzumyan
Amar Budhiraja
Martin Josifoski
...
Jenny Zhang
Taco Cohen
Yossi Adi
Tatiana Shavrina
Yoram Bachrach
116
2
0
19 Nov 2025
It's LIT! Reliability-Optimized LLMs with Inspectable Tools
Ruixin Zhang
J. Donnelly
Zhicheng Guo
Ghazal Khalighinejad
Haiyang Huang
A. Barnett
Cynthia Rudin
88
0
0
18 Nov 2025
AutoTool: Efficient Tool Selection for Large Language Model Agents
Jingyi Jia
Qinbin Li
LLMAG
132
0
0
18 Nov 2025
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist
Z. Liang
D. Zhang
Huichi Zhou
Rui Huang
Bobo Li
...
Shengqiong Wu
X. Wang
Jiebo Luo
Lizi Liao
Hao Fei
VGen
173
0
0
11 Nov 2025
Towards Resource-Efficient Multimodal Intelligence: Learned Routing among Specialized Expert Models
Mayank Saini
Arit Kumar Bishwas
MoE
110
0
0
09 Nov 2025
GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models
Jiarui Feng
Donghong Cai
Yixin Chen
Muhan Zhang
100
1
0
06 Nov 2025
Toward Autonomous Engineering Design: A Knowledge-Guided Multi-Agent Framework
Varun V. Kumar
George Karniadakis
AI4CE
135
1
0
05 Nov 2025
PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework
Sina Montazeri
Yunhe Feng
Kewei Sha
LLMAG
AI4TS
137
0
0
04 Nov 2025
OceanAI: A Conversational Platform for Accurate, Transparent, Near-Real-Time Oceanographic Insights
Bowen Chen
Jayesh Gajbhar
Gregory Dusek
Rob Redmon
Patrick Hogan
Paul Liu
DelWayne Bohnenstiehl
Dongkuan Xu
Ruoying He
112
0
0
02 Nov 2025
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Andrea Agiollo
Andrea Omicini
LM&Ro
AI4CE
144
0
0
23 Oct 2025
ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling
Shuyuan Zhang
Chenhan Jiang
Zuoou Li
Jiankang Deng
104
0
0
20 Oct 2025
Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents
Yihong Tang
Kehai Chen
Liang Yue
Jinxin Fan
Caishen Zhou
...
Kaiyang Guo
Xingshan Zeng
Wenjing Cun
L. Shang
Min Zhang
LLMAG
138
0
0
20 Oct 2025
EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
Rong Wu
Xiaoman Wang
Jianbiao Mei
Pinlong Cai
Daocheng Fu
...
Licheng Wen
Xuemeng Yang
Yufan Shen
Yuxin Wang
Botian Shi
84
3
0
17 Oct 2025
AUGUSTUS: An LLM-Driven Multimodal Agent System with Contextualized User Memory
Jitesh Jain
Shubham Maheshwari
Ning Yu
Wen-mei W. Hwu
Humphrey Shi
RALM
124
0
0
17 Oct 2025
Disaster Management in the Era of Agentic AI Systems: A Vision for Collective Human-Machine Intelligence for Augmented Resilience
Bo Li
Junwei Ma
Kai Yin
Yiming Xiao
Chia-Wei Hsu
Ali Mostafavi
184
1
0
16 Oct 2025
ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling
Jianghao Lin
Yuanyuan Shi
Xin Peng
Renjie Ding
Hairui Wang
...
Fengshuo Bai
Huacan Chai
Weinan Zhang
Fei Huang
Y. Wen
112
0
0
16 Oct 2025
GOAT: A Training Framework for Goal-Oriented Agent with Tools
Hyunji Min
Sangwon Jung
Junyoung Sung
Dosung Lee
Leekyeung Han
Paul Hongsuck Seo
LLMAG
ALM
132
0
0
14 Oct 2025
MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
Dongsen Zhang
Zekun Li
Xu Luo
Xuannan Liu
Peipei Li
Wenjun Xu
ELM
130
1
0
14 Oct 2025
Fundamentals of Building Autonomous LLM Agents
Victor de Lamo Castrillo
Habtom Kahsay Gidey
Alexander Lenz
Alois Knoll
LLMAG
LM&Ro
176
2
0
10 Oct 2025
MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
Tajamul Ashraf
Umair Nawaz
Abdelrahman M. Shaker
Rao Muhammad Anwer
Philip Torr
Fahad Shahbaz Khan
Salman Khan
190
0
0
09 Oct 2025
A
2
^2
2
Search: Ambiguity-Aware Question Answering with Reinforcement Learning
Fengji Zhang
Xinyao Niu
Chengyang Ying
Guancheng Lin
Zhongkai Hao
Zhou Fan
Chengen Huang
J. Keung
B. Chen
Junyang Lin
84
0
0
09 Oct 2025
Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization
Shuo Xing
Soumik Dey
Mingyang Wu
Ashirbad Mishra
Naveen Ravipati
Binbin Li
Hansi Wu
Zhengzhong Tu
163
1
0
09 Oct 2025
MoA-VR: A Mixture-of-Agents System Towards All-in-One Video Restoration
Lu Liu
Chunlei Cai
Shaocheng Shen
Jianfeng Liang
Weimin Ouyang
...
Huiyu Duan
Jiangchao Yao
X. Zhang
Q. Hu
Guangtao Zhai
VGen
165
4
0
09 Oct 2025
ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory
Yunzhong Xiao
Yangmin Li
Hewei Wang
Yunlong Tang
Zora Zhiruo Wang
65
0
0
08 Oct 2025
Adaptive Tool Generation with Models as Tools and Reinforcement Learning
Chenpeng Wang
Xiaojie Cheng
Chunye Wang
L. Yang
Lei Zhang
LRM
93
0
0
08 Oct 2025
FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline
Haotian Wu
Shufan Jiang
Chios Chen
Yiyang Feng
Hehai Lin
Heqing Zou
Yao Shu
Y. Li
AI4CE
229
0
0
08 Oct 2025
Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions
Y. Zhang
Xinhao Deng
Zhongyi Gu
Yihao Chen
Ke Xu
Qi Li
Jianping Wu
84
2
0
08 Oct 2025
AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems
Bo Ma
Hang Li
ZeHua Hu
XiaoFan Gui
LuYao Liu
Simon Liu
LRM
105
0
0
03 Oct 2025
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
Peng Liu
H. Shen
Chunxin Fang
Zhicheng Sun
Jiajia Liao
T. Zhao
MLLM
ObjD
VLM
LRM
201
2
0
30 Sep 2025
XR Blocks: Accelerating Human-centered AI + XR Innovation
David Li
Nels Numan
Xun Qian
Yanhe Chen
Zhongyi Zhou
...
Michelle Huynh
Konrad Piascik
Ricardo Cabello
David Kim
Ruofei Du
116
1
0
29 Sep 2025
From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models
Chenyue Zhou
Mingxuan Wang
Yanbiao Ma
Chenxu Wu
Wanyi Chen
...
Guoli Jia
Lingling Li
Z. Lu
Y. Lu
Wenhan Luo
LRM
407
9
0
29 Sep 2025
Understanding and Enhancing the Planning Capability of Language Models via Multi-Token Prediction
Qimin Zhong
Hao Liao
Siwei Wang
Mingyang Zhou
X. Wu
Rui Mao
Wei Chen
182
0
0
27 Sep 2025
CoFFT: Chain of Foresight-Focus Thought for Visual Language Models
Xinyu Zhang
Yuxuan Dong
L. Zhang
Chengyou Jia
Zhuohang Dang
Basura Fernando
Jun Liu
Mike Zheng Shou
LRM
212
0
0
26 Sep 2025
Thinking with Sound: Audio Chain-of-Thought Enables Multimodal Reasoning in Large Audio-Language Models
Zhen Xiong
Yujun Cai
Zhecheng Li
Junsong Yuan
Yiwei Wang
AuLLM
LRM
235
1
0
26 Sep 2025
VC-Agent: An Interactive Agent for Customized Video Dataset Collection
Yidan Zhang
Mutian Xu
Yiming Hao
Kun Zhou
Jiahao Chang
Xiaoqiang Liu
Pengfei Wan
Hongbo Fu
Xiaoguang Han
VGen
160
0
0
25 Sep 2025
1
2
3
4
...
14
15
16
Next