v1v2v3v4 (latest)

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Yongliang Shen

Kaitao Song

Xu Tan

Dongsheng Li

Weiming Lu

Yueting Zhuang

MLLM

ArXiv (abs)PDF HTML HuggingFace (12 upvotes)

Papers citing "HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"

50 / 754 papers shown

PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement

293

04 Dec 2025

On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral

Christos Thrampoulidis

Xiaoxiao Li

03 Dec 2025

Measuring Agents in Production

...

02 Dec 2025

SkyMoE: A Vision-Language Foundation Model for Enhancing Geospatial Interpretation with Mixture of Experts

02 Dec 2025

STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls

01 Dec 2025

COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis

368

01 Dec 2025

Energy-Aware Data-Driven Model Selection in LLM-Orchestrated AI Systems

30 Nov 2025

GEO-Detective: Unveiling Location Privacy Risks in Images with LLM Agents

27 Nov 2025

EWE: An Agentic Framework for Extreme Weather Analysis

165

26 Nov 2025

Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning

Shanwei Fan

Bin Zhang

Zhiwei Xu

Yingxuan Teng

Siqi Dai

Lin Cheng

Guoliang Fan

161

26 Nov 2025

Beyond Relational: Semantic-Aware Multi-Modal Analytics with LLM-Native Query Optimization

138

25 Nov 2025

The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment

129

25 Nov 2025

VICoT-Agent: A Vision-Interleaved Chain-of-Thought Framework for Interpretable Multimodal Reasoning and Scalable Remote Sensing Analysis

319

25 Nov 2025

$HuggingR$^{4}$: A Progressive Reasoning Framework for Discovering Optimal Model Companions$

HuggingR

^{4}

: A Progressive Reasoning Framework for Discovering Optimal Model Companions

321

24 Nov 2025

ARIAL: An Agentic Framework for Document VQA with Precise Answer Localization

Ahmad Mohammadshirazi

Pinaki Prasad Guha Neogi

Dheeraj Kulshrestha

R. Ramnath

121

22 Nov 2025

ARISE: Agentic Rubric-Guided Iterative Survey Engine for Automated Scholarly Paper Generation

21 Nov 2025

REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing

112

21 Nov 2025

AutoBackdoor: Automating Backdoor Attacks via LLM Agents

395

20 Nov 2025

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

Alexis Audran-Reiss

Jordi Armengol-Estapé

...

174

19 Nov 2025

It's LIT! Reliability-Optimized LLMs with Inspectable Tools

105

18 Nov 2025

AutoTool: Efficient Tool Selection for Large Language Model Agents

Jingyi Jia

Qinbin Li

LLMAG

155

18 Nov 2025

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

...

204

11 Nov 2025

Towards Resource-Efficient Multimodal Intelligence: Learned Routing among Specialized Expert Models

Mayank Saini

Arit Kumar Bishwas

MoE

119

09 Nov 2025

GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models

132

06 Nov 2025

Toward Autonomous Engineering Design: A Knowledge-Guided Multi-Agent Framework

Varun V. Kumar

George Karniadakis

AI4CE

186

05 Nov 2025

PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework

168

04 Nov 2025

OceanAI: A Conversational Platform for Accurate, Transparent, Near-Real-Time Oceanographic Insights

DelWayne Bohnenstiehl

Dongkuan Xu

Ruoying He

149

02 Nov 2025

Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges

Andrea Agiollo

Andrea Omicini

LM&Ro AI4CE

166

23 Oct 2025

Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents

...

158

20 Oct 2025

ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling

143

20 Oct 2025

AUGUSTUS: An LLM-Driven Multimodal Agent System with Contextualized User Memory

145

17 Oct 2025

EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle

...

112

17 Oct 2025

Disaster Management in the Era of Agentic AI Systems: A Vision for Collective Human-Machine Intelligence for Augmented Resilience

223

16 Oct 2025

ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling

...

124

16 Oct 2025

GOAT: A Training Framework for Goal-Oriented Agent with Tools

148

14 Oct 2025

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

187

14 Oct 2025

Fundamentals of Building Autonomous LLM Agents

Victor de Lamo Castrillo

204

10 Oct 2025

^2

Search: Ambiguity-Aware Question Answering with Reinforcement Learning

106

09 Oct 2025

MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning

Tajamul Ashraf

Umair Nawaz

Abdelrahman M. Shaker

224

09 Oct 2025

Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization

179

09 Oct 2025

MoA-VR: A Mixture-of-Agents System Towards All-in-One Video Restoration

...

176

09 Oct 2025

ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory

08 Oct 2025

Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions

108

08 Oct 2025

Adaptive Tool Generation with Models as Tools and Reinforcement Learning

122

08 Oct 2025

FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline

269

08 Oct 2025

AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems

127

03 Oct 2025

VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

222

30 Sep 2025

XR Blocks: Accelerating Human-centered AI + XR Innovation

...

125

29 Sep 2025

From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models

...

453

29 Sep 2025

Understanding and Enhancing the Planning Capability of Language Models via Multi-Token Prediction

211

27 Sep 2025