ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17580
  4. Cited By
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging
  Face
v1v2v3v4 (latest)

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
    MLLM
ArXiv (abs)PDFHTMLHuggingFace (12 upvotes)

Papers citing "HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"

50 / 763 papers shown
Automated Phishing Detection Using URLs and Webpages
Automated Phishing Detection Using URLs and Webpages
Huilin Wang
Bryan Hooi
281
6
0
03 Aug 2024
NOLO: Navigate Only Look Once
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
323
2
0
02 Aug 2024
A Comprehensive Review of Multimodal Large Language Models: Performance
  and Challenges Across Different Tasks
A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks
Jiaqi Wang
Hanqi Jiang
Yi-Hsueh Liu
Chong Ma
Xu-Yao Zhang
...
Shu Zhang
Wei Zhang
Dinggang Shen
Tianming Liu
Shu Zhang
VLMAI4TS
295
86
0
02 Aug 2024
Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion
Autonomous LLM-Enhanced Adversarial Attack for Text-to-MotionAAAI Conference on Artificial Intelligence (AAAI), 2024
Honglei Miao
Fan Ma
Ruijie Quan
Kun Zhan
Yi Yang
AAML
289
8
0
01 Aug 2024
ReplanVLM: Replanning Robotic Tasks with Visual Language Models
ReplanVLM: Replanning Robotic Tasks with Visual Language Models
Aoran Mei
Guo-Niu Zhu
Huaxiang Zhang
Zhongxue Gan
190
42
0
31 Jul 2024
Prompt2DeModel: Declarative Neuro-Symbolic Modeling with Natural
  Language
Prompt2DeModel: Declarative Neuro-Symbolic Modeling with Natural Language
Hossein Rajaby Faghihi
Aliakbar Nafar
Andrzej Uszok
Hamid Karimian
Parisa Kordjamshidi
228
3
0
30 Jul 2024
Efficient Inference of Vision Instruction-Following Models with Elastic
  Cache
Efficient Inference of Vision Instruction-Following Models with Elastic Cache
Zuyan Liu
Benlin Liu
Jiahui Wang
Yuhao Dong
Guangyi Chen
Yongming Rao
Ranjay Krishna
Jiwen Lu
VLM
218
26
0
25 Jul 2024
CityX: Controllable Procedural Content Generation for Unbounded 3D
  Cities
CityX: Controllable Procedural Content Generation for Unbounded 3D Cities
Shougao Zhang
Mengqi Zhou
Yuxi Wang
Chuanchen Luo
Rongyu Wang
Yiwei Li
Xucheng Yin
Zhaoxiang Zhang
Junran Peng
334
18
0
24 Jul 2024
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept
  Discovery
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Pei Zhou
Yanchao Yang
251
2
0
21 Jul 2024
Designing Algorithms Empowered by Language Models: An Analytical Framework, Case Studies, and Insights
Designing Algorithms Empowered by Language Models: An Analytical Framework, Case Studies, and Insights
Yanxi Chen
Yaliang Li
Bolin Ding
Jingren Zhou
263
8
0
20 Jul 2024
KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with
  Large Language Models
KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models
Kemou Jiang
Xuan Cai
Zhiyong Cui
Aoyong Li
Yilong Ren
Haiyang Yu
Hao Yang
Daocheng Fu
Licheng Wen
Pinlong Cai
LLMAG
201
21
0
19 Jul 2024
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Guoli Yin
Haoping Bai
Shuang Ma
Feng Nan
Yanchao Sun
...
Xiaoming Wang
Jiulong Shan
Meng Cao
Ruoming Pang
Zirui Wang
LLMAGELM
331
12
0
18 Jul 2024
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions
Jie Yang
Xuesong Niu
Nan Jiang
Ruimao Zhang
Siyuan Huang
234
22
0
17 Jul 2024
Large Language Models as Biomedical Hypothesis Generators: A
  Comprehensive Evaluation
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Biqing Qi
Kaiyan Zhang
Kai Tian
Haoxiang Li
Zhang-Ren Chen
Sihang Zeng
Ermo Hua
Hu Jinfang
Bowen Zhou
LM&MA
392
33
0
12 Jul 2024
Converging Paradigms: The Synergy of Symbolic and Connectionist AI in
  LLM-Empowered Autonomous Agents
Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents
Haoyi Xiong
Zhiyuan Wang
Xuhong Li
Jiang Bian
Bo Han
Shahid Mumtaz
Laura E. Barnes
LLMAG
596
14
0
11 Jul 2024
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks
  with Large Language Models
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models
Logan Cross
Robert Z. Sparks
Agam Bhatia
Daniel L. K. Yamins
Nick Haber
LM&RoLRMLLMAG
297
22
0
09 Jul 2024
Internet of Agents: Weaving a Web of Heterogeneous Agents for
  Collaborative Intelligence
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen
Ziming You
Ran Li
Yitong Guan
Chen Qian
Chenyang Zhao
Cheng Yang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
LLMAG
332
67
0
09 Jul 2024
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy
Zhenyu Guan
Xiangyu Kong
Fangwei Zhong
Yizhou Wang
240
26
0
09 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and
  Editing
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLMDiffM
326
86
0
08 Jul 2024
LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual
  Contexts
LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts
Yijia Xiao
Edward Sun
Tianyu Liu
Wei Wang
LRM
244
113
0
06 Jul 2024
Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and
  Aleatoric Awareness
Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Khyathi Chandu
Linjie Li
Anas Awadalla
Ximing Lu
Jae Sung Park
Jack Hessel
Lijuan Wang
Yejin Choi
323
6
0
02 Jul 2024
VSP: Assessing the dual challenges of perception and reasoning in
  spatial planning tasks for VLMs
VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs
Qiucheng Wu
Handong Zhao
Michael Stephen Saxon
T. Bui
William Yang Wang
Yang Zhang
Shiyu Chang
CoGe
220
18
0
02 Jul 2024
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models:
  Enhancing Performance and Reducing Inference Costs
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
Enshu Liu
Junyi Zhu
Zinan Lin
Xuefei Ning
Matthew B. Blaschko
Shengen Yan
Guohao Dai
Huazhong Yang
Yu Wang
MoE
263
22
0
01 Jul 2024
Teola: Towards End-to-End Optimization of LLM-based Applications
Teola: Towards End-to-End Optimization of LLM-based Applications
Xin Tan
Yimin Jiang
Yitao Yang
Hong-Yu Xu
643
14
0
29 Jun 2024
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
Haiyang Shen
Yue Li
Desong Meng
Dongqi Cai
Sheng Qi
Li Zhang
Mengwei Xu
Xuhui Liu
LLMAG
387
27
0
28 Jun 2024
When Search Engine Services meet Large Language Models: Visions and
  Challenges
When Search Engine Services meet Large Language Models: Visions and Challenges
Haoyi Xiong
Jiang Bian
Yuchen Li
Xuhong Li
Jundong Li
Shuaiqiang Wang
D. Yin
Sumi Helal
353
83
0
28 Jun 2024
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning
  Graph
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang
Jiaao Chen
Diyi Yang
LRM
229
24
0
25 Jun 2024
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Brandon Huang
Chancharik Mitra
Assaf Arbelle
Leonid Karlinsky
Trevor Darrell
Roei Herzig
237
36
0
21 Jun 2024
Transferable speech-to-text large language model alignment module
Transferable speech-to-text large language model alignment moduleInterspeech (Interspeech), 2024
Boyong Wu
Chao Yan
Haoran Pu
152
0
0
19 Jun 2024
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for
  LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM AgentsNeural Information Processing Systems (NeurIPS), 2024
Edoardo Debenedetti
Jie Zhang
Mislav Balunović
Luca Beurer-Kellner
Marc Fischer
Florian Tramèr
LLMAGAAML
424
82
1
19 Jun 2024
Automatic benchmarking of large multimodal models via iterative
  experiment programming
Automatic benchmarking of large multimodal models via iterative experiment programming
Alessandro Conti
Enrico Fini
Paolo Rota
Yiming Wang
Goran Frehse
Elisa Ricci
243
1
0
18 Jun 2024
ARTIST: Improving the Generation of Text-rich Images by Disentanglement
ARTIST: Improving the Generation of Text-rich Images by Disentanglement
Jianyi Zhang
Jiuxiang Gu
Jiuxiang Gu
Curtis Wigington
Tong Yu
Yiran Chen
Tong Sun
Ruiyi Zhang
261
1
0
17 Jun 2024
Towards Vision-Language Geo-Foundation Model: A Survey
Towards Vision-Language Geo-Foundation Model: A Survey
Yue Zhou
Xue Jiang
Yiping Ke
245
37
0
13 Jun 2024
GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices
GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices
Quanfeng Lu
Wenqi Shao
Zitao Liu
Lingxiao Du
Fanqing Meng
Boxuan Li
Botong Chen
Siyuan Huang
Kaipeng Zhang
Ping Luo
379
103
0
12 Jun 2024
Scaling Large Language Model-based Multi-Agent Collaboration
Scaling Large Language Model-based Multi-Agent Collaboration
Chen Qian
Zihao Xie
YiFei Wang
Wei Liu
Yufan Dang
...
Zhuoyun Du
Weize Chen
Cheng Yang
Zhiyuan Liu
Maosong Sun
AI4CELLMAGLM&Ro
475
133
0
11 Jun 2024
Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees
Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees
Sijia Chen
Yibo Wang
Yi-Feng Wu
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
Lijun Zhang
LLMAGLRM
413
36
0
11 Jun 2024
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent
Wenjia Xu
Zijian Yu
Yixu Wang
Jiuniu Wang
Yuanben Zhang
Guangzuo Li
Mugen Peng
LLMAG
471
7
0
11 Jun 2024
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Joongwon Kim
Bhargavi Paranjape
Tushar Khot
Hannaneh Hajishirzi
LM&RoELMLLMAGLRM
264
11
0
10 Jun 2024
AICoderEval: Improving AI Domain Code Generation of Large Language
  Models
AICoderEval: Improving AI Domain Code Generation of Large Language Models
Yinghui Xia
Yuyan Chen
Tianyu Shi
Jun Wang
Jinsong Yang
161
6
0
07 Jun 2024
LogiCode: an LLM-Driven Framework for Logical Anomaly Detection
LogiCode: an LLM-Driven Framework for Logical Anomaly DetectionIEEE Transactions on Automation Science and Engineering (T-ASE), 2024
Yiheng Zhang
Yunkang Cao
Xiaohao Xu
Nong Sang
245
31
0
07 Jun 2024
Tool-Planner: Task Planning with Clusters across Multiple Tools
Tool-Planner: Task Planning with Clusters across Multiple Tools
Yanming Liu
Xinyue Peng
Jiannan Cao
Yuwei Zhang
Xuhong Zhang
Sheng Cheng
Xun Wang
Jianwei Yin
Xuhong Zhang
LLMAG
371
2
0
06 Jun 2024
A Survey of Language-Based Communication in Robotics
A Survey of Language-Based Communication in Robotics
William Hunt
Sarvapali D. Ramchurn
Mohammad D. Soorati
LM&Ro
715
17
0
06 Jun 2024
AI Agents Under Threat: A Survey of Key Security Challenges and Future
  Pathways
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
Zehang Deng
Yongjian Guo
Changzhou Han
Wanlun Ma
Junwu Xiong
Sheng Wen
Yang Xiang
409
134
0
04 Jun 2024
Towards a copilot in BIM authoring tool using a large language
  model-based agent for intelligent human-machine interaction
Towards a copilot in BIM authoring tool using a large language model-based agent for intelligent human-machine interaction
Changyu Du
Stavros Nousias
André Borrmann
LLMAG
216
6
0
02 Jun 2024
Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot
  Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language
  Models
Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models
Tianrun Chen
Chunan Yu
Jing Li
Jianqi Zhang
Lanyun Zhu
Deyi Ji
Yong Zhang
Ying Zang
Zejian Li
Lingyun Sun
LRM
249
12
0
29 May 2024
Evaluating the External and Parametric Knowledge Fusion of Large
  Language Models
Evaluating the External and Parametric Knowledge Fusion of Large Language Models
Hao Zhang
Yuyang Zhang
Xiaoguang Li
Wenxuan Shi
Haonan Xu
...
Yasheng Wang
Lifeng Shang
Qun Liu
Yong Liu
Ruiming Tang
KELM
250
7
0
29 May 2024
MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex
  Visual Reasoning
MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning
Somnath Kumar
Yash Gadhia
T. Ganu
A. Nambi
LRM
319
11
0
28 May 2024
A Human-Like Reasoning Framework for Multi-Phases Planning Task with
  Large Language Models
A Human-Like Reasoning Framework for Multi-Phases Planning Task with Large Language Models
Chengxing Xie
Difan Zou
LRMLLMAG
225
12
0
28 May 2024
Tool Learning with Large Language Models: A Survey
Tool Learning with Large Language Models: A Survey
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jun Xu
Jirong Wen
LLMAG
342
217
0
28 May 2024
LLM-Optic: Unveiling the Capabilities of Large Language Models for
  Universal Visual Grounding
LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding
Haoyu Zhao
Wenhang Ge
Ying-Cong Chen
ObjDMLLMVLM
303
7
0
27 May 2024
Previous
123...678...141516
Next
Page 7 of 16
Pageof 16