ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17580
  4. Cited By
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging
  Face
v1v2v3v4 (latest)

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
    MLLM
ArXiv (abs)PDFHTMLHuggingFace (12 upvotes)

Papers citing "HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face"

50 / 754 papers shown
Mitigating Hallucination in Visual Language Models with Visual
  Supervision
Mitigating Hallucination in Visual Language Models with Visual Supervision
Zhiyang Chen
Yousong Zhu
Yufei Zhan
Zhaowen Li
Honghui Dong
Jinqiao Wang
Ming Tang
VLMMLLM
244
52
0
27 Nov 2023
Function-constrained Program Synthesis
Function-constrained Program Synthesis
Patrick Hajali
Ignas Budvytis
189
1
0
27 Nov 2023
See and Think: Embodied Agent in Virtual Environment
See and Think: Embodied Agent in Virtual EnvironmentEuropean Conference on Computer Vision (ECCV), 2023
Zhonghan Zhao
Wenhao Chai
Xuan Wang
Li Boyi
Shengyu Hao
Shidong Cao
Tianbo Ye
Gaoang Wang
LM&RoLLMAG
394
54
0
26 Nov 2023
Agent as Cerebrum, Controller as Cerebellum: Implementing an Embodied
  LMM-based Agent on Drones
Agent as Cerebrum, Controller as Cerebellum: Implementing an Embodied LMM-based Agent on Drones
Haoran Zhao
Fengxing Pan
Huqiuyue Ping
Yaoming Zhou
AI4CE
190
17
0
25 Nov 2023
Griffon: Spelling out All Object Locations at Any Granularity with Large
  Language Models
Griffon: Spelling out All Object Locations at Any Granularity with Large Language ModelsEuropean Conference on Computer Vision (ECCV), 2023
Yufei Zhan
Yousong Zhu
Zhiyang Chen
Fan Yang
E. Goles
Jinqiao Wang
ObjD
242
30
0
24 Nov 2023
Towards Responsible Generative AI: A Reference Architecture for
  Designing Foundation Model based Agents
Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents
Qinghua Lu
Liming Zhu
Xiwei Xu
Zhenchang Xing
Stefan Harrer
Jon Whittle
LM&RoAI4CELLMAG
377
21
0
22 Nov 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
342
431
0
21 Nov 2023
AcademicGPT: Empowering Academic Research
AcademicGPT: Empowering Academic Research
Shufa Wei
Xiaolong Xu
Xianbiao Qi
Xi Yin
Jun Xia
...
Chihao Dai
Lihua Wang
Xiaohui Liu
Lei Zhang
Yutao Xie
LM&MA
211
5
0
21 Nov 2023
InteraSSort: Interactive Assortment Planning Using Large Language Models
InteraSSort: Interactive Assortment Planning Using Large Language ModelsSocial Science Research Network (SSRN), 2023
Saketh Reddy Karra
Theja Tulabandhula
169
3
0
20 Nov 2023
VLM-Eval: A General Evaluation on Video Large Language Models
VLM-Eval: A General Evaluation on Video Large Language Models
Shuailin Li
Yuang Zhang
Yucheng Zhao
Qiuyue Wang
Fan Jia
Yingfei Liu
Tiancai Wang
MLLMELM
156
6
0
20 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From
  Chain-of-Thought Reasoning to Language Agents
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAGLM&RoLRM
363
92
0
20 Nov 2023
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language
  Model-based Agents in Real-world Systems
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
Yilun Kong
Jingqing Ruan
Yihong Chen
Bin Zhang
Tianpeng Bao
...
Xiaoru Hu
Hangyu Mao
Ziyue Li
Xingyu Zeng
Rui Zhao
LLMAG
292
50
0
19 Nov 2023
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized
  Multimodal Framework
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
Chris Kelly
Luhui Hu
Cindy Yang
Yu Tian
Deshun Yang
Bang Yang
Zaoshan Huang
Zihao Li
Yuexian Zou
AI4TS
100
3
0
16 Nov 2023
Video-LLaVA: Learning United Visual Representation by Alignment Before
  Projection
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Bin Lin
Yang Ye
Bin Zhu
Jiaxi Cui
Munan Ning
Peng Jin
Li-ming Yuan
VLMMLLM
1.6K
1,181
0
16 Nov 2023
When does In-context Learning Fall Short and Why? A Study on
  Specification-Heavy Tasks
When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks
Hao Peng
Xiaozhi Wang
Jianhui Chen
Weikai Li
Yunjia Qi
...
Zhili Wu
Kaisheng Zeng
Bin Xu
Lei Hou
Juanzi Li
258
42
0
15 Nov 2023
Chat-UniVi: Unified Visual Representation Empowers Large Language Models
  with Image and Video Understanding
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Peng Jin
Ryuichi Takanobu
Caiwan Zhang
Xiaochun Cao
Li-ming Yuan
MLLM
508
352
0
14 Nov 2023
Qwen-Audio: Advancing Universal Audio Understanding via Unified
  Large-Scale Audio-Language Models
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Yunfei Chu
Jin Xu
Xiaohuan Zhou
Qian Yang
Shiliang Zhang
Zhijie Yan
Chang Zhou
Jingren Zhou
AuLLM
317
595
0
14 Nov 2023
TrainerAgent: Customizable and Efficient Model Training through
  LLM-Powered Multi-Agent System
TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System
Haoyuan Li
Hao Jiang
Tianke Zhang
Zhelun Yu
Aoxiong Yin
Hao Cheng
Siming Fu
Yuhao Zhang
Wanggui He
LLMAG
240
8
0
11 Nov 2023
How to Bridge the Gap between Modalities: Survey on Multimodal Large Language Model
How to Bridge the Gap between Modalities: Survey on Multimodal Large Language Model
Shezheng Song
Xiaopeng Li
Shasha Li
Shan Zhao
Jie Yu
Jun Ma
Xiaoguang Mao
Weimin Zhang
275
18
0
10 Nov 2023
TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models
TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models
Zhen Yang
Yingxue Zhang
Fandong Meng
Jie Zhou
VLMMLLM
206
4
0
08 Nov 2023
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic
  Decision-Making with AI Agents
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents
Shaoguang Mao
Yuzhe Cai
Yan Xia
Wenshan Wu
Xun Wang
Fengyi Wang
Tao Ge
Furu Wei
274
24
0
06 Nov 2023
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning
  via Generative Simulation
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative SimulationInternational Conference on Machine Learning (ICML), 2023
Yufei Wang
Zhou Xian
Feng Chen
Tsun-Hsuan Wang
Yian Wang
Katerina Fragkiadaki
Zackory M. Erickson
David Held
Chuang Gan
LM&Ro
482
169
0
02 Nov 2023
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context
  Evaluation Benchmark for Large Language Models
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Wai-Chung Kwan
Xingshan Zeng
Yufei Wang
Yusen Sun
Liangyou Li
Lifeng Shang
Qun Liu
Kam-Fai Wong
ELM
286
13
0
30 Oct 2023
ControlLLM: Augment Language Models with Tools by Searching on Graphs
ControlLLM: Augment Language Models with Tools by Searching on GraphsEuropean Conference on Computer Vision (ECCV), 2023
Zhaoyang Liu
Zeqiang Lai
Zhangwei Gao
Erfei Cui
Ziheng Li
...
Lewei Lu
Qifeng Chen
Yu Qiao
Jifeng Dai
Wenhai Wang
MLLM
392
57
0
26 Oct 2023
Managing extreme AI risks amid rapid progress
Managing extreme AI risks amid rapid progress
Yoshua Bengio
Geoffrey Hinton
Andrew Yao
Dawn Song
Pieter Abbeel
...
Juil Sock
Stuart J. Russell
Daniel Kahneman
J. Brauner
Sören Mindermann
344
30
0
26 Oct 2023
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning
  in Language Models
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Ge Zheng
Bin Yang
Jiajin Tang
Hong-Yu Zhou
Sibei Yang
LRMMLLM
314
181
0
25 Oct 2023
Woodpecker: Hallucination Correction for Multimodal Large Language
  Models
Woodpecker: Hallucination Correction for Multimodal Large Language ModelsScience China Information Sciences (Sci China Inf Sci), 2023
Xinglong Mao
Chaoyou Fu
Zhengye Zhang
Tong Xu
Hao Wang
Dianbo Sui
Chunjiang Ge
Ke Li
Xingguo Sun
Enhong Chen
VLMMLLM
334
197
0
24 Oct 2023
A Communication Theory Perspective on Prompting Engineering Methods for
  Large Language Models
A Communication Theory Perspective on Prompting Engineering Methods for Large Language ModelsJournal of Computational Science and Technology (JCST), 2023
Wailing Ng
Yuanqin He
Xuefang Zhao
Hanlin Gu
Chen Zhang
Haijun Yang
Lixin Fan
Qiang Yang
212
7
0
24 Oct 2023
Large Language Models are Visual Reasoning Coordinators
Large Language Models are Visual Reasoning CoordinatorsNeural Information Processing Systems (NeurIPS), 2023
Liangyu Chen
Bo Li
Sheng Shen
Jingkang Yang
Chunyuan Li
Kurt Keutzer
Trevor Darrell
Ziwei Liu
VLMLRM
276
91
0
23 Oct 2023
ToolChain*: Efficient Action Space Navigation in Large Language Models
  with A* Search
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search
Yuchen Zhuang
Xiang Chen
Tong Yu
Saayan Mitra
Victor S. Bursztyn
Ryan Rossi
Somdeb Sarkhel
Chao Zhang
LLMAG
297
98
0
20 Oct 2023
Frozen Transformers in Language Models Are Effective Visual Encoder
  Layers
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
Ziqi Pang
Ziyang Xie
Yunze Man
Yu-Xiong Wang
430
47
0
19 Oct 2023
Language Agents for Detecting Implicit Stereotypes in Text-to-image
  Models at Scale
Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Qichao Wang
Tian Bian
Yian Yin
Qifeng Bai
Hong Cheng
Helen M. Meng
Zibin Zheng
Liang Chen
Bingzhe Wu
VLMDiffM
242
6
0
18 Oct 2023
Leveraging Large Language Model for Automatic Evolving of Industrial
  Data-Centric R&D Cycle
Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle
Xu Yang
Xiao Yang
Yuante Li
Jinhui Li
Peng Yu
Zeqi Ye
Jiang Bian
191
1
0
17 Oct 2023
Survey of Vulnerabilities in Large Language Models Revealed by
  Adversarial Attacks
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks
Erfan Shayegani
Md Abdullah Al Mamun
Yu Fu
Pedram Zaree
Yue Dong
Nael B. Abu-Ghazaleh
AAML
460
223
0
16 Oct 2023
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in BiologyConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Odhran O'Donoghue
Aleksandar Shtedritski
John Ginger
Ralph Abboud
Ali E. Ghareeb
Justin Booth
Samuel G. Rodriques
292
29
0
16 Oct 2023
An Expression Tree Decoding Strategy for Mathematical Equation
  Generation
An Expression Tree Decoding Strategy for Mathematical Equation Generation
Wenqi Zhang
Yongliang Shen
Qingpeng Nong
Zeqi Tan
Zeqi Tan Yanna Ma
Weiming Lu
AIMat
328
7
0
14 Oct 2023
SAI: Solving AI Tasks with Systematic Artificial Intelligence in
  Communication Network
SAI: Solving AI Tasks with Systematic Artificial Intelligence in Communication Network
Lei Yao
Yong Zhang
Zilong Yan
Jialu Tian
163
4
0
13 Oct 2023
Large Language Models as Source Planner for Personalized
  Knowledge-grounded Dialogue
Large Language Models as Source Planner for Personalized Knowledge-grounded DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongru Wang
Minda Hu
Yang Deng
Rui Wang
Fei Mi
Weichao Wang
Yasheng Wang
Wai-Chung Kwan
Irwin King
Kam-Fai Wong
RALM
259
8
0
13 Oct 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Octopus: Embodied Vision-Language Programmer from Environmental FeedbackEuropean Conference on Computer Vision (ECCV), 2023
Jingkang Yang
Yuhao Dong
Shuai Liu
Yue Liu
Ziyue Wang
...
Haoran Tan
Jiamu Kang
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
LM&Ro
293
80
0
12 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic
  Image Design and Generation
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRMMLLMDiffM
102
28
0
12 Oct 2023
Towards Robust Multi-Modal Reasoning via Model Selection
Towards Robust Multi-Modal Reasoning via Model SelectionInternational Conference on Learning Representations (ICLR), 2023
Xiangyan Liu
Rongxue Li
Wei Ji
Tao Lin
LLMAGLRM
303
8
0
12 Oct 2023
GameGPT: Multi-agent Collaborative Framework for Game Development
GameGPT: Multi-agent Collaborative Framework for Game Development
Dake Chen
Haoyang Zhang
Hanbin Wang
Yunhao Huo
Yuzhao Li
Junjie Wang
LLMAG
303
33
0
12 Oct 2023
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Jie An
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Kevin Qinghong Lin
Zicheng Liu
Lijuan Wang
Jiebo Luo
222
14
0
11 Oct 2023
An Empirical Study of Instruction-tuning Large Language Models in
  Chinese
An Empirical Study of Instruction-tuning Large Language Models in ChineseConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Q. Si
Tong Wang
Zheng Lin
Xu Zhang
Yanan Cao
Weiping Wang
ALM
199
23
0
11 Oct 2023
Lemur: Harmonizing Natural Language and Code for Language Agents
Lemur: Harmonizing Natural Language and Code for Language AgentsInternational Conference on Learning Representations (ICLR), 2023
Yiheng Xu
Hongjin Su
Chen Xing
Boyu Mi
Qian Liu
...
Siheng Zhao
Lingpeng Kong
Bailin Wang
Caiming Xiong
Tao Yu
240
88
0
10 Oct 2023
Revisit Input Perturbation Problems for LLMs: A Unified Robustness
  Evaluation Framework for Noisy Slot Filling Task
Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling TaskNatural Language Processing and Chinese Computing (NLPCC), 2023
Guanting Dong
Jinxu Zhao
Tingfeng Hui
Daichi Guo
Wenlong Wan
...
Yueyan Qiu
Zhuoma Gongque
Keqing He
Zechen Wang
Weiran Xu
AAML
213
32
0
10 Oct 2023
ViCor: Bridging Visual Understanding and Commonsense Reasoning with
  Large Language Models
ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
KAI-QING Zhou
Kwonjoon Lee
Teruhisa Misu
Xin Eric Wang
LRM
272
9
0
09 Oct 2023
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
Establishing Trustworthiness: Rethinking Tasks and Model EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Robert Litschko
Max Müller-Eberstein
Rob van der Goot
Leon Weber
Barbara Plank
LRM
187
3
0
09 Oct 2023
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on
  Open-Source Model
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source ModelNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Cheng Qian
Chenyan Xiong
Zhenghao Liu
Zhiyuan Liu
LRM
216
27
0
08 Oct 2023
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Self-Knowledge Guided Retrieval Augmentation for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yile Wang
Peng Li
Maosong Sun
Yang Liu
RALMKELM
236
80
0
08 Oct 2023
Previous
123...101112...141516
Next