ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.10035
  4. Cited By
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
v1v2v3 (latest)

Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey

Machine Intelligence Research (MIR), 2023
20 February 2023
Tianlin Li
Guangyao Chen
Guangwu Qian
Pengcheng Gao
Xiaoyong Wei
Yaowei Wang
Yonghong Tian
Wen Gao
    AI4CEVLM
ArXiv (abs)PDFHTMLGithub (286★)

Papers citing "Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey"

27 / 127 papers shown
A Survey on Image-text Multimodal Models
A Survey on Image-text Multimodal Models
Ruifeng Guo
Jingxuan Wei
Linzhuang Sun
Khai-Nguyen Nguyen
Guiyong Chang
Dawei Liu
Sibo Zhang
Zhengbing Yao
Mingjun Xu
Liping Bu
VLM
328
22
0
23 Sep 2023
Bias and Fairness in Chatbots: An Overview
Bias and Fairness in Chatbots: An OverviewAPSIPA Transactions on Signal and Information Processing (TASIP), 2023
Jintang Xue
Yun Cheng Wang
Chengwei Wei
Xiaofeng Liu
Jonghye Woo
C.-C. Jay Kuo
322
59
0
16 Sep 2023
SSL-Net: A Synergistic Spectral and Learning-based Network for Efficient
  Bird Sound Classification
SSL-Net: A Synergistic Spectral and Learning-based Network for Efficient Bird Sound ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yiyuan Yang
Kaichen Zhou
Niki Trigoni
Andrew Markham
164
6
0
15 Sep 2023
Enhancing Subtask Performance of Multi-modal Large Language Model
Enhancing Subtask Performance of Multi-modal Large Language Model
Yongqiang Zhao
Zhenyu Li
Feng Zhang
Xinhai Xu
Donghong Liu
LRM
82
1
0
31 Aug 2023
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
GPTEval: A Survey on Assessments of ChatGPT and GPT-4International Conference on Language Resources and Evaluation (LREC), 2023
Rui Mao
Guanyi Chen
Xulang Zhang
Frank Guerin
Xiaoshi Zhong
ELMLM&MA
187
148
0
24 Aug 2023
Progressive Feature Mining and External Knowledge-Assisted
  Text-Pedestrian Image Retrieval
Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image RetrievalIEEE transactions on multimedia (IEEE TMM), 2023
Huafeng Li
Shedan Yang
Yafei Zhang
Dapeng Tao
Z. Yu
225
6
0
23 Aug 2023
TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for
  Time Series
TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time SeriesInternational Conference on Learning Representations (ICLR), 2023
Chenxi Sun
Hongyan Li
Yaliang Li
linda Qiao
AI4TS
408
190
0
16 Aug 2023
CTP: Towards Vision-Language Continual Pretraining via Compatible
  Momentum Contrast and Topology Preservation
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology PreservationIEEE International Conference on Computer Vision (ICCV), 2023
Hongguang Zhu
Yunchao Wei
Xiaodan Liang
Chunjie Zhang
Yao-Min Zhao
VLM
139
36
0
14 Aug 2023
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based RecognitionIEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2023
Tianlin Li
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
376
25
0
08 Aug 2023
Improving Zero-Shot Generalization for CLIP with Synthesized Prompts
Improving Zero-Shot Generalization for CLIP with Synthesized Prompts
Liang Luo
Jian Liang
Ran He
Nana Xu
Zilei Wang
Tien-Ping Tan
VLM
255
69
0
14 Jul 2023
A Survey on Graph Neural Networks for Time Series: Forecasting,
  Classification, Imputation, and Anomaly Detection
A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Ming Jin
Huan Yee Koh
Qingsong Wen
Daniele Zambon
Cesare Alippi
G. I. Webb
Irwin King
Shirui Pan
AI4TSAI4CE
421
350
0
07 Jul 2023
Review of Large Vision Models and Visual Prompt Engineering
Review of Large Vision Models and Visual Prompt Engineering
Yuan Liu
Zheng Liu
Lin Zhao
Zihao Wu
Chong Ma
...
Bao Ge
Yixuan Yuan
Hongtu Zhu
Tianming Liu
Shu Zhang
VLMLRM
317
216
0
03 Jul 2023
A Survey of Vision-Language Pre-training from the Lens of Multimodal
  Machine Translation
A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation
Jeremy Gwinnup
Kevin Duh
VLM
148
8
0
12 Jun 2023
ProTeCt: Prompt Tuning for Taxonomic Open Set Classification
ProTeCt: Prompt Tuning for Taxonomic Open Set ClassificationComputer Vision and Pattern Recognition (CVPR), 2023
Tz-Ying Wu
Chih-Hui Ho
Nuno Vasconcelos
VLM
179
14
0
04 Jun 2023
AMatFormer: Efficient Feature Matching via Anchor Matching Transformer
AMatFormer: Efficient Feature Matching via Anchor Matching TransformerIEEE transactions on multimedia (IEEE TMM), 2023
Bo Jiang
S. Luo
Tianlin Li
Chuanfu Li
Jin Tang
178
11
0
30 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
408
130
0
14 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health
  Management: A Survey and Roadmaps
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and RoadmapsReliability Engineering & System Safety (Reliab. Eng. Syst. Saf.), 2023
Yanfang Li
Huan Wang
Muxia Sun
LM&MAAI4TSAI4CE
386
93
0
10 May 2023
Exploring the Landscape of Machine Unlearning: A Comprehensive Survey
  and Taxonomy
Exploring the Landscape of Machine Unlearning: A Comprehensive Survey and TaxonomyIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
T. Shaik
Xiaohui Tao
Haoran Xie
Lin Li
Xiaofeng Zhu
Qingyuan Li
MU
516
54
0
10 May 2023
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A
  Survey
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A Survey
Yichi Zhang
Rushi Jiao
MedImVLM
294
35
0
05 May 2023
Learning CLIP Guided Visual-Text Fusion Transformer for Video-based
  Pedestrian Attribute Recognition
Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition
Jun Zhu
Jia Jin
Zihan Yang
Xiaohao Wu
X. Wang
ViT
294
14
0
20 Apr 2023
Mastering Symbolic Operations: Augmenting Language Models with Compiled
  Neural Networks
Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural NetworksInternational Conference on Learning Representations (ICLR), 2023
Yixuan Weng
Minjun Zhu
Fei Xia
Bin Li
Shizhu He
Kang Liu
Jun Zhao
504
12
0
04 Apr 2023
Vision-Language Models for Vision Tasks: A Survey
Vision-Language Models for Vision Tasks: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
501
1,044
0
03 Apr 2023
RGBT Tracking via Progressive Fusion Transformer with Dynamically Guided Learning
Yabin Zhu
Chenglong Li
Tianlin Li
Jin Tang
Zhixiang Huang
235
15
0
26 Mar 2023
AI-Generated Content (AIGC): A Survey
AI-Generated Content (AIGC): A Survey
Jiayang Wu
Wensheng Gan
Zefeng Chen
Shicheng Wan
Hong Lin
3DV
250
188
0
26 Mar 2023
Large Selective Kernel Network for Remote Sensing Object Detection
Large Selective Kernel Network for Remote Sensing Object DetectionIEEE International Conference on Computer Vision (ICCV), 2023
Yuxuan Li
Qibin Hou
Zhaohui Zheng
Mingmei Cheng
Jian Yang
Xiang Li
ObjD
328
458
0
16 Mar 2023
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
Dongyan An
Yuankai Qi
Yangguang Li
Yan Huang
Liangsheng Wang
Tieniu Tan
Jing Shao
283
107
0
08 Dec 2022
See Finer, See More: Implicit Modality Alignment for Text-based Person
  Retrieval
See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval
Xiujun Shu
Wei Wen
Haoqian Wu
Keyun Chen
Yi-Zhe Song
Ruizhi Qiao
Bohan Ren
Xiao Wang
317
150
0
18 Aug 2022
Previous
123
Page 3 of 3