Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2302.10035
Cited By
v1
v2
v3 (latest)
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Machine Intelligence Research (MIR), 2023
20 February 2023
Tianlin Li
Guangyao Chen
Guangwu Qian
Pengcheng Gao
Xiaoyong Wei
Yaowei Wang
Yonghong Tian
Wen Gao
AI4CE
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (286★)
Papers citing
"Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey"
27 / 127 papers shown
A Survey on Image-text Multimodal Models
Ruifeng Guo
Jingxuan Wei
Linzhuang Sun
Khai-Nguyen Nguyen
Guiyong Chang
Dawei Liu
Sibo Zhang
Zhengbing Yao
Mingjun Xu
Liping Bu
VLM
328
22
0
23 Sep 2023
Bias and Fairness in Chatbots: An Overview
APSIPA Transactions on Signal and Information Processing (TASIP), 2023
Jintang Xue
Yun Cheng Wang
Chengwei Wei
Xiaofeng Liu
Jonghye Woo
C.-C. Jay Kuo
322
59
0
16 Sep 2023
SSL-Net: A Synergistic Spectral and Learning-based Network for Efficient Bird Sound Classification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yiyuan Yang
Kaichen Zhou
Niki Trigoni
Andrew Markham
164
6
0
15 Sep 2023
Enhancing Subtask Performance of Multi-modal Large Language Model
Yongqiang Zhao
Zhenyu Li
Feng Zhang
Xinhai Xu
Donghong Liu
LRM
82
1
0
31 Aug 2023
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
International Conference on Language Resources and Evaluation (LREC), 2023
Rui Mao
Guanyi Chen
Xulang Zhang
Frank Guerin
Xiaoshi Zhong
ELM
LM&MA
187
148
0
24 Aug 2023
Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval
IEEE transactions on multimedia (IEEE TMM), 2023
Huafeng Li
Shedan Yang
Yafei Zhang
Dapeng Tao
Z. Yu
225
6
0
23 Aug 2023
TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series
International Conference on Learning Representations (ICLR), 2023
Chenxi Sun
Hongyan Li
Yaliang Li
linda Qiao
AI4TS
408
190
0
16 Aug 2023
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
IEEE International Conference on Computer Vision (ICCV), 2023
Hongguang Zhu
Yunchao Wei
Xiaodan Liang
Chunjie Zhang
Yao-Min Zhao
VLM
139
36
0
14 Aug 2023
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
IEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2023
Tianlin Li
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
376
25
0
08 Aug 2023
Improving Zero-Shot Generalization for CLIP with Synthesized Prompts
Liang Luo
Jian Liang
Ran He
Nana Xu
Zilei Wang
Tien-Ping Tan
VLM
255
69
0
14 Jul 2023
A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Ming Jin
Huan Yee Koh
Qingsong Wen
Daniele Zambon
Cesare Alippi
G. I. Webb
Irwin King
Shirui Pan
AI4TS
AI4CE
421
350
0
07 Jul 2023
Review of Large Vision Models and Visual Prompt Engineering
Yuan Liu
Zheng Liu
Lin Zhao
Zihao Wu
Chong Ma
...
Bao Ge
Yixuan Yuan
Hongtu Zhu
Tianming Liu
Shu Zhang
VLM
LRM
317
216
0
03 Jul 2023
A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation
Jeremy Gwinnup
Kevin Duh
VLM
148
8
0
12 Jun 2023
ProTeCt: Prompt Tuning for Taxonomic Open Set Classification
Computer Vision and Pattern Recognition (CVPR), 2023
Tz-Ying Wu
Chih-Hui Ho
Nuno Vasconcelos
VLM
179
14
0
04 Jun 2023
AMatFormer: Efficient Feature Matching via Anchor Matching Transformer
IEEE transactions on multimedia (IEEE TMM), 2023
Bo Jiang
S. Luo
Tianlin Li
Chuanfu Li
Jin Tang
178
11
0
30 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
408
130
0
14 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps
Reliability Engineering & System Safety (Reliab. Eng. Syst. Saf.), 2023
Yanfang Li
Huan Wang
Muxia Sun
LM&MA
AI4TS
AI4CE
386
93
0
10 May 2023
Exploring the Landscape of Machine Unlearning: A Comprehensive Survey and Taxonomy
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
T. Shaik
Xiaohui Tao
Haoran Xie
Lin Li
Xiaofeng Zhu
Qingyuan Li
MU
516
54
0
10 May 2023
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A Survey
Yichi Zhang
Rushi Jiao
MedIm
VLM
294
35
0
05 May 2023
Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition
Jun Zhu
Jia Jin
Zihan Yang
Xiaohao Wu
X. Wang
ViT
294
14
0
20 Apr 2023
Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks
International Conference on Learning Representations (ICLR), 2023
Yixuan Weng
Minjun Zhu
Fei Xia
Bin Li
Shizhu He
Kang Liu
Jun Zhao
504
12
0
04 Apr 2023
Vision-Language Models for Vision Tasks: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
501
1,044
0
03 Apr 2023
RGBT Tracking via Progressive Fusion Transformer with Dynamically Guided Learning
Yabin Zhu
Chenglong Li
Tianlin Li
Jin Tang
Zhixiang Huang
235
15
0
26 Mar 2023
AI-Generated Content (AIGC): A Survey
Jiayang Wu
Wensheng Gan
Zefeng Chen
Shicheng Wan
Hong Lin
3DV
250
188
0
26 Mar 2023
Large Selective Kernel Network for Remote Sensing Object Detection
IEEE International Conference on Computer Vision (ICCV), 2023
Yuxuan Li
Qibin Hou
Zhaohui Zheng
Mingmei Cheng
Jian Yang
Xiang Li
ObjD
328
458
0
16 Mar 2023
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
Dongyan An
Yuankai Qi
Yangguang Li
Yan Huang
Liangsheng Wang
Tieniu Tan
Jing Shao
283
107
0
08 Dec 2022
See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval
Xiujun Shu
Wei Wen
Haoqian Wu
Keyun Chen
Yi-Zhe Song
Ruizhi Qiao
Bohan Ren
Xiao Wang
317
150
0
18 Aug 2022
Previous
1
2
3
Page 3 of 3