Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2103.15808
Cited By
CvT: Introducing Convolutions to Vision Transformers
IEEE International Conference on Computer Vision (ICCV), 2021
29 March 2021
Haiping Wu
Bin Xiao
Noel Codella
Xiyang Dai
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (227★)
Papers citing
"CvT: Introducing Convolutions to Vision Transformers"
50 / 857 papers shown
Title
Sparks of Artificial General Intelligence(AGI) in Semiconductor Material Science: Early Explorations into the Next Frontier of Generative AI-Assisted Electron Micrograph Analysis
Sakhinana Sagar Srinivas
Geethan Sannidhi
Sreeja Gangasani
Chidaksh Ravuru
Venkataramana Runkana
180
0
0
17 Sep 2024
GLCONet: Learning Multi-source Perception Representation for Camouflaged Object Detection
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Yanguang Sun
Hanyu Xuan
Zhiqiang Wang
Lei Luo
ObjD
165
19
0
15 Sep 2024
Domain-Invariant Representation Learning of Bird Sounds
Ilyass Moummad
Romain Serizel
Emmanouil Benetos
Nicolas Farrugia
SSL
294
6
0
13 Sep 2024
SDformer: Efficient End-to-End Transformer for Depth Completion
Jian Qian
Miao Sun
Ashley Lee
Jie Li
Shenglong Zhuo
Patrick Chiang
ViT
MDE
234
4
0
12 Sep 2024
ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation
Fuchen Zheng
Xinyi Chen
Xuhang Chen
Haolun Li
Xiaojiao Guo
Guoheng Huang
Chi-Man Pun
Shoujun Zhou
ViT
MedIm
117
0
0
12 Sep 2024
Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy
Bojian Li
Bo Liu
Dan Si
Jinghua Yue
F. Zhou
MedIm
MDE
258
4
0
12 Sep 2024
PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening
AAAI Conference on Artificial Intelligence (AAAI), 2024
Ruocheng Wu
ZiEn Zhang
ShangQi Deng
YuLe Duan
LiangJian Deng
146
4
0
11 Sep 2024
Brain-Inspired Stepwise Patch Merging for Vision Transformers
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Yonghao Yu
Dongcheng Zhao
Guobin Shen
Yiting Dong
Yi Zeng
302
0
0
11 Sep 2024
Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild
Xiongkuo Min
Yixuan Gao
Yuqin Cao
Guangtao Zhai
Wenjun Zhang
Huifang Sun
C. Chen
119
51
0
09 Sep 2024
UNIT: Unifying Image and Text Recognition in One Vision Encoder
Neural Information Processing Systems (NeurIPS), 2024
Yi Zhu
Yanpeng Zhou
Chunwei Wang
Yang Cao
Jianhua Han
Lu Hou
Hang Xu
ViT
VLM
213
9
0
06 Sep 2024
MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
ViT
218
1
0
05 Sep 2024
TBConvL-Net: A Hybrid Deep Learning Architecture for Robust Medical Image Segmentation
Pattern Recognition (Pattern Recogn.), 2024
Shahzaib Iqbal
Tariq M. Khan
Syed S. Naqvi
Asim Naveed
Erik H. W. Meijering
MedIm
223
40
0
05 Sep 2024
Frequency-Spatial Entanglement Learning for Camouflaged Object Detection
European Conference on Computer Vision (ECCV), 2024
Yanguang Sun
Chunyan Xu
Zhiqiang Wang
Hanyu Xuan
Lei Luo
223
60
0
03 Sep 2024
Dreaming is All You Need
Mingze Ni
Wei Liu
112
0
0
03 Sep 2024
A Hybrid Transformer-Mamba Network for Single Image Deraining
Shangquan Sun
Wenqi Ren
Juxiang Zhou
Jianhou Gan
Rui Wang
Xiaochun Cao
Mamba
262
12
0
31 Aug 2024
SMAFormer: Synergistic Multi-Attention Transformer for Medical Image Segmentation
IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024
Fuchen Zheng
Xuhang Chen
Weihuang Liu
Haolun Li
Yingtie Lei
Jiahui He
Chi-Man Pun
Shounjun Zhou
MedIm
600
38
0
31 Aug 2024
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis
Sakhinana Sagar Srinivas
Chidaksh Ravuru
Geethan Sannidhi
Venkataramana Runkana
189
1
0
27 Aug 2024
Multi-Modal Instruction-Tuning Small-Scale Language-and-Vision Assistant for Semiconductor Electron Micrograph Analysis
AAAI Spring Symposia (SSS), 2024
Sakhinana Sagar Srinivas
Geethan Sannidhi
Venkataramana Runkana
221
1
0
27 Aug 2024
Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Venkataramana Runkana
202
0
0
24 Aug 2024
Preliminary Investigations of a Multi-Faceted Robust and Synergistic Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision Transformers with Large Language and Multimodal Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Sreeja Gangasani
Chidaksh Ravuru
Venkataramana Runkana
224
0
0
24 Aug 2024
Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption
Sakhinana Sagar Srinivas
Chidaksh Ravuru
Geethan Sannidhi
Venkataramana Runkana
109
0
0
23 Aug 2024
Vision HgNN: An Electron-Micrograph is Worth Hypergraph of Hypernodes
Sakhinana Sagar Srinivas
Rajat Kumar Sarkar
Sreeja Gangasani
Venkataramana Runkana
219
2
0
21 Aug 2024
sTransformer: A Modular Approach for Extracting Inter-Sequential and Temporal Information for Time-Series Forecasting
Jiaheng Yin
Zhengxin Shi
Jianshen Zhang
Xiaomin Lin
Yulin Huang
Yongzhi Qi
Wei Qi
AI4TS
93
0
0
19 Aug 2024
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Beoungwoo Kang
Seunghun Moon
Yubin Cho
Hyunwoo Yu
Suk-Ju Kang
ViT
MedIm
179
20
0
14 Aug 2024
Advanced Vision Transformers and Open-Set Learning for Robust Mosquito Classification: A Novel Approach to Entomological Studies
Ahmed Akib Jawad Karim
Muhammad Zawad Mahmud
Riasat Khan
91
3
0
12 Aug 2024
Efficient Visual Representation Learning with Heat Conduction Equation
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Zhemin Zhang
Xun Gong
DiffM
3DV
191
0
0
12 Aug 2024
MacFormer: Semantic Segmentation with Fine Object Boundaries
Guoan Xu
Wenfeng Huang
Tao Wu
Ligeng Chen
Wenjing Jia
Guangwei Gao
Xiatian Zhu
Stuart W. Perry
192
3
0
11 Aug 2024
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications
Tianfang Zhang
Lei Li
Yang Zhou
Wentao Liu
Chen Qian
Xiangyang Ji
ViT
179
60
0
07 Aug 2024
Multi-label Sewer Pipe Defect Recognition with Mask Attention Feature Enhancement and Label Correlation Learning
Xin Zuo
Yu Sheng
Jifeng Shen
Yongwei Shan
138
0
0
01 Aug 2024
Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets
Tianxiao Zhang
Wenju Xu
Bo Luo
Guanghui Wang
ViT
MDE
364
33
0
28 Jul 2024
A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention
João D. Nunes
D. Montezuma
Domingos Oliveira
Tania Pereira
Jaime S. Cardoso
208
0
0
26 Jul 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
276
19
0
26 Jul 2024
Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers
Zhengang Li
Alec Lu
Yanyue Xie
Zhenglun Kong
Mengshu Sun
...
Zhaoyang Han
Caiwen Ding
Yanzhi Wang
Xue Lin
Zhenman Fang
176
9
0
25 Jul 2024
How Lightweight Can A Vision Transformer Be
Jen Hong Tan
ViT
MoE
187
1
0
25 Jul 2024
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation
Hyunwoo Yu
Yubin Cho
Beoungwoo Kang
Seunghun Moon
Kyeongbo Kong
Suk-Ju Kang
172
11
0
24 Jul 2024
HERGen: Elevating Radiology Report Generation with Longitudinal Data
Fuying Wang
Shenghui Du
Lequan Yu
MedIm
203
17
0
21 Jul 2024
DuoFormer: Leveraging Hierarchical Visual Representations by Local and Global Attention
Xiaoya Tang
Bodong Zhang
Beatrice Knudsen
Tolga Tasdizen
ViT
MedIm
222
1
0
18 Jul 2024
SegPoint: Segment Any Point Cloud via Large Language Model
Shuting He
Henghui Ding
Xudong Jiang
Bihan Wen
3DV
MLLM
3DPC
206
33
0
18 Jul 2024
AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs
Yunling Zheng
Zeyi Xu
Fanghui Xue
Biao Yang
Jiancheng Lyu
Shuai Zhang
Y. Qi
Jack Xin
180
0
0
16 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
119
11
0
16 Jul 2024
TractGraphFormer: Anatomically Informed Hybrid Graph CNN-Transformer Network for Interpretable Sex and Age Prediction from Diffusion MRI Tractography
Yuqian Chen
Fan Zhang
Meng Wang
L. Zekelman
Suheyla Cetin Karayumak
...
J. Rushmore
N. Makris
Yogesh Rathi
Weidong Cai
L. O’Donnell
MedIm
ViT
141
6
0
11 Jul 2024
Parameter Efficient Fine Tuning for Multi-scanner PET to PET Reconstruction
Yumin Kim
Gayoon Choi
Seong Jae Hwang
134
1
0
10 Jul 2024
HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation
Guoan Xu
Wenjing Jia
Tao Wu
Ligeng Chen
Guangwei Gao
ViT
211
23
0
10 Jul 2024
iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency
Haruna Yunusa
Qin Shiyin
Abdulrahman Hamman Adama Chukkol
Isah Bello
A. Lawan
Isah Bello
261
4
0
10 Jul 2024
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images
Kazi Sajeed Mehrab
M. Maruf
Arka Daw
Harish Babu Manogaran
Abhilash Neog
...
Paula Mabee
Wasila Dahdul
Anuj Karpatne
Wasila M Dahdul
Anuj Karpatne
383
6
0
10 Jul 2024
CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion
Hosam S. El-Assiouti
Hadeer El-Saadawy
M. Al-Berry
M. Tolba
ViT
192
0
0
09 Jul 2024
CBM: Curriculum by Masking
Andrei Jarca
Florinel-Alin Croitoru
Radu Tudor Ionescu
160
4
0
06 Jul 2024
Kolmogorov-Arnold Convolutions: Design Principles and Empirical Studies
Ivan Drokin
277
58
0
01 Jul 2024
Query-Efficient Hard-Label Black-Box Attack against Vision Transformers
Chao Zhou
Xiaowen Shi
Yuan-Gen Wang
ViT
AAML
147
1
0
29 Jun 2024
Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Ali Khaleghi Rahimian
Manish Kumar Govind
Subhajit Maity
Dominick Reilly
Christian Kummerle
Srijan Das
A. Dutta
189
1
0
27 Jun 2024
Previous
1
2
3
4
5
6
...
16
17
18
Next