ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.06717
  4. Cited By
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

13 March 2022
Xiaohan Ding
X. Zhang
Yi Zhou
Jungong Han
Guiguang Ding
Jian-jun Sun
    VLM
ArXivPDFHTML

Papers citing "Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs"

50 / 153 papers shown
Title
Counting Like Human: Anthropoid Crowd Counting on Modeling the
  Similarity of Objects
Counting Like Human: Anthropoid Crowd Counting on Modeling the Similarity of Objects
Qi. Wang
Juncheng Wang
Junyuan Gao
Yuan. Yuan
Xuelong Li
17
2
0
02 Dec 2022
SimVP: Towards Simple yet Powerful Spatiotemporal Predictive Learning
SimVP: Towards Simple yet Powerful Spatiotemporal Predictive Learning
Cheng Tan
Zhangyang Gao
Siyuan Li
Stan Z. Li
VLM
AI4TS
14
1
0
22 Nov 2022
EHSNet: End-to-End Holistic Learning Network for Large-Size Remote
  Sensing Image Semantic Segmentation
EHSNet: End-to-End Holistic Learning Network for Large-Size Remote Sensing Image Semantic Segmentation
Wei-Neng Chen
Yansheng Li
Bo Dang
Yongjun Zhang
17
3
0
21 Nov 2022
Age Prediction Performance Varies Across Deep, Superficial, and
  Cerebellar White Matter Connections
Age Prediction Performance Varies Across Deep, Superficial, and Cerebellar White Matter Connections
Yuxiang Wei
Tengfei Xue
Yogesh Rathi
N. Makris
Fan Zhang
L. O’Donnell
11
1
0
11 Nov 2022
Demystify Transformers & Convolutions in Modern Image Deep Networks
Demystify Transformers & Convolutions in Modern Image Deep Networks
Jifeng Dai
Min Shi
Weiyun Wang
Sitong Wu
Linjie Xing
...
Lewei Lu
Jie Zhou
Xiaogang Wang
Yu Qiao
Xiao-hua Hu
ViT
18
11
0
10 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with
  Deformable Convolutions
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
20
627
0
10 Nov 2022
Learning Cross-view Geo-localization Embeddings via Dynamic Weighted
  Decorrelation Regularization
Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization
Ting Wang
Zhedong Zheng
Zunjie Zhu
Yuhan Gao
Yi Yang
Chenggang Yan
20
34
0
10 Nov 2022
Decoupled Cross-Scale Cross-View Interaction for Stereo Image
  Enhancement in The Dark
Decoupled Cross-Scale Cross-View Interaction for Stereo Image Enhancement in The Dark
Huan Zheng
Zhao Zhang
Jicong Fan
Richang Hong
Yi Yang
Shuicheng Yan
3DV
24
6
0
02 Nov 2022
GaitMixer: Skeleton-based Gait Representation Learning via Wide-spectrum
  Multi-axial Mixer
GaitMixer: Skeleton-based Gait Representation Learning via Wide-spectrum Multi-axial Mixer
Ekkasit Pinyoanuntapong
Ayman Ali
Pu Wang
Minwoo Lee
C. L. P. Chen
CVBM
92
25
0
27 Oct 2022
MetaFormer Baselines for Vision
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
14
155
0
24 Oct 2022
Delving into Masked Autoencoders for Multi-Label Thorax Disease
  Classification
Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification
Junfei Xiao
Yutong Bai
Alan Yuille
Zongwei Zhou
MedIm
ViT
27
59
0
23 Oct 2022
What Makes Convolutional Models Great on Long Sequence Modeling?
What Makes Convolutional Models Great on Long Sequence Modeling?
Yuhong Li
Tianle Cai
Yi Zhang
De-huai Chen
Debadeepta Dey
VLM
26
95
0
17 Oct 2022
Efficient Image Super-Resolution using Vast-Receptive-Field Attention
Efficient Image Super-Resolution using Vast-Receptive-Field Attention
Ling Zhou
Haoming Cai
Jinjin Gu
Zheyu Li
Yingqi Liu
Xiangyu Chen
Yu Qiao
Chao Dong
SupR
8
56
0
12 Oct 2022
ZITS++: Image Inpainting by Improving the Incremental Transformer on
  Structural Priors
ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors
Chenjie Cao
Qiaole Dong
Yanwei Fu
28
30
0
12 Oct 2022
Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs
Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs
Taojiannan Yang
Haokui Zhang
Wenze Hu
C. L. P. Chen
Xiaoyu Wang
ViT
9
0
0
08 Oct 2022
3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical
  Transformer for Medical Image Segmentation
3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation
Ho Hin Lee
Shunxing Bao
Yuankai Huo
Bennett A. Landman
OOD
MedIm
42
122
0
29 Sep 2022
Rethinking Performance Gains in Image Dehazing Networks
Rethinking Performance Gains in Image Dehazing Networks
Yuda Song
Yang Zhou
Hui Qian
Xin Du
SSeg
22
47
0
23 Sep 2022
DRKF: Distilled Rotated Kernel Fusion for Efficient Rotation Invariant
  Descriptors in Local Feature Matching
DRKF: Distilled Rotated Kernel Fusion for Efficient Rotation Invariant Descriptors in Local Feature Matching
Ranran Huang
Jiancheng Cai
Chao Li
Zhuoyuan Wu
Xinmin Liu
Z. Chai
14
0
0
22 Sep 2022
DMFormer: Closing the Gap Between CNN and Vision Transformers
DMFormer: Closing the Gap Between CNN and Vision Transformers
Zimian Wei
H. Pan
Lujun Li
Menglong Lu
Xin-Yi Niu
Peijie Dong
Dongsheng Li
ViT
28
5
0
16 Sep 2022
LKD-Net: Large Kernel Convolution Network for Single Image Dehazing
LKD-Net: Large Kernel Convolution Network for Single Image Dehazing
Pinjun Luo
Guoqiang Xiao
Xinbo Gao
Song Wu
11
26
0
05 Sep 2022
U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?
U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?
Xi Jia
Joseph Bartlett
Tianyang Zhang
Wenqi Lu
Zhaowen Qiu
Jinming Duan
ViT
MedIm
25
60
0
07 Aug 2022
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated
  Convolutions
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
Yongming Rao
Wenliang Zhao
Yansong Tang
Jie Zhou
Ser-Nam Lim
Jiwen Lu
ViT
12
250
0
28 Jul 2022
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for
  real-time object detectors
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Chien-Yao Wang
Alexey Bochkovskiy
H. Liao
ObjD
20
6,062
0
06 Jul 2022
Overview of Deep Learning-based CSI Feedback in Massive MIMO Systems
Overview of Deep Learning-based CSI Feedback in Massive MIMO Systems
Jiajia Guo
Chao-Kai Wen
Shi Jin
Geoffrey Ye Li
21
144
0
29 Jun 2022
A Simple Baseline for Video Restoration with Grouped Spatial-temporal
  Shift
A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift
Dasong Li
Xiaoyu Shi
Y. Zhang
Ka Chun Cheung
Simon See
Xiaogang Wang
Hongwei Qin
Hongsheng Li
VGen
10
58
0
22 Jun 2022
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs
Yukang Chen
Jianhui Liu
X. Zhang
Xiaojuan Qi
Jiaya Jia
33
82
0
21 Jun 2022
Can CNNs Be More Robust Than Transformers?
Can CNNs Be More Robust Than Transformers?
Zeyu Wang
Yutong Bai
Yuyin Zhou
Cihang Xie
UQCV
OOD
14
46
0
07 Jun 2022
ShuffleMixer: An Efficient ConvNet for Image Super-Resolution
ShuffleMixer: An Efficient ConvNet for Image Super-Resolution
Long Sun
Jin-shan Pan
Jinhui Tang
SupR
24
82
0
30 May 2022
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense
  Prediction
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction
Han Cai
Junyan Li
Muyan Hu
Chuang Gan
Song Han
21
48
0
29 May 2022
TRT-ViT: TensorRT-oriented Vision Transformer
TRT-ViT: TensorRT-oriented Vision Transformer
Xin Xia
Jiashi Li
Jie Wu
Xing Wang
Xuefeng Xiao
Min Zheng
Rui Wang
ViT
16
26
0
19 May 2022
Discovering and Explaining the Representation Bottleneck of Graph Neural
  Networks from Multi-order Interactions
Discovering and Explaining the Representation Bottleneck of Graph Neural Networks from Multi-order Interactions
Fang Wu
Siyuan Li
Lirong Wu
Dragomir R. Radev
Stan Z. Li
11
2
0
15 May 2022
ConvMAE: Masked Convolution Meets Masked Autoencoders
ConvMAE: Masked Convolution Meets Masked Autoencoders
Peng Gao
Teli Ma
Hongsheng Li
Ziyi Lin
Jifeng Dai
Yu Qiao
ViT
19
119
0
08 May 2022
Sequencer: Deep LSTM for Image Classification
Sequencer: Deep LSTM for Image Classification
Yuki Tatsunami
Masato Taki
VLM
ViT
10
77
0
04 May 2022
Learning to Reduce Information Bottleneck for Object Detection in Aerial
  Images
Learning to Reduce Information Bottleneck for Object Detection in Aerial Images
Yuchen Shen
Dong Zhang
Zhihao Song
Xuesong Jiang
Qiaolin Ye
6
7
0
05 Apr 2022
Visual Attention Network
Visual Attention Network
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViT
VLM
14
620
0
20 Feb 2022
Patches Are All You Need?
Patches Are All You Need?
Asher Trockman
J. Zico Kolter
ViT
214
395
0
24 Jan 2022
ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer
ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer
Pengfei Guo
Yiqun Mei
Jinyuan Zhou
Shanshan Jiang
Vishal M. Patel
ViT
MedIm
76
61
0
23 Jan 2022
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
Xiaohan Ding
Honghao Chen
X. Zhang
Jungong Han
Guiguang Ding
14
68
0
21 Dec 2021
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Ruiyang Liu
Yinghui Li
Li Tao
Dun Liang
Haitao Zheng
77
96
0
07 Nov 2021
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel
  Sizes
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes
David W. Romero
Robert-Jan Bruintjes
Jakub M. Tomczak
Erik J. Bekkers
Mark Hoogendoorn
J. C. V. Gemert
74
81
0
15 Oct 2021
MLP-Mixer: An all-MLP Architecture for Vision
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
239
2,554
0
04 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
Accelerating Large Kernel Convolutions with Nested Winograd
  Transformation.pdf
Accelerating Large Kernel Convolutions with Nested Winograd Transformation.pdf
Jingbo Jiang
Xizi Chen
Chi-Ying Tsui
13
4
0
26 Feb 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
Bottleneck Transformers for Visual Recognition
Bottleneck Transformers for Visual Recognition
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
267
955
0
27 Jan 2021
RepVGG: Making VGG-style ConvNets Great Again
RepVGG: Making VGG-style ConvNets Great Again
Xiaohan Ding
X. Zhang
Ningning Ma
Jungong Han
Guiguang Ding
Jian-jun Sun
117
1,484
0
11 Jan 2021
On Translation Invariance in CNNs: Convolutional Layers can Exploit
  Absolute Spatial Location
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
O. Kayhan
J. C. V. Gemert
192
231
0
16 Mar 2020
Deep High-Resolution Representation Learning for Visual Recognition
Deep High-Resolution Representation Learning for Visual Recognition
Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
...
Yadong Mu
Mingkui Tan
Xinggang Wang
Wenyu Liu
Bin Xiao
190
3,480
0
20 Aug 2019
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
  Applications
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
948
20,214
0
17 Apr 2017
Previous
1234
Next