ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.03545
  4. Cited By
A ConvNet for the 2020s

A ConvNet for the 2020s

10 January 2022
Zhuang Liu
Hanzi Mao
Chaozheng Wu
Christoph Feichtenhofer
Trevor Darrell
Saining Xie
    ViT
ArXivPDFHTML

Papers citing "A ConvNet for the 2020s"

50 / 2,184 papers shown
Title
Investigating Neural Architectures by Synthetic Dataset Design
Investigating Neural Architectures by Synthetic Dataset Design
Adrien Courtois
Jean-Michel Morel
Pablo Arias
17
4
0
23 Apr 2022
The 6th AI City Challenge
The 6th AI City Challenge
M. Naphade
Shuo Wang
D. Anastasiu
Zheng Tang
Ming-Ching Chang
...
Stan Sclaroff
Pranamesh Chakraborty
Alice Li
Shangru Li
Rama Chellappa
16
70
0
21 Apr 2022
NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of
  Compressed Video: Dataset, Methods and Results
NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results
Ren Yang
Radu Timofte
Mei Zheng
Qunliang Xing
Minglang Qiao
...
Yulin Huang
Junying Chen
I. Lee
Sunder Ali Khowaja
Jiseok Yoon
SupR
21
33
0
20 Apr 2022
Efficient Architecture Search for Diverse Tasks
Efficient Architecture Search for Diverse Tasks
Jun Shen
M. Khodak
Ameet Talwalkar
14
30
0
15 Apr 2022
2D Human Pose Estimation: A Survey
2D Human Pose Estimation: A Survey
Haoming Chen
Runyang Feng
Sifan Wu
Hao Xu
F. Zhou
Zhenguang Liu
3DH
11
55
0
15 Apr 2022
ResT V2: Simpler, Faster and Stronger
ResT V2: Simpler, Faster and Stronger
Qing-Long Zhang
Yubin Yang
ViT
17
24
0
15 Apr 2022
Neighborhood Attention Transformer
Neighborhood Attention Transformer
Ali Hassani
Steven Walton
Jiacheng Li
Shengjia Li
Humphrey Shi
ViT
AI4TS
6
248
0
14 Apr 2022
DeiT III: Revenge of the ViT
DeiT III: Revenge of the ViT
Hugo Touvron
Matthieu Cord
Hervé Jégou
ViT
21
383
0
14 Apr 2022
Efficient Deep Learning-based Estimation of the Vital Signs on
  Smartphones
Efficient Deep Learning-based Estimation of the Vital Signs on Smartphones
Taha Samavati
Mahdi Farvardin
Aboozar Ghaffari
14
5
0
13 Apr 2022
Localization Distillation for Object Detection
Localization Distillation for Object Detection
Zhaohui Zheng
Rongguang Ye
Ping Wang
Dongwei Ren
Jun Wang
W. Zuo
Ming-Ming Cheng
19
63
0
12 Apr 2022
From Modern CNNs to Vision Transformers: Assessing the Performance,
  Robustness, and Classification Strategies of Deep Learning Models in
  Histopathology
From Modern CNNs to Vision Transformers: Assessing the Performance, Robustness, and Classification Strategies of Deep Learning Models in Histopathology
Maximilian Springenberg
A. Frommholz
M. Wenzel
Eva Weicken
Jackie Ma
Nils Strodthoff
MedIm
14
21
0
11 Apr 2022
Simple Baselines for Image Restoration
Simple Baselines for Image Restoration
Liangyu Chen
Xiaojie Chu
X. Zhang
Jian-jun Sun
43
822
0
10 Apr 2022
Data-Free Quantization with Accurate Activation Clipping and Adaptive
  Batch Normalization
Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization
Yefei He
Luoming Zhang
Weijia Wu
Hong Zhou
MQ
19
2
0
08 Apr 2022
DaViT: Dual Attention Vision Transformers
DaViT: Dual Attention Vision Transformers
Mingyu Ding
Bin Xiao
Noel Codella
Ping Luo
Jingdong Wang
Lu Yuan
ViT
17
233
0
07 Apr 2022
The Effects of Regularization and Data Augmentation are Class Dependent
The Effects of Regularization and Data Augmentation are Class Dependent
Randall Balestriero
Léon Bottou
Yann LeCun
17
94
0
07 Apr 2022
Solving ImageNet: a Unified Scheme for Training any Backbone to Top
  Results
Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results
T. Ridnik
Hussam Lawen
Emanuel Ben-Baruch
Asaf Noy
25
11
0
07 Apr 2022
Surface Vision Transformers: Flexible Attention-Based Modelling of
  Biomedical Surfaces
Surface Vision Transformers: Flexible Attention-Based Modelling of Biomedical Surfaces
Simon Dahan
Hao Xu
Logan Z. J. Williams
Abdulah Fawaz
Chunhui Yang
...
A. Edwards
M. Glasser
Alistair Young
Daniel Rueckert
E. C. Robinson
ViT
MedIm
11
0
0
07 Apr 2022
Unleashing Vanilla Vision Transformer with Masked Image Modeling for
  Object Detection
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
Yuxin Fang
Shusheng Yang
Shijie Wang
Yixiao Ge
Ying Shan
Xinggang Wang
6
54
0
06 Apr 2022
Style-Hallucinated Dual Consistency Learning for Domain Generalized
  Semantic Segmentation
Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation
Yuyang Zhao
Zhun Zhong
Na Zhao
N. Sebe
G. Lee
19
98
0
06 Apr 2022
Bimodal Distributed Binarized Neural Networks
Bimodal Distributed Binarized Neural Networks
T. Rozen
Moshe Kimhi
Brian Chmiel
A. Mendelson
Chaim Baskin
MQ
24
4
0
05 Apr 2022
MaxViT: Multi-Axis Vision Transformer
MaxViT: Multi-Axis Vision Transformer
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
ViT
32
621
0
04 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
28
100
0
04 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
19
262
0
04 Apr 2022
Concept Evolution in Deep Learning Training: A Unified Interpretation
  Framework and Discoveries
Concept Evolution in Deep Learning Training: A Unified Interpretation Framework and Discoveries
Haekyu Park
Seongmin Lee
Benjamin Hoover
Austin P. Wright
Omar Shaikh
Rahul Duggal
Nilaksh Das
Kevin Li
Judy Hoffman
Duen Horng Chau
6
2
0
30 Mar 2022
FlowFormer: A Transformer Architecture for Optical Flow
FlowFormer: A Transformer Architecture for Optical Flow
Zhaoyang Huang
Xiaoyu Shi
Chao Zhang
Qiang Wang
Ka Chun Cheung
Hongwei Qin
Jifeng Dai
Hongsheng Li
ViT
27
262
0
30 Mar 2022
How Deep is Your Art: An Experimental Study on the Limits of Artistic
  Understanding in a Single-Task, Single-Modality Neural Network
How Deep is Your Art: An Experimental Study on the Limits of Artistic Understanding in a Single-Task, Single-Modality Neural Network
Mahan Agha Zahedi
Niloofar Gholamrezaei
A. Doboli
6
2
0
30 Mar 2022
SepViT: Separable Vision Transformer
SepViT: Separable Vision Transformer
Wei Li
Xing Wang
Xin Xia
Jie Wu
Jiashi Li
Xuefeng Xiao
Min Zheng
Shiping Wen
ViT
16
38
0
29 Mar 2022
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of
  Convolutional Network for Speaker-Independent Speech Separation
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Xue Yang
C. Bao
16
3
0
25 Mar 2022
Efficient Visual Tracking via Hierarchical Cross-Attention Transformer
Efficient Visual Tracking via Hierarchical Cross-Attention Transformer
Xin Chen
Ben Kang
D. Wang
Dongdong Li
Huchuan Lu
ViT
12
48
0
25 Mar 2022
The Fixed Sub-Center: A Better Way to Capture Data Complexity
The Fixed Sub-Center: A Better Way to Capture Data Complexity
Zhemin Zhang
Xun Gong
11
1
0
24 Mar 2022
Visual Prompt Tuning
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge J. Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
11
1,507
0
23 Mar 2022
Focal Modulation Networks
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
22
261
0
22 Mar 2022
A Broad Study of Pre-training for Domain Generalization and Adaptation
A Broad Study of Pre-training for Domain Generalization and Adaptation
Donghyun Kim
Kaihong Wang
Stan Sclaroff
Kate Saenko
OOD
AI4CE
14
78
0
22 Mar 2022
Disentangling Architecture and Training for Optical Flow
Disentangling Architecture and Training for Optical Flow
Deqing Sun
Charles Herrmann
F. Reda
Michael Rubinstein
David Fleet
William T. Freeman
3DPC
OOD
53
33
0
21 Mar 2022
simCrossTrans: A Simple Cross-Modality Transfer Learning for Object
  Detection with ConvNets or Vision Transformers
simCrossTrans: A Simple Cross-Modality Transfer Learning for Object Detection with ConvNets or Vision Transformers
Xiaoke Shen
I. Stamos
ViT
10
5
0
20 Mar 2022
Three things everyone should know about Vision Transformers
Three things everyone should know about Vision Transformers
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Jakob Verbeek
Hervé Jégou
ViT
8
118
0
18 Mar 2022
Are Vision Transformers Robust to Spurious Correlations?
Are Vision Transformers Robust to Spurious Correlations?
Soumya Suvra Ghosal
Yifei Ming
Yixuan Li
ViT
17
28
0
17 Mar 2022
Hyperbolic Uncertainty Aware Semantic Segmentation
Bike Chen
Wei Peng
Xiaofeng Cao
Juha Roning
UQCV
14
15
0
16 Mar 2022
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A
  Study on Surgical Workflow Analysis
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis
Dominik Rivoir
Isabel Funke
Stefanie Speidel
17
15
0
15 Mar 2022
Fast Autofocusing using Tiny Transformer Networks for Digital
  Holographic Microscopy
Fast Autofocusing using Tiny Transformer Networks for Digital Holographic Microscopy
Stéphane Cuenat
Louis Andréoli
Antoine N. André
P. Sandoz
G. Laurent
R. Couturier
M. Jacquot
16
10
0
15 Mar 2022
CAR: Class-aware Regularizations for Semantic Segmentation
CAR: Class-aware Regularizations for Semantic Segmentation
Ye Huang
Di Kang
Liang Chen
Xuefei Zhe
W. Jia
Xiangjian He
Linchao Bao
14
16
0
14 Mar 2022
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio
  Classification
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification
Yuan Gong
Sameer Khurana
Andrew Rouditchenko
James R. Glass
VLM
11
29
0
13 Mar 2022
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Xiaohan Ding
X. Zhang
Yi Zhou
Jungong Han
Guiguang Ding
Jian-jun Sun
VLM
40
522
0
13 Mar 2022
Active Token Mixer
Active Token Mixer
Guoqiang Wei
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
10
15
0
11 Mar 2022
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets
  and Transformer
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer
Haokui Zhang
Wenze Hu
Xiaoyu Wang
ViT
28
59
0
08 Mar 2022
Multi-trial Neural Architecture Search with Lottery Tickets
Multi-trial Neural Architecture Search with Lottery Tickets
Zimian Wei
H. Pan
Lujun Li
Menglong Lu
Xin-Yi Niu
Peijie Dong
Dongsheng Li
ViT
21
0
0
08 Mar 2022
Monocular Robot Navigation with Self-Supervised Pretrained Vision
  Transformers
Monocular Robot Navigation with Self-Supervised Pretrained Vision Transformers
Miguel A. Saavedra-Ruiz
Sacha Morin
Liam Paull
MDE
ViT
17
3
0
07 Mar 2022
Continuous Self-Localization on Aerial Images Using Visual and Lidar
  Sensors
Continuous Self-Localization on Aerial Images Using Visual and Lidar Sensors
F. Fervers
Sebastian Bullinger
C. Bodensteiner
Michael Arens
Rainer Stiefelhagen
11
19
0
07 Mar 2022
Color Space-based HoVer-Net for Nuclei Instance Segmentation and
  Classification
Color Space-based HoVer-Net for Nuclei Instance Segmentation and Classification
Hussam Azzuni
Muhammad Ridzuan
Min Xu
Mohammad Yaqub
27
6
0
03 Mar 2022
A Data-scalable Transformer for Medical Image Segmentation:
  Architecture, Model Efficiency, and Benchmark
A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and Benchmark
Yunhe Gao
Mu Zhou
Ding Liu
Zhennan Yan
Shaoting Zhang
Dimitris N. Metaxas
ViT
MedIm
18
68
0
28 Feb 2022
Previous
123...424344
Next