ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.15358
  4. Cited By
Multi-Scale Vision Longformer: A New Vision Transformer for
  High-Resolution Image Encoding
v1v2 (latest)

Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding

IEEE International Conference on Computer Vision (ICCV), 2021
29 March 2021
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
    ViT
ArXiv (abs)PDFHTMLGithub (246★)

Papers citing "Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding"

50 / 197 papers shown
Title
CoMA: Complementary Masking and Hierarchical Dynamic Multi-Window Self-Attention in a Unified Pre-training Framework
CoMA: Complementary Masking and Hierarchical Dynamic Multi-Window Self-Attention in a Unified Pre-training Framework
Jiaxuan Li
Qing Xu
Xiangjian He
Ziyu Liu
Chang Xing
Zhen Chen
Daokun Zhang
Rong Qu
Chang Wen Chen
64
0
0
08 Nov 2025
HRTFformer: A Spatially-Aware Transformer for Personalized HRTF Upsampling in Immersive Audio Rendering
HRTFformer: A Spatially-Aware Transformer for Personalized HRTF Upsampling in Immersive Audio Rendering
Xuyi Hu
Jian Li
Shaojie Zhang
Stefan Goetz
L. Picinali
Ozgur B. Akan
Aidan O. T. Hogg
73
0
0
02 Oct 2025
Saccadic Vision for Fine-Grained Visual Classification
Saccadic Vision for Fine-Grained Visual Classification
Johann Schmidt
Sebastian Stober
Joachim Denzler
P. Bodesheim
108
0
0
19 Sep 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
Wenyuan Xu
Shibiao Xu
ViT
1.0K
2
0
06 May 2025
Learning from Noisy Labels with Contrastive Co-Transformer
Yan Han
S. Roy
Mehrtash Harandi
L. Petersson
NoLa
231
1
0
04 Mar 2025
HSI: A Holistic Style Injector for Arbitrary Style Transfer
HSI: A Holistic Style Injector for Arbitrary Style TransferComputer Vision and Pattern Recognition (CVPR), 2025
Shuhao Zhang
Hui Kang
Yang Liu
Fang Mei
Hongjuan Li
101
1
0
05 Feb 2025
Multi-Exposure Image Fusion via Distilled 3D LUT Grid with Editable Mode
Multi-Exposure Image Fusion via Distilled 3D LUT Grid with Editable Mode
Xin Su
Zhuoran Zheng
201
0
0
18 Dec 2024
MuSiCNet: A Gradual Coarse-to-Fine Framework for Irregularly Sampled
  Multivariate Time Series Analysis
MuSiCNet: A Gradual Coarse-to-Fine Framework for Irregularly Sampled Multivariate Time Series Analysis
Jiexi Liu
Meng Cao
Songcan Chen
AI4TS
233
2
0
02 Dec 2024
Breaking the Low-Rank Dilemma of Linear Attention
Breaking the Low-Rank Dilemma of Linear AttentionComputer Vision and Pattern Recognition (CVPR), 2024
Qihang Fan
Huaibo Huang
Ran He
371
12
0
12 Nov 2024
Don't Look Twice: Faster Video Transformers with Run-Length Tokenization
Don't Look Twice: Faster Video Transformers with Run-Length TokenizationNeural Information Processing Systems (NeurIPS), 2024
Rohan Choudhury
Guanglei Zhu
Sihan Liu
Koichiro Niinuma
Kishore Venkateshan
László A. Jeni
205
24
0
07 Nov 2024
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary
  Views
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views
Xin Fei
Wenzhao Zheng
Yueqi Duan
Weidong Zhan
Masayoshi Tomizuka
Kurt Keutzer
Jiwen Lu
3DGS
237
14
0
24 Oct 2024
On Partial Prototype Collapse in the DINO Family of Self-Supervised
  Methods
On Partial Prototype Collapse in the DINO Family of Self-Supervised MethodsBritish Machine Vision Conference (BMVC), 2024
Hariprasath Govindarajan
Per Sidén
Jacob Roll
Fredrik Lindsten
165
4
0
17 Oct 2024
Efficient Partitioning Vision Transformer on Edge Devices for Distributed Inference
Efficient Partitioning Vision Transformer on Edge Devices for Distributed InferenceIEEE International Conference on Distributed Computing Systems (ICDCS), 2024
Xiang Liu
Yijun Song
Xia Li
Yifei Sun
Huiying Lan
Zemin Liu
Linshan Jiang
Jialin Li
192
1
0
15 Oct 2024
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space
  Model
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space ModelNeural Information Processing Systems (NeurIPS), 2024
Fei Xie
Weijia Zhang
Zhongdao Wang
Chao Ma
Mamba
256
17
0
09 Oct 2024
3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model
  for Laryngeal Cancer Detection Using Laryngoscopic Videos
3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic VideosAnnual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2024
Meiyu Qiu
Yongqian Li
Wenjun Huang
Haoyun Zhang
Weiping Zheng
Wenbin Lei
Xiaomao Fan
105
0
0
02 Sep 2024
Squid: Long Context as a New Modality for Energy-Efficient On-Device
  Language Models
Squid: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Wei Chen
Zhiyuan Li
Shuo Xin
Yihao Wang
206
6
0
28 Aug 2024
Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier
  in 3D Object Detection for AVs
Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVsInternational Conference on Information and Knowledge Management (CIKM), 2024
Sanjay Bhargav Dharavath
Tanmoy Dam
Supriyo Chakraborty
Prithwiraj Roy
Aniruddha Maiti
ViT
151
1
0
20 Aug 2024
Graph Transformers: A Survey
Graph Transformers: A Survey
Ahsan Shehzad
Xiwei Xu
Shagufta Abid
Ciyuan Peng
Shuo Yu
Dongyu Zhang
Karin Verspoor
AI4CE
342
36
0
13 Jul 2024
An Organism Starts with a Single Pix-Cell: A Neural Cellular Diffusion
  for High-Resolution Image Synthesis
An Organism Starts with a Single Pix-Cell: A Neural Cellular Diffusion for High-Resolution Image Synthesis
Marawan Elbatel
Konstantinos Kamnitsas
Xuelong Li
MedImDiffM
182
2
0
03 Jul 2024
Fibottention: Inceptive Visual Representation Learning with Diverse
  Attention Across Heads
Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Ali Khaleghi Rahimian
Manish Kumar Govind
Subhajit Maity
Dominick Reilly
Christian Kummerle
Srijan Das
A. Dutta
193
1
0
27 Jun 2024
Matryoshka Query Transformer for Large Vision-Language Models
Matryoshka Query Transformer for Large Vision-Language Models
Wenbo Hu
Zi-Yi Dou
Liunian Harold Li
Amita Kamath
Nanyun Peng
Kai-Wei Chang
MLLM
234
23
0
29 May 2024
SWAT: Scalable and Efficient Window Attention-based Transformers
  Acceleration on FPGAs
SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs
Zhenyu Bai
Pranav Dangi
Huize Li
Tulika Mitra
251
14
0
27 May 2024
Multi-View Attentive Contextualization for Multi-View 3D Object
  Detection
Multi-View Attentive Contextualization for Multi-View 3D Object Detection
Xianpeng Liu
Ce Zheng
Ming Qian
Nan Xue
Chong Chen
Zhebin Zhang
Chen Li
Tianfu Wu
269
7
0
20 May 2024
Compression-Realized Deep Structural Network for Video Quality
  Enhancement
Compression-Realized Deep Structural Network for Video Quality Enhancement
Hanchi Sun
Xiaohong Liu
Xinyang Jiang
Yifei Shen
Dongsheng Li
Xiongkuo Min
Guangtao Zhai
201
2
0
10 May 2024
Visual Mamba: A Survey and New Outlooks
Visual Mamba: A Survey and New Outlooks
Rui Xu
Shu Yang
Yihui Wang
Yu Cai
Bo Du
Hao Chen
Mamba
351
46
0
29 Apr 2024
Sparse Reconstruction of Optical Doppler Tomography with Alternative State Space Model and Attention
Sparse Reconstruction of Optical Doppler Tomography with Alternative State Space Model and Attention
Zhenghong Li
Jiaxiang Ren
Wensheng Cheng
C. Du
Yingtian Pan
Haibin Ling
H. Ling
178
0
0
26 Apr 2024
Adaptive Patching for High-resolution Image Segmentation with
  Transformers
Adaptive Patching for High-resolution Image Segmentation with Transformers
Enzhi Zhang
Isaac Lyngaas
Peng Chen
Xiao Wang
Jun Igarashi
Yuankai Huo
Mohamed Wahib
M. Munetomo
MedIm
144
4
0
15 Apr 2024
Bidirectional Long-Range Parser for Sequential Data Understanding
Bidirectional Long-Range Parser for Sequential Data Understanding
George Leotescu
Daniel Voinea
A. Popa
185
1
0
08 Apr 2024
Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping
Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping
Hyeongjun Kwon
Jinhyun Jang
Jin-Hwa Kim
Kwonyoung Kim
Kwanghoon Sohn
298
7
0
01 Apr 2024
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and
  Time-Series Analysis
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
Badri N. Patro
Suhas Ranganath
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
239
4
0
26 Mar 2024
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Chenhongyi Yang
Zehui Chen
Miguel Espinosa
Linus Ericsson
Zhenyu Wang
Jiaming Liu
Elliot J. Crowley
Mamba
276
160
0
26 Mar 2024
xT: Nested Tokenization for Larger Context in Large Images
xT: Nested Tokenization for Larger Context in Large Images
Ritwik Gupta
Shufan Li
Tyler Lixuan Zhu
Jitendra Malik
Trevor Darrell
K. Mangalam
ViT
178
7
0
04 Mar 2024
Perceiving Longer Sequences With Bi-Directional Cross-Attention
  Transformers
Perceiving Longer Sequences With Bi-Directional Cross-Attention Transformers
Markus Hiller
Krista A. Ehinger
Tom Drummond
271
7
0
19 Feb 2024
Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain
Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain
Amin Karimi Monsefi
Payam Karisani
Mengxi Zhou
Stacey S. Choi
Nathan Doble
Heng Ji
Srinivasan Parthasarathy
R. Ramnath
228
7
0
09 Feb 2024
MSHyper: Multi-Scale Hypergraph Transformer for Long-Range Time Series
  Forecasting
MSHyper: Multi-Scale Hypergraph Transformer for Long-Range Time Series Forecasting
Zongjiang Shang
Ling Chen
AI4TS
120
8
0
17 Jan 2024
Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization
Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual CategorizationIEEE Transactions on Image Processing (TIP), 2024
Qi Bi
Wei Ji
Jingjun Yi
Haolan Zhan
Gui-Song Xia
434
3
0
16 Jan 2024
Deformable Audio Transformer for Audio Event Detection
Deformable Audio Transformer for Audio Event Detection
Wentao Zhu
151
0
0
24 Dec 2023
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards
  Universal Interpretation for Earth Observation Imagery
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation ImageryComputer Vision and Pattern Recognition (CVPR), 2023
Xin Guo
Jiangwei Lao
Bo Dang
Yingying Zhang
Lei Yu
...
Jian Wang
Jingdong Chen
Ming Yang
Yongjun Zhang
Yansheng Li
314
211
0
15 Dec 2023
Factorization Vision Transformer: Modeling Long Range Dependency with
  Local Window Cost
Factorization Vision Transformer: Modeling Long Range Dependency with Local Window CostIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Haolin Qin
Daquan Zhou
Tingfa Xu
Ziyang Bian
Jianan Li
182
13
0
14 Dec 2023
SCHEME: Scalable Channel Mixer for Vision Transformers
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
700
1
0
01 Dec 2023
Advancing Vision Transformers with Group-Mix Attention
Advancing Vision Transformers with Group-Mix Attention
Chongjian Ge
Xiaohan Ding
Zhan Tong
Lichao Sun
Jiangliu Wang
Yibing Song
Ping Luo
302
30
0
26 Nov 2023
GTP-ViT: Efficient Vision Transformers via Graph-based Token Propagation
GTP-ViT: Efficient Vision Transformers via Graph-based Token PropagationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Xuwei Xu
Sen Wang
Yudong Chen
Yanping Zheng
Zhewei Wei
Jiajun Liu
ViT
262
18
0
06 Nov 2023
CCMR: High Resolution Optical Flow Estimation via Coarse-to-Fine
  Context-Guided Motion Reasoning
CCMR: High Resolution Optical Flow Estimation via Coarse-to-Fine Context-Guided Motion ReasoningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Azin Jahedi
Maximilian Luz
Marc Rivinius
Andrés Bruhn
164
11
0
05 Nov 2023
Scattering Vision Transformer: Spectral Mixing Matters
Scattering Vision Transformer: Spectral Mixing MattersNeural Information Processing Systems (NeurIPS), 2023
Badri N. Patro
Vijay Srinivas Agneeswaran
342
27
0
02 Nov 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual RecognitionIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
436
84
0
30 Oct 2023
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
EViT: An Eagle Vision Transformer with Bi-Fovea Self-AttentionIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2023
Yulong Shi
Mingwei Sun
Yongshuai Wang
Hui Sun
Zengqiang Chen
333
8
0
10 Oct 2023
Latent Wander: an Alternative Interface for Interactive and
  Serendipitous Discovery of Large AV Archives
Latent Wander: an Alternative Interface for Interactive and Serendipitous Discovery of Large AV Archives
Yuchen Yang
Linyida Zhang
182
2
0
09 Oct 2023
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient
  Vision Transformers
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision TransformersIEEE International Conference on Computer Vision (ICCV), 2023
Shiyue Cao
Yueqin Yin
Lianghua Huang
Yu Liu
Xin Zhao
Deli Zhao
Kaiqi Huang
ViT
216
28
0
09 Oct 2023
Win-Win: Training High-Resolution Vision Transformers from Two Windows
Win-Win: Training High-Resolution Vision Transformers from Two WindowsInternational Conference on Learning Representations (ICLR), 2023
Vincent Leroy
Jérôme Revaud
Thomas Lucas
Philippe Weinzaepfel
ViT
227
6
0
01 Oct 2023
Beyond Grids: Exploring Elastic Input Sampling for Vision Transformers
Beyond Grids: Exploring Elastic Input Sampling for Vision TransformersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Adam Pardyl
Grzegorz Kurzejamski
Jan Olszewski
Tomasz Trzciñski
Bartosz Zieliñski
133
3
0
23 Sep 2023
1234
Next