ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.08248
  4. Cited By
How Much Position Information Do Convolutional Neural Networks Encode?

How Much Position Information Do Convolutional Neural Networks Encode?

International Conference on Learning Representations (ICLR), 2020
22 January 2020
Md. Amirul Islam
Sen Jia
Neil D. B. Bruce
    SSL
ArXiv (abs)PDFHTML

Papers citing "How Much Position Information Do Convolutional Neural Networks Encode?"

50 / 173 papers shown
Preventing Shortcuts in Adapter Training via Providing the Shortcuts
Preventing Shortcuts in Adapter Training via Providing the Shortcuts
Anujraaj Goyal
Guocheng Qian
Huseyin Coskun
Aarush Gupta
Himmy Tam
...
Ju Hu
Dhritiman Sagar
Sergey Tulyakov
Kfir Aberman
Kuan-Chieh Wang
155
3
0
23 Oct 2025
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
Raahul Krishna Durairaju
K. Saruladha
252
0
0
02 Oct 2025
ARMA Block: A CNN-Based Autoregressive and Moving Average Module for Long-Term Time Series Forecasting
ARMA Block: A CNN-Based Autoregressive and Moving Average Module for Long-Term Time Series Forecasting
Myung Jin Kim
Yeonghyeon Park
I. Yun
AI4TS
123
0
0
12 Sep 2025
Encoder-Only Image Registration
Encoder-Only Image Registration
Xiang Chen
Renjiu Hu
Jinwei Zhang
Y. Zhang
Xinyao Yue
Min Liu
Yaonan Wang
Hang Zhang
262
3
0
30 Aug 2025
The Next Layer: Augmenting Foundation Models with Structure-Preserving and Attention-Guided Learning for Local Patches to Global Context Awareness in Computational Pathology
The Next Layer: Augmenting Foundation Models with Structure-Preserving and Attention-Guided Learning for Local Patches to Global Context Awareness in Computational Pathology
Muhammad Waqas
Rukhmini Bandyopadhyay
Eman Showkatian
Amgad Muneer
Anas Zafar
...
John Heymach
Natalie I Vokes
Luisa Maren Solis Soto
Jianjun Zhang
Jia Wu
MedIm
169
1
0
27 Aug 2025
Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
Ryan Ramos
Vladan Stojnić
Giorgos Kordopatis-Zilos
Yuta Nakashima
Giorgos Tolias
Noa Garcia
230
3
0
14 Aug 2025
On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation
On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation
Liyao Tang
Zhe Chen
Dacheng Tao
3DPC
452
0
0
28 May 2025
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
Kohei Saijo
Tetsuji Ogawa
363
4
0
28 Apr 2025
Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation
Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation
Feng Zhou
Pu Cao
Yiyang Ma
Pu Cao
Jianqin Yin
DiffM
392
3
0
12 Mar 2025
Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning
Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning
Raphael Trumpp
Ansgar Schäfftlein
Mirco Theile
Marco Caccamo
326
3
0
07 Mar 2025
LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding
LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding
Shen Zhang
Yaning Tan
Yaning Tan
Zhaowei Chen
Linze Li
...
Shuheng Li
Zhenyu Zhao
Caihua Chen
Jiajun Liang
Yao Tang
547
1
0
06 Mar 2025
Comply: Learning Sentences with Complex Weights inspired by Fruit Fly Olfaction
Comply: Learning Sentences with Complex Weights inspired by Fruit Fly OlfactionNeuro Inspired Computational Elements Workshop (NICE), 2025
Alexei Figueroa
Justus Westerhoff
Golzar Atefi
Dennis Fast
B. Winter
Felix Alexader Gers
Alexander Loser
Wolfang Nejdl
622
2
0
03 Feb 2025
Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding
Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position EncodingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Ziyang Chen
Mingxiao Li
Zhongfu Chen
Nan Du
Xiaolong Li
Yuexian Zou
441
4
0
19 Jan 2025
TableTime: Reformulating Time Series Classification as Training-Free Table Understanding with Large Language Models
TableTime: Reformulating Time Series Classification as Training-Free Table Understanding with Large Language Models
Jiangming Wang
Mingyue Cheng
Qingyang Mao
Qi Liu
F. Xu
Xin Li
Tong Xu
X. Li
AI4TSLMTD
538
0
0
24 Nov 2024
PtychoFormer: A Transformer-based Model for Ptychographic Phase
  Retrieval
PtychoFormer: A Transformer-based Model for Ptychographic Phase Retrieval
Ryuma Nakahata
Shehtab Zaman
Mingyuan Zhang
Fake Lu
Kenneth Chiu
232
3
0
22 Oct 2024
Frontiers in Intelligent Colonoscopy
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng Xu
Nick Barnes
Fahad Shahbaz Khan
Salman Khan
Deng-Ping Fan
484
15
0
22 Oct 2024
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion
  Transformers
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Enze Xie
Junsong Chen
Junyu Chen
Han Cai
Haotian Tang
...
Zhekai Zhang
Zhekai Zhang
Ligeng Zhu
Yaojie Lu
Song Han
VLM
437
264
0
14 Oct 2024
HTR-VT: Handwritten Text Recognition with Vision Transformer
HTR-VT: Handwritten Text Recognition with Vision TransformerPattern Recognition (Pattern Recogn.), 2024
Yuting Li
Dexiong Chen
Tinglong Tang
Xi Shen
ViT
237
55
0
13 Sep 2024
Searching for Effective Preprocessing Method and CNN-based Architecture
  with Efficient Channel Attention on Speech Emotion Recognition
Searching for Effective Preprocessing Method and CNN-based Architecture with Efficient Channel Attention on Speech Emotion RecognitionScientific Reports (Sci Rep), 2024
Byunggun Kim
Younghun Kwon
220
0
0
06 Sep 2024
UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary
  Identification in High-Resolution Remote Sensing Images
UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing ImagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Lulin Li
Ben Chen
Xuechao Zou
Junliang Xing
Pin Tao
Mamba
554
9
0
05 Sep 2024
An Investigation on The Position Encoding in Vision-Based Dynamics
  Prediction
An Investigation on The Position Encoding in Vision-Based Dynamics Prediction
Jiageng Zhu
Hanchen Xie
Jiazhi Li
Mahyar Khayatkhoei
Wael AbdAlmageed
261
1
0
27 Aug 2024
Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis
Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis
Jian-Qing Zheng
Yuanhan Mo
Yang Sun
Jiahua Li
Fuping Wu
Ziyang Wang
Tonia Vincent
Bartłomiej W. Papież
MedImDiffM
339
8
0
10 Jul 2024
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation
  Model
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model
Zhuo Zheng
Stefano Ermon
Dongjun Kim
Liangpei Zhang
Yanfei Zhong
DiffM
297
61
0
26 Jun 2024
Wound Tissue Segmentation in Diabetic Foot Ulcer Images Using Deep
  Learning: A Pilot Study
Wound Tissue Segmentation in Diabetic Foot Ulcer Images Using Deep Learning: A Pilot Study
Mou Deb
Chuanbo Wang
Yash Patel
Taiyu Zhang
J. Niezgoda
Sandeep Gopalakrishnan
Keke Chen
Zeyun Yu
204
5
0
23 Jun 2024
Region-aware Grasp Framework with Normalized Grasp Space for Efficient
  6-DoF Grasping
Region-aware Grasp Framework with Normalized Grasp Space for Efficient 6-DoF Grasping
Siang Chen
Pengwei Xie
Wei Tang
Dingchang Hu
Yixiang Dai
Guijin Wang
312
0
0
03 Jun 2024
Pseudo Channel: Time Embedding for Motor Imagery Decoding
Pseudo Channel: Time Embedding for Motor Imagery Decoding
Zhengqing Miao
Meirong Zhao
250
1
0
21 May 2024
CSTA: CNN-based Spatiotemporal Attention for Video Summarization
CSTA: CNN-based Spatiotemporal Attention for Video Summarization
Jaewon Son
Jaehun Park
Kwangsu Kim
AI4TSViT
393
28
0
20 May 2024
Towards Gradient-based Time-Series Explanations through a SpatioTemporal
  Attention Network
Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network
Min Hun Lee
AI4TSViTFAtt
246
4
0
18 May 2024
EMCAD: Efficient Multi-scale Convolutional Attention Decoding for
  Medical Image Segmentation
EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation
Md Mostafijur Rahman
Mustafa Munir
R. Marculescu
MedIm
430
259
0
11 May 2024
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs
Mustafa Munir
William Avery
Md Mostafijur Rahman
R. Marculescu
GNN
235
36
0
10 May 2024
Gasformer: A Transformer-based Architecture for Segmenting Methane
  Emissions from Livestock in Optical Gas Imaging
Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging
Toqi Tahamid Sarker
M. Embaby
Khaled R Ahmed
A. AbuGhazaleh
207
13
0
16 Apr 2024
SegFormer3D: an Efficient Transformer for 3D Medical Image Segmentation
SegFormer3D: an Efficient Transformer for 3D Medical Image Segmentation
Shehan Perera
Pouyan Navard
Alper Yilmaz
MedIm
292
83
0
15 Apr 2024
Accuracy enhancement method for speech emotion recognition from
  spectrogram using temporal frequency correlation and positional information
  learning through knowledge transfer
Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer
Jeongho Kim
Seung-Ho Lee
190
11
0
26 Mar 2024
Spectral Norm of Convolutional Layers with Circular and Zero Paddings
Spectral Norm of Convolutional Layers with Circular and Zero Paddings
Blaise Delattre
Quentin Barthélemy
Alexandre Allauzen
411
3
0
31 Jan 2024
End-to-end Multi-Instance Robotic Reaching from Monocular Vision
End-to-end Multi-Instance Robotic Reaching from Monocular VisionIEEE International Conference on Robotics and Automation (ICRA), 2021
Zheyu Zhuang
Xin Yu
Robert E. Mahony
243
1
0
22 Jan 2024
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View
  Stereo
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View StereoInternational Conference on Learning Representations (ICLR), 2024
Chenjie Cao
Xinlin Ren
Yanwei Fu
269
62
0
22 Jan 2024
CoordGate: Efficiently Computing Spatially-Varying Convolutions in
  Convolutional Neural Networks
CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural NetworksBritish Machine Vision Conference (BMVC), 2024
S. Howard
P. Norreys
Andreas Döpp
274
5
0
09 Jan 2024
Graph Neural Networks with Diverse Spectral Filtering
Graph Neural Networks with Diverse Spectral FilteringThe Web Conference (WWW), 2023
Jingwei Guo
Kaizhu Huang
Xinping Yi
Rui Zhang
459
19
0
14 Dec 2023
GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera
  Parameters via Ground Plane Embedding
GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera Parameters via Ground Plane Embedding
Karlo Koledić
Luka V. Petrović
Ivan Petrović
Ivan Marković
MDE
351
2
0
10 Dec 2023
Hacking Task Confounder in Meta-Learning
Hacking Task Confounder in Meta-LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Wenwen Qiang
Yi Ren
Changwen Zheng
Xingzhe Su
Changwen Zheng
Jingyao Wang
CML
639
9
0
10 Dec 2023
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D
  Scene Generation
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Qihang Zhang
Yinghao Xu
Yujun Shen
Bo Dai
Bolei Zhou
Ceyuan Yang
302
6
0
04 Dec 2023
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
TransNeXt: Robust Foveal Visual Perception for Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2023
Dai Shi
ViT
447
321
0
28 Nov 2023
Spatially Covariant Image Registration with Text Prompts
Spatially Covariant Image Registration with Text PromptsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Xiang Chen
Min Liu
Rongguang Wang
Renjiu Hu
Dongdong Liu
Gaolei Li
Hang Zhang
MedIm
375
24
0
27 Nov 2023
Vision Big Bird: Random Sparsification for Full Attention
Vision Big Bird: Random Sparsification for Full Attention
Zhemin Zhang
Xun Gong
ViT
225
1
0
10 Nov 2023
G-CASCADE: Efficient Cascaded Graph Convolutional Decoding for 2D
  Medical Image Segmentation
G-CASCADE: Efficient Cascaded Graph Convolutional Decoding for 2D Medical Image SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Md Mostafijur Rahman
R. Marculescu
MedIm
269
70
0
24 Oct 2023
ObjFormer: Learning Land-Cover Changes From Paired OSM Data and Optical
  High-Resolution Imagery via Object-Guided Transformer
ObjFormer: Learning Land-Cover Changes From Paired OSM Data and Optical High-Resolution Imagery via Object-Guided TransformerIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Hongruixuan Chen
Cuiling Lan
Jian Song
Clifford Broni-Bediako
Junshi Xia
Xiangwei Zhu
342
45
0
04 Oct 2023
Imperceptible Adversarial Attack on Deep Neural Networks from Image
  Boundary
Imperceptible Adversarial Attack on Deep Neural Networks from Image Boundary
Fahad Alrasheedi
Agnibh Dasgupta
AAML
279
2
0
29 Aug 2023
Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals
Radio2Text: Streaming Speech Recognition Using mmWave Radio SignalsProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2023
Running Zhao
Jiang-Tao Luca Yu
Haiying Zhao
Edith C.H. Ngai
292
12
0
16 Aug 2023
On the Interplay of Convolutional Padding and Adversarial Robustness
On the Interplay of Convolutional Padding and Adversarial Robustness
Paul Gavrikov
J. Keuper
AAML
389
4
0
12 Aug 2023
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Liang Shang
Yanli Liu
Zhengyang Lou
Shuxue Quan
N. Adluru
Bochen Guan
W. Sethares
386
6
0
10 Aug 2023
1234
Next
Page 1 of 4