Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2001.08248
Cited By
How Much Position Information Do Convolutional Neural Networks Encode?
International Conference on Learning Representations (ICLR), 2020
22 January 2020
Md. Amirul Islam
Sen Jia
Neil D. B. Bruce
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"How Much Position Information Do Convolutional Neural Networks Encode?"
50 / 173 papers shown
Preventing Shortcuts in Adapter Training via Providing the Shortcuts
Anujraaj Goyal
Guocheng Qian
Huseyin Coskun
Aarush Gupta
Himmy Tam
...
Ju Hu
Dhritiman Sagar
Sergey Tulyakov
Kfir Aberman
Kuan-Chieh Wang
155
3
0
23 Oct 2025
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
Raahul Krishna Durairaju
K. Saruladha
252
0
0
02 Oct 2025
ARMA Block: A CNN-Based Autoregressive and Moving Average Module for Long-Term Time Series Forecasting
Myung Jin Kim
Yeonghyeon Park
I. Yun
AI4TS
123
0
0
12 Sep 2025
Encoder-Only Image Registration
Xiang Chen
Renjiu Hu
Jinwei Zhang
Y. Zhang
Xinyao Yue
Min Liu
Yaonan Wang
Hang Zhang
262
3
0
30 Aug 2025
The Next Layer: Augmenting Foundation Models with Structure-Preserving and Attention-Guided Learning for Local Patches to Global Context Awareness in Computational Pathology
Muhammad Waqas
Rukhmini Bandyopadhyay
Eman Showkatian
Amgad Muneer
Anas Zafar
...
John Heymach
Natalie I Vokes
Luisa Maren Solis Soto
Jianjun Zhang
Jia Wu
MedIm
169
1
0
27 Aug 2025
Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
Ryan Ramos
Vladan Stojnić
Giorgos Kordopatis-Zilos
Yuta Nakashima
Giorgos Tolias
Noa Garcia
230
3
0
14 Aug 2025
On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation
Liyao Tang
Zhe Chen
Dacheng Tao
3DPC
452
0
0
28 May 2025
A Comparative Study on Positional Encoding for Time-frequency Domain Dual-path Transformer-based Source Separation Models
Kohei Saijo
Tetsuji Ogawa
363
4
0
28 Apr 2025
Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation
Feng Zhou
Pu Cao
Yiyang Ma
Pu Cao
Jianqin Yin
DiffM
392
3
0
12 Mar 2025
Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning
Raphael Trumpp
Ansgar Schäfftlein
Mirco Theile
Marco Caccamo
326
3
0
07 Mar 2025
LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding
Shen Zhang
Yaning Tan
Yaning Tan
Zhaowei Chen
Linze Li
...
Shuheng Li
Zhenyu Zhao
Caihua Chen
Jiajun Liang
Yao Tang
547
1
0
06 Mar 2025
Comply: Learning Sentences with Complex Weights inspired by Fruit Fly Olfaction
Neuro Inspired Computational Elements Workshop (NICE), 2025
Alexei Figueroa
Justus Westerhoff
Golzar Atefi
Dennis Fast
B. Winter
Felix Alexader Gers
Alexander Loser
Wolfang Nejdl
622
2
0
03 Feb 2025
Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Ziyang Chen
Mingxiao Li
Zhongfu Chen
Nan Du
Xiaolong Li
Yuexian Zou
441
4
0
19 Jan 2025
TableTime: Reformulating Time Series Classification as Training-Free Table Understanding with Large Language Models
Jiangming Wang
Mingyue Cheng
Qingyang Mao
Qi Liu
F. Xu
Xin Li
Tong Xu
X. Li
AI4TS
LMTD
538
0
0
24 Nov 2024
PtychoFormer: A Transformer-based Model for Ptychographic Phase Retrieval
Ryuma Nakahata
Shehtab Zaman
Mingyuan Zhang
Fake Lu
Kenneth Chiu
232
3
0
22 Oct 2024
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng Xu
Nick Barnes
Fahad Shahbaz Khan
Salman Khan
Deng-Ping Fan
484
15
0
22 Oct 2024
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Enze Xie
Junsong Chen
Junyu Chen
Han Cai
Haotian Tang
...
Zhekai Zhang
Zhekai Zhang
Ligeng Zhu
Yaojie Lu
Song Han
VLM
437
264
0
14 Oct 2024
HTR-VT: Handwritten Text Recognition with Vision Transformer
Pattern Recognition (Pattern Recogn.), 2024
Yuting Li
Dexiong Chen
Tinglong Tang
Xi Shen
ViT
237
55
0
13 Sep 2024
Searching for Effective Preprocessing Method and CNN-based Architecture with Efficient Channel Attention on Speech Emotion Recognition
Scientific Reports (Sci Rep), 2024
Byunggun Kim
Younghun Kwon
220
0
0
06 Sep 2024
UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Lulin Li
Ben Chen
Xuechao Zou
Junliang Xing
Pin Tao
Mamba
554
9
0
05 Sep 2024
An Investigation on The Position Encoding in Vision-Based Dynamics Prediction
Jiageng Zhu
Hanchen Xie
Jiazhi Li
Mahyar Khayatkhoei
Wael AbdAlmageed
261
1
0
27 Aug 2024
Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis
Jian-Qing Zheng
Yuanhan Mo
Yang Sun
Jiahua Li
Fuping Wu
Ziyang Wang
Tonia Vincent
Bartłomiej W. Papież
MedIm
DiffM
339
8
0
10 Jul 2024
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model
Zhuo Zheng
Stefano Ermon
Dongjun Kim
Liangpei Zhang
Yanfei Zhong
DiffM
297
61
0
26 Jun 2024
Wound Tissue Segmentation in Diabetic Foot Ulcer Images Using Deep Learning: A Pilot Study
Mou Deb
Chuanbo Wang
Yash Patel
Taiyu Zhang
J. Niezgoda
Sandeep Gopalakrishnan
Keke Chen
Zeyun Yu
204
5
0
23 Jun 2024
Region-aware Grasp Framework with Normalized Grasp Space for Efficient 6-DoF Grasping
Siang Chen
Pengwei Xie
Wei Tang
Dingchang Hu
Yixiang Dai
Guijin Wang
312
0
0
03 Jun 2024
Pseudo Channel: Time Embedding for Motor Imagery Decoding
Zhengqing Miao
Meirong Zhao
250
1
0
21 May 2024
CSTA: CNN-based Spatiotemporal Attention for Video Summarization
Jaewon Son
Jaehun Park
Kwangsu Kim
AI4TS
ViT
393
28
0
20 May 2024
Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network
Min Hun Lee
AI4TS
ViT
FAtt
246
4
0
18 May 2024
EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation
Md Mostafijur Rahman
Mustafa Munir
R. Marculescu
MedIm
430
259
0
11 May 2024
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs
Mustafa Munir
William Avery
Md Mostafijur Rahman
R. Marculescu
GNN
235
36
0
10 May 2024
Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging
Toqi Tahamid Sarker
M. Embaby
Khaled R Ahmed
A. AbuGhazaleh
207
13
0
16 Apr 2024
SegFormer3D: an Efficient Transformer for 3D Medical Image Segmentation
Shehan Perera
Pouyan Navard
Alper Yilmaz
MedIm
292
83
0
15 Apr 2024
Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer
Jeongho Kim
Seung-Ho Lee
190
11
0
26 Mar 2024
Spectral Norm of Convolutional Layers with Circular and Zero Paddings
Blaise Delattre
Quentin Barthélemy
Alexandre Allauzen
411
3
0
31 Jan 2024
End-to-end Multi-Instance Robotic Reaching from Monocular Vision
IEEE International Conference on Robotics and Automation (ICRA), 2021
Zheyu Zhuang
Xin Yu
Robert E. Mahony
243
1
0
22 Jan 2024
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
International Conference on Learning Representations (ICLR), 2024
Chenjie Cao
Xinlin Ren
Yanwei Fu
269
62
0
22 Jan 2024
CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural Networks
British Machine Vision Conference (BMVC), 2024
S. Howard
P. Norreys
Andreas Döpp
274
5
0
09 Jan 2024
Graph Neural Networks with Diverse Spectral Filtering
The Web Conference (WWW), 2023
Jingwei Guo
Kaizhu Huang
Xinping Yi
Rui Zhang
459
19
0
14 Dec 2023
GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera Parameters via Ground Plane Embedding
Karlo Koledić
Luka V. Petrović
Ivan Petrović
Ivan Marković
MDE
351
2
0
10 Dec 2023
Hacking Task Confounder in Meta-Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Wenwen Qiang
Yi Ren
Changwen Zheng
Xingzhe Su
Changwen Zheng
Jingyao Wang
CML
639
9
0
10 Dec 2023
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Qihang Zhang
Yinghao Xu
Yujun Shen
Bo Dai
Bolei Zhou
Ceyuan Yang
302
6
0
04 Dec 2023
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
Dai Shi
ViT
447
321
0
28 Nov 2023
Spatially Covariant Image Registration with Text Prompts
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Xiang Chen
Min Liu
Rongguang Wang
Renjiu Hu
Dongdong Liu
Gaolei Li
Hang Zhang
MedIm
375
24
0
27 Nov 2023
Vision Big Bird: Random Sparsification for Full Attention
Zhemin Zhang
Xun Gong
ViT
225
1
0
10 Nov 2023
G-CASCADE: Efficient Cascaded Graph Convolutional Decoding for 2D Medical Image Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Md Mostafijur Rahman
R. Marculescu
MedIm
269
70
0
24 Oct 2023
ObjFormer: Learning Land-Cover Changes From Paired OSM Data and Optical High-Resolution Imagery via Object-Guided Transformer
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Hongruixuan Chen
Cuiling Lan
Jian Song
Clifford Broni-Bediako
Junshi Xia
Xiangwei Zhu
342
45
0
04 Oct 2023
Imperceptible Adversarial Attack on Deep Neural Networks from Image Boundary
Fahad Alrasheedi
Agnibh Dasgupta
AAML
279
2
0
29 Aug 2023
Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2023
Running Zhao
Jiang-Tao Luca Yu
Haiying Zhao
Edith C.H. Ngai
292
12
0
16 Aug 2023
On the Interplay of Convolutional Padding and Adversarial Robustness
Paul Gavrikov
J. Keuper
AAML
389
4
0
12 Aug 2023
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Liang Shang
Yanli Liu
Zhengyang Lou
Shuxue Quan
N. Adluru
Bochen Guan
W. Sethares
386
6
0
10 Aug 2023
1
2
3
4
Next
Page 1 of 4