Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.07036
Cited By
PnP-DETR: Towards Efficient Visual Analysis with Transformers
15 September 2021
Tao Wang
Li Yuan
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PnP-DETR: Towards Efficient Visual Analysis with Transformers"
43 / 43 papers shown
Title
SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer
Wenxi Li
Yuchen Guo
Jilai Zheng
Haozhe Lin
Chao Ma
Lu Fang
Xiaokang Yang
ViT
60
1
0
11 Feb 2025
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions
Jingyu Zhang
Yilei Wang
Lang Qian
Peng Sun
Zengwen Li
Sudong Jiang
Maolin Liu
Liang Song
93
1
0
14 Dec 2024
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
Zhili Chen
Shuangjie Xu
Maosheng Ye
Zian Qian
Xiaoyi Zou
Dit-Yan Yeung
Qifeng Chen
46
1
0
22 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
21
2
0
16 Jul 2024
Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Yingying Zhang
Chuangji Shi
Xin Guo
Jiangwei Lao
Jian Wang
Jiaotuan Wang
Jingdong Chen
32
2
0
06 May 2024
A Hybrid Approach for Document Layout Analysis in Document images
Tahira Shehzadi
Didier Stricker
Muhammad Zeshan Afzal
29
5
0
27 Apr 2024
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection
Tahira Shehzadi
K. Hashmi
Didier Stricker
Muhammad Zeshan Afzal
36
12
0
02 Apr 2024
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding
Run Shao
Zhaoyang Zhang
Chao Tao
Yunsheng Zhang
Chengli Peng
Haifeng Li
VLM
35
4
0
27 Mar 2024
Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
Xiuquan Hou
Meiqin Liu
Senlin Zhang
Ping Wei
Badong Chen
37
22
0
24 Mar 2024
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
Manyuan Zhang
Guanglu Song
Yu Liu
Hongsheng Li
14
14
0
24 Oct 2023
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
Sheng-Hsiang Fu
Junkai Yan
Yipeng Gao
Xiaohua Xie
Wei-Shi Zheng
18
6
0
18 Aug 2023
SODFormer: Streaming Object Detection with Transformer Using Events and Frames
Dianze Li
Jianing Li
Yonghong Tian
ViT
14
26
0
08 Aug 2023
Less is More: Focus Attention for Efficient DETR
Dehua Zheng
Wenhui Dong
Hailin Hu
Xinghao Chen
Yunhe Wang
19
57
0
24 Jul 2023
Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence
Yang Tian
Jiyao Zhang
Zekai Yin
Hao Dong
25
9
0
22 Jul 2023
Cascade-DETR: Delving into High-Quality Universal Object Detection
Mingqiao Ye
Lei Ke
Siyuan Li
Yu-Wing Tai
Chi-Keung Tang
Martin Danelljan
F. I. F. Richard Yu
45
32
0
20 Jul 2023
Box-DETR: Understanding and Boxing Conditional Spatial Queries
Wenze Liu
Hao Lu
Yuliang Liu
Zhiguo Cao
ViT
26
2
0
17 Jul 2023
Joint Microseismic Event Detection and Location with a Detection Transformer
Yuanyuan Yang
C. Birnie
T. Alkhalifah
28
1
0
16 Jul 2023
Single-Stage Visual Relationship Learning using Conditional Queries
Alakh Desai
Tz-Ying Wu
Subarna Tripathi
Nuno Vasconcelos
22
7
0
09 Jun 2023
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViT
MU
13
27
0
07 Jun 2023
Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting
Shubin Huang
Qiong Wu
Yiyi Zhou
Weijie Chen
Rongsheng Zhang
Xiaoshuai Sun
Rongrong Ji
VLM
VPVLM
LRM
16
0
0
01 Jun 2023
Star-Net: Improving Single Image Desnowing Model With More Efficient Connection and Diverse Feature Interaction
Jia-ju Mao
Yuan Chang
Xuesong Yin
Binling Nie
28
1
0
17 Mar 2023
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Feng Li
Ailing Zeng
Siyi Liu
Hao Zhang
Hongyang Li
Lei Zhang
L. Ni
ViT
31
67
0
13 Mar 2023
What Makes for Good Tokenizers in Vision Transformer?
Shengju Qian
Yi Zhu
Wenbo Li
Mu Li
Jiaya Jia
ViT
29
13
0
21 Dec 2022
Focal-PETR: Embracing Foreground for Efficient Multi-Camera 3D Object Detection
Shih-Ping Wang
Xiaohui Jiang
Ying Li
3DPC
23
18
0
11 Dec 2022
DATE: Dual Assignment for End-to-End Fully Convolutional Object Detection
Yiqun Chen
Qiang Chen
Qinghao Hu
Jian Cheng
16
7
0
25 Nov 2022
Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling
Yu Wang
Xin Li
Shengzhao Wen
Fu-En Yang
Wanping Zhang
Gang Zhang
Haocheng Feng
Junyu Han
Errui Ding
37
5
0
15 Nov 2022
TokenMixup: Efficient Attention-guided Token-level Data Augmentation for Transformers
Hyeong Kyu Choi
Joonmyung Choi
Hyunwoo J. Kim
ViT
21
35
0
14 Oct 2022
CD-FSOD: A Benchmark for Cross-domain Few-shot Object Detection
Wuti Xiong
56
13
0
11 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
17
25
0
03 Oct 2022
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition
Y. Wang
H. Sun
Xiaodi Wang
Bin Zhang
Chaonan Li
Ying Xin
Baochang Zhang
Errui Ding
Shumin Han
ViT
14
9
0
31 Aug 2022
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
Gongjie Zhang
Zhipeng Luo
Zichen Tian
Yingchen Yu
Jingyi Zhang
Shijian Lu
28
26
0
24 Aug 2022
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion
Gongjie Zhang
Zhipeng Luo
Jiaxing Huang
Shijian Lu
Eric P. Xing
ViT
34
19
0
28 Jul 2022
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Xiaokang Chen
Fangyun Wei
Gang Zeng
Jingdong Wang
ViT
13
31
0
18 Jul 2022
Polar Parametrization for Vision-based Surround-View 3D Detection
Shaoyu Chen
Xinggang Wang
Tianheng Cheng
Qian Zhang
Chang Huang
Wenyu Liu
3DPC
17
68
0
22 Jun 2022
Future Object Detection with Spatiotemporal Transformers
Adam Tonderski
Joakim Johnander
Christoffer Petersson
Kalle AAstrom
ViT
23
0
0
21 Apr 2022
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Wang Zeng
Sheng Jin
Wentao Liu
Chao Qian
Ping Luo
Ouyang Wanli
Xiaogang Wang
ViT
16
119
0
19 Apr 2022
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Bumsoo Kim
Jonghwan Mun
Kyoung-Woon On
Minchul Shin
Junhyun Lee
Eun-Sol Kim
26
50
0
28 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
F. Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViT
MedIm
20
28
0
24 Mar 2022
Towards Data-Efficient Detection Transformers
Wen Wang
Jing Zhang
Yang Cao
Yongliang Shen
Dacheng Tao
ViT
16
56
0
17 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
39
14
0
01 Mar 2022
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
23
328
0
16 Feb 2022
Embracing Single Stride 3D Object Detector with Sparse Transformer
Lue Fan
Ziqi Pang
Tianyuan Zhang
Yu-xiong Wang
Hang Zhao
Feng Wang
Naiyan Wang
Zhaoxiang Zhang
ViT
14
252
0
13 Dec 2021
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
66
330
0
11 Nov 2021
1