ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.07036
  4. Cited By
PnP-DETR: Towards Efficient Visual Analysis with Transformers

PnP-DETR: Towards Efficient Visual Analysis with Transformers

15 September 2021
Tao Wang
Li Yuan
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
    ViT
ArXivPDFHTML

Papers citing "PnP-DETR: Towards Efficient Visual Analysis with Transformers"

43 / 43 papers shown
Title
SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer
SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer
Wenxi Li
Yuchen Guo
Jilai Zheng
Haozhe Lin
Chao Ma
Lu Fang
Xiaokang Yang
ViT
60
1
0
11 Feb 2025
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions
Jingyu Zhang
Yilei Wang
Lang Qian
Peng Sun
Zengwen Li
Sudong Jiang
Maolin Liu
Liang Song
93
1
0
14 Dec 2024
Learning High-resolution Vector Representation from Multi-Camera Images
  for 3D Object Detection
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
Zhili Chen
Shuangjie Xu
Maosheng Ye
Zian Qian
Xiaoyi Zou
Dit-Yan Yeung
Qifeng Chen
46
1
0
22 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
21
2
0
16 Jul 2024
Enhancing DETRs Variants through Improved Content Query and Similar
  Query Aggregation
Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Yingying Zhang
Chuangji Shi
Xin Guo
Jiangwei Lao
Jian Wang
Jiaotuan Wang
Jingdong Chen
32
2
0
06 May 2024
A Hybrid Approach for Document Layout Analysis in Document images
A Hybrid Approach for Document Layout Analysis in Document images
Tahira Shehzadi
Didier Stricker
Muhammad Zeshan Afzal
29
5
0
27 Apr 2024
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object
  Detection
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection
Tahira Shehzadi
K. Hashmi
Didier Stricker
Muhammad Zeshan Afzal
36
12
0
02 Apr 2024
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote
  Sensing Image Understanding
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding
Run Shao
Zhaoyang Zhang
Chao Tao
Yunsheng Zhang
Chengli Peng
Haifeng Li
VLM
35
4
0
27 Mar 2024
Salience DETR: Enhancing Detection Transformer with Hierarchical
  Salience Filtering Refinement
Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
Xiuquan Hou
Meiqin Liu
Senlin Zhang
Ping Wei
Badong Chen
37
22
0
24 Mar 2024
Decoupled DETR: Spatially Disentangling Localization and Classification
  for Improved End-to-End Object Detection
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
Manyuan Zhang
Guanglu Song
Yu Liu
Hongsheng Li
14
14
0
24 Oct 2023
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive
  Sparse Anchor Generation
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
Sheng-Hsiang Fu
Junkai Yan
Yipeng Gao
Xiaohua Xie
Wei-Shi Zheng
18
6
0
18 Aug 2023
SODFormer: Streaming Object Detection with Transformer Using Events and
  Frames
SODFormer: Streaming Object Detection with Transformer Using Events and Frames
Dianze Li
Jianing Li
Yonghong Tian
ViT
14
26
0
08 Aug 2023
Less is More: Focus Attention for Efficient DETR
Less is More: Focus Attention for Efficient DETR
Dehua Zheng
Wenhui Dong
Hailin Hu
Xinghao Chen
Yunhe Wang
19
57
0
24 Jul 2023
Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose
  Estimation from Image Sequence
Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence
Yang Tian
Jiyao Zhang
Zekai Yin
Hao Dong
25
9
0
22 Jul 2023
Cascade-DETR: Delving into High-Quality Universal Object Detection
Cascade-DETR: Delving into High-Quality Universal Object Detection
Mingqiao Ye
Lei Ke
Siyuan Li
Yu-Wing Tai
Chi-Keung Tang
Martin Danelljan
F. I. F. Richard Yu
45
32
0
20 Jul 2023
Box-DETR: Understanding and Boxing Conditional Spatial Queries
Box-DETR: Understanding and Boxing Conditional Spatial Queries
Wenze Liu
Hao Lu
Yuliang Liu
Zhiguo Cao
ViT
26
2
0
17 Jul 2023
Joint Microseismic Event Detection and Location with a Detection
  Transformer
Joint Microseismic Event Detection and Location with a Detection Transformer
Yuanyuan Yang
C. Birnie
T. Alkhalifah
28
1
0
16 Jul 2023
Single-Stage Visual Relationship Learning using Conditional Queries
Single-Stage Visual Relationship Learning using Conditional Queries
Alakh Desai
Tz-Ying Wu
Subarna Tripathi
Nuno Vasconcelos
22
7
0
09 Jun 2023
Object Detection with Transformers: A Review
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViT
MU
13
27
0
07 Jun 2023
Adapting Pre-trained Language Models to Vision-Language Tasks via
  Dynamic Visual Prompting
Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting
Shubin Huang
Qiong Wu
Yiyi Zhou
Weijie Chen
Rongsheng Zhang
Xiaoshuai Sun
Rongrong Ji
VLM
VPVLM
LRM
16
0
0
01 Jun 2023
Star-Net: Improving Single Image Desnowing Model With More Efficient
  Connection and Diverse Feature Interaction
Star-Net: Improving Single Image Desnowing Model With More Efficient Connection and Diverse Feature Interaction
Jia-ju Mao
Yuan Chang
Xuesong Yin
Binling Nie
28
1
0
17 Mar 2023
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Feng Li
Ailing Zeng
Siyi Liu
Hao Zhang
Hongyang Li
Lei Zhang
L. Ni
ViT
31
67
0
13 Mar 2023
What Makes for Good Tokenizers in Vision Transformer?
What Makes for Good Tokenizers in Vision Transformer?
Shengju Qian
Yi Zhu
Wenbo Li
Mu Li
Jiaya Jia
ViT
29
13
0
21 Dec 2022
Focal-PETR: Embracing Foreground for Efficient Multi-Camera 3D Object
  Detection
Focal-PETR: Embracing Foreground for Efficient Multi-Camera 3D Object Detection
Shih-Ping Wang
Xiaohui Jiang
Ying Li
3DPC
23
18
0
11 Dec 2022
DATE: Dual Assignment for End-to-End Fully Convolutional Object
  Detection
DATE: Dual Assignment for End-to-End Fully Convolutional Object Detection
Yiqun Chen
Qiang Chen
Qinghao Hu
Jian Cheng
16
7
0
25 Nov 2022
Knowledge Distillation for Detection Transformer with Consistent
  Distillation Points Sampling
Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling
Yu Wang
Xin Li
Shengzhao Wen
Fu-En Yang
Wanping Zhang
Gang Zhang
Haocheng Feng
Junyu Han
Errui Ding
37
5
0
15 Nov 2022
TokenMixup: Efficient Attention-guided Token-level Data Augmentation for
  Transformers
TokenMixup: Efficient Attention-guided Token-level Data Augmentation for Transformers
Hyeong Kyu Choi
Joonmyung Choi
Hyunwoo J. Kim
ViT
21
35
0
14 Oct 2022
CD-FSOD: A Benchmark for Cross-domain Few-shot Object Detection
CD-FSOD: A Benchmark for Cross-domain Few-shot Object Detection
Wuti Xiong
56
13
0
11 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without
  Fine-tuning
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
17
25
0
03 Oct 2022
MAFormer: A Transformer Network with Multi-scale Attention Fusion for
  Visual Recognition
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition
Y. Wang
H. Sun
Xiaodi Wang
Bin Zhang
Chaonan Li
Ying Xin
Baochang Zhang
Errui Ding
Shumin Han
ViT
14
9
0
31 Aug 2022
Towards Efficient Use of Multi-Scale Features in Transformer-Based
  Object Detectors
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
Gongjie Zhang
Zhipeng Luo
Zichen Tian
Yingchen Yu
Jingyi Zhang
Shijian Lu
28
26
0
24 Aug 2022
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale
  Feature Fusion
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion
Gongjie Zhang
Zhipeng Luo
Jiaxing Huang
Shijian Lu
Eric P. Xing
ViT
34
19
0
28 Jul 2022
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Xiaokang Chen
Fangyun Wei
Gang Zeng
Jingdong Wang
ViT
13
31
0
18 Jul 2022
Polar Parametrization for Vision-based Surround-View 3D Detection
Polar Parametrization for Vision-based Surround-View 3D Detection
Shaoyu Chen
Xinggang Wang
Tianheng Cheng
Qian Zhang
Chang Huang
Wenyu Liu
3DPC
17
68
0
22 Jun 2022
Future Object Detection with Spatiotemporal Transformers
Future Object Detection with Spatiotemporal Transformers
Adam Tonderski
Joakim Johnander
Christoffer Petersson
Kalle AAstrom
ViT
23
0
0
21 Apr 2022
Not All Tokens Are Equal: Human-centric Visual Analysis via Token
  Clustering Transformer
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Wang Zeng
Sheng Jin
Wentao Liu
Chao Qian
Ping Luo
Ouyang Wanli
Xiaogang Wang
ViT
16
119
0
19 Apr 2022
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction
  Detection
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Bumsoo Kim
Jonghwan Mun
Kyoung-Woon On
Minchul Shin
Junhyun Lee
Eun-Sol Kim
26
50
0
28 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
F. Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViT
MedIm
20
28
0
24 Mar 2022
Towards Data-Efficient Detection Transformers
Towards Data-Efficient Detection Transformers
Wen Wang
Jing Zhang
Yang Cao
Yongliang Shen
Dacheng Tao
ViT
16
56
0
17 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary
  Detection
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
39
14
0
01 Mar 2022
ActionFormer: Localizing Moments of Actions with Transformers
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
23
328
0
16 Feb 2022
Embracing Single Stride 3D Object Detector with Sparse Transformer
Embracing Single Stride 3D Object Detector with Sparse Transformer
Lue Fan
Ziqi Pang
Tianyuan Zhang
Yu-xiong Wang
Hang Zhao
Feng Wang
Naiyan Wang
Zhaoxiang Zhang
ViT
14
252
0
13 Dec 2021
A Survey of Visual Transformers
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
66
330
0
11 Nov 2021
1