ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.03605
  4. Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
    ViT
ArXivPDFHTML

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 716 papers shown
Title
Boundary-Denoising for Video Activity Localization
Boundary-Denoising for Video Activity Localization
Mengmeng Xu
Mattia Soldan
Jialin Gao
Shuming Liu
Juan-Manuel Perez-Rua
Bernard Ghanem
19
10
0
06 Apr 2023
VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view
  Attention for Multi-view 3D Object Detection
VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection
Zhuoling Li
Chuanrui Zhang
Wei-Chiu Ma
Yipin Zhou
Linyan Huang
Haoqian Wang
SerNam Lim
Hengshuang Zhao
15
6
0
03 Apr 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via
  Historical Object Prediction
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Zhuofan Zong
Dong Jiang
Guanglu Song
Zeyue Xue
Jingyong Su
Hongsheng Li
Yu Liu
29
35
0
03 Apr 2023
Siamese DETR
Siamese DETR
Ze-Sen Chen
Gengshi Huang
Wei Li
Jianing Teng
Kun Wang
Jing Shao
Chen Change Loy
Lu Sheng
ViT
6
8
0
31 Mar 2023
DDP: Diffusion Model for Dense Visual Prediction
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffM
VLM
26
129
0
30 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based
  Real-time Mobile Vision Applications
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming Yang
F. Khan
ViT
35
83
0
27 Mar 2023
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every
  Detection Box
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Xing-Hui Wang
Xiaoqing Ye
Wei Zhang
Jincheng Lu
Xiao Tan
Errui Ding
Pei Sun
Jingdong Wang
VOT
24
20
0
27 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
105
63
0
23 Mar 2023
Dense Distinct Query for End-to-End Object Detection
Dense Distinct Query for End-to-End Object Detection
Shilong Zhang
Wang xinjiang
Jiaqi Wang
Jiangmiao Pang
Chengqi Lyu
Wenwei Zhang
Ping Luo
Kai-xiang Chen
64
119
0
22 Mar 2023
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR
  Perception
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR Perception
Zixiang Zhou
Dongqiangzi Ye
Weijia Chen
Yufei Xie
Yu Wang
Panqu Wang
H. Foroosh
24
10
0
21 Mar 2023
Detecting Everything in the Open World: Towards Universal Object
  Detection
Detecting Everything in the Open World: Towards Universal Object Detection
Zhenyu Wang
Yali Li
Xi Chen
Ser-Nam Lim
Antonio Torralba
Hengshuang Zhao
Shengjin Wang
ObjD
VLM
24
76
0
21 Mar 2023
Robust Table Structure Recognition with Dynamic Queries Enhanced
  Detection Transformer
Robust Table Structure Recognition with Dynamic Queries Enhanced Detection Transformer
Jiawei Wang
Weihong Lin
Chixiang Ma
Mingze Li
Zhengmao Sun
Lei-huan Sun
Qiang Huo
LMTD
21
14
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
38
258
0
20 Mar 2023
CCTV-Gun: Benchmarking Handgun Detection in CCTV Images
CCTV-Gun: Benchmarking Handgun Detection in CCTV Images
Srikar Yellapragada
Zhenghong Li
K. Doshi
Purva Mhasakar
Heng Fan
Jieda Wei
Erik P. Blasch
Bin Zhang
Haibin Ling
24
4
0
19 Mar 2023
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Kaixin Xiong
Shi Gong
Xiaoqing Ye
Xiao Tan
Ji Wan
Errui Ding
Jingdong Wang
Xiang Bai
3DPC
23
36
0
17 Mar 2023
DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot
  Object Detection
DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection
Jiawei Ma
Yulei Niu
Jincheng Xu
Shiyuan Huang
G. Han
Shih-Fu Chang
ObjD
19
36
0
16 Mar 2023
FAQ: Feature Aggregated Queries for Transformer-based Video Object
  Detectors
FAQ: Feature Aggregated Queries for Transformer-based Video Object Detectors
Yiming Cui
Linjie Yang
ViT
12
15
0
15 Mar 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
A Simple Framework for Open-Vocabulary Segmentation and Detection
Hao Zhang
Feng Li
Xueyan Zou
Siyi Liu
Chun-yue Li
Jianfeng Gao
Jianwei Yang
Lei Zhang
ObjD
VLM
6
149
0
14 Mar 2023
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Feng Li
Ailing Zeng
Siyi Liu
Hao Zhang
Hongyang Li
Lei Zhang
L. Ni
ViT
31
67
0
13 Mar 2023
MP-Former: Mask-Piloted Transformer for Image Segmentation
MP-Former: Mask-Piloted Transformer for Image Segmentation
Hao Zhang
Feng Li
Hu-Sheng Xu
Shijia Huang
Siyi Liu
L. Ni
Lei Zhang
ViT
MedIm
11
58
0
13 Mar 2023
Object-Centric Multi-Task Learning for Human Instances
Object-Centric Multi-Task Learning for Human Instances
Hyeongseok Son
Sang-Il Jung
Solae Lee
Seong-heum Kim
Seungsang Park
ByungIn Yoo
3DH
19
0
0
13 Mar 2023
Universal Instance Perception as Object Discovery and Retrieval
Universal Instance Perception as Object Discovery and Retrieval
B. Yan
Yi-Xin Jiang
Jiannan Wu
D. Wang
Ping Luo
Zehuan Yuan
Huchuan Lu
VOS
VLM
LRM
27
161
0
12 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set
  Object Detection
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
19
1,804
0
09 Mar 2023
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature
  Mimicking
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking
Peng Gao
Renrui Zhang
Rongyao Fang
Ziyi Lin
Hongyang Li
Hongsheng Li
Qiao Yu
16
18
0
09 Mar 2023
ARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial
  Oriented Object Detection
ARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial Oriented Object Detection
Ying Zeng
Yushi Chen
Xue Yang
Qingyun Li
Junchi Yan
37
41
0
09 Mar 2023
Aberration-Aware Depth-from-Focus
Aberration-Aware Depth-from-Focus
Xinge Yang
Qiang Fu
Mohammed Elhoseiny
Wolfgang Heidrich
17
9
0
08 Mar 2023
HyT-NAS: Hybrid Transformers Neural Architecture Search for Edge Devices
HyT-NAS: Hybrid Transformers Neural Architecture Search for Edge Devices
Lotfi Abdelkrim Mecharbat
Hadjer Benmeziane
Hamza Ouarnoughi
Smail Niar
ViT
16
4
0
08 Mar 2023
CLIP-guided Prototype Modulating for Few-shot Action Recognition
CLIP-guided Prototype Modulating for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Jun Cen
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
VLM
6
53
0
06 Mar 2023
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and
  Artificial Scenes
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Xu Ju
Ailing Zeng
Jianan Wang
Qian Xu
Lei Zhang
3DH
20
44
0
05 Mar 2023
DPA-P2PNet: Deformable Proposal-aware P2PNet for Accurate Point-based
  Cell Detection
DPA-P2PNet: Deformable Proposal-aware P2PNet for Accurate Point-based Cell Detection
Zhongyi Shui
S. Zheng
Chenglu Zhu
Shichuan Zhang
Xiaoxuan Yu
Honglin Li
Jingxiong Li
Pingyi Chen
L. Yang
3DPC
27
4
0
05 Mar 2023
Visual Exemplar Driven Task-Prompting for Unified Perception in
  Autonomous Driving
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving
Xiwen Liang
Minzhe Niu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
VLM
21
13
0
03 Mar 2023
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature
  Augmentation
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation
Rongyao Fang
Peng Gao
Aojun Zhou
Yingjie Cai
Si Liu
Jifeng Dai
Hongsheng Li
ViT
42
9
0
02 Mar 2023
Introducing Depth into Transformer-based 3D Object Detection
Introducing Depth into Transformer-based 3D Object Detection
Hao Zhang
Hongyang Li
Ailing Zeng
Feng Li
Siyi Liu
Xingyu Liao
Lei Zhang
ViT
3DPC
17
1
0
25 Feb 2023
KS-DETR: Knowledge Sharing in Attention Learning for Detection
  Transformer
KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer
Kaikai Zhao
Norimichi Ukita
MU
18
1
0
22 Feb 2023
Team DETR: Guide Queries as a Professional Team in Detection
  Transformers
Team DETR: Guide Queries as a Professional Team in Detection Transformers
Tian Qiu
Linyun Zhou
Wenxiang Xu
Lechao Cheng
Zunlei Feng
Min-Gyoo Song
30
4
0
14 Feb 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference
  Frames
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Ondrej Biza
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gamaleldin F. Elsayed
Aravindh Mahendran
Thomas Kipf
OCL
43
32
0
09 Feb 2023
Revisiting Pre-training in Audio-Visual Learning
Revisiting Pre-training in Audio-Visual Learning
Ruoxuan Feng
Wenke Xia
Di Hu
17
1
0
07 Feb 2023
Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
Jie-jin Yang
Ailing Zeng
Siyi Liu
Feng Li
Ruimao Zhang
Lei Zhang
14
50
0
03 Feb 2023
AMD: Adaptive Masked Distillation for Object Detection
AMD: Adaptive Masked Distillation for Object Detection
Guang-hong Yang
Yin Tang
Jun Li
Jianhua Xu
Xili Wan
17
6
0
31 Jan 2023
Summarize the Past to Predict the Future: Natural Language Descriptions
  of Context Boost Multimodal Object Interaction Anticipation
Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation
Razvan-George Pasca
Alexey Gavryushin
Muhammad Hamza
Yen-Ling Kuo
Kaichun Mo
Luc Van Gool
Otmar Hilliges
Xi Wang
22
14
0
22 Jan 2023
Champion Solution for the WSDM2023 Toloka VQA Challenge
Champion Solution for the WSDM2023 Toloka VQA Challenge
Sheng Gao
Zhe Chen
Guo Chen
Wenhai Wang
Tong Lu
39
2
0
22 Jan 2023
Raw or Cooked? Object Detection on RAW Images
Raw or Cooked? Object Detection on RAW Images
William Ljungbergh
Joakim Johnander
Christoffer Petersson
M. Felsberg
16
17
0
21 Jan 2023
AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware
  Transformers
AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Xumin Yu
Yongming Rao
Ziyi Wang
Jiwen Lu
Jie Zhou
ViT
23
53
0
11 Jan 2023
End-to-End 3D Dense Captioning with Vote2Cap-DETR
End-to-End 3D Dense Captioning with Vote2Cap-DETR
Sijin Chen
Hongyuan Zhu
Xin Chen
Yinjie Lei
Tao Chen
YU Gang
ViT
19
51
0
06 Jan 2023
Cross Modal Transformer: Towards Fast and Robust 3D Object Detection
Cross Modal Transformer: Towards Fast and Robust 3D Object Detection
Junjie Yan
Yingfei Liu
Jian‐Yuan Sun
Fan Jia
Shuailin Li
Tiancai Wang
Xiangyu Zhang
ViT
3DPC
21
54
0
03 Jan 2023
Exploring Vision Transformers as Diffusion Learners
Exploring Vision Transformers as Diffusion Learners
He Cao
Jianan Wang
Tianhe Ren
Xianbiao Qi
Yihao Chen
Yuan Yao
L. Zhang
31
10
0
28 Dec 2022
Position-Aware Contrastive Alignment for Referring Image Segmentation
Position-Aware Contrastive Alignment for Referring Image Segmentation
Bo Chen
Zhiwei Hu
Zhilong Ji
Jinfeng Bai
W. Zuo
17
8
0
27 Dec 2022
Reversible Column Networks
Reversible Column Networks
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
29
53
0
22 Dec 2022
Analysis and application of multispectral data for water segmentation
  using machine learning
Analysis and application of multispectral data for water segmentation using machine learning
Shubham Gupta
D. Uma
R. Hebbar
8
0
0
16 Dec 2022
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection
Benjin Zhu
Zhe Wang
Shaoshuai Shi
Hang Xu
Lanqing Hong
Hongsheng Li
11
23
0
14 Dec 2022
Previous
123...12131415
Next