Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03605
Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
50 / 716 papers shown
Title
Boundary-Denoising for Video Activity Localization
Mengmeng Xu
Mattia Soldan
Jialin Gao
Shuming Liu
Juan-Manuel Perez-Rua
Bernard Ghanem
19
10
0
06 Apr 2023
VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection
Zhuoling Li
Chuanrui Zhang
Wei-Chiu Ma
Yipin Zhou
Linyan Huang
Haoqian Wang
SerNam Lim
Hengshuang Zhao
15
6
0
03 Apr 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Zhuofan Zong
Dong Jiang
Guanglu Song
Zeyue Xue
Jingyong Su
Hongsheng Li
Yu Liu
29
35
0
03 Apr 2023
Siamese DETR
Ze-Sen Chen
Gengshi Huang
Wei Li
Jianing Teng
Kun Wang
Jing Shao
Chen Change Loy
Lu Sheng
ViT
6
8
0
31 Mar 2023
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffM
VLM
26
129
0
30 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming Yang
F. Khan
ViT
35
83
0
27 Mar 2023
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Xing-Hui Wang
Xiaoqing Ye
Wei Zhang
Jincheng Lu
Xiao Tan
Errui Ding
Pei Sun
Jingdong Wang
VOT
24
20
0
27 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
105
63
0
23 Mar 2023
Dense Distinct Query for End-to-End Object Detection
Shilong Zhang
Wang xinjiang
Jiaqi Wang
Jiangmiao Pang
Chengqi Lyu
Wenwei Zhang
Ping Luo
Kai-xiang Chen
64
119
0
22 Mar 2023
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR Perception
Zixiang Zhou
Dongqiangzi Ye
Weijia Chen
Yufei Xie
Yu Wang
Panqu Wang
H. Foroosh
24
10
0
21 Mar 2023
Detecting Everything in the Open World: Towards Universal Object Detection
Zhenyu Wang
Yali Li
Xi Chen
Ser-Nam Lim
Antonio Torralba
Hengshuang Zhao
Shengjin Wang
ObjD
VLM
24
76
0
21 Mar 2023
Robust Table Structure Recognition with Dynamic Queries Enhanced Detection Transformer
Jiawei Wang
Weihong Lin
Chixiang Ma
Mingze Li
Zhengmao Sun
Lei-huan Sun
Qiang Huo
LMTD
21
14
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
38
258
0
20 Mar 2023
CCTV-Gun: Benchmarking Handgun Detection in CCTV Images
Srikar Yellapragada
Zhenghong Li
K. Doshi
Purva Mhasakar
Heng Fan
Jieda Wei
Erik P. Blasch
Bin Zhang
Haibin Ling
24
4
0
19 Mar 2023
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Kaixin Xiong
Shi Gong
Xiaoqing Ye
Xiao Tan
Ji Wan
Errui Ding
Jingdong Wang
Xiang Bai
3DPC
23
36
0
17 Mar 2023
DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection
Jiawei Ma
Yulei Niu
Jincheng Xu
Shiyuan Huang
G. Han
Shih-Fu Chang
ObjD
19
36
0
16 Mar 2023
FAQ: Feature Aggregated Queries for Transformer-based Video Object Detectors
Yiming Cui
Linjie Yang
ViT
12
15
0
15 Mar 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
Hao Zhang
Feng Li
Xueyan Zou
Siyi Liu
Chun-yue Li
Jianfeng Gao
Jianwei Yang
Lei Zhang
ObjD
VLM
6
149
0
14 Mar 2023
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR
Feng Li
Ailing Zeng
Siyi Liu
Hao Zhang
Hongyang Li
Lei Zhang
L. Ni
ViT
31
67
0
13 Mar 2023
MP-Former: Mask-Piloted Transformer for Image Segmentation
Hao Zhang
Feng Li
Hu-Sheng Xu
Shijia Huang
Siyi Liu
L. Ni
Lei Zhang
ViT
MedIm
11
58
0
13 Mar 2023
Object-Centric Multi-Task Learning for Human Instances
Hyeongseok Son
Sang-Il Jung
Solae Lee
Seong-heum Kim
Seungsang Park
ByungIn Yoo
3DH
19
0
0
13 Mar 2023
Universal Instance Perception as Object Discovery and Retrieval
B. Yan
Yi-Xin Jiang
Jiannan Wu
D. Wang
Ping Luo
Zehuan Yuan
Huchuan Lu
VOS
VLM
LRM
27
161
0
12 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
19
1,804
0
09 Mar 2023
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking
Peng Gao
Renrui Zhang
Rongyao Fang
Ziyi Lin
Hongyang Li
Hongsheng Li
Qiao Yu
16
18
0
09 Mar 2023
ARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial Oriented Object Detection
Ying Zeng
Yushi Chen
Xue Yang
Qingyun Li
Junchi Yan
37
41
0
09 Mar 2023
Aberration-Aware Depth-from-Focus
Xinge Yang
Qiang Fu
Mohammed Elhoseiny
Wolfgang Heidrich
17
9
0
08 Mar 2023
HyT-NAS: Hybrid Transformers Neural Architecture Search for Edge Devices
Lotfi Abdelkrim Mecharbat
Hadjer Benmeziane
Hamza Ouarnoughi
Smail Niar
ViT
16
4
0
08 Mar 2023
CLIP-guided Prototype Modulating for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Jun Cen
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
VLM
6
53
0
06 Mar 2023
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Xu Ju
Ailing Zeng
Jianan Wang
Qian Xu
Lei Zhang
3DH
20
44
0
05 Mar 2023
DPA-P2PNet: Deformable Proposal-aware P2PNet for Accurate Point-based Cell Detection
Zhongyi Shui
S. Zheng
Chenglu Zhu
Shichuan Zhang
Xiaoxuan Yu
Honglin Li
Jingxiong Li
Pingyi Chen
L. Yang
3DPC
27
4
0
05 Mar 2023
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving
Xiwen Liang
Minzhe Niu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
VLM
21
13
0
03 Mar 2023
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation
Rongyao Fang
Peng Gao
Aojun Zhou
Yingjie Cai
Si Liu
Jifeng Dai
Hongsheng Li
ViT
42
9
0
02 Mar 2023
Introducing Depth into Transformer-based 3D Object Detection
Hao Zhang
Hongyang Li
Ailing Zeng
Feng Li
Siyi Liu
Xingyu Liao
Lei Zhang
ViT
3DPC
17
1
0
25 Feb 2023
KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer
Kaikai Zhao
Norimichi Ukita
MU
18
1
0
22 Feb 2023
Team DETR: Guide Queries as a Professional Team in Detection Transformers
Tian Qiu
Linyun Zhou
Wenxiang Xu
Lechao Cheng
Zunlei Feng
Min-Gyoo Song
30
4
0
14 Feb 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Ondrej Biza
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gamaleldin F. Elsayed
Aravindh Mahendran
Thomas Kipf
OCL
43
32
0
09 Feb 2023
Revisiting Pre-training in Audio-Visual Learning
Ruoxuan Feng
Wenke Xia
Di Hu
17
1
0
07 Feb 2023
Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
Jie-jin Yang
Ailing Zeng
Siyi Liu
Feng Li
Ruimao Zhang
Lei Zhang
14
50
0
03 Feb 2023
AMD: Adaptive Masked Distillation for Object Detection
Guang-hong Yang
Yin Tang
Jun Li
Jianhua Xu
Xili Wan
17
6
0
31 Jan 2023
Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation
Razvan-George Pasca
Alexey Gavryushin
Muhammad Hamza
Yen-Ling Kuo
Kaichun Mo
Luc Van Gool
Otmar Hilliges
Xi Wang
22
14
0
22 Jan 2023
Champion Solution for the WSDM2023 Toloka VQA Challenge
Sheng Gao
Zhe Chen
Guo Chen
Wenhai Wang
Tong Lu
39
2
0
22 Jan 2023
Raw or Cooked? Object Detection on RAW Images
William Ljungbergh
Joakim Johnander
Christoffer Petersson
M. Felsberg
16
17
0
21 Jan 2023
AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers
Xumin Yu
Yongming Rao
Ziyi Wang
Jiwen Lu
Jie Zhou
ViT
23
53
0
11 Jan 2023
End-to-End 3D Dense Captioning with Vote2Cap-DETR
Sijin Chen
Hongyuan Zhu
Xin Chen
Yinjie Lei
Tao Chen
YU Gang
ViT
19
51
0
06 Jan 2023
Cross Modal Transformer: Towards Fast and Robust 3D Object Detection
Junjie Yan
Yingfei Liu
Jian‐Yuan Sun
Fan Jia
Shuailin Li
Tiancai Wang
Xiangyu Zhang
ViT
3DPC
21
54
0
03 Jan 2023
Exploring Vision Transformers as Diffusion Learners
He Cao
Jianan Wang
Tianhe Ren
Xianbiao Qi
Yihao Chen
Yuan Yao
L. Zhang
31
10
0
28 Dec 2022
Position-Aware Contrastive Alignment for Referring Image Segmentation
Bo Chen
Zhiwei Hu
Zhilong Ji
Jinfeng Bai
W. Zuo
17
8
0
27 Dec 2022
Reversible Column Networks
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
29
53
0
22 Dec 2022
Analysis and application of multispectral data for water segmentation using machine learning
Shubham Gupta
D. Uma
R. Hebbar
8
0
0
16 Dec 2022
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection
Benjin Zhu
Zhe Wang
Shaoshuai Shi
Hang Xu
Lanqing Hong
Hongsheng Li
11
23
0
14 Dec 2022
Previous
1
2
3
...
12
13
14
15
Next