ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.03605
  4. Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
    ViT
ArXivPDFHTML

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 718 papers shown
Title
Computer Vision for Primate Behavior Analysis in the Wild
Computer Vision for Primate Behavior Analysis in the Wild
Richard Vogg
Timo Lüddecke
Jonathan Henrich
Sharmita Dey
Matthias Nuske
...
Alexander Gail
Stefan Treue
H. Scherberger
F. Worgotter
Alexander S. Ecker
28
3
0
29 Jan 2024
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Tianhe Ren
Shilong Liu
Ailing Zeng
Jing Lin
Kunchang Li
...
Feng Li
Jie-jin Yang
Hongyang Li
Qing Jiang
Lei Zhang
VLM
35
378
0
25 Jan 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang
Yahan Yu
Jiahua Dong
Chenxing Li
Dan Su
Chenhui Chu
Dong Yu
OffRL
LRM
39
175
0
24 Jan 2024
ChatterBox: Multi-round Multimodal Referring and Grounding
ChatterBox: Multi-round Multimodal Referring and Grounding
Yunjie Tian
Tianren Ma
Lingxi Xie
Jihao Qiu
Xi Tang
Yuan Zhang
Jianbin Jiao
Qi Tian
Qixiang Ye
18
14
0
24 Jan 2024
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
Zhaozhi Xie
Bochen Guan
Weihao Jiang
Muyang Yi
Yue Ding
Hongtao Lu
Lei Zhang
VLM
31
13
0
23 Jan 2024
Detect-Order-Construct: A Tree Construction based Approach for
  Hierarchical Document Structure Analysis
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis
Jiawei Wang
Kai Hu
Zhuoyao Zhong
Lei-huan Sun
Qiang Huo
25
6
0
22 Jan 2024
Pixel-Wise Recognition for Holistic Surgical Scene Understanding
Pixel-Wise Recognition for Holistic Surgical Scene Understanding
Nicolás Ayobi
Santiago Rodríguez
Alejandra Pérez
Isabela Hernández
Nicolás Aparicio
...
Sebastián Pena
J. Santander
J. Caicedo
Nicolás Fernández
Pablo Arbelaez
ViT
MedIm
29
9
0
20 Jan 2024
Symbol as Points: Panoptic Symbol Spotting via Point-based
  Representation
Symbol as Points: Panoptic Symbol Spotting via Point-based Representation
Wenlong Liu
Tianyu Yang
Yuhan Wang
Qizhi Yu
Lei Zhang
3DPC
17
5
0
19 Jan 2024
Stream Query Denoising for Vectorized HD Map Construction
Stream Query Denoising for Vectorized HD Map Construction
Shuo Wang
Fan Jia
Yingfei Liu
Yucheng Zhao
Zehui Chen
Tiancai Wang
Chi Zhang
Xiangyu Zhang
Feng Zhao
23
18
0
17 Jan 2024
Small Object Detection by DETR via Information Augmentation and Adaptive
  Feature Fusion
Small Object Detection by DETR via Information Augmentation and Adaptive Feature Fusion
Ji Huang
Hui Wang
ViT
19
5
0
16 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
72
1
0
15 Jan 2024
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator
  for Vision Applications
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications
Yuwen Xiong
Zhiqi Li
Yuntao Chen
Feng Wang
Xizhou Zhu
...
Hongsheng Li
Yu Qiao
Lewei Lu
Jie Zhou
Jifeng Dai
24
49
0
11 Jan 2024
Wasserstein Distance-based Expansion of Low-Density Latent Regions for
  Unknown Class Detection
Wasserstein Distance-based Expansion of Low-Density Latent Regions for Unknown Class Detection
Prakash Mallick
Feras Dayoub
Jamie Sherrah
11
1
0
10 Jan 2024
ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic
  Polyp Detection
ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic Polyp Detection
Yuncheng Jiang
Zixun Zhang
Yiwen Hu
Guanbin Li
Xiang Wan
Song Wu
Shuguang Cui
Silin Huang
Zhen Li
11
3
0
10 Jan 2024
Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for
  Memory-Efficient Finetuning
Dr2^22Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning
Chen Zhao
Shuming Liu
K. Mangalam
Guocheng Qian
Fatimah Zohra
Abdulmohsen Alghannam
Jitendra Malik
Bernard Ghanem
46
3
0
08 Jan 2024
MS-DETR: Efficient DETR Training with Mixed Supervision
MS-DETR: Efficient DETR Training with Mixed Supervision
Chuyang Zhao
Yifan Sun
Wenhao Wang
Qiang Chen
Errui Ding
Yi Yang
Jingdong Wang
MU
25
20
0
08 Jan 2024
Exploiting Polarized Material Cues for Robust Car Detection
Exploiting Polarized Material Cues for Robust Car Detection
Wen Dong
Haiyang Mei
Ziqi Wei
Ao Jin
Sen Qiu
Qiang Zhang
Xin Yang
17
1
0
05 Jan 2024
An Open and Comprehensive Pipeline for Unified Object Grounding and
  Detection
An Open and Comprehensive Pipeline for Unified Object Grounding and Detection
Xiangyu Zhao
Yicheng Chen
Shilin Xu
Xiangtai Li
Xinjiang Wang
Yining Li
Haian Huang
ObjD
AI4CE
37
29
0
04 Jan 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
Yiran Song
Qianyu Zhou
Xiangtai Li
Deng-Ping Fan
Xuequan Lu
Lizhuang Ma
VLM
30
14
0
04 Jan 2024
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video
  Grounding
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
Syed Talal Wasim
Muzammal Naseer
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
18
12
0
31 Dec 2023
HEAP: Unsupervised Object Discovery and Localization with Contrastive
  Grouping
HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping
Xin Zhang
Jinheng Xie
Yuan. Yuan
Michael Bi Mi
Robby T. Tan
VOS
OCL
VLM
57
2
0
29 Dec 2023
Transformer-Based Multi-Object Smoothing with Decoupled Data Association
  and Smoothing
Transformer-Based Multi-Object Smoothing with Decoupled Data Association and Smoothing
Juliano Pinto
Georg Hess
Yuxuan Xia
H. Wymeersch
Lennart Svensson
VOT
22
3
0
22 Dec 2023
Universal Noise Annotation: Unveiling the Impact of Noisy annotation on
  Object Detection
Universal Noise Annotation: Unveiling the Impact of Noisy annotation on Object Detection
Kwang-seok Ryoo
Yeonsik Jo
Seungjun Lee
Mira Kim
Ahra Jo
S. Kim
Seungryong Kim
Soonyoung Lee
NoLa
21
1
0
21 Dec 2023
Diffusion-Based Particle-DETR for BEV Perception
Diffusion-Based Particle-DETR for BEV Perception
Asen Nachkov
Martin Danelljan
D. Paudel
Luc Van Gool
DiffM
26
3
0
18 Dec 2023
MatchDet: A Collaborative Framework for Image Matching and Object
  Detection
MatchDet: A Collaborative Framework for Image Matching and Object Detection
Jinxiang Lai
Wenlong Wu
Bin-Bin Gao
Jun Liu
Jiawei Zhan
Congchong Nie
Yi Zeng
Chengjie Wang
VLM
22
0
0
18 Dec 2023
DETER: Detecting Edited Regions for Deterring Generative Manipulations
DETER: Detecting Edited Regions for Deterring Generative Manipulations
Sai Wang
Ye Zhu
Ruoyu Wang
Amaya Dharmasiri
Olga Russakovsky
Yu Wu
35
2
0
16 Dec 2023
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for
  Open-Vocabulary Object Detection
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection
Joonhyun Jeong
Geondo Park
Jayeon Yoo
Hyungsik Jung
Heesu Kim
VLM
ObjD
35
10
0
12 Dec 2023
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object
  Detection
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
Hu Zhang
Jianhua Xu
Tao Tang
Haiyang Sun
Xin Yu
Zi Huang
Kaicheng Yu
ObjD
3DPC
33
12
0
12 Dec 2023
Mixed Pseudo Labels for Semi-Supervised Object Detection
Mixed Pseudo Labels for Semi-Supervised Object Detection
Ze-Yi Chen
Wenwei Zhang
Xinjiang Wang
Kai Chen
Zhi Wang
ObjD
29
10
0
12 Dec 2023
A Multimodal Dataset and Benchmark for Radio Galaxy and Infrared Host
  Detection
A Multimodal Dataset and Benchmark for Radio Galaxy and Infrared Host Detection
N. Gupta
Zeeshan Hayder
Ray P. Norris
Minh Huynh
Lars Petersson
11
3
0
11 Dec 2023
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan
Jiageng Zhang
A. Taalimi
Fan Yang
Xingyi Zhou
Chaochao Yan
Liang-Chieh Chen
Yeqing Li
ViT
26
5
0
11 Dec 2023
You Only Learn One Query: Learning Unified Human Query for Single-Stage
  Multi-Person Multi-Task Human-Centric Perception
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
Sheng Jin
Shuhuai Li
Tong Li
Wentao Liu
Chao Qian
Ping Luo
29
5
0
09 Dec 2023
Vision-based Learning for Drones: A Survey
Vision-based Learning for Drones: A Survey
Jiaping Xiao
Rangya Zhang
Yuhang Zhang
Mir Feroskhan
27
3
0
08 Dec 2023
Lyrics: Boosting Fine-grained Language-Vision Alignment and
  Comprehension via Semantic-aware Visual Objects
Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects
Junyu Lu
Ruyi Gan
Di Zhang
Xiaojun Wu
Ziwei Wu
Renliang Sun
Jiaxing Zhang
Pingjian Zhang
Yan Song
MLLM
VLM
17
15
0
08 Dec 2023
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Hao Zhang
Hongyang Li
Feng Li
Tianhe Ren
Xueyan Zou
...
Shijia Huang
Jianfeng Gao
Lei Zhang
Chun-yue Li
Jianwei Yang
87
68
0
05 Dec 2023
Lenna: Language Enhanced Reasoning Detection Assistant
Lenna: Language Enhanced Reasoning Detection Assistant
Fei Wei
Xinyu Zhang
Ailing Zhang
Bo-Wen Zhang
Xiangxiang Chu
MLLM
LRM
27
23
0
05 Dec 2023
MobileUtr: Revisiting the relationship between light-weight CNN and
  Transformer for efficient medical image segmentation
MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation
Fenghe Tang
Bingkun Nian
Jianrui Ding
Quan Quan
Jie-jin Yang
Wei Liu
S.Kevin Zhou
ViT
MedIm
23
3
0
04 Dec 2023
Learning Efficient Unsupervised Satellite Image-based Building Damage
  Detection
Learning Efficient Unsupervised Satellite Image-based Building Damage Detection
Yiyun Zhang
Zijian Wang
Yadan Luo
Xin Yu
Zi Huang
18
4
0
04 Dec 2023
DiverseDream: Diverse Text-to-3D Synthesis with Augmented Text Embedding
DiverseDream: Diverse Text-to-3D Synthesis with Augmented Text Embedding
Uy Dieu Tran
Minh Luu
P. Nguyen
K. Nguyen
Binh-Son Hua
32
1
0
02 Dec 2023
Segment and Caption Anything
Segment and Caption Anything
Xiaoke Huang
Jianfeng Wang
Yansong Tang
Zheng Zhang
Han Hu
Jiwen Lu
Lijuan Wang
Zicheng Liu
MLLM
VLM
26
18
0
01 Dec 2023
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal
  Sentence Grounding in Videos
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Pilhyeon Lee
Hyeran Byun
19
10
0
30 Nov 2023
Language-conditioned Detection Transformer
Language-conditioned Detection Transformer
Jang Hyun Cho
Philipp Krahenbuhl
VLM
ObjD
42
1
0
29 Nov 2023
A Graph-Based Approach for Category-Agnostic Pose Estimation
A Graph-Based Approach for Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
21
10
0
29 Nov 2023
PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with
  Confidence-Level Prediction and Pose Tokens
PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with Confidence-Level Prediction and Pose Tokens
Sebastian Stapf
Tobias Bauernfeind
Marco Riboldi
ViT
22
1
0
29 Nov 2023
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
Dai Shi
ViT
13
76
0
28 Nov 2023
Stable Segment Anything Model
Stable Segment Anything Model
Qi Fan
Xin Tao
Lei Ke
Mingqiao Ye
Yuanhui Zhang
Pengfei Wan
Zhong-ming Wang
Yu-Wing Tai
Chi-Keung Tang
VLM
20
6
0
27 Nov 2023
Griffon: Spelling out All Object Locations at Any Granularity with Large
  Language Models
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
Yufei Zhan
Yousong Zhu
Zhiyang Chen
Fan Yang
E. Goles
Jinqiao Wang
ObjD
50
14
0
24 Nov 2023
OneFormer3D: One Transformer for Unified Point Cloud Segmentation
OneFormer3D: One Transformer for Unified Point Cloud Segmentation
Maksim Kolodiazhnyi
Anna Vorontsova
Anton Konushin
D. Rukhovich
ViT
31
41
0
24 Nov 2023
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
Benjamin Kiefer
Lojze Žust
Matej Kristan
J. Pers
Matija Tersek
...
Magdalena Šumunec
Nadir Kapetanović
A. Michel
Wolfgang Gross
Martin Weinmann
15
4
0
23 Nov 2023
T-Rex: Counting by Visual Prompting
T-Rex: Counting by Visual Prompting
Qing Jiang
Feng Li
Tianhe Ren
Shilong Liu
Zhaoyang Zeng
Kent Yu
Lei Zhang
16
11
0
22 Nov 2023
Previous
123...8910...131415
Next