ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.03605
  4. Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
    ViT
ArXivPDFHTML

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 718 papers shown
Title
CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset
CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset
Abdelrahman Abdallah
Mahmoud Abdalla
M. Kasem
Mohamed Mahmoud
Ibrahim Abdelhalim
Mohamed Elkasaby
Yasser Elbendary
Adam Jatowt
23
0
0
06 Jun 2024
Learning 1D Causal Visual Representation with De-focus Attention
  Networks
Learning 1D Causal Visual Representation with De-focus Attention Networks
Chenxin Tao
Xizhou Zhu
Shiqian Su
Lewei Lu
Changyao Tian
...
Gao Huang
Hongsheng Li
Yu Qiao
Jie Zhou
Jifeng Dai
60
1
0
06 Jun 2024
Parameter-Inverted Image Pyramid Networks
Parameter-Inverted Image Pyramid Networks
Xizhou Zhu
Xue Yang
Zhaokai Wang
Hao Li
Wenhan Dou
Junqi Ge
Lewei Lu
Yu Qiao
Jifeng Dai
47
0
0
06 Jun 2024
Matching Anything by Segmenting Anything
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
29
22
0
06 Jun 2024
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection
Qiang Chen
Xiangbo Su
Xinyu Zhang
Jian Wang
Jiahui Chen
...
Shan Zhang
Kun Yao
Errui Ding
Gang Zhang
Jingdong Wang
ViT
42
11
0
05 Jun 2024
Global Clipper: Enhancing Safety and Reliability of Transformer-based
  Object Detection Models
Global Clipper: Enhancing Safety and Reliability of Transformer-based Object Detection Models
Qutub Syed Sha
Michael Paulitsch
Karthik Pattabiraman
Korbinian Hagn
Fabian Oboril
Cornelius Buerkle
Kay-Ulrich Scholl
Gereon Hinz
Alois C. Knoll
27
0
0
05 Jun 2024
Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution
  Detection using Budding Ensemble Architecture for Object Detection
Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection
Qutub Syed
Michael Paulitsch
Korbinian Hagn
Neslihan Kose Cihangir
Kay-Ulrich Scholl
Fabian Oboril
Gereon Hinz
Alois C. Knoll
OODD
46
1
0
05 Jun 2024
MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class
  Min-Margin Contrastive Learning for Superior Prohibited Item Detection
MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection
Mingyuan Li
Tong Jia
Hui Lu
Bowen Ma
Hao Wang
Dongyue Chen
18
1
0
05 Jun 2024
EgoSurgery-Tool: A Dataset of Surgical Tool and Hand Detection from
  Egocentric Open Surgery Videos
EgoSurgery-Tool: A Dataset of Surgical Tool and Hand Detection from Egocentric Open Surgery Videos
Ryo Fujii
Hideo Saito
Hiroki Kajita
27
4
0
05 Jun 2024
Parrot: Multilingual Visual Instruction Tuning
Parrot: Multilingual Visual Instruction Tuning
Hai-Long Sun
Da-Wei Zhou
Y. Li
Shiyin Lu
Chao Yi
...
Zhao Xu
Weihua Luo
Kaifu Zhang
De-Chuan Zhan
Han-Jia Ye
MLLM
23
9
0
04 Jun 2024
ELSA: Evaluating Localization of Social Activities in Urban Streets
ELSA: Evaluating Localization of Social Activities in Urban Streets
Maryam Hosseini
Marco Cipriano
Sedigheh Eslami
Daniel Hodczak
Liu Liu
Andres Sevtsuk
Gerard de Melo
26
0
0
03 Jun 2024
CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship
  Modeling in Aerial Videos
CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos
Trong-Thuan Nguyen
Pha Nguyen
Xin Li
Jackson Cothren
Alper Yilmaz
Khoa Luu
38
3
0
03 Jun 2024
Diversifying Query: Region-Guided Transformer for Temporal Sentence
  Grounding
Diversifying Query: Region-Guided Transformer for Temporal Sentence Grounding
Xiaolong Sun
Liushuai Shi
Le Wang
Sanpin Zhou
Kun Xia
Yabing Wang
Gang Hua
21
2
0
31 May 2024
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
Selim Kuzucu
Kemal Oksuz
Jonathan Sadeghi
P. Dokania
31
4
0
30 May 2024
Towards Unified Multi-granularity Text Detection with Interactive
  Attention
Towards Unified Multi-granularity Text Detection with Interactive Attention
Xingyu Wan
Chengquan Zhang
Pengyuan Lyu
Sen Fan
Zihan Ni
Kun Yao
Errui Ding
Jingdong Wang
60
1
0
30 May 2024
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for
  Autonomous Driving
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving
Yiming Cui
Cheng Han
Dongfang Liu
32
0
0
29 May 2024
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and
  Open-World Unknown Objects Supervision
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
Junjie Wang
Bin Chen
Bin Kang
Yulin Li
Yichi Chen
Weizhi Xian
Huifeng Chang
VLM
ObjD
23
7
0
28 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
48
4
0
28 May 2024
Understanding differences in applying DETR to natural and medical images
Understanding differences in applying DETR to natural and medical images
Yanqi Xu
Yiqiu Shen
C. Fernandez‐Granda
Laura Heacock
Krzysztof J. Geras
MedIm
54
2
0
27 May 2024
The SkatingVerse Workshop & Challenge: Methods and Results
The SkatingVerse Workshop & Challenge: Methods and Results
Jian Zhao
Lei Jin
Jianshu Li
Zheng Zhu
Yinglei Teng
...
Shiníchi Satoh
Yandong Guo
Cewu Lu
Junliang Xing
Jane Shengmei Shen
AI4TS
20
0
0
27 May 2024
LLM-Optic: Unveiling the Capabilities of Large Language Models for
  Universal Visual Grounding
LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding
Haoyu Zhao
Wenhang Ge
Ying-cong Chen
ObjD
MLLM
VLM
32
4
0
27 May 2024
Activator: GLU Activation Function as the Core Component of a Vision
  Transformer
Activator: GLU Activation Function as the Core Component of a Vision Transformer
Abdullah Nazhat Abdullah
Tarkan Aydin
ViT
38
0
0
24 May 2024
MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object
  Detection Method
MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method
Pan Liao
Feng Yang
Di Wu
Liu Bo
32
1
0
24 May 2024
YOLOv10: Real-Time End-to-End Object Detection
YOLOv10: Real-Time End-to-End Object Detection
Ao Wang
Hui Chen
Lihao Liu
Kai Chen
Zijia Lin
Jungong Han
Guiguang Ding
3DH
35
893
0
23 May 2024
Context and Geometry Aware Voxel Transformer for Semantic Scene
  Completion
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
Zhuopu Yu
Runmin Zhang
Jiacheng Ying
Junchen Yu
Xiaohai Hu
Lun Luo
Siyuan Cao
Hui-Liang Shen
ViT
49
12
0
22 May 2024
Active Object Detection with Knowledge Aggregation and Distillation from
  Large Models
Active Object Detection with Knowledge Aggregation and Distillation from Large Models
Dejie Yang
Yang Liu
35
3
0
21 May 2024
DATR: Unsupervised Domain Adaptive Detection Transformer with
  Dataset-Level Adaptation and Prototypical Alignment
DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment
Jianhong Han
Liang Chen
Yupei Wang
ViT
44
2
0
20 May 2024
DLAFormer: An End-to-End Transformer For Document Layout Analysis
DLAFormer: An End-to-End Transformer For Document Layout Analysis
Jiawei Wang
Kai Hu
Qiang Huo
3DV
ViT
22
3
0
20 May 2024
Track Anything Rapter(TAR)
Track Anything Rapter(TAR)
Tharun V. Puthanveettil
Fnu Obaid ur Rahman
24
0
0
19 May 2024
Visible and Clear: Finding Tiny Objects in Difference Map
Visible and Clear: Finding Tiny Objects in Difference Map
Bing Cao
Haiyu Yao
Pengfei Zhu
Qinghua Hu
ObjD
30
3
0
18 May 2024
Open-Vocabulary Spatio-Temporal Action Detection
Open-Vocabulary Spatio-Temporal Action Detection
Tao Wu
Shuqiu Ge
Jie Qin
Gangshan Wu
Limin Wang
ObjD
23
5
0
17 May 2024
A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells
  Detection with Morphological Attributes for Explainability
A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability
Abdul Rehman
Talha Meraj
A. Minhas
Ayisha Imran
Mohsen Ali
Waqas Sultani
24
2
0
17 May 2024
Better Sampling, towards Better End-to-end Small Object Detection
Better Sampling, towards Better End-to-end Small Object Detection
Zile Huang
Chong Zhang
Mingyu Jin
Fangyu Wu
Chengzhi Liu
Xiaobo Jin
ObjD
39
0
0
17 May 2024
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Tianhe Ren
Qing Jiang
Shilong Liu
Zhaoyang Zeng
Wenlong Liu
...
Hao Zhang
Feng Li
Peijun Tang
Kent Yu
Lei Zhang
ObjD
VLM
29
33
0
16 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks
  via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
29
12
0
16 May 2024
Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal
  Candidiasis Screening
Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal Candidiasis Screening
Yan Kong
Sheng Wang
Jiangdong Cai
Zihao Zhao
Zhenrong Shen
Yonghao Li
Manman Fei
Qian Wang
23
2
0
15 May 2024
MetaFruit Meets Foundation Models: Leveraging a Comprehensive
  Multi-Fruit Dataset for Advancing Agricultural Foundation Models
MetaFruit Meets Foundation Models: Leveraging a Comprehensive Multi-Fruit Dataset for Advancing Agricultural Foundation Models
Jiajia Li
Kyle Lammers
Xunyuan Yin
Xiang Yin
Long He
Renfu Lu
Zhaojian Li
25
3
0
14 May 2024
Wild Berry image dataset collected in Finnish forests and peatlands using drones
Wild Berry image dataset collected in Finnish forests and peatlands using drones
Luigi Riz
Sergio Povoli
Andrea Caraffa
Davide Boscaini
M. L. Mekhalfi
...
Elisa Castelli
Giacomo Piccinini
L. Marchesotti
Micael S. Couceiro
Fabio Poiesi
29
1
0
13 May 2024
Replication Study and Benchmarking of Real-Time Object Detection Models
Replication Study and Benchmarking of Real-Time Object Detection Models
Pierre-Luc Asselin
Vincent Coulombe
William Guimont-Martin
William Larrivée-Hardy
38
0
0
11 May 2024
How to Augment for Atmospheric Turbulence Effects on Thermal Adapted
  Object Detection Models?
How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?
Engin Uzun
Erdem Akagündüz
30
0
0
10 May 2024
Prompt When the Animal is: Temporal Animal Behavior Grounding with
  Positional Recovery Training
Prompt When the Animal is: Temporal Animal Behavior Grounding with Positional Recovery Training
Sheng Yan
Xin Du
Zongying Li
Yi Wang
Hongcang Jin
Mengyuan Liu
OOD
VLM
27
0
0
09 May 2024
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via
  Editable Gaussian Splatting
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting
O. Shorinwa
Johnathan Tucker
Aliyah Smith
Aiden Swann
Timothy Chen
Roya Firoozi
Monroe Kennedy
Mac Schwager
29
22
0
07 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
79
36
0
06 May 2024
Enhancing DETRs Variants through Improved Content Query and Similar
  Query Aggregation
Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Yingying Zhang
Chuangji Shi
Xin Guo
Jiangwei Lao
Jian Wang
Jiaotuan Wang
Jingdong Chen
32
2
0
06 May 2024
PTQ4SAM: Post-Training Quantization for Segment Anything
PTQ4SAM: Post-Training Quantization for Segment Anything
Chengtao Lv
Hong Chen
Jinyang Guo
Yifu Ding
Xianglong Liu
VLM
MQ
26
12
0
06 May 2024
Multi-method Integration with Confidence-based Weighting for Zero-shot
  Image Classification
Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification
Siqi Yin
Lifan Jiang
22
0
0
03 May 2024
Towards Consistent Object Detection via LiDAR-Camera Synergy
Towards Consistent Object Detection via LiDAR-Camera Synergy
Kai Luo
Hao Wu
Kefu Yi
Kailun Yang
Wei Hao
Rongdong Hu
38
1
0
02 May 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao
Youwei Pang
Wei Ji
Baicheng Sheng
Jiaming Zuo
Lihe Zhang
Huchuan Lu
34
6
0
02 May 2024
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu
Yiming Hao
Manyuan Zhang
Keqiang Sun
Zhaoyang Huang
Guanglu Song
Yu Liu
Hongsheng Li
EGVM
68
16
0
01 May 2024
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target
  Identification with Large Multimodal Models
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models
Hongzhan Lin
Zixin Chen
Ziyang Luo
Mingfei Cheng
Jing Ma
Guang Chen
31
6
0
01 May 2024
Previous
123...567...131415
Next