Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03605
Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
50 / 716 papers shown
Title
Enhancing, Refining, and Fusing: Towards Robust Multi-Scale and Dense Ship Detection
Congxia Zhao
Xiongjun Fu
Jian Dong
Shen Cao
Chunyan Zhang
36
1
0
10 Jan 2025
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Xinyao Liao
Wei Wei
Dangyang Chen
Yuanyuan Fu
48
0
0
10 Jan 2025
Supervision-free Vision-Language Alignment
Giorgio Giannone
Ruoteng Li
Qianli Feng
Evgeny Perevodchikov
Rui Chen
Aleix M. Martinez
VLM
58
0
0
08 Jan 2025
Siamese-DETR for Generic Multi-Object Tracking
Qiankun Liu
Yichen Li
Yuqi Jiang
Ying Fu
VOT
55
7
0
08 Jan 2025
ORGANA: A Robotic Assistant for Automated Chemistry Experimentation and Characterization
Kourosh Darvish
Marta Skreta
Yuchi Zhao
Naruki Yoshikawa
Sagnik Som
...
Han Hao
Haoping Xu
Alán Aspuru-Guzik
Animesh Garg
Florian Shkurti
52
21
0
08 Jan 2025
SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network
Zhaoxu Li
Wei An
Gaowei Guo
Longguang Wang
Yingqian Wang
Zaiping Lin
ViT
73
0
0
03 Jan 2025
ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing
Yuka Ogino
Yuho Shoji
Takahiro Toizumi
Atsushi Ito
39
1
0
31 Dec 2024
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
46
3
0
31 Dec 2024
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
Xiaoyang Liu
Boran Wen
Xinpeng Liu
Zizheng Zhou
Hongwei Fan
Cewu Lu
Lizhuang Ma
Yulong Chen
Y. Li
48
2
0
27 Dec 2024
To Predict or Not To Predict? Proportionally Masked Autoencoders for Tabular Data Imputation
Jungkyu Kim
Kibok Lee
Taeyoung Park
25
0
0
26 Dec 2024
Towards Unsupervised Model Selection for Domain Adaptive Object Detection
Hengfu Yu
Jinhong Deng
Wen Li
Lixin Duan
35
0
0
23 Dec 2024
IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks
Yaming Zhang
Chenqiang Gao
Fangcen Liu
Junjie Guo
Lan Wang
Xinggan Peng
Deyu Meng
87
0
0
21 Dec 2024
OmniPrism: Learning Disentangled Visual Concept for Image Generation
Yangyang Li
Daqing Liu
Wu Liu
Allen He
Xinchen Liu
Yongdong Zhang
Guoqing Jin
DiffM
CoGe
71
0
0
16 Dec 2024
Predicting the Original Appearance of Damaged Historical Documents
Zhenhua Yang
Dezhi Peng
Yongxin Shi
Yuyi Zhang
Chongyu Liu
Lianwen Jin
64
2
0
16 Dec 2024
SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
Zhaoyang Sun
Shengwu Xiong
Yaxiong Chen
Fei Du
Weihua Chen
Fan Wang
Yi Rong
DiffM
71
1
0
15 Dec 2024
Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
Yang Yang
Wenjuan Xi
Luping Zhou
Jinhui Tang
74
0
0
14 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
75
0
0
14 Dec 2024
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
Lojze Žust
Matej Kristan
ViT
69
1
0
13 Dec 2024
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection
K. Hashmi
Talha Uddin Sheikh
Didier Stricker
Muhammad Zeshan Afzal
70
0
0
06 Dec 2024
Towards Real-Time Open-Vocabulary Video Instance Segmentation
Bin Yan
Martin Sundermeyer
D. Tan
Huchuan Lu
F. Tombari
VLM
VOS
86
1
0
05 Dec 2024
DEIM: DETR with Improved Matching for Fast Convergence
Shihua Huang
Zhichao Lu
Xiaodong Cun
Yongjun Yu
Xiao Zhou
Xi Shen
VLM
117
2
0
05 Dec 2024
MeasureNet: Measurement Based Celiac Disease Identification
Aayush Kumar Tyagi
Vaibhav Mishra
Ashok Tiwari
Lalita Mehra
Prasenjit Das
G. Makharia
Prathosh AP
Mausam
75
0
0
02 Dec 2024
HandOS: 3D Hand Reconstruction in One Stage
Xingyu Chen
Zhuheng Song
Xiaoke Jiang
Yaoqing Hu
Junzhi Yu
Lei Zhang
3DH
HAI
69
0
0
02 Dec 2024
BGM: Background Mixup for X-ray Prohibited Items Detection
W. Liu
R. Tao
Hongguang Zhu
Yunda Sun
Yao Zhao
Y. X. Wei
63
0
0
30 Nov 2024
LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Image
Chetan Madan
Mayuna Gupta
Soumen Basu
Pankaj Gupta
Chetan Arora
80
0
0
30 Nov 2024
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Felix Fent
Gerhard Rigoll
77
0
0
29 Nov 2024
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Jinyuan Qu
Hongyang Li
Shilong Liu
Tianhe Ren
Zhaoyang Zeng
Lei Zhang
3DPC
72
1
0
27 Nov 2024
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
Qing Jiang
Gen Luo
Yuqin Yang
Yuda Xiong
Yihao Chen
Zhaoyang Zeng
Tianhe Ren
Lei Zhang
VLM
LRM
105
6
0
27 Nov 2024
Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Peng Cui
Guande He
Dan Zhang
Zhijie Deng
Yinpeng Dong
Jun Zhu
72
0
0
26 Nov 2024
Open Vocabulary Monocular 3D Object Detection
Jin Yao
Hao Gu
Xuweiyi Chen
Jiayun Wang
Zezhou Cheng
ObjD
VLM
63
3
0
25 Nov 2024
Edge Weight Prediction For Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
69
0
0
25 Nov 2024
VideoOrion: Tokenizing Object Dynamics in Videos
Yicheng Feng
Yijiang Li
Wanpeng Zhang
Sipeng Zheng
Zongqing Lu
Sipeng Zheng
Zongqing Lu
101
1
0
25 Nov 2024
DT-LSD: Deformable Transformer-based Line Segment Detection
Sebastian Janampa
Marios Pattichis
ViT
62
0
0
20 Nov 2024
CLIC: Contrastive Learning Framework for Unsupervised Image Complexity Representation
Shipeng Liu
Liang Zhao
Dengfeng Chen
SSL
110
1
0
19 Nov 2024
WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images
Lars Nieradzik
Henrike Stephani
Jördis Sieburg-Rockel
Stephanie Helmling
Andrea Olbrich
Stephanie Wrage
J. Keuper
66
0
0
18 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
45
0
0
16 Nov 2024
Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning
Jingru Yang
Huan Yu
Yang Jingxin
C. Xu
Yin Biao
Yu Sun
Shengfeng He
23
0
0
15 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
58
1
0
12 Nov 2024
White-Box Diffusion Transformer for single-cell RNA-seq generation
Zhuorui Cui
Shengze Dong
Ding Liu
23
1
0
11 Nov 2024
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
Yifan Wang
X. Yang
Fanqi Pu
Q. Liao
Wenming Yang
31
0
0
05 Nov 2024
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
35
1
0
31 Oct 2024
Unbiased Regression Loss for DETRs
Edric
Ueta Daisuke
Kurokawa Yukimasa
Karlekar Jayashree
Sugiri Pranata
20
0
0
30 Oct 2024
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
VLM
23
3
0
25 Oct 2024
On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
Rajat Modi
Vibhav Vineet
Y. S. Rawat
33
1
0
25 Oct 2024
DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection
Qingpeng Li
Yuxin Zhang
Leyuan Fang
Yuhan Kang
Shutao Li
Xiao Xiang Zhu
21
0
0
23 Oct 2024
AlphaChimp: Tracking and Behavior Recognition of Chimpanzees
Xiaoxuan Ma
Yutang Lin
Yuan Xu
Stephan P. Kaufhold
Jack Terwilliger
Andres Meza
Yixin Zhu
Federico Rossano
Yizhou Wang
34
0
0
22 Oct 2024
DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model
Zhixiong Nan
Xianghong Li
Tao Xiang
Jifeng Dai
ISeg
38
0
0
22 Oct 2024
PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model
Zhongchen Deng
Zhechen Yang
Chi Chen
Cheng Zeng
Yan Meng
Bisheng Yang
17
1
0
21 Oct 2024
Adventures with Grace Hopper AI Super Chip and the National Research Platform
J. Alex Hurt
Grant J. Scott
Derek Weitzel
Huijun Zhu
16
0
0
21 Oct 2024
Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images
Yiming Li
Hanchi Ren
Jingjing Deng
Xianghua Xie
26
0
0
21 Oct 2024
Previous
1
2
3
4
5
6
...
13
14
15
Next