Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.00982
Cited By
v1
v2 (latest)
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 623 papers shown
Title
Continual Error Correction on Low-Resource Devices
ACM SIGMM Conference on Multimedia Systems (MMSys), 2025
Kirill Paramonov
Mete Ozay
Aristeidis Mystakidis
Nikolaos Tsalikidis
Dimitrios Sotos
...
Sangdok Mo
Namwoong Kim
Woojong Yoo
J. Moon
Umberto Michieli
CLL
VLM
145
0
0
26 Nov 2025
Large Language Models for the Summarization of Czech Documents: From History to the Present
Václav Tran
Jakub Šmíd
Ladislav Lenc
Jean-Pierre Salmon
Pavel Král
24
0
0
24 Nov 2025
Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy
Italian National Conference on Sensors (INS), 2025
Vinit Mehta
Charu Sharma
Karthick Thiyagarajan
LM&Ro
316
0
0
14 Nov 2025
PriVi: Towards A General-Purpose Video Model For Primate Behavior In The Wild
Felix B. Mueller
Jan F. Meier
Timo Lueddecke
Richard Vogg
Roger L. Freixanet
...
Liran Samuni
Oliver Schülke
Neda Shahidi
Erin G. Wessling
Alexander S. Ecker
115
0
0
12 Nov 2025
ISC-Perception: A Hybrid Computer Vision Dataset for Object Detection in Novel Steel Assembly
Miftahur Rahman
Samuel Adebayo
Dorian A. Acevedo-Mejia
David Hester
Daniel McPolin
Karen Rafferty
Debra F. Laefer
64
0
0
05 Nov 2025
Towards 3D Objectness Learning in an Open World
Taichi Liu
Zhenyu Wang
Ruofeng Liu
Guang Wang
Desheng Zhang
3DPC
VLM
85
0
0
20 Oct 2025
Prominence-Aware Artifact Detection and Dataset for Image Super-Resolution
Ivan Molodetskikh
Kirill Malyshev
Mark Mirgaleev
Nikita Zagainov
Evgeney Nikolaevich Bogatyrev
D. Vatolin
64
0
0
19 Oct 2025
FedHybrid: Breaking the Memory Wall of Federated Learning via Hybrid Tensor Management
ACM International Conference on Embedded Networked Sensor Systems (SenSys), 2024
Kahou Tam
Chunlin Tian
Li Li
Haikai Zhao
Chengzhong Xu
FedML
149
6
0
13 Oct 2025
OTR: Synthesizing Overlay Text Dataset for Text Removal
Jan Zdenek
Wataru Shimoda
Kota Yamaguchi
48
0
0
03 Oct 2025
Model Merging to Maintain Language-Only Performance in Developmentally Plausible Multimodal Models
Ece Takmaz
Lisa Bylinina
Jakub Dotlacil
MoMe
144
0
0
02 Oct 2025
Cumulative Consensus Score: Label-Free and Model-Agnostic Evaluation of Object Detectors in Deployment
Avinaash Manoharan
Xiangyu Yin
Domenik Helm
Chih-Hong Cheng
80
0
0
16 Sep 2025
Beyond Instance Consistency: Investigating View Diversity in Self-supervised Learning
Huaiyuan Qin
M. Yang
Siyuan Hu
Peng Hu
Yu Zhang
Chen Gong
Hongyuan Zhu
113
0
0
14 Sep 2025
Safe Semantics, Unsafe Interpretations: Tackling Implicit Reasoning Safety in Large Vision-Language Models
Wei Cai
Jian Zhao
Yuchu Jiang
T. Zhang
Xuelong Li
LRM
108
2
0
12 Aug 2025
DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding
Wenwen Yu
Zhibo Yang
Yuliang Liu
Xiang Bai
MLLM
OffRL
LRM
64
3
0
12 Aug 2025
DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition
Haijing Liu
Tao Pu
Hefeng Wu
Keze Wang
Guanbin Li
ObjD
VLM
98
0
0
07 Aug 2025
From Label Error Detection to Correction: A Modular Framework and Benchmark for Object Detection Datasets
Sarina Penquitt
Jonathan Klees
Rinor Cakaj
Daniel Kondermann
Matthias Rottmann
Lars Schmarje
100
1
0
06 Aug 2025
MILD: Multi-Layer Diffusion Strategy for Complex and Precise Multi-IP Aware Human Erasing
Jinghan Yu
Junhao Xiao
Zhiyuan Ma
Yue Ma
Kaiqi Liu
Yuhan Wang
Daizong Liu
Xianghao Meng
Jianjun Li
DiffM
128
0
0
05 Aug 2025
The Early Bird Identifies the Worm: You Can't Beat a Head Start in Long-Term Body Re-ID (ECHO-BID)
Thomas M. Metz
Matthew Q. Hill
A. O’toole
126
1
0
23 Jul 2025
PhysLab: A Benchmark Dataset for Multi-Granularity Visual Parsing of Physics Experiments
Minghao Zou
Qingtian Zeng
Yongping Miao
Shangkun Liu
Zilong Wang
Hantao Liu
Wei Zhou
169
1
0
07 Jun 2025
Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects
Wei Li
Hebei Li
Yansong Peng
Siying Wu
Yueyi Zhang
Xiaoyan Sun
DiffM
272
1
0
27 May 2025
FNBench: Benchmarking Robust Federated Learning against Noisy Labels
Xuefeng Jiang
Jia Li
Nannan Wu
Z. F. Wu
Xujing Li
Sheng Sun
Gang Xu
Longji Xu
Qi Li
Min Liu
FedML
233
6
0
10 May 2025
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Computer Vision and Pattern Recognition (CVPR), 2025
Wufei Ma
Luoxin Ye
Nessa McWeeney
Celso M de Melo
Jieneng Chen
LRM
397
20
0
01 May 2025
DataS^3: Dataset Subset Selection for Specialization
Neha Hulkund
Alaa Maalouf
Levi Cai
Daniel Yang
Tsun-Hsuan Wang
...
Ken Goldberg
Hannah Kerner
Irene Chen
Yogesh A. Girdhar
Sara Beery
217
2
0
22 Apr 2025
Neglected Risks: The Disturbing Reality of Children's Images in Datasets and the Urgent Call for Accountability
Conference on Fairness, Accountability and Transparency (FAccT), 2025
C. Caetano
G. O. D. Santos
Caio Petrucci
Artur Barros
Camila Laranjeira
Leo S. F. Ribeiro
Júlia F. de Mendonça
J. A. dos Santos
Sandra Avila
148
1
0
20 Apr 2025
Scaling Laws for Data-Efficient Visual Transfer Learning
Wenxuan Yang
Qingqu Wei
Wenxuan Yang
Weimin Tan
Bo Yan
142
1
0
17 Apr 2025
Object Placement for Anything
Bingjie Gao
Bo Zhang
Li Niu
OCL
187
0
0
16 Apr 2025
PATFinger: Prompt-Adapted Transferable Fingerprinting against Unauthorized Multimodal Dataset Usage
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Weinan Zhang
Ju Jia
Yang Liu
Yihao Huang
Xuzhao Li
Cong Wu
Lina Wang
AAML
228
1
0
15 Apr 2025
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results
Yuqian Fu
Xingyu Qiu
Bin Ren
Yanwei Fu
Radu Timofte
...
Dianmo Sheng
Xuanpu Zhao
Zhiyu Li
X. Ding
Wenqian Li
208
30
0
14 Apr 2025
Towards Unconstrained 2D Pose Estimation of the Human Spine
Muhammad Gul Zain Ali Khan
Stephan Krauß
Didier Stricker
3DH
174
1
0
10 Apr 2025
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection
Jiancheng Pan
Yanxing Liu
Xiao He
Long Peng
Jiahao Li
Yuze Sun
Xiaomeng Huang
205
12
0
06 Apr 2025
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery
Mykola Lavreniuk
Nataliia Kussul
Andrii Shelestov
Bohdan Yailymov
Yevhenii Salii
Volodymyr Kuzin
Zoltan Szantoi
202
2
0
03 Apr 2025
A
T
^\text{T}
T
A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting
Computer Vision and Pattern Recognition (CVPR), 2025
Yizhe Tang
Zhimin Sun
Yuzhen Du
Ran Yi
Guangben Lu
T. Hu
Luying Li
Lizhuang Ma
Fangyuan Zou
DiffM
160
3
0
02 Apr 2025
A Dataset for Semantic Segmentation in the Presence of Unknowns
Computer Vision and Pattern Recognition (CVPR), 2025
Zakaria Laskar
Tomás Vojír
Matej Grcic
Iaroslav Melekhov
Shankar Gangisettye
Arno Solin
Jirí Matas
Giorgos Tolias
C.V. Jawahar
UQCV
171
0
0
28 Mar 2025
The Marine Debris Forward-Looking Sonar Datasets
Matias Valdenegro-Toro
Deepan Padmanabhan
Deepak Singh
Bilal Wehbe
Yvan Petillot
150
1
0
28 Mar 2025
Dual-Task Learning for Dead Tree Detection and Segmentation with Hybrid Self-Attention U-Nets in Aerial Imagery
International Journal of Applied Earth Observation and Geoinformation (JAEOG), 2025
Anis Ur Rahman
Einari Heinaro
Mete Ahishali
Samuli Junttila
196
1
0
27 Mar 2025
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2025
Jinho Jeong
Sangmin Han
Jinwoo Kim
Seon Joo Kim
215
9
0
24 Mar 2025
Universal Scene Graph Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Shengqiong Wu
Hao Fei
Tat-Seng Chua
327
2
0
19 Mar 2025
Salient Temporal Encoding for Dynamic Scene Graph Generation
Zhihao Zhu
192
0
0
15 Mar 2025
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Xin Wen
Bingchen Zhao
Yilun Chen
Jiangmiao Pang
Xiaojuan Qi
LM&Ro
412
3
0
10 Mar 2025
Revisiting Out-of-Distribution Detection in Real-time Object Detection: From Benchmark Pitfalls to a New Mitigation Paradigm
Weicheng He
Changshun Wu
Chih-Hong Cheng
Xiaowei Huang
Saddek Bensalem
OODD
311
0
0
10 Mar 2025
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Neural Information Processing Systems (NeurIPS), 2024
Luca Barsellotti
Roberto Bigazzi
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
429
2
0
20 Feb 2025
One-Shot Federated Learning with Classifier-Free Diffusion Models
Obaidullah Zaland
Shutong Jin
Florian T. Pokorny
Monowar Bhuyan
174
6
0
12 Feb 2025
Foundation Model-Based Apple Ripeness and Size Estimation for Selective Harvesting
Computers and Electronics in Agriculture (CEA), 2025
Keyi Zhu
Jiajia Li
Kaixiang Zhang
Chaaran Arunachalam
Siddhartha Bhattacharya
R. Lu
Zhaojian Li
299
3
0
03 Feb 2025
RORem: Training a Robust Object Remover with Human-in-the-Loop
Computer Vision and Pattern Recognition (CVPR), 2025
Ruibin Li
Tao Yang
Song Guo
Guang Dai
373
10
0
01 Jan 2025
Object Detection Approaches to Identifying Hand Images with High Forensic Values
IEEE International Conference on Systems, Man and Cybernetics (SMC), 2024
Thanh Thi Nguyen
Campbell Wilson
Imad Khan
Janis Dalins
3DH
227
0
0
21 Dec 2024
Classification Drives Geographic Bias in Street Scene Segmentation
Rahul Nair
Gabriel Tseng
Esther Rolf
Bhanu Tokas
Hannah Kerner
153
0
0
15 Dec 2024
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Pan Zhang
Xiaoyi Dong
Yuhang Cao
Yuhang Zang
Rui Qian
...
Xinsong Zhang
Kai Chen
Yu Qiao
Dahua Lin
Jiaqi Wang
KELM
338
31
0
12 Dec 2024
From classical techniques to convolution-based models: A review of object detection algorithms
International Conference on Image Processing, Applications and Systems (ICIPAS), 2024
Fnu Neha
Deepshikha Bhati
Deepak Kumar Shukla
Md. Amiruzzaman
ObjD
VLM
148
9
0
06 Dec 2024
Towards Real-Time Open-Vocabulary Video Instance Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Bin Yan
Martin Sundermeyer
D. Tan
Huchuan Lu
F. Tombari
VLM
VOS
256
3
0
05 Dec 2024
Composed Image Retrieval for Training-Free Domain Conversion
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Nikos Efthymiadis
Bill Psomas
Zakaria Laskar
Konstantinos Karantzalos
Yannis Avrithis
Ondřej Chum
Giorgos Tolias
280
3
0
04 Dec 2024
1
2
3
4
...
11
12
13
Next