v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown

Continual Error Correction on Low-Resource DevicesACM SIGMM Conference on Multimedia Systems (MMSys), 2025

Kirill Paramonov

Mete Ozay

Aristeidis Mystakidis

...

266

26 Nov 2025

Large Language Models for the Summarization of Czech Documents: From History to the Present

24 Nov 2025

Large Language Models and 3D Vision for Intelligent Robotic Perception and AutonomyItalian National Conference on Sensors (INS), 2025

Vinit Mehta

Charu Sharma

Karthick Thiyagarajan

LM&Ro

375

14 Nov 2025

PriVi: Towards A General-Purpose Video Model For Primate Behavior In The Wild

...

213

12 Nov 2025

ISC-Perception: A Hybrid Computer Vision Dataset for Object Detection in Novel Steel Assembly

Miftahur Rahman

Samuel Adebayo

Dorian A. Acevedo-Mejia

05 Nov 2025

Towards 3D Objectness Learning in an Open World

140

20 Oct 2025

Prominence-Aware Artifact Detection and Dataset for Image Super-Resolution

Evgeney Nikolaevich Bogatyrev

D. Vatolin

120

19 Oct 2025

FedHybrid: Breaking the Memory Wall of Federated Learning via Hybrid Tensor ManagementACM International Conference on Embedded Networked Sensor Systems (SenSys), 2024

209

13 Oct 2025

OTR: Synthesizing Overlay Text Dataset for Text Removal

Jan Zdenek

Wataru Shimoda

Kota Yamaguchi

03 Oct 2025

Model Merging to Maintain Language-Only Performance in Developmentally Plausible Multimodal Models

167

02 Oct 2025

Cumulative Consensus Score: Label-Free and Model-Agnostic Evaluation of Object Detectors in Deployment

152

16 Sep 2025

Beyond Instance Consistency: Investigating View Diversity in Self-supervised Learning

175

14 Sep 2025

Safe Semantics, Unsafe Interpretations: Tackling Implicit Reasoning Safety in Large Vision-Language Models

173

12 Aug 2025

DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding

12 Aug 2025

DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition

141

07 Aug 2025

From Label Error Detection to Correction: A Modular Framework and Benchmark for Object Detection Datasets

131

06 Aug 2025

MILD: Multi-Layer Diffusion Strategy for Complex and Precise Multi-IP Aware Human Erasing

199

05 Aug 2025

The Early Bird Identifies the Worm: You Can't Beat a Head Start in Long-Term Body Re-ID (ECHO-BID)

Thomas M. Metz

Matthew Q. Hill

A. O’toole

241

23 Jul 2025

PhysLab: A Benchmark Dataset for Multi-Granularity Visual Parsing of Physics Experiments

213

07 Jun 2025

Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects

341

27 May 2025

FNBench: Benchmarking Robust Federated Learning against Noisy Labels

286

10 May 2025

SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal ModelsComputer Vision and Pattern Recognition (CVPR), 2025

471

01 May 2025

DataS^3: Dataset Subset Selection for Specialization

...

273

22 Apr 2025

Neglected Risks: The Disturbing Reality of Children's Images in Datasets and the Urgent Call for AccountabilityConference on Fairness, Accountability and Transparency (FAccT), 2025

170

20 Apr 2025

Scaling Laws for Data-Efficient Visual Transfer Learning

198

17 Apr 2025

Object Placement for Anything

248

16 Apr 2025

PATFinger: Prompt-Adapted Transferable Fingerprinting against Unauthorized Multimodal Dataset UsageAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

282

15 Apr 2025

NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results

...

297

14 Apr 2025

Towards Unconstrained 2D Pose Estimation of the Human Spine

Muhammad Gul Zain Ali Khan

Stephan Krauß

Didier Stricker

3DH

205

10 Apr 2025

Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection

276

06 Apr 2025

Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery

250

03 Apr 2025

$A$^\text{T}$A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting$

^\text{T}

A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background InpaintingComputer Vision and Pattern Recognition (CVPR), 2025

220

02 Apr 2025

A Dataset for Semantic Segmentation in the Presence of UnknownsComputer Vision and Pattern Recognition (CVPR), 2025

221

28 Mar 2025

The Marine Debris Forward-Looking Sonar Datasets

Matias Valdenegro-Toro

198

28 Mar 2025

Dual-Task Learning for Dead Tree Detection and Segmentation with Hybrid Self-Attention U-Nets in Aerial ImageryInternational Journal of Applied Earth Observation and Geoinformation (JAEOG), 2025

262

27 Mar 2025

Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2025

368

24 Mar 2025

Universal Scene Graph GenerationComputer Vision and Pattern Recognition (CVPR), 2025

Shengqiong Wu

Hao Fei

Tat-Seng Chua

402

19 Mar 2025

Salient Temporal Encoding for Dynamic Scene Graph Generation

Zhihao Zhu

257

15 Mar 2025

A Data-Centric Revisit of Pre-Trained Vision Models for Robot LearningComputer Vision and Pattern Recognition (CVPR), 2025

503

10 Mar 2025

Revisiting Out-of-Distribution Detection in Real-time Object Detection: From Benchmark Pitfalls to a New Mitigation Paradigm

407

10 Mar 2025

Personalized Instance-based Navigation Toward User-Specific Objects in Realistic EnvironmentsNeural Information Processing Systems (NeurIPS), 2024

516

20 Feb 2025

One-Shot Federated Learning with Classifier-Free Diffusion Models

260

12 Feb 2025

Foundation Model-Based Apple Ripeness and Size Estimation for Selective HarvestingComputers and Electronics in Agriculture (CEA), 2025

Siddhartha Bhattacharya

R. Lu

Zhaojian Li

351

03 Feb 2025

RORem: Training a Robust Object Remover with Human-in-the-LoopComputer Vision and Pattern Recognition (CVPR), 2025

533

01 Jan 2025

Object Detection Approaches to Identifying Hand Images with High Forensic ValuesIEEE International Conference on Systems, Man and Cybernetics (SMC), 2024

291

21 Dec 2024

Classification Drives Geographic Bias in Street Scene Segmentation

200

15 Dec 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

...

371

12 Dec 2024

From classical techniques to convolution-based models: A review of object detection algorithmsInternational Conference on Image Processing, Applications and Systems (ICIPAS), 2024

172

06 Dec 2024

Towards Real-Time Open-Vocabulary Video Instance SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

294

05 Dec 2024

Composed Image Retrieval for Training-Free Domain ConversionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

Nikos Efthymiadis

Bill Psomas

Zakaria Laskar

Konstantinos Karantzalos

Yannis Avrithis

Ondřej Chum

Giorgos Tolias

349

04 Dec 2024