v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown

Paint by Example: Exemplar-based Image Editing with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

282

546

23 Nov 2022

Plug and Play Active Learning for Object DetectionComputer Vision and Pattern Recognition (CVPR), 2022

239

21 Nov 2022

ClipCrop: Conditioned Cropping Driven by Vision-Language Model

Mingxi Cheng

Ji Li

Yoichi Sato

135

21 Nov 2022

Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query LocalizationComputer Vision and Pattern Recognition (CVPR), 2022

Juan-Manuel Perez-Rua

225

18 Nov 2022

Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision DecodingComputer Vision and Pattern Recognition (CVPR), 2022

336

200

13 Nov 2022

SSGVS: Semantic Scene Graph-to-Video Synthesis

Yuren Cong

Jinhui Yi

Bodo Rosenhahn

M. Yang

242

11 Nov 2022

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable ConvolutionsComputer Vision and Pattern Recognition (CVPR), 2022

...

Yu Qiao

553

958

10 Nov 2022

High-Quality Entity SegmentationIEEE International Conference on Computer Vision (ICCV), 2022

Jiuxiang Gu

Ming-Hsuan Yang

295

10 Nov 2022

SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object DetectionComputer Vision and Image Understanding (CVIU), 2022

289

107

04 Nov 2022

DEArt: Dataset of European Art

191

02 Nov 2022

Universal Deep Image Compression via Content-Adaptive Optimization with AdaptersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Koki Tsubota

Hiroaki Akutsu

Kiyoharu Aizawa

158

02 Nov 2022

Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentationEuropean Conference on Computer Vision (ECCV), 2022

323

31 Oct 2022

Two-Level Temporal Relation Model for Online Video Instance SegmentationSocial Science Research Network (SSRN), 2022

213

30 Oct 2022

A Survey on Causal Representation Learning and Future Work for Medical Image Analysis

Chang-Tien Lu

OOD BDL CML MedIm

255

28 Oct 2022

Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

144

21 Oct 2022

Similarity of Neural Architectures using Adversarial Attack TransferabilityEuropean Conference on Computer Vision (ECCV), 2022

538

20 Oct 2022

VTC: Improving Video-Text Retrieval with User CommentsEuropean Conference on Computer Vision (ECCV), 2022

Christian Rupprecht

231

19 Oct 2022

Learning to Discover and Detect ObjectsNeural Information Processing Systems (NeurIPS), 2022

Laura Leal-Taixé

263

19 Oct 2022

A Tri-Layer Plugin to Improve Occluded DetectionBritish Machine Vision Conference (BMVC), 2022

Guanqi Zhan

Weidi Xie

Andrew Zisserman

211

18 Oct 2022

Scrape, Cut, Paste and Learn: Automated Dataset Generation Applied to Parcel LogisticsInternational Conference on Machine Learning and Applications (ICMLA), 2022

179

18 Oct 2022

1st Place Solutions for the UVO Challenge 2022

Zonghai Hu

191

18 Oct 2022

Non-Contrastive Learning Meets Language-Image Pre-TrainingComputer Vision and Pattern Recognition (CVPR), 2022

210

17 Oct 2022

DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion ModelsInternational Conference on Computer Science and Artificial Intelligence (ICCSAI), 2022

143

16 Oct 2022

Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers

Tao Tang

Changlin Li

Guangrun Wang

Kaicheng Yu

Xiaojun Chang

Xiaodan Liang

ViT

212

16 Oct 2022

Active Learning from the WebThe Web Conference (WWW), 2022

Ryoma Sato

136

15 Oct 2022

Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-trainingConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

305

14 Oct 2022

Caption supervision enables robust learners

205

13 Oct 2022

Exploring Long-Sequence Masked Autoencoders

181

13 Oct 2022

A survey of Identification and mitigation of Machine Learning algorithmic biases in Image Analysis

217

10 Oct 2022

A Review of Uncertainty Calibration in Pretrained Object Detectors

140

06 Oct 2022

A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific PapersInternational ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2022

S. Chintalapati

Jonathan Bragg

Lucy Lu Wang

139

27 Sep 2022

A Snapshot of the Frontiers of Client Selection in Federated Learning

302

27 Sep 2022

Paraphrasing Is All You Need for Novel Object CaptioningNeural Information Processing Systems (NeurIPS), 2022

Louis-Philippe Morency

Yu-Chiang Frank Wang

184

25 Sep 2022

BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in VideoIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

267

25 Sep 2022

Toward 3D Spatial Reasoning for Human-like Text-based Visual Question AnsweringIEEE Transactions on Image Processing (IEEE TIP), 2022

Hao Li

Qi Wu

376

21 Sep 2022

DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world DetectionNeural Information Processing Systems (NeurIPS), 2022

Lewei Yao

Jianhua Han

Youpeng Wen

Xiaodan Liang

Dan Xu

Wei Zhang

Zhenguo Li

Chunjing Xu

Hang Xu

CLIP VLM

334

218

20 Sep 2022

Enhance the Visual Representation via Discrete Adversarial TrainingNeural Information Processing Systems (NeurIPS), 2022

232

16 Sep 2022

VIPHY: Probing "Visible" Physical Commonsense KnowledgeConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Shikhar Singh

Ehsan Qasemi

Muhao Chen

254

15 Sep 2022

PaLI: A Jointly-Scaled Multilingual Language-Image ModelInternational Conference on Learning Representations (ICLR), 2022

...

709

905

14 Sep 2022

Out-of-Vocabulary Challenge Report

163

14 Sep 2022

Pre-training image-language transformers for open-vocabulary tasks

176

09 Sep 2022

im2nerf: Image to Neural Radiance Field in the Wild

410

08 Sep 2022

Measuring the Interpretability of Unsupervised Representations via Quantized Reverse ProbingInternational Conference on Learning Representations (ICLR), 2022

Iro Laina

Yuki M. Asano

Andrea Vedaldi

SSL

165

07 Sep 2022

Scalable Regularization of Scene Graph Generation Models using Symbolic Theories

Davide Buffelli

Efthymia Tsamoura

202

06 Sep 2022

Design of the topology for contrastive visual-textual alignment

Zhun Sun

376

05 Sep 2022

RLIP: Relational Language-Image Pre-training for Human-Object Interaction DetectionNeural Information Processing Systems (NeurIPS), 2022

350

05 Sep 2022

Frido: Feature Pyramid Diffusion for Complex Scene Image SynthesisAAAI Conference on Artificial Intelligence (AAAI), 2022

Lu Yuan

232

114

29 Aug 2022

Labeling of Cultural Heritage Collections on the Intersection of Visual Analytics and Digital Humanities

C. Meinecke

121

29 Aug 2022

Towards Federated Learning against Noisy Labels via Local Self-RegularizationInternational Conference on Information and Knowledge Management (CIKM), 2022

198

25 Aug 2022

Is Medieval Distant Viewing Possible? : Extending and Enriching Annotation of Legacy Image Collections using Visual AnalyticsDigital Scholarship in the Humanities (DSH), 2022

210

20 Aug 2022