ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00982
  4. Cited By
The Open Images Dataset V4: Unified image classification, object
  detection, and visual relationship detection at scale
v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
    ObjDVLM
ArXiv (abs)PDFHTML

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown
Paint by Example: Exemplar-based Image Editing with Diffusion Models
Paint by Example: Exemplar-based Image Editing with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Binxin Yang
Shuyang Gu
Bo Zhang
Ting Zhang
Xuejin Chen
Xiaoyan Sun
Dong Chen
Fang Wen
DiffM
282
546
0
23 Nov 2022
Plug and Play Active Learning for Object Detection
Plug and Play Active Learning for Object DetectionComputer Vision and Pattern Recognition (CVPR), 2022
Chenhongyi Yang
Lichao Huang
Elliot J. Crowley
ObjD
239
30
0
21 Nov 2022
ClipCrop: Conditioned Cropping Driven by Vision-Language Model
ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Zhihang Zhong
Mingxi Cheng
Zhirong Wu
Yuhui Yuan
Yinqiang Zheng
Ji Li
Han Hu
Stephen Lin
Yoichi Sato
Imari Sato
VLMCLIP
135
8
0
21 Nov 2022
Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual
  Query Localization
Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query LocalizationComputer Vision and Pattern Recognition (CVPR), 2022
Mengmeng Xu
Yanghao Li
Cheng-Yang Fu
Guohao Li
Tao Xiang
Juan-Manuel Perez-Rua
225
19
0
18 Nov 2022
Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked
  Modeling for Vision Decoding
Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision DecodingComputer Vision and Pattern Recognition (CVPR), 2022
Zijiao Chen
Jiaxin Qing
Tiange Xiang
Wan Lin Yue
J. Zhou
DiffMMedIm
336
200
0
13 Nov 2022
SSGVS: Semantic Scene Graph-to-Video Synthesis
SSGVS: Semantic Scene Graph-to-Video Synthesis
Yuren Cong
Jinhui Yi
Bodo Rosenhahn
M. Yang
242
8
0
11 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with
  Deformable Convolutions
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable ConvolutionsComputer Vision and Pattern Recognition (CVPR), 2022
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Jiaming Song
Xiaogang Wang
Yu Qiao
VLM
553
958
0
10 Nov 2022
High-Quality Entity Segmentation
High-Quality Entity SegmentationIEEE International Conference on Computer Vision (ICCV), 2022
Lu Qi
Jason Kuen
Weidong Guo
Tiancheng Shen
Jiuxiang Gu
Jiaya Jia
Zhe Lin
Ming-Hsuan Yang
ISeg
295
77
0
10 Nov 2022
SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object
  Detection
SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object DetectionComputer Vision and Image Understanding (CVIU), 2022
Huayi Zhou
Fei Jiang
Hongtao Lu
ObjD
289
107
0
04 Nov 2022
DEArt: Dataset of European Art
DEArt: Dataset of European Art
Artem Reshetnikov
M. Marinescu
J. M. López
VLM3DH
191
13
0
02 Nov 2022
Universal Deep Image Compression via Content-Adaptive Optimization with
  Adapters
Universal Deep Image Compression via Content-Adaptive Optimization with AdaptersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Koki Tsubota
Hiroaki Akutsu
Kiyoharu Aizawa
158
22
0
02 Nov 2022
Max Pooling with Vision Transformers reconciles class and shape in
  weakly supervised semantic segmentation
Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentationEuropean Conference on Computer Vision (ECCV), 2022
Simone Rossetti
Damiano Zappia
Marta Sanzari
M. Schaerf
F. Pirri
ViT
323
80
0
31 Oct 2022
Two-Level Temporal Relation Model for Online Video Instance Segmentation
Two-Level Temporal Relation Model for Online Video Instance SegmentationSocial Science Research Network (SSRN), 2022
Ç. S. Çoban
Oguzhan Keskin
Jordi Pont-Tuset
Fatma Guney
VOS
213
0
0
30 Oct 2022
A Survey on Causal Representation Learning and Future Work for Medical
  Image Analysis
A Survey on Causal Representation Learning and Future Work for Medical Image Analysis
Chang-Tien Lu
OODBDLCMLMedIm
255
0
0
28 Oct 2022
Do Vision-and-Language Transformers Learn Grounded Predicate-Noun
  Dependencies?
Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Mitja Nikolaus
Emmanuelle Salin
Stéphane Ayache
Abdellah Fourtassi
Benoit Favre
144
17
0
21 Oct 2022
Similarity of Neural Architectures using Adversarial Attack
  Transferability
Similarity of Neural Architectures using Adversarial Attack TransferabilityEuropean Conference on Computer Vision (ECCV), 2022
Ian Ryu
Dongyoon Han
Byeongho Heo
Song Park
Sanghyuk Chun
Jong-Seok Lee
AAML
538
3
0
20 Oct 2022
VTC: Improving Video-Text Retrieval with User Comments
VTC: Improving Video-Text Retrieval with User CommentsEuropean Conference on Computer Vision (ECCV), 2022
Laura Hanu
James Thewlis
Yuki M. Asano
Christian Rupprecht
VGen
231
8
0
19 Oct 2022
Learning to Discover and Detect Objects
Learning to Discover and Detect ObjectsNeural Information Processing Systems (NeurIPS), 2022
V. Fomenko
Ismail Elezi
Deva Ramanan
Laura Leal-Taixé
Aljosa Osep
ObjD
263
13
0
19 Oct 2022
A Tri-Layer Plugin to Improve Occluded Detection
A Tri-Layer Plugin to Improve Occluded DetectionBritish Machine Vision Conference (BMVC), 2022
Guanqi Zhan
Weidi Xie
Andrew Zisserman
211
26
0
18 Oct 2022
Scrape, Cut, Paste and Learn: Automated Dataset Generation Applied to
  Parcel Logistics
Scrape, Cut, Paste and Learn: Automated Dataset Generation Applied to Parcel LogisticsInternational Conference on Machine Learning and Applications (ICMLA), 2022
Alexander Naumann
Felix Hertlein
Benchun Zhou
Laura Dörr
K. Furmans
179
6
0
18 Oct 2022
1st Place Solutions for the UVO Challenge 2022
1st Place Solutions for the UVO Challenge 2022
Jiajun Zhang
Boyu Chen
Zhilong Ji
Jinfeng Bai
Zonghai Hu
191
1
0
18 Oct 2022
Non-Contrastive Learning Meets Language-Image Pre-Training
Non-Contrastive Learning Meets Language-Image Pre-TrainingComputer Vision and Pattern Recognition (CVPR), 2022
Jinghao Zhou
Li Dong
Zhe Gan
Lijuan Wang
Furu Wei
VLMCLIP
210
33
0
17 Oct 2022
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using
  Image-to-Image Diffusion Models
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion ModelsInternational Conference on Computer Science and Artificial Intelligence (ICCSAI), 2022
Yueqin Yin
Lianghua Huang
Yu Liu
Kaiqiang Huang
DiffM
143
13
0
16 Oct 2022
Learning Self-Regularized Adversarial Views for Self-Supervised Vision
  Transformers
Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers
Tao Tang
Changlin Li
Guangrun Wang
Kaicheng Yu
Xiaojun Chang
Xiaodan Liang
ViT
212
1
0
16 Oct 2022
Active Learning from the Web
Active Learning from the WebThe Web Conference (WWW), 2022
Ryoma Sato
136
0
0
15 Oct 2022
Plausible May Not Be Faithful: Probing Object Hallucination in
  Vision-Language Pre-training
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-trainingConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Wenliang Dai
Zihan Liu
Ziwei Ji
Jane Polak Scowcroft
Pascale Fung
MLLMVLM
305
75
0
14 Oct 2022
Caption supervision enables robust learners
Caption supervision enables robust learners
Ben Feuer
Ameya Joshi
Chinmay Hegde
SSLCLIPVLM
205
3
0
13 Oct 2022
Exploring Long-Sequence Masked Autoencoders
Exploring Long-Sequence Masked Autoencoders
Ronghang Hu
Shoubhik Debnath
Saining Xie
Xinlei Chen
181
23
0
13 Oct 2022
A survey of Identification and mitigation of Machine Learning
  algorithmic biases in Image Analysis
A survey of Identification and mitigation of Machine Learning algorithmic biases in Image Analysis
Laurent Risser
Agustin Picard
Lucas Hervier
Jean-Michel Loubes
FaML
217
7
0
10 Oct 2022
A Review of Uncertainty Calibration in Pretrained Object Detectors
A Review of Uncertainty Calibration in Pretrained Object Detectors
Denis Huseljic
M. Herde
Mehmet Muejde
Bernhard Sick
UQCV
140
0
0
06 Oct 2022
A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards
  Producing More Descriptive Alt Texts of Data Visualizations in Scientific
  Papers
A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific PapersInternational ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2022
S. Chintalapati
Jonathan Bragg
Lucy Lu Wang
139
32
0
27 Sep 2022
A Snapshot of the Frontiers of Client Selection in Federated Learning
A Snapshot of the Frontiers of Client Selection in Federated Learning
Gergely Németh
M. Lozano
Novi Quadrianto
Nuria Oliver
FedML
302
17
0
27 Sep 2022
Paraphrasing Is All You Need for Novel Object Captioning
Paraphrasing Is All You Need for Novel Object CaptioningNeural Information Processing Systems (NeurIPS), 2022
Cheng Yang
Yifan Hao
Wanshu Fan
Ruslan Salakhutdinov
Louis-Philippe Morency
Yu-Chiang Frank Wang
184
6
0
25 Sep 2022
BURST: A Benchmark for Unifying Object Recognition, Segmentation and
  Tracking in Video
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in VideoIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
A. Athar
Jonathon Luiten
P. Voigtlaender
Tarasha Khurana
Achal Dave
Bastian Leibe
Deva Ramanan
VOSVLM
267
74
0
25 Sep 2022
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question
  Answering
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question AnsweringIEEE Transactions on Image Processing (IEEE TIP), 2022
Hao Li
Jinfa Huang
Peng Jin
Guoli Song
Qi Wu
Jie Chen
376
27
0
21 Sep 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for
  Open-world Detection
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world DetectionNeural Information Processing Systems (NeurIPS), 2022
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIPVLM
334
218
0
20 Sep 2022
Enhance the Visual Representation via Discrete Adversarial Training
Enhance the Visual Representation via Discrete Adversarial TrainingNeural Information Processing Systems (NeurIPS), 2022
Xiaofeng Mao
YueFeng Chen
Ranjie Duan
Yao Zhu
Gege Qi
Shaokai Ye
Xiaodan Li
Rong Zhang
Hui Xue
232
43
0
16 Sep 2022
VIPHY: Probing "Visible" Physical Commonsense Knowledge
VIPHY: Probing "Visible" Physical Commonsense KnowledgeConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Shikhar Singh
Ehsan Qasemi
Muhao Chen
254
6
0
15 Sep 2022
PaLI: A Jointly-Scaled Multilingual Language-Image Model
PaLI: A Jointly-Scaled Multilingual Language-Image ModelInternational Conference on Learning Representations (ICLR), 2022
Xi Chen
Tianlin Li
Soravit Changpinyo
A. Piergiovanni
Piotr Padlewski
...
Andreas Steiner
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
MLLMVLM
709
905
0
14 Sep 2022
Out-of-Vocabulary Challenge Report
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
163
22
0
14 Sep 2022
Pre-training image-language transformers for open-vocabulary tasks
Pre-training image-language transformers for open-vocabulary tasks
A. Piergiovanni
Weicheng Kuo
A. Angelova
VLMViT
176
12
0
09 Sep 2022
im2nerf: Image to Neural Radiance Field in the Wild
im2nerf: Image to Neural Radiance Field in the Wild
Lu Mi
Abhijit Kundu
David A. Ross
F. Dellaert
Noah Snavely
Alireza Fathi
3DV
410
14
0
08 Sep 2022
Measuring the Interpretability of Unsupervised Representations via
  Quantized Reverse Probing
Measuring the Interpretability of Unsupervised Representations via Quantized Reverse ProbingInternational Conference on Learning Representations (ICLR), 2022
Iro Laina
Yuki M. Asano
Andrea Vedaldi
SSL
165
9
0
07 Sep 2022
Scalable Regularization of Scene Graph Generation Models using Symbolic
  Theories
Scalable Regularization of Scene Graph Generation Models using Symbolic Theories
Davide Buffelli
Efthymia Tsamoura
202
2
0
06 Sep 2022
Design of the topology for contrastive visual-textual alignment
Design of the topology for contrastive visual-textual alignment
Zhun Sun
376
2
0
05 Sep 2022
RLIP: Relational Language-Image Pre-training for Human-Object
  Interaction Detection
RLIP: Relational Language-Image Pre-training for Human-Object Interaction DetectionNeural Information Processing Systems (NeurIPS), 2022
Hangjie Yuan
Jianwen Jiang
Samuel Albanie
Tao Feng
Ziyuan Huang
Dong Ni
Mingqian Tang
VLM
350
76
0
05 Sep 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Frido: Feature Pyramid Diffusion for Complex Scene Image SynthesisAAAI Conference on Artificial Intelligence (AAAI), 2022
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
232
114
0
29 Aug 2022
Labeling of Cultural Heritage Collections on the Intersection of Visual
  Analytics and Digital Humanities
Labeling of Cultural Heritage Collections on the Intersection of Visual Analytics and Digital Humanities
C. Meinecke
121
3
0
29 Aug 2022
Towards Federated Learning against Noisy Labels via Local
  Self-Regularization
Towards Federated Learning against Noisy Labels via Local Self-RegularizationInternational Conference on Information and Knowledge Management (CIKM), 2022
Xue Jiang
Sheng Sun
Yuwei Wang
Min Liu
198
47
0
25 Aug 2022
Is Medieval Distant Viewing Possible? : Extending and Enriching
  Annotation of Legacy Image Collections using Visual Analytics
Is Medieval Distant Viewing Possible? : Extending and Enriching Annotation of Legacy Image Collections using Visual AnalyticsDigital Scholarship in the Humanities (DSH), 2022
C. Meinecke
Estelle Guéville
D. Wrisley
Stefan Jänicke
210
4
0
20 Aug 2022
Previous
123...567...111213
Next