ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00982
  4. Cited By
The Open Images Dataset V4: Unified image classification, object
  detection, and visual relationship detection at scale

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
    ObjD
    VLM
ArXivPDFHTML

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 190 papers shown
Title
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Luca Barsellotti
Roberto Bigazzi
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
87
1
0
20 Feb 2025
Efficient Progressive Image Compression with Variance-aware Masking
Efficient Progressive Image Compression with Variance-aware Masking
Alberto Presta
Enzo Tartaglione
A. Fiandrotti
Marco Grangetto
Pamela Cosman
27
0
0
15 Nov 2024
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
David Tschirschwitz
Volker Rodehorst
26
1
0
14 Sep 2024
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
Peng Hao
Xiaobing Wang
Yingying Jiang
Hanchao Jia
Xiaoshuai Hao
Shaowei Cui
Junhang Wei
Xiaoshuai Hao
49
3
0
26 Jul 2024
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
Yu-Yun Tseng
Tanusree Sharma
Lotus Zhang
Abigale Stangl
Leah Findlater
Yang Wang
Danna Gurari
64
0
0
25 Jul 2024
Error Detection and Constraint Recovery in Hierarchical Multi-Label Classification without Prior Knowledge
Error Detection and Constraint Recovery in Hierarchical Multi-Label Classification without Prior Knowledge
Joshua Shay Kricheli
Khoa Vo
Aniruddha Datta
Spencer Ozgur
Paulo Shakarian
32
2
0
21 Jul 2024
Learning Visual Grounding from Generative Vision and Language Model
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
34
5
0
18 Jul 2024
Cross-Architecture Auxiliary Feature Space Translation for Efficient
  Few-Shot Personalized Object Detection
Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection
F. Barbato
Umberto Michieli
J. Moon
Pietro Zanuttigh
Mete Ozay
37
2
0
01 Jul 2024
Controlling Rate, Distortion, and Realism: Towards a Single
  Comprehensive Neural Image Compression Model
Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model
Shoma Iwai
Tomo Miyazaki
S. Omachi
47
11
0
27 May 2024
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Katherine Xu
Lingzhi Zhang
Jianbo Shi
41
12
0
23 May 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion
  Models
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
42
6
0
24 Apr 2024
Video Relationship Detection Using Mixture of Experts
Video Relationship Detection Using Mixture of Experts
A. Shaabana
Zahra Gharaee
Paul Fieguth
30
0
0
06 Mar 2024
Enhancing Vision-Language Pre-training with Rich Supervisions
Enhancing Vision-Language Pre-training with Rich Supervisions
Yuan Gao
Kunyu Shi
Pengkai Zhu
Edouard Belval
Oren Nuriel
Srikar Appalaraju
Shabnam Ghadar
Vijay Mahadevan
Zhuowen Tu
Stefano Soatto
VLM
CLIP
62
12
0
05 Mar 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Chris Liu
Renrui Zhang
Longtian Qiu
Siyuan Huang
Weifeng Lin
...
Hao Shao
Pan Lu
Hongsheng Li
Yu Qiao
Peng Gao
MLLM
128
107
0
08 Feb 2024
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy
  Labels
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
Jichang Li
Guanbin Li
Hui Cheng
Zicheng Liao
Yizhou Yu
FedML
27
14
0
19 Dec 2023
Enhancing Scene Graph Generation with Hierarchical Relationships and
  Commonsense Knowledge
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge
Bowen Jiang
Zhijun Zhuang
Shreyas S. Shivakumar
Camillo J. Taylor
26
6
0
21 Nov 2023
SniffyArt: The Dataset of Smelling Persons
SniffyArt: The Dataset of Smelling Persons
Mathias Zinnen
Azhar Hussian
Hang Tran
Prathmesh Madhu
Andreas K. Maier
Vincent Christlein
21
9
0
20 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision
  Tasks
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
36
143
0
10 Nov 2023
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zhaohui Zheng
Yuming Chen
Qibin Hou
Xiang Li
Ping Wang
Ming-Ming Cheng
ObjD
24
3
0
20 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Yikang Shen
Zhenfang Chen
Mingyu Ding
Chuang Gan
46
15
0
10 Oct 2023
DreamCom: Finetuning Text-guided Inpainting Model for Image Composition
DreamCom: Finetuning Text-guided Inpainting Model for Image Composition
Lingxiao Lu
Jiangtong Li
Bo Zhang
Li Niu
DiffM
26
11
0
27 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary
  Instance Segmentation
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
64
35
0
22 Sep 2023
ReShader: View-Dependent Highlights for Single Image View-Synthesis
ReShader: View-Dependent Highlights for Single Image View-Synthesis
Avinash Paliwal
Brandon Nguyen
Andrii Tsarov
N. Kalantari
25
3
0
19 Sep 2023
Distributionally Robust Classification on a Data Budget
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
27
2
0
07 Aug 2023
Improving Scene Graph Generation with Superpixel-Based Interaction
  Learning
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Jingyi Wang
Can Zhang
Jinfa Huang
Bo Ren
Zhidong Deng
23
7
0
04 Aug 2023
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the Machine
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the Machine
Nareed Farhat
Teddy Lazebnik
J. Monteny
C. Moons
E. Wydooghe
Dirk van der Linden
Anna Zamansky
24
4
0
26 Jul 2023
In Defense of Clip-based Video Relation Detection
In Defense of Clip-based Video Relation Detection
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
36
4
0
18 Jul 2023
End-to-End Supervised Multilabel Contrastive Learning
End-to-End Supervised Multilabel Contrastive Learning
A. Sajedi
Samir Khaki
Konstantinos N. Plataniotis
Mahdi S. Hosseini
SSL
21
8
0
08 Jul 2023
Joint Adaptive Representations for Image-Language Learning
Joint Adaptive Representations for Image-Language Learning
A. Piergiovanni
A. Angelova
VLM
26
0
0
31 May 2023
ElasticHash: Semantic Image Similarity Search by Deep Hashing with
  Elasticsearch
ElasticHash: Semantic Image Similarity Search by Deep Hashing with Elasticsearch
Nikolaus Korfhage
M. Mühling
Bernd Freisleben
14
3
0
08 May 2023
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label
  Learning
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning
Ming-Kun Xie
Jianxiong Xiao
Hao-Zhe Liu
Gang Niu
Masashi Sugiyama
Sheng-Jun Huang
40
16
0
04 May 2023
Controllable Image Generation via Collage Representations
Controllable Image Generation via Collage Representations
Arantxa Casanova
Marlene Careil
Adriana Romero Soriano
Christopher Pal
Jakob Verbeek
M. Drozdzal
DiffM
37
7
0
26 Apr 2023
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Sheng Liu
C. P. Huynh
Congmin Chen
Maxim Arap
Raffay Hamid
25
18
0
25 Apr 2023
Building Multimodal AI Chatbots
Building Multimodal AI Chatbots
Mingyu Lee
29
3
0
21 Apr 2023
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via
  Geometric and CLIP-based Consistency
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-based Consistency
Zixuan Huang
Varun Jampani
Anh Thai
Yuanzhen Li
Stefan Stojanov
James M. Rehg
3DV
27
18
0
13 Apr 2023
Locate Then Generate: Bridging Vision and Language with Bounding Box for
  Scene-Text VQA
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Yongxin Zhu
Z. Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
21
6
0
04 Apr 2023
Egocentric Auditory Attention Localization in Conversations
Egocentric Auditory Attention Localization in Conversations
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
26
16
0
28 Mar 2023
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Cusuh Ham
James Hays
Jingwan Lu
Krishna Kumar Singh
Zhifei Zhang
Tobias Hinz
DiffM
17
24
0
24 Feb 2023
Multistage Spatial Context Models for Learned Image Compression
Multistage Spatial Context Models for Learned Image Compression
Fangzheng Lin
Heming Sun
Jinming Liu
J. Katto
28
13
0
18 Feb 2023
Contour-based Interactive Segmentation
Contour-based Interactive Segmentation
Danil Galeev
Polina Popenova
Anna Vorontsova
Anton Konushin
24
5
0
13 Feb 2023
KENGIC: KEyword-driven and N-Gram Graph based Image Captioning
KENGIC: KEyword-driven and N-Gram Graph based Image Captioning
Brandon Birmingham
A. Muscat
22
1
0
07 Feb 2023
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text
  Retrieval
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval
Yizhen Chen
Jie Wang
Lijian Lin
Zhongang Qi
Jin Ma
Ying Shan
VLM
18
18
0
30 Jan 2023
Cut and Learn for Unsupervised Object Detection and Instance
  Segmentation
Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Xudong Wang
Rohit Girdhar
Stella X. Yu
Ishan Misra
VLM
45
161
0
26 Jan 2023
Long-tail Detection with Effective Class-Margins
Long-tail Detection with Effective Class-Margins
Jang Hyun Cho
Philipp Krahenbuhl
33
17
0
23 Jan 2023
Toward Building General Foundation Models for Language, Vision, and
  Vision-Language Understanding Tasks
Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks
Xinsong Zhang
Yan Zeng
Jipeng Zhang
Hang Li
VLM
AI4CE
LRM
14
17
0
12 Jan 2023
Improving Human-AI Collaboration With Descriptions of AI Behavior
Improving Human-AI Collaboration With Descriptions of AI Behavior
Ángel Alexander Cabrera
Adam Perer
Jason I. Hong
22
34
0
06 Jan 2023
GIVL: Improving Geographical Inclusivity of Vision-Language Models with
  Pre-Training Methods
GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods
Da Yin
Feng Gao
Govind Thattai
Michael F. Johnston
Kai-Wei Chang
VLM
32
15
0
05 Jan 2023
Analyzing I/O Performance of a Hierarchical HPC Storage System for
  Distributed Deep Learning
Analyzing I/O Performance of a Hierarchical HPC Storage System for Distributed Deep Learning
Takaaki Fukai
Kento Sato
Takahiro Hirofuchi
22
2
0
04 Jan 2023
Generalizable Black-Box Adversarial Attack with Meta Learning
Generalizable Black-Box Adversarial Attack with Meta Learning
Fei Yin
Yong Zhang
Baoyuan Wu
Yan Feng
Jingyi Zhang
Yanbo Fan
Yujiu Yang
AAML
24
27
0
01 Jan 2023
Skew Class-balanced Re-weighting for Unbiased Scene Graph Generation
Skew Class-balanced Re-weighting for Unbiased Scene Graph Generation
Haeyong Kang
Chang-Dong Yoo
30
6
0
01 Jan 2023
1234
Next