Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.00982
Cited By
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 190 papers shown
Title
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Luca Barsellotti
Roberto Bigazzi
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
87
1
0
20 Feb 2025
Efficient Progressive Image Compression with Variance-aware Masking
Alberto Presta
Enzo Tartaglione
A. Fiandrotti
Marco Grangetto
Pamela Cosman
27
0
0
15 Nov 2024
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
David Tschirschwitz
Volker Rodehorst
26
1
0
14 Sep 2024
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
Peng Hao
Xiaobing Wang
Yingying Jiang
Hanchao Jia
Xiaoshuai Hao
Shaowei Cui
Junhang Wei
Xiaoshuai Hao
49
3
0
26 Jul 2024
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
Yu-Yun Tseng
Tanusree Sharma
Lotus Zhang
Abigale Stangl
Leah Findlater
Yang Wang
Danna Gurari
64
0
0
25 Jul 2024
Error Detection and Constraint Recovery in Hierarchical Multi-Label Classification without Prior Knowledge
Joshua Shay Kricheli
Khoa Vo
Aniruddha Datta
Spencer Ozgur
Paulo Shakarian
32
2
0
21 Jul 2024
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
34
5
0
18 Jul 2024
Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection
F. Barbato
Umberto Michieli
J. Moon
Pietro Zanuttigh
Mete Ozay
37
2
0
01 Jul 2024
Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model
Shoma Iwai
Tomo Miyazaki
S. Omachi
47
11
0
27 May 2024
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Katherine Xu
Lingzhi Zhang
Jianbo Shi
41
12
0
23 May 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
42
6
0
24 Apr 2024
Video Relationship Detection Using Mixture of Experts
A. Shaabana
Zahra Gharaee
Paul Fieguth
30
0
0
06 Mar 2024
Enhancing Vision-Language Pre-training with Rich Supervisions
Yuan Gao
Kunyu Shi
Pengkai Zhu
Edouard Belval
Oren Nuriel
Srikar Appalaraju
Shabnam Ghadar
Vijay Mahadevan
Zhuowen Tu
Stefano Soatto
VLM
CLIP
62
12
0
05 Mar 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Chris Liu
Renrui Zhang
Longtian Qiu
Siyuan Huang
Weifeng Lin
...
Hao Shao
Pan Lu
Hongsheng Li
Yu Qiao
Peng Gao
MLLM
128
107
0
08 Feb 2024
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
Jichang Li
Guanbin Li
Hui Cheng
Zicheng Liao
Yizhou Yu
FedML
27
14
0
19 Dec 2023
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge
Bowen Jiang
Zhijun Zhuang
Shreyas S. Shivakumar
Camillo J. Taylor
26
6
0
21 Nov 2023
SniffyArt: The Dataset of Smelling Persons
Mathias Zinnen
Azhar Hussian
Hang Tran
Prathmesh Madhu
Andreas K. Maier
Vincent Christlein
21
9
0
20 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
36
143
0
10 Nov 2023
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zhaohui Zheng
Yuming Chen
Qibin Hou
Xiang Li
Ping Wang
Ming-Ming Cheng
ObjD
24
3
0
20 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Yikang Shen
Zhenfang Chen
Mingyu Ding
Chuang Gan
46
15
0
10 Oct 2023
DreamCom: Finetuning Text-guided Inpainting Model for Image Composition
Lingxiao Lu
Jiangtong Li
Bo Zhang
Li Niu
DiffM
26
11
0
27 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
64
35
0
22 Sep 2023
ReShader: View-Dependent Highlights for Single Image View-Synthesis
Avinash Paliwal
Brandon Nguyen
Andrii Tsarov
N. Kalantari
25
3
0
19 Sep 2023
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
27
2
0
07 Aug 2023
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Jingyi Wang
Can Zhang
Jinfa Huang
Bo Ren
Zhidong Deng
23
7
0
04 Aug 2023
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the Machine
Nareed Farhat
Teddy Lazebnik
J. Monteny
C. Moons
E. Wydooghe
Dirk van der Linden
Anna Zamansky
24
4
0
26 Jul 2023
In Defense of Clip-based Video Relation Detection
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
36
4
0
18 Jul 2023
End-to-End Supervised Multilabel Contrastive Learning
A. Sajedi
Samir Khaki
Konstantinos N. Plataniotis
Mahdi S. Hosseini
SSL
21
8
0
08 Jul 2023
Joint Adaptive Representations for Image-Language Learning
A. Piergiovanni
A. Angelova
VLM
26
0
0
31 May 2023
ElasticHash: Semantic Image Similarity Search by Deep Hashing with Elasticsearch
Nikolaus Korfhage
M. Mühling
Bernd Freisleben
14
3
0
08 May 2023
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning
Ming-Kun Xie
Jianxiong Xiao
Hao-Zhe Liu
Gang Niu
Masashi Sugiyama
Sheng-Jun Huang
40
16
0
04 May 2023
Controllable Image Generation via Collage Representations
Arantxa Casanova
Marlene Careil
Adriana Romero Soriano
Christopher Pal
Jakob Verbeek
M. Drozdzal
DiffM
37
7
0
26 Apr 2023
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Sheng Liu
C. P. Huynh
Congmin Chen
Maxim Arap
Raffay Hamid
25
18
0
25 Apr 2023
Building Multimodal AI Chatbots
Mingyu Lee
29
3
0
21 Apr 2023
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-based Consistency
Zixuan Huang
Varun Jampani
Anh Thai
Yuanzhen Li
Stefan Stojanov
James M. Rehg
3DV
27
18
0
13 Apr 2023
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Yongxin Zhu
Z. Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
21
6
0
04 Apr 2023
Egocentric Auditory Attention Localization in Conversations
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
26
16
0
28 Mar 2023
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Cusuh Ham
James Hays
Jingwan Lu
Krishna Kumar Singh
Zhifei Zhang
Tobias Hinz
DiffM
17
24
0
24 Feb 2023
Multistage Spatial Context Models for Learned Image Compression
Fangzheng Lin
Heming Sun
Jinming Liu
J. Katto
28
13
0
18 Feb 2023
Contour-based Interactive Segmentation
Danil Galeev
Polina Popenova
Anna Vorontsova
Anton Konushin
24
5
0
13 Feb 2023
KENGIC: KEyword-driven and N-Gram Graph based Image Captioning
Brandon Birmingham
A. Muscat
22
1
0
07 Feb 2023
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval
Yizhen Chen
Jie Wang
Lijian Lin
Zhongang Qi
Jin Ma
Ying Shan
VLM
18
18
0
30 Jan 2023
Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Xudong Wang
Rohit Girdhar
Stella X. Yu
Ishan Misra
VLM
45
161
0
26 Jan 2023
Long-tail Detection with Effective Class-Margins
Jang Hyun Cho
Philipp Krahenbuhl
33
17
0
23 Jan 2023
Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks
Xinsong Zhang
Yan Zeng
Jipeng Zhang
Hang Li
VLM
AI4CE
LRM
14
17
0
12 Jan 2023
Improving Human-AI Collaboration With Descriptions of AI Behavior
Ángel Alexander Cabrera
Adam Perer
Jason I. Hong
22
34
0
06 Jan 2023
GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods
Da Yin
Feng Gao
Govind Thattai
Michael F. Johnston
Kai-Wei Chang
VLM
32
15
0
05 Jan 2023
Analyzing I/O Performance of a Hierarchical HPC Storage System for Distributed Deep Learning
Takaaki Fukai
Kento Sato
Takahiro Hirofuchi
22
2
0
04 Jan 2023
Generalizable Black-Box Adversarial Attack with Meta Learning
Fei Yin
Yong Zhang
Baoyuan Wu
Yan Feng
Jingyi Zhang
Yanbo Fan
Yujiu Yang
AAML
24
27
0
01 Jan 2023
Skew Class-balanced Re-weighting for Unbiased Scene Graph Generation
Haeyong Kang
Chang-Dong Yoo
30
6
0
01 Jan 2023
1
2
3
4
Next