Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.00982
Cited By
v1
v2 (latest)
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 623 papers shown
Title
MVImgNet2.0: A Larger-scale Dataset of Multi-view Images
ACM Transactions on Graphics (TOG), 2024
Xiaoguang Han
Yushuang Wu
Luyue Shi
Haolin Liu
Hongjie Liao
Lingteng Qiu
Weihao Yuan
Xiaodong Gu
Zilong Dong
Shuguang Cui
3DV
3DPC
138
6
0
02 Dec 2024
Efficient Progressive Image Compression with Variance-aware Masking
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Alberto Presta
Enzo Tartaglione
Attilio Fiandrotti
Marco Grangetto
Pamela Cosman
500
1
0
15 Nov 2024
JPEG AI Image Compression Visual Artifacts: Detection Methods and Dataset
Daria Tsereh
Mark Mirgaleev
Ivan Molodetskikh
Roman Kazantsev
D. Vatolin
127
3
0
11 Nov 2024
Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach
ACM Multimedia (MM), 2024
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Yiming Wu
Wei Ji
Haoran Liang
Ronghua Liang
122
2
0
03 Nov 2024
Interactive4D: Interactive 4D LiDAR Segmentation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Ilya Fradlin
Idil Esen Zulfikar
Kadir Yilmaz
Theodora Kontogianni
Bastian Leibe
257
4
0
10 Oct 2024
Multimodal Markup Document Models for Graphic Design Completion
Kotaro Kikuchi
Naoto Inoue
Mayu Otani
E. Simo-Serra
Kota Yamaguchi
Kota Yamaguchi
VLM
384
9
0
27 Sep 2024
Enhanced Wavelet Scattering Network for image inpainting detection
De Computis (DC), 2024
Barglazan Adrian-Alin
Brad Remus
150
1
0
25 Sep 2024
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
David Tschirschwitz
Volker Rodehorst
256
2
0
14 Sep 2024
Anno-incomplete Multi-dataset Detection
Yiran Xu
Haoxiang Zhong
Kai Wu
Jialin Li
Yong Liu
Chengjie Wang
Shu-Tao Xia
Hongen Liao
ObjD
132
0
0
29 Aug 2024
Tackling Noisy Clients in Federated Learning with End-to-end Label Correction
International Conference on Information and Knowledge Management (CIKM), 2024
Xuefeng Jiang
Sheng Sun
Jia Li
Jingjing Xue
Runhan Li
Zhiyuan Wu
Gang Xu
Yuwei Wang
Min Liu
FedML
293
19
0
08 Aug 2024
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
European Conference on Computer Vision (ECCV), 2024
Ting Lei
Shaofeng Yin
Yuxin Peng
Yang Liu
VLM
273
20
0
05 Aug 2024
Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization
Changtao Miao
Qi Chu
Tao Gong
Zhentao Tan
Zhenchao Jin
Wanyi Zhuang
Man Luo
Honggang Hu
Nenghai Yu
CVBM
190
8
0
05 Aug 2024
FedDEO: Description-Enhanced One-Shot Federated Learning with Diffusion Models
ACM Multimedia (MM), 2024
Mingzhao Yang
Shangchao Su
Bin Li
Xiangyang Xue
176
12
0
29 Jul 2024
LookupForensics: A Large-Scale Multi-Task Dataset for Multi-Phase Image-Based Fact Verification
Shuhan Cui
H. Nguyen
Trung-Nghia Le
Chun-Shien Lu
Isao Echizen
146
0
0
26 Jul 2024
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
Peng Hao
Xiaobing Wang
Yingying Jiang
Hanchao Jia
Xiaoshuai Hao
Shaowei Cui
Junhang Wei
Xiaoshuai Hao
409
4
0
26 Jul 2024
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
Yu-Yun Tseng
Tanusree Sharma
Lotus Zhang
Abigale Stangl
Leah Findlater
Yang Wang
Danna Gurari
393
3
0
25 Jul 2024
Error Detection and Constraint Recovery in Hierarchical Multi-Label Classification without Prior Knowledge
Joshua Shay Kricheli
Khoa Vo
Aniruddha Datta
Spencer Ozgur
Paulo Shakarian
295
7
0
21 Jul 2024
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
213
17
0
18 Jul 2024
For a semiotic AI: Bridging computer vision and visual semiotics for computational observation of large scale facial image archives
Lia Morra
A. Santangelo
Pietro Basci
Luca Piano
Fabio Garcea
Fabrizio Lamberti
Massimo Leone
162
2
0
03 Jul 2024
Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection
F. Barbato
Umberto Michieli
J. Moon
Pietro Zanuttigh
Mete Ozay
205
4
0
01 Jul 2024
Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?
Gregor Geigle
Radu Timofte
Goran Glavaš
219
2
0
20 Jun 2024
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models
IEEE Access (IEEE Access), 2024
Akchay Srivastava
Atif Memon
ELM
159
2
0
19 Jun 2024
Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation
Nikolas Koutsoubis
Yasin Yilmaz
Ravi P. Ramachandran
M. Schabath
Ghulam Rasool
195
14
0
18 Jun 2024
Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines
Honglei Zhang
Jukka I. Ahonen
Nam Le
Ruiying Yang
Francesco Cricri
124
2
0
18 Jun 2024
Comparison Visual Instruction Tuning
Wei Lin
M. Jehanzeb Mirza
Sivan Doveh
Rogerio Feris
Raja Giryes
Sepp Hochreiter
Leonid Karlinsky
226
5
0
13 Jun 2024
EUFCC-340K: A Faceted Hierarchical Dataset for Metadata Annotation in GLAM Collections
Francesc Net
Marc Folia
Pep Casals
Andrew D. Bagdanov
Lluís Gómez
180
3
0
04 Jun 2024
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Junho Kim
Hyunjun Kim
Yeonju Kim
Yong Man Ro
MLLM
164
29
0
04 Jun 2024
Extreme Point Supervised Instance Segmentation
Hyeonjun Lee
S. Hwang
Suha Kwak
262
7
0
31 May 2024
Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model
Shoma Iwai
Tomo Miyazaki
S. Omachi
243
21
0
27 May 2024
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Katherine Xu
Lingzhi Zhang
Jianbo Shi
350
27
0
23 May 2024
Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view Image
Jianshun Zeng
Wang Li
Yanjie Lv
Shuai Gao
Yuchu Qin
185
0
0
17 May 2024
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Mingxuan Liu
Tyler L. Hayes
Elisa Ricci
G. Csurka
Riccardo Volpi
ObjD
233
8
0
16 May 2024
MANTIS: Interleaved Multi-Image Instruction Tuning
Dongfu Jiang
Xuan He
Huaye Zeng
Cong Wei
Max Ku
Qian Liu
Wenhu Chen
VLM
MLLM
314
177
0
02 May 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
209
16
0
24 Apr 2024
Domain Adaptation for Learned Image Compression with Supervised Adapters
Alberto Presta
Gabriele Spadaro
Enzo Tartaglione
Attilio Fiandrotti
Marco Grangetto
92
6
0
24 Apr 2024
Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery
Yona Falinie A. Gaus
Neelanjan Bhowmik
Brian K. S. Isaac-Medina
T. Breckon
VLM
160
5
0
18 Apr 2024
`Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning
Joshua Forster Feinglass
Jayaraman J. Thiagarajan
Rushil Anirudh
T. S. Jayram
Yezhou Yang
VLM
124
3
0
12 Apr 2024
RASSAR: Room Accessibility and Safety Scanning in Augmented Reality
Xia Su
Han Zhang
Kaiming Cheng
Jaewook Lee
Qiaochu Liu
Wyatt Olson
Jon E. Froehlich
81
18
0
11 Apr 2024
3D-COCO: extension of MS-COCO dataset for image detection and 3D reconstruction modules
Maxence Bideaux
Alice Phe
Mohamed Chaouch
B. Luvison
Q. C. Pham
ISeg
3DV
172
2
0
08 Apr 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPC
ObjD
245
14
0
28 Mar 2024
BAM: Box Abstraction Monitors for Real-time OoD Detection in Object Detection
Changshun Wu
Weicheng He
Chih-Hong Cheng
Xiaowei Huang
Saddek Bensalem
175
5
0
27 Mar 2024
Benchmarking Video Frame Interpolation
Simon Kiefhaber
Simon Niklaus
Feng Liu
Simone Schaub-Meyer
140
3
0
25 Mar 2024
Shadow Generation for Composite Image Using Diffusion model
Computer Vision and Pattern Recognition (CVPR), 2024
Qingyang Liu
Junqi You
Jianting Wang
Xinhao Tao
Bo Zhang
Li Niu
DiffM
222
18
0
22 Mar 2024
Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Jielin Qiu
William Jongwon Han
Winfred Wang
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Christos Faloutsos
Lei Li
Lijuan Wang
VLM
225
3
0
19 Mar 2024
Efficient Transferability Assessment for Selection of Pre-trained Detectors
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhao Wang
Aoxue Li
Zhenguo Li
Qi Dou
130
0
0
14 Mar 2024
Video Relationship Detection Using Mixture of Experts
A. Shaabana
Zahra Gharaee
Paul Fieguth
153
1
0
06 Mar 2024
Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices
Younghan Lee
Sohee Jun
Yungi Cho
Woorim Han
Hyungon Moon
Y. Paek
AAML
71
3
0
05 Mar 2024
Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Imad Eddine Toubal
Aditya Avinash
N. Alldrin
Jan Dlabal
Wenlei Zhou
...
Chun-Ta Lu
Howard Zhou
Ranjay Krishna
Ariel Fuxman
Tom Duerig
VLM
300
19
0
05 Mar 2024
Enhancing Vision-Language Pre-training with Rich Supervisions
Yuan Gao
Kunyu Shi
Pengkai Zhu
Edouard Belval
Oren Nuriel
Srikar Appalaraju
Shabnam Ghadar
Vijay Mahadevan
Zhuowen Tu
Stefano Soatto
VLM
CLIP
330
15
0
05 Mar 2024
A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications
Wei Guo
Fuzhen Zhuang
Qi. Wang
Yiqi Tong
Jin Dong
FedML
233
39
0
03 Mar 2024
Previous
1
2
3
4
5
...
11
12
13
Next