v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown

Orientation Aware Weapons Detection In Visual Data : A Benchmark Dataset

Nazeef Ul Haq

M. Fraz

Tufail Sajjad Shah Hashmi

Muhammad Shahzad

205

04 Dec 2021

$Optimization of phase-only holograms calculated with scaled diffraction calculation through deep neural networks$

Optimization of phase-only holograms calculated with scaled diffraction calculation through deep neural networks

Tomoyoshi Shimobaba

02 Dec 2021

Object-Aware Cropping for Self-Supervised Learning

Shlok Kumar Mishra

Anshul B. Shah

Ankan Bansal

Abhyuday N. Jagannatha

411

01 Dec 2021

Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets

Marcella Cornia

Lorenzo Baraldi

G. Fiameni

Rita Cucchiara

320

24 Nov 2021

NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion

Fan Yang

298

343

24 Nov 2021

UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling

Zicheng Liu

348

134

23 Nov 2021

Class-agnostic Object Detection with Multi-modal TransformerEuropean Conference on Computer Vision (ECCV), 2021

Salman Khan

Rao Muhammad Anwer

613

116

22 Nov 2021

L-Verse: Bidirectional Generation Between Image and TextComputer Vision and Pattern Recognition (CVPR), 2021

1.0K

22 Nov 2021

Rethinking Drone-Based Search and Rescue with Aerial Person Detection

Pasi Pyrrö

H. Naseri

Alexander Jung

17 Nov 2021

Achieving Human Parity on Visual Question Answering

...

Ji Zhang

Songfang Huang

Fei Huang

Luo Si

Rong Jin

147

17 Nov 2021

INTERN: A New Learning Paradigm Towards General Vision

Siyu Chen

...

Yu Qiao

237

16 Nov 2021

Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual ConceptsInternational Conference on Machine Learning (ICML), 2021

335

352

16 Nov 2021

A Survey of Visual TransformersIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

Yang Liu

470

477

11 Nov 2021

TAGLETS: A System for Automatic Semi-Supervised Learning with Auxiliary Data

Wasu Top Piriyakulkij

368

08 Nov 2021

Resource-Efficient Federated LearningEuropean Conference on Computer Systems (EuroSys), 2021

Suhaib A. Fahmy

253

01 Nov 2021

Multi-label Classification with Partial Annotations using Class-aware Selective LossComputer Vision and Pattern Recognition (CVPR), 2021

172

21 Oct 2021

Noisy Annotation Refinement for Object DetectionBritish Machine Vision Conference (BMVC), 2021

Jiafeng Mao

261

20 Oct 2021

Does Data Repair Lead to Fair Models? Curating Contextually Fair Data To Reduce Model BiasIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

161

20 Oct 2021

EBJR: Energy-Based Joint Reasoning for Adaptive InferenceBritish Machine Vision Conference (BMVC), 2021

Mohammad Akbari

Amin Banitalebi-Dehkordi

Yong Zhang

BDL MQ

156

20 Oct 2021

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Cory Paik

Stéphane Aroca-Ouellette

Alessandro Roncone

Katharina Kann

169

15 Oct 2021

Ego4D: Around the World in 3,000 Hours of Egocentric Video

...

Antonio Torralba

Mingfei Yan

1.0K

1,464

13 Oct 2021

Aura: Privacy-preserving Augmentation to Improve Test Set Diversity in Speech EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

165

08 Oct 2021

Inferring Offensiveness In Images From Natural Language Supervision

P. Schramowski

Kristian Kersting

08 Oct 2021

FooDI-ML: a large multi-language dataset of food, drinks and groceries images and descriptions

199

05 Oct 2021

PASS: An ImageNet replacement for self-supervised pretraining without humans

Yuki M. Asano

Christian Rupprecht

Andrew Zisserman

Andrea Vedaldi

VLM SSL

214

27 Sep 2021

PETA: Photo Albums Event Recognition using Transformers AttentionInternational Conference on Pattern Recognition (ICPR), 2021

132

26 Sep 2021

Visual Scene Graphs for Audio Source SeparationIEEE International Conference on Computer Vision (ICCV), 2021

221

24 Sep 2021

Discovering and Validating AI Errors With Crowdsourced Failure Reports

Ángel Alexander Cabrera

174

23 Sep 2021

Pix2seq: A Language Modeling Framework for Object DetectionInternational Conference on Learning Representations (ICLR), 2021

David J. Fleet

655

408

22 Sep 2021

Deep Joint Source-Channel Coding for Multi-Task Network

229

13 Sep 2021

COSMic: A Coherence-Aware Generation Metric for Image Descriptions

154

11 Sep 2021

Panoptic Narrative GroundingIEEE International Conference on Computer Vision (ICCV), 2021

251

10 Sep 2021

Learning to Generate Scene Graph from Natural Language Supervision

Yiwu Zhong

Jing Shi

Jianwei Yang

Chenliang Xu

Yin Li

SSL

261

06 Sep 2021

Identification of Driver Phone Usage Violations via State-of-the-Art Object Detection with Tracking

S. Carrell

Amir Atapour-Abarghouei

136

05 Sep 2021

Evaluating the Single-Shot MultiBox Detector and YOLO Deep Learning Models for the Detection of Tomatoes in a Greenhouse

102

142

02 Sep 2021

EKTVQA: Generalized use of External Knowledge to empower Scene Text in Text-VQAIEEE Access (IEEE Access), 2021

Arka Ujjal Dey

Ernest Valveny

Gaurav Harit

346

22 Aug 2021

DVM-CAR: A large-scale automotive dataset for visual marketing research and applications

149

10 Aug 2021

Pre-trained Models for Sonar Images

Matias Valdenegro-Toro

Alan Preciado-Grijalva

Bilal Wehbe

VLM

02 Aug 2021

United We Learn Better: Harvesting Learning Improvements From Class Hierarchies Across Tasks

112

28 Jul 2021

Spatial-Temporal Transformer for Dynamic Scene Graph GenerationIEEE International Conference on Computer Vision (ICCV), 2021

306

150

26 Jul 2021

Fed-ensemble: Improving Generalization through Model Ensembling in Federated LearningIEEE Transactions on Automation Science and Engineering (T-ASE), 2021

178

21 Jul 2021

Multi-Label Generalized Zero Shot Learning for the Classification of Disease in Chest Radiographs

Nasir Hayat

Hazem Lashen

Farah E. Shamout

245

14 Jul 2021

Exploiting Image Translations via Ensemble Self-Supervised Learning for Unsupervised Domain Adaptation

Fabrizio J. Piva

Gijs Dubbelman

127

13 Jul 2021

EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments

182

09 Jul 2021

PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling

253

06 Jul 2021

MSE Loss with Outlying Label for Imbalanced Classification

S. Kato

Kazuhiro Hotta

108

06 Jul 2021

Web-Scale Generic Object Detection at Microsoft Bing

180

05 Jul 2021

CBNet: A Composite Backbone Network Architecture for Object Detection

Zhi Tang

Jingdong Chen

521

204

01 Jul 2021

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation

Jing Liu

...

299

01 Jul 2021

Making Images Real Again: A Comprehensive Survey on Deep Image Composition

529

28 Jun 2021