v1v2 (latest)

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 623 papers shown

Can Transformers Capture Spatial Relations between Objects?

178

01 Mar 2024

PLReMix: Combating Noisy Labels with Pseudo-Label Relaxed Contrastive Representation Learning

Xiaoyu Liu

Beitong Zhou

Cheng Cheng

235

27 Feb 2024

Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models

169

13 Feb 2024

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

...

Yu Qiao

512

139

08 Feb 2024

Locally-Adaptive Quantization for Streaming Vector Search

308

03 Feb 2024

Category-wise Fine-Tuning: Resisting Incorrect Pseudo-Labels in Multi-Label Image Classification with Partial Labels

221

30 Jan 2024

Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding

...

Somayeh Sojoudi

191

09 Jan 2024

Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification

Bo Liu

207

02 Jan 2024

Amodal Completion via Progressive Mixed Context Diffusion

245

24 Dec 2023

Bayesian Transfer Learning

Piotr M. Suder

Jason Xu

David B. Dunson

252

20 Dec 2023

FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels

228

19 Dec 2023

Painterly Image Harmonization by Learning from Painterly ObjectsAAAI Conference on Artificial Intelligence (AAAI), 2023

199

15 Dec 2023

Localized Symbolic Knowledge Distillation for Visual Commonsense ModelsNeural Information Processing Systems (NeurIPS), 2023

...

Yejin Choi

269

08 Dec 2023

Boosting Object Detection with Zero-Shot Day-Night Domain AdaptationComputer Vision and Pattern Recognition (CVPR), 2023

327

02 Dec 2023

Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense KnowledgeIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Bowen Jiang

Zhijun Zhuang

Shreyas S. Shivakumar

Camillo J Taylor

268

21 Nov 2023

SniffyArt: The Dataset of Smelling Persons

Vincent Christlein

188

20 Nov 2023

Florence-2: Advancing a Unified Representation for a Variety of Vision TasksComputer Vision and Pattern Recognition (CVPR), 2023

Lu Yuan

392

383

10 Nov 2023

Exploring Dataset-Scale Indicators of Data Quality

Ben Feuer

Chinmay Hegde

193

07 Nov 2023

InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV ImagesInternational Journal of Remote Sensing (IJRS), 2023

A. Silva

H. Felix

Franscisco Paulo Magalhaes Simoes

Veronica Teichrieb

Michel Mozinho dos Santos

H. Santiago

V. Sgotti

H. B. D. T. L. Neto

367

02 Nov 2023

From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and OpportunitiesInformation Fusion (Inf. Fusion), 2023

Md Farhan Ishmam

Md Sakib Hossain Shovon

M. F. Mridha

Nilanjan Dey

399

01 Nov 2023

Generated Distributions Are All You Need for Membership Inference Attacks Against Generative ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Michael Backes

191

30 Oct 2023

Open-Set Image Tagging with Multi-Grained Text Supervision

Xinyu Huang

Yi-Jie Huang

Youcai Zhang

Weiwei Tian

Rui Feng

Lei Zhang

248

23 Oct 2023

OV-VG: A Benchmark for Open-Vocabulary Visual Grounding

Xiangtai Li

269

22 Oct 2023

Zone Evaluation: Revealing Spatial Bias in Object Detection

Xiang Li

Ming-Ming Cheng

276

20 Oct 2023

Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation ModelsACM Computing Surveys (ACM Comput. Surv.), 2023

Zhaozheng Chen

Qianru Sun

VLM

426

19 Oct 2023

TextPSG: Panoptic Scene Graph Generation from Textual DescriptionsIEEE International Conference on Computer Vision (ICCV), 2023

Chengyang Zhao

Songlin Yang

Zhenfang Chen

Mingyu Ding

Chuang Gan

388

10 Oct 2023

Lightweight In-Context Tuning for Multimodal Unified Models

144

08 Oct 2023

Automatic and Efficient Customization of Neural Networks for ML Applications

133

07 Oct 2023

Adaptive Visual Scene Understanding: Incremental Scene Graph GenerationNeural Information Processing Systems (NeurIPS), 2023

297

02 Oct 2023

DreamCom: Finetuning Text-guided Inpainting Model for Image Composition

237

27 Sep 2023

A Survey on Image-text Multimodal Models

Ruifeng Guo

Jingxuan Wei

Linzhuang Sun

Khai-Nguyen Nguyen

Guiyong Chang

Dawei Liu

Sibo Zhang

Zhengbing Yao

Mingjun Xu

Liping Bu

VLM

320

23 Sep 2023

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance SegmentationInternational Journal of Computer Vision (IJCV), 2023

Jiahao Xie

Wei Li

Xiangtai Li

Ziwei Liu

Yew-Soon Ong

Chen Change Loy

DiffM VLM

312

22 Sep 2023

ReShader: View-Dependent Highlights for Single Image View-SynthesisACM Transactions on Graphics (TOG), 2023

346

19 Sep 2023

AdSEE: Investigating the Impact of Image Style Editing on Advertisement AttractivenessKnowledge Discovery and Data Mining (KDD), 2023

117

15 Sep 2023

Collecting Visually-Grounded Dialogue with A Game Of SortsInternational Conference on Language Resources and Evaluation (LREC), 2023

Bram Willemsen

Dmytro Kalpakchi

Gabriel Skantze

114

10 Sep 2023

InstructDiffusion: A Generalist Modeling Interface for Vision TasksComputer Vision and Pattern Recognition (CVPR), 2023

...

Jianmin Bao

299

158

07 Sep 2023

Efficient Adaptive Human-Object Interaction Detection with Concept-guided MemoryIEEE International Conference on Computer Vision (ICCV), 2023

Yuxin Peng

Yang Liu

VLM

247

07 Sep 2023

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

...

Luke Zettlemoyer

267

162

05 Sep 2023

FACET: Fairness in Computer Vision Evaluation BenchmarkIEEE International Conference on Computer Vision (ICCV), 2023

351

31 Aug 2023

Separate and Locate: Rethink the Text in Text-based Visual Question AnsweringACM Multimedia (ACM MM), 2023

278

31 Aug 2023

SCoRD: Subject-Conditional Relation Detection with Text-Augmented DataIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

252

24 Aug 2023

HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks

Xin Zhan

24 Aug 2023

Seeing the Intangible: Survey of Image Classification into High-Level and Abstract Categories

Delfina Sol Martinez Pandiani

Valentina Presutti

206

21 Aug 2023

ControlCom: Controllable Image Composition using Diffusion Model

230

19 Aug 2023

RLIPv2: Fast Scaling of Relational Language-Image Pre-trainingIEEE International Conference on Computer Vision (ICCV), 2023

244

18 Aug 2023

DOST -- Domain Obedient Self-supervised Training for Multi Label Classification with Noisy Labels

167

09 Aug 2023

Foreground Object Search by Distilling Composite Image FeatureIEEE International Conference on Computer Vision (ICCV), 2023

Bo Zhang

Jiacheng Sui

Li Niu

251

09 Aug 2023

Which Tokens to Use? Investigating Token Reduction in Vision Transformers

Joakim Bruslund Haurum

274

09 Aug 2023

Distributionally Robust Classification on a Data Budget

249

07 Aug 2023

Improving Scene Graph Generation with Superpixel-Based Interaction LearningACM Multimedia (ACM MM), 2023

Zhidong Deng

175

04 Aug 2023