Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
v1v2v3v4v5 (latest)

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

European Conference on Computer Vision (ECCV), 2020
    VLM

Papers citing "Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks"

50 / 1,171 papers shown
Title
Enhancing Fine-Grained Image Classifications via Cascaded Vision
  Language Models
Enhancing Fine-Grained Image Classifications via Cascaded Vision Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
138
2
0
18 May 2024
Is CLIP the main roadblock for fine-grained open-world perception?
Is CLIP the main roadblock for fine-grained open-world perception?International Conference on Content-Based Multimedia Indexing (CBMI), 2024
129
8
0
04 Apr 2024
Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement
Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation EnhancementConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
172
1
0
24 Mar 2024
Transformer based Multitask Learning for Image Captioning and Object
  Detection
Transformer based Multitask Learning for Image Captioning and Object DetectionPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2024
111
3
0
10 Mar 2024