ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.03687
  4. Cited By
COBRA: Contrastive Bi-Modal Representation Algorithm

COBRA: Contrastive Bi-Modal Representation Algorithm

7 May 2020
Vishaal Udandarao
A. Maiti
Deepak Srivatsav
Suryatej Reddy Vyalla
Yifang Yin
R. Shah
ArXivPDFHTML

Papers citing "COBRA: Contrastive Bi-Modal Representation Algorithm"

15 / 15 papers shown
Title
CMAL: A Novel Cross-Modal Associative Learning Framework for
  Vision-Language Pre-Training
CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training
Zhiyuan Ma
Jianjun Li
Guohui Li
Kaiyan Huang
VLM
58
9
0
16 Oct 2024
Meta-Learn Unimodal Signals with Weak Supervision for Multimodal
  Sentiment Analysis
Meta-Learn Unimodal Signals with Weak Supervision for Multimodal Sentiment Analysis
Sijie Mai
Yu Zhao
Ying Zeng
Jianhua Yao
Haifeng Hu
38
2
0
28 Aug 2024
Labeling Comic Mischief Content in Online Videos with a Multimodal
  Hierarchical-Cross-Attention Model
Labeling Comic Mischief Content in Online Videos with a Multimodal Hierarchical-Cross-Attention Model
Elaheh Baharlouei
Mahsa Shafaei
Yigeng Zhang
Hugo Jair Escalante
Thamar Solorio
51
0
0
12 Jun 2024
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency
  Determines Multimodal Model Performance
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
Vishaal Udandarao
Ameya Prabhu
Adhiraj Ghosh
Yash Sharma
Philip Torr
Adel Bibi
Samuel Albanie
Matthias Bethge
VLM
128
45
0
04 Apr 2024
ArcSin: Adaptive ranged cosine Similarity injected noise for
  Language-Driven Visual Tasks
ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks
Yang Liu
Xiaomin Yu
Gongyu Zhang
Christos Bergeles
Prokar Dasgupta
Alejandro Granados
Sebastien Ourselin
48
2
0
27 Feb 2024
UniS-MMC: Multimodal Classification via Unimodality-supervised
  Multimodal Contrastive Learning
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
Heqing Zou
Meng Shen
Chen Chen
Yuchen Hu
D. Rajan
Chng Eng Siong
SSL
45
15
0
16 May 2023
Curriculum Learning Meets Weakly Supervised Modality Correlation
  Learning
Curriculum Learning Meets Weakly Supervised Modality Correlation Learning
Sijie Mai
Ya Sun
Haifeng Hu
37
3
0
15 Dec 2022
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
Vishaal Udandarao
Ankush Gupta
Samuel Albanie
VLM
MLLM
34
103
0
28 Nov 2022
Fake News Detection with Heterogeneous Transformer
Fake News Detection with Heterogeneous Transformer
Tianle Li
Yushi Sun
Shang-ling Hsu
Yanjia Li
Raymond Chi-Wing Wong
29
3
0
06 May 2022
Hybrid Contrastive Learning of Tri-Modal Representation for Multimodal
  Sentiment Analysis
Hybrid Contrastive Learning of Tri-Modal Representation for Multimodal Sentiment Analysis
Sijie Mai
Ying Zeng
Shuangjia Zheng
Haifeng Hu
30
117
0
04 Sep 2021
Multimodal Co-learning: Challenges, Applications with Datasets, Recent
  Advances and Future Directions
Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future Directions
Anil Rahate
Rahee Walambe
S. Ramanna
K. Kotecha
27
135
0
29 Jul 2021
Multimodal Research in Vision and Language: A Review of Current and
  Emerging Trends
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
28
6
0
19 Oct 2020
DisCont: Self-Supervised Visual Attribute Disentanglement using Context
  Vectors
DisCont: Self-Supervised Visual Attribute Disentanglement using Context Vectors
Sarthak Bhagat
Vishaal Udandarao
Shagun Uppal
CoGe
SSL
DRL
16
6
0
10 Jun 2020
Investigating Audio, Visual, and Text Fusion Methods for End-to-End
  Automatic Personality Prediction
Investigating Audio, Visual, and Text Fusion Methods for End-to-End Automatic Personality Prediction
Onno P. Kampman
Elham J. Barezi
D. Bertero
Pascale Fung
51
95
0
02 May 2018
A Multi-View Embedding Space for Modeling Internet Images, Tags, and
  their Semantics
A Multi-View Embedding Space for Modeling Internet Images, Tags, and their Semantics
Yunchao Gong
Qifa Ke
Michael Isard
Svetlana Lazebnik
3DV
78
584
0
18 Dec 2012
1