ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.02818
  4. Cited By
Evaluating CLIP: Towards Characterization of Broader Capabilities and
  Downstream Implications

Evaluating CLIP: Towards Characterization of Broader Capabilities and Downstream Implications

5 August 2021
Sandhini Agarwal
Gretchen Krueger
Jack Clark
Alec Radford
Jong Wook Kim
Miles Brundage
ArXiv (abs)PDFHTML

Papers citing "Evaluating CLIP: Towards Characterization of Broader Capabilities and Downstream Implications"

50 / 95 papers shown
Title
My Answer Is NOT 'Fair': Mitigating Social Bias in Vision-Language Models via Fair and Biased Residuals
My Answer Is NOT 'Fair': Mitigating Social Bias in Vision-Language Models via Fair and Biased Residuals
Jian Lan
Yifei Fu
Udo Schlegel
Gengyuan Zhang
Tanveer Hannan
Haokun Chen
Thomas Seidl
19
0
0
26 May 2025
Understanding Complexity in VideoQA via Visual Program Generation
Understanding Complexity in VideoQA via Visual Program Generation
Cristobal Eyzaguirre
Igor Vasiljevic
Achal Dave
Jiajun Wu
Rares Andrei Ambrus
Thomas Kollar
Juan Carlos Niebles
P. Tokmakov
80
0
0
19 May 2025
Text-to-Image Models and Their Representation of People from Different Nationalities Engaging in Activities
Text-to-Image Models and Their Representation of People from Different Nationalities Engaging in Activities
Abdulkareem Alsudais
86
0
0
08 Apr 2025
A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models
A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models
Leander Girrbach
Stephan Alaniz
Genevieve Smith
Zeynep Akata
143
0
0
30 Mar 2025
What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images
What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images
Dongheng Lin
Han Hu
Jianbo Jiao
63
0
0
23 Mar 2025
Web Artifact Attacks Disrupt Vision Language Models
Web Artifact Attacks Disrupt Vision Language Models
Maan Qraitem
Piotr Teterwak
Kate Saenko
Bryan A. Plummer
AAML
115
0
0
17 Mar 2025
Debiased Prompt Tuning in Vision-Language Model without Annotations
Chaoquan Jiang
Yunfan Yang
Rui Hu
Jitao Sang
VLM
95
0
0
11 Mar 2025
SB-Bench: Stereotype Bias Benchmark for Large Multimodal Models
SB-Bench: Stereotype Bias Benchmark for Large Multimodal Models
Vishal Narnaware
Ashmal Vayani
Rohit Gupta
Swetha Sirnam
Mubarak Shah
204
3
0
12 Feb 2025
Detecting Content Rating Violations in Android Applications: A Vision-Language Approach
Detecting Content Rating Violations in Android Applications: A Vision-Language Approach
Dishanika Denipitiyage
B. Silva
Suranga Seneviratne
A. Seneviratne
Sanjay Chawla
83
0
0
07 Feb 2025
Joint Vision-Language Social Bias Removal for CLIP
Joint Vision-Language Social Bias Removal for CLIP
Haoyu Zhang
Yangyang Guo
Mohan S. Kankanhalli
VLM
196
1
0
19 Nov 2024
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Leander Girrbach
Yiran Huang
Stephan Alaniz
Trevor Darrell
Zeynep Akata
VLM
145
2
0
25 Oct 2024
Debiasing Vison-Language Models with Text-Only Training
Debiasing Vison-Language Models with Text-Only Training
Yunfan Yang
Chaoquan Jiang
Zhiyu Lin
Jinlin Xiao
Jiaming Zhang
Jitao Sang
VLM
78
1
0
12 Oct 2024
A Unified Debiasing Approach for Vision-Language Models across
  Modalities and Tasks
A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks
Hoin Jung
T. Jang
Xiaoqian Wang
VLM
77
3
0
10 Oct 2024
Contrastive Abstraction for Reinforcement Learning
Contrastive Abstraction for Reinforcement Learning
Vihang Patil
M. Hofmarcher
Elisabeth Rumetshofer
Sepp Hochreiter
OffRLSSL
109
2
0
01 Oct 2024
Social perception of faces in a vision-language model
Social perception of faces in a vision-language model
C. I. Hausladen
Manuel Knott
Colin F. Camerer
Pietro Perona
CVBMVLM
143
2
0
26 Aug 2024
Fairness and Bias Mitigation in Computer Vision: A Survey
Fairness and Bias Mitigation in Computer Vision: A Survey
Sepehr Dehdashtian
Ruozhen He
Yi Li
Guha Balakrishnan
Nuno Vasconcelos
Vicente Ordonez
Vishnu Boddeti
143
5
0
05 Aug 2024
GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language
  Models
GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models
Ali Abdollahi
Mahdi Ghaznavi
Mohammad Reza Karimi Nejad
Arash Mari Oriyad
Reza Abbasi
Ali Salesi
Melika Behjati
M. Rohban
M. Baghshah
CoGe
139
1
0
30 Jul 2024
GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models
  via Counterfactual Probing
GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing
Yisong Xiao
Aishan Liu
QianJia Cheng
Zhenfei Yin
Siyuan Liang
Jiapeng Li
Jing Shao
Xianglong Liu
Dacheng Tao
124
8
0
30 Jun 2024
"My Kind of Woman": Analysing Gender Stereotypes in AI through The
  Averageness Theory and EU Law
"My Kind of Woman": Analysing Gender Stereotypes in AI through The Averageness Theory and EU Law
Miriam Doh
and Anastasia Karagianni
98
1
0
27 Jun 2024
MoESD: Mixture of Experts Stable Diffusion to Mitigate Gender Bias
MoESD: Mixture of Experts Stable Diffusion to Mitigate Gender Bias
Guorun Wang
Lucia Specia
DiffMMoE
86
0
0
25 Jun 2024
Enhancing Domain Adaptation through Prompt Gradient Alignment
Enhancing Domain Adaptation through Prompt Gradient Alignment
Hoang Phan
Lam C. Tran
Quyen Tran
Trung Le
188
1
0
13 Jun 2024
SLANT: Spurious Logo ANalysis Toolkit
SLANT: Spurious Logo ANalysis Toolkit
Maan Qraitem
Piotr Teterwak
Kate Saenko
Bryan A. Plummer
AAML
82
0
0
03 Jun 2024
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
Li Lin
Santosh
Xin Eric Wang
Shu Hu
Shu Hu
EGVM
158
12
0
02 Jun 2024
Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation
Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation
Kimia Hamidieh
Haoran Zhang
Swami Sankaranarayanan
Marzyeh Ghassemi
97
0
0
28 May 2024
No Filter: Cultural and Socioeconomic Diversity in Contrastive
  Vision-Language Models
No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language Models
Angeline Pouget
Lucas Beyer
Emanuele Bugliarello
Xiao Wang
Andreas Steiner
Xiao-Qi Zhai
Ibrahim Alabdulmohsin
VLM
94
9
0
22 May 2024
Who's in and who's out? A case study of multimodal CLIP-filtering in
  DataComp
Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp
Rachel Hong
William Agnew
Tadayoshi Kohno
Jamie Morgenstern
107
15
0
13 May 2024
Decoding Emotions in Abstract Art: Cognitive Plausibility of CLIP in
  Recognizing Color-Emotion Associations
Decoding Emotions in Abstract Art: Cognitive Plausibility of CLIP in Recognizing Color-Emotion Associations
Hanna-Sophia Widhoelzl
Ece Takmaz
73
2
0
10 May 2024
FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities
  in Semantic Dataset Deduplication
FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication
Eric Slyman
Stefan Lee
Scott D. Cohen
Kushal Kafle
VLM
61
5
0
24 Apr 2024
Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models
Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models
Simon Schrodi
David T. Hoffmann
Max Argus
Volker Fischer
Thomas Brox
VLM
142
4
0
11 Apr 2024
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering
Juhong Min
Shyamal Buch
Arsha Nagrani
Minsu Cho
Cordelia Schmid
LRM
99
31
0
09 Apr 2024
DeiT-LT Distillation Strikes Back for Vision Transformer Training on
  Long-Tailed Datasets
DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets
Harsh Rangwani
Pradipto Mondal
Mayank Mishra
Ashish Ramayee Asokan
R. V. Babu
100
9
0
03 Apr 2024
Vision-language models for decoding provider attention during neonatal
  resuscitation
Vision-language models for decoding provider attention during neonatal resuscitation
Felipe Parodi
Jordan K Matelsky
Alejandra Regla-Vargas
Elizabeth E. Foglia
Charis Lim
Danielle Weinberg
Konrad Kording
Heidi Herrick
Michael L Platt
72
0
0
01 Apr 2024
FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in
  RKHSs
FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs
Sepehr Dehdashtian
Lan Wang
Vishnu Boddeti
VLM
89
15
0
22 Mar 2024
Controllable Prompt Tuning For Balancing Group Distributional Robustness
Controllable Prompt Tuning For Balancing Group Distributional Robustness
Hoang Phan
Andrew Gordon Wilson
Qi Lei
99
7
0
05 Mar 2024
What do we learn from inverting CLIP models?
What do we learn from inverting CLIP models?
Hamid Kazemi
Atoosa Malemir Chegini
Jonas Geiping
Soheil Feizi
Tom Goldstein
55
6
0
05 Mar 2024
Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
Huan Ma
Yan Zhu
Changqing Zhang
Peilin Zhao
Baoyuan Wu
Long-Kai Huang
Qinghua Hu
Bing Wu
VLM
167
2
0
01 Mar 2024
The Bias of Harmful Label Associations in Vision-Language Models
The Bias of Harmful Label Associations in Vision-Language Models
C. Hazirbas
Alicia Sun
Yonathan Efroni
Mark Ibrahim
VLM
76
0
0
11 Feb 2024
KVQ: Kwai Video Quality Assessment for Short-form Videos
KVQ: Kwai Video Quality Assessment for Short-form Videos
Yiting Lu
Xin Li
Yajing Pei
Kun Yuan
Qizhi Xie
Yunpeng Qu
Ming Sun
Chao Zhou
Zhibo Chen
113
20
0
11 Feb 2024
Examining Gender and Racial Bias in Large Vision-Language Models Using a
  Novel Dataset of Parallel Images
Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images
Kathleen C. Fraser
S. Kiritchenko
106
40
0
08 Feb 2024
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Felix Friedrich
Katharina Hämmerl
P. Schramowski
Manuel Brack
Jindrich Libovický
Kristian Kersting
Alexander Fraser
EGVM
159
14
0
29 Jan 2024
The Neglected Tails in Vision-Language Models
The Neglected Tails in Vision-Language Models
Shubham Parashar
Zhiqiu Lin
Tian Liu
Xiangjue Dong
Yanan Li
Deva Ramanan
James Caverlee
Shu Kong
VLM
128
38
0
23 Jan 2024
Benchmarking PathCLIP for Pathology Image Analysis
Benchmarking PathCLIP for Pathology Image Analysis
Sunyi Zheng
Xiaonan Cui
Yuxuan Sun
Jingxiong Li
Honglin Li
Yunlong Zhang
Pingyi Chen
Xueping Jing
Zhaoxiang Ye
Lin Yang
VLM
53
7
0
05 Jan 2024
Parrot Captions Teach CLIP to Spot Text
Parrot Captions Teach CLIP to Spot Text
Yiqi Lin
Conghui He
Alex Jinpeng Wang
Bin Wang
Weijia Li
Mike Zheng Shou
106
7
0
21 Dec 2023
Remote Sensing Vision-Language Foundation Models without Annotations via
  Ground Remote Alignment
Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment
Utkarsh Mall
Cheng Perng Phoo
Meilin Kelsey Liu
Carl Vondrick
B. Hariharan
Kavita Bala
VLM
72
42
0
12 Dec 2023
Explaining CLIP's performance disparities on data from blind/low vision
  users
Explaining CLIP's performance disparities on data from blind/low vision users
Daniela Massiceti
Camilla Longden
Agnieszka Slowik
Samuel Wills
Martin Grayson
C. Morrison
VLM
73
10
0
29 Nov 2023
Which One? Leveraging Context Between Objects and Multiple Views for
  Language Grounding
Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding
Chancharik Mitra
Abrar Anwar
Rodolfo Corona
Dan Klein
Trevor Darrell
Jesse Thomason
72
1
0
12 Nov 2023
Evaluating Bias and Fairness in Gender-Neutral Pretrained
  Vision-and-Language Models
Evaluating Bias and Fairness in Gender-Neutral Pretrained Vision-and-Language Models
Laura Cabello
Emanuele Bugliarello
Stephanie Brandl
Desmond Elliott
74
7
0
26 Oct 2023
Survey of Social Bias in Vision-Language Models
Survey of Social Bias in Vision-Language Models
Nayeon Lee
Yejin Bang
Holy Lovenia
Samuel Cahyawijaya
Wenliang Dai
Pascale Fung
VLM
132
19
0
24 Sep 2023
Improving CLIP Robustness with Knowledge Distillation and Self-Training
Improving CLIP Robustness with Knowledge Distillation and Self-Training
Clement Laroudie
Andrei Bursuc
Mai Lan Ha
Gianni Franchi
VLM
71
5
0
19 Sep 2023
ITI-GEN: Inclusive Text-to-Image Generation
ITI-GEN: Inclusive Text-to-Image Generation
Cheng Zhang
Xuanbai Chen
Siqi Chai
Chen Henry Wu
Dmitry Lagun
Thabo Beeler
Fernando de la Torre
VLM
124
58
0
11 Sep 2023
12
Next