ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.04790
  4. Cited By
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
v1v2v3 (latest)

The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

10 May 2020
Douwe Kiela
Hamed Firooz
Aravind Mohan
Vedanuj Goswami
Amanpreet Singh
Pratik Ringshia
Davide Testuggine
ArXiv (abs)PDFHTML

Papers citing "The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes"

50 / 357 papers shown
Title
Visual Objectification in Films: Towards a New AI Task for Video
  Interpretation
Visual Objectification in Films: Towards a New AI Task for Video Interpretation
Julie Tores
L. Sassatelli
Hui-Yin Wu
Clement Bergman
Lea Andolfi
...
F. Precioso
Thierry Devars
Magali Guaresi
Virginie Julliard
Sarah Lecossais
129
4
0
24 Jan 2024
Red Teaming Visual Language Models
Red Teaming Visual Language Models
Mukai Li
Lei Li
Yuwei Yin
Masood Ahmed
Zhenguang Liu
Qi Liu
VLM
135
48
0
23 Jan 2024
Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes
  Through Multimodal Explanations
Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations
Prince Jha
Krishanu Maity
Raghav Jain
Apoorv Verma
Sriparna Saha
P. Bhattacharyya
85
9
0
18 Jan 2024
An Investigation of Large Language Models for Real-World Hate Speech
  Detection
An Investigation of Large Language Models for Real-World Hate Speech Detection
Keyan Guo
Alexander Hu
Jaden Mu
Ziheng Shi
Ziming Zhao
Nishant Vishwamitra
Hongxin Hu
118
14
0
07 Jan 2024
GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse
GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse
Hongzhan Lin
Ziyang Luo
Bo Wang
Ruichao Yang
Jing Ma
256
38
0
03 Jan 2024
Generative Multimodal Models are In-Context Learners
Generative Multimodal Models are In-Context Learners
Quan-Sen Sun
Yufeng Cui
Xiaosong Zhang
Fan Zhang
Qiying Yu
...
Yueze Wang
Yongming Rao
Jingjing Liu
Tiejun Huang
Xinlong Wang
MLLMLRM
206
355
0
20 Dec 2023
Explainable Multimodal Sentiment Analysis on Bengali Memes
Explainable Multimodal Sentiment Analysis on Bengali Memes
Kazi Toufique Elahi
Tasnuva Binte Rahman
Shakil Shahriar
Samir Sarker
Sajib Kumar Saha Joy
Faisal Muhammad Shah
98
2
0
20 Dec 2023
Jack of All Tasks, Master of Many: Designing General-purpose
  Coarse-to-Fine Vision-Language Model
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Shraman Pramanick
Guangxing Han
Rui Hou
Sayan Nag
Ser-Nam Lim
Nicolas Ballas
Qifan Wang
Rama Chellappa
Amjad Almahairi
VLMMLLM
210
41
0
19 Dec 2023
Mixture of Cluster-conditional LoRA Experts for Vision-language
  Instruction Tuning
Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning
Yunhao Gou
Zhili Liu
Kai Chen
Lanqing Hong
Hang Xu
Aoxue Li
Dit-Yan Yeung
James T. Kwok
Yu Zhang
MoEMLLMVLM
264
85
0
19 Dec 2023
MATK: The Meme Analytical Tool Kit
MATK: The Meme Analytical Tool Kit
Ming Shan Hee
Aditi Kumaresan
N. Hoang
Nirmalendu Prakash
Rui Cao
Roy Ka-wei Lee
VLM
66
2
0
11 Dec 2023
PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using
  Large Language Models
PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models
Nirmalendu Prakash
Han Wang
N. Hoang
Ming Shan Hee
Roy Ka-wei Lee
100
13
0
11 Dec 2023
Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning
  Distilled from Large Language Models
Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Models
Hongzhan Lin
Ziyang Luo
Jing Ma
Long Chen
94
15
0
09 Dec 2023
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Hakan Inan
Kartikeya Upasani
Jianfeng Chi
Rashi Rungta
Krithika Iyer
...
Michael Tontchev
Qing Hu
Brian Fuller
Davide Testuggine
Madian Khabsa
AI4MH
237
605
0
07 Dec 2023
Visual Program Distillation: Distilling Tools and Programmatic Reasoning
  into Vision-Language Models
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models
Yushi Hu
Otilia Stretcu
Chun-Ta Lu
Krishnamurthy Viswanathan
Kenji Hata
Enming Luo
Ranjay Krishna
Ariel Fuxman
VLMLRMMLLM
184
55
0
05 Dec 2023
Contextualizing Internet Memes Across Social Media Platforms
Contextualizing Internet Memes Across Social Media Platforms
Saurav Joshi
Filip Ilievski
Luca Luceri
87
9
0
18 Nov 2023
Social Meme-ing: Measuring Linguistic Variation in Memes
Social Meme-ing: Measuring Linguistic Variation in Memes
Naitian Zhou
David Jurgens
David Bamman
81
4
0
15 Nov 2023
Improving Hateful Meme Detection through Retrieval-Guided Contrastive
  Learning
Improving Hateful Meme Detection through Retrieval-Guided Contrastive Learning
Jingbiao Mei
Jinghong Chen
Weizhe Lin
Bill Byrne
Marcus Tomalin
VLM
101
11
0
14 Nov 2023
Detecting and Correcting Hate Speech in Multimodal Memes with Large
  Visual Language Model
Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model
Minh-Hao Van
Xintao Wu
VLMMLLM
90
12
0
12 Nov 2023
Is GPT Powerful Enough to Analyze the Emotions of Memes?
Is GPT Powerful Enough to Analyze the Emotions of Memes?
Jingjing Wang
Joshua Luo
Grace Yang
Allen Hong
Feng Luo
ELMAI4MH
82
4
0
01 Nov 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
185
10
0
25 Oct 2023
CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP
  Performance on Low-Resource Languages
CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages
G. O. D. Santos
Diego A. B. Moreira
Alef Iury Ferreira
Jhessica Silva
Luiz Pereira
...
H. Maia
Nádia Da Silva
Esther Colombini
Hélio Pedrini
Sandra Avila
VLMCLIP
95
6
0
20 Oct 2023
BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification
BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification
Mithun Das
Animesh Mukherjee
95
8
0
18 Oct 2023
Reading Books is Great, But Not if You Are Driving! Visually Grounded
  Reasoning about Defeasible Commonsense Norms
Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms
Seungju Han
Junhyeok Kim
Jack Hessel
Liwei Jiang
Jiwan Chung
Yejin Son
Yejin Choi
Youngjae Yu
88
4
0
16 Oct 2023
Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using
  LLMs
Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs
Chenyang Yang
Rishabh Rustogi
Rachel A. Brower-Sinning
Grace A. Lewis
Jane Hsieh
Tongshuang Wu
KELM
121
13
0
14 Oct 2023
MiniGPT-v2: large language model as a unified interface for
  vision-language multi-task learning
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
600
550
0
14 Oct 2023
Mapping Memes to Words for Multimodal Hateful Meme Classification
Mapping Memes to Words for Multimodal Hateful Meme Classification
Giovanni Burbi
Alberto Baldrati
Lorenzo Agnolucci
Marco Bertini
Marco Bertini
88
20
0
12 Oct 2023
Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve
  Multimodal Sarcasm Detection
Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Swapnil Bhosale
Abhra Chaudhuri
Alex Lee Robert Williams
Divyank Tiwari
Anjan Dutta
Xiatian Zhu
Pushpak Bhattacharyya
Diptesh Kanojia
85
4
0
29 Sep 2023
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Avamarie Brueggeman
Andrea Madotto
Mohammad Kachuee
Tushar Nagarajan
Matt Smith
...
Peyman Heidari
Yue Liu
Kavya Srinet
Babak Damavandi
Anuj Kumar
MLLM
142
102
0
27 Sep 2023
Image-Text Pre-Training for Logo Recognition
Image-Text Pre-Training for Logo Recognition
Mark Hubenthal
Suren Kumar
VLM
114
3
0
18 Sep 2023
MMICL: Empowering Vision-language Model with Multi-Modal In-Context
  Learning
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
Haozhe Zhao
Zefan Cai
Shuzheng Si
Xiaojian Ma
Kaikai An
Liang Chen
Zixuan Liu
Sheng Wang
Wenjuan Han
Baobao Chang
MLLMVLM
199
162
0
14 Sep 2023
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient
  Fine-tuning
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Sanghyeon Kim
Hyunmo Yang
Younghyun Kim
Youngjoon Hong
Eunbyung Park
AI4CE
115
26
0
13 Sep 2023
Overview of Memotion 3: Sentiment and Emotion Analysis of Codemixed
  Hinglish Memes
Overview of Memotion 3: Sentiment and Emotion Analysis of Codemixed Hinglish Memes
Shreyash Mishra
S. Suryavardan
Megha Chakraborty
Parth Patwa
Anku Rani
...
Amitava Das
A. Sheth
Manoj Kumar Chinnakotla
Asif Ekbal
Srijan Kumar
77
6
0
12 Sep 2023
Causal Intersectionality and Dual Form of Gradient Descent for
  Multimodal Analysis: a Case Study on Hateful Memes
Causal Intersectionality and Dual Form of Gradient Descent for Multimodal Analysis: a Case Study on Hateful Memes
Yosuke Miyanishi
Minh Le Nguyen
163
2
0
19 Aug 2023
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual
  Questions
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
Wenbo Hu
Y. Xu
Jian Wang
W. Li
Zhe Chen
Zhuowen Tu
MLLMVLM
193
166
0
19 Aug 2023
ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
Kaicheng Yang
Jiankang Deng
Xiang An
Jiawei Li
Ziyong Feng
Jia Guo
Jing Yang
Tongliang Liu
VLMCLIP
130
68
0
16 Aug 2023
Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme
  Detection
Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Rui Cao
Ming Shan Hee
Adriel Kuek
Wen-Haw Chong
Roy Ka-wei Lee
Jing Jiang
VLMMLLM
77
53
0
16 Aug 2023
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free
  Domain Adaptation
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation
Xuefeng Hu
Ke Zhang
Lu Xia
Albert Y. C. Chen
Jiajia Luo
...
Nan Qiao
Xiao Zeng
Min Sun
Cheng-Hao Kuo
Ram Nevatia
VLM
75
37
0
04 Aug 2023
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive
  Vision-Language Models
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
Anas Awadalla
Irena Gao
Josh Gardner
Jack Hessel
Yusuf Hanafy
...
Simon Kornblith
Pang Wei Koh
Gabriel Ilharco
Mitchell Wortsman
Ludwig Schmidt
MLLM
224
491
0
02 Aug 2023
Unimodal Intermediate Training for Multimodal Meme Sentiment
  Classification
Unimodal Intermediate Training for Multimodal Meme Sentiment Classification
Muzhaffar Hazman
Susan Mckeever
Josephine Griffith
119
2
0
01 Aug 2023
ARC-NLP at Multimodal Hate Speech Event Detection 2023: Multimodal
  Methods Boosted by Ensemble Learning, Syntactical and Entity Features
ARC-NLP at Multimodal Hate Speech Event Detection 2023: Multimodal Methods Boosted by Ensemble Learning, Syntactical and Entity FeaturesCASE (CASE), 2024
Umitcan Sahin
Izzet Emre Kucukkaya
Oguzhan Ozcelik
Cagri Toraman
82
13
0
25 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming-Hsuan Yang
Fahad Shahbaz Khan
VLM
200
143
0
25 Jul 2023
FedMEKT: Distillation-based Embedding Knowledge Transfer for Multimodal
  Federated Learning
FedMEKT: Distillation-based Embedding Knowledge Transfer for Multimodal Federated Learning
Huy Q. Le
Minh N. H. Nguyen
Chu Myaet Thwal
Yu Qiao
Chao Zhang
Choong Seon Hong
103
20
0
25 Jul 2023
Benchmarking and Analyzing Generative Data for Visual Recognition
Benchmarking and Analyzing Generative Data for Visual Recognition
Yue Liu
Haotian Liu
Liangyu Chen
Yong Jae Lee
Xuefei Liu
Yu Qiao
VLMEGVM
103
4
0
25 Jul 2023
OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?
OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?
Runjia Li
Shuyang Sun
Mohamed Elhoseiny
Juil Sock
115
10
0
21 Jul 2023
Multi-Modal Discussion Transformer: Integrating Text, Images and Graph
  Transformers to Detect Hate Speech on Social Media
Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media
Liam Hebert
Gaurav Sahu
Yuxuan Guo
Nanda Kishore Sreenivas
Lukasz Golab
Robin Cohen
92
11
0
18 Jul 2023
One-Versus-Others Attention: Scalable Multimodal Integration for
  Clinical Data
One-Versus-Others Attention: Scalable Multimodal Integration for Clinical Data
Michal Golovanevsky
Eva Schiller
Akira Nair
Ritambhara Singh
Carsten Eickhoff
131
5
0
11 Jul 2023
AI-UPV at EXIST 2023 -- Sexism Characterization Using Large Language
  Models Under The Learning with Disagreements Regime
AI-UPV at EXIST 2023 -- Sexism Characterization Using Large Language Models Under The Learning with Disagreements Regime
Angel Felipe Magnossão de Paula
Giuliano Rizzi
Elisabetta Fersini
Damiano Spina
94
11
0
07 Jul 2023
What Matters in Training a GPT4-Style Language Model with Multimodal
  Inputs?
What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?
Yan Zeng
Hanbo Zhang
Jiani Zheng
Jiangnan Xia
Guoqiang Wei
Yang Wei
Yuchen Zhang
Tao Kong
MLLM
175
85
0
05 Jul 2023
Evaluating AI systems under uncertain ground truth: a case study in dermatology
Evaluating AI systems under uncertain ground truth: a case study in dermatology
David Stutz
A. Cemgil
Abhijit Guha Roy
Tatiana Matejovicova
Melih Barsbey
...
Yossi Matias
Pushmeet Kohli
Yao Xiao
Arnaud Doucet
Alan Karthikesalingam
123
4
0
05 Jul 2023
Towards Language Models That Can See: Computer Vision Through the LENS
  of Natural Language
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language
William Berrios
Gautam Mittal
Tristan Thrush
Douwe Kiela
Amanpreet Singh
MLLMVLM
90
63
0
28 Jun 2023
Previous
12345678
Next