Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2005.04790
Cited By
v1
v2
v3 (latest)
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
10 May 2020
Douwe Kiela
Hamed Firooz
Aravind Mohan
Vedanuj Goswami
Amanpreet Singh
Pratik Ringshia
Davide Testuggine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes"
50 / 357 papers shown
Title
P2P: A Poison-to-Poison Remedy for Reliable Backdoor Defense in LLMs
Shuai Zhao
Xinyi Wu
Shiqian Zhao
Xiaobao Wu
Zhongliang Guo
Yanhao Jia
Anh Tuan Luu
AAML
8
0
0
06 Oct 2025
LLM-Based Multi-Task Bangla Hate Speech Detection: Type, Severity, and Target
Md. Arid Hasan
Firoj Alam
Md Fahad Hossain
Usman Naseem
Syed Ishtiaque Ahmed
0
0
0
02 Oct 2025
Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future Directions
Smita Khapre
Melkamu Mersha
Hassan Shakil
Jonali Baruah
Jugal Kalita
12
0
0
29 Sep 2025
InfMasking: Unleashing Synergistic Information by Contrastive Multimodal Interactions
Liangjian Wen
Qun Dai
Jianzhuang Liu
Jiangtao Zheng
Yong Dai
Dongkai Wang
Zhao Kang
Jun Wang
Z. Xu
Jiang Duan
4
0
0
28 Sep 2025
QoNext: Towards Next-generation QoE for Foundation Models
Yijin Guo
Ye Shen
Farong Wen
Junying Wang
Zicheng Zhang
Qi Jia
Guangtao Zhai
28
0
0
26 Sep 2025
Is GPT-4o mini Blinded by its Own Safety Filters? Exposing the Multimodal-to-Unimodal Bottleneck in Hate Speech Detection
Niruthiha Selvanayagam
Ted Kurti
4
0
0
17 Sep 2025
Multimodal Hate Detection Using Dual-Stream Graph Neural Networks
Jiangbei Yue
Shuonan Yang
Tailin Chen
Jianbo Jiao
Zeyu Fu
0
0
0
16 Sep 2025
MindVL: Towards Efficient and Effective Training of Multimodal Large Language Models on Ascend NPUs
Feilong Chen
Y. Liu
Yi Huang
Hao Wang
Miren Tian
Ya-Qi Yu
Minghui Liao
Jihao Wu
MLLM
VLM
103
0
0
15 Sep 2025
Defining, Understanding, and Detecting Online Toxicity: Challenges and Machine Learning Approaches
Gautam Kishore Shahi
Tim A. Majchrzak
32
0
0
14 Sep 2025
CEMTM: Contextual Embedding-based Multimodal Topic Modeling
Amirhossein Abaskohi
Raymond Li
Chuyuan Li
Shafiq Joty
Giuseppe Carenini
16
0
0
14 Sep 2025
Robult: Leveraging Redundancy and Modality Specific Features for Robust Multimodal Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Duy Nguyen
Abhi Kamboj
Minh N. Do
24
0
0
03 Sep 2025
MM-HSD: Multi-Modal Hate Speech Detection in Videos
Berta Céspedes-Sarrias
Carlos Collado-Capell
Pablo Rodenas-Ruiz
Olena Hrynenko
Andrea Cavallaro
24
0
0
28 Aug 2025
Data Leakage in Visual Datasets
Patrick Ramos
Ryan Ramos
Noa Garcia
PILM
52
0
0
24 Aug 2025
Towards Open World Detection: A Survey
Andrei-Stefan Bulzan
Cosmin Cernazanu-Glavan
ObjD
VLM
72
0
0
22 Aug 2025
Labels or Input? Rethinking Augmentation in Multimodal Hate Detection
Sahajpreet Singh
Rongxin Ouyang
Subhayan Mukerjee
Kokil Jaidka
VLM
32
0
0
15 Aug 2025
CATP: Contextually Adaptive Token Pruning for Efficient and Enhanced Multimodal In-Context Learning
Yanshu Li
JianJiang Yang
Zhennan Shen
Ligong Han
Haoyan Xu
Ruixiang Tang
VLM
41
3
0
11 Aug 2025
MV-Debate: Multi-view Agent Debate with Dynamic Reflection Gating for Multimodal Harmful Content Detection in Social Media
Rui Lu
Jinhe Bi
Yunpu Ma
Feng Xiao
Yuntao Du
Yijun Tian
48
1
0
07 Aug 2025
ToxicTAGS: Decoding Toxic Memes with Rich Tag Annotations
Subhankar Swain
Naquee Rizwan
Nayandeep Deb
Vishwajeet Singh Solanki
Vishwa Gangadhar S
Animesh Mukherjee
42
0
0
06 Aug 2025
BigTokDetect: A Clinically-Informed Vision-Language Modeling Framework for Detecting Pro-Bigorexia Videos on TikTok
Minh Duc Hoang Chu
Kshitij Pawar
Zihao He
Roxanna Sharifi
Ross Sonnenblick
Magdalayna Curry
Laura DÁdamo
Lindsay Young
Stuart Murray
Kristina Lerman
44
0
0
30 Jul 2025
On the Reliability of Vision-Language Models Under Adversarial Frequency-Domain Perturbations
Jordan Vice
Naveed Akhtar
Yansong Gao
Richard Hartley
Ajmal Mian
AAML
59
0
0
30 Jul 2025
MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning
Tianhong Gao
Yannian Fu
Weiqun Wu
Haixiao Yue
Shanshan Liu
Gang Zhang
MLLM
LRM
45
0
0
29 Jul 2025
Rainbow Noise: Stress-Testing Multimodal Harmful-Meme Detectors on LGBTQ Content
Ran Tong
Songtao Wei
Jiaqi Liu
Lanruo Wang
82
1
0
24 Jul 2025
MultiVox: A Benchmark for Evaluating Voice Assistants for Multimodal Interactions
Ramaneswaran Selvakumar
Ashish Seth
Nishit Anand
Utkarsh Tyagi
Sonal Kumar
Sreyan Ghosh
Dinesh Manocha
AuLLM
23
0
0
14 Jul 2025
Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Fengjun Pan
Anh Tuan Luu
Xiaobao Wu
95
1
0
10 Jun 2025
Representation Decomposition for Learning Similarity and Contrastness Across Modalities for Affective Computing
Yuanhe Tian
Pengsen Cheng
Guoqing Jin
Lei Zhang
Yan Song
72
3
0
08 Jun 2025
MINT: Multimodal Instruction Tuning with Multimodal Interaction Grouping
Xiaojun Shan
Qi Cao
Xing Han
Haofei Yu
Paul Liang
132
1
0
02 Jun 2025
VModA: An Effective Framework for Adaptive NSFW Image Moderation
Han Bao
Qinying Wang
Zhi Chen
Qingming Li
Xuhong Zhang
Changjiang Li
Zonghui Wang
Shouling Ji
Wenzhi Chen
126
0
0
29 May 2025
MObyGaze: a film dataset of multimodal objectification densely annotated by experts
Julie Tores
Elisa Ancarani
L. Sassatelli
Hui-Yin Wu
Clement Bergman
...
F. Precioso
Thierry Devars
Magali Guaresi
Virginie Julliard
Sarah Lecossais
DiffM
VGen
86
0
0
28 May 2025
EVADE: Multimodal Benchmark for Evasive Content Detection in E-Commerce Applications
Ancheng Xu
Zhihao Yang
Junlin Li
Guanghu Yuan
Longze Chen
...
Zhen Qin
Hengyun Chang
Hamid Alinejad-Rokny
Bo Zheng
Min Yang
AAML
150
1
0
23 May 2025
What Media Frames Reveal About Stance: A Dataset and Study about Memes in Climate Change Discourse
Shijia Zhou
Siyao Peng
Simon Luebke
Jörg Haßler
Mario Haim
Saif M. Mohammad
Barbara Plank
97
0
0
22 May 2025
ICYM2I: The illusion of multimodal informativeness under missingness
Young Sang Choi
Vincent Jeanselme
Pierre Elias
Shalmali Joshi
80
0
0
22 May 2025
Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study
DongGeon Lee
Joonwon Jang
Jihae Jeong
Hwanjo Yu
163
3
0
21 May 2025
CAMA: Enhancing Multimodal In-Context Learning with Context-Aware Modulated Attention
Yanshu Li
JianJiang Yang
Ziteng Yang
Bozheng Li
Yi Cao
...
Ligong Han
Yingjie Victor Chen
Songlin Fei
Dongfang Liu
Ruixiang Tang
131
6
0
21 May 2025
TACO: Enhancing Multimodal In-context Learning via Task Mapping-Guided Sequence Configuration
Yanshu Li
Tian Yun
Tian Yun
Pinyuan Feng
Jinfa Huang
Ruixiang Tang
122
13
0
21 May 2025
Enhanced Multimodal Hate Video Detection via Channel-wise and Modality-wise Fusion
Yinghui Zhang
Tailin Chen
Yuchen Zhang
Zeyu Fu
153
3
0
17 May 2025
GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning
Teli Ma
Shengfang Zhai
Mingzhe Du
Yulin Chen
Tri Cao
...
Xuzhao Li
Kun Wang
Junfeng Fang
Jiaheng Zhang
Bryan Hooi
OffRL
LRM
134
12
0
16 May 2025
Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models
Minh-Hao Van
Xintao Wu
VLM
212
0
0
30 Apr 2025
MemeBLIP2: A novel lightweight multimodal system to detect harmful memes
Jiaqi Liu
Ran Tong
Aowei Shen
Shuzheng Li
Changlin Yang
Lisha Xu
VLM
243
3
0
29 Apr 2025
CAMU: Context Augmentation for Meme Understanding
Girish A. Koushik
Diptesh Kanojia
Helen Treharne
Aditya Joshi
VLM
219
2
0
24 Apr 2025
Detecting and Understanding Hateful Contents in Memes Through Captioning and Visual Question-Answering
Ali Anaissi
Junaid Akram
Kunal Chaturvedi
Ali Braytee
97
1
0
23 Apr 2025
LLM-based Semantic Augmentation for Harmful Content Detection
Elyas Meguellati
Assaad Zeghina
S. Sadiq
Gianluca Demartini
166
2
0
22 Apr 2025
Leveraging multimodal explanatory annotations for video interpretation with Modality Specific Dataset
Elisa Ancarani
Julie Tores
L. Sassatelli
Rémy Sun
Hui-Yin Wu
F. Precioso
106
0
0
15 Apr 2025
Improving Multimodal Hateful Meme Detection Exploiting LMM-Generated Knowledge
Maria Tzelepi
Vasileios Mezaris
140
1
0
14 Apr 2025
MIEB: Massive Image Embedding Benchmark
Chenghao Xiao
Isaac Chung
Imene Kerboua
Jamie Stirling
Xin Zhang
Márton Kardos
Roman Solomatin
Noura Al Moubayed
Kenneth Enevoldsen
Niklas Muennighoff
VLM
229
4
0
14 Apr 2025
Data Metabolism: An Efficient Data Design Schema For Vision Language Model
Jingyuan Zhang
Hongzhi Zhang
Zhou Haonan
Chenxi Sun
Xingguang Ji
Jiakang Wang
Fanheng Kong
Teli Ma
Qi Wang
Fuzheng Zhang
VLM
192
2
0
10 Apr 2025
Capybara-OMNI: An Efficient Paradigm for Building Omni-Modal Language Models
Xingguang Ji
Jiakang Wang
Hongzhi Zhang
Jingyuan Zhang
Haonan Zhou
Chenxi Sun
Teli Ma
Qi Wang
Fuzheng Zhang
MLLM
VLM
182
0
0
10 Apr 2025
M
2
^2
2
IV: Towards Efficient and Fine-grained Multimodal In-Context Learning via Representation Engineering
Yanshu Li
Yi Cao
Hongyang He
Qisen Cheng
Xiang Fu
Xi Xiao
Tianyang Wang
Ruixiang Tang
VLM
142
6
0
06 Apr 2025
Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks
Jiawei Wang
Yushen Zuo
Yuanjun Chai
Ziqiang Liu
Yichen Fu
Yichun Feng
Kin-Man Lam
AAML
VLM
238
0
0
02 Apr 2025
Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Iñigo Pikabea
Iñaki Lacunza
Oriol Pareras
Carlos Escolano
Aitor Gonzalez-Agirre
Javier Hernando
Marta Villegas
VLM
272
1
0
28 Mar 2025
MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning
Yiwei Ma
Guohai Xu
Xiaoshuai Sun
Jiayi Ji
Jie Lou
Debing Zhang
Rongrong Ji
323
5
0
26 Mar 2025
1
2
3
4
5
6
7
8
Next