ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.04512
  4. Cited By
Defense-Prefix for Preventing Typographic Attacks on CLIP
v1v2v3 (latest)

Defense-Prefix for Preventing Typographic Attacks on CLIP

10 April 2023
Hiroki Azuma
Yusuke Matsui
    VLMAAML
ArXiv (abs)PDFHTMLGithub (6★)

Papers citing "Defense-Prefix for Preventing Typographic Attacks on CLIP"

17 / 17 papers shown
Dyslexify: A Mechanistic Defense Against Typographic Attacks in CLIP
Dyslexify: A Mechanistic Defense Against Typographic Attacks in CLIP
Lorenz Hufe
Constantin Venhoff
Maximilian Dreyer
Sebastian Lapuschkin
Wojciech Samek
Wojciech Samek
AAML
261
2
0
28 Aug 2025
Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance
Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance
Yuchu Jiang
Jian Zhao
Yuchen Yuan
Tianle Zhang
Yao Huang
...
Ya Zhang
Shuicheng Yan
Chi Zhang
Z. He
Xuelong Li
SILM
546
6
0
12 Aug 2025
Vision Transformers Don't Need Trained Registers
Vision Transformers Don't Need Trained Registers
Nick Jiang
Amil Dravid
Alexei A. Efros
Yossi Gandelsman
560
25
0
09 Jun 2025
Steering CLIP's vision transformer with sparse autoencoders
Steering CLIP's vision transformer with sparse autoencoders
Sonia Joseph
Praneet Suresh
Ethan Goldfarb
Lorenz Hufe
Yossi Gandelsman
Robert Graham
Danilo Bzdok
Wojciech Samek
Blake A. Richards
358
19
0
11 Apr 2025
SCAM: A Real-World Typographic Robustness Evaluation for Multimodal Foundation Models
SCAM: A Real-World Typographic Robustness Evaluation for Multimodal Foundation Models
Justus Westerhoff
Erblina Purellku
Jakob Hackstein
Jonas Loos
Leo Pinetzki
Lorenz Hufe
AAML
759
5
0
07 Apr 2025
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models
Zhaochen Wang
Yujun Cai
Zi Huang
Bryan Hooi
Yiwei Wang
Ming Yang
CoGeVLM
443
5
0
02 Apr 2025
Web Artifact Attacks Disrupt Vision Language Models
Web Artifact Attacks Disrupt Vision Language Models
Maan Qraitem
Piotr Teterwak
Kate Saenko
Bryan A. Plummer
AAML
367
3
0
17 Mar 2025
Typographic Attacks in a Multi-Image Setting
Typographic Attacks in a Multi-Image SettingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Xiaomeng Wang
Subrat Kishore Dutta
Martha Larson
AAML
235
5
0
12 Feb 2025
New Emerged Security and Privacy of Pre-trained Model: a Survey and
  Outlook
New Emerged Security and Privacy of Pre-trained Model: a Survey and Outlook
Meng Yang
Tianqing Zhu
Chi Liu
Wanlei Zhou
Shui Yu
Philip S. Yu
AAMLELMPILM
357
2
0
12 Nov 2024
Manipulation Facing Threats: Evaluating Physical Vulnerabilities in End-to-End Vision Language Action Models
Manipulation Facing Threats: Evaluating Physical Vulnerabilities in End-to-End Vision Language Action Models
Hao Cheng
Erjia Xiao
Chengyuan Yu
Zhao Yao
Mengshu Sun
...
Juil Sock
Jindong Gu
Zhanchen Zhu
Jindong Gu
Renjing Xu
AAML
659
15
0
20 Sep 2024
Empirical Analysis of Large Vision-Language Models against Goal
  Hijacking via Visual Prompt Injection
Empirical Analysis of Large Vision-Language Models against Goal Hijacking via Visual Prompt Injection
Subaru Kimura
Ryota Tanaka
Shumpei Miyawaki
Jun Suzuki
Keisuke Sakaguchi
MLLM
332
19
0
07 Aug 2024
Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models
Transfer Attack for Bad and Good: Explain and Boost Adversarial Transferability across Multimodal Large Language Models
Hao-Ran Cheng
Erjia Xiao
Jiayan Yang
Jinhao Duan
Yichi Wang
...
Qiang Zhang
Le Yang
Kaidi Xu
Jindong Gu
Zhanchen Zhu
AAML
734
10
0
30 May 2024
Towards Transferable Attacks Against Vision-LLMs in Autonomous Driving
  with Typography
Towards Transferable Attacks Against Vision-LLMs in Autonomous Driving with Typography
N. Chung
Sensen Gao
Tuan-Anh Vu
Jie M. Zhang
Aishan Liu
Yun Lin
Jin Song Dong
Qi Guo
AAML
232
18
0
23 May 2024
Unveiling Typographic Deceptions: Insights of the Typographic
  Vulnerability in Large Vision-Language Model
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
Hao-Ran Cheng
Erjia Xiao
Jindong Gu
Le Yang
Jinhao Duan
Jize Zhang
Jiahang Cao
Kaidi Xu
Renjing Xu
253
18
0
29 Feb 2024
Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks
Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks
Maan Qraitem
Nazia Tasnim
Piotr Teterwak
Kate Saenko
Bryan A. Plummer
AAMLVLM
488
35
0
01 Feb 2024
Discriminative Class Tokens for Text-to-Image Diffusion Models
Discriminative Class Tokens for Text-to-Image Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Idan Schwartz
Vésteinn Snaebjarnarson
Hila Chefer
Robert Bamler
Serge Belongie
Lior Wolf
Sagie Benaim
505
13
0
30 Mar 2023
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
CLIP-Adapter: Better Vision-Language Models with Feature AdaptersInternational Journal of Computer Vision (IJCV), 2021
Shiyang Feng
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Zelong Li
Jiaming Song
Yu Qiao
VLMCLIP
1.4K
1,614
0
09 Oct 2021
1
Page 1 of 1