ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.11455
  4. Cited By
Adversary-Aware DPO: Enhancing Safety Alignment in Vision Language Models via Adversarial Training

Adversary-Aware DPO: Enhancing Safety Alignment in Vision Language Models via Adversarial Training

17 February 2025
Fenghua Weng
Jian Lou
Jun Feng
Minlie Huang
Wenjie Wang
    AAML
ArXivPDFHTML

Papers citing "Adversary-Aware DPO: Enhancing Safety Alignment in Vision Language Models via Adversarial Training"

1 / 1 papers shown
Title
Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots
Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots
Erfan Shayegani
G M Shahariar
Sara Abdali
Lei Yu
Nael B. Abu-Ghazaleh
Yue Dong
AAML
37
0
0
01 Apr 2025
1