On Evaluating Adversarial Robustness

18 February 2019

Wieland Brendel

Papers citing "On Evaluating Adversarial Robustness"

50 / 151 papers shown

Title
What Is AI Safety? What Do We Want It to Be? Jacqueline Harding Cameron Domenico Kirk-Giannini 68 0 0 05 May 2025
Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation Vaidehi Patil Yi-Lin Sung Peter Hase Jie Peng Tianlong Chen Mohit Bansal AAML MU 83 3 0 01 May 2025
OET: Optimization-based prompt injection Evaluation Toolkit Jinsheng Pan Xiaogeng Liu Chaowei Xiao AAML 69 0 0 01 May 2025
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments Yun Qu W. Wang Yixiu Mao Yiqin Lv Xiangyang Ji TTA 90 0 0 27 Apr 2025
Manipulating Multimodal Agents via Cross-Modal Prompt Injection Le Wang Zonghao Ying Tianyuan Zhang Siyuan Liang Shengshan Hu Mingchuan Zhang A. Liu Xianglong Liu AAML 33 1 0 19 Apr 2025
Decoding FL Defenses: Systemization, Pitfalls, and Remedies M. A. Khan Virat Shejwalkar Yasra Chandio Amir Houmansadr Fatima M. Anwar AAML 38 0 0 03 Feb 2025
The Pitfalls of "Security by Obscurity" And What They Mean for Transparent AI Peter Hall Olivia Mundahl Sunoo Park 74 0 0 30 Jan 2025
The Curious Case of Arbitrariness in Machine Learning Prakhar Ganesh Afaf Taik G. Farnadi 59 2 0 28 Jan 2025
CaFA: Cost-aware, Feasible Attacks With Database Constraints Against Neural Tabular Classifiers Matan Ben-Tov Daniel Deutch Nave Frost Mahmood Sharif AAML 107 0 0 20 Jan 2025
Challenging reaction prediction models to generalize to novel chemistry John Bradshaw Anji Zhang Babak Mahjour David E. Graff Marwin H. S. Segler Connor W. Coley 40 1 0 11 Jan 2025
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation Rohith Peddi Saurabh Ayush Abhay Shrivastava Parag Singla Vibhav Gogate 82 0 0 20 Nov 2024
The Effects of Multi-Task Learning on ReLU Neural Network Functions Julia B. Nakhleh Joseph Shenouda Robert D. Nowak 34 1 0 29 Oct 2024
Do Unlearning Methods Remove Information from Language Model Weights? Aghyad Deeb Fabien Roger AAML MU 40 14 0 11 Oct 2024
Improving Adversarial Robustness for 3D Point Cloud Recognition at Test-Time through Purified Self-Training Jinpeng Lin Xulei Yang Tianrui Li Xun Xu 3DPC 33 0 0 23 Sep 2024
2DSig-Detect: a semi-supervised framework for anomaly detection on image data using 2D-signatures Xinheng Xie Kureha Yamaguchi Margaux Leblanc Simon Malzard Varun Chhabra Victoria Nockles Yue-bo Wu AAML 37 0 0 08 Sep 2024
Adversarial Robustification via Text-to-Image Diffusion Models Daewon Choi Jongheon Jeong Huiwon Jang Jinwoo Shin DiffM 41 1 0 26 Jul 2024
Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors Peter Lorenz Mario Fernandez Jens Müller Ullrich Kothe AAML 78 1 0 21 Jun 2024
Leakage-Resilient and Carbon-Neutral Aggregation Featuring the Federated AI-enabled Critical Infrastructure Zehang Deng Ruoxi Sun Minhui Xue Sheng Wen S. Çamtepe Surya Nepal Yang Xiang 39 1 0 24 May 2024
AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples Antonio Emanuele Cinà Jérôme Rony Maura Pintor Luca Demetrio Ambra Demontis Battista Biggio Ismail Ben Ayed Fabio Roli ELM AAML SILM 44 6 0 30 Apr 2024
Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks Yunzhen Feng Tim G. J. Rudner Nikolaos Tsilivis Julia Kempe AAML BDL 43 1 0 27 Apr 2024
A Survey of Neural Network Robustness Assessment in Image Recognition Jie Wang Jun Ai Minyan Lu Haoran Su Dan Yu Yutao Zhang Junda Zhu Jingyu Liu AAML 30 3 0 12 Apr 2024
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks Maksym Andriushchenko Francesco Croce Nicolas Flammarion AAML 83 159 0 02 Apr 2024
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That? Egor Zverev Sahar Abdelnabi Soroush Tabesh Mario Fritz Christoph H. Lampert 50 19 0 11 Mar 2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit Yao Wan Yang He Zhangqian Bi Jianguo Zhang Hongyu Zhang Yulei Sui Guandong Xu Hai Jin Philip S. Yu 30 20 0 30 Dec 2023
Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness Ambar Pal Huaijin Hao René Vidal 26 8 0 28 Sep 2023
Certified Robust Models with Slack Control and Large Lipschitz Constants M. Losch David Stutz Bernt Schiele Mario Fritz 14 4 0 12 Sep 2023
HoSNN: Adversarially-Robust Homeostatic Spiking Neural Networks with Adaptive Firing Thresholds Hejia Geng Peng Li AAML 34 3 0 20 Aug 2023
Robustified ANNs Reveal Wormholes Between Human Category Percepts Guy Gaziv Michael J. Lee J. DiCarlo AAML 24 6 0 14 Aug 2023
Training on Foveated Images Improves Robustness to Adversarial Attacks Muhammad Ahmed Shah Bhiksha Raj AAML 30 3 0 01 Aug 2023
A LLM Assisted Exploitation of AI-Guardian Nicholas Carlini ELM SILM 24 15 0 20 Jul 2023
On building machine learning pipelines for Android malware detection: a procedural survey of practices, challenges and opportunities Masoud Mehrabi Koushki I. Abualhaol Anandharaju Durai Raju Yang Zhou Ronnie Salvador Giagone Huang Shengqiang 18 11 0 12 Jun 2023
SoK: Pragmatic Assessment of Machine Learning for Network Intrusion Detection Giovanni Apruzzese P. Laskov J. Schneider 36 24 0 30 Apr 2023
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking Chang-Shu Liu Yinpeng Dong Wenzhao Xiang X. Yang Hang Su Junyi Zhu YueFeng Chen Yuan He H. Xue Shibao Zheng OOD VLM AAML 33 72 0 28 Feb 2023
Measuring Equality in Machine Learning Security Defenses: A Case Study in Speech Recognition Luke E. Richards Edward Raff Cynthia Matuszek AAML 16 2 0 17 Feb 2023
On the Efficacy of Metrics to Describe Adversarial Attacks Tommaso Puccetti T. Zoppi Andrea Ceccarelli AAML 17 2 0 30 Jan 2023
Benchmarking Robustness to Adversarial Image Obfuscations Florian Stimberg Ayan Chakrabarti Chun-Ta Lu Hussein Hazimeh Otilia Stretcu ... Merve Kaya Cyrus Rashtchian Ariel Fuxman Mehmet Tek Sven Gowal AAML 29 10 0 30 Jan 2023
Selecting Models based on the Risk of Damage Caused by Adversarial Attacks Jona Klemenc Holger Trittenbach AAML 24 1 0 28 Jan 2023
"Real Attackers Don't Compute Gradients": Bridging the Gap Between Adversarial ML Research and Practice Giovanni Apruzzese Hyrum S. Anderson Savino Dambra D. Freeman Fabio Pierazzi Kevin A. Roundy AAML 31 75 0 29 Dec 2022
Confidence-aware Training of Smoothed Classifiers for Certified Robustness Jongheon Jeong Seojin Kim Jinwoo Shin AAML 21 7 0 18 Dec 2022
On Evaluating Adversarial Robustness of Chest X-ray Classification: Pitfalls and Best Practices Salah Ghamizi Maxime Cordy Michail Papadakis Yves Le Traon OOD 11 2 0 15 Dec 2022
Towards Good Practices in Evaluating Transfer Adversarial Attacks Zhengyu Zhao Hanwei Zhang Renjue Li R. Sicre Laurent Amsaleg Michael Backes AAML 24 20 0 17 Nov 2022
Defending with Errors: Approximate Computing for Robustness of Deep Neural Networks Amira Guesmi Ihsen Alouani Khaled N. Khasawneh M. Baklouti T. Frikha Mohamed Abid Nael B. Abu-Ghazaleh AAML OOD 22 2 0 02 Nov 2022
Multi-view Representation Learning from Malware to Defend Against Adversarial Variants J. Hu Mohammadreza Ebrahimi Weifeng Li Xin Li Hsinchun Chen AAML 13 2 0 25 Oct 2022
Causal Information Bottleneck Boosts Adversarial Robustness of Deep Neural Network Hua Hua Jun Yan Xi Fang Weiquan Huang Huilin Yin Wancheng Ge AAML 25 1 0 25 Oct 2022
Scaling Laws for Reward Model Overoptimization Leo Gao John Schulman Jacob Hilton ALM 38 473 0 19 Oct 2022
On the Adversarial Robustness of Mixture of Experts J. Puigcerver Rodolphe Jenatton C. Riquelme Pranjal Awasthi Srinadh Bhojanapalli OOD AAML MoE 37 18 0 19 Oct 2022
Scaling Adversarial Training to Large Perturbation Bounds Sravanti Addepalli Samyak Jain Gaurang Sriramanan R. Venkatesh Babu AAML 30 22 0 18 Oct 2022
Strength-Adaptive Adversarial Training Chaojian Yu Dawei Zhou Li Shen Jun Yu Bo Han Mingming Gong Nannan Wang Tongliang Liu OOD 17 2 0 04 Oct 2022
A Closer Look at Evaluating the Bit-Flip Attack Against Deep Neural Networks Kevin Hector Mathieu Dumont Pierre-Alain Moëllic J. Dutertre AAML 19 4 0 28 Sep 2022
AdvDO: Realistic Adversarial Attacks for Trajectory Prediction Yulong Cao Chaowei Xiao Anima Anandkumar Danfei Xu Marco Pavone AAML 30 62 0 19 Sep 2022