Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1312.6199
Cited By
Intriguing properties of neural networks
21 December 2013
Christian Szegedy
Wojciech Zaremba
Ilya Sutskever
Joan Bruna
D. Erhan
Ian Goodfellow
Rob Fergus
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Intriguing properties of neural networks"
50 / 189 papers shown
Title
How Do Diffusion Models Improve Adversarial Robustness?
Liu Yuezhang
Xue-Xin Wei
25
0
0
28 May 2025
Test-time augmentation improves efficiency in conformal prediction
Divya Shanmugam
H. Lu
Swami Sankaranarayanan
John Guttag
38
0
0
28 May 2025
Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains
Jiawen Zhang
Zhenwei Zhang
Shun Zheng
Xumeng Wen
Jia Li
Jiang Bian
AI4TS
AAML
82
0
0
26 May 2025
Enhancing Adversarial Robustness of Vision Language Models via Adversarial Mixture Prompt Tuning
Shiji Zhao
Qihui Zhu
Shukun Xiong
Shouwei Ruan
Yize Fan
Ranjie Duan
Qing Guo
Xingxing Wei
AAML
VLM
28
0
0
23 May 2025
Towards more transferable adversarial attack in black-box manner
Chun Tong Lei
Zhongliang Guo
Hon Chung Lee
Minh Quoc Duong
Chun Pong Lau
DiffM
AAML
115
0
0
23 May 2025
SuperPure: Efficient Purification of Localized and Distributed Adversarial Patches via Super-Resolution GAN Models
Hossein Khalili
Seongbin Park
Venkat Bollapragada
Nader Sehatbakhsh
AAML
45
0
0
22 May 2025
Adversarially Pretrained Transformers may be Universally Robust In-Context Learners
Soichiro Kumano
Hiroshi Kera
Toshihiko Yamasaki
AAML
52
0
0
20 May 2025
Approximation theory for 1-Lipschitz ResNets
Davide Murari
Takashi Furuya
Carola-Bibiane Schönlieb
29
0
0
17 May 2025
AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques
Aman Raj
Lakshit Arora
Sanjay Surendranath Girija
Shashank Kapoor
Dipen Pradhan
Ankit Shetgaonkar
60
0
0
13 May 2025
TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification
Dongyoon Yang
Jihu Lee
Yongdai Kim
38
0
0
10 May 2025
PRUNE: A Patching Based Repair Framework for Certifiable Unlearning of Neural Networks
Xuzhao Li
Jingyi Wang
Xiaohan Yuan
Peixin Zhang
Zhan Qin
Peng Kuang
Kui Ren
AAML
MU
59
0
0
10 May 2025
The Spotlight Resonance Method: Resolving the Alignment of Embedded Activations
George Bird
16
0
0
09 May 2025
Adversarial Attacks in Multimodal Systems: A Practitioner's Survey
Shashank Kapoor
Sanjay Surendranath Girija
Lakshit Arora
Dipen Pradhan
Ankit Shetgaonkar
Aman Raj
AAML
80
0
0
06 May 2025
Data-Driven Falsification of Cyber-Physical Systems
Atanu Kundu
Sauvik Gon
Rajarshi Ray
AAML
AI4CE
46
3
0
06 May 2025
A New Approach to Backtracking Counterfactual Explanations: A Unified Causal Framework for Efficient Model Interpretability
Pouria Fatemi
Ehsan Sharifian
Mohammad Hossein Yassaee
50
0
0
05 May 2025
Impact Analysis of Inference Time Attack of Perception Sensors on Autonomous Vehicles
Hanlin Chen
Simin Chen
Wenyu Li
Wei Yang
Yiheng Feng
AAML
186
0
0
05 May 2025
Adversarial Robustness Analysis of Vision-Language Models in Medical Image Segmentation
Anjila Budathoki
Manish Dhakal
AAML
51
1
0
05 May 2025
Towards Model Resistant to Transferable Adversarial Examples via Trigger Activation
Yi Yu
Song Xia
Xun Lin
Chenqi Kong
Wenhan Yang
Shijian Lu
Yap-Peng Tan
Alex C. Kot
AAML
SILM
338
0
0
20 Apr 2025
Bridging the Theoretical Gap in Randomized Smoothing
Blaise Delattre
Paul Caillon
Quentin Barthélemy
Erwan Fagnou
Alexandre Allauzen
AAML
69
0
0
03 Apr 2025
Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions
Giulia Marchiori Pietrosanti
Giulio Rossolini
Alessandro Biondi
Giorgio Buttazzo
AAML
129
0
0
02 Apr 2025
Geometric Median Matching for Robust k-Subset Selection from Noisy Data
Anish Acharya
Sujay Sanghavi
Alexandros G. Dimakis
Inderjit S Dhillon
AAML
90
0
0
01 Apr 2025
Improving Generalization of Universal Adversarial Perturbation via Dynamic Maximin Optimization
Yize Zhang
Yingzhe Xu
Junyu Shi
L. Zhang
Shengshan Hu
Minghui Li
Yanjun Zhang
AAML
91
1
0
17 Mar 2025
AMUN: Adversarial Machine UNlearning
A. Boroojeny
Hari Sundaram
Varun Chandrasekaran
MU
AAML
54
0
0
02 Mar 2025
Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models
Ruta Binkyte
Ivaxi Sheth
Zhijing Jin
Mohammad Havaei
Bernhard Schölkopf
Mario Fritz
321
1
0
28 Feb 2025
Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation
Wenyuan Wu
Zheng Liu
Yong Chen
Chao Su
Dezhong Peng
Xu Wang
AAML
73
0
0
24 Feb 2025
SEA: Shareable and Explainable Attribution for Query-based Black-box Attacks
Yue Gao
Ilia Shumailov
Kassem Fawaz
AAML
162
0
0
21 Feb 2025
Carefully Blending Adversarial Training, Purification, and Aggregation Improves Adversarial Robustness
Emanuele Ballarin
A. Ansuini
Luca Bortolussi
AAML
92
0
0
20 Feb 2025
Uncertainty-Aware Explanations Through Probabilistic Self-Explainable Neural Networks
Jon Vadillo
Roberto Santana
J. A. Lozano
Marta Z. Kwiatkowska
BDL
AAML
98
0
0
17 Feb 2025
ExplainReduce: Summarising local explanations via proxies
Lauri Seppäläinen
Mudong Guo
Kai Puolamäki
FAtt
58
0
0
17 Feb 2025
Universal Adversarial Attack on Aligned Multimodal LLMs
Temurbek Rahmatullaev
Polina Druzhinina
Matvey Mikhalchuk
Andrey Kuznetsov
Anton Razzhigaev
AAML
118
0
0
11 Feb 2025
Confidence Elicitation: A New Attack Vector for Large Language Models
Brian Formento
Chuan-Sheng Foo
See-Kiong Ng
AAML
140
0
0
07 Feb 2025
Detecting APT Malware Command and Control over HTTP(S) Using Contextual Summaries
Almuthanna Alageel
Sergio Maffeis
Imperial College London
51
2
0
07 Feb 2025
Dual-Flow: Transferable Multi-Target, Instance-Agnostic Attacks via In-the-wild Cascading Flow Optimization
Yixiao Chen
Shikun Sun
Jianshu Li
Ruoyu Li
Zhe Li
Junliang Xing
AAML
139
0
0
04 Feb 2025
CoRPA: Adversarial Image Generation for Chest X-rays Using Concept Vector Perturbations and Generative Models
Amy Rafferty
Rishi Ramaesh
Ajitha Rajan
MedIm
AAML
74
0
0
04 Feb 2025
Dimensions underlying the representational alignment of deep neural networks with humans
F. Mahner
Lukas Muttenthaler
Umut Güçlü
M. Hebart
71
5
0
28 Jan 2025
Smoothed Embeddings for Robust Language Models
Ryo Hase
Md Rafi Ur Rashid
Ashley Lewis
Jing Liu
T. Koike-Akino
K. Parsons
Yanjie Wang
AAML
62
2
0
27 Jan 2025
Formal Verification of Markov Processes with Learned Parameters
Muhammad Maaz
Timothy C. Y. Chan
77
0
0
27 Jan 2025
Defending against Adversarial Malware Attacks on ML-based Android Malware Detection Systems
Ping He
Lorenzo Cavallaro
Shouling Ji
AAML
69
0
0
23 Jan 2025
Enhancing Robust Fairness via Confusional Spectral Regularization
Gaojie Jin
Sihao Wu
Jiaxu Liu
Tianjin Huang
Ronghui Mu
132
1
0
22 Jan 2025
Provably-Safe Neural Network Training Using Hybrid Zonotope Reachability Analysis
Long Kiu Chung
Shreyas Kousik
351
0
0
22 Jan 2025
Geometric Median (GM) Matching for Robust Data Pruning
Anish Acharya
Inderjit S Dhillon
Sujay Sanghavi
AAML
82
0
0
20 Jan 2025
Provably Safeguarding a Classifier from OOD and Adversarial Samples: an Extreme Value Theory Approach
Nicolas Atienza
Christophe Labreuche
Johanne Cohen
Michele Sebag
OODD
AAML
289
0
0
20 Jan 2025
CaFA: Cost-aware, Feasible Attacks With Database Constraints Against Neural Tabular Classifiers
Matan Ben-Tov
Daniel Deutch
Nave Frost
Mahmood Sharif
AAML
147
0
0
20 Jan 2025
On the Hypomonotone Class of Variational Inequalities
Khaled Alomar
Tatjana Chavdarova
35
0
0
20 Jan 2025
Can Safety Fine-Tuning Be More Principled? Lessons Learned from Cybersecurity
David Williams-King
Linh Le
Adam Oberman
Yoshua Bengio
AAML
63
0
0
19 Jan 2025
On the uncertainty principle of neural networks
Jun-Jie Zhang
Dong-xiao Zhang
Jian-Nan Chen
L. Pang
Deyu Meng
69
2
0
17 Jan 2025
MOS-Attack: A Scalable Multi-objective Adversarial Attack Framework
Ping Guo
Cheng Gong
Xi Lin
Fei Liu
Zhichao Lu
Qingfu Zhang
Zhenkun Wang
AAML
57
0
0
13 Jan 2025
Most Influential Subset Selection: Challenges, Promises, and Beyond
Yuzheng Hu
Pingbang Hu
Han Zhao
Jiaqi W. Ma
TDI
152
4
0
10 Jan 2025
Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50
Umesh Yadav
Suman Niraula
Gaurav Kumar Gupta
Bicky Yadav
SILM
118
0
0
04 Jan 2025
Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models
Martin Pawelczyk
Lillian Sun
Zhenting Qi
Aounon Kumar
Himabindu Lakkaraju
73
1
0
03 Jan 2025
1
2
3
4
Next