Intriguing properties of neural networks

21 December 2013

Joan Bruna

Papers citing "Intriguing properties of neural networks"

50 / 189 papers shown

Title
How Do Diffusion Models Improve Adversarial Robustness? Liu Yuezhang Xue-Xin Wei 25 0 0 28 May 2025
Test-time augmentation improves efficiency in conformal prediction Divya Shanmugam H. Lu Swami Sankaranarayanan John Guttag 38 0 0 28 May 2025
Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains Jiawen Zhang Zhenwei Zhang Shun Zheng Xumeng Wen Jia Li Jiang Bian AI4TS AAML 82 0 0 26 May 2025
Enhancing Adversarial Robustness of Vision Language Models via Adversarial Mixture Prompt Tuning Shiji Zhao Qihui Zhu Shukun Xiong Shouwei Ruan Yize Fan Ranjie Duan Qing Guo Xingxing Wei AAML VLM 28 0 0 23 May 2025
Towards more transferable adversarial attack in black-box manner Chun Tong Lei Zhongliang Guo Hon Chung Lee Minh Quoc Duong Chun Pong Lau DiffM AAML 115 0 0 23 May 2025
SuperPure: Efficient Purification of Localized and Distributed Adversarial Patches via Super-Resolution GAN Models Hossein Khalili Seongbin Park Venkat Bollapragada Nader Sehatbakhsh AAML 45 0 0 22 May 2025
Adversarially Pretrained Transformers may be Universally Robust In-Context Learners Soichiro Kumano Hiroshi Kera Toshihiko Yamasaki AAML 52 0 0 20 May 2025
Approximation theory for 1-Lipschitz ResNets Davide Murari Takashi Furuya Carola-Bibiane Schönlieb 29 0 0 17 May 2025
AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques Aman Raj Lakshit Arora Sanjay Surendranath Girija Shashank Kapoor Dipen Pradhan Ankit Shetgaonkar 60 0 0 13 May 2025
TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification Dongyoon Yang Jihu Lee Yongdai Kim 38 0 0 10 May 2025
PRUNE: A Patching Based Repair Framework for Certifiable Unlearning of Neural Networks Xuzhao Li Jingyi Wang Xiaohan Yuan Peixin Zhang Zhan Qin Peng Kuang Kui Ren AAML MU 59 0 0 10 May 2025
The Spotlight Resonance Method: Resolving the Alignment of Embedded Activations George Bird 16 0 0 09 May 2025
Adversarial Attacks in Multimodal Systems: A Practitioner's Survey Shashank Kapoor Sanjay Surendranath Girija Lakshit Arora Dipen Pradhan Ankit Shetgaonkar Aman Raj AAML 80 0 0 06 May 2025
Data-Driven Falsification of Cyber-Physical Systems Atanu Kundu Sauvik Gon Rajarshi Ray AAML AI4CE 46 3 0 06 May 2025
A New Approach to Backtracking Counterfactual Explanations: A Unified Causal Framework for Efficient Model Interpretability Pouria Fatemi Ehsan Sharifian Mohammad Hossein Yassaee 50 0 0 05 May 2025
Impact Analysis of Inference Time Attack of Perception Sensors on Autonomous Vehicles Hanlin Chen Simin Chen Wenyu Li Wei Yang Yiheng Feng AAML 186 0 0 05 May 2025
Adversarial Robustness Analysis of Vision-Language Models in Medical Image Segmentation Anjila Budathoki Manish Dhakal AAML 51 1 0 05 May 2025
Towards Model Resistant to Transferable Adversarial Examples via Trigger Activation Yi Yu Song Xia Xun Lin Chenqi Kong Wenhan Yang Shijian Lu Yap-Peng Tan Alex C. Kot AAML SILM 338 0 0 20 Apr 2025
Bridging the Theoretical Gap in Randomized Smoothing Blaise Delattre Paul Caillon Quentin Barthélemy Erwan Fagnou Alexandre Allauzen AAML 69 0 0 03 Apr 2025
Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions Giulia Marchiori Pietrosanti Giulio Rossolini Alessandro Biondi Giorgio Buttazzo AAML 129 0 0 02 Apr 2025
Geometric Median Matching for Robust k-Subset Selection from Noisy Data Anish Acharya Sujay Sanghavi Alexandros G. Dimakis Inderjit S Dhillon AAML 90 0 0 01 Apr 2025
Improving Generalization of Universal Adversarial Perturbation via Dynamic Maximin Optimization Yize Zhang Yingzhe Xu Junyu Shi L. Zhang Shengshan Hu Minghui Li Yanjun Zhang AAML 91 1 0 17 Mar 2025
AMUN: Adversarial Machine UNlearning A. Boroojeny Hari Sundaram Varun Chandrasekaran MU AAML 54 0 0 02 Mar 2025
Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models Ruta Binkyte Ivaxi Sheth Zhijing Jin Mohammad Havaei Bernhard Schölkopf Mario Fritz 321 1 0 28 Feb 2025
Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation Wenyuan Wu Zheng Liu Yong Chen Chao Su Dezhong Peng Xu Wang AAML 73 0 0 24 Feb 2025
SEA: Shareable and Explainable Attribution for Query-based Black-box Attacks Yue Gao Ilia Shumailov Kassem Fawaz AAML 162 0 0 21 Feb 2025
Carefully Blending Adversarial Training, Purification, and Aggregation Improves Adversarial Robustness Emanuele Ballarin A. Ansuini Luca Bortolussi AAML 92 0 0 20 Feb 2025
Uncertainty-Aware Explanations Through Probabilistic Self-Explainable Neural Networks Jon Vadillo Roberto Santana J. A. Lozano Marta Z. Kwiatkowska BDL AAML 98 0 0 17 Feb 2025
ExplainReduce: Summarising local explanations via proxies Lauri Seppäläinen Mudong Guo Kai Puolamäki FAtt 58 0 0 17 Feb 2025
Universal Adversarial Attack on Aligned Multimodal LLMs Temurbek Rahmatullaev Polina Druzhinina Matvey Mikhalchuk Andrey Kuznetsov Anton Razzhigaev AAML 118 0 0 11 Feb 2025
Confidence Elicitation: A New Attack Vector for Large Language Models Brian Formento Chuan-Sheng Foo See-Kiong Ng AAML 140 0 0 07 Feb 2025
Detecting APT Malware Command and Control over HTTP(S) Using Contextual Summaries Almuthanna Alageel Sergio Maffeis Imperial College London 51 2 0 07 Feb 2025
Dual-Flow: Transferable Multi-Target, Instance-Agnostic Attacks via In-the-wild Cascading Flow Optimization Yixiao Chen Shikun Sun Jianshu Li Ruoyu Li Zhe Li Junliang Xing AAML 139 0 0 04 Feb 2025
CoRPA: Adversarial Image Generation for Chest X-rays Using Concept Vector Perturbations and Generative Models Amy Rafferty Rishi Ramaesh Ajitha Rajan MedIm AAML 74 0 0 04 Feb 2025
Dimensions underlying the representational alignment of deep neural networks with humans F. Mahner Lukas Muttenthaler Umut Güçlü M. Hebart 71 5 0 28 Jan 2025
Smoothed Embeddings for Robust Language Models Ryo Hase Md Rafi Ur Rashid Ashley Lewis Jing Liu T. Koike-Akino K. Parsons Yanjie Wang AAML 62 2 0 27 Jan 2025
Formal Verification of Markov Processes with Learned Parameters Muhammad Maaz Timothy C. Y. Chan 77 0 0 27 Jan 2025
Defending against Adversarial Malware Attacks on ML-based Android Malware Detection Systems Ping He Lorenzo Cavallaro Shouling Ji AAML 69 0 0 23 Jan 2025
Enhancing Robust Fairness via Confusional Spectral Regularization Gaojie Jin Sihao Wu Jiaxu Liu Tianjin Huang Ronghui Mu 132 1 0 22 Jan 2025
Provably-Safe Neural Network Training Using Hybrid Zonotope Reachability Analysis Long Kiu Chung Shreyas Kousik 351 0 0 22 Jan 2025
Geometric Median (GM) Matching for Robust Data Pruning Anish Acharya Inderjit S Dhillon Sujay Sanghavi AAML 82 0 0 20 Jan 2025
Provably Safeguarding a Classifier from OOD and Adversarial Samples: an Extreme Value Theory Approach Nicolas Atienza Christophe Labreuche Johanne Cohen Michele Sebag OODD AAML 289 0 0 20 Jan 2025
CaFA: Cost-aware, Feasible Attacks With Database Constraints Against Neural Tabular Classifiers Matan Ben-Tov Daniel Deutch Nave Frost Mahmood Sharif AAML 147 0 0 20 Jan 2025
On the Hypomonotone Class of Variational Inequalities Khaled Alomar Tatjana Chavdarova 35 0 0 20 Jan 2025
Can Safety Fine-Tuning Be More Principled? Lessons Learned from Cybersecurity David Williams-King Linh Le Adam Oberman Yoshua Bengio AAML 63 0 0 19 Jan 2025
On the uncertainty principle of neural networks Jun-Jie Zhang Dong-xiao Zhang Jian-Nan Chen L. Pang Deyu Meng 69 2 0 17 Jan 2025
MOS-Attack: A Scalable Multi-objective Adversarial Attack Framework Ping Guo Cheng Gong Xi Lin Fei Liu Zhichao Lu Qingfu Zhang Zhenkun Wang AAML 57 0 0 13 Jan 2025
Most Influential Subset Selection: Challenges, Promises, and Beyond Yuzheng Hu Pingbang Hu Han Zhao Jiaqi W. Ma TDI 152 4 0 10 Jan 2025
Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50 Umesh Yadav Suman Niraula Gaurav Kumar Gupta Bicky Yadav SILM 118 0 0 04 Jan 2025
Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models Martin Pawelczyk Lillian Sun Zhenting Qi Aounon Kumar Himabindu Lakkaraju 73 1 0 03 Jan 2025