Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.06622
Cited By
v1
v2 (latest)
Robustness Verification for Transformers
International Conference on Learning Representations (ICLR), 2020
16 February 2020
Zhouxing Shi
Huan Zhang
Kai-Wei Chang
Shiyu Huang
Cho-Jui Hsieh
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Robustness Verification for Transformers"
50 / 77 papers shown
Floating-Point Neural Network Verification at the Software Level
Edoardo Manino
B. Farias
R. Menezes
F. Shmarov
Lucas C. Cordeiro
152
1
0
27 Oct 2025
Verifying Memoryless Sequential Decision-making of Large Language Models
Dennis Gross
Helge Spieker
A. Gotlieb
LRM
179
0
0
08 Oct 2025
Influence-Guided Concolic Testing of Transformer Robustness
Chih-Duo Hong
Yu Wang
Yao-Chen Chang
Fang Yu
158
0
0
28 Sep 2025
Bounded PCTL Model Checking of Large Language Model Outputs
Dennis Gross
Helge Spieker
A. Gotlieb
178
0
0
23 Sep 2025
Distributionally Robust Safety Verification of Neural Networks via Worst-Case CVaR
Masako Kishida
AAML
145
0
0
22 Sep 2025
Exact Verification of Graph Neural Networks with Incremental Constraint Solving
Minghao Liu
Chia-Hsuan Lu
Marta Kwiatkowska
AAML
241
2
0
12 Aug 2025
The Counting Power of Transformers
Marco Sälzer
Chris Köcher
Anthony Widjaja Lin
Georg Zetzsche
Anthony Widjaja Lin
430
0
0
16 May 2025
SoundnessBench: A Soundness Benchmark for Neural Network Verifiers
Xingjian Zhou
Keyi Shen
Andy Xu
Hongji Xu
Cho-Jui Hsieh
Huan Zhang
Zhouxing Shi
AAML
407
1
0
04 Dec 2024
Neural Network Verification with Branch-and-Bound for General Nonlinearities
Zhouxing Shi
Qirui Jin
Zico Kolter
Suman Jana
Cho-Jui Hsieh
Huan Zhang
569
42
0
31 May 2024
Transformer Encoder Satisfiability: Complexity and Impact on Formal Reasoning
Marco Sälzer
Eric Alsmann
Martin Lange
LRM
274
0
0
28 May 2024
A One-Layer Decoder-Only Transformer is a Two-Layer RNN: With an Application to Certified Robustness
Yuhao Zhang
Aws Albarghouthi
Loris Dántoni
OffRL
151
0
0
27 May 2024
GenFighter: A Generative and Evolutive Textual Attack Removal
Md Athikul Islam
Edoardo Serra
Sushil Jajodia
AAML
210
0
0
17 Apr 2024
Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions
Runhao Zeng
Xiaoyong Chen
Jiaming Liang
Huisi Wu
Guangzhong Cao
Yong Guo
AAML
402
13
0
29 Mar 2024
Transformers Learn Low Sensitivity Functions: Investigations and Implications
International Conference on Learning Representations (ICLR), 2024
Bhavya Vasudeva
Deqing Fu
Tianyi Zhou
Elliott Kau
Youqi Huang
Willie Neiswanger
524
2
0
11 Mar 2024
Certifying Knowledge Comprehension in LLMs
Isha Chaudhary
Vedaant V. Jain
Gagandeep Singh
390
0
0
24 Feb 2024
Correctness Verification of Neural Networks Approximating Differential Equations
Petros Ellinas
Rahul Nellikkath
Ignasi Ventura
Jochen Stiasny
Spyros Chatzivasileiadis
256
3
0
12 Feb 2024
Robustness Verification for Knowledge-Based Logic of Risky Driving Scenes
Xia Wang
Anda Liang
Jonathan Sprinkle
Taylor T. Johnson
207
7
0
27 Dec 2023
Improving the Robustness of Transformer-based Large Language Models with Dynamic Attention
Network and Distributed System Security Symposium (NDSS), 2023
Lujia Shen
Yuwen Pu
R. Beyah
Changjiang Li
Xuhong Zhang
Chunpeng Ge
Ting Wang
AAML
236
12
0
29 Nov 2023
STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers
Daqian Shao
Lukas Fesser
Marta Z. Kwiatkowska
244
1
0
28 Nov 2023
Fooling the Textual Fooler via Randomizing Latent Representations
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Duy C. Hoang
Quang H. Nguyen
Saurav Manchanda
MinLong Peng
Kok-Seng Wong
Khoa D. Doan
SILM
AAML
332
2
0
02 Oct 2023
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Bochuan Cao
Yu Cao
Lu Lin
Jinghui Chen
AAML
530
222
0
18 Sep 2023
Certified Robustness for Large Language Models with Self-Denoising
Zhen Zhang
Guanhua Zhang
Bairu Hou
Wenqi Fan
Qing Li
Sijia Liu
Yang Zhang
Shiyu Chang
362
26
0
14 Jul 2023
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Artificial Intelligence Review (AIR), 2023
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
429
171
0
19 May 2023
Rate-Adaptive Coding Mechanism for Semantic Communications With Multi-Modal Data
IEEE Transactions on Communications (IEEE Trans. Commun.), 2023
Yangshuo He
Guanding Yu
Yunlong Cai
251
43
0
18 May 2023
Efficient Error Certification for Physics-Informed Neural Networks
International Conference on Machine Learning (ICML), 2023
Francisco Eiras
Adel Bibi
Rudy Bunel
Krishnamurthy Dvijotham
Juil Sock
M. P. Kumar
PINN
430
7
0
17 May 2023
ANTONIO: Towards a Systematic Method of Generating NLP Benchmarks for Verification
Marco Casadio
Luca Arnaboldi
M. Daggitt
Omri Isac
Tanvi Dinkar
Daniel Kienitz
Verena Rieser
Ekaterina Komendantskaya
325
6
0
06 May 2023
Robustifying Token Attention for Vision Transformers
IEEE International Conference on Computer Vision (ICCV), 2023
Yong Guo
David Stutz
Bernt Schiele
ViT
485
36
0
20 Mar 2023
Convex Bounds on the Softmax Function with Applications to Robustness Verification
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Dennis L. Wei
Haoze Wu
Min Wu
Pin-Yu Chen
Clark W. Barrett
E. Farchi
UQCV
AAML
173
15
0
03 Mar 2023
Less is More: Understanding Word-level Textual Adversarial Attack via n-gram Frequency Descend
Ning Lu
Shengcai Liu
Zhirui Zhang
Qi. Wang
Haifeng Liu
Jiaheng Zhang
AAML
386
15
0
06 Feb 2023
TextShield: Beyond Successfully Detecting Adversarial Sentences in Text Classification
International Conference on Learning Representations (ICLR), 2023
Lingfeng Shen
Ze Zhang
Haiyun Jiang
Ying-Cong Chen
AAML
468
10
0
03 Feb 2023
AdvCat: Domain-Agnostic Robustness Assessment for Cybersecurity-Critical Applications with Categorical Inputs
Helene Orsini
Hongyan Bao
Yujun Zhou
Xiangrui Xu
Yufei Han
Longyang Yi
Wei Wang
Xin Gao
Xiangliang Zhang
AAML
314
2
0
13 Dec 2022
Impact of Adversarial Training on Robustness and Generalizability of Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Enes Altinisik
Hassan Sajjad
Husrev Taha Sencar
Safa Messaoud
Sanjay Chawla
AAML
247
17
0
10 Nov 2022
Can Transformers Reason in Fragments of Natural Language?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Viktor Schlegel
Kamen V. Pavlov
Ian Pratt-Hartmann
LRM
ReLM
262
9
0
10 Nov 2022
Textual Manifold-based Defense Against Natural Language Adversarial Examples
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
D. M. Nguyen
Anh Tuan Luu
AAML
340
28
0
05 Nov 2022
Localized Randomized Smoothing for Collective Robustness Certification
International Conference on Learning Representations (ICLR), 2022
Jan Schuchardt
Thomas Wollschläger
Aleksandar Bojchevski
Stephan Günnemann
AAML
292
12
0
28 Oct 2022
Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jiahao Zhao
Wenji Mao
DRL
OOD
247
7
0
26 Oct 2022
ADDMU: Detection of Far-Boundary Adversarial Examples with Data and Model Uncertainty Estimation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Fan Yin
Yao Li
Cho-Jui Hsieh
Kai-Wei Chang
AAML
328
4
0
22 Oct 2022
TCAB: A Large-Scale Text Classification Attack Benchmark
Kalyani Asthana
Zhouhang Xie
Wencong You
Adam Noack
Jonathan Brophy
Sameer Singh
Daniel Lowd
351
3
0
21 Oct 2022
Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation
Neural Information Processing Systems (NeurIPS), 2022
Zhouxing Shi
Yihan Wang
Huan Zhang
Zico Kolter
Cho-Jui Hsieh
439
61
0
13 Oct 2022
3DVerifier: Efficient Robustness Verification for 3D Point Cloud Models
Machine-mediated learning (ML), 2022
Ronghui Mu
Wenjie Ruan
Leandro Soriano Marcolino
Q. Ni
3DPC
331
12
0
15 Jul 2022
Adversarial Robustness of Deep Neural Networks: A Survey from a Formal Verification Perspective
IEEE Transactions on Dependable and Secure Computing (TDSC), 2022
Mark Huasong Meng
Guangdong Bai
Sin Gee Teo
Zhe Hou
Yan Xiao
Yun Lin
Jin Song Dong
AAML
282
68
0
24 Jun 2022
Why Robust Natural Language Understanding is a Challenge
Marco Casadio
Ekaterina Komendantskaya
Verena Rieser
M. Daggitt
Daniel Kienitz
Luca Arnaboldi
Wen Kokke
OOD
AAML
240
0
0
21 Jun 2022
CAISAR: A platform for Characterizing Artificial Intelligence Safety and Robustness
Julien Girard-Satabin
Michele Alberti
F. Bobot
Zakaria Chihani
Augustin Lemesle
380
12
0
07 Jun 2022
Adversarial Training for Improving Model Robustness? Look at Both Prediction and Interpretation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Hanjie Chen
Yangfeng Ji
OOD
AAML
VLM
275
29
0
23 Mar 2022
On Robust Prefix-Tuning for Text Classification
International Conference on Learning Representations (ICLR), 2022
Zonghan Yang
Yang Liu
VLM
270
23
0
19 Mar 2022
A Survey of Adversarial Defences and Robustness in NLP
Shreyansh Goyal
Sumanth Doddapaneni
Mitesh M.Khapra
B. Ravindran
AAML
646
36
0
12 Mar 2022
Robust Textual Embedding against Word-level Adversarial Attacks
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Yichen Yang
Xiaosen Wang
Kun He
AAML
258
21
0
28 Feb 2022
Are Transformers More Robust? Towards Exact Robustness Verification for Transformers
International Conference on Computer Safety, Reliability, and Security (SAFECOMP), 2022
B. Liao
Chih-Hong Cheng
Hasan Esen
Alois Knoll
AAML
315
3
0
08 Feb 2022
LinSyn: Synthesizing Tight Linear Bounds for Arbitrary Neural Network Activation Functions
International Conference on Tools and Algorithms for Construction and Analysis of Systems (TACAS), 2022
Brandon Paulsen
Chao Wang
AAML
262
19
0
31 Jan 2022
Identifying Adversarial Attacks on Text Classifiers
Zhouhang Xie
Jonathan Brophy
Adam Noack
Wencong You
Kalyani Asthana
Carter Perkins
Sabrina Reis
Sameer Singh
Daniel Lowd
AAML
186
11
0
21 Jan 2022
1
2
Next
Page 1 of 2