Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.10265
Cited By
v1
v2 (latest)
Training verified learners with learned verifiers
25 May 2018
Krishnamurthy Dvijotham
Sven Gowal
Robert Stanforth
Relja Arandjelović
Brendan O'Donoghue
J. Uesato
Pushmeet Kohli
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training verified learners with learned verifiers"
50 / 70 papers shown
Title
TriGuard: Testing Model Safety with Attribution Entropy, Verification, and Drift
Dipesh Tharu Mahato
Rohan Poudel
Pramod Dhungana
AAML
15
0
0
17 Jun 2025
Certifying LLM Safety against Adversarial Prompting
Aounon Kumar
Chirag Agarwal
Suraj Srinivas
Aaron Jiaxun Li
Soheil Feizi
Himabindu Lakkaraju
AAML
139
196
0
06 Sep 2023
When Deep Learning Meets Polyhedral Theory: A Survey
Joey Huchette
Gonzalo Muñoz
Thiago Serra
Calvin Tsay
AI4CE
160
37
0
29 Apr 2023
RS-Del: Edit Distance Robustness Certificates for Sequence Classifiers via Randomized Deletion
Zhuoqun Huang
Neil G. Marchant
Keane Lucas
Lujo Bauer
O. Ohrimenko
Benjamin I. P. Rubinstein
AAML
94
17
0
31 Jan 2023
Probabilistic Inverse Modeling: An Application in Hydrology
Somya Sharma
Rahul Ghosh
Arvind Renganathan
Xiang Li
Snigdhansu Chatterjee
John L. Nieber
C. Duffy
Vipin Kumar
AI4CE
84
1
0
12 Oct 2022
A Scalable, Interpretable, Verifiable & Differentiable Logic Gate Convolutional Neural Network Architecture From Truth Tables
Adrien Benamira
Tristan Guérand
Thomas Peyrin
Trevor Yap
Bryan Hooi
55
2
0
18 Aug 2022
A Unified View of SDP-based Neural Network Verification through Completely Positive Programming
Robin Brown
Edward Schmerling
Navid Azizan
Marco Pavone
AAML
71
17
0
06 Mar 2022
Learning Neural Networks under Input-Output Specifications
Z. Abdeen
He Yin
V. Kekatos
Ming Jin
57
8
0
23 Feb 2022
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models
Wei Ping
Chejian Xu
Shuohang Wang
Zhe Gan
Yu Cheng
Jianfeng Gao
Ahmed Hassan Awadallah
Yangqiu Song
VLM
ELM
AAML
78
226
0
04 Nov 2021
When Does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?
Lijie Fan
Sijia Liu
Pin-Yu Chen
Gaoyuan Zhang
Chuang Gan
AAML
VLM
95
124
0
01 Nov 2021
Towards Improving Adversarial Training of NLP Models
Jin Yong Yoo
Yanjun Qi
AAML
195
127
0
01 Sep 2021
Neural Network Branch-and-Bound for Neural Network Verification
Florian Jaeckle
Jingyue Lu
M. P. Kumar
56
8
0
27 Jul 2021
Policy Smoothing for Provably Robust Reinforcement Learning
Aounon Kumar
Alexander Levine
Soheil Feizi
AAML
104
58
0
21 Jun 2021
Taxonomy of Machine Learning Safety: A Survey and Primer
Sina Mohseni
Haotao Wang
Zhiding Yu
Chaowei Xiao
Zhangyang Wang
J. Yadawa
84
32
0
09 Jun 2021
Generating Adversarial Examples with Graph Neural Networks
Florian Jaeckle
M. P. Kumar
GAN
AAML
51
21
0
30 May 2021
Defending Pre-trained Language Models from Adversarial Word Substitutions Without Performance Sacrifice
Rongzhou Bao
Jiayi Wang
Hai Zhao
AAML
48
43
0
30 May 2021
Double Perturbation: On the Robustness of Robustness and Counterfactual Bias Evaluation
Chong Zhang
Jieyu Zhao
Huan Zhang
Kai-Wei Chang
Cho-Jui Hsieh
AAML
66
10
0
12 Apr 2021
Achieving Model Robustness through Discrete Adversarial Training
Maor Ivgi
Jonathan Berant
AAML
71
27
0
11 Apr 2021
Fast Certified Robust Training with Short Warmup
Zhouxing Shi
Yihan Wang
Huan Zhang
Jinfeng Yi
Cho-Jui Hsieh
AAML
85
57
0
31 Mar 2021
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
Ren Wang
Kaidi Xu
Sijia Liu
Pin-Yu Chen
Tsui-Wei Weng
Chuang Gan
Meng Wang
AAML
97
47
0
20 Feb 2021
Center Smoothing: Certified Robustness for Networks with Structured Outputs
Aounon Kumar
Tom Goldstein
OOD
AAML
UQCV
78
19
0
19 Feb 2021
Fast Training of Provably Robust Neural Networks by SingleProp
Akhilan Boopathy
Tsui-Wei Weng
Sijia Liu
Pin-Yu Chen
Gaoyuan Zhang
Luca Daniel
AAML
54
7
0
01 Feb 2021
Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers
Kaidi Xu
Huan Zhang
Shiqi Wang
Yihan Wang
Suman Jana
Xue Lin
Cho-Jui Hsieh
115
188
0
27 Nov 2020
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Wei Ping
Shuohang Wang
Yu Cheng
Zhe Gan
R. Jia
Yue Liu
Jingjing Liu
AAML
215
116
0
05 Oct 2020
Bag of Tricks for Adversarial Training
Tianyu Pang
Xiao Yang
Yinpeng Dong
Hang Su
Jun Zhu
AAML
86
270
0
01 Oct 2020
Certifying Confidence via Randomized Smoothing
Aounon Kumar
Alexander Levine
Soheil Feizi
Tom Goldstein
UQCV
93
40
0
17 Sep 2020
SoK: Certified Robustness for Deep Neural Networks
Linyi Li
Tao Xie
Yue Liu
AAML
123
131
0
09 Sep 2020
Adversarial robustness via robust low rank representations
Pranjal Awasthi
Himanshu Jain
A. S. Rawat
Aravindan Vijayaraghavan
AAML
51
23
0
13 Jul 2020
Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble
Yi Zhou
Xiaoqing Zheng
Cho-Jui Hsieh
Kai-Wei Chang
Xuanjing Huang
SILM
103
48
0
20 Jun 2020
Debona: Decoupled Boundary Network Analysis for Tighter Bounds and Faster Adversarial Robustness Proofs
Christopher Brix
T. Noll
AAML
61
10
0
16 Jun 2020
Extensions and limitations of randomized smoothing for robustness guarantees
Jamie Hayes
AAML
51
21
0
07 Jun 2020
Second-Order Provable Defenses against Adversarial Attacks
Sahil Singla
Soheil Feizi
AAML
71
60
0
01 Jun 2020
Enhancing Certified Robustness via Smoothed Weighted Ensembling
Chizhou Liu
Yunzhen Feng
Ranran Wang
Bin Dong
AAML
77
12
0
19 May 2020
Efficient Exact Verification of Binarized Neural Networks
Kai Jia
Martin Rinard
AAML
MQ
46
59
0
07 May 2020
Robustness Certification of Generative Models
M. Mirman
Timon Gehr
Martin Vechev
AAML
70
21
0
30 Apr 2020
Certifiable Robustness to Adversarial State Uncertainty in Deep Reinforcement Learning
Michael Everett
Bjorn Lutjens
Jonathan P. How
AAML
53
42
0
11 Apr 2020
Sample-Specific Output Constraints for Neural Networks
Mathis Brosowsky
Olaf Dünkel
Daniel Slieter
Marius Zöllner
AILaw
PINN
66
10
0
23 Mar 2020
Exploiting Verified Neural Networks via Floating Point Numerical Error
Kai Jia
Martin Rinard
AAML
97
37
0
06 Mar 2020
Denoised Smoothing: A Provable Defense for Pretrained Classifiers
Hadi Salman
Mingjie Sun
Greg Yang
Ashish Kapoor
J. Zico Kolter
94
23
0
04 Mar 2020
Towards Certifiable Adversarial Sample Detection
Ilia Shumailov
Yiren Zhao
Robert D. Mullins
Ross J. Anderson
AAML
43
13
0
20 Feb 2020
Randomized Smoothing of All Shapes and Sizes
Greg Yang
Tony Duan
J. E. Hu
Hadi Salman
Ilya P. Razenshteyn
Jungshian Li
AAML
94
216
0
19 Feb 2020
T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack
Wei Ping
Hengzhi Pei
Boyuan Pan
Han Liu
Shuohang Wang
Yangqiu Song
AAML
61
6
0
22 Dec 2019
Certified Robustness for Top-k Predictions against Adversarial Perturbations via Randomized Smoothing
Jinyuan Jia
Xiaoyu Cao
Binghui Wang
Neil Zhenqiang Gong
AAML
60
95
0
20 Dec 2019
Practical Solutions for Machine Learning Safety in Autonomous Vehicles
Sina Mohseni
Mandar Pitale
Vasu Singh
Zhangyang Wang
84
68
0
20 Dec 2019
Online Robustness Training for Deep Reinforcement Learning
Marc Fischer
M. Mirman
Steven Stalder
Martin Vechev
OnRL
102
41
0
03 Nov 2019
Certified Adversarial Robustness for Deep Reinforcement Learning
Björn Lütjens
Michael Everett
Jonathan P. How
AAML
98
95
0
28 Oct 2019
Universal Approximation with Certified Networks
Maximilian Baader
M. Mirman
Martin Vechev
67
22
0
30 Sep 2019
Achieving Verified Robustness to Symbol Substitutions via Interval Bound Propagation
Po-Sen Huang
Robert Stanforth
Johannes Welbl
Chris Dyer
Dani Yogatama
Sven Gowal
Krishnamurthy Dvijotham
Pushmeet Kohli
AAML
91
166
0
03 Sep 2019
Certified Robustness to Adversarial Word Substitutions
Robin Jia
Aditi Raghunathan
Kerem Göksel
Percy Liang
AAML
339
294
0
03 Sep 2019
ART: Abstraction Refinement-Guided Training for Provably Correct Neural Networks
Xuankang Lin
He Zhu
R. Samanta
Suresh Jagannathan
AAML
92
29
0
17 Jul 2019
1
2
Next