Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.07519
Cited By
v1
v2
v3
v4 (latest)
DeepGauge: Multi-Granularity Testing Criteria for Deep Learning Systems
20 March 2018
Lei Ma
Felix Juefei Xu
Fuyuan Zhang
Jiyuan Sun
Minhui Xue
Yue Liu
Chunyang Chen
Ting Su
Li Li
Yang Liu
Jianjun Zhao
Yadong Wang
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DeepGauge: Multi-Granularity Testing Criteria for Deep Learning Systems"
50 / 211 papers shown
Learning-Based Testing for Deep Learning: Enhancing Model Robustness with Adversarial Input Prioritization
Sheikh Md Mushfiqur Rahman
Nasir Eisty
AAML
72
1
0
28 Sep 2025
Influence-Guided Concolic Testing of Transformer Robustness
Chih-Duo Hong
Yu Wang
Yao-Chen Chang
Fang Yu
128
0
0
28 Sep 2025
TopoMap: A Feature-based Semantic Discriminator of the Topographical Regions in the Test Input Space
Gianmarco De Vita
Nargiz Humbatova
Paolo Tonella
AAML
116
0
0
03 Sep 2025
DiCriTest: Testing Scenario Generation for Decision-Making Agents Considering Diversity and Criticality
Qitong Chu
Yufeng Yue
Danya Yao
Huaxin Pei
97
0
0
15 Aug 2025
Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-based Test Oracles
Zihao Xu
Junchen Ding
Yiling Lou
Kun Zhang
Dong Gong
Yuekang Li
ELM
LRM
358
1
0
09 Apr 2025
Towards Assessing Deep Learning Test Input Generators
Seif Mzoughi
Ahmed Hajyahmed
Mohamed Elshafei
Foutse Khomh anb Diego Elias Costa
D. Costa
AAML
298
0
0
03 Apr 2025
MetaSel: A Test Selection Approach for Fine-tuned DNN Models
IEEE Transactions on Software Engineering (TSE), 2025
Amin Abbasishahkoo
Mahboubeh Dadkhah
Lionel C. Briand
Dayi Lin
446
1
0
21 Mar 2025
Breaking the Loop: Detecting and Mitigating Denial-of-Service Vulnerabilities in Large Language Models
Junzhe Yu
Yi Liu
Huijia Sun
Ling Shi
Yuqi Chen
229
1
0
01 Mar 2025
Democratic Training Against Universal Adversarial Perturbations
International Conference on Learning Representations (ICLR), 2025
Bing-Jie Sun
Jun Sun
Wei Zhao
AAML
278
0
0
08 Feb 2025
Path Analysis for Effective Fault Localization in Deep Neural Networks
Applied Soft Computing (Appl. Soft Comput.), 2023
Soroush Hashemifar
Saeed Parsa
A. Kalaee
AAML
312
0
0
28 Jan 2025
Benchmarking Generative AI Models for Deep Learning Test Input Generation
International Conference on Information Control Systems & Technologies (ICST), 2024
Maryam
Matteo Biagiola
Andrea Stocco
Vincenzo Riccio
VLM
168
6
0
23 Dec 2024
Assessing Superposition-Targeted Coverage Criteria for Quantum Neural Networks
Minqi Shao
Jianjun Zhao
AAML
443
2
0
03 Nov 2024
DNN Modularization via Activation-Driven Training
Tuan Ngo
Abid Hassan
Saad Shafiq
Nenad Medvidovic
MoMe
327
0
0
01 Nov 2024
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
Lam Nguyen Tung
Steven Cho
Xiaoning Du
Neelofar Neelofar
Valerio Terragni
Stefano Ruberto
Aldeida Aleti
1.2K
3
0
30 Oct 2024
FAST: Boosting Uncertainty-based Test Prioritization Methods for Neural Networks via Feature Selection
International Conference on Automated Software Engineering (ASE), 2024
Jialuo Chen
Jingyi Wang
Xiyue Zhang
Youcheng Sun
Marta Kwiatkowska
Jiming Chen
Peng Cheng
226
2
0
13 Sep 2024
Towards certifiable AI in aviation: landscape, challenges, and opportunities
Hymalai Bello
Daniel Geißler
L. Ray
Stefan Muller-Divéky
Peter Muller
Shannon Kittrell
Mengxi Liu
Bo Zhou
Paul Lukowicz
214
1
0
13 Sep 2024
LeCov: Multi-level Testing Criteria for Large Language Models
Xuan Xie
Yuheng Huang
Yuheng Huang
Da Song
Fuyuan Zhang
Felix Juefei-Xu
Lei Ma
ELM
201
0
0
20 Aug 2024
Robust Black-box Testing of Deep Neural Networks using Co-Domain Coverage
Aishwarya Gupta
Indranil Saha
Piyush Rai
AAML
MLAU
165
1
0
13 Aug 2024
Contexts Matter: An Empirical Study on Contextual Influence in Fairness Testing for Deep Learning Systems
International Symposium on Empirical Software Engineering and Measurement (ESEM), 2024
Chengwen Du
Tao Chen
174
1
0
12 Aug 2024
Effective Black Box Testing of Sentiment Analysis Classification Networks
Parsa Karbasizadeh
Fathiyeh Faghih
Pouria Golshanrad
215
0
0
30 Jul 2024
A3Rank: Augmentation Alignment Analysis for Prioritizing Overconfident Failing Samples for Deep Learning Models
Zhengyuan Wei
Haipeng Wang
Qili Zhou
William Chan
149
0
0
19 Jul 2024
Constraint-based Adversarial Example Synthesis
Fang Yu
Ya-Yu Chi
Yu-Fang Chen
AAML
223
2
0
03 Jun 2024
Uncertainty Measurement of Deep Learning System based on the Convex Hull of Training Sets
Hyekyoung Hwang
Jitae Shin
AAML
UQCV
183
2
0
25 May 2024
Identifying phase transitions in physical systems with neural networks: a neural architecture search perspective
R. C. Terin
Z. G. Arenas
Roberto Santana
158
1
0
23 Apr 2024
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward
Xuan Xie
Yuheng Huang
Zhehua Zhou
Yuheng Huang
Da Song
Lei Ma
OffRL
385
12
0
12 Apr 2024
A Survey of Neural Network Robustness Assessment in Image Recognition
Jie Wang
Jun Ai
Minyan Lu
Haoran Su
Dan Yu
Yutao Zhang
Junda Zhu
Jingyu Liu
AAML
308
4
0
12 Apr 2024
Machine Learning Robustness: A Primer
Houssem Ben Braiek
Foutse Khomh
AAML
OOD
467
21
0
01 Apr 2024
DeepKnowledge: Generalisation-Driven Deep Learning Testing
S. Missaoui
Simos Gerasimou
Nikolaos Matragkas
183
1
0
25 Mar 2024
Beyond Accuracy: An Empirical Study on Unit Testing in Open-source Deep Learning Projects
Han Wang
Sijia Yu
Chunyang Chen
Burak Turhan
Xiaodong Zhu
ELM
MLAU
173
4
0
26 Feb 2024
QuanTest: Entanglement-Guided Testing of Quantum Neural Network Systems
Jinjing Shi
Zimeng Xiao
Heyuan Shi
Yu Jiang
Xuelong Li
AAML
194
5
0
20 Feb 2024
DeepCover: Advancing RNN Test Coverage and Online Error Prediction using State Machine Extraction
Journal of Systems and Software (JSS), 2024
Pouria Golshanrad
Fathiyeh Faghih
167
8
0
10 Feb 2024
Investigating White-Box Attacks for On-Device Models
M. Zhou
Yantao Du
Jing Wu
Kui Liu
Hailong Sun
Li Li
AAML
345
12
0
08 Feb 2024
Outline of an Independent Systematic Blackbox Test for ML-based Systems
H. Wiesbrock
Jürgen Grossmann
195
2
0
30 Jan 2024
Towards Enhancing the Reproducibility of Deep Learning Bugs: An Empirical Study
Mehil B. Shah
Mohammad Masudur Rahman
Foutse Khomh
288
10
0
05 Jan 2024
DREAM: Debugging and Repairing AutoML Pipelines
ACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Xiaoyu Zhang
Juan Zhai
Shiqing Ma
Chao Shen
300
4
0
31 Dec 2023
Efficient Representation of the Activation Space in Deep Neural Networks
Tanya Akumu
C. Cintas
G. Tadesse
Adebayo Oshingbesan
Skyler Speakman
E. McFowland
AAML
205
2
0
13 Dec 2023
GIST: Generated Inputs Sets Transferability in Deep Learning
ACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Florian Tambon
Foutse Khomh
G. Antoniol
AAML
417
1
0
01 Nov 2023
LUNA: A Model-Based Universal Analysis Framework for Large Language Models
IEEE Transactions on Software Engineering (TSE), 2023
Da Song
Xuan Xie
Yuheng Huang
Derui Zhu
Yuheng Huang
Felix Juefei Xu
Lei Ma
ALM
350
9
0
22 Oct 2023
Test & Evaluation Best Practices for Machine Learning-Enabled Systems
Jaganmohan Chandrasekaran
Tyler Cody
Nicola McCarthy
Erin Lanus
Laura J. Freeman
190
8
0
10 Oct 2023
RAI4IoE: Responsible AI for Enabling the Internet of Energy
International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2023
Minhui Xue
Surya Nepal
Ling Liu
Subbu Sethuvenkatraman
Xingliang Yuan
Carsten Rudolph
Ruoxi Sun
Greg Eisenhauer
255
7
0
20 Sep 2023
An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning Systems
International Conference on Automated Software Engineering (ASE), 2023
Ahmed Haj Yahmed
Rached Bouchoucha
Houssem Ben Braiek
Foutse Khomh
CLL
AI4CE
163
0
0
23 Aug 2023
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
International Conference on Automated Software Engineering (ASE), 2023
Wenxuan Wang
Jingyuan Huang
Shu Yang
Chang Chen
Jiazhen Gu
Pinjia He
Michael R. Lyu
VLM
136
6
0
18 Aug 2023
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
Shu Yang
Man Ho Lam
E. Li
Shujie Ren
Wenxuan Wang
Wenxiang Jiao
Zhaopeng Tu
Michael R. Lyu
362
66
0
07 Aug 2023
Evaluating the Robustness of Test Selection Methods for Deep Neural Networks
Qiang Hu
Yuejun Guo
Xiaofei Xie
Maxime Cordy
Wei Ma
Mike Papadakis
Yves Le Traon
NoLa
OOD
199
7
0
29 Jul 2023
Feature Map Testing for Deep Neural Networks
Dong Huang
Qi Bu
Yahao Qing
Yichao Fu
Heming Cui
122
3
0
21 Jul 2023
Neuron Sensitivity Guided Test Case Selection for Deep Learning Testing
ACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Dong Huang
Qi Bu
Yichao Fu
Yuhao Qing
Junjie Chen
Heming Cui
AAML
285
7
0
20 Jul 2023
CertPri: Certifiable Prioritization for Deep Neural Networks via Movement Cost in Feature Space
International Conference on Automated Software Engineering (ASE), 2023
Haibin Zheng
Jinyin Chen
Haibo Jin
AAML
175
9
0
18 Jul 2023
A Scenario-Based Functional Testing Approach to Improving DNN Performance
International Symposium on Service Oriented Software Engineering (SOSE), 2023
Hong Zhu
T. T. T. Tran
Aduen Benjumea
Andrew Bradley
118
6
0
13 Jul 2023
Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization
International Conference on Learning Representations (ICLR), 2023
Zichen Liu
Chris Xing Tian
Haoliang Li
Lei Ma
Shiqi Wang
UQCV
322
24
0
05 Jun 2023
How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses
IEEE Access (IEEE Access), 2023
Joana Cabral Costa
Tiago Roxo
Hugo Manuel Proença
Pedro R. M. Inácio
AAML
370
108
0
18 May 2023
1
2
3
4
5
Next