v1v2v3v4 (latest)

DeepGauge: Multi-Granularity Testing Criteria for Deep Learning Systems

20 March 2018

Ting Su

Yang Liu

Papers citing "DeepGauge: Multi-Granularity Testing Criteria for Deep Learning Systems"

50 / 211 papers shown

Learning-Based Testing for Deep Learning: Enhancing Model Robustness with Adversarial Input Prioritization

Sheikh Md Mushfiqur Rahman

Nasir Eisty

AAML

28 Sep 2025

Influence-Guided Concolic Testing of Transformer Robustness

128

28 Sep 2025

TopoMap: A Feature-based Semantic Discriminator of the Topographical Regions in the Test Input Space

116

03 Sep 2025

DiCriTest: Testing Scenario Generation for Decision-Making Agents Considering Diversity and Criticality

15 Aug 2025

Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-based Test Oracles

358

09 Apr 2025

Towards Assessing Deep Learning Test Input Generators

Seif Mzoughi

Ahmed Hajyahmed

Mohamed Elshafei

Foutse Khomh anb Diego Elias Costa

D. Costa

AAML

298

03 Apr 2025

MetaSel: A Test Selection Approach for Fine-tuned DNN ModelsIEEE Transactions on Software Engineering (TSE), 2025

446

21 Mar 2025

Breaking the Loop: Detecting and Mitigating Denial-of-Service Vulnerabilities in Large Language Models

229

01 Mar 2025

Democratic Training Against Universal Adversarial PerturbationsInternational Conference on Learning Representations (ICLR), 2025

278

08 Feb 2025

Path Analysis for Effective Fault Localization in Deep Neural NetworksApplied Soft Computing (Appl. Soft Comput.), 2023

312

28 Jan 2025

Benchmarking Generative AI Models for Deep Learning Test Input GenerationInternational Conference on Information Control Systems & Technologies (ICST), 2024

168

23 Dec 2024

Assessing Superposition-Targeted Coverage Criteria for Quantum Neural Networks

Minqi Shao

Jianjun Zhao

AAML

443

03 Nov 2024

DNN Modularization via Activation-Driven Training

327

01 Nov 2024

Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers

1.2K

30 Oct 2024

FAST: Boosting Uncertainty-based Test Prioritization Methods for Neural Networks via Feature SelectionInternational Conference on Automated Software Engineering (ASE), 2024

Jingyi Wang

Marta Kwiatkowska

Peng Cheng

226

13 Sep 2024

Towards certifiable AI in aviation: landscape, challenges, and opportunities

Daniel Geißler

Mengxi Liu

Bo Zhou

Paul Lukowicz

214

13 Sep 2024

LeCov: Multi-level Testing Criteria for Large Language Models

Xuan Xie

Yuheng Huang

Da Song

Fuyuan Zhang

Felix Juefei-Xu

Lei Ma

ELM

201

20 Aug 2024

Robust Black-box Testing of Deep Neural Networks using Co-Domain Coverage

165

13 Aug 2024

Contexts Matter: An Empirical Study on Contextual Influence in Fairness Testing for Deep Learning SystemsInternational Symposium on Empirical Software Engineering and Measurement (ESEM), 2024

Chengwen Du

Tao Chen

174

12 Aug 2024

Effective Black Box Testing of Sentiment Analysis Classification Networks

Parsa Karbasizadeh

Fathiyeh Faghih

Pouria Golshanrad

215

30 Jul 2024

A3Rank: Augmentation Alignment Analysis for Prioritizing Overconfident Failing Samples for Deep Learning Models

149

19 Jul 2024

Constraint-based Adversarial Example Synthesis

223

03 Jun 2024

Uncertainty Measurement of Deep Learning System based on the Convex Hull of Training Sets

Hyekyoung Hwang

Jitae Shin

AAML UQCV

183

25 May 2024

Identifying phase transitions in physical systems with neural networks: a neural architecture search perspective

R. C. Terin

Z. G. Arenas

Roberto Santana

158

23 Apr 2024

Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward

385

12 Apr 2024

A Survey of Neural Network Robustness Assessment in Image Recognition

308

12 Apr 2024

Machine Learning Robustness: A Primer

Houssem Ben Braiek

Foutse Khomh

AAML OOD

467

01 Apr 2024

DeepKnowledge: Generalisation-Driven Deep Learning Testing

S. Missaoui

Simos Gerasimou

Nikolaos Matragkas

183

25 Mar 2024

Beyond Accuracy: An Empirical Study on Unit Testing in Open-source Deep Learning Projects

173

26 Feb 2024

QuanTest: Entanglement-Guided Testing of Quantum Neural Network Systems

194

20 Feb 2024

DeepCover: Advancing RNN Test Coverage and Online Error Prediction using State Machine ExtractionJournal of Systems and Software (JSS), 2024

Pouria Golshanrad

Fathiyeh Faghih

167

10 Feb 2024

Investigating White-Box Attacks for On-Device Models

345

08 Feb 2024

Outline of an Independent Systematic Blackbox Test for ML-based Systems

H. Wiesbrock

Jürgen Grossmann

195

30 Jan 2024

Towards Enhancing the Reproducibility of Deep Learning Bugs: An Empirical Study

Mehil B. Shah

Mohammad Masudur Rahman

Foutse Khomh

288

05 Jan 2024

DREAM: Debugging and Repairing AutoML PipelinesACM Transactions on Software Engineering and Methodology (TOSEM), 2023

Xiaoyu Zhang

Juan Zhai

Shiqing Ma

Chao Shen

300

31 Dec 2023

Efficient Representation of the Activation Space in Deep Neural Networks

205

13 Dec 2023

GIST: Generated Inputs Sets Transferability in Deep LearningACM Transactions on Software Engineering and Methodology (TOSEM), 2023

417

01 Nov 2023

LUNA: A Model-Based Universal Analysis Framework for Large Language ModelsIEEE Transactions on Software Engineering (TSE), 2023

350

22 Oct 2023

Test & Evaluation Best Practices for Machine Learning-Enabled Systems

Jaganmohan Chandrasekaran

190

10 Oct 2023

RAI4IoE: Responsible AI for Enabling the Internet of EnergyInternational Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2023

Minhui Xue

Surya Nepal

Ling Liu

Subbu Sethuvenkatraman

255

20 Sep 2023

An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning SystemsInternational Conference on Automated Software Engineering (ASE), 2023

163

23 Aug 2023

An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation SoftwareInternational Conference on Automated Software Engineering (ASE), 2023

Michael R. Lyu

136

18 Aug 2023

Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench

Michael R. Lyu

362

07 Aug 2023

Evaluating the Robustness of Test Selection Methods for Deep Neural Networks

Yuejun Guo

199

29 Jul 2023

Feature Map Testing for Deep Neural Networks

Heming Cui

122

21 Jul 2023

Neuron Sensitivity Guided Test Case Selection for Deep Learning TestingACM Transactions on Software Engineering and Methodology (TOSEM), 2023

Heming Cui

285

20 Jul 2023

CertPri: Certifiable Prioritization for Deep Neural Networks via Movement Cost in Feature SpaceInternational Conference on Automated Software Engineering (ASE), 2023

175

18 Jul 2023

A Scenario-Based Functional Testing Approach to Improving DNN PerformanceInternational Symposium on Service Oriented Software Engineering (SOSE), 2023

118

13 Jul 2023

Neuron Activation Coverage: Rethinking Out-of-distribution Detection and GeneralizationInternational Conference on Learning Representations (ICLR), 2023

Shiqi Wang

322

05 Jun 2023

How Deep Learning Sees the World: A Survey on Adversarial Attacks & DefensesIEEE Access (IEEE Access), 2023

370

108

18 May 2023