Abstract
Data-augmented deep learning models are widely used in real-world applications. However, many state-of the-art loss-based or coverage-based fuzzing techniques fail to produce fuzzing samples for them from many seeds. This paper proposes Aster, a novel technique to address this problem to enhance their fuzzing effectiveness for deep learning models trained with multi-sample data augmentation methods. Aster formulates a novel reachability-based strategy to encode the insights of every seed's direct and indirect data augmentation relation instances into the replacement seed of that seed systematically. Our experiment shows that Aster is highly effective. On average, loss-based and coverage-based fuzzing techniques can generate 166% and 110% more fuzzing samples and reduce 31% and 22% unsuccessful seeds, respectively, after adopting the replacement seeds generated by Aster to replace their original seeds. Their improved models also become up to 55% and 40% on average more robust against FGSM and PGD attacks in the experiment. Copyright © 2023 IEEE.
Original language | English |
---|---|
Title of host publication | Proceedings of 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security, QRS 2023 |
Place of Publication | Danvers, MA |
Publisher | IEEE |
Pages | 370-381 |
ISBN (Electronic) | 9798350319583 |
DOIs | |
Publication status | Published - 2023 |
Citation
Wang, H., Wei, Z., Zhou, Q., Jiang, B., & Chan, W. K. (2023). Aster: Encoding data augmentation relations into seed test suites for robustness assessment and fuzzing of data-augmented deep learning models. In Proceedings of 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security, QRS 2023 (pp. 370-381). IEEE. https://doi.org/10.1109/QRS60937.2023.00044Keywords
- Seed generation
- Fuzzing
- Testing
- Neural network
- Robustness
- Data augmentation