posted on 2012-07-03, 14:33authored byGordon Fraser, Neil Walkinshaw
Identifying a finite test set that adequately captures the essential behaviour of a program such that all faults are identified is a well-established problem. Traditional adequacy metrics can be impractical, and may be misleading even if they are satisfied. One intuitive notion of adequacy, which has been discussed in theoretical terms over the past three decades, is the idea of behavioural coverage; if it is possible to infer an accurate model of a system from its test executions, then the test set must be adequate. Despite its intuitive basis, it has remained almost entirely in the theoretical domain because inferred models have been expected to be exact (generally an infeasible task), and have not allowed for any pragmatic interim measures of adequacy to guide test set generation. In this work we present a new test generation technique that is founded on behavioural adequacy, which combines a model evaluation framework from the domain of statistical learning theory with search-based white-box test generation strategies. Experiments with our BESTEST prototype indicate that such test sets not only come with a statistically valid measurement of adequacy, but also detect significantly more defects.
History
Citation
Software Testing, Verification and Validation (ICST), 2012 IEEE Fifth International Conference on, Proceedings of, pp. 300-309
Author affiliation
/Organisation/COLLEGE OF SCIENCE AND ENGINEERING/Department of Computer Science
Source
Software Testing, Verification and Validation (ICST), 2012 IEEE Fifth International Conference on, 17-21 April 2012, Montreal
Version
AM (Accepted Manuscript)
Published in
Software Testing
Publisher
Institute of Electrical and Electronics Engineers (IEEE)