This report is concerned with the application of software quality and evaluation measures to AI software and, more broadly, with the question of quality assurance for AI software. Considered are not only the metrics that attempt to measure some aspect of software quality, but also the methodologies and techniques (such as systematic testing) that attempt to improve some dimension of quality, without necessarily quantifying the extent of the improvement. The report is divided into three parts Part 1 reviews existing software quality measures, i. e., those that have been developed for, and applied to, conventional software. Part 2 considers the characteristics of AI software, the applicability and potential utility of measures and techniques identified in the first part, and reviews those few methods developed specifically for AI software. Part 3 presents an assessment and recommendations for the further exploration of this important area.
Cited By
- Tsai W, Vishnuvajjala R and Zhang D (1999). Verification and Validation of Knowledge-Based Systems, IEEE Transactions on Knowledge and Data Engineering, 11:1, (202-212), Online publication date: 1-Jan-1999.
- Lee J and Kuo J (1998). New Approach to Requirements Trade-Off Analysis for Complex Systems, IEEE Transactions on Knowledge and Data Engineering, 10:4, (551-562), Online publication date: 1-Jul-1998.
- Rosenwald G and Liu C (1997). Rule-Based System Validation through Automatic Identification of Equivalence Classes, IEEE Transactions on Knowledge and Data Engineering, 9:1, (24-31), Online publication date: 1-Jan-1997.
- Zlatareva N An integrated approach to quality assurance of expert system knowledge bases Proceedings of the second international conference on Information and knowledge management, (85-94)
Recommendations
Metrics in software quality assurance
ACM '81: Proceedings of the ACM '81 conferenceThe nature of “software quality” and some software metrics are defined and their relationship to traditional software indicators such as “maintainability” and “reliability” are suggested. Recent work in the field is summarized and an outlook for ...