1 metrics validation criteria: how do we know when a metric is worthwhile? ben smith andy meneely...

1

Metrics validation criteria: How do we know when a metric is worthwhile?

Ben Smith

Andy Meneely

Laurie Williams

Scenario

You and your team are asked to choose a set of metrics for your development company’s front-running application, iAwesome. The goal of this metrics project is to reduce post-release vulnerabilities by predicting them during the software lifecycle. How do you demonstrate to management that your metrics are meaningful and worthwhile?

2

Metric Uses

3

MetricsMetrics

Quality Assessment

Process CertificationProcess Improvement

Task Planning

Research

Prediction

Motivation

4

Software System

Component Component m=.25 m=.25Component Component m=.25 m=.25


Component Component m=. m=.0505Component Component m=. m=.0505


Component Component m=.15m=.15Component Component m=.15m=.15


Prediction

M < .2

Well, the metric was predictive…

…but may not be valid!

How do we know when a metric is valid?

5

Metrics Validation Criteria

Metrics validation criteria: boolean statements about various aspects of the validity of a metric.

Example:

Underlying theory validity: Is there an underlying theory as to why the metric was chosen?

6

Agenda

• Motivation: what is validity?

• Anatomy of a systematic literature review

• Validating a security metric for prediction

• Is prediction the only answer?

7

Objective

• Guide researchers in making

• Sound contributions to the metrics field

• Providing a practical summary

• The “superset” of all proposed metrics validation criteria

8

Foundation in the Literature

9

Systematic literature review

10

Phase Size of Source List

Literature Index 2,228

Title 536

Cross-confirmed Title 156

Abstract 44

Full-text 17

Follow-up 20

Results of the Review

• Three major categories for metrics validation criteria:– Internal: the metric correctly measures the attribute it

purports to measure.

– External: the metric is related in some way with an external quality factor.

– Construct: the gathering of a metric’s measurements is suitable for the definition of the targeted attribute.

11

Two Competing Philosophies

• Goal-driven: philosophy holds that the primary purpose of a metric is to apply it to a software process.

• Theory-driven: views that the primary purpose of a metric is to gain understanding of the nature of software.

12

Agenda





13

Scenario

You and your team are asked to choose a set of metrics for your development company’s front-running application, iAwesome. The goal of this metrics project is to reduce post-release vulnerabilities by predicting them during the software lifecycle. How do you demonstrate to management that your metrics are meaningful and worthwhile?

14

Choosing the best criteria

To succeed with this metrics project, you should chose validation criteria that:– Help with the accuracy of prediction– Prioritize business over knowledge for the sake

of knowledge– Are absolutely necessary

15

Metrics Validation Criteria

16

A priori validityActionabilityAppropriate ContinuityAppropriate GranularityAssociationAttribute validityCausal model validityCausal relationship validityContent validityConstruct validityConstructivenessDefinition validity Discriminative power Dimensional consistency Economic productivity Empirical validityExternal validity Factor independence Improvement validity Instrument validity Increasing growth validity Interaction sensitivity Internal consistency Internal validity

Monotonicity Metric Reliability Non-collinearity Non-exploitability Non-uniformity Notation validity Permutation validity Predictability Prediction system validity Process or Product Relevance Protocol validity Rank Consistency Renaming insensitivity Repeatability Representation condition Scale validity Stability Theoretical validity Trackability Transformation invariance Underlying theory validity Unit validity Usability

Reduced Metrics Validation Criteria

17

Rejected (and why)

• A metric has improvement validity if the metric is an improvement over existing metrics.

• A metric has increasing growth validity if the metric increases when concatenating two entities together.

18

Accepted (and why)

• A metric has usability if it can be cost-effectively implement in a quality assurance program.

• A metric has instrument validity if the underlying measurement is valid and properly calibrated.

19

Agenda





20

Measurement Theory

• Metrics can be used as the route to understanding the nature of software and the software development process

• Rather than a list of components, we’d like a list of action items based on a set of theories: applied science

• Reactive vs. Proactive

21

22

Questions?

1 metrics validation criteria: how do we know when a metric is worthwhile? ben smith andy meneely...

Documents

metrics project

set of metrics

metrics fieldproviding

metrics measurements

security metric

underlying theory validity

software lifecycle

software systemcomponent