relating roc and cmc curves...2020/09/15  · with multiplecmc curves • thedistribution of...

28
Relating ROC and CMC Curves Work done with Brian DeCann Arun Ross Associate Professor Michigan State University http://www.cse.msu.edu/~rossarun

Upload: others

Post on 28-Feb-2021

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Relating ROC and CMC Curves

Work done with Brian DeCann

Arun Ross Associate Professor

Michigan State University

http://www.cse.msu.edu/~rossarun

Page 2: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Introduction

• Performance of a verification system is summarizedusing Receiver Operating Characteristic (ROC) curve

• Performance of a closed-set identification system issummarizedusingCumulativeMatch Characteristic(CMC) curve

• Can the CMC curve be derived from the ROC curve and vice-versa?

© Ross 2016

Page 3: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

ROC Curve

• Biometrics samples are compared against each other • Genuine and impostor scores are generated

• False Match Rate (FMR) and False Non-match Rate (FNMR) are computedat multiple thresholds

• ROC Curve: True Match Rate versus False Match Rate • ROC Curve: AggregateStatistics

© Ross 2016

Page 4: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

ROC Curve

Match Score Distributions ROC Curve

© Ross 2016

Page 5: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

CMC Curve • Each probe biometric sample is compared against all gallery samples

• The resulting scores are sorted and ranked

• Determine the rank at which a true match occurs • True Positive Identification Rate (TPIR): Probability of observing the correct identity within the top K ranks

• CMC Curve: Plots TPIR against ranks • CMC Curve: Rank-basedmetric

© Ross 2016

Page 6: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

CMC Curve

© Ross 2016

Page 7: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

CMC versus ROC • It is reasonable to expect a good ROC curve to be associated with a good CMC curve and vice-versa

© Ross 2016

Page 8: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Predicting CMC from ROC • The CMC can be predicted from the ROC data

• Bolle et. al. (2005), Hube (2006)

• R. Bolle, J. Connell, S. Pankanti, N. Ratha, and A. Senior. TheRelation Between theROCCurveandtheCMC. AutoID 2005 • J. Hube. UsingBiometric Verification to EstimateIdentificationPerformance. BSYM 2005

© Ross 2016

Page 9: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Predicting CMC from ROC • But neithermodel perfectly predicts the empirical CMC curve

© Ross 2016

Page 10: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

GOOD ROC POOR CMC CURVE CURVE

ROC versus CMC • DeCann and Ross (2012) showed that it is possible for a good ROC curve to be associatedwith a poor CMC curve and vice-versa

B. Decann and A.Ross, "CanaP oorVerification System be aG ood Identification System?APreliminary Study,” WIFS 2012 © Ross 2016

Page 11: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

User 1

User 2

User 3

P(Sc

ore)

P(

Scor

e)

P(Sc

ore)

Match Score Histogram 0.8

0.6

0.4

0.2

0

Genuine Scores Impostor Scores

0 0.5 1 Score

Match Score Histogram 0.8

0.6

0.4

0.2

0

Genuine Scores Impostor Scores

0 0.5 1 Score

Match Score Histogram 0.8

0.6

0.4

0.2

0

Genuine Scores Impostor Scores

0 0.5 1 Score

Why did CMC prediction models fail?

• Each identity contributes uniquely tothe system performance, i.e., aggregate statistics do not necessarily predict ranked statistics

© Ross 2016

Page 12: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

1

CMC Curve

0.85

0.9

0.95

1

82% Rank-1

0.80 2 4 6 8 10

Iden

tific

atio

n Ac

cura

cy

ROC Curve

Iden

tific

atio

n Ac

cura

cy

0.8

0.6

0.4

0.2

Rank

CMC Curve

0.6

0.7

0.8

0.9

1

55% Rank-1

0.50 2 6 104 8

00 0.2 0.4 0.6 0.8 1

False Match Rate

Gen

uine

Acc

ept R

ate

Rank

12

12

One ROC Curve: Multiple CMC Curves

© Ross 2016

Page 13: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Virtual Identities

• Input: Set of genuine and impostor match scores • Output: Virtual identities with different rank-based statistics

• Method: “Reassign” match scores to virtual identitiesaccording to the “Doddington’s Zoo” concept

• Sheep: Low FMR and FNMR • Goats: High FNMR • Lambs: High FMR

© Ross 2016

Page 14: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

1

0.8

0.2

Number of

Score Reassignment

Model

Genuine Scores Imposter Scores

Virtual Identities Virtual Identities Match Score Histogram

0.8 Genuine Scores Impostor Scores Match Scores 0.6

P(Sc

ore)

P(

Scor

e)

0.4

0.2

0.6 00 0.5

Score 0.4 .. ..

Match Score Histogram 0 0.8 0 0.5 1

Genuine Scores Impostor Scores Score 0.6

0.4

0.2

Pr(S

core

)

0Identity labels 0 0.5 Score (i.e., “Sheep”, “Goat”,

“Lamb”

1

1

Reassigning Match Scores

• Set of genuine and impostor match scores

© Ross 2016

Page 15: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Sampling Match Scores

• Depending upon “Sheep”, “Goat”, “Lamb” labels

© Ross 2016

Page 16: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Sampling Rationale

• Genuine Scores: Use the label (“Sheep”, “Goat”,“Lamb”) to assign genuinematch scores to a virtualidentity

• ImpostorScores: Use the labels of “pairs” of virtual identities to assign impostor match scores to a virtualidentity

© Ross 2016

Page 17: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

REAL IDENTITIES

R1

R2

R3

R4

GENUINE SCORES

IMPOSTOR SCORES

VIRTUAL IDENTITIES

V1

V2

V3

V4

From Real to Virtual

• Aggregate Statistics do not change

© Ross 2016

Page 18: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Reassigning Genuine Scores

© Ross 2016

Page 19: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Reassigning Genuine Scores

© Ross 2016

Page 20: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Reassigning Impostor Scores

© Ross 2016

Page 21: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Reassigning Impostor Scores

© Ross 2016

Page 22: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Reassigning Impostor Scores

© Ross 2016

Page 23: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Datasets Used • Face: WVU Multimodal Dataset

• 240 subjects, 5 Samples / subject • Match scores computed using VeriLook

• Gait: CASIA B dataset • 124 subjects, 6 samples / subject • Match scores computed using Gait Curves algorithm

© Ross 2016

Page 24: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Evaluation Criteria

• ROC data: Area underneath the ROC (AUC) • CMC data:WeightedRank-Mstrategy

© Ross 2016

Page 25: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Generate Virtual Identities

•Generate virtual identities with different inputparameters: (% Sheep, % Goats, % Lambs)

•Compute AUC and Rank-Mvalues Sheep

(%) Goat (%)

Lambs (%)

AUC (Face)

Rank-M (Face)

AUC (Gait)

Rank-M (Gait)

100 0 0 0.999 1.0 0.980 1.0 82 10 8 0.999 1.0 0.980 0.966 50 26 24 0.999 0.997 0.980 0.915 15 10 75 0.999 0.997 0.980 0.800

Same Aggregate Statistics Different Rank Statistics

Note:In creasing the proportion of Goats or Lambs decreases Rank-M performance

© Ross 2016

Page 26: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

A Closer Look • ROC and CMC curves for “Original” and “Reassigned” • (15% Sheep, 10% Goats, 75% Lambs)

99% to 75%

100% to 99.7%

© Ross 2016

Page 27: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Summary

• It is possible for a single ROC curve to be associated with multiple CMC curves

• The distribution of “Sheep”, “Goat”, “Lamb” in the target population results in this phenomenon

• Any ROC-CMC prediction model, should account forthis variability in user performance

• Soft biometric traits are more likely to exhibit thistype of disparity

• Reporting both ROC and CMC curves is recommended

• Note: Closed-set identification

Project sponsored by ONR © Ross 2016

Page 28: Relating ROC and CMC Curves...2020/09/15  · with multipleCMC curves • Thedistribution of “Sheep”, “Goat”, “Lamb” in the target populationresults in this phenomenon

Reading Material

• B. DeCann and A. Ross, "Relating ROC and CMC Curves via the Biometric Menagerie," Proc. of 6thIEEE International Conference on Biometrics: Theory,Applications and Systems (BTAS), (Washington DC,USA), September2013

• B. Decann and A. Ross, "Can a Poor Verification System be a Good Identification System? A Preliminary Study," Proc. of IEEE InternationalWorkshop on Information Forensics and Security(WIFS), (Tenerife, Spain), December 2012

© Ross 2016