can the calculation of a spectral global h distance ensure ... - lei … · milk analysis milk...

21
Can the calculation of a spectral Global H distance ensure the quality of international based MIR predictions? Lei ZHANG Ma Y.; DEHARENG F.; Grelet C.; COLINET F.; GENGLER N.; SOYEURT H. ICAR, Prague, 17/ 06/ 2019-21/06/2019

Upload: others

Post on 03-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Can the calculation of a spectral Global H distance

ensure the quality of international based MIR predictions?

Lei ZHANG

Ma Y.; DEHARENG F.; Grelet C.; COLINET F.; GENGLER N.; SOYEURT H.

ICAR, Prague, 17/ 06/ 2019-21/06/2019

Page 2: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Milk Sampling Scheme

© www.lait-solutions.com, 2019

Cow 1

Cow 2

Cow 3

3.5% fat

3.8% fat

4.1% fat

Milk recording

DHI

3.3% fat

Milk payment

Dairy industry

Page 3: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Milk Analysis

Milk recording(About 1 month for each

cow)

%fat

Milk MIR spectrum

EQUATION

Page 4: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

▶ Approximately 2,500-25,000nm (4,000-400 cm-1)

What is Mid-infrared spectrum?

© FOSS,2019

Page 5: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

© Bentley instrument, 2019

Principle of MIR spectrometry

© GRELEt C. et al. 2015

© Clement et al., 2015 JDS.

Page 6: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

How can we make a prediction ?

Page 7: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Can we make a prediction from those spectrum?

Can we a prediction for all spectra ?

© Modified from scikit-learn. 2019

New spectra

Calibration set

NO!Too far !

FarMilk MIR Spectrum

Page 8: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Can I make a prediction from those spectra?

Can we make a prediction for all spectra ?

Yeah

© Modified from scikit-learn. 2019

New spectra

Calibration setGH

GH

>x

≤x

Page 9: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Where: 𝑥 is PC scores of one spectrum; 𝜇 is the mean of PC scores of spectra in

the calibration set;S is covariance matrix between PC scores

of the calibration spectra

Mahalanobis Distance:

P.C. Mahalaobis(1893 – 1972)

© Wikipedia

© Modified from scikit-learn. 2019

New spectra

Calibration set

GH≤xGH>x

Page 10: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Mahalanobis Distance:

GH: Global H which is theStandardized Mahalanobis Distance

GH=𝐷𝑀

𝑛𝑃𝐶𝑠

Where: DM is the distance calculated

from the fomular;nPCs is the number of the

principal components from PCA

© Modified from scikit-learn. 2019

New spectra

Calibration set

GH≤xGH>x

Page 11: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

MilkRecording

Component(Reference)

STD

MIRSTD MIRCal

Equation

RMSE

R

MIR

What is the accuracy of prediction of international spectrum?

Samples : 198,394 milk records from Chinese Holstein cows

Instruments: Bentley FTS.

Duration: 8 months.

Prediction

Page 12: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

MilkRecording

Component(Reference)

STD

MIRSTD GH MIRCal

Equation

Prediction

Prediction

GH limitation

YesGH ≤ 3

RMSE

R

MIR

What is the accuracy of prediction of international spectrum?

Moderate loss of records when GH limitation(≤3) was applied

No

Fat Protein MFA PFA SFA UFA

N records

Without GH

198,394 198,394 198,394 198,394 198,394 198,394

With GH 172,547 174,062 159,651 159,509 174,825 159,467

Percent % 13.03 12.26 19.53 19.60 11.88 19.62

GHMean SD Minimum Maximum

1.93 2.43 0.00 475.00

GH≤3

Page 13: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Descriptive statistics

Traitsg/dL

Reference value Predicted value Predicted value(GH <= 3)

Mean SD Mean SD Mean SD

Fat 3.97 0.95 3.99 0.95 3.90 0.86

Protein 3.43 0.40 3.53 0.46 3.52 0.39

MFA 0.86 0.27 1.15 0.36 1.10 0.31

PFA 0.07 0.04 0.15 0.05 0.15 0.04

SFA 2.62 0.67 2.64 0.68 2.59 0.62

UFA 0.93 0.31 1.29 0.39 1.25 0.34

Table 1. Descriptive statistics of raw and predicted value

Page 14: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

The correlation coefficient

0,4

0,5

0,6

0,7

0,8

0,9

1

Fat Protein MFA PFA SFA UFA

Co

rre

lati

on

co

eff

icie

nt

Traits

Without GH limitation

With GH limitation

Page 15: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Squared residual and GH

0

0,1

0,2

0,3

0,4

0,5

FATPROTEIN

MFAPFA

SFAUFA

Co

eff

icie

nt

Traits

Correlation coefficient

© twitter

Page 16: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

GH limitation decreased RMSE for most traits

0

0,05

0,1

0,15

0,2

0,25

0,3

0,35

0,4

0,45

0,5

FAT PROT MFA PFA SFA UFA

Ro

ot

me

an s

qu

ared

err

or

(RM

SE,

g/d

L)

Traits

without GH limitation

with GH limitation

Δ

Page 17: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Conclusion:Predicted

Accuracy

Extrapolation Control

GH limitation

▶ GH limitation helps to ensure the quality of the MIR predictions

▶ It allows avoiding spectral extrapolation

▶ More work needed to be done to get

more accurate predictions…

Page 18: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Thanks for your attention!

Email: [email protected]

Page 19: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Ma Y.; DEHARENG F.; Grelet C.; COLINET F.; GENGLER N.; SOYEURT H.,&

Lei ZHANG

ICAR, Prague, 17/ 06/ 2019-21/06/2019

Email: [email protected]

Page 20: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Why do PCA?▶ To decrease the dimensionality of

the raw data

▶ To make it easy for calculating the inverse of the covariance matrix

Lever et al., 2017 Nature Method 2017

Additional information:

Page 21: Can the calculation of a spectral Global H distance ensure ... - Lei … · Milk Analysis Milk recording (About 1 month for each cow) %fat Milk MIR spectrum EQUATION Approximately

Why GH ≤ 3? ?Additional information: