university of milano-bicocca anna.fiori@unimib

18
Anna Maria Fiori Department of Quantitative Methods for Economics and Business Sciences University of Milano-Bicocca [email protected] Two kurtosis measures in a simulation study CNAM, Paris COMPSTAT 2010 August 23, 2010

Upload: others

Post on 24-Oct-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: University of Milano-Bicocca anna.fiori@unimib

Anna Maria Fiori

Department of Quantitative Methods for

Economics and Business Sciences

University of Milano-Bicocca

[email protected]

Two kurtosis measures in a simulation study

CNAM, Paris COMPSTAT 2010 August 23, 2010

Page 2: University of Milano-Bicocca anna.fiori@unimib

2

Overview

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

� The IF (SIF) and its role in kurtosis studies

� From inequality (Lorenz) ordering to right/left kurtosis measures

� Two kurtosis measures: SIF comparison / numerical experiments

Page 3: University of Milano-Bicocca anna.fiori@unimib

3

Background

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

From Pearson (1905) onwards statistics textbooks have defined kurtosis

operationally as the characteristic of a distribution measured by its

standardized fourth moment. However…

4

2

−=σ

µβ XE

Non-Gaussian type of

symmetry (First Ed. ESS)

Excess/defect of frequency near

the mean compared to a normal

curve (Pearson, 1894-1905)

Peakedness + tailedness

(Dyson, 1943 – Finucan, 1964)

Tailedness only (Ali, 1974)Reverse tendency

toward bimodality

(Darlington, 1970)

Dispersion around the two

values µ +/µ +/µ +/µ +/−−−− σσσσ (Moors, 1986)

Page 4: University of Milano-Bicocca anna.fiori@unimib

4

Ba

ckg

rou

nd

& m

eth

od

Ma

in r

esu

lt

Hampel (1968) – Ruppert (1987)

� Consider a probability distribution F

and the functional β2 = β2 (F)

� “How does β2 change if we throw in

an additional observation at some

point x? ”

� Contaminated distribution:

Fε = (1− ε) F + ε δx (0 < ε < 1)

( ) ( )

−−−−=

∂∂

rr zzP

122

2

222 41

1 γββββrf

where:33

1;σµγ

σµ =−= r

r

az

( ) zz

FF

1222

22

22

0

41)(

)()(lim

2

γβββ

εββ ε

εβ

−−−−=

−=↓

)(, xxxxIFIFIFIF FFFF

with:σ

µ−= xz

Kurtosis and the IFAn early intuition of L. Faleschini

Faleschini (Statistica, 1948)

� Take a frequency distribution:{ar, fr ; Σ fr = P }

� “To investigate the behaviour of

β2 when a frequency fr is altered,

we compute the partial

derivative of β2 wrt fr”

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

Page 5: University of Milano-Bicocca anna.fiori@unimib

5

Kurtosis and the IFFaleschini’s derivative – computational details

22

42 µ

µβ =

( )( )

( )µµ

µµ

−=

⋅−=∂∂

− ∑∑∑

ri

rrrr

r

i

r

i

aiP

fafaf

if

1

2

1

1

1

( )( )

( )isis

r

ris

rris

r

rr

is

maP

fafaff

m

−−

−−−

−=

⋅−=∂

∂∑∑

∑1

12

s = 2, 4

( ) ( )r

iis

s

i

i

r

s

f

m

i

s

f ∂⋅∂

−=

∂∂ −

=∑

µµ0

1 ( ) ( ){ }11−⋅⋅−−−−= srs

sr saa

Pµµµµ

( ) iis

s

i

is m

i

sµµ ⋅⋅

−= −

=∑0

1

Raw moment of order s− i

∑=

−− ⋅=

n

rr

isris fa

Pm

1

1

i-th power of µ

∑=

⋅=n

rrr fa

P 1

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

Page 6: University of Milano-Bicocca anna.fiori@unimib

6

Kurtosis and the IFKurtosis explained

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

( ) ( )1),;( 22

2

22

2 −−−= ββββ zFxIFIn the symmetric case:

( )12224,3,2,1 −±±= βββσµx

Local maximum: β2 at x = µ

2βσµ ±=xMinima: β2 (1 − β2) at

Unbounded

KURTOSIS =

peakedness + tailedness

but β 2 is dominated by

tailweight

This IF suggests that β2 is likely to be

overestimated by sample kurtosis

for x distant from µ, but

underestimated at

intermediate values of x

µ

CTT

F F

IF = SIF

x1 x2 x3 x4

Quartic function with four real roots:

Page 7: University of Milano-Bicocca anna.fiori@unimib

7

Roots of the quartic:

Kurtosis and the IFThe normal case and sample kurtosis

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

<- IF = SIF

σµσµσµσµ

334,2742,0

742,0334,2)4()2(

)3()1(

+=−=

+=−=

rr

rr

aa

aa

flankflank

center

tail tail

( )[ ]

+

−−

−=−−=∂∂

361

631

24222

σµ

σµβ rr

rr

aa

Pz

Pf

Page 8: University of Milano-Bicocca anna.fiori@unimib

8

flankflank

center

tail tail

-6-6

Kurtosis and the IFThe normal case and sample kurtosis

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

<- IF = SIF( )[ ]

+

−−

−=−−=∂∂

361

631

24222

σµ

σµβ rr

rr

aa

Pz

Pf

There are two intervals (flanks) of

substantial probability (22% each) in

which the IF has relatively large

negative values (minimum = − 6)

These possibly correspond to smaller

values of sample kurtosis

-> Consistent with underestimation

of β2 by sample kurtosis (on average)

and undercoverage of confidence

intervals for β2

0,22 0,22

0,54

Page 9: University of Milano-Bicocca anna.fiori@unimib

9

Kurtosis by inequalityZenga (ESS, 2006); Fiori (Comm. Statist., 2008)

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

Kurtosis-increasing

transformations

Non-egalitarian

transfers on D and S

for fixed γ, δD, δS

Separate analysis

of D and S:

Pigou-Dalton transfer principle

Median

S = γ – X | X < γwith E(S) = δ S < ∞

Left deviation

D = X – γ | X ≥ γwith E(D) = δ D < ∞

Right deviation

Page 10: University of Milano-Bicocca anna.fiori@unimib

10

Kurtosis by inequalityZenga’s kurtosis ordering (ESS, 2006)

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

Left kurtosis

Lorenz curve of S :

∫−=

p

SS

S dttFpL0

1 )(1

)(δ

Right kurtosis

∫−=

p

DD

D dttFpL0

1 )(1

)(δ

Lorenz curve of D :Zenga’s kurtosis curve

Represents kurtosis by a mirror plot

of the Lorenz curves for D and S

Minimum kurtosis

LD(p)LS(p)

� Unified treatment of symmetric and asymmetric distributions

� Kurtosis ordering defined via nested Lorenz curves

� Liu, Parelius and Singh (Ann. Statist., 1999) in a multivariate setting

Page 11: University of Milano-Bicocca anna.fiori@unimib

11

Kurtosis by inequalityZenga’s kurtosis measures (ESS, 2006)

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

[ ])()( SRDRK +=2

12

with:

[ ][ ]dppLpSR

dppLpDR

S

D

∫−=

−=1

0

1

0

2

2

)()(

)()(

(right & left Gini indexes)

)(1)(

2

2

DEDC Dδ−=

)(1)(

2

2

SESC Sδ−=

[ ])()(2

11 SCDCK +=

with:

(ratios of right/left scale functionals)

Normalized measures, with values between 0

(minimum kurtosis) and 1 (maximum kurtosis)

right left right left

Page 12: University of Milano-Bicocca anna.fiori@unimib

12

Kurtosis measuresA look at the SIF

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

� Symmetric distribution (γ = 0):

� Symmetric contaminant (Ruppert, 1987):

First measure of (right) kurtosis

Page 13: University of Milano-Bicocca anna.fiori@unimib

13

Kurtosis measuresA look at the SIF

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

Second measure of (right) kurtosis

Page 14: University of Milano-Bicocca anna.fiori@unimib

14

Kurtosis measuresSIF comparison at standard normal

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

All the measures are increased by contamination in the tails and at the

center and are decreased by contamination in the shoulders/flanks.

Having unbounded SIF,

they are sensitive to the

location of tail outliers as

well as their frequency.

However, conventional

kurtosis is much more

sensitive (quartic SIF).

The magnitude of min SIF

is considerably larger for

conventional kurtosis.

Page 15: University of Milano-Bicocca anna.fiori@unimib

15

Kurtosis measuresMonte Carlo experiment(N = 20,000; n = 20 to 640)

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

Normal Laplace Tukey λ = -0.089higher peak heavier tails

Relative bias

Relative RMSE

Page 16: University of Milano-Bicocca anna.fiori@unimib

16

Kurtosis measuresSmall/medium sample behaviour

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

Bootstrap confidence intervals at 95% level (percentile method)

B = 2,000 bootstrap resamples; N = 10,000 replications

Empirical

coverage

Average

length

� K2 is the measure which is likely to be estimated with the highest accuracy

� The sample performance of K2 improves as the parent distribution

becomes more peaked (Laplace)

Page 17: University of Milano-Bicocca anna.fiori@unimib

17

Directions for research

� Rigorous asymptotic inference for the new measure K2, possibly in

view of practical (financial?) applications of right/left kurtosis

� Check compatibility between the inequality-based concept of

kurtosis and some robust (e.g. quantile-based) measures recently

proposed in literature (Groeneveld, 1998; Schmid & Trede, 2003;

Brys, Hubert & Struyf, 2006; Kotz & Seier, 2007; …)

Anna M. Fiori Two kurtosis measures… COMPSTAT 2010

Page 18: University of Milano-Bicocca anna.fiori@unimib

Anna Maria Fiori

Department of Quantitative Methods for

Economics and Business Sciences

University of Milano-Bicocca

[email protected]

Two kurtosis measures in a simulation study

CNAM, Paris COMPSTAT 2010 August 23, 2010