a skewed look at bivariate and multivariate order statistics · 2007. 6. 29. · a skewed look at...

A Skewed Look atBivariate and Multivariate

Order Statistics

Prof. N. Balakrishnan

Dept. of Mathematics & Statistics

McMaster University, Canada

bala@mcmaster.ca– p. 1/41

Presented with great pleasure as

– p. 2/41

Plenary Lecture

– p. 2/41

Plenary Lecture

at Tartu Conference, 2007

– p. 2/41

Roadmap

1. Order Statistics

2. Skew-Normal Distribution

3. OS from BVN Distribution

4. Generalized Skew-Normal Distribution

5. OS from TVN Distribution

6. OS Induced by Linear Functions

7. OS from BV and TV t Distributions

8. Bibliography

– p. 3/41

Order StatisticsLet X1, · · · , Xn be n independent identically distributed(IID) random variables from a popln. with cumulativedistribution function (cdf) F (x) and an absolutelycontinuous probability density function (pdf) f(x).

– p. 4/41

Order StatisticsLet X1, · · · , Xn be n independent identically distributed(IID) random variables from a popln. with cumulativedistribution function (cdf) F (x) and an absolutelycontinuous probability density function (pdf) f(x).

If we arrange these Xi’s in increasing order ofmagnitude, we obtain the so-called order statistics,denoted by

X1:n ≤ X2:n ≤ · · · ≤ Xn:n,

which are clearly dependent.

– p. 4/41

Order Statistics (cont.)

Using multinomial argument, we readily have forr = 1, · · · , n

Pr (x < Xr:n ≤ x + δx)

(r − 1)!(n − r)!F (x)r−1 F (x + δx) − F (x)

×1 − F (x + δx)n−r + O(

(δx)2)

– p. 5/41

Using multinomial argument, we readily have forr = 1, · · · , n

Pr (x < Xr:n ≤ x + δx)

(r − 1)!(n − r)!F (x)r−1 F (x + δx) − F (x)

×1 − F (x + δx)n−r + O(

(δx)2)

From this, we obtain the pdf of Xr:n as (for x ∈ R)

fr:n(x) =n!

(r − 1)!(n − r)!F (x)r−1 1 − F (x)n−r

– p. 5/41

Similarly, we obtain the joint pdf of (Xr:n, Xs:n) as (for1 ≤ r < s ≤ n and x < y)

fr,s:n(x, y) =n!

(r − 1)!(s − r − 1)!(n − s)!F (x)r−1 f(x)

×F (y) − F (x)s−r−1 1 − F (y)n−s f(y).

– p. 6/41

fr,s:n(x, y) =n!

(r − 1)!(s − r − 1)!(n − s)!F (x)r−1 f(x)

×F (y) − F (x)s−r−1 1 − F (y)n−s f(y).

From the pdf and joint pdf, we can derive, for example,means, variances and covariances of order statistics,and also study their dependence structure.

– p. 6/41

fr,s:n(x, y) =n!

(r − 1)!(s − r − 1)!(n − s)!F (x)r−1 f(x)

×F (y) − F (x)s−r−1 1 − F (y)n−s f(y).

From the pdf and joint pdf, we can derive, for example,means, variances and covariances of order statistics,and also study their dependence structure.

The area of order statistics has a long and rich history,and a very vast literature.

– p. 6/41

Some key references are the books by

– p. 7/41

H.A. David (1970, 1981)

B. Arnold & N. Balakrishnan (1989)

N. Balakrishnan & A.C. Cohen (1991)

B. Arnold, N. Balakrishnan & H.N. Nagaraja (1992)

N. Balakrishnan & C.R. Rao (1998 a,b)

H.A. David & H.N. Nagaraja (2003)

– p. 7/41

H.A. David (1970, 1981)

B. Arnold & N. Balakrishnan (1989)

N. Balakrishnan & A.C. Cohen (1991)

B. Arnold, N. Balakrishnan & H.N. Nagaraja (1992)

N. Balakrishnan & C.R. Rao (1998 a,b)

H.A. David & H.N. Nagaraja (2003)

However, most of the literature on order statistics havefocused on the independent case, and very little on thedependent case.

– p. 7/41

Skew-normal Distribution

The skew-normal distribution has pdf

ϕ(x) = 2 Φ(λx)φ(x), x ∈ R, λ ∈ R,

where φ(·) and Φ(·) are standard normal pdfand cdf, respectively.

– p. 8/41

ϕ(x) = 2 Φ(λx)φ(x), x ∈ R, λ ∈ R,

Note that

– p. 8/41

ϕ(x) = 2 Φ(λx)φ(x), x ∈ R, λ ∈ R,

Note that

λ ∈ R is a shape parameter;λ = 0 corresponds to std. normal case;λ → ∞ corresponds to half normal case;Location and scale parameters can beintroduced into the model as well. – p. 8/41

Skew-Normal Distribution(cont.)

Azzalini’s (1985, Scand. J. Statist.) articlegenerated a lot of work on this family.

– p. 9/41

This distribution was, however, present eitherexplicitly or implicitly in the early works of

– p. 9/41

Birnbaum (1950)Nelson (1964)Weinstein (1964)O’Hagan & Leonard (1976)

– p. 9/41

Birnbaum (1950)Nelson (1964)Weinstein (1964)O’Hagan & Leonard (1976)

Interpretation through hidden truncation /selective reporting is due to Arnold & Beaver(2002, Test) for univariate/multivariate case. – p. 9/41

New connection to OS

– p. 10/41

New connection to OSFor n = 0, 1, 2, · · · , consider the integral

Ln(λ) =

∫ ∞

−∞Φ(λx)n

φ(x) dx.

– p. 10/41

Ln(λ) =

∫ ∞

−∞Φ(λx)n

φ(x) dx.

Clearly, L0(λ) = 1 ∀λ ∈ R.

– p. 10/41

Ln(λ) =

∫ ∞

−∞Φ(λx)n

φ(x) dx.

Clearly, L0(λ) = 1 ∀λ ∈ R.

Furthermore,∫ ∞

−∞

Φ(λx) − 1

φ(x) dx = 0

since the integrand is an odd function of x,we obtain: – p. 10/41

L2n+1(λ) =2n+1∑

(−1)i+1 1

2n + 1

L2n+1−i(λ), λ ∈ R,

for n = 0, 1, 2, . . .

– p. 11/41

L2n+1(λ) =2n+1∑

(−1)i+1 1

2n + 1

L2n+1−i(λ), λ ∈ R,

for n = 0, 1, 2, . . .

For n = 0, we simply obtain

L1(λ) =

∫ ∞

−∞Φ(λx) φ(x) dx =

2L0(λ) =

2, λ ∈ R,

– p. 11/41

L2n+1(λ) =2n+1∑

(−1)i+1 1

2n + 1

L2n+1−i(λ), λ ∈ R,

for n = 0, 1, 2, . . .

For n = 0, we simply obtain

L1(λ) =

∫ ∞

−∞Φ(λx) φ(x) dx =

2L0(λ) =

2, λ ∈ R,

which leads to the skew-normal density

ϕ1(x;λ) = 2 Φ(λx)φ(x), x ∈ R, λ ∈ R.

– p. 11/41

L2(λ) =

∫ ∞

−∞Φ(λx)2

φ(x) dx, λ ∈ R,

– p. 12/41

L2(λ) =

∫ ∞

−∞Φ(λx)2

φ(x) dx, λ ∈ R,

we find

– p. 12/41

L2(λ) =

∫ ∞

−∞Φ(λx)2

φ(x) dx, λ ∈ R,

we finddL2(λ)

dλ= 2

∫ ∞

−∞x Φ(λx) φ(λx) φ(x) dx

∫ ∞

−∞Φ(λx) x exp

2x2(1 + λ2)

π(1 + λ2)√

1 + 2λ2.

– p. 12/41

L2(λ) =

∫ ∞

−∞Φ(λx)2

φ(x) dx, λ ∈ R,

we finddL2(λ)

dλ= 2

∫ ∞

−∞x Φ(λx) φ(λx) φ(x) dx

∫ ∞

−∞Φ(λx) x exp

2x2(1 + λ2)

π(1 + λ2)√

1 + 2λ2.

Solving this differential equation, we obtain– p. 12/41

L2(λ) =1

πtan−1

√1 + 2λ2, λ ∈ R,

– p. 13/41

L2(λ) =1

πtan−1

√1 + 2λ2, λ ∈ R,

which leads to another skew-normal density

– p. 13/41

L2(λ) =1

πtan−1

√1 + 2λ2, λ ∈ R,

ϕ2(x;λ) =π

tan−1√

1 + 2λ2Φ(λx)2

φ(x), x ∈ R, λ ∈ R.

– p. 13/41

L2(λ) =1

πtan−1

√1 + 2λ2, λ ∈ R,

ϕ2(x;λ) =π

tan−1√

1 + 2λ2Φ(λx)2

φ(x), x ∈ R, λ ∈ R.

Remark 1: Interestingly, this family alsoincludes standard normal (when λ = 0) andthe half normal (when λ → ∞) distributions,just as ϕ1(x;λ) does.

– p. 13/41

Next, setting n = 1 in the relation, we get

– p. 14/41

L3(λ) =3

2L2(λ) − 3

4L1(λ) +

8L0(λ)

2πtan−1

√1 + 2λ2 − 1

4, λ ∈ R,

– p. 14/41

L3(λ) =3

2L2(λ) − 3

4L1(λ) +

8L0(λ)

2πtan−1

√1 + 2λ2 − 1

4, λ ∈ R,

– p. 14/41

L3(λ) =3

2L2(λ) − 3

4L1(λ) +

8L0(λ)

2πtan−1

√1 + 2λ2 − 1

4, λ ∈ R,

ϕ3(x;λ) =1

tan−1√

1 + 2λ2 − 14

Φ(λx)3φ(x), x ∈ R.

– p. 14/41

L3(λ) =3

2L2(λ) − 3

4L1(λ) +

8L0(λ)

2πtan−1

√1 + 2λ2 − 1

4, λ ∈ R,

ϕ3(x;λ) =1

tan−1√

1 + 2λ2 − 14

Φ(λx)3φ(x), x ∈ R.

Remark 2: Interestingly, this family alsoincludes standard normal (when λ = 0) andthe half normal (when λ → ∞) distributions,just as ϕ1(x;λ) and ϕ2(x;λ) do. – p. 14/41

Remark 3: Evidently, in the special casewhen λ = 1, the densities ϕ1(x;λ), ϕ(x;λ2)and ϕ3(x;λ) become the densities of thelargest OS in samples of size 2, 3 and 4,respectively, from N(0, 1) distribution.

– p. 15/41

Remark 4: In addition, the integral Ln(λ) isalso involved in the means of OS from N(0, 1)distribution. For example, with µm:m denotingthe mean of the largest OS in a sample ofsize m from N(0, 1), we have

– p. 15/41

Remark 4: In addition, the integral Ln(λ) isalso involved in the means of OS from N(0, 1)distribution. For example, with µm:m denotingthe mean of the largest OS in a sample ofsize m from N(0, 1), we have

µ2:2 =L0√π

, µ3:3 =3L1(1)√

π, µ4:4 =

6L2(1)√π

, µ5:5 =10L3(1)√

– p. 15/41

Natural questions that arise:

– p. 16/41

First, we could consider a generalskew-normal family

ϕ(x;λ, p) = C(λ, p) Φ(λx)pφ(x), x ∈ R, λ ∈ R, p > 0.

– p. 16/41

How about the normalizing constant?

– p. 16/41

How much flexibility is in this family?

– p. 16/41

Can we develop inference for this model?

– p. 16/41

Next, can we have a similar skewed look atOS from BV and MV normal distributions?

– p. 16/41

Next, can we have a similar skewed look atOS from BV and MV normal distributions?

How about other distributions? – p. 16/41

OS from BVN Distribution

Let (X1, X2)d= BV N

µ1, µ2, σ21, σ

22, ρ)

– p. 17/41

Let (X1, X2)d= BV N

µ1, µ2, σ21, σ

22, ρ)

Let W1:2 ≤ W2:2 be the OS from (X1, X2).

– p. 17/41

Let (X1, X2)d= BV N

µ1, µ2, σ21, σ

22, ρ)

This bivariate case has been discussed bymany, including

– p. 17/41

Let (X1, X2)d= BV N

µ1, µ2, σ21, σ

22, ρ)

This bivariate case has been discussed bymany, including

S.S. Gupta and K.C.S. Pillai (1965)A.P. Basu and J.K. Ghosh (1978)H.N. Nagaraja (1982)N. Balakrishnan (1993)M. Cain (1994)M. Cain and E. Pan (1995) – p. 17/41

Generalized Skew-NormalDistribution

A variable Zλ1,λ2,ρ is said to have ageneralized skew-normal distribution,denoted by GSN(λ1, λ2, ρ), if

– p. 18/41

Zλ1,λ2,ρd= X | (Y1 < λ1X,Y2 < λ2X), λ1, λ2 ∈ R, |ρ| < 1,

where X ∼ N(0, 1) independently of(Y1, Y2) ∼ BV N(0, 0, 1, 1, ρ).

– p. 18/41

It should be mentioned that Zλ1,λ2,ρ belongs to

– p. 18/41

It should be mentioned that Zλ1,λ2,ρ belongs to

SUN1,2(0, 0, 1,Ω∗) [Arellano-Valle & Azzalini (2006)]

CSN1,2 [Farias, Molina & A.K. Gupta (2004)]

– p. 18/41

Generalized Skew-NormalDistribution (cont.)

It can shown that the pdf of Zλ1,λ2,ρ is

– p. 19/41

ϕ(z;λ1, λ2, ρ) = c(λ1, λ2, ρ) φ(z) Φ(λ1z, λ2z; ρ), z ∈ R,

with λ1, λ2 ∈ R, |ρ| < 1, and Φ(·, ·; ρ) denotingthe cdf of BV N(0, 0, 1, 1, ρ).

– p. 19/41

ϕ(z;λ1, λ2, ρ) = c(λ1, λ2, ρ) φ(z) Φ(λ1z, λ2z; ρ), z ∈ R,

with λ1, λ2 ∈ R, |ρ| < 1, and Φ(·, ·; ρ) denotingthe cdf of BV N(0, 0, 1, 1, ρ).

For determining c(λ1, λ2, ρ), we note that

c(λ1, λ2, ρ) ≡ 1

a(λ1, λ2, ρ)=

P (Y1 < λ1X,Y2 < λ2X),

– p. 19/41

Lemma 1: We have

– p. 20/41

Lemma 1: We have

a(λ1, λ2, ρ) = P (Y1 < λ1X,Y2 < λ2X)

2πcos−1

−(ρ + λ1λ2)√

1 + λ21

1 + λ22

[Kotz, Balakrishnan and Johnson (2000)].

– p. 20/41

Lemma 1: We have

a(λ1, λ2, ρ) = P (Y1 < λ1X,Y2 < λ2X)

2πcos−1

−(ρ + λ1λ2)√

1 + λ21

1 + λ22

[Kotz, Balakrishnan and Johnson (2000)].The generalized skew-normal pdf is then

– p. 20/41

Lemma 1: We have

a(λ1, λ2, ρ) = P (Y1 < λ1X,Y2 < λ2X)

2πcos−1

−(ρ + λ1λ2)√

1 + λ21

1 + λ22

[Kotz, Balakrishnan and Johnson (2000)].The generalized skew-normal pdf is then

ϕ(z;λ1, λ2, ρ) =2π

cos−1

−(ρ+λ1λ2)√1+λ2

√1+λ2

)φ(z)Φ(λ1z, λ2z; ρ)

for z, λ1, λ2 ∈ R, |ρ| < 1.– p. 20/41

Let Φ(·, ·; δ) denote cdf of BV N(0, 0, 1, 1, δ),and Φ(·; θ) denote cdf of SN(θ).

– p. 21/41

Lemma 2: We then have

– p. 21/41

Lemma 2: We then have

Φ(0, 0; δ) =1

2πcos−1(−δ),

Φ(γx, 0; δ) = Φ(0, γx; δ) =1

γx;−δ√1 − δ2

Φ(γ1x, γ2x; δ) =1

2Φ(γ1x; η1) + Φ(γ2x; η2) − I(γ1γ2) ,

where I(a) = 0 if a > 0 and 1 if a < 0,

η1 =1√

1 − δ2

− δ

, η2 =1√

1 − δ2

− δ

.– p. 21/41

Theorem 1: If M(t;λ1, λ2, ρ) is the MGF ofZλ1,λ2,ρ ∼ GSN(λ1, λ2, ρ), then

– p. 22/41

M(t;λ1, λ2, ρ) =2π

cos−1

−(ρ+λ1λ2)√1+λ2

√1+λ2

) et2/2

λ1t√

1 + λ21

1 + λ22

;ρ + λ1λ2

1 + λ21

1 + λ22

– p. 22/41

M(t;λ1, λ2, ρ) =2π

cos−1

−(ρ+λ1λ2)√1+λ2

√1+λ2

) et2/2

λ1t√

1 + λ21

1 + λ22

;ρ + λ1λ2

1 + λ21

1 + λ22

Corollary 1: Theorem 1 yields, for example,

E [Zλ1,λ2,ρ] =

cos−1

−(ρ+λ1λ2)√1+λ2

√1+λ2

λ1√

1 + λ21

1 + λ22

– p. 22/41

OS from TVN Distribution

Let (W1,W2,W3) ∼ TV N(0,Σ), where

– p. 23/41

σ21 ρ12σ1σ2 ρ13σ1σ3

ρ12σ1σ2 σ22 ρ23σ2σ3

ρ13σ1σ3 ρ23σ2σ3 σ23

is a positive definite matrix.

– p. 23/41

σ21 ρ12σ1σ2 ρ13σ1σ3

ρ12σ1σ2 σ22 ρ23σ2σ3

ρ13σ1σ3 ρ23σ2σ3 σ23

Let W1:3 = min(W1,W2,W3) < W2:3 < W3:3 =max(W1,W2,W3) denote the order statisticsfrom (W1,W2,W3).

– p. 23/41

σ21 ρ12σ1σ2 ρ13σ1σ3

ρ12σ1σ2 σ22 ρ23σ2σ3

ρ13σ1σ3 ρ23σ2σ3 σ23

Let W1:3 = min(W1,W2,W3) < W2:3 < W3:3 =max(W1,W2,W3) denote the order statisticsfrom (W1,W2,W3).

Let Fi(t; Σ) denote the cdf of Wi:3, i = 1, 2, 3.– p. 23/41

OS from TVN Distribution(cont.)

Theorem 2: The cdf of W3:3 is the mixture

– p. 24/41

F3(t; Σ) = a(θ1)Φ

σ1; θ1

+ a(θ2)Φ

σ2; θ2

+ a(θ3)Φ

σ3; θ3

– p. 24/41

F3(t; Σ) = a(θ1)Φ

σ1; θ1

+ a(θ2)Φ

σ2; θ2

+ a(θ3)Φ

σ3; θ3

where Φ(·;θ) denotes the cdf of GSN(θ),

a(λ1, λ2, ρ) =1

2πcos−1

−(ρ + λ1λ2)q

1 + λ21

1 + λ22

σ2− ρ12

1 − ρ212

σ3− ρ13

1 − ρ213

,ρ23 − ρ12ρ13

1 − ρ212

1 − ρ213

σ1− ρ12

1 − ρ212

σ3− ρ23

1 − ρ223

,ρ13 − ρ12ρ23

1 − ρ212

1 − ρ223

σ1− ρ13

1 − ρ213

σ2− ρ23

1 − ρ223

,ρ12 − ρ13ρ23

1 − ρ213

1 − ρ223

– p. 24/41

Theorem 3: The mgf of W3:3 is

– p. 25/41

M3(s; Σ) = es2/2

1 − ρ12

2s; α1

1 − ρ13

2s;α2

1 − ρ23

2s;α3

– p. 25/41

M3(s; Σ) = es2/2

1 − ρ12

2s; α1

1 − ρ13

2s;α2

1 − ρ23

2s;α3

where Φ(·; θ) is the cdf of SN(θ), and

α1 =1 + ρ12 − ρ13 − ρ23√

α2 =1 + ρ13 − ρ12 − ρ23√

α3 =1 + ρ23 − ρ12 − ρ13√

A = 6 −˘

(1 + ρ12)2 + (1 + ρ13)2 + (1 + ρ23)2¯

+2 (ρ12ρ13 + ρ12ρ23 + ρ13ρ23) .– p. 25/41

– p. 26/41

E (W3:3) =1

1 − ρ12 +√

1 − ρ13 +√

1 − ρ23

– p. 26/41

E (W3:3) =1

1 − ρ12 +√

1 − ρ13 +√

1 − ρ23

andV ar (W3:3) = 1 +

2π− E2 (W3:3) .

– p. 26/41

E (W3:3) =1

1 − ρ12 +√

1 − ρ13 +√

1 − ρ23

andV ar (W3:3) = 1 +

2π− E2 (W3:3) .

Remark 5: Similar mixture forms can bederived for the cdf of W1:3 and W2:3, and fromthem explicit expressions for their mgf,moments, etc.

– p. 26/41

Problems for further study:

– p. 27/41

Can we make use of these results concerningdistributions and moments of OS to developsome efficient inferential methods?

– p. 27/41

How far can these results be generalized toobtain such explicit expressions for the caseof OS from MVN?

– p. 27/41

OS Induced by LinearFunctions

Let X i = (X1i, . . . , Xpi)T , i = 1, . . . , n, be iid

observations from MV N(µ,Σ), whereµ = (µ1, . . . , µp)

T and Σ = ((σij)).

– p. 28/41

Let P = i1, . . . , im (m ≥ 1) be a partition of1, . . . , p, and Q its complementary partition.

– p. 28/41

Let C = (c1, . . . , cp)T be a vector of non-zero

constants.

– p. 28/41

Let C = (c1, . . . , cp)T be a vector of non-zero

constants.

Further, let us define for j = 1, . . . , n,

Xj =∑

ci Xij = CTP Xj, Yj =

ci Xij = CTQ Xj.

– p. 28/41

OS Induced by LinearFunctions (cont.)

Lemma 3: Evidently,

– p. 29/41

Lemma 3: Evidently,

µX = E(Xj) = CTP µ,

µY = E(Yj) = CTQ µ,

σ2X = V ar(Xj) = CT

P Σ CP ,

σ2Y = V ar(Yj) = CT

Q Σ CQ,

σX,Y = Cov(Xj, Yj) = CTP Σ CQ,

ρ =σX,Y

σXσY=

CTP Σ CQ

(CTP Σ CP )(CT

Q Σ CQ).

– p. 29/41

Now, let

Sj = Xj + Yj = CTP Xj + CT

QXj = CT Xj

for j = 1, . . . , n.

– p. 30/41

Now, let

QXj = CT Xj

for j = 1, . . . , n.

Let S1:n ≤ S2:n ≤ · · · ≤ Sn:n denote the orderstatistics of Sj ’s.

– p. 30/41

Now, let

QXj = CT Xj

for j = 1, . . . , n.

Let S1:n ≤ S2:n ≤ · · · ≤ Sn:n denote the orderstatistics of Sj ’s.

Let X [k:n] be the k-th induced multivariateorder statistic; i.e.,

X [k:n] = Xj whenever Sk:n = Sj .

– p. 30/41

Example 1: While analyzing extreme lakelevels in hydrology, the annual maximum levelat a location in the lake is a combination ofthe daily water level averaged over the entirelake and the up surge in local water levelsdue to wind effects at that site [ Song,Buchberger & Deddens (1992)].

– p. 31/41

Example 1: While analyzing extreme lakelevels in hydrology, the annual maximum levelat a location in the lake is a combination ofthe daily water level averaged over the entirelake and the up surge in local water levelsdue to wind effects at that site [ Song,Buchberger & Deddens (1992)].

Example 2: While evaluating the performanceof students in a course, the final grade mayoften be a weighted average of the scores inmid-term tests and the final examination.

– p. 31/41

Theorem 4: We have

X[k:n] = CTP X [k:n] and Y[k:n] = CT

Q X [k:n].

– p. 32/41

Theorem 4: We have

Q X [k:n].

Consequently, we readily obtain

E(X[k:n]) = CTP µ + αk:n

CTP ΣCP + CT

P ΣCQq

CTP ΣCP + CT

QΣCQ + 2CTP ΣCQ

where αk:n is the mean of the k-th OS from asample of size n from N(0, 1).

– p. 32/41

Theorem 4: We have

Q X [k:n].

Consequently, we readily obtain

E(X[k:n]) = CTP µ + αk:n

CTP ΣCP + CT

P ΣCQq

CTP ΣCP + CT

QΣCQ + 2CTP ΣCQ

where αk:n is the mean of the k-th OS from asample of size n from N(0, 1).

Similar expressions can be derived forV ar(X[k:n]) and other moments.

– p. 32/41

By choosing the partitions P and Q suitably, we havethe following results for within a concomitant OS.

– p. 33/41

Theorem 5: For k = 1, . . . , n, we obtain

– p. 33/41

Theorem 5: For k = 1, . . . , n, we obtain

E(Xi[k:n]) = µi + αk:n

Ppr=1 crσir

Ppr=1 crcsσrs

, i = 1, . . . , p,

V ar(Xi[k:n]) = σii − (1 − βk,k:n)

r=1 crσir

Ppr=1 crcsσrs

, i = 1, . . . , p,

Cov(Xi[k:n], Xj[k:n])) = σij − (1 − βk,k:n)

Ppr=1 crcsσirσjs

Ppr=1 crcsσrs

1 ≤ i < j ≤ p,

where βk,k:n is the variance of the k-th OSfrom a sample of size n from N(0, 1).

– p. 33/41

By choosing the partitions P and Q suitably, we havethe following results for between concomitant OS.

– p. 34/41

Theorem 6: For 1 ≤ k < ℓ < n, we obtain

– p. 34/41

Theorem 6: For 1 ≤ k < ℓ < n, we obtain

Cov(Xi[k:n],Xi[ℓ:n]) = βk,ℓ:n

r=1 crσir)2

∑ps=1

∑pr=1 crcsσrs

i = 1, . . . , p,

Cov(Xi[k:n],Xj[ℓ:n])) = βk,ℓ:n)

∑ps=1

∑pr=1 crcsσirσjs

∑ps=1

∑pr=1 crcsσrs

1 ≤ i < j ≤ p,

where βk,ℓ:n is covariance between k-th andℓ-th OS from a sample of size n from N(0, 1).

– p. 34/41

Corollary 3: V ar(X[k:n]) and V ar(Y[k:n]) canbe rewritten as

– p. 35/41

V ar(X[k:n]) = σ2X − (1 − βk,k:n)

aσ2X + bρσXσY

V ar(Y[k:n]) = σ2Y − (1 − βk,k:n)

bσ2Y + aρσXσY

where ∆ = a2σ2X + b2σ2

Y + 2abρσXσY .

– p. 35/41

V ar(X[k:n]) = σ2X − (1 − βk,k:n)

aσ2X + bρσXσY

V ar(Y[k:n]) = σ2Y − (1 − βk,k:n)

bσ2Y + aρσXσY

where ∆ = a2σ2X + b2σ2

Y + 2abρσXσY .Now, since βj,k:n > 0 [Bickel (1967)]and

∑nj=1 βj,k:n = 1 for 1 ≤ k ≤ n

[Arnold, Balakrishnan & Nagaraja (1992)],we have 0 < βk,k:n < 1.

– p. 35/41

Consequently, we observe that

V ar(X[k:n]) < σ2X and V ar(Y[k:n]) < σ2

for all k = 1, . . . , n.

– p. 36/41

Consequently, we observe that

V ar(X[k:n]) < σ2X and V ar(Y[k:n]) < σ2

for all k = 1, . . . , n.

Using a similar argument, it can be shownthat

V ar(Xi[k:n]) < σii

for i = 1, . . . , p and all k = 1, . . . , n.

– p. 36/41

OS from BV and TV tDistributions

Arellano-Valle and Azzalini (2006) presentedunified multivariate skew-elliptical distributionthrough conditional distributions.

– p. 37/41

A special case of this distribution is ageneralized skew-tν distribution.

– p. 37/41

A special case of this distribution is ageneralized skew-tν distribution.

Now, using skew-tν and generalized skew-tνdistributions, distributions and properties ofOS from BV and TV tν distributions can bestudied.

– p. 37/41

OS from BV and TV tDistributions (cont.)

– p. 38/41

How far can these results be generalized toobtain such explicit expressions for the caseof OS from MV tν?

– p. 38/41

How far can these results be generalized toobtain such explicit expressions for the caseof OS from MV tν?

Can we do this work more generally in termsof elliptically contoured distributions, forexample?

– p. 38/41

Bibliography

Most pertinent papers are:

– p. 39/41

Bibliography

Most pertinent papers are:

Arellano-Valle, R.B. and Azzalini, A. (2006). Scand. J. Statist.

Arnold, B.C. and Beaver, R.J. (2002). Test.

Azzalini, A. (1985). Scand. J. Statist.

Balakrishnan, N. (1993). Statist. Probab. Lett.

Basu, A.P. and Ghosh, J.K. (1978). J. Multivar. Anal.

Bickel, P.J. (1967). Proc. of 5th Berkeley Symposium.

Birnbaum, Z.W. (1950). Ann. Math. Statist.

Cain, M. (1994). Amer. Statist.

Cain, M. and Pan, E. (1995). Math. Scientist.

Gonzalez-Farias, G., Dominguez-Molina, A. and Gupta, A.K. (2004).J. Statist. Plann. Inf.

Gupta, S.S. and Pillai, K.C.S. (1965). Biometrika.

Nagaraja, H.N. (1982). Biometrika.– p. 39/41

Bibliography

Nelson, L.S. (1964). Technometrics.

O’Hagan, A. and Leonard, T. (1976). Biometrika.

Song, R., Buchberger, S.G. and Deddens, J.A. (1992). Statist. Probab. Lett.

Weinstein, M.A. (1964). Technometrics.

– p. 40/41

Bibliography (cont.)

Most pertinent books are:

– p. 41/41

Bibliography (cont.)

Most pertinent books are:

Arnold, B.C. and Balakrishnan, N. (1989). Relations, Bounds andApproximations for Order Statistics, Springer-Verlag, New York.

Arnold, B.C., Balakrishnan, N. and Nagaraja, H.N. (1992). A First Course inOrder Statistics, John Wiley & Sons, New York.

Balakrishnan, N. and Cohen, A.C. (1991). Order Statistics and Inference:Estimation Methods, Academic Press, Boston.

Balakrishnan, N. and Rao, C.R. (Eds.) (1998a,b). Handbook of Statistics:Order Statistics, Vols. 16 & 17, North-Holland, Amsterdam.

David, H.A. (1970, 1981). Order Statistics, 1st and 2nd editions, John Wiley& Sons, New York.

David, H. A. and Nagaraja, H.N. (2003). Order Statistics, 3rd edition, JohnWiley & Sons, Hoboken, New Jersey.

Kotz, S., Balakrishnan, N. and Johnson, N.L. (2000). ContinuousMultivariate Distributions – Vol. 1, Second edition, John Wiley & Sons,New York. – p. 41/41

a skewed look at bivariate and multivariate order statistics · 2007. 6. 29. · a skewed look at...

Documents

masters thesis - connecting repositories · 5.3...

on the bivariate and multivariate weighted generalized...

alt.binaries.humor skewed

chapter 3: descriptive study of bivariate data. univariate...

bivariate b-spline outline multivariate b-spline [neamtu 04]...

scatter diagram of bivariate measurement data. bivariate...

[chapter 5. multivariate probability...

modeling and simulation of bivariate gaussian random fields...

a gis-based comparative study of frequency ratio...

investigation of the effect of job … · organizational...

errata for warner, r. m. (2008). applied statistics: from...

multivariate statistical...

bivariate & multivariate

bivariate and multivariate spectrophotometric methods for...

topologically guaranteed bivariate solutions of under...

multivariate skewed responses: new semiparametric regression...

constructions for a bivariate beta distribution ·...

[part 4] 1/43 discrete choice modeling bivariate &...

introduction to bivariate analysis -...

bivariate statistics (continued) & multivariate statistics)