16
A comparative analysis of Spearman’s rho and Kendall’s tau in normal and contaminated normal models Weichao Xu a,n , Yunhe Hou b , Y.S. Hung b , Yuexian Zou c a Department of Automatic Control, School of Automation, Guangdong University of Technology, Guangzhou, Guangdong 510006, PR China b Department of Electrical & Electronic Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong c Advanced Digital Signal Processing Lab, Peking University Shenzhen Graduate School, Shenzhen, Guangdong 518055, PR China article info Article history: Received 13 September 2011 Received in revised form 20 July 2012 Accepted 6 August 2012 Available online 14 August 2012 Keywords: Bivariate normal Correlation theory Kendall’s tau Pearson’s product moment correlation coefficient Spearman’s rho abstract This paper analyzes the performances of Spearman’s rho (SR) and Kendall’s tau (KT) with respect to samples drawn from bivariate normal and contaminated normal populations. Theoretical and simulation results suggest that, contrary to the opinion of equivalence between SR and KT in some literature, the behaviors of SR and KT are strikingly different in the aspects of bias effect, variance, mean square error (MSE), and asymptotic relative efficiency (ARE). The new findings revealed in this work provide not only deeper insights into the two most widely used rank-based correlation coefficients, but also a guidance for choosing which one to use under the circumstances where Pearson’s product moment correlation coefficient (PPMCC) fails to apply. & 2012 Elsevier B.V. All rights reserved. 1. Introduction Correlation analysis is among the core research para- digms in nearly all branches of scientific and engineering fields, not to mention the area of signal processing [117]. Being interpreted as the strength of statistical relationship between two random variables [18], correlation should be large and positive if there is a high probability that large (small) values of one variable occur in conjunction with large (small) values of another; and it should be large and negative if the direction is reversed [19]. A number of methods have been proposed and applied in the literature to assess the correlation between two random variables. Among these methods Pearson’s product moment correla- tion coefficient (PPMCC) [20,21], Spearman’s rho (SR) [22] and Kendall’s tau (KT) [22] are perhaps the most widely used [23]. The properties of PPMCC in bivariate normal samples (binormal model) is well known thanks to the funda- mental work of Fisher [20]. It follows that, in the normal cases, (1) PPMCC is an asymptotically unbiased estimator of the population correlation r and (2) the variance of PPMCC approaches the Cramer–Rao lower bound (CRLB) as the sample size increases [18]. Due to its optimality, PPMCC has and will continue to play the dominant role when quantifying the intensity of correlation between bivariate random variables in the literature. However, sometimes the PPMCC might not be applicable when the following scenarios happen: 1. The data is incomplete, that is, only ordinal information (e.g. ranks) is available. This situation is not uncommon in the area of social sciences, such as psychology and education [22]. 2. The underlying data is complete (cardinal) and follows a bivariate normal distribution, but is attenuated more Contents lists available at SciVerse ScienceDirect journal homepage: www.elsevier.com/locate/sigpro Signal Processing 0165-1684/$ - see front matter & 2012 Elsevier B.V. All rights reserved. http://dx.doi.org/10.1016/j.sigpro.2012.08.005 n Corresponding author. E-mail addresses: [email protected] (W. Xu), [email protected] (Y. Hou), [email protected] (Y.S. Hung), [email protected] (Y. Zou). Signal Processing 93 (2013) 261–276

A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

Contents lists available at SciVerse ScienceDirect

Signal Processing

Signal Processing 93 (2013) 261–276

0165-16

http://d

n Corr

E-m

yhhou@

zouyx@

journal homepage: www.elsevier.com/locate/sigpro

A comparative analysis of Spearman’s rho and Kendall’s tauin normal and contaminated normal models

Weichao Xu a,n, Yunhe Hou b, Y.S. Hung b, Yuexian Zou c

a Department of Automatic Control, School of Automation, Guangdong University of Technology, Guangzhou, Guangdong 510006, PR Chinab Department of Electrical & Electronic Engineering, The University of Hong Kong, Pokfulam Road, Hong Kongc Advanced Digital Signal Processing Lab, Peking University Shenzhen Graduate School, Shenzhen, Guangdong 518055, PR China

a r t i c l e i n f o

Article history:

Received 13 September 2011

Received in revised form

20 July 2012

Accepted 6 August 2012Available online 14 August 2012

Keywords:

Bivariate normal

Correlation theory

Kendall’s tau

Pearson’s product moment correlation

coefficient

Spearman’s rho

84/$ - see front matter & 2012 Elsevier B.V. A

x.doi.org/10.1016/j.sigpro.2012.08.005

esponding author.

ail addresses: [email protected] (W. Xu),

eee.hku.hk (Y. Hou), [email protected] (Y.S

szpku.edu.cn (Y. Zou).

a b s t r a c t

This paper analyzes the performances of Spearman’s rho (SR) and Kendall’s tau (KT)

with respect to samples drawn from bivariate normal and contaminated normal

populations. Theoretical and simulation results suggest that, contrary to the opinion

of equivalence between SR and KT in some literature, the behaviors of SR and KT are

strikingly different in the aspects of bias effect, variance, mean square error (MSE), and

asymptotic relative efficiency (ARE). The new findings revealed in this work provide not

only deeper insights into the two most widely used rank-based correlation coefficients,

but also a guidance for choosing which one to use under the circumstances where

Pearson’s product moment correlation coefficient (PPMCC) fails to apply.

& 2012 Elsevier B.V. All rights reserved.

1. Introduction

Correlation analysis is among the core research para-digms in nearly all branches of scientific and engineeringfields, not to mention the area of signal processing [1–17].Being interpreted as the strength of statistical relationshipbetween two random variables [18], correlation should belarge and positive if there is a high probability that large(small) values of one variable occur in conjunction withlarge (small) values of another; and it should be largeand negative if the direction is reversed [19]. A number ofmethods have been proposed and applied in the literatureto assess the correlation between two random variables.Among these methods Pearson’s product moment correla-tion coefficient (PPMCC) [20,21], Spearman’s rho (SR) [22]

ll rights reserved.

. Hung),

and Kendall’s tau (KT) [22] are perhaps the most widelyused [23].

The properties of PPMCC in bivariate normal samples(binormal model) is well known thanks to the funda-mental work of Fisher [20]. It follows that, in the normalcases, (1) PPMCC is an asymptotically unbiased estimatorof the population correlation r and (2) the variance ofPPMCC approaches the Cramer–Rao lower bound (CRLB)as the sample size increases [18]. Due to its optimality,PPMCC has and will continue to play the dominant rolewhen quantifying the intensity of correlation betweenbivariate random variables in the literature. However,sometimes the PPMCC might not be applicable when thefollowing scenarios happen:

1.

The data is incomplete, that is, only ordinal information(e.g. ranks) is available. This situation is not uncommonin the area of social sciences, such as psychology andeducation [22].

2.

The underlying data is complete (cardinal) and followsa bivariate normal distribution, but is attenuated more
Page 2: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

W. Xu et al. / Signal Processing 93 (2013) 261–276262

or less by some monotone nonlinearity in the transfercharacteristics of sensors [24].

3.

The data is complete and the majority follows abivariate normal distribution, but there exists a tinyfraction of outliers with very large variance (impulsivenoise) [25–27].

Under these circumstances, it would be more suitable toemploy the two most popular nonparametric coefficients,SR and KT, which are (1) dependant only on ranks, (2)invariant under increasing monotone transformations[22], and (3) robust against impulsive noise [28,29].Owing to these desired properties, SR and KT have drawnattentions of signal processing practitioners [30–33] inthe late 1960s shortly after their being thoroughly studiedin the literature of statistics [34–45]. However, despitethe rich history of SR and KT, it is still unclear which oneshould we use in Scenarios 1–3 where the familiar PPMCCis inapplicable, even under the fundamental binormalmodel. Some researchers, such as Fieller et al. [44],preferred KT to SR based on empirical evidences; whileothers, such as Gilpin [46], asserted that SR and KT areequivalent in terms of hypothesis testing.

Aiming at resolving such inconsistency, in this workwe investigate systematically the properties of SR and KTunder the binormal model [20]. Moreover, to deal withScenario 3 mentioned above, we also investigate theirproperties under the contaminated binormal model [28].

The rest part of this paper is structured as follows.Section 2 gives some basic definitions and summarizesthe general properties of PPMCC, SR and KT. In Section 3,we lay the foundation of the theoretical framework in thisstudy by outlining some critical results in the binormalmodel. Section 4 establishes the closed form formulaewith respect to the expectations of SR and KT in thecontaminated normal model. In Section 5, we focus on theperformances of the three estimators of r constructedfrom SR and KT. Section 6 verifies the analytical resultswith Monte Carlo simulations. Finally, in Section 7 weprovide our answers to the above raised question con-cerning the choice of Spearman’s rho and Kendall’s tau inpractice when PPMCC fails to apply.

2. Basic definitions and general properties

2.1. Definitions

Let fðXi,YiÞgni ¼ 1 denote n independent and identically

distributed (i.i.d.) data pairs drawn from a bivariatepopulation with continuous joint distribution. Let Pj

(resp. Qj) be the rank of Xj among X1, . . . ,Xn (resp. Yj

among Y1, . . . ,Yn) [22]. Let X and Y be the arithmeticmean values of Xi and Yi, respectively. Let sgnð�Þ stand forthe sign of the argument. The sample versions of threewell known classical correlation coefficient, PPMCC (rP),SR (rS), and KT (rK), are then defined as follows [19]:

rPðXi,YiÞ9Pn

i ¼ 1ðXi�X ÞðYi�Y Þ

½Pn

i ¼ 1ðXi�X Þ2Pn

i ¼ 1ðYi�Y Þ2�1=2ð1Þ

rSðXi,YiÞ91�6Pn

i ¼ 1ðPi�QiÞ2

nðn2�1Þð2Þ

rK ðXi,YiÞ9

Pniaj ¼ 1 sgnðXi�XjÞsgnðYi�YjÞ

nðn�1Þð3Þ

To ease the following discussion, we will employ thesymbol rZðXi,YiÞ, Z 2 fP,S,Kg as a compact notation for thethree coefficients. For brevity, the arguments of rZðXi,YiÞ

will be dropped in the sequel unless ambiguity occurs.

2.2. General properties

It follows that coefficients rZ, Z 2 fP,S,Kg, possess thefollowing general properties:

1.

rZ 2 ½�1,1� for all (X,Y) (standardization); 2. rZðXi,YiÞ ¼ rZðYi,XiÞ (symmetry); 3. rZ ¼ 71 if Y is a positive (negative) linear transforma-

tion of X (shift and scale invariance);

4. rS ¼ rK ¼ 71 if Y is a monotone increasing (decreasing)

function of X (monotone invariance);

5. the expectations of rZ equal zero if X and Y are

independent (independence);

6. rZðXi,YiÞ ¼ �rZð�Xi,YiÞ ¼�rZðXi,�YiÞ ¼ rZð�Xi,�YiÞ; 7. rZ converges to normal distribution when the sample

size n is large.

Note that the first six properties are discussed in [19,23],and the last property follows from the asymptotic theoryof U-statistics established by Hoeffding [38].

3. Auxiliary results in normal cases

In this section we provide some prerequisites concerningthe orthant probabilities of normal distributions. Theseprobabilities, contained in Lemma 1, are critical for thedevelopment of Lemmas 3 and 4 later on. Moreover, somewell known results about the expectation and variance ofPPMCC, SR and KT are collected in Lemma 2 for ease ofexposition. For convenience, we use symbols Eð�Þ, Vð�Þ, Cð�,�Þ,and corrð�,�Þ in the sequel to denote the mean, variance,covariance, and correlation of (between) random variables,respectively. Symbols of big oh and little oh are utilized tocompare the magnitudes of two functions u(t) and v(t) asthe argument t tends to a limit L (might be infinite). Thenotation uðtÞ ¼OðvðtÞÞ, t-L, denotes that 9uðtÞ=vðtÞ9 remainsbounded as t-L; whereas the notation uðtÞ ¼ oðvðtÞÞ, t-L,denotes that uðtÞ=vðtÞ-0 as t-L [47]. Symbols ofP0

mðZ1, . . . ,ZmÞ9PðZ140, . . . ,Zm40Þ are adopted to denotethe positive orthant probabilities associated with multivari-ate normal random vectors ½Z1 � � � Zm� of dimensionsm¼ 1, . . . ,4, respectively. The notation RðRrsÞm�m standsfor correlation matrix with each element Rrs9corrðZr ,ZsÞ,r,s¼ 1, . . . ,m. Obviously the diagonal entries in R are allunities. For compactness, we will also use the symbol P0

mðRÞ

to denote P0mðZ1, . . . ,ZmÞ in the sequel. Throughout the

symbol C stands for ‘‘approximately equals to’’.

Page 3: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

W. Xu et al. / Signal Processing 93 (2013) 261–276 263

3.1. Orthant probabilities for normal distributions

Lemma 1. Assume that ðZ1,Z2,Z3,Z4Þ follow a quadrivariate

normal distribution with zero means and correlation matrix

R¼ Rrs

� �4�4

. Define

HðtÞ91 ðt40Þ

0 ðtr0Þ

(ð4Þ

Then the orthant probabilities

P01ðZ1Þ ¼ EfHðZ1Þg ¼

12 ð5Þ

P02ðZ1,Z2Þ ¼ EfHðZ1ÞHðZ2Þg ¼

1

41þ

2

psin�1 R12

� �ð6Þ

P03ðZ1,Z2,Z3Þ ¼ EfHðZ1ÞHðZ2ÞHðZ3Þg

¼1

81þ

2

pX2

r ¼ 1

X3

s ¼ rþ1

sin�1 Rrs

!ð7Þ

P04ðZ1,Z2,Z3,Z4Þ ¼ EfHðZ1ÞHðZ2ÞHðZ3ÞHðZ4Þg

¼1

161þ

2

pX3

r ¼ 1

X4

s ¼ rþ1

sin�1 RrsþW

!ð8Þ

where

W91

p4

ZZZZ þ1�1

exp �1

2zRzT

� �z1z2z3z4

dz1 dz2 dz3 dz4 ð9Þ

¼X4

‘ ¼ 2

4

p2

Z 1

0

R1‘

½1�R21‘u

2�1=2sin�1 a‘ðuÞ

b‘ðuÞg‘ðuÞ

� �du ð10Þ

with

a2 ¼ R34�R23R24�½R13R14þR12ðR12R34�R14R23�R13R24Þ�u2

a3 ¼ R24�R23R34�½R12R14þR13ðR13R24�R14R23�R12R34Þ�u2

a4 ¼ R23�R24R34�½R12R13þR14ðR14R23�R13R24�R12R34Þ�u2

b2 ¼ b3 ¼ ½1�R223�ðR

212þR

213�2R12R13R23Þu

2�1=2

g2 ¼ b4 ¼ ½1�R224�ðR

212þR

214�2R12R14R24Þu

2�1=2

g3 ¼ g4 ¼ ½1�R234�ðR

213þR

214�2R13R14R34Þu

2�1=2:

Proof. The first statement (5) is trivial. The second one(6) is usually called Sheppard’s theorem in the literature,although it was proposed earlier by Stieltjes [48]. Thethird one (7) is a simple generalization of Sheppard’stheorem based on the relationship [49]

P03 ¼

1

21�

X3

r ¼ 1

P01ðZrÞþ

X2

r ¼ 1

X3

s ¼ rþ1

P02ðZr ,ZsÞ

" #:

The last one (8) is due to Childs [50] and is termed Childs’sreduction formula throughout. &

3.2. Some well known results

Lemma 2. Let fðXi,YiÞgni ¼ 1 denote n i.i.d. bivariate normal

data pairs with correlation coefficient r. Write S19sin�1 r

and S29sin�1 12r. Then

EðrPÞ ¼ r 1�1�r2

2nþOðn�2Þ

� �-r as n-1 ð11Þ

VðrPÞ ¼ð1�r2Þ

2

n�1þOðn�2Þ ð12Þ

EðrSÞ ¼6

pðnþ1Þsin�1rþðn�2Þsin�1 r

2

h ið13Þ

-6

p sin�1 r2

as n-1 ð14Þ

EðrK Þ ¼2

p sin�1r ð15Þ

VðrK Þ ¼2

nðn�1Þ1�

4S21

p2þ2ðn�2Þ

1

9�

4S22

p2

!" #: ð16Þ

Proof. The first three results, (11)–(13), were given byHotelling [43], Fisher [21], and Moran [37], respectively;whereas the last two results, (15) and (16), were derivedby Esscher [51]. &

3.3. Variance of SR

Lemma 3. Let fðXi,YiÞgni ¼ 1, S1 and S2 be defined as in

Lemma 2. Let Wc ,Wd, . . . ,Wq be defined as in (9) with

respect to Rc ,Rd, . . . ,Rq that are tabulated in Table 1. Then

the variance of rSðX,YÞ is

VðrSÞ ¼6

nðnþ1Þþ

9ðn�2Þðn�3Þ

nðn2�1Þðnþ1Þ½ðn�4ÞO1ðrÞþO2ðrÞ�

�36

p2nðn2�1Þðnþ1Þ½3ðn�2Þð3n2�15nþ22ÞS2

2

þ12ðn�2Þ2S1S2�2ðn�3ÞS21� ð17Þ

where

O1ðrÞ ¼Wcþ8Wdþ2Wf ð18Þ

O2ðrÞ ¼ 6Wgþ8Whþ6Wlþ2WnþWoþ13: ð19Þ

Moreover, when n is sufficiently large,

VðrSÞC1

n9O1ðrÞ�

324S22

p2

" #: ð20Þ

Proof. Using the technique developed by David andMallows [45], it follows that the variance of rS dependson a weighted summation of 24 quadrivariate normalorthant probabilities corresponding to Rc ,Rd, . . . ,Rq listedin Table 1. The lemma thus follows with the assistance of(10) in Lemma 1 after some tedious but straightforwardderivations. See also the work of Croux and Dehon [29]and [52] for further discussion. &

Remark 1. Due to the complicated integrals involved inthe expressions of W-terms in O1ðrÞ and O2ðrÞ, thevariance of rS cannot be expressed into elementaryfunctions in general. Nevertheless, exact results can beobtained when r assumes some particular values. It can

Page 4: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

Table 1

Quantities for evaluation of EðS2Þ in Lemma 3.

Corr. matrix No. of termsa Representative term Correlation coefficients P04ðZ1 ,Z2 ,Z3 ,Z4Þ

b Wð0Þ Wð1Þ

Z1 Z2 Z3 Z4 R12 R13 R14 R23 R24 R34

Ra n½6� X1�X2 Y1�Y3 X4�X5 Y4�Y6 1

2r 0 0 0 0 1

2r 1

16þ

S2

4p þS2

2

4p2

– –

Rb 2n½5� X1�X2 Y1�Y2 X3�X4 Y3�Y5 r 0 0 0 0 1

2r 1

16þ

S1

8p þS2

8p þS1S2

4p2

– –

Rc n½5� X1�X2 Y1�Y3 X1�X4 Y1�Y5 1

2r 1

2

1

2r 1

2r 1

2

1

2r 5

48þ

S2

2p þWcðrÞ

16

1

9

1

5Rd 4n½5� X1�X2 Y1�Y3 X2�X4 Y2�Y5 1

2r �

1

2�

1

2r 0 0 1

2r 1

24þ

S2

8p þWdðrÞ

16

0 1

15Re 2n½5� X1�X2 Y1�Y3 X4�X2 Y4�Y5 1

2r 1

2

0 0 0 1

2r 1

12þ

S2

4p þWeðrÞ

16

– –

Rf 2n½5� X1�X2 Y1�Y3 X4�X3 Y4�Y5 1

2r 0 0 1

2r 0 1

2r 1

16þ

3S2

8p þWf ðrÞ

16

0 2

15Rg 4n½4� X1�X2 Y1�Y2 X1�X3 Y1�Y4 r 1

2

1

2r 1

2r 1

2

1

2r 5

48þ

S1

8pþ

3S2

8pþ

Wg ðrÞ16

1

9

1

3Rh 4n½4� X1�X2 Y1�Y2 X3�X4 Y3�Y1 r 0

�1

2r 0

�1

2

1

2r 1

24þ

S1

8p þWhðrÞ

16

0 1

3Ri 4n½4� X1�X2 Y1�Y3 X3�X4 Y3�Y2 1

2r 0 1

2r �

1

2r �

1

2

1

2r 1

24þ

S2

4p– –

Rj n½4� X1�X2 Y1�Y2 X3�X4 Y3�Y4 r 0 0 0 0 r 1

16þ

S1

4p þS2

1

4p2

– –

Rk 2n½4� X1�X2 Y1�Y3 X1�X2 Y1�Y4 1

2r 1 1

2r 1

2r 1

2

1

2r 1

S2

2p– –

Rl 4n½4� X1�X2 Y1�Y3 X4�X1 Y4�Y3 1

2r �

1

2

0�

1

2r 1

2

1

2r 1

16þ

S2

8p þWlðrÞ

16�

1

9

0

Rm n½4� X1�X2 Y1�Y3 X4�X2 Y4�Y3 1

2r 1

2

0 0 1

2

1

2r 5

48þ

S2

4p þWmðrÞ

16

– –

Rn 2n½4� X1�X2 Y1�Y3 X2�X4 Y2�Y1 1

2r �

1

2

�r 0�

1

2

1

2r 1

48�

S1

8p þS2

4p þWnðrÞ

16

1

9

0

Ro n½4� X1�X2 Y1�Y3 X4�X3 Y4�Y2 1

2r 0 1

2r 1

2r 0 1

2r 1

16þ

S2

2p þWoðrÞ

16

0 1

3Rp 2n½4� X1�X2 Y1�Y2 X2�X3 Y2�Y4 r

�1

2�

1

2r �

1

2r �

1

2

1

2r 1

48þ

S1

8p �S2

8p þWpðrÞ

16

– –

Rq 4n½4� X1�X2 Y1�Y2 X3�X2 Y3�Y4 r 1

2

0 1

2r 0 1

2r 1

12þ

S1

8p þS2

4p þWqðrÞ

16

– –

Rr 3n½3� X1�X2 Y1�Y2 X1�X3 Y1�Y3 r 1

2

1

2r 1

2r 1

2

r 1

S1

4p þS2

4p þS2

1

4p2�

S22

4p2

– –

Rs n½3� X1�X2 Y1�Y3 X1�X2 Y1�Y3 1

2r 1 1

2r 1

2r 1 1

2r 1

S2

2p– –

Rt 2n½3� X1�X2 Y1�Y3 X3�X1 Y3�Y2 1

2r �

1

2

1

2r �r

�1

2

1

2r 1

36�

S1

8p þ3S2

8p �S2

1

8p2þ

S22

8p2

– –

Ru 4n½3� X1�X2 Y1�Y2 X1�X2 Y1�Y3 r 1 1

2r r 1

2

1

2r 1

S1

4p þS2

4p– –

Rv 4n½3� X1�X2 Y1�Y2 X3�X1 Y3�Y2 r�

1

2

1

2r �

1

2r 1

2

1

2r 1

18þ

S1

8p þS2

8p þS2

1

8p2�

S22

8p2

– –

Rw 2n½3� X1�X2 Y1�Y2 X3�X1 Y3�Y1 r�

1

2�

1

2r �

1

2r �

1

2

r 1

36þ

S1

4p �S2

4p þS2

1

4p2�

S22

4p2

– –

Rx n½2� X1�X2 Y1�Y2 X1�X2 Y1�Y2 r 1 r r 1 r 1

S1

2p– –

a The symbol n½k� is a compact notation of nðn�1Þ � � � ðn�kþ1Þ, with k being a positive integer.b The orthant probability P0

4ðZ1 ,Z2 ,Z3 ,Z4Þ9EfHðZ1ÞHðZ2ÞHðZ3ÞHðZ4Þg. Notations S19sin�1 r and S29sin�1 12r are used for brevity.

W. Xu et al. / Signal Processing 93 (2013) 261–276264

be shown that

O1ð0Þ ¼19 , O2ð0Þ ¼

59 ð21Þ

O1ð1Þ ¼ 1, O2ð1Þ ¼163 ð22Þ

Substituting r¼ 0 and (21) into (17) leads directly to

VðrSÞ9r ¼ 0 ¼1

n�1ð23Þ

which is a well known result [22]. Substituting r¼ 1 and(22) into (17) and (20) together with some simplificationsyields the following obvious result:

VðrSÞ9r ¼ 1 ¼ 0

And the same relationship also holds true for r¼�1 dueto symmetry.

3.4. Covariance between SR and KT

Lemma 4. Let fðXi,YiÞgni ¼ 1, S1 and S2 be defined as in

Lemma 2. Then the covariance between rSðX,YÞ and rK ðX,YÞ is

CðrS,rK Þ ¼12

nðn2�1Þ

7n�5

18þðn�4Þ

S21

p2�5ðn�2Þ

S22

p2

"

�6ðn�2Þ2S1S2

p2þðn�2Þðn�3ÞO3ðrÞ

�ð24Þ

Page 5: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

1 Note that here we only consider the centered and symmetric type

of contamination that is frequently encountered in the literature of

signal processing. For general situations, the readers are referred to [29].

W. Xu et al. / Signal Processing 93 (2013) 261–276 265

C12

nO3ðrÞ�6

S1S2

p2

� �ðas n largeÞ ð25Þ

where

O3ðrÞ ¼ 12WgþWh: ð26Þ

Proof. See Appendix A.

Corollary 1. In Lemma 4, the covariance CðrS,rK Þ can also

be expressed as

CðrS,rK Þ ¼12

nðn2�1Þ

ðnþ1Þ2

18þðn�4Þ

S21

p2�5ðn�2Þ

S22

p2

"

�6ðn�2Þ2S1S2

p2þ

2

p2ðn�2Þðn�3ÞO4ðrÞ

�ð27Þ

C12

n

1

18þ2

O4ðrÞp2�6

S1S2

p2

� �ðas n largeÞ ð28Þ

where

O4ðrÞ ¼Z r

0sin�1 x

3

� þ2sin�1 xffiffiffi

3p

� �� �dxffiffiffiffiffiffiffiffiffiffiffiffi1�x2p

�2

Z r

0sin�1 x

2

ffiffiffiffiffiffiffiffiffiffiffiffiffiffi1�x2

9�3x2

r !dxffiffiffiffiffiffiffiffiffiffiffiffi4�x2p

þ

Z r

0sin�1 x

2

5�x2

3�x2

� �dxffiffiffiffiffiffiffiffiffiffiffiffi4�x2p

�2

Z r

0sin�1 x

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1�x2

12�6x2

r !dxffiffiffiffiffiffiffiffiffiffiffiffi4�x2p

þ2

Z r

0sin�1 x

ffiffiffiffiffiffiffiffiffiffiffiffiffiffi3�x2

4�2x2

r !dxffiffiffiffiffiffiffiffiffiffiffiffi4�x2p ð29Þ

Proof. Inverting (8) yields

W ¼ 16P04�1�

2

pX3

r ¼ 1

X4

s ¼ rþ1

sin�1Rrs: ð30Þ

Combining (26) and (30), O3ðrÞ can be rewritten in termsof P0

4 and the correlation coefficients corresponding to Rg

and Rh in Appendix 2 of [45]. This leads to

O3ðrÞ ¼1

18þ

2

p2O4ðrÞ: ð31Þ

The corollary thus follows directly by substituting (31)into (24) and (25), respectively. &

Remark 2. Both (24) and (27) are exact for any value ofnZ4 and 9r9r1. However, they are of different useful-ness according to different numerical and analyticalpurposes. Formula (24) is more convenient for computa-tion; whereas (27) is more convenient for analysis. Forexample, performing the Taylor expansion to (28) withthe assistance of (29) gives

CðrS,rK ÞC2

3nð1�1:24858961r2þ0:06830496r4

þ0:07280482r6þ0:04025528r8þ0:02189277r10þ � � �Þ

ð32Þ

which agrees with the formula (51) obtained in [41],except for the coefficients of the last two terms, whichwe find to be 0.04025528 and 0.02189277, against their0.04025526 and 0.01641362, respectively. Since O4ðrÞ in

(27) is exact, we believe that (32) is more accurate than(51) in [41].

Remark 3. Due to the complicated integrals involved in(24) and (27), CðrS,rK Þ cannot be expressed in elementaryfunctions. However, exact results are attainable for r¼ 0and r¼ 1 (�1). It follows that

O3ð0Þ ¼1

18 ð33Þ

O3ð1Þ ¼12 ð34Þ

Substituting (33) into (24) yields

CðrS,rK Þ9r ¼ 0 ¼2

3n

nþ1

n�1ð35Þ

which is more readily to obtain on substitution of r¼ 0into (27). Regarding the case for r¼ 1, it is rather difficultby means of substituting r¼ 1 into (27) and evaluatingO4ð1Þ based on (29) thereafter. Fortunately, with thehelp of (34), it follows readily from (24) and (25)that CðrS,rK Þ9r ¼ 1 ¼ 0. Due to symmetry, we also haveCðrS,rK Þ9r ¼ �1 ¼ 0:

4. New results in contaminated normal model

The PPMCC is notoriously sensitive to the non-Gaussianity caused by impulsive contamination in thedata. Even a single outlier can distort severely the valueof PPMCC and hence result in misleading inference inpractice. Assume that (X,Y) obeys the following distribu-tion [28]:

ð1�EÞN ðmX ,mY ,s2X ,s2

Y ,rÞþEN ðmX ,mY ,l2Xs

2X ,l2

Ys2Y ,r0Þ ð36Þ

where 0rEr1, lX b1, and lY b1. Under this Gaussiancontamination model1 that frequently used in theliterature of robustness analysis [25–27], it has beenshown that, no matter how small E is, the expectationof the PPMCC EðrPÞ-r0 as lX-1 and lY-1 [28].On the other hand, as shown in the theorem below,SR and KT are more robust than PPMCC under themodel (36).

Theorem 1. Let fðXi,YiÞgni ¼ 1 be i.i.d. samples generated from

the model (36). Then for E around 0,

limlX-1lY -1

EðrK ÞC2

p ð1�2EÞsin�1rþ2E sin�1 r0h i

ð37Þ

limn-1lX-1lY -1

EðrSÞC6

pð1�3EÞsin�1 r

2þE sin�1 r0

h i: ð38Þ

where the order of taking limits is arbitrary.

Proof. See Appendix B.

Remark 4. It was stated without substantial argument in[28] that, under the model (36), the first approximation of

Page 6: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

W. Xu et al. / Signal Processing 93 (2013) 261–276266

EðrSÞ, in terms of E, is of the following form:

EðrSÞC6

pð1�EÞsin�1 r

2þE sin�1 r0

2

� �ðnÞ

as n-1, lX-1 and lY-1. This is quite inconsistentwith our result (38) in Theorem 1. We will resolve thecontroversy between (38) and (n) by Monte Carlo simula-tions in Section 6.

5. Estimators of the population correlation

In this section, we investigate the performance ofseveral estimators of r based on SR and KT in terms ofbias, MSE and ARE to PPMCC.

5.1. Asymptotic unbiased estimators

Inverting (11), (14) and (15), we have the followingestimators of r:

rP9rP ð39Þ

rS92 sinp6

rS

� ð40Þ

rK9 sinp2

rK

� : ð41Þ

Moreover, another estimator based on a mixture of rS andrP can be constructed as [22]

rM92 sinp6

rS�p2

rK�rS

n�2

� ð42Þ

based on the following relationship:

EðrSÞ ¼6

p S2þS1�3S2

nþ1

� �: ð43Þ

In the sequel we will focus on the properties of theestimators defined in (39)–(42). Here the estimator rP in(39) is employed as a benchmark due to its optimality fornormal samples, in the sense of approaching the CRLB[18] when the sample size is sufficiently large.

5.2. Bias effect for small samples

It is noteworthy that the four estimators in (39)–(42)are unbiased only for large samples. When the samplesize is small, the bias effects, as shown in the followingtheorem, are not ignorable any more.

Theorem 2. Let rz, z 2 fP,S,K ,Mg, be defined as in (39)–(42),respectively. Define BIASz9Eðrz�rÞ. Let S1 and S2 bear the

same meanings as in Lemma 2. Write s2S9VðrSÞ, s2

K9VðrK Þ

and sS,K9CðrS,rK Þ. Then, under the same assumptions made

as in Lemma 3,

BIASP C�1

2nrð1�r2Þ ð44Þ

BIASSC

ffiffiffiffiffiffiffiffiffiffiffiffi4�r2

pnþ1

ðS1�3S2Þ�p2r72

s2S ð45Þ

BIASK C�p2r

8s2

K ð46Þ

BIASM C�1

72

p2rðn�2Þ2

½ðnþ1Þ2s2S�6ðnþ1ÞsS,Kþ9s2

K � ð47Þ

Proof. The first statement (44) follows directly from (11)in Lemma 2. Now we proceed to evaluate BIASS, BIASK

and BIASM . For convenience, write rS9EðrSÞ, rK 9EðrK Þ,dS9rS�rS , and dK9rK�rK . Expanding (40) around rS

yields

rS ¼ 2 sinp6

rS

� þp3

cosp6

rS

� dS�

p2

36sin

p6

rS

� d2

Sþ � � � :

ð48Þ

Taking expectation of both sides in (48), applyingEðdSÞ ¼ 0, Eðd2

S Þ ¼ s2S and ignoring the high order infinite-

simals, we have

EðrSÞC2 sinp6

rS

� �p2

36sin

p6

rS

� s2

S : ð49Þ

Substituting (43) into (49), expanding to the order ofðnþ1Þ�1, and subtracting r thereafter, we obtain the result(45). In a similar way we have

EðrK ÞCr�p2r8

s2K

which leads directly to (46). Performing Taylor expansionof rMðrS,rK Þ around ðrS ,rK Þ till the second order, we have

rM ¼ rMðrS ,rK Þþ@ðrMÞ

@ðrSÞdSþ

@ðrMÞ

@ðrK ÞdK

þ1

2

@2ðrMÞ

@ðrSÞ2d2

Sþ@2ðrMÞ

@ðrK Þ2d2

Kþ2@2ðrMÞdSdK

@ðrSÞ@ðrK Þ

" #þ � � � :

ð50Þ

Taking expectation of both sides in (50), ignoring highorder infinitesimals, applying the results rMðrS ,rK Þ ¼ r,EðdSÞ ¼ 0, EðdK Þ ¼ 0, Eðd2

S Þ ¼ s2S , Eðd2

K Þ ¼ s2K , EðdS,dK Þ ¼ sS,K

along with the second order partial derivatives

@2rMðrS ,rK Þ

@ðrSÞ2

¼�p2r36

ðnþ1Þ2

ðn�2Þ2

@2rMðrS ,rK Þ

@ðrK Þ2¼�

p2r4

1

ðn�2Þ2

@2rMðrS ,rK Þ

@ðrSÞ@ðrK Þ¼p2r12

nþ1

ðn�2Þ2

and subtracting r thereafter, we arrive at the fourththeorem statement (47), thus completing the proof. &

Remark 5. From (44) to (47), it follows that, for all thefour estimators,

BIASzðrÞ ¼ BIASzð�rÞ (odd symmetry); � r BIASzðrÞr0 (negative bias); � BIASz ¼ 0 for r 2 f�1,0,1g; � BIASz �Oðn�1Þ as n-1.

Moreover, contrary to BIASP and BIASK , BIASS and BIASM

cannot be expressed into elementary functions due to theintractability involved in (17) and (24), the expressions ofVðrSÞ and CðrS,rK Þ, respectively.

Page 7: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

Table 2Values of O1ðrÞ and O2ðrÞ in Lemma 3.

r 0.00þ 0.10þ 0.20þ 0.30þ 0.40þ 0.50þ 0.60þ 0.70þ 0.80þ 0.90þ

0.00 0.1111111111 0.1185038469 0.1408155407 0.1784569533 0.2321489296 0.3029841008 0.3925307865 0.5030051934 0.6375648509 0.8008401854

0.01 0.1111849294 0.1200591282 0.1438805317 0.1830890232 0.2384392872 0.3110659830 0.4025930108 0.5153148070 0.6525067154 0.8189917929

0.02 0.1114063976 0.1217636535 0.1470991369 0.1878822020 0.2449020129 0.3193363285 0.4128663989 0.5278679000 0.6677394272 0.8375074957

0.03 0.1117755552 0.1236177309 0.1504719575 0.1928374325 0.2515384762 0.3277970726 0.4233536847 0.5406684104 0.6832688919 0.8563967883

0.04 0.1122924682 0.1256216966 0.1539996256 0.1979556956 0.2583500955 0.3364502180 0.4340577002 0.5537204299 0.6991012791 0.8756696820

0.05 0.1129572289 0.1277759147 0.1576828058 0.2032380114 0.2653383401 0.3452978372 0.4449813799 0.5670282120 0.7152430387 0.8953367468

0.06 0.1137699564 0.1300807778 0.1615221950 0.2086854397 0.2725047311 0.3543420748 0.4561277650 0.5805961801 0.7317009189 0.9154091574

0.07 0.1147307963 0.1325367071 0.1655185233 0.2142990814 0.2798508436 0.3635851506 0.4675000084 0.5944289365 0.7484819855 0.9358987450

0.08 0.1158399207 0.1351441527 0.1696725544 0.2200800791 0.2873783079 0.3730293618 0.4791013791 0.6085312723 0.7655936432 0.9568180554

0.09 0.1170975289 0.1379035942 0.1739850867 0.2260296186 0.2950888115 0.3826770865 0.4909352680 0.6229081771 0.7830436585 0.9781804140

0.00 0.5555555556 0.5831779199 0.6669411548 0.8096548728 1.0164469967 1.2956050434 1.6602428005 2.1318398440 2.7490961353 3.5988067890

0.01 0.5558310525 0.5889973638 0.6784952270 0.8273423444 1.0409353198 1.3279500727 1.7021306947 2.1861334471 2.8213687297 3.7041428194

0.02 0.5566576312 0.5953785532 0.6906408759 0.8456746565 1.0661538717 1.3611599188 1.7451031281 2.2419045230 2.8959717933 3.8148964800

0.03 0.5580355547 0.6023235637 0.7033822725 0.8646587057 1.0921135054 1.3952518608 1.7891893159 2.2992092121 2.9730475022 3.9318774005

0.04 0.5599652624 0.6098346629 0.7167238250 0.8843017153 1.1188255760 1.4302440445 1.8344202199 2.3581081295 3.0527562596 4.0561698333

0.05 0.5624473698 0.6179143137 0.7306701849 0.9046112480 1.1463019652 1.4661555384 1.8808286962 2.4186669028 3.1352804112 4.1892949503

0.06 0.5654826698 0.6265651776 0.7452262544 0.9255952194 1.1745551090 1.5030063944 1.9284496603 2.4809568021 3.2208290418 4.3335266816

0.07 0.5690721334 0.6357901183 0.7603971935 0.9472619127 1.2035980265 1.5408177141 1.9773202724 2.5450554807 3.3096442738 4.4925913045

0.08 0.5732169115 0.6455922055 0.7761884285 0.9696199936 1.2334443518 1.5796117219 2.0274801460 2.6110478526 3.4020096999 4.6735571704

0.09 0.5779183355 0.6559747193 0.7926056602 0.9926785275 1.2641083679 1.6194118448 2.0789715836 2.6790271403 3.4982619270 4.8941554764

(a) In the upper panel are the values of O1ðrÞ, and in the lower panel (shaded area) are the values of O2ðrÞ.(b) O1ð0Þ ¼ 1=9, O1ð1Þ ¼ 1, O1ð�rÞ ¼O1ðrÞ.(c) O2ð0Þ ¼ 5=9, O2ð1Þ ¼ 16=3, O2ð�rÞ ¼O2ðrÞ.

W.

Xu

eta

l./

Sign

al

Pro

cessing

93

(20

13

)2

61

–2

76

26

7

Page 8: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

Ta

ble

3V

alu

es

ofO

3ðrÞ

inLe

mm

a4

.

r0

.00þ

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.0

00

.05

55

55

55

56

0.0

57

90

82

72

80

.06

50

48

83

62

0.0

77

23

63

47

70

.09

49

46

46

23

0.1

18

95

51

51

80

.15

05

07

46

40

0.1

91

68

42

31

20

.24

63

47

36

65

0.3

23

56

86

52

3

0.0

10

.05

55

79

01

60

0.0

58

40

40

66

10

.06

60

34

51

25

0.0

78

74

87

69

40

.09

70

47

79

74

0.1

21

74

48

81

90

.15

41

47

12

83

0.1

96

45

47

74

90

.25

28

15

53

59

0.3

33

35

72

51

8

0.0

20

.05

56

49

40

53

0.0

58

94

77

68

60

.06

70

70

84

81

0.0

80

31

67

90

20

.09

92

12

72

59

0.1

24

61

09

82

00

.15

78

84

43

90

0.2

01

36

21

18

50

.25

95

09

18

76

0.3

43

71

45

08

8

0.0

30

.05

57

66

74

77

0.0

59

53

95

71

00

.06

81

58

22

84

0.0

81

94

10

52

30

.10

14

42

27

03

0.1

27

55

51

07

10

.16

17

22

26

12

0.2

06

41

19

56

70

.26

64

43

48

35

0.3

54

73

34

97

2

0.0

40

.05

59

31

08

34

0.0

60

17

96

82

10

.06

92

97

06

10

0.0

83

62

22

29

10

.10

37

37

50

18

0.1

30

57

89

98

70

.16

56

63

63

97

0.2

11

61

04

63

70

.27

36

35

63

57

0.3

66

54

00

16

6

0.0

50

.05

61

42

46

92

0.0

60

86

83

28

70

.07

04

87

77

63

0.0

85

36

10

26

30

.10

60

99

54

31

0.1

33

68

44

90

70

.16

97

11

81

52

0.2

16

96

43

53

60

.28

11

05

33

81

0.3

79

31

22

55

6

0.0

60

.05

64

00

97

77

0.0

61

60

57

55

80

.07

17

30

82

84

0.0

87

15

81

83

30

.10

85

29

57

10

0.1

36

87

35

15

70

.17

38

70

24

20

0.2

22

48

09

49

90

.28

88

75

32

52

0.3

93

31

92

25

8

0.0

70

.05

67

06

69

83

0.0

62

39

22

27

40

.07

30

26

69

53

0.0

89

01

44

74

70

.11

10

28

81

92

0.1

40

14

81

12

10

.17

81

42

60

83

0.2

28

16

82

70

00

.29

69

72

10

99

0.4

09

00

62

84

9

0.0

80

.05

70

59

73

66

0.0

63

22

80

26

30

.07

43

75

88

04

0.0

90

93

07

11

90

.11

35

98

58

21

0.1

43

51

04

31

40

.18

25

32

85

87

0.2

34

03

51

23

30

.30

54

26

97

41

0.4

27

22

72

39

2

0.0

90

.05

74

60

21

50

0.0

64

11

34

55

00

.07

57

78

91

25

0.0

92

90

77

44

50

.11

62

40

21

77

0.1

46

96

27

47

10

.18

70

45

22

02

0.2

40

09

12

30

50

.31

42

77

33

22

0.4

50

14

81

06

0

No

teth

at

(a)O

3ð0Þ¼

1=1

8,

(b)O

3ð1Þ¼

1=2

,a

nd

(c)O

3ð�rÞ¼O

3ðrÞ.

W. Xu et al. / Signal Processing 93 (2013) 261–276268

5.3. Approximation of variances

Besides the bias effect just discussed, the variance isanother important figure of merit when comparing theperformance of the estimators rz, z 2 fP,S,K ,Mg. From(11), it follows that

VðrPÞCð1�r2Þ

2

n�1ð51Þ

By the delta method, it follows that [22]

VðrSÞCp2ð4�r2Þ

36VðrSÞ ð52Þ

VðrK ÞCp2ð1�r2Þ

4VðrK Þ ð53Þ

Now we only need to deal with VðrMÞ, which is statedbelow.

Theorem 3. Let rM be defined as in (42). Then, under the

same assumptions made as in Lemma 3,

VðrMÞCp2ð4�r2Þ

36ðn�2Þ2½ðnþ1Þ2s2

S�6ðnþ1ÞsS,Kþ9s2K � ð54Þ

Proof. Using the delta method [18], it follows that

VðrMÞC@ðrMÞ

@ðrSÞ

� �2

s2Sþ

@ðrMÞ

@ðrK Þ

� �2

s2Kþ2

@ðrMÞ

@ðrSÞ

@ðrMÞ

@ðrK ÞsS,K

ð55Þ

The theorem thus follows with the substitutions of thepartial derivatives

@rMðrS ,rK Þ

@ðrSÞ¼p6

nþ1

n�2

ffiffiffiffiffiffiffiffiffiffiffiffi4�r2

q@rMðrS ,rK Þ

@ðrK Þ¼p2

�1

n�2

ffiffiffiffiffiffiffiffiffiffiffiffi4�r2

qinto (55), respectively. &

5.4. Asymptotic relative efficiency

Thus far in this section we have established theanalytical results with an emphasis on limited-sizedbivariate normal samples. For a better understandingof the four estimators, we will shift our attention tothe asymptotic properties of rz in the sequel. Sincelimn-1EðrzÞ ¼ r, we can compare their performances bymeans of the asymptotic relative efficiency, which isdefined as [18]

AREzðrÞ9 limn-1

VðrPÞ

VðrzÞ, z 2 fP,S,K ,Mg ð56Þ

As remarked before, we employ rP as a benchmark, since,for large-sized bivariate normal samples, rP approachesthe Cramer–Rao lower bound (CRLB) [18]

CRLB¼ð1�r2Þ

2

nð57Þ

From (56) it is obvious that AREP ¼ 1. Moreover, comparing(52) and (54), it is easily seen that limn-1VðrSÞ=VðrMÞ ¼ 1,which leads readily to ARES ¼ AREM by referring to (56). Thenwe only need to focus on ARES and AREK in the followingdiscussion.

Page 9: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

W. Xu et al. / Signal Processing 93 (2013) 261–276 269

Theorem 4. Let ARES and AREK be defined as in (56). Then

ARESðrÞ ¼36ð1�r2Þ

2

ð4�r2Þ½9p2O1ðrÞ�324ðsin�1 12rÞ

2�

ð58Þ

AREK ðrÞ ¼9ð1�r2Þ

p2�36ðsin�1 12rÞ

2ð59Þ

Proof. Substituting (52) and (53) into (56) yields (58)and (59), respectively, and the proof completes. See alsothe proof in [29] based on influence functions. &

Remark 6. Due to the intractability of O1ðrÞ in (58), ARES

cannot be expressed into elementary functions in general.However, exact results are obtainable for r¼ 0,71. Sub-stituting r¼ 0 and O1ð0Þ ¼ 1=9 into (58), it is easy toverify that

ARESð0Þ ¼9

p2C0:9119

which is a well known result [22]. In our previous work[14] we also obtained that

ARESð71Þ ¼15þ11

ffiffiffi5p

57C0:6947: ð60Þ

Now let us investigate AREK . It follows from (59) that,AREK is expressible as elementary functions of r, and istherefore more tractable than ARES. In other words, wecan evaluate easily any value of AREK with respect to anyvalue of ra71. For example, substituting r¼ 0 into (59)yields

AREK ð0Þ ¼9

p2

which is identical to ARESð0Þ and also well known [22].However, when r-71, an extra effort is necessary, sinceboth the numerator and denominator of (59) vanish in this

Fig. 1. Comparison of BIASz , z 2 fP,S,K ,Mg for (a) n¼10 and (b) n¼20. Theoretic

based on (44)–(47), respectively; whereas the simulation results, denoted by B

case. Apply L’Hopital’s rule, we find the following result:

AREK9r-71 ¼1

4

rffiffiffiffiffiffiffiffiffiffiffiffi4�r2

psin�1 1

2r

�����r ¼ 71

¼3ffiffiffi3p

2pC0:8270 ð61Þ

which is greater than ARESð71Þ. In fact, a comparison ofARES and AREK in Section 6 suggest that ARESrAREK for allr 2 ½�1,1�.

6. Numerical results

In this section we aim at (1) tabulating the values ofO1ðrÞ, O2ðrÞ (in Lemma 3) and O3ðrÞ (in Lemma 4) thatare not expressible as elementary functions, (2) verifyingthe theoretical results established in Theorem 1, and (3)comparing the performances of the four estimators definedin (39)–(42) by means of bias effect, mean square error

(MSE) and ARE under both the normal and contaminatednormal models (Theorems 2–4). Throughout this section,Monte Carlo experiments are undertaken for 10rnr100.A sample size of n¼1000 is considered large enoughwhen we investigate the asymptotic behaviors. The num-ber of trials is set to 5� 105 for reason of accuracy.All samples are generated by functions in the MatlabStatistics ToolboxTM. Specifically, the normal samplesare generated by mvnrnd, whereas the contaminatednormal samples are generated by gmdistribution andrandom. The notation r¼ r1ðDrÞr2 represents a list of rstarting from r1 to r2 with increment Dr.

6.1. Tables of O1ðrÞ, O2ðrÞ and O3ðrÞ

Table 2 contains the values of O1ðrÞ and O2ðrÞ in (17),the first statement of Lemma 3 for r¼ 0ð0:01Þ1. In theupper panel are the values of O1ðrÞ; whereas in the lowerpanel are the values of O2ðrÞ. Due to the importance ofVðrSÞ both in theory and in practice, the table is made asintensive and accurate as possible, with the incrementDr being 0.01, and the precision being up to 10 decimal

al curves, denoted by BIASTz in the legend, are plotted over r¼�1ð0:01Þ1

IASOz in the legend, are plotted over r¼�0:9ð0:1Þ0:9.

Page 10: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

ðrKÞ

Vðr

55

)O

bs.

(56

)O

bs.

.04

10

.03

90

.04

00

.04

0

.04

00

.03

90

.04

00

.03

9

.03

80

.03

70

.03

80

.03

7

.03

40

.03

40

.03

40

.03

4

.03

00

.02

90

.03

00

.03

0

.02

40

.02

40

.02

40

.02

5

.01

80

.01

80

.01

90

.01

9

.01

20

.01

20

.01

20

.01

3

.00

60

.00

70

.00

70

.00

7

.00

20

.00

20

.00

20

.00

2

W. Xu et al. / Signal Processing 93 (2013) 261–276270

places. In Table 3 are tabulated the values of O3ðrÞ in (24)of Lemma 4 for r¼ 0ð0:01Þ1. Because of the similarreasons, the increment Dr and precision are the same asthose in Table 2. The values of O1ðrÞ, O2ðrÞ and O3ðrÞwithrespect to r not included in Tables 2 and 3 can be easilyobtained by interpolation. Given these tables, we caneasily calculate the quantities that depend on O1ðrÞ,O2ðrÞ and O3ðrÞ, including VðrSÞ, VðrSÞ, VðrMÞ, BIASðrSÞ,BIASðrMÞ, ARES, AREM , and so forth.

Sa

mp

lesi

zen¼

20

Sa

mp

lesi

zen¼

30

Vðr

Vðr

Vðr

Vðr

Vðr

Vðr

V

)O

bs.

(53

)O

bs.

(54

)O

bs.

(55

)O

bs.

(56

)O

bs.

(53

)O

bs.

(54

)O

bs.

(

80

.14

30

.05

30

.05

30

.05

80

.05

70

.06

50

.06

10

.06

40

.06

30

.03

40

.03

40

.03

80

.03

70

50

.14

10

.05

20

.05

20

.05

70

.05

60

.06

40

.06

00

.06

30

.06

20

.03

40

.03

40

.03

70

.03

70

80

.13

50

.04

90

.04

90

.05

40

.05

40

.06

00

.05

70

.05

90

.05

90

.03

20

.03

20

.03

50

.03

50

70

.12

50

.04

40

.04

50

.04

90

.04

90

.05

50

.05

20

.05

40

.05

40

.02

90

.02

90

.03

20

.03

20

20

.11

10

.03

70

.03

90

.04

30

.04

30

.04

70

.04

60

.04

70

.04

70

.02

40

.02

50

.02

80

.02

80

30

.09

40

.03

00

.03

20

.03

60

.03

60

.03

90

.03

80

.03

90

.03

90

.01

90

.02

00

.02

30

.02

30

30

.07

40

.02

20

.02

40

.02

80

.02

80

.02

90

.02

90

.03

00

.03

00

.01

40

.01

50

.01

80

.01

80

10

.05

30

.01

40

.01

60

.01

90

.02

00

.01

90

.02

00

.02

00

.02

10

.00

90

.01

00

.01

20

.01

20

90

.03

10

.00

70

.00

80

.01

10

.01

10

.01

00

.01

10

.01

10

.01

10

.00

40

.00

50

.00

70

.00

70

10

.01

20

.00

20

.00

20

.00

40

.00

40

.00

30

.00

40

.00

40

.00

40

.00

10

.00

10

.00

20

.00

20

6.2. Verification of BIASz and VðrzÞ in small samples

Fig. 1 shows the bias effects BIASz corresponding to thefour estimators rz, z 2 fP,S,K ,Mg for n¼10 and n¼20,respectively. It is clearly observed that the magnitudes ofBIASz can be ordered as 9BIASM9o9BIASP9o9BIASK9o9BIASS9. That is, the performance of rS is much worse thanthose of the other three in terms of bias effect in smallsamples. Moreover, it is also observed that (45) and (47)with respect to BIASS and BIASM are more accurate than(44) and (46) with respect to BIASP and BIASK . In otherwords, the former two formulae agree better than do thelatter two formulae with the corresponding simulationresults for a sample size n as small as 10. Nevertheless,the deviations from (44) and (46) to the correspondingsimulation results are less noticeable when the sample sizen is increased up to 20.

Table 4 lists, for each of the three samples sizes, 10, 20and 30, (1) the theoretical results (51)–(54) with respectto VðrzÞ and (2) the corresponding observed variancesfrom the Monte Carlo simulations. It can be seen that (52)and (54), with respect to VðrSÞ and VðrMÞ, are accurateenough even though the sample size is as small as n¼10.On the other hand, unfortunately, the theoretical formula(51) for VðrPÞ and (53) for VðrK Þ deviate substantiallyfrom the corresponding observed simulation results forthe same sample size n¼10. However, it appears thatthese deviations become less noticeable for n¼20 andnegligible for n¼30. Therefore, it would be save to use(51)–(54) when approximating the variances of rz fornZ20 in practice.

Ta

ble

4

Va

ria

nce

so

fVðr

zÞ,z2fP

,S,K

,Mg

for

10

,20

,30

.

rS

am

ple

size

10

Vðr

Vðr

Vðr

Vðr

(53

)O

bs.

(54

)O

bs.

(55

)O

bs.

(56

0.0

0.1

11

0.1

11

0.1

22

0.1

19

0.1

52

0.1

33

0.1

4

0.1

0.1

09

0.1

09

0.1

20

0.1

17

0.1

50

0.1

31

0.1

4

0.2

0.1

02

0.1

05

0.1

15

0.1

12

0.1

42

0.1

25

0.1

3

0.3

0.0

92

0.0

96

0.1

06

0.1

05

0.1

29

0.1

17

0.1

2

0.4

0.0

78

0.0

85

0.0

94

0.0

94

0.1

13

0.1

04

0.1

1

0.5

0.0

63

0.0

71

0.0

80

0.0

81

0.0

93

0.0

89

0.0

9

0.6

0.0

46

0.0

55

0.0

63

0.0

65

0.0

70

0.0

71

0.0

7

0.7

0.0

29

0.0

38

0.0

46

0.0

48

0.0

48

0.0

52

0.0

5

0.8

0.0

14

0.0

21

0.0

28

0.0

30

0.0

26

0.0

31

0.0

2

0.9

0.0

04

0.0

07

0.0

12

0.0

13

0.0

09

0.0

12

0.0

1

6.3. Comparison of MSE in small samples

Contrary to BIASz illustrated in Fig. 1, the magnitudesof the mean square errors

MSEz9E½ðrz�rÞ2�, z 2 fP,S,K ,Mg

cannot be ordered in a consistent manner. It appears inFig. 2 that (1) MSEM 4MSEK 4MSES4MSEP when 9r9 isaround 0, (2) MSES4MSEK 4MSEP when 9r9 exceedssome threshold, which moves towards 0 with increaseof n, and (3) the difference between MSEK and MSES

around r¼ 0 decreases steadily with increase of n.Furthermore, due to the asymptotic equivalence betweenrS and rM , MSES and MSEM becomes closer and closer asn increases.

Page 11: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

Fig. 2. Comparison of observed MSEz , z 2 fP,S,K ,Mg for (a) n¼10, (b) n¼20, (c) n¼40, and (d) n¼60 over r¼�1ð0:1Þ1, respectively.

Fig. 3. Verification and comparison of ARES and AREK for n¼1000 over

r¼ 0ð0:01Þ1, for theoretical curves, and r¼ 0ð0:05Þ0:95, for simulation

results. The results (60) and (61) are used to plot the two theoretical

curves for r¼ 1, respectively.

W. Xu et al. / Signal Processing 93 (2013) 261–276 271

6.4. Verification and comparison of ARES and AREK

Fig. 3 verifies and compares the performance of rS andrK in terms of ARE. For purpose of numerical verification,simulation results for n¼1000 are superimposed uponthe corresponding theoretical curves. Due to the asymp-totic equivalence between rS and rM , the results withrespect to AREM are not included in Fig. 3. It can beobserved that (1) the simulations agree well with ourtheoretical findings in (58) and (59), respectively, (2)AREK lies consistently above ARES, indicating the super-iority of rK over rS for large samples, and (3) theperformance of rS deteriorates severely as r approaching

unity, although it performs similarly as rK when r issmall. Note that the remarks on ARES also apply to AREM

due to the equivalence between rS and rM when thesample size n is large.

6.5. Performance of rz in contaminated normal model

Fig. 4 supports to (1) verify the two statements concern-ing EðrK Þ and EðrSÞ in Theorem 1 under the contaminatedGaussian model (36) and (2) compare our formula (38) withthe result of (n) that asserted in [28]. Due to the lack ofspace, we only present the results for E¼ 0:01 and E¼ 0:05under the sample size n¼50 here. For simplicity, the restparameters of the model (36) are set to be sX ¼ sY ¼ 1,lX ¼ lY ¼ 100 and r0 ¼ 0 throughout. It is seen that theobserved values of EðrK Þ and EðrSÞ agree well with thecorresponding theoretical results of (37) and (38) estab-lished in Theorem 1. On the other hand, however, the curveswith respect to (n), especially in Fig. 4(b), deviate obviouslyfrom the corresponding observed values.

Fig. 5 illustrates, in terms of MSE, the sensitivity of rP aswell as the robustness of rS, rK and rM to impulsive noise.It is shown in Fig. 5 that the MSE of rP is dramatically largerthan those of the other three estimators, irrespective of howsmall the fraction E of impulsive component is. On theother hand, it is seen that, despite some minor negative(positive) differences for r around 0 (71), MSES and MSEM

behave similarly with MSEK for E¼ 0:01. Nevertheless, MSES

and MSEM are much larger than MSEK for E¼ 0:05 whenr falls in the neighborhood of 71. Combining Fig. 5(a)and (b), it would be reasonable to rank their performance as

Page 12: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

Fig. 4. Verification of Theorem 1 for (a) E¼ 0:01 and (b) E¼ 0:05 under the sample size n¼50 over r¼ ð�1Þ0:1ð1Þ, for simulations, and r¼ ð�1Þ0:01ð1Þ, for

theoretical formulae (37) and (38). The rest parameters of the model (36) are set to be sX ¼ sY ¼ 1, lX ¼ lY ¼ 100 and r0 ¼ 0, respectively. The formula (n)

concerning EðrSÞ developed elsewhere [28] is also included for comparison.

Fig. 5. Performance comparison in terms of MSEz , z 2 fS,P,K ,Mg, over r¼�1ð0:1Þ1 in the contaminated normal model (36) for (a) E¼ 0:01 and (b) E¼ 0:05

under the sample size n¼50. The rest parameters of the model are set to be sX ¼ sY ¼ 1, lX ¼ lY ¼ 100 and r0 ¼ 0, respectively.

W. Xu et al. / Signal Processing 93 (2013) 261–276272

rK ZrS � rM brP in terms of MSE under the contaminatednormal model (36), where the symbol � stands for ‘‘issimilar to’’.

7. Concluding remarks

In this paper we have investigated systematically theproperties of Spearman’s rho and Kendall’s tau for sam-ples drawn from bivariate normal contained normalpopulations. Theoretical derivations along with MonteCarlo simulations reveal that, contrary to the opinion ofequivalence between SR and KT in some literature, e.g.[46], they behave quite differently in terms of mathema-

tical tractability, bias effect, mean square error, asymptotic

relative efficiency in the normal cases and robustness

properties in the contaminated normal model.As shown in Lemma 3, SR is mathematically less

tractable than KT, in the sense of the intractable termsO1ðrÞ and O2ðrÞ in the formula of its variance (17), incontrast with the closed form expression of VðrK Þ in (16).However, this mathematical inconvenience is, to someextent, offset by Table 2 provided in this work, especially

from the viewpoint of numerical accuracy. Moreover, asdemonstrated in Fig. 1 and Table 4, the convergencespeed of the asymptotic formulae (46) and (53) withrespect to BIASK and VðrK Þ are slower than those ofBIASS and VðrSÞ due to the high nonlinearity of thecalibration (41). As a consequence, we do not attachtoo much importance to such mathematical advantage ofKT over SR.

Now let us turn back to the question raised at the verybeginning of this paper: Which one, SR or KT, should weuse in practice when PPMCC is inapplicable? The answerto this question is different for different requirements ofthe task at hand. Specifically,

1.

If unbiasedness is on the top priority list, then neitherrS nor rK should be resorted to. The modified versionrM that employs both SR and KT is definitely the bestchoice (cf. Fig. 1).

2.

On the other hand, if minimal MSE is the criticalfeature and the sample size n is small, then rS (rK )should be employed when the population correlation ris weak (strong) (cf. Fig. 2).
Page 13: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

W. Xu et al. / Signal Processing 93 (2013) 261–276 273

3.

Since rK outperforms rS asymptotically in terms ofARE, then rK is the suitable statistic in large-samplecases (cf. Fig. 3).

4.

If their is impulsive noise in the data, then it would bebetter to employ rK , in terms of MSE, although there issome minor advantage of rS when r is in the neigh-borhood of 0 (cf. Fig. 5).

5.

Moreover, in terms of time complexity, rS appears tobe superior to rK —the computational load of theformer is Oðn log nÞ; whereas the computational loadof the latter is Oðn2Þ [12].

Possessing the desirable properties summarized inSection 2, Spearman’s rho and Kendall’s tau have foundwide applications in the literature other than signalprocessing. With the new insights uncovered in thispaper, these two rank based coefficients can play com-plementary roles under the circumstances where Pear-son’s product moment correlation coefficient is no longereffective.

Acknowledgments

This work was supported in part by 100-TalentsScheme Funding from Guangdong University of Technol-ogy under Grant 112418006, in part by the Talent Intro-duction Special Funds from Guangdong Province underGrant 2050205, in part by a team project from GuangdongUniversity of Technology under Grant GDUT2011-07, andin part by National Natural Science Foundation of Chinaunder Project 61271380.

Appendix A

Proof of Lemma 4. Let

S9Xn

i ¼ 1

Xn

j ¼ 1

Xn

k ¼ 1

HðXi�XjÞHðYi�YkÞ

¼Xn Xn

iaj ¼ 1

HðXi�XjÞHðYi�YjÞ

þXn Xn Xn

iajak ¼ 1

HðXi�XjÞHðYi�YkÞ ðA:1Þ

and T be the numerator of (3). Taking the expectation ofboth sides of (A.1) with the assistance of (6) in Lemma 2,it follows readily that

EðSÞ ¼ n½2�1

S1

2p

� �þn½3�

1

S2

2p

� �ðA:2Þ

where n½k�9nðn�1Þ � � � ðn�kþ1Þ, with k being a positiveinteger. Define

I9Xiaj

HðXi�XjÞHðYi�YjÞ ðA:3Þ

J9X

iajak

HðXi�XjÞHðYi�YkÞ ðA:4Þ

K9Xiaj

HðXi�XjÞ ðA:5Þ

L9Xiak

HðYi�YkÞ ðA:6Þ

Then, we have, from (3), (B.9) and (A.1) along with therelationship sgnð�Þ ¼ 2Hð�Þ�1,

S ¼ Iþ J ðA:7Þ

T ¼ 4I�2 K�2Lþn½2� ðA:8Þ

and hence

CðrS,rK Þ ¼12

n2ðn�1Þðn2�1ÞCðS,T Þ

¼12

n2ðn�1Þðn2�1Þ½EðST Þ�EðSÞEðT Þ� ðA:9Þ

From (5) and (6), it follows that

EðIÞ ¼ n½2�1

S1

2p

� �and EðKÞ ¼ EðLÞ ¼

n½2�

2

Substituting these expectation terms into (A.8) gives

EðT Þ ¼ 4n½2�1

S1

2p

� ��n½2� ¼

2n½2�

pS1 ðA:10Þ

Recall that we have obtained EðSÞ in (A.2). Now the onlydifficulty lies in the evaluation of EðST Þ in (A.9). Multi-plying (A.7) and (A.8), expanding and taking expectationsterm by term, we have

EðST Þ ¼ 4EðIJÞ�2EðKJÞ�2EðLJÞ

þ4EðI2Þ�2EðKIÞ�2EðLIÞþn½2�EðSÞ ðA:11Þ

Now, resorting to Table 5, we are ready to evaluate thefirst six terms in (A.11). From (A.3) and (A.4), it followsthat EðIJÞ is a summation of P0

4 terms of the form

EfHðXi�Xj|fflffl{zfflffl}Z1

ÞHðYi�Yj|fflffl{zfflffl}Z2

ÞHðXk�Xl|fflfflffl{zfflfflffl}Z3

ÞHðYk�Ym|fflfflfflffl{zfflfflfflffl}Z4

Þg: ðA:12Þ

Since, by (4), Hð0Þ ¼ 0, the term (A.12) vanishes for i¼ j

or k¼ l or k¼m. Then there are n2ðn�1Þ2ðn�2Þ nontrivial(A.12)-like terms left to be evaluated. It follows that thedomain of the quintuple ði,j,k,l,mÞ can be partitioned into13 disjoint and exhaustive subsets whose representativeterms, Z1, Z2, Z3, Z4, are listed in the upper panel of Table 5.Summing up the corresponding P0

4-terms in Table 5 leadsdirectly to EðIJÞ. In a similar manner we can obtain EðKJÞ

and EðLJÞ. With the assistance of the lower panel ofTable 5, we also have the expressions of EðI2

Þ, EðKIÞ andEðLIÞ. Substituting these results and (A.2) into (A.11),subtracting the multiplication of (A.2) and (A.10) andsubstituting the resultant back into (A.9), we find thatCðrS,rK Þ is of the form (24) with

O3ðrÞ ¼ 14 Wgþ

14 Wpþ

12 Whþ

12Wq

which simplifies to (26) by applying the identities

We ¼ 2Wd, Wg ¼Wp, Wh ¼Wq, and Wm ¼ 2Wlþ13

obtained from an application of the relationship (8) toAppendix 2 of [45]. The lemma then follows. &

Page 14: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

Table 5Quantities for evaluation of EðST Þ in Lemma 4.

Corr. matrix No. of termsa Representative term P04ðZ1 ,Z2 ,Z3 ,Z4Þ

b P03ðZ2 ,Z3 ,Z4Þ

b P03ðZ1 ,Z3 ,Z4Þ

a

Z1 Z2 Z3 Z4

Rb n½5� X1�X2 Y1�Y2 X3�X4 Y3�Y5 1

16þ

S1

8p þS2

8p þS1S2

4p2

1

S2

4p1

S2

4pRg n½4� X1�X2 Y1�Y2 X1�X3 Y1�Y4 5

48þ

S1

8p þ3S2

8p þWg ðrÞ

16

1

S2

2p1

S2

2pRh n½4� X1�X2 Y1�Y2 X3�X4 Y3�Y1 1

24þ

S1

8p þWhðrÞ

16

1

12þ

S2

4p1

8Rh n½4� X1�X2 Y1�Y2 X3�X1 Y3�Y4 1

24þ

S1

8p þWhðrÞ

16

1

12þ

S2

4p1

8Rp n½4� X1�X2 Y1�Y2 X2�X3 Y2�Y4 1

48þ

S1

8p�

S2

8pþ

WpðrÞ16

1

12

1

12Rq n½4� X1�X2 Y1�Y2 X3�X2 Y3�Y4 1

12þ

S1

8p þS2

4p þWqðrÞ

16

1

S2

2p1

S2

4pRq n½4� X1�X2 Y1�Y2 X3�X4 Y3�Y2 1

12þ

S1

8p þS2

4p þWqðrÞ

16

1

S2

2p1

S2

4pRu n½3� X1�X2 Y1�Y2 X1�X2 Y1�Y3 1

S1

4pþ

S2

4p1

S1

4pþ

S2

4p1

S2

2pRu n½3� X1�X2 Y1�Y2 X1�X3 Y1�Y2 1

S1

4p þS2

4p1

S1

4p þS2

4p1

S2

2pRv n½3� X1�X2 Y1�Y2 X3�X1 Y3�Y2 1

18þ

S1

8p þS2

8p þS2

1

8p2�

S22

8p2

1

6

1

12þ

S2

2pRv n½3� X1�X2 Y1�Y2 X3�X2 Y3�Y1 1

18þ

S1

8p þS2

8p þS2

1

8p2�

S22

8p2

1

6

1

12þ

S2

2pR1 n½3� X1�X2 Y1�Y2 X2�X1 Y2�Y3 0 1

12�

S1

4p þS2

4p0

R2 n½3� X1�X2 Y1�Y2 X2�X3 Y2�Y1 0 0 1

12�

S1

4p þS2

4p

Rj n½4� X1�X2 Y1�Y2 X3�X4 Y3�Y4 1

16þ

S1

4p þS2

1

4p2

1

S1

4p1

S1

4pRr n½3� X1�X2 Y1�Y2 X1�X3 Y1�Y3 1

S1

4p þS2

4p þS2

1

4p2�

S22

4p2

1

S1

4p þS2

4p1

S1

4p þS2

4pRr n½3� X1�X2 Y1�Y2 X3�X2 Y3�Y2 1

S1

4p þS2

4p þS2

1

4p2�

S22

4p2

1

S1

4p þS2

4p1

S1

4p þS2

4pRw n½3� X1�X2 Y1�Y2 X3�X1 Y3�Y1 1

36þ

S1

4p �S2

4p þS2

1

4p2�

S22

4p2

1

12þ

S1

4p �S2

4p1

12þ

S1

4p �S2

4pRw n½3� X1�X2 Y1�Y2 X2�X3 Y2�Y3 1

36þ

S1

4p �S2

4p þS2

1

4p2�

S22

4p2

1

12þ

S1

4p�

S2

4p1

12þ

S1

4p�

S2

4pRx n½2� X1�X2 Y1�Y2 X1�X2 Y1�Y2 1

S1

2p1

S1

2p1

S1

2p

a The symbol n½k� is a compact notation of nðn�1Þ � � � ðn�kþ1Þ, with k being a positive integer.b P0

4ðZ1 ,Z2 ,Z3 ,Z4Þ9EfHðZ1ÞHðZ2ÞHðZ3ÞHðZ4Þg, P03ðZ2 ,Z3 ,Z4Þ9EfHðZ2ÞHðZ3ÞHðZ4Þg, and P0

3ðZ1 ,Z3 ,Z4Þ9EfHðZ1ÞHðZ3ÞHðZ4Þg. Notations S19sin�1 r and

S29sin�1 12r are used for brevity.

W. Xu et al. / Signal Processing 93 (2013) 261–276274

Appendix B

Proof of Theorem 1. For the ease of the followingdiscussion, we will use fðx,yÞ and cðx,yÞ to denote thepdfs of the two bivariate normal components in (36),respectively. From [37] and (A.8), it follows that thenumerator T of (3) can be simplified to

T ¼ 4Xn Xn

iaj ¼ 1

HðXi�XjÞHðYi�YjÞ�n½2� ðB:1Þ

which yields

EðT Þ ¼ 4n½2�E½HðX1�X2ÞHðY1�Y2Þ�|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}E1

�n½2� ðB:2Þ

by the i.i.d. assumption. To evaluate E1 in (B.2), weneed the joint distribution of ðX1,Y1,X2,Y2Þ, denoted byjðx1,y1,x2,y2Þ, which is readily obtained as

j¼ ½ð1�EÞf1þEc1�½ð1�EÞf2þEc2�

¼ ð1�EÞ2|fflfflffl{zfflfflffl}a1

f1f2|fflffl{zfflffl}j1

þEð1�EÞ|fflfflffl{zfflfflffl}a2

f1c2|fflffl{zfflffl}j2

þEð1�EÞ|fflfflffl{zfflfflffl}a3

f2c1|fflffl{zfflffl}j3

þ E2|{z}a4

c1c2|fflffl{zfflffl}j4

ðB:3Þ

where j, fi, ci are compact notations of jðx1,y1,x2,y2Þ,fðxi,yiÞ and cðxi,yiÞ, i¼1,2, respectively. Write

U9X1�X2ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiVðX1�X2Þ

p and V9Y1�Y2ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiVðY1�Y2Þ

p :

Then, with respect to j1, j2, j3, and j4 in (B.3), (U,V)follows four standard bivariate normal distributions withcorrelations

R1 ¼ r ðB:4Þ

R2 ¼rþlXlYr0ffiffiffiffiffiffiffiffiffiffiffiffiffi

1þl2X

q ffiffiffiffiffiffiffiffiffiffiffiffiffi1þl2

Y

q -r0 as lX ,lY-1 ðB:5Þ

R3 ¼rþlXlYr0ffiffiffiffiffiffiffiffiffiffiffiffiffi

1þl2X

q ffiffiffiffiffiffiffiffiffiffiffiffiffi1þl2

Y

q -r0 as lX ,lY-1 ðB:6Þ

Page 15: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

W. Xu et al. / Signal Processing 93 (2013) 261–276 275

R4 ¼ r0 ðB:7Þ

respectively. An application of Sheppard’s theorem (6) to(B.2) along with (B.4)–(B.7) yields

EðT Þ ¼ 4n½2�X4

i ¼ 1

ai1

1

2p sin�1 Ri

� ��n½2�

¼2n½2�

p½a1 sin�1rþ2a2 sin�1R2þa4 sin�1 r0�: ðB:8Þ

Now it is not difficult to verify that the first statement(37) holds by (1) dividing both sides of (B.8) by n½2�,(2) letting lX-1 and lY-1, and (3) ignoring the OðE2Þ

terms.To prove the second statement (38), it suffices to

evaluate EðSÞ by the relationship [37]

rS ¼S� 1

4 ðn�1Þ2

n

12

n2�1: ðB:9Þ

Taking expectations of both sides in (A.1) along with thei.i.d. assumptions gives

EðSÞ ¼ n½2�E1þn½3�E½HðX1�X2ÞHðY1�Y3Þ�|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}E2

: ðB:10Þ

Since we have known E1 in the above development,now we only need to work out E2 in (B.10). Let$ðx1,y1,x2,y2,x3,y3Þ, abbreviated as $, denote the pdf ofthe joint distribution of ðX1,Y1,X2,Y2,X3,Y3Þ. Then, from(36) and the i.i.d. assumption,

$¼ ½ð1�EÞf1þEc1�½ð1�EÞf2þEc2�½ð1�EÞf3þEc3�

¼ ð1�EÞ3f1f2f3|fflfflfflfflffl{zfflfflfflfflffl}$1

þEð1�EÞ2ðf1f2c3|fflfflfflfflffl{zfflfflfflfflffl}$2

þf1c2f3|fflfflfflfflffl{zfflfflfflfflffl}$3

þc1f2f3|fflfflfflfflffl{zfflfflfflfflffl}$4

Þ

þE2ð1�EÞðf1c2c3|fflfflfflfflffl{zfflfflfflfflffl}$5

þc1f2c3|fflfflfflfflffl{zfflfflfflfflffl}$6

þc1c2f3|fflfflfflfflffl{zfflfflfflfflffl}$7

ÞþE3c1c2c3|fflfflfflfflffl{zfflfflfflfflffl}$8

ðB:11Þ

where fi and ci are compact notations of fðxi,yiÞ andcðxi,yiÞ, i¼ 1,2,3, respectively. Define

V 0 ¼Y1�Y3ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiVðY1�Y3Þ

p :

Then, with respect to $1 to $8 in (B.11), ðU,V 0Þ followseight standard bivariate normal distributions with corre-lations

R5 ¼r2

ðB:12Þ

R6 ¼1ffiffiffi2p

rffiffiffiffiffiffiffiffiffiffiffiffiffi1þl2

Y

q -0 as lY-1 ðB:13Þ

R7 ¼1ffiffiffi2p

rffiffiffiffiffiffiffiffiffiffiffiffiffi1þl2

X

q -0 as lX-1 ðB:14Þ

R8 ¼lXlYr0ffiffiffiffiffiffiffiffiffiffiffiffiffi

1þl2X

q ffiffiffiffiffiffiffiffiffiffiffiffiffi1þl2

Y

q -r0 as lX ,lY-1 ðB:15Þ

R9 ¼rffiffiffiffiffiffiffiffiffiffiffiffiffi

1þl2X

q ffiffiffiffiffiffiffiffiffiffiffiffiffi1þl2

Y

q -0 as lX ,lY-1 ðB:16Þ

R10 ¼1ffiffiffi2p

lXr0ffiffiffiffiffiffiffiffiffiffiffiffiffi1þl2

X

q -r0ffiffiffi

2p as lX-1 ðB:17Þ

R11 ¼1ffiffiffi2p

lYr0ffiffiffiffiffiffiffiffiffiffiffiffiffi1þl2

Y

q -r0ffiffiffi

2p as lY-1 ðB:18Þ

R12 ¼r0

2ðB:19Þ

Using Sheppard’s theorem (6) again together with(B.11)–(B.19), we can obtain the expression of E2 and henceEðSÞ in terms of n, E and R1 to R12. Substituting EðSÞ into(B.9), letting n, lX , lY-1 and ignoring the OðE2Þ terms, wearrive at (38), the second theorem statement. &

References

[1] P. Schreier, A unifying discussion of correlation analysis for com-plex random vectors, IEEE Transactions on Signal Processing 56 (4)(2008) 1327–1336.

[2] J. Via, I. Santamaria, Correlation matching approaches for blindOSTBC channel estimation, IEEE Transactions on Signal Processing56 (12) (2008) 5950–5961.

[3] X. Xu, Z. Ye, Y. Zhang, DOA estimation for mixed signals in thepresence of mutual coupling, IEEE Transactions on Signal Proces-sing 57 (9) (2009) 3523–3532.

[4] E. Dougherty, S. Kim, Y. Chen, Coefficient of determination innonlinear signal processing, Signal Processing 80 (10) (2000)2219–2235.

[5] M. Barkat, S. Dib, Cfar detection for two correlated targets, SignalProcessing 61 (3) (1997) 289–295.

[6] J. Lian, G. Garner, D. Muessig, V. Lang, A simple method to quantifythe morphological similarity between signals, Signal Processing90 (2) (2010) 684–688.

[7] J. Girault, D. Kouame, A. Ouahabi, Analytical formulation of thefractal dimension of filtered stochastic signals, Signal Processing90 (9) (2010) 2690–2697.

[8] J. Delmas, H. Abeida, Asymptotic distribution of circularity coeffi-cients estimate of complex random variables, Signal Processing89 (12) (2009) 2670–2675.

[9] Y. He, T. Gan, W. Chen, H. Wang, Multi-stage image denoising basedon correlation coefficient matching and sparse dictionary pruning,Signal Processing 92 (1) (2012) 139–149.

[10] S. Zhou, H. Liu, Y. Zhao, L. Hu, Target spatial and frequencyscattering diversity property for diversity mimo radar, SignalProcessing 91 (2) (2011) 269–276.

[11] W. Xu, C. Chang, Y.S. Hung, S.K. Kwan, P.C.W. Fung, Order statisticcorrelation coefficient and its application to association measure-ment of biosignals, in: Proceedings of the International Conferenceon Acoustics, Speech, Signal Processing (ICASSP), vol. 2, 2006,pp. II-1068–II-1071.

[12] W. Xu, C. Chang, Y.S. Hung, S.K. Kwan, P.C.W. Fung, Order statisticscorrelation coefficient as a novel association measurement withapplications to biosignal analysis, IEEE Transactions on SignalProcessing 55 (12) (2007) 5552–5563.

[13] W. Xu, C. Chang, Y.S. Hung, P.C.W. Fung, Asymptotic properties oforder statistics correlation coefficient in the normal cases, IEEETransactions on Signal Processing 56 (6) (2008) 2239–2248.

[14] W. Xu, Y.S. Hung, M. Niranjan, M. Shen, Asymptotic mean andvariance of Gini correlation for bivariate normal samples, IEEETransactions on Signal Processing 58 (2) (2010) 522–534.

[15] M. Greco, F. Gini, A. Farina, Radar detection and classification ofjamming signals belonging to a cone class, IEEE Transactions onSignal Processing 56 (5) (2008) 1984–1993.

[16] F. Gini, A radar application of a modified Cramer–Rao bound:parameter estimation in non-Gaussian clutter, IEEE Transactionson Signal Processing 46 (7) (1998) 1945–1953.

[17] F. Gini, J. Michels, Performance analysis of two covariance matrixestimators in compound-Gaussian clutter, IEE Proceedings Part-F146 (3) (1999) 133–140.

[18] A. Stuart, J.K. Ord, Kendall’s Advanced Theory of Statistics: ClassicalInference and Relationship, 5th edition, vol. 2, Edward Arnold,London, 1991.

Page 16: A comparative analysis of Spearman's rho and Kendall's tau ...yhhou/pdf/1-s2.0-S0165168412002721-mai… · A comparative analysis of Spearman’s rho and Kendall’s tau in normal

W. Xu et al. / Signal Processing 93 (2013) 261–276276

[19] J.D. Gibbons, S. Chakraborti, Nonparametric Statistical Inference,3rd edition, Marcel Dekker, New York, 1992.

[20] R.A. Fisher, On the ‘probable error’ of a coefficient of correlationdeduced from a small sample, Metron 1 (1921) 3–32.

[21] R.A. Fisher, Statistical Methods, Experimental Design, and ScientificInference, Oxford University Press, New York, 1990.

[22] M. Kendall, J.D. Gibbons, Rank Correlation Methods, 5th edition,Oxford University Press, New York, 1990.

[23] D.D. Mari, S. Kotz, Correlation and Dependence, Imperial CollegePress, London, 2001.

[24] S. Tumanski, Principles of Electrical Measurement, Taylor & Francis,New York, 2006.

[25] D. Stein, Detection of random signals in Gaussian mixture noise,IEEE Transactions on Information Theory 41 (6) (1995) 1788–1801.

[26] R. Chen, X. Wang, J. Liu, Adaptive joint detection and decoding inflat-fading channels via mixture Kalman filtering, IEEE Transactionson Information Theory 46 (6) (2000) 2079–2094.

[27] Z. Reznic, R. Zamir, M. Feder, Joint source-channel coding of aGaussian mixture source over the Gaussian broadcast channel, IEEETransactions on Information Theory 48 (3) (2002) 776–781.

[28] G.L. Shevlyakov, N.O. Vilchevski, Robustness in Data Analysis:Criteria and Methods, Modern Probability and Statistics, VSP,Utrecht, 2002.

[29] C. Croux, C. Dehon, Influence functions of the Spearman andKendall correlation measures, Statistical Methods & Applications19 (4) (2010) 497–515.

[30] D. Ruchkin, Error of correlation coefficient estimates from polaritycoincidences (corresp.), IEEE Transactions on Information Theory11 (2) (1965) 296–297.

[31] M.C. Cheng, The clipping loss in correlation detectors for arbitraryinput signal-to-noise ratios, IEEE Transactions on InformationTheory 14 (3) (1968) 382–389.

[32] H. Chadwick, L. Kurz, Rank permutation group codes based onKendall’s correlation statistic, IEEE Transactions on InformationTheory 15 (2) (1969) 306–315.

[33] V. Hansen, Detection performance of some nonparametric ranktests and an application to radar, IEEE Transactions on InformationTheory 16 (3) (1970) 309–318.

[34] F. Esscher, On a method of determining correlation from the ranksof variates, Skandinavisk Aktuarietidskrift 7 (2) (1924) 201–219.

[35] M.G. Kendall, S.F.H. Kendall, B.B. Smith, The distribution of Spear-man’s coefficient of rank correlation in a universe in which allrankings occur an equal number of times, Biometrika 30 (3/4)(1939) 251–273.

[36] H.E. Daniels, The relation between measures of correlation inthe universe of sample permutations, Biometrika 33 (2) (1944)129–135.

[37] P.A.P. Moran, Rank correlation and product-moment correlation,Biometrika 35 (1/2) (1948) 203–206.

[38] W. Hoeffding, A class of statistics with asymptotically normaldistribution, Annals of Mathematical Statistics 19 (3) (1948)293–325.

[39] M.G. Kendall, Rank and product-moment correlation, Biometrika36 (1/2) (1949) 177–193.

[40] H.E. Daniels, Rank correlation and population models, Journal of theRoyal Statistical Society: Series B (Statistical Methodology) 12 (2)(1950) 171–191.

[41] S.T. David, M.G. Kendall, A. Stuart, Some questions of distribution inthe theory of rank correlation, Biometrika 38 (1/2) (1951) 131–140.

[42] J. Durbin, A. Stuart, Inversions and rank correlation coefficients,Journal of the Royal Statistical Society: Series B (Statistical Meth-odology) 13 (2) (1951) 303–309.

[43] H. Hotelling, New light on the correlation coefficient and itstransforms, Journal of the Royal Statistical Society: Series B (Sta-tistical Methodology) 15 (2) (1953) 193–232.

[44] E.C. Fieller, H.O. Hartley, E.S. Pearson, Tests for rank correlationcoefficients. i, Biometrika 44 (3/4) (1957) 470–481.

[45] F.N. David, C.L. Mallows, The variance of Spearman’s rho in normalsamples, Biometrika 48 (1/2) (1961) 19–28.

[46] A.R. Gilpin, Table for conversion of Kendall’s tau to Spearman’s rhowithin the context of measures of magnitude of effect for meta-analysis, Educational and Psychological Measurement 53 (1) (1993)87–92.

[47] R.J. Serfling, Approximation Theorems of Mathematical Statistics,Wiley Series in Probability and Mathematical Statistics, WileyNew York, 2002.

[48] A. Stuart, J.K. Ord, S.F. Arnold, Kendall’s Advanced Theory ofStatistics: Distribution Theory, 6th edition, vol. 1, Edward Arnold,London, 1991.

[49] S.S. Gupta, Probability integrals of multivariate normal and multi-variate t, Annals of Mathematical Statistics 34 (3) (1963) 792–828.

[50] D.R. Childs, Reduction of the multivariate normal integral tocharacteristic form, Biometrika 54 (1/2) (1967) 293–300.

[51] F. Esscher, On a method of determining correlation from the ranksof the variates, Skandinavisk Aktuarietidskrift 7 (1924) 201–219.

[52] C.B. Borkowf, Computing the nonnull asymptotic variance and theasymptotic relative efficiency of Spearman’s rank correlation,Computational Statistics & Data Analysis 39 (3) (2002) 271–286.