Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 )...

22
Applied Statistics I (IMT224β/AMT224β) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 1/22

Transcript of Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 )...

Page 1: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Applied Statistics I(IMT224β/AMT224β)

Department of MathematicsUniversity of Ruhuna

A.W.L. Pubudu Thilan

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 1/22

Page 2: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Chapter 5

Measures of skewness

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 2/22

Page 3: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Introduction

As mentioned in previous two chapters, we can characterizeany set of data by measuring its central tendency, variation,and shape.

In this chapter we are going to discuss about shape of a givendata set.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 3/22

Page 4: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

IntroductionCont...

The need to study these concepts arises from the fact that themeasures of central tendency and dispersion fail to describe adistribution completely.

It is possible to have frequency distributions which differwidely in their nature and composition and yet may have samecentral tendency and dispersion.

Thus, there is need to supplement the measures of centraltendency and dispersion.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 4/22

Page 5: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Concept of skewness

The skewness of a distribution is defined as the lack ofsymmetry.

In a symmetrical distribution, mean, median, and mode areequal to each other.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 5/22

Page 6: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Concept of skewnessCont...

The presence of extreme observations on the right hand sideof a distribution makes it positively skewed.

We shall in fact have

Mean > Median > Mode

when a distribution is positively skewed.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 6/22

Page 7: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Concept of skewnessCont...

On the other hand, the presence of extreme observations tothe left hand side of a distribution make it negatively skewedand the relationship between mean, median and mode is:

Mean < Median < Mode.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 7/22

Page 8: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Different measures of skewness

The direction and extent of skewness can be measured in variousways. We shall discuss three measures of skewness in this section.

1 Pearson’s coefficient of skewness

2 Bowley’s coefficient of skewness

3 Kelly’s coefficient of skewness

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 8/22

Page 9: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

[1] Karl Pearson coefficient of skewness

The mean, median and mode are not equal in a skeweddistribution.

The Karl Pearson’s measure of skewness is based upon thedivergence of mean from mode in a skewed distribution.

Skp1 =mean - mode

standard deviationor Skp2 =

3(mean - median)

standard deviation

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 9/22

Page 10: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Properties of Karl Pearson coefficient of skewness

1 −1 ≤ Skp ≤ 1.

2 Skp = 0 ⇒ distribution is symmetrical about mean.

3 Skp > 0 ⇒ distribution is skewed to the right.

4 Skp < 0 ⇒ distribution is skewed to the left.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 10/22

Page 11: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Advantage and disadvantage

AdvantageSkp is independent of the scale. Because (mean-mode) andstandard deviation have same scale and it will be canceled outwhen taking the ratio.

DisadvantageSkp depends on the extreme values.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 11/22

Page 12: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Example

Calculate Karl Pearson coefficient of skewness of the following dataset (S = 1.7).

Value (x) 1 2 3 4 5 6 7

Frequency (f) 2 3 4 4 6 4 2

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 12/22

Page 13: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

ExampleSolution

mean = x =

∑ni=1 fixi∑n

i fi

=1× 2 + 2× 3 + ...+ 7× 2

25

=104

25= 4.16

mode = 5

Skp =mean-mode

standard deviation

=4.16− 5

1.7= −0.4941

Since Skp < 0 distribution is skewed left.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 13/22

Page 14: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

[2] Bowley’s coefficient of skewness

This measure is based on quartiles.

For a symmetrical distribution, it is seen that Q1, and Q3 areequidistant from median (Q2).

Thus (Q3 − Q2)− (Q2 − Q1) can be taken as an absolutemeasure of skewness.

Skq =(Q3 − Q2)− (Q2 − Q1)

(Q3 − Q2) + (Q2 − Q1)

=Q3 + Q1 − 2Q2

Q3 − Q1

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 14/22

Page 15: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Properties of Bowley’s coefficient of skewness

1 −1 ≤ Skq ≤ 1.

2 Skq = 0 ⇒ distribution is symmetrical about mean.

3 Skq > 0 ⇒ distribution is skewed to the right.

4 Skq < 0 ⇒ distribution is skewed to the left.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 15/22

Page 16: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Advantage and disadvantage

AdvantageSkq does not depend on extreme values.

DisadvantageSkq does not utilize the data fully.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 16/22

Page 17: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Example

The following table shows the distribution of 128 familiesaccording to the number of children.

No of children No of families

0 201 152 253 304 185 106 67 3

8 or more 1

Find the Bowley’s coefficient of skewness.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 17/22

Page 18: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

ExampleSolution

No of children No of families Cumulative frequency

0 20 201 15 352 25 603 30 904 18 1085 10 1186 6 1247 3 127

8 or more 1 128

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 18/22

Page 19: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

ExampleSolution⇒Cont...

Q1 =

(128 + 1

4

)th

observation

= (32.25)th observation

= 1

Q2 =

(128 + 1

2

)th

observation

= (64.5)th observation

= 3

Q3 = 3

(128 + 1

4

)th

observation

= (96.75)th observation

= 4

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 19/22

Page 20: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

ExampleSolution⇒Cont...

Skq =Q3 + Q1 − 2Q2

Q3 − Q1

=4 + 1− 2× 3

4− 1

= −1

3= −0.333

Since Skq < 0 distribution is skewed left.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 20/22

Page 21: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

[3] Kelly’s coefficient of skewness

Bowley’s measure of skewness is based on the middle 50% ofthe observations because it leaves 25% of the observations oneach extreme of the distribution.

As an improvement over Bowley’s measure, Kelly hassuggested a measure based on P10 and, P90 so that only 10%of the observations on each extreme are ignored.

Sp =(P90 − P50)− (P50 − P10)

(P90 − P50) + (P50 − P10)

=P90 + P10 − 2P50

P90 − P10.

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 21/22

Page 22: Applied Statistics I - math.ruh.ac.lkpubudu/app5.pdf · Applied Statistics I (IMT224 /AMT224 ) Department of Mathematics University of Ruhuna A.W.L. Pubudu Thilan Department of Mathematics

Thank You

Department of Mathematics University of Ruhuna — Applied Statistics I(IMT224β/AMT224β) 22/22