53
Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Embed Size (px)

Citation preview

Page 1: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall Storiesor how a simple question doesn’t always have a simple

answer

Neil SheldonRoyal Statistical Society

Centre for Statistical Education

neilsheldon.net

Page 2: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Some people think statistics is a branch of

mathematics ...

Page 3: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

... but they’re wrong.It’s more important

than that ...

Page 4: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

... it’s a life skill

Page 5: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

The purpose of statistics is understanding,

not numbers

Page 6: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Understanding statistics is understanding the

world around you

Page 7: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Understanding statistics enables you take better

decisions

Page 8: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

The statistics tell a story ...

Page 9: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

... but first you have to understand the story behind the statistics

Page 10: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall Stories

or how a simple question doesn’t always have a simple answer

Page 11: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall stories

• How tall am I ...

Page 12: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall stories

• How tall am I ...

– in absolute terms? What is my height in feet and inches?

Page 13: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall stories

• How tall am I ...

– in absolute terms? What is my height in feet and inches?

– in relative terms? Am I short, or tall, or about average?

Page 14: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall stories

• What factors influence the answers to– the absolute question?– the relative question?

Page 15: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall stories

• What factors influence the answers to– the absolute question?– the relative question?

• Variation,variation, variation!

Page 16: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

If only ...

Page 17: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Men, white From the US National Health And Nutrition Examination Survey ‘NHANES III’ 1988-94

Page 18: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Men, black

Page 19: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Men, Hispanic

Page 20: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Men, other

Page 21: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Women, white

Page 22: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Women, black

Page 23: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Women, Hispanic

Page 24: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Women, other

Page 25: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

But where do the data come from?

And what are the implications of sampling variation?

NormalSimulation.xlsx

Page 26: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Cross-sectional and longitudinal studies

Age differences in height derived from cross-sectional studies can be the result of differential secular influences among the age cohorts. To determine the magnitude of height loss that accompanies aging, longitudinal studies are required. The authors studied 2,084 men and women aged 17–94 years enrolled from 1958 to 1993 in the Baltimore Longitudinal Study of Aging, Baltimore, Maryland. On average, men's height was measured nine times during 15 years and women's height five times during 9 years. The rate of decrease in height was greater for women than for men. For both sexes, height loss began at about age 30 years and accelerated with increasing age. Cumulative height loss from age 30 to 70 years averaged about 3 cm for men and 5 cm for women; by age 80 years, it increased to 5 cm for men and 8 cm for women. Am J Epidemiol 1999;150:969-77.

Page 27: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Longitudinal data

Page 28: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Overlapping longitudinal data

Page 29: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Judged by the height of the doorframes he built, medieval man seems to have been short by today’s standards.

• But evidence gathered from 3,000 skeletons reveals that human height has varied little over the past 1,000 years.

• From the 10th century through to the 19th, the average height of adult men was 5ft 7in or 170cm - just 2in below today's average.

• Women were an average of 5ft 2in or 158cm - just over an inch shorter than today.

All the bones in the study came from the

medieval St Peter's Church in Barton

upon Humber, North East Lincolnshire.

They were shorter back then ...

Page 30: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Based on a modest sample of skeletons from northern Europe, average heights fell from 173.4 cm in the early Middle Ages to a low of roughly 167 cm during the 17th and 18th centuries. Taking the data at face value, this decline of approximately 6.4 cm substantially exceeds any prolonged downturns found during industrialization in several countries that have been studied. Significantly, recovery to levels achieved in the early Middle Ages was not attained until the early 20th century. It is plausible to link the decline in average height to climate deterioration; growing inequality; urbanization and the expansion of trade and commerce, which facilitated the spread of diseases; fluctuations in population size that impinged on nutritional status; the global spread of diseases associated with European expansion and colonization; and conflicts or wars over state building or religion. Because it is reasonable to believe that greater exposure to pathogens accompanied urbanization and industrialization, and there is evidence of climate moderation, increasing efficiency in agriculture, and greater interregional and international trade in foodstuffs, it is plausible to link the reversal of the long-term height decline with dietary improvements. Richard H Steckel

... Or were they?

Page 31: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Variation in height during the day

Page 32: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Did you know that astronauts are up to 2 inches taller while they're in space? As soon as they come back to Earth, though, they return to their normal height.

Imagine that the vertebrae in your back form a giant spring. Pushing down on the spring keeps it coiled tightly. When the force is released, the spring stretches out. In the same way, the spine elongates by up to three percent while humans travel in space.

To some degree, a similar stretching of the spine happens to you every night. When you lie down, gravity isn't pushing down on your vertebrae. Measure your height carefully as soon as you get up or while you are still lying down. You will find that you're about a centimeter or two taller.

Variation in height during the day

Page 33: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Variation with wealth

Page 34: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Mean height of Dutch adults

Page 35: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Time series data

Page 36: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Variation by sex

Men are, on average, taller than women.

But some women are taller than some men.

How can we quantify this?

men and women heights overlap.xls

Page 37: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Data, data everywhere ...

Human height - Wikipedia, the free encyclopedia.htm

Page 38: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Self-reported height

Page 39: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Genetic variation

• Children of tall parents are, on average, – as tall as– taller than– shorter than

their parents ... ?• Parents of tall children are, on average, – as tall as– taller than– shorter than

their children ... ?

Page 40: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Regression to the mean

P P

C C

Individual heights are measured in standard deviations from male mean or female mean as appropriate. Then P is the average of father’s height and mother’s height

C = P C = P

Page 41: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation

–A very simple question, ‘How tall am I’, raises many issues to do with variation

–These issues go to the heart of many statistical concepts

Page 42: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–variation within groups

This is ‘the usual’ concept of variation: the variability within a population or a sample is measured by the standard deviation or the inter-quartile range

Page 43: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–variation between groups

Groups may differ from one another. Sometimes the variation between groups is more important, sometimes it is less important, than the variation within groups. (Analysis of variance treats this in fine detail.)

Page 44: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–variation within individuals

Sometimes the attribute to be measured is not constant:

it may have a cyclical variation it may have a trend over time ... and it may have both

Page 45: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–variation over time

Where an attribute is observed to vary with time, the variation may be

cross-sectional: “the older ones were like that when they were young”

longitudinal: “that’s what happens as you get older”

... or a combination of the two

Page 46: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–historical variation

A longer-term variation that may be quite distinct from longitudinal or cross-sectional variation

Page 47: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–variation in definition

Any attribute being measured or counted has first to be defined. It is very common for definitions to vary from one situation to another

Page 48: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–sampling “error” variation

A misnomer, as it is in the nature of samples to vary: it’s not a bug but a feature. We all know that samples vary, but we are often tempted to read more information into a sample than it can actually offer

Page 49: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–sampling bias variation

There are many ways in which a non-random sample can be unrepresentative. Opportunity sampling – measuring or counting whatever is at hand – may be the most common and the most dangerous

Page 50: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–self-reporting variation

Lacks objectivity and so can be deeply misleading. It’s like anecdotal evidence on a large scale

Page 51: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–variation and correlation

Strong correlations reduce variation: knowing the value of one variable can reduce the uncertainty about another

Page 52: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

• Variation, variation, variation–variation by error

And, underpinning everything else, there are the errors we all make in counting and measuring, recording and tabulating.Even if all the other sources of variation are controlled and understood, our own fallibility ensures that there will always be variation in the data

Page 53: Tall Stories or how a simple question doesn’t always have a simple answer Neil Sheldon Royal Statistical Society Centre for Statistical Education neilsheldon.net

Tall Storieshow a simple question doesn’t always have a simple answer

Neil SheldonRoyal Statistical Society

Centre for Statistical Education

neilsheldon.net