51
CHS October 28, 2003 1 Statistical Rules of Thumb Gerald van Belle Departments of Biostatistics and Environmental Health University of Washington, Seattle, WA.

Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

  • Upload
    vucong

  • View
    225

  • Download
    3

Embed Size (px)

Citation preview

Page 1: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 1

Statistical Rules of Thumb

Gerald van BelleDepartments of Biostatistics and

Environmental HealthUniversity of Washington,

Seattle, WA.

Page 2: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 2

Outline A. Statistical rule of thumbB. Experimental or observational studiesC. CovariationD. Sample sizeE. Presentation of resultsF. Recapitulation

Slides available on WEB at:www.vanbelle.org

Page 3: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 3

A. Statistical rule of thumb-1

A statistical rule of thumb is defined as a widely applicable guide to statistical practice—with sound theoretical basis. Characteristics include intuitive appeal, elegance, and transparency.

Page 4: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 4

A. Statistical rule of thumb-2

1. Statistical rule as quick response a. Typically committee meetingsb. Consultation sessionsc. Need to know the basicsd. May be preaching to the choir

Page 5: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 5

A. Statistical rule of thumb-3

2. What are the basics?a. Definition of statistical rule of thumb b. Characteristics of rule of thumbc. Substantive areas are idiosyncratic. In my

case: environmental studies, epidemiology, statistical consulting

Page 6: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 6

B. Randomized and Observational Studies-1

Rule 1.1 Distinguish between observational and randomized studies.

a. Hohum!b. What’s so nice about randomized studies?c. What’s nice about observational studies?d. Gradation in observational studies.e. Some references.

Page 7: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 7

B. Randomized and observational studies-2

a. Hohum!?* Epidemiology vs biostatistics* Selection issues* Missing data issues* Fragility of causal models

Arm Waving

Page 8: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 8

B. Randomized and observational studies-3

a. Hohum!b. What’s so nice about randomized studies?* Provides a probability model* Rule 6.1 “Randomization puts systematic

sources of variability into the error term.” (DeLury)

* Lack of randomization leads to arm waving

Page 9: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 9

B. Randomized and observational studies-4

a. Hohum!b. What’s so nice about randomized studies?c. What’s nice about observational studies?

* Easier to carry out* Majority of data; e.g. administrative data bases* Ethical constraints on randomization* Lots of data

Page 10: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 10

B. Randomized and observational studies-5

a. Hohum!b. What’s so nice about randomized studies?c. What’s nice about observational studies?d. Gradation in observational studies.

Registry/Cohort

CaseReport

Page 11: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 11

B. Randomized and observational studies-6

e. Some references.Benson and Hartz (2000) NEJMConcato, Shah and Horwitz (2000) NEJMCopas and Li (1997) JRSS BCopas and Shi (2000) BMJHays (2001) Am. Scientist(See May 2002 ROM on website)

Page 12: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 12

C. Statistical rules of thumb for covariation-1

Rule 3.1: “Before choosing a measure of covariation determine the source of the data (sampling scheme), the nature of the variables, and the symmetry status of the measure.”

Page 13: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 13

C. Statistical rules of thumb for covariation-2

Page 14: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 14

C. Statistical rules of thumb for covariation-3

Rule 3.2: “Do not summarize regression sampling schemes with correlations.”

Assume simple linear regression, Y on X

rregression2

1− rregression2 =

rrandom2

1− rrandom2 ×

sx,regression2

sx,random2

Page 15: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 15

C. Statistical rules of thumb for covariation-5

0.630.500.31040.300.200.10010.100.060.0300.250.300.200.100

Ratios2(x)/s2(true)

True r2

Page 16: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 16

C. Statistical rules of thumb for covariation-6

Rule 3: “Assess agreement by addressing accuracy, scale differential, and precision. Accuracy can be thought of as the lack of bias.”

Page 17: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 17

C. Statistical rules of thumb for covariation-7

Model: Y1 and Y2 measured on the same “objects”

212

212

212

21 )1(2)()()( σσρσσµµ −+−+−=−YYE

TotalDeviance

+ +=Bias Scale

differentialImprecision

Lin (1989)

Page 18: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 18

C. Statistical rules of thumb for covariation-8

Model: Y1 and Y2 measured on the same “objects”

)1(2

)(2

)(2

)(

21

221

21

221

21

221 ρ

σσσσ

σσµµ

σσ−+

−+

−=

− YYE

Deviance Bias Scaledifferential

+ Imprecision= +

Page 19: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 19

D. Sample size responsibilities-1

221

212/1 )(

2

+= −−

σµµ

βα zzn

Investigator

Statistician

Investigator

Page 20: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 20

D. Sample size responsibilities-2

Type I error = 0.05 Type II error = 0.20(Power = 0.80)

Two sample(default)

n =16

µ1 − µ2

σ

2

Topic for discussion:Treatment effect+Variability=Effect Size

Page 21: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 21

D. Sample size responsibilities-3Effect Sizeµ1 − µ2

σ

= Effect Size

n4SizeEffect =

Page 22: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 22

E. Presentation of results-1

Rule 7.1: When text, when tables, when graphs?

“Use sentence structure for displaying 2 to 5 numbers, tables for displaying more numerical information, and graphs for complex relationships.”

Page 23: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 23

E. Presentation of results-2

Rule 7.1: When text, when tables, when graphs?

a. Illustration-version 1“The blood type of the population of the

United States is approximately 40%, 11%, 4% and 45% A, B, AB, and O, respectively.”

Page 24: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 24

E. Presentation of results-3

Rule 7.1: When text, when tables, when graphs?

a. Illustration-version 2“The blood type of the population of the

United States is approximately 40% A, 11% B, 4% AB, and 45% O.”

Page 25: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 25

E. Presentation of results-4Rule 7.1: When text, when tables, when graphs?a. Illustration-version 3“The blood type of the population of the United

States is approximately, O 45%A 40%B 11%AB 4%

Page 26: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 26

E. Presentation of results-5

Rule 7.2: Table Structure“Arrange rows and columns in meaningful way,Limit the number of significant digits,Make the table as self-contained as possible,Use white space and lines to organize rows and columns,Do not stint on table headings”

Page 27: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 27

Rule 7.2: Table structureOriginal table on right. 1. Different degrees of precision, due to different sources of data.2. Ordering by alphabet; Spanish version would look different.

Page 28: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 28

Rule 7.2: Table structure1. Reduced number of digits,2. Rows arranged by frequency,3. White space suggests similar groups

Page 29: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 29

E. Presentation of results-6

Rule 7.2: Table StructureSignificant digits for frequencies

Page 30: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 30

Page 31: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 31

Table 7.5 ReformattedNumber of activities

Age70-74

Age 75-79

Age75-79

Age85+

Women %

% % %

0 1-2 3-4 5-7

Mean

17

2765

5.0

1 10 28 61

4.8

2123254

4.5

3193839

4.0 Men

0 1-2 3-4 5-7

Mean

2102661

4.8

2 13 30 55

4.5

3163744

4.2

5233636

4.0

Page 32: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 32

E. Presentation of results-7

Rule 7.3: Graph data“When possible graph the data.”

Page 33: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 33

Page 34: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 34

The following three tables and figures are from a great paper:Gelman, A., Pasarica, C.

and Dohdia, R. (2002). Let’s practice what we preach: Turning tables into graphs. The American Statistician, 56: 121-130.

Page 35: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 35

E. Presentation of results-8

Rule 7.4: Pie Charts“Never use a pie chart. Present a simple list of

percentages, or whatever constitutes the divisions of the pie chart.”

Page 36: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 36

Page 37: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 37

E. Presentation of results-9

Rule 7.5: Bar Charts“Always think of alternatives to a bar graph.”

Page 38: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 38

Page 39: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 39

Page 40: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 40

E. Presentation of results-10

Rule 7.6: Stacked bar charts“There are much more effective ways of

showing data structure than stacked bar charts.”

Page 41: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 41

Page 42: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 42

E. Presentation of results-11

Rule 7.7: Three-dimensional bar graphs“Never use three-dimensional bar graphs.”

Page 43: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 43

Page 44: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 44

E. Presentation of results-12

Rule 7.8: Longitudinal data“In the case of longitudinal data identify both

cross-sectional and longitudinal patterns.”

Page 45: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 45

Page 46: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 46

E. Presentation of results-13

Rule 7.9: High dimensional data“Three key aspects of presenting high dimensional data are: rendering, manipulation, and linking. Rendering determines what is to be plotted, manipulation determines the structure of the relationships, and linkingdetermines what information will be shared between plots or sections of the graph.”

Page 47: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 47

Page 48: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 48

E. Presentation of results-14

1. Chew words carefully2. Watch table manners3. Avoid pies4. Stay away from bars

Page 49: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 49

F. Summary and recapitulation

1. Concept of statistical rule of thumb2. Observation studies are fragile3. Covariation needs to be described correctly4. Logic to presentation of results

Page 50: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 50

G. Resources1. Gerald van Belle, (2002). Statistical Rules

of Thumb, John Wiley and Sons, New York, NY.

2. WEB sites: evolving rapidly4. Statistical books and journals: Audience

recommendations4. My choices at http://www.vanbelle.org5. General purpose books and journals on

science 6. Colleagues. Find a consultants’ consultant

Page 51: Statistical Rules of Thumb - vanbelle.orgCHSTalkNoPictures.pdf · CHS October 28, 2003 3 A. Statistical rule of thumb-1 A statistical rule of thumb is defined as a widely applicable

CHS October 28, 2003 51

H. Acknowledgments

1. DeLury (University of Toronto)2. Steve Millard, Jim Hughes, Michael Levin3. Paul Crane4. Biostatistics, statistics, and epidemiology

colleagues 5. Consultees who sharpened my skills