271
Revista de Historia Americana y Argentina

1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREESMASTER OF PUBLIC HEALTH AND MASTER OF SCIENCE IN PUBLIC HEALTH

IN THEDepartment of Biostatistics

School of Public HealthUniversity of North Carolina at Chapel Hill

assembled and edited by

DANA QUADE and MICHAEL J. SYMONS

Institute of Statistics Mimeo Series No. 1329February 1981

Page 2: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

1966-1980

WRITTEN EXAMINATIONS

for the degrees

MASTER OF PUBLIC HEALTH

and

MASTER OF SCIENCE IN PUBLIC HEALTH

in the

DEPARTMENT OF BIOSTATISTICSSchool of Public Health

University of North Carolina at Chapel Hill

assembled and edited by

DANA QUADE

and

MICHAEL J. SYMONS

February 1981

Page 3: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

TABLE OF CONTENTS

Page

Introduction 1

1966 4

1967 7

27 April 1968 11

3 May 1969 13

1969 (Re-examination) 14

4 April 1970 15

24 April 1971 19

2 August 1971 (Re-examination) 27

8 April 1972 31

12 May 1972 (Re-examination) 37

31 March 1973 40

13 April 1974 47-e12 April 1975 53

3 April 1976 58

2 April 1977 62

8 April 1978 65

7 April 1979 69

12 April 1980 76

Page 4: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-e

INTRODUCTION

This publication contains the written examinations which the Department

of Biostatistics has set for candidates for the degrees Master of

Public Health (MPH) and Master of Science in Public Health (MSPH). A

Department-wide written examination for these degrees was first

instituted in 1966, in order to satisfy the Graduate School requirement

that every Master's candidate pass a comprehensive examination covering

all course work done for the degree. Until recently most candidates

completed these programs in one year of concentrated study, especially

if they had prior experience in public health practice, and they

took the examination during their second semester of graduate work.

However, candidates in two-year programs, which have by now become

typical, generally take the examination in their fourth semester. At

any rate, the examination has been scheduled each Spring (usually in

April) starting from 1966 to the present; there have also been 3

special reexaminations, in 1969, 1971, and 1972.

The Examinations Committee prepares and conducts all Department­

wide written examinations, and handles arrangements for their grading.

The rules under which the examination was conducted for the first

four years are not entirely clear. In 1966 it consisted of 3

questions, while in 1967, 1968, and 1969 (both regular and special)

there were only 2 questions. Starting from1970, however, the rules

have been explicit and formal. From 1971 through 1976 the examination

was closed-book, with 3 hours allowed in which to answer 3 of 6 questions

(exceptions: 2 of 3 in the 1971 special exam, and 3 of 4 in the 1972

special exam). In 1977 the examination was changed to open-book, with

4 hours allowed in which to answer 3 of 5 questions.

In 1980 the rules changed again, so that the same examination

Page 5: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-2-

could be taken not only by MPH and MSPH candidates but also by candidates

for the degree Master of Science (MS) with major in Biostatistics. Now

the examination is composed of two Parts, Theory and Applications, and

the number of questions to be answered in each part depends on the degree

program as follows:

Part One: Closed Book Examination Part Two: Open Book Examination(Theory) (Applications)

MPH 2 Questions (2 hours) 2 Questions (2 hours)

MSPH 3 Questions (3 hours) 2 Questions (2 hours)

MS 3 Questions (3 hours) 3 Questions (3 hours)

In each part there are two Sections: MPH candidates may choose any 2

questions out of the whole set, MSPH candidates are required to answer

at least I question from each Section in Part One and any 2 in Part Two,

while MS candidates are required to answer at least I question from

each Section in each Part. Those MSPH candidates not desiring to go on

to the DrPH may answer 2 questions in Part One and 3 questions in Part

Two.

A team of two graders is appointed for each question. Where possible,

all graders are members of the Department of Biostatistics and of the

Graduate Faculty, and no individual serves on more than one team for

the same examination (the two Parts of the Master's Examination counting

separately in this context). The members of each grading team are to

prepare for their question an "official answer" covering at least

the key points. They agree beforehand on the maximum score possible

for each component, the total for any question being 25.

The papers are coded so that the graders are unaware of the

candidates' identities, and each candidate's answer is marked

e-

Page 6: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-3-

independently by each of the two graders. The score awarded reflects

the effective proportion correctly answered of the question. The two

graders then meet together and attempt to clear up any major discrepancies

between their scores. Their joint report is to include comments on

serious shortcomings in any candidate's answer.

On the basis of a candidate's total score on a paper, the

Examinations Committee recommends to the faculty whether the candidate

is to be passed, failed, or passed conditionally. In the last case,

the condition is specified together with a time limit. All final

decisions are by vote of the faculty. Examination papers are not

identified as to candidate until after the verdicts of PASS and FAIL

have been rendered.

Once the decisions have been made, advisors are free to tell their

students unofficially; the official notification, however, is by letter from

the Chariman of the Examination Committee. Actual scores are

never released, but the "official answers" are made public, and

candidates who are not passed unconditionally are permitted to see

the graders' comments on their papers. When performance was not of

the standard required, candidates are reexamined at a later date

set by the Examinations Committee.

Candidates whose native language is not English are not to be

allowed extra time on Department-wide ( not individual course)

examinations. This condition may be waived for individual candidates

at the discretion of the Department Chairman upon petition by the

candidate at least one week prior to the examination.

NOTE. Most of what follows reproduces the examinations exactly as they

were originally set; however, minor editorial changes and corrections

have heen made, particularly in order to save space.

Page 7: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-4-

BIOSTATISTICS EXAMINATION

FOR ALL MASTER'S DEGREE STUDEN'rS

1966.

1. A medical research worker wanted to know whether, for a certain

specified type of patient, a new drug X would bla more effective in

relieving pain (1. e., produce higher "pain re1ilaf scores") than the

standard drug C. He divided a sample of 30 pat:Lents into two equal

groups at random, gave one drug to each group, lind obtained the

following scores;

Drug X: O,O,l,l,2,2,4,5,6,6,6,6,6,6,6

15t xi • 571

15E x2 • 3031 i

Drug C: 0,l,l,2,2,2,3,3,3,3,4,4,4,4,6

15E Yi • 421

15t y2 • 1501 i

Give several essentially different statistical descriptions of these

data, and base a test of significance on one of them. Justify your

choice of test and state the assumptions it invcIlves. What conclusion

do you draw with respect to the research worker's original quest4on?

Write a brief report including your descriptions, analysis, and

conclusion.

Page 8: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-5-

2. The following data give the number of pounds of weight gained in a

random sample of 20 children in the first and second years of life. It

has been suggested that the weight gain in the second year might be pre-

dieted by simply subtracting a constant from the weight gain in the

first year. If so, what constant should be subtracted? Can you find

a significantly better prediction method?

Child First Year Second Year Child First Year Second YearNumber (x) (y) Number (x) (y)

1. 20.2 17 .2 II. 29.6 23.3

2. 20.4 22.3 12. 30.3 26.4

3. 22.5 19.5 13. 30.4 30.3

4. 25.7 24.2 14. 32.3 29.4

5. 25.8 21.0 15. 32.7 28.7

~e 6. 25.9 23.9 16. 35.0 33.2

7. 27.7 24.5 17. 37.1 31. 7

8. 28.1 27.5 18. 37.4 32.1

9. 28.3 24.4 19. 38.1 38.1

10. 29.4 28.7 20. 39.1 32.9

Ex ... 596.0

Ey .. 539.3 Ex2 = 18361.12 E(x-x)2 ... 600.3200Ey2 ... 15064.33 E(y_y)2 .. 522.1055

x CI 29.800 Exy .. 16585.00 E(x-x)(y-y) • 513.8600

Y• 26.965

Page 9: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-6-

3. The Congress of the United States is currerLtly discussing the

question whether or not to continue the food supplementation program

in the public schools. This consists of providj~ng milk and other

foods for the school lunch program so that ever)' child can partici­

pate either at no cost or at a very nominal pric:e.

Suppose the Director of Public Health Nutrj.tion in X State

Health Department consults you as the staff biostatistician to help

him to evaluate the effectiveness of this program in X state. He

feels that if sound data can be presented to the: Department of

Agriculture regarding the effectiveness of this program, that

Department will have a stronger case in arguing the merits of con­

tinuing the school lunch program.

Describe what you would recommend to the Director of Public

Health Nutrition and what questions you might ask him about the

school lunch program. Assuming a given set of answers to those

questions, how would you proceed to design a study to answer the

question of effectiveness of the school lunch program. If your

recommendation includes the collection of data in certain schools,

indicate how you would select the schools, how many schools, how"many

children, and other details necessary in the planning of such a study.

e-

Page 10: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-e

-7-

BIOSTATISTICS EXAHINATION

FOR ALL ~1J\STER'S DEGREE STUDENTS

1967

The aim of this examination is to find out something about the

student's general ability and ingenuity in tackling a problem of the kind

which might be brought to him as a statistician in a health agency. You

will thus see that the two questions which follow ask you to describe

approaches to solution; they do not call for detailed solutions. Be as

brief and specific as you can in giving your answers, with the purpose of

sho\dng clearly ,,,hat you hClve in mind C1nd why it is appropriate.

1. It is believed that the position of the 10Her inci.sor (tooth) plays au

important role in the aesthetics of facial features. It is commonly

assumed that the angle called IHPA (LmoJer Incisor - Handibular Plane

Angle), "'hich He ...,ill denote by y, should have a certain value relat.ed

to general facial features; this value varies between about 800 and 1100

for various combinations of facial features.

In view of the above, an orthodontic treatment is to place (push

or pull) the mandible so that the desired angular position can be approx­

imately obtained.

But it has been observed that, after some period, the teeth of

some individuals rettu-n back to the initial position, those of others

move partially back, and the teeth of some stay on the position to

which they t,'crc placed by the treat.ment.

We \'llll, then, distinguish three positions of the 10Her incisor:

(1) Po~iLion 1, that is, the position at the initial stage,

before tllC' treatment was applied; the measurements of HIPA here arc denoted

by Yl)

Page 11: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-8-2

(i1) Position 2, that is, the position of the lower incisor

at the conclusion of the tre.:ttment, at the time \vhere the patient was

dismissed; measurements of IMPA here are denoted by Y2;

(iii) Position 3, that is, the position which the lower incistr

reaches at some stated time after the treatment; the measurements of

IMPA here are denoted by Y3'

Ina certain study, there were three time periods after which

Y3 was measured in three independent groups of patients:

Group I Y3 measured after 1-2 years (15 patients)

Group II Y3 measured after 2-3 years (14 patients)

Group III Y3 measured after 3 years (7 patients).

The general .r.ro1?lem \vhich arises here is: \\Thy does the mandible,

after treatl~lent, try to return back to the initial position in some

individuals, but stay in the position attained by the treatment in other

indivi.duals?

An important question in the matter is: Docs the "amount of

relapse" depend on the amount of treatment (magnitude of change in y

from Y1 to Y2)' or is it related to the initial position, or to both:

amount of treatment and initial position?

1. Describe and comment on statistical methods which you would

like to apply to answer this question. Use the enclosed data to make

some rcprcsentntive calcul.:ttions. (A complete annlysis is .!!.Q! expected.)

2. lIo'" could you improve the design· of the experiment?

e-

Page 12: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-e

-9-

Stnbili ty of the Nandibular Incisor, IGroup III (over 3 years)Group I (1-2 years) Group II (2-3 years)

PatientINo.

Y1 Y2 Y3 Y1 Y2 Y3 Y1 Y2 Y3

1 85 82 82 94 94 99 103 106 106

2 95 97 96 98 93 89 95 100 97

3 94 88 811 97 94 96 94 101 95

4 99 91 91 100 105 89 93 87 90

5 86 85 89 100 101 107 92 99 99

6 93 87 88 93 101 103 85 82 86

7 100 90 88 91 94 9S 84 87 83

8 92 98 103 90 82 89

9 99 96 92 86 86 78 I10 91 91 9lf 89 88 88 I

83 86 86 9S 99 92 I11

I12 77 87 87 92 94 98

13 98 96 96 108 101. 103

14 97 97 100 94 93 90

15 91 89 ' 89

-

2. Assume that .1. program has just been initiated in a region of the

size of North Caro1inD to provide for more widespread use of the

latest medical advances in diagnosis and tr.eatment of heart disease,

cancer, and stroke. The program \vil1 be administered by a medical

school and place primary emphasis aD di ssemination of nc\." kno\Jledge

about cart' for these diseases to medical personnel throughout the

region [or use in patient care. Numerous special short training

cour.s~s will be given to assist in providing up to date inforQation

to medical p~rsonne1.

Page 13: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-10-

The region to be served by the program has .a population of

appro>:ima tel y 5, 000,000 persons ahollt equally divided betHeen urban

and rural communities. About one fourth of the population is over

45. Over half of the deaths have been due to heart disease, cancer,

or stroke in recent years. The region has three medical schools,

about a dozen major hospitals affiliated Hith the medical schools,

and scattered smaller general hospitals and·physicians in private

practice.

You have heen asked to serve as a consultant in planning

and subsequent evaluation of training programs for the region.

Briefly indicate analyses you would suggest on:

1. The nature and magnitude of the problem of morbidity

and mortality due to heart disease, cancer, and stroke

among different population lroups in the region.

2. Availability of and needs for trained physicians, nurses,

and related personnel for care of patients with the

specified dis~ascs.

3. Training provided under tIle program, such as number

of persons of various medical specialties trained,

kinds of training, and costs.

4. Effectiveness of the training programs in bringing

better patient care and reduced morbidity and mortality

due to heart diseanc, cancer, and stroke.

5. Possible uses of computers in pl-eparing and maintaining

desired information for use in the program.

Give particular attention to tIle fourth item in your discussion.

e-

Page 14: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

"

1.

-11-

BIOSTATISTICS EXAMINATIONFOR ALL MASTER'S DEGREE STUDENTS

27 April 1968

Experiment:

It has been postulated that the volume of a cake is influenced verysubstantially by batter temperature. Using a specific cake recipe, alarge amount of batter was made, and then divided into 20 equal portions(aliquots). It was decided to see if the cake volume was different at4 different batter temperatures, 60°, 70°, 80°, and 90°. Each aliquotwas allotted to a temperature at random until there were 5 aliquots oneach temperature. The cakes were baked one at a time and the data wererecorded as follows:

Cake No. Temp. Volume (coded) Cake No. Temp. Volume (coded)

1 60° 6 11 80° 22 60° 8 12 80° 43 60° 8 13 80° 54 60° 4 14 80° 25 60° 4 15 80° +26 70° 6 16 90° 27 70° 9 17 90° 48 70° 10 18 90° 39 70° 8 19 90° -2

10 70° 7 20 90° -2

The Analysis of Variance was computed for this example and isrecorded as follows:

ANOVA

Source of Variation d. f. s.s. m. s. FTotal 19 211Bet. Temperatures 3 145 48.33 11. 71*Residual 16 66 4.125

"Reguirements:

1. State the assumptions underlying the analysis of variance don~

in the above table.

2. Indicate where each assumption is needed in the analysis of theabove datn.

3. Devise a method for checking at least two of the assumptions.

4. Complete the analysis of the experiment indicating yourconclusions (use a = 5%).

Page 15: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

II.

-12-

As statistician in the State Health Department, you have been

given the task of designing a study to deter~mine the relationship (if

any) between maternal rubella (German measles) and congenital malforma­

tions.in infants. The choice between a retrospective and prospective

study is yours. Ignoring cost, prepare and briefly discuss your design

for such a study for the State of North Carolina. Emphasize statistical

problems which might be encountered at all stages (including analysis)

and your solution to them.

e-

Page 16: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-13-

DEPARTMENT OF BIOSTATISTICS

EXAMINATION FOR ALL MASTER'S DEGREE CANDIDATES

May 3, 1969

I. The data below constitute independent random samples from two Poisson dis­tributions with parameters ~1 and ~2'

o 1 2 3 4 5 Total

Sample 1 18 19 11 1 0 1 50

Sample 2 9 16 23 . . 0 0 2 50

a) Describe two distinct ways of testing-the hypothesis

H :o ~l = ~2'

b) Perform one of these tests, with a = .05. State why you chose thistest. Write up your conclusions.

Suppose you have been called into a county as a.statistical consultant tohelp set up a Comprehensive Health Planning Agency. Primarily you are askedto construct the health information system necessary for community healthplanning on a broad scale. .

II.

The permanent population of this county is about 200,000, but itseasonally with an influx of tourists and migrant laborers. Thesources of income are agriculture, tourism, and light industry.approximately 607. urban.

increasesprincipalIt is

You will have a generous budget, with computing facilities available andpersonnel to staff and operate any system that you design. The PlanningAgency has liaison with all county and city governme~t agencies, hospitals,the local chapter of the Medical Society, and other voluntary agepcies. Youcan secure information from any of these sources.

The Health Information System will gather, tabulate, analyze, and interpretdata as well as monitor. community hea+th, evaluate programs, and define unmet needs.

a) Describe the type of system and data needed to accomplish the tasks,using examples to illustrate the general considerations.

b) What statistical techniques or studi~s will be used to gather andanalyze the various types of needed information? Again, examples maybe employed to suggest specific illustrations.

c) Discuss the type of quality checks necessary to test adequacy of thedata. Also outline the phases that the system will go through,distinguishing between the information needed for initial "planningand what will be gathered-as the planning agency becomes moresophisticated in its needs and operation.

Page 17: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-14-

(1969 RE-EXAMINATION)

MASTER' S Ex.-'\~IINATION

1. Accordins to binomial theory, if \o'e t.nke t\JO fair coins and toss them 100 timeseach, we should observe 0 heads 25 times, 1 head 50 times, and 2 heads 25 times.That" is, the expected outcomes should be

Outcome Frequency

a he.:tds T,T 251 head T,H or U,T 502 heads H,H 25

In a trial, the f0110\-1ing results were obtained:

Outcome

o heads1 head2 heads

Frequency

204040

a) Test to sec if these results agree with binomial theory. Use 0. = .05.

b) Is the test you use a test of independence, homogeneity or goodness-of-fit4!t .

*2. Race-specific perinatal mortaHty rates for North Carolina are computed annuallyfor each county in the State. The Maternal and Child Health Director wishes toset up some scheme ,,'hereby he can determine whetller the annual race-specific ratesfor each county arc "normal", "too high", or "too lou". His ultiraate objective isto identify countJ.es \,;rith rates "too lOt.," and then attempt to find out \,;rhy theyare so 1m·,. Similarly, he \-Tants to identify counties Hith rates "too high", findout why, and take remedial action to lower the rates.

Hhat scheme \"ould you recommend to him fur making the dcterminaUons of "normal","too 10\"", and "too high" perinatal rates? If there are limitations in yourscheme which the Hell Director should knOH about, specify \-That they are.. Also,specify any other precautions (if any) which he should be aware of in usin2; yoursclleme for his intended purpose.

In 1967 there ,;·,ere lSL.l \.:hite and ]lIn nOn\olhite perinatal deaths in N. C. Therates \-1ere: Total State 35.2; ',hite 28.3; nonuhite 50.6.

*Perinatal mortality rate

Fetal deaths 20 ';-7k8. ges tc.tion or over plusdeaths to live born infants occurring under 28 days

X 1000Fete-ll deaths 20 \-leeks gestation or over plus

all live births

Page 18: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-e 1.

-15-

DEPflRTrrK'i'f OF lHOST.\TISTICS

E:'.aminn t:i on for 11.1'.:1. and H. S. P . II. Cancl ida telJ

SnturdDy, April 4, 19709:00'·12:00 A.H.

EDITORIAL NOTE: A table was included which gave the natural logarithms

of the integers from 2 to 22.

a) ;!a[;,,:~d1HSE.'tt·~ Ei.-,d ~Iii';:.;is"jppi.. e1'H.:h cla:.l1::~c1 a Imler tuberc.ulosis deathrate t"h::m th~ (,::~jc:r in 1950. Usi.u2; th~ fol1mdng elata, ShOiY ho," bothc1:d.r:s can b·::! !-il1h~' Umtiatcd, and explain the paradox.

PopulR tion Llnd 1~'.1mber of TubercuJ osis Deaths, by Color, i.n NRssachuscLts__Cl,,-nc:..(~l.i'li. r;?. i s d Y-P i. ) 9S!)r ..----

Hassachusctts

Totn1 White

Populati.on

TB dc','l thF: ~',

4,690,514 4,611,503 79,011

_-L.9.Q2__ __...:.9--:4...:.2 -----:63

NiF3Sissippi.

l'upu]3tion

Tg deatlls :',

2,178,914

573

1,188,632

191

990,282

382

* Dt:!cJtlls [ro;n all forms of tuberculosis

b) Given that tIle United States had a population of 151,325,798 in 1950,of Hhich 89.31% was vlhi.te, calculate suitC:Lbly standardize.d tuberculosi.smort.:l1 i ty rates for Hassachusettf:l and Hi.ssissippi.

c) Discu~;s how it might be decided whether tqe rates for these two statesare significantly different.(But do not actually carry out the calculations for any statistical test.)

Page 19: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

2.

-16-

a) "List the m::tjor functlol1<1l componenln of a cor.r[)ut£!r.

b) Lif. t tHO SO'I). '."e:·, or packages of s t~ t] :;tir.nJ. progranE;.

c) If yO\! \-7CrC UhOLtt to stol:e a 1.£lrge Qrtta set, ~-Jh.1.t fC2tur.es of tIl,:! dDt",set \7ould uetl;t'T;::inc Hhcther. to sto·re it on cards, d:isk.or tape?

d) ':hat is the essential charactcl"iotic of a sequentinl data set? Givean example of (i) a sequential data set; (ii) a data set uith someorganizGtion other than sequential (machine readable or not).

d) hThat is the purpose of having physical recon1s which contain more thanone logical record? Elabor.ate.

f) (i)(ii)

(iii)(iv)

(v)

~~ich field is major?Hhich fi.eld is minor?Is field 1 ascending or descending?Is fi.eld 2 ascending or de.scending?Is field 3 ascending or descending?

Fi.eld: 1 2 3

4 1 4

3 1 4

2 1 4

1 1 4

4 2 4

3 2 4

2 2 4

2 2 4

1 2 4

4 1 0 (zero)

3 1 02 1 02 2 01 2 0

e-

Page 20: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-17-

3·. /-.r. opinion poll ~;11(l·;·.c,1 Ll~"lt, of 3f.W) ~)i"()f.lh; :in;:cl"vi.':·\lc:;~1, 230/f "~'2.re Ll

favor of "llo·.dn~ hi:r.::L C('lltrol h"t.fnr;:':·,t:l(,:: to 1.1(.'. IU:H.k i.1ViJrLi,'Il]p to fd.r.~V-~

;,'OInell.

Kate: 230~/JGOO = .64

(n) Find An iJppro:d.lw::'•.: t':'!O-Sjdl~c1 9~)~~ confidence :inte'-"val for p (th,~

propo~"tjon. in the pc.·pul<1thm '"ho are in favor).

(h) Test tlle hypothecis

Ho

p - .7

p + .7

llza1.nst

wilh a = .05

(c) "That is the pm,ocr of this test j.f p = .65?

4. OrH'. of. the so"called P.lf;c;'[1} (l:i.,::trihutions has the prob.::b1.15.ty flll1c:tion

f (z) ••{vr "

.. }..x-,,..

-e (:.1) rin(~ tl;2 n:c:d:'J.m li1:0.U 1;(v,d e;~toL:I·;]tor of tllP. piJr£unctcr a, g:i.V(~I~ .:'1.

r:JJ:'.~;)r.1 i;:~.F;plE: of 8;.7.(' J: f ((','E a population 11:1V::.:::; lh'~ above dis tLL­

butLon.

(0) Cl'\,<>n " 1",1",·1("11 <;;]"1')]1' ..,c· 'L'n (-1) dt"'".t'''eJ. ~r (,~ _, 1.,._ .' '•. I~.•• _ (.],u _ ")0 •.•• v

re;jj on for the U.L~l:i.hood ratio test of-·1" .., l'11" t- t'" p ,,'.1. t r~ r' -l" t J' V" 11 ..., ,J; "(cot.,..&. oJ - "- C .1:' I <"'. • \:.. J).. Cl , c.~o

t~e fo~rn of the critic81th2 hypotheGis II a=a

o 0

(c) U:-:.ing the: f.~~:.: that, for large n, the distrihuUc'i1 of -2xn>., ,·'here), is thf~ 11.Lc) j howl 1: ii t:to St:D t is t:1 c, is Dpprm:i lr~d:ely the chi-'squ~,rct1it~tl"'ibut:Lol' \71lh 1 (l(:~~i:(;e of frc(~dC:tn, apply th~ tl~st (li) to tc~:tj,ng

H) : <1=,,2 \"t'n~w; P A : af2, <1t t.he 5~~ I('vel of sign:ificimcc, given C1rAndOl'! SL;!~;plc of ](j ol):::ervations yi<;·.ldi.ng sar"f<!E.' mc~;m 1.

5, In the United States, the vital registration system and the decennial

Census provide data essential in many areas. Evaluate this "dual system"

from the point of view of providing demographic statistics needed to ade-

quately measure population change. Suggest remedies for any deficiencies

you find.

Page 21: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-18-

c,. /. f;c!W(':. liIell1";-;l l! ...."'l·j\ crm:·:ultp.!-.ioll prCl;~l";!U ·.L; to he eVi·tlil<:f.:(·~U at a publi.cel("11'?nt:~)~y sc:nol. 'rl,e P\·..,~;1(1~;(' (,i.' t,:,t-~ cO~l·~ult.tLicn l'n.l:~,l·m·1 5s Lo ir.C";:·e;l: .. (~the': 1,,;: L c.'; r".~nt::1 hi~i.lt:;l ol PilCll ~.;\~I11H11. chil: :i 1l(1!oj" d(><:rc:l£c tLe ],(,ve1(I[ t~cDt;_~.l L::i).rU~1~~L::~('ij. '[l;is J!.\ ttl be '.I:.;~=~)~~:pl:i.r~'11l::d b::.-' UrfeTiI~f'; c.(;n~~l~.}_1.;~tic~:;

to th"1 c.la~':.:·()(")!,l l~·aC'1t(!J.· so tL~·t" slle or h;;: cn,1 l:"ndJe Id,-'nt,,11 h{'~H1Lh p-J."(·1.11'!;.,:::;"'ill :idl Oc.c; 11:' :i n t 1-. c· l~ L',~> ~';,com [IS \;;,~].:l_ <.~; ·i:\~:O,"C!:'".2 po~·.d. t i 've I'lental 1)(;;'.l1 th int1H;' cl.:::.ssroO:H:l. 1,';31,.;-2 ft.:r tld.~~ de~~i~~n t1,.-'l: th:~ t('.(lr-ber can, ~'1ith conD1.:1t.a-­tioa, h-;;;dle nl1 cbUc1rc';l in t1l:? cl<J~;sro':"':l ...md <loe~; not need to refer <mychi1dr~',! for \I.o}"(-. e:·:tenl,:LVf:, :in(Hvidu.:l1.-i.':ed aid.

Four "equivalent" CL.....::;f,rooms in t.he Bclloo! are chosen. Two of the clar-::~roo;;.s

are giV::::ll R pre-ter.t uhich mt1;:lnm"~S the level of mental health of everyclii,1d in the cl;j~sro(lli1. One of these classrocms gets the consultation pro­gram; the other <::I....185r00:1\ do~s not. The' other t"m cla~,srooms do not receivethe prc>··tcf;t. One of thcse cli'lsnrooms receives the' '::onsl1ltat1.on program,and t.he other Cl::'~·.5room does not;. All four cJ.aSBrooms receive the post-test nt the end of the f:chool year. (The post-test is the same test as thepre-tcs t. ) These classrooms nre (~quivalent in the sense tha t if all fourclassro~:h:1S \\1<.>re pre-tested, the average level of liIcnt:11 health \~ould be thesame for all four c1~ssrooms.

Thi~ design can be represented as

Pre-Test________-+-__O~c tO~lcr, 1969

Consultation ProgramNov., 1969 thru April, 1970

Post-Testl'faYJ_ 197..2.-

Classroom 1

Classroom 2

Classroom 3

Classroom 4

yes

yes

no

no

yes

no

yes

no

yes

yes

yes

yes

The fi.rst three questions belo\.! are statistical "design" questions. Inanswering these ques tions, simply state \olhich classrooms you \vould use andin what manner (c.g. pre-test scores on ClassrooDl X compared to differencescores het\'1een pre-test and post-test on Classroom Y). Do not specifystatistical analyses such as analysis of variance, contingency tables, etc.

a) How \YOuld you test for the effectiveness of the consultation program?

b) How would you test for tlle possibility that the pre-test affects thepost-test in the absence of the consultation program?

c) How would y011 test for the possibility that the pre-test affects thepost-test in tile presence of the consultation program, i.e. that thepre-test and the consultation program interact?

d) In thi~ design classrooms are confounded with teachers since the class­rooms 1n the schaal are self-contained. Assuming that you can get fourclassrooms that are equivalent, what variables would you like to measureto assess w~ether or not the four teachers are equivalent for the pur­poses of th1S program evaluation?

e) Comment on the use of the same test for pre-test and post-test measures.

Page 22: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

1.

-19-

DEPAR1MENT OF BIOSTATISTICS

Examination for MPH and MSPH Candidates

(9-12 A.M., Saturday, April 24, 1971)

(15 points) a) TIle follO\'Iing data are from "Increases in Divorces -

United States - 1967," Series 21, Number 20 of Vital and IIealth

Statistics, published by National Center for Health Statistics,

December 1970. The table below gives the age specific death

rates for married and single males in the U. S. in the period

1959-l96l~. (Data for divorced and widowed males are available

in the publication but not included here.)

Death Rates per 1000 population, by Age and Marital Status:U. S. Males, 1959-61

Age in Years Married Single

Total, 15+ 12.8 7.5

15-19 1.3 1.320-24 1.3 2.225-29 1.3 2.830-34 1.6 4.135-39 2.3 5.940-44 3.8 8.845-49 6.3 12.5SO-54 10.6 18.155-59 15.9 23.560-64 25.0 37.465-69 36.6 53.270-74 52.4 74.975-79 76.8 105.080-84 121.3 158.785+ 191.1 231.6

Note that, for any given age category, the death rate for

married males never exceeds the death rate for single males.

However, the overall death rate for married males (12.8) is

70% larger than the overall death rate for single males (7.5).

Explain mathematically hO\'I this can happen, and give the

particular reason why it happens in this set of data.

Page 23: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-20-

(Question l, continued)

(10 points) b) In an article on patients with coronary occlusion, the follow­

ing data ,,,,ere presented for 330 patients who had had pre­

existing anginal symptoms.

Distribution of time between onset of symptomsand

first attack of corona!)' occlusion

Comment on the author's remark: "Occlusion occurs most

frequently between 3 months and 2 years after onset of

anginal symptoms".

Page 24: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-e

-21-

2.

(6 points) a) List the three most important types of statements in JCL and

briefly explain the purpose of each (what does it do?).

(4 points) b) Briefly describe a "standard data matrix." Illustrate with

a figure.

(9 points) c) Give a short description or definition of each of the follow­

ing terms as used in data processing.

(i) byte

(ii) field

(iii) logical record

(iv) pllysical record (what is the distinction beu~een

logical and physical records?)

(6 points) d) (i) For a sequential data set to be written on magnetic

tapeJ wlly would one generally prefer a blocksize of

8000 bytes to a blocksize of 80 bytes?

(ii) Mlat is the principal advantage of shorter blocksizes?

Page 25: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-22-

3. Suppose you have desigl1ed a before-after experiment with control group.

In other words, you have matched two groups, pair by pair, and have before

and after measures on both groups. Make use of the t test to test for the

effectiveness of your experimental variable

(5 points) a) using only the "after" scores, ignoring the "before" s~ores,

(5 points) b) using the ''before'' and "after" scores of the experimental

group only, and

(10 points) c) using all four sets of scores.

(5 points) Contrast the advantages and disadvantages of methods (a) and (b).

l~1at are the. advantages of (c) over both (a) and (b)?

Control group Experimental groupPair

Before After Before After

A 72 75 66 77

B 61 60 61 65

C 48 37 43 49

D 55 64 S5 S3E 81 76 76 91

F 50 59 52 68

G 42 49 40 SlH 64 5S 6S 74

I 77 7S 67 79

J 69 78 64 6,3

Note. Do not perfonn any computations but indicate what needs to be

calculated for each of (a), (b), and (c).

e-

Page 26: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-e

-23-

4. In a certain probability situation, the probability of the occurrence of

either event A or event B (or both) is 0.7, the conditional probability

of A given B is 2/3, and the conditional probability of B given A is 4/5.

Find the probability of each of the follrnving events:

(6 points) a) A occurs,

(6 points) b) B occurs,

(7 points) c) both A and B occur,

(6 points) d) neither A nor B occurs.

EDITorIAL NOTE: Question #5 begins on the next page.

6. Asswne that for a population of animals you obtain the following mortality

statistics per 1000. All anbua1s who die, do so at the ffiuliversary of

their birth.

Age (x) 0 1 2 3 4 5Deaths at

age x 0 100 150 200 250 300

Calculate

(10 points) a) 1ne life expectancy.

(5 points) b) 1ne mean age of the stationary" population.

(5 points)

(5 points)

c) TIle probability that an animal approaching its second

birthday \\lill survive tmtil age 4.

d) 11le life table death rate.

Page 27: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-24-

s. "Q-Sort Methodology" is used in the social sciences to obtain from a person

a description of himself or some other person. For example a "Q-Sort deck" .

may contain N=94 cards, w]lere each card has a statement about a child's

behavior such as "My child has temper tantrums almost every day." The child's

parents, independently of one another, are then asked to "sort" these 94

statements as they pertain to their child. The 94 cards are to be sorted

into the following categories, where the ntnnber of cards whi<;-h end up in

each category is fixed in advance.

TABLE 1

Specification of Q-Sort, N=94 Statements

Category and Description of Category Number of CardsScore for that Statement in this Category

1 Highly uncharacteristic of my 5child

2 Quite uncharacteristic of my 10child

/.

3 Some\"hat uncharacteristic of 18my child

4 Neither characteristic nor un- 28characteristic of my child

5 Somewhat characteristic of m~ 18child

6 Quite characteristic of my child 10

7 Highly characteristic of my 5child

94

Hence, each parent will finish the Q-sort by identifying exactly 5

statements out of the 94 that are highly characteristic of the child,

exactly 10 statements that are quite characteristic of the child, etc.

e-

Page 28: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-25-

(Question 5, continued)

TIle "score" for statement j,j=1,2, ... ,94, is determined by the category

into \vhich the. parent puts statement j. Let x· and y., j=1,2, ..• ,94 beJ J

the assigned score for statement j by the mother and father, respectively.

TI1US, the response of each parent can be described by a multivariate

observation vector of 94 elements, \vhere each element assumes the value

of some integer in the set {1,2,3,4,5,6,7} - with the restriction that

the frequency distribu~ion of the 94 elements follows the specification

in TABLE 1.

(5 points) a) Show that the average score is the same for each parent.

TIlat is, show that

-x = y,

where

'e 1 94x= 94 l x.. 1 1J=and

_ 1 94Y = n;r I y ..

::1'1' • 1 1J=

(5 points) b) Show that the variability of the scores for each parent is

e-

the same.

and

That is, show that 52 = S2 wherex .y

1 94S2 = L (X.-X)2

X 94 j=l J

1 94 - 294 L (y. -y) .

j=l J

Page 29: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-26-

(Question 5, continued)

(15 points) c) One purpose of having both parents do the Q-sort is to compare

the two Q-sorts to find out if both parents view their child

in similar or different ways. Two ways of measuring the

similarity between the 2 Q-sorts are: the correlation

coefficient rxy and the sum of squared differences n2• The

fonnu1as for these measures are:

1 Nr (x -x) (y.-y)N ]"=1 Y ]

r = ---'-~:""-_----

xy SxSy

Nn2 = r

j=l(x. -y.) 2.

J J

In this example, N=94.

Show that these two similarity measures are mathematically

related by the equation

n2r = 1 - ~, for N=94,

where 52

EDITORIAL NOTE: Question #6 is printed between Questions #4 and #5.

Page 30: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-27-

DEPAR1}'ffiN! Of BIOSTATISTICS

Examination for MPH and MSPH Candidates

(9-12 A.~1., Monday, August 2, 1971)

1.

(15 points) a) The follO\ving data are from "Increases in Divorces -

United States - 1967," Series 21, J\'umber 20 of Vital and Health

Statistics, publis}led by National Center for Health Statistics,

December 1970. The table below gives the age specific death

rates for marred and single males in the U. S. in the period

1959-1961. (Data for divorced and widowed males are available

in the publication but not included here.)

Death Rates per 1000 Population, by Age and Marital Status:U., S. Mci1.es, 1959-61

Age in Years Married Single

Total, 15+ 12.8 7.5

15-19 1.3 1.320-24 1.3 2.225-29 1.3 2.830-34 1.6 4.135-39 2.3 5.940-44 3.8 8.845-49 6.3 12.5SO-54 10.6 18:'155-59 15.9 23.560-64 25.0 37.465-69 36.6 53.270-74 52.4 74.975-79- 76.8 105.080-84 121.3 158.785+ 191.1 231.6

Note that, for any given age category, the death rate for

married males never exceeds the death rate for single males.

Hm.;ever, the overall death rate for married males (12.8) is

70% larger than the overall death rate for single males (7.5).

Page 31: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-28-

(Question 1, continued)

Explain mathematically how this can happen, and give the

particular reason 1Vhy it happens in this set of data.

(10 points) b) In an article on patients with coronary occlusion, the fol1mv­

ing data were presented for 330 patients who had had pre-

existing anginal symptoms.

Distribution of time between onset of symptomsand

first attack of coronary occlusion

Time

1 day

2-6 days

1-4 weeks

1-2 months

3-6 months

7-12 months

1-2 years

2-3 years

3-5 years

5-10 years10+ years

~lunber of patients4

12

20

28

54

45

50

23"38

36

20

Total ~;30

Comment on the author's remark: "Occlusion occurs most fre-

quent1y beuveen 3 months and 2 years after onset of anginal

symptoms" •

Page 32: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-29-

2. Suppose you have designed a before-after eA'Periment with control group.

In other words, you have matched two groups, pair by pair, and have before

and after measures on both groups. Make use of the t test to test for the

effectiveness of your experimental variable

(5 points) a) us ing anIy the "after" scores, ignoring the I 'before" scores,

(5 points) b) using the "before" and "after" scores of the eA'Perimental

group only, and.(10 points) c) using all four sets of scores

(5 points) Contrast the advantages and disadvantages of methods (a) and (b).

What are the advantages of (c) over both (a) and (b)?

Control group Experimental groupPair

Before After Before After

A 72 75 66 77B 61 60 61 65

C t 48 37 43 49

D 55 64 55 53

E 81 76 76 91

F 50 59 52 68'G 42 49 40 51

H 64 5S 65 74..

I 77 7S 67 79

J 69 78 64 63

Note. Do not perfonn any computations but indicate '''hat needs to be

calculated for each of (a), (b), and (c).

Page 33: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-30-

3. Asswne that for a populatioll of animals you obtain the follmving mortality

statistics per 1000. All animals '~10 die, do so at the annlvers~T of

their birth.

Age ex) 0 1 2 3 4 5Deaths at

age x 0 100 150 200 250 300

Calculate

"(10 points) a) The life expectancy.

(5 points) b) The mean age of the stationary population.

(5 points) c) The probability that an animal approaching its second birthday

will survive until age 4.

(5 poi~t3) d) TIle life table death rate.

Page 34: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-31-

DEPARTgEN'l' OF BIOSTATISTICS

Examination for MPH and MSPH Candidates

(9-12 A.M., Saturday, April 8,1972)

1. The following table gives the number of live births, infant deaths

and infant death rates for the years 1965-1969 for Barry County,

Michigan.

Year 1965 66 67 68 69

Live Births 594 595 626 626 626

·e Infant D~aths 14 17 5 9 15Infant Death Rate 23.6 28.6 8.0 14.4 24.0

(Per 1000)

a) Com.'ilent briefly (not more than 5-10 lines) on the variability

observed in these rates.

b) Find a 95% confidence interval for the 1969 infant death rate.

c) If one aSSu:::nes that the underlying infant death rate is constant

over the period covered by this table, find a shorter 95%

confidence interval for the infant death rate.

Page 35: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

2.

-32-

1 · that the average lifetime ofA certain manufactUring company calms

its electric lights is 750 hours. A bulb is defective if it lasts

b)

less than 500 hours.

a) In a random sample of 30 light bulbs, three were observed to be

defective. Is this sufficient evidence to dispute the manu­

facture~s claim that no more than 5% of the bulbs are defective?

1 f 30, tw·elve bulbs burned longer than 750In this same samp e 0

hours. Under suitable assumptions (which you will state in your

an~wer) test whether these data support the claim of an average

life of 750 hours.

3. Let the capacity C of an environmental engineering system and the

demand p upon it be random variables. The quantity p = k(C-D),

where k is a knOlTl1 positive const~t, is a measure of the performance

of the system, \.,hether it be "inadequate" (P<O) or "over-designed"

(p large and positive).

a) Find the mean and variance of P in terms of the means and

variances of C and D and the correlation coefficient P between

C and D.

b) Express the probability that the performance of a system is

"inadequate" in terms of a probability statement concerning C

c) For a particular system, Suppose that C and D are normal random

variables with E(C) = 12, Var(C) = 9, E(D) = 2, Var(D) = 16, and

P = O. Find the probability of "inadequate" performance for this

system.

4. Discuss the following quotation from A..1. Co~lc:

"He find at the end, then, that although the birth rate de­termines ho... old a pOI)UJ.ation i:" the death rate, determines whatthe average birth r:J.te in the long run must be. If prolongedlife produces by its direct effects a yow1ger popUlation, it isnevertheless compatible only with an older popUlation."

e-

Page 36: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-33-

5. Let Xik , k=l,2, ... ,K denote K test scores for person i, i=l, ••• ,N.

We can define a distance D.. between person i and person j by the1J

formula

2 K 2Dij = k~l(Xik-Xjk) •

In some instances, it may be desirable to first remove each in-

dividuats elevation from his scores before computing the distance

between individuals. 1~at is, Xik is replaced by Xi~=Xik-Xi~ where

X.1.

',me a1st.ance l'ormul.a t.nen becomes

di viduD,ts elevation and also his scatter before computing distances.

XikThat is, Xik is replaced by Xi~ =~, where

1

K- 2 k

8i = [k~l(Xik-Xi.) ] 2

-eIn other instances,

2DiJ =

it may be desirable to remove both an in-

and distances are computed as

Shm-T tlw.t the following relationships hold among the different

distance ~e~~ures:

a) D ' 2 ._ij

2 - 2D.. - K(X. -Xj

)1':' 1..

b) D./. 2 _ ( 8 . -8 . ) 21J l.l..-

S.8.1 J

Page 37: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-34-

6. This question pertai ns to the job setup which follows.

a) 1. What is the jobnamc?

2. How much core storage will be allocated to this job?

3. Name and briefly describe the programs or procedures used in this job.

b) In STEPI

4. What is the DDNAHE of the output data set?

5. What does the '*' in the / ;$YSlIT1 DD * card indicate?

c) In STEP2

6. What does the 'A' in the / /3YSOlIT DD card rE~fer to?

7. What does '*.STEPl.SYSlIT2' do?

8. What does 'DISP=(OLD,PASS) , mean?

9. What is the purpose of the cards 'SORTWK0l' to 'SORTWK06'?

10. What are the 'SORT' and 'END' cards called?

11. What is the maximum number of records that can be processed?

d) In STEP3

12. lfuere is the input data?

13. In which positions of the record would variable 'Y' be found?

14. How many suhfiles are there and how many cases are in each?

15. What SPSS procedure(s) is being used?

Page 38: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-35-

//BASIC Jon UNC.B. E1234, 'B-JONES' ,HSGLEVET..=1,REGION=100K T=1,P=50

IisTEP1 EXEC l'GN=IEBGENER

IISYSPRINT DD SYSOUT=A

IISYSIN DD DilliMY

IISYSUT2 DD DSN=UNC.B.E1234.JONES.DATA,DISP=(OLD,PASS),UNIT~DISK

IISYSUTI DD :';

10014 3105750 1105 45.9

10~21 893428 875 26.6

10033 750902 600 31.4

10M2 156541lf 1004 35.2

10"j53 2043917 1235 40.0

10061 909100 910 10.7

10073 1998433 1000 30.8

-e 10083 25~5488 2000 41.6

10092 17WnC'lfi 11)71) 1? 1

10Hi2 987541 1010 15.5

I)';

//STEP2 EXEC PGM=SORT

IISYSOUT DD SYSOUT~A

IlsORTLIB DD DSN=SYSl.SORTLIB,DISP=SHR

IlsORTIN DD DSN=*.STEPl.SYSUT2,DISP=OLD,UNIT=DISK

IISOJlTOUT DD nSN=UNC.B.El234.JONES.SORTDATA,DISP=(OLD,PASS),UNIT=DISK

I/SOltTHK01 DD UNIT=DISK, SPACE=(TRK, 25, ,CONTlG)

/ISORTwK02 DD UNlt~DISK,SPACE=(TRK,25"CONTlG)

I ISOl~n.lx03 DD UNIT=DISK, SPACE= (TRK, 25, ,CONTIG)

IlsOR1\'~04 DD UNIT=DISK,SPACE=(TRK,25"CONTIG)

I/SORTHK05 DD UNIT=DISK,SPACE=(TRK,25"CONT+G)

I/SORTHK06 DD llNIT=DISK,SPACE=(TRK,25"CONTIG)

Page 39: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-36-

IISYSIN DD ~'(

SORT FIELDS=(S,1,CII,A),SIZE=10

END

/*

IIsTEP3 EXEC SPSS

I IG. TAPEIN DD DSN=1{. STEP2. SORTOUT, DISP=OLD, UNIT=DISK

IIG.SYSIN DD ~.(

VARIABLE LIST

SUBFILE LIST

n~"PUT HEDIUH

If OF CASES

n.TPUT FOmIAl'

VAR LABELS

PROCESS SBFILES

COl\'DESCRIPTIVE

OPTImJS

STATISTICS

READ INPUT DATA

PROCESS SBFILES

CONDESCRIPTIVE

STATISTICS

FINISH

END

1*

II

NE, N\-1 , SE, SH

DISK

X,POPULATION/Y,DENSITY/z,% POVERTY

ALL

X,Z

2

ALL

ALL

Page 40: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-37-

MASTERS EXAJfINATION

May 12, 1972 9:00 A.M.WORK THREE PROBLENS

1. a) State the assumptions of analysis of variance

b) Complete the following ANOVA table and answer the questions given

after the table.

Source

Between

Within

Total

S5

102

df

4

15

M5

20

F

(i) What are your conclusions?

(ii) How many treatments were there?

(iii) Were there equal numbers of observations in each treatment?

2. A physician observes the systolic blood pressure (SBP) of a patient and

makes further tests if SBP > 160. Assume that in healthy people SBP is

normally distributed with ~ = 120 and a = 20. In diseased persons it

it normally distributed with ~ = 190 and a = 30.

a) What is the probability that the physician makes further tests

on a healthy person?

b) What is the probability that a diseased person is not detected

by this procedure?

c) If 10~'~ of the population is diseased, what is the probability

that a person detected by this procedure is actually diseased?

Page 41: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

3.

-38-

Consider a population of gains in weights of swine during a 20 day period.

These weight gains approximate a normal distribution with jJ :z 30 and

02 = 100. Random samples of size 11 = 10 were repeatedly drawn (511 times)

from this population and for each sample the sample variance, s2, was

computed. The following is a tabulation of these sample variances.

class

boundaries

class

frequency

class

relative

frequency

class

boundaries

class

frequency

class

relative

frequency

10-30 12 0.023 190-210 11 0.021.30-50 47 0.092 210-230 8 0.016

50-70 92 0.180 230-250 2 0.004

70-90 93 0.182 250-270 1 0.002

90-110 72 0.141 270-290 0 0.000

llv-13v 73 0.143 290-310 1 I e.oc:

130-150 42 0.082 310-330 1 0.002

150-170 29 0.057 330-350 1 0.002

170-190 26 0.051

a. Draw a histogram of the sample variances.

b. Draw a cumulative relative frequency polygon.

c. Using the graph of the cumulative distribution, estimate the 10th,

50th, and 95th percentiles of this distribution.

d. What is the theoretical 95th percentile of the distribution of s2 and

how does it compare with the estimate in part c.

Page 42: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

'e

-39-

4. In a sequence of four births, find

a) The probabili ty that there are two boys and two girls.

b) The probability that at least two of the children will be girls.

c) The probability that none of the children will be girls.

d) The probability of exactly two boys given there is at least one boy.

Assume the probability of a girl is 1/2.

Page 43: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-40-

, DEPARTI1ENT OF BIOSTATISTICS

Examina::ioll for HPH and MSPH Candidates

(9-12 A.M., Saturday, March 31, 1973)

1. Examination of 80 men rovealed 40 with hypertension and 30 with elevated

cholesterol, including 10 having both ailments. If a man is picked at

random from the group, what is the probability that he is free of both

ailments? What is the probability that he has both ailments if he has

Oile of them?

2. In evaluating mental health programs, some of the following research

designs might be used. Here, an 0 represents the process of observation

or measurement, an X represents the exposure of a group of subjects to

the experimental val'iable or "treatment", X's and O's on thn same line

refer to the smile specific subjects, while parallel rows represent·

different groups of subjects. Temporal order is indicated by the

l~ft-to-right direction.

Discuss each design in terms of how it con1trols or fails to control

for various sources of invalidity. Discuss where and for what purposes

random assignment can be utilized.

x

1) X

2) 01

3) X

X

Page 44: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-e

-41-

3. Under constant environmental conditions, the number of bacteria Q in a

ATtank is kno"~ to increase proportionately to e ,where T is the time

and the parameter A is the growth rate (O<A<l). If k is the number of

bacteria in the tank at time T=O, then

ATQ""ke ,~k.

a)" If the time permitted for growth.(as determined, say, by the time

required for the charge of water in which the bactevia are living

to pass through a filter) is a random variable with cumulative

distribution function FT(t), t>O, find that particular t (say, t*),

which is a function of A, k, and q, such that

b) If FT(t) = l_e- t, t>O, find fQ(q), the probability density function

of Q.

c) For FT(t) as in (b), find E(Q). [HINT: You do not need to know the

answer to (b).]

d)rHow is E(Q) related to E(T )1

Page 45: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-42-

4. This question concerns the job setup which follows.

a) To what account is the cost of this job charged?

b) How much time is allowed for the job?

c) How many "steps" does the job have?

d) In general, what is a "ddname"? (Check the best answer)

0 a name which is stored in every record of a data set

0 the symbolic name by which a program refElrs to a data set

0 the actual name of a particular data set

0 the name of the programmer who created the data set

e) In general, what is a "dsname"? (Check the best answer.)

0 a name which is stored in every record of a data set

0 the symbolic name by which a program refE!rs to a data set

0 the actual name of a particular data set

0 the name of the programmer who created the data set

f) What do the ampersands ("&&" ) mean on card #4?

g) What does the asterisk ("*") mean on card #24?

h) What is the function of card #8?

i) Does card #2 specify a program or a procedure to be executed?(Indicate which.)

j) How many logical records are in the data set described by card #14?

k) How many logical records are in each physica.l record (block) of thedata set described on cards #4 and #5?

1) In general, what does it mean to "sort" a da.ta set?

m) With respect to sorting, what is:a "major" field?

a "minor" field?

n) Does a data set necessarily need to be in sorted order before it canbe updated by a program such as the Biostatistics UPDATE II program? ~

(Yes or No) .,

Page 46: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-43-

0) What is the function of the "MISSING VALUES' card (#30)?

p) What is the function of the "RECODE" cards (#3l-33)?

q) If the "INPUT MEDIUM" card (#27) specified "CARD" (instead of "DISK"),where (at what point in the job deck) would the data for analysis belocated?

r) What task (or analysis) does the "MARGINALS" card (#35) specify?

s) What task (or analysis) does the "CROSSTABS" card (#39) specify?

t) Name another statistical package (other than SPSS).

Page 47: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-44-

e'

*

1)1.> DSN=UNl:.IS .X566K. 80J:<ANT .NEW ,1.>IS1'= (OL1), PASS) ,DCB=(r~CFM=FB,LRECL=HO,BLKSIZE~6400)

EXEC SPSSDO DSN=*.UPD.REPORT,DISP=OLDDO *

SAMPLE TABULATION OF EDUCATION, INCOMEGRP, ED, INCDISKESTIMATED 200FIXED (Tl1,Fl.0,T21,F2.0,T31,F6.0)ED, INC (0)INC (1 TIIRU 4000 = 1) (4001 TURU 8000 = 2)

(8001 THRU 12000 = 3) (12001 T'HRU 16000 = 4)(16001 THRU HIGHEST = 5)

(GRP EQ 3)ED, INC1Al.L

DATAED BY INC

=;::: ~'t:l;j).r-l11:10uu

!l1/ /EXSAMP JOB UNC. B.X566K, 'SORAJ.'l'£ .A-031024' ,}1::1,1'= (,59)2/ /SRT EXEC DASORT3//S0RTIN DD DSN=UNC.B.X566K.SORANT.UPDT,DISP=OLD4//S0RTOUT DD DSN=&&SORTED,DISP=(NEW,PASS),SPACE=:(6400,(2,1»,5// UNIT=DISK,DCB=(RECFH=FB;LRECL=80,BLKSIZE=6400)6//SYSIN DD *7 SORT FIELDS=(11,1,CH,D,73,8,CH,A)8/*9//UPD EXEC PGM=UPDATE2,REGION=110K

10//STEPLIB DD DSM=UNC.BIOS.LIB,DISP=SHR11//SYSPRINT DD SYSOL~=A

12//HASTER DD DSN::UNC.B.X566K.SORANT.OLD,DISP=OLD13//DETAIL DD DSN=&&SORTED,DISP=(OLD,DELETE)14//SYSIN DD *15CONTROL FIELD SPECS16C 11 1 ~1

17C 73 8 +118END CONTROL FIELD SPECS19/*20//1ilil:'ORT'1.1/122//TAB23//TAPEIN24//SYSIN25RUN NAME26VARIABLE LIST27INPUr MEDIUl1281J OF CASES29INPUT FORI'tAT30MISSING VALUES31RECODE323334SELECT IF35HARGINALS360PTIONS37STATISTICS38READ INPUT39CROSSTABS40FINISH41/*42//

Page 48: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-45-

5. The Secretary of Health in country Z ordered his statistician to determine

whether or nol there was any evidence that ethnic group A had different

size families than ethnic group B. The statistician selected an independent

random sample of 25 and 30 year old women from the two groups and determined

how many children each woman had. Means and standard errors were calculated

and are shmm in the followi.ng table:

Statistic Age (years)Totaland Group 25 30

Number ofWomen:

A 100 200 300B 200 100 300

Mean Number ofChildren:

. e A 1.8 3.6 3.0B 1.8 3.6 2.4

Standard Errorof the Mean:

A 0.070 0.072 .072B 0.050 0.100 .071

The statistician wrote a memorandum to the Secretary stating that:

"The mean number of children per woman is significantly higher in group A

than in group B. The 95% confidence interval for the difference'is 0.6 ± 0.2."

a) Do you agree with the statistician's conclusion? Explain your answer.

b) Briefly discuss the possibility that including older women in the study

might change the results.

Page 49: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

,.-46-

6. A highway-safety research investigator is interest.~d in testing the acclaimed

success of "air-bags" in the reduction of risk of death and serious injury

to the driver in hight.;ray crashes. He feels that the air-bags would be mos t

effective in head-on collisions. Suppose that the risk of a vehicle being

involved in a crash during a year is 1/9. The chance of the crash being

head-on is about 1/4 and the risk of death or 'seril:>us injury to the driver

in such a crash without air-bags is 1/4.

a) The number X of deaths and'serious injuries fr4:>m head-on collisions

during a year in a fleet of N vehicles is assumed to be a Poisson

random variable. Please comment on the basic philosophy underlying

sl-lch.an assumption. Also, under this assumption, what is E(X) for a

f~eet of N vehicles without air-bags?

b) It is planned to equip a fleet of N vehicles fl:>r one year with air

bags to test their effectiveness in reducing the risk of death and

serious injury to the driver. Let p, 0 < P < 1/4, denote the- -probability of death or serious inJury for the driver in a head-on

colHsion involving a vehicle equipped with air-bags. Consider the

test of the null hypothesis

H: p=1/4o

versus

Use the normal approximation to the Poisson distribution to determine

the fleet size N required for a test with Type I &ld Type II ~rror

probabilities cx=8=O.05. In particular, what fleet size is required

when p = 1/161a

Page 50: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-47-

DEPARTMENT OF BIOSTATISTICS

Examination for MPH and MSPH· Candidates

(9-12 a.m•• Saturday. April 13. 1974)

1. A diagnostic test for a disease D is often classified by its sensitivity

and specificity. Sensitivity is the probability that the test gives a

positive result when applied to a person who actually has D. Specificity

is the probability that the test gives a negative result when applied to

a person who is actually free of D. Of great importance to the usefulness

of such a test is the probability that a tested person actually has D if

the test gives a positive result. In order for this probability to be at

least 90%, what must the lower bound on the preva1ance of D be (i.e.,

what is the smallest allowable value of P(D» if the diagnostic test

has 997. sensitivity and 98% specificity?

Page 51: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

2. a.

-48-

i. From a population of size 50, ten elements are drawn by simple

random sampling. Their associated values are:

15, 31, 16, 27, 22, 23, 25, 19, 20, 16.

Estimate the population mean ~ and also the variance of the

sampling distribution of your- estimator.

ii. A population of size 50 is stratified into two subclasses of

respective sizes 35 and 15. Five elements are drawn from

each s tra tum by s tra tif ied random sam}:lling, yielding:

Stratum I: 32, 40, 39, 42, 35

Stratum II: 60, 69, 64, 55, 63

Estimate the population mean and also the variance of the

sampling distribution of your estimatclr. Is this proportional

allocation?

b. Discuss the advantages and disadvantages of

i. simple random sampling

ii. stratified random sampling

iii. cluster random sampling

in terms of the interplay between precision and cost.

3. For the exponential distribution having probability density function

x- -f(x)

1=-e~

x > 0,

the maximum likelihood estimator of ~ is the mean X of a random sample

of size n from the population f(x). When n is reasonably large, the

distribution of Xcan be satisfactorily approximated by a normal dis-

tribution. Using this approximation, derive a 100(1-a)% confidence

interval for~. Apply the result to obtain a 95% confidence interval

-when a sample of 49 yields X = 11. 52.

Page 52: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-49-

4. Researchers believe that the prevalence (Y) of byssinosis in

workers in textile manufacturing plants is linearly related to

3the mean daily cotton dust level (X) in mg/m •

a) Under the assumption that zero cotton dust level implies

zero prevalence, write down an appropriate statistical

model expressing Y as a straight-line function of X.

Make sure to define precisely all terms in the model

you suggest.

b)

c)

Given n pairs of data points (Xi,Yi ), i=1,2, ••• ,n, provide

explicit expressions for the least squares estimates of all

parameters associated with your model.

Using the above results, determine whether there is evidence

of a significant relationship between X and Y based on;the

following set of data:

y X

2 13 27 37 4

10 5

Page 53: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-50-

5. The research designs often used in evaluating Inental health programs

may be open to several sources of error, including the following:

a. History - extraneous events may occur simultaneously with the

treatment.

b. l-Iaturation - the effect may be due to the l?assage of time alone.

c. Testing - a "before" measure may stimulate change regardless of

treatment.

d. Regression to Mean - individuals with extr.ame scores at one time

tend to score nearer the mean at another.

e. Selection - different groups may be composed of individuals with

differing degrees of susceptibilIty to treatment.

Give eX~lRples of various research designs which do and do not control

for each of these sources of error. Be sure to discuss your choice

of examples in detail.

-,

Page 54: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-51-

6. The following table shows the distribution of the number of hospital out-

patient attendances for males in a small town during a one-year period.

For example, the table tells us that out" of 35,008 males in the town 734

had 3 outpatient attendances during the year.

Distribution of the number of hospital outpatientattendances for males over a one-year period

NU}IBER OF OUTPATIENT ATTEliDANCES

0 1 2 3 >4 TOTAL X " S2 S-27,411 4,339 1,539 734 985 35,008 0.43 1.26 1.12

It is desirable to develop a distributional form to describe this data

as a first step in developing a mathematical model describing the delivery

of health services. If we assume for each individual that

i) successive outpatient attendances occur independently,

ii) the probability of a given number of attendances is the same

for all time intervals of the same length, and 0

iii) only one"alternative occurs at a time,

then \ole have the basic assumptions for a Poisson Process. Under these

assumpti.ons, the distribution of the number of attendances for a fixed

time period of one year will be Poisson; Le.,

po{i attendances for a giveu} = e-A Ai/~!I individual in one year

The parameter A ,.,ill be a measure of the tendency for an individual to

use the outpatient service. If ,"e further assume that all individuals

have the same A, then the proportion of persons in the population with

i outpatient attendances will be given by the Poisson distribution.

Page 55: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-52-

Investigate whether the Poisson distribution provides a suitable

model for the above data. Have you any suggestions for an improved

model?

It is suggested that you estimate the parameter of the Poisson

distribution by the method of moments. Note that

-0.43e = 0.651.

For ease of computation, if

-A iP1=e 1./11,

then

-APO = e

Pl = APO'

1P2 = '2 APl ,

1P3 = "3 APZ' -,

Page 56: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-53-

DEPARTMENT OF BIOSTATISTICS

Bxamination for MPH and HSPll Candidates

(9-12 a.tn., Saturday April 12, 1975)

1. Suppose that a proportion p of a large 'popu1ation has a particular disease,

which can he detcctcd by a blood test. Suppose that a random sample of n

people is to be tested. This testing can be done in two ways:

i) Each of the n people is tested separately.

ii) The blood samples of the n people are mixed together and analyzed. If

the test is negative, all of them are healthy (that is, just this one

test is needed). If the test is positive, each of the n persons must

be tested separately (that is, totally (n+1) tests are needed).

a) For procedure (ii), give the probability distribution of X, the number

of tests required.

b) Show that

E(X) = (n+l) - n(l-p)~

c) For n=2, sb w that the difference in the number of tests required by

procedures (i) and (ii) on the average, that is

2-E(X) ,

is positive for p<l 1--fT'

Page 57: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-54-

2. Assume that you have been asked to serve as statistjLcian for a Division of

Preventive Health Services in a health departm2nt for a state with a pop­

ulation of approximately 6 million persons. The Division has responsibility

for program development, operation, and evaluation for chronic disease,

communicable disease, dental health, maternal and child health, and mental

health. Specific responsibilities include identification of health needs

in various population groups, study of factors associated with needs, de­

termination of available manpower and facilities to help meet them, and pro­

vision and evaluation of needed services. The Division has been asked to

include specific consideration of the growing hazards of our physical environ­

ment for the health of the people.

Your first assignment is to prepare a source book containing readily

available data, and references to other data sources (including tapes, punch

cards, or tabulat~ons) that might be used in program development, operation,

and evaluation.

List some of the major sources and types of data that you might

include in this source book, and discuss some of the ways in \>~hich this infor­

mation might be of use to the Division.

Page 58: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-55-

3. Nine animals were each starved for six days, each at a different humidity.

The nine humiditi~s (X) ranged from 0% to 93%, and weight loss (Y) was the

response of interest. For the n=9 data points, the following quantities

were calculated:

9 9 9 _ _ 9_2E X.=453.5, L y.r.54.2, L (X.-X)(Y.-Y)=-44l.82, L (X.-X) =8,301.39,

i=l 1. i=l 1. i=1 1. 1. 1=1 1.

Assuming a model of the form

A A

a) Find the least squares estimates So and Sl of So and al •

b) Construct an appropriate analysis of variance table.

c) Obtain and interpret a 95% confidence interval for the true

average value of Y when X~25.

4. In a certain measuring process, the distribution of measurements is normal

with standard deviation 8. Because the mean has a tendency to shift upward

from time to time, quality control personnel make periodic checks by taking

random samp1eG and comparin~ ~ample means against. a prescribed standard.

Each suell check is a test of the hypothesis that the process mean is 100

against the alternative hypothesis that the process mean is great(~r than

100; the sample is composed of 16 measurements, and the 5% level of signi-

ficance is used.

(a) Specify the quality control decision rule in terms of the sample mean.

(b) What is the probability of a correct decision if the process mean istruly 100?

(c) What is the probability of a correct decision if the process mean istruly 105?

Page 59: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-56-

5. Goal Attainment Scaling is a model which fs currently in use in the evaluation

of mental health programs. A patient entering a mental health service will be

scored at follov-up on k variables Xl' X2, •• , ~, \o.i'here k may be different for

different patients. The variables Xl' X2, .• , Xk are constructed and scaled so

that each one can be consi.dered to have a normal distributi.on wi.th mean zero

and variance one. An overall score is determined for each patient by the for-

mula

T=50+10

[ var ( t wix.)l~i=l ~ j

where wI' w2'.~' wk are constants reflecting the relative importance of

Xl' X2 '··, ~.

ka) Show that E (E wiXi)=O.

i:.=l

b) ~lat is E(T), Var (T), and what is the distribution of T?

c) If Pij is the correlation between Xi and Xj and if Pij=P for all i~j,

show that T can be written in the form

kE wiX.

1'=50+10i=l ~

[O-P)k k ~E w~ + p( E w

i)2J

i=l ~ i=l

Page 60: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-57-

6. A group of 100 rabbits is being used in a nutrition.study~ A pre-study

weight is recorded for each rabbit. The average of these ~eights is 3.1

pounds. After two months, the experimenter wants to obtain a rough approx-

imation to the average weight of the 100 rabbits. A simple random sample

of n=lO rabbits is selected and each rabbit is weighed. The pre-study

weights and current weights for the sample are presented below:

RABBIT NUHBER 1 2 3 4 5 6 7 8 9 10

PRE-STUDY WT. (X. ) .-1. 3.2 3.0 2.9 2.8 2.8 3.1 3.0 3.2 2.9 2.8

CURRENT WT. (Y. ) 4.1 4.0 4.1 3.9 3.7 4.1 4.2 4.1 3.9 3.81.

DIFF. d.=Y.-X. 0.9 1.0 1.2 1.1 0.9 1.0 1.2 0.9 1.0 1.01. 1. 1.

'e 10E Y.=39.9,

i=l 1. •

10E Y~=159.43,

i=l 1.

10E Xi =29.7,

i=l

10E X~=88.43,

i=l 1.

10.E Xi Yi =118.67,1.=1

10E di =10.2

i=l

i) Using the d. IS, estimate the average current weight of the population1.

of 100 rabbits in the study and place a bound on the error of -esti-

mation.

ii) Discuss an alternative method to tllat used in (i) to estimate the

average current weight of the 100 rabbits (still using information on

pre-study Feight), and then comment on the relative merits of the

different methods.

Page 61: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-58-

DEPART~ffiNT OF BIOSTATISTICS

Examination for MPH and MSPH Candidates

(9-12 a.m., Saturday, April 3, 1976)

(5)

(8)

(8)

1. A study was conducted to determine if method A of teaching reading to

first grade students was superior to the presently used method. method B.

Twelve pairs of students were identified and one member of each pair was

taught using method A while the second member was taught by method B. After

three months of training, a reading test was given to the students. For 9 of

the 12 pairs. the student taught by method A scored higher than his counter­

part taught by method B.

(4) a) State the null and alternative hypotheses for the puposes of this

study.

b) COfupute the exact probability of the occurrence of this result or

worse under the null hypothesis.

c) If there had been 64 pairs. and the results had been 48 students

taught by method A scored higher. what conclusion would you draw for

a.= .05. using large sample theory.

d) If you had been asked prior to the experiment "How many pairs of

students are needed for this study?". what additional information would

you need?

Page 62: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-59-

2. Two dentists A and B made a survey of the condition of the teeth of 200

children in a village. Dentist A selects a simple random sample of 20

children and counts the number of decayed teeth for each child, with the

following results:

i = {Child}Number

123456789

10

y.= !Number of 1~ lDecayed Teet~

2o12oo1o34

i {Child}= Number

11121314151617181920

r~.lmber of 1Yi= \Decayed Teet~

10oo5o31o19

20r y. = 42

i=l ~

20r y: = 252

i=l ~

Dentist B, using the same dental techniques, examines all 200 children,

recording merely those who have no decayed teeth. He finds 60 children

with no decayed teeth.

(10) a) Estimate the total number of decayed teeth in the village children

using Dentist A's results. Is your estimate unbiased? Pr.ovide an

estimate of the variance of this estimate of the total number of de-

cayed teeth.

(10) b) Consider the above random sample as a sample of 8 from the 60

children Dentist B found with no decayed teeth and a sample of 12

from the 140 children with one or more decayed teeth. Provide an

alternate estimate of the total number of decayed teeth in the village

children. Is this estimate unbiased? Provided an estimate of the

variance of this estimate.

(5) c) Which procedure is more precise? How do you explain this?

Page 63: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-60-

3. A random sample of size n is taken from a population having the

exponential distribution defined by the probability density function

f(x)

x

= .!.. e \.lII

x > 0 ,

wherein the mean is and variance is2

II

(20) a) For the large-sample case, apply the Central Limit Theorem to

show how an approximate confidence interval (confidence coefficient

1 - ex) can be based on the sample mean as the only sample parameter

required.

(5) b) What is the specific 90% confidence interval of this kind in the

case where a random sample of 64 observations yields x = 24.12?

4. An executive is willing to hire a secretary ';olho has applied for a

position unless the applicant averages more than one error per typed

page. A random sample of five pages is selected from some material

typed by this secretary and the errors per page are: 3.4.3,1.2.

(10) a) Assuming that the number of errors per page has a Poisson

distribution with parameter A, give an expression for the exact

p-value for a test of Ho: A=l versus the alternative Ha: A>l.

(10) b) Suppose that the executive looks at a random sample of 225

pages of the secretary's work and finds 252 errors. Approximate

the p-value for the same test as in part a .

(5) c) With a=O.05. what is the executive's decision based on the

results in part b?

5. Briefly describe a typical research data management system for a pro-

ject with sufficient data flow to warrant computerized data management.

Be sure your description includes each major component of such a system,

including all major data processing phases from data collection through

statistical analysis, and includes brief descriptions of measures taken

to assure data quality, data security, and data confidentiality.

Page 64: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-61-

6. Consider the following descriptions of two populations, A and B•

A B

Population Size Deaths Population Size Deaths(July 1, 1975) (1975) (July 1, 1975) (1975)

Age Group

0-19 32 4 144 18

20-39 27 5 89 19

40-59 22 6 51 16

60+ 19 8 17 8

Total 100 23 301 61

(5) (a) Calculate the crude death rate, and the death rates specific to

,e

the enumerated age groups for populations A and B.

(5) (b) Calculate the directly age-standardized death rate for

population B, using population A as the standard.

(5) (c) Explain the reasons for the change in the death rates for

population B obtained under (a) and (b).

(5) (d) Assume that no information is available on deaths by age in population

B. Explain a procedure to calculate a standardized death rate

for population B, using population A as the standard.

(5) (e) Write a short note on the merits and demerits of standardized death

rates.

EDITORIAL NOTE: Two tables were appended to this examination:

a) Binomial Coefficients C(N,X) for N=1(1)20, X=1(1)10

b) Standard normal distribution function <1l(x) for x=O(.OS)3

Page 65: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-62-

DEPARTMENT OF BIOSTATISTICS

Examination for MPH and MSPH Candidates

(9:00 a.m.-l:OO p.m., Saturday, April 2, 1977)

Massachusetts and Mississippi each claimed a lower tuberculosis deathrate th:JIl the other in 1950. Using the following data, show how bothclaims can he substantiated, and explain the paradox.

Population and Number of Tuberculosis Deaths, by Color,in Massachusetts and Mississippi, 1950

Total White Non white

Massachusetts

Population 4,690,514 4,611,5.03 79,001TB deaths* 1,005 942 63

Mississippi

Population 2,178,914 1,188,632 990,282TB dl'aths* 573 191 382

*Deaths from all forms of tuberculosis

(b) Given that the United States had a population of 151,325,798 in 1950,of which 89.31% was White, calculate suitably standardized tuberculo­sis mortality rates for Massachusetts and Mississippi.

2. For each of the following distributions: write down the probabilityfunction, specify a model, including all necessary assumptions, whichwould produce data having the Jistribution, and give an illustrativeexample from the biological or health sciences.

(a) Binomial

(b) Poisson

(c) Multinomial

(d) Norma 1

Also, if Var(X) = SO, Var(X + Y) = 80 and Var(X - Y) = 40, find

(e) Var(Y) and Cov (X,Y)

3.(a) Define simple random sampling with and without replacement. Compare aand contrast, briefly. Explain requisite formulae for estimates pro- •duced and standard errors.

(b) What is stratified random sampling and in what circumstances is it apreferred procedure? Explain requisite fo~mulae for estimates pro­duced and standard errors.

Page 66: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-63-

4. It has been suggested that a linear relationship exists between the amountof mail handled in a post office and the man-hours required to handle it.In order to test out this hypothesis the following data were cOllected:

Four-week period,fiscal year 1962

123456789

10111213

Assume the following model:

Pieces ofmail handled (X)

(in millions)

157161169186183184268184180188184182179

Man-hours used (Y)(in thousands)

572570599645645671

105365563766765664-0609

where X is a fixed (independent) variate, and £ is2

N(O, a).

Calculations

LX = 2405 LX2= 453517

LY = 8619 Ly 2= 5892485

LXY = 1633249

)185

})663-= 11-N

LX 2 _ U:) 2 = 8592 , LXY -(LX) (LY)

38734 , })2 _(LY) 2

178088 .= =N N

(a) Calculate the ANOVA table for the assumed model using the abovefigures.

(b) Assuming all the above calculations arc correct, test the hypo­thcsis that 8

1= 0 using an a-risk of 5~.

(c) Comment on the validity of the error term in the Analysis.

(d) How do you explain that thc best estimate of the intercept isnegative?

(e) Is there any evidence here that the linear model is inadequate?Justify your answer.

Page 67: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-64-

s. A test for the presence or absence of gout is based on the serum uric acidlevel x in the blood. Assume x is a normally distributed random vari- .able. In healthy individuals x has a mean S.Omg/IOOml and a standarddeviation of I.Omg/IOOml and in diseased persons it has a mean of 8.Smg/IOOml and a standard deviation of I.Omg/IOOml.

A test T for the presence of gout is to classify those persons with aserum uric acid level of at least 7.0mg/IOOml as diseased.

(a) In epidemiologic terminology, the "sensitivity" of a test for adisease is defined as the probability of detecting the diseasewhen it is present. Express the sensitivity of T as a testfor gout in terms of the tabulated values of the standard normaldistribution.

(b) The "specificity" of a test is defined as the probability of thetest not detecting the disease when it is absent. Express the spe­cificity of T similarly.

Alternatively, this can be regarded as a statistical test of the nullhypothesis

HO: ~ =S.Omg/IOOml

against the alternative hypothesis

II: IJ = 8.Smg/IOOml .a

The data are a single observation on the serum uric acid x of the personto be classified.

Cc) \\That is the rejection region of the test T?

Cd) What is the size of the test (expressed in terms of tabulated valuesof the standard normal distribution)?

Ce) Relate the epidemiological concepts of sensitivity and specificityto a, a, and I - a, the Type I and Type II error probabilitys andthe power of T, respectively.

Page 68: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-65-

DEPARTMENT OF BIOSTATISTICS

Examination for MPH and MSPH Candidates

(9:30 a.m.-l:30 p.m., Saturday, April 8, 1978)

1. Suppose you were going to carry out a survey of a rural area ofNorth Carolina of about 5000 families to establish the need fora health clinic in the area. The geographic area correspondedapproximately to the service area of two telephone exchanges (atelephone exchange corresponds to the first three digits of aseven digit telephone number), and a telephone directorycovering these exchanges was available.

a) Outline briefly some of the advantages and disadvantages ofcarrying out the survey by telephone.

b) Suppose you established that the telephone listings for thetwo exchanges covered 90% of the 5000 families that comprisedthe target population. Cost considerations prevented personalinterviews and so the survey population was taken to be the4500 families with listed telephone numbers. Further, supposethat the survey budget was sufficient to allow 300 familiesto be telephoned with adequate follow-up. Assuming an overallrcsponse ratc of 80%, with what precision would a proportion0.5 bc estimated given that the sample was selected by simplerandom sampling?

c) Consider the possibility of stratifying by telephone exchangeand selecting a simple random sample from each exchange. Howmight such a procedure influence the precision of yourestimate? Explain your answer.

d) Suppose you decide to select a stratified random sample, eachexchange comprising one stratum. You are particularlyinterested in estimating the overall proportion of familieswith a problem of accessing health care. The data availablefrom the survey is as follows.

Population size (N)

/I responJcnts =Sample size (n)

II of households insample with accessproblem

Stratum 1First Telephone

exchange

1500

80

10

Stratum 2Second Telephone

exchange

3000

160

60

Estimate the overall proportion of families with a problemaccessing health care.

What is the standard error of your estimate?

Page 69: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-66-

2 The table gives the results of an experiment to compare 4treatments with a control.

Total Mean---12, 14, 15, 20, 24 85 17 •Control

#1 5, 8, 10, 17 40 10

##2 4, 7, 8, 13 32 8

##3 13, 16, 19. 24 72 18

##4 7. 10. 10. 17 44 11

a) State a linear model for this experiment. define all theterms in it, and estimate all its paramet,ers.

b) Construct the standard analysis of variance table and testthe hypothesis of no differences among treatments andcontrol (0 = 5%). .

c) The original intention was to compare each treatmentindividually with the control. Use the Bonferroni (addition)method to decide which treatments are significantly differentfrom the control, with an overall error rate not to exceed5'Jo•

d) After seeing the data, the experimenter decides to comparethe treatments with each other. By Scheffe's method, whichpairwise differences are significant, with an overall errorrate not exceeding 5%?

c) Same as (d). but use Tukey's method.

e'

Page 70: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-67-

3. A regression analysis is considered in which the sampling units are

the counties of a state. The dependent variable is a five-year age-

race-sex adjusted lung cancer mortality rate; the investigators wish to

find determinants of lung cancer. In planning the study, the investigators

list the following as some of the independent variables of interest to

be included in the model (there may be other variables also):

x = % urban of county population1

X2 = % rural of county population

X = -1, 0, 1 if the county is in the South, Central or North region3

X4 = mean per capita income of the county

Xs = % white of county population

Critique this; redefine variables where necessary in commenting on:

(a) Dependent variable

(b) Xl

(c) X2

(d) X3

(e) X4

(f) Xs

(g) The basic design with respect to the research porblem of findingdeterminants of lung cancer.

Page 71: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-68-

4. TIle number of machine malfunctions per shift at a factoryis recorded for 200 shifts and the following data are obtained:

No. of mal functions (X) 0 1 2 3 4 5 6 Total

No. of shifts 77 68 38 11 3 2 1 200

a) What is a reasonable probability model for this type ofdata? Justi fy your answer with sui tablle reasons forchoosing the particular model.

b) What is the maximum likelihood estimator of lJ, the trueaverage number of malfunctions per shift?

c) Test if the model in (a) adequately describes the data.

d) The management claims that lJ =1 while the workers insistthat lJ > 1. Test whether you agree to accept the management'sclaim based on the above data.

5. Suppose we ohserve the number of defective items in acertain production process. Let p be the probability ofobtaining a defective item on any single trial.

a) In a sample of nl = 150 items, r I = 6 were found to be

defective. Find the 90 percent confidence interval for p.

b) Two other samples from the same process were obtained withthe following results

Sample 2: n2

= 100

Sample 3: n3 = 200

r =52

r 2 =10.

Find the "best" estimator of p and the 90 percent confidenceinterval for p using the information from these samples.

Page 72: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-69-

DEPARTMENT OF BIOSTATISTICS

Examination for MPH and MSPH Candidates

(9:00 a.m. - 1:00 p.m., Saturday, April 7, 1979)

1. A city transportation department is conducting a survey to

determine the gasoline usage of its residents. Stratified

random sampling is used and the four city wards are treated as

the strata. The amount of gasoline purchased in the last week

is recorded for each household sampled. The strata siz~s and

the summary information obtained from the sample are:

STRATA.- I II III IV

Stratum size 3750 3272 1387 2475

Sample size 50 45 30 30

Sample mean 12.6 14.5 18.6 13.8(in gallons)

Sample variance 2.8 2.9 4.8 3.2

a) Estimate the mean weekly gasoline usage per household for

the city population and construct a 95% error bound for

your estimate.

b) Estimate the width of a 95% error bound for the estima~or

in simple random sampling (ignori71g stratification when n =155) .

c) Do you agree with the surveyor that stratification has led

to some reduction in the sampling error of the estimation

(over simple random sampling)?

Page 73: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-70-

2. A forester seeking information on basic dimensions obtains

the following measurements of the diameter 4.5 feet above the

ground and the height of 12 sugar maple trees.

0.9, 1.2, 2.9, 3.1, 3.3, 3.9, 4.3, 6.2, 9.6, 12.6, 16.1, 25.8Diamctcr x(in inches)

Height y 18.(in feet)

26, 32. 36, 44.5,35.6,40.5,57.5,67.3,84 ,67 ,87.5

The forester wished to determine if the diameter measurements

can be used to predict tree height.

a) Plot the scatter diagram and comment on the appropriateness

of a straight line relation.

b) Determine an appropriate linearizing transformation. In

particular, try x' =log x and y' = log y .

c) Fit a straight line regression to the transformed data.

d) What proportion of the variability is explained by the

fitted model? ~ .

e) Find a 95% confidence bond for predicted height when x = 10.

[Computations up to the second decimal place should be good enough.]

Page 74: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-71-

3. An experiment is conducted to determine the soil moisture deficit

resulting from varying amounts of residual timber left after

cutting trees in a forest. The three treatments are Treatment I:

no timber left; Treatment II: 2000 bd. ft. left , and Treatment III:

8000 bd ft. left (board feet is a particular unit of measurement

of timber volume). The measurements of moisture deficit are

b) Obtain simultaneous confidence regions for the mean

differences between individual pairs of treatments.

c) Use (b) to provide a mUltiple comparison test for paired

differences.

[Computations up to the third decimal place should be good enough.]

Page 75: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-72-

4. A biologist wishes to investigate the incidence of a particular

disease in a certain tribal population, Let p be the fraction

of the population affected by this disease.

a) Individual members are to be randomly chosen and examined

one at a time until exactly m(~ 1) affected persons are

found in the sample, How many persons (M) are expected

to be examined using this method of sampling when the units

are drawn with replacement? What is the variance of M?

b) A second biologist decides to draw a random sample of size

n and to examine and count the number of persons (X)affected

by the dioease • Derive the formulae for the

expectation and variance of X when sampling is made

(a) with replacement; and (b) without replacement.

e·I

Page 76: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-73-

5. Surgery was performed on glaucoma patients in order to reduce

intraocular pressure (lOP) and thereby provide relief from the

symptoms of glaucoma. In some cases, medication was provided

to the patients; these patients would take the drug periodically

over the days following surgery. Assume that the patients

receiving medication were selected at random. At a certain

time following surgery, the patients were reexamined and their

post-operation lOP measurements were taken and recorded with

their pre-operation lOP measurements. The surgery plus medicine

group is composed of 15 patients (26 eyes) and the surgery alone

group is composed of 5 patients (9 eyes). The pre-surgery lOP,

the follow-up lOP, and the absolute and percent changes in lOP

are given in the following tables.

a) Summarize the data in a manner interpretable to physicians

who are interested in whether or not the medication had any

benefits in addition to the benefits of surgery regarding

lOP. Provide appropriate counts, means, standard

deviations, standard errors, ranges, or whatever you think

appropriate.

b) Perform at least one appropriate test of significance on the

data in addressing the question of interest (see part a).

Consider both parametric and nonparametric tests in planning

the analysis.

c) What additional information, if any, pertinent to exploring

the data would you attempt to get in discussions with the

physicians?

Page 77: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-74-

Surgery + Medications

Patient IOP Before Follow-up lOP PercentNo. Surgery lOP Change Change ...

l. RE = 26 27 +l +3.8LE = 40 25 ~lS -37.5

3. LE = 28 19 -9 -32.1

4. RE = 25 19 -6 -24

5. RE = 26 19 -7 ~26.9

LE = 22 19 " -13.6-~t

6. RE = 30 20 -10 -33.3LE = 40 17 -2~; -57.5

7. RE = 40 14 -26 -65.0LE = 58 18 -40 -69.0

8. RE = 30 18 1~' -40.0 e- ,,, .LE = 40 18 2~' -55.0- ,",

10. RE = 36 22 -14 -38.9LE = 39 24 -IS -38.5

11. RE = 40 22 -18 -45.0LE = 40 24 -16 -40.0

14. RE = 24 22 ~t -8.3-,t;

LE = 25 18 -7 -28.0

15. RE = 30 20 -10 -33.3

20. LE = 30 19 -11 -36.7

21. RE = 28 19 -9 -32.1LE = 30 21 -9 -30.0

24. RE = 35 35 0 0LE = 38 27 -11 -28.9

25. RE = 46 30 -16 -34.8LE = 32 31 -l -3.1

-------* eRE and LE denote the right and left e)'e respectively.

Page 78: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

. ,.. ,-75-

Surgery Alone

Patient IOP Before Follow-up IOP PercentNo. Surgery lOP Change Change

2. RE = 34 17 -17 -50.0LE = 53 8 -45 -84.9

9. RE = 22 14 -8 -36.4LE = 30 16 -14 -46.7

17. RE = 26 13 -13 -50.0

18. RE= 26 22 -4 -15.4

e LE = 25 10 -15 -60.0.23. RE = 21 18 -3 -14.3

LE = 20 17 -3 -15.0-_. ---.__.-

Page 79: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-76-

BASIC MASTER LEVEL WRITTEN EXAMINATION IN BIOSTATISTICS

PART I

(April 12, 1980)

SECTION A

Qucstion 1 The following questions apply to stratified sampling:

..

J points

J points

6 points

a) Briefly describe stratified (simple) random sampling.

b) "'hat makes stratified random sampling a type of probabilitysampling?

c) Briefly discuss the set of guidelines which you would usein forming strata in this type of design.

J points

to points

d)

e)

Briefly describe the relationship between an epsem (orself-weighting) design and proportionate stratified sampling.

nNoting that the estimated variance of y = (lIn) Iy. from a

o j Jsimple random sample of size n from a population of size Nis

where

derive an estimator for the variance of the estimated meanfrom a stratified random sample,

H~ - 'W-yYwo - L h ho

h

where "'h = (Nh/N), Nh is the number of elements in the h-th

Hstratum of the population, N =LNh, H is the total number

h

e'

of strata,

and nh is the number of elements in the sample from the

h-th stratum.

Page 80: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

"-77-

~cstio~ Consider the regression model

Yi =a. + t31Xi + 82xi + ei , i = 1, ... ,n.

Suppose that you want to test

~) puints a) State the usual assumptions you need to make for this purpose.S points b) Derive the expression for the mean square due to error.S f'llints c) Write down the formula for the test statistics.~ Points d) Suppose that you want to test also

H*' 8 = 8 =0 vs. H*' (81 , 82) f:: <1,.O· 1 2 I .

:, p\lints e) What are the critical regions for the testing problemsin (c) and (d)?

_~~~~~ion~ A large machine consists of SO components. Past experience hasshown that a particular component will fail during an 8-hourshift with probability .1 • The equipment will work if no morethan one component fails during an 8-hour shift.

7 pllints a)

• e(. 1)(' i nts b)

5 pL'ints c)

:> ppinls

4 ppints.

Calculate the probability that the machine will work throughoutan entire 8-hour shift, assuming the binomial distribution isapplicable. Carefully define any notation and state theassumptions required for the valid application of the binomialprocedure.

State the general situation and assumptions which are requiredfor the Poisson model to be applicable to this situation.Give the Poisson density function.

The Poisson distribution may be used to approximate thebinomial distribution when n is "large" and p is "small".Use the Poisson distribution to find an approximation to theprobability computed in part (a).

d) Calculate an approximation to the probability obtained inpart (a) by using the normal approximation to the binomialdistribution. State the assumptions under which the approximationis reasonable.

e) Compare the accuracy and usefulness of these threecalculations.

Page 81: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-78-

SECTION B

question 4 Suppose that X has a chi-square distribution-

and Y has a beta distribution

x > 0, \) > 0,

f () = r(Ct+8) a-l(l_ )8-1Y Y f«iJfw Y Y, a> 0, 8>0

O<y<1.

6 points

7 points

a)

b)

Obtain formulae for E(Xr )positive integer.

v *If r < [2"] show that

where r is a

Question 5 Suppose that lifetime (T) of a certain mechanical device has aWeibull distribution with PDF

fJ points

b points

l~ points

13 points

-r -1E(X )=[(v-2)(v-4)···(v-2r)]

c) Find E(y-r). What condition must r satisfy?

d) Assume that X and Yare independent, and let Z =X/Y.Find E(Z).

c c-1 t cf (t) =- t exp[-(-)] e >0, c >0, t >0T eC e

a) Obtain the formula for cumulative distribution function,FT(t), and evaluate the expected proportion of failures

(i) before time T;(ii) after time T.

b) Suppose that N items were put on test at t =0, and nwere observed to fail before time T. Suppose that exactfailure times were recorded; let t. denote the failure

1

time of the i-th item among those items which failed(i = 1, ... ,n). Construct the likelihood function.

EDITORIAL NOTE: Two tables were appended to this exa.mination: ..a) Standard normal distribution function ¢(x) for x = -3(.01)3 •b) Natural logarithms In(x) for x = 1(.01)10

Page 82: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-79-

BASIC MASTER LEVEL WRITTEN E~~INATION IN BIOSTATISTICS

PART II

(April 12, 1980)

M.P.H. and M.S.P.H. students are to answer"any two questionsduring the two-hour period (1 pm - 3 pm). M.S. studentsare to answer three questions of which not more than 2should be from Section A - time period 1 pm - 4 pm

You are required to answer only what is asked in thequestions and not all you know about the topics.

SECTION A

Q~~?sti0!1_J.. A. survey of 320 families, each with 5 children, revealed thefollowing distribution:

No. of girls 0 1 2 3 4 5 Total

No. of families 18 56 110 88 40 8 320

a) Is the result consistent with the hypothesis that male andfemale births are equally probable?

1 ') points Test this hypothesis at the significance level a =0.05,a =0.01

10 roints b) I'Ihat is the maximum likelihood estimate of the probabilityof a female birth?

Page 83: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-80-

Question 2 A. Briefly describe or explain the following terms:

(a) as(b) 1'50(c) Track (on magnetic disk)

7 points (d) Block(e) Logical Record(f) byte(g) JCL .,

B. Compare and contrast:

6 points(a) OS dataset(b) SAS database(c) SAS dataset

(That is, demonstrate that you know what each of these terms meansand the differences between them.)

C. The printout in Figure 1 was produced by the PROC PRINT statementin line 190 of the SAS program shown in Jrigure 2. Show what willbe printed as a result of

(a) the PROC PRINT statement in line 230 in Figure 2. (Be carefulto write out the entire output to be produced by SAS excepttitles and page headings.)

(b) Show what will be printed as a result of the PROC MEANS(Figure 2-, line 250) and related statements.

(c) Show what will be printed as a result of the PROC PRINTstatement in line 440 of Figure 2, .

(d) Show what will be printed as a result of the PROC PRINTstatement in line 530 of Figure 2.

Figure 1. Printout Produced by the PROC PRINT Statement in line 190

of Figure 2.

s '~.~ .s. !. s_~.!._~_._h_L__J\ N ~_L__Y__5_!-.~_._.5 Y 5 T E l'f---cns ~AM E SEX AGE HEIGHT HEIGHT-1 _. - ···ALFn·F.·O-··· ..·_··--M 14 -.-69-------ffi

2 ALIC~ F 13 56 843 n~PN~DETT: F 13 654 r.r.rn1\PA -·----F---n~--·--62----1·~~~--5 ,Jld~ES 11 12 51 83

Page 84: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-81-

Figure 2. SJ\S Program for Problem 2

O)~5~ II EXEC SAS0') 06 0 /1* p W=EXA11

_0:)07.0 _I I.Ri::X P_I?._.~..... .. u .. _

01080 M 14 69 112 ALfRED-OOOqO P 13 S6 84 ALIC3O~100 F 11 65 9B UERNADET!E00f"O-F"-'-4--(;~'--lbi DAR-BA £lA .....----- ..------

Oa120 M 12 57 83 JAMESJ?l)l~_/L/lS_"!.~~J~ ,DD__ .!.. .__ , ' ..00140 CATA S~UD?nrl;

0')150 INPILE REX;00160 LENGTH Nl\MZ $ 20 SEX $ 1;--O~T7-0---iNPiif·-5EIuAG-E-IIEiGfiTWEiGI{T--N-f~fE;

00180Oa190 PROC PRINT; ... THIS PRODn~~~_F.JGURP__2;-002(nf-·----- -.. ------ ,- ------- _.--- ---'------00210 PROC SOFT; 8Y SEX ~.GE HEIGHT;00220 '

-002-jO-PROC p-PI~f'fi--- ..;.;--:?if6jjLE:13 (Ar;-00240002 50 Pn0 C MEA N S ['IE A N !l: lie I? ROD 1 E!~_~.~ :-O(fi6-b-· .-,-81(-s-ii;--' ..----- ----00270 VAR AGE HEIGHT;002AO

-·0-~-29·0---P·rioc·--~i"(fRT; --DY-Nh1Er-sE"jC AGE ;---

o~noo

00310 DATA CHANGE~;-o-6-3":fo-- ----Y.EriG--'i'l( ~IA[,iE--$--·:;{0--··-S-EX $---r;------.-------O~330 MISSING _;00340 INPU'! ~I::X AGE HEIGHT WEIGHT NAME;

-(fO 350----CAR-OS;---· .- --------- -------00360 F 13 50 • ALICE00370 M 12 • 89 J~MES

-6offfo-'----·~'1-1·4-:,---'---··AL-FREiY .-.--00390 F 1!l -1 • DARDAr"00400 F 16 62 102 BARBARA

--O·o"4'fo----·P--'--2S-<j---e 4---jANi .. -.- ---'---00420 ; * 3ND OF CHAnGE DATA;00430

'-0 () 4 LiO--·pg-OC-!:iF I N·T":----"'-·PF-OOL EM 3(C) ;0:')450

__0~§_9_ ...r~IiQ~~qB1.i.. __ !!X._ NAME ,_S_~_~~G~ _0047000480 DATA sTUDrMT2;

.__9QI!'!.O " P:.?!lJ.~_S TilDE ,~I..TJ_.cJ:1 f.\tiGE~ _OOr,OO DY NANE SEX AGE;00510 II-' HEIGH: > 0:0) 520 . ._____ _00530 PR0C POIN!; * PR0UlE~ 3(0);

Page 85: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-82-

Question 3 A sample survey of 800 adults is conducted in a large city in orderto determine citizen attitudes toward national health insurance.The sampling frame for the survey is a list of adult taxpayersfrom which four strata are formed corresponding to each of four majorsections of the c~ty. The design calls for sellecting a simple randomsample within each stratum. General data for the survey sample aswell as results for an opinion question on national health insuranceare as follows:

5 points

') p"ints

5 points

5 points

') Jhl ill ts

Stratum (h)

(Middle(Inner (Blue Cla'ss (Affluent TOTALCity) Collar) Suburbs) Suburbs)

Total number of adults (~) 20,000 50,000 20,000 10,000 100,000

Number of adults interviewedin the sample (nh) 200 200 200 200 SOO

Number of interviewed sample e-adults who are in favor ofnational health insurance (xh) 190 ISO 40 30 440

a) Calculate a stratified estimate of the proportion (P) of adultsin the city who favor national health insurance.

b) Calculate an estimate of the variance of the estimate producedin Part (a).

c) A colleague argues that since a "random sample" of adults has beenchosen and since simple random sampling wali used, the estimate andits variance can be calculated as if a simple sample of n =SOOadults had been selected. Calculate the estimate and its-varianceas the colleagues suggests.

d) Comment briefly on the difference between your analysis and yourcolleague's analysis.

e) If you decide to use Neyman allocation for a similar survey onnational health insurance in the future, how would you thenallocate a sample of n = 800 adults to the same four strata?

Page 86: 1966-1980 WRITTEN EXAMINATIONS FOR THE DEGREES …boos/library/mimeo.archive/ISMS_1981… · 1966-1980 written examinations for the degrees master of public health and master of science

-83-

SECTION B

Question 4 Read the attached article by Topoff and Mirenda from a recent issue

of Science. Note that from the data of Table 1 the authors were led

to calculate a chi-squared statistic. Discuss the statistical

appropriateness and the numerical accuracy of their analysis.

EDITORIAL NOTE: The article referred to, including Table I, was reproducedand appended to this examination. It originally appeared inScience 207: 1099-1100 (7 Harch 1980).

Question 5 Ten observations on three independent variables (Xl' X2, X3) andone dependent variable were obtained as follows:

Xl X2 X3 Y

2.7 5.5 6.0 2.3

·e 2.2 6.2 6.8 1.8

2.7 5.S 6.0 2.7

3.0 6.2 7.3 2.7

3.2 7.9 7.1 2.8

2.2 6.2 6.8 2.2

2.7 S.S 6.0 2.4

4.0 9.0 7.8 1.6

3.0 6.2 7.3 2.9

4.0 9.0 7.8 2.2

It is postulated that the model

is reasonable.

5 points (i)

..( I !llJillls (ii)

e 8 points (iii)

() puillls. (iv)

Inspect the model and the data and comment on theappropriateness of the model .

Determine an estimate of the variance of the random error,

02, for these data.

By plotting the data or otherwise, select a model withfew parameters that appears appropriate and fit that model.Comment on the fit.

2Does R tell you the same thing as a direct test of lackof fit based on an estimate of pure error?