4

Click here to load reader

Exam 2 (Units 3, 4 & 5) Study Guide and Practice Questionscourses.umass.edu/biep640w/pdf/BIOSTATS 640 Spring 2016 Exam 2... · BIOSTATS’640–’Spring2016

Embed Size (px)

Citation preview

Page 1: Exam 2 (Units 3, 4 & 5) Study Guide and Practice Questionscourses.umass.edu/biep640w/pdf/BIOSTATS 640 Spring 2016 Exam 2... · BIOSTATS’640–’Spring2016

BIOSTATS  640  –  Spring  2016                                                                                                                                                                                                                                                          Exam  2  Study  Guide  

… 2016\docu\BIOSTATS 640 spring 2016 Exam 2 Study Guide.docx Page 1 of 4  

BIOSTATS 640 - Intermediate Biostatistics Spring 2016

Exam 2 (Units 3, 4 & 5) Study Guide and Practice Questions  

   

Study  Guide    Unit  3  (Discrete  Distributions)    Take  care  to  know  how  to  do  the  following!  Learning  Objective   See:  1.    Write  down  the  expression  for  obtaining  a  probability  calculation  for  each  of  the  following:    Binomial,  Poisson,  Central  Hypergeometric.  

Notes  3,  pages  13  (binomial),  20  (poisson)  and  30  (central  hypergeometric)  

2.    Starting  with  a  study  design  (2  group  cohort,  or  case-­‐control,  or  surveillance),  write  down  the  expression  for  the  likelihood  of  the  observed  data  

Notes  4,  actually.    See  pages  5-­‐8.  

3.    Know  how  to  do  a  probability  calculation  for  a  given  situation  using  2  approaches:    binomial  and  poisson  

Notes  3,  pages  21-­‐22.  

4.    Know  how  to  do  a  Fisher  Exact  test  by  hand   Notes  3,  pages  31-­‐34  

    Unit  4  (Categorical  Data  Analysis)    Take  care  to  know  how  to  do  the  following!  Learning  Objective   See:  

1.    Know  how  to  do,  by  hand,    a  chi  square  test  for  general  association  in  a  RxC  table.    

Notes  4,  pages  10-­‐14  

2.  Know  how  to  do,  by  hand,    and  explain  a  stratified  (K=2)  analysis  of  2x2  tables.  

Consider  multiple  resources  here,  as  my  notes  may  need  improvement.  (a)    Notes  4,  pages  26-­‐39  (b)    Visit  FAQ  2  on  the  course  website  page  for  Regression  (c)    See  again,  the  handout:  “decision  tree”  for  evaluating  effect  modification  and  confounding.    

3.    Know  how  to  do,  by  hand,  a  test  of  zero  trend  in  a  2xC  table.  

Notes  4,  pages  40-­‐42    (actually  –  not  necessary  to  go  beyond  page  42  unless  you  are  very  interested!)  

                     

Page 2: Exam 2 (Units 3, 4 & 5) Study Guide and Practice Questionscourses.umass.edu/biep640w/pdf/BIOSTATS 640 Spring 2016 Exam 2... · BIOSTATS’640–’Spring2016

BIOSTATS  640  –  Spring  2016                                                                                                                                                                                                                                                          Exam  2  Study  Guide  

… 2016\docu\BIOSTATS 640 spring 2016 Exam 2 Study Guide.docx Page 2 of 4  

 Unit  5  (Logistic  Regression)    Take  care  to  know  how  to  do  the  following!  Learning  Objective   See:  

1.    Write  down  the  likelihood  L  for  a  single  individual  outcome  distributed  Bernoulli  that  incorporates  a  linear  model  of  the  logit  of  the  event  probability.    

Notes  5,  appendix  2  is  a  good  place  to  start.    See  especially    page  63  

2.    Starting  with  a  given  fitted  logistic  model,  extract  estimated  OR,  and  predicted  probability.    As  well,  know  how  to  compute  the  estimated  OR  for  the  comparison  of  two  profiles  of  values  of  the  predictor  variables.  

For  simple  OR,  notes  5  pages  12  For  predicted  probability,  notes  5,  pages  9  and  17.  For  OR  comparison  of  two  profiles,  notes  5,  pages  14-­‐15  

3.    Know  how  to  write  out  models  for  the  expected  logit  in  various  settings:    (a)  basic,  (b)  confounding,  and  (c)  effect  modification  

 I  will  talk  about  this  in  class.    

4.    Know  how  to  assess  a  current  model.  Is  it  statistically  significantly  better  than  a  model  with  no  predictors  in  it?  

 I  will  talk  about  this  in  class.  

5.    Compare  two  hierarchical  models  using  the  likelihood  ratio  (LR)  test.    Specifically,  show  how  the  test  statistic  is  calculated,  then  calculate  it,  then  interpret  it.  

Notes  5,  pages  20-­‐21.    An  example  is  detailed  on  page  21.  

   

Page 3: Exam 2 (Units 3, 4 & 5) Study Guide and Practice Questionscourses.umass.edu/biep640w/pdf/BIOSTATS 640 Spring 2016 Exam 2... · BIOSTATS’640–’Spring2016

BIOSTATS  640  –  Spring  2016                                                                                                                                                                                                                                                          Exam  2  Study  Guide  

… 2016\docu\BIOSTATS 640 spring 2016 Exam 2 Study Guide.docx Page 3 of 4  

Practice  Questions  Practice Question 1 Both the Binomial and Poisson distributions have been used to model the quantal nature of synaptic transmission. Crudely, the quantal hypothesis says that a nerve terminal contains a very large number of “quanta”, each with a small probability of releasing acetylcholine (ACh) in response to a nerve stimulus. Suppose it is known that, for a given stimulus, the probability of Ach release is 0.01 for each quantum and is the same for all quanta. You may assume the quantal responses are independent. (a) Using the Binomial distribution, what is the probability that in a nerve terminal containing 200 quanta zero Ach is released in response to stimulus? (b) Using the Poisson distribution, what is the probability that in a nerve terminal containing 200 quanta zero Ach is released in response to stimulus? Practice Question 2 A logistic regression analysis was used to explore the relationship between the diabetes (presence or absence) and body mass index (BMI). The Y-variable for this analysis was Y=Diabetes and was coded Y=1 for persons with diabetes and Y=0 for persons without diabetes. The X-variable for this analysis was X=BMI where BMI is measured as kg/m2. The following fitted model was obtained:

π̂ln = -3.034 + 0.075X

ˆ1 - π⎛ ⎞⎜ ⎟⎝ ⎠

With the following values of ln-likelihood: Ln-Likihood (intercept only) = -116.652 Ln-Likelihood (intercept + BMI) = -113.852

(a) Using the information given in the fitted model, together with your understanding of logistic regression, complete the following table by filling in the five blanks in the 2nd row .

Coefficient Standard Error Wald Statistic p-value OR 95% CI for OR

Intercept

Not asked

0.893

Not asked

Not asked -

-

BMI

_________

0.032

________

_______

_______

____________

(b) Using the information given in the fitted model, calculate the value of the estimated odds ratio for the outcome of diabetes in relationship to a 5 kg/m2 increase in BMI.

Practice Questions 3 and 4

Page 4: Exam 2 (Units 3, 4 & 5) Study Guide and Practice Questionscourses.umass.edu/biep640w/pdf/BIOSTATS 640 Spring 2016 Exam 2... · BIOSTATS’640–’Spring2016

BIOSTATS  640  –  Spring  2016                                                                                                                                                                                                                                                          Exam  2  Study  Guide  

… 2016\docu\BIOSTATS 640 spring 2016 Exam 2 Study Guide.docx Page 4 of 4  

Consider again the same logistic regression analysis setting of practice question 2. Further analysis of diabetes explored two additional predictors: treatment with digoxin (X2) and non-white race (X3). The following is a full coding manual.

Variable Label Codings Outcome Y Diabetes 1 = yes, 0 = no

Predictors X1 BMI Continuous kg/m2

X2 Digoxin 1 = yes, 0 = no X3 Race 1 = non-white, 0=other

The fitted logit model is now the following.

1 2 3π̂ln = -2.948 + 0.081X - 0.796X + 0.904X

ˆ1 - π⎛ ⎞⎜ ⎟⎝ ⎠

The following ln-likelhood values are provided for you: Ln-Likihood (intercept only model) = -116.652 Ln-Likelihood (intercept + X1 model) = -113.852 Ln-Likelihood (intercept + X1 + X2 + X3 model) = -108.917

Practice Question 3 Using the fitted logit model, calculate the estimated probability of diabetes for a person with BMI of 24 kg/m2, on digoxin treatment, and being of non-white race. Practice Question 4 Carry out the appropriate likelihood ratio test to compare the reduced model containing X1 = BMI with a full model containing all three predictors X1 = BMI, X2 = Digoxin and X3 = Race

Practice Question 5 A logistic regression analysis of likelihood (π) of mortality considered several variables: shock (SHOCK: coded 1=shock, 0=NO shock), malnutrition (MALNUT; coded 1=malnourished, 0 = NOT malnourished), alcoholism (ALC: coded 1=alcoholic 0=NOT alcoholic), age (AGE: continuous), and bowel infarction (INFARCT: coded 1=infarction, 0=NO infarction). The following fitted model was obtained:

ˆ ˆlogit[π] = -9.754 + 3.674[SHOCK] + 1.217[MALNUT] + 3.355[ALC] + 0.09215[AGE] + 2.798[INFARCT] What is the estimated probability of death for a 60 year old malnourished patient with no evidence of shock, but with symptoms of alcoholism and prior bowel infarction? In developing your answer write out the formula you use and provide the numeric estimate.