33
1 Data Extraction Handbook, Chapter 7

Data Extraction

Embed Size (px)

DESCRIPTION

Cochrane Review author training workshop, January 22-23, 2009 at the University of Calgary Health Sciences Centre

Citation preview

Page 1: Data Extraction

1

Data Extraction

Handbook, Chapter 7

Page 2: Data Extraction

2

Data Collection Form

Importance Linked directly to review question and

planned assessment Historical record of decisions through the

review process Data repository, from where the analysis will

emerge Vary across reviews, but fundamental

components

Page 3: Data Extraction

3

Components of Data Form

Paper vs electronic forms → Chapter 7.5.2 for considerations

Consider how much information to collect (too much vs too little)

Careful thought and planning

Logical to entry into RevMan, especially for electronic forms

Page 4: Data Extraction

4

Components of Data Form (continued)

Items to include:– Review title, author name, who collecting data– Review unique ID, Study ID (RevMan), unique

report ID (multiple reports)– Date (for multiple chronologic versions)– Notes section (up front) – use for RevMan– Verification of study eligibility Table of excluded studies and reasons required– Study characteristics, information to assess bias,

results– Accurate coding: instructions and decision rules,

use of ‘Not reported’ and ‘Unclear’

Page 5: Data Extraction

5

Details for Protocol

Data categories to be collected– Again, the study is the unit of interest

Who and how many for collection– 2 people, independent– Content and non-content expert?

Piloting, training, existence of coding instructions for the data form– Piloting for information and instructions– Kappa not routinely done, but can and only for the

most important data

Page 6: Data Extraction

6

Details for Protocol (continued)

How data extracted for multiple reports of the same study– How to collate: CONSORT flow diagrams may

help How disagreements handled

– Process (consensus → arbitrator → study authors → report disagreement in review)

Blinding to aspects of study reports not generally recommended

Page 7: Data Extraction

7

Data Form: Process

Think – what are the needs? Design – draft form Pilot – sample of papers, compare completed

forms Refine – modify form, instructions Extract – further revisions may be needed

once data extraction underway, this is okay

Consider retraining or recoding with passage of time, also for updating

Page 8: Data Extraction

8

Characteristics of Included Studies

Extracted data:– Methods– Participants– Interventions– Comparisons– Outcomes– Information for risk of bias (later)– Notes

Page 9: Data Extraction

9

Characteristics of Included Studies

Page 10: Data Extraction

10

Characteristics of Included Studies

Page 11: Data Extraction

11

Characteristics of Included Studies

Page 12: Data Extraction

12

Table of Excluded Studies

Page 13: Data Extraction

13

Studies Awaiting Classification

Page 14: Data Extraction

14

Table of Ongoing Studies

Page 15: Data Extraction

15

Data Types

Dichotomous Continuous Ordinal Counts and rates Time-to-event

Page 16: Data Extraction

16

Dichotomous (binary) Outcomes

The outcome is one of two possibilities, cannot be both e.g., pregnant vs. not

Intervention

Event No event

a b

c d

Event No event

Control

a+b = nI

c+d = nC

Data required: ‘event’ and ‘no event’ for

each group

Page 17: Data Extraction

17

Dichotomous outcomes (continued)

May experience difficulties with:– Poor reporting– Numbers may need to be derived from

percentage data provided in the report (which denominator to use, compatibility with more than one numerator)

Sometimes ordered categorical data (ordinal) are treated as dichotomous data

Page 18: Data Extraction

18

Continuous outcomes

Outcomes that can take any value in a specific range eg, weight, length of stay

Sometimes data from ordered categories (ordinal) are treated as continuous

Check if data can be treated as continuous (Consult CRG statistician)

Effect measures: mean difference (difference in means) or standardized mean difference (factors in standard deviation)

Page 19: Data Extraction

19

Ordinal outcomes

Categories with a natural order Can range in number of categories Measurement scales: important to know if

validated Discussion on how to analyze: Section 9.2.4 Analyze as dichotomous? Continuous? As

is? Consult CRG statistician Extract data in all forms in which reported

Page 20: Data Extraction

20

Counts of events

Events that can happen more than once to a

given individual Eg, MI, adverse event Common vs. rare events Different methods exist for analysis Consult CRG statistician Extract data in the form they are reported in

Page 21: Data Extraction

21

Time-to-event outcomes

Analysis of whether the event occurred and when ‘Survival data’ in statistics E.g., survival, disease recurrence For each individual:

– ‘no event’ period– at the end of that period, whether event occurred

or is just the end of observation (censored) Hazard ratio the most appropriate effect measure Methods of meta-analysis Section 9.4.9; consult

CRG statistician

Page 22: Data Extraction

22

Planning Your Analysis

Handbook, Chapter 9

Page 23: Data Extraction

23

Planning Your Analysis

Specify comparisons First and most important step! Back to PICO – should relate clearly and directly

Page 24: Data Extraction

24

What Comparisons are Important?

Pair-wise comparisons– Glucosamine versus placebo– Glucosamine versus no therapy– Glucosamine versus NSAIDs– Acupuncture plus vitamin B12 injections versus

B12 injections alone

Specify the main comparisons in the protocol If need to modify in light of the data (eg,new

comparison)…document!

Page 25: Data Extraction

25

Characteristics of Intervention and Control

Are the interventions or controls all the same?

Different types of drugs used (eg inhaled steroids? NSAIDs?)

Different dosages/duration of therapy/preparation

If not the same, are they similar enough to be combined?

Page 26: Data Extraction

26

Population Description

Separate or combine? E.g. Mild versus severe rheumatoid arthritis Age issues? Defining separation points?

Page 27: Data Extraction

27

Outcomes

Determine what to combine – shouldn’t be too diverse

Start with the outcomes that are considered most important (specified in protocol)

Eg, mortality, pain, function

May be same variable, but could be categorical in one study and continuous in another

Eg, pain – continuous: 0-10 mmVAS, 1-5 Likertdichotomous: ‘moderate’, ‘severe’

RevMan: outcomes are entered after the comparisons have been set up

Page 28: Data Extraction

28

Hierarchy of Data and Analysis Section

• Results of studies• Tabular• Fixed format• Forest plots automatically generated

Page 29: Data Extraction

29

Sample Table

Hyperlink to Forest plotHyperlink to Forest plot

Page 30: Data Extraction

30

Link out to Forest Plot

Page 31: Data Extraction

31

Make a Plan

Sketch it out on paper Decide what you want to do before you start

entering into RevMan Document changes between the protocol and

conducting the review

Page 32: Data Extraction

32

Critical Elements

Input from more than one person, including “expert”

Transparency; state post-hoc decisions Clear, consistent

Page 33: Data Extraction

33

Data extraction exercise