50
The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research Eileen C King, PhD Research Associate Professor, Biostatistics Acting Director, Data Management Center

The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Embed Size (px)

Citation preview

Page 1: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

The Importance of Good Clinical Data Management and Statistical

Programming Practices to Reproducible Research

Eileen C King, PhD Research Associate Professor, Biostatistics Acting Director, Data Management Center

Page 2: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Reproducible Research The term first proposed by Jon Claerbout at Stanford

University and refers to the idea that the ultimate

product of research is the paper along with the full

computational environment used to produce the

results in the paper such as the code, data, etc.

necessary for reproduction of the results and

building upon the research.

Page 3: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Why? • Scholarship can be recreated, better

understood and verified.

• Others can start from the current state of the art

• Simplifies task of comparing a new method to existing methods

• Create earlier results again in a later stage of the research

Page 4: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Reproducibility of Clinical Research is Cornerstone of Drug

Development Process • Phase III trials - Requirement for two

trials that produce similar results (reproducibility)

• Summary of Clinical Efficacy Document – Displays the reproducibility of the efficacy

results

Page 5: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Drug Development Submissions • Required to provide all data along with

complete documentation

• May be required to provide statistical analysis code along with complete documentation

Page 6: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Why? • Regulators (e.g. FDA) will recreate the

results in your submission

• Regulators will use the data to evaluate efficacy and safety issues across a drug class – cardiac effects from NSAIDS – effectiveness of antihistamines in OTC cold

products

Page 7: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Academic Research Centers should

steal shamelessly from the Drug Development Process in order to facilitate Reproducible

Research

Page 8: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Why? • NIH is requiring:

– Data Sharing Plan in large grants • Data and Documentation • Statistical Programming Code and Documentation

– Posting of clinical study results to

clinicaltrials.gov

Page 9: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Facilitating Reproducible Research

Page 10: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Develop Standard Operating Procedures (SOPs)

• SOPs should be written to cover all key elements of the conduct of a study

• Provide enough detail to ensure steps are consistently carried out

• Don’t provide so much detail as to end up with violations due to normal variations in working

• Written at a high level to outline: – Required tasks – Sign-offs – Checks performed

Page 11: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Develop Work Instructions • Work instructions can be more specific to

a particular division or study

• Can document in more specific detail how to do things

• Generally not formally audited by regulatory agencies on work instructions

Page 12: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Assemble Multi-Functional Teams

Assemble as soon as study planning is underway Team should provide input into the protocol regarding their functional area

Page 13: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Study Team

Data Management

Investigator

Clinical Operations

Statistical Programming

Statistics QA

Biomedical Informatics

Regulatory

Pharmaco-vigilance

Coordinator

Page 14: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Cannot Work in Functional Silos • Expertise is needed in each of these areas

from protocol development through manuscript submission

• Amount of effort varies dependent on stage of study

Page 15: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Documents Required for Reproducible Research

• Study Protocol

• Manual of Procedures (MOP)

• Data Management Plan (DMP)

• Statistical Analysis Plan (SAP)

Page 16: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Importance of Data Management for Reproducible

Research

Page 17: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

“As its importance has grown, clinical data

management has changed from an

essentially clerical task in the late 1970s and

early 1980s to the highly computerized

specialty it is today”

Susanne Prokscha (2007): Practical Guide to Clinical Data Management

Page 18: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Society of Clinical Data Management (SCDM)

• Founded to advance the discipline of clinical data management (CDM)

• Organized exclusively for educational and scientific purposes

• Mission: Promoting clinical data management excellence including promotion of standards of good practice within clinical data management

• Provides certification: The CCDM® program establishes eligibility criteria and standards of knowledge as measured by a rigorous examination to qualified applicants.

Page 19: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Good Clinical Data Management Practices (GCDMP)

Charter: The review and approval of new pharmaceuticals by federal regulatory agencies is contingent upon a trust that the clinical trials data presented are of sufficient integrity to ensure confidence in the results and conclusions presented by the sponsor company. Important to obtaining that trust is adherence to quality standards and practices. To this same goal, companies must assure that all staff involved in the clinical development program are trained and qualified to perform those tasks for which they are responsible From SCDM GCDMP document

Page 20: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Certified Clinical Data Manager (CCDM)

• SCDM certification program was designed to meet the following goals: – Establish and promote professional practice standards

throughout CDM

– Identify qualified professionals within the profession

– Ensure recognition of expertise

– Enhance the credibility and image of the profession

Page 21: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Data Management Plan (DMP) • Details how data will be:

– Collected – Stored – Managed – Archived

• Describes and defines all data management

activities for a study – What – Who – When – How

Page 22: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Timing of DMP • Development begins after the protocol and

Case Report Forms are drafted

• Should be completed before the study begins

• Must be kept current – reflect important changes to the data

management process and computer systems that took place during the study

Page 23: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

General DMP Contents

• Protocol Summary • Study

Personnel/Roles • CRF Design/Tracking • Database

Development • Data Entry and

Processing • Data Cleaning

• Reports • Audit Plans • External Data

Transfers • Managing Lab Data • SAE Handling • Coding • Training • Study Closeout

Page 24: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Benefits of DMP • Forces planning • Process and tasks become more visible to

the project team

• Expected documents are listed at the start of the study so they can be produced during the study

Page 25: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Benefits (Cont) • Provides continuity of process and a

history of a project

– Useful for long-term studies – Useful for personnel turnover and growing DM

groups

• Regulatory Requirement – Auditors will ask for it

Page 26: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

DMP • Planning upfront saves time at end and

improves data quality

• First DMP is time-consuming to do

• Use templates and previous DMPs to ease upfront burden

Page 27: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Development of the Data Management Plan

is so important that the DMC is

recommending tracking of the percent of

studies for which a DMP was written prior to

first patient enrolled.

Page 28: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Selection of Database

Page 29: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

21 CFR part 11 In March of 1997, FDA issued final part 11 regulations that provide criteria for acceptance by FDA, under certain circumstances, of electronic records, electronic signatures, and handwritten signatures executed to electronic records as equivalent to paper records and handwritten signatures executed on paper. These regulations, which apply to all FDA program areas, were intended to permit the widest possible use of electronic technology, compatible with FDA's responsibility to protect the public health.

Page 30: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Implications • If data are being captured in an electronic

system with no paper CRFs then should(must) use compliant system

• System can be compliant but user must follow standard procedures to assure total process is compliant

• At minimum, use system with audit trails

Page 31: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Current Options • REDCap

– Not yet

• Request for Proposals to purchase 21CFR part 11 compliant system – Oracle RDC – Medidata Rave – Study Manager – Omnicomm Trialmaster – TargetHealth

Estimated timing for purchase: Summer 2012 Estimated timing for installation: End of 2012

Page 32: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

NOT Options • EXCEL spreadsheets

• Microsoft Access

Page 33: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Data Base Design • Investigators, Coordinators, Data

Managers, Biomedical Informatics, and Statisticians must work together

• Poor database design affects timings for: – Data Entry – Data cleaning – Extraction – Statistical Analysis

Page 34: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Things to do to quality of data in data base

• Enter data as soon as possible after it is received

• Run cleaning procedures throughout the study so that queries go out early

• Identify missing CRF pages and lab data by knowing what is expected. Use tracking systems

• Code AEs and meds frequently

Page 35: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Improving Quality (cont) • Reconcile SAEs periodically throughout the study

– Get listings from safety system early so you know what to expect

• Begin to audit data against the CRF early to detect systematic problems Continue to audit as the study proceeds to monitor quality.

• Open the study documentation at the start of the study and make an effort to keep it updated as the study progresses.

Page 36: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Data Standards Facilitate Reproducible Research

Page 37: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Value of standard modules in CRF Design

• Completion instructions are the same

• Database design is the same

• Edit checks are the same

• Associated statistical analysis programs are the same

• Easier to compare and combine data across studies

Page 38: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Developing Standards • Standards can include:

– Case Report Forms / Instructions – Variable names – Codelists

• Must be a commitment to following the

standards that are developed

Page 39: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Resource for Standards

• Clinical Data Interchange Standards Consortium (CDISC) – Global, open, multidisciplinary, non-profit

organization that has established standards to support the acquisition, exchange, submission and archive of clinical research data and metadata.

Page 40: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

CDISC Mission

• To develop and support global, platform-independent data standards that enable information system interoperability to improve medical research and related areas of healthcare

• Standards are freely available via the CDISC website.

Page 41: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

• Leading the development of standards for data collection is goal for Data Management Center

• Will need standards committee to develop and maintain which involves: – Clinical teams – Data Management – Statistics – Pharmacovigilance – Biomedical Informatics

Page 42: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Importance of Good Statistical Programming Practices to Reproducible Research

Page 43: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Statistical Programmers

• Work closely with Data Management to: – Specify format of data base that will be

received by statistics

– Assist with data cleaning by running analysis programs on live data base

• Benefit: – Statisticians don’t spend time cleaning data

Page 44: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Statistical Analysis Plan Includes: • Hypotheses to be tested

• Identification of primary and secondary endpoints

• Statistical analysis approach for each hypothesis

– Assumption testing – Alternative approaches – Data handling decisions

• Draft tables and figures for reports/manuscripts

Page 45: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Good Statistical Programming Practices

• Clear documentation contained within programming code

• Creation of Analysis Datasets – Ensures all programs are using a consistent set of

derived data.

• Validation of Statistical Programs – How: Second programming or code review – What: All analysis datasets and tables and figures

Page 46: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Tips to Facilitate Reproducibility

• Produce Report-Ready Output – RTF files cut and pasted into

manuscript/report – Authors need not edit the numerical output – Authors can change titles and footnotes

• Conduct quality assurance audits of

manuscripts/reports

Page 47: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Reproducibility (cont) • Programs for Tables and Graphs

– Archive SAS Programs, list and log files – All tables should contain a footnote with the

location and name of the SAS program that created it

• When report is complete, change permissions to read-only – All SAS programs, list and log files – Analysis datasets

Page 48: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Where Can I Get Help?

http://cctst.uc.edu/

Go to Research Central on Centerlink

CCTST: Center for Clinical and

Translational Science and Training

CCTST can identify resources for your study

Page 49: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Resources Available

• Clinical Trial Office

• Biomedical Informatics

• Data Management Center

• Biostatistical Consulting Units

• Drug Poison Information Center

Page 50: The Importance of Good Clinical Data Management and ... King 2012-01...The Importance of Good Clinical Data Management and Statistical Programming Practices to Reproducible Research

Thank you!