Upload
duongtuyen
View
217
Download
3
Embed Size (px)
Citation preview
Introduction to VINCI data services
HSR&D Cyber SeminarOctober 11, 2011
VA Informaticsd C tiand Computing
Infrastructure
AgendaAgenda
What is VINCI (review)What is VINCI (review)Available dataVINCI data servicesVINCI data access/analysis toolsVINCI data access/analysis toolsPlanned enhancements/new featuresQ ti ?Questions?
2
What is VINCI?What is VINCI?
Secure analytical workspaceSecure analytical workspaceProtect PHI/PIIRegulated data access for researchAccess to data analysis software toolsAccess to data analysis software toolsAccess to high performance serversL hi h d d t tLarge high speed data storageCustom softwareRDP connection – Cloud computingStaff to assist youStaff to assist you
3
VINCI OrganizationVINCI Organization
Veterans Affairs
VHA OI&T
ORDBISL/CDW
HSR&D
VINCI
4
What is VINCIWhat is VINCI
Data Center in AITC (Austin Texas)Data Center in AITC (Austin, Texas)Staff mostly in Salt Lake City field officeVINCI works closely with NDS for data access approvalServing both research and operations usersCDW as data provider VINCI providing CDW as data provider, VINCI providing analysis service – working in partnershipVINCI includes custom software development work for research use
5
Poll QuestionPoll Question
Have you already used VINCI Have you already used VINCI data/services?
YesNoDid not know about VINCIDid not know about VINCI
6
Available DataAvailable Data
CDW Production DataCDW Production DataCDW Raw DataSAS Datasets
7
CDW Production DataCDW Production Data
Data warehouse modeled dataData warehouse modeled dataHosted in SQL Server 2008-R2Fact and dimension tablesExtracted through CDW’s journaling Extracted through CDW s journaling process from shadow serversUnique CDW identifiersUnique CDW identifiersIndexed for fast queriesUpdated nightly
8
CDW Production DataCDW Production Data
Patient DemographicsPatient DemographicsVital SignsOutpatient PharmacyConsultsConsultsHealth FactorsO t ti t E tOutpatient EncountersImmunizationLab ChemistryPCMMPCMM
9
CDW Production DataCDW Production Data
OrdersOrdersAppointmentsMore domains coming soon
10
CDW Production DataCDW Production Data
For updated production domain For updated production domain information see:
http://vaww.vinci.med.va.gov/vincicentral/data.aspx
http://vaww.dwh.cdw.portal.va.gov/Pages/welcome.aspx
11
CDW Raw DataCDW Raw Data
Extracted directly from Vista Cache Extracted directly from Vista Cache databasesS h t d tSnapshot dataSame structure as source, not modeledData types and column names standardized by VINCI – indexes addedstandardized by VINCI indexes addedUpdated weekly, monthly, as neededCan be joined with prod. and other dataAdditional data domains can be requestedq
12
CDW Raw DataCDW Raw Data
Bill ClaimsBill ClaimsFee BasisNon-VA MedsMyHealtheVetMyHealtheVetPatient Treatment FileP th ti SProsthetic SurgeryInventoryyTravel
13
CDW Raw DataCDW Raw Data
Intravenous MedsIntravenous MedsTIUAllergies/Adverse EventsRadiologyRadiology
14
Text Integration Utilities (TIU)Text Integration Utilities (TIU)
Medical document information such as Medical document information such as clinical notes and nurses notesA il bl f ll i ( ki l t 7 Available for all regions (working on last 7 stations)Full Text Indexed – Very fast searchesContains real SSN–access regulated by NDSContains real SSN access regulated by NDSUpdated as neededDoes not include radiology and pathology report
15
Allergies/Adverse Event (Raw)Allergies/Adverse Event (Raw)
Contains patient allergies records and Contains patient allergies records and adverse reaction eventsI l d ll i d d d t Includes: allergies recorded dates, locations, allergy reactant, such as food, d d hdrug, and othersIncludes all stationsUpdate monthly
16
Radiology (Raw)Radiology (Raw)Include report notes, impression text and clude epo t otes, p ess o te t a d additional clinical historyRadiology procedures includes: MRI CT Radiology procedures includes: MRI, CT scan, bone scan, ultrasound,
h di gechocardiogramIncludes all stationsUpdated monthly
17
SAS DatasetsSAS Datasets
Uploaded from AITC mainframeUploaded from AITC mainframeRegularly updated with mainframe dataAvailable in both SAS and SQL Server formatCombined in VINCI for faster & easier accessaccess
18
DSS NDE (SAS)DSS NDE (SAS)
All NDEs include LAB LAR OUT DISCH All NDEs, include LAB, LAR, OUT, DISCH, PHA, and RADFY 2005 t bi d d id d FY 2005-current combined and provided by DSS teamData prior to FY 2005 was transformed, combined and loaded to SQL Server by Q yVINCI staff Monthly or quarterly updateMonthly or quarterly update
19
MedSAS (SAS)MedSAS (SAS)
Available in VINCI in both SAS and SQL Available in VINCI in both SAS and SQL Server table format
Transformed & loaded to SQL Server by VINCITransformed & loaded to SQL Server by VINCIFY 2000 – 2010Combined over multiple years in SQL Server
Includes:Outpatient SE/SF filesInpatient Encounter (SE)Inpatient Encounter (SE)Inpatient datasetsInpatient census datasetsInpatient census datasets
20
Vital StatusVital Status
includes date of death Medicare yearly includes date of death, Medicare yearly enrollment indicators, date of last activity and Veteran status flags etcactivity, and Veteran status flags, etc.Transformed and loaded from SAS to SQL
bl bServer tables by VINCIUpdated quarterlyp q y
21
Other Research DataOther Research Data
PBMPBMCMSHERCActive duty personnel dataActive duty personnel dataRegistry dataBi th/d th tifi t d t f Birth/death certificate data, or cause of death.
22
Poll QuestionPoll Question
Which data analysis tools do you use?Which data analysis tools do you use?SQL ServerSASSTATASTATAPASW (SPSS)RROthers
23
VINCI Data ServicesVINCI Data Services
Data description documentsData description documentsPreparatory consultingCohort selectionData extractionData extractionData formatsD t d itData access and securityExternal data
24
Data Description DocumentsData Description Documents
http://vaww.vinci.med.va.gov/vincicentral/data.aspx
25
Preparatory ConsultingPreparatory Consulting
Data needs assessmentData needs assessmentWork with VINCI prior to NDS approval to
lidif i tsolidify requirementsContact: [email protected]
26
Cohort SelectionCohort Selection
If the study does not have a cohort we If the study does not have a cohort, we will help to create one
Research Study Team provides specific Research Study Team provides specific criteriaVINCI pro ides data necessar for VINCI provides data necessary for calculations (if any)R h St d T l t h t Research Study Team completes cohort calculations (if any)
27
Data ExtractionData Extraction
Research Study Team receives project Research Study Team receives project approval from NDSD t A G C t d i l di Data Access Group Created including Research Study Team participants
Used for granting access to data & workspace
Research Study Team receives alerts that ythe Correspondence Site has been created
All communication takes place on the SiteAll communication takes place on the SiteCorrespondence Site is easy to use
28
Correspondence SiteCorrespondence Site
29
Data ExtractionData Extraction
Research Study Team uploads cohort to a Research Study Team uploads cohort to a secure site or VINCI creates cohort per requirementsrequirementsResearch Study Team completes the Data
l dSelection Forms on correspondence siteNDS approved data domains are extracted ppand providedAnalysis performed by research project Analysis performed by research project team staff
30
Data FormatsData Formats
SQL Server databaseSQL Server databaseSAS filesPASW (SPSS)STATASTATAExcelFl t filFlat filesOthersTools available for data conversion
31
Data Access & SecurityData Access & Security
Access groups created based on IRB and Access groups created based on IRB and NDS approved research teamO l h t b h Only research team members have access to the dataData stored on secure VINCI serversRegular data backup and archivingRegular data backup and archivingWorkspace vs. collaboration siteProject work can be performed in VINCIExport final result & publicationp p
32
External DataExternal Data
Research Study Team may upload other Research Study Team may upload other data for analysis into project database or workspaceworkspaceSecure data upload processOptional direct database uploadVINCI data managers work on behalf of VINCI data managers work on behalf of research team to upload data from other data providersdata providers
33
Data Processing/Analysis toolsData Processing/Analysis tools
SQL Server as primary data storeSQL Server as primary data storeMultiple high performance serversMost data queries performed in SQL ServerAccessible by all analysis softwareAccessible by all analysis softwareSSIS, SSRS, SSASHi h d i t t k ill ll High speed intra-server network will allow distributed queries
34
SASSAS
High performance SAS serverHigh performance SAS server2 TB of RAM, 64 cores, 2 TB SSDL h g id j bLaunch grid jobs
SAS grid – very large data analysis work10 high performance serversMost advanced SAS implementation in VApDedicated SANAdditional SAS modulesAdditional SAS modulesSAS knowledge base SharePoint site
D di t d SAS d i i t tDedicated SAS administrator35
Other Data Analysis ToolsOther Data Analysis Tools
STATASTATASPSSRAnnotation & NLP toolsAnnotation & NLP toolsTerminology databasesOth ft b h d Other software can be purchased as needed
36
Enhancements/New FeaturesEnhancements/New Features
New CDW data domainsNew CDW data domainsAdditional Raw data & standardizationMetadata & data profile reportsCubesCubesAggregate reportsK l d bKnowledge baseDOD research data sharinggCohort selection toolsI2B2 pilot projectI2B2 pilot project
37
Poll QuestionPoll Question
Do you plan on using VINCI data/services Do you plan on using VINCI data/services in the next:
6 monthsYearNot sureNot sureNot planning on using VINCI
38
Questions?Questions?
Contact Information:Contact Information:[email protected]
http://vaww.vinci.med.va.gov/vincicentralp g
39