21
© 2015 IHS. ALL RIGHTS RESERVED. KNOWLEDGE ARCHITECTURE AND BIG DATA How to Apply Knowledge Architecture to Big Data David Meza Chief Knowledge Architect NASA Johnson Space Center Federal Reserve June 15, 2016

KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

©2015IHS.ALLRIGHTSRESERVED.

KNOWLEDGEARCHITECTUREANDBIGDATA

HowtoApplyKnowledgeArchitecturetoBigData

DavidMezaChiefKnowledgeArchitectNASAJohnsonSpaceCenter

FederalReserveJune15,2016

Page 2: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

AGENDA

•  KnowledgeArchitecture•  NASADataStrategy•  CogniPveCompuPng

2

Page 3: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

“ThemostimportantcontribuPonmanagementneedstomakeinthe21stCenturyistoincreasetheproducPvityofknowledgeworkandtheknowledgeworker.”PETERF.DRUCKER,1999

Page 4: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

ToconvertdatatoknowledgeaconvergenceofKnowledgeManagement,InformaPonArchitectureandDataScienceisnecessary.

4

KnowledgeManagement

DataScienceInformaPonArchitecture

Page 5: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

KnowledgeArchitecture•  Thepeople,processes,andtechnologyofdesigning,implemenPng,andapplying

theintellectualinfrastructureoforganizaPons.

•  Whatisanintellectualinfrastructure?

•  ThesetofacPviPestocreate,capture,organize,analyze,visualize,present,

anduPlizetheinformaPonpartoftheinformaPonage..

•  InformaPon+Contexts=Knowledge

•  InformaPonArchitecture+KnowledgeManagement+DataScience=Knowledge

Architecture

•  KMwithoutapplicaPonsisempty(StrategyOnly)

•  ApplicaPonswithoutKAareblind(ITbasedKM)

•  DataSciencetransformyourdatatoknowledge

5

Page 6: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

KnowledgeManagement"Knowledgemanagementistheprocessofcapturing,distribuPng,andeffecPvely

usingknowledge.”

ThisdefiniPonhasthevirtueofbeingsimple,stark,andtothepoint.Afewyearslater,the

GartnerGroupcreatedanotherseconddefiniPonofKM,whichisperhapsthemostfrequently

citedone(Duhon,1998):

"Knowledgemanagementisadisciplinethatpromotesanintegratedapproachto

idenPfying,capturing,evaluaPng,retrieving,andsharingallofanenterprise's

informaPonassets.Theseassetsmayincludedatabases,documents,policies,

procedures,andpreviouslyun-capturedexperPseandexperienceinindividual

workers.”

6

Page 7: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

InformaPonArchitectureTheintentistoachieveavarietyofcapabiliPestoenabletheAgencytoefficiently

acquireorgenerate,findandaccess,useandreuse,shareandexchange,manageand

govern,andstoreandrePreourdata.

7

Page 8: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

DataScienceDatascienceisaninterdisciplinaryfieldaboutprocessesandsystemstoextract

knowledgeorinsightsfromdatainvariousforms,eitherstructuredorunstructured,

whichisaconPnuaPonofsomeofthedataanalysisfieldssuchasstaPsPcs,data

mining,andpredicPveanalyPcs,similartoKnowledgeDiscoveryinDatabases(KDD).TheKnowledgeDiscoveryinDatabases(KDD)processiscommonlydefinedwiththestages:(1)SelecPon(2)Pre-processing(3)TransformaPon(4)DataMining(5)InterpretaPon/EvaluaPon.

8

Page 9: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

DataStrategy

9

Key Recommendations : •  Data Management •  Unified Data Lifecycle •  Data Governance •  Data Analytics Lab •  Data Fellows Program •  Data Stewards

Page 10: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

DataStrategyFramework

10

Challenge Example Opportunity RecommendaEonLackofanexplicitdatamanagementframework,fragmenteddatalifecycleandlackofdataintegraPon

NoAgency-widearchitectureandstandardsforinformaPoninteroperability.MuchofthedataNASAproducesisinaccessibleorhuman-readableonly,withnomethodtodraw-in,parse,organize,ormakeuseofthisdata.

Improvedarchitecture,standardsandaccessibilitypermimngquickerandmoreeffecPvecollecPon,digiPzaPonanddiscovery;increasedfocusonmission-specificdataneedsandtype-specificapproaches

1.  DataManagement2.  UnifiedDataLifecycle3.  DataGovernanceProgram

NeedfornewemergingdataanalyPcstechnologiesandcapabiliPestoaddressmissionspecificchallenges

ManyofNASA’scurrentdatasystemsaresignificantlyoutdatedandcannotscaletomeetdemand.

ExperimenPngwithnewalgorithms,applicaPons,andtechniques

4.DataAnalyPcsLab

DataexperPsegap DatascienPstsareinlowsupplyandhighdemand,andNASAwillneedtocompetewithindustrytoapractthebest&brightest.

CollaboraPvepartnershipstobuildinternalcapacityandexperPseanduPlizeexternaltalent,tools,andinformaPon

5.DataFellowsProgram

NeedtoeffecPvelyaddresscultureandpolicyissuesalongsidetechnology

Inmanycases,individualsarenotmoPvatedtosharedataforcollaboraPveusewithothers.

Increasedcross-agencyandcross-stakeholderownershipandapproachtodatamanagementanddataanalyPcschallenges

6.DataStewards

Page 11: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

KNOWLEDGEARCHITECTURE–ANALYTICSFRAMEWORK

11

IT&IntellectualInfrastructure

Security,DataQuality,WorkflowManagement,DataManagement,ResourceManagement

DataProducts:•  PredicPons•  Models•  VisualizaPons•  DecisionAnalysis•  Wiki

Sources:•  Sensor•  Experimental•  Computed

(modeling&simulaPon)

Forms:•  Digital•  Text•  VisualOrganizaPon:•  Structured•  Semi-Structured•  Unstructured

FuncPons:•  Governance•  Taxonomy•  Ontology•  Comm.Plan•  OperaPons

Management•  Security•  MasterData

Management•  Content

Management•  Metadata•  DataQuality

Tools&Environments:•  Largescalestorage•  RDBMS•  ParallelRDBMS•  NOSQL•  HadoopOrganizaPon:•  Structured•  Semi-Structured•  Unstructured

Tools&Environments:•  ComputaPon&data

access•  DataMining•  TextMining•  OpPmizaPon•  NetAlgorithm•  NewAlgorithm•  VisualizaPonAccessPapern:•  Structured•  Semi-Structured•  Unstructured•  Predictable•  Unpredictable

DataAcquisiPon&CreaPon

DataManagement

DataWarehousing

DataAnalyPcs,BI

(KnowledgeExtracPon)

KnowledgePresentaPon

andVisualizaPon

Source User

Page 12: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

“Wehaveanopportunityforeveryoneintheworldtohaveaccesstoalltheworld’sinformaPon.Thishasneverbeforebeenpossible.WhyisubiquitousinformaPonsoprofound?Itisatremendousequalizer.InformaPonispower.”ERICSCHMIDT(FORMERCEOOFGOOGLE)

Page 13: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

30%oftotalR&DspendiswastedduplicaPngresearchandworkpreviouslydone.Source:Na+onalBoardofPatentsandRegistra+on(PRH),WIPO,IFA

54%ofdecisionsaremadewithincomplete,inconsistentandinadequateinformaPonSource:InfoCentricResearch

46%Workerscan’tfindtheinformaPontheyneedalmosthalfthePme.Source:IDC

Page 14: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

KnowledgeArchitecture:TheNextPhase

14

Page 15: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

15

Page 16: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

16

Page 17: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

17

PushversusPull

Page 18: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

18

WHATCOULDYOUACCOMPLISHIFYOUCOULD:

•  Empowerfasterandmoreinformeddecision-making

•  Leveragelessonsofthepasttominimizewaste,rework,re-invenPonandredundancy

•  Reducethelearningcurvefornewemployees

•  EnhanceandextendexisPngcontentanddocumentmanagementsystems

Page 19: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

19

JSCKnowledgeArchitectureServices:§  AnalyPcs

§  WebPlauormforAnalysisandVisualizaPon

§  NOSQL-Neo4jandMongoDB

§  VisualizaPonServices-BusinessIntelligence

§  RepositorySpecificSearch

§  WikiFarm

§  CodeSharingandProjectcollaboraPon

§  Training

Page 20: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

Contact Information

David Meza – [email protected]

Twitter - @davidmeza1

Linkedin - hpps://www.linkedin.com/pub/david-meza/16/543/50b

Github – davidmeza1

Blog davidmeza1.github.io

20

Page 21: KNOWLEDGE ARCHITECTURE AND BIG DATA - …files.meetup.com/19117935/NASA - Knowledge Architecture...data needs and type-specific approaches 1. Data Management 2. Unified Data Lifecycle

Contents

©2015IHS.ALLRIGHTSRESERVED. 21ReportName/Month2015

QUESTIONS?