22
SILVERCHAIR Content Preparation and Delivery to Support Artificial Intelligence Jake Zarnegar Chief Product Officer, Silverchair

Zarneger Content Preparation and Delivery to Support AI

Embed Size (px)

Citation preview

Page 1: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

ContentPreparationandDeliverytoSupportArtificialIntelligence

JakeZarnegarChiefProductOfficer,Silverchair

Page 2: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

Page 3: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

Muchlike“thecloud,”“bigdata,”and“machinelearning”beforeit,theterm“artificialintelligence”hasbeenhijackedbymarketersandadvertisingcopywriters.

Ifthehypeleavesyouasking“WhatisA.I.,really?,”don’tworry,you’renotalone.Iaskedvariousexpertstodefinetheterm andgotdifferentanswers.Theonlythingtheyallseemtoagreeonisthatartificialintelligence isasetoftechnologiesthattry toimitateoraugmenthumanintelligence.

Tome,theemphasisisonaugmentation,inwhich intelligentsoftwarehelpsusinteractanddealwiththeincreasinglydigitalworldwelivein.

OmMalik,TheNewYorkerhttp://www.newyorker.com/business/currency/the-hype-and-hope-of-artificial-intelligence

Page 4: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

GeneralAugmentation

http://www.cnn.com/2017/01/26/health/ai-system-detects-skin-cancer-study/

SpecificAugmentation

TwoTypesofAugmentation

Page 5: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

1:OngoingIndependentLearningFromInteractionwithStimuli

2:ComplexInteractionwithHumans

TwoAI“Augmentation”ConditionsThatMostAgreeOn

Page 6: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

Howcanscientificandscholarlypublishersbestprepareanddelivercontenttoassist(andnothinder)

theadvancementofAIsystems?

Today’sQuestion

Page 7: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

KnowledgeDiscoveryinDatabases(KDD)[1]isdividedinfourmainphases:domainexploration,datapreparation,datamining,andinterpretationofresults.

1. Thefirstphaseisresponsibleforunderstandingtheproblemandwhatdatawillbeusedintheknowledgediscoveryprocess.

2. Thenextphaseselects,cleans,andtransformsthedatatoaformatthatissuitableforaspecificdataminingalgorithm.

3. Inthethirdphase,thechosendataminingalgorithmperformssomeintelligenttechniquestodiscoverpatternsthatcanbeofpotentialuse.

4. Thelastphaseisresponsibleformanipulatingtheextractedpatternstogenerateinterpretableknowledgeforhumans…

Page 8: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

…Mostoftheresearchcarriedoutinthisareafocusonthedataminingphase,whichusesartificialintelligencealgorithmslikedecisiontrees,artificialneuralnetworks,evolutionarycomputation,amongothers[2]todiscoverknowledge.Ontheotherhand, thedatapreparationphase,responsibleforintegration,cleaning,andtransformationofdata,hasnotbeenthesubjectofmuchresearch.Infact,Pyle[3]arguesthat “datapreparationconsumes60to90%ofthetimeneededtominedata– andcontributes75to90%totheminingproject’ssuccess”.

From: PauloM.Goncalves Jr.and RobertoS.M.Barros, "AutomatingDataPreprocessingwithDMPMLandKDDML," 10thIEEE/ACISInternationalConferenceonComputerandInformationScience,2011, DOI:10.1109/ICIS.2011.23.

Page 9: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

“Idownloaded2TBofArxiv contentlastweekbutI can’tbringmyselftoopenitandstartworkingonanalyzingitbecauseIknowIhaveatleast6monthsofpainstakingdatacleanup&preparationaheadofmebeforeIcanbegin.”

--MikeM.,FastForwardLabs

Page 10: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

WhereWeAreNow:ApplyingtheLevelsofCognitiveLearningtoSoftware

Page 11: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR Bloom, et al. 1956

Page 12: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

• We’vemasteredthis!• Thefundamentalsofthe

permanentscholarlyrecord(DOI,CLOCKSS,PDF,etc.)

Page 13: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

• Alsostrongincreatinginterfacesthatassistunderstandingfromhumanreaders

Page 14: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

FosteringSoftwareUnderstanding

Insomewayswe'vegotagoodfoundation– detailed,consistentcontenttaggingtoaidwithsoftwareUnderstanding

• StructureunderstandingthroughnormalizedXML:whatisthetitle,authors,abstract, wheretheconclusionsareinthepaper,etc.

Increasedtaggingofnamedentities:understandingwhatisagene,whatisaclinicaltrialID,whatisaperson

• Thiscanstillgoawry: "BethIsrael,""BethIsraelDeaconess"examples

Page 15: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

http://anesthesiology.pubs.asahq.org/article.aspx?articleid=2592740

https://academic.oup.com/rheumatology/article/doi/10.1093/rheumatology/kex082/3101351/Musculoskeletal-manifestations-of-Ebola-virus

Page 16: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

1:Interfacesstillprimarilyvisual,narrative

2:HelpfulunderlyingXMLstructurenotshared

3:Littletonotaggingabove“Understanding”level

3ObstaclestoHigherSoftwareCognition

Page 17: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

WhereWe’reGoing:TheRacetotheTop

Page 18: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR Bloom, et al. 1956

Page 19: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

• Full-textnormalizedXML(orJSON)• Separateproduct/subscriptionforsale• Separatedeliverymechanism(nohumaninterface)butcan

piggybackonexistingcontentworkflows• Accessesanewclassofcustomerw/deeppockets

(AIcreatorsorimplementers)• Requiresnewvetting/legalagreements

ConsiderProvidingYourStructuredContentasaNewProduct

Page 20: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

…Mostoftheresearchcarriedoutinthisareafocusonthedataminingphase,whichusesartificialintelligencealgorithmslikedecisiontrees,artificialneuralnetworks,evolutionarycomputation,amongothers[2]todiscoverknowledge.Ontheotherhand, thedatapreparationphase,responsibleforintegration,cleaning,andtransformationofdata,hasnotbeenthesubjectofmuchresearch.Infact,Pyle[3]arguesthat “datapreparationconsumes60to90%ofthetimeneededtominedata– andcontributes75to90%totheminingproject’ssuccess”.

From: PauloM.Goncalves Jr.and RobertoS.M.Barros, "AutomatingDataPreprocessingwithDMPMLandKDDML," 10thIEEE/ACISInternationalConferenceonComputerandInformationScience,2011, DOI:10.1109/ICIS.2011.23.

Page 21: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

• Developyourownsoftware(ordevelopanalysis,applicationandevaluation)higherupthecognitionpyramid

• Ifthat’sthecase,don’tshareyourstructuredcontentwithpotentialcompetitors

OrConsiderCompetingDirectly!

Page 22: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

ThankYou

JakeZarnegarChiefProductOfficer,Silverchair