Upload
others
View
6
Download
0
Embed Size (px)
Citation preview
CS6510AppliedMachineLearning
CourseIntroduc;on
6Aug2016
VineethNBalasubramanian
Afewrecentquotes
5-Aug-16CS6510-AppliedMachineLearning
2
• “AbreakthroughinmachinelearningwouldbeworthtenMicrosoGs”(BillGates,Chairman,MicrosoG)• “MachinelearningisthenextInternet” (TonyTether,Director,DARPA)• Machinelearningisthehotnewthing” (JohnHennessy,President,Stanford)• “WebrankingstodayaremostlyamaUerofmachinelearning”(PrabhakarRaghavan,ex-Dir.Research,Yahoo)• “Machinelearningisgoingtoresultinarealrevolu;on”(GregPapadopoulos,CTO,Sun)• “Machinelearningistoday’sdiscon;nuity” (JerryYang,ex-CEO,Yahoo)
WhatisMachineLearning?
5-Aug-16CS6510-AppliedMachineLearning
3
WhatisMachineLearning?
5-Aug-16CS6510-AppliedMachineLearning
4
• Makingpredic;onsordecisionsfromdata• “Programmingcomputerstoop;mizeaperformancecriterionusingexampledataorpastexperience”(EthemAlpaydin,MachineLearning,2010)• “AcomputerprogramissaidtolearnfromexperienceEwithrespecttosomeclassoftasksTandperformancemeasureP,ifitsperformanceattasksinT,asmeasuredbyP,improveswithexperienceE.”(TomMitchell,MachineLearning,1997)• “Learninggeneralmodelsfromadataofpar;cularexamples”• “Buildamodelthatisagoodandusefulapproxima1ontothedata.“
Today
5-Aug-16CS6510-AppliedMachineLearning
5
Source:Domingos
Tradi'onalProgramming
MachineLearning
ComputerData
ProgramOutput
ComputerData
OutputProgram
Magic?
5-Aug-16CS6510-AppliedMachineLearning
6
No,morelikegardening• Seeds=Algorithms• Nutrients=Data• Gardener=You• Plants=Programs
Source:Domingos
RelatedTerms
5-Aug-16CS6510-AppliedMachineLearning
7
MachineLearning,DataMining,KnowledgeDiscovery,Ar;ficialIntelligence,Sta;s;calLearning,PaUernRecogni;on,
Computa;onalLearning
WhenisMachineLearningUsed?
5-Aug-16CS6510-AppliedMachineLearning
8
• Humanexper;sedoesnotexist• E.g.naviga;ngonMars
• Humansareunabletoexplaintheirexper;se• E.g.speechrecogni;on
• Solu;onchangesin;me• E.g.rou;ngonacomputernetwork
• Solu;onneedstobeadaptedtopar;cularcases• E.g.userbiometrics
• Dataischeapandabundant;knowledgeisexpensiveandscarce
Applica;onsofMachineLearning
5-Aug-16CS6510-AppliedMachineLearning
9
Applica;onsofMachineLearning
5-Aug-16CS6510-AppliedMachineLearning
10
Applica;onsofMachineLearning
5-Aug-16CS6510-AppliedMachineLearning
11
Applica;onsofMachineLearning
5-Aug-16CS6510-AppliedMachineLearning
12
Applica;onsofMachineLearning
6-Aug-16CS6510-AppliedMachineLearning
13
MoreMLApplica;ons
6-Aug-16CS6510-AppliedMachineLearning
14
• Science(Astronomy,neuroscience,medicalimaging,• bio-informa;cs)• Environment(energy,climate,weather,resources)• Retail(Intelligentstockcontrol,demographicstore• placement)• Manufacturing(Intelligentcontrol,automatedmonitoring,• detec;onmethods)• Security(Intelligentsmokealarms,frauddetec;on)• Marke;ng(promo;ons,...)• Management(Scheduling,;metabling)• Finance(creditscoring,riskanalysis...)• Webdata(informa;onretrieval,informa;onextrac;on,...)
MoreRecentMLApplica;ons
6-Aug-16CS6510-AppliedMachineLearning
15
• AlphaGo!• Automa;ngEmployeeAccessControl• Iden;fyingwhalesinoceanbasedonaudiorecordings• Predictwait;mesforpa;entsinemergencyrooms• Extractheartfailurediagnosiscriteriafromfree-textphysiciannotes• Predic;nghospitalreadmissions• Is(s)heapyschopath?Source:hUp://www.forbes.com/sites/85broads/2014/01/06/six-novel-machine-learning-applica;ons/#6b6f9a9e67bf
WhenareMLalgorithmsnotneeded?
6-Aug-16CS6510-AppliedMachineLearning
16
• Whentherela;onshipsbetweenallsystemvariables(input,output,andhidden)iscompletelyunderstood!
• ThisisNOTthecaseforalmostanyrealsystem!
OverviewofML
6-Aug-16CS6510-AppliedMachineLearning
17
• Supervisedlearning• Predictanoutputywhengivenaninputx• Forcategoricaly:classifica;on.• Forreal-valuedy:regression.
• Unsupervisedlearning• Createaninternalrepresenta;onoftheinput,e.g.clustering,dimensionality• ThisisimportantinmachinelearningasgennglabelsisoGendifficultandexpensive
• OtherareasofML• Learningtopredictstructuredobjects(e.g.,graphs,trees)• Reinforcementlearning(learningfrom“rewards”)• Semi-supervisedlearning(combinessupervised+unsupervised)
Classifica;on(SupervisedLearning)
6-Aug-16CS6510-AppliedMachineLearning
18
Classifica;on(SupervisedLearning)
6-Aug-16CS6510-AppliedMachineLearning
19
Regression(SupervisedLearning)
6-Aug-16CS6510-AppliedMachineLearning
20
Ranking(SupervisedLearning)
6-Aug-16CS6510-AppliedMachineLearning
21
Givenaqueryandasetofwebpages,rankthemaccordingtorelevance
• Otherapplica;ons• Userpreference,e.g.Nerlix“MyList”--moviequeueranking• Flightsearch(searchingeneral)• …
Clustering(UnsupervisedLearning)
6-Aug-16CS6510-AppliedMachineLearning
22
ReinforcementLearning
6-Aug-16CS6510-AppliedMachineLearning
23
• Learningapolicy:Asequenceofoutputs• Nosupervisedoutputbutdelayedreward• E.g.Gameplaying• E.g.Robotinamaze
• Mul;pleagents,par;alobservability,...• Examples:• hUps://www.youtube.com/watch?v=DCjbk4m1G6I• hUps://www.youtube.com/watch?v=VCdxqn0fcnE
DimensionalityReduc;on
6-Aug-16CS6510-AppliedMachineLearning
24
• Largesamplesizeisrequiredforhigh-dimensionaldata• Queryaccuracyandefficiencydegraderapidlyasthedimensionincreases• Strategies• Featurereduc;on• Featureselec;on• Manifoldlearning• Kernellearning
MLProblems
6-Aug-16CS6510-AppliedMachineLearning
25
MLinPrac;ce
6-Aug-16CS6510-AppliedMachineLearning
26
• Understandingdomain,priorknowledge,andgoals• Dataintegra;on,selec;on,cleaning,pre-processing,etc.• Learningmodels• Interpre;ngresults• Consolida;nganddeployingdiscoveredknowledge• Loop
TrainingandTes;ngMLModels
6-Aug-16CS6510-AppliedMachineLearning
27
Trainingset(observed)
Universalset(unobserved)
Tes;ngset(unobserved)
Dataacquisi;on Prac;calusage
TrainingandTes;ngMLModels
6-Aug-16CS6510-AppliedMachineLearning
28
• Trainingistheprocessofmakingthesystemabletolearn.• Nofreelunchrule:• Trainingsetandtes;ngsetcomefromthesamedistribu;on• Needtomakesomeassump;onsorbias
TypesofModels
6-Aug-16CS6510-AppliedMachineLearning
29
• Induc;vevsTransduc;veLearning• OnlinevsOfflineLearning• Genera;vevsDiscrimina;veModels• ParametricvsNon-ParametricModels
MLDatasets
6-Aug-16CS6510-AppliedMachineLearning
30
• UCIRepository:hUp://www.ics.uci.edu/~mlearn/MLRepository.html• Statlib:hUp://lib.stat.cmu.edu/
MLResources
6-Aug-16CS6510-AppliedMachineLearning
31
• MOOCs• Coursera
• Conferences/Journals• JMLR,MachineLearning,IEEETransac;onsonNeuralNetworksandLearningSystems,IEEETransac;onsonPaUernAnalysisandMachineIntelligence,AnnalsofSta;s;cs• ICML,NIPS,ACMSIG-KDD,IJCAI,AAAI,ICDM
Mathema;calBasis
6-Aug-16CS6510-AppliedMachineLearning
32
• Func;ons,LogarithmsandExponen;als• Vectors,DotProducts,Orthogonality• Matrices,MatrixOpera;ons,LinearTransforma;ons,Eigendecomposi;on• Calculus,Differen;a;on,Integra;on• ProbabilityandSta;s;cs• Func;onalAnalysis,HilbertSpaces
CourseDetails
6-Aug-16CS6510-AppliedMachineLearning
33
• Timings/Loca'on:• Sat9:30am–12:30pm• Un;lmid-OctoberforEMDSstudents• Block-ALH1(LH2occasionally,ifrequired)
• Instructor:VineethNB• Email:[email protected]• Office:Block-E,324
• TAs• ArghyaPal,SupriyaPandhre,MausamJain,AkileshB,SahilManocha,HrishikeshVaidya
• Web:• hUp://www.iith.ac.in/~vineethnb/teaching/f2016/cs6510-aml.htm• WewillsoonhaveaGoogleClassroomportalforclassmanagement
CourseTopics
6-Aug-16CS6510-AppliedMachineLearning
34
• Introduc;ontoMachineLearning:OverviewandApplica;ons• Classifica;onMethods• MachineLearningSystemDesign• RegressionMethods• ClusteringMethods• DimensionalityReduc;onMethods• FeatureSelec;onMethods• AnomalyDetec;onMethods• AdvancedMethodsMethods
• GraphicalModels,DeepLearning,NewSenngs(Ac;veLearning,TransferLearning,StructuredPredic;on,Mul;taskLearning,Mul;pleInstanceLearning
Topicsnottobecovered(likely)
6-Aug-16CS6510-AppliedMachineLearning
35
• DeepLearning(notinits“depth”,atleast)• BayesianNetworks(notindepth)• ReinforcementLearning• LearningTheory
CourseEvalua;onRubric
6-Aug-16CS6510-AppliedMachineLearning
36
• 15%:Quizzes• 25%:Homework(Theory+Programming)• 30%:Compe;;veCodingAssignments(Challengestyle)• 30%:End-semesterexamina;on
• 5gracedaystostartwith–pleaseusethemwisely.Maynotapplytosomedeadlines,whichwillbepointedout.
• Best80%(approx,tenta;ve)ofquizzeswillbeconsidered• NotaUemp;ngtheend-semesterexamwillresultinanFR.
Programming
6-Aug-16CS6510-AppliedMachineLearning
37
• Python• Libraries• Numpy,Scipy–numerical/scien;ficcompu;ng,linearalgebra• Matplotlib–forplonng• Scikitlearn–formachinelearning
References
6-Aug-16CS6510-AppliedMachineLearning
38
• KeyReferences• PaUernRecogni;onandMachineLearning,byChristopherBishop• Ar;cles/Blogs/Papers/MOOCsonline
• OtherRecommendedReferences• R.Duda,P.Hart&D.Stork,PaUernClassifica;on(2nded.)• T.Mitchell,MachineLearning,
Appropriatereadingmaterialsforlectureswillbeposted
Homework
6-Aug-16CS6510-AppliedMachineLearning
39
• Gothroughremedialvideosformathfounda;onsat:• hUps://www.youtube.com/channel/UC7gOYDYEgXG1yIH_rc2LgOw/playlists
• Programming• LearnPython
• hUps://try.jupyter.org/• hUps://docs.python.org/3/tutorial/• VideoTutorials:hUps://www.youtube.com/watch?v=cpPG0bKHYKc
• Noteofcau;on:Python2.7vsPython3.4• hUp://sebas;anraschka.com/Ar;cles/2014_python_2_3_key_diff.html
• PlaywithNumpy