39
CS6510 Applied Machine Learning Course Introduc;on 6 Aug 2016 Vineeth N Balasubramanian

Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

CS6510AppliedMachineLearning

CourseIntroduc;on

6Aug2016

VineethNBalasubramanian

Page 2: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Afewrecentquotes

5-Aug-16CS6510-AppliedMachineLearning

2

•  “AbreakthroughinmachinelearningwouldbeworthtenMicrosoGs”(BillGates,Chairman,MicrosoG)•  “MachinelearningisthenextInternet” (TonyTether,Director,DARPA)• Machinelearningisthehotnewthing” (JohnHennessy,President,Stanford)•  “WebrankingstodayaremostlyamaUerofmachinelearning”(PrabhakarRaghavan,ex-Dir.Research,Yahoo)•  “Machinelearningisgoingtoresultinarealrevolu;on”(GregPapadopoulos,CTO,Sun)•  “Machinelearningistoday’sdiscon;nuity” (JerryYang,ex-CEO,Yahoo)

Page 3: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

WhatisMachineLearning?

5-Aug-16CS6510-AppliedMachineLearning

3

Page 4: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

WhatisMachineLearning?

5-Aug-16CS6510-AppliedMachineLearning

4

• Makingpredic;onsordecisionsfromdata• “Programmingcomputerstoop;mizeaperformancecriterionusingexampledataorpastexperience”(EthemAlpaydin,MachineLearning,2010)• “AcomputerprogramissaidtolearnfromexperienceEwithrespecttosomeclassoftasksTandperformancemeasureP,ifitsperformanceattasksinT,asmeasuredbyP,improveswithexperienceE.”(TomMitchell,MachineLearning,1997)• “Learninggeneralmodelsfromadataofpar;cularexamples”• “Buildamodelthatisagoodandusefulapproxima1ontothedata.“

Page 5: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Today

5-Aug-16CS6510-AppliedMachineLearning

5

Source:Domingos

Tradi'onalProgramming

MachineLearning

ComputerData

ProgramOutput

ComputerData

OutputProgram

Page 6: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Magic?

5-Aug-16CS6510-AppliedMachineLearning

6

No,morelikegardening• Seeds=Algorithms• Nutrients=Data• Gardener=You• Plants=Programs

Source:Domingos

Page 7: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

RelatedTerms

5-Aug-16CS6510-AppliedMachineLearning

7

MachineLearning,DataMining,KnowledgeDiscovery,Ar;ficialIntelligence,Sta;s;calLearning,PaUernRecogni;on,

Computa;onalLearning

Page 8: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

WhenisMachineLearningUsed?

5-Aug-16CS6510-AppliedMachineLearning

8

• Humanexper;sedoesnotexist•  E.g.naviga;ngonMars

• Humansareunabletoexplaintheirexper;se•  E.g.speechrecogni;on

•  Solu;onchangesin;me•  E.g.rou;ngonacomputernetwork

•  Solu;onneedstobeadaptedtopar;cularcases•  E.g.userbiometrics

• Dataischeapandabundant;knowledgeisexpensiveandscarce

Page 9: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Applica;onsofMachineLearning

5-Aug-16CS6510-AppliedMachineLearning

9

Page 10: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Applica;onsofMachineLearning

5-Aug-16CS6510-AppliedMachineLearning

10

Page 11: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Applica;onsofMachineLearning

5-Aug-16CS6510-AppliedMachineLearning

11

Page 12: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Applica;onsofMachineLearning

5-Aug-16CS6510-AppliedMachineLearning

12

Page 13: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Applica;onsofMachineLearning

6-Aug-16CS6510-AppliedMachineLearning

13

Page 14: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

MoreMLApplica;ons

6-Aug-16CS6510-AppliedMachineLearning

14

•  Science(Astronomy,neuroscience,medicalimaging,•  bio-informa;cs)•  Environment(energy,climate,weather,resources)•  Retail(Intelligentstockcontrol,demographicstore•  placement)•  Manufacturing(Intelligentcontrol,automatedmonitoring,•  detec;onmethods)•  Security(Intelligentsmokealarms,frauddetec;on)•  Marke;ng(promo;ons,...)•  Management(Scheduling,;metabling)•  Finance(creditscoring,riskanalysis...)•  Webdata(informa;onretrieval,informa;onextrac;on,...)

Page 15: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

MoreRecentMLApplica;ons

6-Aug-16CS6510-AppliedMachineLearning

15

• AlphaGo!• Automa;ngEmployeeAccessControl• Iden;fyingwhalesinoceanbasedonaudiorecordings• Predictwait;mesforpa;entsinemergencyrooms• Extractheartfailurediagnosiscriteriafromfree-textphysiciannotes• Predic;nghospitalreadmissions• Is(s)heapyschopath?Source:hUp://www.forbes.com/sites/85broads/2014/01/06/six-novel-machine-learning-applica;ons/#6b6f9a9e67bf

Page 16: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

WhenareMLalgorithmsnotneeded?

6-Aug-16CS6510-AppliedMachineLearning

16

• Whentherela;onshipsbetweenallsystemvariables(input,output,andhidden)iscompletelyunderstood!

• ThisisNOTthecaseforalmostanyrealsystem!

Page 17: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

OverviewofML

6-Aug-16CS6510-AppliedMachineLearning

17

•  Supervisedlearning•  Predictanoutputywhengivenaninputx•  Forcategoricaly:classifica;on.•  Forreal-valuedy:regression.

• Unsupervisedlearning•  Createaninternalrepresenta;onoftheinput,e.g.clustering,dimensionality•  ThisisimportantinmachinelearningasgennglabelsisoGendifficultandexpensive

• OtherareasofML•  Learningtopredictstructuredobjects(e.g.,graphs,trees)•  Reinforcementlearning(learningfrom“rewards”)•  Semi-supervisedlearning(combinessupervised+unsupervised)

Page 18: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Classifica;on(SupervisedLearning)

6-Aug-16CS6510-AppliedMachineLearning

18

Page 19: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Classifica;on(SupervisedLearning)

6-Aug-16CS6510-AppliedMachineLearning

19

Page 20: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Regression(SupervisedLearning)

6-Aug-16CS6510-AppliedMachineLearning

20

Page 21: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Ranking(SupervisedLearning)

6-Aug-16CS6510-AppliedMachineLearning

21

Givenaqueryandasetofwebpages,rankthemaccordingtorelevance

• Otherapplica;ons•  Userpreference,e.g.Nerlix“MyList”--moviequeueranking•  Flightsearch(searchingeneral)•  …

Page 22: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Clustering(UnsupervisedLearning)

6-Aug-16CS6510-AppliedMachineLearning

22

Page 23: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

ReinforcementLearning

6-Aug-16CS6510-AppliedMachineLearning

23

•  Learningapolicy:Asequenceofoutputs• Nosupervisedoutputbutdelayedreward•  E.g.Gameplaying•  E.g.Robotinamaze

• Mul;pleagents,par;alobservability,...•  Examples:•  hUps://www.youtube.com/watch?v=DCjbk4m1G6I•  hUps://www.youtube.com/watch?v=VCdxqn0fcnE

Page 24: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

DimensionalityReduc;on

6-Aug-16CS6510-AppliedMachineLearning

24

•  Largesamplesizeisrequiredforhigh-dimensionaldata• Queryaccuracyandefficiencydegraderapidlyasthedimensionincreases•  Strategies•  Featurereduc;on•  Featureselec;on•  Manifoldlearning•  Kernellearning

Page 25: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

MLProblems

6-Aug-16CS6510-AppliedMachineLearning

25

Page 26: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

MLinPrac;ce

6-Aug-16CS6510-AppliedMachineLearning

26

• Understandingdomain,priorknowledge,andgoals• Dataintegra;on,selec;on,cleaning,pre-processing,etc.• Learningmodels• Interpre;ngresults• Consolida;nganddeployingdiscoveredknowledge• Loop

Page 27: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

TrainingandTes;ngMLModels

6-Aug-16CS6510-AppliedMachineLearning

27

Trainingset(observed)

Universalset(unobserved)

Tes;ngset(unobserved)

Dataacquisi;on Prac;calusage

Page 28: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

TrainingandTes;ngMLModels

6-Aug-16CS6510-AppliedMachineLearning

28

•  Trainingistheprocessofmakingthesystemabletolearn.• Nofreelunchrule:•  Trainingsetandtes;ngsetcomefromthesamedistribu;on•  Needtomakesomeassump;onsorbias

Page 29: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

TypesofModels

6-Aug-16CS6510-AppliedMachineLearning

29

• Induc;vevsTransduc;veLearning• OnlinevsOfflineLearning• Genera;vevsDiscrimina;veModels• ParametricvsNon-ParametricModels

Page 30: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

MLDatasets

6-Aug-16CS6510-AppliedMachineLearning

30

• UCIRepository:hUp://www.ics.uci.edu/~mlearn/MLRepository.html• Statlib:hUp://lib.stat.cmu.edu/

Page 31: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

MLResources

6-Aug-16CS6510-AppliedMachineLearning

31

• MOOCs•  Coursera

• Conferences/Journals•  JMLR,MachineLearning,IEEETransac;onsonNeuralNetworksandLearningSystems,IEEETransac;onsonPaUernAnalysisandMachineIntelligence,AnnalsofSta;s;cs•  ICML,NIPS,ACMSIG-KDD,IJCAI,AAAI,ICDM

Page 32: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Mathema;calBasis

6-Aug-16CS6510-AppliedMachineLearning

32

• Func;ons,LogarithmsandExponen;als• Vectors,DotProducts,Orthogonality• Matrices,MatrixOpera;ons,LinearTransforma;ons,Eigendecomposi;on• Calculus,Differen;a;on,Integra;on• ProbabilityandSta;s;cs• Func;onalAnalysis,HilbertSpaces

Page 33: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

CourseDetails

6-Aug-16CS6510-AppliedMachineLearning

33

•  Timings/Loca'on:•  Sat9:30am–12:30pm•  Un;lmid-OctoberforEMDSstudents•  Block-ALH1(LH2occasionally,ifrequired)

•  Instructor:VineethNB•  Email:[email protected]•  Office:Block-E,324

•  TAs•  ArghyaPal,SupriyaPandhre,MausamJain,AkileshB,SahilManocha,HrishikeshVaidya

• Web:•  hUp://www.iith.ac.in/~vineethnb/teaching/f2016/cs6510-aml.htm•  WewillsoonhaveaGoogleClassroomportalforclassmanagement

Page 34: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

CourseTopics

6-Aug-16CS6510-AppliedMachineLearning

34

•  Introduc;ontoMachineLearning:OverviewandApplica;ons•  Classifica;onMethods• MachineLearningSystemDesign•  RegressionMethods•  ClusteringMethods• DimensionalityReduc;onMethods•  FeatureSelec;onMethods•  AnomalyDetec;onMethods•  AdvancedMethodsMethods

•  GraphicalModels,DeepLearning,NewSenngs(Ac;veLearning,TransferLearning,StructuredPredic;on,Mul;taskLearning,Mul;pleInstanceLearning

Page 35: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Topicsnottobecovered(likely)

6-Aug-16CS6510-AppliedMachineLearning

35

• DeepLearning(notinits“depth”,atleast)• BayesianNetworks(notindepth)• ReinforcementLearning• LearningTheory

Page 36: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

CourseEvalua;onRubric

6-Aug-16CS6510-AppliedMachineLearning

36

• 15%:Quizzes• 25%:Homework(Theory+Programming)• 30%:Compe;;veCodingAssignments(Challengestyle)• 30%:End-semesterexamina;on

•  5gracedaystostartwith–pleaseusethemwisely.Maynotapplytosomedeadlines,whichwillbepointedout.

•  Best80%(approx,tenta;ve)ofquizzeswillbeconsidered•  NotaUemp;ngtheend-semesterexamwillresultinanFR.

Page 37: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Programming

6-Aug-16CS6510-AppliedMachineLearning

37

• Python• Libraries•  Numpy,Scipy–numerical/scien;ficcompu;ng,linearalgebra•  Matplotlib–forplonng•  Scikitlearn–formachinelearning

Page 38: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

References

6-Aug-16CS6510-AppliedMachineLearning

38

• KeyReferences•  PaUernRecogni;onandMachineLearning,byChristopherBishop•  Ar;cles/Blogs/Papers/MOOCsonline

• OtherRecommendedReferences•  R.Duda,P.Hart&D.Stork,PaUernClassifica;on(2nded.)•  T.Mitchell,MachineLearning,

Appropriatereadingmaterialsforlectureswillbeposted

Page 39: Course Introduc;on - IIT Hyderabad › ~vineethnb › teaching › f2016 › ... · CS6510 - Applied Machine Learning 2 • “A breakthrough in machine learning would be worth ten

Homework

6-Aug-16CS6510-AppliedMachineLearning

39

• Gothroughremedialvideosformathfounda;onsat:•  hUps://www.youtube.com/channel/UC7gOYDYEgXG1yIH_rc2LgOw/playlists

• Programming•  LearnPython

•  hUps://try.jupyter.org/•  hUps://docs.python.org/3/tutorial/•  VideoTutorials:hUps://www.youtube.com/watch?v=cpPG0bKHYKc

•  Noteofcau;on:Python2.7vsPython3.4•  hUp://sebas;anraschka.com/Ar;cles/2014_python_2_3_key_diff.html

•  PlaywithNumpy