38
© 2016 Continuum Analytics - Confidential & Proprietary © 2016 Continuum Analytics - Confidential & Proprietary Tour Anaconda Enterprise Without Leaving Your Desk Accelerate. Connect. Empower. Ian Stokes-Rees Computational Scientist December 15, 2016

Tour Anaconda Enterprise Without Leaving Your Desk

Embed Size (px)

Citation preview

Page 1: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary© 2016 Continuum Analytics - Confidential & Proprietary

Tour Anaconda Enterprise Without Leaving Your DeskAccelerate. Connect. Empower.

Ian Stokes-ReesComputational Scientist

December 15, 2016

Page 2: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 2

Join us for the inaugural AnacondaCONDiscover What #OpenDataScienceMeans

http://anacondacon17.io

Speakers from industry, government, academia

Demos, BoFs, Panels, Exhibits, Partner Showcase

Page 3: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 3

• Ph.D. at the University of Oxford, working on the CERN LHCb particle physics experiment

• Harvard University, working on computational techniques for protein structure determination

• Joined Continuum Analytics in 2013• Greatest interest: enabling communication,

collaboration and discovery using high performance computing infrastructure

Ian Stokes-Rees @ijstokesComputational Scientist, Continuum Analytics

Page 4: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 44

• Intro to Anaconda Enterprise Notebooks for effortless collaboration

• Publish analytics using Anaconda Enterprise

• Discover deep insights with interactive visualizations

• Communicate work to all levels through interactive visualizations

• Deploy data science with Anaconda and engage your team with intuitive and

relevant data science narratives

• Q&A

Agenda

Page 5: Tour Anaconda Enterprise Without Leaving Your Desk

5

Anaconda Distribution

Page 6: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 6

ANACONDA Accelerates Adoption of Open Data Science for Enterprises

• Easy to install

• Agile data exploration

• Powerful data analysis

• Simple to collaborate

• Accessible to everyone

PYTHON & R OPEN SOURCE ANALYTICSNumPy SciPy Pandas Scikit-learn Jupyter/IPython

Numba Matplotlib Spyder Numexpr Cython Theano

Scikit-image NLTK NetworkX IRKernel dplyr shiny

ggplot2 tidyr caret PySpark & 720+ packages

Page 7: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Full Featured Analytics Platform

Page 8: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Hundreds of Analytics Tools - Integrated

Page 9: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 9

Anaconda Distribution Promise• Individuals• Government• Commercial• Students

• Educational• Research• Application embedding• Commercial

• No time limits• No trials• No license files• No expiry

Free for everyone

Free forany useFreeforever

Page 10: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Anaconda Solves Many Analytics Problems• Deployment: Windows, Mac, Linux• Reproducibility• Extensibility and Flexibility

• Over 100,000 Conda packages available today• Multi-language: Python, R, Scala, Julia and more• Widely used:

• Hundreds of companies• Millions of users• Millions of annual downloads

• Analytics sandboxes without VMs or containers

Page 11: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Anaconda DistributionWho is it for?

• Single user• Single system• Unrestricted access to public Internet• Access to Anaconda Cloud

When do you need Anaconda Enterprise?• Multiple users• Collaboration• Compute clusters• Hadoop• On-premesis package mirror• Private package repository

Page 12: Tour Anaconda Enterprise Without Leaving Your Desk

12

Anaconda Enterprise

Page 13: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Anaconda Platform

Page 14: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 14

Data Lab: shared analytics cluster

Package Control

Internal Anaconda Repository

Authentication

Anaconda Enterprise Notebook Server

Computation

Web Interface

Active Directory/ LDAPOptional

Anaconda Enterprise Architecture

Data Scientist (Mac)

Business Analyst (Win)

DevOps Engineer (Linux)

Publish

Fetch

Productionanalytics cluster

Page 15: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 15

Page 16: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 16

Data lineage

Interactive Visualizations

Advanced notebook extensions

Anaconda Enhanced Jupyter Notebooks

Page 17: Tour Anaconda Enterprise Without Leaving Your Desk

17

Excel + Python + Jupyter

Page 18: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 18

Anaconda Fusion Excel Integration

BRING interactive visualizations, machine learning and ETL to Excel

BRIDGE Excel Data to Python & R through notebooks

ACCESS all the power of Python and Big Data, natively embedded inside Excel

Anaconda Fusion brings Open Data Science to Microsoft Excel

Page 19: Tour Anaconda Enterprise Without Leaving Your Desk

19

Parallel Data Processing

Page 20: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 20

• Parallel and Distributed Pandas and Numpy

• Low latency workflow manager• Graphical tools• Simple APIs• Extensible and generalizable to

other data structures

Dask: Parallel Data Processing

Page 22: Tour Anaconda Enterprise Without Leaving Your Desk

22

Interactive Data Vizualization Apps

Page 23: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 23

Interactive Data Visualization

• Interactive viz, widgets, and tools• Versatile high level graphics• Streaming, dynamic, large data• Optimized for the browser• No Javascript• With or without a server

Page 24: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 24

Rapid Prototyping Visual Apps

• Python interface• R interface• Smart plotting

Page 25: Tour Anaconda Enterprise Without Leaving Your Desk

25

Geoviews and Datashader

Page 26: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 26

Datashader: Rendering a Billion Points of Data• datashader provides a fast,

configurable visualization pipeline for faithfully revealing even very large datasets

• Each of these visualizations requires just a few lines of code and no magic numbers to adjust by trial and error.

Page 27: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 27

Datashader

Page 28: Tour Anaconda Enterprise Without Leaving Your Desk

28

Anaconda Accelerate

Page 29: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 29

GPU Acceleration in Python

Linear algebra, FFTs, sorting, random number generation

Fast algorithms for nVidia GPUs

Data profiling in Jupyter Notebooks

Works with Numba

Track what size and type of data is beingpassed through your algorithm for better optimization decision-making

Designed to be used in conjunction with the Numba Python compiler for CPUs and GPUs

Page 30: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Linear Algebra on the GPU with Anaconda Accelerate

Double precision matrix-matrix multiplicationIntel Core i7-4820K 3.70GHz CPU vs. NVIDIA Tesla K20c

Page 31: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Data Profiling

Page 32: Tour Anaconda Enterprise Without Leaving Your Desk

32

Data Science Workflows

Page 33: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Laptop to Cluster

Page 34: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

Analytics artifact repository

Page 35: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary

221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5440

[email protected]

@ContinuumIO

© 2016 Continuum Analytics - Confidential & Proprietary

From Data Lab to Production Analytics

Page 36: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 3636

Questions?

Page 37: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary 3737

• Try it out yourselfSign up for an Anaconda Enterprise Test Drive:

know.continuum.io/Anaconda-Enterprise-Test-Drive.html

• Meet up with other thought leaders like youRegister for AnacondaCON – February 7-9, 2017: anacondacon17.io

• Learn more about the Anaconda PlatformCheck out the “Resources” tab for webinars, whitepapers and more: continuum.io/

Next Steps

Page 38: Tour Anaconda Enterprise Without Leaving Your Desk

© 2016 Continuum Analytics - Confidential & Proprietary© 2016 Continuum Analytics - Confidential & Proprietary

Continuum AnalyticsWe empower data science teamsto make the world a better placeWe Empower Data Science Teams to Make the World Better221 W. 6th StreetSuite #1550Austin, TX 78701+1 512.222.5400