29
Open Platform for Next-Gen Analytics Director, Enterprise Segment Datacenter and Connected System Group Patrick Buddenbaum

Big Data launch keynote Singapore Patrick Buddenbaum

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Big Data launch keynote Singapore Patrick Buddenbaum

Open Platform for Next-Gen Analytics

Director, Enterprise Segment

Datacenter and Connected System Group

Patrick Buddenbaum

Page 2: Big Data launch keynote Singapore Patrick Buddenbaum

Today’s presentations contain forward-looking statements. All statements made that are not historical facts are subject to a number of risks and uncertainties, and actual results may differ materially. Please refer to our most recent Earnings Release and our most recent Form 10-Q or 10-K filing for more information on the risk factors that could cause actual results to differ.

If we use any non-GAAP financial measures during the presentations, you will find on our website, intc.com, the required reconciliation to the most directly comparable GAAP financial measure.

INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.

Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, reference www.intel.com/software/products.

Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult otherinformation and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.

Intel product plans in this presentation do not constitute Intel plan of record product roadmaps. Please contact your Intel representative to obtain Intel's current plan of record product roadmaps.

Legal Information

Page 3: Big Data launch keynote Singapore Patrick Buddenbaum

Making Sense of One Petabyte

50xTo read

in Library of Congress

13yTo view

as HD Video

11sTo generate

in 2012

Sources: IDC 2012, The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far Easthttp://blogs.loc.gov/digitalpreservation/2011/07/transferring-libraries-of-congress-of-data/

Page 4: Big Data launch keynote Singapore Patrick Buddenbaum

Analysis of Data can Transform Society

Enhance understanding, drive innovation, and accelerate medical cures

Create new business models and transform organizational processes

Improve public safety and increase energy efficiency with smart grids

Page 5: Big Data launch keynote Singapore Patrick Buddenbaum

Virtuous Cycle of Data-Driven User Experience

CLOUD

Richer data to analyze

CLIENTS

Richer data from devices

Richer user experiences

INTELLIGENT SYSTEMS

Page 6: Big Data launch keynote Singapore Patrick Buddenbaum

Democratize Data Analysis from Edge to Cloud

Unlock value in silicon

Support open platforms

Intelligent Systems Framework

Page 7: Big Data launch keynote Singapore Patrick Buddenbaum

Intel at the Intersection of Big Data Forces

Enabling exascale computing on massive data sets

Helping enterprises build open interoperable clouds

Contributing code and fostering ecosystem

HPC Cloud Open Source

Intel®TrueScaleInfiniband

* Other names and brands may be claimed as the property of others.

Page 8: Big Data launch keynote Singapore Patrick Buddenbaum

Research

Benchmarking

TuningOptimization

Product

History of Intel and Apache Hadoop*

2009 2013

Open Cirrus*

HiBenchRelease 1.0

(2011)

* Other names and brands may be claimed as the property of others.

Release 2.0(2012)Telco Smart City

Web

RetailHealthcare

Page 9: Big Data launch keynote Singapore Patrick Buddenbaum

Announcing Availability ofIntel® Distribution for Apache Hadoop* software

Hardware-enhanced performance & security

Enables partner innovation in analytics

Strengthens Apache Hadoop* ecosystem

* Other names and brands may be claimed as the property of others.

Page 10: Big Data launch keynote Singapore Patrick Buddenbaum

Intel® Distribution for Apache Hadoop* software

• Up to 20x faster decryption with AES-NI*• Granular access controls for Hbase

• Optimized with SSD and Cache Acceleration• Up to 8.5X faster queries in Hive• Hardware-enhanced compression with AVX & SSE4.2

• Automated tuning with Intel® Active Tuner

*Based on internal testing

Page 11: Big Data launch keynote Singapore Patrick Buddenbaum

Intel Distribution for Apache Hadoop* software

* Other names and brands may be claimed as the property of others.

Intel® Manager for Apache Hadoop softwareDeployment, Configuration, Monitoring, Alerts, and Security

HDFSHadoop Distributed File System

YARN (MRv2)Distributed Processing Framework

HB

ase

Colu

mna

r St

ore

Zook

eepe

rCo

ordi

natio

n

Flum

eLo

g Co

llect

orSq

oop

Dat

a Ex

chan

ge PigScripting

HiveSQL Query

OozieWorkflow

MahoutMachine Learning

R connectorsStatistics

Intel enhancements contributed back to open source

Open source components included without change

Intel unique

Page 12: Big Data launch keynote Singapore Patrick Buddenbaum

Sold with World-Class Intel Support

Annual Subscription with Technical Support

Support Coverage Options: 24x7 or 8x5

Via Solution Vendors and Service Providers

Page 13: Big Data launch keynote Singapore Patrick Buddenbaum

Continued Innovation

* Other names and brands may be claimed as the property of others.

Pipeline of innovation from Intel Labs• Machine Learning, Graph Lab & Graph Builder• Data-Intensive Algorithms & Computer Architecture

Roadmap of open source from Intel Software• Project Rhino: Hardening Apache Hadoop• Project Panthera: Standard SQL on Apache Hadoop

Page 14: Big Data launch keynote Singapore Patrick Buddenbaum

Backed by Broad Portfolio of Datacenter ProductsSoftware

CacheAccelerationSoftware

NetworkStorage & MemoryServer

Page 15: Big Data launch keynote Singapore Patrick Buddenbaum

* Other names and brands may be claimed as the property of others.

Antoine HueRegional Sales Manager

APJC Data Center

Page 16: Big Data launch keynote Singapore Patrick Buddenbaum

>4 Hours to 7 MinutesIntel Platform Benefits for Sorting 1TB Data

Intel® Xeon 5690

7200 HDD

1GbE Adapters

Intel® Xeon®

E5-2690processor

~50%improved

Intel® SSD 520

Series

~80%improved

Intel® 10GbE

Adapters

~50%improved

Deploy IntelDistribution for Apache Hadoop*

~40%improved

Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.

Source: Intel Internal testingFor more information go to : intel.com/performance

`

>4 Hours

~7 mins

Page 17: Big Data launch keynote Singapore Patrick Buddenbaum

Proven in the Enterprise

Using the Intel® Distribution to gain tremendous results

* Other names and brands may be claimed as the property of others.

IT

Page 18: Big Data launch keynote Singapore Patrick Buddenbaum

Customer Video

Page 19: Big Data launch keynote Singapore Patrick Buddenbaum

With Broad Support from the Ecosystem

* Other names and brands may be claimed as the property of others.

Page 20: Big Data launch keynote Singapore Patrick Buddenbaum

Chris LevanesDirector of Cloud Business Development

Savvis Asia

Page 21: Big Data launch keynote Singapore Patrick Buddenbaum
Page 22: Big Data launch keynote Singapore Patrick Buddenbaum

The Promise of Big Data Requires Industrialized Services

Page 23: Big Data launch keynote Singapore Patrick Buddenbaum

• Trusted, mission critical, high-powered computing solutions

• Robust security options

• Enterprise-grade global storage capabilities

• Highly available compute power

• Cloud-based economic model

• Expert consulting services to aide in transformation of data assets

Big Data Customers Need

BIGDATA

Page 24: Big Data launch keynote Singapore Patrick Buddenbaum

A Longstanding Successful Alliance

Page 25: Big Data launch keynote Singapore Patrick Buddenbaum

Enterprise-Grade, Industrialized Infrastructure Services for Intel Distribution for Apache Hadoop Software

Page 26: Big Data launch keynote Singapore Patrick Buddenbaum

Summary

• Intel announced Intel® Distribution for Apache Hadoop* software

• Delivers performance, security and ease of deployment

• Backed by broad portfolio of Intel data center products

• Contributes to open source and supports Apache Hadoop

• Enabling ecosystem of partners to innovate on analytics solutions

Page 27: Big Data launch keynote Singapore Patrick Buddenbaum

Q&A

Page 28: Big Data launch keynote Singapore Patrick Buddenbaum

Legal DisclaimersAll products, computer systems, dates, and figures specified are preliminary based on current expectations, and are subject to change without notice.Intel processor numbers are not a measure of performance. Processor numbers differentiate features within each processor family, not across different processor families. Go to: http://www.intel.com/products/processor_number

Intel, processors, chipsets, and desktop boards may contain design defects or errors known as errata, which may cause the product to deviate from published specifications. Current characterized errata are available on request.

Intel® Virtualization Technology requires a computer system with an enabled Intel® processor, BIOS, virtual machine monitor (VMM). Functionality, performance or other benefits will vary depending on hardware and software configurations. Software applications may not be compatible with all operating systems. Consult your PC manufacturer. For more information, visit http://www.intel.com/go/virtualization

No computer system can provide absolute security under all conditions. Intel® Trusted Execution Technology (Intel® TXT) requires a computer system with Intel® Virtualization Technology, an Intel TXT-enabled processor, chipset, BIOS, Authenticated Code Modules and an Intel TXT-compatible measured launched environment (MLE). Intel TXT also requires the system to contain a TPM v1.s. For more information, visit http://www.intel.com/technology/security

Intel, Intel Xeon, Intel Atom, Intel Xeon Phi, Intel Itanium, the Intel Itanium logo, the Intel Xeon Phi logo, the Intel Xeon logo and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.

Other names and brands may be claimed as the property of others.

Copyright © 2013, Intel Corporation. All rights reserved.

Page 29: Big Data launch keynote Singapore Patrick Buddenbaum

Apache Hadoop Performance Test Configuration4 hours to 7 minutes

Cluster Configuration 1 Head Node (name node, job tracker) 10 Workers (data nodes, task trackers) 10-Gigabit Switch: Cisco Nexus 5020

Software Configuration Intel Distribution for Apache Hadoop 2.1.1 Apache Hadoop 1.0.3 RHEL 6.3 Oracle Java 1.7.0_05

29

Head Node Hardware 1 x Dell r710 1U servers

Intel: 2x3.47GHz Intel® Xeon®

processor X5690 Memory: 48G RAM Storage: 10K SAS HDD Intel® Ethernet 10 Gigabit SFP+ Intel® Ethernet 1 Gigabit

Worker Node Hardware 10 x Dell r720 2U servers

Intel: 2 x 2.90Ghz Intel® Xeon® processor E5-2690 Memory: 128G RAM Storage: 520 Series SSDs Intel® Ethernet 10 Gigabit SFP+ Intel® Ethernet 1 Gigabit