10
Don’t Lose The Data Battle Before It Even Begins Maneesh Joshi Senior Director, Product Marketing and Strategy SnapLogic

Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

Embed Size (px)

DESCRIPTION

In this session, SnapLogic's Maneesh Joshi will share perspective on why the data integration market is ripe for disruption and what that means to data scientists and data integration professionals. This coverc critical topics for companies that desperately need big insights to remain competitive, but also require help as they struggle to digest massive amounts of data from a variety of sources. Because analytics algorithms are only as good as the comprehensiveness of the data they process, keeping data current and relevant is crucial before the battle for data insights even begins. This session covers: ==> SnapLogic’s vision for the future of integration, and integration’s role in empowering companies to be more agile and competitive ==> Pragmatic techniques for integrating across today’s increasingly disparate data varieties (unstructured vs. flat files), growing volumes of information (Hadoop clusters vs. data warehouses), and increasing velocities of data (real time vs. batch) ==>Tips for integrating Hadoop data with other data sources, including leading Business Intelligence (BI) apps, for better information flow and decision-making ==> Best practices for dramatically lowering integration costs and improving time to value To learn more, visit: http://www.snaplogic.com/. About Maneesh: Maneesh Joshi has over 15 years of experience in the enterprise software space, primarily in application and data integration. In his current role as Senior Director of Product Marketing at leading enterprise cloud integration company SnapLogic, he is responsible for its global go-to market strategy and product marketing. Prior to this position, Maneesh was the head of platform product marketing at Informatica. He started his career as a key member of the team that built Oracle’s Service Oriented Architecture and Business Process Management businesses. Before running product marketing for this group, he managed product planning, architecture, and engineering operations for Oracle’s integration products. Maneesh holds a B.S. in Engineering from the Indian Institute of Technology, Kharagpur, where he graduated with honors. He also received an M.S. in Engineering from the University of California, Davis, and an M.B.A. from The Wharton School at the University of Pennsylvania.

Citation preview

Page 1: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

Don’t Lose The Data Battle Before It Even Begins

Maneesh JoshiSenior Director, Product Marketing and StrategySnapLogic

Page 2: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

2

The data battle is brewing

Commodity Hardware

Open Source

Algorithm Output= f(Your Data)

Page 3: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

Data is everywhere

Users

Mobile

Enterprise

CloudBig Data

Data Center

ESB RDBMS

Page 4: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

Simplifying Data Access

Users

Mobile

Enterprise

Cloud Big Data

Data Center

ESB RDBMS

Page 5: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

Data Containerization with Snaps

BUY• SnapStore• Certified and

supported by SnapLogic

BUILD• SDK + API• Java, Python• Customer, Partner or

SnapLogic

Page 6: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

6

Big Data as a Service Architecture

Structured Data

UnstructuredData

DB

Collect Translate & Enrich Distribute1 32

DataView

DB

Amazon Redshift

Page 7: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

Select Customers

7

Corporate Power

Page 8: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

• Store information on CouchDB in AWS

as semi-structured JSON documents with 100+ attributes

• SnapLogic, TIBCO and Java programs to be used in “ingestion” layer to maintain data in CouchDB

• Use AWS Elastic Search Layer on top of CouchDB to provide querying

APPROACH

• 6x improvement in development time compared to TIBCO and hand-coding

• Intuitive graphic designer allows for agile changes in response to requirements

• Seamless integration between cloud applications and on-premise legacy applications with conversion between structured and semi-structured data

• Building snaps in SnapLogic to connect with new systems determined to be the fastest way to connect to a new system

• Full-automated one-touch deployment allows for elastic scaling of SnapLogic cluster

BENEFITS TO THE CLIENT

• BBY Open vision to encourage a vibrant reseller and developer community.

• Data propagation from 15-20 major backend systems, accrued over 12-15 years

• Backend Systems are continually changing (30+ per month), so need to move away from hand coding

• Million+ SKUs: Product information, Warranty Plans with 100+ Attributes and Pricing (with 16 Localization Scenarios)

• External traffic is expanding by million hits per month

BACKGROUND

Agile Cloud ETL for CouchDB on AWS

Page 9: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

Summary

• Algorithm Output = f(Your Data)

• Without comprehensive data inputs, the battle of Big Data is lost before it even begins

• SnapLogic speeds up access of structured and unstructured data in the cloud, and on-premise

Page 10: Don't Lose the Data Battle Before it Begins | #EDW13 Enterprise Data World 2013 | Session by Maneesh Joshi

Q&A