Upload
oracle-analytics
View
2.492
Download
2
Embed Size (px)
DESCRIPTION
What if you could make data preparation 20 percent of your effort so you can focus 80 percent of your time on executing and improving your business? Come to this session to learn how you can easily use guided search across all Hadoop Distributed File System (HDFS) files with automated data enrichment; highlight which attributes are important, which data elements have statistical meaning, and which have quality issues; use visualization to identify multiple segments of data that matter most; and fix data quality problems and create new data elements—all this and more with big data discovery.
Citation preview
Oracle Big Data Discovery Unlock Potential in Big Data Reservoir
Gokula Mishra Premjith Balakrishnan Business Analytics Product Group September 29, 2014
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Safe Harbor Statement
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.
Oracle Confidential – Internal 3
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Agenda
Oracle Confidential – Internal 4
Introduction to Big Data Discovery
Q&A
1
2
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Agenda
Oracle Confidential – Internal 5
Introduction to Big Data Discovery
Q&A
1
2
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Hadoop Data Reservoir Concept Gaining Momentum
Oracle Confidential – Internal 6
Data Warehouse Data Reservoir
Emerging Sources Existing Sources
Source: wikibon.org/wiki/v/Big_Data_Vendor_Revenue_and_Market_Forecast_2013-2017 Source: 451 Research – Total Data Warehousing: 2013-2018
Source: The Forrester WaveTM: Big Data Hadoop Solutions, Q1 2014
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Not Easy to Get Analytic Value from Hadoop Data Reservoir
Oracle Confidential – Internal 7
? • Volume, Variety, Velocity = Complexity – Data not organized
– Complex, non-integrated tools
– Specialized skills required
• Impact: Lack of Analytic Agility – 80% effort spent on data
preparation vs. analytics
• Path to Production Unclear – Difficult to share with masses
– Hard to secure
– Lack of governance
• Impact: Poor Enterprise Adoption – Insights not widely leveraged
across the organization
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Internal 8
What if we could reverse: •80% - Data Preparation •20% - Analysis
The Big Data Opportunity
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Requires a Fundamentally New Approach
Oracle Confidential – Internal 9
An intuitive, interactive and visual user interface
then share results for enterprise leverage
Data Warehouse
Business Intelligence
Advanced Analytics
Other Hadoop Tools
Explore
Transform Discover
Find
for anyone to quickly find, explore, transform and analyze data in Hadoop
Data Scientist
Business Analyst Business User
Increase Analytic Agility Maximize Enterprise Adoption
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Internal 10
Oracle Big Data Discovery. The Visual Face of Hadoop
Explore
Transform Discover
Find
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
• Navigate a rich catalog of all data in the Hadoop cluster
• Familiar search and guided navigation for ease of use
• Access data set summaries, annotation and recommendations
• Provision your own data through self-service upload
• Browse personal big data projects and those shared by the community
Oracle Confidential – Internal 11
Easily Find Relevant Data Sets
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
• Understand shape of the data. Visualize attributes by type
• Entropy based sorting by information potential
• View attribute statistics, data quality and outliers
• Use scratch pad to see statistical correlations between attribute combinations
• Evaluate whether a data set is worthy of further investment
Oracle Confidential – Internal 12
Explore the Data and Understand Potential
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
• Intuitive user driven data wrangling
• Library of data transformations to replace values, convert types, collapse, reshape, pivot, group, custom tag, merge and much more
• Data enrichments for inferring location and language. Theme, entity and sentiment enrichments for text
• Preview results, undo, commit and replay transforms
• Run on sample data in memory or full data set in Hadoop
Oracle Confidential – Internal 13
Transform and Enrich Data to Make it Ready
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
• Mash up different data sets for deeper perspectives
• Drag and drop from a rich library of interactive visualizations to compose discovery dashboards
• Filter through data with powerful search and intuitive guided navigation
• Publish blended data sets back to Hadoop
• Share projects, bookmarks and snapshots with team members for collaboration
Oracle Confidential – Internal 14
Analyze the Data to Discover New Insights
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Share Results and Publish for Enterprise Leverage
Oracle Confidential – Internal 15
• Share and collaborate with the team
– Share projects, bookmarks and snapshots then collaborate and iterate
• Publish back to Hadoop
– Transforms and enrichments may be applied to original data sets in Hadoop
– Publish blended data sets back to HDFS
• Leverage results in other tools
– Publish data to Hadoop in format optimized for advanced analytic tools (e.g. ORAAH)
– Hadoop compliant BI tools (e.g. OBIFS) can burst out to the masses
– Leverage any native Hadoop tooling (e.g. Pig, Hive, Impala, Python, etc)
– Integrate BDD data sets with DWH to secure, govern and optimize for query performance (e.g. Oracle Big Data SQL)
Oracle Big Data Discovery plays well with the big data ecosystem
Explore
Transform Discover
Find
Share & Collaborate
raw data
transformed data
data reservoir
(HDFS)
Publish
data warehouse
business intelligence
advanced analytics
other hadoop tools
Leverage
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Productize, Secure & Govern
Experiment, Prototype & Collaborate
Data Reservoir
Un
stru
ctu
red
D
ata
Data Warehouse
Oracle Database
Stru
ctu
red
Dat
a
Oracle Big Data Discovery
Oracle Big Data SQL
Hadoop (HDFS)
Oracle R for Hadoop
Oracle Advanced Analytics
Tables in Hadoop
Tables in DB
SQL join
In-Memory Appliance
Oracle BI Foundation Suite
Oracle SQL Queries
Exalytics
Exadata
BDA
Oracle’s Unified Big Data Management and Analytics Strategy
• Experiment, Prototype, Collaborate
– Quickly find, explore, transform, discover and share in BDD
– Publish results to HDFS
– Use to build predictive models with Oracle R for Hadoop
• Productize, Secure, Govern
– Connect published HDFS files to secure Oracle DB using Oracle Big Data SQL
– No data movement required
– Seamlessly extends existing DWH and BI investments with non-traditional data in Hadoop
• Available as Engineered Systems
Oracle Confidential – Internal
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Internal 17
Oracle Big Data Discovery. A Game Changing Platform
Benefits to the Business • Get Value Faster. Rapidly turn raw data into actionable
insights that can be leveraged across the enterprise
• Democratize Value from Big Data. Increase the size, diversify the skills, and improve the efficiency of Big Data project teams
Benefits to IT • Destroy Existing Technical Barriers. Run natively on
Hadoop cluster for maximum scalability and performance
• Share, Publish, Secure and Leverage. Integrate with Hadoop open standards and leverage the Oracle big data ecosystem
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Agenda
Oracle Confidential – Internal 18
Introduction to Big Data Discovery
Q&A
1
2
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |