Upload
mongodb
View
136
Download
0
Tags:
Embed Size (px)
Citation preview
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75551
Bo BorlandVP Field Technical Sales, Pentaho
Twitter: @boborland
June 2015
Blending Hadoop & MongoDB with Pentaho
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75552
About this session
Today You Will Learn
… to prepare and blend MongoDB and Hadoop data for reporting and analyses
Agenda❯ Overview and preparation❯ Blending data❯ Information presentation
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75553
The Universe of Data is Exploding And Data is Connecting People & Things
New Technology Paradigms have Emerged
1
2
3
New data architectures must blend and analyze all data regardless of source
Users will consume analytics in new ways: data driven apps
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75554 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75554 © 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75554
Entry
Tran
sform
Advanced
Op
tim
ize
What The Market Is Deploying Today And Planning For Tomorrow
Data Warehouse Optimization
Data Refinery
Big Data Exploration
Customer 360 Degree View
Internet of Things
Next Generation Applications
Internal Big Data as a Service
On-Demand Big Data Blending
Big Data Predictive Analytics
Use Case Complexity
Busi
ness
Im
pact
Monetize My Data
Data Warehouse Optimization
Data Warehouse Optimization
Data Refinery
360 Degree View
A Spectrum of Big Data Use Cases
Internet of Things
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75555 © 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75555 © 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75555
Blending the Two Worlds Together
2014,
Customer
Billing
ProvisioningBI TOOLSDATA MARTSEDW
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75556
Blending the Two Worlds Together
Customer
Billing
ProvisioningBI TOOLSDATA MARTSEDW
Location
Sensors
NetworkWeb
Social Media
Blending the Two Worlds Together
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75557
Blending the Two Worlds Together
EDW
Customer
Billing
Provisioning
Location
Sensors
NetworkWeb
Social Media
DATA MARTS
SKILLS
TIME
GOVERNANCE
FLEXIBILITY
COST/VALUE
Blending the Two Worlds Together
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75558
Big Data Blending: Example Overview
❯ Fictional e-commerce data for retail sales
❯ Want to promote product sales through website wish-lists
❯ We have been promoting wish-lists
❯ How is this affecting sales?
❯ Combine web-hits and sales data
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75559
Big Data Blending: Example Overview
Web server data
Analytics Business User
TrendsSales data
PDI
Archive Aggregate
Web Clicks
Real-time
Ecommerce
Sales
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755510
Big Data Blending: Query MongoDB
❯ Discover MongoDB collection information
❯ Obtain sales information
❯ Use the MongoDB aggregation framework (ELT)
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755511
Big Data Blending: Example Overview
Web server data
Real-time
Ecommerce
Sales
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755512
Big Data Blending: Prepare Hadoop
❯ Process relevant web log lines
❯ Derive information from web log lines
❯ Aggregate by month, log line type
❯ Store intermediate data
❯ Typically schedule daily
❯ Run inside of the Hadoop cluster (ELT)
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755513
Big Data Blending: Example Overview
Web server data
Archive Aggregate
Web Clicks
Mapper
Reducer
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755514
DEMONSTRATION ❯Pentaho Map/Reduce
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755515
Big Data Blending: Hits vs Sales
❯ Join with Hadoop and Mongo web-hits and sales data
❯ De-normalize hits and sales metrics
❯ Blend data on a month level
❯ Make the information available as a virtual database table
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755516
Big Data Blending: Example Overview
PDI
Blended Sales Data
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755517
DEMONSTRATION ❯Blending
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755518
Summary
What We Covered Today:❯ PDI big data handling is easy and scalable❯ PDI blending removes the need to stage big data❯ Analytics and data integration are “better together”
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755519
Next Steps
Want to learn more?❯Take these next steps
❯Try it out yourself!
❯Visit our documentation for samples
❯Talk to us!
© 2015, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755520
Thank You
blog.pentaho.com
@Pentaho
Facebook.com/Pentaho
Pentaho Business Analytics
JOIN THE CONVERSATION. YOU CAN FIND US ON: