30
ETL with WSO2 Enterprise Middleware Platform Prabath Abeysekara - Associate Technical Lead

ETL with WSO2 Enterprise Middleware Platform

  • Upload
    wso2

  • View
    1.415

  • Download
    4

Embed Size (px)

Citation preview

Page 1: ETL with WSO2 Enterprise Middleware Platform

ETL with

WSO2 Enterprise Middleware PlatformPrabath Abeysekara - Associate Technical Lead

Page 2: ETL with WSO2 Enterprise Middleware Platform
Page 3: ETL with WSO2 Enterprise Middleware Platform
Page 4: ETL with WSO2 Enterprise Middleware Platform

Outline● A Classic Use Case

● What’s ETL and How It Is Interpreted In The Modern World?

● Why ETL?

● Challenges In Implementing ETL Solutions

● Why Traditional Standalone ETL Products Are Considered

Dead In The Modern World?

● What Factors To Be Considered When Implementing ETL In

Re-Architecting A System?

Page 5: ETL with WSO2 Enterprise Middleware Platform

Outline contd..

● Impact Of Tooling

● Reference Architecture

○ How to build an “efficient, robust, scalable, auditable,

performing and maintainable” ETL solution with WSO2

EMP?

● Demo - Data Mapping With WSO2 Developer Studio

● Summary

● Q&A

Page 6: ETL with WSO2 Enterprise Middleware Platform

A Classic Use Case - Financial Sector

Financial Reporting

Revenue Predictions

Other Analytics &

BI fronts

RDBMS

XML, Web Services

Flat files

ETL Process Enterprise

Data Warehouse

Page 7: ETL with WSO2 Enterprise Middleware Platform

What’s ETL? - Traditional Interpretation

● Extract

● Transform

● Load

Page 8: ETL with WSO2 Enterprise Middleware Platform

What’s ETL? - Modern Interpretation

● Extract

● Monitor

● Profile/Audit

● Analyze

● Cleanse

● Transform

● Load

Page 9: ETL with WSO2 Enterprise Middleware Platform

Why ETL?

● Generally, to build and maintain data repositories with “single version of the truth” out of the multiple heterogenous data sources scattered across an organization or a business domain.

● Then, the business users can use that data for,○ Predictive Analysis○ Revenue predictions and comparisons○ Monitor Overall Growth of an organization○ Business Policies○ Strategic Decisions

Page 10: ETL with WSO2 Enterprise Middleware Platform

Challenges

● Data definition establishment

● Need for expert knowledge

● Scalability and Performance

● Business user acceptance and seamless support for wide range of business use cases

● Maintenance, Data Archival

● Real-time or Near Real-time data synchronization

Page 11: ETL with WSO2 Enterprise Middleware Platform

Why Standalone ETL Products Are Dead?

● Modern day organizations are evolving as it’s never been before.

● Tendency to adopt architecture patterns such as SOA to reduce IT costs and have flexible business processes is rapidly increasing.

● Organizations are more focussed towards “Connected businesses”.

● Thus, it’s very likely that an organization might have a IT infrastructure in place already.

Page 12: ETL with WSO2 Enterprise Middleware Platform

Why Standalone ETL Products Are Dead?

● Adopting a standalone ETL product? Possible, but worthwhile?

● Generally less support for open standards. Extension points? Connectors? More custom code!

● Usually, relies on some proprietary data integration patterns, inducing high maintenance costs.

● Additional licensing costs, need for separate expert/operational assistance, again inducing high maintenance costs.

Page 13: ETL with WSO2 Enterprise Middleware Platform

Why Standalone ETL Products Are Dead?

● Tendency to use in-house re-usable business components leveraging the benefits of SOA

● Less operational costs

● Scalability is a main focus nowadays.

● Having a similar process implemented enables, horizontal scalability at different layers as the need arises.

Page 14: ETL with WSO2 Enterprise Middleware Platform

Re-Architecting A System’s DIL?

● Data Integration is always cumbersome

● Need for ensuring policy compliance of data at its target containers. (usually Enterprise Data Warehouses, Central MDM repositories, etc)

● Flexibility

● Ensuring acceptable Performance

● What about Reliability?

Page 15: ETL with WSO2 Enterprise Middleware Platform

Re-Architecting A System’s DIL?

● How to deal with the freshness of data?

● When to synchronize?

● Need for tuning the system to meet various SLAs

Page 16: ETL with WSO2 Enterprise Middleware Platform

Impact Of Tooling

XSLT Custom Code

Scripts

Page 17: ETL with WSO2 Enterprise Middleware Platform

Impact Of Tooling

● Numerous ETL solutions fail because of the lack of tooling.

● Developers/Solution composers are left with manual coding of XSLT, Custom mappers, etc.

● Not scalable!

● Often requires a powerful flexible tooling platform particularly, as the system grows and matures.

Page 18: ETL with WSO2 Enterprise Middleware Platform

Reference Architecture

Page 19: ETL with WSO2 Enterprise Middleware Platform

Reference Architecture - Big Picture

ESB

BAM

DSS

DS

MB MB

DSS

Enterprise DW

Page 20: ETL with WSO2 Enterprise Middleware Platform

Reference Architecture - Reliable extraction

ESB

DSS

DS

MB

Scheduled Tasks

Page 21: ETL with WSO2 Enterprise Middleware Platform

Reference Architecture - Validate & Transform

Data Model X Data Model Y

ESB

WSO2 Data Mapper

Input Data Model

Output Data Model

Page 22: ETL with WSO2 Enterprise Middleware Platform

Reference Architecture - Auditing

ESB

BAM

Data Quality Reports/ Dashboards

Data Policy Compliance Reports/ Dashboards

Page 23: ETL with WSO2 Enterprise Middleware Platform

Reference Architecture - Reliable Loading

ESB MB

DSS

Enterprise DW

Page 24: ETL with WSO2 Enterprise Middleware Platform

Tooling - Smooks Editor

Page 25: ETL with WSO2 Enterprise Middleware Platform

Tooling - WSO2 Data Mapper

Page 26: ETL with WSO2 Enterprise Middleware Platform

Demo

● Building a transformation between two simple data models using the Smooks Editor shipped with WSO2 Developer Studio.

Page 27: ETL with WSO2 Enterprise Middleware Platform

Summary

● ETL, plays a pivotal role in any business organization.

● Often requires a lot of effort put into implementing a proper ETL process within an organization.

● Standalone ETL solutions can be costly.

● Re-architecting data models is made easy with WSO2 Enterprise Middleware Platform.

Page 28: ETL with WSO2 Enterprise Middleware Platform

References

[1] How to use the Smooks Editor shipped with WSO2 Developer Studio

http://wso2.com/library/tutorials/2011/06/perform-data-mapping-smooks-editor-wso2-carbon-studio/

Page 29: ETL with WSO2 Enterprise Middleware Platform

Q&A

Page 30: ETL with WSO2 Enterprise Middleware Platform