9
Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics An Oracle White Paper October 2013 Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

Oracle Data Integrator 12c (ODI12c) - Powering Big Data ... · PDF fileOracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics ... Oracle Data Integrator

Embed Size (px)

Citation preview

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

An Oracle White Paper

October 2013

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

2

Introduction:

The value of analytics is so widely recognized today that all mid to large scale enterprises have

deployed some level of Business Intelligence (BI) tools and technologies. In spite of this

widespread implementation of analytics, the promise of operational effectiveness is not being

fully realized. According to a recent survey conducted by independent analyst firm, BI

Scorecard, adoption of Business Intelligence (BI) remains surprisingly low—reaching only 25

percent of potential users. For many, the Business Intelligence (BI) tools available are difficult

to use and slow to respond and the content they deliver is of little relevance. As a result,

business users default to making decisions based on incomplete information or “gut feeling.”

The same survey also noted that while 26 percent of respondents reported that their Business

Intelligence deployments were successful, the vast majority of Business Intelligence (BI)

programs are falling short of their potential to increase revenues, improve customer service,

reduce costs, and use strategic insights to create innovative new products or services

Is Data Integration Limiting Business Analytics?

There are a number of factors why organizations have not realized their full potential in analytics.

These are three key factors where Data Integration technologies can help play a key role in solving:

Business Analytics systems are running on stale and incorrect data. Without real-time good

quality data in the data warehouses, data loses its relevance. This negatively impacts operational

efficiencies and jeopardizes the carefully crafted customer experiences.

Business Analytics systems are not designed for Big and Fast Data. Future-proofing business

analytics and data warehousing system is becoming a growing concern to address the increasing

volume, velocity and variety of big data.

Business Analytics systems are disconnected. When Business Intelligence (BI) systems are

fragmented from various planning systems, transactional systems and packaged applications (i.e. ERP,

CRM), facts and assumptions are misaligned. The pitfalls include cost overruns, inventory shortage and

surplus, unnecessary risks, and delayed responses to threats and opportunities.

Each of these obstacles can be overcome by utilizing key best practices in data integration as the core

foundation to business analytics and data warehousing. These strategies include integrating data in real-

time, effectively tapping the value by integrating big data, and finally, integrating data end-to-end with

business analytics systems. In this paper we shall discuss these requirements and how the latest release

of Oracle Data Integrator 12c (ODI12c) along with supporting Oracle Technologies addresses this

growing need of powering Business Analytics for organizations.

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

3

The Enterprise Data Warehouse – Feeding Business Analytics

Even in today’s world of Big Data, the enterprise data warehouse plays a critical role in feeding

Business Intelligence applications for businesses. Most organizations have a traditional Data warehouse

that needs to be kept current and integrated with their reporting structures to make sure that relevant

data is being surfaced to line of business operational reports and management dashboards. These are

increasingly being integrated with new analytic data warehouses and Big Data clusters and real time

data feeds to provide a comprehensive data store to the Analytics technologies. This population of the

data store requires a Data Integration technology that

Handles bulk loads of traditional and Big Data in and out of the sources and data warehouses

Incorporates Change Data Capture technologies to enable Real Time data movement.

Should be part of a Fast Data ecosystem to handle Big Data, and most importantly

Should be tightly integrated with an Enterprise Business Analytics Solution.

Analyzing Big and Fast Data

Businesses today need to analyze huge amounts of data, including social media activity, web logs,

warranty claims, call center activity, movement of assets with RFID tags, and sensor data. In most

cases, organizations need to integrate unstructured data with data that resides in multiple data sources

such as data warehouses, enterprise business applications, and online analytic processing (OLAP)

cubes. This ‘big data’ is having a tremendous impact on the solution space for business analytics, data

warehousing and data integration. This has to do primarily with the fact that big data poses new

questions for the best ways to process volumes and varieties of data at higher speeds and at faster

velocity. One recent new standard that has emerged, Fast Data, specifically deals especially with this

velocity component of big data and it’s increasingly seen as an important element that ties together

concepts in real-time and in-memory processing with big data and analytics.

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

4

Fig1: Big Data and Enterprise Data should be used in conjunction for best results.

Loading and Transforming Big Data using Oracle Data Integrator

Oracle Data Integrator is the flagship Oracle Data Integration product. Oracle Data Integrator is

critical to big data projects in providing much needed connectivity to load and transform data from

Hadoop sources and to support structured and relational sources. Oracle Data Integrator leverages

unified tooling for both big data and enterprise data which translates into a faster learning curve as well

as seamless usability so that the data scientist or data analyst can focus on integration versus usability.

Flow based declarative designs in the new release of Oracle Data Integrator, ODI12c helps build

complex expressions that are easy to maintain and support. Two representations of the same ELT

mappings, the logical representation and the physical representation, provide customized working

environments for business analysts and data scientists. Oracle Data Integrator Application Adapter for

Hadoop simplifies data loading and movement between Hadoop and an Oracle Database through

Oracle Data Integrator 12c (ODI12c)’s easy to use rebuilt interfaces.

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

5

Fig 2: By providing efficient connectivity between Oracle Database and Hadoop, Oracle Big Data Connectors enables

analysis of all data, both structured and unstructured, in enterprise data warehouses.

The Need for Speed - Real-time Data Warehousing and Analytics:

Real time data is absolutely required for efficient operational decision-making. As latency increases,

data becomes less relevant and the ability to act on it decreases, robbing a company’s opportunity to

improve operational efficiencies and increase revenue. Real-time data integration ensures that the data

used in decision-making is as current as possible and in some cases, virtually in sync with operational

information.

Traditional Business Intelligence and Data Warehousing solutions give users an excellent view of past

events and entities by accessing historical data. These solutions typically monitor slowly moving trends,

such as product defects or sales performance. Users gain insight into these activities by querying a data

warehouse, which is updated periodically.

Historical analysis is important, but it mainly targets strategic, long-term decision-making by analyzing

trends, and it doesn’t let people respond to operational events happening in the organization. As data

latency increases, its relevance to operational events diminishes. BI solutions for day-to-day operations

need to leverage the most current data to enable employees to take action right away. Data integration

solutions that leverage heterogeneous Change Data Capture (CDC) technology can establish real-time

links to production data sources in addition to performing bulk updates. These solutions work

efficiently by moving only the changed data to the decision support system.

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

6

Real-time Business Analytics - Oracle Data Integrator with Oracle GoldenGate

Real time reporting is best delivered when coupled with effective Change Data Capture (CDC)

technology. Oracle’s strategy for enabling real time is realized through Oracle GoldenGate. To enable

real-time data acquisition, Oracle GoldenGate uses log-based, real-time CDC capabilities to provide

continuous capture and delivery of the most recently changed data between OLTP systems and the

data warehouse. The application offers transactional, real-time data capture, routing, transformations,

and delivery, using the push approach. Real time changes to the data are moved to the data warehouse

where it can drive enhanced, strategic, and operational BI capabilities. Oracle GoldenGate can perform

basic, row-level transformations at the point of capture or at the time of delivery.

For heavy transformation requirements Oracle GoldenGate delivers transactional data to a staging area

in the data warehouse for in-database transformations with Oracle Data Integrator to support an E-LT

architecture. Oracle Data Integrator utilizes the native database or execution platform to ensure that

performance gains are not just limited to within the Oracle technology but also in the platform

investments that forms a large part of the Data Integration landscape. Because of this E-LT

architecture, query performances are faster and consequently transformations are faster and more

efficient.

Fig 3: Oracle GoldenGate and Oracle Data Integration combine to provide real time Data warehousing.

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

7

Integrated BI Applications – Oracle BI Powered by Oracle Data Integrator

Oracle’s latest Business Intelligence Applications (OBIA) uses Oracle Data Integrator (ODI) for its

data integration requirements. Built into the OBIA solution, Oracle Data Integrator provides load

plans, pre built Knowledge Modules and user roles to populate and maintain fact and dimension tables

for OBIA. Load plans, are executables that comprises of multiple dependent and parallel ETL steps

that can be configured to load fact tables from sources. A load plan generator plug is provided that

helps in creating load plans. For more information on understanding how ODI works with OBIA click

here.

Previously, for OBIA versions before 11.1.1.7.1, this ETL part was performed via 3rd party tools. With

the integration of ODI into the solution monitoring and user management is seamless with the ODI

console acting as a monitoring and restarting during failovers and or job statuses.

Dedicated BIApps Knowledge Modules can be used for updating Facts, Slowly Changing Dimensions,

targets and periods. The use of Knowledge Modules eases development cycles in integrating data into

source and target table definitions and allows your whole Business Intelligence solutions to be

maintained as a single unit, delivering a single point of support, following a logical product lifecycle and

lowering TCO.

Oracle Data Integrator enables organizations using BI Applications to increase IT efficiency and

reduce costs with a comprehensive data integration platform that covers all data integration

requirements – including big data, application integration, as well as BI / data warehousing.

Fig 4: Complete integrated Data Integration components for multiple use scenario

Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics

8

Summary

Only Oracle provides a complete and best-of-breed platform for integrated business analytics solutions

to handle increasing volume, velocity & variety of big data. The new releases of Oracle Data Integrator

12c (ODI12c) and Oracle GoldenGate 12c has cemented Oracle’s commitment to generate increased

value for customers looking to implement real time analytics use cases, better tap big data sources. The

components of this platform are engineered to work together. In addition, each component is based

on open standards and interfaces that allow customers to leverage their existing enterprise standard

components in areas where investments may have already been made. Oracle is the only vendor today

that can add even more increased value in analytics with unmatched performance, an eye to the future

trends and lower TCO.

To learn more about Oracle Data Integration: Oracle Data Integration

To learn more about Oracle Business Analytics: Oracle BI

White Paper Title

October 2013

Author: Madhu R Nair

Contributing Authors: Dain C Hansen

Oracle Corporation

World Headquarters

500 Oracle Parkway

Redwood Shores, CA 94065

U.S.A.

Worldwide Inquiries:

Phone: +1.650.506.7000

Fax: +1.650.506.7200

oracle.com

Copyright © 2013, Oracle and/or its affiliates. All rights reserved.

This document is provided for information purposes only, and the contents hereof are subject to change without notice. This

document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in

law, including implied warranties and conditions of merchantability or fitness for a particular purpose. We specifically disclaim any

liability with respect to this document, and no contractual obligations are formed either directly or indirectly by this document. This

document may not be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our

prior written permission.

Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.

Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and

are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are

trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group. 0113