16
The Next-Generation EDW Is The Big Data Warehouse Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics by Noel Yuhanna August 29, 2016 FOR ENTERPRISE ARCHITECTURE PROFESSIONALS FORRESTER.COM Key Takeaways Without Modernizing Your Current EDW Platform, You Will Likely Fail Business users are demanding faster, more real- time, and integrated customer analytics from multiple sources, so they can make better decisions and increase their company’s competitiveness. Current EDW platforms have gaps and limitations that fail to meet these new requirements. Forrester’s Big Data Warehouse Strategy Extends The Existing EDW Framework Based on interviews of customers and vendors, Forrester has laid out an architecture to guide enterprise architects in creating a big data warehouse framework tailored to their firm’s requirements to support both existing and new actionable business insights. You Need A Big Data Warehouse Strategy To Succeed Big data warehouse is a modern data warehouse architecture that leverages traditional and new data repositories, in-memory, cloud, and other technologies. Why Read This Report EDW is not dead; it’s evolving! Enterprise data warehouses have come a long way in delivering value by predicting trends, minimizing churn, and identifying new business opportunities. However, in the era of big data, traditional EDW is failing to meet new business requirements, such as support for real-time and ad hoc customer analytics, new sources of data, and self-service capabilities. Enterprise architects should read this report to learn how the new big data warehouse addresses these gaps by delivering timely and actionable insights to gain competitive edge and enable innovation and growth.

The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

The Next-Generation EDW Is The Big Data WarehouseBig Data Warehouses Drive Faster, Integrated, Self-Service Analytics

by Noel YuhannaAugust 29, 2016

For ENtErprisE ArchitEcturE proFEssioNAls

ForrESTEr.com

Key takeawaysWithout modernizing Your current EDW Platform, You Will Likely FailBusiness users are demanding faster, more real-time, and integrated customer analytics from multiple sources, so they can make better decisions and increase their company’s competitiveness. current EDW platforms have gaps and limitations that fail to meet these new requirements.

Forrester’s Big Data Warehouse Strategy Extends The Existing EDW FrameworkBased on interviews of customers and vendors, Forrester has laid out an architecture to guide enterprise architects in creating a big data warehouse framework tailored to their firm’s requirements to support both existing and new actionable business insights.

You Need A Big Data Warehouse Strategy To SucceedBig data warehouse is a modern data warehouse architecture that leverages traditional and new data repositories, in-memory, cloud, and other technologies.

Why read this report

EDW is not dead; it’s evolving! Enterprise data warehouses have come a long way in delivering value by predicting trends, minimizing churn, and identifying new business opportunities. however, in the era of big data, traditional EDW is failing to meet new business requirements, such as support for real-time and ad hoc customer analytics, new sources of data, and self-service capabilities. Enterprise architects should read this report to learn how the new big data warehouse addresses these gaps by delivering timely and actionable insights to gain competitive edge and enable innovation and growth.

Page 2: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

2

6

10

12

13

© 2016 Forrester research, inc. opinions reflect judgment at the time and are subject to change. Forrester®, technographics®, Forrester Wave, roleView, techradar, and total Economic impact are trademarks of Forrester research, inc. All other trademarks are the property of their respective companies. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

Forrester research, inc., 60 Acorn park Drive, cambridge, MA 02140 usA+1 617-613-6000 | Fax: +1 617-613-5000 | forrester.com

table of contents

EDW Has Been The Analytics Platform King For Decades

But New Business requirements Are changing EDW requirements

EDW technology Gaps Are Making Enterprises look Elsewhere

The Big Data Warehouse Extends The EDW Platform

Big Data Fabric connects the superset of Your Data sources — including Your BDWs

the BDW provides A comprehensive View And integrated Analytics

The Major EDW Vendors Provide BDW Components

BDW use cases Go Beyond traditional Analytics

recommendations

Extend Your Current EDW Platforms Toward A BDW Strategy

Supplemental Material

Notes & resources

Forrester interviewed various customers in the financial, oil and gas, retail, and healthcare sectors.

related research Documents

Big Data Fabric Drives innovation And Growth

the Forrester Wave™: Enterprise Data Warehouse, Q4 2015

techradar™: Big Data, Q1 2016

For ENtErprisE ArchitEcturE proFEssioNAls

The Next-Generation EDW Is The Big Data WarehouseBig Data Warehouses Drive Faster, Integrated, Self-Service Analytics

by Noel Yuhannawith Gene leganza and shreyas Warrier

August 29, 2016

Page 3: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

2

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

EDW has Been the Analytics platform King For Decades

the enterprise data warehouse is an architecture, not a technology. the traditional EDW platform has served and continues to serve a broad range of business users, including enterprise architecture (EA) pros, feeding both analytical and operational systems. EDWs:

› organize and aggregate historical analytical data from functional domains. EDWs house information from data subject areas such as customer, manufacturing, finance, and human resources that align with key processes, applications, and roles. Most of the traditional EDW platform has been built using relational database management system (DBMs) and columnar database platforms using extract-transform-load (Etl), change data capture (cDc), and replication technology (see Figure 1).

› offer a strong decision support framework. EDWs provide in-database analytics, predictive models, and embedded business algorithms to drive business decisions.

› Are central to a firm’s data ecosystem. the EDW is a proven ecosystem that supports integration with data models and security frameworks, automation, and a broad range of business intelligence (Bi) and visualization tools.1

› Provide the foundation for BI. EDWs support timely reports, ad hoc queries, and dashboards and supply other analytics applications with trusted and integrated data. Many use the EDW to deliver operational intelligence — in the form of query responses, reports, dashboards, charts, and other analytic views — in support of various decision scenarios.

Page 4: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

3

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

FIGUrE 1 the traditional Enterprise Data Warehouse platform

Relational

Columnar

Business intelligence

Operationalreporting

Analytics

Predictiveanalytics

Social

OLTP

CRM

ETL/CDC/replication

On-premises

• Modeling• Data quality• Security governance

transformation• Integration

Hybrid

ERP

SaaS

Compute/processingStorage/persistenceSource

Cloud

But New Business requirements Are changing EDW requirements

today, business users are demanding real-time analytics that’s integrated from legacy, social, and cloud sources, while business execs want self-service and autonomous access to fit-for-purpose customer data insights. in our 2016 global survey, 59% of respondents stated that leveraging big data and analytics was a critical or high priority (see Figure 2). But increasing data volume and dealing with multimodel customer data are slowing down timely analytics and putting constraints on traditional warehouse platforms, causing firms to revisit their EDW architectures. Businesses are reporting that current EDW platforms:

› can’t share current data quickly enough for timely business decisions. With increasing big data comes a major challenge for any enterprise: knowing what to look for and where, and then making sense of it. in our survey, 30% of businesses reported growth of data volume and variety affecting their Bi strategy (see Figure 3). Firms are realizing that traditional data warehouses fall short when it comes to real-time analytics.2

“With data explosion and increasing demand for real-time analytics by the business, we are finding it challenging to support our loB users. While we already use hadoop, our traditional data warehouses still are important for analytics, but we are now looking at modernizing that architecture.” (Enterprise architect, oil and gas, North America)

› Don’t support ad hoc and dynamic analytics for new customer trends. EDWs were built for a limited set of uses, providing answers to known questions. But 27% of enterprises report that fast-changing analytics and reporting requirements are one of the biggest challenges when orchestrating

Page 5: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

4

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

their Bi strategy, while 30% cite the growth and variety of their data. processes using traditional EDWs don’t scale well when you introduce ambiguity or add new and dynamic questions. EDWs need to ingest, process, and curate data continuously and support dynamic insights.

“We are now looking to [build] a modern data warehouse that can provide insights to all kinds of tough questions critical for our business to succeed. including identifying business risks and opportunity.” (Business analyst, financial services, Europe)

› Don’t provide a self-service platform for strategic and operational decision-making. When executives need to determine why something is happening or what the best course of action is, they can’t wait for a data processing cycle to make data available. Analysts need to be able to aggregate and prepare data sets without technology management’s involvement. twenty-seven percent of companies reported lack of end user self-service capabilities as one of the biggest challenges in executing their Bi strategy. self-service customer analytics has become critical for organizations to succeed.

“self-service for all data is our long-term strategic direction, and we know it’ll take us some time to get there, but we have to start somewhere. We have started to integrate our current EDW appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial services, North America)

FIGUrE 2 Big Data And Analytics have Become A priority

0%

9%

28%

40%

19%

1%

Not on our agenda

Low priority

Moderate priority

High priority

Critical priority

Don’t know

“Which of the following initiatives are likely to be your organization’s top business priorities?”(Better leverage big data and analytics in business decision-making)

Source: Forrester’s Global Business Technographics® Data and Analytics Survey, 2016

Base: 3,343 data and analytics decision-makers

Page 6: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

5

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

FIGUrE 3 Data Growth And Variety Are Affecting Business intelligence And Analytics strategy

4%

16%

17%

17%

19%

20%

21%

22%

22%

25%

25%

25%

27%

30%

35%

Don’t know/does not apply

Lack of business C-level executive support

Widespread utilization of insights fordecision-making and planning

Inadequate change management programs(communications, incentives, etc.)

Lack of access to data and insights

Lack of end user self-service capabilities

Lack of data standards

Legal and regulatory compliance

Inadequate or missing relevant internal skills

Poor data quality

Lack of adequate user training

Lack of alignment between IT and business

Fast-changing analytic and reporting requirements

Growth of data volume/variety

Data security and privacy

“What are the biggest challenges your firm faces when orchestratingits business intelligence strategy?”

Source: Forrester’s Global Business Technographics® Data and Analytics Survey, 2016

Base: 3,343 data and analytics decision-makers

EDW Technology Gaps Are making Enterprises Look Elsewhere

While traditional data warehouses often took years to build, deploy, and reap benefits from, today’s organizations want more simplified, agile, integrated, cost-effective, and automated solutions. Firms are revisiting their EDW strategies, as they spend too much time loading, unloading, transforming, securing, integrating, and curating customer data. Enterprises face:

Page 7: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

6

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

› A data volume explosion that’s affecting customer analytics. traditional structured data continues to grow rapidly, slowing down legacy data warehouse systems and affecting analytics and timely insights. regulatory requirements now mandate storing compliant data for several years, and business growth is generating more data at a faster pace than ever before.

“We are experiencing tremendous data explosion for traditional data sets that’s impacting our data warehouses. While we are still looking at improving the performance of existing data warehouses for the short term, we are now starting to look at alternatives, both supplementary and replacement as longer-term strategy.” (Enterprise architect, oil and gas, North America)

› Data variety that’s making it harder to support using traditional warehouses. Business users can’t easily spot patterns and trends in content such as documents, email, images, audio, and social media. in addition, storing, processing, and accessing unstructured data in data warehouses pushes the limits of traditional technologies and architectures, which were not designed to handle such data types.3

› Data speed that’s making it harder to keep up. New sources of data are coming in a lot faster, such as sensor and machine data, log and clickstream data, cloud and software-as-a-service (saas) data, and other streaming data. storing, transforming, and processing such data requires new technologies and systems to support new customer analytics, real-time analytics, and operational intelligence reporting.4

“For us, real-time data sharing is critical internally among business users but also with various partners that we engage with. currently, not all of our data is available to everyone, but we are looking at ways of expanding to support a more self-service real-time big data platform.” (Data scientist, biotechnology company, North America)

the Big Data Warehouse Extends the EDW platform

Firms are already using a variety of technologies in their big data strategy to support new, next-generation analytics (see Figure 4). the big data warehouse (BDW) is a modern data warehouse architecture that leverages traditional data warehouse architectures as well as modern big data technologies (see Figure 5). Forrester defines the big data warehouse as:

A specialized, cohesive set of data repositories and platforms used to support a broad variety of analytics running on-premises, in the cloud, or in a hybrid environment. BDW leverages both traditional and new technologies such as Hadoop, columnar and row-based data warehouses, ETL and streaming, and elastic in-memory and storage frameworks.

Page 8: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

7

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

FIGUrE 4 cloud, streaming, And Distributed in-Memory Are Already part of Firms’ Big Data strategy

“Which of the following are included in your plans for big data?”

8%

16%

18%

22%

23%

23%

26%

26%

27%

28%

30%

33%

36%

40%

Don’t know

NoSQL other than Hadoop

A massively parallel processing (MPP)data warehouse

Semantic technologies (ontology building,search, autocuration, graph, etc.)

Hadoop (including Hbase or Accumulo)

Data anonymization or de-identi�cation

Creating or building out a data lake

Marketing or digital data management platforms andservice providers that brand their offerings as . . .

Packaged analytics technologies that brandthemselves as big data

Unstructured data mining/analytics

Distributed in-memory databases, grids,analytics tools

Streaming analytics/computing

Large-scale predictive modelling, data mining,or other advanced analytics

Public cloud big data services

Source: Forrester’s Global Business Technographics® Data and Analytics Survey, 2016

Base: 2,094 data and analytics decision-makers

Page 9: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

8

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

FIGUrE 5 Big Data Warehouse Architecture

Relational

Columnar

Apache Hadoop

Businessintelligence

Operationalreporting

Analytics

Predictiveanalytics

Social

Devices

OLTP

ERP

ETL/CDC/replication

Machine learning

Data quality

Security

Governance

Hybrid

CRM

SaaS

Sensors Real-timeanalytics

Streaming

Transformation

IntegrationIn-memory/

Apache Spark

Self-service

Ad hocinteractions

Modeling

On-premises

Storage/computeprocessing ManagementSources Use casesInteraction

Cloud

Big Data Fabric connects The Superset of Your Data Sources — Including Your BDWs

the big data warehouse is part of a larger big data fabric architecture, which embodies data from multiple — potentially distributed — data sources, including BDWs and data lakes. the big data fabric architecture enables integration, data quality, security, governance, data curation, data preparation, and data management to support an end-to-end, real-time big data platform (see Figure 6).5 the two architectures:

› can exist separately but work best as complements. Multiple traditional EDWs, BDWs, and data lakes have become the new norm to support the variety of analytical workloads. While both BDWs and big data fabric architectures can exist independent of each other, typically firms leverage both to deliver a blend of real-time and batch across various distributed enterprise data sets to support broader use cases. For example, some financial services organizations use the BDW to support mostly financial data analytics — leveraging columnar data warehouses, hadoop, and Etl technologies. the BDW also acts as a source within the big data fabric architecture that delivers real-time customer analytics across BDW, twitter, salesforce, and clickstream data.

Page 10: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

9

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

› Vary significantly in the amount of data transformation required. We often see big data fabric used for real-time analytical use cases that integrate data across many disparate sources, including BDW, with the BDW used mostly for batch and near-real-time analytics for data stored in a data warehouse and hadoop clusters that require aggregation, transformation, and further processing before becoming available to Bi users or analytical processes. Exploration occurs within the fabric, with transformations captured within the BDW.

FIGUrE 6 Big Data Fabric Architecture integrated With Big Data Warehouse

Big data fabric

Data ingestion(streaming/replication/batch)

Processing andpersistence

On-premises sources Cloud sources

HadoopSpark

BDW

Hadoop

EDW

Spark

New York

Singapore

The BDW Provides A comprehensive View And Integrated Analytics

A key component of the BDW architecture is the ability to leverage various specialized data repositories such as traditional relational data warehouses, columnar data warehouses, and hadoop. unlike traditional data warehouses, the BDW minimizes complexity and hides heterogeneity by embodying a trusted model, supports all kinds of data types including unstructured data, and adapts to changing business requirements more rapidly through a self-service platform. the BDW centralizes administration of distributed data repositories, in-memory compute resources, metadata, storage, access, and processing functions. it leverages new technologies such as:

› Hadoop to support diverse data sets and distributed computing. By leveraging hadoop, the BDW enables organizations to deal with a wider variety of data structures than traditional EDWs. hadoop can also deal with extremely large data sets that are inappropriate for traditional EDW

Page 11: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

10

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

platforms. Enterprise architects can choose to store data in relational, columnar, wide columns, or hadoop based on business needs. For example, a retailer leverages legacy structured data stored in a traditional data warehouse, and hadoop for clickstream data, and integrates them to deliver a 360-degree view of the customer for recommendations and churn analysis.

› In-memory to enable faster customer analytical capabilities. A key component of the BDW is the ability to use in-memory to deliver performance and faster access to business data. We are heading toward having large memory platforms that will store petabytes in DrAM and Flash/ssD in the coming years. For example, several retailers are using BDW to leverage customer-related data to determine product discounting strategy, optimize product distribution across stores, and enable personalized customer experiences.

› Streaming engines to support new data channels for ingestion and processing. Market data, clickstream, mobile devices, and sensors are new sources for analytical information that are not in your existing data warehouse. streaming technology boosts integrating, transforming, and curating data on diverse data streams in real time.6 integrating streaming technology with data platforms such as hadoop and spark — as well as traditional data warehouses — has become critical. For example, we see oil and gas industry firms leveraging streaming technology for insights into new business opportunities, such as predicting staffing and resource requirements for various drilling sites and performing machine failure analysis.

the Major EDW Vendors provide BDW components

From an implementation viewpoint, most enterprises are currently building BDW platforms themselves by integrating their traditional data warehouses with Apache spark, hadoop, storm, and in-memory technologies. Forrester sees many enterprises already using an extract-hadoop-load (Ehl) approach to:

1. Extract data from various source systems such as traditional databases and flat files.

2. load data into hadoop to perform aggregation and transformation using Apache hadoop ecosystem tools.

3. Finally load the result into the EDW platform.7

BDW Use cases Go Beyond Traditional Analytics

Adoption of BDW architectures will accelerate as enterprises run into existing EDW challenges. But building a BDW platform internally will require more time and effort, which will likely put pressure on the overall business technology (Bt) agenda. the good news is that solutions are starting to emerge from vendors such as iBM, Microsoft, oracle, sAp, snowflake, and teradata that provide some or all of the components to build and deploy a BDW strategy.8 Enterprises are already using BDWs to support social analytics, risk analysis, campaign analysis, fraud assessment, and pricing trends. the top BDW use cases include:

Page 12: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

11

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

› Integrated analytics. A key challenge in the traditional EDW approach was that if data didn’t exist in the warehouse, you couldn’t do any analytics — full stop. With BDW architecture, you can perform integrated analytics across data warehouse and hadoop clusters. hadoop can store and process large sets of semistructured and unstructured data, log files, and streaming data with ease. For example, health research often requires looking at complex patient data and determining how effective a treatment is likely to be based on factors like age, sex, and health status. the BDW enables gathering and storing millions of data points in hadoop and performing complex navigation and modeling using traditional data warehouse and in-memory technology.

› Internet-of-things (IoT) analytics. traditional data warehouses don’t deal with iot data. however, the BDW offers the ability to store, process, and access large volumes of iot data from sensors and devices in hadoop repositories efficiently through automation and machine learning technologies. Manufacturers deal with highly sophisticated machinery to support their plants, whether they’re building a car, airplane, or tire or bottling wine or soda. Every minute of machine downtime can cost a manufacturer dearly. iot analytics on BDW platforms enables manufacturers to predict machine failures based on sensor data, minimizing or eliminating production slowdown.

› right-time business analytics. traditional EDW architectures were based on mostly batch processing, with Etl doing the heavy lifting of data from traditional systems to operational systems to data warehouses. As a result, by the time data arrived in data warehouses, it was already 12 to 48 hours old. BDWs enable right-time analytics by leveraging streaming and replication with direct access to data sources, whether on-premises or cloud, bypassing traditional Etl approaches. the financial services industry has been an early adopter of BDW to support right-time analytics for portfolio management, fraud detection, and asset management.

› Adaptive, self-service analytics. Most EDWs use predefined data sources to deliver predictive analytics, trends, and insights. the BDW enables organizations to dynamically leverage new data sources quickly to deliver new insights. it enables self-service capabilities for business users to ask complex and new questions so they can make more accurate decisions. the BDW adapts to the new sources and can help correlate data using machine learning and adaptive intelligence. For example, a major European bank recently built a BDW framework that business units now use to support self-service for making better decisions on investments and risks. the platform represents a major shift from the static reports the bank used previously.

Page 13: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

12

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

recommendations

Extend Your current EDW platforms toward A BDW strategy

Don’t throw away your existing EDW platform! the investments you have already made in EDWs will form the foundation of the next-generation BDW strategy. however, attaining this demands that you rearchitect your existing EDW platform and invest in new technologies to deliver on a new vision of right-time analytics, self-service, and intelligent and contextualized customer analytics. Forrester recommends that enterprise architects extend existing EDW platforms toward a BDW strategy by leveraging:

› Hadoop for low-cost storage and processing of big data. let hadoop be the first stop for your big data that has no other home in your data warehouse. hadoop offers the ability to store very large volumes of data (including unstructured data) more efficiently than traditional warehouses — and at a fraction of cost. in addition, hadoop helps you offload data from traditional warehouses and leverage a distributed computing framework to perform transformation, aggregation, and curation quickly.

› In-memory technology to support right-time analytics. Without in-memory technology, customer analytics, personalization, and right-time analytics will run slowly. this could cause you to miss key trends like customer churn or miss the opportunity to offer new products and services or identify weak markets. You can also use data from the BDW as part of the bigger big data fabric framework that leverages distributed in-memory computing to deliver a broader enterprise information fabric.

› Hybrid platforms to support on-demand and scalable BDWs. storing all of your data on-premises need no longer be the default. cloud platforms like those from Amazon Web services, Google, iBM, Microsoft (Azure), oracle, and rackspace offer pay-as-you-go facilities to store, process, and access any amount of data.9 hybrid is the new norm — look at utilizing both on-premises and cloud data warehouse platforms as part of your BDW architecture, with a common administration facility.

› Vendor solutions that help achieve faster time-to-value. Data warehouse, hadoop, and other big data solutions from vendors such as cloudera, hortonworks, iBM, Mapr technologies, Microsoft, oracle, sAp, and teradata can reduce time-to-value by automating and simplifying various BDW functions and implementation steps. look at vendors that support broader solutions and can support your business data. Ask your vendor how it plans to provide the BDW vision. review the various components that the vendor has integrated and ask how it plans to fill any gaps.

Page 14: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

13

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

supplemental Material

Forrester’s Global Business technographics® Data And Analytics survey, 2016 was fielded in March 2016. this online survey included 3,343 respondents in Australia, Brazil, canada, china, France, Germany, india, New Zealand, the uK, and the us from companies with 100 or more employees.

Forrester’s Business technographics ensures that the final survey population contains only those with significant involvement in the planning, funding, and purchasing of business and technology products and services. research Now fielded this survey on behalf of Forrester. survey respondent incentives include points redeemable for gift certificates.

please note that the brand questions included in this survey should not be used to measure market share. the purpose of Forrester’s Business technographics brand questions is to show usage of a brand by a specific target audience at one point in time.

Engage With An Analyst

Gain greater confidence in your decisions by working with Forrester thought leaders to apply our research to your specific business and technology initiatives.

Forrester’s research apps for iPhone® and iPad®

stay ahead of your competition no matter where you are.

Analyst Inquiry

to help you put research into practice, connect with an analyst to discuss your questions in a 30-minute phone session — or opt for a response via email.

learn more.

Analyst Advisory

translate research into action by working with an analyst on a specific engagement in the form of custom strategy sessions, workshops, or speeches.

learn more.

Webinar

Join our online sessions on the latest research affecting your business. Each call includes analyst Q&A and slides and is available on-demand.

learn more.

Page 15: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

For EntErprisE ArchitEcturE proFEssionAls

The Next-Generation EDW Is The Big Data WarehouseAugust 29, 2016

© 2016 Forrester research, inc. unauthorized copying or distributing is a violation of copyright law. [email protected] or +1 866-367-7378

14

Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics

Endnotes1 today, organizations still rely on EDW platforms to deliver actionable, timely, and trustworthy intelligence. EDW

technology organizes and aggregates analytical data from various functional domains and serves as a critical repository for organizations’ operations. see the “the Forrester Wave™: Enterprise Data Warehouse, Q4 2015” Forrester report.

2 it takes a long time to measure a business process. Enterprise data hubs need to accommodate more data and an infinite set of queries. see the “create A road Map For A real-time, Agile, self-service Data platform” Forrester report.

3 Data consumers — from casual data analysts to data scientists to your customers — are looking across a broad variety of data today to find answers to their questions. see the “compose Digital Data to create A symphony of insight” Forrester report.

4 Data bottlenecks create business bottlenecks. the days of provisioning data to simply meet the requirements of systems of record are over. Business stakeholders at the executive and line-of-business levels need data faster to keep up with customers, competitors, and partners. see the “create A road Map For A real-time, Agile, self-service Data platform” Forrester report.

5 Forrester defines big data fabric as “bringing together disparate big data sources automatically, intelligently, and securely, and processing them in a big data platform technology, such as hadoop and Apache spark, to deliver a unified, trusted, and comprehensive view of customer and business data.” see the “Big Data Fabric Drives innovation And Growth” Forrester report.

6 streaming technology helps integrating, transforming, and curating data on diverse data streams in real time. see the “the Forrester Wave™: Big Data streaming Analytics, Q1 2016” Forrester report.

7 Forrester sees many enterprises already using an extract-hadoop-load approach to extract data from various source systems, such as iot devices and cloud and traditional platforms, then load it into hadoop, perform aggregation and transformation, and finally load it into the EDW to support business analytics. see the “the Forrester Wave™: Enterprise Data Warehouse, Q4 2015” Forrester report.

8 Most big data integration vendors focus on making classic processes faster with tools for moving data into a lake and working with it there. three innovative vendors — looker Data sciences, snaplogic, and snowflake computing — offer alternative approaches. see the “Breakout Vendors: Big Data integration” Forrester report.

9 According to Forrester customer feedback, such cloud-based storage is typically over 20% less expensive than on-premises deployment.

Page 16: The Next-Generation EDW Is The Big Data Warehouse · appliances to hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial

We work with business and technology leaders to develop customer-obsessed strategies that drive growth.

Products and services

› core research and tools › data and analytics › Peer collaboration › analyst engagement › consulting › events

Forrester research (nasdaq: Forr) is one of the most influential research and advisory firms in the world. We work with business and technology leaders to develop customer-obsessed strategies that drive growth. through proprietary research, data, custom consulting, exclusive executive peer groups, and events, the Forrester experience is about a singular and powerful purpose: to challenge the thinking of our clients to help them lead change in their organizations. For more information, visit forrester.com.

client suPPort

For information on hard-copy or electronic reprints, please contact client support at +1 866-367-7378, +1 617-613-5730, or [email protected]. We offer quantity discounts and special pricing for academic and nonprofit institutions.

Forrester’s research and insights are tailored to your role and critical business initiatives.

roles We serve

Marketing & Strategy ProfessionalscMoB2B MarketingB2c Marketingcustomer experiencecustomer insightseBusiness & channel strategy

Technology Management Professionalscioapplication development & delivery

› enterprise architectureinfrastructure & operationssecurity & risksourcing & vendor Management

Technology Industry Professionalsanalyst relations

128005