20
PRESENTATION ON DATA WAREHOUSING Presented By: Jagnesh Chawla Manpreet Singh Mintu

data warehousing

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: data warehousing

PRESENTATION ON

DATA WAREHOUSING

Presented By:Jagnesh ChawlaManpreet SinghMintu

Page 2: data warehousing

CONTENTS:

Meaning Of data warehousing Benefit of data warehousing Problems Architecture of data warehouse Main components Data flows Tools and technologies Data Mart

Page 3: data warehousing

MEANING: Data warehouse is data management and

data analysis

Goal: is to integrate enterprise wide corporate data into a single reository from which users can easily run queries

Page 4: data warehousing

BENEFITS:

The major benefit of data warehousing are high returns on investment.

Increased productivity of corporate decision-makers

Page 5: data warehousing

PROBLEMS: Underestimation of resources for data

loading Hidden problems with source systems Required data not captured Increased end-user demands Data homogenization High demand for resources Data ownership High maintenance Long-duration projects Complexity of integration

Page 6: data warehousing

ARCHITECTURE:

Operational data source1 Query Manage

Warehouse Manager

DBMS

Operationaldata source 2

Meta-dataHigh

summarized data

Detailed data

Lightly summarized

data

Operational data store (ods)

Operational data source n

Archive/backup data

Load Manager

Data mining

OLAP(online analytical processing) tools

Reporting, query,application development, and EIS(executive information system) tools

End-useraccess toolsTypical architecture of a data warehouse

Operational data store (ODS)

Page 7: data warehousing

MAIN COMPONENTS:

Operational data sourcesfor the DW is supplied from mainframe operational data held in first generation hierarchical and network databases, departmental data held in proprietary file systems, private data held on workstaions and private serves and external systems such as the Internet, commercially available DB, or DB assoicated with and organization’s suppliers or customers

Operational datastore(ODS)is a repository of current and integrated operational data used for analysis. It is often structured and supplied with data in the same way as the data warehouse, but may in fact simply act as a staging area for data to be moved into the warehouse

Page 8: data warehousing

MAIN COMPONENTS:

query manageralso called backend component, it performs all the operations associated with the management of user queries. The operations performed by this component include directing queries to the appropriate tables and scheduling the execution of queries

end-user access toolscan be categorized into five main groups: data reporting and query tools, application development tools, executive information system (EIS) tools, online analytical processing (OLAP) tools, and data mining tools

Page 9: data warehousing

DATA FLOW:

Inflow- The processes associated with the extraction, cleansing, and loading of the data from the source systems into the data warehouse.

upflow- The process associated with adding value to the data in the warehouse through summarizing, packaging , packaging, and distribution of the data

downflow- The processes associated with archiving and backing-up of data in the warehouse

Page 10: data warehousing

DATA FLOW:

outflow- The process associated with making the data availabe to the end-users.

Meta-flow- The processes associated with the management of the meta-data

Page 11: data warehousing

Operational data source1

Warehouse Manager

DBMS

Meta-data High summarized data

Detailed data

Lightly summarized

data

Operational data store (ods)

Operational data source n

Archive/backup data

Load Manager

Data mining toolsEnd-useraccess tools

Information flows of a data warehouse

Downflow

Inflow

Meta-flow

Upflow Query Manage

Outflow

Warehouse Manager

Page 12: data warehousing

TOOLS AND TECHNOLOGIES: The critical steps in the construction of a

data warehouse:

a. Extraction

b. Cleansing c. Transformation

Page 13: data warehousing

TOOLS AND TECHNOLOGIES: after the critical steps, loading the results

into target system can be carried out either by separate products, or by a single, categories:

code generators

database data replication tools

dynamic transformation engines

Page 14: data warehousing

MANAGEMENT TOOLS: For the various types of meta-data and the

day-to-day operations of the data warehouse, the administration and management tools must be capable of supporting those tasks:

Monitoring data loading from multiple sources

Data quality and integrity checks

Managing and updating meta-data

Monitoring database performance to ensure efficient query response times and resource utilization

Page 15: data warehousing

Auditing data warehouse usage to provide user chargeback information

Replicating, subsetting, and distributing data Maintaining effient data storage

management Purging data; Archiving and backing-up data Implementing recovery following failure Security management

Page 16: data warehousing

DATA MART:

Data mart a subset of a data warehouse that supports the requirements of particular department or business function

The characteristics that differentiate data marts and data warehouses include:

A data mart focuses on only the requirements of users associated with one department or business function

Page 17: data warehousing

Operational data source1

Warehouse Manager

DBMS

Operationaldata source 2

Meta-dataHigh

summarized data

Detailed data

Lightly summarized

data

Operational data store (ods)

Operational data source n

Archive/backup data

LoadManager

End-useraccess tools

Typical data warehouse adn data mart architecture

Operational data store (ODS)

QueryManage

summarizeddata(Relational database)

Summarized data (Multi-dimension database)

Data Mart

(First Tier) (Third Tier)

(Second Tier)

Warehouse Manager

Page 18: data warehousing

DATA MART ISSUES: Data mart functionalitythe capabilities of data marts

have increased with the growth in their popularity

Data mart sizethe performance deteriorates as data marts grow in size, so need to reduce the size of data marts to gain improvements in performance

Data mart load performancetwo critical components: end-user response time and data loading performanceto increment DB updating so that only cells affected by the change are updated and not the entire MDDB structure

Page 19: data warehousing

REFERENCES:

Book of DBMS Google.com Wikipedia, the free encyclopedia InformIT.com Allfree-stuff.com

Page 20: data warehousing