Upload
francis-york
View
234
Download
2
Embed Size (px)
Citation preview
®
IBM Software Group
© IBM Corporation
IBM Information Server
Understand - Information Analyzer
IBM Software Group
IBM Information ServerDelivering information you can trust
Understand
Cleanse Transform Deliver
Discover, model, and govern information
structure and content
Standardize, merge,and correct information
Combine and restructure
information for new uses
Synchronize, virtualize and move information for in-
line delivery
ParallelProcessing Connectivity Metadata DeploymentAdministration
Platform Services
Support for Service-Oriented Architectures
2
IBM Software Group
3
The IBM Solution: IBM Information ServerDelivering information you can trust
Cleanse Transform Deliver
Parallel Processing
Rich Connectivity to Applications, Data, and Content
IBM Information Server
Unified Deployment
Unified Metadata Management
Understand
Information AnalyzerData profiling for understanding what data you have and how it relates to other data, plus data analysis for measuring and monitoring ongoing
data quality.
IBM Software Group
4
Data ProfilingCritical Problems: You don’t know what data is really in your
legacy systems Sources have changed or are new and
unknown
Why? Data values and relationships are
inconsistent and divergent from documented rules
Incomplete and missing documentation Data sources are never static and
frequently change without warning
Alternative Approach Labor intensive, resource devouring
process Never review 100% of data elements No infrastructure to support maintenance No standardized approach across
projects 1st generation tools document but don’t
address the problem resolution
Mainframe manufacturing system
Demographic
Contact
Billing / Accounts
External Lists
Distribution
ERP from acquisition
Parts BOM
Data SourcesData Sources
IBM Software Group
5
About Information Analyzer
Automates your data discovery process
Enables you to understand your data before starting development
Eliminates the risk and uncertainty of using bad data
Useful in any type of data migration project
Analyzes every data attribute and reverse engineers the true meta data of your source
Reduces time to analyze data
Mainframe manufacturing system
Demographic
Contact
Billing / Accounts
External Lists
Distribution
ERP from acquisition
Parts BOM
Data SourcesData Sources
IBM Software Group
6
IBM Information Analyzer
Reduce Time to Value of Data Projects
Increase the Productivity of Data Personnel
Assess Data Quality & Consistency across the Enterprise
Results sharable across IBM Information Server
Data Profiling: the process of analyzing a data sources to determine its content, quality and structure
IBM Software Group
What does Information Analyzer provide?
Source System Analysis Provides the key understanding of the source data
Column & Domain analysis
Table/Primary Key analysis
Foreign Key analysis
Cross-Domain analysis
Iterative AnalysisLeverages the analysis to facilitate iterative tests
Baseline analysis
7
Foreign Key &Cross-Domain Analysis
Primary Key Analysis
Co
lum
nA
na
lysis
Source 1 Source 2
IBM Software Group
Source System Analysis
Column & Domain analysis Infers from content a column’s classification, physical properties, and
frequency distribution
Table/Primary Key analysisValidates the uniqueness of the identified key column, which allows
us to ensure that a given row of data can be clearly identified and related to other data
Cross-Domain & Foreign Key analysisSyncronizes the structure, relationships and integrity of data
environments by finding and validating otherwise unknown relationships and identifying critical integrity violations that need to be rectified.
8
IBM Software Group
Column Analysis: Tabular View
9
IBM Software Group
Column Analysis: Chart View
Frequency DistributionView Frequency Distribution either in Tabular or in Graph
Add user defined value to Frequency Distribution
Generate Reference Tables
Sort and Filter Frequency Data
10
IBM Software Group
Column Analysis: Properties
PropertiesSix property values are inferred for each column: Data Type,
Length, Precision, Scale, Nullability and Cardinality Type.
Distribution of data types, lengths, precisions and scales is displayed graphical.
11
IBM Software Group
Primary Key Analysis Results
Reviewing DuplicatesView Summary of Distinct and
Duplicated Values
Display list of all Primary Key values and #/% Duplicated.
12
IBM Software Group
Cross Domain Analysis Results
13
IBM Software Group
14
Baseline Differences
Detailed results for the column level.
Results include the column level summaries of distinctions for both Structure (Defined and Inferred) and Content.
Baseline Analysis
IBM Software Group
Sharing Analysis across Information Server
15
IBM Software Group
16
Company Facts :• Largest distributor in North America
• Four major acquisitions in last two years• 12,000 branded products• 30,000 clients• 11 operating centers
Integration of supply chain management systems
Profit margin analysis systems
Field expansion, and take along project
Staff changes and limited documentation related to acquired systems
Only 7% of data being analyzed, but bad data causing 20% of cost overruns
Estimate 10k hours and $650k in costs to support first four projects
80% productivity gain for analyzing data sources
$504,000 annual savings in lower development and maintenance costs
Repeatable process for all future projects that ensures good, actionable data
Project Goals Challenges Results
ROI: Food Distribution.
IBM Software Group
17
ROI: Top US Life Insurance Company
Competitive pressures requires the company to further enhance an existing competitive advantage – 360 degree customer view and 24\7 data availability. .
Detailed customer data resided in ten disparate legacy systems with little to no documentation. Presenting raw detailed data 24\7 was impossible.
Leveraging IBM allows for consistent data formats, validate data domains, define business rules linking policy data.
Better customer visibility.
Reduced costs by eliminating expensive and time-consuming investigations of detailed data.
Redeploying an investigator saves $130k annually.
Project Goals Challenges Results
Company Facts :
• #1 Largest Life Insurance Company in USA
• 138US$ billion in assets under management
• Offer complete like of life insurance, investment, retirement and related products
®
IBM Software Group
© IBM Corporation
Thank You