Upload
others
View
4
Download
0
Embed Size (px)
Citation preview
Enhancing Data Quality and Governancewith IBM Information Server 11.7
Brian MayerInformation Governance Specialist
Matt Crittenden Information Governance Architect
WEDNESDAY, NOVEMBER 14, 2018
Please noteIBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice and at IBM’s sole discretion.
Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision.
The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract.
The development, release, and timing of any future features or functionality described for our products remains at our sole discretion.
Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon many factors, including considerations such as the amount of multiprogramming in the user’s job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve results similar to those stated here. 2
Agenda• Introduction• Brief intro to Governance• Overview of IBM Information Server• My top 10 features for speeding up governance efforts
3
Use Cases Driving a Unified Governance Strategy
GOVERNANCE FOR COMPLIANCE
Discover, classify and manage information in ways that meet
the obligations enforced by both regulatory and corporate
mandates
GOVERNANCE FOR INSIGHTS
Provide safe access to trusted, high quality data while
facilitating effective collaboration among team members to become a data
driven organization
Confidence Availability Analytics Value4
Business Ready Data=
Right Data+
Right Place+
Right Time6
Business Outcomes: Market Share & Operational Efficiency
IBM Information Governance Process Framework
7
Siloed Efforts
Reactive
Proactive
Business Ready
Data Quality is not a focus at point of creation. Continuous Improvement in your Information Supply Chain does not exist.
Departmental Data
Improvements
Enterprise level information governance funded and sustained as a part of “How You Do Business.”
Your data is Business Ready for all consumers now and as tomorrow’s requirements emerge.
Limited metrics not directly tied to governance
Range of disconnected,
discipline-specific tools
Data Stewards, Policies &
Rules Business Focused
Defined, formally reviewed
governance metrics
Enterprise-based
integration & governance
tools with LOB access
Journey Map: Organizations are at different points
8
Best Practices:
Imperative to design initial scope around a specific business problem
Articulated in terms the business understands
Root cause of program failure is the lack of linkage to business value
Quantitative Metrics
Qualitative Metrics
Define Business Need/Problem
9
Best Practices:
Start small
Celebrate and communicate successes
Define Business Need/Problem
10
11
Best Practices:
Executive support is essential to the success of program
Must show value to gain/maintain support
Identify sponsor through interest in key business initiatives driven by data◦ If project is Business driven (i.e., top down), leaders should be executive level ◦ If project is IT driven (i.e., bottom up), leaders should be senior IT leaders who are
respected by executive level business leaders and have deep experience and understanding of the business processes
Build team around business need/problem…include IT and LOB
Establish/Expand Executive Sponsorship
12
Best Practices:
Roadmap outlines people, process and technology tasks◦ People: Appoint executive leadership to lead the design & implementation of Governance
Operating Model ◦ Process: Promote new habits for all information governance consumers across the
business (Scope out Change Management and Accountability programs)◦ Technology: Tooling to facilitate data quality, traceability, lineage, monitoring and
accountability
Create guardrails and guideposts to assist with setting data policies and standards
Build Roadmap/Outline, Guardrails & Guideposts
13
Information is a company asset. It will be managed according to the prescribed governance policies.
Information is identified and classified. All information stored by the company will be identified and classified according to its sensitivity and content. This classification will determine how the information will be governed.
Information is a sharable resource. It should be made available for all legitimate needs of the company.
Information is owned. There is an individual responsible for the appropriate management and governance of each information collection.
Information users are identified. An individual will be identified and be accountable for each access and change they make to information.
Information is protected. Information is secured from unauthorized access and use.
Information users are responsible. Individuals are responsible for safeguarding the information that they own, access and use.
Decision makers use appropriate data. Decision makers are responsible for ensuring they are using information of appropriate integrity for their work.
Information is kept as long as it is needed. Information is disposed of appropriately when it is no longer needed.
Information quality is everyone’s responsibility. Information is validated and where necessary it is corrected and made complete.
Information is managed in a cost effective manner. This is achieved through a well-defined information architecture that follows standards and best practices.
Information and analytics will only be used for approved, ethical purposes. Each new use of analytics is reviewed to ensure it will not damage the reputation of the organization.
Example Guardrails & Guideposts
Governance Council(CDO?)
Data Stewards
Data Policies
Data Standards
Establish Operating Model
14
Business Outcomes: Market Share & Operational Efficiency
IBM Information Governance Process Framework
15
DataGoneBad
WellGoverned
Data
IBM Unified Governance StrategyA modernized governance approach would need to safely democratize data while ensuring compliance mandates are met in parallel
– Governance as a service
– Structured and unstructured
– Private and public cloud
– Self service
– Open and extensible
–Automated and Intelligent
–Easy to deploy– Infused with industry knowledge
RULES
REQUIREMENTS
LAW
STANDARDS
REGULATIONS
GOVERNANCE
TRANSPARENCY
POLICIES
Insights and Compliance
17
Data Quality•Analyze and Classify•Cleanse and Standardize•Define and Monitor Data
Rules
Data Integration•Massive Scalability•Power for any complexity•Deliver in batch and/or real-
time with change capture
Data Governance•Create an Enterprise
Language•Document Requirements•Catalog of Governed Data•Support Compliance
thru Lineage
IBM Information ServerThe Foundation for Unified Governance
InfoSphereInformation
ServerData
IntegrationData
Quality
Data Governance
18
10. Advanced
Search? ? ? ?
The advanced gear is hiding in plain site… This will take you to the Advanced Search!19
10. Advanced
Search? ? ? ?
The magical gear allows you to access
all attributes (including Custom)
for the asset chosen.
20
10. Advanced
Search? ? ? ?
Advanced Search will narrow down the attribute list to only those shared across the asset list.
21
10. Advanced
Search? ? ? ?
Advanced Search is far more visible as “Filters” in
current releases!
22
10.Advanced
Search
9. BatchEdit
? ? ?
Along with Advanced Search is
Batch Edit. From your
search results,
select items and click Edit from
menu.
23
10.Advanced
Search
9. BatchEdit
? ? ?
Your edit options are always limited to attributes
shared across all asset types as before.
24
10.Advanced
Search
9. BatchEdit
? ? ?
“Add or Replace” is incredibly valuable making
updates easier!
25
10.Advanced
Search
9. BatchEdit
8.Query
Builder? ?
Governance often requires a quick answer to a new question. Enter Query Builder! Choose what you
wish to see on the DISPLAY panel. 26
10.Advanced
Search
9. BatchEdit
8.Query
Builder? ?
and choose your selection criteria/parameters on the CRITERIA panel then click
Run.27
10.Advanced
Search
9. BatchEdit
8.Query
Builder? ?
Notice the boxes? Queries are another way to do an Advanced Search allowing a Batch Edit!
28
10.Advanced
Search
9. BatchEdit
8.Query
Builder? ?
This…
With this!!!29
10.Advanced
Search
9. BatchEdit
8.Query
Builder
7.Reformat Screens
?
30
10.Advanced
Search
9. BatchEdit
8.Query
Builder
7.Reformat Screens
?
Simply name your query with a leading $ followed by the screen name and sub-topic separated by a
period.
Your query needs to be about asset type in the sub-topic of the name!
Screen
Sub-Topic
31
10.Advanced
Search
9. BatchEdit
8.Query
Builder
7.Reformat Screens
6.“All”Data
32
5. Automation
Rules? ? ? ?
Automation Rules speed quality efforts based off Business logic! 33
5. Automation
Rules
4. Discovery/
Assignment? ? ?
Discovery allows you to speed up the governance efforts by combining many activities into one simplified flow!
With just a data connection you can execute multiple steps in your governance process!
34
5. Automation
Rules
4. Discovery/
Assignment? ? ?
Based on:1) Column name to term name matching2) Indirect assignment of a term when a data class is assigned to a column3) Supervised Machine Learning
35
5. Automation
Rules
4. Discovery/
Assignment? ? ?
ENRICH YOUR GLOSSARYAdditional benefit comes along --
Enriching your glossary with IBM Industry Models!
36
5. Automation
Rules
4. Discovery/
Assignment
3. Enterprise
Search? ?
37
5. Automation
Rules
4. Discovery/
Assignment
3.Enterprise
Search
2.Custom
Relations?
38
5. Automation
Rules
4. Discovery/
Assignment
3. Enterprise
Search
2.Custom
Relations?
Relationships work like other custom attributes…
and support bi-directional attributes with additional restrictions. 39
5. Automation
Rules
4. Discovery/
Assignment
3. Enterprise
Search
2.Custom
Relations?
40
5. Automation
Rules
4. Discovery/
Assignment
3. Enterprise
Search
2.Custom
Relations
1. View
Layer!
CMVIEWS
Views related to common assets that are shared across multiple InfoSphere Information Server offerings. These views include users and groups, custom attributes, annotations, stewardship, database tables, data files, data connections, and published analysis results such as quality, profiling, and classification.
IAVIEWS Views related to InfoSphere Information Analyzer workspaces, analyzed data sets and data rules. These views include analysis results and rule definitions.
IGVIEWSViews related to governance assets that are managed by the InfoSphere Governance Catalog. These views include business categories and terms, policies and rules, and rules and term assignments.
REMVIEWS Views related to assets that are managed by using the Data Quality Exception Console. These views include Exception Sets, their properties, and association to IT assets.
41
5. Automation
Rules
4. Discovery/
Assignment
3. Enterprise
Search
2.Custom
Relations
1. View
Layer!
42
5. Automation
Rules
4. Discovery/
Assignment
3. Enterprise
Search
2.Custom
Relations
1. View
Layer!
Three Questions in 30 Minutes! Can you show me…
A distribution of terms with in a
category structure?
43
5. Automation
Rules
4. Discovery/
Assignment
3. Enterprise
Search
2.Custom
Relations
1. View
Layer!
Terms that have been modified
within the last…?
Three Questions in 30 Minutes! Can you show me…
44
5. Automation
Rules
4. Discovery/
Assignment
3. Enterprise
Search
2.Custom
Relations
1. View
Layer!
Three Questions in 30 Minutes! Can you show me…
A term distribution to their parent
categories?
45
Open API/CLIOpen View Layer
Open Standards
Openness is Critical for Data Governance!
Open Extension
46
Open Link to IBM Roadmaps
Open Source
Questions?
Thank You!