36
Data Governance and Data Stewardship on how to reach global adoption and systematic monitoring of data policy through software Dr. Pieter De Leenheer Co-founder

Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Embed Size (px)

DESCRIPTION

Data quality and regulations are perpetual drivers for Data Governance solutions that systematically monitor the execution of data policy. And yet, there is along road ahead to achieve ​​Data Governance: the term is still relatively unknown, there is no political forum in the form of a Data Governance Council, and software support is moderate. Time for change ! Data Governance requires automation on the one hand and a wide adoption of business to ICT on the other. In this lecture, we set out the basic principles to successful develop Data Governance. By way of example, we show how to translate this in Collibra's Data Governance Center. We pay particular attention to identifying and modelling data policies and rules, and to empowering them on the basis of data stewardship and configurable workflows across silos and functions in the organization. The example is drawn from the Flanders Research Information Space, where data quality is critical to drive and boost pan-European Research policy.

Citation preview

Page 1: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Data Governance and Data Stewardship on how to reach global adoption and systematic monitoring of data policy through software

Dr. Pieter De Leenheer Co-founder !

Page 2: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

What we talk about when we talk about no Data Governance

Who approved this?!

I wish these guys spoke our language !

I can’t understand this report !!

I’ve never seen this code! Who introduced this ?!

This doesn’t seem right. Are we sure this data is correct ?!

The Problem!

This rule is different in our country !!

This is an exception !to the rule !!

Page 3: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Data Management Challenges

•  Data Service = data sharing agreement across organization silos, policies, regulations, semantic assumptions

•  No clear balance between data ownership and control: •  responsibilities are not set •  for each data point : increasing exposure to risk regarding quality

and policy compliance !  ask Alice, she knows

Regulatory+compliance+risks+con3nue+to+persist+and+remain+a+solid+driver+for+governance,+risk+and+compliance+technologies.+However,+more+hype+is+being+generated+by+external+risks+posed+by+third+par3es,+suppliers+and+customers.+

(Gartner!Hype!Cycle!on!Risk!and!Compliance!Tech,!2013)!!

Page 4: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Flanders Research Information Space

•  Providing Scientific Research Information and Services

•  Easy

•  Transparent

•  Open

•  Timely

•  Unambiguous

•  Supported by Data Governance

•  Qualitative meta data: e.g., definition for project, funding codes, mappings, classifications, etc.

•  Roles and responsibilities for Information Providers and Stiweto

•  Collaborative workflows between Information Providers and Stiweto

By courtesy of G. Van Grootel, EWI

Page 5: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

FRIS’ Data-driven Innovation Engine

By courtesy of G. Van Grootel, EWI

Page 6: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Context & Necessity

•  Services are increasingly •  knowledge-intensive relying on millions of data points from

•  Partners •  Third parties •  Customers

•  co-produced in federated, decentralised, multi-tier settings

•  multi-disciplinary:

•  Algorithm: e.g., Big Data Analytics •  Infrastructure: e.g., Internet of Things

•  Service Innovation Methods: e.g, Living Labs

•  Marketing: e.g., Service-dominant Logic

•  sufficient…..? No

Page 7: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Defining Data Stewardship & Governance

•  Ownership + => Power + Control

Page 8: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Data Stewardship & Governance

•  Ownership + Responsibility => Power + Control 1.  (global) data stewardship

!  Requirement 1: people who define data policy !  E.g., multilingualism policy

2.  (systematic) data governance !  Requirement 2: processes that enforce data policy !  E.g., every project abstract must be in English and Dutch

•  Now let’s build software for it…

“New+Informa3on+infrastructure+technologies+must+enable+organiza3ons+to+define,+organize,+share,+integrate+and+govern+data+and+content+to+create+business+value”++

(Gartner!Hype!Cycle!on!Informa@on!Infrastructure!Tech!2013).!

Page 9: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Yet contradicting forces… Borrowed from Dirk Coutuer (ING)

Page 10: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

. .and not all data points are create equal

Critical Data Elements

Auditors, Clients,

Counterparties ...

External

Risk, Compliance, Finance ...

Critical Data Elements

Critical Data Elements

Equities, Fixed Income, Wealth Management ...

Dodd Frank Act, Basel III, FATCA ...

Critical Data Elements

Business Lines

Corporate Functions

Regulations

Borrowed from Predrag Dizdarevic (Element 22 NYC)

Page 11: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Can technology globalise and systematise data policy scoping, definition and enforcement which is by nature a human process?

Page 12: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Process-driven Data Governance

Page 13: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Tools Policy

Multilingualism

Business RuleAbstract must be

in English

Business RuleAbstract must be

in Dutch

Code Value4250

Code Value4.3

Code ValueG3

Business TermResearcher

Business TermPublication

Business TermProject

Business TermActie er Onder...

?!

?!

?!

?!

Funding Community

Generation 1 Funding Codes (Codelist)Funding Sources Glossary

Business termActie ter

ondersteuning van de

Strategische prioriteiten van

de Federale overheid

Generation 2 Funding Codes (Codelist)

Code SetGeneration 2 Funding Codes

Code Value368

contains

code

Code SetGeneration 1 Funding Codes

Code Value4250

containscode

Code Value4.3

contains

Funding Stream Codes (Codelist)

Code SetFunding Stream Codes

Code ValueG3

Accounting Codes (Codelist)

Code SetAccounting Codes

Code Valuexxxx

Business termPOD

wetenschapsbeleid -

Federale Impulsprogra

mma's

?!?!

?!

?!

Page 14: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Tools Policy

Multilingualism

Business RuleAbstract must be

in English

Business RuleAbstract must be

in Dutch

Code Value4250

Code Value4.3

Code ValueG3

Business TermResearcher

Business TermPublication

Business TermProject

Business TermActie er Onder...

Funding Community

Generation 1 Funding Codes (Codelist)Funding Sources Glossary

Business termActie ter

ondersteuning van de

Strategische prioriteiten van

de Federale overheid

Generation 2 Funding Codes (Codelist)

Code SetGeneration 2 Funding Codes

Code Value368

contains

code

Code SetGeneration 1 Funding Codes

Code Value4250

containscode

Code Value4.3

contains

Funding Stream Codes (Codelist)

Code SetFunding Stream Codes

Code ValueG3

Accounting Codes (Codelist)

Code SetAccounting Codes

Code Valuexxxx

Business termPOD

wetenschapsbeleid -

Federale Impulsprogra

mma's

?!

?!

?!

?!

Page 15: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Data Governance Council

Example for Funding Source Terms and Codes

Funding Community

Generation 1 Funding Codes (Codelist)Funding Sources Glossary

Business termActie ter

ondersteuning van de

Strategische prioriteiten van

de Federale overheid

Generation 2 Funding Codes (Codelist)

Code SetGeneration 2 Funding Codes

Code Value368

contains

code

Code SetGeneration 1 Funding Codes

Code Value4250

containscode

Code Value4.3

contains

Funding Stream Codes (Codelist)

Code SetFunding Stream Codes

Code ValueG3

Accounting Codes (Codelist)

Code SetAccounting Codes

Code Valuexxxx

Business termPOD

wetenschapsbeleid -

Federale Impulsprogra

mma's

Page 16: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Load, Define & Enforce Data Governance Council: Governance Operating Model

Roles & Responsibilities

Processes & Workflow

Asset Types & Traceability

Data Governance Organization

Data Stewardship Activities

Data Quality Development

IT / Operational Data Management Activities

Data Modeling

Metadata Lineage

Establishes & drives

Aligns & Coordinates

Reports & Escalates

Monitors & Remediates

Metadata Scanning

Reference Data Authoring

Data Integration

Collibra Business Semantics Glossary (BSG)

Collibra Reference Data Accelerator (RDA)

Hierarchy Management

Business & Data Definitions

Business Traceability

Semantic Modeling

Mapping Specifications

Policy Management

BusinessRules

Data Quality Rules

Data Quality Reporting

Issue Management

Reference Data Crosswalks

Master Data Stewardship

Data Quality Profiling

DQ Defect Resolution

Collibra Data Stewardship Manager (DSM)

Collibra Platform

Other Data Management Vendor products

...

Load…!

Scope,!select,!define!

enforce!

Page 17: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

5 Modeling Concepts in DGC Operating Model

Community Name

Domain

Assetrelation Attribute

Assets are fundamental building blocks or resources for which you want to capture information. An asset

belongs to exactly one domain. An asset has a unique name within its domain..

E.g., Personal Privacy Policy, Customer, ISO 3166, CRM, Customer Gender Disclosure Issue

Attributes are literal values such as strings or numbers that do not form an asset on their own right. E.g., the Description attribute for asset “Customer” is “Person that placed at least one order for at least one product with Bank and Insurance”

Relations semantically relate 2 assetsE.g., between assets “Customer” and “CRM”: “Customer has system of

record / is system of record for CRM”E.g., between assets “Customer” and “Gender”: “Customer has gender /

gender of Gender”

Domains logically group assets (according to their function, project, or knowledge area) and are owned by exactly one community. It has a domain type that specifies which asset types can be created in the domain.E.g., Customer Domain groups all assets related to customer relationship managementE.g., Enterprise Rules and Policies Domain collects all valid policies and rules in the organisation

Communities are groups of people. They often correspond to functional divisions in a company and should be aligned with

the company's governance organization. A community can control/own various domains.

E.g., Finance Community includes relevant people in the finance function, and controls the Customer Domain.

Page 18: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

DGC Asset Types

Asset

Business Asset

Data AssetTechnology

AssetGovernance

AssetIssue

Asset Types allow you to formally specify what type an asset is, as a kind of template. They are assigned to one or more Domain Types.E.g., Business Term is type for “Customer” and “Gender”E.g., Code Value is type for “CG_NA”;E.g., System is type for “CRM”

subsumes asset types such as Business Term, KPI, and Report

includes asset types such as Policy and Rule

subsumes asset types such as Code Value

includes asset types such as System and Database

We distinguish between 4 main types of asset, and 1

special type called Issue

Page 19: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Traceability of Assets across Domains

Assigning types to assets, relations, domains gives meaning; and brings a better understanding of different viewpoints on DG !

Enterprise Architecture

Finance

Working Group on Rules and Policies

Application Assets CRM Application Reference Data

Enterprise Rules and Policies

Customer Domain

Business Term Customer

System CRM

has system of record

"Person or […] and Insurance"

Business Term Genderhas gender

Code ValueCG_MA

allowed value

Code ValueCG_FE

Code ValueCG_NA

PolicyPersonal

Privacy Policy

governs / complies to

Issue Gender

Disclosure Issueviolates

description

Page 20: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Use-cases

Page 21: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Business Glossary

Page 22: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

DG in Cloud Provider

Page 23: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Data Dictionary

Page 24: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Business Glossary at the #1 Chocolate Factory

Page 25: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Reference Data Reference Data

FWO Disciplines

IWETO Disciplines

ECOOM Hasselt

Page 26: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Issue Management Issue Management

Data Governance Council

Funding Source Not

Found

Page 27: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Reference Data & Issue Mgt at Health Insurance Co.

•  http://prezi.com/ve1ws8jmpqcn/workflow/

Page 28: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Policy Management Policy Management

PolicyMultilingualism

Business RuleAbstract must be

in Dutch

Business RuleAbstract must be

in English

Business TermProject

Data EntitycfProj

Page 29: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

FRIS Data Governance: Funding Sources Glossary Scenario

…!

…!

Funding Sources Glossary (FSG)

Data Governance Council

ECOOMUGent VUB

data governance officers in the Council are delegated by each institute

Page 30: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

FRIS Data Governance: Funding Sources Glossary Scenario (2)

•  5 (fictional) workflows for different phases in the lifecycle of a term:

candidate > proposed > draft > in-review > accepted

Funding Sources Glossary (FSG)

approving in-review term

Data Governance Council

ECOOM UGent VUB

Ticket Request

Create

Import

Discover

delegating proposed FSG term

mapping accepted FSG terms

on-boarding candidate FSG term

approving in-review termapproving in-review term

5

1

4 4 4

2

draft term3

Page 31: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

on-boarding, delegating and drafting a Funding Source term

candidate > proposed > draft > in-review > accepted

Page 32: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Approving Funding Source Glossary term

candidate > proposed > draft > in-review > accepted

Page 33: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Demonstration in the DGC Software Tool

•  5 workflows for different phases in the lifecycle of a term:

Funding Sources Glossary (FSG)

approving in-review term

Data Governance Council

VUB UGent ECOOM

Ticket Request

Create

Import

Discover

delegating proposed FSG term

mapping accepted FSG terms

on-boarding candidate FSG term

approving in-review termapproving in-review term

5

1

4 4 4

2

draft term3

candidate > proposed > draft > in-review > accepted

1. Start-user who requests: Bob Brown 2. DGO Secretary motivates request: Mike Jones

6. Subject Matter Expert reviews: John West

8. Co-Stewards vote : Mary Smith

7. Stakeholder comments: Judy Clarke

5. Steward drafts term: Pieter DL

3. Officers vote onboarding: John Fisher

4. DGO Secretary moves the onboarded term: Mike Jones

Page 34: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Conclusion •  FRIS Service = Qualitative Data Sharing

•  Qualitative => Unambiguous, Timely, Accurate, Open, Complete, Consistent, Valid, etc.

•  Data Stewardship highlights Responsibility aspect of Data Ownership

•  Data Governance programs enforces Data Quality Policy and Regulations

•  Data Governance Technologies are promising to handle these issues that hamper service innovation

Page 35: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Conclusions •  Services are data-intensive

•  Their coproduction requires data sharing across organisation policies / modelling assumptions / regulations

•  Data Stewardship highlights responsibility aspect of Data Power

•  Data Governance programs enforces data policy and regulations

•  Data Governance Technologies are promising to overcome these issues that hamper service innovation

Page 36: Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data policy through software ?

Questions & Feedback?