94
Data as a Service, Data Marketplace and Data Lake Models, Data Concerns and Engineering Hong-Linh Truong Faculty of Informatics, TU Wien [email protected] http://dsg.tuwien.ac.at/staff/truong @linhsolar 1 ASE Summer 2018 Advanced Services Engineering, Summer 2018 Lecture 4

Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data as a Service, Data Marketplace and

Data Lake – Models, Data Concerns and

Engineering

Hong-Linh Truong

Faculty of Informatics, TU Wien

[email protected]://dsg.tuwien.ac.at/staff/truong

@linhsolar

1ASE Summer 2018

Advanced Services Engineering,

Summer 2018 – Lecture 4

Page 2: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Outline

Data-as-a-Service concepts

Data governance & Data concerns for DaaS

Evaluating data concerns

Data marketplace

Datalake

ASE Summer 2018 2

Page 3: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

From last year projects

ASE Summer 2018 3

„Use of several health, food and recipe

services, in order to collect general

food information”

“Latest data on air quality is fetched

from London Air API”

“Measure and report water quality

metrics”

„collect location-data from multiple

Sources …. combine location- with social-data“

“give data about crimes in an area

…. ranking of data quality

„real time production information from photovoltaic panels”

Page 4: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DATA AS A SERVICE

ASE Summer 2018 4

Page 5: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data versus data assets

ASE Summer 2018 5

Data

Data Assets

Data management

and provisioning

Data concerns

Data collection,

assessment and

enrichment

Page 6: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data provisioning activities and

issues

ASE Summer 2018 6

Collect

• Data sources

• Ownership

• License

• Quality assessment and enrichment

Store

• Query and backup capabilities

• Local versus cloud, distributed versus centralized storage

Access

• Interface

• Public versus private access

• Access granularity

• Pricing and licensing model

Utilize

• Alone or in combination with other data sources

• Redistribution

• Updates

Non-exhausive list! Add your own issues!

Provisioning Models

Page 7: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Stakeholders in data provisioning

ASE Summer 2018 7

Data

Data Provider

• People (individual/crowds/organization)

• Software, Things

Service Provider

• Software and people

Data Consumer

• People, Software, Things

Data Aggregator/Integrator

• Software

• People + software

Data Assessment

• Software and people

Stakeholder classes can be further divided!

Domain-specific versus domain-independent functions

Page 8: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Conceptual data service unit

ASE Summer 2018 8

Service model

Unit Concept

Data service

unit

Data

Can be used for private

or public

Can be elastic or not

„basic

component“/“basic

function“ modeling

and description

Consumption,

ownership,

provisioning, price, etc.

Microservices mindset!

Page 9: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data service units in clouds

Provide data capabilities rather than provide

computation or software capabilities

Providing data in edges/clouds is an increasing

trend

In both business and e-science environments

Now often in a combination of data + analytics

of the data + AI to provide data assets

9ASE Summer 2018

Page 10: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data service unit

10

Data service units in distributed

edge and cloud systems

data

Edge/Cloud Infrastructure

Data service unit

People

data

Data service unit

Things

ASE Summer 2018

data data

Page 11: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data as a Service -- characteristics

On-demand self-service

Capabilities to provision data at different granularities

Resource pooling

Multiple types of data, big, static or near-realtime,raw data and

high-level information

Broad network access

Can be access from anywhere

Rapid elasticity

Easy to add/remove data sources

Measured service

Measuring, monitoring and publishing data concerns and usage

ASE Summer 2018 11

Let us use NIST‘s definition

Page 12: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data-as-a-Service – service models

Data as a Service – service models

and deployment models

ASE Summer 2018 12

Storage-as-a-Service

(Basic storage functions)

Database-as-a-Service

(Structured/non-structured

querying systems)

Data publish/subcription

middleware as a service

Sensor-as-a-Service

IoT, Edge & Cloud Systems

deploy

Page 13: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Examples of DaaS

ASE Summer 2018 13

Page 14: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DaaS design & implementation –

APIs

Read-only DaaS versus CRUD DaaS APIs

Service APIs versus Data APIs

They are not the same w.r.t. data/service

concerns

REST & Streaming data APIs

ASE Summer 2018 14

Page 15: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

ASE Summer 2018 15

DaaS design & implementation –

service provider vs data provider

The DaaS provider is separated from the data

provider

DaaS

db

file

Consumer

DaaS

Sensor

DaaS

Consumer DaaS provider Data

provider

Page 16: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DaaS design & implementation –

structures

DaaS and data providers have the right to

publish the data

ASE Summer 2018 16

DaaS

• Service APIs

• Data APIs for the whole resource

Data Resource

• Data APIs for particular resources

• Data APIs for data items

Data Items

• Data APIs for data items

Three levels

Page 17: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

17

DaaS design & implementation –

structures (2)

Data

items

Data

items

Data

items

Data resource

Data

assets

Data resource Data resource

Data resourceData resource

Consumer

Consumer

DaaS

ASE Summer 2018

Page 18: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DaaS design & implementation –

patterns for „turning data to DaaS“ (1)

ASE Summer 2018 18

DaaSdata Build Data

Service

APIs

Deploy

Data

Service

Examples: using WSO2 data service

Page 19: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Storage/Database

-as-a-Service

DaaS design & implementation –

patterns for „turning data to DaaS“ (2)

ASE Summer 2018 19

data

Examples: using

Amazon S3

DaaS

Page 20: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Storage/Databa

se/Middleware

DaaS design & implementation –

patterns for „turning data to DaaS“ (3)

ASE Summer 2018 20

data

Examples: using

Crowd-sourcing

with Pachube in

2013 (Note: the

information was

not up-to-date)

Things

One Thing 10000... Things

DaaS

Page 21: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Storage/Database

/Middleware

DaaS design & implementation –

patterns for „turning data to DaaS“ (4)

ASE Summer 2018 21

data

Examples:

using Twitter

PeopleDaaS

Page 22: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

....

DaaS design & implementation –

not just „functional“ aspects (1)

ASE Summer 2018 22

data DaaS.... data assets

Data

concerns

Quality of

dataOwnership

PriceLicense ....

EnrichmentCleansing

Profiling

Integration ...

Data Assessment

/Improvement

APIs, Querying, Data Management, etc.

Page 23: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DaaS design & implementation –

not just „functional“ aspects (2)

ASE Summer 2018 23

Understand the DaaS ecosystem

Specifying, Evaluating and Provisioning Data

concerns and Data Contract

Page 24: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Example

ASE Summer 2018 24

https://www.informatica.com/products/data-quality/data-as-a-service.html

Page 25: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DATALAKE

ASE Summer 2018 25

Page 26: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Example: Linking data in telco

management

26

Alarm

BTS

Infrastructure NodeB

You can continue to have different data sources like that but you

need to make sure they are linked

KPI

record

record

record

Time

ASE Summer 2018 26

Page 27: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data lakes

A lake of data

Ingest and integrate as many as possible types of

data

To archive a lot of data so that potentially many

analytics and applications can access

Data lake is a concept so you can implement it based

on your requirements and needs

27ASE Summer 2018 27

Page 28: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Example

28

Cedrine Madera and Anne Laurent. 2016. The next information architecture evolution: the data lake wave. In Proceedings of

the 8th International Conference on Management of Digital EcoSystems (MEDES). ACM, New York, NY, USA, 174-180.

DOI: https://doi.org/10.1145/3012071.3012077

ASE Summer 2018 28

Page 29: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Implementation

Can we build a

data lake using

the concept of

“data space”

29

Source: http://www.gigaspaces.com/logistics-and-shipping-management

ASE Summer 2018 29

Page 30: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data Lake through Data Access API &

API Management

Data access APIs can be built based on well-defined interfaces

Help to bring the data object close to the programming language objects

30

Relational

Database

(e.g. MySQL)

REST APIREST API

Object-based

storage

(e.g. S3)

Document-

based

Database

Graph-

based

Database

REST API Tool-specific

API

Data Lake API & API Management Layer

ASE Summer 2018 30

Page 31: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DATA GOVERNANCE

ASE Summer 2018 31

Page 32: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data Governance

32

“Data governance is a control that ensures that the data entry

by an operations team member or by automated processes

meets precise standards, such as a business rule, a data

definition and data integrity constraints in the data model.”

From https://en.wikipedia.org/wiki/Data_governance

ASE Summer 2018 32

Page 33: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data governance Process

33

Sunil Soares. 2010. The IBM Data Governance Unified Process: Driving Business Value with IBM Software and

Best Practices. MC Press, LLC.

ASE Summer 2018 33

Page 34: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Decision domains for data

governance

34

Vijay Khatri and Carol V. Brown.

2010. Designing data governance.

Commun. ACM 53, 1 (January

2010), 148-152.

DOI=http://dx.doi.org/10.1145/1629

175.1629210

ASE Summer 2018 34

Page 35: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Framework for

domain

decisions

35ASE Summer 2018 35

Vijay Khatri and Carol V. Brown.

2010. Designing data governance.

Commun. ACM 53, 1 (January

2010), 148-152.

DOI=http://dx.doi.org/10.1145/1629

175.1629210

Page 36: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DATA CONCERNS

ASE Summer 2018 36

Page 37: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

....

What are data concerns?

data DaaS.... data assets

APIs, Querying, Data Management, etc.

Located

in US?

free?

price?

redistribution?Service

quality?

37ASE Summer 2018

Quality of data? Privacy

problem?

Read: Carlo Batini, Monica Scannapieco: Data and Information Quality - Dimensions, Principles and Techniques. Data-

Centric Systems and Applications, Springer 2016, ISBN 978-3-319-24104-3, pp. 1-449

Page 38: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

....

DaaS concerns

ASE Summer 2018 38

data DaaS.... data assets

Data

concerns

Quality of

dataOwnership

PriceLicense ....

APIs, Querying, Data Management, etc.

DaaS concerns include QoS, quality of data (QoD),

service licensing, data licensing, data governance, etc.

Page 39: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Why DaaS/data concerns are

important?

Overloading data returned to the

consumer/integrator are not good

Results are returned without a clear usage and

ownership causing data compliance problems

Consumers want to deal with dynamic changes

ASE Summer 2018 39

Ultimate goal: to provide relevant data with

acceptable constraints on data concerns in

different provisioning models

Page 40: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DaaS concerns analysis and

specification

Which concerns are important in which

situations?

How to specify concerns?

40ASE Summer 2018

Hong Linh Truong, Schahram Dustdar On analyzing and specifying concerns for data as a service. APSCC 2009: 87-

94

Page 41: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data governance

ASE Summer 2018 41

Important factor, for example, the security and

privacy compliance, data distribution, and auditing

Storage/Database

-as-a-Servicedata DaaS

Data governance

Page 42: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Types of concerns

Quality of data:

For example, the accurary and compleness of the

data, whether the data is up-to-date

Data and service usage:

In particular, price, data and service APIs licensing, law

enforcement, and Intellectual Property rights

Quality of service:

in particular, availability, response time, dependability, and

security

Context:

useful factor, such as classification and service type (REST),

location

ASE Summer 2018 42

Page 43: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Conceptual model for DaaS

concerns and contracts

43ASE Summer 2018

Page 44: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

44

Example of implementations

Check http://www.infosys.tuwien.ac.at/prototyp/SOD1/dataconcerns

ASE Summer 2018

Page 45: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

45

Populating DaaS concerns

DaaS

Concerns

evaluate, specify,

publish and manage

specify, select,

monitor, evaluate

monitor and

evaluate

The role of stakeholders in the most trivial view

Data Aggregator/Integrator

Data Consumer

Data Assessment

Service Provider

Data Provider

ASE Summer 2018

Page 46: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

SOFTWARE DESIGN FOR

EVALUATING DATA

CONCENRS FOR DATA

ASSETS

ASE Summer 2018 46

Page 47: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data concern measure

They are domain-specific and data-specific

You need to look at specific definitions in other to

calculate/determine values of metrics

But key principles of software design are quite

similar

We are focusing on software design

ASE Summer 2018 47

Page 48: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Patterns for „turning data to DaaS“

ASE Summer 2018 48

Storage/Database

-as-a-Servicedata DaaS

Storage/Databa

se/Middleware

data

ThingsDaaS

Storage/Database

/Middleware

data

PeopleDaaS

DaaSdata Build Data

Service

APIs

Deploy

Data

Service

Page 49: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data-related activities

ASE Summer 2018 49

Wrapping

data

Publishing DaaS

interface

Typical activities for data wrapping and publishing

Typical activities for data updating & retrieval

Updating

data

Selecting

datadata

Provisioning

data

Page 50: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Typical data concern evaluation

ASE Summer 2018 50

Evaluating data

concerns

Describing data

concerns

Data Concerns

Evaluation ToolsData Concerns

Representation Models

Populating data

concerns

Publishing services

What do we need in order to perform these activities?

Page 51: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

51

Data concern-aware DaaS

engineering process Typical activities

for data wrapping

and publishing

Typical activities

for data updating &

retrieval

ASE Summer 2018

Hong Linh Truong, Schahram Dustdar: On Evaluating and Publishing

Data Concerns for Data as a Service. APSCC 2010: 363-370

Page 52: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluating data concerns – the

three important points

52

• At which level the evaluation is performed?

evaluation scope

• When the evaluation is done?

evaluation modes

• How the evaluation tool is invoked?

integration model

ASE Summer 2018

Hong Linh Truong, Schahram Dustdar: On Evaluating and Publishing Data Concerns for Data as a Service. APSCC

2010: 363-370

Page 53: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluating data concerns – some

patterns (1)

53

Pull, pass-by-references

ASE Summer 2018

Page 54: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluating data concerns – some

patterns (2)

54

Pull, pass-by-values

ASE Summer 2018

Page 55: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluating data concerns – some

patterns (3)

55

Push, pass-by-values (1)

ASE Summer 2018

Page 56: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluating data concerns – some

patterns (4)

56

Push, pass-by-values (2)

ASE Summer 2018

Page 57: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluation Tool – Internal Software

components

Self-developed or third-party software

components for evaluation tool

Advantages

Tightly couple integration performance, security,

data compliance

Customization

Disadvantages

Usually cannot be integrated with other features

(e.g., data enrichment)

Costly (e.g., what if we do not need them)

ASE Summer 2018 57

Page 58: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluation tool – using cloud

services

Evaluation features are provided by cloud

services

Several implementations Informatica Cloud Data Quality Web Services, StrikeIron,

Advantages Pay-per-use, combined features

Disadvantages Features are limited (with certain types of data)

Performance issues with large-scale data

Data compliance and security assurance

ASE Summer 2018 58

Page 59: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluation Tool -- using human

computation capabilities Professionals and Crowds can act as data

concerns evaluators For complex quality assessment that cannot be done by

software

Issues Subjective evaluation

Performance

Limited type of data (e.g., images, documents, etc.)

ASE Summer 2018 59

Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A Novel

Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112

Maribel Acosta, Amrapali Zaveri, Elena Simperl, Dimitris Kontokostas, Sören Auer, Jens Lehmann: Crowdsourcing Linked

Data Quality Assessment. International Semantic Web Conference (2) 2013: 260-276

Óscar Figuerola Salas, Velibor Adzic, Akash Shah, and Hari Kalva. 2013. Assessing internet video quality using

crowdsourcing. In Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia (CrowdMM '13).

ACM, New York, NY, USA, 23-28. DOI=10.1145/2506364.2506366 http://doi.acm.org/10.1145/2506364.2506366

Page 60: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Evaluating concerns in in the big

data pipeline

ASE Summer 2018 60

data concerns: why, how and what?

Page 61: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Abstract software components for

identifying “How” and “What”

ASE Summer 2018 61

Hong-Linh Truong, Manfred Halper, „Characterizing Incidents in Cloud-based IoT Data Analytics”, Proceedings of The

42nd IEEE International Conference on Computers, Software and Applications (COMPSAC 2018)

Page 62: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Monitor and evaluating concerns

ASE Summer 2018 62

Identify where and when to do instrumentation

and monitoring

Hong-Linh Truong, Manfred Halper, „Characterizing Incidents in Cloud-based IoT Data Analytics”, Proceedings of The

42nd IEEE International Conference on Computers, Software and Applications (COMPSAC 2018)

Page 63: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data quality monitoring at large-

scale

ASE Summer 2018 63

Figure taken from the keynote talk “Data Intelligence and Analytics at Alibaba”

of Dr. Jingren Zhou, Alibaba Group at the 2018 International Conference on

Cloud Engineering (IC2E 2018)

Page 64: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DATA MARKETPLACE

ASE Summer 2018 64

Page 65: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data marketplaces

More than just DaaS

DaaS focuses on data provisioning features

Stakeholders in data marketplaces

Multiple data providers and consumers

Marketplace providers

Marketplace authorities

Analytics providers

Data transportation providers

Billing and payment providers

ASE Summer 2018 65

Page 66: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Example of stakeholders

ASE Summer 2018 66

Specific data market or generic data market?

Tien-Dung Cao, Quang-Hieu Vu, Duc-Hung Le, Hong-Linh Truong, Schahram Dustdar: MARSA: A Marketplace for Realtime Human-Sensing Data.

http://dungcao.github.io/marsa/

Page 67: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Technical services, protocols,

mechanisms in data marketplaces

Multiple DaaS provisioning

Access models and interfaces

Complex interactions among DaaS providers,

data providers, data consumers, marketplace

providers, etc.

Data exchange as well as payment

Complex billing and pricing models

Market dynamics

Service and data contracts

ASE Summer 2018 67

Page 68: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data marketplaces and related

components/services

ASE Summer 2018 68

Page 69: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Data contracts

Give a clear information about data usage

Have a remedy against the consumer for illegal

data usage

Limit the liability of data providers in case of

failure of the provided data;

Specify information on data delivery,

acceptance, and payment

69ASE Summer 2018

Page 70: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

70

Data contracts

Well-researched contracts for services but not

for DaaS and data marketplaces

But service APIs != data APIs =! data assets

Several open questions

Right to use data? Quality of data in the data

agreement? Search based on data contract? Etc.

➔ Require extensible models

➔ Capture contractual terms for data contracts

➔ Support (semi-)automatic data service/data selection

techniques.

Hong-Linh Truong, Marco Comerio, Flavio De Paoli, G.R. Gangadharan, Schahram Dustdar, "Data Contracts for

Cloud-based Data Marketplaces ", International Journal of Computational Science and Engineering, 2012 Vol.7, No.4,

pp.280 - 295

ASE Summer 2018

Page 71: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Study of main data contract terms

Data rights

Derivation, Collection, Reproduction, Attribution

Quality of Data (QoD)

Not mentioned, Not clear how to establish QoD metrics

Regulatory Compliance

Sarbanes-Oxley, EU data protection directive, etc.

Pricing model

Different models, pricing for data APIs and for data assets

Control and Relationship

Evolution terms, support terms, limitation of liability, etc

71

Most information is in human-readable form

ASE Summer 2018

Page 72: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

72

Representing data contract terms

Contract term: (termName,termValue)

Term name: common terms or user-specific terms

Term value: a single value, a set, or a range

ASE Summer 2018

Page 73: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

HOW DOES NEAR-REALTIME DATA IMPACT

ON DATA CONTRACT EXCHANGE?

Discussion time

ASE Summer 2018 73

Page 74: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

DATA MARKETPLACE IN

BLOCKCHAIN AGE

ASE Summer 2018 74

Page 75: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

IoT Data Market without

Marketplace?

ASE Summer 2018 75

Dominic Wörner and Thomas von

Bomhard. 2014. When your sensor

earns money: exchanging data for

cash with Bitcoin. In Proceedings of the

2014 ACM International Joint Conference

on Pervasive and Ubiquitous Computing:

Adjunct Publication (UbiComp '14

Adjunct). ACM, New York, NY, USA, 295-

298.

Kay Noyen, Dirk Volland, Dominic

Wörner, Elgar Fleisch:

When Money Learns to Fly: Towards

Sensing as a Service Applications Using

Bitcoin.

But what about data

contract? smart contract

Page 76: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Generic Contract-aware Framework Architecture

ASE Summer 2018 76

Florin-Bogdan Balint, Hong-Linh Truong:

On Supporting Contract-Aware IoT Dataspace Services. MobileCloud 2017: 117-124

Page 77: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

IoT Data Contract Design

ASE Summer 2018 77

Page 78: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

IoTA Data marketplace

ASE Summer 2018 78

Figure captured from https://data.iota.org/

The principle is very much like any other data

marketplaces

Payment via IoTA

Page 79: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Monetasa

Using smart

contract

contract

ASE Summer 2018 79

Figure captured from https://monetasa.com/

Page 80: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Databroker DAO

ASE Summer 2018 80

Figure captured from https://databrokerdao.com/

Page 81: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Exercises

Read mentioned papers

Check characteristics, service models and

deployment models of mentioned DaaS (and

find out more)

Identify services in the ecosystem of some DaaS

Turn some data to DaaS using existing tools

ASE Summer 2018 81

Page 82: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Exercises (2)

Identify and analyze the relationships between

data concerns evaluation tools and types of data

Analyze trade-offs between on-line and off-line

evaluation and when we can combine them

Analyze how to utilize evaluated data concerns

for optimizing data compositions

Analyze situations when software cannot be

used to evaluate data concerns

ASE Summer 2018 82

Page 83: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Exercises (3)

Develop some specific data contracts for open

government data

Work on some algorithms for checking data

contract compatibility

Incorporate data marketplaces concepts into

your scenario

Build your own mini data marketplace

ASE Summer 2018 83

Page 84: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

CASE STUDY – DESIGN DATA

MARKETPLACE

MARSA: A Marketplace for Realtime Human-Sensing Data

Cao, Tien-Dung ; Pham, Tran-Vu ; Vu, Quang-Hieu ; Le, Duc-Hung

; Truong, Hong-Linh ; Dustdar, Schahram

ACM Transactions on Internet Technology, 2016

http://dungcao.github.io/marsa/

ASE Summer 2018 84

Page 85: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Traffic problems in HoChiMinh City

ASE Summer 2018 85

Figure sources: Internet

Crowded and

unpredictable

Needs a lot of data to

understand traffics

Lack infrastructures for

collecting traffic

information

Common problems in

developing countries

Cannot buy

expensive traffic

data collection

systems!

Page 86: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Market-oriented View of traffic data

scenarios

ASE Summer 2018 86

4000 citybus fleet, 0.25MB per day per bus (7.5MB/month/bus), 30GB for the fleet

1MB of GPS data =20 USD cent 6000 USD for the fleet operators

A mobile phone, like a bus, can receive 1.5 USD per month ½ of 3G data bill

Page 87: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Costs and benefits

ASE Summer 2018 87

Page 88: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

MARSA Design overview

ASE Summer 2018 88

Page 89: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

MARSA description for human-

sensing data marketplace

ASE Summer 2018 89

Page 90: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Cost model

ASE Summer 2018 90

Quality of data has not supported yet

Page 91: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Implementation

ASE Summer 2018 91

Page 92: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Testbed

ASE Summer 2018 92

Page 93: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

Example of bills

ASE Summer 2018 93

Page 94: Data-as-a-Service, Data marketplace, data lakes: Models, Data … · Consumption, ownership, provisioning, price, etc. Microservices mindset! Data service units in clouds Provide

94

Thanks for your attention

Hong-Linh Truong

Distributed Systems Group, TU Wien

[email protected]

http://dsg.tuwien.ac.at/staff/truong

@linhsolar

ASE Summer 2018