59
Data and Processes: Can we Marry Them . . . and Make the Marriage Last? Diego Calvanese Research Centre for Knowledge and Data (KRDB) Free University of Bozen-Bolzano, Italy INRIA Saclay Paris – 18/3/2016

Data and Processes: Can we Marry Them . . . and Make the Marriage Last?

Embed Size (px)

Citation preview

Data and Processes: Can we Marry Them . . .and Make the Marriage Last?

Diego Calvanese

Research Centre for Knowledge and Data (KRDB)Free University of Bozen-Bolzano, Italy

..

KRDB1

INRIA Saclay Paris – 18/3/2016

Dichotomy Analysis Marriage Strengthening Conclusions

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (1/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Outline

1 Data and processes: a dichotomy

2 Analysing data and processes

3 Marrying data and processes

4 Strengthening the marriage

5 Conclusions

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (2/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Processes and data

The information assets of an organization are constituted by data andprocesses:

Data are the main information source about the history of the domain ofinterest, and about the relevant aspects of the current state of affairs.

A (business) process consists of a set of activities that are performed incoordination in an organizational and technical environment [Weske 2007].

Activities change the real world: The corresponding updates arereflected into the organizational information system(s).

Data trigger decision-making, which in turn determines the next stepsto be taken in the process.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (3/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Our starting point

Marrying processes and data is a must if we

really want to understand the functioning of

complex systems in the real world.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (4/52)

Dichotomy Analysis Marriage Strengthening Conclusions

1 Data and processes: a dichotomy

2 Analysing data and processes

3 Marrying data and processes

4 Strengthening the marriage

5 ConclusionsDiego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (5/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Experts’ dichotomy

Survey by Forrester [Karel, Richardson, and Moore 2009]: lack of interactionbetween data and process experts:

Business process management professionals: think that data aresubsidiary to processes, and neglect importance of data quality.

Data management experts: claim that data are the main driver of theorganizational processes, and only focus on data quality.

Forrester: 83 out of 100 . . . no interaction at all between these two groups!

This isolation propagates to languages and tools, which never properlyaccount for the process-data connection.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (6/52)

Dichotomy Analysis Marriage Strengthening Conclusions

One side: conventional data modeling

Focus: relevant entities, relations, static constraints

Supplier ManufacturingProcurement/Supplier

Sales

Customer PO Line Item

Work OrderMaterial PO

*

*

spawns0..1

Material

But. . . how do data evolve?Where can we find the “state” of a purchase order?

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (7/52)

Dichotomy Analysis Marriage Strengthening Conclusions

The other side: conventional process modeling

Focus: control flow of activities in response to events

But. . . how do activities update data?What is the impact of canceling an order?

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (8/52)

Dichotomy Analysis Marriage Strengthening Conclusions

IT integration: Spaghetti!

Manage Cancelation

ShipAssembleManageMaterial POs

DecomposeCustomer PO

Activities

Process

Data

Activities

Process

Data

Activities

Process

Data

Activities

Process

Data

Activities

Process

Data

Customers Suppliers&CataloguesCustomer POs Work Orders Material POs

IT integration: system is difficult to manage, understand, evolve.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (9/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Too late to reconstruct the missing pieces

Where is our data?

part is in the DBs,part is hidden in the process execution engine.

Where are the relevant business rules, and how are they modeled?

At the DB level? Which DB? How to import the process data?(Also) in the business model? How to import data from the DBs?

DataProcess

Supplier ManufacturingProcurement/Supplier

Sales

Customer PO Line Item

Work OrderMaterial PO

*

*

spawns0..1

Determine cancelation

penaltyNotify penalty

Material

Process Engine

Process State

Business rulesFor each work order W For each material PO M in W if M has been shipped add returnCost(M) to penalty

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (10/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Data and processes in AI

Artificial Intelligence, traditionally has given important contributions to bothsettings:

Data: knowledge bases, conceptual models, ontologies, ontology-baseddata access and integration, inconsistency-tolerant semantics, . . .

Processes: reasoning about actions, temporal/dynamic logics,situation/event calculus, temporal reasoning, planning, verification,synthesis, . . .

Why is there still isolation?

To attack the complexity – Divide et impera!

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (11/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Data and processes in BPM

Need for conceptual integration recognized by the (business) process modelingcommunity as well.

Data-process integration is crucial to assess the value of processes andevaluate KPIs [Meyer, Smirnov, and Weske 2011].

Data-process integration is crucial to aggregate all relevant information,and to suitably inject business rules into the system [Dumas 2011].

Process and data are just two sides of the same coin [Reichert 2012].

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (12/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Overcoming the dichotomy

Strong need for:

suitable modeling formalisms supporting the integrated management ofprocesses and data;

methodologies for the design of systems based on such formalisms;

systems and tools that implement these languages and methodologies.

This, in turn requires a foundational approach to tackle important issues.1 Provide a clear understanding of (data-aware) process models w.r.t.

semantic properties, andcomputational properties.

2 Enable static analysis of such formalisms.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (13/52)

Dichotomy Analysis Marriage Strengthening Conclusions

1 Data and processes: a dichotomy

2 Analysing data and processes

3 Marrying data and processes

4 Strengthening the marriage

5 ConclusionsDiego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (14/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Analysis in DB theory

In DB theory, data-related analysis is well-established:

intensional reasoning over queries: containment, equivalence

database dependencies: axiomatization, satisfaction, implication, . . .

semantic and conceptual data models

reasoning over views

· · ·

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (15/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Business process analysis

In BPM, process model analysis is considered the second most influentialtopic in the last decade (after process modeling languages) [Aalst 2012].

Basic assumption: control-flow is captured by a (possibly infinite-state)propositional labeled transition system:

Labels represent the process tasks/activities.

Concurrency is represented by interleaving.

Transition system usually not represented explicitly, but is implicitly“folded” into a Petri net.

However:

Data has been abstracted away.

Emphasis has been on the control-flow dimension:; sophisticated techniques for absence of deadlocks, boundedness,soundness, or domain-dependent properties expressed in LTL or CTL.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (16/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Verification of complex system behaviour

Automated analysis of aformal model of the system

against a property of interest,considering all possible system behaviors.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (17/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Verification via model checking

Verification of software and hardware systems via model checking.[2007 Turing Award: Clarke, Emerson, Sifakis]

Dynamic properties of interest are formulated in a temporal logic (LTL,CTL, µ-calculus, . . . ).

The transition system mathematically capturing the dynamics of thesystem of interest is (implicitly or explicitly) represented.

The temporal logic formulas are checked (i.e., evaluated) over thetransition system.

Model checking technology applicable only over a finite transition system.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (18/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Analysis of data-aware processes

The presence of data complicates analysis significantly:

States must be modeled relationally rather than propositionally.

The resulting transition system is typically infinite state.

Query languages for analysis need to combine two dimensions:

A temporal dimension to query the process execution flow.A first-order dimension to query the data present in the relational structures.

; We need first-order variants of temporal logics.

Model checking data-aware processes becomes immediately undecidable!

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (19/52)

Dichotomy Analysis Marriage Strengthening Conclusions

1 Data and processes: a dichotomy

2 Analysing data and processes

3 Marrying data and processes

4 Strengthening the marriage

5 ConclusionsDiego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (20/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Marrying data and processes

How can we marry data and processes, mediating between:

the expressiveness of the language for temporal properties, and

the form of the data-aware processes,

in such a way that

1 we are able to capture notable, real-world scenarios, but

2 analysis stays decidable, and possibly efficient.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (21/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Business entities/artifacts

Data-centric paradigm for process modeling.

First: elicitation of relevant business entities that are evolved withingiven organizational boundaries.

Then: definition of the lifecycle of such entities, and how tasks triggerthe progression within the lifecycle.

Information model Lifecycle Artifact

Is an active research area, with concrete languages (e.g., IBM GSM, OMGCMMN).

Cf. EU project ACSI (2010–2013). a   i S C  Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (22/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Concrete models for artifacts

Key questions:

How and where to store data maintained by the information model?

How to specify the lifecycle of an artifact?

At which level of abstraction?

Some concrete information models:

Relational database (with nested records).

Knowledge base, e.g., expressed in some ontology language.

Some concrete lifecycle models:Finite-state machines. State = phase; events trigger transitions.

Implemented in the Siena prototype by IBM.

Guard-Stage-Milestone lifecycles, based on declarative(event-condition-action)-like rules.

Implemented in the Barcelona prototype by IBM.

Proclets (interacting Petri nets).Emphasise many-to-many relationships between artifacts.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (23/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Data-Centric Dynamic Systems (DCDS)

Abstract model underlying variants of artifact-centric systems.

Semantically equivalent to the most expressive models for business processsystems (e.g., GSM).

Data Process Data+Process

Data Layer: Relational databases / ontologies

Data schema, specifying constraints on the allowed statesData instance: state of the DCDS

Process Layer: key elements are

Atomic actionsCondition-action-rules for application of actionsService calls: communication with external environment, new data!

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (24/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Deterministic vs. non-deterministic services

DCDSs admit two different semantics for service-execution:

Deterministic services semantics

Along a run, when the same service is called again with the same arguments, itreturns the same result as in the previous call.

Are used to model an environment whose behavior is completely determined bythe parameters.Example: temperature, given the location and the date and time

Non-deterministic services semantics

Along a run, when the same service is called again with the same arguments, itmay return a different value than in the previous call.

Are used to model:

an environment whose behavior is determined by parameters that areoutside the control of the system;input of external users, whose choices depend on external factors.

Example: current temperature, given the locationDiego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (25/52)

Dichotomy Analysis Marriage Strengthening Conclusions

An example: Hotels and price conversion

Data Layer: Info about hotels and room prices

Cur = 〈UserCurrency〉 CH = 〈Hotel ,Currency〉 PEntry = 〈Hotel ,Price,Date〉

Process Layer/1

User selection of a currency.

Process: true 7−→ ChooseCur()

Service call for currency selection: uInputCurr()

Models user input with non-deterministic behavior.

ChooseCur() :

{Cur(c) del{Cur(c)}true add{Cur(uInputCurr())}

}

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (26/52)

Dichotomy Analysis Marriage Strengthening Conclusions

An example: Hotels and price conversion

Data Layer: Info about hotels and room prices

Cur = 〈UserCurrency〉 CH = 〈Hotel ,Currency〉 PEntry = 〈Hotel ,Price,Date〉

Process Layer/2

Price conversion for a hotel.

Process: Cur(c) ∧ CH(h, ch) ∧ ch 6= c 7−→ ApplyConv(h, c)

Service call for currency selection: conv(price, from, to, date)

Models historical conversion with deterministic behavior.

ApplyConv(h, c) : PEntry(h, p, d) del{PEntry(h, p, d)}PEntry(h, p, d) ∧ CH(h, cold) ∧ Cur(c) add{PEntry(h,conv(p, cold , c, d), d)}

CH(h, cold) del{CH(h, cold)}, add{CH(h, c)}

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (26/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Run of the system

HC

h1 eurh2 eur

PEntry

h1 95 apr-25h1 80 sep-18h2 80 sep-18

HC

h1 eurh2 eur

PEntry

h1 95 apr-25h1 80 sep-18h2 80 sep-18

Cur

usd

HC

h1 usdh2 eur

PEntry

h1 115 apr-25h1 95 sep-18h2 80 sep-18

Cur

usd

HC

h1 usdh2 usd

PEntry

h1 115 apr-25h1 95 sep-18h2 95 sep-18

Cur

usd

ChooseCur(): uInputCurr() =

?

usd

ApplyConv(h1,usd):conv(95,eur,usd,apr-25) = ?115conv(80,eur,usd,sep-18) = ?95

ChooseCur()

ApplyConv(h2,usd)

ChooseCur()

ApplyConv(h2,usd)conv(80,eur,usd,sep-18) = 95

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (27/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Semantics via transition systems

Semantics of a DCDS S is given in terms of a transition system ΥS :

Each state of ΥS has an associated DB over a common schema.

The initial state is associated to the initial DB of the DCDS.

s0

s1

s3

s4

s6

s7

Note: ΥS is in general infinite state:

Infinite branching, due to the results of service calls.

Infinite runs, since infinitely many DBs may occur along a run.

Associated to the states we have DBs of unbounded size.Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (28/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Verification for DCDSs

We are interested in the verification of temporal properties over ΥS .

Idea to overcome infiniteness:

1 Devise a finite-state transition system ΘS that is a faithful abstractionof ΥS independent of the formula to verify.

2 Reduce the verification problem ΥS |= Φ to the verification of ΘS |= Φ.

Problem: Verification of DCDSs is undecidable even for propositionalreachability properties.; We need to pose restrictions on DCDSs.

We could draw inspiration from chase termination for tgds in data exchange,and specifically from weak-acyclicity.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (29/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Restrictions on DCDSs

Run-bounded DCDS

Runs cannot accumulate more than a fixed number of different values.

Transition system is still infinite-state due to infinite branching.

This is a semantic condition, whose checking is undecidable.; Sufficient syntactic condition: Weak-acyclicity.

Run-boundedness is very restrictive for DCDSs with nondeterministicservices.

State-bounded DCDS

States cannot contain more than a fixed number of different values.

Relaxation of run-boundedness.

Infinite runs are possible.

This is a semantic condition, whose checking is undecidable.; Sufficient syntactic condition: e.g., GR-acyclicity.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (30/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Verification formalisms for DCDSs

Boundedness is not sufficient for decidability.We introduce two extensions of the modal µ-calculus µL / LTL with restrictedforms of first order quantification.

History-Preserving quantification: µLA / LTL-FOA

FO quantification ranges over current active domain only.

Examples:LTL-FOA : ∀x.live(x) ∧ Customer(x)→ F Gold(x)

µLA : ∀x.live(x) ∧ Customer(x)→ µZ.Gold(x) ∨ [−]Z

Persistence-Preserving quantification: µLP / LTL-FOP

FO quantification ranges over persisting individuals only.

Examples:LTL-FOP : ∀x.live(x) ∧ Gold(x)→ G Gold(x)

µLP : ∀x.live(x) ∧ Gold(x)→ νZ.Gold(x) ∧ live(x) ∧ [−]Z

LTL

µL

µLFO/LTL-FO

µLA/LTL-FOA

µLP /LTL-FOP

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (31/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Towards decidability

We need to tame the two sources of infinity inDCDSs:

infinite branching, due to external input;

infinite runs, i.e., runs visiting infinitelymany DBs.

P(a) P(a)

P(b)

. . .

. . .

. . .

. . .

To prove decidability of model checking for a specific restriction and a specificverification formalism:

We use bisimulation as a tool.

We show that restricted DCDSs have a finite-state bisimilar transitionsystem.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (32/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Bisimulation between transition systems

States sA and sB of transition systems A and B are bisimilar if:1 sA and sB are isomorphic;2 If there exists a state sA1 of A such that sA ⇒A sA1 , then there exists a

state sB1 of B such that sB ⇒B sB1 , and sA1 and sB1 are bisimilar;3 The other direction!

A and B are bisimilar, if their initial states are bisimilar.

A BsA sB

sA1 sB1

sB2sA2

µL invariance property of bisimulation:

Bisimilar transition systems satisfy the same set of µL properties.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (33/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Adapting the notion of bisimulation

History PreservingBisimulation Invariant Languages

Persistence PreservingBisimulation Invariant Languages

Bisimulation Invariant Languages

L

CTL

µL

LP

µLP

LA

µLA

µLFOP

rop

ositio

na

lT

emp

ora

lL

og

icsF

irstO

rder

Tem

po

ral

Lo

gics

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (34/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Decidability of µL extensions for run-bounded systems

Theorem

Verification of µLA over run-bounded DCDSs is decidable and can be reducedto model checking of propositional µ-calculus over a finite transition system.

Idea: use isomorphic types instead ofactual values.

Remember: runs are bounded!

...

...

...

...

. . .

A-bisimilar

non A-bisimilar

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (35/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Decidability of µL extensions for state-bounded systems

Theorem

Verification of µLP over state-bounded DCDSs is decidable and can be reducedto model checking of propositional µ-calculus over a finite transition system.

Steps:

1 Prune infinite branching (isomorphic types).2 Finite abstraction along the runs:

µLP looses track of previous values that do notexist anymore.New values can be replaced with old, non-persistingones.This eventually leads to recycle the old valueswithout generating new ones.

......

......

......

......

...

...

...

. . .

P-bisimilar

non P-bisimilar

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (36/52)

Dichotomy Analysis Marriage Strengthening Conclusions

What about LTL-FO?

For verification of LTL-FO over DCDSs, analogous decidability results hold:

Theorem

Verification of LTL-FOA over run-bounded DCDSs, andLTL-FOP over state-bounded DCDSs

are decidable and can be reduced to model checking of propositional LTL over afinite transition system.

Moreover:

Theorem

Verification of LTL-FOA over state-bounded DCDSs is undecidable.

Intuition: LTL-FOA can arbitrarily quantify over the infinitely many valuesencountered during a single run, and start comparing them.

Proof is based on a reduction from satisfiability of LTL with freeze quantifiersover infinite data words.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (37/52)

Dichotomy Analysis Marriage Strengthening Conclusions

And verification of µLA over state-bounded DCDSs?

Well-known

Propositional LTL can be expressed in µL, i.e., the propositional µ-calculus.

Folklore “theorem” (see, e.g., [Okamoto 2010])

This correspondence carries over to the FO-variants, i.e., LTL-FO can beexpressed in µLFO.

Note: This, together with the undecidability of LTL-FOA verification overstate-bounded DCDSs, would imply that also:

Verification of µLA over state-bounded DCDSs is undecidable.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (38/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Verification of µLFO over state-bounded DCDSs

Instead, the following positive result holds:

Theorem

Verification of µLFO (and hence µLA) over state-bounded DCDSs is decidable.

Relies on the fact that DCDSs generate transition systems that are generic:Intuitively, if a state s has a successor state s′ with fresh values ~v, then ithas also all successor states that are obtained from s′ by varying in allpossible ways the fresh values ~v.This is a consequence of the fact that the progression mechanism isdefined by means of a logical specification.

Lemma

For generic TSs (with infinite domain), persistence-preserving bisimilarityand bisimilarity (and hence history-preserving bisimilarity) coincide.

For TSs of state-bounded DCDSs, we can devise finite state abstractionsthat are faithful for µLFO formulas (although such abstractions maydepend on the formula).

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (39/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Results on decidability of verification for DCDSs

Un

rest

rict

edD

CD

Ss

(Tu

rin

gco

mp

lete

)

Sta

te-b

ou

nd

edD

CD

Ss

Ru

n-b

ou

nd

edD

CD

Ss

Fin

ite-

sta

teD

CD

Ss

GR+-acyclic DCDSs

GR-acyclic DCDSs

Weakly-acyclic DCDSsfor det. services

Finite-range DCDSs

Unrestricted State-bounded Run-bounded Finite-state

LTL-FO / µLFO U U / N ? / N D

LTL-FOA / µLA U U / N D D

LTL-FOP / µLP U D D D

LTL / µL U D D D

D: decidable U: undecidable N: decidable, but no finite abstraction

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (40/52)

Dichotomy Analysis Marriage Strengthening Conclusions

1 Data and processes: a dichotomy

2 Analysing data and processes

3 Marrying data and processes

4 Strengthening the marriage

5 ConclusionsDiego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (41/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Semantically-Governed Artifact Systems (SASs)

The data layer in an artifact system might be very complex, and difficult tointeract with.

Hence we can resort to ontology-based technology and ontology-based dataaccess techniques to support users:

We install “on top” of an artifact system an ontology, capturing thedomain of interest at a higher level of abstraction.

We connect the ontology to the underlying artifact system via declarativemappings.

Such a setting gives rise to a very rich and still largely unexplored framework, inwhich we have various choices for:

the language used to express the ontology;

the form of the mappings, and the language used to express them;

the assumptions we make about the dynamics of the system;

the kind of analysis tasks we want to perfom.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (42/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Semantically-Governed Artifact Systems (SASs)

Artifacts System conceptual schema (TBox) composed of semantic constraintsthat define the “data boundaries” of the artifact system.

TBox

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (43/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Semantic layer and snapshots

Actual data are concretely maintained at the artifact layer.Snapshot: database instances of artifacts.

Da

Db

Dc

Artifact System Snapshot

TBox

...

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (44/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Mappings

Each snapshot is conceptualized in the ontology as instance data.Mappings define how to obtain the virtual ABox from the source data.

Da

Db

Dc

Artifact System Snapshot

Mappings

TBox

ABox1

...

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (45/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Action execution to evolve the system

The system evolves due to actions/process executed over the artifact layer,invoking external services to inject new data.

Da

Db

Dc

Artifact System Snapshot

D'a

D'b

D'c

Artifact System Snapshot

Mappings Mappings

Semantic Layer Snapshot

TBox

ABox1

TBox

Semantic Layer Snapshot

ABox2

Transition... ...

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (46/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Understanding the evolution

Semantic layer used to understand the evolution at the conceptual level,by posing queries over the ontology.

Da

Db

Dc

Artifact System Snapshot

D'a

D'b

D'c

Artifact System Snapshot

Mappings Mappings

Semantic Layer Snapshot

TBox

ABox1

TBox

Semantic Layer Snapshot

ABox2

Transition... ...

... ...

queries

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (47/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Semantic Governance

Semantic layer used to regulate the execution of actions at the artifact layer byrejecting actions that lead to violations of constraints in the ontology.

Da

Db

Dc

Artifact System Snapshot

D'a

D'b

D'c

Artifact System Snapshot

Mappings Mappings

Semantic Layer Snapshot

TBox

ABox1

Semantic Layer Snapshot

...

ABox2

TBox

Transition

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (48/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Temporal Verification over Semantic Layer

Temporal properties expressed as:

queries over the ontology combined with

temporal operators to talk about the dynamics of the system.

System evolves at the Artifact Layer.

Rewriting of temporal properties

The temporal part is maintained unaltered, because thesystem evolves at the Artifact Layer.

Faithful transformation of a temporal property overSemantic Layer:

1 Rewriting of ontology queries to compile away the TBox.2 Unfolding of temporal property wrt mappings to obtain a

corresponding temporal property over the Artifact Layer.

D

A

T

M

Q0 = rew(Q, T )

Q

unfold(Q0,M)

Hence, verification of temporal properties expressed over the ontology isreduced to verification of temporal properties over the artifacts.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (49/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Decidability of Verification over SASs

We obtain that the verification of (restricted first-order) temporal properties isdecidable, provided the transition system at the Artifact Layer satisfies suitableboundedness conditions.

Results

The following are decidable, and can be reduced to model checking ofpropositional LTL/mu-calculus over a finite transition system:

Verification of LTL-FOA/µLA properties over run-bounded SASs withdeterministic services.

Verification of LTL-FOP /µLP properties over state-bounded SASs (bothwith deterministic and with non-deterministic services).

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (50/52)

Dichotomy Analysis Marriage Strengthening Conclusions

1 Data and processes: a dichotomy

2 Analysing data and processes

3 Marrying data and processes

4 Strengthening the marriage

5 ConclusionsDiego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (51/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Additional, ongoing, and future work

Dealing with state-boundedness:

relaxation of syntactic restrictionsboundedness “by design”

Further developments of the semantically enriched setting:

More sophisticated treatment of inconsistency w.r.t. the ontology[C., Kharlamov, et al. 2013; C., Montali, and Santoso 2015].Handling of contextual information [C., Ceylan, et al. 2014].Allow the system to evolve also at the semantic layer, and propagate theupdates to the artifact layer.

Enriching of the data model with ordered data types [C., Delzanno, and

Montali 2015].

Investigate how to deal with the exponential explosion w.r.t. the data.

Implementation of the approach on the top of a relational database[C., Montali, Patrizi, et al. 2015]. We aim at using state-of-the-artfinite-state model checkers.

Other reasoning services, e.g., composition, adversarial synthesis[C., De Giacomo, et al. 2013].

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (52/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Acknowledgements

Thanks to my friends and colleagues with whom this work was carried out!

Babak Bagheri Hariri (graduated in Bolzano)

Giuseppe De Giacomo (Sapienza University of Rome)

Alin Deutsch (University of California San Diego)

Marco Montali (Bolzano)

Fabio Patrizi (Bolzano)

Ario Santoso (PhD student in Bolzano, graduating soon)

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (53/52)

Dichotomy Analysis Marriage Strengthening Conclusions

Thank you for your attention!

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (54/52)

References References

References I

[1] Mathias Weske. Business Process Management: Concepts, Languages,Architectures. Springer, 2007. isbn: 9783540735212.

[2] Rob Karel, Clay Richardson, and Connie Moore. Warning: Don’t AssumeYour Business Processes Use Master Data – Synchronize Your BusinessProcess And Master Data Strategies. Report. Forrester, Sept. 2009.

[3] Andreas Meyer, Sergey Smirnov, and Mathias Weske. Data in BusinessProcesses. Tech. rep. 50. Available online athttp://opus.kobv.de/ubp/volltexte/2011/5304/.Hasso-Plattner-Institut for IT Systems Engineering, Universitat Potsdam,2011.

[4] Marlon Dumas. “On the Convergence of Data and Process Engineering”.In: Proc. 15th Int. Conf. of the Advances in Databases and InformationSystems (ADBIS). Vol. 6909. Lecture Notes in Computer Science.Springer, 2011, pp. 19–26. doi: 10.1007/978-3-642-23737-9_2.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (55/52)

References References

References II

[5] Manfred Reichert. “Process and Data: Two Sides of the Same Coin?” In:Proc. of the On the Move Confederated Int. Conf. (OTM 2012).Vol. 7565. Lecture Notes in Computer Science. Springer, 2012, pp. 2–19.doi: 10.1007/978-3-642-33606-5_2.

[6] Wil M. P. van der Aalst. “A Decade of Business Process ManagementConferences: Personal Reflections on a Developing Discipline”. In: Proc.of the 10th Int. Conf. on Business Process Management (BPM).Vol. 7481. Lecture Notes in Computer Science. Springer, 2012, pp. 1–16.

[7] Keishi Okamoto. “Comparing Expressiveness of First-Order Modalµ-calculus and First-Order CTL*”. In: RIMS Kokyuroku 1708 (2010),pp. 1–14.

[8] Diego C., Evgeny Kharlamov, et al. “Verification of Inconsistency-AwareKnowledge and Action Bases”. In: Proc. of the 23rd Int. Joint Conf. onArtificial Intelligence (IJCAI). 2013.

[9] Diego C., Marco Montali, and Ario Santoso. “Verification of GeneralizedInconsistency-Aware Knowledge and Action Bases”. In: Proc. of the 24thInt. Joint Conf. on Artificial Intelligence (IJCAI). 2015.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (56/52)

References References

References III

[10] Diego C., Ismail Ilkan Ceylan, et al. “Verification of Context-SensitiveKnowledge and Action Bases”. In: Proc. of the 14th Eur. Conf. onLogics in Artificial Intelligence (JELIA). Vol. 8761. Lecture Notes inComputer Science. Springer, 2014, pp. 514–528.

[11] Diego C., Giorgio Delzanno, and Marco Montali. “Verification ofRelational Multiagent Systems with Data Types”. In: Proc. of the 29thAAAI Conf. on Artificial Intelligence (AAAI). AAAI Press, 2015,pp. 2031–2037. isbn: 9781577356981.

[12] Diego C., Marco Montali, Fabio Patrizi, et al. “ImplementingData-Centric Dynamic Systems over a Relational DBMS”. In: Proc. ofthe 9th Alberto Mendelzon Int. Workshop on Foundations of DataManagement (AMW). 2015.

[13] Diego C., Giuseppe De Giacomo, et al. “Verification and Synthesis inDescription Logic Based Dynamic Systems”. In: Proc. of the 7th Int.Conf. on Web Reasoning and Rule Systems (RR). Vol. 7994. LectureNotes in Computer Science. Springer, 2013, pp. 50–64.

Diego Calvanese (FUB) Foundations of Data-Aware Process Analysis INRIA Saclay Paris – 18/3/2016 (57/52)