34
Reconciling Rationality and Stochasticity: Rich Behavioral Models in Two-Player Games Mickael Randour Computer Science Department, ULB - Universit´ e libre de Bruxelles, Belgium July 24, 2016 GAMES 2016 - 5th World Congress of the Game Theory Society

Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Reconciling Rationality and Stochasticity:

Rich Behavioral Models in Two-Player Games

Mickael Randour

Computer Science Department, ULB - Universite libre de Bruxelles, Belgium

July 24, 2016

GAMES 2016 - 5th World Congress of the Game Theory Society

Page 2: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

The talk in one slideTwo traditional paradigms for agents

in complex systems

Fully rational

System=

(multi-player) game

Fully stochastic

System=

large stochastic process

In some fields (e.g., computer science), needto go beyond: rich behavioral models

Illustration: planning a journeyin an uncertain environment

Reconciling Rationality and Stochasticity Mickael Randour 1 / 21

Page 3: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Advertisement

Full paper available on arXiv [Ran16a]: abs/1603.05072

Reconciling Rationality and Stochasticity Mickael Randour 2 / 21

Page 4: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

1 Rationality & stochasticity

2 Planning a journey in an uncertain environment

3 Synthesis of reliable reactive systems

4 Conclusion

Reconciling Rationality and Stochasticity Mickael Randour 3 / 21

Page 5: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

1 Rationality & stochasticity

2 Planning a journey in an uncertain environment

3 Synthesis of reliable reactive systems

4 Conclusion

Reconciling Rationality and Stochasticity Mickael Randour 4 / 21

Page 6: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Rationality hypothesis

Rational agents [OR94]:

clear personal objectives,

aware of their alternatives,

form sound expectations about any unknowns,

choose their actions coherently (i.e., regarding some notion ofoptimality).

=⇒ In the particular setting of zero-sum games: antagonisticinteractions between the players.

↪→ Well-founded abstraction in computer science.E.g., processes competing for access to a shared resource.

Reconciling Rationality and Stochasticity Mickael Randour 5 / 21

Page 7: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Stochasticity

Stochastic agents:

often a sufficient abstraction to reason about macroscopicproperties of a complex system,

agents follow stochastic models that can be based onexperimental data (e.g., traffic in a town).

Several models of interest:

fully stochastic agents =⇒ Markov chain [Put94],

rational agent against stochastic agent =⇒ Markovdecision process [Put94],

two rational agents + one stochastic agent =⇒ stochasticgame or competitive MDP [FV97].

Reconciling Rationality and Stochasticity Mickael Randour 6 / 21

Page 8: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Choosing the appropriate paradigm matters!

As an agent having to choose a strategy, the assumptions madeon the other agents are crucial.

=⇒ They define our objective hence the adequate strategy.

=⇒ Illustration: planning a journey.

Reconciling Rationality and Stochasticity Mickael Randour 7 / 21

Page 9: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

1 Rationality & stochasticity

2 Planning a journey in an uncertain environment

3 Synthesis of reliable reactive systems

4 Conclusion

Reconciling Rationality and Stochasticity Mickael Randour 8 / 21

Page 10: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Aim of this illustration

Flavor of 6= types of useful strategies in stochastic environments.

� Based on a series of papers, most in a computer sciencesetting (more on that later) [Ran13, BFRR14b, BFRR14a,RRS15a, RRS15b, BCH+16].

Applications to the shortest path problem.

A

B

C

D

E

30

10

20

5

10

20

5

↪→ Find a path of minimal length in a weighted graph(Dijkstra, Bellman-Ford, etc) [CGR96].

Reconciling Rationality and Stochasticity Mickael Randour 9 / 21

Page 11: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Aim of this illustration

Flavor of 6= types of useful strategies in stochastic environments.

� Based on a series of papers, most in a computer sciencesetting (more on that later) [Ran13, BFRR14b, BFRR14a,RRS15a, RRS15b, BCH+16].

Applications to the shortest path problem.

A

B

C

D

E

30

10

20

5

10

20

5

What if the environment is uncertain? E.g., in case of heavytraffic, some roads may be crowded.

Reconciling Rationality and Stochasticity Mickael Randour 9 / 21

Page 12: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Planning a journey in an uncertain environment

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

Each action takes time, target = work.

� What kind of strategies are we looking for when theenvironment is stochastic (MDP)?

Reconciling Rationality and Stochasticity Mickael Randour 10 / 21

Page 13: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 1: minimize the expected time to work

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

� “Average” performance: meaningful when you journey often.

� Simple strategies suffice: no memory, no randomness.

� Taking the car is optimal: EσD(TSwork) = 33.

Reconciling Rationality and Stochasticity Mickael Randour 11 / 21

Page 14: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 2: traveling without taking too many risks

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

Minimizing the expected time to destination makes sense if we traveloften and it is not a problem to be late.

With car, in 10% of the cases, the journey takes 71 minutes.

Reconciling Rationality and Stochasticity Mickael Randour 12 / 21

Page 15: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 2: traveling without taking too many risks

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

Most bosses will not be happy if we are late too often. . .

; what if we are risk-averse and want to avoid that?

Reconciling Rationality and Stochasticity Mickael Randour 12 / 21

Page 16: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 2: maximize the probability to be on time

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

Specification: reach work within 40 minutes with 0.95 probability

Sample strategy: take the train ; PσD[TSwork ≤ 40

]= 0.99

Bad choices: car (0.9) and bike (0.0)

Reconciling Rationality and Stochasticity Mickael Randour 13 / 21

Page 17: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 3: strict worst-case guarantees

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

Specification: guarantee that work is reached within 60 minutes(to avoid missing an important meeting)

Sample strategy: bike ; worst-case reaching time = 45 minutes.Bad choices: train (wc = ∞) and car (wc = 71)

Reconciling Rationality and Stochasticity Mickael Randour 14 / 21

Page 18: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 3: strict worst-case guarantees

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

Worst-case analysis ; two-player zero-sum game against a ratio-nal antagonistic adversary (bad guy)

� forget about probabilities and give the choice of transitions tothe adversary

Reconciling Rationality and Stochasticity Mickael Randour 14 / 21

Page 19: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 4: minimize the expected time under strictworst-case guarantees

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

Expected time: car ; E = 33 but wc = 71 > 60

Worst-case: bike ; wc = 45 < 60 but E = 45 >>> 33

Reconciling Rationality and Stochasticity Mickael Randour 15 / 21

Page 20: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 4: minimize the expected time under strictworst-case guarantees

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

In practice, we want both! Can we do better?

� Beyond worst-case synthesis [BFRR14b, BFRR14a]:minimize the expected time under the worst-case constraint.

Reconciling Rationality and Stochasticity Mickael Randour 15 / 21

Page 21: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 4: minimize the expected time under strictworst-case guarantees

home

waitingroom

trainlight

trafficmediumtraffic

heavytraffic

work

railway, 2 car, 1

wait, 3

relax, 35

go back, 2

bike, 45

drive, 20 drive, 30 drive, 70

0.1 0.9 0.20.7

0.1

0.1 0.9

Sample strategy: try train up to 3 delays then switch to bike.

; wc = 58 < 60 and E ≈ 37.34 << 45; Strategies need memory ; more complex!

Reconciling Rationality and Stochasticity Mickael Randour 15 / 21

Page 22: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 5: multiple objectives ⇒ trade-offs

home

workcar

wreck

bus, 30, 3 taxi, 10, 20

0.7 0.99 0.01

0.3

Two-dimensional weights on actions: time and cost.

Often necessary to consider trade-offs: e.g., between the probabilityto reach work in due time and the risks of an expensive journey.

Reconciling Rationality and Stochasticity Mickael Randour 16 / 21

Page 23: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 5: multiple objectives ⇒ trade-offs

home

workcar

wreck

bus, 30, 3 taxi, 10, 20

0.7 0.99 0.01

0.3

Solution 2 (probability) can only ensure a single constraint.

C1: 80% of runs reach work in at most 40 minutes.

� Taxi ; ≤ 10 minutes with probability 0.99 > 0.8.

C2: 50% of them cost at most 10$ to reach work.

� Bus ; ≥ 70% of the runs reach work for 3$.

Taxi 6|= C2, bus 6|= C1. What if we want C1 ∧ C2?

Reconciling Rationality and Stochasticity Mickael Randour 16 / 21

Page 24: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 5: multiple objectives ⇒ trade-offs

home

workcar

wreck

bus, 30, 3 taxi, 10, 20

0.7 0.99 0.01

0.3

C1: 80% of runs reach work in at most 40 minutes.

C2: 50% of them cost at most 10$ to reach work.

Study of multi-constraint percentile queries [RRS15a].

� Sample strategy: bus once, then taxi. Requires memory .

� Another strategy: bus with probability 3/5, taxi withprobability 2/5. Requires randomness.

Reconciling Rationality and Stochasticity Mickael Randour 16 / 21

Page 25: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Solution 5: multiple objectives ⇒ trade-offs

home

workcar

wreck

bus, 30, 3 taxi, 10, 20

0.7 0.99 0.01

0.3

C1: 80% of runs reach work in at most 40 minutes.

C2: 50% of them cost at most 10$ to reach work.

Study of multi-constraint percentile queries [RRS15a].

In general, both memory and randomness are required.

6= previous problems ; more complex!

Reconciling Rationality and Stochasticity Mickael Randour 16 / 21

Page 26: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

1 Rationality & stochasticity

2 Planning a journey in an uncertain environment

3 Synthesis of reliable reactive systems

4 Conclusion

Reconciling Rationality and Stochasticity Mickael Randour 17 / 21

Page 27: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Controller synthesis

Setting:

� a reactive system to control,� an interacting environment,� a specification to enforce.

For critical systems (e.g., airplane controller, power plants,ABS), testing is not enough!

⇒ Need formal methods.

Automated synthesis of provably-correct and efficientcontrollers:� mathematical frameworks,

↪→ e.g., games on graphs [GTW02, Ran13, Ran14]

� software tools.

Reconciling Rationality and Stochasticity Mickael Randour 18 / 21

Page 28: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Strategy synthesis in stochastic environments

Strategy = formal model of how to control the system

systemdescription

environmentdescription

informalspecification

model as aMarkov DecisionProcess (MDP)

model asa winningobjective

synthesis

is there awinning

strategy ?

empower systemcapabilitiesor weaken

specificationrequirements

strategy=

controller

no yes

1 How complex is it to decide ifa winning strategy exists?

2 How complex such a strategyneeds to be? Simpler isbetter.

3 Can we synthesize oneefficiently?

⇒ Depends on the winningobjective, the exact type ofinteraction, etc.

Reconciling Rationality and Stochasticity Mickael Randour 19 / 21

Page 29: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

Some other objectives

The example was about shortest path objectives, but there aremany more! Some examples based on energy applications.

� Energy: operate with a (bounded) fuel tank and never runout of fuel [BFL+08].

� Mean-payoff: average cost/reward (or energy consumption)per action in the long run [EM79].

� Average-energy: energy objective + optimize the long-runaverage amount of fuel in the tank [BMR+15].

Also inspired by economics:

� Discounted sum: simulates interest or inflation [BCF+13].

Reconciling Rationality and Stochasticity Mickael Randour 20 / 21

Page 30: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Rationality & stochasticity Planning a journey Synthesis Conclusion

ConclusionOur research aims at:

defining meaningful strategy concepts,providing algorithms and tools to compute those strategies,classifying the complexity of the different problems from atheoretical standpoint.↪→ Is it mathematically possible to obtain efficient algorithms?

Take-home message

Rich behavioral models are natural and important in computerscience (e.g., synthesis).

Maybe they can be useful in other areas too. E.g., in economics:combining sufficient risk-avoidance and profitable expected return,

value-at-risk models.

Thank you! Any question?Reconciling Rationality and Stochasticity Mickael Randour 21 / 21

Page 31: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

References I

T. Brazdil, T. Chen, V. Forejt, P. Novotny, and A. Simaitis.

Solvency Markov decision processes with interest.In Proc. of FSTTCS, volume 24 of LIPIcs, pages 487–499. Schloss Dagstuhl - Leibniz-Zentrum fuerInformatik, 2013.

R. Brenguier, L. Clemente, P. Hunter, G.A. Perez, M. Randour, J.-F. Raskin, O. Sankur, and M. Sassolas.

Non-zero sum games for reactive synthesis.In Proc. of LATA, LNCS 9618, pages 3–23. Springer, 2016.

P. Bouyer, U. Fahrenberg, K.G. Larsen, N. Markey, and J. Srba.

Infinite runs in weighted timed automata with energy constraints.In Proc. of FORMATS, LNCS 5215, pages 33–47. Springer, 2008.

V. Bruyere, E. Filiot, M. Randour, and J.-F. Raskin.

Expectations or guarantees? I want it all! A crossroad between games and MDPs.In Proc. of SR, EPTCS 146, pages 1–8, 2014.

V. Bruyere, E. Filiot, M. Randour, and J.-F. Raskin.

Meet your expectations with guarantees: Beyond worst-case synthesis in quantitative games.In Proc. of STACS, LIPIcs 25, pages 199–213. Schloss Dagstuhl - LZI, 2014.

P. Bouyer, N. Markey, M. Randour, K.G. Larsen, and S. Laursen.

Average-energy games.In Proc. of GandALF, EPTCS 193, pages 1–15, 2015.

Reconciling Rationality and Stochasticity Mickael Randour 22 / 21

Page 32: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

References II

B.V. Cherkassky, A.V. Goldberg, and T. Radzik.

Shortest paths algorithms: Theory and experimental evaluation.Math. programming, 73(2):129–174, 1996.

A. Ehrenfeucht and J. Mycielski.

Positional strategies for mean payoff games.International Journal of Game Theory, 8:109–113, 1979.

J. Filar and K. Vrieze.

Competitive Markov decision processes.Springer, 1997.

E. Gradel, W. Thomas, and T. Wilke, editors.

Automata, Logics, and Infinite Games: A Guide to Current Research, LNCS 2500. Springer, 2002.

M.J. Osborne and A. Rubinstein.

A Course in Game Theory.MIT Press, 1994.

M.L. Puterman.

Markov Decision Processes: Discrete Stochastic Dynamic Programming.John Wiley & Sons, Inc., New York, NY, USA, 1st edition, 1994.

M. Randour.

Automated synthesis of reliable and efficient systems through game theory: A case study.In Proceedings of the European Conference on Complex Systems 2012, Springer Proceedings in ComplexityXVII, pages 731–738. Springer, 2013.

Reconciling Rationality and Stochasticity Mickael Randour 23 / 21

Page 33: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

References III

M. Randour.

Synthesis in Multi-Criteria Quantitative Games.PhD thesis, Universite de Mons, Belgium, 2014.

M. Randour.

Reconciling rationality and stochasticity: Rich behavioral models in two-player games.CoRR, abs/1603.05072, 2016.

M. Randour.

Reconciling rationality and stochasticity: Rich behavioral models in two-player games.Talk at GAMES 2016 - 5th World Congress of the Game Theory Society, 2016.

M. Randour, J.-F. Raskin, and O. Sankur.

Percentile queries in multi-dimensional Markov decision processes.In Proc. of CAV, LNCS 9206, pages 123–139. Springer, 2015.

M. Randour, J.-F. Raskin, and O. Sankur.

Variations on the stochastic shortest path problem.In Proc. of VMCAI, LNCS 8931, pages 1–18. Springer, 2015.

Reconciling Rationality and Stochasticity Mickael Randour 24 / 21

Page 34: Reconciling Rationality and Stochasticity: Rich Behavioral ...di.ulb.ac.be/verif/randour/slides/GAMES2016.pdf · ,! e.g., games on graphs [GTW02,Ran13,Ran14] software tools. Reconciling

Algorithmic complexity: hierarchy of problemsFor shortest path

com

ple

xity

NP∩coNP

P

LOGSPACE

LOG

NP

NP-c

coNP

coNP-c

PSPACE

EXPTIME

EXPSPACE

...

ELEMENTARY...

2EXPTIME

PR

DECIDABLE

UNDECIDABLE

not computable by an algorithm

Solutions 1 (E)and 3 (wc)

Solution 4 (BWC)Solutions 2 (P)and 5 (percentile)

Reconciling Rationality and Stochasticity Mickael Randour 25 / 21