17
Markov Game Analysis for Attack and Defense of Power Networks Chris Y. T. Ma, David K. Y. Yau, Xin Lou, and Nageswara S. V. Rao

Markov Game Analysis for Attack and Defense of Power Networks

  • Upload
    ziv

  • View
    39

  • Download
    0

Embed Size (px)

DESCRIPTION

Markov Game Analysis for Attack and Defense of Power Networks. Chris Y. T. Ma, David K. Y. Yau, Xin Lou, and Nageswara S. V. Rao. Power Networks are Important Infrastructures (And Vulnerable to Attacks). Growing reliance on electricity Aging infrastructure - PowerPoint PPT Presentation

Citation preview

Page 1: Markov Game Analysis for  Attack and Defense of  Power Networks

Markov Game Analysis for Attack and Defense of

Power NetworksChris Y. T. Ma, David K. Y. Yau,

Xin Lou, and Nageswara S. V. Rao

Page 2: Markov Game Analysis for  Attack and Defense of  Power Networks

Power Networks are Important Infrastructures(And Vulnerable to Attacks)

• Growing reliance on electricity• Aging infrastructure• Introduced more connected digital sensing and

control devices (and attract attacks on cyber space)• Hard and expensive to protect• Limited budget• How to allocate the limited resources?– Optimal deployment to maximize long-term payoff

Page 3: Markov Game Analysis for  Attack and Defense of  Power Networks

Modeling the Interactions – Game Theoretic Approaches

• Static game– Each player has a set of actions available– Outcome and payoff determined by action of all

players– Players act simultaneously

Page 4: Markov Game Analysis for  Attack and Defense of  Power Networks

Static Game

• Example

Defend &Attack

Defend & No Attack

No defend& Attack

No defend & No Attack

Page 5: Markov Game Analysis for  Attack and Defense of  Power Networks

Modeling the Interactions – Game Theoretic Approaches

• Leader-follower game (Stackelberg game)– Defender as the leader– Adversary as the follower– Bi-level optimization – minimax operation• Inner level: follower maximizes its payoff given a

leader’s strategy• Outer level: leader maximizes its payoff subject to the

follower’s solution of the inner problem

Page 6: Markov Game Analysis for  Attack and Defense of  Power Networks

Stackelberg Game

• Example

Defend No defend

Attack NoAttack Attack No

Attack

Only model one-time interactions

Page 7: Markov Game Analysis for  Attack and Defense of  Power Networks

Modeling the Interactions – Markov Decision Process

• Markov Decision Process (MDP)– System modeled as set of states with Markov

transitions between them– Transition depends on action of one player and

some passive disruptors of known probabilistic behaviors (acts of nature)

Page 8: Markov Game Analysis for  Attack and Defense of  Power Networks

Markov Decision Process (MDP)

• Example (2 states, each has 2 actions available)

up down

Defend

No defend

Recover

No recover

0.9

0.6

0.1

0.9

0.1

0.4

0.9

0.1

Only models one intelligent player

Page 9: Markov Game Analysis for  Attack and Defense of  Power Networks

Our Approach – Markov Game

• Generalizations of MDP to an adversarial setting– Models the continual interactions between

multiple players• Players interact in the new state with different payoffs

– Models probabilistic state transition because of inherent uncertainty in the underlying physical system (e.g., random acts of nature)

Page 10: Markov Game Analysis for  Attack and Defense of  Power Networks

Problem Formulation

• Defender and adversary of a power network– Two-player zero-sum game

• Game formulation:– Adversary

• Actions: which link to attack• Payoff: cost of load shedding by the defender because of the

attack– Defender

• Actions: which (up) link to reinforce or which (down) link to recover

• Payoff: cost of load shedding because of the attack

Page 11: Markov Game Analysis for  Attack and Defense of  Power Networks

Markov Game – Reward Overview• Assume five links; link 4 both attacked and defended

(u,u,u,u,u) (u,u,u,u,u)

(u,u,u,d,u)

(u,u,u,u,u)

(u,u,u,d,u)

p1

1-p1

• Immediate reward of such actions is the weighted sum of successful attack and successful defense

• Assume at state (u,u,u,d,u), link 4 both attacked and defended again

p2

1-p2

• Immediate reward at state (u,u,u,d,u) is then the weighted sum of successful recovery and failed recovery

• This immediate reward is further “propagated” back to the original state (u,u,u,u,u) with a discount factor

• Hence, actions taken in a state will accrue a long-term reward

Page 12: Markov Game Analysis for  Attack and Defense of  Power Networks

Solving the Markov Game – Definitions

Page 13: Markov Game Analysis for  Attack and Defense of  Power Networks

Finding the Optimal Strategy – Solving a Linear Program

Page 14: Markov Game Analysis for  Attack and Defense of  Power Networks

Solving the Markov Game – Value Iteration

• Dynamic program (value iteration) to solve the Markov game

Page 15: Markov Game Analysis for  Attack and Defense of  Power Networks

Experiment Results

Link diagram

State {u,u,u,u,u}

Links 4 and 5 both connect to generator, and generator at bus 4 has higher output

Page 16: Markov Game Analysis for  Attack and Defense of  Power Networks

Experiment Results

Payoff Matrix of state {u,u,u,u,u} for the static game.

Payoff Matrix of state {u,u,u,u,u} for the Markov game. (ϒ = 0.3)

Page 17: Markov Game Analysis for  Attack and Defense of  Power Networks

Conclusions

• Using Markov game to model the attack and defense of a power network between two players

• Results show the action of players depends not only on current state, but also later states– To obtain the optimal long term benefit