SE 420 1 This time: Outline Game playing The minimax algorithm Resource limitations alpha-beta...

View
226
Download
2
Category

Documents

Tags:

Preview:

Citation preview

SE 420 1

This time: Outline

• Game playing• The minimax algorithm• Resource limitations• alpha-beta pruning• Elements of chance

SE 420 2

What kind of games?

• Abstraction: To describe a game we must capture every relevant aspect of the game. Such as:• Chess• Tic-tac-toe• …

• Accessible environments: Such games are characterized by perfect information

• Search: game-playing then consists of a search through possible game positions

• Unpredictable opponent: introduces uncertainty thus game-playing must deal with contingency problems

SE 420 3

Searching for the next move

• Complexity: many games have a huge search space• Chess: b = 35, m=100 nodes = 35 100

if each node takes about 1 ns to explorethen each move will take about 10 50 millennia to calculate.

• Resource (e.g., time, memory) limit: optimal solution not feasible/possible, thus must approximate

1. Pruning: makes the search more efficient by discarding portions of the search tree that cannot improve quality result.

2. Evaluation functions: heuristics to evaluate utility of a state without exhaustive search.

SE 420 4

Two-player games

• A game formulated as a search problem:

• Initial state: ?• Operators: ?• Terminal state: ?• Utility function: ?

SE 420 5

Two-player games

• A game formulated as a search problem:

• Initial state: board position and turn• Operators: definition of legal moves• Terminal state: conditions for when game is over• Utility function: a numeric value that describes the

outcome of thegame. E.g., -1, 0, 1 for loss, draw, win.(AKA payoff function)

SE 420 6

Game vs. search problem

SE 420 7

Example: Tic-Tac-Toe

SE 420 8

Type of games

SE 420 9

Type of games

SE 420 10

The minimax algorithm

• Perfect play for deterministic environments with perfect information

• Basic idea: choose move with highest minimax value= best achievable payoff against best play

• Algorithm: 1. Generate game tree completely

2. Determine utility of each terminal state

3. Propagate the utility values upward in the tree by applying MIN and MAX operators on the nodes in the current level

4. At the root node use minimax decision to select the move with the max (of the min) utility value

• Steps 2 and 3 in the algorithm assume that the opponent will play perfectly.

SE 420 11

Generate Game Tree

SE 420 12

Generate Game Tree

x x x

x

SE 420 13

Generate Game Tree

x

o x x

o

x

o

x o

SE 420 14

Generate Game Tree

x

o x x

o

x

o

x o

1 ply

1 move

SE 420 15

A subtree

win

lose

draw

xxo

o

ox

xxo

o

ox

xxo

o

oxx

xxo

o

ox

x x

xxo

o

oxx

xxo

o

oxx

xxo

o

ox

x

xxo

o

ox

x

xxo

o

ox

x

xxo

o

ox

xo

oo oo

o

xxo

o

oxx

o x xx

xxo

o

ox

xo

xxo

o

ox

x

o xxx

oo

ox

xo

SE 420 16

What is a good move?

win

lose

draw

xxo

o

ox

xxo

o

ox

xxo

o

oxx

xxo

o

ox

x x

xxo

o

oxx

xxo

o

oxx

xxo

o

ox

x

xxo

o

ox

x

xxo

o

ox

x

xxo

o

ox

xo

oo oo

o

xxo

o

oxx

o x xx

xxo

o

ox

xo

xxo

o

ox

x

o xxx

oo

ox

xo

SE 420 17

Minimax

3 812 4 6 14 252

•Minimize opponent’s chance•Maximize your chance

SE 420 18

Minimax

3 2

3

2

812 4 6 14 252

MIN

•Minimize opponent’s chance•Maximize your chance

SE 420 19

Minimax

3

3

2

3

2

812 4 6 14 252

MAX

MIN

•Minimize opponent’s chance•Maximize your chance

SE 420 20

Minimax

3

3

2

3

2

812 4 6 14 252

MAX

MIN

•Minimize opponent’s chance•Maximize your chance

SE 420 21

minimax = maximum of the minimum

1st ply

2nd ply

SE 420 22

Minimax: Recursive implementation

Complete: ?Optimal: ?

Time complexity: ?Space complexity: ?

SE 420 23

Minimax: Recursive implementation

Complete: Yes, for finite state-spaceOptimal: Yes

Time complexity: O(bm)Space complexity: O(bm) (= DFSDoes not keep all nodes in memory.)

SE 420 24

1. Move evaluation without complete search

• Complete search is too complex and impractical

• Evaluation function: evaluates value of state using heuristics and cuts off search

• New MINIMAX:• CUTOFF-TEST: cutoff test to replace the termination

condition (e.g., deadline, depth-limit, etc.)• EVAL: evaluation function to replace utility function

(e.g., number of chess pieces taken)

SE 420 25

Evaluation functions

• Weighted linear evaluation function: to combine n heuristics

f = w1f1 + w2f2 + … + wnfn

E.g, w’s could be the values of pieces (1 for prawn, 3 for bishop etc.)f’s could be the number of type of pieces on the board

SE 420 26

Note: exact values do not matter

SE 420 27

Minimax with cutoff: viable algorithm?

Assume we have 100 seconds, evaluate 104 nodes/s; can evaluate 106 nodes/move

SE 420 28

2. - pruning: search cutoff

• Pruning: eliminating a branch of the search tree from consideration without exhaustive examination of each node

- pruning: the basic idea is to prune portions of the search tree that cannot improve the utility value of the max or min node, by just considering the values of nodes seen so far.

• Does it work? Yes, in roughly cuts the branching factor from b to b resulting in double as far look-ahead than pure minimax

SE 420 29

- pruning: example

6

6

MAX

6 12 8

MIN

SE 420 30

- pruning: example

6

6

MAX

6 12 8 2

2MIN

SE 420 31

- pruning: example

6

6

MAX

6 12 8 2

2

5

5MIN

SE 420 32

- pruning: example

6

6

MAX

6 12 8 2

2

5

5MIN

Selected move

SE 420 33

- pruning: general principle

Player

Player

Opponent

Opponent

m

n

v

If > v then MAX will chose m so prune tree under n

Similar for for MIN

SE 420 34

Properties of -

SE 420 35

The - algorithm:

SE 420 36

More on the - algorithm

• Same basic idea as minimax, but prune (cut away) branches of the tree that we know will not contain the solution.

SE 420 37

More on the - algorithm: start from Minimax

SE 420 38

Remember: Minimax: Recursive implementation

Complete: Yes, for finite state-spaceOptimal: Yes

Time complexity: O(bm)Space complexity: O(bm) (= DFSDoes not keep all nodes in memory.)

SE 420 39

More on the - algorithm

• Same basic idea as minimax, but prune (cut away) branches of the tree that we know will not contain the solution.

• Because minimax is depth-first, let’s consider nodes along a given path in the tree. Then, as we go along this path, we keep track of: : Best choice so far for MAX : Best choice so far for MIN

SE 420 40

More on the - algorithm: start from Minimax

Note: These are bothLocal variables. At theStart of the algorithm,We initialize them to = - and = +

SE 420 41

More on the - algorithm

…

MAX

MIN

MAX

= - = +

5 10 6 2 8 7

Min-Value loopsover these

In Min-Value:

= - = 5

= - = 5

= - = 5

Max-Value loopsover these

SE 420 42

More on the - algorithm

…

MAX

MIN

MAX

= - = +

5 10 6 2 8 7

In Max-Value:

= - = 5

= - = 5

= - = 5

= 5 = +Max-Value loops

over these

SE 420 43

In Min-Value:More on the - algorithm

…

MAX

MIN

MAX

= - = +

5 10 6 2 8 7 = - = 5

= - = 5

= - = 5

= 5 = +

= 5 = 2End loop and return 5

Min-Value loopsover these

SE 420 44

In Max-Value:More on the - algorithm

…

MAX

MIN

MAX

= - = +

5 10 6 2 8 7 = - = 5

= - = 5

= - = 5

= 5 = +

= 5 = 2End loop and return 5

= 5 = +

Max-Value loopsover these

SE 420 45

Example

SE 420 46

- algorithm:

SE 420 47

Solution

NODE TYPE ALPHA BETA SCORE A Max -I +IB Min -I +I C Max -I +I D Min -I +I E Max 10 10 10 D Min -I 10 F Max 11 11 11 D Min -I 10 10 C Max 10 +I G Min 10 +I H Max 9 9 9 G Min 10 9 9 C Max 10 +I 10 B Min -I 10 J Max -I 10 K Min -I 10 L Max 14 14 14 K Min -I 10 10 …

NODE TYPE ALPHA BETA SCORE …J Max 10 10 10 B Min -I 10 10 A Max 10 +I Q Min 10 +I R Max 10 +I S Min 10 +I T Max 5 5 5 S Min 10 5 5 R Max 10 +I V Min 10 +I W Max 4 4 4 V Min 10 4 4 R Max 10 +I 10 Q Min 10 10 10 A Max 10 10 10

SE 420 48

State-of-the-art for deterministic games

SE 420 49

Nondeterministic games

SE 420 50

Algorithm for nondeterministic games

SE 420 51

Remember: Minimax algorithm

SE 420 52

Nondeterministic games: the element of chance

3 ?

0.50.5

817

8

?

CHANCE ?

expectimax and expectimin, expected values over all possible outcomes

SE 420 53

Nondeterministic games: the element of chance

3 50.50.5

817

8

5

CHANCE 4 = 0.5*3 + 0.5*5Expectimax

Expectimin

SE 420 54

Evaluation functions: Exact values DO matter

Order-preserving transformation do not necessarily behave the same!

SE 420 55

State-of-the-art for nondeterministic games

SE 420 56

Summary

SE 420 57

Exercise: Game Playing

(a) Compute the backed-up values computed by the minimax algorithm. Show your answer by writing values at the appropriate nodes in the above tree.

(b) Compute the backed-up values computed by the alpha-beta algorithm. What nodes will not be examined by the alpha-beta pruning algorithm?

(c) What move should Max choose once the values have been backed-up all the way?

A

B C D

E F G H I J K

L M N O P Q R S T U V W YX

2 3 8 5 7 6 0 1 5 2 8 4 210

Max

Max

Min

Min

Consider the following game tree in which the evaluation function values are shown below each leaf node. Assume that the root node corresponds to the maximizing player. Assume the search always visits children left-to-right.

Recommended

Game Trees: MiniMax strategy, Tree Evaluation, Pruning, Utility evaluation Adapted from slides of Yoonsuck Choe

Documents

Bardelli Minimax

Documents

Adversarial Search: Nine Men’s Morris, Minimax, and Alpha ...seaugust.lmu.build/TAILSweb/html/Nine-Mens-Morris... · 1.2.2 Alpha-Beta Pruning Alpha-Beta pruning is almost always

Documents

Mathematical Statistics, Lecture 12 Minimax Procedures · Spring 2016. í. MIT 18.655 Minimax Procedures. Minimax Procedures Decision-Theoretic Framework Game Theory Minimax Theorems

Documents

Minimax With Alpha Beta Pruning

Documents

Minimax Optimization

Documents

FRUIT TREE PRUNING LNT 70 - California Rare Fruit Growers ...FRUIT TREE PRUNING BASICS •Natural Target Pruning •Terminology and Tools •Reasons for Pruning Fruit Trees. Pruning

Documents

Game-playing AIs: Games and Adversarial Search FINAL SET ...cis391/Lectures/Games FINAL.pdf · Alpha-Beta Pruning II During Minimax, keep track of two additional values: • α: MAX’s

Documents

Minimax with Alpha Beta Pruning The minimax algorithm is a way of finding an optimal move in a two player game. Alpha-beta pruning is a way of finding

Documents

CATALOGO MINIMAX

Documents

More on games (Ch. 5.4-5.6)...Alpha-beta pruning In general, alpha-beta pruning allows you to search to a depth 2d for the minimax search cost of depth d So if minimax needs to find:

Documents

Minimax Pathology

Documents

MiniMax and Alpha Beta Pruning - GitLab · Alpha-Beta Pruning Alpha-beta pruningallows to avoid searching subtrees of moves which do not lead to the optimal minimax solution. 5 2

Documents

1.Alpha-Beta Pruning - jnkvv.orgjnkvv.org/PDF/16042020120556Alpha beta algo.pdf · 1.Alpha-Beta Pruning 1. Alpha-beta pruning is a modified version of the minimax algorithm. It is

Documents

CSC242: Intro to AI - University of Rochester · Alpha-Beta Pruning How can we make MiniMax run faster, without sacriﬁcing optimality?! During MINIMAX search keep track of! α:

Documents

Lecture 4: Minimax estimators - Department of Statisticsshao/stat710/stat710-04.pdf · better than a minimax estimator is also minimax. We should ﬁnd an admissible minimax estimator

Documents

4p minimax xpmetermall.godohosting.com/product/Honeywell/MiNiMax-XP_c.pdf · 2014. 11. 17. · MiniMAX XP는백라이트기능을가진LCD 표시창을통하여측정가스의농도를표시하고,

Documents

Chapter 6 Instructor : Miss Mahreen Nasir Butt. Outline Games Optimal decisions Minimax algorithm α-β pruning Imperfect, real-time decisions 2

Documents

Alpha-beta Pruning · 2020-06-15 · Alpha-beta Pruning. The trouble with Minimax • The full Minimax search of all possible ways a game could be played takes time. • If there

Documents

minimax maximix

Documents