18
Empirical Analysis of Ideal Crossover on Random Additively Decomposable Problems Kumara Sastry 1 , Martin Pelikan 2 , David E. Goldberg 1 1 Illinois Genetic Algorithms Laboratory (IlliGAL) University of Illinois at Urbana-Champaign, Urbana, IL 61801 2 Missouri Estimation of Distribution Algorithm Lab (MEDAL) University of Missouri at St. Louis, St. Louis, MO [email protected] , [email protected] , [email protected] http://www.illigal.uiuc.edu , http://medal.cs.umsl.edu Supported by AFOSR FA9550-06-1-0096, NSF DMR 03-25939, and CAREER ECS-0547013. Computational results obtained using CSE’s Turing cluster.

Empirical Analysis of ideal recombination on random decomposable problems

Embed Size (px)

DESCRIPTION

This paper analyzes the behavior of a selectorecombinative genetic algorithm (GA) with an ideal crossover on a class of random additively decomposable problems (rADPs). Specifically, additively decomposable problems of order k whose subsolution fitnesses are sampled from the standard uniform distribution U[0,1] are analyzed. The scalability of the selectorecombinative GA is investigated for 10,000 rADP instances. The validity of facetwise models in bounding the population size, run duration, and the number of function evaluations required to successfully solve the problems is also verified. Finally, rADP instances that are easiest and most difficult are also investigated.

Citation preview

Page 1: Empirical Analysis of ideal recombination on random decomposable problems

Empirical Analysis of Ideal Crossover on Random Additively Decomposable Problems

Kumara Sastry1, Martin Pelikan2, David E. Goldberg1

1Illinois Genetic Algorithms Laboratory (IlliGAL)University of Illinois at Urbana-Champaign, Urbana, IL 618012Missouri Estimation of Distribution Algorithm Lab (MEDAL)

University of Missouri at St. Louis, St. Louis, MO

[email protected], [email protected], [email protected]://www.illigal.uiuc.edu, http://medal.cs.umsl.edu

Supported by AFOSR FA9550-06-1-0096, NSF DMR 03-25939, and CAREER ECS-0547013. Computational results obtained using CSE’s Turing cluster.

Page 2: Empirical Analysis of ideal recombination on random decomposable problems

2

RoadmapAdversarial test problem design

Random additively decomposable problems

Ideal crossover

Scalability of selectorecombinative GAsPopulation sizing and Run duration

Experimental Procedure

Key Results

Summary and Conclusions

Page 3: Empirical Analysis of ideal recombination on random decomposable problems

3

Adversarial Test Problem Design

Test systems on boundary of design envelopeCommon approach in designing complex systems

GAs are complex systems [Goldberg, 2002]

GA design envelope characterized by different dimensions of problem difficulty

Thwart the mechanism of GAs to the extreme

P

Fluctuating

Deception NoiseScaling R

Page 4: Empirical Analysis of ideal recombination on random decomposable problems

4

Random Additively Decomposable Problem

Focus on nearly decomposable problems [Simon, 1960]

Three desired featuresScalability: Able to control problem size and difficultyKnown optimum: Allows comparison of different solversEasy problem instance generation

rADP fitness function:

Si represents variable subset for ith subproblemEach subset consists of k bitsgi is the fitness of the ith subproblemgi is sampled from uniform distribution U[0,1]

Page 5: Empirical Analysis of ideal recombination on random decomposable problems

5

Ideal Crossover: Exchange Building Blocks

Population sizing and run duration models assume good exchange of building blocks

Simulate what we ideally want to achieve with model-building GAs

For example, extended compact GA [Harik, 1999]

Ideal recombination operatorEffectively exchange building blocksDon’t disrupt any building block

Uniform building-block-wise crossoverExchange BBs with probability 0.5 BBs #1 and #3 exchanged

Page 6: Empirical Analysis of ideal recombination on random decomposable problems

6

Purpose: Analyze Ideal Crossover on rADPs

Analyze behavior of selectorecombinative GAs on rADPs

Verify the validity of lessons learned from adversarial test problems

Expand the pool of test problems

Page 7: Empirical Analysis of ideal recombination on random decomposable problems

7

Selectorecombinative GA Population Sizing

Gambler’s ruin model[Harik, et al, 1997]

Combines decision making and supply models

Additive Gaussian noise with variance σ2

N

Noise-to-fitness variance ratio

Error toleranceSignal-to-Noise ratio # Competing sub-components

# Components (# BBs)

Page 8: Empirical Analysis of ideal recombination on random decomposable problems

8

GA Run Duration (Selection)

Selection-Intensity based model [Bulmer, 1980; Mühlenbein & Schlierkamp-Voosen, 1993; Thierens & Goldberg, 1994; Bäck, 1994; Miller & Goldberg, 1995 & 1996]

Derived for the OneMax problemApplicable to additively-separable problems [Miller, 1997]

Selection Intensity

Problem size (m·k )

[Miller & Goldberg, 1995; Sastry & Goldberg, 2002]

Page 9: Empirical Analysis of ideal recombination on random decomposable problems

9

GA Run Duration (Drift)

Accumulation of stochastic errors due to finite population

Proportion of competing sub-solutions change due to drift

Drift time [Goldberg & Segrest, 1987]:

Substituting population sizing bound

Page 10: Empirical Analysis of ideal recombination on random decomposable problems

10

Signal-to-Noise Ratio for rADPs

Signal d is the fitness difference between best and second best sub-solutions

jth order statistic follows a Beta distribution with α = j and β = 2k-j+1

Probability density function (p.d.f) of d:

p.d.f. of sub-solution fitness variance approximation

E[1/d] = 2k and E[σ2BB] ≈ 1/12

Page 11: Empirical Analysis of ideal recombination on random decomposable problems

11

Assumptions and Experimental SetupNon-overlapping sub-problems

Identical sub-problems across different partitionsg1 = g2 = … = gm

Selectorecombinative GABinary tournament selection

10,000 random problem instancesm = 5 – 50, k = 3, 4, and 5

Minimum population size determined by bisection methodPopulation correctly converges to at least m-1 out of m BBs in 49 out of 50 independent runsAveraged over 30 bisection runsResults averaged over 1,500 GA runs

Page 12: Empirical Analysis of ideal recombination on random decomposable problems

12

Population Sizing & Run Duration Histograms

Tail increases with m0.15-0.59% of rADP instances require # evals greater than 3σ from the median

Population sizem = 50

Run durationm = 50

Page 13: Empirical Analysis of ideal recombination on random decomposable problems

13

Easy and Hard Problem Instances

Sorted subsolution index

Subs

olut

ion

fitn

ess

Sorted subsolution index

Subs

olut

ion

fitn

ess

Hard instance

Easy instance

Min signalMax noise

Max signalMin noise

Page 14: Empirical Analysis of ideal recombination on random decomposable problems

14

Population Sizing Scalability

Gambler’s ruin model bounds population sizing

Page 15: Empirical Analysis of ideal recombination on random decomposable problems

15

Run Duration Scalability

Selection-intensity based run-duration model bounds median convergence time

Drift-time model bounds convergence time

Page 16: Empirical Analysis of ideal recombination on random decomposable problems

16

Number of Function Evaluations Scalability

Facetwise models are applicable to rADPsTesting on adversarial problems bounds performance of GAs on rADPs

Page 17: Empirical Analysis of ideal recombination on random decomposable problems

17

Easy and Hard Scalable Problem Instances

Easy instances have large signal differenceHard instances have very small signal difference

Easy ScalableInstances

Hard ScalableInstances

Page 18: Empirical Analysis of ideal recombination on random decomposable problems

18

Summary and Conclusions

Empirically analyzed behavior of selectorecombinative GA with ideal crossover:

Class of random additively decomposable problemsSub-solution fitness sampled from uniform distribution

Verified applicability of facetwise models:Developed based on adversarial problemsGA scales subquadratically with problem size

Analyzed easy and hard problem instances:Easy problem instances have large signal, small variance.Hard problem instances have small signal, large variance