Levels of Abstraction in Probabilistic Modeling and Sampling

Levels of AbstractionLevels of Abstractionin Probabilistic Modeling and Samplingin Probabilistic Modeling and Sampling

Moshe LooksNovember 18th, 2005

Levels of Abstraction 2

OutlineOutline

• Global Optimization with Graphical Models

• Where Do Our Random Variables Come From?

• Incorporating Levels of Abstraction

• Results

• Conclusions

Global OptimizationGlobal Optimization

• User Determines– How instances are represented

• E.g., fixed-length bit-strings drawn from {0,1}n

– How instance quality is evaluated• E.g., a fitness function from {0,1}n to R

– May be expensive to compute

• Assumes a Black Box– No additional problem knowledge

ApproachesApproaches

• Blind Search– Generate and test random instances

• Local Search (Hill-climbing, Annealing, etc..)– Search from a single (best) instance seen

• Population-Based Search– Search from a collection of good instances seen

Population-Based SearchPopulation-Based Search

1. Generate a population of random instances

2. Recombine promising instances in the population to create new instances

3. Remove the worst instances from the population

4. Goto step 2

Population-Based SearchPopulation-Based Search

• How to Generate New Instances?

• Genetic Algorithms– Crossover + Mutation

• Estimation of Distribution Algorithms (EDAs)– Generate instances by sampling from a probability

distribution reflecting the good instances– How to represent the distribution?

Probability-Vector EDA (Bit-String Case)Probability-Vector EDA (Bit-String Case)

• Each position in the bit-string corresponds to a random variable in the model; X = {X1,X2,…,Xn}

• Assume independence– For the population 001, 111, and 101

• P(X1=0) = 1/3 P(X1=1) = 2/3• P(X2=0) = 2/3 P(X2=1) = 1/3• P(X3=0) = 0 P(X3=1) = 1

– E.g., P(011) = P(X1=0) • P(X2=1) • P(X3=1) = 1/3 • 1/3 • 1 = 1/9

– Can generate instances according to the distribution

• Population-Based Incremental Learning (PBIL)– Baluja, 1995

Graphical ModelsGraphical Models

• Probability + Graph Theory– Nodes are random variables– Graph structure encodes variable

dependencies

• Great for– Uncertainty– Complexity– Learning

The Bayesian Optimization The Bayesian Optimization AlgorithmAlgorithm ((Pelikan, Goldberg, and Cantú-Paz, 1999)Pelikan, Goldberg, and Cantú-Paz, 1999)

• Dynamically learn dependencies between variables (nodes in a Bayesian network)

• Without dependenciesP(X1X2X3X4)

= P(X1) • P(X2) • P(X3) • P(X4)

• With dependenciesP(X1X2X3X4)

= P(X1) • P(X2 | X1) • P(X3) • P(X4 | X1, X3)– Dependencies (edges) correspond to partitions of the target

node’s distribution based on the source node’s distribution

Variables must now be sampled in topological order

Where Do Our Random Variables Come From?Where Do Our Random Variables Come From?

• The real world is a mess!– Boundaries are fuzzy and ambiguous– Even in discrete domains, ambiguity remains:

• E.g., DNA- a gene’s positions is sometimes critical, and sometimes irrelevant

• Consider abstracted features, defined in terms of “base-level variables”

• E.g., contains a prime number of ones• E.g., does not contain the substring AATGC

FeaturesFeatures• Features are predicates over instances,

describing the presence or absence of some pattern

• Base-level variables Xi, 1 < i < n, are a special case (i.e., “Xi=1” is a feature)

• Any well-defined feature (f) may be introduced as a node in the graph for probabilistic modeling

• What about model-based instance generation?

Definitions for Feature-Based SamplingDefinitions for Feature-Based Sampling

• Given a set of base variables X={X1,X2,…,Xn}, and an instance x=(x1x2..xn) – SX is sufficient for fx if f is true for every solution

with the same assignment as x for variables in S– S is minimally sufficient if none of its subsets are

sufficient– The unique grounding of fx is the union of all

minimally sufficient sets• If f is not present in x, the grounding of fx is the empty set

• E.g., for f = “contains the substring 11”, the grounding of f10110111 is {X3,X4,X6,X7,X8}

Generalizing Variable Assignment in Generalizing Variable Assignment in Instance GenerationInstance Generation

• Assigning fx to a newinstance means:

– When the feature is present in x• Partitioning the distribution to include the grounding of fx

– When the feature is absent from x• Partitioning the distribution to exclude the groundings of f for

all instances in the population

Generalizing Variable Assignment in Generalizing Variable Assignment in Instance GenerationInstance Generation

• Consider assignmentwith f = “contains the substring 11”

We may now generate instances as before, although some assignments may fail (i.e., features may overlap)

Assignment with f110 (i1) Assignment with f010 (i3)

Feature-Based BOA With MotifsFeature-Based BOA With Motifs

• Acceptance criterion for a motif f:

• F is the set of existing features

• count(A,B) is the number of instances in the population with all features in A and none in B

• spread(f) is the relative frequency of f across possible substring positions (more possibilities for short strings)

Feature-Based BOA With MotifsFeature-Based BOA With Motifs

• N (the population size) random substrings are tested as possible motifs

• c = 0.4 was chosen based on ad-hoc experimentation with small (n = 30, 60) OneMax instances

• Motif-learning is O(n2•N), assuming a fixed upper bound on the number of motifs

• The general complexity of BOA-modeling is O(n3 + n2•N)

Test ProblemsTest Problems

• OneMax should be very easy for fbBOA– Features are strings of all-ones

• TwoMax (aka Twin Peaks)– Global optima at 0n and 1n

– Features are strings of all-ones and strings of all-zeros

– Requires dependency learning

• 3-deceptive– Hard because of traps (local optima)

TwoMax ~ ResultsTwoMax ~ Results

Qualitatively similar results were obtained for OneMax and 3-deceptive

Levels of Abstraction in Probabilistic Modeling and Sampling

Documents

The effect of context on decisions: Decision by sampling based … · 2020. 7. 23. · The effect of context on decisions: Decision by sampling based on probabilistic beliefs Hidehito

Sustainable Places 2015 · 2017-08-13 · MCS Among the sampling methods, MCS is the most common traditional probabilistic simulation technique for performing a probabilistic analysis

Abstraction Re nement for Probabilistic Softwareqav.comlab.ox.ac.uk/papers/vmcai09.pdf · state transition systems augmented with probabilistic information, such as Markov decision

Selective Sampling on Probabilistic Labels

A Probabilistic Damage Tolerance Analysis Computer · PDF file" Probabilistic damage tolerance methods and opportunity ... computed using Monte Carlo sampling or advanced methods!

Sampling Techniques for Probabilistic and …dechter/talks/tutorial-aaai-2010.pdfSampling Techniques for Probabilistic and Deterministic Graphical models Bozhena Bidyuk Vibhav Gogate

Probabilistic Graphical Models and Their Applications€¦ · Today’s topics I Sampling I Barber Sections 27.1, 27.2, 27.3, 27.4 Andres & Schiele (MPII) Probabilistic Graphical

Probabilistic Logic in CMOS (PCMOS) - Rice Universitykvp1/isscc-talk-noanimate.pdf · Validation of PBL through 0.18 μm CHRT(Chartered Semiconductor) ... Abstraction in Design Input

1 Sampling, Counting, and Probabilistic Inference Wei joint work with Bart Selman

Learning probabilistic models for model checking: an ... · Keywords Probabilistic model checking ·Model learning · Genetic algorithm ·Abstraction ... An example DTMC modeling

Hybrid probabilistic sampling with random subspace for

CS 326A: Motion Planning Probabilistic Roadmaps: Sampling and Connection Strategies

SURVEYS Chapter 5.5 – 5.8. Non-Probabilistic Sampling In HCI research, population estimates are often not the goal. Users are often recruited in non-probabilistic

A Probabilistic Language based upon Sampling Functions

APPENDIX A EXAMPLES OF SEDIMENT QUALITY SAMPLING … › water › archive › polwaste › web › ... · Optimize the design • Choose probabilistic sampling design; use stratified

Part 3: Probabilistic Inference in Graphical ModelsPart 3: Probabilistic Inference in Graphical Models Belief Propagation Variational Inference Sampling Break Belief Propagation Example:

LOT QUALITY ASSURANCE SAMPLING (LQAS). What is LQAS A sampling method that: Is simple, in-expensive, and probabilistic. Combines two standard statistical

djpr.vic.gov.au · Web viewHas a “probabilistic” design for sample selection been used (e.g. simple random sampling, systematic sampling, stratified sampling, cluster sampling)?

Sampling Techniques for Probabilistic and …Markov Chain Monte Carlo: Gibbs Sampling 4. Sampling in presence of Determinism 5. Rao-Blackwellisation 6. AND/OR importance sampling Overview

Selective Sampling on Probabilistic Labels Peng Peng, Raymond Chi-Wing Wong CSE, HKUST 1