57

Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Imprecise Probability in Statistical Modelling:A Critical Review

T. Augustin M. Cattaneo P. Fink J. Plaß G. SchollmeyerG. Walter A. Wiencierz F. Coolen U. Pötter M. Seitz

University of Munich (LMU)

ToulouseMay 27th, 2015

Augustin et al.: 1 / 54

Page 2: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Augustin et al.: 2 / 54

Page 3: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Table of contents

1 Introduction

2 Imprecise Sampling Models: Robustness/Neighbourhood Models

3 Imprecise Priors: Prior Data-Conflict

4 Imprecise Observations: Ontic View

5 Imprecise Observations: Epistemic View

6 Concluding Remarks: Outlook

Augustin et al.: 3 / 54

Page 4: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Imprecision in statistics

– hide/neglect imprecision!

– model imprecision away!

!! take imprecision into account in a reliable way!

!! imprecision as a modelling tool

Augustin et al.: 4 / 54

Page 5: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

1. Introduction

Statistics

?

6

data generation process (DGP)

Data

Augustin et al.: 5 / 54

Page 6: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Two kinds of imprecision

data imprecision: imprecise observations, data are subsets of theintended sample space

* imprecise observations of something precise → epistemic

* precise observations of something imprecise≈

→ ontic

Couso & Dubois (2014, IJAR), Couso, Dubois & Sánchez (2014,Springer)

model imprecision: imprecise probability models

P(Data⋃︀⋃︀Parameter) ,

maybe also P(Parameter)

set-valued approaches: take sets of values/probability distributionsas the basic entity

Augustin et al.: 6 / 54

Page 7: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

On the power of IP in statistical modelling

defensive point of view

IP protects against the potentially disastrous consequences of applyingstandard procedures under violated assumptions → robustness in:

frequentist and

Bayesian settings

Augustin et al.: 7 / 54

Page 8: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

On the power of IP in statistical modelling

offensive point of viewIP is a most powerful methodology, allowing for

separation of variability (variance) from indeterminism

active modelling of ignorance

active modelling of conflicting/surprising information

active use of weak knowledge that can not be used in the traditionalsetting

Augustin et al.: 8 / 54

Page 9: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Augustin et al.: 9 / 54

Page 10: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Table of contents

1 Introduction

2 Imprecise Sampling Models: Robustness/Neighbourhood Models

3 Imprecise Priors: Prior Data-Conflict

4 Imprecise Observations: Ontic View

5 Imprecise Observations: Epistemic View

6 Concluding Remarks: Outlook

Augustin et al.: 10 / 54

Page 11: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

The mantra of statistical modelling

Box & Draper (1987, Empirical Model Building and Response Surfaces, p. 424)

“Essentially, all models are wrong,

but some of them are useful”,

and sometimes dangerous

Augustin et al.: 11 / 54

Page 12: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

The mantra of statistical modelling

Box & Draper (1987, Empirical Model Building and Response Surfaces, p. 424)

“Essentially, all models are wrong,

but some of them are useful”,

and sometimes dangerous

Augustin et al.: 11 / 54

Page 13: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

The mantra of statistical modelling

Box & Draper (1987, Empirical Model Building and Response Surfaces, p. 424)

“Essentially, all models are wrong,

but some of them are useful”,

and sometimes dangerous

Augustin et al.: 11 / 54

Page 14: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Assumptions may matter!

−8 −6 −4 −2 0 2 4 6 8

0.0

0.1

0.2

0.3

0.4

x

dens

ity(x

)

NormalCauchy

Figure: Densities of the Normal(0,1) and the Cauchy(0,0.79) distribution.

Augustin et al.: 12 / 54

Page 15: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Assumptions may matter!

Consider sample mean X .

if X1, . . . ,Xn ∼ N(𝜇,1) (normally distributed), then

X̄ ∼ N(𝜇,1

n)

Learning from the sample, with increasing sample size variance of Xdecreases.

if X1, . . . ,Xn ∼ 𝒞(𝜇,1) (Cauchy-distributed), then

X ∼ C(𝜇,1)

Distribution does not depend on n, no learning via sample meanpossible

Augustin et al.: 13 / 54

Page 16: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Assumptions may matter! robustness

many optimal procedures show very bad properties under minimaldeviations from the ideal model

instead of f (x ⋃︀⋃︀𝜗): model "approximately f (x ⋃︀⋃︀𝜗) ", i.e. consider alldistribution "close to f (x ⋃︀⋃︀𝜗) "→ neighbourhood models

Augustin et al.: 14 / 54

Page 17: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Huber & Strassen approach

Huber & Strassen (1973, AnnStat): globally least favorable pairs foroptimal Neyman-Person testing between two-monotone surveyed, e.g., inAugustin, Walter & Coolen (2014, Intro IP, Wiley)

* applicable to most neighborhood models of precise probabilities

* extension to neighborhood models of many IP models

* construction procedures

* going beyond two-monotonicity▸ parametrically constructed models▸ locally least favorable pairs

Augustin et al.: 15 / 54

Page 18: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Augustin et al.: 16 / 54

Page 19: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Table of contents

1 Introduction

2 Imprecise Sampling Models: Robustness/Neighbourhood Models

3 Imprecise Priors: Prior Data-Conflict

4 Imprecise Observations: Ontic View

5 Imprecise Observations: Epistemic View

6 Concluding Remarks: Outlook

Augustin et al.: 17 / 54

Page 20: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Bayesian inference with sets of priors I: ignorance

so-called ’noninformative priors’ do contain information

consider set of all (non-degenerated) distributions instead, e.g., Walley(1996, JRSSB), Benavoli & Zaffalon (2012, JSPI)

Augustin et al.: 18 / 54

Page 21: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Bayesian inference with sets of priors II: prior-data conflict

Bayesian models are understood to express prior knowledge (or to"borrow strength")

what happens when this prior konwledge is wrong?

example: X1, . . . ,Xn i.i.d data, Xi ∼ 𝒩 (𝜇,𝜎20)

conjugated prior: 𝜇 ∼ 𝒩 (𝜈, 𝜚2) then

𝜈′ =

x̄𝜌2 + 𝜈 ⋅ 𝜎2

n

𝜌2 + 𝜎2

n

𝜌2′

=

𝜌2 ⋅ 𝜎2

n

𝜌2 + 𝜎2

n

Augustin et al.: 19 / 54

Page 22: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Bayesian inference with sets of priors II: prior-data conflict

let, for sake of simplicity, 𝜚2 = 𝜎2

n , then

�̂� = 𝜈′ =x̄ + 𝜈

2

and

𝜚2′

=

𝜚4

2𝜚2=

𝜚2

2.

then, e.g.,x̄ = 0.9 and 𝜈 = 1.1

andx̄ = −100 and 𝜈 = 102

lead to the same distribution (equal mean and variance )

general effect for canonical exponential families

much more intuitive behaviour when prior parameters are imprecise,e.g., are interval-valued

Augustin et al.: 20 / 54

Page 23: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Bayesian inference with sets of priors II: prior-data conflict

Source: Walter & Augustin (2009, JStTheorPract, p. 268)

Augustin et al.: 21 / 54

Page 24: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Bayesian inference with sets of priors II: prior-data conflict

Source: Walter & Augustin (2009, JStTheorPract, p. 268)

0 20 40 60 80 100

02

46

810

n

y(0)

= 3.5, n(0)

= 5

0 20 40 60 80 1000

24

68

10

n

y(0)

∈ [3;4], n(0)

= 5

0 20 40 60 80 100

02

46

810

n

y(0)

∈ [3;4], n(0)

∈ [1;25]

0 20 40 60 80 100

02

46

810

n

y(0)

= 3.5, n(0)

= 5

0 20 40 60 80 100

02

46

810

n

y(0)

∈ [3;4], n(0)

= 5

0 20 40 60 80 100

02

46

810

n

y(0)

∈ [3;4], n(0)

∈ [1;25]

Augustin et al.: 22 / 54

Page 25: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Augustin et al.: 23 / 54

Page 26: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Table of contents

1 Introduction

2 Imprecise Sampling Models: Robustness/Neighbourhood Models

3 Imprecise Priors: Prior Data-Conflict

4 Imprecise Observations: Ontic View

5 Imprecise Observations: Epistemic View

6 Concluding Remarks: Outlook

Augustin et al.: 24 / 54

Page 27: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Ontic imprecision: example

Plass, Fink, Schöning & Augustin (2015, ISIPTA)

pre-election study (GLES 2013: German Longitudinal Election Study)

a considerable amount of voters is still undecided, but mainly onlybetween two or three parties

these voters constitute different subgroups of there own with specificcharacteristics (which have to be neglected in the traditional analysis)

here, for the moment, NO forecast aimed at, instead analysis ofindividual preferences as they are

Augustin et al.: 25 / 54

Page 28: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Ontic imprecision: modelling idea

"precise observations of something imprecise"

modelled by random conjunctive sets

change sample space 𝒮 = {CD,SPD,Green,Left, . . .} into 𝒮∗ ⊂ 𝒫(𝒮)

oberservations are precise observations in 𝒮∗ and can be treated liketradtional categorical data

whole statistical modelling framework can be applied, here logisticregression

for each non-empty element of 𝒮∗ vector of regression coefficients

Augustin et al.: 26 / 54

Page 29: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Ontic imprecision: example, Plass et al (2015, Table 4)

Augustin et al.: 27 / 54

Page 30: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Augustin et al.: 28 / 54

Page 31: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Table of contents

1 Introduction

2 Imprecise Sampling Models: Robustness/Neighbourhood Models

3 Imprecise Priors: Prior Data-Conflict

4 Imprecise Observations: Ontic View

5 Imprecise Observations: Epistemic View

6 Concluding Remarks: Outlook

Augustin et al.: 29 / 54

Page 32: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Epistemic data imprecision

imprecise observations of something precise

missing data (refusals, treatment design)

data protection

data merging with partially overlapping categories

secondary data analysis

forecasts derived from set-valued (ontic) observations

refined responses of primary refusals, typically coarsening/missing notat random

Augustin et al.: 30 / 54

Page 33: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Spinney of deficiencies

Augustin et al.: 31 / 54

Page 34: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

The two-layers perspective

ideal Yi� effects � ideal Xi

? ?

? ?

6

data - inference � data

deficiency model deficiency model

observable Y∗i observable X∗i

Augustin et al.: 32 / 54

Page 35: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Traditional treatment of deficiencies

model the deficiency process!

characterize situations where the deficiency may be ignored or whenone can correct for it!

but typically very restrictive – often untestable – asumptions neededto ensure identifiability = precise solution

Augustin et al.: 33 / 54

Page 36: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Traditional treatment of deficiencies

For instance, in measurement error models (“classical model of testingtheory”):

measurement error model must be known precisely– type of error, especially assumptions on (conditional) independence

– independence of true value

– independence of other covariates

– independence of other measurements

– type of error distribution– moments of error distribution

validation studies typically not available

Augustin et al.: 34 / 54

Page 37: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Interval data: example

German General Social Survey (ALLBUS) 2010:2827 observations in total, approx. 2000 report personal income (30%missing). An additional 10% report only income brackets.

0 1000 2000 3000 4000 5000 6000 7000 8000

0

50

100

Frequen

cies

Augustin et al.: 35 / 54

Page 38: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Interval data: example

1 we see heaping at 1000 e, 2000 e, . . ., less so at 500 e, 1500 e, . . .

2 both heaping and grouping depend on the amount of income reported.

3 missingness (some 20% of the data) might as well depend on theamount of income.

Consequences:

1 missingness, grouping, and heaping can often be represented byintervals.

2 missingness, grouping, and heaping will rarely conform to theassumption of “coarsening at random” (CAR).

3 missingness, grouping, and heaping add an additional type ofuncertainty apart from classical statistical uncertainty. Thisuncertainty can’t be decreased by sampling more data.

Augustin et al.: 36 / 54

Page 39: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Interval data: example

1 we see heaping at 1000 e, 2000 e, . . ., less so at 500 e, 1500 e, . . .

2 both heaping and grouping depend on the amount of income reported.

3 missingness (some 20% of the data) might as well depend on theamount of income.

Consequences:

1 missingness, grouping, and heaping can often be represented byintervals.

2 missingness, grouping, and heaping will rarely conform to theassumption of “coarsening at random” (CAR).

3 missingness, grouping, and heaping add an additional type ofuncertainty apart from classical statistical uncertainty. Thisuncertainty can’t be decreased by sampling more data.

Augustin et al.: 36 / 54

Page 40: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Wrongly assuming CAR (binary data)

Source: Plass, Augustin, Cattaneo, Schollmeyer (2015, ISIPTA)

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9qABA

qA

BB

0.2

0.4

0.6

abs. value

sign

-

+

Relative bias of p̂A if CAR is assumed (pA=0.6)

Augustin et al.: 37 / 54

Page 41: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Manski’s Law of Decreasing Credibility

Reliability !? Credibility ?

"The credibility of inference decreases with the strength of the assumptionsmaintained." (Manski (2003, p. 1))

Augustin et al.: 38 / 54

Page 42: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Reliable inference instead of overprecision!!

Consequences to be drawn from the Law of Decreasing Credibility:

adding untenable assumptions to produce precise solution may distroycredibility of statistical analysis, and therefore its relevance for thesubject matter questions.

make realistic assumptions and let the data speak for themselves!

the results may be imprecise, but are more reliable

the extent of imprecision is related to the data quality!

as a welcome by-product: clarification of the implication of certainassumptions

often still sufficient to answer subjective matter question

Augustin et al.: 39 / 54

Page 43: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Much IP work on epistemic date impprecision, e.g.,

de Cooman & Zaffalon (2004, AI), Zaffalon & Miranda (2009, JAIR)

Utkin & Augustin (2007, IJAR), Troffaes & Coolen (2009, IJAR)

Utkin & Coolen (2011, ISIPTA)

Cattaneo & Wiencierz (2012, IJAR)

Schollmeyer & Augustin (2015, IJAR)

Denoeux (2014, IJAR)

Augustin et al.: 40 / 54

Page 44: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Relation to work in econometrics, biometrics and engineering

partial identification: e.g., Manski (2003, Springer), Tamer (2010,Annu Rev Econ)

systematic sensitivity analysis: e.g., Vansteelandt, Goetghebeur,Kenword, Molenberghs (2006, Stat. Sinica)

reliable computing, interval computation: e.g., Ferson et al. (2007,Sandra TR), Nguyen, Kreinovich, Wu & Xiang (2011, Springer)

Augustin et al.: 41 / 54

Page 45: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Recent likelihood approach

Plass, Augustin, Cattaneo, Schollmeyer (2015, ISIPTA)

utilize invariance of likelihood under paramter-transformation

observable part: set-valued observations, parameter 𝜗, maximumlikelihood estimator 𝜗

latent part: parameter of interest 𝛾

related via observation model: expressed by mapping Φ

set-valued maximum likelihood estimator Γ̂ = {𝛾⋃︀Φ(𝛾) = 𝜗}

application also to some basic logistic regression models

Augustin et al.: 42 / 54

Page 46: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Estimating equations

Generalizing from the linear case, suppose there is a consistent (score-)estimating equation for the ideal model {𝒫𝜗 ⋃︀𝜗 ∈ Θ}, i.e.:

∀𝜗 ∈ Θ ∶ E𝜗(𝜓(X ,Y ;𝜗)) = 0

With interval data, one gets a set of estimating equations, one for eachrandom vector (selection) (X ,Y ) ∈ (X,Y):

Ψ(X,Y;𝜗) ∶= {𝜓(X ,Y ;𝜗) ⋃︀X ∈ X,Y ∈Y}

Augustin et al.: 43 / 54

Page 47: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Augustin et al.: 44 / 54

Page 48: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Table of contents

1 Introduction

2 Imprecise Sampling Models: Robustness/Neighbourhood Models

3 Imprecise Priors: Prior Data-Conflict

4 Imprecise Observations: Ontic View

5 Imprecise Observations: Epistemic View

6 Concluding Remarks: Outlook

Augustin et al.: 45 / 54

Page 49: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Concluding remarks

law of decreasing credibility !

reliable use of information

set-valued analysis: imprecise data, imprecise models

imprecise but reliable results; often sufficient!

natural behaviour of imprecision!

use this actively in modelling!

towards a general framework for reliable analysis of non-idealized data!

Augustin et al.: 46 / 54

Page 50: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

On the power of IP in statistical modelling

defensive point of view

IP protects against the potentially disastrous consequences of applyingstandard procedures under violated assumptions → robustness in:

frequentist and

Bayesian settings

Augustin et al.: 47 / 54

Page 51: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

On the power of IP in statistical modelling

offensive point of viewIP is a most powerful methodology, allowing for

separation of variability (variance) from indeterminism

active modelling of ignorance

active modelling of conflicting/surprising information

active use of weak knowledge that can not be used in the traditionalsetting

Augustin et al.: 48 / 54

Page 52: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Future directions

Popularize the defensive point of view

case studies, illustrating the power of imprecision

robust procedures for generalized linear models etc.

cautious data completion for generalized linear models etc.

(disc. with H. Rieder): for each result complement p-value routinely bystability level: smallest level of contamination where the result is nolonger significant

Propagate the offensive view

case studies, illustrating the power of imprecisions

separation of variability (variance) from indeterminism

active modelling of ignorance

active modelling of conflicting/surprising information

active use of weak knowledge that can not be used in the traditionalsetting

Augustin et al.: 49 / 54

Page 53: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Future directions

statisticians start to think from data→ improve understanding of imprecise sampling models

▸ imprecise probabilities for the observables!▸ generalized sampling theory: imprecise selection probabilities▸ utilize variety of independence concepts (model slight dependence)▸ develop methodology of estimation from imprecise sampling models

develop simulation techniques for imprecise probabilities

how to handle regression models?

Augustin et al.: 50 / 54

Page 54: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Future directions

develop heuristics, "semi-imprecise" methods"IP should make life better or easier (or both)" (Frank Coolen)

develop direct methods▸ leave the necessarily more complicated "set-of traditional model views"▸ direct processing of information (e.g., statistics with desirablegambles?)

Augustin et al.: 51 / 54

Page 55: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Future directions

develop a methodology for statistical modelling with sets of models

▸ generalized linear models▸ nonparametric regression models → smoothing▸ variable selection▸ realistic measurement error and random effect models▸ importance of unbiased estimation equations

▸ evaluation / comparision of models with different level of imprecision

Augustin et al.: 52 / 54

Page 56: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

The two-layers perspective

ideal Yi� effects � ideal Xi

? ?

? ?

6

data - inference � data

deficiency model deficiency model

observable Y∗i observable X∗i

Augustin et al.: 53 / 54

Page 57: Imprecise Probability in Statistical Modelling: A Critical ...€¦ · de Cooman & Za alon (2004, AI), Za alon & Miranda (2009, JAIR) Utkin & Augustin (2007, IJAR), ro aesT & Coolen

Augustin et al.: 54 / 54