72
Mitglied der Helmholtz-Gemeinschaft Skeptical View on AI Application in Science February 28, 2020 edrzej Rybicki

Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Mitg

lied

derH

elm

holtz

-Gem

eins

chaf

t

Skeptical View on AI Applicationin Science

February 28, 2020 Jedrzej Rybicki

Page 2: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Disclaimer

Opinions are mine not my employer

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 2 43

Page 3: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Intro

big successes of AI in recent years

... or is it just a big hype?

resulting funding opportunities in science

(lots of AI products on the market)

Skeptik

Skeptik but not denier. Critical thinking, seeing not only powers but also

limitations.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 3 43

Page 4: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Intro

big successes of AI in recent years

... or is it just a big hype?

resulting funding opportunities in science

(lots of AI products on the market)

Skeptik

Skeptik but not denier. Critical thinking, seeing not only powers but also

limitations.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 3 43

Page 5: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Intro

big successes of AI in recent years

... or is it just a big hype?

resulting funding opportunities in science

(lots of AI products on the market)

Skeptik

Skeptik but not denier. Critical thinking, seeing not only powers but also

limitations.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 3 43

Page 6: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Intro

big successes of AI in recent years

... or is it just a big hype?

resulting funding opportunities in science

(lots of AI products on the market)

Skeptik

Skeptik but not denier. Critical thinking, seeing not only powers but also

limitations.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 3 43

Page 7: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Intro: What are we talking about

Classical AI:

rules & heuristics

almost forgotten by now?

clearly limited when applied outside of its “domain”

reasoning

ML AI:

automatic algorithm creation (“getting computers to act without being

explicitly programmed” Andrew Ng)

data driven (data hungriness)

mostly Deep Learning

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 4 43

Page 8: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Intro: What are we talking about

Classical AI:

rules & heuristics

almost forgotten by now?

clearly limited when applied outside of its “domain”

reasoning

ML AI:

automatic algorithm creation (“getting computers to act without being

explicitly programmed” Andrew Ng)

data driven (data hungriness)

mostly Deep Learning

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 4 43

Page 9: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Science: Scientific method

1 empirical method of acquiring knowledge

2 develop a more sophisticated understanding over time (novelty)

3 replication, testable outcomes⇒ falsification

4 counterfactual situations

Observation Hypothesis Verificationinduction deduction

Data Model Insighttraining explainability

Modeltraining

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 5 43

Page 10: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Science: Scientific method

1 empirical method of acquiring knowledge

2 develop a more sophisticated understanding over time (novelty)

3 replication, testable outcomes⇒ falsification

4 counterfactual situations

Observation Hypothesis Verificationinduction deduction

Data Model Insighttraining explainability

Modeltraining

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 5 43

Page 11: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Science: Scientific method

1 empirical method of acquiring knowledge

2 develop a more sophisticated understanding over time (novelty)

3 replication, testable outcomes⇒ falsification

4 counterfactual situations

Observation Hypothesis Verificationinduction deduction

Data Model Insighttraining explainability

Modeltraining

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 5 43

Page 12: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Science: Scientific method

1 empirical method of acquiring knowledge

2 develop a more sophisticated understanding over time (novelty)

3 replication, testable outcomes⇒ falsification

4 counterfactual situations

Observation Hypothesis Verificationinduction deduction

Data Model Insighttraining explainability

Modeltraining

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 5 43

Page 13: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Model: What are we talking about

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 6 43

Page 14: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Model: Geometric View

Learning is optimization problem: minimize the error between model and

training set (Cost)

DL Model: chain of simple geometric continuous transformations

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 7 43

Page 15: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Deep Neural Networks

Input #1

Input #2

Input #3

Input #4

Output

z[1] = w [1]x + b[1]

a[1] = g[1](z[1])

z[2] = w [2]a[1] + b[2]

a[2] = g[2](z[2])

x

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 8 43

Page 16: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

DL Model: the transformation is chain of simple geometric continuous

transformations

model is a function

currently: continuous (which is already a limitation)

it makes mathematical sense outside of the domain

at best it can interpolate over the input

⇒ f(x) = x network by Gary Marcus (filling the gaps)

it is not programming

even simple task like sorting cannot be accomplished (efficiently)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 9 43

Page 17: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

DL Model: the transformation is chain of simple geometric continuous

transformations

model is a function

currently: continuous (which is already a limitation)

it makes mathematical sense outside of the domain

at best it can interpolate over the input

⇒ f(x) = x network by Gary Marcus (filling the gaps)

it is not programming

even simple task like sorting cannot be accomplished (efficiently)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 9 43

Page 18: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

DL Model: the transformation is chain of simple geometric continuous

transformations

model is a function

currently: continuous (which is already a limitation)

it makes mathematical sense outside of the domain

at best it can interpolate over the input

⇒ f(x) = x network by Gary Marcus (filling the gaps)

it is not programming

even simple task like sorting cannot be accomplished (efficiently)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 9 43

Page 19: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Special case: target domain can be set of (human) concepts

... but it does not mean that the model understands or uses the concepts

“superhuman” performance on ImageNet: what does it mean? (⇒overattribution)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 10 43

Page 20: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Special note: Reinforced learning

Agent Environment

Goal

observations

actions

Application criteria (based on Alpha-0):

1 huge combinational space

2 clear objective (function/metric)

3 data (or simulation)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 11 43

Page 21: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Special note: Reinforced learning

Agent Environment

Goal

observations

actions

Application criteria (based on Alpha-0):

1 huge combinational space

2 clear objective (function/metric)

3 data (or simulation)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 11 43

Page 22: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Reinforced learning

is this the way how we learn? we rather understand in

terms of things that we already understand Agent Environment

Goal

observations

actions

Alpha-Go

Alpha-0

universal framework that learn any game

⇒ Atari 2600 games

very good on Breakout

very bad on Montezuma’s Revenge

Breakout: unless you rotate the screen or even move paddle 2 pixels

higher

lots of anticipating but 0 understanding

Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics, Kansky et al.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 12 43

Page 23: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Reinforced learning

is this the way how we learn? we rather understand in

terms of things that we already understand Agent Environment

Goal

observations

actions

Alpha-Go

Alpha-0

universal framework that learn any game

⇒ Atari 2600 games

very good on Breakout

very bad on Montezuma’s Revenge

Breakout: unless you rotate the screen or even move paddle 2 pixels

higher

lots of anticipating but 0 understanding

Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics, Kansky et al.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 12 43

Page 24: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Reinforced learning

is this the way how we learn? we rather understand in

terms of things that we already understand Agent Environment

Goal

observations

actions

Alpha-Go

Alpha-0

universal framework that learn any game

⇒ Atari 2600 games

very good on Breakout

very bad on Montezuma’s Revenge

Breakout: unless you rotate the screen or even move paddle 2 pixels

higher

lots of anticipating but 0 understanding

Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics, Kansky et al.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 12 43

Page 25: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Reinforced learning

is this the way how we learn? we rather understand in

terms of things that we already understand Agent Environment

Goal

observations

actions

Alpha-Go

Alpha-0

universal framework that learn any game

⇒ Atari 2600 games

very good on Breakout

very bad on Montezuma’s Revenge

Breakout: unless you rotate the screen or even move paddle 2 pixels

higher

lots of anticipating but 0 understanding

Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics, Kansky et al.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 12 43

Page 26: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Reinforced learning

is this the way how we learn? we rather understand in

terms of things that we already understand Agent Environment

Goal

observations

actions

Alpha-Go

Alpha-0

universal framework that learn any game

⇒ Atari 2600 games

very good on Breakout

very bad on Montezuma’s Revenge

Breakout: unless you rotate the screen or even move paddle 2 pixels

higher

lots of anticipating but 0 understanding

Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics, Kansky et al.February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 12 43

Page 27: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Training/Model Summary

despite the hype: very simple (limited?) idea

DL can do lots of interesting things

... but also completely unable to do others

performance is not understanding (image recognition)

brute force⇒ question of efficiency (ResNet18: 11689512 parameters.

Optimal configuration: a point in the 11689512 dimensional space.)

correlations between features rather than abstractions

trend of hard-coding domain knowledge into the neural networks

(Convolutional neural networks)

limited application outside of the domain

In principle, given infinite data, deep learning systems are powerful enough

to represent any finite deterministic “mapping" between any given set of

inputs and a set of corresponding outputs

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 13 43

Page 28: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Training/Model Summary

despite the hype: very simple (limited?) idea

DL can do lots of interesting things

... but also completely unable to do others

performance is not understanding (image recognition)

brute force⇒ question of efficiency (ResNet18: 11689512 parameters.

Optimal configuration: a point in the 11689512 dimensional space.)

correlations between features rather than abstractions

trend of hard-coding domain knowledge into the neural networks

(Convolutional neural networks)

limited application outside of the domain

In principle, given infinite data, deep learning systems are powerful enough

to represent any finite deterministic “mapping" between any given set of

inputs and a set of corresponding outputs

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 13 43

Page 29: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Training/Model Summary

despite the hype: very simple (limited?) idea

DL can do lots of interesting things

... but also completely unable to do others

performance is not understanding (image recognition)

brute force⇒ question of efficiency (ResNet18: 11689512 parameters.

Optimal configuration: a point in the 11689512 dimensional space.)

correlations between features rather than abstractions

trend of hard-coding domain knowledge into the neural networks

(Convolutional neural networks)

limited application outside of the domain

In principle, given infinite data, deep learning systems are powerful enough

to represent any finite deterministic “mapping" between any given set of

inputs and a set of corresponding outputs

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 13 43

Page 30: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Training/Model Summary

despite the hype: very simple (limited?) idea

DL can do lots of interesting things

... but also completely unable to do others

performance is not understanding (image recognition)

brute force⇒ question of efficiency (ResNet18: 11689512 parameters.

Optimal configuration: a point in the 11689512 dimensional space.)

correlations between features rather than abstractions

trend of hard-coding domain knowledge into the neural networks

(Convolutional neural networks)

limited application outside of the domain

In principle, given infinite data, deep learning systems are powerful enough

to represent any finite deterministic “mapping" between any given set of

inputs and a set of corresponding outputs

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 13 43

Page 31: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Training/Model Summary

despite the hype: very simple (limited?) idea

DL can do lots of interesting things

... but also completely unable to do others

performance is not understanding (image recognition)

brute force⇒ question of efficiency (ResNet18: 11689512 parameters.

Optimal configuration: a point in the 11689512 dimensional space.)

correlations between features rather than abstractions

trend of hard-coding domain knowledge into the neural networks

(Convolutional neural networks)

limited application outside of the domain

In principle, given infinite data, deep learning systems are powerful enough

to represent any finite deterministic “mapping" between any given set of

inputs and a set of corresponding outputs

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 13 43

Page 32: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data

Observation Hypothesis Verificationinduction deduction

Data Model Insighttraining explainability

Data:

1 Data deluge

2 Data hungriness

Ways of increasing size of data:

increasing number of rows

increasing number of columns

increasing density of rows

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 14 43

Page 33: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data

Observation Hypothesis Verificationinduction deduction

Data Model Insighttraining explainability

Data:

1 Data deluge

2 Data hungriness

Ways of increasing size of data:

increasing number of rows

increasing number of columns

increasing density of rows

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 14 43

Page 34: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Cautionary note: Quality vs. Quantity

1936 U.S. election: “Literary Digest” conducted huge poll with 2.3 million

voters: Alf Landon. George Gallup conducted a far smaller (but more

scientifically based) survey, correctly predicted Roosevelt’s victory.

statistics would say: better to have 5% random than 90% non-random

learning algorithm will not work (not enough iterations)

data from different sources

usually pre-processed⇒ have different probability distributions

hard to say what is representative

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 15 43

Page 35: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Cautionary note: Quality vs. Quantity

1936 U.S. election: “Literary Digest” conducted huge poll with 2.3 million

voters: Alf Landon. George Gallup conducted a far smaller (but more

scientifically based) survey, correctly predicted Roosevelt’s victory.

statistics would say: better to have 5% random than 90% non-random

learning algorithm will not work (not enough iterations)

data from different sources

usually pre-processed⇒ have different probability distributions

hard to say what is representative

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 15 43

Page 36: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Cautionary note: Quality vs. Quantity

1936 U.S. election: “Literary Digest” conducted huge poll with 2.3 million

voters: Alf Landon. George Gallup conducted a far smaller (but more

scientifically based) survey, correctly predicted Roosevelt’s victory.

statistics would say: better to have 5% random than 90% non-random

learning algorithm will not work (not enough iterations)

data from different sources

usually pre-processed⇒ have different probability distributions

hard to say what is representative

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 15 43

Page 37: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Old stories...

Nate Silver’s model... On election day.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 16 43

Page 38: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Old stories...

Nate Silver’s model... On election day.

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 16 43

Page 39: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: combining sources

Kidney stone treatment study

Treatment A Treatment B

Small stones 93% 87%

Large stones 73% 69%

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 17 43

Page 40: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: combining sources

Kidney stone treatment study

Treatment A Treatment B

Small stones 81/87 (93%) 234/270 (87%)

Large stones 192/263 (73%) 55/80 (69%)

Overall 273/350 (78%) 289/350 (83%)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 17 43

Page 41: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: combining sources

Kidney stone treatment study

Treatment A Treatment B

Small stones 81/87 (93%) 234/270 (87%)

Large stones 192/263 (73%) 55/80 (69%)

Overall 273/350 (78%) 289/350 (83%)

Simpson’s paradox

a trend appears in several different groups of data but disappears or reverses

when these groups are combined

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 17 43

Page 42: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Ramsey theory

A branch of mathematics that studies the conditions under which order must

appear. (Wikipedia)

Example: the minimum number of guests that must be invited so that at least

m will know each other and at least n does not?

e1

e2

e3

e4

e5

e6

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 18 43

Page 43: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Ramsey theory

A branch of mathematics that studies the conditions under which order must

appear. (Wikipedia)

Example: the minimum number of guests that must be invited so that at least

m will know each other and at least n does not?

e1

e2

e3

e4

e5

e6

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 18 43

Page 44: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Ramsey theory

A branch of mathematics that studies the conditions under which order must

appear. (Wikipedia)

Example: the minimum number of guests that must be invited so that at least

m will know each other and at least n does not?

e1

e2

e3

e4

e5

e6

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 18 43

Page 45: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Ramsey theory

A branch of mathematics that studies the conditions under which order must

appear. (Wikipedia)

Example: the minimum number of guests that must be invited so that at least

m will know each other and at least n does not?

e1

e2

e3

e4

e5

e6

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 18 43

Page 46: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Ramsey theory

A branch of mathematics that studies the conditions under which order must

appear. (Wikipedia)

Example: the minimum number of guests that must be invited so that at least

m will know each other and at least n does not?

e1

e2

e3

e4

e5

e6

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 18 43

Page 47: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

Ramsey theory

A branch of mathematics that studies the conditions under which order must

appear. (Wikipedia)

Example: the minimum number of guests that must be invited so that at least

m will know each other and at least n does not?

e1

e2

e3

e4

e5

e6 for a given graph size of 6

⇒ you will find a clique of 3!

pattern-finding!

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 18 43

Page 48: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

van der Waerden’s theorem

van der Waerden’s theorem is a theorem about the existence of arithmetic

progressions in sets. In a series of length W(r , k) r colors at least k form an

arithmetic progression.

Example: in a series of length W(r = 2, k = 3) ≥ 9

1 2 3 4 5 6 7 8 9

B R R B B R R B ?

B R R B B R R B B

B R R B B R R B R

Ramifications:

how big is the structure to find a given substructure

correlation is result of data size

complete disorder is not possible

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 19 43

Page 49: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

van der Waerden’s theorem

van der Waerden’s theorem is a theorem about the existence of arithmetic

progressions in sets. In a series of length W(r , k) r colors at least k form an

arithmetic progression.

Example: in a series of length W(r = 2, k = 3) ≥ 9

1 2 3 4 5 6 7 8 9

B R R B B R R B ?

B R R B B R R B B

B R R B B R R B R

Ramifications:

how big is the structure to find a given substructure

correlation is result of data size

complete disorder is not possible

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 19 43

Page 50: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

van der Waerden’s theorem

van der Waerden’s theorem is a theorem about the existence of arithmetic

progressions in sets. In a series of length W(r , k) r colors at least k form an

arithmetic progression.

Example: in a series of length W(r = 2, k = 3) ≥ 9

1 2 3 4 5 6 7 8 9

B R R B B R R B ?

B R R B B R R B B

B R R B B R R B R

Ramifications:

how big is the structure to find a given substructure

correlation is result of data size

complete disorder is not possible

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 19 43

Page 51: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of rows

van der Waerden’s theorem

van der Waerden’s theorem is a theorem about the existence of arithmetic

progressions in sets. In a series of length W(r , k) r colors at least k form an

arithmetic progression.

Example: in a series of length W(r = 2, k = 3) ≥ 9

1 2 3 4 5 6 7 8 9

B R R B B R R B ?

B R R B B R R B B

B R R B B R R B R

Ramifications:

how big is the structure to find a given substructure

correlation is result of data size

complete disorder is not possibleFebruary 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 19 43

Page 52: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of columns

Data Leakage

Data leakage is when information from outside the training dataset is used to

create the model.

Examples:

“it rains on rainy days”

IBM training set

normalize or standardize your entire dataset

⇒ data rescaling process that you performed had knowledge of the full

distribution of data

Results:

overestimation of model’s performance

reversing an anonymization and obfuscation (sensitive data)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 20 43

Page 53: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of columns

Data Leakage

Data leakage is when information from outside the training dataset is used to

create the model.

Examples:

“it rains on rainy days”

IBM training set

normalize or standardize your entire dataset

⇒ data rescaling process that you performed had knowledge of the full

distribution of data

Results:

overestimation of model’s performance

reversing an anonymization and obfuscation (sensitive data)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 20 43

Page 54: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of columns

Data Leakage

Data leakage is when information from outside the training dataset is used to

create the model.

Examples:

“it rains on rainy days”

IBM training set

normalize or standardize your entire dataset

⇒ data rescaling process that you performed had knowledge of the full

distribution of data

Results:

overestimation of model’s performance

reversing an anonymization and obfuscation (sensitive data)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 20 43

Page 55: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of columns

Data Leakage

Data leakage is when information from outside the training dataset is used to

create the model.

Examples:

“it rains on rainy days”

IBM training set

normalize or standardize your entire dataset

⇒ data rescaling process that you performed had knowledge of the full

distribution of data

Results:

overestimation of model’s performance

reversing an anonymization and obfuscation (sensitive data)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 20 43

Page 56: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: increasing number of columns

Data Leakage

Data leakage is when information from outside the training dataset is used to

create the model.

Examples:

“it rains on rainy days”

IBM training set

normalize or standardize your entire dataset

⇒ data rescaling process that you performed had knowledge of the full

distribution of data

Results:

overestimation of model’s performance

reversing an anonymization and obfuscation (sensitive data)

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 20 43

Page 57: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: Summary

often you require lots of data to create complex model

or you are overloaded with the data anyways

data collection might be harder than you think (end-to-end control)

danger of emerging patterns (Ramsey & van der Waerden)

The Curse of Dimensionality

long-tail problem (things that don’t happen so often)

Illusion of Invariants: Data that span several order of magnitude leads to

high R2 and makes invariants notable.

The Illusion of Invariant Quantities in Life Histories Sean Nee, Nick Colegrave, Stuart A. West, Alan

Grafen

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 21 43

Page 58: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Data: Summary

often you require lots of data to create complex model

or you are overloaded with the data anyways

data collection might be harder than you think (end-to-end control)

danger of emerging patterns (Ramsey & van der Waerden)

The Curse of Dimensionality

long-tail problem (things that don’t happen so often)

Illusion of Invariants: Data that span several order of magnitude leads to

high R2 and makes invariants notable.

The Illusion of Invariant Quantities in Life Histories Sean Nee, Nick Colegrave, Stuart A. West, Alan

Grafen

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 21 43

Page 59: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Deduction

Observation Hypothesis Verificationinduction deduction

Data Model Insighttraining explainability

Crucial for:

replication, testable outcomes (trust)

falsification

novelty (looking into the black box for new insights)

counterfactual situations

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 22 43

Page 60: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

LIME: Local Interpretable Model-Agnostic Explanations

“Why Should I Trust You?" Explaining the Predictions of Any Classifier Marco Tulio Ribeiro, Sameer

Singh, and Carlos Guestrin

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 23 43

Page 61: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

SHAP: SHapley Additive exPlanations

LIME:

generate artificial points around an observation

local approximation by Linear Regression

SHAP:

“generalization" of LIME

local explainer is not LR

more sophisticated model types for explainer

Data Complex Model

Explainer

Prediction

Explaination

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 24 43

Page 62: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Adversarial Examples

“Explaining and Harnessing Adversarial Examples" Ian J. Goodfellow, Jonathon Shlens, Christian

Szegedy

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 25 43

Page 63: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Adversarial Examples

“Synthesizing Robust Adversarial Examples" Anish Athalye, Logan Engstrom, Andrew Ilyas, Kevin Kwok

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 25 43

Page 64: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Deduction

Summary:

ML model is valid outside of the input

⇒ but often does not make much sense

Current approaches to explainability: not really deduction

⇒ simplyfing “transformations” for single point

Neural networks can be tricked

⇒ worrisome and shows how much “intuition" people have

DL models can even be better than e.g., random forest

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 26 43

Page 65: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

AI stories I

Google flu trends:

2008: paper in Nature claiming to beat Centers for Disease Control and

Prevention

2013: misses the peak of the flu season by 140 percent

⇒ overfitting and missing changes in search behavior over time

side note: data

IBM Watson for Oncology:

data from doctor’s notes (leakage, non-representative?), medical studies

and clinical guidelines

treatment recommendations are based on training by human overseers

“through AI, [...] generate new insights and identify,[...] new approaches

to cancer care"

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 27 43

Page 66: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

AI stories II

⇒ Cancelled after unsafe treatment recommendations

it is much easier to make prediction than suggest an action to change the

outcome (counterfactual)

side note: no scientific papers demonstrating how the technology affects

physicians and patients

Kitano “Artificial Intelligence to Win the Nobel Prize and Beyond" (2016)

human cognitive limitations

1 mln papers/year, some contradictory, inaccurate (partly language

problem)

explosion of experimental data

hope of getting rid of bias

discovery is beyond current knowledge

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 28 43

Page 67: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

AI stories III

⇒ hypothesis generation and verification (robotics)

Playing games:

is it really a proxy for intelligence?

shown that if you play for 200 years your are better

Recommendation systems:

successful on manipulating you to buy more things

cannot explain, reason, convince you

Autonomous cars:

failure of Volvo, ... and Tesla?

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 29 43

Page 68: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Summary

DL models are just transformations: overattribution

DL not able to generalize, explain, fill the gaps. Do not resemble scientific

approach.

Limitations hidden in data (hinder creation of really large data sets)

⇒ Complex vs. simpler models (Model-free, policy-based learning to help?)

limits of (inefficient) “just learning from data”⇒ reasoning

Dangers:

over-hype

trivialization of science: moving to problems that can be tackled with AI

(recipe vs. understand) vs. trivialization of AI

what we learn from solutions?

already dealing with a replication crisis (black box models, questionable

reproducibility, limited explainability, and lack of uncertainty quantification)

technical sustainability of brute-force-based progress

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 30 43

Page 69: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Summary

DL models are just transformations: overattribution

DL not able to generalize, explain, fill the gaps. Do not resemble scientific

approach.

Limitations hidden in data (hinder creation of really large data sets)

⇒ Complex vs. simpler models (Model-free, policy-based learning to help?)

limits of (inefficient) “just learning from data”⇒ reasoning

Dangers:

over-hype

trivialization of science: moving to problems that can be tackled with AI

(recipe vs. understand)

vs. trivialization of AI

what we learn from solutions?

already dealing with a replication crisis (black box models, questionable

reproducibility, limited explainability, and lack of uncertainty quantification)

technical sustainability of brute-force-based progress

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 30 43

Page 70: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Summary

DL models are just transformations: overattribution

DL not able to generalize, explain, fill the gaps. Do not resemble scientific

approach.

Limitations hidden in data (hinder creation of really large data sets)

⇒ Complex vs. simpler models (Model-free, policy-based learning to help?)

limits of (inefficient) “just learning from data”⇒ reasoning

Dangers:

over-hype

trivialization of science: moving to problems that can be tackled with AI

(recipe vs. understand) vs. trivialization of AI

what we learn from solutions?

already dealing with a replication crisis (black box models, questionable

reproducibility, limited explainability, and lack of uncertainty quantification)

technical sustainability of brute-force-based progress

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 30 43

Page 71: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

Summary

DL models are just transformations: overattribution

DL not able to generalize, explain, fill the gaps. Do not resemble scientific

approach.

Limitations hidden in data (hinder creation of really large data sets)

⇒ Complex vs. simpler models (Model-free, policy-based learning to help?)

limits of (inefficient) “just learning from data”⇒ reasoning

Dangers:

over-hype

trivialization of science: moving to problems that can be tackled with AI

(recipe vs. understand) vs. trivialization of AI

what we learn from solutions?

already dealing with a replication crisis (black box models, questionable

reproducibility, limited explainability, and lack of uncertainty quantification)

technical sustainability of brute-force-based progress

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 30 43

Page 72: Skeptical View on AI Application in Science€¦ · Skeptical View on AI Application in Science February 28, 2020 Jedrz˛ ej Rybicki. Disclaimer ... February 28, 2020Jedrz˛ ej RybickiSkeptical

[email protected]

February 28, 2020 Jedrzej Rybicki Skeptical View on AI Application in Science 31 43