109
1 UNC, Stat & OR Standard Approach: Lee et al (1999): Formulate & Solve Optimization Major Challenge: Not Nested , () Nonnegative Matrix Factorization

1 UNC, Stat & OR Nonnegative Matrix Factorization

Embed Size (px)

Citation preview

Page 1: 1 UNC, Stat & OR Nonnegative Matrix Factorization

1

UNC, Stat & OR

Standard Approach:

Lee et al (1999):

Formulate & Solve Optimization

Major Challenge:

Not Nested, ()

Nonnegative Matrix Factorization

Page 2: 1 UNC, Stat & OR Nonnegative Matrix Factorization

2

UNC, Stat & OR

Standard NMF

But Note

Not Nested

No “Multi-scale”

Analysis

Possible (Scores Plot?!?)

Nonnegative Matrix Factorization

Page 3: 1 UNC, Stat & OR Nonnegative Matrix Factorization

3

UNC, Stat & OR

Same Toy

Data Set

Rank 1

Approx.

Properly

Nested

Nonnegative Nested Cone Analysis

Page 4: 1 UNC, Stat & OR Nonnegative Matrix Factorization

4

UNC, Stat & OR

5-d Toy Example Rank 3 NNCA Approx.

Current

Research:

How Many

Nonneg.

Basis El’ts

Needed?

Nonnegative Nested Cone Analysis

Page 5: 1 UNC, Stat & OR Nonnegative Matrix Factorization

5

UNC, Stat & OR

How generally applicable is

Backwards approach to PCA?

Potential Application: Principal Curves

Hastie & Stuetzle, (1989)

(Foundation of Manifold Learning)

An Interesting Question

Page 6: 1 UNC, Stat & OR Nonnegative Matrix Factorization

6

UNC, Stat & OR

How generally applicable is

Backwards approach to PCA?

An Attractive Answer:

James Damon, UNC Mathematics

Key Idea: Express Backwards PCA as

Nested Series of Constraints

An Interesting Question

Page 7: 1 UNC, Stat & OR Nonnegative Matrix Factorization

7

UNC, Stat & OR

New Topic

Curve Registration

Page 8: 1 UNC, Stat & OR Nonnegative Matrix Factorization

8

UNC, Stat & OR

Context

Functional Data AnalysisCurves as Data Objects

Toy Example:

How Can WeUnderstandVariation?

Page 9: 1 UNC, Stat & OR Nonnegative Matrix Factorization

9

UNC, Stat & OR

Context

Functional Data AnalysisCurves as Data Objects

Toy Example:

How Can WeUnderstandVariation?

Page 10: 1 UNC, Stat & OR Nonnegative Matrix Factorization

10

UNC, Stat & OR

Functional Data Analysis

InsightfulDecomposition

Vertical Variation

Horiz’l Var’n

Page 11: 1 UNC, Stat & OR Nonnegative Matrix Factorization

11

UNC, Stat & OR

Challenge

Fairly Large Literature

Many (Diverse) Past Attempts

Limited Success (in General)

Surprisingly Slippery

(even mathematical formulation)

Page 12: 1 UNC, Stat & OR Nonnegative Matrix Factorization

12

UNC, Stat & OR

Challenge (Illustrated)

Thanks to Wei Wu

Page 13: 1 UNC, Stat & OR Nonnegative Matrix Factorization

13

UNC, Stat & OR

Challenge (Illustrated)

Thanks to Wei Wu

Page 14: 1 UNC, Stat & OR Nonnegative Matrix Factorization

14

UNC, Stat & OR

Functional Data Analysis

AppropriateMathematicalFramework? Vertical Variation

Horiz’l Var’n

Page 15: 1 UNC, Stat & OR Nonnegative Matrix Factorization

15

UNC, Stat & OR

Landmark Based Shape Analysis

Approach: Identify objects that are:

• Translations

• Rotations

• Scalings

of each other

Mathematics: Equivalence Relation

Results in: Equivalence Classes

Which become the Data Objects

Page 16: 1 UNC, Stat & OR Nonnegative Matrix Factorization

16

UNC, Stat & OR

Curve Registration

What are the Data Objects?

Consider “Time Warpings”

(smooth)

More Precisely: Diffeomorphisms

Page 17: 1 UNC, Stat & OR Nonnegative Matrix Factorization

17

UNC, Stat & OR

Time Warping Intuition

Elastically Stretch & Compress Axis

Page 18: 1 UNC, Stat & OR Nonnegative Matrix Factorization

18

UNC, Stat & OR

Curve Registration

Say curves and are equivalent,

When so that

Page 19: 1 UNC, Stat & OR Nonnegative Matrix Factorization

19

UNC, Stat & OR

Data Objects I

Equivalence Classes of Curves

(Set of AllWarps ofGiven Curve)

Notation: for a “representor”

Page 20: 1 UNC, Stat & OR Nonnegative Matrix Factorization

20

UNC, Stat & OR

Data Objects I

Equivalence Classes of Curves

(Set of AllWarps ofGiven Curve)

Next Task: Find Metric on Equivalence Classes

Page 21: 1 UNC, Stat & OR Nonnegative Matrix Factorization

21

UNC, Stat & OR

Metrics in Curve Space

Find Metric on Equivalence Classes

Start with Warp Invariance on Curves& Extend

Page 22: 1 UNC, Stat & OR Nonnegative Matrix Factorization

22

UNC, Stat & OR

Metrics in Curve Space

Traditional Approach to Curve

Registration:

• Align curves, say and

• By finding optimal time warp, , so:

• Vertical var’n: PCA after alignment

• Horizontal var’n: PCA on s

Page 23: 1 UNC, Stat & OR Nonnegative Matrix Factorization

23

UNC, Stat & OR

Metrics in Curve Space

Problem:

Don’t have proper metric

Since:

Because:

Page 24: 1 UNC, Stat & OR Nonnegative Matrix Factorization

24

UNC, Stat & OR

Metrics in Curve Space

Thanks toXiaosun Lu

Page 25: 1 UNC, Stat & OR Nonnegative Matrix Factorization

25

UNC, Stat & OR

Metrics in Curve Space

Note:VeryDifferentL2 norms

Thanks toXiaosun Lu

Page 26: 1 UNC, Stat & OR Nonnegative Matrix Factorization

26

UNC, Stat & OR

Metrics in Curve Space

Solution:

Look for Warp Invariant Metric

Where:

Page 27: 1 UNC, Stat & OR Nonnegative Matrix Factorization

27

UNC, Stat & OR

Metrics in Curve Space

I.e. Have “Parallel”

Representatives

Of Equivalence

Classes

Page 28: 1 UNC, Stat & OR Nonnegative Matrix Factorization

28

UNC, Stat & OR

Metrics in Curve Space

Warping Invariant Metric

Developed in context of:

Likelihood Geometry

Fisher – Rao Metric:

Page 29: 1 UNC, Stat & OR Nonnegative Matrix Factorization

29

UNC, Stat & OR

Metrics in Curve Space

Fisher – Rao Metric:

Computation Based on

Square Root Velocity Function (SRVF)

Page 30: 1 UNC, Stat & OR Nonnegative Matrix Factorization

30

UNC, Stat & OR

Metrics in Curve Space

Square Root Velocity Function (SRVF)

Page 31: 1 UNC, Stat & OR Nonnegative Matrix Factorization

31

UNC, Stat & OR

Metrics in Curve Space

Fisher – Rao Metric:

Computation Based on SRVF:

So work with SRVF

Page 32: 1 UNC, Stat & OR Nonnegative Matrix Factorization

32

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 33: 1 UNC, Stat & OR Nonnegative Matrix Factorization

33

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Thanks to

Xiaosun Lu

Page 34: 1 UNC, Stat & OR Nonnegative Matrix Factorization

34

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 35: 1 UNC, Stat & OR Nonnegative Matrix Factorization

35

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 36: 1 UNC, Stat & OR Nonnegative Matrix Factorization

36

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 37: 1 UNC, Stat & OR Nonnegative Matrix Factorization

37

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 38: 1 UNC, Stat & OR Nonnegative Matrix Factorization

38

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 39: 1 UNC, Stat & OR Nonnegative Matrix Factorization

39

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 40: 1 UNC, Stat & OR Nonnegative Matrix Factorization

40

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 41: 1 UNC, Stat & OR Nonnegative Matrix Factorization

41

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots?

Page 42: 1 UNC, Stat & OR Nonnegative Matrix Factorization

42

UNC, Stat & OR

Metrics in Curve Space

Why

square

roots? Dislikes Pinching

Focusses Well On

Peaks of Unequal Height

Page 43: 1 UNC, Stat & OR Nonnegative Matrix Factorization

43

UNC, Stat & OR

Metrics in Curve Space

Note on SRVF representation:

Can show: Warp Invariance

Follows from Jacobean calculation

Page 44: 1 UNC, Stat & OR Nonnegative Matrix Factorization

44

UNC, Stat & OR

Metrics in Curve Quotient Space

Above was Invariance for Individual

Curves

Now extend to:

Equivalence Classes of Curves

I.e. Orbits as Data Objects

I.e. Quotient Space

Page 45: 1 UNC, Stat & OR Nonnegative Matrix Factorization

45

UNC, Stat & OR

Metrics in Curve Quotient Space

Define Metric on Equivalence Classes:

For & , i.e. &

Independent of Choice of &

By Warping Invariance

Page 46: 1 UNC, Stat & OR Nonnegative Matrix Factorization

46

UNC, Stat & OR

Mean in Curve Quotient Space

Benefit of a Metric:

Allows Definition of a “Mean”

Fréchet Mean

Geodesic Mean

Barycenter

Karcher Mean

Page 47: 1 UNC, Stat & OR Nonnegative Matrix Factorization

47

UNC, Stat & OR

Mean in Curve Quotient Space

Given Equivalence Class Data Objects:

The Karcher Mean is:

Page 48: 1 UNC, Stat & OR Nonnegative Matrix Factorization

48

UNC, Stat & OR

Mean in Curve Quotient Space

The Karcher Mean is:

Intuition: Can Show, for Euclidean

Data

Minimizer = Conventional

Page 49: 1 UNC, Stat & OR Nonnegative Matrix Factorization

49

UNC, Stat & OR

Mean in Curve Quotient Space

Next Define “Most Representative”

Choice of

As Representer of

Page 50: 1 UNC, Stat & OR Nonnegative Matrix Factorization

50

UNC, Stat & OR

Mean in Curve Quotient Space

“Most Representative” in

Given a candidate

Consider warps to each

Choose to make

Karcher mean of warps = Identity

(under Fisher Rao metric)

Page 51: 1 UNC, Stat & OR Nonnegative Matrix Factorization

51

UNC, Stat & OR

Mean in Curve Quotient Space

“Most Representative” in

Thanks to Anuj Srivastava

Page 52: 1 UNC, Stat & OR Nonnegative Matrix Factorization

52

UNC, Stat & OR

Toy Example – (Details Later)

EstimatedWarps

(Note:RepresentedWith KarcherMean At Identity)

Page 53: 1 UNC, Stat & OR Nonnegative Matrix Factorization

53

UNC, Stat & OR

Mean in Curve Quotient Space

“Most Representative” in

Terminology: The “Template Mean”

Page 54: 1 UNC, Stat & OR Nonnegative Matrix Factorization

54

UNC, Stat & OR

More Data Objects

Final Curve Warps:

• Warp Each Data Curve,

• To Template Mean,

• Denote Warp Functions

Gives (Roughly Speaking):

Vertical Components

(Aligned Curves)

Horizontal Components

Page 55: 1 UNC, Stat & OR Nonnegative Matrix Factorization

55

UNC, Stat & OR

More Data Objects

Final Curve Warps:

• Warp Each Data Curve,

• To Template Mean,

• Denote Warp Functions

Gives (Roughly Speaking):

Vertical Components

(Aligned Curves)

Horizontal Components

Data Objects II

Page 56: 1 UNC, Stat & OR Nonnegative Matrix Factorization

56

UNC, Stat & OR

More Data Objects

Final Curve Warps:

• Warp Each Data Curve,

• To Template Mean,

• Denote Warp Functions

Gives (Roughly Speaking):

Vertical Components

(Aligned Curves)

Horizontal Components

Data Objects II

~ Kendall’s Shapes

Page 57: 1 UNC, Stat & OR Nonnegative Matrix Factorization

57

UNC, Stat & OR

More Data Objects

Final Curve Warps:

• Warp Each Data Curve,

• To Template Mean,

• Denote Warp Functions

Gives (Roughly Speaking):

Vertical Components

(Aligned Curves)

Horizontal Components

Data Objects III

Page 58: 1 UNC, Stat & OR Nonnegative Matrix Factorization

58

UNC, Stat & OR

More Data Objects

Final Curve Warps:

• Warp Each Data Curve,

• To Template Mean,

• Denote Warp Functions

Gives (Roughly Speaking):

Vertical Components

(Aligned Curves)

Horizontal Components

Data Objects III

~ Chang’s Transfo’s

Page 59: 1 UNC, Stat & OR Nonnegative Matrix Factorization

59

UNC, Stat & OR

Computation

Several Variations of

Dynamic Programming

Done by Eric Klassen, Wei Wu

Page 60: 1 UNC, Stat & OR Nonnegative Matrix Factorization

60

UNC, Stat & OR

Toy Example

Raw Data

Page 61: 1 UNC, Stat & OR Nonnegative Matrix Factorization

61

UNC, Stat & OR

Toy Example

Raw Data

BothHorizontalAndVerticalVariation

Page 62: 1 UNC, Stat & OR Nonnegative Matrix Factorization

62

UNC, Stat & OR

Toy Example

ConventionalPCAProjections

Page 63: 1 UNC, Stat & OR Nonnegative Matrix Factorization

63

UNC, Stat & OR

Toy Example

ConventionalPCAProjections

PowerSpreadAcrossSpectrum

Page 64: 1 UNC, Stat & OR Nonnegative Matrix Factorization

64

UNC, Stat & OR

Toy Example

ConventionalPCAProjections

PowerSpreadAcrossSpectrum

Page 65: 1 UNC, Stat & OR Nonnegative Matrix Factorization

65

UNC, Stat & OR

Toy Example

ConventionalPCAScores

Page 66: 1 UNC, Stat & OR Nonnegative Matrix Factorization

66

UNC, Stat & OR

Toy Example

ConventionalPCAScores

Views of1-d CurveBendingThrough4 Dim’ns’

Page 67: 1 UNC, Stat & OR Nonnegative Matrix Factorization

67

UNC, Stat & OR

Toy Example

ConventionalPCAScores

PatternsAre“Harmonics”In Scores

Page 68: 1 UNC, Stat & OR Nonnegative Matrix Factorization

68

UNC, Stat & OR

Toy Example

Scores PlotShows DataAre “1”Dimensional

So NeedImprovedPCA Decomp.

Page 69: 1 UNC, Stat & OR Nonnegative Matrix Factorization

69

UNC, Stat & OR

Visualization

Vertical Variation:

• PCA on Aligned Curves,

• Projected Curves

Page 70: 1 UNC, Stat & OR Nonnegative Matrix Factorization

70

UNC, Stat & OR

Toy Example

AlignedCurves

(Clear1-dVerticalVar’n)

Page 71: 1 UNC, Stat & OR Nonnegative Matrix Factorization

71

UNC, Stat & OR

Toy Example

AlignedCurvePCAProjections

All Var’nIn 1st

Component

Page 72: 1 UNC, Stat & OR Nonnegative Matrix Factorization

72

UNC, Stat & OR

Visualization

Horizontal Variation:

• PCA on Warps,

• Projected Curves

Page 73: 1 UNC, Stat & OR Nonnegative Matrix Factorization

73

UNC, Stat & OR

Toy Example

EstimatedWarps

Page 74: 1 UNC, Stat & OR Nonnegative Matrix Factorization

74

UNC, Stat & OR

Toy Example

Warps,PCProjections

Page 75: 1 UNC, Stat & OR Nonnegative Matrix Factorization

75

UNC, Stat & OR

Toy Example

Warps,PCProjections

Mostly1st PC

Page 76: 1 UNC, Stat & OR Nonnegative Matrix Factorization

76

UNC, Stat & OR

Toy Example

Warps,PCProjections

Mostly1st PC,But 2nd

Helps Some

Page 77: 1 UNC, Stat & OR Nonnegative Matrix Factorization

77

UNC, Stat & OR

Toy Example

Warps,PCProjections

Rest isNotImportant

Page 78: 1 UNC, Stat & OR Nonnegative Matrix Factorization

78

UNC, Stat & OR

Toy Example

Horizontal Var’n Visualization Challenge:

(Complicated) Warps Hard to Interpret

Approach:

Apply Warps to Template Mean

(PCA components)

Page 79: 1 UNC, Stat & OR Nonnegative Matrix Factorization

79

UNC, Stat & OR

Toy Example

WarpCompon’ts(+ Mean)Applied toTemplateMean

Page 80: 1 UNC, Stat & OR Nonnegative Matrix Factorization

80

UNC, Stat & OR

Refined Calculations

Current Development:

Better Exploit Manifold Data Space:

Principal Nested Spheres

Page 81: 1 UNC, Stat & OR Nonnegative Matrix Factorization

81

UNC, Stat & OR

PNS on SRVF Sphere

Toy Example

Tangent Space

PCA

(on Horiz. Var’n)

Thanks to Xiaosun Lu

Page 82: 1 UNC, Stat & OR Nonnegative Matrix Factorization

82

UNC, Stat & OR

PNS on SRVF Sphere

Toy Example

PNS Projections

(Fewer Modes)

Page 83: 1 UNC, Stat & OR Nonnegative Matrix Factorization

83

UNC, Stat & OR

PNS on SRVF Sphere

Toy Example

Tangent Space

PCA

Note: 3 Comp’s

Needed for This

Page 84: 1 UNC, Stat & OR Nonnegative Matrix Factorization

84

UNC, Stat & OR

PNS on SRVF Sphere

Toy Example

PNS Projections

Only 2 for This

Page 85: 1 UNC, Stat & OR Nonnegative Matrix Factorization

85

UNC, Stat & OR

TIC testbed

Serious Data Challenge:

TIC (Total Ion Count) Chromatograms

Modern type of “chemical spectra”

Thanks to

Peter

Hoffmann

Page 86: 1 UNC, Stat & OR Nonnegative Matrix Factorization

86

UNC, Stat & OR

TIC testbed

Raw Data: 15 TIC Curves (5 Colors)

Page 87: 1 UNC, Stat & OR Nonnegative Matrix Factorization

87

UNC, Stat & OR

TIC testbed

Special Feature: Answer Key of Known Peaks

Found by MajorTime &LaborInvestment

Page 88: 1 UNC, Stat & OR Nonnegative Matrix Factorization

88

UNC, Stat & OR

TIC testbed

Special Feature: Answer Key of Known Peaks

Goal:FindWarpsTo AlignThese

Page 89: 1 UNC, Stat & OR Nonnegative Matrix Factorization

89

UNC, Stat & OR

TIC testbed

Fisher – Rao Alignment

Page 90: 1 UNC, Stat & OR Nonnegative Matrix Factorization

90

UNC, Stat & OR

TIC testbed

Fisher – Rao Alignment

Spike-InPeaks

Page 91: 1 UNC, Stat & OR Nonnegative Matrix Factorization

91

UNC, Stat & OR

TIC testbed

Next Zoom in on This Region

Page 92: 1 UNC, Stat & OR Nonnegative Matrix Factorization

92

UNC, Stat & OR

TIC testbed

Zoomed Fisher – Rao Alignment

Page 93: 1 UNC, Stat & OR Nonnegative Matrix Factorization

93

UNC, Stat & OR

TIC testbed

Before Alignment

Page 94: 1 UNC, Stat & OR Nonnegative Matrix Factorization

94

UNC, Stat & OR

TIC testbed

Next Zoom in on This Region

Page 95: 1 UNC, Stat & OR Nonnegative Matrix Factorization

95

UNC, Stat & OR

TIC testbed

Zoomed Fisher – Rao Alignment

Page 96: 1 UNC, Stat & OR Nonnegative Matrix Factorization

96

UNC, Stat & OR

TIC testbed

Before Alignment

Page 97: 1 UNC, Stat & OR Nonnegative Matrix Factorization

97

UNC, Stat & OR

TIC testbed

Next Zoom in on This Region

Page 98: 1 UNC, Stat & OR Nonnegative Matrix Factorization

98

UNC, Stat & OR

TIC testbed

Zoomed Fisher – Rao Alignment

Page 99: 1 UNC, Stat & OR Nonnegative Matrix Factorization

99

UNC, Stat & OR

TIC testbed

Before Fisher-Rao Alignment

Page 100: 1 UNC, Stat & OR Nonnegative Matrix Factorization

100

UNC, Stat & OR

TIC testbed

Next Zoom in on This Region

Page 101: 1 UNC, Stat & OR Nonnegative Matrix Factorization

101

UNC, Stat & OR

TIC testbed

Zoomed Fisher – Rao Alignment

Page 102: 1 UNC, Stat & OR Nonnegative Matrix Factorization

102

UNC, Stat & OR

TIC testbed

Zoomed Fisher – Rao Alignment

Note:VeryChallenging

Page 103: 1 UNC, Stat & OR Nonnegative Matrix Factorization

103

UNC, Stat & OR

TIC testbed

Before Alignment

Page 104: 1 UNC, Stat & OR Nonnegative Matrix Factorization

104

UNC, Stat & OR

TIC testbed

Next Zoom in on This Region

Page 105: 1 UNC, Stat & OR Nonnegative Matrix Factorization

105

UNC, Stat & OR

TIC testbed

Zoomed Fisher – Rao Alignment

Page 106: 1 UNC, Stat & OR Nonnegative Matrix Factorization

106

UNC, Stat & OR

TIC testbed

Before Alignment

Page 107: 1 UNC, Stat & OR Nonnegative Matrix Factorization

107

UNC, Stat & OR

TIC testbed

Warping Functions

Page 108: 1 UNC, Stat & OR Nonnegative Matrix Factorization

108

UNC, Stat & OR

References for Much More

Big Picture Survey:Marron, Ramsay, Sangalli & Srivastava (2014)

TIC Proteomics Example:Koch, Hoffman & Marron (2014)

Page 109: 1 UNC, Stat & OR Nonnegative Matrix Factorization

109

UNC, Stat & OR

Take Away Message

Curve Registration is Slippery Thus, Careful Mathematics is Useful Fisher-Rao Approach:

Gets the Math Right Intuitively Sensible Computable Generalizable Worth the Complication