60
Sparse optimization and machine learning in image reconstruction Aleksandra Piˇ zurica Group for Artificial Intelligence and Sparse Modelling (GAIM) Department Telecommunications and Information Processing Ghent University 2 nd Annual Meeting B-Q MINDED Siemens Healthineers, Belgium, January 29-30, 2020

Sparse optimization and machine learning in image

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Sparse optimization and machine learning in image

Sparse optimization and machine learning

in image reconstruction

Aleksandra Pizurica

Group for Artificial Intelligence and Sparse Modelling (GAIM)

Department Telecommunications and Information Processing

Ghent University

2nd Annual Meeting

B-Q MINDED

Siemens Healthineers, Belgium, January 29-30, 2020

Page 2: Sparse optimization and machine learning in image

Outline

1 Model-based iterative reconstruction algorithms

Sparse optimization

Solution strategies: greedy methods vs. convex optimization

Optimization methods in sparse image reconstruction

2 Structured sparsity

Wavelet-tree sparsity

Markov Random Field (MRF) priors

3 Machine learning in image reconstruction

Main ideas and current trends

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 2 / 51

Page 3: Sparse optimization and machine learning in image

A fairly general formulation

Reconstruct a signal (image) x ∈ X from data y ∈ Y where

y = T (x) + n

X and Y are Hilbert spaces, T : X 7→ Y is the forward operator and n is noise.

A common model-driven approach is to minimize the negative log-likelihood L:

minx∈XL(T (x), y)

Typically, ill-posed and leads to over-fitting. Variational regularization, also called

model-based iterative reconstruction seeks to minimize a regularized objective

function

minx∈XL(T (x), y) + τφ(x)

φ : X 7→ R ∪ −∞,∞ is a regularization functional. τ ≥ 0 governs the influence of

the a priori knowledge against the need to fit the data.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 3 / 51

Page 4: Sparse optimization and machine learning in image

Linear inverse problems

Many image reconstruction problems can be formulated as a linear inverse problem.

A noisy indirect observation y, of the original image x is then

y = Ax + n

Matrix A is the forward operator. x ∈ Rn; y,n ∈ Rm (or x ∈ Cn; y,n ∈ Cm).

Here, image pixels are stacked into vectors (raster scanning). In general, m 6= n.

Some examples

CT: A is the system matrix modeling the X-ray transformation

MRI: A is (partially sampled) Fourier operator

OCT: A is the first Born approximation for the scattering

Compressed sensing: A is a measurement matrix (dense or sparse)

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 4 / 51

Page 5: Sparse optimization and machine learning in image

Linear inverse problems

For the linear inverse problem y = Ax + n, model-based reconstruction seeks to solve:

minx

1

2‖Ax− y‖2

2 + τφ(x) (Tikhonov formulation)

Alternatively,

minxφ(x) subject to ‖Ax− y‖2

2 ≤ ε (Morozov formulation)

minx‖Ax− y‖2

2 subject to φ(x) ≤ δ (Ivanov formulation)

Under mild conditions, these are all equivalent [Figueiredo and Wright, 2013], and

which one is more convenient is problem-dependent.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 5 / 51

Page 6: Sparse optimization and machine learning in image

Outline

1 Model-based iterative reconstruction algorithms

Sparse optimization

Solution strategies: greedy methods vs. convex optimization

Optimization methods in sparse image reconstruction

2 Structured sparsity

Wavelet-tree sparsity

Markov Random Field (MRF) priors

3 Machine learning in image reconstruction

Main ideas and current trends

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 6 / 51

Page 7: Sparse optimization and machine learning in image

Sparse optimization

A common assumption: x is sparse in a well-chosen transform domain.

We refer to a wavelet representation meaning any wavelet-like multiscale

representation, including curvelets and shearlets..

x = Ψθ, θ ∈ Rd , Ψ ∈ Rn×d

The columns of Ψ are the

elements of a wavelet frame

(an orthogonal basis or an

overcomplete dictionary)

The main results hold for learned dictionaries, trained on a set of representative

examples to yield optimally sparse representation for a particular class of images.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 7 / 51

Page 8: Sparse optimization and machine learning in image

Compressed sensing

Consider y = Ax + n, with y,n ∈ Rm, x = Ψθ, x ∈ Rn, θ ∈ Rd , m < n

θ = arg minθ

1

2‖AΨθ− y‖2

2 + τφ(θ), x = Ψθ

Commonly: minθ‖θ‖0 s.t. ‖AΨθ− y‖2

2 ≤ ε or minθ

12‖AΨθ− y‖2

2 + τ‖θ‖1

[Candes et al., 2006], [Donoho, 2006], [Lustig et al., 2007]A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 8 / 51

Page 9: Sparse optimization and machine learning in image

Compressed sensing: recovery guarantees

Consider y = Ax + n, with y,n ∈ Rm, x = Ψθ, x ∈ Rn, m < n

Matrix Φ = AΨ has K -restricted isometry property (K-RIP) with constant εK < 1 if

∀ K -sparse (having only K non-zero entries) θ:

(1− εK )‖θ‖22 ≤ ‖Φθ‖2

2 ≤ (1 + εK )‖θ‖22

Suppose matrix A ∈ Rm×n is formed by subsampling a given sampling operator

A ∈ Rn×n. The mutual coherence between A and Ψ:

µ(A,Ψ) = maxi ,j|a>i ψj |

If m > Cµ2(A,Ψ)Kn log(n), for some constant C > 0, then

minθ

1

2‖AΨθ− y‖2

2 + τ‖θ‖1

recovers x with high probability, given the K-RIP holds for Φ = AΨ.A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 9 / 51

Page 10: Sparse optimization and machine learning in image

Analysis vs. synthesis formulation

Consider y = Ax + n, with y ∈ Rm, x = Ψθ, x ∈ Rn, θ ∈ Rd

Synthesis approach:

minθ

1

2‖AΨθ− y‖2

2 + τφ(θ)

or in the constrained form:

minθφ(θ) subject to ‖AΨθ− y‖2

2 ≤ ε

Analysis approach:

minx

1

2‖Ax− y‖2

2 + τφ(x)

or in the constrained form:

minxφ(x) subject to ‖Ax− y‖2

2 ≤ ε

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 10 / 51

Page 11: Sparse optimization and machine learning in image

Analysis vs. synthesis formulation

Consider y = Ax + n, with y ∈ Rm, x = Ψθ, x ∈ Rn, θ ∈ Rd

Synthesis approach:

minθ

1

2‖AΨθ− y‖2

2 + τφ(θ)

or in a constrained form:

minθφ(θ) subject to ‖AΨθ− y‖2

2 ≤ ε

Analysis approach:

minx

1

2‖Ax− y‖2

2 + τφ(x)

or in a constrained form:

minxφ(x) subject to ‖Ax− y‖2

2 ≤ ε

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 11 / 51

Page 12: Sparse optimization and machine learning in image

Analysis vs. synthesis formulation

Consider y = Ax + n, with y ∈ Rm, x = Ψθ, x ∈ Rn, θ ∈ Rd

Synthesis approach:

minθ

1

2‖AΨθ− y‖2

2 + τφ(θ)

or in a constrained form:

minθφ(θ) subject to ‖AΨθ− y‖2

2 ≤ ε

Analysis approach that also applies to wavelet regularization

minx

1

2‖Ax− y‖2

2 + τφ(Px)

or in a constrained form:

minxφ(Px) subject to ‖Ax− y‖2

2 ≤ ε

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 12 / 51

Page 13: Sparse optimization and machine learning in image

Analysis vs. synthesis formulation

Consider y = Ax + n, with y ∈ Rm, x = Ψθ, x ∈ Rn, θ ∈ Rd

Synthesis approach:

minθ

1

2‖AΨθ− y‖2

2 + τφ(θ)

or in a constrained form:

minθφ(θ) subject to ‖AΨθ− y‖2

2 ≤ ε

Analysis approach that also applies to wavelet regularization

minx

1

2‖Ax− y‖2

2 + τφ(Px)

or in a constrained form:

minxφ(Px) subject to ‖Ax− y‖2

2 ≤ ε

P: a wavelet transform operator or P = I (standard analysis)A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 13 / 51

Page 14: Sparse optimization and machine learning in image

Outline

1 Model-based iterative reconstruction algorithms

Sparse optimization

Solution strategies: greedy methods vs. convex optimization

Optimization methods in sparse image reconstruction

2 Structured sparsity

Wavelet-tree sparsity

Markov Random Field (MRF) priors

3 Machine learning in image reconstruction

Main ideas and current trends

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 14 / 51

Page 15: Sparse optimization and machine learning in image

Solution strategies: greedy methods vs. convex optimization

Solution strategy is problem-dependent. For non-convex problems like

minx‖x‖0 subject to ‖Ax− y‖2

2 ≤ ε

Greedy algorithms, e.g.,

Matching Pursuit (MP) [Mallat and Zhang, 1993]

OMP[Tropp, 2004], CoSaMP [Needell and Tropp, 2009]

IHT [Blumensath and Davies, 2009]

or convex relaxation can be applied leading to:

minx‖x‖1 subject to ‖Ax− y‖2

2 ≤ ε

minx

1

2‖Ax− y‖2

2 + τφ(x)

known as LASSO [Tibshirani, 1996] or BPDN [Chen et al., 2001] problem.A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 15 / 51

Page 16: Sparse optimization and machine learning in image

Greedy methods: OMP

OMP algorithm for solving minx‖x‖0 subject to Ax = y

Require: k = 1, r(1) = y,Λ(0) = ∅1: repeat

2: λ(k) = arg maxj |Aj · r(k)|3: Λ(k) = Λ(k−1) ∪ λ(k)4: x(k) = arg minx‖AΛkx− y‖2

5: y(k) = AΛkx(k)

6: r(k+1) = r(k) − y(k)

7: k = k + 1

8: until stopping criterion satisfied

Aj is the j-th column of A, and AΛ a sub-matrix of A with columns indicated in Λ.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 16 / 51

Page 17: Sparse optimization and machine learning in image

Greedy methods: OMP

OMP algorithm for solving minx‖x‖0 subject to Ax = y

Require: k = 1, r(1) = y,Λ(0) = ∅1: repeat

2: λ(k) = arg maxj |Aj · r(k)| identify the ’important’ column of A

3: Λ(k) = Λ(k−1) ∪ λ(k) augment the index set

4: x(k) = arg minx‖AΛkx− y‖2 solve the least square problem

5: y(k) = AΛkx(k) express the portion of y being explained by AΛkx

(k)

6: r(k+1) = r(k) − y(k) update the residual by removing the explained portion of y

7: k = k + 1

8: until stopping criterion satisfied

Aj is the j-th column of A, and AΛ a sub-matrix of A with columns indicated in Λ.

’Important column’ = that with max absolute value of correlation with the residual

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 17 / 51

Page 18: Sparse optimization and machine learning in image

Outline

1 Model-based iterative reconstruction algorithms

Sparse optimization

Solution strategies: greedy methods vs. convex optimization

Optimization methods in sparse image reconstruction

2 Structured sparsity

Wavelet-tree sparsity

Markov Random Field (MRF) priors

3 Machine learning in image reconstruction

Main ideas and current trends

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 18 / 51

Page 19: Sparse optimization and machine learning in image

Proximal operator

Many state-of-the-art image reconstruction algorithms solve problems of the kind

minx

1

2‖Ax− y‖2

2 + τφ(Px)

making use of the proximity operator i.e., the Moreau proximal mapping

[Combettes and Wajs, 2005]

proxτφ(y) = argminx

1

2‖x− y‖2

2 + τφ(x)

For certain choices of φ(x), this operator has a closed-form, e.g.,

φ(x) = ‖x‖1 → proxτ`1(y) = soft(y, τ) component-wise soft thresholding

φ(x) = ‖x‖0 → proxτ`0(y) = hard(y,

√2τ) component-wise hard thresholding

Another common regularization function is total variation (TV):

φ(x) = ‖x‖TV → proxτTV (y) Chambolle’s algorithm [Chambolle, 2004]

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 19 / 51

Page 20: Sparse optimization and machine learning in image

Iterative shrinkage/thresholding (IST)

The standard algorithm for solving

minx

1

2‖Ax− y‖2

2 + τφ(x)

is iterative shrinkage/thresholding (IST) algorithm [Figueiredo and Nowak, 2003],

[Daubechies et al., 2004]:

xk+1 = proxτφ

(xk −

1

γAH(Axk − y)︸ ︷︷ ︸

gradient of the datafidelity term

)

Its key ingredient is the proximity operator proxτφ(y) = argminx

12‖x− y‖2

2 + τφ(x)

A general approach in [Daubechies et al., 2004]: φ(x) is a weighted `p norm of the

coefficients of x with respect to a wavelet basis.A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 20 / 51

Page 21: Sparse optimization and machine learning in image

Iterative shrinkage/thresholding (IST) and extensions

The standard algorithm for solving

minx

1

2‖Ax− y‖2

2 + τφ(x)

is iterative shrinkage/thresholding (IST) algorithm [Figueiredo and Nowak, 2003],

[Daubechies et al., 2004]:

xk+1 = proxτφ

(xk −

1

γAH(Axk − y)︸ ︷︷ ︸

gradient of the datafidelity term

)

Different accelerated versions:

TwIST [Bioucas-Dias and Figueiredo, 2007]

FISTA [Beck and Teboulle, 2009]

SpaRSA [Wright et al., 2009]

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 21 / 51

Page 22: Sparse optimization and machine learning in image

Variable splitting

A very old idea (back to at least [Courant, 1943]): Represent minx f1(x) + f2(Gx) as

minxf1(x) + f2(z) subject to Gx = z

The rationale: it may be easier to solve the constrained problem.

Variable splitting (VS) together with the augmented Lagrangian method (ALM) and

non linear block Gauss-Seidel (NLBGS) leads to a form of Alternating Direction

Method of Multipliers (ADMM). It is this interpretation:

(VS + ALM + NLBGS)→ ADMM

that we give in the next few slides, following [Afonso et al., 2010]

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 22 / 51

Page 23: Sparse optimization and machine learning in image

Variable splitting and Augmented Lagrangian Method

A very old idea (back to at least [Courant, 1943]): Represent minx f1(x) + f2(Gx) as

minxf1(x) + f2(z) subject to Gx = z

Lµ(x, z,λ) = f1(x) + f2(z) + λT (Gx− z)︸ ︷︷ ︸Lagrangian

2‖Gx− z‖2

2︸ ︷︷ ︸“augmentation”

Basic augmented Lagrangian method (ALM), a.k.a., method of multipliers (MM),:

(x(k), z(k)) = argmin(x,z)

Lµ(x, z,λ(k−1))

λ(k) = λ(k−1) + µ(Gx(k) − z(k))

Goes back to at least [Hestenes, 1969], [Powell, 1969]

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 23 / 51

Page 24: Sparse optimization and machine learning in image

Variable splitting and Augmented Lagrangian Method

A very old idea (back to at least [Courant, 1943]): Represent minx f1(x) + f2(Gx) as

minxf1(x) + f2(z) subject to Gx = z

Lµ(x, z,λ) = f1(x) + f2(z) + λT (Gx− z)︸ ︷︷ ︸Lagrangian

2‖Gx− z‖2

2︸ ︷︷ ︸“augmentation”

After simple “complete-the-squares” ALM/MM yields [Afonso et al., 2010]:

(x(k), z(k)) = argmin(x,z)

f1(x) + f2(z) +µ

2‖Gx− z− d(k−1)‖2

2

d(k) = d(k−1) − (Gx(k) − z(k))

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 24 / 51

Page 25: Sparse optimization and machine learning in image

ADMM as Variable splitting and ALM

Use variable splitting (VS) to represent minx f1(x) + f2(Gx) as

minxf1(x) + f2(z) subject to Gx = z

ALM/MM yields :

(x(k), z(k)) = argmin(x,z)

f1(x) + f2(z) +µ

2‖Gx− z− d(k−1)‖2

2 (P)

d(k) = d(k−1) − (Gx(k) − z(k))

Solve (P) with one step of NLBGS → “scaled” ADMM version [Boyd et al., 2011]:

x(k) = argminxf1(x) +

µ

2‖Gx− z(k−1) − d(k−1)‖2

2

z(k) = argminzf2(z) +

µ

2‖Gx(k−1) − z− d(k−1)‖2

2

d(k) = d(k−1) − (Gx(k) − z(k))

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 25 / 51

Page 26: Sparse optimization and machine learning in image

ADMM algorithm

ADMM algorithm for solving: minx f1(x) + f2(Gx)

Require: k = 0, µ > 0, z0,d0

1: repeat

2: x(k) = argminxf1(x) + µ

2 ‖Gx− z(k−1) − d(k−1)‖22

3: z(k) = argminzf2(z) + µ

2‖Gx(k−1) − z− d(k−1)‖22

4: d(k) = d(k−1) − (Gx(k) − z(k))5: k = k + 1

6: until stopping criterion is satisfied

Equivalent to split-Bregman method [Goldstein and Osher, 2009].

Connections with Douglas-Raschford splitting [Setzer, 2009].

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 26 / 51

Page 27: Sparse optimization and machine learning in image

ADMM algoritm for linear inverse problems

Instantiate ADMM to our linear inverse problem: minx‖Ax− y‖22 + τφ(Px)

Require: k = 0, µ > 0, z0,d0

1: repeat

2: x(k) = argminx‖Ax− y‖2

2 + µ2 ‖Px− z(k−1) − d(k−1)‖2

2

3: z(k) = argminz

τφ(z) + µ2 ‖Px(k−1) − z− d(k−1)‖2

2= proxτφ/µ(Px(k−1) − d(k−1))

4: d(k) = d(k−1) − (Px(k) − z(k))5: k = k + 1

6: until stopping criterion is satisfied

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 27 / 51

Page 28: Sparse optimization and machine learning in image

A variant of ADMM algorithm for more than two functions

Consider minx∈Rn∑Jj=1 gj(Hjx) and map it into the previous: minx f1(x) + f2( Gx︸︷︷︸

z

)

f1(x) = 0, f2(z) =J∑j=1

gj(zj), G =

H1...

HJ

∈ Rp×n, z(k) =

z(k)1...

z(k)J

, d(k) =

d(k)1...

d(k)J

,

x(k) =( J∑j=1

((Hj)>Hj

)−1( J∑j=1

(Hj)>(z

(k−1)j + dk−1

j ))

z(k)1 = proxg1µ(H1x(k−1) − d

(k−1)1 )

...

z(k)J = proxgJµ(HJx

(k−1) − d(k−1)J ) C-SALSA [Afonso et al., 2011]

d(k) = d(k−1) − (Gx(k) − z(k))A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 28 / 51

Page 29: Sparse optimization and machine learning in image

Example: MRI reconstruction with shearlet regularization

A transversal slice of a FLAIR sequence, resampled along a non-Cartesian trajectory

based on an Archimedean spiral (sampling rate 15%). [Aelterman et al., 2011].A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 29 / 51

Page 30: Sparse optimization and machine learning in image

Example: CT reconstruction with shearlet regularization

x = arg minx

1

2‖C−1(Ax− y)‖2

2 + τ‖Px‖1

Matrix C is a “prewhitener” for the acquisition system [Vandeghinste et al., 2013].A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 30 / 51

Page 31: Sparse optimization and machine learning in image

Example: CT reconstruction with shearlet regularization

Top left: reference; Top right: SIRT; Bottom left: ADMM with TV regularization;

Bottom right: ADMM with shearlet regularization [Vandeghinste et al., 2013]A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 31 / 51

Page 32: Sparse optimization and machine learning in image

Modelling structured sparsity

Two main approaches to modelling structured sparsity in image reconstruction

in the acquisition stage

in the reconstruction stage

In the following we only focus on the second approach.

For the the improved design of the sampling patterns/sampling trajectories making

use of the structured sparsity, see [Roman et al., 2015], [Adcock et al., 2017],

[Gozcu, 2018]

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 32 / 51

Page 33: Sparse optimization and machine learning in image

Outline

1 Model-based iterative reconstruction algorithms

Sparse optimization

Solution strategies: greedy methods vs. convex optimization

Optimization methods in sparse image reconstruction

2 Structured sparsity

Wavelet-tree sparsity

Markov Random Field (MRF) priors

3 Machine learning in image reconstruction

Main ideas and current trends

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 33 / 51

Page 34: Sparse optimization and machine learning in image

Wavelet tree sparsity

[Jacob et al., 2009], [He and Carin, 2009], [Rao et al., 2011].

Application to MRI [Chen and Huang, 2014].

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 34 / 51

Page 35: Sparse optimization and machine learning in image

Wavelet-tree sparsity

Standard CS approach with `1 regularization:

θ = minθ

1

2‖AΨθ− y‖2

2 + τ‖θ‖1

Wavelet-tree sparsity approach:

Illustration from [Rao et al., 2011]

A group Lasso estimator:

θGL = minθ

1

2‖AΨθ− y‖2

2+τ∑g∈G‖θg‖2

G – the collection of all parent-child groups;

g is one such group.

Whole groups are set to zero if their `2 norm is relatively small.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 35 / 51

Page 36: Sparse optimization and machine learning in image

Outline

1 Model-based iterative reconstruction algorithms

Sparse optimization

Solution strategies: greedy methods vs. convex optimization

Optimization methods in sparse image reconstruction

2 Structured sparsity

Wavelet-tree sparsity

Markov Random Field (MRF) priors

3 Machine learning in image reconstruction

Main ideas and current trends

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 36 / 51

Page 37: Sparse optimization and machine learning in image

Sparse reconstruction with Markov Random Field priors

Consider (a little simpler): y = Ax + n, with y,n ∈ Rm, x ∈ Rn, si ∈ 0, 1

P(s) =1

Zexp

[−(∑i

αsi +∑〈i ,j〉

βsisj

)]P(s, x, y) = P(y|x)P(x|s)P(s)

[x, s] = arg maxx,s

∑〈i ,j〉

βsisj +∑i

[αsi + log(p(xi |si)]−1

2σ2‖y − Ax‖2

2

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 37 / 51

Page 38: Sparse optimization and machine learning in image

Sparse reconstruction with Markov Random Field priors

Use Markov Random Field (MRF) as a statistical model for the spatial clustering of

important wavelet coefficients [Cevher et al., 2010], [Pizurica et al., 2011]A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 38 / 51

Page 39: Sparse optimization and machine learning in image

Sparse MRI reconstruction with MRF priors

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 39 / 51

Page 40: Sparse optimization and machine learning in image

Sparse MRI reconstruction with MRF priors

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 40 / 51

Page 41: Sparse optimization and machine learning in image

Sparse MRI reconstruction with MRF priors

Consider: y = Ax + n, with y,n ∈ Cm, x ∈ Cn, x = Ψθ, θ ∈ Cd , si ∈ 0, 1

Let Ωs = i ∈ N : si = 1. Define a model for θ that conforms to the support s:

Ms = θ ∈ CD : supp(θ) = Ωs

Our objective is:

minx∈CN

‖Ax− y‖22 subject to Px ∈Ms

or equivalently:

minx∈CN

‖Ax− y‖22 + ιΩs(supp(Px))

where

ιQ(q) =

0, q ∈ Q+∞, otherwise

LaSB [Pizurica et al., 2011], GreeLa [Panic et al., 2016], LaSAL [Panic et al., 2017]A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 41 / 51

Page 42: Sparse optimization and machine learning in image

Example: CS-MRI with LaSB - early motivating results for using MRFs

SB (split-Bregman) and LaSB implemented with the same shearlet transform.A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 42 / 51

Page 43: Sparse optimization and machine learning in image

Example: CS-MRI with MRF priors

20% measurements, with variable density random sampling

Left: zero fill (PSNR = 19.87 dB)

Middle: WaTMRI [Chen and Huang, 2014] (wavelet-tree; PSNR = 28.78 dB)

Right: LaSAL [Panic et al., 2017] (MRF-based; PSNR = 33.43 dB)A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 43 / 51

Page 44: Sparse optimization and machine learning in image

Example: CS-MRI with MRF priors

Reconstructions from 20% measurements, with radial sampling

Left: reconstructions; Right: error images; Top: LaSAL, Bottom: WaTMRIA. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 44 / 51

Page 45: Sparse optimization and machine learning in image

Outline

1 Model-based iterative reconstruction algorithms

Sparse optimization

Solution strategies: greedy methods vs. convex optimization

Optimization methods in sparse image reconstruction

2 Structured sparsity

Wavelet-tree sparsity

Markov Random Field (MRF) priors

3 Machine learning in image reconstruction

Main ideas and current trends

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 45 / 51

Page 46: Sparse optimization and machine learning in image

Machine learning in image reconstruction

Covered in many recent workshops, special sessions and special issues of journals:

Three main direction have been proposed

learned postprocessing or learned denoisers;

learn a regularizer and use it in a classical variational regularization scheme;

learning the full reconstruction operator

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 46 / 51

Page 47: Sparse optimization and machine learning in image

Machine learning in image reconstruction: Special issues

IEEE Signal Processing

Magazine,

November 2017

Deep Learning for Visual

Understanding, Part 1

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 47 / 51

Page 48: Sparse optimization and machine learning in image

Machine learning in image reconstruction: Special issues

IEEE Signal Processing

Magazine,

January 2018

Deep Learning for Visual

Understanding, Part 2

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 48 / 51

Page 49: Sparse optimization and machine learning in image

Learning fast approximations of sparse coding

Core idea: time-unfolded version of an iterative reconstruction algorithm, like IST,

truncated to a fixed number of iterations.

Representatives: LISTA [Gregor and LeCun, 2010],[Moreau and Bruna, 2017]

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 49 / 51

Page 50: Sparse optimization and machine learning in image

Deep CNN models in image reconstruction

A central question is whether one can combine elements of model and data driven

approaches for solving ill-posed inverse problems.

[McCann et al., 2017]

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 50 / 51

Page 51: Sparse optimization and machine learning in image

Summary

Sparse optimization is a fundamental concept in inverse problems like image

reconstruction.

We covered some basic components of sparse image recovery algorithms,

including ADMM-based methods.

The concept of structured sparsity was underlined with particular attention to

using Markov Random Field priors in sparse image recovery.

A new frontier: machine learning in image reconstruction. Great potential, a

huge variability of approaches.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 52: Sparse optimization and machine learning in image

Adcock, B., Hansen, A., Poon, C., and Roman, B. (2017).

Breaking the coherence barrier: A new theory for compressed sensing.

Forum of Mathematics, Sigma, 5(e4).

Aelterman, J., Luong, H. Q., Goossens, B., Pizurica, A., and Philips, W. (2011).

Augmented Lagrangian based reconstruction of non-uniformly sub-Nyquist

sampled MRI data.

Signal Processing, 91(12):2731–2742.

Afonso, M. V., Bioucas-Dias, J. M., and Figueiredo, M. A. (2010).

Fast image recovery using variable splitting and constrained optimization.

IEEE Transactions on Image Processing, 19(9):2345–2356.

Afonso, M. V., Bioucas-Dias, J. M., and Figueiredo, M. A. T. (2011).

An augmented Lagrangian approach to the constrained optimization formulation

of imaging inverse problems.

IEEE Transactions on Image Processing, 20(3):681–695.

Beck, A. and Teboulle, M. (2009).

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 53: Sparse optimization and machine learning in image

A fast iterative shrinkage-thresholding algorithm for linear inverse problems.

SIAM Journal on Imaging Sciences, 2(1):183–202.

Bioucas-Dias, J. M. and Figueiredo, M. A. (2007).

A new twist: two-step iterative shrinkage/thresholding algorithms for image

restoration.

IEEE Transactions on Image Processing, 16(12):2992–3004.

Blumensath, T. and Davies, M. (2009).

Sampling theorems for signals from the union of finite-dimensional linear

subspaces.

IEEE Transactions on Information Theory, 55(4):1872–1882.

Boyd, S., Parikh, N., Chu, E., Peleato, B., and Eckstein, J. (2011).

Distributed optimization and statistical learning via the alternating direction

method of multipliers.

Foundations and Trends in Machine Learning, 3(1):1–122.

Candes, E. J., Romberg, J., and Tao, T. (2006).

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 54: Sparse optimization and machine learning in image

Robust uncertainty principles: Exact signal reconstruction from highly incomplete

frequency information.

IEEE Transactions on Information Theory, 52(2):489–509.

Cevher, V., Indyk, P., Carin, L., and Baraniuk, R. G. (2010).

Sparse signal recovery and acquisition with graphical models.

IEEE Signal Processing Magazine, 27(6):92–103.

Chambolle, A. (2004).

An algorithm for total variation minimization and applications.

Journal of Mathematical Imaging and Vision, 20(1-2):89–97.

Chen, C. and Huang, J. (2014).

Exploiting the wavelet structure in compressed sensing MRI.

Magnetic Resonance Imaging, 32(10):1377–1389.

Chen, S., Donoho, D., and Saunders, M. (2001).

Atomic decomposition by basis pursuit.

SIAM Review, 43(1):129–159.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 55: Sparse optimization and machine learning in image

Combettes, P. L. and Wajs, V. R. (2005).

Signal recovery by proximal forward-backward splitting.

Multiscale Modeling & Simulation, 4(4):1168–1200.

Courant, R. (1943).

Variational methods for the solution of problems of equilibrium and vibrations.

Bulletin of the American Mathematical Society, 49(1):1–23.

Daubechies, I., Defrise, M., and De Mol, C. (2004).

An iterative thresholding algorithm for linear inverse problems with a sparsity

constraint.

Communications on Pure and Applied Mathematics, 57(11):1413–1457.

Donoho, D. L. (2006).

Compressed sensing.

IEEE Transactions on Information Theory, 52(4):1289–1306.

Figueiredo, M. and Nowak, R. (2003).

An EM algorithm for wavelet-based image restoration.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 56: Sparse optimization and machine learning in image

IEEE Transactions on Image Processing, 12(8):906–916.

Figueiredo, M. and Wright, S. (2013).

Applications of sparse optimization.

Goldstein, T. and Osher, S. (2009).

The split Bregman method for `1-regularized problems.

SIAM Journal on Imaging Sciences, 2(2):323–343.

Gozcu, B. e. a. (2018).

Learning-based compressive MRI.

IEEE Transactions on Medical Imaging, 37(6).

Gregor, K. and LeCun, Y. (2010).

Learning fast approximations of sparse coding.

In Proceedings of the 27th International Conference on Machine Learning.

He, L. and Carin, L. (2009).

Exploiting structure in wavelet-based bayesian compressive sensing.

IEEE Transactions on Signal Processing, 57(9):3488–3497.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 57: Sparse optimization and machine learning in image

Hestenes, M. R. (1969).

Multiplier and gradient methods.

Journal of Optimization Theory and Applications, 4(5):303–320.

Jacob, L., G., O., and Vert, J. (2009).

Group Lasso with overlap and graph Lasso.

In Proceedings of the 26th International Conference on Machine Learning.

Lustig, M., Donoho, D., and Pauly, J. M. (2007).

Sparse MRI: The application of compressed sensing for rapid MR imaging.

Magnetic Resonance in Medicine, 58(6):1182–1195.

Mallat, S. and Zhang, Z. (1993).

Matching pursuits with time-frequency dictionaries.

IEEE Transactions on Image Processing, 41(12):3397–3415.

McCann, M., Jin, K., and Unser, M. (2017).

Convolutional neural networks for inverse problems in imaging.

IEEE Signal Processing Magazine, 34(6):85–95.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 58: Sparse optimization and machine learning in image

Moreau, T. and Bruna, J. (2017).

Understanding trainable sparse coding via matrix factorization.

In Proceedings ICLR.

Needell, D. and Tropp, J. A. (2009).

Cosamp: Iterative signal recovery from incomplete and inaccurate samples.

Applied and Computational Harmonic Analysis, 26(3):301–321.

Panic, M., Aelterman, J., Crnojevic, V., and Pizurica, A. (2017).

Sparse recovery in magnetic resonance imaging with a Markov random field prior.

IEEE Transactions on Medical Imaging, 36(10):2104–2115.

Panic, M., Vukobratovic, D., Crnojevic, V., and Pizurica, A. (2016).

Sparse MRI with a Markov random field prior for the subband coefficients.

In Proceedings of the third ”international Traveling Workshop on Interactions

between Sparse models and Technology” (iTWIST’16), pages 56–58.

Pizurica, A., Aelterman, J., Bai, F., Vanloocke, S., Luong, Q., Goossens, B., and

Philips, W. (2011).

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 59: Sparse optimization and machine learning in image

On structured sparsity and selected applications in tomographic imaging.

In SPIE Conference on Wavelets and Sparsity XIV, volume 8138, pages

81381D–1–12.

Powell, M. (1969).

A method for nonlinear constraints in minimization problems.

In Fletcher, R., editor, Optimization, pages 283–298. Academic Press, New York.

Rao, N., Nowak, R., Wright, S., and Kingsbury, N. (2011).

Convex approaches to model wavelet sparsity patterns.

In IEEE international Conference on Image Processing.

Roman, B., Adcock, B., and Hansen, A. (2015).

On asymptotic structure in compressed sensing,.

In Proceedings of the third ”international Traveling Workshop on Interactions

between Sparse models and Technology” (iTWIST’16).

Setzer, S. (2009).

Split Bregman algorithm, Douglas-Rachford splitting and frame shrinkage.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51

Page 60: Sparse optimization and machine learning in image

In International Conference on Scale Space and Variational Methods in Computer

Vision, pages 464–476. Springer.

Tibshirani, R. (1996).

Regression shrinkage and selection via the Lasso.

Journal of the Royal Statistical Society, Series B, 58(1):267–288.

Tropp, J. (2004).

Greed is good: Algorithmic results for sparse approximation.

IEEE Transactions on Information Theory, 50(10):2231–2242.

Vandeghinste, B., Goossens, B., Van Holen, R., Vanhove, C., Pizurica, A.,

Vandenberghe, S., and Staelens, S. (2013).

Iterative CT reconstruction using shearlet-based regularization.

IEEE Transactions on Nuclear Science, 60(5):3305–3317.

Wright, S. J., Nowak, R. D., and Figueiredo, M. A. (2009).

Sparse reconstruction by separable approximation.

IEEE Transactions on Signal Processing, 57(7):2479–2493.

A. Pizurica (UGent) Sparse Optimization in Image Reconstruction B-Q Minded 2020 51 / 51