Modern iterative methods For basic iterative methods, converge linearly Modern iterative methods,...

Modern iterative methods

For basic iterative methods, converge linearlyModern iterative methods, converge faster– Krylov subspace method

• Steepest descent method• Conjugate gradient (CG) method --- most popular• Preconditioning CG (PCG) method• GMRES for nonsymmetric matrix

– Other methods (read yourself)• Chebyshev iterative method• Lanczos methods• Conjugate gradient normal residual (CGNR)

cDxRDxcxRxDbxA mm 1)(1)1(

bxxAxxx TT

1:)()(min

Modern iterative methods

Ideas:– Minimizing the residual – Projecting to Krylov subspace

Thm: If A is an n-by-n real symmetric positive definite matrix, then

have the same solutionProof: see details in class

bxxAxxbxA TT

1min)(min

bAbxbAx T 11

bxxAxxdxdxx TT

1:)()(min )()()1(

Steepest decent method

Suppose we have an approximation Choose the direction as negative gradient of

– If

– Else, choose to minimize

ccxxxxc rxAbbxAxdcc

:|)(|)(

!solution!exact theis 0 ccc xxAbr

)( cc dx

Computation

Choose asc

)()()(2

dddAdx

bxAddAdx

bdxdxAdxdx

Algorithm– Steepest descent method

xAbrrαxx

rArrrαmm

10)1()(

10 (e.g. until

)/()( & 1

0 while

0set & Compute

guess Initial

Theory

Suppose A is symmetric positive definite.Define A-inner product

Define A-norm

yAxyAxyx TA

),(),(

xAxxxx TAA

rrαxAbr

rαxxx

& guess Initial

)()1()0(

Theory

Thm: For steepest decent method, we have

Proof: Exercise

*)()()(

1)(*)()(

bAbxAk

bAbxxx

Theory

Rewrite the steepest decent method

Let errors

Lemma: For the method, we have m

mmmmmm reexxexxe

)1()()()()()( ~*~~*

(m))(m

xA brrr

rrαrx x

)1()()()1(

0),()~,(

~)1()1()1()()1(

)1()()1(

Theory

Thm: For steepest decent method, we have

Proof: See details in class (or as an exercise)

)()1(222)1(

2)1()(22)1(2)(

nrom.-A of sense thein

convergently monotional algorithm The

Performance– Converge globally, for any initial data– If , then it converges very fast– If , then it converges very slow!!!

Geometric interpretation– Contour plots are flat!!– Local best direction (steepest direction) is not necessarily a global best direction – Computational experience shows that the method suffers a decreasing

convergence rate after a few iteration steps because the search directions become linearly dependent!!!

)1()(2 OAk

1)(2 Ak

Conjugate gradient (CG) method

Since A is symmetric positive definite, A-norm

In CG method, the direction vectors are chosen to be A-orthogonal (and called as conjugate vectors), i.e.

xAxxxx TAA

midAd mT

i ,0)(

CG method

In addition, we take the new direction vector as a linear combination of the old direction vector and the descent direction as

By the assumption we get 0)( 1 mT

dArdAdr

)( 1 m

mmmmm xAbrdrd

Algorithm– CG Method

(0)0 0 0

( 1) ( )1

Choose initial guess

Compute & set

For 0,1,..., do

Compute ( ) / ( )

T Tm m m m m

m mm m m m m m

r b A x d r

α r r d A d

x x α d r r α A d

1 11 1

(e.g. 10 , then

endfor

m m m m mTm m

r rd r d

An example

Initial guess

The approximate solutions

xbxAbA

0000.0

0000.0)0(x

0003.1

,1429.0,

0000.7

0000.7)1(

000 xdr

CG method

In CG method, are A-orthogonal!

Define the linear space as

Lemma: In CG method, for m=0,1,…., we have

– Proof: See details in class or as an exercise

1 & mm dd

0),()(A respect to with 111 AmmmT

mmm dddAddd

span{ , , , } { | , }m

m i i ii

d d d y y d

},,,{span

},,,{span},,,{span

rrrdddm

CG method

In CG method, is A-orthogonal to or

Lemma: In CG method, we have

– Proof: See details in class or as an exercise

Thm: Error estimate for CG method

,,, 10

A respect to with},,,{span 101 mm dddd

jirrdAd jT

i ,0)(,0)(

max)(&*

:&,2,1,01)(

22)()(

AAAkxxe

xAxxmAk

CG method

Computational cost– At each iteration, 2 matrix-vector multiplications. This

can be further reduced to 1 matrix-vector multiplications

– At most n steps, we can get the exact solution!!!Convergence rate depends on the condition #– K2(A)=O(1), converges very fast!!– K2(A)>>1, converges slow but can be accelerated by

preconditioning!!

Preconditioning

Ideas: Replace by satisfying

– C is symmetric positive definite – is well-conditioned, i.e. – can be easily solved

Conditions for choosing the preconditioning matrix– as small as possible– is easy to compute– Trade-off

bxA~~~

xxC ~A~ )()

~( 22 AkAk

(2 Ak1C

bCbxCxACCA 111 ~~~

Algorithm– PCG Method

(0) (0)0

0 0 0 0

( 1) ( )1

Choose initial guess & compute

Solve & set

For 0,1,..., do

Compute ( ) / ( )

T Tm m m m m

m mm m m m m

x r b A x

Cr r d r

α r r d A d

x x α d r r α

1 11 1

If (e.g. 10 , then

endfor

m m m m mTm m

r rd r d

Preconditioning

Ways to choose the matrix C (read yourself)– Diagonal part of A– Tri-diagonal part of A– m-step Jacobi preconditioner– Symmetric Gauss-Seidel preconditioner– SSOR preconditioner– In-complete Cholesky decomposition– In-complete block preconditioning– Preconditioning based on domain decomposition– …….

Extension of CG method to nonsymmetric

Biconjugate gradient (BiCG) method: – Solve simultaneously– Works well for A is positive definite, not symmetric– If A is symmetric, BiCG reduces to CG

Conjugate gradient squared (CGS) method– A has a special formula in computing Ax, its transport hasn’t– Multiplication by A is efficient but multiplication by its transport

is not

& TAx b A y b

Krylov subspace methods

Problem I. Linear systemProblem II. Variational formulation

Problem III. Minimization problem

– Thm1: Problem I is equivalent to Problem II– Thm2: If A is symmetric positive definite, they are equivalent

),(),(2

1:)()(min bxxxAbxxAxxx TT

nvvbvxA

),(),(

To reduce problem size, we replace by a subspace

Subspace minimization: – Find – Such that

Subspace projection

),(),(2

1)()(min )(

)0(bxxxAbxxAxxx TTm

guess initial an with},,,{span )0(110

)0( xdddSSx mmm

1( ) (0)

SvvbvxA

mkdbdxA

),(),(

10),(),()(

: )0(n)(m

To determine the coefficients, we have – Normal Equations

– It is a linear system with degree m!!

m=1: line minimization or linear search or 1D projection

By converting this formula into an iteration, we reduce the original problem into a sequence of line minimization (successive line minimization ).

1,,1,0)()()()(

1,,1,0)( )(

mkrdxAbddAd

mkbddxAd

00)0()1(

For symmetric matrix

Positive definite– Steepest decent method

– CG method

– Preconditioning CG method

Non-positive definite – MINRES (minimum residual method)

0),( 11 Akkkkkk dddrd

)(min mxAb

),,(},,,{span 001

00)0( mrAKrArArSSx m

For nonsymmetric matrix

Normal equations method (or CGNR method)

GMRES (generalized minimium residual method)– Saad & Schultz, 1986 – Ideas:

• In the m-th step, minimize the residual over the set

• Use Arnoldi (full orthogonal) vectors instead of Lanczos vectors• If A is symmetric, it reduces to the conjugate residual method

bAbAAAbxAbxA TT

with~~

)(mxAb

),,(},,,{span 001

00)0( mrAKrArArSSx m

Algorithm– GMRES

(0)0 1,0 0 2

Choose initial guess

Compute & set 0

while 0

k k k k k k

Ti,k i k k k i k i

r b A x h r k

q r h k k r A q

h q r r r h q

( ) (0)1, 1 0 12 2

( ) ( 1) 10

with min

until (e.g. 10

kk k k k k , k k

h r x x Q y h e H y

Modern iterative methods For basic iterative methods, converge linearly Modern iterative methods,...

Documents

Iterative Methods for ion

Iterative Methods and Multigrid

Multilevel Iterative Methods

Iterative methods for network alignment

Iterative Methods for LS - 國立臺灣師範大學math.ntnu.edu.tw/~min/Numerical_Analysis/2004/Iterative_methods... · Iterative Methods for LS 3 1 – Classic Iterative Methods

Lecture03 - Iterative Methods

Iterative Methods andCombinatorial Preconditioners

2. lp iterative methods

Iterative methods for sparse linear systems (PDF)saad/PS/iter1.pdf · Iterative Methods for Sparse Linear Systems ... Iterative methods for solving general, ... ments revolutionized

Fast Iterative Regularization Methods

Iterative Algorithms Inkamra/pdf/igwa.pdf · Iterative Algorithms I: Elementary Iterative Methods and the Conjugate ... • Formally, iterative methods give the solution of the equation

Iterative Methods for Toeplitz Systems - avcr.cz · Iterative Methods for Toeplitz Systems ... Small ÆLarge systems (Recursive) ... circulant matrices and iterative methods

Iterative Methods for Linear Systems

Iterative Projection Methods

Shrinkage/Thresholding Iterative Methods

NUMERICAL METHODS -Iterative methods(indirect method)

Iterative Methods for Image Reconstruction Image Reconstruction Methods

Iterative methods with special structures

Iterative methods for the solution

Iterative Methods for Sparse Linear Systems - UDCcaminos.udc.es/.../TextosPDF/IterativeMethodsSparseLinearSystems.… · 3.6 Sparse Direct Solution Methods ... Iterative methods for