6
Mathematics and Computers in Simulation 76 (2007) 149–154 On a MIC(0) preconditioning of non-conforming mixed FEM elliptic problems S. Margenov a,, P. Minev b a Institute for Parallel Processing, Bulgarian Academy of Sciences, Sofia, Bulgaria b Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, Canada T6G 2G1 Available online 23 January 2007 Abstract In this paper we analyze a preconditioner for mixed finite element systems arising in the approximation of a second order elliptic problem with Neumann boundary conditions by triangular non-conforming elements. This problem stems from the so called projection methods for the unsteady Navier–Stokes equations and is one of the most computationally intensive parts of the method. The present study is focused on the efficient implementation of the modified incomplete LU factorization MIC(0) as a preconditioner in the PCG iterative method for the reduced Schur complement system following from the mixed formulation of the pressure Poisson problem. The presented model analysis and the related set of numerical tests illustrate well the potential of the proposed approach. © 2007 IMACS. Published by Elsevier B.V. All rights reserved. AMS Classification: 65F10; 65N30 Keywords: Non-conforming mixed FEM; Preconditioning 1. Introduction The projection methods are probably the most popular methods for solving the unsteady Navier–Stokes equations. Their popularity is due to the fact that they reduce the solution of the generalized Stokes problem to the solution of a vectorial advection-diffusion equation for the velocity and a second order elliptic problem for the pressure. In a recent paper (see [4]) it was proposed to use P 1 elements for the discretization of the advection-diffusion equation that follows from the projection approach, and first order nonconforming Crouzeix–Raviart elements for the discretization of the projection step. As a result, the projection step takes the following mixed form: u −∇p = f in ¯ Ω, ∇· u = 0 in Ω, u · n = 0 on ∂Ω, (1) where Ω is a convex polygonal domain in R 2 and f is a given smooth vector valued function. In the paper we use the CFD terminology and refer to the vector u as a velocity and p as a pressure. The weak formulation of the problem (1) reads: For a given f (L 2 (Ω)) 2 find (u,p) (V,Q), satisfying (u, v) + (p, ∇· v) = (f, v) v V, (∇· u,q) = 0 q Q (2) where V ={v (L 2 (Ω)) 2 ; ∇· v L 2 (Ω); v · n = 0 on ∂Ω} and Q ={q L 2 (Ω); Ω q dΩ = 0}. Corresponding author. E-mail addresses: [email protected] (S. Margenov), [email protected] (P. Minev). 0378-4754/$32.00 © 2007 IMACS. Published by Elsevier B.V. All rights reserved. doi:10.1016/j.matcom.2007.01.021

On a preconditioning of non-conforming mixed FEM elliptic problems

Embed Size (px)

Citation preview

Mathematics and Computers in Simulation 76 (2007) 149–154

On a MIC(0) preconditioning of non-conformingmixed FEM elliptic problems

S. Margenov a,∗, P. Minev b

a Institute for Parallel Processing, Bulgarian Academy of Sciences, Sofia, Bulgariab Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, Canada T6G 2G1

Available online 23 January 2007

Abstract

In this paper we analyze a preconditioner for mixed finite element systems arising in the approximation of a second orderelliptic problem with Neumann boundary conditions by triangular non-conforming elements. This problem stems from the so calledprojection methods for the unsteady Navier–Stokes equations and is one of the most computationally intensive parts of the method.The present study is focused on the efficient implementation of the modified incomplete LU factorization MIC(0) as a preconditionerin the PCG iterative method for the reduced Schur complement system following from the mixed formulation of the pressure Poissonproblem. The presented model analysis and the related set of numerical tests illustrate well the potential of the proposed approach.© 2007 IMACS. Published by Elsevier B.V. All rights reserved.

AMS Classification: 65F10; 65N30

Keywords: Non-conforming mixed FEM; Preconditioning

1. Introduction

The projection methods are probably the most popular methods for solving the unsteady Navier–Stokes equations.Their popularity is due to the fact that they reduce the solution of the generalized Stokes problem to the solution of avectorial advection-diffusion equation for the velocity and a second order elliptic problem for the pressure. In a recentpaper (see [4]) it was proposed to use P1 elements for the discretization of the advection-diffusion equation that followsfrom the projection approach, and first order nonconforming Crouzeix–Raviart elements for the discretization of theprojection step. As a result, the projection step takes the following mixed form:

u − ∇p = f in Ω, ∇ · u = 0 in Ω, u · n = 0 on ∂Ω, (1)

where Ω is a convex polygonal domain inR2 and f is a given smooth vector valued function. In the paper we use the CFDterminology and refer to the vector u as a velocity and p as a pressure. The weak formulation of the problem (1) reads:

For a given f ∈ (L2(Ω))2

find (u, p) ∈ (V, Q), satisfying

(u, v) + (p, ∇ · v) = (f, v) ∀v ∈ V, (∇ · u, q) = 0 ∀q ∈ Q (2)

where V = {v ∈ (L2(Ω))2; ∇ · v ∈ L2(Ω); v · n = 0 on ∂Ω} and Q = {q ∈ L2(Ω);

∫Ω

q dΩ = 0}.

∗ Corresponding author.E-mail addresses: [email protected] (S. Margenov), [email protected] (P. Minev).

0378-4754/$32.00 © 2007 IMACS. Published by Elsevier B.V. All rights reserved.doi:10.1016/j.matcom.2007.01.021

150 S. Margenov, P. Minev / Mathematics and Computers in Simulation 76 (2007) 149–154

Elliptic problems can be discretized by a variety of methods and each of them has its advantages and disadvantages.It is usually the particular application that favours one choice versus another. For CFD applications, in particular, theuse of nonconforming (discontinuous pressure) elements for the projection yields a locally divergence-free velocitywhich is a desirable feature in many occasions.

The discretization of (2) proceeds as follows. Suppose that Ω is partitioned using triangular elements. Thepartition is denoted by Th and is assumed to be quasi-uniform with a characteristic mesh-size h (for a pre-cise definition of a quasi-uniform mesh see [10], p. 106). Then the discretization of V is given by Vh ={vh ∈ (L2(Ω))

2; vh ∈ (P1(e))2, ∀e ∈ T; vh is continuous at mi,e ∈ Ω; vh(mj,e) · n = 0, ∀mj,e ∈ ∂Ω}, where mj,e is the

midpoint of the i-th side of the element e (i = 1, 2, 3). The pressure is approximated by piecewise constants on Tand the corresponding discrete space is Qh = {qh ∈ L2(Ω); qh|e = P0(e), ∀e ∈ T;

∫Ω

qh = 0}. The spaces (Vh, Qh)satisfy the inf-sup stability condition associated with saddle point problems. In general, this is not enough to guar-antee the convergence of the projection algorithm as demonstrated in [15]. However, it was shown in [4] that if theCrouzeix–Raviart elements are used to discretize the projection step (1) then the corresponding approximation for thevelocity converges to the exact solution of the original generalized Stokes problem.

The finite element discretization of the problem leads to the system

Pwh = b, (3)

where wTh = {uT

h, ph} with uTh, ph being the vectors of the velocity and pressure unknowns. The matrix P results from

a symmetric saddle point problem and therefore is symmetric and indefinite, i.e.,

P =

⎡⎢⎣

M B1

M B2

BT1 BT

2

⎤⎥⎦ . (4)

Here, M is the mass matrix corresponding to the scalar FEM space used to approximate the velocity components. Note,that M is a diagonal matrix. This follows directly from the fact, that the quadrature formula on a triangle with nodes inthe midpoints of the edges is exact for second degree polynomials.

The solution of linear systems resulting from saddle point problems has been discussed by many authors (see [3]and the references therein). For this system we can either eliminate the pressure unknowns and end up with a systemfor uh that corresponds to a divergence-free basis in Vh (see [9]) or, since the mass matrix is diagonal, we can eliminatethe velocity unknowns and derive a system for the pressure with a symmetric and positive semidefinite matrix

S = BT1 M−1B1 + BT

2 M−1B2. (5)

In this paper we discuss the second approach and derive a preconditioner for the Schur complement (5) based on themodified Cholesky factorization MIC(0).

The rest of the paper is organized as follows. In Sections 2 we present the MIC(0) algorithm. A model analysisof the proposed PCG algorithm for the considered mixed FEM problem is given in Section 3. Finally, in Section 4we present numerical results on a test problem, that illustrate the computational efficiency of the considered iterativesolution method.

2. MIC(0) factorization

Let us first recall some known facts about the modified incomplete factorization MIC(0) preconditioner applied to amatrix A. Our presentation at this point follows the setting of [6]; see also [12,13] for some more details. Let us rewritethe real N × N matrix A = (aij) in the form

A = D − L − LT, (6)

where D is the diagonal and (−L) is the strictly lower triangular part of A. Then we consider the approximatefactorization of A which has the following form:

CMIC(0)(A) = (X − L)X−1(X − L)T,

S. Margenov, P. Minev / Mathematics and Computers in Simulation 76 (2007) 149–154 151

where X = diag(x1, . . . , xN ) is a diagonal matrix determined by the condition of equal rowsums:

CMIC(0)(A)e = Ae, e = (1, . . . , 1)T ∈RN.

For the purpose of preconditioning, we are interested in the case when X > 0 and thus CMIC(0)(A) is positive definite.If this holds, we call the MIC(0) factorization stable. We have the following theorem about the stability of the MIC(0)factorization.

Theorem 1. Let A = (aij) be a symmetric real N × N m atrix and let A = D − L − Lt be the splitting (6) of A. Letus assume that

L ≥ 0, Ae ≥ 0, Ae + Lte > 0, e = (1, . . . , 1)t ∈RN,

i.e. that A is a weakly diagonally dominant matrix with non-positive off-diagonal entries and that A + Lt = D − L isstrictly diagonally dominant.

Then the relation

xi = aii −i−1∑k=1

aik

xk

N∑j=k+1

akj

gives positive values xi and the diagonal matrix X = diag(x1, . . . , xN ) defines a stable MIC(0) factorization of A.

For a complete proof the reader is referred to [12].

Remark 2. The numerical tests presented in the last section are performed using the perturbed version of theMIC(0)algorithm, where the incomplete factorization is applied to the matrix A = A + D. The diagonal perturbationD = D(ξ) = diag(d1, . . . dN ) is defined as follows:

di ={

ξaii if aii ≥ 2wi

ξ1/2aii if aii < 2wi

,

where 0 < ξ < 1 is a constant and wi = ∑j>i − aij .

Let us focus on the preconditioned conjugate gradient (PCG) solution of the reduced system for the Schur com-plement S, using the preconditioner C = CMIC(0)(S). Each PCG iteration consists of one solution of problem with thepreconditioning matrix C, one matrix vector multiplication with the original matrix S, two inner products, and threelinked vector triads of the form v:=αv + u. It is easy to show, that the matrix S remains sparse in the considered setting.Moreover, there are at most four nonzero entries in each of its rows. Therefore, the computational complexity of onePCG iteration is given by

NitPCG ≈ N(C−1v) + N(Sv) + 10N.

Then,

N(C−1v) ≈ 9N, N(Sv) ≈ 7N,

and finally

CCNitPCG ≈ 26N. (7)

Obviously, the algorithm under consideration is relatively cheap, since the solution of the preconditioning system takesonly about one third of the total cost.

3. Model analysis

This model analysis concerns mostly the structure of the Schur complement matrix S and its properties and triesto determine if it is possible to apply MIC(0) factorization for a PCG iterative solution of the reduced problem forthe pressure Ph. Before we start, we should note that it is quite difficult to impose the condition

∫Ω

qh dΩ = 0 on thespace for the pressure Qh. Therefore, the usual approach is to relax this requirement on Qh, impose a homogeneous

152 S. Margenov, P. Minev / Mathematics and Computers in Simulation 76 (2007) 149–154

Fig. 1. Schur complement four point stencil for the pressure.

Dirichlet condition to one, let say the first, of the components of the vector ph, and then impose the integral conditionabove on the so computed solution. This procedure yields a symmetric and positive definite Schur complement S. Letus consider the problem in the unit square Ω = (0, 1)2. Let us also assume that a uniform mesh is used with a meshsize parameter h = 1/n. Then, the mass matrix is easy to compute, namely

M = h2

3I.

The Schur complement is a discrete elliptic operator acting on Qh. As it was already mentioned, due to the diagonalstructure of M, the matrix S remains sparse. In fact, S could be interpreted as a stiffness matrix corresponding to aT-shaped four point stencil. The particular coefficients for the model problem are shown in Fig. 1. The conclusion fromFig. 1 is that the Schur complement is an M-matrix. Obviously, the first two conditions for a stable MIC(0) factorization(see Theorem 1) are satisfied. They are independent of the ordering of the unknowns, corresponding to S. This is notthe case with the last condition of the theorem. Here we consider two possible element numberings presented in Fig. 2.Both are natural from the FEM point of view. The first one follows the vertical columns of elements, and is denoted byTW (triangle-wise). The second variant follows the vertical columns of square cells, and is therefore denoted by SW(square-wise).

In both cases, the Schur complement S has a block-tridiagonal form, i.e.,

S = blocktridiag (Si,i−1, Si,i, Si,i+1).

The diagonal blocks Si,i are diagonal matrices for variant TW which could be very attractive from algorithmic pointof view. This could in particular be very useful for the parallel implementation of the algorithm. Unfortunately,D − L is not strictly diagonally dominant in this case, i.e., the last condition from Theorem 1 is not fulfilled.The structure of S does not look so nicely if elements’ numbering SW is applied, but the conditions for a sta-ble MIC(0) factorization are fully satisfied in this case. We have used the variant SW to implement the resultingalgorithm.

Fig. 2. Element numbering of the pressure unknowns (a) variant TW; (b) variant SW.

S. Margenov, P. Minev / Mathematics and Computers in Simulation 76 (2007) 149–154 153

Table 1CG and PCG iterations: MIC(0) factorization of S

n N CG PCG

16 512 86 1332 2048 177 1964 8192 358 28

128 32768 737 41256 131072 1487 59512 524288 3010 87

4. Numerical tests

The numerical tests presented below illustrate the PCG convergence rate when the size of the discrete problemvaries. A relative stopping criterion

(C−1rnit , rnit )

(C−1r0, r0)< ε

is used in the PCG algorithm, where ri stands for the residual at the ith iteration step, (·, ·) is the standard Euclideaninner product, and ε = 10−6. The computational domain is the unit square Ω = (0, 1)2. A uniform mesh is used, whereh = 1/n, and the size of the discrete problem is N = 2n2.

The qualitative analysis of the results given in Table 1 clearly confirms that: (a) the number of iterations for CGmethod is O(n) = O(N1/2); (b) the number of iterations for MIC(0) PCG method is O(n1/2) = O(N1/4), namely, itgrows proportionally to

√n.

5. Concluding remarks

In this paper we propose a preconditioning technique for the iterative solution of mixed finite element systemsarising in approximations of second order elliptic boundary value problems by non-conforming finite elements. Ourstudy was motivated by more general applications in CFD, where the Crouzeix–Raviart non-conforming elements areshown to be very a attractive approximation tool because it yields a velocity field which is locally divergence-free.

The considered approach is based on the elimination of the velocity unknowns in the system that results fromthe mixed FE discretization of the problem. It was shown, that the Schur complement matrix for the reduced pressureproblem preserves a strong sparsity satisfying the conditions for a stable MIC(0) factorization. The presented numericaltests well illustrate the potential of the reported method, algorithm and code, for the efficient solution of large scaleproblems.

Our further plans include a generalization to 3D problems on tetrahedral meshes. We plan also to consider a two-level modification of our algorithm based on a first step of aggregation/differentiation of the Schur complement (see,e.g., [7,14]). Depending on the aggregation strategy, we could get for the second (elliptic) diagonal block of theobtained hierarchical presentation of S: (a) a more regular structure, like the standard five point (seven point in 3Dcase) stiffness matrix, or (b) a better for parallel implementation structure, which is similar to what we have in case ofRannacher–Turek rotated quadrilateral elements [5,16].

Acknowledgments

This work was supported in part by BiS21++ EC Grant, Bulgarian NSF Grant I-1402/2004 and a Discovery Grantof NSERC.

References

[3] O. Axelsson, M. Neytcheva, Preconditioned methods for linear systems arising in constrained optimization problems, Numer. Linear AlgebraAppl. 10 (2003) 3–31.

154 S. Margenov, P. Minev / Mathematics and Computers in Simulation 76 (2007) 149–154

[4] B. Bejanov, J.-L. Guermond, P. Minev, A locally div-free projection scheme for incompressible flows based on non-conforming finite elements,Int. J. Numer. Meth. Fluids 49 (2005) 239–258.

[5] G. Bencheva, S. Margenov, Parallel incomplete factorization preconditioning of rotated linear FEM systems, J. Comput. Appl. Mech. 4 (2)(2003) 105–117.

[6] R. Blaheta, Displacement decomposition-incomplete factorization preconiditioning techniques for linear elasticity problems, Numer. LinearAlgebra Appl. 1 (1994) 107–126.

[7] R. Blaheta, S. Margenov, M. Neytcheva, Uniform estimate of the constant in the strengthened CBS inequality for anisotropic non-conformingFEM systems, Numer. Linear Algebra Appl. 11 (2004) 309–326.

[9] S. Brenner, A nonconforming multigrid method for the stationary Stokes equations, Math. Comp. 55 (1990) 411–437.[10] S. Brenner, L. Scott, The mathematical theory of finite element methods, Springer, 1994.[12] I. Gustafsson, Modified incomplete Cholesky (MIC) factorization, in: D.J. Evans (Ed.), Preconditioning Methods; Theory and Applications,

Gordon and Breach, 1984, pp. 265–293.[13] I. Gustafsson, An incomplete factorization preconditioning method based on modification of element matrices, BIT 36 (11) (1996) 86–100.[14] R. Lazarov, S. Margenov, On two-level MIC(0) preconditioning of Crouzeix–Raviart nonconforming FEM systems, Springer Lecture Notes

Comp. Sci. 2542 (2003) 192–201.[15] K.A. Mardal, X.-C. Tai, R. Winther, A robust finite element method for Darcy–Stokes flow, SIAM J. Numer. Anal. 40 (5) (2002) 1605–1631.[16] R. Rannacher, S. Turek, Simple nonconforming quadrilateral Stokes Element, Numer. Meth. PDEs 8 (2) (1992) 97–112.