Parallel, Adaptive Grid Computing of Multiphase Flows in

American Institute of Aeronautics and Astronautics

1

Parallel, Adaptive Grid Computing of Multiphase Flows in

Spacecraft Fuel Tanks

Chih-Kuang Kuan* and Jaeheon Sim

†

University of Michigan, Ann Arbor, MI, 48109

and

Wei Shyy‡

University of Michigan, Ann Arbor, MI, 48109

Hong Kong University of Science and Technology, Kowloon, Hong Kong

A parallel adaptive Eulerian-Lagrangian method is developed for effective large scale

multiphase flow computation. The intricate complexity of parallelism for the Eulerian-

Lagrangian method is discussed and spatial domain decomposition strategies are proposed.

An efficient communication approach is presented with MPI standard communication and

the performance of the parallel Eulerian-Lagrangian method is evaluated. The speedup on

problems having million of cells shows super-linear behavior due to the effect of cache hit

rate, and the parallel efficiency peaks at the trade-off point of the cache hit rate and the

computation-to-communication ratio. A first-principle-based phase change model is

introduced by computing the rate of phase change in each phase as well as across

liquid/vapor interface. The self-pressurization in a liquid hydrogen fuel tank is simulated

and it is shown that the phase change in a liquid phase has a dominant contribution

comparing phase change on the liquid/vapor surface, and the rate decreases as saturation

temperature increases. The effects of heat flux amount and liquid fuel fill-level are

investigated, and the pressurization accelerates showing longer transition period with higher

pressure rise rate as heat flux and fill level increases.

I. Introduction

he reliable and economic management of cryogenic liquids in a spacecraft including storing and transferring

becomes more critical for the long duration future space missions such as interplanetary exploration. Cryogenic

liquid is required for human life support systems and spacecraft propulsion systems, and its management determines

human survival in space and the success of the mission. Even a small heat leak from incident solar radiation,

propulsion system, aerodynamics heating, or the spacecraft structure can cause vaporization of the liquid due to

extremely low boiling temperatures and long duration missions. The vaporization results in the increase of pressure

in storage tanks which causes safety issues and design difficulties in liquid storages. The vent system releasing high

pressured vapor can result in loss of cryogenic liquids. Improving the cryogenic storage and lowering the

pressurization leads to significant cost savings through the reduction of launch mass1. Various pressure control

techniques have been proposed; passive thermal insulation, cryocooler mixing, (passive) thermodynamic vent

systems, and active thermodynamic vent systems with heat exchanger and fluid mixing techniques1, 2

. The concept

of zero boiloff (ZBO) storage of cryogenics has been developed by NASA for future space mission with duration on

the order of years1.

In order to understand and estimate the self-pressurization in cryogenic liquid storages, many theoretical,

numerical, and experimental studies have been conducted. The homogeneous model3-6

(or thermodynamic

model) and surface evaporation model have been widely used due to their mathematical simplicity. The

* Graduate Student, Department of Aerospace Engineering, University of Michigan; AIAA Student member.

† Post-doctoral Research Fellow, Department of Aerospace Engineering, University of Michigan; AIAA member.

† Post-doctoral Research Fellow, Department of Aerospace Engineering, University of Michigan; AIAA member.

‡Provost & Chair Professor, Department of Mechanical Engineering, Hong Kong University of Science and

Technology; Adjunct Professor, Department of Aerospace Engineering, University of Michigan; AIAA Fellow.

T

50th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition09 - 12 January 2012, Nashville, Tennessee

AIAA 2012-0761

Copyright © 2012 by the American Institute of Aeronautics and Astronautics, Inc. All rights reserved.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


2

homogeneous model assumes that both liquid and vapor phases are at the thermodynamic equilibrium states

with the saturation temperature of the cryogenic fluid at the total tank pressure . This model shows lower

pressurization than experiment, and is referenced usually as the lower limit of pressure rise. The surface

evaporation model assumes that all the heat influx is used to evaporate liquid, maintaining the initial liquid

temperature. The vapor temperature is thus at the saturation temperature corresponding to the final system

pressure. This surface evaporation model is referred to as the upper limit of pressure rise. In the experimental

studies, several different variables were studied, including the effect of gravity, heat flux amount and

distribution, tank size, tank geometry, fill level, and mixing. One of the noticeable studies was conducted by

Aydelott3, 4

, who showed the rate of pressure rise is lower in reduced-gravity than normal gravity because of

the increased liquid-wetted wall area and increasing natural convection. He also mentioned the faster

increasing vapor temperatures is the difference between the homogeneous model and experiment. Poth and

Van Hook showed that a mixing jet could be used to minimize thermal stratification and reduce pressure rise

in a tank7, 8

. Many other studies showed that the difference between the homogeneous model and experiment

are much large in a larger storage and/or higher heat influx5. However, the final rate of pressure rise after

initial transient is very close to that of homogeneous model. Nevertheless, the final pressure rise cannot be

predicted correctly without appropriate initial transient studies. The numerical studies followed for deeper

understanding and overcome the experimental limitation. However, due to its complexity including flow

motions, heat transfer, and phase change, the numerical and theoretical studies are conducted in a very

limited condition. Liquid Conduction-only model9 are implemented in the beginning, and later, the lumped

vapor model were the most popular approaches by solving thermal effect in cryogenic liquid region with

assumption of homogenous condition in vapor phases6, 10

. However, the lack of agreement between these

simple mathematical/numerical approaches and experiment, and the difficulties in the experimental studies

encourage finer numerical approaches by solving full Navier-Stokes and energy equation for both phases.

Considering microgravity conditions in a typical spacecraft environment, the capillary effect is critical

due to small Bond number, the ratio of body (or gravitational) forces to surface tension forces. Thus, a study

on the multiphase flow including interfacial dynamics is required in order to understand the liquid fuel

dynamics in a spacecraft. However, experimental studies are limited because the microgravity conditions are

hard to realize on the ground. Thus, a high fidelity numerical simulation of such a multiphase flow in a space

environment is crucial to compensate the limitation of experiment. Though numerical simulation is a useful

approach, it is still challenging because of the multiple time/length scales, large variations in fluid properties,

moving boundaries, and phase changes in multiphase flow. To fully capture the physics of interfacial

dynamic, detail geometrical features of irregular interface shape has to be resolved by fine resolution. Such a

multiple length scale problem demands expensive computing resource in modeling. The varying source terms

in governing equations, such as mass transfer and surface tension, may restrict the time step for numerical

stability. Moreover, an additional difficulty arise when the jump of density across fluid interface is large, the

encounter of numerical stiffness is inevitable.

To provide a compatible computational platform with simulating large-scale multiphase flows, distributed

computation via parallel computing is preferable. Parallelized multiphase flow simulations using the volume

of fluid method is studied broadly and showed good parallel performance for varying large problems11

.

Sussman demonstrated efficient parallel algorithm based on coupled level set/volume-of-fluid method.12

. In

case of Eulerian-Lagrangian methods, its’ applicability to parallelization is complex since interactions

between Eulerian and Lagrangian data is intricate. Unlike the volume of fluid and level-set method which are

both categorized as Eulerian computations, the major challenge of Eulerian-Lagrangian methods is the

decomposition of computation on Lagrangian domain and the mapping of data in between Eulerian and

Lagrangian coordinates. Parallelism for both Eulerian and Lagrangian coordinates have to be considered.

In the present study, we have further developed the previously reported 3-D adaptive Eulerian-Lagrangian

method13-15

by parallel implementation for both continuous and sharp interface methods and energy equation

with a phase change effect. This numerical method utilizes the stationary (Eulerian) frame to resolve the flow

field, and the marker-based triangulated moving (Lagrangian) surface meshes to treat the phase boundary

interfaces. The large-deformable fluid boundaries are modeled using a continuous interface method, and the

surface tension between fluid interfaces is smeared within finite distance. The solid boundaries are treated by

a sharp interface method along with the ghost cell method by reconstructing the solution on the ghost cell

based on the known solid boundary condition. The contact line where the fluid interface meets the solid

boundary is modeled using a contact line force model, which enforces the given contact angle dynamically.

For moving contact line treatment, local slip condition is applied around the contact line. The first-principle-

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


3

based phase change model is introduced by computing the amount of a phase change in the pure material

region as well as across liquid/vapor interface. For more effective computation in large scale problem, the

adaptive Cartesian grid method and the multi-domain parallel computing technique is implemented with the

help of the MPI standard communication and METIS library. The contribution of phase change in each phase

and on the interface surface are investigated, and the effects of various heat flux amount and liquid fuel fill-

levels are followed on a liquid hydrogen fuel tank under normal gravity.

II. Eulerian-Lagrangian Methods

A. Governing equations The marker-based 3-D adaptive Eulerian-Lagrangian method is a multiphase flow algorithm which is able to

resolve interfacial dynamic with surface tension computation and phase change modeling. The bulk flow variables

are solved on the stationary (Eulerian) background grid, whereas interface variables are handled by moving

(Lagrangian) surface meshes. Figure 1 shows a brief framework of the present Eulerian-Lagrangian method, with

the detailed numerical method found in the previous works13-15

. Figure 1 also shows the schematic drawing of the

present approach for multiphase flow computation.

Non-dimensionalized continuity, momentum and energy equations for incompressible Newtonian fluid can be

written in Eqs. (1)–(3), which account for the interfacial dynamics as source terms in the governing equations;

surface tension effects of fluid interfaces as a momentum forcing term ( ), and the latent heat effects across fluid

interfaces as a energy source term ( ). In Eqs. (1) and (2), is the velocity vector, and , , and is the density,

viscosity, and pressure, respectively.

(1)

(2)

(3)

Here, variables are non-dimensionalized by a characteristic velocity ( ) and length scale ( ), standard gravity

(a) (b) (c)

Figure 1. Eulerian-Lagrangian methods. (a) Interface representation and tracking using moving meshes

(red) on the stationary Cartesian grid. (b) Solution procedures. (c) Illustration of the present numerical

approach.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


4

( ), and liquid material properties (density , viscosity , and surface tension ). The Reynolds, Froude, and

Weber numbers in Eq. (2) are defined as, , , and . In addition, the

capillary number and bond number can be defined as and ,

respectively.

Using the continuous interface method, the single set of equations for all fluid phases in the whole domain is

achieved with the help of an approximate Dirac delta function ( ) and an indicator function ( ). The approximate

Dirac delta function, originally proposed by Peskin,16

is calculated over finite thickness of four cell widths. It

provides a weighting projection for variables communication between the Eulerian and Lagrangian domains.

Besides this, an indicator function varying from zero to one smoothly across the interface has a value of 0.5 at the

interface location. The smoothed fluid properties such as density and viscosity are computed by indicator function in

Eqs. (4)-(7).

(4)

(5)

(6)

(7)

Separate indicator functions for fluid and solid interfaces are used to separate the designation of the solid

interface, modeled by a sharp interface method, from the fluid interface modeled by a continuous interface method.

The indicator function is computed using a discrete form of the Heaviside step function in Eq. (8) by integrating the

1-D form of discrete Dirac delta function. This approach is known being more general than the Poisson equation

solver method since it requires only distance information from interface, and thus gives accurate values even at the

boundaries.14

(8)

The governing equations (1)-(3) are solved using a projection method on staggered grid finite volume

formulation. The pressure and fluid properties are stored at the cell center and face-normal velocities are stored on

the Cartesian cell faces. The convection term ( ) is discretized using a 3rd order ENO scheme in space and a 2nd

order Runge-Kutta integration in time. The central difference scheme and Crank-Nicholson method is implemented

for the viscous term ( ). The discretized solution procedures are summarized in Eqs. (9)-(12). In Eq. (13),

term is zero if the phase change doesn’t occur and mass is not transferred across the interface.

(9)

(10)

(11)

(12)

The pressure Poisson equation of Eq. (13) is derived by taking divergence of Eq. (11).

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


5

(13)

B. Interface representation and tracking

In this marker-based Eulerian-Lagrangian method, the interface is represented by markers in coordination with

each other for maintaining the interface connectivity information. The geometrical structure is established via line-

segments in two-dimensional and triangles in three-dimensional domains, as represented in Figure 1(a). The marker

locations, denoted by in Lagrangian frame, are updated from the velocities at its location, , in Eq. (14).

(14)

Fluid interfaces use the computed flow solution field to obtain the marker velocities as shown in Eq. (15). In this

equation, the discrete Dirac delta function, , is employed for converting the Eulerian velocity field, ,

to Lagrangian form, . The interface velocity is exactly same to fluid velocity if there is no mass transfer in Eq.

(15). However, with phase change, the velocity component from mass transfer should be considered. On the other

hand, solid interfaces use the prescribed velocity field to advance the marker points using Eq. (14).

(15)

In order to maintain consistent computational accuracy, the spacing between marker points is rearranged by

adding or deleting markers whenever two markers come too close or too far from each other. The level-contour-

based interface reconstruction technique with connectivity information is also implemented to handle topological

changes such as merger or break-ups. The detailed numerical method can be found in the previous works15

.

C. Interfacial dynamics modeling

The surface force computation of Eq. (16), where is the surface tension and is the curvature of interface, is

applied as a source term in the momentum equation, Eq. (2).

(16)

Solid interfaces are modeled using the sharp interface method by reconstructing solution fields around an

interface to incorporate the given boundary condition on the solid interface in an Eulerian Cartesian grid. The

velocity reconstruction on the solid side is implemented via ghost cells, which are defined as solid cells having at

least one neighboring fluid cell. This approach works well even with contact lines where fluid interface meets a solid

interface. The basic idea of these two different approaches is shown in Figure 2, and the details can be found in our

previous work15

.

(a) (b)

Figure 2. Interfacial dynamics modeling. (a) Fluid interface: continuous interface method with smeared

interface within finite zone. (b) Solid interface: sharp interface method with the zero-thickness interface.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


6

When a fluid-fluid interfaces intersects a solid surface, the treatment of the tri-junction locations, called contact

lines, is required to account for the presence and interactions of all three phases, fluid-fluid-solid. One of the most

discussed issues for modeling these contact lines with Navier-Stokes equations is that the imposed no-slip condition

for velocity leads to a non-integrable singularity in stress. In the present research, the contact line force is imposed

with local slip condition to overcome this singularity issue.

D. Mass transfer due to phase change across liquid/vapor interface

Mass transfer computation is one of the key issues in the phase change process since it is directly related to the

movement of phase boundaries and the amount of latent heat in the energy transfer process. In the energy equation

of Eq. (3), the latent heat is computed by Eq. (17), where is mass transfer per unit area across interface due to

phase change and is the latent heat.

(17)

Similar to the transformation of surface tension, the latent heat of the interface due to phase change is also

transformed from a Lagrangian quantity ( ) to an Eulerian quantity ( ) via the approximate discrete Dirac delta

function, .

In the present study, the mass transfer is computed in Eq. (18) based on the Stefan condition using the

temperature gradient with discontinuous material properties for simplicity and accuracy. is the latent heat, is

thermal conductivity and q1 and q2 are heat flux vectors transporting across the liquid-vapor interface. The interface

temperature is assumed equal to the saturation temperature which is an adequate assumption in macroscopic

problems.17

(18)

The energy equation of Eq. (3) is solved using continuous treatment with smoothed material properties in Eqs.

(4)-(7), and the projection method in Eqs. (9)-(13) is applied to solve momentum equation. The mass conservation

equation in Eq. (1) is enforced by assuming when the pressure Poisson equation is solved in Eq. (13).

However, the divergence of the velocity is not zero around the interface if the phase change occurs and mass is

transferred across the interface. Shin and Juric developed the conservation of mass in Eq. (19),18

and in the present

work, the modified version of Eq. (20) is implemented for the non-conservative form of the equation.

(19)

(20)

In typical phase change models, the mass transfer occurs only across the interface between liquid fuel and vapor.

However, this assumption is not appropriate because the liquid can be heated up to higher temperature than the

saturation condition as heat is absorbed. In fact, phase change should occur if the liquid temperature is higher than

the saturation temperature. Moreover, condensation happens if vapor temperature is lower than saturation

temperature. We further define our phase change model more strictly by considering the mass flux per unit volume

due to phase change in a liquid and a vapor region, denoted as and , and as well as an interface region as

followings.

• Interface region (0<I<1): phase change occurs based on the Stefan condition

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


7

• Liquid region (I=1): phase change occurs if T > T sat

and

• Vapor region (I=0): phase change occurs if T < T sat

and

E. Thermodynamic model for spacecraft fuel tank computation

In the present study, the incompressible flow solver can be applicable to the spacecraft self-pressurization

problems by implementing Boussinesq approximation for buoyancy effect and modifying thermodynamic properties

of vapor, and saturation pressure and temperature according to the given thermodynamic conditions as phase change

occurs.

The total phase change rate from liquid to vapor can be defined as Eq. (21) in a partially filled cryogenic storage

tank.

(21)

Where, and are mass and volume, and subscript and means liquid and vapor, respectively.

Then, the vapor volume change rate, and vapor volume at a given time, , can be computed in Eq. (22) and

Eq. (23). The density of liquid, , is assumed constant.

(22)

(23)

From the total mass and volume conservation in Eq. (24) and (25), the vapor density of Eq. (26) can be obtained.

(24)

(25)

(26)

The saturation pressure and temperature can be computed from the ideal gas law with compressibility

factor, , and Clausius-Clapeyron relation in Eq. (27) and (28) .

(27)

(28)

In the spacecraft fuel tank self-pressurization problem, the temperature in a tank is very low - around 20 K.

In this low temperature environment, the material properties are not constant, and the ideal gas law is not

appropriate as depicted in Figure 3. Therefore, the properties should be modified according to the temperature.

In the present study, the real gas data related to the material properties and compressibility factor are obtained

from NIST chemistry webbook.

F. Adaptive Grid

Multiphase flow problems involve multiple length scales. In order to effectively resolve the flow features in such

cases, adaptive grids based on isotropic refinement is implemented. The cells are split into four and eight equal

sibling cells in two- and three-dimensions, respectively, to better handle regions which require higher resolution.

The grid is represented using unstructured data that connects cells through cell faces. The details of the algorithm

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


8

can be found in Singh and Shyy.13

In present study, only the static grid adaption is implemented for the parallel

Eulerian-Lagrangian method. The static grid adaption method refines the Cartesian cells around the solid and fluid

interfaces at the begging of simulation to provide finer resolution. The dynamic grid adaption involving repartition

of Eulerian domain at runtime is not included in the current parallel code.

III. Parallelization of Eulerian-Lagrangian Method

We now present the parallelization of the adaptive Eulerian-Lagrangian method. Our goal of current study is to

develop a parallel Eulerian-Lagrangian method for large problems that provides substantial computational power

with scalability, compact data structures and efficient communication methodologies. For the purpose of portability,

this parallel program is designed for distributed memory multiprocessors architecture with MPI standard for

communication. Third party libraries, PETSc19

and HYPRE are implemented for solving linear system of equations.

As addressed in previous sections, the Eulerian-Lagrangian method resolves the flow filed on a stationary

Eulerian grid with a moving Lagrangian mesh superimposed on it. Eulerian and Lagrangian variables are stored on

different spatial domains but interact with each other by projecting quantities through the Dirac delta function.

Therefore, two issues need to be considered: decomposition strategies for both the Eulerian and Lagrangian domain

and additional communication due to Eulerian-Lagrangian interaction.

Spatial domain decomposition is the most widely adopted for computational fluid dynamic methods because of

its repeated calculation on a data set originated from discretized differential equation. Using spatial domain

decomposition, the concurrent operation of each processor working on one piece of entire data can maintain load

balance easily. In case of Eulerian-Lagrangian method, which has data stored on two different coordinates, the data

parallelism framework favorable for Eulerian domain, but a suitable approach for decomposition of Lagrangian

interface should be evaluated. The parallelism adopted for Lagrangian domain should provide load balance and

minimize communication in frequency and data size as well.

The detail approaches of parallel implementation are addressed in the following sections.

A. Domain Decomposition of Eulerian domain

The partitioning methodology of Eulerian domain needs to consider the following issues: load balance,

communication cost; data dependency arising from ghost cell method. Among many approaches for spatial

decomposition of a graph, the most popular two methods are space filling curves method and heuristic method such

as partitioning libraries METIS20

. A space-filling curve traverses N-dimensional graph and map it bijectively to a

one-dimensional array. The ordered array is split into P parts which is the total number of processors. Consequently,

the load balance is satisfied but total edges cut, that is, the size of partition boundary requiring information exchange

is not an objective of space-filling curve method. On the other hand, graph partitioning packages, such as METIS

using k-way multilevel heuristic approach as partition algorithm usually provide satisfactory load balance and fewer

edges cut compared with space-filling curves method21

. Moreover, it is easy to implement and provide flexible

weighting function to manipulate preferred partition scenario. In this study METIS is implemented as the

partitioning tool.

In our Eulerian-Lagrangian method, the boundary condition of solid interface is enforced by ghost cell method.

First of all, solid boundaries are represented by immersed sharp interface that identifies the fluid and solid side. As

shown in Figure 4(a), a cell in solid side adjacent to the immersed boundary would be recognized as ghost cell to

help the boundary condition enforcement. Image points IP of a ghost cell are determined by extending the local

Figure 3. Low temperature effects on the compressibility factor of hydrogen with saturation temperature.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


9

normal of solid boundary into fluid side in a fixed distance. The variables at an image point is obtained by

interpolation method using information from surrounding fluid cells, which are denoted with dash line. Then ghost

cell velocity is calculated by linear extrapolation from the known values at image point and velocity at solid point

SP, which is zero for no-slip boundary condition. Pressure and temperature at ghost cells are obtained as the same

manner. As a result, a ghost cell has geometrical dependency to its corresponding fluid cells. Such one-to-multiple

dependency of ghost cells on solid boundary results in ordering difficulties if using space-filling curve, but no hurdle

whenever METIS is applied. In addition, a partition methodology that supports additional constrain for cell

workload can help even distribution of ghost cells on requested processors, which decreases possibilities of

processor idling. Vertex weighting of METIS provides this functionality for users to adjust flexible workload.

The procedure of decomposition Eulerian domain starts with the creating of background grid and identification

of ghost and fluid cells by computing indicator functions. By providing connectivity information of a graph to

METIS with appropriate weighting information, an index array is returned with an integer indicating in which

processor a cell is placed. Figure 4(b) shows a problem with a circle representing solid interface situated in the

center of Cartesian grid is decomposed into 4 partitions. Each partition acquires 3 layers of cells from neighbor

partitions to be the overlapping zone. As shown in Figure 4(c), a sub-domain occupied by a processor, is then

defined by the combination of a processor’s partition and overlapping zone. The overlapping zone serves as a buffer

region for synchronized of information.

B. Domain decomposition of Lagrangian interface

The parallelisms for Lagrangian interface considered are task parallelism, atomic decomposition, and spatial

decomposition. An example in Figure 5 illustrates how a problem having Lagrangian markers denoted in circle

superposing on the Eulerian Cartesian grid could be decomposed with 4 processors.

The first feasible parallelism is task parallelism which is reserving specific processors to dealing with

Lagrangian data exclusively and using other processors for calculation of Eulerian data. This approach takes

advantage of the concurrency of Eulerian and Lagrangian tasks to accelerate computation. One of the difficulties is

the optimal arrangement of fair work load among Eulerian and Lagrangian tasks for general problems. The ratio of

workload between Eulerian and Lagrangian domain is very problem-dependent and varies with the evolution of the

interface. Complex management is required to avoid processes idling due to racing among tasks. Moreover, because

of dependency between Eulerian and Lagrangian variables, data exchange under task parallelism intensely relies on

all to one and one to all collective communication that may lead to communication overhead.

On the other hand, data parallelism distributes the entire data over processors to be processed concurrently.

Under the data parallelism, one of the choice to decompose Lagrangian interface is atomic decomposition22

as

illustrated in Figure 5(c). Atomic decomposition evenly distributes Lagrangian markers to all processors regardless

their locality. It achieves load balance automatically. However, frequent communication for updating interface

information between processors due to low locality is inevitable. Thus deterioration of parallel efficiency is expected

arising from communication overhead.

Unlike atomic decomposition, spatial decomposition spreads entire Lagrangian interface into sub-domains on the

basis of spatial inhabitation of markers. In Figure 5(d), Lagrangian markers encompassed by a sub-domain owned

(a) (b) (c)

Figure 4. The Eulerian sub-domain with sharp interface method; (a) Ghost cell method using

interpolation of image point to impose boundary condition (b) A circular solid interface located inside

two-dimensional domain partitioned by 4 processors (c) Definition of overlapping zone for a sub-

domain

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


10

by a particular processor should be assigned to this processor. This approach takes the advantage of data locality that

interface information is available in each processor, and both communication frequency and data exchanged can be

minimized. The load balance of Lagrangian computation may not be satisfied but reasonable scalability is expected

in general cases since the cost of pure calculation of Lagrangian data is much smaller than cost of flow solver.

Besides, the methodology of spatial decomposition of the Lagrangian domain is similar to Eulerian domain that

makes the implementation straightforward. As a result, the spatial decomposition is considered as the partitioning

approach for the Lagrangian interface in this study.

The procedure of Lagrangian domain decomposition is summarized as following. A marker locality array is used

to account marker inhabitation by providing a primary processor that situates the marker. As shown in Figure 6,

localities of the solid and dashed markers inside a particular Eulerian sub-domain are assigned with this primary

processor ID. The collection of partition-own markers, represented by solid circle, is the sub-set of local Lagrangian

sub-domains that operations looping on, and dashed markers in overlapping zone are served as buffers whose

information is updated by neighboring processors at the end of each time step. Note that the global Lagrangian

interface connectivity will not be discarded but preserved for repartitioning. Repartitioning is necessary when a

marker moves out of its Eulerian sub-domain. In addition, any geometrical operation such as coarsening, refining,

and smoothing on Lagrangian interface affecting global connectivity requires repartitioning to keep consistent

connectivity. In such case the data locality has to be updated and Lagrangian markers repartitioned

C. Communication methodology Data in an overlapping zone of a sub-domain needs to be synchronized with neighbor sub-domains for data

consistency. For point-to-point communication, i.e., a sub-domain synchronizing data in overlapping zone with

neighbor partitions, the non-blocking send and receive are used to avoid blocking delay. The sent-out data of a sub-

domain is collected and concatenated into an array on the sequence add-up manner. Figure 7(a) illustrates how the

data in overlapping zone is grouped in a chunk of continuous memory. This problem is executed with 4 processors

and the partitioned domain of processor 1 is adjacent to processor 0, 2 and 3 with 3, 4, and 5 entries in overlapping

zone respectively. A temporary array is then created by concatenation of prepared data. The pointer of this array is

passed to the subroutine, MPI_ISEND to initialize communication process. The procedure of receiving data from

neighbor partitions has similar approach by allocating one-dimensional space beforehand for arriving data. An

Figure 6. Lagrangian markers inhabiting in a sub-domain

(a) (b) (c) (d)

Figure 5. Strategies of decomposition of Lagrangian interface (a) 2D Lagrangian mesh on Eulerian

domain (b) task parallelism (c) atomic decomposition (d) spatial decomposition

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


11

MPI_WAIT is placed in the end of initialization of send and receive subroutines to check if procedures are

completed correctly. The pseudo code of the communication procedure is shown in Figure 7(b). This design allows a

processor finishing synchronization earlier to work on following operations earlier. Such design provides a sort of

overlapping between communication and computation that improves parallel efficiency.

The performance of our methodology is evaluated with a one million cell problem. The problem is executed on

the NYX machine in the Center of Advance Computing at University of Michigan. The nodes we used on NYX are

comprised with Intel Nehalem Xeon E5540 CPU with InfiniBand networking whose latency is about 10-6

second.

Each Intel Xeon E5540 CPU has 4 cores with 8M Bytes L3 shared cache and total memory available per CPU is

12G Bytes.

The averaged data size of velocity exchanged in a process of communication is from 108K to 41 Bytes on 4 to

96 processors. The start-up time represents the preparation and calling of MPI_ISEND and MPI_IRECV procedures,

and the time to finish synchronization stands for time staying on MPI_WAITALL. The cost of one velocity

synchronization is on the order of 2*10-5

second in spite of number of processers used. Our communication strategy

has been tested and trivial latency is observed. This communication design is applied for both Eulerian domain and

Lagrangian domain communication in present work.

D. Solving Procedure

We summarize the procedures of parallel Eulerian-Lagrangian method in Figure 8. Note that those operations in

italic involving both Eulerian and Lagrangian variables. First of all, the initialization of simulation creates a

Cartesian grid to encompass solid interface and refine it adaptively to designated resolution with the identification of

ghost cells. The Eulerian domain is then partitioned by METIS, and the Lagrangian domain is spatially decomposed

based on marker’s locality. In the time integration loop, a marker is updated to new position by a velocity field

interpolated from the Eulerian domain and interface shape will be refined if needed. The updated interface renders a

new distribution of fluid properties that inter-processors communication is needed to retain properties such as

consistency at overlapping zone of sub-domains. Two source terms, surface tension and mass transfer across fluid

interface distributes their impact to cells at a fixed distance. The velocity field is updated with Runge-Kutta Crank-

Nicolson method by projection method. PETSc and HYPRE are both implemented as the linear solver for the

discretized Poisson equation. Dynamic mesh refinement, which is a complex but computationally efficient approach

Table 1. Communication cost for velocity filed in 1 millions cells problem

# of processors 4 8 16 64 96

Start-up time(s) 3.0*10-5

2.4~10-6

2.0*10-5

2.1*10-5

2.1*10-5

Time to finish synchronization(s) 9.5*10-6

9.7*10-6

9.5*10-6

1.0*10-5

1.1*10-5

Averaged data size (K Bytes) 108 105 84 50 40.5

Do : index0 = 1, total number of processors

IF NOT neighbor partition, CYCLE

recv recv_data(index0); source = index0

tag1 = processor_id

CALL MPI_IRECV(recv, # of receive, type, source , tag1, comm., reqs)

send send_data(index0)

tag2 = processor_id; dest. = sent entry index

CALL MPI_ISEND(send, # of send , type, dest. , tag2, comm, reqs)

CALL MPI_WAITALL(reqs, stat, comm)

(a) (b)

Figure 7. (a) An example of concatenated array of data in overlapping zone of processor ID 0 on 4 processors

computation (b) Pseudo code of non-block receiving/sending communication

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


12

is not addressed in the present study but will be implemented in the near future. Once the divergence-free velocity

filed is obtained on Eulerian domain, it is projected to Lagrangian markers by Dirac delta function.

IV. Validation and Performance of Parallel Implementation

A. Validation

Here we validate the parallel Eulerian-Lagrangian method by comparing solutions from serial and parallel

computation: the lid-driven cavity flow, the uniform flow past a cylinder with sharp interface method, and a natural

convection problem to verify the energy equation. The three tests are computed with ENO scheme around solid

boundary and second order central difference scheme in the rest region.

The first case is lid-driven cavity flow at Reynolds number 1000 with grid size 80 by 80. The streamline of at

steady state together with colored sub-domain mesh from 16 processors computation is given in Figure 9(a). The

overlapping zone between sub-domains having three layers of cell width is shaded to highlight overlapping.

Computational results confirm the solution produced by parallel code is identical with solution from serial code in

an error range of 10-7

, which is the convergence criterion of the pressure Poisson equation. The velocity profile

shown in Figure 9(b) and agrees with Ghia’s benchmark solution.

The second test is the uniform flow past a cylinder at Reynolds number 40 and 200 with 5 layer of static mesh

adaption around solid interface on 32 processors. At Reynolds number 40, the computed drag coefficient is 1.52 and

length of circulation zone is 2.23 which are same as result from serial code. The time-evaluation of drag and lift

coefficient at Re = 200 are plotted for parallel and serial computation in Figure 10(b). Drag and lift coefficient

computed by the parallel solver is marked with thick black line and the solution form the serial solver in green and

red lines. It is clear that parallel code produce numerical identical solutions to the serial code. The solution of

parallel code agrees with solution of serial code up to 9 digits, which is the convergence criterion of the Poisson

equation.

The natural convection in a square cavity at Rayleigh number 105 is selected as the validation test for examining

the parallel implementation of energy equation. Dirichlet temperature boundary conditions are applied at left and

right wall and insulated wall boundary conditions for top and bottom wall of the cavity. The grid size is 80 by 80

with 16 processors used in this simulation. The sharp interface method is used as the enforcement of the boundary

condition of solid interface. The temperature contour and plots of temperature profile along horizontal center and

vertical center are presented in Figure 11. The solution of the parallel computation agrees with the serial

computation exactly. These three tests demonstrate of correct implementation of the parallel Eulerian-Lagrangian

method.

Initialization: Eulerian domain decomposition(METIS)

Lagrangian domain decomposition

Time integration loop :

Marker movement

Interface Restruction: Smooth, refine, coarsen

Determine material and cell properties

Source term: Surface tension

Mass transfer/ Heat flux at fluid interface

Intermediate velocity U* by RK-CN integration

Pressure Poisson equation using HYPRE/PETSc

Corrected velocity Un+1

Energy equation by RK-CN integration

Dynamic adaptive mesh refinement

Eulerian domain decomposition(ParaMetis)

Lagrangian mesh decomposition

data reallocation

Velocity interpolation to Lagrangian markers

Reconstruction of Lagrangian interface

Figure 8. The solving procedure of parallel Eulerian-Lagrangian method

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


13

(a) (b)

Figure 11 Natural convection in square cavity at Rayleigh number 105 on 16 processors; (a) Temperature

contour of natural convection (b) Temperature along vertical center and horizontal center of the square

cavity

(a) (b)

Figure 10 Uniform flow past circular cylinder 32 processors (a) Stream line plot at Re = 40; The drag

coefficient is 1.52 and the length of circulation zone is 2.24 (b) Lift and drag coefficient of uniform flow

past circular cylinder at Re = 200

(a) (b)

Figure 9 Lid-driven cavity flow at Re = 1000; (a) Stream line on 16 processors; The shadow of edges

represents overlapping zone between sub-domains. (b) Velocity profile of U and V along the central

vertical line and central horizontal line of cavity; grid size 128*128 on 32 processors at.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


14

B. Parallel performance

We evaluate parallel performance by simulating the lid-driven cavity flow with three problem size: problem 1

has 1.6x105 cells; problem 2 has 1 million cells, and problem 3 has 4 million cells. This experiment is executed on

the NYX machine. Its specification has been addressed in the preceding section.

The linear solver library used in this test is PETSc’ conjugate gradient method with Jacobi preconditioner. Since

the graph of a linear system of discretized Poisson equation is changed with the number of partitions, the iteration

number reaching a constant rounding-error varies with the number of processors used. In order to control the

workload spent on solving a linear system on varying number of processors used in a computation, executing time is

measured under the basis of the fixed iteration number of the linear solver.

It is observed that up to 90% of cost of a time step is spent on solving the linear system of discretized Poisson

equation. As a result, overall parallel performance is dominated by the efficiency of linear solver, which is

controlled by two effects: the Computation-to-communication ratio and shared cache size per CPU. For a fixed-size

problem, increasing the number of processor decreases computation-to-communication ratio that results in the

slowdown of speedup. In case of shared cache effect, its impact to parallel performance is related to the size of

working data in solving a linear system. A typical parallel program assigns a fraction of the entire data to each

processor. It occurs that the entire data set does not fit into the cache on a single processor execution but a parallel

computations executing on the same problem with multiple processors may have fraction of entire data assigned to

each processor fits in its local shared cache. In case of a computation having decomposed working data fully cached,

it leads to higher hit rate and produces super-linear speedup.

The executing time and speedup is shown in Figure 12. In problem 1, the speedup increases gradually with

increasing number of processor but slowdown on 16 processors due to decreasing computation-to-communication

ratio. In problem 2, super-linear speedup due to higher cache hit rate is observed in between 64 to 108 processors

and the slowdown of speedup due to decreasing computation-to-communication ratio on 140 processors. Problem 3

has 4 millions cells such that the working data is way too large with respect to the 8M-Bytes L3 cache on serial

execution, and results in significant low cache hit rate that overestimates the speedup of multiple-CPU computation.

The parallel efficiency gradually ascends to 1.2 on 240 processors (60 CPUs). Deterioration of the efficiency arising

from the decreasing computation-to-communication ratio is not observed in the scope of 240 processors.

Parallel efficiency on the basis of number of cells per processor is given in Figure 13. In general, the efficiency

reaches peak roughly at 1.5x104 cells per processor and deterioration due to decreasing computation-to-

communication ratio is around 104 cells per processor. It suggests that parallel efficiency doesn’t benefit from

further decomposition of entire data. On the other hand, relative low efficiency at larger number of cells per

processor is observed because of lower cache hit rate. Not until the fraction of working data can be cached is the

highest efficiency reached. As a result, it is said the 8M-Bytes shared cache affects the behaviors parallel efficiency

at larger problems. Moreover, for working data that can be fully cached on serial execution, the cache effect on

parallel efficiency will not be seen as the case of 0.16 million cells problem. The overall performance of the

Eulerian-Lagrangian method demonstrates outstanding efficiency for problems in the scope of several millions of

cells on Intel Nehalem Xeon architecture.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


15

Figure 13. Parallel efficiency with respect to number of cells per processor

P

1:

1.6

x10

5

P

2:

1x1

06

P

3:

4x1

06

Figure 12. Executing time and speedup as a function of number of processors for various problem size

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


16

(a) (b)

Figure 14. The numerical configuration. (a) Fuel tank geometry. (b) Axisymmetric computational domain

with uniform tank-wall heat distributions.

V. Self-pressurization in a Spacecraft Cryogenic Fuel Tank

The self-pressurization in a cylindrical liquid hydrogen fuel tank is considered. Figure 14(a) shows the geometric

configuration of the test case. The ground test at normal gravity is assumed since the interface can be more distorted

and the liquid position and vapor bubble distributions are usually unknown under microgravity condition. The

axisymmetric numerical computation in Figure 14(b) is conducted due to its effective computation in order to

establish a guideline on the amount of self-pressurization at a given heat leak.

First, qH = 1.0 W/m2 case is simulated with uniform heating distribution at 50% fill level in order to understand

the self-pressurization mechanism in a cryogenic fuel tank. For studying the influence of heat amount, four different

uniform heating cases (qH=0.1, 0.5, 1.0, and 2.0 W/m2) are simulated with 50% fill level. Investigation of the liquid

fill-level effect is followed for uniform heating case with three different fill-levels of 25%, 50% and 75%. The total

heat power input is QT=4.71mW for qH=0.1 W/m2 case, and intial temperature is same to saturation temperature for

all cases. The used material properties at T=20.369 K and non-dimensional parameters are summarized in the Table

2 and Table 3.

Table 3. Reference scales and non-dimensional parameters.

Reference case ( , )

Reference length scale, [m] 0.05

Laplace number,

3.88×10

7

Bond number,

893

Prandtl number,

1.033

Rayleigh number,

2.90×10

8

Table 2. Material properties at T=20.369 K.

Vapor hydrogen Liquid hydrogen

Density, [kg/m3] 1.3322 70.850

Thermal conductivity, k [J/m·s·K] 0.017085 0.10382

Dynamic viscosity, [kg/m·s] 1.0786×10-6

1.3320×10-5

Specific heat, Cp [J/kg·K] 1.2036×104 9.772×10

3

Thermal diffusivity, [m2/s] 2.56×10

-6 1.82×10

-7

Thermal expansion coefficient, [K-1

] 0.04909 0.0168

Surface tension, [N/m] 1.9452×10-3

Latent heat, [J/kg] 4.456×105

Normal boiling temperature, [K] 20.369

Normal saturation pressure, [Pa] 1.01400×105

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


17

Uniform heating case with heat flux qH = 1.0 W/m2 at 50% fill level is first tested in order to understand

the self-pressurization mechanism in a cryogenic fuel tank. The grid convergence test in Figure 15 shows

almost identical results for saturation pressure computation when the grid is finer than 3200cells with

adaptive meshes. From the temperature contour and velocity vector plot, we may need finer grid in order to

resolve the flow fields up to desirable resolution, but in the present study, 3200-7200 cells are used for

computation of self-pressurization with the help of adaptive mesh refinement. Figure 16(a) shows the

saturation pressure change, which increases with time, but the rise rate decreases and almost constant after

transient time. The implemented first principle-based phase change model is supported in Figure 16(b), where

the rate of phase change for each phases and on the interface surface are depicted. It is shown that the phase

change in liquid phase has dominant contribution comparing phase change on the interface surface. As

expected, the phase change in vapor region is negligible since the vapor temperature is higher than saturation

temperature. The rate of phase change in liquid phase is dominant, but decreases in time as the saturation

temperature rises and the heat is used to warm up the liquid to the saturation temperature before vaporization

and to heat up the vapor on the top surface. The increasing saturation temperature and following thermal

stratification in the vapor region is depicted in Figure 16.

(a) (b)

(c) (d)

Figure 16. Self-pressurization results with uniform qH=1.0W/m2 at 50% fill level. (a) The saturation

pressure change. (b) The rate of phase change for each phases and interface surface. (c) Saturation

temperature change. (d) Temperature profile along center line of the cylindrical tank.

(a) (b)

Figure 15. A sensitivity study on the spatial resolution. (a) grid convergence test. (b) Temperature contour

with velocity vector.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


18

A. Influence of heat flux amount

The effects of heat flux amount are studied with the same 50% fill level and four different heat flux amount; 0.1,

0.5, 1.0, 2.0 W/m2, where the total heat power is 4.71 mW for 0.1 W/m

2 heat flux case. In Figure 17(a), the

saturation pressure increases in time as vaporization occurs due to heat influx from the tank walls. The saturation

pressure increases more rapidly with increasing heat flux amount as expected and previous researches addressed.

The pressure rise rate is plotted in Figure 17(b), where the smallest heat flux case with 0.1 W/m2 shows almost

constant pressure rise rate after short transient period due to weak natural convection and almost constant saturation

temperature. The larger heat amount shows longer transient period with steeper initial pressure rise, but the rate

decreases as time goes. In Figure 17(c), the pressure rise rates are normalized by the heat flux amount, and they are

not perfectly proportional to the heat amount. Except the smallest heat flux case, the pressure rise rate is high in the

beginning, and then decreases. It can be noticed that the higher heat flux amount produces less pressure increase per

heat flux amount after transient period. This can be explained in Figure 18. With higher heat influx, the saturation

temperature increases faster, and the absorbed heat is utilized to warm up the liquid before vaporization since liquid

vaporization occur when the liquid temperature reaches saturation temperature. It is also shown that the thermal

stratification in vapor region increases as heat flux amount increases, and thus, the heat absorbed is used for

increasing vapor temperature instead of vaporizing liquid.

(a) (b)

Figure 18. The influence of heat flux amount on the saturation temperature. (a) Saturation temperature

(b) temperature profile along storage center line.

(a) (b) (c)

Figure 17. The influence of heat flux amount on the saturation pressure. (a) Saturation pressure (b)

pressure rise rate (c) pressure rise rate normalized by heat flux amount.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


19

B. Effect of liquid fuel fill-level

The influence of the liquid fill-levels is investigated with three different levels; 25%, 50%, and 75%. The

higher fill level shows higher saturation pressure in Figure 19(a). The pressure rise rate of Figure 19(b) shows

that 75% fill-level has dramatic rise rate in the beginning due to faster heat transport from the top surfaces

and larger wet surface area where heat is absorbed directly into liquid. Furthermore, the small volume ratio of

a vapor region results in faster pressurization since even the same amount of phase change makes higher

density and pressure rise of vapor in a smaller vapor volume. In lower fill -level, the absorbed heat is used to

increase vapor temperature instead of vaporization of liquid. From the vapor temperature history of Figure

19(d), we can find that more heat remains in the vapor region for lower fill level. Therefore, the pressure will

increase with increasing fill level as it is in the present study. However, such a huge difference in the pressure

rise rate diminishes after transient time.

In the present study, higher fill level shows higher pressure rise rate. However, many other previous studies

showed quite different results in the influence of the liquid fill level. The experimental studies in an oblate

spheroidal storage, the pressure rise rate at 29% is higher than 83%, and 49% shows the lowest pressure rise3, 4

. On

the other hand, the lower fill level shows higher pressure rise rate in the homogeneous model6. The lumped vapor

model shows the same trends with homogeneous model, but the trend are the opposite in the beginning6. The

homogeneous model assumes homogeneous condition on the whole storage, and the lumped vapor model assumes

homogeneous condition for vapor region, and these assumptions may cause these difference. Moreover, in the

present test, the cylindrical storage tank is tested unlike the oblate spheroidal tank in other studies. The different fill

level in the oblate spheroidal tank has large influence on the liquid surface area and vapor volume ratio. Thus, in

order to understand the effect of fill level correctly and explain these disagreements, more parameter studies are

needed including tank shapes, tank sizes, the liquid surface area, wet surface area and the vapor volume ratio.

(a) (b)

(c) (d)

Figure 19. The effects of liquid hydrogen fill-level for fuel tank pressurization. (a) The saturation

pressure in a fuel tank. (b) The saturation pressure rise rates. (c) Saturation temperature. (d)The vapor

temperature history at 95% fill level.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


20

VI. Summary and Conclusions

In this paper, a 3-D adaptive Eulerian-Lagrangian method is further developed with multi-domain parallel

computation technique for the large scale multiphase flow computation. Spatial domain decomposition parallelism is

adopted for both Eulerian and Lagrangian domain. The multi-domain parallel computation presents the capability

for various large scale problems including arbitrary solid boundary and thermal effects, and the parallel performance

of the current Eulerian-Lagrangian method shows remarkable speedup for problems in the scope of several millions

cells on Intel Nehalem Xeon architecture. The highlights of parallel computing of the Eulerian-Lagrangian method

are addressed as following.

(1) The interactions between Eulerian and Lagrangian domain results in difficulties of problem decomposition,

especially communications complexities between both domains. With the spatial domain decomposition for

Lagrangian markers based on the their inhabitation to an Eulerian sub-domain, communication cost over

processors could be minimized,

(2) The communication methodology reported in the current study shows that latency of a synchronization

process is weakly dependent on the local data size of a processor,

(3) Experiments show monotonically ascending speedup until the problem is over-partitioned, Parallel

efficiency reaches the peak around 1.5x104 cells per processor, which is a trade-off between cache hit rate

and communication latency.

The first principle-based phase change model is introduced by computing the rate of phase change in each phase

as well as across the liquid/vapor interface. Specifically, the following observations can be made in the fuel tank

self-pressurization case:

(1) The rate of phase change in a liquid phase is dominant, and should be considered with the phase change on

the liquid surface in the fuel tank self-pressurization. The rate of phase change decreases with increasing

saturation temperature,

(2) With increasing heat flux amount, the pressure rise rate and thermal stratification in vapor region also

increase and larger heat amount shows longer transition with steep initial pressure rise,

(1) Higher liquid fill level shows higher pressure rise rate due to smaller vapor region and larger wet surface

area, but the rate becomes similar after transient time with steep pressure rise. The liquid fill level

determines various important parameters such as the liquid volume, wet area, liquid surface area and their

ratios according to its geometric tank shape, and more parameter study is required to understand the

influence of liquid fill level correctly.

Acknowledgments

The work reported in this paper has been partially supported by NASA Constellation University Institutes

Program (CUIP), Claudia Meyer and Jeff Rybak program managers. We have benefited from the discussion with

John Peugeot and Jeff West of NASA Marshall Space Flight Center.

References 1Hastings, L.J., Plachta, D.W., Salerno, L., and Kittel, P., "An overview of NASA efforts on zero boiloff storage of cryogenic

propellants," Cryogenics, Vol. 41, No. 11-12, 2001, pp. 833 - 839. 2Lin, C., Dresar, N.V., and Hasan, M., "Pressure Control Analysis of Cryogenic Storage Systems," Journal of Propulsion and

Power, Vol. 20, No. 3, 2004, pp. 480-485. 3Aydelott, J.C., "Effect of gravity on self-pressurization of spherical liquid-hydrogen tankage," NASA-TN-D-4286, 1967. 4Aydelott, J.C. and Spuckler, C.M., "Effect of size on normal-gravity self- pressurization of spherical liquid hydrogen

tankage," NASA-TN-D-5196, 1969. 5Hasan, M.M., Lin, C.S., and Van Dresar, N.T., "Self-pressurization of a flightweight liquid hydrogen storage tank subjected

to low heat flux," NASA-TM-103804, 1991. 6Barsi, S. and Kassemi, M., "Numerical and experimental comparisons of the self-pressurization behavior of an LH2 tank in

normal gravity," Cryogenics, Vol. 48, No. 3-4, 2008, pp. 122 - 129. 7Poth, L.J. and van Hook, J.R., "Control of the Thermodynamic State of Space-Stored Cryogens by Jet Mixing," Journal of

Spacecraft and Rockets, Vol. 9, No. 5, 1972, pp. 332-336. 8Aydelott, J.C., "Modeling of space vehicle propellant mixing," NASA-TP-2107, 1983. 9Hochstein, J.I.J.H.-C.A.J.C., "Prediction of self-pressurization rate of cryogenic propellant tankage," Journal of Propulsion

and Power, Vol. 6, 1990, pp. 0748-4658.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1


21

10Panzarella, C.H. and Kassemi, M., "On the validity of purely thermodynamic descriptions of two-phase cryogenic fluid

storage," Journal of Fluid Mechanics, Vol. 484, 2003, pp. 41-68. 11Agbaglah, G., Delaux, S.a., Fuster, D., Hoepffner, J.e., Josserand, C., Popinet, S.h., Ray, P., Scardovelli, R., and Zaleski,

S.h., "Parallel simulation of multiphase flows using octree adaptivity and the volume-of-fluid method," Comptes Rendus

M?anique, Vol. 339, No. 2-3, pp. 194-207. 12Sussman, M., "A parallelized, adaptive algorithm for multiphase flows in general geometries," Computers & Structures,

Vol. 83, No. 6-7, 2005, pp. 435 - 444. 13Singh, R. and Shyy, W., "Three-dimensional adaptive Cartesian grid method with conservative interface restructuring and

reconstruction," Journal of Computational Physics, Vol. 224, No. 1, 2007, pp. 150-167. 14Uzgoren, E., Sim, J., and Shyy, W., "Marker-based, 3-D adaptive Cartesian grid method for multiphase flow around

irregular geometries," Communications in Computational Physics, Vol. 5, No. 1, 2009, pp. 1-41. 15Sim, J. and Shyy, W., "Interfacial flow computations using adaptive Eulerian-Lagrangian method for spacecraft

applications," International Journal for Numerical Methods in Fluids, 2010. 16Peskin, C.S., "The immersed boundary method," Acta Numerica, Vol. 11, 2003, pp. 479-517. 17Juric, D. and Tryggvason, G., "Computations of boiling flows," International Journal of Multiphase Flow, Vol. 24, No. 3,

1998, pp. 387-410. 18Shin, S. and Juric, D., "Modeling three-dimensional multiphase flow using a level contour reconstruction method for front

tracking without connectivity," Journal of Computational Physics, Vol. 180, No. 2, 2002, pp. 427-470. 19Balay, S., Brown, J., Buschelman, K., Gropp, W.D., Kaushik, D., Knepley, M.G., McInnes, L.C., Smith, B.F., and Zhang,

H., PETSc Web page. 20Karypis, G. and Kumar, V., MeTis: Unstructured Graph Partitioning and Sparse Matrix Ordering System, Version 4.0.

2009, \url{http://www.cs.umn.edu/~metis}. 21Schamberger, S. and Wierum, J.-M., Graph Partitioning in Scientific Simulations: Multilevel Schemes versus Space-Filling

Curves, in Parallel Computing Technologies, V. Malyshkin, Editor. Springer Berlin / Heidelberg. 2003, p. 165-179. 22Plimpton, S., "Fast parallel algorithms for short-range molecular dynamics," J. Comput. Phys., Vol. 117, 1995, pp. 1--19.

Dow

nloa

ded

by U

NIV

ER

SIT

Y O

F M

ICH

IGA

N o

n A

pril

3, 2

013

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/6

.201

2-76

1

http://www.cs.umn.edu/~metis%7d

Documents

Parallel, Adaptive Grid Computing of Multiphase Flows in