web.fe.up.ptfpinho/pdfs/msc_thesis_npimentel_vfinal.pdf · direct numerical simulation of turbulent...

Direct Numerical Simulation of Turbulent Planar Jets with

Polymer Additives

Nuno Filipe Cabral Pimentel

Thesis to obtain the Master of Science Degree in

Aerospace Engineering

Supervisor: Prof. Carlos Frederico Neves Bettencourt da Silva

Examination Committee

Chairperson: Prof. Filipe Szolnoky Ramos Pinto CunhaSupervisor: Prof. Carlos Frederico Neves Bettencourt da Silva

Member of the Committee: Prof. José Manuel da Silva Chaves Ribeiro Pereira

November 2018

”Nothing in this world can take the place of persistence. Talent will not; nothing is more common than

unsuccessful people with talent. Genius will not; unrewarded genius is almost a proverb. Education will

not; the world is full of educated failures. Persistence and determination alone are omnipotent.”

- Calvin Coolidge

iii

Acknowledgments

Firstly, I would like to start thanking Professor Carlos Silva for giving me the opportunity to work in this

area. Also, for his availability to explain and teach many interesting things in the area, along with his

advices during the thesis.

I would like to thank Mateus Guimaraes, for the data post-processing and obtaining the first results

with the the algorithm developed within this work.

I would like to give a special thanks to my IST colleagues and friends Hugo Abreu and Afonso Ghira

for introducing me to Professor Carlos Silva and for the meaningful discussions throughout the thesis

development, which were crucial to clarify many things.

I would also like to give a special thanks to Millennium BCP, namely Dr. Antonio Carreira, for providing

a crucial time off work required to develop this work and supporting me throughout this thesis.

I would like to thank my friends and colleagues Bruno Conceicao, Diogo Antunes, Ricardo Verıssimo

and Pedro Abracos for their continuous support and motivation.

Finally, I would like to thank my family for supporting me throughout all my IST course, for their

understanding and motivation. This work is devoted to them.

v

Resumo

Neste trabalho foram efetuadas simulacoes numericas diretas (DNS) do jato espacial plano turbulento

com moleculas polimericas para estudar o mecanismo de interacao do polımero no avanco do es-

coamento do jato. Estas simulacoes foram conseguidas com a implemtancao do modelo numerico

de fluido visco-elastico, representado pelo modelo constitutivo reologico FENE-P. O modelo numerico

para a equacao de transporte do tensor de conformacao foi adaptado as condicoes de fronteira nao

periodicas do escoamento, nao estando presente na literatura referencias ao caso em estudo. Na

concretizacao deste, foi tido em conta o desempenho computacional da simulacao, para o qual se

implementou um mecanismo de celulas fantasma por forma a reduzir o tempo de computacao. O

algoritmo desenvolvido foi alvo de uma extensa verificacao numerica, de forma a garantir a resolucao

correta das equacoes governantes para o escoamento com moleculas de polımero diluıdas. O algoritmo

e testado em simulacoes numericas diretas com variacao das caracterısticas fısicas das moleculas do

polımero, nomeadamente a concentracao polimerica e tempo de relaxacao. Os resultados permitiram

observer um decrescimo da espessura do jato com a presenca de fluido visco-elastico, assim como

uma reducao da dissipacao da energia viscosa. De notar que o algoritmo implementado representa um

enorme progresso nas simulacoes numericas de escoamento viscoelastico turbulento de jatos espaci-

ais, fornecendo os primeiros resultados numericos em estudos do genero.

Palavras-chave: Turbulencia visco-elastica, DNS, FENE-P, Jato plano turbulento espacial

vii

Abstract

In this study it was performed Direct Numerical Simulations (DNS) of a spatial turbulent planar jet with

polymer additives to further understand the mechanism of polymer interaction on jet flows. These sim-

ulations were achieved with the implementation of the visco-elastic fluid numerical model, represented

by the rheological constitutive FENE-P model. The numerical model for the conformation tensor trans-

port equation was adapted for the non-periodic boundary conditions of the flow, for which no references

are present in the literature. The development of the numerical algorithm took into consideration the

computational performance of the simulations, in which it was implement a ghost cell mechanism to

decrease the computation time. The developed algorithm has been extensively verified to ensure the

correct resolution of the governing equations for turbulent jet flow with polymer additives. The numerical

model was tested in direct numerical simulation for different polymer molecules physical characteristics,

namely the polymeric concentration and the relaxation time. The observed results verified a decrease in

the jet width on the presence of a visco-elastic flow, together with a decrease of the viscous energy dis-

sipation rate. It should be noted that the implemented numerical model represents a major progress on

the numerical simulations of turbulent spatial jet flow with polymer additives, providing the first numerical

results for this kind of study.

Keywords: Visco-elastic turbulence, DNS, FENE-P, Spatial turbulent planar jet

ix

Contents

Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v

Resumo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii

Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix

List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii

List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv

Nomenclature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xix

1 Introduction 1

1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

1.2 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.3 Thesis Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

2 Background 3

2.1 What is turbulence? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

2.2 Turbulent Planar Jets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.3 Turbulence in dilute polymer solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

2.3.1 Experimental discoveries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

2.3.2 Numerical simulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

2.4 High performance computing and parallelization strategies . . . . . . . . . . . . . . . . . . 15

2.4.1 Parallelization Efficiency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

2.4.2 Load Balancing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

2.4.3 Granularity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

2.4.4 Distributed vs. Shared memory parallelization . . . . . . . . . . . . . . . . . . . . . 17

2.4.5 Hybrid parallelization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

3 Governing equations and Numerical Methods 21

3.1 Velocity field . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

3.2 The FENE-P constitutive model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

3.3 Properties of the conformation tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

3.4 Computational domain . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

3.5 Spatial discretization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

3.5.1 Pseudo-spectral scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

xi

3.5.2 Compact finite differences discretization . . . . . . . . . . . . . . . . . . . . . . . . 26

3.5.3 Conformation Tensor Transport Equation . . . . . . . . . . . . . . . . . . . . . . . 30

3.6 Temporal discretization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

3.6.1 Pressure-velocity coupling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

3.6.2 Non-Linear Term in the Momentum Equation . . . . . . . . . . . . . . . . . . . . . 36

3.6.3 Conformation Tensor Transport Equation . . . . . . . . . . . . . . . . . . . . . . . 37

3.7 Stability condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

3.8 Boundary and Initial Conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

3.8.1 Inlet boundary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

3.8.2 Lateral boundaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

3.8.3 Outlet boundary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

3.8.4 Poisson equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

3.9 Code architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

3.9.1 Parallel architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

3.9.2 Parallel domain decomposition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

3.9.3 FFT calculation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

3.9.4 I/O . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

4 Code Verification 47

4.1 Conformation Tensor Transport Equation Verification . . . . . . . . . . . . . . . . . . . . . 48

4.2 Velocity - Conformation Tensor Coupling Term Verification . . . . . . . . . . . . . . . . . . 58

5 Results 63

6 Conclusions 73

6.1 Achievements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

6.2 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

Bibliography 75

A Stability condition proof 81

xii

List of Tables

3.1 Gauss-Legendre integration nodes and weights. From [60]. . . . . . . . . . . . . . . . . . 33

4.1 Simulation parameters for transport equation verification. . . . . . . . . . . . . . . . . . . 50

4.2 Coupling term simulation parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59

5.1 Computational domain parameters for the simulations. Number of grid points (nx, ny, nz);

non-dimensional grid size (Lx/h, Ly/h, Lz/h). . . . . . . . . . . . . . . . . . . . . . . . . . 64

5.2 Physical parameters for the simulations. Weissenberg number Wi; slot width, initial max-

imum jets velocity, solvent viscosity based Reynolds number Re; Averaged Taylor turbu-

lence micro-scale centreline velocity solvent viscosity based Reynolds number on the x

streamwise direction Reλ; polymer relaxation time τp; ratio of the solvent to solvent plus

polymer viscosity β; polymer normalized maximum extensibility L; mesh resolution nor-

malized by the Kolmogorov small scale at x/h = 14. . . . . . . . . . . . . . . . . . . . . . 64

xiii

List of Figures

2.1 Examples of turbulent flows in engineering applications and nature. . . . . . . . . . . . . . 3

2.2 Time history of the axial component of velocity on the centreline of a turbulent jet. From

[11]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2.3 Energy spectrum at high Re under the Kolmogorov hypothesis for a Newtonian fluid. Flow

scales are expressed as the log of the wavenumber, k = 1/l. L is the flow characteristic

length, l0 is the characteristics size of the largest eddies and η is the Kolmogorov scale.

lEI is the start of the inertial sub-range and lDI is the start of the dissipation sub-range.

From [16]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2.4 Model spectrum for turbulence in polymer solutions indicating the three regions: (I) κ <

1/lp, Kolmogorov’s inertial cascade; (E) 1/lp < κ < 1/ηp, Elastic subrange where fractions

of turbulence kinetic energy are transferred to elastic energy, which is then dissipated by

viscous drag of relaxing polymers and internal friction between the monomers of a single

polymer; (V) κ > 1/ηp, Viscous dissipation range where turbulence kinetic energy is

dissipated by viscous forces, just as the Newtonian spectrum. From [17]. . . . . . . . . . 6

2.5 Turbulent planar jet. From [18]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.6 Turbulent planar jet regions. From [16]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

2.7 Turbulent mixing of jets of water (left) and PEO solution (right). From [25]. . . . . . . . . . 9

2.8 Turbulent boundary layer velocity profile. From [26]. . . . . . . . . . . . . . . . . . . . . . 10

2.9 Schematic illustration of the onset of drag reduction and the maximum drag reduction

asymptote. The dotted line represents the case with a fixed polymer concentration C

and increasing Re. The dashed line is the case for a fixed Re, where the onset of drag

reduction is first observed, and the polymer concentration C is increased. f refers to the

friction drag for pipe flows. From [28]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

2.10 Mean velocity profiles during drag reduction. From [23]. . . . . . . . . . . . . . . . . . . . 12

2.11 FENE-P dumbbell model with a spring between two beads. From [35]. . . . . . . . . . . . 13

2.12 One-dimensional schematic of a shock (thick, solid line). From [36]. . . . . . . . . . . . . 14

2.13 Evolution of efficiency with the number of processors. From [18]. . . . . . . . . . . . . . . 16

2.14 Illustration of OpenMP architecture. From [50]. . . . . . . . . . . . . . . . . . . . . . . . . 17

2.15 Illustration of MPI architecture. From [50]. . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

2.16 Illustration of Hybrid MPI+OpenMP architecture. From [50]. . . . . . . . . . . . . . . . . . 19

xv

3.1 View of the computational domain with reference frame and notation. From [16]. . . . . . 24

3.2 Mean velocity profile imposed at the inlet. H stands for the width of the inlet slot. From [18]. 39

3.3 View of the domain partitioning with reference frame and notation. . . . . . . . . . . . . . 43

3.4 Ghost cell configuration data exchange on the streamwise direction for a domain parti-

tioned in 3 normal slices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

4.1 u velocity profile in the normal y direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

4.2 Numerical solution for the Cij tensor considering a u velocity profile in the normal y direction. 51

4.3 Numerical (n) and analytical (a) solutions for the Cii tensor considering a u velocity profile

in the normal y direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

4.4 Numerical (n) and analytical (a) solutions for the Cij , i 6= j tensor considering a u velocity

profile in the normal y direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

4.5 u velocity profile in the spanwise z direction. . . . . . . . . . . . . . . . . . . . . . . . . . . 53

4.6 Cij numerical (n) and analytical (a) solutions comparison considering a u velocity profile

in the spanwise z direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53

4.7 w velocity profile in the normal y direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . 54

4.8 Cij numerical (n) and analytical (a) solutions comparison considering a w velocity profile

in the normal y direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54

4.9 v velocity profile in the spanwise z direction. . . . . . . . . . . . . . . . . . . . . . . . . . . 55

4.10 Cij numerical (n) and analytical (a) solutions comparison considering a v velocity profile

in the spanwise z direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

4.11 w velocity profile in the streamwise x direction. . . . . . . . . . . . . . . . . . . . . . . . . 56

4.12 Cij numerical (n) and analytical (a) solutions comparison considering a w velocity profile

in the streamwise x direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

4.13 v velocity profile in the streamwise x direction. . . . . . . . . . . . . . . . . . . . . . . . . 57

4.14 Cij numerical (n) and analytical (a) solutions comparison considering a v velocity profile

in the streamwise x direction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57

4.15 C12 and C13 profiles prescribed for the coupling term on the u velocity component. H is

the unit of length used in the code. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59

4.16 Coupling term for u velocity for numerical (red) and analytical (blue) solutions. . . . . . . . 60

4.17 C12 and C23 profiles prescribed for the coupling term on the v velocity component. h is



4.19 C13 and C23 profiles prescribed for the coupling term on the w velocity component. h is



5.1 Streamwise evolution of non-dimensional (a) jet’s half-width and (b) centreline velocity for

fluids with different relaxation time for cases A, J, K and L. From [69]. . . . . . . . . . . . . 66

xvi

5.2 Normalized mean velocity: (a) streamwise and (b) normal direction components, (c-e)

velocity root mean squared components and (f) Reynolds shear stress profiles for fluids

with different relaxation time for cases A, J, K and L. From [69]. . . . . . . . . . . . . . . . 67

5.3 Mean (a-c) and root mean squared (d-f) components of the conformation tensor for fluids

with different relaxation time at x/h = 12 for cases J, K and L. From [69]. . . . . . . . . . . 68

5.4 Centreline root mean squared evolution of (a) streamwise, (b) normal and (c) spanwise

directions velocity components and (d-f) conformation tensor components for fluids with

different relaxation time for cases J, K and L. From [69]. . . . . . . . . . . . . . . . . . . . 69

5.5 Stream-wise evolution of centreline Weissenberg numbers for all visco-elastic cases.

From [69]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

5.6 Stream-wise evolution of centreline turbulent kinetic energy dissipation of the solvent for

all cases. From [69]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

5.7 Stream-wise evolution of centreline mean conformation tensor components for cases J, K

and L. From [69]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71

xvii

Nomenclature

API Application Programming Interface.

CFD Computational Fluid Dynamics.

CFL Courant-Friedrichs-Lewy condition.

DFT Discrete Fourier Transform.

DNS Direct Numerical Simulations.

FENE-P Finitely Extensible Non-linear Elastic Peterlin model.

FFT Fast Fourier Transform.

HIT Homogeneous Isotropic Turbulence.

I/O Input/Output.

KT Kurganov-Tadmor.

LASEF Laboratory of Fluid Simulation in Energy and Fluids.

MDR Maximum Drag Reduction.

MHR Magneto-hydrodynamics.

MPI Message Passing Interface.

NUMA Non-uniform memory access.

PEO Polyethylene Oxide.

ppm parts per million.

SPD Symmetric Positive Definite.

T/NT Turbulent/Non-turbulent interface.

Greek symbols

αp, βp Runge-Kutta Coefficients.

β Polymer Concentration.

xix

∆t Time step.

δ Flow width.

δij Kronecker’s delta.

ε Turbulent kinetic energy dissipation.

η Kolmogorov length scale.

λ Taylor microscale.

λi Eigenvalues of Cij .

µ Dynamic Viscosity.

ν Kinematic Viscosity.

ρ Fluid density.

τη Kolmogorov time scale.

τp Zimm relaxation time of the polymer.

ξi Gauss-Legendre integration nodes.

Roman symbols

C Conformation tensor.

H Convective flux tensor.

u Velocity vector.

Ai Gauss-Legendre integration weights.

Cij Components i, j of the Conformation tensor.

E(k) Kinetic energy density function.

f(Ckk) Peterlin function.

h Width of the inlet slot.

k Wavenumber.

L Maximum polymer extension.

l Characteristic length scale.

Lx Computational domain length in the x direction.

Ly Computational domain length in the y direction.

Lz Computational domain length in the z direction.

xx

nx Number of grid points in the x direction.

ny Number of grid points in the y direction.

nz Number of grid points in the z direction.

p Pressure.

ri Instantaneous orientation of a polymer dumbbell.

Re Reynolds Number.

Reλ Reynolds Number based on the Taylor scale.

Sij Strain Rate tensor.

t Time.

Tij Stress tensor.

u First component (streamwise) of the velocity vector.

ui Velocity vector.

U1L Mean local co-flow velocity.

U1 Co-flow velocity.

U2 Peak velocity at the inlet.

Uc Mean local centreline velocity.

v Second component (normal) of the velocity vector.

w Third component (spanwise) of the velocity vector.

x First spatial (streamwise) coordinate.

y Second spatial (normal) coordinate.

z Third spatial (spanwise) coordinate.

Subscripts

i, j, k Computational indexes.

x, y, z Cartesian components.

Superscripts

[p] Polymer.

[s] Solvent.

xxi

Chapter 1

Introduction

The objective of the current chapter is to describe the motivation underlying the present work and to

provide a brief description of the outline of the document.

1.1 Motivation

Since Toms (1948) [1] reported turbulent drag reduction by the addition of long-chain polymers by up to

80 percent, this effect has been studied for both practical and theoretical purposes. Currently, industrial

applications regarding polymer drag reduction are mostly related to long-distance liquid transportation

pipeline systems, marine vehicles and heating/cooling system water-circulating devices.

This work will focus on the study of this phenomenon on turbulent jet flows, to further understand the

mechanism of polymer interaction on jet flows and advance our understanding of fluid turbulence and

visco-elastic flows. It is important to note that the framework of this study has potential applications on

the aerospace industry, namely on heat transfer reduction by injection of micro-jets into turbine blades

[2]. There is a potential application on combining visco-elastic fluids with the micro-jet structures, as

the heat transfer behaviour of visco-elastic fluids has been studied recently, namely focused on cooling

applications for turbine disks [3, 4].

Moreover, it is possible to make a formal analogy between the Finite Extensible Nonlinear Elastic -

Peterlin (FENE-P) set of equations for visco-elastic flows and magneto-hydrodynamics (MHD) equations,

as mentioned in [5, 6]. Thus, the numerical model considered here to simulate polymer solutions (FENE-

P) will have a strong resemblance and application to the methodology used in numerical simulations of

MHD flows. As for aerospace industry applications, in recent years plasma flows have rapidly become

an important set of new technologies that find application to hypersonic propulsion, being evident a

growing interest in plasma-based flow manipulation through MHD forces [7–9]. This step, together with

the combining technological developments in electromagnetics, aerodynamics, and chemical kinetics,

might lead to a breakthrough in propulsion systems for improving aerospace vehicle performance [10].

As the current work will comprise the resolution of Navier-Stokes equations coupled with polymer

models using Direct Numerical Simulations (DNS), it is of the uttermost importance the optimization of

1

the computational performance of the numerical model. Such is entailed by considering parallelization

techniques for high performance computing, with transversal applications to numerical simulations in the

industry.

1.2 Objectives

The main objectives for the work presented here are:

1. To implement and verify a computational method to simulate turbulent jets with polymer additives

(FENE-P model) through Direct Numerical Simulations (DNS) on an existing turbulent jet DNS

code.

2. To optimize the computational performance of the resulting computational model.

3. To analyse the effects of polymer additives on the development of turbulent jets.

1.3 Thesis Outline

The present document is organized into six chapters.

In the first chapter, it is described the underlying usefulness of the current work, highlighting potential

future applications, along with a statement of the main objectives to be accomplished.

In the second chapter, it is introduced the main subjects fundamental to the present work. The topics

of turbulence and turbulent planar jet are presented, along with a review on flows with diluted polymer

solutions. It is also presented a review on parallelization techniques for high performance computing.

In the third chapter, it is described the constitutive equations concerning the velocity field and the

polymer transport equation, along with the detailed definition of the numerical methods that were used

to implement the equations for the spatial discretization, temporal advancement and the boundary con-

ditions. It is provided detail regarding the parallelization strategy, namely on the enhancements that

were developed during this work.

In the fourth chapter, it is presented the code verification with analytical solutions, for example, veri-

fication was performed with the Couette flow. The verification exercise was extended to all components

from the conformation tensor transport equation and, also, the coupling term on the velocity field equa-

tions.

In the fifth chapter, it is presented the results obtained from DNS of turbulent planar jet with polymer

additives. Several quantities are analysed for different polymer physical characteristics. It is worth

mentioning that these results are the first numerical results of DNS visco-elastic turbulent planar jet.

The conclusions are presented in the sixth chapter, as well as proposals for future work.

2

Chapter 2

Background

This chapter defines the concept of turbulence and planar jet flows, along with some of its characteristics.

Furthermore, it is introduced an overview of numerical methods for the phenomenon of polymer-induced

drag reduction (FENE-P) and High Performance Computing, with emphasis on parallelization strategies.

2.1 What is turbulence?

As highlighted by [11], turbulent flows are prevalent in engineering applications and nature. Indeed,

flows such as waterfalls, rivers or flows around vehicles (e.g. airplanes, vehicles) are of turbulent nature.

Figure 2.1 illustrates examples of this phenomena.

(a) Space shuttle Atlantis launch in 2006. Fromwww.nasa.gov/centers/marshall/history.

(b) View of the Mayon volcano at January 24, 2018.From Romeo Ranoco/Reuters.

Figure 2.1: Examples of turbulent flows in engineering applications and nature.

3

Turbulence is a characteristic of the flow, being generally associated with flows that are highly un-

steady with chaotic variations of velocity and pressure in space and time. The transition from a laminar

to a turbulent flow is defined by Reynolds number [12], which is given by:

Re =ρUl

µ=Ul

ν(2.1)

where µ is the molecular viscosity, ρ is the density and ν is the kinematic viscosity. U and l are the

characteristic velocity and length scale of the flow, respectively. In most cases, when Re exceeds a

certain critical value, the flow transits from laminar to turbulent flow. The Reynolds number expresses

the ratio of inertial/inviscid forces and viscous forces and indicates the flow’s reaction to disturbances,

because perturbations can be damped by viscosity or amplified by inviscid forces.

The turbulence chaotic nature is noted by watching the velocity variation in time measured on the

centreline of a turbulent jet, as illustrated on Figure 2.2.

Figure 2.2: Time history of the axial component of velocity on the centreline of a turbulent jet. From [11].

Despite the random behaviour observed on velocity U1(t), observations show that the average ve-

locity < U1 > and its fluctuations present a stable statistically behaviour, which justifies the study of

turbulent flows by means of statistics.

The first concept of energy transfer through scales in turbulence was introduced by Richardson [13],

which stated that turbulence is produced at a large scale and is progressively broken down into smaller

and smaller scales by inviscid forces, until reaching a sufficiently small scale where viscosity becomes

dominant and the mechanical energy is dissipated into heat.

This conceptual picture was translated into a quantitative theory by Kolmogorov [14], who summa-

rized the following hypothesis to define the energy transfer process in turbulence:

• The local isotropy hypothesis: At sufficiently high Reynolds number, the small-scale turbulent

motions are statistically isotropic.

• The first similarity hypothesis: At sufficiently high Reynolds number, the statistics of small-scale

4

motions have a universal form dependent only on viscosity and dissipation rate. This region is

named universal equilibrium range.

• The second similarity hypothesis: At sufficiently high Reynolds number, the statistics of motions

at scales that are much smaller than the integral scale and much larger than the smallest scale

have a universal form dependent only by the dissipation rate. This region is named inertial sub-

range. The zone in which the viscous effects start being dominant is named dissipation sub-range.

The smallest scales of turbulence are called the Kolmogorov scales. By using the first similarity

hypothesis, and performing a dimensional analysis, length (η), time (τη) and velocity (uη) are determined

by the viscosity ν and dissipation rate ε and are given by:

η ≡(ν3

ε

)1/4

, τη ≡√ν

ε, uη ≡ (νε)

1/4 (2.2)

Obukhov [15] formulated Kolmogorov’s hypothesis in the spectral space, in which the turbulence

eddies characteristics scale l is described as a function of a wave number k = 2π/l.

Figure 2.3: Energy spectrum at high Re under the Kolmogorov hypothesis for a Newtonian fluid. Flowscales are expressed as the log of the wavenumber, k = 1/l. L is the flow characteristic length, l0 is thecharacteristics size of the largest eddies and η is the Kolmogorov scale. lEI is the start of the inertialsub-range and lDI is the start of the dissipation sub-range. From [16].

The kinetic turbulence energy distribution along the spectrum of wave numbers is expressed in Figure

2.3. Ek is the turbulence kinetic energy, L is the characteristics length of the flow, l0 is the characteristics

size of the largest eddies and η is the Kolmogorov scale. lEI is the start of the inertial sub-range and

lDI is the start of the dissipation sub-range.

Using Kolmogorov’s second similarity hypothesis, that at the inertial sub-range viscous effects are

negligible, becoming only function of the dissipation rate and the eddies length scale l, it follows that the

5

energy spectrum is proportional to ε2/3κ−5/3 for a Newtonian fluid and expressed by:

E(k) = Ckε2/3κ−5/3 (2.3)

in which k is such that lEI > l > lDI , and Ck is the Kolmogorov’s constant.

For the case of a visco-elastic fluid, the polymer interaction with the flow leads to a steepening of

the kinetic energy spectrum beyond a wavenumber lp, which is the Lumley length-scale. In [17], it is

presented a model spectrum for the visco-elastic fluid consisting of 3 major regions: (I) Inertial cascade,

(E) Elastic subrange and (V) Viscous dissipation. This model spectrum is illustrated on Figure 2.4.

It is important to notice that the model spectrum for the visco-elastic fluid on Figure 2.4 has been

verified with experimental results by [17].

Figure 2.4: Model spectrum for turbulence in polymer solutions indicating the three regions: (I) κ < 1/lp,Kolmogorov’s inertial cascade; (E) 1/lp < κ < 1/ηp, Elastic subrange where fractions of turbulencekinetic energy are transferred to elastic energy, which is then dissipated by viscous drag of relaxingpolymers and internal friction between the monomers of a single polymer; (V) κ > 1/ηp, Viscous dis-sipation range where turbulence kinetic energy is dissipated by viscous forces, just as the Newtonianspectrum. From [17].

The (I) Inertial cascade on Figure 2.4 consists on a Newtonian inertial cascade at low wavenumbers,

whose governing equations have been above-mentioned.

According to [17], the (E) Elastic subrange is a spectral region separated from the inertial cascade

by the Lumley scale lp, which is determined by the elastic properties of the fluid and the turbulence

dissipation rate. In this region, fractions of the turbulence kinetic energy arriving from the inertial sub-

range are converted into elastic energy by polymer stretching. Subsequent relaxation of the stretched

polymers dissipates a part of this elastic energy due to the viscous drag of the polymer molecules in the

solvent and interactions between monomers of a single polymer. The remaining elastic energy is trans-

formed back to turbulence kinetic energy. This phenomenon is called back reaction and, consequently,

the energy flux on this region is continuously reduced from higher to lower wavenumbers.

6

Experimental data [17] suggested that this region follows a power-law with a slope of−3, as illustrated

on Figure 2.4, where the polymer stresses overcome the viscous stresses. The energy spectrum in a

turbulent structure of wavenumber k decreases according to:

E(k) = Ckε2/30 l5/3p (lpκ)

−3 (2.4)

where ε0 is equal to the energy flux from the Newtonian inertial cascade.

The (V) Viscous subrange consists on a region dominated by viscous stresses which transform

turbulence kinetic energy into heat, similarly to the dissipation range in the Newtonian spectrum, in

which the new viscous dissipation scale is given by:

ηp ≡(ν3

εv

)1/4

(2.5)

where εv is the energy flux contributing to the viscous dissipation at the end of the elastic subrange.

2.2 Turbulent Planar Jets

The turbulent plane jet flow is a type of flow belonging to the free shear flow type, which are charac-

terised by developing far away from the interaction with boundaries, and that advances along a preferred

direction. An example of a two-dimensional plane jet flow is presented in Figure 2.5.

Figure 2.5: Turbulent planar jet. From [18].

This type of flow is characterised by being statistically two-dimensional, with a dominant direction of

mean flow across the streamwise x direction and a mean velocity in the spanwise z direction of zero.

It is important to note some features of this kind of flow, namely that the velocity in the streamwise x

direction is much greater than the velocity in the normal y direction (u >> v), whilst the normal gradient

∂/∂y is much larger than the streamwise gradient ∂/∂x (∂/∂y >> ∂/∂x), as stated in [16]. Moreover, it

is worth mentioning the relation of flow scales for this type of flow, namely the shear layer thickness (δ)

and the flow streamwise length (L), for which L >> δ.

As the remaining shear flows, the turbulent plane jet is characterised by presenting a sudden change

from a turbulent rotational region, to an irrotational one. This transition is observed at the turbulent/non-

turbulent interface (T/NT), in which occurs exchanges of mass, momentum and other scalars while the

jet develops in the streamwise direction, spreading out. This dispersion of the flow makes it swell,

7

transmitting momentum to the adjoining fluid in the form of momentum and vorticity, dragging mass into

it in a phenomenon known as turbulent entrainment [16].

The plane jet can be split across different regions, as illustrated on Figure 2.6:

• The potential core region: It corresponds to the flow between the shear layers near the jet inlet,

originated by the high velocity gradient between the flow incoming from the jet inlet and that of the

remaining environment.

• The transition region: It comprises all the area in which the flow experiences transition into a fully

developed turbulent state.

• The self-similarity region: Region in which the flow characteristics, such as velocity and other

scalar properties, can be collapsed seamlessly after being properly scaled following the Kol-

mogorov’s hypothesis.

Figure 2.6: Turbulent planar jet regions. From [16].

2.3 Turbulence in dilute polymer solutions

2.3.1 Experimental discoveries

Since the discovery of the polymer drag reduction by Toms [1], there have been numerous experiments

done and theories proposed to explain the mechanisms underlying the drag reduction by dilute polymers

on turbulent flows. Indeed, Nadolink and Haigh [19] included over 2500 entries on the bibliography

review over this phenomenon. Nonetheless, Jin [20] refers that the most cited literature on polymer drag

reduction in boundary layers are Lumley [21, 22], Virk [23] and de Gennes [24], which also provide the

most successful attempts to explain the mechanism of polymer drag reduction.

8

Lumley [21] defined drag reduction as the reduction of skin friction in turbulent flow below that of

the solvent alone, by the addition of trace percentages of polymers. This phenomenon was observed

to occur in thermodynamically dilute solutions of long, flexible, expanded high-molecular-weight linear

polymers, considering typical polymer-solvent systems, such as the polyethylene oxide (PEO or Polyox)

in water. This one has become the most common solution due to its inexpensiveness, easiness to handle

and effectiveness (33% reduction in skin friction over that of water alone by the addition of 18 parts per

million (ppm) of PEO of molecular weight of 0.76× 106), as stated by [21]. An illustrative example of the

effect of PEO solution on turbulent flows is presented on Figure 2.7.

Figure 2.7: Turbulent mixing of jets of water (left) and PEO solution (right). From [25].

Further to that, Lumley [22] said that in regions where polymer molecules are elongated, viscosity is

enhanced. Moreover, Lumley stated the at sufficiently high wall shear stresses, the fluctuating strain rate

causes polymer molecules to expand and, thus, increasing viscosity. Furthermore, Lumley concluded

that in turbulent flows elongated polymer molecules are found in the wall layer, but not in the viscous

sub-layer, as vorticity and strain flow rate are not correlated (molecules do not expand, since there is

only simple shear on the viscous sub-layer).

Therefore, increased viscosity will only occur in the wall layer, leading to dissipation of turbulence

9

Figure 2.8: Turbulent boundary layer velocity profile. From [26].

fluctuations and mitigation of turbulence stresses. Small eddies on the wall layer will be damped by

the increased viscosity, and the resulting lower Reynolds stress at the buffer layer will thicken the vis-

cous sub-layer. The large eddies will expand on the viscous sub-layer and lead to higher streamwise

fluctuating velocities in this region. On the maximum drag reduction (MDR) regime large eddies will be

dominant.

Virk [27] compiled many experimental data available at that time on pipe flows with polymer additives

and smooth surfaces. Virk summarized three regimes of pipe flows with polymer additives:

• Laminar regime: Under a laminar flow regime, no drag reduction is observed, and the skin friction

obeys Poiseuille’s law, because the overall viscosity is very similar to the solvent viscosity in the

dilute solution.

• Polymeric regime: The flow is turbulent and the drag reduction effect is observed. Furthermore,

Virk states that the onset of drag reduction occurs at a universal characteristic wall shear stress,

and that the amount of drag reduction is dependent on the polymer characteristics, such as poly-

mer molecular weight and polymer concentration.

• Maximum drag reduction (MDR): It was determined the existence of a universal drag reduction

saturation asymptote, i.e. independent of system and polymer properties. The maximum drag

reduction is observed in the velocity profile and in the friction flow rate domain.

The drag reduction phenomenon appears to lie between two universal asymptotes: the Newtonian

turbulent flow (the so-called Prandtl-Karman law) and the MDR (see Figure 2.9). The experimental data

compiled in [23] confirmed the agreement with the MDR asymptote and drag reduction onset shear

stress relation. Indeed, it was concluded that MDR occurs when the effects of polymers are felt over all

scales, causing the buffer layer thickness to extend across the entire boundary layer.

10

Figure 2.9: Schematic illustration of the onset of drag reduction and the maximum drag reduction asymp-tote. The dotted line represents the case with a fixed polymer concentration C and increasing Re. Thedashed line is the case for a fixed Re, where the onset of drag reduction is first observed, and thepolymer concentration C is increased. f refers to the friction drag for pipe flows. From [28].

Moreover, Virk [23] observed and stated that the mean velocity profile can be divided into three zones

(see Figure 2.10):

• Standard viscous sub-layer: Usual region from the wall outward.

• Elastic sub-layer: This layer exists between the ”Newtonian plug” region and the usual viscous

sub-layer. It is characteristic of drag reduction and its extent increases with increasing drag reduc-

tion, up to the maximum drag reduction. On the latter situation, this layer is spread through the

entire cross section.

• Newtonian log layer: Also named ”Newtonian plug”, this layer has the universal slope logarithmic

profile. On this layer the velocity profile is shifted upwards, but parallel to the Newtonian law of the

wall.

The turbulence structure information can be analogously categorized into the same three zones.

Years later, experiments were performed in which it was observed that drag reduction initiates as

polymer enters lower inertial wall layer from above or below, suggesting that the source of the drag

reduction lies within the lower inertial range [24, 29]. Therefore, De Gennes [24] proposed a model based

on the elastic properties of the polymer particles, instead of the viscosity effects stated by Lumley. The

elastic theory postulates that the elastic energy stored by the partially stretched polymers is an important

variable for drag reduction and suppression of turbulence, and the increase in the effective viscosity is

small and inconsequential.

Moreover, the onset of drag reduction occurs when the cumulative elastic energy stored by the par-

tially stretched polymers becomes comparable with the kinetic energy in the buffer layer. The small

scales will be damped, leading to a buffer layer increase and reduced drag. The polymer concentration

11

Figure 2.10: Mean velocity profiles during drag reduction. From [23].

is included in the onset criterion because the cumulative elastic energy of the polymers is a function of

the concentration [24].

Recently, experiments have been made with fully developed three-dimensional turbulence that do

not support convincingly elastic theory. Instead, an approach based on an ”energy flux theory” is pro-

posed, that states that the turbulence energy flux in the cascade process is gradually reduced by the

energy transfer into the elastic motion of polymers, which becomes dominant in the small scales. An

experimental work that supports this approach is presented by [30].

2.3.2 Numerical simulations

Over the past 20 years, Direct Numerical Simulations (DNS) have played an increasingly important

role in the investigation of turbulent drag reduction mechanisms by polymer additives, namely for wall-

bounded turbulent flows. Despite the limitedness of DNS accuracy, due to the model for polymer stress,

inability to resolve all polymer scales and potential numerical instability, DNS provides the advantage of

describing the orientation of the polymer micro-structure in addition to the velocity field and Reynolds

stresses. This is especially relevant over laboratory experiments, allowing additional meaningful and

comprehensive insight over drag reduction mechanisms.

Typically, numerical simulations consider polymer molecules as two beads connected by an elastic

spring. The polymer dynamics are then described by the evolution of the end-to-end vector connecting

the two beads (see Figure 2.11), using constitutive equations such as Oldroyd-B and FENE-P [31].

Recently, the FENE-P polymer model has been widely used in DNS to study turbulence polymer drag

reduction, in such works as [32], [33] and [34].

The polymer orientation is represented as a continuous second-order tensor field, the so-called con-

formation tensor. This conformation tensor is defined as the normalised second moment of the end-to-

12

Figure 2.11: FENE-P dumbbell model with a spring between two beads. From [35].

end vector between the two beads [36], described as:

Cij =〈rirj〉

13 〈r2〉eq

(2.6)

where ri is the instantaneous orientation of a polymer dumbbell, r2eq is the square of the equilibrium

separation distance, and the angle brackets imply an ensemble average over the configuration space of

the dumbbell.

The conformation tensor Cij is a symmetric and positive definite (SPD) matrix. This property chal-

lenges the DNS of turbulence coupled FENE-P, as it is required that Cij remains semi-positive. While

mathematically both constraints are satisfied by the governing equations, these properties can be lost

due to cumulative numerical errors. Early attempts at numerical simulation of visco-elastic turbulence

were plagued by Hadamard instabilities that resulted from the numerical loss of the SPD property [37].

Sureshkumar and Beris [38] overcame these instabilities by introducing a stress diffusion term into

the equation for the conformation tensor. Variations of this approach were used in several investigations.

In 1997, the first DNS of channel flow [32] was able to show the Drag Reduction phenomenon, although

with lower Reynolds numbers in the numerical simulations in comparison to the conditions under which

Drag Reduction is experimentally observed with dilute polymers.

Vaithianathan [39] exploited the SPD property of Cij to derive independent equations for the eigen-

vectors and eigenvalues of the conformation tensor. In this formulation, Cij must remain greater than

zero and the eigenvalues of Cij should comply with the finite extensibility of the polymer:

λ1 + λ2 + λ3 ≤ L2 (2.7)

where λi are the eigenvalues of Cij and L is the maximum polymer extension (non-dimensional). Even

though this implicit formulation guarantees that Equation 2.7 is satisfied, the compact finite-difference

method that was used by [40] did not guarantee the eigenvalues remain positive. Instead, realizability

was enforced by setting the negative eigenvalues to zero before constructing the conformation tensor,

ensuring numerical stability. However, the uncontrolled, spatially distributed adjustments of the eigenval-

ues destroyed overall conservation of the conformation tensor, and spatial averages of the conformation

tensor contained spurious contributions from the convective term.

13

Thus, the decomposition applied by [39] guaranteed stability (by providing easy access to the eigen-

values) but did not guarantee conservation. This issue can be traced back to early numerical approaches

to compressible flows, which often suffered from loss of conservation [41]. It is related with the hyper-

bolic nature of the equation for Cij in the Oldroyd-B, FENE-P and Giesekus models, which admits shocks

(discontinuities) in the polymer stress tensor [42]. Discontinuities in the polymer stress cannot be fully

resolved by a grid, and so the main responsibility of the numerical scheme is to correctly predict the

jump magnitude. Jumps in the conformation tensor should satisfy the overall conservation balance to

guarantee correct elastic wave propagation. This behaviour is illustrated on Figure 2.12.

Figure 2.12: One-dimensional schematic of a shock (thick, solid line). Black dots represent the gridpoints. Thick dashed line is an ideal representation of the shock on the grid. A spectral representation,without an artificial stress diffusivity, would look like the thin dashed line, with overshoots and under-shoots (Gibbs phenomenon). The dotted line indicates the effect of adding the stress diffusivity to thespectral representation. From [36].

The solid line indicates a jump in the polymer stress tensor across a discontinuity. The other curves

illustrate the equivalent numerical representation based on a finite-difference and spectral scheme. The

Gibbs phenomenon observed in the spectral representation can be attenuated by introducing an artificial

diffusivity. However, artificial diffusion can also reduce the magnitude of the jump. There are more

sophisticated approaches to filtering the spectral modes to (just) eliminate the ringing, but they are still

at a relatively early stage of development [43]. In contrast, specific finite-difference schemes have been

designed to maintain the magnitude of the jump and avoid excessive spreading of the discontinuity.

Early hyperbolic solvers were first-order in space, robust and reliable, yet often overly dissipative

[41]. More recent approaches have overcome these shortcomings while still preserving the simplicity

of implementation and robustness. The approach taken by [36, 44] consists on an algorithm based

on the method of Kurganov and Tadmor (KT). This second-order scheme guarantees that a positive

scalar will remain so at all points and it was generalized to guarantee that an SPD tensor also remains

SPD. Furthermore, the method dissipates less elastic energy than methods based on artificial diffusion,

resulting in stronger polymer–flow interactions. This approach has been successfully used in DNS of

Shear Flow, Decaying and Forced HIT [45–47], and, as such, will be used in this dissertation.

14

2.4 High performance computing and parallelization strategies

As stated in [48], there is an increasing need of computational resources to perform increasingly complex

DNS simulations. In the case of large computational simulations of fluid flow, and to take advantage of

these resources, it is necessary to develop codes that can run in parallel and are suited to the cluster

configurations that exist nowadays [49].

2.4.1 Parallelization Efficiency

In this subsection, it is presented the metrics used for the performance of the code, namely the speed-up

and the efficiency of the parallel code. While the speed-up provides a good measure of the performance,

it would be irrelevant if the number of processors was not known, while the efficiency provides a way of

comparison between the different configurations.

The speed-up is defined as the ratio between the sequential time ts and the parallel equivalent tp,

given by equation (2.8):

Speed− up =tstp

=σ(n) + ψ(n)

σ(n) + ψ(n)p + κ(n, p)

(2.8)

in which σ is the time in inherently serial tasks, ψ is the time spent in parallelizable tasks, p is the number

of processing units, n is a measure of the problem size (such as mesh points), and κ is the measure of

communication introduced by parallelization, which is a function of both the problem size and the number

of processors. The σ(n) + ψ(n) reflects the total sequential time of the code. It is convenient to define

this metric in terms of fractions of the code with reference to the total time. Thus, it is introduced the

parallel Fp and sequential Fp fractions of the code. Ideally the parallel fraction would be 100%, but it is

not possible since there are always some initialization procedures that cannot be performed in parallel.

The parallel and sequential fractions are related according to equation (2.9):

Fp =ψ(n)

σ(n) + ψ(n)= 1− Fs (2.9)

If one considers that κ is neglectable, it is obtained the Amdahl’s Law from (2.10). Amdahl’s Law provides

an upper limit to the performance of a parallel code and it shows that unless the sequential fraction is

very small, there will be no gain in increasing the number of processors.

Speed− up =1

1 + Fp

(1p − 1

) (2.10)

The efficiency of a parallel code (equation (2.11)) is given by dividing the speed-up by the number

of processes. It provides a comparable measure of performance between configurations that differ in

problem size and/or number of processors.

Efficiency =1

p+ Fp (1− p)(2.11)

In Figure 2.13 it is shown the evolution of efficiency with the number of processors. It becomes clear why

15

scalability is critical, as larger numbers of processors will yield less and less performance improvements.

Figure 2.13: Evolution of efficiency with the number of processors. From [18].

2.4.2 Load Balancing

Load balancing is an important concept to assess performance of parallelization code. It consists on

dividing a task among different processing units in a synchronized way, that is each unit takes the same

amount of time to complete its part, without having processes finishing earlier than the others and

waiting, resulting in idling resources. The slowest process will be the limiting factor when assessing load

balancing.

Another form of imbalance is related to memory imbalance between processors. On machines with

little memory per node, load imbalance can become burdensome.

2.4.3 Granularity

Code granularity refers to measure between computation and communication, once a process in parallel

programming is split across multiple processing units that work simultaneously. The following types of

granularity are defined in [49]:

• Coarser grain: The task is divided in large components, with greater computation work between

communication stages, which implies a better opportunity for performance increase. The load

balancing is more difficult to achieve, with possibly longer synchronization waiting periods.

• Finer grain: The task is divided into small components and less computation is required between

communication events. Despite being an advantage for load balancing, it will lead to higher com-

munication overheads, thus leading to potential worse performance.

16

2.4.4 Distributed vs. Shared memory parallelization

Shared memory parallelization

Shared memory parallelization refers to code being executed by various processors accessing a shared

memory. Thread-based models deal with each thread behaving as a processing unit that has no sepa-

rate memory of its own and shares its memory with the other threads, each thread doing its part of the

processing on the same shared memory, contributing to the desired result. This implies that all threads

or processes see and have access to the same memory. An illustration of this set-up is presented in

Figure 2.14.

Figure 2.14: Illustration of OpenMP architecture. From [50].

The standard application for shared memory configuration is OpenMP. It uses the Fork-Join model, in

which the master thread is responsible for synchronizing the run and distributing the code amongst other

threads. Then, the code is executed between threads and, after the parallelization step, the remaining

threads are joined with the master thread so the code proceeds in sequential mode.

According to [50], this configuration presents the following limitations:

• All processes or processors demand access to the same memory. Even though a software layer

that deals with simulating a shared memory can be used, the performance will be severed.

• Scalability is physically limited by the size of the shared memory node on which it is running.

• Some variables and locations in memory also must be duplicated to avoid race conditions, where

multiple threads try to update the same shared variable simultaneously. Care must be taken not to

overlap multiple processors changing the same place in memory simultaneously.

• Not suitable for dynamic problems (i.e. when the workload fluctuates rapidly during the execution).

Distributed memory parallelization

Contrary to shared memory parallelization, distributed memory relies on communication between pro-

cessing units, each one with its individual memory.

17

Message Passing Interface, or MPI, is the standard API used to parallelize programs that run on

distributed memory architectures. It provides different types of communications, such as point-to-point

and collective (e.g. gathering or broadcasting of data). The communications must be explicitly handled

by the user. Moreover, it provides other functionalities, such as parallel I/O and derived datatypes. An

illustration of this configuration is presented in Figure 2.15.

Figure 2.15: Illustration of MPI architecture. From [50].

It is important to note that the current work considers a parallelized code under MPI.

According to [50], this configuration presents the following limitations:

• Final speed-up limited by the purely sequential fraction of the code.

• Scalability is limited due to the additional costs related to the MPI library and load balancing man-

agement.

• MPI is a flat model, treating each MPI process as a separate physical processor and memory,

independently of physical architecture. This means that unnecessary communications might take

place between processes that physically have access to the same memory.

• Certain types of collective communications become more and more time-consuming as the number

of processes increases, namely MPIAlltoall, which is used in this work.

• As each processor gets its own portion to process before the task is under-way, the assumption of

equal load distribution is taken, which might not be the case as the processing progresses. This

load unbalance might be accounted for by the user, but it involves great programming effort as MPI

is not developed with this taken into consideration, and overheads of more frequent communication

for load distribution appear.

2.4.5 Hybrid parallelization

Hybrid parallel programming consists of mixing several parallel programming paradigms in order to ben-

efit from the advantages of the different approaches. In general, MPI is used for communication between

18

processes, and another paradigm, such as OpenMP, is used inside each process. An illustration of this

configuration is presented in Figure 2.16.

Figure 2.16: Illustration of Hybrid MPI+OpenMP architecture. From [50].

A hybrid programming configuration presents the following advantages in comparison to the dis-

tributed and shared memory [49]:

• Improved scalability through a reduction in both the number of MPI messages and the number of

processes involved in collective communications. This gain is specially verified if the non-hybrid

code uses communications of type MPI AlltoAll.

• More adequate to the architecture of modern supercomputers (e.g. interconnected shared-memory

nodes, NUMA machines, ...), whereas MPI used alone is a flat approach.

• Optimization of the total memory consumption, due to the OpenMP shared-memory approach;

less replicated data in the MPI processes. According to [49], benchmark of pure MPI to Hybrid

provided a memory gain between 80% to 480%.

• Fewer simultaneous accesses in I/O and a larger average record size, with potential significant

time savings on a massively parallel application.

• Better load balancing, by joining OpenMP load balancing features with MPI.

However, implementation of hybrid programming comes with higher level of complexity, with gains in

performance not being guaranteed according to the final application [50].

19

Chapter 3

Governing equations and Numerical

Methods

This chapter describes in detail the implementation of the Navier-Stokes equations that govern viscous

fluid flows, considering a coupling mechanism to represent the polymer additive-turbulence interaction,

along with the numerical methods considered in the main code that were used for the simulation of turbu-

lent planar jet with polymer additives. It is highlighted the discretization schemes considered throughout

jet simulations, as well as the parallelization techniques.

It is important to note that the starting point of the numeric work here presented is a DNS code for

turbulent jet described in Ricardo Reis’ PhD thesis [16] and Diogo Lopes’ PhD thesis [18]. Both thesis

were concerned about DNS of Newtonian fluids, presenting extensive verification and validation of the

DNS of turbulent jets. Furthermore, the core of the parallelization strategy was developed in the above-

mentioned works. Subsequently, the main features that were added to the DNS code are as follows:

• Introduction of the Conformation tensor transport equation;

• Coupling of the Navier-Stokes momentum equation with the Polymer Stress Tensor;

• Use of ghost cells data on streamwise slab configuration.

An overview regarding the discretization schemes considered on this work is now presented:

• The Navier-Stokes momentum equation is solved considering a pseudo-spectral scheme for spa-

tial discretization, together with fully explicit third order Runge-Kutta for temporal advancement;

• The Conformation tensor transport equation is solved according to the following structure:

– The convective term is discretized according to the Kurganov-Tadmor (KT);

– The stretching term is discretized using the finite difference method;

– The method used for temporal advancement is the explicit third order Runge-Kutta method.

21

3.1 Velocity field

On the velocity field governing equations, it is assumed that the fluid is incompressible and that it satisfies

the continuity and momentum equations with an additional term related with divergence of polymer

stress. Below it is presented the continuity and momentum, respectively:

∂ui∂xi

= 0 (3.1)

∂ui∂t

+ uk∂ui∂xk

= −1

ρ

∂p

∂xi+

1

ρ

∂Tij∂xj

(3.2)

where ui is the i component of the velocity vector, ρ is the constant fluid density, p is the local pressure

and Tij is the combination of the viscous and polymer stress tensors. The stress tensor can be ex-

pressed as a linear sum of contributions from the Newtonian stress (T [s]ij ) and the polymer stress (T [p]

ij ),

as shown below:

Tij = T[s]ij + T

[p]ij (3.3)

where the Newtonian stress tensor T [s]ij is given by:

T[s]ij = 2ν[s] ∂Sij

∂xj(3.4)

being ν[s] the kinematic viscosity of the fluid. The Sij is the Strain Rate Tensor defined as:

Sij =1

2

(∂ui∂xj

+∂uj∂xi

)(3.5)

The Polymer Stress Tensor T [p]ij is described in the next sub-section.

3.2 The FENE-P constitutive model

Among the conformation models used in turbulence polymer simulations, the Finitely Extensible Non-

Linear Elastic Peterlin Model (FENE-P) is the most widely used configuration [28, 45]. FENE-P model

considers the polymer solution as a flowing suspension sufficiently dilute that the polymer molecules

do not interact with each other and that each molecule is idealized as an elastic dumbbell composed

by two beads connected by a non-linear spring with a maximum length [31]. Under this configuration,

and considering a uniform polymer concentration field, the constitutive equation of the Polymer Stress

Tensor yields the following relationship:

T[p]ij =

ρν[p]

τp[f(Ckk)Cij − δij ] (3.6)

where Cij is the conformation tensor, ν[p] the zero shear-rate polymeric viscosity, δij is the Kronecker

delta, L is the maximum possible extension of polymers and τp is the Zimm relaxation time of the

22

polymer. The polymer viscosity ν[p] is included in the model by a non-dimensional parameter β, which

represents the ratio between the solvent and the zero-shear-rate viscosity:

β =ν[s]

ν[p] + ν[s](3.7)

The function f(Ckk) is the Peterlin function given by:

f(Ckk) ≡ L2 − 3

L2 − Ckk(3.8)

where Ckk = Cxx + Cyy + Czz is the trace of the conformation tensor, which represents the extension

length. This function ensures finite extensibility, as it gives rise to a non-linear spring force that diverges

as√Ckk → L, preventing the spring from extending beyond L [36].

To complement the continuity equation and the conservation of momentum equation, a transport

equation for the conformation tensor Cij is required. Under the FENE-P model, the conformation tensor

evolution equation is given by:

∂Cij∂t

+ uk∂Cij∂xk

=∂ui∂xk

Cjk +∂uj∂xk

Cik −1

ρ

T[p]ij

ν[p](3.9)

The former equation is solved simultaneously with the velocity field flow equations, ensuring polymer-

flow interaction as the polymer molecules are deformed by the velocity field and, in turn, the resulting

conformation tensor introduces a polymeric stress on the flow structure.

3.3 Properties of the conformation tensor

The effects of polymer stretching on the flow development may be understood as either an elastic effect

or a viscous effect. In the framework of the FENE-P model, if the problem is approached from the

perspective of the elastic theory of drag reduction, the transport equation for the elastic energy can be

studied to understand the energy transfer between the polymers and the flow [51, 52]. Alternatively, if

polymer interaction is approached from the perspective of the viscous theory, Benzi [53] highlighted the

different physical roles played by the different components of the conformation tensor Cij , highlighting

the important role of Cyy which appears in the momentum and kinetic energy equations as an effective

viscosity.

Furthermore, the conformation tensor Cij is a measure of the second-order moment of the end-

to-end distance vector of the polymer dumbbell. It can be written as (2.6) where the vector r is the

separation vector between the two beads of the dumbbell. From the definition, it follows that the confor-

mation tensor is a symmetric positive definite (SPD) matrix [39]. Hulsen [54] proved that during exact

time evolution the conformation tensor must remain positive definite if it were initially. Nonetheless, cu-

mulative numerical errors that arise from virtually all initial value problem algorithms can give rise to

negative eigenvalues, which in turn cause the unbounded growth of Hadamard instabilities that quickly

overwhelm the calculation [55].

23

Another important property of the conformation tensor is that the trace, which represents the square

of the separation distance, must always be less than the square of the maximum extension, i.e., r2 <=

L2. The model guarantees this property through the force term f(Ckk), which diverges in strength as

this limit is approached, as mentioned previously. Hence for flows of arbitrary strength, the restoring

force is always sufficient to maintain this constraint. However, numerical errors in the evaluation of Ckk

can lead to violations of this constraint. Extension past L2 causes the force to change sign, resulting in

the rapid divergence of the calculation [39].

3.4 Computational domain

The computational domain for the turbulent jet spatial simulation consists of a box with uniform grid and

with Lx,Ly and Lz dimensions in each of the x (streamwise), y (normal) and z (spanwise) directions

(see Figure 3.1). The y and z directions have periodic boundary conditions, whilst the streamwise x is a

non-periodic direction with inflow and outflow boundaries.

Figure 3.1: View of the computational domain with reference frame and notation. From [16].

3.5 Spatial discretization

This section details all the spatial discretization procedures. The code employs the following structure:

Momentum equation terms highly accurate pseudo-spectral schemes to solve derivative on the normal

and spanwise directions and uses a compact scheme on the streamwise direction. As for the Con-

formation Tensor Transport equation, it is employed the 2nd order scheme Kurganov-Tadmor and finite

difference method.

3.5.1 Pseudo-spectral scheme

Pseudo-spectral schemes are used in spatial discretization in the normal and spanwise direction, as

these schemes are limited to the treatment of periodic functions. These schemes allow the representa-

24

tion of periodic functions in the Fourier space (also called spectral) in terms of frequencies for temporal

functions, and in terms of wavenumbers for spatial functions. Due to their high-level of accuracy without

numerical dispersion (to the machine precision), pseudo-spectral schemes are extensively used in CFD

[56]. Given a periodic flow variable along the normal y and spanwise directions z in the physical space,

φ(x, y, z, t), an inverse 2D discrete Fourier transform can be used to expand it through (3.10),

φ(x, y, z, t) =

ny2 −1∑

j=−ny2

nz2 −1∑

k=−nz2

φ(x, ky, kz, t)eι(kyy+kzz) (3.10)

where j, k are the indexes of the points along the normal y and spanwise z direction, respectively, and

(ky, kz) are the Fourier wave numbers defined by,

ky =2π

Lyj

kz =2π

Lzk. (3.11)

where i =√−1 is the imaginary unit, Ly and Lz are the spatial dimensions along the y and z directions,

and ny and nz are the number of discrete mesh points along those same directions, respectively.

Each Fourier coefficient φ(x, y, z, t), can then be obtained using the (direct) 2D discrete Fourier

transform,

φ(x, ky, kz, t) =1

nynz

ny−1∑j=0

nz−1∑k=0

φ(x, y, z, t)e−ι(kyy+kzz) (3.12)

It is important to note that the spectral variable φ is defined as a function of streamwise distance x,

as this direction does not present a periodic behaviour on the planar jet. Having defined the general

Discrete Fourier Transform (DFT), it is important to note that one of the main features of this method

is the simplicity of most operations in the spectral space [57]. In fact, derivatives in the physical space

result from the simple multiplication in the Fourier space, like

∂φ

∂y= ιkyφ

∂φ

∂z= ιkzφ (3.13)

resulting in a computationally light operation, with the accuracy only suffering from the rounding errors

and the discretization for the Fourier Transform. In addition to the high-level accuracy, the Fourier Trans-

form became more computationally interesting in 1965 with the publication of the Fast Fourier Transform

(FFT) algorithm. This algorithm presents a computational cost proportional to N log2N , where N is the

number of discretization points used. This cost is quite low comparing to the straightforward computation

of one-dimension Fourier transform with a proportional computational cost of N2 [58].

25

3.5.2 Compact finite differences discretization

The streamwise direction does not present a periodic behaviour, with one boundary being the jet inlet

with a predefined velocity and noise, and the other boundary being the outlet which ensures the flow is

not perturbed. Hence, spectral schemes are excluded to discretize the streamwise direction. In order

to maintain high accuracy, the code employs a 6th order compact finite difference scheme, described

in various references such as [40], [16] and [59]. It has been shown that these schemes provide easily

high order accuracy and possess some good ”spectral characteristics” [59] [18]. The basic idea of this

scheme is the definition of a derivative in terms of its neighbouring derivatives, which means the scheme

is implicit and requires solving a linear system of equations.

Considering a uniform grid with spatial coordinates xi = (i − 1)∆x, with ∆x = Const. (i ∈ [1, nx],

xi ∈ [0, Lx]), and using Taylor expansions of the function fi = f(xi) = f(x) with first derivative f ′i =

f ′(xi) = dfdx (xi) and second derivative f ′′i = f ′′(xi) = d2f

dx2 (xi), one can defined the general formula as

p∑j=−p

αjf′i+j =

q∑k=−q

akfi+k +O(∆xn) (3.14)

with p, q ∈ N. The order of the approximation n depends on the restrictions imposed upon the numbers

αj and ak. For the 6th order compact scheme, p = 2 and q = 3 in (3.14). Thus, (3.15) is obtained for the

first derivative calculation.

βf ′i−2 + αf ′i−1 + f ′i + αf ′i+1 + βf ′i+2 = afi+1 − fi−1

2∆x+ b

fi+2 − fi−2

4∆x+ c

fi+3 − fi−3

6∆x+O(∆xn) (3.15)

For this case the restrictions on (α, β, a, b, c) are:

• a+ b+ c = 1 + 2α+ 2β if n ≥ 2.

• a+ 22b+ 32c = 2 3!2! (α+ 22β) if n ≥ 4.

• a+ 24b+ 34c = 2 5!4! (α+ 24β) if n ≥ 6.

• ...

The calculation of each streamwise derivative depends on each particular point location. All interior

points, i.e., points between i = 3, ..., nx − 2, are calculated with a 6th order accuracy central compact

formulation given by:

αf ′i−1 + f ′i + αf ′i+1 = afi+1 − fi−1

∆x+ b

fi+2 − fi−2

∆x(3.16)

where α = 1/3 and enforcing the following restrictions on a and b,

a = (α+ 2)/3 = 7/9

b = (4α− 1)/12 = 1/36(3.17)

For those points near the two boundaries along the streamwise direction, i.e., i = 2 and i = nx−1, a

classical 4th order Pade scheme was used, corresponding to setting α = 1/4 and,

26

a = (α+ 2)/3 = 3/4

b = (4α− 1)/12 = 0.(3.18)

To complete the calculation of the derivatives in the streamwise x direction, a non-centred 3rd order

implicit scheme is used with forward and backward differencing for the inlet (i = 1) and outlet boundary

(i = nx), respectively.

f ′1 + 2f ′2 =1

2∆x(−5f1 + 4f2 + f3) (3.19)

f ′nx+ 2f ′nx−1 =

1

2∆x(5fnx

− 4fnx−1 − fnx−2) (3.20)

The linear system of equations to calculate the first derivative in the streamwise direction is sum-

marised in (3.21) in matrix form. In practice, the calculation of a single derivative at given point

1 α1

α2 1 α2

α 1 α

. . . . . . . . .

α 1 α

. . . . . . . . .

α 1 α

αn−1 1 αn−1

αn 1

f ′1

f ′2

f ′3...

f ′i...

f ′nx−2

f ′nx−1

f ′nx

=

1

∆x

a1 b1 c1

−a2 0 a2

−b −a 0 a b

. . . . . . . . . . . . . . .

−b −a 0 a b

. . . . . . . . . . . . . . .

−b −a 0 a b

−an−1 0 an−1

cn bn an

f1

f2

f3

...

fi...

fnx−2

fnx−1

fnx

(3.21)

with the following coefficients, as discussed above,

α1 = αn = 2 , a1 = −an = −5

2, b1 = −bn = 2 , c1 = −cn =

1

2

α2 = αn−1 =1

4, a2 = an−1 =

3

4

27

α =1

3, a =

7

9, b =

1

36

The compact scheme for the second derivative is obtained in a similar manner as the first derivative.

An analogous equation to (3.22), where f ′i− > f ′′i , can be written as

βf ′′i−2 + αf ′′i−1 + f ′′i + αf ′′i+1 + βf ′′i+2 = afi+1 − 2fi + fi−1

∆x2+

bfi+2 − 2fi + fi−2

4∆x2+ c

fi+3 − 2fi + fi−3

9∆x2+O(∆xn)

(3.22)

and the restrictions on (α, β, a, b, c) become:

• a+ b+ c = 1 + 2α+ 2β if n ≥ 2.

• a+ 22b+ 32c = 2 4!2! (α+ 22β) if n ≥ 4.

• a+ 24b+ 34c = 2 6!4! (α+ 24β) if n ≥ 6.

• ...

Like for the first derivatives, the second derivative of all interior points (from i = 3 to i = nx − 2) were

calculated with a 6th order central compact formulation, given by

αf ′′i−1 + f ′′i + αf ′′i+1 = afi+1 − 2fi + fi−1

∆x2+ b

fi+2 − 2fi + fi−2

∆x2(3.23)

where α = 2/11 and,

a = 4(1− α)/3 = 12/11b = (4α− 1)/12 = 3/44 (3.24)

As before, the points next to boundary (at i = 2 and i = nx−1) are obtained with a classical 4th order

Pade scheme. For the second derivative this is done with α = 1/10 and,

a = 4(1− α)/3 = 12/10b = (4α− 1)/12 = 0 (3.25)

At the boundary points (at i = 1 and i = nx) it is considered a non-centred 3rd order implicit scheme

given by

f ′′1 + 11f ′′2 =1

2∆x2(13f1 − 27f2 + 15f3 − f4) (3.26)

f ′′nx+ 11f ′′nx−1 =

1

2∆x2(13fnx

− 27fnx−1 + 15fnx−2 − fnx−3) (3.27)

28

The final system of equations for the second derivative results in,

1 α1

α2 1 α2

α 1 α

. . . . . . . . .

α 1 α

. . . . . . . . .

α 1 α

αn−1 1 αn−1

αn 1

f ′′1

f ′′2

f ′′3...

f ′′i...

f ′′nx−2

f ′′nx−1

f ′′nx

=

1

∆x2

a1 b1 c1 d1

a2 −2a2 a2

b a −2(a+ b) a b

. . . . . . . . . . . . . . .

b a −2(a+ b) a b

. . . . . . . . . . . . . . .

b a −2(a+ b) a b

an−1 −2an−1 an−1

dn cn bn an

f1

f2

f3

...

fi...

fnx−2

fnx−1

fnx

(3.28)

with the coefficients,

α1 = αn = 11 , a1 = an = 13 , b1 = bn = −27 , c1 = cn = 15 , d1 = dn = −1

α2 = αn−1 =1

10, a2 = an−1 =

12

10

α =2

11, a =

12

11, b =

3

44

(3.21) and (3.28) are presented more compactly by

A1f′ =

1

∆xB1f (3.29)

A2f′′ =

1

∆x2B2f (3.30)

where A1, A2, B1 and B2 are (nx × nx) matrices and f , f ′ and f ′′ are (nx × 1) vectors.

Having defined the matrices A1, A2, B1 and B2, the calculation of the derivative vectors f ′ and f ′′

from vector f is the following:

1. Inversion of A1 and A2 matrices to obtain A1−1 and A2

−1

29

2. Multiplication of matrices B1 and B2 by the vector 1∆xf , resulting in two new vectors of size (nx×1)

(1

∆xB1f

),

(1

∆x2B2f

)(3.31)

3. Multiplication of matrices A1−1 and A2

−1 by the vectors 1∆xB1f and 1

∆x2B2f to obtain the first and

second derivatives, respectively:

A1−1

(1

∆xB1f

), A2

−1

(1

∆x2B2f

). (3.32)

It is important to note that this procedure is computationally lighter than computing the inversion of

M1 = A1−1B1 and M2 = A2

−1B2 matrices and then performing the multiplication of M1 and M2 by f .

As M1 and M2 are full matrices, this option would require several operations proportional to n2x, whilst

the above procedure only requires the inversion of matrices A1 and A2, resulting in a computational cost

proportional to nx.

3.5.3 Conformation Tensor Transport Equation

The spectral and high-order compact schemes are not suitable for solving the conformation tensor trans-

port equation, since they lose spectral convergence near the descontinuities [36]. Therefore, in spatial

domain, a second-order central difference scheme is adopted for the discretization, except for the con-

vection term in which it is considered the Kurganov-Tadmor scheme (KT).

∂Cij∂t

+ uk∂Cij∂xk︸︷︷︸

Convection

=∂ui∂xk

Cjk +∂uj∂xk

Cik︸︷︷︸Stretching

−1

ρ

T[p]ij

ν[p](3.33)

Convection Term: Kurganov-Tadmor (KT) method

The convection term is discretized with a second-order KT scheme, described in [36], to guarantee

the Symmetric and Positive Definite (SPD) property of the conformation tensor, C, at all times and all

points. Spatial derivatives are second-order accurate everywhere, except for the grid points losing SPD

property. Where it occurs the scheme automatically reverts to first-order accurate for that grid point to

maintain the SPD property is maintained. The equations for the six independent components of C are

coupled to the velocity field and must be solved simultaneously. This scheme is given by:

u · ∇C =Hxi+1/2,j,k −Hx

i−1/2,j,k

∆x+

Hyi,j+1/2,k −Hy

i,j−1/2,k

∆y+

Hzi,j,k+1/2 −Hz

i,j,k−1/2

∆z, (3.34)

where the convective flux tensor H in each direction is given by:

Hxi+1/2,j,k =

1

2ui+1/2,j,k(C+

i+1/2,j,k + C−i+1/2,j,k)− 1

2|ui+1/2,j,k|(C+

i+1/2,j,k −C−i+1/2,j,k) (3.35)

30

Hyi,j+1/2,k =

1

2vi,j+1/2,k(C+

i,j+1/2,k + C−i,j+1/2,k)− 1

2|vi,j+1/2,k|(C+

i,j+1/2,k −C−i,j+1/2,k) (3.36)

Hzi,j,k+1/2 =

1

2wi,j,k+1/2(C+

i,j,k+1/2 + C−i,j,k+1/2)− 1

2|wi,j,k+1/2|(C+

i,j,k+1/2 −C−i,j,k+1/2) (3.37)

The superscripts ‘+’ and ‘–’ on the right hand side in (3.35), (3.36) and (3.37) designate the values

of the conformation tensor at the interface obtained in the limit approaching the point of interest from

the right (+) or left (–) side. The conformation tensor C at the interface is constructed from the following

second-order linear approximations:

C±i+1/2,j,k = Ci+1/2±1/2,j,k ∓(

∆x

2

)(∂C

∂x

)i+1/2±1/2,j,k

(3.38)

C±i,j+1/2,k = Ci,j+1/2±1/2,k ∓(

∆y

2

)(∂C

∂y

)i,j+1/2±1/2,k

(3.39)

C±i,j,k+1/2 = Ci,j,k+1/2±1/2 ∓(

∆z

2

)(∂C

∂z

)i,j,k+1/2±1/2

(3.40)

This central difference approximation to the convective term not only allows to capture the potentially

sharp variations in the conformation field, but also by writing the convective flux with a difference for-

mula the conservation of the mean conformation tensor is automatically satisfied [36]. To complete the

expression, approximations for the spatial derivatives of the conformation tensor are required. These

quantities must be defined in a way that maintains SPD property for each C±. The potential candidates

for approximating the gradient are:

(∂C∂x

)i,j,k

=

Ci+1,j,k−Ci,j,k

∆x

Ci,j,k−Ci−1,j,k

∆x

Ci+1,j,k−Ci−1,j,k

2∆x

(3.41)

It is selected the derivative approximation that can yield SPD results for C+i−1/2 and C−i+1/2. When two

or more candidates satisfy the criterion, it is selected the one which maximizes the minimum eigenvalue

for these two tensors. When none of them meet this criterion, the derivative is set to zero, reducing to

first-order accurate. This slope limiting procedure will ensure that all C± are SPD.

The update of the conformation tensor C requires the area-averaged velocity at the edge of the

volume surrounding each grid point. However, a complication of using a pseudo spectral code in the

velocity update is that the area-averaged velocities at cell edges that are not known. As mentioned in

[36], a simple estimate would be to average the nearest neighbouring grid points, but this approach

31

would not satisfy:

ui+1/2,j,k − ui−1/2,j,k

∆x+vi,j+1/2,k − vi,j−1/2,k

∆y+wi,j,k+1/2 − wi,j,k−1/2

∆z= 0, (3.42)

which follows from the continuity equation over a control volume. Violation of (3.42) leads to numerical

errors in the volume average of C. The exact expression for the area-averaged velocity is given by:

ui+1/2,j,k =1

∆y∆z

∫ yj+∆y/2

yj−∆y/2

∫ zk+∆z/2

zk−∆z/2

u

(xi +

∆x

2, y, z

)dydz (3.43)

vi,j+1/2,k =1

∆x∆z

∫ xi+∆x/2

xi−∆x/2

∫ zk+∆z/2

zk−∆z/2

u

(x, yi +

∆y

2, z

)dxdz (3.44)

wi,j,k+1/2 =1

∆x∆y

∫ xi+∆x/2

xi−∆x/2

∫ yj+∆y/2

yj−∆y/2

u

(x, y, zk +

∆z

2

)dxdy (3.45)

where ui+1/2,j,k, vi,j+1/2,k and wi,j,k+1/2 are the area-averaged velocities at the edge of the volume

surrounding each grid point. In [45] and [36], the velocities u(x, y, z), v(x, y, z) and w(x, y, z) are ob-

tained from the inverse transform of the Fourier coefficients u(kx, ky, kz), v(kx, ky, kz) and w(kx, ky, kz).

However, since the streamwise direction x is not periodic this method is not applicable. Thus, the area-

averaged velocities are computed considering the physical domain along the streamwise direction x and

in the spectral domain along the normal y and spanwise z directions. The resulting expressions for the

cell velocities are:

ui+1/2,j,k = FT−1

{u(x, ky, kz)

sin(ky∆y/2)

ky∆y/2

sin(kz∆z/2)

kz∆z/2

}(3.46)

vi,j+1/2,k =1

∆x

∫ xi+∆x/2

xi−∆x/2

FT−1

{v(x, ky, kz)e

iky∆y/2 sin(kz∆z/2)

kz∆z/2

}dx (3.47)

wi,j,k+1/2 =1

∆x

∫ xi+∆x/2

xi−∆x/2

FT−1

{w(x, ky, kz)e

ikz∆z/2 sin(ky∆y/2)

ky∆y/2

}dx (3.48)

where FT−1 is the inverse fast Fourier transform, u(x, ky, kz), v(x, ky, kz) and w(x, ky, kz) are the Fourier

coefficients at each grid point. For the streamwise direction it is required to calculate the velocity and

integrate along x in the physical space. Along the streamwise direction x, the velocity on the cell edge

is interpolated with 5th order Lagrange Polynomials and the integration along the streamwise direction

is performed considering Gauss-Legendre 3 point rule. The computation procedure to assess the area

averaged velocity is the following:

1. Interpolation of velocity along the x direction for the required points. When assessing the area

averaged velocity u(x, y, z), the streamwise velocity component is interpolated on xi+1/2. When

assessing the area averaged velocities v(x, y, z) and w(x, y, z), the required points to interpolate

are the Gauss-Legendre nodes for the 3 point rule. In both cases, the interpolation is considered

a centred 5th Lagrange Polynomial for interior points (from i = 3 to i = nx−3). On the remaining

points, the polynomial order is maintained and the grid points used to interpolate are shifted in

32

order to only consider information within the domain (for example, to interpolate the velocity at

i = 1 + 1/2 it is considered the points from i = 1 to i = 6). Moreover, on the velocity points that are

outside the domain boundaries (for example, edge i− 1/2 for i = 1 and edge i+ 1/2 for i = nx) it

is assumed that the edge u velocity value is equal to the i grid point u velocity value;

2. 2D discrete Fourier transformation of the velocity field;

3. Computation of the integral along the periodic directions y and z on the spectral space;

4. Computation of the cell edge velocity for the velocities v and w along the periodic directions on the

spectral space;

5. Inverse 2D discrete Fourier transform of the velocity field and computation of the velocity integral

along the streamwise direction x. It is important to note that this step is not required for the area

averaged u velocity. The integration on the physical space is performed with the Gauss-Legendre

3 point rule, except for the boundary points in which the Gauss-Legendre reverts to the 1 point rule.

The Gauss-Legendre 3 point formulation for the area averaged normal velocity v can be written

as:

v(i, j + 1/2, k) =1

2

[A1F

(x+

∆x

2ξ1

)+A2F

(x+

∆x

2ξ2

)+A3F

(x+

∆x

2ξ3

)](3.49)

F (x) = FT−1

{v(x, ky, kz)e

iky∆y/2 sin(kz∆z/2)

kz∆z/2

}(3.50)

where ξi and Ai are the Gauss nodes and weights, respectively, and are given by

3 point rule 1 point rule

ξi Ai ξi Ai

node

1 −√

15/5 5/9 0 2

2 0 8/9

3√

15/5 5/9

Table 3.1: Gauss-Legendre integration nodes and weights. From [60].

The area averaged spanwise velocity w is obtained analogously to v.

It should be noted that the formulation presented is applicable to the velocity on positive edges i+ 1/2,

j + 1/2 and k + 1/2. The area averaged velocities on negative edges i − 1/2, j − 1/2 and k − 1/2

are obtained by shifting one position backwards the positive edge velocity field in the respective velocity

direction. This shifting requires special treatment on the boundaries. For the normal v and spanwise

w velocities, the lower boundary for the negative edge cell velocities is equal to the upper boundary

positive edge cell velocities, thus ensuring a period behaviour. Regarding the streamwise velocity u, the

negative cell edge velocity at the inlet boundary is assumed to be the velocity value u at the inlet.

33

Stretching Term and Polymer Stress Coupling Term: Finite difference method

The code was developed for use with a finite difference approach in which the differential equations

are approximated at each point. For the stretching term and the polymer stress coupling term, the

first derivatives were computed with a central difference scheme second-order accurate everywhere in

space, except for the boundaries on the streamwise direction.

For the stretching term, this scheme results in:

∂ui∂xk

Cjk +∂uj∂xk

Cik =(ui)m+1 − (ui)m−1

2∆xkCjk +

(uj)m+1 − (uj)m−1

2∆xkCik (3.51)

where i, j and k are the computational indexes and m is an auxiliary variable that represents i, j and

k for the ∂/∂x, ∂/∂y and ∂/∂z derivatives, respectively. On the inlet and outlet boundaries the central

difference scheme is reverted to a forward and backward finite difference first-order accurate scheme,

respectively. The forward finite difference scheme is given by:

∂ui∂x

Cj1 +∂uj∂x

Ci1 =(ui)2 − (ui)1

∆xCj1 +

(uj)2 − (uj)1

∆xCi1 (3.52)

The backward finite difference scheme is given by:

∂ui∂x

Cj1 +∂uj∂x

Ci1 =(ui)nx

− (ui)nx−1

∆xCj1 +

(uj)nx− (uj)nx−1

∆xCi1 (3.53)

For the polymer stress coupling term on the momentum equations, this scheme results in:

∂T[p]ij

∂xj=ν[p]

τp

[(f(Ckk)Cij − δij)m+1 − (f(Ckk)Cij − δij)m−1

2∆xk

](3.54)

where i and j are the computational indexes and m is an auxiliary variable that represents i, j and k

for the ∂/∂x, ∂/∂y and ∂/∂z derivatives, respectively. Similarly to the stretching term, the coupling term

on the inlet and outlet boundaries the central difference scheme is reverted to a forward and backward

finite difference first-order accurate scheme, respectively. The forward finite difference scheme is given

by:∂T

[p]ij

∂xj=ν[p]

τp

[(f(Ckk)Cij − δij)2 − (f(Ckk)Cij − δij)1

∆x

](3.55)

The backward finite difference scheme is given by:

∂T[p]ij

∂xj=ν[p]

τp

[(f(Ckk)Cij − δij)nx

− (f(Ckk)Cij − δij)nx−1

∆x

](3.56)

34

3.6 Temporal discretization

The time advancement method of the momentum equation and for the conformation tensor transport

equation is an explicit 3rd order Runge-Kutta scheme of numerical integration. The equations to be

solved can be written as follows:

∂ui∂t

= N (ui) + L[s] (ui) + L[p] (Cij)−∇πp (3.57)

where N(ui) is the non-linear convective term

N(ui) = −uj∂ui∂xj

+∂

∂xi

ujuj2

= ui × ωi (3.58)

and L[s](ui) represents the viscous term due to the solvent

L[s](ui) = ν∂2ui∂xj∂xj

= ν∇2u (3.59)

and L[p](Cij) represents the viscous term due to the polymer

L[p] (Cij) = ν[p]∇ ·(f(Ckk)Cij − δij

τp

)(3.60)

and πp is the modified pressure given by

π =p

ρ+ujuj

2(3.61)

Having determined the terms in (3.59) to (3.61), it is employed the 3 step, 3rd order Runge-Kutta scheme,

which calculates each sub-step velocity up from the last two time sub-steps p− 1 and p− 2 using

uip − uip−1

∆t= αp

[N(ui

p−1) + L[s](uip−1) + L[p](Cij

p−1)]

+ βp

[N(ui

p−2) + L[s](uip−2) + L[s](Cij

p−2)]− ∂πp

∂xi

(3.62)

The coefficients αp and βp in (3.62) for a 3rd order accuracy with 3 time sub-steps p = 1, 2, 3 are defined

in (3.63) [61].

α1 = 8/15 β1 = 0

α2 = 5/12 β2 = −17/60

α3 = 3/4 β3 = −5/12

(3.63)

3.6.1 Pressure-velocity coupling

To solve (3.62), the pressure field π must be known at each time sub-step p. In order to know the

pressure term, each Runge-Kutta step is divided into two sub-steps using a method called fractional

step ([62, 63] through [16]). This method is used to solve the simultaneous calculation of the velocity and

pressure field. This is necessary to insure incompressibility of the velocity field at each time sub-step and

35

it implies that a Poisson equation must be solved at each sub-step of the Runge-Kutta scheme. Firstly,

the terms N(ui), L[s](ui) and L[p](Cij) are computed, and an intermediate velocity uip∗ is calculated as

uip∗ − uip−1

∆t= αp

[N(ui

p−1) + L[s](uip−1) + L[p](Cij

p−1)]

+ βp

[N(ui

p−2) + L[s](uip−2) + L[p](Cij

p−2)] (3.64)

Secondly, the pressure field is calculated by imposing the continuity equation. Subtracting (3.64) from

(3.62), one obtainsuip∗ − uip−1

∆t= −∂π

p

∂xi(3.65)

By taking the divergence of (3.65), the Poisson equation (3.66) is obtained.

∇ · uip −∇ · uip∗

∆t= −∇2πp (3.66)

At each Runge-Kutta step the continuity equation must be verified,

∇ · uip = 0 (3.67)

When this is applied to (3.66), one obtains:

−∇ · uip∗

∆t= −∇2πp (3.68)

Thus, solving the Poisson equation (3.68) will result in the modified pressure, which, when used in (3.65),

yields the velocity at the end of each Runge-Kutta step p.

3.6.2 Non-Linear Term in the Momentum Equation

The terms of velocity momentum equation are discretised in the Fourier space for the spanwise and

normal directions (y, z) and 6th order compact differences for the derivatives in the streamwise direction

(x) [16]. The velocity at each sub-step p is determined by the following equations:

uip∗ − uip−1

∆t= αp

[Nx

p−1+ ν(

d2

dx− k2)ui

p−1 + Lx[p]

(Cijp−1)

]+ βp

[Nx

p−2+ ν(

d2

dx− k2)ui

p−2 + Lx[p]

(Cijp−2)

]vip∗ − vip−1

∆t= αp

[Ny

p−1+ ν(

d2

dx− k2)vi

p−1 + Ly[p]

(Cijp−1)

]+ βp

[Ny

p−2+ ν(

d2

dx− k2)vi

p−2 + Ly[p]

(Cijp−2)

]wip∗ − wip−1

∆t= αp

[Nz

p−1+ ν(

d2

dx− k2)wi

p−1 + Lz[p]

(Cijp−1)

]+ βp

[Nz

p−2+ ν(

d2

dx− k2)wi

p−2 + Lz[p]

(Cijp−2)

]

(3.69)

36

where d/dx represents the operator of compact second order differentiation and each component of the

non-linear term is given by:

Nxp = vpωpz − wpωpy

Nyp = wpωpx − upωpz

Nzp = upωpy − vpωpx

(3.70)

The vorticity components in (3.70) are given by:

ωpx = ikywp − ikz vp

ωpy = ikzup − dwp

dx

ωpz = dvp

dx − ikyup

(3.71)

In (3.70), k2 is the norm of the wavenumber vector k that appears from the calculation of a second

derivative in the spectral space.

k2 = k2y + k2

z (3.72)

After solving (3.69), the correction pressure field πp is obtained solving the Poisson problem. This

computation is performed on the Fourier space and can be written as:

(d2

dx2− k2

)πp =

1

∆t

(dup∗

dx+ iky v

p∗ + ikzwp∗)

(3.73)

Having calculated the pressure field, the final velocity field uip at the end of each Runge-Kutta step is

obtained by

up = up∗ −∆t(dπp

dx

)vp = vp∗ −∆t (ikyπ

p)

wp = wp∗ −∆t (ikzπp)

(3.74)

3.6.3 Conformation Tensor Transport Equation

To know each new velocity at each new sub-step, it is necessary to know the conformation tensor at the

previous sub-step(s). Equation (3.9) can be written in the following form:

∂Cij∂t

= N ′ (ui, Cij) +M (ui, Cij) + L[p]′ (Cij) (3.75)

where N ′ (ui, Cij) is the convection term

N ′ (ui, Cij) = −uk∂Cij∂xk

(3.76)

and M (ui, Cij) is the stretching term

M (ui, Cij) = Cjk∂ui∂xk

+ Cik∂uj∂xk

(3.77)

37

and L[p]′ (Cij) is the viscous term

L[p]′ (Cij) = −f(Ckk)Cij − δijτp

(3.78)

The same process that was applied before to the Navier-Stokes momentum equation in (3.62), is

done here. Thus, the following is obtained:

Cijk − Cijk−1

∆t= αk

[N ′(uik−1, Cij

k−1)

+M(uik−1, Cij

k−1)

+ L[p]′(Cij

k−1)]

+βk

[N ′(uik−2, Cij

k−2)

+M(uik−2, Cij

k−2)

+ L[p]′(Cij

k−2)] (3.79)

where αk and βk are the coefficients of (3.63).

It is important to notice that the Runge-Kutta step of equation (3.79) is not the same as that of the

equation (3.69). In equation (3.79), Cijk−2 ≡ Cijn−2 and Cij

k ≡ Cijn−1, whereas in equation (3.69),

Cijk−2 ≡ Cijn−1.

3.7 Stability condition

The time step used in the numerical resolution of momentum and conformation tensor transport equa-

tions is bounded by stability restrictions. In convection dominated problems (such as the one considered

in this work), the main parameter that governs the stability limits imposed on the size of the mesh ele-

ments and the time step is the Courant-Friedrich-Levy condition or Courant number (CFL) expressed by

(3.80) [58]:

CFL =|ui|∆t

∆x(3.80)

In the simulations, the Courant number is found by assessing (3.80) at each iteration for each flow

direction. The resulting ∆t can be written as:

∆t = CFL×min

{∆x

|u|max,

∆y

π|v|max,

∆z

π|w|max

}(3.81)

where in (3.81) |u|max, |v|max, |w|max are the maximum velocity in each respective direction in the veloc-

ity field. The π factor in the y and z direction arises due to the spectral nature of the spatial discretisation

in those directions [64]. In the current work, a visco-elastic fluid is considered with a conformation tensor

having a second order discretisation scheme. Vaithianathan [36] demonstrated that to ensure the con-

formation tensor Cij remains positive as the simulation develops, the following Courant condition must

be satisfied:

CFL <1

6≈ 0.1667 (3.82)

which represents a more severe stability condition than the one considered for Newtonian fluids CFL <

0.6 [18]. For this reason, the simulations consider a maximum CFL number of 0.16. The derivation of

the maximum CFL number is presented in appendix A.

38

3.8 Boundary and Initial Conditions

In this section it is described the different boundary conditions deployed in each location of the domain

for the spatial plane jet.

3.8.1 Inlet boundary

The inlet conditions are imposed by forcing a velocity profile and a conformation tensor profile at each

time step for the streamwise component of the velocity (U ) and for the polymer conformation tensor

(Cij), respectively.

The velocity profile imposed at the inlet at each timestep is given by:

u(x0, t) = Umed(x0) + Unoise (3.83)

where x0 = (x = 0, y, z) is the inlet plane, Umed is the mean inlet profile and Unoise is a superimposed

random numerical noise. For the planar jet, the mean velocities in the normal y and spanwise z directions

are set to zero at the inlet. In the streamwise direction x the prescribed velocity profile is based in a

hyperbolic tangent profile used in [65] and [16]. The mean inlet velocity is expressed in (3.84).

Umed(x0) =U2 + U1

2+U2 − U1

2tanh

[h

4θ0

(1− 2|y|

h

)](3.84)

In (3.84) θ0 is the momentum thickness of the initial shear layer, h is the jet inlet slot width and y is the

distance from the centre of the jet, i.e., y = 0. U1 is the co-flow velocity and U2 is the centreline velocity.

The co-flow is required to allow the growth of the jet’s shear layer, otherwise hampered by the lack of

natural entrainment from the periodic lateral boundaries. However, U1 cannot be too high, as it might

change the instability characteristic of the initial mean velocity profile [16]. An example of the mean

velocity profile is shown in Figure 3.2, exhibiting inlet velocity from y/h = −0.5 to y/h = 0.5.

Figure 3.2: Mean velocity profile imposed at the inlet. H stands for the width of the inlet slot. From [18].

39

The numerical noise, Unoise, is superposed to the mean profile in (3.84), at each timestep t by

imposing a three-component fluctuating velocity field (u, v and w), to exhibit the statistical characteristics

of isotropic turbulence [65]. The velocity noise profile is given by

Unoise = ampUbase~u′ (3.85)

where amp is an amplification factor typically ranging from [0, 0.15] and ~u′ = (u′, v′, w′) comes from a

random number generator chosen to comply with the following energy spectrum

E(k) ∝ kse− s2 (k/k0)2 (3.86)

where s is the noise spectrum slope in the large scales region and, k0 the wave number location of

every maximum. These parameters are chosen to provide an energy input dominant at the small scales.

The numerical noise is only imposed in the shear-layer region of the mean streamwise velocity profile

through a proper convolution function.

An inlet boundary is also considered for the polymer conformation tensor. The inlet profile is based

on the assumption of steady flow, fully developed in the streamwise direction x. Considering the inlet ve-

locity profile without co-flow (see (3.84)), it is possible to notice that there are no normal v and spanwise

w velocities. Taking the previous remarks into account, one obtains an analytical profile for the polymer

conformation tensor, given by the real root of the following cubic equation:

C223 +

L2

2τp2(∂U∂y

)2C22 −L2

2τp2(∂U∂y

)2 = 0 (3.87)

Further detail regarding the derivation of this solution, as well as the value for each component of the

conformation tensor, is presented in section 4.1.

3.8.2 Lateral boundaries

As explained in section 3.4, the normal y and spanwise z directions of the domain are discretized with

spectral methods, which required the boundaries to be periodic. This periodicity of the lateral boundaries

is stated for all three components of the velocity field, the pressure and the polymer conformation tensor.

These can be written as:~u(x, y, z, t) = ~u(x, y + Ly, z, t)

~u(x, y, z, t) = ~u(x, y, z + Lz, t)

p(x, y, z, t) = p(x, y + Ly, z, t)

p(x, y, z, t) = p(x, y, z + Lz, t)

Cij(x, y, z, t) = Cij(x, y + Ly, z, t)

Cij(x, y, z, t) = Cij(x, y, z + Lz, t)

(3.88)

The boundaries must be set sufficiently far away from the turbulent region, otherwise any perturbation

that leaves the domain will influence the opposite side of the computational domain and the simulation

40

will diverge [66]. Moreover, the distance of the lateral boundaries is also dictated by the need to avoid

spurious reflections which would upset the flow. It must be noted that the above-mentioned issue only

affects the normal direction y under the planar jet configuration, as the spanwise direction z is naturally

periodic.

3.8.3 Outlet boundary

The outlet boundary is the most sensitive boundary condition of the planar jet. The fluid structures that

leave the domain must exit unperturbed by the outflow condition and ensuring that reflected waves do not

go back into the domain. Therefore, the code employs a non-reflective boundary condition used by [67],

where all the terms of the Navier-Stokes equations - convective and viscous - are explicitly advanced.

This method begins with the x momentum equation, in which the x component of the diffusive term

Re−1 ∂2u∂x2 and the pressure gradient ∂p

∂x and included in an unknown variable Cu(y, z). Thus, the final

equation is:∂u

∂t= −Cu(y, z)

∂u

∂x− v ∂u

∂y− w∂u

∂z+

1

Re

(∂u2

∂y2+∂u2

∂z2

)(3.89)

The Cu(y, z) term is calculated at i = nx − 1 to determine the value of streamwise velocity u at each

Runge-Kutta sub-timestep p with(3.90):

up−1 − up−2

∆t= −Cu(y, z)

∂u

∂x− v ∂u

∂y− w∂u

∂z+

1

Re

(∂u2

∂y2+∂u2

∂z2

)(3.90)

Using (3.90) the streamwise derivative is calculated with a 1st order backward differences scheme,

whilst the normal and spanwise directions are determined with a pseudo-spectral scheme. Next, the

same equation is used with the Runge-Kutta time scheme to obtain the streamwise velocity at the outlet

plane i = nx. After this, the code solves the Poisson equation, as described in section 3.8.4, to obtain

the corrected normal v and spanwise w velocities at i = nx with the pressure gradient that result from

the solution of the Poisson equation. This step is shown in (3.91).

v(y, z) = v∗y(y, z)−∆t ∂p∂y

w(y, z) = w∗z(y, z)−∆t∂p∂z

(3.91)

where v∗y,z and w∗y,z are the velocity components for the normal and spanwise directions, respectively,

obtained from the Navier-Stokes equations without the pressure gradient term. This scheme was shown

by Reis [16] to ensure a smooth exit of the coherent turbulent structures, with negligible perturbations

from the presence of the outflow boundary condition. It must be noted that no special treatment of the

boundary outlet condition is required on the polymer conformation tensor, as the outflow condition will

be reflected through the velocity field on the polymer conformation transport equation.

3.8.4 Poisson equation

In the case of the Poisson equation, the lateral boundaries for the spanwise and normal direction are

periodic, due to the pseudo-spectral scheme. For the inlet and outlet boundaries, the pressure boundary

41

conditions are of Neumann type. This requires values of the pressure gradient outside of the computa-

tional domain, for x = −∆x and x = Lx + ∆x]. The values on these points are taken as being equal to

the corresponding boundary points:∂p

∂x x=−∆x=

∂p

∂x x=0(3.92)

∂p

∂x x=Lx+∆x=

∂p

∂x x=Lx

(3.93)

The streamwise pressure gradients at the inlet are obtained from the imposed velocity fields at the inlet.

At the outlet it is obtained from the intermediate velocity at each Runge-Kutta step before the correction.

These steps are illustrated on (3.94) and (3.95), respectively.

−∆t∂p

∂x i=1= ui=1 − u∗i=1 (3.94)

−∆t∂p

∂x i=nx

= ui=nx− u∗i=nx

(3.95)

For discretisation purposes, (3.92), (3.95) corresponds to

f ′i=0 = f ′i=1 (3.96)

f ′i=nx+1= f ′i=nx

(3.97)

where 0 and nx + 1 are fictitious nodes placed outside the computational domain. Consequently, the set

of expressions in (3.98) is obtained, which allows for the inversion of the Poisson equation.

fi=1 = 87fi=2 − 1

7fi=3 − 67∆xf ′i=1

fi=0 = fi=1 −∆xf ′i=1

fi=nx= 8

7fi=nx−1 − 17fi=nx−2 − 6

7∆xf ′i=nx

fi=nx+1 = fi=nx−∆xf ′i=nx

(3.98)

3.9 Code architecture

As mentioned previously in section 3, the starting point of the numeric work here presented is a DNS

code for turbulent jet described in Ricardo Reis’ PhD thesis [16] and Diogo Lopes’ PhD thesis [18], having

also defined the core of the parallelization strategy was developed in the above-mentioned works. In this

section it is described the parallel code architecture, such as the data dependency, read/write operations

and domain data decomposition.

3.9.1 Parallel architecture

The parallelization strategy was designed with the aim of minimizing communication between proces-

sors, whilst being able to meet the requirements of the problem. The current numerical work is paral-

lelized with MPI and is described in detail in [16]. The main operations to be performed by the numerical

42

scheme for each Runge-Kutta step are the calculation of the non-linear and the viscous terms, to obtain

the velocity field, followed by the resolution of the Poisson equation to obtain the pressure field and the

corrected velocity.

3.9.2 Parallel domain decomposition

The computational domain is implemented on a mixed decomposition, with the co-existance of stream-

wise slabs and normal slices (see Figure 3.3). The reasoning underlying this configuration is presented

in [16].

Figure 3.3: View of the domain partitioning with reference frame and notation. From [18].

The data field configuration is swapped according to the operation taking place, according to the

following structure:

• The direct and inverse Fourier transforms are computed in independent yz planes, thus being

intrinsically connected to the normal slice configuration;

• The compact scheme for the streamwise derivatives is computed on the slab configuration, as it

requires a complete line of data on the streamwise direction not available on normal slice configu-

ration;

• The streamwise derivatives for the conformation tensor transport equation do not require complete

lines of data on the streamwise x direction. Thus, it was opted to perform the calculation on

the normal slice configuration with the aid of ghost cells, avoiding the need to perform intensive

communications between processors to swap data configuration.

The data exchange between the two data configurations uses the function MPI AlltoAll and is per-

formed in every Runge-Kutta sub-step.

Data exchange with ghost cells

Ghost cells (also referred as halo cells on the literature [50]) here consist in replicated data from neigh-

bouring partitions that is passed between processors. The replicated data is only performed on the

streamwise direction, as the remaining directions are periodic. This additional chunk of data in each

43

normal slice allows the computation of the streamwise derivative, avoiding the use of a MPI AlltoAll

communication transfer. This process requires data exchange at each Runge-Kutta sub-step. For a

given centred domain partitioned on normal slices, two streamwise data planes are communicated on

each side of the slices, except on the ones that are located on the streamwise boundaries. An illustrative

example of ghost cell application is presented in Figure 3.4.

(a) Normal slice configuration without ghost cell data on the xy plane.

(b) Normal slice configuration with ghost cell data on the xy plane.

Figure 3.4: Ghost cell configuration data exchange on the streamwise direction for a domain partitionedin 3 normal slices.

3.9.3 FFT calculation

The Fast Fourier Transforms are calculated with the Fastest Fourier Transform in the West (FFTW)

library [68]. FFTW is a high-performance library, released under the GNU General Public License1,

with portable features, maintaining the program performance on most computer architectures without

the need of modification. Due to the nature of FFT algorithms, the Fourier transform computation speed

is maximized when the prime factorization of the number of grid points in the spectral directions is

comprised by low prime numbers [18].

3.9.4 I/O

DNS programs have input and output (I/O) intensive procedures, being one of the slowest operations

found in computing systems [16]. Moreover, the amount of space required is quite demanding, increas-

ing three times with the mesh size. For example, the reference simulation performed in [18] comprised a

1See https://www.fftw.org

44

600 million point mesh, resulting in 2.3GB per data file. Several I/O methods were considered, which are

described in detail in [16] and [18]. The preferred I/O method is the MPI-IO, which allows the processors

to write and read their part of a common file. In addition, as there is a provision to dynamically adjust the

scope of action for each processor when interacting with the file, this method allows simulation restarts

using a different number of processors.

45

Chapter 4

Code Verification

To ensure full reliability on the results generated by the spatial jet DNS code with FENE-P, extensive code

verification tests with known results were performed. The code verification exercise was conceptualized

with the purpose of assessing all terms related to the visco-elastic characteristics of the simulation that

were added to the existing DNS code. Thus, the exercise was organized according to the following

structure:

1. Conformation Tensor Transport Equation

• It was considered a 2D Couette flow for verification purposes. With this particular flow, an

analytical solution was computed for the polymer conformation transport equation;

• The code was modified in order to allow the interaction of the numerical scheme with this flow.

Namely, the initial velocity profile was frozen for each Runge-Kutta step, which means it was

kept constant throughout the simulation;

• Being a 2D flow scheme, the verification analysis was performed for all 2D planes to trigger

and verify all the equation components.

2. Velocity - Conformation Tensor Coupling term

• It was performed the verification of the visco-elastic coupling term on the momentum equation

by triggering it with a known Cij conformation tensor;

• By having a known imposed Cij , an analytical solution was found and compared to the result

from the numerical scheme.

It should be noted that the verification exercise of the overall 3D numerical simulation code does

not comprise a comparison to other simulation results, as no comparable simulation set-ups, i.e., con-

sidering non-periodic boundary conditions on a spatial jet with polymer additives, were found on the

literature.

47

4.1 Conformation Tensor Transport Equation Verification

The Conformation Tensor Transport Equation was verified by performing an exercise with a frozen in

time 2D Couette flow, thus allowing the comparison with a steady analytical solution. The velocity profile

was kept constant throughout the simulation by imposing it in each Runge-Kutta step.

Being a 2D flow scheme and as the streamwise direction does not present a periodic behaviour in

the spatial jet flow (e.g. discretization is not performed with a pseudo-spectral scheme in this direction),

the verification analysis was performed for all 2D planes so that all the transport equation components

are triggered.

In this section it is presented in detail the analytical solution for the transport equation, assuming a

2D steady flow for the u velocity component fully-developed in the x direction. The derivation consid-

ering these assumptions is also presented on [57]. It should be noted that the derivation is performed

analogously for the remaining velocity profiles considered in this verification exercise.

The velocity profile for the u velocity component in function of y normal direction is presented in

Figure 4.1.

Figure 4.1: u velocity profile in the normal y direction.

From Figure 4.1 it is noticeable that the velocity profile is a mirrored Couette profile, due to the nu-

merical scheme and the need to ensure periodic conditions. Therefore, the discontinuity points observed

on the profile (y/h = −5, 0, 5 for Figure 4.1) are not comparable to the analytical solution.

The profile of the conformation tensor was set to an identity matrix of size 3 at the beginning of the

simulation.

The conformation tensor transport equation is presented previously in (3.9). Considering the Polymer

Stress Tensor relationship from (3.6), the transport equation can be written as follows:

∂Cij∂t

+ uk∂Cij∂xk

=∂ui∂xk

Cjk +∂uj∂xk

Cik −1


As mentioned previously on Section 3.3, the conformation tensor Cij is an SPD matrix by defini-

48

tion. For this reason, Cij = Cji and, therefore, the system of linear equations only encompasses six

independent variables, corresponding to the independent components of the conformation tensor.

As it is considered a steady and fully-developed flow in the streamwise x direction, the following

assumptions are enforced:∂

∂t= 0,

∂

∂x= 0 (4.2)

Given the two-dimensional flow (velocity profile u(y)) and the conservation of mass, the following is

obtained:

v = 0, w = 0,∂

∂z= 0 (4.3)

Taking into consideration the flow assumptions from Equations (4.2) and (4.3), the conformation

tensor transport equation can be rewritten in this case as:

uk∂Cij∂xk

=∂ui∂xk

Cjk +∂uj∂xk

Cik −1


Having equation (4.4), each entry of the conformation tensor Cij must be assessed. The resulting

equations for each entry are presented below.

For component C11:

2∂u

∂yC12 −

1

τp[f(Ckk)C11 − 1] = 0 (4.5)

For component C12:∂u

∂yC22 −

1

τp[(Ckk)C12] = 0 (4.6)

For component C13:∂u

∂yC23 −

1

τp[(Ckk)C13] = 0 (4.7)

For component C22:

f(Ckk)C22 − 1 = 0 (4.8)

For component C23:1

τp[(Ckk)C23] = 0 (4.9)

For component C33:

f(Ckk)C33 − 1 = 0 (4.10)

From equation (4.9) it follows that C23 = 0, as the diagonal component of the conformation tensor

Cij are positive. Having this result, equation (4.7) can be simplified, leading to C13 = 0.

Moreover, comparing equations (4.8) and (4.10), it is noticeable that the following relationship must

be held: C22 = C33.

Rearranging equation (4.8), and considering that the Peterlin function is defined as (3.8), the following

is obtained:L2 − 3

L2 − (C11 + 2C22)C22 = 1 (4.11)

49

from which it is obtained

C11 = C22

(1− L2

)+ L2 (4.12)

From equation (4.5), it is possible to organize it as function of C22, if one considers the relationship

between variables C11, C12 and C22 from equations (4.6) and (4.12). By doing so, the following equation

is achieved:

2

(∂u

∂y

)2

τ2pC

222 =

1

C22

(C22

(1− L2

)+ L2

)− 1 (4.13)

Rearranging (4.13) for C22, it is obtained a cubic polynomial in function of C22:

C322 + C22

L2

2(∂u∂y

)2

τ2p

− L2

2(∂u∂y

)2

τ2p

= 0 (4.14)

The real root of the cubic polynomial (4.14) is the following:

C22 = 3

√√√√√ L2

4(∂u∂y

)2

τ2p

+

√√√√ L6

216(∂u∂y

)6

τ6p

+L4

16(∂u∂y

)4

τ4p

+

3

√√√√√ L2

4(∂u∂y

)2

τ2p

−√√√√ L6

216(∂u∂y

)6

τ6p

+L4

16(∂u∂y

)4

τ4p

(4.15)

The verification exercise for the conformation tensor equation considers the following simulation pa-

rameters:

General properties Polymer properties

Box height h 2π Zero shear-rate viscosity ν[p] 0.002

Box dimensions 0.9h× h× 0.118h Maximum molecular extensibility L 10

Mesh 128× 128× 128 Relaxation time (s) τp 0.1

Maximum velocity umax 1 Polymer concentration β 0.8

Table 4.1: Simulation parameters for transport equation verification.

Considering parameters from Table 4.1 and the velocity profile from 4.1, the following velocity u

derivatives are obtained:

∂u

∂y=

1π , if yh < 0

− 1π , if yh ≥ 0

(4.16)

Taking into consideration the previous remarks, the following analytical solution for the conformation

tensor Cij is obtained:

50

Cij ≈

1.002 0.0318 0

0.0318 1.000 0

0 0 1.000

for yh < 0

1.002 −0.0318 0

−0.0318 1.000 0

0 0 1.000

for yh ≥ 0

(4.17)

The numerical solution obtained from this simulation is presented in Figure 4.2.

Figure 4.2: Numerical solution for the Cij tensor considering a u velocity profile in the normal y direction.

Having both analytical and numerical results, it is performed the comparison between both. This

verification is presented in Figures 4.3 and 4.4 for the diagonal and non-diagonal conformation tensor

components, respectively. It is important to note that for each figure the dashed lines correspond to the

numerical solution, whilst the solid lines refer to the analytical solution.

From Figures 4.3 and 4.4 it is possible to observe that the numerical and analytical solutions present

the same results, thus ensuring the verification of the triggered equation components in this scenario.

51

Figure 4.3: Numerical (n) and analytical (a) solutions for the Cii tensor considering a u velocity profile inthe normal y direction.

Figure 4.4: Numerical (n) and analytical (a) solutions for the Cij , i 6= j tensor considering a u velocityprofile in the normal y direction.

52

Now it is presented the verification results for the remaining 2D planes.

For a u velocity component in function of z spanwise direction:

Figure 4.5: u velocity profile in the spanwise z direction.

As it is considered a steady and fully-developed flow in the streamwise x direction, the following

assumptions are enforced:∂

∂t= 0,

∂

∂x= 0 (4.18)

Given the two-dimensional flow (velocity profile u(z)) and from the continuity equation, the following

is obtained:

v = 0, w = 0,∂

∂y= 0 (4.19)

(a) Cii tensor components (b) Cij , i 6= j tensor components

Figure 4.6: Cij numerical (n) and analytical (a) solutions comparison considering a u velocity profile inthe spanwise z direction.

53

For a w velocity component in function of y normal direction:

Figure 4.7: w velocity profile in the normal y direction.

As it is considered a steady and fully-developed flow in the spanwise z direction, the following as-

sumptions are enforced:∂

∂t= 0,

∂

∂z= 0 (4.20)

Given the two-dimensional flow (velocity profile w(y)) and from the continuity equation, the following

is obtained:

u = 0, v = 0,∂

∂x= 0 (4.21)


Figure 4.8: Cij numerical (n) and analytical (a) solutions comparison considering a w velocity profile inthe normal y direction.

54

For a v velocity component in function of z spanwise direction:

Figure 4.9: v velocity profile in the spanwise z direction.

As it is considered a steady and fully-developed flow in the normal y direction, the following assump-

tions are enforced:∂

∂t= 0,

∂

∂y= 0 (4.22)

Given the two-dimensional flow (velocity profile v(z)) and from the continuity equation, the following

is obtained:

u = 0, w = 0,∂

∂x= 0 (4.23)


Figure 4.10: Cij numerical (n) and analytical (a) solutions comparison considering a v velocity profile inthe spanwise z direction.

55

For a w velocity component in function of x streamwise direction:

Figure 4.11: w velocity profile in the streamwise x direction.

As it is considered a steady and fully-developed flow in the spanwise z direction, the following as-


∂t= 0,

∂

∂z= 0 (4.24)

Given the two-dimensional flow (velocity profile w(x)) and from the continuity equation, the following

is obtained:

u = 0, v = 0,∂

∂y= 0 (4.25)


Figure 4.12: Cij numerical (n) and analytical (a) solutions comparison considering a w velocity profile inthe streamwise x direction.

56

For a v velocity component in function of x streamwise direction:

Figure 4.13: v velocity profile in the streamwise x direction.

As it is considered a steady and fully-developed flow in the spanwise y direction, the following as-


∂t= 0,

∂

∂y= 0 (4.26)

Given the two-dimensional flow (velocity profile v(x)) and from the continuity equation, the following

is obtained:

u = 0, w = 0,∂

∂z= 0 (4.27)


Figure 4.14: Cij numerical (n) and analytical (a) solutions comparison considering a v velocity profile inthe streamwise x direction.

It is possible to observe that the numerical and analytical solutions present the same values for

all steps in the verification exercise of the conformation tensor transport equation. Thus, the code is

deemed as verified.

57

4.2 Velocity - Conformation Tensor Coupling Term Verification

As mentioned before, the velocity - conformation tensor coupling term was verified by imposing a known

Cij field and comparing the obtained result from the numerical scheme with an analytical solution. It

should be noted that this verification assumes a frozen in time Cij field to match the analytical solution.

The velocity - conformation tensor coupling term, expressed in (3.2) and (3.6), can be written as:

∂T[p]ij

∂xj=ν[p]

τp

∂ [f(Ckk)Cij − δij ]∂xj

(4.28)

On (4.28) the terms ν[p], τp and δij are constants. Moreover, the coupling term must be verified for each

of the velocity components. Therefore, for the purpose of obtaining an analytical solution for the coupling

term, it was assumed that the main diagonal values of the Cij tensor are set to one, as this setting would

nullify the coupling term’s derivative on the same direction of the respective velocity component. The

non-diagonal values of the Cij tensor were imposed and adapted to each velocity component coupling

term.

The analytical solution is derived with detail in this section for the u velocity component. It should be

noted that the coupling term for the remaining velocity components was derived analogously.

The coupling term for the u velocity component can be expanded to the following form:

∂T[p]ij

∂x=ν[p]

τp

(∂ [f(Ckk)C11 − δ11]

∂x+∂ [f(Ckk)C12 − δ12]

∂y+∂ [f(Ckk)C13 − δ13]

∂z

)(4.29)

From (4.29) it is noticeable that the ∂x term is zero, as the Peterlin function f(Ckk) and C11 are

equal to one. For the terms C12 and C13, it was assumed a linear profile along the y and z directions,

respectively.

Considering the previous remarks, equation (4.29) can be rewritten as follows:

∂T[p]ij

∂x=ν[p]

τp

(∂C12

∂y+∂C13

∂z

)(4.30)

Therefore, C12 and C13 profiles were computed as a function with independent variable equal to the

respective partial derivative on equation (4.30). The considered profiles are shown in 4.15.

Profiles shown on 4.15 can be written as:

C12 =

zhπ5 + π, if yh < 0

− zhπ5 + π, if yh ≥ 0

(4.31)

C13 =z

h

π

5(4.32)

For this verification exercise, the following simulation parameters were considered. It should be noted

that these parameters are valid for the coupling term verification exercise in all velocity components.

Thus, for the verification test, given the properties from Table 4.2 and profiles from Figure 4.15, the

58

(a) C12 profile (b) C13 profile

Figure 4.15: C12 and C13 profiles prescribed for the coupling term on the u velocity component. H is theunit of length used in the code.

General properties Polymer properties

Box height h 2π Zero shear-rate viscosity ν[p] 0.002

Box dimensions 10× 10× 10 Maximum molecular extensibility L 100

Mesh 128× 128× 128 Relaxation time (s) τp 0.1

Polymer concentration β 0.8

Table 4.2: Coupling term simulation parameters.

analytical solution for tensor Cij is the following:

∂T[p]ij

∂x'

0.0398, if yh < 0

0, if yh ≥ 0

(4.33)

Having computed the analytical solution, it is performed the comparison with the numerical solution.

The result for the coupling term on the u velocity is presented in Figure 4.16.

From Figure 4.16, it is possible to see that the dashes lines regarding the numerical (red colour) and

analytical (blue colour) solutions follow the same trend and present the same values.

As mentioned before, the verification exercise was performed analogously for the remaining velocity

components. Now it is presented the verification results for the coupling term in the v velocity component

in Figures 4.17 and 4.18.

As observed on the coupling term for the u velocity, the dashes lines regarding the numerical (red

colour) and analytical (blue colour) solutions follow the same trend and present the same values on the

coupling term for the v velocity.

Similarly, to the v velocity component, it is shown the verification results for the coupling term in the

w velocity component in Figures 4.19 and 4.20.

As observed on the coupling term for the u velocity, the dashes lines regarding the numerical (red

colour) and analytical (blue colour) solutions follow the same trend and present the same values on the

coupling term for the w velocity.

59

Figure 4.16: Coupling term for u velocity for numerical (red) and analytical (blue) solutions.


Figure 4.17: C12 and C23 profiles prescribed for the coupling term on the v velocity component. h is theunit of length used in the code.

Having obtained the results for the coupling term verification exercise, it is possible to conclude that

the code is verified given the precision of the results between analytical and numerical solutions.

60



Figure 4.19: C13 and C23 profiles prescribed for the coupling term on the w velocity component. h is theunit of length used in the code.


61

Chapter 5

Results

This chapter is devoted to the analysis of new results obtained with the code developed within this thesis,

namely the study of the influence of polymer additives on the turbulent planar jet. It should be noted that

the results were produced as part of an ongoing investigation with Mateus C. Guimaraes [69].

The addition of polymer additives to a Newtonian solvent has proven to strongly influence the flow

physics. However, the flow physics of the visco-elastic scenario still lack understanding in comparison

to the Newtonian case, mainly due to its complexity and lack of data at an experimental and numerical

level. Some studies have been performed considering homogeneous isotropic turbulence [35, 47, 57],

but flows with a non-zero mean shear, such as planar jets, have not yet been investigated numerically

or experimentally.

It is important to note that the validation of the Newtonian code with other literature references is not

presented in this section. There has been an extensive validation process presented by [16, 18] for the

Newtonian turbulent planar jet.

Having the code been verified (see Chapter 4 for further detail), studies were carried out using DNS

of FENE-P turbulent plane jets considering the simulation parameters shown in Tables 5.1 and 5.2.

The physical length of the computational box in the x streamwise direction was chosen to allow for

the simulation of a fully developed turbulent jet. Concerning the normal direction y, the computational

domain must be large enough to accommodate the jet without interacting with the normal boundaries.

According to [18], the mesh size should be of the order of the Kolmogorov small scale, to be able to

capture the turbulent phenomenon. Moreover, since turbulence is isotropic at the small scales, the

mesh size is required to be approximately equal in all direction. In addition to this, the number of points

in the y normal and z spanwise directions are required to be decomposable into powers of 2 for better

performance of the FFTW library. Also, due to the parallelization strategy the number of points in the x

streamwise and y spanwise directions must be even.

63

nx ny nz Lx/h Ly/h Lz/h

512 512 128 18 18 4

Table 5.1: Computational domain parameters for the simulations. Number of grid points (nx, ny, nz);non-dimensional grid size (Lx/h, Ly/h, Lz/h).

It should be noted that parameters from Table 5.1 are equal to all cases presented on Table 5.2.

Moreover, the grid is uniform in the three spatial directions and that the total number of mesh points

amounts to a total of just over 33 million.

As for the physical simulation parameters, results in this section considered the cases presented in

Table 5.2.

Case Wi Re Reλ τp(s) β L ∆x/η14h

A 0 3500 100 0 1 0 4.0

B 0.33 3500 110 0.025 0.8 100 4.0

C 0.65 3500 110 0.05 0.8 100 4.0

D 1.19 3500 110 0.10 0.8 100 3.9

E 1.66 3500 140 0.20 0.8 100 3.4

F 0.20 3500 120 0.025 0.6 100 3.8

G 0.27 3500 160 0.20 0.6 100 3.3

H 1.35 3500 160 0.025 0.4 100 3.3

I 1.55 3500 180 0.20 0.4 100 3.1

J 2 3500 200 0.40 0.8 100 2.8

K 2.6 3500 220 0.60 0.8 100 2.5

L 3.3 3500 250 0.80 0.8 100 2.3

Table 5.2: Physical parameters for the simulations. Weissenberg number Wi; slot width, initial maximumjets velocity, solvent viscosity based Reynolds number Re; Averaged Taylor turbulence micro-scale cen-treline velocity solvent viscosity based Reynolds number on the x streamwise direction Reλ; polymerrelaxation time τp; ratio of the solvent to solvent plus polymer viscosity β; polymer normalized maximumextensibility L; mesh resolution normalized by the Kolmogorov small scale at x/h = 14.

The Wi represents the Weissenberg (non-dimensional) number, which is given by the ratio between

the polymer relaxation time and Kolmogorov time scale:

Wi =τpτs

(5.1)

where τs represents the Kolmogorov time scale (τs = (ν[s]/ε[s])(1/2)).

The Reynolds number based on the Taylor microscale is defined as:

Reλ =u′λ

ν[s](5.2)

where u′ is the reference perturbation velocity and λ is the Taylor scale. The Taylor scale λ is given by

64

the viscous dissipation rate assuming local isotropy at the jet centreline.

ε[s] = 15ν[s]

⟨(∂u′

∂x

)2⟩

= 15ν[s] u′2

λ2⇔ λ =

√15ν[s]u′

2

ε[s](5.3)

As the Taylor scale is a function of the x streamwise direction, the value of Reλ presented in Table

5.2 refers to an averaged value on z spanwise direction and time along the centreline for y = 0.

In order to simulate the smallest scales that are present in the flow, the grid size must be of the order

of the Kolmogorov scale η, defined in terms of the solvent turbulent viscous energy dissipation rate ε[s]

and the solvent viscosity ν[s], as shown in equation (5.4).

η =

((ν[s])3

ε[s]

)1/4

(5.4)

According to [18], several authors state that the ratio of the element size ∆x to the Kolmogorov scale

η should have a value of the order of 2.0 to 3.0. In [11], it is stated that ∆x/η = 2.1 is a good reference

value for DNS simulations of isotropic turbulence.

∆x/η presents an evolution across the x streamwise direction on the centreline, with the highest

value located at the beginning of the self-similar region and decreasing from then on. It was opted to

select to analyse this quantity at x/h = 14, as this region refers to a fully developed turbulent region

of the jet, subject to the current study. By having mesh resolution normalized by the Kolmogorov small

scales within the values stated in the literature, it was considered that the mesh size was adequate for

the study of turbulence in the fully developed region of the jet, namely for cases J to L which are the

focus of this work.

The simulations are started with variable time step and constant Courant number (CFL = 1/6).

Once self-similarity is attained, the time step is kept constant at a lower level than the minimum reached

during the first part of the simulation. At this stage, statistical quantities are computed by accumulating

data over 150.000 iterations (3 weeks computing run).

The half-width or mean flow thickness δ is defined as follows according to [69]:

δ =

∫ ∞0

u− U2L

Uc − U2Ldy (5.5)

where Uc is the centreline velocity and U2L is the local co-flow velocity.

The streamwise velocity is non-dimensionalized according to the following expression:

(U1 − U2

〈Uc〉 − U2

)2

(5.6)

where U1 is the inlet centreline velocity and U2 is the inlet co-flow velocity.

It is important to notice that all variables presented in this section are an average on time and on the

z spanwise direction.

65

0 5 10 15 20

x/h

0.5

1

1.5

2δ/h

(a)

δ = 0.110xδ = 0.083xτp = 0.0τp = 0.4τp = 0.6τp = 0.8

0 5 10 15 20

x/h

1

1.5

2

2.5

3

3.5

[(Uc−U2)/(U

1−U2)]−2

(b)

0.210(x/h − 1)0.158(x/h − 1)τp = 0.0τp = 0.4τp = 0.6τp = 0.8

Figure 5.1: Streamwise evolution of non-dimensional (a) jet’s half-width and (b) centreline velocity forfluids with different relaxation time for cases A, J, K and L. The Newtonian case refers to τp = 0. From[69].

From Figure 5.1, it is possible to identify the self-similar region by fitting straight lines to the linear

region of each curve. A good linear fit was found between x/h = 6 to x/h = 17 for the Newtonian case,

whilst for the visco-elastic case with τp = 0.8 a good linear fit was observed spanning from x/h = 10 to

x/h = 17. These windows identify the self-similar region. Despite the good linear fit observed on Figure

5.1 (b), results from case τp = 0.8 display the need of additional convergence.

Moreover, it is possible to observe that for higher polymer relaxation time there is a different flow

behaviour between the Newtonian case and the FENE-P solution. From 5.1 (a) it is noticeable a lower jet

width on the visco-elastic cases in comparison to the Newtonian case. Similarly, the centreline velocity

shows a larger value on the visco-elastic cases than on the Newtonian case. Both these remarks are

indicative of a lower energy dissipation rate for the FENE-P solutions due to the interaction of the polymer

particles on the flow physics. This result follows the expected trend, as polymer particles are expected

to absorb elastic energy at a higher rate from the flow, reducing the overall energy dissipation of the flow

and, consequently, inducing drag reduction.

On Figure 5.2 it is shown the normalized mean velocities by the local Uc centreline velocity for a fully

turbulent streamwise cross-section (x/h = 12) for cases A, J, K and L. It is also shown the root mean

squared turbulent velocity perturbation components and the Reynolds shear stress profiles for different

relaxation time.

It is possible to observe that the streamwise u velocity profiles collapse for different relaxation time.

This behaviour is like the one observed on a Newtonian case for different streamwise cross-sections, for

which the streamwise u velocity profiles collapse for different self-similar stations locations [18]. Con-

cerning the normal y velocity profile, it is noted that a higher polymer relaxation time leads to, typically,

a lower average normal velocity magnitude. It is important to notice that the spanwise w average ve-

locity is not presented, since it presents an average zero value for a planar jet [18]. As for the velocity

perturbation root mean squared components, it is observed an overall decrease of all components with

increasing polymer relaxation time. Moreover, it is seen that all perturbation velocities converge to zero

with higher distances from the centreline, as the flow is moving towards zones not perturbed by the jet.

66

Concerning the Reynolds shear stresses, the examination of Figure 5.2 indicates that for higher polymer

relaxation time the shear stress reduces, namely for cases K and L. This observation reinforces the

previous remarks, indicating a smaller energy dissipation rate for the polymer solution case due to the

energy-absorption effect of the polymers.

0 0.5 1 1.5 2 2.50

0.5

1

(u−U2L)/(U

c−U2L)

(a)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

x/h = 12

0 1 2-0.06

-0.04

-0.02

0

0.02

v/(Uc−U2L) (b)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

x/h = 12

0 1 2 30

0.1

0.2

√

u′2/(Uc−U2L)

(c)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

x/h = 12

0 1 2 30

0.1

0.2√

v′2/(Uc−U2L)

(d)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

x/h = 12

0 1 2 3

y/δ

0

0.1

0.2

√

w′2/(Uc−U2L)

(e)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

x/h = 12

0 1 2 3

y/δ

0

0.01

0.02

0.03

u′v′/(Uc−U2L)2 (f)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

x/h = 12

Figure 5.2: Normalized mean velocity: (a) streamwise and (b) normal direction components, (c-e) ve-locity root mean squared components and (f) Reynolds shear stress profiles for fluids with differentrelaxation time for cases A, J, K and L. From [69].

67

The conformation tensor Cij are plotted for the same streamwise cross-section (x/h = 12) in Fig-

ure 5.3. It is worth mentioning that the averaged conformation tensor components C13 and C23 are not

presented in Figure 5.3, as due to the nature of the turbulent flow is it possible to prove that the aver-

age value of these components is zero. This was observed on the simulation results. Concerning the

remaining conformation tensor components, it is seen that with increasing polymer relaxation time, the

magnitude of the averaged Cij increases. Moreover, one notices that the average values converges to

the polymer equilibrium state, that is value 1 for the diagonal tensor terms and 0 for the remaining tensor

terms, on the non-perturbed region of the flow, as expected. As the elastic energy stored by a stretched

polymer is proportional to the trace of the conformation tensor [70], the observed higher average values

of the conformation tensor components with increasing polymer relaxation time indicate that the polymer

molecules have an increasing absorbed elastic energy.

0 1 2 3

0

20

40

Cij

(a)ij = 11

ij = 22

ij = 33

ij = 12

τp = 0.4

0 1 2 3

-50

0

50

100

150

Cij

(b)ij = 11

ij = 22

ij = 33

ij = 12

τp = 0.6

0 1 2 3

y/δ

0

100

200

Cij

(c)ij = 11

ij = 22

ij = 33

ij = 12

τp = 0.8

0 1 2 30

50

100√

c′2 ij

(d)ij = 11ij = 22ij = 33ij = 12ij = 13ij = 23

τp = 0.4

0 1 2 30

50

100

150

√

c′2 ij

(e)ij = 11ij = 22ij = 33ij = 12ij = 13ij = 23

τp = 0.6

0 1 2 3

y/δ

0

100

200

√

c′2 ij

(f)ij = 11ij = 22ij = 33ij = 12ij = 13ij = 23

τp = 0.8

Figure 5.3: Mean (a-c) and root mean squared (d-f) components of the conformation tensor for fluidswith different relaxation time at x/h = 12 for cases J, K and L. From [69].

68

On Figure 5.4 it is presented the velocity perturbation components on the streamwise direction nor-

malized by the local centreline velocity, along with the root mean squared conformation tensor compo-

nents.

It is observed that with increasing polymer relaxation time, the perturbation velocity profile (typically

starting around x/h = 3) for all velocity components presents a smoother transition behaviour with a

lower perturbation velocity magnitude.

As for the root mean squared conformation tensor components, it is noticeable their values are

increasing with higher polymer relaxation time, with foremost relevant changes on terms C11 and C12.

Also, it is clear the presence of an abrupt increase on the root mean squared conformation tensor

components, which is indicative of a regime modification in the flow.

0 5 10 15 200

0.1

0.2

√

u′2/(Uc−U2L)

(a)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

0 5 10 15 200

0.1

0.2

√

v′2/(Uc−U2L)

(b)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

0 5 10 15 20

0

0.05

0.1

0.15

√

w′2/(Uc−U2L)

(c)

τp = 0.0τp = 0.4τp = 0.6τp = 0.8

-5 0 5 10 150

20

40

60

80

√

c′2 ij

(d)

ij = 11ij = 22ij = 33ij = 12ij = 13ij = 23

τp = 0.4

-5 0 5 10 15

x/h

0

50

100

√

c′2 ij

(e)ij = 11ij = 22ij = 33ij = 12ij = 13ij = 23

τp = 0.6

-5 0 5 10 15

x/h

0

50

100

150

√

c′2 ij

(f)ij = 11ij = 22ij = 33ij = 12ij = 13ij = 23

τp = 0.8

Figure 5.4: Centreline root mean squared evolution of (a) streamwise, (b) normal and (c) spanwise direc-tions velocity components and (d-f) conformation tensor components for fluids with different relaxationtime for cases J, K and L. From [69]

69

The Weissenberg number compares the elastic forces to the viscous forces and an analysis for the

visco-elastic simulations is presented in Figure 5.5. It is observed that for higher polymer relaxation

time with the same β (especially clear on cases with β = 0.8), Wi number increases. Simultaneously,

the same cases when viewed from a turbulent kinetic energy dissipation perspective (see Figure 5.6)

present a lower viscous energy dissipation, which in turn would contribute to a lower Wi number. Thus,

this indicates that the polymer elastic forces on the flow are far more significant than the solvent viscous

forces.

If one considers cases with an equal τp, it is noticeable that higher β leads to a higher Wi number.

This is observed, for example, in cases with τp = 0.2s. Nonetheless, the variation on the ratio between

elastic and viscous forces on the flow is clearly smaller with the influence of β than with the influence of

τp.

0 5 10 15 20 25

x/h

0

1

2

3

Wi=

τ p/√

ν[s] /ε[s] τp = 0.025, β = 0.8

τp = 0.025, β = 0.6τp = 0.025, β = 0.4τp = 0.050, β = 0.8τp = 0.100, β = 0.8τp = 0.200, β = 0.8τp = 0.200, β = 0.6τp = 0.200, β = 0.4τp = 0.400, β = 0.8τp = 0.600, β = 0.8τp = 0.800, β = 0.8

Figure 5.5: Stream-wise evolution of centreline Weissenberg numbers for all visco-elastic cases. From[69].

As for the viscous energy dissipation rate presented in Figure 5.6, it is noted that the presence of

polymer additives on the flow leads to a lower viscous energy dissipation rate. Moreover, for the same τp

a lower β leads to lower viscous energy dissipation. Similarly, for the same β a higher τp leads to lower

viscous energy dissipation.

0 5 10 15 20 25 30

x/h

0

2

4

6

8

10

ε[s]h/U

3 1

Newtonian fluidτp = 0.025, β = 0.8τp = 0.025, β = 0.6τp = 0.025, β = 0.4τp = 0.050, β = 0.8τp = 0.100, β = 0.8τp = 0.200, β = 0.8τp = 0.200, β = 0.6τp = 0.200, β = 0.4τp = 0.400, β = 0.8τp = 0.600, β = 0.8τp = 0.800, β = 0.8

Figure 5.6: Stream-wise evolution of centreline turbulent kinetic energy dissipation of the solvent for allcases. From [69].

70

In Figure 5.7 it is presented the average conformation tensor trace components along the x stream-

wise direction.

It is noticeable that a higher τp induces higher conformation trace components. This behaviour is

also verified for the root mean squared components presented in Figure 5.4. Together with the observed

lower jet width and overall lower turbulent energy dissipation rate for higher polymer relaxation time, it is

indicative that the polymer stress due to larger polymer elongations (Cij) compensates the effect of the

increased relaxation time in reducing the polymer dissipation.

0 5 10 15 20 25

x/h

0

50

100

Cij

C11, τp = 0.4C22, τp = 0.4C33, τp = 0.4C11, τp = 0.6C22, τp = 0.6C33, τp = 0.6C11, τp = 0.8C22, τp = 0.8C33, τp = 0.8

Figure 5.7: Stream-wise evolution of centreline mean conformation tensor components for cases J, Kand L. From [69].

Future work will be developed on this area and ongoing investigation will be presented in [71].

71

Chapter 6

Conclusions

6.1 Achievements

In the present work it was implemented the Finite Extensible Nonlinear Elastic continuous model closed

with the Peterlin approximation (FENE-P) within an existing DNS tool, resulting in a DNS tool for turbulent

spatial jet with polymer additives. It is important to stress that DNS or experimental analyses of turbulent

spatial jet have not been found in the literature. As such, this work provides the first time a turbulent

spatial jet is analysed with a FENE-P visco-elastic configuration, leading to a new ground breaking

numerical tool with potential for new study areas. It is also worth mentioning that there is already

being performed investigation on a more thorough level regarding the influence of polymer additives on

turbulent planar jets with polymer additives in LASEF, which will result in conference proceeding and

technical articles [71].

During the development of the current work, extensive care was taken concerning the numerical

implementation to ensure impact minimization of the implementation of FENE-P on the overall DNS code

performance. Such was achieved, at a certain level, with the inclusion of the ghost cells mechanism and

minimization of the call of MPI commands, which allowed a relevant performance enhancement due

to the need of less communication time. In fact, iteration times with the inclusion of FENE-P on the

spatial jet code resulted in an increase of 7-fold, whilst for comparable homogeneous isotropic turbulent

FENE-P simulations a 10-fold increase was observed.

Further to the previous remarks, it was performed an extensive verification of the implemented code

with satisfactory result. This step is rather important because no comparable simulations were found in

the literature regarding FENE-P. Having the code verified, the results obtained were able to provide a

first meaningful insight regarding turbulent spatial jet simulations with polymer additives.

As for the results of DNS analyses of turbulent planer jets with polymer additives, it was possible to

obtain meaningful insight on the physics of the turbulent flow.

On the turbulent regime, it was analysed the evolution of the jet’s characteristics, the conformation

tensor components evolution and the turbulent energy dissipation rate. Regarding the jet’s characteris-

tics, focus was given on the jet’s width, velocity components evolution and Reynolds shear stress profile.

73

For all the abovementioned, the difference between the Newtonian fluid solution and FENE-P solution

is clear. The jet width in the FENE-P solution is lower than the one observed for the Newtonian case,

which indicates that the viscous energy dissipation is smaller for the polymer solution case. Similarly,

the centreline velocity shows a higher value for polymer solution than for the Newtonian case, confirm-

ing the previous remark. As for the remaining components, results show that velocity perturbations on

turbulent regime are lower in the visco-elastic solution and that the Reynolds shear stress profiles de-

creases on the FENE-P solution. Both previous remarks confirm a smaller energy dissipation rate for

the polymer solution case due to the energy-absorption effect of the polymers, which is indicative of the

drag reduction phenomenon.

As for the conformation tensor components evolution, it is observed an increase of these components

with higher polymer relaxation time. This is followed by a lower jet width and overall lower turbulent

energy dissipation rate for higher polymer relaxation time. Such indicates that the polymer stress due to

larger polymer elongations (Cij) compensates the effect of the increased relaxation time in reducing the

polymer dissipation. On works such as [47] related to homogeneous isotropic turbulence, it has been

shown that for increasingly large Wi there is a decrease in the fraction of the power dissipated by the

polymers. This scenario was not observed on the turbulent planar jet with polymer additives.

6.2 Future Work

In order to continue the work done in the current thesis, the following topics could be addressed:

• To analyse with more detail the interaction of polymer particles on the turbulent spatial jet. Addi-

tional analyses could include a better understanding the energy dissipation of the polymer or study

the polymer characteristics to achieve the upper limit of drag reduction. As mentioned previously,

there is already being performed investigation on a more thorough level regarding the influence of

polymer additives on turbulent planar jets with polymer additives in LASEF.

• Study the implementation of a hybrid parallelization approach, as the problem under studying is

highly scalable and would benefit from an hybrid MPI/OpenMP approach, namely on communica-

tion time and I/O. It should be noted that a first step has been performed for this scenario in [72],

with promising results for the Newtonian turbulent jet case.

74

Bibliography

[1] B. A. Toms, “Some observations on the flow of linear polymers solutions through straight tubes at

large reynolds numbers,” in Proc. 1st Intern. Congr. Rheol., (North Holland, Amsterdam), pp. 135–

141, 1948.

[2] K. Anand, Numerical Heat Transfer Analysis of Micro-scale Jet-impingement Cooling in a High-

pressure Turbine Vane. PhD thesis, Ryerson University, 2011.

[3] F. Khani, M. Darvishi, and R. S. R. Gorla, “Analytical investigation for cooling turbine disks with a

non-newtonian viscoelastic fluid,” Computers and Mathematics with Applications, vol. 61, pp. 1728–

1738, 2011.

[4] A. Dogonchi and D. Ganji, “Investigation of heat transfer for cooling turbine disks with a non-

newtonian fluid flow using dra,” Case Studies in Thermal Engineering, vol. 6, pp. 40–51, 2015.

[5] A. Fouxon and V. Lebedev, “Spectra of turbulence in dilute polymer solutions,” Physics of Fluids,

vol. 15, no. 7, pp. 2060–2072, 2003.

[6] E. Balkovsky, A. Fouxon, and V. Lebedev, “Turbulence of polymer solutions,” Physical Review E,

vol. 64, no. 056301, 2001.

[7] R. Joussot, S. Coumar, and V. Lago, “Plasmas for high speed flow control,” Plasmas for Aeronautics,

vol. 10, no. AL10-04, 2015.

[8] P. S. Rawat, X. Zhong, and V. Singh, “Numerical simulation of secondary flow in a weakly ion-

ized supersonic flow with applied electromagnetic field,” in 36th AIAA Plasmadynamics and Lasef

Conference, no. AIAA-2005-5050, 2005.

[9] S. O. Macheret, M. N. Shneider, and R. B. Miles, “Potential performance of supersonic mhd power

generators,” in 39th AIAA Aerospace Science Meeting and Exhibit, no. AIAA-2001-0795, 2015.

[10] P. A. Czysz, C. Bruno, and B. Chudoba, Future Spacecraft Propulsion Systems and Integration -

Enabling Technologies for Space Exploration. Springer-Verlag, 3rd ed., 2018.

[11] S. B. Pope, Turbulent Flows. Cambridge University Press, 2000.

[12] O. Reynolds, “An experimental investigation of the circumstances which determine whether the

motion of water is direct or sinuous, and the law of resistance in parallel channels,” Phil. Trans. Roy.

Soc., vol. 174, pp. 935–983, 1883.

75

[13] L. F. Richardson, Weather Prediction by Numerical Process. Cambridge University Press, 1922.

[14] A. N. Kolmogorov, “The local structure of turbulence in incompressible viscous fluid for very large

reynolds number,” Dokl. Akad. Nauk. SSSR, vol. 30, pp. 299–303, 1941.

[15] A. M. Obukhov, “On the distribution of energy in the spectrum of turbulent flow,” Dokl. Akad. Nauk

SSSR, vol. 32, pp. 22–243, 1941. (In Russian).

[16] R. N. Reis, The dynamics of coherent vortices near the turbulent/non-turbulent interface analysed

by direct numerical simulations. PhD thesis, IST, Lisboa, 2011.

[17] R. Vonlanthen, The Effects of Fluid Elasticity on Grid Turbulence. PhD thesis, Ecole Polytechnique

Federale de Lausanne, Suisse, 2010.

[18] D. C. Lopes, Direct and Large-Eddy Simulations of the Turbulent Entrainment of Passive Scalars in

Planar Jets. PhD thesis, IST, Lisboa, 2014.

[19] R. H. Nadolink and W. W. Haigh, “Bibliography on skin friction reduction with polymers and other

boundary-layer additives,” ASME Appl. Mech. Rev., vol. 48, pp. 351–460, 1995.

[20] S. Jin, Numerical simulations of a dilute polymer solution in isotropic turbulence. PhD thesis, Phd

Thesis, Cornell University, 2007.

[21] J. L. Lumley, “Drag reduction by additives,” Annu. Rev. Fluid Mech., vol. 1, pp. 367–384, 1969.

[22] J. L. Lumley, “Drag reduction in turbulent flow by polymer additives,” J. Polym. Sci., vol. 7, pp. 263–

290, 1973.

[23] P. S. Virk, “Drag reduction fundamentals,” AICHE J., vol. 21, pp. 625–656, 1975.

[24] P. G. De Gennes, Introduction to Polymer Dynamics. Cambridge University Press, 1990.

[25] G. Gadd, “Turbulence damping and drag reduction produced by certain additives in water,” Nature,

vol. 206, pp. 463–467, may 1965.

[26] M. P. Schultz et al., “Three models to relate detachment of low form fouling at laboratory and ship

scale,” Biofouling, vol. 19, pp. 17–26, 2003.

[27] P. S. Virk, E. W. Merril, H. S. Mickley, K. A. Smith, and E. L. Mollo-Christensen, “The toms phe-

nomenon: turbulent pipe flow of dilute polymer solutions,” J. Fluid Mech., vol. 20, pp. 22–30, 1967.

[28] C. M. White and M. G. Mungal, “Mechanics and prediction of turbulent drag reduction with polymer

additives,” Annu. Rev. Fluid. Mech., vol. 40, pp. 235–256, 2008.

[29] K. R. Sreenivasan and C. M. White, “The onset of drag reduction by dilute polymers and maximun

drag reduction asymptote,” J. Fluid Mech., vol. 409, pp. 149–164, 2000.

[30] H.-D. Xi, E. Bodenschatz, and H. Xu, “Elastic energy flux by flexible polymers in fluid turbulence,”

Phys. Rev. Lett., vol. 111, p. 024501, 2013.

76

[31] R. B. Bird, C. F. Curtiss, R. C. Armstrong, and O. Hassager, Dynamics of Polymeric Liquids, vol. vol.

2. John Wiley, 2nd ed ed., 1987.

[32] R. Sureshkumar, A. N. Beris, and R. A. Handler, “Direct numerical simulation of the turbulent chan-

nel flow of a polymer solution,” Phys. Fluids, vol. 9, no. 3, pp. 743–755, 1997.

[33] C. D. Dimitropoulos, R. Sureshkumar, A. N. Beris, and R. A. Handler, “Budgets of reynolds stress, ki-

netic energy and streamwise enstrophy in viscoelastic turbulent channel flow,” Phys. Fluids, vol. 13,

no. 4, pp. 1016–1027, 2001.

[34] C. D. Dimitropoulos, Y. Dubief, E. S. G. Shaqfeh, P. Moin, and S. K. Lene, “Direct numerical sim-

ulation of polymer-induced drag reduction in turbulent boundary layer flow,” Phys. Fluids, vol. 17,

2005.

[35] P. Ferreira, “Large-eddy simulations of turbulent flows with polymer additives,” Master’s thesis, In-

stituto Superior Tecnico, University of Lisbon, 2016.

[36] T. Vaithianathan, A. Robert, J. G. Brasseur, and L. R. Collins, “An improved algorithm for simulating

three-dimensional, viscoelastic turbulence,” J. Non-Newt. Fluid Mech., vol. 140, no. 1, pp. 3–22,

2006.

[37] F. Dupret and J. M. Marchal, “Loss of evolution in the flow of viscoelastic fluids,” J. Non-Newtonian

Fluid Mech., vol. 20, pp. 143–171, 1986.

[38] R. Sureshkumar and A. N. Beris, “Effect of artificial stress difusivity on the stability of numerical

calculations and the flow dynamics of time-dependent viscoelastic flows,” J. Non-Newt. Fluid Mech.,

vol. 60, no. 1, pp. 53–80, 1995.

[39] T. Vaithianathan and L. R. Collins, “Numerical approach to simulating turbulent flow of a viscoelastic

polymer solution,” J. Comp. Phys., vol. 187, pp. 1–21, 2003.

[40] S. K. Lele, “Compact finite difference scheme with spectral-like resolution,” Journal of Computa-

tional Physics, vol. 103(1), pp. 16–42, 1992.

[41] C. Laney, Computational Gas Dynamics. Cambridge University Press, 2000.

[42] A. W. El-Kareh and L. G. Leal, “Existence of solutions for all deborah numbers for a non-newtonian

model modified to include diffusion,” J. Non-Newtonian Fluid Mech., vol. 33, pp. 257–287, 1989.

[43] D. Gottlieb and J. S. Hesthaven, “Spectral methods for hyperbolic problems,” J. Comput. Appl.

Math., vol. 128, pp. 83–131, 2001.

[44] A. Kurganov and E. Tadmor, “New high-resolution central schemes for nonlinear conservation laws

and convection-diffusion equations,” J. Comp. Phys., vol. 160, no. 1, pp. 241–282, 2000.

[45] W.-H. Cai, F.-C. Li, and H.-N. Zhang, “Dns study of decaying homogeneous isotropic turbulence

with polymer additives,” J. Fluid Mech., 2010.

77

[46] A. Robert, T. Vaithianathan, L. R. Collins, and J. G. Brasseur, “Polymer-laden homogeneous shear-

driven turbulent flow: a model of polymer drag reduction,” J. Fluid. Mech., vol. 657, pp. 189–226,

2010.

[47] P. C. Valente, C. B. da Silva, and F. T. Pinho, “The effect of viscoelasticity on the turbulent kinetic

cascade,” J. Fluid. Mech., vol. 760, pp. 39–62, 2014.

[48] R. S. Rogallo, “High performance parallel computing for computational fluid dynamics (cfd),” Ko-

matsu Technical Report VOL.51 NO.156, 2005.

[49] A. Maheo, Improving the Hybrid model MPI+ Threads through Applications, Runtimes and Perfor-

mance tools. PhD thesis, Versailles-St Quentin en Yvelines, 2015.

[50] P. F. Lavalle and P. Wautelet, Hybrid MPI-OpenMP Programming. CNRS - IDRIS, version 2.0 ed.,

June 2014.

[51] T. Min, H. Choi, and J. Y. Yoo, “Maximum drag reduction in turbulent channel flow by polymer

additives,” J. Fluid Mech., vol. 492, pp. 91–100, 2003.

[52] T. Min, J. Y. Yoo, H. Choi, and D. D. Joseph, “Drag reduction by polymer additives in turbulent

channel flow,” J. Fluid Mech., vol. 486, pp. 213–238, 2004.

[53] R. Benzi, E. De Angelis, V. S. L’vov, I. Procaccia, and V. Tiberkevich, “Maximum drag reduction

asymptotes and the cross-over to the newtonian plug,” J. Fluid Mech., vol. 551, pp. 185–195, 2006.

[54] M. A. Hulsen, “A sufficient condition for a positive definite configuration tensor in different models,”

J. Non-Newtonian Fluid Mech., vol. 38, pp. 93–100, 1990.

[55] A. N. Beris and R. Sureshkumar, “Effect of artificial stress diffusity on the stability of numerical

calculations and the flow dynamics of time-dependent viscoelastic flows,” J. Non-Newtonian Fluid

Mech., vol. 60, pp. 53–80, 1995.

[56] C. Canuto, M. Hussaini, A. Quarteroni, and T. Zhang, Spectral Methods in Fluid Dynamics.

Springer-Verlag, 1988.

[57] A. Mosca, “Energy cascade analysis in a viscoelastic turbulent flow,” Master’s thesis, Instituto Su-

perior Tecnico, University of Lisbon, 2012.

[58] J. H. Ferziger and M. Perich, Computational methods for fluid dynamics, vol. 2. Springer Berlin,

1999.

[59] C. B. da Silva, The role of coherent structures in the control and interscale interactions of round,

plane and coaxial jets. PhD thesis, INPG, Grenoble, 2001.

[60] H. Pina, Metodos Numericos. Mc Graw-Hill, 1998.

[61] J. Williamson, “Low-storage runge-kutta schemes,” Journal of Computational Physics, vol. 35(1),

pp. 48–56, 1980.

78

[62] P. Moin and J. Kim, “Application of a fractional-step method to incompressible navier-stokes equa-

tions,” Journal of Computational Physics, vol. 59(2), pp. 308–323, 1985.

[63] P. Moin and H. Le, “An improvement of fractional step methods for the incompressible navier-stokes

equations,” Journal of Computational Physics, vol. 92(2), pp. 369–379, 1991.

[64] C. Canuto, M. Hussaini, A. Quarteroni, and T. Zang, Spectral Methods in Fluid Dynamics. Springer-

Verlag, 1987.

[65] C. B. da Silva and O. Metais, “On the influence of coherent structures upon interscale interactions

in turbulent plane jets,” J. Fluid Mech., vol. 473, pp. 103–145, 2002.

[66] S. Stanley and S. Sarkar, “Influence of nozzle conditions and discrete forcing on turbulent planar

jets,” AIAA journal, vol. 38(9), pp. 1615–1623, 2000.

[67] I. Orlanski, “A simple boundary condition for unbounded hyperbolic flows,” Journal of Computational

Physics, vol. 21(3), pp. 251–269, 1976.

[68] M. Frigo and S. G. Johnson, “The fastest fourier transform in the west,” tech. rep., Massachusetts

Institute of Technology, 1997.

[69] M. C. Guimaraes, “Report.” Instituto Superior Tecnico, University of Lisbon, September 2018.

[70] R. D. B. Marques, “Numerical study of the entrainment small scale features/characteristics in vis-

coelastic turbulent jets,” Master’s thesis, Instituto Superior Tecnico, University of Lisbon, 2012.

[71] M. Guimaraes, C. B. Silva, and N. Pimentel, “Direct numerical simulations of turbulent planar

viscoelastic jets.” 71st Annual Meeting of the APS Division of Fluid Dynamics, November 2018.

D35.00007.

[72] P. Alves, “Hybrid parallelization of a spatial direct numerical simulation,” Master’s thesis, Instituto

Superior Tecnico, University of Lisbon, 2016.

79

Appendix A

Stability condition proof

In this appendix, it is shown the proof for the Courant condition that must be satisfied in order to ensure

that the conformation tensor Cij remains positive definite. The proof is presented in [36] and is based

on the fact that a convex sum of positive definite matrices is itself positive definite. For example, matrix

C∗ defined as

C∗ =

N∑l=1

slCl (A.1)

where the coefficients sl ≥ 0 satisfy:N∑l=1

sl = 1 (A.2)

is positive definite if the matrices Cl are positive definite.

The Courant condition for numerical stability is obtained by considering the convection term of equa-

tion (3.33).

The numerical approximation of the convective term in many schemes strives to satisfy the maximum

principle of the underlying partial differential equation, leading to a convex representation. This is a

convenient representation for a bounded scalar (tensor), as it immediately implies the bounds will be

obeyed. For first-order methods, the sl weights depend only on the advecting velocity field and are

independent of the scalar (tensor) values themselves. However, for a second-order update, the weights

are functions of the velocity and the scalar (tensor) values at the node of interest and the neighbouring

nodes. Therefore, ensuring the weights remain convex therefore depends upon how the scalar (tensor)

values at the cell edges are approximated.

Considering the convection term alone, the following relationship can be derived from equations

(3.35), (3.36) and (3.37):

Ci,j,k =1

6

(C−i+1/2,j,k + C+

i−1/2,j,k + C−i,j+1/2,k + C+i,j−1/2,k + C−i,j,k+1/2 + C+

i,j,k−1/2

)(A.3)

81

Applying the previous relationship to equation (3.33), it is obtained:

C[n+1]i,j,k = s1C

+i+1/2,j,k + s2C

−i+1/2,j,k + s3C

+i−1/2,j,k + s4C

−i−1/2,j,k + s5C

+i,j+1/2,k + s6C

−i,j+1/2,k+

s7C+i,j−1/2,k + s8C

−i,j−1/2,k + s9C

+i,j,k+1/2 + s10C

−i,j,k+1/2 + s11C

+i,j,k−1/2 + s12C

−i,j,k−1/2

(A.4)

where the sl weights are given by:

s1 ≡(−ui+1/2,j,k + |ui+1/2,j,k|

)( ∆t

2∆x

), s2 ≡

1

6+(−ui+1/2,j,k − |ui+1/2,j,k|

)( ∆t

2∆x

),

s3 ≡1

6+(ui−1/2,j,k − |ui−1/2,j,k|

)( ∆t

2∆x

), s4 ≡

(ui−1/2,j,k + |ui−1/2,j,k|

)( ∆t

2∆x

),

s5 ≡(−vi,j+1/2,k + |vi,j+1/2,k|

)( ∆t

2∆y

), s6 ≡

1

6+(−vi,j+1/2,k − |vi,j+1/2,k|

)( ∆t

2∆y

),

s7 ≡1

6+(vi,j−1/2,k − |vi,j−1/2,k|

)( ∆t

2∆y

), s8 ≡

(vi,j−1/2,k + |vi,j−1/2,k|

)( ∆t

2∆y

),

s9 ≡(−wi,j,k+1/2 + |wi,j,k+1/2|

)( ∆t

2∆z

), s10 ≡

1

6+(−wi,j,k+1/2 − |wi,j,k+1/2|

)( ∆t

2∆z

),

s11 ≡1

6+(wi,j,k−1/2 − |wi,j,k−1/2|

)( ∆t

2∆z

), s12 ≡

(wi,j,k−1/2 + |wi,j,k−1/2|

)( ∆t

2∆z

)

(A.5)

The condition that the weights sl ≥ 0 is ensured if the following Courant number is satisfied:

CFL ≡ max(|u|∆x

,|v|∆y

,|w|∆z

)∆t <

1

6(A.6)

Summing the sl weights, one obtains:

∑l

sl = 1− ∆t

2

[ui+1/2,j,k − ui−1/2,j,k

∆x+vi,j+1/2,k − vi,j−1/2,k

∆y+wi,j,k+1/2 − wi,j,k−1/2

∆z

](A.7)

From equation (A.7) it is possible to notice that the term in brackets is equal to the term presented

in equation (3.42), which has been shown to be equal to zero. In combination with equation (A.1), it is

ensured that the eigenvalues of C[n+1]i,j,k remain positive and that its sum remain below L2.

82

web.fe.up.ptfpinho/pdfs/msc_thesis_npimentel_vfinal.pdf · direct numerical simulation of turbulent...

Documents