Turbulence Enrichment using Generative Adversarial Networks

What and why

Turbulent flow is important in many engineering applications. However, simulating turbulence is very computationally expensive due to extremely high resolution requirements. Large Eddy Simulations (LES) that simulate only the large scales have become popular due much lower cost, but require explicit modeling of the small scales. Here, we propose to enrich LES data by populating it with small scales obtained using a Generative Adversarial Network (GAN).

Modeling turbulence accurately is extremely challenging especially in capturing high order statistics due to its intermittent nature. GANs have been shown to perform better than other data driven approaches like PCA in capturing high order moments [1]. In addition, generating physically realistic realizations are important for physical simulation data; a constraint not present when using generative models for images. Incorporating this constraint into the GAN framework would be crucial to its performance. The constraint is given by the governing equations for the incompressible flow given below:

\begin{align*}
\nabla \cdot \bm{u} &= 0 \\
\frac{\partial \bm{u}}{\partial t} + \left( \bm{u} \cdot \nabla \right) \bm{u} &= - \nabla p + \frac{1}{\mathrm{Re}} \nabla^2 \bm{u}
\end{align*}

Data

We generated data for this work using an in-house fluid flow simulation code (PadeOps) which was run on a compute cluster using 256 processors for multiple days. We performed a time-evolving simulation on a 64x64x64 uniform grid and collected snapshots separated in time by more than one integral time scale. The output is comprised of four fields — three components of the velocity vector $(u, v, w)$ and one of the pressure $(p)$ each of size $64 \times 64 \times 64$.

Low-resolution data is generated by filtering the HR data down to $16 \times 16 \times 16$ using the explicit filter shown below that’s derived as a best approximation to the sharp spectral filter.

Comparison of the high-resolution (top) and low resolution (bottom) data:

Comparison of the power spectral density of the high-resolution (blue) and low-resolution (orange) data:

The $1260$ datasets are split into $920\ (79.3\%)$ train, $120\ (10.3\%)$ dev and $120\ (10.3\%)$ test sets.

Model

For the task of upsampling the low resolution data in a physically consistent manner, we use a GAN[2] in a fashion similar to super-resolution applications for image data [3].

The generator has a deep residual network architecture with each residual block having convolutional layers with batch normalization. The discriminator has a deep convolutional architecture with fully connected layers in the end for binary classification. The architectures of the generator and discriminator are depicted pictorially below:

Loss Functions

The flow field is constrained by the continuity and pressure Poisson equations:

\begin{gather}
\nabla \cdot \bm{u} = 0, \tag{\text{continuity}} \label{eqn:continuity} \\
- \nabla^2 p = \nabla \bm{u} : \nabla \bm{u}^T \tag{\text{pressure Poisson}} \label{eqn:pressure_poisson}
\end{gather}

The above equations might not be satisfied exactly by the model’s generated output. To combat this, the residual of the above equations can be used as a regularizer for the model through a physics loss.

The loss function minimized for the generator network during training is a combination of

\begin{align*}
\mathcal{L}_\mathrm{GAN} &= \left( 1 - \lambda_\mathrm{A} \right) \mathcal{L}_\mathrm{resnet} + \lambda_\mathrm{A} \mathcal{L}_\mathrm{adversarial}\\
\mathcal{L}_\mathrm{resnet} &= \left( 1 - \lambda_\mathrm{P} \right) \mathcal{L}_\mathrm{content} + \lambda_\mathrm{P} \mathcal{L}_\mathrm{physics} \\
\mathcal{L}_\mathrm{content} &= \left( 1 - \lambda_\mathrm{E} \right) \mathcal{L}_\mathrm{MSE} + \lambda_\mathrm{E} \mathcal{L}_\mathrm{enstrophy} \\
\mathcal{L}_\mathrm{physics} &= \left( 1 - \lambda_\mathrm{C} \right) \mathcal{L}_\mathrm{pressure} + \lambda_\mathrm{C} \mathcal{L}_\mathrm{continuity}
\end{align*}

$\mathcal{L}_\mathrm{MSE}$: Mean squared error between the high resolution and generated fields

$\mathcal{L}_\mathrm{enstrophy}$: Mean squared error in the derived enstrophy field $\Omega$ ($\Omega = \bm{\omega} \cdot \bm{\omega},\ \mathrm{where}\ \bm{\omega}=\nabla \times \bm{u}$) to sensitize the generator to high frequency content

Residuals of the continuity ($\mathcal{L}_\mathrm{continuity}$) and pressure Poisson ($\mathcal{L}_\mathrm{pressure}$) equations given above similar to [4].

To train the discriminator, we use the logistic loss based on predicted labels for real and generated data.

Training

TEResNet

A model with just the residual generator network without the adversarial component is termed TEResNet. We first train TEResNet to convergence and tune hyperparameters like the number of residual blocks and the physics loss parameters.

TEGAN

The model with both the residual network generator and the discriminator depicted above is termed TEGAN. The generator in TEGAN is first initialized using the weights from the trained TEResNet while the discriminator is initialized using the Xavier-He initialization.

For the first few iterations in the training process (~300), the discriminator alone is trained to negate the advantage that the generator has because of its pre-trained weights. Then, both the generator and discriminator are trained together with the active adversarial loss until the losses saturate and the discriminator’s output saturates at 0.5.

Model Performance

Content loss Physics loss
Dev Test Dev Test
TEResNet 0.049 0.050 0.078 0.085
TEGAN 0.047 0.047 0.070 0.072
% difference 4.1 6.0 10.3 15.2

References

[1] Chan, Shing, and Ahmed H. Elsheikh. “Parametrization and Generation of Geological Models with Generative Adversarial Networks.” arXiv preprint arXiv:1708.01810 (2017).

[2] Goodfellow, Ian, et al. “Generative adversarial nets.” Advances in neural information processing systems. 2014.

[3] Ledig, Christian, et al. “Photo-realistic single image super-resolution using a generative adversarial network.” arXiv preprint (2016).

[4] Raissi, Maziar, Paris Perdikaris, and George Em Karniadakis. “Physics Informed Deep Learning (Part I): Data-driven Solutions of Nonlinear Partial Differential Equations.” arXiv preprint arXiv:1711.10561 (2017).

[5] Arjovsky, Martin, Soumith Chintala, and Léon Bottou. “Wasserstein GAN.” arXiv preprint arXiv:1701.07875 (2017)

People

Raunak Borker, Sravya Nimmagadda, Akshay Subramaniam and Man-Long Wong