Detailed Usage
This page describes how to perform a wide variety of user interactions with nekRS for setting boundary conditions, converting between mesh formats, defining and running device kernels, writing output files, and much more. Please first consult the Input File Syntax page for an overview of the purpose of each nekRS input file to provide context on where the following instructions fit into the overall code structure. Throughout this section, variables and data structures in the nekRS source code are referenced - a list defining these variables and structures is available on the Commonly Used Variables page for reference.
Defining Variables to Access in Device Kernels
The customization of a nekRS problem to a specific case is one with both the host-side
user functions in the .udf file, as well as device-side user functions in the .oudf
file. For convenience purposes, nekRS supports setting non-pointer-type variables in the
.udf file that are accessible in the device kernels in the .oudf file. This section
shows an example of this usage.
Suppose that a device kernel requires a parameter representing a pressure gradient, which
is then used to determine a forcing kernel. One option would be to pass that pressure gradient
to the device kernel through its function parameters. The kernel in the .oudf file
would look something like the following.
@kernel void myForcingKernel(const dfloat dp_dx, /* more parameters */)
{
double foo = 2 * dp_dx;
// do something
}
Alternatively, we can define a variable, p_dp_dx, that we set from the .udf file.
While this variable propagation can be done in any of the user-defined functions that
has nrs as an input parameter, for consistency purposes we will use the UDF_LoadKernels
function for this purpose.
To set p_dp_dx to 5.5 from the .udf file, write to the kernelInfo object
on the nrs object. The defines/<p_name> syntax indicates that a variable on
the device is being declared with a name p_name that will be accessible simply as
p_name in the device kernels.
void UDF_LoadKernels(nrs_t * nrs)
{
occa::properties & kernelInfo = *nrs->kernelInfo;
kernelInfo["defines/p_dp_dx"] = 5.5;
// other stuff related to loading the kernels
}
Then, the kernel would be simplified to the following. You will note that nothing needs
to be passed through the kernel function arguments - p_dp_dx is simply available as
if it were a local variable to the function.
@kernel void myForcingKernel(/* more parameters */)
{
double foo = 2 * p_dp_dx;
// do something
}
If you grep for kernelInfo["defines in the nekRS source code, you will see that
this variable propagation features is also used extensively throughout a normal problem
setup. For instance, the number of velocity fields to solve for is propagated to the device
in the nrsSetup function.
nrs_t* nrsSetup(MPI_Comm comm, occa::device device, setupAide &options, int buildOnly)
{
// ...
kernelInfo["defines/p_NVfields"] = nrs->NVfields;
// ...
}
Again, the convention is to precede all such propagated variables with the p_ prefix.
No list of all such variables propagated automatically within a nekRS simulation is
maintained, so always check if the information you’d like to propagate is perhaps
already automatically propagated.
Setting Boundary Conditions with Device Kernels
Because all nekRS solves are performed on the device, boundary conditions on the
solution (which may change from time step to time step and be arbitrary functions
of the solution itself) are also applied on the device. The types of boundary conditions
on each solution field are specified in the .par file with the boundaryTypeMap
key.
Setting Custom Properties
Custom material properties can be set for the flow and passive scalar equations
by assigning the udf.properties function pointer to a function with a signature
that takes the nrs pointer to the nekRS solution object, the simulation time
time, the velocity solution on the device o_U, the passive scalar solution
on the device o_S, the flow material properties on the device o_UProp,
and the passive scalar material properties on the device o_SProp.
This section provides an example of setting \(\mu\) and \(\rho\) for the flow
equations and \(k\) and \(\rho C_p\) for two passive scalars. Suppose our problem
contains velocity, pressure, temperature, and two passive scalars. The [VELOCITY],
[PRESSURE], [TEMPERATURE], [SCALAR01], and [SCALAR02] sections of the
.par file would be as follows. Because we will be setting custom properties for
the pressure, velocity, and first two passive scalars (temperature and SCALAR01),
we can let nekRS assign the default values of unity to all properties for those
governing equations until we override them in our custom property function. We still
need to define the material properties for SCALAR02, however, because we will not
be overriding those properties in our function.
[PRESSURE]
residualTol = 1e-6
[VELOCITY]
boundaryTypeMap = v, O, W
residualTol = 1e-8
[TEMPERATURE]
boundaryTypeMap = t, O, I
residualTol = 1e-8
[SCALAR01]
boundaryTypeMap = t, O, I
residualTol = 1e-8
[SCALAR02]
boundaryTypeMap = t, O, t
residualTol = 1e-7
conductivity = 3.5
rhoCp = 2e5
Also suppose that our problem contains conjugate heat transfer, such that some of the mesh is fluid while some of the mesh is solid.
In UDF_Setup, we next need to assign an address to the udf.properties function
pointer to a function with the correct signature where we eventually assign our custom
properties. Our UDF_Setup function would be as follows.
void UDF_Setup(nrs_t* nrs)
{
udf.properties = &material_props;
}
Here, material_props is our name for a function in the .udf file that sets the
material properties. Its name is arbitrary, but it must have the following signature.
void material_props(nrs_t* nrs, dfloat time, occa::memory o_U, occa::memory o_S,
occa::memory o_UProp, occa::memory o_SProp)
{
// set the material properties
}
This function is called after the solve has been performed on each time step, so the material properties are lagged by one time step with respect to the simulation.
Note
You must place the material_props function before UDF_Setup (and before any other
function that uses material_props) in the .udf file in order for the just-in-time
compilation to succeed.
Suppose we would like to set \(\rho=1000.0\) and \(\mu=2.1e-5 e^{-\phi_0/500}(1+z)\) for the flow equations; because only the fluid domain has flow, we do not need to set these properties on the solid part of the domain. For the first passive scalar \(\phi_0\), we would like to set \((\rho C_p)_f=2e3(1000+PV_x)\) and \(k_f=2.5\) in the fluid domain, and \((\rho C_p)_s=2e3(1000+PV_x)\) and \(k_s=3.5\) in the solid domain. Here, \(P\) is the thermodynamic pressure and \(V_x\) is the \(x\)-component velocity. For the second passive scalar \(\phi_1\), we would like to set \(\rho C_p=0\) and \(k=5+\phi_0\) in both the fluid and solid domains. Our material property function would be as follows. Note that these boundary conditions are selected just to be comprehensive and show all possible options for setting constant and non-constant properties with dependencies on properties - they do not necessarily represent any realistic physical case.
// declare all the kernels we will be writing
static occa::kernel viscosityKernel;
static occa::kernel constantFillKernel;
static occa::kernel heatCapacityKernel;
static occa::kernel conductivityKernel;
void material_props(nrs_t* nrs, dfloat time, occa::memory o_U, occa::memory o_S,
occa::memory o_UProp, occa::memory o_SProp)
{
mesh_t* mesh = nrs->mesh;
// viscosity and density for the flow equations
const occa::memory o_mue = o_UProp.slice(0 * nrs->fieldOffset * sizeof(dfloat));
const occa::memory first_scalar = o_S.slice(0 * cds->fieldOffset * sizeof(dfloat));
viscosityKernel(mesh->Nelements, first_scalar, mesh->o_z, o_mue);
const occa::memory o_rho = o_UProp.slice(1 * nrs->fieldOffset * sizeof(dfloat));
constantFillKernel(nrs->mesh->Nelements, 1000.0, 0.0 /* dummy */, nrs->o_elementInfo, o_rho);
// conductivity and rhoCp for the first passive scalar
int scalar_number = 0;
const occa::memory o_con = o_SProp.slice((0 + 2 * scalar_number) *
cds->fieldOffset * sizeof(dfloat));
constantFillKernel(mesh->Nelements, 2.5, 3.5, nrs->o_elementInfo, o_con);
const occa::memory o_rhocp = o_SProp.slice((1 + 2 * scalar_number) *
cds->fieldOffset * sizeof(dfloat));
heatCapacityKernel(mesh->Nelements, o_U, nrs->o_P, o_rhocp);
// conductivity and rhoCp for the second passive scalar
scalar_number = 1;
const occa::memory o_con_2 = o_SProp.slice((0 + 2 * scalar_number) *
cds->fieldOffset * sizeof(dfloat));
conductivityKernel(mesh->Nelements, first_scalar, o_con_2);
const occa::memory o_rhocp_2 = o_SProp.slice((1 + 2 * scalar_number) *
cds->fieldOffset * sizeof(dfloat));
constantFillKernel(mesh->Nelements, 0.0, 0.0, nrs->o_elementInfo, o_rhocp_2);
}
The o_UProp and o_SProp arrays hold all material
property information for the flow equations and passive scalar equations, respectively.
In this function, you see six “slice” operations performed on o_UProp and o_SProp
in order to access the two individual properties (diffusive constant and time derivative constant)
for the three equations (momentum, scalar 0, and scalar 1). The diffusive constant
(\(\mu\) for the momentum equations and \(k\) for the passive scalar equations)
is always listed first in these arrays, while the coefficient on the time derivative
(\(\rho C_p\) for the momentum equations and \(\rho C_p\) for the passive scalar
equations) is always listed second in these arrays.
To further elaborate, \(\mu\) and \(\rho\) are accessed as slices on o_UProp.
Because viscosity is listed before density, the offset in the o_UProp array to get
the viscosity is zero, while the offset to get the density is nrs->fieldOffset.
\(k\) and \(\rho C_p\) are accessed as slices in o_SProp. Because the
passive scalars are listed in order and the conductivity is listed first for each user,
the offset in the o_SProp array to get the conductivity for the first passive scalar
is zero, while the offset to get the heat capacity for the first passive scalar
is cds->fieldOffset. Finally, the offset in the o_SProp array to get the conductivity
for the second passive scalar is 2 * cds->fieldOffset, while the offset to get the
heat capacity for the second passive scalar is 3 * cds->fieldOffset.
The viscosityKernel, constantFillKernel, heatCapacityKernel,
and conductivityKernel functions are all user-defined device kernels. These
functions must be defined in the .oudf file, and the names are arbitrary. For each
of these kernels, we declare them at the top of the .udf file. In order to link
against our device kernels, we must instruct nekRS to use its just-in-time compilation
to build those kernels. We do this in UDF_LoadKernels by calling the
udfBuildKernel function for each kernel. The second argument to the udfBuildKernel
function is the name of the kernel, which appears as the actual function name of
the desired kernel in the .oudf file.
void UDF_LoadKernels(nrs_t* nrs)
{
viscosityKernel = udfBuildKernel(nrs, "viscosity");
constantFillKernel = udfBuildKernel(nrs, "constantFill");
heatCapacityKernel = udfBuildKernel(nrs, "heatCapacity");
conductivityKernel = udfBuildKernel(nrs, "conductivity");
}
In order to write these device kernels, you will need some background in programming with OCCA. Please consult the OCCA documentation before proceeding [1].
First, let’s look at the constantFill kernel. Here, we want to write a device kernel
that assigns a constant value to a material property. So that we can have a general
function, we will write this such that it can be used to set constant (but potentially
different) properties in the fluid and solid phases for conjugate heat transfer
applications.
Note
Material properties for the flow equations (i.e. viscosity and density) do not need to be specified in the solid phase. If you define flow properties in solid regions, they are simply not used.
The constantFill kernel is defined in the .oudf file as follows [2]. OCCA
kernels operate on the device. As input parameters, they can take non-pointer objects
on the host (such as Nelements, fluid_val, and solid_val in this example),
as well as pointers to objects of type occa::memory, or device-side memory. The
device-side objects are indicated with the @restrict tag.
Note
Device-side memory in nekRS is by convention preceded with a o_ prefix in order
to differentiate from the host-side objects. In the initialization of nekRS, most of
the simulation data is copied over to the device. All calculations are done on the
device. The device-side solution is then only copied back onto the host for the
purpose of writing output files.
Warning
Because nekRS by default only copies the device-side solution back to the host for
the purpose of writing output files, if you touch any host-side objects in your
user-defined functions, such as in UDF_ExecuteStep, you must ensure
that you only use the host-side objects after they have been copied from device back
to the host. Otherwise, they would not be “up to date.” You can ensure that the host-
side objects reflect the real-time nekRS solution by either (a) only touching the
host-side solution on output writing steps (which can be determined based on the
nrs->isOutputStep variable), or (b) calling the appropriate routines in nekRS
to force data to be copied from the device back to the host. For the latter option,
please refer to the Copying From Device to Host section.
For this example, we
loop over all the elements. The eInfo parameter represents a mask, and takes a value
of zero for solid elements and a value of unity for fluid elements. Next, we loop over
all of the GLL points on the element, or p_Np. This variable is set within
nekRS to be the same as mesh->Np using the device variable feature described in
the Defining Variables to Access in Device Kernels
section. This particular variable is always available, and you do not need to pass it
explicitly into device functions. Finally, we set the value of the property to the
value specified in the function parameters.
@kernel void constantFill(const dlong Nelements, const dfloat fluid_val,
const dfloat solid_val, @restrict const dlong* eInfo, @restrict dfloat* property)
{
for (dlong e = 0; e < Nelements; ++e ; @outer(0))
{
const bool is_solid = eInfo[e];
for (int n = 0; n < p_Np; ++n ; @inner(0))
{
const int id = e * p_Np + n;
property[id] = fluid_val;
if (is_solid)
property[id] = solid_val;
}
}
}
Now, let’s look at the slightly more complex conductivity kernel. Here, our function
signature is very different from that of the constantFill kernel. While we still
pass the number of elements, we no longer need to check whether we are in a fluid element
or a solid element, since the conductivity for the second passive scalar is going to be
the same in both phases. All that we need to pass in is the coupled scalar scalar,
or \(\phi_0\) in our material property correlation \(k=5+\phi_0\) that we listed
earlier. The property passed in then should represent the conductivity we are setting.
@kernel void conductivity(const dlong Nelements, @restrict const dfloat* scalar,
@restrict dfloat* property)
{
for (dlong e = 0; e < Nelements; ++e ; @outer(0))
{
for (int n = 0; n < p_Np; ++n ; @inner(0))
{
const int id = e * p_Np + n;
const dfloat scalar = scalar[id];
property[id] = 5.0 + scalar;
}
}
}
A key aspect of writing device kernels is that the device kernel can only operate on
non-pointer objects or pointers to device memory. Whatever the form of your material properties,
you just need to be sure to pass in all necessary information. Now, let’s look at the even
more complex viscosity kernel. Here, we need to pass in the scalar \(\phi_0\) and the
\(z\)-coordinate that appear in the viscosity model.
@kernel void viscosity(const dlong Nelements, @restrict const dfloat* scalar,
@restrict const dfloat* z, @restrict dfloat* property)
{
for (dlong e = 0; e < Nelements; ++e ; @outer(0))
{
for (int n = 0; n < p_Np; ++n ; @inner(0))
{
const int id = e * p_Np + n;
const dfloat scalar = scalar[id];
const dfloat z = z[id];
property[id] = 2.1E-5 * exp(-scalar / 500.0) * (1.0 + z);
}
}
}
The final kernel that wraps up this example is the heatCapacity kernel.
Setting Custom Source Terms
Custom source terms can be added to the momentum conservation equation and/or the
energy conservation equation by assigning the udf.uEqnSource and
udf.sEqnSource function pointers, respectively, to functions with the appropriate signature.
Each of these cases are described separately next. The process is conceptually very similar
to the process for declaring custom properties in Setting Custom Properties,
so you may find it useful to first review that section.
The Momentum Equation
To set a custom source term for the momentum equation, you must assign the
udf.uEqnSource function pointer to a function with a signature that takes the nrs pointer
to the nekRS solution object, the simulation time time, the velocity solution on the device
o_U, and the momentum source term on the device o_FU. In UDF_Setup,
we need to assign an address to the udf.uEqnSource function pointer to a function
with the correct signature where we will eventually compute a momentum source. Our
UDF_Setup function would be as follows.
void UDF_Setup(nrs_t * nrs)
{
udf.uEqnSource = &custom_source;
}
Here, custom_source is our name for a function in the .udf file that computes the
momentum source. Its name is arbitrary, but it must have the following signature.
void custom_source(nrs_t * nrs, dfloat time, occa::memory o_U, occa::memory o_FU)
{
// compute the momentum source
}
Note
You must place the custom_source function _before_ UDF_Setup (and before any other
function that uses custom_source) in the .udf file in order for the just-in-time
compilation to success.
Suppose we would like to add a gravitational force to the \(z\) momentum equation, of form \(-\rho_fg\). For the momentum equation, the source term is defined on a per-mass basis; in other words, we must provide the vector \(\vec{f}\) for a source with strong form \(\rho\vec{f}\). Our custom source function would be as follows.
// declare all kernels we will be writing
static occa::kernel constantFillKernel;
void custom_source(nrs_t * nrs, dfloat time, occa::memory o_U, occa::memory o_FU)
{
mesh_t * mesh = nrs->mesh;
// what momentum equation we want to add gravity to
int component = 2;
constantFillKernel(nrs->mesh->Nelements, -9.81, component * nrs->fieldOffset, o_FU);
}
The constantFillKernel is a user-defined device kernel. This function must now be defined
in the .oudf file; the name is arbitrary. In order to link against our device kernels, we must
also instruct nekRS to use its just-in-time compilation to build those kernels. We do this in
UDF_LoadKernels by calling the udfBuildKernel function for the kernel. The second argument
to the udfBuildKernel function is the name of the kernel, which appears as the actual function
name of the desired kernel in the .oudf file.
void UDF_LoadKernels(nrs_t * nrs)
{
constantFillKernel = udfBuildKernel(nrs, "constantFill");
}
The constantFill kernel is now defined in the .oudf file as follows.
@kernel void constantFill(const dlong Nelements, const dfloat value,
const int offset, @restrict dfloat * source)
{
for (dlong e = 0; e < Nelements; ++e ; @outer(0))
{
for (int n = 0; n < p_Np; ++n ; @inner(0))
{
const int id = e * p_Np + n + offset;
source[id] = value;
}
}
}
The Energy Equation
Copying From Device to Host
All solutions take place on the host, and data transfer of the solution back to the host
must be manually performed by the user if you would like to access nrs->U, nrs->p,
nrs->cds->S, or other solution objects, in host-side functions. To copy the solution
from the device to the host, use the nek_ocopyFrom(double time, int tstep) routine in the
nekInterfaceAdapter.cpp file. This function performs the following actions:
1. Copy the nekRS solution from the nekRS device arrays to the nekRS host arrays - that is,
nrs->o_U is copied to nrs->U, and so on. This
allows you to access the solution on the host as nrs->U, nrs->p, nrs->S, etc.
Copy the nekRS solution from the nekRS host arrays to the Nek5000 backend arrays.
If you only want to access the nekRS host side arays such as nrs->U, you can skip the
second part by directly using OCCA memory copy functions like the following, which
copies from the device array nrs->o_U to the host array nrs->U.
nrs->o_U.copyTo(nrs->U);
Calculating the Distance to a Wall
nekRS allows users to access many Nek5000 “backends” through the (optional)
<case>.usr file. A common use case is to calculate the distance from each
GLL point to a boundary, such as for setting initial conditions for turbulent quantities
or other closures. The procedure to compute and then use these values is as follows.
First, in the usrdat2 subroutine, make sure that all boundaries for which
you want to compute the distance for are marked as “wall” boundaries in the cbc array.
In the example shown below, we assume that
the mesh already has sidesets defined in it (assigned through Cubit/gmsh/however else
the mesh was created). We then loop over all the GLL points and determine
if the point is on the boundary of interest by checking if the boundary ID is
equal to the sideset of interest. This is done by checking the absolute difference
between the bc array and the sideset value of interest (in this example, the sideset
is 7). If the boundary ID matches the sideset of interest, then we set the cbc array
to W, or the character that indicates a no-slip wall boundary.
subroutine usrdat2
include 'SIZE'
include 'TOTAL'
integer e,f
n = lx1*ly1*lz1*nelv
nxz = nx1*nz1
nface = 2*ldim
do iel=1,nelv
do ifc=1,2*ndim
if (abs((bc(5,ifc,iel,1)-7.0)).lt.1e-4) cbc(ifc,iel,1)= 'W '
enddo
enddo
return
end
In other words,
if your wall boundaries were instead boundaries 3 and 4, the if (abs...) lines
in the above example would become:
if (abs((bc(5,ifc,iel,1)-3.0)).lt.1e-4) cbc(ifc,iel,1)= 'W '
if (abs((bc(5,ifc,iel,1)-4.0)).lt.1e-4) cbc(ifc,iel,1)= 'W '
Next, in the usrdat3 subroutine, you simply need to call the
dist function, which loops over all boundaries with W type
and determines the distance of all GLL points to those boundaries.
The result of the calculation should be stored into the nrs_scptr(1) pointer,
which is then what we will access in the .udf file.
subroutine usrdat3
include 'SIZE'
include 'TOTAL'
common /scrach_o1/
w1(lx1*ly1*lz1*lelv)
,w2(lx1*ly1*lz1*lelv)
,w3(lx1*ly1*lz1*lelv)
,w4(lx1*ly1*lz1*lelv)
,w5(lx1*ly1*lz1*lelv)
common /scrach_o2/
ywd(lx1,ly1,lz1,lelv)
COMMON /NRSSCPTR/ nrs_scptr(1)
integer*8 nrs_scptr
call distf(ywd,7,'W ',w1,w2,w3,w4,w5)
nrs_scptr(1) = loc(ywd)
return
end
In other words, if your wall boundaries were instead boundaries 3 and 4, the
call distf... lines in the above example would become:
call distf(ywd,3,'W ',w1,w2,w3,w4,w5)
call distf(ywd,4,'W ',w1,w2,w3,w4,w5)
Then, you can access the results of the distance-to-wall calculation in the .udf
by assigning a pointer to the nek::scPtr(1) array. Note that this call must be
within UDF_ExecuteStep so that the Nek5000 backend will have been called first.
void UDF_ExecuteStep(nrs_t * nrs, dfloat time, int tstep)
{
double * wall_distance = (double *) nek::scPtr(1);
// then, you can copy it into some device-side memory so you can use it in
// BCs if you want
auto mesh = nrs->meshV;
int n_gll_points = mesh->Np * mesh->Nelements;
int write_location = 2; // "slice" into which you want to write, in case nrs->o_usrwrk holds other info
nrs->o_usrwrk.copyFrom(wall_distance, n_gll_points * sizeof(dfloat), write_location * nrs->fieldOffset * sizeof(dfloat));
}
Footnotes