MOST wall model for ABL simulations by loretet · Pull Request #2385 · ExtremeFLOW/neko

loretet · 2026-03-16T17:06:08Z

This PR implements the Monin-Obukhov Similarity Theory (MOST) wall model on CPU and GPU.

Description

MOST is the most common wall model used in atmospheric science. Here, the friction velocity $u_*$ is computed based on some empirical similarity functions which depend on the thermal stratification of the boundary layer (in this version we use the same functions as the PALM model). The WSS $\tau$ is then computed from $u_*$ in the usual way and applied as a boundary condition.
More info can be found in Stull (1988) or Rotach and Holtslag (2025).

The MOST implementation is very similar to the rough_log_law wall model, but it is made more complex by a necessary branching in the algorithm: the similarity functions and stress expression have different shapes based on whether a Dirichlet or Neumann boundary condition is used for temperature, and also based on the local stratification of the flow (which has to be computed for every timestep and point). This makes the GPU implementation not very efficient, I think (especially if it is a noob like me to code it :D)

Implementation

The MOST wall model requires more specification than rough_log_law. An example of implementation is as follows:

 "boundary_conditions": [
       {
            "type": "wall_model",
            "model": "most",
            "kappa": 0.4,     // Von Karman constant
            "z0": 0.1,     // the momentum roughness length
            "z0h": 0.1,      // the heat roughness length (if negative, Zilitinkevich formula is used)
            "type_of_temp_bc": "neumann",   // supports either Neumann or Dirichlet as b.c.
            "bottom_bc_flux_or_temp": 0.05,     // the temperature or flux value imposed at the boundary
            "zone_indices": [
                       5
             ],
            "h_index":1     // the model samples the flow at the first index above the boundary
        },
....
]

Limitations/issues

At the moment, in a typical ABL simulation it is necessary to specify multiple times in the case file the same variables (e.g. g or q). The redundancy of parameters in the case file can be reduced by using the constants group (#2228).
Another limitation is that - for now - it is only possible to use this wall model on one boundary at the time (it is unlikely that anybody will ever need it on two different boundaries, but it's still a possible future work).

Results

Here is a comparison made between the CPU and GPU implementation of the wall model in Neko. Both the rough log-law and the MOST model appear. The simulation had a geostrophic wind flow of 10 m/s and a bottom heat flux of 0.05Kms-1, used buoyancy-corrected Vreman as SGS model and has 24 elements (polynomial degree 7).
Coding-wise, the CPU and GPU implementations seem to be exactly the same for me, and I think the slight difference in the results is acceptable although maybe a bit surprising (tested on Dardel). Here, $v$ is the spanwise velocity.

Note: Nek5000 doesn't appear in the plots. This is because Nek5000 and Neko have always given slightly different results in some ABL flows, and we are still investigating this issue. However, the discrepancy within the codes should not be due to the wall model, hence the PR.

loretet · 2026-03-17T10:13:40Z

Quick update after March 17 developers' meeting:

The difference between CPU and GPU is concerning for Niclas. Philipp suggested investigating the influence of the time average by doing batch averages and Siavash suggested to also look at very close time average after restarting from the same file. I will do that and publish some updates
Tim suggested to open a separate PR/issue/discussion for the difference between CPU and GPU implementation of the rough_log_law model, which shows differences even if it is a much simpler model. Will do that as well

…most

linneahuusko

Nice work! I didn't look much at the device files since I don't really know what to look for there, but the rest looks good. I left some minor comments and thoughts about things we should consider how to do.

linneahuusko · 2026-03-18T09:02:37Z

src/wall_models/bcknd/cpu/most_cpu.f90

+    real(kind=rp) :: magu, utau, normu, z0h
+    real(kind=rp) :: L_ob, L_upper, L_lower, L_old
+    real(kind=rp) :: Ri_b, f, dfdl, fd_h, L_new, L_sign
+    real(kind=rp), parameter :: g = 9.80665_rp


The gravity should not be hardcoded here since we allow it as a parameter in the case file for other components (e.g., Boussinesq, Deardorff). I guess you are assuming the direction of gravity to be wall normal, which makes sense in our cases, but how can we generalize it? Making it truly general I think would be a lot of work, but I think we should at least let the user provide a g_vector and then put an error if it has anything other than a component in the z direction.

Yes you are right, we can/should definitely generalise it. I have to include rho (and mu) as well in the wall model inputs, I will add g as well (then the user can specify them all in a constants block in the case file)

src/wall_models/most.f90

src/wall_models/bcknd/cpu/most_cpu.f90

linneahuusko · 2026-03-18T09:20:12Z

src/wall_models/bcknd/cpu/most_cpu.f90

+    ! Print some indicative quantities (these are just point quantities: don't trust 100%)
+    call neko_log%section('Wall model quick look')
+    write(log_buf, '(A,E15.7)') 'Ri_b: ', Ri_b
+    call neko_log%message(trim(log_buf))
+    write(log_buf, '(A,E15.7)') 'L_ob: ', L_ob
+    call neko_log%message(trim(log_buf))
+    write(log_buf, '(A,E15.7)') 'utau: ', utau
+    call neko_log%message(trim(log_buf))
+    write(log_buf, '(A,E15.7)') 'magu: ', magu
+    call neko_log%message(trim(log_buf))
+    write(log_buf, '(A,E15.7)') 'ts: ', ts
+    call neko_log%message(trim(log_buf))
+    write(log_buf, '(A,E15.7)') 'ti: ', ti
+    call neko_log%message(trim(log_buf))
+    write(log_buf, '(A,E15.7)') 'q: ', q
+    call neko_log%message(trim(log_buf))
+    write(log_buf, '(A,E15.7)') 'hi: ', hi
+    call neko_log%message(trim(log_buf))
+    call neko_log%end_section()
+


Here I think the goal should be to get the mean, min and max of these variables, and also write them to a separate file

I agree, I didn't want to include this part at first but I left it in the end so that the user can just have a very rough idea if there are weird values or NaNs. A better diagnostics is necessary for sure

src/wall_models/bcknd/device/cuda/rough_log_law_kernel.h

src/wall_models/bcknd/device/hip/rough_log_law_kernel.h

src/wall_models/most.f90

linneahuusko · 2026-03-18T09:31:54Z

src/wall_models/most.f90

+    u => neko_registry%get_field("u")
+    v => neko_registry%get_field("v")
+    w => neko_registry%get_field("w")
+    temp => neko_registry%get_field("temperature")


Maybe we should allow for other names of the scalar field than "temperature"? Like in the buoyancy corrected Vreman, where we put "scalar_field" in the json file (that does add another parameter to the json but it might be good to not assume field names)

src/wall_models/wall_model_fctry.f90

timofeymukha · 2026-03-18T10:42:20Z

src/wall_models/bcknd/device/hip/most_kernel.h

+
+        // Zilitinkevich 1995 correlation for thermal roughness
+        T z0h;
+        if (z0h_in < 0.0) {
+            const T nu = 1.46e-5; // Kinematic viscosity of air [m^2/s]
+            const T Re_z0 = (utau * z0) / nu;
+            z0h = z0 * exp(-0.1 * sqrt(Re_z0));
+        } else {
+            z0h = z0h_in;
+        }


Just a first comment, haven't gotten to look at the entire code yet.

I really don't like the hardcoded nu here, because it is completely independent of the material properties in the simulation's case file. I think one either picks up the mu / rho from the case file (The wall model should have access to these, I think), or we just make z0h mandatory, and people have to use whatever correlation they want, but outside Neko.

Yes I agree, consider it some kind of placeholder for now: I want to include rho in the wall model computation and I will include mu as well (rho HAS to be there so it's a natural step to add mu as well). Thanks for spotting it!

timofeymukha · 2026-03-18T10:44:53Z

@loretet I'm going to order a copilot review now. It will probably spit out a lot of comments, and for now you can just focus on those related to the documentation. Like, it will hopefully pick up that the copyright year is wrong in some files, and similar stuff. For "tougher" comments, read them if you want, or you can wait for me to triage them first.

Copilot

Pull request overview

This PR adds a new Monin–Obukhov Similarity Theory (MOST) wall model to Neko, including both CPU and GPU (CUDA/HIP) implementations, and wires it into the wall-model factory and build system.

Changes:

Add most wall model type with JSON-driven configuration and compute path selecting CPU vs device backend.
Add CPU kernel and CUDA/HIP device kernels + wrappers for MOST.
Register most in the wall model factory and include new sources/headers in the build/dependency lists.

Reviewed changes

Copilot reviewed 13 out of 14 changed files in this pull request and generated 24 comments.

Show a summary per file

File	Description
`src/wall_models/wall_model_fctry.f90`	Registers `most` type in the wall-model factory.
`src/wall_models/wall_model.f90`	Indentation-only change in `wall_model_find_points`.
`src/wall_models/most.f90`	New MOST wall model type, JSON parsing, compute dispatch, lifecycle methods.
`src/wall_models/bcknd/cpu/most_cpu.f90`	New CPU implementation of MOST wall stress computation.
`src/wall_models/bcknd/device/most_device.F90`	Fortran-side device wrapper and C-bind interfaces for MOST.
`src/wall_models/bcknd/device/hip/most_kernel.h`	New HIP kernel header implementing MOST logic.
`src/wall_models/bcknd/device/hip/most.hip`	HIP launcher that selects kernel variant based on `bc_type`.
`src/wall_models/bcknd/device/cuda/most_kernel.h`	New CUDA kernel header implementing MOST logic.
`src/wall_models/bcknd/device/cuda/most.cu`	CUDA launcher that selects kernel variant based on `bc_type`.
`src/wall_models/bcknd/device/hip/rough_log_law_kernel.h`	Adds licence header.
`src/wall_models/bcknd/device/cuda/rough_log_law_kernel.h`	Adds licence header; adjusts a comment (currently incorrect).
`src/Makefile.am`	Adds MOST sources to CPU + HIP/CUDA build lists and EXTRA_DIST.
`src/.depends`	Updates Fortran dependency list for new MOST modules.
`src/.depends_device`	Updates device dependency list for new MOST CUDA/HIP objects.

src/wall_models/most.f90

timofeymukha · 2026-03-18T11:09:12Z

src/wall_models/bcknd/device/most_device.F90

+       type(c_ptr), value :: u_d, v_d, w_d, temp_d
+       type(c_ptr), value :: ind_r_d, ind_s_d, ind_t_d, ind_e_d
+       type(c_ptr), value :: n_x_d, n_y_d, n_z_d, h_d
+       type(c_ptr) :: bc_type


See also below. Let's avoid all of this by not passing a string to device kernels.

timofeymukha · 2026-03-18T11:09:41Z

src/wall_models/bcknd/device/hip/most.hip

+ POSSIBILITY OF SUCH DAMAGE.
+*/
+
+#include <hip/hip_runtime.h>


Correct, but will be resolved if we don't use strings for bc_type

src/wall_models/bcknd/cpu/most_cpu.f90

src/wall_models/bcknd/device/cuda/rough_log_law_kernel.h

src/wall_models/most.f90

src/wall_models/bcknd/device/cuda/most.cu

src/wall_models/wall_model_fctry.f90

src/wall_models/most.f90

Co-authored-by: Linnea Huusko <113937514+linneahuusko@users.noreply.github.com>

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

loretet added 30 commits February 3, 2026 11:19

first submission

102fa20

first submission

faddf61

placeholder files

a1bdc86

added call to neko error

3e9fe47

first non-working commit of joint stable/unstable model

483e473

general debugging (first round)

cfd20d1

implemented neutral branch

22efd0f

renaming + minor fixes

ae2c913

split 'splt_bc_type' function in three

869976a

z0t renamed to z0h

d32f8c7

added z0h computation as in Zilitinkevich (commented out)

1141b23

added 'most' to wall models

3c2f43b

first commit

f5c26aa

fixed missing 'import rp' statement

02b2b0d

fixed slaw interfaces imports

2367ccb

fixed spelling errors

ac44926

fixed some function callings

ca2ad4f

fixed neko_error call

4b82c2a

fixed spelling mistake

62ee1ed

changed 'most_convective' to 'most'

c2b794d

fixed dependencies

e5d460a

minor fix

7575fb0

added logging

9c91fd8

fixed iterator logic

04617ac

fixed log indentation

2d63e92

fixed variable order in slaw call

6b7494f

added surf temp retrieval

20ec494

fixed ts declaration

d1cbeeb

switched to json_get for zone_indices retrieval

82a8f62

fixed wrong zone_idx dimension

8ba36a2

loretet added the don't merge Don't merge yet! label Mar 16, 2026

loretet added this to Neko v1.1.0 release Mar 16, 2026

github-project-automation bot moved this to 📋 Todo in Neko v1.1.0 release Mar 16, 2026

loretet added 4 commits March 17, 2026 12:02

Merge branch 'dev/most' of github.com:loretet/neko into dev/most

7a864b8

Merge branch 'develop' into dev/most

c9eb7e8

Merge branch 'develop' into dev/most

afb96b0

Merge remote-tracking branch 'refs/remotes/origin/dev/most' into dev/…

fa8e0c4

…most

linneahuusko reviewed Mar 18, 2026

View reviewed changes

Merge branch 'develop' into dev/most

73968bd

timofeymukha reviewed Mar 18, 2026

View reviewed changes

timofeymukha requested a review from Copilot March 18, 2026 10:45

Copilot started reviewing on behalf of timofeymukha March 18, 2026 10:45 View session

Copilot AI reviewed Mar 18, 2026

View reviewed changes

loretet and others added 15 commits March 18, 2026 14:11

changed k to 0.4

a456839

deleted repeated line

52ee4b2

changed CUDA to HIP in comment

5921cfc

Co-authored-by: Linnea Huusko <113937514+linneahuusko@users.noreply.github.com>

added comment on z0h<0

1279b6e

removed call to Vreman sgs

23ff39e

modified year

a79c74b

removed ts from logged quantities

6f865b5

changed CUDA to HIP in comment

261d325

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

typo in comment

50598fc

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

fixed most_compute call

b8baa7f

typo

6866df0

modified a comment in header

36c35a5

removed some comments

a7e9195

removed some unused vars

51b6801

Merge branch 'feature/most' of github.com:ExtremeFLOW/neko into dev/most

e63a8bb

Conversation

loretet commented Mar 16, 2026

Description

Implementation

Limitations/issues

Results

Uh oh!

loretet commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

linneahuusko left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

timofeymukha commented Mar 18, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants

loretet commented Mar 17, 2026 •

edited

Loading