File-By-File GPU Audit¶
Generated by tools/generate_gpu_file_audit.py on 2026-05-19 11:11 UTC.
This page is a release-gate inventory for the GPU branch. It lists every top-level Fortran source, its runtime relevance, GPU status, retained environment switches, and developer notes. The status is generated from source markers plus curated role metadata; update the role map in the generator when ownership changes.
Summary¶
- Top-level Fortran files audited:
69 - LESGO-owned GPU checkpoints:
17 - CPU reference checkpoints preserved but excluded from GPU count:
2 - System probes:
2
Retained GPU Checkpoints¶
LESGO_ATM_DIAG_TIMINGLESGO_ATM_LB_AUTO_SELECTLESGO_ATM_LB_VALIDATELESGO_ATM_POINT_OWNER_LBLESGO_ATM_POINT_OWNER_TARGETEDLESGO_MPI_CUDA_DEBUGLESGO_MPI_CUDA_SYNCLESGO_PRESS_DIRECT_THOMAS_OUTLESGO_PRESS_RHS_HALO_COMBINEDLESGO_PRESS_STAGE_TIMINGLESGO_PRESS_TRANSPOSE_GENERICLESGO_PRESS_TRANSPOSE_TIMINGLESGO_PROJECT_STAGE_TIMINGLESGO_SGS_CALCSIJ_EXPLICITLESGO_SGS_HALO_COMBINEDLESGO_SGS_STAGE_TIMINGLESGO_SGS_STRICT_SYNC
CPU Reference Checkpoints¶
LESGO_CPU_REF_TIME_FORCINGLESGO_CPU_REF_TIME_TOTAL
Default Half-Channel SGS Audit Note¶
The latest 128^3 half-channel validation uses sgs_model=5, so it exercises the Lagrangian scale-dependent SGS path rather than only the short actuator-turbine benchmark path. The relevant runtime files are sgs_stag_util.f90, scaledep_dynamic.f90, interpolag_Sdep.f90, and lagrange_Sdep.f90; these cover the calc_Sij/Nu_t/tau construction plus the F_NN, F_QN, F_MM, and F_LM update work after model initialization.
Audit Matrix¶
| File | Area | Role | Relevance | CMake/Option | GPU Status | Procedures | Switches | Loop Markers | GPU/MPI Markers | Developer Note |
|---|---|---|---|---|---|---|---|---|---|---|
actuator_turbine_model.f90 |
ATM | Turbine physics, yaw/rotation, blade force logic | Runtime hot path when USE_ATM | USE_ATM, USE_TURBINES | GPU-enabled | actuator_turbine_model, atm_model_cuda_check, atm_model_cuda_sync, atm_initialize, atm_read_actuator_points, atm_read_restart, +19 more |
- | 60 | CUF/global=6; CUDA/device=40; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
atm_base.f90 |
ATM | ATM type definitions | Setup/runtime support | USE_ATM | Review if runtime-active | atm_base, error, interpolate, vector_add, vector_divide, vector_multiply, +5 more |
- | 2 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
atm_input_util.f90 |
ATM | ATM input parsing | I/O/setup | USE_ATM, USE_TURBINES | GPU-enabled | atm_input_util, read_input_conf, read_turbine_model_variables, atm_print_initialize, read_airfoil, readline, +1 more |
- | 15 | CUF/global=0; CUDA/device=34; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
atm_lesgo_interface.f90 |
ATM | GPU LESGO/ATM interface, sampling, gather, point-owner LB | Runtime hot path when USE_ATM | USE_ATM, USE_HIT, USE_TURBINES | GPU-enabled | atm_lesgo_interface, atm_lb_auto_record, atm_diag_event_start, atm_diag_event_stop, atm_diag_event_flush, atm_cuda_check, +43 more |
LESGO_ATM_DIAG_TIMING, LESGO_ATM_LB_AUTO_SELECT, LESGO_ATM_LB_VALIDATE, LESGO_ATM_POINT_OWNER_LB, LESGO_ATM_POINT_OWNER_TARGETED |
149 | CUF/global=23; CUDA/device=141; MPI=198 | Legacy ATM path is default; point-owner LB remains experimental and must validate force sums. |
cfl_util.f90 |
Diagnostics | GPU CFL reduction | Runtime diagnostic | default/core | GPU-enabled | cfl_util, cfl_cuda_sync, get_max_cfl, get_cfl_dt |
- | 6 | CUF/global=2; CUDA/device=21; MPI=10 | Runtime loops have GPU coverage; preserve device-resident data flow. |
clocks.f90 |
Timing | Wall-clock timers | Runtime support | default/core | Review if runtime-active | clock_m, start, stop |
- | 0 | CUF/global=0; CUDA/device=0; MPI=4 | Audit before using in a new active configuration. |
concurrent_precursor.f90 |
CPS | Concurrent precursor coordination | Optional runtime | USE_CPS, USE_SCALARS | GPU-enabled | concurrent_precursor, initialize_cps, synchronize_cps, inflow_cps |
- | 4 | CUF/global=1; CUDA/device=9; MPI=30 | Runtime loops have GPU coverage; preserve device-resident data flow. |
convec.f90 |
Convection | Production GPU convection path | Runtime hot path | default/core | GPU-enabled | convec_cuda_m, convec_cuda_impl, ensure_convec_cuda, padd_3d_dp, unpadd_3d_dp, check_convec_cuda, +3 more |
- | 53 | CUF/global=16; CUDA/device=127; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
coriolis.f90 |
Forcing | GPU Coriolis forcing loops | Runtime | default/core | GPU-enabled | coriolis, coriolis_cuda_sync, coriolis_init, coriolis_finalize, coriolis_calc |
- | 9 | CUF/global=3; CUDA/device=22; MPI=7 | Runtime loops have GPU coverage; preserve device-resident data flow. |
cuda_mpi_debug.f90 |
MPI/GPU debug | Concise CUDA-aware MPI pointer/sync diagnostics | Debug support | default/core | GPU-enabled | cuda_mpi_debug, cuda_mpi_debug_init, print_env, cuda_pre, cuda_post, probe_real, +5 more |
LESGO_MPI_CUDA_DEBUG, LESGO_MPI_CUDA_SYNC |
0 | CUF/global=0; CUDA/device=107; MPI=41 | Retained switch should be documented before changing behavior. |
derivatives.f90 |
Derivatives | GPU spatial derivative kernels | Runtime hot path | default/core | GPU-enabled | derivatives, filt_da_cuda, xy_derivs_cuda, stress_uv_xy_derivs_cuda, stress_w_xy_derivs_cuda, stress_uv_div_cuda, +19 more |
- | 114 | CUF/global=38; CUDA/device=305; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
divstress_uv.f90 |
SGS/divstress | GPU horizontal stress divergence | Runtime hot path | default/core | GPU-enabled | divstress_uv |
- | 5 | CUF/global=2; CUDA/device=25; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
divstress_w.f90 |
SGS/divstress | GPU vertical stress divergence | Runtime hot path | default/core | GPU-enabled | divstress_w |
- | 24 | CUF/global=6; CUDA/device=37; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
emul_complex.f90 |
Math utility | Complex arithmetic compatibility | Shared | USE_SCALARS | Review if runtime-active | emul_complex, procedure, procedure, procedure, mul_real_complex_imag_scalar, mul_real_complex_2D, +2 more |
- | 6 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
fft.f90 |
FFT | FFT plan/state support | Runtime support | default/core | Review if runtime-active | fft, padd, unpadd, init_fft, init_wavenumber |
- | 2 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
finalize.f90 |
Finalization | Shutdown and cleanup | Finalization | USE_ATM, USE_TURBINES | Review if runtime-active | finalize |
- | 0 | CUF/global=0; CUDA/device=0; MPI=4 | Audit before using in a new active configuration. |
forcing.f90 |
Forcing/projection | GPU forcing, applied force reset, projection timing | Runtime hot path | USE_ATM, USE_HIT | GPU-enabled | forcing, forcing_cuda_sync, forcing_random, forcing_applied, forcing_induced, project, +7 more |
LESGO_PROJECT_STAGE_TIMING |
41 | CUF/global=13; CUDA/device=88; MPI=99 | Retained switch should be documented before changing behavior. |
fringe.f90 |
Inflow | Fringe-region support | Runtime | default/core | GPU-enabled | fringe, procedure, constructor |
- | 2 | CUF/global=0; CUDA/device=2; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
functions.f90 |
Utility | Shared GPU-enabled helper loops and math | Shared/runtime | default/core | GPU-enabled | functions, procedure, procedure, procedure, procedure, procedure, +15 more |
- | 12 | CUF/global=2; CUDA/device=21; MPI=19 | Runtime loops have GPU coverage; preserve device-resident data flow. |
grid.f90 |
Grid | Grid metadata and coordinates | Runtime data | default/core | Review if runtime-active | grid_m, build |
- | 5 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
hit_inflow.f90 |
HIT | Optional HIT input/reference path | Optional | USE_HIT | GPU-enabled | hit_inflow, initialize_HIT, extract_HIT_data, compute_HIT_plane_data, inflow_HIT, hit_write_restart, +2 more |
- | 27 | CUF/global=2; CUDA/device=60; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
hit_inflow_gpu.f90 |
HIT | GPU HIT inflow helpers | Optional runtime | USE_HIT | GPU-enabled | hit_inflow_gpu, hit_gpu_setup, hit_compute_plane_gpu, hit_apply_fringe_gpu |
- | 9 | CUF/global=2; CUDA/device=131; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
inflow.f90 |
Inflow | GPU inflow forcing/runtime loops | Runtime | USE_CPS, USE_HIT | GPU-enabled | inflow, inflow_cuda_sync, inflow_init, apply_inflow, inflow_uniform |
- | 4 | CUF/global=1; CUDA/device=41; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
init_random_seed.f90 |
Initialization | Random seed control | Setup | default/core | Review if runtime-active | init_random_seed, lcg |
LESGO_RANDOM_SEED |
2 | CUF/global=0; CUDA/device=0; MPI=0 | Retained switch should be documented before changing behavior. |
initial.f90 |
Initialization | Initial condition setup and device state | Initialization | USE_SCALARS, USE_TURBINES | GPU-enabled | initial, initial_cuda_sync, initial_cuda_touch_velocity, ic_uniform, check_for_interp, ic_file, +3 more |
- | 27 | CUF/global=2; CUDA/device=24; MPI=10 | Runtime loops have GPU coverage; preserve device-resident data flow. |
initialize.f90 |
Initialization | Startup allocation and setup | Initialization | USE_ATM, USE_CPS, USE_HIT, USE_LVLSET, USE_SCALARS, USE_TURBINES | Review if runtime-active | initialize |
- | 0 | CUF/global=0; CUDA/device=0; MPI=5 | Audit before using in a new active configuration. |
input_util.f90 |
Configuration | Input parsing helpers | I/O/setup | USE_HIT, USE_SCALARS, USE_TURBINES | CPU acceptable: setup/I/O | input_util, procedure, read_input_conf, domain_block, model_block, coriolis_block, +10 more |
- | 13 | CUF/global=0; CUDA/device=0; MPI=0 | CPU is acceptable unless this path becomes repeated inside the timestep. |
interpolag_Sdep.f90 |
SGS | Lagrangian scale-dependent interpolation | Runtime | default/core | GPU-enabled | interpolag_Sdep, interpolag_sdep_cuda_sync |
- | 17 | CUF/global=2; CUDA/device=22; MPI=17 | Exercised by the default half-channel sgs_model=5 validation; preserve device-resident Lagrangian data flow. |
interpolag_Ssim.f90 |
SGS | Lagrangian scale-sim interpolation | Runtime | default/core | GPU-enabled | interpolag_Ssim, interpolag_ssim_cuda_sync |
- | 11 | CUF/global=2; CUDA/device=22; MPI=13 | Runtime loops have GPU coverage; preserve device-resident data flow. |
io.f90 |
Output | Output and optional CGNS paths | I/O | USE_TURBINES | GPU-enabled | io, openfiles, energy, write_tau_wall_bot, write_tau_wall_top, write_parallel_cgns, +7 more |
- | 40 | CUF/global=1; CUDA/device=18; MPI=33 | Runtime loops have GPU coverage; preserve device-resident data flow. |
iwmles.f90 |
Wall model | Integral wall model support | Runtime | default/core | GPU-enabled | iwmles, iwm_cuda_sync, iwm_wallstress, iwm_init, iwm_finalize, iwm_calc_lhs, +5 more |
- | 28 | CUF/global=5; CUDA/device=38; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
lagrange_Sdep.f90 |
SGS | Lagrangian scale-dependent averaging | Runtime | default/core | GPU-enabled | lagrange_Sdep, lagrange_sdep_cuda_sync |
- | 23 | CUF/global=10; CUDA/device=59; MPI=19 | Covers F_NN, F_QN, F_MM, and F_LM update work in the default half-channel validation. |
lagrange_Ssim.f90 |
SGS | Lagrangian scale-sim averaging | Runtime | default/core | GPU-enabled | lagrange_Ssim, lagrange_ssim_cuda_sync |
- | 21 | CUF/global=9; CUDA/device=53; MPI=15 | Runtime loops have GPU coverage; preserve device-resident data flow. |
level_set.f90 |
Level set | Optional level-set runtime loops | Optional runtime | USE_HIT, USE_LVLSET | GPU-enabled | level_set, level_set_cuda_sync, level_set_init, level_set_vel_err, level_set_Cs_lag_dyn, level_set_lag_dyn, +35 more |
- | 106 | CUF/global=8; CUDA/device=34; MPI=165 | Runtime loops have GPU coverage; preserve device-resident data flow. |
level_set_base.f90 |
Level set | Level-set shared definitions | Optional setup | USE_LVLSET | GPU-enabled | level_set_base, level_set_base_init |
- | 0 | CUF/global=0; CUDA/device=3; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
linear_simple.f90 |
Math utility | Linear helper routines | Shared | default/core | Review if runtime-active | linear_simple, procedure, solve_linear, assert_eq2, assert_eq3, assert_eq4, +4 more |
- | 3 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
main.f90 |
Main driver | Timestep orchestration, timing, CPU/GPU comparison output | Runtime | USE_SCALARS, USE_TURBINES | GPU-enabled | main, main_cuda_sync |
LESGO_CPU_REF_TIME_FORCING, LESGO_CPU_REF_TIME_TOTAL |
33 | CUF/global=13; CUDA/device=49; MPI=29 | Retained switch should be documented before changing behavior. |
messages.f90 |
Messages | User-facing message helpers | Setup | default/core | CPU acceptable: setup/I/O | messages, procedure, procedure, message_a, message_ai, message_aiai, +20 more |
- | 0 | CUF/global=0; CUDA/device=0; MPI=0 | CPU is acceptable unless this path becomes repeated inside the timestep. |
mpi_defs.f90 |
MPI | MPI decomposition, rank/GPU binding, communication helpers | Runtime | USE_CPS | GPU-enabled | mpi_defs, initialize_mpi, bind_cuda_device, create_mpi_comms_cps, mpi_sync_real_array, sync_down, +2 more |
- | 1 | CUF/global=0; CUDA/device=52; MPI=90 | Runtime loops have GPU coverage; preserve device-resident data flow. |
mpi_transpose_mod.f90 |
MPI transpose | Transpose support used by pressure and spectral paths | Runtime | default/core | GPU-enabled | mpi_transpose_mod, mpi_transpose |
- | 8 | CUF/global=0; CUDA/device=1; MPI=14 | Runtime loops have GPU coverage; preserve device-resident data flow. |
param.f90 |
Configuration | Global parameters and CMake/runtime option state | Setup | default/core | CPU acceptable: setup/I/O | param |
- | 0 | CUF/global=0; CUDA/device=0; MPI=3 | CPU is acceptable unless this path becomes repeated inside the timestep. |
param_output.f90 |
Output | Output configuration | I/O/setup | default/core | CPU acceptable: setup/I/O | param_output |
- | 4 | CUF/global=0; CUDA/device=0; MPI=0 | CPU is acceptable unless this path becomes repeated inside the timestep. |
pid.f90 |
Utility | PID/control helper | Setup | default/core | Review if runtime-active | pid_m, procedure, constructor, advance_noset, advance_set |
- | 0 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
press_stag_array.f90 |
Pressure | GPU pressure RHS, halos, cuFFT orchestration | Runtime hot path | default/core | GPU-enabled | press_stag_array, press_cuda_sync, press_queue_event_start, press_queue_event_stop, press_queue_report, press_rhs_halo_audit, +12 more |
CUDA_VISIBLE_DEVICES, LESGO_PRESS_RHS_HALO_COMBINED, LESGO_PRESS_STAGE_TIMING, MPICH_GPU_SUPPORT_ENABLED |
80 | CUF/global=27; CUDA/device=250; MPI=137 | Pressure math and zero-mode handling are release-sensitive; validate divergence and wall stress after edits. |
rmsdiv.f90 |
Diagnostics | GPU divergence metric/reduction | Runtime diagnostic | default/core | GPU-enabled | rmsdiv |
- | 6 | CUF/global=1; CUDA/device=19; MPI=3 | Runtime loops have GPU coverage; preserve device-resident data flow. |
scalars.f90 |
Scalars | Optional scalar transport GPU loops | Optional runtime | USE_SCALARS | GPU-enabled | scalars, scalars_cuda_sync, scalars_copy_rhs_gpu, scalars_advective_gpu, scalars_flux_gpu, scalars_rhs_theta_gpu, +12 more |
- | 65 | CUF/global=20; CUDA/device=58; MPI=17 | Runtime loops have GPU coverage; preserve device-resident data flow. |
scaledep_dynamic.f90 |
SGS | Scale-dependent dynamic SGS model support | Runtime | default/core | GPU-enabled | scaledep_dynamic, scaledep_dynamic_cuda_sync |
- | 17 | CUF/global=8; CUDA/device=31; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
sgs_param.f90 |
SGS | SGS parameters | Setup/runtime support | default/core | GPU-enabled | sgs_param, sgs_param_init |
- | 0 | CUF/global=0; CUDA/device=7; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
sgs_stag_util.f90 |
SGS | GPU SGS model/stress construction and halos | Runtime hot path | default/core | GPU-enabled | sgs_stag_util, sgs_cuda_sync, sgs_cuda_barrier, sgs_event_record, sgs_event_elapsed_seconds, sgs_diag_time, +26 more |
CUDA_VISIBLE_DEVICES, LESGO_SGS_CALCSIJ_EXPLICIT, LESGO_SGS_HALO_COMBINED, LESGO_SGS_STAGE_TIMING, LESGO_SGS_STRICT_SYNC, MPICH_GPU_SUPPORT_ENABLED |
122 | CUF/global=33; CUDA/device=275; MPI=205 | Do not change SGS formulas while editing timing or halo paths; validate wall stress and the default half-channel SGS statistics. |
shifted_inflow.f90 |
Inflow | Shifted inflow support | Runtime | USE_SCALARS | GPU-enabled | shifted_inflow, shifted_inflow_cuda_sync, shifted_inflow_init, inflow_shifted |
- | 10 | CUF/global=3; CUDA/device=65; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
sim_param.f90 |
Configuration | Simulation parameters | Setup | default/core | GPU-enabled | sim_param, sim_param_init |
- | 0 | CUF/global=0; CUDA/device=1; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
sponge.f90 |
Boundary | GPU sponge damping loops | Runtime | default/core | GPU-enabled | sponge, sponge_cuda_sync, sponge_init, sponge_force |
- | 7 | CUF/global=2; CUDA/device=19; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
stability.f90 |
Stability | Stability helper routines | Runtime support | default/core | Review if runtime-active | stability, calc_phi, calc_psi_m, calc_psi_h |
- | 0 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
stat_defs.f90 |
Statistics | Statistics definitions | Runtime support | USE_TURBINES | Review if runtime-active | stat_defs |
- | 0 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
std_dynamic.f90 |
SGS | Standard dynamic SGS model support | Runtime | default/core | GPU-enabled | std_dynamic, std_dynamic_cuda_sync |
- | 13 | CUF/global=6; CUDA/device=37; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
string_util.f90 |
Utility | String parsing helpers | Setup | USE_HIT | Review if runtime-active | string_util, procedure, procedure, procedure, numtostr_r, numtostr_i, +36 more |
- | 3 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
test_filtermodule.f90 |
Filtering | GPU test-filter support for dynamic SGS | Runtime hot path | default/core | GPU-enabled | test_filtermodule, require_test_filter_cufft_success, test_filter_cuda_sync, test_filter_cuda_barrier, ensure_test_filter_cuda_plan, ensure_test_filter_cuda_many_plan, +10 more |
- | 8 | CUF/global=3; CUDA/device=102; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
time_average.f90 |
Statistics | Time averaging and output accumulation | Runtime/I/O | USE_ATM, USE_TURBINES | Review if runtime-active | time_average, init, compute, finalize, checkpoint, write_parallel_cgns, +1 more |
- | 20 | CUF/global=0; CUDA/device=0; MPI=39 | Audit before using in a new active configuration. |
trees_base_ls.f90 |
Level set | Tree geometry base data | Optional setup | USE_LVLSET | Review if runtime-active | trees_base_ls, cross_product, delta, grid_initialize, grid_nearest_of_pt, grid_of_pt, +2 more |
- | 1 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
trees_global_fmask_ls.f90 |
Level set | Tree/fmask preprocessing | Optional setup | USE_LVLSET | GPU-enabled | trees_global_fmask_ls, global_fmask_init, calc_global_fmask_ta, read_global_fmask, write_global_fmask, write_fmt_global_fmask, +5 more |
- | 34 | CUF/global=5; CUDA/device=21; MPI=7 | Runtime loops have GPU coverage; preserve device-resident data flow. |
trees_io_ls.f90 |
Level set | Tree I/O | Optional I/O | USE_LVLSET | Review if runtime-active | trees_io_ls, draw_cross_section, set_h, set_radius, write_tecplot_zone_hdr, draw_tree_array, +4 more |
- | 9 | CUF/global=0; CUDA/device=0; MPI=0 | Audit before using in a new active configuration. |
trees_pre_ls.f90 |
Level set | Tree preprocessing | Optional setup | USE_LVLSET | GPU-enabled | trees_pre_ls_mod, trees_pre_ls |
- | 13 | CUF/global=0; CUDA/device=3; MPI=13 | Runtime loops have GPU coverage; preserve device-resident data flow. |
trees_setup_ls.f90 |
Level set | Tree setup | Optional setup | USE_LVLSET | GPU-enabled | trees_setup_ls, sdistfcn_tree_array, dist_plate, dist_circle, dist_circle_b, dist_circle_c, +16 more |
- | 16 | CUF/global=1; CUDA/device=16; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
tridag_array.f90 |
Pressure | GPU tridiagonal solve and transpose-Thomas helper | Runtime hot path | default/core | GPU-enabled | tridag_transpose_gpu_mod, tridag_array, tridag_array_transpose_thomas_cuda, tridag_array_spike2_cuda, tridag_array_replicated_cuda, tridag_array |
CUDA_VISIBLE_DEVICES, LESGO_PRESS_DIRECT_THOMAS_OUT, LESGO_PRESS_TRANSPOSE_GENERIC, LESGO_PRESS_TRANSPOSE_TIMING, MPICH_GPU_SUPPORT_ENABLED |
89 | CUF/global=34; CUDA/device=308; MPI=158 | Pressure math and zero-mode handling are release-sensitive; validate divergence and wall stress after edits. |
turbine_indicator.f90 |
Turbines | Turbine indicator setup/runtime helpers | Optional setup/runtime | USE_TURBINES | GPU-enabled | turbine_indicator, val, init |
- | 12 | CUF/global=7; CUDA/device=40; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
turbines.f90 |
Turbines | Optional actuator disk/turbine routines | Optional runtime | USE_TURBINES | GPU-enabled | turbines, turbines_init, turbines_nodes, turbines_forcing, turbines_finalize, turbines_checkpoint, +3 more |
- | 35 | CUF/global=0; CUDA/device=5; MPI=31 | Runtime loops have GPU coverage; preserve device-resident data flow. |
turbines_gpu.f90 |
Turbines | GPU helper routines for optional turbine paths | Optional runtime | USE_TURBINES | GPU-enabled | turbines_gpu, turbines_interp_w_to_uv_gpu |
- | 5 | CUF/global=2; CUDA/device=8; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
types.f90 |
Shared types | Core precision and type definitions | Shared | default/core | CPU acceptable: setup/I/O | types |
- | 0 | CUF/global=0; CUDA/device=0; MPI=0 | CPU is acceptable unless this path becomes repeated inside the timestep. |
wallstress.f90 |
Wall model | GPU wall-stress related loops and diagnostics | Runtime | default/core | GPU-enabled | wallstress, wallstress_cuda_sync, ws_free_lbc, ws_free_ubc, ws_dns_lbc, ws_dns_ubc, +2 more |
- | 26 | CUF/global=9; CUDA/device=39; MPI=0 | Runtime loops have GPU coverage; preserve device-resident data flow. |
Procedure Inventory¶
actuator_turbine_model.f90¶
- Area: ATM
- Runtime relevance: Runtime hot path when USE_ATM
- GPU status: GPU-enabled
- Procedures/modules found:
actuator_turbine_model,atm_model_cuda_check,atm_model_cuda_sync,atm_initialize,atm_read_actuator_points,atm_read_restart,atm_write_restart,atm_initialize_output,atm_create_points,atm_update,atm_control_yaw,atm_computeRotorSpeed,atm_rotateBlades,atm_compute_cl_correction,atm_compute_cl_correction_gpu,s_fit,atm_calculate_variables,atm_computeBladeForce,atm_computeNacelleForce,atm_integrate_u,atm_yawNacelle,atm_output,atm_compute_power,atm_write_blade_points,atm_process_output - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
atm_base.f90¶
- Area: ATM
- Runtime relevance: Setup/runtime support
- GPU status: Review if runtime-active
- Procedures/modules found:
atm_base,error,interpolate,vector_add,vector_divide,vector_multiply,vector_mag,rotatePoint,matrix_vector,cross_product,distance - Retained switches: -
- Developer note: Audit before using in a new active configuration.
atm_input_util.f90¶
- Area: ATM
- Runtime relevance: I/O/setup
- GPU status: GPU-enabled
- Procedures/modules found:
atm_input_util,read_input_conf,read_turbine_model_variables,atm_print_initialize,read_airfoil,readline,eat_whitespace - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
atm_lesgo_interface.f90¶
- Area: ATM
- Runtime relevance: Runtime hot path when USE_ATM
- GPU status: GPU-enabled
- Procedures/modules found:
atm_lesgo_interface,atm_lb_auto_record,atm_diag_event_start,atm_diag_event_stop,atm_diag_event_flush,atm_cuda_check,atm_cuda_sync,atm_lesgo_reset_turbine_gpu,atm_interp_w_to_uv,atm_prepare_direct_w,atm_lesgo_apply_force_gpu,atm_lesgo_convolute_force_gpu_atpoint,atm_lesgo_force_gpu_atpoint,atm_lesgo_nacelle_force_gpu_atpoint,atm_point_owner_sample_turbine,atm_point_owner_force_turbine,atm_lb_ensure_targeted_buffers,atm_lb_refresh_targeted_sample_slots,atm_lb_targeted_velocity_exchange,atm_lb_pack_force_turbine,atm_lb_unpack_force_turbine,atm_point_owner_lb_gather_targeted,atm_point_owner_lb_force,atm_point_owner_lb_gather,atm_point_owner_lb_validate,atm_lesgo_initialize,atm_lesgo_finalize,atm_lesgo_report_timing,atm_lesgo_diag_load,atm_lesgo_lb_plan, +19 more - Retained switches:
LESGO_ATM_DIAG_TIMING,LESGO_ATM_LB_AUTO_SELECT,LESGO_ATM_LB_VALIDATE,LESGO_ATM_POINT_OWNER_LB,LESGO_ATM_POINT_OWNER_TARGETED - Developer note: Legacy ATM path is default; point-owner LB remains experimental and must validate force sums.
cfl_util.f90¶
- Area: Diagnostics
- Runtime relevance: Runtime diagnostic
- GPU status: GPU-enabled
- Procedures/modules found:
cfl_util,cfl_cuda_sync,get_max_cfl,get_cfl_dt - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
clocks.f90¶
- Area: Timing
- Runtime relevance: Runtime support
- GPU status: Review if runtime-active
- Procedures/modules found:
clock_m,start,stop - Retained switches: -
- Developer note: Audit before using in a new active configuration.
concurrent_precursor.f90¶
- Area: CPS
- Runtime relevance: Optional runtime
- GPU status: GPU-enabled
- Procedures/modules found:
concurrent_precursor,initialize_cps,synchronize_cps,inflow_cps - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
convec.f90¶
- Area: Convection
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
convec_cuda_m,convec_cuda_impl,ensure_convec_cuda,padd_3d_dp,unpadd_3d_dp,check_convec_cuda,convec_cuda_sync,require_convec_cufft,convec - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
coriolis.f90¶
- Area: Forcing
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
coriolis,coriolis_cuda_sync,coriolis_init,coriolis_finalize,coriolis_calc - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
cuda_mpi_debug.f90¶
- Area: MPI/GPU debug
- Runtime relevance: Debug support
- GPU status: GPU-enabled
- Procedures/modules found:
cuda_mpi_debug,cuda_mpi_debug_init,print_env,cuda_pre,cuda_post,probe_real,probe_complex,mpi_dbg_sendrecv_r,mpi_dbg_sendrecv_c,mpi_dbg_send_r,mpi_dbg_recv_r - Retained switches:
LESGO_MPI_CUDA_DEBUG,LESGO_MPI_CUDA_SYNC - Developer note: Retained switch should be documented before changing behavior.
derivatives.f90¶
- Area: Derivatives
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
derivatives,filt_da_cuda,xy_derivs_cuda,stress_uv_xy_derivs_cuda,stress_w_xy_derivs_cuda,stress_uv_div_cuda,stress_w_div_cuda,filt_da_vel_cuda,ensure_filt_da_cuda_plan,ensure_filt_da_vel_cuda_plan,ensure_stress_uv_xy_cuda_plan,ensure_stress_w_xy_cuda_plan,derivatives_cuda_sync,require_filt_da_cufft_success,require_filt_da_cuda_success,stress_uv_xy_derivs,stress_w_xy_derivs,ddx,ddy,ddxy,filt_da,filt_da_vel,ddz_vel,ddz_uv,ddz_w - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
divstress_uv.f90¶
- Area: SGS/divstress
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
divstress_uv - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
divstress_w.f90¶
- Area: SGS/divstress
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
divstress_w - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
emul_complex.f90¶
- Area: Math utility
- Runtime relevance: Shared
- GPU status: Review if runtime-active
- Procedures/modules found:
emul_complex,procedure,procedure,procedure,mul_real_complex_imag_scalar,mul_real_complex_2D,mul_real_complex_imag_2D,mul_real_complex_real_2D - Retained switches: -
- Developer note: Audit before using in a new active configuration.
fft.f90¶
- Area: FFT
- Runtime relevance: Runtime support
- GPU status: Review if runtime-active
- Procedures/modules found:
fft,padd,unpadd,init_fft,init_wavenumber - Retained switches: -
- Developer note: Audit before using in a new active configuration.
finalize.f90¶
- Area: Finalization
- Runtime relevance: Finalization
- GPU status: Review if runtime-active
- Procedures/modules found:
finalize - Retained switches: -
- Developer note: Audit before using in a new active configuration.
forcing.f90¶
- Area: Forcing/projection
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
forcing,forcing_cuda_sync,forcing_random,forcing_applied,forcing_induced,project,project_sync_velocity_halos_cuda,project_sync_velocity_direct_halos_cuda,project_sync_velocity_direct_halos_overlap_cuda,project_stage_report,project_ensure_halo_buffers,project_pack_velocity_halos_cuda,project_unpack_velocity_halos_cuda - Retained switches:
LESGO_PROJECT_STAGE_TIMING - Developer note: Retained switch should be documented before changing behavior.
fringe.f90¶
- Area: Inflow
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
fringe,procedure,constructor - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
functions.f90¶
- Area: Utility
- Runtime relevance: Shared/runtime
- GPU status: GPU-enabled
- Procedures/modules found:
functions,procedure,procedure,procedure,procedure,procedure,procedure,tau_wall_cuda_sync,interp_to_uv_grid,interp_to_w_grid,bilinear_interp_sa_nocheck,bilinear_interp_sa,bilinear_interp_aa,linear_interp_sa_nocheck,linear_interp_sa,linear_interp_aa,cross_product,binary_search,get_tau_wall_bot,get_tau_wall_top,count_lines - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
grid.f90¶
- Area: Grid
- Runtime relevance: Runtime data
- GPU status: Review if runtime-active
- Procedures/modules found:
grid_m,build - Retained switches: -
- Developer note: Audit before using in a new active configuration.
hit_inflow.f90¶
- Area: HIT
- Runtime relevance: Optional
- GPU status: GPU-enabled
- Procedures/modules found:
hit_inflow,initialize_HIT,extract_HIT_data,compute_HIT_plane_data,inflow_HIT,hit_write_restart,hit_read_restart,interpolate3D - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
hit_inflow_gpu.f90¶
- Area: HIT
- Runtime relevance: Optional runtime
- GPU status: GPU-enabled
- Procedures/modules found:
hit_inflow_gpu,hit_gpu_setup,hit_compute_plane_gpu,hit_apply_fringe_gpu - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
inflow.f90¶
- Area: Inflow
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
inflow,inflow_cuda_sync,inflow_init,apply_inflow,inflow_uniform - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
init_random_seed.f90¶
- Area: Initialization
- Runtime relevance: Setup
- GPU status: Review if runtime-active
- Procedures/modules found:
init_random_seed,lcg - Retained switches:
LESGO_RANDOM_SEED - Developer note: Retained switch should be documented before changing behavior.
initial.f90¶
- Area: Initialization
- Runtime relevance: Initialization
- GPU status: GPU-enabled
- Procedures/modules found:
initial,initial_cuda_sync,initial_cuda_touch_velocity,ic_uniform,check_for_interp,ic_file,ic_interp,ic_dns,ic_les - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
initialize.f90¶
- Area: Initialization
- Runtime relevance: Initialization
- GPU status: Review if runtime-active
- Procedures/modules found:
initialize - Retained switches: -
- Developer note: Audit before using in a new active configuration.
input_util.f90¶
- Area: Configuration
- Runtime relevance: I/O/setup
- GPU status: CPU acceptable: setup/I/O
- Procedures/modules found:
input_util,procedure,read_input_conf,domain_block,model_block,coriolis_block,time_block,flow_cond_block,output_block,level_set_block,turbines_block,scalars_block,checkentry,readline,parse_vector_real,parse_vector_point3D - Retained switches: -
- Developer note: CPU is acceptable unless this path becomes repeated inside the timestep.
interpolag_Sdep.f90¶
- Area: SGS
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
interpolag_Sdep,interpolag_sdep_cuda_sync - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
interpolag_Ssim.f90¶
- Area: SGS
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
interpolag_Ssim,interpolag_ssim_cuda_sync - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
io.f90¶
- Area: Output
- Runtime relevance: I/O
- GPU status: GPU-enabled
- Procedures/modules found:
io,openfiles,energy,write_tau_wall_bot,write_tau_wall_top,write_parallel_cgns,write_null_cgns,output_loop,inst_write,force_tot,checkpoint,output_final,output_init - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
iwmles.f90¶
- Area: Wall model
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
iwmles,iwm_cuda_sync,iwm_wallstress,iwm_init,iwm_finalize,iwm_calc_lhs,iwm_slv,iwm_calc_wallstress,iwm_monitor,iwm_checkPoint,iwm_read_checkPoint - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
lagrange_Sdep.f90¶
- Area: SGS
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
lagrange_Sdep,lagrange_sdep_cuda_sync - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
lagrange_Ssim.f90¶
- Area: SGS
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
lagrange_Ssim,lagrange_ssim_cuda_sync - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
level_set.f90¶
- Area: Level set
- Runtime relevance: Optional runtime
- GPU status: GPU-enabled
- Procedures/modules found:
level_set,level_set_cuda_sync,level_set_init,level_set_vel_err,level_set_Cs_lag_dyn,level_set_lag_dyn,modify_beta,neumann_F_MM,mpi_sync_F_MM,zero_F_LM,modify_dutdn,extrap_tau_simple,extrap_tau_log,enforce_un,enforce_log_profile,interp_scal,interp_tij_u,fill_f,interp_tij_w,fill_f,interp_phi,interp_vel,smooth_tau,level_set_smooth_vel,smooth,level_set_global_CA,level_set_Cs,mpi_sync,mpi_sync_tau,level_set_BC, +11 more - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
level_set_base.f90¶
- Area: Level set
- Runtime relevance: Optional setup
- GPU status: GPU-enabled
- Procedures/modules found:
level_set_base,level_set_base_init - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
linear_simple.f90¶
- Area: Math utility
- Runtime relevance: Shared
- GPU status: Review if runtime-active
- Procedures/modules found:
linear_simple,procedure,solve_linear,assert_eq2,assert_eq3,assert_eq4,ludcmp,lubksb,outerprod,swap - Retained switches: -
- Developer note: Audit before using in a new active configuration.
main.f90¶
- Area: Main driver
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
main,main_cuda_sync - Retained switches:
LESGO_CPU_REF_TIME_FORCING,LESGO_CPU_REF_TIME_TOTAL - Developer note: Retained switch should be documented before changing behavior.
messages.f90¶
- Area: Messages
- Runtime relevance: Setup
- GPU status: CPU acceptable: setup/I/O
- Procedures/modules found:
messages,procedure,procedure,message_a,message_ai,message_aiai,message_aiar,message_al,message_aii,message_air,message_ai_array,message_aiai_array,message_ar,message_ar_array,message_aiar_array,warn,error_a,error_ai,error_ai_array,error_aia,error_aiai,error_aiar,error_arar,error_al,error_ar,error_ar_array - Retained switches: -
- Developer note: CPU is acceptable unless this path becomes repeated inside the timestep.
mpi_defs.f90¶
- Area: MPI
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
mpi_defs,initialize_mpi,bind_cuda_device,create_mpi_comms_cps,mpi_sync_real_array,sync_down,sync_up,sync_downup_nb - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
mpi_transpose_mod.f90¶
- Area: MPI transpose
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
mpi_transpose_mod,mpi_transpose - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
param.f90¶
- Area: Configuration
- Runtime relevance: Setup
- GPU status: CPU acceptable: setup/I/O
- Procedures/modules found:
param - Retained switches: -
- Developer note: CPU is acceptable unless this path becomes repeated inside the timestep.
param_output.f90¶
- Area: Output
- Runtime relevance: I/O/setup
- GPU status: CPU acceptable: setup/I/O
- Procedures/modules found:
param_output - Retained switches: -
- Developer note: CPU is acceptable unless this path becomes repeated inside the timestep.
pid.f90¶
- Area: Utility
- Runtime relevance: Setup
- GPU status: Review if runtime-active
- Procedures/modules found:
pid_m,procedure,constructor,advance_noset,advance_set - Retained switches: -
- Developer note: Audit before using in a new active configuration.
press_stag_array.f90¶
- Area: Pressure
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
press_stag_array,press_cuda_sync,press_queue_event_start,press_queue_event_stop,press_queue_report,press_rhs_halo_audit,press_rhs_halo_report,press_rhs_assembly_report,press_stage_report,press_pack_rhs_cuda,press_pack_rhs_halo_cuda,press_unpack_rhs_halo_cuda,press_pack_rhs_halo_combined_cuda,press_unpack_rhs_halo_combined_cuda,press_rhs_prep_cuda,press_assemble_rhs_cuda,press_assemble_rhs_range_cuda,press_zero_mode_cuda - Retained switches:
CUDA_VISIBLE_DEVICES,LESGO_PRESS_RHS_HALO_COMBINED,LESGO_PRESS_STAGE_TIMING,MPICH_GPU_SUPPORT_ENABLED - Developer note: Pressure math and zero-mode handling are release-sensitive; validate divergence and wall stress after edits.
rmsdiv.f90¶
- Area: Diagnostics
- Runtime relevance: Runtime diagnostic
- GPU status: GPU-enabled
- Procedures/modules found:
rmsdiv - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
scalars.f90¶
- Area: Scalars
- Runtime relevance: Optional runtime
- GPU status: GPU-enabled
- Procedures/modules found:
scalars,scalars_cuda_sync,scalars_copy_rhs_gpu,scalars_advective_gpu,scalars_flux_gpu,scalars_rhs_theta_gpu,buoyancy_force_gpu,scalars_init,ic_scal,ic_scal_file,ic_scal_les,ic_scal_interp,scalars_checkpoint,scalars_deriv,obukhov,scalars_transport,to_big,buoyancy_force - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
scaledep_dynamic.f90¶
- Area: SGS
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
scaledep_dynamic,scaledep_dynamic_cuda_sync - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
sgs_param.f90¶
- Area: SGS
- Runtime relevance: Setup/runtime support
- GPU status: GPU-enabled
- Procedures/modules found:
sgs_param,sgs_param_init - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
sgs_stag_util.f90¶
- Area: SGS
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
sgs_stag_util,sgs_cuda_sync,sgs_cuda_barrier,sgs_event_record,sgs_event_elapsed_seconds,sgs_diag_time,sgs_calc_diag_begin,sgs_calc_diag_start,sgs_calc_diag_stop,sgs_calc_cpu_start,sgs_calc_cpu_stop,sgs_calc_set_zrange,sgs_calc_set_audit,sgs_tau_detail_begin,sgs_tau_detail_start,sgs_tau_detail_stop,sgs_tau_detail_add_bytes,sgs_tau_detail_add_msg,sgs_dwdz_detail_begin,sgs_dwdz_detail_start,sgs_dwdz_detail_stop,sgs_dwdz_detail_add_msg,sgs_dwdz_path_audit,sgs_pointer_env_audit,sgs_pointer_env_audit_device,sgs_stag,calc_Sij,calc_Sij_nut_dynamic_cuda,sgs_sync_dwdz_down_cuda,sgs_calc_sij_detail_report, +2 more - Retained switches:
CUDA_VISIBLE_DEVICES,LESGO_SGS_CALCSIJ_EXPLICIT,LESGO_SGS_HALO_COMBINED,LESGO_SGS_STAGE_TIMING,LESGO_SGS_STRICT_SYNC,MPICH_GPU_SUPPORT_ENABLED - Developer note: Do not change SGS formulas while editing timing or halo paths; validate tau/wall-stress behavior.
shifted_inflow.f90¶
- Area: Inflow
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
shifted_inflow,shifted_inflow_cuda_sync,shifted_inflow_init,inflow_shifted - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
sim_param.f90¶
- Area: Configuration
- Runtime relevance: Setup
- GPU status: GPU-enabled
- Procedures/modules found:
sim_param,sim_param_init - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
sponge.f90¶
- Area: Boundary
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
sponge,sponge_cuda_sync,sponge_init,sponge_force - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
stability.f90¶
- Area: Stability
- Runtime relevance: Runtime support
- GPU status: Review if runtime-active
- Procedures/modules found:
stability,calc_phi,calc_psi_m,calc_psi_h - Retained switches: -
- Developer note: Audit before using in a new active configuration.
stat_defs.f90¶
- Area: Statistics
- Runtime relevance: Runtime support
- GPU status: Review if runtime-active
- Procedures/modules found:
stat_defs - Retained switches: -
- Developer note: Audit before using in a new active configuration.
std_dynamic.f90¶
- Area: SGS
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
std_dynamic,std_dynamic_cuda_sync - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
string_util.f90¶
- Area: Utility
- Runtime relevance: Setup
- GPU status: Review if runtime-active
- Procedures/modules found:
string_util,procedure,procedure,procedure,numtostr_r,numtostr_i,eat_whitespace,uppercase,split_string,count_string_occur,string_concat_a,string_concat_r,string_concat_i,string_concat_ai,string_concat_ar,string_concat_aia,string_concat_ara,string_concat_aiaia,string_concat_arara,string_concat_aiai,string_concat_arar,string_concat_araia,string_concat_arai,string_concat_aiaiai,string_concat_ararar,string_concat_aiaiaia,string_concat_ararara,string_splice_aa,string_splice_ar,string_splice_ai, +12 more - Retained switches: -
- Developer note: Audit before using in a new active configuration.
test_filtermodule.f90¶
- Area: Filtering
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
test_filtermodule,require_test_filter_cufft_success,test_filter_cuda_sync,test_filter_cuda_barrier,ensure_test_filter_cuda_plan,ensure_test_filter_cuda_many_plan,apply_test_filter_cuda,apply_test_filter_cuda_3,apply_test_filter_cuda_6,test_filter_init,test_filter,test_filter_3,test_filter_6,test_test_filter,test_test_filter_3,test_test_filter_6 - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
time_average.f90¶
- Area: Statistics
- Runtime relevance: Runtime/I/O
- GPU status: Review if runtime-active
- Procedures/modules found:
time_average,init,compute,finalize,checkpoint,write_parallel_cgns,write_null_cgns - Retained switches: -
- Developer note: Audit before using in a new active configuration.
trees_base_ls.f90¶
- Area: Level set
- Runtime relevance: Optional setup
- GPU status: Review if runtime-active
- Procedures/modules found:
trees_base_ls,cross_product,delta,grid_initialize,grid_nearest_of_pt,grid_of_pt,mag,pt_of_grid - Retained switches: -
- Developer note: Audit before using in a new active configuration.
trees_global_fmask_ls.f90¶
- Area: Level set
- Runtime relevance: Optional setup
- GPU status: GPU-enabled
- Procedures/modules found:
trees_global_fmask_ls,global_fmask_init,calc_global_fmask_ta,read_global_fmask,write_global_fmask,write_fmt_global_fmask,calc_global_fmask,filter_global_fmask,calc_global_fmask_gpu,filter_global_fmask_gpu,truncgauss_kernel_3d - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
trees_io_ls.f90¶
- Area: Level set
- Runtime relevance: Optional I/O
- GPU status: Review if runtime-active
- Procedures/modules found:
trees_io_ls,draw_cross_section,set_h,set_radius,write_tecplot_zone_hdr,draw_tree_array,read_tree_data,read_ta_data,write_tree_data,write_ta_data - Retained switches: -
- Developer note: Audit before using in a new active configuration.
trees_pre_ls.f90¶
- Area: Level set
- Runtime relevance: Optional setup
- GPU status: GPU-enabled
- Procedures/modules found:
trees_pre_ls_mod,trees_pre_ls - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
trees_setup_ls.f90¶
- Area: Level set
- Runtime relevance: Optional setup
- GPU status: GPU-enabled
- Procedures/modules found:
trees_setup_ls,sdistfcn_tree_array,dist_plate,dist_circle,dist_circle_b,dist_circle_c,dist_circle_bc,dist_circle_nobc,dist_square,dist_square_b,dist_square_c,dist_square_bc,dist_square_nobc,sdistfcn_branch_square_gpu,fill_tree_array,read_trees_conf,case_n_tree,case_tree,init_tree,cartesian_correction,heightwidth_bbox_br,set_zone_branch - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
tridag_array.f90¶
- Area: Pressure
- Runtime relevance: Runtime hot path
- GPU status: GPU-enabled
- Procedures/modules found:
tridag_transpose_gpu_mod,tridag_array,tridag_array_transpose_thomas_cuda,tridag_array_spike2_cuda,tridag_array_replicated_cuda,tridag_array - Retained switches:
CUDA_VISIBLE_DEVICES,LESGO_PRESS_DIRECT_THOMAS_OUT,LESGO_PRESS_TRANSPOSE_GENERIC,LESGO_PRESS_TRANSPOSE_TIMING,MPICH_GPU_SUPPORT_ENABLED - Developer note: Pressure math and zero-mode handling are release-sensitive; validate divergence and wall stress after edits.
turbine_indicator.f90¶
- Area: Turbines
- Runtime relevance: Optional setup/runtime
- GPU status: GPU-enabled
- Procedures/modules found:
turbine_indicator,val,init - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
turbines.f90¶
- Area: Turbines
- Runtime relevance: Optional runtime
- GPU status: GPU-enabled
- Procedures/modules found:
turbines,turbines_init,turbines_nodes,turbines_forcing,turbines_finalize,turbines_checkpoint,turbine_vel_init,place_turbines,read_control_files - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
turbines_gpu.f90¶
- Area: Turbines
- Runtime relevance: Optional runtime
- GPU status: GPU-enabled
- Procedures/modules found:
turbines_gpu,turbines_interp_w_to_uv_gpu - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.
types.f90¶
- Area: Shared types
- Runtime relevance: Shared
- GPU status: CPU acceptable: setup/I/O
- Procedures/modules found:
types - Retained switches: -
- Developer note: CPU is acceptable unless this path becomes repeated inside the timestep.
wallstress.f90¶
- Area: Wall model
- Runtime relevance: Runtime
- GPU status: GPU-enabled
- Procedures/modules found:
wallstress,wallstress_cuda_sync,ws_free_lbc,ws_free_ubc,ws_dns_lbc,ws_dns_ubc,ws_equilibrium_lbc,ws_equilibrium_ubc - Retained switches: -
- Developer note: Runtime loops have GPU coverage; preserve device-resident data flow.