Batched C14 ML Inference by Mikolaj-A-Kowalski · Pull Request #42 · m2lines/clubb_ML

Mikolaj-A-Kowalski · 2026-04-22T12:19:03Z

This PR is stacked on top of #39 and #40.

Closes #30 and #35

It is not great in its current form since it suffers from two defects:

we can only batch all cells. This require a memory buffer which is 6 * nz*ngrid, so basically like storing 6 extra fields. It is a lot
We transpose the data in Fortran when loading into the buffer. The memory layout is not ideal with a large stride between inputs from the same index. It appears fine at the moment, but may not remain the case when the number of columns grows.

In discussion with @jatkinson1000 we decided to 'kick the can down the road' when it comes to reducing the buffer space. We will address it when we need it.

Memory layout could be improved, but permuting a torch_tensor on construction on after construction is a bit clunky ATM (I am punished for not merging Cambridge-ICCS/FTorch#423 😅 ). I will poke a bit more to see how it would look like so we can choose between potentially better performance and 'hackly' implementation.

The speedup of batching is significant. On single column model with BOMEX we are talkign ~ x10 (from ~0.5s to ~0.05s)

Mikolaj-A-Kowalski · 2026-04-22T13:20:50Z

Memory layout could be improved, but permuting a torch_tensor on construction on after construction is a bit clunky ATM (I am punished for not merging Cambridge-ICCS/FTorch#423 😅 ). I will poke a bit more to see how it would look like so we can choose between potentially better performance and 'hackly' implementation.

See 77227e6

As indicated in the commit message I am not sure it is legal. It works though on ifx 2023.2.4

EDIT: I think it is perfectly legal now...

vopikamm

LGTM! I agree for tackling the buffer space issue when we need to.

All the cells in the problem are batched into a single forward model evaluation. Also the input data is transposed on Fortran side which results in inefficient memory layout. In this form we require quite large memory buffer. Non-optimal memory layout probably have little effect in a single column model, but may become significant in larger problems. Batching does offer significant advantage over 'loop' though. On a single column BOMEX test case we observe x10 speedup in ML inference time.

Mikolaj-A-Kowalski requested review from adconnolly and vopikamm April 22, 2026 12:19

Mikolaj-A-Kowalski self-assigned this Apr 22, 2026

Mikolaj-A-Kowalski commented Apr 22, 2026

View reviewed changes

Comment thread src/CLUBB_core/advance_xp2_xpyp_module.F90 Outdated

vopikamm approved these changes Apr 23, 2026

View reviewed changes

jatkinson1000 linked an issue Apr 24, 2026 that may be closed by this pull request

Convert net to run using batched input #35

Closed

jatkinson1000 removed a link to an issue Apr 24, 2026

Convert net to run using batched input #35

Closed

jatkinson1000 linked an issue Apr 24, 2026 that may be closed by this pull request

Convert net to run using batched input #35

Closed

Mikolaj-A-Kowalski commented Apr 27, 2026

View reviewed changes

Comment thread src/CLUBB_core/advance_xp2_xpyp_module.F90 Outdated

Mikolaj-A-Kowalski force-pushed the 30-batched-inference branch from 9180246 to 7bb884f Compare April 29, 2026 08:49

Mikolaj-A-Kowalski merged commit b4acdc4 into CLUBB_ML Apr 29, 2026

jatkinson1000 deleted the 30-batched-inference branch April 29, 2026 09:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batched C14 ML Inference#42

Batched C14 ML Inference#42
Mikolaj-A-Kowalski merged 1 commit into
CLUBB_MLfrom
30-batched-inference

Mikolaj-A-Kowalski commented Apr 22, 2026

Uh oh!

Uh oh!

Mikolaj-A-Kowalski commented Apr 22, 2026 •

edited

Loading

Uh oh!

vopikamm left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Mikolaj-A-Kowalski commented Apr 22, 2026

Uh oh!

Uh oh!

Mikolaj-A-Kowalski commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vopikamm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Mikolaj-A-Kowalski commented Apr 22, 2026 •

edited

Loading