Add test for writing and reading Legendre polynomials from disk by samhatfield · Pull Request #413 · ecmwf-ifs/ectrans

samhatfield · 2026-06-09T13:12:51Z

Before merging #409 I want to add a test for doing a direct transform with Legendre polynomials read from disk. Firstly I want to check whether we can call SETUP_TRANS with disk-read polynomials. This PR adds a test which calls SETUP_TRANS once to write the polynomials to disk and then again in the same instance to read the polynomials from the same file. However, the test currently segfaults which either indicates a bug in the code or improper use. @wdeconinck can you see anything wrong with how I'm calling SETUP_TRANS?

Contributor Declaration

By opening this pull request, I affirm the following:

All authors agree to the Contributor License Agreement.
The code follows the project's coding standards.
I have performed self-review and added comments where needed.
I have added or updated tests to verify that my changes are effective and functional.
I have run all existing tests and confirmed they pass.

wdeconinck · 2026-06-10T15:08:25Z

The history of this read/write was for the serial postprocessing of operational (high) resolution using following stack hierarchy:
MARS -> MIR -> Atlas -> transi -> trans
The Legendre coefficients could then be cached, or precomputed. For operational resolution these are quite big, and computation in serial is too expensive. In our parallel IFS context the recomputation is not an issue making the caching an extra complexity.
All this still comes from the time when trans was still part of the IFS and transi a standalone repository.
As such I had created a unit-test in transi to exercise this: https://github.com/ecmwf-ifs/ectrans/blob/develop/tests/transi/transi_test_io.c
It may serve as hints on how to do this just with Fortran.
Although we don't really exercise this in any Fortran context, it is still a useful Fortran test to add.
Note this is all in a serial context (NPROC==1) only.

samhatfield · 2026-06-11T10:50:45Z

It was just a simple precision bug -> the writer and reader assumed 8-byte reals, always, so the single-precision tests failed. I've fixed this by adjusting the type of the polynomials based on JPRB and embedding this in the legpol file metadata.

Copilot

Pull request overview

This PR adds an API test intended to validate that SETUP_TRANS can (1) write Legendre polynomials to disk and (2) re-initialize in the same process reading those polynomials back from the same file. To support this, it also extends the Legendre polynomial on-disk header with the real-element byte size (IRBYTES) in both CPU and GPU implementations.

Changes:

Add a new setup_trans API test that writes Legendre polynomials to disk and then reads them back in a second SETUP_TRANS call.
Extend Legendre polynomial file headers (CPU/GPU) to include IRBYTES and use it when sizing binary reads/writes.
Update CMake test list/excludes to register the new test and skip it for MPI>0 configurations.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
tests/trans/api/setup_trans/setup_trans_test_suite.F90	Adds the new write/read Legendre polynomial test case.
tests/trans/api/setup_trans/CMakeLists.txt	Registers the new test and excludes it for MPI configurations.
src/trans/gpu/internal/write_legpol_mod.F90	Writes extended Legendre header including `IRBYTES` (GPU path).
src/trans/gpu/internal/read_legpol_mod.F90	Reads extended Legendre header including `IRBYTES` (GPU path).
src/trans/cpu/internal/write_legpol_mod.F90	Writes extended Legendre header including `IRBYTES` (CPU path).
src/trans/cpu/internal/read_legpol_mod.F90	Reads extended Legendre header including `IRBYTES` (CPU path).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

wdeconinck · 2026-06-14T11:30:45Z

Due to the extra entry in HEADER, would you say that generated files from before this PR are no longer compatible to be read in?
If no longer backwards compatible, I would add reserve some extra bytes padded into the header that could possibly be filled later.
I would also add a version number which can be checked for compatibility.
It is important to try to be backwards compatible for the future, so that caching mechanisms don't need to adapt.
It is probably OK to break backwards compatibility at this time because MARS/MIR has moved away from using this, in favour of an Atlas-serial implementation for arbitrary grids.

samhatfield · 2026-06-23T15:57:29Z

Due to the extra entry in HEADER, would you say that generated files from before this PR are no longer compatible to be read in? If no longer backwards compatible, I would add reserve some extra bytes padded into the header that could possibly be filled later. I would also add a version number which can be checked for compatibility. It is important to try to be backwards compatible for the future, so that caching mechanisms don't need to adapt. It is probably OK to break backwards compatibility at this time because MARS/MIR has moved away from using this, in favour of an Atlas-serial implementation for arbitrary grids.

Correct, they're no longer backwards compatible. I've added 4 bytes to the header for storing the packed version integer, and two extra integers in case of future changes. Look good to you?

wdeconinck · 2026-06-23T23:46:53Z

Hi @samhatfield what I meant with version is the version of the file format; not necessarily the ectrans version, although that is also good to know... The file format version only gets bumped when we actually change something to the file format. We can set it to 1 now, considering 0 is the previous version.

If we're to design this really good, and we have the chance now, I suggest the header to contain this:

<"ECTRANS_LEGPOL  ":16*char> <BOM:int32> <file_format_version:int32> <ectrans_version:int32> <polynomial_type:8*char> <spectral_truncation:int32> <NGauss:int32> <precision_bytes:int32> <padding:x*bytes>

16 bytes; First a string that we can check that we're actually having the expected file format
4 bytes; The BOM is a Byte-Order-Marker to detect endianness of the written data. Typically it can be written to the file with an int32 with hexadecimal value:
```
integer(int32) :: BOM
BOM = z'12345678'
```
When reading this back in and the BOM is z'78563412' then the endianness is not the native one.
4 bytes; The version of this file format. If we change anything, it has to come after only. We can set this to 1 now as in version 1.
4 bytes; Version of ectrans used (useful but not crucial)
8 bytes; polynomial type
4 bytes; spectral truncation
4 bytes; gaussian number
4 bytes; precision bytes
X*bytes; padding

The ordering of 4..8 does not matter, but what's here and was already here is a good choice.
We can choose now to have the padding (X*bytes) arbitrarily large. We should not try to make the header very small as the data that comes after is quite huge. For instance with X=16, the header in total will be 64 bytes.

Finally, after all the data is written, I'd also add another string, "ECTRANS_LEGPOL_END"

Then when reading in the file back we need to add assertions:

file format matches "ECTRANS_LEGPOL"
endianness is as expected (native)
version is as expected

Then we can further read in the rest safely.

Finally verify that we encounter "ECTRANS_LEGPOL_END"

samhatfield · 2026-06-24T13:10:48Z

All good ideas. Let me take a look.

samhatfield · 2026-06-24T15:20:58Z

@wdeconinck the header now looks like the following:

! Layout:
! 1. 20 bytes: the string "ECTRANS_LEGPOL_START" indicating the start of the header
! 2. 4 bytes: byte-order-marker indicating endianness of this platform
! 3. 4 bytes: version of the polynomial file
! 4. 4 bytes: version of ecTrans as packed integer
! 5. 8 bytes: polynomial type (one of the strings "LEGPOLBF" or "LEGPOL  ")
! 6. 4 bytes: spectral truncation
! 7. 4 bytes: number of northern latitudes
! 8. 4 bytes: size of real numbers in bytes
! 9. 32 bytes: padding reserved in case of future use
! 10. 20 bytes: the string "ECTRANS_LEGPOL_FINAL"
! Total: 104 bytes

I'm not that familiar with BOZ literals in Fortran so if you could take a look at how I've handled the BOM and whether I've done it correctly, that would be appreciated...

wdeconinck · 2026-06-25T08:31:56Z

It's nice to have a HEADER_END marker; it does not necessarily have to be a large string.
I would use ECTRANS_LEGPOL_FINAL at the very end of the file (after writing all the actual data); This can be used to check if a file was incomplete.
What would also be nice, if possible to add to the header is the number of bytes in the data section, following the HEADER_END section all the way upto (not including) ECTRANS_LEGPOL_FINAL. So some kind of precomputation of expected bytes...
That could make it possible to read all the data into memory in advance, and also allow to file-jump to the ECTRANS_LEGPOL_FINAL string to verify the file is complete.

samhatfield · 2026-06-25T09:55:19Z

It's nice to have a HEADER_END marker; it does not necessarily have to be a large string. I would use ECTRANS_LEGPOL_FINAL at the very end of the file (after writing all the actual data); This can be used to check if a file was incomplete.

There's already an end marker 'LEGPOL---EOF-EOF' which is checked in READ_LEGPOL.

What would also be nice, if possible to add to the header is the number of bytes in the data section, following the HEADER_END section all the way upto (not including) ECTRANS_LEGPOL_FINAL. So some kind of precomputation of expected bytes... That could make it possible to read all the data into memory in advance, and also allow to file-jump to the ECTRANS_LEGPOL_FINAL string to verify the file is complete.

I've added this functionality to WRITE_LEGPOL, but it isn't currently used in READ_LEGPOL. I'm not sure how to skip ahead by a prescribed number of bytes like you suggest. Does BYTES_IO_WRITE increment an offset from the start of the file? Do I just need to read a dummy buffer of NBYTES and then check that the next n bytes matches the expected end marker ('LEGPOL---EOF-EOF')? I suppose I would then need to rewind to finish reading the header.

wdeconinck · 2026-06-25T10:01:30Z

Great about the EoF already. The capability of doing the rewind or skip is not necessary now but is good to have in the file format if possible. Sent from Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: Sam Hatfield ***@***.***> Sent: Thursday, 25 June 2026 11:55:41 To: ecmwf-ifs/ectrans ***@***.***> Cc: Willem Deconinck ***@***.***>; Mention ***@***.***> Subject: Re: [ecmwf-ifs/ectrans] Add test for writing and reading Legendre polynomials from disk (PR #413) [https://avatars.githubusercontent.com/u/8796885?s=20&v=4]samhatfield left a comment (ecmwf-ifs/ectrans#413)<#413 (comment)> It's nice to have a HEADER_END marker; it does not necessarily have to be a large string. I would use ECTRANS_LEGPOL_FINAL at the very end of the file (after writing all the actual data); This can be used to check if a file was incomplete. There's already an end marker 'LEGPOL---EOF-EOF' which is checked in READ_LEGPOL. What would also be nice, if possible to add to the header is the number of bytes in the data section, following the HEADER_END section all the way upto (not including) ECTRANS_LEGPOL_FINAL. So some kind of precomputation of expected bytes... That could make it possible to read all the data into memory in advance, and also allow to file-jump to the ECTRANS_LEGPOL_FINAL string to verify the file is complete. I've added this functionality to WRITE_LEGPOL, but it isn't currently used in READ_LEGPOL. I'm not sure how to skip ahead by a prescribed number of bytes like you suggest. Does BYTES_IO_WRITE increment an offset from the start of the file? Do I just need to read a dummy buffer of NBYTES and then check that the next n bytes matches the expected end marker ('LEGPOL---EOF-EOF')? I suppose I would then need to rewind to finish reading the header. — Reply to this email directly, view it on GitHub<#413?email_source=notifications&email_token=AABWNZ7JBERXWWGACSZYGET5BTZJ3A5CNFSNUABFM5UWIORPF5TWS5BNNB2WEL2JONZXKZKDN5WW2ZLOOQXTINZZG44TSMBSGA32M4TFMFZW63VHNVSW45DJN5XKKZLWMVXHJLDGN5XXIZLSL5RWY2LDNM#issuecomment-4797990207>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AABWNZZHYKOSQLD75WFJVVT5BTZJ3AVCNFSNUABFKJSXA33TNF2G64TZHM2DENRVG42DSMJTHNEXG43VMU5TINRSGIZDMNRUGM4KC5QC>. You are receiving this because you were mentioned.Message ID: ***@***.***>

samhatfield · 2026-06-25T10:32:51Z

I checked whether the precomputed file size matches the actual size for the test, and it does:

Single precision
Expected file size = 112 (header) + 1063040 (body) + 16 (end marker) = 1063168 bytes. Matches actually-written polynomial file.

Double precision
Expected file size = 112 (header) + 2125440 (body) + 16 (end marker) = 2125568 bytes. Matches actually-written polynomial file.

samhatfield added 4 commits June 9, 2026 13:10

Add test for writing and reading Legendre polynomials from disk

8f36946

Exclude legpoly read/write test for nproc > 1

52eecab

Remove NPROC > 1 logic from legpoly read/write test

9329130

Fix abor1 include

c5d1d66

samhatfield added 2 commits June 11, 2026 10:48

Enforce MPI=off for test

6bd3b7a

Embed type of polynomials in header

659ea36

samhatfield added 2 commits June 11, 2026 10:51

Disable mpi1 legpol test

7f8a96b

Also fix legpol reading/writing for GPU

2c5d541

samhatfield requested a review from Copilot June 11, 2026 12:14

samhatfield marked this pull request as ready for review June 11, 2026 12:14

Copilot started reviewing on behalf of samhatfield June 11, 2026 12:14 View session

samhatfield requested a review from wdeconinck June 11, 2026 12:14

github-actions Bot assigned marsdeno Jun 11, 2026

github-actions Bot requested a review from marsdeno June 11, 2026 12:14

Copilot AI reviewed Jun 11, 2026

View reviewed changes

samhatfield and others added 6 commits June 11, 2026 13:24

Enforce build-time precision matches legpol file precision

81f71bf

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Fix type bug

3f87d33

Use header buffer correctly

ae201ba

Properly teardown OpenMP version

0e224bd

Fix potential string overflow bug

1a548c7

Add missing character

15a8458

Add version number to polys header

6a7728e

samhatfield force-pushed the feat/add_legpoly_read_test branch from 6fb498d to 6a7728e Compare June 23, 2026 15:57

Improve legpoly file header

f11a392

Properly cast BOZ literal to int

92581d1

samhatfield force-pushed the feat/add_legpoly_read_test branch from ad52119 to 92581d1 Compare June 24, 2026 15:06

samhatfield added 2 commits June 25, 2026 10:10

Fix indentation

ff33ad4

Add total legpoly data size to header

be35eb6

Fix wrong header size bug

2b2f668

Uh oh!

Conversation

samhatfield commented Jun 9, 2026

Contributor Declaration

Uh oh!

wdeconinck commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samhatfield commented Jun 11, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wdeconinck commented Jun 14, 2026

Uh oh!

samhatfield commented Jun 23, 2026

Uh oh!

wdeconinck commented Jun 23, 2026

Uh oh!

samhatfield commented Jun 24, 2026

Uh oh!

samhatfield commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wdeconinck commented Jun 25, 2026

Uh oh!

samhatfield commented Jun 25, 2026

Uh oh!

wdeconinck commented Jun 25, 2026 via email

Uh oh!

samhatfield commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wdeconinck commented Jun 10, 2026 •

edited

Loading

samhatfield commented Jun 24, 2026 •

edited

Loading