Towards Large Meshes - Githubissues

davschneller commented 1 year ago

As stated, this PR adds large mesh support—or rather, it corrects the few spots where that wasn't the case yet (that is, the Simmodeller should have been able to do that already). For the large meshes to work, this PR requires a SCOREC/PUMI version which is newer than v2.2.7; seemingly only directly using master works right now for that (i.e. one which would not have worked before #58 ). Older PUMI versions are still supported, and PUMgen should work as usual (cf. #58 ), but they probably won't support larger meshes, due to the element data type in APF.

Some additional features:

We add some compression options for the output—if that's wanted or needed. (note that most compression options can also be applied, changed or removed by using h5repack and the resulting size is roughly the same as storing a PUML mesh in a ZIP file)
Furthermore, we add chunked output writing for large meshes.
Also, we integrate 64-bit boundaries natively into PUMgen (groups still stay 32-bit at the moment).

Some of the older mesh formats may break during this update and may be removed some time soon-ish.

davschneller commented 1 year ago

Thanks, that sounds great!

About the compression a few notes maybe still: using any of the deflate compression options will require HDF5 to be compiled with that enabled (both for writing and reading). I'd guess that many of the clusters have enabled it, but in case something throws an error, that could be the reason why.

Other than that, a small note regarding the compression; I had tried h5repack with deflate on strength 1 some time ago, and, as mentioned, it basically reduced it to the size (including the other meshes) as if we were to download the Zenodo file for the SC21 paper. In particular, the groups and the boundaries could be compressed exceptionally well. Here's the command I used:

h5repack -l connect:CHUNK=8192x4 -l geometry:CHUNK=8192x3 -l group:CHUNK=8192 -l boundary:CHUNK=8192 -f boundary:GZIP=1 -f group:GZIP=1 -f geometry:GZIP=1  -f connect:GZIP=1 infile.h5 compressed-outfile.h5

One more thing to add is maybe, the scaleoffset compression does not go well with the compactify-datatypes option; it only wants 1,2,4,8 byte-sized data types. But the scaleoffset compression is also the weakest one.

compactify-datatypes itself is not a compression option per se (meaning that it won't need HDF5 to be compiled with any such option, nor is that mentioned in the file itself). It merely cuts off unused bytes in integers. For example: if we have a mesh with, say, 7 million nodes/cells, then the connectivity indices all fit into a 3-byte integer (which can house up to roughly 16 million, or a bit over 8 million, if signed). The compactify-datatypes option does exactly use 3-byte integers for storage then—instead of the usual 8 bytes.

Also, the compactify-datatypes option cannot be applied post-mortem by h5repack (but deflate, scaleoffset can be either added or removed using h5repack).

krenzland commented 1 year ago

How would the compression work then, when integrating it into SeisSol? Does it work well with the parallel I/O? Naively, I would assume that it chunks the data and then compresses it?

We might also consider this then for the output writing?

davschneller commented 1 year ago

How would the compression work then, when integrating it into SeisSol? Does it work well with the parallel I/O? Naively, I would assume that it chunks the data and then compresses it?

Yup, that's exactly how it works. But the chunking needs to be done explicitly. Cf. the given h5repack command: we need to change the data layout from contiguous to chunked. As I understand it, each chunked is then compressed. Apparently it's supported with parallel IO, as long as we don't also use additional HDF5 data transforms (which we shouldn't... I think). (we can still run PUMgen with MPI, and it doesn't throw us any errors)

For an example in code, cf. e.g. https://github.com/SeisSol/PUMGen/blob/7748c255f8c69fe7ded75d9a0eee1ed71586cba2/src/pumgen.cpp#L335-L346

(and yes, it's still a lot of duplicated code in there at the moment—maybe it can be slightly generalized as in PUMLcube https://github.com/SeisSol/Meshing/pull/49 )

We might also consider this then for the output writing?

That's a good idea—if we work on IO nodes which support that, then why actually not?

SeisSol / PUMGen

Towards Large Meshes #59