jimbraun / XCDF

XCDF: eXplicitly Compacted Data Format. See documentation at Read the Docs:
https://xcdf.readthedocs.io/en/latest/
Other
14 stars 8 forks source link

XCDF/HAWCSim Corruption #90

Closed jimbraun closed 2 years ago

jimbraun commented 3 years ago

Hi Andy,

I've continued working on this problem. I've found that if I fill the missing data at the end of the event with either zeroes or ones, I can re-write the file without errors. Interestingly, the re-written file is slightly smaller, which implies some differences between my environment (on OSX) and the one used by HAWCSim.

Can you please point me to the HAWCSim source file where the XCDF serialization takes place?

To summarize where we are:

The problem is constrained to one of the following:

  1. An error in XCDF that manifests on the HAWC cluster, but not on my machine

  2. An XCDF user error in HAWCSim that is not caught by XCDF

  3. Memory corruption or something else very bad happening in HAWCSim.

Issues to fix in the XCDF source code are at least:

jimbraun commented 2 years ago

From Andy:

I suspect that you are correct. By your suggestion, I removed the compression and the problem of producing unreadable files goes away. The bigger problem is that large events take a zillion hours to simulate and contain more hits than fit in memory, so in the end, the arrays that hold the data in XCDF are not the culprit, just where the memory is actually allocated.

I’m good with you closing this out.