Open ackagel opened 2 years ago
Hi,
The error code is not overflown, libZSTD actually reports them as high unsigned ints. Having said that: at a first glance there doesn't seem to be anything obviously wrong with the metadata, so, it's hard to tell what's happening without getting a look at the file itself. Can you upload it somewhere, or do you need to keep it confidential?
Hello,
And what is the value of bad frame and what are the minimal and maximal frames in the dataset? Just want to eliminate the usual suspects first :)
Best wishes
@michalsta This particular run should probably stay confidential, but it sounds like our team can put together a run which we can share by sometime next week. For context, this same error code pops up on pretty much all our tdf's thus far, so it's likely the issue will be replicated.
@MatteoLacki minimum frame, in terms of id, is 1, while the maximum ranges between 70k-110k. A bad frame typically pops up around 3,000-8,000, and then all the subsequent frames throw this error. The intensities/ion-mobilities that come from the prior 'error-free' frames look correct and don't seem to be corrupted.
@michalsta sorry for the delay, I can share some offending tdf/tdf_bin files now. Is sharing via GoogleDrive alright?
Yes, absolutely. Just post/send me the link
ok, that's weird. I downloaded these and... works for me ;) So. Let's take a few shots in the dark:
Maybe there's some zstd version mismatch and somehow it uses not the built-in but system-wide zstd?
What do you get when you run: python -c "import zstd; print(zstd.version())"
1.5.1.0 for me.
What's your system (not Python) zstd version? like:
$ ls /usr/lib/libzstd.so -l
lrwxrwxrwx 1 root root 16 Feb 19 19:04 /usr/lib/libzstd.so -> libzstd.so.1.5.2
Maybe something got changed in transit? What's the md5sum of the two files you sent? For me it's
2c41f1053df0315db5c331d3979f37d5 analysis.tdf
3af802550b9c7f3f2643899257a68f04 analysis.tdf_bin
Could you check out newest opentims, install the Python version from devel branch - it'll install a script "opentims_verify.py", could you run it on your machine, both with and without -c option?
Also, can you check which frame number is the crashy one?
good to hear it works on your end! I'll try extracting on a different system for now; probably a linux machine since this is sounding more and more like a "windows is cranky about something ambiguous" problem.
As for the zstd version, looks like no zstd module is installed for python; but pip installing zstd didn't improve anything. Pretty sure I don't have a system zstd installed (if I do, not sure where that DLL would be on Windows). md5sums match, and opentims_verify.py also crashes on the problem frame. Looks like the first crashy frame is 8622.
Ah, it's on Windows. Okay, now I can reproduce it. Will have a look.
Hello, is there any progress on this problem? I am getting the same error for my files from DIA experiments. From a certain frame on, so far always at around 42000 of 67700, all subsequent frames cause the error. I also run opentimspy in Windows.
I keep encountering this error:
when querying frames beyond some frame id, in some tdf files (e.g
D.query(frames=[bad_frame], columns=('intensity', ...))
). The error code reads like a u64 underflow (-10 or -8 I think). Here's some of the tdf metadata for context: