momentoscope / hextof-processor

Code for preprocessing data from the HEXTOF instrument at FLASH, DESY in Hamburg (DE)
https://hextof-processor.readthedocs.io/en/latest/
GNU General Public License v3.0
7 stars 4 forks source link

Permission denied when generating parquet data for lab runs #108

Open steinnymir opened 2 years ago

steinnymir commented 2 years ago

Beamtime directories generated for lab data seem to have restricted permissions to few users. This is absoultely correct, but limits the work that other users (me and Zain for example) can do.

Any suggestions how to solve this?

steinnymir commented 2 years ago

@zainsohail04 suggested using local directories for test purpouses. The parquet dir was hardcoded but I added an optional argument to read_data to overwrite this.

now the parquet folder can be arbitrarily selected, and data readout works as intended

zain-sohail commented 2 years ago

Thanks. I think I will restructure the code because this functionality is also in express dataframe creator class and can be generalized for both.

kutnyakhov commented 2 years ago

We are currently trying to write parquet in the "raw" folder, but they should go into "processed" folder /asap3/fs-flash-o/gpfs/hextof/2022/data/11013726/processed For this folder, everyone should have write permission.

kutnyakhov commented 2 years ago

One of our last gray days at FLASH, where we were using fs-flash-o/hextof for data storage all parquet files were written into folder /asap3/fs-flash-o/gpfs/hextof/2021/data/11013350/processed/parquet/per_file

So it was not even trying to write into "raw" folder - could it be that this was changed and now code trying to write into current ("raw") folder?

zain-sohail commented 2 years ago

When using readData, there is a optional argument for parquet_path. The default is it adds a new parquet dir to the the file path.