Open will-moore opened 1 year ago
Reimport still in progress - cancelled once because of long wait on FILESET_UPLOAD_PREP. The new import in progress since 8 March, also FILESET_UPLOAD_PREP (with parallel-upload=10)
Since we will be working on that study. We should take the opportunity to also fix the location metadata
Imported without chunks and exchanged the symlink in ManagedRepo similarly to the idr0013 case. The new Plate on pilot-idrtesting
is http://localhost:1080/webclient/?show=plate-254 and the name is idr0015-nochunks
. All looks good, the thumbs and full viewer work fine.
Estimate data volume...
uint8, 4 channels, Z: 20, 2048 x 2048, 22 x 18 wells, 84 plates. 11 TB
Starting to free-up some space...
(base) [wmoore@pilot-zarr1-dev data]$ df -h /data
Filesystem Size Used Avail Use% Mounted on
/dev/vdb 4.9T 4.4T 587G 89% /data
$ cd /data
$ sudo rm -rf idr0011/
$ df -h ./
Filesystem Size Used Avail Use% Mounted on
/dev/vdb 4.9T 3.6T 1.3T 74% /data
Convert 1 screen...
screen -S idr0015_ngff
/home/wmoore/bioformats2raw-0.6.0-24/bin/bioformats2raw /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U00--V01.screen TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U00--V01.ome.zarr
EDIT: permission denied - chown -R wmoore idr0015
then re-ran at 13:16...
Make bucket...
$ aws --endpoint-url https://uk1s3.embassy.ebi.ac.uk s3 mb s3://idr0015
make_bucket: idr0015
$ aws --endpoint-url https://uk1s3.embassy.ebi.ac.uk s3api put-bucket-policy --bucket idr0015 --policy file://policy.json
$ aws --endpoint-url https://uk1s3.embassy.ebi.ac.uk s3api put-bucket-cors --bucket idr0015 --cors-configuration file://cors.json
Upload a previously-created plate from pilot-zarr1-dev
cd /data/idr0015
$ /home/wmoore/mc cp -r TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U01--V01.ome.zarr uk1s3/idr0015/zarr
.../V/9/0/3/0/4/9/0/0: 53.04 GiB / 53.04 GiB ━━━━━━━━━━━━━━━ 25.02 MiB/s 36m10s
Looks good in validator
And vizarr (although the omero
rendering settings in the .zarr are different from what's in IDR).
E.g. only 3 channels active, although 5 are exported OK:
Uploaded a 2nd plate, recently generated above:
(base) [wmoore@pilot-zarr1-dev idr0015]$ /home/wmoore/mc cp -r TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U00--V01.ome.zarr uk1s3/idr0015/zarr
...me.zarr/V/9/0/3/0/4/9/0/0: 79.03 GiB / 79.03 GiB ━━━━━━━━━━━━━━━━━━━━ 37.09 MiB/s 36m21
Converting 4 more plates...
for i in TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U01--V01 TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U00--V01 TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U01--V01 TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U00--V0; do echo $i; ~/bioformats2raw-0.6.0-24/bin/bioformats2raw --memo-directory ../memo /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/$i.screen ${i%.*}.ome.zarr; done
Seeing errors writing memo files...
2023-07-02 20:38:56,486 [main] WARN loci.formats.Memoizer - failed to save memo file: ../memo/uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/.TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U01--V01.screen.bfmemo
java.io.IOException: No such file or directory
And then... (caused by a typo in my command above: --V0
should be --V00
.
Exception in thread "main" picocli.CommandLine$ExecutionException: Error while calling command (com.glencoesoftware.bioformats2raw.Converter@63a65a25): java.io.FileNotFoundException: /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U00--V0.screen (No such file or directory)
Try to avoid memo issues by not using memo directory, but allow writing to source dir...
$ cd /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/
$ sudo chown wmoore patterns/
$ df -h /data/
Filesystem Size Used Avail Use% Mounted on
/dev/vdb 4.9T 3.6T 1.4T 74% /data
Running 12 more (including repeat of last typo fix above):
for i in TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U00--V01 TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U00--V01 TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U01--V01 TARA_HCS1_H5_G100002567_G100002568--2013_09_25_12_55_58_chamber--U00--V01 TARA_HCS1_H5_G100002567_G100002568--2013_09_25_12_55_58_chamber--U01--V01 TARA_HCS1_H5_G100002655_G100002656--2013_09_24_15_21_06_chamber--U00--V01 TARA_HCS1_H5_G100002655_G100002656--2013_09_24_15_21_06_chamber--U01--V01 TARA_HCS1_H5_G100002978_G100002980--2013_08_29_12_36_46_chamber--U00--V01 TARA_HCS1_H5_G100002978_G100002980--2013_08_29_12_36_46_chamber--U01--V01 TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U00--V01 TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U01--V01 TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U00--V01; do echo $i; ~/bioformats2raw-0.6.0-24/bin/bioformats2raw /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/$i.screen ${i%.*}.ome.zarr; done
Started to zip some... First 4 zarrs...
screen -S idr0015_zip
for i in TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U01--V01 TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U00--V01 TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U01--V01 TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U00--V0; do zip -r "${i%/}.ome.zarr.zip" "$i.ome.zarr"; done
Still seeing memo issues...
2023-07-03 18:41:39,091 [pool-1-thread-4] WARN loci.formats.Memoizer - skipping memo: directory not writeable - /uod/idr/filesets/idr0015-UNKNOWN-taraoceans/20150918-tara/RAW_DATA/TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14/slide--S00/chamber--U00--V01/field--X17--Y21
Zip command above created 3 zips but failed with same typo as earlier! (oops again):
zip warning: name not matched: TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U00--V0.ome.zarr
Conversion above was generating memo errors as before. Last was:
2023-07-03 21:47:48,790 [pool-1-thread-4] WARN loci.formats.Memoizer - skipping memo: directory not writeable - /uod/idr/filesets/idr0015-UNKNOWN-taraoceans/20150918-tara/RAW_DATA/TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10/slide--S00/chamber--U00--V01/field--X17--Y21
but ran to completion OK...
Current status...
$ ls -alh /data/idr0015
total 161G
drwxrwxr-x. 19 wmoore dlindner 4.0K Jul 4 05:08 .
drwxrwxr-x. 14 root idr-data 270 Jul 2 22:53 ..
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 2 13:17 TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 2 18:23 TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U01--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 48G Jul 3 15:20 TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U01--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 2 20:03 TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U00--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 63G Jul 3 22:49 TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U00--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 2 21:33 TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U01--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 51G Jul 4 05:08 TARA_HCS1_H5_G100001988_G100001989--2013_09_23_19_42_50_chamber--U01--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 06:27 TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore dlindner 4.0K Mar 9 11:30 TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U01--V01.ome.zarr
-rw-rw-r--. 1 wmoore dlindner 115K Mar 9 10:13 TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U01--V01.screen
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 07:54 TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 09:33 TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 10:44 TARA_HCS1_H5_G100002567_G100002568--2013_09_25_12_55_58_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 11:55 TARA_HCS1_H5_G100002567_G100002568--2013_09_25_12_55_58_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 13:03 TARA_HCS1_H5_G100002655_G100002656--2013_09_24_15_21_06_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 14:14 TARA_HCS1_H5_G100002655_G100002656--2013_09_24_15_21_06_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 15:24 TARA_HCS1_H5_G100002978_G100002980--2013_08_29_12_36_46_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 16:29 TARA_HCS1_H5_G100002978_G100002980--2013_08_29_12_36_46_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 18:42 TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U00--V01.ome.zarr
drwxrwxr-x. 16 wmoore wmoore 232 Jul 3 20:34 TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 21:47 TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U00--V01.ome.zarr
Available space getting low...
$ df -h /data
Filesystem Size Used Avail Use% Mounted on
/dev/vdb 4.9T 4.7T 234G 96% /data
Need to delete data... $ sudo rm -rf idr0036
Upload first 3 zips...
$ ./ascp -P33001 -i ../etc/asperaweb_id_dsa.openssh -d /data/idr0015/idr0015 bsaspera_w@hx-fasp-1.ebi.ac.uk:5f/13xxxxxxxx
Deleted 3 zips uploaded above and their .zarr
dirs.
Move all remaining .zarr
to batch1
for zipping and upload...
mkdir batch1
mv *.zarr batch1
cd batch1
for i in */; do zip -r "${i%/}.zip" "$i"; done
Converting 10 Filesets into "batch2"...
screen -r idr0015_ngff
mkdir batch2
for i in TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U01--V01 TARA_HCS1_H5_G100003584_G100003586--2014_06_26_15_58_43_chamber--U00--V01 TARA_HCS1_H5_G100003584_G100003586--2014_06_26_15_58_43_chamber--U01--V01 TARA_HCS1_H5_G100003741_G100003739--2013_09_30_14_59_10_chamber--U00--V01 TARA_HCS1_H5_G100003741_G100003739--2013_09_30_14_59_10_chamber--U01--V01 TARA_HCS1_H5_G100004339_G100004341--2013_09_27_15_24_28_chamber--U00--V01 TARA_HCS1_H5_G100004339_G100004341--2013_09_27_15_24_28_chamber--U01--V01 TARA_HCS1_H5_G100004727_G100004940--2013_12_08_21_26_28_chamber--U00--V01 TARA_HCS1_H5_G100004727_G100004940--2013_12_08_21_26_28_chamber--U01--V01 TARA_HCS1_H5_G100004906_G100002201--2013_08_25_19_31_15_chamber--U00--V01; do echo $i; ~/bioformats2raw-0.6.0-24/bin/bioformats2raw /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/$i.screen batch2/${i%.*}.ome.zarr; done
Deleting individual ome.zarr
filesets from batch1
once their ome.zarr.zip
has been created.
Also upload a random (last) ome.zarr
to s3 from batch1
to validate we're still good...
$ /home/wmoore/mc cp -r TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U00--V01.ome.zarr uk1s3/idr0015/zarr
...me.zarr/A/1/0/0/0/0/5/0/0: 7.45 MiB
...me.zarr/V/9/0/3/0/4/9/0/0: 60.48 GiB / 60.48 GiB ━━━━━━━━━━━━━━━━━━━━ 28.41 MiB/s 36m19s
Current state of batch1:
In a day of generating zips, we only have 4 zips created (5th due soon):
Zips take 4-5 hours to generate, compared to only about 1.5 hours to convert the ome.zarr
.
(base) [wmoore@pilot-zarr1-dev ~]$ ls -alh /data/idr0015/batch1
total 196G
drwxrwxr-x. 14 wmoore wmoore 4.0K Jul 6 20:21 .
drwxrwxr-x. 4 wmoore dlindner 138 Jul 5 22:55 ..
-rw-rw-r--. 1 wmoore wmoore 49G Jul 6 04:39 TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 32G Jul 6 08:48 TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U00--V01.ome.zarr.zip
drwxrwxr-x. 8 wmoore dlindner 107 Jul 6 23:09 TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U01--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 31G Jul 6 13:12 TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U01--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 07:54 TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U00--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 61G Jul 6 20:14 TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U00--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 09:33 TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 10:44 TARA_HCS1_H5_G100002567_G100002568--2013_09_25_12_55_58_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 11:55 TARA_HCS1_H5_G100002567_G100002568--2013_09_25_12_55_58_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 13:03 TARA_HCS1_H5_G100002655_G100002656--2013_09_24_15_21_06_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 14:14 TARA_HCS1_H5_G100002655_G100002656--2013_09_24_15_21_06_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 15:24 TARA_HCS1_H5_G100002978_G100002980--2013_08_29_12_36_46_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 16:29 TARA_HCS1_H5_G100002978_G100002980--2013_08_29_12_36_46_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 18:42 TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U00--V01.ome.zarr
drwxrwxr-x. 16 wmoore wmoore 232 Jul 3 20:34 TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 3 21:47 TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U00--V01.ome.zarr
-rw-------. 1 wmoore wmoore 25G Jul 6 23:10 zitPNp7I
Current state of batch2 (started generating zips too):
(base) [wmoore@pilot-zarr1-dev ~]$ ls -alh /data/idr0015/batch2
total 88G
drwxrwxr-x. 12 wmoore wmoore 4.0K Jul 6 22:37 .
drwxrwxr-x. 4 wmoore dlindner 138 Jul 5 22:55 ..
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 00:29 TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U01--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 38G Jul 6 17:24 TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U01--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 01:41 TARA_HCS1_H5_G100003584_G100003586--2014_06_26_15_58_43_chamber--U00--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 45G Jul 6 22:29 TARA_HCS1_H5_G100003584_G100003586--2014_06_26_15_58_43_chamber--U00--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 02:53 TARA_HCS1_H5_G100003584_G100003586--2014_06_26_15_58_43_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 04:06 TARA_HCS1_H5_G100003741_G100003739--2013_09_30_14_59_10_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 05:11 TARA_HCS1_H5_G100003741_G100003739--2013_09_30_14_59_10_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 06:27 TARA_HCS1_H5_G100004339_G100004341--2013_09_27_15_24_28_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 07:45 TARA_HCS1_H5_G100004339_G100004341--2013_09_27_15_24_28_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 08:59 TARA_HCS1_H5_G100004727_G100004940--2013_12_08_21_26_28_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 10:16 TARA_HCS1_H5_G100004727_G100004940--2013_12_08_21_26_28_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 6 11:37 TARA_HCS1_H5_G100004906_G100002201--2013_08_25_19_31_15_chamber--U00--V01.ome.zarr
-rw-------. 1 wmoore wmoore 5.1G Jul 6 23:14 ziBWfTRP
Convert another 10 in batch3
...
for i in TARA_HCS1_H5_G100004906_G100002201--2013_08_25_19_31_15_chamber--U01--V01 TARA_HCS1_H5_G100006116_G100006253--2013_10_30_19_38_12_chamber--U00--V01 TARA_HCS1_H5_G100006116_G100006253--2013_10_30_19_38_12_chamber--U01--V01 TARA_HCS1_H5_G100006175_G100006179--2013_11_01_23_35_08_chamber--U00--V01 TARA_HCS1_H5_G100006175_G100006179--2013_11_01_23_35_08_chamber--U01--V01 TARA_HCS1_H5_G100007472_G100006130--2013_10_31_18_54_20_chamber--U00--V01 TARA_HCS1_H5_G100007472_G100006130--2013_10_31_18_54_20_chamber--U01--V01 TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U00--V01 TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U01--V01 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01; do echo $i; ~/bioformats2raw-0.6.0-24/bin/bioformats2raw /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/$i.screen batch3/${i%.*}.ome.zarr; done
Upload completed zips from batch1...
(base) [wmoore@pilot-zarr1-dev batch1]$ ls /data/idr0015/batch1/idr0015/
TARA_HCS1_H5_G100001472_G100001473--2013_09_28_19_45_25_chamber--U00--V01.ome.zarr.zip
TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U00--V01.ome.zarr.zip
TARA_HCS1_H5_G100002411_G100002481--2013_08_28_14_46_59_chamber--U01--V01.ome.zarr.zip
TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U00--V01.ome.zarr.zip
TARA_HCS1_H5_G100002479_G100002163--2013_08_26_14_59_40_chamber--U01--V01.ome.zarr.zip
TARA_HCS1_H5_G100002567_G100002568--2013_09_25_12_55_58_chamber--U00--V01.ome.zarr.zip
$ ./ascp -P33001 -i ../etc/asperaweb_id_dsa.openssh -d /data/idr0015/batch1/idr0015 bsaspera_w@hx-fasp-1.ebi.ac.uk:5f/136xxxxxxxxxx
TARA_HCS1_H5_G100001472_G100001473--2013_09_2 100% 48GB 328Mb/s 23:53
TARA_HCS1_H5_G100002411_G100002481--2013_08 100% 31GB 364Mb/s 38:41
TARA_HCS1_H5_G100002411_G100002481--2013_08_2. 100% 30GB 290Mb/s 53:57
TARA_HCS1_H5_G100002479_G100002163--2013_08_2 100% 60GB 118Mb/s 1:32:42
TARA_HCS1_H5_G100002479_G100002163--2013_08_2 100% 60GB 304Mb/s 2:03:27
TARA_HCS1_H5_G100002567_G100002568--2013_09_2 100% 33GB 91.7Mb/s 2:20:28
Zipping ran out of space...
adding: TARA_HCS1_H5_G100004906_G100002201--2013_08_25_19_31_15_chamber--U00--V01.ome.zarr/A/1/0/0/0/3/4/1/ (stored 0%)
adding: TARA_HCS1_H5_G100004906_G100002201--2013_08_25_19_31_15_chamber--U00--V01.ome.zarr/A/1/0/0/0/3/4/1/0
zip I/O error: No space left on device
zip error: Output file write failure (write error on zip file)
For batch3 try zipping with -0
(no compression) and -m
(move files into zip - delete originals)...
$ screen -r idr0015_zip
$ cd /data/idr0015/batch3
$ for i in */; do zip -0mr "${i%/}.zip" "$i"; done
Upload 5 more from batch2, then delete
$ ls -alh /data/idr0015/batch2/idr0015
total 196G
drwxrwxr-x. 2 wmoore wmoore 4.0K Jul 7 14:58 .
drwxrwxr-x. 8 wmoore wmoore 4.0K Jul 8 00:04 ..
-rw-rw-r--. 1 wmoore wmoore 38G Jul 6 17:24 TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 45G Jul 6 22:29 TARA_HCS1_H5_G100003584_G100003586--2014_06_26_15_58_43_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 43G Jul 7 03:30 TARA_HCS1_H5_G100003584_G100003586--2014_06_26_15_58_43_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 37G Jul 7 08:14 TARA_HCS1_H5_G100003741_G100003739--2013_09_30_14_59_10_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 34G Jul 7 12:17 TARA_HCS1_H5_G100003741_G100003739--2013_09_30_14_59_10_chamber--U01--V01.ome.zarr.zip
$ ./ascp -P33001 -i ../etc/asperaweb_id_dsa.openssh -d /data/idr0015/batch2/idr0015 bsaspera_w@hx-fasp-1.ebi.ac.uk:5f/136e8d-xxxx
TARA_HCS1_H5_G100003584_G100003586--2013_09_2 100% 37GB 313Mb/s 16:46
TARA_HCS1_H5_G100003584_G100003586--2014_06_2 100% 44GB 177Mb/s 38:08
TARA_HCS1_H5_G100003584_G100003586--2014_06_2 100% 42GB 271Mb/s 58:21
TARA_HCS1_H5_G100003741_G100003739--2013_09_3 100% 36GB 291Mb/s 1:16:30
TARA_HCS1_H5_G100003741_G100003739--2013_09_3 100% 33GB 64.6Mb/s 1:32:49
$ rm -rf /data/idr0015/batch2/idr0015/*
Looks like zipping of batch3 (with -m
and -0
) stopped prematurely for some reason:
screen -r idr0015_zip2
...
adding: TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01.ome.zarr/A/1/0/0/0/4/19/0/ (stored 0%)
adding: TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01.ome.zarr/A/1/0/0/0/4/19/0/0 (stored 0%)
adding: TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01.ome.zarr/A/1/0/0/0/4/19/0/1 (stored 0%)
Last 2 zips are smaller than expected. Probably need to regenerate
(base) [wmoore@pilot-zarr1-dev batch3]$ ls -alh
total 614G
drwxrwxr-x. 2 wmoore wmoore 4.0K Jul 9 04:32 .
drwxrwxr-x. 5 wmoore dlindner 156 Jul 7 08:03 ..
-rw-rw-r--. 1 wmoore wmoore 96G Jul 8 05:04 TARA_HCS1_H5_G100004906_G100002201--2013_08_25_19_31_15_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 80G Jul 8 09:04 TARA_HCS1_H5_G100006116_G100006253--2013_10_30_19_38_12_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 79G Jul 8 12:29 TARA_HCS1_H5_G100006116_G100006253--2013_10_30_19_38_12_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 72G Jul 8 16:00 TARA_HCS1_H5_G100006175_G100006179--2013_11_01_23_35_08_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 76G Jul 8 19:06 TARA_HCS1_H5_G100006175_G100006179--2013_11_01_23_35_08_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 75G Jul 8 22:08 TARA_HCS1_H5_G100007472_G100006130--2013_10_31_18_54_20_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 74G Jul 9 01:30 TARA_HCS1_H5_G100007472_G100006130--2013_10_31_18_54_20_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 59G Jul 9 04:09 TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 6.6G Jul 9 04:31 TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 55M Jul 9 04:32 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01.ome.zarr.zip
but looks like space didn't run out:
$ df -h /data
Filesystem Size Used Avail Use% Mounted on
/dev/vdb 4.9T 4.5T 482G 91% /data
Zip remaining zarrs in batch1
(and batch2
)
screen -S idr0015_zip(2)
cd batch1(2)
for i in */; do zip -0mr "${i%/}.zip" "$i"; done
Moved all zips for batch3
and upload (except for 2 zips that appear too small - corrupted?).
(base) [wmoore@pilot-zarr1-dev batch3]$ ls -alh idr0015/
total 608G
-rw-rw-r--. 1 wmoore wmoore 96G Jul 8 05:04 TARA_HCS1_H5_G100004906_G100002201--2013_08_25_19_31_15_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 80G Jul 8 09:04 TARA_HCS1_H5_G100006116_G100006253--2013_10_30_19_38_12_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 79G Jul 8 12:29 TARA_HCS1_H5_G100006116_G100006253--2013_10_30_19_38_12_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 72G Jul 8 16:00 TARA_HCS1_H5_G100006175_G100006179--2013_11_01_23_35_08_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 76G Jul 8 19:06 TARA_HCS1_H5_G100006175_G100006179--2013_11_01_23_35_08_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 75G Jul 8 22:08 TARA_HCS1_H5_G100007472_G100006130--2013_10_31_18_54_20_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 74G Jul 9 01:30 TARA_HCS1_H5_G100007472_G100006130--2013_10_31_18_54_20_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 59G Jul 9 04:09 TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U00--V01.ome.zarr.zip
(base) [wmoore@pilot-zarr1-dev batch3]$ ls -alh
total 6.7G
drwxrwxr-x. 2 wmoore wmoore 4.0K Jul 10 07:30 idr0015
-rw-rw-r--. 1 wmoore wmoore 6.6G Jul 9 04:31 TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 55M Jul 9 04:32 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01.ome.zarr.zip
Upload...
./ascp -P33001 -i ../etc/asperaweb_id_dsa.openssh -d /data/idr0015/batch3/idr0015 bsaspera_w@hx-fasp-1.ebi.ac.uk:5f/13xxxxxxx
As suggested by Jason & Josh, looking at 7zip... https://www.7-zip.org/download.html
$ wget https://www.7-zip.org/a/7z2201-linux-x64.tar.xz
Seems that we actually need p7zip
, downloading from https://sourceforge.net/projects/p7zip/files/p7zip/16.02/
wget https://sourceforge.net/projects/p7zip/files/p7zip/16.02/p7zip_16.02_src_all.tar.bz2/download
That doesn't give me anything useful. Can't actually find any install instructions!
Upload kinda slow (since they're not compressed):
(base) [wmoore@pilot-zarr1-dev bin]$ ./ascp -P33001 -i ../etc/asperaweb_id_dsa.openssh -d /data/idr0015/batch3/idr0015 bsaspera_w@hx-fasp-1.ebi.ac.uk:5f/136e8d-xxxxxxxxx
TARA_HCS1_H5_G100004906_G100002201--2013_08_2 100% 95GB 175Mb/s 1:00:51
TARA_HCS1_H5_G100006116_G100006253--2013_10_3 100% 79GB 91.5Mb/s 1:52:26
TARA_HCS1_H5_G100006116_G100006253--2013_10_3 100% 78GB 215Mb/s 2:52:26
TARA_HCS1_H5_G100006175_G100006179--2013_11_0 100% 71GB 179Mb/s 3:38:16
TARA_HCS1_H5_G100006175_G100006179--2013_11_0 100% 75GB 341Mb/s 4:27:16
TARA_HCS1_H5_G100007472_G100006130--2013_10_3 100% 74GB 179Mb/s 5:07:43
TARA_HCS1_H5_G100007472_G100006130--2013_10_3 100% 73GB 337Mb/s 5:46:02
TARA_HCS1_H5_G100007665_G100007576--2013_10_2 100% 58GB 145Mb/s 6:15:54
Batch4 (12 plates - including last 2 from batch3 which failed to complete)
mkdir batch4
for i in TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U01--V01 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U01--V01 TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U00--V01 TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U01--V01 TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U00--V01 TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U01--V01 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U00--V01 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U01--V01 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U00--V01 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U01--V01 TARA_HCS1_H5_G100008302_G100008304--2013_12_02_21_30_23_chamber--U00--V01; do echo $i; ~/bioformats2raw-0.6.0-24/bin/bioformats2raw /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/$i.screen batch4/${i%.*}.ome.zarr; done
Started uploading last completed zips from batch2:
(base) [wmoore@pilot-zarr1-dev bin]$ ls -lh /data/idr0015/batch2
total 423G
-rw-rw-r--. 1 wmoore wmoore 89G Jul 10 07:34 TARA_HCS1_H5_G100004339_G100004341--2013_09_27_15_24_28_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 89G Jul 10 11:17 TARA_HCS1_H5_G100004339_G100004341--2013_09_27_15_24_28_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 72G Jul 10 15:03 TARA_HCS1_H5_G100004727_G100004940--2013_12_08_21_26_28_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 72G Jul 10 18:22 TARA_HCS1_H5_G100004727_G100004940--2013_12_08_21_26_28_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 103G Jul 10 22:53 TARA_HCS1_H5_G100004906_G100002201--2013_08_25_19_31_15_chamber--U00--V01.ome.zarr.zip
...
TARA_HCS1_H5_G100004339_G100004341--2013_09_2 100% 88GB 165Mb/s 42:18
TARA_HCS1_H5_G100004339_G100004341--2013_09_2 100% 88GB 352Mb/s 1:24:00
TARA_HCS1_H5_G100004727_G100004940--2013_12_0 100% 71GB 242Mb/s 1:58:25
TARA_HCS1_H5_G100004727_G100004940--2013_12_0 100% 71GB 271Mb/s 2:32:27
TARA_HCS1_H5_G100004906_G100002201--2013_08_2 100% 102GB 231Mb/s 3:21:41
and batch1
(base) [wmoore@pilot-zarr1-dev bin]$ ls -lh /data/idr0015/batch1
total 485G
-rw-rw-r--. 1 wmoore wmoore 55G Jul 10 06:45 TARA_HCS1_H5_G100002655_G100002656--2013_09_24_15_21_06_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 55G Jul 10 09:10 TARA_HCS1_H5_G100002978_G100002980--2013_08_29_12_36_46_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 55G Jul 10 12:14 TARA_HCS1_H5_G100002978_G100002980--2013_08_29_12_36_46_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 143G Jul 10 18:06 TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 119G Jul 10 22:50 TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 61G Jul 11 01:43 TARA_HCS1_H5_G100003584_G100003586--2013_09_26_15_23_10_chamber--U00--V01.ome.zarr.zip
...
TARA_HCS1_H5_G100002655_G100002656--2013_09_2 100% 54GB 298Mb/s 25:40
TARA_HCS1_H5_G100002978_G100002980--2013_08_2 100% 54GB 439Mb/s 50:48
TARA_HCS1_H5_G100002978_G100002980--2013_08_2 100% 54GB 238Mb/s 1:15:34
TARA_HCS1_H5_G100003406_G100004906--2013_08_2 100% 142GB 192Mb/s 2:20:05
TARA_HCS1_H5_G100003406_G100004906--2013_08_2 100% 118GB 169Mb/s 3:15:32
TARA_HCS1_H5_G100003584_G100003586--2013_09_2 100% 60GB 196Mb/s 3:43:59
Completed: 508294083K bytes transferred in 13440 seconds
(309811K bits/sec), in 6 files, 1 directory.
Zip batch4 (not all exported)
(base) [wmoore@pilot-zarr1-dev batch4]$ ls -lh
total 40K
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 02:32 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 04:02 TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 05:34 TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 07:25 TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 08:43 TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 10:01 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 11:16 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 12:31 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 13:46 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 15:15 TARA_HCS1_H5_G100008302_G100008304--2013_12_02_21_30_23_chamber--U00--V01.ome.zarr
(base) [wmoore@pilot-zarr1-dev batch4]$ for i in */; do zip -mr "${i%/}.zip" "$i"; done
10 hours later, only 2 zips created for batch4:
(base) [wmoore@pilot-zarr1-dev ~]$ ls -lh /data/idr0015/batch4
total 77G
-rw-rw-r--. 1 wmoore wmoore 35G Jul 11 20:41 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 39G Jul 12 01:35 TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U00--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 05:34 TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U01--V01.ome.zarr
...
Not much space left:
(base) [wmoore@pilot-zarr1-dev ~]$ df -h /data
Filesystem Size Used Avail Use% Mounted on
/dev/vdb 4.9T 4.8T 175G 97% /data
Installed p7zip on pilot-zarr1-dev: https://github.com/IDR/idr-metadata/issues/651#issuecomment-1632133497
Current state of batch4 zipping...
(base) [wmoore@pilot-zarr1-dev ~]$ ls -alh /data/idr0015/batch4
total 136G
drwxrwxr-x. 9 wmoore wmoore 4.0K Jul 12 06:17 .
drwxrwxr-x. 5 wmoore dlindner 156 Jul 10 23:09 ..
-rw-rw-r--. 1 wmoore wmoore 35G Jul 11 20:41 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 39G Jul 12 01:35 TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 39G Jul 12 06:02 TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U01--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 07:25 TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 08:43 TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 10:01 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 11:16 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 12:31 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U00--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 13:46 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U01--V01.ome.zarr
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 15:15 TARA_HCS1_H5_G100008302_G100008304--2013_12_02_21_30_23_chamber--U00--V01.ome.zarr
-rw-------. 1 wmoore wmoore 25G Jul 12 09:08 ziIj7CU5
Cancelled that and try p7zip...
$ 7za a -sdel TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U00--V01.ome.zarr.zip TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U00--V01.ome.zarr
moved that zip into batch4/idr0015
and upload...
zip remaining batch 4 zarrs...
cd /data/idr0015/batch4
for i in */; do 7za a "${i%/}.zip" "$i"; done
Certainly a bit faster using 7za
than regular zip...
About 40 minutes per zip.
But the originals aren't deleted.
total 267G
drwxrwxr-x. 9 wmoore wmoore 4.0K Jul 13 05:48 .
drwxrwxr-x. 7 wmoore dlindner 190 Jul 13 03:58 ..
drwxrwxr-x. 2 wmoore wmoore 10 Jul 13 03:48 idr0015
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 08:43 TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U01--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 46G Jul 13 03:13 TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U01--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 10:01 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U00--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 37G Jul 13 03:45 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U00--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 11:16 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U01--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 37G Jul 13 04:18 TARA_HCS1_H5_G100008152_G100008154--2013_11_04_20_31_35_chamber--U01--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 12:31 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U00--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 47G Jul 13 05:00 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U00--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 13:46 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U01--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 47G Jul 13 05:40 TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U01--V01.ome.zarr.zip
drwxrwxr-x. 25 wmoore wmoore 4.0K Jul 11 15:15 TARA_HCS1_H5_G100008302_G100008304--2013_12_02_21_30_23_chamber--U00--V01.ome.zarr
-rw-rw-r--. 1 wmoore wmoore 56G Jul 13 06:28 TARA_HCS1_H5_G100008302_G100008304--2013_12_02_21_30_23_chamber--U00--V01.ome.zarr.zip
Uploading...
batch5...
screen -r idr0015_ngff
mkdir batch5
mkdir memo
for i in TARA_HCS1_H5_G100008302_G100008304--2013_12_02_21_30_23_chamber--U01--V01 TARA_HCS1_H5_G100008608_G100008610--2013_11_05_20_28_35_chamber--U00--V01 TARA_HCS1_H5_G100008608_G100008610--2013_11_05_20_28_35_chamber--U01--V01 TARA_HCS1_H5_G100008990_G100009437--2013_12_07_22_33_03_chamber--U00--V01 TARA_HCS1_H5_G100008990_G100009437--2013_12_07_22_33_03_chamber--U01--V01 TARA_HCS1_H5_G100009617_G100008907--2013_12_06_20_46_40_chamber--U00--V01 TARA_HCS1_H5_G100009617_G100008907--2013_12_06_20_46_40_chamber--U01--V01 TARA_HCS1_H5_G100009725_G100010454--2013_12_05_21_08_22_chamber--U00--V01 TARA_HCS1_H5_G100009725_G100010454--2013_12_05_21_08_22_chamber--U01--V01 TARA_HCS1_H5_G100010173_G100010177--2013_08_22_17_12_07_chamber--U00--V01 TARA_HCS1_H5_G100010173_G100010177--2013_08_22_17_12_07_chamber--U01--V01; do echo $i; ~/bioformats2raw-0.6.0-24/bin/bioformats2raw --memo-directory ./memo /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/$i.screen batch5/${i%.*}.ome.zarr; done
Speed of zipping with 7zip...
45 - 50 minutes per zip:
(base) [wmoore@pilot-zarr1-dev idr0015]$ ls -lh
-rw-rw-r--. 1 wmoore wmoore 53G Jul 13 12:20 TARA_HCS1_H5_G100008302_G100008304--2013_12_02_21_30_23_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 37G Jul 13 13:06 TARA_HCS1_H5_G100008608_G100008610--2013_11_05_20_28_35_chamber--U00--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 38G Jul 13 13:47 TARA_HCS1_H5_G100008608_G100008610--2013_11_05_20_28_35_chamber--U01--V01.ome.zarr.zip
-rw-rw-r--. 1 wmoore wmoore 41G Jul 13 14:32 TARA_HCS1_H5_G100008990_G100009437--2013_12_07_22_33_03_chamber--U00--V01.ome.zarr.zip
Convert last 25 (still need to go back and fix 3 that failed earlier)... Not a ton of space...
ssh pilot-zarr1-dev
$ df -h /data
Filesystem Size Used Avail Use% Mounted on
/dev/vdb 4.9T 4.2T 713G 86% /data
cd /data
mkdir idr0015 && cd idr0015
mkdir batch6
mkdir memo
screen -r idr0015_ngff
for i in TARA_HCS1_H5_G100010173_G100010177--2014_06_24_12_25_12_chamber--U00--V01 TARA_HCS1_H5_G100010173_G100010177--2014_06_24_12_25_12_chamber--U01--V01 TARA_HCS1_H5_G100010237_G100010241--2013_08_20_15_53_43_chamber--U00--V01 TARA_HCS1_H5_G100010237_G100010241--2013_08_20_15_53_43_chamber--U01--V01 TARA_HCS1_H5_G100010237_G100010241--2014_06_25_13_42_23_chamber--U00--V01 TARA_HCS1_H5_G100010237_G100010241--2014_06_25_13_42_23_chamber--U01--V01 TARA_HCS1_H5_G100010241_G100010731--2013_09_29_19_14_59_chamber--U00--V01 TARA_HCS1_H5_G100010241_G100010731--2013_09_29_19_14_59_chamber--U01--V01 TARA_HCS1_H5_G100010607_G100010623--2014_06_23_16_05_11_chamber--U00--V01 TARA_HCS1_H5_G100010607_G100010623--2014_06_23_16_05_11_chamber--U01--V01 TARA_HCS1_H5_G100010623_G100010731--2013_08_23_17_27_23_chamber--U00--V01 TARA_HCS1_H5_G100010623_G100010731--2013_08_23_17_27_23_chamber--U01--V01 TARA_HCS1_H5_G100010623_G100010731--2014_06_20_18_48_17_chamber--U00--V01 TARA_HCS1_H5_G100010623_G100010731--2014_06_20_18_48_17_chamber--U01--V01 TARA_HCS1_H5_G100010824_G100010826--2013_12_03_22_14_42_chamber--U00--V01 TARA_HCS1_H5_G100010824_G100010826--2013_12_03_22_14_42_chamber--U01--V01 TARA_HCS1_H5_G100010891_G100010893--2013_12_04_21_23_54_chamber--U00--V01 TARA_HCS1_H5_G100010891_G100010893--2013_12_04_21_23_54_chamber--U01--V01 TARA_HCS1_H5_G100011084_G100010958--2013_10_04_09_31_04_chamber--U00--V01 TARA_HCS1_H5_G100011084_G100010958--2013_10_04_09_31_04_chamber--U01--V01 TARA_HCS1_H5_G100011162_G100012868--2013_10_03_13_08_50_chamber--U00--V01 TARA_HCS1_H5_G100011162_G100012868--2013_10_03_13_08_50_chamber--U01--V01 TARA_HCS1_H5_G100012537_G100012477--2013_10_01_12_30_33_chamber--U00--V01 TARA_HCS1_H5_G100012537_G100012477--2013_10_01_12_30_33_chamber--U01--V01 TARA_HCS1_H5_G100012694_G100012776--2013_10_02_14_27_33_chamber--U00--V01 TARA_HCS1_H5_G100012694_G100012776--2013_10_02_14_27_33_chamber--U01--V01; do
~/bioformats2raw-0.6.0-24/bin/bioformats2raw --memo-directory ./memo /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/$i.screen batch6/${i%.*}.ome.zarr; done
Need a simple script to process zarrs generated above:
Creates it's own idr0015
dir which we know is empty
create script zip_upload.sh
in /data/idr0015/upload/zip_upload.sh
mkdir idr0015
for i in `ls | grep zarr`; do
echo $i;
7za a "idr0015/${i%/}.zip" "$i";
~/.aspera/cli/bin/ascp -P33001 -i ~/.aspera/cli/etc/asperaweb_id_dsa.openssh -d idr0015 bsaspera_w@hx-fasp-1.ebi.ac.uk:5f/136xxxxx
rm -rf "idr0015/${i%/}.zip"
# rm -rf "$i"
done
Then we can move files into /upload/
etc...
mv *.zarr upload
cd upload
./zip_upload.sh
Seems to work OK, but this plate didn't fully export (so the zip is small). Need to re-export.
TARA_HCS1_H5_G100010241_G100010731--2013_09_29_19_14_59_chamber--U00--V01.ome.zarr
However.... using the zip_upload.sh
script above takes a long time before anything is deleted, and runs all the steps for each zarr in series.
Useful to process a bunch of data for longer periods without tending to the process, but not as fast as manually running zip, upload and delete steps in parallel as above, so will continue with this approach...
https://www.ebi.ac.uk/biostudies/submissions/files?path=%2Fuser%2Fidr0015 lists 82 plates, but we need 84. To find missing plates, use JS Console to load names from IDR and compare...
let url = "https://idr.openmicroscopy.org/webclient/api/plates/?id=1201"
let idr_plates = await fetch(url).then(rsp => rsp.json());
let idr_names = idr_plates.plates.map(p => p.name);
let names = [];
[].forEach.call(document.querySelectorAll("div [role='row'] .ag-cell[col-id='name']"), function(div) {
names.push(div.innerHTML.trim().replace(".ome.zarr.zip", ""));
});
idr_names.forEach(n => {if (names.indexOf(n) == -1) {console.log(n)}; });
TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U01--V01
TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01
2 plates missing from batch3 above. Re-export...
screen -r idr0015_ngff
cd /data/idr0015/batch3
for i in TARA_HCS1_H5_G100007665_G100007576--2013_10_28_21_05_26_chamber--U01--V01 TARA_HCS1_H5_G100007887_G100007889--2013_10_29_19_56_21_chamber--U00--V01; do
~/bioformats2raw-0.6.0-24/bin/bioformats2raw --memo-directory ./memo /uod/idr/metadata/idr0015-colin-taraoceans/screenA/patterns/$i.screen batch6/${i%.*}.ome.zarr; done
Moved those 2 .zarrs
into upload
dir, then...
cd upload
./zip_upload.sh
22 out of 84 filesets "viewable" at https://uk1s3.embassy.ebi.ac.uk/bia-integrator-data/pages/S-BIAD861.html
On idr0125-pilot, take first 10 for testing...
idr0015/TARA_HCS1_H5_G100010607_G100010623--2014_06_23_16_05_11_chamber--U01--V01.ome.zarr,S-BIAD861/00fc2a08-e352-4720-beac-13fd06cda6b2,21164
idr0015/TARA_HCS1_H5_G100010237_G100010241--2014_06_25_13_42_23_chamber--U00--V01.ome.zarr,S-BIAD861/0fb7c57c-fc23-43a0-b5aa-b12d691784b4,21159
idr0015/TARA_HCS1_H5_G100006175_G100006179--2013_11_01_23_35_08_chamber--U01--V01.ome.zarr,S-BIAD861/17a8eb33-2684-40af-9801-c25254537e92,21109
idr0015/TARA_HCS1_H5_G100004727_G100004940--2013_12_08_21_26_28_chamber--U00--V01.ome.zarr,S-BIAD861/17b51dec-bd98-44af-8041-0e4615e755d0,21051
idr0015/TARA_HCS1_H5_G100006116_G100006253--2013_10_30_19_38_12_chamber--U01--V01.ome.zarr,S-BIAD861/31a566c3-55a9-4efc-8565-b711a1b0bf6b,21107
idr0015/TARA_HCS1_H5_G100008236_G100008238--2013_11_06_21_52_31_chamber--U01--V01.ome.zarr,S-BIAD861/39063bfc-ff55-4f8e-a062-1ae1e8da24a2,21123
idr0015/TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U00--V01.ome.zarr,S-BIAD861/48eb9947-fb37-417a-bbb7-cda59f112229,20306
idr0015/TARA_HCS1_H5_G100010623_G100010731--2014_06_20_18_48_17_chamber--U01--V01.ome.zarr,S-BIAD861/4b1f9efb-965a-481f-bafc-195dc96c1219,21168
idr0015/TARA_HCS1_H5_G100008033_G100007130--2013_11_02_21_39_19_chamber--U00--V01.ome.zarr,S-BIAD861/67e274a4-78db-4ace-bb66-d2010ec9891b,21116
idr0015/TARA_HCS1_H5_G100003406_G100004906--2013_08_24_19_23_14_chamber--U01--V01.ome.zarr,S-BIAD861/73d2e66a-a737-46b8-b174-6d60e9145b45,20903
Found prefix demo_2/2016-06/11 // 02-07-11.805 for fileset 21164
Checking for prefix_dir /data/OMERO/ManagedRepository/demo_2/2016-06/11/02-07-11.805
Creating dir at /data/OMERO/ManagedRepository/demo_2/2016-06/11/02-07-11.805_mkngff
Creating symlink /data/OMERO/ManagedRepository/demo_2/2016-06/11/02-07-11.805_mkngff/00fc2a08-e352-4720-beac-13fd06cda6b2.zarr -> /bia-integrator-data/S-BIAD861/00fc2a08-e352-4720-beac-13fd06cda6b2/00fc2a08-e352-4720-beac-13fd06cda6b2.zarr
...
Re-ran on idr0138-pilot with the same inputs, but server must have restarted at some point because the last few didn't get logged-in. Then I needed to manually update the sql scripts with a new SECRET, so I only did the first 3 above...
BEGIN
mkngff_fileset
----------------
5811649
(1 row)
COMMIT
e.g. 3rd plate above... Waiting on memo file...
In IDR, we have 2 duplicate Plates named
TARA_HCS1_H5_G100004727_G100004940--2013_12_08_21_26_28_chamber--U00--V01
https://idr.openmicroscopy.org/webclient/?show=plate-4801
https://idr.openmicroscopy.org/webclient/?show=plate-4751
but we have NO plate named TARA_HCS1_H5_G100008990_G100009437--2013_12_07_22_33_03_chamber--U00--V01
which exists on disk (and we have generated NGFF plate for it: https://ome.github.io/ome-ngff-validator/?source=https://uk1s3.embassy.ebi.ac.uk/bia-integrator-data/S-BIAD861/9b150385-9a50-4454-9a4e-18c477f938ea/9b150385-9a50-4454-9a4e-18c477f938ea.zarr/
We can't use the same NGFF plate for the new Filesets of both duplicate plates, since OMERO doesn't allow duplicate path/names in OriginalFiles table. Probably easiest to simply delete one of the Plates (as we did with idr0004).
https://uk1s3.embassy.ebi.ac.uk/bia-integrator-data/pages/S-BIAD861.html
still has 1 Plate not on s3:
IM30 | idr0015/TARA_HCS1_H5_G100008060_G100008062--2013_11_03_22_51_32_chamber--U00--V01.ome.zarr.zip
Will need to update later...
Running on idr0125-pilot... Using idr0015.csv without the incomplete last line...
(venv3) (base) bash-4.2$ for r in $(cat $IDRID.csv); do
> biapath=$(echo $r | cut -d',' -f2)
> uuid=$(echo $biapath | cut -d'/' -f2)
> fsid=$(echo $r | cut -d',' -f3)
> omero mkngff sql $fsid "/bia-integrator-data/$biapath/$uuid.zarr" >> "$IDRID/$fsid.sql"
> done
Using session for demo@localhost:4064. Idle timeout: 10 min. Current group: Public
Found prefix: demo_2/2016-06/09/07-02-56.345 for fileset: 21152
Failed due to goofys after 16 Filesets. Remounted, edited idr0015.csv
to remove 1st 16 lines and restarted... (on idr0125-pilot)...
(venv3) (base) bash-4.2$ for r in $(cat $IDRID.csv); do
> biapath=$(echo $r | cut -d',' -f2)
> uuid=$(echo $biapath | cut -d'/' -f2)
> fsid=$(echo $r | cut -d',' -f3)
> omero mkngff sql $fsid "/bia-integrator-data/$biapath/$uuid.zarr" >> "$IDRID/$fsid.sql"
> done
Using session for demo@localhost:4064. Idle timeout: 10 min. Current group: Public
Found prefix: demo_2/2016-06/05/09-15-45.736 for fileset: 21107
idr0015-colin-taraoceans