Reduce area in prod google bucket

arschat commented 7 months ago

There was a request to reduce the size of the prod google bucket staging area (i.e. gs://broad-dsp-monster-hca-prod-ebi-storage/prod/).

If we remove a project from the bucket, we won't be able to do partial update on that.

Import team agreed on implementing a script to populate this area, based on the imported data (reverse import).
In the meantime, we are asked to prioritise projects that are unlikely to update and remove them until end of March.

Action points:

[x] create a spreadsheet that shows projects \w uuid, and if it is part of any bionetwork list
[ ] discuss with team if the reduction of the files there would be enough or we need more

arschat commented 7 months ago

Spreadsheet here

arschat commented 7 months ago

Summary in spreadsheet:

	Number of Projects	Size in TiB
all dirs & files in bucket	449	197.65
non-bionetwork list	299	122.68
non-bionetwork list & non-hca publication	260	95.64
backup projects	12	8.87
not in DCP (-submitted for next release)	27	13.48
not in ingest	14	8.87
has open submission	40	13.46

idazucchi commented 7 months ago

We don't have a specific target for storage area reduction, but we do a first pass targeting a 30% reduction (59 GB). This way we can free up some space while minimising the time spent triaging the areas to be removed.

After we're done with this first pass I'll check in with Mary and Travis.

Candidates for the first pass

[ ] hca-publications --> only 2 updates ever needed
[ ] backup
[ ] non-DCP projects --> probably integration tests
[ ] high volume projects that are part of a wave 1-2 bionetwork but not used for the atlas --> they are less likely to be updated

All areas need to be checked except for hca-publications

triage of areas:

[x] backup --> Arsenios
[ ] non-DCP projects --> ida

arschat commented 7 months ago

Remove the non-bionetwork list & and organ of known bionetworks projects

Not Atlas

Bellow is the list of uuids, that:

ARE on DCP projectTitle != FALSE
do NOT have an open submission on ingest hasOpenSubmission == FALSE
are NOT in a bionetwork list although, they describe (at least 1) organ that we have a list for notAtlas == TRUE
are NOT part of next unpublished releases nextRelease == FALSE

101 projects -> 50.35 TiB

notAtlas

`005d611a-14d5-4fbf-846e-571a1f874f70` `04ad400c-58cb-40a5-bc2b-2279e13a910b` `0562d2ae-0b8a-459e-bbc0-6357108e5da9` `05be4f37-4506-429b-b112-506444507d62` `06c7dd8d-6cc6-4b79-b795-8805c47d36e1` `07073c12-8006-4710-a00b-23abdb814904` `074a9f88-729a-455d-bca5-0ce80edf0cea` `0792db34-8047-4e62-802c-9177c9cd8e28` `0fd8f918-62d6-4b8b-ac35-4c53dd601f71` `135f7f5c-4a85-4bcf-9f7c-4f035ff1e428` `1eba4d0b-2d15-4ba7-bb3c-d4654dd94519` `2084526b-a66f-4c40-bb89-6fd162f2eb38` `2086eb05-10b9-432b-b7f0-169ccc49d270` `2184e63d-82d8-4ab2-839e-e93f8395f568` `2253ae59-4cc5-4bd2-b44e-ecb6d3fd7646` `23587fb3-1a4a-4f58-ad74-cc9a4cb4c254` `28371655-60ba-449e-a303-5859b29ead65` `2a64db43-1b55-4639-aabb-8dba0145689d` `2fe3c60b-ac1a-4c61-9b59-f6556c0fce63` `3089d311-f9ed-44dd-bb10-397059bad4dc` `34da2c5f-8011-48af-a7fd-ad2f56ec10f4` `376a7f55-b876-4f60-9cf3-ed7bc83d5415` `403c3e76-6814-4a2d-a580-5dd5de38c7ff` `41fb1734-a121-4616-95c7-3b732c9433c7` `455b46e6-d8ea-4611-861e-de720a562ada` `4f17edf6-e9f0-42af-a54a-f02fdca76ade` `504e0cee-1688-40fa-b936-361c4a831f87` `5116c081-8be7-49c5-8ce0-73b887328aa9` `559bb888-7829-41f2-ace5-2c05c7eb81e9` `591af954-cdcd-4839-96d3-a0d1b1e885ac` `5bb1f67e-2ff0-4848-bbcf-17d133f0fd2d` `5bd01deb-01ee-4611-8efd-cf0ec5f56ac4` `645b20c9-5ed0-4500-86b5-7aef770d010a` `67a3de09-45b9-49c3-a068-ff4665daa50e` `6e177195-0ac0-468b-99a2-87de96dc9db4` `7027adc6-c9c9-46f3-84ee-9badc3a4f53b` `71436067-ac41-4ace-be1b-2fbcc2cb02fa` `74493e98-44fc-48b0-a58f-cc7e77268b59` `74e2ef9d-7c9f-418c-b281-7fb38f3b1571` `77780d56-03c0-481f-aade-2038490cef9f` `78b2406d-bff2-46fc-8b61-20690e602227` `7b947aa2-43a7-4082-afff-222a3e3a4635` `7bc1f14b-5e64-4c7f-86b0-23596b97e2aa` `7f9766ff-bb12-4279-b340-78d140bdd7ba` `849ed38c-5917-43c4-a8f9-0782241cf10c` `8559a8ed-5d8c-4fb6-bde8-ab639cebf03c` `85c0d6fa-f117-4d76-b01a-5d5e8f5f9188` `8787c238-89ef-4636-a57d-3167e8b54a80` `896f377c-8e88-463e-82b0-b2a5409d6fe4` `8b954fb2-bccb-44c5-84e3-9f91e9189c40` `8bd2e5f6-9453-4b9b-9c56-59e3a40dc87e` `8f1f653d-3ea1-4d8e-b4a7-b97dc852c2b1` `90bd6933-40c0-48d4-8d76-778c103bf545` `92afaa56-d501-481e-a027-dddd72212ba8` `94e4ee09-9b4b-410a-84dc-a751ad36d0df` `962bd805-eb89-4c54-bad2-008e497d1307` `a27dd619-25ad-46a0-ae0c-5c4940a1139b` `a2a2f324-cf24-409e-a859-deaee871269c` `a60803bb-f7db-45cf-b529-95436152a801` `a62dae2e-cd69-4d5c-b5f8-4f7e8abdbafa` `a9301beb-e9fa-42fe-b75c-84e8a460c733` `a9c022b4-c771-4468-b769-cabcf9738de3` `a9f5323a-ce71-471c-9caf-04cc118fd1d7` `b4a7d12f-6c2f-40a3-9e35-9756997857e3` `b7259878-436c-4274-bfff-ca76f4cb7892` `b733dc1b-1d55-45e3-8036-7eab0821742c` `b9484e4e-dc40-4e38-9b85-4cecf5b8c068` `bd7104c9-a950-490e-9472-7d41c6b11c62` `be010abc-fb68-4581-b61f-7dd7c3d7b044` `c0d82ef2-1504-4ef0-9e5e-d8a13e45fdec` `c31fa434-c9ed-4263-a9b6-d9ffb9d44005` `c41dffbf-ad83-447c-a0e1-13e689d9b258` `c5f46615-68de-4cf4-bbc2-a0ae10f08243` `c7c54245-548b-4d4f-b15e-0d7e238ae6c8` `c9e83418-a9f0-4ed1-ab4f-56d9513417bf` `cc35f94e-e93b-4dbd-a08c-702978d9046f` `cc95ff89-2e68-4a08-a234-480eca21ce79` `ccd1f1ba-74ce-469b-9fc9-f6faea623358` `ccef38d7-aa92-4010-9621-c4c7b1182647` `ce33dde2-382d-448c-b6ac-bfb424644f23` `d2111fac-3fc4-4f42-9b6d-32cd6a828267` `d3ac7c1b-5302-4804-b611-dad9f89c049d` `d71c76d3-3670-4774-a9cf-034249d37c60` `d7845650-f6b1-4b1c-b2fe-c0795416ba7b` `da9d6f24-3bdf-4eaa-9e3f-f47ce2a65b36` `dbcd4b1d-31bd-4eb5-94e1-50e8706fa192` `dd7f2436-0c56-4709-bd17-e526bba4cc15` `df88f39f-01a8-4b5b-92f4-3177d6c0f242` `e0c74c7a-20a4-4505-9cf1-38dcdd23011b` `e456c042-f6b6-4cec-a338-1a8ef80bd779` `e57dc176-ab98-446b-90c2-89e0842152fd` `e9f36305-d857-44a3-93f0-df4e6007dc97` `ea9eec5a-4fc2-4c58-94d0-2fcb598732bc` `eaefa1b6-dae1-4414-953b-17b0427d061e` `efea6426-510a-4b60-9a19-277e52bfa815` `f2fe82f0-4454-4d84-b416-a885f3121e59` `f48e7c39-cc67-4055-9d79-bc437892840c` `f81efc03-9f56-4354-aabb-6ce819c3d414` `fae72d89-4ac4-4aab-9b93-574775e168d4` `fcaa53cd-ba57-4bfe-af9c-eaa958f95c1a` `fccd3f50-cde2-47bf-8972-a293b5928aea`

Backups

And here is the list of the projects that are backups or integration tests, and are safe to remove:

safe for deletion == yes AND contents == Integration Test

10 Projects -> 6.58 TiB

Backup or integration

`2043c65a-1cf8-4828-a656-9e247d4e64f1.bak` `2ef3655a-973d-4d69-9b41-21fa4041eed7_bak` `5f607e50-ba22-4598-b1e9-f3d9d7a35dcc-replace` `61515820-5bb8-45d0-8d12-f0850222ecf0-replace` `a4d92356-3314-456e-917a-60d58c91af86` `ae62bb31-55ca-4127-b0fb-b1771a604645-backup` `c0631c8d-f6a5-493c-857c-7bdb8ae4ecbf` `e526d91d-cf3a-44cb-80c5-fd7676b55a1d.backup` `e6773550-c1a6-4949-8643-1a3154cf2670-replace` `e9f36305-d857-44a3-93f0-df4e6007dc97.bak`

Permanently deleted

There is also one project that is permanently deleted from dcp, that is probably safe to remove from here too. dd7ada84-3f14-4765-b7ce-9b64642bb3dc

1 project -> 1.14 TiB

Sum

Sum of those 3 options is 112 projects with 58.07 TiB size which is 24.94% of the total projects and 29.38% of total size.

arschat commented 7 months ago

removed:

[x] prema deleted
[x] backup
[x] not atlas

Command

> gsutil rm -r gs://broad-dsp-monster-hca-prod-ebi-storage/prod/dd7ada84-3f14-4765-b7ce-9b64642bb3dc > gsutil -m rm -r gs://broad-dsp-monster-hca-prod-ebi-storage/prod/2043c65a-1cf8-4828-a656-9e247d4e64f1.bak gs://broad-dsp-monster-hca-prod-ebi-storage/prod/2ef3655a-973d-4d69-9b41-21fa4041eed7_bak gs://broad-dsp-monster-hca-prod-ebi-storage/prod/5f607e50-ba22-4598-b1e9-f3d9d7a35dcc-replace gs://broad-dsp-monster-hca-prod-ebi-storage/prod/61515820-5bb8-45d0-8d12-f0850222ecf0-replace gs://broad-dsp-monster-hca-prod-ebi-storage/prod/a4d92356-3314-456e-917a-60d58c91af86 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/ae62bb31-55ca-4127-b0fb-b1771a604645-backup gs://broad-dsp-monster-hca-prod-ebi-storage/prod/c0631c8d-f6a5-493c-857c-7bdb8ae4ecbf gs://broad-dsp-monster-hca-prod-ebi-storage/prod/e526d91d-cf3a-44cb-80c5-fd7676b55a1d.backup gs://broad-dsp-monster-hca-prod-ebi-storage/prod/e6773550-c1a6-4949-8643-1a3154cf2670-replace gs://broad-dsp-monster-hca-prod-ebi-storage/prod/e9f36305-d857-44a3-93f0-df4e6007dc97.bak > gsutil -m rm -r gs://broad-dsp-monster-hca-prod-ebi-storage/prod/005d611a-14d5-4fbf-846e-571a1f874f70 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/04ad400c-58cb-40a5-bc2b-2279e13a910b gs://broad-dsp-monster-hca-prod-ebi-storage/prod/0562d2ae-0b8a-459e-bbc0-6357108e5da9 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/05be4f37-4506-429b-b112-506444507d62 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/06c7dd8d-6cc6-4b79-b795-8805c47d36e1 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/07073c12-8006-4710-a00b-23abdb814904 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/074a9f88-729a-455d-bca5-0ce80edf0cea gs://broad-dsp-monster-hca-prod-ebi-storage/prod/0792db34-8047-4e62-802c-9177c9cd8e28 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/0fd8f918-62d6-4b8b-ac35-4c53dd601f71 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/135f7f5c-4a85-4bcf-9f7c-4f035ff1e428 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/1eba4d0b-2d15-4ba7-bb3c-d4654dd94519 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/2084526b-a66f-4c40-bb89-6fd162f2eb38 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/2086eb05-10b9-432b-b7f0-169ccc49d270 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/2184e63d-82d8-4ab2-839e-e93f8395f568 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/2253ae59-4cc5-4bd2-b44e-ecb6d3fd7646 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/23587fb3-1a4a-4f58-ad74-cc9a4cb4c254 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/28371655-60ba-449e-a303-5859b29ead65 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/2a64db43-1b55-4639-aabb-8dba0145689d gs://broad-dsp-monster-hca-prod-ebi-storage/prod/2fe3c60b-ac1a-4c61-9b59-f6556c0fce63 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/3089d311-f9ed-44dd-bb10-397059bad4dc gs://broad-dsp-monster-hca-prod-ebi-storage/prod/34da2c5f-8011-48af-a7fd-ad2f56ec10f4 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/376a7f55-b876-4f60-9cf3-ed7bc83d5415 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/403c3e76-6814-4a2d-a580-5dd5de38c7ff gs://broad-dsp-monster-hca-prod-ebi-storage/prod/41fb1734-a121-4616-95c7-3b732c9433c7 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/455b46e6-d8ea-4611-861e-de720a562ada gs://broad-dsp-monster-hca-prod-ebi-storage/prod/4f17edf6-e9f0-42af-a54a-f02fdca76ade gs://broad-dsp-monster-hca-prod-ebi-storage/prod/504e0cee-1688-40fa-b936-361c4a831f87 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/5116c081-8be7-49c5-8ce0-73b887328aa9 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/559bb888-7829-41f2-ace5-2c05c7eb81e9 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/591af954-cdcd-4839-96d3-a0d1b1e885ac gs://broad-dsp-monster-hca-prod-ebi-storage/prod/5bb1f67e-2ff0-4848-bbcf-17d133f0fd2d gs://broad-dsp-monster-hca-prod-ebi-storage/prod/5bd01deb-01ee-4611-8efd-cf0ec5f56ac4 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/645b20c9-5ed0-4500-86b5-7aef770d010a gs://broad-dsp-monster-hca-prod-ebi-storage/prod/67a3de09-45b9-49c3-a068-ff4665daa50e gs://broad-dsp-monster-hca-prod-ebi-storage/prod/6e177195-0ac0-468b-99a2-87de96dc9db4 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/7027adc6-c9c9-46f3-84ee-9badc3a4f53b gs://broad-dsp-monster-hca-prod-ebi-storage/prod/71436067-ac41-4ace-be1b-2fbcc2cb02fa gs://broad-dsp-monster-hca-prod-ebi-storage/prod/74493e98-44fc-48b0-a58f-cc7e77268b59 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/74e2ef9d-7c9f-418c-b281-7fb38f3b1571 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/77780d56-03c0-481f-aade-2038490cef9f gs://broad-dsp-monster-hca-prod-ebi-storage/prod/78b2406d-bff2-46fc-8b61-20690e602227 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/7b947aa2-43a7-4082-afff-222a3e3a4635 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/7bc1f14b-5e64-4c7f-86b0-23596b97e2aa gs://broad-dsp-monster-hca-prod-ebi-storage/prod/7f9766ff-bb12-4279-b340-78d140bdd7ba gs://broad-dsp-monster-hca-prod-ebi-storage/prod/849ed38c-5917-43c4-a8f9-0782241cf10c gs://broad-dsp-monster-hca-prod-ebi-storage/prod/8559a8ed-5d8c-4fb6-bde8-ab639cebf03c gs://broad-dsp-monster-hca-prod-ebi-storage/prod/85c0d6fa-f117-4d76-b01a-5d5e8f5f9188 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/8787c238-89ef-4636-a57d-3167e8b54a80 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/896f377c-8e88-463e-82b0-b2a5409d6fe4 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/8b954fb2-bccb-44c5-84e3-9f91e9189c40 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/8bd2e5f6-9453-4b9b-9c56-59e3a40dc87e gs://broad-dsp-monster-hca-prod-ebi-storage/prod/8f1f653d-3ea1-4d8e-b4a7-b97dc852c2b1 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/90bd6933-40c0-48d4-8d76-778c103bf545 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/92afaa56-d501-481e-a027-dddd72212ba8 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/94e4ee09-9b4b-410a-84dc-a751ad36d0df gs://broad-dsp-monster-hca-prod-ebi-storage/prod/962bd805-eb89-4c54-bad2-008e497d1307 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/a27dd619-25ad-46a0-ae0c-5c4940a1139b gs://broad-dsp-monster-hca-prod-ebi-storage/prod/a2a2f324-cf24-409e-a859-deaee871269c gs://broad-dsp-monster-hca-prod-ebi-storage/prod/a60803bb-f7db-45cf-b529-95436152a801 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/a62dae2e-cd69-4d5c-b5f8-4f7e8abdbafa gs://broad-dsp-monster-hca-prod-ebi-storage/prod/a9301beb-e9fa-42fe-b75c-84e8a460c733 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/a9c022b4-c771-4468-b769-cabcf9738de3 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/a9f5323a-ce71-471c-9caf-04cc118fd1d7 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/b4a7d12f-6c2f-40a3-9e35-9756997857e3 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/b7259878-436c-4274-bfff-ca76f4cb7892 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/b733dc1b-1d55-45e3-8036-7eab0821742c gs://broad-dsp-monster-hca-prod-ebi-storage/prod/b9484e4e-dc40-4e38-9b85-4cecf5b8c068 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/bd7104c9-a950-490e-9472-7d41c6b11c62 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/be010abc-fb68-4581-b61f-7dd7c3d7b044 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/c0d82ef2-1504-4ef0-9e5e-d8a13e45fdec gs://broad-dsp-monster-hca-prod-ebi-storage/prod/c31fa434-c9ed-4263-a9b6-d9ffb9d44005 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/c41dffbf-ad83-447c-a0e1-13e689d9b258 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/c5f46615-68de-4cf4-bbc2-a0ae10f08243 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/c7c54245-548b-4d4f-b15e-0d7e238ae6c8 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/c9e83418-a9f0-4ed1-ab4f-56d9513417bf gs://broad-dsp-monster-hca-prod-ebi-storage/prod/cc35f94e-e93b-4dbd-a08c-702978d9046f gs://broad-dsp-monster-hca-prod-ebi-storage/prod/cc95ff89-2e68-4a08-a234-480eca21ce79 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/ccd1f1ba-74ce-469b-9fc9-f6faea623358 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/ccef38d7-aa92-4010-9621-c4c7b1182647 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/ce33dde2-382d-448c-b6ac-bfb424644f23 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/d2111fac-3fc4-4f42-9b6d-32cd6a828267 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/d3ac7c1b-5302-4804-b611-dad9f89c049d gs://broad-dsp-monster-hca-prod-ebi-storage/prod/d71c76d3-3670-4774-a9cf-034249d37c60 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/d7845650-f6b1-4b1c-b2fe-c0795416ba7b gs://broad-dsp-monster-hca-prod-ebi-storage/prod/da9d6f24-3bdf-4eaa-9e3f-f47ce2a65b36 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/dbcd4b1d-31bd-4eb5-94e1-50e8706fa192 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/dd7f2436-0c56-4709-bd17-e526bba4cc15 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/df88f39f-01a8-4b5b-92f4-3177d6c0f242 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/e0c74c7a-20a4-4505-9cf1-38dcdd23011b gs://broad-dsp-monster-hca-prod-ebi-storage/prod/e456c042-f6b6-4cec-a338-1a8ef80bd779 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/e57dc176-ab98-446b-90c2-89e0842152fd gs://broad-dsp-monster-hca-prod-ebi-storage/prod/e9f36305-d857-44a3-93f0-df4e6007dc97 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/ea9eec5a-4fc2-4c58-94d0-2fcb598732bc gs://broad-dsp-monster-hca-prod-ebi-storage/prod/eaefa1b6-dae1-4414-953b-17b0427d061e gs://broad-dsp-monster-hca-prod-ebi-storage/prod/efea6426-510a-4b60-9a19-277e52bfa815 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/f2fe82f0-4454-4d84-b416-a885f3121e59 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/f48e7c39-cc67-4055-9d79-bc437892840c gs://broad-dsp-monster-hca-prod-ebi-storage/prod/f81efc03-9f56-4354-aabb-6ce819c3d414 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/fae72d89-4ac4-4aab-9b93-574775e168d4 gs://broad-dsp-monster-hca-prod-ebi-storage/prod/fcaa53cd-ba57-4bfe-af9c-eaa958f95c1a gs://broad-dsp-monster-hca-prod-ebi-storage/prod/fccd3f50-cde2-47bf-8972-a293b5928aea

arschat commented 6 months ago

In DCP Demo today, there was an interest on the dev staging area size and if we can reduce it.	Metric	Value
Sum (TiB)	4.64
Number of Projects	810
>1 TiB	1
>1 GiB	19
>1 MiB	104
>1 KiB	685

idazucchi commented 6 months ago

ida to check if we've reduced the volume of data by enough

idazucchi commented 6 months ago

I've confirmed that the storage we've freed up is enough for now

arschat commented 1 month ago

Import team requested more free space. Re-opening to investigate options.

arschat commented 1 month ago

Did some more digging. From the list they provided I created Sheet6 in previous spreadsheet.

I wanted to investigate the number of projects which we hold all the data in our aws servers along with the gcp staging area. scripts used: aws_staging.txt gsutil_staging.txt

	number of projects
No of files ==	206
No of files !=	136
no info	30
no file in aws	126

since in gcp area we upload spreadsheet as supplementary file, I extracted the number of filenames with *metadata*xlsx pattern and subtracted from total number of files in gcp

Projects in the first group are potentially safer to delete from staging, since we have all data to re-export everything if update is needed.

ebi-ait / hca-ebi-wrangler-central