filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] Kernelogic - Human PanGenomics Project (4/4) #1546

Closed kernelogic closed 5 months ago

kernelogic commented 1 year ago

Data Owner Name

Human Pangenome Reference Consortium

Data Owner Country/Region

United States

Data Owner Industry

Life Science / Healthcare

Website

https://humanpangenome.org/

Social Media

https://twitter.com/HumanPangenome

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

Custom multisig

Identifier

No response

Share a brief history of your project and organization

I have participated every Slingshot phase and is probably the best performing as a "small individual client". 

Even though Slingshot v2 has ended, there are still strong demand from SPs to onboard useful data. This application is to onboard open dataset from AWS.

I will provide a web UI (https://singularity-browser.kernelogic.ca/) to index all files onboarded and provide ways to retrieve.

I have successfully completed a few LDNs on other datasets and I have record to show I have been following the rules of decentralization and have zero self dealing.

Some of the recent LDNs I completed:
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1108
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1107
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1106
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1104
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/983

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

Storage working groups, BigD exchange, singularity deal making tool.

Describe the data being stored onto Filecoin

https://github.com/human-pangenomics/hpgp-data

This dataset includes sequencing data, assemblies, and analyses for the offspring of ten parent-offspring trios.

Total size: about 1.2PB from bucket arn:aws:s3:::human-pangenomics

I will apply a total of 20PB DC to store 12 copies (considering car padding)

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

singularity

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://registry.opendata.aws/hpgp-data/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

1 to 1.5 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe, Australia (continent)

How will you be distributing your data to storage providers

HTTP or FTP server

How do you plan to choose storage providers

Slack, Big data exchange

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

PIKNIK f01904630,f01873432
GreaterHeat f01971600,f01992630
HarryM-Filet f02301,f03223,f0240185
BEWELL TECHNOLOGIES LIMITED f01944744,f01943663,f01928097
And many others from BigDExchange

How do you plan to make deals to your storage providers

Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

1PiB

Client address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f01858410

Client address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

DataCap allocation requested

256TiB

Id

39fb22b1-de4a-48f5-8fa0-a252a98af54d

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

There is no previous allocation for this issue.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

xinaxu commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceczmzcw5q4qztt7lsmreipi3mlqxvvym4aiqtiiqwjq323iehwtfo

Address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

Datacap Allocated

256.00TiB

Signer Address

f1k3ysofkrrmqcot6fkx4wnezpczlltpirmrpsgui

Id

39fb22b1-de4a-48f5-8fa0-a252a98af54d

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceczmzcw5q4qztt7lsmreipi3mlqxvvym4aiqtiiqwjq323iehwtfo

flyworker commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebk4firrdjzbeqoiynbupttfzopat4cq2v6ycflm5fz3s4v6zrdj4

Address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

Datacap Allocated

256.00TiB

Signer Address

f1hlubjsdkv4wmsdadihloxgwrz3j3ernf6i3cbpy

Id

39fb22b1-de4a-48f5-8fa0-a252a98af54d

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebk4firrdjzbeqoiynbupttfzopat4cq2v6ycflm5fz3s4v6zrdj4

filplusapp commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f01858410

Client address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

DataCap allocation requested

512TiB

Id

b02774d1-2321-4f84-9ed2-196d4ef88a42

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

Tom-OriginStorage commented 1 year ago

Viewed by the robot, the encapsulation complies with the rules, the nodes are distributed in multiple areas, and there is no CID sharing,I tried to retrieve it is also normal

image

Tom-OriginStorage commented 1 year ago

i would like to support it

Tom-OriginStorage commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedlq3eso5wu6t3zis3gdsviwixala77o4ntsxja575aoj4h3pxvpq

Address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

Datacap Allocated

512.00TiB

Signer Address

f1q6bpjlqia6iemqbrdaxr2uehrhpvoju3qh4lpga

Id

b02774d1-2321-4f84-9ed2-196d4ef88a42

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedlq3eso5wu6t3zis3gdsviwixala77o4ntsxja575aoj4h3pxvpq

xiaoyuaiheshui commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

xiaoyuaiheshui commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceb6jaaiamijgafotwtbia3fxazjox46vdlvqflmv4nui4t3ybxsv6

Address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

Datacap Allocated

512.00TiB

Signer Address

f122qmy25wdtt5mxd77kndiq7z5x2n3iwiuz2wdsa

Id

b02774d1-2321-4f84-9ed2-196d4ef88a42

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceb6jaaiamijgafotwtbia3fxazjox46vdlvqflmv4nui4t3ybxsv6

herrehesse commented 1 year ago

Dear Filecoin+ Github applicant,

We have noticed that the dataset is already (partly) on chain. While we appreciate your enthusiasm to contribute to the Filecoin network, we want to remind you that this behaviour may not be beneficial to the network. Can you explain to me what happend here?

Thank you for your understanding and cooperation.

Screenshot 2023-02-22 at 11 27 40
kernelogic commented 1 year ago

Hi, my explanation is the following:

  1. There is no rule about only one dataset can only be chosen by one person.
  2. This dataset is not "overly crowded"
  3. The only earlier applicant GhostByteInc has not made significant progress on it yet.

Thanks

NiwanDao commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacect2a5k57qryi4golqaor44ggrryldh74zwywp4nu3qzir6vtxjem

Address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

Datacap Allocated

512.00TiB

Signer Address

f1a2lia2cwwekeubwo4nppt4v4vebxs2frozarz3q

Id

b02774d1-2321-4f84-9ed2-196d4ef88a42

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacect2a5k57qryi4golqaor44ggrryldh74zwywp4nu3qzir6vtxjem

NDLABS-Leo commented 1 year ago

Based on ND's review standards and the excellent performance of check bot, we are willing to support it. And the retrieval test of the node was carried out, and the performance was good.

NDLABS-Leo commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedxnythr3gtr7izhfins73mpjd7ju4sipfdhushnkc4rf2gydathu

Address

f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq

Datacap Allocated

512.00TiB

Signer Address

f1yayfsv6whu3rheviucvventj3y6t542xfpb47ei

Id

b02774d1-2321-4f84-9ed2-196d4ef88a42

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedxnythr3gtr7izhfins73mpjd7ju4sipfdhushnkc4rf2gydathu

kernelogic commented 1 year ago

keepalive

github-actions[bot] commented 11 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

kernelogic commented 11 months ago

Need to keep this open. Still onboarding slowly.

github-actions[bot] commented 11 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

kernelogic commented 11 months ago

Need to keep this open. Still onboarding slowly.

github-actions[bot] commented 10 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 10 months ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

github-actions[bot] commented 10 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 10 months ago

Actively preparing more cars now.

github-actions[bot] commented 10 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 10 months ago

Still actively preparing more cars now.

github-actions[bot] commented 9 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 9 months ago

Still actively preparing more cars now.

github-actions[bot] commented 9 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 9 months ago

Still actively preparing more cars now.

github-actions[bot] commented 9 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 9 months ago

Still actively preparing more cars now.

github-actions[bot] commented 8 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 8 months ago

Still actively preparing more cars now.

github-actions[bot] commented 8 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 8 months ago

Still actively preparing more cars now.

github-actions[bot] commented 8 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 8 months ago

keepalive

github-actions[bot] commented 7 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 7 months ago

keepalive

github-actions[bot] commented 7 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

kernelogic commented 7 months ago

keepalive, holiday season and mainnet upgrade, things going slow

large-datacap-requests[bot] commented 7 months ago

Client address f1v4jt6gssxwf6oafe4muelr3sv6szzoqezwumhtq is present in other Fil+ applications (#1545, #1544, #1543). This may cause unexpected behavior.

Sunnyiscoming commented 7 months ago

If you already have a list of storage providers to work with, fill out their names and provider IDs below PIKNIK f01904630,f01873432 GreaterHeat f01971600,f01992630 HarryM-Filet f02301,f03223,f0240185 BEWELL TECHNOLOGIES LIMITED f01944744,f01943663,f01928097 And many others from BigDExchange

Please provide ID, City, Country, Organization of each SP here.