filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] International Neuroimaging Data-Sharing Initiative (INDI) #1906

Closed Parker35sun closed 1 year ago

Parker35sun commented 1 year ago

Data Owner Name

Child Mind Institute

Data Owner Country/Region

United States

Data Owner Industry

Life Science / Healthcare

Website

https://childmind.org/

Social Media

https://www.facebook.com/ChildMindInstitute
https://twitter.com/ChildMindInst

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

800TiB

On-chain address for first allocation

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Data Type of Application

None

Custom multisig

Identifier

No response

Share a brief history of your project and organization

Child Mind Institute is dedicated to transforming the lives of children and families struggling with mental health and learning disorders by giving them the help they need. We’ve become the leading independent nonprofit in children’s mental health by providing gold-standard evidence-based care, delivering educational resources to millions of families each year, training educators in underserved communities, and developing tomorrow’s breakthrough treatments.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Neuroimaging data for the International Neuroimaging Data-Sharing Initiative (INDI)
This bucket contains multiple neuroimaging datasets that are part of the International Neuroimaging Data-Sharing Initiative. Raw human and non-human primate neuroimaging data include 1) Structural MRI; 2) Functional MRI; 3) Diffusion Tensor Imaging; 4) Electroencephalogram (EEG) In addition to the raw data, preprocessed data is also included for some datasets.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

lotus, singularity

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://registry.opendata.aws/fcp-indi/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America

How will you be distributing your data to storage providers

HTTP or FTP server, Shipping hard drives

How do you plan to choose storage providers

Slack, Big data exchange

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

Can you introduce your organizaion? Please share more detailed information about sps you will cooperate with.

Parker35sun commented 1 year ago

@Sunnyiscoming Ok. I'm a DP, who are responsible for the downloading, processing, and distribution of CAR files to SP. I am confirming the list of sp and we will soon reach a cooperation. Hope my application will be approved.

cryptowhizzard commented 1 year ago

Dear @Parker35sun

Thank you for applying for datacap. As Filecoin FIL+ notary i am screening your application and conducting due diligence.

Looking at your application i have some questions: As you are brand new on Github and have no history of past applications. Can you give me an introduction who you are and an introduction about your company?

You stated that you are a Data preparer. What hardware do you have over there to prepare this dataset? What is your internet connection speed? Do you already have the data downloaded on the premises?

It seems to me that applying for 5PB of datacap is a lot. One needs comprehensive knowledge of Filecoin, packing of data, distribution of data and all it's requirements coming with it. Are you brand new in the Filecoin space or have you applied for datacap in the past on different Github account names?

Can you show us some visible proof of the size of your data and the storage / packing hardware you have there?

As last question i would like you to fill out this form to provide us with the necessary information to make a educated decision on your LDN request if we would like to support it.

Thanks!

Sunnyiscoming commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

800TiB

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

DataCap allocation requested

256TiB

Id

aef3d914-fcb7-4bd7-85d5-55e55112f2c4

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

DataCap allocation requested

256TiB

Id

83c8c324-c12a-407f-a23e-f43e40f7918c

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No application info found for this issue on https://filplus.d.interplanetary.one/clients.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

Casey-PG commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedxy6jgjitx7ao6a4gnstaadpbn5uxmorhiv2a4cu3w5xgpp2k34s

Address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Datacap Allocated

256.00TiB

Signer Address

f1d4yb3wags3mtddzesxoo63jv7dmlec3bq4yteni

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedxy6jgjitx7ao6a4gnstaadpbn5uxmorhiv2a4cu3w5xgpp2k34s

Casey-PG commented 1 year ago

Since this is currently the first round of applications, there are no bot check results available for review. We believe this application meets the Fil+ criteria based on the applicant's description and the distribution plan.

Bennyyangpu commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedx7iwlqu3m3udy5kexjleu63bpblakl732ftfxacchkxzqtlishg

Address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Datacap Allocated

256.00TiB

Signer Address

f174fg3bqbln3zjnkxtyf6s54txqkr7yqkj6cig7y

Id

83c8c324-c12a-407f-a23e-f43e40f7918c

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedx7iwlqu3m3udy5kexjleu63bpblakl732ftfxacchkxzqtlishg

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

DataCap allocation requested

512TiB

Id

35834b1e-6fa3-49b1-975f-9cb17a325668

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Rule to calculate the allocation request amount

10% of total dc amount requested

DataCap allocation requested

512TiB

Total DataCap granted for client so far

256TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.75PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
null null 256TiB null 71.75TiB
cryptowhizzard commented 1 year ago

Hello,

I have tried retrieval but none of your SP's seem to support it. This is against FIL+ rules & guidelines. Secondly, everything is stored in one region and not distributed outside Asia. This is against FIL+ rules & guidelines.

Please advise.

Scherm­afbeelding 2023-05-05 om 19 56 10
TakiChain commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 73.42% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

TakiChain commented 1 year ago

Hope to improve this indicator of Deal Data Replication in the next few rounds. willing to help onboard more valuable dataset.

TakiChain commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacebrjeya45i5tvbt5kpr6odpfnmzypenazzkciykqu37nsvu2tnsda

Address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Datacap Allocated

512.00TiB

Signer Address

f15impf3j2zcaex4lhyxndxswuuhv24vzstuqtxsi

Id

35834b1e-6fa3-49b1-975f-9cb17a325668

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebrjeya45i5tvbt5kpr6odpfnmzypenazzkciykqu37nsvu2tnsda

BobbyChoii commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecyfbgtwdhzyxdt5vjgwq7bemje54a7prjxmpkk34uzehitodjfl6

Address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Datacap Allocated

512.00TiB

Signer Address

f1irqs2gmctiv3jcdfwuch7oxvf4ixh3k4b2wc24i

Id

35834b1e-6fa3-49b1-975f-9cb17a325668

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecyfbgtwdhzyxdt5vjgwq7bemje54a7prjxmpkk34uzehitodjfl6

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

DataCap allocation requested

1PiB

Id

e39fa630-d845-44a7-92c5-be99a3178830

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Rule to calculate the allocation request amount

20% of total dc amount requested

DataCap allocation requested

1PiB

Total DataCap granted for client so far

465661.3YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

-5.62B

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
null null 512TiB null 133.40TiB
Bennyyangpu commented 1 year ago

The client contacted me by slack and shared more of their information . I'd like to see more public data joining the network!

Bennyyangpu commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaced6yg3dv4pw4inxerqxfw36zmpogamdbi2fvmwj2yfzgd7u7n3n7i

Address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Datacap Allocated

1.00PiB

Signer Address

f174fg3bqbln3zjnkxtyf6s54txqkr7yqkj6cig7y

Id

e39fa630-d845-44a7-92c5-be99a3178830

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaced6yg3dv4pw4inxerqxfw36zmpogamdbi2fvmwj2yfzgd7u7n3n7i

Casey-PG commented 1 year ago

Glad to support this application in this round.

Casey-PG commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceabzpl57xvdqem3yk2vjqpwbgfvtwdp5aujqt6y3l6dxn6echzzqm

Address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Datacap Allocated

1.00PiB

Signer Address

f1d4yb3wags3mtddzesxoo63jv7dmlec3bq4yteni

Id

e39fa630-d845-44a7-92c5-be99a3178830

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceabzpl57xvdqem3yk2vjqpwbgfvtwdp5aujqt6y3l6dxn6echzzqm

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

MEIYAN666 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 4

Multisig Notary address

f02049625

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

DataCap allocation requested

2PiB

Id

f358de05-d8d7-4702-b1e4-e4c008cabe67

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Rule to calculate the allocation request amount

40% of total dc amount requested

DataCap allocation requested

2PiB

Total DataCap granted for client so far

931322574615478927360.0YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

-1.12B

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
26592 16 1PiB 19.12 235TiB
TakiChain commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceafpspygubgmbhd2tytgufzpxs7lycecnyqqaz2q4hqm4omug43kc

Address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Datacap Allocated

2.00PiB

Signer Address

f15impf3j2zcaex4lhyxndxswuuhv24vzstuqtxsi

Id

f358de05-d8d7-4702-b1e4-e4c008cabe67

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceafpspygubgmbhd2tytgufzpxs7lycecnyqqaz2q4hqm4omug43kc

BobbyChoii commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebzoqtymxitinvemm7bggtxr56m7t4lur4yfn2zwzohxc4yhih3mq

Address

f1zqpqw7kavxxu3zdaomwveibcsvj4ida7iwmskia

Datacap Allocated

2.00PiB

Signer Address

f1irqs2gmctiv3jcdfwuch7oxvf4ixh3k4b2wc24i

Id

f358de05-d8d7-4702-b1e4-e4c008cabe67

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebzoqtymxitinvemm7bggtxr56m7t4lur4yfn2zwzohxc4yhih3mq

BobbyChoii commented 1 year ago

The client followed the allocation he disclosed before and the datacap usageis in compliance with regulations.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

cryptowhizzard commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

⚠️ All retrieval success ratios are below 1%.

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

cryptowhizzard commented 1 year ago

@TakiChain @BobbyChoii @Aifabot-Cloud @Parker35sun

This data is unretrievable. This data is fully centralised in China. There has been no due diligence done.

Parker35sun commented 1 year ago

@cryptowhizzard Thank you for your reminder.

1 We've asked and confirmed SPs who are cooperating with us to open http retrieval. However, due to the retrieval bot, it cannot be reflected in the report in real time. We learned that Caro (PL) has noticed the problem and tried to resolve it. 3

2 It's true that most of the SPs we worked with are in Greater China, but that's allowed in the community rules. China is, after all, a geographically vast country.

3 I saw that most of notaries have left due diligence in my application. Very few notaries may have forgotten to leave the message because of the second signing. It was very hard for me to get their support, please don't blame them too much.

Lastly, thank you for your supervision and reminder, we will continue to look for SPs who are willing to follow the rules and cooperate with them.

herrehesse commented 1 year ago
  1. Retrieval bot works, the miners are unreachable.
  2. No it is not.
  3. No these notaries are the same ones found on almost all disputes.
cryptowhizzard commented 1 year ago

1, I tested HTTP, still no avail. 2, No. your LDN indicated: Greater China, Asia other than Greater China, North America. This is not true. 3, rules are in place to be followed upon.

cryptowhizzard commented 1 year ago

Hello @Parker35sun

It really seems the SP's used are involved in CID sharing and have retrieval disabled. The issue LDN's are in the column auditTrail if you want to check yourself.

Let me know if you need any more information.

Scherm­afbeelding 2023-07-31 om 13 17 49

Parker35sun commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 35.31% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

Parker35sun commented 1 year ago

@Carohere I saw that you are helping to correct the gaslighting behavior of some notaries, could you give me a hand?

Carohere commented 1 year ago

@Parker35sun Thanks for your trust. Based on the report, it's true that all of their comments were not supported by written rules and corroborating evidence. But your application is closed by bot and I am not able to help you.

Parker35sun commented 1 year ago

thanks @simonkim0515 @Carohere, I would place the status update in the LDN application itself.