filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] Foldingathome COVID-19 Dataset #1024

Closed Megan008 closed 1 year ago

Megan008 commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

I have participated in some projects and hackathon. I have experience on it.

What is the primary source of funding for this project?

Personal income.

What other projects/ecosystem stakeholders is this project associated with?

No.

Use-case details

Describe the data being stored onto Filecoin

[Folding@home](http://foldingathome.org/) is a massively distributed computing project that uses biomolecular simulations to investigate the [molecular origins of disease](https://foldingathome.org/diseases/) and accelerate the discovery of new therapies.

Where was the data in this dataset sourced from?

Simulations of SARS-CoV-2 and associated host proteins, with emphasis on discovering druggable cryptic pockets, documented at the [MolSSI COVID Hub](https://covid.molssi.org//simulations/#foldinghome-simulations-of-the-sars-cov-2-spike-protein-spike-spike-binding).

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this. 

https://registry.opendata.aws/foldingathome-covid19/

         Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes, it's a public dataset.

What is the expected retrieval frequency for this data?

Multiple times.

For how long do you plan to keep this dataset stored on Filecoin?

2 years.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

North america; Korea; China.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

75% data will be distributed by offline data transfer. Other data will use online transfer for distributing with storage providers who close to me.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

I would let 1 sp who used to cooperate with me for this deal. Now I'm chatting with other sps. f023495, f0508988

How will you be distributing deals across storage providers?

I have communicated with 4 sp. In first time, I will divide 1/4 data to each sp. If I find out more sp, I will decrease the percentage of deals to them --- for decentralized storage. 

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Yes.
Bennyyangpu commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

Bennyyangpu commented 1 year ago

According to the report, the situation in all aspects is relatively good.

Bennyyangpu commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedv6vuvy6esbvl2qny5vwwpyzrdfuu6bfisxjb6ixhgbp46rio46c

Address

f1au3nipqjprr5xp2mwsarr7obvpx2dwy4is6qn4y

Datacap Allocated

400.00TiB

Signer Address

f174fg3bqbln3zjnkxtyf6s54txqkr7yqkj6cig7y

Id

ce15296d-34d7-4f78-a693-db943fcaeec6

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedv6vuvy6esbvl2qny5vwwpyzrdfuu6bfisxjb6ixhgbp46rio46c

BobbyChoii commented 1 year ago

Didn't find any CID sharing as some notary mentioned from the application. Retrieval is ok according to the Retrieval Dashboard. . Willing to give my sign and help to onboard this application.

BobbyChoii commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacead5ev4ndiasciqqckdiwlkzz5lnkv3562sonc2rzaahiy5w4axoq

Address

f1au3nipqjprr5xp2mwsarr7obvpx2dwy4is6qn4y

Datacap Allocated

400.00TiB

Signer Address

f1irqs2gmctiv3jcdfwuch7oxvf4ixh3k4b2wc24i

Id

ce15296d-34d7-4f78-a693-db943fcaeec6

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacead5ev4ndiasciqqckdiwlkzz5lnkv3562sonc2rzaahiy5w4axoq

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 7

Multisig Notary address

f02049625

Client address

f1au3nipqjprr5xp2mwsarr7obvpx2dwy4is6qn4y

DataCap allocation requested

400TiB

Id

d9dee3ed-014a-41e7-900c-821ac9129c52

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1au3nipqjprr5xp2mwsarr7obvpx2dwy4is6qn4y

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested

400TiB

Total DataCap granted for client so far

3.6379788070917166e+64YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

3.6379788070917166e+64YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
33372 16 400TiB 15.41 106.96TiB
cryptowhizzard commented 1 year ago

This applicant has a long history of fraud and refuses to start working on the right path. Applications still not provide retrieval / data stored is not the data stored the applicant is saying.

Scherm­afbeelding 2023-08-19 om 17 09 00
Megan008 commented 1 year ago

image @cryptowhizzard Is it mean not provide retrieval? Why did you lie through your teeth? Can you please do any check before you do anything? @raghavrmadya @dkkapur Is it allow to dispute anyone just on his own words?

cryptowhizzard commented 1 year ago

image @cryptowhizzard Is it mean not provide retrieval? Why did you lie through your teeth? Can you please do any check before you do anything? @raghavrmadya @dkkapur Is it allow to dispute anyone just on his own words?

It is public knowledge that the HTTP retrieval bot is gamed.

http://www.datasetcreators.com/downloadedcarfiles/logs/1024.log

Here you can find the log. Since you have range retrieval disabled ( Something natively enabled in boost ) it is clear you attempt to avoid that someone is retrieving the whole carfile to unpack it and do due diligence.

This is what your retrieval looks like. It is all junk and scam.

Scherm­afbeelding 2023-08-22 om 12 02 50

Megan008 commented 1 year ago

I will let SPs for check.

It is public knowledge that the HTTP retrieval bot is gamed.

This tool is from PL team, do you mean that it is useless? I think I can only trust people from official team.

Carohere commented 1 year ago

@cryptowhizzard Could you share the link to the file you downloaded? @Megan008 Retrieval bots can be largely trusted, but sometimes they are not accurate.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

raghavrmadya commented 1 year ago

Hi, I'm following up on the dispute here - https://www.notion.so/filecoin/No-retrieval-supported-bfcfebbcbdbd475fab52cccaf83d4674?pvs=4

Client is requested to provide an update on retrievals if they are not satisfied with the evidence provided by @cryptowhizzard.

Until then, application will remain under dispute and notaries are encourage to not sign without providing evidence of retrieval compliance

Megan008 commented 1 year ago

@raghavrmadya First, I've proved that we support retrieval and the retrieval report can also show it.

image @cryptowhizzard Is it mean not provide retrieval? Why did you lie through your teeth? Can you please do any check before you do anything? @raghavrmadya @dkkapur Is it allow to dispute anyone just on his own words?

Then, this is the retrieval download which is given by SPs. 6e356c07-4dbf-40dd-ae57-

All thing means that we support retrieval.

cryptowhizzard commented 1 year ago

Dear Megan008,

As notary I am doing due diligence on your LDN. I could not get retrieval to work. Can you please upload the car file of CID baga6ea4seaqofl35yu6stkuaeo4nbpe543355wtaglyv74pfwtyx5uqhpag34ii ?

You can use our upload system at http://send.datasetcreators.com. Please select 7 days for the system to keep the file and post the link you received here so I (and other notaries) can download your content.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

-- Commented by Stale Bot.

Megan008 commented 1 year ago

Dear Megan008,

As notary I am doing due diligence on your LDN. I could not get retrieval to work. Can you please upload the car file of CID baga6ea4seaqofl35yu6stkuaeo4nbpe543355wtaglyv74pfwtyx5uqhpag34ii ?

You can use our upload system at http://send.datasetcreators.com. Please select 7 days for the system to keep the file and post the link you received here so I (and other notaries) can download your content.

I can not open the link to do upload. It's better that you check my answer as below. It can give you what you want.

large-datacap-requests[bot] commented 10 months ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find Website \/ Social Media field in the information provided We could not find Total amount of DataCap being requested (between 500 TiB and 5 PiB) field in the information provided We could not find Weekly allocation of DataCap requested (usually between 1-100TiB) field in the information provided We could not find On-chain address for first allocation field in the information provided We could not find Data Type of Application field in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
large-datacap-requests[bot] commented 8 months ago

RootKeyHolders have approved multisig account. You can now request first datacap release