filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application]Radio telescope——National Radio Astronomy Observatory #2045

Closed nicelove666 closed 11 months ago

nicelove666 commented 1 year ago

Data Owner Name

National Radio Astronomy Observatory

What is your role related to the dataset

Data Preparer

Data Owner Country/Region

United States

Data Owner Industry

Life Science / Healthcare

Website

https://data.nrao.edu/portal/#/

Social Media

https://data.nrao.edu/portal/#/

Total amount of DataCap being requested

27PiB

Expected size of single dataset (one copy)

1P

Number of replicas to store

10

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

No response

Share a brief history of your project and organization

Founded in 1956, NRAO provides the most advanced radio telescope facilities and information to the international scientific community. Currently, https://data.nrao.edu/portal/#/ has stored 4.3PB of data.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Founded in 1956, NRAO provides the most advanced radio telescope facilities and information to the international scientific community. Currently, https://data.nrao.edu/portal/#/ has stored 4.3PB of data.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

IPFS, lotus, singularity

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

We counted the data on this website, a total of 500,800 pieces of information, with a total capacity of 4.2P。

https://docs.google.com/spreadsheets/d/1F26TunJBid_6SqMYOQscSpm3793y4xYu/edit?usp=share_link&ouid=109823390606932719085&rtpof=true&sd=true

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, IPFS, Shipping hard drives, Lotus built-in data transfer

How do you plan to choose storage providers

Slack, Filmine, Big Data Exchange

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

f02182867、
f0427989、
f02182798、
f02204960、
f02182743、
f02182802、
f02182902、
f02105219、
f0427989、
f02145020、
f021255、
f02125861、
f02181415、
f02145020、
f021255

How do you plan to make deals to your storage providers

Boost client, Lotus client, Droplet client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

Expected size of single dataset (one copy)Number of replicas to store≠Total amount of DataCap being requested 1P10≠15P Can you explain about that?

Whether the data of this application overlaps with the data of the previous application?

Can you introduce your organizaion?

Sunnyiscoming commented 1 year ago

https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1947 https://github.com/data-preservation-programs/filplus-checker-assets/blob/main/filecoin-project/filecoin-plus-large-datasets/issues/1947/1686267632151.md Can you explain the reason for low data retrieval success rate?

nicelove666 commented 1 year ago

Dear officials, Hello, we can retrieve before the last version upgrade, because our retrieval is through the storage of CAR files, and then due to the new version upgrade, our developers did not follow up, I have informed them to work overtime this weekend to solve the retrieval problem, in addition, the retrieval rate of 1948 will be 5 times higher than 1947, please have a look. Finally, there are three SP that do not support retrieval, for which we have cancelled our cooperation with SP. Thank you for your attention and questions, we will certainly improve in the next round!

As there are many SP partners, the situation of each SP is different, but we can promise that if customers will not search, we will help them and guide them; if customers are not willing to search, we will not cooperate with customers. Hope to get everyone's support, so that we can see our improvement!

Sunnyiscoming commented 1 year ago

Expected size of single dataset (one copy)Number of replicas to store≠Total amount of DataCap being requested 1P10≠15P Can you explain about that?

Whether the data of this application overlaps with the data of the previous application?

Can you introduce your organizaion?

nicelove666 commented 1 year ago

Hi, dear~ https://data.nrao.edu/portal/#/The data here is 4.2P, we save 10 copies, and can apply for a DC of 4.2X10=42P in total. We applied for 10P in 1947 and 1948, and 15P in 2045, we will applied for 12P after 1947, 1948, and 2045 were basically used up. I want to emphasize that, compared to most people who tend to store the same data in countless copies, such as ocean data, covid-19 virus has been applied for many times by many different people and you all passed it quickly, Radio telescope data Should not be stored by anyone else Regarding my organization, I already answered your question in 1947: I am a Filecoin investor and enthusiast. I have worked in traditional cloud storage companies for 8 years. In 2016, he quit his high-paying job to join the blockchain. In 2018, we paid attention to IPFS and Filecoin. Later, some of our friends created IPFS media, some created a huge Filecoin mining company, and some made dapps. Although we are very low-key, we have been quietly contributing to Filecoin. We've brought together some tech geeks. It's a non-profit, we don't have a company because there is no money involved. Let me briefly introduce our members. For example, WeChat search video account: Atomic View, search WeChat official account: IPFS Club. We recommended a very well-known client to the Filecoin network, which is the Chinese jewelry giant (1299). After many communications, they are willing to store data on the Filecoin network, and V guest is also a Filecoin pioneer in China (1903), and 670、1845-1848, these are our contributions! I think you will also agree that, compared to the beautiful official website, PPT and other external things, the truly valuable contribution is how many SPs have been helped to join Filecoin, how many enterprise-level customers have recognized Filecoin, and joined FIL+.

Sunnyiscoming commented 1 year ago

Now, the maxmium of datacap requested is not 15 PB. If you want to apply for this dataset later, why did you apply for 15 PB this time, not total amount of all remaining data?

nicelove666 commented 1 year ago

Thank you for your kind reminder, we have revised the application, thank you again for your dedication and patience,I hope to see you help us, on behalf of all the SPs and customers we cooperate with, I would like to express my gratitude to you~

Sunnyiscoming commented 1 year ago

Datacap Request Trigger

Total DataCap requested

27PiB

Expected weekly DataCap usage rate

1PiB

Client address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

DataCap allocation requested

512TiB

Id

f3a68908-acf7-41e8-bdb9-10d9144c9016

Chuangshi1 commented 1 year ago

i will suport the first round.

Chuangshi1 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceclmi6xpfedqvwhrrst7kbvtjsvgbrhlee2s2whu6dypi3ks6oess

Address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

Datacap Allocated

512.00TiB

Signer Address

f1mdk7s2vntzm6hu35yuo6vjubtrpfnb2awhgvrri

Id

f3a68908-acf7-41e8-bdb9-10d9144c9016

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceclmi6xpfedqvwhrrst7kbvtjsvgbrhlee2s2whu6dypi3ks6oess

zcfil commented 1 year ago

After browsing the historical LDN and historical responses, I am willing to support the first round and follow up on the data situation in the future.

zcfil commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecmaisjcmxblmia4pm2eb5hyxaomhuwlgumu32oofiepnpiw3xug6

Address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

Datacap Allocated

512.00TiB

Signer Address

f1cjzbiy5xd4ehera4wmbz63pd5ku4oo7g52cldga

Id

f3a68908-acf7-41e8-bdb9-10d9144c9016

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecmaisjcmxblmia4pm2eb5hyxaomhuwlgumu32oofiepnpiw3xug6

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

nicelove666 commented 1 year ago

It takes time to prepare data and mail the hard drive, we will start it within 10 days, thank you

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

DataCap allocation requested

512TiB

Id

15bb9727-621b-4abd-8bf9-d31d2237328f

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

Rule to calculate the allocation request amount

100% weekly > 0.5PiB, requesting 0.5PiB

DataCap allocation requested

512TiB

Total DataCap granted for client so far

512TiB

Datacap to be granted to reach the total amount requested by the client (27PiB)

26.5PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
0 0 512TiB NaN 129.12TiB
mnxspl commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

stcloudlisa commented 1 year ago

After checking the application history, I am willing to support it for the time being and will keep paying attention

igoovo commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

stcloudlisa commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedqeukgp7vxhzy7at7qoc2kgbsm5p4ximjh5p3dkbudh65ky7dni4

Address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

Datacap Allocated

512.00TiB

Signer Address

f1jvvltduw35u6inn5tr4nfualyd42bh3vjtylgci

Id

15bb9727-621b-4abd-8bf9-d31d2237328f

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedqeukgp7vxhzy7at7qoc2kgbsm5p4ximjh5p3dkbudh65ky7dni4

igoovo commented 1 year ago

This process looks pretty healthy.

igoovo commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceczoyaho4ym252cygcc3vj2nguct67g3zcgfyxorjatn6yjxkk4dy

Address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

Datacap Allocated

512.00TiB

Signer Address

f1shnsfayxqll77svffaxnjenms7bbbysbqcatrpy

Id

15bb9727-621b-4abd-8bf9-d31d2237328f

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceczoyaho4ym252cygcc3vj2nguct67g3zcgfyxorjatn6yjxkk4dy

nicelove666 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 77.68% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

cryptowhizzard commented 1 year ago

@nicelove666

Can i ask why you are storing this on different miners then indicated in your LDN and only with one entity?

Are you aware that if you do this that next allocations cannot be granted according to Fil+ rules and notary guidelines?

Scherm­afbeelding 2023-07-06 om 12 48 52

cryptowhizzard commented 1 year ago

@notary's please check on #1947

nicelove666 commented 1 year ago

The reason we are responding so late is because we have identified:

1、SP is independent and belongs to different companies

2、sp does not use VPN

3、The computer rooms of the different sps you mentioned are in different places, and they don't use vpn either.

For example, f02105219 is an sp from Guangdong, China. They don’t use any vpn. This is their first time joining FIL+. They don’t have any fraud, but you show “ipfsfy.com”.

For example, f02125861 is a SP in Hong Kong, and they don't use any vpn, but you show "ipfsfy.com".

I asked the sp you listed and the sp has absolutely no idea what "ipfsfy.com" is or what ipfsfy.com does, the ipfsfy.com website doesn't even open.

You think that all the sps using "ipfsfy.com" belong to the same group.

According to your logic, ipfsfy.com is developed by a Filecoin company, so the sp of this company uses products developed by its own company. Excuse me, do you think it is possible?

Please be careful! For example, you can observe 500 sps, and you will find that there may be 120 sps that use "ipfsfy.com". Excuse me, do these 120 sps belong to the same group?

Please don't accuse us for no reason. It is very easy to accuse, but it is very easy to make people sad, I believe your actions are out of goodwill, you love Filecoin, and we also love Filecoin, so there is no need to hurt each other!

nicelove666 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

nicelove666 commented 1 year ago

The data has been updated to a certain extent. At present, you have seen 4 sps. Within 2 days, you will see 8 sps. There is a big delay in the statistics of the Filecoin network. I think that instead of slandering others, you can spend time Research how to enable bots to synchronize the latest data in real time.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

DataCap allocation requested

1PiB

Id

b01c22b9-65cf-4527-922a-892e76148d44

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1y5mkyvzsfxsapuecbbs4hrrmio2te6ajdqpgedq

Rule to calculate the allocation request amount

200% weekly > 1PiB, requesting 1PiB

DataCap allocation requested

1PiB

Total DataCap granted for client so far

465661.3YiB

Datacap to be granted to reach the total amount requested by the client (27PiB)

465661.3YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
14234 5 512TiB 28.03 128.75TiB
nicelove666 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

nicelove666 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

herrehesse commented 1 year ago

LDN under dispute, notaries not to sign.

nicelove666 commented 1 year ago

Everything looks healthy, but the retrieval rate doesn't look high due to a bug in the robot:

f01435542 , the retrieval of the SP feedback node is completely normal, but statistically, the retrieval success rate is 0. We randomly sampled a few pieces of data from this node for retrieval and all of them can be retrieved normally.

WechatIMG8031

The community can search by itself. According to our observation, there are indeed problems, and we ask official members for help. @xinaxu

xinaxu commented 1 year ago

This is indeed a bug for f01435542. The multiaddr of the SP was updated but it wan't updated in the bot due to caching. This is now fixed.

nicelove666 commented 1 year ago

thank you! @xinaxu

nicelove666 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

nicelove666 commented 1 year ago

First, this sp has just been repaired, so the retrieval rate will be improved in the next few days. Second, the other two SPs have not solved the retrieval problem for a long time, so we decided not to cooperate with these two SPs. We will develop new SP. Only by supporting us can we see the next round of progress, Thanks

This is indeed a bug for f01435542. The multiaddr of the SP was updated but it wan't updated in the bot due to caching. This is now fixed.

ipollo00 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

woshidama323 commented 1 year ago

Please improve retrieval success rate ASAP in the next round