filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] Kernelogic - Post Slingshot 2.8 continuation on NEXRAD dataset (1/3) #1004

Closed kernelogic closed 1 year ago

kernelogic commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

Similarly to this LDN comment https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/432#issuecomment-1204902669 from @dkkapur , as Slingshot 2.8 has ended, I'd like to continue storing the whole dataset to its completion under a new LDN, following the same rules as before.

This dataset is 2PB+, I have got one 5PB so this one I am requesting 15PB so that I can have approximately 10 replicas in total.

I have participated every Slingshot phase and is probably the best performing as a "small individual client". 

I have successfully completed a few LDNs on other datasets and I have record to show I have been following the rules of decentralization and have zero self dealing.

https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/60
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/59
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/46
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/297
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/298
https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/304

What is the primary source of funding for this project?

Self-funded, BigD exchange.

What other projects/ecosystem stakeholders is this project associated with?

enterprise-sp-wg, BigD exchange.

Use-case details

Describe the data being stored onto Filecoin

Real-time and archival data from the Next Generation Weather Radar (NEXRAD) network.

Where was the data in this dataset sourced from?

https://registry.opendata.aws/noaa-nexrad/

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

The data is primarily compressed binary data. Below site demonstrate how to consume and render the data
https://nbviewer.org/gist/dopplershift/356f2e14832e9b676207

s3://noaa-nexrad-level2/2021/01/01/TSDF/TSDF20210101_235417_V08

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

AWS open dataset

What is the expected retrieval frequency for this data?

Infrequent. However all details are available at my browser https://slingshot.kernelogic.ca/nexrad.html?v=2.8

For how long do you plan to keep this dataset stored on Filecoin?

Between 365 - 520 days.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

All regions.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

I will upload my prepared CAR files to a web server and coordinate with providers to download and propose offline deals.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

Beside the previous SPs I have worked with, I also utilize bigD exchange to further decentralize the storage

To name a few from the community that I deal with regularly: PIKNIK, Holon, CabrinaHuang, HarryM, BigBear, j1v, XinAn Xu, WillTechMusing.

From BigD exchange: Mog Li, Devin Chen, DSS Nathanial Marsh, Rabinovitch, Vin K, arockpool Tony

How will you be distributing deals across storage providers?

Evenly across all providers I propose to, if they can handle. If a miner is a notary itself, this notary will receive no more than 20% of the total granted datacap.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

I have all I need to start making deals.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 2 years ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

raghavrmadya commented 2 years ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

750TiB

Client address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

large-datacap-requests[bot] commented 2 years ago

DataCap Allocation requested

Multisig Notary address

f01858410

Client address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

DataCap allocation requested

256TiB

psh0691 commented 2 years ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecevv46q6gmrjyqmf4ure5virhkhtuzel7rs7ith2rtw7zhdic2bg

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

256.00TiB

Signer Address

f1qdko4jg25vo35qmyvcrw4ak4fmuu3f5rif2kc7i

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecevv46q6gmrjyqmf4ure5virhkhtuzel7rs7ith2rtw7zhdic2bg

xiaoyuaiheshui commented 2 years ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacear66il2z2q5psn3gts7nt4fp576caslncayop5ksetpsi55lkqf6

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

256.00TiB

Signer Address

f122qmy25wdtt5mxd77kndiq7z5x2n3iwiuz2wdsa

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacear66il2z2q5psn3gts7nt4fp576caslncayop5ksetpsi55lkqf6

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f01858410

Client address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

DataCap allocation requested

512TiB

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Last two approvers

jggapp & psh0691

Rule to calculate the allocation request amount

10% of total dc amount requested

DataCap allocation requested

512TiB

Total DataCap granted for client so far

256TiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

4.75PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
6296 6 256TiB 25.86 63.78TiB
flyworker commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaced7ns27ekwl2xghcllec2kxg2qckwrstn7dlwhfqnom27jx2qaquu

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

512.00TiB

Signer Address

f1hlubjsdkv4wmsdadihloxgwrz3j3ernf6i3cbpy

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaced7ns27ekwl2xghcllec2kxg2qckwrstn7dlwhfqnom27jx2qaquu

liyunzhi-666 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedhngedt4vpf6f7ei3ieutl2khpveg4hqjhbizxwjroxiik7neham

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

512.00TiB

Signer Address

f1pszcrsciyixyuxxukkvtazcokexbn54amf7gvoq

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedhngedt4vpf6f7ei3ieutl2khpveg4hqjhbizxwjroxiik7neham

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f01858410

Client address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

DataCap allocation requested

1PiB

Id

9eeed983-96be-444f-8132-e8c8a83a0980

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Last two approvers

liyunzhi-666 & flyworker

Rule to calculate the allocation request amount

20% of total dc amount requested

DataCap allocation requested

1PiB

Total DataCap granted for client so far

768TiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

4.25PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
20283 14 512TiB 11.81 127.40TiB
liyunzhi-666 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacectujix4g5cp74ett6ho2q67xblk7s7nllmfi32jfvplz32tr32sc

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

1.00PiB

Signer Address

f1pszcrsciyixyuxxukkvtazcokexbn54amf7gvoq

Id

9eeed983-96be-444f-8132-e8c8a83a0980

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacectujix4g5cp74ett6ho2q67xblk7s7nllmfi32jfvplz32tr32sc

Tom-OriginStorage commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceahh4x47rz6lki4zgvjyyuck5pyjwzptw7ealoli7fxtjqa6aanbg

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

1.00PiB

Signer Address

f1q6bpjlqia6iemqbrdaxr2uehrhpvoju3qh4lpga

Id

9eeed983-96be-444f-8132-e8c8a83a0980

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceahh4x47rz6lki4zgvjyyuck5pyjwzptw7ealoli7fxtjqa6aanbg

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 4

Multisig Notary address

f01858410

Client address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

DataCap allocation requested

2PiB

Id

f335582f-b10a-4627-b82a-551d9c348a2b

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Last two approvers

llifezou & liyunzhi-666

Rule to calculate the allocation request amount

40% of total dc amount requested

DataCap allocation requested

2PiB

Total DataCap granted for client so far

1.75PiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

3.25PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
50150 29 1PiB 8.76 246.28TiB
newwebgroup commented 1 year ago

The distribution of Sps is very scattered, very good

image
newwebgroup commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecbgpgalgiyunz6xeb7cjh444z7nvkccqawirxz6kgkjmkqffov4e

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

2.00PiB

Signer Address

f1e77zuityhvvw6u2t6tb5qlnsegy2s67qs4lbbbq

Id

f335582f-b10a-4627-b82a-551d9c348a2b

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecbgpgalgiyunz6xeb7cjh444z7nvkccqawirxz6kgkjmkqffov4e

cryptowhizzard commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecalwlxkscxhahpsadf3yktcb2s7fhyoomrjmbhjkoe6e2bgtaahg

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

2.00PiB

Signer Address

f1krmypm4uoxxf3g7okrwtrahlmpcph3y7rbqqgfa

Id

f335582f-b10a-4627-b82a-551d9c348a2b

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecalwlxkscxhahpsadf3yktcb2s7fhyoomrjmbhjkoe6e2bgtaahg

flyworker commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecszm5k7p4dn3kiwzqstxwrbskuimzsiexdoyawcdxvmrdvt3ob5i

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

2.00PiB

Signer Address

f1hlubjsdkv4wmsdadihloxgwrz3j3ernf6i3cbpy

Id

f335582f-b10a-4627-b82a-551d9c348a2b

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecszm5k7p4dn3kiwzqstxwrbskuimzsiexdoyawcdxvmrdvt3ob5i

liyunzhi-666 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebtrcxwdkjtiv4tlza7m65ymdj5zfea36gicdvyxzqwnyd6wtvnu6

Address

f1yy7riqoc3vm7jv6nawupnytj4m6sajfuq7kqn6q

Datacap Allocated

2.00PiB

Signer Address

f1pszcrsciyixyuxxukkvtazcokexbn54amf7gvoq

Id

f335582f-b10a-4627-b82a-551d9c348a2b

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebtrcxwdkjtiv4tlza7m65ymdj5zfea36gicdvyxzqwnyd6wtvnu6

filplus-checker commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

✔️ Storage provider distribution looks healthy.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01949267 Shanghai, Shanghai, CN 137.31 TiB 8.89% 137.31 TiB 0.00%
f01949260 Shanghai, Shanghai, CN 131.19 TiB 8.50% 131.19 TiB 0.00%
f01904630new Las Vegas, Nevada, US 88.56 TiB 5.74% 88.56 TiB 0.00%
f01927554 Shenzhen, Guangdong, CN 75.88 TiB 4.91% 75.69 TiB 0.25%
f01915033 Chengdu, Sichuan, CN 75.00 TiB 4.86% 75.00 TiB 0.00%
f01923787 Shenzhen, Guangdong, CN 74.69 TiB 4.84% 74.25 TiB 0.59%
f01660795 Shenzhen, Guangdong, CN 74.19 TiB 4.80% 73.91 TiB 0.38%
f01873432 Las Vegas, Nevada, US 57.63 TiB 3.73% 57.63 TiB 0.00%
f01889668 San Jose, California, US 56.25 TiB 3.64% 56.25 TiB 0.00%
f01518369 San Jose, California, US 56.25 TiB 3.64% 56.25 TiB 0.00%
f01918045 Kuala Lumpur, Kuala Lumpur, MY 54.03 TiB 3.50% 54.03 TiB 0.00%
f01909705 Kuala Lumpur, Kuala Lumpur, MY 54.00 TiB 3.50% 54.00 TiB 0.00%
f01918046 Kuala Lumpur, Kuala Lumpur, MY 53.84 TiB 3.49% 53.84 TiB 0.00%
f01928097 Hong Kong, Central and Western, HK 50.25 TiB 3.25% 50.25 TiB 0.00%
f0143858 Clifton, New Jersey, US 48.44 TiB 3.14% 48.44 TiB 0.00%
f03223 San Jose, California, US 46.59 TiB 3.02% 46.59 TiB 0.00%
f0240185 Clifton, New Jersey, US 46.28 TiB 3.00% 46.28 TiB 0.00%
f02301 San Jose, California, US 44.56 TiB 2.89% 44.56 TiB 0.00%
f01938674 Shenzhen, Guangdong, CN 40.97 TiB 2.65% 40.94 TiB 0.08%
f0142637 Chengdu, Sichuan, CN 37.44 TiB 2.42% 37.44 TiB 0.00%
f01938671 Hong Kong, Central and Western, HK 34.53 TiB 2.24% 34.53 TiB 0.00%
f01923786 Hong Kong, Central and Western, HK 34.16 TiB 2.21% 34.09 TiB 0.18%
f01946104 Chengdu, Sichuan, CN 34.00 TiB 2.20% 34.00 TiB 0.00%
f01859603 Shenzhen, Guangdong, CN 26.13 TiB 1.69% 26.13 TiB 0.00%
f0397083 Tokyo, Tokyo, JP 18.75 TiB 1.21% 18.75 TiB 0.00%
f01985775 Dallas, Texas, US 18.75 TiB 1.21% 18.75 TiB 0.00%
f033462 Dallas, Texas, US 18.75 TiB 1.21% 18.75 TiB 0.00%
f01943663new Hong Kong, Central and Western, HK 18.72 TiB 1.21% 18.72 TiB 0.00%
f01907578 Fuzhou, Fujian, CN 18.31 TiB 1.19% 18.31 TiB 0.00%
f01985745 Dallas, Texas, US 18.22 TiB 1.18% 18.22 TiB 0.00%
f01969779 Clifton, New Jersey, US 544.00 GiB 0.03% 544.00 GiB 0.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 38.32% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
59.31 TiB 59.31 TiB 1 3.84%
119.59 TiB 239.19 TiB 2 15.49%
97.72 TiB 293.16 TiB 3 18.98%
64.09 TiB 256.38 TiB 4 16.60%
28.13 TiB 140.91 TiB 5 9.12%
21.13 TiB 126.78 TiB 6 8.21%
33.13 TiB 232.41 TiB 7 15.05%
21.81 TiB 174.56 TiB 8 11.30%
1.19 TiB 10.75 TiB 9 0.70%
992.00 GiB 9.72 TiB 10 0.63%
96.00 GiB 1.03 TiB 11 0.07%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f154a4iq5mxq76avoooc5a3unchfbrjg7itkjfl6y Fei Yan - Kernelogic 1.59 PiB 11,273 LDN v3 multisig
f1tszopsyo4fs75f4ia5h25btfob7t6upmj2w27qq Fei Yan - Kernelogic 1.11 PiB 9,545 LDN v3 multisig
f3waskb5svh6ywfzhm2khvsomjlgb5rif6flf2bcy
n3ift7xswlffifn32jmvttf7o43zchku2manh7y34
w4pq
Unknown 320.00 GiB 2 Unknown

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

kernelogic commented 1 year ago

keepalive

large-datacap-requests[bot] commented 10 months ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find Website \/ Social Media field in the information provided We could not find Total amount of DataCap being requested (between 500 TiB and 5 PiB) field in the information provided We could not find Weekly allocation of DataCap requested (usually between 1-100TiB) field in the information provided We could not find On-chain address for first allocation field in the information provided We could not find Data Type of Application field in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
large-datacap-requests[bot] commented 7 months ago

RootKeyHolders have approved multisig account. You can now request first datacap release

large-datacap-requests[bot] commented 3 months ago

RootKeyHolders have approved multisig account. You can now request first datacap release