filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] <SXX Future Data> - <Web3 Cloud Drive> #1613

Open sxxfuture-official opened 1 year ago

sxxfuture-official commented 1 year ago

Data Owner Name

Sxx Future Data

Data Owner Country/Region

China

Data Owner Industry

Web3 / Crypto

Website

https://cloud.sxxfuture.com/products/cloud-drive

Social Media

https://twitter.com/sxxfuture
https://medium.com/@sxxfuture.official
https://discord.gg/GFsJaUFS9K
https://weibo.com/u/7801654981

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

500TiB

On-chain address for first allocation

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

Custom multisig

Identifier

efil

Share a brief history of your project and organization

Web3 Cloud Drive is a decentralized file storage application for the Web3 era, which is based on blockchain technology and IPFS decentralized distributed storage technology, and can provide massive storage space for users who require data security and decentralization, achieving data trust, multi-node backup, decentralized storage, fast retrieval and other functions, ensuring that files are safe and untamperable, fully protecting users' privacy data. It is suitable for data backup, data distribution, collaborative sharing, NFT storage, etc. in Web3 era.
This project is developed and maintained by SXX Future Data. Founded on July 29, 2021 and headquartered in Hubei, China, SXX Future Data focuses on distributed storage cloud services, blockchain product development and application, and Web3 new generation Internet ecology construction.

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

We are gonna cooperate with more SPs from other regions.

Describe the data being stored onto Filecoin

The data stored in Web3 Cloud Drive is basically private data that users use for backup, including but not limited to documents and multimedia files.

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

IPFS, lotus

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://ipfs.crab.merak.sxxfuture.com:8443/ipfs/bafybeiasw6ht2wuns2vdstcega6tusdnizmg4e3anwu7pbw2aabtnv6vk4?filename=Web30%E6%8A%95%E8%9E%8D%E8%B5%84%E7%9B%91%E7%AE%A1%E5%90%88%E8%A7%84%E5%AE%8C%E5%85%A8%E6%89%8B%E5%86%8C.pdf

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

More than 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China

How will you be distributing your data to storage providers

IPFS

How do you plan to choose storage providers

Slack, Filmine, Partners

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

f01964215
f01986236
What’s more, We will provide those small SPs who newly join the network with technology support.

How do you plan to make deals to your storage providers

Boost client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

cryptowhizzard commented 1 year ago

Dear applicant,

Thank you for applying for datacap. As Filecoin FIL+ notary i am screening your application and conducting due diligence.

As last question i would like you to fill out this form to provide us with the necessary information to make a educated decision on your LDN request if we would like to support it.

Thanks!

kevzak commented 1 year ago

@sxxfuture-official is this a public dataset as listed above? Or private?

sxxfuture-official commented 1 year ago

It is private.

cryptowhizzard commented 1 year ago

Hi @sxxfuture-official

Thanks for submitting your LDN.

On review i noticed you provided SP's , however f01964215 is your own SP.

Awaiting your progress.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

kevzak commented 1 year ago

Hello @sxxfuture-official for a private dataset application, please include more information regarding the SPs:

Also see update application as needed to meet storage guidelines from https://github.com/filecoin-project/filecoin-plus-large-datasets

Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners, and having at least 5 replicas of the dataset. No more than one replica should be stored with one SP ID, and if the data cannot leave a particular geographic boundary, then it is expected that replication will still happen across different locations (cities, datacenters, etc.). If you cannot follow these practices due to policy or any other issues, you may explain your case in the application and provide to the community what method you can do instead. These are recommendations and not strict rules that every client must follow.

kevzak commented 1 year ago

Also this applicant needs to complete KYB check before proceeding.

sxxfuture-official commented 1 year ago
kevzak commented 1 year ago

Hello - I can confirm that @sxxfuture-official has completed the KYC and KYB check.

Additionally they have completed the full upfront check registration form. For notaries interested in reviewing this application, send a comment to me, I will share the proof of data set size, data sample, and SP distribution plan.

kevzak commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

500TiB

Client address

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! :exclamation: We have found some problems in the information provided. The request cannot be posted because the identifier in the issue cannot be retrieved

Please, take a look at the request and edit the body of the issue providing all the required information.
kevzak commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

500TiB

Client address

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f01940930

Client address

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

DataCap allocation requested

250TiB

Id

94572b2e-e2c9-4c04-ade5-395e440e6c41

liyunzhi-666 commented 1 year ago

hey @sxxfuture-official I noticed that you are storing private data of the platform users, have you consulted the platform users? Also, only one PDF document is not enough to prove that you have 5PiB storage requirements. cc @kevzak can you share the proof of data set size, data sample, and SP distribution plan with me?

kevzak commented 1 year ago

hey @sxxfuture-official I noticed that you are storing private data of the platform users, have you consulted the platform users? Also, only one PDF document is not enough to prove that you have 5PiB storage requirements. cc @kevzak can you share the proof of data set size, data sample, and SP distribution plan with me?

Sent @liyunzhi-666

kevzak commented 1 year ago

@sxxfuture-official please share a data sample here for notaries.

Also, please explain the 5PiB DataCap request. Your dataset is 600TiB x 5 copies, correct?

sxxfuture-official commented 1 year ago

@kevzak @liyunzhi-666 image

So far, our project has received more than 600T of customer data, as shown in the figure below: Lk5nMpq2GS

This time we will use the E-FIL+ storage standard to store data, and the number of backups is greater than 5.

liyunzhi-666 commented 1 year ago

600TiB x 5 copies doesn't seem to meet the storage needs of 5PiB, @sxxfuture-official is there any new data to upload later?

sxxfuture-official commented 1 year ago

@liyunzhi-666 600TiB is the original size of the data, and the size of the car file will not be 32GB (about 19GB) after cutting, so the amount occupied will be larger. And the number of copies >=5, the total amount occupied can be selected 5PiB.

Of course, on the other hand, due to the continuous operation of the product, the volume of data has been increasing, but 5PiB is enough for the time being.

liyunzhi-666 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacebxajy2h4qmsesdcv7vtxr736onggh36j6sgztvxhidymhy7xopn6

Address

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

Datacap Allocated

250.00TiB

Signer Address

f1pszcrsciyixyuxxukkvtazcokexbn54amf7gvoq

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebxajy2h4qmsesdcv7vtxr736onggh36j6sgztvxhidymhy7xopn6

laurarenpanda commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceb2oaob6wtg54c73a3hrcbnus3fhsboahtbaivw6stecqscwjuiaw

Address

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

Datacap Allocated

250.00TiB

Signer Address

f1bp3tzp536edm7dodldceekzbsx7zcy7hdfg6uzq

Id

94572b2e-e2c9-4c04-ade5-395e440e6c41

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceb2oaob6wtg54c73a3hrcbnus3fhsboahtbaivw6stecqscwjuiaw

laurarenpanda commented 1 year ago

@kevzak @liyunzhi-666 image

  • About MinerID: Part of the MinerIDs that will to participate : f01964215 \ f01986236 \ f01985775 \ f01732345 The rest of the Miner will be newly initialized, and the specific ID is not yet known.
  • About Business entity name SXX Future Data / New Web Group / Toploong Tec / Greater Heat and more in future
  • About Region/Location We will distribute the data to different regions and countries in the world. Include :Mainland China, Hong Kong, South America, Singapore ...

So far, our project has received more than 600T of customer data, as shown in the figure below: Lk5nMpq2GS

This time we will use the E-FIL+ storage standard to store data, and the number of backups is greater than 5.

The above info looks great, so I'm willing to sign this round and will pay more attention to its Checker report later.

sxxfuture-official commented 1 year ago

@laurarenpanda Thanks for supporting!

Joss-Hua commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

kevzak commented 1 year ago

Hello @sxxfuture-official - there is now one additional step as part of E-Fil+ application process: To validate your applicant GitHub ID, we ask you to complete the KYC check (a third party ID verification process).

Steps:

Also note:

Let me know if you have any issues or questions.

sxxfuture-official commented 1 year ago

image

kevzak commented 1 year ago

apologies, I will ask the team to fix the bot issue here. This is a new feature under testing.

data-programs commented 1 year ago
KYC

This user’s identity has been verified through filplus.storage

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

sxxfuture-official commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

⚠️ All retrieval success ratios are below 1%.

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

cryptowhizzard commented 1 year ago

@sxxfuture-official

Can you elaborate on what is going on here?

sxxfuture-official commented 1 year ago

@cryptowhizzard ? working

cryptowhizzard commented 1 year ago

@sxxfuture-official

There is no retrieval?

cryptowhizzard commented 1 year ago

Scherm­afbeelding 2023-07-31 om 08 16 51

sxxfuture-official commented 1 year ago

@cryptowhizzard I mean this is still under sealing for first round, I will urge the SPs to resolve this matter before the next round of sign.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f01940930

Client address

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

DataCap allocation requested

500TiB

Id

3371ceb8-0510-4769-96f2-2f3e1a139a11

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01940930

Client address

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

500TiB

Total DataCap granted for client so far

250TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.75PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
5310 4 250TiB 37.66 54.06TiB
Joss-Hua commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

Joss-Hua commented 1 year ago

Regarding the controversy over the retrieval in the message, the client told me that although E-FIL does not need to support retrieval, due to their incorrect operation, they checked support retrieval and are now open for retrieval. Based on the 3 reports, I will support this round.

Joss-Hua commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedgun2cvls4ss3aadeflc4d5uh6s2qht44ghdklq5ataveav3mecu

Address

f1ov55gc6aga6hlurmd6mgmpucitazq6dxygcr3wa

Datacap Allocated

500.00TiB

Signer Address

f1tfg54zzscugttejv336vivknmsnzzmyudp3t7wi

Id

3371ceb8-0510-4769-96f2-2f3e1a139a11

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedgun2cvls4ss3aadeflc4d5uh6s2qht44ghdklq5ataveav3mecu

sxxfuture-official commented 1 year ago

image 7a730a39-a0c6-402b-89cc-19401a09eaa5

The display of filplus-checker-app bot is delayed, and the retrieval of all nodes has actually been completed

Tom-OriginStorage commented 1 year ago

checker:manualTrigger