filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] <Sparklink Protocol> - <Sparklink Protocol> #1671

Closed Erreth-Akbe closed 1 year ago

Erreth-Akbe commented 1 year ago

Data Owner Name

Sparklink protocol

Data Owner Country/Region

United States

Data Owner Industry

Information, Media & Telecommunications

Website

Sparklink.io

Social Media

https://twitter.com/SparkLink_io

https://discord.gg/3EV9V7Tenb

https://medium.com/@SparkLink_Protocol

Total amount of DataCap being requested

2PiB

Weekly allocation of DataCap requested

50TiB

On-chain address for first allocation

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

Custom multisig

Identifier

None

Share a brief history of your project and organization

SparkLink protocol is a unique decentralized protocol for NFT content creation and sharing. Based on web3.0 concepts, everyone can create their own NFT works through SparkLink and set their own sales and distribution fees. Any sale or dissemination of NFT after it is generated will generate revenue for the owner or node user. SparkLink provides users with a new decentralized way to distribute NFT and spread content,we have developed a new extension protocol ERC-721S, based on the current NFT protocol ERC-721. We have added the scarcity and irreplaceable value of NFT to the sharing incentive property of a larger content layer, allowing the content of all formats to split indefinitely and reap benefits through NFT.

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

The project is deployed on Binance Smart Chain and polygon

Describe the data being stored onto Filecoin

Pictures, music, and media materials

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

Pinata

How do you plan to prepare the dataset

IPFS

If you answered "other/custom tool" in the previous question, enter the details here

null

Please share a sample of the data

we have tens of thousands of pages of files on pinata server, including pictures, videos, and audio, here are some sample files(some files that have been asymmetrically encrypted):
https://i.328888.xyz/2023/02/23/xB5EF.png
sparklink.mypinata.cloud
https://sparklink.mypinata.cloud/ipfs/QmSnBxzqunw9xWBHGmV7Mkoq2dBrGWxgcFhKzyYg355Kbb?_gl=1*13a4ura*_ga*MTY4NDI2MzAxOC4xNjczMDIxNzcx*_ga_5RMPXG14TE*MTY3NzEyNjk2Ny4yLjEuMTY3NzEyNzk4MC42MC4wLjA.
https://sparklink.mypinata.cloud/ipfs/QmTQhGVnL8bSrZNVg5ZqgSN3YXW5bHRrs8PCi8xZGpWCCx?_gl=1*13a4ura*_ga*MTY4NDI2MzAxOC4xNjczMDIxNzcx*_ga_5RMPXG14TE*MTY3NzEyNjk2Ny4yLjEuMTY3NzEyNzk4MC42MC4wLjA.
https://sparklink.mypinata.cloud/ipfs/QmTpKk6sijZUBraQgkRP9xdr2qd5MwXGTZFbNBM6SbrLpp?_gl=1*13a4ura*_ga*MTY4NDI2MzAxOC4xNjczMDIxNzcx*_ga_5RMPXG14TE*MTY3NzEyNjk2Ny4yLjEuMTY3NzEyNzk4MC42MC4wLjA.
https://sparklink.mypinata.cloud/ipfs/QmeSGXg2WeaAcQ9J1ttWL7r1AoJWHt2TZPeiZCMw7LXSFh?_gl=1*1r47vai*_ga*MTY4NDI2MzAxOC4xNjczMDIxNzcx*_ga_5RMPXG14TE*MTY3NzEyNjk2Ny4yLjEuMTY3NzEyODA0NS42MC4wLjA.
https://sparklink.mypinata.cloud/ipfs/QmaH7o9RQxA1wUs283bFn1hxXzA7WKwSeAoDbFo8oSRP7b?_gl=1*1r47vai*_ga*MTY4NDI2MzAxOC4xNjczMDIxNzcx*_ga_5RMPXG14TE*MTY3NzEyNjk2Ny4yLjEuMTY3NzEyODA0NS42MC4wLjA.
https://sparklink.mypinata.cloud/ipfs/QmfFgPHGFThrp9TaFi7Hy4afdamY2vv8iAaZypHDLFmo5G?_gl=1*1r47vai*_ga*MTY4NDI2MzAxOC4xNjczMDIxNzcx*_ga_5RMPXG14TE*MTY3NzEyNjk2Ny4yLjEuMTY3NzEyODA0NS42MC4wLjA.
https://sparklink.mypinata.cloud/ipfs/Qma3Lo8apbvN2A4cvGyJZC2aFdU2svEyqtabTnM1sHGWkJ?_gl=1*1r47vai*_ga*MTY4NDI2MzAxOC4xNjczMDIxNzcx*_ga_5RMPXG14TE*MTY3NzEyNjk2Ny4yLjEuMTY3NzEyODA0NS42MC4wLjA.

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

The Dapp has a unlock function, in this function, partial files can only be seen through “pay to unlock”. Files are encrypted using asymmetric encryption, just only the file which selected this function, this is an important feature of the product.

What is the expected retrieval frequency for this data

Weekly

For how long do you plan to keep this dataset stored on Filecoin

More than 3 years

In which geographies do you plan on making storage deals

Asia other than Greater China, Africa, North America, South America, Europe, Australia (continent), Antarctica

How will you be distributing your data to storage providers

HTTP or FTP server

How do you plan to choose storage providers

Slack, Big data exchange

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

cryptowhizzard commented 1 year ago

Dear applicant,

Thank you for applying for datacap. As Filecoin FIL+ notary i am screening your application and conducting due diligence.

Looking at your application i have some questions: Can you show us visible proof of the size of your data and the storage systems you have there?

As last question i would like you to fill out this form to provide us with the necessary information to make a educated decision on your LDN request if we would like to support it.

Thanks!

Erreth-Akbe commented 1 year ago

We have completed the documentation, the proof about the size of the stored data, and since we used pinata's service before, all we have are the CIDs of these files, and I can list them below, along with the pinata debit records. cdcf84134ca50ff4b5af13dabb6324b QmSnBxzqunw9xWBHGmV7Mkoq2dBrGWxgcFhKzyYg355Kbb QmTQhGVnL8bSrZNVg5ZqgSN3YXW5bHRrs8PCi8xZGpWCCx QmURYSDPFYpE7ABK4zhWw2YNtwrtvtxK9zzWQ8roScTuE5 QmWWJHVM26pa7mWwYckNzrPAVbZbD3cZwXJZfKcRKc1XZv QmeSGXg2WeaAcQ9J1ttWL7r1AoJWHt2TZPeiZCMw7LXSFh If you need the complete CID of all our files we can crawl and export them, but a large part of the files are AES symmetric encrypted

cryptowhizzard commented 1 year ago

Good morning.

Thanks for sending me the data and KYC.

I am missing the data onboarding plan ( Sp's you are going to use ). Please let me know when you are ready so i can check them followed by a propose when they are ok.

Sunnyiscoming commented 1 year ago

I think maybe the dataset you want to store is more suitable for applying for E-Fil+. @kevzak Hope you can give him a hand.

Erreth-Akbe commented 1 year ago

Sorry I didn't explain clearly, to be precise only part of the file is AES encrypted, in our DAPP, the generated json when uploading these are certainly not encrypted, and then there will be a file attached to the json, it can choose whether aes encrypted, we now only about 1/4 of the NFT works choose to encrypt the release.

Erreth-Akbe commented 1 year ago

And the data we store is the user's data, in fact, the data that is accessed at high frequencies. For example, if a user stores an encrypted file, our server will detect whether another user has purchased this one user's NFT, and if so, we will distribute a decryption key to him. So it's actually high frequency access.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

But Large dataset is open for totally public data without any encrypted file. Can you tell me how much totally public data do you have?

Sunnyiscoming commented 1 year ago

Any update here?

Erreth-Akbe commented 1 year ago

Now we choose the filplus service only to store public data, because it is more cost-effective than indirectly using ipfs in pinata. The storage space is determined according to the number of times the user uses it. Calculated according to the average number of times of use, it will take about 50T per week. And we only store unencrypted data in this part of the space.

Erreth-Akbe commented 1 year ago

If you want to see more proofs, you can also search for our contract address on any NFT platform to see the NFT data generated by users. We have user records on BSC, ETH, POLYGON.

Sunnyiscoming commented 1 year ago

Large dataset applications are open to existing public datasets. Can you tell me how much data you already have?

Erreth-Akbe commented 1 year ago

We are currently using the pinata service to indirectly store hundreds of T data and are adding more data every day.

Sunnyiscoming commented 1 year ago

Datacap Request Trigger

Total DataCap requested

2PiB

Expected weekly DataCap usage rate

50TiB

Client address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! :exclamation: We have found some problems in the information provided. The request cannot be posted because the identifier in the issue cannot be retrieved

Please, take a look at the request and edit the body of the issue providing all the required information.
Sunnyiscoming commented 1 year ago

Datacap Request Trigger

Total DataCap requested

2PiB

Expected weekly DataCap usage rate

50TiB

Client address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! :exclamation: We have found some problems in the information provided. The request cannot be posted because the identifier in the issue cannot be retrieved

Please, take a look at the request and edit the body of the issue providing all the required information.
simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

2PiB

Expected weekly DataCap usage rate

50TiB

Client address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

DataCap allocation requested

25TiB

Id

38510d49-3f75-4e53-ac9d-872ab134efbf

Erreth-Akbe commented 1 year ago

Thank you!

cryptowhizzard commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedo5tllfhtmt2iwl7tz3fzd5m2ialyp5kdmfoxo3i6t76jja4pv7k

Address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

Datacap Allocated

25.00TiB

Signer Address

f1krmypm4uoxxf3g7okrwtrahlmpcph3y7rbqqgfa

Id

38510d49-3f75-4e53-ac9d-872ab134efbf

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedo5tllfhtmt2iwl7tz3fzd5m2ialyp5kdmfoxo3i6t76jja4pv7k

sxxfuture-official commented 1 year ago

Thanks for your application ~ As Filecoin FIL+ notary i am screening your application and conducting due diligence. Hope you can answer the following two questions :

Erreth-Akbe commented 1 year ago

Sorry we don't have a mail service set up for this domain, what should I do in this case, can I use the official contract deployment wallet address for signature verification? Or provide some other information, for example, you can see that the owner of the official github org is my github account.

Erreth-Akbe commented 1 year ago

We have replied to your Twitter account through our official account for verification, please confirm:https://twitter.com/sxxfuture/status/1638450562126729219

Erreth-Akbe commented 1 year ago

f02009673 f02008876 f02008883 f02012674

The sealing plan is in Asia minus GCR and Greater China SPs.

sxxfuture-official commented 1 year ago

We have replied to your Twitter account through our official account for verification, please confirm: https://twitter.com/sxxfuture/status/1638450562126729219

Received your sealing plan, bug twitter reply not found yet. Or you can use the official twitter @@SparkLink_io to post a kyc message, and I will sign it for you this round.

sxxfuture-official commented 1 year ago

img_v2_c4678303-3f40-462a-a225-9556565185bg

OK, i found it

sxxfuture-official commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceapyp7xkaeunbgg3flazepnh54ozxzlommsaapoyjyrve7ichwcpa

Address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

Datacap Allocated

25.00TiB

Signer Address

f1foiomqlmoshpuxm6aie4xysffqezkjnokgwcecq

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceapyp7xkaeunbgg3flazepnh54ozxzlommsaapoyjyrve7ichwcpa

herrehesse commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

DataCap allocation requested

50TiB

Id

0e78dad5-d61d-428b-8397-37757d499f9e

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

50TiB

Total DataCap granted for client so far

25TiB

Datacap to be granted to reach the total amount requested by the client (2PiB)

1.97PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
0 0 25TiB NaN 1.25TiB
spaceT9 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

⚠️ All retrieval success ratios are below 1%.

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

spaceT9 commented 1 year ago

retrieval rate too low, need to be improved

Aaron01230 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

Aaron01230 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

a1991car commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

a1991car commented 1 year ago

Retrieving backups doesn't look great and needs to be improved, but the retrieval rate is improving, which is an improvement. Considering this is only the second round, and the first round is only 25T, I support it for now

a1991car commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecena3rjzw2nqhmi6fwr5rabi4zr4lbu2bl52idh4zbtkdbt2znx4

Address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

Datacap Allocated

50.00TiB

Signer Address

f1qnumecdypgrbaebtkdfjnwt5ndacadcuas3deiq

Id

0e78dad5-d61d-428b-8397-37757d499f9e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecena3rjzw2nqhmi6fwr5rabi4zr4lbu2bl52idh4zbtkdbt2znx4

Aaron01230 commented 1 year ago
image

Retrieval rate is improving, from here(https://datacapstats.io/clients/f02031033/breakdown) the data distribution statistics shown in the report are not accurate, I will support this round

Aaron01230 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceave2kqxsz2mx7gqdlpcktkr7pdzmfooy7frlmgwb6e24ek4obwag

Address

f1hjiip3jr6u4a5wgrktfyvgtqrmf3xyxbhpcvbki

Datacap Allocated

50.00TiB

Signer Address

f1xrnysd4gimg64d4l6qi7ulzwwq22c6vfg6lpw3i

Id

0e78dad5-d61d-428b-8397-37757d499f9e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceave2kqxsz2mx7gqdlpcktkr7pdzmfooy7frlmgwb6e24ek4obwag

NewHuoPool commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

NewHuoPool commented 1 year ago

With a smaller allocation of the first round, it's understandable that these problems were reported. I will keep an eye on it as the community rules are followed in the later rounds.