filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] Define NFT Platform #91

Closed Alex11801 closed 1 year ago

Alex11801 commented 2 years ago

Large Dataset Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

DeFine is the social NFT platform for all creators including artists, musicians, influencers, gamers, and athletes. The platform facilitates all social interaction, engagement and communication for creators and their fanbase with digital assets like NFTs and social/fan tokens, and real assets on the blockchain. Owners of social/fan tokens will have many benefits including special access to the creators’ NFTs, merchandise, content, etc. while being part of a private community. It is also the social platform for all participants in the digital world where they can identify and interact with each other through their NFT social profiles which are based on their contribution and achievements in the digital world. Ultimately, the DeFine will serve as a social platform for creators and users to define how to engage with each other and build communities in the digital world.

What is the primary source of funding for this project?

Funding from venture investors and also the income from the platform revenue.

What other projects/ecosystem stakeholders is this project associated with?

We started with NFT stuff on Ethereum and now extended to multiple chains including Binance Smart Chain, TRON and Polygon. And also have plans to continue expanding to more chains. Besides, we also have various partners across the world like Abyss, 3LAU, Mymusictaste, Huobi ventures, and so on. Check the details here: https://define.one/#partners

Use-case details

Describe the data being stored onto Filecoin

The platform is NFT-based and it’s important to transfer the data of the NFT to a decentralized storage network which can reduce the risk of data loss. Also it’s important for users to identify whether the NFT data is valid or not by the means of the Filecoin storage.

Where was the data in this dataset sourced from?

All the data comes from the users or artists on Define platform who created their assets and minted them to NFT, including artwork, paintings, music, videos.

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

https://api.de-fine.art/api/tokens/mainnet/0x70A76282752b5D2F09f81fe86D49d80ED8B53DC7/20

https://define-art-static-prod.s3-ap-northeast-1.amazonaws.com/token/image/mainnet/ERC721/20.png

https://define-art-static-prod.s3-ap-northeast-1.amazonaws.com/token/video/mainnet/ERC721/28.mp4

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes, we confirm that the data is public and can be retrieved by anyone on the network.

What is the expected retrieval frequency for this data?

We may need to retrieve the data from the Filecoin network to restore the data when the NFT stored on the servers is lost, or to verify the NFTs when there is some doubt as to whether it’s genuine or fake. The expected retrieval frequency is on average half a year if you ask me.

For how long do you plan to keep this dataset stored on Filecoin?

Presumably for about 5 years, I think. But it depends case by case according to the general network requirements.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

We plan to make deals across Asia, North America, and more regions.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Some small NFT data is to be distributed online to storage providers and for some big data like a large collection of videos, we prefer offline data transfer. The detailed offline transfer process is obliged to be made then.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We will refer to some reputation platforms of storage providers in the Filecoin ecosystem or get some suggestions from Filecoin officials and community. We may select several eligible storage providers to try a little bit of DataCap firstly and then have more cooperation on the storage deals.

How will you be distributing deals across storage providers?

We plan to distribute deals to 3-5 or more miners. We also need to have multiple copies across miners.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Although we don’t have enough knowledge of the Filecoin network, and the Filecoin Plus program. Our technical teammates have been learning about those for some time and we have sufficient budget to make the storage deals on Filecoin. We may need a few supports from the community and Filecoin official team onboarding onto the first allocation of DataCap. Thanks.
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ 50.31% of total deal sealed by f01602479 are duplicate data.

⚠️ 50.25% of total deal sealed by f01606849 are duplicate data.

⚠️ 50.13% of total deal sealed by f01641612 are duplicate data.

⚠️ 50.00% of total deal sealed by f01716466 are duplicate data.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01380788 Shanghai, Shanghai, CN 110.17 TiB 29.30% 97.76 TiB 11.26%
f01602479 Longueuil, Quebec, CA 74.63 TiB 19.85% 37.08 TiB 50.31%
f01606849 Montréal, Quebec, CA 74.53 TiB 19.82% 37.08 TiB 50.25%
f01641612 Montréal, Quebec, CA 74.34 TiB 19.77% 37.08 TiB 50.13%
f01666984new Dorval, Quebec, CA 42.33 TiB 11.26% 37.08 TiB 12.40%
f01716466 Singapore, Singapore, SG 8.00 GiB 0.00% 4.00 GiB 50.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

✔️ Data replication looks healthy.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
97.77 TiB 110.18 TiB 1 29.30%
37.08 TiB 265.83 TiB 4 70.70%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f3vygs6wslfvenruzrrbez5yelczv2jjoii4wlvd6
yuyb3snm7sajh2ngjflhdntzxewuzmeczy4j6q56f
qdsq
Ecology Limited 1.72 TiB 1 LDN # 36

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ 50.31% of total deal sealed by f01602479 are duplicate data.

⚠️ 50.25% of total deal sealed by f01606849 are duplicate data.

⚠️ 50.13% of total deal sealed by f01641612 are duplicate data.

⚠️ 50.00% of total deal sealed by f01716466 are duplicate data.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01380788 Shanghai, Shanghai, CN 110.17 TiB 29.30% 97.76 TiB 11.26%
f01602479 Longueuil, Quebec, CA 74.63 TiB 19.85% 37.08 TiB 50.31%
f01606849 Montréal, Quebec, CA 74.53 TiB 19.82% 37.08 TiB 50.25%
f01641612 Montréal, Quebec, CA 74.34 TiB 19.77% 37.08 TiB 50.13%
f01666984new Dorval, Quebec, CA 42.33 TiB 11.26% 37.08 TiB 12.40%
f01716466 Singapore, Singapore, SG 8.00 GiB 0.00% 4.00 GiB 50.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

✔️ Data replication looks healthy.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
97.77 TiB 110.18 TiB 1 29.30%
37.08 TiB 265.83 TiB 4 70.70%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f3vygs6wslfvenruzrrbez5yelczv2jjoii4wlvd6
yuyb3snm7sajh2ngjflhdntzxewuzmeczy4j6q56f
qdsq
Ecology Limited 1.72 TiB 1 LDN # 36

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ 50.31% of total deal sealed by f01602479 are duplicate data.

⚠️ 50.25% of total deal sealed by f01606849 are duplicate data.

⚠️ 50.13% of total deal sealed by f01641612 are duplicate data.

⚠️ 50.00% of total deal sealed by f01716466 are duplicate data.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01380788 Shanghai, Shanghai, CN 110.17 TiB 29.30% 97.76 TiB 11.26%
f01602479 Longueuil, Quebec, CA 74.63 TiB 19.85% 37.08 TiB 50.31%
f01606849 Montréal, Quebec, CA 74.53 TiB 19.82% 37.08 TiB 50.25%
f01641612 Montréal, Quebec, CA 74.34 TiB 19.77% 37.08 TiB 50.13%
f01666984new Dorval, Quebec, CA 42.33 TiB 11.26% 37.08 TiB 12.40%
f01716466 Singapore, Singapore, SG 8.00 GiB 0.00% 4.00 GiB 50.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

✔️ Data replication looks healthy.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
97.77 TiB 110.18 TiB 1 29.30%
37.08 TiB 265.83 TiB 4 70.70%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f3vygs6wslfvenruzrrbez5yelczv2jjoii4wlvd6
yuyb3snm7sajh2ngjflhdntzxewuzmeczy4j6q56f
qdsq
Ecology Limited 1.72 TiB 1 LDN # 36

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ 50.31% of total deal sealed by f01602479 are duplicate data.

⚠️ 50.25% of total deal sealed by f01606849 are duplicate data.

⚠️ 50.13% of total deal sealed by f01641612 are duplicate data.

⚠️ 50.00% of total deal sealed by f01716466 are duplicate data.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01380788 Shanghai, Shanghai, CN 110.17 TiB 29.30% 97.76 TiB 11.26%
f01602479 Longueuil, Quebec, CA 74.63 TiB 19.85% 37.08 TiB 50.31%
f01606849 Montréal, Quebec, CA 74.53 TiB 19.82% 37.08 TiB 50.25%
f01641612 Montréal, Quebec, CA 74.34 TiB 19.77% 37.08 TiB 50.13%
f01666984new Dorval, Quebec, CA 42.33 TiB 11.26% 37.08 TiB 12.40%
f01716466 Singapore, Singapore, SG 8.00 GiB 0.00% 4.00 GiB 50.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

✔️ Data replication looks healthy.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
97.77 TiB 110.18 TiB 1 29.30%
37.08 TiB 265.83 TiB 4 70.70%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f3vygs6wslfvenruzrrbez5yelczv2jjoii4wlvd6
yuyb3snm7sajh2ngjflhdntzxewuzmeczy4j6q56f
qdsq
Ecology Limited 1.72 TiB 1 LDN # 36

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

Sunnyiscoming commented 1 year ago

Hello, @Alex11801 per the https://github.com/filecoin-project/notary-governance/issues/922 for Open, Public Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be allowed to move forward for additional notary review.