filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] FileDrive Labs - Datasets Landing Plan - [3/3] #1268

Closed laurarenpanda closed 1 year ago

laurarenpanda commented 1 year ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

FileDrive Datasets Landing Plan is a project for onboarding more valuable public datasets onto the Filecoin network. Through several phases, we plan to bring 10 PiB data and promote 100 PiB storage power growth to Filecoin. 

About FileDrive Datasets

FileDrive Datasets is a platform to effectively connect the huge storage market that Filecoin has built with publishers of public datasets.
The Filecoin network provides reliable, secure, and affordable decentralized storage services, and FileDrive Labs wants to deliver these benefits to end-users by building a public dataset platform.
It is challenging to attract traditional Cloud Storage and Object-base Storage users to the Filecoin network and benefit from it. Developers in the Felicoin ecosystem, such as FileDrive Labs, need to face this challenge together.
As a member of the Filecoin ecosystem, FileDrive Labs has been insisting on developing useful tools to make it easier for users to store their data onto the Filecoin network. 

FileDrive Datasets has integrated a group of tools to provide storage service with the compatibility of both Cloud Storage and Object-base Storage and better user experience to attract more users.
Projects(ongoing) behind:
- Go-Graphsplit: https://github.com/filedrive-team/go-graphsplit
- DS-Cluster: https://github.com/filedrive-team/go-ds-cluster
- Filejoy: https://github.com/filedrive-team/filejoy

Article about FileDrive Datasets on Filecoin Blog:
- Large Datasets: FileDrive: https://filecoin.io/blog/posts/large-datasets-filedrive/

About FileDrive Labs

FileDrive Labs has always defined ourselves as tool developers and infrastructure builders in the Filecoin ecosystem. From 2019, we continuously focus on technical solutions and development based on IPFS protocol and the Filecoin network and do our best to contribute to the community.
Over 80% of our team are qualified engineers, and half of them have more than 10-year development experience in multiple industries, including Communication, the Internet, and blockchain.
Since 2020, we have participated in Slingshot Competition, become one of the top teams, and stored over 5 PiB useful data from public datasets to the Filecoin network.
To contribute to the Filecoin Community, we developed an open-source data prep tool Graphsplit, FIL+ project dashboard filplus.info and storage provider discovery platform filfind,info.
Besides, we have also hold weekly online virtual events named FileDrive Meetup from March 2022, which aims to provide a platform for community members to grasp the latest trends of the Filecoin network and our work and research.

Please check the following links for more details.
- GitHub: https://github.com/filedrive-team
- Twitter: https://twitter.com/FileDrive1
- Eventbrite: https://www.eventbrite.hk/o/filedrive-labs-42456337463
- YouTube Channel: https://www.youtube.com/channel/UCxcZC1dtBUlQvZY7DX13W1w
- Medium: https://medium.com/@FileDrive1

What is the primary source of funding for this project?

FileDriven Labs, rewards from the Slingshot Competition, Filecoin DevGrants, Mircogrants and a series of Hackathons.

What other projects/ecosystem stakeholders is this project associated with?

FileDrive Dataset is an open dataset platform on IPFS Network, and all data will store on Filecoin Network through the Filecoin Plus project. Since that the primary ecosystem stakeholders are IPFS and Filecoin.

Use-case details

Describe the data being stored onto Filecoin

FileDrive Datasets Landing Plan #1
- Datasets: 6
- Total data capacity: 2451.1TiB

List of Datasets in #1:

1. ZINC Database
- 3D models for molecular docking screens.
- Size: 924.5 TiB

2. Transiting Exoplanet Survey Satellite (TESS)
- The Transiting Exoplanet Survey Satellite (TESS) is a multi-year survey that will discover exoplanets in orbit around bright stars across the entire sky using high-precision photometry. The survey will also enable a wide variety of stellar astrophysics, solar system science, and extragalactic variability studies. More information about TESS is available at MAST and the TESS Science Support Center.
- Size: 285.6 TiB

3. Smithsonian Open Access
- The Smithsonian’s mission is the "increase and diffusion of knowledge" and has been collecting since 1846. The Smithsonian, through its efforts to digitize its multidisciplinary collections, has created millions of digital assets and related metadata describing the collection objects. On February 25th, 2020, the Smithsonian released over 2.8 million CC0 interdisciplinary 2-D and 3-D images, related metadata, and additionally, research data from researches across the Smithsonian. The 2.8 million "open access" collections are a subset of the Smithsonian’s 155 million objects, 2.1 million library volumes and 156,000 cubic feet of archival collections held in 19 museums, 9 research centers, libraries, archives and the National Zoo. Digitization of collections is ongoing.
- Size: 621.2 TiB

4. Community Earth System Model v2 ARISE (CESM2 ARISE)
- Data from ARISE-SAI Experiments with CESM2
- Size: 263.5 TiB

5. 3DCoMPaT: Composition of Materials on Parts of 3D Things
- 3D CoMPaT is a richly annotated large-scale dataset of rendered compositions of Materials on Parts of thousands of unique 3D Models. This dataset primarily focuses on stylizing 3D shapes at part-level with compatible materials. Each object with the applied part-material compositions is rendered from four equally spaced views as well as four randomized views. We introduce a new task, called Grounded CoMPaT Recognition (GCR), to collectively recognize and ground compositions of materials on parts of 3D objects. We present two variations of this task and adapt state-of-art 2D/3D deep learning methods to solve the problem as baselines for future research. We hope our work will help ease future research on compositional 3D Vision.
- Size: 42.8 TiB

6. Reference Elevation Model of Antarctica (REMA)
- The Reference Elevation Model of Antarctica - 2m GSD Digital Elevation Models (DEMs) and mosaics from 2009 to the present. The REMA project seeks to fill the need for high-resolution time-series elevation data in the Antarctic. The time-dependent nature of the strip DEM files allows users to perform change detection analysis and to compare observations of topography data acquired in different seasons or years. The mosaic DEM tiles are assembled from multiple strip DEMs with the intention of providing a more consistent and comprehensive product over large areas. REMA data is constructed from in-track and cross-track high-resolution (~0.5 meter) imagery acquired by the Maxar constellation of optical imaging satellites.
- Size: 313.5 TiB

Where was the data in this dataset sourced from?

All data is from public open datasets.

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

FileDrive Datasets: 
https://datasets.filedrive.io/

Original Source:
https://registry.opendata.aws/zinc15/
https://registry.opendata.aws/tess/
https://registry.opendata.aws/smithsonian-open-access/
https://registry.opendata.aws/ncar-cesm2-arise/
https://registry.opendata.aws/3dcompat/
https://registry.opendata.aws/pgc-rema/

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes, it is. All data can be retrieved by anyone on Filecoin Network.

What is the expected retrieval frequency for this data?

The data of FileDrive Dataset will be pinned on IPFS Network before being stored on Filecoin, which means users could mainly have two different ways to retrieve data, thought IPFS or Filecoin. So the retrieval frequency depends on users' needs.

For how long do you plan to keep this dataset stored on Filecoin?

This data will be stored for at least 1 year on Filecoin, so the verified deals will use a 1-year minimum deal duration (from 356 to 530 days).
Ideally, this project will be a permanent archival on the Filecoin network, as long as there are actual users and data requirements.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

All regions, as long as data transmission can be stable and successful.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Through both online/offline storage deals. Data transfer strategy might be considered in certain situations.
The expected data onboarding rate is 500TiB per week, which is around 72TiB per day. 
However, it will be influenced by factors such as the speed of data transmission(especially inter-regional transmission), daily power growth of storage providers, base fees, equipment performance, etc. For these reasons, the actual data onboarding rate may differ from our expectations.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

Major characteristics about choosing the storage providers:
- location
- transmission speed
- deal success rate
- previous experience of real data storage
- stability of their nodes
- reputation score

How will you be distributing deals across storage providers?

All storage deals will be verified if they have enough FIL to pledge and their equipment can handle.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Yes, we do.
Suggestions and feedbacks can help us optimate this project and cooperate with more great storage providers from all over the world.
large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

raghavrmadya commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

500TiB

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

DataCap allocation requested

250TiB

Id

a6ea69b5-ee1d-429e-8393-57b350d12299

newwebgroup commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecww5jshyi3airyqg2ghvim2jhwrg2yvvdjf5xidvpxavlsehohxo

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

250.00TiB

Signer Address

f1e77zuityhvvw6u2t6tb5qlnsegy2s67qs4lbbbq

Id

a6ea69b5-ee1d-429e-8393-57b350d12299

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecww5jshyi3airyqg2ghvim2jhwrg2yvvdjf5xidvpxavlsehohxo

Joss-Hua commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceagww7xkpqhtn6iwe3b4xz3ij3mgg34lxpt634ppy3lya36yh4nha

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

250.00TiB

Signer Address

f1tfg54zzscugttejv336vivknmsnzzmyudp3t7wi

Id

a6ea69b5-ee1d-429e-8393-57b350d12299

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceagww7xkpqhtn6iwe3b4xz3ij3mgg34lxpt634ppy3lya36yh4nha

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

DataCap allocation requested

500TiB

Id

766aa69a-9345-4f5a-992f-aad3663a586d

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

Since this is the 3rd allocation, the following restrictions have been relaxed:

⚠️ All storage providers are located in the same region.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f0118330 Hong Kong, Central and Western, HK 81.33 TiB 53.86% 81.33 TiB 0.00%
f0134516 Hong Kong, Central and Western, HK 49.96 TiB 33.08% 49.96 TiB 0.00%
f01227975 Hong Kong, Central and Western, HK 19.72 TiB 13.06% 19.72 TiB 0.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

Since this is the 3rd allocation, the following restrictions have been relaxed:

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
13.03 TiB 13.03 TiB 1 8.63%
68.99 TiB 137.98 TiB 2 91.37%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f14uhjnqrocqcenbjfaergw2uvaimysi4snv2oepy FileDrive Labs 101.32 TiB 1,620 LDN v3 multisig
f1bycr5r3ymkgqvkuxoemgsmnuawyawptwj44mqdi FileDrive Labs 52.46 TiB 1,635 LDN v3 multisig
f3wgfwtrs5p6jrkwfl2mksqa2ivgbgdjjrhjbefy3
n7qzvotc3y6sazmp5gfyj7um6jlgdvlbiepzawnc6
wxtq
FileDrive Labs 1.56 TiB 30 LDN v3 multisig

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

newwebgroup commented 1 year ago

CID Checker looks good

newwebgroup commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacea6lsurjeqtg7fcyqv6e3znni66wptg3z25cc3hzvrykxovyebsva

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

500.00TiB

Signer Address

f1e77zuityhvvw6u2t6tb5qlnsegy2s67qs4lbbbq

Id

766aa69a-9345-4f5a-992f-aad3663a586d

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacea6lsurjeqtg7fcyqv6e3znni66wptg3z25cc3hzvrykxovyebsva

kernelogic commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceabuoqb5ud4a4zysgpaajdo7cdws33kumpbcdanjatn25w266kqio

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

500.00TiB

Signer Address

f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa

Id

766aa69a-9345-4f5a-992f-aad3663a586d

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceabuoqb5ud4a4zysgpaajdo7cdws33kumpbcdanjatn25w266kqio

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

DataCap allocation requested

1000.0TiB

Id

f15e2778-0b28-4a08-822c-da58fd9f015d

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Last two approvers

kernelogic & newwebgroup

Rule to calculate the allocation request amount

200% of weekly dc amount requested

DataCap allocation requested

1000.0TiB

Total DataCap granted for client so far

250TiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

4.75PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
8000 3 500TiB 40.25 120GiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

Storage Provider Distribution

The below table shows the distribution of storage providers that have stored data for this client.

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ f0118330 has sealed 40.25% of total datacap.

⚠️ f01227975 has sealed 39.76% of total datacap.

⚠️ All storage providers are located in the same region.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f0118330 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
100.57 TiB 40.25% 100.57 TiB 0.00%
f01227975 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
99.36 TiB 39.76% 99.36 TiB 0.00%
f0134516 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
49.96 TiB 19.99% 49.96 TiB 0.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
50.00 TiB 50.00 TiB 1 20.01%
99.94 TiB 199.88 TiB 2 79.99%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Approvers
f1bycr5r3ymkgqvkuxoemgsmnuawyawptwj44mqdi FileDrive Labs 200.57 TiB 4,800 1Joss-Hua
1kernelogic
1NDLABS-OFFICE
1newwebgroup
1steven004
f14uhjnqrocqcenbjfaergw2uvaimysi4snv2oepy FileDrive Labs 101.32 TiB 1,620 1Joss-Hua
1kernelogic
1NDLABS-OFFICE
1newwebgroup
1steven004
f3wgfwtrs5p6jrkwfl2mksqa2ivgbgdjjrhjbefy3
n7qzvotc3y6sazmp5gfyj7um6jlgdvlbiepzawnc6
wxtq
FileDrive Labs 1.56 TiB 30 1GaryGJG
1IreneYoung
3Joss-Hua
1liyunzhi-666
1MegTei
1MetaWaveInfo
3newwebgroup
2psh0691

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

herrehesse commented 1 year ago

Dear Applicant,

Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.

Please answer the questions below as comprehensively as possible.

Customer data

We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.

Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.

(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

Files and Processing

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

herrehesse commented 1 year ago

@newwebgroup Could you explain to me why you signed this while the applicant has stored all files in the same region which is not allowed by Filecoin+ rules?

cryptowhizzard commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacea4s4iyur6ttcy7dqnwkiazqatqyilx6opt2yca5muz45smr4bh46

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1000.00TiB

Signer Address

f1krmypm4uoxxf3g7okrwtrahlmpcph3y7rbqqgfa

Id

f15e2778-0b28-4a08-822c-da58fd9f015d

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacea4s4iyur6ttcy7dqnwkiazqatqyilx6opt2yca5muz45smr4bh46

kernelogic commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedrmzdxr3reivxqoluvykj4lealvrcei6llmz3zbfcysw7jhr2x2i

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1000.00TiB

Signer Address

f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa

Id

f15e2778-0b28-4a08-822c-da58fd9f015d

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedrmzdxr3reivxqoluvykj4lealvrcei6llmz3zbfcysw7jhr2x2i

kernelogic commented 1 year ago

DD done in #1266

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 4

Multisig Notary address

f02049625

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

DataCap allocation requested

1.95PiB

Id

875382f2-8fc6-43a4-acd0-173ea93ded27

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Last two approvers

kernelogic & cryptowhizzard

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested

1.95PiB

Total DataCap granted for client so far

750TiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

4.26PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
21225 5 1000.0TiB 26.77 87.22TiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

Storage Provider Distribution

The below table shows the distribution of storage providers that have stored data for this client.

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

✔️ Storage provider distribution looks healthy.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01227975 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
165.26 TiB 25.56% 165.26 TiB 0.00%
f0118330 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
163.57 TiB 25.30% 163.57 TiB 0.00%
f0134516 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
118.88 TiB 18.39% 118.88 TiB 0.00%
f01993339 Singapore, Singapore, SG
Amazon.com, Inc.
99.07 TiB 15.32% 99.04 TiB 0.03%
f01984580 Singapore, Singapore, SG
HUAWEI CLOUDS
99.82 TiB 15.44% 99.79 TiB 0.03%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 69.62% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
183.70 TiB 183.70 TiB 1 28.41%
130.84 TiB 261.75 TiB 2 40.48%
1.56 TiB 4.69 TiB 3 0.72%
49.11 TiB 196.45 TiB 4 30.38%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Approvers
f14uhjnqrocqcenbjfaergw2uvaimysi4snv2oepy FileDrive Labs 897.10 TiB 9,587 11ane-1
1cryptowhizzard
1Joss-Hua
2kernelogic
1NDLABS-OFFICE
1newwebgroup
1steven004
f1bycr5r3ymkgqvkuxoemgsmnuawyawptwj44mqdi FileDrive Labs 448.24 TiB 8,000 11ane-1
1cryptowhizzard
1Joss-Hua
2kernelogic
1NDLABS-OFFICE
1newwebgroup
1steven004
f3wgfwtrs5p6jrkwfl2mksqa2ivgbgdjjrhjbefy3
n7qzvotc3y6sazmp5gfyj7um6jlgdvlbiepzawnc6
wxtq
FileDrive Labs 1.72 TiB 35 1GaryGJG
1IreneYoung
3Joss-Hua
1liyunzhi-666
1MegTei
1MetaWaveInfo
3newwebgroup
2psh0691

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

BDEio commented 1 year ago

@laurarenpanda Hi! Congratulations on your DataCap approval! BDE is a verified deals auction house helping you to get paid storing your data with reliable storage providers. If you need any help, please get in touch.

cryptowhizzard commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecpqhtnjg3kcmcl7wzzvsbflzd5w6qjpvpnppk6byd6zslczm6534

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1.95PiB

Signer Address

f1krmypm4uoxxf3g7okrwtrahlmpcph3y7rbqqgfa

Id

875382f2-8fc6-43a4-acd0-173ea93ded27

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecpqhtnjg3kcmcl7wzzvsbflzd5w6qjpvpnppk6byd6zslczm6534

kernelogic commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceaokeheklsybhkorzo33f2gti4dwq6tssi46ggzffr5rt4zsdhbwe

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1.95PiB

Signer Address

f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa

Id

875382f2-8fc6-43a4-acd0-173ea93ded27

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceaokeheklsybhkorzo33f2gti4dwq6tssi46ggzffr5rt4zsdhbwe

MEIYAN666 commented 1 year ago

checker:manualTrigge

cryptowhizzard commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaced55aknssyi4f6i7thvqkqyhasofgganb4bcm7a5mqjqj7y2kwnbw

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1.95PiB

Signer Address

f1krmypm4uoxxf3g7okrwtrahlmpcph3y7rbqqgfa

Id

875382f2-8fc6-43a4-acd0-173ea93ded27

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaced55aknssyi4f6i7thvqkqyhasofgganb4bcm7a5mqjqj7y2kwnbw

xiaoyuaiheshui commented 1 year ago

checker:manualTrigger

cryptowhizzard commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacec76wpvdg5sjda7fvrf225xexewcay5mgse55oioxdh6nmi6txr46

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1.95PiB

Signer Address

f1krmypm4uoxxf3g7okrwtrahlmpcph3y7rbqqgfa

Id

875382f2-8fc6-43a4-acd0-173ea93ded27

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacec76wpvdg5sjda7fvrf225xexewcay5mgse55oioxdh6nmi6txr46

NDLABS-Leo commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

Storage Provider Distribution

The below table shows the distribution of storage providers that have stored data for this client.

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

✔️ Storage provider distribution looks healthy.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01228105 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
243.58 TiB 5.55% 243.58 TiB 0.00%
f01228100 San Jose, California, US
Alibaba (US) Technology Co., Ltd.
242.80 TiB 5.53% 242.80 TiB 0.00%
f01228089 Frankfurt am Main, Hesse, DE
Alibaba (US) Technology Co., Ltd.
239.64 TiB 5.46% 239.64 TiB 0.00%
f01227975 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
236.41 TiB 5.39% 236.41 TiB 0.00%
f01228087 London, England, GB
Alibaba (US) Technology Co., Ltd.
193.94 TiB 4.42% 193.94 TiB 0.00%
f0118330 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
163.57 TiB 3.73% 163.57 TiB 0.00%
f0134516 Hong Kong, Central and Western, HK
Alibaba (US) Technology Co., Ltd.
149.94 TiB 3.42% 149.94 TiB 0.00%
f01228000 Seoul, Seoul, KR
Alibaba (US) Technology Co., Ltd.
100.00 TiB 2.28% 100.00 TiB 0.00%
f0522948 Singapore, Singapore, SG
Alibaba (US) Technology Co., Ltd.
95.86 TiB 2.19% 95.86 TiB 0.00%
f0867300 Tokyo, Tokyo, JP
Alibaba (US) Technology Co., Ltd.
82.70 TiB 1.89% 82.70 TiB 0.00%
f01228008 Sydney, New South Wales, AU
Alibaba (US) Technology Co., Ltd.
70.28 TiB 1.60% 70.28 TiB 0.00%
f01984593 Ashburn, Virginia, US
Amazon.com, Inc.
299.97 TiB 6.84% 299.97 TiB 0.00%
f01975338 Tokyo, Tokyo, JP
Amazon.com, Inc.
299.97 TiB 6.84% 299.97 TiB 0.00%
f01975316 Ashburn, Virginia, US
Amazon.com, Inc.
299.94 TiB 6.84% 299.94 TiB 0.00%
f01975336 Seoul, Seoul, KR
Amazon.com, Inc.
299.94 TiB 6.84% 299.94 TiB 0.00%
f01975326 Montréal, Quebec, CA
Amazon.com, Inc.
299.94 TiB 6.84% 299.94 TiB 0.00%
f01993339 Singapore, Singapore, SG
Amazon.com, Inc.
169.63 TiB 3.87% 169.60 TiB 0.02%
f01993388 Boardman, Oregon, US
Amazon.com, Inc.
125.08 TiB 2.85% 125.08 TiB 0.00%
f01984576 Singapore, Singapore, SG
HUAWEI CLOUDS
299.97 TiB 6.84% 299.97 TiB 0.00%
f01984586 Bangkok, Bangkok, TH
HUAWEI CLOUDS
299.97 TiB 6.84% 299.97 TiB 0.00%
f01984580 Singapore, Singapore, SG
HUAWEI CLOUDS
173.91 TiB 3.96% 173.82 TiB 0.05%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 39.53% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
194.48 TiB 194.48 TiB 1 4.43%
282.30 TiB 564.72 TiB 2 12.87%
324.99 TiB 974.96 TiB 3 22.22%
164.24 TiB 656.95 TiB 4 14.97%
31.08 TiB 155.39 TiB 5 3.54%
6.28 TiB 43.97 TiB 7 1.00%
99.98 TiB 799.88 TiB 8 18.23%
32.00 GiB 288.00 GiB 9 0.01%
34.16 TiB 341.56 TiB 10 7.79%
59.53 TiB 654.84 TiB 11 14.93%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Approvers
f1bycr5r3ymkgqvkuxoemgsmnuawyawptwj44mqdi FileDrive Labs 2.91 PiB 20,016 11ane-1
2cryptowhizzard
1Joss-Hua
2kernelogic
1NDLABS-OFFICE
1newwebgroup
1steven004
f14uhjnqrocqcenbjfaergw2uvaimysi4snv2oepy FileDrive Labs 1.87 PiB 20,274 11ane-1
3cryptowhizzard
1Joss-Hua
3kernelogic
1NDLABS-OFFICE
1newwebgroup
1steven004
f3wgfwtrs5p6jrkwfl2mksqa2ivgbgdjjrhjbefy3
n7qzvotc3y6sazmp5gfyj7um6jlgdvlbiepzawnc6
wxtq
FileDrive Labs 2.34 TiB 55 1GaryGJG
1IreneYoung
3Joss-Hua
1liyunzhi-666
1MegTei
1MetaWaveInfo
3newwebgroup
2psh0691

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

stcloudlisa commented 1 year ago

I would like to support them for the following reasons:

Data can be retrieved Reasonable SP distribution

stcloudlisa commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceaj23pmmqwpkpufg4zl6zl5f6qejgrtja7t5kzrloio33lh6c2j5g

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1.95PiB

Signer Address

f1jvvltduw35u6inn5tr4nfualyd42bh3vjtylgci

Id

875382f2-8fc6-43a4-acd0-173ea93ded27

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceaj23pmmqwpkpufg4zl6zl5f6qejgrtja7t5kzrloio33lh6c2j5g

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 5

Multisig Notary address

f02049625

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

DataCap allocation requested

1.34PiB

Id

daccf34d-fbb3-4a9d-8230-89a335fe35cb

Joss-Hua commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecaxj6irfhen3y67z22fzbfamur7fibi6s47zdw5gtwy2q3obw27m

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1.34PiB

Signer Address

f1tfg54zzscugttejv336vivknmsnzzmyudp3t7wi

Id

daccf34d-fbb3-4a9d-8230-89a335fe35cb

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecaxj6irfhen3y67z22fzbfamur7fibi6s47zdw5gtwy2q3obw27m

cryptowhizzard commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacea6jxemjoyd4mpqpuq244hfj4d4svejbya43rbwvsumwlnulnv6se

Address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Datacap Allocated

1.34PiB

Signer Address

f1krmypm4uoxxf3g7okrwtrahlmpcph3y7rbqqgfa

Id

daccf34d-fbb3-4a9d-8230-89a335fe35cb

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacea6jxemjoyd4mpqpuq244hfj4d4svejbya43rbwvsumwlnulnv6se

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 6

Multisig Notary address

f02049625

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

DataCap allocation requested

1.03TiB

Id

5dc9969b-1bcc-4e1d-b1f9-b8281d3060f4

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1sejgqbuwsf74qifuxqykwotyu5aswuwhubxghqa

Last two approvers

cryptowhizzard & Joss-Hua

Rule to calculate the allocation request amount

800% of weekly dc amount requested

DataCap allocation requested

1.03TiB

Total DataCap granted for client so far

7.55PiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

-2881160269424231B

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
224380 21 1.34PiB 7.59 172.08TiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 35.91% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

⚠️ CID sharing has been observed. (Top 3)

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

data-programs commented 1 year ago
KYC

This user’s identity has been verified through filplus.storage