Closed conneygabriele closed 1 year ago
Thanks for your request!
Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
Thanks for your request!
Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
Thanks for your request!
Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
Hi, thanks for providing holistic data samples. Can you share how many copies you plan to store? Such information will assist the notaries in their due diligence process.
Total DataCap requested
5PiB
Expected weekly DataCap usage rate
200TiB
Client address
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
f02049625
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
100TiB
Additionally, if you can share the SPs you have been in touch with and the associated IDs, it will help gain community confidence
I'd love to know how you get to the 5PiB estimate!
Thanks for RG and Dannyob's attention and reply About this dataset It is divided into: YouTube source video+material: 70TIB+ My Youtube account has been in operation for 4 years, with 51900 subscribers and 3938806 views We uploaded about 1800 videos In addition to Youtube, I have produced teaching videos on many social media platforms in China, including BiliBili, Youku, Tiktok, watermelon videos, etc
Shanghai International Community College: 300TIB+ We have set up a blockchain finance training school "Shanghai International Community College" in China and the Shanghai government of China. The college was established in December 2017. Robertli is the dean of the college. In the past five years, we have cooperated with Shanghai Jiaotong University, Fudan University, Tsinghua University, Shanghai University of Foreign Trade, Hong Kong Business School, Malaysia Client University and other universities to jointly launch courses and textbooks. It has trained senior talents in blockchain and digital economy, and promoted the research, development, application and popularization of blockchain technology.
Our Dataset Sources 518 blockchain salon (once a week for 6 years) Blockchain Intermediate Qualification Certificate Training Course Video Blockchain Digital Economy Direction Doctor of Business Administration (DBA) Course Video Course related textbooks Project research topic To sum up, our existing data is about 400TIB. In addition to some sector fills, we plan to do 8 backups worldwide, so we need 5PIB quota.
Our monthly data increment is about 10TIB+ —————— The SPs we are currently in contact with include: Great heat, New web group, water drop cloud, Chain up cloud, F3, SXX and other Filecoin technology service companies. In addition, we plan to find more and better SPs through BDE and Slack Xi'an: f01907545 Shenzhen: f01099999, f01412203 Hong Kong: f01128206, f01885088, f01851482 Singapore: f01877571 Germany: f01926792 Australia: f01928022
8 copies of 400TiB adds up to 3.1 PiBs, not 5.PiBS
As there is a discrepancy in the information provided, this application is being closed. Please open a new application with the correct request as well as link this application
Please rectify the information in the LDN for notaries to proceed with due diligence
Hi RG, Sorry, I'm a little busy these days. Our current dataset is 400TIB, but for the sake of data security and dispersion. After communication with SP, we plan to fill 20G - 25G data in each 32G sector. The rest is filled with data. This operation is compliant So we need a copy of (400TIB 1024)/(20G~25G) 32G/1024 ≈ 640TIB - 512TIB 8 replicas are required 640TIB ~ 512TIB * 8 = 5120TIB ~ 4096TIB Considering the growth of data sets in the packaging period, we hope to have 5PIB storage space.
Robertli contacted us and we met through Zoom. We reviewed the data samples and relevant qualifications, and we are willing to support them in the first round.
Your Datacap Allocation Request has been proposed by the Notary
bafy2bzacedwg7wfruca7peo3z5oivzrcfoybbakp4emaeykdjidxnkqfj4xrs
Address
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
Datacap Allocated
100.00TiB
Signer Address
f1e77zuityhvvw6u2t6tb5qlnsegy2s67qs4lbbbq
Id
You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedwg7wfruca7peo3z5oivzrcfoybbakp4emaeykdjidxnkqfj4xrs
Your Datacap Allocation Request has been approved by the Notary
bafy2bzacebw6tdxlh35gtuhgzk5nfabrrwnpsc7v5zbcwbuyfeneuc4londkg
Address
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
Datacap Allocated
100.00TiB
Signer Address
f1a2lia2cwwekeubwo4nppt4v4vebxs2frozarz3q
Id
You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebw6tdxlh35gtuhgzk5nfabrrwnpsc7v5zbcwbuyfeneuc4londkg
The material is convincing for now and I will check the distribution later on.
@conneygabriele Hi! Great to see you have gotten approval for DataCap and advancing the mission of preserving humanity’s most important information. If you are looking for storage providers to store these data or have any questions, please visit #bigdata-exchange on Filecoin Slack or reply here.
We have strong demand from a diverse group of SPs, who are actively looking to onboard more data.
f02049625
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
200TiB
43f50030-a755-4a4e-bc70-80ca548e9eb9
f01858410
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
xingjitansuo & newwebgroup
100% of weekly dc amount requested
200TiB
100TiB
4.90PiB
Number of deals | Number of storage providers | Previous DC Allocated | Top provider | Remaining DC |
---|---|---|---|---|
2328 | 3 | 100TiB | 34.36 | 2.25TiB |
BLOCKCHAIN METAVERSE ACADEMY PTY LTD
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
1
newwebgroup1
xingjitansuo
The below table shows the distribution of storage providers that have stored data for this client.
If this is the first time a provider takes verified deal, it will be marked as new
.
For most of the datacap application, below restrictions should apply.
Since this is the 3rd allocation, the following restrictions have been relaxed:
⚠️ f01566485 has unknown IP location.
Provider | Location | Total Deals Sealed | Percentage | Unique Data | Duplicate Deals |
---|---|---|---|---|---|
f01566485 | UnknownUnknown |
25.00 TiB | 59.22% | 25.00 TiB | 0.00% |
f0469055 | Hong Kong, Central and Western, HKANLIAN NETWORK TECHNOLOGY CO., LIMITED |
640.00 GiB | 1.48% | 640.00 GiB | 0.00% |
f01520487 | Xiamen, Fujian, CNChina Mobile Communications Group Co., Ltd. |
11.78 TiB | 27.91% | 11.78 TiB | 0.00% |
f01699999 | Zhongshan, Guangdong, CNChina Unicom IP network China169 Guangdong province |
4.81 TiB | 11.40% | 4.81 TiB | 0.00% |
The below table shows how each many unique data are replicated across storage providers.
Since this is the 3rd allocation, the following restrictions have been relaxed:
⚠️ 94.45% of deals are for data replicated across less than 3 storage providers.
Unique Data Size | Total Deals Made | Number of Providers | Deal Percentage |
---|---|---|---|
32.38 TiB | 32.38 TiB | 1 | 76.68% |
3.75 TiB | 7.50 TiB | 2 | 17.76% |
800.00 GiB | 2.34 TiB | 3 | 5.55% |
The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.
However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.
✔️ No CID sharing has been observed.
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
Your Datacap Allocation Request has been proposed by the Notary
bafy2bzacea54zxrf7syxo5gqrsprxafhmxql2cokkavbqzwgppaqzicpk4tzc
Address
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
Datacap Allocated
200.00TiB
Signer Address
f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa
Id
43f50030-a755-4a4e-bc70-80ca548e9eb9
You can check the status of the message here: https://filfox.info/en/message/bafy2bzacea54zxrf7syxo5gqrsprxafhmxql2cokkavbqzwgppaqzicpk4tzc
There is no problem with the number of SPs and data checks, but your data is too centralized, which is not in line with the filecoin principle. I will support you this time, but you need to adjust the subsequent packaging to ensure that the data is evenly distributed.
checker:manualTrigger
BLOCKCHAIN METAVERSE ACADEMY PTY LTD
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
1
kernelogic1
newwebgroup1
xingjitansuo
The below table shows the distribution of storage providers that have stored data for this client.
If this is the first time a provider takes verified deal, it will be marked as new
.
For most of the datacap application, below restrictions should apply.
Since this is the 3rd allocation, the following restrictions have been relaxed:
⚠️ f01566485 has unknown IP location.
Provider | Location | Total Deals Sealed | Percentage | Unique Data | Duplicate Deals |
---|---|---|---|---|---|
f01566485 | UnknownUnknown |
25.00 TiB | 44.22% | 25.00 TiB | 0.00% |
f0469055 | Hong Kong, Central and Western, HKANLIAN NETWORK TECHNOLOGY CO., LIMITED |
1.28 TiB | 2.27% | 1.28 TiB | 0.00% |
f01520487 | Xiamen, Fujian, CNChina Mobile Communications Group Co., Ltd. |
23.47 TiB | 41.51% | 23.47 TiB | 0.00% |
f01699999 | Zhongshan, Guangdong, CNChina Unicom IP network China169 Guangdong province |
6.78 TiB | 12.00% | 6.78 TiB | 0.00% |
The below table shows how each many unique data are replicated across storage providers.
Since this is the 3rd allocation, the following restrictions have been relaxed:
⚠️ 93.53% of deals are for data replicated across less than 3 storage providers.
Unique Data Size | Total Deals Made | Number of Providers | Deal Percentage |
---|---|---|---|
38.25 TiB | 38.25 TiB | 1 | 67.66% |
7.31 TiB | 14.63 TiB | 2 | 25.87% |
1.22 TiB | 3.66 TiB | 3 | 6.47% |
The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.
However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.
✔️ No CID sharing has been observed.
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
Your Datacap Allocation Request has been approved by the Notary
bafy2bzaceckqxndthnwwvublk33bh7d3764s7ahslts6z3uaczzln6yqhngei
Address
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
Datacap Allocated
200.00TiB
Signer Address
f1fg6jkxsr3twfnyhdlatmq36xca6sshptscds7xa
Id
43f50030-a755-4a4e-bc70-80ca548e9eb9
You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceckqxndthnwwvublk33bh7d3764s7ahslts6z3uaczzln6yqhngei
f02049625
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
400TiB
938f9520-a523-4b5c-82be-609b031ff651
f01858410
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
fireflyHZ & kernelogic
200% of weekly dc amount requested
400TiB
100TiB
4.90PiB
Number of deals | Number of storage providers | Previous DC Allocated | Top provider | Remaining DC |
---|---|---|---|---|
3192 | 4 | 200TiB | 25.94 | 256GiB |
BLOCKCHAIN METAVERSE ACADEMY PTY LTD
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
1
kernelogic1
newwebgroup1
xingjitansuo1
YuanHeHK
The below table shows the distribution of storage providers that have stored data for this client.
If this is the first time a provider takes verified deal, it will be marked as new
.
For most of the datacap application, below restrictions should apply.
⚠️ f01566485 has sealed 44.22% of total datacap.
⚠️ f01566485 has unknown IP location.
⚠️ f01520487 has sealed 41.51% of total datacap.
Provider | Location | Total Deals Sealed | Percentage | Unique Data | Duplicate Deals |
---|---|---|---|---|---|
f01566485 | UnknownUnknown |
25.00 TiB | 44.22% | 25.00 TiB | 0.00% |
f0469055 | Hong Kong, Central and Western, HKANLIAN NETWORK TECHNOLOGY CO., LIMITED |
1.28 TiB | 2.27% | 1.28 TiB | 0.00% |
f01520487 | Xiamen, Fujian, CNChina Mobile Communications Group Co., Ltd. |
23.47 TiB | 41.51% | 23.47 TiB | 0.00% |
f01699999 | Zhongshan, Guangdong, CNChina Unicom IP network China169 Guangdong province |
6.78 TiB | 12.00% | 6.78 TiB | 0.00% |
The below table shows how each many unique data are replicated across storage providers.
⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.
Unique Data Size | Total Deals Made | Number of Providers | Deal Percentage |
---|---|---|---|
38.25 TiB | 38.25 TiB | 1 | 67.66% |
7.31 TiB | 14.63 TiB | 2 | 25.87% |
1.22 TiB | 3.66 TiB | 3 | 6.47% |
The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.
However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.
✔️ No CID sharing has been observed.
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
Dear Applicant,
Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.
Please answer the questions below as comprehensively as possible.
Customer data
We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.
Did the customer specify the amount of data involved in this relevant correspondence?
Why does the customer in question want to use the Filecoin+ program?
Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.
(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)
Files and Processing
Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.
checker:manualTrigger
BLOCKCHAIN METAVERSE ACADEMY PTY LTD
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
1
kernelogic1
newwebgroup1
xingjitansuo1
YuanHeHK
The below table shows the distribution of storage providers that have stored data for this client.
If this is the first time a provider takes verified deal, it will be marked as new
.
For most of the datacap application, below restrictions should apply.
⚠️ f01520487 has sealed 30.96% of total datacap.
⚠️ f01527777 has sealed 33.90% of total datacap.
Provider | Location | Total Deals Sealed | Percentage | Unique Data | Duplicate Deals |
---|---|---|---|---|---|
f0469055 | Hong Kong, Central and Western, HKANLIAN NETWORK TECHNOLOGY CO., LIMITED |
1.28 TiB | 0.74% | 1.28 TiB | 0.00% |
f01520487 | Xiamen, Fujian, CNChina Mobile Communications Group Co., Ltd. |
53.97 TiB | 30.96% | 53.97 TiB | 0.00% |
f01527777 | Nanning, Guangxi, CNChina Telecom |
59.09 TiB | 33.90% | 59.09 TiB | 0.00% |
f01699999 | Zhongshan, Guangdong, CNChina Unicom IP network China169 Guangdong province |
34.97 TiB | 20.06% | 34.97 TiB | 0.00% |
f01566485 | Shenzhen, Guangdong, CNCHINANET-BACKBONE |
25.00 TiB | 14.34% | 25.00 TiB | 0.00% |
The below table shows how each many unique data are replicated across storage providers.
⚠️ 91.61% of deals are for data replicated across less than 4 storage providers.
Unique Data Size | Total Deals Made | Number of Providers | Deal Percentage |
---|---|---|---|
23.16 TiB | 23.16 TiB | 1 | 13.28% |
25.75 TiB | 51.50 TiB | 2 | 29.54% |
28.34 TiB | 85.03 TiB | 3 | 48.78% |
3.66 TiB | 14.63 TiB | 4 | 8.39% |
The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.
However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.
✔️ No CID sharing has been observed.
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
A client came to me on slack asking for a review of their LDN, and so far everything looks good, so I'm willing to support it. I hope that other members of the community can also conduct information review, and I will continue to pay attention.
Your Datacap Allocation Request has been proposed by the Notary
bafy2bzaceb5xihg4s3culxyt22wtzjwh6wqnxh2jxlomtcqymex635gsbpmhi
Address
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
Datacap Allocated
400.00TiB
Signer Address
f1yayfsv6whu3rheviucvventj3y6t542xfpb47ei
Id
938f9520-a523-4b5c-82be-609b031ff651
You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceb5xihg4s3culxyt22wtzjwh6wqnxh2jxlomtcqymex635gsbpmhi
We are now 500 TiB on way with this datacap requests there are some questions to be raised here.
@conneygabriele i have some questions:
What software tools are you using to extract your Youtube video's or do you have a cold storage copy yourself? What software tools have been used to prepare this dataset for distribution until now? Are you distributing the dataset yourself or if not, who or which company is distributing it for you? How many TiB's of data have you packed already and do you have a list of CID's for us that are created?
Fil+ data should be stored in multiple regions. This data is stored only in the greater China region. Do you intent to store data on other continents according to the Fil+ guidelines? You mention offline shipping of harddrives. How do you intent to ship that data to other continents?
Fil+ is a community program where transparency is key as it is incentivized by us all. I am looking forward to your answers.
Your Datacap Allocation Request has been approved by the Notary
bafy2bzacebo2r6g7asdafhqthzajikbh5as2cygxptzed7sxwj4uisxqyc2bq
Address
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
Datacap Allocated
400.00TiB
Signer Address
f1jvvltduw35u6inn5tr4nfualyd42bh3vjtylgci
Id
938f9520-a523-4b5c-82be-609b031ff651
You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebo2r6g7asdafhqthzajikbh5as2cygxptzed7sxwj4uisxqyc2bq
@stcouldlisa no interest in any of the questions asked right?
@stcouldlisa why do you sign this? There are clearly things very wrong with the datacap usage of this applicant.
f02049625
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
800TiB
42bb9c17-a34f-4e1f-a75e-2e92e7ac5dd7
f01858410
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
1LISA2 & not found
400% of weekly dc amount requested
800TiB
300TiB
4.70PiB
Number of deals | Number of storage providers | Previous DC Allocated | Top provider | Remaining DC |
---|---|---|---|---|
7472 | 5 | 400TiB | 26.38 | 66.5TiB |
BLOCKCHAIN METAVERSE ACADEMY PTY LTD
f12k5kukgldua3ag5kzdhi27bxy52cvwf4kn7phfy
1
kernelogic1
NDLABS-OFFICE1
newwebgroup1
stcouldlisa1
xingjitansuo1
YuanHeHK
The below table shows the distribution of storage providers that have stored data for this client.
If this is the first time a provider takes verified deal, it will be marked as new
.
For most of the datacap application, below restrictions should apply.
⚠️ f01520487 has sealed 30.96% of total datacap.
⚠️ f01527777 has sealed 33.88% of total datacap.
Provider | Location | Total Deals Sealed | Percentage | Unique Data | Duplicate Deals |
---|---|---|---|---|---|
f0469055 | Hong Kong, Central and Western, HKANLIAN NETWORK TECHNOLOGY CO., LIMITED |
1.28 TiB | 0.73% | 1.28 TiB | 0.00% |
f01520487 | Xiamen, Fujian, CNChina Mobile Communications Group Co., Ltd. |
54.03 TiB | 30.96% | 54.03 TiB | 0.00% |
f01527777 | Nanning, Guangxi, CNChina Telecom |
59.13 TiB | 33.88% | 59.13 TiB | 0.00% |
f01699999 | Zhongshan, Guangdong, CNChina Unicom IP network China169 Guangdong province |
35.09 TiB | 20.11% | 35.09 TiB | 0.00% |
f01566485 | Shenzhen, Guangdong, CNCHINANET-BACKBONE |
25.00 TiB | 14.32% | 25.00 TiB | 0.00% |
The below table shows how each many unique data are replicated across storage providers.
⚠️ 91.62% of deals are for data replicated across less than 4 storage providers.
Unique Data Size | Total Deals Made | Number of Providers | Deal Percentage |
---|---|---|---|
23.09 TiB | 23.09 TiB | 1 | 13.23% |
25.66 TiB | 51.31 TiB | 2 | 29.40% |
28.50 TiB | 85.50 TiB | 3 | 48.99% |
3.66 TiB | 14.63 TiB | 4 | 8.38% |
The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.
However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.
✔️ No CID sharing has been observed.
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
@conneygabriele the data is highly centralized and stored in the same region, which is against the rules of the Filecoin+ program. There is also 91.62% of deals are for data replicated across less than 4 storage providers.
Can you explain how you will resolve this and why you did not follow the guidelines of the program?
@stcouldlisa - can you explain why you signed without due diligence?
@herrehesse Sorry for the late reply, I signed because the client told me that he has already answered your question, and also contacted you or your colleagues on slack, I saw the client's explanation, I think the client's The explanation is reasonable.
Of course, I think @conneygabriele should find out the reason why he cannot log in to GitHub as soon as possible, and then reply to your question on GitHub,
Large Dataset Notary Application
To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.
Core Information
Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.
Project details
Share a brief history of your project and organization.
What is the primary source of funding for this project?
What other projects/ecosystem stakeholders is this project associated with?
Use-case details
Describe the data being stored onto Filecoin
Where was the data in this dataset sourced from?
Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.
Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).
What is the expected retrieval frequency for this data?
For how long do you plan to keep this dataset stored on Filecoin?
DataCap allocation plan
In which geographies (countries, regions) do you plan on making storage deals?
How will you be distributing your data to storage providers? Is there an offline data transfer process?
How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.
How will you be distributing deals across storage providers?
Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?