filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] <We Doctor> - <Digital Healthcare life> #962

Closed vincen1989 closed 11 months ago

vincen1989 commented 1 year ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

We are an internationally leading company focusing on digital health platform, which was established on 2nd March, 2016. In past years, we built up the health ecosystem through digital hospital and working closely with our clients including governmental hospital, pharmacy, medical insurance institutions and the qualified doctors.  Internet-based healthcare has become a significant part of medical services, our goal and missioin:1) provide the patients with eHealth solutions and bring smoother and more convenient experiences for the public by setting up digital hospitals;  2) build up healthcare ecosystem cross cities by sharing and optimizing database, and maximize data intelligence.  one of our significant projects is "Digital Healthcare Life", which is conducted to provide our clients with an intelligent dataset to better serve their customers and to have smarter solutions by diagnosing faster.

What is the primary source of funding for this project?

Own fund and business income

What other projects/ecosystem stakeholders is this project associated with?

This projects is associated with governmental healthcare projects, also some prominant insurance companies and local hospitals.

Use-case details

Describe the data being stored onto Filecoin

The data we want to store on filecoin are public internet dataset which can be shared with the public and promote the social healthcare system. 

Where was the data in this dataset sourced from?

mainly from the clients' data and accumulative data from our projects.

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.
previously  I have provided the agreement between two companies and sent an hello email to filecoin official emal address to prove my identity. Thanks

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

In the previous application, someone in the  community questioned if it is an invasion of customers' privacy. Here I would like to clarify again: we plan to store clients' data which is already public or ready to get public. We host these data already anyway. We have not yet spoken with all clients in terms of transferring to or making another copy to filecoin network. We would do it step by step to see how it goes. So far there has been a couple of clients' data we decided to do first, they approved and looked forward to it.

What is the expected retrieval frequency for this data?


For how long do you plan to keep this dataset stored on Filecoin?

5 years at least

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

Greater China, southeast Asia

How will you be distributing your data to storage providers? Is there an offline data transfer process?


How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We plan to select the SPs with high reputation who already participate in the project.

How will you be distributing deals across storage providers?

We will identify at least 5--7 SPs as our long term partners, and Ideally we will distribute deals evenly across the SPs.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

raghavrmadya commented 1 year ago

Datacap Request Trigger

Total DataCap requested


Expected weekly DataCap usage rate


Client address


large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address


Client address


DataCap allocation requested


Alex11801 commented 1 year ago

@vincen1989 Hello. There are some people's photos in the data sample you provided. Will there be any copyright disputes? Because there are some photos which I think may be downloaded from the Internet.

vincen1989 commented 1 year ago

@vincen1989 Hello. There are some people's photos in the data sample you provided. Will there be any copyright disputes? Because there are some photos which I think may be downloaded from the Internet.

Hi Alex, thanks for your question, which has been asked by the community in the previous application(also approved). I would like to answer it here to clear away your concern. Actually the data link all can see is our partner's website and I also have provided the fil team with the contract signed between us plus official email confirmation. To further clarify on privacy which all care about - we plan to store clients' data which is already public or ready to get public. We host these data already anyway. We have not yet spoken with all clients in terms of transferring to or making another copy to filecoin network. We would do it step by step to see how it goes. So far there has been a couple of clients' data we decided to do first, they approved and looked forward to it.

kernelogic commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address


You can check the status of the message here:

Alex11801 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address


You can check the status of the message here:

BDE-io commented 1 year ago

@vincen1989 Hi! Great to see you have gotten approval for DataCap and advancing the mission of preserving humanity’s most important information. If you are looking for storage providers to store these data or have any questions, please visit #bigdata-exchange on Filecoin Slack or reply here.

We have strong demand from a diverse group of SPs, who are actively looking to onboard more data.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address


Client address


DataCap allocation requested


large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address


Client address


Last two approvers

Alex11801 & kernelogic

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested


Total DataCap granted for client so far


Datacap to be granted to reach the total amount requested by the client (1PiB)



Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
0 0 50TiB 0 11.53TiB
kernelogic commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address


You can check the status of the message here:

liyunzhi-666 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address


You can check the status of the message here:

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address


Client address


DataCap allocation requested


large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address


Client address


Last two approvers

liyunzhi-666 & kernelogic

Rule to calculate the allocation request amount

200% of weekly dc amount requested

DataCap allocation requested


Total DataCap granted for client so far


Datacap to be granted to reach the total amount requested by the client (1PiB)



Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
4637 9 100TiB 17.86 8.12TiB
kernelogic commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address


You can check the status of the message here:

stcloudlisa commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address



You can check the status of the message here:

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 4

Multisig Notary address


Client address


DataCap allocation requested




large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address


Client address


Last two approvers

1LISA2 & kernelogic

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested


Total DataCap granted for client so far


Datacap to be granted to reach the total amount requested by the client (1PiB)



Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
4898 10 200TiB 16.90 42.53TiB
NDLABS-Leo commented 1 year ago

Looks great!I would like to support~

NDLABS-Leo commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address




You can check the status of the message here:

Defil2022 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address




You can check the status of the message here:

filplus-checker commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ f01384160 has sealed 25.91% of total datacap.

⚠️ 87.21% of total deal sealed by f01384160 are duplicate data.

⚠️ 87.99% of total deal sealed by f01943959 are duplicate data.

⚠️ f01943959 has unknown IP location.

⚠️ 81.86% of total deal sealed by f01853077 are duplicate data.

⚠️ 81.12% of total deal sealed by f01878534 are duplicate data.

⚠️ 78.86% of total deal sealed by f01852363 are duplicate data.

⚠️ 79.07% of total deal sealed by f01890456 are duplicate data.

⚠️ 82.04% of total deal sealed by f01854772 are duplicate data.

⚠️ 66.91% of total deal sealed by f01384209 are duplicate data.

⚠️ 63.40% of total deal sealed by f01215819 are duplicate data.

⚠️ 63.28% of total deal sealed by f01271208 are duplicate data.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01384160 Shenzhen, Guangdong, CN 135.63 TiB 25.91% 17.34 TiB 87.21%
f01943959 Unknown 110.31 TiB 21.08% 13.25 TiB 87.99%
f01853077 Singapore, Singapore, SG 55.66 TiB 10.63% 10.09 TiB 81.86%
f01878534 Los Angeles, California, US 51.97 TiB 9.93% 9.81 TiB 81.12%
f01852363 Singapore, Singapore, SG 50.41 TiB 9.63% 10.66 TiB 78.86%
f01890456 Los Angeles, California, US 49.88 TiB 9.53% 10.44 TiB 79.07%
f01854772 Los Angeles, California, US 39.50 TiB 7.55% 7.09 TiB 82.04%
f01384209new Shenzhen, Guangdong, CN 12.75 TiB 2.44% 4.22 TiB 66.91%
f01215819 Shenzhen, Guangdong, CN 11.78 TiB 2.25% 4.31 TiB 63.40%
f01271208new Shenzhen, Guangdong, CN 5.53 TiB 1.06% 2.03 TiB 63.28%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
63.75 TiB 425.88 TiB 1 81.37%
11.06 TiB 87.53 TiB 2 16.72%
1.13 TiB 10.00 TiB 3 1.91%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f1o2nxreefzqzc2mvaqneaolpdat5yk3275r6rofa iPolloverse SG PTE. LTD 77.88 TiB 451 LDN v3 multisig
f1ng6g57r4u62q67u6lm33ftijfsggyzzjzb2l4cy NFTSTAR 52.88 TiB 406 LDN v3 multisig
f1vajbbihmcqkcjlnicmjto6fflklvhoifips7uzq rctAI 27.47 TiB 217 LDN v3 multisig
f1ocw63vnfpg3lkhiqxtzvzqke4d42km6ow3shkba M Space 12.47 TiB 85 LDN v3 multisig
f13fjnwckkgnkbpapcmnpupbdf4ohduomdso4eqga Asia Blockchain Gaming Alliance 11.72 TiB 84 LDN v3 multisig
f1k7y7jd3ly42cg5ysty2ngc5smxmaery5mglbynq MikaeLa 8.56 TiB 80 LDN v3 multisig
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey Blue Storm Information Technology 6.97 TiB 50 LDN v3 multisig
f1fsoyq7oiwdaapagpnlncbohkrkzolxxzkex6hxq SemiDrive Technology 6.06 TiB 39 LDN v3 multisig
f1mgvvj45ce5i3ikpyixtubqzcytcgmdlwgjldqri Blockchain Media 4.94 TiB 32 LDN v3 multisig
f1guplg5wyjdn6bv4forsb5eu2lohexvdlttkavpq Ipollo 4.00 TiB 13 LDN # 168
f1ji2xty4dtg2a4pfnslltkw7pbhy76kvihizkfmq Pow Power 2.53 TiB 10 LDN v3 multisig
f1gmiepn73zoa5gz2oiqyugjmrsecwj5qxd42vmyi `` 2.41 TiB 12 LDN v3 multisig
f1dfkdmjhuvvol6okun57mix447wdpolwcd323ktq BITRISE CAPITAL FOUNDATION PTE.LTD 1.94 TiB 7 LDN # 139
f14kei3mbjobgsfszu5cndhosuobkbbtoaikc52pq Proya 1.63 TiB 10 LDN v3 multisig
f17d2n363326qlln4uva7ylxteamtdj3lq6wuaewi Metadata Labs Inc 1.16 TiB 5 LDN # 200
f1qpkmibcsitxtxwg6ayghxwkssoxs4bfyzwzslza GAMEWAY PTE. LTD. 416.00 GiB 2 LDN # 158
f1raeideaex2fwvoemftky6x5jzbqdawrgeuz2mry BBNews 288.00 GiB 2 LDN # 218
f1ewwoms3aiairedun4h2uayqhlmtjsulnqy2xnrq AsiaBlockchain Gaming Alliance 224.00 GiB 1 LDN # 179
f1df64fi3a4pmo5mxvdrz2zs7n2bfd4wnixomudcq UZERO 128.00 GiB 1 LDN v3 multisig
f1pnsnrfp5dqfqwospzj6vis6jxhni765i37tne5q TIMAGE 128.00 GiB 1 LDN v3 multisig

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 5

Multisig Notary address


Client address


DataCap allocation requested




large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address


Client address


Last two approvers

DeFIL123 & not found

Rule to calculate the allocation request amount

800% of weekly dc amount requested

DataCap allocation requested


Total DataCap granted for client so far


Datacap to be granted to reach the total amount requested by the client (1PiB)



Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
20784 12 400TiB 23.00 88.06TiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ 88.65% of total deal sealed by f01943959 are duplicate data.

⚠️ f01943959 has unknown IP location.

⚠️ 87.21% of total deal sealed by f01384160 are duplicate data.

⚠️ 81.86% of total deal sealed by f01853077 are duplicate data.

⚠️ 81.12% of total deal sealed by f01878534 are duplicate data.

⚠️ 78.86% of total deal sealed by f01852363 are duplicate data.

⚠️ 79.07% of total deal sealed by f01890456 are duplicate data.

⚠️ 82.04% of total deal sealed by f01854772 are duplicate data.

⚠️ 88.80% of total deal sealed by f01983523 are duplicate data.

⚠️ f01983523 has unknown IP location.

⚠️ 66.91% of total deal sealed by f01384209 are duplicate data.

⚠️ 63.40% of total deal sealed by f01215819 are duplicate data.

⚠️ 63.28% of total deal sealed by f01271208 are duplicate data.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01943959 Unknown 145.63 TiB 23.47% 16.53 TiB 88.65%
f01384160 Shenzhen, Guangdong, CN 135.63 TiB 21.86% 17.34 TiB 87.21%
f01853077 Singapore, Singapore, SG 55.66 TiB 8.97% 10.09 TiB 81.86%
f01878534 Los Angeles, California, US 51.97 TiB 8.38% 9.81 TiB 81.12%
f01852363 Singapore, Singapore, SG 50.41 TiB 8.12% 10.66 TiB 78.86%
f01890456 Los Angeles, California, US 49.88 TiB 8.04% 10.44 TiB 79.07%
f01834291 Los Angeles, California, US 41.97 TiB 6.76% 40.84 TiB 2.68%
f01854772 Los Angeles, California, US 39.50 TiB 6.37% 7.09 TiB 82.04%
f01983523 Unknown 19.81 TiB 3.19% 2.22 TiB 88.80%
f01384209new Shenzhen, Guangdong, CN 12.75 TiB 2.05% 4.22 TiB 66.91%
f01215819 Shenzhen, Guangdong, CN 11.78 TiB 1.90% 4.31 TiB 63.40%
f01271208new Shenzhen, Guangdong, CN 5.53 TiB 0.89% 2.03 TiB 63.28%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
110.09 TiB 522.97 TiB 1 84.28%
11.06 TiB 87.53 TiB 2 14.11%
1.13 TiB 10.00 TiB 3 1.61%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f1o2nxreefzqzc2mvaqneaolpdat5yk3275r6rofa iPolloverse SG PTE. LTD 81.75 TiB 469 LDN v3 multisig
f1ng6g57r4u62q67u6lm33ftijfsggyzzjzb2l4cy NFTSTAR 53.38 TiB 408 LDN v3 multisig
f1vajbbihmcqkcjlnicmjto6fflklvhoifips7uzq rctAI 40.31 TiB 602 LDN v3 multisig
f13fjnwckkgnkbpapcmnpupbdf4ohduomdso4eqga Asia Blockchain Gaming Alliance 37.03 TiB 777 LDN v3 multisig
f1ocw63vnfpg3lkhiqxtzvzqke4d42km6ow3shkba M Space 12.47 TiB 85 LDN v3 multisig
f1dob4zdjy6b3iinf6evbqzxf6nwgnftcksewabxq Dr.ji 9.72 TiB 311 LDN v3 multisig
f1k7y7jd3ly42cg5ysty2ngc5smxmaery5mglbynq MikaeLa 8.56 TiB 80 LDN v3 multisig
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey Blue Storm Information Technology 7.88 TiB 60 LDN v3 multisig
f1fsoyq7oiwdaapagpnlncbohkrkzolxxzkex6hxq SemiDrive Technology 6.06 TiB 39 LDN v3 multisig
f1mgvvj45ce5i3ikpyixtubqzcytcgmdlwgjldqri Blockchain Media 4.94 TiB 32 LDN v3 multisig
f1guplg5wyjdn6bv4forsb5eu2lohexvdlttkavpq Ipollo 4.13 TiB 14 LDN # 168
f1ji2xty4dtg2a4pfnslltkw7pbhy76kvihizkfmq Pow Power 2.53 TiB 10 LDN v3 multisig
f1gmiepn73zoa5gz2oiqyugjmrsecwj5qxd42vmyi `` 2.41 TiB 12 LDN v3 multisig
f1dfkdmjhuvvol6okun57mix447wdpolwcd323ktq BITRISE CAPITAL FOUNDATION PTE.LTD 2.03 TiB 8 LDN # 139
f14kei3mbjobgsfszu5cndhosuobkbbtoaikc52pq Proya 1.63 TiB 10 LDN v3 multisig
f17d2n363326qlln4uva7ylxteamtdj3lq6wuaewi Metadata Labs Inc 1.16 TiB 5 LDN # 200
f1sffbmjyfnz5jbsej23syerh6ubjcfmk4hd34zpi Ganku Co., Ltd. 896.00 GiB 25 LDN v3 multisig
f1qpkmibcsitxtxwg6ayghxwkssoxs4bfyzwzslza GAMEWAY PTE. LTD. 416.00 GiB 2 LDN # 158
f1raeideaex2fwvoemftky6x5jzbqdawrgeuz2mry BBNews 288.00 GiB 2 LDN # 218
f1ewwoms3aiairedun4h2uayqhlmtjsulnqy2xnrq AsiaBlockchain Gaming Alliance 224.00 GiB 1 LDN # 179
f1df64fi3a4pmo5mxvdrz2zs7n2bfd4wnixomudcq UZERO 128.00 GiB 1 LDN v3 multisig
f1pnsnrfp5dqfqwospzj6vis6jxhni765i37tne5q TIMAGE 128.00 GiB 1 LDN v3 multisig

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

github-actions[bot] commented 11 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 11 months ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

large-datacap-requests[bot] commented 11 months ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find Organization Name field in the information provided We could not find Website \/ Social Media field in the information provided We could not find Total amount of DataCap being requested (between 500 TiB and 5 PiB) field in the information provided We could not find Weekly allocation of DataCap requested (usually between 1-100TiB) field in the information provided We could not find On-chain address for first allocation field in the information provided We could not find Data Type of Application field in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
aggregation-and-compliance-bot[bot] commented 7 months ago
Client f01936823 does not follow the datacap usage rules. More info here. This application has been failing the requirements for 7 days. Please take appropiate action to fix the following DataCap usage problems. Criteria Treshold Reason
Cid Checker score > 25% The client has a CID checker score of 1%. This should be greater than 25%. To find out more about CID checker score please look at this issue:
Shared data percent < 20% 22.41% of the clients data is shared with other clients. This should be less than 20%