Closed huangqian2021 closed 1 year ago
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
@galen-mcandrew @Kevin-FF-USA @dkkapur Is there any information we need to provide for our issue? Because I haven't seen any progress about my issue while many issues submitted later than me have already gone to the next process. Please help to check . Thank you.
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
In China, all of the dataset about Public Security and Police can NOT be used to apply DataCap. I propose to close this issue immediately in order to avoid more trouble.
@huangqian2021 Hello
The data comes from video surveillance and wechat working group.The data from the monitoring system is mainly geographic information.The wechat data is captured by an automatic capturing software regardless of the file type and then sent to our own platform.
Is video surveillance authorized for data use by relevant organizations? Can you provide relevant materials?
Data sample is invalid, please check it.
In China, all of the dataset about Public Security and Police can NOT be used to apply DataCap. I propose to close this issue immediately in order to avoid more trouble.
The platform is a public security command system based on police comprehensive platform, information research .It is illegal to transmit relevant public security data
Any update?@huangqian2021
@AnthnoySmith @Yvette516 We are a privately held company wholly owned by individuals. We gather data from different sources and applying analytics capabilities to improve safety, efficiency and productivity as well as security. Thanks a lot for bring this up. We are fully aware of video surveillance is one of the data protection areas that raises questions, and the 1st patch we would store with Filecoin network is the data coming from cameras along the railway and expressway which are mainly environmental information related to ecological governance. These data counts around 18% of our 11Pb total public data. We also have additional 5.8Pb data stored at designated data centers that so far would not be able to move over to Filecoin network.
@Sunnyiscoming For the data we planned to store with the filecoin network,our company installed the cameras,own the video,and we apply analytics capabilities to derive actionable,and quantifiable intelligence from video content. Yes,we can provide relavant materials. here are the samples: https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/13CF9E80FC334A58C8BB2EDB9CD25A10.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/29B02C01FC377B226F7EEDD55321A948.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/3DC6B82FAD0C5DA26C009EBC2A2E4B1F.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/6EE144CD0DFCDBA49B1CE3468575B3A4.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/73987E8F8FBB8780A38556BC091CDD7F.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/8500FAD668F5CED4C622FDB8066574C7.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/A084DC70D9ABE4D4449052A41D805B27.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/A5251EE10EDCDD6679D88A4DF527F77A.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/EBEFB7510C137EEE4B573A46AA3E25A3.jpg https://image-blue-storm.oss-cn-hangzhou.aliyuncs.com/image/F1BB492D9E8BE883ECF6FAC08B3C9680.jpg
Hello @Kevin-FF-USA Any feedback from Next Governance Call: Tues June 14 8am PT / 3pm UTC -and- 4pm PT / 2300 UTC? We used to pitch clients with such presentations, but first time in front of cloud service providers. I hope it was not too technical or industrial focused. If anything else you and the team would be interested, please let us know. We are also open to take emails or phone calls for any questions or suggestions.
Hello @galen-mcandrew @dkkapur, do we have a next step to follow to proceed?
The installation and shooting of the camera should also be authorized by the relevant institutions. Is the authorization of the data also approved? Can you provide such data use authorization certificate?
@Sunnyiscoming Yes, video surveillance is approved. We’ve been in video surveillance for several years with major partners in multiple areas, and we are fully aware of it. As previous stated, we gather data from different sources and apply analytics capabilities to improve safety, efficiency and productivity as well as security. In other words, valuable information is digested from raw data and stored/managed by the clients. We remove clients’ data from raw data and artificial intelligently manipulate and reproduce a clean version data, which we could authorize for public use.
@huangqian2021 Hey. Could you send an email with data use authorization certificate to filplus@fil.org with your official domain?
Have you sent the email? If not, you'd better to name your email with issue id #323, which can make it easily be found.
Following up on Sunny's comment. Please send an email from official account to filplus-app-review@fil.org. No response by end of week will lead to application closure
Hi Sunny, I sent the email, thanks for your following.
Hi Raghav, just sent the Email, pls check it. Thx a lot!
Note for notaries:
Total DataCap requested
5 PiB
Expected weekly DataCap usage rate
100 TiB
Client address
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
f02049625
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
50TiB
Note for notaries: Please do further diligence on if " dataset about Public Security and Police can NOT be used to apply DataCap" as mentioned in comments above before approving the first tranche.
Hello @huangqian2021 , How do you prove that your data is publicly available? Do you have any relevant credentials? If you have, DM me through slack @DeFIL or you can shown on Github directly.
Your Datacap Allocation Request has been proposed by the Notary
bafy2bzaced3qbbhzvvvzke6g4t2ytzmuonbs7iyjsrd35b54kyvjldpcgmsvy
Address
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
Datacap Allocated
50.00TiB
Signer Address
f1jvvltduw35u6inn5tr4nfualyd42bh3vjtylgci
You can check the status of the message here: https://filfox.info/en/message/bafy2bzaced3qbbhzvvvzke6g4t2ytzmuonbs7iyjsrd35b54kyvjldpcgmsvy
Your Datacap Allocation Request has been approved by the Notary
bafy2bzaceaaj3oor6it5yhar6sji4oeu4q3uuvchgitb6mj7svssnd3ajkurg
Address
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
Datacap Allocated
50.00TiB
Signer Address
f1dnb3uz7sylxk6emti3ififcvu3nlufnnsjui6ea
You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceaaj3oor6it5yhar6sji4oeu4q3uuvchgitb6mj7svssnd3ajkurg
您的邮件我已收到~
@huangqian2021 Hi! Great to see you have gotten approval for DataCap. If you are looking for storage providers to store these data or have any questions, please visit #bigdata-exchange on Filecoin Slack or reply here.
We have strong demand from a diverse group of SPs, who are actively looking to onboard more data.
“警务云”采用世界领先的数据算法和计算引擎,可以在1秒内从海量数据中检索到关注人员的基础信息、活动轨迹、案件事件、网络行为等所有关联信息,在5至10秒内完成关注人员轨迹碰撞、路径推演、时空分析等,实现千亿数据秒级计算、数据检索秒级响应,为全警大数据应用创造了有力条件。同时,还依托“警务云”对海量数据进行智能分析,建立以身份证号码、电话号、车牌号等唯一值为索引、全数据关联的12类主题数据库,实现全量数据关联融合,并根据各警种数据应用需求,进一步细化数据分类,建立人员关系、人员轨迹等68个专题数据库,为实战应用提供全方位、精细化的数据服务。
http://jlrbszb.cnjiwang.com/pc/paper/c/201808/07/content_59869.html
@huangqian2021 As a Chinese company, don't you know why Didi was fined and taken offline? Not to mention that the data involved in your application is of the highest privacy. NO country would allow police data to be stored externally. This has clearly violated the privacy of citizens and I have flagged this application many times. I would like to know @raghavrmadya what is your reason for approving this?
Hi @Yvette516, thanks for flagging. In the capacity of a governance team member, I did the initial KYC by asking questions and the client responded. I or a governance team member do not approve applications. This responsibility lies with the notaries. In this case, I would invite @stcouldlisa and @Defil2022 to respond to your comment. If the community comes to the conclusion that this application violates the guidelines of the Fil+ program, we can take necessary steps
The company is the joint development and practice base of intelligent police mobile application of Hunan Police College. It has maintained close cooperation with public institutions and government agencies in nearly 50% of cities in Hunan Province ,which covers 7 cities with a land area of 100,000 km² and a population of 30 million,to develop smart public security platform.The platform is a public security command system based on police comprehensive platform, information research and judgment platform, geographic information platform and big data center, supplemented by video surveillance system, GPS satellite positioning system and image transmission system, and supported by mobile alarm positioning, police service management and modern communication technology.
@raghavrmadya Thanks for quick response but have you heard of a non-state run police academy anywhere? As the leader of the trust and transparency team, I believe you need to improve your discernment about the dataset from the application itself. I also saw the note you left for notaries, but it is unreasonable to set TRIGGER knowing that this would violate the law and doesn't fit the on board client regulation. @huangqian2021 It is impossible for police systems to authorize data use to non-state administered organizations. The proof you mentioned is very suspicious.
I suggest suspending the application until this company can provide authentic authorization for data storage in filecoin or proper KYC via the police cloud center email.
Hey @raghavrmadya @Yvette516 Thanks for the heads up. I did thorough diligence about this application. First, they can provide emails to prove their identity. Their application form about whether the data can be publicly retrieval is filled out as follows.
the data could be accessed by anyone.Data related to policing and privacy will not be stored in Filecoin at this stage.
In other words, valuable information is digested from raw data and stored/managed by the clients. we remove clients' data from raw data and We remove clients' data from raw data and artificially intelligently manipulate and reproduce a clean version data, which we could authorize for public use.
My understanding is that the data their company handles is divided into publicable and non-publicable. Their non-public data will not be stored on the filecoin network. So it makes sense.
Second, he has documents that prove authorization. I think this is the type of documentation that no one would risk falsifying, so it is reasonable.
Based on the above review, we are willing to support their first round of allocation so that we can retrieve the stored data for further review. We also hope that more community members and notaries will participate in the review.
f02049625
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
100TiB
f01858410
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
DeFIL123 & 1LISA2
100% of weekly dc amount requested
100TiB
50TiB
4.95PiB
Number of deals | Number of storage providers | Previous DC Allocated | Top provider | Remaining DC |
---|---|---|---|---|
1396 | 6 | 50TiB | 20.42 | 6.37TiB |
Flagging the relative nodes f01740934\f01834291\f01834253\f01893023\f01890456 and two notaries @stcouldlisa @Defil2022 have suspicious behaviors with this issue.
This is a very serious accident, proposed to have a discussion in the next gov-call about how to avoid such mistakes in the future! @raghavrmadya
“警务云”采用世界领先的数据算法和计算引擎,可以在1秒内从海量数据中检索到关注人员的基础信息、活动轨迹、案件事件、网络行为等所有关联信息,在5至10秒内完成关注人员轨迹碰撞、路径推演、时空分析等,实现千亿数据秒级计算、数据检索秒级响应,为全警大数据应用创造了有力条件。同时,还依托“警务云”对海量数据进行智能分析,建立以身份证号码、电话号、车牌号等唯一值为索引、全数据关联的12类主题数据库,实现全量数据关联融合,并根据各警种数据应用需求,进一步细化数据分类,建立人员关系、人员轨迹等68个专题数据库,为实战应用提供全方位、精细化的数据服务。
http://jlrbszb.cnjiwang.com/pc/paper/c/201808/07/content_59869.html
@huangqian2021 As a Chinese company, don't you know why Didi was fined and taken offline? Not to mention that the data involved in your application is of the highest privacy. NO country would allow police data to be stored externally. This has clearly violated the privacy of citizens and I have flagged this application many times. I would like to know @raghavrmadya what is your reason for approving this?
Hi @Yvette516 @stcouldlisa @Defil2022 @raghavrmadya, As I previously stated, we gather data from different sources and applying analytics capabilities to improve safety, efficiency and productivity as well as security. You are totally right, the raw data we collected for some parties is highly private. Even though we have collected nearly 6Pb such data stored at designated data centers, we don’t own such data. We don’t make decisions on where the data is stored. Police data is one of such cases. I recall we briefly discussed this concern in June. The 1st patch data we would store with Filecoin network is the video surveillance data coming from cameras along the railway and expressway which are mainly environmental information related to ecological governance. Such data is owned by us, both raw data and analytically manipulated data. Both are public to different group of people. We planned to store less than 20% of such public data with Filecoin network, and such data does not include raw data of the environmental information yet. There are rules and regulations regarding video surveillance, most people would not imagine how weird some rules are. In many cases, we can’t even fully manage our own data which doesn’t even related to any personal, household, or demography info. We are trying to bring as much as possible to filecoin network, at the same time, we are also working on alternative solutions for cost reduction.
@Chris00618 What's the meaning of "suspicious behaviors"? what's the matter with those SPs? Plz take responsibility for your words, THANK YOU! @huangqian2021 Thanks for your reply. Please keep in touch with community. I do recommend more notaries to do investigation on this and i will follow this up.
The 1st patch data we would store with Filecoin network is the video surveillance data coming from cameras along the railway and expressway which are mainly environmental information related to ecological governance. Such data is owned by us, both raw data and analytically manipulated data. Both are public to different group of people.
@huangqian2021 Do you mean that the surveillance data of railway and expressway in the police cloud system belongs to your company? "For different groups of people" can you please classify it clearly? I would also like to see the breakdown of the surveillance videos that you state in dataset.
The company is the joint development and practice base of intelligent police mobile application of Hunan Police College. It has maintained close cooperation with public institutions and government agencies in nearly 50% of cities in Hunan Province ,which covers 7 cities with a land area of 100,000 km² and a population of 30 million,to develop smart public security platform.The platform is a public security command system based on police comprehensive platform, information research and judgment platform, geographic information platform and big data center, supplemented by video surveillance system, GPS satellite positioning system and image transmission system, and supported by mobile alarm positioning, police service management and modern communication technology.
@raghavrmadya Thanks for quick response but have you heard of a non-state run police academy anywhere? As the leader of the trust and transparency team, I believe you need to improve your discernment about the dataset from the application itself. I also saw the note you left for notaries, but it is unreasonable to set TRIGGER knowing that this would violate the law and doesn't fit the onboard client regulation. @huangqian2021 It is impossible for police systems to authorize data use to non-state administered organizations. The proof you mentioned is very suspicious.
I suggest suspending the application until this company can provide authentic authorization for data storage in filecoin or proper KYC via the police cloud center email.
I or the governance team do not have the final say on LDN applications. If a client provides response to questions asked for kYC, the governance team will not block on applications. It is on the notaries and community members to assess the dataset and flag inconsistencies.
The 1st patch data we would store with Filecoin network is the video surveillance data coming from cameras along the railway and expressway which are mainly environmental information related to ecological governance. Such data is owned by us, both raw data and analytically manipulated data. Both are public to different group of people.
@huangqian2021 Do you mean that the surveillance data of railway and expressway in the police cloud system belongs to your company? "For different groups of people" can you please classify it clearly? I would also like to see the breakdown of the surveillance videos that you state in dataset.
Hi Yvette516 Although we collect data, clean/manipulate raw data, and later apply ananlytics including AI/ML upon request from clients, we don't own them. We donn't bear the cost of storage and we don't make decisions where the clients store their data. The current surveillance data of railway and exressway to store with Filecoin network is not in the police cloud system, so that we could pull and mix up with data from other resources and apply all sort of data mining algorithms to answer different client inquiries. We have 3rd party research, analytics, and consulting companies build up qualitative or quantitative models with our data resources. They take our clean and manipulated data for further research and analysis. There are also people just search and view the raw data. You have some good quesstions and valid concerns. I think we need more connections and communications between decentralized storage founders/executives and centralized storage users. You are more than welcom to come to our comany for a site visit. It'll probably help us all in long run to allocate more data to decentralized way of cloud storage.
f02049625
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
200TiB
d6f34292-6d9b-4241-8e46-0cbf876c6753
f01858410
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
1475Notary & swatchliu
200% of weekly dc amount requested
200TiB
150TiB
4.85PiB
Number of deals | Number of storage providers | Previous DC Allocated | Top provider | Remaining DC |
---|---|---|---|---|
3488 | 7 | 100TiB | 22.59 | 2.09TiB |
This app wrapper looks fine, and I'll support this signature
Your Datacap Allocation Request has been proposed by the Notary
bafy2bzaceatopaaclch47gjomi3467uuclaadkrzx6bd4wanx7ok2rynqayzq
Address
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
Datacap Allocated
200.00TiB
Signer Address
f1fg6jkxsr3twfnyhdlatmq36xca6sshptscds7xa
Id
d6f34292-6d9b-4241-8e46-0cbf876c6753
You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceatopaaclch47gjomi3467uuclaadkrzx6bd4wanx7ok2rynqayzq
Your Datacap Allocation Request has been approved by the Notary
bafy2bzacedjvf5ayp7l63la6kxwhcgoj2so5bvhu6qq7peyabupu36nzcrirq
Address
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
Datacap Allocated
200.00TiB
Signer Address
f1q6bpjlqia6iemqbrdaxr2uehrhpvoju3qh4lpga
Id
d6f34292-6d9b-4241-8e46-0cbf876c6753
You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedjvf5ayp7l63la6kxwhcgoj2so5bvhu6qq7peyabupu36nzcrirq
f02049625
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
400TiB
7fb79b10-86af-44f7-805b-733be4575e10
f01858410
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
llifezou & fireflyHZ
400% of weekly dc amount requested
400TiB
150TiB
4.85PiB
Number of deals | Number of storage providers | Previous DC Allocated | Top provider | Remaining DC |
---|---|---|---|---|
3488 | 7 | 200TiB | 22.59 | 2.09TiB |
Looks good so far.
Your Datacap Allocation Request has been proposed by the Notary
bafy2bzacebfifevha3oozteigudrspkitqlj524unvpirzthvhuuej72zxvjm
Address
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
Datacap Allocated
400.00TiB
Signer Address
f1pszcrsciyixyuxxukkvtazcokexbn54amf7gvoq
Id
7fb79b10-86af-44f7-805b-733be4575e10
You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebfifevha3oozteigudrspkitqlj524unvpirzthvhuuej72zxvjm
Blue Storm Information Technology
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey
The below table shows the distribution of storage providers that have stored data for this client.
If this is the first time a provider takes verified deal, it will be marked as new
.
For most of the datacap application, below restrictions should apply.
⚠️ f01943959 has sealed 25.71% of total datacap.
⚠️ 86.05% of total deal sealed by f01943959 are duplicate data.
⚠️ f01943959 has unknown IP location.
⚠️ 78.99% of total deal sealed by f01878534 are duplicate data.
⚠️ 69.80% of total deal sealed by f01893023 are duplicate data.
⚠️ 75.66% of total deal sealed by f01890456 are duplicate data.
⚠️ 79.66% of total deal sealed by f01852363 are duplicate data.
⚠️ 62.99% of total deal sealed by f01834291 are duplicate data.
⚠️ 64.57% of total deal sealed by f01740934 are duplicate data.
⚠️ 63.96% of total deal sealed by f01834253 are duplicate data.
Provider | Location | Total Deals Sealed | Percentage | Unique Data | Duplicate Deals |
---|---|---|---|---|---|
f01943959 | Unknown | 41.00 TiB | 25.71% | 5.72 TiB | 86.05% |
f01878534 | Los Angeles, California, US | 25.44 TiB | 15.95% | 5.34 TiB | 78.99% |
f01893023 | Los Angeles, California, US | 24.63 TiB | 15.44% | 7.44 TiB | 69.80% |
f01890456 | Los Angeles, California, US | 23.63 TiB | 14.81% | 5.75 TiB | 75.66% |
f01852363 | Singapore, Singapore, SG | 21.97 TiB | 13.78% | 4.47 TiB | 79.66% |
f01834291 | Los Angeles, California, US | 7.94 TiB | 4.98% | 2.94 TiB | 62.99% |
f01740934 | Los Angeles, California, US | 7.94 TiB | 4.98% | 2.81 TiB | 64.57% |
f01834253 | Los Angeles, California, US | 6.94 TiB | 4.35% | 2.50 TiB | 63.96% |
The below table shows how each many unique data are replicated across storage providers.
⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.
Unique Data Size | Total Deals Made | Number of Providers | Deal Percentage |
---|---|---|---|
29.34 TiB | 137.66 TiB | 1 | 86.32% |
1.09 TiB | 5.63 TiB | 2 | 3.53% |
1.81 TiB | 16.19 TiB | 3 | 10.15% |
The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.
⚠️ CID sharing has been observed.
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.
This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!
Large Dataset Notary Application
To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.
Core Information
Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.
Project details
Share a brief history of your project and organization.
What is the primary source of funding for this project?
What other projects/ecosystem stakeholders is this project associated with?
Use-case details
Describe the data being stored onto Filecoin
Where was the data in this dataset sourced from?
Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.
Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).
What is the expected retrieval frequency for this data?
For how long do you plan to keep this dataset stored on Filecoin?
DataCap allocation plan
In which geographies (countries, regions) do you plan on making storage deals?
How will you be distributing your data to storage providers? Is there an offline data transfer process?
How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.
How will you be distributing deals across storage providers?
Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?