filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] - <100market> #1216

Closed 100MarketOfficial closed 1 year ago

100MarketOfficial commented 1 year ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization. 100market genuine digital product market was founded in 2014, and in April 2019, the investor completed the first round of financing of over 10 million yuan for Guangyuan Investment. 100market digital market consists of 100Audio copyrighted music, 100image copyrighted design material, 100wa copyrighted video material, 100web enterprise website and other e-commerce platforms, which fully meet various production needs. Up to now, it has included hundreds of thousands of exclusive works from all over the world, providing genuine digital product licensing solutions for tens of millions of people, and is a fixed licensing supplier for the government, mainstream media, Fortune 500 companies and 4A advertising companies. Since its establishment, it has used a brand-new e-commerce model to help customers purchase digital products with copyright protection efficiently and quickly.


What is the primary source of funding for this project?

business income


What other projects/ecosystem stakeholders is this project associated with?

no


## Use-case details

Describe the data being stored onto Filecoin

The dataset we want to store on Filecoin are some videos/audios/images which copy rights are owned by 100market.


Where was the data in this dataset sourced from?

videos/audios/images we have the copy right on


Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this. 

https://100audio.com/
https://100wa.com/
https://100wa.com/?fwp_download_categories=stock-footage&fwp_download_tags=uk#search
https://100wa.com/?fwp_download_categories=stock-footage&fwp_download_tags=aerial#search
https://100image.com/


Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

absolutely


What is the expected retrieval frequency for this data?

everyday


For how long do you plan to keep this dataset stored on Filecoin?

3 years


## DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

globally


How will you be distributing your data to storage providers? Is there an offline data transfer process?

offline


How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

we plan to work with SPs with Network stability and fast retrieval,and focus on SPs in China first.


How will you be distributing deals across storage providers?

at least 2 copies per client and distribute the data fairly as possbiel as we can.


Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

sure

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

100TiB

Client address

f1uloycctvjmyfis4xggi7urlthmld3hhd7lze4sq

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1uloycctvjmyfis4xggi7urlthmld3hhd7lze4sq

DataCap allocation requested

50TiB

Id

7a87db5c-b9cc-4288-915b-e60a16cbce7b

psh0691 commented 1 year ago

If you look at the application form and website, we sell copyrighted digital products, is it suitable for FileCoin Plus, which stores public data?

100MarketOfficial commented 1 year ago

yes, we are a platform to sell some digital products which we have owned their copyright through business cooperation. After that, it is public data set I think. But to further clarify on privacy you are concerned about, we actually talked with some key customers for this transition and got positive support. That is why we are here. And surely we will also look into Filecoin’s work to see if we put more and more here.

100MarketOfficial commented 1 year ago

yes, we are a platform to sell some digital products which we have owned their copyright through business cooperation. After that, it is public data set I think. But to further clarify on privacy you are concerned about, we actually talked with some key customers for this transition and got positive support. That is why we are here. And surely we will also look into Filecoin’s work to see if we put more and more here.

newwebgroup commented 1 year ago

About KYB/KYC 1:Could you send an email to filplus-app-review@fil.org

The content should include the number of the LDN application. If possible, please attach copies of the business license and other valid certificates

2:Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present?

  1. How large is your existing dataset? How much is the data growth per month?
newwebgroup commented 1 year ago

Canceled Request

The following request has been canceled by the notary, thus should not be considered as valid anymore.

Message sent to Filecoin Network

bafy2bzacecislegzypz44nb45ss3jrizhzomksbu6saz6gg3ghal3nbroa6qe

Address

f1uloycctvjmyfis4xggi7urlthmld3hhd7lze4sq

Datacap Allocated

50.00TiB

Signer Address

f1e77zuityhvvw6u2t6tb5qlnsegy2s67qs4lbbbq

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecislegzypz44nb45ss3jrizhzomksbu6saz6gg3ghal3nbroa6qe

herrehesse commented 1 year ago

It seems (given my crowded inbox) that in the past 15 minutes @newwebgroup has approved 10+ datacap requests.

I wonder if this blind approval is allowed?

Would like to see per project what due diligence you did to explain the approval @newwebgroup.

cryptowhizzard commented 1 year ago

@newwebgroup

What duedilligence has been done here? What data is stored here? All websites given above by the applicant are not reachable. There is no storage plan or distribution plan either.

Scherm­afbeelding 2023-01-04 om 14 49 18

Why did you approve this?

@raghavrmadya can you intervene please and explain to this notary that this is not the way to go forward?

cryptowhizzard commented 1 year ago

https://100audio.com/

This website states that all music on this website is copyrighted by the authors. Musicians can upload there and get payed for their music / copyrights.

The 100WA website is not reachable, we cannot verify it.

The 100image.com is also not reachable and down.

Scherm­afbeelding 2023-01-04 om 15 24 35

herrehesse commented 1 year ago

Dear Applicant,

Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.

Please answer the questions below as comprehensively as possible.

Customer data

We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.

Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.

(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

Files and Processing

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

100MarketOfficial commented 1 year ago

Hi Notaries

please refer to below screenshot for the links. I updated my answers to your concerns here. 100market

Is it a VPN issue? Can you just try again?

In terms of the copyright you are concerned about, we have talked with some authors cooperating with our platform to see if we can move forward in this way. most of them seem to be very insterested and positive.

If any further question, kindly let me know.

Thanks

100MarketOfficial commented 1 year ago

Dear Applicant,

Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.

Please answer the questions below as comprehensively as possible.

Customer data

  • Could you demonstrate exactly how and to what extent customer contact occurred?

We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.

  • Did the customer specify the amount of data involved in this relevant correspondence?
  • Why does the customer in question want to use the Filecoin+ program?

Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.

  • Why is the customer data considered Filecoin+ eligible?

(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

Files and Processing

  • Could you please demonstrate to us how you envision processing and transporting the customer data in question to any location for preparation?
  • Would you demonstrate to us that the customer, the preparer and the intended storage providers all have adequate bandwidth to process the set with its corresponding size?
  • Would you tell us how the data set preparer takes into account the prevention of duplicates in order to prevent data cap abuse?

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

totally understood. I will look into your questions and get back to you and the team later. And again, thanks for all the questions here, which help me and our team to better understand filecoin+ and its strong value/ecosystem.

herrehesse commented 1 year ago

@100MarketOfficial appreciated. Looking forward to your answers.

raghavrmadya commented 1 year ago

Given the numerous flags raised the T&T WG recommends notaries to not sign this application further until due diligence is completed

newwebgroup commented 1 year ago

I am doing due diligence on this LDN and have raised several related questions.

About why the proposed LDN; This is caused by the operation error. Because it is batch approval, the operation error also approved the LDN Thanks for the marks and reminders. I will revoke Proposed later @herrehesse @raghavrmadya @cryptowhizzard

newwebgroup commented 1 year ago

@herrehesse It does not specify the time and frequency at which LDN must be signed, LDN approval requires specific environment and equipment. If you need to sign many LDNs, you need to operate many times, which takes a lot of time. But in fact, the signing work only takes a short time, not at the moment of signing, and due diligence can be completed in advance. Everyone has different work habits. I am used to separating due diligence and signing work, which will be more effective for my notarization work. Before signing, conduct due diligence on various scenarios and equipment through spare time, and screen out the LDNs that need to be signed ➡️ And then batch signing in a specific period of time In addition, please pay attention to your wording, which is a blow to a positive notary.

newwebgroup commented 1 year ago

The wording of this community member is not friendly, which has escalated to personal attack . @raghavrmadya

newwebgroup commented 1 year ago

I have canceled the proposal triggered by mistake

image
herrehesse commented 1 year ago

Hello @newwebgroup, thank you for taking the time to answer me.

We find it very strange that you feel personally attacked as soon as questions are raised about the legitimacy of a datacap request. This in and of itself gives us the sense that you do know about some of the fraudulent practices within Filecoin+'s ecosystem.

We will maintain a heightened focus on each and every request on behalf of the entire community to ensure that all datacap requests are legitimate and the data in question actually complies with Filecoin+'s regulations.

newwebgroup commented 1 year ago

@herrehesse The community welcomes anyone's supervision and construction. But this does not mean that anyone can slander others "for the good of the community" at will. Your words make me feel violated. Please pay attention to your wording, "blind" and "taking bribes" will make everything less friendly.

herrehesse commented 1 year ago

@newwebgroup you indicate that "Your words make me feel violated" which of course is very unpleasant for you. I am not here to harras/harm anyones feelings. However, you are not the only one feeling violated.

The fact that datacap requests are being approved that turn out to be fraudulent has an awfully big impact on Filecoin. It also makes me feel terrible. And how do you view that?

All the work that real data preparers and clients want to do to make the Filecoin network succeed is undone by approved fraudulent requests approved by you and others.

Let me be direct here, your feelings do not come before the interest of the entire community. Annoying that you feel this way but fraudulent practices need to be addressed, period.

100MarketOfficial commented 1 year ago

Hi herrehesse

Sorry it was busy yesterday.

To read all the conversations here, I feel a bit confused. From my eyes, everything came out due to a misunderstanding: someone cannot open the links due to some internet or VPN reasons, which I have explained and proved they are working. Here please kindly refer to my answers in below content, and do hope it can help everyone here to get more information.

Thanks

1.Could you demonstrate exactly how and to what extent customer contact occurred? All the customers have a cooperation with our platforms, some of them are key customers which we would like to work together on this project first. We have discussed about the storage way in future considering the increasing need for business in terms of storage method and cost. Anyway, we do not have rich experience and need to see how it is going and then involve more customers and more capacity.

2.We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here. Not sure I have understood what you mean here - multiple email?

3.Did the customer specify the amount of data involved in this relevant correspondence? Back to the answers the 1st question, we need to work with some key customers first and build up a way for others with successful and smooth experience in this project - which depends on the project itself, community and also SPs.

4.Why does the customer in question want to use the Filecoin+ program? Two reasons: 4.1IPFS - we talked about this tech as future opportunity to store valuable 4.2Our influence - they are our customers and we have long-term and solid relationship in the past years.we move forward together in some key decision. And from our POV, embrace new change is necessary. And Web3.0 is currently so hot all over the world. As a content provider platform, we met some challenges like creator/originator economy, which urges us to take a small step first.

5.Why is the customer data considered Filecoin+ eligible? Considering the business needs and the values Filecoin+ want to deliver in the community ( not only github or slack but the whole reality world). (As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

6.Could you please demonstrate to us how you envision processing and transporting the customer data in question to any location for preparation? We have been in the initial stage of contacting some potential SPs and working on the details. If you can recommend more, welcome.

7.Would you demonstrate to us that the customer, the preparer and the intended storage providers all have adequate bandwidth to process the set with its corresponding size? Sure. Adequate bandwidth is necessary as I said in the application: we plan to work with SPs with Network stability and fast retrieval.

8.Would you tell us how the data set preparer takes into account the prevention of duplicates in order to prevent data cap abuse? Data cap abuse? We will distribute the data to the SPs as fairly as we can.

9.Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

Again, I think it is just a misunderstanding here. Appreciate all notaries’ due diligence. With your support, the community might have more positive attendees to build up together as you hope.

herrehesse commented 1 year ago

Hello @100MarketOfficial - Thank you for your fast and kind response! I will try to answer them one by one.

  1. Makes sense. Thank you.

  2. Looking at the answer from question 1 its OK.

  3. Looking at the answer from question 1 its OK.

  4. If the above answers are correct and you use your customers and business in that way, you are not eligible for Filecoin+ program, you should be looking at FIL-E. Hence the usage is not "public and useful datasets for humanity". You yourself have the copy rights to the content, but others do not. Hence again, not applicable for Filecoin+.

You are describing the benefits of Filecoin & IPFS, which is indeed awesome, but I am talking about this specific program not the ecosystem as a whole. This program is not the same as Filecoin. You could store all data without the need of Filecoin+.

  1. As mentioned in 4. FIL-E is a solution here.

  2. If you are still trying to work on a plan and contact Storage Providers. We always recommend to start with a small 100T datacap request. Asking for 5PiB directly seems quite aggressive as a start.

  3. Again, doing a 5P request seems very high if you have no direct answer to these questions. Start with a lower number and find out if you can find the right Storage Providers to work with you first.

  4. Allright.

  5. I really appreciate your honest answers here @100MarketOfficial and I would love to help and assist to get you further. But we need to stay careful, there are many entities who respond in a much different manner. Again, appreciated.

100MarketOfficial commented 1 year ago

Hi herrehesse

Glad to further discuss the pending issues here. 4 - how to define/who defines the meaning of "public and useful datasets for humanity" because I didn’t see the specific definition in fil+ documentation - just go back to check to ensure I didn't misunderstand :)
I saw your LDN application which refer to health data - from your POV, health is important, and creation is not - ofc, argue about this open topic in a very respectful manner - sorry I cannot agree with you. “You yourself have the copy rights to the content, but others do not” I don’t understand this point - we have the copy right of all the contents which owners are working with us, so we as a platform/on behalf of our partnership to store useful/valuable data(from our POV) which can be retrieved by the public, is not suitable?

5 - sorry it might be my lack of knowledge about fil-E since I cannot find any useful information.Can you provide an official link for me to check?

6 - yes, I still wanna work on LDN plan - in your POV, 5P is too aggressive. Surely I’m glad to take the advice with the community’s support, and I will apply for a reasonable allocation in community’s common sense - 500Tib as you suggested? I prefer to 1P.

Again, thanks for your questions and support in some clarification. I still insist on the LDN which might be adjusted with the allocation.

Look forward to your reply.

herrehesse commented 1 year ago

@100MarketOfficial dear applicant,

I would strongly advise you to have a chat on GitHub or Slack with @kevzak. He can assist you with a FIL-E application. Next to that, your applications seem far from suitable and the data is copyrighted.

Filecoin+ is not the right place for this request.

100MarketOfficial commented 1 year ago

Hi herrehesse

I take your words as advice or debate with all my respect. As I know public data with useful meaning (can be from various angles) can apply for LDN.

Thank you again for all your questions and advice.

herrehesse commented 1 year ago

@100MarketOfficial You can always apply for an LDN. But that does not mean the system owes you any datacap. There are still rules that need to be met and data-forms that are not meant for the FIL+ program.

100MarketOfficial commented 1 year ago

@herrehesse Thanks. And sorry I have to say I dont like the word"owe". You asked questions, I answered with all my respect. Now you make me feel you are the law here - sorry for that feeling.

Absolutely the system does not owe me any datacap, not only me but also any entity here. Open community here but within rules. Cheers.

Have a nice day/night.

herrehesse commented 1 year ago

@100MarketOfficial

"The dataset we want to store on Filecoin are some videos/audios/images which copy rights are owned by 100market."

As stated, this data does not belong at FIL+ but rather at FIL-E since the data is copyrighted. I am not supportive of continuation of this LDN.

cryptowhizzard commented 1 year ago

@simonkim0515

I did checks on the websites again today. They are all not reachable.

I suggest closing this.

100MarketOfficial commented 1 year ago

@cryptowhizzard Hello Can you share the way how you opened it? It did work from my side.

Thanks

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!