filecoin-project / notary-governance

115 stars 58 forks source link

v5 Notary Allocator Application: Data Preparer Pathway #1030

Closed laurarenpanda closed 8 months ago

laurarenpanda commented 11 months ago

v5 Notary Allocator Application

To apply to be an allocator, organizations will submit one application for each proposed pathway to DataCap. If you will be designing multiple specific pathways, you will need to submit multiple applications.

Please complete the following steps:

1. Fill out the information below and create a new GitHub Issue

  1. Notary Allocator Pathway Name (This can be your name, or the name of your pathway/program. For example "E-Fil+"): Data Preparer Pathway
  2. Organization Name: FileDrive Labs
  3. On-chain address for Allocator (Provide a NEW unique address. During ratification, you will need to initialize this address on-chain): f1vewqfhde62bjccdiy3qbjbhg37ldr4y7nm5wv6q
  4. Country of Operation (Where your organization is legally based): PRC
  5. Region of Operation (What region will you serve?): All regions
  6. Type of Allocator, diligence process: (Automated/programmatic, Market-based, or Manual (human-in-the-loop at some phase): Market-based & Manual
  7. DataCap requested for allocator for 12 months of activity (This should be an estimate of overall expected activity. Estimate the total amount of DataCap you will be distributing to clients in 12 months, in TiB or PiB): 200PiB

2. Access allocator application (download to save answers)

Click link below to access a Google doc version of the allocator application that can be used to save your answers if you are not prepared to fully submit the application in Step 3. https://docs.google.com/document/d/1-Ze8bo7ZlIJe8qX0YSFNPTka4CMprqoNB1D6V7WJJjo/copy

3. Submit allocation application

Clink link below to access full allocator questionnaire and officially submit your answers: https://airtable.com/appvyE0VHcgpAkt4Z/shrQxaAIsD693e1ns

Note: Sections of your responses WILL BE posted back into the GitHub issue tracking your application. The final section (Additional Disclosures) will NOT be posted to GitHub, and will be maintained by the Filecoin Foundation. Application information for notaries not accepted and ratified in this round will be deleted.

Kevin-FF-USA commented 11 months ago

Hi @laurarenpanda

Wanted to let you know this application has been received along with the Airtable detailed answers - the public answers will be posted in a thread below soon. If you have any questions - please let me know.

ghost commented 10 months ago

Basic Information

1. Notary Allocator Pathway Name: Data Preparer Pathway

2. Organization: FileDrive Labs

3. On Chain Address for Allocator: f1vewqfhde62bjccdiy3qbjbhg37ldr4y7nm5wv6q

4. Country of Operation: PRC

5. Region(s) of operation: South America, North America, Japan, Oceania, Europe, Greater China, Asia minus GCR, Africa , Other

6. Type of Allocator: Manual

7. DataCap requested for allocator for 12 months of activity: 200PiB

8. Is your allocator providing a unique, new, or diverse pathway to DataCap? How does this allocator differentiate itself from other applicants, new or existing?: In this pathway, data preparers are the principal applicants for the DC application because they have more previous understanding of the Filecoin network and FIL+ than data storage clients, and they have more specialized tools and solutions for data preprocessing than storage providers on average.

9. As a member in the Filecoin Community, I acknowledge that I must adhere to the Community Code of Conduct, as well other End User License Agreements for accessing various tools and services, such as GitHub and Slack.: Acknowledge

Client Diligence

10. Who are your target clients?: Other (specified above), Small-scale developers or data owners

11. Describe in as much detail as possible how you will perform due diligence on clients. If you are proposing an automated pathway, what diligence mechanism will you use to determine client eligibility?: For due diligence, it is necessary to collect basic information from applicants, including:

With the required info, we could know better about applicants and their businesses, verify their qualifications and authenticity, review data samples, and decide to allocate certain DataCap according to our allocation strategy rating form.

Clients with great credit and reputation in Filecoin Plus could gradually increase their limitations for their one-kick DataCap allocation application.

12. Please specify how many questions you’ll ask, and provide a brief overview of the questions.: Questions about the Client's qualifications:

  1. Entity Information of data preparer and data owner

    • Formation documents: includes certificates of registration/incorporation/information
    • government-issued identification number for the entity
  2. Authorized Signatory Information

    • evidence of the authorized signatory's authority to act on behalf of the application entity (evidence include written consent of the company board of directors, a secretary's certificate, a signed letter from the applicant's CFO, or may be included in the company operating agreement);
    • close-up photo of the signatory's face next to the visible passport or other government-issued photo-bearing ID
  3. Beneficial Owners If there are any 25%+ shareholders, we need a document, including a capitalization table, operating agreement, or similar, to verify the ownership, and also the following information for each of them:

    • legal name;
    • date of birth;
    • address;
    • a government-issued identification number, which shall be either an SSN, ITIN, passport number and country of issuance, alien identification card number, or number and country of issuance of any other government-issued document evidencing nationality or residence;
      • a copy of an unexpired passport or other government-issued document evidencing the individual's nationality or residence, which bears a photograph of the individual.

Question about the DataCap allocation plan:

  1. Introduction of yourself/ your organization(with links)/data owners’ details
  2. Use case
  3. Location
  4. Max DataCap Allocation
  5. Filecoin address
  6. SPs you intend to allocate DataCap with allocation proportion
  7. Specific requirements of SPs and storage service
  8. For-profit or not

13. Will you use a 3rd-party Know your client (KYC) service?: Will consider.

14. Can any client apply to your pathway, or will you be closed to only your own internal clients? (eg: bizdev or self-referral): My pathway is open to every data preparer in the Filecoin ecosystem.

15. How do you plan to track the rate at which DataCap is being distributed to your clients?: Yes

Data Diligence

16. As an operating entity in the Filecoin Community, you are required to follow all local & regional regulations relating to any data, digital and otherwise. This may include PII and data deletion requirements, as well as the storing, transmit: Acknowledge

17. What type(s) of data would be applicable for your pathway?: Public Open Dataset (Research/Non-Profit), Public Open Commercial/Enterprise, Private Commercial/Enterprise, Private Non-Profit/Social Impact

18. How will you verify a client’s data ownership? Will you use 3rd-party KYB (know your business) service to verify enterprise clients?: KYB service + Manual method This will depend on the enforceability of the validation and local laws and regulations.

19. How will you ensure the data meets local & regional legal requirements?: Applicants are required to demonstrate data compliance in their application.

20. What types of data preparation will you support or require?: In this pathway, the applicant, that is, the data preparer, needs to preprocess the data per the common data preprocessing standards of the network and then transfer processed data to the storage provider. FileDrive Labs is willing to share open-source tools and operation experience to support more community members in understanding data processing.

21. What tools or methodology will you use to sample and verify the data aligns with your pathway?: FIL+ Dashboard filplus.info: developed by FileDrive Labs to monitor the whole working flow of DC CID Checker Bot: track data distribution, retrieval success rate and CID sharing Random data retrieval: manually verified data

Data Distribution

22. How many replicas will you require to meet programmatic requirements for distribution?: 3+

23. What geographic or regional distribution will you require?: Same as the FIL+ LDN requirement but will also support data owners to specify the geographic location of their data storage.

24. How many Storage Provider owner/operators will you require to meet programmatic requirements for distribution?: 5+

25. Do you require equal percentage distribution for your clients to their chosen SPs? Will you require preliminary SP distribution plans from the client before allocating any DataCap?: We will regularly communicate with applicants about the data distribution plan, but in view of the actual situation, it will not be mandatory to distribute evenly throughout the process, but the overall distribution percentage of the application will be required, which is no single storage provider can obtain more than 10% of the DC. We will conduct this process with reference to existing information collection methods in order to reduce the cost of learning for applicants.

26. What tooling will you use to verify client deal-making distribution?: FIL+ Dashboard filplus.info: developed by FileDrive Labs to monitor the whole working flow of DC

27. How will clients meet SP distribution requirements?: We encourage data preparers to build their own data distribution platforms, establish credit systems, and establish cooperation with more high-quality storage providers. At the same time, if there is an urgent need for a data distribution platform in the market, we will also consider launching corresponding projects to build it together with the community.

28. As an allocator, do you support clients that engage in deal-making with SPs utilizing a VPN?: VPN is allowed to be used on the premise that the SPs’ geographical location can be proved.

DataCap Allocation Strategy

29. Will you use standardized DataCap allocations to clients?: No, client specific

30. Allocation Tranche Schedule to clients:: • First: lesser of 5% of total DataCap requested or 50% of weekly allocation rate • Second: lesser of 10% of total DataCap requested or 100% of weekly allocation rate • Third: lesser of 20% of total DataCap request or 200% of weekly allocation rate • Fourth: lesser of 40% of total DataCap requested or 400% of weekly allocation rate • Max per client overall: lesser of 80% of total DataCap request or 800% of weekly allocation rate

31. Will you use programmatic or software based allocations?: Yes, standardized and software based

32. What tooling will you use to construct messages and send allocations to clients?: Existing notary registry tooling with some more practical optimizations.

33. Describe the process for granting additional DataCap to previously verified clients.: We will use FIL+ Dashboard filplus.info to track the percentage of DC and further adjust the subsequent allocation of the next stage. We have plans to develop related tools.

34. Describe in as much detail as possible the tools used for: • client discoverability & applications • due diligence & investigation • bookkeeping • on-chain message construction • client deal-making behavior • tracking overall allocator health • disput:

  1. GitHub & Slack: communication and Q&A
  2. Existing notary registry tooling: DC allocation
  3. Dashboard filplus.info: track DC flow
  4. CID Check Bot: T&T analysis
  5. Others: necessary tools needed to be developed

Tools and Bookkeeping

35. Will you use open-source tooling from the Fil+ team?: Existing notary registry tooling & CID Check Bot

36. Where will you keep your records for bookkeeping? How will you maintain transparency in your allocation decisions?: Local database, open-sourced Github repo, and periodic information synchronization with Filecoin Foundation

Risk Mitigation, Auditing, Compliance

37. Describe your proposed compliance check mechanisms for your own clients.: We will allow at least two weeks for the applicant to present and explain the issue before we identify the issue and make any decisive decisions, and we will allow the applicant to continue to receive the DC assignment as long as the reasons are credible and logical. At the same time, we will establish a record file and credit system, record all problems and disputes, and give the corresponding processing results for reference.

38. Describe your process for handling disputes. Highlight response times, transparency, and accountability mechanisms.: We are willing to use the way the T&T team operates as a reference, and solicit community suggestions through suggestion boxes or online meetings before assigning phased initiatives.

39. Detail how you will announce updates to tooling, pathway guidelines, parameters, and process alterations.: We plan to make updates or changes on a quarterly basis and ensure that any changes are discussed openly.

40. How long will you allow the community to provide feedback before implementing changes?: Max 3 months.

41. Regarding security, how will you structure and secure the on-chain notary address? If you will utilize a multisig, how will it be structured? Who will have administrative & signatory rights?: Ledger & Multisig (in considering) with core members of FileDrive Labs.

42. Will you deploy smart contracts for program or policy procedures? If so, how will you track and fund them?: Still under discussion.

Monetization

43. Outline your monetization models for the services you provide as a notary allocator pathway.: No monetization models at this stage. Will consider it after this pathway runs smoothly.

44. Describe your organization's structure, such as the legal entity and other business & market ventures.: Shanghai Xunling Technology Co,.Ltd Legal entity: a limited company registered in Shanghai, China Business: software services

45. Where will accounting for fees be maintained?: Haven’t decided.

Past Experience, Affiliations, Reputation

46. If you've received DataCap allocation privileges before, please link to prior notary applications.: https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1716 https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1688 https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1623 https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1624 https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1625 https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1626 https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1627

47. How are you connected to the Filecoin ecosystem? Describe your (or your organization's) Filecoin relationships, investments, or ownership.: FileDrive Labs is a technical team aiming to provide a high-customized decentralized storage service for Web3 data and develop storage applications with more reliability and availability based on IPFS/FIlecoin. In 2019, FileDrive Labs was established by core members in Shanghai, China, and began to do in-depth research and development around the IPFS technology stack. Now, FileDrive Labs has ten core team members focusing on self-developed tools and programs contributing to IPFS/Filecoin ecosystem and Web 3.0 World. Open-source programs: - Go-Graphsplit: https://github.com/filedrive-team/go-graphsplit - Go-Parallel-Graphsync: https://github.com/filedrive-team/go-parallel-graphsync - Go-DS-Cluster: https://github.com/filedrive-team/go-ds-cluster - FileDAG Storage: https://github.com/filedag-project/filedag-storage Tool for FIL+: - filplus.info: https://filplus.info/ Software for Filecoin Network: - HASS: https://hass.filedrive.io/ Part of Filecoin ecosystem: - Filecoin Data Onboarding WG with FileDrive Storage Solutions: https://dataonboarding.filecoin.io/ - Filecoin Retrieval Market with ParallelGrapgSync: https://retrieval.market/ - Protocol Labs Network Directory: https://www.plnetwork.io/directory/teams - Member of FVM Early Builder Foundry - Spaceship Member of Filecoin Orbit Community: https://orbitcommunity.filecoin.io/

48. How are you estimating your client demand and pathway usage? Do you have existing clients and an onboarding funnel?: According to more than 3-year experience in FIL+ and data onboarding in Filecoin Network and feedback from the community and our partners.

galen-mcandrew commented 8 months ago

Datacap Request for Allocator

Address

f2n2cvq3wi344zua7ofax4jygkcqb4lwbhdtofhmq

Datacap Allocated

5PiB

filplus-bot commented 8 months ago

The request has been signed by a new Root Key Holder

Message sent to Filecoin Network

bafy2bzaceb2amdjxqhswxebivizmwkrimvtgooqfuushqj6eflwtbjaggvxzm

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceb2amdjxqhswxebivizmwkrimvtgooqfuushqj6eflwtbjaggvxzm