filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] <Venus Team > - <DealAccelerator 3> #1725

Closed Joss-Hua closed 10 months ago

Joss-Hua commented 1 year ago

Data Owner Name

Venus team

Data Owner Country/Region

American Samoa

Data Owner Industry

Other

Website

https://venushub.io

Social Media

https://linktr.ee/filecoinvenus

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

200TiB

On-chain address for first allocation

f1fppo2nhn3zd2zfpa6fdqwuqwlho3nerbkrenquq

Custom multisig

Identifier

No response

Share a brief history of your project and organization

The team: Venus team

Venus team leads the development and practice of Filecoin Venus, established in 2020, Shanghai, China. Now we have decades of members from all over the world focusing on code dev and community.

Through the core dev and ecological activities and programs, we hope (and already) to have more storage service providers, users, and enthusiasts join Filecoin or provide more contributions after joining Filecoin.

The project: Venus Deal Accelerator (https://venushub.io/accelerator/)

Venus is committed to offer a fully functional deal-making experience for both storage clients and storage providers on the scale. As the Filecoin network grows and the community strives towards a more storage deal weighted growth than committed capacity growth, the Venus community takes on the challenge to help shape this vision with the Venus Deal Accelerator program.

The goal of the Venus Deal Accelerator program is to distribute as much storage deals as it can to the broader storage provider community with focuses on seamlessly bridging the sealing experience that storage providers are already familiar with to the Filecoin deal taking experience. Venus Deal Accelerator program will be responsible for applying large datacap with approved open datasets from fil-plus program and distribute storage deals to its participants running Venus storage systems.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

The data stored in Filecoin is from publicly available datasets in various machine learning.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

graphsplit

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://registry.opendata.aws/nasa-gcms/
s3://gcms-samdata-mlchallenge/
https://registry.opendata.aws/comonscreens/
s3://common-screens/
https://registry.opendata.aws/allenai-tqa/
3://ai2-public-datasets/
https://registry.opendata.aws/multi-token-completion/
s3://multi-token-completion/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

1 to 1.5 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, Shipping hard drives, Others

How do you plan to choose storage providers

Slack, Filmine, Big data exchange, Others

If you answered "Others" in the previous question, what is the tool or platform you plan to use

We define VenusHub as a platform for community projects. The Deal Accelerator mentioned here is one of them. The Deal Accelerator is aimed at the storage providers of real data storage. We complete the SP screening through they' application and screening rules, and they come from the Filecoin community, so they have no interest relationship.

If you already have a list of storage providers to work with, fill out their names and provider IDs below

This list is a part of it. We are still expanding the available real data storage providers.
https://github.com/data-preservation-programs/filplus-checker-assets/tree/main/filecoin-project/filecoin-plus-large-datasets/issues/345
https://github.com/data-preservation-programs/filplus-checker-assets/tree/main/filecoin-project/filecoin-plus-large-datasets/issues/1444
We will focus on more new SPs (miners) and hope that more miners will real data provide storage for the client.

How do you plan to make deals to your storage providers

Others/custom tool

If you answered "Others/custom tool" in the previous question, enter the details here

venus-market (Droplet). The support of venus-market(Droplet) for real data storage is very mature.

Can you confirm that you will follow the Fil+ guideline

Yes

Joss-Hua commented 1 year ago

Hi @Sunnyiscoming and team, the balance at this address has been exhausted, but the robot has not triggered the next round and has been waiting for several weeks. We need your help to proceed to its next step.

herrehesse commented 12 months ago

checker:manualTrigger

filplus-checker-app[bot] commented 12 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 55.12% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

⚠️ CID sharing has been observed. (Top 3)

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.

github-actions[bot] commented 11 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

Joss-Hua commented 11 months ago

keep on

github-actions[bot] commented 11 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

Joss-Hua commented 11 months ago

keep going

Sunnyiscoming commented 11 months ago

SP List provided: [{"providerID":"f02813417","City":"Beijing","Country":"CN","SPOrg","shiliu"}, {"providerID":"f020522","City":"Shenzhen","Country":"CN","SPOrg","wenchu"}, {"providerID":"f02104858","City":"HongKong","Country":"CN","SPOrg","Sanchuang"}, {"providerID":"f01968296","City":"Beijing","Country":"CN","SPOrg","YueKeYun"}, {"providerID":"f034548","City":"Texas","Country":"US","SPOrg","ByteByte"},]

github-actions[bot] commented 10 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

github-actions[bot] commented 10 months ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

-- Commented by Stale Bot.

Joss-Hua commented 10 months ago

Sorry for the late reply, please reopen it @Sunnyiscoming @Kevin-FF-USA , thanks, keep going~