filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] Daxian Tec - Industry Datasets #2312

Open jeterhunt opened 7 months ago

jeterhunt commented 7 months ago

Data Owner Name

Beijing Daxian Technology Co., Ltd.

What is your role related to the dataset

Dataset Owner

Data Owner Country/Region

China

Data Owner Industry

IT & Technology Services

Website

https://daxiancloud.com/

Social Media

微信公众号:哒线云
https://daxiancloud.com/

Total amount of DataCap being requested

5PiB

Expected size of single dataset (one copy)

800T

Number of replicas to store

7

Weekly allocation of DataCap requested

800TiB

On-chain address for first allocation

f1zshv3gxdkg3lzotybw2hgga5w2lqrgsr3ismpoa

Data Type of Application

Private Commercial/Enterprise

Custom multisig

Identifier

No response

Share a brief history of your project and organization

Beijing Daxian Technology Co., Ltd. is an industry-oriented Saas platform, dedicated to helping enterprises tap the procurement opportunities and channel resources of big customers, and empower enterprises through big data and AI technology. 
Our team members all have decades of SaaS related background, strong big data analysis ability and rich project management experience, who are experts in product, technology, data and sales industries.The company focuses on the in-depth mining of customer data resources in subdivided industries, and has a comprehensive layout in education, medical, finance, energy, transportation and government industries to help enterprises find precise customers and core channels, understand the market situation, and reasonably allocate sales resources.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

The products are targeted at large financial and economic customers, and the multi-dimensional and multi-depth data mining and knowledge graph construction are carried out to help the majority of scientific and technological innovative enterprises to achieve intelligent sales for education, medical care, finance, government and other fields. It relies on big data technology to provide users with market analysis, business opportunities mining, channel expansion, intelligent CRM and other features. 
1. Market analysis: according to the historical recruitment and acquisition transaction data, conduct statistical analysis, extrapolation and prediction in terms of products, customer groups, regions, time and other dimensions, and assist users in making market layout and sales decisions;
2. Business opportunity mining: predict the future procurement demand of the purchaser through data collection and correlation analysis of the purchaser's historical procurement, industry trends, project announcements, academic papers and other data;
3. Channel development: Based on the in-depth modeling of enterprises in the industry segmentation field, build the profile of channel dealers and relationship maps, and break through the relationship barriers between customers and buyers;
4. Intelligent CRM: Provide intelligent and efficient clue and customer management functions, and realize the full chain closed-loop of sales lead insight, mining, reaching, closing and after-sales management.
We save large data sets to Filecoin for cold storage backup and user retrieval, including marketing statistics, industry news, product information, and so on.

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

China

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

singularity
Boost
lotus

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1016
There was a bug-error in the last Issue. It is recommended to reopen LDN and continue encapsulation.

Please share a sample of the data

https://www.daxiancloud.com/file/articles_0101.zip
https://www.daxiancloud.com/file/articles_0110.zip
https://www.daxiancloud.com/file/articles_0119.zip
https://www.daxiancloud.com/file/articles_0128.zip
https://www.daxiancloud.com/file/articles_0206.zip
https://www.daxiancloud.com/file/articles_0215.zip
https://www.daxiancloud.com/file/articles_0224.zip
https://www.daxiancloud.com/file/articles_0305.zip
https://www.daxiancloud.com/file/articles_0314.zip
https://www.daxiancloud.com/file/articles_0323.zip
https://www.daxiancloud.com/file/articles_0401.zip
https://www.daxiancloud.com/file/articles_0410.zip
https://www.daxiancloud.com/file/articles_0419.zip
https://www.daxiancloud.com/file/articles_0428.zip
https://www.daxiancloud.com/file/articles_0507.zip
https://www.daxiancloud.com/file/articles_0516.zip
https://www.daxiancloud.com/file/articles_0525.zip
https://www.daxiancloud.com/file/articles_0603.zip
https://www.daxiancloud.com/file/articles_0612.zip
https://www.daxiancloud.com/file/articles_0621.zip
...

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

More than 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, Shipping hard drives

How do you plan to choose storage providers

Slack, Partners, Others

If you answered "Others" in the previous question, what is the tool or platform you plan to use

Wechat, X

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 7 months ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 7 months ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

github-actions[bot] commented 7 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

jeterhunt commented 6 months ago

@Sunnyiscoming Please check it

kevzak commented 6 months ago

@jeterhunt this application qualifies as an Enterprise project. You'll need to follow E-Fil+ guidelines listed here: https://efilplus.super.site/

let me know what questions you have.

jeterhunt commented 6 months ago

@kevzak I had some knowledge about E-FIL+ before, but the reason for recreating the current LDN this time was that the previous LDN https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1016 was closed due to an unexpected bug in the robot, so it was reopened to continue the previous project. Therefore, I do not want to change this LDN to E-FIL+ type. I hope you can help me continue the current project.

jeterhunt commented 6 months ago

I have completed the KYC email confirmation in #1016. Considering that it will soon enter the V5 stage, please help me continue the storage work as soon as possible. Thank you!

kevzak commented 6 months ago

@jeterhunt you would need to complete EFil guidelines to continue here. However, the notaries are not active for this program.

I can potentially work with you on this application. The Fil+ program is transitioning this month into new allocator pathways.

Since this is a private dataset, I'd advise you to apply via my allocator, the Enterprise Data Pathway

I'll have more details in 1-2 weeks