X-lab2017 / open-digger

Open source analysis tools
https://open-digger.cn
Apache License 2.0
291 stars 86 forks source link

[Batch Label Data] Add and revise multiple label data for Database technical area labled on dbdb.io and DB-Engines Ranking by June 10th, 2023. #1313

Closed birdflyi closed 1 year ago

birdflyi commented 1 year ago

Description

Description

I want to add some labeled data into OpenDigger to help us for our community analysis. The data is based on a dataset fused by data from dbdb.io and DB-Engines by June 10th, 2023. It is an incremental version of labeled data submited in https://github.com/X-lab2017/open-digger/issues/1283, which is based on data by May 7th, 2023.

Filter conditions: Collected by dbdb.io on June 10th, 2023 OR Rankings in the DB-Engines Rankings table on June 10th, 2023; Has open source license; Has repository link on GitHub.

Features:

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Array

Type: Tech-1

Repos:

Label: Key-value

Type: Tech-1

Repos:

Label: Relational

Type: Tech-1

Repos:

Label: Search engine

Type: Tech-1

Repos:

Label: Vector DBMS

Type: Tech-1

Repos:

birdflyi commented 1 year ago

/parse-github-id

github-actions[bot] commented 1 year ago

Get repo and org/user ids done.

"### Description\n\nDescription

I want to add some labeled data into OpenDigger to help us for our community analysis.
The data is based on a dataset fused by data from dbdb.io and DB-Engines by June 10th, 2023. It is an incremental version of labeled data submited in https://github.com/X-lab2017/open-digger/issues/1283, which is based on data by May 7th, 2023.

Filter conditions: Collected by dbdb.io on June 10th, 2023 OR Rankings in the DB-Engines Rankings table on June 10th, 2023; Has open source license; Has repository link on GitHub.

Features:
- A new label has appeared: Vector DBMS; # not found
- The labels of a small portion of the data have changed. # not found

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Array

Type: Tech-1

Repos:

- 534109388 # repo:EuclidOLAP/EuclidOLAP

Label: Key-value

Type: Tech-1

Repos:

- 606125384 # repo:paypal/junodb

Label: Relational

Type: Tech-1

Repos:

- 496817075 # repo:GlareDB/glaredb
- 20587599 # repo:apache/flink
- 91715647 # repo:gnocchixyz/gnocchi
- 507829396 # repo:openGemini/openGemini

Label: Search engine

Type: Tech-1

Repos:

- 60377070 # repo:vespa-engine/vespa

Label: Vector DBMS

Type: Tech-1

Repos:

- 201403923 # repo:activeloopai/deeplake
- 546206616 # repo:chroma-core/chroma
- 208728772 # repo:milvus-io/milvus
- 268163609 # repo:qdrant/qdrant
- 55072677 # repo:semi-technologies/weaviate
- 195619075 # repo:vdaas/vald

"

birdflyi commented 1 year ago

/self-assign