X-lab2017 / open-digger

Open source analysis tools
https://open-digger.cn
Apache License 2.0
280 stars 78 forks source link

[Batch Label Data] Add more label data for Database technical area labled on dbdb.io and DB-Engines Ranking up to May 30, 2024. #1573

Closed birdflyi closed 1 month ago

birdflyi commented 1 month ago

Description

I want to add some labeled data into OpenDigger to help us for our community analysis. The data is based on a dataset fused by data from dbdb.io and DB-Engines by May 30, 2024. It is an incremental version of labeled data submited in https://github.com/X-lab2017/open-digger/issues/1551, which is based on data by Apr 30, 2024.

Filter conditions: Collected by dbdb.io on May 30, 2024 OR Rankings in the DB-Engines Rankings table on May 30, 2024; Has open source license; Has repository link on GitHub.

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Document

Type: Tech-1

Repos:

Label: Key-value

Type: Tech-1

Repos:

Label: Object Oriented

Type: Tech-1

Repos:

Label: Relational

Type: Tech-1

Repos:

Label: Time Series

Type: Tech-1

Repos:

Label: Vector

Type: Tech-1

Repos:

birdflyi commented 1 month ago

/parse-github-id

github-actions[bot] commented 1 month ago

Get repo and org/user ids done.

"### Description\n\nI want to add some labeled data into OpenDigger to help us for our community analysis.
The data is based on a dataset fused by data from dbdb.io and DB-Engines by May 30, 2024. It is an incremental version of labeled data submited in https://github.com/X-lab2017/open-digger/issues/1551, which is based on data by Apr 30, 2024.

Filter conditions: Collected by dbdb.io on May 30, 2024 OR Rankings in the DB-Engines Rankings table on May 30, 2024; Has open source license; Has repository link on GitHub.

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Document

Type: Tech-1

Repos:

- 411979983 # repo:elmarti/camadb

Label: Key-value

Type: Tech-1

Repos:

- 580788054 # repo:web3-storage/pail
- 20433978 # repo:xtreemfs/babudb

Label: Object Oriented

Type: Tech-1

Repos:

- 160528020 # repo:mateusfreira/nun-db

Label: Relational

Type: Tech-1

Repos:

- 590507334 # repo:StereoDB/StereoDB
- 114619105 # repo:apache/kyuubi

Label: Time Series

Type: Tech-1

Repos:

- 441299511 # repo:reductstore/reductstore

Label: Vector

Type: Tech-1

Repos:

- 793209340 # repo:carsonpo/haystackdb

"

birdflyi commented 1 month ago

/self-assign