X-lab2017 / open-digger

Open source analysis tools
https://open-digger.cn
Apache License 2.0
300 stars 87 forks source link

[Batch Label Data] Add more label data for Database technical area labled on dbdb.io and DB-Engines Ranking by Sep 26, 2023. #1393

Closed birdflyi closed 1 year ago

birdflyi commented 1 year ago

Description

I want to add some labeled data into OpenDigger to help us for our community analysis. The data is based on a dataset fused by data from dbdb.io and DB-Engines by Sep 26, 2023. It is an incremental version of labeled data submited in https://github.com/X-lab2017/open-digger/issues/1376, which is based on data by Aug 26, 2023.

Filter conditions: Collected by dbdb.io on Sep 26, 2023 OR Rankings in the DB-Engines Rankings table on Sep 26, 2023; Has open source license; Has repository link on GitHub.

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Array

Type: Tech-1

Repos:

Label: Document

Type: Tech-1

Repos:

Label: Object Oriented

Type: Tech-1

Repos:

Label: Relational

Type: Tech-1

Repos:

Label: Search Engine

Type: Tech-1

Repos:

Label: Vector

Type: Tech-1

Repos:

birdflyi commented 1 year ago

/parse-github-id

github-actions[bot] commented 1 year ago

Get repo and org/user ids done.

"### Description\n\nI want to add some labeled data into OpenDigger to help us for our community analysis.
The data is based on a dataset fused by data from dbdb.io and DB-Engines by Sep 26, 2023. It is an incremental version of labeled data submited in https://github.com/X-lab2017/open-digger/issues/1376, which is based on data by Aug 26, 2023.

Filter conditions: Collected by dbdb.io on Sep 26, 2023 OR Rankings in the DB-Engines Rankings table on Sep 26, 2023; Has open source license; Has repository link on GitHub.

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Array

Type: Tech-1

Repos:

- 664133375 # repo:epsilla-cloud/vectordb

Label: Document

Type: Tech-1

Repos:

- 642912355 # repo:awa-ai/awadb
- 587246478 # repo:endatabas/endb
- 628503457 # repo:jdagdelen/hyperDB
- 635374751 # repo:jina-ai/vectordb
- 632269609 # repo:kagisearch/vectordb
- 520096046 # repo:marqo-ai/marqo
- 223462217 # repo:resilientdb/resilientdb
- 186332888 # repo:vearch/vearch
- 634656527 # repo:vector5ai/vector5db

Label: Object Oriented

Type: Tech-1

Repos:

- ADBSQL/AntDB # not found
- 681142 # repo:Bobris/BTDB
- 52080367 # repo:CUBRID/cubrid
- 329729926 # repo:CondensationDB/Condensation-java
- 8799170 # repo:DevrexLabs/OrigoDB
- 516821813 # repo:HydrasDB/hydra
- 1244027 # repo:ModeShape/modeshape
- 150752008 # repo:SapphireDb/SapphireDb
- 124423054 # repo:The-Alchemist/perst
- 569871553 # repo:VeloxDB/VeloxDB
- 206403 # repo:apache/jackrabbit
- 246343828 # repo:atoti/atoti
- 396856161 # repo:authzed/spicedb
- 95070401 # repo:devrexlabs/memstate
- 95817032 # repo:edgedb/edgedb
- 15001136 # repo:etoile/CoreObject
- 42143916 # repo:fern4lvarez/piladb
- 240387847 # repo:gaia-platform/GaiaPlatform
- 204261394 # repo:iboxdb/db4o-gpl
- 5453989 # repo:jankotek/mapdb
- 1776883 # repo:kimchy/compass
- 25225465 # repo:markmeeus/MarcelloDB
- 273564373 # repo:morecraf/Siaqodb
- 351806852 # repo:neondatabase/neon
- 79901405 # repo:objectbox/objectbox-java
- 7083240 # repo:orientechnologies/orientdb
- 432844875 # repo:orioledb/orioledb
- 37285717 # repo:pilgr/Paper
- 14702444 # repo:pipelinedb/pipelinedb
- 927442 # repo:postgres/postgres
- 1917262 # repo:realm/realm-core
- 3893984 # repo:tzaeschke/zoodb
- 88111990 # repo:zhihu/Matisse
- 7357595 # repo:zopefoundation/ZODB

Label: Relational

Type: Tech-1

Repos:

- 587246478 # repo:endatabas/endb
- 183929744 # repo:erikgrinaker/toydb
- 26774602 # repo:proullon/ramsql

Label: Search Engine

Type: Tech-1

Repos:

- 341374920 # repo:apache/solr
- 507775 # repo:elastic/elasticsearch
- 95614931 # repo:manticoresoftware/manticoresearch
- 520096046 # repo:marqo-ai/marqo
- 130688011 # repo:meilisearch/meilisearch
- 334274271 # repo:opensearch-project/OpenSearch
- 36992044 # repo:sphinxsearch/sphinx
- 79317191 # repo:typesense/typesense
- 60377070 # repo:vespa-engine/vespa
- 735981 # repo:xapian/xapian

Label: Vector

Type: Tech-1

Repos:

- 642912355 # repo:awa-ai/awadb
- 304530333 # repo:featureform/embeddinghub
- 628503457 # repo:jdagdelen/hyperDB
- 635374751 # repo:jina-ai/vectordb
- 632269609 # repo:kagisearch/vectordb
- 242383787 # repo:marekgalovic/anndb
- 520096046 # repo:marqo-ai/marqo
- 478288303 # repo:nuclia/nucliadb
- 40127179 # repo:pilosa/pilosa
- 186332888 # repo:vearch/vearch
- 634656527 # repo:vector5ai/vector5db

"

birdflyi commented 1 year ago

/self-assign