apache / doris

Apache Doris is an easy-to-use, high performance and unified analytics database.
https://doris.apache.org
Apache License 2.0
12.8k stars 3.29k forks source link

Doris Roadmap 2023 #16392

Closed morningman closed 10 months ago

morningman commented 1 year ago

This is Apache Doris Roadmap 2023.

The plan is currently under discussion, so if you have comments or suggestions on any aspect of the plan or beyond, please feel free to leave a comment or send an email to dev@doris.apache.org.

We encourage developers to discuss anything in the dev mailing list, to subscribe to the mailing list please refer to How to subscribe.

We will gradually create issues for each direction of the plan to describe and track the progress in detail. Developers who wish to contribute are also welcome to create issues directly and associate with them (just leave a comment)

Roadmap 2022

Our Main Focus

Release Schedule

We plan to release Apache Doris at the following pace:

V 1.2.x V 2.0.x V 2.1.x V 2.2.x
Jan. 1.2.1
Feb. 1.2.2
Mar. 1.2.3 2.0.0 alpha
Apr. 1.2.4 2.0.0 beta1
May 1.2.5 2.0.0 beta2
Jun. 1.2.6 2.0.0
Jul. 2.0.1 .
Aug. 2.0.2
Sept. 2.0.3 2.1.0 alpha
Oct. 2.0.4 2.1.0 beta
Nov. 2.1.0
Dec. 2.1.1

Features

We plan to develop or continuously optimize these features:

Hybrid Workloads

Semi-Structure Data Analysis

Lakehouse & Data Integration

New Optimizer (Nereids)

Cost Efficiency & Performance

Data Modeling & Storage Engine

Ecosystem

Utility & Stability

emerkfu commented 1 year ago

flink-doris-connector sink doris, Doris target table dynamic update function.

The current flink-doris-connector can well support the data writing operation of the established table. However, when adding a new write target table in the same flink job, it is necessary to stop the Flink job and reload the target table name. If flink-doris-connector can be configured to read the value of a field in the stream data to dynamically obtain the target table to be written, then the Flink job does not need to be stopped, and the operation and maintenance work will become easier.

wangbo commented 1 year ago

I think CCR(Cross Cluster Replication) is a nice idea. If we can clone a cluster partially or completely from Production ENV, then we can do many things to the cloned cluster. 1 Replay online query to the cloned cluster find online bug. 2 Verify major version upgrade, such upgrade the cloned cluster from 1.x to 2.x, to find potential problems.

To achive above goals, we still need some tools, such as query replay/load/ddl tool. This is undoubtedly very helpful for improving stability and realizing rapid development.

yangzhg commented 1 year ago

Column-level authorization is capability of controlling access to specific columns or fields within a database or table, rather than just the entire table itself. This type of authorization allows for fine-grained control over data access and can enhance security and privacy by preventing unauthorized users from accessing sensitive information. So I think this is a very attractive feature

siriume commented 1 year ago

mutil group_concat(distinct xxx order by xx) is a function we urgently need now. The current solution is to implement it through the join of multiple subqueries. Because the scanned rows are too large, the speed is very slow now. I don’t know if there is any plan to add this function in the future.

wangbo commented 1 year ago

How about link related PR/issue to the item in the list?

subkanthi commented 1 year ago

Im interested in contributing to some of the work, how do I go about doing it, should I create a separate issue and just add it here in the comments, please let me know.

luzhijing commented 1 year ago

Im interested in contributing to some of the work, how do I go about doing it, should I create a separate issue and just add it here in the comments, please let me know.

That's great! Welcome to Apache Doris Community! Feel free to exchange any ideas in the comments

wangshisan commented 1 year ago

Writing Doris data format file externally

What's the status of this task, any update?

morningman commented 1 year ago

Writing Doris data format file externally

What's the status of this task, any update?

Still work-in-progress, may be released at mid of this year

wangshisan commented 1 year ago

Writing Doris data format file externally

What's the status of this task, any update?

Still work-in-progress, may be released at mid of this year

May I know how would you implement this? By implementing a full Java writer, or by JNI and invoking the existing C++ code in the backend?

morningman commented 1 year ago

By JNI, possibly

fakeyanss commented 1 year ago

Full support for K8s deployment

When did this start, or what's the current status?

HunterPan commented 1 year ago

support time series data ?like ymatrix or greenplum, hope so

hqx871 commented 11 months ago

Hi team, any update about MergeSortJoin?