StarRocks / starrocks

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
https://starrocks.io
Apache License 2.0
8.57k stars 1.73k forks source link

[Good First Issue]StarRocks Hands-on Tasks 2024 #40894

Open wangsimo0 opened 6 months ago

wangsimo0 commented 6 months ago

Hi Rockstars,

This is a list of proposed Hands-on tasks. If you're new to StarRocks and eager to engage with the community, here are some issues that are well-suited for you to dive into :) These issues are suitable for gaining hands-on experience and becoming familiar with StarRocks development. Also this is an open list, you are welcome to propose more tasks.

Please @kateshaowanjou or @wangsimo0 to book the issue, and add a comment in the issue you picked, so the issue won't be assigned to others. And always discuss with the community about the design before actually developing, some of the issues are really big, don't hesitate to seek help from the community.

External Catalog related issues

Information Schema

External Catalog

In version 3.2 and later, StarRocks enhances compatibility with more BI tools by supporting the information_schema database in External Catalog. This feature serves as a valuable tool for obtaining structured information. While several views within information_schema currently return empty, efforts are underway to optimize support for these views to ensure comprehensive coverage. StarRocks aligns with MySQL's pattern in supporting information_schema, as it follows the MySQL protocol. We better maintain the compatibility with MySQL, provide as much information as we can, and optimize for efficiency to minimize time consumption. consumed.

  • [ ] Columns view
  • [ ] Views view

Default Catalog

Trino's Compatibility Issues

In version 3.0 and later, StarRocks supports Trino's SQL_dialect mode; however, ongoing enhancements are necessary to further optimize this functionality.

New Functions

Function Mapping

  Trino's function/expression StarRocks' function/expression comment assginee
  • [ ]
map_agg(key, value) → map<K,V> map()  @Jcnessss
  • [ ]
show schemas from Show databases from  
  • [ ]
array_sort(array(T), function(T, T, int)) -> array(T) array_sortby(, array0 [, array1...]) This one needs to pay attention to the input order.
  • [ ]
sequence(start, stop)sequence(start, stop, step)In integers data type array_generate([start,] end [, step])  
  • [ ]
last_day_of_month(x) → date last_day(x,'month');  
  • [ ]
map_from_entries(array(row(K, V))) -> map(K, V) map_from_arrays. This one needs to pay attention to the transformation. SELECT map_from_entries(ARRAY[(1, 'x'), (2, 'y')]); equals to SELECT map_from_arrays([1,2],['x','y']);
  • [x]
current_catalog catalog()   thanks to @macroguo-ghy
  • [x]
current_schema database()   thanks to @macroguo-ghy
  • [x]
slice(x, start, length) → array array_slice(input, offset, length)  
  • [ ]
approx_set(x) → HyperLogLog HLL_HASH(column_name)  
  • [ ]
empty_approx_set() → HyperLogLog HLL_EMPTY()  
  • [ ]
merge(HyperLogLog) → HyperLogLog HLL_RAW_AGG(hll)  
  • [ ]
show schemas from show databases from https://github.com/StarRocks/starrocks/issues/40868  

Other Enhancements

Apache Hudi & Delta Lake Capabilities

More Connectors

More Capabilities

alberttwong commented 6 months ago

I'd add iceberg tagging and branch query

alberttwong commented 6 months ago

https://github.com/StarRocks/starrocks/issues/37959

241600489 commented 5 months ago

I want to pick #38989 @wangsimo0

mygrsun commented 4 months ago

I want to pick #40881 @wangsimo0

yangzho12138 commented 3 months ago

I want to pick #37089 @wangsimo0

kateshaowanjou commented 3 months ago

I want to pick #37089 @wangsimo0 You need to also comment under the issue #37089 so I can assign it to you. If you have any issues during the development process, I can introduce you to the relevant discussion group. https://853921.ma3you.cn/articles/b12e90J/

yangzho12138 commented 2 months ago

I want to pick #46105 @wangsimo0

FLAYhhh commented 2 weeks ago

@wangsimo0 Hi, I want to add Delta Lake Compatibilities. Has this requirement been resolved?

kateshaowanjou commented 2 weeks ago

@wangsimo0 Hi, I want to add Delta Lake Compatibilities. Has this requirement been resolved? Are you referring to the "Add Delta Lake sink" function? There's no one working on it at the moment and it'd be awesome if you are willing to give it a try!😎

FLAYhhh commented 2 weeks ago

Sure thing! I'd be happy to take this on.

amoghmargoor commented 2 weeks ago

@kateshaowanjou @wangsimo0 Can I pick this issue: https://github.com/StarRocks/starrocks/issues/38989 if its not being worked upon by anyone ?

kateshaowanjou commented 2 weeks ago

Sure thing! I'd be happy to take this on.

This issue is not the easiest one so feel free to add my WeChat:wanjoushao if you need help!

Jcnessss commented 5 days ago

@kateshaowanjou @wangsimo0 We are migrating from Trino to Starrocks and working on the functions. Can I pick the map_agg issue?

SoraNimi commented 1 day ago

I want to pick this issue #46060 @wangsimo0 @kateshaowanjou