StarRocks / starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
https://starrocks.io
Apache License 2.0
8.91k stars 1.79k forks source link

String case conversion functions doesnt work with cyrilic text #45064

Open dnbnero opened 6 months ago

dnbnero commented 6 months ago

I tried to convert some cyrilic text to lowercase, but lower() function did not affect these lines. Also upper and ucase dont work as excepted. These functions work as expected with english lines.

Steps to reproduce the behavior

  1. SELECT upper('теКст') as to_upper, lower('ТЕкСТ') as to_lower

Expected behavior

to_upper = 'ТЕКСТ' to_lower = 'текст'

Real behavior

to_upper = 'теКст' to_lower = 'ТЕкСТ'

StarRocks version

dnbnero commented 6 months ago

Possible these funtions work only with ascii characters - select ucase('für') returns FüR, but FÜR is excepted

github-actions[bot] commented 2 days ago

We have marked this issue as stale because it has been inactive for 6 months. If this issue is still relevant, removing the stale label or adding a comment will keep it active. Otherwise, we'll close it in 10 days to keep the issue queue tidy. Thank you for your contribution to StarRocks!

eshishki commented 2 days ago

this is still very much needed