apache / superset

Apache Superset is a Data Visualization and Data Exploration Platform
https://superset.apache.org/
Apache License 2.0
61.59k stars 13.45k forks source link

Cannot load filter Error: ORA-00979: not a GROUP BY expression #28327

Open mujohiddin opened 4 months ago

mujohiddin commented 4 months ago

Bug description

I faced this problem with superset oracle connection. There is a connection with oracle and there is data, but there is this error in the filter, this process happened after connecting to oracle postres, this problem is not there, because it is the superset settings or the queries are wrong? photo_2024-05-03_09-25-33

and where does such an error 1 appear in the console?

photo_2024-05-03_09-29-25

How to reproduce the bug

I don't know

Screenshots/recordings

photo_2024-05-03_09-25-33 photo_2024-05-03_09-29-25

Superset version

3.1.2

Python version

3.9

Node version

16

Browser

Chrome

Additional context

No response

Checklist

mujohiddin commented 4 months ago

@mistercrunch @villebro @michael-s-molina @betodealmeida

Can you help me, I have not been able to solve it for several weeks

mujohiddin commented 4 months ago

I faced this problem with superset oracle connection. There is a connection with oracle and there is data, but there is this error in the filter, this process happened after connecting to oracle postres, this problem is not there, because it is the superset settings or the queries are wrong?

michael-s-molina commented 4 months ago

Hi @mujohiddin. Oracle does not support GROUP BY 1. I believe you are using a virtual dataset? If that's the case, just replace GROUP BY 1 with GROUP BY "CODE".

mujohiddin commented 4 months ago

@michael-s-molina but 5 months ago version didn't have this error, I got this error when upgrading to new version.

mujohiddin commented 4 months ago

@michael-s-molina GROUP BY 1 with GROUP BY "CODE". I could not find this part, can you tell me where to replace it

michael-s-molina commented 4 months ago

I could not find this part, can you tell me where to replace it

1 - Go to the filter configuration 2 - Check what's the dataset that's being used for the filter 3 - Go to the datasets list and check if that dataset is a virtual dataset 4 - If it is, you can edit its query

mujohiddin commented 4 months ago

@michael-s-molina , I followed the steps what you suggested, but there problem is not in the dataset. The above query was generated by Superset itself on filter pane. I think issue is with SQLA dialect, because the source was change to Postgre, it works, since Postgre supports Group BY 1, but Oracle does not. Therefore I am guesing it is due to SQLA dialect on newer version of Superset.

Moreover, I cannot create Table, Pivot charts , they are failling with the same error : photo_2024-05-03_16-58-19 photo_2024-05-03_16-58-24

michael-s-molina commented 4 months ago

The above query was generated by Superset itself on filter pane. I think issue is with SQLA dialect, because the source was change to Postgre, it works, since Postgre supports Group BY 1, but Oracle does not.

@rusackas @betodealmeida Does any of you have an Oracle instance to test this? Or do you know who's the Oracle dialect maintainer?

michael-s-molina commented 4 months ago

@mujohiddin Can you also share what was the previous version where this was working?

mujohiddin commented 4 months ago

@michael-s-molina

  1. I don't remember the exact version, it was about 6 months ago, It works successfully with postgre, clickhouse, oracle
  2. The last new superset I installed recently was 1 month before the project in this case there is a problem with oracle
rusackas commented 4 months ago

Sadly, I don't know of anyone with Oracle. It's still on the wish list in the rolodex, if anyone on this thread would be kind enough to sign up.

mujohiddin commented 4 months ago

photo_2024-05-06_10-52-19 how can i detect the old superset version, 0.0.dev is the same in the interface

@mistercrunch @villebro @michael-s-molina @betodealmeida @john-bodley

RavshanbekXB commented 4 months ago

I have also faced this issue with Oracle, any updates @rusackas , @michael-s-molina , @mistercrunch ? Any workaround applied ? @mujohiddin

mujohiddin commented 4 months ago

i have old version working with oracle but i have problems in new version, do i need to add some parts from old version or some scprits? exactly in which python file or class is this generation process taking place. If you tell me that, I will send it to you, and we will compare the status of the two versions

@mistercrunch @villebro @michael-s-molina @betodealmeida @john-bodley @rusackas

mujohiddin commented 4 months ago

I have also faced this issue with Oracle, any updates @rusackas , @michael-s-molina , @mistercrunch ? Any workaround applied ? @mujohiddin

No, the solution has not been found yet. I am waiting for a response from contributions

rusackas commented 4 months ago

Let's see if our AI overlords can help at all...

@dosu-bot

dosubot[bot] commented 4 months ago

To address the "ORA-00979: not a GROUP BY expression" error in Superset when using Oracle, it seems the issue stems from Oracle's lack of support for the "GROUP BY 1" shorthand syntax, which is not an issue with databases like PostgreSQL that support this syntax. The error arises because Oracle requires explicit column names in the GROUP BY clause instead of positional references.

A potential solution involves customizing the SQL compilation process in SQLAlchemy, which Superset uses to interact with databases. You can create a custom SQLAlchemy dialect for Oracle or modify the existing one to ensure that GROUP BY expressions are correctly handled. This involves overriding the visit_group_by method in a custom compiler class to ensure that column names are used in GROUP BY clauses instead of positional references.

Additionally, changes in newer versions of Superset, particularly in how SQL statements are generated for Oracle databases, could contribute to this issue. These changes might involve adjustments in time grain expressions, date-time conversion methods, and enforced column alias quoting, which could affect the generation of GROUP BY clauses.

A specific commit addressing a similar issue by modifying the SQL generation logic in Superset to comply with Oracle's requirements might also offer a clue. This commit ensures that columns used in the ORDER BY clause are included in the SELECT clause, as required by Oracle.

Implementing these solutions requires a good understanding of Python, SQLAlchemy, and Superset's architecture. If you're comfortable making these adjustments, you could potentially resolve the issue. Otherwise, it might be beneficial to reach out to the Superset community for further assistance or consider hiring a developer with the necessary expertise.

#### Details **Sources** - [Group by filters broken in 1.5.0](https://github.com/apache/superset/issues/20000) - [superset/models/helpers.py](https://github.com/apache/superset/blob/master/superset/models/helpers.py) - [UPDATING.md](https://github.com/apache/superset/blob/master/UPDATING.md) - [superset/db_engine_specs/oracle.py](https://github.com/apache/superset/blob/master/superset/db_engine_specs/oracle.py)
**About Dosu** This response is meant to be useful and save you time. It is **not meant to be a precise solution**, but rather a starting point for your own research. Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

To continue the conversation, mention @dosu.

villebro commented 4 months ago

Debugging this actually only requires having access to the Oracle SQLA dialect. I assume @mujohiddin you are using the default dialect included in SQLA? If so, I think we may be doing something wrong, as I'd be extremely surprised if SQLA has bug of this magnitude..

vranjan25 commented 3 months ago

This is an issue from superset filter, it's not using the column name in group by. DATABASE: Oracle SQLALCHEMY URI: oracle+cx_oracle://myUser:XXXXXXXXXX@dev

image

image

image

mistercrunch commented 3 months ago

I'm not clear on what may trigger sqlalchemy to use ordinals (GROUP BY 1) while using its group_by() clause, for anything. In my experience it will always use aliases.

And why would it do it for Oracle, one of the databases that seems to not support this (?)

vranjan25 commented 3 months ago

That's the question why superset is doing this?

mujohiddin commented 2 months ago

Is this problem still not fixed? it was working in old version ie below versions 3.0.0, 3.0.1, 3.0.2 new version is showing this problem. I know there have been a lot of changes since then, but it's important that later we still have this problem, we need to find it. @mistercrunch @villebro @michael-s-molina @betodealmeida @john-bodley @rusackas

rusackas commented 2 months ago

it's important that later we still have this problem, we need to find it.

100% agreed, but I don't think a single one of the people you mentioned here has an Oracle DB to test against. If you do, would you be able or willing to do some digging here? It's times like this where we can really use the help of external contributions to the project. The comment from the Dosu bot above may be accurate in that a different or custom SQLAlchemy dialect may be required, but an Oracle DB would be needed to test any of that.

fraannco commented 2 months ago

Hi, I have just installed this version of superset on my server, is there any way to install an older version that does not have this problem with oracle? I get the same message at the time of making the charts.

mistercrunch commented 2 months ago

You can git checkout any version tag and build from there.

mujohiddin commented 2 months ago

This was working in versions 3.0.1 and 3.0.2, so you should see changes in later versions

mistercrunch commented 2 months ago

so you should see changes in later versions

Lots of changes since then, it'd be great if someone could bissect this one. @rusackas published this recently -> https://preset.io/blog/using-git-bisect-to-find-and-fix-bugs/

mujohiddin commented 2 months ago

@Sumeet photo_2024-06-27_10-40-57

this bug was not present in version 3.0.1 and 3.0.2, this problem should be fixed in the new version. this error is omitted

mujohiddin commented 2 months ago

@sumeet-singh04

wangyan1856 commented 3 weeks ago

I found a word around for this: comment out this code in the superset/models/helpers.py file:

         try.
             sql = SQLStatement(sql, engine=self.db_engine_spec.engine).format()
         except SupersetParseError.
             logger.warning("Unable to parse SQL to format it, passing it as-is")

Here the sqlglot tool is used to parse the sql, the purpose is to make sure that there is only one statement, I guess it should be in order to avoid potential problems with custom SQL queries, but it shouldn't be necessary for the sql generated by superset itself