apache / drill

Apache Drill is a distributed MPP query layer for self describing data
https://drill.apache.org/
Apache License 2.0
1.93k stars 979 forks source link

Error when generating metadata of file with 155 columns #2847

Open Maegor opened 10 months ago

Maegor commented 10 months ago

Hello! I have problems to generate the metadata of a Parquet file in S3 with 156 columns (some with long names). Drill version is 1.21.1 In my IDE I see the error: "SQL Error: INTERNAL_ERROR ERROR: Code generation error - likely code error. Fragment: 0:0"

I attach the complete log (error_drill_metadata.log) where you can see much more detail. For example: "Caused by: java.lang.Exception: File 'org.apache.drill.exec.compile.DrillJavaFileObject[StreamingAggregatorGen1872.java]', Line 13982, Column 18: StreamingAggregatorGen1872.java:13982: error: code too large". error_drill_metadata.log

Step to reproduce:

  1. Upload the empty_parquet.parquet ( inside empty_parquet.zip) file to an S3 bucket. For example. "s3://your-bucker/empty_parquet/empty_parquet.parquet
  2. Configure Drill to access the bucket
  3. Execute the command "ANALYZE TABLE s3.empty_parquet refresh METADATA;".