trinodb / trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
https://trino.io
Apache License 2.0
9.94k stars 2.87k forks source link

Release notes for 362 #9015

Closed martint closed 2 years ago

martint commented 2 years ago

Ashhar Hasan

ebyhr commented 2 years ago
BigQuery
* Fix query failure for predicate pushdown to BigQuery `datetime` type. ({issue}`9005`)

9005

findepi commented 2 years ago
General
* Fix query failure when query contains a cast from `varchar` to a shorter `char`. ({issue}`9036`)

9036 https://github.com/trinodb/trino/pull/9038

ebyhr commented 2 years ago
BigQuery
* Fix incorrect results when using parameterized `numeric` type with non-default precision and scale. ({issue}`9060`)

9060

sopel39 commented 2 years ago
General
* Improve join performance. ({issue}`8974`)

https://github.com/trinodb/trino/pull/8974

dain commented 2 years ago
Hive
* Limit maximum file size to 1 GB by default.  The limit is not exact and is performed best effort.  The limit can be set with the `target_max_file_size` session property, or the `hive.target-max-file-size` configuration property. ({issue}`7991`)

7991

dain commented 2 years ago
General
* Fix `ALTER SCHEMA ... SET AUTHORIZATION` to resolve roles using the catalog of the schema instead of the session catalog. ({issue}`9066`)

Hive
* Fix file-based metastore to always add the admin role by default. ({issue}`9066`)

9066

ebyhr commented 2 years ago
BigQuery
* Fix query failure when accessing tables having unsupported type columns. ({issue}`9086`)

9086

ebyhr commented 2 years ago
Cassandra
* Change minimum number of speculative executions from 2 to 1. ({issue}`9096`)

9096

dain commented 2 years ago
Security
* Add `IN catalog` clause to `CREATE ROLE`, `DROP ROLE`, `GRANT ROLE`, `REVOKE ROLE`, and `SET ROLE` to specify the target catalog of the statement instead of using the current session catalog.  This change is necessary to support system roles in the future.  The old behavior of these statements can be restored by setting the `deprecated.legacy-catalog-roles` config property. ({issue}`9087`)

9087

ebyhr commented 2 years ago
BigQuery
* Fix query failure for predicate pushdown to BigQuery `float64` type with `+infinity` or `-infinity` value. ({issue}`9122`)

9122

sopel39 commented 2 years ago
General:
* Improve performance of queries that contain `DISTINCT` clause. ({issue}`8967`)

https://github.com/trinodb/trino/pull/8967

dain commented 2 years ago
TPCH
* Add support for switch mapping for floating point values between SQL `DOUBLE` and `DECIMAL` types.  The mapping can be set  the `tpch.double-type-mapping` configuration property. ({issue}`7990`)

7990

sopel39 commented 2 years ago
General
* Improve query performance by reducing CPU overhead of repartitioning data across worker nodes. ({issue}`9102`)

https://github.com/trinodb/trino/pull/9102

ebyhr commented 2 years ago
BigQuery
* Chage type mapping of BigQuery `datetime` from `timestamp(3)` to `timestamp(6)` in Trino. ({issue}`9052`)
* Chage type mapping of BigQuery `time` from `time with time zone` to `time(6)` in Trino. ({issue}`9052`)
* Chage type mapping of BigQuery `timestamp` from `timestamp(3) with time zone` to `timestamp(6) with time zone` in Trino. ({issue}`9052`)

9052

sopel39 commented 2 years ago
General
* Prevent failure when querying `system.materialized_views` and materialized view is dropped concurrently. ({issue}`9050`)

https://github.com/trinodb/trino/pull/9050

findepi commented 2 years ago
Iceberg
* Fix the mapping of nested fields between table metadata and Parquet file metadata. This 
  enables evolution of `row` typed columns for Iceberg tables stored in Parquet. ({issue}`9124`)

(wording borrowed from 352 notes for similar change for ORC) https://github.com/trinodb/trino/issues/8750 https://github.com/trinodb/trino/pull/9124

findepi commented 2 years ago
Hive
* Fix failure when writing Parquet files with Snappy compression on ARM64. ({issue}`9148`)

https://github.com/trinodb/trino/pull/9148

martint commented 2 years ago
* Add support for {func}`LISTAGG`. ({issue}`4835`)

https://github.com/trinodb/trino/issues/4835

ebyhr commented 2 years ago
BigQuery
* Support case insensitive name matching for BigQuery views. ({issue}`9164`)

9164

hashhar commented 2 years ago
Pinot
* Fix handling of `IN` and `NOT IN` clauses for `VARCHAR` columns in presence of aggregations. ({issue}`9133`)

9133, #9181

hashhar commented 2 years ago
Pinot
* Support pushdown of filters on `VARBINARY` columns to Pinot. ({issue}`9180`)

9180

kasiafi commented 2 years ago
General
* Fix planning of recursive queries when the recursion the base plan or the recursion step plan produce duplicate outputs.

https://github.com/trinodb/trino/pull/9153

sopel39 commented 2 years ago
General
* Split metrics that are reported in operator stats into `connectorMetrics` and `metrics`. ({issue}`9156`)

https://github.com/trinodb/trino/pull/9156

martint commented 2 years ago
Hive
* Fix performance regression where reading Parquet files that contain dictionaries. ({issue}`9161`)

https://github.com/trinodb/trino/pull/9161

Praveen2112 commented 2 years ago
Hive
* Allow reads on table if they are bucketed on unsupported data type. 

https://github.com/trinodb/trino/pull/8986

Praveen2112 commented 2 years ago
Hive
* Optionally specify the schema which enforces that a filter on a partition key be present in the query for its table. This can be specified by setting the 
  ``hive.query-partition-filter-required-schemas`` config property or the ``query_partition_filter_required_schemas`` session property

9106

kasiafi commented 2 years ago
General
* Fix planning failure of INSERT statement when source table has hidden columns. ({issue}`9150`)

https://github.com/trinodb/trino/issues/9150, https://github.com/trinodb/trino/pull/9201

phd3 commented 2 years ago
* Iceberg
- Fix failure for queries filtering on columns with array, map, or row type. ({issue}`8822`)

8822

hashhar commented 2 years ago
Pinot
* Fix failure for queries on `REAL` or `DOUBLE` columns having `+Infinity` or `-Infinity` values. ({issue}`9180`)

9180 (2nd and 3rd commits)

findepi commented 2 years ago
Hive
* Support reading Parquet timestamp stored with millisecond or microsecond precision as INT64 with logical type annotations when Hive timestamp precision is `NANOSECONDS`. ({issue}`9139`)
* Support reading Parquet timestamp stored as INT64 with nanosecond precision. ({issue}`9188`)
* Support writing Parquet timestamp stored as INT64 with nanosecond precision when experimental Parquet writer is enabled. 
  To use this, the Hive timestamp precision should be set to `NANOSECONDS`. ({issue}`9188`)

9139 #9188

kokosing commented 2 years ago
Hive
 * Support loading of s3 mappings via http/s url. The `hive.s3.security-mapping.config-file property` can now either point to a local file or a URL. ({issue}`6210`)

6210

phd3 commented 2 years ago
* General
- Fix failure when querying `optimizer_rule_stats` system table. ({issue}`8700`)

8700, #9237

losipiuk commented 2 years ago
Hive
 * Fix incorrect table locking in Hive metastore when querying transactional and not partitioned table. ({issue}`9070`)

9070

losipiuk commented 2 years ago
Hive
 * Extend set of statistics reported by JMX to include metrics for calls made to the Glue statistics API. ({issue}`9100`)

9100

losipiuk commented 2 years ago
General
 * Limit number of metadata calls done for particular queries reading from `system.jdbc.columns` where filter on schema name is present. ({issue}`9214`)

9214

(should we mention Metadata API exposed by JDBC driver instead?)

findepi commented 2 years ago
Hive
* Fix `ArrayIndexOutOfBoundsException` when using the experimental Parquet writer. ({issue}`5518`)

Iceberg
* Fix `ArrayIndexOutOfBoundsException` when writing Parquet data. ({issue}`5518`)

5518 #9245

findepi commented 2 years ago

(should we mention Metadata API exposed by JDBC driver instead?)

@losipiuk yes

martint commented 2 years ago
* Fix incorrect results for aggregations that contain `DISTINCT` and `FILTER` clauses. ({issue}`9265`)

https://github.com/trinodb/trino/pull/9265

martint commented 2 years ago
* Improve performance for aggregations that contain a `DISTINCT` clause. ({issue}`9194`)

https://github.com/trinodb/trino/pull/9194

kasiafi commented 2 years ago
General
* Fix incorrect query results when row pattern involves back references. ({issue}`9109`)

https://github.com/trinodb/trino/pull/9109

phd3 commented 2 years ago
General
* Fix failure for queries that push projections into the connectors. ({issue}`6200`)

6200, #9185, #9222

losipiuk commented 2 years ago

(should we mention Metadata API exposed by JDBC driver instead?)

@losipiuk yes

https://github.com/trinodb/trino/pull/9216/files#r711073226

electrum commented 2 years ago
## Server RPM

* Change RPM architecture to `noarch` to allow installing on any machine type. ({issue}`9187`)

9187

electrum commented 2 years ago
## Web UI

* Add query error info to cluster overview page. ({issue}`8762`)

8762

electrum commented 2 years ago
## Hive Connector

* Fix reading Avro tables written with older Avro versions that had certain semi-invalid schemas. ({issue}`9114`)

## Iceberg Connector

* Fix query failures caused by errors reading certain Avro metadata files. ({issue}`9114`)

9114

electrum commented 2 years ago
## SPI

* Change `Connector.isSingleStatementWritesOnly()` to return `true` by default. ({issue}`8872`)

8872

findepi commented 2 years ago
Iceberg
* Add support for Iceberg `uuid` type. ({issue}`6663`)

6663 #8747

losipiuk commented 2 years ago
## Hive Connector

* Fix possible `INSERT`/`UPDATE`/`ANALYZE` query failure when Glue metastore is in use and table statistics collection is enabled. ({issue}`9297`)

9297

martint commented 2 years ago
* Fix planning timeout for queries containing `IS NULL`, `AND` and `OR` predicates in the `WHERE` clause. ({issue}`9250`)
* Fix failure for queries containing `ORDER BY ... LIMIT` when columns in the subquery are known to be constant. ({issue}`9171`)

9250, #9171