Open wanglinsong opened 5 days ago
~~@gupteaj Ajay, BEFORE syntax support for presto engine.
1. Release note
2. The documentation in iceberg.rst.
I see they are added.
@tanjialiang Somehow the release note for [[native] Remove spill stats reporting from presto native(https://github.com/prestodb/presto/pull/22751#top)#22751 was not picked up. But anyways your original release note might be too long. I 'd like to use the following:
Replace the presto_cpp.spill_metrics_name stats to velox.spill_metrics_name.
Do you think this is correct?
@tangjiangling Can you please confirm and complete the following release note for [native] Migrate away stats reporting for allocator and cache ? Or you can provide a release note that follows https://github.com/prestodb/presto/wiki/Release-Notes-Guidelines.
Move the stats of memory allocator, in-memory cache and ssd cache from presto_cpp namespace to velox. For example, presto_cpp.memoryCacheStats.numEntries is now velox.memoryCacheStats.numEntries. The affected metrics are: ....
@kewang1024 Can you please add a release note for "Fix hash calculation for Timestamp "? Thanks!
Added
@konjac-h Is there notable performance improvement to CTE selections? If yes, can you please provide a release note? Does the following sound good?
Improve the performance of reading common table expressions (CTE).
confirmed
@konjac-h Is there notable performance improvement to CTE selections? If yes, can you please provide a release note? Does the following sound good?
Improve the performance of reading common table expressions (CTE).
The fix reduces the unnecessary call to metastore API and improves the performance. Yes, the release note looks good to me.
@yingsu00 please add review in the file directly, so that I know the location to update.
@yingsu00 please add review in the file directly, so that I know the location to update.
I want to get the author's confirmation first. Then will add review.
@ClarenceThreepwood The release note for Support for NOT NULL column constraints needs a bit touch up. Does it mean "NOT NULL" is not supported in the DDLs in general, but only take effects for Hive connector? What about Iceberg connector or Hudi? Could you please refine the release note to be more precise and easier to understand by the users?
@ClarenceThreepwood The release note for Support for NOT NULL column constraints needs a bit touch up. Does it mean "NOT NULL" is not supported in the DDLs in general, but only take effects for Hive connector? What about Iceberg connector or Hudi? Could you please refine the release note to be more precise and easier to understand by the users?
It is not supported in iceberg or hudi yet. You can only use the vanilla hive connector to create constraints in HMS. I tweaked the note slightly to reflect this - let me know if you have a better rewrite in mind
@xiaoxmeng Can you please confirm if the following release notes are correct? I made some changes to your PR message, but they still need to be clearer. For example you used "the minimum amount of memory", does it mean this amount is guaranteed to be reserved at least and it could be more? Would it be more precise just to say is this the total amount of memory reserved for xxx? You mentioned "A query can only allocate from the reserved query memory if its current capacity is less than the minimal memory capacity as specified by 'memory-pool-reserved-capacity'. Does it mean query-reserved-memory-gb has to be less than memory-pool-reserved-capacity?
Add two system configuration properties to specify the reserved query memory capacity on native clusters:
``query-reserved-memory-gb`` is the total amount of memory in gigabytes reserved for a
query. ``memory-pool-reserved-capacity`` is the total amount of memory in bytes reserved for each query memory pool.
cc @steveburnett
@yingsu00 can you please apply these suggestions to Linsong's branch? You can push directly to this branch.
Missing Release Notes
Ajay Gupte
Amit Dutta
Christian Zentgraf
Deepak Majeti
Emanuel F
Jialiang Tan
Jimmy Lu
Karteekmurthys
Ke
Konjac Huang
Linsong Wang
Sreeni Viswanadha
Vivek
Zac Blanco
jackychen718
wypb
xiaoxmeng
Extracted Release Notes
21416 (Author: aditi-pandit): [native] SystemConnector to query system.runtime.tasks table
21622 (Author: Jalpreet Singh Nanda (:imjalpreet)): Introduce AWS Security Mapping
22386 (Author: Rebecca Schlussel): Implement new NaN behavior
use-new-nan-definition
tofalse
. This configuration property is intended to be temporary to ease migration in the short term, and will be removed in a future release.22417 (Author: kiersten-stokes): Add support for REST catalog in Iceberg connector
22417
.22417
.22492 (Author: Wills Feng): Support forwarded https
http-server.authentication.allow-forwarded-https
configuration property to recognize X-Forwarded-Proto header, :pr:22492
.22538 (Author: Rebecca Schlussel): Fix no value present scheduling failure with grouped execution
22539 (Author: Feilong Liu): Fix analyzer for lambda in aggregation
22554 (Author: wangd): [Iceberg]Support metadata delete with predicate on non-identity partition columns
22556 (Author: Nidhin Varghese): Add Support for non default keystore and truststore type in presto CLI and JDBC
22562 (Author: Andrii Rosa): Make number of preferred nodes configurable
node-scheduler.max-preferred-nodes
configuration property to allow changing number of preferred nodes when soft affinity scheduling is enabled. :pr:22562
.22563 (Author: Andrii Rosa): Allow different scheduling affinity for different sections of a file
hive.affinity-scheduling-file-section-size
configuration property andaffinity_scheduling_file_section_size
session property. The default file size is 256MB. :pr:22563
.22606 (Author: Feilong Liu): Add HBO support for CTE materialization
22606
.22609 (Author: wangd): Support procedure expire_snapshots for iceberg
expire_snapshots
to remove old snapshots in Iceberg. :pr:22609
.22611 (Author: 8dukongjian): Fix correctness issue in regr_r2
22618 (Author: wangd): Revert "Revert "Preserve case for RowType's field name and JSON content when
CAST
""22620 (Author: Steve Burnett): [docs] Add prestissimo/prestissimo-properties.rst
22620
.22652 (Author: Kevin Wilfong): array_join adds an extra delimeter if the last element is null
22652
.22661 (Author: Feilong Liu): Revert "Add histograms for optimizer cost calculation"
22665 (Author: Steve Burnett): [docs] Add PR number to the release notes entry
22665
.22681 (Author: Feilong Liu): Broadcast join if build estimation is small and from HBO
property-use_broadcast_when_buildsize_small_probeside_unknown
to do broadcast join when probe side size is unknown and build side estimation from HBO is small.22700 (Author: jaystarshot): Fix cte filter pushdown wrong results by splitting SpecialFormExpressions
22700
.22715 (Author: Jonathan Hehir): Documented noisy aggregate functions in new page
/functions/noisy
, including :func:noisy_approx_distinct_sfm
and :func:noisy_approx_set_sfm
(:pr:21290
, :pr:22715
).22717 (Author: Steve Burnett): [docs] Restructure and expand presto_cpp docs
22717
.22733 (Author: Zac Blanco): Support format arguments in EXPLAIN ANALYZE
/sql/explain-analyze
statement to support aformat
argument with values of<TEXT|JSON>
:pr:22733
.22737 (Author: aditi-pandit): [Native] Add support for bucketed (but not partitioned) tables
22737
.22753 (Author: wangd): [Iceberg] Refine the partition specs that really need to be checked
22753
.22762 (Author: kiersten-stokes): Update
io.jsonwebtoken
artifacts for Java 11 compatibility22762
.22764 (Author: Swapnil Tailor): Add support for memoization to resource group state endpoint
cluster-resource-group-state-info-expiration-duration
to a non-zero duration. :pr:22764
.22765 (Author: Feilong Liu): Log detail name for optimizers of RowExpressionRewriteRuleSet and StatsRecordingPlanOptimizer
22765
.22769 (Author: Feilong Liu): Record estimation stats during query optimization
22769
.22783 (Author: Ge Gao): Support functionc call rewrite by Presto Verifier
22799 (Author: Fazal Majid): added docs for noisy_approx_set_sfm_from_index_and_zeros
noisy_approx_set_sfm_from_index_and_zeros
.22806 (Author: Linsong Wang): update to build native docker image
22806
.22809 (Author: Andrii Rosa): Allow configuring driver threads based on the number of cores
task.max-worker-threads
configuration property to<multiplier>C
. For example, setting the property to2C
configures the worker thread pool to create up to twice as many threads as there are cores available on a machine. :pr:22809
.22851 (Author: Ajay Gupte): Connector changes for time travel BEFORE clause.
BEFORE
syntax for Iceberg tables to return historical data :pr:22851
.VERSION (SYSTEM_VERSION)
syntax to include snapshot id using bigint data type :pr:22851
.TIMESTAMP (SYSTEM_TIME)
syntax to include timestamp-with-time-zone data type :pr:22851
.22853 (Author: Feilong Liu): Skip hbo stats recording for nodes with dynamic filter
22853
.22885 (Author: Steve Burnett): [docs] Add Presto C++ config properties doc
/presto_cpp/properties
documentation :pr:22885
.22888 (Author: Rebecca Schlussel): Remove deprecated feature group-by-uses-equal
deprecated.group-by-uses-equal
, which allowed group by to use equal to rather than distinct semantics.22917 (Author: Rebecca Schlussel): Throw exception for cast of nan and infinity to int types
NUMERIC_VALUE_OUT_OF_RANGE
toINVALID_CAST_ARGUMENT
.22918 (Author: Steve Burnett): [docs] Improve README.md and CONTRIBUTING.md
22918
.22926 (Author: Denodo Research Labs): Disable timestamp with time zone in create, alter and insert statements
22926
.22958 (Author: Zac Wen): Upgrade Alluxio to 313
22960 (Author: Nikhil Collooru): Propagate runtime stats to page source provider
ConnectorPageSourceProvider
.All Commits
expire_snapshots
for iceberg (wangd)...iceberg.procedure
(wangd)CAST
"" (wangd)CAST
" (Neerad Somanchi)