Closed renovate[bot] closed 2 months ago
This PR contains the following updates:
50.0
52.0
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.
This PR contains the following updates:
50.0
->52.0
Release Notes
apache/arrow-rs (arrow)
### [`v52.2.0`](https://togithub.com/apache/arrow-rs/blob/HEAD/CHANGELOG.md#5220-2024-07-24) [Compare Source](https://togithub.com/apache/arrow-rs/compare/52.1.0...52.2.0) [Full Changelog](https://togithub.com/apache/arrow-rs/compare/52.1.0...52.2.0) **Implemented enhancements:** - Faster min/max for string/binary view arrays [#6088](https://togithub.com/apache/arrow-rs/issues/6088) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Support casting to/from Utf8View [#6076](https://togithub.com/apache/arrow-rs/issues/6076) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Min/max support for String/BinaryViewArray [#6052](https://togithub.com/apache/arrow-rs/issues/6052) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Improve performance of constructing `ByteView`s for small strings [#6034](https://togithub.com/apache/arrow-rs/issues/6034) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Fast UTF-8 validation when reading StringViewArray from Parquet [#5995](https://togithub.com/apache/arrow-rs/issues/5995) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] - Optimize StringView row decoding [#5945](https://togithub.com/apache/arrow-rs/issues/5945) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Implementing `deduplicate` / `intern` functionality for StringView [#5910](https://togithub.com/apache/arrow-rs/issues/5910) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] - Add `FlightSqlServiceClient::new_from_inner` [#6003](https://togithub.com/apache/arrow-rs/pull/6003) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Complete `StringViewArray` and `BinaryViewArray` parquet decoder: [#6004](https://togithub.com/apache/arrow-rs/pull/6004) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add begin/end_transaction methods in FlightSqlServiceClient [#6026](https://togithub.com/apache/arrow-rs/pull/6026) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Read Parquet statistics as arrow `Arrays` [#6046](https://togithub.com/apache/arrow-rs/pull/6046) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([efredine](https://togithub.com/efredine)) **Fixed bugs:** - Panic in `ParquetMetadata::memory_size` if no min/max set [#6091](https://togithub.com/apache/arrow-rs/issues/6091) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] - BinaryViewArray doesn't roundtrip a single `Some(&[])` through parquet [#6086](https://togithub.com/apache/arrow-rs/issues/6086) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] - Parquet `ColumnIndex` for null columns is written even when statistics are disabled [#6010](https://togithub.com/apache/arrow-rs/issues/6010) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] **Documentation updates:** - Fix typo in GenericByteViewArray documentation [#6054](https://togithub.com/apache/arrow-rs/pull/6054) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([progval](https://togithub.com/progval)) - Minor: Improve parquet PageIndex documentation [#6042](https://togithub.com/apache/arrow-rs/pull/6042) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([alamb](https://togithub.com/alamb)) **Closed issues:** - Potential performance improvements for reading Parquet to StringViewArray/BinaryViewArray [#5904](https://togithub.com/apache/arrow-rs/issues/5904) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] **Merged pull requests:** - Faster `GenericByteView` construction [#6102](https://togithub.com/apache/arrow-rs/pull/6102) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add benchmark to track byte-view construction performance [#6101](https://togithub.com/apache/arrow-rs/pull/6101) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Optimize `bool_or` using `max_boolean` [#6100](https://togithub.com/apache/arrow-rs/pull/6100) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([simonvandel](https://togithub.com/simonvandel)) - Optimize `max_boolean` by operating on u64 chunks [#6098](https://togithub.com/apache/arrow-rs/pull/6098) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([simonvandel](https://togithub.com/simonvandel)) - fix panic in `ParquetMetadata::memory_size`: check has_min_max_set before invoking min()/max() [#6092](https://togithub.com/apache/arrow-rs/pull/6092) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([Fischer0522](https://togithub.com/Fischer0522)) - Implement specialized min/max for `GenericBinaryView` (`StringView` and `BinaryView`) [#6089](https://togithub.com/apache/arrow-rs/pull/6089) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add PartialEq to ParquetMetaData and FileMetadata [#6082](https://togithub.com/apache/arrow-rs/pull/6082) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([adriangb](https://togithub.com/adriangb)) - Enable casting from Utf8View [#6077](https://togithub.com/apache/arrow-rs/pull/6077) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([a10y](https://togithub.com/a10y)) - StringView support in arrow-csv [#6062](https://togithub.com/apache/arrow-rs/pull/6062) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([2010YOUY01](https://togithub.com/2010YOUY01)) - Implement min max support for string/binary view types [#6053](https://togithub.com/apache/arrow-rs/pull/6053) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Minor: clarify the relationship between `file::metadata` and `format` in docs [#6049](https://togithub.com/apache/arrow-rs/pull/6049) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([alamb](https://togithub.com/alamb)) - Minor API adjustments for StringViewBuilder [#6047](https://togithub.com/apache/arrow-rs/pull/6047) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add parquet `StatisticsConverter` for arrow reader [#6046](https://togithub.com/apache/arrow-rs/pull/6046) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([efredine](https://togithub.com/efredine)) - Directly decode String/BinaryView types from arrow-row format [#6044](https://togithub.com/apache/arrow-rs/pull/6044) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Clean up unused code for view types in offset buffer [#6040](https://togithub.com/apache/arrow-rs/pull/6040) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Avoid using Buffer api that accidentally copies data [#6039](https://togithub.com/apache/arrow-rs/pull/6039) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - MINOR: Fix `hashbrown` version in `arrow-array`, remove from `arrow-row` [#6035](https://togithub.com/apache/arrow-rs/pull/6035) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([mbrobbel](https://togithub.com/mbrobbel)) - Improve performance reading `ByteViewArray` from parquet by removing an implicit copy [#6031](https://togithub.com/apache/arrow-rs/pull/6031) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add begin/end_transaction methods in FlightSqlServiceClient [#6026](https://togithub.com/apache/arrow-rs/pull/6026) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Unsafe improvements: core `parquet` crate. [#6024](https://togithub.com/apache/arrow-rs/pull/6024) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([veluca93](https://togithub.com/veluca93)) - Additional tests for parquet reader utf8 validation [#6023](https://togithub.com/apache/arrow-rs/pull/6023) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([alamb](https://togithub.com/alamb)) - Update zstd-sys requirement from >=2.0.0, <2.0.12 to >=2.0.0, <2.0.13 [#6019](https://togithub.com/apache/arrow-rs/pull/6019) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([dependabot\[bot\]](https://togithub.com/apps/dependabot)) - fix doc ci in latest rust nightly version [#6012](https://togithub.com/apache/arrow-rs/pull/6012) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([Rachelint](https://togithub.com/Rachelint)) - Do not write `ColumnIndex` for null columns when not writing page statistics [#6011](https://togithub.com/apache/arrow-rs/pull/6011) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([etseidl](https://togithub.com/etseidl)) - Fast utf8 validation when loading string view from parquet [#6009](https://togithub.com/apache/arrow-rs/pull/6009) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Deduplicate strings/binarys when building view types [#6005](https://togithub.com/apache/arrow-rs/pull/6005) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Complete `StringViewArray` and `BinaryViewArray` parquet decoder: implement delta byte array and delta length byte array encoding [#6004](https://togithub.com/apache/arrow-rs/pull/6004) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) - Add `FlightSqlServiceClient::new_from_inner` [#6003](https://togithub.com/apache/arrow-rs/pull/6003) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Rename `Schema::all_fields` to `flattened_fields` [#6001](https://togithub.com/apache/arrow-rs/pull/6001) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] \[[arrow-flight](https://togithub.com/apache/arrow-rs/labels/arrow-flight)] ([lewiszlw](https://togithub.com/lewiszlw)) - Refine documentation and examples for `DataType` [#5997](https://togithub.com/apache/arrow-rs/pull/5997) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([alamb](https://togithub.com/alamb)) - implement `DataType::try_form(&str)` [#5994](https://togithub.com/apache/arrow-rs/pull/5994) \[[arrow](https://togithub.com/apache/arrow-rs/labels/arrow)] ([samuelcolvin](https://togithub.com/samuelcolvin)) - Implement dictionary support for reading ByteView from parquet [#5973](https://togithub.com/apache/arrow-rs/pull/5973) \[[parquet](https://togithub.com/apache/arrow-rs/labels/parquet)] ([XiangpengHao](https://togithub.com/XiangpengHao)) \* *This Changelog was automatically generated by [github_changelog_generator](https://togithub.com/github-changelog-generator/github-changelog-generator)* ### [`v52.1.0`](https://togithub.com/apache/arrow-rs/compare/52.0.0...52.1.0) [Compare Source](https://togithub.com/apache/arrow-rs/compare/52.0.0...52.1.0) ### [`v52.0.0`](https://togithub.com/apache/arrow-rs/compare/51.0.0...52.0.0) [Compare Source](https://togithub.com/apache/arrow-rs/compare/51.0.0...52.0.0) ### [`v51.0.0`](https://togithub.com/apache/arrow-rs/compare/50.0.0...51.0.0) [Compare Source](https://togithub.com/apache/arrow-rs/compare/50.0.0...51.0.0)Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.