Skip to content

Update arrow-rs to v58 - autoclosed#7971

Closed
renovate[bot] wants to merge 1 commit into
developfrom
renovate/major-58-arrow-rs
Closed

Update arrow-rs to v58 - autoclosed#7971
renovate[bot] wants to merge 1 commit into
developfrom
renovate/major-58-arrow-rs

Conversation

@renovate
Copy link
Copy Markdown
Contributor

@renovate renovate Bot commented May 18, 2026

ℹ️ Note

This PR body was truncated due to platform limits.

This PR contains the following updates:

Package Type Update Change
arrow-cast dependencies major 5758
parquet dependencies major 5758

Warning

Some dependencies could not be looked up. Check the Dependency Dashboard for more information.


Release Notes

apache/arrow-rs (arrow-cast)

v58.3.0

Compare Source

Full Changelog

Implemented enhancements:

  • Add DatePart::from_str API #​9930 [arrow]
  • should use DictionaryArray::with_values instead of try_new on the dictionary fast path #​9889 [arrow]
  • [arrow-string] add concat_elements for BinaryViewArray and FixedSizeBinary #​9875 [arrow]
  • Expose eq ignore ascii case from arrow-string #​9870 [arrow]
  • Configurable data page v2 compression threshold #​9827 [parquet]

Fixed bugs:

  • [arrow-cast] incorrect Time32 -> Time64 conversion #​9851 [arrow]
  • Panic when reading malformed compact-Thrift bool fields in Parquet page metadata #​9839 [parquet]
  • Parquet DeltaBitPackDecoder::skip could panic on "non-standard" miniblocks #​9793 [parquet]

Documentation updates:

Closed issues:

  • GenericByteDictionaryBuilder::with_capacity does not pre-size dedup HashTable #​9907 [arrow]
  • [arrow-buffer] Integer overflow in repeat_slice_n_times leads to undefined behavior #​9904 [arrow]
  • [arrow-buffer] Integer overflow in BitChunks::new leads to undefined behavior #​9903 [arrow]
  • [arrow-row] Integer overflow in Rows::row index handling leads to undefined behavior #​9901 [arrow]
  • [arrow-data] Integer overflow in ArrayData validation leads to undefined behavior #​9900 [arrow]
  • [arrow-data] Integer overflow in ArrayData::slice leads to undefined behavior #​9899 [arrow]
  • [arrow-array] Integer overflow in FixedSizeBinaryArray::value leads to undefined behavior #​9898 [arrow]
  • [arrow-buffer] Integer overflow in BufferBuilder::reserve leads to undefined behavior #​9897 [arrow]
  • arrow-csv: integer overflow panic in Reader::records::flush #​9885 [arrow]
  • Make an API to help with the pattern of 'replaces the values of the REE array' #​9854 [arrow]
  • Parquet reader rejects canonical UNKNOWN logical type on BOOLEAN physical columns #​9844 [parquet]
  • ColumnIndex length mismatch can cause panic during decoding in Parquet #​9832 [parquet]
  • Bug converting json to fixed list of zero size #​9780 [arrow]

Merged pull requests:

* This Changelog was automatically generated by github_changelog_generator

v58.2.0: arrow 58.2.0

Compare Source

Changelog
58.2.0 (2026-04-28)

Full Changelog

Implemented enhancements:

  • Expose ColumnCloseResult on ArrowColumnChunk #​9774 [parquet]
  • Expose FFI data structures fields #​9771 [arrow]
  • short-circuit last predicate in RowFilter when with_limit(N) is set #​9765 [parquet]
  • vectorise dict-index bounds check #​9747 [parquet]
  • Refactor RleEncoder::flush_bit_packed_run #​9734 [parquet]
  • Add benchmark for cast from/to decimals #​9728 [arrow]
  • Add a security policy for arrow-rs #​9727 [parquet] [arrow] [arrow-flight]
  • Support FixedSizeList in arrow-json reader #​9714 [arrow]
  • [Variant] Add VariantArrayBuilder::append_nulls API #​9684
  • [Json] RunEndEncoded decoder optimization #​9645 [arrow]
  • [Variant] variant_get(..., List<_>) non-Struct types support #​9615
  • [Variant] Add unshredded Struct fast-path for variant_get(..., Struct) #​9596
  • Allow setting custom line terminator for CSV writer #​9571 [arrow]
  • [Variant] Align cast logic for variant_get to cast kernel for numeric/bool types #​9564 [arrow]
  • ci: use ubuntu-slim where applicable #​9536
  • Publicly export arrow_string::Predicate and its methods? #​9480
  • Don't create CompressionContext when no compression is selected [IPC] #​9463 [arrow]
  • Parquet: Raw level buffering causes unbounded memory growth for sparse columns #​9446 [parquet]
  • Parallel Parquet Reading #​9381 [parquet]

Fixed bugs:

  • [Variant] unshred_variant panics on malformed bytes despite returning Result #​9740
  • RecordBatch::normalize() does not propagate top level null bitmap into the results #​9732 [arrow]
  • Incorrect accounting in DictEncoder::estimated_memory_size #​9719 [parquet]
  • arrow-ipc writer does not comply with spec for empty variable-size arrays #​9716 [arrow]
  • Panic when reading corrupt parquet file with truncated data instead of ParquetError #​9705 [parquet]
  • NOTICE.txt is inaccurate #​9703 [arrow]
  • Unnecessary dependency on regex crate #​9672
  • [arrow-avro] Avro reader produces incorrect results when reader schema and writer schema differ #​9655 [arrow]
  • parquet docs are broken on docs.rs #​9649
  • [Parquet] ArrowWriter with CDC panics on nested ListArrays #​9637 [parquet] [arrow] [arrow-flight]
  • Use release KEYS file for verification instead of dev KEYS #​9603
  • IPC reader: handling of dictionaries with only null values #​9595 [arrow]
  • Parquet RleDecoder::get_batch_with_dict panics on oob dictionary indices #​9434 [parquet]

Documentation updates:

Performance improvements:

Closed issues:

  • Incorrect buffer skipping for V4 Union types in IPC skip_field #​9828 [arrow]
  • Replace wildcard match in skip_field with explicit DataType handling #​9821 [arrow]
  • Column projection misalignment for ListView / LargeListView in IPC reader #​9805 [arrow]
  • Avoid panic on malformed compressed buffer prefix in IPC #​9801 [arrow]
  • DeltaByteArrayDecoder panics on invalid prefix lengths #​9796 [parquet]
  • Use NullBufferBuilder when reading json #​9781 [arrow]
  • Perfectly shredded arrays with top-level null values loss nullability when typed_value is extracted #​9701
  • [Parquet Metadata] API to determine page-index presence separately from page-index load #​9693
  • Union cast is incorrect for duplicate field names #​9664 [arrow]
  • List and ListView are missing take benchmarks #​9627 [arrow]
  • Support RunEndEncoded arrays in comparison kernels (eq, lt, etc.) #​9620 [arrow]
  • variant_get should follow JSONpath semantics #​9606
  • GenericByteViewArray: support finding total length of all strings #​9435 [arrow]

Merged pull requests:

* This Changelog was automatically generated by github_changelog_generator

v58.1.0: arrow 58.1.0

Compare Source

Changelog
58.1.0 (2026-03-20)

Full Changelog

Implemented enhancements:

  • Reuse compression dict lz4_block #​9566
  • [Variant] Add variant_to_arrow Struct type support #​9529
  • [Variant] Add unshred_variant support for Binary and LargeBinary types #​9526
  • [Variant] Add shred_variant support for LargeUtf8 and LargeBinary types #​9525
  • [Variant] variant_get tests clean up #​9517
  • parquet_variant: Support LargeUtf8 typed value in unshred_variant #​9513
  • parquet-variant: Support string view typed value in unshred_variant #​9512
  • Deprecate ArrowTimestampType::make_value in favor of from_naive_datetime #​9490 [arrow]
  • Followup for support ['fieldName'] in VariantPath #​9478
  • Speedup DELTA_BINARY_PACKED decoding when bitwidth is 0 #​9476 [parquet]
  • Support CSV files encoded with charsets other than UTF-8 #​9465 [arrow]
  • Expose Avro writer schema when building the reader #​9460 [arrow]
  • Python: avoid importing pyarrow classes ever time #​9438
  • Add append_nulls to MapBuilder #​9431 [arrow]
  • Add append_non_nulls to StructBuilder #​9429 [arrow]
  • Add append_value_n to GenericByteBuilder #​9425 [arrow]
  • Optimize from_bitwise_binary_op #​9378 [arrow]
  • Configurable Arrow representation of UTC timestamps for Avro reader #​9279 [arrow]

Fixed bugs:

  • MutableArrayData::extend does not copy child values for ListView arrays #​9561 [arrow]
  • ListView interleave bug #​9559 [arrow]
  • Flight encoding panics with "no dict id for field" with nested dict arrays #​9555 [arrow] [arrow-flight]
  • "DeltaBitPackDecoder only supports Int32Type and Int64Type" but unsigned types are supported too #​9551 [parquet]
  • Potential overflow when calling util::bit_mask::set_bits (soundness issue) #​9543 [arrow]
  • handle Null type in try_merge for Struct, List, LargeList, and Union #​9523 [arrow]
  • Invalid offset in sparse column chunk data for multiple predicates #​9516 [parquet]
  • debug_assert_eq! in BatchCoalescer panics in debug mode when batch_size < 4 #​9506 [arrow]
  • Parquet Statistics::null_count_opt wrongly returns Some(0) when stats are missing #​9451 [parquet]
  • Error "Not all children array length are the same!" when decoding rows spanning across page boundaries in parquet file when using RowSelection #​9370 [parquet]
  • Avro schema resolution not properly supported for complex types #​9336 [arrow]

Documentation updates:

Performance improvements:

Closed issues:

  • Duplicate macro definition: partially_shredded_variant_array_gen #​9492
  • Enable LargeList / ListView / LargeListView for VariantArray::try_new [#&

Note

PR body was truncated to here.


Configuration

📅 Schedule: (UTC)

  • Branch creation
    • Between 12:00 AM and 03:59 AM, only on Monday (* 0-3 * * 1)
  • Automerge
    • At any time (no schedule defined)

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about these updates again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

@renovate renovate Bot added the changelog/chore A trivial change label May 18, 2026
@codspeed-hq
Copy link
Copy Markdown

codspeed-hq Bot commented May 18, 2026

Merging this PR will improve performance by 15.01%

⚠️ Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚠️ Different runtime environments detected

Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.

Open the report in CodSpeed to investigate

⚡ 5 improved benchmarks
✅ 1216 untouched benchmarks

Performance Changes

Mode Benchmark BASE HEAD Efficiency
Simulation chunked_varbinview_canonical_into[(1000, 10)] 198.2 µs 162 µs +22.34%
Simulation chunked_varbinview_into_canonical[(100, 100)] 358.4 µs 325.4 µs +10.13%
Simulation new_bp_prim_test_between[i16, 32768] 132.2 µs 120 µs +10.21%
Simulation new_alp_prim_test_between[f32, 16384] 118.1 µs 103.6 µs +13.99%
Simulation new_alp_prim_test_between[f32, 32768] 181.9 µs 153 µs +18.88%

Tip

Curious why this is faster? Comment @codspeedbot explain why this is faster on this PR, or directly use the CodSpeed MCP with your agent.


Comparing renovate/major-58-arrow-rs (cbb40a5) with develop (3c9f0e2)

Open in CodSpeed

@renovate renovate Bot force-pushed the renovate/major-58-arrow-rs branch from fcfb9e8 to ef98e4e Compare May 18, 2026 09:42
@renovate renovate Bot force-pushed the renovate/major-58-arrow-rs branch from ef98e4e to cbb40a5 Compare May 18, 2026 10:29
@renovate renovate Bot changed the title Update arrow-rs to v58 Update arrow-rs to v58 - autoclosed May 18, 2026
@renovate renovate Bot closed this May 18, 2026
@renovate renovate Bot deleted the renovate/major-58-arrow-rs branch May 18, 2026 14:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/chore A trivial change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants