diff --git a/CHANGELOG.md b/CHANGELOG.md index 6101f5d3cac25..5774601dab83b 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,4 +1,216 @@ +# Apache Arrow 19.0.0 (2025-01-10) + +## New Features and Improvements + +* [GH-23995](https://github.com/apache/arrow/issues/23995) - [C#] Make PrimitiveArrayBuilder constructor public (#44596) +* [GH-27919](https://github.com/apache/arrow/issues/27919) - [CI][C++] Add a nightly job to test offline build (#44721) +* [GH-32206](https://github.com/apache/arrow/issues/32206) - [C++] GcsFileSystem::Make should return Result (#44503) +* [GH-35589](https://github.com/apache/arrow/issues/35589) - [Ruby] Add support or JRuby (#44346) +* [GH-36954](https://github.com/apache/arrow/issues/36954) - [Python] Add more FlightInfo / FlightEndpoint attributes (#43537) +* [GH-38837](https://github.com/apache/arrow/issues/38837) - [Format] Add the specification for statistics schema (#45058) +* [GH-39212](https://github.com/apache/arrow/issues/39212) - [Release] Retry download binary subprocess in case of OpenSSL error (#39213) +* [GH-40592](https://github.com/apache/arrow/issues/40592) - [C++][Parquet] Implement SizeStatistics (#40594) +* [GH-40713](https://github.com/apache/arrow/issues/40713) - [Docs] Trim down ADBC page (#44313) +* [GH-41141](https://github.com/apache/arrow/issues/41141) - [C++] Reduce string inlining in Substrait serde (#45174) +* [GH-41706](https://github.com/apache/arrow/issues/41706) - [C++][Acero] Enhance asof_join to work in multi-threaded execution by sequencing input (#44083) +* [GH-43080](https://github.com/apache/arrow/issues/43080) - [CI][Dev] Enable shellcheck (#44724) +* [GH-43410](https://github.com/apache/arrow/issues/43410) - [Python] Support Arrow PyCapsule stream objects in write_dataset (#43771) +* [GH-43535](https://github.com/apache/arrow/issues/43535) - [C++] Support the AWS S3 SSE-C encryption (#43601) +* [GH-43547](https://github.com/apache/arrow/issues/43547) - [R][CI] Add recheck workflow for checking reverse dependencies on GHA (#43784) +* [GH-43570](https://github.com/apache/arrow/issues/43570) - [CI][Dev][Docs] Update references to "docker-compose" with "docker compose" (#43575) +* [GH-43598](https://github.com/apache/arrow/issues/43598) - [C++][Parquet] Parquet Metadata Printer supports print sort-columns (#43599) +* [GH-43631](https://github.com/apache/arrow/issues/43631) - [C++] Add C++ implementation of Async C Data Interface (#44495) +* [GH-43631](https://github.com/apache/arrow/issues/43631) - [C][Format] Add ArrowAsyncDeviceStreamHandler interface (#43632) +* [GH-43683](https://github.com/apache/arrow/issues/43683) - [Python] Use pandas StringDtype when enabled (pandas 3+) (#44195) +* [GH-43693](https://github.com/apache/arrow/issues/43693) - [C++][Acero] Support AVX2 swiss join decoding (#43832) +* [GH-43808](https://github.com/apache/arrow/issues/43808) - [C++] skip `-0117` in StrptimeZoneOffset for old glibc (#44621) +* [GH-43951](https://github.com/apache/arrow/issues/43951) - [CI][Python] Use GitHub Packages for vcpkg cache (#44644) +* [GH-44010](https://github.com/apache/arrow/issues/44010) - [C++] Add `arrow::RecordBatch::MakeStatisticsArray()` (#44252) +* [GH-44018](https://github.com/apache/arrow/issues/44018) - [R] Treat builds on R-universe as like `NOT_CRAN=true` (#44476) +* [GH-44065](https://github.com/apache/arrow/issues/44065) - [Java] Implement C Data Interface for RunEndEncodedVector (#44241) +* [GH-44066](https://github.com/apache/arrow/issues/44066) - [Python] Add Python wrapper for JsonExtensionType (#44070) +* [GH-44084](https://github.com/apache/arrow/issues/44084) - [C++] Improve merge step in chunked sorting (#44217) +* [GH-44101](https://github.com/apache/arrow/issues/44101) - [C++][Parquet] Tools: Debug Print for Json should be valid JSON (#44532) +* [GH-44223](https://github.com/apache/arrow/issues/44223) - [Dev] Use "Gandiva" instead of "C++ - Gandiva" label (#44722) +* [GH-44308](https://github.com/apache/arrow/issues/44308) - [C++][FS][Azure] Implement SAS token authentication (#45021) +* [GH-44364](https://github.com/apache/arrow/issues/44364) - [C++] Don't export template class (#44365) +* [GH-44392](https://github.com/apache/arrow/issues/44392) - [GLib] Add GArrowDecimal32DataType (#44580) +* [GH-44404](https://github.com/apache/arrow/issues/44404) - [CI] Stop running CI "push" jobs for Dependabot (#44405) +* [GH-44416](https://github.com/apache/arrow/issues/44416) - [C++][Docs] Update the URL to C++ Development in README.md (#44427) +* [GH-44444](https://github.com/apache/arrow/issues/44444) - [Java][CI] Add Java implementation of Flight do_exchange integration test (#44445) +* [GH-44464](https://github.com/apache/arrow/issues/44464) - [C++] Added rvalue-reference-qualified overload for arrow::Result::status() returning value instead of reference (#44477) +* [GH-44474](https://github.com/apache/arrow/issues/44474) - [Website][Docs] Improve project description in more places (#44522) +* [GH-44474](https://github.com/apache/arrow/issues/44474) - [Website] Improve project description (#44492) +* [GH-44480](https://github.com/apache/arrow/issues/44480) - [Release][Packaging] Use `--platform` explicitly for `docker run` (#44481) +* [GH-44491](https://github.com/apache/arrow/issues/44491) - [C++] StatusConstant- cheaply copied const Status (#44493) +* [GH-44518](https://github.com/apache/arrow/issues/44518) - [R] Update NEWS for 18.0.0 (#44520) +* [GH-44528](https://github.com/apache/arrow/issues/44528) - [Dev] Introduce a new feature: specific language reformatting (#44529) +* [GH-44555](https://github.com/apache/arrow/issues/44555) - [C++][Compute] Allow casting struct to bigger nullable struct (#44587) +* [GH-44556](https://github.com/apache/arrow/issues/44556) - [Release][MSYS2] Update python-pyarrow too (#44557) +* [GH-44558](https://github.com/apache/arrow/issues/44558) - [Release][Website] Remove needless "Apache Arrow ${VERSION}" section (#44559) +* [GH-44569](https://github.com/apache/arrow/issues/44569) - [GLib] Add GArrowDecimal64DataType (#44571) +* [GH-44575](https://github.com/apache/arrow/issues/44575) - [C#] Replace LINQ expression with for loop (#44576) +* [GH-44578](https://github.com/apache/arrow/issues/44578) - [Release][Packaging] Verify wheel version (#44593) +* [GH-44579](https://github.com/apache/arrow/issues/44579) - [C++] Use array type to compute min/max statistics Arrow type (#45094) +* [GH-44581](https://github.com/apache/arrow/issues/44581) - [C++] Minor: ArrayData ctor can assign null_count directly (#44582) +* [GH-44588](https://github.com/apache/arrow/issues/44588) - [GLib] Add GArrowDecimal64 Class (#44591) +* [GH-44589](https://github.com/apache/arrow/issues/44589) - [GLib] Add GArrowDecimal32 class (#44597) +* [GH-44590](https://github.com/apache/arrow/issues/44590) - [C++] Add `const` and `&` to `arrow::Array::statistics()` return type (#44592) +* [GH-44603](https://github.com/apache/arrow/issues/44603) - [GLib] Add GArrowDecimal64Array and GArrowDecimal64ArrayBuilder (#44605) +* [GH-44604](https://github.com/apache/arrow/issues/44604) - [GLib] Add Decimal32Array (#44617) +* [GH-44614](https://github.com/apache/arrow/issues/44614) - [Python][C++] Add version suffix to libarrow_python* libraries (#44702) +* [GH-44618](https://github.com/apache/arrow/issues/44618) - [GLib] Add GArrowDecimal64Scalar (#44620) +* [GH-44619](https://github.com/apache/arrow/issues/44619) - [GLib] Add GArrowDecimal32Scalar (#44628) +* [GH-44648](https://github.com/apache/arrow/issues/44648) - [CI] Remove autotune and rebase from commentbot (#44649) +* [GH-44656](https://github.com/apache/arrow/issues/44656) - [GLib] Add GArrowBinaryViewDataType (#44659) +* [GH-44667](https://github.com/apache/arrow/issues/44667) - [Archery] Suppress pull/push progress logs (#44669) +* [GH-44686](https://github.com/apache/arrow/issues/44686) - [GLib] Add GArrowStringViewDataType (#44687) +* [GH-44690](https://github.com/apache/arrow/issues/44690) - [C++] NumericBuilder::AppendValues append vector prevent from ub (#44794) +* [GH-44700](https://github.com/apache/arrow/issues/44700) - [C++][Parquet] Remove obsolete parquet_constants generated files from old thrift (#44772) +* [GH-44703](https://github.com/apache/arrow/issues/44703) - [CI][MATLAB][Packaging] Update MATLAB CI and `crossbow` packaging workflows to build against MATLAB `R2024b` (#44704) +* [GH-44705](https://github.com/apache/arrow/issues/44705) - [MATLAB][PACKAGING] Update the crossbow MATLAB workflow to use `macOS 13` instead of `macOS 12` (#44858) +* [GH-44710](https://github.com/apache/arrow/issues/44710) - [Docs][C++] Add `arrow::ArrayStatistics` to API doc (#44764) +* [GH-44713](https://github.com/apache/arrow/issues/44713) - [Python] Add support for Decimal32 and Decimal64 types (#44882) +* [GH-44744](https://github.com/apache/arrow/issues/44744) - [C++] Upgrade ORC to 2.0.3 (#44745) +* [GH-44749](https://github.com/apache/arrow/issues/44749) - [CI][Dev] Apply ShellCheck lint to ci/scripts/c_glib_test.sh (#44750) +* [GH-44770](https://github.com/apache/arrow/issues/44770) - [Java] Update minor protobuf version to avoid CVE-2024-7254 (#44775) +* [GH-44784](https://github.com/apache/arrow/issues/44784) - [C++][Parquet] Add `arrow::Result` version of `parquet::arrow::OpenFile()` (#44785) +* [GH-44788](https://github.com/apache/arrow/issues/44788) - [C++] Fix a couple of maybe-uninitialized warnings (#44789) +* [GH-44795](https://github.com/apache/arrow/issues/44795) - [C++] Use arrow::util::span on arrow::util::bitmap_builders_utilities instead of std::vector (#44796) +* [GH-44805](https://github.com/apache/arrow/issues/44805) - [Dev] Add `.editorconfig` file (#44870) +* [GH-44808](https://github.com/apache/arrow/issues/44808) - [C++][Parquet] Add `arrow::Result` version of `parquet::arrow::FileReader::GetRecordBatchReader()` (#44809) +* [GH-44811](https://github.com/apache/arrow/issues/44811) - [C++] minor optimize cancel and thread pool (#44812) +* [GH-44815](https://github.com/apache/arrow/issues/44815) - [C++][Parquet] Add an example to dump statistics read as `arrow::ArrayStatistics` (#44816) +* [GH-44828](https://github.com/apache/arrow/issues/44828) - [Java][Release] Remove Java from release scripts (#44854) +* [GH-44829](https://github.com/apache/arrow/issues/44829) - [Java][CI] Remove Java related test CI except integration test (#44946) +* [GH-44830](https://github.com/apache/arrow/issues/44830) - [Java][CI] Disable Java Dependabot (#44832) +* [GH-44831](https://github.com/apache/arrow/issues/44831) - [CI] Build Java for integration tests from arrow-java (#44932) +* [GH-44843](https://github.com/apache/arrow/issues/44843) - [CI][Doc] Remove building Java documentation from this repository (#45000) +* [GH-44878](https://github.com/apache/arrow/issues/44878) - [Archery] Only suppress Docker progress logs when running on CI (#44865) +* [GH-44903](https://github.com/apache/arrow/issues/44903) - [C++] Add the Expm1(exponent) scalar arithmetic function (#44904) +* [GH-44915](https://github.com/apache/arrow/issues/44915) - [C++] Add WithinUlp testing functions (#44906) +* [GH-44922](https://github.com/apache/arrow/issues/44922) - [MATLAB] Add IPC `RecordBatchStreamFileWriter` MATLAB class (#44925) +* [GH-44923](https://github.com/apache/arrow/issues/44923) - [MATLAB] Add IPC `RecordBatchStreamReader` MATLAB class (#45068) +* [GH-44934](https://github.com/apache/arrow/issues/44934) - [Dev] Remove JIRA remnants (#44936) +* [GH-44948](https://github.com/apache/arrow/issues/44948) - [Dev][Archery] Remove JIRA remnants from Archery (#45091) +* [GH-44952](https://github.com/apache/arrow/issues/44952) - [C++][Python] Add Hyperbolic Trig functions (#44630) +* [GH-44962](https://github.com/apache/arrow/issues/44962) - [Python] Clean-up name / field_name handling in pandas compat (#44963) +* [GH-44976](https://github.com/apache/arrow/issues/44976) - [C++] Enable mimalloc by default, disable jemalloc by default and more (#44951) +* [GH-44982](https://github.com/apache/arrow/issues/44982) - [C++] Add support for building system OpenTelemetry (#44983) +* [GH-44994](https://github.com/apache/arrow/issues/44994) - [C++][CMake] Use librt only for Linux (#44984) +* [GH-45005](https://github.com/apache/arrow/issues/45005) - [C++] Support for fixed-size list in conversion of range tuple (#45008) +* [GH-45015](https://github.com/apache/arrow/issues/45015) - [C++][Parquet] Allow configuring the default footer read size (#45016) +* [GH-45047](https://github.com/apache/arrow/issues/45047) - [CI][Python][Packaging] Test 3.12 wheels on Ubuntu 24.04 (#45042) +* [GH-45050](https://github.com/apache/arrow/issues/45050) - [CI][Dev] Apply ShellCheck lint to c_glib/test/run-test.sh (#45052) +* [GH-45075](https://github.com/apache/arrow/issues/45075) - [C++] Remove result_internal.h (#45066) +* [GH-45076](https://github.com/apache/arrow/issues/45076) - [CI][Packaging][Python] Simplify dev/tasks/python-wheels/github.linux.yml (#45077) +* [GH-45079](https://github.com/apache/arrow/issues/45079) - [FlightRPC][C++] Deprecate InitializeFlightUcx before removing UCX (#45080) +* [GH-45092](https://github.com/apache/arrow/issues/45092) - [C++][Parquet] Add GetReadRanges function to FileReader (#45093) +* [GH-45096](https://github.com/apache/arrow/issues/45096) - [C++] Apply a cstdint patch to bundled Thrift for GCC 15 (#45097) +* [GH-45135](https://github.com/apache/arrow/issues/45135) - [C++] Remove useless "hash table ready" states in swiss join (#45136) +* [GH-45137](https://github.com/apache/arrow/issues/45137) - [CI][C++] Add a GCC 15 job (#45138) +* [GH-45140](https://github.com/apache/arrow/issues/45140) - [Dev][Release] Release guide improvements (#45141) +* [GH-45142](https://github.com/apache/arrow/issues/45142) - [C++] Ensure using `cpp/cmake_modules/*.cmake` (#45143) +* [GH-45164](https://github.com/apache/arrow/issues/45164) - [CI][Integration] Follow build script name change in apache/arrow-java (#45199) +* [GH-45164](https://github.com/apache/arrow/issues/45164) - [CI][Integration] Remove "java_" prefix from Java build scripts (#45165) +* [GH-45166](https://github.com/apache/arrow/issues/45166) - [CI][C++] Upgrade Alpine Linux to 3.18 from 3.16 (#45168) +* [GH-45175](https://github.com/apache/arrow/issues/45175) - [Python] Honor the strings_to_categorical keyword in to_pandas for string view type (#45176) +* [GH-45178](https://github.com/apache/arrow/issues/45178) - [CI] Remove clcache related codes (#45182) +* [GH-45198](https://github.com/apache/arrow/issues/45198) - [CI][Integration] Follow build script name change in apache/arrow-java + + +## Bug Fixes + +* [GH-15233](https://github.com/apache/arrow/issues/15233) - [C++] Fix CopyFiles when destination is a FileSystem with background_writes (#44897) +* [GH-39914](https://github.com/apache/arrow/issues/39914) - [pyarrow] Reorder to_pandas extension dtype mapping (#44720) +* [GH-40633](https://github.com/apache/arrow/issues/40633) - [C++][Python] Fix ORC crash when file contains unknown timezone (#45051) +* [GH-41326](https://github.com/apache/arrow/issues/41326) - [Python] Converting month\_day\_nano\_interal to numpy crashes +* [GH-41536](https://github.com/apache/arrow/issues/41536) - [C++] Replace std::aligned_storage that is deprecated in C++23 (#45019) +* [GH-41667](https://github.com/apache/arrow/issues/41667) - [C++][Parquet] Refuse writing non-nullable column that contains nulls (#44921) +* [GH-43124](https://github.com/apache/arrow/issues/43124) - [C++] Initialize offset vector head as 0 after memory allocated in grouper.cc (#43123) +* [GH-43949](https://github.com/apache/arrow/issues/43949) - [C++] io::BufferedInput: Fix invalid state after SetBufferSize (#44387) +* [GH-43994](https://github.com/apache/arrow/issues/43994) - [C++][Parquet] Fix schema conversion from two-level encoding nested list (#43995) +* [GH-44344](https://github.com/apache/arrow/issues/44344) - [Java] fix VectorSchemaRoot.getTransferPair for NullVector (#44631) +* [GH-44368](https://github.com/apache/arrow/issues/44368) - [C++] Use "lib" for generating bundled dependencies even with "clang-cl" (#44391) +* [GH-44372](https://github.com/apache/arrow/issues/44372) - [C++] Fix unaligned load/store implementation for clang-18 (#44468) +* [GH-44384](https://github.com/apache/arrow/issues/44384) - [C++] Use CMAKE_LIBTOOL on macOS (#44385) +* [GH-44396](https://github.com/apache/arrow/issues/44396) - [CI][C++] Use setup-python on hosted runner (#44411) +* [GH-44412](https://github.com/apache/arrow/issues/44412) - [CI][R] Use Ubuntu 22.04 not 24.04 for Arrow C++ 15.0.2 deb packages (#44413) +* [GH-44428](https://github.com/apache/arrow/issues/44428) - [CI][C#] Use setup-python to use "pip install" (#44429) +* [GH-44430](https://github.com/apache/arrow/issues/44430) - [CI][Swfit] Use setup-python to use "pip install" (#44431) +* [GH-44432](https://github.com/apache/arrow/issues/44432) - [Swift] Use flatbuffers v24.3.7 (#44433) +* [GH-44441](https://github.com/apache/arrow/issues/44441) - [Release] Add missing `--repo apache/arrow` in `02-source.sh` (#44442) +* [GH-44455](https://github.com/apache/arrow/issues/44455) - [C++] Update vendored date to 3.0.3 (#44482) +* [GH-44465](https://github.com/apache/arrow/issues/44465) - [GLib][C++] Meson searches libraries with specific versions. (#44475) +* [GH-44478](https://github.com/apache/arrow/issues/44478) - [GLib] Prevent Arrow::ExtensionType.new (#44498) +* [GH-44479](https://github.com/apache/arrow/issues/44479) - [CI][Archery] Add missing Flight integration targets (#44691) +* [GH-44483](https://github.com/apache/arrow/issues/44483) - [GLib] Don't check invalid decimal precision message in test (#44484) +* [GH-44526](https://github.com/apache/arrow/issues/44526) - [C++][Acero] Fix crash when thread in asof_join is not running (#44584) +* [GH-44541](https://github.com/apache/arrow/issues/44541) - [C++] NumericArray should not use ctor from parent directly (#44542) +* [GH-44563](https://github.com/apache/arrow/issues/44563) - [C++] `FunctionOptions::{Serialize,Deserialize}()` return an error without `ARROW_IPC` (#45171) +* [GH-44564](https://github.com/apache/arrow/issues/44564) - [Java][FlightSQL] Fix native libraries relocation (#44565) +* [GH-44570](https://github.com/apache/arrow/issues/44570) - [Release][R][Docs] Update `r/pkgdown/assets/versions.html` (#44572) +* [GH-44574](https://github.com/apache/arrow/issues/44574) - [Release] Ensure using the release tag to build binaries (#44577) +* [GH-44585](https://github.com/apache/arrow/issues/44585) - [JS][Release] Skip bin dir in npm-release.sh (#44861) +* [GH-44601](https://github.com/apache/arrow/issues/44601) - [GLib] Fix the wrong GARROW_AVAILABLE_IN declaration (#44602) +* [GH-44624](https://github.com/apache/arrow/issues/44624) - [CI][JS] Increase "AMD64 macOS 13 NodeJS 18" timeout (#44625) +* [GH-44626](https://github.com/apache/arrow/issues/44626) - [Java] fix SplitAndTransfer throws for empty MapVector (#44627) +* [GH-44651](https://github.com/apache/arrow/issues/44651) - [Python] Allow from_buffers to work with StringView on Python (#44701) +* [GH-44657](https://github.com/apache/arrow/issues/44657) - [CI][Dev] Add write permission to the crossbow comment bot (#44658) +* [GH-44668](https://github.com/apache/arrow/issues/44668) - [Docs] Fix ColumnChunkMetaData offset documentation in pyarrow (#44670) +* [GH-44677](https://github.com/apache/arrow/issues/44677) - [C++][Acero] Enhance partition sort example (#44678) +* [GH-44679](https://github.com/apache/arrow/issues/44679) - [C++][Python] Fix Flight Timestamp precision, revert workaround from #43537 (#44681) +* [GH-44695](https://github.com/apache/arrow/issues/44695) - [C++] Add S3 option to ignore SIGPIPE signals (#44735) +* [GH-44706](https://github.com/apache/arrow/issues/44706) - [Release][Archery][Packaging] Add "so_version" variable (#44707) +* [GH-44711](https://github.com/apache/arrow/issues/44711) - [Docs][Python] Add missing canonical extension types to PyArrow arrays and datatypes docs (#44880) +* [GH-44714](https://github.com/apache/arrow/issues/44714) - [C++] Keep field metadata for keys and values when importing a map type via the C data interface (#44715) +* [GH-44716](https://github.com/apache/arrow/issues/44716) - [Dev][Integration] Add numpy to archery integration deps (#44717) +* [GH-44726](https://github.com/apache/arrow/issues/44726) - [CI] Update substrait consumer call to use updated producer and consumer parameters (#44727) +* [GH-44728](https://github.com/apache/arrow/issues/44728) - [Python] Trigger manual Garbage collection before checking allocated bytes for dlpack tests (#44793) +* [GH-44734](https://github.com/apache/arrow/issues/44734) - [C++][CI] Fix arrow-c-bridge-test timeout with threading disabled (#44737) +* [GH-44742](https://github.com/apache/arrow/issues/44742) - [Ruby] Fix a bug that empty struct list value can't be built (#44763) +* [GH-44754](https://github.com/apache/arrow/issues/44754) - [C++] Use lowercased `windows.h` to enable cross-platform builds (#44755) +* [GH-44767](https://github.com/apache/arrow/issues/44767) - [C++] Fix Float16.To{Little,Big}Endian on big endian machines (#44768) +* [GH-44769](https://github.com/apache/arrow/issues/44769) - [C++][Parquet] Fix read/write of metadata length footer on big-endian systems (#44787) +* [GH-44773](https://github.com/apache/arrow/issues/44773) - [Dev][Doc] Remove obsolete Read the docs configuration (#44774) +* [GH-44797](https://github.com/apache/arrow/issues/44797) - [CI] Update pkg-config to pkgconf on Homebrew (#44798) +* [GH-44802](https://github.com/apache/arrow/issues/44802) - [C++][CI] Migrate to `arrow::Result` based `parquet::arrow::OpenFile() API` in example tutorials (#44807) +* [GH-44820](https://github.com/apache/arrow/issues/44820) - [CI] Uninstall pkg-config entirely on macOS jobs (#44821) +* [GH-44844](https://github.com/apache/arrow/issues/44844) - [CI] Uninstall pkg-config entirely on verification and java-jars macOS jobs (#44845) +* [GH-44846](https://github.com/apache/arrow/issues/44846) - [C++] Fix thread-unsafe access in ConcurrentQueue::UnsyncFront (#44849) +* [GH-44855](https://github.com/apache/arrow/issues/44855) - [Python][Packaging] Use delvewheel to repair Windows wheels (#35323) +* [GH-44885](https://github.com/apache/arrow/issues/44885) - [Dev][Release] Update test condition in utils-prepare.sh (#44862) +* [GH-44898](https://github.com/apache/arrow/issues/44898) - [C++] Fix compilation error on GCC 8 (#44899) +* [GH-44911](https://github.com/apache/arrow/issues/44911) - [C#] Choose port numbers dynamically for ArrowStreamWriterTests (#44912) +* [GH-44918](https://github.com/apache/arrow/issues/44918) - [Ruby] Fix a bug that empty list is ignored in builder (#44933) +* [GH-44954](https://github.com/apache/arrow/issues/44954) - [C++][CI] Silence protobuf-generated deprecations (#44955) +* [GH-44968](https://github.com/apache/arrow/issues/44968) - [CI][Dev] Remove nonexistent `getJiraInfo` reference (#44972) +* [GH-44974](https://github.com/apache/arrow/issues/44974) - [Dev] Fix minor issue handling (#44975) +* [GH-44980](https://github.com/apache/arrow/issues/44980) - [CI] Remove retrieval of Arrow version from Java on Spark integration and update test structure for PySpark (#44981) +* [GH-44985](https://github.com/apache/arrow/issues/44985) - [C++] Use recommended downloads URLs for ORC and Thrift (#44977) +* [GH-44991](https://github.com/apache/arrow/issues/44991) - [CI][Python] Fix and modernize AppVeyor build (#44999) +* [GH-45003](https://github.com/apache/arrow/issues/45003) - [Python][Docs] Update docstrings for metadata methods on Field and Schema classes (#45004) +* [GH-45006](https://github.com/apache/arrow/issues/45006) - [CI][Python] Fix test_memory failures (#45007) +* [GH-45027](https://github.com/apache/arrow/issues/45027) - [C++] Include path in the documentation is wrong (#45031) +* [GH-45034](https://github.com/apache/arrow/issues/45034) - [C++] Remove Parquet requirement from Arrow Acero and from Arrow Dataset when not necessary (#45035) +* [GH-45039](https://github.com/apache/arrow/issues/45039) - [CI][Packaging][Python] Fix Docker push step for free-threaded wheel builds (#45040) +* [GH-45041](https://github.com/apache/arrow/issues/45041) - [Packaging][Python] Use ORC from vcpkg instead of bundled on Linux and macOS (#45046) +* [GH-45053](https://github.com/apache/arrow/issues/45053) - [C++] Add support for Boost 1.87.0 (#45057) +* [GH-45059](https://github.com/apache/arrow/issues/45059) - [C++][CI] Fix test-build-cpp-fuzz failures (#45060) +* [GH-45069](https://github.com/apache/arrow/issues/45069) - [R][CI] Fix r-binary-packages crossbow job (#45070) +* [GH-45071](https://github.com/apache/arrow/issues/45071) - [Packaging][Docs] Fix NumPy v2 include directory for Emscripten, and update Pyodide-related documentation (#45072) +* [GH-45073](https://github.com/apache/arrow/issues/45073) - [C++][Parquet] Fix generation of repetition levels for encryption test data (#45074) +* [GH-45099](https://github.com/apache/arrow/issues/45099) - [C++] Avoid static const variable in the status.h (#45100) +* [GH-45118](https://github.com/apache/arrow/issues/45118) - [Packaging] Use armored keyring for APT repository (#45131) +* [GH-45119](https://github.com/apache/arrow/issues/45119) - [Ruby] Use #size for Arrow::Column#size backend (#45133) +* [GH-45151](https://github.com/apache/arrow/issues/45151) - [C++][Parquet] Fix Null-dereference READ in parquet::arrow::ListToSchemaField (#45152) +* [GH-45157](https://github.com/apache/arrow/issues/45157) - [CI][Packaging] Remove explicit pins of gemfury version in dev/tasks/macros.jinja (#45158) +* [GH-45183](https://github.com/apache/arrow/issues/45183) - [C++][Release] Add llvm-dev back to setup-ubuntu.sh (#45184) +* [GH-45212](https://github.com/apache/arrow/issues/45212) - [C++][Parquet] Fix uninitialized size_statistics_level property (#45213) + + + # Apache Arrow 6.0.1 (2021-11-18) ## Bug Fixes diff --git a/c_glib/meson.build b/c_glib/meson.build index bd7843d8bc362..c668e30e67361 100644 --- a/c_glib/meson.build +++ b/c_glib/meson.build @@ -35,7 +35,7 @@ project('arrow-glib', 'c', 'cpp', # * 22.04: 0.61.2 meson_version: '>=0.53.2') -version = '19.0.0-SNAPSHOT' +version = '19.0.0' if version.endswith('-SNAPSHOT') version_numbers = version.split('-')[0].split('.') version_tag = version.split('-')[1] diff --git a/c_glib/vcpkg.json b/c_glib/vcpkg.json index f2717f7e27cf2..88b010d1b3e9a 100644 --- a/c_glib/vcpkg.json +++ b/c_glib/vcpkg.json @@ -1,6 +1,6 @@ { "name": "arrow-glib", - "version-string": "19.0.0-SNAPSHOT", + "version-string": "19.0.0", "dependencies": [ "glib", "gobject-introspection", diff --git a/ci/scripts/PKGBUILD b/ci/scripts/PKGBUILD index 4c567d550b92a..792df1376d90f 100644 --- a/ci/scripts/PKGBUILD +++ b/ci/scripts/PKGBUILD @@ -18,7 +18,7 @@ _realname=arrow pkgbase=mingw-w64-${_realname} pkgname="${MINGW_PACKAGE_PREFIX}-${_realname}" -pkgver=18.1.0.9000 +pkgver=19.0.0 pkgrel=8000 pkgdesc="Apache Arrow is a cross-language development platform for in-memory data (mingw-w64)" arch=("any") diff --git a/cpp/CMakeLists.txt b/cpp/CMakeLists.txt index 103e0f08445d9..67189526eff7e 100644 --- a/cpp/CMakeLists.txt +++ b/cpp/CMakeLists.txt @@ -79,7 +79,7 @@ if(POLICY CMP0170) cmake_policy(SET CMP0170 NEW) endif() -set(ARROW_VERSION "19.0.0-SNAPSHOT") +set(ARROW_VERSION "19.0.0") string(REGEX MATCH "^[0-9]+\\.[0-9]+\\.[0-9]+" ARROW_BASE_VERSION "${ARROW_VERSION}") diff --git a/cpp/vcpkg.json b/cpp/vcpkg.json index 6192e4efbd21e..75264a31b849e 100644 --- a/cpp/vcpkg.json +++ b/cpp/vcpkg.json @@ -1,6 +1,6 @@ { "name": "arrow", - "version-string": "19.0.0-SNAPSHOT", + "version-string": "19.0.0", "dependencies": [ "abseil", { diff --git a/csharp/Directory.Build.props b/csharp/Directory.Build.props index 1994aa74bd9f4..993ca1a59582b 100644 --- a/csharp/Directory.Build.props +++ b/csharp/Directory.Build.props @@ -29,7 +29,7 @@ Apache Arrow library Copyright 2016-2024 The Apache Software Foundation The Apache Software Foundation - 19.0.0-SNAPSHOT + 19.0.0 diff --git a/dev/tasks/homebrew-formulae/apache-arrow-glib.rb b/dev/tasks/homebrew-formulae/apache-arrow-glib.rb index df068812492e8..9c6ed8c587423 100644 --- a/dev/tasks/homebrew-formulae/apache-arrow-glib.rb +++ b/dev/tasks/homebrew-formulae/apache-arrow-glib.rb @@ -29,7 +29,7 @@ class ApacheArrowGlib < Formula desc "GLib bindings for Apache Arrow" homepage "https://arrow.apache.org/" - url "https://www.apache.org/dyn/closer.lua?path=arrow/arrow-19.0.0-SNAPSHOT/apache-arrow-19.0.0-SNAPSHOT.tar.gz" + url "https://www.apache.org/dyn/closer.lua?path=arrow/arrow-19.0.0/apache-arrow-19.0.0.tar.gz" sha256 "9948ddb6d4798b51552d0dca3252dd6e3a7d0f9702714fc6f5a1b59397ce1d28" license "Apache-2.0" head "https://github.com/apache/arrow.git", branch: "main" diff --git a/dev/tasks/homebrew-formulae/apache-arrow.rb b/dev/tasks/homebrew-formulae/apache-arrow.rb index a3730db719e05..2400461c156b1 100644 --- a/dev/tasks/homebrew-formulae/apache-arrow.rb +++ b/dev/tasks/homebrew-formulae/apache-arrow.rb @@ -29,7 +29,7 @@ class ApacheArrow < Formula desc "Columnar in-memory analytics layer designed to accelerate big data" homepage "https://arrow.apache.org/" - url "https://www.apache.org/dyn/closer.lua?path=arrow/arrow-19.0.0-SNAPSHOT/apache-arrow-19.0.0-SNAPSHOT.tar.gz" + url "https://www.apache.org/dyn/closer.lua?path=arrow/arrow-19.0.0/apache-arrow-19.0.0.tar.gz" sha256 "9948ddb6d4798b51552d0dca3252dd6e3a7d0f9702714fc6f5a1b59397ce1d28" license "Apache-2.0" head "https://github.com/apache/arrow.git", branch: "main" diff --git a/dev/tasks/linux-packages/apache-arrow-apt-source/debian/changelog b/dev/tasks/linux-packages/apache-arrow-apt-source/debian/changelog index 7326bde3ee2d0..da35e4a4f8deb 100644 --- a/dev/tasks/linux-packages/apache-arrow-apt-source/debian/changelog +++ b/dev/tasks/linux-packages/apache-arrow-apt-source/debian/changelog @@ -1,3 +1,9 @@ +apache-arrow-apt-source (19.0.0-1) unstable; urgency=low + + * New upstream release. + + -- Bryce Mecum Sat, 11 Jan 2025 04:17:56 -0000 + apache-arrow-apt-source (18.1.0-1) unstable; urgency=low * New upstream release. diff --git a/dev/tasks/linux-packages/apache-arrow-release/yum/apache-arrow-release.spec.in b/dev/tasks/linux-packages/apache-arrow-release/yum/apache-arrow-release.spec.in index ff08b13902923..6f826ad0a360e 100644 --- a/dev/tasks/linux-packages/apache-arrow-release/yum/apache-arrow-release.spec.in +++ b/dev/tasks/linux-packages/apache-arrow-release/yum/apache-arrow-release.spec.in @@ -102,6 +102,9 @@ else fi %changelog +* Sat Jan 11 2025 Bryce Mecum - 19.0.0-1 +- New upstream release. + * Wed Nov 13 2024 Bryce Mecum - 18.1.0-1 - New upstream release. diff --git a/dev/tasks/linux-packages/apache-arrow/debian/changelog b/dev/tasks/linux-packages/apache-arrow/debian/changelog index 43fd92f73f68e..10acb10b28ec0 100644 --- a/dev/tasks/linux-packages/apache-arrow/debian/changelog +++ b/dev/tasks/linux-packages/apache-arrow/debian/changelog @@ -1,3 +1,9 @@ +apache-arrow (19.0.0-1) unstable; urgency=low + + * New upstream release. + + -- Bryce Mecum Sat, 11 Jan 2025 04:17:56 -0000 + apache-arrow (18.1.0-1) unstable; urgency=low * New upstream release. diff --git a/dev/tasks/linux-packages/apache-arrow/yum/arrow.spec.in b/dev/tasks/linux-packages/apache-arrow/yum/arrow.spec.in index e5d8c87f45996..9ca9898fa9b15 100644 --- a/dev/tasks/linux-packages/apache-arrow/yum/arrow.spec.in +++ b/dev/tasks/linux-packages/apache-arrow/yum/arrow.spec.in @@ -894,6 +894,9 @@ Documentation for Apache Parquet GLib. %endif %changelog +* Sat Jan 11 2025 Bryce Mecum - 19.0.0-1 +- New upstream release. + * Wed Nov 13 2024 Bryce Mecum - 18.1.0-1 - New upstream release. diff --git a/docs/source/_static/versions.json b/docs/source/_static/versions.json index f1a20098d4caf..370915c479856 100644 --- a/docs/source/_static/versions.json +++ b/docs/source/_static/versions.json @@ -1,15 +1,20 @@ [ { - "name": "19.0 (dev)", + "name": "20.0 (dev)", "version": "dev/", "url": "https://arrow.apache.org/docs/dev/" }, { - "name": "18.1 (stable)", + "name": "19.0 (stable)", "version": "", "url": "https://arrow.apache.org/docs/", "preferred": true }, + { + "name": "18.1", + "version": "18.1/", + "url": "https://arrow.apache.org/docs/18.1/" + }, { "name": "18.0", "version": "18.0/", diff --git a/js/package.json b/js/package.json index 17bb95b7a5f51..7aba1c1fee3a7 100644 --- a/js/package.json +++ b/js/package.json @@ -120,5 +120,5 @@ "engines": { "node": ">=12.0" }, - "version": "19.0.0-SNAPSHOT" + "version": "19.0.0" } diff --git a/matlab/CMakeLists.txt b/matlab/CMakeLists.txt index c11decb5a77a2..fb02a40a771b6 100644 --- a/matlab/CMakeLists.txt +++ b/matlab/CMakeLists.txt @@ -100,7 +100,7 @@ endfunction() set(CMAKE_CXX_STANDARD 17) -set(MLARROW_VERSION "19.0.0-SNAPSHOT") +set(MLARROW_VERSION "19.0.0") string(REGEX MATCH "^[0-9]+\\.[0-9]+\\.[0-9]+" MLARROW_BASE_VERSION "${MLARROW_VERSION}") project(mlarrow VERSION "${MLARROW_BASE_VERSION}") diff --git a/python/CMakeLists.txt b/python/CMakeLists.txt index 80d1cd31ac231..714c6448453fa 100644 --- a/python/CMakeLists.txt +++ b/python/CMakeLists.txt @@ -28,7 +28,7 @@ project(pyarrow) # which in turn meant that Py_GIL_DISABLED was not set. set(CMAKE_NO_SYSTEM_FROM_IMPORTED ON) -set(PYARROW_VERSION "19.0.0-SNAPSHOT") +set(PYARROW_VERSION "19.0.0") string(REGEX MATCH "^[0-9]+\\.[0-9]+\\.[0-9]+" PYARROW_BASE_VERSION "${PYARROW_VERSION}") # Generate SO version and full SO version diff --git a/python/pyproject.toml b/python/pyproject.toml index 85bdbec0915ed..ef2043f6d0b71 100644 --- a/python/pyproject.toml +++ b/python/pyproject.toml @@ -81,4 +81,4 @@ root = '..' version_file = 'pyarrow/_generated_version.py' version_scheme = 'guess-next-dev' git_describe_command = 'git describe --dirty --tags --long --match "apache-arrow-[0-9]*.*"' -fallback_version = '19.0.0a0' +fallback_version = '19.0.0' diff --git a/r/DESCRIPTION b/r/DESCRIPTION index b45022f5aa90b..cd57f9a524a52 100644 --- a/r/DESCRIPTION +++ b/r/DESCRIPTION @@ -1,6 +1,6 @@ Package: arrow Title: Integration to 'Apache' 'Arrow' -Version: 18.1.0.9000 +Version: 19.0.0 Authors@R: c( person("Neal", "Richardson", email = "neal.p.richardson@gmail.com", role = c("aut")), person("Ian", "Cook", email = "ianmcook@gmail.com", role = c("aut")), diff --git a/r/NEWS.md b/r/NEWS.md index 83d09157b9038..437867d3ad482 100644 --- a/r/NEWS.md +++ b/r/NEWS.md @@ -17,7 +17,7 @@ under the License. --> -# arrow 18.1.0.9000 +# arrow 19.0.0 # arrow 18.1.0 diff --git a/r/pkgdown/assets/versions.html b/r/pkgdown/assets/versions.html index b9148ef4ff092..cbeff74fa4ce4 100644 --- a/r/pkgdown/assets/versions.html +++ b/r/pkgdown/assets/versions.html @@ -1,7 +1,8 @@ -

18.1.0.9000 (dev)

-

18.1.0 (release)

+

19.0.0.9000 (dev)

+

19.0.0 (release)

+

18.1.0

17.0.0

16.1.0

16.0.0

diff --git a/r/pkgdown/assets/versions.json b/r/pkgdown/assets/versions.json index 8c8d1203f01a1..105c4d2f3d0e2 100644 --- a/r/pkgdown/assets/versions.json +++ b/r/pkgdown/assets/versions.json @@ -1,12 +1,16 @@ [ { - "name": "18.1.0.9000 (dev)", + "name": "19.0.0.9000 (dev)", "version": "dev/" }, { - "name": "18.1.0 (release)", + "name": "19.0.0 (release)", "version": "" }, + { + "name": "18.1.0", + "version": "18.1/" + }, { "name": "17.0.0", "version": "17.0/" diff --git a/ruby/red-arrow-cuda/lib/arrow-cuda/version.rb b/ruby/red-arrow-cuda/lib/arrow-cuda/version.rb index 284afd290f655..2a659bdca1931 100644 --- a/ruby/red-arrow-cuda/lib/arrow-cuda/version.rb +++ b/ruby/red-arrow-cuda/lib/arrow-cuda/version.rb @@ -16,7 +16,7 @@ # under the License. module ArrowCUDA - VERSION = "19.0.0-SNAPSHOT" + VERSION = "19.0.0" module Version numbers, TAG = VERSION.split("-") diff --git a/ruby/red-arrow-dataset/lib/arrow-dataset/version.rb b/ruby/red-arrow-dataset/lib/arrow-dataset/version.rb index 26cf368019f68..9b70cd29321f0 100644 --- a/ruby/red-arrow-dataset/lib/arrow-dataset/version.rb +++ b/ruby/red-arrow-dataset/lib/arrow-dataset/version.rb @@ -16,7 +16,7 @@ # under the License. module ArrowDataset - VERSION = "19.0.0-SNAPSHOT" + VERSION = "19.0.0" module Version numbers, TAG = VERSION.split("-") diff --git a/ruby/red-arrow-flight-sql/lib/arrow-flight-sql/version.rb b/ruby/red-arrow-flight-sql/lib/arrow-flight-sql/version.rb index a4fabe6737aa8..f2cd2f5175866 100644 --- a/ruby/red-arrow-flight-sql/lib/arrow-flight-sql/version.rb +++ b/ruby/red-arrow-flight-sql/lib/arrow-flight-sql/version.rb @@ -16,7 +16,7 @@ # under the License. module ArrowFlightSQL - VERSION = "19.0.0-SNAPSHOT" + VERSION = "19.0.0" module Version numbers, TAG = VERSION.split("-") diff --git a/ruby/red-arrow-flight/lib/arrow-flight/version.rb b/ruby/red-arrow-flight/lib/arrow-flight/version.rb index a497405047e0d..70e896ce3d43e 100644 --- a/ruby/red-arrow-flight/lib/arrow-flight/version.rb +++ b/ruby/red-arrow-flight/lib/arrow-flight/version.rb @@ -16,7 +16,7 @@ # under the License. module ArrowFlight - VERSION = "19.0.0-SNAPSHOT" + VERSION = "19.0.0" module Version numbers, TAG = VERSION.split("-") diff --git a/ruby/red-arrow/lib/arrow/version.rb b/ruby/red-arrow/lib/arrow/version.rb index 3033a470dd697..f369fd223a93c 100644 --- a/ruby/red-arrow/lib/arrow/version.rb +++ b/ruby/red-arrow/lib/arrow/version.rb @@ -16,7 +16,7 @@ # under the License. module Arrow - VERSION = "19.0.0-SNAPSHOT" + VERSION = "19.0.0" module Version numbers, TAG = VERSION.split("-") diff --git a/ruby/red-gandiva/lib/gandiva/version.rb b/ruby/red-gandiva/lib/gandiva/version.rb index d8e7688f2b454..5226a34f1e012 100644 --- a/ruby/red-gandiva/lib/gandiva/version.rb +++ b/ruby/red-gandiva/lib/gandiva/version.rb @@ -16,7 +16,7 @@ # under the License. module Gandiva - VERSION = "19.0.0-SNAPSHOT" + VERSION = "19.0.0" module Version numbers, TAG = VERSION.split("-") diff --git a/ruby/red-parquet/lib/parquet/version.rb b/ruby/red-parquet/lib/parquet/version.rb index 836e3ce846c86..e5c9050744164 100644 --- a/ruby/red-parquet/lib/parquet/version.rb +++ b/ruby/red-parquet/lib/parquet/version.rb @@ -16,7 +16,7 @@ # under the License. module Parquet - VERSION = "19.0.0-SNAPSHOT" + VERSION = "19.0.0" module Version numbers, TAG = VERSION.split("-")