pkgsrc-WIP-changes archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

apache-arrow: Update to 11.0.0



Module Name:	pkgsrc-wip
Committed By:	Matthew Danielson <matthewd%fastmail.us@localhost>
Pushed By:	matthewd
Date:		Thu Feb 2 18:59:17 2023 -0800
Changeset:	96cd1dacd1d09ec424309c03e423eec1a3ab7638

Modified Files:
	apache-arrow/Makefile
	apache-arrow/PLIST
	apache-arrow/distinfo
	apache-arrow/version.mk

Log Message:
apache-arrow: Update to 11.0.0

11.0.0:
Thanks for your contributions and participation in the project!
Columnar Format Notes
Arrow Flight RPC notes
In the C++/Python Flight clients, DoAction now properly streams the results, instead of blocking until the call finishes. Applications that did not consume the iterator before should fully consume the result. (#15069)
C++ notes
    It is now possible to specify alignment when making allocations with a MemoryPool GH-33056
    It is now possible to run an ExecPlan without using any CPU threads
    Added kernel for slicing list values GH-33168
    Added kernel for slicing binary arrays GH-20357
    When comparing list arrays for equality the list field name is now ignored GH-30519
    Add support for partitioning on columns that contain special characters GH-33448
    Added a streaming reader for JSON GH-33140
    Added support for incremental writes to the ORC writer GH-33047
    Added support for casting decimal to string and writing decimal to CSV GH-33002
    Fixed an assert in the scanner that would occur when batch_readahead was set to 0 GH-15264
    Fixed bug where arrays with a null data buffer would not be accepted when imported via the C data API GH-14875
    Fixed bug where arrays with a zero-case union data type would not be accepted when imported via the C data API GH-14855
    Fixed bug where case_when could return incorrect values GH-33382
    Fixed bug where RecordBatch::Equals was ignoring field names GH-33285
    C# notes
No major changes to C#.
Go notes
    Go’s benchmarks will now get added to Conbench alongside the benchmarks for other implementations GH-32983
    Exposed FlightService_ServiceDesc and RegisterFlightServiceServer to allow easily incorporating a flight service into an existing gRPC server GH-15174
Arrow
    Function ApproxEquals was implemented for scalar values GH-29581
    UnmarshalJSON for the RecordBuilder now properly handles extra unknown fields with complex/nested values GH-31840
    Decimal128 and Decimal256 type support has been added to the CSV reader GH-33111
    Fixed bug in array.UnionBuilder where Len method always returned 0 GH-14775
    Fixed bug for handling slices of Map arrays when marshalling to JSON and for IPC GH-14780
    Fixed memory leak when compressing IPC message body buffers GH-14883
    Added the ability to easily append scalar values to array builders GH-15005
Compute
    Scalar binary (add/subtract/multiply/divide/etc.) and unary arithmetic (abs/neg/sqrt/sign/etc.) has been implemented for the compute package GH-33086 this includes easy functions like compute.Add and compute.Divide etc.
    Scalar boolean functions like AND/OR/XOR/etc. have been implemented for compute GH-33279
    Scalar comparison function kernels have been implemented for compute (equal/greater/greater_equal/less/less_equal) GH-33308
    Scalar compute functions are compatible with dictionary encoded arrays by casting them to their value types GH-33502
Parquet
    Panic when decoding a delta_bit_packed encoded column has been fixed GH-33483
    Fixed memory leak from Allocator in pqarrow.WriteArrowToColumn GH-14865
    Fixed writer.WriteBatch to properly handle writing encrypted parquet columns and no longer silently fail, but instead propagate an error GH-14940
Java notes
    Implement support for writing compressed files (#15223)
    Improve performance by short-circuiting null checks when comparing non null field types (#15106)
    Several enhancements to dictionary encoding (#14891, (#14902, (#14874)
    Extend Table to support additional vector types (#14573)
    Enhance and simplify handling of allocation management by integrating C Data into allocator hierarchy (#14506)
    Make ComplexCopier agnostic of specific implementation of MapWriter (#14557)
    Distribute Apple M1 compatible JNI libraries via mavencentral (#14472)
    Extend Table copy functionality, and support returning copies of individual vectors (#14389)
JavaScript notes
    Bugfixes and dependency updates.
    Arrow now requires BigInt support. GH-33681
Python notes
Compatibility notes:
    PyArrow now requires pandas >= 1.0 (ARROW-18173)
    The pyarrow.parquet.ParquetDataset() class now by default uses the new Dataset API under the hood (use_legacy_dataset=False). You can still pass use_legacy_dataset=True to get the legacy implementation, but this option will be removed in a next release (ARROW-16728).
New features:
    Added support for the DataFrame Interchange Protocol for pyarrow.Table (GH-33346).
    New kernels: list_slice() to slice each list element of a ListArray returning a new ListArray (ARROW-17960).
    A new filter() method on the Dataset class as additional API to filter a Dataset before consuming it (ARROW-16616).
    New sort() method for (Chunked)Array and sort_by() method for RecordBatch, providing a convenience on top of the sort_indices kernel (GH-14778), and a new Dataset.sort_by() method (GH-14975).
Other improvements:
    Support for custom metadata of record batches in the IPC read and write APIs (ARROW-16430).
    Support URIs and the filesystem parameter in pyarrow.parquet.ParquetFile (ARROW-18272) and pyarrow.parquet.write_metadata (ARROW-18225).
    When writing a dataset to IPC using pyarrow.dataset.write_dataset(), you can now specify IPC specific options, such as compression (ARROW-17991)
    The pyarrow.array() function now allows to construct a MapArray from a sequence of dicts (in addition to a sequence of tuples) (ARROW-17832).
    The struct_field() kernel now also accepts field names in addition to integer indices (ARROW-17989).
    Casting to string is now supported for duration (ARROW-15822) and decimal (ARROW-17458) types, which also means those can now be written to CSV.
    When writing to CSV, you can now specify the quoting style (GH-14755).
    The pyarrow.ipc.read_schema() function now accepts a Message object (ARROW-18423).
    The Time32Scalar, Time64Scalar, Date32Scalar and Date64Scalar classes got a .value attribute to access the underlying integer value, similar to the other date-time related scalars (ARROW-18264)
    Duration type is now supported in the hash kernels like dictionary_encode (GH-15226).
    Fix silent overflow when converting datetime.timedelta to duration type (ARROW-15026).
Relevant bug fixes:
    Numpy conversion for ListArray is improved taking into account sliced offset, avoiding increased memory usage (GH-20512
    Fix writing files with multi-byte characters in file name (ARROW-18123).
R notes
    map_batches() is lazy by default; it now returns a RecordBatchReader instead of a list of RecordBatch objects unless lazy = FALSE. GH-14521
    A substantial reorganisation, rewrite of and addition to, many of the vignettes and README. GH-14514
For more on what’s in the 11.0.0 R package, see the R changelog.
Ruby and C GLib notes
Ruby
    Arrow::Table#save now always returns self instead of the result of its raw_recordsGH-15289
    Improve the GC-related crash prevention system by guarding the shared objects from GC ARROW-18161
    Add Arrow::HalfFloat and raw_records support in Arrow::HalfFloatArray ARROW-18086
    Support omitting join keys in Table#join GH-15084
    Add support for Arrow::Table.load(uri, schema:) ARROW-15206
    Add Arrow::ColumnContainable#column_names (e.g. Arrow::Table#column_names) GH-15085
    Add to_arrow_chunked_array methods to support converting to Arrow::ChunkedArray ARROW-18405
C GLib
    Add garrow_chunked_array_new_empty() GH-33671
    Add GArrowProjectNodeOptions GH-33670
    Add GADatasetHivePartitioning GH-15257
    The signature of garrow_execute_plain_wait() was changed to take the error argument and to return the finished status GH-15254
    Add support for half float GH-15168
    Add GADatasetFinishOptions GH-15146
Rust notes
The Rust projects have moved to separate repositories outside the main Arrow monorepo. For notes on the latest release of the Rust implementation, see the latest Arrow Rust changelog.

10.0.1:
Arrow Flight RPC notes
The Arrow Flight SQL JDBC driver now properly handles executeUpdate (ARROW-18294), and will no longer try to handle URIs that it does not recognize (ARROW-18296).
C++ notes
    Add support for ARMv6 (ARROW-18255) And some other minor fixes.
Go notes
    Added option to support dictionary deltas with IPC (ARROW-18326)
    Fix dictionary replacement during IPC stream (ARROW-18317)
    Fix StructBuilder premature release fields (ARROW-18274)
Python notes
New features and improvements:
    Added support and wheels for Python 3.11 (ARROW-17487).
    Updated OpenSSL bundled on wheels to 3.0.7 due to CVE-2022-3786 (ARROW-18302).
R notes
    Fix for failing test after lubridate 1.9 release (ARROW-18285)
    Add deprecation cycle for pull() change (ARROW-18132)
    Fix to correctly handle .data pronoun in group_by() (ARROW-18131)
    Fix for dev purrr (ARROW-18305)

10.0.0
Columnar Format Notes
Arrow Flight RPC notes
A JDBC driver based on Arrow Flight SQL is now available, courtesy of a code donation from Dremio (ARROW-7744). For more details, see “Expanding Arrow’s Reach with a JDBC Driver for Arrow Flight SQL”. Flight SQL is now supported in Go (ARROW-17326). Protocol definitions for transactions and Substrait plans were added to Flight SQL and are implemented in C++ and Java (ARROW-17688). General “best practices” documentation was added for C++ (ARROW-17407). The C++ implementation now has basic OpenTelemetry integration (ARROW-14958).
C++ notes
C++11 is no longer supported
The Arrow C++ codebase has moved to C++17 as its language compatibility standard (ARROW-17545). This means Arrow C++, including its header files, now requires a C++17-compliant compiler and standard library to be used. Such compilers are widely available on most platforms.
Compatibility backports of C++14 and C++17 features, such as std::string_view or std::variant, have been removed in favor of the standard library version of these APIs (ARROW-17546). This will also make integration of Arrow C++ with other codebases easier.
It is expected that the Arrow C++ codebase will be gradually modernized to use C++17 features in subsequent releases, when the need arises.
Plasma is deprecated
The Plasma module is deprecated and will be removed in a future release.
Compute / Acero
Extension types are now supported in hash joins (ARROW-16695).
Datasets
The fragments of a dataset can now be iterated in an asynchronous fashion, using Dataset::GetFragmentsAsync (ARROW-17318).
Filesystems
It is now possible to configure a timeout policy for S3 (ARROW-16521).
Error messages for S3 have been improved to give more context about the error (ARROW-17079).
GetFileInfoGenerator has been optimized for local filesystems, with dedicated options to tune chunking and readahead (ARROW-17306).
JSON
Previously, the JSON reader could only read Decimal fields from JSON strings (i.e. quoted). Now it can also read Decimal fields from JSON numbers as well (ARROW-17847).
Parquet
Before Arrow 3.0.0, data pages version 2 were incorrectly written out, making them unreadable with spec-compliant readers. A compatibility fix has been introduced so that they can still be read with contemporary versions of Arrow (ARROW-17100).
Substrait
The Substrait consumer, which allows Substrait plans to be executed by the Acero execution engine, has received some improvements:
    Aggregations are now supported (ARROW-15591).
    Conversion options have been added so that the level of compliance and rountrippability can be chosen when converting between Substrait and Acero representations of a plan (ARROW-16988).
    Support for many standard Substrait functions has been added (ARROW-15582, ARROW-17523)
Some work has also been done in the reverse direction, to allow Acero execution plans to be serialized as Substrait plans (ARROW-16855).
Other changes
Our CMake package files have been overhauled (ARROW-12175). As a result, namespaced targets are now exported, such as Arrow::arrow_shared. Legacy (non-namespaced) names are still available, for example arrow_shared.
Compiling in release mode now uses -O2, not -O3, by default (ARROW-17436).
The RISC-V architecture is now recongnized at build time (ARROW-17440).
The PyArrow-specific C++ code was moved into the PyArrow source tree (see below, ARROW-16340). The ARROW_PYTHON CMake variable has been deprecated and will be removed in a later release; you should instead enable the necessary components explicitly (ARROW-17868).
Some classes with a Equals method now also support operator== (ARROW-6772). It was decided to only do this when equality is computationally cheap (i.e. not on data collections such as Array, RecordBatch…).
C# notes
Bug Fixes
    DecimalArray incorrectly appends values very large and very small values. (ARROW-17223)
Gandiva notes
Gandiva has been migrated to use LLVM opaque pointer types, as typed pointers had been deprecated (ARROW-17790).
Go notes
    A new CI job has been added to run all of the tests with the -asan option using go1.18 (ARROW-17324)
    Go now passes all integration tests on data types and IPC handling.
    The Go Arrow and Parquet packages now require go1.17+ (ARROW-17646)
Compute
The compute package that was importable via github.com/apache/arrow/go/v9/arrow/compute is now a separate module which requires go1.18+ (only the compute module, the rest of the packages still work fine under go1.17). (ARROW-17456).
Scalar and Vector kernel infrastructure has been implemented for performing compute operations providing the following functionality:
    casting Arrow Arrays from one type to another (ARROW-17454)
    Using Filter and Take functions on an Arrow Array, Record Batch, or Table (ARROW-17669)
Arrow
    Sparse and Dense Union Arrays have been implemented along with appropriate builders and data type support including in IPC reading and writing. (ARROW-3678, ARROW-17276). This includes scalar types for Dense and Sparse union (ARROW-17390)
    LargeBinary, LargeString and LargeList arrays have been implemented for handling arrays with 64-bit offsets. This also included fixing a bug so that binary builders are correctly resettable. (ARROW-8226, ARROW-17275)
    Support for Decimal256 arrays has been implemented (ARROW-10600)
    Automatic Endianness Conversion for non-native endianness is now an option for IPC streams (ARROW-17219)
    CSV Writer now supports Timestamp, Date32 and Date64 types (ARROW-17273)
    CSV Writer now supports custom formatting for boolean values (ARROW-17277)
    The Go Arrow Library now provides a FlightSQL client and server implementation (ARROW-17326). An example server implementation is provided for a FlightSQL server using SQLite (ARROW-17359)
    CSV Reader now supports schema type inference via NewInferringReader, along with options for specifying the type of some columns and skipping columns (ARROW-17778)
Parquet
    RowGroupReader.Column(index int) no longer panics if provided an invalid column index. Instead the signature has been changed to now return (PageReader, error) similar to other methods in the codebase. (ARROW-17274)
    Bitpacking and other internal required implementations for ppc64le have been added for the Parquet library (ARROW-17372)
    A bug has been fixed that caused inconsistent row information data from a table written by Athena (ARROW-17453)
    Fixed a bug that caused panics when writing a Nullable List of Structs (ARROW-17169)
    Key Value metadata in an Arrow Schema will be propagated to the Parquet file when using pqarrow even when not using StoreSchema (ARROW-17627)
    A memory leak when using statistics on ByteArray columns has been fixed (ARROW-17573)
Java notes
Many important features, enhancements, and bug fixes are included in this release, as are documentation enhancements, and a large number of improvements to build processes and project infrastructure. Selected highlights can be found below.
New Features and Enhancements
    JDBC Driver for Arrow Flight SQL (13800)
    Initial implementation of immutable Table API (14316)
    Substrait, transaction, cancellation for Flight SQL (13492)
    Read Arrow IPC, CSV, and ORC files by NativeDatasetFactory (13811, 13973, 14182)
    Add utility to bind Arrow data to JDBC parameters (13589)
Build enhancements
    Add Windows build script that produces DLLs (14203)
    C Data Interface and Dataset libraries can now be built with mvn commands (13881, 13889)
Java notes:
    Java Plasma JNI bindings have been deprecated (14262)
    JavaScript notes
    No major changes.
Python notes
Compatibility notes:
    Some deprecated APIs (deprecated at least since pyarrow 1.0.0) have been removed: the RecordBatchReader.get_next_batch method, DataType.num_children attribute, etc (ARROW-17649).
    When writing to Arrow IPC file format with pyarrow.dataset.write_dataset using format="ipc" or format="arrow", the default extension for the resulting files is changed to .arrow instead of .feather. You can still use format="feather" to write identical files but using the .feather extension (ARROW-17089).
New features and improvements:
    Filters in pq.read_table() can be passed as an expression in addition to the legacy list of tuples. For example, filters=pc.field("col") < 4 is equivalent to filters=[("col", "<", 4)] (ARROW-17483).
    The batch_readahead and fragment_readahead arguments for scanning Datasets are exposed in Python (ARROW-17299).
    ExtensionArrays can now be created from a storage array through the pa.array(..) constructor (ARROW-17834).
    Converting ListArrays containing ExtensionArray values to numpy or pandas works by falling back to the storage array (ARROW-17813).
    The pyarrow.substrait.run_query() function gained a table_provider keyword to run the query against in-memory tables (ARROW-17521).
    The StructType class gained a field() method to retrieve a child field (ARROW-17131).
    Casting Tables to a new schema now honors the nullability flag in the target schema (ARROW-16651).
    Parquet files are now explicitly closed after reading (ARROW-13763).
Further, the Python bindings benefit from improvements in the C++ library (e.g. new compute functions); see the C++ notes above for additional details.
Build notes
The PyArrow-specific C++ code, previously part of Arrow C++ codebase, is now integrated into PyArrow. The tests are run automatically as part of the PyArrow test suite. See: ARROW-16340, ARROW-17122 and PyArrow C++ API notes).
The build process is generally not affected by the change, but the ARROW_PYTHON CMake variable has been deprecated and will be removed in a later release; you should instead enable the necessary components explicitly (ARROW-17868).
R notes
Many improvements to Arrow dplyr queries are added in this version, including:
    dplyr::across() can be used to apply the same computation across multiple columns;
    long-running queries can now be cancelled;
    the data source file name can be added as a column when reading multi-file datasets with add_filename();
    joins now support extension arrays;
    and all supported Arrow dplyr functions are now documented on the R documentation site.
For more on what’s in the 10.0.0 R package, see the R changelog.
Ruby and C GLib notes
Ruby
    Plasma binding has been deprecated (ARROW-17864)
C GLib
    Plasma binding has been deprecated (ARROW-17862)
Rust notes
The Rust projects have moved to separate repositories outside the main Arrow monorepo. For notes on the latest release of the Rust implementation, see the latest Arrow Rust changelog.

To see a diff of this commit:
https://wip.pkgsrc.org/cgi-bin/gitweb.cgi?p=pkgsrc-wip.git;a=commitdiff;h=96cd1dacd1d09ec424309c03e423eec1a3ab7638

Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.

diffstat:
 apache-arrow/Makefile   |   4 +-
 apache-arrow/PLIST      | 121 ++++++++++++++++--------------------------------
 apache-arrow/distinfo   |  12 ++---
 apache-arrow/version.mk |   2 +-
 4 files changed, 48 insertions(+), 91 deletions(-)

diffs:
diff --git a/apache-arrow/Makefile b/apache-arrow/Makefile
index 6e418867a9..660cd94655 100644
--- a/apache-arrow/Makefile
+++ b/apache-arrow/Makefile
@@ -14,7 +14,7 @@ LICENSE=	apache-2.0
 # These packages are built within arrow, and it
 # looks difficult to decouple them
 # They come from ./cpp/thirdparty/versions.txt
-XSIMD=		8.1.0.tar.gz
+XSIMD=		9.0.1.tar.gz
 JEMALLOC=	jemalloc-5.3.0.tar.bz2
 
 DISTFILES+=		${DISTNAME}${EXTRACT_SUFX}
@@ -54,7 +54,7 @@ CONFIGURE_ENV+=	ARROW_XSIMD_URL=/${DISTDIR}/${XSIMD}
 CMAKE_ARGS+=	-DARROW_BUILD_TESTS=OFF
 
 post-install:
-	${MV} ${DESTDIR}${PREFIX}/share/gdb/auto-load/${LOCALBASE}/lib/libarrow.so.900.0.0-gdb.py ${DESTDIR}${PREFIX}/share/arrow/gdb/
+	${MV} ${DESTDIR}${PREFIX}/share/gdb/auto-load/${LOCALBASE}/lib/libarrow.so.${APACHE_ARROW_VERSION:S/.//g}.0.0-gdb.py ${DESTDIR}${PREFIX}/share/arrow/gdb/
 
 BUILD_DEPENDS+=		cmake-[0-9]*:../../devel/cmake
 # Needs gnu install for -v option
diff --git a/apache-arrow/PLIST b/apache-arrow/PLIST
index e0940f0fba..3ce51f812e 100644
--- a/apache-arrow/PLIST
+++ b/apache-arrow/PLIST
@@ -1,4 +1,6 @@
 @comment $NetBSD$
+bin/arrow-file-to-stream
+bin/arrow-stream-to-file
 include/arrow/api.h
 include/arrow/array.h
 include/arrow/array/array_base.h
@@ -38,6 +40,7 @@ include/arrow/compute/cast.h
 include/arrow/compute/exec.h
 include/arrow/compute/exec/accumulation_queue.h
 include/arrow/compute/exec/aggregate.h
+include/arrow/compute/exec/asof_join_node.h
 include/arrow/compute/exec/benchmark_util.h
 include/arrow/compute/exec/bloom_filter.h
 include/arrow/compute/exec/exec_plan.h
@@ -45,12 +48,13 @@ include/arrow/compute/exec/expression.h
 include/arrow/compute/exec/hash_join.h
 include/arrow/compute/exec/hash_join_dict.h
 include/arrow/compute/exec/hash_join_node.h
-include/arrow/compute/exec/ir_consumer.h
 include/arrow/compute/exec/key_hash.h
 include/arrow/compute/exec/key_map.h
+include/arrow/compute/exec/map_node.h
 include/arrow/compute/exec/options.h
 include/arrow/compute/exec/order_by_impl.h
 include/arrow/compute/exec/partition_util.h
+include/arrow/compute/exec/query_context.h
 include/arrow/compute/exec/schema_util.h
 include/arrow/compute/exec/swiss_join.h
 include/arrow/compute/exec/task_util.h
@@ -149,38 +153,6 @@ include/arrow/memory_pool.h
 include/arrow/memory_pool_test.h
 include/arrow/pch.h
 include/arrow/pretty_print.h
-include/arrow/python/api.h
-include/arrow/python/arrow_to_pandas.h
-include/arrow/python/benchmark.h
-include/arrow/python/common.h
-include/arrow/python/csv.h
-include/arrow/python/datetime.h
-include/arrow/python/decimal.h
-include/arrow/python/deserialize.h
-include/arrow/python/extension_type.h
-include/arrow/python/filesystem.h
-include/arrow/python/flight.h
-include/arrow/python/gdb.h
-include/arrow/python/helpers.h
-include/arrow/python/inference.h
-include/arrow/python/init.h
-include/arrow/python/io.h
-include/arrow/python/ipc.h
-include/arrow/python/iterators.h
-include/arrow/python/numpy_convert.h
-include/arrow/python/numpy_interop.h
-include/arrow/python/numpy_to_arrow.h
-include/arrow/python/parquet_encryption.h
-include/arrow/python/pch.h
-include/arrow/python/platform.h
-include/arrow/python/pyarrow.h
-include/arrow/python/pyarrow_api.h
-include/arrow/python/pyarrow_lib.h
-include/arrow/python/python_to_arrow.h
-include/arrow/python/serialize.h
-include/arrow/python/type_traits.h
-include/arrow/python/udf.h
-include/arrow/python/visibility.h
 include/arrow/record_batch.h
 include/arrow/result.h
 include/arrow/scalar.h
@@ -215,8 +187,8 @@ include/arrow/util/algorithm.h
 include/arrow/util/align_util.h
 include/arrow/util/aligned_storage.h
 include/arrow/util/async_generator.h
+include/arrow/util/async_generator_fwd.h
 include/arrow/util/async_util.h
-include/arrow/util/atomic_shared_ptr.h
 include/arrow/util/base64.h
 include/arrow/util/basic_decimal.h
 include/arrow/util/benchmark_util.h
@@ -243,6 +215,7 @@ include/arrow/util/bpacking_simd256_generated.h
 include/arrow/util/bpacking_simd512_generated.h
 include/arrow/util/byte_size.h
 include/arrow/util/byte_stream_split.h
+include/arrow/util/bytes_view.h
 include/arrow/util/cancel.h
 include/arrow/util/checked_cast.h
 include/arrow/util/compare.h
@@ -271,17 +244,16 @@ include/arrow/util/key_value_metadata.h
 include/arrow/util/launder.h
 include/arrow/util/logging.h
 include/arrow/util/macros.h
-include/arrow/util/make_unique.h
 include/arrow/util/map.h
 include/arrow/util/math_constants.h
 include/arrow/util/memory.h
 include/arrow/util/mutex.h
-include/arrow/util/optional.h
 include/arrow/util/parallel.h
 include/arrow/util/pcg_random.h
 include/arrow/util/print.h
 include/arrow/util/queue.h
 include/arrow/util/range.h
+include/arrow/util/regex.h
 include/arrow/util/rle_encoding.h
 include/arrow/util/simd.h
 include/arrow/util/small_vector.h
@@ -290,7 +262,6 @@ include/arrow/util/spaced.h
 include/arrow/util/stopwatch.h
 include/arrow/util/string.h
 include/arrow/util/string_builder.h
-include/arrow/util/string_view.h
 include/arrow/util/task_group.h
 include/arrow/util/tdigest.h
 include/arrow/util/test_common.h
@@ -305,7 +276,6 @@ include/arrow/util/unreachable.h
 include/arrow/util/uri.h
 include/arrow/util/utf8.h
 include/arrow/util/value_parsing.h
-include/arrow/util/variant.h
 include/arrow/util/vector.h
 include/arrow/util/visibility.h
 include/arrow/util/windows_compatibility.h
@@ -327,13 +297,11 @@ include/arrow/vendored/double-conversion/fixed-dtoa.h
 include/arrow/vendored/double-conversion/ieee.h
 include/arrow/vendored/double-conversion/strtod.h
 include/arrow/vendored/double-conversion/utils.h
-include/arrow/vendored/optional.hpp
 include/arrow/vendored/pcg/pcg_extras.hpp
 include/arrow/vendored/pcg/pcg_random.hpp
 include/arrow/vendored/pcg/pcg_uint128.hpp
 include/arrow/vendored/portable-snippets/debug-trap.h
 include/arrow/vendored/portable-snippets/safe-math.h
-include/arrow/vendored/string_view.hpp
 include/arrow/vendored/strptime.h
 include/arrow/vendored/xxhash.h
 include/arrow/vendored/xxhash/xxhash.h
@@ -382,6 +350,7 @@ include/parquet/level_conversion.h
 include/parquet/level_conversion_inc.h
 include/parquet/metadata.h
 include/parquet/murmur3.h
+include/parquet/page_index.h
 include/parquet/parquet_version.h
 include/parquet/pch.h
 include/parquet/platform.h
@@ -396,63 +365,51 @@ include/parquet/type_fwd.h
 include/parquet/types.h
 include/parquet/windows_compatibility.h
 include/parquet/windows_fixup.h
-lib/cmake/arrow/ArrowConfig.cmake
-lib/cmake/arrow/ArrowConfigVersion.cmake
-lib/cmake/arrow/ArrowDatasetConfig.cmake
-lib/cmake/arrow/ArrowDatasetConfigVersion.cmake
-lib/cmake/arrow/ArrowDatasetTargets-release.cmake
-lib/cmake/arrow/ArrowDatasetTargets.cmake
-lib/cmake/arrow/ArrowOptions.cmake
-lib/cmake/arrow/ArrowPythonConfig.cmake
-lib/cmake/arrow/ArrowPythonConfigVersion.cmake
-lib/cmake/arrow/ArrowPythonTargets-release.cmake
-lib/cmake/arrow/ArrowPythonTargets.cmake
-lib/cmake/arrow/ArrowTargets-release.cmake
-lib/cmake/arrow/ArrowTargets.cmake
-lib/cmake/arrow/FindArrow.cmake
-lib/cmake/arrow/FindArrowDataset.cmake
-lib/cmake/arrow/FindArrowPython.cmake
-lib/cmake/arrow/FindBrotli.cmake
-lib/cmake/arrow/FindGLOG.cmake
-lib/cmake/arrow/FindParquet.cmake
-lib/cmake/arrow/FindSnappyAlt.cmake
-lib/cmake/arrow/FindThrift.cmake
-lib/cmake/arrow/Findlz4Alt.cmake
-lib/cmake/arrow/Findre2Alt.cmake
-lib/cmake/arrow/Findutf8proc.cmake
-lib/cmake/arrow/Findzstd.cmake
-lib/cmake/arrow/ParquetConfig.cmake
-lib/cmake/arrow/ParquetConfigVersion.cmake
-lib/cmake/arrow/ParquetTargets-release.cmake
-lib/cmake/arrow/ParquetTargets.cmake
-lib/cmake/arrow/arrow-config.cmake
+lib/cmake/Arrow/ArrowConfig.cmake
+lib/cmake/Arrow/ArrowConfigVersion.cmake
+lib/cmake/Arrow/ArrowOptions.cmake
+lib/cmake/Arrow/ArrowTargets-release.cmake
+lib/cmake/Arrow/ArrowTargets.cmake
+lib/cmake/Arrow/FindBrotliAlt.cmake
+lib/cmake/Arrow/FindGLOG.cmake
+lib/cmake/Arrow/FindSnappyAlt.cmake
+lib/cmake/Arrow/FindThriftAlt.cmake
+lib/cmake/Arrow/Findlz4Alt.cmake
+lib/cmake/Arrow/Findre2Alt.cmake
+lib/cmake/Arrow/Findutf8proc.cmake
+lib/cmake/Arrow/FindzstdAlt.cmake
+lib/cmake/Arrow/arrow-config.cmake
+lib/cmake/ArrowDataset/ArrowDatasetConfig.cmake
+lib/cmake/ArrowDataset/ArrowDatasetConfigVersion.cmake
+lib/cmake/ArrowDataset/ArrowDatasetTargets-release.cmake
+lib/cmake/ArrowDataset/ArrowDatasetTargets.cmake
+lib/cmake/Parquet/ParquetConfig.cmake
+lib/cmake/Parquet/ParquetConfigVersion.cmake
+lib/cmake/Parquet/ParquetTargets-release.cmake
+lib/cmake/Parquet/ParquetTargets.cmake
 lib/libarrow.a
 lib/libarrow.so
-lib/libarrow.so.900
-lib/libarrow.so.900.0.0
+lib/libarrow.so.1100
+lib/libarrow.so.1100.0.0
 lib/libarrow_bundled_dependencies.a
 lib/libarrow_dataset.a
 lib/libarrow_dataset.so
-lib/libarrow_dataset.so.900
-lib/libarrow_dataset.so.900.0.0
-lib/libarrow_python.a
-lib/libarrow_python.so
-lib/libarrow_python.so.900
-lib/libarrow_python.so.900.0.0
+lib/libarrow_dataset.so.1100
+lib/libarrow_dataset.so.1100.0.0
 lib/libparquet.a
 lib/libparquet.so
-lib/libparquet.so.900
-lib/libparquet.so.900.0.0
+lib/libparquet.so.1100
+lib/libparquet.so.1100.0.0
 lib/pkgconfig/arrow-compute.pc
 lib/pkgconfig/arrow-csv.pc
 lib/pkgconfig/arrow-dataset.pc
 lib/pkgconfig/arrow-filesystem.pc
 lib/pkgconfig/arrow-json.pc
-lib/pkgconfig/arrow-python.pc
 lib/pkgconfig/arrow.pc
 lib/pkgconfig/parquet.pc
 share/arrow/gdb/gdb_arrow.py
-share/arrow/gdb/libarrow.so.900.0.0-gdb.py
+share/arrow/gdb/libarrow.so.1100.0.0-gdb.py
 share/doc/arrow/LICENSE.txt
 share/doc/arrow/NOTICE.txt
 share/doc/arrow/README.md
+@pkgdir share/gdb/auto-load/home/matthew/pkgsrc/install.20221225/lib
diff --git a/apache-arrow/distinfo b/apache-arrow/distinfo
index 043ad6e52d..087e12b5f1 100644
--- a/apache-arrow/distinfo
+++ b/apache-arrow/distinfo
@@ -1,11 +1,11 @@
 $NetBSD$
 
-BLAKE2s (8.1.0.tar.gz) = 37f9e9f450b26a016e97a460f8fc9b304867c36d6a2008999b8df35d67f88f02
-SHA512 (8.1.0.tar.gz) = 539f7b565b45e8225c6476ca1becc8243a84ae7fb51b45a584231e7d36aee10a09d7d30fb87d89cb77813fb063a7b7617bcf01fdf996f59d99e8d474d2a044ee
-Size (8.1.0.tar.gz) = 190180 bytes
-BLAKE2s (apache-arrow-9.0.0.tar.gz) = e06c46febd9849343357b449d8f85a6375825dd47431ede47f1ba936fa87a93d
-SHA512 (apache-arrow-9.0.0.tar.gz) = 6eb14c766e4542e110a2d6748cea19a71d7035b8a3fda095efe36b5f02c134abc8c5bbcc9cd72dbc40ae1e87c28399a116501d983b5501b9b277fbcbfa09d7e5
-Size (apache-arrow-9.0.0.tar.gz) = 18314915 bytes
+BLAKE2s (9.0.1.tar.gz) = a785e1ad5fd5df76c95e7cf9a6eadeb86ffbc46ea4342f49f19381434bd0f78c
+SHA512 (9.0.1.tar.gz) = ed56287f608ccdf5bc5d5fc2918e313e7c4cecdd9ef2c9993a72ea900d9ff662c57ac5326c7a809eb11505c6f39d4599f3f161b97b6e03c65783b824b8d700d2
+Size (9.0.1.tar.gz) = 215065 bytes
+BLAKE2s (apache-arrow-11.0.0.tar.gz) = ee159d17a2ce4daea917a054e5f3faed538042fe80f4bb7bea0246adeb61f9f8
+SHA512 (apache-arrow-11.0.0.tar.gz) = 46df4fb5a703d38d0a74fde9838e9f9702b24b442cb225517516c335a5ab18955699000bf0b2fc7d1698ada6d2e890ba3860933b6280f5160b0fce8a07484d0e
+Size (apache-arrow-11.0.0.tar.gz) = 19956186 bytes
 BLAKE2s (jemalloc-5.3.0.tar.bz2) = 285e6145b9d3b575b1ec5cfdae8af40b461149085f001839d64685c0d56e2689
 SHA512 (jemalloc-5.3.0.tar.bz2) = 22907bb052096e2caffb6e4e23548aecc5cc9283dce476896a2b1127eee64170e3562fa2e7db9571298814a7a2c7df6e8d1fbe152bd3f3b0c1abec22a2de34b1
 Size (jemalloc-5.3.0.tar.bz2) = 736023 bytes
diff --git a/apache-arrow/version.mk b/apache-arrow/version.mk
index f75d6721ae..a99e65faaa 100644
--- a/apache-arrow/version.mk
+++ b/apache-arrow/version.mk
@@ -1,2 +1,2 @@
 # $NetBSD$
-APACHE_ARROW_VERSION=	9.0.0
+APACHE_ARROW_VERSION=	11.0.0


Home | Main Index | Thread Index | Old Index