pkgsrc-WIP-changes archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

py-dask: Update to 3.5.1



Module Name:	pkgsrc-wip
Committed By:	Matthew Danielson <matthewd%fastmail.us@localhost>
Pushed By:	matthewd
Date:		Thu Jun 1 15:02:09 2023 -0700
Changeset:	849f7727d7db3e700ec3641e4c881d77627632d1

Modified Files:
	py-dask/Makefile
	py-dask/PLIST
	py-dask/distinfo

Log Message:
py-dask: Update to 3.5.1

2023.5.1
Released on May 26, 2023
Note This release drops support for Python 3.8. As of this release Dask supports Python 3.9, 3.10, and 3.11. See this community issue for more details.
Enhancements
    Drop Python 3.8 support (GH#10295) Thomas Grainger
    Change Dask Bag partitioning scheme to improve cluster saturation (GH#10294) Jacob Tomlinson
    Generalize dd.to_datetime for GPU-backed collections, introduce get_meta_library utility (GH#9881) Charles Blackmon-Luca
    Add na_action to DataFrame.map (GH#10305) Patrick Hoefler
    Raise TypeError in DataFrame.nsmallest and DataFrame.nlargest when columns is not given (GH#10301) Patrick Hoefler
    Improve sizeof for pd.MultiIndex (GH#10230) Patrick Hoefler
    Support duplicated columns in a bunch of DataFrame methods (GH#10261) Patrick Hoefler
    Add numeric_only support to DataFrame.idxmin and DataFrame.idxmax (GH#10253) Patrick Hoefler
    Implement numeric_only support for DataFrame.quantile (GH#10259) Patrick Hoefler
    Add support for numeric_only=False in DataFrame.std (GH#10251) Patrick Hoefler
    Implement numeric_only=False for GroupBy.cumprod and GroupBy.cumsum (GH#10262) Patrick Hoefler
    Implement numeric_only for skew and kurtosis (GH#10258) Patrick Hoefler
    mask and where should accept a callable (GH#10289) Irina Truong
    Fix conversion from Categorical to pa.dictionary in read_parquet (GH#10285) Patrick Hoefler
Bug Fixes
    Spurious config on nested annotations (GH#10318) crusaderky
    Fix rechunking behavior for dimensions with known and unknown chunk sizes (GH#10157) Hendrik Makait
    Enable drop to support mismatched partitions (GH#10300) James Bourbeau
    Fix divisions construction for to_timestamp (GH#10304) Patrick Hoefler
    pandas ExtensionDtype raising in Series reduction operations (GH#10149) Patrick Hoefler
    Fix regression in da.random interface (GH#10247) Eray Aslan
    da.coarsen doesn’t trim an empty chunk in meta (GH#10281) Irina Truong
    Fix dtype inference for engine="pyarrow" in read_csv (GH#10280) Patrick Hoefler
Documentation
    Add meta_from_array to API docs (GH#10306) Ruth Comer
    Update Coiled links (GH#10296) Sarah Charlotte Johnson
    Add docs for demo day (GH#10288) Matthew Rocklin
Maintenance
    Explicitly install anaconda-client from conda-forge when uploading conda nightlies (GH#10316) Charles Blackmon-Luca
    Configure isort to add from __future__ import annotations (GH#10314) Thomas Grainger
    Avoid pandas Series.__getitem__ deprecation in tests (GH#10308) James Bourbeau
    Ignore numpy.find_common_type warning from pandas (GH#10307) James Bourbeau
    Add test to check that DataFrame.__setitem__ does not modify df inplace (GH#10223) Patrick Hoefler
    Clean up default value of dropna in value_counts (GH#10299) Patrick Hoefler
    Add pytest-cov to test extra (GH#10271) James Bourbeau

2023.5.0
Released on May 12, 2023
Enhancements
    Implement numeric_only=False for GroupBy.corr and GroupBy.cov (GH#10264) Patrick Hoefler
    Add support for numeric_only=False in DataFrame.var (GH#10250) Patrick Hoefler
    Add numeric_only support to DataFrame.mode (GH#10257) Patrick Hoefler
    Add DataFrame.map to dask.DataFrame API (GH#10246) Patrick Hoefler
    Adjust for DataFrame.applymap deprecation and all NA concat behaviour change (GH#10245) Patrick Hoefler
    Enable numeric_only=False for DataFrame.count (GH#10234) Patrick Hoefler
    Disallow array input in mask/where (GH#10163) Irina Truong
    Support numeric_only=True in GroupBy.corr and GroupBy.cov (GH#10227) Patrick Hoefler
    Add numeric_only support to GroupBy.median (GH#10236) Patrick Hoefler
    Support mimesis=9 in dask.datasets (GH#10241) James Bourbeau
    Add numeric_only support to min, max and prod (GH#10219) Patrick Hoefler
    Add numeric_only=True support for GroupBy.cumsum and GroupBy.cumprod (GH#10224) Patrick Hoefler
    Add helper to unpack numeric_only keyword (GH#10228) Patrick Hoefler
Bug Fixes
    Fix clone + from_array failure (GH#10211) crusaderky
    Fix dataframe reductions for ea dtypes (GH#10150) Patrick Hoefler
    Avoid scalar conversion deprecation warning in numpy=1.25 (GH#10248) James Bourbeau
    Make sure transform output has the same index as input (GH#10184) Irina Truong
    Fix corr and cov on a single-row partition (GH#9756) Irina Truong
    Fix test_groupby_numeric_only_supported and test_groupby_aggregate_categorical_observed upstream errors (GH#10243) Irina Truong
Documentation
    Clean up futures docs (GH#10266) Matthew Rocklin
    Add Index API reference (GH#10263) hotpotato
Maintenance
    Warn when meta is passed to apply (GH#10256) Patrick Hoefler
    Remove imageio version restriction in CI (GH#10260) Patrick Hoefler
    Remove unused DataFrame variance methods (GH#10252) Patrick Hoefler
    Un-xfail test_categories with pyarrow strings and pyarrow>=12 (GH#10244) Irina Truong
    Bump gpuCI PYTHON_VER 3.8->3.9 (GH#10233) Charles Blackmon-Luca

2023.4.1
Released on April 28, 2023
Enhancements
    Implement numeric_only support for DataFrame.sum (GH#10194) Patrick Hoefler
    Add support for numeric_only=True in GroupBy operations (GH#10222) Patrick Hoefler
    Avoid deep copy in DataFrame.__setitem__ for pandas 1.4 and up (GH#10221) Patrick Hoefler
    Avoid calling Series.apply with _meta_nonempty (GH#10212) Patrick Hoefler
    Unpin sqlalchemy and fix compatibility issues (GH#10140) Patrick Hoefler
Bug Fixes
    Partially revert default client discovery (GH#10225) Florian Jetter
    Support arrow dtypes in Index meta creation (GH#10170) Patrick Hoefler
    Repartitioning raises with extension dtype when truncating floats (GH#10169) Patrick Hoefler
    Adjust empty Index from fastparquet to object dtype (GH#10179) Patrick Hoefler
Documentation
    Update Kubernetes docs (GH#10232) Jacob Tomlinson
    Add DataFrame.reduction to API docs (GH#10229) James Bourbeau
    Add DataFrame.persist to docs and fix links (GH#10231) Patrick Hoefler
    Add documentation for GroupBy.transform (GH#10185) Irina Truong
    Fix formatting in random number generation docs (GH#10189) Eray Aslan
Maintenance
    Pin imageio to <2.28 (GH#10216) Patrick Hoefler
    Add note about importlib_metadata backport (GH#10207) James Bourbeau
    Add xarray back to Python 3.11 CI builds (GH#10200) James Bourbeau
    Add mindeps build with all optional dependencies (GH#10161) Charles Blackmon-Luca
    Provide proper like value for array_safe in percentiles_summary (GH#10156) Charles Blackmon-Luca
    Avoid re-opening hdf file multiple times in read_hdf (GH#10205) Thomas Grainger
    Add merge tests on nullable columns (GH#10071) Charles Blackmon-Luca
    Fix coverage configuration (GH#10203) Thomas Grainger
    Remove is_period_dtype and is_sparse_dtype (GH#10197) Patrick Hoefler
    Bump actions/checkout from 3.5.0 to 3.5.2 (GH#10201)
    Avoid deprecated is_categorical_dtype from pandas (GH#10180) Patrick Hoefler
    Adjust for deprecated is_interval_dtype and is_datetime64tz_dtype (GH#10188) Patrick Hoefler

2023.4.0
Released on April 14, 2023
Enhancements
    Override old default values in update_defaults (GH#10159) Gabe Joseph
    Add a CLI command to list and get a value from dask config (GH#9936) Irina Truong
    Handle string-based engine argument to read_json (GH#9947) Richard (Rick) Zamora
    Avoid deprecated GroupBy.dtypes (GH#10111) Irina Truong
Bug Fixes
    Revert grouper-related changes (GH#10182) Irina Truong
    GroupBy.cov raising for non-numeric grouping column (GH#10171) Patrick Hoefler
    Updates for Index supporting numpy numeric dtypes (GH#10154) Irina Truong
    Preserve dtype for partitioning columns when read with pyarrow (GH#10115) Patrick Hoefler
    Fix annotations for to_hdf (GH#10123) Hendrik Makait
    Handle None column name when checking if columns are all numeric (GH#10128) Lawrence Mitchell
    Fix valid_divisions when passed a tuple (GH#10126) Brian Phillips
    Maintain annotations in DataFrame.categorize (GH#10120) Hendrik Makait
    Fix handling of missing min/max parquet statistics during filtering (GH#10042) Richard (Rick) Zamora
Deprecations
    Deprecate use_nullable_dtypes= and add dtype_backend= (GH#10076) Irina Truong
    Deprecate convert_dtype in Series.apply (GH#10133) Irina Truong
Documentation
    Document Generator based random number generation (GH#10134) Eray Aslan
Maintenance
    Update dataframe.convert_string to dataframe.convert-string (GH#10191) Irina Truong
    Add python-cityhash to CI environments (GH#10190) Charles Blackmon-Luca
    Temporarily pin scikit-image to fix Windows CI (GH#10186) Patrick Hoefler
    Handle pandas deprecation warnings for to_pydatetime and apply (GH#10168) Patrick Hoefler
    Drop bokeh<3 restriction (GH#10177) James Bourbeau
    Fix failing tests under copy-on-write (GH#10173) Patrick Hoefler
    Allow pyarrow CI to fail (GH#10176) James Bourbeau
    Switch to Generator for random number generation in dask.array (GH#10003) Eray Aslan
    Bump peter-evans/create-pull-request from 4 to 5 (GH#10166)
    Fix flaky modf operation in test_arithmetic (GH#10162) Irina Truong
    Temporarily remove xarray from CI with pandas 2.0 (GH#10153) James Bourbeau
    Fix update_graph counting logic in test_default_scheduler_on_worker (GH#10145) James Bourbeau
    Fix documentation build with pandas 2.0 (GH#10138) James Bourbeau
    Remove dask/gpu from gpuCI update reviewers (GH#10135) Charles Blackmon-Luca
    Update gpuCI RAPIDS_VER to 23.06 (GH#10129)
    Bump actions/stale from 6 to 8 (GH#10121)
    Use declarative setuptools (GH#10102) Thomas Grainger
    Relax assert_eq checks on Scalar-like objects (GH#10125) Matthew Rocklin
    Upgrade readthedocs config to ubuntu 22.04 and Python 3.11 (GH#10124) Thomas Grainger
    Bump actions/checkout from 3.4.0 to 3.5.0 (GH#10122)
    Fix test_null_partition_pyarrow in pyarrow CI build (GH#10116) Irina Truong
    Drop distributed pack (GH#9988) Florian Jetter
    Make dask.compatibility private (GH#10114) Jacob Tomlinson

2023.3.2
Released on March 24, 2023
Enhancements
    Deprecate observed=False for groupby with categoricals (GH#10095) Irina Truong
    Deprecate axis= for some groupby operations (GH#10094) James Bourbeau
    The axis keyword in DataFrame.rolling/Series.rolling is deprecated (GH#10110) Irina Truong
    DataFrame._data deprecation in pandas (GH#10081) Irina Truong
    Use importlib_metadata backport to avoid CLI UserWarning (GH#10070) Thomas Grainger
    Port option parsing logic from dask.dataframe.read_parquet to to_parquet (GH#9981) Anton Loukianov
Bug Fixes
    Avoid using dd.shuffle in groupby-apply (GH#10043) Richard (Rick) Zamora
    Enable null hive partitions with pyarrow parquet engine (GH#10007) Richard (Rick) Zamora
    Support unknown shapes in *_like functions (GH#10064) Doug Davis
Documentation
    Add to_backend methods to API docs (GH#10093) Lawrence Mitchell
    Remove broken gpuCI link in developer docs (GH#10065) Charles Blackmon-Luca
Maintenance
    Configure readthedocs sphinx warnings as errors (GH#10104) Thomas Grainger
    Un-xfail test_division_or_partition with pyarrow strings active (GH#10108) Irina Truong
    Un-xfail test_different_columns_are_allowed with pyarrow strings active (GH#10109) Irina Truong
    Restore Entrypoints compatibility (GH#10113) Jacob Tomlinson
    Un-xfail test_to_dataframe_optimize_graph with pyarrow strings active (GH#10087) Irina Truong
    Only run test_development_guidelines_matches_ci on editable install (GH#10106) Charles Blackmon-Luca
    Un-xfail test_dataframe_cull_key_dependencies_materialized with pyarrow strings active (GH#10088) Irina Truong
    Install mimesis in CI environments (GH#10105) Charles Blackmon-Luca
    Fix for no module named ipykernel (GH#10101) Irina Truong
    Fix docs builds by installing ipykernel (GH#10103) Thomas Grainger
    Allow pyarrow build to continue on failures (GH#10097) James Bourbeau
    Bump actions/checkout from 3.3.0 to 3.4.0 (GH#10096)
    Fix test_set_index_on_empty with pyarrow strings active (GH#10054) Irina Truong
    Un-xfail pyarrow pickling tests (GH#10082) James Bourbeau
    CI environment file cleanup (GH#10078) James Bourbeau
    Un-xfail more pyarrow tests (GH#10066) Irina Truong
    Temporarily skip pyarrow_compat tests with p`andas 2.0 (GH#10063) James Bourbeau
    Fix test_melt with pyarrow strings active (GH#10052) Irina Truong
    Fix test_str_accessor with pyarrow strings active (GH#10048) James Bourbeau
    Fix test_better_errors_object_reductions with pyarrow strings active (GH#10051) James Bourbeau
    Fix test_loc_with_non_boolean_series with pyarrow strings active (GH#10046) James Bourbeau
    Fix test_values with pyarrow strings active (GH#10050) James Bourbeau
    Temporarily xfail test_upstream_packages_installed (GH#10047) James Bourbeau

To see a diff of this commit:
https://wip.pkgsrc.org/cgi-bin/gitweb.cgi?p=pkgsrc-wip.git;a=commitdiff;h=849f7727d7db3e700ec3641e4c881d77627632d1

Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.

diffstat:
 py-dask/Makefile | 8 ++++++--
 py-dask/PLIST    | 4 ++++
 py-dask/distinfo | 6 +++---
 3 files changed, 13 insertions(+), 5 deletions(-)

diffs:
diff --git a/py-dask/Makefile b/py-dask/Makefile
index 64f072d937..26aaf28709 100644
--- a/py-dask/Makefile
+++ b/py-dask/Makefile
@@ -1,6 +1,6 @@
 # $NetBSD$
 
-GITHUB_TAG=	2023.3.1
+GITHUB_TAG=	2023.5.1
 DISTNAME=	dask-${GITHUB_TAG}
 PKGNAME=	${PYPKGPREFIX}-${DISTNAME}
 GITHUB_PROJECT=	dask
@@ -16,14 +16,17 @@ LICENSE=	modified-bsd
 .include "../../mk/bsd.prefs.mk"
 .include "options.mk"
 
+PYTHON_VERSIONS_INCOMPATIBLE=	27 37 38
+
 BUILD_DEPENDS+=       ${PYPKGPREFIX}-wheel>=0:../../devel/py-wheel
+BUILD_DEPENDS+=       ${PYPKGPREFIX}-versioneer>=0.28:../../devel/py-versioneer
 
 DEPENDS+=	${PYPKGPREFIX}-toolz>=0.8.2:../../devel/py-toolz
 DEPENDS+=	${PYPKGPREFIX}-numpy>=1.18:../../math/py-numpy
 DEPENDS+=	${PYPKGPREFIX}-pandas>=1.0:../../math/py-pandas
 DEPENDS+=	${PYPKGPREFIX}-partd>=0.3.10:../../wip/py-partd
 DEPENDS+=	${PYPKGPREFIX}-yaml>=5.3.1:../../textproc/py-yaml
-DEPENDS+=	${PYPKGPREFIX}-distributed>=2022.12.0:../../wip/py-distributed
+DEPENDS+=	${PYPKGPREFIX}-distributed>=2023.5.1:../../wip/py-distributed
 DEPENDS+=	${PYPKGPREFIX}-cloudpickle>=1.1.1:../../wip/py-cloudpickle
 DEPENDS+=	${PYPKGPREFIX}-fsspec>=2022.7.1:../../sysutils/py-fsspec
 DEPENDS+=	${PYPKGPREFIX}-zict>=2.1.0:../../wip/py-zict
@@ -34,6 +37,7 @@ TEST_DEPENDS+=	${PYPKGPREFIX}-test-[0-9]*:../../devel/py-test
 TEST_DEPENDS+=	${PYPKGPREFIX}-test-rerunfailures-[0-9]*:../../devel/py-test-rerunfailures
 TEST_DEPENDS+=	${PYPKGPREFIX}-test-xdist-[0-9]*:../../devel/py-test-xdist
 TEST_DEPENDS+=	${PYPKGPREFIX}-multipledispatch>=0.6.0*:../../devel/py-multipledispatch
+TEST_DEPENDS+=	${PYPKGPREFIX}-importlib-metadata>=6.6.0*:../../devel/py-importlib-metadata
 TEST_DEPENDS+=	${PYPKGPREFIX}-pre-commit-[0-9]*:../../wip/py-pre-commit
 # TEST_DEPENDS+=	${PYPKGPREFIX}-s3fs>=2022*:../../wip/py-s3fs
 # TEST_DEPENDS+=	${PYPKGPREFIX}-sparse>=0.11.2:../../wip/py-sparse
diff --git a/py-dask/PLIST b/py-dask/PLIST
index 4a11846584..64051b7acc 100644
--- a/py-dask/PLIST
+++ b/py-dask/PLIST
@@ -14,6 +14,8 @@ ${PYSITELIB}/dask/__init__.py
 ${PYSITELIB}/dask/__init__.pyc
 ${PYSITELIB}/dask/__main__.py
 ${PYSITELIB}/dask/__main__.pyc
+${PYSITELIB}/dask/_compatibility.py
+${PYSITELIB}/dask/_compatibility.pyc
 ${PYSITELIB}/dask/_version.py
 ${PYSITELIB}/dask/_version.pyc
 ${PYSITELIB}/dask/array/NUMPY_LICENSE.txt
@@ -351,6 +353,8 @@ ${PYSITELIB}/dask/dataframe/tests/test_indexing.py
 ${PYSITELIB}/dask/dataframe/tests/test_indexing.pyc
 ${PYSITELIB}/dask/dataframe/tests/test_merge_column_and_index.py
 ${PYSITELIB}/dask/dataframe/tests/test_merge_column_and_index.pyc
+${PYSITELIB}/dask/dataframe/tests/test_methods.py
+${PYSITELIB}/dask/dataframe/tests/test_methods.pyc
 ${PYSITELIB}/dask/dataframe/tests/test_multi.py
 ${PYSITELIB}/dask/dataframe/tests/test_multi.pyc
 ${PYSITELIB}/dask/dataframe/tests/test_numeric.py
diff --git a/py-dask/distinfo b/py-dask/distinfo
index 0d4c7a8572..f3ce0f894e 100644
--- a/py-dask/distinfo
+++ b/py-dask/distinfo
@@ -1,5 +1,5 @@
 $NetBSD$
 
-BLAKE2s (dask-2023.3.1.tar.gz) = 06057d863ee083daae47e20ab7fc979b02da60525f801768a72fcb6d290ecc35
-SHA512 (dask-2023.3.1.tar.gz) = e4b8306a900453b27bc19055b5a5bfd0d0ed2dc143be854d8f056ede8b1e4bdc5e91a54c674201b1b1f27cbc0c9ba908e5a82a12f52d337c3cb23cbc2e6ef199
-Size (dask-2023.3.1.tar.gz) = 8520302 bytes
+BLAKE2s (dask-2023.5.1.tar.gz) = 9b43bcfd39089b3966c43f36e495bf33bf3463acc4242de9a270c8746fe63040
+SHA512 (dask-2023.5.1.tar.gz) = 603d0deb0e88cc2ae5a5fdbc4352f454e68f836b7c3a9039ec20f9493b2b4c72684bb5981d6e985de5bb32d73fc48d0faa2834c9c56a04277755ea3c8e77fb75
+Size (dask-2023.5.1.tar.gz) = 8523514 bytes


Home | Main Index | Thread Index | Old Index