pkgsrc-WIP-changes archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

py-scikit-learn: Update to 1.2.2



Module Name:	pkgsrc-wip
Committed By:	Matthew Danielson <matthewd%fastmail.us@localhost>
Pushed By:	matthewd
Date:		Thu Mar 16 04:50:08 2023 -0700
Changeset:	d15bf54c075c2e05082988a9d0a9a799e7c9b5c0

Modified Files:
	py-scikit-learn/Makefile
	py-scikit-learn/PLIST
	py-scikit-learn/distinfo

Log Message:
py-scikit-learn: Update to 1.2.2

Version 1.2.2
March 2023
Changelog
sklearn.base
    Fix When set_output(transform="pandas"), base.TransformerMixin maintains the index if the transform output is already a DataFrame. #25747 by Thomas Fan.
sklearn.calibration
    Fix A deprecation warning is raised when using the base_estimator__ prefix to set parameters of the estimator used in calibration.CalibratedClassifierCV. #25477 by Tim Head.
sklearn.cluster
    Fix Fixed a bug in cluster.BisectingKMeans, preventing fit to randomly fail due to a permutation of the labels when running multiple inits. #25563 by Jérémie du Boisberranger.
sklearn.compose
    Fix Fixes a bug in compose.ColumnTransformer which now supports empty selection of columns when set_output(transform="pandas"). #25570 by Thomas Fan.
sklearn.ensemble
    Fix A deprecation warning is raised when using the base_estimator__ prefix to set parameters of the estimator used in ensemble.AdaBoostClassifier, ensemble.AdaBoostRegressor, ensemble.BaggingClassifier, and ensemble.BaggingRegressor. #25477 by Tim Head.
sklearn.feature_selection
    Fix Fixed a regression where a negative tol would not be accepted any more by feature_selection.SequentialFeatureSelector. #25664 by Jérémie du Boisberranger.
sklearn.inspection
    Fix Raise a more informative error message in inspection.partial_dependence when dealing with mixed data type categories that cannot be sorted by numpy.unique. This problem usually happen when categories are str and missing values are present using np.nan. #25774 by Guillaume Lemaitre.
sklearn.isotonic
    Fix Fixes a bug in isotonic.IsotonicRegression where isotonic.IsotonicRegression.predict would return a pandas DataFrame when the global configuration sets transform_output="pandas". #25500 by Guillaume Lemaitre.
sklearn.preprocessing
    Fix preprocessing.OneHotEncoder.drop_idx_ now properly references the dropped category in the categories_ attribute when there are infrequent categories. #25589 by Thomas Fan.
    Fix preprocessing.OrdinalEncoder now correctly supports encoded_missing_value or unknown_value set to a categories’ cardinality when there is missing values in the training data. #25704 by Thomas Fan.
sklearn.tree
    Fix Fixed a regression in tree.DecisionTreeClassifier, tree.DecisionTreeRegressor, tree.ExtraTreeClassifier and tree.ExtraTreeRegressor where an error was no longer raised in version 1.2 when min_sample_split=1. #25744 by Jérémie du Boisberranger.
sklearn.utils
    Fix Fixes a bug in utils.check_array which now correctly performs non-finite validation with the Array API specification. #25619 by Thomas Fan.
    Fix utils.multiclass.type_of_target can identify pandas nullable data types as classification targets. #25638 by Thomas Fan.

Version 1.2.1
January 2023
Changed models
The following estimators and functions, when fit with the same data and parameters, may produce different models from the previous version. This often occurs due to changes in the modelling logic (bug fixes or enhancements), or in random sampling procedures.
    Fix The fitted components in MiniBatchDictionaryLearning might differ. The online updates of the sufficient statistics now properly take the sizes of the batches into account. #25354 by Jérémie du Boisberranger.
    Fix The categories_ attribute of preprocessing.OneHotEncoder now always contains an array of object`s when using predefined categories that are strings. Predefined categories encoded as bytes will no longer work with `X encoded as strings. #25174 by Tim Head.
Changes impacting all modules
    Fix Support pandas.Int64 dtyped y for classifiers and regressors. #25089 by Tim Head.
    Fix Remove spurious warnings for estimators internally using neighbors search methods. #25129 by Julien Jerphanion.
    Fix Fix a bug where the current configuration was ignored in estimators using n_jobs > 1. This bug was triggered for tasks dispatched by the auxillary thread of joblib as sklearn.get_config used to access an empty thread local configuration instead of the configuration visible from the thread where joblib.Parallel was first called. #25363 by Guillaume Lemaitre.
Changelog
sklearn.base
    Fix Fix a regression in BaseEstimator.__getstate__ that would prevent certain estimators to be pickled when using Python 3.11. #25188 by Benjamin Bossan.
    Fix Inheriting from base.TransformerMixin will only wrap the transform method if the class defines transform itself. #25295 by Thomas Fan.
sklearn.datasets¶
    Fix Fixes an inconsistency in datasets.fetch_openml between liac-arff and pandas parser when a leading space is introduced after the delimiter. The ARFF specs requires to ignore the leading space. #25312 by Guillaume Lemaitre.
    Fix Fixes a bug in datasets.fetch_openml when using parser="pandas" where single quote and backslash escape characters were not properly handled. #25511 by Guillaume Lemaitre.
sklearn.decomposition
    Fix Fixed a bug in decomposition.MiniBatchDictionaryLearning where the online updates of the sufficient statistics where not correct when calling partial_fit on batches of different sizes. #25354 by Jérémie du Boisberranger.
    Fix decomposition.DictionaryLearning better supports readonly NumPy arrays. In particular, it better supports large datasets which are memory-mapped when it is used with coordinate descent algorithms (i.e. when fit_algorithm='cd'). #25172 by Julien Jerphanion.
sklearn.ensemble
    Fix ensemble.RandomForestClassifier, ensemble.RandomForestRegressor ensemble.ExtraTreesClassifier and ensemble.ExtraTreesRegressor now support sparse readonly datasets. #25341 by Julien Jerphanion
sklearn.feature_extraction
    Fix feature_extraction.FeatureHasher raises an informative error when the input is a list of strings. #25094 by Thomas Fan.
sklearn.linear_model
    Fix Fix a regression in linear_model.SGDClassifier and linear_model.SGDRegressor that makes them unusable with the verbose parameter set to a value greater than 0. #25250 by Jérémie Du Boisberranger.
sklearn.manifold
    Fix manifold.TSNE now works correctly when output type is set to pandas #25370 by Tim Head.
sklearn.model_selection
    Fix model_selection.cross_validate with multimetric scoring in case of some failing scorers the non-failing scorers now returns proper scores instead of error_score values. #23101 by András Simon and Thomas Fan.
sklearn.neural_network
    Fix neural_network.MLPClassifier and neural_network.MLPRegressor no longer raise warnings when fitting data with feature names. #24873 by Tim Head.
    Fix Improves error message in neural_network.MLPClassifier and neural_network.MLPRegressor, when early_stopping=True and partial_fit is called. #25694 by Thomas Fan.
sklearn.preprocessing
    Fix preprocessing.FunctionTransformer.inverse_transform correctly supports DataFrames that are all numerical when check_inverse=True. #25274 by Thomas Fan.
    Fix preprocessing.SplineTransformer.get_feature_names_out correctly returns feature names when extrapolations="periodic". #25296 by Thomas Fan.
sklearn.tree
    Fix tree.DecisionTreeClassifier, tree.DecisionTreeRegressor tree.ExtraTreeClassifier and tree.ExtraTreeRegressor now support sparse readonly datasets. #25341 by Julien Jerphanion
sklearn.utils
    Fix Restore utils.check_array’s behaviour for pandas Series of type boolean. The type is maintained, instead of converting to float64. #25147 by Tim Head.
    API Change utils.fixes.delayed is deprecated in 1.2.1 and will be removed in 1.5. Instead, import utils.parallel.delayed and use it in conjunction with the newly introduced utils.parallel.Parallel to ensure proper propagation of the scikit-learn configuration to the workers. #25363 by Guillaume Lemaitre.

To see a diff of this commit:
https://wip.pkgsrc.org/cgi-bin/gitweb.cgi?p=pkgsrc-wip.git;a=commitdiff;h=d15bf54c075c2e05082988a9d0a9a799e7c9b5c0

Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.

diffstat:
 py-scikit-learn/Makefile |  2 +-
 py-scikit-learn/PLIST    | 14 ++++++++++++++
 py-scikit-learn/distinfo |  6 +++---
 3 files changed, 18 insertions(+), 4 deletions(-)

diffs:
diff --git a/py-scikit-learn/Makefile b/py-scikit-learn/Makefile
index 148bef2e0d..eb718eb54a 100644
--- a/py-scikit-learn/Makefile
+++ b/py-scikit-learn/Makefile
@@ -1,6 +1,6 @@
 # $NetBSD: Makefile,v 1.18 2022/04/10 00:57:15 gutteridge Exp $
 
-DISTNAME=	scikit-learn-1.2.0
+DISTNAME=	scikit-learn-1.2.2
 PKGNAME=	${PYPKGPREFIX}-${DISTNAME}
 CATEGORIES=	math python
 MASTER_SITES=	${MASTER_SITE_PYPI:=s/scikit-learn/}
diff --git a/py-scikit-learn/PLIST b/py-scikit-learn/PLIST
index 2223f4d5f5..76db074203 100644
--- a/py-scikit-learn/PLIST
+++ b/py-scikit-learn/PLIST
@@ -230,6 +230,12 @@ ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1119/api-v1-jdl-dn-adult-cens
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1119/api-v1-jdl-dn-adult-census-l-2-s-act-.json.gz
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1119/api-v1-jdq-1119.json.gz
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1119/data-v1-dl-54002.arff.gz
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1590/__init__.py
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1590/__init__.pyc
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1590/api-v1-jd-1590.json.gz
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1590/api-v1-jdf-1590.json.gz
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1590/api-v1-jdq-1590.json.gz
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_1590/data-v1-dl-1595261.arff.gz
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_2/__init__.py
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_2/__init__.pyc
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_2/api-v1-jd-2.json.gz
@@ -285,6 +291,12 @@ ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_40966/api-v1-jdl-dn-miceprote
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_40966/api-v1-jdl-dn-miceprotein-l-2-s-act-.json.gz
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_40966/api-v1-jdq-40966.json.gz
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_40966/data-v1-dl-17928620.arff.gz
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42074/__init__.py
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42074/__init__.pyc
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42074/api-v1-jd-42074.json.gz
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42074/api-v1-jdf-42074.json.gz
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42074/api-v1-jdq-42074.json.gz
+${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42074/data-v1-dl-21552912.arff.gz
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42585/__init__.py
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42585/__init__.pyc
 ${PYSITELIB}/sklearn/datasets/tests/data/openml/id_42585/api-v1-jd-42585.json.gz
@@ -1186,6 +1198,8 @@ ${PYSITELIB}/sklearn/utils/murmurhash.pxd
 ${PYSITELIB}/sklearn/utils/murmurhash.so
 ${PYSITELIB}/sklearn/utils/optimize.py
 ${PYSITELIB}/sklearn/utils/optimize.pyc
+${PYSITELIB}/sklearn/utils/parallel.py
+${PYSITELIB}/sklearn/utils/parallel.pyc
 ${PYSITELIB}/sklearn/utils/random.py
 ${PYSITELIB}/sklearn/utils/random.pyc
 ${PYSITELIB}/sklearn/utils/sparsefuncs.py
diff --git a/py-scikit-learn/distinfo b/py-scikit-learn/distinfo
index 888cabfabd..473707898f 100644
--- a/py-scikit-learn/distinfo
+++ b/py-scikit-learn/distinfo
@@ -1,5 +1,5 @@
 $NetBSD: distinfo,v 1.9 2021/10/26 10:56:04 nia Exp $
 
-BLAKE2s (scikit-learn-1.2.0.tar.gz) = ca5c2755c852a32c37f9fc89bf87fe088d157fd14628456f587b78e4e81f7523
-SHA512 (scikit-learn-1.2.0.tar.gz) = 37b26c6e5eb0cb186e823862ae2e417fc4c1221934ffaa47fe0df24a808cce8e2b16a2521902a24566b1a45670b1d967a34ed0887d3aa35d1fe0f0f8a8ca0d44
-Size (scikit-learn-1.2.0.tar.gz) = 7208996 bytes
+BLAKE2s (scikit-learn-1.2.2.tar.gz) = 4a5c5f003b2d60739eaf67a27c4dd977e0db244ea74a912fad3dba0d56f69269
+SHA512 (scikit-learn-1.2.2.tar.gz) = 73ebcd49f49607cefbd4c2200e9379ab1b1277067d0a10c09d80e3969d4924506ef90d52ad3173cf1e05268ad4c7812218b2e955798ac123bb078ff08330309e
+Size (scikit-learn-1.2.2.tar.gz) = 7269934 bytes


Home | Main Index | Thread Index | Old Index