pkgsrc-Changes archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

CVS commit: pkgsrc/textproc/R-data.table



Module Name:    pkgsrc
Committed By:   mef
Date:           Sun Jun  6 13:32:51 UTC 2021

Modified Files:
        pkgsrc/textproc/R-data.table: Makefile distinfo

Log Message:
(textproc/R-data.table) updated  1.12.2 to 1.14.0

NEWS.md from 1.12.2 to 1.14.0 has over 1,000 lines, See following
URL for full text:
https://github.com/Rdatatable/data.table/blob/master/NEWS.md

Here is only for 'POTENTIALLY BREAKING CHANGES' in 1.140.0

# data.table [v1.14.0](https://github.com/Rdatatable/data.table/milestone/23?closed=1)

## POTENTIALLY BREAKING CHANGES

1. In v1.13.0 (July 2020) native parsing of datetime was added to
`fread` by Michael Chirico which dramatically improved
performance. Before then datetime was read as type character by
default which was slow. Since v1.13.0, UTC-marked datetime
(e.g. `2020-07-24T10:11:12.134Z` where the final `Z` is present) has
been read automatically as POSIXct and quickly. We provided the
migration option `datatable.old.fread.datetime.character` to revert to
the previous slow character behavior. We also added the `tz=` argument
to control unmarked datetime; i.e. where the `Z` (or equivalent UTC
postfix) is missing in the data. The default `tz=""` reads unmarked
datetime as character as before, slowly. We gave you the ability to
set `tz="UTC"` to turn on the new behavior and read unmarked datetime
as UTC, quickly. R sessions that are running in UTC by setting the TZ
environment variable, as is good practice and common in production,
have also been reading unmarked datetime as UTC since v1.13.0, much
faster. Note 1 of v1.13.0 (below in this file) ended `In addition to
convenience, fread is now significantly faster in the presence of
dates, UTC-marked datetimes, and unmarked datetime when tz="UTC" is
provided.`.

    At `rstudio::global(2021)`, Neal Richardson, Director of
    Engineering at Ursa Labs, compared Arrow CSV performance to
    `data.table` CSV performance, [Bigger Data With Ease Using Apache
    Arrow](https://rstudio.com/resources/rstudioglobal-2021/bigger-data-with-ease-using-apache-arrow/). He
    opened by comparing to `data.table` as his main point. Arrow was
    presented as 3 times faster than `data.table`. He talked at length
    about this result. However, no reproducible code was provided and
    we were not contacted in advance in case we had any comments. He
    mentioned New York Taxi data in his talk which is a dataset known
    to us as containing unmarked
    datetime. [Rebuttal](https://twitter.com/MattDowle/status/1360073970498875394).

    `tz=`'s default is now changed from `""` to `"UTC"`. If you have
    been using `tz=` explicitly then there should be no change. The
    change to read UTC-marked datetime as POSIXct rather than
    character already happened in v1.13.0. The change now is that
    unmarked datetimes are now read as UTC too by default without
    needing to set `tz="UTC"`. None of the 1,017 CRAN packages
    directly using `data.table` are affected. As before, the migration
    option `datatable.old.fread.datetime.character` can still be set
    to TRUE to revert to the old character behavior. This migration
    option is temporary and will be removed in the near future.


To generate a diff of this commit:
cvs rdiff -u -r1.2 -r1.3 pkgsrc/textproc/R-data.table/Makefile
cvs rdiff -u -r1.1 -r1.2 pkgsrc/textproc/R-data.table/distinfo

Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.

Modified files:

Index: pkgsrc/textproc/R-data.table/Makefile
diff -u pkgsrc/textproc/R-data.table/Makefile:1.2 pkgsrc/textproc/R-data.table/Makefile:1.3
--- pkgsrc/textproc/R-data.table/Makefile:1.2   Thu Aug  8 19:53:56 2019
+++ pkgsrc/textproc/R-data.table/Makefile       Sun Jun  6 13:32:51 2021
@@ -1,13 +1,18 @@
-# $NetBSD: Makefile,v 1.2 2019/08/08 19:53:56 brook Exp $
+# $NetBSD: Makefile,v 1.3 2021/06/06 13:32:51 mef Exp $
 
 R_PKGNAME=     data.table
-R_PKGVER=      1.12.2
+R_PKGVER=      1.14.0
 CATEGORIES=    textproc
 
 MAINTAINER=    bacon%NetBSD.org@localhost
 COMMENT=       Extension of 'data.frame'
 LICENSE=       mpl-2.0
 
+TEST_DEPENDS+= R-bit64-[0-9]*:../../devel/R-bit64
+TEST_DEPENDS+= R-bit-[0-9]*:../../devel/R-bit
+TEST_DEPENDS+= R-xts-[0-9]*:../../math/R-xts
+TEST_DEPENDS+= R-zoo-[0-9]*:../../math/R-zoo
+
 USE_LANGUAGES= c
 
 .include "../../math/R/Makefile.extension"

Index: pkgsrc/textproc/R-data.table/distinfo
diff -u pkgsrc/textproc/R-data.table/distinfo:1.1 pkgsrc/textproc/R-data.table/distinfo:1.2
--- pkgsrc/textproc/R-data.table/distinfo:1.1   Wed Jul 31 13:43:56 2019
+++ pkgsrc/textproc/R-data.table/distinfo       Sun Jun  6 13:32:51 2021
@@ -1,6 +1,6 @@
-$NetBSD: distinfo,v 1.1 2019/07/31 13:43:56 brook Exp $
+$NetBSD: distinfo,v 1.2 2021/06/06 13:32:51 mef Exp $
 
-SHA1 (R/data.table_1.12.2.tar.gz) = 3583fc54515c8419925098061c7a2add9ecccd4c
-RMD160 (R/data.table_1.12.2.tar.gz) = 5f75feea4718479f1d87eb9cededcf292783e000
-SHA512 (R/data.table_1.12.2.tar.gz) = addfddd432cdda8c37959ed08b30ad0d030938d1ea531e299f8398c895d1da00ba409dcc88ae0677c71e060dcd665e484f9562c791b406e02bc8f68bb2de9e2c
-Size (R/data.table_1.12.2.tar.gz) = 3343698 bytes
+SHA1 (R/data.table_1.14.0.tar.gz) = 6ca317410f49abd751905572e943fb3f09790eb0
+RMD160 (R/data.table_1.14.0.tar.gz) = b724e1a468411d678fbe29806b921e8251fdcdbe
+SHA512 (R/data.table_1.14.0.tar.gz) = db90998904270f42cc654faa0b333f8e720e4044c7f2c82f52efbb69e1ed30c5b2c1b034f7a5baff4543151eda7fb33aa454817889ac95a5d17d6aba364396df
+Size (R/data.table_1.14.0.tar.gz) = 5303849 bytes



Home | Main Index | Thread Index | Old Index