pkgsrc-Changes archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

CVS commit: pkgsrc/math/py-pandas



Module Name:    pkgsrc
Committed By:   bad
Date:           Sat Feb 16 00:02:19 UTC 2013

Modified Files:
        pkgsrc/math/py-pandas: Makefile distinfo

Log Message:
Update pandas to 0.10.1.

Release date: 2013-01-22

New features:

        Add data inferface to World Bank WDI pandas.io.wb (GH2592)

API Changes:

        Restored inplace=True behavior returning self (same object) with
          deprecation warning until 0.11 (GH1893)
        HDFStore
            refactored HFDStore to deal with non-table stores as objects, will
              allow future enhancements
            removed keyword compression from put (replaced by keyword complib
              to be consistent across library)
            warn PerformanceWarning if you are attempting to store types that
              will be pickled by PyTables

Improvements to existing features:

        HDFStore
            enables storing of multi-index dataframes (closes GH1277)
            support data column indexing and selection, via data_columns
              keyword in append
            support write chunking to reduce memory footprint, via chunksize
              keyword to append
            support automagic indexing via index keyword to append
            support expectedrows keyword in append to inform PyTables about
              the expected tablesize
            support start and stop keywords in select to limit the row
              selection space
            added get_store context manager to automatically import with pandas
            added column filtering via columns keyword in select
            added methods append_to_multiple/select_as_multiple/
              select_as_coordinates to do multiple-table append/selection
            added support for datetime64 in columns
            added method unique to select the unique values in an indexable
              or data column
            added method copy to copy an existing store (and possibly upgrade)
            show the shape of the data on disk for non-table stores when
              printing the store
            added ability to read PyTables flavor tables (allows compatiblity
              to other HDF5 systems)
        Add logx option to DataFrame/Series.plot (GH2327, GH2565)
        Support reading gzipped data from file-like object
        pivot_table aggfunc can be anything used in GroupBy.aggregate (GH2643)
        Implement DataFrame merges in case where set cardinalities might
          overflow 64-bit integer (GH2690)
        Raise exception in C file parser if integer dtype specified and have
          NA values. (GH2631)
        Attempt to parse ISO8601 format dates when parse_dates=True in read_csv
          for major performance boost in such cases (GH2698)
        Add methods neg and inv to Series
        Implement kind option in ExcelFile to indicate whether it's an XLS
          or XLSX file (GH2613)

Bug fixes:

        Fix read_csv/read_table multithreading issues (GH2608)
        HDFStore
            correctly handle nan elements in string columns; serialize via the
              nan_rep keyword to append
            raise correctly on non-implemented column types (unicode/date)
            handle correctly Term passed types (e.g. index<1000, when index is
              Int64), (closes GH512)
            handle Timestamp correctly in data_columns (closes GH2637)
            contains correctly matches on non-natural names
            correctly store float32 dtypes in tables (if not other float types
              in the same table)
        Fix DataFrame.info bug with UTF8-encoded columns. (GH2576)
        Fix DatetimeIndex handling of FixedOffset tz (GH2604)
        More robust detection of being in IPython session for wide DataFrame
          console formatting (GH2585)
        Fix platform issues with file:/// in unit test (GH2564)
        Fix bug and possible segfault when grouping by hierarchical level that
          contains NA values (GH2616)
        Ensure that MultiIndex tuples can be constructed with NAs (GH2616)
        Fix int64 overflow issue when unstacking MultiIndex with many levels
          (GH2616)
        Exclude non-numeric data from DataFrame.quantile by default (GH2625)
        Fix a Cython C int64 boxing issue causing read_csv to return incorrect
          results (GH2599)
        Fix groupby summing performance issue on boolean data (GH2692)
        Don't bork Series containing datetime64 values with to_datetime (GH2699)
        Fix DataFrame.from_records corner case when passed columns, index
          column, but empty record list (GH2633)
        Fix C parser-tokenizer bug with trailing fields. (GH2668)
        Don't exclude non-numeric data from GroupBy.max/min (GH2700)
        Don't lose time zone when calling DatetimeIndex.drop (GH2621)
        Fix setitem on a Series with a boolean key and a non-scalar as value
          (GH2686)
        Box datetime64 values in Series.apply/map (GH2627, GH2689)
        Upconvert datetime + datetime64 values when concatenating frames
          (GH2624)
        Raise a more helpful error message in merge operations when one
          DataFrame has duplicate columns (GH2649)
        Fix partial date parsing issue occuring only when code is run at EOM
          (GH2618)
        Prevent MemoryError when using counting sort in sortlevel with
          high-cardinality MultiIndex objects (GH2684)
        Fix Period resampling bug when all values fall into a single bin
          (GH2070)
        Fix buggy interaction with usecols argument in read_csv when there is
          an implicit first index column (GH2654)


To generate a diff of this commit:
cvs rdiff -u -r1.2 -r1.3 pkgsrc/math/py-pandas/Makefile \
    pkgsrc/math/py-pandas/distinfo

Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.




Home | Main Index | Thread Index | Old Index