Skip to content

Latest commit

 

History

History
197 lines (149 loc) · 8.23 KB

CHANGES.rst

File metadata and controls

197 lines (149 loc) · 8.23 KB

Changelog

Version 3.7.1 (2020-02-XX)

  • GH227 Fix a Type error when loading categorical data in dask without specifying it explicitly

Version 3.7.0 (2020-02-12)

Improvements

Version 3.6.2 (2019-12-17)

Improvements

Bug fixes

Version 3.6.1 (2019-12-11)

Bug fixes

  • Fix a regression introduced in 3.5.0 where predicates which allow multiple values for a field would generate duplicates

Version 3.6.0 (2019-12-03)

New functionality

Bug fixes

  • Fix addition of bogus index columns to Parquet files when using sort_partitions_by.
  • Fix bug where partition_on in write path drops empty DataFrames and can lead to datasets without tables.

Version 3.5.1 (2019-10-25)

Version 3.5.0 (2019-10-21)

New functionality

Bug fixes

  • Input to normalize_args is properly normalized to list
  • MetaPartition.load_dataframes now raises if table in columns argument doesn't exist
  • require urlquote>=1.1.0 (where urlquote.quoting was introduced)
  • Improve performance for some cases where predicates are used with the in operator.
  • Correctly preserve :class:`~kartothek.core.index.ExplicitSecondaryIndex` dtype when index is empty
  • Fixed DeprecationWarning in pandas CategoricalDtype
  • Fixed broken docstring for store_dataframes_as_dataset
  • Internal operations no longer perform schema validations. This will improve performance for batched partition operations (e.g. partition_on) but will defer the validation in case of inconsistencies to the final commit. Exception messages will be less verbose in these cases as before.
  • Fix an issue where an empty dataframe of a partition in a multi-table dataset would raise a schema validation exception
  • Fix an issue where the dispatch_by keyword would disable partition pruning
  • Creating dataset with non existing columns as explicit index to raise a ValueError

Breaking changes

  • Remove support for pyarrow < 0.13.0
  • Move the docs module from io_components to core

Version 3.4.0 (2019-09-17)

  • Add support for pyarrow 0.14.1
  • Use urlquote for faster quoting/unquoting

Version 3.3.0 (2019-08-15)

  • Fix rejection of bool predicates in :func:`~kartothek.serialization.filter_array_like` when bool columns contains None
  • Streamline behavior of store_dataset_from_ddf when passing empty ddf.
  • Fix an issue where a segmentation fault may be raised when comparing MetaPartition instances
  • Expose a date_as_object flag in kartothek.core.index.as_flat_series

Version 3.2.0 (2019-07-25)

Version 3.1.1 (2019-07-12)

Version 3.1.0 (2019-07-10)

Breaking:

Version 3.0.0 (2019-05-02)

  • Initial public release