Releases: xorbitsai/xorbits
Releases · xorbitsai/xorbits
v0.6.1
What's new in 0.6.1 (2023-09-15)
These are the changes in xorbits v0.6.1.
New features
- FEAT: Impl series setitem by @ChengjieLi28 in #694
Enhancements
- ENH: Add a status monitor to trace the running stage of subtask by @Flying-Tom in #677
- ENH: impl array protocol for series and index by @UranusSeven in #531
- BLD: ADLFS import error and docker build failed by @ChengjieLi28 in #698
Bug fixes
- BUG: Tensor map chunk error when func is
tolist
by @ChengjieLi28 in #697
Others
- CHORE: Deprecate
use_inf_as_na
option by @ChengjieLi28 in #689 - CHORE: Fix asv CI by @aresnow1 in #654
Full Changelog: v0.6.0...v0.6.1
v0.6.0
What's new in 0.6.0 (2023-09-08)
These are the changes in xorbits v0.6.0.
New features
- FEAT: support loading zip file for read_parquet by @YibinLiu666 in #662
Enhancements
- ENH: Use memoryview to open reader in MMAP storage backend by @ChengjieLi28 in #675
- ENH: Compatible with Pandas 2.1.0 by @ChengjieLi28 in #679
- ENH: Improve use_arrow_dtype for pandas 2.0+ by @codingl2k1 in #580
- ENH: Default enable arrow dtype for read_parquet if pandas>=2.1 by @codingl2k1 in #685
Bug fixes
- BUG: ipython data missing columns due to column pruning by @ChengjieLi28 in #681
Documentation
- DOC: Readme for tpch benchmark script by @ChengjieLi28 in #676
Others
- Add support to import xorbits.numpy.special functions by @luweizheng in #683
- CHORE: Fix Pandas <= 2.0.3 for now by @ChengjieLi28 in #678
- CHORE: fsspec as dependency by @aresnow1 in #686
Full Changelog: v0.5.2...v0.6.0
v0.5.2
What's new in 0.5.2 (2023-08-30)
These are the changes in xorbits v0.5.2.
New features
- FEAT: MMAP storage backend by @ChengjieLi28 in #674
Enhancements
- ENH: Use arrow table to transfer dataset meta by @codingl2k1 in #663
Bug fixes
- BUG: Fix dtypes of
pd.pivot_table
by @aresnow1 in #668 - BUG: Fix tpch by @codingl2k1 in #670
Tests
- TST: Add test_put_get_arrow by @codingl2k1 in #656
Others
- CHORE: Modify required reviews by @aresnow1 in #665
- CHORE: disable infer_datetime_format option to reduce warnings by @ChengjieLi28 in #673
Full Changelog: v0.5.1...v0.5.2
v0.5.1
What's new in 0.5.1 (2023-08-14)
These are the changes in xorbits v0.5.1.
New features
- FEAT: Export dataset by @codingl2k1 in #639
- FEAT: Iterable dataset by @codingl2k1 in #643
Enhancements
- ENH: Compatible with scikit-learn 1.3.0 by @ChengjieLi28 in #648
- BLD: fix docker build on python 3.11 by @ChengjieLi28 in #642
- BLD: add datasets dependency in docker base image by @ChengjieLi28 in #644
- BLD: git reset to fix dirty tag by @aresnow1 in #655
- BLD: Fix source building by @aresnow1 in #657
- BLD: Fix uploading wheel when it exists by @aresnow1 in #658
Bug fixes
- BUG: Fix Huggingface dataset loader filelock conflict by @codingl2k1 in #633
- BUG: Fix pre-commit arg and CI node version by @Flying-Tom in #622
- BUG: Fix missing ArrowDtype by @codingl2k1 in #641
- BUG: Fix
_generate_value
when building mock dataframe by @aresnow1 in #645 - BUG: Fix issues when connecting to a real yarn cluster by @ChengjieLi28 in #647
Tests
- TST: K8s with juicefs CI may leads to pv conflict by @ChengjieLi28 in #646
Others
- Fix copyright by @codingl2k1 in #651
- CHORE: Remove win32 support by @aresnow1 in #652
Full Changelog: v0.5.0...v0.5.1
v0.5.0
What's new in 0.5.0 (2023-07-28)
These are the changes in xorbits v0.5.0.
New features
- FEAT: Basic datasets feature by @codingl2k1 in #600
- FEAT: Raise clearer errors when encountering OOM by @Flying-Tom in #621
- Feat: Basic datasets getitem by @codingl2k1 in #628
Enhancements
Bug fixes
- BUG: Fix
pd.read_csv
with encoding parameter by @aresnow1 in #614 - BUG: fix from_tensor by @Hank0626 in #625
- BUG: fix drop_duplicates no dtypes by @Hank0626 in #627
Tests
Documentation
- DOC: Add best practice for loading data by @aresnow1 in #620
- Doc: Datasets doc by @codingl2k1 in #619
Others
- CHORE: add msbuild to CI by @YibinLiu666 in #631
New Contributors
- @Flying-Tom made their first contribution in #621
Full Changelog: v0.4.4...v0.5.0
v0.4.4
v0.4.3
What's new in 0.4.3 (2023-07-20)
These are the changes in xorbits v0.4.3.
New features
Enhancements
- ENH: Support xoscar copy_to by @ChengjieLi28 in #582
Bug fixes
- BUG: Fix Read Parquet by @Hank0626 in #590
- BUG: Fix LGBM CI by @Hank0626 in #599
- BUG: Fetch to local when print object data by @aresnow1 in #598
- BUG: Fix compatibility with the newest lightgbm by @yifeis7 in #601
Others
- Bump semver from 5.7.1 to 5.7.2 in /python/xorbits/web/ui by @dependabot in #594
- Bump word-wrap from 1.2.3 to 1.2.4 in /python/xorbits/web/ui by @dependabot in #606
Full Changelog: v0.4.2...v0.4.3
v0.4.2
What's new in 0.4.2 (2023-07-10)
These are the changes in xorbits v0.4.2. v0.4.2 is a hotfix version to address a few crucial issues.
Bug fixes
- BUG: Fix StringAccessor getitem error by @codingl2k1 in #588
- BUG: Fix RuntimeError: cannot schedule new futures after shutdown by @codingl2k1 in #589
Full Changelog: v0.4.1...v0.4.2
v0.4.1
What's new in 0.4.1 (2023-07-07)
These are the changes in xorbits v0.4.1.
New features
- FEAT: Support base
Index.str
accessor by @RayJi01 in #511 - FEAT: External Storage JuiceFS on K8S by @Zhou1213CN in #497
- FEAT: Support pandas option by @Hank0626 in #535
- FEAT: Support nlargest and nsmallest op by @ZJU3190105746 in #530
- FEAT: Support read parquet from http urls by @aresnow1 in #573
Enhancements
- ENH: Re-impl DataFrameNunique op by @ChengjieLi28 in #577
- BLD: Fix scipy version in pyproject.toml by @aresnow1 in #585
- CLN: Fix github urls by @qinxuye in #568
Bug fixes
- BUG: Fix password lost when calling
pd.read_sql
by @aresnow1 in #545 - BUG: Fix error for unsupported input type of
read_sql
by @yifeis7 in #547 - BUG: fix sparse divide CI by @Hank0626 in #550
- BUG: fix joblib CI by @Hank0626 in #554
- BUG: Fix build error on macos by @codingl2k1 in #565
- BUG: Fix dataframe shape property by @yifeis7 in #566
- BUG: Match more error message by @aresnow1 in #575
- BUG: Fix importing learn.contrib is slow by @codingl2k1 in #569
Tests
Documentation
- DOC: Documentation for JuiceFS on Kubernetes by @Zhou1213CN in #538
- DOC: Add Chinese doc for external storage JuiceFS on K8S by @Zhou1213CN in #551
- DOC: update why xorbits by @onesuper in #556
- DOC: update readme by @onesuper in #557
- DOC: make the slurm docs more concise by @luweizheng in #561
Others
- Bump scipy from 1.9.2 to 1.10.0 in /CI by @dependabot in #584
- CHORE: Restrict fork repositories from using public ci resources by @ChengjieLi28 in #574
New Contributors
- @qinxuye made their first contribution in #559
- @onesuper made their first contribution in #556
- @codingl2k1 made their first contribution in #565
- @ZJU3190105746 made their first contribution in #530
Full Changelog: v0.4.0...v0.4.1
v0.4.0
What's new in 0.4.0 (2023-06-21)
These are the changes in xorbits v0.4.0.
New features
Enhancements
Bug fixes
- BUG: fix
sort_index
bug of multi-level indexed Dataframe with multi-chunk by @pangyoki in #506 - BUG: fix dataframe sample by @Hank0626 in #532
- Bug: Fix numpy CI by @Hank0626 in #537
- BUG: fix lightgbm by @Hank0626 in #541
- BUG: Suppress scipy warnings by @aresnow1 in #543
Documentation
- DOC: Init Xgboost Doc by @Hank0626 in #525
- DOC: Updated doc for python support version by @shark-21 in #518
- DOC: Install xgboost for doc building by @aresnow1 in #539
- DOC: Add lightgbm doc by @yifeis7 in #540
Full Changelog: v0.3.2...v0.4.0