Releases: iterative/datachain
Releases · iterative/datachain
0.7.10
What's Changed
- add some missing type hint by @shibuiwilliam in #648
- update readme - lower use cases to the ground by @shcheklein in #668
- Add 'bit_hamming_distance' and 'byte_hamming_distance' SQLite functions by @dreadatour in #669
- feat: update documentation and enhance navigation structure by @yathomasi in #670
- Allow testing external contributions using secrets by @0x2b3bfa0 in #667
- Fix readme by @dreadatour in #674
New Contributors
- @shibuiwilliam made their first contribution in #648
Full Changelog: 0.7.9...0.7.10
0.7.9
What's Changed
- Migrate the datasets and job urls by @amritghimire in #652
- pytorch dataset: pass cache/prefetch to DataChain instances by @skshetry in #653
- Use 'setup' in HuggingFace llm example by @dreadatour in #662
- Bump ultralytics from 8.3.29 to 8.3.37 by @dependabot in #626
to_pytorch
: enable prefetching by @skshetry in #664- order datasets and versions in list_datasets query by @mattseddon in #665
Full Changelog: 0.7.8...0.7.9
0.7.8
What's Changed
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #654
- Update base Func class and tests by @dreadatour in #641
Full Changelog: 0.7.7...0.7.8
0.7.7
What's Changed
- Remove 'partition_by' requirement from 'group_by' method by @dreadatour in #649
- Support 'case' operator inside functions by @dreadatour in #651
- Add test for 'group_dy' without 'partition_by' by @dreadatour in #650
Full Changelog: 0.7.6...0.7.7
0.7.6
What's Changed
- Using version
uuid
indatachain pull
by @ilongin in #621 - Add option to run the datachain query to Studio by @amritghimire in #579
- Refactor the cli commands for the datasets by @amritghimire in #622
- remove catalog.get_file_from_row by @mattseddon in #645
- Removing id generator by @ilongin in #594
Full Changelog: 0.7.5...0.7.6
0.7.5
What's Changed
- do not use orjson to parse dataset version preview by @mattseddon in #646
Full Changelog: 0.7.4...0.7.5
0.7.4
What's Changed
- remove unused _parse_datasets method from abstract db metastore by @mattseddon in #643
- do not pre-load dataset version preview from string by @mattseddon in #642
Full Changelog: 0.7.3...0.7.4
0.7.3
What's Changed
- Add studio flag for rm-dataset and edit-dataset by @amritghimire in #572
- docs: update README.rst by @eltociear in #639
- add DatasetListRecord and DatasetListVersion dataclasses to only extract required fields when listing datasets by @mattseddon in #634
Full Changelog: 0.7.2...0.7.3
0.7.2
What's Changed
- feat(dc): make expand work if some fields are optional by @shcheklein in #607
- fix(tests): minor fix in comment by @shcheklein in #618
- test_query_e2e: do not interrupt running script multiple times by @skshetry in #614
- sqlitedb: retain
db_file
in_connect
method to ensure proper connection ondb.clone()
by @skshetry in #620 - [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #623
- Bump astral-sh/setup-uv from 3 to 4 by @dependabot in #625
- dataset_select_paginated: do not clone on each paginated query by @skshetry in #629
- docs: document prefetch by @skshetry in #630
- fix type issues produced by new version of pydantic by @mattseddon in #638
- Finish SQL functions refactoring by @dreadatour in #543
Full Changelog: 0.7.1...0.7.2
0.7.1
What's Changed
- Increase resolution of
train_test_split
by @0x2b3bfa0 in #611 - Update ultralytics models (take two) by @dreadatour in #613
Full Changelog: 0.7.0...0.7.1