Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Abstract dataset #125

Closed
wants to merge 44 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
44 commits
Select commit Hold shift + click to select a range
d027ac5
Merge pull request #19 from quantumblacklabs/release/0.14.0
idanov May 16, 2019
c947de9
Merge pull request #51 from quantumblacklabs/release/0.14.1
nakhan98 May 31, 2019
8b218ec
Merge pull request #81 from quantumblacklabs/release/0.14.2
nakhan98 Jun 11, 2019
c9ca369
Merge pull request #105 from quantumblacklabs/release/0.14.3
nakhan98 Jun 26, 2019
f06a603
Merge pull request #184 from quantumblacklabs/release/0.15.0
921kiyo Aug 13, 2019
f69bd54
Merge pull request #232 from quantumblacklabs/release/0.15.1
Sep 12, 2019
f4e6ea8
Release 0.15.2
Oct 8, 2019
fc68c72
Merge pull request #307 from quantumblacklabs/release/0.15.4
DmitriiDeriabinQB Oct 30, 2019
e3cf344
Merge pull request #352 from quantumblacklabs/release/0.15.5
andrii-ivaniuk Dec 12, 2019
5d29840
Merge pull request #455 from quantumblacklabs/release/0.15.6
limdauto Feb 26, 2020
9ff268e
Merge pull request #477 from quantumblacklabs/hotfix/0.15.8
921kiyo Mar 5, 2020
0c9afa0
Merge pull request #607 from quantumblacklabs/release/0.16.0
limdauto May 20, 2020
7c9e434
Fix DataSet string representation for falsy values (#418)
DmitriiDeriabinQB Jun 23, 2020
5eca512
Fixed versioning on Windows (#673)
andrii-ivaniuk Jun 26, 2020
ee95b11
Fix broken links to documentation (#473)
jeverling Aug 3, 2020
cd72df9
Make Pylint 2.6.0 happy again (#765)
921kiyo Aug 24, 2020
573977b
Convert all format strings to f-strings (#574)
WaylonWalker Nov 20, 2020
fb4a328
Merge pull request #923 from quantumblacklabs/develop
limdauto Dec 17, 2020
255a65c
Update Copyright date to 2021 (#941)
merelcht Jan 13, 2021
e736726
Upgrade project's `requirements.txt` #ked 2540 (#1126)
ignacioparicio Jun 7, 2021
e5051aa
Improve error msg when versioning an existing dataset (#1144)
ignacioparicio Jun 11, 2021
4d75d89
DataSetError 'parsing config' thrown when ModuleNotFound (#1201)
Aug 2, 2021
1fe5988
Update docstrings to mention the meaning of the _SINGLE_PROCESS flag …
jiriklein Aug 3, 2021
8b4f8b5
implement __all__ in __init__ modules (#874)
WaylonWalker Sep 9, 2021
5de06ed
Update release notes post release (#1240)
Sep 20, 2021
468b97b
Use `pyupgrade` to replace format strings and more (#1242)
deepyaman Sep 22, 2021
d681a82
Remove McK headers + Licence from codebase + docs (#1285)
SajidAlamQB Nov 1, 2021
1ea8a92
Fix versioning tracking datasets (#1016)
merelcht Nov 8, 2021
5c4ff50
Ensure path is of type `str` if `overwrite` is set (#1094)
deepyaman Dec 8, 2021
51e589d
Fix error message whitespace (#1206)
puneet29 Feb 7, 2022
af55fd0
Merge pull request #1397 from kedro-org/develop
Mar 31, 2022
487eb33
Add abfss to the list of cloud protocols (#1348)
philomine May 24, 2022
35c1e45
Replace backticks with single quotes (#1584)
AhdraMeraliQB Jun 8, 2022
d490f9e
Generic typings abstract dataset (#1612)
daBlesr Jul 4, 2022
ee39e5e
Add gdrive to CLOUD_PROTOCOLS list (#1708)
SajidAlamQB Sep 9, 2022
c713e7d
Update Error message for `VersionNotFoundError` to handle Permission …
ankatiyar Oct 3, 2022
781fa1e
Remove redundant `resolve_load_version` call (#1911)
noklam Oct 20, 2022
5c134e5
Make Kedro instantiate datasets from `kedro_dataset`with higher prior…
noklam Nov 2, 2022
8b03ab6
Add support for Azure Data Lake Storage Gen2 URIs (#2190)
jmholzer Jan 16, 2023
860a4ee
Fix `s3n` and `s3a` bug by adding them to CLOUD_PROTOCOLS (#2326)
MigQ2 Feb 24, 2023
bd1ebfd
Added kedro/io/core.py kedro/io/__init__.py and kedro/utils.py from p…
bpmeek Mar 9, 2023
85125f2
Make AbstractDataSets from `kedro_datasets` instead of `kedro`
bpmeek Mar 9, 2023
389d34e
Removed `kedro~=0.18.4` as requirement, added `cachetools~=5.3`
bpmeek Mar 9, 2023
6cc3b09
Updated RELEASE.md to reflect changes
bpmeek Mar 9, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions kedro-datasets/RELEASE.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,10 @@
| `polars.CSVDataSet` | A `CSVDataSet` backed by [polars](https://www.pola.rs/), a lighting fast dataframe package built entirely using Rust. | `kedro_datasets.polars` |
| `snowflake.SnowparkTableDataSet` | Work with [Snowpark](https://www.snowflake.com/en/data-cloud/snowpark/) DataFrames from tables in Snowflake. | `kedro_datasets.snowflake` |

* Patched `kedro.io.core.py` and `kedro.utils` to `io.core.py` and `io.core.utils` respectively,
allowing for implementations of `AbstractDataSet` and `AbstractVersionedDataSet` to be shared with
and used by non-Kedro users.

## Bug fixes and other changes
* Add `mssql` backend to the `SQLQueryDataSet` DataSet using `pyodbc` library.

Expand Down
2 changes: 1 addition & 1 deletion kedro-datasets/kedro_datasets/api/api_dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@
from typing import Any, Dict, Iterable, List, NoReturn, Union

import requests
from kedro.io.core import AbstractDataSet, DataSetError
from kedro_datasets.io.core import AbstractDataSet, DataSetError
from requests.auth import AuthBase


Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@

import fsspec
from Bio import SeqIO
from kedro.io.core import AbstractDataSet, get_filepath_str, get_protocol_and_path
from kedro_datasets.io.core import AbstractDataSet, get_filepath_str, get_protocol_and_path


class BioSequenceDataSet(AbstractDataSet[List, List]):
Expand Down
2 changes: 1 addition & 1 deletion kedro-datasets/kedro_datasets/dask/parquet_dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@
import dask.dataframe as dd
import fsspec
import triad
from kedro.io.core import AbstractDataSet, get_protocol_and_path
from kedro_datasets.io.core import AbstractDataSet, get_protocol_and_path


class ParquetDataSet(AbstractDataSet[dd.DataFrame, dd.DataFrame]):
Expand Down
2 changes: 1 addition & 1 deletion kedro-datasets/kedro_datasets/email/message_dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@
from typing import Any, Dict

import fsspec
from kedro.io.core import (
from kedro_datasets.io.core import (
AbstractVersionedDataSet,
DataSetError,
Version,
Expand Down
2 changes: 1 addition & 1 deletion kedro-datasets/kedro_datasets/geopandas/geojson_dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@

import fsspec
import geopandas as gpd
from kedro.io.core import (
from kedro_datasets.io.core import (
AbstractVersionedDataSet,
DataSetError,
Version,
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@

import fsspec
import holoviews as hv
from kedro.io.core import (
from kedro_datasets.io.core import (
AbstractVersionedDataSet,
DataSetError,
Version,
Expand Down
22 changes: 22 additions & 0 deletions kedro-datasets/kedro_datasets/io/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
"""``kedro.io`` provides functionality to read and write to a
number of data sets. At core of the library is ``AbstractDataSet``
which allows implementation of various ``AbstractDataSet``s.
"""

from .core import (
AbstractDataSet,
AbstractVersionedDataSet,
DataSetAlreadyExistsError,
DataSetError,
DataSetNotFoundError,
Version,
)

__all__ = [
"AbstractDataSet",
"AbstractVersionedDataSet",
"DataSetAlreadyExistsError",
"DataSetError",
"DataSetNotFoundError",
"Version",
]
Loading