Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

master merge for 1.0.0 release #1816

Merged
merged 55 commits into from
Sep 16, 2024
Merged
Changes from 1 commit
Commits
Show all changes
55 commits
Select commit Hold shift + click to select a range
fbf0ef4
Feat/1492 extend timestamp config (#1669)
donotpush Aug 30, 2024
1723faa
Fix/1571 Incremental: Optionally raise, load, or ignore raise records…
willi-mueller Aug 30, 2024
36c0d14
fix installation command" (#1741)
novica Sep 2, 2024
dd973c5
Don't use Custom Embedding Functions (#1771)
Pipboyguy Sep 3, 2024
e9468cb
sets default concurrency for blob upload for adlfs to 1 (#1779)
rudolfix Sep 3, 2024
8995f70
Add historic docs versions
sh-rp Sep 6, 2024
144d3b8
disable docusaurus contextual search
sh-rp Sep 6, 2024
632e823
Docs/update deploy dagster (#1761)
mariarice15 Sep 6, 2024
301d0a2
updated the documentation (#1784)
dat-a-man Sep 6, 2024
1b66e0e
Correct wrong code example for apply_hints( incremental(xx) ) (#1785)
w0ut0 Sep 6, 2024
84a8e25
Refined documentation with minor improvements (#1760)
dat-a-man Sep 6, 2024
8dd1467
Updated performance docs for `file_max_items` and `file_max_bytes` (#…
dat-a-man Sep 6, 2024
68b0bb8
Fix/1790 support incremental load with arrow when cursor column is no…
willi-mueller Sep 8, 2024
51516b1
move rest_api, sql_database and filesystem sources to dlt core (#1728)
willi-mueller Sep 8, 2024
84f9fa7
controls row groups and empty tables (#1782)
rudolfix Sep 8, 2024
dad2a08
prepares for nested references (#1774)
rudolfix Sep 9, 2024
b1bcf21
Fix link and add env var naming example (#1763)
VioletM Sep 10, 2024
79c70c9
Feat/1749 abort load package and raise exception on terminal errors i…
willi-mueller Sep 10, 2024
f7be6f0
Use consistent capitalization in the docs (#1798)
burnash Sep 10, 2024
a6857c9
migrates complex data type and nested reference hints (#1792)
rudolfix Sep 10, 2024
6451bd7
bumps to prerelease 0.9.9a1
rudolfix Sep 10, 2024
8624c9a
Move sources and destinations to the top level
VioletM Aug 27, 2024
4958238
Update the css
VioletM Aug 28, 2024
c875022
Remove the integrations page
VioletM Sep 10, 2024
25e03d1
Update the icons
VioletM Sep 10, 2024
0c5df29
Minor
VioletM Sep 10, 2024
2a636a1
Fix icons
VioletM Sep 10, 2024
e3a9507
Fix icons
VioletM Sep 10, 2024
b5faf1c
Change link to the custom source
VioletM Sep 11, 2024
bc99c93
Merge pull request #1750 from dlt-hub/docs/move_sources_and_destinations
akelad Sep 11, 2024
5115a99
Fix typo "frequenly" (#1800)
ruudwelten Sep 11, 2024
555a918
Update blog link in the docs' menu (#1802)
burnash Sep 11, 2024
27539c9
Refactor filesystem doc (#1745)
VioletM Sep 12, 2024
0baebde
structural and content changes to the sql_database doc (#1623)
rahuljo Sep 12, 2024
dd7709a
Update REST API docs (#1795)
akelad Sep 12, 2024
4df648f
adding the sql_database tutorial (#1796)
rahuljo Sep 12, 2024
9e46b5c
skips tables without jobs when merging delta tables (#1803)
rudolfix Sep 12, 2024
62187c7
Add filesystem tutorial (#1775)
VioletM Sep 13, 2024
12e2f12
creates a single source in extract for all resource instances passed …
rudolfix Sep 14, 2024
325d927
Enable BigQuery schema auto-detection with partitioning and clusterin…
Pipboyguy Sep 14, 2024
9580baf
Sqlalchemy destination (#1734)
steinitzu Sep 14, 2024
eb4b1ba
Docs: update the introduction, add the rest_api tutorial (#1729)
burnash Sep 14, 2024
4e45ea4
Feat/1730 extend filesystem sftp (#1769)
donotpush Sep 14, 2024
5b92fea
Reorder sidebar (#1787)
mariarice15 Sep 14, 2024
1714801
Masks secrets in traces. (#1797)
willi-mueller Sep 14, 2024
4127a6c
Docs: fix broken code detection/highlighting (#1810)
burnash Sep 16, 2024
84b828b
sources re-ordering
sh-rp Sep 16, 2024
f1d733f
fix docusaurus deployment
sh-rp Sep 16, 2024
c056b83
Fix link in credentials
VioletM Sep 16, 2024
866bce3
bumps to 1.0.0 + docs cleanup (#1809)
rudolfix Sep 16, 2024
8f325c3
Update CSS class names (#1814)
burnash Sep 16, 2024
21fba1b
Update terminology in filesystem destination (#1804)
burnash Sep 16, 2024
ef929ca
Update redirects in netlify.toml (#1815)
burnash Sep 16, 2024
b1d20ac
adds new versioning scheme to README
rudolfix Sep 16, 2024
e48f641
Merge branch 'master' into devel
rudolfix Sep 16, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Move sources and destinations to the top level
VioletM committed Sep 11, 2024

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
commit 8624c9a0056957ec1ec8ea242c0a0c9220c24627
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
---
title: Filesystem
description: dlt verified source for Readers Source and Filesystem
description: AWS S3, Google Cloud Storage, Azure Blob Storage, local files
keywords: [readers source and filesystem, filesystem, readers source]
---
import Header from './_source-info-header.md';
@@ -123,7 +123,7 @@ For more information, read the

3. You can pass the bucket URL and glob pattern or use `config.toml`. For local filesystems, use
`file://` as follows:

```toml
[sources.filesystem] # use [sources.readers.credentials] for the "readers" source
bucket_url='file://Users/admin/Documents/csv_files'
@@ -136,7 +136,7 @@ For more information, read the
bucket_url='~\Documents\csv_files\'
file_glob="*"
```

In the example above we use Windows path to current user's Documents folder. Mind that literal toml string (single quotes)
was used to conveniently use the backslashes without need to escape.

46 changes: 30 additions & 16 deletions docs/website/docs/dlt-ecosystem/verified-sources/index.md
Original file line number Diff line number Diff line change
@@ -1,31 +1,45 @@
---
title: Verified sources
description: List of verified sources
keywords: ['verified source']
title: Sources
description: Available sources
keywords: ['source']
---
import DocCardList from '@theme/DocCardList';
import Link from '../../_book-onboarding-call.md';
import DocCardList from '@theme/DocCardList';
import {useCurrentSidebarCategory} from '@docusaurus/theme-common';

Choose from our collection of verified sources, developed and maintained by the dlt team and community. Each source is rigorously tested against a real API and provided as Python code for easy customization.

Planning to use dlt in production and need a source that isn't listed? We're happy to help you build it: <Link />.
Planning to use dlt in production and need a source that isn't listed? We're happy to help you build it: <Link/>.

### Popular sources
### Core sources

- [SQL databases](sql_database). Supports PostgreSQL, MySQL, MS SQL Server, BigQuery, Redshift, and more.
- [REST API generic source](rest_api). Loads data from REST APIs using declarative configuration.
- [OpenAPI source generator](openapi-generator). Generates a source from an OpenAPI 3.x spec using the REST API source.
- [Cloud and local storage](filesystem). Retrieves data from AWS S3, Google Cloud Storage, Azure Blob Storage, local files, and more.
<DocCardList items={useCurrentSidebarCategory().items.filter(
item => item.label === '30+ SQL Databases' || item.label === 'REST API generic source' || item.label === 'Filesystem'
)} />

### Full list of verified sources
### Verified sources

<DocCardList />
Choose from our collection of verified sources, developed and maintained by the dlt team and community. Each source is rigorously tested against a real API and provided as Python code for easy customization.

:::tip
If you're looking for a source that isn't listed and it provides a REST API, be sure to check out our [REST API generic source](rest_api)
source.
If you couldn't find a source implementation, you can easily create your own, check our [tutorial](../../tutorial) to learn how!
:::

<DocCardList items={useCurrentSidebarCategory().items.filter(
item => item.label !== '30+ SQL Databases' && item.label !== 'REST API generic source'&& item.label !== 'Filesystem'
)} />

### What's the difference?

The main difference between the [core sources](#core-sources) and [verified sources](#verified-sources) lies in their structure.
Core sources are generic collections, meaning they can connect to a variety of systems. For example, [SQL Database source](sql_database) can connect to any
database which supports SQLAlchemy.

According to our telemetry, core sources are the most widely used among our users!

It's also important to note that core sources are integrated into the `dlt` core library,
whereas verified sources are maintained in a separate [repository](https://github.com/dlt-hub/verified-sources).
To use a verified source, you need to run the dlt init command, which will download the verified source code to
your working directory.


### Get help

Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
---
title: REST API generic source
description: dlt verified source for REST APIs
description: Loads data from REST APIs using declarative configuration
keywords: [rest api, restful api]
---
import Header from './_source-info-header.md';
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
---
title: 30+ SQL Databases
description: dlt pipeline for SQL Database
description: PostgreSQL, MySQL, MS SQL Server, BigQuery, Redshift, and more
keywords: [sql connector, sql database pipeline, sql database]
---
import Header from './_source-info-header.md';
138 changes: 64 additions & 74 deletions docs/website/sidebars.js
Original file line number Diff line number Diff line change
@@ -44,84 +44,74 @@ const sidebars = {
},
{
type: 'category',
label: 'Integrations',
label: 'Sources',
link: {
type: 'doc',
id: 'dlt-ecosystem/index',
id: 'dlt-ecosystem/verified-sources/index',
},
items: [
{
type: 'category',
label: 'Sources',
link: {
type: 'doc',
id: 'dlt-ecosystem/verified-sources/index',
},
items: [
'dlt-ecosystem/verified-sources/airtable',
'dlt-ecosystem/verified-sources/amazon_kinesis',
'dlt-ecosystem/verified-sources/arrow-pandas',
'dlt-ecosystem/verified-sources/asana',
'dlt-ecosystem/verified-sources/chess',
'dlt-ecosystem/verified-sources/facebook_ads',
'dlt-ecosystem/verified-sources/filesystem',
'dlt-ecosystem/verified-sources/freshdesk',
'dlt-ecosystem/verified-sources/github',
'dlt-ecosystem/verified-sources/google_ads',
'dlt-ecosystem/verified-sources/google_analytics',
'dlt-ecosystem/verified-sources/google_sheets',
'dlt-ecosystem/verified-sources/hubspot',
'dlt-ecosystem/verified-sources/inbox',
'dlt-ecosystem/verified-sources/jira',
'dlt-ecosystem/verified-sources/kafka',
'dlt-ecosystem/verified-sources/matomo',
'dlt-ecosystem/verified-sources/mongodb',
'dlt-ecosystem/verified-sources/mux',
'dlt-ecosystem/verified-sources/notion',
'dlt-ecosystem/verified-sources/personio',
'dlt-ecosystem/verified-sources/pg_replication',
'dlt-ecosystem/verified-sources/pipedrive',
'dlt-ecosystem/verified-sources/rest_api',
'dlt-ecosystem/verified-sources/openapi-generator',
'dlt-ecosystem/verified-sources/salesforce',
'dlt-ecosystem/verified-sources/scrapy',
'dlt-ecosystem/verified-sources/shopify',
'dlt-ecosystem/verified-sources/sql_database',
'dlt-ecosystem/verified-sources/slack',
'dlt-ecosystem/verified-sources/strapi',
'dlt-ecosystem/verified-sources/stripe',
'dlt-ecosystem/verified-sources/workable',
'dlt-ecosystem/verified-sources/zendesk'
]
},
{
type: 'category',
label: 'Destinations',
link: {
type: 'doc',
id: 'dlt-ecosystem/destinations/index',
},
items: [
'dlt-ecosystem/destinations/bigquery',
'dlt-ecosystem/destinations/databricks',
'dlt-ecosystem/destinations/duckdb',
'dlt-ecosystem/destinations/mssql',
'dlt-ecosystem/destinations/synapse',
'dlt-ecosystem/destinations/clickhouse',
'dlt-ecosystem/destinations/filesystem',
'dlt-ecosystem/destinations/postgres',
'dlt-ecosystem/destinations/redshift',
'dlt-ecosystem/destinations/snowflake',
'dlt-ecosystem/destinations/athena',
'dlt-ecosystem/destinations/weaviate',
'dlt-ecosystem/destinations/lancedb',
'dlt-ecosystem/destinations/qdrant',
'dlt-ecosystem/destinations/dremio',
'dlt-ecosystem/destinations/destination',
'dlt-ecosystem/destinations/motherduck'
]
},
],
'dlt-ecosystem/verified-sources/airtable',
'dlt-ecosystem/verified-sources/amazon_kinesis',
'dlt-ecosystem/verified-sources/arrow-pandas',
'dlt-ecosystem/verified-sources/asana',
'dlt-ecosystem/verified-sources/chess',
'dlt-ecosystem/verified-sources/facebook_ads',
'dlt-ecosystem/verified-sources/filesystem',
'dlt-ecosystem/verified-sources/freshdesk',
'dlt-ecosystem/verified-sources/github',
'dlt-ecosystem/verified-sources/google_ads',
'dlt-ecosystem/verified-sources/google_analytics',
'dlt-ecosystem/verified-sources/google_sheets',
'dlt-ecosystem/verified-sources/hubspot',
'dlt-ecosystem/verified-sources/inbox',
'dlt-ecosystem/verified-sources/jira',
'dlt-ecosystem/verified-sources/kafka',
'dlt-ecosystem/verified-sources/matomo',
'dlt-ecosystem/verified-sources/mongodb',
'dlt-ecosystem/verified-sources/mux',
'dlt-ecosystem/verified-sources/notion',
'dlt-ecosystem/verified-sources/personio',
'dlt-ecosystem/verified-sources/pg_replication',
'dlt-ecosystem/verified-sources/pipedrive',
'dlt-ecosystem/verified-sources/rest_api',
'dlt-ecosystem/verified-sources/openapi-generator',
'dlt-ecosystem/verified-sources/salesforce',
'dlt-ecosystem/verified-sources/scrapy',
'dlt-ecosystem/verified-sources/shopify',
'dlt-ecosystem/verified-sources/sql_database',
'dlt-ecosystem/verified-sources/slack',
'dlt-ecosystem/verified-sources/strapi',
'dlt-ecosystem/verified-sources/stripe',
'dlt-ecosystem/verified-sources/workable',
'dlt-ecosystem/verified-sources/zendesk'
]
},
{
type: 'category',
label: 'Destinations',
link: {
type: 'doc',
id: 'dlt-ecosystem/destinations/index',
},
items: [
'dlt-ecosystem/destinations/bigquery',
'dlt-ecosystem/destinations/databricks',
'dlt-ecosystem/destinations/duckdb',
'dlt-ecosystem/destinations/mssql',
'dlt-ecosystem/destinations/synapse',
'dlt-ecosystem/destinations/clickhouse',
'dlt-ecosystem/destinations/filesystem',
'dlt-ecosystem/destinations/postgres',
'dlt-ecosystem/destinations/redshift',
'dlt-ecosystem/destinations/snowflake',
'dlt-ecosystem/destinations/athena',
'dlt-ecosystem/destinations/weaviate',
'dlt-ecosystem/destinations/lancedb',
'dlt-ecosystem/destinations/qdrant',
'dlt-ecosystem/destinations/dremio',
'dlt-ecosystem/destinations/destination',
'dlt-ecosystem/destinations/motherduck'
]
},
{
type: 'category',