Serverless Data Lake Framework (SDLF)

An AWS Professional Service open source initiative | [email protected]

The Serverless Data Lake Framework (SDLF) is a collection of reusable artifacts aimed at accelerating the delivery of enterprise data lakes on AWS, shortening the deployment time to production from several months to a few weeks. It can be used by AWS teams, partners and customers to implement the foundational structure of a data lake following best practices.

Motivation

A data lake gives your organization agility. It provides a repository where consumers can quickly find the data they need and use it in their business projects. However, building a data lake can be complex; there’s a lot to think about beyond the storage of files. For example, how do you catalog the data so you know what you’ve stored? What ingestion pipelines do you need? How do you manage data quality? How do you keep the code for your transformations under source control? How do you manage development, test and production environments? Building a solution that addresses these use cases can take many weeks and this time can be better spent innovating with data and achieving business goals. The SDLF is a collection of production-hardened, best practice templates which accelerate your data lake implementation journey on AWS, so that you can focus on use cases that generate value for business.

Customers using the SDLF

If you would like us to include your company’s name and/or logo in the README file to indicate that your company is using the AWS Serverless Data Lake Framework, please raise a "Support the SDLF" issue. If you would like us to display your company’s logo, please raise a linked pull request to provide an image file for the logo. Note that by raising a Support the SDLF issue (and related pull request), you are granting AWS permission to use your company’s name (and logo) for the limited purpose described here and you are confirming that you have authority to grant such permission.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
docs		docs
sdlf-cicd		sdlf-cicd
sdlf-datalakeLibrary		sdlf-datalakeLibrary
sdlf-dataset		sdlf-dataset
sdlf-foundations		sdlf-foundations
sdlf-pipLibrary		sdlf-pipLibrary
sdlf-pipeline		sdlf-pipeline
sdlf-stageA		sdlf-stageA
sdlf-stageB		sdlf-stageB
sdlf-team		sdlf-team
sdlf-utils		sdlf-utils
thirdparty-scms		thirdparty-scms
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
deploy.sh		deploy.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serverless Data Lake Framework (SDLF)

Motivation

Customers using the SDLF

Workshop

Read The Docs

Ingestion/Processing Library

About

Releases

Packages

Languages

License

prajeshrawat25/aws-serverless-data-lake-framework

Folders and files

Latest commit

History

Repository files navigation

Serverless Data Lake Framework (SDLF)

Motivation

Customers using the SDLF

Workshop

Read The Docs

Ingestion/Processing Library

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages