Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build cloud infrastructure #4

Open
josephmachado opened this issue Sep 1, 2023 · 8 comments
Open

Build cloud infrastructure #4

josephmachado opened this issue Sep 1, 2023 · 8 comments

Comments

@josephmachado
Copy link
Owner

josephmachado commented Sep 1, 2023

Description

Currently the project is only meant to be run locally, implement a solution to deploy this to the cloud.

Areas to cover

  1. Platform to use
  2. CI/CD
  3. Env variable management (w Secret managements)
  4. Cloud storage & processing.
  5. Scheduling & Orchestration.
  6. Logging, metadata & debugging

Choice of platform

Please leave a comment with your choices for areas 1 - 5 above.

@andreale28
Copy link

Hi Joseph

Thanks for sharing your project.

In case of next area to cover, I would love to see the expansion of this project to orchestration and/or platform. Otherwise, how about some talk on open table format?

@abdelhaqs
Copy link

All the topics mentioned above are interesting to see.

@flaviassantos
Copy link

Great innitiative @josephmachado! I would love to see more about points 2,3&5. :)

@cjj1120
Copy link

cjj1120 commented Nov 13, 2023

I would like to further explore 4&5 those two provide the best value for me as a data engineer. I would be glad to support if you can help with those two! Great job and thanks for the website, appreciate it.

@kottargiveer
Copy link

Great Project. Thank you.

However, It would be great, if you could teach us how to create such projects from scratch.

How to create those containers so that we can ship projects from dev to QA to PROD ?

ex : The underlying script for these commands
make up # Spin up containers
make ddl # Create tables & views
make ci # Run checks & tests
make etl # Run etl
make spark-sh # Spark shell to check created tables

@cjj1120
Copy link

cjj1120 commented Feb 29, 2024

@kottargiveer The underlying commands are all under the Makefile, if that's what you're asking.

@cjj1120
Copy link

cjj1120 commented Feb 29, 2024

My top 3 choices:
4 -> 5 -> 2

@GEJ1
Copy link

GEJ1 commented Apr 5, 2024

Great project, I would be interested in all the topics mentioned above. Thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants