Docker environment for data processing, with spark and airflow with postgresql as backgroud database
docker network create --driver bridge data-processing
docker network create --subnet=172.30.0.0/16 --driver bridge data-processing
cd postgres
docker compose up -d
cd airflow
mkdir -p ./dags ./logs ./plugins ./config ./wallet
docker compose up airflow-init
docker compose up -d
Adjust the user and password at the .env file
cd spark
mkdir -p ./data ./jobs ./logs ./ wallet
docker compose up -d --scale worker=3
http://localhost:9091
http://localhost:18081
http://localhost:8080