Directions

To start up the different containers individually.

spark 3.3 with jupyter notebook

docker-compose up notebook

minio (s3-compatible storage layer)

docker-compose up minio

nessie (transactional catalog for Apache Iceberg)

docker-compose up nessie

Dremio (data lakehouse platform (query engine, access layer, more))

docker-compose up dremio

There are three folders in this repo mapped specifically to the spark/notebook container which are:

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
datasets		datasets
lesson_code		lesson_code
notebooks		notebooks
warehouse		warehouse
docker-compose.yml		docker-compose.yml
readme.md		readme.md

Provide feedback