Skip to content

Latest commit

 

History

History
35 lines (23 loc) · 738 Bytes

readme.md

File metadata and controls

35 lines (23 loc) · 738 Bytes

Directions

To start up the different containers individually.

spark 3.3 with jupyter notebook

docker-compose up notebook

minio (s3-compatible storage layer)

docker-compose up minio

nessie (transactional catalog for Apache Iceberg)

docker-compose up nessie

Dremio (data lakehouse platform (query engine, access layer, more))

docker-compose up dremio

There are three folders in this repo mapped specifically to the spark/notebook container which are:

  • datasets use this for any sample datasets
  • notebooks for storing notebooks
  • warehouse to be used as a warehouse for written data

Find Guides and Tutorials Here