This is an Insight Data Engineering project. The objective is to construct a data pipeline to extract analytics from time-series satellite images. I implement the pipeline on AWS with the serverless Lambda service to get flexible performance scaling and cost efficiency.
Development and deployment: Docker, serverless
AWS: S3, Lambda, SQS, DynamoDB
Frontend: Dash
- Lambda has strict source code size limit. How do we include all dependencies of the image processing in the package?
- If the region spans to more than 1 scenes, how to combine them?
- How to avoid clouded images and access the quality of each scene?
A web interface that allows user to specify a region (by a geojson file) and returns historical urban region graph.
- AWS Account
- awscli
- Docker
- npm (serverless)
cp sample.env .env
vi .env
Put your AWS access key id and secret access key in .env
.
make build
npm install -g serverless
sls deploy