Skip to content

Latest commit

 

History

History
108 lines (79 loc) · 6.25 KB

README.md

File metadata and controls

108 lines (79 loc) · 6.25 KB

Python DB Project Template

A project template for Python applications that need to use a relational database, helping to simplify the process for managing schema migrations and setting up a containerised DB instance for testing.

Involves an implementation of a Python service running on AWS with a PostgreSQL database, but can be adapted to other contexts.

Intended for use on a Mac or Linux development machine.

Features

  • A collection of commands/scripts and configurations leveraging the Atlas tool to make the process of creating, validating and applying schema migrations simple, safe and automated:
    • create-migrations command for generating migration files from changes to DB schema
    • check-migrations command for validating schema & migration state & integrity, and showing migration-related warnings (can be used as a pre-commit hook and in CI)
    • AWS Lambda-backed custom resource for applying migrations upon CloudFormation stack deployment (With a custom parameter that causes the CR to only be triggered when migration state changes, and a Lambda Layer to contain the Atlas binary).
  • Automatically installs Atlas and a license-free docker runtime for Mac machines (colima)
  • Pytest fixtures for managing a local containerised PostgreSQL database for testing, and providing a SQLAlchemy session wih per-test isolation.

Installation

Clone this repository, adapt it to your existing project if required.

Run make install to install project dependencies, including:

  • Docker engine runtime (colima)
  • Atlas DB migration tool
  • Python project dependencies
  • Downloads the postgres:16-alpine docker image for local testing and migration verification

Usage

Configure Database Schema Source

This template comes with an example configuration using SQLAlchemy as a DB schema source. Check the Atlas docs for how to update the atlas.hcl configuration file to use a different schema source.

Add your app project code

Add your application code to the src/app/ directory. Your application could be a Lambda function, a Flask/Fast API app running on AWS, or any other Python application that needs to interact with a relational database. You will also need to add appropriate resources to cloud-formation.yaml, or another solution to deploy your application.

Create Migrations

Delete the existing files in the src/db/migrations/ directory, then run make create-migrations from the project root to create initial migration and hash files.

The MigrationHash property of the AWS Custom Resource will be automatically updated to the new migration hash value, so that it will be considered updated and trigger the Lambda to run migrations when deployed.

Validate Migration files

Running make check-migrations will:

  • Verify that the MigrationHash property of the AWS Custom Resource is set appropriately
  • Verify hash integrity of migration files
  • Exit early if there are no changes to DB schema or migrations
  • Verify that DB schema and migrations are in sync (this involves running the migrations against a temporary database, which will raise any SQL semantic issues)
  • Run Atlas linting which generates warnings for common migration issues

The make lint command will run check-migrations along with other project linting and type checking. This command can be added to a Git pre-commit hook, and should be used in your automated CI process as part of PR validation. Linting warning outputs from the command should be included in the PR description for review.

Set configuration in AWS SAM template

The cloud-formation.yaml AWS SAM template file needs to be updated with the correct parameters for your project. You can use references to outputs from the database infrastructure resources, which should be defined in a separate stack.

Testing

Use the db_session_maker fixture in your tests to get a SQLAlchemy session maker with a connection to an automatically created PostgreSQL DB container. The DB tables will automatically be created, and content cleared between tests.

Example usage:

def test_my_function_that_needs_a_session(db_session: Session):
      # Set up test data in the DB
      ...
      db_session.flush()
  
      # Test your function that requires a SQLAlchemy session
      result = my_function(db_session)
      db_session.flush()
      
      # Verify state of the DB after the function has run
      ...

def test_my_function_that_needs_a_session_maker(db_session_maker):
    with db_session_maker.begin() as session:
        # Set up test data in the DB
        ...
    
    # Test your function that requires a SQLAlchemy session maker
    result = my_function(db_session_maker)
    
    with db_session_maker.begin() as session:
        # Verify state of the DB after the function has run
        ...

Run make test to run tests.

Apply Migrations

Configure your deployment pipeline to run the following commands (if using Lambda-backed custom resource to apply migrations):

# Download atlas binary to be packaged into the Lambda Layer
./scripts/download_atlas.sh 'atlas_layer/bin/'
# Deploy the SAM template
sam deploy --template-file cloud-formation.yaml

If using AWS with the provided Lambda-backed custom resource, migrations will automatically be applied when the SAM template is deployed. The Lambda logs will contain details of the outcome.

To manually apply migrations in other contexts, run:

atlas migrate apply -u <DB_CONNECTION_STRING> --dir file://src/db/migrations/

Notes

  • Uses the Atlas versioned workflow for migrations, and only free features (no Atlas Cloud).
  • Any other services or application code that needs to reference the DB schema (e.g. SQLAlchemy models) should be included within this same project, so that they are kept in sync and deployed together.
  • The mechanism for applying migrations (Lambda-backed custom resource) can be changed to suit your project needs, while retaining the commands used for local development and in CI.
  • Any additional AWS resources relating to this project/services should have a DependsOn: DBSchemaMigration set, so that they are only deployed after migrations have successfully completed.