crab
is a simple unix toolkit for working with local development environments.
It is intended to allow developers to run multiple twelve-factor style web applications locally and address them via hostnames rather than port numbers. It is deliberately designed to be as simple as possible, and therefore does not attempt to manage multiple processes itself.
Crab does the following:
- Understands env files.
- Understands procfiles.
- Can manipulate the
$PATH
environment variable. - Manages ports and provides a local virtual host router.
The crab binary can be called in three ways:
- With a literal command to run:
crab python manage.py shell
- With the name of a process from a procfile:
crab web
- As a special-case to start a virtual host router:
crab router
(see below).
Crab itself is configured entirely through environment variables, and so a developer can set a few simple env vars (in .bashrc
, say) to match how their projects are laid out, and then Crab just does the "right thing".
An env file is a text file containing key/value pairs, like this:
SOME_VARIABLE=somevalue
ANOTHER_VARIABLE=anothervalue
Following 12 factor guidelines, an app's config (everything that is likely to vary between different environments) should be stored in environment variables
. Env files provide a simple way for developers to specify environment variables for a project.
By default, Crab will look for an env file called .env
inside the project directory (ie wherever crab
is executed from). This can be overridden using the ENV_FILE
environment variable. ENV_FILE
can be a comma-separated list of file paths, which will be parsed in order. For example, you could use configs/development/env
as your checked-in env file, and then have a local .env
to override individual variables. In this case, use export ENV_FILE=configs/development/env,.env
.
$ echo 'FOO=bar' > .env
$ crab sh -c 'echo $FOO'
bar
A procfile is a text file that defines the processes that need to be running for your application to work. It contains a mapping from process names to commands, like this:
web: python manage.py runserver
worker: python manage.py worker
By default, Crab will look for a procfile called Procfile
inside the project directory (ie wherever crab
is executed from). This can be overridden using the PROC_FILE
environment variable. For example export PROC_FILE=configs/development/procfile
. To specify multiple procfile locations, which are used in sequence, use export PROC_FILE="configs/development/Procfile,configs/development/procfile"
To run a process defined in a procfile, use crab <processname>
. For example, crab web
would start the web
process defined in the example procfile above. Note that Crab only runs a single process from the procfile. It cannot start all of the processes in the procfile at the same time. This is by design. If you wish to use multiple processes from the procfile, just start each one separately with crab
in a separate terminal split or tab.
Tools that are designed to isolate per-project dependency environments often work by making a copy of the language binary and libraries inside a project-specific subdirectory. The main example is Python's virtualenv
tool.
Crab can add the path to this virtualenv onto the front of the $PATH
environment variable. This means that you do not need to "activate" the virtualenv before using it - simply running crab python manage.py runserver
will automatically use the python
binary inside the virtualenv.
By default, the path env/bin
is prepended to $PATH
. You can override this by setting the BIN_DIRS
environment variable.
Many developers work on multiple projects and/or multiple services at the same time. When each requires its own web server, each server process requires a different port to bind to. For example, Django's manage.py runserver
binds to port 8000 by default. If you're working on two Django projects simulateously, you will have to run the second on a different port, say manage.py runserver 0.0.0.0:8001
. Once you're developing on five or ten microservices, this can become very difficult to manage.
Crab helps by providing a free port to processes that need one. The port is provided in the environment (as $PORT
) as well as substituted into the command. For example:
crab python manage.py '0.0.0.0:$PORT'
You'll see Django's development server start up with (say):
Django version 2.2, using settings 'project.settings'
Starting development server at http://0.0.0.0:63601/
Quit the server with CONTROL-C.
A port is provided to any command with the name "web
" in a Procfile, or any command containing the string "$PORT
". In other circumstances, it can be explicitly requested by setting the environment variable CRAB_PROVIDE_PORT
.
(Note that for non-procfile commands, the variable must be quoted, or else the shell will try to substitute $PORT
, which won't work).
The ports functionality above is only useful in combination with another component of Crab: the virtual host router. Start this up by typing crab router
in a separate terminal tab or split. By default, the router binds to port 8080
(see below for more details on this).
Now, if any of the other processes you run have an environment variable called VIRTUAL_HOST
, the router can "see" them and automatically route traffic to the port they've been provided with.
You can set this environment variable in the .env
file for your project e.g.
VIRTUAL_HOST=mywebsite.localhost
Then you can start (or restart) your project, visit http://mywebsite.localhost:8080
in your browser, and the traffic will magically be routed to the right place.
(Note that at least Chrome automatically routes everything with the TLD .localhost
to 127.0.0.1. Other browsers may or may not follow this standard).
The port that the router binds to can be changed by setting the CRAB_ROUTER_PORT
env var. If this is not set, the router will first try to bind to port 80
, and then fall back to 8080
if it fails. This means that if you start the router with sudo crab router
, you can then just use http://mywebsite.localhost
in your browser - even better!
The router is designed for local development only, so binds to 127.0.0.1
by default. You can set CRAB_ROUTER_HOST
to customize this.
Python doesn't have a great built-in way of installing command line tools. There are a few options:
Crab can be downloaded from our homebrew tap:
brew install dabapps/tap/crab
This will also add the router as a service, which can be started with brew services start crab
.
You can try pip install --user crabtools
. This will install crab
and its dependencies globally. Depending on how you set up your development environment, this may not be desirable.
You can create a virtualenv somewhere on your machine, pip install crabtools
into it, and then put that virtualenv's bin
dir on your $PATH
(for example, by setting $PATH
in your .bashrc
) or link the binary onto somewhere that's already on your $PATH
(eg sudo ln -s /path/to/your/venv/bin/crab /usr/local/bin/crab
).
pipx
is a great tool for managing command line programs written in Python. It basically creates and manages virtualenvs containing isolated command line tools. Follow the instructions to install pipx
and then pipx install crabtools
.
Please ensure all code conforms to Black formatting rules. Install black
in your virtualenv and then crab black crab/ setup.py
.