diff --git a/episodes/01-introduction.md b/episodes/01-introduction.md
new file mode 100644
index 0000000..bfefa6a
--- /dev/null
+++ b/episodes/01-introduction.md
@@ -0,0 +1,241 @@
+---
+title: "Running commands with Maestro"
+teaching: 30
+exercises: 30
+---
+
+::: questions
+- "How do I run a simple command with Maestro?"
+:::
+
+:::objectives
+- "Create a Maestro YAML file"
+:::
+
+
+## What is the workflow I'm interested in?
+
+In this lesson we will make an experiment that takes an application which runs
+in parallel and investigate it's scalability. To do that we will need to gather
+data, in this case that means running the application multiple times with
+different numbers of CPU cores and recording the execution time. Once we've
+done that we need to create a visualisation of the data to see how it compares
+against the ideal case.
+
+From the visualisation we can then decide at what scale it
+makes most sense to run the application at in production to maximise the use of
+our CPU allocation on the system.
+
+We could do all of this manually, but there are useful tools to help us manage
+data analysis pipelines like we have in our experiment. Today we'll learn about
+one of those: Maestro.
+
+In order to get started with Maestro, let's begin by taking a simple command
+and see how we can run that via Maestro. Let's choose the command `hostname`
+which prints out the name of the host where the command is executed:
+
+```bash
+janeh@pascal83:~$ hostname
+```
+```output
+pascal83
+```
+
+That prints out the result but Maestro relies on files to know the status of
+your workflow, so let's redirect the output to a file:
+
+```bash
+janeh@pascal83:~$ hostname > hostname_login.txt
+```
+
+## Writing a Maestro YAML
+
+Edit a new text file named `hostname.yaml`.
+
+Contents of `hostname.yaml`:
+
+```yml
+description:
+    name: Hostnames
+    description: Report a node's hostname.
+
+study:
+    - name: hostname-login
+      description: Write the login node's hostname to a file
+      run:
+          cmd: |
+              hostname > hostname_login.txt
+```
+
+::: callout
+
+## Key points about this file
+
+1. The name of `hostname.yaml` is not very important; it gives us information
+   about file contents and type, but maestro will behave the same if you rename
+   it to `hostname` or `foo.txt`.
+1. The file specifies fields in a hierarchy. For example, `name`, `description`,
+   and `run` are all passed to `study` and are at the same level in the hierarchy.
+   `description` and `study` are both at the top level in the hierarchy. 
+1. Indentation indicates the hierarchy and should be consistent. For example, all
+   the fields passed directly to `study` are indented relative to `study` and
+   their indentation is all the same. 
+1. The commands executed during the study are given under `cmd`. Starting this
+   entry with `|` and a newline character allows us to specify multiple commands.
+1. The example YAML file above is pretty minimal; all fields shown are required.
+1. The names given to `study` can include letters, numbers, and special characters.
+
+
+:::
+
+Back in the shell we'll run our new rule. At this point, we may see an error if
+a required field is missing or if our indentation is inconsistent.
+
+```bash
+$ maestro run hostname.yaml
+```
+
+::: callout
+
+## `bash: maestro: command not found...`
+
+If your shell tells you that it cannot find the command `maestro` then we need
+to make the software available somehow. In our case, this means activating the
+python virtual environment where maestro is installed.
+```bash
+source /usr/global/docs/training/janeh/maestro_venv/bin/activate
+```
+
+You can tell this command has already been run when `(maestro_venv)` appears
+before your command prompt:
+
+
+```bash
+janeh@pascal83:~$ source /usr/global/docs/training/janeh/maestro_venv/bin/activate
+(maestro_venv) janeh@pascal83:~$
+```
+
+Now that the `maestro_venv` virtual environment has been activated, the `maestro`
+command should be available, but let's double check
+
+```bash
+(maestro_venv) janeh@pascal83:~$ which maestro
+```
+```output
+/usr/global/docs/training/janeh/maestro_venv/bin/maestro
+```
+:::
+
+
+## Running maestro
+
+Once you have `maestro` available to you, run `maestro run hostname.yaml`
+and enter `y` when prompted
+
+```bash
+(maestro_venv) janeh@pascal83:~$ maestro run hostname.yaml
+[2024-03-20 15:39:34: INFO] INFO Logging Level -- Enabled
+[2024-03-20 15:39:34: WARNING] WARNING Logging Level -- Enabled
+[2024-03-20 15:39:34: CRITICAL] CRITICAL Logging Level -- Enabled
+[2024-03-20 15:39:34: INFO] Loading specification -- path = hostname.yaml
+[2024-03-20 15:39:34: INFO] Directory does not exist. Creating directories to /g/g0/janeh/Hostnames_20240320-153934/logs
+[2024-03-20 15:39:34: INFO] Adding step 'hostname-login' to study 'Hostnames'...
+[2024-03-20 15:39:34: INFO]
+------------------------------------------
+Submission attempts =       1
+Submission restart limit =  1
+Submission throttle limit = 0
+Use temporary directory =   False
+Hash workspaces =           False
+Dry run enabled =           False
+Output path =               /g/g0/janeh/Hostnames_20240320-153934
+------------------------------------------
+Would you like to launch the study? [yn] y
+Study launched successfully.
+```
+
+and look at the outputs. You should have a new directory whose name includes a
+date and timestamp and that starts with the `name` given under `description`
+at the top of `hostname.yaml`.
+
+In that directory will be a subdirectory for every `study` run from
+`hostname.yaml`. The subdirectories for each study include all output files
+for that study
+
+```bash
+(maestro_venv) janeh@pascal83:~$ cd Hostnames_20240320-153934/
+(maestro_venv) janeh@pascal83:~/Hostnames_20240320-153934$ ls
+```
+```output
+batch.info      Hostnames.pkl        Hostnames.txt  logs  status.csv
+hostname-login  Hostnames.study.pkl  hostname.yaml  meta
+```
+```bash
+(maestro_venv) janeh@pascal83:~/Hostnames_20240320-153934$ cd hostname-login/
+(maestro_venv) janeh@pascal83:~/Hostnames_20240320-153934/hostname-login$ ls
+```output
+hostname-login.2284862.err  hostname-login.2284862.out  hostname-login.sh  hostname_login.txt
+```
+
+::: challenge
+
+To which file will the login node's hostname, `pascal83`, be written?
+
+1. hostname-login.2284862.err
+2. hostname-login.2284862.out
+3. hostname-login.sh
+4. hostname_login.txt
+
+:::::: solution
+(4) hostname_login.txt
+
+In the original `hostname.yaml` file that we ran, we specified that
+hostname would be written to `hostname_login.txt`, and this is where
+we'll see that output, if the run worked!
+::::::
+:::
+
+::: challenge
+
+This one is tricky! In the example above, `pascal83` was written to
+`.../Hostnames_{date}_{time}/hostname-login/hostname_login.txt`.
+
+Where would `Hello` be written for the following YAML?
+
+```yml
+description:
+    name: MyHello
+    description: Report a node's hostname.
+
+study:
+    - name: give-salutation
+      description: Write the login node's hostname to a file
+      run:
+          cmd: |
+              echo "hello" > greeting.txt
+```
+
+
+1. `.../give-salutation_{date}_{time}/greeting/greeting.txt`
+2. `.../greeting_{date}_{time}/give_salutation/greeting.txt`
+3. `.../MyHello_{date}_{time}/give-salutation/greeting.txt`
+4. `.../MyHello_{date}_{time}/greeting/greeting.txt`
+
+:::::: solution
+
+(3) `.../MyHello_{date}_{time}/give-salutation/greeting.txt`
+
+The toplevel folder created starts with the `name` field under `description`; here, that's `MyHello`.
+Its subdirectory is named after the `study`; here, that's `give-salutation`.
+The file created is `greeting.txt` and this stores the output of `echo "hello"`.
+
+::::::
+:::
+
+::: keypoints
+
+- "You execute `maestro run` with a YAML file including information about your run."
+- "Your run includes a description and at least one study (a step in your run)."
+- "Your maestro run creates a directory with subdirectories and outputs for each study."
+
+:::
diff --git a/episodes/02-maestro_on_the_cluster.md b/episodes/02-maestro_on_the_cluster.md
new file mode 100644
index 0000000..a14be16
--- /dev/null
+++ b/episodes/02-maestro_on_the_cluster.md
@@ -0,0 +1,190 @@
+---
+title: "Running Maestro on the cluster"
+teaching: 30
+exercises: 20
+---
+
+::: objectives
+
+- "Define rules to run locally and on the cluster"
+
+:::
+
+# How do I run Maestro on the cluster?
+
+What happens when we want to run on the cluster ("to run a batch job")rather
+than the login node? The cluster we are using uses Slurm, and Maestro has 
+built in support for Slurm. We just need to tell Maestro which resources we
+need Slurm to grab for our run.
+
+First, we need to add a `batch` block to our YAML file, where we'll provide the
+names of the machine, bank, and queue in which your jobs should run.
+
+```yml
+batch:
+    type: slurm
+    host: pascal # enter the machine you'll run on
+    bank: lc # enter the bank to charge
+    queue: pvis # enter the partition in which your job should run
+```
+
+Second, we need to specify the number of nodes, number of processes, and walltime
+for *each study* to be run from our YAML file. This information goes under each
+study's `run` field:
+
+```yml
+(...)
+      run:
+          cmd: |
+               hostname >> hostname.txt
+          nodes: 1
+          procs: 1
+          walltime: "00:00:30"
+```
+
+Here we specify 1 node, 1 process, and a time limit of 30 seconds. **Note** that
+the format of the walltime includes quotation marks -- "<Hours>:<Minutes>:<Seconds>".
+
+With these changes, our updated YAML file might look like
+
+```yml
+description:
+    name: Hostnames
+    description: Report a node's hostname.
+
+batch:
+    type: slurm
+    host: pascal # machine to run on
+    bank: lc # bank
+    queue: pvis # partition
+
+study:
+    - name: hostname-login
+      description: Write the login node's hostname to a file
+      run:
+          cmd: |
+              hostname > hostname_login.txt
+    - name: hostname_batch
+      description: Write the node's hostname to a file
+      run:
+          cmd: |
+               hostname >> hostname.txt
+          nodes: 1
+          procs: 1
+          walltime: "00:00:30"
+```
+
+Note that we left the rule `hostname-login` as is. Because we do not specify any info for slurm under this original step's `run` field -- like nodes, processes, or walltime -- this step will continue running on the login node and only `hostname_batch` will be handed off to slurm.
+
+::: challenge
+## Running on the cluster
+
+Modify your YAML file, `hostname.yaml` to execute `hostname` on the _cluster_.
+Run with 1 node and 1 process using the bank `guest` on the partition
+`psummer` on `quartz`.
+
+If you run this multiple times, do you always run on the same node?
+(Is the hostname printed always the same?)
+
+:::::: solution
+
+The contents of `hostname.yaml` should look something like:
+
+```yml
+description:
+    name: Hostnames
+    description: Report a node's hostname.
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: psummer # partition
+
+study:
+    - name: hostname-login
+      description: Write the login node's hostname to a file
+      run:
+          cmd: |
+              hostname > hostname_login.txt
+    - name: hostname_batch
+      description: Write the node's hostname to a file
+      run:
+          cmd: |
+               hostname >> hostname.txt
+          nodes: 1
+          procs: 1
+          walltime: "00:00:30"
+
+```
+
+When you run this job, a directory called `Hostname_...` will be created. If you look in the subdirectory `hostname_batch`, you'll find a file called `hostname.txt` with info about the compute node where the `hostname` command ran. If you run the job multiple times, you will probably land on different nodes; this means you'll see different node numbers in different hostname.txt files. If you see the same number more than once, don't worry! If you get an answer other than `pascal83`, you're doing it correctly. :)
+
+::::::
+
+:::
+
+## Outputs from a batch job
+
+When running in batch, `maestro run...` will create a new directory with the
+same naming scheme as seen in episode 1, and that directory will contain
+subdirectories for all studies. The `hostname_batch` subdirectory has four
+output files, but this time the file ending with extension `.sh` is a slurm
+submission script
+
+```bash
+(maestro_venv) janeh@pascal83:~/Hostnames_20240320-170150/hostname_batch$ ls
+hostname.err  hostname.out  hostname.slurm.sh  hostname.txt
+(maestro_venv) janeh@pascal83:~/Hostnames_20240320-170150/hostname_batch$ cat hostname.slurm.sh
+#!/bin/bash
+#SBATCH --nodes=1
+#SBATCH --partition=pvis
+#SBATCH --account=lc
+#SBATCH --time=00:00:30
+#SBATCH --job-name="hostname"
+#SBATCH --output="hostname.out"
+#SBATCH --error="hostname.err"
+#SBATCH --comment "Write the node's hostname to a file"
+
+hostname > hostname.txt
+```
+
+Maestro uses the info from your YAML file to write this script and then
+submits it to the scheduler for you. Soon after you run on the cluster via
+`maestro run hostname.yaml`, you should be able to see the job
+running or finishing up in the queue with the command `squeue -u <your username`.
+For example,
+
+```bash
+(maestro_venv) janeh@pascal83:~$ maestro run batch-hostname.yaml
+[2024-03-20 17:31:37: INFO] INFO Logging Level -- Enabled
+[2024-03-20 17:31:37: WARNING] WARNING Logging Level -- Enabled
+[2024-03-20 17:31:37: CRITICAL] CRITICAL Logging Level -- Enabled
+[2024-03-20 17:31:37: INFO] Loading specification -- path = batch-hostname.yaml
+[2024-03-20 17:31:37: INFO] Directory does not exist. Creating directories to /g/g0/janeh/Hostnames_20240320-173137/logs
+[2024-03-20 17:31:37: INFO] Adding step 'hostname-login' to study 'Hostnames'...
+[2024-03-20 17:31:37: INFO] Adding step 'hostname_batch' to study 'Hostnames'...
+[2024-03-20 17:31:37: INFO]
+------------------------------------------
+Submission attempts =       1
+Submission restart limit =  1
+Submission throttle limit = 0
+Use temporary directory =   False
+Hash workspaces =           False
+Dry run enabled =           False
+Output path =               /g/g0/janeh/Hostnames_20240320-173137
+------------------------------------------
+Would you like to launch the study? [yn] y
+Study launched successfully.
+(maestro_venv) janeh@pascal83:~$ squeue -u janeh
+             JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)
+            718308      pvis hostname    janeh  R       0:01      1 pascal13
+```
+
+::: keypoints
+
+- "You can run on the cluster by including info for Slurm in your Maestro YAML file"
+- "Maestro generates and submits its own batch scripts to your scheduler."
+- "Steps without Slurm parameters will run on the login node by default."
+
+:::
diff --git a/episodes/03-maestro-and-mpi.md b/episodes/03-maestro-and-mpi.md
new file mode 100644
index 0000000..2d686c2
--- /dev/null
+++ b/episodes/03-maestro-and-mpi.md
@@ -0,0 +1,363 @@
+---
+title: "MPI applications and Maestro"
+teaching: 30
+exercises: 20
+---
+
+::: objectives
+
+- "Define rules to run parallel applications on the cluster"
+
+:::
+
+::: questions
+
+- "How do I run an MPI application via Maestro on the cluster?"
+
+:::
+
+Now it's time to start getting back to our real workflow. We can execute a
+command on the cluster, but what about executing the MPI application we are
+interested in? Our application is called `amdahl` and is available in the
+python virtual environment we're already using for `maestro`. Check that
+you have access to this binary by running `which amdahl` at the command
+line. You should see something like
+
+```bash
+(maestro_venv) janeh@pascal83:~$ which amdahl
+```
+```output
+/usr/global/docs/training/janeh/maestro_venv/bin/amdahl
+```
+
+We'll use this binary to see how efficiently we can run a parallel
+program on our cluster -- i.e. how the amount of work done per
+processor changes as we use more processors.
+
+::: challenge
+
+Copy `hostname.yaml` to a new file, `amdahl.yaml` with 
+`cp hostname.yaml amdahl.yaml`.
+
+Open the new file, `amdahl.yaml`. Remove the "hostname-login" step so that
+the study in this file has only a single step. Rename the "hostname_batch"
+step to "amdahl" or something similar. Update other names and descriptions
+in this file to reflect the new binary. Update the command under your study
+so that the `amdahl` binary will run and write to an output file.
+
+::::::solution
+
+The contents of `amdahl.yaml` should look something like
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          cmd: |
+               amdahl >> amdahl.out
+          nodes: 1
+          procs: 1
+          walltime: "00:00:30"
+```
+
+Exact wording for names and descriptions is not important, but should
+help you to remember what this file and its study are doing.
+
+::::::
+:::
+
+::: challenge
+
+After checking that `amdahl.yaml` looks similar to the solution above,
+run `maestro run amdahl.yaml`. Then, update the number of `nodes` and
+`procs` each to `2`. You should also increase the walltime a bit, to
+a minute or minute and a half. Then rerun `maestro run amdahl.yaml`.
+How does the output change? How did you expect it to change?
+
+*Hint* Remember that if you run `squeue -u <your username>`, you can
+see the node(s) assigned to your slurm job.
+
+::::::solution
+
+In your output files (`amdahl.out` if using the script in the last
+solution), you probably see output looking something like
+
+```
+Doing 30.000000 seconds of 'work' on 1 processor,
+ which should take 30.000000 seconds with 0.800000 parallel proportion of the workload.
+
+  Hello, World! I am process 0 of 1 on pascal17. I will do all the serial 'work' for 5.324555 seconds.
+  Hello, World! I am process 0 of 1 on pascal17. I will do parallel 'work' for 22.349517 seconds.
+
+Total execution time (according to rank 0): 27.755552 seconds
+```
+
+Notice that this output refers to only "1 processor" and mentions
+only `pascal17`, even in the job that requested and received two
+nodes. You will likely see a different number, but most you will
+still see only a single processor mentioned in the output. If you
+ran `squeue -u <username>` while the job was in queue, you should
+have seen two unique node numbers assigned to your job.
+
+So what's going on? If your job were really *using* both nodes
+that were assigned to it, then both processes would have written
+to `amdahl.out`.
+
+::::::
+:::
+
+## Maestro and MPI
+
+We didn't really run an MPI application in the last section as we only ran on
+one processor. How do we request to run on multiple processors for a single
+step?
+
+The answer is that we have to tell Slurm that we want to use MPI. In the Intro
+to HPC lesson, the episodes introducing Slurm and running parallel jobs showed
+that commands to run in parallel need to use `srun`. `srun` talks to MPI and
+allows multiple processors to coordinate work. A call to `srun` might look
+something like
+
+```bash
+srun -N {# of nodes} -n {number of processes} amdahl >> amdahl.out
+```
+
+To make this easier, Maestro offers the shorthand `$(LAUNCHER)`. Maestro
+will replace instances of `$(LAUNCHER)` with a call to `srun`, specifying
+as many nodes and processes we've already told Slurm we want to use.
+
+::: challenge
+
+Update `amdahl.yaml` to include `$(LAUNCHER)` before the call to `amdahl`
+in your study's `run` field. Re-run maestro with the updated YAML and
+explore the outputs. How many nodes are mentioned in `amdahl.out`?
+In the Slurm submission script created by Maestro (included in the same
+subdirectory as `amdahl.out`), what text was used to replace `$(LAUNCHER)`?
+
+:::::: solution
+
+The updated YAML should look something like
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl >> amdahl.out
+          nodes: 2
+          procs: 2
+          walltime: "00:01:30"
+```
+
+Your output file `Amdahl_.../amdahl/amdahl.out` should include
+"Doing 30.000000 seconds of 'work' on 2 processors" and the submission
+script `Amdahl_.../amdahl/amdahl.slurm.sh` should include the line
+"srun -n 2 -N 2 amdahl >> amdahl.out". Maestro substituted
+`srun -n 2 -N 2` for `$(LAUNCHER)`!
+
+::::::
+:::
+
+::: callout
+## Commenting Maestro YAML files
+
+In the solution from the last challenge, the line beginning `#` is a comment line. Hopefully 
+you are already in the habit of adding comments to your own scripts. Good comments make any
+script more readable, and this is just as true with our YAML files.
+
+:::
+
+
+## Customizing amdahl output
+
+Another thing about our application `amdahl` is that we ultimately want to
+process the output to generate our scaling plot. The output right now is useful
+for reading but makes processing harder. `amdahl` has an option that actually
+makes this easier for us. To see the `amdahl` options we can use
+
+```bash
+(maestro_venv) janeh@pascal83:~$ amdahl --help
+```
+```output
+usage: amdahl [-h] [-p [PARALLEL_PROPORTION]] [-w [WORK_SECONDS]] [-t] [-e]
+
+options:
+  -h, --help            show this help message and exit
+  -p [PARALLEL_PROPORTION], --parallel-proportion [PARALLEL_PROPORTION]
+                        Parallel proportion should be a float between 0 and 1
+  -w [WORK_SECONDS], --work-seconds [WORK_SECONDS]
+                        Total seconds of workload, should be an integer greater than 0
+  -t, --terse           Enable terse output
+  -e, --exact           Disable random jitter
+```
+The option we are looking for is `--terse`, and that will make `amdahl` print
+output in a format that is much easier to process, JSON. JSON format in a file
+typically uses the file extension `.json` so let's add that option to our 
+`shell` command _and_ change the file format of the `output` to match our new
+command:
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse >> amdahl.json
+          nodes: 2
+          procs: 2
+          walltime: "00:01:30"
+```
+
+There was another parameter for `amdahl` that caught my eye. `amdahl` has an
+option `--parallel-proportion` (or `-p`) which we might be interested in
+changing as it changes the behaviour of the code,and therefore has an impact on
+the values we get in our results. Let's try specifying a parallel proportion
+of 90%:
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p .9 >> amdahl.json
+          nodes: 2
+          procs: 2
+          walltime: "00:01:30"
+```
+
+Our current directory is probably starting to fill up with directories
+starting with `Amdahl_...`, distinguished only by dates and timestamps. 
+It's probably best to group runs into separate folders to keep things tidy. 
+One way we can do this is by specifying an `env` section in our YAML
+file with a variable called `OUTPUT_PATH` specified in this format:
+
+```yml
+env:
+    variables:
+      OUTPUT_PATH: ./Episode3
+```
+
+This `env` block goes above our `study` block. In this case, directories
+created by runs using this `OUTPUT_PATH` will all be grouped inside the
+directory `Episode3`, to help us group runs by where we are in the lesson.
+
+
+::: challenge
+
+Create a YAML file for a value of `-p` of 0.999 (the default value is 0.8)
+for the case where we have a single node and 6 parallel processes. 
+Directories for subsequent runs should be grouped into a shared parent
+directory (for example, `Episode3`, as above).
+
+:::::: solution
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+env:
+    variables:
+      OUTPUT_PATH: ./Episode3
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p .999 >> amdahl.json
+          nodes: 1
+          procs: 6
+          walltime: "00:01:30"
+```
+
+::::::
+:::
+
+## Dry-run (`--dry`) mode
+
+It's often useful to run Maestro in `--dry` mode, which causes Maestro to create scripts
+and the directory structure without actually running jobs. You will see this parameter
+if you run `maestro run --help`.
+
+::: challenge
+
+Do a dry-run using the script created in the last challenge. This should help you
+verify that a new directory gets created for runs from this episode.
+
+:::::: solution
+
+After running
+
+```bash
+maestro run --dry amdahl.yaml
+```
+a directory path of the form `Episode3/Amdahl_{DATE}_{TIME}/amdahl` should
+be created.
+
+::::::
+:::
+
+
+::: keypoints
+
+- "Adding `$(LAUNCHER)` before commands signals to Maestro to use MPI via `srun`."
+- "New Maestro runs can be grouped within a new directory specified by the environment
+variable `OUTPUT_PATH`"
+- You can use `--dry` to verify that the expected directory structure and scripts
+are created by a given Maestro YAML file.
+
+:::
diff --git a/episodes/04-placeholders.md b/episodes/04-placeholders.md
new file mode 100644
index 0000000..dc05775
--- /dev/null
+++ b/episodes/04-placeholders.md
@@ -0,0 +1,235 @@
+---
+title: "Placeholders"
+teaching: 40
+exercises: 30
+---
+
+::: questions
+- "How do I make a generic rule?"
+:::
+
+::: objectives
+- "Learn to use variables as placeholders"
+- "Learn to run many similar Maestro runs at once"
+:::
+
+::: callout
+## D.R.Y. (Don't Repeat Yourself)
+
+In many programming languages, the bulk of the language features are
+there to allow the programmer to describe long-winded computational
+routines as short, expressive, beautiful code.  Features in Python,
+R, or Java, such as user-defined variables and functions are useful in
+part because they mean we don't have to write out (or think about)
+all of the details over and over again.  This good habit of writing
+things out only once is known as the "Don't Repeat Yourself"
+principle or D.R.Y.
+:::
+
+Maestro YAML files are a form of code and, in any code, repetition can
+lead to problems (e.g. we rename a data file in one part of the YAML
+but forget to rename it elsewhere).
+
+In this episode, we'll set ourselves up with ways to avoid repeating
+ourselves by using *environment variables* as *placeholders*.
+
+
+## Placeholders
+
+At the end of our last episode, our YAML file contained the sections
+
+```yml
+(...)
+
+env:
+    variables:
+      OUTPUT_PATH: ./Episode3
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p .999 >> amdahl.json
+          nodes: 1
+          procs: 6
+          walltime: "00:01:30"
+```
+
+Here we were already using a placeholder -- `$(LAUNCHER)` -- which held
+the place and was later swapped out for a call to `srun` specifying
+the nodes and tasks wanted.
+
+Let's create another environment variable in the `variables` second under
+`env`. We can define a new parallel proportion as `P: .999`. Then, under
+`run`'s `cmd`, we can call this environment variable with the syntax
+`$(P)`. `$(P)` holds the place of and will be substituted by `.999` when
+Maestro creates a Slurm submission script for us
+
+```yml
+(...)
+
+env:
+    variables:
+      P: .999
+      OUTPUT_PATH: ./Episode4
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p $(P) >> amdahl.json
+          nodes: 1
+          procs: 6
+          walltime: "00:01:30"
+```
+
+(Note that the `OUTPUT_PATH` was also updated to reflect the current
+episode.)
+
+It may also be helpful to create a variable for our output file, like this:
+
+```yml
+(...)
+
+env:
+    variables:
+      P: .999
+      OUTPUT: amdahl.json
+      OUTPUT_PATH: ./Episode4
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p $(P) >> $(OUTPUT)
+          nodes: 1
+          procs: 6
+          walltime: "00:01:30"
+```
+
+## Maestro's global.parameters
+
+We're almost ready to perform our scaling study -- to see how the amount of work per processor
+changes as we use more processors in the job. One way to do this would be to update the line
+
+```yml
+          procs: 6
+```
+
+and to manually re-run `maestro run...` several times with different numbers of processes.
+
+An alternative is to avoid repeating ourselves by defining a **parameter** that lists multiple
+values of tasks and runs a separate job for each value. We do this by adding a 
+`global.parameters` section at the bottom of the script. We then list individual parameters
+within this section. Each parameter must include a list of its values and a label, using the
+following syntax:
+
+
+```yml
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 18, 24, 36]
+        label: TASKS.%%
+```
+
+Note that the parameter is `TASKS` and that its label starts with the same name, but is followed
+by a `.%%`. 
+
+We would then update the line under `run` -> `cmd` defining `procs` to include the name
+of the parameter enclosed in `$()`:
+
+```yml
+          procs: $(TASKS)
+```
+
+The full YAML file will look like
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+env:
+    variables:
+      P: .999
+      OUTPUT: amdahl.json
+      OUTPUT_PATH: ./Episode4
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p $(P) >> $(OUTPUT)
+          nodes: 1
+          procs: $(TASKS)
+          walltime: "00:01:30"
+
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 18, 24, 36]
+        label: TASKS.%%
+```
+
+::: challenge
+
+Run `maestro run --dry amdahl.yaml` using the above YAML file
+and investigate the resulting directory structure. How does
+the list of task values under `global.parameters` change the
+output directory organization?
+
+::::::solution
+
+Under your current working directory, you should see a directory structure
+created with the following format -- `Episode4/Amdahl_<Date>-<Time>/amdahl`.
+Within the `amdahl` subdirectory, you should see one output directory
+for each of the values listed for `TASKS` under `global.parameters`:
+
+```bash
+(...)Episode4/Amdahl_<Date>_<Time>/amdahl$ ls
+```
+```output
+TASKS.18  TASKS.2  TASKS.24  TASKS.36  TASKS.4  TASKS.8
+```
+
+Each `TASKS...` subdirectory will contain the slurm submission script
+to be used if this maestro job is run: 
+
+```bash
+(...)Episode4/Amdahl_<Date>_<Time>/amdahl$ ls TASKS.18/
+```
+```output
+amdahl_TASKS.18.slurm.sh
+```
+::::::
+:::
+
+Run
+
+```
+maestro run amdahl.yaml
+```
+
+before moving on to the next episode, to generate the results for various task numbers. You'll be able to
+see your jobs queueing and running via `squeue -u <username>`.
+
+
+:::keypoints
+- "Environment variables are placeholders defined under the `env` section of a Maestro YAML."
+- "Parameters defined under `global.parameters` require lists of values and labels."
+- "Parameters "
+
+:::
diff --git a/episodes/05-chaining_rules.md b/episodes/05-chaining_rules.md
new file mode 100644
index 0000000..51bdf04
--- /dev/null
+++ b/episodes/05-chaining_rules.md
@@ -0,0 +1,444 @@
+---
+title: "Chaining rules"
+teaching: 40
+exercises: 30
+---
+
+::: questions
+- "How do I combine steps into a workflow?"
+- "How do I make a step that uses the outputs from a previous step?"
+:::
+
+::: objectives
+- "Create workflow pipelines with dependencies"
+- "Use the outputs of one step as inputs to the next"
+:::
+
+## A pipeline of multiple rules
+
+In this episode, we will plot the scaling results generated in the last
+episode using the script `plot_terse_amdahl_results.py`. These results
+report how long the work of running `amdahl` took when using between
+2 and 36 processes.
+
+We want to plot these results automatically, i.e. as part of our workflow.
+In order to do this correctly, we need to make sure that our python
+plotting script runs only **after** we have finished calculating all
+results.
+
+We can control the order and relative timing of when steps execute
+by defining dependencies.
+
+For example, consider the following YAML, `depends.yaml`:
+
+```yml
+description:
+    name: Dependency-exploration
+    description: Experiment with adding dependencies
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+env:
+    variables:
+      OUTPUT_PATH: ./Episode5
+      OUTPUT: date.txt
+
+study:
+    - name: date-batch
+      description: Write the date and node's hostname to a file
+      run:
+          cmd: |
+              echo "From batch node:" >> $(OUTPUT)
+              hostname >> $(OUTPUT)
+              date >> $(OUTPUT)
+              sleep 10
+          nodes: 1
+          procs: 1
+          walltime: "00:00:30"
+    - name: date-login
+      description: Write the date and login node's hostname to a file
+      run:
+          cmd: |
+              echo "From login node:" >> $(OUTPUT)
+              hostname >> $(OUTPUT)
+              date >> $(OUTPUT)
+              sleep 10
+```
+
+This script has two steps, each of which writes the type of
+node of its host, the particular hostname, and the date/time
+before waiting for 10 seconds. The step `date-login` is run
+on a login node and the step `date-batch` is run on a "batch"
+or "compute" node.
+
+::: challenge
+
+Run the above script with
+
+```bash
+maestro run depends.yaml
+```
+
+You can then `cat` the output of both steps to `stdout` via a
+command of the form 
+
+```bash
+cat Episode5/Dependency-exploration-{FILL}/date-*/*.txt
+```
+
+where `{FILL}` should be replaced by the date/time info of
+this run. This will print the output of the two steps to
+`stdout`.
+
+Which step is printed first and how close together are they
+completed?
+
+:::::: solution
+
+You'll likely see output somewhat similar to
+
+```bash
+(maestro_venv) janeh@pascal83:~$ cat Episode5/Dependency-exploration_20240326-143559/date-*/date.txt
+```
+```output
+From batch node:
+pascal16
+Tue Mar 26 14:36:04 PDT 2024
+From login node:
+pascal83
+Tue Mar 26 14:36:03 PDT 2024
+```
+
+though the times/dates and hostnames will be different.
+We didn't try to control the order of operations, so
+each step will have run as soon as it had resources.
+Probably you'll see that the timestamp on the login node
+is earlier because it will not have to wait for resources
+from the queue.
+
+::::::
+:::
+
+Next, let's add a dependency to ensure that the batch step
+runs **before** the login node step. To add a dependency
+a line with the following format must be added to a step's
+`run` block:
+
+```yml
+        depends: [`{STEP NAME}`]
+```
+
+`{STEP NAME}` is replaced by the name of the step from the
+study that you want the current step to depend upon. 
+
+If we update `date-login` to include a dependency, we'll see
+
+```yml
+    - name: date-login
+      description: Write the date and login node's hostname to a file
+      run:
+          cmd: |
+              echo "From login node:" >> $(OUTPUT)
+              hostname >> $(OUTPUT)
+              date >> $(OUTPUT)
+              sleep 10
+          depends: [date-batch]
+```
+
+Now `date-login` will not run until `date-batch` has finished.
+
+::: challenge
+
+Update `depends.yaml` to make `date-login` wait for `date-batch`
+to complete before running. Then rerun `maestro run depends,yaml`.
+
+How has the output of the two `date.txt` files changed?
+
+::::::solution
+
+This time, you should see that the date printed from the
+login node is *at least* 10 seconds later than the date
+printed on the batch node. For example, on Pascal I see
+
+```bash
+$ cat Episode5/Dependency-exploration_20240326-150950/date-*/date.txt
+```
+
+```output
+From batch node:
+pascal16
+Tue Mar 26 15:09:54 PDT 2024
+From login node:
+pascal83
+Tue Mar 26 15:10:53 PDT 2024
+```
+::::::
+:::
+
+## A step that waits for all iterations of its dependency
+
+Let's return to our Amdahl scaling study and the YAML
+with which we ended in the last episode:
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+env:
+    variables:
+      P: .999
+      OUTPUT: amdahl.json
+      OUTPUT_PATH: ./Episode5
+
+study:
+    - name: amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p $(P) >> $(OUTPUT)
+          nodes: 1
+          procs: $(TASKS)
+          walltime: "00:01:30"
+
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 18, 24, 36]
+        label: TASKS.%%
+```
+
+Ultimately we want to add a plotting step that depends upon
+`amdahl`, but for now let's create a placeholder that will
+go under `study` and beneath `amdahl`:
+
+```
+   - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          # We'll update this `cmd` later
+          cmd: |
+               echo "This is where we plot"
+```
+
+Based on what we saw before, we might think that we just
+need to add
+
+```
+          depends: [amdahl]
+```
+
+to the end of this block. Instead, the syntax changes
+slightly because `amdahl` is parameterized -- i.e. it will
+run for several task values. To indicate that we want `plot`
+to run after *ALL* `amdahl` steps, we'll add a `_*` to the
+end of the step name:
+
+```
+          depends: [amdahl_*]
+```
+
+Now our new step definition will look like
+
+```yml
+   - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          # We'll update this `cmd` later
+          cmd: |
+               echo "This is where we plot"
+          depends: [amdahl_*]
+```
+
+## Using the outputs from a previous step
+
+### Manually plotting scaling results
+In your working directory, you should have a copy of 
+`plot_terse_amdahl_results.py`. The syntax for running this script is
+
+```
+python3 plot_terse_amdahl_results.py {output image name} {input json filenames}
+```
+
+Before trying to add this command to our workflow, let's run it manually
+to see how it works. We can call the output image `output.jpg`. As for
+the input names, we can use the `.json` files created at the end of
+episode 4. In particular if you run
+
+```bash
+ls Episode4/Amdahl_<Date>_<Time>/amdahl/TASKS.*/amdahl.json
+```
+using the `<Date>` and `<Time>` of your last run in Episode 4, you should
+see a list of files named `amdahl.json`:
+
+```bash
+$ ls Episode4/Amdahl_20240326-155434/amdahl/TASKS.*/amdahl.json
+```
+```output
+Episode4/Amdahl_20240326-155434/amdahl/TASKS.18/amdahl.json
+Episode4/Amdahl_20240326-155434/amdahl/TASKS.24/amdahl.json
+Episode4/Amdahl_20240326-155434/amdahl/TASKS.2/amdahl.json
+Episode4/Amdahl_20240326-155434/amdahl/TASKS.36/amdahl.json
+Episode4/Amdahl_20240326-155434/amdahl/TASKS.4/amdahl.json
+Episode4/Amdahl_20240326-155434/amdahl/TASKS.8/amdahl.json
+```
+
+You can use this same filepath with wildcards to specify this list of
+JSON files as inputs to our python script:
+
+```bash
+python3 plot_terse_amdahl_results.py output.jpg Episode4/Amdahl_<Date>_<Time>/amdahl/TASKS.*/amdahl.json
+```
+
+::: challenge
+
+Generate a scaling plot by manually specifying the JSON files
+produced from a previous run of `amdahl.yaml`.
+
+:::::: solution
+
+The resulting JPEG should look something like
+
+![](files/example-scaling-plot.jpg)
+
+::::::
+:::
+
+### Adding plotting to our workflow
+
+Let's update our `plot` step in `amdahl.yaml` to include python plotting
+rather than a placeholder `echo` command. We want the updated step to look
+something like
+
+```yml
+   - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          cmd: |
+               python3 plot_terse_amdahl_results.py output.jpg Episode5/Amdahl_<Date>_<Time>/amdahl/TASKS.*/amdahl.json
+          depends: [amdahl_*]
+```
+
+The trouble is that we don't know the exact value of 
+`Episode5/Amdahl_<Date>_<Time>/amdahl` for a job that we haven't run
+yet. Luckily Maestro gives us a placeholder to the equivalent of this path
+for the current job --- `$(amdahl.workspace)`. This is the workspace for
+the `amdahl` step, where all outputs for `amdahl`, including our `TASKS.*`
+directories, are written.
+
+This means we can update the `plot` step as follows:
+
+```yml
+   - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          cmd: |
+               python3 plot_terse_amdahl_results.py output.jpg $(amdahl.workspace)/TASKS.*/amdahl.json
+          depends: [amdahl_*]
+```
+
+::: callout
+
+Where does the `plot_terse_amdahl_results.py` script live? In Maestro,
+`$(SPECROOT)` specifies the root directory from which you originally
+ran `maestro run...`. This is where `plot_terse_amdahl_results.py` should
+live, so let's be more precise:
+
+```yml
+   - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          cmd: |
+               python3 $(SPECROOT)/plot_terse_amdahl_results.py output.jpg $(amdahl.workspace)/TASKS.*/amdahl.json
+          depends: [amdahl_*]
+```
+
+:::
+
+::: challenge
+
+Update `amdahl.yaml` so that 
+
+* one step definiton runs `amdahl` for 85% parallelizable code using [2, 4, 8, 16, 32] tasks
+* a second step plots the results.
+
+::::::solution
+
+Your YAML file should look something like
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+env:
+    variables:
+      P: .85
+      OUTPUT: amdahl.json
+      OUTPUT_PATH: ./Episode5
+
+study:
+    - name: run-amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p $(P) >> $(OUTPUT)
+          nodes: 1
+          procs: $(TASKS)
+          walltime: "00:01:30"
+    - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          cmd: |
+               python3 $(SPECROOT)/plot_terse_amdahl_results.py output.jpg $(run-amdahl.workspace)/TASKS.*/amdahl.json
+          depends: [amdahl_*]
+
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 16, 32]
+        label: TASKS.%%
+
+```
+
+::::::
+:::
+
+
+::: callout
+
+## Errors are normal
+
+Don't be disheartened if you see errors when first testing
+your new Maestro pipelines. There is a lot that can go wrong when writing a
+new workflow, and you'll normally need several iterations to get things just
+right. Luckily, `Maestro` will do some checks for consistency at the outset
+of the run. If you specify a dependency that doesn't exist (because of a
+rename, for example), the job will fail before submitting work to the queue.
+
+:::
+
+
+
+::: keypoints
+- "We can control the order of steps run in a study by creating dependencies."
+- "You can create a dependency with the `depends: [{step name}]` syntax."
+- "Dependency syntax changes to `depends: [{step name}_*]` for parameterized steps."
+:::
+
diff --git a/episodes/06-multiple-parameters.md b/episodes/06-multiple-parameters.md
new file mode 100644
index 0000000..a2cec44
--- /dev/null
+++ b/episodes/06-multiple-parameters.md
@@ -0,0 +1,302 @@
+---
+title: "Multiple parameters"
+teaching: 50
+exercises: 30
+---
+
+::: questions
+- "How do I specify multiple parameters?"
+- "How do multiple parameters interact?"
+:::
+
+::: objectives
+- "Create scaling results for different proportions of parallelizable code."
+- "Create and compare scalability plots for codes with different amounts of parallel work."
+:::
+
+## Adding a second parameter
+
+In this episode, we want to vary `P`, the fraction of parallel code,
+as part of our workflow. To do this, we will add a second entry under
+`global.parameters` and remove the definition for `P` under `env`:
+
+```yml
+(...)
+env:
+    variables:
+      OUTPUT: amdahl.json
+      OUTPUT_PATH: ./Episode6
+
+(...)
+
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 16, 32]
+        label: TASKS.%%
+    P:
+        values: [<Insert values>]
+        label: P.%%
+```
+
+How many values do we want to include for `P`? We need to have
+**the same number** of values listed for all `global.parameters`.
+This means that to run the previous scaling study with `P=.85`,
+and five values for `TASKS`, our global parameters section would
+specify `.85` for `P` five times:
+
+```yml
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 16, 32]
+        label: TASKS.%%
+    P:
+        values: [.85, .85, .85, .85, .85]
+        label: P.%%
+```
+
+Let's say we want to perform the same scaling study for a second
+value of P, `.99`. This means that we'd have to repeat the same
+5 values for `TASKS` and then provide the second value of `P` 5
+times:
+
+```yml
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 16, 32, 2, 4, 8, 16, 32]
+        label: TASKS.%%
+    P:
+        values: [.85, .85, .85, .85, .85, .99, .99, .99, .99, .99]
+        label: P.%%
+```
+
+At this point, our entire YAML file should look something like
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+env:
+    variables:
+      OUTPUT: amdahl.json
+      OUTPUT_PATH: ./Episode6
+
+study:
+    - name: run-amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p $(P) >> $(OUTPUT)
+          nodes: 1
+          procs: $(TASKS)
+          walltime: "00:01:30"
+    - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          cmd: |
+               python3 $(SPECROOT)/plot_terse_amdahl_results.py output.jpg $(run-amdahl.workspace)/TASKS.*/amdahl.json
+          depends: [amdahl_*]
+
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 16, 32, 2, 4, 8, 16, 32]
+        label: TASKS.%%
+    P:
+        values: [.85, .85, .85, .85, .85, .99, .99, .99, .99, .99]
+        label: P.%%
+```
+
+::: challenge
+
+Run the workflow above. Do you generate `output.jpg`? If not, why not?
+
+::::::solution
+
+If you use the YAML above, your `amdahl-run` steps should work, but your
+`plot` step will fail. `plot`'s failure will be evident both because
+`output.jpg` will be missing from the `plot` subdirectory **and**
+because the `plot.*.err` file in the same directory will contain an
+error:
+
+```
+Traceback (most recent call last):
+  File "/p/lustre1/janeh/test-maestro/snakemake-port/ep6/plot_terse_amdahl_results.py", line 46, in <module>
+    process_files(filenames, output=output)
+  File "/p/lustre1/janeh/test-maestro/snakemake-port/ep6/plot_terse_amdahl_results.py", line 10, in process_files
+    with open(filename, 'r') as file:
+FileNotFoundError: [Errno 2] No such file or directory: '/g/g0/janeh/Episode6/Amdahl_20240328-163359/run-amdahl/TASKS.*/amdahl.json'
+```
+
+The problem is that the directory path for `.json` files has changed.
+This will be discussed more below!
+::::::
+:::
+
+The trouble with the YAML above is that our output directory
+structure changed when we added a second global parameter, but
+we didn't update the directory path specified under `plot`.
+
+If we look inside the `run-amdahl` output folder (identified
+as `$(run-amdahl.workspace)` in our workflow YAML), its
+subdirectory names now include information about both
+global parameters:
+
+```
+janeh@pascal83:~/Episode6/Amdahl_20240328-163359/run-amdahl$ ls
+P.0.85.TASKS.16  P.0.85.TASKS.8   P.0.99.TASKS.4
+P.0.85.TASKS.2   P.0.99.TASKS.16  P.0.99.TASKS.8
+P.0.85.TASKS.32  P.0.99.TASKS.2
+P.0.85.TASKS.4   P.0.99.TASKS.32
+```
+
+whereas directory path for our `.json` files is specified
+as `$(run-amdahl.workspace)/TASKS.*/amdahl.json` under the `plot`
+step.
+
+We could get the `plot` step to work by simply
+adding a wildcard, `*`, in front of `TASKS` so that the
+path to `.json` files would be
+
+```
+$(run-amdahl.workspace)/*TASKS.*/amdahl.json
+```
+
+and the definition for `plot` would be
+
+```
+    - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          cmd: |
+               python3 $(SPECROOT)/plot_terse_amdahl_results.py output.jpg $(run-amdahl.workspace)/*TASKS.*/amdahl.json
+          depends: [amdahl_*]
+```
+
+This would allow `plot` to terminate happily and to produce
+an `output.jpg` file, but that image would plot output from
+all `.json` files as a single line, and we wouldn't be able
+to tell which data points corresponded to a parallel fraction,
+`P`, of `.85` and which corresponded to `P=.99`. 
+
+If we can generate two plots -- one for each value of `P`
+-- we'll more clearly be able to see scaling behavior for
+these two situations. We can generate two separate plots
+by calling `python3 plot_terse_amdahl_results.py ...` on
+two sets of input files -- those in the `P.0.85.TASKS*`
+subdirectories of `$(run-amdahl.workspace)` and those
+in the `P.0.99.TASKS*` subdirectories.
+
+That means we can generate these two plots by inserting
+the variable `$(P)` into the path ---
+
+```
+$(run-amdahl.workspace)/P.$(P).TASKS.*/amdahl.json
+```
+
+making the definition for `plot`
+
+```
+    - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          cmd: |
+               python3 $(SPECROOT)/plot_terse_amdahl_results.py output.jpg $(run-amdahl.workspace)/P.$(P).TASKS.*/amdahl.json
+          depends: [amdahl_*]
+```
+
+::: challenge
+
+Modify your workflow as discussed above to generate
+output plots for two different values of P. Open these
+plots and verify they are different from each other.
+How does changing the workflow to generate two separate
+plots change the directory structure?
+
+(Feel free to use .85 and .99 or to modify to
+other values of your choosing.)
+
+:::::: solution
+
+Your full YAML file should be similar to
+
+```yml
+description:
+    name: Amdahl
+    description: Run a parallel program
+
+batch:
+    type: slurm
+    host: quartz # machine to run on
+    bank: guest # bank
+    queue: pbatch # partition
+
+env:
+    variables:
+      OUTPUT: amdahl.json
+      OUTPUT_PATH: ./Episode6
+
+study:
+    - name: run-amdahl
+      description: run in parallel
+      run:
+          # Here's where we include our MPI wrapper:
+          cmd: |
+               $(LAUNCHER) amdahl --terse -p $(P) >> $(OUTPUT)
+          nodes: 1
+          procs: $(TASKS)
+          walltime: "00:01:30"
+    - name: plot
+      description: Create a plot from `amdahl` results
+      run:
+          cmd: |
+               python3 $(SPECROOT)/plot_terse_amdahl_results.py output.jpg $(run-amdahl.workspace)/P.$(P).TASKS.*/amdahl.json
+          depends: [amdahl_*]
+
+global.parameters:
+    TASKS:
+        values: [2, 4, 8, 16, 32, 2, 4, 8, 16, 32]
+        label: TASKS.%%
+    P:
+        values: [.85, .85, .85, .85, .85, .99, .99, .99, .99, .99]
+        label: P.%%
+```
+
+Modifying `plot` to include `$(P)` caused this step to run twice.
+As a result, two subdirectories under `plot` were created --
+one for each value of P:
+
+```
+(maestro_venv) janeh@pascal83:~/Episode6/Amdahl_20240328-171343/plot$ ls
+P.0.85  P.0.99
+```
+::::::
+:::
+
+::: callout
+
+Instead of modifying the path to our `amdahl.json` files to
+`$(run-amdahl.workspace)/P.$(P).TASKS.*/amdahl.json`, we
+could have equivalently updated it to
+`$(run-amdahl.workspace)/$(P.label).TASKS.*/amdahl.json`.
+
+In other words, `P.$(P)` is equivalent to `$(P.label)`.
+Similarly, in Maestro, `TASKS.$(TASKS)` is equivalent to
+`$(TASKS.label)`. This syntax works for every global parameter
+in Maestro.
+
+:::
+
+::: keypoints
+- "Multiple parameters can be defined under `global.parameters`."
+- "Lists of values for all global parameters must have the same
+length; the Nth entries in the lists of values for all global
+parameters are used in a single job."
+:::
+
diff --git a/episodes/introduction.md b/episodes/introduction.md
deleted file mode 100644
index 7065d23..0000000
--- a/episodes/introduction.md
+++ /dev/null
@@ -1,114 +0,0 @@
----
-title: "Using Markdown"
-teaching: 10
-exercises: 2
----
-
-:::::::::::::::::::::::::::::::::::::: questions 
-
-- How do you write a lesson using Markdown and `{sandpaper}`?
-
-::::::::::::::::::::::::::::::::::::::::::::::::
-
-::::::::::::::::::::::::::::::::::::: objectives
-
-- Explain how to use markdown with The Carpentries Workbench
-- Demonstrate how to include pieces of code, figures, and nested challenge blocks
-
-::::::::::::::::::::::::::::::::::::::::::::::::
-
-## Introduction
-
-This is a lesson created via The Carpentries Workbench. It is written in
-[Pandoc-flavored Markdown](https://pandoc.org/MANUAL.txt) for static files and
-[R Markdown][r-markdown] for dynamic files that can render code into output. 
-Please refer to the [Introduction to The Carpentries 
-Workbench](https://carpentries.github.io/sandpaper-docs/) for full documentation.
-
-What you need to know is that there are three sections required for a valid
-Carpentries lesson:
-
- 1. `questions` are displayed at the beginning of the episode to prime the
-    learner for the content.
- 2. `objectives` are the learning objectives for an episode displayed with
-    the questions.
- 3. `keypoints` are displayed at the end of the episode to reinforce the
-    objectives.
-
-:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: instructor
-
-Inline instructor notes can help inform instructors of timing challenges
-associated with the lessons. They appear in the "Instructor View"
-
-::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
-
-::::::::::::::::::::::::::::::::::::: challenge 
-
-## Challenge 1: Can you do it?
-
-What is the output of this command?
-
-```r
-paste("This", "new", "lesson", "looks", "good")
-```
-
-:::::::::::::::::::::::: solution 
-
-## Output
- 
-```output
-[1] "This new lesson looks good"
-```
-
-:::::::::::::::::::::::::::::::::
-
-
-## Challenge 2: how do you nest solutions within challenge blocks?
-
-:::::::::::::::::::::::: solution 
-
-You can add a line with at least three colons and a `solution` tag.
-
-:::::::::::::::::::::::::::::::::
-::::::::::::::::::::::::::::::::::::::::::::::::
-
-## Figures
-
-You can use standard markdown for static figures with the following syntax:
-
-`![optional caption that appears below the figure](figure url){alt='alt text for
-accessibility purposes'}`
-
-![You belong in The Carpentries!](https://raw.githubusercontent.com/carpentries/logo/master/Badge_Carpentries.svg){alt='Blue Carpentries hex person logo with no text.'}
-
-::::::::::::::::::::::::::::::::::::: callout
-
-Callout sections can highlight information.
-
-They are sometimes used to emphasise particularly important points
-but are also used in some lessons to present "asides": 
-content that is not central to the narrative of the lesson,
-e.g. by providing the answer to a commonly-asked question.
-
-::::::::::::::::::::::::::::::::::::::::::::::::
-
-
-## Math
-
-One of our episodes contains $\LaTeX$ equations when describing how to create
-dynamic reports with {knitr}, so we now use mathjax to describe this:
-
-`$\alpha = \dfrac{1}{(1 - \beta)^2}$` becomes: $\alpha = \dfrac{1}{(1 - \beta)^2}$
-
-Cool, right?
-
-::::::::::::::::::::::::::::::::::::: keypoints 
-
-- Use `.md` files for episodes when you want static content
-- Use `.Rmd` files for episodes when you need to generate output
-- Run `sandpaper::check_lesson()` to identify any issues with your lesson
-- Run `sandpaper::build_lesson()` to preview your lesson locally
-
-::::::::::::::::::::::::::::::::::::::::::::::::
-
-[r-markdown]: https://rmarkdown.rstudio.com/