I was looking for ways to work with and extract information from noisy 3-dimensional images of bacteria called tomograms. I read a book about ways of making the most of messy data using linear algebra and optimization1. In the process, I learned about Principal Component Pursuit.
Here I apply Principal Component Pursuit to a video I shot in my lab. Using this method, I am able to extract the background and foreground of a video, using nothing more than linear algebra and a simple convex optimization problem.
- The top video is the original, which I shot of myself in my lab.
- The middle video is the "background" (not moving) component of the video.
- The bottom video is the "foreground" (moving) component of the video.
One can use such a decomposition in a number of ways. One is to detect and quantify motion, as depicted in the below video.
I have yet to successfully apply this method to tomograms, but my advisor and I thought it was fascinating, so we feel like we succeeded anyway! If you would like to learn more details, read the longer summary below.
I work in a biophysics research group, in which we perform analysis on 3-dimensional images of bacteria called tomograms. These images are often very noisy2, so my advisor and I started reading a textbook1 to learn more ways to deal with the dimensionality and messiness of our data.
One thing we read about was a way to separate an almost-low-rank matrix into a sparse component and a low-rank component using convex optimization. The authors claimed that it could be applied to video, but they did not provide code, so I decided to test it myself. I implemented the method, called Principal Component Pursuit, in Julia.
Although we have yet to find a way to apply Principal Component Analysis to tomograms, we thought the concept was fascinating.
I recorded the top video in my lab. I wanted a video with a mostly static background and something moving in the foreground.
Using the method, which I will describe next, I separate the video into a static component and a moving component. The static component is the middle video. The moving component is the bottom video. In other words, the bottom video added to the middle video yields the top video.
Each frame of a grayscale video can be thought of as a matrix of grayscale values. For each frame, I take that matrix and flatten it into a vector. Thus, each frame of the video can be represented as a long vector with as many elements as there are pixels in a frame. I will call this vector a "frame vector".
By representing the frames as vectors, the entire video can itself be represented as a matrix. This is done by stacking all of the frame vectors side by side into a huge matrix, with as many columns as there are frames in the video. I will call this matrix a "video matrix".
In a video with a mostly static background and something moving in the foreground, the video matrix is almost low rank, since the frame vectors are mostly the same (since most of the pixels don't change). But it isn't, because of the movement in the foreground. Nevertheless, we can find a video matrix that is nearly equal to the original video matrix but is in fact low rank. In simpler terms, we can extract the background of the video.
Let
One could set this up as an optimization problem
for some tuning parameter
Rather, we set up the problem using convex surrogate norms:
where
This new problem is convex! It is easy to solve with off-the-shelf convex optimizers. I have opted to implement the optimizer myself, but other libraries like CVXPY (in Python) or Convex.jl (for Julia) should work fine.
By solving
we find video matrices
First, we want to minimize the rank of
Second, we want to minimize the number of nonzero elements of
For more formal justification for these choices of norm, consult Wright and Ma's textbook1.
The code should be viewed with a font capable of handling mathematical Unicode characters like 𝒮, ϵ, and 𝐕ᵀ. I don't generally code this way, but since Julia offers the support, I thought I'd experiment with making the code match the book's notation. Not everyone agrees with this practice, and I don't think I would name the variables this way if I were to code it again. Below is a comparison of the ADMM algorithm pseudocode in Wright & Ma's book and my implementation in PCP.jl
. Julia makes this kind of mathematical coding easy.