Reduce the blocked arrays A and L to three blocks #554

dmbates · 2021-08-19T19:58:58Z

dmbates
Aug 19, 2021
Maintainer

This has come up in the code refactoring of the leverage method for LinearMixedModel but it could also apply more generally to the overall evaluation of the objective. At present, the symmetric matrix A = [Z X y]'[Z X y] is stored (lower triangle only) in blocked form. The rows and columns are divided into length(reterms) + 1 blocks according to the grouping factors for the random effects (the last block is from Xymat which is contains the model matrix for the fixed-effects parameters and the response). The lower triangular Cholesky factor, L, of

 Λ'Z'ZΛ + I  Λ'Z'X  Λ'Z'y
   X'ZΛ        X'X    X'y
   y'ZΛ        y'X    y'y

is similarly blocked.

For example, in a model for the sleepstudy data with vector-valued random effects for the 18 subjects, the block structure is

julia> BlockDescription(m1)
rows:     subj         fixed     
  36:   BlkDiag    
   3:    Dense         Dense

whereas for a model fit with vector-valued random effects for item and for subject in the mrk17_exp1 data set the blocking is

julia> BlockDescription(m6)
rows:     item          subj         fixed     
1200:   BlkDiag    
 438:    Dense     BlkDiag/Dense 
  33:    Dense         Dense         Dense

For the leverage calculation, we need to solve a linear system defined by L to get the leverage of each observation. The big savings in computation comes from the (1, 1) block which is block diagonal. The nature of the system involved results in all blocks except one having zeros on the right hand side, hence only the one block needs to be solved. We go from a dimension 1200 system to a dimension 5 system. After that things get complicated and the full system involving the (2,1) and (2,2) blocks and the (3,1), (3,2) and (3,3) blocks need to be solved.

Essentially, as soon as one block in the first column of blocks is not sparse then everything needs to be treated as dense. The logic of the algorithm gets complicated for the blocks other than the first and the last ones. So my idea is always to reduce the problem to exactly three blocks by combining all the blocks after (1,1). So the (2,1) block would have dimension (438+33) by 1200 and the (2,2) block would be lower triangular of dimension (438+33) by (438+33). For the leverage function we don't need the last row (from the response) so the (2,1) and (2,2) blocks could be reduced to 470 rows.

It is likely that this reduction in the number of blocks could be beneficial to the objective evaluation as well as to the leverage. Essentially all the operations in the update of L become dense matrix operations, which can benefit from OpenBLAS, MKL and whatever the JuliaSIMD folks come up with, after the first block. The only downside I can see is for nested grouping factors when blocks like (2,1) are BlockedSparse but those cases are relatively rare.

palday · 2021-08-19T20:13:16Z

palday
Aug 19, 2021
Maintainer

At a first glance (all I have time for at the moment), I think this sounds like something worth investigating. I suspect it will also make some of my tinkering on multi-membership models more straightforward because the sparsity pattern for those degrades even faster -- they're not even block diagonal in the (1,1) block.

In general then, this implies that we would have the sparse (1,1) block plus one dense block per blocking variable?

I also need to think a bit about whether this change would be technically breaking.

1 reply

dmbates Aug 19, 2021
Maintainer Author

What I am suggesting is that, after reordering the blocking variables according to decreasing number of random effects (which is what we do now), only the first blocking variable gets its own block. All the later blocking factors and the fixed-effects and the response are collected in the rectangular (2,1) and the lower triangular (2,2) blocks. So every model ends up with exactly 3 blocks in A and in L.

palday · 2021-08-19T20:20:13Z

palday
Aug 19, 2021
Maintainer

See also your previous thoughts in #234.

2 replies

dmbates Aug 19, 2021
Maintainer Author

I think all of what was discussed in #234 has been incorporated now. Am I missing something.

By the way, no rush on this at all. I will implement it for leverage because that is currently much slower than necessary for large models but the other thoughts are more for the future.

palday Aug 19, 2021
Maintainer

I also think all of #234 has been incorporated -- feel free to close. 😄

palday · 2021-08-22T16:27:12Z

palday
Aug 22, 2021
Maintainer

To help with benchmarking, we could create a non exported 3block!(::LinearMixedModel) that converts a constructed model to the 3-block representation. Then we could test how that changes works in a variety of models and situations.

0 replies

dmbates · 2021-08-22T17:14:34Z

dmbates
Aug 22, 2021
Maintainer Author

The important benchmark will be for models fit to subject-item data sets such as mrk17_exp1 and I need to work out the details of updateL! in the 3block form. It is not terribly complicated - it just requires some careful consideration. At present it is the subject of some musing while I am on my daily walk around the neighborhood. In the near future I hope to convert the musings to code.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce the blocked arrays A and L to three blocks #554

{{title}}

Replies: 4 comments 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Reduce the blocked arrays A and L to three blocks #554

dmbates Aug 19, 2021 Maintainer

Replies: 4 comments · 3 replies

palday Aug 19, 2021 Maintainer

dmbates Aug 19, 2021 Maintainer Author

palday Aug 19, 2021 Maintainer

dmbates Aug 19, 2021 Maintainer Author

palday Aug 19, 2021 Maintainer

palday Aug 22, 2021 Maintainer

dmbates Aug 22, 2021 Maintainer Author

dmbates
Aug 19, 2021
Maintainer

Replies: 4 comments 3 replies

palday
Aug 19, 2021
Maintainer

dmbates Aug 19, 2021
Maintainer Author

palday
Aug 19, 2021
Maintainer

dmbates Aug 19, 2021
Maintainer Author

palday Aug 19, 2021
Maintainer

palday
Aug 22, 2021
Maintainer

dmbates
Aug 22, 2021
Maintainer Author