@@ -148,7 +148,57 @@ class CholeskyCorr(Transform):
148148
149149 #### Mathematical Details
150150
151- [Include detailed mathematical explanations similar to the original TFP bijector.]
151+ This bijector provides a change of variables from unconstrained reals to a
152+ parameterization of the CholeskyLKJ distribution. The CholeskyLKJ distribution
153+ [1] is a distribution on the set of Cholesky factors of positive definite
154+ correlation matrices. The CholeskyLKJ probability density function is
155+ obtained from the LKJ density on n x n matrices as follows:
156+
157+ 1 = int p(A | eta) dA
158+ = int Z(eta) * det(A) ** (eta - 1) dA
159+ = int Z(eta) L_ii ** {(n - i - 1) + 2 * (eta - 1)} ^dL_ij (0 <= i < j < n)
160+
161+ where Z(eta) is the normalizer; the matrix L is the Cholesky factor of the
162+ correlation matrix A; and ^dL_ij denotes the wedge product (or differential)
163+ of the strictly lower triangular entries of L. The entries L_ij are
164+ constrained such that each entry lies in [-1, 1] and the norm of each row is
165+ 1. The norm includes the diagonal; which is not included in the wedge product.
166+ To preserve uniqueness, we further specify that the diagonal entries are
167+ positive.
168+
169+ The image of unconstrained reals under the `CorrelationCholesky` bijector is
170+ the set of correlation matrices which are positive definite. A [correlation
171+ matrix](https://en.wikipedia.org/wiki/Correlation_and_dependence#Correlation_matrices)
172+ can be characterized as a symmetric positive semidefinite matrix with 1s on
173+ the main diagonal.
174+
175+ For a lower triangular matrix `L` to be a valid Cholesky-factor of a positive
176+ definite correlation matrix, it is necessary and sufficient that each row of
177+ `L` have unit Euclidean norm [1]. To see this, observe that if `L_i` is the
178+ `i`th row of the Cholesky factor corresponding to the correlation matrix `R`,
179+ then the `i`th diagonal entry of `R` satisfies:
180+
181+ 1 = R_i,i = L_i . L_i = ||L_i||^2
182+
183+ where '.' is the dot product of vectors and `||...||` denotes the Euclidean
184+ norm.
185+
186+ Furthermore, observe that `R_i,j` lies in the interval `[-1, 1]`. By the
187+ Cauchy-Schwarz inequality:
188+
189+ |R_i,j| = |L_i . L_j| <= ||L_i|| ||L_j|| = 1
190+
191+ This is a consequence of the fact that `R` is symmetric positive definite with
192+ 1s on the main diagonal.
193+
194+ We choose the mapping from x in `R^{m}` to `R^{n^2}` where `m` is the
195+ `(n - 1)`th triangular number; i.e. `m = 1 + 2 + ... + (n - 1)`.
196+
197+ L_ij = x_i,j / s_i (for i < j)
198+ L_ii = 1 / s_i
199+
200+ where s_i = sqrt(1 + x_i,0^2 + x_i,1^2 + ... + x_(i,i-1)^2). We can check that
201+ the required constraints on the image are satisfied.
152202
153203 #### Examples
154204
0 commit comments