From 8c403140229764d69da1b20054c72ea6d98fea0c Mon Sep 17 00:00:00 2001
From: Sammy Sharief <sharief2@illinois.edu>
Date: Mon, 28 Oct 2024 14:05:13 -0400
Subject: [PATCH 1/7] Update to PQMass

I have done the following
1) Updated the README to include more detail about PQMass as well as
examples (in distribution and out of distribution) as well as advanced
options
2) Made PQMass GPU compatible that have the same results as the cpu
(default) option
3) Renamed "whiten" to "z_score_norm" as whiten doesn't tell the user
what they are working with
4) Updated notebooks to include device parameter to showcase that it
could be 'cuda' if the user has access to GPU

To Do Later
1) More encompassing test cases
---
 README.md                   | 143 +++++++++++++----
 media/Voronoi.png           | Bin 0 -> 24838 bytes
 notebooks/mnist.ipynb       |   8 +-
 notebooks/test.ipynb        |  14 +-
 notebooks/time_series.ipynb |  10 +-
 requirements.txt            |   3 +-
 src/pqm/pqm.py              | 298 +++++++++++++++++++++++++++---------
 src/pqm/test_gaussian.py    |   2 +-
 8 files changed, 364 insertions(+), 114 deletions(-)
 create mode 100644 media/Voronoi.png

diff --git a/README.md b/README.md
index 20dd82f..06c7503 100644
--- a/README.md
+++ b/README.md
@@ -6,63 +6,152 @@
 ![PyPI - Downloads](https://img.shields.io/pypi/dm/pqm)
 [![arXiv](https://img.shields.io/badge/arXiv-2402.04355-b31b1b.svg)](https://arxiv.org/abs/2402.04355)
 
-Implementation of the PQMass two sample test from Lemos et al. 2024 [here](https://arxiv.org/abs/2402.04355)
+<!-- Implementation of the PQMass two sample test from Lemos et al. 2024 [here](https://arxiv.org/abs/2402.04355) -->
+
+[PQMass](https://arxiv.org/abs/2402.04355) is a new sample-based method for evaluating the quality of generative models as well assessing distribution shifts.
 
 ## Install
 
-Just do:
+To install PQMass, run the following:
 
-```
+```python
 pip install pqm
 ```
 
 ## Usage
 
-This is the main use case:
+PQMass takes in $x$ and $y$ two datasets and determines if they come from the same underlying distribution. For instance, in the case of generative models, $x$ represents the samples generated by your model, while $y$ corresponds to the real data or test set.
+
+![Headline plot showing an example tessellation for PQMass](media/Voronoi.png "")
+PQMass partitions the space by taking reference points from $x$ and $y$ and creating Voronoi tessellations around the reference points. On the left is an example of one such region, which we note follows a Binomial Distribution; the samples are either inside or outside the region. On the right is the entire space partitioned, allowing us to see that this is a multinomial distribution, a given sample can be in region P or any other region. This is crucial as it allows for two metrics to be defined that can be used to determine if $x$ and $y$ come from the same underlying distribution. The first is the $\chi_{PQM}^2$
+$$\chi_{PQM}^2 \equiv \sum_{i = 1}^{n_R} \left[ \frac{(k({\bf x}, R_i) - \hat{N}_{x, i})^2}{\hat{N}_{x, i}} + \frac{(k({\bf y}, R_i) - \hat{N}_{y, i})^2}{\hat{N}_{y, i}} \right]$$
+
+and the second is the $\text{p-value}(\chi_{PQM}^2)$
+$$\text{p-value}(\chi_{PQM}^2) \equiv \int_{-\infty}^{\chi^2_{\rm {PQM}}} \chi^2_{n_R - 1}(z) dz$$
+
+For $\chi_{PQM}^2$ metric, given your two sets of samples, if they come from the same
+distribution, the histogram of your $\chi_{PQM}^2$ values should follow the  $\chi^2$
+distribution. The degrees of freedom (DoF) will equal `DoF = num_refs - 1` The
+peak of this distribution will be at `DoF - 2`, the mean will equal `DoF`, and
+the standard deviation will be `sqrt(2 * DoF)`. If your $\chi_{PQM}^2$ values are too
+high (`chi^2 / DoF > 1`), it suggests that the samples are out of distribution.
+Conversely, if the values are too low (`chi^2 / DoF < 1`), it indicates
+potential duplication of samples between `x` and `y` (i.e.
+memorization for generative models).
+
+If your two samples are drawn from the same distribution, then the $\text{p-value}(\chi_{PQM}^2)$
+should be drawn from the random $\mathcal{U}(0,1)$ distribution. This means that if
+you get a very small value (i.e., 1e-6), then you have failed the null
+hypothesis test, and the two samples are not drawn from the same distribution.
+If you get values approximately equal to 1 every time then that suggests
+potential duplication of samples between `x` and `y`.
+
+PQMass can work for any two datasets as it measures the distribution shift between the $x$ and $y$, which we show below.
+
+## Example
+
+We are using 100 regions. Thus, the DoF is 99, our expected $\chi^2$ peak of the distribution is 97, the median is 99, and the standard deviation should be 14.07. With this in mind, we set up our example. For the p-value, we expect to be between 0 and 1 and a significantly small p-value (e.g., $< 0.05$ or $< 0.01$) would mean we reject the null hypothesis and thus $x$ and $y$ do not come from the same distribution.
+
+Our expected p-value should be around 0.5 to pass the null hypothesis test; any significant deviation away from this would indicate failure of the null hypothesis test.
+
+Given two distributions, $x$ and $y$, sampling from a $\mathcal{N}(0, 1)$ in 10 dimensions, the goal is to determine if they come from the same underlying distribution. This is considered the null test as we know they come from the same distribution, but we show how one would use PQMass to determine this.
 
 ```python
 from pqm import pqm_pvalue, pqm_chi2
 import numpy as np
 
-x_sample = np.random.normal(size = (500, 10))
-y_sample = np.random.normal(size = (400, 10))
+p = np.random.normal(size = (500, 10))
+q = np.random.normal(size = (400, 10))
+
+# To get chi^2 from PQMass
+chi2_stat = pqm_chi2(p, q, re_tessellation = 1000)
+print(np.mean(chi2_stat), np.std(chi2_stat)) # 98.51, 11.334
 
 # To get pvalues from PQMass
-pvalues = pqm_pvalue(x_sample, y_sample, num_refs = 100, re_tessellation = 50)
-print(np.mean(pvalues), np.std(pvalues))
+pvalues = pqm_pvalue(p, q, re_tessellation = 1000)
+print(np.mean(pvalues), np.std(pvalues)) # 0.50, 0.26
+```
+
+We see that both $\chi_{PQM}^2$ and $\text{p-value}(\chi_{PQM}^2)$ follow the expected $\chi^2$ indicatiing that both $x$ and $y$ come from the same underlying distribution.
+
+Another such example in which we do $\textit{not}$ expect $x$ and $y$ to come from the same distribution is if $x$ is again sampled from a $\mathcal{N}(0, 1)$ in 10 dimensions whereas $y$ is sampled from a $\mathcal{U}(0, 1)$ in 10 dimensions.
+
+```python
+from pqm import pqm_pvalue, pqm_chi2
+import numpy as np
+
+p = np.random.normal(size = (500, 10))
+q = np.random.uniform(size = (400, 10))
 
 # To get chi^2 from PQMass
-chi2_stat = pqm_chi2(x_sample, y_sample, num_refs = 100, re_tessellation = 50)
-print(np.mean(chi2_stat), np.std(chi2_stat))
+chi2_stat = pqm_chi2(p, q, re_tessellation = 1000)
+print(np.mean(chi2_stat), np.std(chi2_stat)) # 577.29, 25.74
+
+# To get pvalues from PQMass
+pvalues = pqm_pvalue(p, q, re_tessellation = 1000)
+print(np.mean(pvalues), np.std(pvalues)) # 3.53e-56, 8.436e-55
 ```
 
-If your two samples are drawn from the same distribution, then the p-value
-should be drawn from the random uniform(0,1) distribution. This means that if
-you get a very small value (i.e., 1e-6), then you have failed the null
-hypothesis test, and the two samples are not drawn from the same distribution.
-If you get values approximately equal to 1 every time then that suggests
-potential duplication of samples between `x_samples` and `y_samples`.
+Here it is clear that both $\chi_{PQM}^2$ and $\text{p-value}(\chi_{PQM}^2)$ are not close to their expected results, thus showing that $x$ and $y$ do $\textbf{not}$ come from the same underlying distribution.
 
-For the chi^2 metric, given your two sets of samples, if they come from the same
-distribution, the histogram of your chi^2 values should follow the chi^2
-distribution. The degrees of freedom (DoF) will equal `DoF = num_refs - 1` The
-peak of this distribution will be at `DoF - 2`, the mean will equal `DoF`, and
-the standard deviation will be `sqrt(2 * DoF)`. If your chi^2 values are too
-high (`chi^2 / DoF > 1`), it suggests that the samples are out of distribution.
-Conversely, if the values are too low (`chi^2 / DoF < 1`), it indicates
-potential duplication of samples between `x_samples` and `y_samples` (i.e.
-memorization for generative models).
+Thus, PQMass can be used to identify if any two distributions come from the same underlying distributions if enough samples are given. We encourage users to look through the paper to see the varying experiments and use cases for PQMass!
+
+## Advanced Usage
+
+Depending on the data you are working with we show other uses of the parameters for PQMass.
+
+### Z-Score Normalization
 
+If you determine that you need to normalize $x$ and $y$, there is a z-score normalization function built into PQMass, and one can call it by setting `z_score_norm = True`:
+
+```python
+chi2_stat = pqm_chi2(p, q, re_tessellation = 1000, z_score_norm = True)
+pvalues = pqm_pvalue(p, q, re_tessellation = 1000, z_score_norm = True)
+```
+
+### Modification to how references points are selected
+
+The default setup for selecting reference points is to take the number of regions and then sample from $x$ and $y$ proportional to each length, respectively. However, if, for your case, you want to only sample the reference points from $x$ by setting `x_frac = 1.0`:
+
+```python
+chi2_stat = pqm_chi2(p, q, re_tessellation = 1000, x_frac = 1.0)
+pvalues = pqm_pvalue(p, q, re_tessellation = 1000, x_frac = 1.0)
+```
+
+Alternatively, you can sample the reference points only from $y$ by setting `x_frac = 0`:
+
+```python
+chi2_stat = pqm_chi2(p, q, re_tessellation = 1000, x_frac = 0)
+pvalues = pqm_pvalue(p, q, re_tessellation = 1000, x_frac = 0)
+```
+
+Similary you can sample reference points equally from both $x$ and $y$ by setting `x_frac = 0.5`:
+
+```python
+chi2_stat = pqm_chi2(p, q, re_tessellation = 1000, x_frac = 0.5)
+pvalues = pqm_pvalue(p, q, re_tessellation = 1000, x_frac = 0.5)
+```
+
+Lastly one could not sample reference points from either $x$ or $y$ but instead sample from a Guassian by using the `guass_frac = 1.0`:
+
+```python
+chi2_stat = pqm_chi2(p, q, re_tessellation = 1000, guass_frac = 1.0)
+pvalues = pqm_pvalue(p, q, re_tessellation = 1000, guass_frac = 1.0)
+```
+
+### GPU Compatibility
+
+PQMass now works on both CPU and GPU. All that is needed is to pass what device you are on via `device = 'cuda'` or `device = 'cpu'`
 
 ## Developing
 
 If you're a developer then:
 
-```
+```python
 git clone git@github.com:Ciela-Institute/PQM.git
 cd PQM
 git checkout -b my-new-branch
 pip install -e .
 ```
 
-But make an issue first so we can discuss implementation ideas.
\ No newline at end of file
+But make an issue first so we can discuss implementation ideas.
diff --git a/media/Voronoi.png b/media/Voronoi.png
new file mode 100644
index 0000000000000000000000000000000000000000..978679a300e5309b25032aaa2851a7763a1aff3c
GIT binary patch
literal 24838
zcmbSy^Lu1n&~0pMVjC0Nw(X9MiJeI@v2EM7ZJRT(ZQp*sd%u6+{&3E-`<$+Rp4xkN
z?W(m_g)1pYBEsRqfq;M@N=u2UfPjG30mo}FP{4acPD?rP0hN`gsFIDSq^N_fgOjSG
zk%^hGse`eZilhiLH#Zju2z9&(0H7j8Pd#RY1^|pr)6u{=xvPYQMX3P%`}>B82m8kI
zrt(tK^z{B=qx>5LfndVL^y!3BgM~<S6*f1T*oV}kDIXpedP7FBp!$K4%*4cG4zfOR
zNC8Rgh8)L`kc4Q?2vQ&U8wsMJ_4nE-$`foIF3dZUmb_FPYuqR3*pVz)W4e&rZL#PY
z7%|6^0SbzX+_<8QI>@WR;M~H3NTQe^3Is(tCI#^ZcW+(;H7kFiSU)PZN;zDP=eQRS
zoCK5vB?3a8V`$kIJ`9qgL_I4EGlWpsbUXlXV>r>*cLz;kzA)KWClneG;7=<eHysxs
zER-8y^f&8Zp@5}~IZAE>0pZUJhyu8b{iFwPBXH)p%(SG<<>f(WfnyjD&`>K72;c}5
zc;Nys5D>6Du>XAmT9*g@zhjWK|6Vo+_mY5s2!Tk839EU4Uiw0Nplhx?#oGqiF%c}<
zEqY(0z;u(Nz)%!HFAztQXHK_{Q~tGg<IHq$PxR;PwuOT=Y7`=dfI*QuA&|Q!kYnx+
zsx12M(j+9Aq$RT=pO$*&i{&p&=XJmGwYbXip5Y%hH6fKo0}cNzCPYe%%7hJO9@MPK
zp$Qzv!AL=ngMopL_Z>w3@23zkAr>Z40giRc|7Q?~AlC!_|I7iv?}M`*m?TE0{EsQx
zf2O?uf2IK*%_w}I4~s5`6PUEREma$`rrhB|Q^a*<lUN&dMnM%{Zf8s38!dK~6;%fC
z`n`VH6G=3pl!?-{@)(yQdHHyp_SjdQ>PkvNSdqbK=;*G|wJXts-s(L*o^LPvvFhs8
z836&Fh=hD;860*M6~@|9q$sj5d@uikDmuL{H`u-I#;Md(S55jA1wWG{V(_KaE7ayq
z?jxE{BN==0goB`zFE^Sa30Ci8gP@UBh9fY`oen3w>`RX9HXALF8MNwzT)Qi9u*n&j
zYM(AQD{J(yl3<45z~Us9&?azw@tds{Ln<_D6{>YxH8wB9l6l=Ou!o0-ap0W)MRJUi
zR_nE^xV9eRrdgMn3ja6L!o=+C$un8pg&#|gSm+K@7ttc2$V4TrJ6@C2)6tf^6yTB6
zA9qJ+peCeB)jI4v-cMB|xPSA6!Fl{XxDF2waSFOS-LFf`r!#O0s^jg8tTBy?@My0c
z=ppJn{LYqW>QU^fQoJ4PL`1-L_w6`FpSF<E)<&Z7x`T=n8$HZj&sT~+-=9?{Qy5$q
zXlJKE!0~uo%8tSAqQHS`jIT6Cz<~PO3pPa1^=xUrVIyn>GO{!B2jjv{x3`I32m+Kj
z1}p-jd?Ka%A6(^YgsQr_I9yJ9`Y+PA2E2o#gaDiaVVvr8CjA=2UV-GF`Wu$=&l?8V
ztVSUS=*L?6rqDulaf7Tgv$Lft#WJ45a>!KmNKez8e^2sh3^q}Sh!h7y5bt#)a=L<r
zY|n-5cZr|bBtU3~EY%x#$<WWkr%0kTs&$r`RJ6Rjc=UUG(rXO5D{eDu>|7horwa}z
zQ*CsuFjgsXv@itq*4e{_s8N{`NT9KQsyTbLN^Es`HqPn&xoa0!-A)J)UtCP`{T11l
z#qDf45`|+w+s+HzOo9vk-|vR*r3cJZ5K*QwvF3vfin9n81Hte~iHRoEp*E$fB&3}{
zYX~>bMMsi4k6YvX1RQqjq%&P4B7v-xTElBiF_7hCNoC<eT&I&q)GS{2m=FY<g4Ltb
z2K|J9)4Td1^TNA3+pal#Gcwpra}Z*3MdB0-tq0DC!WwhHt2U}yn8W}K!jBNnQpMb-
z#?oCHB-wF;Zm-~Db7iL=<9!7O!noadsf?Rkj#ec6w|{~&I2nsBH=9k2XV&yOT*7-k
z?qqidA=pXC*`xzV0Bo=b-D*>(;OD<iY66w#1ixPujn;8GI>ij9fnNM&WjBpApQ=+<
zK|_V!#bUG9%cuP0aye0A&aFlkc==7}d4L)ygW9ieF`L716{P*TP%KOEEEcx`4voa+
zH<_IPLu}18b>i&oESs}*irO8L%;WVyO@sR73YYyelL9C0Cz0$0dxeD}IwRok*)v%z
z#J9EI)4B!eaJch7y&^~vHV-lLllemE?(eVny^<TVv+d5-LQpb^C=?qD;2v|rPvoUQ
zWy&CdCT9{msMD#Y;6?8zWBLi|oP0n8*6&a8(V0unfa(}2R475I(?TE9;X?9upF7R=
z%lIa~Z8VmMF%6>%4pW4WPfBOI)BQ)nV1;_+(DkZ;cXe9uMuB+bNeu5*-mm=;n$(GE
zy#HD8qtc?HFbs+~j|Z_ZoyB?+Nq+Wm6uWqYDU?_m4qVUoOLLu0j)1Rc{~!wkIAbMr
zkv%YyiRaJfzl`(<4m(}a-!G$rWy*yTf#W1%YSlW;Ns}-{Z5iw~Jbse|&oJ2XV4{q~
zr^NVaIVU~uSA|hg2><u@^NSE}(@e>lh4O%F&E~Mf7oYt@4$-q6g#lvSAzwwO(L7qp
zr);>}xEP6viT_=r-5Uu2kjj98)kXf}W477qkP7Mp?t!fW1`opK8U72UuL?^*U*?2r
zhy_A8x#&-aNRyKmIjIdJObD#aY7OGphk+W3=y#N5_*$Kj=)}atg1i@ImyJXiRr%<u
z4hs{%0OmU7GzvDCBXKl7w~Ca(4z0M4Xz<ZYw(P9HdsRNm3#}dU^NM8;pbvTqtRD5z
z*o^kxAM$Hu2e&z3m83Qa<^(su@AK_-S%8U8`V2*hTt}=VRAz;`C~dLTa-Ot6UgEbA
z(~_kOFNT6pF$9)E(_uF`m~z-}wgZ@te2p@t00aNe$3D8STe`xCfYYgFvJpdub>Bi#
zK=X#jw36?6bQ=;BCMmrlCcOG%_i_vQN-yA+mWBzUATS`{7Xhz%p=hY>hsqoC9oV3p
z0R?K94#ZmL?`_1e82ozff`XT`D%90}Kp*&z2&s(4!8*&ghf`ASGRCMw&u17hNO17Q
ziSc@6r-o>7*Nrt{br$B;Q9F~#j==p<f)$UN<HFZg!BaIBR@PHs#E2PBb4i2A;j=oV
z-|)b*b=mvVVTmb`)Hf$;Mu71CrL_BcZ)!HW4}eXHgBcAL1pX;eFcGTa29I`U%~?^=
zR(BOuenHSz{j9$g=|ohu%|=VRpm~}eKZd!}iSIpC<8e6w?w0S5V?mo`2+`%BdT`8w
zWoy!>pGx8q=72ho2cVz%^-eoAA_~3(#IMd);SCFKbr9r!L}74H8r2!$wCXId<CW&}
zs`Cg74)g_gU`$VB@6M+~IdrBp$#TIKvw@4}4Yiuz_e34-rm*@RhnUkFKcsjtr>3Hl
zD8=aNv?@nd9AAEB!Wcj<S5{`i;3@~hOwHNHZCFf)Bb@wOk#+ehNVmm;*N!CZ3X=hQ
z;q<rxbR6e+gSDv`6c7V_MF9JOtr-Mmjy0NGGtPj*<(Xiy$ahCi$(qfBF4^J}!;>#c
zgTT5OS=%FXp)bS9trL=>l6Ie2XFLoSs;K4W<|ei(l(ku$`ngankAJczhJynzKEu{R
zD@FZH^Ris2`M7|%e5q6^A=dYKfex9J<{bWWOR9*@bKKNO-8fGLo3xjHcRXV!5UvtR
zfeNxPFyItIR-a=qAye@PPm{Vwu0iXYg@W=-1+Me$vemTPPagndUXqM)l%;UERIW-&
zxR>eBLvwZidYtQgH?_S-#$va@HhLdq)!2mK+?Q`X8dmaff{mSqD~d$TV5`Z9J_}AG
z&@XoM@~@~)zte4grCOI@9-tOL51%X~#8QVAC$<`J7XD+*I-)et2X7gW9LJple&}=|
z$X^{yv+!f(1cDiSQ3TFez7$My6KLFMUDeSxeRnun4h+HrNgL~)I2FZimm8=mwOU&0
z<l~%R5(d!V;Ho0go@Z9ZA|&eJonoBy;YmqSYCnf1%v1Ql_2CmwRryFR|Na{9`xu;5
zD$Xxe%mY){l2YrmMxxMi9*Zc_t#WRro)UrPcS`9=I<h1QZZA;{X#TNK`f$0aIn*x-
zCR|i6M7dS2T1wkm0^cR&e=vq>6di`~kl;fGP=g9dOG~CzH|ESTG2pe@mo^-xHJ7Q)
zQDmoU6kujdt*W{j(HiJHW0VF*@<(+xGzdv>ij@_MIXYbDimM72y+f(Q3Umkz7wmz(
zs4I8x7{c;@5lTx)B#?)^?`C$U)Lz@5tFcxe{pIhTwOIa7hn!7J?CtNz5$967vjr+T
z@=R5ofZ7RPt)0}OIV7U1u2eNJWmMDTLMX1$Z&cRH3fH&s`75UlJvlaZqlLYoQ4WR3
zm-BoKvC?heu?rbkp`+pmV|kN~Py>2r%?rlqu;Jl5LiynsL)d(_ti(hfOBcNg&Vocx
z@ZkH7?~h|EW0)%c9b{y1h1=<QZ~nHJ`$ewDkmGem!=V;N#YA7!7>B_H=9SQ=kW~jO
zX&Y5~f9$KfMT<axv-9qLNWoDdf`KC4PezQRb%SlKGMG%q^v$W<?nH^b7cMAI%OONz
zo<lj>n6voESYte#ryHx?oS*=CuD#h}N5;kBzPmL0;uz5Lc2;S8G*yRWvsfZO_Z(9U
zhXHkD!$7l?*2fONaZ9T%N56}rEAiGtwout;32o&vAPh{9NVWQ8;0Qs>MbrZFn|~W}
zWa9C+(geues(W$}eeg)PJCP3nj1ZD(A*ocR@R!q>D^EnGu!MOqt4xILxJ<`T-mW^Y
zFILjJ-16g`X)(#0#j^PHsHFM9ie*#HUG-T=I=sXYDGAT6^}5K_uEWDQ>_hq6gsXL0
zbDVDzTK)1bt*op_yJhZ=rX8|j`U$c5R(i9dLqr@i)5=v?n22IG%-e;zL%F&`qK8SL
zg&l|ceh2ID>5{RRs;E;b>7^^gpnY8dwkz%RJ0H}0_;lfM^*G!((L-riu-QDI3;4Kh
zQT%uBD!XiWFdgD`I$Y0>8twJ2Yy1Tmqwc^WM4icTu!$99SsTr0o@_-51NaSrM^R`-
z0Ku`NJrh1G7TKTKAQSexM$$OZs)T;Q#C(6-k~S8#l5yOK+K1n5+Is0VNn57bFR~T9
z(K71Z@m1}eG@R!+4SI$@)PfQc<v)z^ay>!+fN@N;o$fJBheFUqO?^#jeZ#NsA^0Y)
zgj3{Z7D&*0Qg%r)F$YS3pLsN6)VS6;ZPkUNQ&rfdY&wO#XJp%<qlTiqCr6u`q!dAv
z+|BDcUobE3sd&{N05owj(D35YzpL8c%Wfv{Y1{aM_9onvwM|@pRO+_YFp0YxRB2#i
zdm&Md^y49j$9453?MjAN9wkJO<^6!t=u$uPGcz@v<8JW?wh2xsQr%AeN}e#{OaU<H
zF}vM_YNAXc)_>5-K$Yx$HfrCeNnc`6>?drdwhfp!Mq+WCPoo_hKY|Rvv)gZJ(MQpp
zag-q|=C(i!RU2&EN<l%5!y6@%CyRj~AjB(1Nu|a8^9@#QY?KX7@eMfmTG4fw=e_Pi
zS8JYIVp=lJMz#m5w~c1RNsG`N&lsd!nJSX|l{~K3VP8*I1$jK?WLM9HWmw`}?_B={
zPUOwR63nzVk=$4hVFS1Mmg?ltj>r!-fva_Ljgi4iZ9SyKTnDUY;K50-qakF=%PCu$
zmPQQO)>_2wyvm`K19}dRD3vGlO4=UGrg=r$M{@v2DT=q4A@@A%*qF`3Zm5mgEVY)t
z;KTv_r88xq*1HEWD2n-{)rAg24QEQ85~%TgagJ#AC}~-u7$cJMv!?CwG#M6?f(ow|
zI}|n~$O~KiQ>`%xH2ECKHJ^$RitYNd`f+xVDi95g)mc-Rev0NsZ7AdR0|cNFa9Yn*
zV7Gk$wL%F*)OR!+BlV!Luf6ik=?r$Vs%Gg0q<}hEKv;A}@^Apt>kJb(|1XiwnLh=T
znaL(OhZiX7qp5gOlo})%=y*<Q*4|MWZ3i3)p21)q+NFvR0NR+Y2v~fA&zE0x$AkBt
z(qr)0>v7=!sIA4S0|s~;fte_m9RB;H)B-|49V?T^1er4n+L17(Ek`4Fbc}SBzjD6T
zw@ldw3WXZBL@}+PNcgFt<Njpb<E|q9(qbC?Do=<$hdd-$SoZj2)Lv{VrnZ;TFXgaE
zu}Pk^6((Tq8&jf~TjScyY-qTq9_9!L-nhBY&Xh_kb*JJ}JU66twlcTI95?S4Uinb9
z-BK1jGNgnR@<1)qTOTt2%i*@s%zJTW<m6xqi=cuA##;+4n$wmBcd==eKWN@-bepd*
zbZY&BpXC>?w+CkWKp?YRcTHi~NzG%tC?^p+K|?EFyQ*)%@F+JPb4xt-Pv0M-+HEs=
zAeh`17N<#Rn8=y=QLweB5lKXrCB??lSj-|EmHe-}pSg1O5c{X%%y9jsI-LVodFzlV
zLm7NV0A}Yg{q2O{l<t3h-ODnF8XqP81c2IgBXBt=?s$GqCI@J*d1{k5muFOAG2{i_
z8#+prmnIE5RjNY#Q$iFaeObib<$r)8OH0dn1Hs%|Z?x!U*Va6C^4xWNx>zU0oC-rF
zD+8wC1Ji{-s6k*8HXKn-&d73Rb1)Q{HztUCz(pVlX+Z!FuK;Wgq>q78Wx;<4TPy5l
zsYoYtdZ6BW_U>(ER*wk!_Asv-zd3a|xZSp^|5Pho?qgH93roXsSC_VnPnU%Wn)+=^
zxahl3vi=GVlOy~kaRFbu(*p8bREuOF$)MUxzO(aPQ`OwOL@U*Ly`fmDe!n&VQeaDo
zX0H|qFBmuoBsy=BocI5MMej;sU67tqMVWweLkRma%x3Kg%^a{FLcKzQauS?;TuyR!
zgppPIHdlZQG28pyGg$s_PfgPGs7S+<R$cL9hMyy6(r@M$s{c6`qx(ZR$Vw#BKNS@@
ze2SqG_(^_WQcSJNe_b&`%}nB}biyJ&&)e?eAz(~Np@d%42P*+UOrVg}Vg@COyC48%
z;%owwPFQI;0R_GQm4u3!l=OM^P<~(4OT|a^eIvCLN(2oUqSNU)TkmwVIEI%!q7k_-
z5T+N|zX&ByD6}$?Y0x)sXFi6fbzd9I<Uf858}ri}*@eJc99BDcahGB{31yf42&|a2
zW+*zG^e+LPZ3_hMTf)9?53rd@FhuR6gx+#Y3I{~0g#}CSZdV(i%dW-+;e*=iWp#5{
z^<ggw0qx#AJ_rT3O;#GXOnp;5Bv;}liQM6PlWvbnqK={Q`LAotC#jcLP&ha&f#Qx~
z8?_3i0Uh@GjauNc(#+@9pHh)ur};?10W@~$Jj=9{h>HAXSSh0uw~ltPM6L5B@|~7n
zAJ=8liEM9!L|=Wk;M|*LY2_SLb_7fS5cv$Ya09PX3c}5+UOFRtjlR}OeY}KJX>8PF
zMj}UcVSQ>GEmR+^IQ@@vMVI>#_6dP(8T6cvY>Q(ouiaoSoi<0J<r+Kl(0K^dG0c9w
z?B`ou8(%@RJEIVA!*)7NTT)S0KUrW7!NeSHcEjaWK~k;2{`QgBI~Ox`ZT0lwqSI2q
zqlkfa+Po?=KUKb3?xKRzveWoY5n-qOrJsWK%1FZ<EO@a|ZL8PjnBMc{g&W#7QNV0+
zbv!Q1YHqNxd<_e(IQss4tq81WOAY-$lj|Hz!M+a<d&3e;0pd5;8_kI|S(8OEmuGW|
zjdXXv8}D$X@f{c*9)GM_4Vci;)b@?E=e@*=ANlNHHL&X%1O&&D4Dd*5GG-4U{ESC?
zxnAK2PghFVPouk6#m;X~ZvZD#CP-N6KLP2Vjpq<f5EHtF0g$#??pUyyd&pKOlO-GY
z6mxQNr)F0<YujbYLO<E&6%FChgZ@ylHxM`kWM<C3Lv|es?ifdrQ&2LC${vJEps}`^
z?l2Dek0)h?F<=w6JPE${WWxrcQc{=*2B-Yg{khVf!QPo2f=o1;G5|Z(MammP>>7>7
z8T0kF+KZN8@y7Bs_kJJa@=gxEPX!WLZkL&=oNMn|Zk_D(Fb1x9a$kGd$QDTvL+DE#
z+u{0jRc2eSm0qu%c8YmNl)3cd=<bP!m0LgqAeb_0-EU*d1p+k|`ohI39ZYk_5ei*L
zQ^Uv4`~|s7rCFOJUDm;LQdF#06dKXhiwn`>pe;kWiFmd;ONiIxw@vBtV%eY+Jm_J*
zHOR4I;1S!Cn3*|hLK6D-Q*UB=SI{<n{P5;>E32`KUm>+N=oT}KIdN2OwjbtoP5!IX
zaTm9E)lRs!>y9(VCT+q`*Rng>`B(X=xaB0^(`Knm`Qb3taG|ocXTn<j6$c69S2#YI
z`^`UL8R?FRsmheTdgsu|hsEb}%Y`I1ucL9YrNT||+?SDIBmQBz@Z$A@9Q$IFThDyi
zfZQL~`2@Tz%9R>+Yl`82Y7=BpUsXx-bRmvrv*m)`Tb-L6R!1lvO22L7bX$^3PZL>v
zSs3-w(zJ9wxM^Vl#}B8C4>R7Z<!m;Ld1no*HDfY{GutOC)p&N>9mzn%gN`68JK)w?
z&}jl7h=*ITr{rn7-W0XbL~qZ`UiX(f<L!}BeM)w+#q~5!LYKuKwNZiLJlRzmLL|Dc
zhyVQKZJBkc#ZGp#`k(3N@#^#Z*0r{4nyI4s85hn#_Sv#(x%aUb+xdH}h)3$b3cr`#
zP%B_Q5F!tTt9RYp&U1vb7EdNg9WbLZeU{uL%r>S&o5U#kXpL^SRmU-wad3zha6`)E
zFvH1KXNV^s%@larpC{b97-MHceST5{sDNE>4x0icV&mcxm8Tu|hcM0wuv>2m)M+g{
z6Lcc!D$qY?<7x6|FU{QbVND<hgA)3;woBmhwnTEab(-?Vv@oq;?7WyCv^l>-b1l(9
zLwNnSLY*#tKwP5Y876)eUAe!{c5EIEgoLY<;u@lOb^Zr9-SoMooo-TLvrP)b9Gz#l
z7_krpiUFuU=v)`Kp=z%Ig9m{Ym2z<8+g!J<WXG>d47;5+rr!p^#|iU=l8ZgQ@0P$)
zXSdx5V|(}pVQNIV5FQEt-^Cby%YfoWgB518eF^e%zRhALM}@d1<HgZIu69l7fkU{E
z6bkjfyvATq)3j*;&)-OdYZ9%l*Z2u;kg7C36B4l}!haP%|By+%d`H6^?O4}}=lzu=
z@>LTXloSF;pP@2wfbXvxgO0MZ-d4??m3AQh8m}?n5qdns;@nX}*0+?+n5H~is==++
z?^a5CeRt-h05BAULvZCk9lJHyVh{tS$4T&N2elk_Fm(MpngCavOPv;~Pk#d;Yre{)
z_a)RCb*$bVkH`Hukv!pJ@$2hx`~V5%8+bs(IJd+7TH&wXd-Z?Bh(zBU(CMJ)?yz-h
z{&4!N+EF4J+xrrBdSCT_kQSe5D;+)YR?U@4=_<#ToV?nV6sCvM6Aj2MvZ-;_0!`qG
zcAci&zGQ%Q$ABPJ_*hR`K7RvO6UHoF;my|SLIeKJvK~q_NJ*q?S;5&{h1D7ZC0}m{
zznPg9i|!FFxaREPxqQ)e|4XoHgKiyHyuxZ?A18SDb!iQX(*@+I5F|&*q2G5)sN!#X
z!8-kaB8uv`FhRv`6S8=_6W`pdj>_G(@vpafIHnC7!f-jP>7C$St<#nx@`|5M2%eI#
z-puBD-biPmV}B#=aq&)w$nJ*!c^FO<X4M_uwX1Kyg<nX6Wp4GE8}R%)XQz*QgU{j!
zT%*>x0~c#+%(DfOY`?5q68?u1BK75Pp^Y2n5u&4Z7@f!|jC|56B7v`${fDPb2i5(N
zXm0WF9`wTjNSfSK`=+;`H;nX3boDOx3D!9Z@WwhX<ccU2dJS(l;e07WN(gv}U!^3$
z>D<Xdnj@p)t@hfA40xrE%lY0o;+bqOn)M>u6}smLb`yGv+a1pj+9o1_;aoI7sW6*n
z@hjCA{fT_tWUycmKyzTQ@cCuTV?I!>4^2+udXhTp$f0QH1A=C~3M(~d=HFapPor0f
zo;2h*kFW5zBEMda$j`PN7?%}n_8Rq*=`y*s7+8LwP!)<TjLgLr<FZ?aNxd>v`@bHh
zg0G-(vnGSBsBnL;SQ90ntz4tLO>X9*Q7Y0TPNCu^`I<#yaQ^juCICXU2`Ti-4qYX@
zP$4*E2&_VBuo>IKenvBfV{%CFd>W-<Y-!S1EGRorgen2KTpj5h$PO3xkSnqQZgvZn
zM_b8DRi1R}%1!B^kv^J$ITnaygB;_RO{2rGIb+{6bh!<#w~lf7G1B_0VUw{T#)w8Z
zgQ{TW#e2%&&jtY8D3W|B2E-K`h_FnTGXO8+Dc65qw$|^caX7AHVNo7`bpB`XYnM<V
znD6@&CRR9+-wO#eQO!B$J?Nm587^GmO^dj(g8MQ5a`3*+oM#Tbd=6i_fzPdY%x8v$
zEmX*Qo{}dM7uHo8&Tfo;@4$!o@M^`dXPd|8WH3Hj`sA;}<CJN<a_vFWo?wgQ2o()R
z-9M$)%S{N?x}8Z-h&{UMjaxpIYWhFqgY#sDS*+$46xdv+-3^I-nfD|1^{(eMw9?Ys
z69@Y8_~c)@m}|2#HLZs4ARjg+pI_`FV`LuMs^J-QA2CM-^hK&SA8*%LnChjXD9Yyl
zLBp{VkYg43{~FcXA^<m;_p%eDg7)maNT6n#kFc++yW8=;p4WHZ7H80D`q#vn`B9;x
ziIx`E!-PklF|X};wl56_UU)ZLr56vP6^dL{=zV4V&x<ziF9#B7Mk`gM45$2lizO@G
z?NZ#4{P|o?JMZaGcudyDaHsXCDnY#qe&Z3DWF4{XY~cvH_7{jH*PROzWf5ciUT80e
z`4@;zx0I3A`(0`Z{TpMmtwA=e`oaie@y!xs!_HW-3gYFh)TdqS#$BK3!*=U@)3=`(
zgU&Ocw|Oo`xB1fP=Q9xDhATgR?)~om#No+}>Jy-6G8zJlR&Z*Nmnzm_1E(E}6?h91
zjcqO_<oAM5E3T<R!90~c4ToI`t_vGb(*fV;v14Vg(am^l+30FowWumu{4^}a6Rerf
zbKR-zn_O(SQ`c!cvtXvXH_+K^l&T45Z&uPZvRb!p!ekwBI=q0t(HafIp-t#v@`Es%
znDUPsXv+KO!q@qjq{m(At;AXm=TSg@1<yWXQ)XhGrhQaWZY+_fPF`&VIvg8{md}k~
z1-IVEs8@%l)7}YWJ#{UC^~A$LqT*SgXMXgJl8iKS`0ue7tTttz_Vb8UlRmI;e!4ra
z(QWt2McVdCB~Ao{H+Q4Yp&;aQotMVjjFjYFF{~Zr6ciiV>zd`If_4s8KD<(e_J>lM
zkK}_(yuS&Es$VFoUOKcCBiSYV-fp2U3au5PXa7!=JBg;|3&-N!&a-l!wZI37K+xcC
zup$}g;cvY92L}xC(N{vX_w%AC>J1Q}O8UJ!j8C7J@(jc)Y6kW+5h=N4TM`ZtTDi+H
z^gms4GGqgPvKo6hWPC6BpT4pGO{Wa((N>1+;$xh;>mSnF*1Zun>~yQvg7-QrFMaW4
zC|tG+s)4TLlB)1NO^B|R_k5KA?<OdcE0Y?H#`}@fp&C>I5gl6wk`e@qgJUnc{!&Tn
zvRaVM+oHP7Lk~uUsm;(qcQ9IizL;{q5y%fx`ly4nomL+)5!1dMiwi~<VfiS?s4Int
z%Z9;o+h?upuWpbNzDz$im<hr9+o~Q_?N=Rct?sHEr#)aUUHeSquSH>Mlu4y%z1zhG
zs-p|z(L>Gf_GgP0oaJud>BExA=XN&I$=dL0&r$u*(U+X&*CTjxrXyAQ<O|0x@`<_l
zkL5ChY@VhXd1rG!k>^-nhO`PbL-@!MndFNQ^|VB?(e)l*Z!sNAQ>ze)o$luySs?gW
zagHD<+=O#Jme!?FWvMw1?ff^HPIKXCI+I&-lWqzj$V9j!z7Hf!Yjv-j6)QITJMRl7
z1d!ZlGYV(Dte4qLCjQp2U5CtrcYZZU5v*RJm2##wdptaY*e_J@rZd%ZGd+kvFR(K+
zre=u08_1L(!ZN32YyjqDt~RqjS|3}h6xZZ-h+h$(F+{Y^BfMhTi@%_0gcV5%ck9%@
zJUc)ZjM(1z!@^ff1|}F2sX}FV3T;ijr9oG|(cD+A_kDYEU=Io5ym{3C{pg-)ZH}xz
z8YGH-Op*CN^@90qjyyV*!jjm;5+*qOa?obCH`TICvr;>TT^2)jF=O#`1V+h+M(6Z6
zv+A>7>wKYNvr#0sYf3!9hWT8N{v>zTw0d(ZY+DxTxT$GcxdA`<B|;mmXMR6w-Fj6M
zV8`F`rcWW`aY*}3q_`!^ha|W^Z_rSuyN1;D+#B!nz0yTJRYab%&EF>DN#C+?nap+@
zNSU8D&VKdN4hY`0ZHxN8xfJw%UOEi8bh%&e3}8dWbTOI5YKYvpnCAPvot6SOHwWla
zP~u<GFt&#=`o`O!=NioVx*7k?BQv%#O+<N}$mSaOS-+v(N||_?*AxRdtng#1uIp4>
z)D{|f-iLhE_3)(jVm@EqW+Y6#jPu59>G0w;RNi#gqe;tueVslj$SojgDaObsw)%Ho
zz9b-`Kd?^#<HbIn*BLQWkMQO6RK>Pyc|DZbWBQGma8HDc95G7umtH^hKaD}vJq$kg
zzTT$bmTYe93iR<fHOW~kyNUeCCoOorA1`F+H2lXs2UAohS2pND0sQATyMrR^VKDbM
zcGim}h=@2WVq$AI81;e~IM^$l0x$o-<G-tbG_OEz<OW!XDF9$lszCA*lFWZ7pA^5V
z-L&26z^GmxC#uwKCG3b$UH@S>@%LAgm}{}_>odT~dH>{qrc=$b(evA*ZKq1Sk%!IX
zien@ne7Q<{@v-TH0re&v<boTJ7j$ECJeygj!GNb;bMeZbCHSn{VWR|{`&hvcw^og#
z{&^?f(6DqUV<xeY9)JEzv6s_SqeunT)>I9oI3fCoQBHlhZ|avhE_W5L{P&fp^-cjV
zYfqm*xK@b}HaQc0l|}L~2m(MSmj4MmGrKYoCSREbx^bPF414ZBHh-zt-`|L(*6bO~
zTSupb_;*JgcEs?|i$|vsC{#gbv(OOOR)ZDJ<5fZQLopj>)P$jVvEF%?yh4j3l&%Kj
zN5({`Jh0mBv51;@{Ka<z(CGC#6v1_!UCX2QOyW7@XhHe?5FG4Tg>Kv(7wyN7Mvh^#
ziwOj=(nnbWvZz@H1SJkn31j)Q#CtuPAt@+ys6(kjzTYSO6vbunC391%)?_BJ6pIDG
z3F&|BEP_hO0aZIF30QZYK#W}Je5LwyhWGj)T*yo@)sv4f8V$RrKiB6jJdW@E@%*^{
zIv^(u8+@<|xko-gX}QAO?W7qt5>SS+&CIyueKJQPBi$uZuHI}02xwYgWEb@94e(1d
zarhh7*}#wK?9E5UpX_$EJ)I)wT`rwiVm~<+G#?FOQwbnp)PwrCq>0j<NUE<7K&s(H
z4c)z$x;OV9JQu&@%ZYt}m4i|w(2|13DptweoK?Nl`Kr#3CY+D_dfdf0Tkxf(NTs>1
z=hd1GOlo#L*I4n(DgdH*%I{B?Ru^maq{bnm@V;4np#q^6C|)eN&-140rSXJN^CDO6
zRBWiaqyv2l*$tIn2?@6N>I02$v*;o)_)4`o{1WM{&FRzWlV!4aQk-v;{y3@54jnFK
z^1XE!8ohqtsNYksS+%}CFaKh}M#YA_9-E=|Dg`_Rx?qF?Cng&1#Ad{nD>K(X`o7;w
zi_`xcK(4|lqW6D#b1p1TmcEGkhJJIx>|?4|9xM(%=fa%Cu-hyZxm8=wWf2MFdlPvL
zD?_g|#^7`B0o~$}i*G<-Hed#yooziv1P@`i_83`EuAf9Is%`i%TI7L9nzS2ubN>2x
zb-6o`_Bt(2^$3Jmu6e?6gM#YC_i!WoM?XMFNu{g3_0;R!LmwDR$S-s25@NkPr*WA7
zNd{9hLYq9Pe7VHhWadoidbYJ(JHd1>F+;{wlSpM}Yi}>2^j$^;uPdsZZbL=flLV9{
z;8YosbRpNX0rsV%zKIlkn0&SEXLH}#5l=QV1Z%!xRmu8!U8IP-_NN`nB(G6jc8zNQ
zI$E;>9T;g?4V0a_vnrH<FA~ScWYD?E|AJ1CahyIQSMBfCwJ35H4cnmfCD~A4yt{#9
z--mv?*~JkurS(Kec7U*YfHtC97&f2OH!9plR(s$En?*e-RGI2K!kbY)?hMX~*dKG{
zO|vOn;iqxboX8EjXk8>!L;A3r$v<J|Vd$1>0~?G^YQkXq`Q%?sSF$Vf+b%3Ng7VnU
ze^%7bYkC@NBe?Hs6dpbr6_r|Z2JsV8G9KxpvfH>i6C0+Fi0;8qB=o);jG?9o7)_;%
zlF%BX-y8hD{W1i6JL0-T?2^!c^POy1tz+Kdff|AxR$u(-jZn}2eZ(xIXj9{R$W!%e
zwkJ`oFvWige!EO(nS2kAlon(qtGnF9b?x$ZhN*ZO#S3Izjif2>oc0p9KM&-iw_8j2
zbW?uB^OeG$V)u;M@ocu3uXn3s*+kZ?_@<wcFv+xTQ@FwRU~ZXwr$I$$=(y<R#*?e(
z#VTyy=%yn=l!YSX7}fx~kSQ;U@M?N^wt#QO=e9uM?fz&Z?eb(ng|$of4+LItQRluk
zfH*eoMwb<)tfB2TD!5CK=*`7eu4Lg=_xAOUah(w&#Fr(Bit_t*iIM?v5n~Q}hDQnF
zso@YTTOCG-#fWxrnh0ykj44scr+4d|=rbSI#ArcghrfP9!oou4`*2s%i0ETY8z&@7
zO4LXnv9A#VNi6;fK^zEsZ2#}}51<yR94H=wV<XvqV>}OO;_las@i5l>c~R))S_y5K
z4r;wtcZ%I?yQoMH0==;%1f`ev9C6?p@emvUEpoyVe@UDWy>Q!oJew}o>h6HHz>d)E
z@(t%7=daxode#0t{5$i{VZ0DSC1Nv_2DvZII#L`pdoBa@-*5kKAeMM8q2J*Gi!JKU
z7{vqCVs?*fy<H{J$C%IyBGK@1+L(GeRm`%~A-^fbF^KYvweZ8q`1YSFX%pRgZ$4PO
z8h(K6j|Qvr6!xV~%@+ZQVO^9RL%}I+k!~;KeA;r*lW#W5ig<FSqHd8E@O++wM`(Cm
zB|mmMTy=}-5tOF@cq(zRVC)Jcg9GHHbXDFmP09UtG2v?WD<7JQ6de{B(3>EVsg9|{
zrRMif=6V6t3*I1G=v%F<KI=|+8dmAlLFP~A-LBel@6<QpSB_K2KG{_CA(L9Q_)qbz
zIUt5+^&KGlz;ygC0z(*#RR({E%dN+ikaf{~iWr`?R+4MPNQyqZi!Z4xp|9mX3;?&d
zjy<lv)Adk>FS&*<$(oZ;s=DLJ>HX!2XsYDT0hBFIwY(lrD`$8^&=_A78(EG2E}@H8
z=;OvRe3RQW+Sz7l=1%5Wqp$Okdf`$>P?=7*Fq@72xEZ7zMCr7i1et_jf5pO>c994e
zeDiRQV%-uJAUU5UB&DJK<Ex53qi_CjAAIbh`Y9!60_R=!dWmid5JR*sCXW}XZdb}W
zvg!NO%8(A*2s6@T;Q;4*E3Z?paB%!%q2w~P?A>nT(3I8<{FST(^~Du<Hq5mkC3AY{
z^_;KnTOI^u<ImpZg!1w{C1*dLvy9^#hIedXz2&V<@Ne5ilLydTN<N*4M`DF9QA?w;
z7{%a1wb<EEvRlOs`I*_0+Br^4vNk%*R!gep`}|rSO}W%gGU&yVoDD-1<O~v25M>vn
zI>u!%<cWQIX6<<h{&1c7!xcW0l$t5ocx18Qi`-33f0i{H?EO~>4Lc}E2o(nX@uG3|
z?a;e|y{5G%L1z_z<jww$^dtZCyv$S8EhHT5?Yc;QsdiO+K5>+gN1`yqg4fIS@Jyb^
zV@e{s)5RSh-i3-YH5-K+Kb}1NcVJ(GVTnp4<#L&f*?wm}vF>opqinf^t9Q~K;-n_^
zHpV6-e5y0V+l*SP-?{PV=V!p9^-S8Hy{EdmI*ej41|4iznJoy<Eag3s??Zf;oC+06
zDJSAPHl_qV5SA=~I)zQr+Q4S~U3&~IxCz;~(#g=`r;62l=>gqo0P*9Vh8K^d+m_GK
z7M~EBq)236_Mip*F7GGm*z9bv;g9Kp<6PKU3T^6s%Y_pjv&+}!A%>Y8Ud754q7uRG
z$78hP5ANTV)KbzxWgODdl!UT%CN~cBIMTi_M!UcorZ8VLR3#FN;Uj>S+zgR1fJA$1
zC%dxD>>da=b@3nWQ_U_eXH9(UHQi}U9djdahy@7aA`S^2fNTPh-6rdGgb69d{0yV8
zJQS-o|Fq^bC`3Z}L%JfU<)q2f{NcJ$bn%Q*JkF2W8S1k^5sg)$JL}CBhR2Kb#qXGq
ze{h4Wb=kbFs!*XzH2X;e+z?#iT{1&4dcCIzuGhby-`gOc*Wx4*&%uHE8Lsa3=<K}D
zOlE6PC=p<=wTt~zEAHD6^ADv?2k!Lvb&=aF!XGq0sME%@W~b4eD4Q=)@e@iA%>^7q
z4WD~CPh_YKj=aT&2@QKG73KiJE&>lZKJUF$4a8<zI+60Ws1<D6_j!9_H_RdE9ZjO`
zH4XLRK+L;31MlzM+>fNrC;U=4bzR4-=6?hX{ogDsQ(4|)V#^ij<toK{k=s2RXcwMR
z{`e?4jaHjlpMfsKyWReEm$4(~tJOKO<;mzT-PHECxOTqHI#my!$d#5;pR+83Mpx+M
z#YIAPmboa#x;(%8&I4os0Ig9E{w#(Y?o0@5?YQh)r`L=1#&l_ef_jtDw`$Ty6}!1o
zgy}xF>kVo43ao~9hl3B!KtC{A+OhHJ(o1i9l~sFRwImHqv2k6s>Wpt5Z&&vly@{WX
zB*`(-*zp(`e+<DMwmQ~$gVrBEa=%1_ppw{FV>#GTYIP(rX-kzepM2TRb%4sD9IXs+
zq!2_XtT1%emA`fPzki?Hcqlc<TCUU=L;&^D;_2z>4^ynOCo^+HbqEMJa;<jS64jZj
zb8nufH9hjWG0OW#{M-(y$8z-@HchEezut%y3arp^VB@BXT&!pUJOk(sN;w?##H5Si
z{=>3KdNs)%9J7CvVbO2MJPX_ylV>XL?I@8z?F~gqQ#-vH5g<*1mX~iRLbzXUh{yF%
zHWW4xHj3k|!2SB$<EvJ_C10UO)1gjy%vcxnl)k`La?KDrg+|r~fCg{{Mr|<d#>RaY
z_M^btIUmn{rd(mq2{due3tu|>>r6-+zN<`cycsXW_2F{ZVxx$R4Mlr0IK{up+?`(|
zD%9nC0|_Ox1Wj7=Y#wq^832&0+3rfxX{E-&l?cU#UM_M?rz$f^4ft?^s;xz?Ai*_6
zNc}P1#RkRx`^@c9OB&KMwM>2TWG37F&XmJ$-7n$__tgc2I`}C;NIguZ63|ooc)SbL
z;yv_36U~7m^YtZ8drefkw6c?f^SYd%P*CZWoRwp`K$oQ;^7tIEX-~P9{o1G0(V#m|
zfZ;+(6SQCfvTG8W@b@tzR&A_yBegE8Vf9IR68_dN0GzZCrtpe?T8M_VK;^7*r`u&y
zjltp@!gybct<KDXrn-E<oUU1LFtBOj5o~+KS!u~%?;*sus_A$xVA^F@56ksrj-#Tp
z%WTrEBRy4$-#d22Ob}cj9A4Z4pnG##nj7&a38<7o9%f*xUUWnN)&wS|4Ot}{>~uD8
ze|Bg$TUaBli(CP$?i}bz<ft&9VE4y9^5AbRa2vHu5b}#v6M=yu8Qk6sqr5&j@8!K*
z<p6B%ECdxsC^Xd)In<O8laZ+T3!Fa<q=U?L)9G|vM#Z=LO`hIms}2VXJEFl^>uN+h
zS34WlL(Y&{U-`eOMC#?J8lD=EcL>i`C_3bi{a;-ni&t%=qchJ~nW>gS&A1W6dv{Ak
zZm(u?1#@GF1QlWhzuG#a5jSs-iTI>ega%?Qmq10C>r^kwQ{DHmLp$e7#iJx<|B%_^
z7h>7HotbNtsP{f7A+xaRdXCUFFgMl+YFK@3NiWq}utq2{RgpBzO1g(5q5`Wy$>*)-
zz}bd)+Pu`qS|As@5P?Z|(tey{(u(D}vle`@CPy!<!_~%!a84>5VhoV_wNqgQsXwJ5
z@>tg}t5<NZIiZ}%G2i_YmG;AD*=DzDo6;?hxgLjHrEruqo8LLg?Ru&xhyQ^l+451&
zeA+;y`+Y||x$=xtp@MLw7YUzBNVQa<_&C?!pjj{@+80p8mok+VzCM92j0gQ14S{3+
z7L4*W8f#nPr(d{g2-%p{jHjrwdW9VE02l8;Qt+A+*F%p#N{f^apH{V6n-~q9OEWh1
z<u!<KjT)%;7qE=9#fWlf1U$g-JI*_}IQ(aN%k3)JXtT@gOR8JH%B^O12PIGXb|=}~
z2KKEZB&+0#DC$`8$KM-g1q!KhJOZs>P>0qB78qpF_GYK{p;gkt3(@}QVp?NEg&ah?
z;<**&VqF$(@|3C;N0^a4Ed-y9ydz~53?+?ZyzBo9GT#%_unny^4Jp%%?@1b=RnhWc
z_#9psVjbKQXwmv7=yVVkCJtQ!aRzJzAR+rH@<BWd`IX~yCml24n7-qAOwiw!@EXM8
z^Wu26)y^s`4q+Z*;*RK_bo6mUNI}I=#0&jgqEcKqIf{}dpIsj%5uM)WE2~1-%Ap%Y
zh9a<os|1w(T_HNZ0uyWHM9MRL*Q{bjA>t9#RY;ntp=)cdaVB8b7y`Bj=jHb1A1ud_
zH7F~Ka{Y6cmbCd?Gi73ACc)k~j>|R?(Q7~AxXEq<ll@o&P|0_>GTl@;9dACb?nXZa
zxKsEu(W~BN>20G!CV^kcbfM~SLAi)9a#oLL%Scdv@MY7=x^GAD0zFBN@qB&(TS7$W
zG;1(M`+g^t!N4%Al*r@tC-ohm(BS(^dG2P^gTUX<EajU}WqfT5o<_I!6fl!H6s}a0
z=5CI+`}SYd>Q7Gb)YH0Nage83Zy99HLJ9S87zE{Qv<WlCpKNm|9rgzXg6)=VRqJwc
zVBISw>y0g7iTpKT&?W&Gb9<-A;oul04<HQpfoK5REVzx`^mRJjAg4d92wbgFD|icI
z21L-CPu+O4Clrci1-E3#LbEk;8b>cnsz=NG3a{{S3{2o<{pRV9XX3*fO?5oy*?&q*
z-!aR|ITn(lE?A|dcR(s@rSEw%{A6I5?*DCFPYy$K3heX27|mK{Mi%^V40ne{MFWdY
zP^$64^!g}owY<J+W;;ChU&wInw45sY+UWJC(7F5@@CiMsbRXkeW>Xr;X7T1#p_H>9
z>zTi`#rO8hfRg`v)4e+h?ji}1$W~)o$5YO#*ee7B!(J1o>ps{2iyMfDsmjHCh?C_|
zNprY|5b}*Q#mg?YjyWPA=nqacxnF~3DOO+tlhxEG0%PZY3OGqB8YZp*Uigv8K2OCd
zP2T|RoT9Xooq7TqXGmUy)-Ii@sQcJSP5ZfBbV`NWJl~&`X>B^5m!nf99a`VanpN1D
zD%h1ibME|cd-p+0TvrSG)W?JKE0oh64H!>EIxmalUgwwPOTCKDXRF5?`c?e~y@XV0
zKOD7M>q*t8vE?9BkGz7-zQB-}S6g_CSC<V6{v3e95+Z;=3F(yOkdXw)XL86_#RFsE
z4qZx)g7LH#<M~L40Y<q&H%~O-J@#^qJM{DycVvq#?}SG7F6(jA&QxJnw&d7|GHaIA
zNGv;v5xk649SiGy%@2c4=Ex0qd}8cIXMW*plf3!|%u!)z=1k3^y1esbOfgKbu}IhB
zo7;!CT*~?WLaev*zf|)9L7@X1qXSiqe*`T$T++eKu}~?4C=;LSeUC1<bsBH6xB1_=
z4_T&L;(BhNq0G%sMVUe1o}l7@q>32PiSG|agQ-a-Z>xbnupMbd5F5ycJff^jE?^z2
zE4EGoXFJCy#`fSD-0oD_Jif}OhElDXf*GRg{C=|SHf}#UXFH1NURw3jYE-y$CAO0F
zFS_nyO-5(urvufjSNLvl$kHe(n`Glied}u`{9-H=;s)vn$)!pEMF=G?@jQG?5-r@!
z&n{~I{aNZz@*yQ9jAq?XFY{>H`s36zHWlU_Zn->LRxygd;-l!A%E9HLb2FXWh8eNP
z89|sImzqR0P!%12nR+30AB2^KVT)<NYN#Fvv*|Qh#!E#fU{J(2A<p(C{eauK0EgZY
z-}Z_wPR4VxI-T(SBNmrWdj5#tad$<p{0XEqe!68sJ=!mngTq_B&Q?;}%rf#V(ztEh
zf-mg7O<R<E$M#cTZ4(6hTmb3_xQ&6GELB;_%T^)45EEJ9!*3pGJTgW^c5+3MES&)3
zA(E#*8t`x=Sfbh<!EmbR{~4lDZNRu>o-+aM{W0>8{6x#8dl!vU8KM4s+v_$LayV&q
zeKgEwJC<&IoaKM9(VaSz)vDAhjLc2+WRn|ALMVwXTL;qZN)v_2goB}QBW?oADv_38
z!Urk3dk4-7Z0`cnvf4!w^FG=9*dT#jD{(e}owux<9EIkqT%k-(yCNb#9l1-LltRjA
zYb$#k=f}m|KNLtCDjr;!P8t_^ML7%v@|8Lixi`nrZdU>d$(79o8_nSEO{Rl9@tW}3
zMvMkcx1?R;IVFG8L+0i;x5CVCHsi=qZ$gAl7slD%0#eIuV)Gv^SN%(u&nXwdpPx!M
znytM$r>>pj+J1K5GOTm?;GQhtsg_NZZT{)dEw8epn>`<;HZdD{{zRs;ngahGOdQs=
zmOS)g*F~c4w>jfLIyX)2YiP71wIHZrZ){5VXC4LD?1N$gyHfUxuTx&gJ|6|m$I>XB
z$9{{;hz(;vH@N$(EWR%*KFC$GbVi^1d6w0Mi#Pd*eobBqp&S$T^xUvfi^|?YiNgTi
zIeEZm8-8K;iJ!B~rgf($!izuF6?6(Keiu7K#KLNJsKX1}q3w%XEyBPig^B`9ta&kG
zj_&&%^YP@Kxcm85FN4`x;IO_pR|Dd7QfH`3sq}F}zJuAh*>IUrzbB+Ztx~dj14<yp
zqi56v+GIRudu@4a#uZ`Ly-q-Jm+9Yce`25Re+Bamht;qxh9Ku{j%qASkk=8mt{k>6
z-AfIF{7sg_kUEWDDE)tFwf<}V=)j<nd%d*DY=6aOw_E2uf!zA9o1kA9o9*^uV#<82
z=Zbz-7UDw`Bmqh#{^QW$yEtnA_hGcxZ&+bz^RseD-b(WnUYs~w|83v2L6}@ccvY+T
z`M!4ZbO+k6u%PLf6ac^$ltN0+vfV4g@F5<;um+D2F_;TFfHdsS;r9+{<BTsCsK7}B
zvnrH`UKjh+Zs~QH?J;?W2`2@o&>}SWI#>JEPE*cC;~zIN6YVRh-h4b`Sd4CgUas$L
zM;f46V_Oh~fWW8T8oMr{#i;hGAEh1D<=SN{LxIcr#mKhX-aEe)?0pgx+gs>-{L`az
z=)67io}*6HGsgsh#$?>-4Q*TR-!?$B++${oqG}jhXxU~(6A7{_7^+Z%K@MX5kQBji
zL`fC^JT^(WI%Bb!Nda<v`oPj^San^AFt&r8NfAIY;557WKDS=Nu;zY2wc+i^>3W?d
zd^q{~yv#|JT5<-zQxphJCA&piZ_|i2x<#PXdCMgUi;eEI-P-9rdRA|<YMV4YlJ~;2
z3r<0?-1qknRP@Fop8qHEC(Ae9+q+<?N^VZHmC$$20P|+9&g9l|a@7U6cAyqMt+T~3
zTBtrUzMx-V%Qep=6xDTS4$IKBk_<rb>xHZ~+_31szL|`g@4-E8x=EQj{QwfOq7^BX
zDJG`hwr1#e5nTYbvz%W|&c|*UohKI7E5te3QH$E&)nCHoZdNic@PCj9_i8OgI>tP3
z#~$x+zft>aQ6ulnCens<NUGS+SJc0|K6_0b42`&dCA@75WbnA*$^Di0%lx|YH=O@8
z@lxHFK>R%OnGpT_puN`fc$=bNvmMfsk@<P-v&QtE&ysK0Y@Hky|9VzgYb_u>B6f2m
zk1p#bCUxX^7!my}T5JFKpk!i>AzF&mi-?L8Qi{x=+fw3v<sVan!IqEX>^2{+8c-?1
z7jW*?6-SW6+G<#^>4oIPPvIYm0*c)sbT=_8ndFwRpc$t{W?9L^aD1y)u7c?GH0(dm
zkNkz~d%tb{c6Xlcg86hpQX>C<dN}WJIKKD&D~na4?1~<&-mMx0tE?`<s?mF*hJ+Ax
z^|m@uqOBH0+aN>`y+!mO5>XOS61@}r#{2WTzJKm@uAP}{&dfRU+|PZ#UcEzE+c9oV
z1RBNe(6(->b^*2gkNZGDtCz=+78K5)5f$~fxF=o38i#37F#8LKTmOw}msKfZKSY4T
z3@%PSrhD#OGt_*wWKX-T$X|+w$EJP50$5xaF?eLx<P<2YT_KY(nR{Fk5%Z-I*6fK;
zZ<@R(7mkZp9}I<Qbh=9$1jv!tCN_&s&Q{P!)V2Sf<G`TsFKD_OSwxuVFoNYX9v9$6
zT$PmEWZ`12?kCe~yB-eTXoeHSydI_uTXL_`ykuldw219@?dw3cps+6v{lPEIFVOii
z-r+Yn;Tvm!%5fsU1)aKsGj<4#C2wR(<@%`@jL?Tp(r432h;lQ%Pw9={M6=CgU-TQ6
z<p#UVv~f1N?U2?by}Ik!Yp`bh{7u2%DXr<oV4K34_x(v@7fEIn&$3t*KMfPibK;pY
zgQ%%y3EGbp#<dcw!|B7^8TC)0=3=$=@NT0jBaTFd1A@!#qX#Zi$Da<H{W`D?18>`;
zgNCQ)4=;rR<z5wr)~G`Zo@_PA>s{^tA*)&}l9F;QPNc~gV~(yszl1PMt_J~@NuTS>
z6T{2%F6Ku6KjJF!GjAUK9wAM2vVQE~4u`9qhv(2h|CIm>8_QCt7vJA;EHq}~MfBTI
zlh<r(JLL^)wS2D=JZL;vo&S)*eqcq-KW;EwR31W6g>`TD+mK4hV3|_Uz86^o=?%2y
zjqlZKq7M;up~}G|3hHd39Vz8{g}kUPzP0@3th(>!(a8x7WHv%;2elXUvL)S<nHMh8
z!J;h%;@_XtFfgzbFNR(RZ%)0YYI?%fP8zzPuvGELR^@ypMceGO(-xY_KR2w^OX@RB
zUWO)Yn8NxksCwL6)JQP3Sq6hhbWW2p*D@##JLXC)Re-+YC~n7-1ji_9E%*S4S$X@J
zv!;U?2=ISp8cNa+n?aavMe+!yb?4oitQqh+|Fn3mGt(lO<bN&2Oy};cD12OaCp3d;
zy5S8>m`<Yo@^>x7DUa&k?oa#8scX+Fqle#O*+);nvmJEKhbGqFt`DakRND1KOWJm#
zy`-$}p1ha3oVcIf=n*@J9C5R1{#90d{%}RWdt_q<FoD?mPE9r1T{4}AY-?|RFNa7~
z*z_3Kj|>`4*WJ|!c(fDzL^dAoW|ft5f59_QQ}aa<*{>vIr!BY>D_fh#oPbaip{vX5
z>7+dpey>XuaEe1rE=O2A-Zy<RRn_!iy+e$!(%jWpW9P)(TzhRaiv!f5Zd|awgzfP;
zo%0Pi%+XM|EjHo}XLbCpPBCly7~n~C2iuKgi**?}%(;~mgo(`-f%Lx9fno2LJQM;9
zpv(89lMj-*{DkruO&ATVR<%7<IzN1WymOVFVN^>4^n_dTQW4w){VI6a^c6x~Mw>ci
zi}RjWi<4<;hU;5?*Q(8aZyrPtA3IAtx!@f%{jSdtsrw43L>DhGW^NL6Pq{6~k6_vK
zJVarW;Y8$fyn+eVA!n=Th%l0XC+(K>M*>J;w0iCdO$lo`@#pA?Hk$^QpXEL7<9vTL
z+f53yOqE{wS(a@#ADx)%6#(!F0s`k}EE)-+k@-a*%OtXf-!`peMe^;tK0eA#%rGjK
z^ZhGscHes;i}d-W8(@cQ%quxWoP2*4$$kr`ofcLZw4co{^<*MVeepzSq4YOkA7CAp
zv%5QRV$@e&^KiOo$fH$fYP+(1zXi$80ye4`oQWQ5zfXk;%r77Pu2~p*reM|a^y|+W
ztm`X@Q0S}AB=kGuKW}k(aWD`+U+ToqmGiCIJ?hR47D{)t4N(j}O*-1TmwiqVlsEIF
zVY2Si@zcqVX|iD=Rvmg8dbt&<1GOPPl<Bg-y~G5<=0`<{YY%}hx;y=0;U7!5bxaRX
z=2{6qUL0hhz8x#@39DLFNCSbk3aYsriAU@Ir%w>KNZNyfrr#Ne!<#hUXY_oM_}#%<
z>6(u(>2_r>j}uHFS*(dn`y>89KV~$--n`8(O!q1SoT0=H!%E&_K4Q7NqAHw9YM<Sx
z3DsQz)o9#V)jJ>wy!`!BGHALX$zh@%8VibcvlRif3QgC^QZ30&Y9F@Z#aPbAvGS(2
zzgq;hzWLcgXVe>AS4A5`e~3u$SA0H8+n3!!3P`!ld_U;h#{f{4C8Z+s#MZ;fH-!`D
zQizGyHo>|zxn+|JJ?Br0hq-;$#xMBGES)sIKAzi~jqE^Mp_eMYm+8=N&KtY>X80*-
z+mb<)-9O=6_U`y4l_o8)mG-?6nGzM$(;u@|4Sx$KE0*0y#i8wJ*1vmzV@h+?_l6SJ
zH|P;Xf9~oS6yj4cy+af*QM!;V-jV{D=8Yk)^8jv%2*j}JJ&(z-CaG=@uT`D30;TC8
zi-t=FyS4PKH!qE~K=#C8rKPk1OR0;7m(ls_funfAr@J)aHqMf%2*}*YVyI|Pe}6I?
z+kNWCll1&yE(AYov>`I;EY8|M*8vi?M7=&)!^SKq&(Co`Pcc}oW!UF=lR2sT#$;Ke
z+pI_c;Qeu82i+ah7l+?18dSaE&nQx(aos>CbOY#laB2!qsJLQe-g$ge_c95&I$M){
z!SyIP!{`QPqpbw7BB3T1zg|)foZemE%>8>G!#)m#f;Yc5ysqz5U7FXVu~+`Rv-l+W
zHIpM}P|gIKLO=%3g96<#`2)-$=TUf*qpfvkT(qGya^(?^^D^HLwTJ6Qe&h)Q+u@B>
zwMqc#<#A-^Qdfj*!gH%-!qrhz-I8zqwb29NgSK7xw|dR?EhjG-JFGFhisU?)zw~CF
zUK(bq*Xt$8>!}*7%ks}Rw&MpK)WDj=bf4=8Bx<{he?^#3bP8@y_4tQ(4ioZUkHZ3s
zOddr2@{Uu=rf%8kxIiwMYw|4f2tKN<_m+DiK_XuNbXz#}i;4%QKMUQi%v<X=zS>>q
zPVe*`Lf?&izaLsavJVv)lP%24m_5qYo|%&rl`l&O6V$Ph{&n^Ht2GNTs{~gQDm$^k
z)B?W5-W~;03u3=apY5e1*!>U$pmF%`-FpSt$F}^|$YpkGq^NImJ-9-y=>1Yw-L>+J
zL7=iT2dGN>yNmbMhR`J`1K$VOLUn<>UpZi4I$QW?)&@|76cD~}s!@N7^&JdVa5vg+
zLs0MGzg6Y6WLo0>xUTr=J^S6K!NAjys*dM+7gYawWTX@@-c@;0?q7d3Yo$q-`C|H&
zTwpS*k`GqP0CgR3oM7meihUjLy*h;c`KRt<c}4~;lbC&Ep)h7)1%RlWegz))w;(5b
z(&0izw${in`%h!Mf+Q-pkG!ljzuPMBD{DGgYiioaTST^kCYk5i%iyBuAwxSgJQq#i
z+fDP??-3v|^8B>JFBRkzPDg+lVdJN$@|!{1UTQOH6jQ>>IWmiyiIvX7=|B#1y}H$C
z-cr+?v-fghs>QwYvn9!j0)*`Tgwv~eK90}vk#`aK+<cl3viFRuHF>DWKh|zZ<FZ0k
zCNL9cEX0uPw7aNMQLM;vc*+WURNvlT>P@H?ButZ2zsc3u4toN062NgnJvto<F`%el
zsq!I0BYZ1vUr@6|FLPI9w<|wLi9(Iye^h9m6Z|B|@&Btt_F-7jBc+knygfh)rZ6m6
z*S0cIugWjj+((mQQ?lZ}{yXd7Jkyrz-~I(>RG2}|i<FitJG#wGb`P<J@FT24{}3uK
zid!Z$QK{{R2C$ug;_$L(fM!5c-w)Zxbl(k957pF0`g`9u{LAd_tgsxQ3AxDCB@+hM
zxqZ!RWY^!aQS^;R#X};tzc$1G1|x2+VcQ0Y4}U@$+0r4|tcCDWQ4@>o7fWDH3QtaA
zRtB>?Hk6r?l|w(h&FdtKPpViHMFr|bDw16i-01!_ZM*f4$TH{ghs8@hN7?5c(Cph3
zcj4a8Fx)e`r`%Dprt1S8InR9iK1^hP|K*+Ey{1<n&u`DtJu4_B^(8si$$$u02WYtt
z8>2?^NEl~|^M)ug*^}Ng(vGf%FJxf^%s{J0%g7>)WLBTl*^(l?e_KyNEp%^@O!;B5
zAOcFP4)(&?yzVg0ncQsbJ0Gl@P-@utkBgm;g`2Z=J1CVaqXe|0t!paTBUx52(!Y)T
zS^!N;Kc=(Zt_K8FojxXo^6a8&f_=GapE`9H7OaccJ$g{6n12o=*zU;rj2li-%+kFl
zc5cz&VHN@?TrH|g%Q0-_LGRk~flczmRIBW}kjnrXJBLm*eydBm0W$|<wn1T@do!?E
z=u_ZFJ_<kdJ>9jV0X5HQ<^MxZsGn>vL=lzg<vOlAYZ8|q@8P5jyKoL1d6@s8T4-bu
z7XiGw(~OLW`)wGAC4c!Bg_g#XMgX{im5Y>ZFI}PmAYa_k94r(gLo}4?K-`Uknt?#g
zdCQAO-4Ua&|Ki<lM2Q4Md3)i*UD{@F#VFt@9p{(EYWM_e&9k#JW|Og`rXeGp>=$n#
zOQ~Pp7pP5FyIJH{z%qI94^aY<AMIknT25HKNCOgv7}m$j-BHucPi$!7tf{u*qNRIz
z2pyu0x~peCO(C@*+<YQ=^kuZaiG*GVlv7jxj2%c9c=}WT$f1<cMjuN)9!InKFX7wO
zhLCPVa%jR27T-U6J?KW)<&0dYaYze;)hKtRaw<9ythn4Ott<9|dx^Y6nN24We#I(L
zqn*Q{qAK^W(SXI)AXY+VEe1uV`f9$(O{4x;H6Btb5J!7?{-+KUDpfsk6THrLu=S!8
z5hClZ<dIBEoXY;m1UjI#_z{zi12>m0JWG+%0#U@^%|xjUih+_u_;!DUup&5w^(#?H
z-jUQU;0D^NCU}{${x_*7Kk^jMenFE6_2p&ID4q-p_-K>cEZLDA-wrgvcYuXZRo@a+
zdONk!E(Je1=t(ajRVqJ{UC)waC!jz!zUZB+G4GGftTiqIbtHN*IQ1)DE-5S2#700)
zges0a{iD4@T%D<kd=w-aXlBI04FMGbl0KpT<5SbN)2Ej}56T1F&z;t&#$L|Az{OXp
zME(AS|M*u19UD}3jsq2smwV?a;1<6AQb7Yv{fhNeGp7T9agRWeheYMQ9i|Kq=p1!+
zd7@zbJQ`?Gv82gKWRc}st16DF;RwjRVruIfWq`F)|9iJbJ`rXJfI+z^6)NJday+xB
zlwVf@PMXaT%<rO-^1HK8sE=~m-jQVH<;#Xg#;XePG@6cg!_W<fQH#4SD9$LXJBYua
z?Bnag5c6Nu7H=K3%*+YC*r}@%mvwKZ4M6q}Zf0f9&Q=pj;6e~VRjGVRl`u>ff<;~l
zZow0Cb0eF$>mDj;X96=WKT~wizONOY%o~Ks#BufCenZo)ZKu2_drFq}LUsw@w(w>*
z0%#`Scl$Kj_rNCPEQ)xSDNi#BL<rx|*d!9We$b}}w=uEfDt(<0=6BVWI2G`%!d?w(
zT;Tz6(P>lW6{3+PwF+u-MaFu5X7lB4Z)yI638I{&pGI=*(9kXAoDDQQN#u&dODpSl
zuy??DYg1cuO24Wwmx6L<{FbtpPCikG&(-hR1eIxbqa*4&C`d&xMsYidkM96Ge(fNJ
z;@O4%IKX=8Yqe0I<e|7NI5TS3MbS^y^aglkWABh+aNQjH_uU}*9eK#I;Y_R{eLDYu
zvrXhNaoH|m1ffJ|FL@GMGHE80bNujtX0^8_|HwB%h~Zi!3@TC3V4BW9);oGhI^?q=
zpIkDjsM{+>W1k*av1O;2jO(57;|#a*u%7DAodae=`}K>bOl091vxT!A?7{r}@gQhA
zD%;shgPTy$c9B-+`8$ggTeSpIHgZS#XTPd@6ocTm5%B7))gC|0&~LY{`K|oKXZoof
zd4}bj^o*r&EnEcNbQn<#<qIRWh?CA5QWg4sngiZCE~8@>6*8kihanKZNI2U8Aht{b
zpL|itbH*kKu399NEYu;7YRvv{D$1y#c+Q2=+=rXs_m`67h>7=D*p^gID_Zu>(A;ik
z`#e|`A&Z$_jt;*gqEgZ9`4S_TZj{~gG4dIe*EK_&<2Kx8KRb({X}rZLw03w$R6?>P
zQM(k}hR6RTyDZ^h&}co+_%GEi91N`cZpw;@Phr8h=`u$~`k9Z}w8{<mO+k@hZ_>cy
zzx?8IYc6w?pslR|Hyde6OZSzId&%7YlB#_Dwy8=tG^<0^7nrj3gS!?%JO}^6;@_Az
z&5ffw@`e3!sB$7{P5zdCq5`A{vRk5-EcURc`FrjrqNQ+YoawZ?NLKU-FEJ=ojh$Pi
z9V*mkT&d#Srl%hcPna&cf*jG^!Fy+0VD^SKX+?-<K&&4IV>n=^V{@R7d3X!?RErX1
z+epY0f$6pv&R8U=Sjs0w{Q2@Tj#9_EW2xuD#c6}QN)nSV^CR9DR?Frf-Qj$Ht*WOL
z0brU^DWIFZYXt%Ey4m%_IQ7iW^MxhVL`;Yf`96m>Ja$vEWts3jO_;{TfK2qqH&G7C
z{bC(h^JaQtZ~39yi~Z9^vjx$Ie!DgwQl!T|4#R?#<zupw==C*qS-FU92r2zcxTxCD
zQS$4%;<juJ0t#AovHYg7jMQA*!e9&;-utU4uW@A(qC$y1iuY_${|ulM#I2Ro`moTv
zOsa$w8C5STPE*n$T2}`<+b#=vX1_?&?lrR&u@X%ilaO~NSGRuNLcXUl@Zb=N!rB1H
z0Jg4?E4QQ>gOp6C*~XSX_3;iV9p}6A^qWwq;;eV%B7*~r1Ir&qL&x>F-wb(gdB;>U
zwy-RdH$3}$&wc*ysBBM0h9BOK|NMiE{6c0>SoVs7C*#Q%MMDL=1rlPClPwj;DmokC
z;4XIZu{_)>nxCt?o$UElq-Q`1BEID*-Y7cqp`vcX-oN^2{$eRF1gK=96){5V;ct4%
z8IO>k^+HCBw=%o_>5<ug#9E|j6Hgo7q`wzLV1!#P;pwj515I^rAmI}bhqyBFwh!Z~
z0rn}b7Yy#QL4vHIzi8z8M*i?Vm#6eWJ<CC?)GM}K9)FAQK-tb7oW@`&3hwYk5D5j_
ztyrN+HNhRmDIK6G0sEPh?ryDCBba?BKtx+TnMaO08rN(FzT<FaG&`~1AirQAVI@EP
zY1FDhw6c-1*wT+HL$Q3MCS|zPKNc(<DL5^o$~3F}K0Heo92G!Z#oGP%r5L;+rvV$g
z*GF<c2+p_@p3YlI`$}5XiAcRC;bm@>NEa=Z)nvMcW0bozOutf)^Arx}dfQ;P>FMz0
zJ!4M~uU&XFE`kckCcG*nGBUV=NOF{Iet~9XR@r<!b;Ldv+EBjDHt=(81Lv@DN3RRD
zw^mqrqHt0{Et*h*17M4r^G$+)$Dt+;Z`lG(8moV~A4B_-%+=pX{E2f@v2(A(ZGfO9
zrEFhGgfe5uGV$RL6XUUeo06vDvz>Rp#rh1Ce;>Q@@S16CMEfkEM{%aJXbqg<kA_)N
zI%AU%SM~8No{B-%1np&rS<(%iGXZbrM7|p;msfvB{d9Od)mS1)TsewCLWDvt>S1w+
zrd@0Ws|SjKC;YYE!?>n<Dxdu}Gvqc{6wUYkekjFo;`*2vc+@ebN+9(p1giw_u^?tF
z`7)fT1YfG+RPA(KbyHyAGGsfB_pyjX@m7el&#05}6@Vy2_OGrq#Q>!{Gb}g^yBp~F
zQ!G-8iHE~~vFT+u%Ot%~vDyyNHoYGINGOhrVyxS>NkUjYrbVWLAb62BEhWyT!>rhT
zlHSi!GeSO=f;(zYNj3IPSEGNC9U_V3)%b$y{z?`eg@z!%ZpFP*RkBab_J#Isj`}!T
zxH@T=jqN|B7kFER=3U==S*SvB$Zn>l^{=L@JA)F*2tBr|i$%%Q2>NNKN`qT9X1H+k
zckV;@5EV!NUnMkVHoXBdz!$t@^}li0orf=;Iy-wnO5t#N(;jKIF9C|TEbu~x^0<B1
zha-tG15H&_&1>_<`4ilPZ>$$a8_k$_`mIiLDmL$9Zlm_$%YDY+DzS<MZRMaodfm0i
zV-*4wi!P%TxZCnq7KYp?5(k#^PC_a-J{dY8yMC%8M<Hter{=kFcYHmnC&g*{GfUIJ
zwy2Wo7yJ&Dlsb7-6nC`QyjgLkSGy!tE2>A)lEV6I9Nikdu><jMgV2h~leZ&reGWtY
zq97sVpOW?>-YhvJp{ZO};_#pb>Rk0SKM+KwwfP<=q+&(ob6dSU!BOps=~{bP!I+9G
zGUF-9mzqCwnfJOl@d~JTc!neh%tCbr2fZq~GSarb=5xlBiE;J(Ti6NV?N;GmvutLT
zb}usj$I96qg8+SI7L|S5m79q&{H%O`2Di;U)BPSl)kkOd^>x=vV{9JzG2Q-o3ak^O
z{ZQ5G2eEzq$^&L>lH79)z1Kq>v@k}BTMLB2;FSihiIx66vE^N(ydyUr!xl1FWyY&2
z@_Muz3@$$y{P$P1O9r6w2;yviF&JLW(0p12ShK6I*=!eaMv;h-?T4JEBT9xwLxfBr
zmLXp~N#^KY<RICn!4%BW7%7!6>(AclHQLYoRspgywnO}3xsom`Py0{C{>*x0q}dmj
za#b7ew=8?F>gvC_7=E~JsW@{C9KVUTRyh>ck<9%Lz=x>JMP!9b#F;&D_s4gD09Ei~
zrJI82cT3Niq{n<CWFhQO%4?|;GVq0^69NI<V^}b$GMfC1ZxR*7;y$;43d=n&X~#I)
zPFzK*P9v({uw(Sw1AebW2LemqcpUh+p4CUf;g&(VV-qAS1x7uvc@7k4<6RLX8;_1@
z4A;RV+#Q1fod7(KKjz>3OOYB3XK5*aNTt0K#gq??5%$#OECHyHg!-)smv;ybM%hWD
zr5S?e<kC|*{L9HEbh#8VP}vglPZvAETyKpRd*h%kXZx#-1;M?xxE8N2g5?t;=#dhs
z7+q1kA9vq?;i=c2qrSFXEm(`#1d_^!u(w0JQK1xtLWTyl&IvFmvV{FCe&P?N1)n{;
zdOHf`GEGJb?qB$A6$AB`TT-OzdAK*Jga%SQ6=kNCO;{+MQLV!|Axq#TgREfG=bL#=
zyYY#}I^l~5ifrmk2li_1E`gMc--scVyb>Fb25)*YmjVe->5yNr1Z-CaC^EwozK%?a
zY8h68OC3hqZo|Ua%Ep)7NF|d3ewC);e_g!{2KX8J5*>Kagyotn@iLl#WFv>fb3L7R
z?~`=y1;6WciV196WJ=RNIB+y{Pw0nmjJ)z!c!+$j|3b7{En)Cn;1e@0TjWkoBDnO&
zH;N7N2qkZyNZER@EYSnUL`@LlxkDrVZOynoajFv#l$J0{^~Yl-|LLB~Uv_cmk6^w^
z=lN#O``1A1J9;_a$<Q+9ch@4B)o>5*t*&ITGeKoAswkRzFl!(MZ1lPObk{9GJq=<e
zsc3vR7*JSNjrdC8*f$!w+F6Juv5C%>r31(S4N)VdeAS_aCpSSCdr?~n1HxBE98nET
z_HN~_DT8Mpw<dA!ef#1%tojJq|478m`MUA(sGJC6b__jxE~UdbtS~3Voi-E1C2lgS
zEHFj>@I`+`WEW}@v0mo!_6`J}z33Ock<qW@h2UoW<OsWUo>%xfHkzD7JPH1><Bgp#
zIVP<t>XN<6)<aT(I1rT>@li?)>1W>&_k`jihh1eda>=Y$<e>f6NF^v}MW&+e>(DcV
zuyA6nSOi}b_|WxBU~XgYGj7}RdUM*UKoR1hTPQPm=0YU17OBvDK}F0}tl1Ozknfq{
z2kBL;`De<641WD&C4tD^L5sG$<q_f1hmRc9CKB*;=kxoi-}1Iy5Api!t{Vg&GEUMn
zKe4qJ=%-2MuGq!qCad%7YTKUsZ`psi-Dc+TQ_Z>IO$ZzL*zP`H1sxY703W}~$<eyy
zlTuEi&b;Rib9u(7<2^EdDE{!vzQJ%^#LLIStpuuQ5@vBcois25O<2;Mdy|XTSAI+q
z&KUg#wYOCI^!+$*t?CpbSpRVP%gYP^E3Oi*^rgyp0~^B%u+U)d!>Xq_2~Y$F;6!=Y
zf5h@>7}`j{^YY!am52^eEIOl;0HkNl^WP9H$g}>{@J{G;VDnjbhWOJr3Iu6kF3FYE
z27o>jD+UrUuHs6w1{5mZ{s7jm+uFo*fFQZR4M-Jy%1StOOxiX>F*87`UJ)Bo*9Oq&
z0U9?F(R515)TgHUT?PK3&hD6|w)7i)qz}w49t>C~hd3noBe(c<i0Py&RugYjkH;Fe
zH&JrH)`kqc>pO)Q>hw9_b5sR}@Jl$yYpw3h;~Ypdkb>|4Q;La@q0avZzrZ!%bF!y&
z+vSOii-VvNDuIbV)Q37TS0Ja0n=vaGIt4H@N`{rzqJQ^SB=vgdA82Z^k+dr)er0WE
z$I~!rihK*y`6vs^ml6yF^(rGj|5@xaUR*DAI&`{Cr@0cF6c~<)n$@FCYAq{)D2pBk
zwFFqnxAdj{F)$&`^DA%j@M#8@BXOiG_=?ufhP!_@0EhQ%1vhkz&fS&#)6@SW$d8i~
zJiVnW`atEKDsWeC81nyA0JZnyQd-o_mEOS2MFDBMJc}3h?SC`IMGyi5R3aO%q3-{G
zlp=ZnBtP{jJ!R{E40;IuN9q)c5z!V$;FSk%FFOrz7C_Qn)f#+$!>GUcn+<T#-}IJm
u>;D^10k7X^e24p7j`#jIj_ec#T_N<d-c@@*_v~*-`&w#x$XXTK@c#$ir)pvV

literal 0
HcmV?d00001

diff --git a/notebooks/mnist.ipynb b/notebooks/mnist.ipynb
index 6391053..6d17580 100644
--- a/notebooks/mnist.ipynb
+++ b/notebooks/mnist.ipynb
@@ -19,7 +19,9 @@
     "# Set font to Computer \n",
     "# mpl.rcParams.update(mpl.rcParamsDefault)\n",
     "plt.rc(\"font\", **{\"family\": \"serif\", \"serif\": [\"Computer Modern\"]})\n",
-    "plt.rc(\"text\", usetex=True)"
+    "plt.rc(\"text\", usetex=True)\n",
+    "\n",
+    "device = torch.device(\"cuda\" if torch.cuda.is_available() else \"cpu\")\n"
    ]
   },
   {
@@ -164,7 +166,7 @@
     "\n",
     "for i in range(num_repeats):\n",
     "    for _ in range(num_repeats):\n",
-    "        z = pqm_chi2(x_samples[i*n:(i+1)*n], y_samples[i*n:(i+1)*n])\n",
+    "        z = pqm_chi2(x_samples[i*n:(i+1)*n], y_samples[i*n:(i+1)*n], device=device)\n",
     "        zs.append(z)\n",
     "\n",
     "print(np.median(zs), np.std(zs))"
@@ -209,7 +211,7 @@
     "\n",
     "for i in range(num_repeats):\n",
     "    for _ in range(num_repeats):\n",
-    "        pval = pqm_pvalue(x_samples[i*n:(i+1)*n], y_samples[i*n:(i+1)*n])\n",
+    "        pval = pqm_pvalue(x_samples[i*n:(i+1)*n], y_samples[i*n:(i+1)*n], device=device)\n",
     "        p_val.append(pval)"
    ]
   },
diff --git a/notebooks/test.ipynb b/notebooks/test.ipynb
index 8319a1f..06c308c 100644
--- a/notebooks/test.ipynb
+++ b/notebooks/test.ipynb
@@ -10,13 +10,15 @@
     "import matplotlib.pyplot as plt\n",
     "from pqm import pqm_chi2, pqm_pvalue\n",
     "from scipy.stats import norm, chi2, uniform\n",
-    "\n",
+    "import torch\n",
     "import matplotlib as mpl\n",
     "mpl.rcParams['figure.dpi'] = 200\n",
     "# Set font to Computer \n",
     "# mpl.rcParams.update(mpl.rcParamsDefault)\n",
     "plt.rc(\"font\", **{\"family\": \"serif\", \"serif\": [\"Computer Modern\"]})\n",
-    "plt.rc(\"text\", usetex=True)"
+    "plt.rc(\"text\", usetex=True)\n",
+    "\n",
+    "device = torch.device(\"cuda\" if torch.cuda.is_available() else \"cpu\")"
    ]
   },
   {
@@ -98,7 +100,7 @@
     "for num in range(num_repeats):\n",
     "    x_samples = gmm.generate_samples(num_samples=5000)\n",
     "    y_samples = gmm.generate_samples(num_samples=5000)\n",
-    "    z = pqm_chi2(x_samples, y_samples)\n",
+    "    z = pqm_chi2(x_samples, y_samples, device = device)\n",
     "    zs.append(z)\n",
     "\n",
     "print(np.median(zs), np.std(zs))"
@@ -152,7 +154,7 @@
     "    x_samples = gmm.generate_samples(num_samples=5000)\n",
     "    y_samples = gmm_short.generate_samples(num_samples=5000)\n",
     "    for i in range(20):\n",
-    "        z = pqm_chi2(x_samples, y_samples)\n",
+    "        z = pqm_chi2(x_samples, y_samples, device = device)\n",
     "        zs.append(z)\n",
     "    zs_mean.append(np.median(zs))\n",
     "    zs_std.append(np.std(zs))"
@@ -230,7 +232,7 @@
     "for _ in range(num_repeats):\n",
     "    x_samples = gmm.generate_samples(num_samples=5000)\n",
     "    y_samples = gmm.generate_samples(num_samples=5000)\n",
-    "    pval = pqm_pvalue(x_samples, y_samples)\n",
+    "    pval = pqm_pvalue(x_samples, y_samples, device = device)\n",
     "    p_val.append(pval)"
    ]
   },
@@ -284,7 +286,7 @@
     "    x_samples = gmm.generate_samples(num_samples=5000)\n",
     "    y_samples = gmm_short.generate_samples(num_samples=5000)\n",
     "    for i in range(20):\n",
-    "        pval = pqm_pvalue(x_samples, y_samples)\n",
+    "        pval = pqm_pvalue(x_samples, y_samples, device = device)\n",
     "        p_val.append(pval)\n",
     "    p_value_mean.append(np.median(p_val))\n",
     "    p_value_std.append(np.std(p_val))"
diff --git a/notebooks/time_series.ipynb b/notebooks/time_series.ipynb
index 15b7e3b..d1c5721 100644
--- a/notebooks/time_series.ipynb
+++ b/notebooks/time_series.ipynb
@@ -10,11 +10,13 @@
     "import matplotlib.pyplot as plt\n",
     "from pqm import pqm_chi2, pqm_pvalue\n",
     "from scipy.stats import norm, chi2, uniform, kstest\n",
-    "\n",
+    "import torch\n",
     "import matplotlib as mpl\n",
     "mpl.rcParams['figure.dpi'] = 200\n",
     "plt.rc(\"font\", **{\"family\": \"serif\", \"serif\": [\"Computer Modern\"]})\n",
-    "plt.rc(\"text\", usetex=True)"
+    "plt.rc(\"text\", usetex=True)\n",
+    "\n",
+    "device = torch.device(\"cuda\" if torch.cuda.is_available() else \"cpu\")"
    ]
   },
   {
@@ -88,7 +90,7 @@
     "    y_samples = gmm.generate_samples(num_samples=5000)\n",
     "    zs = []\n",
     "    for _ in range(num_repeats):\n",
-    "        z = pqm_chi2(y_samples, null_samples)\n",
+    "        z = pqm_chi2(y_samples, null_samples, device = device)\n",
     "        zs.append(z)\n",
     "        \n",
     "    chisqs_mean[i] = np.median(zs)\n",
@@ -192,7 +194,7 @@
     "    y_samples = gmm.generate_samples(num_samples=5000)\n",
     "    p_val = []\n",
     "    for _ in range(num_repeats):\n",
-    "        pval = pqm_pvalue(y_samples, null_samples)\n",
+    "        pval = pqm_pvalue(y_samples, null_samples, device = device)\n",
     "        p_val.append(pval)\n",
     "    p_val_mean[i] = np.median(p_val)\n",
     "    p_val_std[i] = np.std(p_val)"
diff --git a/requirements.txt b/requirements.txt
index 415cdb1..d7a0fd3 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -1,2 +1,3 @@
 scipy
-numpy
\ No newline at end of file
+numpy
+torch
\ No newline at end of file
diff --git a/src/pqm/pqm.py b/src/pqm/pqm.py
index 7cca092..1460760 100644
--- a/src/pqm/pqm.py
+++ b/src/pqm/pqm.py
@@ -1,35 +1,119 @@
 from typing import Optional
 import warnings
-
+import torch
 import numpy as np
 from scipy.stats import chi2_contingency, chi2
 from scipy.spatial import KDTree
+from torch.distributions import Multinomial
 
 __all__ = ("pqm_chi2", "pqm_pvalue")
 
 
-def _mean_std(sample1, sample2, axis=0):
-    """Get the mean and std of two combined samples, without actually combining them, to save memory."""
-    n1, *_ = sample1.shape
-    n2, *_ = sample2.shape
+def _mean_std(sample1, sample2, dim=0):
+    """Get the mean and std of two combined samples without actually combining them."""
+    n1 = sample1.shape[dim]
+    n2 = sample2.shape[dim]
     # Get mean/std of combined sample
-    mx, sx = np.mean(sample1, axis=axis), np.std(sample1, axis=axis)
-    my, sy = np.mean(sample2, axis=axis), np.std(sample2, axis=axis)
+    mx = torch.mean(sample1, dim=dim)
+    sx = torch.std(sample1, dim=dim, unbiased=True)
+    my = torch.mean(sample2, dim=dim)
+    sy = torch.std(sample2, dim=dim, unbiased=True)
     m = (n1 * mx + n2 * my) / (n1 + n2)
-    s = np.sqrt(
-        ((n1 - 1) * (sx**2) + (n2 - 1) * (sy**2) + n1 * n2 * (mx - my) ** 2 / (n1 + n2))
+    s = torch.sqrt(
+        (
+            (n1 - 1) * (sx ** 2)
+            + (n2 - 1) * (sy ** 2)
+            + n1 * n2 * (mx - my) ** 2 / (n1 + n2)
+        )
         / (n1 + n2 - 1)
     )
     return m, s
 
 
+def rescale_chi2(chi2_stat, orig_dof, target_dof, device):
+    """
+    Rescale chi2 statistic using appropriate methods depending on the device.
+    """        
+
+    # Move tensors to CPU and convert to NumPy
+    chi2_stat_cpu = chi2_stat.cpu().item()  # Convert to float
+    orig_dof_cpu = orig_dof.cpu().item()    # Convert to float
+
+    if orig_dof_cpu == target_dof:
+        return chi2_stat_cpu
+        
+    if chi2_stat_cpu / orig_dof_cpu < 10:
+        # Use cumulative probability method for better accuracy
+        cp = chi2.sf(chi2_stat_cpu, orig_dof_cpu)
+        return chi2.isf(cp, target_dof)
+    else:
+        # Use simple scaling for large values
+        return chi2_stat_cpu * target_dof / orig_dof_cpu
+
+
+
+def _chi2_contingency(counts, device):
+    """
+    Computes the chi-squared statistic and p-value for a contingency table.
+
+    Parameters
+    ----------
+    counts: torch.Tensor
+        2xN tensor of counts for each category.
+    device : str
+        Device to use for computation. Default: 'cpu'. If 'cuda' is selected,
+
+    Returns
+    -------
+    tuple
+        chi2_stat, p_value, dof, expected
+    """
+    if device == 'cpu':
+        counts_np = counts.cpu().numpy()
+        chi2_stat, p_value, dof, expected = chi2_contingency(counts_np)
+        chi2_stat = torch.tensor(chi2_stat, device=device)
+        dof = torch.tensor(dof, device=device)
+        return chi2_stat, p_value, dof, expected
+    else:
+        # Observed counts
+        O = counts.float()
+
+        # Row sums and column sums
+        row_sums = O.sum(dim=1, keepdim=True)  # shape (2, 1)
+        col_sums = O.sum(dim=0, keepdim=True)  # shape (1, N)
+        total = O.sum()
+
+        # Expected counts under the null hypothesis of independence
+        E = row_sums @ col_sums / total  # shape (2, N)
+
+        # Degrees of freedom
+        dof = (O.size(0) - 1) * (O.size(1) - 1)
+
+        # Avoid division by zero
+        mask = E > 0
+        O = O[mask]
+        E = E[mask]
+
+        # Compute chi-squared statistic
+        chi2_stat = ((O - E) ** 2 / E).sum()
+
+        # Move dof and chi2_stat to the same device
+        dof = torch.tensor(dof, dtype=torch.float32, device=chi2_stat.device)
+
+        # Compute p-value using the survival function (1 - CDF)
+        p_value = torch.special.gammaincc(dof / 2, chi2_stat / 2).item()
+
+        return chi2_stat, p_value, dof, E
+
+
 def _pqm_test(
-    x_samples: np.ndarray,
-    y_samples: np.ndarray,
+    x_samples: torch.Tensor,
+    y_samples: torch.Tensor,
     num_refs: int,
-    whiten: bool,
-    x_frac: Optional[float] = None,
-    gauss_frac: float = 0.0,
+    z_score_norm: bool,
+    x_frac: Optional[float],
+    gauss_frac: float,
+    device: str,
 ):
     """
     Helper function to perform the PQM test and return the results from
@@ -38,14 +122,22 @@ def _pqm_test(
     Parameters
     ----------
     x_samples : np.ndarray
-        Samples from the first distribution, test samples.
+        Samples from the first distribution. Must have shape (N, *D) N is the
+        number of x samples, and D is the dimensionality of the samples.
     y_samples : np.ndarray
-        Samples from the second distribution, reference samples.
+        Samples from the second distribution. Must have shape (M, *D) M is the
+        number of y samples, and D is the dimensionality of the samples.
     num_refs : int
-        Number of reference samples to use.
-    whiten : bool
-        If True, whiten the samples by subtracting the mean and dividing by the
-        standard deviation.
+        Number of reference samples to use. These samples will be drawn from
+        x_samples, y_samples, and/or a Gaussian distribution, see the note
+        below.
+    re_tessellation : Optional[int]
+        Number of times pqm_pvalue is called, re-tesselating the space. No
+        re_tessellation if None (default).
+    z_score_norm : bool
+        If True, z_score_norm the samples by subtracting the mean and dividing by the
+        standard deviation. mean and std are calculated from the combined
+        x_samples and y_samples.
     x_frac : float
         Fraction of x_samples to use as reference samples. ``x_frac = 1`` will
         use only x_samples as reference samples, ``x_frac = 0`` will use only
@@ -58,6 +150,21 @@ def _pqm_test(
         support of the reference samples if pathological behavior is expected.
         Default: 0.0 no gaussian samples.
 
+    Note
+    ----
+        When using ``x_frac`` and ``gauss_frac``, note that the number of
+        reference samples from the x_samples, y_samples, and Gaussian
+        distribution will be determined by a multinomial distribution. This
+        means that the actual number of reference samples from each distribution
+        may not be exactly equal to the requested fractions, but will on average
+        equal those numbers. The mean relative number of reference samples drawn
+        from x_samples, y_samples, and Gaussian is ``Nx=x_frac*(1-gauss_frac)``,
+        ``Ny=(1-x_frac)*(1-gauss_frac)``, and ``Ng=gauss_frac`` respectively.
+        For best results, we suggest using a large number of re-tessellations,
+        though this is our recommendation in any case.
+    device : str
+        Device to use for computation. Default: 'cpu'. If 'cuda' is selected,
+
     Note
     ----
         When using ``x_frac`` and ``gauss_frac``, note that the number of
@@ -76,8 +183,8 @@ def _pqm_test(
     tuple
         Results from scipy.stats.chi2_contingency function.
     """
-    nx, *D = x_samples.shape
-    ny, *D = y_samples.shape
+    nx = x_samples.shape[0]
+    ny = y_samples.shape[0]
     if (nx + ny) <= num_refs + 2:
         raise ValueError(
             "Number of reference samples (num_ref) must be less than the number of x/y samples. Ideally much less."
@@ -86,7 +193,7 @@ def _pqm_test(
         warnings.warn(
             "Number of samples is small (less than twice the number of reference samples). Result will have high variance and/or be non-discriminating."
         )
-    if whiten:
+    if z_score_norm:
         mean, std = _mean_std(x_samples, y_samples)
         y_samples = (y_samples - mean) / std
         x_samples = (x_samples - mean) / std
@@ -94,48 +201,73 @@ def _pqm_test(
     # Determine fraction of x_samples to use as reference samples
     x_frac = nx / (nx + ny) if x_frac is None else x_frac
 
-    # Determine number of samples from each distribution (x_samples, y_samples, gaussian)
-    Nx, Ny, Ng = np.random.multinomial(
-        num_refs,
-        [x_frac * (1.0 - gauss_frac), (1.0 - x_frac) * (1.0 - gauss_frac), gauss_frac],
+    # Determine number of samples from each distribution
+    probs = torch.tensor(
+        [
+            x_frac * (1.0 - gauss_frac),
+            (1.0 - x_frac) * (1.0 - gauss_frac),
+            gauss_frac,
+        ],
+        device=device,
+    )
+
+    counts = Multinomial(total_count=num_refs, probs=probs).sample()
+    counts = counts.round().long()
+    Nx, Ny, Ng = counts.tolist()
+    assert (Nx + Ny + Ng) == num_refs, (
+        f"Something went wrong. Nx={Nx}, Ny={Ny}, Ng={Ng} should sum to num_refs={num_refs}"
     )
-    assert (Nx + Ny + Ng) == num_refs, f"Something went wrong. Nx={Nx}, Ny={Ny}, Ng={Ng} should sum to num_refs={num_refs}"  # fmt: skip
 
     # Collect reference samples from x_samples
-    xrefs = np.random.choice(nx, Nx, replace=False)
-    xrefs, x_samples = x_samples[xrefs], np.delete(x_samples, xrefs, axis=0)
+    x_indices = torch.randperm(nx, device=device)
+    if Nx > nx:
+        raise ValueError("Cannot sample more references from x_samples than available")
+    xrefs_indices = x_indices[:Nx]
+    x_samples_indices = x_indices[Nx:]
+
+    xrefs = x_samples[xrefs_indices]
+    x_samples = x_samples[x_samples_indices]
 
     # Collect reference samples from y_samples
-    yrefs = np.random.choice(ny, Ny, replace=False)
-    yrefs, y_samples = y_samples[yrefs], np.delete(y_samples, yrefs, axis=0)
+    y_indices = torch.randperm(ny, device=device)
+    if Ny > ny:
+        raise ValueError("Cannot sample more references from y_samples than available")
+    yrefs_indices = y_indices[:Ny]
+    y_samples_indices = y_indices[Ny:]
+
+    yrefs = y_samples[yrefs_indices]
+    y_samples = y_samples[y_samples_indices]
 
     # Join the full set of reference samples
-    refs = np.concatenate([xrefs, yrefs], axis=0)
+    refs = torch.cat([xrefs, yrefs], dim=0)
 
-    # get gaussian reference points if requested
+    # Get gaussian reference points if requested
     if Ng > 0:
         m, s = _mean_std(x_samples, y_samples)
-        gauss_refs = np.random.normal(
-            loc=m,
-            scale=s,
-            size=(Ng, *D),
+        gauss_refs = torch.normal(
+            mean=m.repeat(Ng, 1),
+            std=s.repeat(Ng, 1),
         )
-        refs = np.concatenate([refs, gauss_refs], axis=0)
+        refs = torch.cat([refs, gauss_refs], dim=0)
 
-    # Build KDtree to measure distances
-    tree = KDTree(refs)
+    num_refs = refs.shape[0]
 
-    idx = tree.query(x_samples, k=1, workers=-1)[1]
-    counts_x = np.bincount(idx, minlength=num_refs)
+    # Compute nearest reference for x_samples
+    distances = torch.cdist(x_samples, refs)
+    idx = distances.argmin(dim=1)
+    counts_x = torch.bincount(idx, minlength=num_refs)
 
-    idx = tree.query(y_samples, k=1, workers=-1)[1]
-    counts_y = np.bincount(idx, minlength=num_refs)
+    # Compute nearest reference for y_samples
+    distances = torch.cdist(y_samples, refs)
+    idx = distances.argmin(dim=1)
+    counts_y = torch.bincount(idx, minlength=num_refs)
 
     # Remove reference samples with no counts
     C = (counts_x > 0) | (counts_y > 0)
-    counts_x, counts_y = counts_x[C], counts_y[C]
+    counts_x = counts_x[C]
+    counts_y = counts_y[C]
 
-    n_filled_bins = np.sum(C)
+    n_filled_bins = C.sum().item()
     if n_filled_bins == 1:
         raise ValueError(
             """
@@ -155,17 +287,20 @@ def _pqm_test(
             """
         )
 
-    return chi2_contingency(np.stack([counts_x, counts_y]))
+    # Perform chi-squared test
+    counts = torch.stack([counts_x, counts_y])
+    return _chi2_contingency(counts, device)
 
 
 def pqm_pvalue(
-    x_samples: np.ndarray,
-    y_samples: np.ndarray,
+    x_samples,
+    y_samples,
     num_refs: int = 100,
     re_tessellation: Optional[int] = None,
-    whiten: bool = False,
+    z_score_norm: bool = False,
     x_frac: Optional[float] = None,
     gauss_frac: float = 0.0,
+    device: str = 'cpu',
 ):
     """
     Perform the PQM test of the null hypothesis that `x_samples` and `y_samples`
@@ -187,8 +322,8 @@ def pqm_pvalue(
     re_tessellation : Optional[int]
         Number of times pqm_pvalue is called, re-tesselating the space. No
         re_tessellation if None (default).
-    whiten : bool
-        If True, whiten the samples by subtracting the mean and dividing by the
+    z_score_norm : bool
+        If True, z_score_norm the samples by subtracting the mean and dividing by the
         standard deviation. mean and std are calculated from the combined
         x_samples and y_samples.
     x_frac : float
@@ -215,6 +350,8 @@ def pqm_pvalue(
         ``Ny=(1-x_frac)*(1-gauss_frac)``, and ``Ng=gauss_frac`` respectively.
         For best results, we suggest using a large number of re-tessellations,
         though this is our recommendation in any case.
+    device : str
+        Device to use for computation. Default: 'cpu'. If 'cuda' is selected,
 
     Returns
     -------
@@ -222,30 +359,40 @@ def pqm_pvalue(
         pvalue(s). Null hypothesis that both samples are drawn from the same
         distribution.
     """
+    # Move samples to torch tensors on the selected device
+    x_samples = torch.tensor(x_samples, device=device)
+    y_samples = torch.tensor(y_samples, device=device)
+
     if re_tessellation is not None:
         return [
             pqm_pvalue(
                 x_samples,
                 y_samples,
                 num_refs=num_refs,
-                whiten=whiten,
+                z_score_norm=z_score_norm,
                 x_frac=x_frac,
                 gauss_frac=gauss_frac,
+                device=device,
             )
             for _ in range(re_tessellation)
         ]
-    _, pvalue, _, _ = _pqm_test(x_samples, y_samples, num_refs, whiten, x_frac, gauss_frac)
-    return pvalue
+    chi2_stat, p_value, dof, _ = _pqm_test(
+        x_samples, y_samples, num_refs, z_score_norm, x_frac, gauss_frac, device
+    )
+    
+    # Return p-value as a float
+    return p_value if isinstance(p_value, float) else float(p_value)
 
 
 def pqm_chi2(
-    x_samples: np.ndarray,
-    y_samples: np.ndarray,
+    x_samples,
+    y_samples,
     num_refs: int = 100,
     re_tessellation: Optional[int] = None,
-    whiten: bool = False,
+    z_score_norm: bool = False,
     x_frac: Optional[float] = None,
     gauss_frac: float = 0.0,
+    device: str = 'cpu',
 ):
     """
     Perform the PQM test of the null hypothesis that `x_samples` and `y_samples`
@@ -265,10 +412,10 @@ def pqm_chi2(
         x_samples, y_samples, and/or a Gaussian distribution, see the note
         below.
     re_tessellation : Optional[int]
-        Number of times pqm_pvalue is called, re-tesselating the space. No
+        Number of times pqm_chi2 is called, re-tesselating the space. No
         re_tessellation if None (default).
-    whiten : bool
-        If True, whiten the samples by subtracting the mean and dividing by the
+    z_score_norm : bool
+        If True, z_score_norm the samples by subtracting the mean and dividing by the
         standard deviation. mean and std are calculated from the combined
         x_samples and y_samples.
     x_frac : float
@@ -282,6 +429,8 @@ def pqm_chi2(
         determined from the combined x_samples/y_samples. This ensures full
         support of the reference samples if pathological behavior is expected.
         Default: 0.0 no gaussian samples.
+    device : str
+        Device to use for computation. Default: 'cpu'. If 'cuda' is selected,
 
     Note
     ----
@@ -312,26 +461,31 @@ def pqm_chi2(
     float or list
         chi2 statistic(s).
     """
+    # Move samples to torch tensors on the selected device
+    x_samples = torch.tensor(x_samples, device=device)
+    y_samples = torch.tensor(y_samples, device=device)
+
     if re_tessellation is not None:
         return [
             pqm_chi2(
                 x_samples,
                 y_samples,
                 num_refs=num_refs,
-                whiten=whiten,
+                z_score_norm=z_score_norm,
                 x_frac=x_frac,
                 gauss_frac=gauss_frac,
+                device=device,
             )
             for _ in range(re_tessellation)
         ]
-    chi2_stat, _, dof, _ = _pqm_test(x_samples, y_samples, num_refs, whiten, x_frac, gauss_frac)
 
+    chi2_stat, _, dof, _ = _pqm_test(
+        x_samples, y_samples, num_refs, z_score_norm, x_frac, gauss_frac, device
+    )
+
+    # Rescale chi2 statistic if necessary
     if dof != num_refs - 1:
-        # Rescale chi2 to new value which has the same cumulative probability
-        if chi2_stat / dof < 10:
-            cp = chi2.sf(chi2_stat, dof)
-            chi2_stat = chi2.isf(cp, num_refs - 1)
-        else:
-            chi2_stat = chi2_stat * (num_refs - 1) / dof
-        dof = num_refs - 1
-    return chi2_stat
+        chi2_stat = rescale_chi2(chi2_stat, dof, num_refs - 1, device)
+
+    # Return chi2_stat as a float
+    return chi2_stat.item() if isinstance(chi2_stat, torch.Tensor) else float(chi2_stat)
\ No newline at end of file
diff --git a/src/pqm/test_gaussian.py b/src/pqm/test_gaussian.py
index b67398b..f99cd6d 100644
--- a/src/pqm/test_gaussian.py
+++ b/src/pqm/test_gaussian.py
@@ -8,6 +8,6 @@ def test():
         y_samples = np.random.normal(size=(500, 50))
         x_samples = np.random.normal(size=(250, 50))
 
-        new.append(pqm_pvalue(x_samples, y_samples))
+        new.append(pqm_pvalue(x_samples, y_samples, device = 'cpu'))
 
     assert np.abs(np.mean(new) - 0.5) < 0.15

From 4cbe01dd841af052a5aa6e1708e0123e49061959 Mon Sep 17 00:00:00 2001
From: Sammy Sharief <sharief2@illinois.edu>
Date: Mon, 28 Oct 2024 14:35:25 -0400
Subject: [PATCH 2/7] Updated test_gaussian to change from whiten to
 z_score_norm

---
 tests/test_gaussian.py | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)

diff --git a/tests/test_gaussian.py b/tests/test_gaussian.py
index f4d011c..0006746 100644
--- a/tests/test_gaussian.py
+++ b/tests/test_gaussian.py
@@ -3,16 +3,16 @@
 import pytest
 
 
-@pytest.mark.parametrize("whiten", [True, False])
+@pytest.mark.parametrize("z_score_norm", [True, False])
 @pytest.mark.parametrize("num_refs", [20, 100])
 @pytest.mark.parametrize("ndim", [1, 50])
-def test_pass_pvalue(whiten, num_refs, ndim):
+def test_pass_pvalue(z_score_norm, num_refs, ndim):
     new = []
     for _ in range(50):
         y_samples = np.random.normal(size=(500, ndim))
         x_samples = np.random.normal(size=(250, ndim))
 
-        new.append(pqm_pvalue(x_samples, y_samples, whiten=whiten, num_refs=num_refs))
+        new.append(pqm_pvalue(x_samples, y_samples, z_score_norm=z_score_norm, num_refs=num_refs))
 
     # Check for roughly uniform distribution of p-values
     assert np.abs(np.mean(new) - 0.5) < 0.15
@@ -45,17 +45,17 @@ def test_fail_pvalue(num_refs, ndim):
     assert np.mean(new) < 1e-3
 
 
-@pytest.mark.parametrize("whiten", [True, False])
+@pytest.mark.parametrize("z_score_norm", [True, False])
 @pytest.mark.parametrize("num_refs", [20, 100])
 @pytest.mark.parametrize("ndim", [1, 50])
-def test_fail_chi2(whiten, num_refs, ndim):
+def test_fail_chi2(z_score_norm, num_refs, ndim):
     new = []
     for _ in range(100):
         y_samples = np.random.normal(size=(500, ndim))
         y_samples[:, 0] += 5  # one dim off by 5sigma
         x_samples = np.random.normal(size=(250, ndim))
 
-        new.append(pqm_chi2(x_samples, y_samples, whiten=whiten, num_refs=num_refs))
+        new.append(pqm_chi2(x_samples, y_samples, z_score_norm=z_score_norm, num_refs=num_refs))
     new = np.array(new)
     assert (np.mean(new) / (num_refs - 1)) > 1.5
 

From 99a8e9b673720092afd31dc4c6f2276fd0c7701d Mon Sep 17 00:00:00 2001
From: Sammy Sharief <sharief2@illinois.edu>
Date: Sun, 3 Nov 2024 19:27:19 -0500
Subject: [PATCH 3/7] Updated PQM with better handling between CPU (numpy) and
 GPU (torch)

---
 src/pqm/pqm.py | 562 ++++++++++++++++++++++++++++++++++---------------
 1 file changed, 388 insertions(+), 174 deletions(-)

diff --git a/src/pqm/pqm.py b/src/pqm/pqm.py
index 1460760..8b40432 100644
--- a/src/pqm/pqm.py
+++ b/src/pqm/pqm.py
@@ -3,12 +3,12 @@
 import torch
 import numpy as np
 from scipy.stats import chi2_contingency, chi2
-from scipy.spatial import KDTree
+from scipy.spatial.distance import cdist
 from torch.distributions import Multinomial
+from typing import Optional, Union, Tuple
 
 __all__ = ("pqm_chi2", "pqm_pvalue")
 
-
 def _mean_std(sample1, sample2, dim=0):
     """Get the mean and std of two combined samples without actually combining them."""
     n1 = sample1.shape[dim]
@@ -29,15 +29,17 @@ def _mean_std(sample1, sample2, dim=0):
     )
     return m, s
 
-
 def rescale_chi2(chi2_stat, orig_dof, target_dof, device):
     """
     Rescale chi2 statistic using appropriate methods depending on the device.
     """        
-
-    # Move tensors to CPU and convert to NumPy
-    chi2_stat_cpu = chi2_stat.cpu().item()  # Convert to float
-    orig_dof_cpu = orig_dof.cpu().item()    # Convert to float
+    if device.type == 'cuda':
+        # Move tensors to CPU and convert to NumPy
+        chi2_stat_cpu = chi2_stat.cpu().item()  # Convert to float
+        orig_dof_cpu = orig_dof.cpu().item()    # Convert to float
+    else:
+        chi2_stat_cpu = chi2_stat
+        orig_dof_cpu = orig_dof
 
     if orig_dof_cpu == target_dof:
         return chi2_stat_cpu
@@ -48,83 +50,316 @@ def rescale_chi2(chi2_stat, orig_dof, target_dof, device):
         return chi2.isf(cp, target_dof)
     else:
         # Use simple scaling for large values
-        return chi2_stat_cpu * target_dof / orig_dof_cpu
-
+        return chi2_stat_cpu * target_dof / orig_dof_cpu        
 
+def _chi2_contingency_torch(
+    counts_x: torch.Tensor,
+    counts_y: torch.Tensor
+) -> Tuple[torch.Tensor, float, torch.Tensor, torch.Tensor]:
+    """
+    Perform chi-squared contingency test using PyTorch tensors.
+    
+    Returns:
+        chi2_stat (torch.Tensor): Chi-squared statistic.
+        p_value (float): p-value.
+        dof (torch.Tensor): Degrees of freedom.
+        expected (torch.Tensor): Expected frequencies.
+    """
+    counts = torch.stack([counts_x, counts_y])
+    
+    # Observed counts
+    O = counts.float()
+    
+    # Row sums and column sums
+    row_sums = O.sum(dim=1, keepdim=True)  # shape (2, 1)
+    col_sums = O.sum(dim=0, keepdim=True)  # shape (1, N)
+    total = O.sum()
+    
+    # Expected counts under the null hypothesis of independence
+    E = row_sums @ col_sums / total  # shape (2, N)
+    
+    # Degrees of freedom
+    dof = (O.size(0) - 1) * (O.size(1) - 1)
+    
+    # Avoid division by zero
+    mask = E > 0
+    O_masked = O[mask]
+    E_masked = E[mask]
+    
+    # Compute chi-squared statistic
+    chi2_stat = ((O_masked - E_masked) ** 2 / E_masked).sum()
+    
+    # Move dof and chi2_stat to the same device
+    dof = torch.tensor(dof, dtype=torch.float32, device=chi2_stat.device)
+    
+    # Compute p-value using the survival function (1 - CDF)
+    p_value = torch.special.gammaincc(dof / 2, chi2_stat / 2).item()
+    
+    return chi2_stat, p_value, dof, E
 
-def _chi2_contingency(counts, device):
+def _sample_reference_indices_numpy(Nx, nx, Ny, ny, Ng, x_samples, y_samples):
     """
-    Computes the chi-squared statistic and p-value for a contingency table.
+    Helper function to sample references for CPU-based NumPy computations.
 
     Parameters
     ----------
-    counts: torch.Tensor
-        2xN tensor of counts for each category.
-    device : str
-        Device to use for computation. Default: 'cpu'. If 'cuda' is selected,
+    Nx : int
+        Number of references to sample from x_samples.
+    nx : int
+        Number of samples in x_samples.
+    Ny : int
+        Number of references to sample from y_samples.
+    ny : int
+        Number of samples in y_samples.
+    Ng : int
+        Number of references to sample from a Gaussian distribution.
 
     Returns
     -------
-    tuple
-        chi2_stat, p_value, dof, expected
+    np.ndarray  
+        References samples.
     """
-    if device == 'cpu':
-        counts_np = counts.cpu().numpy()
-        chi2_stat, p_value, dof, expected = chi2_contingency(counts_np)
-        chi2_stat = torch.tensor(chi2_stat, device=device)
-        dof = torch.tensor(dof, device=device)
-        return chi2_stat, p_value, dof, expected
-    else:
-        # Observed counts
-        O = counts.float()
 
-        # Row sums and column sums
-        row_sums = O.sum(dim=1, keepdim=True)  # shape (2, 1)
-        col_sums = O.sum(dim=0, keepdim=True)  # shape (1, N)
-        total = O.sum()
+    if Nx > nx:
+        raise ValueError("Cannot sample more references from x_samples than available")
+    if Ny > ny:
+        raise ValueError("Cannot sample more references from y_samples than available")
+    
+    # Reference samples from x_samples
+    xrefs_indices = np.random.choice(nx, Nx, replace=False)
+    xrefs = x_samples[xrefs_indices]
+    x_samples = np.delete(x_samples, xrefs_indices, axis=0)
+    
+    # Reference samples from y_samples
+    yrefs_indices = np.random.choice(ny, Ny, replace=False)
+    yrefs = y_samples[yrefs_indices]
+    y_samples = np.delete(y_samples, yrefs_indices, axis=0)
+    
+    # Combine references
+    refs = np.concatenate([xrefs, yrefs], axis=0)
+    
+    # Gaussian references
+    if Ng > 0:
+        m, s = _mean_std(x_samples, y_samples)
+        gauss_refs = np.random.normal(
+            loc=m,
+            scale=s,
+            size=(Ng, ) + tuple(x_samples.shape[1:])
+        )
+        refs = np.concatenate([refs, gauss_refs], axis=0)
+
+    return refs
 
-        # Expected counts under the null hypothesis of independence
-        E = row_sums @ col_sums / total  # shape (2, N)
+def _compute_distances_numpy(x_samples, y_samples, refs, current_num_refs, num_refs):
+    """
+    Helper function to calculate distances for CPU-based NumPy computations.
 
-        # Degrees of freedom
-        dof = (O.size(0) - 1) * (O.size(1) - 1)
+    Parameters
+    ----------
+    x_samples : np.ndarray
+        Samples from the first distribution. Must have shape (N, *D) N is the
+        number of x samples, and D is the dimensionality of the samples.
+    y_samples : np.ndarray
+        Samples from the second distribution. Must have shape (M, *D) M is the
+        number of y samples, and D is the dimensionality of the samples.
+    refs : np.ndarray
+        Reference samples. Must have shape (num_refs, *D) where D is the
+        dimensionality of the samples.
+    current_num_refs : int
+        Number of reference samples used in the test.
+    num_refs : int
+        Number of reference samples to use.
 
-        # Avoid division by zero
-        mask = E > 0
-        O = O[mask]
-        E = E[mask]
+    Returns
+    -------
+    tuple
+        Results from scipy.stats.chi2_contingency.
+    """
+
+    # Compute distances
+    distances_x = cdist(x_samples, refs, metric='euclidean')
+    distances_y = cdist(y_samples, refs, metric='euclidean')
+    
+    # Nearest references
+    idx_x = np.argmin(distances_x, axis=1)
+    idx_y = np.argmin(distances_y, axis=1)
+    
+    # Counts
+    counts_x = np.bincount(idx_x, minlength=current_num_refs)
+    counts_y = np.bincount(idx_y, minlength=current_num_refs)
+    
+    # Remove references with no counts
+    C = (counts_x > 0) | (counts_y > 0)
+    counts_x = counts_x[C]
+    counts_y = counts_y[C]
+    
+    n_filled_bins = np.sum(C)
+    if n_filled_bins == 1:
+        raise ValueError(
+            """
+            Only one Voronoi cell has samples, so chi^2 cannot 
+            be computed. This is likely due to a small number 
+            of samples or a pathological distribution. If possible, 
+            increase the number of x_samples and y_samples.
+            """
+        )
+    if n_filled_bins < (num_refs // 2):
+        warnings.warn(
+            """
+            Less than half of the Voronoi cells have any samples in them.
+            Possibly due to a small number of samples or a pathological
+            distribution. Result may be unreliable. If possible, increase the
+            number of x_samples and y_samples.
+            """
+        )
+    
+    # Perform chi-squared test using SciPy
+    contingency_table = np.stack([counts_x, counts_y])
+    return chi2_contingency(contingency_table)
 
-        # Compute chi-squared statistic
-        chi2_stat = ((O - E) ** 2 / E).sum()
+def _sample_reference_indices_torch(Nx, nx, Ny, ny, Ng, x_samples, y_samples, device):
+    """
+    Helper function to sample references for GPU-based Torch computations.
 
-        # Move dof and chi2_stat to the same device
-        dof = torch.tensor(dof, dtype=torch.float32, device=chi2_stat.device)
+    Parameters
+    ----------
+    Nx : int
+        Number of references to sample from x_samples.
+    nx : int
+        Number of samples in x_samples.
+    Ny : int
+        Number of references to sample from y_samples.
+    ny : int
+        Number of samples in y_samples.
+    Ng : int
+        Number of references to sample from a Gaussian distribution.
 
-        # Compute p-value using the survival function (1 - CDF)
-        p_value = torch.special.gammaincc(dof / 2, chi2_stat / 2).item()
+    Returns
+    -------
+    np.ndarray  
+        References samples.
+    """
+    
+    if Nx > nx:
+        raise ValueError("Cannot sample more references from x_samples than available")
+    if Ny > ny:
+        raise ValueError("Cannot sample more references from y_samples than available")
+    
+    # Reference samples from x_samples
+    x_indices = torch.randperm(nx, device=device)
+    xrefs_indices = x_indices[:Nx]
+    x_samples_indices = x_indices[Nx:]
+    xrefs = x_samples[xrefs_indices]
+    x_samples = x_samples[x_samples_indices]
+    
+    # Reference samples from y_samples
+    y_indices = torch.randperm(ny, device=device)
+    yrefs_indices = y_indices[:Ny]
+    y_samples_indices = y_indices[Ny:]
+    yrefs = y_samples[yrefs_indices]
+    y_samples = y_samples[y_samples_indices]
+    
+    # Combine references
+    refs = torch.cat([xrefs, yrefs], dim=0)
+    
+    # Gaussian references
+    if Ng > 0:
+        m, s = _mean_std(x_samples, y_samples)
+        # Ensure m and s have the correct shape
+        if m.dim() == 1:
+            m = m.unsqueeze(0)
+        if s.dim() == 1:
+            s = s.unsqueeze(0)
+        gauss_refs = torch.normal(
+            mean=m.repeat(Ng, 1),
+            std=s.repeat(Ng, 1),
+        )
+        refs = torch.cat([refs, gauss_refs], dim=0)
+    return refs
+    
+def _compute_distances_torch(x_samples, y_samples, refs, current_num_refs, num_refs):
+    """
+    Helper function to calculate distances for GPU-based Torch computations.
 
-        return chi2_stat, p_value, dof, E
+    Parameters
+    ----------
+    x_samples : torch.Tensor
+        Samples from the first distribution. Must have shape (N, *D) N is the
+        number of x samples, and D is the dimensionality of the samples.
+    y_samples : torch.Tensor
+        Samples from the second distribution. Must have shape (M, *D) M is the
+        number of y samples, and D is the dimensionality of the samples.
+    refs : torch.Tensor
+        Reference samples. Must have shape (num_refs, *D) where D is the
+        dimensionality of the samples.
+    current_num_refs : int
+        Number of reference samples used in the test.
+    num_refs : int
+        Number of reference samples to use.
 
+    Returns
+    -------
+    tuple
+        Results from the PyTorch implementation of chi2_contingency.
+    """
+
+    # Compute distances and find nearest references
+    distances_x = torch.cdist(x_samples, refs)
+    idx_x = distances_x.argmin(dim=1)
+    counts_x = torch.bincount(idx_x, minlength=current_num_refs)
+    
+    distances_y = torch.cdist(y_samples, refs)
+    idx_y = distances_y.argmin(dim=1)
+    counts_y = torch.bincount(idx_y, minlength=current_num_refs)
+    
+    # Remove references with no counts
+    C = (counts_x > 0) | (counts_y > 0)
+    counts_x = counts_x[C]
+    counts_y = counts_y[C]
+    
+    n_filled_bins = C.sum().item()
+    if n_filled_bins == 1:
+        raise ValueError(
+            """
+            Only one Voronoi cell has samples, so chi^2 cannot 
+            be computed. This is likely due to a small number 
+            of samples or a pathological distribution. If possible, 
+            increase the number of x_samples and y_samples.
+            """
+        )
+    if n_filled_bins < (num_refs // 2):
+        warnings.warn(
+            """
+            Less than half of the Voronoi cells have any samples in them.
+            Possibly due to a small number of samples or a pathological
+            distribution. Result may be unreliable. If possible, increase the
+            number of x_samples and y_samples.
+            """
+        )
+    
+    # Perform chi-squared test using the PyTorch implementation
+    chi2_stat, p_value, dof, expected = _chi2_contingency_torch(counts_x, counts_y)
+    return chi2_stat, p_value, dof, expected
 
 def _pqm_test(
-    x_samples: torch.Tensor,
-    y_samples: torch.Tensor,
+    x_samples: Union[np.ndarray, torch.Tensor],
+    y_samples: Union[np.ndarray, torch.Tensor],
     num_refs: int,
     z_score_norm: bool,
     x_frac: Optional[float],
     gauss_frac: float,
-    device: str,
-):
+    device: str = 'cpu',
+) -> Tuple:
     """
     Helper function to perform the PQM test and return the results from
-    chi2_contingency.
+    chi2_contingency (using SciPy or a PyTorch implementation).
 
     Parameters
     ----------
-    x_samples : np.ndarray
+    y_samples : np.ndarray or torch.Tensor
         Samples from the first distribution. Must have shape (N, *D) N is the
         number of x samples, and D is the dimensionality of the samples.
-    y_samples : np.ndarray
+    y_samples : np.ndarray or torch.Tensor
         Samples from the second distribution. Must have shape (M, *D) M is the
         number of y samples, and D is the dimensionality of the samples.
     num_refs : int
@@ -149,19 +384,6 @@ def _pqm_test(
         determined from the combined x_samples/y_samples. This ensures full
         support of the reference samples if pathological behavior is expected.
         Default: 0.0 no gaussian samples.
-
-    Note
-    ----
-        When using ``x_frac`` and ``gauss_frac``, note that the number of
-        reference samples from the x_samples, y_samples, and Gaussian
-        distribution will be determined by a multinomial distribution. This
-        means that the actual number of reference samples from each distribution
-        may not be exactly equal to the requested fractions, but will on average
-        equal those numbers. The mean relative number of reference samples drawn
-        from x_samples, y_samples, and Gaussian is ``Nx=x_frac*(1-gauss_frac)``,
-        ``Ny=(1-x_frac)*(1-gauss_frac)``, and ``Ng=gauss_frac`` respectively.
-        For best results, we suggest using a large number of re-tessellations,
-        though this is our recommendation in any case.
     device : str
         Device to use for computation. Default: 'cpu'. If 'cuda' is selected,
 
@@ -181,116 +403,84 @@ def _pqm_test(
     Returns
     -------
     tuple
-        Results from scipy.stats.chi2_contingency function.
+        Results from scipy.stats.chi2_contingency or the PyTorch implementation.
     """
+
+    # Determine if we're working with NumPy or PyTorch
+    is_numpy = isinstance(x_samples, np.ndarray) and isinstance(y_samples, np.ndarray)
+    is_torch = isinstance(x_samples, torch.Tensor) and isinstance(y_samples, torch.Tensor)
+    
+    if not (is_numpy or is_torch):
+        raise TypeError("x_samples and y_samples must both be either NumPy arrays or PyTorch tensors.")
+    
+    # Validate sample sizes
     nx = x_samples.shape[0]
     ny = y_samples.shape[0]
     if (nx + ny) <= num_refs + 2:
         raise ValueError(
-            "Number of reference samples (num_ref) must be less than the number of x/y samples. Ideally much less."
+            "Number of reference samples (num_refs) must be less than the number of x/y samples. Ideally much less."
         )
     elif (nx + ny) < 2 * num_refs:
         warnings.warn(
-            "Number of samples is small (less than twice the number of reference samples). Result will have high variance and/or be non-discriminating."
+            "Number of samples is small (less than twice the number of reference samples). "
+            "Result will have high variance and/or be non-discriminating."
         )
+    
+    # Z-score normalization
     if z_score_norm:
         mean, std = _mean_std(x_samples, y_samples)
-        y_samples = (y_samples - mean) / std
-        x_samples = (x_samples - mean) / std
-
+        if is_numpy:
+            x_samples = (x_samples - mean) / std
+            y_samples = (y_samples - mean) / std
+        elif is_torch:
+            x_samples = (x_samples - mean) / std
+            y_samples = (y_samples - mean) / std
+    
     # Determine fraction of x_samples to use as reference samples
-    x_frac = nx / (nx + ny) if x_frac is None else x_frac
-
+    if x_frac is None:
+        x_frac = nx / (nx + ny)
+    
     # Determine number of samples from each distribution
-    probs = torch.tensor(
-        [
-            x_frac * (1.0 - gauss_frac),
-            (1.0 - x_frac) * (1.0 - gauss_frac),
-            gauss_frac,
-        ],
-        device=device,
-    )
-
-    counts = Multinomial(total_count=num_refs, probs=probs).sample()
-    counts = counts.round().long()
-    Nx, Ny, Ng = counts.tolist()
-    assert (Nx + Ny + Ng) == num_refs, (
-        f"Something went wrong. Nx={Nx}, Ny={Ny}, Ng={Ng} should sum to num_refs={num_refs}"
-    )
-
-    # Collect reference samples from x_samples
-    x_indices = torch.randperm(nx, device=device)
-    if Nx > nx:
-        raise ValueError("Cannot sample more references from x_samples than available")
-    xrefs_indices = x_indices[:Nx]
-    x_samples_indices = x_indices[Nx:]
-
-    xrefs = x_samples[xrefs_indices]
-    x_samples = x_samples[x_samples_indices]
-
-    # Collect reference samples from y_samples
-    y_indices = torch.randperm(ny, device=device)
-    if Ny > ny:
-        raise ValueError("Cannot sample more references from y_samples than available")
-    yrefs_indices = y_indices[:Ny]
-    y_samples_indices = y_indices[Ny:]
-
-    yrefs = y_samples[yrefs_indices]
-    y_samples = y_samples[y_samples_indices]
-
-    # Join the full set of reference samples
-    refs = torch.cat([xrefs, yrefs], dim=0)
-
-    # Get gaussian reference points if requested
-    if Ng > 0:
-        m, s = _mean_std(x_samples, y_samples)
-        gauss_refs = torch.normal(
-            mean=m.repeat(Ng, 1),
-            std=s.repeat(Ng, 1),
+    if is_numpy:
+        counts = np.random.multinomial(
+            num_refs,
+            [x_frac * (1.0 - gauss_frac), (1.0 - x_frac) * (1.0 - gauss_frac), gauss_frac],
         )
-        refs = torch.cat([refs, gauss_refs], dim=0)
-
-    num_refs = refs.shape[0]
-
-    # Compute nearest reference for x_samples
-    distances = torch.cdist(x_samples, refs)
-    idx = distances.argmin(dim=1)
-    counts_x = torch.bincount(idx, minlength=num_refs)
-
-    # Compute nearest reference for y_samples
-    distances = torch.cdist(y_samples, refs)
-    idx = distances.argmin(dim=1)
-    counts_y = torch.bincount(idx, minlength=num_refs)
-
-    # Remove reference samples with no counts
-    C = (counts_x > 0) | (counts_y > 0)
-    counts_x = counts_x[C]
-    counts_y = counts_y[C]
-
-    n_filled_bins = C.sum().item()
-    if n_filled_bins == 1:
-        raise ValueError(
-            """
-            Only one Voronoi cell has samples, so chi^2 cannot 
-            be computed. This is likely due to a small number 
-            of samples or a pathological distribution. If possible, 
-            increase the number of x_samples and y_samples.
-            """
+        Nx, Ny, Ng = counts
+    elif is_torch:
+        probs = torch.tensor(
+            [
+                x_frac * (1.0 - gauss_frac),
+                (1.0 - x_frac) * (1.0 - gauss_frac),
+                gauss_frac,
+            ],
+            device=device,
         )
-    if n_filled_bins < (num_refs // 2):
-        warnings.warn(
-            """
-            Less than half of the Voronoi cells have any samples in them.
-            Possibly due to a small number of samples or a pathological
-            distribution. Result may be unreliable. If possible, increase the
-            number of x_samples and y_samples.
-            """
+        counts_tensor = Multinomial(total_count=num_refs, probs=probs).sample()
+        counts = counts_tensor.round().long().cpu().numpy()
+        Nx, Ny, Ng = counts.tolist()
+    
+    # Validate counts
+    if Nx + Ny + Ng != num_refs:
+        raise ValueError(
+            f"Something went wrong. Nx={Nx}, Ny={Ny}, Ng={Ng} should sum to num_refs={num_refs}"
         )
-
-    # Perform chi-squared test
-    counts = torch.stack([counts_x, counts_y])
-    return _chi2_contingency(counts, device)
-
+    
+    # Sampling reference indices
+    if is_numpy:
+        refs = _sample_reference_indices_numpy(Nx, nx, Ny, ny, Ng, x_samples, y_samples)
+    elif is_torch:
+        refs = _sample_reference_indices_torch(Nx, nx, Ny, ny, Ng, x_samples, y_samples, device)
+    
+    # Update num_refs in case Gaussian samples were added
+    current_num_refs = refs.shape[0]
+    
+    # Compute nearest references and counts
+    if is_numpy:
+        return _compute_distances_numpy(x_samples, y_samples, refs, current_num_refs, num_refs)
+    
+    elif is_torch:
+        return _compute_distances_torch(x_samples, y_samples, refs, current_num_refs, num_refs)
 
 def pqm_pvalue(
     x_samples,
@@ -320,7 +510,7 @@ def pqm_pvalue(
         x_samples, y_samples, and/or a Gaussian distribution, see the note
         below.
     re_tessellation : Optional[int]
-        Number of times pqm_pvalue is called, re-tesselating the space. No
+        Number of times _pqm_test is called, re-tesselating the space. No
         re_tessellation if None (default).
     z_score_norm : bool
         If True, z_score_norm the samples by subtracting the mean and dividing by the
@@ -337,6 +527,8 @@ def pqm_pvalue(
         determined from the combined x_samples/y_samples. This ensures full
         support of the reference samples if pathological behavior is expected.
         Default: 0.0 no gaussian samples.
+    device : str
+        Device to use for computation. Default: 'cpu'.
 
     Note
     ----
@@ -350,8 +542,7 @@ def pqm_pvalue(
         ``Ny=(1-x_frac)*(1-gauss_frac)``, and ``Ng=gauss_frac`` respectively.
         For best results, we suggest using a large number of re-tessellations,
         though this is our recommendation in any case.
-    device : str
-        Device to use for computation. Default: 'cpu'. If 'cuda' is selected,
+
 
     Returns
     -------
@@ -359,9 +550,20 @@ def pqm_pvalue(
         pvalue(s). Null hypothesis that both samples are drawn from the same
         distribution.
     """
-    # Move samples to torch tensors on the selected device
-    x_samples = torch.tensor(x_samples, device=device)
-    y_samples = torch.tensor(y_samples, device=device)
+    # Check the device and convert to the respective type (Numpy or Torch) and call their respective _pqm_test function
+
+    if device.type == 'cpu':
+        # Check if x_samples and y_samples are not already NumPy arrays
+        if not isinstance(x_samples, np.ndarray):
+            x_samples = x_samples.cpu().numpy()
+        if not isinstance(y_samples, np.ndarray):
+            y_samples = y_samples.cpu().numpy()
+    elif device.type == 'cuda':
+        # Check if x_samples and y_samples are not already torch tensors
+        if not torch.is_tensor(x_samples):
+            x_samples = torch.tensor(x_samples, device=device)
+        if not torch.is_tensor(y_samples):
+            y_samples = torch.tensor(y_samples, device=device)
 
     if re_tessellation is not None:
         return [
@@ -376,14 +578,14 @@ def pqm_pvalue(
             )
             for _ in range(re_tessellation)
         ]
-    chi2_stat, p_value, dof, _ = _pqm_test(
+    
+    _, p_value, _, _ = _pqm_test(
         x_samples, y_samples, num_refs, z_score_norm, x_frac, gauss_frac, device
     )
     
     # Return p-value as a float
     return p_value if isinstance(p_value, float) else float(p_value)
 
-
 def pqm_chi2(
     x_samples,
     y_samples,
@@ -401,10 +603,10 @@ def pqm_chi2(
 
     Parameters
     ----------
-    x_samples : np.ndarray
+    x_samples : np.ndarray or torch.Tensor
         Samples from the first distribution. Must have shape (N, *D) N is the
         number of x samples, and D is the dimensionality of the samples.
-    y_samples : np.ndarray
+    y_samples : np.ndarray or torch.Tensor
         Samples from the second distribution. Must have shape (M, *D) M is the
         number of y samples, and D is the dimensionality of the samples.
     num_refs : int
@@ -412,7 +614,7 @@ def pqm_chi2(
         x_samples, y_samples, and/or a Gaussian distribution, see the note
         below.
     re_tessellation : Optional[int]
-        Number of times pqm_chi2 is called, re-tesselating the space. No
+        Number of times _pqm_test is called, re-tesselating the space. No
         re_tessellation if None (default).
     z_score_norm : bool
         If True, z_score_norm the samples by subtracting the mean and dividing by the
@@ -430,7 +632,7 @@ def pqm_chi2(
         support of the reference samples if pathological behavior is expected.
         Default: 0.0 no gaussian samples.
     device : str
-        Device to use for computation. Default: 'cpu'. If 'cuda' is selected,
+        Device to use for computation. Default: 'cpu'.
 
     Note
     ----
@@ -461,9 +663,20 @@ def pqm_chi2(
     float or list
         chi2 statistic(s).
     """
-    # Move samples to torch tensors on the selected device
-    x_samples = torch.tensor(x_samples, device=device)
-    y_samples = torch.tensor(y_samples, device=device)
+
+    # Check the device and convert to the respective type (Numpy or Torch) and call their respective _pqm_test function
+    if device.type == 'cpu':
+        # Check if x_samples and y_samples are not already NumPy arrays
+        if not isinstance(x_samples, np.ndarray):
+            x_samples = x_samples.cpu().numpy()
+        if not isinstance(y_samples, np.ndarray):
+            y_samples = y_samples.cpu().numpy()
+    elif device.type == 'cuda':
+        # Check if x_samples and y_samples are not already torch tensors
+        if not torch.is_tensor(x_samples):
+            x_samples = torch.tensor(x_samples, device=device)
+        if not torch.is_tensor(y_samples):
+            y_samples = torch.tensor(y_samples, device=device)
 
     if re_tessellation is not None:
         return [
@@ -479,6 +692,7 @@ def pqm_chi2(
             for _ in range(re_tessellation)
         ]
 
+
     chi2_stat, _, dof, _ = _pqm_test(
         x_samples, y_samples, num_refs, z_score_norm, x_frac, gauss_frac, device
     )

From 2642cc63f654e52a0c1c32e4b9ee2b2752cb46d5 Mon Sep 17 00:00:00 2001
From: Sammy Sharief <sharief2@illinois.edu>
Date: Sun, 3 Nov 2024 19:30:45 -0500
Subject: [PATCH 4/7] Updated test_guassian

---
 README.md                | 16 +++++++++++-----
 src/pqm/test_gaussian.py |  4 ++--
 tests/test_gaussian.py   |  2 +-
 3 files changed, 14 insertions(+), 8 deletions(-)

diff --git a/README.md b/README.md
index 06c7503..44b7427 100644
--- a/README.md
+++ b/README.md
@@ -6,15 +6,13 @@
 ![PyPI - Downloads](https://img.shields.io/pypi/dm/pqm)
 [![arXiv](https://img.shields.io/badge/arXiv-2402.04355-b31b1b.svg)](https://arxiv.org/abs/2402.04355)
 
-<!-- Implementation of the PQMass two sample test from Lemos et al. 2024 [here](https://arxiv.org/abs/2402.04355) -->
-
-[PQMass](https://arxiv.org/abs/2402.04355) is a new sample-based method for evaluating the quality of generative models as well assessing distribution shifts.
+[PQMass](https://arxiv.org/abs/2402.04355) is a new sample-based method for evaluating the quality of generative models as well as assessing distribution shifts to determine if two datasets come from the same underlying distribution.
 
 ## Install
 
 To install PQMass, run the following:
 
-```python
+```bash
 pip install pqm
 ```
 
@@ -50,7 +48,7 @@ PQMass can work for any two datasets as it measures the distribution shift betwe
 
 ## Example
 
-We are using 100 regions. Thus, the DoF is 99, our expected $\chi^2$ peak of the distribution is 97, the median is 99, and the standard deviation should be 14.07. With this in mind, we set up our example. For the p-value, we expect to be between 0 and 1 and a significantly small p-value (e.g., $< 0.05$ or $< 0.01$) would mean we reject the null hypothesis and thus $x$ and $y$ do not come from the same distribution.
+We are using 100 regions. Thus, the DoF is 99, our expected $\chi^2$ peak of the distribution is 97, the mean is 99, and the standard deviation should be 14.07. With this in mind, we set up our example. For the p-value, we expect to be between 0 and 1 and a significantly small p-value (e.g., $< 0.05$ or $< 0.01$) would mean we reject the null hypothesis and thus $x$ and $y$ do not come from the same distribution.
 
 Our expected p-value should be around 0.5 to pass the null hypothesis test; any significant deviation away from this would indicate failure of the null hypothesis test.
 
@@ -96,6 +94,14 @@ Here it is clear that both $\chi_{PQM}^2$ and $\text{p-value}(\chi_{PQM}^2)$ are
 
 Thus, PQMass can be used to identify if any two distributions come from the same underlying distributions if enough samples are given. We encourage users to look through the paper to see the varying experiments and use cases for PQMass!
 
+## How to Intrept Result
+
+We have shown what to expect for PQMass when working with $\chi_{PQM}^2$ or $\text{p-value}(\chi_{PQM}^2)$ however when working with $\chi_{PQM}^2$, there is the case in which it will return 0's. There are a couple reasons in why this could happen
+
+- For Generative Models; 0's indicate memorization. Samples are duplicates of the data it has been trained on.
+- For non generative model scenario, it is typically due to lack of samples espically in high dimensions. Increasing samples should alleviate the issue.
+- Another scenario in which one could get 0's in a non generative model case is that it can also be an inidcator of duplicate samples in $x$ and $y$.
+
 ## Advanced Usage
 
 Depending on the data you are working with we show other uses of the parameters for PQMass.
diff --git a/src/pqm/test_gaussian.py b/src/pqm/test_gaussian.py
index f99cd6d..db9f7de 100644
--- a/src/pqm/test_gaussian.py
+++ b/src/pqm/test_gaussian.py
@@ -8,6 +8,6 @@ def test():
         y_samples = np.random.normal(size=(500, 50))
         x_samples = np.random.normal(size=(250, 50))
 
-        new.append(pqm_pvalue(x_samples, y_samples, device = 'cpu'))
+        new.append(pqm_pvalue(x_samples, y_samples))
 
-    assert np.abs(np.mean(new) - 0.5) < 0.15
+    assert np.abs(np.mean(new) - 0.5) < 0.15
\ No newline at end of file
diff --git a/tests/test_gaussian.py b/tests/test_gaussian.py
index 0006746..170900f 100644
--- a/tests/test_gaussian.py
+++ b/tests/test_gaussian.py
@@ -68,4 +68,4 @@ def test_fracs(x_frac, gauss_frac):
     x_samples[:, 0] += 5  # one dim off by 5sigma
 
     pval = pqm_pvalue(x_samples, y_samples, x_frac=x_frac, gauss_frac=gauss_frac)
-    assert pval < 1e-3
+    assert pval < 1e-3
\ No newline at end of file

From cd8344d41b0ce692384b4009968d2f608a55607e Mon Sep 17 00:00:00 2001
From: Sammy Sharief <sharief2@illinois.edu>
Date: Sun, 3 Nov 2024 19:35:51 -0500
Subject: [PATCH 5/7] Updated the default device from 'cpu' to
 torch.device('cpu')

---
 src/pqm/pqm.py | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/src/pqm/pqm.py b/src/pqm/pqm.py
index 8b40432..3c02a3c 100644
--- a/src/pqm/pqm.py
+++ b/src/pqm/pqm.py
@@ -348,7 +348,7 @@ def _pqm_test(
     z_score_norm: bool,
     x_frac: Optional[float],
     gauss_frac: float,
-    device: str = 'cpu',
+    device: str = torch.device("cpu"),
 ) -> Tuple:
     """
     Helper function to perform the PQM test and return the results from
@@ -490,7 +490,7 @@ def pqm_pvalue(
     z_score_norm: bool = False,
     x_frac: Optional[float] = None,
     gauss_frac: float = 0.0,
-    device: str = 'cpu',
+    device: str = torch.device("cpu"),
 ):
     """
     Perform the PQM test of the null hypothesis that `x_samples` and `y_samples`
@@ -594,7 +594,7 @@ def pqm_chi2(
     z_score_norm: bool = False,
     x_frac: Optional[float] = None,
     gauss_frac: float = 0.0,
-    device: str = 'cpu',
+    device: str = torch.device("cpu"),
 ):
     """
     Perform the PQM test of the null hypothesis that `x_samples` and `y_samples`

From 34879274f487b0a2099bdf77f332f0ca41295fc4 Mon Sep 17 00:00:00 2001
From: Sammy Sharief <sharief2@illinois.edu>
Date: Sun, 3 Nov 2024 19:46:45 -0500
Subject: [PATCH 6/7] Updated _mean_std function and passed all test locally

---
 src/pqm/pqm.py | 56 +++++++++++++++++++++++++++++++++++---------------
 1 file changed, 40 insertions(+), 16 deletions(-)

diff --git a/src/pqm/pqm.py b/src/pqm/pqm.py
index 3c02a3c..224a3c8 100644
--- a/src/pqm/pqm.py
+++ b/src/pqm/pqm.py
@@ -11,23 +11,47 @@
 
 def _mean_std(sample1, sample2, dim=0):
     """Get the mean and std of two combined samples without actually combining them."""
-    n1 = sample1.shape[dim]
-    n2 = sample2.shape[dim]
-    # Get mean/std of combined sample
-    mx = torch.mean(sample1, dim=dim)
-    sx = torch.std(sample1, dim=dim, unbiased=True)
-    my = torch.mean(sample2, dim=dim)
-    sy = torch.std(sample2, dim=dim, unbiased=True)
-    m = (n1 * mx + n2 * my) / (n1 + n2)
-    s = torch.sqrt(
-        (
-            (n1 - 1) * (sx ** 2)
-            + (n2 - 1) * (sy ** 2)
-            + n1 * n2 * (mx - my) ** 2 / (n1 + n2)
+    # Check if both samples are PyTorch tensors
+    if isinstance(sample1, torch.Tensor) and isinstance(sample2, torch.Tensor):
+        n1 = sample1.shape[dim]
+        n2 = sample2.shape[dim]
+        
+        mx = torch.mean(sample1, dim=dim)
+        sx = torch.std(sample1, dim=dim, unbiased=True)
+        my = torch.mean(sample2, dim=dim)
+        sy = torch.std(sample2, dim=dim, unbiased=True)
+        
+        m = (n1 * mx + n2 * my) / (n1 + n2)
+        s = torch.sqrt(
+            (
+                (n1 - 1) * (sx ** 2)
+                + (n2 - 1) * (sy ** 2)
+                + n1 * n2 * (mx - my) ** 2 / (n1 + n2)
+            )
+            / (n1 + n2 - 1)
         )
-        / (n1 + n2 - 1)
-    )
-    return m, s
+        return m, s
+
+    # Check if both samples are NumPy arrays
+    elif isinstance(sample1, np.ndarray) and isinstance(sample2, np.ndarray):
+        n1 = sample1.shape[dim]
+        n2 = sample2.shape[dim]
+        
+        mx = np.mean(sample1, axis=dim)
+        sx = np.std(sample1, axis=dim, ddof=1)
+        my = np.mean(sample2, axis=dim)
+        sy = np.std(sample2, axis=dim, ddof=1)
+        
+        m = (n1 * mx + n2 * my) / (n1 + n2)
+        s = np.sqrt(
+            (
+                (n1 - 1) * (sx ** 2)
+                + (n2 - 1) * (sy ** 2)
+                + n1 * n2 * (mx - my) ** 2 / (n1 + n2)
+            )
+            / (n1 + n2 - 1)
+        )
+        return m, s
 
 def rescale_chi2(chi2_stat, orig_dof, target_dof, device):
     """

From 34e2ae64bfabe2e9ff71fa75ad69cdeb26f44c38 Mon Sep 17 00:00:00 2001
From: Sammy Sharief <sharief2@illinois.edu>
Date: Mon, 4 Nov 2024 14:39:50 -0500
Subject: [PATCH 7/7] Updated repo post comments

---
 README.md      |  3 +--
 src/pqm/pqm.py | 30 ++++++++++++++----------------
 2 files changed, 15 insertions(+), 18 deletions(-)

diff --git a/README.md b/README.md
index 44b7427..3e26801 100644
--- a/README.md
+++ b/README.md
@@ -34,8 +34,7 @@ peak of this distribution will be at `DoF - 2`, the mean will equal `DoF`, and
 the standard deviation will be `sqrt(2 * DoF)`. If your $\chi_{PQM}^2$ values are too
 high (`chi^2 / DoF > 1`), it suggests that the samples are out of distribution.
 Conversely, if the values are too low (`chi^2 / DoF < 1`), it indicates
-potential duplication of samples between `x` and `y` (i.e.
-memorization for generative models).
+potential duplication of samples between `x` and `y`.
 
 If your two samples are drawn from the same distribution, then the $\text{p-value}(\chi_{PQM}^2)$
 should be drawn from the random $\mathcal{U}(0,1)$ distribution. This means that if
diff --git a/src/pqm/pqm.py b/src/pqm/pqm.py
index 224a3c8..5d441a7 100644
--- a/src/pqm/pqm.py
+++ b/src/pqm/pqm.py
@@ -57,7 +57,7 @@ def rescale_chi2(chi2_stat, orig_dof, target_dof, device):
     """
     Rescale chi2 statistic using appropriate methods depending on the device.
     """        
-    if device.type == 'cuda':
+    if device.type == 'cuda' or device == 'cuda':
         # Move tensors to CPU and convert to NumPy
         chi2_stat_cpu = chi2_stat.cpu().item()  # Convert to float
         orig_dof_cpu = orig_dof.cpu().item()    # Convert to float
@@ -329,11 +329,12 @@ def _compute_distances_torch(x_samples, y_samples, refs, current_num_refs, num_r
 
     # Compute distances and find nearest references
     distances_x = torch.cdist(x_samples, refs)
-    idx_x = distances_x.argmin(dim=1)
-    counts_x = torch.bincount(idx_x, minlength=current_num_refs)
-    
     distances_y = torch.cdist(y_samples, refs)
+
+    idx_x = distances_x.argmin(dim=1)
     idx_y = distances_y.argmin(dim=1)
+
+    counts_x = torch.bincount(idx_x, minlength=current_num_refs)
     counts_y = torch.bincount(idx_y, minlength=current_num_refs)
     
     # Remove references with no counts
@@ -391,7 +392,7 @@ def _pqm_test(
         x_samples, y_samples, and/or a Gaussian distribution, see the note
         below.
     re_tessellation : Optional[int]
-        Number of times pqm_pvalue is called, re-tesselating the space. No
+        Number of times _pqm_test is called, re-tesselating the space. No
         re_tessellation if None (default).
     z_score_norm : bool
         If True, z_score_norm the samples by subtracting the mean and dividing by the
@@ -453,12 +454,9 @@ def _pqm_test(
     # Z-score normalization
     if z_score_norm:
         mean, std = _mean_std(x_samples, y_samples)
-        if is_numpy:
-            x_samples = (x_samples - mean) / std
-            y_samples = (y_samples - mean) / std
-        elif is_torch:
-            x_samples = (x_samples - mean) / std
-            y_samples = (y_samples - mean) / std
+
+        x_samples = (x_samples - mean) / std
+        y_samples = (y_samples - mean) / std
     
     # Determine fraction of x_samples to use as reference samples
     if x_frac is None:
@@ -497,7 +495,7 @@ def _pqm_test(
         refs = _sample_reference_indices_torch(Nx, nx, Ny, ny, Ng, x_samples, y_samples, device)
     
     # Update num_refs in case Gaussian samples were added
-    current_num_refs = refs.shape[0]
+    current_num_refs = Nx + Ny + Ng
     
     # Compute nearest references and counts
     if is_numpy:
@@ -576,13 +574,13 @@ def pqm_pvalue(
     """
     # Check the device and convert to the respective type (Numpy or Torch) and call their respective _pqm_test function
 
-    if device.type == 'cpu':
+    if device.type == 'cpu' or device == 'cpu':
         # Check if x_samples and y_samples are not already NumPy arrays
         if not isinstance(x_samples, np.ndarray):
             x_samples = x_samples.cpu().numpy()
         if not isinstance(y_samples, np.ndarray):
             y_samples = y_samples.cpu().numpy()
-    elif device.type == 'cuda':
+    elif device.type == 'cuda' or device == 'cuda':
         # Check if x_samples and y_samples are not already torch tensors
         if not torch.is_tensor(x_samples):
             x_samples = torch.tensor(x_samples, device=device)
@@ -689,13 +687,13 @@ def pqm_chi2(
     """
 
     # Check the device and convert to the respective type (Numpy or Torch) and call their respective _pqm_test function
-    if device.type == 'cpu':
+    if device.type == 'cpu' or device == 'cpu':
         # Check if x_samples and y_samples are not already NumPy arrays
         if not isinstance(x_samples, np.ndarray):
             x_samples = x_samples.cpu().numpy()
         if not isinstance(y_samples, np.ndarray):
             y_samples = y_samples.cpu().numpy()
-    elif device.type == 'cuda':
+    elif device.type == 'cuda' or device == 'cuda':
         # Check if x_samples and y_samples are not already torch tensors
         if not torch.is_tensor(x_samples):
             x_samples = torch.tensor(x_samples, device=device)