Testing Complete Spatial Randomness

Testing CSR

Monte Carlo Tests

Let $T$ be any test statistic where larger $T$ cast doubt on the null hypothesis.
Let $t_1$ be the value of $T$ calculated from dataset.
For convenience, assume that the null sampling distribution of $T$ is continuous.

Let $t_2,\ldots,t_s$ be the values of $T$ calculated from $s-1$ independent simulations of $H_0$. Then under $H_0$, the values $t_1,\ldots,t_s$ are exchangeable, i.e.

\[P(t_i=t_{(j)})=\frac{1}{s}, j=1,\ldots,s\]

Hence, if $R$ denotes the number of $t_i>t_1$ then $P(R\leq r) = \frac{r+1}{s}.$ Which means that the p-value of Monte Carlo test is $(r+1)/s$.

Inter-event distance based Test

Let $T$ be the distance between two events independently and uniformly distributed in $A$.
For a unit square of $A$
\[\begin{aligned} H(t) &= P(T\leq t) \\ &= \begin{cases} \pi t^2 - {8\over 3}t^3+{1\over 2}t^4,\quad 0\leq t\leq 1 \\ {1\over3}-2t^2-{1\over2}t^4+{4\over3}(t^2-1)^{1\over2}(2t^2+1)+2t^2\sin^{-1}(2t^{-2}-1),\quad 1\leq t\leq \sqrt{2} \end{cases} \end{aligned}\]
For a circle of unit radius $A$,
\[\begin{aligned} H(t) = 1+\pi^{-1}[2(t^2-1)\cos^{-1}({t\over2})-t(1+{t^2\over2})\sqrt{1-{t^2\over4}}],\quad 0\leq t\leq2 \end{aligned}\]
Consider empirical distribution function(EDF) of inter-event distances as:
\[\hat{H}_1(t)={2\over n(n-1)}\sum_{i<j}I(t_{ij}\leq t)\]
where $t_{ij}$ are observed inter-event distances from data.
Monte Carlo-based approach is used for this test.
Generating Monte-Carlo Samples

Generate $s-1$ times of $n$ events in $A$ under CSR assumption

Calculate $\hat{H}_i(t), i=2,\ldots,s$

Calculate envelopes:

\[\begin{aligned} U(t) &=\max_{2\leq i\leq s}\lbrace \hat{H}_i(t)\rbrace \\ L(t) &=\min_{2\leq i\leq s}\lbrace \hat{H}_i(t)\rbrace \end{aligned}\]

Two common MC test approaches
1. Choose appropriate $t_0$ and define $u_i=\hat{H}_i(t_0)$ under CSR. Note that under $H_0$ at MC test,
  \[P(u_1=u_{(j)})=1/s\]
  If $u_1$ ranks $k$th largest or higher than $k$th, the test that rejects CSR based on that gives an exact one-sided test of size $k/s$.
  
  example : $k=5, s=100, u_1\geq u_{(5)}$ then size = 0.05
2. Define
  \[u_i = \int(\hat{H}_i(t)-H(t))^2 dt.\tag{*}\]
  Then proceed to a test based on the rank of $u_1$.
Note that the approach 2 is more objective but known to have weak power.

Nearest neighbor distance based Test

Let $Y$ be the nearest neighbor distance under CSR when there are $n$ events in a region $A$
Theorical distribution of $Y$ is quite difficult, instead use an approximation.
Note that an event being within distance $y$ from known(specified) event is
\[\frac{\pi y^2}{|A|}\]
Then, the CDF can be approximated by as follows:
\[\begin{aligned} G(y)=P(Y\leq y)&\approx 1-(1-\pi y^2|A|^{-1})^{n-1} \\ &\approx 1-\exp(-\lambda\pi y^2),\quad y\geq 0 \end{aligned}\]
where (2) is a further approximation with large $n$ with $\lambda=n\vert A\vert^{-1}$.
Empirical CDF is gien as:
\[\hat{G}_1(y) = \frac{1}{n}\sum_i I(y_i\leq y)\]
Let $\bar{G}_i(y)=\frac{1}{s-1}\sum_{j\neq i}\hat{G}_j(y)$ then the MC test is given as same as (*) where
\[u_i = \int(\hat{G} _i(y)-\bar{G}_i(y))^2 dt.\]

Testing CSR

Monte Carlo Tests

Inter-event distance based Test

Nearest neighbor distance based Test

Leave a comment