Statistics
Testing Complete Spatial Randomness
Testing CSR Monte Carlo Tests - Let $T$ be any test statistic where larger $T$ cast doubt on the null hypothesis. - Let $t1$ be the value of $T$ calculated from dataset. - For convenience, assume that the null samplin...
Testing CSR
Monte Carlo Tests
-
Let be any test statistic where larger cast doubt on the null hypothesis.
-
Let be the value of calculated from dataset.
-
For convenience, assume that the null sampling distribution of is continuous.
Let be the values of calculated from independent simulations of . Then under , the values are exchangeable, i.e.
Hence, if denotes the number of then Which means that the p-value of Monte Carlo test is .
Inter-event distance based Test
-
Let be the distance between two events independently and uniformly distributed in .
-
For a unit square of
H(t) &= P(T\leq t) \\ &= \begin{cases} \pi t^2 - {8\over 3}t^3+{1\over 2}t^4,\quad 0\leq t\leq 1 \\ {1\over3}-2t^2-{1\over2}t^4+{4\over3}(t^2-1)^{1\over2}(2t^2+1)+2t^2\sin^{-1}(2t^{-2}-1),\quad 1\leq t\leq \sqrt{2} \end{cases} \end{aligned}$$ -
For a circle of unit radius ,
H(t) = 1+\pi^{-1}[2(t^2-1)\cos^{-1}({t\over2})-t(1+{t^2\over2})\sqrt{1-{t^2\over4}}],\quad 0\leq t\leq2 \end{aligned}$$ -
Consider empirical distribution function(EDF) of inter-event distances as:
where are observed inter-event distances from data.
-
Monte Carlo-based approach is used for this test.
-
Generating Monte-Carlo Samples
Generate times of events in under CSR assumption
Calculate
Calculate envelopes:
-
Two common MC test approaches
-
Choose appropriate and define under CSR. Note that under at MC test,
If ranks th largest or higher than th, the test that rejects CSR based on that gives an exact one-sided test of size .
example : then size = 0.05
-
Define
u_i = \int(\hat{H}_i(t)-H(t))^2 dt.\tag{*}
Then proceed to a test based on the rank of .
-
-
Note that the approach 2 is more objective but known to have weak power.
Nearest neighbor distance based Test
-
Let be the nearest neighbor distance under CSR when there are events in a region
-
Theorical distribution of is quite difficult, instead use an approximation.
-
Note that an event being within distance from known(specified) event is
Then, the CDF can be approximated by as follows:
G(y)=P(Y\leq y)&\approx 1-(1-\pi y^2|A|^{-1})^{n-1} \\ &\approx 1-\exp(-\lambda\pi y^2),\quad y\geq 0 \end{aligned}$$ where (2) is a further approximation with large $n$ with $\lambda=n\vert A\vert^{-1}$. -
Empirical CDF is gien as:
-
Let then the MC test is given as same as (*) where