Skip to main content

# A remark on the convergence of Betti numbers in the thermodynamic regime

## Abstract

The convergence of the expectations of Betti numbers of Čech complexes built on binomial point processes in the thermodynamic regime is established.

## Terminologies and main results

### Definition 1.1

Let $$\mathfrak {X} = \{x_{1}, x_{2}, \dots, x_{n}\}$$ be a collection of points in $$\mathbb {R}^{d}$$. The Čech complex $$\mathcal {C}(\mathfrak {X},r)$$, for r>0, is constructed as follows.

• The 0-simplices (vertices) are the points in $$\mathfrak {X}$$.

• A k-simplex $$\left [x_{i_{0}}, \dots, x_{i_{k}}\right ]$$ is in $$\mathcal {C}(\mathfrak {X}, r)$$ if $$\bigcap _{j = 0}^{k} B_{r} (x_{i_{j}}) \ne \emptyset$$.

Here $$B_{r}(x) = \left \{y \in \mathbb {R}^{d} : \|y - x\| \le r\right \}$$ denotes a ball of radius r and center x, and x is the Euclidean norm of x. The Čech complex can be also constructed from an infinite collection of points.

Let X 1,X 2,…, be a sequence of i.i.d. (independent identically distributed) $$\mathbb {R}^{d}$$-valued random variables with common probability density function f(x). Define the induced binomial point processes as $$\mathfrak {X}_{n} = \left \{X_{1}, \dots, X_{n}\right \}$$. The object here is the Čech complex $$\mathcal {C}\left (\mathfrak {X}_{n}, r_{n}\right)$$ built on $$\mathfrak {X}_{n}$$, where the radius r n also varies with n. Denote by $$\beta _{k}(\mathcal {K})$$ the kth Betti number, or the rank of the kth homology group, of a simplicial complex $$\mathcal {K}$$. The limiting behaviour of Betti numbers $$\beta _{k}\left (\mathcal {C}\left (\mathfrak {X}_{n}, r_{n}\right)\right)$$ in various regimes has been studied recently by many authors. See  for a brief survey. The aim of this paper is to refine a limit theorem in the thermodynamic regime, a regime that n 1/d r n r(0,).

In the thermodynamic regime, the expectations of the kth Betti numbers, for 1≤kd−1, grow linearly in n, that is, $$c_{1} n \le \mathbb {E} \left [\beta _{k}\left (\mathcal {C}\left (\mathfrak {X}_{n}, r_{n}\right)\right)\right ] \le c_{2} n$$ as n. After centralizing, the strong law of large numbers holds,

\begin{aligned} \frac{1}{n} \left(\beta_{k}\left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right) \right.& \left. - \mathbb{E} \left[\beta_{k}\left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right)\right] \right)\\ & \to 0\, \text{almost surely as}\ n \to \infty, \end{aligned}
(1)

provided that the density function f has compact, convex support and that on the support of f, it is bounded both below and above [9, Theorem 4.6]. A remaining problem is to describe the exact limiting behaviour of the expected values of the Betti numbers. This paper gives a solution to that problem. Note that the 0th Betti number which counts connected components in a random geometric graph was completely described [5, Chapter 13]. Note also that the kth Betti number of the Čech complex built on a finite set of points in $$\mathbb {R}^{d}$$ is vanishing, if kd. These facts explain why we only need to consider the case 1≤kd−1.

Betti numbers are tightly related to the number of j-simplices in $$\mathcal {C}(\mathfrak {X}, r)$$, denoted by $$S_{j}(\mathcal {C}(\mathfrak {X},r))$$ or simply by $$S_{j}(\mathfrak {X}, r)$$, which can be expressed as

$$S_{j}(\mathfrak{X}, r) = \frac{1}{j + 1} \sum_{x \in \mathfrak{X}} \xi(x, \mathfrak{X}),$$

where $$\xi (x, \mathfrak {X})$$ is the number of j-simplices containing x. Note that $$\xi (x, \mathfrak {X})$$ is a local function in the sense that it depends only on points near x. Then in the thermodynamic regime, the weak and strong laws of large numbers for $$S_{j}(\mathcal {C}(\mathfrak {X}_{n}, r_{n}))$$ hold as a consequence of general results in [6, 7],

$$\frac{S_{j}\left(\mathfrak{X}_{n}, r_{n}\right)}{n} \to \hat S_{j}(r)\, \text{almost surely as}\ n \to \infty.$$

The limit $$\hat S_{j}(r)$$ can be explicitly calculated. However, Betti numbers do not have expression like the above form, and hence those general results can not be applied.

To establish a limit theorem for Betti numbers, we exploit the following two properties. The first one is the nearly additive property of Betti numbers that was used in  to study Betti numbers of Čech complexes built on stationary point processes. The second one is the property that binomial point processes behave locally like a homogeneous Poisson point process. The latter property is also a key tool to establish the law of large numbers for local geometric functionals [6, 7].

Now let us get into more detail to state the main result of the paper. We begin with the definition of a homogeneous Poisson point process. Let N be the set of all counting measures on $$\mathbb {R}^{d}$$ which are finite on any bounded Borel set and for which the measure of a point is at most 1. Define $$\mathcal {N}$$ as the σ-algebra generated by sets of the form

$$\left\{\mu \in \mathbf{N} : \mu(A) = k\right\},$$

where A is a bounded Borel set and k is an integer. Then a point process Φ is a measurable mapping from some probability space into $$(\mathbf {N}, \mathcal {N})$$. For a Borel set A, let Φ(A) denote the number of points in A. By definition of the σ-algebra $$\mathcal {N}$$, Φ(A) becomes a usual random variable. A homogeneous Poisson point process is defined as follows. For some basic properties of point processes, see , for example.

### Definition 1.2

The point process $$\mathcal {P}$$ is said to be a Poisson point process with density λ>0 if

• for disjoint Borel sets A 1,…,A k , the random variables $$\mathcal {P}(A_{1}), \dots, \mathcal {P}(A_{k})$$ are independent;

• for any bounded Borel set A, the number of points in A has Poisson distribution with parameter |A|, $$\mathcal {P}(A) \sim \text {Pois}~(\lambda |A|)$$, that is,

$$\mathbb{P}(\mathcal{P}(A) = k) = e^{- \lambda |A|} \frac{\lambda^{k} |A|^{k}}{k!}, \quad k = 0,1, \dots,$$

where |A| denotes the Lebesgue measure of A.

For homogeneous Poisson point processes, the following law of large numbers for Betti numbers was established in . Let $$\mathcal {P}(\lambda)$$ be a homogeneous Poisson point process on $$\mathbb {R}^{d}$$ with density λ>0. Denote by $$\mathcal {P}_{A}(\lambda)$$ the restriction of $$\mathcal {P}(\lambda)$$ on a Borel set A. For simplicity, we write $$\mathcal {P}_{L}(\lambda)$$ instead of $$\mathcal {P}_{W_{L}}(\lambda)$$ when W L is a window of the form $$W_{L} = [-\frac {L^{1/d}}2, \frac {L^{1/d}}2)^{d}$$. Then for 1≤kd−1, there is a constant $$\hat \beta _{k} (\lambda, r)$$ such that [9, Theorem 3.5],

$$\frac{\beta_{k}\left(\mathcal{C}\left(\mathcal{P}_{L}(\lambda), r\right)\right)} {L} \to \hat \beta_{k}(\lambda, r)\, \text{almost surely as}\ L \to \infty.$$

The Poisson point process $$\mathcal {P}(0)$$ is understood as a point process with no point. Thus we set $$\hat \beta _{k}(0, r) = 0$$ for all r>0. Now we can state our main result.

### Theorem 1.3

Assume that the common probability density function f(x) has compact support, is bounded and Riemann integrable. Then as n with n 1/d r n r(0,),

$$\frac{\mathbb{E}\left[\beta_{k}\left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right)\right]}{n} \to \int_{\mathbb{R}^{d}} \hat\beta_{k}\left(f(x), r\right) dx.$$

Consequently, together with (1), we have the following law of large numbers.

### Corollary 1.4

Assume that the support of f is compact and convex and that

$$0 < \inf_{x \in \mathrm{s}\mathrm{u}\mathrm{p}\mathrm{p}(f)} f(x) \le \sup_{x \in \mathrm{s}\mathrm{u}\mathrm{p}\mathrm{p}(f)} f(x) < \infty.$$

Assume further that f is Riemann integrable. Then for 1≤kd−1,

{}{{\begin{aligned} \frac{\beta_{k}\left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right)}{n} \!\to\! \int_{\mathbb{R}^{d}} \hat \beta_{k}(f(x), r) dx \, \text{almost surely as}\ n \to\! \infty. \end{aligned}}}
(2)

It is noted that the method here can be applied to show the convergence of persistence diagrams of Čech complexes built on binomial point processes. The convergence of Betti numbers and persistence diagrams related to i.i.d. sampling were observed in  by numerical simulation. Here we give a rigorous mathematical proof of the convergences.

For the proof, we need a Poissonized version of the binomial processes. Let N n be a random variable which is independent of {X n } n≥1 and has Poisson distribution with parameter n. Let

$$\bar{\mathcal{P}}_{n} = \left\{X_{1}, X_{2}, \dots, X_{N_{n}}\right\}.$$

Then $$\bar {\mathcal {P}}_{n}$$ becomes a non-homogeneous Poisson point process with intensity function n f(x). Here a non-homogeneous Poisson point process is defined as follows.

### Definition 1.5

Let f(x)≥0 be a locally integrable function on $$\mathbb {R}^{d}$$. The point process $$\mathcal {P}$$ is said to be a (non-homogeneous) Poisson point process with intensity function f(x) if

• for disjoint Borel sets A 1,…,A k , the random variables $$\mathcal {P}\left (A_{1}\right), \dots, \mathcal {P}\left (A_{k}\right)$$ are independent;

• for any bounded Borel set A, $$\mathcal {P}(A)\sim \text {Pois}\left (\int _{A} f(x) dx\right)$$.

As proved later, Theorem 1.3 is equivalent to the following result.

### Theorem 1.6

Assume that the common probability density function f(x) has compact support, is bounded and Riemann integrable. Then as n with n 1/d r n r(0,),

$$\frac{\mathbb{E}\left[\beta_{k}\left(\mathcal{C}\left(\bar{\mathcal{P}}_{n}, r_{n}\right)\right)\right]}{n} \to \int_{\mathbb{R}^{d}} \hat\beta_{k}\left(f(x), r\right) dx.$$

All the proofs will be given in Section 3 after discussing some basic properties of Betti numbers in the next section.

## Simplicial complexes and Betti numbers

This section introduces some basic concepts in algebraic topology such as simplicial complexes and Betti numbers. It is mainly taken from the book .

An abstract simplicial complex $$\mathcal {K}$$ on a finite set V is a collection of nonempty subsets of V which is closed under inclusion relation, that is, if $$\sigma \in \mathcal {K}$$, then $$\tau \in \mathcal {K}$$ for any nonempty subset τσ. An element $$\sigma \in \mathcal {K}$$ with |σ|=k+1 is called a k-simplex or a simplex of dimension k. A 0-simplex (resp. 1-simplex) is usually called a vertex (resp. edge). Čech complexes are examples of geometric complexes which are constructed over points in some metric space with respect to certain conditions.

We assign orientations on simplices in the following way. For a k-simplex σ={v 0,…,v k } with k>0, define two orderings of its vertex set to be equivalent if they differ from one other by an even permutation. The orderings of the vertices of σ then fall into two equivalent classes. Each of these classes is called an orientation of σ. We write 〈v 0,…,v k 〉 for an oriented simplex. Let us fix an ordering of the vertex set V. Then the notation 〈σ〉 means the oriented simplex which belongs to the equivalent class of a natural ordering. A 0-simplex has only one orientation.

Let F be a field. For each k, let

$$C_{k}(\mathcal{K}) = \left\{\sum \alpha_{i} \left\langle{\sigma_{i}}\right\rangle : \alpha_{i} \in \mathbf{F}, \sigma_{i} \in \mathcal{K}_{k} \right \}$$

be a vector space with the basis $$\left \{\left \langle {\sigma }\right \rangle : \sigma \in \mathcal {K}_{k}\right \}$$, where $$\mathcal {K}_{k}$$ is the set of all k-simplices in $$\mathcal {K}$$. The space $$C_{k}(\mathcal K)$$ is called a chain group.

The dimension of $$\mathcal {K}$$, denoted by $$\dim (\mathcal {K})$$, is defined to be the maximum dimension of simplices in $$\mathcal {K}$$. For $$1\le k \le \dim (\mathcal {K})$$, a boundary operator $$\partial _{k} \colon C_{k}(\mathcal {K}) \to C_{k - 1} (\mathcal {K})$$ is a linear map whose value on an oriented simplex 〈v 0,…,v k 〉 is given by

$$\partial_{k}\left(\left\langle{v_{0}, \dots, v_{k}}\right\rangle \right) = \sum_{i = 0}^{k} (-1)^{k} \left\langle{v_{0}, \dots, \hat{v}_{i}, \dots, v_{k}}\right\rangle,$$

where the symbol $${\hat {~}}$$ over v i indicates that the vertex is removed from the sequence. Then we get a chain

\begin{aligned} &0 \longrightarrow C_{\dim(\mathcal{K})} \overset{\partial_{\dim(\mathcal{K})}}{\longrightarrow} \cdots \longrightarrow C_{k + 1}(\mathcal{K}) \overset{\partial_{k+1}}{\longrightarrow}\\ & \qquad C_{k}(\mathcal{K}) \overset{\partial_{k}}{\longrightarrow}C_{k-1}(\mathcal{K}) \longrightarrow \cdots \overset{\partial_{1}}\longrightarrow C_{0}(\mathcal{K}) \longrightarrow 0. \end{aligned}

The subspaces $$B_{k}(\mathcal {K}) := \text {Im} \partial _{k + 1}$$ and $$Z_{k} (\mathcal {K}) := \ker \partial _{k}$$ are called the kth boundary group and the kth cycle group, respectively. By definition, it is straightforward to show that k k+1=0. Thus $$B_{k}(\mathcal {K})$$ becomes a subspace of Z k . The quotient space

$$H_{k} (\mathcal{K}) = Z_{k}(\mathcal{K}) / B_{k}(\mathcal{K})$$

is called the kth homology group of $$\mathcal {K}$$, and its rank is the kth Betti number,

$$\beta_{k}(\mathcal{K}) = \text{rank} H_{k}(\mathcal{K}).$$

In computational topology, it is convenient to consider the case where F=F 2={0,1} because in this case we do not need orientations of simplices.

Let A k be the matrix representation of the boundary operator k with respect to the standard bases $$\mathcal {K}_{k}$$ and $$\mathcal {K}_{k - 1}$$. It is clear that A k is a f k−1×f k matrix with {0,±1}-coefficients, where $$f_{k} = \dim \left (C_{k}(\mathcal {K})\right) = \left |\mathcal {K}_{k}\right |$$. Then the kth Betti number can be expressed as

{}{{\begin{aligned} \beta_{k}(\mathcal{K})\! =\! \dim \left(\ker A_{k}\right)\! -\! \text{rank}\, A_{k + 1}\! =\! f_{k}\! -\! \text{rank}\, A_{k}\! -\! \text{rank}\, A_{k + 1}. \end{aligned}}}
(3)

Let $$\{\mathcal {K}^{(i)}\}_{i\in I}$$ be a finite collection of disjoint simplicial complexes. Then $$\cup _{i \in I}\mathcal {K}^{(i)}$$ becomes a simplicial complex. We easily see that

$$\beta_{k} \left(\bigcup_{i \in I}\mathcal{K}^{(i)} \right) = \sum_{i \in I} \beta_{k}\left(\mathcal{K}^{(i)}\right).$$
(4)

Next assume that $$\mathcal {K}$$ is a sub-complex of $$\tilde {\mathcal {K}}$$, that is, $$\mathcal {K} \subset \tilde {\mathcal {K}}$$. We use symbols like $$\tilde {\beta _{k}}, \tilde {f}_{k}$$ and $$\tilde {A}_{k}$$ to denote corresponding quantities of $$\tilde {\mathcal {K}}$$. By arranging the basis in $$C_{k}(\tilde {\mathcal {K}})$$ such that first elements coincide with the elements in the basis of $$C_{k}(\mathcal {K})$$, the matrix $$\tilde {A}_{k}$$ has the following form

$$\tilde{A}_{k} = \left(\frac{A_{k}}{\boldsymbol{0}}\left|{\vphantom{\frac{A_{k}}{\boldsymbol{0}}}}\right. *\right).$$

It then follows that $$\phantom {\dot {i}\!}0 \le \text {rank}\, \tilde {A}_{k} - \text {rank}\, A_{k} \le \tilde {f}_{k} - f_{k}$$. This inequality, together with the relation (3), implies the following result ([9, Lemma 2.2]).

### Lemma 2.1

Let $$\mathcal {K},\tilde {\mathcal {K}}$$ be two finite simplicial complexes such that $$\mathcal {K} \subset \tilde {\mathcal {K}}$$. Then for every k≥1,

$${} \left| \beta_{k}(\mathcal{K}) - \beta_{k} (\tilde{\mathcal{K}}) \right| \le \sum\limits_{j = k}^{k + 1} \#\left\{j {-simplices\, in\, \tilde{\mathcal{K}} \setminus \mathcal{K}}\right\}.$$
(5)

We have mentioned that Betti numbers are nearly additive because of the two properties (4) and (5). Note that β 0 counts the number of connected components in the undirected graph G=(V,E), where $$E =\mathcal {K}_{1}$$, which is independent of the underlying field F.

## Proofs of main theorems

We will use the following two important properties of Poisson point processes. Denote by $$\mathcal {P}(f(x))$$ the non-homogeneous Poisson point process with intensity function f(x).

• Scaling property. For any θ>0 and $$t \in \mathbb {R}^{d}$$,

$$\theta (\mathcal{P}(f(x)) - t) \overset{d}{=} \mathcal{P}(\theta^{-d} f(t + \theta^{-1}x)),$$

where ‘$$\overset {d}{=}$$’ denotes the equality in distribution. In particular, $$\theta (\mathcal {P}(\lambda) - t) \overset {d}{=} \mathcal {P}(\theta ^{-d} \lambda)$$.

• Coupling property. Let $$\mathcal {P}(g(x))$$ be a Poisson point process with intensity function g(x) which is independent of $$\mathcal {P}(f(x))$$. Then

$$\mathcal{P}(f(x)) + \mathcal{P}(g(x)) \overset{d}{=} \mathcal{P}(f(x) + g(x)).$$

Here ‘ + ’ means the superposition of two point processes.

We begin with a result for the simplices counting function.

### Lemma 3.1

(cf. [9, Lemma 3.2]) Let S j (λ,r;L)be the number of j-simplices in $$\phantom {\dot {i}\!}\mathcal {C}(\mathcal {P}_{L}(\lambda), r)$$. Then for fixed r>0,

$${} \frac{\mathbb{E}\left[S_{j}(\lambda, r; L)\right]}{L}\! \to\! \hat S_{j}(\lambda, r)\ {as}\ L\! \to\! \infty, \text{uniformly for $$0\!\le\! \lambda \!\le\! \Lambda$$}.$$

In addition, for fixed r, the limit $$\hat S_{j}(\lambda, r)$$is a continuous function of λ on [0,).

### Proof

For convenience, let $$A_{l}(\lambda) := S_{j}\left (\lambda, r; l^{d}\right) = S_{j}\left (\mathcal {C}\left (\mathcal {P}_{V_{l}}(\lambda), r\right)\right)$$, where $$V_{l} = [-\frac {l}{2}, \frac {l}{2})^{d}$$. Our aim now is to show that

$${} \frac{\mathbb{E}\left[A_{l}(\lambda)\right]}{l^{d}} \text{uniformly converges for $$\lambda \in\, [\!0, \Lambda]$$ as}\ l \to \infty,$$

and that $$\mathbb {E}\left [A_{l}(\lambda)\right ]$$ is continuous for λ [ 0,). Let us first show the continuity of $$\mathbb {E}\left [A_{l}(\lambda)\right ]$$. For 0≤λ<μ, we use the coupling $$\mathcal {P}(\mu) = \mathcal {P}(\lambda) + \mathcal {P}(\mu - \lambda)$$. Here $$\mathcal {P}(\lambda)$$ and $$\mathcal {P}(\mu - \lambda)$$ are two independent Poisson point processes with densities λ and (μλ), respectively. Let N λ (resp. N μ;λ ) be the number of points of $$\mathcal {P}(\lambda)$$ (resp. $$\mathcal {P}(\mu - \lambda)$$) in V l , which has Poisson distribution with parameter λ l d (resp. (μλ)l d). Then the continuity follows from a trivial estimate

$$\begin{array}{*{20}l} 0 \le A_{l}(\mu) - A_{l}(\lambda) \le N_{\mu; \lambda} \left(N_{\mu; \lambda} + N_{\lambda}\right)^{j}. \end{array}$$

Next, we show the uniform convergence. The proof here is similar to that of the pointwise convergence ([9, Lemma 3.2]). Define the function

{}{{\begin{aligned} h(\mathcal{P}(\lambda)) := \frac{1}{j + 1} \sum_{x \in \mathcal{P}_{1}(\lambda)} \#[j \text{-simplices in}\mathcal{C}(\mathcal{P}(\lambda), r) \text{containing} x]. \end{aligned}}}
(6)

Then for l>4r+1,

$$\sum_{z \in \mathbb{Z}^{d} \cap V_{l -4r -1}}\!\! h(\mathcal{P}(\lambda) - z) \le A_{l}(\lambda) \le\! \sum_{z \in \mathbb{Z}^{d} \cap V_{l + 1}} h(\mathcal{P}(\lambda) - z).$$

Consequently, by the stationality of the Poisson point process $$\mathcal {P}(\lambda)$$,

$${} (l - 4r - 2)^{d} \mathbb{E}\, [\!h(\mathcal{P}(\lambda))] \le \mathbb{E}\left[A_{l}(\lambda)\right] \le (l+2)^{d} \mathbb{E}\, [\!h(\mathcal{P}(\lambda))].$$

Note that $$\mathbb {E}\,[\!h(\mathcal {P}(\lambda))]$$ is non-decreasing in λ and for any λ>0,

$$\mathbb{E}\, [\!h(\mathcal{P}(\lambda))] \le \mathbb{E} \left[\mathcal{P}\left(\lambda; V_{1 + 4r}\right)^{j + 1}\right] < \infty.$$

Here $$\mathcal {P}(\lambda ; V_{1 + 4r})$$ is the number of points of $$\mathcal {P}(\lambda)$$ in V 1+4r . Therefore uniformly for 0≤λΛ,

$$\frac{\mathbb{E}\left[A_{l}(\lambda)\right]}{ l^{d}} \to \mathbb{E}\,[\!h(\mathcal{P}(\lambda))] \text{as}\ l \to \infty.$$

The proof is complete. □

For the sake of simplicity, we denote by β k (λ,r;L) the kth Betti number of the Čech complex $$\mathcal {C}(\mathcal {P}_{W_{L}} (\lambda), r)$$, where W L is any rectangle of the form $$x + [-\frac {L^{1/d}}{2}, \frac {L^{1/d}}{2})^{d}$$.

### Lemma 3.2

For fixed r>0, uniformly for 0≤λΛ,

$$\frac{\mathbb{E}\,[\!\beta_{k} (\lambda, r; L)]}{L} \to \hat \beta_{k}(\lambda, r)\, {as}\, L\, \to \infty.$$

The limit $$\hat \beta _{k}(\lambda, r)$$has the following scaling property,

$$\hat \beta_{k}(\lambda, r) = \frac{1}{\theta}\hat \beta_{k}\left(\lambda \theta, \frac{r}{\theta^{1/d}} \right), for\, any\, \theta > 0.$$

In particular, $$\hat \beta _{k}(\lambda, r) = \lambda \hat \beta _{k} \left (1, \lambda ^{1/d} r\right)$$ is a continuous function in both λ and r, and $$\hat \beta (\lambda, r) > 0$$, if λ>0 and r>0.

### Proof

For fixed r>0 and fixed λ>0, the convergence of the expectations of Betti numbers was shown in [9, Lemma 3.3]. The positivity is a consequence of [8, Theorem 4.2]. Here we will show the uniform convergence for 0≤λΛ. We use the following criterion for the uniform convergence on a compact set, which is related to the Arzelà–Ascoli theorem. The sequence of continuous functions {a L (λ)} L>0 converges uniformly on [ 0,Λ] if and only if it converges pointwise and is equicontinuous, that is, for any ε>0, there are δ>0 and L 0>0 such that

\begin{aligned} \left|a_{L}\left(\lambda_{1}\right) \right.& \left.- a_{L}\left(\lambda_{2}\right) \right| < \varepsilon\, \text{for all}\ \lambda_{1}, \lambda_{2} \in \,[\!0, \Lambda],\\ &\quad \left|\lambda_{1} - \lambda_{2}\right| < \delta,\, \text{and all}\, L > L_{0}. \end{aligned}
(7)

Our task now is to show that the sequence $$\left \{L^{-1} \mathbb {E}\left [ \beta _{k} (\lambda, r; L)\right ]\right \}$$ is equicontinuous. Let λ<μ. By using the coupling $$\mathcal {P}(\mu) = \mathcal {P}(\lambda) + \mathcal {P}(\mu - \lambda)$$, the Čech complex $$\mathcal {C}(\mathcal {P}_{L}(\lambda), r)$$ becomes a sub-complex of $$\mathcal {C}\left (\mathcal {P}_{L}(\mu), r\right)$$. Thus, by Lemma 2.1,

{}{{\begin{aligned} \left|\beta_{k}(\mu, r ; L) \right.&\left.- \beta_{k}(\lambda, r ; L)\right|\\ &\le \sum_{j = k}^{k + 1} \#\left\{j\text{-simplices in}\ C(\mathcal{P}_{L}(\mu), r) \setminus C(\mathcal{P}_{L}(\lambda), r) \right\} \\ &=\sum_{j = k}^{k + 1} \left(S_{j}(\mu, r; L) - S_{j}(\lambda, r; L)\right). \end{aligned}}}
(8)

Therefore

\begin{aligned} {}&\left| \frac{\mathbb{E} [ \beta_{k}(\mu, r ; L)]}{L} - \frac{\mathbb{E} [ \beta_{k}(\lambda, r; L)]} {L} \right| \\ &\le \sum_{j = k}^{k + 1} \left(\frac{\mathbb{E} \left[S_{j}(\mu, r ; L)\right]}{L} - \frac{\mathbb{E} \left[ S_{j}(\lambda, r; L)\right]}{L} \right). \end{aligned}
(9)

The sequence $$\left \{L^{-1}\mathbb {E}\left [S_{j}(\lambda, r; L)\right ]\right \}$$ converges uniformly on [ 0,Λ] by Lemma 3.1, and hence, is equicontinuous, which then implies the equicontinuity of the sequence $$\left \{L^{-1} \mathbb {E}\left [ \beta _{k} (\lambda, r; L)\right ]\right \}$$.

By observing that $$\theta ^{-1/d} \mathcal {P}(\lambda)$$ has the same distribution with $$\mathcal {P}(\lambda \theta)$$, we obtain the scaling property of $$\hat \beta _{k}(\lambda, r)$$. It then follows from the scaling property that $$\hat \beta _{k}(\lambda, r)$$ is continuous in both λ and r. The lemma is proved. □

Let us now consider the scaled Poissonized version

$$\mathcal{P}_{n} = \left\{n^{1/d}X_{1}, n^{1/d} X_{2}, \dots, n^{1/d}X_{N_{n}}\right\}.$$

Recall that N n is independent of {X n } and has Poisson distribution with parameter n. Then $$\mathcal {P}_{n} = n^{1/d}\bar {\mathcal {P}}_{n}$$ is a non-homogeneous Poisson point process with the intensity function f n (x):=f(x/n 1/d). It is clear that $$\mathcal {C}\left (\mathcal {P}_{n}, n^{1/d} r_{n}\right) \cong \mathcal {C}\left (\bar {\mathcal {P}}_{n}, r_{n}\right)$$. Thus Theorem 1.6 can be rewritten as follows.

### Theorem 3.3

Assume that the common probability density function f(x) has compact support, is bounded and Riemann integrable. Then in the regime that $$\tilde {r}_{n} \to r \in (0, \infty)$$,

\begin{aligned} &{}\frac{\mathbb{E} \left[\beta_{k} \left(\mathcal{C}\left(\mathcal{P}_{n}, \tilde{r}_{n}\right)\right)\right]}{n} \to \int_{\mathbb{R}^{d}} \hat{\beta}_{k}(f(x), r) dx\\ &\qquad\qquad\qquad\,\,\,\, =\! \! \int_{\mathbb{R}^{d}} \!\hat{\beta}_{k}\!\left(\!1, {f(x)^{1/d}} r\right) \! f(x) dx\, \text{as}\ n \!\to\! \! \infty. \end{aligned}
(10)

### Remark 3.4.

Note that $$\mathcal {P}'_{n} = \left (r/\tilde {r}_{n}\right) \mathcal {P}_{n}$$ is also a non-homogeneous Poisson point process. Moreover, as a result of scaling, $$\mathcal {C}\left (\mathcal {P}_{n}, \tilde {r}_{n}\right) \cong \mathcal {C}\left (\mathcal {P}_{n}', r\right)$$. Thus it is enough to prove Theorem 3.3 with $$\tilde {r}_{n} = r$$.

### Lemma 3.5

Assume that f(x),g(x)≤Λ in W L , where $$W_{L} \subset \mathbb {R}^{d}$$ is any Borel set of volume L. Then there exists a constant c=c(k,Λ L) such that

\begin{aligned} {}\left| \mathbb{E} \left[\beta_{k}\left(\mathcal{C}\left(\mathcal{P}_{W_{L}}(f(x)), r\right)\right)\right] \right.&\left.- \mathbb{E} \left[\beta_{k}\left(\mathcal{C}\left(\mathcal{P}_{W_{L}}(g(x)), r\right)\right)\right] \right|\\ &\le c \int_{W_{L}}\left|f(x) - g(x)\right|dx. \end{aligned}
(11)

### Proof

By considering $$f(x) := f(x)|_{W_{L}}$$ and $$g(x) := g(x)|_{W_{L}}$$, we omit the subscript W L in formulae. Let h(x)= max{f(x),g(x)}. A key idea here is the following coupling

$$\mathcal{P}(h(x)) = \mathcal{P}(f(x)) + \mathcal{P}(h(x) - f(x)).$$

Let $$t = \int (h(x) - f(x)) dx = \int (g(x) - f(x))^{+} dx$$ and N t be the number of points of $$\mathcal {P}(h(x)- f(x))$$ in W L . Then N t has Poisson distribution with parameter t. The total number of points of $$\mathcal {P}(h(x))$$ is bounded by N t +N Λ Lt , where N Λ Lt has Poisson distribution with parameter (Λ Lt) which is independent of N t . It now follows from Lemma 2.1 that

$$\begin{array}{*{20}l} &\left|\beta_{k}(\mathcal{C}(\mathcal{P}(f(x)), r)) - \beta_{k}\left(\mathcal{C}(\mathcal{P}(h(x)), r)\right) \right| \\ &\le \sum_{j = k}^{k + 1} S_{j} \left(\mathcal{C}(\mathcal{P}(h(x)), r) \setminus C(\mathcal{P}(f(x)), r) \right) \\ &\le 2N_{t} \left(N_{t} + N_{\Lambda L - t}\right)^{k+ 1}, \end{array}$$

and hence,

\begin{aligned} &{} \left|\mathbb{E}\left[\beta_{k}(\mathcal{C}(\mathcal{P}(f(x)), r))\right] - \mathbb{E}\left[\beta_{k}(\mathcal{C}(\mathcal{P}(h(x)), r)) \right]\right| \\&\,\,\,\qquad\qquad\quad\qquad \le 2\mathbb{E}\left[ N_{t} \left(N_{t} + N_{\Lambda L - t}\right)^{k+1}\right]. \end{aligned}
(12)

The right hand side is a polynomial of t whose smallest order is 1 and note that tΛ L, thus it is bounded by c t, where the constant c=c(k,Λ L) depends only on k and Λ L, namely we have

\begin{aligned} &{} \left|\mathbb{E}\left[\beta_{k}(\mathcal{C}(\mathcal{P}(f(x)), r))\right] - \mathbb{E}\left[\beta_{k}(\mathcal{C}(\mathcal{P}(h(x)), r)) \right]\right| \\ &\,\,\,\qquad\qquad\qquad\quad\le c \int (g(x) - f(x))^{+}dx. \end{aligned}
(13)

An analogous estimate holds when we compare the kth Betti number of $$\mathcal {C}(\mathcal {P}(g(x)), r)$$ and that of $$\mathcal {C}(\mathcal {P}(h(x)), r)$$. The proof is complete. □

### Proof of Theorem 3.3

Let S be the support of f and Λ:= supf(x). Divide $$\mathbb {R}^{d}$$ according to the lattice $$(L/n)^{1/d}\mathbb {Z}^{d}$$ and let {C i } be the cubes which intersect with S. Since we also consider the Poisson point process with density 0, we may assume that S= i C i .

Let W i be the image of C i under the map xn 1/d x. Then W i is a cube of length L 1/d. Let β k (W i ,r) be the kth Betti number of $$\mathcal {C}(\mathcal {P}_{n}|_{W_{i}}, r)$$. We now compare the kth Betti number of $$\mathcal {C}(\mathcal {P}_{n}, r)$$ and that of $$\cup _{i} \mathcal {C}(\mathcal {P}_{n}|_{W_{i}}, r)$$ by taking into account Lemma 2.1,

{{\begin{aligned} {}\left| \beta_{k}\left(\mathcal{C}(\mathcal{P}_{n}, r)\right) \,-\, \beta_{k}\left(\!\bigcup_{i}\!\mathcal{C}\!\left(\mathcal{P}_{n}|_{W_{i}}, r\right)\!\right) \right| \!&\!\le\! \sum_{j = k}^{k + 1} S_{j}\!\left(\!\mathcal{C}\!\left(\mathcal{P}_{n}, r\right) \!\setminus \!\bigcup_{i}\!\mathcal{C}\!\left(\mathcal{P}_{n}|_{W_{i}}, r\right) \!\right) \\ &\le \sum_{j = k}^{k + 1} S_{j}\left(\mathcal{P}_{n}, r; \cup_{i} \left(\partial W_{i}\right)^{(2r)}\right). \end{aligned}}}

Here $$S_{j}\left (\mathcal {P}_{n}, r; A\right)$$ is the number of j-simplices in $$\mathcal {C}\left (\mathcal {P}_{n}, r\right)$$ which has a vertex in A, A denotes the boundary of the set A and A (2r) is the set of points with distance at most 2r from A. The second inequality holds because any simplex in $$\mathcal {C}\left (\mathcal {P}_{n}, r\right) \setminus \cup _{i}\mathcal {C}(\mathcal {P}_{n}|_{W_{i}}, r)$$ must have a vertex in i ( W i )(2r).

Next, by the coupling $$\mathcal {P}(\Lambda) = \mathcal {P}_{n} + \mathcal {P}\left (\Lambda - f\left (x/n^{1/d}\right)\right)$$, it follows that for any bounded Borel set A,

$$\begin{array}{*{20}l} &\mathbb{E} \left[S_{j}\left(\mathcal{P}_{n}, r; A\right)\right] \le \mathbb{E} \left[S_{j}(\mathcal{P}(\Lambda), r ; A)\right] \\ &\le \mathbb{E}\left[ \sum_{x \in \mathcal{P}(\Lambda) \cap A} \mathcal{P}\left(\Lambda; B_{2r}(x)\right)^{j}\right]=: \mu_{\Lambda, r, j} (A) < \infty. \end{array}$$

It turns out that μ Λ,r,j becomes a translation invariant measure on $$\mathbb {R}^{d}$$ which is finite on bounded Borel sets. Thus μ Λ,r,j (A)=c(Λ,r,j)|A| for some constant c(Λ,r,j) depending only on Λ,r and j. Now by taking the expectation in (5), we get

$$\begin{array}{*{20}l} &\left| \mathbb{E} \left[\beta_{k}\left(\mathcal{C}\left(\mathcal{P}_{n}, r\right)\right)\right] - \sum_{i} \mathbb{E} \left[\beta_{k}\left(\mathcal{C}\left(\mathcal{P}_{n}|_{W_{i}}, r\right)\right) \right] \right| \\ &\le c \sum_ i \left|\left(\partial W_{i}\right)^{(2r)}\right| \le c' \frac{n |S|}{L} L^{(d-1)/d} = c' \frac{n |S|}{L^{1/d}}, \end{array}$$

where c and c are constants which do not depend on n and L. Therefore,

$${} \limsup_{n \to \infty} \left| \frac{\mathbb{E} \left[\beta_{k}\left(\mathcal{C}\left(\mathcal{P}_{n}, r\right)\right)\right]}{n} - \frac{1}{n} \sum_{i} \mathbb{E} \left[\beta_{k}\left(W_{i}, r\right) \right] \right| \!\le\! c' \frac{|S|}{L^{1/d}}.$$
(14)

Let $$f^{*}_{i} := \sup _{x \in C_{i}} f(x)$$ and $$\beta _{k}\left (f_{i}^{*}, r; L\right)$$ be the kth Betti number of the Čech complex built on a homogeneous Poisson point process $$\mathcal {P}_{W_{i}}\left (f_{i}^{*}\right)$$ with density $$f_{i}^{*}$$ restricted on W i . Then by Lemma 3.5,

{{\begin{aligned} {} \left|\mathbb{E} \!\left[\beta_{k}\!\left(W_{i}, r\right)\right] \,-\, \mathbb{E} \left[\beta_{k}\left(f^{*}_{i}, r;L\right)\right] \right| &\!\le\! c(k, \Lambda L) \int_{W_{i}}{\left(f^{*}_{i} \,-\, f\left(x/n^{1/d}\right)\right) \!dx} \\ &= c(k, \Lambda L) n \int_{C_{i}} \left(f^{*}_{i} - f(x)\right) dx. \end{aligned}}}

Here c(k,Λ L) is the constant in Lemma 3.5. Consequently,

$$\begin{array}{*{20}l} &\left| {\frac 1n \sum_{i} \mathbb{E}\left[ \beta_{k}\!\left(W_{i}, r\right)\right] - \frac 1n \sum_{i} \mathbb{E} \left[\beta_{k}\!\!\left(f^{*}_{i}, r;L\right)\right]}\right| \\ &\le c(k, \Lambda L) \sum_{i} \int_{C_{i}} \left(f_{i}^{*} - f(x)\right) dx \to 0\, \text{as}\ n \to \infty, \end{array}$$

because the function f(x) is assumed to be Riemann integrable.

By comparing $$\mathbb {E}\! \left [\beta _{k}\left (f_{i}^{*}, r;L\right)\right ]$$ with the limit $$\hat \beta _{k}(\lambda, r)$$, we get

$$\begin{array}{*{20}l} &\left| {\frac 1n \sum_{i} \mathbb{E} \!\left[\beta_{k}\left(f^{*}_{i}, r;L\right)\right] - \frac{L}{n} \sum_{i} \hat \beta_{k}\left(f^{*}_{i}, r\right)}\right| \\ &\le \frac{L}{n} \#\left\{C_{i} \right\} \sup_{0 \le \lambda \le \Lambda} \left|\frac{\mathbb{E} \!\left[\beta_{k}(\lambda,r; L)\right]}{L} - \hat \beta_{k}(\lambda, r)\right| \\ &= |S| \sup_{0 \le \lambda \le \Lambda} \left|\frac{\mathbb{E} [\! \beta_{k}(\lambda,r; L)]}{L} - \hat \beta_{k}(\lambda, r)\right|. \end{array}$$

Note that for fixed L, as n,

$$\sum_{i} \hat \beta_{k}\left(f^{*}_{i}, r\right) \frac Ln \to \int_{S} \hat \beta_{k}(f(x), r) dx.$$

Therefore

$$\begin{array}{*{20}l} &\limsup_{n \to \infty} \left| \frac{1}{n} \sum_{i} \mathbb{E} \!\left[\beta_{k}(W_{i}, r)\right] - \int_{S} \hat \beta_{k}(f(x), r) dx \right| \\ &\le |S| \sup_{0 \le \lambda \le \Lambda} \left|\frac{\mathbb{E} [\!\beta_{k}(\lambda, r;L)]}{L} - \hat \beta_{k}(\lambda, r)\right|. \end{array}$$
(15)

Combining the two estimates (14) and (15) and then let L, we get the desired result. The proof is complete. □

The result for binomial point processes will follow from Theorem 1.6 and the following result.

### Lemma 3.6

As n with n 1/d r n r,

$$\left| \frac{\mathbb{E}\!\left[\beta_{k}\left(\mathcal{C}(\bar{\mathcal{P}}_{n}, r_{n})\right)\right]}{n} - \frac{\mathbb{E}\!\left[\beta_{k}\left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right)\right]}{n} \right| \to 0.$$

### Proof

By Lemma 2.1 again, we have,

\begin{aligned} &\left| \beta_{k}\left(\mathcal{C}\left(\bar{\mathcal{P}}_{n}, r_{n}\right)\right) - \beta_{k}\left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right) \right| \\&\quad\le \sum_{j = k}^{k + 1} \left|S_{j}\left(\mathcal{C}\left(\bar{\mathcal{P}}_{n}, r_{n}\right)\right) - S_{j} \left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right) \right|. \end{aligned}

The right hand side, divided by n, converges to 0 as a consequence of general results in [6, 7] applied to S j . Here we will give an easy proof.

For any m, let

$$S_{j}(m, n) = \left|S_{j}\left(\mathcal{C}\left(\mathfrak{X}_{m}, r_{n}\right)\right) - S_{j} \left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right) \right|.$$

Since the probability density function f(x) is bounded, in the regime that n 1/d r n r, the probability that {X 1B x (r n )} is bounded by

$$\mathbb{P}\left(X_{1} \in B_{x} \left(r_{n}\right)\right) \le \frac{c}{n},$$

for some constant c which does not depend on n.

For m>nj, since each j-simplex in $$\mathcal {C}\left (\mathfrak {X}_{m}, r_{n}\right) \setminus \mathcal {C}\left (\mathfrak {X}_{n}, r_{n}\right)$$ must contain at least one vertex in {X n+1,…,X m }, we have

{{\begin{aligned} {}\mathbb{E}\!\left[\!S_{j}(m, n)\!\right] &\le (m - n) \mathbb{E} \left[\#\!\left\{j \text{-simplices in} \mathcal{C}(\mathfrak{X}_{m}, r_{n}) \text{containing} X_{m}\!\right\}\!\right] \\ &\le (m-n) \binom mj \mathbb{P}\left(X_{1} \in B_{X_{m}} (r_{n}), \dots, X_{j} \in B_{X_{m}}(r_{n})\right) \\ &\le (m - n) \frac{m !}{j! (m - j)!} \left(\frac{c}{n} \right)^{j} \\ &\le c_{1} (m - n) \left(\frac{m}{n} \right)^{j}. \end{aligned}}}

When jm<n, we change the role of m and n to get

$$\mathbb{E}\!\left[S_{j}(m, n)\right] \le (n - m) \binom n j \left(\frac{c}{n} \right)^{j} \le c_{2} (n - m).$$

Combining two estimates, we have

$$\mathbb{E}\!\left[S_{j}(m, n)\right] \le c_{3} |m - n| \left[1 + \left(\frac{m}{n} \right)^{j} \right].$$

Therefore,

$$\begin{array}{*{20}l} &\mathbb{E} \!\left[\left|S_{j}\left(\mathcal{C}\left(\bar{\mathcal{P}}_{n}, r_{n}\right)\right) - S_{j} \left(\mathcal{C}\left(\mathfrak{X}_{n}, r_{n}\right)\right) \right| \right] \\ &\le c_{3} \mathbb{E} \!\left[ \left|N_{n} - n \right| \left(1 + \frac{\left(N_{n}\right)^{j}}{n^{j}} \right) \right] \\ &\le c_{3} \mathbb{E}\!\left[\left(N_{n} - n\right)^{2}\right]^{1/2} \mathbb{E} \!\left[ \left(1 + \frac{(N_{n})^{j}}{n^{j}} \right)^{2} \right]^{1/2}. \end{array}$$

Here in the last inequality, we have used the Cauchy–Schwarz inequality. Note that $$\mathbb {E}\left [\left (N_{n}\right)^{j}\right ]$$ is a polynomial in n of degree j. Thus the second factor in the above estimate remains bounded as n. Note also that

$$\mathbb{E}\!\left[\left(N_{n} - n\right)^{2}\right] = \text{Var}\left[N_{n}\right] = n.$$

Therefore,

$${}\frac{\mathbb{E}\! \left[\left|S_{j}\left(\mathcal{C}\left(\bar{\mathcal{P}}_{n}, r_{n}\right)\right) \! -\! S_{j} \left(\mathcal{C}\! \left(\mathfrak{X}_{n}, r_{n}\right)\right) \right| \right]}{n} \! \le\! \frac{c_{4}}{n^{1/2}} \to 0\, \text{as}\ n \to \infty,$$

which completes the proof of Lemma 3.6. □

## References

1. 1

Bobrowski, O, Kahle, M: Topology of random geometric complexes: a survey. Topology in Statistical Inference, the Proceedings of Symposia in Applied Mathematics. to appear.

2. 2

Meester, R, Roy, R: Continuum Percolation. In: Cambridge Tracts in Mathematics, p. 238. Cambridge University Press, Cambridge (1996).

3. 3

Munkres, JR: Elements of Algebraic Topology. Addison-Wesley Publishing Company, Menlo Park (1984).

4. 4

Nakamura, T, Hiraoka, Y, Hirata, A, Escolar, EG, Nishiura, Y: Persistent homology and many-body atomic structure for medium-range order in the glass. Nanotechnology. 26(30), 304001 (2015).

5. 5

Penrose, M: Random Geometric Graphs. In: Oxford Studies in Probability, p. 330. Oxford University Press, Oxford (2003).

6. 6

Penrose, MD: Laws of large numbers in stochastic geometry with statistical applications. Bernoulli. 13(4), 1124–1150 (2007).

7. 7

Penrose, MD, Yukich, JE: Weak laws of large numbers in geometric probability. Ann. Appl. Probab. 13(1), 277–303 (2003).

8. 8

Yogeshwaran, D, Adler, RJ: On the topology of random complexes built over stationary point processes. Ann. Appl. Probab. 25(6), 3338–3380 (2015).

9. 9

Yogeshwaran, D, Subag, E, Adler, RJ: Random geometric complexes in the thermodynamic regime. Probab. Theory Relat. Fields. 167(1-2), 107–142 (2017).

Download references

## Acknowledgements

The author would like to thank Professor Tomoyuki Shirai and Dr. Kenkichi Tsunoda for useful discussions. The author would also like to thank the referees for their valuable comments. This work is partially supported by JST CREST Mathematics (15656429). The author is partially supported by JSPS KAKENHI Grant No. 16K17616.

### Competing interests

The author declares that he has no competing interests.

## Author information

Authors

### Corresponding author

Correspondence to Khanh Duy Trinh.

## Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions 