Adaptive Metrics for Adaptive Samples

Cavanna, Nicholas J.; Sheehy, Donald R.

doi:10.3390/a13080200

Open AccessArticle

Adaptive Metrics for Adaptive Samples

by

Nicholas J. Cavanna

^1,2,† and

Donald R. Sheehy

^1,3,*,†

¹

Department of Computer Science and Engineering, University of Connecticut, Storrs, CT 06269, USA

²

Swift Health Systems Inc., Irvine, CA 92617, USA

³

Department of Computer Science, North Carolina State University, Raleigh, NC 27695, USA

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Algorithms 2020, 13(8), 200; https://doi.org/10.3390/a13080200

Submission received: 1 July 2020 / Revised: 31 July 2020 / Accepted: 3 August 2020 / Published: 18 August 2020

(This article belongs to the Special Issue Topological Data Analysis)

Download Versions Notes

Abstract

:

We generalize the local-feature size definition of adaptive sampling used in surface reconstruction to relate it to an alternative metric on Euclidean space. In the new metric, adaptive samples become uniform samples, making it simpler both to give adaptive sampling versions of homological inference results and to prove topological guarantees using the critical points theory of distance functions. This ultimately leads to an algorithm for homology inference from samples whose spacing depends on their distance to a discrete representation of the complement space.

Keywords:

surface reconstruction; homology inference; adaptive sampling; topological data analysis

1. Introduction

1.1. From Points to Topology

Both surface reconstruction and homology inference are algorithmic problems that take points as input and produce a topological representation of the underlying space from which the points were drawn. In surface reconstruction, one often wants a homeomorphic reconstruction in the form of a triangulation, whereas in homology inference, it suffices to compute the homology groups. Although similar in many respects, the general trend is that with weaker conditions on the input (i.e., noisier samples), one can only hope for weaker guarantees on the output (homology rather than homeomorphism). There is one aspect of these theories that directly contradicts this trend: many surface reconstruction algorithms are able to work with an adaptive sample, while most homology inference algorithms require a uniform sample. Here and throughout, we use “uniform” in the Hausdorff sense, not the statistical sense. An adaptive sample has a density that adapts to some local sizing function. Thus, areas that require higher fidelity will have higher density (and a smaller scale), while areas that can get by with less fidelity will have lower density (and larger scale).

There have been some notable works that have bridged this gap between surface reconstruction and homology inference for adaptive samples. Most theoretically guaranteed surface reconstruction algorithms assume an input that is sufficiently dense with respect to the distance to the medial axis, a kind of skeleton describing the complement of the underlying shape. Cazals et al. [1] introduced the conformal alpha shape filtration as a way to build triangulations at different scales that have local connectivity related to the local feature size. Although their stated goal was surface reconstruction, the work employed many of the methods of homology inference. Chazal and Lieutier [2,3] gave a more direct generalization of methods in surface reconstruction with adaptive samples to homology inference, achieving some guarantees for smooth manifolds assuming both upper and lower bounds on the density. Dey et al. [4] gave a homology inference algorithm for manifold data that attempts to sample a subset of the medial axis in order to approximate the local feature size. This work was the main motivation for the current paper, and we adopted their notation of X for the space and L for the approximation to the complement space. We extend these works by providing guaranteed homology inference for a much more general class of samples and spaces; we do not require the space to be a manifold or the sample to adapt to the medial axis.

1.2. From Surface Reconstruction to Homology Inference

To reconstruct a surface from a point set, one needs the sample to be sufficiently dense with respect to not just the local curvature of the surface, but also the distance to parts of the surface that are close in the embedding, but far in geodesic distance. Otherwise, algorithms have no way of identifying which geometrically close sample points correspond to local neighborhoods in the surface. Adaptive sampling with respect to the so-called local feature size as introduced by Amenta and Bern [5] neatly characterizes such “good” samples and was then used in many later works on surface reconstruction with topological guarantees [6]. There is an extensive literature on the problem in high dimensions (see [7,8] for recent examples), and the problem remains an active research area. Such adaptive samples are in contrast to uniform samples for which a single parameter determines the density. That parameter is usually driven by the minimum of the local feature size and results in a much larger sample.

Later work on generalizations of surface reconstruction and homology inference related the topology of unions of balls centered at a sample

\hat{X}

near the unknown set X to the topology of X itself. The most well-known such results were by Niyogi et al. [9,10]. A union of balls with a fixed radius can be viewed as a sublevel set of the distance function to

\hat{X}

. If we have an adaptive sample, then we would like to scale the radii of the balls as well. However, if the sample is adaptive with respect to a local feature size defined as the distance to an unknown set L, another approximation

\hat{L}

near L is necessary. Indeed, one interpretation of some Voronoi-based surface reconstruction algorithms is that an approximation

\hat{L}

to the medial axis L is computed from the Voronoi diagram of the sample

\hat{X}

of the unknown surface X.

We present a new perspective on adaptive samples. For any pair of disjoint, compact sets X and L, we define a metric on

R^{d} \ L

with the property that a uniform sample of X in the new metric corresponds to an adaptive sample in the Euclidean metric. We call this the metric induced by L or simply the induced metric. This new metric can also be extended to an arbitrarily close Riemannian metric over the same domain. Our main motivation is to connect adaptive sampling theory to the critical point theory of distance functions used extensively to prove topological guarantees in topological data analysis [2,11,12]. That theory gives natural topological equivalences between sublevel sets of distance functions to compact sets in Riemannian metrics. Thus, we propose to use the induced metric as the underlying ideal object and then relate it to a union of Euclidean balls constructed from approximations of X and L. Our metric can be viewed as a smoothed version of a metric used by Clarkson [13]. Our new formulation reveals connections with work on path planning [14,15] and density-based distances [16,17]. These are all constructions where one looks at conformal change of metrics induced by subsets of Euclidean space.

1.3. Overview

We lay out the main objects of study in Section 2. This includes the induced metric and a discrete approximation. Throughout the paper, we will relate these two objects or variations thereof for different purposes. In Section 3.1, we prove the relationship between the adaptive samples used in surface reconstruction and uniform samples in the induced metric. The definition of the induced metric does not lend itself to direct computation. Therefore, in Section 3.2, we bound the interleaving distance between the induced metric and its discrete approximation. This interleaving is then used in Section 3.3 to give a homology inference algorithm that is guaranteed to recover the homology of a sublevel set of the induced metric under certain sampling conditions.

2. Methods

Let L and X be compact subsets of

R^{d}

with respect to the Euclidean metric. For

x, y \in R^{d}

, define

Path (x, y)

to be the set of bounded piecewise-

C_{1}

paths from x to y, parametrized by the Euclidean arc-length. Similarly,

Path (x, S) : = ⋃_{s \in S} Path (x, s)

denotes all paths from x to a set S.

For any compact set

L \subseteq R^{d}

, define

f_{L} (\cdot) : R^{d} \to R

by:

f_{L} (x) : = min_{ℓ \in L} ‖ x - ℓ ‖ .

Define:

d^{L} (x, y) : = min_{γ \in Path (x, y)} \int_{γ} \frac{d z}{f_{L} (z)} .

The length of a unit-speed path

γ : [0, a] \to R^{d}

is denoted as:

| γ | : = \int_{γ} d z = \int_{0}^{a} d t .

For

y \in R^{d}

, define:

f_{X}^{L} (y) : = d^{L} (y, X) = min_{x \in X} d^{L} (y, x),

and:

\hat{f_{X}^{L}} (y) : = min_{x \in X} \frac{‖ y - x ‖}{f_{L} (x)} .

Note that

f_{X}^{L} (\cdot)

is a distance function, while

\hat{f_{X}^{L}} (\cdot)

is not. The latter function can be interpreted as a first-order approximation of the former.

Definition 1.

For any compact set

X \subset R^{d} \ L

, for some compact set

L \subset R^{d}

, the α-offsets with respect to

d^{L}

are:

A_{X}^{L} (α) : = {x \in R^{d} | f_{X}^{L} (x) \leq α} .

The distance function

f_{L} (\cdot)

can be transformed into an arbitrarily close smooth function

{\tilde{f}}_{L} (\cdot)

[18], yielding a Riemannian metric

{\tilde{d}}_{L}

defined in an identical manner as

d_{L}

. From this, one has corresponding

α

-offsets

{\tilde{A}}_{X}^{L} (α)

that are arbitrarily close to

A_{L}^{X} (α)

. We will encounter this smoother version in Section 3.3.

We will approximate the offsets

A_{X}^{L} (α)

by a union of balls as follows.

Definition 2.

For any compact set

X \subset R^{d} \ L

, for some compact set

L \subset R^{d}

, the approximate α-offsets with respect to

d^{L}

are:

B_{X}^{L} (α) : = {(\hat{f_{X}^{L}})}^{- 1} [0, α] = ⋃_{x \in X} ball (x, α f_{L} (x)) .

A useful property of

f_{X}^{L} (\cdot)

is that it is a one-Lipschitz function. In general, a function f between two metric spaces

(X, d_{X})

and

(Y, d_{Y})

is said to be k-Lipschitz if for all

x, y \in X

,

d_{Y} (f (x), f (y)) \leq k d_{X} (x, y)

.

Lemma 1.

The function

f_{X}^{L}

is one-Lipschitz from the metric space

(R^{d}, d^{L})

to

R

.

Proof.

Fix any

a, b \in R^{d}

. There exists point

x \in X

and a path

γ_{1} \in Path (a, x)

such that

f_{X}^{L} (a) = \int_{γ_{1}} \frac{d z}{f_{L} (z)}

. Likewise, there exists

γ_{2} \in Path (a, b)

such that

d^{L} (a, b) = \int_{γ_{2}} \frac{d z}{f_{L} (z)}

.

This implies that the concatenation of

γ_{1}

and

γ_{2}

is a path

γ_{3}

in

Path (b, X)

. Thus,

f_{X}^{L} (b) \leq \int_{γ_{3}} \frac{d z}{f_{L} (z)} \leq f_{X}^{L} (a) + d^{L} (a, b)

. As this holds for all

a, b

, we conclude that

| f_{X}^{L} (a) - f_{L}^{X} (b) | \leq d^{L} (a, b)

, as desired. □

We can use

f_{X}^{L}

to define the Hausdorff distance, which is a metric between compact sets. This metric is useful for stating bounds on the quality, or uniformity, of a sample near a set.

Definition 3.

The Hausdorff distance between two compact sets

X, Y \in (R^{d}, d^{L})

is defined as:

d_{H}^{L} (X, Y) = max {max_{x \in X} f_{Y}^{L} (x), max_{y \in Y} f_{X}^{L} (y)}

If the Hausdorff distance between a compact set and a sample is bounded, Lemma 3 shows that their

α

-offsets are interleaved at particular scales.

Lemma 2.

Let

\hat{X}, X \subseteq R^{d} \ L

be such that

d_{H}^{L} (\hat{X}, X) \leq δ

. Then, for all

α \geq 0

,

A_{X}^{L} (α) \subseteq A_{\hat{X}}^{L} (α + δ)

and

A_{\hat{X}}^{L} (α) \subseteq A_{X}^{L} (α + δ)

.

Proof.

Let

y \in A_{X}^{L} (α)

be any point. By the definition of

A_{X}^{L}

, we have

f_{X}^{L} (y) \leq α

. Therefore, there exists

x \in X

such that

d^{L} (x, y) \leq α

. The Hausdorff assumption that

d_{H}^{L} (\hat{X}, X) \leq δ

implies that for all

x \in X

, we have

f_{\hat{X}}^{L} (x) \leq δ

. By Lemma 1,

f_{\hat{X}}^{L} (y) \leq f_{\hat{X}}^{L} (x) + d^{L} (x, y) \leq δ + α

, implying

y \in A_{\hat{X}}^{L} (α + δ)

. The second inclusion is proven by a symmetric argument. □

The following is the definition of an adaptive sample we will use throughout. For the special case when X is a manifold and L is its medial axis, it corresponds to the

ε

-sample used in surface reconstruction.

Definition 4.

Given a compact set

L \subset R^{d}

and compact sets

X, \hat{X} \subset R^{d} \ L

such that

\hat{X} \subseteq X

, we say that

\hat{X}

is an ε-sample of X, for

ε \in [0, 1)

, if for all

x \in X

, there exists

p \in \hat{X}

such that

‖ x - p ‖ \leq ε f_{L} (x)

.

This definition is closely related to that of the approximate

α

-offsets, because if

\hat{X}

is an

ε

-sample of X, then for all

x \in X

,

ball (x, ε f_{L} (x)) \cap \hat{X} \neq \emptyset

.

3. Results

Lemma 3.

Consider

\hat{X}, X \subseteq R^{d} \ L

to be such that

d_{H}^{L} (\hat{X}, X) \leq δ

. Then, for all

α \geq 0

,

A_{X}^{L} (α) \subseteq A_{\hat{X}}^{L} (α + δ)

and

A_{\hat{X}}^{L} (α) \subseteq A_{X}^{L} (α + δ)

.

Proof.

Fix

y \in A_{X}^{L} (α)

. By definition,

f_{X}^{L} (y) \leq α

, which implies that there exists

x \in X

such that

d^{L} (x, y) \leq α

.

d_{H}^{L} (\hat{X}, X) \leq δ

, which implies that for all

x \in X

,

f_{\hat{X}}^{L} (x) \leq δ

. Now, by Lemma 1,

f_{\hat{X}}^{L} (y) \leq f_{\hat{X}}^{L} (x) + d^{L} (x, y) \leq δ + α

, implying

y \in A_{\hat{X}}^{L} (α + δ)

. By a symmetric argument, the other statement holds. □

Lemma 4 relates the length of a path

γ

with respect to two distance-to-set functions, assuming they have a close Hausdorff distance with respect to a Euclidean metric.

Lemma 4.

Let

L, \hat{L}

be two compact sets such that

d_{H} (L, \hat{L}) \leq ε

for some

ε > 0

. For all unit-speed,

γ : [0, a] \to R^{d} \ L^{c ε}

, where for some positive c, we have the following inequalities.

(1 - \frac{1}{c}) {| γ |}^{\hat{L}} \leq {| γ |}^{L} \leq (1 + \frac{1}{c}) {| γ |}^{\hat{L}} .

Proof.

Take an arbitrary unit-speed path

γ : [0, a] \to R^{d} \ L^{c ε}

where

d_{H} (L, \hat{L}) \leq ε

. Since the image of the path

γ

is a subset of

R^{d} \ L^{c ε}

, then for all

z \in γ

,

f_{L} (z) > c ε

. By the Hausdorff distance between L and

\hat{L}

, we have

f_{L} (z) \leq f_{\hat{L}} (z) + ε < f_{\hat{L}} (z) + \frac{f_{L} (z)}{c}

. Likewise, we have that

f_{L} (z) \geq f_{\hat{L}} (z) - ε > f_{\hat{L}} (z) - \frac{f_{L} (z)}{c}

. Rearranging both of these, we have that

\frac{1 - \frac{1}{c}}{f_{\hat{L}} (z)} < \frac{1}{f_{L} (z)} < \frac{1 + \frac{1}{c}}{f_{\hat{L}} (z)}

.

By the definition of

{| γ |}^{L}

and

{| γ |}^{\hat{L}}

, these inequalities imply that

(1 - \frac{1}{c}) {| γ |}^{\hat{L}} \leq {| γ |}^{L} \leq (1 + \frac{1}{c}) {| γ |}^{\hat{L}} .

□

The following lemma provides a bound on how close to L a shortest path to a compact set X can be and a constant c to satisfy Lemma 4 that is dependent on what compact set

X \subset R^{d} \ L

one is working.

Lemma 5.

Take compact set

L \subset R^{d}

, compact set

X \subset R^{d} \ L

, and

y \in A_{X}^{L} (δ)

, for

δ < 1

. If γ is the shortest path from y to X with respect to

d^{L}

, then:

γ \subset R^{d} \ L^{(1 - \frac{δ}{1 - δ}) d_{H} (X, L)}

Proof.

Since

y \in A_{X}^{L} (δ)

,

f_{X}^{L} (y) \leq δ

, so there exists

x \in X

such that

d^{L} (x, y) \leq δ

. Take

γ

as the shortest path from y to X. For all

z \in γ

,

d^{L} (x, z) \leq d^{L} (x, y) \leq δ

.

By Lemma 10,

‖ x - z ‖ \leq \frac{δ}{1 - δ} f_{L} (x)

, and by

f_{L}

being Lipschitz, we have that

f_{L} (z) \geq f_{L} (x) - ‖ x - z ‖ \geq (1 - \frac{δ}{1 - δ}) f_{L} (x) \geq (1 - \frac{δ}{1 - δ}) d_{H} (X, L)

. This means that every point on the path

γ

is at least distance

(1 - \frac{δ}{1 - δ}) d_{H} (X, L)

away from L. □

We define a noisy

ε

-sample, for

ε < 1

, of compact

X \subseteq R^{d} \ L

with respect to

f_{L}

for some compact set L as a compact set

\hat{X} \subseteq R^{d} \ L

such that for all

x \in X

, there exists

p \in \hat{X}

such that

‖ x - p ‖ \leq ε f_{L} (x)

. Likewise, for all

p \in \hat{X}

, there exists

x \in X

, such that

‖ x - p ‖ \leq ε f_{L} (x)

. The following theorems relate a noisy

ε

-sample to the Hausdorff distance between the sample

\hat{X}

and the set X and vice versa.

Lemma 6.

Consider compact set L and compact

X, \hat{X} \subset R^{d} \ L

. If

\hat{X}

is a noisy ε-sample of X with respect to

f_{L}

, for

ε < 1

, then

d_{H}^{L} (\hat{X}, X) \leq \frac{ε}{1 - ε}

.

Proof.

Given

x \in X

, by definition, there exists

p \in \hat{X}

such that

‖ x - p ‖ \leq ε f_{L} (x)

. By Lemma 10,

d^{L} (x, p) \leq \frac{ε}{1 - ε}

, so for all,

x \in X

,

f_{\hat{X}}^{L} (x) \leq \frac{ε}{1 - ε}

.

Furthermore, given

p \in \hat{X}

, there exists

x \in X

such that

‖ x - p ‖ \leq ε f_{L} (x)

, so for all

p \in \hat{X}

,

f_{X}^{L} (p) \leq \frac{ε}{1 - ε}

; thus,

d_{H}^{L} (\hat{X}, X) \leq \frac{ε}{1 - ε}

. □

Lemma 7.

Consider compact set L and sets

X, \hat{X} \subset R^{d} \ L

. If

d_{H}^{L} (\hat{X}, X) \leq ε < \frac{1}{2}

, then

\hat{X}

is a noisy

\frac{ε}{1 - ε}

-sample of X with respect to

f_{L}

.

Proof.

d_{H}^{L} (\hat{X}, X) \leq ε

implies that for all

p \in \hat{X}

,

f_{X}^{L} (p) \leq ε

. Thus, there exists

x \in X

such that

d^{L} (x, p) \leq ε

. By Lemma 10,

‖ x - p ‖ \leq \frac{ε}{1 - ε} f_{L} (x)

.

Similarly,

d_{H}^{L} (\hat{X}, X) \leq ε

implies that for all

x \in X

,

f_{\hat{X}}^{L} (x) \leq ε

; thus, there exists

x \in \hat{X}

such that

d^{L} (x, p) \leq ε

, and thus,

‖ x - p ‖ \leq \frac{ε}{1 - ε} f_{L} (x)

. Since

ε < \frac{1}{2}

, then

\frac{ε}{1 - ε} < 1

, so

\hat{X}

is a noisy

\frac{ε}{1 - ε}

-sample of X. □

Lemma 8.

Given compact set

L \subset R^{d}

and compact set

X \subset R^{d} \ L

, for

ε < 1

,

A_{X}^{L} (ε) \subseteq B_{X}^{L} (\frac{ε}{1 - ε})

.

Proof.

Take

y \in A_{X}^{L} (ε)

so that

f_{X}^{L} (y) \leq ε

. Thus, there exists

x \in X

such that

d^{L} (x, y) \leq ε

. By Lemma 10, this implies that

‖ x - y ‖ \leq \frac{ε}{1 - ε} f_{L} (x)

, which implies that

y \in B_{X}^{L} (\frac{ε}{1 - ε})

. □

Lemma 9.

Given compact set

L \subset R^{d}

and compact set

X \subset R^{d} \ L

, for

ε < 1

,

B_{X}^{L} (ε) \subseteq A_{X}^{L} (\frac{ε}{1 - ε})

.

Proof.

Consider

y \in B_{X}^{L} (ε)

. Thus,

y \in ball (x, ε f_{L} (x))

, for some

x \in X

, so

‖ x - y ‖ \leq ε f_{L} (x)

. Applying Lemma 10, we then have that

d^{L} (x, y) \leq \frac{ε}{1 - ε}

, and as

f_{X}^{L} (y) \leq d^{L} (x, y)

,

y \in A_{X}^{L} (\frac{ε}{1 - ε})

. □

3.1. Adaptive Sampling

In this section, we prove that a uniform sample in the induced metric corresponds to an adaptive sample in the Euclidean metric and vice versa. The key to this proof is the following lemma, which will also be used for the more elaborate interleaving results of Section 3.2.

Lemma 10.

Let

L \subset R^{d}

be a compact set, and let

a, b \in R^{d} \ L

. Then, the following two statements hold for all

δ \in [0, 1)

.

(i): If $d^{L} (a, b) \leq δ$ , then $\frac{‖ a - b ‖}{f_{L} (a)} \leq \frac{δ}{1 - δ}$ .
(ii): If $\frac{‖ a - b ‖}{f_{L} (a)} \leq δ$ , then $d^{L} (a, b) \leq \frac{δ}{1 - δ}$ .

Proof.

To prove (i), we assume

d^{L} (a, b) \leq δ

. Let

γ

be the path in

Path (a, b)

such that

d^{L} (a, b) = \int_{γ} \frac{d z}{f_{L} (z)} < δ

. Then, we have the following inequalities following from the Lipschitz property of

f_{L}

.

\begin{matrix} | γ | = \int_{γ} d z & = (f_{L} (a) + | γ |) \int_{γ} \frac{d z}{f_{L} (a) + | γ |} \\ \leq (f_{L} (a) + | γ |) \int_{γ} \frac{d z}{f_{L} (z)} \\ \leq (f_{L} (x) + | γ |) δ \end{matrix}

It follows that

| γ | \leq \frac{δ}{1 - δ} f_{L} (x)

. Because

‖ a - b ‖

is the length of the shortest path between a and b in the Euclidean metric, we conclude that

‖ a - b ‖ \leq | γ | \leq \frac{δ}{1 - δ} f_{L} (x)

.

Next we prove (ii). Assume

\frac{‖ a - b ‖}{f_{L} (a)} \leq δ

. For all points z in the straight line segment

\bar{a b}

,

f_{L} (z) \geq f_{L} (a) - ‖ a - z ‖ \geq f_{L} (a) - ‖ a - b ‖ \geq (1 - δ) f_{L} (a) .

This implies the following inequality.

\begin{matrix} d^{L} (a, b) & = inf_{γ \in Path (a, b)} \int_{γ} \frac{d z}{f_{L} (z)} \\ \leq \int_{\bar{a b}} \frac{d z}{f_{L} (z)} \\ \leq \frac{1}{(1 - δ) f_{L} (a)} \int_{\bar{a b}} d z \\ = \frac{‖ a - b ‖}{(1 - δ) f_{L} (a)} \\ \leq \frac{δ}{1 - δ} . \end{matrix}

□

We can now state the main theorem relating adaptive samples in the Euclidean metric to uniform samples in the metric induced by a set L.

Theorem 1.

Let L and X be compact sets; let

\hat{X} \subset X

be a sample; and let

ε \in [0, 1)

be a constant. If

\hat{X}

is an ε-sample of X with respect to the distance to L, then

d_{H}^{L} (X, \hat{X}) \leq \frac{ε}{1 - ε}

. Furthermore, if

d_{H}^{L} (X, \hat{X}) \leq ε < \frac{1}{2}

, then

\hat{X}

is an

\frac{ε}{1 - ε}

-sample of X with respect to the distance to L.

Proof.

Given

x \in X

, there exists

p \in \hat{X}

such that

‖ x - p ‖ \leq ε f_{L} (x)

. By Lemma 10,

d^{L} (x, p) \leq \frac{ε}{1 - ε}

, so for all

x \in X

,

f_{\hat{X}}^{L} (x) \leq \frac{ε}{1 - ε}

. As

\hat{X} \subseteq X

, this proves

d_{H}^{L} (\hat{X}, X) \leq \frac{ε}{1 - ε}

.

Furthermore,

d_{H}^{L} (\hat{X}, X) \leq ε < \frac{1}{2}

implies that for all

x \in X

,

f_{\hat{X}}^{L} (x) \leq ε

; thus, there exists

p \in \hat{X}

such that

d^{L} (x, p) \leq ε

. Thus, by Lemma 10

‖ x - p ‖ \leq \frac{ε}{1 - ε} f_{L} (x)

. Since

ε < \frac{1}{2}

, then

\frac{ε}{1 - ε} < 1

, so

\hat{X}

is an

\frac{ε}{1 - ε}

-sample of X. □

3.2. Interleaving

A filtration is a nested family of sets. In this paper, we consider filtrations F parameterized by a real number

α \geq 0

so that

F (α) \subset R^{d}

, and whenever

α < β

, we have

F (α) \subseteq F (β)

. Often, our filtrations are sublevel filtrations of a real valued function

f : R^{d} \to R

. The sublevel filtration F corresponding to the function f is defined as:

F (α) : = {x \in R^{d} | f (x) \leq α} .

Definition 5.

A pair of filtrations

(F, G)

is

(h_{1}, h_{2})

-interleaved in an interval

(s, t)

if

F (r) \subseteq G (h_{1} (r))

whenever

r, h_{1} (r) \in (s, t)

and

G (r) \subseteq F (h_{2} (r))

whenever

r, h_{2} (r) \in (s, t)

. We require that the functions

h_{1}, h_{2}

to be nondecreasing in

(s, t)

.

The following lemma gives us an easy way to combine interleavings.

Lemma 11.

If

(F, G)

is

(h_{1}, h_{2})

-interleaved in

(s_{1}, t_{1})

and

(G, H)

is

(h_{3}, h_{4})

-interleaved in

(s_{2}, t_{2})

, then

(F, H)

is

(h_{3} \circ h_{1}, h_{2} \circ h_{4})

-interleaved in

(s_{3}, t_{3})

, where

s_{3} = max {s_{1}, s_{2}}

and

t_{3} = min {t_{1}, t_{2}}

.

Proof.

If

r, h_{3} (h_{1} (r)) \in (s_{3}, t_{3})

, then we have

F (r) \subseteq G (h_{1} (r)) \subseteq H (h_{3} (h_{1} (r)))

. Similarly, if

r, h_{2} (h_{4} (r)) \in (s_{3}, t_{3})

, then

H (r) \subseteq G (h_{4} (r)) \subseteq F (h_{2} (h_{4} (r)))

. □

3.2.1. Approximating X with $\hat{X}$

Ultimately, the goal is to relate

A_{X}^{L}

, the offsets in the induced metric, to

B_{\hat{X}}^{\hat{L}}

, the approximate offsets computed from approximations (or samples) to both X and L. This relationship will be given by an interleaving that is built up from an interleaving for each approximation step. For each of the following lemmas, let

L, \hat{L} \subset R^{d}

and

X, \hat{X} \subset R^{d} \ (L \cup \hat{L})

be compact sets.

Lemma 12.

If

d_{H}^{L} (\hat{X}, X) \leq ε

, then

(A_{X}^{L}, A_{\hat{X}}^{L})

are

(h_{1}, h_{1})

-interleaved in

(0, \infty)

, where

h_{1} (r) = r + ε

.

Proof.

This lemma is a reinterpretation of Lemma 3 in the interleaving notation. □

3.2.2. Approximating the Induced Metric

It is much easier to use a union of Euclidean balls to model the sublevel sets of the distance function

f_{X}^{L}

. Below, we show that this is a reasonable approximation. The following results may also be viewed as a strengthening of the adaptive sampling result of the previous section (Theorem 1).

Lemma 13.

The pair

(A_{\hat{X}}^{L}, B_{\hat{X}}^{L})

is

(h_{2}, h_{2})

-interleaved in

(0, 1)

, where

h_{2} (r) = \frac{r}{1 - r}

.

Proof.

It will suffice to show that for

r \in [0, 1)

,

A_{\hat{X}}^{L} (r) \subseteq B_{\hat{X}}^{L} (\frac{r}{1 - r})

, and for

r \in [0, \frac{1}{2})

,

B_{\hat{X}}^{L} (r) \subseteq A_{\hat{X}}^{L} (\frac{r}{1 - r})

.

Take

y \in A_{\hat{X}}^{L} (r)

so that

f_{\hat{X}}^{L} (y) \leq r

. Thus, there exists

x \in X

such that

d^{L} (x, y) \leq r

. By Lemma 10, this implies that

‖ x - y ‖ \leq \frac{r}{1 - r} f_{L} (x)

, which implies that

y \in B_{\hat{X}}^{L} (\frac{r}{1 - r})

.

Consider any point

y \in B_{\hat{X}}^{L} (r)

. For some

x \in X

, we have

y \in ball (x, r f_{L} (x))

, so

‖ x - y ‖ \leq r f_{L} (x)

. Applying Lemma 10, we have that

d^{L} (x, y) \leq \frac{r}{1 - r}

. Finally,

y \in A_{\hat{X}}^{L} (\frac{r}{1 - r})

, because

f_{\hat{X}}^{L} (y) \leq d^{L} (x, y)

. □

3.2.3. Approximating L with $\hat{L}$

Usually, the set L is unknown at the start and must be estimated from the input. For example, if L is the medial axis of X, there are several known techniques for approximating L by taking some vertices of the Voronoi diagram [5,6]. We would like to give some sampling conditions that allow us to replace L with an approximation

\hat{L}

. Interestingly, the sampling conditions for

\hat{X}

are dual to those used for

\hat{L}

: we require

d_{H}^{\hat{X}} (L, \hat{L}) \leq ε

. In other words,

\hat{L}

must be an adaptive sample with respect to the distance to

\hat{X}

.

Lemma 14.

If

d_{H}^{\hat{X}} (L, \hat{L}) \leq δ < 1

, then

(B_{\hat{X}}^{L}, B_{\hat{X}}^{\hat{L}})

is

(h_{3}, h_{3})

-interleaved in

(0, \infty)

, where

h_{3} (r) = \frac{r}{1 - δ}

.

Proof.

Fix any

x \in B_{\hat{X}}^{L} (r)

. There is a point

p \in \hat{X}

such that

\frac{‖ x - p ‖}{f_{L} (p)} \leq r

. Moreover, there is a nearest point

z \in \hat{L}

to x such that

f_{\hat{L}} (p) = ‖ p - z ‖

. Lemma 10 and the assumption that

d_{H}^{\hat{X}} (L, \hat{L}) \leq δ

together imply that there exists

y \in L

such that:

‖ y - z ‖ \leq \frac{δ}{1 - δ} f_{\hat{X}} (z) .

(1)

It then follows from the definitions that:

f_{\hat{X}} (z) = min_{q \in \hat{X}} ‖ z - q ‖ \leq ‖ z - p ‖ = f_{\hat{L}} (p) .

(2)

Therefore, we can bound

f_{L} (p)

in terms of

f_{\hat{L}} (p)

as follows.

\begin{matrix} f_{L} (p) & \leq ‖ y - p ‖ & [y \in L] \\ \leq ‖ y - z ‖ + ‖ z - p ‖ & [triangle inequality] \\ \leq \frac{1}{1 - δ} f_{\hat{L}} (p) & [by (1) and (2)] \end{matrix}

Therefore,

\begin{matrix} \frac{‖ x - p ‖}{f_{\hat{L}} (p)} & \leq \frac{‖ x - p ‖}{(1 - δ) f_{L} (p)} \leq \frac{r}{1 - δ} = h_{3} (r) . \end{matrix}

Therefore,

x \in B_{\hat{X}}^{\hat{L}} (h_{3} (r))

, and so, we conclude that

B_{\hat{X}}^{L} (r) \subseteq B_{\hat{X}}^{\hat{L}} (h_{3} (r))

. The proof is symmetric to show that

B_{\hat{X}}^{\hat{L}} (r) \subseteq B_{\hat{X}}^{L} (h_{3} (r))

□

3.2.4. Putting It All Together

We can now use Lemma 11 to combine the interleavings established in Lemmas 12–14.

Theorem 2.

Let

L, \hat{L} \subset R^{d}

and

X, \hat{X} \subset R^{d} \ (L \cup \hat{L})

be compact sets. If

d_{H}^{\hat{X}} (L, \hat{L}) \leq δ < 1

and

d_{H}^{L} (\hat{X}, X) \leq ε < 1

, then

(A_{X}^{L}, B_{\hat{X}}^{\hat{L}})

are

(h_{4}, h_{5})

-interleaved in

(0, 1)

, where

h_{4} (r) = \frac{r + ε}{(1 - r - ε) (1 - δ)}

and

h_{5} (r) = \frac{r}{1 - δ - r} + ε

.

Proof.

We use Lemma 11 to combine the interleavings from Lemmas 12–14 to conclude that the pair

(A_{X}^{L}, B_{\hat{X}}^{\hat{L}})

is

(h_{3} \circ h_{2} \circ h_{1}, h_{1} \circ h_{2} \circ h_{3})

-interleaved in

(0, 1)

. To complete the proof, we expand

h_{3} \circ h_{2} \circ h_{1}

and

h_{1} \circ h_{2} \circ h_{3}

as follows.

\begin{matrix} (h_{3} \circ h_{2} \circ h_{1}) (r) = (h_{3} \circ h_{2}) (r + δ) & = h_{3} (\frac{r + δ}{1 - r - δ}) \\ = \frac{r + δ}{(1 - r - δ) (1 - ε)} \\ (h_{1} \circ h_{2} \circ h_{3}) (r) = (h_{1} \circ h_{2}) (\frac{r}{1 - ε}) & = h_{1} (\frac{r}{(1 - ε) (1 - \frac{r}{1 - ε})}) \\ = h_{1} (\frac{r}{1 - ε - r}) \\ = \frac{r}{1 - ε - r} + δ \end{matrix}

Therefore, we have that

h_{4} (r) = \frac{r + δ}{(1 - r - δ) (1 - ε)}

and

h_{5} (r) = \frac{r}{1 - ε - r} + δ

. □

3.3. Smooth Adaptive Distance and Homology Inference

In the preceding sections, we showed how to approximate (via interleaving)

A_{X}^{L}

, the sublevels of the distance to X in the induced metric, using a finite set of Euclidean balls,

B_{\hat{X}}^{\hat{L}}

. Now, we show how and when such an approximation gives a guarantee about the underlying space X itself. This is substantially more difficult, because it requires us to relate the sublevels of the induced metric to an object we do not have direct access to. As such, we will require some stronger hypotheses.

We will first review the critical point theory of distance functions. Then, we show how to smooth the induced metric to an arbitrarily close Riemannian metric, rendering the critical point theory applicable. Then, we put these together to prove the main inference result of the paper, Theorem 3.

3.3.1. Critical Points of Distance Functions

In this section, we give a minimal presentation of the critical point theory of distance functions to explain and motivate the results about interleaving offsets of distance functions in Riemannian manifolds. The main fact we use is that such interleavings lead immediately to results about homology inference (Lemma 16).

For a smooth Riemannian manifold M and a compact subset

X \subset M

, one can consider the function

f_{X} : M \to R

that maps each point in M to the distance to its nearest point in X as measured by the metric on the manifold. The gradient of

f_{X}

can be defined on M, and the critical points are those points for which the gradient is zero. The critical values of

f_{X}

are those values of r such that

f_{X}^{- 1} (r)

contains a critical point. The critical point theory of distance functions developed by Grove and others [11] extends the ideas from Morse theory to such distance functions. In particular, the theory gives the following result.

Lemma 15

(Grove [11]). If

[r, r^{'}]

contains no critical values, then

f_{X}^{- 1} ([0, r]) ↪ f_{X}^{- 1} ([0, r^{'}])

is a homotopy equivalence.

This means that for intervals that do not contain critical values, the inclusion maps in the filtration

F (r) : = {f_{X}^{- 1} ([0, r]) | r \geq 0}

are all homotopy equivalences and therefore induce isomorphisms in homology. This is used to give some information about the homology of filtrations that are interleaved with F.

We write

H_{*}

to denote homology over a field. Therefore, for a set

X \subseteq R^{d}

, we have a vector space

H_{*} (X)

, and for a continuous map

f : X \to Y

, we have a linear map

H_{*} (f)

. For the canonical inclusion map

X ↪ Y

for a subset

X \subseteq Y

, we will denote the corresponding linear map in homology as

H_{*} (X ↪ Y)

. The image of this map is denoted

im H_{*} (X ↪ Y)

.

Lemma 16.

Let

f_{X}

be the distance function to a compact set in a Riemannian manifold such that

[r, r^{'}]

contains no critical values of

f_{X}

. Let F be the sublevel filtration of

f_{X}

, and let G be a filtration such that

(F, G)

are

(h_{1}, h_{2})

-interleaved in

(r, r^{'})

. If

r^{'} < (h_{2} \circ h_{1} \circ h_{2} \circ h_{1}) (r)

, then:

im H_{*} (G (h_{1} (r)) ↪ G ((h_{1} \circ h_{2} \circ h_{1}) (r))) ≅ H_{*} (F (r)) .

Proof.

The interleaving and the hypotheses imply that we have the following inclusions.

F (r) \subseteq G (h_{1} (r)) \subseteq F ((h_{2} \circ h_{1}) (r)) \subseteq G ((h_{1} \circ h_{2} \circ h_{1}) (r)) \subseteq F ((h_{2} \circ h_{1} \circ h_{2} \circ h_{1}) (r))

The preceding lemma implies that the maps

F (r) ↪ F ((h_{2} \circ h_{1}) (r))

,

F ((h_{2} \circ h_{1}) (r)) ↪ F ((h_{2} \circ h_{1} \circ h_{2} \circ h_{1}) (r))

, and

F (r) ↪ F ((h_{2} \circ h_{1} \circ h_{2} \circ h_{1}) (r))

all induce isomorphisms in homology. It follows that

im H_{*} (G (h_{1} (r)) ↪ G ((h_{1} \circ h_{2} \circ h_{1}) (r))) ≅ H_{*} (F (r))

, because the inclusion of spaces in G is factored through a space in F, and it factors an inclusion of spaces, all of which are isomorphic in homology. □

3.3.2. Smoothing the Metric

To apply the critical point theory of distance functions to the induced metric directly, we would need it to be a smooth Riemannian manifold. Although it is not smooth, we can smooth it with an arbitrarily small change. The process, though a little technical, is not surprising, nor very difficult. It proceeds in three steps.

We smooth the distance to L. This is the source of non-smoothness in the induced metric. This replaces $f_{L}$ with a smooth approximation, $\tilde{f_{L}}$ .
The smoothed distance to L is used to define the smoothed induced metric $\tilde{d^{L}}$ analogously to the original construction of $d^{L}$ .
The induced distance function $f_{X}^{L}$ can then be replaced by its smoothed version $\tilde{f_{X}^{L}}$ , and the corresponding smoothed offsets $\tilde{A_{X}^{L}}$ are then well defined.

The complete construction of the smoothed offsets is presented in Appendix A. The end result is an interleaving between the induced offsets

A_{X}^{L}

and the smoothed version

\tilde{A_{X}^{L}}

as expressed in the following lemma.

Lemma 17.

Given

α, β \in (0, 1)

, consider compact sets

\hat{L} \subseteq L \subset R^{d}

and compact sets

\hat{X} \subseteq X \subset R^{d} \ L^{β}

, such that

d_{H}^{\hat{X}} (L, \hat{L}) \leq δ < 1

and

d_{H}^{L} (\hat{X}, X) \leq ε < 1

, then

({\tilde{A}}_{X}^{L}, B_{\hat{X}}^{\hat{L}})

are

(h_{8}, h_{9})

-interleaved on

(0, 1)

, where

h_{8} (r) = \frac{r + α r + ε}{(1 - r - r α - ε) (1 - δ)}

and

h_{9} (r) = \frac{r}{(1 - α) (1 - δ - r)} + \frac{ε}{1 - α}

.

Proof.

The proof can be found in Appendix A.1. □

3.3.3. The Weak Feature Size

Chazal and Leutier [19] introduced the weak feature size (

wfs

) as the least positive critical value of a Riemannian distance function. We denote the weak feature size with respect to

\tilde{f_{X}^{L}} (\cdot)

as

{wfs}^{L} (X)

. In light of the critical point theory of distance functions, a bound on the weak feature size gives a guaranteed interval with no critical points. This allows one to infer the homology from another filtration (usually one that is discrete and built from data) as long as the second filtration is interleaved in that critical point free interval.

Lemma 18

(Adapted from [19] Theorem 4.2; see also [20]). Let S and

\hat{S}

be compact subsets of

R^{d}

. If

d_{H} (S, \hat{S}) > ε

and

wfs (S) > 4 ε

, then for all sufficiently small

η > 0

,

H_{*} (A_{S} (η)) ≅ im H_{*} (A_{\hat{S}} (ε) ↪ A_{\hat{S}} (3 ε)) .

The key idea in that proof is that the Hausdorff bound gives an interleaving, while the weak feature size bound gives the interval without critical points. The technical condition regarding

η

is present to account for strange compact sets that may be homologically different from their arbitrarily small offsets. It is reasonable to assume that for some sufficiently small

η

that

H_{*} (A_{S} (η)) ≅ H_{*} (S)

, and thus, one could “compute” the homology of S using only the sample

\hat{S}

.

Most previous uses of the weak feature size have been applied in Euclidean spaces, but the critical point theory of distance functions can be applied more broadly to other smooth Riemannian manifolds. This is why we introduced it as

{wfs}^{L}

(with the superscript) to indicate the underlying metric.

3.3.4. Homology Inference

We have now introduced all the necessary pieces to prove our main homology inference result.

Theorem 3.

Given

α, β \in (0, 1)

, consider compact sets

\hat{L} \subseteq L \subset R^{d}

and compact sets

\hat{X} \subseteq X \subset R^{d} \ L^{β}

, such that

d_{H}^{\hat{X}} (L, \hat{L}) \leq δ < 1

and

d_{H}^{L} (\hat{X}, X) \leq ε < 1

. Define the real-valued functions

Ψ

and

Φ

as:

Ψ (r) : = \frac{r + α r + ε}{(1 - r - α r - ε) (1 - δ)}

and:

Φ (r) : = \frac{r}{(1 - α) (1 - δ - r)} + \frac{ε}{1 - α} .

Given any

η > 0

, such that

Φ Ψ Φ Ψ (η) < 1

, if

{wfs}_{L} (X) > Φ Ψ Φ Ψ (η)

, then:

H_{*} ({\tilde{A}}_{X}^{L} (η)) ≅ im (H_{*} (B_{\hat{X}}^{\hat{L}} (Ψ (η))) ↪ B_{\hat{X}}^{\hat{L}} (Ψ Φ Ψ (η))) .

Proof.

Given

η > 0

such that

Φ Ψ Φ Ψ (η) < 1

, we have the following sequence of inclusions as a result of Lemma 17.

{\tilde{A}}_{X}^{L} (η) \overset{a}{↪} B_{\hat{X}}^{\hat{L}} (Ψ (η)) \overset{b}{↪} {\tilde{A}}_{X}^{L} (Φ Ψ (η)) \overset{c}{↪} \dots \dots \overset{c}{\to} B_{\hat{X}}^{\hat{L}} (Ψ Φ Ψ (η)) \overset{d}{↪} {\tilde{A}}_{X}^{L} (Φ Ψ Φ Ψ (η)) .

(3)

As we assume that

{wfs}_{L} (X) > Φ Ψ Φ Ψ (η)

, by the definition of the weak feature size, Lemma 16 implies that the inclusions

b \circ a

and

d \circ c

are homotopy equivalences. We remind the reader that if two spaces are homotopy equivalent, all the induced homology maps between the spaces are isomorphisms. By applying homology to each space and inclusion in the previous sequence, we have the following sequence of homology groups, where

b_{*} \circ a_{*}

and

d_{*} \circ c_{*}

are isomorphisms.

H_{*} ({\tilde{A}}_{X}^{L} (η)) \overset{a_{*}}{\to} H_{*} (B_{\hat{X}}^{\hat{L}} (Ψ (η))) \overset{b_{*}}{\to} H_{*} ({\tilde{A}}_{X}^{L} (Φ Ψ (η))) \overset{c_{*}}{\to} H_{*} (B_{\hat{X}}^{\hat{L}} (Ψ Φ Ψ (η))) \overset{d_{*}}{\to} H_{*} ({\tilde{A}}_{X}^{L} (Φ Ψ Φ Ψ (η))) .

(4)

The aforementioned isomorphisms

b_{*} \circ a_{*}

and

d_{*} \circ c_{*}

factor through

H_{*} (B_{\hat{X}}^{\hat{L}} (Ψ (η)))

and

H_{*} (B_{\hat{X}}^{\hat{L}} (Ψ Φ Ψ (η)))

, respectively, proving that

b_{*}

is surjective and

c_{*}

is injective. We then have that

H_{*} ({\tilde{A}}_{X}^{L} (η)) ≅ H_{*} ({\tilde{A}}_{X}^{L} (Φ Ψ (η))) ≅ im b_{*} ≅ im (c_{*} \circ b_{*})

. □

3.3.5. Computing the Homology

The last step is to relate the smoothed offsets to something that can be computed. It will generally be the case that the approximation

\hat{X}

of X is not just compact, but also finite. Then, for any scale

α \geq 0

, we have that

B_{\hat{X}}^{\hat{L}} (α)

is the union of a finite set of Euclidean balls.

The nerve theorem provides a natural way to compute the homology of a union of Euclidean balls. The nerve of a collection U of sets is the set of all subsets of U that have a nonempty intersection. It has the structure of a simplicial complex, whose homology can be directly computed by standard matrix reduction algorithms. When all nonempty intersections are contractible, the cover is said to be good. A cover by Euclidean balls (or any convex shape) is always good. For good covers, the nerve theorem, a standard result in algebraic topology [21], implies that:

H_{*} (Nrv ({ball (x, α f_{\hat{L}} (x))}_{x \in \hat{X}})) ≅ H_{*} (B_{\hat{X}}^{\hat{L}} (α)) .

This is the most basic way to compute the homology of union of balls and is used throughout topological data analysis.

In our case, we are not just computing the homology of the union, but also the homology of the inclusion map. This computation will require a slightly stronger result. The persistent nerve lemma [20], applied to Diagram (4) when combined with the above isomorphisms, yields the following.

H_{*} ({\tilde{A}}_{X}^{L} (η)) ≅ im H_{*} (Nrv ({ball (x, Ψ (η) f_{\hat{L}} (x))}_{x \in \hat{X}})) ↪ Nrv ({ball (x, Ψ Φ Ψ (η) f_{\hat{L}} (x))}_{x \in \hat{X}})) .

This last statement turns the isomorphism into an algorithm, because standard algorithms [22] can compute the homology of the inclusion of the nerves.

4. Conclusions

We present an alternative metric in Euclidean space that connects adaptive sampling and uniform sampling. We show how to apply classical results from the critical point theory of distance functions to infer topological properties of the underlying space from such samples. This provides a connection between methods in surface reconstruction (based on adaptive sampling) and homology inference (based on uniform sampling).

We show in Theorem 1 that there is a precise relationship between samples that are uniformly taken with respect to

d^{L}

at some scale, to those same samples being adaptive in the Euclidean metric. In Theorem 2, we show that we can interleave the sublevel sets of our distance function under this alternative metric with the metric balls resulting from our approximation of the metric, assuming that both

\hat{X}

and

\hat{L}

are uniformly well sampled with respect to the Hausdorff distance of

d^{L}

and

d^{\hat{X}}

. Finally, we show how to fully extend the critical point theory of distance functions and the weak feature size to give theoretical guarantees on homology inference from finite samples of X and L using the induced metric (Theorem 3).

The main limitation of adaptive metrics is that they require two sets as input, one to define the set and one to define the metric. In many instances, this is not available. However, we expect that the approach could find wider use in problems with labeled data. For example, data with binary labels may be viewed as the two sets X and L. Then, each set defines a metric on the other, where the metric is scaled according to how close it is to the other set. This is the subject of ongoing and future work.

Author Contributions

Writing—original draft, N.J.C. and D.R.S.; Writing—review & editing, N.J.C. and D.R.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by the National Science Foundation under Grants CCF-1464379, CCF-1525978, and CCF-1652218.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Details on Metric Smoothing

This section includes the full construction and relevant lemmas about the smoothed version of the induced metric.

For a compact set

L \subset R^{d}

and

β \geq 0

, denote by

L^{β} : = {x \in R^{d} ∣ {min}_{y \in L} ‖ x - y ‖ \leq β}

the offsets of L with respect to the Euclidean metric. The following lemma gives upper and lower bounds on the value of a smoothing of the distance-to-set function

f_{L}

,

\tilde{f_{L}}

, which is defined on an arbitrarily smaller subset of Euclidean space.

Lemma A1.

Consider a compact set

L \subset R^{d}

. Given

α \in (0, 1)

, for all

β \in (0, 1)

, there exists smooth function

\tilde{f_{L}} : R^{d} \ L^{β} \to R

such that for all

x \in R^{d} \ L^{β}

,

(1 - α) f_{L} (x) < \tilde{f_{L}} (x) < (1 + α) f_{L} (x)

.

Proof.

By a standard result from [18], for all

ε > 0

, there exists a smoothing

\tilde{f_{L}} : R^{d} \ L^{β} \to R

of the distance function

f_{L}

such that

‖ f_{L} - \tilde{f_{L}} ‖_{\infty} < ε

. Choose

ε = β α

, for the given

α \in (0, 1)

. By the approximation property of

\tilde{f_{L}}

, for all

x \in R^{d} \ L^{β}

, we have that

f_{L} (x) - ε < \tilde{f_{L}} (x) < f_{L} (x) + ε

. Note also that for all

x \in R^{d} \ L^{β}

,

f_{L} (x) > β = \frac{ε}{α},

and thus,

α f_{L} (x) > ε

. Combining the aforementioned, we have that

f_{L} (x) (1 - α) < f_{L} (x) - ε

and

f_{L} (x) + ε < f_{L} (x) (1 + α)

. □

Consider

\tilde{f_{L}}

as defined in Lemma A1. Using this we can define a smooth adaptive distance function

\tilde{f_{X}^{L}}

and provide upper and lower bounds on its value with respect to the original adaptive distance function

f_{X}^{L}

. For

x, y \in R^{d} \ L^{β}

, we define:

\tilde{d^{L}} (x, y) : = inf_{γ \in Path (x, y)} \int_{γ} \frac{d z}{\tilde{f_{L}} (z)}

and

\tilde{f_{X}^{L}} (y) : = \tilde{d^{L}} (y, X)

.

Lemma A2.

Given

α, β \in (0, 1)

and a smooth function

\tilde{f_{L}}

defined on

R^{d} \ L^{β}

approximating

f_{L}

, consider a compact set

X \subset R^{d} \ L^{β}

. The Riemannian distance function

\tilde{f_{X}^{L}} (\cdot) : = \tilde{d^{L}} (\cdot, X)

satisfies the following property for all

y \in R^{d} \ L^{β}

,

\frac{1}{1 + α} f_{X}^{L} (y) < \tilde{f_{X}^{L}} (y) < \frac{1}{1 - α} f_{X}^{L} (y) .

Proof.

Given two points

x, y \in R^{d} \ L^{β}

and any

ε > 0

, consider

γ, γ^{'} \in Path (x, y)

such that

d^{L} (x, y) \leq \int_{γ} \frac{d z}{f_{L} (z)} \leq d^{L} (x, y) + ε

and

\tilde{d^{L}} (x, y) \leq \int_{γ^{'}} \frac{d z}{\tilde{f_{L}} (z)} \leq \tilde{d^{L}} (x, y) + ε

. We then have the following inequalities resulting from inverting the inequalities in Lemma A1.

\tilde{d^{L}} (x, y) \leq \int_{γ} \frac{d z}{\tilde{f_{L}} (z)} < \frac{1}{1 - α} \int_{γ} \frac{d z}{f_{L} (z)} \leq \frac{1}{1 - α} d^{L} (x, y) + \frac{ε}{1 - α},

and:

\frac{1}{1 + α} d^{L} (x, y) \leq \frac{1}{1 + α} \int_{γ^{'}} \frac{d z}{f_{L} (z)} < \int_{γ^{'}} \frac{d z}{\tilde{f_{L}} (z)} \leq \tilde{d^{L}} (x, y) + ε .

Since these equalities hold for all

ε > 0

, then we can conclude that for all pairs

x, y \in R^{d} \ L^{β}

,

\frac{1}{1 + α} d^{L} (x, y) < \tilde{d^{L}} (x, y) < \frac{1}{1 - α} d^{L} (x, y)

.

Now, consider

y \in R^{d} \ L^{β}

. Define

x^{'} : = {argmin}_{x \in X} d^{L} (y, x)

and

x^{″} = {argmin}_{x \in X} \tilde{d^{L}} (y, x)

. We remind the reader that these points’ existence is guaranteed by the extreme value theorem. By examining these variables with respect to the previous inequality we know that:

\frac{1}{1 + α} d^{L} (y, x^{'}) \leq \frac{1}{1 + α} d^{L} (y, x^{″}) < \tilde{d^{L}} (y, x^{″}) \leq \tilde{d^{L}} (y, x^{'}) < \frac{1}{1 - α} d^{L} (y, x^{'}) .

By applying the definitions of both adaptive distance functions to the previous expression, we obtain the desired inequality,

\frac{1}{1 + α} f_{X}^{L} (y) < \tilde{f_{X}^{L}} (y) < \frac{1}{1 - α} f_{X}^{L} (y) .

□

Define the Riemannian adaptive offsets of X as

{\tilde{A}}_{X}^{L} (α) : = {x \in R^{d} | \tilde{f_{X}^{L}} (x) \leq α}

, and denote the corresponding filtration by

{\tilde{A}}_{X}^{L}

. The following result reestablishes Lemma A2 in the language of filtrations and establishes an interleaving of the Riemannian adaptive offsets with the original adaptive offsets.

Corollary A1.

Let

L \subset R^{d}

be a compact set. Given

α, β \in (0, 1)

, for compact

X \subset R^{d} \ L^{β}

, there exists a Riemannian distance function

\tilde{f_{X}^{L}} : R^{d} \to R

, such that

({\tilde{A}}_{X}^{L}, A_{X}^{L})

are

(h_{6}, h_{7})

-interleaved on

(0, \infty)

, where

h_{6} (r) = (1 + α) r

and

h_{7} (r) = \frac{r}{1 - α}

.

Proof.

By Lemma A2, there exists a Riemannian distance function

\tilde{f_{X}^{L}} : R^{d} \to R

, such that for all

y \in R^{d} \ L^{β}

,

\frac{1}{1 + α} f_{X}^{L} (y) < \tilde{f_{X}^{L}} (y) < \frac{1}{1 - α} f_{X}^{L} (y),

so for

r \in (0, \infty)

and

y \in {\tilde{A}}_{X}^{L} (r)

,

\tilde{f_{X}^{L}} (y) \leq r

, and thus,

f_{X}^{L} (y) \leq (1 + α) r

, which implies that

y \in A_{X}^{L} ((1 + α) r)

, so

{\tilde{A}}_{X}^{L} (r) \subseteq A_{X}^{L} ((1 + α) r)

.

On the other hand, for

r \in (0, \infty)

and

y \in A_{X}^{L} (r)

,

f_{X}^{L} (y) \leq r

, and thus,

\tilde{f_{X}^{L}} (r) \leq \frac{r}{1 - α}

, so

A_{X}^{L} (r) \subseteq {\tilde{A}}_{X}^{L} (\frac{r}{1 - α}) .

□

Combining the previous corollary with Theorem 2 in Section 3.2.4, we obtain an interleaving between the smoothed adaptive offsets and the approximate offsets. This will then allow us to apply Lemma 16 and standard topological data analysis techniques to this interleaving to give a method of homology inference for arbitrary small offsets of X as we have a Riemannian distance function generating the smooth adaptive offsets’ filtration.

Appendix A.1. Proof of Lemma 17

Proof.

The hypotheses of the statement satisfy the hypotheses of both Theorem 2 and Corollary A1, so one knows that

(A_{X}^{L}, B_{\hat{X}}^{\hat{L}})

are

(h_{4}, h_{5})

-interleaved on

(0, 1)

, where

h_{4} (r) = \frac{r + ε}{(1 - r - ε) (1 - δ)}

and

h_{5} (r) = \frac{r}{1 - δ - r} + ε

Furthermore,

({\tilde{A}}_{X}^{L}, A_{X}^{L})

are

(h_{6}, h_{7})

-interleaved on

(0, \infty)

, where

h_{6} (r) = (1 + α) r

and

h_{7} (r) = \frac{r}{1 - α}

. By applying Lemma 11 and composing the necessary functions, we achieve the stated interleavings. □

References

Cazals, F.; Giesen, J.; Pauly, M.; Zomorodian, A. The conformal alpha shape filtration. Vis. Comput. 2006, 22, 531–540. [Google Scholar]
Chazal, F.; Lieutier, A. Smooth Manifold Reconstruction from Noisy and Non-Uniform Approximation with Guarantees. Comput. Geom. Theory Appl. 2008, 40, 156–170. [Google Scholar] [CrossRef]
Chazal, F.; Lieutier, A. Topology Guaranteeing Manifold Reconstruction using Distance Function to Noisy Data. In Proceedings of the 22nd ACM Symposium on Computational Geometry, Sedona, AZ, USA, 5–7 June 2006. [Google Scholar]
Dey, T.K.; Dong, Z.; Wang, Y. Parameter-free topology inference and sparsification for data on manifolds. In Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, Barcelona, Spain, 16–19 January 2017; pp. 2733–2747. [Google Scholar]
Amenta, N.; Bern, M.; Kamvysselis, M. A new Voronoi-based surface reconstruction algorithm. In Proceedings of the 25th Annual Conference on Computer Graphics and Interactive Techniques, Anaheim, CA, USA, 21–25 July 2013; pp. 415–421. [Google Scholar]
Dey, T.K. Curve and Surface Reconstruction: Algorithms with Mathematical Analysis; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
Boissonnat, J.D.; Dyer, R.; Ghosh, A. Delaunay Triangulation of Manifolds. Found. Comput. Math. 2018, 18, 399–431. [Google Scholar] [CrossRef]
Boissonnat, J.D.; Wintraecken, M. The Topological Correctness of PL-Approximations of Isomanifolds. In Proceedings of the 36th International Symposium on Computational Geometry (SoCG 2020), Zurich, Switzerland, 23–26 June 2020; Leibniz International Proceedings in Informatics (LIPIcs). Cabello, S., Chen, D.Z., Eds.; Schloss Dagstuhl–Leibniz-Zentrum für Informatik: Dagstuhl, Germany, 2020; Volume 164, pp. 20:1–20:18. [Google Scholar] [CrossRef]
Niyogi, P.; Smale, S.; Weinberger, S. Finding the Homology of Submanifolds with High Confidence from Random Samples. Discret. Comput. Geom. 2008, 39, 419–441. [Google Scholar] [CrossRef]
Niyogi, P.; Smale, S.; Weinberger, S. A Topological View of Unsupervised Learning from Noisy Data. SIAM J. Comput. 2011, 40, 646–663. [Google Scholar] [CrossRef]
Grove, K. Critical point theory for distance functions. Proc. Symp. Pure Math. 1993, 54, 357–385. [Google Scholar]
Chazal, F.; Cohen-Steiner, D.; Lieutier, A. A Sampling Theory for Compact Sets in Euclidean Space. Discret. Comput. Geom. 2009, 41, 461–479. [Google Scholar] [CrossRef] [Green Version]
Clarkson, K.L. Building triangulations using ε-nets. In Proceedings of the Thirty-Eighth Annual ACM Symposium on Theory of Computing, Seattle, WA, USA, 21–23 May 2006; pp. 326–335. [Google Scholar]
Wein, R.; van den Berg, J.; Halperin, D. Planning High-quality Paths and Corridors Amidst Obstacles. Int. J. Robot. Res. 2008, 27, 1213–1231. [Google Scholar] [CrossRef]
Agarwal, P.K.; Fox, K.; Salzman, O. An efficient algorithm for computing high quality paths amid polygonal obstacles. In Proceedings of the 27th Annual ACM-SIAM Symposium on Discrete Algorithms, Arlington, VA, USA, 10–12 January 2016; pp. 1179–1192. [Google Scholar]
Cohen, M.B.; Fasy, B.T.; Miller, G.L.; Nayyeri, A.; Sheehy, D.R.; Velingker, A. Approximating Nearest Neighbor Distances. In Proceedings of the Algorithms and Data Structures Symposium, Victoria, BC, Canada, 5–7 August 2015; pp. 200–211. [Google Scholar]
Chu, T.; Miller, G.L.; Sheehy, D.R. Exact computation of a manifold metric via Lipschitz Embeddings and Shortest Paths on a Graph. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms, Salt Lake City, UT, USA, 5–8 January 2020. [Google Scholar]
Green, R.; Wu, H. C^∞ approximations of convex, subharmonic, and plurisubharmonic functions. Ann. Sci. Éc. Norm. Sup. 1979, 12, 47–84. [Google Scholar] [CrossRef]
Chazal, F.; Lieutier, A. Weak Feature Size and Persistent Homology: Computing Homology of Solids in Rⁿ from Noisy Data Samples. In Proceedings of the 21st ACM Symposium on Computational Geometry, Pisa, Italy, 6–8 June 2005; pp. 255–262. [Google Scholar]
Chazal, F.; Oudot, S.Y. Towards Persistence-Based Reconstruction in Euclidean Spaces. In Proceedings of the 24th ACM Symposium on Computational Geometry, College Park, MD, USA, 9–11 June 2008; pp. 232–241. [Google Scholar]
Hatcher, A. Algebraic Topology; Cambridge University Press: Cambridge, UK, 2001. [Google Scholar]
Edelsbrunner, H.; Letscher, D.; Zomorodian, A. Topological Persistence and Simplification. Discret. Comput. Geom. 2002, 4, 511–533. [Google Scholar] [CrossRef] [Green Version]

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cavanna, N.J.; Sheehy, D.R. Adaptive Metrics for Adaptive Samples. Algorithms 2020, 13, 200. https://doi.org/10.3390/a13080200

AMA Style

Cavanna NJ, Sheehy DR. Adaptive Metrics for Adaptive Samples. Algorithms. 2020; 13(8):200. https://doi.org/10.3390/a13080200

Chicago/Turabian Style

Cavanna, Nicholas J., and Donald R. Sheehy. 2020. "Adaptive Metrics for Adaptive Samples" Algorithms 13, no. 8: 200. https://doi.org/10.3390/a13080200

APA Style

Cavanna, N. J., & Sheehy, D. R. (2020). Adaptive Metrics for Adaptive Samples. Algorithms, 13(8), 200. https://doi.org/10.3390/a13080200

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adaptive Metrics for Adaptive Samples

Abstract

1. Introduction

1.1. From Points to Topology

1.2. From Surface Reconstruction to Homology Inference

1.3. Overview

2. Methods

3. Results

3.1. Adaptive Sampling

3.2. Interleaving

3.2.1. Approximating X with $\hat{X}$

3.2.2. Approximating the Induced Metric

3.2.3. Approximating L with $\hat{L}$

3.2.4. Putting It All Together

3.3. Smooth Adaptive Distance and Homology Inference

3.3.1. Critical Points of Distance Functions

3.3.2. Smoothing the Metric

3.3.3. The Weak Feature Size

3.3.4. Homology Inference

3.3.5. Computing the Homology

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Details on Metric Smoothing

Appendix A.1. Proof of Lemma 17

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Adaptive Metrics for Adaptive Samples

Abstract

1. Introduction

1.1. From Points to Topology

1.2. From Surface Reconstruction to Homology Inference

1.3. Overview

2. Methods

3. Results

3.1. Adaptive Sampling

3.2. Interleaving

3.2.1. Approximating X with X ^

3.2.2. Approximating the Induced Metric

3.2.3. Approximating L with L ^

3.2.4. Putting It All Together

3.3. Smooth Adaptive Distance and Homology Inference

3.3.1. Critical Points of Distance Functions

3.3.2. Smoothing the Metric

3.3.3. The Weak Feature Size

3.3.4. Homology Inference

3.3.5. Computing the Homology

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Details on Metric Smoothing

Appendix A.1. Proof of Lemma 17

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2.1. Approximating X with $\hat{X}$

3.2.3. Approximating L with $\hat{L}$