1. Introduction
The continued fraction expansions of real numbers have long been known as an interesting and fruitful venue for applications of extreme value theory (EVT). They are interesting because continued fractions digits are almost surely unbounded and have infinite expectation which raises natural questions about the behaviour of their largest digits. They are fruitful because continued fraction digits are closely connected to a dynamical system with nice mixing properties and an explicit invariant measure.
We briefly recall some facts about regular continued fractions necessary for further discussion. Every
may be written as a fraction of the form:
where the (possibly finite) sequence
consists of natural numbers which we refer to as the regular continued fraction (RCF) digits of
x. (Often the terminology
partial quotients of
x is used for the numbers
; however, we prefer
digits. Furthermore, in many instances, regular continued fractions are simply referred to as continued fractions. However, since we will discuss multiple different continued fraction algorithms in this paper, we will remain specific throughout.) For simplicity we will leave out the dependence on
x and write
unless the context requires specification.
is the integer part of
x, hence, for
, we have
, and only the fraction in (
1) remains. Furthermore, the sequence
is infinite if and only if
x is irrational. Since only the infinite case is of interest for extreme value behaviour and since the rational numbers make up a null set anyway, we will focus on numbers in
.
The RCF digits of a number
may also be computed using the Gauss map
which is defined by
Here,
and
denote the fractional and integer part of
x, respectively. The
s are then recursively given by
The Gauss map is invariant under the Gauss measure on
X which is given by
where
is Lebesgue measurable. Let
denote the
-algebra of Lebesgue measurable subsets of
X. As a triple,
constitutes dynamical systems with many nice and interesting properties. The first of these is the explicit form of
. For many dynamical systems, it is possible to prove the existence of an invariant measure, however, there are few systems for which this measure is explicitly known. Interestingly, the measure was first stated by Gauss in 1800 [
1], however, he gave no explanation for how he discovered it and their rationale remains a mystery to this day. Equipped with the Gauss measure, we may think of
X as a probability space and the
as a sequence of random variables. The second property of
which we highlight is a quantitative mixing property with respect to sets belonging to certain sub-
-algebras of
. We define the property in a general setting.
Definition 1 (
-mixing)
. Let denote a probability space and let denote a stationary sequence of random variables. For natural numbers , let denote the smallest σ-algebra for which are measurable. Then, is said to be ψ-mixing if for any sets and we have:where is a function for which as .
In the case of RCF digits, the random variables
form a stationary sequence due to the invariance of the Gauss measure with respect to
T. The
s are known to be
-mixing with respect to the Gauss measure and the function
is known to vanish at an exponential rate. This follows from independent work of Kuzmin and Levy in the late 1920s, see for example [
2] for the proof and a discussion of its history.
1.1. Extreme Value Theory for RCF Digits
Extreme value theory for RCF digits first emerged in the 1970s, specifically through Galambos, in 1972. He used the Gauss measure and -mixing to prove the following extreme value law.
Theorem 1 ([
3])
. Let . Then, for we have: Readers familiar with extreme value theory will recognize this as a Frechet distribution with an extremal index equal to 1. Galambos generalized this result in [
4], showing that
may be replaced with any measure that is absolutely continuous with respect to Lebesgue. In 1977, Iosifescu gave the following more general result, proving a Poisson law for general stationary,
-mixing sequences.
Theorem 2 ([
5])
. Let denote a stationary and ψ-mixing sequence of random variables. For and , setAssume that there exists a sequence of functions such that: for some real-valued function τ. Then, for all and any : It was straightforward for Iosifescu to apply this theorem to RCF digits ((Theorem 2) [
5]) since
-mixing for RCF digits was known and property (
3) easily follows from the form of the Gauss measure. Similar to Galambos, he gave an argument for why the Poisson law holds for any measure which is absolutely continuous with respect to Lebesgue. Note that one obtains Galambos’ theorem by setting
in Iosifescu’s result for RCF digits. It is interesting to note that Iosifescu’s result for RCF digits appeared as early as 1940 in a paper by Doeblin [
6]. However, as Iosifescu explains in [
5], Doeblin’s proof contains a mistake. Galambos was unaware of Doeblin’s result when proving their extreme value law, and Iosifescu obtained their proof by applying ideas from a Galambos paper to fix Doeblin’s mistake.
The era offered more results concerning the maximum of RCF digits. Galambos proved an iterated logarithm type theorem for the maximal RCF digits [
7], which was improved by Philipp [
8] to give a complete answer to a conjecture of Erdos. Furthermore, in [
8], an upper bound on the rate of convergence in (
2) was provided. Diamond and Vaaler [
9] showed that the partial maximum was responsible for the failure of the law of large numbers for RCF digits.
The topic of extreme value statistics for RCF digits has gained interest again in recent years. Philipp’s rate of convergence was improved by Ghosh, Kirsebom and Roy [
10], while a refinement of Iosifescu’s theorem was presented by Zweimüller in [
11]. On a slightly different note, Chang and Chen [
12] investigated the Hausdorff dimension of certain sets defined via the largest RCF digits.
1.2. Extreme Value Theory for Other CF Algorithms
The regular continued fractions considered to date are by far not the only continued fractions in existence. For RCF digits, we have a good understanding of the statistical behaviour of largest digits. For other CF algorithms, however, results of this nature are scarce and many interesting questions remain open. To our knowledge, the only works on extreme value theory for other algorithms are the papers by Chang and Ma [
13] which consider the case of Oppenheim CF, by Shen, Xu and Jing [
14] which studies the case of continued fraction defined over the field of formal Laurent series and by Nakada and Natsui [
15] which investigate fibred systems. In a different but related direction, González Robert [
16] recently proved a Borel–Bernstein Theorem for Hurwitz complex continued fractions. The methods used in [
13] are somewhat different to the ones described for RCFs, since the invariant measure associated with Oppenheim CF is infinite. Instead, the metric theory is developed for the Lebesgue measure with respect to which the associated dynamical system is not
-mixing. Furthermore, [
14] is somewhat different in that an iterated logarithm type result is proven as opposed to a distributional result.
On the contrary, [
15] took the same approach as described above and applied it to fibred systems. Many CF algorithms, including several complex CF algorithms, satisfy the conditions for being fibred systems, see [
17] for some examples.
Under certain assumptions, fibred systems were proven by Waterman [
18] to admit an invariant measure absolutely continuous with respect to Lebesgue and under further assumptions Schweiger [
17,
19] showed that the fibred system was
-mixing with respect to this invariant measure. Nakada and Natsui use this fact to formulate general sufficient conditions on the invariant measure in order to obtain analogues of Theorem 1 as well as the results in [
8,
9].
They further proved that these various assumptions are satisfied for the Jacobi–Perron multidimensional CF algorithm, but for many other CF algorithms, this is not known.
The main aim of this paper was to develop extreme value theory for complex continued fractions, more specifically the variant introduced by Hurwitz [
20]. In the process, we “pick up” analogue results for the nearest integer continued fractions which we present first before continuing to the complex continued fractions and our main results.
1.3. Extreme Value Theory for Nearest Integer Continued Fractions
This subsection serves two purposes. First, it allows us to state some new results concerning extreme value theory for nearest integer continued fractions (NICF). These results are new in the sense that they appear not to have been stated elsewhere before. However, aside from a small calculation, the proofs simply combine results proven elsewhere, hence in that sense, the novelty is limited. Second, the complex continued fractions which are the main focus of this paper are a generalization of NICF for real numbers. Hence, this subsection serves as a stepping stone between the RCFs and the Hurwitz complex continued fractions.
NICFs work similar to RCFs, the main difference being that we round to the nearest integer. We consider the fundamental domain
and for
, we take its inverse
and subtract the integer nearest to it. This brings us back to
X where we repeat the process. This enables us to write
x as
where
denotes the nearest integer function. Similar to the RCF, the NICF digits may be generated through a dynamical system. However, the layout of the digits will vary slightly from (
5), the benefit being that our transformation obtains a simpler expression. We define the transformation
by
where
denotes the sign of
x. Using this definition, the modulus of the nearest integers and their signs are recorded in separate digits. The NICF expansion of
then becomes:
where:
and:
In this notation, the
’s record whether the nearest integer digits from (
5) change sign. Indeed, the NICF digits in (
5) may be written as
The map
admits an invariant measure
on
, whose density is given by
where
is the golden ratio. It is also known to be
-mixing with respect to this measure. Indeed, the function
is known to decay faster than
where
. Both of these results were proven by Rieger ([
21,
22]).
Consider now
as a sequence or random variables. By
-invariance,
forms a stationary sequence. Define:
Now, the only property missing in order to apply Theorem 2 to
is the property (
3). However, this follows from the ensuing calculation. We shorten the notation by writing
instead of
. Furthermore, we use the notation ∼ to indicate that the related quantities have the same limit. We obtain:
In the above, the fact that we did not change the limit by ignoring the floor function and the added
requires a small calculation but is intuitively clear. Furthermore, we made use of the power series expansion of
. From this, we see that:
which verifies (
3) for
. Hence, we proved the Poisson law of exceedances for the NICF digits.
Theorem 3. Let . For all and any : This leads to the immediate corollaries.
Corollary 1. Let and let denote the k’th largest element among .
The last result is a direct analogue of Galambos’ theorem for RCF, the difference only appearing in the constant used to normalize the maximum.
Remark 1. When comparing the RCF with the NICF of certain real numbers, it appears plausible that the behaviour of their largest digits should be similar. Taking π as an example, we have the two different expansions (see https://oeis.org/A001203 (accessed on 24 June 2021) and https://oeis.org/A133593 (accessed on 24 June 2021) for more digits of either expansion.):
One observes a certain similarity in the larger digits, say 292 and 294 as well as 84 and 85. A big difference is the lack of 1s in the NICF expansion leading to shorter intervals between the large digits. Heuristically, this indicates that the partial maximum for NICF digits should grow faster than the partial maximum for RCF digits. This is indeed reflected in the extreme value laws for the two expansions. In the NICF case, the normalizing sequence must be multiplied with the larger constant in order to obtain the same distribution as for the RCF.
We also obtain a rate of convergence for the limits in Theorem 3 and Corollary 1. Let
be a sequence which satisfies:
where again,
. Note that, in particular:
.
Theorem 4. For all and all , the rate of convergence in (12)–(14) is bounded by Proof. The proof follows exactly the proof of Theorem 1.1 of [
10]. Only in a few places, small adjustments have to be made due to the different density of the invariant measure. The adjustments are done using (
7) and the derivations leading up to it, in particular, the power series expansion of
. Due to the simplicity of the adaptations, we leave the details to the reader. □
2. Complex Continued Fractions and Main Results
While there are many CF algorithms for real numbers, the RCF algorithm has a strong sense of being the most “natural”. This is due to its simplicity, strong properties and its many connections to other fields like analytic number theory, dynamical systems and hyperbolic geometry. In the complex realm, no CF algorithm reigns supreme in the same way. Part of the explanation is that the RCF algorithm for real numbers does not generalize in a meaningful way to the complex numbers (the argument that follows was kindly presented to me by Gerardo González Robert). A naive approach to generalizing the RCF would see us using complex inversion on the square:
and applying the floor function in both dimensions. More precisely, define the transformation
by
where
denotes the complex floor function given by
. Set
and
. Using these definitions, uncountably many different elements of
S would be assigned the same sequence of digits
. A specific example is the region bounded by the circles
and
as well as the line segment connecting 1 and
. This region becomes mapped to itself under
T and in the process, every element
z in this region is assigned
for all
. Clearly this does not lead to a useful continued fractions representation of numbers in
S.
Many alternative approaches to complex continued fractions exist, see [
23] for an overview of some of them and their properties and references. The approach which we study in this paper is a generalization of the NICF algorithm for real numbers.
Hurwitz Complex Continued Fractions
Denote by , the Gaussian integers and denote by the Gaussian integer nearest to z. We apply the convention that ties are broken by approximating them down into both real and complex parts, for example if for . This ensures that is well defined, however, the choice of rounding up will play no role in our results since the convention only relates to a set of measure zero.
Let
. Analogous to the Gauss map, we define the Hurwitz map
by
For a given
, consider the sequence
given by
and:
for
. By setting
and
for
, we obtain the Hurwitz complex continued fractions (HCCF) expansion of
z written as
It is well known that this expansion converges and provides a meaningful representation of complex numbers, see [
24] for an introduction to the HCCF expansion. Let
denote the
-algebra of Borel subsets of
B and let
be the Lebesgue measure on
B. It is known that there exists a unique measure on
B which is
T-invariant and absolutely continuous with respect to Lebesgue (see [
24], Section 5.7). We denote this measure by
. Thus, we have a dynamical system
where
is a
-preserving map. Consider the sequence of real-valued random variables
. As previously indicated, we are particularly interested in the occurrence of large values of
.
We are finally ready to state the main results of this article. The first is a Poisson law for HCCF, analogue to the Iosifescus result applied to RCFs.
Theorem 5. For and , let: There exist such that for all and any : This leads to the immediate corollaries:
Corollary 2. Let and let denote the k’th largest element among . There exists such that for all : Note that (
14) provides an analogue of Galambos’s extreme value law for HCCF’s. The corollary is simply obtained by setting
in Theorem 5.
3. The Invariant Measure with Respect to the Hurwitz Map
The strategy of proof of Theorem 5 is clear. If we can satisfy the conditions of Theorem 2, the result follows. Fortunately, the
-mixing property of
T is known and it is even known to be mixing at an exponential rate, i.e.,
for some
. This exponential
-mixing result was first stated by Nakada in ([
25] (Corollary 2)), however, the proof relied on work by Schweiger [
26] which turned out to contain a serious gap. As explained in [
27], Schweiger was later able to recover the result with a sub-exponential rate in [
28], and finally the exponential rate in [
29], meaning that Nakada’s result is indeed correct. Note also that the sequence
forms a stationary sequence, as this is an easy consequence of
being
T-invariant.
The greater challenge is to satisfy (
3) in Theorem 2. For this, we need more specific information about the unique
T-invariant measure absolutely continuous with respect to Lebesgue. We first cite the following theorem by Hensley which is central to our proof.
Lemma 1 ([
24], Theorem 5.5).
The density function ρ of the measure μ is continuous except possibly along the intersections of the circles and with B. It is a real analytic on each of the 12 open regions into which the interior of B is dissected by these circles. Moreover, the measure μ is symmetric under complex conjugation and multiplication by i. The 8 arcs and 12 regions in
B are denoted by
and
, respectively. See
Figure 1.
An important further fact about the density
is that the limits:
exist and are strictly positive, i.e.,
. This can be seen with the help of various facts from recent works of Hiary and Vandehey [
30] as well as Ei, et al. [
27]. As explained in [
30], the density
may be expressed as
where
is a certain set related to
whose exact definition we will not get into here. The sets
are not completely understood at this point (if they were, we would understand the measure
much better), but they were thoroughly studied in [
27] where it was shown, among other things, that they are measurable and of positive Lebesgue measure. Hence, we see that from (
15), the limit for
inside either of the regions
,
, exists and is equal to the Lebesgue measure of the corresponding set
. Due to the symmetries mentioned in Theorem 1, the limits inside
,
,
and
must be identical and the same applies to the limits inside
,
,
and
. Hence, we may set:
Nakada had much earlier showed [
25] (Theorem 2) that the measure
is equivalent to the Lebesgue measure, i.e., there exists a constant
such that for all
:
Proof of Main Results
The main technical lemma of this paper is the following.
Lemma 2. For some constant , we have: Proof. For the purpose of this proof, we occasionally shorten notation by writing instead of .
Set:
hence, our goal is to determine the asymptotics of
. Let
be given by
and notice that
is a bijection which maps disks of radius
to complements of disks of radius
. Then, for
:
We define two sets
and
by
A simple geometric argument (see
Figure 2) shows that:
Figure 2 shows
,
and
for some unspecified
as well as the set
B.
is the (unbounded) grey region,
is the region
outside the green circle while
is the region
outside the red circle. The blue dots are the Gaussian integers. The set
A, not drawn in the figure, is some neighbourhood of 0 inside
B.
Applying
to (
17) we obtain:
Taking the
-measure in the above inclusion followed by a small computation which makes use of the fact that
is equivalent with respect to Lebesgue, we deduce that:
We now proceed to estimate
, i.e., the
-measure of the ball centred at 0 with radius
. Set
and:
Note that the intersection with
,
,
, and
is empty for a sufficiently large
j, hence we may ignore these regions for our purposes. We have:
The last equality follows from the symmetries noted in Theorem 1. We can now use the fact that
is continuous on each
and that the limits:
exist and are strictly positive to estimate
and
. Let
be given. For a sufficiently large
j, we have:
hence:
We may now estimate
as follows:
The lower bound is motivated by
Figure 3 which suggests that
becomes insignificant compared to
when
j becomes large. We make this precise by computing an upper bound for
.
We see from
Figure 3 that
is contained in a rectangle of side length
and
. Here the vertical side length is found as the imaginary value of the intersection point between
and the boundary of
D, i.e., solving:
leading to
and side lengths
. Hence, we obtain the bound:
We also use this bound to estimate
. Namely, we have
and:
Inserting these estimates in (
20), we obtain:
Since we are interested in the asymptotic behaviour as
, we can pick
to be arbitrarily small. Together with (
18), we come to the conclusion that for
, we have:
This completes the proof of the lemma. □
Remark 2. We repeat the point here that followed (15), namely that the constant H could potentially be made explicit if one could find a way of computing the Lebegue measure of the sets which are defined and studied in [27,30]. Proof of Theorem 5. As stated earlier, to finish the proof of Theorem 5, we need to show that condition (
3) in Theorem 2 is satisfied. However, at this stage, we only need to observe that, by choosing the sequence,
with
we obtain:
and the proof is complete. □