1. Introduction
Let
be a random process,
are given time moments, and
are finite-dimensional distributions of the process. The Monte Carlo method of modeling a process
usually consists of modeling its finite-dimensional distributions. Extensive literature is devoted to the algorithms of such modeling (for example, [
1] and some cited sources). The methods developed in these papers allow us to express realizations of a random vector
through a certain set of randomizations of a random variable
uniformly distributed on
, where
and, generally speaking, these can be random.
Ultimately, the problem solution
is represented as the expectation of a certain function
of a random vector
for selected finite values
n and
M. That means
, and the computational process consists in multiple (
N times) calculations of the independent realizations
of the vector
and in estimation of
using the arithmetic mean
. Let us notice that the value
may depend on the number of the realization
j. The maximum value
is called a constructive dimension of the algorithm. Finally, recall that
is expressed in terms of realizations of uniformly distributed numbers, i.e.,
, the integral over the
-dimensional unit hypercube, has the following form
Further, it is convenient to change the designations and talk about the calculation of the integral
using the
s-dimensional unit hypercube,
,
;
. The problem of calculating this integral using the Monte Carlo method is well known. If it is estimated using the sum
, where
are independent realizations of a random variable uniformly distributed on
, and
f is a quadratically integrable function, then for the error it is possible to construct a confidence interval of width
. At the same time, a number of articles [
2,
3,
4] based on the theory of numbers considerations, pointed out sequences of
s-dimensional vectors
, for which the error
decreased as
, for functions that have the first partial derivative with respect to each variable. This result is obviously almost
times better than the Monte Carlo method. The sequences
, possessing the property mentioned above, are called quasi-random. Extensive literature is devoted to their properties and applications (see, for example, [
5] and the bibliography available there).
As a rule, the authors consider that quasi-random sequences are significantly better than the pseudo-random sequences used in the Monte Carlo method. The legitimacy of such comparisons is studied in detail below.
2. Koksma-Hlawka Inequality and Random Quadrature Formulas
One of the well-known approaches to constructing the sum
where
are constants,
, which is used in the integral
calculation, is as follows. It is assumed that
belongs to a linear normed space of functions, the error of the integration formula
is considered as a functional in this space and parameters of the formula (1) are
and
,
are chosen so as to minimize the norm of the functional [
6].
This task is usually very difficult. It is enough to note that it is possible to obtain the explicit expression of the above-mentioned functional norm only in a few particular cases. One of these cases is
,
,
D is a unit hypercube
and
is a space of functions of bounded variation in the Hardy-Krause sense. In this case one can use the Koksma-Hlawka inequality [
7,
8]
where
is the variation mentioned above, and
is the error norm, which is called a discrepancy in the non-Russian literature or a deviation in the Russian one. Sometimes it is also called a star discrepancy. In the number-theoretical sense, this quantity characterizes the uniformity of the sequences distribution and equals to
where
is the volume of the multidimensional box
, and
is a number of points of sequences belonging to
.
Similar explanations regarding the value
can be found in
Appendix A for this article. For our purposes, it suffices to note that for
, one knows the upper bound for the asymptotic behavior of
[
9], namely
The authors of [
2,
3,
4] indicate of the algorithms for constructing sequences
for which equality is achieved in (5). These sequences are called quasi-random due to the formal similarity of algorithms of quasi-random numerical integration with the Monte Carlo method of calculating multiple integrals. As we have already said, the authors of numerous articles devoted to the study and use of quasi-random numbers usually note that the Monte-Carlo method has an asymptotic error decrease of the same order as
, while from (5) we can conclude that quasi-random methods provide an error decrease as
, more precisely
for any arbitrarily small
.
The proof of Inequality (3) given in the literature is quite large and complex. As we have already noted, it can easily be obtained by means of functional analysis. For
we showed it in the application. The general case simply requires more complex definitions. It can be noted that using the theory of cubature formulas [
1], one can obtain analogs of Inequality (3) for Sobolev functions and many other classes of functions and specify sequences for these classes that have faster order of the error decrease. However, at the same time, computational algorithms are usually significantly complicated.
Other important applications of classical computational mathematics arise while estimating the error of quasi-random methods. The construction of the confidence interval in the Monte Carlo method automatically gives an estimate of the error, but for the quasi-Monte Carlo in its pure form there is only the Inequality (3), which is of little use when solving a specific problem. This difficulty can be overcome by randomizing quasi-random points.
Suppose
are quasi-random vectors of dimension
s, and
are realizations of vectors of the same dimension, uniformly distributed in
. The sum
is an unbiased estimate of the integral
The curly brackets denote the operation of taking the fractional part performed on the components of the vector. The error for this randomized sum can be approximated using the central limit theorem. The following statement is trivial
where
is a vector uniformly distributed in
, is a random quadrature formula with one free node. The theory of such formulas is given in detail in [
1]. With the help of this theory, one can establish for which class of functions the formula is exact. A number of papers consider other methods for randomizing quasi-random numbers, known as scrambling methods.
3. Numerical Error Estimation Experiment: Monte Carlo and Quasi-Monte Carlo
In this paper we show that references to Inequality (3) when evaluating a computational algorithm are not completely correct, at least for
, and we suggest some correct approaches to determining the asymptotical error decrease of the quasi-random methods. It can be immediately noted that in the case of the Monte Carlo (MC) method, we are talking about the width of the confidence interval and the asymptotic behavior in the central limit theorem is already seen for small
N (
, for example). The situation is different for the expression (5). The multiplier
plays a significant role already for
. The asymptotics of order
can be reached when
, but as follows from
Table 1, for
the rate of error decrease of the two methods becomes approximately equal with
, for
, with
, for
with
.
The data given in
Table 1 clearly confirm the unsuitability of Inequality (5) to show the advantages of the quasi-Monte Carlo (QMC) method for calculating integrals of large multiplicity and, with increasing multiplicity of the integral, this situation worsens. Many authors (for example, [
10]) confirm that the real error decrease for different values of
N for
does not obey inequality (5), but behaves like
with
. Thus, one should speak about the quality of a particular quasi-random sequence only for reasonable values of
N, when the asymptotic behavior indicated by equality (3) hasn’t been fulfilled yet and one should introduce a reasonable quality criterion only from empirical considerations. First, we discuss the behavior of the integration error in some numerical examples. Let
be defined and integrable in the
s-dimensional unit cube
and the integral
is calculated using the cubature formula (1).
The exact value of the integrals
,
,
are known, they are equal to 1 for all
s.
Table 2 shows the absolute error of the QMC method, calculated for
and for the quasi-random Sobol (ErrS) and Halton (ErrH) sequences.
The obtained calculations lead to the following conclusions. The error values differ by no more than one order of magnitude for the considered sequences. The well-known estimate
cannot serve as a reasonable estimate of the error in this case (see the last line in the
Table 2). As already noted, the practical application of quasi-Monte Carlo for large
s is limited. The calculations of integrals by the QMC method, the exact value of which is unknown, do not allow us to compare the accuracy of the various quasi-sequences used among themselves.
4. The Criterion of Decreasing Residual
We show that the use of the randomized quasi-Monte Carlo (RQMC) procedure allows us to effectively estimate the error of the quasi-Monte Carlo method. Consider a new criterion, that will allow us to judge about the error decrease depending on N, and will provide an opportunity to compare the quality of various quasi-random integration methods.
As a quality criterion, we will use the average order of the error decrease.
Consider the error change interval for some randomized quasi-random quadrature formula. The average value for a given number of realizations N, is denoted and we will approximate using the least squares method with the function . The value of is called the average order of the error decrease. In the case when the exact value of the integral is unknown we use the error estimate obtained by randomization.
Let us estimate the average order of the error decrease of the numerical integration for integrals
. The exact value of these integrals is not known. The calculations were carried out for
(the number of nodes) and
(the number of randomizations) for quasi-random sequences of the Sobol and Halton method. Moreover, the total number of nodes is
. The value of random error
is approximated on the following intervals:
. The results of the calculations are given in
Table 3 and
Table 4, where
is the average order of the error decrease value on a given interval.
Conducted calculations show the relative stability of the estimate of for changing and .
5. Conclusions
Error analysis of numerical methods plays a major role in the choice of algorithms for solving a problem. The main goals of this article are to propose a new quality criterion for algorithms for calculating multidimensional integrals; point out the incorrect use of the Koksma-Hlawka inequality when comparing the asymptotic error behavior of the Monte Carlo and the quasi-Monte Carlo methods for calculating integrals; and propose a new quality criterion for calculation algorithm integrals with large multiplicity.
This will allow, in particular, to choose the ratio between the number of random and quasi-random components of the nodes used in quadrature formulas, when their number (constructive dimension) is very large. The results obtained are confirmed by numerical examples. It makes possible to judge the comparative quality of various quasi-random sequences in some cases.
The results obtained, can be useful in solving other problems (for example, optimization problems). However, this requires separate studies that are beyond the scope of this work.