Next Article in Journal
Optically Transparent Dual-Band Metamaterial Absorber Using Ag Nanowire Screen-Printed Second-Order Cross-Fractal Structures
Next Article in Special Issue
Analysis of Pore Characterization and Energy Evolution of Granite by Microwave Radiation
Previous Article in Journal
Convolution Kernel Function and Its Invariance Properties of Bone Fractal Operators
Previous Article in Special Issue
Changes in Pore Structure and Fractal Characteristics of Solvents Pretreated High-Rank Coal under Supercritical CO2
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Statistical Study of the Bias and Precision for Six Estimation Methods for the Fractal Dimension of Randomly Rough Surfaces

by
Jorge Luis Flores Alarcón
1,
Carlos Gabriel Figueroa
2,
Víctor Hugo Jacobo
1,
Fernando Velázquez Villegas
1 and
Rafael Schouwenaars
1,3,*
1
Departamento de Diseño y Manufactura, Facultad de Ingeniería, Edificio O, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Coyoacan 04510, Ciudad de Mexico, Mexico
2
Departamento de Materiales y Manufactura, Facultad de Ingeniería, Universidad Nacional Autónoma de México, Via de la Innovación 410, PIIT, Apodaca 66629, Nuevo León, Mexico
3
Department of Electromechanical, Systems and Metals Engineering, Ghent University, Technologiepark 46 Zwijnaarde, 9052 Ghent, Belgium
*
Author to whom correspondence should be addressed.
Fractal Fract. 2024, 8(3), 152; https://doi.org/10.3390/fractalfract8030152
Submission received: 15 January 2024 / Revised: 29 February 2024 / Accepted: 4 March 2024 / Published: 7 March 2024
(This article belongs to the Special Issue Fractal Analysis and Its Applications in Geophysical Science)

Abstract

:
The simulation and characterisation of randomly rough surfaces is an important topic in surface science, tribology, geo- and planetary sciences, image analysis and optics. Extensions to general random processes with two continuous variables are straightforward. Several surface generation algorithms are available, and preference for one or another method often depends on the specific scientific field. The same holds for the methods to estimate the fractal dimension D. This work analyses six algorithms for the determination of D as a function of the size of the domain, variance, and the input value for D, using surfaces generated by Fourier filtering techniques and the random midpoint displacement algorithm. Several of the methods to determine fractal dimension are needlessly complex and severely biased, whereas simple and computationally efficient methods produce better results. A fine-tuned analysis of the power spectral density is very precise and shows how the different surface generation algorithms deviate from ideal fractal behaviour. For large datasets defined on equidistant two-dimensional grids, it is clearly the most sensitive and precise method to determine fractal dimension.

1. Introduction

The description of surface topography by the superposition of harmonic waves with randomly oriented wave vectors and phase angles was pioneered by Longuet-Higgins in relation to the moving sea surface [1,2]. It uses the relationship between the magnitude of the wave vector and the power spectrum (PS). Whitehouse et al. [3] were among the first to analyse the roughness of engineering surfaces under the concept of random process theory. With increasing measurement size and resolution, it was found that roughness increases with the length (area) of the measured domain [4,5,6,7].
Fractals are often associated with graphics in a plane, such as the Julia and Mandelbrot sets [8] or with problems like the coastline paradox [9], explained in terms of a Hausdorff dimension in a classical paper by Mandelbrot [10]. A geometrical analysis of fractal dimension follows naturally from the graphical nature of the object studied. For random fractal surfaces, or more generally, for the self-affine stochastic processes found in many scientific disciplines, the problem must be approached differently. A mathematical definition of the basic problem is the most direct way to start.
The increase in the variance with an increase in the size of the domain over which the random variable is measured follows from Plancherel’s theorem [11] for self-affine surfaces with a power-law PS: let the surface topography be described as a function z ( x , y ) , with an isotropic PS:
P ( r ) = { 0 r < r l C r 2 ( H + 1 ) r l < r < r u 0 r u
Here, r = k 2 + l 2 , the magnitude of the wave vector k with components ( k , l ) . r l and r u are the lower and upper cut-off radii in the frequency domain. C is a constant. The measurement length corresponds to the maximum wavelength λ u = 2 π / r l and the resolution of the measurement λ l = 2 π / r u , where H is the Hurst exponent and the fractal dimension D = 3 H . The variance of the height is given by the following:
z 2 = 0 2 π r P ( r ) d r = π C r l 2 H r u 2 H H
The operator < · > refers to the expected value. For r l / r u 1 , the second term in the numerator can be neglected, and the standard deviation σ = z 2 ~ r l h ~ λ u h , i.e., the root mean square roughness increases with the measurement length λ u .
It has been shown that λ l can be of atomic size in diamond coatings [12,13], while λ u can be as large as the size of the observable universe [14]. The cut-off radii are generally not imposed by the physics of the phenomenon but by instrumental limitations, although a combination of measurement techniques at overlapping observation scales has been used to successfully extend λ u / λ l [4,12,13]. In geo- and planetary sciences, multifractal descriptions are often used, where the value of H is allowed to vary with λ u and/or λ l [15,16,17]. In materials science and tribology, an upper cut-off λ u is universally observed, beyond which the roughness no longer increases with the measurement length [12,13,18]. λ u can be associated with the intuitive concept of coarseness, while H can be connected to complexity [19]. In numerical simulations, λ u / λ l is determined by the size of the dataset, and λ u can be set to 1 without loss of generality.
Different definitions of the fractal dimension can give different results for D [20]. A distinction must be made between the mathematical definition of D and the numerical methods used to estimate D from a discrete set of measured or simulated data. From a mathematical viewpoint, it is sufficient to find a single example in which two definitions of D produce different results. Although there are many counterexamples, the Minkowski and Hausdorff dimensions coincide in almost all cases, and objective criteria exist to test this coincidence [21].
In numerical methods, if D M is the estimate of D by a given method M . If two methods produce distinct values D M , it must be checked if they correspond to established mathematical exceptions. If not, the difference is more likely due to the numerical procedure missing some essential details of the geometry or because of the finite size and resolution of the available dataset. All definitions of fractal dimensions involve a limit for λ l 0 . Limitations on λ u / λ l may severely distort the relationship between D M   and D .
Most methods for determining D are only applicable to a limited subset of geometries [20]. This study assumes that a smooth reference surface exists on a simply connected domain on which a function z ( x , y ) can be defined. For engineering surfaces and many geological and geomorphological studies, z is the height. More generally, z can be any random variable measured along a line or surface, such as the grey scale or colour value of pixels in an image [22,23,24,25] or the temperature of the cosmic background radiation [26,27]. Contours of constant z 0 = z ( x , y ) (or w 0 = w ( x , y , z ) in three dimensions [28]) cannot be analysed by the methods studied here, except for box counting methods (BCM) and a strongly modified power spectral density method [29].
This work will provide clear definitions of the bias and precision of the different methods M for the determination of H, with the goal of determining which method provides the best estimate for H. Section 2 of this paper describes the methods used to simulate the random surfaces and calculate H. Section 2.1 explains how this combination defines a random process. Section 2.2 presents three surface generation algorithms. It justifies the selection of only two of them for further analysis. The following seven subsections describe the methods used for determining H. Section 3 describes the simulation scheme, identifies the parameters of interest, and provides a statistical analysis of the results based on linear regression and analysis of variance (ANOVA). The results are discussed in terms of bias, dispersion, precision, and computational and information efficiency. This will allow us to reach a clear conclusion with respect to the problem posed in the title of this paper.

2. Methods

2.1. Surface Simulation and Characterisation as a Random Process

Simulated surfaces will be used to analyse the methods for estimating H. The process is illustrated in Figure 1. For Gaussian surfaces with zero mean, the input values are the variance < z 2 > i n , H i n , λ u , and λ l . These are deterministic values. The surface generation algorithm S acts on a finite set { g i }   of random values, producing a single representation z S ( x , y ) . Because of the random nature of { g i } , the mean value of μ S 0 and < z 2 > S < z 2 > i n in general. Shifting and rescaling the simulated surface can correct these random variations. A numerical procedure M then calculates the simulated Hurst exponent H S M (or   H o u t ), which is a random variable, which will be characterised by its mean and standard deviation.
The bias is defined as B S M ( H i n ) = < H S M > H i n . The variance < H S M 2 > < H S 2 > ; the right-hand side is the variance of the unknown Hurst exponent of the surface simulated with S . The total dispersion can be characterised by the standard deviation σ o u t ( H i n ) = < ( H S M H i n ) 2 > . < H S 2 > cannot be estimated independently of M , but the smallest < H S M 2 > obtained from a given set of methods will give an upper bound estimate for < H S 2 > .

2.2. Surface Generation

Surfaces used in this study are defined on an equidistant set of points ( x i , y j ) , with i , j : 1 2 n 0 and n 0 = 8, 9, or 10. The side of the square domain can be arbitrarily set equal to 1. Then, λ u = 2 and λ l = 2 n 0 . Three surface generation methods S are commonly used to generate z ( x , y ) . The generalised Weierstrass–Mandelbrot function [30,31] ( S = W M ) has recently recovered some popularity [32,33,34,35,36,37,38]. It is described by the following:
z ( x , y ) = λ u ( G λ u ) D 2 log γ M m = 1 M n = 0 N γ ( D 3 ) n cos ( φ n m cos ( 2 π γ n x 2 + y 2 cos ( arctan ( x 2 y 2 ) π   m M ) + φ n m ) ) .  
G is called the fractal roughness, and γ is the lacunarity. The random phase angles φ m n are chosen from a uniform distribution on [ π , π ] . The observation that the second cosine term in the sum is independent from the random phase angles allows speeding up the calculation of the WM surface during Monte Carlo simulations. Even so, the method is not computationally efficient compared to the following algorithms.
The midpoint displacement method ( S = M P ) has been widely used in computer graphics [39] and in early studies on fractal landscapes [40,41]. The original midpoint algorithm suffers from the problem that the initial few values can dominate the entire simulated surface, creating clear cross-patterns in the result. This can be corrected by modifying the way in which random values are added during each simulation step [39]. It is also feasible to ensure periodicity of the surface, which is important in some simulations, e.g., in contact mechanics, where a Fourier transform boundary element method is employed [42,43].
The Fourier transform method ( S = F T ) [43,44] uses normally distributed variables { g i j } on a regular grid with a size of 2 n 0 × 2 n 0 . This set is transformed to the frequency domain using a fast Fourier transform (FFT), defining { g i j ^ } = F { g i j } . By multiplying with the PS P ( k , l ) in the frequency domain and applying the inverse Fourier transform (IFT), { g i j ^ }   is filtered:
z ( x , y ) = F 1 ( ( R e ( g i j ^ ) + i   I m ( g i j ^ ) ) P ( k i 2 + l i 2 ) ) ,
where i = 1 ; the linear combination of real and imaginary parts in Equation (4) generates a purely real z ( x , y ) .
It shall be remembered that the FFT is only an efficient way of summing cosine and sine terms in a discrete Fourier transform (DFT). This is equivalent to the original approach by Longuet-Higgins [1], but Equation (4) allows for the efficient inclusion of 2 2 n 0 harmonic terms, which would require unacceptable computer times if the surface were generated by naive algorithms. However, by using the FFT, aliasing is introduced, which results in a statistical height distribution that is not Gaussian. This aspect can be improved by eliminating the longest wavelengths from the analysis [45].
Figure 2 shows three topographies generated by WM, MP, and FT. Their differences are seen in P S and the autocorrelation R S (Section 2.7). P W M shows many individual peaks corresponding to the individual combinations ( m , n ) in Equation (3) and has ridges along the coordinate axes. The rotational symmetry inherent to Equation (3) is visible in R W M as small wrinkles in the conical part of the surface. Elaborating Equation (3) is computationally inefficient and limits the use of WM for large-scale simulations.
P M P and R M P show pronounced edges along the coordinate axes. The preferential 90°–45° orientations imposed by the algorithm are clearly visible in R M P . P F T consists of a noisy signal around the centre of the spectrum that decays according to Equation (1). R F T consists of a conical peak superposed on a background ridge. Only the conical part is important for the analysis (Section 2.8).

2.3. Box Counting Methods

Box counting methods (BCM) are, in principle, numerical implementations of the theoretical definition of a Minkowski dimension [18]. For fractal curves defined on a unit square domain, the method consists of dividing the image into a set of m × m identical squares with length a = 1/m and counting the number nm of squares which contain at least one point of the curve. The box counting dimension is determined as the following:
D B C M = lim m log n m log m
If there is an algorithm for infinitely refining the scale of the curve, m is limited only by computational precision, and the method is very accurate [46,47,48]. For experimental data, m is limited by the spatial resolution. The procedure is analogous in 3 dimensions [28].
For functions defined as z ( x , y ) on a finite grid, the method produces considerable bias [49]. Several recent publications have focussed on the rule that nm should be the infimum of all possible covering configurations at a given size [22,50]. This infimum can be approximated by shifting the covering boxes with respect to their ideal position [51]. Differential box counting (DBC) adapts for differences in vertical and horizontal scale by calculating the range z over the domain and using boxes of vertical size z / m . A lack of vertical resolution is addressed by considering only the highest and lowest boxes that contain part of the surface in each column of boxes; all intermediate boxes will contain a part of the surface, even if the corresponding points do not appear in the dataset [49]. This modifies Equation (5) as the following:
D D B C = lim m log i m M i m i z log m
where the sum is taken over all m 2 boxes, · represents the ceiling function, and M i and m i are the maximum and minimum values within square i. The method is closely related to the Hall–Wood estimator [52,53]. Differences between the original definition of a box counting dimension, BCM and DBC, are illustrated in Figure 3.
Notice that Figure 3, Figure 4 and Figure 5 use time series to exemplify the basic concepts of the methods and are for illustrative purposes only. The curve shown in Figure 3a and Figure 4a corresponds to an arbitrary contour line of a surface produced by FT. The data series represents an arbitrary vertical section through this surface in arbitrary units. All methods described in this paper refer to the fully three-dimensional functions, as shown in Figure 2.

2.4. Triangular Prism Method

The triangular prism method (TPM) [40,41,54] is an extension of the Hausdorff dimension as used by Mandelbrot [10] to explain the coastline paradox [9]. The original method is applied to contour lines in the plane, where the number n i of segments (yardsticks) with a given length l required to cover the entire contour is determined by geometrical construction. TPM extends this approach by covering a surface with a mesh of triangles of a given size. The total surface area s T o t is the sum of all areas obtained by approximating the surface by triangles over a grid of square domains with side a and the following equation:
D T P M = lim a 0 log s T o t log a
Because surface area does not linearly scale with height, this method is sensitive to the vertical scale, which would limit its use to self-similar surfaces [55]. However, it has been proven that Equation (7) always converges to the correct value of D, but very small values of a may be required, corresponding to very large datasets [46]. The difference between the original definition of a Hausdorff dimension and its application to a time series is shown in Figure 4. For more details on the geometry in the case of two-dimensional surfaces, the reader is referred to Ref. [49].

2.5. Detrended Fluctuation

Detrended fluctuation (DTF) was originally developed for the study of single-variable processes that show nonstationarity [56]. Instead of analysing the variance of the function over a time interval t , a linear or polynomial fit is made to the accumulated function over t , and the residual variance of this fit is analysed as a function of t . The method was adapted to two-variable random processes by Gu and Zhou [57,58], with an emphasis on multifractal analysis.
Here, the cumulative function Z ( x , y ) is calculated over each square subdivision with side a. A fitting polynomial f ( x , y ) is adjusted to Z ( x , y ) on the individual squares. The residual variance v ¯ a , averaged over all squares, obeys v ¯ a ~ a H , hence the following equation:
D D T F = 3 lim a 0 log v ¯ a log a
Linear fits and second-degree polynomials are commonly used for f ( x , y ) [49]. This work uses the following Equation:
f ( x , y ) = c 00 + c 10 x + c 01 y + c 20 x 2 + c 02 x y + c 11 y 2

2.6. Roughness–Length Method

The roughness–length method (RLM) is based on the log–log plot of the variance measured over square areas of length a, according to Equation (2). Early Monte Carlo simulations [59] showed that its reliability is comparable to the power spectral method for time series. Its use is popular in the field of rock mechanics [60]. It was used to analyse the quality of surfaces generated by the FT and WM methods [32]. The number of simulations in this study was limited.
The implementation and use of the RLM are simple. Its theoretical basis (Equation (2)) is straightforward. Therefore, extended studies on its theoretical background or numerical improvement have not been published, contrary to TPM, BCM, DBC, and DTF. Its use for the characterisation of random surfaces is nonetheless interesting, as will be shown in Section 3. An illustration of the difference between RLM and DTF is shown in Figure 5 for the simplified case of time series.

2.7. Power Spectral Density and Related Functions

In most cases of interest, the function z ( x , y ) can be described with respect to a fixed mean value, which can be set to 0 without loss of generality. This is certainly true for the numerically generated surfaces studied here. Three closely related functions are defined for discrete datasets on a regular grid ( x i , y j ) with i , j : 1 2 n 0 . The autocorrelation of a real-valued function z ( x , y ) is defined as the following:
R ( ξ , ψ ) = z ( x , y ) z ( x ξ , y ψ ) d x d y
or for discrete data:
R ( ξ k , ψ l ) = 1 2 2 n 0 i = 1 2 n 0 j = 1 2 n 0 z ( x i , y j ) z ( x i ξ k , y j ψ l )
The convolution in Equation (10) can be easily performed as a multiplication in the frequency domain. Likewise, the discrete convolution in Equation (11) is most easily performed as a multiplication after FFT followed by IFT. The power spectral density (PSD) is the Fourier transform of the autocorrelation function, i.e.,
P ( k , l ) = F ( R ( x , y ) ) = | F ( z ( x , y ) ) | 2 .
The variogram [61], height correlation function [62], or structure function [63] is defined as follows:
γ ( ξ , ψ ) = < ( h ( x , y ) h ( x ξ , y ψ ) ) 2 >
Given that γ ( ξ , ψ ) = 1 R ( ξ , ψ ) , all three functions hold the same information [62,64]. Their numerical implementation and fields of application may differ, which may induce differences in the estimated value of D M , with M referring to PSA (power spectral analysis) or SFA (structure–function analysis).
Variograms or structure functions are widely used in geosciences [15,16,17] but are seldom found in tribology and surface sciences. One reason is that the variogram can be applied without defining a reference level (mean value) and can be calculated for limited datasets obtained from irregularly spaced sample points, which are typical for geological exploration [65]. The other reason is that surface profilers and atomic force microscopes usually come with an extensive postprocessing package, which includes background corrections and PSD calculation in the form of black-box procedures [66,67].

2.8. Tuning of the PSA

A common strategy to calculate the PSD from a measured or simulated surface is to apply an FFT to each individual line z ( x , y j ) and averaging the results. In this work, P ( k 2 + l 2 ) is calculated and averaged over all values with identical radii. Both approaches are illustrated in Figure 6. While there is a large dispersion on the individual P F T ( k , l j ) , the average over all j is restricted to a very narrow band. P F T ( k 2 + l 2 ) shows a much larger spread. Meanwhile, for j : 1 1025 for the line profiles, there are 82,798 different values for r = k 2 + l 2 in a 1025 × 1025 grid (1,050,625 points). The lower number of values for each value of r produces a larger variance of the sample mean.
The data in Figure 6b present two inconveniences for correct regression analysis. The residual variance is not constant, and the density of data points increases strongly with r. Figure 6c shows the results of applying a low-pass filter to the average data of Figure 6b, with threshold radii r T h r = 1 , 2 4 , 2 8 . With the highest threshold, linearity is respected over the entire log 2 r range, but dispersion is considerable for large values of r. As r T h r is increased, dispersion decreases, but the curve flattens for small r T h r . This phenomenon is artificial and should not be confused with an upper cutoff. For the statistical simulations reported in Section 3.3, r T h r = 2 5 was used, with linear regression performed for r : 2 6 2 1 .
An important aspect of PSA is illustrated in Figure 7. P F T ( r ) correctly reproduces the power-law PSD used as an input for FT. This is not a trivial result because the algorithm used to determine H by PSA does not “know” what kind of surface is used as an input, i.e., PSA does not use a trivial deconvolution of { g i j ^ } and P ( r ) because the input { g i j ^ } are unknown to the algorithm. P M P ( r ) flattens at high frequencies. Although the theoretical analysis of MP shows that this method produces a power-law PSD [39], its numerical implementation induces a deviation of this theoretical behaviour. P W M ( r ) shows the strongest deviation from the ideal behaviour. The peaks in the spectrum are due to the finite values of N and M in Equation (3). Each peak corresponds to a circle of spikes in P W M ( k , l ) (Figure 2). Using PSA without accounting for this behaviour leads to large errors [32].

2.9. Tuning of the SFA

A reference curve for the structure function γ ( r ) can be calculated by eliminating C from Equations (1) and (2) and taking the limit for r u for P ( r ) . This produces the following:
P ( r ) = { 0 r < r l < z 2 > H r l 2 H π r 2 ( H + 1 ) r l r
with the help of mathematical software, the IFT of Equation (14) can be found as the following:
R ( ρ ) < z 2 > = 1 < z 2 > F 1 ( P ( r ) ) = 4 H r l 2 H ρ H Γ ( 1 H ) Γ ( 1 + H ) + 1 F 2 ( H ; 1 , 1 H ; 1 4 r l 2 ρ 2 ) ,
where Γ ( · ) is the gamma function, and 1F2(·) is the generalised hypergeometric function. As R ( ρ ) has zero crossings (Figure 8a), its representation in a log 2 log 2   plot is meaningless. The log–log plot for γ ( ρ ) = 1 R ( ρ ) shows asymptotic linear behaviour for log 2 ρ < 0 (Figure 8b). Figure 8c shows the average of γ ( ρ ) over all points with the same ρ for H = 0.25, 0.5, 0.75 and S = F T .
Contrary to the radial average of the PSD, no filtering is needed for SFA, but the range where the theoretical prediction and linear approach are valid is limited compared with the filtered PSD. There is a visible difference between the slope of the simulated γ ( ρ ) and the theoretical curves. The horizontal offset of the simulated γ ( ρ ) is a random value with broad dispersion. For calculating H, this offset is irrelevant.

3. Simulations and Results

3.1. Simulation Scheme

Five factors were considered in this study: the surface generating algorithm, the size of the dataset, the vertical scale σ 0 = < z 2 > i n , the method for calculating H, and the value of Hin. The domain is defined as a grid of 2 n 0 × 2 n 0 points, with n 0 taking values of 8, 9, and 10. An earlier study [49] showed that TPM, and to a lesser extent, DTF, depend on vertical scaling; three roughness values σ 0 equal to 0.25, 0.5, and 1 are considered. The methods used are DBC, TPM, DTF, PSA, SFA, and RLM.
The value of H i n was varied from 1/128 to 127/128 in steps of 1/64. These values are chosen to exploit the perfectly parallel nature of the calculations on a processor with eight cores. Two surfaces were created for each combination of S ,   n 0 and H i n . For each of these surfaces, the three values of σ 0 and all six methods were applied. The scheme allows the assessment of bias and dispersion by standard regression analysis and the comparison of results by ANOVA.

3.2. Data Analysis

For both S , each M and n 0 , H S M (or H o u t ) is plotted vs. H i n , for each value of σ 0 . The linear regression is calculated with the following Equation:
H S M = a 0 + a 1 H i n
is determined by standard least squares. A fourth regression line is estimated for the pooled data, disregarding the value of σ 0 . The residual variance σ O u t 2 of the pooled data is compared with the mean residual variance of the three individual regression lines. The resulting F-ratio is compared with the 90% quantile of the F-distribution with the corresponding degrees of freedom, and the p-value is calculated. If p > 0.1, it is concluded that M is insensitive to σ 0 in the range evaluated.
The value of   a 1 characterises the bias of the methods. Although this value cannot be studied entirely independently of a 0 , a 1 affects the precision of a method. Having determined H S M (Equation (16)) and knowing a 0 and a 1 , a corrected value can be found as the following:
H c r r = H S M a 0 a 1
σ o u t represents the dispersion of the method. The precision of M can be defined as the difference between H i n and H c r r for a measured H S M , quantified, for example, as the standard deviation between the predicted value H c r r and H i n is as follows:
σ P r e d = < ( H i n H c r r ) 2 >
σ o u t interacts with a 1 in determining σ P r e d . Small values of a 1 magnify σ o u t when projected on the horizontal axis of the H S M vs. H i n -plot. σ P r e d is obtained from the regression analysis on a H i n vs. H out -plot, under the assumption that the residuals of this regression are independent of H S M . If the residuals follow a normal distribution, the confidence intervals can be calculated. The assumption of normality was assessed using the Kolmogorov–Smirnov test.

3.3. Results for S = F T

The complete results for n 0 = 10, σ 0 = 0.25, 0.5, and 1 and all M for S = F T   are presented in Figure 9. As an example, Table 1 presents the complete analysis of the regression results for DTF and DBC. For all M , the regression results are independent of σ 0 , although the P F -values for TPM are close to 0.1. A summary of the pooled data for all M is shown in Table 2 and Table 3, for FT and MP and all n 0 , together with P F and the 90% confidence interval for H c r r . None of the P K M indicate deviations from normality, but note that for n 0 = 8, P F = 0.025 for TPM, confirming its dependence on σ 0 , as observed in an earlier study [49]. All methods except RLM and PSA present a visible increase in the variance with H i n . This effect is not studied in detail and is neglected in the statistical analysis.

3.4. Results for S = M P

The results for S = M P differ considerably from S = F T . Figure 10 shows that SFA has a large bias, especially for high values of H (low D), where the predominance of the first few values in MP is not masked by the strong fluctuations associated with low H. In the statistical results (Table 3), the precision for H T P M and H D B C varies significantly with σ 0 , as found in an earlier study [49]. This effect depends on both the surface simulation method and the algorithm used for the calculation of D.

4. Discussion

4.1. Bias, Dispersion, and Precision

Most studies on the fractal dimension of measured data conclude that different methods yield different results, as is clearly confirmed here. However, if the exact input value H i n is a priori unknown, there is no way to decide which H S M is closer to this value. In the field of image recognition, there is a tradition of applying different methods to a standard library of bitmaps and evaluating whether a method can distinguish between similar images [23,24,25,51]. Methods are then compared based on their consistency with earlier research, but such an approach does not provide information on bias or statistical dispersion.
Using simulated data with known H i n , two factors become clear at once. The first is the bias B ( H i n ) , as captured by a 1 . DBC is clearly underperforming in this regard. However, by Equation (17), this can be corrected by if a 0 and a 1 are calibrated through numerical simulation. Dispersion (σout) cannot be corrected. In this aspect, DBC outperforms the other methods, except for PSA. σout combines with B to determine σ P r e d and the confidence interval for H c r r . A low value of a 1 will magnify the dispersion on H c r r for a given H o u t . The low dispersion found in DBC cancels out the effect of bias for S = P S . This effect has not received due attention in the literature; most studies comparing different methods for the determination of D do not define precision.

4.2. Computational Efficiency

In the context of this study, efficiency can be interpreted either in terms of computational speed or the optimal use of information. Computation speed is often a secondary concern if the effort involved in measuring z ( x , y ) is much greater than the work involved in the calculation of D. However, as the size of the datasets increases, the computational cost may blow up. In applications such as real-time pattern recognition, computational speed is essential. PSA, DBC, and RLM will have an advantage here.
To compare computational efficiency, it must be ensured that the most efficient algorithm is used for each method. A straightforward example is the use of FFT in SFA and PSA, but efficient algorithms are also known for BCM [68] and direct calculation of R ( ρ ) and γ ( ρ ) [69]. As the algorithms in this study have not been fully optimised for efficiency, only the most evident effects will be mentioned. As an example, the full PSA routine used here for n 0 = 10 takes 5″ using a single core on an 11th Gen Intel Core i9-11950H @ 2.60 GHz. Additional software optimisation can certainly increase this speed.
Calculation of the cumulative function in DTF and of the variance in RLM can be significantly sped up using the results at length scale a to obtain the values for 2a. A naive calculation of z 2 was used in RLM because, even without optimisation, this method is very fast. In contrast, DTF blows up rapidly with n 0 . The use of the cumulative function instead of the original values, combined with a regression analysis for each subset in each step, generates computational overhead. TPM is not very efficient either and is not very precise.
Further development of DTF and TPM to analyse physical measurements in the form of z ( x i , y j ) shows little promise. Considering the algorithms used in this study, DBC, RLM, PSA, and SFA are at least ten times more efficient than TPM, which is significantly faster than DTF. SFA takes about twice the time of PSA simply because of the calculation of the IFT in addition to the FFT. Given the lower precision of SFA, PSA is preferred.

4.3. Information Efficiency

Information efficiency describes how much of the available information in the dataset is effectively used to estimate D, including the individual values of z ( x i , y j ) and their spatial correlation. Power spectral analysis and autocorrelation were originally developed for the analysis of time series. Contact surface profilers measure roughness along a line [3]; hence, estimating fractal dimensions by analysing data along a line is a deep-rooted tradition [61,63,70,71].
Most modern applications generate large datasets over a rectangular array of regularly spaced points, as is the case for images obtained by digital cameras, atomic force microscopes, optical roughness measurements, or photogrammetry data. To analyse single lines out of a full set of f ( x i , y j ) data induces a considerable loss of information and may even distort the conclusions of the analysis [72,73]. The effect is clearly illustrated in Figure 6a, where the power spectrum of single lines shows extreme dispersion, but the average value of the 1025 lines appears as a well-defined straight line. Single-line analysis should never be performed if data are available over a 2D domain.
Even so, DTF, TPM, DBC, and RLM do not use all available information because they are typically implemented by subsequent division of intervals by two in each step. For the periodic surfaces generated by FT, there are 2 2 n 0 possible positions to collocate the corner of the first interval. While the computational cost of DTF and TPM does not encourage exploration of this effect, it has been used to decrease the statistical dispersion in DBC [20,21,51] and can be used in RLM. According to the definition of the Minkowski dimension [20], in DBC, the infimum of n m should be determined, while in RLM, the mean value can be used. Calculating all possible values would significantly increase the calculation time for RLM and DBC. SFA and PSA are not subject to this observation.

4.4. Effect of Vertical Scaling and Resolution

Vertical scaling clearly affects TPM but was found to be insignificant in all other methods for FT. It also affects DTF for MP. It is always possible to apply vertical rescaling such that σ 0 is of the same order of λ u if the significant number of digits for x, y, and z is equal, i.e., image depth should be equal to image resolution [49]. If not, scaling only increases the gaps between consecutive vertical values. Many scientific cameras obey this condition, and optical roughness measurements often have a higher vertical resolution than lateral resolution. In numerical simulations, the vertical resolution is limited by machine precision, while the lateral resolution is defined by n 0 .
In terms of cameras and measurement systems available in many laboratory settings, n 0 = 10 is rather modest. However, for time-consuming measurements, large sample sets, or computationally complex simulations, it may still be interesting to work at lower n 0 . It follows from Table 2 and Table 3 that σ o u t increases with decreasing n 0 . The effect is small for all methods except for PSA, where the increase in σ o u t is proportional to the reduction in spatial resolution. Still, at n 0 = 8 , PSA outperforms all other methods at n 0 = 10 , i.e., better estimates can be obtained using smaller datasets.

4.5. Effect of the Surface Generation Algorithm

PSA is more precise than the other methods for FT by a factor of 10. P F T ( r ) is related to the input function P ( r ) in Equation (4) by { g i j ^ } , which randomises the result. It has been claimed that using PSA with S = F T is “unfair” [32] because one is presumably the inverse of the other. It was pointed out in Section 2.8 that this is not the case; the implementation of PSA used in this work is independent of the surface generating algorithm. The excellent results obtained by PSA from surfaces generated by FT show that both methods are consistent.
The radial PS of surfaces defined by MP has a reduced linear section in the log–log plot (Figure 7). DTF is slightly more successful here than for S = F T (Table 2 and Table 3) because the principal characteristic of DTF is to filter out effects of nonstationarity, which is inherent to MP [37]. Zhang et al. [32] detected a breakdown of PSA for WM. This can be associated with the discrete peaks in the PS (Figure 2b and Figure 7c). These peaks are the reason why WM was not included in the analysis, as they do not appear in the experimental spectra available in the literature [12,13,14,15,16,17,18].
As a reference, a series of simulations was performed for WM with n 0 = 10 using RLM and PSA. The latter method does not produce useful results over the entire range of H. RLM gave H W M   R L M = 0.082 + 0.81 H i n , with σ o u t = 0.02 . This is comparable to RLM and SFA for FT. It follows that an approximate value for H can be obtained, even if the initial assumption (Equation (1)) is not fulfilled. Without the analysis of the PS, the peaks in the PS for WM would be overlooked. An extensive statistical analysis of the available surface generation algorithms, in terms of their PS, H, and the probability density distribution of surface heights, is an interesting topic for future investigation.
If the purpose of a study is to simulate a surface which conforms to Equation (1), the peaks in the PS created by WM induce a deviation from the basic hypothesis. Whether such preferred frequencies are important in the simulation depends on the application. For optical phenomena, they can be very important as they may interact with the frequency of the optical signal. In contact and fracture mechanics, the effect of the details of the PS has not been critically analysed. MP introduces a smaller deviation than WM, but its effect on specific applications has not been quantified either.
If the goal is to estimate the fractal dimension of some measured dataset z = z ( x , y ) , it is important to investigate whether the hypothesis of a power-law PS is fulfilled. This can be carried out by plotting P ( k 2 + l 2 ) as in Figure 7 or by studying the residual standard deviation and coefficient of determination of the linear fit to the log 2 log 2 plot for P ( r ) . Failure to do so may lead to incorrect or incomplete conclusions about the character of the data.
The simulations made in this work are meant as a reference for the analysis of experimental data. In an experimental setting, D is a priory unknown, so it is not possible to compare and calibrate the methods objectively. For measurements on an irregular grid, the present version of PSA is not feasible, and other methods may be selected. For experimental results on a regular grid, which may show deviations from the exact power law (Equation (1)), PSA can detect such deviations, while techniques such as DBC and DTF will mask them, which may represent an unacceptable loss of information.
Refined methods for the characterisation of more complex random processes on a two-dimensional domain can be devised. Substituting the simple power law by more advanced fitting functions may be the first approach to this problem. For the simulation of surfaces which purposedly deviate from Equation (1), the FT technique can use more refined expressions for P ( k , l ) in Equation (4).

5. Conclusions

A basic rule in scientific research is that measurement techniques must be calibrated to reliable standards. Neither calibration techniques nor standards have been clearly defined for determining the Hurst exponent of random fractal surfaces defined as z = z ( x , y ) on a unit square. This study defines the bias, dispersion, and precision of the methods. Bias refers to the difference between the output and input values of the Hurst exponent. It is a function of the deterministic input value H i n . Dispersion is the standard deviation of the random variable H o u t , and precision is the standard deviation after correction for bias, i.e., on H c r r .
The consistency and high precision of the power spectral analysis for surfaces produced by the Fourier transform method establishes the latter as a useful standard for the other methods. The detrended fluctuation and triangular prism methods can be rejected because of their computational cost and low precision. Based on the bias, differential box counting should be rejected. Box counting methods were developed to characterise contours z 0 = z ( x , y ) in the plane but not for surfaces z = z ( x , y ) in three dimensions, where they underperform.
Differential box counting and detrended fluctuation can filter out deviations of the perfect power law behaviour of the power spectrum, but power spectral analysis accurately detects these deviations. Future research should, therefore, focus on exploring the unique capabilities of power spectral analysis, rather than trying to improve methods that are computationally inefficient or make suboptimal use of the information available in the measurements.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/fractalfract8030152/s1, Mathematica notebook: FractDimStatSim.nb.

Author Contributions

J.L.F.A., Methodology, Software, Investigation, Formal analysis, Visualisation, and Writing—Original draft; C.G.F., Resources, Funding acquisition, Supervision, and Writing—Review and editing; V.H.J., Conceptualisation, Methodology, and Writing—Review and editing; F.V.V., Software, Supervision, Writing—Review and editing; R.S., Conceptualisation, Methodology, Software, Formal analysis, Supervision, and Writing—Final text. All authors have read and agreed to the published version of the manuscript.

Funding

The PhD scholarship for JLFA was provided by CONACYT (Mexico). This research was partially funded by the DGAPA project PAPIIT-IA106720.

Data Availability Statement

Numerical values of the data presented in the figures will be made available upon reasonable request. The software used to create the data is attached as a Mathematica notebook (Supplementary Materials).

Acknowledgments

Technical support by I. Cueva, E. Ramos, J.L. Romero, and G. Álvarez is greatly acknowledged.

Conflicts of Interest

The authors declare that there are no conflicts of interest in the elaboration and publication of this work.

Abbreviations

PSPower spectrum
PSDPower spectral density
ANOVAAnalysis of variance
WMWeierstrass–Mandelbrot
FFTFast Fourier transform
IFTInverse fast Fourier transform
DTFDiscrete Fourier transform
F ( · ) Fourier transform
F 1 ( · ) Inverse Fourier transform
Re ( · ) Real part
Im ( · ) Imaginary part
· Ceiling
< · > Expected value
Γ ( · ) Gamma function
1 F 2 ( · ) generalised hypergeometric function
x, y, z, ξ , ψ Coordinates in physical domain
x i , y j , ξ k , ψ l Points in physical domain
ρ Radius in physical domain
z ( x , y ) Fractal function on a square domain
w ( x , y , z ) Density function in physical space
z 0 ,   w 0 Arbitrary   values   for   z ( x , y ) ,   w ( x , y , z )
z 2 variance of z ( x , y )
λ u Longest wavelength in physical space
λ l Shortest wavelength in physical space
kWave vector in the frequency domain
k, lCoordinates in the frequency domain
k i , l j Points in the frequency domain
rRadial coordinate in the frequency domain
r l upper cut-off radius in frequency domain
r u upper cut-off radius in frequency domain
DFractal dimension
HHurst exponent
CArbitrary normalisation constant
GFractal roughness (WM)
γ Lacunarity (WM)
P ( r ) Power spectrum as a function of r
R ( ξ , ψ ) Autocorrelation of z ( x , y )
γ ( ξ , ψ ) Structure function of z ( x , y )
< z 2 > i n Input   value   of   z 2   for   simulation
σ 0 Input value of standard deviation
H i n Input value of H for simulation
S Surface generation method (generic)
WMWeierstrass–Mandelbrot (method)
MPRandom midpoint (method)
FTFourier transform (method)
{ g i } Set of random numbers (generic)
z S ( x , y ) Random   surface   generated   by   S
z S ( x i , y j ) individual   values   of   z S ( x , y )
2 n 0 Number   of   x ,   y   values   in   z S ( x i , y j )
μ S Mean   value   of   z S ( x , y )
< z 2 > S Variance   of   z S ( x , y )
M, NNumber of terms in WM
{ g i j } 2 n 0 × 2 n 0 matrix of Gaussian values
{ g i j ^ } Fourier   transform   of   { g i j }
φ n m Random phase angles in WM
M Method to determine H (generic)
BCMBox counting methods (generic)
DBCDifferential box counting (method)
DTFDetrended fluctuation (method)
TPMTriangular prism method
RLMRoughness–length method
PSAPower spectrum analysis (method)
SFAStructure–function analysis (method)
aSize of square sub-domain
m 2 Number of sub-domains
z Range   of   z S ( x i , y j ) over entire domain
M i Maximum of z S ( x i , y j ) in sub-domain i.
m i Minimum of z S ( x i , y j ) in sub-domain i.
f ( x , y ) Regression function for DTF
v ¯ a Residual variance for sub-domain size a
c 00 , c 10 , c 01 , c 20 , c 02 , c 11 , a 0 , a 1 parameters
s T o t Total surface determined in TPM
P S ( k , l ) ,   P S ( r ) Power   spectrum   for   method   S
R S ( x , y ) ,   R S ( ρ ) Autocorrelation for method S
γ S ( ρ ) Structure   function   for   method   S
r T h r Treshold radius for low-pass filter
H S M Estimated   H   for   S ,   M ,   and   H i n
H c r r Corrected   H   for   S ,   M ,   and   H i n
B S M Bias   for   methods   S   and   M
σ o u t Standard   error   on   H S M (dispersion)
σ P r e d Standard   error   on   H c r r (precision)
R2Coefficient of determination

References

  1. Longuet-Higgins, M.S. The statistical analysis of a random, moving surface. Philos. Trans. R. Soc. Lond. Ser. A Math. Phys. Sci. 1957, 249, 321–387. [Google Scholar] [CrossRef]
  2. Longuet-Higgins, M.S. Statistical properties of an isotropic random surface. Philos. Trans. R. Soc. Lond. Ser. A Math. Phys. Sci. 1957, 250, 157–174. [Google Scholar] [CrossRef]
  3. Whitehouse, D.J.; Archard, J.F. The properties of random surfaces of significance in their contact. Proc. R. Soc. Lond. Ser. A. Math. Phys. Sci. 1970, 316, 97–121. [Google Scholar] [CrossRef]
  4. Pfeifer, P. Fractal dimension as working tool for surface-roughness problems. Appl. Surf. Sci. 1984, 18, 146–164. [Google Scholar] [CrossRef]
  5. Pfeifer, P.; Wu, Y.J.; Cole, M.W.; Krim, J. Multilayer adsorption on a fractally rough surface. Phys. Rev. Lett. 1989, 62, 1997–2000. [Google Scholar] [CrossRef]
  6. Wang, C.L.; Krim, J.; Toney, M.F. Roughness and porosity characterization of carbon and magnetic films through adsorption isotherm measurements. J. Vac. Sci. Technol. A 1989, 7, 2481–2485. [Google Scholar] [CrossRef]
  7. Majumdar, A.; Bhushan, B. Role of Fractal Geometry in Roughness Characterization and Contact Mechanics of Surfaces. J. Tribol. 1990, 112, 205–216. [Google Scholar] [CrossRef]
  8. Mandelbrot, B.B. The Fractal Geometry of Nature; WH Freeman: New York, NY, USA, 1982. [Google Scholar]
  9. Richardson, L.F. The problem of contiguity: An appendix to statistics of deadly quarrels. Gen. Syst. Yearb. 1961, 6, 139–187. [Google Scholar]
  10. Mandelbrot, B. How long is the coast of Britain? Statistical self-similarity and fractional dimension. Science 1967, 156, 636–638. [Google Scholar] [CrossRef]
  11. Plancherel, M.; Leffler, M. Contribution à ľétude de la représentation d’une fonction arbitraire par des intégrales définies. Rend. Circ. Mat. Palermo (1884–1940) 1910, 30, 289–335. [Google Scholar] [CrossRef]
  12. Gujrati, A.; Khanal, S.R.; Pastewka, L.; Jacobs, T.D.B. Combining TEM, AFM, and profilometry for quantitative topography characterization across all scales. ACS Appl. Mater. Interfaces 2018, 10, 29169–29178. [Google Scholar] [CrossRef] [PubMed]
  13. Gujrati, A.; Sanner, A.; Khanal, S.R.; Moldovan, N.; Zeng, H.; Pastewka, L.; Jacobs, T.D.B. Comprehensive topography characterization of polycrystalline diamond coatings. Surf. Topogr. Metrol. Prop. 2021, 9, 014003. [Google Scholar] [CrossRef]
  14. Philcox, O.H.E.; Torquato, S. Disordered Heterogeneous Universe: Galaxy Distribution and Clustering across Length Scales. Phys. Rev. X 2023, 13, 011038. [Google Scholar] [CrossRef]
  15. Smith, M.W. Roughness in the Earth Sciences. Earth-Sci. Rev. 2014, 136, 202–225. [Google Scholar] [CrossRef]
  16. Pardo-Igúzquiza, E.; Dowd, P.A. Fractal analysis of the Martian landscape: A study of kilometre-scale topographic roughness. Icarus 2022, 372, 114727. [Google Scholar] [CrossRef]
  17. Pardo-Igúzquiza, E.; Dowd, P.A. The roughness of Martian topography: A metre-scale fractal analysis of six selected areas. Icarus 2022, 384, 115109. [Google Scholar] [CrossRef]
  18. Persson, B.N.J. On the fractal dimension of rough surfaces. Tribol. Lett. 2014, 54, 99–106. [Google Scholar] [CrossRef]
  19. Theiler, J. Estimating fractal dimension. J. Opt. Soc. Am. A 1990, 7, 1055–1073. [Google Scholar] [CrossRef]
  20. Falconer, K. Fractal Geometry: Mathematical Foundations and Applications; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
  21. Bandt, C.; Hung, N.; Rao, H. On the open set condition for self-similar fractals. Proc Amer. Math. Soc. 2006, 134, 1369–1374. [Google Scholar] [CrossRef]
  22. Panigrahy, C.; Seal, A.; Mahato, N.K. Quantitative texture measurement of gray-scale images: Fractal dimension using an improved differential box counting method. Measurement 2019, 147, 106859. [Google Scholar] [CrossRef]
  23. Nayak, S.R.; Mishra, J. An improved method to estimate the fractal dimension of colour images. Perspect. Sci. 2016, 8, 412–416. [Google Scholar] [CrossRef]
  24. Nayak, S.R.; Mishra, J.; Palai, G. Analysing roughness of surface through fractal dimension: A review. Image Vis. Comput. 2019, 89, 21–34. [Google Scholar] [CrossRef]
  25. Liang, H.; Tsuei, M.; Abbott, N.; You, F. AI framework with computational box counting and Integer programming removes quantization error in fractal dimension analysis of optical images. Chem. Eng. J. 2022, 446, 137058. [Google Scholar] [CrossRef]
  26. Coles, P.; Barrow, J.D. Non-Gaussian statistics and the microwave background radiation. Mon. Not. R. Astron. Soc. 1987, 228, 407–426. [Google Scholar] [CrossRef]
  27. Heavens, A.F.; Sheth, R.K. The correlation of peaks in the microwave background. Mon. Not. R. Astron. Soc. 1999, 310, 1062–1070. [Google Scholar] [CrossRef]
  28. Wang, R.; Singh, A.K.; Kolan, S.R.; Tsotsas, E. Fractal analysis of aggregates: Correlation between the 2D and 3D box-counting fractal dimension and power law fractal dimension. Chaos Solitons Fractals 2022, 160, 112246. [Google Scholar] [CrossRef]
  29. Florindo, J.B.; Bruno, O.M. Closed contour fractal dimension estimation by the Fourier transform. Chaos Solitons Fractals 2011, 44, 851–861. [Google Scholar] [CrossRef]
  30. Ausloos, M.; Berman, D.H. A multivariate Weierstrass–Mandelbrot function. Proc. R. Soc. Lond. Ser. A. Math. Phys. Sci. 1985, 400, 331–350. [Google Scholar] [CrossRef]
  31. Yan, W.; Komvopoulos, K. Contact analysis of elastic-plastic fractal surfaces. J. Appl. Phys. 1998, 84, 3617–3624. [Google Scholar] [CrossRef]
  32. Zhang, X.; Xu, Y.; Jackson, R.L. An analysis of generated fractal and measured rough surfaces in regards to their multi-scale structure and fractal dimension. Tribol. Int. 2017, 105, 94–101. [Google Scholar] [CrossRef]
  33. Xiao, H.; Sun, Y.; Chen, Z. Fractal modeling of normal contact stiffness for rough surface contact considering the elastic–plastic deformation. J. Braz. Soc. Mech. Sci. Eng. 2019, 41, 11. [Google Scholar] [CrossRef]
  34. Wei, D.; Zhai, C.; Hanaor, D.; Gan, Y. Contact behaviour of simulated rough spheres generated with spherical harmonics. Int. J. Solids Struct. 2020, 193–194, 54–68. [Google Scholar] [CrossRef]
  35. Yu, X.; Sun, Y.; Zhao, D.; Wu, S. A revised contact stiffness model of rough curved surfaces based on the length scale. Tribol. Int. 2021, 164, 107206. [Google Scholar] [CrossRef]
  36. Liu, L.; Shi, Y.; Hu, F. Application of the Weierstrass–Mandelbrot function to the simulation of atmospheric scalar turbulence: A study for carbon dioxide. Fractals 2022, 30, 2250086. [Google Scholar] [CrossRef]
  37. Kant, K.; Sibin, K.; Pitchumani, R. Novel fractal-textured solar absorber surfaces for concentrated solar power. Sol. Energy Mater. Sol. Cells 2022, 248, 112010. [Google Scholar] [CrossRef]
  38. Shen, F.; Li, Y.-H.; Ke, L.-L. A novel fractal contact model based on size distribution law. Int. J. Mech. Sci. 2023, 249, 108255. [Google Scholar] [CrossRef]
  39. Saupe, D. Algorithms for random fractals. In The Science of Fractal Images; Peitgen, H.O., Saupe, D., Eds.; Springer: New York, NY, USA, 1988; pp. 71–136. [Google Scholar]
  40. Lam, N.S.-N.; Qiu, H.-L.; Quattrochi, D.A.; Emerson, C.W. An evaluation of fractal methods for characterizing image complexity. Cartogr. Geogr. Inf. Sci. 2002, 29, 25–35. [Google Scholar] [CrossRef]
  41. Zhou, G.; Lam, N.S.-N. A comparison of fractal dimension estimators based on multiple surface generation algorithms. Comput. Geosci. 2005, 31, 1260–1269. [Google Scholar] [CrossRef]
  42. Burger, H.; Forsbach, F.; Popov, V.L. Boundary Element Method for Tangential Contact of a Coated Elastic Half-Space. Machines 2023, 11, 694. [Google Scholar] [CrossRef]
  43. Yastrebov, V.A.; Anciaux, G.; Molinari, J.-F. The role of the roughness spectral breadth in elastic contact of rough surfaces. J. Mech. Phys. Solids 2017, 107, 469–493. [Google Scholar] [CrossRef]
  44. Hu, Y.Z.; Tonder, K. Simulation of 3-D random rough surface by 2-D digital filter and Fourier analysis. Int. J. Mach. Tools Manuf. 1992, 32, 83–90. [Google Scholar] [CrossRef]
  45. Yastrebov, V.A.; Anciaux, G.; Molinari, J.-F. From infinitesimal to full contact between rough surfaces: Evolution of the contact area. Int. J. Solids Struct. 2015, 52, 83–102. [Google Scholar] [CrossRef]
  46. Foroutan-Pour, K.; Dutilleul, P.; Smith, D. Advances in the implementation of the box-counting method of fractal dimension estimation. Appl. Math. Comput. 1999, 105, 195–210. [Google Scholar] [CrossRef]
  47. Wu, J.; Jin, X.; Mi, S.; Tang, J. An effective method to compute the box-counting dimension based on the mathematical definition and intervals. Results Eng. 2020, 6, 100106. [Google Scholar] [CrossRef]
  48. So, G.-B.; So, H.-R.; Jin, G.-G. Enhancement of the Box-Counting Algorithm for fractal dimension estimation. Pattern Recognit. Lett. 2017, 98, 53–58. [Google Scholar] [CrossRef]
  49. Schouwenaars, R.; Jacobo, V.H.; Ortiz, A. The effect of vertical scaling on the estimation of the fractal dimension of randomly rough surfaces. Appl. Surf. Sci. 2017, 425, 838–846. [Google Scholar] [CrossRef]
  50. Wu, M.; Wang, W.; Shi, D.; Song, Z.; Li, M.; Luo, Y. Improved box-counting methods to directly estimate the fractal dimension of a rough surface. Measurement 2021, 177, 109303. [Google Scholar] [CrossRef]
  51. Liu, C.; Zhan, Y.; Deng, Q.; Qiu, Y.; Zhang, A. An improved differential box counting method to measure fractal dimensions for pavement surface skid resistance evaluation. Measurement 2021, 178, 109376. [Google Scholar] [CrossRef]
  52. Gneiting, T.; Ševčíková, H.; Percival, D.B. Estimators of fractal dimension: Assessing the roughness of time series and spatial data. Stat. Sci. 2012, 27, 247–277. [Google Scholar] [CrossRef]
  53. Hall, P.; Wood, A. On the performance of box-counting estimators of fractal dimension. Biometrika 1993, 80, 246–251. [Google Scholar] [CrossRef]
  54. Clarke, K.C. Computation of the fractal dimension of topographic surfaces using the triangular prism surface area method. Comput. Geosci. 1986, 12, 713–722. [Google Scholar] [CrossRef]
  55. De Santis, A.; Fedi, M.; Quarta, T. A revisitation of the triangular prism surface area method for estimating the fractal dimension of fractal surfaces. Ann. Geophys. 1997, 15, 811–821. [Google Scholar] [CrossRef]
  56. Kantelhardt, J.W.; Zschiegner, S.A.; Koscielny-Bunde, E.; Havlin, S.; Bunde, A.; Stanley, H. Multifractal detrended fluctuation analysis of nonstationary time series. Phys. A Stat. Mech. Its Appl. 2002, 316, 87–114. [Google Scholar] [CrossRef]
  57. Gu, G.-F.; Zhou, W.-X. Detrended fluctuation analysis for fractals and multifractals in higher dimensions. Phys. Rev. E 2006, 74, 061104. [Google Scholar] [CrossRef]
  58. Gu, G.-F.; Zhou, W.-X. Detrending moving average algorithm for multifractals. Phys. Rev. E 2010, 82, 011136. [Google Scholar] [CrossRef]
  59. Malinverno, A. A simple method to estimate the fractal dimension of a self-affine series. Geophys. Res. Lett. 1990, 17, 1953–1956. [Google Scholar] [CrossRef]
  60. Kulatilake, P.H.S.W.; Ankah, M.L.Y. Rock Joint Roughness Measurement and Quantification—A Review of the Current Status. Geotechnics 2023, 3, 116–141. [Google Scholar] [CrossRef]
  61. Wen, R.; Sinding-Larsen, R. Uncertainty in fractal dimension estimated from power spectra and variograms. Math. Geol. 1997, 29, 727–753. [Google Scholar] [CrossRef]
  62. Kondev, J.; Henley, C.L.; Salinas, D.G. Nonlinear measures for characterizing rough surface morphologies. Phys. Rev. E 2000, 61, 104–125. [Google Scholar] [CrossRef]
  63. Jiang, K.; Liu, Z.; Tian, Y.; Zhang, T.; Yang, C. An estimation method of fractal parameters on rough surfaces based on the exact spectral moment using artificial neural network. Chaos Solitons Fractals 2022, 161, 112366. [Google Scholar] [CrossRef]
  64. Bhushan, B. Surface roughness analysis and measurement techniques. In Modern Tribology Handbook; Bhushan, B., Ed.; two volume set; CRC Press: Boca Raton, FL, USA, 2000. [Google Scholar]
  65. Cheng, Q.; Agterberg, F. Fractal Geometry in Geosciences. In Encyclopedia of Mathematical Geosciences; Springer International Publishing: Cham, Switzerland, 2022; pp. 1–24. [Google Scholar]
  66. Wang, H.; Chi, G.; Jia, Y.; Ge, C.; Yu, F.; Wang, Z.; Wang, Y. Surface roughness evaluation and morphology reconstruction of electrical discharge machining by frequency spectral analysis. Measurement 2020, 172, 108879. [Google Scholar] [CrossRef]
  67. Eftekhari, L.; Raoufi, D.; Eshraghi, M.J.; Ghasemi, M. Power spectral density-based fractal analyses of sputtered yttria-stabilized zirconia thin films. Semicond. Sci. Technol. 2022, 37, 105011. [Google Scholar] [CrossRef]
  68. Kruger, A. Implementation of a fast box-counting algorithm. Comput. Phys. Commun. 1996, 98, 224–234. [Google Scholar] [CrossRef]
  69. Marcotte, D. Fast variogram computation with FFT. Comput. Geosci. 1996, 22, 1175–1186. [Google Scholar] [CrossRef]
  70. Zuo, X.; Peng, M.; Zhou, Y. Influence of noise on the fractal dimension of measured surface topography. Measurement 2020, 152, 107311. [Google Scholar] [CrossRef]
  71. Wu, J.-J. Structure function and spectral density of fractal profiles. Chaos Solitons Fractals 2001, 12, 2481–2492. [Google Scholar] [CrossRef]
  72. Nayak, P.R. Random Process Model of Rough Surfaces. J. Lubr. Technol. 1971, 93, 398–407. [Google Scholar] [CrossRef]
  73. Greenwood, J.A. A unified theory of surface roughness. Proc. R. Soc. Lond. Ser. A. Math. Phys. Sci. 1984, 393, 133–157. [Google Scholar] [CrossRef]
Figure 1. Representation of the process which creates H S M for a given input value H i n through a surface generating algorithm S and a method M . By the random input { g i } , z S ( x , y ) is a random process, and H S M is a random value.
Figure 1. Representation of the process which creates H S M for a given input value H i n through a surface generating algorithm S and a method M . By the random input { g i } , z S ( x , y ) is a random process, and H S M is a random value.
Fractalfract 08 00152 g001
Figure 2. Height maps z S ( x , y ) , power spectra P S ( k , l ) , and autocorrelation R S ( x , y ) (see Section 2.5) for S = W M (N = 18, M = 23), S = M P , and S = F T ; n 0 = 10 .
Figure 2. Height maps z S ( x , y ) , power spectra P S ( k , l ) , and autocorrelation R S ( x , y ) (see Section 2.5) for S = W M (N = 18, M = 23), S = M P , and S = F T ; n 0 = 10 .
Fractalfract 08 00152 g002
Figure 3. (a) Original concept of BCM as a method to estimate the Minkowski dimension of a fractal contour. The grey boxes contain part of the curve and are counted in n m . (b) BCM with vertical scaling. The boxes marked by * and ** are missed because the curve passes through the box, but there are no data points inside the box due to a lack of vertical resolution. (c) DBC allows for a vertical shift and counts the total range in individual intervals. Box ** is still missed.
Figure 3. (a) Original concept of BCM as a method to estimate the Minkowski dimension of a fractal contour. The grey boxes contain part of the curve and are counted in n m . (b) BCM with vertical scaling. The boxes marked by * and ** are missed because the curve passes through the box, but there are no data points inside the box due to a lack of vertical resolution. (c) DBC allows for a vertical shift and counts the total range in individual intervals. Box ** is still missed.
Fractalfract 08 00152 g003
Figure 4. (a) Original concept of the yardstick method to estimate the Hausdorff dimension of a fractal contour. The contour is approximated by lines of a fixed length. (b) Trapezium method [39] for the estimation of the fractal dimension of a time series. The length of the lines between the dots is approximated by the length of the blue lines over intervals of width a in successive refinements. By Pythagoras’ theorem, the method is sensitive to vertical scaling, and the finite resolution of the data means that the limit for a 0 can only be approached crudely.
Figure 4. (a) Original concept of the yardstick method to estimate the Hausdorff dimension of a fractal contour. The contour is approximated by lines of a fixed length. (b) Trapezium method [39] for the estimation of the fractal dimension of a time series. The length of the lines between the dots is approximated by the length of the blue lines over intervals of width a in successive refinements. By Pythagoras’ theorem, the method is sensitive to vertical scaling, and the finite resolution of the data means that the limit for a 0 can only be approached crudely.
Fractalfract 08 00152 g004
Figure 5. (a) Illustration of RLM for a time series. The vertical variation (standard deviation) is marked as dashed lines relative to the mean (blue line). (b) Cumulative data (black) with superposed piecewise fitting curves (blue, Equation (9)), according to DBC. (c) Fitting residuals, showing the residual standard deviation (dashed line) for each interval. Standard deviations are shown for plotting purposes; RLM and DBC use the corresponding variance.
Figure 5. (a) Illustration of RLM for a time series. The vertical variation (standard deviation) is marked as dashed lines relative to the mean (blue line). (b) Cumulative data (black) with superposed piecewise fitting curves (blue, Equation (9)), according to DBC. (c) Fitting residuals, showing the residual standard deviation (dashed line) for each interval. Standard deviations are shown for plotting purposes; RLM and DBC use the corresponding variance.
Fractalfract 08 00152 g005
Figure 6. (a) log 2 log 2 plot of P F T ( k , l j ) ,   j : 1 1025 for a surface with 1025 × 1025 points, together with the average for all 1025-line profiles. (b) Individual values and averages over all points with the same radius. (c) Filtered data. As the low-pass threshold radius r T h r decreases, dispersion is reduced, as is the linear part of the plot.
Figure 6. (a) log 2 log 2 plot of P F T ( k , l j ) ,   j : 1 1025 for a surface with 1025 × 1025 points, together with the average for all 1025-line profiles. (b) Individual values and averages over all points with the same radius. (c) Filtered data. As the low-pass threshold radius r T h r decreases, dispersion is reduced, as is the linear part of the plot.
Fractalfract 08 00152 g006
Figure 7. Filtered mean values of the log 2 log 2 plot for (a) P F T ( r ) ; (b) P M P ( r ) ; and (c) P W M ( r ) .
Figure 7. Filtered mean values of the log 2 log 2 plot for (a) P F T ( r ) ; (b) P M P ( r ) ; and (c) P W M ( r ) .
Fractalfract 08 00152 g007
Figure 8. (a) presents R ( ρ ) (Equation (15)) for H = 0.25, 0.5, 0.75, and S = F T . (b) shows a log 2 log 2 plot of the theoretical value (Equation (15)), with linear fits (dashed) in the ρ = 2 10 2 4 range. (c) shows the theoretical curves (th.) and simulated γ ( ρ ) (sim.). The latter shows considerable horizontal offsets. Variations in H are much smaller but not negligible.
Figure 8. (a) presents R ( ρ ) (Equation (15)) for H = 0.25, 0.5, 0.75, and S = F T . (b) shows a log 2 log 2 plot of the theoretical value (Equation (15)), with linear fits (dashed) in the ρ = 2 10 2 4 range. (c) shows the theoretical curves (th.) and simulated γ ( ρ ) (sim.). The latter shows considerable horizontal offsets. Variations in H are much smaller but not negligible.
Fractalfract 08 00152 g008
Figure 9. Plots of H F T M vs. H i n for three values of σ 0 and all M , with the ideal case H F T M = H i n marked as a red line. The methods are listed in order the precision σ P r e d .
Figure 9. Plots of H F T M vs. H i n for three values of σ 0 and all M , with the ideal case H F T M = H i n marked as a red line. The methods are listed in order the precision σ P r e d .
Fractalfract 08 00152 g009
Figure 10. Plots of H M P M vs. H i n for three values of σ 0 and all M . For DTF and TPM, the results depend significantly on σ 0 ; for DBC and SFA, the bias is much higher than in H F T M .
Figure 10. Plots of H M P M vs. H i n for three values of σ 0 and all M . For DTF and TPM, the results depend significantly on σ 0 ; for DBC and SFA, the bias is much higher than in H F T M .
Fractalfract 08 00152 g010
Table 1. Results of the regression analysis for n 0 = 10, σ 0 = 0.25, 0.5, and 1 for S = F T and M = D T F and DBC. The first three lines present the results for individual σ 0 . R2 is the coefficient of determination, and n is the number of simulations. The next line presents the results for the pooled values. Line 6 presents the ANOVA results for the effect of σ 0 ; F90% and P σ 0 are the 90% confidence level and p-value for the Fisher test, respectively. Line 8 gives the residual standard deviation σ P r e d , the 90% confidence interval for H c r r , and the p-value for the Kolmogorov–Smirnov test ( P K M ) applied to the residuals of the H i n vs. H S M regression line.
Table 1. Results of the regression analysis for n 0 = 10, σ 0 = 0.25, 0.5, and 1 for S = F T and M = D T F and DBC. The first three lines present the results for individual σ 0 . R2 is the coefficient of determination, and n is the number of simulations. The next line presents the results for the pooled values. Line 6 presents the ANOVA results for the effect of σ 0 ; F90% and P σ 0 are the 90% confidence level and p-value for the Fisher test, respectively. Line 8 gives the residual standard deviation σ P r e d , the 90% confidence interval for H c r r , and the p-value for the Kolmogorov–Smirnov test ( P K M ) applied to the residuals of the H i n vs. H S M regression line.
DTFa0a1soutR2n
σ 0 = 1 0.0140.8480.0460.968128
σ 0 = 0.5 0.0020.8520.0460.968128
σ 0 = 0.25 0.0020.8660.0440.972128
Pooled0.0060.8550.0450.969384
ANOVA   σ 0 F-ratioF90% P σ 0
1.0111.1390.458
PrecisionsPred90% CIPKM
0.052±0.0870.755
DBCa0a1σoutR2n
σ 0 = 1 0.3080.5940.0160.992128
σ 0 = 0.5 0.3050.6010.0180.99128
σ 0 = 0.25 0.3060.5980.0180.99128
Pooled0.3060.5970.0170.991384
ANOVA σF-ratioF90% P σ 0
1.0031.1390.488
PrecisionσPred90% CIPKM
0.028±0.0480.97
Table 2. Summary of the statistical results for S = F T and all M . The results are pooled over all σ 0 . Notice how σ o u t and the 90% CI on H c r r increase with decreasing n 0 .
Table 2. Summary of the statistical results for S = F T and all M . The results are pooled over all σ 0 . Notice how σ o u t and the 90% CI on H c r r increase with decreasing n 0 .
Method n 0 a0a1σoutR2 P σ 0 90% CI
DTF100.0020.8660.0440.9720.458±0.087
9−0.0040.8410.0490.9620.458±0.105
80.0020.8450.0690.9270.494±0.13
TPM100.1370.7440.030.9820.11±0.067
90.1530.7160.0330.9760.15±0.078
80.1750.6910.0390.9640.025±0.095
RLM100.1440.7410.0240.9880.487±0.053
90.1550.7230.0250.9860.472±0.06
80.1820.6920.0290.9790.481±0.071
DBC100.3060.5970.0170.9910.488±0.048
90.3380.5590.0180.9880.442±0.053
80.3750.520.0230.9780.489±0.074
SFA100.1070.7870.0160.9950.487±0.034
90.1280.7580.0180.9930.491±0.039
80.1530.7260.0180.9930.479±0.042
PSA100.0011.0010.00210.4134±0.0032
90.0031.0010.00410.4984±0.0068
80.0071.0070.0080.9990.4692±0.0137
Table 3. Summary of the statistical results for S = M P and all M . The results are pooled over all σ 0 . Generally, bias is higher, and precision is lower than in FT.
Table 3. Summary of the statistical results for S = M P and all M . The results are pooled over all σ 0 . Generally, bias is higher, and precision is lower than in FT.
Method n 0 a0a1σoutR2 P σ 0 90% CI
DTF10−0.0140.8840.0470.9670.004±0.089
9−0.0280.8740.050.9620.008±0.077
8−0.0420.860.0620.9420.011±0.064
TPM100.0340.8930.0290.9880.015±0.083
90.0330.8890.0340.9830.023±.064
80.0450.8590.040.9750.027±0.077
RLM100.0140.9220.0210.9940.468±0.039
90.0130.9170.0240.9920.477±.044
80.0170.9010.0320.9850.415±0.06
DBC100.2680.6410.0180.990.488±0.05
90.30.5990.0210.9860.479±0.059
80.3390.5460.0250.9750.376±0.079
SFA100.1480.5490.0470.9190.466±0.036
90.1550.5350.0430.9290.486±0.049
80.1650.5110.0370.9430.488±0.062
PSA10−0.0651.0380.0130.9980.446±0.022
9−0.0591.0460.020.9960.485±0.033
8−0.0331.0560.030.990.47±0.048
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Flores Alarcón, J.L.; Figueroa, C.G.; Jacobo, V.H.; Velázquez Villegas, F.; Schouwenaars, R. Statistical Study of the Bias and Precision for Six Estimation Methods for the Fractal Dimension of Randomly Rough Surfaces. Fractal Fract. 2024, 8, 152. https://doi.org/10.3390/fractalfract8030152

AMA Style

Flores Alarcón JL, Figueroa CG, Jacobo VH, Velázquez Villegas F, Schouwenaars R. Statistical Study of the Bias and Precision for Six Estimation Methods for the Fractal Dimension of Randomly Rough Surfaces. Fractal and Fractional. 2024; 8(3):152. https://doi.org/10.3390/fractalfract8030152

Chicago/Turabian Style

Flores Alarcón, Jorge Luis, Carlos Gabriel Figueroa, Víctor Hugo Jacobo, Fernando Velázquez Villegas, and Rafael Schouwenaars. 2024. "Statistical Study of the Bias and Precision for Six Estimation Methods for the Fractal Dimension of Randomly Rough Surfaces" Fractal and Fractional 8, no. 3: 152. https://doi.org/10.3390/fractalfract8030152

APA Style

Flores Alarcón, J. L., Figueroa, C. G., Jacobo, V. H., Velázquez Villegas, F., & Schouwenaars, R. (2024). Statistical Study of the Bias and Precision for Six Estimation Methods for the Fractal Dimension of Randomly Rough Surfaces. Fractal and Fractional, 8(3), 152. https://doi.org/10.3390/fractalfract8030152

Article Metrics

Back to TopTop