1. Introduction
Suppose
is an arbitrary function. Finding the zeros of
f is one of the main tasks that arise from real-world problems. Iteration methods are among the most efficient tools for approximating the zeros of
f. Undoubtedly, the most famous iteration methods in the literature are Newton’s method, Halley’s method [
1] and Chebyshev’s method [
2]. It is well known that Newton’s method converges quadratically, while Halley and Chebyshev’s methods converge cubically to simple zeros, but all of them converge only linearly for multiple zeros. In 1870, Schröder [
3] presented the following modification of Newton’s method:
which restores the quadratic convergence for zero with known multiplicity
. In 1963, Obreshkov [
4] developed the following modifications of Halley and Chebyshev’s methods:
which converge cubically to zeros with multiplicity
m. It is worth noting that, Traub [
5] has suggested using any method for
or
instead of
f to preserve its convergence order. For a more detailed historical survey about these famous methods, we refer the reader to the elegant papers of Ypma [
6], Scavo and Thoo [
7] and Ezquerro et al. [
8].
In 2009, Proinov [
9] established a local convergence theorem about Newton’s method (
1) for multiple zeros of analytic functions. Very recently, based on Proinov’s general convergence theory developed in [
9,
10,
11], Ivanov [
12] has proven a general convergence theorem about Picard iteration
where
is some iteration function and, as an application, has proven a local convergence result about Halley’s method (
2) for simple and multiple zeros of analytic functions.
In this paper, using the approach of [
12], we study the local convergence of Chebyshev’s method (
3) for simple and multiple zeros of analytic functions. Thus, we obtain a local convergence theorem (Theorem 3) that provides initial conditions and error estimates to guarantee the
Q-cubic convergence of Chebyshev’s method (
3). More precisely, our result provides the following:
Exact domains of initial approximations, that is, exact Q-convergence balls (Theorem 4), that ensure the Q-cubic convergence of Chebyshev’s method right from the first iteration;
A priory and a posteriori error estimates;
An assessment of the asymptotic error constant of the method.
To the best of our knowledge, there is no such convergence theorem about Chebyshev’s method (
3), except Theorem 2.1 of [
13], which, in fact, only detects the convergence order of the Chebyshev–Halley family (see [
14]), which, in turn, includes Chebyshev’s method (
3). So, Theorem 3 improves and complements the mentioned Osada’s theorem as regards Chebyshev’s method. Moreover, our results allow us to compare Chebyshev’s method with the famous Halley one, theoretically (see Remark 3). Note that, the present study is a kind of continuation of [
15], where a detailed local convergence analysis of Chebyshev’s method (
3) for simple and multiple polynomial zeros has been presented.
On the other hand, the dynamics of Chebyshev’s method has been widely studied for simple polynomial zeros (see [
16,
17,
18,
19] and references therein). Here, we present the basins of attraction of Chebyshev’s method (
3) applied to polynomial and transcendental equations arising from some real-world problems. Our examples (see Examples 4 and 5) show that neither Chebyshev’s method nor Halley’s one satisfies the ‘nearest root principal’ when applied to irrational or transcendental equations instead to polynomials with only simple zeros. A comparison with Halley’s method (
2) based on the convergence behavior and dynamics of the two methods is also provided.
2. Preliminaries
Let
be analytic in a neighborhood of
, which is a zero of
f with multiplicity
, then we define the function
with
where
is defined by (see Yakoubsohn [
20])
The following definition introduces a wide class of iteration functions associated with the function
E defined by (
5).
Definition 1 ([
12] Definition 3.1)
. The function is called gamma iteration function at the point if there exists a quasi-homogeneous function of degree , such that for each with , it follows thatwhere the function E is defined by (5). The function ϕ shall be called a control function of T. The following general convergence theorem shall be used for the proof of our main result:
Theorem 1 ([
12] Theorem 3.4)
. Let be a gamma iteration function at the point with control function ϕ of the exact degree , and be an initial guess satisfyingThen, ξ is a unique fixed point of T, Picard iteration (4) is well defined and converges to ξ with Q-order , and for all , we have the following error estimates:where . In addition, if for sufficiently large k, then we have the following estimate of the asymptotic error constant: Definition 2 ([
12] Definition 3.5)
. Let and . The set is said to be a Q-convergence ball (with center ξ and radius ρ) of some iterative method if for any initial point , the iterative sequence generated by the method is well defined and converges to ξ with a Q-order of at least . The next theorem provides a
Q-convergence ball of Picard iteration (
4).
Theorem 2 ([
12] Theorem 3.6)
. Let be a gamma iteration function at the point with a control function ϕ of the exact degree , and R be the minimal positive solution of the equation . Then, is a Q-convergence ball of Picard iteration (4) with Q-order and with error estimates (9). Also, we have the following estimate of the asymptotic error constant: We end this section with two technical lemmas that play a crucial role in the proofs of our results.
Lemma 1 ([
12] Lemma 4.1)
. Let , be a zero of f with multiplicity , and , where the function is defined by (5). Then, the following estimates hold true:where the numbers and l are defined bywith defined by Remark 1. As for the numbers defined by (11), it is not hard to verify that . Lemma 2 ([
12] Lemma 4.2)
. Let and be a zero of f with multiplicity . If f is analytic around ξ, then we can define and c with (11) and the following claims hold true:- (i)
If is not a zero of f and , then we have - (ii)
If is not a zero of and , then we have
3. Local Convergence Analysis
In this section, we present a comprehensive local convergence analysis of Chebyshev’s method for simple and multiple zeros of analytic functions,
where Chebyshev’s iteration function
is defined by
In what follows, for
, we define the number
with
and the real functions
and
with
and
Note that
is the unique solution of the equation
in the interval
, and the function
is quasi-homogeneous of exact second degree ([
21] [Definition 8]) in
.
The following is our main lemma in this paper.
Lemma 3. Let , be a zero of f with multiplicity , and f be analytic around ξ. Then Chebyshev’s iteration function (13) is a gamma iteration function at the point ξ with a control function ϕ defined by (17). Proof. Define the function
E with (
5) and the number
with (
14). Let
be such that
; then, we have
where the function
H is defined by (
15). Now, we need to prove that
where the function
is defined by (
17). Let
. Define the numbers
a,
b,
c and
l with (
11). Then, from the triangle inequality, the second inequality of (
10) and (
18), we obtain
which means that
, and so, according to Lemma 2 (i), we deduce that
.
Now, it remains to prove (
19). From the definition of Chebyshev’s iteration function (
13) and Lemma 2, we obtain
where (see Remark 1)
From this, the triangle inequality and the estimates (
10) and (
20), we obtain
which, together with (
21), proves (
19) and completes the proof. □
The following is the main theorem of this paper:
Theorem 3. Let , be a zero of f with multiplicity , and f be analytic in a neighborhood of ξ. Suppose is an initial approximation satisfyingwhere the function E is defined by (5) and , with H and G defined by (15). Then, Chebyshev’s iteration (12) is well defined and converges Q-cubically to ξ, with the following error estimates for all :where and the function ϕ is defined by (17). Additionally, if for sufficiently large k, then the following estimate of the asymptotic error constant holds: Proof. Since
in the interval
, from
, we obtain
, which implies that
and
, where
is defined by (
14). This means that the initial condition (
24) implies that (
8) with
, and so, the proof follows immediately from Lemma 3 and Theorem 1. □
Remark 2. The initial condition (24) can be represented in the form , where R is the minimal positive solution of the equation in the interval , where τ is defined by (14). Note that such R exists because Ψ is continuous, and . Mindful of Theorem 2, we formulate Theorem 3 in the following equivalent form:
Theorem 4. Let be a zero of with multiplicity , and f be analytic in a neighborhood of ξ. If R is the minimal positive solution of the equation , where the function Ψ is the same as in Theorem 3, then is a Q-convergence ball that guarantees Q-cubical convergence of Chebyshev’s method with estimates (25) and (26). In the case of , i.e., when is a simple zero of f, from Theorem 3, we obtain the following result:
Theorem 5. Let be a simple zero of , which is supposed to be analytic in a neighborhood of ξ. Suppose is an initial approximation satisfyingwhere the function E is defined by (5) with and is the minimal positive solution of the equationThen, Chebyshev’s iteration (12) (for ) is well defined and converges Q-cubically to ξ with error estimate (25), where the function ϕ is defined byAdditionally, if for sufficiently large k, then the following estimate of the asymptotic error constant holds: Proof. This proof follows directly from Theorem 3, since (
27) and (
24) are equivalent for
. □
Analogically to Theorem 4, we formulate Theorem 5 as a ball convergence one.
Theorem 6. Let be a simple zero of , which is supposed to be analytic in a neighborhood of ξ. Then, with defined by (27) is a Q-convergence ball that guarantees Q-cubical convergence of Chebyshev’s method with estimate (25), where ϕ is defined by (29), and the estimate of the asymptotic error constant (30). Remark 3. What is interesting is that in the cases of , from Theorem 3, we obtain theorems with larger convergence domains (balls) and better error estimates than Theorems 5 and 6. In fact, Theorem 3 gives the largest convergence domain when ξ is a zero of f with multiplicity . Bearing in mind Remark 2, the values of R for are given in Table 1 with three decimal places. In light of Smale’s concept of
approximate zero of the second kind ([
22,
23]), we give the following definition associated with Chebyshev’s method (
12).
Definition 3. Let and be a zero of f. The number is said to be an approximate zero of the second kind for Chebyshev’s method if the iterative sequence (12) is well defined and According to Definition 3, the next consequence of Theorem 4 provides a ball of approximate zeros of the second kind for the Chebyshev’s method in the case of .
Corollary 1. Let be a simple zero of a function , which is analytic in a neighborhood of ξ. Then, any is an approximate zero of the second kind for Chebyshev’s method, where is the minimal positive solution of the equation Hereinafter, we aim to compare the convergence properties of Chebyshev’s method (
12) and Halley’s one (
2) using Theorem 5 and Theorem 4.4 of [
12], which we formulate in the following way:
Theorem 7 ([
12] Theorem 4.4)
. Let , be a zero of f with multiplicity and f be analytic in a neighborhood of ξ. Suppose is an initial approximation, such thatwhere the function E is defined by (5) and is the minimal positive solution of the equation , with h and g defined byThen the Halley’s iteration (2) is well defined and converges Q-cubically to ξ with error estimate (25), where , and with the following estimate of the asymptotic error constant: Remark 4. One can see from Table 1 that, in the case of simple zeros, Theorem 7 gives a larger convergence domain and therefore better error estimates for Halley’s method than Theorem 5 for Chebyshev’s one. For all other multiplicities (), Chebyshev’s method behaves better than Halley’s one. On the other hand, it is easy to check that the estimate of the asymptotic error constant (33) is better than (26) for all . 4. Convergence and Dynamics in Some Real-World Applications
In this section, we apply Chebyshev’s method (
3) and Halley’s method (
2) for the approximation of the roots of six nonlinear equations arising from different real-world problems. A numerical comparison based on the convergence behavior and dynamics of the two methods is conducted within the examples. In addition, we wish to note that we are in the process of developing integrated physics, mathematics and art courses within STEAM (Science, Technology, Engineering, the Arts and Mathematics) education at the Bulgarian high-school level. More precisely, we propose implementing the Polynomiography approach (see [
24]) to high-school physics education in Bulgaria to increase students’ interest, demonstrating the beauty of science through the ‘chain’
Physics–Mathematics–Art. So, we intend to use the following examples within these courses.
Suppose
, and
is an initial approximation. In the examples below, we apply Chebyshev and Halley’s methods for the approximation of a zero
of
f and for obtaining their basins of attraction. For any example, the dynamical planes are plotted using a mesh of
points and 50 iterations. The yellow points in the figures depict the roots of the corresponding equation. For the convergence tests, we define the following error estimate
In the tables below, for any example, we present the approximated root and the estimates and for the two methods. All results in the tables are given with at least six decimal places. All calculations and figures are performed in CAS Wolfram Mathematica 8.0.
Example 1 (Adiabatic flame temperature [
25])
. It is known that the adiabatic energy balance can be presented by the following cubic equation:where the variable z represents the adiabatic flame temperature in Kelvin. Note that Equation (34) possesses the following roots: and . The basins of attraction of the roots of (34) are obtained by the Chebyshev and Halley’s methods in the squareand the initial guess is randomly taken from the same square. Despite the ‘rough’ choice of the initial guess, one can see from
Table 2 that the root
is found by the two methods only in the third iteration, with an error estimation of less than
for Chebyshev’s method and less than
for Halley’s one, while at the next iteration, the root is found with accuracy of
and
, respectively. The basins of attraction for this example are plotted in
Figure 1.
Example 2 (Chemical equilibrium [
25])
. The equilibrium conversion of nitrogen and hydrogen to ammonia starting with of hydrogen and 1 mol of nitrogen in reaction conditions of 250 atm and °C is described by the following polynomial equation:where z represents the moles of nitrogen reacted at equilibrium, which, for of nitrogen, also represents the fractional conversion of nitrogen. Equation (35) has four simple roots: , and . The basins of attraction of the roots of (35) are obtained by Chebyshev and Halley’s methods in the squareThe initial guess , again, is randomly taken from the same square. The numerical results for this example are given in
Table 3 and the basins of attraction are plotted in
Figure 2. It is seen in the table that Halley’s method behaves slightly better that Chebyshev’s one.
In the following example, we shall consider a well-known engineering problem that leads to a polynomial equation with multiple roots.
Example 3 (Deflection of horizontal beam [
26] [Section 5.1])
. Consider a 10-meter beam with embedded ends, i.e., the ends are not only supported at both ends, but are also clamped horizontally to something (for example, walls). The deflection of the beam is described by the following Euler–Bernoulli equation:where E is Young’s modulus of elasticity for the material, I is the moment of inertia of a cross-section of the beam and is the load per unit length, which is constant when the beam has a uniform cross-section and is supporting its own weight. Taking , and , as in [26] [Example 5.1], we obtain the following boundary value problem:with the solutionNote that f has two simple zeros of and , and a zero of with multiplicity .For this example, the basins of attraction of the zeros of (36) are obtained by Chebyshev and Halley’s methods in the rectangleThe initial guess is randomly taken from the same rectangle. The numerical results for this example are given in
Table 4 and the basins of attraction are plotted in
Figure 3.
Example 4 (Blood rheology [
27,
28])
. It is well known that the blood is modeled as Casson Fluid. The Casson fluid model says that basic fluids such as blood and water flow through a tube such that the velocity gradient near the wall of the tube and the center core of the fluids travel as plugs with minimal deformation. So, to calculate the plug flow of Casson fluids, we use the following irrational equationwith zeros of and . The basins of attraction for this example are obtained in the squareand the initial approximation is randomly taken from the same square. The numerical results for this example are given in
Table 5 and the basins of attraction are plotted in
Figure 4. What is interesting is that from one and the same initial approximation, the two methods converge to different roots. Taking a closer look at the initial guess and the approximated roots, one can see that neither Chebyshev’s method nor Halley’s satisfy the ‘nearest root principal’ (see [
17] and references therein) when applied to irrational equations. In fact, this is easily observable from
Figure 4, where the white points depict the initial guess
while the black ones are the approximated zeros.
In the rest of the examples, we shall consider two problems that lead to transcendental equations.
Example 5 (Fractional conversion in a reactor [
25])
.
The fractional chemical conversion in a reactor is proposed to be determined by the equationwith , where X denotes the fractional conversion of the limiting reactant. Equation (38) has two real roots, and , which can be easily used for finding X.For this example, the basins of attraction of the zeros of (38) are obtained in the rectangleThe initial approximation is randomly taken from the same rectangle. The numerical results and the dynamical planes for this example are given in
Table 6 and
Figure 5. One can see from the table that Halley’s method behaves considerably better than Chebyshev’s, while the figure shows that both methods do not satisfy the ‘nearest root principal’ when applied to transcendental equations.
Example 6 (Torsional vibrations of a ball [
29])
.
The natural vibrations of a spherical cavity located in an infinitely elastic medium are considered in this example. The transcendental equationdescribes the complex spectrum of the natural frequencies z of the torsional vibrations of a spherical cavity in a continuous elastic medium. It is easy to see that Equation (39) has an infinite number of real roots and singularities. Since the main difficulty of applying the two methods is the need for the existence of a second derivative, we enlarge the domain of (39), and so we shall consider the equationwhich has the same roots as (39) but no singularities. We shall seek the roots of (40) in the interval which are and . The basins of attraction of the zeros of (40) are obtained in the squarewhile the initial approximation is randomly taken from the squarebecause we wish to approximate the roots inside it but not outside. The numerical results and the dynamical planes for this example are given in
Table 7 and
Figure 6. It is easy to see that for this example, Halley’s method again behaves better than Chebyshev’s method.
5. Conclusions
A detailed local convergence analysis of Chebyshev’s method (
12) for simple and multiple zeros of analytic functions has been provided. As a result, a local convergence theorem (Theorem 3) that provides the exact domains of the initial conditions, together with a priori and a posteriori error estimates to guarantee the
Q-cubic convergence of Chebyshev’s method (
12) right from the first iteration, has been obtained. To the best of the authors’ knowledge, there are no existing results pertaining to this method, except the theorem of Osada [
13], which only detects the convergence order of the method. As consequences, two ball
Q-convergence theorems (Theorems 4 and 6) as well as a Smale-type result (Corollary 1) that provides a ball of approximate zeros of the second kind for Chebyshev’s method in the case of
, have been obtained.
Furthermore, theoretical and numerical comparisons between Chebyshev’s method and the famous Halley’s one have been conducted on the bases of Theorem 3 and the convergence behavior and dynamics of the two methods when applied to six real-world problems. The comparison shows that Chebyshev’s method (
12) has some theoretical advantages (see Remark 4) over Halley’s method (
2), but this is not the case in the numerical comparison, where, in almost all examples, Halley’s method behaves better than Chebyshev’s one. In addition, our examples (Examples 4 and 5) have shown that neither Chebyshev’s method nor Halley’s one satisfies the ‘nearest root principal’ when applied to irrational or transcendental equations. As for the basins of attraction, one can see that the two methods produce their own specific shapes in the plain, which hints that all iteration methods have their own ‘art style’. For example, in all figures produced by Chebyshev’s method, clover-like shapes are observed, while in those produced by Halley’s one, balloon-like shapes prevail. Along this line, we intend to further implement the provided examples, and others, in some new integrated physics, mathematics and art courses within STEAM education in Bulgarian high-schools in order to increase students’ interest by revealing the art in the science.
Finally, we can further extend the convergence analysis of Chebyshev’s method for operator equations in Banach spaces.