1. Introduction
It is common practice to approximate a solution
of the nonlinear problem (
1) using Newton’s method.
where
is a Fréchet differentiable operator between Banach spaces
and
, and
is an open convex set.
Several modifications of Newton’s method have been studied [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10] to accelerate the convergence (i.e., improve the convergence order) or to reduce the number of functional evaluations in each step (i.e., improve the computational efficiency). In [
1] (also see [
11]), Cordero and Torregrosa considered the following modifications of Newton’s method, called the Newton–Simpson (NS) method defined for
by
where
when
It is proved in [
1] that method (
2) is of order three. Recall that the iterative method has order of convergence
if for
The parameter
is called convergent rate.
The proof in [
1] required the operator
to be at least four times differentiable, which reduces the applicability of the NS method. The analysis in [
1] is based on Taylor expansion and is restricted to Euclidean space.
The NS method is studied in this paper for general Banach space setting, and our analysis does not depend on Taylor expansions. Hence, we do not require the assumptions on the derivatives of of order more than two. In fact, we obtained the convergence order three for the NS method using assumptions on the derivatives of of order up to two. Thus, our analysis improves the applicability of these methods.
For example: Let
Define
f on
by
Then, we get
and
Obviously
is not bounded on
Thus, the convergence of method (
2) is not guaranteed by the analyses in [
1], although it may converge.
The main contributions of this paper are: (1) we obtain the convergence order of three for method (
2) with assumptions on
and
thus the applicability of method (
2) is extended to problems involving operators whose first and second derivatives exist (the analysis in [
1] requires the operator to be differentiable at least four times); (2) we extend the method (
2) to a method with convergence order five (see (
3) below) and to a method with convergence order six (see (
4) below); (3) semilocal convergence of methods (
2)–(
4) are provided in this paper.
In
Section 2, we prove that the method (
2) is of order three. Furthermore, we extend the order of method (
2) to methods with order of convergence five in
Section 3, and with order of convergence six in
Section 4, using Cordero et al.’s [
12,
13] technique. The extended fifth order method is defined for
as follows:
and the extended sixth order method is defined for
as follows:
Semilocal convergences of methods (
2)–(
4) are given in
Section 5; numerical examples are given in
Section 6. The dynamics of the methods are given in
Section 7, and the paper ends with a conclusion in
Section 8.
5. Semilocal Convergence
We develop a common analysis based on scalar majorizing sequences and the concept of
continuity [
14,
15].
Let us first deal with the method (
2). Define the scalar sequences
and
using two continuous and nondecreasing functions
for
by
These sequences are shown to be majorizing for the method (
2) in Theorem 4. However, a general convergence result is presented for these methods.
Lemma 1. Suppose that for each there exists such thatThen, the sequences generated by the Formula (32) are convergent to some and Proof. It follows from the Formula (
32), the properties of the functions
, and the condition (
33), that sequences
are nondecreasing and bounded from above by
Hence, they are convergent to
□
Remark 1. (i) The parameter λ is the unique and common least upper bound of the sequences and
- (ii)
If the function is strictly increasing, then a possible choice for
- (iii)
Suppose that the function has a minimal zero Then, the function ω can be restricted on the interval and
Next, we connect the functions
the sequence (
32) and the limit point
to the operators on the method (
2).
Suppose:
- (c1)
There exists a starting point and a parameter such that and
- (c2)
for all
Set
- (c3)
for all
- (c4)
The condition (
33) holds for
and
- (c5)
We now have the tools to develop the semilocal convergence for the method (
2).
Theorem 4. Suppose that the conditions (c1)–(c5) hold. Then, the sequences developed by the method (2) are convergent to some solving the equation Proof. The verification of the following assertions is needed:
and
The method of mathematical induction is employed. From the condition (c1) and the Formula (
32), it follows that
Thus, the assertion (
34) holds if
and the iterate
Pick a point
Then, the application of the condition (c2) and the definition of
give
Hence, we get
and
We also need the estimate
so
Suppose that the assertions (
34) and (
35) hold ∀
The induction hypothesis gives
Furthermore, we can write in turn following the second substep of the method (
2)
We also get
In view of (
32), (
36) (for
), (
37)–(
39), we obtain
These estimates show that the iterate
and the assertion (
35) hold. Then, we can write following the first substep of the method (
2)
It follows from (
32), (c2), (c3) and (
41) that
Hence, the first substep of the method (
2), (
36) (for
) and (
42) give
and
Therefore the induction for the assertions (
34), (
35) is completed and
for all
By the condition (c4) and Lemma 1, the sequences
,
are Cauchy. Consequently, by (
34) and (
35), the sequences
are also Cauchy, and as such convergent to some
Finally, the continuity of the operator
and (
42) if
imply
□
A uniqueness region for the solution is satisfied.
Proposition 2. Suppose:
- (a)
There exists a simple solution of the equation for some .
- (b)
he condition (c2) holds on the ball .
- (c)
There exists such that
Set Then the equation is uniquely solvable by d in the region .
Proof. Let
with
. Define the linear operator
By applying (c2) and (
43), we get in turn that
so
□
Remark 2.
(a) The condition (c5) can be replaced by .
- (b)
Under all the conditions (c1)–(c5) we can choose and .
Similarly, we develop the semilocal convergence analysis of the method (3) and the method (4). A majorizing sequence
,
,
for the method (
3) is given by
Moreover, a majorizing sequence for the method (
4) is
Clearly, the corresponding convergence conditions to (
33) are, respectively,
These conditions replace (c4) respectively.
The limit point is not necessarily the same for all three methods, but in order to simplify the notation, we use the same symbol
. Under these modifications, we present the semilocal convergence of the method (
3) and the method (
4).
Theorem 5. Suppose that the conditions (c1)–(c5) hold. Then, there exists satisfying under the method (3). Proof. We proceed as in the proof of Theorem 4, but there are some differences. We get
so
Notice that
leading to
So,
and
The rest is identical to Theorem 4. □
Theorem 6. Suppose that the conditions (c1)–(c5) hold. Then, there exists satisfying under the method (4). Proof. The third substep of the method (
4) gives instead
The rest follows as in Theorem 5. □
Notice that the uniqueness of the solution region has been given in Proposition 2.
7. Basins of Attraction
In order to obtain a visual region of convergence, we study the Fatou sets and Julia set of the methods (
2)–(
4). Recall that for a sequence
produced by the above methods starting with
converging to
the set
is called the Basin of Attraction (BA) or Fatou set [
18] and
, the complement of
S, is known as a Julia set. The BAs associated with the roots of the three systems of equations are studied for the methods (
2)–(
4).
Example 4.
with solutions { }.
Example 5.
with solutions { }.
Example 6.
with solutions { }.
We consider the rectangular region and find the basins of attraction associated with a given root in R. Consider an equidistant grid of 401 × 401 points and consider each point as an initial point, then check whether the point gives convergence to any of these roots. A maximum of 50 iterations are performed for each of the points, and the point which does not give convergence with error tolerance of is considered a point at which the iterative function does not converge. Corresponding to each root, a color is assigned, and the initial points which give convergence to that root are marked with their respective colors. Dark regions represent the points which do not give convergence.
All the calculations in this paper were performed on a 16-core 64-bit Windows machine with Intel Core i7-10700 CPU @ 2.90GHz, using MATLAB.
In
Figure 1 (corresponding to Example 4), the red region is the set of all initial points from which the iterate (
2) converges to
the blue region is the set of all initial points from which the iterate (
2) converges to
and the green region is the set of all initial points from which the iterate (
2) converges to
The black region represents the Julia set.
In
Figure 2 (corresponding to Example 4), the red region is the set of all initial points from which the iterate (
3) converges to
the blue region is the set of all initial points from which the iterate (
3) converges to
and the green region is the set of all initial points from which the iterate (
3) converges to
The black region represents the Julia set.
In
Figure 3 (corresponding to Example 4), the red region is the set of all initial points from which the iterate (
4) converges to
the blue region is the set of all initial points from which the iterate (
4) converges to
and the green region is the set of all initial points from which the iterate (
4) converges to
The black region represents the Julia set.
In
Figure 4 (corresponding to Example 5), the red region is the set of all initial points from which the iterate (
2) converges to
the blue region is the set of all initial points from which the iterate (
2) converges to
and the green region is the set of all initial points from which the iterate (
2) converges to
The black region represents the Julia set.
In
Figure 5 (corresponding to Example 5), the red region is the set of all initial points from which the iterate (
3) converges to
the blue region is the set of all initial points from which the iterate (
3) converges to
and the green region is the set of all initial points from which the iterate (
3) converges to
The black region represents the Julia set.
In
Figure 6 (corresponding to Example 5), the red region is the set of all initial points from which the iterate (
4) converges to
the blue region is the set of all initial points from which the iterate (
4) converges to
and the green region is the set of all initial points from which the iterate (
4) converges to
The black region represents the Julia set.
In
Figure 7 (corresponding to Example 6), the red region is the set of all initial points from which the iterate (
2) converges to
the blue region is the set of all initial points from which the iterate (
2) converges to
the green region is the set of all initial points from which the iterate (
2) converges to
and the yellow region is the set of all initial points from which the iterate (
2) converges to
The black region represents the Julia set.
In
Figure 8 (corresponding to Example 6), the red region is the set of all initial points from which the iterate (
3) converges to
the blue region is the set of all initial points from which the iterate (
3) converges to
the green region is the set of all initial points from which the iterate (
3) converges to
and the yellow region is the set of all initial points from which the iterate (
3) converges to
The black region represents the Julia set.
In
Figure 9 (corresponding to Example 6), the red region is the set of all initial points from which the iterate (
4) converges to
the blue region is the set of all initial points from which the iterate (
4) converges to
the green region is the set of all initial points from which the iterate (
4) converges to
and the yellow region is the set of all initial points from which the iterate (
4) converges to
The black region represents the Julia set.