A Newton-like Midpoint Method for Solving Equations in Banach Space

Regmi, Samundra; Argyros, Ioannis K.; Deep, Gagan; Rathour, Laxmi

doi:10.3390/foundations3020014

Open AccessArticle

A Newton-like Midpoint Method for Solving Equations in Banach Space

¹

Department of Mathematics, University of Houston, Houston, TX 77204, USA

²

Department of Computing and Mathematical Sciences, Cameron University, Lawton, OK 73505, USA

³

Department of Mathematics, Hans Raj Mahila Mahavidyalaya, Jalandhar 144008, Punjab, India

⁴

Department of Mathematics, Indira Gandhi National Tribal University, Lalpur, Markantak, Annuppur 484887, Madhya Pradesh, India

^*

Author to whom correspondence should be addressed.

Foundations 2023, 3(2), 154-166; https://doi.org/10.3390/foundations3020014

Submission received: 25 February 2023 / Revised: 16 March 2023 / Accepted: 23 March 2023 / Published: 27 March 2023

(This article belongs to the Section Mathematical Sciences)

Download Versions Notes

Abstract

:

The present paper includes the local and semilocal convergence analysis of a fourth-order method based on the quadrature formula in Banach spaces. The weaker hypotheses used are based only on the first Fréchet derivative. The new approach provides the residual errors, number of iterations, convergence radii, expected order of convergence, and estimates of the uniqueness of the solution. Such estimates are not provided in the approaches using Taylor expansions involving higher-order derivatives, which may not exist or may be very expensive or impossible to compute. Numerical examples, including a nonlinear integral equation and a partial differential equation, are provided to validate the theoretical results.

Keywords:

quadrature formula; fourth convergence order; banach space; convergence

MSC:

47J25; 49M15; 65J15; 65H10; 65G99

1. Introduction

In the field of numerical analysis, a significant role is played by numerical methods for solving nonlinear equations. Due to lack of analytical methods, iterative techniques are required to approximate the solutions. One of the foremost objectives to use numerical methods for solving nonlinear transcendental equations is the ability to handle non-analytic and complex functions. Oftentimes, such equations arise in diverse disciplines such as science, engineering, and applied sciences [1,2,3,4]. For example, in physics, nonlinear equations often describe the behavior of systems with multiple interacting components, such as the Navier-Stokes equations in fluid dynamics. In engineering, nonlinear equations are used to model the behavior of materials under different loads and conditions. The ability to handle large and complex systems is another essential reason to use numerical methods. Nonlinear equations generally describe the behavior of systems with many interacting components, and solving them analytically can be extremely difficult, if not impossible. Numerical methods provide a way to break down these large systems into smaller, more manageable parts and find approximate solutions using iterative techniques.

A plethora of iterative methods are used for solving nonlinear transcendental equations, including fixed point iteration, root-finding methods, and the Newton–Raphson method. Each method has its own robustness and limitations, and the selection of the method depends on the particular equation being solved and the pre-decided accuracy level. For instance, the bisection method is one of the simplest and most robust methods for finding the roots of an equation but has a disadvantage of being slower and diverging for certain types of functions. The Newton–Raphson method, on the other hand, is faster and more accurate, but it requires the derivative of the function and may not converge for certain types of functions.

Moreover, the numerical method to be chosen depends on the specific equation being solved, the interval of the solutions, the number of solutions, and the desired accuracy level. For example, the bisection method is a good choice for finding all solutions in a given interval, while the Newton–Raphson method is better for finding a specific solution with an initial guess. In numerical optimization, root-finding methods are used to find the solutions of nonlinear equations that describe the behavior of the system, which enables the design of algorithms that are more efficient and more robust. There are several root-finding methods for solving nonlinear transcendental equations in research. Some common methods include:

1: The bisection method: a simple yet robust method that involves repeatedly bisecting an interval and determining which subinterval a root lies in.
2: The Newton–Raphson method: this method uses an initial guess and an iterative process to converge on a root and requires the ability to compute the derivative of the function.
3: The Secant method: this method is similar to the Newton–Raphson method but uses the slope of the secant line between two points rather than the derivative of the function.
4: Fixed-point iteration: this method involves finding the fixed point of a function using an iterative process. It requires the function to be in a specific form.
5: Muller’s method: this method is an extension of the secant method and is used for complex roots.
6: Bairstow’s method: this method is used for finding the roots of polynomials with real coefficients, and it is used to find the roots of polynomials of degree greater than two.
7: Aitken’s delta-squared method: this method is used for speeding up the convergence of fixed-point iteration method.
8: The Hybrid method: as the name suggests, this method combines two or more methods to find the root of the nonlinear equation.

As a workaround, iterative methods have been developed to locate the initial values of solutions to the nonlinear in the form as follows:

F (x) = 0,

(1)

where F is a Fréchet differentiable operator mapping between a Banach space

B_{1}

into a Banach space

B_{2}

, and D is a convex and open subset of

B_{1}

. The determination of a solution

x^{*} \in D

of the equation, whose analytical form is rarely attainable, is very important in many disciplines [1,2,3,4]. This is the case since applications are formulated as an equation such as (1) using mathematical modeling [1,2,3,5]. This is the explanation of why iterative methods are introduced producing sequences approximating

x^{*}

. There is extensive literature on the convergence of iterative methods motivated by algebraic or geometrical considerations [3,5,6,7,8].

A widely used method to solve (1) is Newton’s (NM), which is defined for each

n = 0, 1, 2, \dots

by

x_{0} \in D, x_{n + 1} = x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) .

(2)

NM uses one function evaluation and one inverse per iteration. It is of convergence order two [5]. It is always important to develop iterative methods of a higher convergence order as they provide an efficient approximation and more accuracy in finding the solution. There is a plethora of such methods (see [9,10,11,12,13,14] and references therein) proposed by various researchers.

In particular, we investigate the convergence of the fourth convergence order method defined for each

n = 0, 1, 2, \dots

by

\begin{matrix} y_{n} = & x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}), \\ x_{n + 1} = & x_{n} - A_{n}^{- 1} F (x_{n}), \end{matrix}

(3)

where

{a_{j}} \in [0, 1]

,

{b_{j}}

with

\sum_{j = 1}^{k} b_{j} = 1

are sequences of nonnegative parameters, k is a natural number, and

A_{n} = \sum_{j = 1}^{k} b_{j} F^{'} (x_{n} - a_{j} F^{'} {(x_{n})}^{- 1} F (x_{n})) .

The authors in [8,9] motivated by the quadrature formula studied the local convergence of this method utilizing the Taylor series expansion of the operator F in the special case when

B_{1} = B_{2} = R^{m}

, where m is a natural number. The benefits over the other methods of the same convergence order were also explained in [8]. The convergence is established under the differentiability assumptions on

F^{(λ)}

,

λ = 1, 2, 3, 4, 5

. However, these results assure the convergence in case the operator is five times differentiable although the method may converge. Let us look at a simple example in the case when

D = [- 0.5, 1.5]

and

F (t) = \{\begin{matrix} t^{2} log t + t^{4} - t^{3}, t \neq 0 \\ 0, t = 0 . \end{matrix}

.

Then, one can clearly see that the results in [8,9] do not apply since

F^{(3)}

is unbounded at

t = 0

. Other problems include:

(1): The uniqueness of the solution region is not provided.
(2): The choice of the starting point $x_{0} \in D$ is a “shot in the dark ”.
(3): There are no estimates on $∥ x_{n + 1} - x_{n} ∥$ or $∥ x^{*} - x_{n} ∥$ that can be computed in advance based on the properties of the operator F.
(4): The semilocal convergence of the method has not been studied.
(5): The derivative higher than one used in the local convergence is not on the method.

It is worth noticing that the aforementioned problems appear in numerous other methods. These problems motivate the writing of this paper. In particular, we positively address all of these problems utilizing the operators on the method and the very general

ω

-continuity conditions on the operator

F^{'}

[1,7]. In the case of the semilocal convergence, the concept of the majorizing sequences is employed [1,6,7]. The idea of this paper can also be applied to other methods [6,15,16,17] analogously since it only depends on the inverse of the operators

F^{'}

and not on the method itself [12]. Moreover, see the related papers [18,19,20,21].

The paper is structured as follows: The local convergence in Section 2 is followed by the semilocal convergence in Section 3. The numerical applications and concluding remarks appearing in Section 4 and Section 5, respectively, complete the paper.

2. Convergence I: Local

We denote the interval

[0, \infty)

by M for brevity.

Suppose:

There exists a nondecreasing and continuous function (NCF) $w_{0} : M \to R$ such that the function $w_{0} (t) - 1$ has a smallest positive root denoted by s.
Set $M_{1} = [0, s)$ .
NCF $w : M_{1} \to R$ exists such that the function $g_{1} (t) - 1$ has a smallest root $r_{1} \in M_{1} - {0}$ , where

$\begin{matrix} g_{1} (t) = \frac{\int_{0}^{1} w ((1 - θ) t) d θ}{1 - w_{0} (t)} . \end{matrix}$

The function

q (t) - 1

has the smallest root

r_{q} \in M_{1} - {0}

, where

\begin{matrix} q (t) = \sum_{j = 1}^{k} | b_{j} | w_{0} (| 1 - a_{j} | t + a_{j} g_{1} (t) t) . \end{matrix}

Set

r_{1} = min {s, r_{q}}

and

M_{2} = [0, r_{1})

.

Define the function

p : M_{2} \to R

by

\begin{matrix} p (t) = \sum_{j = 1}^{k} \int_{0}^{1} | b_{j} | w (| 1 - θ - a_{j} | t + a_{j} g_{1} (t) t) d θ . \end{matrix}

The function

g_{2} (t) - 1

has a smallest root

r_{2} \in M_{2} - {0}

, where

\begin{matrix} g_{2} (t) = \frac{p (t)}{1 - q (t)} . \end{matrix}

Then, in Theorem 1 the parameter r given as

\begin{matrix} r = min {r_{i}}, i = 1, 2 \end{matrix}

(4)

is proven to be a radius of convergence for the method (3).

Set

M_{3} = [0, r)

.

It is implied by these definitions that for each

t \in M_{3}

\begin{matrix} 0 \leq & w_{0} (t) < 1 \end{matrix}

(5)

\begin{matrix} 0 \leq & q (t) < 1 \end{matrix}

(6)

\begin{matrix} 0 \leq & p (t) \end{matrix}

(7)

and

\begin{matrix} 0 \leq g_{i} (t) < 1 . \end{matrix}

(8)

The sets

S (x^{*}, μ)

,

S [x^{*}, μ]

denote, respectively, the open and closed balls in

B_{1}

with center

x^{*} \in B_{1}

and of radius

μ > 0

.

The parameter r and the functions

w_{0}

and w are connected to the operator F as follows, provided that

x^{*}

is a solution of the Equation (1) with

F^{'} {(x^{*})}^{- 1} \in L (B_{2}, B_{1})

.

$(E_{1})$: $∥ F^{'} {(x^{*})}^{- 1} (F^{'} (u) - F^{'} (x^{*})) ∥ \leq w_{0} (∥ u - x^{*} ∥)$ for each $u \in D$ .
Set $D_{1} = D \cap S (x^{*}, r) .$
$(E_{2})$: $∥ F^{'} {(x^{*})}^{- 1} (F^{'} (u_{1}) - F^{'} (u_{2})) ∥ \leq w (∥ u_{1} - u_{2} ∥)$ for each $u_{1}, u_{2} \in D_{1}$ .
and
$(E_{3})$: $S [x^{*}, r] \subset D$ .

The local convergence of the method (3) follows next based on the terminology and the conditions

(E_{1})

–

(E_{3})

.

Theorem 1.

Suppose the conditions

(E_{1})

–

(E_{3})

hold. Then, the sequence

{x_{n}}

is convergent to

x^{*}

provided that the starting point

x_{0} \in S (x^{*}, r) - {x^{*}}

.

Proof.

We shall establish using induction the assertions

∥ y_{n} - x^{*} ∥ \leq g_{1} (∥ x_{n} - x^{*} ∥) ∥ x_{n} - x^{*} ∥ \leq ∥ x_{n} - x^{*} ∥ < r

(9)

and

∥ x_{n + 1} - x^{*} ∥ \leq g_{2} (∥ x_{n} - x^{*} ∥) ∥ x_{n} - x^{*} ∥ \leq ∥ x_{n} - x^{*} ∥

(10)

with r,

g_{1}

, and

g_{2}

as previously defined.

By applying the condition

(E_{1})

for

u \in S (x^{*}, r) - {x^{*}}

, we obatin, in turn, by (4) and (5)

\begin{matrix} ∥ F^{'} {(x^{*})}^{- 1} (F^{'} (u) - F^{'} (x^{*})) ∥ \leq w_{0} (∥ u - x^{*} ∥) \leq w_{0} (r) < 1 . \end{matrix}

(11)

The Banach lemma for invertible linear operators [1,2,3,16] and the estimate (11) imply that

F^{'} {(u)}^{- 1} \in L (B_{2}, B_{1})

with

∥ F^{'} {(u)}^{- 1} F^{'} (x^{*}) ∥ \leq \frac{1}{1 - w_{0} (∥ u - x^{*} ∥)} .

(12)

In particular, if

u = x_{0}

in (12) the iterate

y_{0}

is well defined, and we can write by the first substep of the method (3) if

n = 0

\begin{matrix} y_{0} - x^{*} & = x_{0} - x^{*} - F^{'} {(x_{0})}^{- 1} F (x_{0}) \\ = [F^{'} {(x_{0})}^{- 1} F^{'} (x^{*})] \int_{0}^{1} F^{'} {(x^{*})}^{- 1} (F^{'} (x^{*} + θ (x_{0} - x^{*})) - F^{'} (x_{0})) d θ (x_{0} - x^{*}) . \end{matrix}

(13)

In view of (4), (8) (for

i = 1

), (12) (for

u = x_{0}

), (

E_{2}

) and (13), we have in turn that

\begin{matrix} ∥ y_{0} - x^{*} ∥ \leq & \frac{\int_{0}^{1} w ((1 - θ) ∥ x_{0} - x^{*} ∥) d θ ∥ x_{0} - x^{*} ∥}{1 - w_{0} (∥ x_{0} - x^{*} ∥)} = g_{1} (∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥ \\ \leq & ∥ x_{0} - x^{*} ∥ < r . \end{matrix}

(14)

Thus, the iterate

y_{0} \in S (x^{*}, r)

and the assertion (9) holds for

n = 0

.

Next, we estimate:

\begin{matrix} ∥ F^{'} {(x^{*})}^{- 1} (A_{0} - \sum_{j = 1}^{k} b_{j} F^{'} (x^{*})) ∥ & \leq \sum_{j = 1}^{k} | b_{j} | ∥ F^{'} {(x^{*})}^{- 1} (F^{'} (x_{0}) - a_{j} F^{'} {(x_{0})}^{- 1} F (x_{0}) - F^{'} (x^{*})) ∥ \\ \leq \sum_{j = 1}^{k} | b_{j} | w_{0} (| 1 - a_{j} | ∥ x_{0} - x^{*} ∥ + a_{j} ∥ y_{0} - x^{*} ∥) \leq q (∥ x_{0} - x^{*} ∥) < 1 . \end{matrix}

(15)

Thus, we deduce

\begin{matrix} ∥ A_{0}^{- 1} F^{'} (x^{*}) ∥ \leq \frac{1}{1 - q (∥ x_{0} - x^{*} ∥)} . \end{matrix}

(16)

Moreover, the iterate

x_{1}

is well defined by the second substep of the method (3) if

n = 0

.

Similarly, we first have

\begin{matrix} A_{0} - \int_{0}^{1} F^{'} (x^{*} + θ (x_{0} - x^{*})) d θ = \sum_{j = 1}^{k} b_{j} (F^{'} (x_{0} + a_{j} (y_{0} - x_{0})) - \int_{0}^{1} F^{'} (x^{*} + θ (x_{0} - x^{*})) d θ), \end{matrix}

so

\begin{matrix} ∥ F^{'} {(x^{*})}^{- 1} & (A_{0} - \int_{0}^{1} F^{'} (x^{*} + θ (x_{0} - x^{*})) d θ) ∥ \\ \leq \int_{0}^{1} \sum_{j = 1}^{k} | b_{j} | w (∥ x_{0} + a_{j} (y_{0} - x_{0}) - x^{*} - θ (x_{0} - x^{*}) ∥) d θ \\ \leq \int_{0}^{1} \sum_{j = 1}^{k} | b_{j} | w (| 1 - θ - a_{j} | ∥ x_{0} - x^{*} ∥ + a_{j} ∥ y_{0} - x^{*} ∥) d θ \\ \leq p (∥ x_{0} - x^{*} ∥), \end{matrix}

(17)

hence,

\begin{matrix} ∥ x_{1} - x^{*} ∥ & = ∥ x_{0} - x^{*} - A_{0}^{- 1} F (x_{0}) ∥ \\ = ∥ [A_{0}^{- 1} F^{'} (x^{*})] [F^{'} {(x^{*})}^{- 1} (A_{0} - \int_{0}^{1} F^{'} (x^{*} + θ (x_{0} - x^{*})) d θ)] (x_{0} - x^{*}) ∥ \\ \leq \frac{p (∥ x_{0} - x^{*} ∥)}{1 - q_{n}} \leq g_{2} (∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥ \leq ∥ x_{0} - x^{*} ∥ . \end{matrix}

(18)

That is, the iterate

x_{1} \in S (x^{*}, r)

and the assertion (10) holds if

n = 0

.

By switching

x_{0}

,

y_{0}

,

x_{1}

with

x_{m}

,

y_{m}

,

x_{m + 1}

in the previous calculations, the induction for the assertions (9) and (10) is terminated. Therefore, the estimate

\begin{matrix} ∥ x_{m + 1} - x^{*} ∥ \leq c ∥ x_{m} - x^{*} ∥ < r, \end{matrix}

(19)

where

c = g_{2} (∥ x_{0} - x^{*} ∥) \in [0, 1)

gives

{lim}_{m \to \infty} x_{m} = x^{*}

, and the iterate

x_{m + 1} \in S (x^{*}, r)

. □

The uniqueness of the solution region is determined in the next result.

Proposition 1.

Suppose:

(1): A solution $u^{*} \in S (x^{*}, ρ_{3})$ of the equation $F (x) = 0$ exists for some $ρ_{3} > 0$ .
(2): The condition $(E_{1})$ holds on the ball $S (x^{*}, ρ_{3})$ .
(3): $ρ_{4} \geq ρ_{3}$ exists such that

$\int_{0}^{1} w_{0} (θ ρ_{4}) d θ < 1 .$

Set

D_{2} = D \cap S [x^{*}, ρ_{4}]

. Then, the equation (1) is uniquely solvable by

x^{*}

in the region

D_{2}

.

Proof.

Let us define the linear operator T by

T = \int_{0}^{1} F^{'} (x^{*} + θ (u^{*} - x^{*})) d θ .

It follows by

(1)

–

(3)

that

\begin{matrix} ∥ F^{'} {(x^{*})}^{- 1} (T - F^{'} (x^{*})) ∥ & \leq \int_{0}^{1} w_{0} (θ ∥ u^{*} - x^{*}) d θ \\ \int_{0}^{1} w_{0} (θ ρ_{4}) d θ < 1, \end{matrix}

thus

u^{*} - x^{*} = T^{- 1} (F (u^{*}) - F (x^{*})) = T^{- 1} (0) = 0

. □

Remark 1.

We can choose

ρ_{3} = r

provided that all hypotheses

(E_{1})

–

(E_{3})

of the Theorem 1 hold.

3. Convergence II: Semilocal

We still rely on the

ω

-continuity of

F^{'}

, but a scalar majorizing sequence is also employed.

Let

v_{0} : M \to R

,

v : M_{1} \to R

be NCF’s. If

α_{0} = 0

and

β_{0} \geq 0

, define the sequences

{t_{n}}

,

{s_{n}}

by

\begin{matrix} \bar{q_{n}} & = \sum_{j = 1}^{k} | b_{j} | v_{0} (| 1 - a_{j} | t_{n} + a_{j} s_{n}) \\ \bar{p_{n}} & = \sum_{j = 1}^{k} | b_{j} | \int_{0}^{1} v (| 1 - θ - a_{j} | t_{n} + a_{j} s_{n}) d θ \\ t_{n + 1} & = s_{n} + \frac{\bar{p_{n}} (s_{n} - t_{n})}{1 - \bar{q_{n}}} \\ γ_{n + 1} & = \int_{0}^{1} v ((1 - θ) (t_{n + 1} - t_{n})) d θ (t_{n + 1} - t_{n}) + (1 + v_{0} (t_{n})) (t_{n + 1} - s_{n}) \end{matrix}

(20)

and

\begin{matrix} s_{n + 1} & = t_{n + 1} + \frac{γ_{n + 1}}{1 - v_{0} (t_{n + 1})} . \end{matrix}

These scalar sequences are shown to be majorizing for the method (3). However, first, some general convergence conditions are needed for them.

Lemma 1.

Suppose that there

d > 0

exists such that for each

n = 0, 1, 2, \dots

\begin{matrix} \bar{q_{n}} < 1, v_{0} (t_{n}) < 1 a n d t_{n} < d . \end{matrix}

(21)

Then, the sequences

{t_{n}}

,

{s_{n}}

given by the formula (20) are convergent to some

d^{*} \in [0, d]

.

Proof.

The Formula (20) and Condition (21) imply

t_{n} \leq s_{n} \leq t_{n + 1} < d .

Hence, the result follows. □

Remark 2.

(a): If the function $v_{0}$ is strictly increasing on the interval $[0, ρ)$ ; then, we can choose $d = v_{0}^{- 1} (1)$ .
(b): If the smallest positive root $ρ_{0}$ of the function $v_{0} (t) - 1$ exists then we can set $d = ρ_{0}$ .

The functions

v_{0}

, v and parameter

d^{*}

relate to the operators F and

F^{'}

provided

x_{0} \in D

is such that

F^{'} {(x_{0})}^{- 1} \in L (B_{2}, B_{1})

and

∥ F^{'} {(x_{0})}^{- 1} F (x_{0}) ∥ \leq β_{0}

.

Suppose:

$(H_{1})$: $∥ F^{'} {(x_{0})}^{- 1} (F^{'} (u) - F^{'} (x_{0})) ∥ \leq v_{0} (∥ u - x_{0} ∥)$ for each $u \in D$ .
Set $D_{3} = D \cap S (x_{0}, ρ_{0})$ , where $ρ_{0}$ is the smallest positive root of the function $v_{0} (t) - 1$ .
$(H_{2})$: $∥ F^{'} {(x_{0})}^{- 1} (F^{'} (u_{1}) - F^{'} (u_{2})) ∥ \leq v (∥ u_{1} - u_{2} ∥)$ for each $u_{1}, u_{2} \in D_{3}$ .
$(H_{3})$: The condition (21) holds
and
$(H_{4})$: $S [x_{0}, d^{*}] \subset D$ .

The semilocal convergence follows for the method (3).

Theorem 2.

Suppose that the conditions

(H_{1})

–

(H_{4})

hold. Then, the sequence is convergent to some

x^{*} \in S [x_{0}, d^{*}]

solving the equation

F (x) = 0

and such that

\begin{matrix} ∥ x^{*} - x_{n} ∥ \leq d^{*} - t_{n} . \end{matrix}

(22)

Proof.

The following assertions are shown using induction.

\begin{matrix} ∥ y_{n} - x_{n} ∥ \leq s_{n} - t_{n} < d^{*} \end{matrix}

(23)

and

\begin{matrix} ∥ x_{n + 1} - y_{n} ∥ \leq t_{n + 1} - s_{n} . \end{matrix}

(24)

The assertion (23) holds if

n = 0

by the choice of

t_{0}

,

s_{0}

, and the first substep of the method (3). It follows that the iterate

y_{0} \in S (x_{0}, d^{*})

. By switch

x^{*}

, conditions

(E_{1})

–

(E_{3})

by

x_{0}

,

(H_{1})

–

(H_{4})

, we obtain

\begin{matrix} ∥ A_{m}^{- 1} F^{'} (x_{0}) ∥ \leq \frac{1}{1 - \bar{q_{m}}} \end{matrix}

(25)

and

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x_{m}) - A_{m}) ∥ \leq \bar{p_{m}} . \end{matrix}

(26)

We can write by the second substep of the method (3)

\begin{matrix} x_{m + 1} - y_{m} & = (F^{'} {(x_{m})}^{- 1} - A_{m}^{- 1}) F (x_{m}) = - (A_{m}^{- 1} - F^{'} {(x_{m})}^{- 1}) F (x_{m}) \\ = - A_{m}^{- 1} (F^{'} (x_{m}) - A_{m}) F^{'} {(x_{m})}^{- 1} F (x_{m}) = A_{m}^{- 1} (F^{'} (x_{m}) - A_{m}) (y_{m} - x_{m}), \end{matrix}

thus,

\begin{matrix} ∥ x_{m + 1} - y_{m} ∥ \leq \frac{\bar{p_{m}} ∥ y_{m} - x_{m} ∥}{1 - \bar{q_{m}}} \leq t_{m + 1} - s_{m} \end{matrix}

(27)

and

\begin{matrix} ∥ x_{m + 1} - x_{0} ∥ \leq ∥ x_{m + 1} - y_{m} ∥ + ∥ y_{m} - x_{0} ∥ \leq t_{m + 1} - s_{m} + s_{m} - t_{0} = t_{m + 1} < d^{*} . \end{matrix}

Hence, the iterate

x_{m + 1} \in S (x_{0}, d^{*})

and (23) holds. We can write by the first substep of the method (3) in turn that

\begin{matrix} F (x_{m + 1}) = & F (x_{m + 1}) - F (x_{m}) - F^{'} (x_{m}) (y_{m} - x_{m}) \\ = & F (x_{m + 1}) - F (x_{m + 1}) - F^{'} (x_{m}) (x_{m + 1} - x_{m}) + F^{'} (x_{m}) (x_{m + 1} - y_{m}), \end{matrix}

thus,

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} & F (x_{m + 1}) ∥ \leq ∥\int_{0}^{1} F^{'} {(x_{0})}^{- 1} (F (x_{m} + θ (x_{m + 1} - x_{m})) - F^{'} (x_{m})) d θ (x_{m + 1} - x_{m})∥ \\ + & ∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x_{m}) - F^{'} (x_{0}) + F^{'} (x_{0})) ∥ \\ \leq & \int_{0}^{1} v ((1 - θ) ∥ x_{m + 1} - x_{m} ∥) d θ ∥ x_{m + 1} - x_{m} ∥ + (1 + v_{0} (∥ x_{m} - x_{0} ∥)) ∥ x_{m + 1} - y_{m} ∥ \\ \leq & \int_{0}^{1} v ((1 - θ) (t_{m + 1} - t_{m})) d θ (t_{m + 1} - t_{m}) + (1 + v_{0} (t_{m})) (t_{m + 1} - s_{m}) = γ_{m + 1} . \end{matrix}

(28)

Consequently, we obtain

\begin{matrix} ∥ y_{m + 1} - x_{m + 1} ∥ \leq ∥ F {(x_{m + 1})}^{- 1} F^{'} (x_{0}) ∥ ∥ F^{'} {(x_{0})}^{- 1} F (x_{m + 1}) ∥ \leq s_{m + 1} - t_{m + 1} \end{matrix}

and

\begin{matrix} ∥ y_{m + 1} - x_{0} ∥ \leq ∥ y_{m + 1} - x_{m + 1} ∥ + ∥ x_{m + 1} - x_{0} ∥ \leq s_{m + 1} - t_{m + 1} + t_{m + 1} - t_{0} = s_{m + 1} < d^{*} . \end{matrix}

Hence, the induction is completed and the iterate

y_{m + 1} \in S (x_{0}, d^{*})

. It follows by Lemma 1 and the condition

(H_{2})

that the sequences

{t_{m}}

,

{s_{m}}

are Cauchy as convergent. Then, by (23) and (24), the sequences

{x_{m}}

,

{y_{m}}

are also Cauchy and, as such, they are convergent to some

x^{*} \in S [x^{*}, d^{*}]

. Moreover, by letting

m \to \infty

in (28) and using the continuity of the operator F, we deduce that

F (x^{*}) = 0

. Furthermore, for

j \geq 0

an integer, and the estimation

\begin{matrix} ∥ x_{m + j} - x_{m} ∥ \leq t_{m + j} - t_{m}, \end{matrix}

(29)

we conclude that (22) holds by letting

j \to + \infty

in (29). □

Next, the uniqueness region is provided.

Proposition 2.

Suppose:

(1): There exists a solution $u^{*} \in S (x_{0}, d_{1})$ of the Equation (1) for some $d_{1} > 0$ .
(2): The condition $(H_{1})$ holds on the ball $S (x_{0}, d_{1})$ .
(3): There exists $d_{2} \geq d_{1}$ such that

$\begin{matrix} \int_{0}^{1} v_{0} ((1 - θ) d_{1} + θ d_{2}) d θ < 1 . \end{matrix}$

Set

D_{4} = D \cap S [x_{0}, d_{2}]

.

Then, the equation

F (x) = 0

is uniquely solvable by

u^{*}

in the region

D_{4}

.

Proof.

As in Proposition 1 define the linear operator

T_{1} = \int_{0}^{1} F^{'} (u^{*} + θ (y^{*} - u^{*})) d θ

for some

y^{*} \in D_{4}

with

F (y^{*}) = 0

. Then, it follows in turn by

(1) - (3)

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} (T_{1} - F^{'} (x_{0})) ∥ & \leq \int_{0}^{1} v_{0} ((1 - θ) ∥ u^{*} - x_{0} ∥ + θ ∥ y^{*} - x_{0} ∥) d θ \\ \leq \int_{0}^{1} v_{0} ((1 - θ) ρ_{5}) + θ d_{1}) d θ < 1 . \end{matrix}

Thus, we conclude again that

u^{*} = y^{*}

. □

Remark 3.

(i): Under all the conditions of Theorem 2, we can let $d_{1} = d^{*}$ and $u^{*} = x^{*}$ .
(ii): The condition $(H_{4})$ can be replaced by ${(H_{4})}^{'}$ $S [x_{0}, ρ_{0}] \subset D$ , where $ρ_{0}$ is given in closed form.

4. Examples and Numerical Calculations

Validating and verifying theoretical results, numerical experiments are essential. This section comprises six numerical problems based on three applied science problems to check the theoretical results obtained from preceding sections. Two types of convergence analysis are mainly focused on: semi-local and local.

In order to evaluate the effectiveness of the method (3), some applications are simulated, and the results are analyzed. In particular, the residual errors, the number of iterations, the convergence radii, and the expected order of convergence are computed. The following formulas used for COC:

\begin{matrix} μ = \frac{ln \frac{∥ x_{j + 1} - x^{*} ∥}{∥ x_{j} - x^{*} ∥}}{ln \frac{∥ x_{j} - x^{*} ∥}{∥ x_{j - 1} - x^{*} ∥}}, for j = 1, 2, \dots \end{matrix}

or ACOC by:

\begin{matrix} μ^{*} = \frac{ln \frac{∥ x_{j + 1} - x_{j} ∥}{∥ x_{j} - x_{j - 1} ∥}}{ln \frac{∥ x_{j} - x_{j - 1} ∥}{∥ x_{j - 1} - x_{j - 2} ∥}}, for j = 2, 3, \dots \end{matrix}

We observe that the iterations terminate when the error is sufficiently small, according to the following sopping criterion:

(i): $∥ x_{k + 1} - x_{k} ∥ \leq ϵ,$ and
(ii): $∥ F (x_{k}) ∥ < ϵ$ ,

where

ϵ = 10^{- 100}

as error tolerance. The stopping criteria ensure that the computed approximations are accurate to a pre-decided level of precision. The numerical examples are stimulated by using Mathematica 11 software.

The first four examples are based on local convergence. Moreover, on the last we apply the method (3).

Example 1.

Let

B_{1} = B_{2} = R^{3},

D = S [0, 1]

and define F on D for

u = (x, y, z)

by

\begin{matrix} F (u) = {(e^{x} - 1, \frac{e - 1}{2} y^{2} + y, z)}^{T} . \end{matrix}

The first Fréchet derivative is given by

F^{'} (u) = (\begin{matrix} e^{x} & 0 & 0 \\ 0 & (e - 1) y + 1 & 0 \\ 0 & 0 & 1 \end{matrix})

Then, we find that

x^{*} = (0, 0, 0)

,

ω_{0} (t) = (e - 1) t

and

ω_{1} (t) = e t

. Then, taking

k = 2

,

a_{1} = a_{2} = 1 / 2

,

b_{1} = b_{2} = 1 / 2

the smallest positive roots of

g_{i} (t) - 1 = 0

for

i = 1, 2

are

0.324947

and

0.264229

. Then, the radius of convergence is given as

r = 0.264229

.

Example 2.

Let

B_{1} = B_{2} = R

. Define F on

D = (- 1, 1)

by

\begin{matrix} F (x) = sin x + x^{\frac{7}{4}} . \end{matrix}

Then, clearly

x^{*} = 0

. For

k = 1

,

a_{1} = 1

,

b_{1} = 1

, and

ω_{0} (t) = ω_{1} (t) = t + \frac{7}{4} t^{\frac{3}{4}}

. The smallest positive roots of

g_{i} (t) - 1 = 0

for

i = 1, 2

are

0.173601

and

0.14117

. Then, the radius of convergence is given as

r = 0.14117

.

Example 3.

Consider the nonlinear integral equation of mixed Hammerstein-type equation given by

\begin{matrix} F (x) (u) = x (u) - \int_{0}^{1} u t x {(t)}^{1 + α} d t, α \in (0, 1) \end{matrix}

where

x (u) \in C [0, 1]

. Clearly,

x^{*} = 0

. For

k = 1

,

a_{1} = 1

,

b_{1} = 1

,

α = 1 / 2

, and

ω_{0} (t) = ω_{1} (t) = 2.5 (1 + α) t^{α}

. The smallest positive roots of

g_{i} (t) - 1 = 0

for

i = 1, 2

are

0.0256

and

0.0189628

. Then, the radius of convergence is given as

r = 0.0189628

.

Example 4.

Consider the function defined on

D = [- 0.5, 1.5]

by

F (t) = \{\begin{matrix} t^{2} log t + t^{4} - t^{3}, t \neq 0 \\ 0, t = 0 . \end{matrix}

The unique solution is

x^{*} = 1

. Then, we find that for

k = 2

,

a_{1} = a_{2} = 1 / 2

,

b_{1} = b_{2} = 1 / 2

,

ω_{0} (t) = 96.6628 t

, and

ω_{1} (t) = 96.6628 t

. Then, the smallest positive roots of

g_{i} (t) - 1 = 0

for

i = 1, 2

are

0.00689683

and

0.0064939

. Then, the radius of convergence is given as

r = 0.0064939

.

Example 5.

Consider the following nonlinear partial differential equation, also known as problem of molecular interaction and defined by

\begin{matrix} θ_{t_{1} t_{1}} + θ_{t_{2} t_{2}} = θ^{2}, \end{matrix}

(30)

subject to the following conditions:

\begin{matrix} θ (t_{1}, 0) & = 2 t_{1}^{2} - t_{1} + 1, \\ θ (t_{1}, 1) & = 2, \\ θ (0, t_{2}) & = 2 t_{2}^{2} - t_{2} + 1, \\ θ (1, t_{2}) & = 2 \end{matrix}

where

(t_{1}, t_{2}) \in [0, 1] \times [0, 1]

.

Discretize the PDE (30) by applying the central divided difference

\begin{matrix} θ_{t_{1} t_{1}} & = \frac{θ_{i + 1, j} - 2 θ_{i, j} + θ_{i - 1, j}}{a^{2}}, \\ θ_{t_{2} t_{2}} & = \frac{θ_{i, j + 1} - 2 θ_{i, j} + θ_{i, j - 1}}{a^{2}} \end{matrix}

which further produces

\begin{matrix} θ_{i + 1, j} - 4 θ_{i, j} + θ_{i - 1, j} + θ_{i, j + 1} + θ_{i, j - 1} - a^{2} θ_{i, j}^{2} = 0, \end{matrix}

a system of nonlinear equations, where

i = 1, 2, 3, \dots, l - 1

,

j = 1, 2, 3, \dots, l - 1

. For instance

l = 6

, we obtain a system of

5 \times 5

and

a = \frac{1}{l}

. The COC, the number of iterations, residual errors, CPU timing, and error difference between two iterations for Example 5 are mentioned in Table 1.

Example 6.

Let us consider the following the Van der Pol equation, which is defined as

\begin{matrix} ν^{″} - η (ν^{2} - 1) ν^{'} + ν = 0, η > 0, \end{matrix}

(31)

which governs the flow of current in a vacuum tube, with the boundary conditions

ν (0) = 0

,

ν (2) = 1

. Further, we consider the partition of the given interval

[0, 2]

, which is given by

\begin{matrix} τ_{0} = 0 < τ_{1} < τ_{2} < τ_{3} < \dots < τ_{k}, w h e r e τ_{i} = τ_{0} + i h, h = \frac{2}{k} . \end{matrix}

Moreover, we assume that

\begin{matrix} ν_{0} = ν (τ_{0}) = 0, ν_{1} = ν (τ_{1}), \dots, ν_{k - 1} = ν (τ_{k - 1}), ν_{k} = ν (τ_{k}) = 1 . \end{matrix}

If we discretize the above problem (31) by using the second order divided difference for the first and second derivatives, which are given by

\begin{matrix} ν_{k}^{'} = \frac{ν_{k + 1} - ν_{k - 1}}{2 h}, ν_{k}^{''} = \frac{ν_{k - 1} - 2 ν_{k} + ν_{k + 1}}{h^{2}}, k = 1, 2, \dots, n - 1, \end{matrix}

then, we obtain a

(n - 1) \times (n - 1)

system of nonlinear equations

\begin{matrix} 2 h^{2} τ_{k} - h η (τ_{k}^{2} - 1) (τ_{k + 1} - τ_{k - 1}) + 2 (τ_{k - 1} + τ_{k + 1} - 2 τ_{k}) = 0, k = 1, 2, \dots, n - 1 . \end{matrix}

Let us consider

η = \frac{1}{2}

and

n = 8

; so, we have a

7 \times 7

system of nonlinear equations. The obtained results are depicted in Table 2.

Method (3) converges to the following estimated zero:

\begin{matrix} x^{*} = {(0.3381 \dots, 0.6208 \dots, 0.8452 \dots, 1.009 \dots, 1.111 \dots, 1.146 \dots, 1.108 \dots)}^{t r} \end{matrix}

5. Concluding Remarks

In the foregoing study, we have analyzed the local and the semilocal convergence for a fourth-order iterative method based on quadrature formulae in Banach spaces by using majorizing sequences. Local convergence analysis is based on very general

ω

-continuity conditions on first order Fréchet derivative, thereby extending the applicability and usage of the method. Theoretical results are applied to some numerical examples to demonstrate the efficiency of our convergence analysis. It can be observed that our theoretical conclusions worked well in the situation where the earlier analysis based on Lipschitz condition cannot be used. Future work involves other methods and applications to integral equations and to the solution of PDE’s.

Author Contributions

Conceptualization, S.R., I.K.A., G.D. and L.R.; methodology, S.R., I.K.A., G.D. and L.R.; software, S.R., I.K.A., G.D. and L.R.; validation, S.R., I.K.A., G.D. and L.R.; formal analysis, S.R., I.K.A., G.D. and L.R.; investigation, S.R., I.K.A., G.D. and L.R.; resources, S.R., I.K.A., G.D. and L.R.; data curation, S.R., I.K.A., G.D. and L.R.; writing—original draft preparation, S.R., I.K.A., G.D. and L.R.; writing—review and editing, S.R., I.K.A., G.D. and L.R.; visualization, S.R., I.K.A., G.D. and L.R.; supervision, S.R., I.K.A., G.D. and L.R.; project administration, S.R., I.K.A., G.D. and L.R.; and funding acquisition, S.R., I.K.A., G.D. and L.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Argyros, I.; Magreñán, Á.A. Iterative Methods and Their Dynamics with Applications; CRC Press: New York, NY, USA, 2017. [Google Scholar]
Argyros, I.K. The Theory and Applications of Iteration Methods; Taylor and Francis: Abingdon, UK; CRC Press: New York, NY, USA, 2022. [Google Scholar]
Ortega, J.M.; Rheinboldt, W.C. Iterative Solution of Nonlinear Equations in Several Variables; Academic Press: New York, NY, USA, 1970. [Google Scholar]
Sharma, J.R.; Guha, R.K.; Sharma, R. An efficient fourth order weighted-Newton method for systems of nonlinear equations. Numer. Algo. 2013, 62, 307–323. [Google Scholar] [CrossRef]
Ezquerro, J.A.; Hernández, M.A. Newton’s Method: An Updated Approach of Kantorovich’s Theory; Springer: Cham, Switzerland, 2018. [Google Scholar]
Argyros, I.K.; Shakhno, S.; Regmi, S.; Yarmola, H. Newton-Type Methods for Solving Equations in Banach spaces: A Unified Approach. Symmetry 2023, 15, 15. [Google Scholar] [CrossRef]
Argyros, I.K.; Deep, G.; Regmi, S. Extended Newton-like Midpoint Method for Solving Equations in Banach Space. Foundations 2023, 3, 82–98. [Google Scholar] [CrossRef]
Darvishi, M.T.; Barati, A. A fourth-order method from quadrature formulae to solve systems of nonlinear equations. Appl. Math. Comput. 2007, 188, 257–261. [Google Scholar] [CrossRef]
Frontini, M.; Sormani, E. Third-order methods from quadrature formulae for solving systems of nonlinear equations. Appl. Math. Comput. 2004, 149, 771–782. [Google Scholar] [CrossRef]
Gutiérrez, J.M. A new semilocal convergence for Newton’s method. J. Comput. Appl. Math. 1997, 79, 131–145. [Google Scholar] [CrossRef] [Green Version]
Gutiérrez, J.M.; Hernández, M.A. Third-order iterative methods for operators with bounded second derivative. J. Comput. Appl. Math. 1997, 82, 171–183. [Google Scholar] [CrossRef] [Green Version]
Herceg, D.; Herceg, D.J. Means based modifications of Newton’s method for solving nonlinear equations. Appl. Math. Lett. 2013, 219, 6126–6133. [Google Scholar] [CrossRef]
Kou, J. A third-order modification of Newton method for systems of non-linear equations. Appl. Math. Comput. 2007, 191, 117–121. [Google Scholar] [CrossRef]
Singh, S.; Gupta, D.; Badoni, R.; Martínez, E.; Hueso, J.L. Local convergence of a parameter based iteration with Hölder continuous derivative in Banach spaces. Calcolo 2017, 54, 527–539. [Google Scholar] [CrossRef]
Singh, S.; Gupta, D.K.; Martínez, E.; Hueso, J.L. Semilocal and local convergence of a fifth order iteration with Fréchet derivative satisfying Hölder condition. Appl. Math. Comput. 2016, 276, 266–277. [Google Scholar]
Traub, J.F. Iterative Methods for the Solution of Equations; Chelsea Publishing Company: New York, NY, USA, 1982. [Google Scholar]
Wang, X.; Gu, C.; Kou, J. Semilocal convergence of a multipoint fourth-order super-Halley method in Banach spaces. Numer. Algo. 2011, 56, 497–516. [Google Scholar] [CrossRef]
Kamran, I.M.; Alotaibi, F.M.; Haque, S.; Mlaiki, N.; Shah, K. RBF-Based Local Meshless Method for Fractional Diffusion Equations. Fractal Fract. 2023, 7, 143. [Google Scholar] [CrossRef]
Khan, A.; Shah, K.; Abdeljawad, T.; Sher, M. On Fractional Order Sine-Gordon Equation Involving Nonsingular Derivative. Fractals 2022. [Google Scholar] [CrossRef]
Saifullah, S.; Ali, A.; Khan, A.; Shah, K.; Abdeljawad, T. A Novel Tempered Fractional Transform: Theory, Properties and Appli- cations to Differential Equations. Fractals 2022. [Google Scholar] [CrossRef]
Shah, K.; Sinan, M.; Abdeljawad, T.; El-Shorbagy, M.A.; Abdalla, B.; Abualrub, M.S. A Detailed Study of a Fractal-Fractional Transmission Dynamical Model of Viral Infectious Disease with Vaccination. Complexity 2022, 2022, 7236824. [Google Scholar] [CrossRef]

Table 1. Numerical outcomes for Example 5.

Cases	$x_{0}$	$\| F (x_{n}) \|$	$\| x_{n + 1} - x_{n} \|$	n	$μ$	CPU Timing
Method (3)	${(\frac{39}{100}, \frac{39}{100}, \frac{39}{100}, \frac{39}{100}, \frac{39}{100})}^{t r}$	$8.5 \times 10^{- 827}$	$1.2 \times 10^{- 826}$	3	4	$7.56632$

Table 2. Numerical outcomes for Example 6.

Cases	$x_{0}$	$\| F (x_{n}) \|$	$\| x_{n + 1} - x_{n} \|$	n	$μ$	CPU Timing
Method (3)	${(\frac{34}{100}, \frac{62}{100}, \frac{8}{10}, \frac{9}{10}, \frac{12}{10}, \frac{11}{10}, \frac{13}{10})}^{t r}$	$8.7 \times 10^{- 944}$	$5.7 \times 10^{- 944}$	3	4	$3.63682$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Regmi, S.; Argyros, I.K.; Deep, G.; Rathour, L. A Newton-like Midpoint Method for Solving Equations in Banach Space. Foundations 2023, 3, 154-166. https://doi.org/10.3390/foundations3020014

AMA Style

Regmi S, Argyros IK, Deep G, Rathour L. A Newton-like Midpoint Method for Solving Equations in Banach Space. Foundations. 2023; 3(2):154-166. https://doi.org/10.3390/foundations3020014

Chicago/Turabian Style

Regmi, Samundra, Ioannis K. Argyros, Gagan Deep, and Laxmi Rathour. 2023. "A Newton-like Midpoint Method for Solving Equations in Banach Space" Foundations 3, no. 2: 154-166. https://doi.org/10.3390/foundations3020014

APA Style

Regmi, S., Argyros, I. K., Deep, G., & Rathour, L. (2023). A Newton-like Midpoint Method for Solving Equations in Banach Space. Foundations, 3(2), 154-166. https://doi.org/10.3390/foundations3020014

Article Menu

A Newton-like Midpoint Method for Solving Equations in Banach Space

Abstract

1. Introduction

2. Convergence I: Local

3. Convergence II: Semilocal

4. Examples and Numerical Calculations

5. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI