On an Extension of a Spare Regularization Model

Moudafi, Abdellatif

doi:10.3390/math11204285

Open AccessFeature PaperArticle

On an Extension of a Spare Regularization Model

by

Abdellatif Moudafi

CNRS-L.I.S UMR 7296, Aix Marseille Université, Campus Universitaire de Saint-Jérôme, 13397 Marseille, France

Mathematics 2023, 11(20), 4285; https://doi.org/10.3390/math11204285

Submission received: 10 September 2023 / Revised: 9 October 2023 / Accepted: 12 October 2023 / Published: 14 October 2023

(This article belongs to the Special Issue New Trends in Nonlinear Analysis)

Download Versions Notes

Abstract

:

In this paper, we would first like to promote an interesting idea for identifying the local minimizer of a non-convex optimization problem with the global minimizer of a convex optimization one. Secondly, to give an extension of their sparse regularization model for inverting incomplete Fourier transforms introduced. Thirdly, following the same lines, to develop convergence guaranteed efficient iteration algorithm for solving the resulting nonsmooth and nonconvex optimization problem but here using applied nonlinear analysis tools. These both lead to a simplification of the proofs and to make a connection with classical works in this filed through a startling comment.

Keywords:

minimization; feasibility; l₀-norm; Moreau envelope; fixed-point algorithm; forward–backward iterations; proximity mapping

MSC:

49J53; 65K10; 49M37; 90C25; 49J53

1. Introduction

Compressed sensing (see, for example, [1,2,3,4,5,6,7]), was used to invert incomplete Fourier transforms in the context of sparse signal/image processing, and the

l_{1}

-norm was applied as a regularization for reconstructing an object from randomly selected incomplete frequency samples. Both the sparse regularization method and the compressed sensing method use the

l_{1}

-norm as a regularization to impose sparsity for the reconstructed signal under certain transforms. Because the models based on the

l_{1}

-norm are convex, they can be solved efficiently by available algorithms. Recently, the application of non-convex metrics as alternative approaches to

l_{1}

norm has been favored, see for example, [8,9,10,11]. The main goal of this paper is to suggest the employ of the Moreau envelope associated with the

l_{0}

-norm as a regularization. Note that the sparsity of a vector is originally measured by the

l_{0}

-norm of the vector, i.e., the number of its nonzero components. However, the

l_{0}

-norm is discontinuous at the origin, which is not appropriate from a computational point of view. The envelope of the

l_{0}

-norm is a Lipschitz surrogate of the

l_{0}

-norm, which is nonconvex. Through [7], a local minimizer of a function that is the sum of a convex function and the

l_{0}

-norm can be identified with a global minimizer of a convex function which permits algorithmic development of convex optimization problems. For inverting incomplete Fourier transforms, the use of the

l_{0}

-norm allows to formulate a sparsity regularization model that can reduce artifacts and outliers in the reconstructed signal. It also allow us to design an efficient algorithm for the resulting nonconvex and nonsmooth optimization problem by means of a fixed-point formulation. Moreover, the link of this minimization problem with the related convex minimization problem will permit to prove convergence of our proposed algorithm. Furthermore, a connection with proximal/projection gradient methods is also provided by appealing to two key formulas.

2. A Sparse Regularization Model

In order to obtain to the essential information to share, we took the same paper outline as in [6] and we assume the reader has some basic knowledge of monotone operator theory and convex analysis as can be found, for example, in [12,13,14,15].

In what follows, we propose an extension of a sparse regularization model based on the Moreau envelope of the

l_{0}

-norm for inverting incomplete Fourier transforms considered in [6]. Likewise, relying on properties of the Moreau envelope of the

l_{0}

-norm, we obtain an equivalent formulation favorable for algorithmic development.

Given two Euclidean spaces of dimensions N and d, a nonempty, closed and convex subset

Q \subset I R^{d}

and a matrix

K : I R^{N} \to I R^{d}

, we are interested in this work regarding the following problem:

Find y \in I R^{N} such that K y \in Q,

(1)

This formalism is also at the heart of the modeling of many inverse problems posed by phase recovery problems and other real-world problems, see [16] and references therein.

Our job is to describe the sparse regularization model for Equation (1) in order to obtain a sparse vector y. The

l_{0}

-norm, which counts the number of nonzero components of a vector

x \in I R^{N}

, is naturally used to measure its sparsity and is defined by

{∥ x ∥}_{0} = \sum_{i = 1}^{N} {| x_{i} |}_{0},

with

| x_{i} |_{0} = 1

if

x_{i} \neq 0

and

| x_{i} |_{0} = 0

if

x_{i} = 0

.

Now, let

P_{Q}

be the projection from

I R^{N}

onto the set Q. Since the constraint is equivalent to the fact that

K y - P_{Q} (K y) = 0

, we derive the following equivalent Lagrangian formulation

min_{y \in I R^{N}} \frac{1}{2} ∥ (I - P_{Q}) {K y ∥}^{2} + γ {∥ y ∥}_{0},

(2)

with

γ > 0

a Lagrangian multiplier.

Both non-convexity and discontinuity of the

l_{0}

-norm at the origin lead to computational difficulties. To overcome these problems, we use a Lipschitz regularization of the

l_{0}

-norm by its Moreau envelope. According to [14,17], for a positive number

λ

, the Moreau envelope of

{∥ \cdot ∥}_{0}

with index

λ

at

x \in I R^{N}

is defined by

e n v_{{λ ∥ \cdot ∥}_{0}} (x) = min_{z \in I R^{N}} ({∥ z ∥}_{0} + \frac{1}{2 λ} {∥ x - z ∥}^{2}) .

(3)

e n v_{{λ ∥ \cdot ∥}_{0}}

is continuous and locally convex near the origin. Moreover, as

{lim}_{λ \to 0} e n v_{{λ ∥ \cdot ∥}_{0}} = {∥ \cdot ∥}_{0}

,

e n v_{{λ ∥ \cdot ∥}_{0}}

is a good approximation of

{∥ \cdot ∥}_{0}

when

λ

is small enough. Therefore, with an appropriate choice of the parameter

λ

,

e n v_{{λ ∥ \cdot ∥}_{0}}

can be used as a measure of sparsity and allows to avoid drawbacks

{∥ \cdot ∥}_{0}

. For a fixed

Q \subset I R^{d}

and for

y \in I R^{N}

, we let

H (y) = \frac{1}{2} {∥ (I - P_{Q}) K y ∥}^{2} + γ e n v_{{λ ∥ \cdot ∥}_{0}} (y),

(4)

where

γ

is a positive parameter.

To recover a sparse vector y from (1), we now propose the sparse regularization model based on the Moreau envelope of the

l_{0}

-norm

\bar{y} = a r g m i n_{y \in I R^{N}} H (y) .

(5)

Since

e n v_{{λ ∥ \cdot ∥}_{0}}

is an approximation of

{∥ \cdot ∥}_{0}

, we expect that the proposed model enjoys nice properties and can be solved by efficient iteration algorithms.

As was pointed out earlier,

e n v_{{λ ∥ \cdot ∥}_{0}}

is an excellent sparsity promoting function. Therfore, we adopt

v = e n v_{{λ ∥ \cdot ∥}_{0}} (y)

in this paper.

We reformulate problem (5) to obtain a problem that is well suited and favorable for computation. Relying on definition (3) of

e n v_{{λ ∥ \cdot ∥}_{0}}

in problem (5) and r being fixed, we introduce the following function

F (x, y) = \frac{1}{2} ∥ (I - P_{Q}) {K y ∥}^{2} + \frac{γ}{2 λ} {∥ x - y ∥}^{2} + γ {∥ x ∥}_{0} .

(6)

The non-convex function

F (x, y)

is a special case of those considered in [7]. We then consider the problem

(\bar{x}, \bar{y}) = a r g m i n_{(x, y) \in I R^{N} \times I R^{N}} F (x, y) .

(7)

Next, we prove that problems (5) and (7) are essentially equivalent. A global minimizer of any of these problems will also be called a solution of the problem. We first present a relation between

H (y)

and

F (x, y)

. Remember that for

λ > 0

, the proximity operator of

{∥ \cdot ∥}_{0}

at

z \in I R^{N}

is defined by

p r o x_{{λ ∥ \cdot ∥}_{0}} (z) = a r g m i n_{x \in I R^{N}} {{∥ x ∥}_{0} + \frac{1}{2 λ} {∥ x - z ∥}^{2}} .

(8)

Clearly, if

x \in p r o x_{{λ ∥ \cdot ∥}_{0}} (z)

, then we have that

e n v_{{λ ∥ \cdot ∥}_{0}} (z) = {∥ x ∥}_{0} + \frac{1}{2 λ} {∥ x - z ∥}^{2} .

(9)

By relation (9), we obtain

H (y) = F (x, y), \forall x \in p r o x_{{λ ∥ \cdot ∥}_{0}} (y) and \forall y \in I R^{N} .

(10)

We now give a direct proof of [6], Proposition 1.

Proposition 1.

Let

λ > 0

and

γ > 0

. A pair

(\bar{x}, \bar{y})

solves problem (7) if, and only if,

\bar{y}

solves problem (5) with

\bar{x}

, verifying the following relation

\bar{x} \in p r o x_{{λ ∥ \cdot ∥}_{0}} (\bar{y}) .

Proof.

This follows directly from the following successive equalities.

\begin{matrix} inf_{(x, y) \in I R^{N} \times I R^{N}} F (x, y) & = & inf_{(x, y) \in I R^{N} \times I R^{N}} (\frac{1}{2} ∥ (I - P_{Q}) {K y ∥}^{2} + \frac{γ}{2 λ} {∥ x - y ∥}^{2} + γ {∥ x ∥}_{0}) \\ = & inf_{y \in I R^{N}} (\frac{1}{2} {∥ (I - P_{Q}) K y ∥}^{2} + γ inf_{x \in I R^{N}} (\frac{1}{2 λ} {∥ x - y ∥}^{2} + {∥ x ∥}_{0})) . \\ = & inf_{y \in I R^{N}} (\frac{1}{2} {∥ (I - P_{Q}) K y ∥}^{2} + γ e n v_{{λ ∥ \cdot ∥}_{0}} (y)) . \end{matrix}

□

Based on the fact that problems (5) and (7) are essentially equivalent, it suffices to establish that a local minimizer of the nonconvex problem (7) is a minimizer of a convex problem on a subdomain. To that end, we first present a convex optimization problem on a proper subdomain of

I R^{N} \times I R^{N}

related to problem (7) and recall the notion of the support of a vector

x \in I R^{N}

, denoted by

N (x)

, namely the index set on which the components of x is nonzero, that is

N (x) = {i : x_{i} \neq 0}

. Note that when the support of x in problem (7) is specified, the non-convex problem (7) reduces to a convex one. Based on this observation, we introduce a convex function by

G (x, y) = \frac{1}{2} ∥ (I - P_{Q}) {K y ∥}^{2} + \frac{γ}{2 λ} {∥ x - y ∥}^{2}, (x, y) \in I R^{N} \times I R^{N} .

(11)

Clearly,

F (x, y) = G (x, y) + {γ ∥ x ∥}_{0}

and

G (x, y)

is convex and differentiable on

I R^{N} \times I R^{N}

.

We define now, for a given index set

N

, a subspace of

I R^{N}

by setting

B_{N} = {x \in I R^{N}, N (x) \subset N} .

(12)

B_{N}

is convex and closed (see [6]), and we consider then the minimization problem on

B_{N} \times I R^{N}

defined by

a r g m i n {G (x, y), (x, y) \in (x, y) \in B_{N} \times I R^{N}} .

(13)

Problem (13) is convex, thanks to the convexity of both the function G and the set

B_{N} \times I R^{N}

. Next, we will show the equivalence between the non-convex problem (7) and the convex problem (13) with an appropriate choose of the index set

N

. To this end, we investigate properties of the support set of certain sequences in

I R^{N}

and for a given index set

N

, we define an operator

P_{B_{N}} : I R^{N} \to B_{N}

by

P_{B_{N}} (y) = y_{i} i f i \in N and 0 otherwise .

(14)

This operator is indeed the orthogonal projection from

I R^{N}

onto

N

, (see [6] Lemma 3). A convenient identification of the proximity operator of the

l_{0}

-norm with the projection

P_{B_{N}}

and some properties of the sequence generated by

p r o x_{{λ ∥ \cdot ∥}_{0}}

, with respect to the existence of an integer which will denote

\bar{κ}

, were developed in (Lemmas 4–7 together with Proposition 2 [6]), and which are still valid in our context.

Recall also the closed form formula of the proximity of

l_{0}

. For all

z \in I R^{N}

,

p r o x_{{λ | \cdot |}_{0}} (z) = \{\begin{matrix} {z_{i}} & if | z_{i} | > \sqrt{2 λ}; \\ {z_{i}, 0} & if | z_{i} | = \sqrt{2 λ}; \\ {0} & otherwise \end{matrix}

(15)

A connection between problems (7) and (13) is given by the following result.

Theorem 1.

λ, γ > 0

, and

(\bar{x}, \bar{y}) \in I R^{N} \times I R^{N}

be given. The pair

(\bar{x}, \bar{y})

is a local minimizer of the non-convex problem (7) if, and only if,

(\bar{x}, \bar{y})

is a minimizer of the convex problem (13) with

N : = N (\bar{x})

.

Proof.

Follows directly by using ([6] Corollary 4.9), with

ϕ (y) : = \frac{1}{2} {∥ (I - P_{Q}) K y ∥}^{2}, μ : = \frac{γ}{2 λ}

and

D : = I

. □

Following the same lines of ([6] Propositions 1 and 3), we can identify and connect global and local minimizers of (7) with those of (5).

3. A Fixed Point Approach

We will propose an iterative method for finding a local minimizer of (7) relying on a fixed-point formulation. For all the facts we will use, we refer to [14].

Let us begin with a characterization of the convex problem (13).

Proposition 2.

Suppose

λ, γ > 0

. If

C \subset {1, 2, \cdot \cdot \cdot, N}

, then the problem (13) with

N : = C

has a solution and a pair

(\bar{x}, \bar{y}) \in I R^{N} \times I R^{N}

solves (13) with

N : = C

if, and only if,

\bar{x} = P_{N (\bar{x})} (\bar{y}) and \bar{y} = \bar{x} - \frac{λ}{γ} K^{*} (I - P_{Q}) K \bar{y} .

(16)

Proof.

The existence of solutions follows by the fact that

B_{N}

is compact together with the coercivity of G with respect to the second variable. On the other hand, the optimality condition of the minimization problem

min_{(x, y) \in B_{N} \times I R^{N}} (\frac{1}{2} ∥ (I - P_{Q}) {K y ∥}^{2} + \frac{γ}{2 λ} {∥ x - y ∥}^{2}),

reads as

(0, 0) \in (K^{*} (I - P_{Q}) K \bar{y} - \frac{γ}{λ} (\bar{x} - \bar{y}), \frac{γ}{λ} (\bar{x} - \bar{y}) + N_{B_{N}} (\bar{x})),

or, equivalently,

\bar{y} = \bar{x} - \frac{λ}{γ} K^{*} (I - P_{Q}) K \bar{y} and \bar{y} \in \bar{x} + \frac{λ}{γ} N_{B_{N}} (\bar{x}) \Leftrightarrow \bar{x} = {(I + \frac{λ}{γ} N_{B_{N}})}^{- 1} (\bar{y}) = P_{B_{N}} (\bar{y}) .

□

Application of both Theorem 1 and Proposition 2 leads to the following characterization of a local minimizer of the problem (7).

Theorem 2.

Let

λ, γ > 0

be fixed. A pair

(\bar{x}, \bar{y}) \in I R^{N} \times I R^{N}

is a local minimizer of (7) if, and only if,

(\bar{x}, \bar{y})

verifies (16).

Let us now give a characterization of a global minimizer of (16).

Theorem 3.

Let

λ, γ > 0

be fixed. If a pair

(\bar{x}, \bar{y}) \in I R^{N} \times I R^{N}

is a local minimizer of (7), then

(\bar{x}, \bar{y})

satisfies the relations

\bar{x} = p r o x_{{λ ∥ \cdot ∥}_{0}} (\bar{y}) and \bar{y} = \bar{x} - \frac{λ}{γ} K^{*} (I - P_{Q}) K \bar{y} .

(17)

Conversely, if a pair

(\bar{x}, \bar{y})

verifies (17), then If a pair

(\bar{x}, \bar{y})

is a local minimizer of (7).

Proof.

The optimality condition of the minimization problem

min_{(x, y) \in I R^{N} \times I R^{N}} (\frac{1}{2} ∥ (I - P_{Q}) {K y ∥}^{2} + \frac{γ}{2 λ} {∥ x - y ∥}^{2} + γ {∥ x ∥}_{0}),

reads as

(0, 0) \in (K^{*} (I - P_{Q}) K \bar{y} - \frac{γ}{λ} (\bar{x} - \bar{y}), \frac{γ}{λ} (\bar{x} - \bar{y}) + γ \partial {∥ \cdot ∥}_{0} (\bar{x})),

or, equivalently,

\bar{y} = \bar{x} - \frac{λ}{γ} K^{*} (I - P_{Q}) K \bar{y} and \bar{y} \in \bar{x} {+ λ \partial ∥ \cdot ∥}_{0} (\bar{x}) \Leftrightarrow \bar{x} {\in (I + λ \partial ∥ \cdot ∥}_{0})^{- 1} (\bar{y}) = p r o x_{{λ ∥ \cdot ∥}_{0}} (\bar{y}) .

□

In view of Theorem 3, we propose the following explicit–implicit Algorithm for solving problem (7)

x_{k + 1} \in p r o x_{{λ ∥ \cdot ∥}_{0}} (y_{k}) and y_{k + 1} = x_{k + 1} - \frac{λ}{γ} K^{*} (I - P_{Q}) K y_{k + 1}

(18)

When the projection can be computed efficiently, updates of both variables x and y in Algorithm (18) at each iteration can be efficiently implemented.

Proposition 3.

If

λ, γ > 0

, then the operator

I + \frac{λ}{γ} K^{*} (I - P_{Q}) K

is invertible. Hence, the second part of (18) reads as

y_{k + 1} : = J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (x_{k + 1}) = {(I + \frac{λ}{γ} K^{*} (I - P_{Q}) K)}^{- 1} (x_{k + 1}) .

(19)

Proof.

It is well known that

K^{*} (I - P_{Q}) K

is a maximal monotone operator (more precisely, it is an inverse strongly monotone operator, see the beginning of the proof of Theorem 4), hence

I + \frac{λ}{γ} K^{*} (I - P_{Q}) K

is invertible by Minty Theorem and its inverse, which is the so-called resolvent operator, is single valued and firmly nonexpansive. □

4. Convergence Analysis

In this section, we investigate the convergence behavior of Algorithm (18). As in [6], after a finite number of iterations, the support of the sparse variable

x_{k}

defined by Algorithm (18) will remain unchanged, and hence solving the non-convex optimization problem (7) by algorithm (18) reduces to solving a convex optimization problem on the support.

First, we consider a function E, which is closely related to both functions F and G, and we define

E : I R^{N} \to I R at y \in I R^{N} by E (y) : = \frac{L}{2} {∥ (I - P_{Q}) K y ∥}^{2},

(20)

where

L : = 1 + \frac{λ}{γ}

;

E (y)

will be denoted for short by

E (y)

.

Now, we prove a convergence result of Algorithm (18).

Theorem 4.

Let

{(x_{k}, y_{k})}_{k \in I N}

be a sequence generated by Algorithm (18) with an initial

(x_{0}, y_{0}) \in I R^{N} \times I R^{N}

for problem (7). If

λ, γ > 0

are positive numbers, then we have the following properties

1.: $F (x_{k + 1}, y_{k + 1}) \leq F (x_{k}, y_{k})$ for all $k \geq 0$ and the sequence ${(F (x_{k}, y_{k}))}_{k \in I N}$ is convergent;
2.: The sequence ${(x_{k}, y_{k})}_{k \in I N}$ has a finite length, namely

$\sum_{k = 0}^{+ \infty} ∥ x_{k + 1} - x_{k} ∥^{2} < + \infty, a n d \sum_{k = 0}^{+ \infty} {∥ y_{k + 1} - y_{k} ∥}^{2} < + \infty,$

(21)
3.: The sequence ${(x_{k}, y_{k})}_{k \in I N}$ is asymptotically regular, that is

$lim_{k \to + \infty} ∥ x_{k + 1} - x_{k} ∥ = 0 a n d lim_{k \to + \infty} {∥ y_{k + 1} - y_{k} ∥}^{2} = 0 .$

Proof.

The function E is differentiable with a 1-Lipschitz continuous gradient. Indeed,

\begin{matrix} 〈 K^{*} (I - P_{Q}) K (x) - K^{*} (I - P_{Q}) K (y), x - y 〉 & = & 〈 (I - P_{Q}) K (x) - (I - P_{Q}) K (y), K x - K y 〉 \\ \geq & ∥ (I - P_{Q}) K (x) - (I - P_{Q}) K (y) ∥^{2} \\ \geq & ∥ K^{*} (I - P_{Q}) K (x) - K^{*} (I - P_{Q}) K (y) ∥^{2} . \end{matrix}

This ensures that

\nabla E

is 1-Lipschitz continuous.

On the other hand, since

y_{k + 1} = x_{k + 1} - \frac{λ}{γ} K^{*} (I - P_{Q}) K y_{k + 1}

, we can write

\frac{γ}{2 λ} ∥ x_{k + 1} - y_{k + 1} ∥^{2} = \frac{γ}{2 λ} {∥ \frac{λ}{γ} K^{*} (I - P_{Q}) K y_{k + 1} ∥}^{2} .

Taking into account definition of the

∥ \cdot ∥

and the fact that

K K^{*} = I

, we obtain that

\frac{γ}{2 λ} ∥ x_{k + 1} - y_{k + 1} ∥^{2} = \frac{λ}{2 γ} {∥ (I - P_{Q}) K y_{k + 1} ∥}^{2} .

Hence,

F (x_{k + 1}, y_{k + 1}) = \frac{1}{2} (1 + \frac{λ}{γ}) ∥ (I - P_{Q}) K y_{k + 1} ∥^{2} + γ ∥ x_{k + 1} ∥_{0} = E (y_{k + 1}) + γ {∥ x_{k + 1} ∥}_{0} .

Using the celebrate descent Lemma, see for example [12], we can write

F (x_{k + 1}, y_{k + 1}) \leq E (y_{k}) + 〈 \nabla E (y_{k}), y_{k + 1} - y_{k} 〉 + \frac{L}{2} ∥ y_{k + 1} - y_{k} ∥^{2} + γ {∥ x_{k + 1} ∥}_{0} .

(22)

Now, we have

\begin{matrix} \frac{γ}{λ + γ} 〈 \nabla E (y_{k}), x_{k + 1} - x_{k} 〉 & = & 〈 K^{*} (I - P_{Q}) K y_{k}, y_{k + 1} - y_{k} + \frac{λ}{γ} (K^{*} (I - P_{Q}) K y_{k + 1} - K^{*} (I - P_{Q}) K y_{k}) 〉 \\ = & 〈 K^{*} (I - P_{Q}) K y_{k}, y_{k + 1} - y_{k} 〉 + \frac{λ}{γ} 〈 K^{*} (I - P_{Q}) K y_{k}, y_{k + 1} - y_{k} 〉 \\ - & \frac{λ}{γ} 〈 (I - P_{Q}) K y_{k}, P_{Q} K y_{k + 1} - P_{Q} K y_{k} 〉 \\ \geq & (1 + \frac{λ}{γ}) 〈 K^{*} (I - P_{Q}) K y_{k}, y_{k + 1} - y_{k} 〉 = 〈 \nabla E (y_{k}), y_{k + 1} - y_{k} 〉 . \end{matrix}

The Characterization of the orthogonal projection, namely

〈 (y - P_{Q} y, z - P_{Q} y 〉 \leq 0 \forall z \in Q

assures that

〈 (I - P_{Q}) K y_{k}, P_{Q} K y_{k + 1} - P_{Q} K y_{k} 〉 \leq 0,

and thus

〈 \nabla E (y_{k}), y_{k + 1} - y_{k} 〉 \leq \frac{γ}{λ + γ} 〈 \nabla E (y_{k}), x_{k + 1} - x_{k} 〉 .

(23)

Now, by using the second equation of (18) and by taking into account the fact that

I - P_{Q}

is firmly nonexpansive and that

K K^{*} = I

, we can write

\begin{matrix} ∥ x_{k + 1} - x_{k} ∥^{2} & = & ∥ y_{k + 1} - y_{k} + \frac{λ}{γ} (K^{*} (I - P_{Q}) K y_{k + 1} - K^{*} (I - P_{Q}) K y_{k}) ∥^{2} \\ = & ∥ y_{k + 1} - y_{k} ∥^{2} + \frac{2 λ}{γ} 〈 K^{*} (I - P_{Q}) K y_{k + 1} - K^{*} (I - P_{Q}) K y_{k}, y_{k + 1} - y_{k} 〉 \\ + & \frac{λ^{2}}{γ^{2}} {∥ K^{*} (I - P_{Q}) K y_{k + 1} - K^{*} (I - P_{Q}) K y_{k} ∥}^{2} \\ \geq & ∥ y_{k + 1} - y_{k} ∥^{2} + \frac{2 λ}{γ} {∥ (I - P_{Q}) K y_{k + 1} - (I - P_{Q}) K y_{k} ∥}^{2} \\ + & \frac{λ^{2}}{γ^{2}} {∥ (I - P_{Q}) K y_{k + 1} - (I - P_{Q}) K y_{k} ∥}^{2} . \end{matrix}

This yields

∥ x_{k + 1} - x_{k} ∥^{2} \leq ∥ y_{k + 1} - y_{k} ∥^{2} + \frac{λ}{γ} (2 + \frac{λ}{γ}) {∥ (I - P_{Q}) K y_{k + 1} - (I - P_{Q}) K y_{k} ∥}^{2},

(24)

hence

∥ y_{k + 1} - y_{k} ∥^{2} \leq {∥ x_{k + 1} - x_{k} ∥}^{2} .

Taking into account the fact that

0 < \frac{λ}{γ} < \frac{- 1 + \sqrt{5}}{2}

, we have

L = 1 + \frac{λ}{γ} < \frac{γ}{λ}

which, combined with the last inequality, ensures that

L ∥ y_{k + 1} - y_{k} ∥^{2} \leq \frac{γ}{λ} {∥ x_{k + 1} - x_{k} ∥}^{2} .

(25)

Combining (22), (23) and (25) yields

F (x_{k + 1}, y_{k + 1}) \leq E (y_{k}) + \frac{γ}{γ + λ} 〈 \nabla E (y_{k}), x_{k + 1} - x_{k} 〉 + \frac{γ}{2 λ} ∥ x_{k + 1} - x_{k} ∥^{2} + γ {∥ x_{k + 1} ∥}_{0} .

(26)

To prove the no-increasing of the sequence

{(F (x_{k}, y_{k}))}_{k \in I N}

, we first notice that the second part of (18) can be reads as

y_{k} = x_{k} - \frac{λ}{λ + γ} \nabla E (y_{k})

. Now, by applying definition of the proximal operator of

{λ ∥ \cdot ∥}_{0}

at

y_{k}

, we have

x_{k + 1} \in a r g min_{x \in I R^{N}} ({∥ x ∥}_{0} + \frac{1}{2 λ} {∥ x - x_{k} + \frac{λ}{λ + γ} \nabla E (y_{k}) ∥}^{2})

or, equivalently,

x_{k + 1} \in a r g min_{x \in I R^{N}} ({∥ x ∥}_{0} + \frac{1}{2 λ} {∥ x - x_{k} ∥}^{2} + \frac{λ}{λ + γ} 〈 \nabla E (y_{k}), x - x_{k} 〉),

which ensures that

∥ x_{k + 1} ∥_{0} + \frac{1}{2 λ} ∥ x_{k + 1} - x_{k} ∥^{2} + \frac{1}{λ + γ} 〈 \nabla E (y_{k}), x_{k + 1} - x_{k} 〉 \leq {∥ x_{k} ∥}_{0} .

(27)

Finally, from (26) and (28), we deduce that

F (x_{k + 1}, y_{k + 1}) \leq E (y_{k}) + γ {∥ x_{k} ∥}_{0} = F (x_{k}, y_{k}) .

It follows from (27) that

γ ∥ x_{k + 1} ∥_{0} + \frac{γ}{2 λ} {∥ x_{k + 1} - x_{k} ∥}^{2} + \frac{γ}{λ + γ} 〈 \nabla E (y_{k}), x_{k + 1} - x_{k} 〉 \leq F (x_{k}, y_{k}) .

In addition form (22), we obtain that

- E (y_{k}) - 〈 \nabla E (y_{k}), y_{k + 1} - y_{k} 〉 + \frac{L}{2} ∥ y_{k + 1} - y_{k} ∥^{2} - γ {∥ x_{k + 1} ∥}_{0} \leq - F (x_{k + 1}, y_{k + 1}) .

Summing the above two inequalities and using (23), we obtain

\frac{γ}{2 λ} ∥ x_{k + 1} - x_{k} ∥^{2} - \frac{L}{2} {∥ y_{k + 1} - y_{k} ∥}^{2} \leq F (x_{k}, y_{k}) - F (x_{k + 1}, y_{k + 1}) .

This, combined with (24), yields

\frac{1}{2} (\frac{γ}{2 λ} - L) ∥ y_{k + 1} - y_{k} ∥^{2} + \frac{λ}{γ} (2 + \frac{λ}{γ}) {∥ (I - P_{Q}) y_{k + 1} - (I - P_{Q}) y_{k} ∥}^{2} \leq F (x_{k}, y_{k}) - F (x_{k + 1}, y_{k + 1}) .

(28)

By summing the last inequality and by taking into account the fact the convergence of the sequence

{(F (x_{k}, x_{y}))}_{k \in I N}

together with the fact that

\frac{γ}{2 λ} - L > 0

, we deduce first that

\sum_{k = 0}^{\infty} ∥ y_{k + 1} - y_{k} ∥^{2} < + \infty and \sum_{k = 0}^{\infty} {∥ (I - P_{Q}) y_{k + 1} - (I - P_{Q}) y_{k} ∥}^{2} < + \infty .

The property

\sum_{k = 0}^{\infty} {∥ x_{k + 1} - x_{k} ∥}^{2} < + \infty

follows then from relation (24). The latter properties ensures clearly the asymptotic regularity of the sequence

{(x_{k}, y_{k})}_{k \in I N}

. □

As in ([6] Lemma 12), in our setting, we also have that the invariant support set of the sequence defined by Algorithm (18) exists for the nonconvex problem (7). Now, let us prove, more directly than in [6], the convergence of the sequence

{(x_{k}, y_{k})}_{k \in I N}

generated by (18) relying on averaged operators and Krasnoselskii–Mann Theorem. Averaged mappings are convenient in studying the convergence sequences generated by iterative algorithms for fixed-point problems thanks to the following celebrate theorem, see for example [12,13].

Theorem 5

(Krasnoselskii–Mann Theorem). Let

M : I R^{N} \to I R^{N}

be averaged and assume

F i x M \neq \emptyset

. Then, for any starting point

x_{0}

, the sequence

{M^{k} x_{0}}

converges weakly to a fixed-point of M.

Recall also the definitions of nonexpansive and averaged operators, which appear naturally when using iterative algorithms for solving fixed-point problems and which are commonly encountered in the literature; see, for instance, [13]. A mapping

T : I R^{N} \to I R^{N}

is said to be nonexpansive if, for all

x, y \in I R^{N}, ∥ T x - T y ∥ \leq ∥ x - y ∥

, firmly nonexpansive if

2 T - I

is nonexpansive, or equivalently

〈 T x - T y, x - y 〉 \geq {∥ T x - T y ∥}^{2}

, for all

x, y \in I R^{N}

. It is well known that T is firmly nonexpansive if and only if T can be written as

T = \frac{1}{2} (I + S)

, where

S : I R^{N} \to I R^{N}

is nonexpansive. Recall also that mapping

T : I R^{N} \to I R^{N}

is said to be averaged if it can be expressed as

T = (1 - α) I + α S

with

S : I R^{N} \to I R^{N}

a nonexpansive mapping and

α \in

[0, 1]. Thus, firmly nonexpansive mappings (e.g., projections on convex convex and nonempty subsets and resolvent of maximal monotone operators) are averaged mappings.

Mimicking the analysis in [6],

(\bar{x}, \bar{y})

is a solution of (13) if, and if

(\bar{x}, \bar{y})

satisfies (16) and thus

\bar{x}

is verified as

\bar{x} = P_{B_{N}} \circ J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (\bar{x}) .

(29)

Similarly, using the same arguments, we drive that

(x_{k}, y_{k})

generated by (18) leads to

x_{k + 1} = P_{B_{N}} (y_{k}) \forall k \geq \bar{κ}

and thus for all

k \geq \bar{κ}

, it satisfies

x_{k + 1} = P_{B_{N}} (y_{k}) and y_{k + 1} = x_{k + 1} - \frac{λ}{γ} K^{*} (I - P_{Q}) K y_{k + 1} .

(30)

Consequently,

\bar{x} = P_{B_{N}} \circ J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (\bar{x}),

(31)

and

x_{k + 1} = P_{B_{N}} \circ J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (x_{k}) .

(32)

It is well-known that firmly nonexpansive mappings (including orthogonal projections on closed convex nonempty subsets and resolvent mappings of maximal monotone operators) are averaged operators. In view of the fact that the composite of finitely many averaged mappings is averaged, see for instance [12], and by applying Krasnoselskii–Mann Theorem, we deduce the convergence of the subsequence

{(x_{k})}_{k \geq \bar{κ}}

to a solution

\bar{x}

of (29).

\bar{x} \in B_{N}

, since

{(x_{k})}_{k \geq \bar{κ}} \subset B_{N}

, which is closed, and we also have

N = N (x_{\bar{κ}}) = N (\bar{x})

in view of ([6] Lemma 7). In addition, since the resolvent of a maximal monotone operator is nonexpansive, for all

m > n > \bar{κ}

, we can write that

∥ y_{m} - y_{n} ∥ = ∥ J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (x_{m}) - J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (x_{n}) ∥ \leq ∥ x_{m} - x_{n} ∥ .

{(x_{k})}_{k \geq \bar{κ}}

being convergent, it is a Cauchy sequence and thus it is also the case of the subsequence

{(y_{k})}_{k \geq \bar{κ}}

which, in turn, converges to some limit

\bar{y}

. Now, by passing to the limit in

y_{k + 1} = J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (x_{k})

and taking into account the continuity of the resolvent, we obtain

\bar{y} = J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (\bar{x})

, where

\bar{x}

is a solution of (16) which ensures that

(\bar{x}, \bar{y})

is a solution of (13) with

N = N (\bar{x})

.

Based on the above, these lead to the following Theorem.

Theorem 6.

Let

(x_{k}, y_{k}) \in I R^{N} \times I R^{N}

a sequence generated by (18) form an initial point

(x_{0}, y_{0})

. If

λ, γ > 0

are chosen such that

0 < \frac{λ}{γ} < \frac{1 + \sqrt{5}}{2}

, then

(x_{k}, y_{k})

converges to a local minimizer

(\bar{x}, \bar{y})

of (7). Moreover,

{(F (x_{k}, y_{k}))}_{k \in I N}

is a convergent sequence and if, in addition

| {\bar{y}}_{j} | \neq \sqrt{2 β}

for all

j \in N (\bar{y})

, then

\bar{y}

is a local minimizer of (5).

Finally, let us point out that, since

K K^{*} = I

, the fixed point iteration in (18) turns into

y_{k + 1} = x_{k + 1} - \frac{λ}{2 γ} K^{*} (I - P_{Q}) (K x_{k + 1}) .

(33)

In particular, when

Q = {r}

, this reduces to

y_{k + 1} = x_{k + 1} - \frac{λ}{2 γ} K^{*} (K x_{k + 1} - r) .

Indeed, as

K K^{*} = I

,

\begin{matrix} J_{\frac{λ}{γ}}^{K^{*} (I - P_{Q}) K} (x_{k + 1}) & = & x_{k + 1} - \frac{λ}{γ} K^{*} {(I - P_{Q})}_{1} (K x_{k + 1}) = x_{k + 1} - \frac{λ}{γ} K^{*} {({(\partial i_{Q})}_{1})}_{1} (K x_{k + 1}) \\ = & x_{k + 1} - \frac{λ}{γ} K^{*} {(\partial i_{Q})}_{2} (K x_{k + 1}) = x_{k + 1} - \frac{λ}{2 γ} K^{*} (I - P_{Q}) (K x_{k + 1}) . \end{matrix}

We used both the fact that for any maximal monotone operator A and

ν > 0

, we have

J_{ν}^{K^{*} A K} (x) = x - ν K^{*} A_{1} (K x),

A_{1} = I - J_{1}^{A}

being the Yosida operator of A with parameter 1, and that for all

ν, μ > 0

, we have

{(A_{ν})}_{μ} = A_{ν + μ} .

We used also the fact that the Yosida operator of the subdifferential of the indicator function of Q with parameter 2 (the latter is nothing but the the normal cone to Q) is exactly

\frac{I - P_{Q}}{2}

.

Therefore, the proposed algorithms are nothing else than a Proximal Gradient and a Projection Gradient Algorithms. Nevertheless, the crucial idea (namely, the support of the sparse main variable generated by the Algorithm remains unchanged after a finite number of iterations) that permits to locate a local minimizer of the nonconvex optimization problem with a global minimizer of a convex optimization one deserves a great interest.

Clearly, the analysis developed here can be extended to split feasibility problems, namely

Find y \in C such that K y \in Q,

(34)

with

C \subset I R^{N}

,

Q \subset I R^{d}

being two closed, convex subsets and

K : I R^{N} \to I R^{d}

a given matrix. Since the sum of a maximal monotone operator (the normal cone to C) and the monotone Lipschitz one (the operator

\frac{λ}{2 γ} K^{*} (I - P_{Q}) K

) is still maximal monotone [14], this can be naturally extended to the following general minimization problem by means of its regularized version, i.e.,

min_{y \in I R^{N}} (f (y) + g (K y) + {γ ∥ y ∥}_{0}) t h r o u g h min_{y \in I R^{N}} (f (y) + g_{ν} (K y) + {γ (∥ y ∥}_{0})_{λ}),

with

f, g

being two proper, convex, lower semicontinuous functions defined on

I R^{N}, I R^{d}

, respectively, and

ν, λ > 0

. The proximity map of g and the subdifferential of f will act as the projection on the set Q and the normal cone to C, respectively, since they share both the same properties.

5. Conclusions

Based on an interesting idea developed in [6], which leads to identifying a local minimizer of a nonconvex minimization problem with a global optimizer of a convex optimization one, we provide an extension of the sparse regularization model for inverting incomplete Fourier transforms. Next, we propose an efficient convergence guaranteed iteration algorithm for solving the resulting non-convex and non-smooth optimization problem. The fixed-point approach is preferred, as it enables us to develop efficient algorithms with guaranteed convergence. Combined with applied nonlinear analysis tools, this leads both to a simplification of the proofs and to make a connection with classical works as split convex feasibility problems. With this generalization, the proposed approach may be applicable to other real world applications such as inverse problem of intensity-modulated radiation therapy (IMRT) treatment planning [18]. It can be applied equally to cooperative wireless sensor network positioning [19] or adaptive image denoising [20]. Thus, the proposed method is expected to work efficiently for problems that can be reformulated as sparse optimization and convex feasibility problems. We will consider this as a future project for numerical applications as well as other potential extensions, for example, in a non-convex framework. These in turn will pave the way for other applications in the real world, which is the case, for example, in [21].

Funding

This research received no external funding.

Data Availability Statement

Data sharing not applicable. No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

We would like to thank the anonymous reviewers for their meticulous reading of the paper and for their pertinent comments.

Conflicts of Interest

The author declares no conflict of interest.

References

Byrne, C. A unified treatment of some iterative algorithms in signal processing and image reconstruction. Inverse Probl. 2004, 20, 103–120. [Google Scholar] [CrossRef]
Candes, E.J.; Romberg, J.; Tao, T. Stable signal recovery from incomplete and inaccurate measurements. Commun. Pure Appl. Math. 2006, 59, 1207–1223. [Google Scholar] [CrossRef]
Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Shen, F.C.L.; Xu, Y.; Zeng, X. The Moreau envelope approach for the L1/TV image denoising model. Inverse Probl. Imaging 2014, 8, 53–77. [Google Scholar]
Wu, T.; Shen, L.; Xu, Y. Fixed-point proximity algorithms solving an incomplete Fourier transform model for seismic wavefield modeling. J. Comput. Appl. Math. 2021, 385, 113208. [Google Scholar] [CrossRef]
Wu, T.; Xu, Y. Inverting Incomplete Fourier Transforms by a Sparse Regularization Model and Applications in Seismic Wavefileld Modeling. J. Sci. Comput. 2022, 92, 48. [Google Scholar] [CrossRef]
Xu, Y. Sparse regularization with the l₀-norm. Anal. Appl. 2023, 21, 901–929. [Google Scholar] [CrossRef]
Cao, W.; Xu, H.K. l₁ − l_p DC regularization method for compressed sensing. J. Nonlinear Convex Anal. 2020, 9, 1889–1901. [Google Scholar]
Lou, Y.; Yan, M. Fast l₁ − l₂ Minimization via a proximal operator. J. Sci. Comput. 2017, 74, 767–785. [Google Scholar] [CrossRef]
Moudafi, A.; Gibali, A. l₁ − l₂ regularization of split feasibility problems. Numer. Algorithms 2018, 78, 739–757. [Google Scholar] [CrossRef]
Moudafi, A.; Xu, H.-K. A DC Regularization of Split Minimization Problems. Appl. Anal. Optim. 2018, 2, 285–297. [Google Scholar]
Bauschke, H.H.; Combettes, P.L. Convex Analysis and Monotone Operator Theory in Hilbert Spaces, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
Cegielski, A. Iterative Methods for Fixed Point Problems in Hilbert Spaces. In Lecture Notes in Mathematics; LNM; Springer: Berlin/Heidelberg, Germany, 2013; Volume 2057. [Google Scholar]
Rockafellar, R.T.; Wets, R.J.-B. Variational Analysis; Springer: Berlin/Heidelberg, Germany, 1998. [Google Scholar]
Yamada, I.; Yukawa, M.; Yamagishi, M. Minimizing the Moreau Envelope of Nonsmooth Convex Functions over the Fixed Point Set of Certain Quasi-Nonexpansive Mappings. In Fixed-Point Algorithms for Inverse Problems in Science and Engineering; Springer: New York, NY, USA, 2011; pp. 345–390. [Google Scholar]
Censor, Y.; Bortfeld, T.; Martin, B.; Trofimov, A. A unified approach for inversion problems in intensity-modulated radiation therapy. Phys. Med. Biol. 2006, 51, 2353–2365. [Google Scholar] [CrossRef] [PubMed]
Moreau, J.-J. Fonctions convexes duales et points proximaux dans un espace hilbertien. Comptes Rendus Acad. Sci. Paris Ser. A Math. 1962, 255, 1897–2899. [Google Scholar]
Censor, Y.; Elfving, T.; Kopf, N.; Bortfeld, T. The multiple-sets split feasibility problem and its applications for inverse problems. Inverse Probl. 2005, 21, 2071–2084. [Google Scholar] [CrossRef]
Gholami, M.R.; Tetruashvili, L.; Strom, E.G.; Censor, Y. Cooperative Wireless Sensor Network Positioning via Implicit Convex Feasibility. IEEE Trans. Signal Process. 2013, 61, 5830–5840. [Google Scholar] [CrossRef]
Censor, Y.; Gibali, A.; Lenzen, F.; Schnorr, C. The Implicit Convex Feasibility Problem and Its Application to Adaptive Image Denoising. J. Comp. Math. 2016, 34, 610–625. [Google Scholar] [CrossRef]
Brooke, M.; Censor, Y.; Gibali, A. Dynamic string-averaging CQ-methods for the split feasibility problem with percentage violation constraints arising in radiation therapy treatment planning. Int. Trans. Oper. Res. 2020, 30, 181–205. [Google Scholar] [CrossRef] [PubMed]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Moudafi, A. On an Extension of a Spare Regularization Model. Mathematics 2023, 11, 4285. https://doi.org/10.3390/math11204285

AMA Style

Moudafi A. On an Extension of a Spare Regularization Model. Mathematics. 2023; 11(20):4285. https://doi.org/10.3390/math11204285

Chicago/Turabian Style

Moudafi, Abdellatif. 2023. "On an Extension of a Spare Regularization Model" Mathematics 11, no. 20: 4285. https://doi.org/10.3390/math11204285

APA Style

Moudafi, A. (2023). On an Extension of a Spare Regularization Model. Mathematics, 11(20), 4285. https://doi.org/10.3390/math11204285

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On an Extension of a Spare Regularization Model

Abstract

1. Introduction

2. A Sparse Regularization Model

3. A Fixed Point Approach

4. Convergence Analysis

5. Conclusions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI