Next Article in Journal
Efficient BEM-Based Algorithm for Pricing Floating Strike Asian Barrier Options (with MATLAB® Code)
Previous Article in Journal
Some Summation Theorems for Generalized Hypergeometric Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

What Do You Mean by “Nonlinear Eigenvalue Problems”?

by
Raffaele Chiappinelli
Dipartimento di Ingegneria dell’Informazione e Scienze Matematiche, Università di Siena, I-53100 Siena, Italy
Axioms 2018, 7(2), 39; https://doi.org/10.3390/axioms7020039
Submission received: 30 March 2018 / Revised: 5 June 2018 / Accepted: 6 June 2018 / Published: 9 June 2018

Abstract

:
A nonlinear eigenvalue problem is generally described by an equation of the form F ( λ , x ) = 0 , where F ( λ , 0 ) = 0 for all λ , and contains by definition two unknowns: the eigenvalue parameter λ and the “nontrivial” vector(s) x corresponding to it. The nonlinear dependence of F can be in either of them (and of course in both), and also the research in this area seems to follow two quite different directions. In this review paper, we try to collect some points of possible common interest for both fields.
MSC:
Primary 47J10; Secondary 47A56

1. Introduction

Nonlinear eigenvalue problems are generally described by equations of the form
F ( λ , x ) = 0 ( λ K , x E )
where K ( = R or C ) is the field of real or complex numbers, and E is a real or complex Banach space that can in particular be the n-space R n or C n . In Equation (1), F is a continuous map of K × E into E, and it is assumed that F ( λ , 0 ) = 0 for all scalars λ . That is to say, x = 0 solves trivially Equation (1) for all λ ; and one looks therefore for those λ ’s (the eigenvalues of F) such that Equation (1) has a solution x 0 (an eigenvector of F corresponding to λ ).
Of course, Equation (1) contains as a (very) special case the proper eigenvalue-eigenvector equation of Linear Algebra and Linear Functional Analysis,
F ( λ , x ) = A x λ x = ( A λ I ) x = 0
in which A L ( E ) , the space af all bounded linear operators acting in E and I is the identity map; to stress the linearity of A, we write as usual A x rather than A ( x ) . In addition to Equation (2), consider now the following special forms of Equation (1):
F ( λ , x ) = G ( λ ) x = 0 ,
and
F ( λ , x ) = A ( x ) λ C ( x ) = 0 .
Evidently, both Equations (3) and (4) encompass the classical case Equation (2). However, there is quite a difference between them: in Equation (3), F depends linearly on x and arbitrarily in λ , the latter dependence being driven by a map G : K L ( E ) , while, in Equation (4), it is rather the opposite, for here it is the dependence on x that is (possibly) nonlinear as dictated by the continuous maps A , C : E E . One first consequence is that the terms eigenvalue/eigenvector/eigenspace retain their usual significance in the case of Equation (3), while on the contrary they have in general a poor meaning in the case of Equation (4). On the other hand, in the latter case, assuming that C ( x ) 0 for x 0 , the eigenvalue associated with an eigenvector is uniquely determined, for A ( x ^ ) λ 1 C ( x ^ ) = A ( x ^ ) λ 2 C ( x ^ ) with x ^ 0 implies that λ 1 = λ 2 , while in the former case this is not necessarily true.
In fact, in the past decades both Equations (3) and (4) have been usually referred to as Nonlinear Eigenvalue Problems. Those of the type in Equation (3), especially with E = K n , have been extensively studied in Numerical Analysis and Matrix Analysis (see, for instance, the review paper [1], where the abbreviation NLEVP is used to designate them), while problems of the type in Equation (4) have formed a main subject in Nonlinear Functional Analysis and its applications to differential equations, and are at the basis of, among others, Bifurcation Theory; see, for instance, the nowadays classic book [2] or the most recent [3].
Historically, the study of Nonlinear Eigenvalue Problems can be dated back to nearly one century ago, if we look in particular to the work—inspired by D. Hilbert and E. Landau—of E. Schmidt and A. Hammerstein, and subsequently of M. Golomb, on parameter-dependent nonlinear integral equations of the form
λ u ( x ) = Ω k ( x , y ) f ( y , u ( y ) ) d y .
On the eastern side of Europe, the investigation of this kind of problems received a strong impulse in the former Soviet Union on behalf of M.A. Krasnosel’skii and I.T. Gohberg. They were both pupils of M.G. Krein, and their subsequent work during many decades of the second half of the last century seems to have developed mainly on problems of the type in Equation (3) by Gohberg, and mainly on problems of the type in Equation (4) by Krasnosel’skii. For this reason, and in honor of these two true giants of Nonlinear Functional Analysis, I will often refer in the sequel to Equation (3) as describing problems of type G, and to Equation (4) as describing problems of type K.
The present paper does not contain new results in either field. It is rather a tentative review, having as a prominent scope that of indicating some problems and methods followed in each of the two classes, with a look for possible future interactions between them. This is done in the main part of the paper (Section 2). In fact, as to problems of type G—of which I became aware only a short time ago—my presentation (Section 2.1) will be that of a beginner, and limited to a few historical remarks, accompanied by some indication for further study and some motivation from ODEs.
Something more the reader will find about problems of type K, for which I have focused on a short account of some basic results and methods from Bifurcation Theory on the one hand (Section 2.2), and to a brief description of a very special—maybe the “closest to linear”—nonlinear eigenvalue problem on the other (Section 2.3). The latter is the p-Laplace equation
div ( | u | p 2 u ) = μ | u | p 2 u in Ω u = 0 on Ω
where p > 1 and Ω is a bounded domain in R n . Here, one can prove the existence—exactly as for the classical Laplace operator, p = 2 —of countably many eigenvalues which can be naturally arranged in an increasing sequence
μ 1 < μ 2 μ k , μ k +
The importance of this example is also because it shows—via the Lusternik–Schnirelmann theory—the full strength of the variational methods and of Critical Point Theory in particular. As is well known, these consist in searching a solution of a given equation as a critical point of a functional (i.e., a point where the derivative of the functional vanishes), and are of the utmost importance both for equations of the form
A ( x ) = 0
and for equations of the form
A ( x ) = λ C ( x )
the latter being in fact the nonlinear eigenvalue problem in Equation (4) for the pair ( A , C ) . Indeed, if A = f and C = g , then solutions of Equation (7) are the free critical points of f, while solutions of Equation (8) are—modulo technicalities—the constrained critical points of f on the manifold M = { x : g ( x ) = c o n s t . } . As explained for instance in [4], the Lusternik–Schnirelmann theory not only guarantees, under appropriate assumptions on the nonlinear operators A and C, the existence of infinitely many distinct eigenvalues of Equation (8), and thus in particular of Equation (5), but also provides for them a “minimax" characterization of the form (when C = I )
λ n = sup K n inf K A ( u ) , u ,
obtained via suitable families K n of subsets K of the unit sphere. This realizes a conceptually beautiful (and also practically useful, see for instance [5]) extension of the Courant–Weyl principle for the eigenvalues of a linear compact self-adjoint operator. The variational characterization of the eigenvalues seems to be a main point of common interest for either type of nonlinear eigenvalue problems, see, e.g., [6] and the references quoted in Section 2.1 and Section 2.3.
In the second part of the paper (Section 3), we return to Equation (1) and look at the case in which a small “perturbation parameter” ϵ enters in the problem, originating an equation of the form
F ( ϵ , λ , x ) = 0
of which Equation (1) is seen as the unperturbed form for ϵ = 0 . We consider parameter-dependent forms of both Equations (3) and (4), precisely
G ( ϵ , λ ) x = 0
and, taking in Equation (4) C = I and adding to a linear A a nonlinear perturbation ϵ B ,
A x + ϵ B ( x ) = λ x .
In both Equations (10) and (11), one common problem is—in the light of what is done for linear operators [7]—to see how the perturbed eigenvalues λ ( ϵ ) (provided that they exist) will depend on ϵ near a given unperturbed eigenvalue λ 0 . To this purpose, we review the main points of the recent contributions [8,9], respectively, to Equation (10) and to Equation (11).
Two more points deserve to be mentioned before closing this Introduction. The first is that, for a better understanding, Nonlinear Eigenvalue Problems—both of type G and of type K—should be set in the more general respective context of Nonlinear Spectral Theory. References for this are [10,11], respectively. The interested reader might look at [12] for a recent contribution to the latter. The second fact, clear enough from this Introduction, is that we have not even attempted to mention the various numerical methods used for the practical solution of Equation (3) in the case E = K n . The reader interested in this rich and fundamental research field might look into the excellent and very recent survey paper [13].
Let me repeat in conclusion that the only reasonable scope of this paper is to possibly arouse the curiosity of some expert in either field towards the problems treated in the other, and to give a chance of possible inspiration for further study.

2. The Two Types of NLEVP

2.1. Problems of Type G: (Linear) Operator- and Matrix-Valued Functions

A good point to start a presentation of nonlinear eigenvalue problems of the type in Equation (3) is perhaps R.E.L. Turner’s paper [6] of 1968. Given a complex Hilbert space H, rather than considering the spectrum of a single bounded linear operator A acting in H, he considers for λ C operators of the form
A B ( λ ) A 1 N λ k B k
where B k L ( H ) are given ( k = 1 , , N ) , with B 1 = I ; thus if N = 1 we are back to the familiar A λ I considered in linear spectral theory. The spectrum of A B ( λ ) is defined as the set of those λ C for which A B ( λ ) fails to be a homeomorphism of H onto itself. In particular, a point λ 0 C such that A B ( λ 0 ) is not injective, i.e., such that the nullspace Ker ( A B ( λ 0 ) ) { 0 } , is an eigenvalue of A B ( λ ) . Note that, in the case N = 1 , these definitions of spectrum and eigenvalues of A λ I yield what we usually call the spectrum and eigenvalues of A. The new point of view is that the spectrum is now attributed to the (polynomial) function of C into L ( H ) defined putting
G ( λ ) = A B ( λ ) .
In the case H = C , the spectrum so defined consists very simply of the zeroes of the polynomial G itself. Now recall (see e.g., [14] or [15]) that, if A is compact, self-adjoint and nonnegative, then:
  • The spectrum of A is at most countable and consists of a finite or infinite decreasing sequence of non-negative eigenvalues ( λ n ) :
    λ 1 λ 2 λ k
    If the sequence is infinite, then λ n 0 .
  • The eigenvectors ( u n ) associated with the eigenvalues ( λ n ) form an orthonormal basis of H.
Turner first generalizes this to operators as in Equation (12) where A is compact, self-adjoint and nonegative, B k is self-adjoint and non-negative for k = 1 , , N and A belongs to the Schatten class C r (i.e., its eigenvalues ( α i ) satisfy the condition i ( α i ) r < ) for some r < 1 2 . Another basic fact concerning the spectrum of an operator A as above is the variational characterization of its positive eigenvalues ( λ n ) : indeed,
λ n = max x u 1 , , u n 1 A x , x x , x = min x [ u 1 , , u n ] A x , x x , x
= min V V n 1 max x V A x , x x , x = max V V n min x V A x , x x , x
where V is a vector subspace of H, and V n denotes the family of all vector subspaces of dimension n.
Turner generalizes the variational principle as follows. For x H , x 0 , let Z ( x ) be the unique non-negative zero of the polynomial λ G ( λ ) x , x [6]. Note that, in the case N = 1 , as G ( λ ) x , x = A x , x λ x , x , we have
Z ( x ) = A x , x x , x
so that the function Z is the usual Rayleigh quotient of A, of which the eigenvalues are extremal values as shown by Equation (14). Then, under the stated assumptions on A and B k , if moreover the eigenvectors of A B ( λ ) , corresponding to non-negative eigenvalues, form a basis for H, then the variational principles in Equations (14) and (15) hold replacing A x , x / x , x with Z ( x ) .
Finally, we have by definition of Z ( x ) that
G ( Z ( x ) ) x , x = 0 for all x H , x 0
Results similar to those of Turner, and practically at the same time, were obtained by K.P. Hadeler in [16,17]. He considered several-parameter dependent operators of the form
A ( λ 1 B 1 + λ 2 B 2 + λ N B N )
with B j bounded self-adjoint for j = 1 , , N , and in connection with the variational property of their eigenvalues introduced the general concept of Rayleigh functional of a matrix function as follows. Let α T ( α ) be a differentiable mapping of the real interval ( a , b ) to the set S n of real symmetric matrices of order n. Then, a Rayleigh functional of T is a continuous real-valued function p on R n \ { 0 } such that p ( x ) ( a , b ) for all x R n \ { 0 } and
  • p ( c x ) = p ( x ) if c 0
  • T ( p ( x ) x , x = 0
  • T ( p ( x ) x , x > 0 .
The last is a definiteness condition that can be replaced by T ( p ( x ) x , x < 0 , and is plainly satisfied in the basic case T ( α ) = A α I , where T ( α ) = I . Thus, looking at Equations (16) and (17), we see that this is a sensible extension of the definition and properties of the Rayleigh quotient.
The results of Turner and Hadeler indicated above were developed and improved by, among others, H. Langer. For instance, in [18], studying combinations T ( λ ) of bounded self-adjoint operators of the form of Equation (12) considered by Turner, he assumed that, for each nonzero vector x, the polynomial p x ( L ) T ( λ ) x , x has only real roots
λ 1 ( x ) λ 2 ( x ) λ n ( x )
Under this assumption he showed that the ranges Λ i of the functions λ i are intervals, called spectral zones, whose interiors do not overlap.
A systematization of the spectral theory (that is, of the properties of eigenvalues and eigenvectors) of polynomial operator pencils, as had been named the families
A ( λ ) = A 0 + λ A 1 + λ n A n
where λ C is a spectral parameter, and A i , i = 1 , , n , are linear operators in a Hilbert space, was given by A.S. Markus in his book [19]. Among others, he considered in depth the problem of the factorization of pencils, which in the simplest case consists in representing a quadratic pencil A ( λ ) = λ 2 I + λ B + C in the form
A ( λ ) = ( λ I Y ) ( λ I Z ) .
The importance of many results in [19] is due to the fact that they hold for the more general case of holomorphic(i.e., analytic) operator-valued functions, namely operators A ( λ ) expressed as the sum of convergent power series in L ( E ) :
A ( λ ) = 0 λ n A n .
For an updated reference reviewing the spectral properties of self-adjoint analytic operator functions, and in particular the factorization problem, see [10]. On the other hand, for further work on the variational characterization of eigenvalues as well as for the development of the theory of Rayleigh functionals, the interested reader can look for instance into the quite recent papers by Binding, Eschwé and Langer [20], Hasanov [21], Voss [22], and Schwetlick and Schreiber [23], and the references therein.
Let us now add some more specific indication for the case in which E = K m , so that the function G appearing in Equation (3) takes its values in the space K m × m of m × m real or complex matrices. We shall stress the finite-dimensionality of the ambient space E using the letter M rather than G, and often the letter v rather than x for the vectors of E. A well known reference book for the matter is the one by Gohberg, Lancaster and Rodman [24], and the very Introduction to this book explains to us that problems of the form
M ( λ ) v = 0 , λ C , v C m , v 0
where M ( λ ) C m × m appear naturally when dealing with linear systems of higher order ordinary differential equations (ODE) with constant coefficients:
d n u d t n + A n 1 d n 1 u d t n 1 + + A 1 d u d t + A 0 u = 0
where A i C m × m for i = 0 , 1 , , n 1 . Indeed, looking for solutions of the form u ( t ) = e λ t v ( λ C , v C m ) of Equation (21) leads to the equation
e λ t { λ n + λ n 1 A n 1 + + λ A 1 + A 0 } v = 0
which—as long as v 0 , and putting A n = I —is equivalent to Equation (20) with
M ( λ ) = i = 0 n λ i A i .
Thus, e λ 0 t v 0 is a nontrivial solution of Equation (21) if and only if λ 0 is an eigenvalue of Equation (20), i.e., it is a zero of the characteristic equation
det M ( λ ) = 0
and v 0 Ker M ( λ 0 ) . More generally, the function
u ( t ) = e λ 0 t { t k k ! v 0 + + t 1 ! v k 1 + v k }
is a solution of Equation (21) if and only if the vectors v 0 , v 1 , , v k satisfy the relations
j = 0 l 1 j ! d j M d λ j ( λ 0 ) v l j = 0 , l = 0 , 1 , , k .
Such a set of vectors v 0 , v 1 , v k is called a Jordan chain of length k + 1 for the matrix function M ( λ ) , corresponding to the eigenvalue λ 0 and starting with the eigenvector v 0 . The above definitions extend from matrix polynomials as in Equation (23) to any analytic matrix function M ( λ ) . It is good to see the explicit form of Equation (26), which is
l = 0 : M ( λ 0 ) v 0 = 0 l = 1 : M ( λ 0 ) v 1 + 1 1 ! d M d λ ( λ 0 ) v 0 = 0 l = 2 : M ( λ 0 ) v 2 + 1 1 ! d M d λ ( λ 0 ) v 1 + 1 2 ! d 2 M d λ 2 ( λ 0 ) v 0 = 0 l = k : M ( λ 0 ) v k + 1 1 ! d M d λ ( λ 0 ) v k 1 + + 1 k ! d k M d λ k ( λ 0 ) v 0 = 0 .
If n = 1 in Equation (23), we have M ( λ ) = λ A 1 + A 0 ; and if moreover A 1 = I , then M ( λ ) = A 0 λ I . In this case, d M d λ ( λ 0 ) = I , while d j M d λ j ( λ 0 ) = 0 for all j > 1 , so that the above equalities reduce to (putting A 0 = A )
( A λ 0 I ) v 0 = 0 ( A λ 0 I ) v 1 = v 0 ( A λ 0 I ) v k = v k 1
and are those defining an ordinary Jordan chain for the matrix A corresponding to λ 0 and v 0 , used to represent A in its Jordan canonical form and in particular to construct a basis of the generalized eigenspace E λ 0 ( A ) associated with λ 0 . We recall that this is defined as
E λ 0 ( A ) = Ker ( ( A λ 0 I ) p )
where p is the least integer such that Ker ( ( A λ 0 I ) p ) = Ker ( ( A λ 0 I ) p + 1 ) , and that the dimension dim E λ 0 ( A ) of E λ 0 ( A ) is equal to the algebraic multiplicity of λ 0 , that is, the multiplicity of the eigenvalue as a zero of the characteristic polynomial det ( A λ I ) . We say that λ 0 is semisimple if p = 1 in Equation (29)—that is, if the algebraic multiplicity coincides with the geometric multiplicity of λ 0 , defined as dim Ker ( A λ 0 I ) —and that λ 0 is simple if they are both equal to 1.
These familiar concepts from Linear Algebra, concerning the basic case M ( λ ) = A 0 λ I , need to be extended to analytic matrix functions M ( λ ) . To this purpose, we quote from [25]; see also ([26], Chapter 7).
  • Let x 0 be an eigenvector corresponding to an eigenvalue λ 0 . The maximal length of a Jordan chain starting at x 0 is called the multiplicity of x 0 and denoted by m ( x 0 ) . An eigenvalue λ 0 is said to be normal if it is an isolated eigenvalue and the multiplicity of each corresponding eigenvector is finite.
  • Suppose that λ 0 is a normal eigenvalue. Then, a corresponding canonical system of Jordan chains
    x 0 k , x 1 k , , x m k 1 k ( k = 1 , , N )
    is defined by the following rules:
    (1)
    The vectors x 0 1 , , x 0 N form a basis of Ker M ( λ 0 ) (and so N = dim Ker M ( λ 0 ) ).
    (2)
    x 0 1 , x 1 1 , , x m 1 1 1 is a Jordan chain of the maximal length m 1 m ( x 0 1 ) .
    (3)
    Once that the vectors x 0 1 , x 0 2 , , x 0 k 1 ( 1 k < N ) have been chosen, then pick an eigenvector x 0 k linearly independent from x 0 1 , x 0 2 , , x 0 k 1 and form a Jordan chain x 0 k , x 1 k , , x m k 1 k of the maximal length m k m ( x 0 k ) .
  • A canonical system is not defined uniquely; however, the numbers m 1 , m 2 , , m N do not depend on the choice of Jordan chains and are called partial multiplicities of the eigenvalue λ 0 . The number Q ( λ 0 ) m 1 + + m N is the algebraic multiplicity of the eigenvalue λ 0 .
The next statement—which is based on results found in [27]—proves that these definitions are a coherent generalization of the usual ones.
Proposition 1.
An eigenvalue λ 0 is a zero of det M ( λ ) of multiplicity Q ( λ 0 ) .
Based on Proposition 1, the definitions of simple and semisimple eigenvalue carry over to the case of matrix polynomials and more generally to analytic matrix functions. For instance, one may check that the matrix function
M 2 ( λ ) = λ 1 + e λ 0 0 λ + 1
considered in [8] has λ 0 = 0 as a double (i.e., of algebraic multiplicity 2), nonsemisimple (i.e., of geometric multiplicity 1) eigenvalue, with Jordan chain
H 0 = 1 0 , H 1 = α 0
for any α R . This example also shows that in the nonlinear case, generalized eigenvectors do not need to be linearly independent. Indeed, in the construction (and notation) recalled above, the generating vectors x 0 1 , , x 0 N of the system of Jordan chain are chosen to be linearly independent, but it is not necessarily so for the vectors in each corresponding chain, generated by the rules given by the system in Equation (27).
An especially important source for the study of NLEVP are the Delay Differential Equations (DDE), or systems of them. For instance, in [26] is considered the so-called Wright equation
x ( t ) = α x ( t 1 ) [ 1 + x ( t ) ]
where α > 0 . The objective is to determine the periodic orbits (if any) of Equation (32). To do this, one must first look at the linearized equation of Equation (32) near x 0 , which is
x ( t ) = α x ( t 1 )
Solutions e λ t of this exist iff λ satisfies the characteristic equation
λ e λ + α = 0
For α = π / 2 , this has λ = i π 2 as a simple purely imaginary root, corresponding to the periodic solution e i π 2 t . Studying the properties of these nonlinear eigenvalues, that is of the roots λ ( α ) of Equation (34) as a function of α , and using deep topological and functional-analytic results from [26], it is possible to demonstrate that Equation (32) has a Hopf bifurcation at α = π / 2 , and that for every α > π / 2 Equation (32) has a nonconstant periodic solution with period close to 4. Finally, the authors show that for p > 4 , there is a periodic solution of Equation (32) of period p.
One can also consider systems of DDE, for instance
x ( t ) = 1 0 0 1 x ( t ) + 1 0 0 0 x ( t 1 )
whose characteristic matrix is precisely that displayed in Equation (30). The general form of a system of N delay differential equations, with delays τ 1 , , τ N is
x ( t ) = A 0 x ( t ) + i = 1 N A i x ( t τ i )
with A 0 , A i C N × N , and the corresponding characteristic matrix is
M ( λ ) = λ I A 0 i = 1 N A i e λ τ i .
More general forms of Equation (35) are considered in Section 3.

2.2. Problems of Type K: Nonlinear Operators and Bifurcation

Throughout this Section E will be a real Banach space, of finite or infinite dimension. Originally, bifurcation theory deals with the local study of Equation (1) near a point ( λ 0 , 0 ) R × E , and studies precisely the conditions under which from the given point ( λ 0 , 0 ) of the line R × { 0 } R × E of the trivial solutions of Equation (1), there bifurcates a branch of nontrivial solutions, that is, of solutions ( λ , x ) with x 0 . Of course, the basic situation that comes to one’s mind is the case F ( λ , x ) = A x λ x , with λ 0 an eigenvalue of the linear operator A, the “branch” being here the special subset { λ 0 } × ( Ker ( A λ 0 I ) \ { 0 } ) of R × E . The interesting case is when F depends in a less obvious way from λ and x; an easy example of what we mean is given for instance by the equation
F ( λ , x ) a x + b x 3 λ x = 0 , ( λ , x ) R 2
in which the parabola λ = a + b x 2 bifurcates at the point ( a , 0 ) from the line of the trivial solutions. For a motivating introduction to the theory, and a discussion of some important physical problems that fall in this context, an excellent source is the old review paper by Stackgold [28].
The previous “naif” idea of bifurcation needs to be made both more precise and more general, and this is done by saying that ( λ 0 , 0 ) is a bifurcation point for Equation (1) if any neighborhood of ( λ 0 , 0 ) in R × E contains nontrivial solutions of Equation (1). For this definition to make sense, it is enough that F be defined in an open set U R × E with ( λ 0 , 0 ) U , and this is what we assume from now on. For the next step, we further assume that F is differentiable at the point ( λ 0 , 0 ) , so that F can be linearized near that point as
F ( λ , x ) = F ( λ 0 , 0 ) + D λ F ( λ 0 , 0 ) ( λ λ 0 ) + D x F ( λ 0 , 0 ) x + R ( λ , x ) = D x F ( λ 0 , 0 ) x + R ( λ , x )
where the remainder term R satisfies
R ( λ , x ) = o ( ( λ , x ) ) as ( λ , x ) ( λ 0 , 0 ) .
Some more regularity on F yields immediately a necessary condition for bifurcation:
Theorem 1.
Suppose that F is of class C 1 in a neighborhood of ( λ 0 , 0 ) . If D x F ( λ 0 , 0 ) is a homeomorphism of E onto itself, then ( λ 0 , 0 ) cannot be a bifurcation point for Equation (1).
Proof. 
The assumption implies, via the Implicit Function Theorem, that there is a neighborhood I × V of ( λ 0 , 0 ) such that, for any λ I , there is a unique x = x ( λ ) V such that F ( λ , x ) = 0 . As by assumption F ( λ , 0 ) = 0 for any λ , we must have x ( λ ) = 0 for λ I , so that there is no nontrivial solution to Equation (1) in the neighborhood I × V of ( λ 0 , 0 ) . ☐
For simplicity, we shall henceforth consider only the special case
F ( λ , x ) = A ( x ) λ x
where A ( 0 ) = 0 and A is of class C 1 near x = 0 . Here, D x F ( λ 0 , 0 ) = A ( 0 ) λ 0 I , and we have a more explicit form of the remainder term in the linearized form in Equation (36) of F: for we can write A ( x ) = A ( 0 ) x + B ( x ) with B ( x ) = o ( x ) as x 0 , so that Equation (37) yields
F ( λ , x ) = A ( 0 ) x + B ( x ) λ x
= ( A ( 0 ) λ 0 I ) x + B ( x ) ( λ λ 0 ) x
and comparing this with Equation (36) we see that R ( λ , x ) = B ( x ) ( λ λ 0 ) x in this special case. Resuming, the equation we want to study is
A ( x ) λ x = 0
with A ( 0 ) = 0 and A of class C 1 near x = 0 , and can be written as
T x λ 0 x + B ( x ) = ( λ λ 0 ) x
where T A ( 0 ) and B ( x ) = o ( x ) as x 0 . The necessary condition for bifurcation implicitly stated in Theorem 1 can now be rephrased as follows:
( λ 0 , 0 ) bifurcation point of A ( x ) λ x = 0 λ 0 σ ( A ( 0 ) ) .
The standard case considered in the literature is when λ 0 is in the point spectrum of A ( 0 ) , and we formalize this more precisely under the form of a basic assumption, which is plainly satisfied if dim E < :
H0. 
λ 0 is an isolated eigenvalue of T A ( 0 ) and T λ 0 I is a Fredholm operator of index zero.
Let us recall (see, e.g., [14]) that a bounded linear operator L between two Banach spaces E and F is said to be a Fredholm operator if its nullspace Ker L has finite dimension and its range Im L is closed and has finite codimension; in this case, the index of L, ind L , is defined as
ind L = dim Ker L codim Im L .
Thus, if dim E = dim F < , then any linear operator is Fredholm of index zero. From the Riesz–Schauder theory of such operators (see e.g., [15]), it is known that also the nullspaces Ker L j ( j > 1 ) are finitedimensional, and that they stabilize for j sufficiently large; with reference to the case L = T λ 0 I , this means that there exists a least integer p such that Ker ( T λ 0 I ) p = Ker ( T λ 0 I ) p + 1 , and moreover one has
E = Ker ( T λ 0 I ) p Im ( T λ 0 I ) p .
It follows in particular that the algebraic multiplicity of λ 0 is finite, where in general this is is defined—consistently with the definition recalled in Section 2.1 for the case dim E < —as the dimension of the subspace
j = 1 Ker ( T λ 0 I ) j .
In the following, when speaking of multiplicity of an eigenvalue, we refer to the algebraic multiplicity. We recall that this coincides with the geometric multiplicity dim Ker ( T λ 0 I ) when T is a self-adjoint operator in a Hilbert space.
Remark 1.
The assumption H0 alone is not sufficient to guarantee that an eigenvalue of the “linear part” T of A at 0 is a bifurcation point for A. To see this, consider the example ([29], Chapter 11) given by the system
x + y 3 = λ x y x 3 = λ y .
Here, E = R 2 and we have (in our notations) T = I , λ 0 = 1 and B ( x , y ) = ( y 3 , x 3 ) . Multiplying the first equation by y, the second by x and subtracting the second from the first, we obtain x 4 + y 4 = 0 , showing that Equation (40) has no nontrivial solution whatsoever. One way of seeing this is that the two-dimensional eigenspace associated with λ 0 is completely destroyed by the addition of the perturbing term B.
Three typical situations are then considered, each of them guaranteeing bifurcation from λ 0 , and described by the following assumptions, respectively:
H1. 
λ 0 is a simple eigenvalue of A ( 0 ) .
H2. 
A is compact and λ 0 0 is an eigenvalue of odd multiplicity of A ( 0 ) .
H3. 
A is a gradient operator in a Hilbert space and λ 0 is an isolated eigenvalue of finite multiplicity of A ( 0 ) .
These assumptions call immediately for some explanation. In fact, it could be noted at once that both H2 and H3 are a strengthening of H0. However, to proceed with some order, in the remaining part of this subsection, we shall give a precise statement for each of the three bifurcation results roughly indicated above, preceded by a comment on the respective assumption, and followed by an indication of the proof.
Thus, starting with H2, we recall that, if A is compact, then the linear operator A ( 0 ) is a compact, too [30]. Therefore H0 is redundant in this case, as it is a basic spectral property of any such operator [14].
Theorem 2.
If H2 is satisfied, then λ 0 is a bifurcation point for Equation (38). Moreover, it is a global bifurcation point in the following sense: if S denotes the closure in R × E of the set of nontrivial solutions of Equation (38), then S { ( λ 0 , 0 ) } has a connected subset S λ 0 containing { ( λ 0 , 0 ) } , and which is either unbounded in R × E or contains a point { ( λ 1 , 0 ) } with λ 1 an eigenvalue of odd multiplicity of T.
Proof. 
The proof relies on the Leray–Schauder degree. Roughly speaking, this is a topological tool to detect the fixed points of a compact map and can be briefly introduced as follows (see, for instance, [3] (Part I) for a complete presentation). Suppose we have a continuous compact map C of E into itself, a bounded open set Ω E with 0 Ω , and suppose that C ( x ) x for x Ω . Then, there exists an integer, denoted d ( I C , Ω , 0 ) and called the (Leray–Schauder) degree of I C relative to the set Ω and to the point 0, having the following properties:
(i)
If d ( I C , Ω , 0 ) 0 , then there exists an x Ω such that x = C ( x ) .
(ii)
d ( I , Ω , 0 ) = 1 .
(iii)
If Ω 0 Ω and I C has no zeroes in Ω \ Ω ¯ 0 , then d ( I C , Ω , 0 ) = d ( I C , Ω 0 , 0 ) .
(iv)
Suppose C 1 , C 2 : E E are compact maps, and put
H ( t , x ) = x [ C 1 ( x ) t ( C 2 ( x ) C 1 ( x ) ] , t [ 0 , 1 ] , x E
If H ( t , x ) 0 for t [ 0 , 1 ] and x Ω , then
d ( I C 1 , Ω , 0 ) = d ( I C 2 , Ω , 0 ) .
(v)
If C is a linear compact map and I C is injective, then
d ( I C , Ω , 0 ) = ( 1 ) ν
where ν is the number of eigenvalues >1 of C, each counted with its algebraic multiplicity.
To prove that λ 0 is a bifurcation point, it is enough to show that for any sufficiently small r > 0 , there exists a solution ( λ r , x r ) of Equation (38) with λ r [ λ 0 r , λ 0 + r ] and x r = r . Thus, let B r = { x E : x r < r } be the open ball centered at x = 0 and with radius r; we consider the degree of various maps with respect to this neighborhood of 0. Precisely, assume for instance λ 0 > 0 and write A ( x ) λ x = λ ( x μ A ( x ) ) , μ = 1 / λ , for λ near λ 0 . Consider thus the equivalent equation
x μ A ( x ) = 0
and let μ vary in an interval [ μ ̲ , μ ¯ ] containing as interior point μ 0 = 1 / λ 0 and no other characteristic values(as are named the reciprocals of the nonzero eigenvalues) of T A ( 0 ) except μ 0 . Assume by way of contradiction that x μ A ( x ) 0 for x = r and μ [ μ ̲ , μ ¯ ] ; then using the Homotopy invariance Property (iv) with C 1 = μ ̲ A , C 2 = μ ¯ A we would have
d ( I μ ̲ A , B r , 0 ) = d ( I μ ¯ A , B r , 0 ) .
On the other hand, for small r > 0 , using again Property (iv) we have
d ( I μ ̲ A , B r , 0 ) = d ( I μ ̲ T , B r , 0 )
because I μ ̲ A is homotopic to I μ ̲ T on B r ; indeed, since the latter operator is a homeomorphism and since B ( x ) = o ( x ) as x 0 , we have (diminishing r if necessary)
x μ ̲ ( T x + t B ( x ) ) x μ ̲ T x μ ̲ t B ( x ) k x
for some k > 0 and for all ( t , x ) [ 0 , 1 ] × B ¯ r . Similarly,
d ( I μ ¯ A , B r , 0 ) = d ( I μ ¯ T , B r , 0 ) .
However, using Property (v), we have
d ( I μ ̲ T , B r , 0 ) = ( 1 ) ν ̲ , d ( I μ ¯ T , B r , 0 ) = ( 1 ) ν ¯
where ν ¯ = ν ̲ + h , h an odd integer (the algebraic multiplicity of λ 0 ); therefore the two degrees in Equation (45) are different, contradicting the previous equalities in Equations (42)–(44). This proves that x r μ r A ( x r ) = 0 for some x r B r and some μ r [ μ ̲ , μ ¯ ] , and therefore that there is bifurcation from ( μ 0 , 0 ) for the Equation (41), or equivalently from ( λ 0 , 0 ) for Equation (38). The proof that under the stated assumptions the bifurcation has a global character, in the sense described by the statement of Theorem 2, requires the much deeper topological analysis performed by P.H. Rabinowitz in his famous paper [31]. ☐
We now go on to comment assumption H3, and to briefly discuss the corresponding bifurcation result. For the next definition, and the statements following it, see for instance [2] or [30].
Definition 1.
Let H be a real Hilbert space with scalar product denoted . . An operator A : H H is said to be a gradient (or potential) operator if there exists a differentiable functional a : H R such that
A ( x ) , y = a ( x ) y f o r a l l x , y H .
One then writes A = a ; the functional a—the potential of A—is uniquely determined by the requirement that a ( 0 ) = 0 , and is explicitly given by the formula
a ( x ) = 0 1 A ( t x ) , x d t .
A bounded linear operator is a gradient if and only if it is self-adjoint. Moreover, if a gradient operator A is differentiable at a point x 0 , then A ( x 0 ) is self-adjoint.
Theorem 3.
If H3 is satisfied, then λ 0 is a bifurcation point of Equation (38). Moreover, for each r > 0 sufficiently small, Equation (38) has at least two distinct solutions ( λ r , x r ) such that x r = r .
Proof. 
The proof makes use of the Lyapunov–Schmidt method (see, for instance, ([3], Chapter 2) or ([29]), Chapter 11) which allows to reduce the infinite-dimensional problem in Equation (38) to a problem in the finite-dimensional space Ker ( T λ 0 I ) . Indeed, consider the equivalent form Equation (39) of Equation (38), and rewrite it as
L x + B ( x ) = δ x
where L = T λ 0 I and δ = λ λ 0 . Now recalling that T = A ( 0 ) , the assumption H3 implies that H is the orthogonal sum
H = Ker L Im L .
Then, letting P , Q denote the orthogonal projections of H onto Ker L and Im L , respectively, we have
x = P x + Q x v + w
and using this in Equation (48), we obtain the equivalent system
P B ( v + w ) = δ v
L w + Q B ( v + w ) = δ w .
The restriction L | Im L of L to Im L is a homeomorphism of Im L onto itself. A standard application of the Implicit Function Theorem, together with the condition B ( x ) = o ( x ) as x 0 , then allows to solve the complementary equation, Equation (52) in the form
w = w ( δ , v )
with w ( 0 , 0 ) = 0 , where δ and v belong to suitably small neighborhoods J and V of δ = 0 and v = 0 , respectively in R and in Ker L . Replacing this in Equation (51) first yields
P B ( v + w ( δ , v ) ) , v = δ v 2
whence, applying once more the Implicit Function Theorem, one can recover δ as a function of v,
δ = δ ( v ) , δ ( 0 ) = 0 ,
for v in a neighborhood V 0 V of 0 in Ker L . Finally, putting
ϕ ( v ) = w ( δ ( v ) , v ) , v V 0
and replacing this in Equation (51), one is left with the finite-dimensional equation (the bifurcation equation)
F 0 ( v ) P B ( v + ϕ ( v ) ) = δ ( v ) v .
Any solution v V 0 , v 0 , of this equation will give rise to a solution ( δ , x ) , x 0 ,
( δ , x ) = ( δ ( v ) , v + ϕ ( v ) )
of the original Equation (48), and the continuity (in fact, C 1 regularity) of the maps δ = δ ( v ) , w = w ( δ , v ) will ensure that this solution ( δ , x ) stays into a given small neighborhood of ( 0 , 0 ) in R × H provided that v is small enough. Thus, proving bifurcation from λ 0 for Equation (38)—or equivalently, bifurcation from δ = 0 for Equation (48)—reduces to prove that Equation (57) has solutions v 0 of arbitrarily small norm.
Remark 2.
The Lyapunov–Schmidt reduction can be applied more generally, and with minor modifications, in a Banach space E whenever the basic assumption H0 (i.e., that L = T λ 0 I is Freholm of index zero) holds and is supplemented by the transversality condition
Ker L Im L = { 0 }
which is plainly satisfied when T is self-adjoint, as the two subspaces in Equation (58) are then orthogonal. Note that Equation (58) is in general equivalent to Ker L = Ker L 2 , and thus to the fact that the algebraic and geometric multiplicities of λ 0 coincide. H0 and Equation (58) imply the direct decomposition of E into (closed) subspaces as in Equation (49), and therefore allow for the same reduction on taking for P , Q the (continuous) projections associated with Equation (49).
Returning to the proof of Theorem 3, we let now come in the assumption that the whole A, and therefore also its “nonlinear part” B, is a gradient. Here, we bound ourselves to give the main idea of the particularly clear demonstration provided by C. Stuart [32]. Thus, let f be such that f = L + B , and consider the reduced functional f 0 : V 0 Ker L R defined putting
f 0 ( v ) f ( v + ϕ ( v ) ) .
Moreover, for small r > 0 put
M r = { v V 0 : g ( v ) v 2 + ϕ ( v ) 2 = r 2 } .
Then, M r is the level set of the C 1 functional g, and is compact because it is a closed and bounded subset of the finite dimensional space Ker L . Thus, f 0 attains its minimum and its maximum on M r , and if v 0 M r is such an extremal point we have, by the Lagrange multiplier’s rule,
f 0 ( v 0 ) = λ g ( v 0 ) .
Performing the computations of f 0 ( v 0 ) and g ( v 0 ) by the definitions in Equations (59) and (60), and using the fact that w ( δ , v ) satisfies the complementary equation, Equation (52), one checks that λ = δ ( v 0 ) and that Equation (57) is satisfied. ☐
We finally come to H1. Unlike H2 and H3, in general H1 is independent from H0, and must be supplemented with it to guarantee bifurcation. Of course, when E is finite dimensional, H0 does not play any role, and indeed H1 can in this case be viewed as a special case of H2, because any continuous map is then compact.
Theorem 4.
If H0 and H1 are satisfied, then λ 0 is a bifurcation point of Equation (38). Moreover, if A is of class C 2 in a neighborhood of x = 0 , then near ( λ 0 , 0 ) the solution set of Equation (38) consists of the trivial solutions { ( λ , 0 ) } and of a C 1 curve
γ ( t ) = ( λ ( t ) , x ( t ) ) , t ] δ , δ [
with γ ( 0 ) = ( λ 0 , 0 ) and x ( t ) 0 for t 0 . Finally, if Ker L = [ ϕ ] , then as t 0
x ( t ) = t ϕ + o ( | t | ) λ ( t ) = λ 0 + o ( 1 ) .
The statement of Theorem 4 means that near ( λ 0 , 0 ) , the solution set of Equation (38) is topologically equivalent to the “cross”
( ] 1 , 1 [ × { 0 } ) ( { 0 } × ] 1 , 1 [ ) .
As to the proof, this goes for a first part along the same lines used to prove the previous Theorem 3, that is, using the Lyapounov–Schmidt decomposition in the sense indicated in Remark 2. What is specific here is that, since dim Ker L = 1 , one ends with an equation in R ; a further nontrivial application of the Implicit Function Theorem then leads to the result: see, for instance, ([3], Chapter 2).

2.3. A Very Special Nonlinear Problem: The p-Laplace Equation

Let Ω be a bounded open set in R n , let p > 1 , and let E be the Sobolev space W 0 1 , p ( Ω ) , equipped with the norm
v W 0 1 , p = ( Ω | v | p d x ) 1 p .
That this is actually a norm in W 0 1 , p ( Ω ) , equivalent to the standard one of W 1 , p ( Ω ) , is a consequence of Poincaré’s inequality(see e.g., [14]), stating that
Ω | v | p d x C Ω | v | p d x
for some C > 0 and for all v W 0 1 , p ( Ω ) . Let E = W 1 , p ( Ω ) be the dual space of E. A (weak) solution of the p-Laplace Equation (5) is a function u E such that
A p ( u ) = λ B p ( u )
where λ = μ 1 (it will soon be clear that μ = 0 is not an eigenvalue of Equation (5)) and A p , B p : E E are defined by duality via the equations
A p ( u ) , v = Ω | u | p 2 u v d x , B p ( u ) , v = Ω | u | p 2 u v d x
where u , v E and , denotes the duality pairing between E and E .
The proof of the existence of countably many eigenvalues and eigenfunctions of Equation (65) relies on the Lusternik–Schnirelmann (LS) theory of critical points for an even functional on a symmetric manifold. Complete presentations of this theory, in both finite and infinite dimensional spaces, can be found, among others, in [3,4,29,33,34]. Theorem 5 below is essentially a simplified version of Theorem A in [35], save that with respect to [35] we have for expository convenience interchanged the roles of the operators A and B. Thus, let E be a real, infinite dimensional, uniformly convex Banach space with dual E , and consider the problem
A ( u ) = λ B ( u )
where A , B : E E are continuous gradient operators with potentials a , b , respectively: A = a , B = b . Definition 1 of gradient operator extends of course to mappings of E into E replacing the scalar product with the duality pairing.
Suppose that b ( u ) > 0 for u 0 ; then, the eigenvectors of Equation (67) satisfying a normalization condition b ( u ) = r ( r > 0 ), are precisely the constrained critical points of a on the level set
M r = { u E : b ( u ) = r } .
The additional key assumptions that we make on A and B are as follows:
  • A , B are odd(that is, A ( u ) = A ( u ) for u E , and similarly for B).
  • A is non-negative (that is, A ( u ) , u 0 for u E ) and strongly sequentially continuous (that is, if ( u n ) E converges weakly to u 0 E , then A ( u n ) converges strongly to A ( u 0 ) in E ).
  • B is strongly monotone in the following sense: there exist constants k > 0 and p > 1 such that, for all u , v E ,
    B ( u ) B ( v ) , u v k u v p .
By the above assumptions on B, M r is symmetric (that is, u M r u M r ) and sphere-like, in the sense that each ray through the origin hits M r in exactly one point. If K M r is compact and symmetric, then the genus of K, denoted γ ( K ) , is defined as
γ ( K ) = inf { n N : there exists a continuous odd map of K into R n \ { 0 } } .
If V is a subspace of E with dim V = n , then γ ( M r V ) = n . For n N put
K n ( r ) = { K M r : K compact and symmetric , γ ( K ) n } .
Theorem 5.
Let A , B : E E be as above. Suppose moreover that a ( u ) 0 implies A ( u ) 0 . For n N and r > 0 , put
C n ( r ) sup K n ( r ) inf K a ( u )
where K n ( r ) is as in Equation (69). Then
sup M r a ( u ) = C 1 ( r ) C n ( r ) C n + 1 ( r ) 0 .
Moreover, C n ( r ) 0 as n , and if C n ( r ) > 0 then C n ( r ) is attained and is critical value of a on M r : thus, there exist u n ( r ) M r and λ n ( r ) R such that
C n ( r ) = a ( u n ( r ) )
and
A ( u n ( r ) ) = λ n ( r ) B ( u n ( r ) ) .
Here are a few indications for the Proof of Theorem 5:
(i) The sequence ( C n ( r ) ) is non-decreasing because, for any n N , we have K n ( r ) K n + 1 ( r ) as shown by Equation (69). (ii) In addition, C 1 ( r ) = sup M r a ( u ) because K 1 ( r ) contains all sets of the form { x } { x } , x M r . (iii) The proof that C n ( r ) 0 as n , together with a lot of related information, can be found for instance in [34]. (iv) Finally, the assumption that A ( u ) 0 whenever a ( u ) 0 , together with the stated continuity properties of A and B, ensures that a satisfies the crucial Palais–Smale(PS) condition on M r at any level C > 0 , needed to prove the final (and most important) assertion of the Theorem via the standard deformation methods of Critical Point Theory; see for this any of the above cited references.
Of special importance—with reference to the the p-Laplace equation—is the case in which A and B have the additional property of being positively homogeneous of the same degree p 1 > 0 , meaning that A ( t u ) = t p 1 A ( u ) for u E and t > 0 , and similarly for B. In this case, we have from Equation (47)
a ( u ) = A ( u ) , u p , b ( u ) = B ( u ) , u p
so that a ( u ) 0 implies A ( u ) 0 . Moreover, the use of Equation (74) in Equations (72) and (73) yields at once the relation C n ( r ) = λ n ( r ) r . In fact, here, λ n ( r ) is independent of r > 0 : to see this, it is convenient to re-parameterize the level sets on putting for R > 0
M R = { u E : b ( u ) = R p p } = { u E : B ( u ) , u = R p } .
As a and b are p-homogeneous, it follows that M R = R M 1 , that each K K n ( R ) is the image of the corresponding set in K n ( 1 ) under the map u R u , and that C n ( R ) = R p C n ( 1 ) . By these remarks, we thus have the equalities
λ n ( R ) R p p = C n ( R ) = R p C n ( 1 )
showing as expected that λ n ( R ) is independent of R, and precisely that
λ n ( R ) = p C n ( 1 ) = sup K n inf K A ( u ) , u λ n
where K n K n ( 1 ) . From Theorem 5, we then get immediately the following statements about λ n :
  • sup M 1 A ( u ) , u = λ 1 λ 2 λ n ;
  • λ n 0 as n ; and
  • if λ n > 0 , then there exists u n M 1 (that is, B ( u n ) , u n = 1 ) such that A u n = λ n B ( u n ) ; in particular, λ n = A ( u n ) , u n .
Remark 3.
The situation just described contains as a more special case that of two linear operators A and B, in which the above formulae hold with p = 2 . Suppose in particular that A acts in a real Hilbert space H and B = I ; then M 1 = { u H : u = 1 } is the unit sphere in H, while A is a compact, self-adjoint, non-negative linear operator (strong sequential continuity and compactness are equivalent properties for a linear operator acting in a reflexive Banach space, see e.g., [15]). Then, Equation (75) and the statements following this formula yield a good part of the familiar spectral properties of such operators: indeed it is not hard to see that the LS variational characterization in Equation (75) of λ n reduces in this case to the classical Courant’s minimax principle expressed by Equation (15), so that the sequence in Equation (75) of the LS eigenvalues of A coincides with the decreasing sequence of all the eigenvalues of A, each counted with its multiplicity.
Returning finally to the p-Laplacian, it is now a matter of applying the above information to the operators A p , B p defined in Equation (66). One can check (see [36,37], for instance) that they satisfy all the requirements for the application of Theorem 5. Moreover, they are evidently positively homogeneous of degree p 1 , and finally A p is (strictly) positive, for
A p ( v ) , v = Ω | v | p d x > 0 for v E , v 0 .
This implies that each of the numbers λ n defined in Equation (75) for the pair A p , B p is strictly positive, whence it follows—using the last statement of Theorem 5—that the eigenvalue problem in Equation (65) for the p-Laplacian possesses an infinite sequence of eigenvalues λ n > 0 , each given by
λ n = sup K n inf K Ω | v | p ( n = 1 , 2 , )
where
K n = { K { v W 0 1 , p : Ω | v | p = 1 } , K compact and symmetric , γ ( K ) n } .
Setting μ n = λ n 1 , this finally proves the properties of Equation (5) stated in the Introduction, and in particular Equation (6).
Remark 4.
For the very special properties owned by the first eigenvalue μ 1 in the sequence in Equation (6) and by the associated eigenfunctions, see for instance [37]. Anyway, it follows by our discussion that λ 1 = μ 1 1 is the best constant in Poincaré’s inequality, Equation (64):
λ 1 = sup v W 0 1 , p , v 0 Ω | v | p Ω | v | p .
To conclude this section, let us remark that the study and research in problems related to the p-Laplacian has grown enormously in the last decades, and even remaining in the strict context of a “spectral theory” for Equation (5), one should at least mention the following relevant points: (i) the problem of the asymptotic distribution of the LS eigenvalues (along the classical Weyl’s law for the Laplacian); (ii) the question of the existence of other eigenvalues outside the LS sequence; and (iii) the Fredholm alternative for perturbed non-homogeneous versions of Equation (5). For information on these issues, we refer the reader to [37,38,39] and to the recent and very clear review paper [36]. Related material can be found in [40].

3. Nonlinear Perturbation of an Isolated Eigenvalue

As a way to introduce and motivate the more specific content of this section, let me start recalling a famous and beautiful result of F. Rellich in perturbation theory of linear eigenvalue problems:
Theorem 6.
([41], Theorem 1). Let A ( ϵ ) be a family of Hermitian n × n matrices depending analytically on the real parameter ϵ for ϵ near 0. Let λ 0 be an eigenvalue of multiplicity m > 1 of A = A ( 0 ) . Then, for ϵ near 0, A ( ϵ ) possesses m eigenvalues
λ 1 ( ϵ ) , , λ m ( ϵ )
and corresponding orthonormal eigenvectors u 1 ( ϵ ) , , u m ( ϵ ) ; that is, for all sufficiently small ϵ, we have
A ( ϵ ) u i ( ϵ ) = λ i ( ϵ ) u i ( ϵ ) ( i = 1 , , m ) .
Moreover, λ i ( 0 ) = λ 0 for all i = 1 , , m and the functions λ i and u i depend analytically on ϵ near ϵ = 0 .
As is well known, the “ideal" situation described by Theorem 6 for the splitting of the multiple eigenvalue does not hold in general. In Rellich’s words, “...our question about the eigenvalues reduces to asking whether or not the zeroes of a polynomial [in the case, the characteristic polynomial of a matrix whose elements depend analytically on a parameter ϵ ] are themselves regular analytic functions of ϵ for small ϵ . In general the answer is no; a counterexample is λ 2 + ϵ . What is true is that if λ = λ ( 0 ) is a zero for ϵ = 0 , then the zero λ ( ϵ ) can be written as a convergent (for small ϵ ) power series in ϵ 1 h (Puiseux series) where h is the multiplicity of λ = λ ( 0 ) .” The example indicated by Rellich can be displayed as
A ( ϵ ) 0 1 ϵ 0 = 0 1 0 0 + ϵ 0 0 1 0 A + ϵ B
and shows the unperturbed eigenvalue λ 0 = 0 of A, of multiplicity h = 2 , splitting into the two simple eigenvalues λ ( ϵ ) = ± ( ϵ ) 1 2 of A ( ϵ ) . In general, if λ 0 has multiplicity h, the perturbed eigenvalue(s) λ ( ϵ ) will admit an expansion such as
λ ( ϵ ) = λ 0 + ϵ 1 h λ 1 + ϵ 2 h λ 2 + + ϵ λ h + = λ 0 + i = 1 ϵ i m λ i
For the special case that A ( ϵ ) is Hermitian, using the reality of λ ( ϵ ) Rellich showed in [41] that only integral powers of ϵ can have non-zero coefficients in the expansion of Equation (78), thus proving the analytic dependence on ϵ of the perturbed eigenvalues as stated in Theorem 6. Rellich’s work was a main starting point for the very vast literature concerning the systematic analysis of the perturbation of eigenvalues of linear operators, both in finite and infinite dimensional spaces; see Kato’s book [7] and the references therein. Our aim in this section is to indicate some partial results about similar questions for nonlinear eigenvalue problems, both of type G and of type K, recently appearing in [8,9], respectively.

3.1. A Perturbation Problem of Type G

In the paper [8], the authors study the splitting of a multiple eigenvalue of the nonlinear eigenvalue problem, depending on the real parameter ϵ ,
M ( λ ; ϵ ) v = 0 , λ C , v C n , v 0 .
Here, M ( λ ; ϵ ) is an n × n complex matrix having an eigenvalue λ 0 for ϵ = 0 (i.e., det M ( λ 0 ; 0 ) = 0 ). As in the linear case, a perturbation theory for the eigenvalue λ 0 consists in the study of the eigenvalues of Equation (79)—and of the corresponding eigenvectors—in the vicinity of λ 0 , and will focus precisely on the behaviour of such eigenvalues/eigenvectors as functions λ ( ϵ ) , v ( ϵ ) of the parameter ϵ for ϵ near 0; one assumes to know the solutions of Equation (79) for ϵ = 0 , i.e., to know the nullspace of M ( λ 0 ; 0 ) . In the linear case, we have
M ( λ ; ϵ ) = A ( ϵ ) λ I
for some assigned function A of ϵ into C n × n , and Rellich’s theorem can be rephrased on saying that if this function is analytic and with Hermitian values, and if
dim Ker ( A ( 0 ) λ 0 I ) = m
then there exist m pairs of analytic functions λ i ( ϵ ) , u i ( ϵ ) such that λ i ( 0 ) = λ 0 , u i ( 0 ) Ker ( A ( 0 ) λ 0 I ) , each pair satisfying identically Equation (76) for ϵ near 0.
For the study of Equation (79), it is assumed that M ( λ ; ϵ ) depends regularly on λ and ϵ in the following sense: there exists an open set Ω C containing λ 0 , and an open interval I R containing zero, such that for all ϵ I the entries of M are analytic functions of λ in Ω , and for all λ Ω the entries of M are smooth functions of ϵ in I. In the first part of [8], the authors develop previous work on the subject and consider the case in which the geometric multiplicity of λ 0 (that is, the dimension of the nullspace of M ( λ 0 , 0 ) ) is one, while its algebraic multiplicity (that is, the multiplicity of λ as a root of the characteristic equation det M ( λ ; 0 ) = 0 ) is m > 1 . Thus, λ 0 in a multiple, nonsemisimple eigenvalue of M for ϵ = 0 . The following notations are used in the sequel:
M ϵ M ϵ ( λ 0 , 0 ) ; M λ M λ ( λ 0 , 0 ) ; M λ λ 2 M λ 2 ( λ 0 , 0 ) ; M λ m m M λ m ( λ 0 , 0 )
Theorem 7.
[8]. Let λ 0 be an eigenvalue of Equation (79) for ϵ = 0 , with algebraic multiplicity equal to m and geometric multiplicity one, with Jordan chain ( H 0 , , H m 1 ) . Let U 0 be the corresponding left eigenvector. Assume that the condition U 0 * M ϵ H 0 0 holds. Then, around ϵ = 0 , the eigenvalues in the vicinity of λ 0 can be expanded as the branches of the Puiseux series in Equation (78), where
λ 1 m = U 0 * M ϵ H 0 U 0 * ( 1 1 ! M λ H m 1 + 1 2 ! M λ λ H m 2 + + 1 m ! M λ m H 0 ) .
Remark 5.
In the classical terminology of Numerical Analysis (see e.g., ([42], p. 137), a (column) vector U C n = C n × 1 is a left eigenvector of a matrix M if U * M = 0 , where U * denotes the transpose of the conjugate. “Starring both sides”, this is equivalent to M * U = 0 , that is U is a (“right”) eigenvector of the adjoint matrix M * . With the same notations, for the scalar product in C n we have
x , y = i = 1 n x i y i ¯ = y * x
(the last product being the matrix product between y * C 1 × n and x C n × 1 ), and therefore Equation (80) reads
λ 1 m = M ϵ H 0 , U 0 Z , U 0
with Z = ( 1 1 ! M λ H m 1 + 1 2 ! M λ λ H m 2 + + 1 m ! M λ m H 0 ) .
To some extent, the proof of Theorem 7 relies on previous work by Lancaster et al. [25,43] on the perturbation of analytic matrix functions. To indicate the main idea followed to obtain Equation (80), consider that by definition the perturbed λ ( ϵ ) , v ( ϵ ) have to satisfy for all ϵ the condition
M ( λ ( ϵ ) , ϵ ) v ( ϵ ) = 0 .
Now use the Taylor expansion of M ( λ , ϵ ) around ( λ 0 , 0 ) ,
M ( λ , ϵ ) = M ( λ 0 , 0 ) + M ϵ ϵ + 1 1 ! M λ ( λ λ 0 ) + 1 2 ! M λ λ ( λ λ 0 ) 2 +
and replace λ with the expansion of Equation (78) for λ ( ϵ ) . Using a similar expansion for v ( ϵ ) ,
v ( ϵ ) = V 0 + i = 1 ϵ i m V i
starting with an eigenvector V 0 associated with λ 0 , and putting all this in Equation (81) yields (equalling to zero the coefficients of the increasing powers of ϵ 1 m ) m + 1 recursive equations that contain the elements of a Jordan chain built upon the unknown vectors V 1 , , V m . Solving these equations with the help of a technical lemma (Lemma 2.1 in [8]) that relates all possible Jordan chains corresponding to the same eigenvalue, one returns to the original chain ( H 0 , , H m 1 ) and finally obtains Equation (80).
Example 1.
[8]. Consider the perturbed matrix
M 2 ( λ , ϵ ) = λ 1 + e λ + ϵ 0 0 λ + 1 + ϵ
that for ϵ = 0 reduces to Equation (30). For the unperturbed eigenvalue λ 0 = 0 , in addition to the Jordan chain in Equation (31), one has
U 0 = 1 0 , M ϵ = I , M λ = 0 0 0 1 , M λ λ = 1 0 0 0 .
Substituting these values in Equation (80), one obtains
λ 1 = 2 .
In the second part of [8], the authors consider general linear functional differential equations of the form
x ( t ) = τ m a x 0 d μ ( θ ) x ( t + θ ) , x ( t ) C n
where μ : [ τ m a x , 0 ] C n × n is a function of bounded variation such that μ ( 0 ) = 0 . We refer to Chapter 7 of [26] for a thorough discussion of this kind of problems. Note that Equation (84) contains as special cases both equations with discrete delay and equations with continuous delay: indeed, Equation (84) takes the form of Equation (35) if one lets
0 = τ 0 < τ 1 < < τ k τ m a x
and defines μ : [ τ m a x , 0 ] C n × n as follows:
μ ( 0 ) = 0 μ ( θ ) = i = 0 , τ i > θ k A i , θ ( τ m a x , 0 ) μ ( τ m a x ) = i = 0 k A i .
On the other hand, taking in Equation (84) μ ( θ ) = θ 0 A ( s ) d s for θ [ τ m a x , 0 ] , with A a continuous function from [ τ m a x , 0 ] to C n × n , yields the system with distributed delay
x ( t ) = τ m a x 0 A ( θ ) x ( t + θ ) d θ .
The relation of Equation (84) with Equation (79) is as follows: looking for solutions x ( t ) = e λ t v ( v C n ) of Equation (84) yields the equation
λ v = τ m a x 0 d μ ( θ ) e λ θ v N ( λ ) v
that is a non-parametric form of Equation (79) with M ( λ ) = λ I N ( λ ) . The authors then consider the infinite-dimensional vector space X = C n × L 2 ( [ τ m a x , 0 ] , C n ) and a suitably defined linear operator A acting in X and having the property that Equation (84) can be rewritten as the abstract ordinary differential equation z ( t ) = A z ( t ) in X. Moreover, λ 0 is an eigenvalue of the nonlinear eigenvalue problem in Equation (86) if and only if it is an eigenvalue of the linear operator A, and in Theorem 3.1 of [8] it is shown how to build an (ordinary) Jordan chain for A corresponding to λ 0 starting from a Jordan chain for λ 0 as an eigenvalue of the NLEVP in Equation (86), and vice versa; for this matter, see also ([26], Chapter 7, Theorem 4.2). Further exploiting this functional-analytic point of view, the authors are then able to deal with parameter-dependent forms of Equation (84)—that is, with functions μ = μ ( θ , ϵ ) —and to reformulate the sensitivity formula in Equation (80) for the eigenvalues λ ϵ of the perturbed matrix N ( λ ; ϵ ) , corresponding to μ = μ ( θ , ϵ ) as in Equation (86), in terms of eigenvectors and generalized eigenvectors of the linear operator A ( ϵ ) acting in X. This produces a more readable formula, given in Theorem 3.2 of [8], for the coefficient λ 1 of the leading term in the expansion in Equation (78).
The concluding section of [8] shows applications of the theory to some numerical examples, that deal in particular with a planar time-delay system containing an uncertain delay τ + ϵ and with a model problem for spectral abscissa optimization.

3.2. A Perturbation Problem of Type K

In [9] we have considered the following parameter-dependent version of Equation (39),
T x + ϵ B ( x ) = λ x , x S
where—as in Section 2.2T is a self-adjoint bounded linear operator acting in a real Hilbert space H and having λ 0 R as an isolated eigenvalue of finite multiplicity. In Equation (87), S stands for the unit sphere in H, so that S Ker ( T λ 0 I ) is the unit sphere in some R n . As to the nonlinear term B, we shall soon give precise assumptions, but roughly speaking can say that the ϵ term appearing before it in Equation (87) replaces the condition B ( x ) = o ( x ) as x 0 previously considered for bifurcation in Equation (39). Indeed, rather than looking for solutions of small norm as in Equation (39), we now look for normalized eigenvectors of the perturbed eigenvalue problem T x + ϵ B ( x ) = λ x . Here, is our result for Equation (87):
Theorem 8.
Let T be a self-adjoint bounded linear operator acting in a real Hilbert space H, and having λ 0 as an isolated eigenvalue of finite multiplicity. Suppose that B is a C 1 map of H into itself, and suppose moreover that at least one of the following conditions is satisfied: either
(a) 
the dimension of the nullspace N Ker ( T λ 0 I ) is odd; or
(b) 
B is a gradient operator.
Then, there exist ϵ 0 > 0 , δ 0 > 0 such that for any ϵ [ ϵ 0 , ϵ 0 ] , there exist λ ϵ [ λ 0 δ 0 , λ 0 + δ 0 ] and x ϵ S such that
T x ϵ + ϵ B ( x ϵ ) = λ ϵ x ϵ .
If moreover B is bounded on S, then λ ϵ λ 0 as ϵ 0 . Finally, if we suppose in addition that B ( 0 ) = 0 and that B is Lipschitz continuous in the unit ball U = { x H : x 1 } of H, i.e., that there exist k > 0 such that
B ( x ) B ( y ) k x y
for x , y U , then putting
C = inf 0 < v 1 , v N B ( v ) , v v 2 , D = sup 0 < v 1 , v N B ( v ) , v v 2
the following asymptotic estimate for λ ϵ hold as ϵ 0 + :
λ 0 + ϵ C + O ( ϵ 2 ) λ ϵ λ 0 + ϵ D + O ( ϵ 2 ) .
The same estimate, with reversed inequalities, holds for ϵ 0 .
Remark 6.
The bounds in Equation (91) are sharp in the sense that there exist perturbing operators B satisfying all the assumptions of the Theorem, and perturbed eigenvalues λ + ( ϵ ) , λ ( ϵ ) of T + ϵ B that satisfy at least one of the inequalities in Equation (91) with the equality sign. To see this, just consider a linear operator B 0 acting in the finite-dimensional subspace N, and then extend it to all of H on putting B ( x ) = B 0 ( v ) for all x H , with v the orthogonal projection of x onto N. If B 0 : N N is taken to be self-adjoint, then it has n eigenvalues (counting multiplicities) μ 0 1 .... μ 0 n with normalized eigenvectors v 1 , , v n , say; that is, B 0 v i = μ 0 i v i and v i = 1 . Then, putting for each i = 1 , , n
λ ϵ = λ 0 + ϵ μ 0 i , x ϵ = v i
we have n families of eigenvalues/eigenvectors satisfying Equation (87) for all ϵ R . We have μ 0 i = B 0 v i , v i = B v i , v i for each i, and the variational characterization of the eigenvalues of B 0 gives in particular
μ 0 1 = inf 0 < v 1 , v N B ( v ) , v v 2 = C
and similarly μ 0 n = D . Therefore taking λ ( ϵ ) = λ 0 + ϵ μ 0 1 (respectively, λ + ( ϵ ) = λ 0 + ϵ μ 0 n ), the left-hand side (respectively, the right-hand side) of Equation (91) is satisfied with equality sign and O ( ϵ 2 ) = 0 .
The first part of Theorem 8 is proved following the track indicated in Section 2.2, that is performing the Lyapounov–Schmidt reduction of Equation (87). One non-trivial difference is that here a global version of the Implicit Function Theorem is employed in order to obtain a mapping
( δ , ϵ , v ) w ( δ , ϵ , v )
defined in an open neighborhood Y 1 = I 1 × J 1 × V 1 R × R × N of { 0 } × { 0 } × S by the rule that the ϵ -dependent complementary equation (see Equation (52)) can be solved uniquely with respect to w for each given ( δ , ϵ , v ) Y 1 . Moreover, w ( 0 , 0 , v ) = 0 for any v S , and the mapping ( δ , ϵ , v ) w ( δ , ϵ , v ) of Y 1 into W is of class C 1 . Next, expressing δ as a C 1 function δ ( ϵ , v ) of ( ϵ , v ) in a possibly smaller neighborhood J × V J 1 × V 1 , and putting for convenience
ϕ ( ϵ , v ) w ( δ ( ϵ , v ) , ϵ , v ) , ( ϵ , v ) J × V
we arrive at the ϵ -dependent form of the bifurcation Equation (57), namely
ϵ P B ( v + ϕ ( ϵ , v ) ) = δ ( ϵ , v ) v
that is here accompanied by the norm constraint
v + ϕ ( ϵ , v ) 2 = v 2 + ϕ ( ϵ , v ) 2 = 1 .
A solution ( λ , x ) of the original problem in Equation (87) will then be given by the formulae
λ = λ 0 + δ ( ϵ , v ) , x = v + ϕ ( ϵ , v ) .
In any of the two Cases (a) and (b) listed in Theorem 8, using as needed either of the methods (topological or variational) recalled in general in Section 2.2, we find for each ϵ small a solution v ϵ of Equations (92) and (93). Therefore, making an appropriate choice of δ 0 , ϵ 0 for the intervals I 0 [ δ 0 , δ 0 ] , J 0 [ ϵ 0 , ϵ 0 ] and putting
δ ϵ = δ ( ϵ , v ϵ ) and w ϵ = ϕ ϵ ( v ϵ ) = w ( δ ϵ , ϵ , v ϵ )
the first part of Theorem 8, asserting the existence of at least one solutions ( λ ϵ , x ϵ ) I 0 × S of Equation (87) for each ϵ J 0 , is proved with λ ϵ = λ 0 + δ ϵ and x ϵ = v ϵ + w ϵ .
Some words are now in order to explain the estimates in Equation (91). One first shows that the component w ϵ of x ϵ (as defined in Equation (95)) satisfies w ϵ 0 as ϵ 0 , uniformly with respect to v ϵ , and consequently with respect to x ϵ . This in turn implies that λ ϵ λ 0 as ϵ 0 , uniformly with respect to x ϵ . Indeed, using Equation (92), we have
ϵ P B ( v ϵ + w ϵ ) = δ ϵ v ϵ
for all ϵ , whence taking the scalar product with v ϵ of both members, we obtain
ϵ B ( v ϵ + w ϵ ) , v ϵ = δ ϵ v ϵ 2 .
Therefore,
δ ϵ = ϵ B ( x ϵ ) , v ϵ v ϵ 2 .
Moreover, as x ϵ = v ϵ + w ϵ S and w ϵ 0 as indicated above, then necessarily v ϵ 1 as ϵ 0 . Therefore, since
| B ( x ϵ ) , v ϵ | v ϵ 2 B ( x ϵ ) v ϵ
it follows, by the boundedness assumption on B, that the term multiplying ϵ in Equation (97) remains bounded as ϵ 0 , implying that δ ϵ = O ( ϵ ) as ϵ 0 , uniformly with respect to x ϵ .
We can now prove the asymptotic formula in Equation (91) on λ ϵ if B satisfies Equation (89). In this respect, the utility of Equation (89) is twofold. First, it permits to improve significantly the information on w ϵ as it yields by means of straightforward computations the estimate
w ( δ , ϵ , v ) C 1 | ϵ | v
holding for some constant C 1 > 0 and all ( δ , ϵ , v ) [ δ 0 , δ 0 ] × [ ϵ 0 , ϵ 0 ] × ( U N ) . Moreover, Equation (89) implies via the Schwarz’ inequality that, for any v and w such that v , v + w U , one has
| B ( v + w ) , v B ( v ) , v | k v w .
Writing this for w ( δ , ϵ , v ) and using Equation (98), we then get the inequality
| B ( v + w ( δ , ϵ , v ) ) , v B ( v ) , v | C 2 | ϵ | v 2
with C 2 = k C 1 , valid for all the (possible) solutions ( δ , x = v + w ( δ , ϵ , v ) ) of Equation (87) having sufficiently small ϵ . Using in turn this estimate in Equation (96) for the actual solutions ( δ ϵ , x ϵ = v ϵ + w ϵ ) we see that as ϵ 0
δ ϵ v ϵ 2 = ϵ B ( v ϵ + w ϵ ) , v ϵ = ϵ B ( v ϵ ) , v ϵ + v ϵ 2 O ( ϵ 2 ) .
This implies that
δ ϵ = ϵ B ( v ϵ ) , v ϵ v ϵ 2 + O ( ϵ 2 )
as ϵ 0 , and thus for ϵ > 0 yields immediately the estimate of Equation (91) in view of the definition in Equation (90) of C and D.
Example 2.
Theorem 8 can be used to evaluate the convergence rate as ϵ 0 of the eigenvalues μ ϵ of the nonlinear elliptic problem
Δ u = μ ( u + ϵ f ( x , u ) ) i n Ω u = 0 o n Ω
near an eigenvalue μ 0 of the unperturbed linear problem Δ u = μ u in Ω, u = 0 on Ω . Here, Ω is a bounded domain in R N ( N 1 ) with boundary Ω , and Δ = i = 1 N 2 x i 2 is the familiar Laplace operator acting on sufficiently smooth real functions u defined in Ω. Under appropriate hypotheses on f, and assuming in particular that
m t 2 f ( x , t ) t M t 2 ( x Ω , t R )
for some real constants 0 m M , one proves that as ϵ 0 +
μ 0 ϵ μ 0 M + O ( ϵ 2 ) μ ϵ μ 0 ϵ μ 0 m + O ( ϵ 2 ) .
These inequalities can be used for actual computation, once an efficient approximation of the linear eigenvalue μ 0 is available and the bounds in Equation (103) for f are known with accuracy. For instance, if f ( x , t ) = f ( t ) = t 1 + t 2 , just put in Equation (104)
m = inf t 0 f ( t ) t t 2 = 0 , M = sup t 0 f ( t ) t t 2 = 1 .

4. Concluding Remarks, Open Problems and Applicability

To summarize and motivate again the content of this paper, let me define it as an attempt to identify and logically re-connect (or at least give a common frame to) two important and presently distinct research areas in Mathematical Analysis and its applications that appear in the current literature under the same name of Nonlinear Eigenvalue Problems. As better explained in the Introduction, problems in these two areas are described (in abstract operator form) by the two equations
G ( λ ) x = 0
(“problems of type G”) and
A ( x ) λ C ( x ) = 0
(“problems of type K”). Some basic facts and solution methods about each of the two equations are reported in Section 2. Section 3 is devoted to discuss two specific problems, one for each type, with the scope of giving samples of very recent research in either field.
While for the problem discussed in Section 3.1 there are already concrete numerical examples [8], these are still missing for the problem presented in Section 3.2 [9]. The main aim of this final section is to partially fill this gap by further commenting (in Section 4.1) on the formula
λ 0 + ϵ C + O ( ϵ 2 ) λ ϵ λ 0 + ϵ D + O ( ϵ 2 )
proved in Theorem 8 and by finally providing, at least in a special case, a recipe ready for use in numerical simulation (Section 4.2).

4.1. Open Problems

The basic idea standing behind the formula in Equation (108) is that if the (algebraic and geometric) multiplicity m ( λ 0 ) of the unperturbed eigenvalue λ 0 of the linear operator T in Equation (87) is equal to m, then upon perturbation by a small term ϵ B there are potentially m eigenvalue functions λ 1 ( ϵ ) , , λ m ( ϵ ) of T + ϵ B satisfying Equation (108). This is what actually happens for a linear operator B (essentially under the further assumption that T and B are self-adjoint) as described by Rellich’s Theorem 6. Our idea is that something of this persists also for nonlinear operators that have good similarity with the linear ones: the class of Lipschitz continuous maps considered in Theorem 8 is apparently quite close to that of bounded linear maps, and in fact contains properly the latter. Indeed, the conclusions of Theorem 8 point in in this direction. However, many problems remain open and we describe here three of them (in increasing order of interest and difficulty):
  • Verify on specific examples of nonlinear ODE/PDE/Equations in R n —by means of explicit computation or by means of a numerical analysis—the existence of at least one “eigenvalue branch” λ ( ϵ ) satisfying the bounds in Equation (108) as predicted by the theory.
  • Verify by the same means that the bounds in Equation (108) are optimal by producing examples of nonlinear problems (in the same fields as above) where at least one eigenvalue function exists that satisfies the RHS (LHS) bound in Equation (108) with equality sign.
  • Exhibit examples of “nonlinear splitting of the multiple eigenvalue”, that is, of nonlinear problems in which starting from an unperturbed eigenvalue λ 0 of multiplicity 2 there exist two different families λ + ( ϵ ) , λ ( ϵ ) respecting Equation (108), and possibly each satisfying the RHS (LHS) bound of it with equality sign.
Here, is a very simple example that highlights the above issues, and the last in particular. For the linear case, these questions were answered by Remark 6.
Example 3.
Consider the system
x + ϵ x 3 = λ x y + ϵ y 3 = λ y .
In the notations of Equation (87) and of Theorem 8, we have here H = R 2 , T = I , λ 0 = 1 and B ( x , y ) = ( x 3 , y 3 ) . Solving Equation (109) with the constraint x 2 + y 2 = 1 gives the solutions
( x , y ) = ( 0 , ± 1 ) , ( x , y ) = ( ± 1 , 0 ) , λ = 1 + ϵ ( x , y ) = ( ± 1 2 , ± 1 2 ) , λ = 1 + ϵ / 2 .
Therefore, λ 0 = 1 splits into the two eigenvalue functions (each carrying four distinct eigenvectors)
λ + ( ϵ ) = 1 + ϵ , λ ( ϵ ) = 1 + ϵ / 2 .
This is in full agreement with Equation (108), for
D = sup 0 < v 1 , v N B ( v ) , v v 2 = sup 0 < x 2 + y 2 1 x 4 + y 4 x 2 + y 2 = 1
and similarly, replacing “sup" with “inf", we find that C = 1 / 2 .

4.2. Applicability

As indicated in Example 2, the bounds in Equation (108) proved in Theorem 8 for the operator Equation (87) can be used for concrete nonlinear elliptic problems in a bounded domain Ω R N ( N 1 ) such as Equation (102), or more generalized forms of it in which Δ is replaced by a uniformly elliptic second order operator in divergence form. In particular for N = 1 this applies to the Sturm–Liouville problem
( p ( x ) u ) + q ( x ) u = μ ( u + ϵ f ( x , u ) ) in ] a , b [ u ( a ) = u ( b ) = 0
where p C 1 ( [ a , b ] ) , p > 0 and q C ( [ a , b ] ) . The first remark on the applicability of Theorem 8 to such kind of problems, in one or more space variables, is that one must not worry of the multiplicity of the unperturbed eigenvalue because—as recalled for instance in [9]—the operator B corresponding to the nonlinear term f is a gradient operator, so that the assumption b ) of Theorem 8 is satisfied. In [9] it is previously recalled that these problems can be cast in the operator form of Equation (87) on taking as Hilbert space the Sobolev space H 0 1 ( Ω ) W 0 1 , 2 ( Ω ) , equipped with the scalar product
u , v = Ω u ( x ) v ( x ) d x
and that the operator B mentioned above is defined via the duality relation
B ( u ) , v = Ω f ( x , u ( x ) ) v ( x ) d x .
The representation in Equation (114) of B is the key formula to be used to gain information on the constants C , D appearing in Equation (108), for these are defined by the formulae in Equation (90) that involve precisely the nonlinear Rayleigh quotient of B. Indeed, using Equation (114) and the fact (see e.g., [9]) that for v N we have
v 2 = Ω v 2 ( x ) d x = μ 0 Ω v 2 ( x ) d x
where μ 0 is the unperturbed eigenvalue and N the corresponding eigenspace, yields the following quite readable expression for D:
D = sup 0 < v 1 , v N B ( v ) , v v 2 = sup 0 < v 1 , v N Ω f ( x , v ( x ) ) v ( x ) d x μ 0 Ω v 2 ( x ) d x
Thus, essentially, in the applications of the theory to elliptic PDE or ODE, estimating the Rayleigh quotient of B reduces to estimating the ratio appearing in the RHS of Equation (116). In turn, this can be easily obtained by pointwise bounds on f: for clearly if f satisfies Equation (103), then it follows that for every v H (and in fact for every v L 2 ( Ω ) )
m Ω f ( x , v ( x ) ) v ( x ) d x Ω v 2 ( x ) d x M .
We conclude by Equation (116) and the dual formula for C that
m μ 0 C , D M μ 0 .
Using the inequalities in Equations (117) and (108) and putting λ ϵ = 1 / μ ϵ yield bounds on the perturbed eigenvalues μ ϵ of Equation (102). Considering for instance the right-hand side of in Equation (108), we obtain
1 μ ϵ 1 μ 0 + ϵ M μ 0 + O ( ϵ 2 ) = 1 μ 0 ( 1 + ϵ M + O ( ϵ 2 ) )
and doing the same with the lower bound thus yields
μ 0 1 + ϵ m + O ( ϵ 2 ) μ ϵ μ 0 1 + ϵ M + O ( ϵ 2 ) .
Remark 7.
Note that Equation (119) contains—as due—the equality
μ ϵ = μ 0 1 + ϵ a
which plainly holds for the eigenvalues of Equation (102) in the linear case f ( x , s ) = a s , a = const.
Finally, using in Equation (119) the asymptotic relation ( 1 + x ) 1 = 1 x + O ( x 2 ) for x 0 , we obtain as ϵ 0 + the formula in Equation (104), which as remarked is ready for use in numerical experiments once μ 0 , m and M are known. For instance, taking f ( x , s ) = f ( s ) = s 1 + s 2 and using the bounds m = 0 , M = 1 (see Equation (105)) yields, for ϵ 0 + ,
μ 0 ϵ μ 0 + O ( ϵ 2 ) μ ϵ μ 0 + O ( ϵ 2 ) .
The case of the simple eigenvalue. More information can be gained in the case that dim N = 1 , so that N = { t ϕ , t R } for some ϕ that we normalize taking ϕ = 1 . Then, the Rayleigh quotient of B simplifies as
B ( v ) , v v 2 = B ( t ϕ ) , t ϕ t 2 ϕ 2 = 1 t B ( t ϕ ) , ϕ h ( t ) , 0 < | t | 1 .
Note that h is bounded since B is sublinear (that is, B ( u ) k u for all u H ) as follows from Equation (89) and the condition B ( 0 ) = 0 . It follows by Equations (114) and (121) that
h ( t ) = 1 t Ω f ( x , t ϕ ( x ) ) ϕ ( x ) d x .
Considering as above the example f ( x , s ) = f ( s ) = s 1 + s 2 , we obtain
h ( t ) = Ω ϕ 2 ( x ) 1 + t 2 ϕ 2 ( x ) d x
showing that h can be extended continuously to t = 0 and that it is an even function of t in [ 1 , 1 ] . Therefore, using also Equation (115) and the condition ϕ = 1 , we get
D = sup 1 t 1 h ( t ) = sup 0 t 1 h ( t ) = Ω ϕ 2 ( x ) d x = 1 μ 0
while
C = inf 1 t 1 h ( t ) = inf 0 t 1 h ( t ) = Ω ϕ 2 ( x ) 1 + ϕ 2 ( x ) d x K .
These computations show that in the present case:
  • The upper bound M μ 0 = 1 μ 0 given by Equation (117) for D is optimal.
  • The lower bound 0 given by Equation (117) for C can be improved to C = K > 0 .
Proceeding in the same way as before (see Equations (118) and (119)) and using as before the asymptotic expansion for 1 1 + x as x 0 , we see that, as ϵ 0 + , Equation (120) can be replaced by the more precise formula
μ 0 ϵ μ 0 + O ( ϵ 2 ) μ ϵ μ 0 ϵ μ 0 2 K + O ( ϵ 2 ) .
The practical use of Equation (125) for numerical purposes requires explicit knowledge of the eigenvalue μ 0 and of the corresponding eigenfunction ϕ . Typical cases in which these data are available are:
  • N = 2 , Ω a rectangle or a circle, and μ 0 the first eigenvalue of the Dirichlet Laplacian in Ω (see e.g., [44].
  • N = 1 , Ω = ] a , b [ and μ 0 = μ n any eigenvalue of the Sturm–Liouville problem in Equation (112) with simple forms of the coefficients p and q. For instance, if ] a , b [ = ] 0 , π [ , p 1 and q 0 we have
    μ n = n 2 , ϕ n ( x ) = 1 n 2 π sin n x ( 0 x π ) .
    As to the expression of ϕ n , recall that we have normed H 0 1 ( a , b ) via the formula in Equation (113), which in this case reduces to ( u , v ) = 0 π u ( x ) v ( x ) d x .

Acknowledgments

The author wishes to thank the Referees for their valuable suggestions that have helped to improve the quality of the present paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Betcke, T.; Higham, N.J.; Mehrmann, V.; Schroder, C.; Tisseur, F. NLEVP: A collection of nonlinear eigenvalue problems. ACM Trans. Math. Softw. 2013, 39, 28. [Google Scholar] [CrossRef]
  2. Berger, M.S. Nonlinearity and Functional Analysis; Academic Press: Cambridge, MA, USA, 1977. [Google Scholar]
  3. Ambrosetti, A.; Malchiodi, A. Nonlinear Analysis and Semilinear Elliptic Problems; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
  4. Palais, R.S. Critical point theory and the minimax principle. In Proceedings of the Symposia in Pure Mathematics, Berkeley, CA, USA, 1–26 July 1968; American Mathematics Society: Providence, RI, USA, 1970; Volume XV, pp. 185–212. [Google Scholar]
  5. Chabrowski, J. On nonlinear eigenvalue problems. Forum Math. 1992, 4, 359–375. [Google Scholar] [CrossRef]
  6. Turner, R.E.L. A class of nonlinear eigenvalue problems. J. Funct. Anal. 1968, 2, 297–322. [Google Scholar] [CrossRef]
  7. Kato, T. Perturbation Theory for Linear Operators, 2nd ed.; Springer: Berlin, Germany; New York, NY, USA, 1976. [Google Scholar]
  8. Michiels, W.; Boussaada, I.; Niculescu, S.I. An explicit formula for the splitting of multiple eigenvalues for nonlinear eigenvalue problems and connections with the linearization for the delay eigenvalue problem. SIAM J. Matrix Anal. Appl. 2017, 38, 599–620. [Google Scholar] [CrossRef]
  9. Chiappinelli, R. Approximation and convergence rate of nonlinear eigenvalues: Lipschitz perturbations of a bounded self-adjoint operator. J. Math. Anal. Appl. 2017, 455, 1720–1732. [Google Scholar] [CrossRef]
  10. Langer, H.; Markus, A.; Matsaev, V. Linearization, factorization, and the spectral compression of a self-adjoint analytic operator function under the condition (VM). In A Panorama of Modern Operator Theory and Related Topics; Birkhauser/Springer Basel AG: Basel, Switzerland, 2012; pp. 445–463. [Google Scholar]
  11. Appell, J.; De Pascale, E.; Vignoli, A. Nonlinear Spectral Theory; de Gruyter: Berlin, Germany, 2004. [Google Scholar]
  12. Chiappinelli, R. Surjectivity of coercive gradient operators in Hilbert space and Nonlinear Spectral Theory. Ann. Funct. Anal. 2018, 9. in press. [Google Scholar]
  13. Guttel, S.; Tisseur, F. The nonlinear eigenvalue problem. Acta Numer. 2017, 26, 1–94. [Google Scholar] [CrossRef] [Green Version]
  14. Brezis, H. Functional Analysis, Sobolev Spaces and Partial Differential Equations; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
  15. Taylor, A.; Lay, D. Introduction to Functional Analysis; Wiley: Hoboken, NJ, USA, 1980. [Google Scholar]
  16. Hadeler, K.P. Mehrparametrige und nichtlineare Eigenwertaufgaben. Arch. Ration. Mech. Anal. 1967, 27, 306–328. [Google Scholar] [CrossRef]
  17. Hadeler, K.P. Variationsprinzipien bei nichtlinearen Eigenwertaufgaben. Arch. Ration. Mech. Anal. 1968, 30, 297–307. [Google Scholar] [CrossRef]
  18. Langer, H. Uber eine Klasse polynomialer Scharen selbstadjungierter Operatoren im Hilbertraum. J. Funct. Anal. 1973, 12, 13–29. [Google Scholar] [CrossRef]
  19. Markus, A.S. Introduction to the Spectral Theory of Polynomial Operator Pencils; Translations of Mathematical Monographs, 71; American Mathematical Society: Providence, RI, USA, 1988. [Google Scholar]
  20. Binding, P.; Eschwé, D.; Langer, H. Variational principles for real eigenvalues of self-adjoint operator pencils. Integral Equ. Oper. Theory 2000, 38, 190–206. [Google Scholar] [CrossRef]
  21. Hasanov, M. An approximation method in the variational theory of the spectrum of operator pencils. Acta Appl. Math. 2002, 71, 117–126. [Google Scholar] [CrossRef]
  22. Voss, H. A minmax principle for nonlinear eigenproblems depending continuously on the eigenparameter. Numer. Linear Algebra Appl. 2009, 16, 899–913. [Google Scholar] [CrossRef] [Green Version]
  23. Schwetlick, H.; Schreiber, K. Nonlinear Rayleigh functionals. Linear Algebra Appl. 2012, 436, 3991–4016. [Google Scholar] [CrossRef]
  24. Gohberg, I.; Lancaster, P.; Rodman, L. Matrix Polynomials; Academic Press, Inc. [Harcourt Brace Jovanovich, Publishers]: New York, NY, USA; London, UK, 1982. [Google Scholar]
  25. Hryniv, R.; Lancaster, P. On the perturbation of analytic matrix functions. Integral Equ. Oper. Theory 1999, 34, 325–338. [Google Scholar] [CrossRef] [Green Version]
  26. Hale, J.K.; Verduyn Lunel, S.M. Introduction to Functional Differential Equations; Applied Mathematical Sciences, 99; Springer: New York, NY, USA, 1993. [Google Scholar]
  27. Gohberg, I.; Lancaster, P.; Rodman, L. Invariant Subspaces of Matrices With Applications; A Wiley-Interscience Publication; John Wiley and Sons, Inc.: New York, NY, USA, 1986. [Google Scholar]
  28. Stackgold, I. Branching of solutions of non-linear equations. SIAM Rev. 1971, 13, 289–332. [Google Scholar] [CrossRef]
  29. Rabinowitz, P.H. Minimax Methods in Critical Point Theory With Applications to Differential Equations; CBMS Regional Conference Series Mathematics; American Mathematics Society: Providence, RI, USA, 1986; Volume 65. [Google Scholar]
  30. Krasnoselskii, M.A. Topological Methods in the Theory of Nonlinear Integral Equations; Pergamon Press: Oxford, UK, 1964. [Google Scholar]
  31. Rabinowitz, P.H. Some global results for nonlinear eigenvalue problems. J. Funct. Anal. 1971, 7, 487–513. [Google Scholar] [CrossRef]
  32. Stuart, C.A. An introduction to bifurcation theory based on differential calculus. In Nonlinear Analysis and Mechanics: Heriot-Watt Symposium; Research Notes in Mathematics; Pitman: Totowa, NJ, USA, 1979; Volume IV, pp. 76–135. [Google Scholar]
  33. Mawhin, J.; Willem, M. Critical Point Theory and Hamiltonian Systems; Applied Mathematical Sciences, 74; Springer: New York, NY, USA, 1989. [Google Scholar]
  34. Zeidler, E. Nonlinear Functional Analysis and Its Applications. III. Variational Methods and Optimization; Springer: New York, NY, USA, 1985. [Google Scholar]
  35. Amann, H. Liusternik-Schnirelman theory and non-linear eigenvalue problems. Math. Ann. 1972, 199, 55–72. [Google Scholar] [CrossRef]
  36. Fernandez Bonder, J.; Pinasco, J.P.; Salort, A.M. Quasilinear eigenvalues. Rev. Union Mat. Argent. 2015, 56, 1–25. [Google Scholar]
  37. Lindqvist, P. A nonlinear eigenvalue problem. In Topics in Mathematical Analysis; World Scientific Publishing: Hackensack, NJ, USA, 2008; pp. 175–203. [Google Scholar]
  38. Drábek, P. On the variational eigenvalues which are not of Ljusternik-Schnirelmann type. Abstr. Appl. Anal. 2012, 434631. [Google Scholar] [CrossRef]
  39. Drábek, P.; Robinson, S.B. Resonance problems for the p-Laplacian. J. Funct. Anal. 1999, 169, 189–200. [Google Scholar] [CrossRef]
  40. Appell, J.; Drábek, P.; Chiappinelli, R. (Eds.) Mini-Workshop: Nonlinear Spectral and Eigenvalue Theory with Applications to the p-Laplace Operator; Abstracts from the Mini-Workshop held 15–21 February 2004; Oberwolfach Report; Mathematisches Forschungsinstitut Oberwolfach: Oberwolfach, Germany, 2004; pp. 407–437. [Google Scholar]
  41. Rellich, F. Perturbation Theory of Eigenvalue Problems; Gordon and Breach Science Publishers: New York, NY, USA; London, UK; Paris, France, 1969. [Google Scholar]
  42. Isaacson, E.; Keller, H.B. Analysis of Numerical Methods; John Wiley and Sons, Inc.: New York, NY, USA; London, UK; Sydney, Australia, 1966. [Google Scholar]
  43. Lancaster, P.; Markus, A.S.; Zhou, F. Perturbation theory for analytic matrix functions: The semisimple case. SIAM J. Matrix Anal. Appl. 2003, 25, 606–626. [Google Scholar] [CrossRef]
  44. Courant, R.; Hilbert, D. Methods of Mathematical Physics; Wiley: Hoboken, NJ, USA, 1953; Volume I. [Google Scholar]

Share and Cite

MDPI and ACS Style

Chiappinelli, R. What Do You Mean by “Nonlinear Eigenvalue Problems”? Axioms 2018, 7, 39. https://doi.org/10.3390/axioms7020039

AMA Style

Chiappinelli R. What Do You Mean by “Nonlinear Eigenvalue Problems”? Axioms. 2018; 7(2):39. https://doi.org/10.3390/axioms7020039

Chicago/Turabian Style

Chiappinelli, Raffaele. 2018. "What Do You Mean by “Nonlinear Eigenvalue Problems”?" Axioms 7, no. 2: 39. https://doi.org/10.3390/axioms7020039

APA Style

Chiappinelli, R. (2018). What Do You Mean by “Nonlinear Eigenvalue Problems”? Axioms, 7(2), 39. https://doi.org/10.3390/axioms7020039

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop