1. Introduction
After the discovery of second-harmonic generation (SHG) by Franken et al. [
1], progress in the field of nonlinear optics (NLO) increased at a rapid pace with valuable applications ranging from thin-film material characterization, nanoscale optical sensing, nanoscale engineering, and various other fields. The nonlinear effects originate from the material response to the incoming field. This response is proportional to the electrical field and its higher powers. In the case of SHG, a single incoming wavelength involving two-photon absorption results in a nonlinear response that produces light with a frequency twice the incoming light frequency. Therefore, the amplitude of the SHG frequency generated by a sample is proportional to the square of the incoming field. It soon became apparent from the early theoretical works by Armstrong [
2] and Bloembergen [
3] that the analysis of the higher-order susceptibility tensorial components is essential to understand the nature of SHG generation in inversion-symmetric material.
It is now well understood that centrosymmetric structures such as silicon (Si) materials possess the vital property of bulk inversion symmetry, where parity symmetry forbids the nonlinear dipole contribution inside the bulk [
3,
4]. However, the contribution from SHG at the surface can exist due to symmetry breaking [
5,
6]. This discovery, together with the fact that NLO enables probing surfaces in situ and nondestructively, makes it a powerful tool to study semiconductor surfaces in detail and at the nanoscale. One of the major theoretical challenges in understanding SHG from semiconductor surfaces by optical probes, e.g., by SHG rotational anisotropy (RA), lies in separating the various nonlinear contributions that may arise. For the case of noncentrosymmetric semiconductor material, the SHG bulk dipolar contribution usually dominates due to the sheer amount of bulk layers inside the material. However, when the semiconductor possesses inversion symmetry, the sources become more complex and detailed knowledge of the material symmetry is required. Because physics takes place in real time and space as well as at the nanoscale (e.g., atomic level), a description of the phenomena using quantum mechanics seems natural and more fundamental, but requires a substantial amount of complex computations and lacks a pictorial insight [
7]. Therefore, phenomenological models are sometimes combined with quantum mechanical calculations. Examples include the work of Mendoza and Mochán [
8], and later Arzata and Mendoza [
9], which exhibit interacting dipole bond model to investigate SHG from surface local-field effects. Other modeling efforts involving SHG from centrosymmetric material are mainly phenomenological, applying Green’s function formalism developed by Sipe et al. [
10,
11,
12].
The challenge, however, was the number of independent parameters required to model SHG in inversion-symmetric structure that was still complicated, even if the SH signal was only generated from the surface/interface. This resulted in the difficulty to interpret experimental data in a simple way. This issue was resolved by the development of the simplified bond hyperpolarizability model (SBHM) proposed in 2002 by Dave Aspnes through a breakthrough paper [
13], where direct products of bond vectors replace the tensorial formalism.
In this work, we show how the SBHM can produce a significant improvement over previous models using tensorial analysis. First, the SBHM approaches the nonlinear tensor components from a classical physical model rather than an abstract group theoretical calculation, which often hides the physics, thus providing a better understanding of the nonlinear phenomena at the atomic scale. Second, the SBHM relates some previously assumed to be independent tensor components obtained by GT because of the different approaches in obtaining them (through bond vector products), thus substantially reducing the number of independent parameters. For example, a remarkable achievement of the SBHM was to reproduce rotational anisotropy second-harmonic generation (RASHG) intensity experimental data from [
14] with a high precision by only three normalized complex hyperpolarizabilities and two relative phases. The result was a reduction from the previous 14 normalized tensor coefficients in [
10]. Third, because of this simplification, previous RASHG experiments can be better understood. Furthermore, the SBHM might resolve several important issues regarding the long-standing problem of determining the actual contribution of surface and bulk effects, which was previously difficult due to a large amount of fitting parameters [
10] even in highly symmetric semiconductors such as Si and gallium arsenide (GaAs). The SBHM is thus opening the way for better surface characterization and real-time surface deposition monitoring at the nanoscale. Another attractive future potential application of this work is to study tensorial phase control of nonlinear metasurfaces, which enables SHG analysis at the subwavelength level with potential application in ultrathin, free-space photonic devices for nonlinear imaging [
15,
16].
To better appreciate the novelty of the third point, we mention the work given by Peng et al. [
17], who applied four steps of the Ewald–Oseen dipole model to SHG. His work shows additional SH contributions in the form of spatial dispersion and magnetic terms. Applying the SBHM, we showed very recently that spatial dispersion contribution in Si(111), due to the incoming electric field decay, also contributes to the RASHG intensity data in nonvicinal Si(111) surfaces [
18]. This effect can be described using a third rank tensor by redefining spatial dispersion from an earlier work by Peng et al. [
17], which described spatial dispersion in the form of a fourth rank tensor. It is well known that the nonlinear polarization of the SH field depends on the material symmetry, which is described either by a third- or fourth-order nonlinear susceptibility tensor. It is interesting to study how this tensor is obtained from the SBHM and compared with the conventional group theoretical approach. Therefore, in this work, we access this tensor from both viewpoints and compare them later on. More specifically, we show that the SHG Si(111) surface part obtained from the SBHM only consists of two independent fitting parameters rather than the four independent parameters if GT is applied. Thus, the SBHM better estimates the surface to bulk SHG contribution.
However, despite the various advantages of this model previously mentioned, we also have to point out several shortcomings: (1) the SBHM is a phenomenological model that requires experimental fitting to obtain the nonlinear hyperpolarizability. Ab initio models using quantum mechanics should be applied to solve this issue but lie beyond the scope of this work. (2) This work only assumes oscillation parallel to the bond and neglects perpendicular oscillation. The latter might be significant for third-harmonic generation studies, but the exclusion of such an assumption for SHG still shows excellent agreement with experiments. (3) For complex semiconductors such as perovskite, the determination of the bond vector can be challenging, and calculations can be consuming. However, we show that many bond vectors cancel out when symmetry is considered, and simplified bond vectors can be applied instead.
An experimental comparison with the SBHM that included quadrupolar contribution from the Si bulk was performed using arbitrary input polarization to reproduce RASHG intensity in nonvicinal Si surfaces [
19]. It was later shown that these effects were related to the first-order derivatives of the microscopic response function [
20]. The contribution of spatial dispersion to SHG due to the incoming field was also incorporated very recently [
18]. Investigation using this model to study the effect of H
2 exposure on the RASHG spectra for Si surfaces was also performed using the SBHM with success [
21]. An SBHM study on the SH spectra from zincblende sample was also performed where the dominant SH contribution was indeed coming from bulk dipole radiation [
22].
For the sake of clarity, we proceed with explaining the basics of the SBHM, followed by a description of group theory. Afterwards, we explain the main results of this work, namely, comparing the third rank tensor using both approaches for the case of a Si(111) and Si(001) surface and showing how this method can also be applied to more complex structures, e.g., perovskite. We then explain how this result can be applied to determine the different surface and bulk contributions in Si and show how it can increase our understanding regarding the polar–nonpolar nature of perovskite. Finally, a brief conclusion is given at the end.
2. Simplified Bond Hyperpolarizability Model
The simplified bond hyperpolarizability model (SBHM) is a classical phenomenological theory, whose history can be traced back to Clausius–Mossotti polarizable point models. This equation relates the atomic-scale polarizability
and the volume density
n of a collection of points, which in SBHM are renamed as bonds, to the macroscopic complex dielectric function [
7]. In the SBHM, the dielectric response function is derived from the Newtonian equation of motion of a charge
. The charges oscillate harmonically and anharmonically parallel to the covalent atomic bonds. The oscillation is driven by the local incoming electric field as illustrated in
Figure 1. For the illustration, we chose a Si(001) surface, meaning that the Si surface of the film is a (001) Miller plane through the Si lattice (e.g., in the cubic lattice system, the direction (hkl) defines a vector direction normal to the surface of a particular plane or facet). For simplicity, let us consider a 1D Lorentzian equation of motion for the case of second-harmonic (SH) and third-harmonic (TH) generation [
23]:
here
is the charge position in the local coordinate system along the
direction,
is the charge equilibrium position,
is the local field experienced by the charge at the position of the atom, which is related to the macroscopic field through
. In Equation (
1),
,
, and
are, respectively, the harmonic, first anharmonic, and second anharmonic spring constant. These constants will give rise to the harmonic, SH, and TH radiation. A frictional coefficient
is introduced in the equation to account for the dissipation. Here the direction of the
j-th bond is defined by the unit vector
.
Before we proceed with finding the solution for
, it is necessary to discuss the specific assumptions in this model. Equation (
1) readily shows that each charge oscillates harmonically and parallel to the atomic bonds. The incoming electric field drives these oscillations. Thus, only the charges within the optical penetration depth radiate dipolar and quadrupolar electromagnetic waves. The total amount of dipole and quadrupole far-field radiation is thus given by the summation of all electromagnetic radiation directions [
24] that are produced by oscillating charges along each of the bonds. Here, the induced dipole oscillates under the action of an applied electric field and appropriate restoring and dissipation forces. The number of nearest neighbors determines the number of bonds. Therefore, they depend on the symmetry of the crystal. The nonlinear part of the motion is represented in terms of complex hyperpolarizabilities obtained by solving Equation (
1). According to classical physics, an accelerated charge radiates electromagnetic waves. The total far field is calculated by summation of all the fields radiated by each dipole/quadrupole. Therefore, the model extends the Ewald–Oseen theorem to nonlinear optics.
Figure 2 depicts the coordinate system used in the model.
It should be mentioned that the superpositions of all atomic dipoles driven by the external field are coherently driven dipoles, which means that the fundamental and the second/third harmonic outputs radiate in phase. The coherence is a consequence of the difference between the distance between dipoles, which is in the order of the atomic bond distance (1 nm), and the dimension of the incoming wavelength (100 nm). Hence, Huygens’ principle implies that the coherent dipole field superposition gives a constructive interference only if the outgoing angle (
) is equal to the incoming angle (
) as depicted in
Figure 2. In transmission, a coherent superposition occurs, but this time
, since the constructive angle is given by Snell’s law.
We will now derive a solution from Equation (
1) with the assumption that
can be written as an expansion:
Inserting Equation (
2) in Equation (
1) the linear
, quadratic
, and cubic
dipole polarization of the
j-th charge are given by
here
,
, and
are, respectively, the atomic-scale (microscopic) linear polarizability, first-order, and second-order nonlinear hyperpolarizability of the
j-th bond. The hyperpolarizability parameters are usually obtained from experimental fitting. Therefore, the model is usually called phenomenological, while future improvements to incorporate ab initio quantum mechanical calculations [
8] for the derivation of hyperpolarizability parameters may improve the model accuracy. The incoming
s- and
p-polarized field are expressed in terms of
and
.
The total polarization can be readily obtained by summing over all the bonds in the conventional cell, where
N is the atomic units of dipoles per unit volume (
V)
The relation of the atomic-scale hyperpolarizabilities in Equation (
6) with the macroscopic nonlinear susceptibility is straightforward
where
,
,
are, respectively, the first, second, and third-order susceptibility tensors of the system.
In RASHG experiments, the sample is often rotated along the
z-axis to produce a polar nonlinear intensity plot. In our calculation, the azimuthal angle
is measured from the
x-axis and attached to each bond direction by applying the rotation matrix
:
This readily gives us the formula for the nonlinear SH in terms of the atomic-scale nonlinear hyperpolarizability:
as well as the expression for the TH atomic-scale nonlinear susceptibility:
Equations (
3)–(
5) can be applied to obtain an expression of the far field:
Here
is the unit tensor and
is the outgoing wave vector pointing in the direction of the observer/detector. The derivation of Equation (
11) from classical electrodynamics by applying the vector potential
which satisfies the Lorentz gauge can be found in [
22]. A possible phase difference can exist between the radiated fields of different bonds. However, it can be neglected because the light wavelength
is large compared to the distance between the atomic nucleus and the bond. This work considers the case where the RASHG experiments have been performed at a single wavelength. The sample is being rotated along the
z-axis. The obtained nonlinear intensity can readily be obtained by multiplying Equation (
11) with its complex conjugate. The considered intensity are obtained for
s- and
p-polarizations of the incoming fundamental and outgoing nonlinear field resulting in four combinations labeled
-
,
,
, and
.
We focus here on the nonlinear susceptibility of the SH and in some cases TH given in Equations (
9)–(
10), respectively. The nonlinear susceptibility in the SBHM is obtained via bond vector summation rather than the standard group theoretical approach by assessing the crystal symmetry with regard to reflection and rotation. We describe in the following section the methods to obtain the nonlinear tensor via group theory.
3. Group Theory
Although the derivation of the higher-order tensor from the SBHM in Equations (
9)–(
10) is straightforward and clear, it is unfortunate that the SBHM is not widely used in the analysis of recent nonlinear optical studies. Earlier phenomenological models such as [
10,
11,
12] apply group-theoretical analysis to derive the higher-order tensor, which as we will see, may result in unnecessary fitting parameters due to a large number of independent components in the tensor. Before we describe how the SBHM can simplify the number of independent components, it is best to briefly recall how the higher-order tensors are usually obtained using group theory (GT).
In this context, group theory (GT) can be understood as a mathematical structure that can be applied to the symmetry operations that are allowed in a crystal. The symmetry operations consist of rotations and reflections and can be represented mathematically by a matrix. Certain crystals have structural properties that may be unchanged upon specific angles of rotation and some specific existing mirror planes. An ensemble of matrices that represents all the rotations and mirror planes that keep the above-mentioned properties unchanged for a particular crystal is called a symmetry group, or more precisely called a point group [
25]. Furthermore, these specific symmetry operations must preserve the crystal physical properties. This is in agreement with Neumann’s principle, or the principle of symmetry: “if a crystal is invariant with respect to certain symmetry operations, any of its physical properties must also be invariant with respect to the same symmetry operations or otherwise stated, the symmetry operations of any physical property of a crystal must include the symmetry operations of the point group of the crystal”.
To understand the matter more clearly let us recall the general macroscopic relation between the nonlinear polarization and the incoming driving field in terms of component form, which is
The first term includes the linear susceptibility denoted by subscript
i and
j. This is a 3 × 3 matrix containing nine elements. In contrast, the second is a third rank susceptibility denoted by subscripts
i,
j, and
k containing 9 × 3 = 27 elements. The third term is the fourth rank susceptibility tensor with subscripts
i,
j,
k and
l therefore consisting of 9 × 9 = 81 components. The label
i,
j,
k, and
l refer to the three Cartesian coordinates
x,
y, and
z. For clarity, an explicit representation of a general third rank tensor
is given below:
As can be seen in Equation (
13) the first index “
i” in the tensor
is related to the rows in the main matrix. Therefore, all the elements in the first row of the inner 3 × 3 matrix have
indices and for the second and third row it will be
and
, respectively. Analogously, the indices that follow after “
i”, which are “
j” and “
k”, refer to the usual way of labeling a 3 × 3 matrix, namely the rows and columns in the inner 3 × 3 matrix, respectively.
At first, analyzing nonlinear effects involving a third rank (or worse fourth rank) tensor looks pretty complicated due to the many elements involved. Fortunately, the symmetry of the crystal can reduce the amount of independent tensorial elements. The simplest form of the tensor that still fully represents the crystal symmetry group can be called the irreducible form of the tensor. In GT, an irreducible tensor can be obtained by performing all the symmetry operation that belongs to a certain crystal group on the most general tensor, such as in Equation (
13). Therefore, in GT, rather than assessing the nonzero components using bond vectors as in Equations (
9)–(
10), the latter uses the conventional symmetry analysis of the crystal structure.
Now, if a symmetry operation is applied to the crystal, the third rank tensor can be described mathematically as a linear transform through the relation
or for the case of a fourth rank tensor
where
is a matrix defining a symmetry operation, e.g., it can be in the form of a mirror/inversion operation or general rotation of an arbitrary angle. Neuman’s principle then says that if symmetry operations belonging to the point group of the crystal are performed, then the initial and final tensor of Equations (
14)–(
15) must be equal, e.g.,
.
For simplicity, let us calculate the resulting third rank susceptibility tensor for a crystal with a symmetry point group C
2v. Certain semiconductor surfaces can possess such symmetry. We apply Equation (
14) to calculate the susceptibility third rank tensor for a crystal with C
2 symmetry (inversion symmetric under 180
rotation). Equation (
9) can be readily applied where
is now the rotational matrix given in Equation (
8) and applied to the most general form of a third rank tensor in Equation (
13). Contracting the matrices with the tensor to the right side of Equation (
14) results in:
Now, for an element to be the same as its negative, the values must be zero. The final tensor thus takes the form:
It can be seen that there are 13 nonzero components in the tensor in Equation (
17). Although the form of the tensor is not similar to that in the standard GT literature [
26,
27], this discrepancy simply stems from the arbitrary choice of the initial coordinate system, where the latter uses the
y-coordinate rotation
rather than
in Equation (
8). Repeating the step for a rotation in the
y-coordinate, this value produces results similar to [
26] and its step-by-step derivation can be seen in [
28]. In the following section, we focus our analysis on the higher-order susceptibility tensor diamond-like structures such as Si, and show that the SBHM and GT are in perfect agreement. Afterwards, we discuss different semiconductors’ crystal symmetry such as zincblende and wurtzite.
5. Third Rank Tensors of Perovskite Semiconductor Structures
We now proceed with a more complex structure, namely perovskite, which is a semiconductor with ABO
3 symmetry. To the best of our knowledge this is the first time that the SBHM is applied to the perovskite structure. In this work, we focus on the room temperature methyl ammonium lead iodide (MAPbI
3) tetragonal structure. As before, we apply the SBHM to investigate the third rank tensor associated with SHG. The perovskite 3D structure under analysis is given by
Figure 5.
As can be seen in
Figure 5, the perovskite MAPbI
3 surface and bulk can be described using a different symmetry. The surface structure possesses a C
2v point group symmetry which is noncentrosymmetric whereas the bulk structure is centrosymmetric and is described by the D
4h. However, the centrosymmetry of the bulk structure is still under debate as whether such bulk structure is polar or nonpolar [
29]. If the MA ions can be neglected as an SHG source as claimed by [
30], then it can be assumed that the bulk structure made of Pb and I covalent bonds are centrosymmetric.
We now show using the tensor formalism of the SBHM that the neglect of MA ions indeed results in a fully zero third rank susceptibility tensor. The main challenge in obtaining the third rank tensor expression using the SBHM is determining how many bond vectors are required and how they are arranged. Afterwards, one can create an effective or reduced structure based on translational analysis of the bond vector. The MAPbI
3 bulk structure and its bond vector creation is given in
Figure 6. As can be seen, we apply four Pb atoms surrounded by I atoms forming a tetragonal structure. The total involved bond vectors, in this case, is 36. Inserting all the bond vectors, one readily obtains a zero third rank tensor, confirming the centrosymmetry of the MAPbI
3 bulk structure. However, such a cumbersome calculation can be avoided if we look at the reduced form of the bond vector. Using translation to merge the vectors into a centralized point vector structure, one can see, as depicted in
Figure 7 that the 26 bond vectors will exactly cancel each other out, thus producing a zero net vector. The complete expression of the perovskite bulk and surface bond vectors is avaiable as
Supplementary Materials.
We now apply the same procedure to investigate the MAPbI
3 surface symmetry. The difference between the surface and bulk lies in the absence of the four up bond vector. This breaks the centrosymmetry into a C
2v structure, as can be seen in
Figure 8. As before, we apply four Pb atomic centers for the SBHM model. From the reduced form of the bond vectors, it can be seen that the twofold rotational symmetry is readily obtained due to the four downward bond vectors. The number of bond vectors that are now involved consists of 40 bonds. Each of the bonds represents, as before, the covalent bond between the Pb and I atoms, neglecting the MA ions. Note that the number of bond vectors required is 40 or 4 more bonds than the previous 36 used to model the perovskite bulk structure.
The third rank susceptibility tensor from the SBHM can now be obtained. Using the same procedure by inserting all 40 bond vectors, the tensor takes the following form:
where
Based on Equations (
41)–(
43), we can conclude that the SBHM gives seven nonzero elements with only one fitting parameter, namely the effective Pb-I nonlinear hyperpolarizability (
).
The C
2v third rank tensor from GT can be obtained by performing the point group rotation and mirror symmetry operation as given in the previous example of finding the C
2v tensor, but now adding to the mirror symmetry operation. If this is performed, one readily obtains:
Therefore, GT requires seven nonzero elements and as in the Si(001) case requires five independent elements (
, and
) but with Kleinman symmetry, this is readily reduced to three elements. Still, the SBHM is far more simple by requiring just one fitting parameter. State-of-the-art RASHG experimental results by Frohna et al. [
30] show that there is no SHG contribution from within the bulk of inorganic halide tetragonal perovskite such as MAPbI
3. One of the implications is that the dynamic and large static Rashba effects, which come from within the bulk, cannot be attributed to the SHG source. Thus, the SHG must come from the
surface. Using the SBHM, we confirm their results. In addition, we found that the three independent fitting parameters in the GT SHG surface tensor of perovskite can be reduced to only one fitting parameter, namely the nonlinear effective perovskite hyperpolarizability (
). This result is important because it allows the future analysis of more complex perovskite surfaces.
Before we conclude, we would like to briefly discuss the accuracy of the proposed model with experimental results, particularly current results from the SHG intensity in various RASHG experiments. As mentioned earlier, the SBHM is a phenomenological model because the complex nonlinear hyperpolarizabilities must be obtained by experimental fitting. More precisely, after the nonlinear SHG tensor form is obtained using bond vector direct products in Equation (
9), the far field can be obtained through Equation (
11) and, when multiplied with its complex conjugate, it readily gives the SHG intensity that can be fitted with the experiment to obtain the hyperpolarizability parameter. Because of the complexity of the various SHG sources in nonvicinal Si, an estimate of the residual fitting in vicinal Si(111) where the surface effects dominate has been offered instead by Powel et al. [
13]. Applying the SBHM, they found using two independent complex parameters (the SHG up and down hyperpolarizabilities) a perfect fit with the experiment with a residual statistical error of only 0.0039 (e.g., see
Figure 2 in the reference). A similar analysis of SBHM on GaAs [
22] where the RASHG intensity experimental data are assumed to stem only from within the bulk also has been analyzed using the SBHM with a fitting statistical error of only 0.017, which is a good validation of the model.
6. Conclusions
We have shown that SBHM can simplify the nonlinear tensor compared to group theory. For the case of both Si(111) and Si(001) surfaces, SBHM requires only two fitting parameters namely and whereas GT with no Kleinman symmetry shows that four () and five (, and ) independent parameters are required for the Si(111) and Si(001) surfaces, respectively. For the case of perovskite, the SBHM surprisingly requires only one independent fitting parameter. This parameter is the effective perovskite hyperpolarizability (). Meanwhile, GT requires five (, and ) independent elements when modeling the tetragonal perovskite surfaces. Therefore, the SBHM provides a major simplification even if Kleinman symmetry is invoked.
Our results open the possibilities for future research directions. First, with the nonlinear tensor containing fewer independent parameters, one can start to estimate better the various bulk and surface nonlinear contributions in other semiconductors, paving the way for a better understanding of nonlinear processing at the nanoscale. Second, the SBHM calculation of perovskite can be developed further to produce the SHG intensity that can be compared with existing experiments, e.g., Frohna et al. [
30]. For the case of perovskite, this approach seems to agree with their result that the SHG comes from the perovskite surface and that the large static and dynamic Rashba effect is not responsible for the SHG intensity. Third, real-time atomic/molecular deposition can be monitored and modeled more easily with the SBHM because we can attribute various features of the SHG intensity to single atomic bonds, including the bonds connected to the deposited atoms/molecules.