1. Introduction
All X-ray diffraction theories have limits, and knowing these is crucial to interpretating the data correctly [
1]. Typically, conventional kinematical theory starts with Bragg’s law and assumes a perfect crystal [
2]. The profile shape is interpreted by including smearing functions, e.g., those due to crystal size broadening and diffuse scattering from defects. Ideally, a theory would include all known information about the X-ray photons and the full nature of the crystal under investigation. This is presently impractical since this requires intensive calculations. There is also the influence of the diffractometer to consider. The conventional kinematical theory is used in powder diffraction and single crystal structure determination, whereas the closest example of the more complete approach is used in the study of near perfect semiconductor crystals with dynamical theory. The reason for this is that the sample is very well defined and dynamical theory is a more exact description of the diffraction process. If, for example, Bragg’s law and kinematical theory were applied to semiconductor heterostructures, the derived information will contain large errors [
3], which are resolved by applying dynamical theory. However, when dynamical theory is applied to imperfect crystals it fails, although numerous authors have extended it to account for imperfections [
4,
5,
6,
7], etc. However, as will be illustrated, dynamical theory within this conventional formulation has its limits and cannot reproduce the whole of the diffraction pattern observed, even with perfect crystals.
The intensity calculated with the simpler kinematical theory and the dynamical theory are very different, unless the crystals are very small, i.e., <1 μm. This is a result of extinction, where the diffracted beam is diffracted back into the path of the incident beam, reducing the diffracted intensity and the forward incident intensity. Dynamical theory includes this effect, and kinematical theory requires a subsequent correction. Therefore, understanding the nature of the crystal is important when judging which theory to use.
Suppose we have a perfect crystal, a perfect instrument and apply conventional two-beam dynamical theory (one incident beam and one diffracted beam [
8]) but use it over a large angular range or in extreme geometries, e.g., when the incident or diffracted beam is close to the surface, then the theory will fail to reproduce the experimental profiles [
1]. This is because the diffraction from one reflection can interfere with another. By introducing a specular beam (surface reflection), the diffraction profile is more complete; see
Figure 1 (effectively including all four tie points (intersections) of the dispersion surface in the two-beam theory). This improvement alone is still insufficient to produce the correct profile along the <001> line of reflections from planes parallel to the surface. The two-beam dynamical theory only generates one reflection and requires as many beams as there are reflections to account for all the interferences in this profile. Introducing more reflections requires three-beam, four-beam, five-beam, etc., dynamical theory. These calculations become completely unwieldy [
9] because they require the solution of n interacting dispersion surfaces to include n-beams. There is, though, an underlying problem in that, however generalized this dynamical theory becomes, it uses the structure factors for a discrete set of reciprocal lattice vectors. A kinematical theory can overcome this, but it cannot account for extinction and the peak intensities are incorrect. These examples indicate the limits of the conventional dynamical theory [
5,
6,
8,
10,
11,
12], etc. However, it is widely used and very useful within its limits, i.e., in the vicinity of one reflection. The two-beam (with two-tie points) dynamical theory is available commercially.
2. An Alternative Dynamical Diffraction Description
Suppose the calculation is considered in a different way, by using the well-established Fresnel equations [
13]. This approach generates all the transmitted and scattered waves and their interferences in a crystal, and is therefore fully dynamical. It removes Bragg’s law and the dispersion surfaces and most of limitations of conventional dynamical theory [
14]. The crystal is modelled as many very thin lamellae of constant scattering ability, which are parallel to the crystal planes (and in the most straightforward case, parallel to the surface). These lamellae have a thickness that is a small fraction of the repeat distance, or unit cell (determined by trial and error), and the scattering ability is averaged laterally. This removes the discrete nature of the structure factor because the scattering factor is included as the profile of the electron distribution through the whole crystal.
The Fresnel formula was applied to each lamella, with the scattering from each being used as the input for the adjacent lamellae. The full diffraction profile is built by including all the lamellae throughout the crystal; see
Figure 2. The whole diffraction profile is complete, including all diffraction orders. The calculation time takes longer than the commercially available two-beam dynamical diffraction calculations, but with optimisation it should be comparable, i.e., within seconds, depending on the complexity of the structure.
This Fresnel approach has been applied to the
hhh diffraction from a 111 surface orientated Si crystal [
15]. The 222 diffraction peak appears and does not require any assumptions about asymmetric bonding, as suggested by Bragg [
16]. The conventional (textbook) derivation of the structure factor would require the 222 intensity to be zero, which has led to a large field in determining the bonding asymmetry in crystals. This more complete theory based on the Fresnel formula suggests that these interpretations may be in error, unless a more complete theory is used as the base before interpreting deviations.
This Fresnel approach is computationally cumbersome, especially when deriving the scattering from inclined planes [
14], but it has been used to predict some very subtle diffraction effects that have been observed experimentally [
1], e.g., Aufhellung and Umweganregung [
17]. It has also been used to simulate defects by modifying the average lateral scattering ability.
It appears, therefore, that this Fresnel dynamical diffraction theory is more complete for modelling perfect crystals compared with conventional dynamical theory. Despite this improvement, there are persistent diffraction features that are still not accommodated. These are revealed in experiments using very high resolution Reciprocal Space Mapping [
18,
19]. This mapping indicates intensity streaking close to the Bragg scattering angle 2
θB over large crystal plane rotations away from the Bragg condition; see
Figure 3a. A 2
θ profile scan reveals this as a peak (see
Figure 3b and [
20]), a feature that is easily overlooked. None of the theories above explain this streak or peak of intensity.
3. Explaining the Diffraction Streak
The theories described above assume that the crystal is a perfect array or a time-averaged structure. The kinematical theory starts with point scatterers, dynamical theory with a polarizable scattering medium that responds to an electromagnetic wave, and the Fresnel approach assumes a laterally averaged refractive index over a small thickness. They all assume that the crystal is perfect or that diffraction comes from an average crystal configuration.
It is important to consider how a crystal appears to a photon travelling at the speed of light. Each atom (~0.1 nm) will be sampled in ~3 × 10
−19 s, but because all atoms vibrate with a much longer time-period (e.g., ~10
−13 s to ~10
−14 s [
21]) they will appear stationary to a photon and displaced from their averaged positions. Atom vibrations are unavoidable. The diffraction from a real crystal will, therefore, be the average diffraction pattern from all the photons, not the diffraction pattern from a crystal with the average atomic positions.
In conventional theories, the effects of thermal vibrations and defects are incorporated as a perturbation on the average perfect structure, which is clearly an inadequate approximation for vibrations that should be included before any calculation. This has consequences; for example, if an observed peak is labelled as a Bragg peak when it is not, because it arises from a streak, then the subsequent analysis will start to go astray (see
Figure 3b).
The origin of this diffraction streak can be understood by reverting to a simple model. Take a hypothetical crystal with two planes, and only one atom on each plane A and P; see
Figure 4. An incident beam, I, will be scattered radially from each point atom to form spherical waves [
22]. The different path lengths along any given direction will result in phase differences and interference with a series of streaks of maximum intensity; see
Figure 5a. The detector is assumed to be at a large distance compared to the atom separation, and the far-field or Fraunhofer diffraction theory is applicable [
13]. The intensity is plotted with coordinates associated with the incident angle to the planes and the scattering angle (deviation from the incident beam direction). The important point to note is that the streak of intensity stays close to 2
θB over a large range of incident angles. The higher angle streak corresponds to the second diffraction order, where the path difference corresponds to two wavelengths, 2λ.
By increasing the number of atoms to form a column (on many repeated planes), the streaks narrow; see
Figure 5b. If more of these columns are combined by including more atoms on each plane, the streaking is suppressed. For infinite sized planes the streak disappears, which can be explained by the small shift in phase from column to column (due to the rotation) until one can be paired to the original that is exactly out of phase and their contributions cancel out. A finite size crystal will have interference features that correspond to the Fourier transform of the shape [
23], which will not generally have streaks close to 2
θB. However, this is the interpretation from a perfect array but not a real crystal. Any imperfections, unavoidable atom vibrations, etc., will weaken the phase cancellation, and the streaks, as in
Figure 5, will begin to reappear.
The experimental evidence in
Figure 3a can be explained by this description. Further support for this explanation comes from experiments using a multi-wavelength source, because the streak will simultaneously scatter for each wavelength at a fixed incident beam angle, as in
Figure 3b [
20]. Within the conventional description, only a single wavelength can be scattered at a single incident angle. This new theory indicates that a single incident beam will not only show peaks where a detector scan (2
θ) intersects the expected shape interference pattern, but in addition will show a peak close to 2
θB [
15,
20].
If the crystal is imperfect, e.g., if the crystal planes are not perfectly flat, then for a given incident beam direction there will be regions that satisfy the Bragg condition and scatter at 2
θB and some that do not, but with scattered intensity close to 2
θB. As the imperfections increase, a smaller proportion of the scattering can satisfy the Bragg condition at any given incident beam direction, which reduces the extinction effects in proportion and the diffracted intensity can be approximated by kinematical theory [
15]. This is also compounded by the instrument used to collect the data.
A single crystal will have a diffraction width of the order of 5 to 10 s of arc (depending on various factors), and the beam divergence experienced by each point on a crystal plane will be typically 80 s of arc (i.e., assuming a slit-collimator, and a 200 μm source at 0.5 m). This results in ~10% of the incident beam within this divergence angle that satisfies the Bragg condition, and the remainder will be contributing to non-Bragg intensity of a varying magnitude. The Bragg condition will be stronger than non-Bragg intensities. This argument indicates that the ratio of non-Bragg to Bragg contributions is modified by the instrument.
Since the conventional description requires the observed peak intensity to come only from the Bragg condition, it must follow dynamical theory. For most crystals, kinematical theory applies and to explain this the crystals are assumed to be ideally imperfect and consists of mosaic blocks [
8,
24]. These blocks must be ~<1 μm to follow kinematical theory. However, there is little evidence that the scattering from every crystal that follows kinematical theory has this specific microstructure. If, though, there are regions not in the Bragg condition due to local curvature from defects, there are still non-Bragg contributions that will contribute intensity towards 2
θB. When this becomes a significant proportion of the scattering, the intensity will follow kinematical theory rather than dynamical theory. Point defects and dislocations are omnipresent in crystals, whereas mosaic blocks are not.
4. Simulating the Diffraction Streak
Consider a crystal in more detail, or more specifically a crystal that an X-ray photon experiences. The parameters of a photon, including wavelength dispersion and length, are determined by its generation and, in the case of a laboratory source, the electron transition between energy levels in the target material. There is an uncertainty at the individual levels, and this gives an energy spread in the emitted photon ΔE, which in turn gives the length of the photon Δx (~Eλ/ΔE) and a wavelength spread Δλ [
25]. The length of a photon gives rise to a ‘coherence’ length (the maximum path difference that a photon can interfere with itself, and it can only interfere with itself). Typically, for Cu
Kα1 radiation this coherence length is ~2 μm. This is significant when examining crystals larger than a few microns. As the scattering angle increases, the path difference between points of scatter decreases, and therefore the size of the regions that scatter coherently increases above the coherence length, i.e., 2
θ = 0, by 1/cos2
θ. The diffractometer geometry can change the coherence length, e.g., non-dispersive geometry can reduce the overall path difference to give more overlap of the photon paths, which is comparable to increasing the coherence length.
It is clearly prohibitive to model the diffraction at a fundamental level because of the shear complexity, but an indication can be obtained by representing atoms as point scatterers and a simple geometrical approach. Since the crystal must be small to have a manageable number of atoms even in this simple form, the influence of size effects can dominate (~0.1 μm in this case, with 1 nm
3 unit cells). To isolate the influence of the features of interest, the model structure was chosen to be an octagonal crystal with all the facets facing out of the scattering plane and stationary atoms. Each row of atoms was also varied randomly by 5% to further reduce the dominance of the size effects. This gave a reference diffraction pattern from a crystal modelled in the conventional way; see
Figure 6a. The diffraction pattern still has fringing normal to the facets (which fade more rapidly when they are out of the scattering plane).
When atomic vibrations are included, ~0.005 nm, the pattern changes significantly; see
Figure 6b. These small random atom displacements produce an additional streak of intensity close to 2
θB. In a real experiment, the diffraction pattern will be the sum of all the diffraction patterns from each photon used to collect the data. Calculated diffraction patterns from these atom vibrations all have the same broad characteristics.
6. Concluding Remarks
The conventional kinematical and dynamical theories account for most features in a diffraction pattern. However, there are clear limits with dynamical theory away from the Bragg condition and questionable statistics associated with kinematical theory, e.g., the number and reliability of the peaks observed in powder diffraction, and the Bragg peaks in single crystal studies follow kinematical theory rather than dynamical theory, which is required at the Bragg condition.
There exists subtle streaking close to the Bragg scattering angle that is neither addressed nor can be accommodated in conventional theories and requires a physical explanation. This streak is always present, but weak, e.g., ~10−3 to ~10−5 of the peak intensity in perfect crystals.
This streak can be explained by considering that each X-ray photon forms a diffraction snapshot of a crystal. The photon samples the atoms when they are distributed about their average positions through thermal vibrations. That is, the experimental observations are averages of the snapshots, NOT the average of the atom positions. Each snapshot no longer occurs from a perfect array, which in turn prevents the phase-cancellation of waves generated outside the Bragg condition. The effect is subtle, but profound.
The detector will intersect this streak and register weak peaks appearing close to the Bragg scattering angle, which can be remote from the Bragg condition. These peaks will be additive and create measurable intensity in powder diffraction scans. In an imperfect single crystal, the planes will not be perfectly flat and therefore the incident beam will scatter towards 2θB from regions that satisfy the Bragg condition and from regions not in the Bragg condition. Although the latter may be considerably weaker than the former, as the proportion of the non-Bragg diffraction increases compared to the Bragg condition diffraction, the intensity will change from dynamical to kinematical in nature.