1. Introduction
A vast spectrum of scientific and engineering fields employ and benefit from increasingly more complex artificial intelligence (AI) algorithms, which in turn drive the demand for faster and more energy-efficient computational hardware. Brain-inspired, neuro-morphic, hardware architectures are being investigated [
1,
2], among which photonic implementations are studied for their ultra-fast and parallel processing capabilities [
3]. Spiking neural network (SNN) hardware has attracted a lot of interest for its similarity with information processing (neural behavior) in the human brain, which is the most energy efficient neural network known and where data are encoded using short electrical signals or spikes [
4]. It has been shown that photonic implementations can result in spiking neurons employing optical pulses at a much shorter time scale [
5,
6]. The spikes in the human brain are digital in amplitude but analogue in time, generated by individual neurons and transmitted through axons [
7]. In [
4], an extensive comparison between the human brain and digital computing in terms of energy efficiency is given.
The principles of a biological SNN can be transferred to integrated photonics due to the spiking capabilities of semiconductor lasers [
5,
8,
9]. Integrated photonic SNNs benefit from high switching speed, high communication bandwidth, low crosstalk [
5,
9] and temporal characteristics governed by ultra-fast carrier dynamics [
10,
11]. This results in the operation of an optical neuron orders of magnitude faster than its biological counterpart. These semiconductor lasers can be the building block in an all-optical SNN on a photonic integrated circuit (PIC).
One successful realization of a photonic spiking neuron is by using an integrated two-section Fabry–Pérot-type (FP) semiconductor laser, where one section operates as the gain and the other as saturable absorber. This configuration is known to exhibit, apart from self-pulsations and CW operation, a form of excitability when operating near, but below threshold [
12,
13,
14]. In excitable conditions, the laser emits a short optical pulse when triggered by an optical input pulse of sufficiently large energy and thus can operate as an artificial neuron. We will demonstrate, both experimentally [
15] and in simulations, that near the lasing threshold, such an optical trigger can also be caused by a sufficiently large positive intensity fluctuation due to spontaneous-emission noise. As this “spontaneous” emission of output pulses is a source of errors in the functionality of this artificial neuron, it is relevant to investigate this phenomenon in more detail as to its statistics, which is the purpose of the present study.
A semiconductor laser with saturable absorber is accurately described by the well-known Yamada model [
12,
16,
17], which will be used in this paper to investigate the effects of optical noise on the laser’s self-spiking behaviour and in simulations. By interpreting the noise-triggered self-pulsations in the excitability parameter region, in terms of first-passage events for the intensity, we derive analytic approximations for the self-pulsation timing statistics, which can be compared with simulated and observed timing statistics [
15].
2. Device under Study and Observations
The gain and saturable absorber integrated laser was fabricated in a commercially available active–passive multi-project wafer (MPW) InP integration platform [
18]. The device basically consists of two electrically isolated semiconductor optical amplifiers (SOAs), one for the gain and the other for the saturable absorption, between two mirrors. The output of the laser is coupled to the edge of the chip, where the emitted light is collected using lensed fibers. A schematic of the two-section laser and a photograph of one of the fabricated devices can be seen in
Figure 1. The measurement set up and method are described in [
15].
The linear cavity contains an 80 μm saturable absorber, 500 μm gain section and a 500 μm phase shifter. One mirror is a 350 μm distributed Bragg reflector (DBR), optimized for reflections at 1550 nm, and the other a multimode interference reflector (MIR), with estimated reflectivities of 0.76 and 0.40, respectively. The electrical isolation sections are 30 μm in length. In the measurement setup, an optical isolator prevents back reflections from the measurement equipment to the chip (see [
15]).
The gain and absorber currents are set to values for which simulations indicate the existence of the excitability regime (see [
15]). This corresponds to laser operation just below threshold, which for our two-section laser is 50.54 mA at the saturable-absorber voltage of 0.72 V. To investigate the emission of output pulses in the excitability regime under the influence of spontaneous-emission noise, the absorption is decreased, so that the laser operates even closer to, but still below, threshold and the relative intensity fluctuations become large enough to have a measurable probability to overcome the excitability threshold. The noise-triggered pulses are observed first when the gain section is biased at 50.11 mA and the absorber voltage at 0.720 V. For this setting, the laser starts to generate pulses, without any optical injection. By increasing the absorber voltage slightly, the absorption decreases and the pulse density increases. In
Figure 2, two examples of measured time traces of the laser output at absorber voltages of 0.727 V and 0.730 V are given. These pulses do not occur at a fixed repetition rate, as would be expected for a self-pulsating laser, but occur in a random manner and rather are triggered by spontaneous-emission noise.
Figure 3 shows a histogram of observed consecutive pulse timings for an injection current of 50.12 mA for the gain section and voltage 1.38 V for the absorber. The time unit is
with
the sampling rate of the oscilloscope. The pulse-repetition time distribution is characterized by an initial time interval without pulses, a very steep initial flank and a maximum, followed by a slower decaying tail. We interpret the time at which the pulse-timing distribution starts as the refractory time
associated with the excitability at hand [
12]. The skew distribution will be analyzed and explained in terms of a first-passage-time distribution in
Section 3. Simulations based on the Yamada model (see next section) give rise to pulse timing histograms, that will be shown in
Section 4 (see also Figure 13 in [
15]), which are similar to the histogram in
Figure 3. In the next section, we will derive respective asymptotic expressions for short-time and long-time pulse-timing distributions in terms of first-passage times and explain the typical form of the pulse-timing distribution.
3. Theoretical Description
The theoretical analysis and simulations are based on the Yamada model [
12,
16,
17],
where all variables and parameters are dimensionless and their meanings summarized in
Table 1. The fluctuating Langevin noise term
describes the effect of spontaneous emission events on the intensity dynamics and it has the correlation properties [
19]
where <…> denotes the mean value. In fact,
,
and
are stochastic variables and their values given by probability distributions. Denoting the probability distribution for
at time
by
, it can be shown that
is a solution of a Fokker–Planck type diffusion equation [
20]
subject to adequate boundary conditions, and where
and
are the diffusion coefficient and drift, respectively, and related to (3) and (4) by
To obtain an explicit expression for
in terms of
, we apply the adiabatic-following solution for
and
, by equating (1) and (2) to 0. This yields
and
, so that in this adiabatic approximation, we find
If we then express the stationary solution of (5) as
with
a constant normalizing
to unity, that is,
and
the potential that can be associated with the dynamical system (1), (2) and (3) (in the adiabatic approximation), Equation (5), with the right-hand side set to 0, leads to
Solving (10) for
, we find
where we fixed the integration constant such that
. Hence, we can write
In
Figure 4, an example is shown for
,
and the corresponding stationary intensity distribution
. For this value of the spontaneous-emission rate (
), the intensity is nearly Gaussian distributed with mean <
> = 1.72 and FWHM = 0.44.
Now, we consider the situation where the intensity fluctuations are driven by the diffusion coefficient (6) in the force field described by the drift (7). We are interested in the time it takes for the intensity, initially at , to hit the excitability threshold value for the first time, where is the most probable intensity, <>. When this happens, a response pulse will be emitted by the laser. It then takes one refractory period before the laser is ready for the next excitation. This time , the first-passage time (FPT), is obviously a stochastic quantity itself and thus given by a distribution, the first-passage-time density (FPTD) ) for the intensity to diffuse from to . The pulse timing distribution is then given by when and ) for . Strictly speaking, since the refractory time follows a distribution itself, the final pulse repetition statistics is a convolution of the FPTD and the refractory-time distribution. As we have not found an analytical solution for the refractory-time distribution, we will treat as a fixed given quantity.
It is shown in [
20] that the FPTD can be expressed entirely in terms of the stationary intensity distribution
. The main general results will be repeated here; the full details can be found in [
20]. First of all, we introduce the moment-generating function
with
.
is a Laplace transform with negative argument
and can be derived from a function
such that
Introducing another function
and expressing
as a power series in
,
the following result is derived in [
20]:
The problem of determining
is, in principle, solved now, apart from the actual calculation of the integrals. It follows directly from definition (13) that the mean first-passage time from
to
is given by
By repeatedly using (18), the moments can be expressed in multiple integrals, the number of which rapidly increases with increasing order .
So far, (13) to (19) are exact and generally valid. In order to derive an explicit expression for the FPTD, we proceed by making some approximations, based on the observation that the function
has two sharp maxima on the interval
, one at
, the other at
, if
. Thus, for
, we have
implying that
for all
and
where the last equality follows from (19). An approximated expression for
when
is derived in
Appendix A with the result
where
is a characteristic time related to the first passage from
to
, defined as
Although may be of the same order of magnitude as the average first-passage time (19), it should not be confused with that.
By inverse Laplace transform, (22) with
leads to the following explicit analytic form for the FPTD:
where
is the modified Bessel function of the first kind. It can easily be checked that (24) is not valid for short times. In fact, for
, (24) yields
(see [
21] (p. 375)), whereas on physical grounds,
should vanish there. Another argument why (24) is an asymptotic expression for large
is given in
Appendix A. There, it is argued that the approximation leading to (22) can only be accurate for small
, implying large
.
For the short-time behavior, we assume that the fluctuating intensity near the mean value
behaves under the influence of the spontaneous emission noise as a random walk, with fixed diffusion coefficient
, flat potential
and zero drift,
. For such a case, the probability density for the first-passage time from initial intensity
to the absorption point at
in this random-walk approximation is given by [
22,
23]
which has a maximum for
, the most likely first-passage time.
It should be realized that the random-walk approximation for the first-passage time can be correct only for small
but for large
, the tail of the distribution will approach zero faster. This is a consequence of the potential not being flat (as in the random walk), but forming a hill slope upward for intensities near the threshold intensity
. Therefore, the probability for a first passage at larger times
will be smaller than in the random-walk case. Indeed, it can be shown that for large
, the tail of the distribution in (24) behaves asymptotically as [
21]
In
Figure 5a, the two approximations for
are depicted. The yellow curve is the large-
approximation (24) for
with
,
and,
, while the green curve is the random-walk approximation (25). These data pertain to the situation of
Figure 4. The two curves connect around
. This can be seen more convincingly in the logarithmic plots of
Figure 5b, which also illustrates that the long-time tail of the distribution decreases faster than exponentially (see (26)), i.e., much faster than the random-walk approximation.
We have applied the asymptotic expressions (24) and (25) to the histogram of measured data (
Figure 2), the result of which is depicted in
Figure 6. The green curve is the long-time approximation (24), the yellow curve—the short-time approximation (25).
5. Discussion
We have analyzed and explained the emission of irregularly timed optical pulses from a two-section semiconductor laser with saturable absorber, operating near threshold in a regime of excitability. It is shown that the pulses are triggered by the relatively large spontaneous-emission intensity noise. When such a laser system is used as a neuron, this “spontaneous” emission of output pulses is a source of quasi-random spikes following the derived distribution. It could be employed to simulate random spiking in biological neurons [
24], because this phenomenon has similarities to the stochastic behavior of neurons being studied in neuroscience [
25,
26,
27], with models based on the Fokker–Planck-type equation. This work contributes to the prospect that stochastic neural networks can be simulated with photonics and that problems such as random number generation [
28], tunable rate encoding for SNNs [
29] or analysis of network population dynamics [
30] can be tackled in the future. We have investigated this phenomenon in more detail by focusing on the statistics of the spontaneously emitted pulses.
The numerical simulations and analytical theory are based on the Yamada model for a single-mode semiconductor laser with saturable absorber. By using this model, successful reproduction of observed trends in pulse statistics was obtained in [
15]. The observed irregular pulse trains are analyzed in terms of an initial refractory time, whereafter the time interval until the next emitted pulse follows a first-passage-time distribution, for which analytic asymptotic short-time and long-time approximations are derived, which very well explain measured and simulated pulse-repetition time distributions. This provides physical insight into the noise-triggering mechanism.
The noise in the system is mostly dominated by spontaneous emission and its coupling into the lasing mode can be tailored through technology choices, such as the choice of index or gain guiding, variation of the active geometry and the choice of gain material. Hence, the amount of noise can be altered if deterministic spiking is desired.
An interesting research question for further investigation is the observation of coherence resonance [
12] in the device studied here. The relevant quantity to investigate then is the normalized jitter
/<T>, as a function of the absorber voltage
, where
<T> is the mean pulse-repetition time.
Finally, there are major challenges in moving from a single photonic spiking neuron towards a spiking network on-chip. The output of a single laser neuron needs to be able to trigger another subsequent laser neuron, making it cascadable. This depends on the interconnection architecture, on-chip losses and the required excitation pulse energy. Another challenge is to address on-chip spurious reflections, originating from interfaces and other components in a photonic neural network, prompting for more research in active laser feedback compensation techniques. The third challenge is related to the accurate control of many interconnected laser neurons. This requires high-density electrical interconnects, control electronics and optical monitoring functions to be co-integrated with the photonic chip. Furthermore, low-energy components that perform synaptic weighting on the photonic chip are subject of active research and required in an all-optical SNN.