1. Introduction
Spherical harmonics (SHs) are one of the fundamental approaches for spherical microphone array processing [
1,
2,
3,
4]. They have been widely adopted in various acoustic tasks, including beamforming [
5,
6], source localization [
7,
8,
9,
10], noise cancellation [
11,
12,
13,
14], and sound field recording [
15]. However, in SH-based methods, implementing the required spherical microphone array can be challenging, especially in applications such as attaching these arrays to drones for whole-sphere sampling. To address the whole sphere sampling challenge, an orthonormal function set was developed for an accurate representation of the pressure over an arbitrary spherical sector region, named spherical sector harmonics (SSHs) [
16]. Although SSHs have the potential to be implemented in various audio and acoustic problems, a thorough investigation of this needs to be included. In this paper, we conduct comprehensive investigations of SSHs and reveal three main limitations of them, offering valuable insights for their future implementation.
SHs can decompose the sound field into angular-dependent orthonormal functions and corresponding coefficients, allowing for the analysis of a spatial sound field with a finite number of SH coefficients. The concept of using an open spherical microphone array to capture high-order sound fields was first introduced by Abhayapala and Ward [
1]. Subsequently, Meyer and Elko developed a microphone array over a rigid sphere to implement this approach [
2]. Further advancements in the design and analysis of spherical arrays were made by Rafaely [
3] and Duraiswami [
17], with Rafaely later proposing a dual-sphere microphone array [
4]. Although SH-based spherical array processing has greatly benefited spatial audio and acoustics, the requirement for whole-sphere sampling poses challenges for implementation in certain applications.
McCormack et al. [
18] developed a method for sound field visualization based on sector design. To approximate the directivity pattern with data available only within a limited range of directions, Zotter and Pomberger introduced the concept of spherical Slepian functions [
19]. These functions form an orthogonal basis specifically designed for the restricted directional range of the sphere, particularly for rotationally symmetric regions such as spherical caps or segments [
19,
20]. While spherical Slepian functions enable the decomposition and reconstruction of the sound field within a spherical cap or segment, they require frequency-dependent matrix inversion and may introduce significant errors for sources outside the restricted range [
20]. By solving the Helmholtz equation, with sound-soft or sound-hard boundary conditions, the spherical cap and segment harmonics were proposed in [
20,
21] and have been validated by microphone array prototypes [
22,
23]. As the spherical cap and segment harmonics satisfy the Helmholtz equation, they can be implemented to interpolate and extrapolate the sound field. However, to apply the spherical cap and segment harmonics, the target region has to be bounded by a sound-soft or sound-hard boundary, limiting its broader implementation.
Further addressing the need for microphone processing over arbitrary spherical sector regions, Kumari and Kumar developed SSHs that decompose and represent the sound pressure over a spherical sector region [
16]. This advancement reduces the sampling requirements, simplifies microphone array construction, and expands its application to active noise control (ANC) [
24], beamforming [
25,
26], and sound source localization [
25,
27]. In our previous work, we implemented SSHs in a spherical sector sound field extrapolation problem [
28]. Compared to effectively representing sound pressure over a spherical sector region, SSHs do not require the sound-soft or sound-hard boundary to bind the spherical sector region; therefore, it can potentially be applied to broader applications.
While SSHs have demonstrated potential in spatial audio applications, a thorough investigation of them needs to be conducted. In this paper, we delve into the properties and performance of SSHs, uncovering their limitations and offering valuable insights for their future implementation. Specifically, we investigate and prove the following limitations of SSHs: (i) SSHs are not solutions of the Helmholtz equation due to the shifting and scaling of the input arguments for the associated Legendre polynomials and exponential function. (ii) The spherical Hankel and Bessel functions are not the corresponding radial functions for SSHs to perform radial extrapolation. (iii) There is severe distortion around the edge of the sector region when using SSHs for sound field reconstruction. Our proof shows that SSHs are better suited for decomposing and processing the sound field within the measured sector region without any radial extrapolation. Additionally, the measurement sector region should be larger than the target sector region to mitigate distortion.
The main contribution of this paper is a comprehensive investigation of the performance and limitations of SSHs. This study will help future users identify suitable applications for SSHs and those that are not appropriate.
The structure of this paper is as follows:
Section 2 introduces the background theory of SSHs and discusses its three main limitations.
Section 3 provides a mathematical proof showing that SSHs do not satisfy the Helmholtz equation. In
Section 4, we further demonstrate that the spherical Hankel and Bessel functions are unsuitable for radius extrapolation in SSHs.
Section 5 examines the distortion issues arising from these inappropriate functions.
Section 6 highlights the limitations of SSHs and offers insights for implementation. Finally,
Section 7 concludes the paper.
2. Spherical Sector Harmonics
This section introduces the background theory of SHs and SSHs and highlights the differences between them. As illustrated in
Figure 1, spherical coordinates
are used to specify the position of a point, where
r represents the radial distance from the origin,
is the polar angle between the radial line and the positive z-axis, and
is the azimuthal angle, measured as the counterclockwise rotation of the radial line around the positive x-axis on the xy-plane [
29].
A sound field over a spherical region can be decomposed by SHs to
where
is the wave number with
f as the frequency and
c as the speed of sound propagation;
is the radial independent SH coefficient;
is the radial function, which is the solution of the Helmholtz equation in the radial direction for the spherical coordinate;
(
is the ceiling operation) [
30] is the truncation order of SH decomposition; and
is the spherical harmonics of
n-th order and
m-th degree, defined by
where
is the associated Legendre polynomial of
n-th order and
m-th degree, and
is the exponential function of
m-th degree.
For the interior case, where the source is located outside the sphere,
is the first kind of spherical Bessel function of the
n-th order; for the exterior case, where the source is inside the sphere,
is the spherical Hankel function of the second kind of
n-th order. Generally, the sound field can be a combination of these two cases with distinct radial-independent SH coefficients for the interior and exterior parts [
31].
Similarly, a sound field over a spherical sector region (
,
) can be decomposed by SSHs to [
16]
where
is the spherical sector harmonics of
n-th order and
m-th degree,
are the SSH coefficients, and
is the truncation order of SSH decomposition [
16].
can be expressed as [
16]
where
is the normalization constant, which can be calculated by
where
and
are the shift coefficients that transform
to the shifted associated Legendre polynomials
, and
u is the scale coefficient that changes the exponential function
to the scaled exponential functions
. These coefficients depend on the spherical sector region, represented by
As
and
, based on (
6), we have
and
. When
,
,
, and
, SSHs reduce to SH.
According to (
2) and (
4), the definition of SSHs is similar to the definition of SH but contains the shifted associated Legendre polynomials
and scaled exponential functions
rather than the associated Legendre polynomials
and exponential functions
. SH has been successfully implemented in the spatial audio and acoustics field, but the following limitations of SSHs can hinder their further application.
Firstly, to preserve the orthogonality of the associated Legendre polynomials and exponential functions in the original SH, SSHs replace these functions with shifted associated Legendre polynomials,
, and scaled exponential functions,
. This replacement effectively maps the sector region in SSHs back to the entire sphere in SH. However, this mapping process violates the Helmholtz equation in both the elevational and azimuthal directions, as demonstrated in
Section 3.
Secondly, due to this violation of the Helmholtz equation,
no longer accurately represents the variation in and propagation of the sound field in the radial direction. As a result,
cannot be used to extend the sound field from one measured sector to another sector with the same angular range of
and
but a different radius. Additionally, the SSH coefficients,
, become radius-dependent, making radial extrapolation with SSHs challenging and impractical for direct application. Further details are provided in
Section 4.
Third, the mapping process of SSHs can cause severe distortion around the edge of the sector region when using (
3) to reconstruct a sound field. For a sector region
(
), the shifted associated Legendre polynomial maps two latitudinal edges of the sector region (
or
) to two poles of the sphere (
or
), and the scaled exponential function maps two longitudinal edges of the sector region (
,
or
,
) to one longitude line of the sector region (
,
). Due to the mapping process, the information of the sound field over the edge of
is lost, and the reconstructed sound field has severe distortion around the edge. The distortion problem is further investigated in
Section 5.
3. Violation of the Helmholtz Equation
This section proves that SSHs are not the angular part solution of the Helmholtz equation. Consider an arbitrary spherical sector region
,
. The related SSH function is shown in (
4), defined in the spherical coordinates. Therefore, if SSHs are solutions of the Helmholtz equation, they should satisfy the Helmholtz equation in spherical coordinates, shown in (
7) below:
where
p is the sound pressure at an observation point
at time
t. By separating the variables, we have
and (
7) can be rewritten to four ordinary differential equations:
Considering a single frequency case, with (
3) and (
4), we have
where
is the radial frequency. Then, based on (
8)–(
10), for SSHs to be considered solutions of the Helmholtz equation,
should satisfy (9b), and
should satisfy (9a).
3.1. Elevational Direction
We first provide our proof in the elevational direction. We use
to represent the shifted associated Legendre polynomials:
where
,
. Let
, and we have
. Based on the chain rule [
32], the left-hand side of (9b) can be rewritten as
has the following property [
31]:
when
and
, (
12) can be simplified as
From (
13) and (
14), we observe that when
and
,
satisfies (9b). However, for
or
, (
12) does not match the left-hand side of (
13).
To further prove that SSHs are not solutions of the Helmholtz equation, we take the specific example of
[
33] into (
12) and have
where
or
. Thus, the shifted associated Legendre polynomials in SSHs violate (9b) and are not a solution of the Helmholtz equation.
3.2. Azimuthal Direction
The validity of the azimuthal direction is also examined. We use
to represent the scaled exponential function and have
By taking (
16) into the left-hand side of (9a), we have
From (
17), we can find that only when
,
satisfies (9a).
Based on the proof provided above, it is evident that SSHs can satisfy the Helmholtz equation only when , , and . In this special case, SSHs become SHs, which are indeed solutions of the Helmholtz equation. However, for any other values of , , and u, SSHs do not fulfill the Helmholtz equation. The violation of the Helmholtz equation can affect the application of SSHs in certain scenarios, such as sound field radial extrapolation, which is investigated in the following section.
4. Limitations on Radial Extrapolation
Sound field radial extrapolation is an essential property of SHs. With (
1), we can extrapolate the sound field from the measured sphere to another sphere with a different radius. This property enables us to analyze the sound field over the space with a measurement on a sphere and is implemented in many acoustics applications, such as ANC [
14] and sound field reproduction [
34]. As SSHs are not solutions of the Helmholtz equation, shown in
Section 3, the radial function is not theoretically guaranteed to present the sound field changes in the radial direction. Therefore, analyzing the feasibility of conducting the radial extrapolation with SSHs is important. In our previous work [
28], we proposed a sector sound field radial extrapolation method using the mapping relationship between SHs and SSHs, where we conducted radial extrapolation with SHs and then mapped SHs to SSHs to reconstruct the sound field in the extrapolated sector region. However, obtaining accurate SH coefficients from a restricted measurement region required a dual-array measurement and a large angular range of the measurement region. In this section, we investigate whether SSHs possess a similar extrapolation property to SHs and can decompose the sound field with radially independent coefficients.
As shown in
Figure 2, the radial extrapolation of a spherical sector sound field involves expanding or contracting the sectorial sound field from the measured region,
, to a target sectorial region,
, which shares the same angular range for
and
but has a different radius. This process can be divided into two cases: the exterior and interior cases. In the exterior case, illustrated in
Figure 2a, the sound sources are located inside a sphere,
, and we expand the sound field to an outer target region,
. Conversely, in the interior case shown in
Figure 2b, the sound sources are positioned outside a spherical region,
, and we contract the sound field to an inner target region,
.
To test the performance of radial functions in SSH radial extrapolation, we create a similar equation, Equation (
18), with reference to (
1):
where
is the radial independent SSH coefficient. If (
18) is valid,
can be calculated by
where
denotes the pseudo-inverse operation and
denotes a
matrix that contains the multiplication of SSHs and the radial function, expressed as
is a
vector that contains the sound pressure measurements over the sector region by
Q microphones (
), with the coordinate
.
Simulations are conducted to test the validity of (
18) for both exterior and interior cases. To calculate the extrapolation error,
points are placed over
according to the nearly uniform sampling. The extrapolation error
is defined as
where
is the exact sound field over
, and
is the extrapolated sound field over
.
4.1. Exterior Case
We test the validity of (
18) using three unit strength point sources with Cartesian coordinates
,
, and
. First, we investigate the extrapolation performance within a small sector region defined by
,
,
, and
. We then estimate the sound field using (
18) and (
19).
The extrapolation results are presented in
Figure 3. Notably, when
, the sound field estimation is accurate and matches the ground truth. However, when
, the extrapolated sound field deviates from the actual sound field, which means we cannot conduct an accurate extrapolation with (
18) in this example.
Subsequently, we investigate the extrapolation performance under various conditions, including different frequencies ( 300, 500, 700, and 900 Hz), sizes of (, and ), and distances between and (ranging from 0 to 19, where is the wavelength). We keep constant, and the distance between and is represented in units of wavelength (). Additionally, we ensure an adequate number of microphones () for the simulations.
The following observations can be made from the results shown in
Figure 4: (i) the extrapolation error (
) increases rapidly within a distance of
and gradually saturates as the distance grows; (ii) lower frequencies yield faster convergence of
to smaller values; and (iii) smaller sector regions tend to result in larger
values.
Based on the simulation results, we find that (
18) alone cannot accurately extrapolate the sector sound field under the exterior case. However, under certain conditions, such as low frequencies, large sector regions, and small distances from the measurement region, (
18) can still provide acceptable estimations of the extrapolated sound field.
4.2. Interior Case
4.2.1. Plane Wave
For the interior case, we first validate the extrapolation with a unit-amplitude plane wave source arriving from the direction
,
. The setups for
and the microphone placement are the same as in the exterior case. We estimate the sound field for different inner concentric sector regions with
and
. The results are shown in
Figure 5 below. We observe that (i) when
, we can accurately estimate the plane wave shape over
; (ii) compared to the exterior case, when
, the estimation error is significantly larger.
Then, we analyze the extrapolation performance with different settings of frequencies (
300, 500, 700, and 900 Hz), sizes of
(
, and
), and distances between
and
. We consistently set
and
. With
, we only analyze the distances between
and
from 0 to 0.7
. The results are shown in
Figure 6, which demonstrates that (i)
increases sharply within 0.1
distance; (ii)
reaches the largest value when the distance is in the range of 0.3
to 0.5
and decreases or fluctuates around the peak value when the distance increases further; (iii) compared to
Figure 6, we have a larger
in
Figure 6, and the dependency of the error on the frequency and the size of
is less regular in
Figure 6.
We additionally explore the impact of the arrival direction by varying
from
to
in increments of
, while keeping
and
. We conduct simulations with varying
settings and distances between
and
. The results are illustrated in
Figure 7, leading us to the following observations: (i) the extrapolation error
exhibits symmetry around
; (ii) for
,
is small and particularly responsive to changes in
, especially with larger
; (iii) when
,
rises from its minimum value to a peak at
, and then descends back to its minimum again; (iv) if
,
is larger and less sensitive to directions in
; (v) with
,
diminishes with the increment of
, whereas with
,
rises with the increment of
.
4.2.2. Point Source
We further investigate the sound field extrapolation performance in the interior case with a unit strength point source, with the Cartesian coordinates
. The experimental setup, including settings of
and the microphone placement, is maintained as described in
Section 4.2.1. We estimate the sound field for various inner concentric sector regions with
and
. The results, shown in
Figure 8, demonstrate that, consistent with the findings in the preceding subsections, the accurate estimation of the sector sound field is achievable when
. However, significant errors arise when
.
Continuing our investigation, we conduct additional simulations to analyze the extrapolation error (
) using various setups while maintaining a point source instead of a plane wave. The results are depicted in
Figure 9, which reveals the following: (i) the results are similar to those for a plane wave in
Figure 6; (ii) the impact of frequency and the size of
on
remains less regular; (iii) compared to
Figure 6, the trajectories of
in
Figure 9 are more concentrated across different
setups. This suggests that the size of
has a relatively minor effect on the extrapolation accuracy with point sources, in contrast to plane waves.
In conclusion, the findings from both the exterior and interior case simulations underscore the limitations of conducting radial extrapolation with SSHs. While accurate extrapolation is challenging when
, (
18) shows potential for estimating the extrapolation in the exterior case, especially under specific conditions involving low frequencies, large sector regions, and small extrapolation distances.
5. Near Edge Distortion Problem
The sound field decomposition and reconstruction are fundamental steps in SSH-based sound field processing. For a spherical sector region, the related SSH coefficients
can be found by
where
denotes the
matrix of SSH, expressed as
After estimating
with (
22), the sound field can be reconstructed using (
3). While this process is similar to sound field decomposition and reconstruction in SH processing, it can lead to severe distortions around the edges of the spherical sector region. These distortions are caused by mapping the spherical sector to a whole sphere. As illustrated in
Figure 10, the points over the two latitudinal edges of the sector region (
or
) are mapped to the two poles of the sphere (
or
) during the mapping process of shifted associated Legendre polynomials. Similarly, the points over the two longitudinal edges of the sector region (
,
or
,
) are mapped to one longitudinal line of the sector region (
,
) during the mapping process of scaled exponential functions. These mappings lead to information loss over the edge of the sector region, causing distortion around the sector edge.
We investigate this distortion problem using simulations. For the exterior case, we place three unit strength point sources with the Cartesian coordinates
,
, and
. For the interior case, we place three unit strength point sources with the Cartesian coordinates
,
, and
. We analyze the sound field reconstruction distortion over a sector region
with
, and
. The measurement region
is the same as
. Microphones are placed over
with a nearly uniform distribution, with
for the 300 Hz setup and
for the 700 Hz setup. The reconstruction error over elevation direction
and azimuth direction
are defined as
and
where
is the number of samples along the azimuth direction, and
represents the number of samples along the elevation direction. In the simulation, we set
, resulting in a total of 41,209 sampling points over region
.
The results depicted in
Figure 11 and
Figure 12 reveal that the distortion appears around all four edges of
in both the exterior and interior cases, and the distortion is more severe as it approaches the edges. Due to this distortion issue, to enhance the performance of SSH in decomposition and reconstruction, the measurement region
should be larger than
.
6. Discussion
Although SSHs were derived from SHs and share high similarities in their formulations, according to the analysis in
Section 3,
Section 4 and
Section 5, it is evident that SSHs have essential limitations compared to SHs. The first limitation arises from violating the Helmholtz equation, a fundamental requirement for many sound field processing applications. Consequently, SSHs cannot accurately conduct sound field radial extrapolation. As a result, SSHs remain suitable solely for analyzing measurement regions and are incapable of extending sound field analysis into the broader space. The simulation results have shown significant errors in extrapolating the sound field using SSHs, except under specific conditions such as the exterior case with low frequencies, large sector sizes, and small extrapolation distances. Furthermore, the distortion problem affects the reconstruction accuracy over the edge areas of the sector region. Overall, SSHs remain reliable tools for decomposing and reconstructing the sound field within the measured spherical sector region, with applications in beamforming and localization. However, users should be mindful of the noted limitations to ensure they are applied appropriately.
To improve the performance of SSHs, it is crucial to ensure that the measurement region is larger than the target sector region both longitudinally and latitudinally to mitigate the distortion issues and improve the accuracy of sound field reconstruction. Considering these limitations, future implementations of spherical sector harmonics should primarily focus on processing and analyzing the measurement sector region without any extrapolation process. Alternatively, if a dual-array setup is available, the method proposed in [
28] can be employed to enhance the extrapolation performance. Additionally, for the interior case, an improved reconstruction accuracy can be achieved when the source is oriented towards the sector region.