The inequality (23) means that the KSD keeps invariant when the tails of the two distributions, , are altered. Therefore, the KSD does not reflect the difference between the two PDFs on tails. However, the tails of the amplitude distributions of sea clutter are important in the control of the false alarm rate in target detection of maritime surveillance radars.
Owing to the logarithmic difference in the integration, the KLD better reflects the difference of the PDFs on tails. Herein, the RMMSE, KSD, and KLD are used in experiments for a full evaluation.
3.1. Performance Evaluation by Using Simulated Data
It is difficult to analyze the RMMSEs of the joint estimates of the two parameters in the
TML estimation method (13). Instead, we analyze the RMSME of one of the parameters when the other is known. When the scale parameter is known, the
TML estimate of the inverse shape parameter is given by the following:
The Equation (25) shows that the
TML estimate at the known scale parameter is equivalent to its estimate as the scale parameter is one. Thus, the RMMSE of the inverse shape parameter is independent of the scale parameter as it is known. Similarly, when the inverse shape parameter is known, the
TML estimate of the scale parameter is given by the following:
In terms of the definition in (21), the RRMSE of the scale parameter is independent of the scale parameter itself if only the ratio
is independent of the scale parameter. The ratio is given by the following:
Note that the normalized data
are independent of the scale parameter. Thus, the RRMSE of the scale parameter is independent of the scale parameter and depends on the inverse shape parameter. In the TML estimation method, the scale and inverse shape parameters are iteratively estimated, and the two parameters are unknown. When the two parameters are unknown, it is difficult to prove that the RRMSEs of the two parameters are independent of the scale parameter. For other estimation methods [
15,
16], it is verified to be true by Monte-Carlo tests.
In the first experiment, we examine this point for the TML estimation method and set up the sample number
N = 5 × 10
3. The inverse shape parameter is taken as 0.1, 1, 4, and 10, and the scale parameter varies from 0.1 to 15, and 10
5 independent trials to calculate the RRMSE at each parameter setup. As shown in
Figure 1, the RRMSEs of the two parameters are almost horizontal lines and thus are independent of the scale parameter. In subsequent simulation experiments, the scale parameter is always taken as one. Moreover, the RRMSEs of the two parameters depend on the inverse shape parameter. As
υ gets smaller, its RRMSEs become smaller first and then become larger. The difference is that the RRMSE of the scale parameter decreases with the decrease in
υ, which is because the clutter power distribution has a wider range due to the heavier tail of the amplitude distribution, thus resulting in more loss of estimation accuracy.
In the second experiment, we examine the influence of the selection of truncation rate on the TML estimation method in the context of sample data consisting only of pure clutter. In the BiP estimation method, the selection of the upper percentile position is a compromise between estimation accuracy and outlier robustness [
18]. The closer to one the upper percentile position is, the more samples are used, and the higher the estimation accuracy is in the case of pure clutter data, but the weaker the robustness to outliers is at the same time. For the TML estimation method, the truncation rate has the same effect as in
β of the BiP estimation method. When
β = 1, the TML estimator gets the IML estimator, which achieves the highest accuracy in the case of pure clutter data but is sensitive to outliers.
Figure 2 illustrates the RRMSEs of the TML estimation method when
β = 0.85, 0.9, 0.95 and 1, and the sample size
N varies from 500 to 10
4, and
υ = 2. As the truncation ratio
β is close to one, the TML estimation method is similar to the IML estimation method in form and RRMSE curve. The TML estimation method is an extension of the IML estimation method to adapt to a cluttered environment with outliers. Moreover, the straight lines of the RRMSE versus sample size on the logarithmic plot show that the RRMSEs of the TML estimation method are inversely proportional to
N1/2.
The third experiment compares the TML estimator with
β = 0.95 with the existing estimators in the case without an outlier. The existing estimators include the MoM estimator [
8], the LOME estimator [
15], the IML estimator and the BiP estimator with
α = 0.5 and
β = 0.95. The setup of simulations is the same as in the second experiment.
Figure 3 demonstrates the RRMSEs of the two parameters by the five estimators. For the inverse shape parameter, the IML estimator attains the best performance, followed by the TML estimator, the LOME estimator, the BiP estimator, and the MoM estimator. For the scale parameter, the IML estimator keeps the best one, followed by the LOME and MoM estimators, the TML estimator, and the BiP estimator. Note that the scale parameter in the MoM and LOME estimators is estimated by the second-order sample moment of data, which is not affected by the estimates of the inverse shape parameter. For the IML, TML, and BiP estimators, the estimates of the scale parameter are always affected by that of the inverse shape parameter due to the iteration process or the order of the two parameters to be estimated. On the whole, for data without outliers, all five estimators are comparable in performance, though there exist some differences. In other words, when data are without outliers, anyone among the five estimators is acceptable in applications.
In what follows, we examine the influence of the inverse shape parameter on the RRMSEs. In the experiment, we set up the sample size as
N = 5000 and the inverse shape parameter ranges from 0.1 to 15, and 10
4 independent trials to calculate the RRMSEs at each parameter setup.
Figure 4 demonstrates the RRMSEs of the five estimators. The RRMSE curves of the five estimators have the same trends. As the inverse shape parameter alters from 0 to 15, the RRMSE of the inverse shape parameter first becomes smaller and then slowly increases, while that of the scale parameter monotonously increases. In the case without an outlier, the IML estimator achieves the smallest RRMSEs, followed by the TML estimator, the LOME estimator, the BiP estimator, and the MoM estimator at the inverse shape parameter. For the scale parameter, the situation is slightly different and the BiP estimator and the TML estimator have poorer performance, and the IML estimator and the MoM and LOME estimators have better performance, which is due to the fact that the errors of the inverse shape parameter propagate into the estimates of the scale parameter in the TML and BiP estimators. Moreover, it is found that the error of the MoM estimator at the inverse shape parameter is much larger than the other four estimators. Therefore, for data without outliers, the MoM estimator is not suggested to be used in applications.
The use of pure clutter data alone is insufficient to illustrate the performance of the TML estimation method. So, in the sequent experiment, we use data with outliers as validation. In this case, the MoM estimator, the LOME estimator, and the IML estimator suffer from quite a severe degradation in performance. Thus, only the outlier-robust BiP estimation method is compared with the TML estimation method. In each trial, δ
1 percentages of the pure clutter data, which two parameters are given are replaced with outliers whose amplitude is δ
2 times the average amplitude of the clutter data, where δ
1 follows the uniform distribution of the interval [0, 0.02
N] and δ
2 follows the uniform distribution of the interval [20
0.5, 20], so as to increase the randomicity of outliers in trials. In comparison, the truncation ratio in the TML estimator and the position of the upper percentile in the BiP estimator are taken as
β = 0.95 so that they have the same outlier robustness.
Figure 5 illustrates the RRMSEs of the two estimators, where
υ ranges between 0.1 and 15 and
N = 5000. The TML estimator obtains smaller RRMSEs than the BiP estimator. Unlike the BiP estimator, which uses only two special sorted samples in estimation, the TML estimates are obtained by fitting the truncated empirical CDF of the data using the biparametric CDF. Obviously, the curve fitting is better than the simple two-point interpolation.
Besides, the TML estimation method is outlier-robust instead of outlier-free [
20]. In other words, outliers also bring the degradation of the estimation performance of the TML estimators even though the percentage of outliers is smaller than one minus the truncation ratio
β. The reason is straightforward. The existence of outliers influences the right end of the truncated CDF of the data. This situation is illustrated by the sixth experiment. We compare the TML estimator with the truncation ratio
β = 0.80, 0.9, 0.95, and 1 as
N = 5000 when there exist outliers of not more than 2% of all the samples, as in the fifth experiment.
Figure 6 plots the RRMSE curves of the two parameters as
υ alters from 0.1 to 15. Several facts can be observed in
Figure 6. Firstly, the IML estimator with
β = 1 has the poorest performance for its sensitivity to outliers. Secondly, the TML estimator with
β = 0.95 has better performance in the case without outliers than in the case with outliers, which shows that the TML estimation method is outlier-robust rather than outlier-free. In other words, the TML estimation method can avoid sharp degradation from outliers but fails to avoid their effect on performance. Besides, the TML estimator with
β = 0.95 is not the best one among the four TML estimators, because outliers severely affect the right end of the truncated CDF of the data. Thirdly, the TML estimator with
β = 0.80 attains the best performance. In fact, the selection of the truncation ratio is a tradeoff between the influence of outliers on the truncated CDF of the data and the utilization rate of samples. The TML estimator with
β = 0.90 attains almost same performance as one with
β = 0.80. In sequent experiments using measured data, the TML estimator with
β = 0.90 is used to assure wholly good performance in cases with or without outliers.
3.2. Performance Evaluation Using Measured Data
Two measured radar datasets from the open and recognized IPIX database [
30] and CSIR database [
31] are used to evaluate the TML estimation method. The dataset from the IPIX database is ‘19980223_184853_ANTSTEP’ with a range resolution of three meters and a pulse repetition frequency of 1000 Hz. It consists of the complex radar returns at the 27 adjacent range cells during one minute. The amplitude map is plotted in
Figure 7a, where a deep red stripe on the 20–24th range cells is the radar returns of the target under test, a floating small boat, with an average signal-to-clutter ratio (SCR) of 22.4 dB at the 21st cell. Due to the small grazing angle range of the 27 range cells, the sea clutter data in
Figure 7a can be regarded to follow the same amplitude distribution. At first, the sea clutter data at the 7–13th range cells, labeled as area A in
Figure 7a, which contains 210,000 samples, are used to calculate the empirical APDF and determine the optimal type from the K-distributions, the IGCG distributions, the generalized Pareto distributions, and the GK-LN distributions. Under each type, the parameters are estimated by the numerical/iterative ML estimators or the tri-percentile estimator. As shown in
Figure 7b, the K-distribution fails to model the APDF of the data, and the other three distributions have comparable goodness-of-the-fit. In terms of their KSD and KLD, the IGCG distribution (KSD = 0.0240, KLD = 0.0018) is better than the K distribution (KSD = 0.0951, KLD = 0.0031), the GK-LN distribution (KSD = 0.0319, KLD = 0.0019) and the generalized Pareto distribution (KSD = 0.0241, KLD = 0.0018).
Area B contains half the samples of area A and does not contain outliers. When the IGCG distributions are used to model the data, the empirical APDF and the fitted APDFs by using the five estimators are plotted in
Figure 7c. The five fitted APDFs are comparable in goodness-of-the-fit. In other words, for clean data without outliers, all five estimation methods can provide satisfactory parameter estimation. Area C contains 3% outliers that come from target returns.
Figure 7d plots the empirical APDF of the data and the fitted IGCG distributions by using the five estimators. The outlier-sensitive MoM, IML, and LOME estimators fail to fit the APDFs of the data due to their poor parameter estimation abilities for data with outliers. The fitted APDFs by using the outlier-robust BiP and TML estimators accord with the empirical APDF of the data.
For a quantitative assessment,
Table 1 lists the estimates of the inverse shape parameter and scale parameters by using the five estimators on the three areas and the KSDs and KLDs of the fitted APDF and the empirical truncated APDF of the data (truncation ratio 0.90). Area A contains a large number of samples and does not contain any outliers. When the IML estimator is used in area A, the error indexes of parameter estimation, including KSD and KLD, are very small. Therefore, the estimates for area A are often used as the true values of the parameters to assess other estimators. In area B without outliers, the errors of the estimates, KSD, and KLD give a consistent evaluation of the five estimators. The IML estimator is the best one, followed by the TML, BiP, LOME, and MoM estimators. In area C with outliers, the estimates obtained from the outlier-sensitive MoM, LOME, and IML estimators severely deviate from the true values of the two parameters. Hence, the three estimators fail to be used in real clutter environments. Differently, the outlier-robust BiP and TML estimators still give high-precision estimates of the two parameters. Besides, the estimates by the TML estimator are much better than those by the BiP estimator, which is owing to the fact that the TML estimator uses most of the points on the empirical APDF while the BiP estimator uses only the two special points on the empirical APDF.
The other dataset comes from the CSIR database [
31], an X-band high-resolution sea clutter database, and its filename is ‘TFA10_001.01’. The range resolution of the dataset is fifteen meters, the PRF is 2500 Hz, and it was collected at a high sea state and a small grazing angle. The data consist of the radar returns at 64 range cells during 24 s. The amplitude map on the range-slow time plane is plotted in
Figure 8a. The target under test, a moving small boat, is at the 16–17th range cell and the target returns have an average SCR of 19.3 dB. Besides the red stripe from the target returns, there exist some regular orange slant stripes, which originate from the power modulation of long waves on sea clutter at high sea state. It is referred to as the “structural textures in sea clutter” [
32]. Sea clutter data on a large-sized area A from the 20th to 36th range of cells is first used to determine the optimal distribution type and parameters. As shown in
Figure 8b, the dotted line is the empirical APDF of the data. When the four types of biparametric amplitude distributions are used to fit the empirical APDF, the fitted APDFs are illustrated in
Figure 8b. The fitted K-distribution severely deviates from the empirical APDF and is unsuitable to model the data. The other three fitted APDFs can fit well with the empirical APDF of the data. In terms of their KSDs and KLDs, the IGCG distribution provides the best goodness-of-the-fit. The KSD of the KLD of the IGCG distribution is obviously larger than that in the first measured data. In fact, due to the existence of structural textures in the second measured data, one mixture of two compound-Gaussian distributions [
32] is a better choice. In applications, the biparametric IGCG model is more suitable, owing to simple parameter estimation and effective adaptive detection under the model.
For area B without an outlier, the empirical APDF of the data and the fitted ICCG distributions using the five estimators are demonstrated in
Figure 8c. The fitted APDFs by using the five estimators highly accord with the empirical APDF of the data, and the five estimators are usable for clean sea clutter data. Area C contains about 2% of outliers from target returns. In the case of outliers, the empirical APDF of the data and the fitted IGCG distributions using the five estimators are illustrated in
Figure 8d. Obviously, the fitted distributions by using the MoM, LOME, and IML estimators severely deviate from the empirical APDF of the data. The outlier-sensitive estimators fail to use them in the case of outliers. The fitted distributions by using the BiP and TML estimators can better fit the empirical APDF of the data. It is necessary to use outlier-robust estimation methods when data contain outliers.
A quantitative assessment for the five estimators is listed in
Table 2, where the estimates of the two parameters, the KSDs and KLDs of the empirical APDF of the data and the fitted APDFs for each area are included. The IML estimates on area A are used as the true values of the two parameters to assess other estimators, owing to their high precision in the case without outliers and large sample size. In area B without outliers, in terms of the KSDs and KLDs, the IML estimator is the best one, followed by the TML, BiP estimators, the LOME estimator, and the MoM estimator. Particularly, the MoM estimator suffers from large errors in the inverse shape parameter and thus is not recommended for use. In area C with outliers, the estimates by the three outlier-sensitive estimators fully depart from the true values of the two parameters. Particularly, the estimates of the inverse shape parameter have too large errors. The two outlier-robust estimators still give acceptable estimates of the two parameters, though there is some performance loss in comparison with the results in area B. Moreover, the TML estimator behaves much better than the BiP estimator at the inverse shape parameter. In terms of the KSDs and KLDs, the TML estimator is the best one, followed by the BiP estimator, the IML estimator, the LOME estimator, and the MoM estimator.