Robust Estimation and Tests for Parameters of Some Nonlinear Regression Models

Liu, Pengfei; Zhang, Mengchen; Zhang, Ru; Zhou, Qin

doi:10.3390/math9060599

Open AccessArticle

Robust Estimation and Tests for Parameters of Some Nonlinear Regression Models

¹

School of Mathematics and Statistics, Jiangsu Normal University, Xuzhou 221116, China

²

Research Institute of Mathematical Sciences, Jiangsu Normal University, Xuzhou 221116, China

³

Department of Public Administration and Policy, College of Public Affairs, National Taipei University, Taipei 237, Taiwan

⁴

Jiangsu Provincial Key Laboratory of Educational Big Data Science and Engineering, Jiangsu Normal University, Xuzhou 221116, China

^*

Authors to whom correspondence should be addressed.

^†

The author order is alphabetically sorted. These authors contributed equally to this work. And the submit author is Ru Zhang.

Mathematics 2021, 9(6), 599; https://doi.org/10.3390/math9060599

Submission received: 15 February 2021 / Revised: 3 March 2021 / Accepted: 5 March 2021 / Published: 11 March 2021

(This article belongs to the Special Issue Probability, Statistics and Their Applications 2021)

Download

Browse Figures

Versions Notes

Abstract

:

This paper uses the median-of-means (MOM) method to estimate the parameters of the nonlinear regression models and proves the consistency and asymptotic normality of the MOM estimator. Especially when there are outliers, the MOM estimator is more robust than nonlinear least squares (NLS) estimator and empirical likelihood (EL) estimator. On this basis, we propose hypothesis testing Statistics for the parameters of the nonlinear regression models using empirical likelihood method, and the simulation performance shows the superiority of MOM estimator. We apply the MOM method to analyze the top 50 data of GDP of China in 2019. The result shows that MOM method is more feasible than NLS estimator and EL estimator.

Keywords:

median-of-means (MOM); nonlinear regression (NR); empirical likelihood (EL); hypothesis testing (HT)

1. Introduction

A nonlinear regression model refers to a regression model in which the relationship between variables is not linear. Nonlinear regression model has been widely used in various disciplines. For instance, Hong [1] applied a nonlinear regression model to the economic system prediction; Wang et al. [2] studied the application of nonlinear regression model in the detection of protein layer thickness; Chen et al. [3] utilized a nonlinear regression model in the price estimation of surface-to-air missiles; Archontoulis and Miguez [4] used a nonlinear regression model in agricultural research.

The principle of median-of-means (MOM) was firstly introduced by Alon, Matias, and Szegedy [5] in order to approximate the frequency moment with space complexity. Lecu

\overset{´}{e}

and Lerasle [6] proposed new estimators for robust machine learning based on MOM estimators of the mean of real-valued random variables. These estimators achieved optimal rates of convergence under minimal assumptions on the dataset. Lecu

\overset{´}{e}

et al. [7] proposed MOM minimizers estimator based on MOM method. The MOM minimizers estimator is very effective when the instantaneous hypothesis may have been corrupted by some outliers. Zhang and Liu [8] applied MOM method to estimate the parameters in multiple linear regression models and AR error models of repeated measurement data.

For unknown parameters of a nonlinear regression model, Radchenko [9] proposed an estimator named nonlinear least square to approximate the unknown parameters. Ding [10] introduced the empirical likelihood (EL) estimator of the parameter of the nonlinear regression model based on the empirical likelihood method. However, when there are outliers, the general methods are more sensitive and easily affected by the outliers based on Gao and Li [11]. On the basis of the study of Zhang and Liu [8], this paper applies the MOM method to estimate the parameters of the nonlinear regression models and receives more robust results.

The paper is organized as follows: In Section 2, we review the definition of the nonlinear regression model and introduce the MOM method specifically. We prove the consistency and asymptotic properties of the MOM estimator. In Section 3, we introduce a new test method based on the empirical likelihood method for the median. Section 4 illustrates the superiority of the MOM method with simulation studies. A real application to GDP data is given in Section 5, and the conclusion is discussed in the last section.

2. Median-of-Means Method Applies to Nonlinear Regression Model

We consider the following nonlinear regression model introduced by Wu [12]

y_{i} = g (θ, x_{i}) + ε_{i} i = 1, \dots, T .

(1)

where

{θ = (θ_{1}, \dots, θ_{k})}^{T}

is a fixed

k \times 1

unknown parameter column vector.

x_{i}

is the i-th “fixed” input vector with observation

y_{i}

.

g (θ, x_{i})

is a known functional form (usually nonlinear).

ε_{i}

are i.i.d errors with 0 mean and

σ^{2}

unknown variance.

According to Zhang and Liu [8], MOM estimator of

θ

is produced by the following steps:

Step I: We seperate

(y_{i}, x_{i}), i = 1, \dots, T

into g groups. The number of observations in each group is

n = T / g

(Usually for the convenience of calculation, we assume that T is always divisible by g). We discuss the choice of grouping number g. According to the suggestion by Emilien et al. [13],

g = ⌈ 8 \times l o g (\frac{1}{ζ}) ⌉

for any

ζ \in (0, 1)

, where

⌈ η ⌉

is the ceiling function. In fact, the structure of observations is always unknown, and the diagnosis of outliers is complicated. Therefore, we usually set

ζ = \frac{C}{\sqrt{T}}

for some constant C regardless of outliers.

Step II: We estimate the parameter

θ

in each group

j, 1 \leq j \leq g

by the nonlinear least square estimator

{\hat{θ}}^{(j)} = {({\hat{θ}}_{1}^{(j)}, \dots, {\hat{θ}}_{k}^{(j)})}^{T}

.

Step III: The MOM estimator of

{\hat{θ}}^{M O M} = {({\hat{θ}}_{1}^{M O M}, \dots, {\hat{θ}}_{k}^{M O M})}^{T}

is defined, where

{\hat{θ}}_{q}^{M O M} = m e d i a n ({\hat{θ}}_{q}^{(1)}, \dots, {\hat{θ}}_{q}^{(g)}), q = 1, \dots, k

.

The asymptotic properties of

{\hat{θ}}^{M O M}

are summarized in the following theorems. Their proofs are postponed to Appendix A.

Theorem 1.

For some constant C and any positive integer g, we suppose the following:

(I) For certain

0 < a < b < \infty

and any

θ_{1}, θ_{2} \in Θ

, Θ is is an open interval (finite or infinite) of the real axis

E_{1}

.

φ_{n} (θ_{1}, θ_{2}) = \infty

for

θ_{1} \neq θ_{2}

if at least one of the points

θ_{1}

,

θ_{2}

is

- \infty

or ∞ ).

i = 1, \dots, n

.

\begin{matrix} a {(θ_{1} - θ_{2})}^{2} \leq φ_{n} (θ_{1}, θ_{2}) = \frac{1}{n} \sum_{i = 1}^{n} {[g (θ_{1}, x_{i}) - g (θ_{2}, x_{i})]}^{2} \leq b {(θ_{1} - θ_{2})}^{2} \end{matrix}

Suppose

E | ε_{1} |^{s} < \infty

for some

s \geq 2

. For

n \geq N_{0}

and sufficiently large positive ρ, c does not depend on n and ρ.

(II)

g^{'} (θ_{0}, x_{i})

,

g^{″} (θ_{0}, x_{i})

exist for all

θ_{0}

near

θ_{q}, q = 1, \dots, k

, the true value

θ_{q}

is in the interior of

θ_{0}

, and

\begin{matrix} lim sup_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{n} [{(g^{'} (θ_{q}, x_{i}))}^{2}] = S 0 < S < \infty \end{matrix}

(III) There exits

θ_{υ} \in Θ

as

n \to \infty

and

| θ_{υ} - θ_{q} | \to 0

.

\begin{matrix} lim_{n \to \infty} \frac{\sum_{i = 1}^{n} {{(g^{'} (θ_{ν}, x_{i}))}^{2}}}{\sum_{i = 1}^{n} {{(g^{'} (θ_{q}, x_{i}))}^{2}}} = 1 \end{matrix}

(IV) There exits a

δ > 0

such that

\begin{matrix} \bar{lim_{n \to \infty}} \frac{1}{n} \sum_{i = 1}^{n} sup_{| θ_{0} - θ_{q} | \leq δ} {\frac{\partial^{2} g (θ_{0}, x_{i})}{\partial θ_{0}^{2}}}^{2} < \infty \end{matrix}

for all i = 1, …, n, where

s = {θ_{q} \in Θ, | θ_{0} - θ_{q} | \leq δ}

.

According to conditions

I \sim I V

, for any fixed

x > 0

, we can get

P (| {\hat{θ}}_{q}^{M O M} - θ_{q} | \geq x) \leq \frac{C}{{(T / g)}^{g / 5}} .

(2)

Theorem 1.

(1) Suppose g is fixed and

σ \neq 0

. Let

Θ_{1}

,

Θ_{2}

, …,

Θ_{g}

be i.i.d standard normal random variables. When

T \to \infty

,

\begin{matrix} \frac{\sqrt{n}}{{\hat{σ}}_{n} S^{- \frac{1}{2}}} & \overset{d}{⟶} m e d i a n {Θ_{1}, \dots, Θ_{g}} . \end{matrix}

(3)

(2) Suppose

T / g^{2} \to \infty

as

g \to \infty

and

σ \neq 0

. Afterwards the following asymptotic normal holds

\begin{matrix} \frac{\sqrt{T}}{{\hat{σ}}_{n} S^{- \frac{1}{2}}} & \overset{d}{⟶} & \sqrt{2 / π} N (0, 1) . \end{matrix}

(4)

3. Empirical Likelihood Test Based on MOM Method

In Section 2, this paper uses the MOM method to estimate the parameters of the nonlinear regression model. In this section, we consider the hypothesis test that

θ

equals a given value parameter based on the empirical likelihood method.

Because different groups are disjoint,

{\hat{θ}}_{q}^{(1)}, {\hat{θ}}_{q}^{(2)}, \dots, {\hat{θ}}_{q}^{(j)}, j = 1, \dots, g

,

q = 1, \dots, k

are i.i.d. We treat them as a sample and apply empirical likelihood. For each j, we say

T_{n, j} = I ({\hat{θ}}_{q}^{(j)} \leq θ_{q})

. Obviously,

E T_{n, j} \approx 0.5

. In fact,

E T_{n, j} - 0.5 = O (n^{- 1 / 2})

(by the process of proof of Theorem in the Appendix A. Given restrictive conditions, the empirical likelihood ratio of

θ

is

R (θ) = m a x {\prod_{j = 1}^{g} g ω_{j} | \sum_{j = 1}^{g} ω_{j} T_{n, j} = 0.5, ω_{j} \geq 0, \sum_{j = 1}^{g} ω_{j} = 1} .

(5)

Using the Lagrange multiplier to find the maximum point we obtained the following equation.

ω_{j} = \frac{1}{g} \frac{1}{1 + λ (T_{n, j} - 0.5)}

where

λ = λ (θ)

satisfies the equation

0 = \frac{1}{g} \sum_{j = 1}^{g} \frac{T_{n, j} - 0.5}{1 + λ (T_{n, j} - 0.5)} .

(6)

Theorem 3.

According to Theorem 2 and Owen [14], as

g, n \to \infty

, we have

\begin{matrix} - 2 l o g R (θ) & \overset{d}{⟶} & χ_{1}^{2} . \end{matrix}

(7)

Using the Theorem 3, the rejection region for the hypothesis with significance level

α

(0 < α < 1)

H_{0} : θ = θ_{0} v s . H_{1} : θ \neq θ_{0}

can be constructed as

R : = {- 2 l o g R (θ) > χ_{1}^{2} (α)}

where

χ_{1}^{2} (α)

is the upper

α

-th quantile of

χ_{1}^{2}

.

4. Simulation Study

In this section, we use R software for simulation. Simulation experiments are carried out to compare the performance of the MOM estimator with the nonlinear least squares (NLS) estimator and the EL estimator under “no outliers” and “with outliers” cases in Examples 1–3. The definition of Mean Square Error (MSE) of

{\hat{θ}}^{E L}

,

{\hat{θ}}^{M O M}

and

{\hat{θ}}^{N L S}

are as follows.

\begin{matrix} M S E = \frac{1}{D - 1} \sum_{i = 1}^{D} {({\hat{θ}}_{q} - θ_{q})}^{2}, q = 1, \dots, k . \end{matrix}

(8)

{\hat{θ}}_{q}

,

θ_{q}

represent the estimated value of the parameter and the true value of the parameter Respectively in formula (8). D represents the total number of simulations, and in this article, D = 1000. The MSE results calculated in Table 1, Table 2 and Table 3 are all multiplied by 100. The results are accurate to three decimal places. In Examples 4–6, we compare our proposed method with empirical likelihood inference proposed by Jiang [15].

We report the empirical sizes and powers of the two methods, where size represents the probability of rejecting the null hypothesis provided it is true. In this paper, we set that the nominal significance level is 0.05. If the value is close to 0.05 it is good. Power represents the probability of rejecting the null hypothesis provided it is false. If the value of power is close to 1 it is good. Empirical size or power represents

\frac{n_{1}}{D}

, where

n_{1}

refers to the number of times the null hypothesis is rejected in D simulations. In Table 4, Table 5 and Table 6 of this article, the size value refers to the empirical likelihood, and power refers to the empirical power. In fact, the empirical size is the estimated value of size, and the empirical power is the estimated value of power. We consider the following three forms of nonlinear regression models, which were also considered by Hong [16].

m o d e l 1 : y_{i} = {0.8}^{x_{i}} + ε_{i}, i = 1, \dots, T .

m o d e l 2 : y_{i} = x_{i}^{0.6} + ε_{i}, i = 1, \dots, T .

m o d e l 3 : y_{i} = e^{(0.5 x_{i})} + ε_{i}, i = 1, \dots, T .

In this paper, for convenience, we fix the number of groups in simulation. We find that the result is consistent with the calculation result according to the formula

g = ⌈ 8 \times l o g (\frac{1}{ζ}) ⌉

which suggested by Emilien et al. [13]. Throughout the paper, the distribution abbreviations B, U, N, P represent binomial distribution, uniform distribution, normal distribution respectively and Poisson distribution. N(0,1) represents the standard normal distribution. We set the number of repeated observations T to 100, 200, …, 1000.

Example 1.

We consider model

y_{i} = {0.8}^{x_{i}} + ε_{i}

, For the observation data, the grouping is carried out according to the grouping principle. Taking the effect of the measures of dispersion in data sets into consideration (accuracy of the estimator may be affected by the dispersion in the data set).

x_{i}

are generated from the

P (0.7)

,

ε_{i}

are generated from

N (0, 1)

. The output variable

y_{i}

has outliers. There are three cases of outliers. We choose

1 % T

outliers from

B (20, 1 / 2)

,

2 % T

outliers from

U (7, 8)

and

2 % T

outliers from

N (6, 2)

, respectively. The results are shown in Table 1.

Example 2.

We consider model

y_{i} = x_{i}^{0.6} + ε_{i}

,

x_{i}

are generated from the

U (2, 3)

,

ε_{i}

are generated from

N (0, 1)

. The output variable

y_{i}

have outliers. There are three cases of outliers. We choose

1 % T

outliers from B(22,1/2),

2 % T

outliers from

U (7, 8)

and

2 % T

outliers from

N (7, 3)

, respectively. The results are shown in the Table 2.

Example 3.

We consider model

y_{i} = e^{(0.5 x_{i})} + ε_{i}

,

x_{i}

are generated from

U (- 1, 0)

.

ε_{i}

are generated from

N (0, 1)

. The output variable

y_{i}

have outliers. There are three cases of outliers. We choose

1 % T

outliers from

B (20, 1 / 2)

,

2 % T

outliers from

N (6, 2)

and

2 % T

outliers from

U (6, 7)

, respectively. The results are shown in the Table 3.

Form Table 1, Table 2 and Table 3, we have the following comments.

(1): The MSE decrease for all estimators as T becomes large whether there are outliers.
(2): When there are no outliers, the MSE of ${\hat{θ}}^{M O M}$ , ${\hat{θ}}^{N L S}$ and ${\hat{θ}}^{E L}$ are the same basically.
(3): When there are outliers, the MSE of ${\hat{θ}}^{M O M}$ estimator is smaller than the MSE of ${\hat{θ}}^{N L S}$ estimator and ${\hat{θ}}^{E L}$ estimator. From Table 1 and Table 3, the results show that they are no significant differences between the MSE of ${\hat{θ}}^{N L S}$ estimator and ${\hat{θ}}^{E L}$ estimator as T is large.

Example 4.

We consider model

y_{i} = {0.8}^{x_{i}} + ε_{i}

,

x_{i}

are generated from

P (0.7)

,

ε_{i}

are generated from

N (0, 1)

, For the power, we use

θ + θ_{0}

with

θ_{0} \in {0.1, 0.2}

as the alternative hypothesis. The results are shown in Table 4. MOMEL repersents empirical likelihood test based on MOM method, and EL represents hypothesis test based on the EL estimator.

Example 5.

We consider model

y_{i} = x_{i}^{0.6} + ε_{i}

, suppose

x_{i}

are generated from

U (2, 3)

,

ε_{i}

are generated from

N (0, 1)

. For power, we use

θ + θ_{0}

with

θ_{0} \in {0.1, 0.15}

as the alternative hypothesis. The results are shown in Table 5.

Example 6.

We consider model

y_{i} = e^{(0.5 x_{i})} + ε_{i}

,

x_{i}

are generated by

U (- 1, 0)

,

ε_{i}

are generated by

N (0, 1)

. For power, we use

θ + θ_{0}

with

θ_{0} \in {0.2, 0.3}

as the alternative hypothesis. The results are shown in Table 6.

From simulation results that are displayed in Table 4, Table 5 and Table 6, we can see that the size of the proposed test is close to 0.05 and the power is close to 1 as T increases. Especially when N is small, the results of MOM are significantly better than EL’s. When T increases, the MOM also performs better in terms of size and power although the power of both methods tends to one. In summary, our method is better.

5. The Real Data Analysis

In this section, we apply the MOM method to analyze the top 50 data of GDP of China in 2019. Basing on the presentation of Zhu et al. [17], there are many methods to test whether there are outliers in the data, such as the 4d test,

3 σ

principle, the Chauvenet method, the t-test and the Grubbs test. Sun [18] also introduced the box plot method. Different test methods will get different outliers. So we use the box plot as shown in Figure 1 to confirm the existence of outliers in the actual data based on the suggestion of Sun et al. [18]. The outliers are 381.55, 353.71, 269.27, and 236.28 (unit: ten billion RMB).

We also use a 3-

σ

principle to test whether there are outliers, and the result shows that the outliers are 381.55 and 353.71. Through the test of the above two methods, we can judge that there are outliers in this real data.

Yin and Du [19] introduced a power-law distribution. For the purpose of predicting the GDP development trend of major cities accurately in China. We use the EL method, the MOM method and the NLS method to fit the curve respectively. Where

x_{i}

represents the sorting order of GDP of 50 cities in descending order. The dataset is from www.askci.com (accessed on 15 February 2021).

The EL gives the nonlinear regression equation

\begin{matrix} G D P^{E L} & = & 444.0250 \times x_{i}^{- 0.5176290} . \end{matrix}

(9)

The MOM gives the nonlinear regression equation

\begin{matrix} G D P^{M O M} & = & 594.1439 \times x_{i}^{- 0.6111023} . \end{matrix}

(10)

The NLS gives the nonlinear regression equation

\begin{matrix} G D P^{N L S} & = & 443.0247 \times x_{i}^{- 0.5167945} . \end{matrix}

(11)

In Figure 2, the red line represents the fitting result of NLS method, and the blue line represents the fitting result of MOM method. the black line represents the fitting result of EL method, and the yellow points represent the true value of GDP.

In actual data, the true values of parameters are really unknown, so we cannot calculate MSE of the parameter. MAE refers to the average value of the absolute error. The definition of Mean Absolute Error (MAE) is given below.

M A E = \frac{1}{G} \sum_{i = 1}^{G} | y_{i} - {\hat{y}}_{i} |, i = 1, \dots, G .

In the actual data,

y_{i}

refers to the true value of GDP, and

{\hat{y}}_{i}

refers to the GDP value obtained from the fitted nonlinear regression model, so we calculated the MAE. MAE of the MOM method is 11.984. MAE of the NLS method is 12.024, MAE of the MOM method is 11.982. Cross-validations are taken to examine the accuracy of forecasting. Specifically, we take 40 data as experimental data and the other 10 as forecasting data randomly and the number of independent replications is 1000. The MAE of EL, ELS and MOM are 14.206, 14.271 and 12.242 respectively. These suggest that MOM is more plausible than NLS and EL.

6. Conclusions

It is shown that the NLS method is not robust to outliers based on the research of Gao and Li [11]. So in this paper, firstly, we apply the MOM method to the nonlinear regression model and introduce its theory. We give the theoretical results of asymptotic normality and consistency of the MOM estimator. Secondly, we propose a new test method based on the empirical likelihood method. Thirdly, we use the MOM method to estimate the parameters of three forms of nonlinear regression models, and compare the MSE of

{\hat{θ}}^{N L S}

,

{\hat{θ}}^{M O M}

and

{\hat{θ}}^{E L}

. The results show that the MSE of

{\hat{θ}}^{M O M}

is the smallest from Table 1, Table 2 and Table 3 and the size and power prove the superiority of the MOM method from Table 4, Table 5 and Table 6. Finally, the MOM method is applied to predict the GDP development of cities of China, the value of MAE shows that the prediction of the MOM method is better than the NLS method. All in all, the MOM method does not need to eliminate outliers. Regardless of whether there are outliers in the data, we will use the MOM method to get a robust estimation.

Author Contributions

Conceptualization, P.L.; methodology, P.L.; M.Z. and Q.Z.; software, R.Z.; writing—original draft, R.Z.; writing—review and editing, M.Z., R.Z. and Q.Z. All authors have read and agreed to the published version of the manuscript.

Funding

Pengfei Liu’s research is supported by the National Natural Science Foundation of China (NSFC11501261, NSFC52034007) and the State Scholarship funded by China Scholarship Council (CSC201808320107). Ru Zhang’s research is supported by the Project funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions. Qin Zhou’s research is supported by the National Natural Science Foundation of China (NSFC11671178).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

www.askci.com (accessed on 15 February 2021).

Acknowledgments

Thank Shaochen Wang and Wang Zhou for their help. And thank the reviewers for their constructive comments.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

In this Appendix A, we give the technical proofs of Theorems 1–3.

Lemma A1

(Chernoff’s inequality, cf. Vershynin [20] Theorem 2.3.1). Let

X_{i} (i = 1, \dots, n)

is an independent Bernoulli random variable with the parameter

p_{i}

. Consider sum

M_{n} = \sum_{i = 1}^{n} X_{i}

and their mean

μ = E (M_{n})

, for any

t > μ

, we have

\begin{matrix} P (M_{n} \geq t) \leq e^{- μ} {(\frac{e μ}{t})}^{t} . \end{matrix}

(A1)

Proof of Theorem 1.

In accordance with the condition (I) of Theorem 1 and the Lemma 1 of Ivanov [21], for

n \geq N_{0}

and sufficiently large positive

ρ

, c does not depend on n and

ρ

. We have

\begin{matrix} P (\sqrt{n} | {\hat{θ}}_{k}^{(j)} - θ_{k} | > ρ) \leq \frac{c}{ρ^{s}}, j = 1, \dots, g . \end{matrix}

(A2)

According to Wu [22], we can get that the least square estimate of

σ^{2}

is (j = 1, …, g)

{\hat{σ}}_{n}^{2} = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{θ}}_{q}^{(j)})}^{2}, q = 1, \dots, k

According to Formula (2) and the conditions (II)(III)(IV). From the Theorem 5 of Wu [12], we can know

\begin{matrix} \sqrt{n} ({\hat{θ}}_{q}^{(j)} - θ_{q}) \overset{d}{⟶} N (0, {\hat{σ}}_{n}^{2} S^{- 1}), q = 1, \dots, k . \end{matrix}

(A3)

According to Pinelis [23],

C_{1}

is a constant, we can know

\begin{matrix} sup_{x \in R} | P \frac{\sqrt{n}}{{\hat{σ}}_{n} S^{\frac{- 1}{2}}} ({\hat{θ}}_{q}^{(j)} - θ_{q}) \leq x) - Φ (x) | & \leq & \frac{C_{1}}{\sqrt{n}}, q = 1, \dots, k . \end{matrix}

(A4)

where

Φ

represent the cumulative distribution function of Standard normal distribution.

Define random variables

\begin{matrix} α_{n, j} = \frac{\sqrt{n}}{{\hat{σ}}_{n} S^{\frac{- 1}{2}}} ({\hat{θ}}_{1}^{(j)} - θ_{1}), j = 1, \dots, g . \end{matrix}

(A5)

According to formula (A4), we have

\begin{matrix} sup_{x \in R} | P (\frac{\sqrt{n}}{{\hat{σ}}_{n} S^{\frac{- 1}{2}}} ({\hat{θ}}_{1}^{(j)} - θ_{1}) \leq x) - Φ (x) | & \leq & \frac{C_{1}}{\sqrt{n}} \end{matrix}

So we can get

\begin{matrix} sup_{x \in R} | P (α_{n, j} \leq x) - Φ (x) | & \leq & \frac{C_{1}}{\sqrt{n}} \end{matrix}

For each

j = 1, \dots, g

, suppose

x = \frac{\sqrt{n} H}{{\hat{σ}}_{n} S^{- \frac{1}{2}}}

, we have

\begin{matrix} P ({\hat{θ}}_{1}^{(j)} - θ_{1} \geq H) & \leq & \frac{C_{1}}{\sqrt{n}} + 1 - Φ (\frac{\sqrt{n} H}{{\hat{σ}}_{n} S^{- \frac{1}{2}}}) \end{matrix}

for all

H > 0

, according to the elementary inequality

\begin{matrix} 1 - Φ (\sqrt{n} H / {\hat{σ}}_{n} S^{- \frac{1}{2}}) & \leq & e^{- n H^{2} / 2 ({\hat{σ}}_{n}^{2} S^{- 1})} \end{matrix}

where

o (n^{- \frac{1}{2}})

for large n and fixed

H > 0

, hence

\begin{matrix} P ({\hat{θ}}_{1}^{(j)} - θ_{1} \geq H) & \leq & \frac{C_{2}}{2 \sqrt{n}} . \end{matrix}

(A6)

Similarly, we can get

\begin{matrix} P ({\hat{θ}}_{1}^{(j)} - θ_{1} \leq - H) \leq \frac{C_{2}}{2 \sqrt{n}} \end{matrix}

where

C_{2}

is a constant that depends on H but not n, so we have

P (| {\hat{θ}}_{1}^{(j)} - θ_{1} | \geq H) \leq \frac{C_{2}}{\sqrt{n}} .

(A7)

It is easy to verify that

| {\hat{θ}}_{1}^{M O M} - θ_{1} | \leq M e d i a n \{| {\hat{θ}}_{1}^{(j)} - θ_{1} |, j = 1, \dots, g\}

so we have the conclusion

P (| {\hat{θ}}_{1}^{M O M} - θ_{1} | \geq H) \leq P (M e d i a n \{| {\hat{θ}}_{1}^{(j)} - θ_{1} |, j = 1, \dots, g\} \geq H) : = P (E) .

(A8)

Definite Bernoulli random variable

η_{j} = I (| {\hat{θ}}_{1}^{(j)} - θ_{1} | \geq H), j = 1, \dots, g

we have

E η_{j} \leq C_{2} n^{\frac{- 1}{2}}

by Formula (A9). It can be seen that event E occurs if and only if

\sum_{j = 1}^{g} η_{j}

is larger than

\frac{g}{2}

, hence

P (E) = P (\sum_{j = 1}^{g} η_{j} \geq \frac{g}{2}) \leq e^{- g E η_{1}} {(2 e C_{2} n^{\frac{- 1}{2}})}^{\frac{g}{2}} \leq \frac{C}{n^{\frac{g}{5}}} .

(A9)

We have used Lemma 1 in the last step. This ends the proof of Theorem 1.

For any fixed x, we define i.i.d random variables

\begin{matrix} π_{n, j} (x) = I (α_{n, j} \leq x), j = 1, \dots, g \end{matrix}

and suppose

\begin{matrix} p_{n} (x) = P (α_{n, j} \leq x) \end{matrix}

according to Formula (A4)

\begin{matrix} | p_{n} (x) - Φ (x) | & = & O (n^{- \frac{1}{2}}) \end{matrix}

for all real x. The following lemma gives the central limit theorem for the partial sums of

π_{n, j} (x)

. □

Lemma A2.

Suppose

n / g \to \infty

as

g \to \infty

. We have

\begin{matrix} \sqrt{g} (\frac{1}{g} \sum_{j = 1}^{g} π_{n, j} (x) - Φ (x)) \overset{d}{⟶} N (0, Φ (x) [1 - Φ (x)]) . \end{matrix}

(A10)

for the fixed x, as

g \to \infty

,

\begin{matrix} \sqrt{g} (\frac{1}{g} \sum_{j = 1}^{g} π_{n, j} (x g^{- \frac{1}{2}}) - \frac{1}{2} - \frac{x}{\sqrt{2 π g}}) \overset{d}{⟶} N (0, \frac{1}{4}) . \end{matrix}

(A11)

Proof of Lemma 2.

For convenience, we write

π_{n, j} (x)

as

π_{n, j}

. By independence, for any real t and

i = \sqrt{- 1}

, we have

\begin{matrix} E e x p \{i t \sqrt{g} (\frac{1}{g} \sum_{j = 1}^{g} π_{n, j} - Φ (x))\} = {(E e^{i t \frac{1}{\sqrt{g}} [π_{n, 1} - Φ (x)]})}^{g} . \end{matrix}

(A12)

through the Taylor’s expansion, we have

\begin{matrix} E e^{i t \frac{1}{\sqrt{g}} [π_{n, 1} - Φ (x)]} & = p_{n} e^{i t g^{- \frac{1}{2}} (1 - Φ (x))} + (1 - p_{n}) e^{- i t g^{- 1 / 2} Φ (x)} \\ = p_{n} (1 + i t g^{- 1 / 2} (1 - Φ (x)) + \frac{{(i t g^{- 1 / 2} (1 - Φ (x)))}^{2}}{2!}) \\ + (1 - p_{n}) (1 - i t g^{- 1 / 2} Φ (x) + \frac{{(- i t g^{- 1 / 2} Φ (x))}^{2}}{2!}) \\ = 1 + p_{n} i t g^{- 1 / 2} (1 - Φ (x)) - (1 - p_{n}) i t g^{- 1 / 2} Φ (x) \\ - \frac{p n}{2 g} {[t (1 - Φ (x))]}^{2} - \frac{1 - p_{n}}{2 g} {[t Φ (x)]}^{2} + o (g^{- 1}) \\ = 1 - \frac{t^{2}}{2 g} Φ (x) [1 - Φ (x)] + o (g^{- 1}) . \end{matrix}

(A13)

where we used the formula

| p_{n} - Φ (x) | = O (n^{- \frac{1}{2}})

, when

n / g \to \infty

and

g \to \infty

,

\begin{matrix} | p_{n} g^{- 1 / 2} (1 - Φ (x)) - (1 - p_{n}) g^{- 1 / 2} Φ (x) | = g^{- 1 / 2} | p_{n} - Φ (x) | = o (g^{- 1}) . \end{matrix}

(A14)

so the first conclusion of the Lemma 2 can get by formula (A13).

For the second conclusion, we find that the above calculations still hold if we replace x with

x g^{- \frac{1}{2}}

and note the fact that

\begin{matrix} Φ (x g^{- \frac{1}{2}}) = \frac{1}{2} + \frac{1}{\sqrt{2 π}} \int_{0}^{x g^{- 1 / 2}} e^{- \frac{u^{2}}{2}} d u = \frac{1}{2} + \frac{x}{\sqrt{2 π g}} + o (g^{- \frac{1}{2}}) . \end{matrix}

(A15)

We can proof the formula (A15) by the virtue of Slutsky’s theorem. □

Proof of Theorem 2.

(1) This follows immediately by formula (A4) and the continuous mapping theorem since the Median function is continuous.

(2) We can observe that

\begin{matrix} \frac{\sqrt{N}}{{\hat{σ}}_{n} S^{- \frac{1}{2}}} ({\hat{θ}}_{1}^{M O M} - θ_{1}) & = & \sqrt{g} \frac{\sqrt{n}}{{\hat{σ}}_{n} S^{- \frac{1}{2}}} ({\hat{θ}}_{1}^{M O M} - θ_{1}) = \sqrt{g} M e d i a n {α_{n, j}, j = 1, \dots, g} \end{matrix}

We first assume g is odd and for any real x, and we have

\begin{matrix} P (\sqrt{g} M e d i a n {α_{n, j}, j = 1, \dots, g} \leq x) \\ = P (\sum_{j = 1}^{g} I (α_{n, j} \leq x g^{- \frac{1}{2}}) \geq (g + 1) / 2) \\ = P (\sqrt{g} {\frac{1}{g} \sum_{j = 1}^{g} π_{n, j} (x g^{- \frac{1}{2}}) - 1 / 2 - \frac{x}{\sqrt{2 π g}}} \geq - \frac{x}{2 \sqrt{π}} + O (g^{- \frac{1}{2}})) . \end{matrix}

(A16)

under the above lemma, it tends to

Φ (\sqrt{\frac{2}{π}} x)

.

If g is even, we can know

\begin{matrix} P (\sqrt{g} M e d i a n \{α_{n, j}, j = 1, \dots, g\} \leq x) \geq P (\sum_{j = 1}^{g} I (α_{n, j} \leq x g^{- \frac{1}{2}}) \geq \frac{g}{2} + 1) \end{matrix}

and

\begin{matrix} P (\sqrt{g} M e d i a n {α_{n, j}, j = 1, \dots, g} \leq x) \leq P (\sum_{j = 1}^{g} I (α_{n, j} \leq x g^{- \frac{1}{2}}) \geq \frac{g}{2}) \end{matrix}

The right hand sides of the above two inequalities tend to

Φ (\sqrt{\frac{2}{π}} x)

as

g \to \infty

. □

Proof of Theorem 3.

Recall that

T_{n, j} = I ({\hat{θ}}_{q}^{(j)} \leq θ_{q}), q = 1, \dots, k .

(A17)

where

j = 1, \dots, g

, so formula (6) is

f (λ) = \frac{1}{g} \sum_{j = 1}^{g} \frac{T_{n, j} - 0.5}{1 + λ (T_{n, j} - 0.5)} = 0 .

(A18)

set that

L_{n, j} = λ (T_{n, j} - 0.5)

, and we have

\begin{matrix} λ \tilde{R} = & \frac{1}{g} \sum_{j = 1}^{g} \frac{λ {(T_{n, j} - 0.5)}^{2}}{1 + L_{n, j}} \\ = & \frac{1}{g} \sum_{j = 1}^{g} \frac{L_{n, j} (T_{n, j} - 0.5)}{1 + L_{n, j}} . \end{matrix}

(A19)

\begin{matrix} {\bar{T}}_{n, j} - 0.5 = & \frac{1}{g} \sum_{j = 1}^{g} \frac{1 + L_{n, j}}{1 + L_{n, j}} (T_{n, j} - 0.5) \\ = & \frac{1}{g} \sum_{j = 1}^{g} (T_{n, j} - 0.5) - \frac{1}{g} \sum_{j = 1}^{g} \frac{T_{n, j} - 0.5}{1 + L_{n, j}} \\ = & \frac{1}{g} \sum_{j = 1}^{g} \frac{L_{n, j} (T_{n, j} - 0.5)}{1 + L_{n, j}} . \end{matrix}

(A20)

So

({\bar{T}}_{n, j} - 0.5) = λ \tilde{R} .

(A21)

where

\begin{matrix} \tilde{R} = \frac{1}{g} \sum_{j = 1}^{g} \frac{{(T_{n, j} - 0.5)}^{2}}{1 + L_{n, j}} \end{matrix}

\begin{matrix} {\bar{T}}_{n, j} = \frac{1}{g} \sum_{j = 1}^{g} T_{n, j} \end{matrix}

R = \frac{1}{g} \sum_{j = 1}^{g} {(T_{n, j} - 0.5)}^{2} = 0.25 .

(A22)

T_{g} = max_{1 \leq j \leq g} | T_{n, j} - 0.5 | = 0.5

Combining the constraint condition

ω_{i} > 0

, we can get that

1 + L_{n, j} > 0

, and

\begin{matrix} λ R & \leq & λ \tilde{R} (1 + max_{1 \leq j \leq g} L_{n, j}) \\ \leq & λ \tilde{R} (1 + λ T_{g}) \\ = & ({\bar{T}}_{n, j} - 0.5) (1 + λ T_{g}) \end{matrix}

The last equality follows by formual (A20). So,

\begin{matrix} λ [R - ({\bar{T}}_{n, j} - 0.5) T_{g}] \leq {\bar{T}}_{n, j} - 0.5 . \end{matrix}

(A23)

and according to Lemma 2,

{\bar{T}}_{n, j} - 0.5 = O_{p} (g^{- 1 / 2})

, we can get

\begin{matrix} λ [0.25 - O_{p} (g^{- 1 / 2})] = O_{p} (g^{- 1 / 2}) \end{matrix}

so

\begin{matrix} λ = O_{p} (g^{- 1 / 2}) . \end{matrix}

(A24)

In addition, we know

\begin{matrix} max_{1 \leq j \leq g} | L_{n, j} | = O_{p} (g^{- 1 / 2}) . \end{matrix}

(A25)

Expanding formula (6),

\begin{matrix} 0 & = \frac{1}{g} \sum_{j = 1}^{g} \frac{T_{n, j} - 0.5}{1 + L_{n, j}} \\ = ({\bar{T}}_{n, j} - 0.5) - λ R + \frac{1}{g} \sum_{j = 1}^{g} \frac{(T_{n, j} - 0.5) L_{n, j}^{2}}{1 + L_{n, j}} \\ = ({\bar{T}}_{n, j} - 0.5) - 0.25 λ + \frac{1}{g} \sum_{j = 1}^{g} \frac{(T_{n, j} - 0.5) L_{n, j}^{2}}{1 + L_{n, j}} . \end{matrix}

(A26)

The final term in formula (A26) above has a norm bounded by

\begin{matrix} \frac{1}{g} \sum_{j = 1}^{g} | T_{n, j} {- 0.5 |}^{3} λ^{2} {| 1 + L_{n, j} |}^{- 1} = O (1) {(O_{p} (g^{- 1 / 2}))}^{2} O_{p} (1) = o_{p} (g^{- 1 / 2}) \end{matrix}

Therefore

\begin{matrix} λ = R^{- 1} ({\bar{T}}_{n, j} - 0.5) + β = 4 ({\bar{T}}_{n, j} - 0.5) + β \end{matrix}

where

β = o_{p} (g^{- 1 / 2})

.

Through formula (A26) and using Taylor expansion, we can find that

\begin{matrix} l o g (1 + L_{n, j}) = L_{n, j} - \frac{1}{2} L_{n, j}^{2} + η_{j} . \end{matrix}

(A27)

holds for some finite

B > 0

,

1 \leq j \leq g

,

P (| η_{j} | \leq B | L_{n, j} |^{3}) \to 1 .

(A28)

as

g \to \infty

and

n \to \infty

.

Now, we calculate that

\begin{matrix} - 2 l o g R (θ) & = & 2 \sum_{j = 1}^{g} l o g (1 + L_{n, j}) \\ = & 2 \sum_{j = 1}^{g} (L_{n, j} - \frac{1}{2} L_{n, j}^{2} + η_{j}) \\ = & \sum_{j = 1}^{g} (2 (4 ({\bar{T}}_{n, j} - 0.5) + β) T_{n, j} \\ - & (4 {\bar{T}}_{n, j} - 2 + β) - {(4 {\bar{T}}_{n, j} - 2 + β)}^{2} T_{n, j}^{2} \\ + & {(4 {\bar{T}}_{n, j} - 2 + β)}^{2} T_{n, j} - 0.25 {(4 {\bar{T}}_{n, j} - 2 + β)}^{2} + 2 \sum_{j = 1}^{g} η_{j} \\ = & 4 g {({\bar{T}}_{n, j} - 0.5)}^{2} - \frac{1}{4} g β^{2} + 2 \sum_{j = 1}^{g} η_{j} \end{matrix}

By Lemma 2, we have

\begin{matrix} 4 g {({\bar{T}}_{n, j} - 0.5)}^{2} \to χ_{1}^{2} . \end{matrix}

(A29)

Noticed that

\begin{matrix} \frac{1}{4} g β^{2} = \frac{1}{4} g o_{p} (g^{- 1}) = o_{p} (1) . \end{matrix}

(A30)

| \sum_{j = 1}^{g} η_{j} {| \leq B | λ |}^{3} \sum_{j = 1}^{g} {| T_{n, j} - 0.5 |}^{3} = O_{p} (g^{- \frac{3}{2}}) O (1) = o_{p} (1)

This completes the proof. □

References

Hong, Z. The application of nonlinear regression model to the economic system prediction. J. Jimei Inst. Navig. 1996, 4, 48–52. [Google Scholar]
Wang, D.; Jiang, D.; Cheng, S. Application of nonlinear regression model to detectthe thickness of protein layer. J. Biophys. 2000, 16, 33–74. [Google Scholar]
Chen, H.; Wang, J.; Zhang, H. Application of nonlinear regression analysis in establishing price model of ground-to-air missile. J. Abbr. 2005, 4, 77–79. [Google Scholar]
Archontoulis, S.V.; Miguez, F.E. Nonlinear regression models and applications in agricultural research. Agron. J. 2015, 105, 1–13. [Google Scholar] [CrossRef] [Green Version]
Alon, N.; Matias, Y.; Szegedy, M. The space complexity of approximating the frequency moment. J. Comput. Syst. Sci. 1999, 58, 137–147. [Google Scholar] [CrossRef] [Green Version]
Lecué, G.; Lerasle, M. Robust machine learning by median-of-means: Theory and practice. Ann. Stat. 2017, 32, 4711–4759. [Google Scholar]
Lecué, G.; Lerasle, M.; Mathieu, T. Robust classification via MOM minimization. Mach. Learn. 2018, 32, 1808–1837. [Google Scholar] [CrossRef]
Zhang, Y.; Liu, P. Median-of-means approach for repeated measures data. Commun. Stat. Theory Methods 2020, 2020, 1–10. [Google Scholar] [CrossRef]
Radchenko, P.P. Nonlinear least-squares estimation. J. Multivar. Anal. 2006, 97, 548–562. [Google Scholar]
Ding, X.; Xu, L.; Lin, J. Empirical likelihood diagnosis of nonlinear regression model. Chin. J. Appl. Math. 2012, 4, 693–702. [Google Scholar]
Gao, S.; Li, X. Analysis on the robustness of least squares method. Stat. Decis. 2006, 15, 125–126. [Google Scholar]
Wu, C.-F. Asymptotic theory of nonlinear least squares estimation. Ann. Stat. 1981, 9, 501–513. [Google Scholar] [CrossRef]
Emilien, J.; Gábor, L.; Roberto, I.O. Sub-Gaussian estimators of the mean of a random vector. Ann. Stat. 2017, 47, 440–451. [Google Scholar]
Owen, A.B. Empirical likelihood ratio confidence intervals for a single functional. Biometrika 1988, 75, 237–249. [Google Scholar] [CrossRef]
Jiang, Y. Empirical Likelihood Inference of Nonlinear Regression Model Parameters. Master’s Thesis, Beijing University of Technology, Beijing, China, 2005. [Google Scholar]
Ratkowski, H.Z. Nonlinear Regression Model: A Unified Practical Method; Nanjing University Press: Nanjing, China, 1986; pp. 12–25. [Google Scholar]
Zhu, J.; Bao, Y.; Li, C. Discussion on data outlier test and processing method. Univ. Chem. 2018, 33, 58–65. [Google Scholar] [CrossRef]
Sun, X.; Liu, Y.; Chen, W.; Jia, Z.; Huang, B. The application of box and plot method in the outlier inspection of animal health data. China Anim. Quar. 2010, 27, 66–68. [Google Scholar]
Yin, C.; Du, J. The collision theory reaction rate coefficient for power-law distributions. Phys. A Stat. Mech. Its Appl. 2014, 407, 119–127. [Google Scholar] [CrossRef] [Green Version]
Vershynin, R. High-Dimensional Probability (An Introduction with Applications in Data Science); Cambridge University Press: Cambridge, UK, 2018; pp. 70–97. [Google Scholar]
Ivanov, A.V. An asymptotic expansion for the Distribution of the Least Squares Estimator of the nonlinear regression parameter. Theory Probab. Appl. 1977, 21, 557–570. [Google Scholar] [CrossRef]
Wu, Q. Asymptotic normality of least squares estimation in nonlinear models. J. Guilin Inst. Technol. 1998, 18, 394–400. [Google Scholar]
Pinelis, I.; Molzon, R. Optimal-order bounds on the rate of convergence to normality in the multivariate delta method. Electron. J. Stat. 2016, 10, 1001–1063. [Google Scholar] [CrossRef]

Figure 1. Box plot of the top 50 data of GDP of China in 2019.

Figure 2. Fitting result figure of the top 50 data of GDP of China in 2019.

Table 1. Mean Square Error (MSE) for

{\hat{θ}}^{N L S}

,

{\hat{θ}}^{M O M}

and

{\hat{θ}}^{E L}

in Example 1.

Table 1. Mean Square Error (MSE) for

{\hat{θ}}^{N L S}

,

{\hat{θ}}^{M O M}

and

{\hat{θ}}^{E L}

in Example 1.

	No Outliers			$1 %$ from $B (20, \frac{1}{2})$			$2 %$ from $U (7, 8)$			$2 %$ from $N (6, 2)$
T	EL	MOM	NLS	EL	MOM	NLS	EL	MOM	NLS	EL	MOM	NLS
100	1.617	1.375	1.329	2.208	2.020	2.081	2.321	1.969	2.023	2.065	1.790	1.792
200	0.650	0.669	0.649	1.230	1.071	1.229	2.000	1.727	1.753	1.610	1.486	1.611
300	0.415	0.421	0.414	1.011	0.867	1.012	1.798	1.714	1.799	1.157	1.058	1.158
400	0.322	0.328	0.321	0.830	0.717	0.831	1.351	1.178	1.351	1.134	1.035	1.135
500	0.268	0.274	0.267	0.779	0.628	0.780	1.256	1.104	1.256	1.129	1.002	1.128
600	0.212	0.214	0.212	0.729	0.578	0.729	1.145	0.989	1.146	1.036	0.874	1.036
700	0.173	0.174	0.173	0.697	0.554	0.698	1.195	0.979	1.196	1.015	0.848	1.014
800	0.162	0.163	0.161	0.667	0.486	0.668	1.119	0.976	1.120	1.016	0.828	1.017
900	0.141	0.142	0.141	0.637	0.448	0.638	1.113	0.891	1.113	1.007	0.832	1.008
1000	0.128	0.129	0.127	0.622	0.441	0.623	1.083	0.865	1.083	0.996	0.809	0.997

Table 2. MSE for

{\hat{θ}}^{N L S}

,

{\hat{θ}}^{M O M}

,

{\hat{θ}}^{E L}

in Example 2.

Table 2. MSE for

{\hat{θ}}^{N L S}

,

{\hat{θ}}^{M O M}

,

{\hat{θ}}^{E L}

in Example 2.

	No Outliers			$1 %$ from $B (22, \frac{1}{2})$			$2 %$ from $N (7, 3)$			$2 %$ from $U (7, 8)$
T	EL	MOM	NLS	EL	MOM	NLS	EL	MOM	NLS	EL	MOM	NLS
100	0.380	0.384	0.381	0.626	0.598	0.627	0.822	0.809	0.823	0.606	0.587	0.607
200	0.189	0.193	0.190	0.481	0.455	0.480	0.569	0.554	0.570	0.596	0.590	0.597
300	0.126	0.127	0.126	0.414	0.399	0.415	0.525	0.522	0.526	0.578	0.563	0.577
400	0.100	0.100	0.099	0.379	0.344	0.380	0.472	0.465	0.472	0.547	0.544	0.546
500	0.078	0.079	0.077	0.370	0.328	0.369	0.447	0.428	0.447	0.533	0.515	0.533
600	0.061	0.065	0.063	0.346	0.301	0.347	0.458	0.415	0.459	0.501	0.494	0.501
700	0.055	0.056	0.055	0.337	0.296	0.336	0.424	0.401	0.425	0.498	0.492	0.498
800	0.049	0.049	0.048	0.336	0.278	0.336	0.436	0.403	0.437	0.496	0.472	0.495
900	0.042	0.044	0.042	0.342	0.276	0.341	0.410	0.369	0.410	0.490	0.464	0.489
1000	0.038	0.042	0.038	0.332	0.278	0.333	0.420	0.387	0.420	0.487	0.455	0.487

Table 3. MSE for

{\hat{θ}}^{N L S}

,

{\hat{θ}}^{M O M}

and

{\hat{θ}}^{E L}

in Example 3.

Table 3. MSE for

{\hat{θ}}^{N L S}

,

{\hat{θ}}^{M O M}

and

{\hat{θ}}^{E L}

in Example 3.

	No Outliers			$1 %$ from $B (20, \frac{1}{2})$			$2 %$ from $N (6, 2)$			$2 %$ from $U (6, 7)$
T	EL	MOM	NLS	EL	MOM	NLS	EL	MOM	NLS	EL	MOM	NLS
100	6.700	6.890	6.701	8.559	8.197	8.666	10.502	9.496	10.546	8.876	8.687	8.928
200	3.341	3.351	3.342	5.970	5.446	5.971	8.096	7.911	8.178	7.500	7.438	7.501
300	2.200	2.239	2.201	5.482	5.067	5.483	7.055	6.742	7.088	6.824	6.652	6.825
400	1.565	1.624	1.566	5.102	4.555	5.103	6.673	6.209	6.785	6.528	6.305	6.529
500	1.263	1.295	1.264	4.645	4.188	4.646	6.574	6.045	6.575	6.244	6.111	6.245
600	1.158	1.175	1.159	4.497	3.675	4.498	6.399	5.909	6.400	6.219	6.077	6.220
700	0.831	0.844	0.831	4.248	3.492	4.249	6.290	5.817	6.292	6.101	5.970	6.102
800	0.802	0.815	0.803	4.336	3.486	4.337	6.356	5.664	6.357	6.067	5.691	6.068
900	0.731	0.736	0.732	4.138	3.162	4.139	6.206	5.411	6.207	5.858	5.486	5.859
1000	0.663	0.669	0.664	3.628	2.764	3.629	6.336	5.312	6.336	5.385	4.998	5.386

Table 4. Size and power in Example 4.

	Size		Power
			$θ_{0} = 0.1$		$θ_{0} = 0.2$
T	MOMEL	EL	MOMEL	EL	MOMEL	EL
100	0.054	0.060	0.629	0.161	0.852	0.584
200	0.051	0.045	0.713	0.237	0.946	0.847
300	0.059	0.062	0.764	0.368	0.987	0.966
400	0.047	0.060	0.828	0.498	0.996	0.987
500	0.050	0.040	0.869	0.578	1.000	0.996
600	0.055	0.047	0.900	0.670	1.000	0.999
700	0.052	0.048	0.920	0.728	1.000	0.999
800	0.050	0.045	0.936	0.779	1.000	1.000
900	0.050	0.046	0.953	0.833	1.000	1.000
1000	0.046	0.040	0.961	0.882	1.000	1.000

Table 5. Size and power in Example 5.

	Size		Power
			$θ_{0} = 0.1$		$θ_{0} = 0.15$
T	MOMEL	EL	MOMEL	EL	MOMEL	EL
100	0.057	0.065	0.762	0.322	0.896	0.524
200	0.050	0.046	0.888	0.556	0.974	0.827
300	0.054	0.043	0.939	0.729	0.995	0.970
400	0.051	0.044	0.972	0.835	0.997	0.991
500	0.055	0.047	0.988	0.919	0.999	0.998
600	0.056	0.047	0.995	0.949	1.000	0.999
700	0.047	0.044	0.998	0.966	1.000	1.000
800	0.056	0.043	1.000	0.985	1.000	1.000
900	0.050	0.048	1.000	0.991	1.000	1.000
1000	0.051	0.046	1.000	0.996	1.000	1.000

Table 6. Size and power in Example 6.

	Size		Power
			$θ_{0} = 0.2$		$θ_{0} = 0.3$
T	MOMEL	EL	MOMEL	EL	MOMEL	EL
100	0.063	0.072	0.577	0.144	0.663	0.190
200	0.058	0.045	0.640	0.183	0.754	0.290
300	0.047	0.040	0.698	0.215	0.834	0.398
400	0.051	0.049	0.768	0.291	0.885	0.501
500	0.055	0.047	0.799	0.335	0.892	0.577
600	0.057	0.042	0.824	0.373	0.946	0.667
700	0.056	0.043	0.851	0.470	0.957	0.725
800	0.048	0.049	0.859	0.472	0.969	0.801
900	0.055	0.042	0.893	0.540	0.983	0.852
1000	0.051	0.057	0.920	0.607	1.000	0.880

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, P.; Zhang, M.; Zhang, R.; Zhou, Q. Robust Estimation and Tests for Parameters of Some Nonlinear Regression Models. Mathematics 2021, 9, 599. https://doi.org/10.3390/math9060599

AMA Style

Liu P, Zhang M, Zhang R, Zhou Q. Robust Estimation and Tests for Parameters of Some Nonlinear Regression Models. Mathematics. 2021; 9(6):599. https://doi.org/10.3390/math9060599

Chicago/Turabian Style

Liu, Pengfei, Mengchen Zhang, Ru Zhang, and Qin Zhou. 2021. "Robust Estimation and Tests for Parameters of Some Nonlinear Regression Models" Mathematics 9, no. 6: 599. https://doi.org/10.3390/math9060599

APA Style

Liu, P., Zhang, M., Zhang, R., & Zhou, Q. (2021). Robust Estimation and Tests for Parameters of Some Nonlinear Regression Models. Mathematics, 9(6), 599. https://doi.org/10.3390/math9060599

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Robust Estimation and Tests for Parameters of Some Nonlinear Regression Models

Abstract

1. Introduction

2. Median-of-Means Method Applies to Nonlinear Regression Model

3. Empirical Likelihood Test Based on MOM Method

4. Simulation Study

5. The Real Data Analysis

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI