Next Article in Journal
Solving Inverse Conductivity Problems in Doubly Connected Domains by the Homogenization Functions of Two Parameters
Next Article in Special Issue
Convergence Behavior of Optimal Cut-Off Points Derived from Receiver Operating Characteristics Curve Analysis: A Simulation Study
Previous Article in Journal
A New Parameter-Uniform Discretization of Semilinear Singularly Perturbed Problems
Previous Article in Special Issue
Short- and Medium-Term Power Demand Forecasting with Multiple Factors Based on Multi-Model Fusion
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Parametric Frailty Analysis in Presence of Collinearity: An Application to Assessment of Infant Mortality

1
Department of Statistics, Faculty of Science, University of Tabuk, Tabuk 47512, Saudi Arabia
2
Department of Mathematics, Sharda University, Greater Noida 201310, India
3
Department of Statistics, Institute of Science, Banaras Hindu University, Varanasi 221005, India
4
Department of Mathematics, College of Science and Arts in Gurayat, Jouf University, Gurayat 77454, Saudi Arabia
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(13), 2255; https://doi.org/10.3390/math10132255
Submission received: 17 May 2022 / Revised: 16 June 2022 / Accepted: 19 June 2022 / Published: 27 June 2022
(This article belongs to the Special Issue Computational Statistics and Data Analysis)

Abstract

:
This paper analyzes the time to event data in the presence of collinearity. To address collinearity, the ridge regression estimator was applied in multiple and logistic regression as an alternative to the maximum likelihood estimator (MLE), among others. It has a smaller mean square error (MSE) and is therefore more precise. This paper generalizes the approach to address collinearity in the frailty model, which is a random effect model for the time variable. A simulation study is conducted to evaluate its performance. Furthermore, the proposed method is applied on real life data taken from the largest sample survey of India, i.e., national family health survey (2005–2006 ) data to evaluate the association of different determinants on infant mortality in India.

1. Introduction

In a survival analysis, the frailty model is a popular approach to model time to event data, which considers both observable and unobservable factors simultaneously [1]. A large number of studies have demonstrated that the consideration of unobservable variability improves the fitness of the model [2,3,4,5]. In the frailty model there is an assumption on the explanatory variables that the variable are independent to each other, while various practical situations do not satisfy this assumption. Several studies of infant survival considered factors such as place of residence, birth order, breastfeeding, mother education, tetanus toxoid, antenatal care, iron folic acid tablets, place of delivery, etc. [6,7,8,9]. Some of the variables, for example tetanus toxoid and antenatal care, are correlated to each other in the data set, which is analyzed in this study. The fact that parameter estimates become unstable when the explanatory variables are correlated in the case of the frailty model, means that the collinearity problem has seldom been considered.
To overcome the problem of collinearity ridge estimation is the alternative of the maximum likelihood estimation. Ridge estimator has smaller MSE than MLE. In the case of high multicollinearity, there is a considerable reduction in MSE. Ridge regression was originally proposed by Hoerl and Kennard (1970a,b) [10,11], and later on Schafer et al. (1984) [12] generalized this approach to logistic regression. A slightly different approach in the connection of the ridge type estimator is given by Daffy and Santner (1989) [13]. This approach is proposed in a standard linear regression, while Cessie and Houwelingen (1992) [14] generalized the same approach in logistic regression. However, this method is not adapted for frailty models.
This study developed a ridge regression estimator in the parametric frailty model and evaluated its performance through a simulation study. Further, the proposed approach is applied to examine the risk of infant mortality in India.

2. Ridge Estimator for Parametric Frailty Model

In this section, the approach of Cessie and Houwelingen (1992) [14] is extended to the parametric frailty model. The ridge estimator is derived as a restricted maximum likelihood estimator.
Frailty model is a mixture model in the survival analysis, where the risk of the individual death (hazard function) is a function of observable factors and a random effect term (unobserved frailty) [1,15]. The form of such a model is given as
λ ( τ i x i , α i ) = α i λ 0 ( τ i ) e x p ( β x i )
where λ ( τ i x i , α i ) is the hazard of i-th individual at time τ i , i = 1 , 2 , , n with covariate vector x i = ( x i 1 , x i 2 , , x i q ) and unobserved frailty variable α i . λ 0 ( τ i ) is the base line hazard function and β = ( β 1 , β 2 , , β q ) is the vector of regression coefficients. Here, q represent the number of regressors. It is assumed that the unobserved frailty follow a gamma distribution and is captured by its variance σ 2 [16,17]. This paper also assumes that the unobserved frailty follows gamma distribution. One assumption is also added on the response variable that it follows Weibull distribution with one parameter θ . On the basis of this assumption, λ 0 ( τ i ) = θ τ i θ 1 , θ > 0 . The likelihood function of the frailty model (1) is given by,
L ( β ) = Π i = 1 n ( α i λ 0 ( τ i ) e β x i ) δ i e α i γ 0 ( τ i ) e β x i
Using the assumption of the distribution (i.e., gamma distribution) of frailty, the random terms are resulted out, making the likelihood function,
L ( β ) = i = 1 n ( λ 0 ( τ i ) e β x i 1 + σ 2 γ 0 ( τ i ) e β x i ) δ i ( 1 + σ 2 γ 0 ( τ i ) e β x i ) 1 / σ 2
The log likelihood form of (3) is
l ( β ) = i = 1 n [ δ i [ l o g ( λ 0 ( τ i ) ) + β x i l o g ( 1 + σ 2 γ 0 ( τ i ) e β x i ) ] ( 1 / σ 2 ) l o g ( 1 + σ 2 γ 0 ( τ i ) e β x i ) ]
The maximization of (4) gives the ordinary MLE β ^ for β . Since (4) is non linear in β , m-th, the approximation is obtained by using the Newton–Raphson method, i.e.,
β ^ ( m ) = β ^ ( m 1 ) + F ( β ^ ( m 1 ) ) 1 U ( β ^ ( m 1 ) )
where U ( β ^ ( m 1 ) ) is the derivative of l ( β ) in respect to β using β ^ ( m 1 ) ,
U ( β ^ ( m 1 ) ) = i = 1 n [ δ i x i { ( 1 + δ i σ 2 ) γ 0 ( τ i ) x i e β ( m 1 ) x i } / { 1 + σ 2 γ 0 ( τ i ) e β ( m 1 ) x i } ]
and F ( β ^ ( m 1 ) ) is the negative of the matrix of the second derivative of l ( β ) in respect to β using β ^ ( m 1 ) , represented as
F ( β ^ ( m 1 ) ) = X V X
where, V is the n x n diagonal matrix, whose ( i , i ) -th element is v i i = τ i θ e β ^ ( m 1 ) x i ( 1 + δ i σ 2 ) / ( 1 + σ 2 τ i θ e β ^ ( m 1 ) x i ) . The estimation in the frailty model is based on the iteration and continues until the small changes, which are decided as β ( m ) β ( m 1 ) 10 5 . The Duffy and Santner (1989) [13] procedure of the ridge estimation is utilized to reduce the problem of collinearity. On the basis of this procedure, the log likelihood function is modified with the addition of a penalty on β .
l k ( β ) = l ( β ) k β 2
where l k ( β ) and l ( β ) are the restricted and unrestricted log likelihood functions, respectively. Here, β = ( j = 1 q β j 2 ) 1 / 2 is the norm of the parameter vector β . The maximization of (8) yields β ^ k . The ridge parameter k controls the amount of shrinkage of the norm of β . When k = 0 , the solution will remain the same as the ordinary MLE; however, if k , then all β s 0 .
Collinearity among the covariates induces the unstable parameter estimates. Shrinking the β s towards zero and allowing a little bias will stabilize the system and give estimates with smaller variance. Therefore, for a good choice of k, the restricted estimate ( β ^ k ) is expected to be, on average, closer to true value of β than unrestricted MLE. The rule of selecting k is not fixed but the most common choice is to make a little increment in MSE. In the case of the logistic and Cox regression model, k is chosen as “ 1 / β ^ β ^ , q / β ^ β ^ and ( q + 1 ) / β ^ β ^ [12,18,19]. These three choices were investigated for parametric frailty model in our simulation studies.
Such as the unrestricted MLE, β ^ k is obtained by the Newton–Raphson method. The m-th order approximation for β ^ k is
β ^ k ( m ) = β ^ k ( m 1 ) + F k ( β ^ k ( m 1 ) ) 1 U k ( β ^ k ( m 1 ) )
where U k ( β ^ k ( m 1 ) ) is the derivative of the restricted log likelihood function (8) and F k ( β ^ k ( m 1 ) ) is the negative of the matrix of the second derivative,
U k ( β ^ k ( m 1 ) ) = U ( β ^ k ( m 1 ) ) 2 k β ^ k ( m 1 )
F k ( β ^ k ( m 1 ) ) = F ( β ^ k ( m 1 ) ) + 2 k I
where I is a q × q identity matrix. Under certain regularity conditions, β ^ is asymptotically unbiased, i.e., the E ( β ^ ) = β with th covariance matrix ( X V X ) 1 [14,20]. The asymptotic bias of β ^ k becomes
E ( β ^ k β ) = 2 k ( X V X + 2 k I ) 1 β
and the asymptotic variance of β ^ k is
( X V X + 2 k I ) 1 X V X ( X V X + 2 k I ) 1
Similarly, asymptotic MSE can be obtained by taking the summation on the asymptotic variance and square of asymptotic bias.

3. Simulation Study

To evaluate the performance of estimators of β , they are compared in terms of their MSE. In this paper, the survival function is taken as
S ( τ i ) = e x p ( γ 0 ( τ i ) α i e β x i )
Then, the cumulative distribution function is
F ( τ i ) = 1 e x p ( γ 0 ( τ i ) α i e β x i )
The cumulative distribution function follows a uniform distribution with a range between 0 and 1. If M U ( 0 , 1 ) , then 1 M U ( 0 , 1 ) . Therefore, the survival function also follows a uniform distribution with the range [0–1].
M = e x p ( γ 0 ( τ i ) α i e β x i ) U ( 0 , 1 )
and
τ = γ 0 1 ( l o g ( M ) / α i e β x i )
In this study, it is assumed that the baseline distribution is the Weibull with one parameter θ = 0.5 , then γ 0 1 ( ) = t 1 / θ , ; presently, the failure time is
τ = ( l o g ( M ) / α i e β x i ) 1 / θ
Using the expression (10), failure times are generated and the sample sizes are set as, 50, 200, and 2000. The frailty α i is generated from the gamma distribution with scale and shape parameters 2 and 0.5, respectively, and M is generated randomly from U ( 0 , 1 ) . This paper considered two covariates in the model and generated by the normal distribution, where the mean is taken as zero and the variance is one. The parameters are taken as 0.25 and 0.691. Pair wise correlation is taken in four categories, namely, small ( ρ = 0.4 ) , moderate ( ρ = 0.6 ) , high ( ρ = 0.8 ) and very high ( ρ = 0.95 ) . It is assumed that the censoring times have a uniform distribution with a range from 0 to 4.5, which allows for censoring rates of approximately 39%. A total of 100 data sets were generated under each model and applied the parametric gamma frailty for each data set. The averaged MSE over q covariates for both ML and ridge estimators are computed and represented in Table 1.
The findings of Table 1, Table 2 and Table 3 demonstrate that the ridge estimators have a smaller MSE than ML. Among the three categories of k, the ridge estimator performed better for k = ( q + 1 ) / β ^ β ^ . In the case of moderate correlation ( ρ = 0.6 ), if k changes from 1 / β ^ β ^ to ( q + 1 ) / β ^ β ^ then the percentage reduction will change from 37% to 66%. It is revealed from the Table 1, Table 2 and Table 3 that if ρ increases then the percentage reduction in MSE also increases. Figure 1 shows the graphical representation of MSE with respect to the deferent values of ρ .

4. Real Life Application

The proposed ridge regression estimator for the parametric frailty model is applied to a data set from the National Family Health Survey IIIrd Phase (NFHS III). The data set has 11,581 infants, out of which 10,448 infants are right censored. The response variable is the time of survival (in months) from the birth of the infant to the death of the infant. In this paper, the frailty model included 12 covariates of infant mortality, i.e., place of residence, breastfeeding, sex of infant, birth order, place of delivery, marital status of parents, tetanus toxoid (TT), antenatal care (ANC), age of mother, mother education, iron folic acid (IFA) and family size corresponding to each infant. Out of these covariates, TT and ANC are moderately correlated to each other and the correlation value is 0.6.
In some situations, the correlation cannot be considered as the standard measure of collinearity. The condition number is an alternative to measuring the collinearity. This study followed the Ozkale (2021) [21] approach to measure the condition number. On the basis of this approach, the condition number is calculated for the matrix [ ( X * ) ( X * ) ] , where X * = V ( 1 / 2 ) X . The value of the condition number is 36.51, which is greater than 10. Therefore, collinearity is present in the data set [22,23].
Table 4 shows the distribution of births and deaths with different covariates in India 2005–2006. About 68% deaths occurred, where infants were not provided with breastfeeding. Approximately 53% infants were delivered at home, and the mothers were not educated in the same proportion. About 46% of births occurred when the size of the family was large, i.e., more than six members in the household. A total of 15%, 18% and 32% of mothers had not taken tetanus toxoid, antenatal care and IFA, respectively, during their pregnancy.
A frailty model is implemented on the data set and the results related to maximum likelihood and ridge estimates are summarized in Table 5. A hazard ratio (exp( β ^ )) less than one indicates a decreased death risk, while a greater than unity indicates an increased risk, and the interpretation of the hazard ratio is either the number of times the risk changes or the percentage change in the death risk.
Some significant results of the ridge regression are given as: Infants who were breastfeeding have a 99% lower death risk as compared to those who were not breastfeeding. Old married couples (more than nine years of marriage) have a 1.98 times higher risk of infant death as compared to newly married couples (0 to 4 years of marriage). If the age of the mother is more than 34 years, then the risk of infant death is increased by 1.5 times as compared to the infant of the reference category (mothers aged 18 to 24 years). The uneducated mother has a 9.82 times higher risk of infant death as compared to the highly educated mother. The risk of infant death is twice if the size of family is small (1 to 3 members in the household) and this risk is 28% lower for a large family (more than 6 members in the household) as compared to the reference category (4 to 6 members in the household). The estimated value of the σ 2 is significantly non zero ( σ 2 = 2.727 ), which means the unobserved variables have an impact on the survival times of the infants.
The results related to MSE corresponding to ridge estimators and the MLE are given in Table 6, which shows that the MSE is the lesser for the ridge estimator as compared to the MLE, and the benefits are more when the ridge parameter is ( q + 1 ) / β ^ β ^ .

5. Discussion and Conclusions

This paper generalized the approach of ridge regression in the frailty model. The frailty model deals with the time to event data and considered both factors (i.e., measurable and unmeasurable) simultaneously. Here, the parametric form of the frailty model is utilized. A simulation study is conducted and explained that the ridge estimator is more accurate and precise, as compared to MLE. The benefits are higher if the ridge parameter k is taken as ( q + 1 ) / β ^ β ^ .
The method is then applied to assess the risk of infant death in India. Data are taken from the third phase of NFHS and revealed that the proposed method produced more precise estimates as compared to MLE. Ridge estimates demonstrate that breastfeeding and mothers’ education are highly associated with infant mortality. The findings of this study demonstrates that maternal education is positively associated with infant mortality, i.e., the increment in the level of education reflects the decrement in the infant mortality; these results are significant. There is a high risk of infant death if the mother’s age is less than 18 years, and the reason for this finding may be considered as the complications of delivery, premature birth, pregnancy and some other related issues of teenagers. A bigger family size has a positive impact on infant death.

6. Future Scope

This study utilized the ridge as an alternative of MLE in the presence of collinearity, while other alternatives such as lasso, elastic net, etc., are also available in the existence of collinerity. In future, these alternatives can also be applied in the frailty model and can be compared with the existing one.

Author Contributions

Data curation, O.A.; Formal analysis, P.K.R.; Methodology, A.S.; Software, A.R.A.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wienke, A. Frailty Models in Survival Analysis; Chapman & Hall: London, UK, 2011; pp. 1–320. [Google Scholar]
  2. Beard, R.E. Note on some mathematical mortality models. In The Lifespan of Animals; Wolstenholme, G.E.W., Conner, M.O., Eds.; Ciba Foundation Colloquium on Ageing, Little, Brown: Boston, MA, USA, 1959; pp. 302–311. [Google Scholar]
  3. Blossfeld, H.P.; Hamerle, A. Unobserved heterogeneity in event history models. Qual. Quant. 1992, 26, 157–168. [Google Scholar] [CrossRef]
  4. Heckman, J.; Hotz, J.; Walker, J. New evidence on the timing and spacing of births. Am. Econ. Rev. Pap. Proc. 1985, 75, 179–184. [Google Scholar]
  5. Sastry, N. A nested frailty model for survival data, with an application to the study of child survival in Northeast Brazil. J. Am. Stat. Assoc. 1997, 92, 426–435. [Google Scholar] [CrossRef]
  6. Magadi, M.A.; Diamond, I.; Rodrigues, R.N. The determinants of delivery care in Kenya. Soc. Biol. 2001, 47, 164–188. [Google Scholar] [CrossRef]
  7. Omariba, D.; Beaujot, R.; Rajulton, F. Determinants of infant and child mortality in Kenya: An analysis controlling for frailty effects. Popul. Res. Policy Rev. 2007, 26, 299–321. [Google Scholar] [CrossRef]
  8. Pebley, A.R.; Goldman, N.; Rodriguez, G. Prenatal and delivery care and childhood immunization in Guatemala: Do family and community matter? Demography 1996, 33, 231–247. [Google Scholar] [CrossRef]
  9. Singh, K.K.; Pandey, N.; Gautam, A. Effect of Breastfeeding and Maternal Health Care programme on infant mortality. Demogr. India 2007, 36, 253–266. [Google Scholar]
  10. Hoerl, A.E.; Kennard, R.W. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 1970, 12, 55–67. [Google Scholar] [CrossRef]
  11. Hoerl, A.E.; Kennard, R.W. Ridge regression: Some simulations. Commun. Stat. 1970, 4, 1105–1123. [Google Scholar] [CrossRef]
  12. Schafer, R.L.; Roi, L.D.; Wolfe, R.A. A ridge logistic estimator. Commun. Stat. Theory Methods 1984, 13, 99–113. [Google Scholar] [CrossRef]
  13. Duffy, D.E.; Santner, T.J. On the small sample properties of norm-restricted maximum likelihood estimators for logistic regression models. Commun. Stat. Theory Methods 1989, 18, 959–980. [Google Scholar] [CrossRef]
  14. Cessie, S.L.; Houwelingen, J.C.V. Ridge estimators in logistic regression. Appl. Stat. 1992, 41, 191–201. [Google Scholar] [CrossRef]
  15. Vaupel, J.; Manton, K.; Stallard, E. Impact of heterogeneity in individual frailty on the dynamics of mortality. Demography 1979, 16, 439–454. [Google Scholar] [CrossRef] [PubMed]
  16. Oakes, D. A model for association in bivariate survival data. J. R. Stat. Soc. (B) 1982, 44, 414–422. [Google Scholar] [CrossRef]
  17. Sastry, N. Community characteristics, individual and household attributes, and child survival in Brazil. Demography 1996, 33, 211–229. [Google Scholar] [CrossRef] [PubMed]
  18. Smith, K.R.; Slattery, M.L.; French, T.K. Collinear nutrients and the risk of colon cancer. J. Clin. Epidemiol. 1991, 44, 715–723. [Google Scholar] [CrossRef]
  19. Xue, X.; Kim, M.Y.; Roy, E.S. Cox regression analysis in presence of collinearity: An application to assessment of health risks associated with occupational radiation exposure. Lifetime Data Anal. 2007, 12, 333–350. [Google Scholar] [CrossRef] [PubMed]
  20. Cox, D.R.; Hinkley, D.V. Theoretical Statistics; Chapman and Hall: London, UK, 1974. [Google Scholar]
  21. Ozkale, M.R. The red indicator and corrected VIFs in generalizedlinear models. Commun. Stat. Simul. Comput. 2021, 50, 4144–4170. [Google Scholar] [CrossRef]
  22. Mackinnon, M.J.; Puterman, A.L. Collinearity in generalized linear models. Commun. Stat. Theory Methods 1989, 18, 3463–3472. [Google Scholar] [CrossRef]
  23. Weissfeld, L.A.; Sereika, S.M. A multicollinearity diagnostic for generalized linear modelsy. Commun. Stat. Theory Methods 1991, 20, 1183–1198. [Google Scholar] [CrossRef]
Figure 1. Comparison of MLE and ridge estimators of β in simulation study. Here, k1, k2 and k3 are ridge parameters such that; k1 = 1 / β ^ β ^ , k2 = q / β ^ β ^ and k3 = ( q + 1 ) / β ^ β ^ .
Figure 1. Comparison of MLE and ridge estimators of β in simulation study. Here, k1, k2 and k3 are ridge parameters such that; k1 = 1 / β ^ β ^ , k2 = q / β ^ β ^ and k3 = ( q + 1 ) / β ^ β ^ .
Mathematics 10 02255 g001
Table 1. Comparison of MSE between the MLE and ridge estimators in simulation; n = 50.
Table 1. Comparison of MSE between the MLE and ridge estimators in simulation; n = 50.
ρ MethodMSE ( β )% Reduction of MSE ( β )
0.4MLE0.0801
Ridge estimator for k = 1 / β ^ β ^ 0.057927.72
Ridge estimator for k = q / β ^ β ^ 0.044444.57
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.035655.56
0.6MLE0.1056
Ridge estimator for k = 1 / β ^ β ^ 0.065837.69
Ridge estimator for k = q / β ^ β ^ 0.046555.97
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.035566.38
0.8MLE0.1871
Ridge estimator for k = 1 / β ^ β ^ 0.077255.74
Ridge estimator for k = q / β ^ β ^ 0.045775.57
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.031982.95
0.95MLE0.6908
Ridge estimator for k = 1 / β ^ β ^ 0.065490.55
Ridge estimator for k = q / β ^ β ^ 0.030495.60
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.020697.02
Table 2. Comparison of MSE between the MLE and ridge estimators in simulation; n = 200.
Table 2. Comparison of MSE between the MLE and ridge estimators in simulation; n = 200.
ρ MethodMSE ( β )% Reduction of MSE ( β )
0.4MLE0.0197
Ridge estimator for k = 1 / β ^ β ^ 0.01818.12
Ridge estimator for k = q / β ^ β ^ 0.016715.23
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.015521.32
0.6MLE0.0259
Ridge estimator for k = 1 / β ^ β ^ 0.022712.35
Ridge estimator for k = q / β ^ β ^ 0.020222
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.018130.11
0.8MLE0.0459
Ridge estimator for k = 1 / β ^ β ^ 0.035023.74
Ridge estimator for k = q / β ^ β ^ 0.027939.22
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.023049.89
0.95MLE0.1693
Ridge estimator for k = 1 / β ^ β ^ 0.064362.02
Ridge estimator for k = q / β ^ β ^ 0.035079.33
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.022786.59
Table 3. Comparison of MSE between the MLE and ridge estimators in simulation; n = 2000.
Table 3. Comparison of MSE between the MLE and ridge estimators in simulation; n = 2000.
ρ MethodMSE ( β )% Reduction of MSE ( β )
0.4MLE0.00196
Ridge estimator for k = 1 / β ^ β ^ 0.001941.02
Ridge estimator for k = q / β ^ β ^ 0.001931.5
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.001922.04
0.6MLE0.00257
Ridge estimator for k = 1 / β ^ β ^ 0.002541.16
Ridge estimator for k = q / β ^ β ^ 0.002512.33
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.002473.89
0.8MLE0.00458
Ridge estimator for k = 1 / β ^ β ^ 0.004443.05
Ridge estimator for k = q / β ^ β ^ 0.004325.60
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.004218.07
0.95MLE0.01687
Ridge estimator for k = 1 / β ^ β ^ 0.014921.15
Ridge estimator for k = q / β ^ β ^ 0.0133320.98
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 0.0120040.58
Table 4. Descriptive statistics of NFHS data.
Table 4. Descriptive statistics of NFHS data.
VariablesBirthDeath
NumberPercentageNumberPercentage
Place of residence
Rural734863.477668.49
Urban423336.635731.51
Breastfeeding
No9258.076667.61
Yes10,65692.036732.39
Sex
Female564448.752346.16
Male593751.361053.84
Birth order
1380932.931527.80
2325928.126823.65
3179915.515813.95
> = 4271423.439234.60
Place of delivery
Home611052.869961.69
Govt. hospital280224.220418.01
Prvt. Hospital264822.922419.77
Other210.260.53
Marital status
Unmarried110.110.09
0–4 Years496442.932428.60
5–9 Years385033.235731.51
> = 10 Years275623.845139.81
ANC
No216518.731127.45
1–2268723.230326.74
> = 3672958.151945.81
Age of mother
< 18 Years7396.4766.71
18–24 Years507543.839034.42
25–29 Years345329.831627.89
30–34 Years153713.320618.18
> = 35 Years7776.714512.80
Mother education
No45123959852.78
Primary168714.619617.30
Secondary443638.330927.27
Higher9468.2302.65
IFA
No370832.048442.72
Yes787368.064957.28
Family size
1–310719.226022.95
4–6521445.051445.37
> = 7529645.735931.69
Table 5. Estimation in frailty model.
Table 5. Estimation in frailty model.
CategoryMLERidge Estimator
exp ( β ^ ) S.E. 1 ( β ^ ) exp ( β ^ k ) S.E. 1 ( β ^ k )
Place of residence
Urban a ----
Rural1.1560.1041.1620.103
Breastfeeding
No a ----
Yes8.28 × 10 5 *0.0938.49 × 10 5 *0.092
Sex
Male a ----
Female1.0440.0891.0470.089
Birth order
11.1760.1761.2090.171
21.2430.1501.2650.147
31.0870.1541.0990.151
> = 4 a ----
Place of delivery
Home1.1490.1341.1620.133
Govt. hospital0.9350.1450.9470.143
Prvt. Hospital a ----
Other1.3500.6881.3360.614
Marital status
Unmarried0.5071.6520.6740.985
0–4 Years a ----
5–9 Years1.546 *0.1371.568 *0.134
> = 10 Years1.939 *0.1921.988 *0.187
TT
No1.2460.1531.2530.151
11.0770.1621.0790.160
> = 2 a ----
ANC
No0.9870.1610.9950.159
1–21.1620.1181.1690.117
> = 3 a ----
Age of mother
<18 Years1.4590.1931.461 *0.191
18–24 Years a ----
25–29 Years1.316 *0.1251.317 *0.123
30–34 Years1.723 *0.1701.717 *0.167
> = 35 Years1.545 *0.2061.543 *0.201
Mother education
No10.592 *0.2329.821 *0.221
Primary10.430 *0.2429.639 *0.231
Secondary7.560 *0.2237.036 *0.213
Higher a ----
IFA
No a ----
Yes1.0520.1121.0630.111
Family size
1–32.175 *0.1382.175 *0.137
4–6 a ----
> = 70.715 *0.0990.720 *0.098
a Reference category; * statistical significance at 5% level of significance; 1 S.E.: Standard Error.
Table 6. Comparison of MSE in NFHS data.
Table 6. Comparison of MSE in NFHS data.
MethodMSE ( β )Percentage Reduction of MSE ( β )
MLE3.819
Ridge estimator for k = 1 / β ^ β ^ 3.5536.97
Ridge estimator for k = q / β ^ β ^ 2.11044.75
Ridge estimator for k = ( q + 1 ) / β ^ β ^ 2.04846.37
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Albalawi, O.; Sirohi, A.; Rai, P.K.; Alanzi, A.R.A. Parametric Frailty Analysis in Presence of Collinearity: An Application to Assessment of Infant Mortality. Mathematics 2022, 10, 2255. https://doi.org/10.3390/math10132255

AMA Style

Albalawi O, Sirohi A, Rai PK, Alanzi ARA. Parametric Frailty Analysis in Presence of Collinearity: An Application to Assessment of Infant Mortality. Mathematics. 2022; 10(13):2255. https://doi.org/10.3390/math10132255

Chicago/Turabian Style

Albalawi, Olayan, Anu Sirohi, Piyush Kant Rai, and Ayed R. A. Alanzi. 2022. "Parametric Frailty Analysis in Presence of Collinearity: An Application to Assessment of Infant Mortality" Mathematics 10, no. 13: 2255. https://doi.org/10.3390/math10132255

APA Style

Albalawi, O., Sirohi, A., Rai, P. K., & Alanzi, A. R. A. (2022). Parametric Frailty Analysis in Presence of Collinearity: An Application to Assessment of Infant Mortality. Mathematics, 10(13), 2255. https://doi.org/10.3390/math10132255

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop