Symmetry in Statistics and Data Science

A special issue of Symmetry (ISSN 2073-8994). This special issue belongs to the section "Mathematics".

Deadline for manuscript submissions: closed (1 June 2023) | Viewed by 16654

Special Issue Editor


E-Mail Website
Guest Editor
Department of Mathematics, Université de Caen, LMNO, Campus II, Science 3, 14032 Caen, France
Interests: mathematical statistics; applied statistics; data analysis; probability; applied probability; analytic inequalities
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Symmetry is a central notion in statistics and data science, appearing in various forms in artificial intelligence, data analysis, distribution theory, modeling, networks, nonparametric estimation, parametric estimation, high dimensional data, statistical tests, as well as in many other branches of modern interest.
The objective of this Special Issue is to publish highly motivated, original, and innovative research articles that use the notion of symmetry on current topics in statistics and data science.
The scope includes but is not limited to the following topics:

  • Artificial intelligence;
  • Bayes methods;  
  • Data analysis;
  • Dimension reduction and variable selection;
  • Distribution theory;
  • Econometrics;
  • Estimation;
  • Inference with high-dimensional data;
  • Inference of stochastic processes;
  • Machine learning;
  • Modelling;
  • Nonparametric function estimation;
  • Sample surveys;
  • Statistical algorithms;
  • Statistical methods for imaging;
  • Time series analysis.

Dr. Christophe Chesneau
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Symmetry is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • e-Book format: Special Issues with more than 10 articles can be published as dedicated e-books, ensuring wide and rapid dissemination.

Further information on MDPI's Special Issue polices can be found here.

Published Papers (12 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Editorial

Jump to: Research

3 pages, 161 KiB  
Editorial
Introduction to the Special Issue in Symmetry Titled “Symmetry in Statistics and Data Science”
by Christophe Chesneau
Symmetry 2023, 15(5), 1103; https://doi.org/10.3390/sym15051103 - 18 May 2023
Viewed by 1026
Abstract
In order to introduce this Special Issue, some motivational facts are given [...] Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)

Research

Jump to: Editorial

16 pages, 364 KiB  
Article
Properties and Maximum Likelihood Estimation of the Novel Mixture of Fréchet Distribution
by Wikanda Phaphan, Ibrahim Abdullahi and Wirawan Puttamat
Symmetry 2023, 15(7), 1380; https://doi.org/10.3390/sym15071380 - 7 Jul 2023
Viewed by 1528
Abstract
In recent decades, there have been numerous endeavors to develop a novel category of survival distributions possessing enhanced flexibility through the extension of existing distributions. This article constructs and validates the statistical properties of a novel survival distribution in order to obtain an [...] Read more.
In recent decades, there have been numerous endeavors to develop a novel category of survival distributions possessing enhanced flexibility through the extension of existing distributions. This article constructs and validates the statistical properties of a novel survival distribution in order to obtain an alternative distribution that is suitable for analyzing survival data by presenting the novel mixture of the Fréchet distribution along with statistical properties such as the probability density function (PDF), cumulative distribution function (CDF), rth ordinary moment, skewness, kurtosis, moment-generating function, mean, variance, mode, survival function, hazard function, and asymptotic behavior, as well as constructing the estimators of the unknown parameter by employing the expectation-maximization (EM) algorithm, and simulated annealing. Additionally, the performance of the proposed estimators was compared with bias, mean squared errors (MSE), and simulated variances, and given an illustrative example of the proposed distribution to the survival data set in order to show that the proposed distribution is appropriate for the right-skewed data. This will be extremely advantageous in survival analysis. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

15 pages, 374 KiB  
Article
A New Detection Function Model for Distance Sampling Based on the Burr XII Model
by Ayed R. A. Alanzi, Farrukh Jamal, Muhammad H. Tahir, Christophe Chesneau, Sana Kanwal and Waqas Sami
Symmetry 2023, 15(3), 620; https://doi.org/10.3390/sym15030620 - 1 Mar 2023
Cited by 2 | Viewed by 1540
Abstract
One of the most used techniques for determining animal abundance is distance sampling. The distance sampling framework depends on the idea of a detection function, and a number of options have been suggested. In this paper, we provide a new flexible parametric model [...] Read more.
One of the most used techniques for determining animal abundance is distance sampling. The distance sampling framework depends on the idea of a detection function, and a number of options have been suggested. In this paper, we provide a new flexible parametric model based on the Burr XII distribution. To be more specific, we use the survival function of the Burr XII distribution for novel purposes in this context. The proposed model is appealing because it meets all of the requirements for a reliable detection function model, such as being monotonically decreasing and having a shoulder at the origin. It also has the features of having various asymmetric properties and a heavy right tail, which are rare properties in this setting. In the first part, we provide its key characteristics, such as shapes and moments. Then, the inferential aspect of the model is investigated. The maximum likelihood estimation method is used to estimate the parameters in a data-fitting scenario. The estimates of population abundance are derived and compared with some existing parametric estimates. A simulation is run to assess how well the resulting estimates perform in comparison to other widely applied estimates from the literature. The model is then tested using two real-world data sets. Based on the famous goodness-of-fit statistics, we show that it is preferable to some of the well-established models. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

15 pages, 324 KiB  
Article
Robust Procedure for Change-Point Estimation Using Quantile Regression Model with Asymmetric Laplace Distribution
by Fengkai Yang
Symmetry 2023, 15(2), 447; https://doi.org/10.3390/sym15020447 - 8 Feb 2023
Cited by 1 | Viewed by 1790
Abstract
The usual mean change-point detecting method based on normal linear regression is not robust to heavy-tailed data with potential outlying points. We propose a robust change-point estimation procedure based on a quantile regression model with asymmetric Laplace error distribution and develop a non-iterative [...] Read more.
The usual mean change-point detecting method based on normal linear regression is not robust to heavy-tailed data with potential outlying points. We propose a robust change-point estimation procedure based on a quantile regression model with asymmetric Laplace error distribution and develop a non-iterative sampling algorithm from a Bayesian perspective. The algorithm can generate independently and identically distributed samples approximately from the posterior distribution of the position of the change-point, which can be used for statistical inferences straightforwardly. The procedure combines the robustness of quantile regression and the computational efficiency of the non-iterative sampling algorithm. A simulation study is conducted to illustrate the performance of the procedure with satisfying findings, and finally, real data is analyzed to show the usefulness of the algorithm by comparison with the usual change-point detection method based on normal regression. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

26 pages, 1644 KiB  
Article
Analysis of Adaptive Progressive Type-II Hybrid Censored Dagum Data with Applications
by Heba S. Mohammed, Mazen Nassar, Refah Alotaibi and Ahmed Elshahhat
Symmetry 2022, 14(10), 2146; https://doi.org/10.3390/sym14102146 - 14 Oct 2022
Cited by 8 | Viewed by 1741
Abstract
In life testing and reliability studies, obtaining whole data always takes a long time and lots of monetary and human resources. In this case, the experimenters prefer to gather data using censoring schemes that make a balance between the length of the test, [...] Read more.
In life testing and reliability studies, obtaining whole data always takes a long time and lots of monetary and human resources. In this case, the experimenters prefer to gather data using censoring schemes that make a balance between the length of the test, the desired sample size, and the cost. Lately, an adaptive progressive type-II hybrid censoring scheme is suggested to enhance the efficiency of the statistical inference. By utilizing this scheme, this paper seeks to investigate classical and Bayesian estimations of the Dagum distribution. The maximum likelihood and Bayesian estimation methods are considered to estimate the distribution parameters and some reliability indices. The Bayesian estimation is developed under the assumption of independent gamma priors and by employing symmetric and asymmetric loss functions. Due to the tough form of the joint posterior distribution, the Markov chain Monte Carlo technique is implemented to gather samples from the full conditional distributions and in turn obtain the Bayes estimates. The approximate confidence intervals and the highest posterior density credible intervals are also obtained. The effectiveness of the various suggested methods is compared through a simulated study. The optimal progressive censoring plans are also shown, and number of optimality criteria are explored. To demonstrate the applicability of the suggested point and interval estimators, two real data sets are also examined. The outcomes of the simulation study and data analysis demonstrated that the proposed scheme is adaptable and very helpful in ending the experiment when the experimenter’s primary concern is the number of failures. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

20 pages, 477 KiB  
Article
Statistical Inference of Weighted Exponential Distribution under Joint Progressive Type-II Censoring
by Yinuo Qiao and Wenhao Gui
Symmetry 2022, 14(10), 2031; https://doi.org/10.3390/sym14102031 - 28 Sep 2022
Cited by 7 | Viewed by 1482
Abstract
The weighted exponential distribution is a promising skewed distribution in the life-testing experiment. The joint progressive type-II censoring (JPC) scheme is an effective approach to reducing costs. In this paper, we consider the estimates of parameters of the weighted exponential distribution with the [...] Read more.
The weighted exponential distribution is a promising skewed distribution in the life-testing experiment. The joint progressive type-II censoring (JPC) scheme is an effective approach to reducing costs. In this paper, we consider the estimates of parameters of the weighted exponential distribution with the JPC data. Two populations, whose scale parameters are the same but the shape parameters of which are different, are chosen to be studied. We first evaluate the parameters with the maximum likelihood method. Because the maximum likelihood estimates of parameters cannot be obtained in closed form, we apply the Newton–Raphson method in this part. Bayesian estimates and the corresponding credible intervals under the squared error loss function are computed by using the Markov Chain Monte Carlo method. After that, we use the bootstrap method to calculate the associated confidence intervals of the unknown parameters. A simulation has been performed to test the feasibility of the above methods and real data analysis is also provided for illustrative purposes. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

29 pages, 574 KiB  
Article
On Odd Perks-G Class of Distributions: Properties, Regression Model, Discretization, Bayesian and Non-Bayesian Estimation, and Applications
by Ibrahim Elbatal, Naif Alotaibi, Ehab M. Almetwally, Salem A. Alyami and Mohammed Elgarhy
Symmetry 2022, 14(5), 883; https://doi.org/10.3390/sym14050883 - 26 Apr 2022
Cited by 38 | Viewed by 2661
Abstract
In this paper, we present a new univariate flexible generator of distributions, namely, the odd Perks-G class. Some special models in this class are introduced. The quantile function (QFUN), ordinary and incomplete moments (MOMs), generating function (GFUN), moments of residual and reversed residual [...] Read more.
In this paper, we present a new univariate flexible generator of distributions, namely, the odd Perks-G class. Some special models in this class are introduced. The quantile function (QFUN), ordinary and incomplete moments (MOMs), generating function (GFUN), moments of residual and reversed residual lifetimes (RLT), and four different types of entropy are all structural aspects of the proposed family that hold for any baseline model. Maximum likelihood (ML) and maximum product spacing (MPS) estimates of the model parameters are given. Bayesian estimates of the model parameters are obtained. We also present a novel log-location-scale regression model based on the odd Perks–Weibull distribution. Due to the significance of the odd Perks-G family and the survival discretization method, both are used to introduce the discrete odd Perks-G family, a novel discrete distribution class. Real-world data sets are used to emphasize the importance and applicability of the proposed models. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

25 pages, 702 KiB  
Article
Sec-G Class of Distributions: Properties and Applications
by Luciano Souza, Wilson Rosa de Oliveira, Cícero Carlos Ramos de Brito, Christophe Chesneau, Renan Fernandes and Tiago A. E. Ferreira
Symmetry 2022, 14(2), 299; https://doi.org/10.3390/sym14020299 - 1 Feb 2022
Cited by 19 | Viewed by 2357
Abstract
Although there are many continuous distributions in the literature, only a handful take advantage of the modeling power provided by trigonometric functions. To our knowledge, none of them are based on the so-called secant function, defined as the reciprocal of the cosine function. [...] Read more.
Although there are many continuous distributions in the literature, only a handful take advantage of the modeling power provided by trigonometric functions. To our knowledge, none of them are based on the so-called secant function, defined as the reciprocal of the cosine function. The secant function can go to large values whenever the cosine function goes to small values. The idea is to profit from this trigonometric property to modify well-known distribution tails and overall skewness features. With this in mind, in this paper, a new class of trigonometric distributions based on the secant function is introduced. It is called the Sec-G class. We discuss the main mathematical characteristics of this class, including series expansions of the corresponding cumulative distribution and probability density functions, as well as several probabilistic measures and functions. In particular, we present the moments, skewness, kurtosis, Lorenz, and Bonferroni curves, reliability coefficient, entropy measure, and order statistics. Throughout the study, emphasis is placed on the unique four-parameter continuous distribution of this class, defined with the Kumaraswamy-Weibull distribution as the baseline. The estimation of the model parameters is performed using the maximum likelihood method. We also carried out a numerical simulation study and present the results in graphic form. Three referenced datasets were analyzed, and it is proved that the proposed secant Kumaraswamy-Weibull model outperforms important competitors, including the Kumaraswamy-Weibull, Kumaraswamy-Weibull geometric, Kumaraswamy-Weibull Poisson, Kumaraswamy Burr XII, and Weibull models. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

20 pages, 12543 KiB  
Article
Spatial-Temporal 3D Residual Correlation Network for Urban Traffic Status Prediction
by Yin-Xin Bao, Quan Shi, Qin-Qin Shen and Yang Cao
Symmetry 2022, 14(1), 33; https://doi.org/10.3390/sym14010033 - 28 Dec 2021
Cited by 6 | Viewed by 2137
Abstract
Accurate traffic status prediction is of great importance to improve the security and reliability of the intelligent transportation system. However, urban traffic status prediction is a very challenging task due to the tight symmetry among the Human–Vehicle–Environment (HVE). The recently proposed spatial–temporal 3D [...] Read more.
Accurate traffic status prediction is of great importance to improve the security and reliability of the intelligent transportation system. However, urban traffic status prediction is a very challenging task due to the tight symmetry among the Human–Vehicle–Environment (HVE). The recently proposed spatial–temporal 3D convolutional neural network (ST-3DNet) effectively extracts both spatial and temporal characteristics in HVE, but ignores the essential long-term temporal characteristics and the symmetry of historical data. Therefore, a novel spatial–temporal 3D residual correlation network (ST-3DRCN) is proposed for urban traffic status prediction in this paper. The ST-3DRCN firstly introduces the Pearson correlation coefficient method to extract a high correlation between traffic data. Then, a dynamic spatial feature extraction component is constructed by using 3D convolution combined with residual units to capture dynamic spatial features. After that, based on the idea of long short-term memory (LSTM), a novel architectural unit is proposed to extract dynamic temporal features. Finally, the spatial and temporal features are fused to obtain the final prediction results. Experiments have been performed using two datasets from Chengdu, China (TaxiCD) and California, USA (PEMS-BAY). Taking the root mean square error (RMSE) as the evaluation index, the prediction accuracy of ST-3DRCN on TaxiCD dataset is 21.4%, 21.3%, 11.7%, 10.8%, 4.7%, 3.6% and 2.3% higher than LSTM, convolutional neural network (CNN), 3D-CNN, spatial–temporal residual network (ST-ResNet), spatial–temporal graph convolutional network (ST-GCN), dynamic global-local spatial–temporal network (DGLSTNet), and ST-3DNet, respectively. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

13 pages, 1209 KiB  
Article
Estimating the Variance of Estimator of the Latent Factor Linear Mixed Model Using Supplemented Expectation-Maximization Algorithm
by Yenni Angraini, Khairil Anwar Notodiputro, Henk Folmer, Asep Saefuddin and Toni Toharudin
Symmetry 2021, 13(7), 1286; https://doi.org/10.3390/sym13071286 - 17 Jul 2021
Cited by 1 | Viewed by 1841
Abstract
This paper deals with symmetrical data that can be modelled based on Gaussian distribution, such as linear mixed models for longitudinal data. The latent factor linear mixed model (LFLMM) is a method generally used for analysing changes in high-dimensional longitudinal data. It is [...] Read more.
This paper deals with symmetrical data that can be modelled based on Gaussian distribution, such as linear mixed models for longitudinal data. The latent factor linear mixed model (LFLMM) is a method generally used for analysing changes in high-dimensional longitudinal data. It is usual that the model estimates are based on the expectation-maximization (EM) algorithm, but unfortunately, the algorithm does not produce the standard errors of the regression coefficients, which then hampers testing procedures. To fill in the gap, the Supplemented EM (SEM) algorithm for the case of fixed variables is proposed in this paper. The computational aspects of the SEM algorithm have been investigated by means of simulation. We also calculate the variance matrix of beta using the second moment as a benchmark to compare with the asymptotic variance matrix of beta of SEM. Both the second moment and SEM produce symmetrical results, the variance estimates of beta are getting smaller when number of subjects in the simulation increases. In addition, the practical usefulness of this work was illustrated using real data on political attitudes and behaviour in Flanders-Belgium. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

20 pages, 1139 KiB  
Article
Bayesian Reference Analysis for the Generalized Normal Linear Regression Model
by Vera Lucia Damasceno Tomazella, Sandra Rêgo Jesus, Amanda Buosi Gazon, Francisco Louzada, Saralees Nadarajah, Diego Carvalho Nascimento, Francisco Aparecido Rodrigues and Pedro Luiz Ramos
Symmetry 2021, 13(5), 856; https://doi.org/10.3390/sym13050856 - 12 May 2021
Cited by 3 | Viewed by 2975
Abstract
This article proposes the use of the Bayesian reference analysis to estimate the parameters of the generalized normal linear regression model. It is shown that the reference prior led to a proper posterior distribution, while the Jeffreys prior returned an improper one. The [...] Read more.
This article proposes the use of the Bayesian reference analysis to estimate the parameters of the generalized normal linear regression model. It is shown that the reference prior led to a proper posterior distribution, while the Jeffreys prior returned an improper one. The inferential purposes were obtained via Markov Chain Monte Carlo (MCMC). Furthermore, diagnostic techniques based on the Kullback–Leibler divergence were used. The proposed method was illustrated using artificial data and real data on the height and diameter of Eucalyptus clones from Brazil. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

17 pages, 389 KiB  
Article
Computing Expectiles Using k-Nearest Neighbours Approach
by Muhammad Farooq, Sehrish Sarfraz, Christophe Chesneau, Mahmood Ul Hassan, Muhammad Ali Raza, Rehan Ahmad Khan Sherwani and Farrukh Jamal
Symmetry 2021, 13(4), 645; https://doi.org/10.3390/sym13040645 - 11 Apr 2021
Cited by 4 | Viewed by 2424
Abstract
Expectiles have gained considerable attention in recent years due to wide applications in many areas. In this study, the k-nearest neighbours approach, together with the asymmetric least squares loss function, called ex-kNN, is proposed for computing expectiles. Firstly, [...] Read more.
Expectiles have gained considerable attention in recent years due to wide applications in many areas. In this study, the k-nearest neighbours approach, together with the asymmetric least squares loss function, called ex-kNN, is proposed for computing expectiles. Firstly, the effect of various distance measures on ex-kNN in terms of test error and computational time is evaluated. It is found that Canberra, Lorentzian, and Soergel distance measures lead to minimum test error, whereas Euclidean, Canberra, and Average of (L1,L) lead to a low computational cost. Secondly, the performance of ex-kNN is compared with existing packages er-boost and ex-svm for computing expectiles that are based on nine real life examples. Depending on the nature of data, the ex-kNN showed two to 10 times better performance than er-boost and comparable performance with ex-svm regarding test error. Computationally, the ex-kNN is found two to five times faster than ex-svm and much faster than er-boost, particularly, in the case of high dimensional data. Full article
(This article belongs to the Special Issue Symmetry in Statistics and Data Science)
Show Figures

Figure 1

Back to TopTop