1. Introduction
The widespread adoption of electric vehicles (EVs) has emerged as a prominent trend within the automotive industry and is primarily propelled by the commitment of numerous nations to achieve net-zero emissions by 2050 without increasing atmospheric carbon concentrations [
1]. This global shift towards EVs has been accompanied by escalating sales figures and more stringent governmental directives, driving an increase in battery production capacity on a global scale. Batteries constitute up to 30% of an EV’s weight [
2] and contribute to approximately 30% of the overall cost [
3]. Consequently, optimizing battery manufacturing quality to align with economic objectives, such as reduced reduction, increased yield, and minimized scrap rates, becomes critical. These have a substantial influence on the overall impact and viability of EVs.
The packaging of batteries is decisive in determining key performance parameters such as lifetime, cyclability, ruggedness, safety, and cost. At the core of battery packaging technology lies the electrochemical cell, which represents the smallest unit of a battery. Packaging tailors the battery to specific applications, encompassing aspects such as sealing, form factor, temperature and charge monitoring, and overall management. In the case of high-voltage or high-capacity EV batteries, multiple electrochemical cells may be interconnected in parallel or series, or both. Furthermore, these cells are integrated into modules that monitor individual cells and control the temperature. The modules are subsequently assembled to form a complete battery pack (
Figure 1).
The flatness of the side plate and the bottom plate is important as it directly impacts the successful integration of the battery into the module and, consequently, the module into the battery pack. Hence, flatness represents one of the most crucial and widely considered geometric tolerances that determine the quality of the products. The definition of flatness is the degree to which the surface of a measured plane or the median plane contains all its elements. The definition of flatness tolerance is such an area, that is, two parallel planes, which can contain the median plane and the entire area of the measured plane. The concept of flatness pertains to a fundamental geometric primitive. The geometrical product specifications (GPS) standard describes the verification process for flatness as encompassing partitioning, extraction, association, and evaluation. The new GPS standard recognizes two types of reference planes: the Minimum Zone Reference Planes (MZPL) and the Least Squares Reference Planes (LSPL). Corresponding to these reference planes, two association methods for flatness error evaluation are used: the Minimum Zone Method (MZM) and the Least Squares Method (LSM) [
4,
5].
Several researchers have developed new techniques to improve the flatness tolerance evaluation. The literature provides insights into various techniques used to develop methodologies for obtaining minimum zone solutions. These techniques include nonlinear optimization, metaheuristics, and approximation based on linear programming [
6,
7,
8,
9,
10,
11,
12,
13,
14,
15,
16,
17]. Although a smaller area can be found eventually, the algorithm has a long computation time and is complex to implement. In modern equipment like coordinate measuring machines (CMM), the LSM is often used for flatness evaluation at present [
18,
19]. However, in battery manufacturing, the flatness of large planes, such as the side plates and end plates of the battery module, needs to be monitored closely, making it challenging to obtain the distribution characteristics of flatness in depth. Most classical quality monitoring techniques in parts manufacturing rely on assumptions regarding sample distribution. Hence, the theoretical distribution of flatness plays a crucial role in both design and manufacturing processes. Chelishcev et al. [
20] used a CMM to measure the rotating parts and used the probability density function method to predict the size distribution of the measured parts and demonstrated the applicability of a distribution-free model in predicting the sample size, minimum content, and confidence level for CMM-based Geometric Dimensioning and Tolerancing inspection without the need for prior measurements. Berrado et al. [
21] put forth a method that considers the measurement round-off and small sample sizes. They employed the Moran log spacing statistic to evaluate the suitability of a normal distribution in fitting the data. The results of their research suggest that the normal distribution adequately fits all the examined datasets. The study has demonstrated that the assumption of normality is adequately met for 20 industrial datasets encompassing four processes and three distinct material types. Ghie et al. [
22] proposed an analysis method based on the Jacobian–Torsor model combined with the uncertainty of measurement and used a probability density function to simulate the distribution of part manufacturing sizes. Wang et al. [
23] presented the actual measurement and statistical processing of error data to show that the distribution law of shape and position error obeys the folded normal process data (also known as the absolute normal distribution) under certain conditions. Chatterjee et al. [
24] provided examples of situations commonly associated with folded normal distributions. These instances encompass measurements pertaining to flatness or straightness, as well as the distance between two objects. A variety of planes are involved in the manufacture of battery modules/PACK, among which the plane is an important geometric element that constitutes mechanical parts [
25]. It is often used as the datum plane for testing. To control the quality of the plane, it is necessary to measure the flatness. Investigate the distribution of flatness. And flatness also has an important application in mechanical design, such as in modern technologies like CAD/CAM/CAE [
4].
The novelty of this paper is related to three aspects: Based on the definition of flatness and the principle of multivariate statistics, flatness statistics are constructed, and their theoretical distribution is deduced.
The theoretical distribution of flatness is validated against experimental flatness datasets using the maximum likelihood estimation (MLE) and Kolmogorov–Smirnov (KS) test.
The application scope of the distribution form of flatness is extended to large sample data acquired on a long plane. Then, the tolerances related to flatness are optimized.
The subsequent sections of this article are organized as follows:
Section 2 introduces a statistical model of flatness. The theoretical distribution of flatness is deduced as a normal range distribution, and the normal range distribution function and density function are calculated using MLE and numerical integration.
Section 3 describes the implementation of experiments, while
Section 4 presents a comprehensive summary and discussion of the key points and findings presented in the article.
2. Materials and Methods
2.1. Statistical Model of Flatness
This investigation primarily focuses on the flatness distribution. A flowchart to study the form of the distribution is shown in
Figure 2. Form tolerances typically define a designated area in which a consideration plane, line, axis, or center plane should be located. In accordance with the measurement standard [
26], flatness tolerance specifies that all points on the surface must fall within the regions defined by two parallel planes. Specifically, the tolerance value represents the distance between these two parallel planes.
In practical applications, a limited number of coordinate points on the plane are measured to determine the flatness value. It is necessary for all measurement points to fall within the specified area defined by the measurement criteria. The measurement of flatness can be represented by the distance between two parallel planes that contain all the measurement points. One effective approach to meet this requirement is to fit the set of measurement points with a regression plane and calculate the flatness using the residual.
Figure 3 provides a visual representation of this concept, depicting a cross-section of a panel. In the figure, the symbol
represents the region enclosing the fitting plane, and it indicates the distance between the parallel planes that enclose all the measurement points. The symbol
represents the distance between the measurement point and the fitting plane. The symbol
represents the vertical distance between the measurement point and the horizontal plane. The dots represent the set of points taken during measurement. The blue line represents the actual shape of the plane. The yellow line represents the fitted plane. Mathematically, it is feasible to evaluate the flatness distribution by the LSM. This method has excellent premise assumptions:
- 1.
; the independent variable is currently correlated with the dependent variable;
- 2.
; the mean value of the error is 0; the variance is fixed and obeys the normal distribution;
- 3.
The errors are independent of each other; .
Therefore, to simplify the process of determining the flatness, we make the following assumptions: the fitting plane is fixed, and the measured points on the plane obey the three-dimensional normal distribution with a fixed mean value of 0 and a variance (, , ).
As shown in
Figure 3,
represents the distance from the
th point to the fitted plane (points are positive above the plane and negative below the plane). We consider the
(flatness) statistic of the form.
In the next section, we aim to derive the distribution form of the statistic .
2.2. Exact Distribution of R (Flatness)
This section shows that
statistic can be expressed as the range form of a multivariate independent normal distribution. Therefore, its exact distribution can be obtained easily from the result of Anderson [
27] on multivariate statistics. Taking the points on the long plane is mainly performed by CMM, and the coordinates of each point are
.
According to the model established in the previous section, let
be the vertical distance between the
th point and the plane. Geometrically, the difference between
and
is only a constant
(that is, the cosine of the angle between the fitting plane and the horizontal plane), where
Due to the randomness of the points,
and
can be regarded as a random variable and
,
as the sample point. The joint density function of n-dimensional random variables
,
are
,
, respectively. According to the Jacobian transformation of multivariate random variables, we have
, where
Therefore, to simplify the derivation of the flatness distribution, it is only necessary to derive the distribution of the distance
between the point and the vertical direction of the fitting plane. According to the assumption that the error
is a normal distribution that is independent and homoscedastic, it can be seen that
is an n-dimensional normal distribution given by
where
The joint density function of the n-dimensional normal distribution, denoted as
, can be expressed as the following:
Then, the joint density function of the n-dimensional random variable
through a Jacobian transformation is
Hence, are independent normal distributions with the same variance . Let the distribution function be , and its density function be . Next, the distribution of flatness is solved by deriving the range distribution .
Let
be the statistical population. Take a sample
and obtain the corresponding order statistic
, where
is the
order statistic
.
is the minimum order statistic.
is the maximum order statistic.
is the sample range. The joint density function of
and
is
Then, its inverse transformation is
Its Jacobian determinant
. The joint density function of
and
is
The distribution function
of the flatness
is
On computation, the density function
of the flatness
is
where
Figure 4 shows the density and distribution function plots of the normal range distribution under different m values.
2.3. Parameter Estimation of the Normal Range Distribution
The parameter
is estimated using the maximum likelihood estimation (MLE) method. Because the normal range distribution and density function are extraordinarily complex and have no analytical solution, a simplified approach is required to maximize the function. Many scholars have tried to solve the problem of seeking the optimal value of the objective function and achieved good results [
28,
29]. The grid search method is a well-established and uncomplicated approach for identifying extreme values, which can be used to solve extreme value problems [
30]. It is a fundamental algorithm for optimizing parameters.
Let
be
simple random samples from a normal range distribution, then the log-likelihood function based on these samples is
Subsequently, we need an estimate of
, as
Since
has no analytical solution, it is necessary to use the Simpson integration method and truncate it first:
There is a truncation error.
According to the precondition, since the plane shows relatively small fluctuations in the vertical direction, the standard deviation of its values along the
axis is much smaller than that in the
and
axes and is generally between 0.001 and 0.1. Moreover, the inclination angle between the fitted plane and the actual plane is relatively small, the
value is close to 1, and the flatness value is generally not greater than 1. The number of points
on the plane generally does not exceed 100. Let
, so the truncation error
When using numerical integration for approximate calculations, such as
, we have the computational errors
obtained by ignoring the truncation error and calculation errors in numerical integration. The Simpson integration formula is
The extreme value solution of (16) can be obtained using the grid method to obtain the MLE
. The estimation performance of model parameters is evaluated using the root mean squared error (RMSE). As depicted in
Table 1, the RMSE values are minimal, and the parameters estimated by MLE are accurate (one hundred simulations were conducted with different values of
and
).
2.4. Kolmogorov–Smirnov Test
The two-sided, one-sample Kolmogorov–Smirnov (KS) statistic is widely used as a test-of-fit statistic to assess the consistency between the distribution of a random sample and a predefined theoretical distribution [
31,
32,
33,
34]. For a given ordered sample
of size
from the theoretical distribution
, its empirical distribution is
.
is the probability distribution function of the theoretical distribution pre-specified under the null hypothesis
; that is, the two data distributions are consistent. The two-sided K-S statistic is defined as
, and
can be represented as follows:
When
is calculated from observations,
will be rejected; otherwise, accept the
hypothesis.
Figure 5 is a KS test chart of normal distribution and normal range distribution for a measurement data group, respectively, where
and
. The KS test value
of the normal distribution is 0.0685 (
p-value = 0.01842 < 0.05), which means that the null hypothesis is rejected, and the data do not obey the normal distribution.
The KS test value of the normal range distribution is 0.0354 (p-value = 0.5461 > 0.05), which means that the null hypothesis can be accepted and the data obeys the normal range distribution.
3. Results
3.1. Verifying Flatness Distribution Based on Experimental Data
Enterprises often use the normal distribution to fit flatness data, calculate dimensional quantities, and process capability indices. However, the flatness data measured on this long plane often cannot pass the KS test when fitting the normal distribution; that is, the flatness distribution is not normal. Hence, calculating the process capability index, such as CPK/PPK, will lead to deviations in the result, distortion of product quality, and an increase in the monitoring cost for product quality. The data set for verifying the flatness distribution comes from two parts: CMM measures the coordinate points on the plane of the aluminum plate and calculates the flatness data; the dimensions of the aluminum plate (specification 6061) is 500 mm in length, 200 mm in width, and 3 mm in height. As shown in
Figure 6, in the experiment, the flatness data sets with sample sizes of
,
,
, and
were obtained when the number of points
, and
were taken (points were taken at equal intervals in the X direction and Y direction of the aluminum plate, respectively).
Firstly, the first data set obtained through experiments is processed. The
of the corresponding data set is obtained through the parameter estimation method in the second chapter. Subsequently, the KS test is performed for the normal distribution and the normal range distribution. The results are shown in
Table 2. The results show that for three different values of m, there are the following conclusions: when the sample size is small (less than or equal to 100), both the normal distribution and normal range distribution pass the power KS test; however, when the sample size exceeds 500, the normal distribution cannot pass the KS test (
p-value < 0.05).
In contrast, the normal range distribution always fits the flatness data (p-value > 0.05), and the surface flatness data is not normally distributed. Therefore, the larger the sample size, the closer the flatness data is to the true distribution form, that is, the normal range distribution.
3.2. Application of Flatness Distribution Form in Dimensional Chain
Tolerance runs through the two stages of product design and manufacturing. It determines whether the product can meet the technical (quality) requirements of the product while enabling the product to be processed most economically. Dimensional chain calculation and tolerance analysis have significant applications in the design and manufacture of products. During the product design process, the accuracy of the assembly of the product can be analyzed and calculated using the dimensional chain, and the accumulation of part tolerances in the product assembly process can be analyzed. For the mechanical manufacturing process, the accumulation and synthesis of manufacturing errors during processing can be analyzed using dimensional chain calculations. The probability method is commonly used for these calculations. The probability method expresses dimensional changes as probability distributions, which requires knowing the true distribution form of shape tolerances such as flatness. The statistical tolerance of the closed loop can be obtained by the following formula when calculated by the probability method:
In the formula,
: Closed ring tolerance;
: Tolerance of the component ring;
: Relative distribution coefficient of the size of the constituent ring;
: Transfer coefficient of the constituent ring;
: Relative distribution coefficient of the closed loop, generally approximating 1.
When the size chain of a particular product comprises flatness values, assuming that the assembly tolerance requirement is ±0.5, the normal distribution and the normal range distribution are used to distribute the tolerance by the probability method.
Figure 7 shows the influence of the two distribution forms on the differences between the parts in terms of dimensional tolerance requirements (assuming that each dimension in the dimensional chain is assigned the same tolerance):
Where
is related to the form of the distribution,
is the relative standard deviation, which is equal to the ratio of the standard deviation to half of the tolerance, that is
;
is the relative standard deviation of the normal distribution, take the tolerance
(corresponding to a confidence level of 99.73%) when
, so
. For a normal range distribution
, its expectation
and variance
are, respectively,
The tolerance can be obtained by . To facilitate the calculation, we can normalize the subsample , let , , then ; let , so there are , , .
In the example in
Figure 7,
,
,
. And finally, it can be found that
The tolerance values for different numbers of rings are obtained by Formula (22).
Using the normal range distribution for flatness will relax the tolerance of some rings. In the high-value scenario of battery manufacturing, this can make the processing of parts more economical and suitable for mass production.
4. Discussion
As mentioned earlier, flatness is a form tolerance critical to ensuring the quality of long straight plane assemblies. Out-of-flatness issues can result in substantial and expensive assembly rejection problems. Consequently, monitoring the flatness data corresponding to the different number of points on the plane is crucial. In this paper, we deduce that flatness has a normal range distribution based on a statistical theoretical model; the flatness data with different
values gained by the experiment all fit the normal range distribution, which has an excellent fitting effect. And it can be explained that the distribution pattern of flatness is dependent on the number of sampling points m. In the experiment results, we have shown that a normal distribution can be fitted under small samples, which is consistent with the conclusions derived by Berrado et al. [
3]. However, the normal distribution cannot be fitted for large samples.
As a large amount of data were required for the analysis, we used multiple CMM measurements of a certain number of aluminum plates to study the flatness. We anticipate that these findings can also apply to flatness measurements in other processes. To enhance the credibility of this statistical theoretical model, it should be validated further by utilizing additional flatness datasets collected from diverse components. This expanded validation process will help generate more substantial evidence and strengthen the applicability of the model across various scenarios. Further research is also required for numerical calculation and parameter optimization to increase compliance with high product quality standards and reduce errors in monitoring the manufacturing process.