1. Introduction
With the rapid and excessive development of motorization, most cities worldwide are confronted with traffic congestion, air deterioration, noise pollution, fossil fuel consumption, and carbon emissions [
1]. Public transport, especially the metro, has excellent advantages in alleviating traffic congestion, reducing fossil energy consumption, improving transportation efficiency, and reducing carbon emissions [
2]. However, unlike private cars, the metro provides a “station-to-station” service rather than a “door-to-door” service. Therefore, the last-mile problem has become one of the main obstacles for travelers choosing public transport [
3,
4]. The most common last-mile travel modes include walking, shared bicycles, and buses. However, each mode has its advantages and disadvantages. For example, walking provides excellent flexibility, economy, and environmental friendliness and it benefits the physical and mental health of the traveler [
5,
6]. However, walking is only suitable for travel within 10 min [
7], and its use is restricted in bad weather (e.g., rain and snow). Shared bicycles solve the problems of access and parking, but are likely to cause traffic accidents and traffic congestion [
1]. Similar to walking, shared bicycles are also restricted in bad weather. Taking a bus presents the advantages of short travel time and low travel cost, but the walking and waiting times are relatively long, and the attractiveness of taking the bus is not high enough.
Autonomous vehicles rely on artificial intelligence, visual computing, radar, surveillance devices, and global positioning systems working in tandem to allow computers to automatically and safely operate motorized vehicles without human initiative. Autonomous vehicle (AV) technology has made great progress in the past few years. The application of AVs would reduce traffic accidents [
8,
9,
10], alleviate traffic congestion [
11,
12], improve fuel economy [
13,
14,
15], and reduce carbon emissions [
16,
17]. It would also improve the mobility of the elderly and the disabled [
18], reduce the travel pressure on drivers [
19], and improve the efficiency of multi-task work [
20]. Therefore, shared AVs have the potential to solve the problem of deciding on a transportation method for the last mile of metro trips.
Although there is much in the literature regarding the impact of shared AVs on travel behavior, most studies mainly focus on the impacts of vehicle travel mileage [
21,
22,
23,
24], travel mode choice [
25,
26,
27,
28], and vehicle travel time [
29,
30]. Few studies have explored the use of shared AVs to solve the last-mile problem. In addition, most of the studies in the literature detail the influence of an individual’s attitude and perception towards AVs and other latent psychological variables on the behavioral intention of choosing AVs. However, a mature theoretical framework that combines behavior with attitude [
31], such as a combination of the technology acceptance model and the theory of planned behavior, has not been adopted.
In order to fill these gaps, we studied the latent psychological variables of travelers toward AVs in this paper, based on the mature theory of planned behavior, including attitudes (ATTs), subjective norms (SNs), perceived behavior control (PBC), and behavioral intention of use (BIU). Then, these variables were incorporated into the discrete choice model to establish a hybrid model, with Wuhan as an example, to conduct an empirical analysis of the influencing factors and degrees of influence of the last-mile travel mode choice behavior of travelers taking metro trips (beginning and ending in the city). The results of this paper are expected to improve the service level of the metro and provide insights for transportation planning within 1.5 km of subway stations. The main contributions of this paper are listed as follows:
(1) There was significant heterogeneity for travelers in their travel mode choices for the last mile of metro trips. (2) Respondents’ latent psychological variables towards AVs had significant impacts on their travel behavior, but the impacts varied among different segments. (3) Demographic characteristics, such as education, career, and monthly household income, had a significant impact on the membership of each latent class (LC). (4) The willingness to pay for walking and waiting times, as well as in-vehicle time, varied significantly among travelers in different segments. (5) Elastic analysis results illustrated that a 1% increase in the travel cost for shared AVs in segment 1 led to a 7.598% decrease in the probability of choosing shared AVs.
The paper is structured as follows:
Section 2 presents the literature review. In
Section 3, the structural equation models (SEMs) and LC choice models are briefly discussed.
Section 4 presents the survey design, data collection process, and descriptive statistics.
Section 5 shows and discusses the results of the final estimated model. Finally, conclusions and recommendations for further research are presented in
Section 6. An AV, in this paper, refers to a fully self-driving vehicle.
2. Literature Review
Travel time and travel costs are considered the most critical factors affecting travel behavior. Ortúzar [
32] investigated travel mode choice in the Garforth Corridor in West Yorkshire, England, and found that in-vehicle time, out-of-vehicle time, and travel costs were significant factors affecting travelers’ travel mode choices. Stern [
33] asserted that travel cost was related to travel mode choice for elderly and disabled people in rural Virginia. Ewing et al. [
34] confirmed that travel time significantly influenced the school travel mode choices of students in Gainesville, Florida. Frank et al. [
35] explored factors affecting the travel mode choices and trip-chaining patterns of residents in the Central Puget Sound (Seattle) region and ascertained that travel time and cost significantly affected travel behavior. Wang et al. [
36] believed that travel cost determines the use of shared travel modes in Beijing.
Sociodemographic factors also have a substantial impact on travel behavior. Schwanen et al. [
37] confirmed that the contributory factors of travel mode choices for senior citizens taking leisure trips included age, gender, car ownership, possession of a driver’s license, and educational attainment. Zhang [
38] analyzed the travel mode choices of travelers in Metropolitan Boston and Hong Kong and found that their choices were affected by socioeconomic characteristics, such as age, job, homeownership, children, and car availability. Verplanken et al. [
39] believed that the travel mode choices of university employees were related to age and gender. Tilahun et al. [
40] explored the travel mode choices of commuters in the northeastern Illinois area and demonstrated that their choices were effected by gender, age, vehicle/household size, income, and vehicle availability.
Besides travel time, travel costs, and socioeconomic characteristics, the latent psychological variables, such as values, norms, ATTs, perceptions, and desire, play a pivotal role in an individual’s travel mode choice [
39,
41,
42]. Numerous studies have delved into individuals’ attitudes and perceptions regarding autonomous vehicles (AVs). For instance, Sanbonmatsu et al. [
43] found that an individual with a higher awareness of AVs has a stronger intention to use them. Panagiotopoulos et al. [
44] asserted that latent variables such as perceived usefulness, perceived ease to use, perceived trust, and social influence, significantly impact respondents’ behavioral intentions towards AV usage. Choi et al. [
45] and Kaur et al. [
46] concluded that perceived trust positively affects the adoption of AVs. Haboucha et al. [
47] stated that pro-AV sentiments, environmental concern, and technology interest are related to users’ preferences regarding AVs. Lavieri et al. [
48] suggested that privacy sensitivity is related to individuals’ willingness to share trips with strangers. Nevertheless, most studies have not adopted a mature theoretical framework that combines behavior with attitude [
31], such as the technology acceptance model and theory of planned behavior.
Discrete choice models have been widely used to study individual travel behavior. The multinomial logit (MNL) model is the most common in practical applications due to fewer sample requirements, mature technology, and easy implementation [
49]. However, the MNL model is not without limitations. Specifically, it assumes uniform preferences across individuals and upholds the principle of independence of irrelevant alternatives (IIA). When alternatives are inherently independent, the IIA assumptions may not align with real-world scenarios, leading to issues exemplified by the “red bus or blue bus” problem [
50,
51]. The nested logit (NL) model came into being in response to the flaw of IIA. The NL model establishes a tree structure based on the correlation between the alternatives: the alternatives are dependent on the same nest but independent among different nests, which overcomes the IIA problem to a certain extent [
32]. The significant challenge in employing the NL model is to determine the tree structure reasonably [
52]. Unlike the fixed coefficients in the MNL model, the mixed logit (ML) model assumes that the coefficients of the explanatory variables are random and obey a specific probability distribution. Therefore, the ML model can solve the preference heterogeneity problem [
53,
54]. The ML model can also be called the random parameter logit (RPL) model. The ML model needs to determine the distribution type that the model coefficients obey in advance, and then the corresponding parameter values can be estimated [
55]. Typical parameter values are the mean and standard deviation. The former reflects the average preference, while the latter is the magnitude of the preference difference. Like the ML model, the latent class (LC) model handles the problem of random preference heterogeneity by dividing the respondents into several classes and applying different coefficients, respectively [
56]. Although both LC and ML models serve as primary instruments for navigating preference heterogeneity, empirical studies often indicate a slight superiority of the LC model over the ML in aspects such as goodness of fit, theoretical foundation, and information depth [
57,
58]. Based on the above discussion, this study aims to analyze the effects of travel time, travel cost, socioeconomic characteristics, and psychological latent variables on autonomous vehicles to solve the last-mile problem of metro trips.
5. Results
5.1. Results of Latent Variable Model
Stata 15.0 was used to test the latent variable model in the study. A confirmatory factor analysis (CFA) was conducted to ascertain the influence of variables on the adoption of AVs. The data needed to be evaluated for reliability and validity before performing CFA.
Table 4 provides details of the reliability and convergent validity of constructs. Standardized factor loadings are a measure of the strength of the relationship between an observed variable and a latent variable, which indicates the degree of linear relationship that exists between an observed variable and the factor to which it belongs. The standardized factor loadings of 12 observed variables ranged from 0.851 to 0.961, exceeding the standard of 0.5 [
67]. Cronbach’s alpha is a measure of the reliability of a scale or test. All Cronbach’s alpha values of four latent variables were above the acceptable level of 0.70 [
68]. The composite reliability (CR) of a latent variable is the combination of the reliabilities of all its observed variables, and this indicator is used to analyze the consistency among the observed variables of the latent variable. The minimum CR value was 0.921, and all values were higher than the minimum threshold of 0.7 [
69]. The average variance extracted (AVE) calculates the variance explanatory power of the latent variable, i.e., it directly shows how much of the variance explained by the latent variable is from measurement error. The AVE values of all constructs were between 0.797 and 0.893, indicating that the measurement model has a good structural reliability and convergence validity [
69].
Table 5 presents the results from the discriminant validity examination. All square values of AVE are higher than the inter-construct correlations, demonstrating that the latent variables have acceptable discriminant validity. The measurement model has been validated and used for structural model analysis.
The estimation results of latent variable measurement and SEMs indicated that the model fits the data well based on fit indices such as chi-square/degree of freedom (
), the root mean squared error of approximation (RMSEA), the comparative fit index (CFI), the Tucker–Lewis index (TLI), and standardized root mean square residual (SRMR). The chi-square/degree of freedom is a hypothesis testing method used in data analysis to detect the relationship between two categorical variables. The root mean squared error of approximation is an index that evaluates the model’s lack of fit; if it is close to 0 it indicates a good fit, and conversely, the further away from 0 the worse the fit. The comparative fit index is obtained when comparing a hypothetical model with an independent model. Its value ranges from 0 to 1, with a value closer to 0 indicating a worse fit and closer to 1 indicating a better fit. The Tucker–Lewis index is a comparative fit index that takes values between 0 and 1, with a value closer to 0 indicating a worse fit and closer to 1 indicating a better fit. The standardized root mean square residual is an absolute goodness-of-fit index, which is used to assess the average size of the difference between the observed and expected correlation matrices.
= 3.775 (critical value is between 1 and 5 when the sample size exceeds 500 according to [
60]), RMSEA = 0.064 (less than the critical value of 0.08 based on [
70]), CFI = 0.983 (more than the critical value of 0.90 on the basis of [
71]), TLI = 0.977 (more than the critical value of 0.90 in accordance with [
68]), SRMR = 0.024 (less than the critical value of 0.08 on the basis of [
72]).
5.2. Results of Latent Class Choice Model
In principle, more classes mean better goodness of fit at the cost of decreasing parsimony. The Bayesian information criterion (
BIC) is a widely recognized statistical method employed for choosing the most appropriate model from a defined set of candidates. It computes the probability function and adds a penalty term for the number of parameters in the model. The consistent Akaike information criterion (
CAIC) is an adapted form of the classic Akaike information criterion, which was developed by modifying penalties. This helps to avoid overfitting and provides a balanced approach to model selection. Both
BIC and
CAIC were proposed to penalize the number of classes.
Table 6 summarizes these measures concerning the models with one to six classes. Among them, the four-segment LC model has the lowest
BIC and
CAIC, and the rho-bar squared between 4 and 5 is 0.0009. Four classes might be the optimal number of classes considering the objective of the study and its simplicity. As shown in
Table 7, all models with LCs outperform the single-segment model (MNL model), confirming the heterogeneity of the preferences of the sample.
The chosen four-segment model exhibits a rho-bar squared value of 0.400. The class probability model includes socioeconomic characteristics as explanatory variables, and the parameters are shown in
Table 8.
Table 7 presents the model statistics of multinomial and four-segment LC models. Notably, segment 4 comprises only 7.3% of the overall sample. Within this segment, the coefficients for walking and waiting time are significantly positive, whereas those for in-vehicle time and trip cost prove to be statistically inconsequential. This indicates that respondents within this segment may not have fully grasped the essence of the choice task presented to them.
Segment 1 comprises 28.7% of the total sample. The parameter estimates and corresponding z values in segment 1 indicate that both walking and waiting time and trip cost are statistically significant, suggesting that increasing walking and waiting time and travel cost will reduce travelers’ willingness to use a certain travel mode. A significant majority of respondents favored walking (93.9%), while a comparatively smaller portion preferred shared bikes (4.8%). The results indicate that segment 1 is interested in walking. According to the class member model, respondents who belonged to segment 1 are more likely to possess bachelor’s degrees and upper middle income (CNY 10,001–20,000 monthly) and are less likely to have education levels of secondary school and below. Furthermore, the latent psychological variables such as ATT and BIU are related to the total utility for using AVs. The ATT towards AVs has a strongly negative contribution (marginal value equals −4.251) to the total utility of using AVs as last-mile transport. The results show that respondents who are positive about adopting AVs are less willing to use AVs as egress mode. The latent psychological variable BIU contributes positively to the total utility, indicating that a higher BIU decreases the disutility of using AVs for last-mile trips.
Segment 2 consists of 37.8% of the respondents. Within this segment, the transportation mode preferences are distributed as follows: walking (21.8%), shared bikes (33.5%), shared AVs (16.2%), and buses (28.4%). Demographically, individuals in this segment predominantly possess a bachelor’s degree and are employed as public servants, in public institutions, or within enterprises. Their income levels tend to be median when compared to other segments. The parameter estimates of travel characteristics are all statistically significant. However, none of the latent psychological variables regarding AVs influence travelers’ preference for AVs as egress mode in segment 2. This observation suggests that respondents in segment 2 are not familiar with AVs.
Segment 3 includes 26.2% of the overall sample. A striking majority of these respondents, 91.3%, favor shared bikes, indicating a dominant inclination towards this mode of transportation for last-mile journeys within this group. In terms of socioeconomic characteristics, individuals in this segment are more likely to possess a bachelor’s degree in the upper-middle-income category, demonstrating that shared bikes are attractive for these travelers in the last mile trip. The waiting and walking time, in-vehicle time, and travel cost proved statistically significant. BIU has a significant positive influence, while PBC has a significant negative effect on the utility function of choosing AVs for last-mile transport.
5.3. Elasticities
Table 9 presents both the direct and cross-elasticities related to the travel cost for each of the travel modes, aiming to analyze the variation in preferences among the three latent classes (LCs), excluding the residual segment 4. The elasticities illustrate the percentage change in the choice probability of four travel modes due to a 1% change in the level of travel cost. For example, a 1% increase in the travel cost for shared AVs leads to a 0.649% decrease in the choice probability of shared AVs (i.e., direct elasticity), while it causes a 0.105% increase in the probability of choosing walking, shared bikes, and buses (i.e., cross-elasticity) when considering the entire sample. The direct elasticities of travel cost for all travel modes of the MNL model and class2 were bigger than negative 1, showing that a 1% increase in the travel cost for all travel modes will decrease choice probabilities by less than 1%. However, the direct elasticities of travel cost for all travel modes of class1 are smaller than negative 1 and those in the MNL model, class2, and class3, which indicates that respondents from class1 are more sensitive to travel cost than individuals from class2 and class3. In class1, a 1% increase in the travel cost for shared AVs leads to a 7.598% decrease in the choice probability of shared AVs.
5.4. Willingness to Pay
Although travelers from different segments have the same signs for preferences, there are differences in sensitivities to walking and waiting time, in-vehicle time, and trip cost. The travelers’ willingness to pay is calculated as a trade-off ratio between the estimated time parameter and the estimated cost parameter.
Table 10 shows travelers’ willingness to pay values across distinct segments. Individuals from segment 1 who are sensitive to travel costs were willing to pay CNY 14.5 to reduce a one-hour walking and waiting time. In-vehicle time was found to have an insignificant influence on respondents’ last-mile egress mode. Conversely, travelers in segment 3 were willing to pay up to CNY 441.3 and CNY 63.0 to decrease one-hour walking and waiting time and in-vehicle time. The results are consistent with previous research. Seelhorst and Liu (2015) postulated that price-sensitive travelers were willing to pay less to reduce travel time [
73]. Wen and Lai (2010) discovered that travelers with high incomes were willing to pay more to improve service quality [
74].
6. Discussion
6.1. Summary of Results
This study positioned shared AVs in the public transport market and applied an LC model to understand the unobserved preference heterogeneity across respondents. Four distinct market segments concerning the preference were identified for the last-mile travel mode choice of metro trips. By analyzing the preference heterogeneity and group characteristics of these LCs, we can determine the target group for using shared AVs in the last mile of metro trips. Travelers who choose shared AVs with the highest proportion belong to segment 2. These people are more likely to possess a bachelor’s degree, work in public servants/public institutions, and be enterprise employees with a middle income. Different strategies should be used to increase the attractiveness of shared AVs in response to the heterogeneity of choice preferences of travelers from different segments.
Latent psychological variables exerted differential impacts across segments. The latent psychological variables of attitude towards AVs have a significant negative impact on the trips of travelers from segment 1 while they have positive effect on the travelers from segment 4, indicating that travelers from segment 1 lack sufficient knowledge of AVs. In the process of promoting and popularizing AVs, it is important to consider the psychological attitudes of travelers towards AVs. The direct elasticity analysis shows that the travelers from segment 1 are most sensitive to travel costs, and the direct elasticity value reaches −7.598, which indicates that a 1% increase in the travel cost for shared AVs leads to a 7.598% decrease in the choice probability of shared AVs. The cross-elasticity analysis shows that the cost of shared bicycles has the greatest impact on shared AVs. In segment 3, a 1% increase in the travel cost for shared bikes will cause a 0.472% increase in the choice probability of shared AVs. This suggests that an astutely calibrated pricing strategy can facilitate the promotion and adoption of shared AVs.
Travelers from different segments have different willingness to pay for walking and waiting time and in-vehicle time. For travelers from segment 1, in-vehicle time does not significantly influence their willingness to pay. Conversely, those from segment 2 are willing to pay CNY 20.4 to reduce one-hour in-vehicle time, while for segment 3, this value rises to CNY 60.3 per hour. Walking and waiting times significantly impact travelers from all segments, but the magnitude of the impact varies greatly. Travelers from segment 2 are willing to pay CNY 11.0 to reduce one-hour walking and waiting time, while segment 3 travelers are willing to pay up to CNY 441.3 for the same reduction. The results show that the value of travel time for high-income groups is relatively higher. Hence, the travel experience of travelers can be improved by rationally designing the operation routes and departure intervals of shared AVs to reduce the walking and waiting time.
6.2. Contributions and Comparison to Literature
The contribution of this paper mainly includes three aspects. From the perspective of research objects, although Chinese AVs have made rapid progress, there is no research on Chinese shared AVs to solve the problem of the last mile of metro trips. Although AVs have been studied in the Netherlands, Atlanta area, Ann Arbor–Detroit Area, and other areas [
3,
4,
75], findings from these areas may not be applicable to other areas.
From the perspective of research methods, diverging from prevailing methodologies that deploy cluster analysis to discern user groups [
76], this paper is the first time that the LC logit model has been used to study travelers’ preferences to choose shared AVs to solve the last-mile problem of metro trips.
From the perspective of influencing factors, in addition to socioeconomic attributes and travel characteristic variables that have an impact on travel behavior, travelers’ psychological latent variables have a significant impact on travel mode choice. For example, the perceived trust and perceived reliability of AVs affect travelers’ preference for AVs [
3]. Nevertheless, few studies focused on mature theoretical frameworks that combine behavior and attitude [
31]. Addressing this gap, this paper adopts the theory of planned behavior to study latent psychological variables, including ATTs, SNs, PBC, and BIU of travelers towards AVs.
6.3. Limitations and Further Work
This area still needs to be further studied. Firstly, the results of this paper are mainly specific to the research area. Research results in different countries and regions will vary. Secondly, the COVID-19 pandemic has markedly reshaped travel behaviors worldwide. Future research can consider the impact of COVID-19 on travelers’ preference for shared AVs. Thirdly, this paper mainly studies the option of metro and transfer while ignoring the option of private AVs throughout the journey. More options for travel indicate a more complex system and greater uncertainty. Studying more modes of travel helps to understand this uncertainty. Finally, this paper does not fully consider the influence of distinct geographic and regional characteristics on travel behaviors. For instance, the impact of significant landmarks such as the Yangtze River on residents’ travel behaviors or the discrepancies in travel mode choices between primary urban areas and their peripheral counterparts warrants attention in future studies.