1. Introduction
High-definition maps are electronic maps with absolute accuracy of up to the centimeter level, which can provide rich static and dynamic traffic information. They are important guarantees for the safe and efficient travel of autonomous vehicles and can provide accurate positioning, optimized routing, and vehicle control for autonomous vehicles [
1]. The higher the autonomous driving level, the higher the requirements for high-definition maps (
Table 1). Starting from level L3 autonomous driving, the subject of real-time environmental perception has changed from human drivers to autonomous driving systems [
2].
Compared with traditional maps, high-definition maps have the characteristics of high accuracy, strong timeliness, large amounts of data, and high dynamics; can provide more accurate positioning services, more abundant traffic information, more reasonable route planning, and other services; and play an important directing role in the scenario of autonomous driving [
3]. With the continuous development of intelligent driving, users’ demands for the richness of high-definition map road environment information, the timeliness of information transmission, and the personalization of information are becoming increasingly strong. However, currently, high-precision maps still have problems such as large amounts of data, serious data redundancy, and weak data association, which make autonomous driving vehicles fall into difficulties such as difficult data query and low data transmission timeliness [
4], which are not conducive to the efficient control and timely communication of autonomous driving vehicles and further make it difficult to improve user satisfaction. In response to these problems, a large number of experts and scholars have made efforts, especially in the aspect of autonomous vehicle control and communication technology. For example, MA et al. [
5] proposed a human–machine shared steering controller based on the Nash game strategy and designed a continuous weight adjustment algorithm to balance the steering control rights of the driver and the intelligent system, significantly improving the vehicle control rights coordination degree and the vehicle path tracking accuracy, which is of great significance for improving the dynamic positioning accuracy of autonomous driving vehicles. Sun et al. [
6] proposed a state-aware event-triggered communication strategy, which can dynamically adjust the coupling of vehicle control and communication transmission, which is of great significance for optimizing information transmission. Ding et al. [
7] proposed a vehicle control method of human–machine collaborative predictive steering control, which has dynamic path planning and tracking functions and improves the human–machine collaborative steering performance. Using pre-stored data for trajectory prediction is of great importance for avoiding the risks caused by low timeliness of data transmission. Ding et al. [
8] took a hybrid heavy-duty truck as the experimental object and designed a steering control system based on the safe elastic trigger output feedback LKC (lane-keeping control) to counter the aperiodic energy-bounded DoS (denial-of-service) attack, greatly reducing the driver’s steering burden and the conflict of driving control rights. It also proposed a communication scheme based on the safety and adaptive adjustment of auxiliary variables, which can greatly save information transmission resources and is of great significance for improving the timeliness of information transmission. The emergence of these achievements has a mitigating effect on problems such as difficult data query and low data transmission timeliness of autonomous driving vehicles, but it still needs the support of a complete and accurate high-definition map information system to provide a guarantee for the enrichment of map data information, the optimization of data quality, and the improvement of the degree of data association [
9]. In this regard, it is necessary to explore a complete and accurate high-definition map information system model in the autonomous driving scenario and simultaneously analyze the role played by data at different levels of the model in the autonomous driving scenario so as to provide assistance for the development of high-definition maps in coordination with intelligent driving.
2. High-Definition Map Information System Model (HDMISM)
According to the Kolacny theory in the cartography theory, the traditional map information transmission model mainly includes the cartographers (H
1), the map users (H
2), the objective world (W), the cartographers’ spatial environment cognitions (C
1), the map users’ spatial environment cognitions (C
2), and the map (M
1), that is, Model
1 = (H
1, H
2, C
1, C
2, W, M
1). This theory points out that the traditional map information transmission mainly follows the basic principle of one-way transmission, that is, W→H
1→C
1→M
1→C
2→H
2. The objective world is recognized by the cartographers and forms the cognition of the world’s spatial environment. The cartographers then use a certain map language to create the map based on their own cognition, and the map users parse the map language to further form the cognition of the spatial environment of the objective world [
10]. However, the cognitions are mainly based on the cartographers’ cognitions of the objective world, and the entire information transmission process follows the principle of one-way transmission, with a clear division between the cartographers and the map users. The information transmitted by the traditional map information transmission model mainly consists of relatively static spatial cognition with low timeliness, which fails to convey dynamic real-time environmental cognition in a timely manner. This is not conducive to the real-time cognition of the dynamic environment by automatic driving vehicles and the making of driving decisions and requires the support of high-definition maps with stronger real-time performance.
This article believes that high-definition maps are more intelligent than traditional maps and that the content included in the information transmission model is richer. For example, both the cartographers and the map users need to not only cognize the geographic spatial environment but also form cognitions of the spatiotemporal environment and complete the perception with a large number of sensors, making the cognitions of the cartographers and the map users dynamic (
,
). In addition, it is necessary to add the personalized demand information and cognitive characteristics of the map users (D) so that the high-definition map (M
2) can meet the needs of the map user in real time and have self-adaptability. It should be noted that there is no clear identity boundary between the cartographers and the map users of the high-definition map, and, in certain cases, the identities of the two overlap, and some map users will participate in the map production. As a result, the information transmission model of the high-definition map is Model
2 = (H
1, H
2,
,
, W, M
2, D). Under this kind of information transmission model, the transmission of information mainly includes six stages: ① Based on D, a user model is built to lay the foundation for leading the cartographers to produce maps that meet the requirements; ② the cartographers obtain information about the objective world and form a cognition (
); ③ the cartographers process the information and form the map M; ④ the map users interpret the map information and form cognitions (
); ⑤ there is interactive feedback between the cartographers and the map users to provide guidance for optimizing the personalized service functions of the map; and ⑥ the cognitions of the map users guide the implementation of their actions. The information transmission model of high-definition maps can be described as shown in
Figure 1.
As shown in
Figure 1, compared with the traditional Kolacny map information transmission model, the information transmission model of high-definition maps is no longer one-way information transmission, and the identities of the cartographers and the map users are no longer clearly divided. Some map users will also participate in mapping in the form of popular crowdsourcing to ensure the dynamic nature of map information. The map information is mutually transmitted between the map users and professional cartographers, and the cognitions of both sides are integrated under the traction of user demands, making the interaction feedback between the two more connected, ensuring that the personalized needs of the map users can be understood by the cartographers. From this, it can be known that the high-definition map information transmission model mainly has the characteristics of crowdsourcing, personalization, and dynamization. The discussion of the high-definition map information transmission model helps to accurately grasp the logical hierarchy of the high-definition map information system (HDMIS) and then clarify the information interaction relationships and methods at different levels, which has important guiding significance for strengthening the application effect of high-definition maps in the autonomous driving scenario [
11]. In this regard, an HDMISM will be proposed below on the basis of the above information transmission model, and its data logical structure and application in the autonomous driving scenario will be analyzed.
The academic community, map vendors, and auto companies have continuously increased their attention on the HDMIS but failed to unify their understanding of its data logical structure [
12]. For example, in China, He Yong and others believed that the HDMIS can be divided into the traffic sign information layer, the lane line information layer, the lane network information layer, and the road network information layer [
13]. The German Bosch Group (BOSCH) divided the HDMIS into the dynamic data layer, the semi-dynamic data layer, the semi-static data layer, and the static data layer [
14]. The Li Keqiang team from Tsinghua University in China divided the HDMIS for autonomous driving into a total of seven layers: the road-level road network layer, the traffic information layer, the road-lane connection layer, the lane-level road network layer, the map feature information layer, the dynamic perception container layer, and the intelligent decision support layer [
15]. HERE, a company under Nokia in Finland, based on cloud services, divided the HDMIS into the road model layer, the lane model layer, and the positioning model layer [
16]. In China, E-Maptech Technology Co., Ltd., jointly joined hands with Tsinghua University in Beijing, Guoqi Intelligent Connected Automobile Research Institute Co., Ltd. (Beijing, China), Beijing NavInfo Technology Co., Ltd. (Beijing, China), Baidu Online Network Technology (Beijing) Co., Ltd. (Beijing, China), Amap Software Co., Ltd. (Beijing, China), and China Wuhan Zhonghaiting Data Technology Co., Ltd. (Wuhan, China) and more than 20 map vendors, auto manufacturers, and software and hardware vendors in the field of autonomous driving to be responsible for formulating the “Automatic Driving Map Collection Element Model and Exchange Format” standard. This standard divided high-definition maps into the collection scene basic information layer, the road traffic sign layer, the road traffic marking layer, other road safety facility layers, and the intelligent roadside equipment layer [
17].
The division of the HDMISM mentioned above was mainly based on different perspectives, and the resulting division results were different and could not be unified. However, taken together, it can be found that high-definition maps mainly include three aspects: rich static road data information, dynamic driving environment information, and user-personalized demand information [
18]. Static road data information mainly describes the static data of the road and the environment, and this type of data has a lower update frequency and longer timeliness; the dynamic driving environment information includes real-time road information and vehicle dynamic information, the real-time road information mainly refers to the road condition information at the macro level, and the vehicle dynamic information mainly refers to the instantaneous information of the local traffic scene centered on the vehicle, while the user-personalized demand information mainly refers to various types of demand information and characteristics of passengers, drivers, or map platform administrators. The existing research results have not focused on considering user demand information, and this paper believes that user-personalized demand information is also important information and an important driving force for the optimized development of high-definition maps and needs to be considered separately. In this regard, based on the division characteristics of people’s HDMIS mentioned above, combined with standards such as the OpenDRIVE formulated by Germany, the “Navigation Data Standard” jointly developed by several multinational enterprises in Europe, the “Geographic Data Files” formulated by the European Standardization Committee, the “Road High-Definition Navigation Electronic Map Data Specification” formulated by China, and the “Intelligent Transportation System Intelligent Driving Electronic Map Data Model and Exchange Format” created in China [
19], this paper believes that the HDMIS can be divided into a static map layer, real-time road data layer, vehicle dynamic data layer, and user model layer. The data of the four layers form the whole of the HDMIS, that is, the four-layer HDMISM. This information system covers traditional static map information (map), information considering the constant changes in the road (road), the instantaneous environment information around the vehicle (vehicle), and user demand information (person), taking “map, road, vehicle, person” as the main consideration elements, which clearly define the system thinking of “map-road-vehicle-person” and can more conveniently express the logical relationship of map data in the automatic driving scene. As shown in
Figure 2, it is the approximate contents contained in the four levels. Later, the contents of different data levels will be expounded to provide references for research on high-definition map data association, the discussion of information interaction methods, etc., which has great significance for the development of high-definition maps.
2.1. Static Map Layer
This layer of HDMIS is the basic data upgraded and optimized based on traditional navigation electronic maps, with roads as the main body, expressing more refined and richer traffic environment information, service point information, and information connecting with other transportation systems [
20]. The static map layer contains various types of static data (
Figure 3), which can accurately describe the static traffic environment and help autonomous vehicles understand the driving environment and control the vehicle’s driving status.
Road network information mainly includes data such as road direction, road baseline, and road intersections, which can describe the relationship between road geometry and traffic facilities, providing global navigation information for autonomous vehicles and supporting road-level route planning. Lane network information is more detailed than road network information, mainly providing detailed information about lanes for autonomous vehicles, such as lane passage direction, curvature, slope, width, etc. [
21]. It also has a more detailed description of lane baselines, lane markings, and lane baseline connection points, which can support lane-level path planning. Traffic facility information mainly describes point, linear, or area traffic facility information such as traffic lights, roadside telephone booths, guardrails, soundproof walls, traffic signs, toll stations, landmark buildings, etc. It is the long-term data of the HDMIS. Traffic facility information contains rich national geographic data, so the government is required to work together with the high-definition map management center and related enterprises to ensure the security of these basic data [
22]. The positioning feature reference layer is derivative positioning service information, mainly released by different enterprises, which can complement basic geographic information data. Combined with the traffic facility information layer, it can provide positioning services for autonomous driving scenarios under no-signal or weak-signal conditions. With the gradual enrichment of people’s needs, service point information in navigation electronic maps is also included in high-definition maps. Service point information includes service area entrances, hotels, rest areas, restaurants, train stations, ski cable car stations, banks, tourist attractions, toll stations, stadiums, airports, etc. Such service point information can not only meet the personalized needs of users but also serve as feature references to assist vehicle positioning [
23]. Different passengers or drivers have different lengths of routes, and the transportation tools they need to take are also diversified. Incorporating the connection point information of other transportation systems such as subways, buses, waterways, and railways into high-definition maps could enable people to transfer conveniently.
Overall, in the autonomous driving scenario, the static map layer of the HDMIS can provide rich, detailed, comprehensive, and accurate road information and environmental information (such as road boundaries, lane boundaries, and lane centerlines covered by snow, water, or leaves) for autonomous driving vehicles, thereby guiding the vehicles to drive correctly and safely. The validity time of static map layer data is relatively long, the update frequency is relatively low, the cycle is relatively long, and the update cost is also relatively high, but the static map layer is the most important data layer of high-definition maps and lays an important data foundation for other data layers.
2.2. Real-Time Data Layer
This layer of HDMIS mainly focuses on the real-time traffic conditions of roads and lanes. This type of data has a higher update frequency and has an important impact on the real-time driving route of vehicles in autonomous driving scenarios. The real-time data layer has a large variety of information, which comes from vehicle sensors, meteorological bureaus, road management departments, traffic management departments, and road sensor networks but can be categorized into two types: real-time traffic information and traffic scene information (
Figure 4).
Real-time traffic information can affect the vehicle driving route in autonomous driving scenarios [
24], can support vehicles that access the HDMIS to change the driving route, and can help vehicles optimize navigation decisions. Traffic scene information is a supplement to real-time traffic information, which can provide relevant information about public parking lot scenes for autonomous driving vehicles, as well as provide relevant information about normal driving scenes, temporary road maintenance scenes, and traffic accident scenes, providing support for autonomous parking of autonomous driving vehicles and ensuring the optimization of vehicle navigation decisions.
To sum up, this layer can provide real-time traffic information for autonomous driving vehicles, avoiding delays in travel time due to congestion during driving and the occurrence of dangerous situations due to poor road conditions. The update frequency of the information of the real-time data layer is relatively high, which is between that of the static data layer and the dynamic data layer, and can provide important references for global route planning and optimization.
2.3. Dynamic Data Layer
The dynamic data layer mainly focuses on autonomous driving vehicles, paying attention to the driving behavior of vehicles and the change in the surrounding local scenes in the autonomous driving scenario. Vehicles obtain dynamic information through active and passive sensing methods and simultaneously perform instantaneous control of vehicles in combination with the instantaneous state information of the vehicle [
25] (as shown in
Figure 5).
Active sensing dynamic information refers to the surrounding traffic signals, pedestrians, other vehicles, and other environmental information [
26] that autonomous driving vehicles directly obtain through their own radar, cameras, and other sensors. The dynamic information of the local road scene provided by non-autonomous vehicle sensor systems such as roadside units (RSUs) is passive perceptual dynamic information. For example, the cloud distributes intersection information and traffic accident information to autonomous vehicles, helping them deal with blind spots when turning at intersections in advance, modify travel paths, and so on, thereby avoiding traffic accidents or becoming stuck near traffic accident scenes [
27]. Passive sensing dynamic information makes up for the shortcomings of the incomplete active perception of autonomous vehicles and provides a guarantee for the safe operation of autonomous driving. In addition, the status information of autonomous vehicles, such as the vehicle’s position, driving direction, current speed, and acceleration, also needs to be considered to ensure the safe driving of autonomous vehicles. In this way, autonomous vehicles can effectively predict special scenes, such as reverse driving, overtaking, abnormal vehicle temporary parking, and running red lights, and then take timely countermeasures and reasonably control the vehicle to avoid collisions.
The information of the dynamic data layer has a very high update frequency. Information acquisition is mainly carried out by vehicle-mounted sensors, and by cooperating with the information of the static map layer and the real-time data layer, it can ensure the efficient and safe driving of autonomous vehicles and then meet the user needs from the user model layer.
2.4. User Model Layer
The user model layer of the HDMIS mainly focuses on data sharing based on user (passengers and drivers at L2/L3 levels) requirements, providing services for them. The main data types include user data sets and vehicle characteristic data sets (
Figure 6) [
28].
Drivers are involved in the driving of low-level autonomous vehicles, and the characteristic information such as their reaction speed, mental state, physical condition, driving skills, driving habits, cognitive level, cultural background, and driving experience all affect the vehicle operation. Therefore, the characteristic information of the driver needs to be included in the user model layer. In addition, the characteristic information of the passengers of autonomous vehicles, such as the number, age, personality, cultural background, mental state, and physical condition, also affects the driving of autonomous vehicles [
29]. In order to better meet the needs of users, it is necessary to include the relevant characteristic information of users in the user model layer of the high-definition map. In addition, the characteristic data of the vehicle’s configuration, mechanical performance, and equipment performance determine the operation control of the autonomous vehicle. This is a clear difference from the vehicle status information in the aforementioned dynamic data layer, but the user layer needs to cooperate with the vehicle status information in the dynamic data layer to optimize vehicle control.
The above four data layers cooperate with each other to provide perception, positioning, decision-making, and control services for autonomous vehicles. However, the mutual cooperation of the data at these levels requires the support of some key technologies. The following will present two relatively key technologies to provide support for the update of high-definition maps, the formation of local maps, and so on.
5. Discussion
The HDMISM is a comprehensive concept, and its data logical structure is different, resulting in differences in data association methods. Unifying the data logical structure of the HDMISM is of great significance for formulating high-definition map production standards, unifying information interaction relationships, and standardizing map data association, etc.
The four-layer integrated HDMISM proposed in this paper forms a simple model structure with the four core elements of “map, road, vehicle, and person” as key nodes, making the data logical structure of the system model clearer. In the autonomous driving scenario, the “map” is the most basic tool, and no matter what the data logical structure of the high-definition map is, it cannot be separated from the static map data information. The “road” is a process that the autonomous vehicle must go through, and in this process, what dynamic influencing factors will be faced is what the autonomous vehicle needs to consider and understand. The “vehicle” is the carrier of autonomous driving, which is indispensable in the autonomous driving scenario, but the environment around the vehicle changes all of the time, and relevant information needs to be obtained in time to properly control the vehicle’s travel. “Person” is the cause of vehicle travel in the autonomous driving scenario, and even for an autonomous vehicle without passengers, its travel purpose must also originate from the “person”. In this regard, this paper believes that “map, road, vehicle, person” are the core elements of the high-definition map in the autonomous driving scenario, and it is feasible to divide the data logically with them as key nodes. The proposed map information system model has an important reference value for unifying the data logical structure of the high-definition map information system model.
L5-level autonomous driving is the pursuit goal in the future field of autonomous driving. At that time, autonomous vehicles will be able to drive highly autonomously but still need the support of the four core elements of “map, road, vehicle, person”. Although some scholars believe that “sensors + AI” can replace the human senses and brain to complete vehicle driving without the need for maps, this may be limited to local autonomous driving. In long-distance autonomous driving, rich data support from high-definition maps is still required, especially real-time road information (such as road sections with floods or landslides) that traditional maps do not have. Therefore, in the future production of high-definition maps, “map, road, vehicle, person” can be key factors to consider, unifying the production standards of high-precision maps and ensuring that high-definition maps can achieve functions such as self-learning, self-adaptation, and self-evaluation.