1. Introduction
Despite a decrease in emissions and ambient levels of pollutants over the years, air quality remains poor in many areas in Europe. As a result, air pollution still forms the biggest environmental risk to human health in Europe, especially in urban areas [
1]. One of the main pollutants causing health effects is particulate matter (PM). The health effects of PM are partly dependent on the size of the particles [
2]. Particle pollution is grouped into two main categories: parts smaller than 10 μm (PM
10) and smaller than 2.5 μm (PM
2.5). The ability of the human body to prevent penetration of particulate matter into human tissue decreases with smaller particle size, causing far reaching health consequences [
3]. It is estimated that long-term exposure to PM
2.5 caused 417,000 premature deaths throughout Europe in 2018. For the Netherlands, the yearly number of premature deaths is estimated at 9900 [
1].
Hence, the European Union (EU) has set up limits regarding maximum levels of particulate matter [
4]. Annual average PM
2.5 concentrations should be maintained below 25 μg/m
3 and PM
10 concentrations should remain below 40 μg/m
3. However, even below these limits, adverse health effects might occur [
3], and according to the WHO “no threshold has been identified below which no damage to health is observed” [
5].
EU legislation requires member states to monitor PM levels with official measurement methods, as defined in the European Air Quality Directive (2008/50/EC) [
4]. In the Netherlands, monitoring takes place using the National Air Quality Monitoring Network (Dutch: Landelijk Meetnet Luchtkwaliteit (LML)) [
6]. In addition, air quality assessment is performed using an extensive set of modeling within the framework of the National Air Quality Cooperation Program (NSL) [
7,
8,
9].
The costs of a national system with high-quality reference stations are high. Consequently, the network is sparse, limiting the ability to capture spatial variability in air pollutant concentrations, which cannot always be compensated for by additional modeling work.
To improve the spatial and temporal resolution of the existing air quality monitoring network, alternative air quality monitoring solutions, such as the deployment of a large number of low-cost sensors, are suggested [
10]. Low-cost sensors might be added to the existing network to improve its spatial representation [
11,
12], and do not need to be operated by official institutions [
13,
14], which makes it an attractive option for governments and citizens alike. Interest in the use of low-cost sensors is rapidly increasing, for a large part driven by their low cost and ease of operation. In Europe, many thousands of low-cost PM sensors are already deployed, primarily through initiatives like “Sensor.Community”. In the Netherlands, several experiments with low-cost sensors have also been performed [
15]. Associated sensor data are available at sensors.rivm.nl. Wesseling et al. (2019) analyzed these stationary sensor measurements and concluded that the rise of citizen science and increasingly better performing low-cost sensors will change the role of Environmental Protection Agencies (EPAs) significantly in the coming years [
13]. In recent years, several studies have given an extensive overview of the use of low-cost sensors, their calibration options and other technical considerations [
16,
17,
18]
Another reason for the increasing interest in low-cost sensors is their potential for personal exposure monitoring [
17,
19]. Typically, estimates for exposure are based on modeled PM concentrations at home addresses or zip codes. However, these studies cannot account for exposure to pollutants in other locations, or, for instance, while commuting. Low-cost sensors offer an attractive method to assess individual exposure to particulate matter as they can be used as portable sensors [
20]. However, typical studies on personal exposure characterization using portable sensors have few participants, which makes it difficult to use these data to improve the spatial resolution of PM measurements. When there are many individual participants in an experiment new challenges arise as the quality of the measurements may vary significantly [
14].
In this study, we aim to improve the spatial and temporal resolution of PM
2.5 measurements by deploying a large number of low-cost PM sensors mounted on bicycles. Moreover, we specifically investigate whether it is possible to estimate exposure to traffic related PM
2.5 using these low-cost sensors. This is important because it has been suggested that traffic-related emissions, particularly soot emissions, are most harmful to human health [
21].
In the Netherlands, more than one-quarter of all trips are made by bicycle [
22] while the average number of bikes per person in 2020 was 1.3 [
23]. Cycle paths in the Netherlands are often part of a dense network, particularly in urban areas. Because of this dense network, there is a large variety in the type of cycle paths. Some cycle paths are adjacent to major roads for motorized traffic, while others are completely segregated, e.g., through the woods. This makes it possible to compare different cycle routes with different expected contributions of traffic-related PM. Modeling and empirical research show that PM concentrations in the Netherlands are highest around major transportation routes [
24,
25,
26,
27]
Our study has three main objectives. First, we will discuss the quality and validity of PM2.5 measurements using low-cost sensors on bikes. Secondly, we will use these measurements to estimate exposure to traffic related PM2.5 during typical commutes. Finally, we will reflect on the use of mobile, low-cost air quality sensors and their potential to estimate personal exposure to PM.
2. Materials and Methods
For this study, we use PM
2.5 sensor data obtained from the “Sniffer bike” project for the year 2020 (in Dutch:
https://snuffelfiets.nl/ (accessed on 25 May 2021)). In this project, around 500 cyclists used the Sniffer bike sensor kit to measure PM
2.5 concentrations. The Sniffer bike is a co-operation between citizens, provinces/municipalities, private companies, and a research institute. The province of Utrecht provides the financing, the project- and program management, and platform for the community. Within the organization of the province, multiple policy areas are involved (e.g., health, mobility and data, knowledge, and innovation). Local volunteers are being involved by municipalities, platforms, such as cyclist associations, groups of cycling friends, and through their jobs as bicycle couriers. SODAQ, an Internet of Things company, participates as a partner for the delivery and maintenance of the sensors. The company Civity is responsible for the data management and the data platform. RIVM, the Dutch National Institute of Public Health and the Environment, validates and analyses the collected data.
The area of the Sniffer bike project in Utrecht is shown in
Figure 1. The province of Utrecht has a very high population density and consists of several urban centers, most notably the city of Utrecht (population of ~360,000; located in the west) and Amersfoort (population of ~158,000; located in the northeast). The center of the province is dominated by National Park “De Utrechtse Heuvelrug”, the second largest forest area in the Netherlands. There are many relatively quiet trails (non-motorized) for walking and biking in this national park. As a result, because of the urban centers and the presence of a national park, there is a large variety of different cycle paths in the study area. Sniffer bike sensors are not geographically restricted to the province of Utrecht; they have been deployed elsewhere, and people are allowed to use them wherever they are. However, for this study, we concentrated on the measurements in the province of Utrecht as the Sniffer-bike project originated there, with most measurements performed there. Of the approximately 25.5 million Sniffer bike measurements in the Netherlands in 2020, about 20 million were located in the area of the province of Utrecht.
The sensor kit in the Sniffer bike contains several sensors and additional hardware, including:
Sensirion SPS30 dust sensor for air quality (PM1, PM2.5, and PM10);
Bosch BME680 temperature and humidity sensor;
Accelerometer to collect data about the pavement quality (LSM303AGR—part of SODAQ Sara SFF board);
GPS and time observations to display the cycled route (u-blox EVA-M8M—part of SODAQ Sara SFF board).
The housing is mounted onto the handlebar of the bicycle (see
Figure 2). The housing contains the sensors, the battery and the communication unit. The communication is facilitated via Long Term Evolution (4G), category M1 (LTE-M), which is a wireless communication standard which belongs to the category of low-power wide-area networks (LPWAN), enabling to connect devices that need small amounts of data, low bandwidth, and long battery life. The IMEI-number is the unique identification of the sensor kit, used throughout the whole process. Being on a mobile platform, the power of the sensors is provided by batteries that have to be charged before the trips. In theory, a charged set of batteries should allow for up to 8 h of measurements, more than enough for daily use. In practice, this was an upper limit and not all users kept the batteries sufficiently charged. A guide on how to build your own “Sniffer bike” sensor kit (of the initial pilot tests) is shown on the website of the manufacturer [
28].
The sensor kit is only active when moving. Every ten seconds a sample of air is taken, and the measurements, including location, are sent to a server at the company Civity. The raw data are then sent to RIVM where it is stored into a dedicated InfluxDB data base and subsequently calibrated. The calibrated data are finally POSTed back to Civity (see
Figure 3 for an overview of the data streams). During the project, we found that the GPS had issues getting a fix on the position when the bikes passed through the woods. In these cases, concentration data were provided without a position and without the possibility to estimate the speed. Therefore, these data could not be used in the analysis. The data of the temperature sensors will be analyzed in a separate project. The other data have not been used so far. The price of the measuring kit is roughly 500 Euros. On top of this, there are costs for creating a data infrastructure and a data portal for participants. The present project was sponsored by the province of Utrecht.
For the calibration of the sensor data, we use official measurements. RIVM performs official measurements of the PM
2.5 concentration in accordance with the EU directive 2008/EC/50 [
4]. This is done using four stations from the LML network. The hourly PM
2.5 measurements are performed with the Met One BAM 1020 [
29], for which was shown that the measurement results are equivalent to those of the reference method. The hourly results of the measurements, additional information, and historic data are available at
www.luchtmeetnet.nl (accessed on 25 May 2021).
Figure 1 shows the locations of the official measurements used in this study.
The dust sensor used in this project is the Sensirion SPS30 [
30]. Based on our previous experiences with this sensor, and other studies [
31], we know that this sensor is not very good at measuring the coarse fraction of PM (PM
10), but reasonably able to estimate PM
2.5-concentrations. Therefore, in this study, we only consider the concentration of PM
2.5.
2.1. Clean-Up and Calibration of the Sensor Data
In this section, we describe the clean-up and calibration of the data. First, we describe the filtering of the data that took place. Next, we discuss how the sensors were calibrated using information obtained by co-locating a set of sensors on official measuring locations. Finally, we discuss the variability and uncertainty of measurements obtained while cycling (i.e., the effect of having to deal with mobile sensors).
The sensor data were first processed to filter out potential improper use of the sensor (mainly indoor use) or malfunctioning sensors. This was necessary as it was not possible to monitor the correct functioning of each individual sensor during the campaign. In the processing, all speeds of the sensor kits were estimated to filter out unreasonably high speeds or stationary use. All measurements obtained at speeds higher than 45 km/h and lower than 5 km/h were excluded. The upper limit corresponds to the maximum speed possible using a “Speed Pedelec” e-bike. After applying this filter, almost all measurements located on highways and train tracks disappeared. To exclude stationary use, we applied a lower speed limit of 5 km/h. This also got rid of most indoor measurements, as these are practically stationary. After applying these speed limits, roughly 60% of the measurements were accepted as valid for further processing.
A small number of sensors provided large numbers of measurements.
Figure 4a shows the relation between the number of measurements and the average calibrated concentrations reported by the sensors.
Figure 4b shows the relation between the average official concentrations in the Utrecht area and the average calibrated concentrations reported by the sensors. So, for each sensor kit, we took the average concentrations of the official measurements during the hours when that specific kit was active and compared this to the average of the kit itself.
Some sensors report values that are relatively low or high with respect to the bulk of the measurements (which scatters around 10 μg/m3). The very low and high values may be correct but could also indicate issues with the sensors. For instance, there is a group of sensors showing very low average concentrations, below 3 μg/m3. These results hardly correspond to the average concentration in the region at the same hours of these sensor measurements. The average concentration in the area was 2 to 8 times higher than the sensors, suggesting malfunctioning sensor kits or improper use. For example, very low values are reported if the inlet of the PM sensor is covered. On the other hand, one sensor reported about 30,000 measurements with an average concentration of 36 μg/m3. As the average concentration in the area during the hours measured by this sensor was only 9 μg/m3, this suggested that the sensor was not functioning properly. The suspect data (average concentrations below 3 μg/m3 and the one sensor with an average of 36 μg/m3), is observed in 21% of the deployed sensor kits reporting valid data, but represents only 0.7% of the total data that were available after initial filtering. Excluding these data from the analyses did not seriously impact any of the results and conclusions.
After the cleanup and filtering, the sensor data were calibrated. Calibration is needed because concentrations reported by low-cost sensors are affected by several factors, such as meteorological circumstances (e.g., humidity). For this calibration, we relied on data from co-located sensors at three official reference stations in the province of Utrecht. A fourth official location was close enough to co-located sensors to also participate in de measurements. As a first check, we compared the official hourly PM
2.5 measurements with the average values of all sensors passing by that location (within 1 km) and with sensors co-located at that station (
Figure 5).
At a first glance, the comparison between official measurements and the results of the sensors is quite satisfying. However, when looking in more detail, there are non-trivial differences between sensors and official measurements, especially at concentrations below 15 μg/m
3. There are correlations between the absolute concentrations, the humidity and the ratio between sensor and official measurement. We did not fit correction factors as a function of environmental parameters but decided to estimate a general hourly correction factor based on the ratios observed at the locations with co-located sensors. As we are looking at measurements in a relatively small and uniform region in the Netherlands, we expect such a general correction factor to be applicable in the whole region. We write the hourly calibration as
, with the hourly correction factor defined by:
here
is the combination of values of sensors co-located at official locations and sensors reporting data while passing within one km of official locations. If
n < 2, then only one station is too limited to estimate a calibration for the whole area and we take
To assess the uniformity of the concentration field in the area, we calculate the standard deviation
of the available official measurements. A large standard deviation indicates a limited representativity of the correction value for the whole area. We impose that
and for
we only apply the hourly correction if it has a relatively small uncertainty:
. In practice, this means that the raw data are used if the calibrations at the available official locations differ too much. We explicitly do not only use the single calibration factor closest to a sensor measurement as this creates a dependency on one co-located sensor and one official measurement. Less representative measurements by either one will then directly impact all sensors in that area. Having multiple sensor kits at all the locations of the official measurements resolves this. A drawback of using a common hourly correction factor for all sensors is that specifics of the sensors, such as aging or malfunctioning, are not considered. However, users of the sensor kits had an app that provided them with information about their trips and measurements. In case they observed suspicious measurements or malfunctions of the sensor, they reported this and were supplied with a new or repaired sensor (if possible). However, given the amount and geographic distribution of the sensors during the project, there was no realistic alternative to our calibration approach, like repeated individual calibration of the sensors.
Comparing the raw sensor data to the official measurements results in standard deviations of the differences between 4 and 6 μg/m3. There is an average bias of −1.2 μg/m3 and 95% of the sensor data lies within ±4 μg/m3 of the official data (95% CI of 8 μg/m3). The distribution of the differences between hourly results of sensors and official measurements shows relatively high and long tails, indicating the potential of sensors to report substantial deviations from the real values. Using the calibration system described above, comparing all available calibrated co-located sensor data to official data results in differences with a 95% CI of 6 μg/m3 and a bias of 0.3 μg/m3. Of course, this is partly because the stations themselves were used in the calibration. It should be noted that the locations of Kardinaal de Jongweg (along a moderately busy open street) and Griftpark (in an urban park) are just 500 m apart. The yearly average concentrations at these locations differ only 1 μg/m3, roughly 10%. Given the close proximity, the data from the sensor co-located at Kardinaal de Jongweg is also combined with the official data at Griftpark. So, this sensor contributes to two calibration factors, one as if it is located at a street location and another as if it is at an urban background location. Of course, other sensors, passing by in the neighborhood of official locations also contribute to the calibration factors.
To test the calibration algorithm, we calculated hourly correction factors for the sensors
without using the official data at location “Kardinaal de Jongweg” and then applied this factor to the sensor located at “Kardinaal de Jongweg” (Leave Station Out, LSO). The results are shown in
Figure 6. The underestimation of the concentration at lower concentration levels and higher relative humidity’s was substantially reduced. Overall, the correlation looks better, although it only slightly improves from 0.86 to 0.87.
The test was performed for all co-located sensors. For each location, the hourly correction factors in the year 2020 were calculated excluding the location itself. These correction factors were subsequently applied to the sensor at that location and the results were compared to the official measurements. When each sensor is corrected using data from the other co-located sensors in the area (Leave Station Out calibration) the differences between sensor data and official measurements decrease and the bias is corrected for. The short distance between Kardinaal de Jongweg and Griftpark contributes to a good performance in the LSO test. Therefore, we have tested the LSO calibration for these stations, using only calibration factors obtained at Cabauw and Breukelen (both well outside of the city). For both the raw sensor data and the LSO calibrated data the mean concentrations, the Mean Average Error (MAE), the Pearson correlation and the Root Mean Square Error (RMSE) have been calculated. The results of the calibration-tests are summarized in
Table 1.
As mentioned before, an important benefit of the calibration is the reduction of the bias, which practically disappears. On average, the Pearson correlation decreases from 0.81 to 0.80. The MAE and RMSE improve by the calibration, for all but one case. In the analysis of the bike data, for an individual sensor in the city of Utrecht the calibration will be based on information of official measurements that were excluded for the purpose of testing the calibration. As a result, the calibration is expected to perform better than shown in these tests. Based on the tests described above, we assume the 95% CI of the calibrated hourly sensor measurements to be in the order of 6 μg/m3. As the hourly concentrations of the co-located sensors are based on many 10-s measurements it is clear that the observed uncertainty must be of a systematic nature. This suggests that the estimate of the uncertainty will also hold to a large extent for measurements that do not last a full hour but only 30 or 15 min.
The Leave Station Out calibration also allows estimating the uncertainty of the calibration regarding the yearly average concentrations. Throughout the whole year of 2020, the average bias between the yearly average concentrations by the sensors (after calibration with the LSO method) was 0.1 μg/m3 and the standard deviation of the differences was 1.1 μg/m3. This is interpreted as a 95% CI of 2.2 μg/m3.
2.2. Scatter of Measurements While on a Bike
To assess the quality of measurements while the sensor kits are mounted on bikes and moving, we look at the values of different sensors when they coincidently pass through the same 50 × 50 m2 grid cell in the same hour. The estimate of the speed is based on the last two positions. This implies that data of bikes that have just stopped at an intersection or at traffic lights are also considered. We require at least two sensors to report data. This results in about 5000 h/cell combinations with co-incident sensor measurements in 2020. First, we take the average of the measurements of each sensor in the cell and then calculate the standard deviation of these averages in the cell. The scatter between co-incident measurements is, with roughly 25–30%, comparable with that between LML and sensors. We therefore conclude that the scatter in measurements does not substantially increase when the sensors are used on (moving) bikes. If the duration of a bike trip takes in the order of one hour, we assume an uncertainty in the average concentration as derived above for the co-located sensors, with the 95% CI of the mobile measurements in the order of 6 μg/m3. As discussed above, we assume the uncertainty to be roughly similar when there are only measurements during a substantial part of an hour.
2.3. Variations of Measurements during Bike Trips
The above uncertainties are all for hourly average concentrations and differences between individual sensors. In order to estimate PM2.5 concentrations during bike trips, the variations of individual sensors over smaller time scales than hours are important.
The 10-s measurements during a bike trip are mainly influenced by (1) the changes in the overall PM2.5 concentrations in the province of Utrecht; (2) the overall calibration of the sensors; (3) random fluctuations in the sensors themselves; and (4) the local sources surrounding the bikes (mainly traffic). For our analysis, it is important that the effects of (1), (2), and (3) are small compared to those of (4) or, alternatively—that we can prevent these effects from influencing the analysis.
To check the effect of large-scale variations of the concentrations in the whole province on the results of individual bike trips (issue 1), we excluded data in hours where the average variation in PM
2.5 concentration measured at the official locations in Utrecht city exceeded 2.5 μg/m
3 per hour. Given that bike trips, on average, take 17 min, this implies that we do not expect a significant influence of changes in the general concentrations in the province on the estimated concentrations along a bike trip. As a result of this constraint, for 6.3% of the bike trips, the concentrations along the trip were not included in the sensitivity analysis (see
Section 3.2). Evidently, incidental fast changes in concentration cannot be excluded but these do not occur often. Moreover, during longer bike trips, there may be an effect of changes in the general background concentration.
As discussed in
Section 2.1, the overall calibration (issue 2) can be addressed using the co-located sensors. We use the hourly calibration factors determined using all available co-located sensors to correct all sensors in the field. As a result, the differences in concentrations measured along bike trips are also calibrated.
Data of measurements during several bike trips suggested that the short-term random fluctuations of the sensors (issue 3) are quite limited. To verify this, we looked at several months of data of the sensors co-located at official locations as these sensors produced data all the time. As bike trips take on average 17 min, just over 1000 s, the standard deviations of every series of 100 consecutive datapoints were calculated. With an average standard deviation of roughly 1 μg/m3, the random fluctuations of the 10-s measurements are indeed quite low.
From the above we conclude that fluctuations in measured concentrations exceeding 0.5–1.0 μg/m3 on a short time scale of 10–30 s are most likely caused by the varying ambient concentrations during a bike trip. At this time scale, the concentrations around the bikes may change substantially. Even at moderate speeds of the bike of 10 km/h it travels some 2.8 m per second. The time between two measurements is enough to cross a street with busy traffic. Between three reported measurements by the sensor kit, it probably traveled (at least) almost 100 m. This represents a large enough distance to travel from, for instance, a relatively quiet street in a residential area into a street with substantial amounts of traffic at short distance from the bike.
As the measured concentrations are calibrated using the hourly calibration factors, changes in these calibration factors during bike trips also influence the estimated contributions from traffic. Assuming the correction factors to change linearly within hours, the standard deviation of changes in the applied calibrations is in the order of 15%, leading to an estimated (systematic) 95% CI of 30%. The fluctuations of the sensors between subsequent measurements lead to a random uncertainty of 1/sqrt(n), with
n the number of measurements during a bike trip. Several typical time series of measurements in bikes are shown in
Figure 7.
In short, we have calibrated sensors using co-located sensors at official stations. Using a “Leave Station Out” approach, we find that systematic biases in the sensor data are corrected for and measurements are more in line with official measurements. Moreover, by checking data from sensors that are, by chance, simultaneously present in the same grid cell, we show that differences in sensor data for different sensors (moving on bikes) are limited. Finally, we checked the fluctuations in measured concentrations at a higher temporal resolution than for the hourly values and find that there generally no large deviations from the overall trend in concentrations.
2.4. Estimation of Exposure to Traffic-Related PM
In order to estimate the exposure of the bikes to emissions of the surrounding traffic we take the concentration profile during a bike trip and assume that at least part of the trip will be at locations where there is little traffic nearby. Typically, a trip on a bike in the Netherlands starts at the home address, most of which are not located directly on a street with busy traffic. Moreover, many office spaces, schools, and other likely destinations (or starting points) of a trip on a bike are not directly on a street with busy traffic. Many bicycle trips will pass through or go along a number of streets and roads with substantial traffic. Most bicycle trips do not last very long and are limited to several kilometers. We take the lowest measured concentration (assuming no uncertainty) during the trip as an estimate for the local background concentration during the bike trip. In practice, there are complications and we cannot simply take the lowest value. At the start of a trip, the sensor needs some time to start measuring properly, especially when the sensor kit is taken from an indoor location, a substantial change in temperature and humidity at the start of the measurements can occur. Depending on the circumstances, we estimate that it may take up to a few minutes for the sensor kit to sufficiently cool down or warm up. Apart from the potential initialization issues, we have to take the uncertainty and scatter of the sensor values into account. Due to these effects, the lowest measured value will probably be lower than the background level we are looking for. We therefore do not use the lowest measurements, but use the 10% percentile of the concentrations as an estimation for the background concentration. Taking either the 5%, 10%, or 15% percentile results in a difference of roughly 0.4 μg/m3 in the derived average exposure to traffic emissions between each step. This difference is treated as an indication of the systematic uncertainty of the approach. The possible variations in correction factors during bike trips lead to a systematic uncertainty of 15% in the estimated exposure. On top of this, the fluctuations of the sensors at the smallest time scale add a random uncertainty. However, the observed random variations between subsequent measurements of the sensors of 1 μg/m3 will, given enough measurements in a bike trip, not contribute significantly to uncertainties of the estimated exposure.
4. Discussion
This study shows that mobile low-cost sensors mounted on bikes can be a useful way to assess PM
2.5 exposure along roads and to estimate PM
2.5 exposure specifically from traffic sources nearby. Our results show that an average cyclist in the province of Utrecht has an estimated traffic-related PM exposure of 2 μg/m
3, on top of the background concentrations, a small but not insignificant contribution. Moreover, our results show clear differences between busy main roads and quiet back roads. Although the absolute differences in PM
2.5 concentration between these types of roads are small, the relative difference is consistently present and in the same direction (i.e., measured PM concentrations are higher close to busy roads). Because of the large number of measurements, it is therefore reasonable to suggest that this difference is caused by PM
2.5 emissions from traffic. Other studies also suggest that it is possible to measure PM from traffic using low-cost sensors [
32].
4.1. Calibration and Uncertainties
It is well known that low-cost PM sensors have various measurement issues and are not as reliable as official measuring equipment. PM sensors are influenced by, for instance, meteorological conditions, such as relative humidity [
33]. Moreover, low-cost sensors might differ in their particle-size selectivity.
Kuula et al. show that the Sensirion SPS30 is able to reasonably measure PM1 and PM
2.5 fractions but does not measure PM
10 accurately [
31]. Initial tests performed as part of the present project also indicated that PM
10 values produced by the Sensirion SPS30 are usually very close to the PM
2.5 values, irrespective of the PM
10 reported by official measurements. We have therefore decided not to report and use the PM
10 values at all.
Many authors have tested the calibration of low-cost sensors. Part of the tests on low-cost particulate matter sensors are performed at relatively high concentration, varying the composition of the particles, e.g., [
34]. These values are very different from the concentrations encountered in the present study, where the average ambient PM
2.5 concentrations are in the order of 10 μg/m
3. Furthermore, the present experiment has no way to assess the size distribution and composition of PM; the calibration has to deal with these types of variations and effects on a real-time basis. Several tests are described in the literature using different schemes to account for the ways environmental conditions influence the behavior of sensors [
35]. Instead of using a calibration as a function of separately measured environmental parameters, we decided to test the relatively simple approach described in [
15], where the hourly calibration of co-located sensors is assumed to be (sufficiently) representative for the area around those locations. Given the relatively small and uniform nature of the project area, we expected this straightforward method to work sufficiently well. It cannot be excluded that a more elaborate calibration scheme will lead to a better calibration of the data from the Sniffer bikes.
Using co-located sensors to correct for systematic biases on an hourly basis, leads to corrected/calibrated sensor values that are much more in line with official measurements, although the variability in the measured concentrations remains large.
We use the 10% percentile of measured concentrations as an estimate of the background concentration. Using the 5% or 15% percentile does not change our results much, although the estimation of traffic related PM2.5 does change. Requiring more measurements in a bike trip also does not influence the results much. We also checked the effect of global variations in PM2.5 concentration in the province on the traffic related PM2.5, which was also limited. The relative difference in concentrations close to busy and quiet roads is consistently present and in the expected direction. Based on the variations in the traffic related PM2.5, depending on the specific choices in the analysis, the uncertainty is estimated to be in the order of 0.5 μg/m3 for the 95% CI.
There are few exceptions where the estimated contribution of local traffic is high at places where this is not expected based on the type of road. In these cases, there are often very few or just one cyclist cycling this route. We expect our estimations to become better when more cyclists are using a Sniffer bike sensor and are cycling for longer periods of time.
It must be noted that the estimated contributions from the traffic are not representative for yearly average concentration contributions as the bicycle trips are not distributed evenly over the whole day and year. Practically all trips took place between morning and evening (see
Figure 8). Moreover, we assume that differences in exposure between busy and quiet roads are due to local traffic. In theory, other local sources close to busy roads might explain the difference, although this is not likely as most other sources of PM
2.5 will be further away from roads (power plants, wood burning).
4.2. Use of Mobile Sensors
The use of mobile air quality sensors makes it possible to estimate exposure to PM
2.5 while riding on a bike. As a result, it can be used to improve exposure assessments in real-world situations. Several other studies highlight this, showing that pollution levels are generally higher close to breathing levels, compared to levels obtained through stationary measurements [
36,
37]. Moreover, by mounting air quality sensors on bikes, the willingness to participate in air quality measuring experiments increases significantly. Interest from local governments is also high because the sensor kits also measure additional parameters such as location, speed, temperature and road conditions. In doing so, these sensor kits also contribute to other policy objectives.
Important limitations of using bikes are the limited spread in routes they take and the distribution of the trips over the day. In practice, there are hardly any bike trips in the late evening or night. As a result, it is not possible to compare the averages obtained by the bike to official yearly average values. A further challenge in using mobile sensor measurements is the limited time the sensors are operational. The sensor is only active while moving around, so the total measuring time is low as bike rides are typically around 15 min long. This short operating time makes it more difficult to assess the behavior of an individual sensor. Therefore, the use of mobile sensors would be improved if the sensor is also active while stationary.
Our results suggest that the measured values are often quite high at the start and end of each bike ride. This might be caused by a change in external circumstances, such as the transfer of the sensor from indoor to outdoor, suggesting sensors need a certain amount of initialization time. Therefore, we have not considered the first and last 5% of each bike ride.
4.3. Effect of Corona Lock Down
For this study, we used data for the year 2020. Our results might be influenced by the fact that the year 2020 was an atypical year because of coronavirus-related lockdowns. In the Netherlands, several lockdown periods were implemented, most notably at the end of March, mid-October, and at the end of December. During these lockdowns, there was a large reduction in commuting while the number of recreational bike trips increased substantially. As a result, there were significant effects of the lockdown on the emissions and concentration of PM
2.5 [
38]. Therefore, our results might underestimate the contribution of local traffic sources because there was less traffic through the studied period and fewer cyclists may have commuted during rush hours.
4.4. Future Directions
There is a lot of interest from local governments to use Sniffer bike sensors, as it has the potential to serve multiple policy objectives. For instance, sensors might be used to characterize not only air pollution exposure, but also heat stress, road quality, and so on. Therefore, we believe the uptake of these sensors will accelerate in the near future. The associated increase in PM2.5 measurements will allow us to better estimate exposure to PM2.5. Moreover, it would be interesting to include a route planner for the lowest PM2.5 exposure and investigate if this leads to meaningful differences in PM exposure over the long-term.
In addition, we believe it is important to have sensors that can be used both stationary and mobile. This allows for a better characterization of sensor behavior and, thus, a better estimation of the quality of individual sensor measurements.
5. Conclusions
In this study, we show how mobile air quality sensors (using the Sensirion SPS30) can be used to estimate typical PM2.5 concentrations to which cyclists are exposed in the province of Utrecht, the Netherlands.
Our study looked at effects that are well within the uncertainty of individual sensors. Due to the large number of measurements in 2020 (more than 8 million) and the combination of these in many bike trips (almost 68,000), we were able to estimate not only the exposure to absolute PM2.5 concentrations during bike trips, but also variations in concentrations during bike trips. We found that the average PM2.5 concentration that bicyclists in the project were exposed to in 2020 was 9.9 μg/m3. We estimate the average traffic-related exposure in the order of 2 μg/m3. During the summer months, we estimate a slightly lower exposure of 1.7 μg/m3. In these months, the combination of absolute concentration levels and dispersion of traffic related emissions may lead to fluctuations in concentrations during bike trips that are in the order of the random uncertainty of the sensor and the analysis.
Important limitations of using sensors mounted on bicycles are the limited spread in routes and the uneven distribution of the trips over the day and year. Similar experiments should consider using even more bikes and aim for more diversity in the use of the Sniffer bike sensor units, such as sharing sensor units or enabling use of the sensors while stationary. Quality control should also improve, making sure that sensor kits that produce little or seemingly unrealistic data are quickly checked and, if needed, fixed.
Nevertheless, overall, we conclude that the use of mobile sensors, mounted on bicycles, allows for better characterization of individual exposure to PM2.5, while using a bicycle for commuting or for recreational purposes. The mobile PM2.5 measurements represents a useful addition to a network of stationary air quality sensors.