To conduct the CCAM-CTM modelling system performance evaluation in a robust way, the meteorological modelling predictions from CCAM must be evaluated first. The performance statistics at each site for the predicted hourly surface temperature, surface winds, and precipitation against observations were investigated and a more detailed evaluation can be found in [
31]. The main outcomes from the CCAM evaluations show: (1) CCAM generally overpredicts surface temperature (°C) in both SPS1 and SPS2 periods across OEH and BOM sites. The values of averaged MB and R are 1.14 and 0.92 for SPS1, and 0.93 and 0.82 for SPS2, which show CCAM has a tendency to overpredict temperature in summer (SPS1) rather than in autumn months (SPS2). (2) CCAM overpredicted surface wind speed (m/s) across OEH and BOM sites in both SPS1 and SPS2 periods. The averaged MB at OEH sites for surface wind predictions is 1.93 (SPS1) to 1.97 (SPS2), and 0.52 (SPS1) to 0.62 (SPS2) at BOM sites. The results show that CCAM tends to predict stronger surface winds at OEH sites compared to that at BOM sites in both summer and autumn months.
4.1. PM2.5
The model predictions of hourly PM
2.5 mass concentration, as a summation of various components, were evaluated against available PM
2.5 observations at Chullora, Earlwood, Richmond, Liverpool and Wollongong.
Table 2 presents the quantitative performance statistics at each site for predicted hourly PM
2.5 concentration against observations, along with the mean and standard deviation of predictions and observations for the SPS1 and SPS2 periods. The PM
2.5 is generally overpredicted across Sydney in SPS1, where values of MB, NMB and MFB are 0.58, 11% and 23% at Earlwood; 0.92, 19% and 47% at Richmond; 0.98, 20% and 35% at Liverpool; and 0.56, 10% and 24% at Wollongong, respectively. The only exception is at Chullora, where PM
2.5 is underestimated and values of MB, NMB and MFB are −0.28, −4% and 58%, respectively. The values of ME, NME and MFE across these sites are in the range of 2.54–2.86, 65–79% and 46–69%, respectively. Relatively high correlation coefficients (0.50–0.51) and high IOA (0.66) can be found at both Earlwood and Liverpool, while the relatively low MFB and MFE are also found at Earlwood and Liverpool. The overprediction of PM
2.5 concentrations across Sydney is also obvious in SPS2, with the only exception at Liverpool. A lower MFB (compared to that in SPS1) is found at Chullora, Richmond and Liverpool, while a slightly higher MFB is found at Earlwood and Wollongong. A generally higher MFE is found in SPS2 across all sites, except at Chullora. The better correlation and agreement between hourly PM
2.5 predictions and observations in SPS2 is found at Chullora, with the values of R and IOA of 0.50 and 0.67. The worst correlation and agreement occur at Richmond, in terms of the lowest R (0.18) and lowest IOA (0.48).
The bugle plots for hourly PM
2.5 for all predicted/observed pairs for the five air quality monitoring stations during SPS1 and SPS2 are shown in
Figure 5. Model performance goal and criteria proposed by Boylan and Russell [
30] were used to benchmark PM
2.5 predictions in our study. In
Figure 5a, the PM
2.5 predictions for SPS1 (black star) at every site meet the performance criteria for MFB (±60%), and there are two sites (Earlwood and Wollongong) further complying with the performance goal for MFB (±30%). The hourly PM
2.5 predictions at five sites in SPS2 (green triangle) also meet the performance criteria for MFB, and there are four sites (Chullora, Earlwood, Richmond and Liverpool) further complying with the performance goal for MFB. In
Figure 5b, the PM
2.5 predictions at every site in SPS1 and SPS2 meet the performance criteria for MFE (75%). However, there are only two sites (Earlwood and Liverpool) that meet the performance goal for MFE (50%) during SPS1 (black star in
Figure 5b) and none of the sites in SPS2 comply with the performance goal for MFE.
Comparisons of predicted and observed PM
2.5 were further examined using Taylor diagrams, which provide a concise way to summarize some statistical metrics used for model evaluation. The Taylor diagram combining the correlation (i.e., R in
Table 2) along with the normalized standardized deviation and centred RMSE is presented in
Figure 6 for all sites in SPS1 and SPS2. Most of the sites are characterized with low correlations, along with high normalized standard deviation and high centred RMSE. This reflects one of the difficulties to correctly predict PM
2.5 concentrations without having well captured highly variable emission sources, such as anthropogenic sources including wood smoke, motor vehicles and coal-fired power stations and other industry point sources, as well natural sources sea salt, wind-blown dust and soil [
32,
33].
Figure 7 shows a time series of predicted and observed hourly PM
2.5 across five sites during the periods of SPS1 and SPS2. CCAM-CTM generally predicted the variations of PM
2.5 for most days at most sites. Overall, the model slightly overpredicted PM
2.5, however, moderate underpredictions of hourly PM
2.5 peak on the high PM
2.5 days can be found across most sites. For example, the model is not able to capture PM
2.5 peaks on 11 February (SPS1) at Chullora, Richmond and Liverpool; it also has difficulty predicting PM
2.5 peaks on 22 April (SPS2) at Chullora, Earlwood, Richmond, Liverpool, and Wollongong.
In a review of various studies [
34], a common trend is found where PM
2.5 is overestimated during the winter and underestimated during the summer. The wintertime overestimate of PM
2.5 total mass is contributed by overestimates of OC and nitrate, while sulphate and OC contribute most of the summertime PM
2.5 underestimate reported in the literature. When some species are overpredicted and some are underpredicted, the evaluation of speciated PM
2.5 may provide more insightful information of model performance in addition to an evaluation for total mass of predicted PM
2.5.
The spatial distribution of predicted hourly average PM
2.5 concentrations for SPS1 and SPS2 are shown in
Figure 8a,b, and the hourly maximum PM
2.5 concentrations for SPS1 and SPS2 are shown in
Figure 8c,d. The areas of elevated PM
2.5 concentrations are found to coincide with populated regions (Sydney, Wollongong and Newcastle) with an average concentration of 6 µg/m
3 and a maximum hourly value of 20 µg/m
3. Significantly elevated PM
2.5 is also found in the Upper Hunter region, about 200-km northwest of Sydney, which has a northwest–southeast oriented valley. The emissions from open-cut coal mines, coal-fired power stations and agriculture industries are found to have major contributions to high PM
2.5 in this region [
21,
35].
4.2. O3
The model predictions of hourly O
3 concentrations were evaluated against observations. Statistics were calculated separately across the 18 NSW OEH air quality monitoring stations shown in
Figure 4. However, to simplify the comparison, a regional average of statistical metrics (as suggested in [
12] and done in [
36]) was computed across sites in the Sydney East, Sydney Northwest (Sydney NW), Sydney Southwest (Sydney SW), Illawarra and Newcastle regions.
Table 3 presents statistical measures for the predicted hourly O
3 concentrations averaged over each region as defined above, along with the mean and standard deviation of predicted values and observations for the SPS1 and SPS2 periods. The CCAM-CTM shows a tendency to underpredict hourly O
3 concentrations in SPS1 (summer months of 2011) across all regions, with only one exception for the Illawarra region, where the O
3 is overpredicted. The values of MB, NMB and MFB for these regions are in the ranges of −2.31–0.52 ppb, −11–4% and −9–11%, respectively. The values of ME, NME and MFE are in the ranges of 5.57–6.99 ppb, 56–73% and 36–48%, respectively. The maximum RMSE across regions is 7.64 ppb. The highest values of correlation coefficient (0.75) and IOA (0.87) are both found in the Sydney NW region, while the lowest value of correlation coefficient (0.51) and IOA (0.64) are in Illawarra. In SPS2 (autumn months of 2012), CCAM-CTM demonstrates greater tendency to underpredict O
3 compared to the performance in SPS1. The hourly O
3 concentrations are generally underpredicted in the Sydney East, Sydney NW, Sydney SW and Illawarra regions, while O
3 is overpredicted in the Newcastle region. The values of MB, NMB and MFB for these regions are in the ranges of −3.07–3.55 ppb, −16–29% and −20–45%, respectively. The values of ME, NME and MFE are in the ranges of 6.04–6.98 ppb, 61–92% and 59–72%, respectively. The maximum RMSE across all regions is 6.23 ppb. The worst correlation between predictions and observations is found in the Illawarra region, with a correlation coefficient of 0.34 and IOA of 0.55.
Simon et al. [
34] reported RMSEs in the range of 15–20 ppb for hourly O
3 concentrations in most model validation studies, and the RMSEs in our studies are considerably lower. The US EPA recommended benchmarks of MFB and MFE are ±15% and 35% for ozone predictions [
12]. Accordingly, the hourly O
3 predictions in SPS1 across all regions comply with the benchmark of MFB of ±15%; however, none of the regions comply with the benchmark of MFE of 35%. In SPS2, only the O
3 predictions in Sydney East, Sydney NW and Illawarra regions comply with the benchmark of MFB of ±15%; none of the regions comply the benchmark of MFE of 35%. The implementation of cut-off values for background O
3 is suggested by US EPA [
12], which suggests that data under the cut-off values are discarded in the evaluation. Follow this guideline, MFB_15 and MFE_15 with cut-off value of 15 ppb for background O
3 in Sydney [
37] were calculated (results shown in
Table 3). The values of MFB_15 and MFE_15 across all regions in SPS1 (SPS2) are in the range from −17% to −26% (−9–−48%) and from 26% to 38% (27–53%). There is an increase in MFB_15 and a decrease in MFE_15 compared to the values of the original MFB and MFE. Based on the values of MFB_15, none of the regions would comply with the benchmark of MFB (±15%); however, based on the values of MFE_15, O
3 predictions in the Sydney East (SPS1), Illawarra (SPS1 and SPS2) and Newcastle (SPS1 and SPS2) regions comply with the benchmark of MFE (35%) and predictions in the Sydney NW (SPS1) and Sydney SW (SPS1) regions are very close.
The predicted and observed O
3 presented in the Taylor diagram (
Figure 9) generally shows a more homogeneous pattern across the sites compared to the PM
2.5 predictions shown in
Figure 6. The correlations range 0.4–0.7, however, the model performance in O
3 predictions for SPS1 (summer months) are slightly better than that in SPS2 (autumn months) due to higher correlations, lower normalized standard deviation and lower centred RMSE.
Figure 10 shows a time series of predicted and observed hourly O
3 concentrations at selected OEH sites that represent different regions. Chullora and Randwick both represent the Sydney East region; Richmond, Bringelly, Wollongong and Newcastle represent the Sydney NW, Sydney SW, Illawarra and Newcastle regions, respectively. CCAM-CTM generally captures the diurnal variations of O
3 for most days at most sites with a slight underestimation. Unlike what we found for predicted PM
2.5, predicted O
3 has a larger seasonal dependency. The higher O
3 concentrations due to increased photochemical production are seen in both observations and predictions in the summer months (SPS1). The model is able to reproduce most of the high ozone events in SPS1 and also captures the overall concentration variation in the autumn months (SPS2). However, the model tends to underpredict the daily maximum hourly ozone concentrations that occur at 15:00 (AEST) during most cases. The inaccurate predictions of peak O
3 concentrations are attributed to the combination of several factors, including the uncertainties in the NOx and VOCs emission estimations, which are the precursors of O
3 [
38], and the overpredicted surface wind speeds in the CCAM [
31] that may also contribute to the underpredicted O
3 concentrations.
The spatial distribution of predicted average hourly O
3 and average daily maximum O
3 concentrations for the SPS1 and SPS2 periods are shown in
Figure 11. In the summer months during SPS1 (
Figure 11a), areas of higher average ozone concentrations (18 ppb) are found over the Blue Mountains National Park (100 km west of Sydney) as well as the Wollemi National Park (200 km northwest of Sydney). In the autumn months during SPS2, generally lower than average ozone concentrations are found across Sydney (
Figure 11b) compared to that in SPS1. It should be noted that an area of elevated ozone is found over the ocean east of Sydney in both SPS1 and SPS2. There are no surface ozone observations over the ocean to compare with at this stage, however, the possible high ozone may be due to lack of local sources of NOx over the ocean, which leads to less surface deposition. The spatial distribution of O
3 daily maximum concentrations averaged for the period of SPS1 and SPS2 are illustrated in
Figure 11c,d. It clearly demonstrates the higher peak hourly ozone tend to occur in the Sydney Northwest region and the Wollemi National Park region during summer months (
Figure 11c). Peak hourly ozone significantly decreases during autumn months (
Figure 11d), however, the reginal maximum still can be found in the Sydney NW region.
4.3. NO2
The model predictions of hourly NO
2 concentrations were evaluated against observations.
Table 4 presents the regional averaged quantitative performance statistics for predicted hourly NO
2 concentrations against observations, along with the mean and standard deviation of predictions and observations for periods of SPS1 and SPS2 periods. Generally, the hourly NO
2 concentrations are underpredicted across all regions in both SPS1 and SPS2. In SPS1, the bias, in terms of MB, NMB and MFB, are smallest in the Sydney East region, with values of −1.11, −13% and −10%, respectively; and are largest in the Newcastle region, with values of −2.26, −46% and −54%, respectively. The values of ME, NME and MFE are in the ranges of 2.76–4.03 ppb, 60–87% and 60–82%, respectively. The maximum RMSE across all regions is 3.87 ppb. A better correlation between NO
2 predictions and observations is in the Sydney East, Sydney SW and Newcastle regions, in terms of higher correlation coefficients (0.49, 0.48 and 0.59, respectively) and higher IOA (0.71, 0.70, and 0.68, respectively). In SPS2, similar to what we found in SPS1, the smallest biases are seen in the Sydney East region with MB, NMB and MFB of −0.4, −1% and −6%, respectively; and the largest biases are found in the Newcastle region. The values of ME, NME and MFE are in the ranges of 4.22–6.63 ppb, 68–99% and 54–80%, respectively. The value of maximum RMSE across regions is 7.77 ppb. The best correlation between predictions and observations is seen in the Illawarra region, with correlation coefficient of 0.54 and an IOA of 0.72, while the worst correlation is found in the Sydney NW region, with correlation coefficient of 0.35 and an IOA of 0.62. The statistics for NO
2 show a larger bias and error compared to the statistics calculated for O
3 (
Table 3) due to the higher sensitivity of NO
2 predictions to uncertainties and errors in the emissions and meteorology, similar to [
38]. While there are no benchmarks available for NO
2 validation, the MFE and NME of NO
2 from our study fall well within similar ranges inferred from [
39].
Comparisons of predicted and observed NO
2 were further examined using Taylor diagram, as presented in
Figure 12. For all sites in both periods, the correlations fall within a range of 0.2–0.6. The higher correlations along with lower normalized standard deviation and lower centred RMSE indicates that CCAM-CTM tends to predict NO
2 in the summer months (SPS1) better than in the autumn months (SPS2).
Figure 13 shows a time series of predicted and observed hourly NO
2 at selected OEH sites, which represent different regions. The model generally captures the diurnal variations and magnitudes well at most sites most of time, with a tendency to underestimate NO
2 across all regions. However, the model tends to overpredict NO
2 peaks at Bringelly (Sydney SW region), Chullora and Randwick (Sydney East region) during SPS2 (
Figure 13g–i) and underpredict NO
2 peaks at Richmond (Sydney NW region), Wollongong (Illawarra region) and Newcastle (Newcastle region) (
Figure 13j–l). The inaccuracies in the model NO
2 predictions are assumed to be highly associated with the uncertainly in the NO
2 emission estimations, the main contributions being from on-road motor vehicles and industrial sources in the NSW GMR [
21].
The spatial distribution of predicted hourly average NO
2 concentrations in SPS1 and SPS2 periods are shown in
Figure 14. Significant elevated NO
2 concentrations are found over the populated Sydney East region as well as in the Upper Hunter during both periods. The model results also show higher NO
2 concentrations predicted in the autumn months in SPS2 (
Figure 14b). These spatial distributions are consistent with those seen in the Ozone Monitoring Instrument (OMI) satellite observations of tropospheric NO
2 columns during the periods of SPS1 and SPS2, as shown in
Figure 15.
4.4. PM2.5 Components
During the Sydney Particle Study, 60 samples were collected for the measurement of aerosol chemical composition, 30 of these in the mornings and 30 samples in the afternoons [
11]. This number of samples is too limited to support detailed performance statistics. Instead, model results for these periods were extracted and averaged over the full sampling season for comparison with the coinciding measurements, with a focus on the summer campaign (SPS1). Model performance is considered reasonable for modelled averages that are within a factor of two of the measured averages. The summer observation campaign identified sea salt (sodium, chloride and magnesium as marker species) and primary and secondary organic matter (organic carbon) as being the major components of PM
2.5, with secondary inorganic aerosol (sulphate, ammonium, and nitrate), soil and elemental carbon also present in significant amounts.
The contribution of modelled and observed PM
2.5 components are shown in
Figure 16 and concentrations provided in
Table 5. Only measured components of PM
2.5 that correspond to modelled species are shown. The model does not explicitly predict concentrations of all measured species, accounting for several species as lumped species (e.g., dust). The components shown are sodium, chloride and magnesium (all components of sea salt), ammonium, sulphate, nitrate, elemental carbon and organic matter.
Organic matter, sodium and the secondary inorganic nitrate and sulphate components are reasonably predicted. Elemental carbon, magnesium, chloride and ammonium are underpredicted, with the predicted mass approximating only about a third of the observed mass in most cases and less than this in the case of ammonium. Results indicate that that the model underestimates the fractional sea salt contribution, and that sources of elemental carbon and ammonia are not fully accounted for in the model.