1. Introduction
Based on an analysis of the 37,099 projects in the National River Restoration Science Synthesis (NRRSS) database, Bernhardt et al. [
1] estimated that annual expenditures on stream restoration in the United States exceeded one billion dollars and was increasing exponentially. They identified the most common goals of river restoration as enhancing water quality; managing riparian zones; and improving in-stream habitat and improving fish passage and stabilizing banks. A concern they identified a decade ago was that most small- to modest-sized projects had no post-construction monitoring or assessment of performance, therefore limiting our ability to know if our efforts approximated a river’s remaining natural potential and to learn from successes and failures. Since the mid-1990s, the controversial Natural Channel Design approach that was developed by Dave Rosgen has seen widespread application in the United States [
2,
3,
4]. The backbone of the approach is a classification system that depended on knowledge of bankfull discharges. Simon et al. [
5] argued that a fundamental problem with the approach was that it was based on channel form and did not adequately consider channel processes. We will not jump into this debate, but we concur with Simon et al. that identifying and measuring the bankfull geometry in order to ascertain bankfull discharges is often difficult in an unstable stream system and that it is desirable to collect geotechnical data to help evaluate bank stability. Channel processes include the knowledge of channel-forming or dominant discharges and are often the foundation of river restoration projects [
6]. In this manuscript, we use the term channel-forming discharge to designate either the bankfull (Q
b) or effective discharge (Q
e).
Wolman and Miller [
7] described the bankfull discharge as the stream-flow that filled the main channel and that began to spill onto the active floodplain; while the effective discharge was the discharge that transported the most sediment over time. Goodwin [
8] provided a useful study on what he called the
dominant discharge that determined the cross-sectional and planform characteristics of a channel, and he noted that the dominant discharge could be based on measurements, the effective discharge, or the recurrence interval (RI) of the dominant discharge. Doyle et al. [
6] stated that the bankfull discharge, or a discharge for a user-specified recurrence interval, was used widely in design procedures without consideration of the effective discharge. Although the bankfull and effective discharge were each considered to be channel-forming discharges, they are not identical. For example, Emmitt and Wolman [
9] reported that the ratio of the effective discharge to bankfull discharge, Q
e/Q
b, ranged from 0.98 to 1.31 for five gravel-bed streams in the northern Rocky Mountains, USA. Torizzo and Pitlick [
10], also in the Rocky Mountains, found that Q
e/Q
b ranged from 0.8 to 1.5 for 12 gravel-bed streams. Powell et al. [
11] found that the Q
e/Q
b ratio ranged from 0.4 to 2.5 for 10 large rivers in Ohio. Channel-forming discharges have often been associated with a fixed recurrence interval such as 1.5 or 1.58 years [
12], which were partially derived from the classic work of Leopold et al. [
13]; however, studies also warn against using a fixed recurrence interval [
14].
One method for determining the effective discharge is the Wolman–Miller Model [
15], which Powell et al. [
11] illustrated for the Maumee River in Ohio at a United States Geological Survey (USGS) gauge having a catchment area of 16,395 km
2 (
Figure 1). In this study, 43 years of discharge and suspended-sediment data were divided into 100-m
3/s discharge ranges (bins) starting with 0 to 100 m
3/s, 100+ to 200 m
3/s, 200+ to 300 m
3/s, and so forth until a final bin was made that contained all data for discharges larger than 2700 m
3/s. The total sediment load per bin (solid line) was the product of the two dashed lines (days in a particular bin multiplied by the average daily load for discharges in the bin). Low discharges were ineffective in transporting sediment, and extreme events had very high sediment transport rates but occurred infrequently so the total sediment load they carried over a period of many decades was not the bin with the largest average discharge.
For the result shown in
Figure 1, the effective discharge was about 700 m
3/s; however, there were two potential issues with this approach. First, at a particular gauge, there might be considerable variability in the calculated effective discharge, as it will vary with the flow range of the bin. Powell et al. [
11] illustrated this concept for the Maumee River by reporting larger estimates of the effective discharge, in part, because wash loads were excluded and perhaps because the bins were sized based on stage increments rather than discharge increments. Simon et al. [
16] stated that mean daily values of both flow and sediment loads, which were readily available from the USGS, tended to be biased towards lower flows, particularly in flashy basins. They noted that using data at shorter time increments was best but that type of measured data was seldom available. Second, in addition to the lack of a standard approach to determining bin sizes for an effective discharge analysis, the approach sometimes gave more than one peak (effective) discharge. This might be an anomaly of the analysis but also could be the geomorphic reality [
17].
Useful guidelines on determining the effective discharge have been published by Biedenharn et al. [
18]. Other approaches to determining the effective discharge included using a sediment transport threshold, such as the discharge that transported 50% of the sediment, to quantify the channel-forming discharge [
19,
20]. For large rivers in Ohio, Powell et al. [
11] found the ratio of the threshold discharge, associated with transport of 50% of the sediment, to the bankfull discharge was 0.5 to 1.8. Powell et al. [
11] reported that the effective discharge and bankfull discharge often had RIs of less than one year; however, they also found that even in rivers with RIs of at least a year the channel-forming discharges, or large discharges, occurred 1 to 24 days a year. Their results were not unique and were in contradiction not only with statistical expectations but also numerous studies in the literature [
5,
6,
21]. Nolan et al. [
22] found that discharges, which transported 50% to 90% of the sediment load, had recurrence intervals (RIs) of 0.27 to 16.1 years. Andrew and Nankervis [
15] and Sichingabula [
23] reported discharges that transported 50% to 80% of the sediment load to have flow durations of 11 to 80 days annually.
For headwater rivers in Idaho, Whiting et al. [
24] reported RIs of 1.0 to 2.8 years for the effective discharge. In a study in the northern Rocky Mountains, Emmett and Wolman [
9] reported RIs ranging from 1.5 to 1.7 years, while MacRae [
25] reported RIs of 1.6 to 10 years for a study in British Columbia. Andrews and Nankervis [
15] described the effective discharge as that which transports the largest portion of the mean annual bed-material load over time, and in the gravel bed rivers located in mountainous parts of Colorado, they found that 80% of the mean annual load was transported by flows between 0.8 to 1.6 times the bankfull discharge, which on average occurred as 15.6 events per year.
Gomez et al. [
26], in a study of a gravel-bed river in New Zealand, estimated that the average RI of the effective discharge was 4 ± 2 years but it was equaled or exceeded three times a year. In a study of 12 urban river channels in southern Ontario, Canada, Annable et al. [
27] reported that bankfull discharge occurred more than once a year in all of the rivers, often averaged from four to eight bankfull discharge or larger events per year, and in one case 18 events occurred in a year. Other examples of channel-forming discharges occurring many times annually include: Andrews [
28], who reported effective discharge flow duration values of 1.5 to 11 days annually, with corresponding recurrence intervals of 3.26 and 1.18 years, respectively; and Wolman and Miller [
7], who reported channel-forming discharge or larger of 6 to 11 days per year.
Numerous studies and methods have been used to develop relationships between discharge and RI, but much focus has been on applying these methods to extreme floods. Common approaches that are used to analyze river discharge data are based on: (1) an analysis of an annual series of peak values; (2) a partial duration or peaks over threshold approach; (3) an analysis of a probability density function; and (4) an annual frequency of exceedance approach. In the United States, the most commonly used method by state and federal agencies is the Log-Pearson Type 3 statistical method and procedures [
28,
29,
30,
31]. As the main purpose of the Log-Pearson Type 3 statistical method was to identify probable maximum discharges and flood elevations, it is usually used with an annual series of “instantaneous” peak values—a series of the highest discharge in each year that was recorded at an interval of 5 to 60 min depending on the river. The main disadvantage of using an annual series is that it excludes large floods when several occur in a single year [
32].
Using a partial duration series, also called peaks over threshold approach, to determine the RI has also seen much application [
33,
34,
35,
36,
37,
38,
39]. For frequent floods, it is the recommended method as it provides better estimates than the annual series of peaks approach [
38]. The main problem with the peaks over threshold approach is determining the threshold to be used.
Logic and methods based on sediment transport, such as the effective discharge or half-load discharge [
40], suggest that using a continuous series of discharges should be used to determine channel-forming discharges. If a continuous series of data is not available, such as discharge values every 15 min, an approach commonly used is to develop histograms and probability density functions (pdf) based on mean daily discharge data. There is much variability in how different scientists have developed strategies on how to perform this type of analysis. Doyle et al. [
41] divided a histogram into 25 bins to represent the pdf of the daily discharge. Crowder and Knapp [
42] used an iterative approach to reducing the number of bins from 25 until each bin had at least one discharge value. In addition to a lack of a standard approach to determining the bin size, the main drawback of a pdf approach is that the magnitude of the largest calculated RI is the number of years of available data. Determining the annual frequency of exceedance, for example, of specified mean daily discharge can be obtained from a histogram based on an annual series of mean daily discharges—or values associated with other time increments such as 15 min. In order to maximize the stability of channel reaches, channel-forming discharge is usually adopted as the design discharge in channel design and restoration [
6].
We have cited just a few of the many studies that consider the annual frequency of occurrence of channel-forming discharges. The problems with the approach and its usefulness in river restoration are in knowing that a channel-forming discharge or larger occurs a certain number of times annually. Quader and Guo [
43] warned that an inaccurate estimation of the design discharge in river restoration could adversely affect project performance. Amongst others, Doyle et al. [
6] also noted that one should not just rely on return intervals or bankfull discharges in river restoration designs.
The goal of our study was to address statistical inconsistencies in the range of channel-forming discharges using 150 case studies across six states in the North Central Region of the United States, and to propose a novel, alternative approach to determining discharge versus RIs based on daily values rather than peak values, which we call the Full Daily Distribution (FDD). Specific objectives of our study were to: (1) determine the annual frequency of occurrence of daily discharges ranging from 2 to 100 years with particular focus on discharges with a RI of 10 years or less by comparing a simplified Log-Pearson Type 3 method to the more sophisticated method commonly used for USGS gauged sites; (2) determine if our method of calculating annual frequency of occurrences using mean daily values was consistent with statistical probability expectations; and (3) evaluate the appropriateness of the use of the new FDD approach to determine discharge versus RI relationships.
4. Discussion
The evidence from this study suggests that the frequency of channel-forming discharges that have a recurrence interval less than five years are likely to occur much more frequently than statistical expectations. The calculated smallest number of occurrences in 100 years for the two-year recurrence interval daily discharge was 62 times for one of the catchments, while the highest number of occurrences for the two-year event was 1854 days in 100 years. On average, a two-year or larger discharge was calculated to occur about 2.5 days annually.
Based on an annual series of peak daily means, the ratio of the calculated to the expected mean number of occurrences were highly correlated to the recurrence interval with the exception of the results for Ohio (R-squared = 0.64, see
Figure 3). However, when the state-to-state or catchment-to-catchment variability was considered (
Figure 3) the correlation was too low for scientific purposes. The analysis based on discharges 25% larger or smaller than those obtained from the Log-Pearson Type 3 analysis indicated that the discharges were generally unlikely to be much smaller or larger than the calculated values.
The results obtained with the full daily distribution (FDD) approach were in general agreement with a study conducted by Endreny [
32] with data for 46 bankfull surveyed gauges in Maryland, North Carolina, and New York. Using a flow duration approach with a daily peak series he found that on average the bankfull discharge occurred 2.1 days annually and a maximum mean number of times of 13.2 days annually. He determined that by using either an annual instantaneous peak series or an annual daily peak series the recurrence interval of the bankfull discharge was about 1.4 years and concluded that use of a flow duration analysis helped to describe that bankfull discharges occur more than one time or one day annually. Navratil et al. [
50], in a study of 16 gravel-bed rivers in France, evaluated five different methods to identify bankfull and determined that the mean annual duration of occurrence varied from 4.5 days to 11.5 days depending on the method. One of their conclusions was that the annual maximum flood (AMF) analysis should be avoided.
Using the FDD method for each catchment the maximum recurrence interval was limited by the number of years in the period of record. For example, for the results reported in
Table 7, the maximum recurrence interval was 89 years.
The FDD results were a function of the K value of 20 used in the study. In
Table 7, calculated values are the number of exceedances and expected values are the number of occurrences (days) for a RI. The difference between the calculated exceedances and expected occurrences were the number of discharge values in the bin with the maximum discharge shown. For example, with K equal to 20, there were 146 discharge values larger than 380 m
3/s and six discharge values (152–146) in the bin containing discharges in the range 379.47 to 380 m
3/s. The expected value of 152 is 89 years/0.586 years (0.586 is round to 0.59 in
Table 7). Using a K value of 20, as shown in
Figure 7B, resulted in smaller bin discharge ranges than for a K value of 0.5 (
Figure 7A) and a better representation of the frequency distribution of different magnitude discharges. For example with K values of 20 and 0.5 the 4.94-year recurrence interval discharges are 598 m
3/s and 641 m
3/s, respectively. For the Sandusky River catchment, with a period of record of 89 years, a K value of 20 gave 1780 bins (20 times 89) while a K value of 0.5 only gave 45 bins. For a K value of 20, the average number of values in each bin was 18.3 for all the catchments while a K value of 0.5 had an average of 722 values.
The strength of the FDD approach was that the calculated number of occurrence were always in close agreement with the expected number of occurrences. The FDD approach does, therefore, provide a useful approach for discharge versus recurrence interval relationships in both the range of channel-forming discharges and also for extreme floods. However, it is less useful for extreme floods as the maximum recurrence interval that can be calculated by the method is the number of years in the period of record.
The number of occurrences of a discharge associated with a recurrence interval raises the question as to what is the meaning of the term channel-forming discharge. It might be more appropriately called a channel-forming threshold discharge. Discharges below the threshold are sufficient to transport sediment and to shape the channel geometry—geomorphic work associated with scour and deposition. The function of discharges above the threshold will depend on floodplain connectivity and the size of the floodplain. What is important to note is that, statistically, all discharges higher than the threshold are included in the statistical definition. Therefore, in mountain streams where the banks will generally not be flat and in locations where the floodplain is small, the functional usefulness of basing a restoration design just on a channel-forming discharge must be questionable. A number of approaches based on sediment transport, hydraulics, and dynamics of the system have more merit [
1,
51] than using an uncertain estimate of the channel-forming discharge.
Other implications of the statistical meaning and statistical uncertainty of channel-forming discharges for river restoration are unclear. For example, He and Wilkerson [
52] determined that in some cases regional bankfull geometry relationship can be better estimated as a function of the 2-year recurrence interval discharge or this discharge and the drainage area. If that approach is used it would be desirable to have a standard method for determining the 2-year recurrence interval discharge.
In the United States, a commonly used Internet tool for ungauged sites is the USGS StreamStats Method (
http://water.usgs.gov/osw/streamstats/) that, based on regional regression equations, can provide discharge versus recurrence interval estimates for any location in the more than 30 states where StreamStats is operational. A problem with the approach is that reported errors in a discharge estimate usually range from 30% to 50%. Like other prediction methods, it is necessary to calibrate the method with data from the nearest suitable river gauge [
33,
53]. Typically, this is done by developing calibration variables based on first using discharge data at the river gauge and then relating the results to a Log-Pearson Type 3 analysis of an annual series of maximum instantaneous peaks as that data are readily available from the USGS-NWIS state websites. The results of this study indicated that calibration with an FDD analysis of mean daily discharges might be more appropriate. However, this creates an inconsistency in methodology as hydrologic prediction methods usually estimate peak discharges not mean daily discharges.