Next Article in Journal
Statistical and Criticality Analysis of the Lower Ionosphere Prior to the 30 October 2020 Samos (Greece) Earthquake (M6.9), Based on VLF Electromagnetic Propagation Data as Recorded by a New VLF/LF Receiver Installed in Athens (Greece)
Next Article in Special Issue
Piecewise Modeling the Accumulated Daily Growth of COVID-19 Deaths: The Case of the State of São Paulo, Brazil
Previous Article in Journal
Gradient Profile Estimation Using Exponential Cubic Spline Smoothing in a Bayesian Framework
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State

1
Department of Mathematics and Institute for Systems Research, University of Maryland, College Park, MD 20742, USA
2
Public Health Dynamics Laboratory, Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA
3
Health Analytics Network, Pittsburgh, PA 15237, USA
*
Authors to whom correspondence should be addressed.
Entropy 2021, 23(6), 675; https://doi.org/10.3390/e23060675
Submission received: 7 May 2021 / Revised: 23 May 2021 / Accepted: 23 May 2021 / Published: 27 May 2021
(This article belongs to the Special Issue Modeling and Forecasting of Rare and Extreme Events)

Abstract

:
In disease modeling, a key statistical problem is the estimation of lower and upper tail probabilities of health events from given data sets of small size and limited range. Assuming such constraints, we describe a computational framework for the systematic fusion of observations from multiple sources to compute tail probabilities that could not be obtained otherwise due to a lack of lower or upper tail data. The estimation of multivariate lower and upper tail probabilities from a given small reference data set that lacks complete information about such tail data is addressed in terms of pertussis case count data. Fusion of data from multiple sources in conjunction with the density ratio model is used to give probability estimates that are non-obtainable from the empirical distribution. Based on a density ratio model with variable tilts, we first present a univariate fit and, subsequently, improve it with a multivariate extension. In the multivariate analysis, we selected the best model in terms of the Akaike Information Criterion (AIC). Regional prediction, in Washington state, of the number of pertussis cases is approached by providing joint probabilities using fused data from several relatively small samples following the selected density ratio model. The model is validated by a graphical goodness-of-fit plot comparing the estimated reference distribution obtained from the fused data with that of the empirical distribution obtained from the reference sample only.

1. Introduction

A challenging statistical problem is the estimation of lower and upper tail probabilities from a given small data set. Challenging as it is, the problem becomes even more arduous when the data set lacks information about lower or upper tail data to the extent that the use of the empirical distribution becomes problematic. This calls for additional data in some form. In this study, the fusion of data from multiple sources allows us to compute tail probabilities, which could not otherwise be obtained due to the lack of lower or upper tail data.
In particular, if the data from a certain source exceed a sufficiently high threshold, then information about lower values below the threshold can be obtained by fusion with other sources that do have data below the threshold. The same holds for sources with data below a given threshold. This necessitates fusion with data sources containing data above the threshold. Our approach is particularly useful when the sample sizes are relatively small and yet probabilities of unusual or extreme values are of interest [1]. Here, we present a multivariate extension of our methodology and demonstrate its application using small pertussis case count data sets.
Pertussis, or whooping cough, is an acute infectious disease of the respiratory tract caused by the gram-negative bacterium Bordetella pertussis. It is highly contagious and transmitted from infected to susceptible individuals by airborne droplets due to coughing and sneezing. Pertussis affects people of all ages but can be serious in infants less than 1 year of age and causes 195,000 infant deaths annually, mostly in developing countries. The global estimates in 2014 were 24.1 million cases and 160,700 deaths from the disease among children below five years of age [2]. Pertussis is endemic in all countries and tends to occur every two to five years in North America and Europe [3].
After widespread vaccination began in the U.S. in the 1940s, the number of new infections reduced to 10,000–40,000 cases of pertussis reported each year, resulting in a 100-fold reduction in the incidence of the disease, thereby making it a likely candidate for elimination. However, since the mid 1970s, pertussis incidence has steadily increased [4]. In 2012, 48,277 pertussis cases were reported in the U.S. (an incidence rate of 15.1 per 100,000), the largest number since 1955 [5]. In Washington state alone, more than 4600 pertussis cases were reported in 2012, mostly among infants aged less than 1 year and children aged 10 years [6]. The incidence of the disease among adolescents of age 13–14 years and adults has also increased, including those previously vaccinated, suggesting early waning of vaccine-acquired immunity.
While vaccination remains the most effective means of preventing illness, pertussis has re-emerged in countries that have sustained high vaccine coverage. In the U.S., pertussis has been a reportable disease since 1922, and case-based surveillance data are available through the National Notifiable Diseases Surveillance System (NNDSS) of the Centers for Disease Control and Prevention (CDC) and, additionally, the Enhanced Pertussis Surveillance (EPS) in seven states [7]. The reasons for this re-emergence are attributable to several factors including changes in diagnostic testing and reporting, increased awareness, mismatch of vaccine antigens and circulating strains, reduced duration of immunity from acellular pertussis (aP) vaccines that replaced whole-cell vaccines in the U.S. during the 1990s, and changes in the B. pertussis organism at the molecular level [7].
During the 2012 pertussis outbreak in Washington state, it was observed that the incidence was highest in infants of age <1 year and children of age 10, 13 and 14 years [6]. The statewide incidence rate was higher among Hispanics than non-Hispanics [6]. Household size [7] and vaccination coverage [8] have been considered among the risk factors of the disease. We have noted such risk factors in Table A1.
Apart from the analysis of factors that affect the resurgence of pertussis, forecasting upper and lower joint tail probabilities of high incidence in a given period of time is another key topic of interest to epidemiologists. While a variety of methods for modeling pertussis incidence have been proposed in recent years [9,10], here we present a method for the forecasting of both univariate as well as multivariate joint tail probabilities using the fusion of pertussis count data obtained from neighboring counties in Washington state. Our approach is based on the so-called density ratio model with variable tilts presented here with a multivariate extension, which is the novel contribution of this study.

2. Density Ratio Model

Given m + 1 independent p-dimensional multivariate random samples X k = { X k 1 , , X k n k } , k = 0 , , m , where n k ’s are the corresponding sample sizes. Suppose that X k has a density g k for k = 0 , , m , where the g k satisfy the density ratio structure
g k ( x ) g 0 ( x ) = exp ( α k + β k T h k ( x ) ) k = 1 , , m ,
where h k is referred to as a tilt functions or simply tilt. The sample X 0 is referred to as the reference sample and the rest of the samples are referred to as tilted samples.
Let α = ( α 1 , , α m ) T , β = ( β 1 T , , β m T ) T and θ = ( α T , β T ) T . Let w 0 ( · ; θ ) = 1 and w k ( · ; θ ) = exp ( α k + β k T h k ( · ) ) . Denote the combined sample by t = { t 1 , , t n } = { X 01 , , X 0 n 0 , , X m 1 , , X m n m } with the corresponding samples size n = n 0 + + n m . Let G 0 be the reference cumulative distribution function that corresponds to the density g 0 . The empirical likelihood function can be written as
L ( θ ; G 0 ) = i = 1 n p i k = 1 m j = 1 n k w k ( X k j ; θ )
with constraints
i = 1 n p i = 1 i = 1 n p i [ w k ( t i ; θ ) 1 ] = 0   k = 1 , , m ,
where p i is the jump of G 0 at t i . By profiling, the p i ’s that maximize the empirical likelihood are given by
p i = 1 k = 0 m n k w k ( t i ; θ ) .
Therefore, the likelihood becomes a function of θ only and we can find the estimator θ ˜ that maximizes the likelihood. Subsequently, the estimator of p i is obtained as
p ˜ i = 1 k = 0 m n k w k ( t i ; θ ˜ ) .
It can be shown that θ ˜ has the asymptotic normal distribution
n ( θ ˜ θ 0 ) d N ( 0 , Σ ( θ 0 ) )
as n . Details can be found in [11,12,13,14,15].
The estimated G 0 is obtained from the accumulation of the p ˜ i ’s,
G ˜ 0 ( ( x 1 , , x p ) ) = i = 1 n p ˜ i I [ t i 1 x 1 , , t i p x p ] .
In the above expression for G ˜ 0 , replacing p ˜ i by 1 / n we get the reference empirical distribution G ^ 0 .
The selection of the tilts h k ’s can be based on [16,17,18].
A flowchart in Figure 1 is provided to illustrate the steps in the data fusion analysis.

3. Application: County-level Pertussis Cases in Washington State

We collected Washington state county-level annual data of the number of pertussis cases from 1997–2018 (Washington Department of Health Website https://www.doh.wa.gov/ (accessed on 1 March 2021)). For each county, we have a sample of size 22. Without any distribution assumption, when county tail data are available we can estimate tail probabilities from the empirical distribution. However, such an estimation is not feasible if tail data are absent. For example, from Table 1 we see that no observation exceeds 30 in Jefferson county so that estimating the chance of exceeding the threshold of 30 from the empirical distribution is not viable.
Nevertheless, the estimation of this probability is possible via the density ratio model if we fuse the sample from Jefferson county with samples from the counties of Cowlitz and Snohomish for which sufficient amounts of data above 30 are available.

3.1. Univariate Analysis

The sample from 0-Jefferson is taken as the reference while the samples from 1-Cowlitz and 2-Snohomish are tilted with tilts h 1 ( x ) = h 2 ( x ) = x as suggested in [14]. Using the fused data from the three counties, and appealing to the density ratio model, tail probabilities for Jefferson County are given in Table 2 for thresholds 30, 40 and 50. As discussed above, these tail probabilities cannot be estimated by the empirical distribution for lack of tail data.
To validate the model, we used the graphical goodness-of-fit discussed in [15]. The idea is to see whether the points ( G ^ 0 , G ˜ 0 ) lie on or close to a 45°-line. From the goodness-of-fit graph in Figure 2, we see that some points lie not far from a 45°-line while others do not, pointing to a possible lack of fit. Moreover, little improvement has been observed by using different tilt functions. To resolve this issue as to the suitability of the density ratio model, we turn to the multivariate version of the model, where a somewhat richer class of possible tilts is used. This leads to, as we shall see in the next section, remarkable improvement in the fit.

3.2. Multivariate Analysis

We took 3-dimensional (that is p = 3 ) samples from three different regions: 0-(Grays Harbor, Jefferson, Clallam), 1-(Clark, Cowlitz, Lewis), 2-(King, Snohomish, Skagit). The order for each region is from the most to the least populated. Therefore, we obtained three 3-dimensional multivariate random samples with sample sizes all equal to 22 where the sample from (Grays Harbor, Jefferson, Clallam) was considered as the reference sample. The summary statistics of the nine counties are shown in Table 3.
We initiated tilt selection with h 1 ( x ) = h 2 ( x ) = ( x 1 , x 2 , x 3 ) T suggested in [14,15]. The tilts selected were h 1 ( x ) = ( x 1 , x 2 , x 3 ) T and h 2 ( x ) = ( x 1 , x 3 ) T giving the smallest AIC = 483.22 as shown in Table 4. The 45°-line formed by the pairs ( G ^ 0 , G ˜ 0 ) in Figure 3 indicating a good fit ( G ˜ 0 is closed to the empirical distribution G ^ 0 ).
We computed in Table 5 the estimates of several selected joint threshold probabilities, which can be regarded as predictions for a future year. It is worth noticing that the probabilities selected cannot be estimated by the empirical distribution G ^ 0 due to the lack of observations while this is made feasible by fusing data from the other two regions.

4. Discussion

Our data fusion approach allows us to combine information from multiple sources that can together describe dynamic and multifactorial phenomena more comprehensively than a single source alone. Infectious disease dynamics are ideally suited for such integrative modeling of an outbreak in which a county is usually affected by its neighboring counties, especially in populated areas, due to population mobility [19]. As the re-emergence of pertussis in the U.S. and Europe in recent years has shown, it is important to have the modeling capacity to predict the incidence of the disease even if the data are usually of small size, which are in themselves not adequate for the precise estimation of tail probabilities.
The multivariate density ratio model described in this study allowed us to examine the joint behavior of pertussis resurgence in adjacent counties. The model was validated by goodness-of-fit plots. Importantly, the observed support of the reference distribution of cases was enlarged by fusing the reference sample with data from nearby regions and applied to the density ratio model. While time series modeling of disease incidence is common in epidemiology, in the face of small or moderate data sources few methods can enhance their input to yield multivariate tail probabilities and confidence intervals, which are not possible to estimate otherwise.
In future work, we plan to further enrich our model with regional covariates to provide key insights for disease surveillance and public health researchers. For instance, the risk factors of pertussis cases that are studied in the U.S. include household size, vaccination coverage and demographics (see Table A1). Such factors are observed with regional variation that is often spatially clustered across neighboring counties [20]. Indeed, data fusion is well suited to the systematic modeling of regions with socioeconomic, political or cultural overlap (e.g., school districts) that are characterized by nonmedical vaccine exemptions, migration and vaccine refusal [21,22,23]. In times of increasingly common vaccine hesitancy, such applications could be very effective for public health.
While the world is currently seeing outbreaks of the COVID-19 pandemic, pertussis is, in comparison, an ancient disease, which was recognized even in the Middle Ages. While connections between these diseases have recently been considered [24], it is beyond the scope of the present study. However, the multivariate approach that we used for fusion of pertussis inter-county data could also be applied to other regionally transmissible diseases, including COVID-19. We leave this to future studies.

Author Contributions

All authors have contributed equally to the paper. This includes conceptualization, methodology, formal analysis, investigation, and writing. In addition, all authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Faculty-Student Research Award, University of Maryland, College Park.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are available on Washington State Department of Health and U.S. Census Bureau Website.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Risk Factors of Pertussis Incidence

Table A1. County statistics and risk factors: Population Estimate, Average Household Size, Percent Hispanic, Pertussis Vaccine Coverage, Percent Population Below 5 years, Population Density, Rural/Urban, Socioeconomic Status (SES) as per 2017 estimates.
Table A1. County statistics and risk factors: Population Estimate, Average Household Size, Percent Hispanic, Pertussis Vaccine Coverage, Percent Population Below 5 years, Population Density, Rural/Urban, Socioeconomic Status (SES) as per 2017 estimates.
CountyPopulationHousehold%Hispanic%Vaccine%Below5DensityRural/UrbanSES
Grays Harbor72,4902.439.880.75.514.78Mostly RuralMid
Jefferson31,2102.073.780.82.96.70Mostly RuralMid
Clallam75,6372.255.887.14.716.74Mostly RuralMid
Clark474,3812.698.784.76.2290.74Semi-UrbanHigh
Cowlitz106,8052.528.494.16.236.13Semi-UrbanHigh
Lewis78,3202.529.791.55.912.56Mostly RuralMid
King2,203,8362.459.491.45.9400.75UrbanMid/High
Snohomish802,0892.689.790.76.4147.82Semi-UrbanHigh
Skagit125,8602.5517.890.46.128.03Semi-UrbanHigh

References

  1. Kedem, B.; Pyne, S. Estimation of Tail Probabilities by Repeated Augmented Reality. J. Stat. Theory Pract. 2021, 15, 1–16. [Google Scholar] [CrossRef] [PubMed]
  2. Yeung, K.H.T.; Duclos, P.; Nelson, E.A.S.; Hutubessy, R.C.W. An update of the global burden of pertussis in children younger than 5 years: A modelling study. Lancet Infect. Dis. 2017, 17, 974–980. [Google Scholar] [CrossRef]
  3. Broutin, H.; Guégan, J.F.; Elguero, E.; Simondon, F.; Cazelles, B. Large-scale comparative analysis of pertussis populations dynamics: Periodicity, synchrony and impact of vaccination. Am. J. Epidemiol. 2005, 161, 1–9. [Google Scholar] [CrossRef] [PubMed]
  4. Rohani, P.; Drake, J.M. The decline and resurgence of pertussis in the US. Epidemics 2011, 3, 183–188. [Google Scholar] [CrossRef] [PubMed]
  5. Van Panhuis, W.; Cross, A.; Burke, D. Counts of Pertussis reported in United States of America: 1888–2017; Project Tycho Data Release; University of Pittsburgh: Pittsburgh, PA, USA, 2018. [Google Scholar]
  6. Centers for Disease Control and Prevention (CDC). Pertussis epidemic—Washington, 2012. MMWR Morb. Mortal. Wkly. Rep. 2012, 61, 517–522. [Google Scholar]
  7. Skoff, T.H.; Baumbach, J.; Cieslak, P.R. Tracking pertussis and evaluating control measures through enhanced pertussis surveillance, Emerging Infections Program, United States. Emerg. Infect. Dis. 2015, 21, 1568. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Barlow, R.S.; Reynolds, L.E.; Cieslak, P.R.; Sullivan, A.D. Vaccinated children and adolescents with pertussis infections experience reduced illness severity and duration, Oregon, 2010–2012. Clin. Infect. Dis. 2014, 58, 1523–1529. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Sanstead, E.; Kenyon, C.; Rowley, S.; Enns, E.; Miller, C.; Ehresmann, K.; Kulasingam, S. Understanding trends in pertussis incidence: An agent-based model approach. Am. J. Public Health 2015, 105, e42–e47. [Google Scholar] [CrossRef] [PubMed]
  10. Wang, Y.; Xu, C.; Wang, Z.; Zhang, S.; Zhu, Y.; Yuan, J. Time series modeling of pertussis incidence in China from 2004 to 2018 with a novel wavelet based SARIMA-NAR hybrid model. PLoS ONE 2018, 13, e0208404. [Google Scholar] [CrossRef] [PubMed]
  11. Kedem, B.; De Oliveira, V.; Sverchkov, M. Statistical Data Fusion; World Scientific: Singapore, 2017. [Google Scholar]
  12. Lu, G. Asymptotic Theory for Multiple-Sample Semiparametric Density Ratio Models and Its Application to Mortality Forecasting. Ph.D. Thesis, University of Maryland, College Park, MD, USA, 3 October 2007. [Google Scholar]
  13. Qin, J. Biased Sampling, Over-Identified Parameter Problems and Beyond; Springer: Singapore, 2017. [Google Scholar]
  14. Qin, J.; Zhang, B. A goodness-of-fit test for logistic regression models based on case-control data. Biometrika 1997, 84, 609–618. [Google Scholar] [CrossRef]
  15. Voulgaraki, A.; Kedem, B.; Graubard, B.I. Semiparametric regression in testicular germ cell data. Ann. Appl. Stat. 2012, 6, 1185–1208. [Google Scholar] [CrossRef] [Green Version]
  16. Fokianos, K. Density ratio model selection. J. Stat. Comput. Simul. 2007, 77, 805–819. [Google Scholar] [CrossRef]
  17. Fokianos, K.; Kaimi, I. On the effect of misspecifying the density ratio model. Ann. Inst. Stat. Math. 2006, 58, 475–497. [Google Scholar] [CrossRef]
  18. Zhang, X.; Pyne, S.; Kedem, B. Model selection in radon data fusion. Stat. Transit. New Ser. 2020, 21, 167–174. [Google Scholar] [CrossRef]
  19. Nielsen, F.; Marti, G.; Ray, S.; Pyne, S. Clustering patterns connecting COVID-19 dynamics and Human mobility using optimal transport. Sankhya B 2021, 83, 1–18. [Google Scholar] [CrossRef] [PubMed]
  20. Omer, S.B.; Enger, K.S.; Moulton, L.H.; Halsey, N.A.; Stokley, S.; Salmon, D.A. Geographic clustering of nonmedical exemptions to school immunization requirements and associations with geographic clustering of pertussis. Am. J. Epidemiol. 2008, 168, 1389–1396. [Google Scholar] [CrossRef] [PubMed]
  21. Omer, S.B.; Pan, W.K.; Halsey, N.A.; Stokley, S.; Moulton, L.H.; Navar, A.M.; Pierce, M.; Salmon, D.A. Nonmedical exemptions to school immunization requirements: Secular trends and association of state policies with pertussis incidence. JAMA 2006, 296, 1757–1763. [Google Scholar] [CrossRef] [PubMed]
  22. Phadke, V.K.; Bednarczyk, R.A.; Salmon, D.A.; Omer, S.B. Association between vaccine refusal and vaccine-preventable diseases in the United States: A review of measles and pertussis. JAMA 2016, 315, 1149–1158. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Wolf, E.; Rowhani-Rahbar, A.; Tasslimi, A.; Matheson, J.; DeBolt, C. Parental country of birth and childhood vaccination uptake in Washington State. Pediatrics 2016, 138, e20154544. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Ismail, M.B.; Al Omari, S.; Rafei, R.; Dabboussi, F.; Hamze, M. COVID-19 in children: Could pertussis vaccine play the protective role? Med. Hypotheses 2020, 145, 110305. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flowchart of the data fusion analysis.
Figure 1. Flowchart of the data fusion analysis.
Entropy 23 00675 g001
Figure 2. PP-plot for G ^ 0 vs. G ˜ 0 in the univariate case.
Figure 2. PP-plot for G ^ 0 vs. G ˜ 0 in the univariate case.
Entropy 23 00675 g002
Figure 3. PP-plot for G ^ 0 vs. G ˜ 0 in the univariate case.
Figure 3. PP-plot for G ^ 0 vs. G ˜ 0 in the univariate case.
Entropy 23 00675 g003
Table 1. Summary statistics of Jefferson, Cowlitz and Snohomish counties, WA. Q1 and Q3 are referred to 25th and 75th percentile respectively.
Table 1. Summary statistics of Jefferson, Cowlitz and Snohomish counties, WA. Q1 and Q3 are referred to 25th and 75th percentile respectively.
StatisticsMin.Q1MedianQ3Max.
County
Jefferson0.000.001.006.5030.00
Cowlitz0.003.008.0023.25108.00
Snohomish7.0036.2546.5054.75549.00
Table 2. Selected joint probability estimates non-obtainable from the empirical distribution and the corresponding 95% confidence intervals. Here, t represents the annual pertussis cases in Jefferson.
Table 2. Selected joint probability estimates non-obtainable from the empirical distribution and the corresponding 95% confidence intervals. Here, t represents the annual pertussis cases in Jefferson.
ProbabilityEstimate95% Confidence Interval
P ( t > 30 ) 0.0200(−0.0204, 0.0604)
P ( t > 40 ) 0.0084(−0.0124, 0.0292)
P ( t > 50 ) 0.0021(−0.0041, 0.0083)
Table 3. Summary statistics of each county used in the multivariate analysis. Q1 and Q3 are referred to 25th and 75th percentile respectively.
Table 3. Summary statistics of each county used in the multivariate analysis. Q1 and Q3 are referred to 25th and 75th percentile respectively.
StatisticsMin.Q1MedianQ3Max.
County
Grays Harbor0.001.002.504.7524.00
Jefferson0.000.001.006.5030.00
Clallam0.001.002.004.7525.00
Clark3.0020.2533.5085.00326.00
Cowlitz0.003.008.0023.25108.00
Lewis0.002.005.0010.7571.00
King38.00115.00141.00194.25785.00
Snohomish7.0036.2546.5054.75549.00
Skagit1.005.009.0017.75559.00
Table 4. AIC values for different choices of h 1 and h 2 . A hyphen “-” indicates that h k ( x ) 0 and therefore g 0 and g k are identical for k = 1 , 2 .
Table 4. AIC values for different choices of h 1 and h 2 . A hyphen “-” indicates that h k ( x ) 0 and therefore g 0 and g k are identical for k = 1 , 2 .
h 1 - x 1 x 2 x 3 ( x 1 , x 2 ) ( x 1 , x 3 ) ( x 2 , x 3 ) ( x 1 , x 3 , x 3 )
AIC
h 2
-553.03554.32552.37554.39554.19556.22554.22556.11
x 1 527.36487.62529.32526.98483.92489.53528.98485.89
x 2 525.03524.09516.98525.37518.98525.56518.94520.94
x 3 549.19551.19549.92547.88551.36549.45547.50549.45
( x 1 , x 2 ) 523.36485.04515.77522.57485.22487.03517.24487.17
( x 1 , x 3 ) 558.58489.07530.52528.38485.37486.05530.36483.22
( x 2 , x 3 ) 527.03526.08518.97526.34520.97526.85520.93522.92
( x 1 , x 2 , x 3 ) 524.91486.51517.25524.33486.71483.32519.19485.22
Table 5. Selected joint probability estimates non-obtainable from the empirical distribution and the corresponding 95% confidence intervals. Here, ( t 1 , t 2 , t 3 ) represents the number of annual pertussis cases in (Grays Harbor, Jefferson, Clallam) respectively.
Table 5. Selected joint probability estimates non-obtainable from the empirical distribution and the corresponding 95% confidence intervals. Here, ( t 1 , t 2 , t 3 ) represents the number of annual pertussis cases in (Grays Harbor, Jefferson, Clallam) respectively.
ProbabilityEstimate95% Confidence Interval
P ( t 1 > 20 , t 2 10 , t 3 10 ) 5.6511 × 10 3 ( 2.5641 × 10 2 , 3.6943 × 10 2 )
P ( t 1 > 10 , t 2 > 10 , t 3 10 ) 8.1231 × 10 3 ( 2.9312 × 10 2 , 4.5558 × 10 2 )
P ( t 1 > 15 , t 2 > 15 , t 3 15 ) 2.6609 × 10 3 ( 1.8843 × 10 2 , 2.4166 × 10 2 )
P ( t 1 > 25 , t 2 > 20 , t 3 > 10 ) 2.6517 × 10 7 ( 1.8969 × 10 4 , 1.9010 × 10 4 )
P ( t 1 > 15 , t 2 > 30 , t 3 > 10 ) 3.7789 × 10 9 ( 2.5683 × 10 5 , 2.5691 × 10 5 )
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhang, X.; Pyne, S.; Kedem, B. Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State. Entropy 2021, 23, 675. https://doi.org/10.3390/e23060675

AMA Style

Zhang X, Pyne S, Kedem B. Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State. Entropy. 2021; 23(6):675. https://doi.org/10.3390/e23060675

Chicago/Turabian Style

Zhang, Xuze, Saumyadipta Pyne, and Benjamin Kedem. 2021. "Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State" Entropy 23, no. 6: 675. https://doi.org/10.3390/e23060675

APA Style

Zhang, X., Pyne, S., & Kedem, B. (2021). Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State. Entropy, 23(6), 675. https://doi.org/10.3390/e23060675

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop