Next Article in Journal
Hatchability of Fascioloides magna Eggs in Cervids
Next Article in Special Issue
Space–Time Patterns of Poultry Pathogens in the USA: A Case Study of Ornithobacterium rhinotracheale and Pasteurella multocida in Turkey Populations
Previous Article in Journal
Kudoa septempunctata Spores Cause Acute Gastroenteric Symptoms in Mouse and Musk Shrew Models as Evidenced In Vitro in Human Colon Cells
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Mapping the Dynamics of Contemporary PRRSV-2 Evolution and Its Emergence and Spreading Hotspots in the U.S. Using Phylogeography

by
Nakarin Pamornchainavakul
1,
Igor A. D. Paploski
1,
Dennis N. Makau
1,
Mariana Kikuti
1,
Albert Rovira
1,2,
Samantha Lycett
3,
Cesar A. Corzo
1 and
Kimberly VanderWaal
1,*
1
Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN 55108, USA
2
Veterinary Diagnostic Laboratory, University of Minnesota, St. Paul, MN 55108, USA
3
Roslin Institute, University of Edinburgh, Edinburgh EH25 9RG, UK
*
Author to whom correspondence should be addressed.
Pathogens 2023, 12(5), 740; https://doi.org/10.3390/pathogens12050740
Submission received: 28 April 2023 / Revised: 16 May 2023 / Accepted: 19 May 2023 / Published: 20 May 2023
(This article belongs to the Special Issue Spatio-Temporal Analysis of Veterinary Infectious Diseases)

Abstract

:
The repeated emergence of new genetic variants of PRRSV-2, the virus that causes porcine reproductive and respiratory syndrome (PRRS), reflects its rapid evolution and the failure of previous control efforts. Understanding spatiotemporal heterogeneity in variant emergence and spread is critical for future outbreak prevention. Here, we investigate how the pace of evolution varies across time and space, identify the origins of sub-lineage emergence, and map the patterns of the inter-regional spread of PRRSV-2 Lineage 1 (L1)—the current dominant lineage in the U.S. We performed comparative phylogeographic analyses on subsets of 19,395 viral ORF5 sequences collected across the U.S. and Canada between 1991 and 2021. The discrete trait analysis of multiple spatiotemporally stratified sampled sets (n = 500 each) was used to infer the ancestral geographic region and dispersion of each sub-lineage. The robustness of the results was compared to that of other modeling methods and subsampling strategies. Generally, the spatial spread and population dynamics varied across sub-lineages, time, and space. The Upper Midwest was a main spreading hotspot for multiple sub-lineages, e.g., L1C and L1F, though one of the most recent emergence events (L1A(2)) spread outwards from the east. An understanding of historical patterns of emergence and spread can be used to strategize disease control and the containment of emerging variants.

1. Introduction

Porcine reproductive and respiratory syndrome (PRRS) is considered to be one of the most significant pig diseases in the U.S. and worldwide [1]. The predominant virus species that causes this disease in the U.S. is porcine reproductive and respiratory syndrome virus type 2 (PRRSV-2) [2,3], an RNA virus belonging to the family Arteriviridae in the Nidovirales order [4]. PRRSV-2 interrupts the function of the innate immune system [5], weakens adaptive immunity [6], induces cell death [7,8], and triggers inflammatory responses [9,10,11] in infected pigs, leading to various clinical presentations including late-term abortion and premature farrowing in sows, high mortality in piglets, and failure to thrive in growing pigs [12]. These clinical impacts during PRRS outbreaks directly diminish gross pig production and income on the level of individual farms [13] and at national scales [14,15].
PRRS first appeared in the U.S. in the states of North Carolina, Minnesota, and Iowa in 1987–1988 [16]. After more than 3 decades, 20–30% of sow farms throughout the U.S. pig industry still experience PRRSV-2 outbreaks each year [17]. The persistence of PRRSV-2 is characterized by the cyclical emergence of new genetic variants of the virus [18], the spread of which is then facilitated by the continuous movement of pigs between herds as part of multi-site, vertically integrated production systems. New variants typically emerge in a particular geographic area and then disseminate widely in the industry through routine animal movements [19,20,21,22]. The location of the emergence and the subsequent patterns of spread constitute crucial information, determining where prevention measures should be strengthened to limit pathogen dispersal. Phylogeography, which utilizes evolutionary relationships between viral genetic sequences to reconstruct ancestral locations and migrations, can be a useful tool to address these questions.
PRRSV-2 diversity has conventionally been quantified through variation in the open reading frame 5 (ORF5) gene [2,3,23]. This 603-nucleotide gene encodes the major envelope glycoprotein, in which several immune epitopes reside [24]. Its immunogenic importance, coupled with its high genetic variability between viruses [3,25,26], makes ORF5 a good marker for PRRSV-2 genetic diversity. Historical analysis of PRRSV-2 ORF5 sequences in the U.S. has shown that the PRRSV epidemiological landscape is characterized by the co-existence of multiple groups of genetically distinct viruses [3], referred to as phylogenetic lineages [3,18]. Pairwise genetic distances are typically <11% for sequences within the same lineage, typically with >10% divergence between lineages [18]. These lineages were originally defined by Shi et al., using data from up to 2009 [26]. Since then, PRRSV-2 lineage 1 (L1) has increased from <40% to >60% of sequenced viruses, while other PRRSV-2 lineages have been decreasing over time [3,27]. It is worth noting that the majority of non-L1 viruses detected through sequencing in the past decade (20–30% of sequenced viruses) are L5 and L8 [27], which have been widely used as commercial live-attenuated vaccines, whereas wild-type non-lineage 1 sequences account for <1% of sequences [27]. In addition, lineage 1 has diverged into several subpopulations, i.e., sub-lineages L1A to L1H (typical pairwise genetic distances <8.5% for sequences belonging to the same sub-lineage, and >9% divergence between sub-lineages [18]. The prevalence of different sub-lineages varies geographically and temporally, with cyclic population expansions and contractions of particular groups of viruses being continuously observed [18].
Although patterns of L1 sub-lineage emergence and turnover are well described [3,18], the geographic origin and spreading hotspots of contemporary L1 viruses have not been determined. Such information is crucial for PRRS prevention, management, and containment. The most recent analysis of PRRSV-2 phylogeography in the U.S. used sequences up to 2011 [28], prior to when L1 became the dominant lineage in the U.S. Thus, this earlier work might not be representative of the current epidemiologic situation. To fill these gaps, our objective is to provide an updated analysis of the spatiotemporal dynamics of the spread and population growth of PRRSV-2 L1 over the last three decades in the U.S., with a particular emphasis on understanding the spatiotemporal dynamics that underpin the continued diversification of L1 into numerous sub-lineages. We inferred the phylodynamics of each L1 sub-lineage and identified potential selection pressures associated with phylogenetic divergence that may help to explain the overall L1 phylogeography. Given the large size of the aggregated U.S. PRRSV ORF5 sequence dataset, we also evaluated the robustness of our results with reference to various sub-sampling strategies that aimed to generate spatial-temporally representative subsets for analysis. This study advances our understanding of the large-scale geographic expansions and evolutionary dynamics of the virus; it may help us to determine why PRRSV-2 persistently circulates within the U.S.

2. Materials and Methods

2.1. Data Sources

Three data sources, the National Center for Biotechnology Information GenBank (NCBI, 1991–2021), the University of Minnesota Veterinary Diagnostic Laboratory (UMN VDL, 2004–2021), and the Morrison Swine Health Monitoring Project (MSHMP, 2010–2021), were accessed in October 2021 to gather PRRSV-2 ORF5 genetic sequences collected in the U.S. Briefly, the UMN VDL has generated virus sequences from throughout the U.S. as part of services rendered to primarily U.S.-based industry clients. MSHMP is a voluntary program established in 2011 that archives a variety of swine disease data from over 35 participating production systems, capturing data from more than 50% of the U.S. breeding population [29]. Outbreaks identified and reported to MSHMP may have an accompanying ORF5 gene sequence. Sequences reported to MSHMP are typically produced by the VDLs of the University of Minnesota, Iowa State University, South Dakota State University, or Kansas State University. All available sequences from Canada (only found in the NCBI GenBank) were also included, given that the routine transport of hogs between the U.S. and Canada [30] might lead to transboundary PRRSV transmission.
The inclusion criteria for sequences were the completeness of the sequence (>580 nucleotides) and the availability of a sample collection date and location (i.e., the state in the U.S. where a sample was collected or, in the case of the UMN VDL, the state where the client who submitted the sample was located). Primary deduplication by sequence identification number was performed between the MSHMP and the UMN VDL datasets. Subsequently, the aggregated dataset of 40,365 ORF5 sequences were aligned using pairwise local alignment in MAFFT v.7.310 [31], and the alignment was used to build an approximately maximum-likelihood tree using FastTree v.2.1.10 [32]. PRRSV-1 sequences detected in the tree were excluded before realigning only the PRRSV-2 ORF5 sequences onto a PRRSV-2 reference (GenBank accession no. NC_038291.1) using pairwise codon alignment via VIRULIGN [33]. Repeated sequences from localized outbreaks caused by highly related viruses (defined by same exact collection date, collected within the same state, and with 100% nucleotide similarity) were deduplicated. After filtering out sequences containing ambiguous nucleotides (n = 4644), gaps (n = 530), or with signals of potential recombination (n = 4) consistently detected by all seven methods implemented in in RDP5 [34], wherein a fully exploratory (all sequences compared to all others) recombination analysis was performed using the RDP [35], GENECONV [36], MaxChi [37], BootScan [38], SiScan [39], Chimaera [40], and 3Seq [41] methods, the curated 29,554 PRRSV-2 ORF5 sequences were classified into (sub-)lineages by measuring the pairwise distance between a sequence to each (sub-)lineage’s anchors, as described elsewhere (Paploski, et al.) [3,18]. Non-L1 sequences were excluded from further analysis. Ultimately, an alignment of just 19,395 PRRSV-2 L1 ORF5 sequences was used for further analyses (Supplementary Figure S1).

2.2. Subsampling

The geographic regions utilized for discrete-space phylogeography were adapted from the Swine Health Information Center’s (SHIC) regions, with boundaries creating divisions between major pork-producing areas in the U.S. Due to the high number of available sequences, SHIC’s region 3 (Midwest) was subdivided into 2 regions according to the spatial distribution of the pig population [42,43]. These regions included the Southwest (SW, n = 1057 sequences), the Upper Midwest (UMW, n = 10,384 sequences), the Central Midwest (CMW, n = 827 sequences), the Northeast (NE, n = 473 sequences), the East (E, n = 6554 sequences), and Canada (CAD, n = 80 sequences) (Supplementary Table S1). The Western (W, n = 12 sequences) and Southeastern (SE, n = 8 sequences) U.S. were not included in the analysis since less than 20 sequences were available from those regions, likely reflecting low pig populations [43]. Given that one of the data sources is a Minnesota-based diagnostic lab, data availability was highly biased towards the Upper Midwest. Imbalances in the number of sequences per region can create analytical biases that influence ancestral state reconstructions produced by phylogeographic models, wherein the model is falsely confident that the overrepresented region is the ancestral region. To diminish this bias and for computational feasibility, the L1 alignment (n = 19,395) was sampled 5 times (500 sequences per subset) using a spatiotemporally stratified uniform sampling method with year and region strata. This approach aims to equalize the number of samples included from each region/year; as such, an equal number of sequences were drawn from the available sequences per region for each calendar year from 1991 to 2021, except for some years, in which a few (<5) or no sequences were available from particular regions (Figure 1). Using this approach, the median number of sequences/region/year was 3 (IQR = 0–4) and the total unique sequences utilized across the 5 random data sets was 1765, or 9.1% of all L1 sequences.
As there is some uncertainty about the most appropriate sub-sampling strategy for addressing sampling bias [44,45], two other subsampling approaches were also assessed five times each. For uniform spatial stratified sampling, sub-sampling was performed as described above but ignoring the year. For proportionate stratified sampling, the number of sequences included per region was set to be proportionate to the relative pig population in each region and state, as reported in the 2017 agricultural census [43]. A comparison of the spatiotemporal representations of all three subsampling approaches is shown in the Supplementary Data. (Supplementary Figures S3 and S4)

2.3. Phylogeographic Analyses

The origin and frequency of inter-regional spread of PRRSV-2 Lineage 1 and its sub-lineages were estimated from the subsampled sequences via Bayesian discrete phylogeography, which is also referred to as discrete trait analysis (DTA). This analysis treats the sampling location as a discrete trait and then estimates the ancestral locations and geographic migrations of the virus from the trait transitions across the viral evolutionary tree through continuous-time Markov chain (CTMC) modeling [46]. To achieve this, time-scaled phylogenetic trees with phylogeographic inference were reconstructed from each subsampled set using BEAST v.1.10.4 [47]. The models used for the analysis were as follows: the general time reversible with gamma plus invariant site heterogeneity (GTR + I + G) as the nucleotide substitution model, the uncorrelated relaxed clock [48] with log-normal distribution as the molecular clock model, and the non-reversible CTMC for the asymmetric discrete trait substitution model [46]. We additionally inferred the viral population dynamics by specifying the Bayesian Skygrid’s Gaussian Markov random field (GMRF) model [49] as a coalescent. Prior to the Bayesian analysis, the temporal signals of each subsampled set were checked using TempEst v.1.5.3 [50] by analyzing the root-to-tip distances in the maximum likelihood trees built by IQ-TREE [51]. Bayesian analyses were run with 300 million Markov chain Monte Carlo (MCMC)’s chain length. The first 10% of the samples from the MCMC chain were discarded as burn-ins, and the remaining trees were summarized as a maximum clade credibility (MCC) tree via TreeAnnotator v.1.10.4 [52] and visualized using the Nextstrain [53] platform. Phylogeographic and population dynamic outputs were visualized using the ggplot2 [54] package in R [55]. All of these approaches were repeated for each set of sequences belonging to each of the seven sub-lineages identified in the MCC trees, including L1A, L1A(2) (the secondary re-emergence of L1A viruses [18]), L1BG (a monophyletic clade comprising L1B and L1G), L1C, L1E, L1F, and L1H. In total, 50 DTA runs (15 runs from 3 subsampling strategies performed on L1 overall, plus 35 runs from the sub-lineage analyses, for which we only used spatiotemporal subsampling) were performed to infer the spatial-temporal dynamics of the virus. The key phylogeographic results were combined from 5 runs, e.g., the range of median times to the most recent common ancestor (tMRCA), the range of the 95% highest posterior densities (HPDs), and the range of probabilities in the ancestral region.
Maximum likelihood discrete-space phylogeography [56] and Bayesian structured coalescent approximation [57] were explored as alternative analytical approaches and were used to evaluate the sensitivity of our results to the utilized modeling platform. The first method, maximum-likelihood phylogeography, essentially infers ancestral traits across an evolutionary tree’s internal nodes by treating migration between discrete locations in the same way as genetic mutation, given a time-reversible model [56]; it is a non-Bayesian version of the DTA described above. Bayesian structured coalescent approximation extends the Bayesian coalescent model to allow for migration between subpopulations within a structured population and approximates the internal nodes’ trait probability [57,58]. The same five spatiotemporal stratified sampled sets used for DTA were reanalyzed using these methods via TreeTime v.0.8.5 [56] in the Nextstrain’s augur v.10.1.1 pipeline [53] (the maximum-likelihood approach), and the marginal approximation of the structured coalescent (MASCOT) v.2.1.2 [58] package in BEAST v.2.5.1 [59], respectively. Since TreeTime does not require significant amounts of computational power to run a large dataset, the full set of PRRSV-2 L1 ORF5 sequences (n = 19,395) was also analyzed by TreeTime. Detailed parameter settings are displayed in the Supplementary Data (Supplementary Table S2). Ancestral locations and their transitions through time from all phylogeographic analyses were summarized using the Babel v.0.4.0 package in BEAST v.2.5.1 [59].

2.4. Selective Pressure Analysis

Possible selective pressures throughout PRRSV-2 L1’s evolutionary history were identified using the branch-site test of positive selection [60]. We used a branch-site test because branches can be tied to an inferred geographic region, and we wanted to test the hypothesis that selection pressures vary between different regions. The analysis was performed on each of the five spatiotemporally subsampled MCC trees from DTA and their original sequence alignment. Implemented in aBSREL (adaptive branch-site random effects likelihood) v.2.3 software, all tree branches were tested using an exploratory analysis, in which the optimal ω (non-synonymous to synonymous substitutions ratio or dN/dS) of each branch was inferred by the small-sample Akaike Information Criterion (AICc) and then compared to the null model (ω ≤ 1) using the likelihood ratio test (LRT) [61]. If a branch has the inferred ω > 1 and LRT p-value < 0.05, the virus is estimated to evolve under episodic positive (diversifying) selection at that branch.

3. Results

3.1. Origin of PRRSV-2 L1 and Its Sub-Lineages

The results from five DTAs on different spatiotemporally stratified subsampled sets (n = 500 each) suggest that the PRRSV-2 L1 in the U.S. originated in Canada (100% posterior probability) during the late 1980s (median time to the most recent common ancestor (tMRCA) = 1986; (range of 95% highest posterior densities (HPDs) = 1981–1989)). Although the L1A and L1F sub-lineages concurrently diverged from the primitive L1 in the late 1990s (range of median L1A tMRCA = 1995–2000; (range of 95% HPDs = 1992–2002) and range of median L1F tMRCA = 1995–1998; (range of 95% HPDs = 1993–2000)), L1F was likely imported from Canada (93–100% probability), while L1A emerged within the Upper Midwestern U.S. (97–100% probability). Within a few years, two additional sub-lineages, L1C and L1BG, diverged (range of median L1C tMRCA = 1998–1999; (range of 95% HPDs = 1995–2002) and range of median L1BG tMRCA = 2000–2002; (range of 95% HPDs = 1997–2003)). L1C was a sister clade of L1, and L1BG was a sister of L1A on every MCC tree. Like their sisters, L1C and L1BG potentially originated in Canada (97–100% probability) and the Upper Midwest (93–100% probability), respectively. L1 further diverged into another two sub-lineages, L1E and L1H, in the 2000s. L1E was a relatively small clade, directly arising from the basal L1 around the mid-to-late 2000s (range of median L1E tMRCA = 2003–2007; (range of 95% HPDs = 2000–2009)) and was estimated to have emerged in the Upper Midwest (80–98% probability). On the contrary, L1H was a recent sister of L1C and L1F, and its approximated time and place of origin were the late 2000s (range of median L1H tMRCA = 2008; (range of 95% HPDs = 2006–2010)) in Canada (100% probability). The second and larger wave of L1A (L1A(2)) branched out from the original L1A in early 2010s (range medians L1A(2) tMRCA = 2009–2012; (range of 95% HPDs = 2007–2013)). Although the most likely origin of this emergence appears to be the Upper Midwest (52–99% probability across analyses based on 5 subsets of data), the probability that the source was the Eastern U.S. (4–47% probability) could not be ruled out (Figure 2).
When comparing the results generated by the different subsampling techniques, the estimations of both the time and location of origin from the spatially stratified subsampled sets were markedly similar to those from the spatiotemporally stratified subsampling. Unlike the first two techniques, estimations based on the proportionate stratified samples indicate that the L1 virus and all its sub-lineages likely originated in the Upper Midwestern U.S., the region with the proportionately largest pig population and thus the most sequences included in the analysis (48% of the sequences were from the Upper Midwest). Inferred tMRCAs from the population-based subsampling were less consistent compared to the spatially based subsamples (Supplementary Figure S5).
The virus phylogeography inferred by the DTA differed when using other modeling approaches, despite the same inputs being used (i.e., all models utilized the same subsets produced from spatiotemporally stratified sub-sampling). For instance, the origins of L1BG, L1E, and L1A(2) were more uncertain in the maximum-likelihood-based TreeTime analysis than in DTA; in some cases, three regions were considered almost equally likely to be the putative origin (sub-lineages L1A(2) and L1BG), or different runs produced different results (sub-lineage L1E). That said, the estimations from TreeTime were far more similar to the DTA than the results from MASCOT. In the latter, there were high levels of uncertainty regarding the region of origin for most sub-lineages, and the Eastern U.S. was frequently inferred to be a potential origin of both L1 overall and most sub-lineages. However, DTA, TreeTime, and MASCOT similarly estimated that the origin of L1H was in Canada (100% probability). Focusing on the time of emergence, estimates of tMRCAs and inferred nucleotide substitution rates were similar across all methods (Supplementary Figure S6).
Both the geographic region and the time of origin of the L1 virus estimated from the non-subsampled set (n = 19,395) using TreeTime corresponded well to the inference from the spatiotemporally stratified samples subjected to DTA. The only disagreement was the origin of the sub-lineage L1F; the ancestral location of the full dataset was inferred to be the Upper Midwest instead of Canada, possibly as a result of the overrepresentation of the Upper Midwest in this dataset (Figure 3 and Supplementary Figure S7).

3.2. Inter-Regional Spread and Spreading Hotspots

The source regions contributing to frequent inter-regional spread (based on the median number of transitions between regions across the five runs) inferred by the phylogeographic analysis were considered hotspots for inter-regional spread. DTA analysis of the spatiotemporally stratified samples suggests that the Upper Midwestern U.S. was the main hotspot for inter-regional spread and was the origin of frequent dissemination events to every region except for Canada. The common destinations of such events were the Central Midwest (~126 transitions since L1 emergence until 2021) and the Northeast (~111 transitions). Canada and the Eastern U.S. can also be considered hotspots, although they mainly spread the virus to only a single adjacent region, such as the Upper Midwest (~75 transitions) and the Northeast (~71 transitions), respectively (Figure 4A). Summarizing the proportion of branches inferred to exist in each region across time (Supplementary Figure S11), L1 was primarily circulating in Canada in the 1990s and was likely introduced to the U.S. through the Upper Midwest during the early 2000s. After around 2005, Canada gradually ceased to be a major player in U.S. inter-regional spread (although we have relatively few Canadian sequences in later years, preventing us from fully understanding its role in later periods). These phylogeographic patterns were relatively stable across different phylogeographic approaches and subsampling techniques, albeit with some variations. For instance, MASCOT approximated that the virus spread from the East to both the Northeast and the Upper Midwest, and that there was no apparent flow from Canada to the U.S., which was inconsistent with the patterns inferred by the other approaches (Supplementary Figures S8 and S9).
At the sub-lineage level, the direction and extent of inter-regional spread were unique for each viral subpopulation. L1C and L1A(2) were the predominant sub-lineages with the highest number of samples across all subsampled datasets (Figure 1, Figure 2 and Figure 3). Nevertheless, the inferred patterns of inter-regional spread moved in opposite directions. The DTA phylogeography estimated that the Upper Midwest was a spreading hotspot for L1C, which disseminated the virus mainly to the Northeast, Central Midwest, and East. L1C was also found to spread in the opposite direction, from the East to the Northeast to the Central Midwest. In contrast, L1A(2) displayed a more unidirectional northwesterly flow from the East to the Northeast to the Central Midwest to the Upper Midwest. With a smaller number of inferred transitions events due to its smaller population size, L1BG mostly circulated back and forth between two Midwestern regions, with some spillover to the Northeast, whereas L1F largely spread from the Upper Midwest to the Southwest, with some introductions to the Central Midwest (Figure 4B). Even though the extent of L1H’s spread was not comparable to the spread of other sub-lineages, all of the phylogenetic trees show that the L1H viruses circulated primarily within the Southwestern U.S., which was also its spreading hotspot according to the DTA inference. The smallest sub-lineages were L1A and L1E, for which no distinct patterns of inter-regional spread were discerned (Supplementary Figure S10)

3.3. Population Dynamics, Mutation, and Selection Pressure

PRRSV-2 L1 population dynamics in the U.S. were summarized from the median effective population sizes inferred by the Bayesian Skygrid analysis on different spatiotemporally stratified sets. The virus population rapidly grew within the first decade (1990–1998), after which the population dynamics were characterized by a wave-like pattern, wherein the effective population size slightly decreased and then increased to a new peak approximately every ~6 years until the present (Figure 5). Over 30 years, there were 4 peaks in the effective population size of L1, each of which was driven by different sub-lineages. The first L1 peak (1998) occurred prior to the peaks of any of the analyzed sub-lineages, and may have resulted from older sub-lineages, such as L1D, which were classified as primitive L1 in this study [18]. Subsequent peaks coincided with the epidemic-like wave of L1F, which appears to have pushed the overall L1 population to the second peak in 2005. The third wave, peaking in 2012, involved L1F, L1BG, and L1C, but increases associated with this peak appear to be most strongly driven by a rapid increase in L1C. The most recent peak (2017) was mainly driven by the sub-lineages L1A(2) and L1H, which emerged in the late 2000s but did not experience rapid growth until the mid 2010s (Figure 5). The Skygrid analyses of the inferred population sizes were remarkedly consistent across the different subsampling techniques, nearly completely overlaying on top of each other, except during the early time period (before 2000), when the size of the first peak in population size varied according to the subsampling strategy (Supplementary Figure S13).
The median nucleotide substitution rate for overall L1, which was computed from the spatiotemporal MCC trees, was 8.6 × 10−3 (IQR = 7.7 × 10−3 − 1.0 × 10−2) substitutions per nucleotide site per year (s/n/y). There was no significant difference between sub-lineages or geographic regions, nor was there a clear temporal pattern in branch-specific substitution rates (Supplementary Figures S14 and S15).
The selective pressure analysis of the spatiotemporal stratified sequences and trees estimated that ORF5 mostly evolved under near-neutrality, regardless of lineage or region (median and mode ω = 1, IQR = 0.38 − 2.11). Although the adaptive branch-site model’s ω showed that the ancestral viruses of several trees’ branches also evolved under both extremely high-purifying and positive selection, only three branches from three different subsampled trees can be considered to be positively selected (p-value < 0.05) according to the likelihood ratio test (LRT). These rare episodic positive selection events displayed no clear spatial or temporal patterns. One was identified at an internal branch of primitive L1 in 1992, prior to the virus being introduced from Canada; another was a terminal branch of primitive L1 in 2000 in the East, which originated from Canada, and a third was a terminal branch of sub-lineage L1C in 2014 in the East, without any associated inter-regional transition. The positive selected branches did not have higher evolutionary rates compared to the others (Supplementary Figures S16 and S17).

4. Discussion

In this work, we aggregated several large-scale PRRSV-2 L1 ORF5 sequence datasets from the U.S. and Canada to reconstruct the historical patterns of lineage emergence, inter-regional spread, and population dynamics using phylogeography. Using such a large dataset allowed us to apply different subsampling techniques and phylogeographic approaches and test the sensitivity of our inferences to the sub-sampling technique and modeling approach utilized [44,62]. We found evidence that L1 viruses overall, as well as the sub-lineages L1C, L1F, and L1H, potentially originated in Canada, while the sub-lineages L1A, L1A(2), L1BG, and L1E emerged within the Upper Midwestern U.S. However, the initial region of origin might not always be the hotspot that is most responsible for frequent inter-regional spread. Moreover, the hotspot and spreading patterns of each viral subpopulation varied. For example, the opposing directionality of the dissemination of the two currently predominant sub-lineages, L1C and L1A(2), demonstrated the varied phylogeographic histories of different subpopulations. This complexity was also found in the temporal demographic inference, which shows that the contribution of each co-existing sub-lineage to the overall L1 population dynamics has fluctuated through time.
The majority of our inferences, as well as a previous phylogeographic analysis published in 2013 [28], indicate that L1 viruses in the U.S. originated in Canada. This does not seem to be due to lack of sequence data in the U.S. before 1998 (the earliest detected L1 sequence in the U.S.) [63], given that there are abundant sequence data available (>230 sequences, all non-L1) from throughout the U.S. since 1989 [23,64,65,66,67]. The detection of L1 sequences as early as 1991 in Canada, but not in the U.S., [68,69,70,71] further supports the conclusions of our phylogeographic analysis. Many of the early Canadian sequences were not included in the analysis where sub-sampling was proportional to pig population size; the preponderance of Upper Midwestern sequences and the exclusion of early Canadian sequences in those sub-samples resulted in the Upper Midwestern U.S. being the inferred origin of L1 and all of its sub-lineages, but we believe this to be an analytical artefact of the overrepresentation of this region. Thus, we contend that spatiotemporally stratified sub-sampling was the best approach to overcome the sampling biases in our dataset.
Performing the analysis with the spatiotemporal sub-sampled sequences in MASCOT, which uses a forwards/backwards algorithm [72] with a structured coalescent model [57], resulted in drastically different results, in which all L1 viruses except L1H were inferred to originate in the East. Such a pattern would be possible only if the L1 virus did truly exist in the East undetected during or before the 1990s but had never been sampled or sequenced. In fact, according to our data, at least 40 samples from the Eastern region were sequenced between 1992 and 1999, but none of them was classified as L1. This further supports our conclusion that conducting DTA of the spatiotemporally stratified samples was the most reliable and robust approach for our data.
While the Canadian origin of the L1 lineage is well supported, the evidence for Canada’s potential role in subsequent inter-regional spread was inconclusive, given the available dataset. The introductions of the primitive L1, L1F (circa 1998–2001), L1C (circa 2000–2005), and L2H (circa 2009–2011) from Canada to the U.S. were highlighted, while reverse dissemination from the U.S. back to Canada was rare. This unidirectional pattern is not surprising, given that the U.S. imports millions of feeder and finishing pigs per year from Canada but exports only a few thousand pigs back to Canada annually [30]. The changes in the structures of the U.S. and Canada’s pig industries that led to this phenomenon have been described elsewhere [73,74], but it is worth noting that 90% of imported pigs are moved into the Upper Midwest [75]. L1H, the most recent sub-lineage originating from Canada, appears to have been introduced to the U.S. in the late 2000s when the Canada-to-U.S. pig imports were at their peak (8 to 10 million heads/year) [30]. Once the virus has been introduced to the U.S., the Upper Midwest appears to be the epicenter of inter-regional spread; this is likely because it is a hub for interstate pig shipments, given the number of harvest plants and the proximity to corn and soybean crops that are used in feed [76,77]. Notably, this general pattern of virus movement from Canada to the Upper Midwest, following by inter-regional spread within the U.S., mirrors the phenomenon observed for swine influenza [78,79].
The question of whether Canada continues to disseminate viruses into the U.S. cannot be fully answered due the reduced number of Canadian L1 ORF5 sequences available in the NCBI GenBank database in recent decades. Reductions in the number of PRRSV-2 sequences available in GenBank also occurred in the U.S. After 2010; when sequencing technology became more accessible and affordable, more ORF5 sequencing was likely requested by field veterinarians for diagnostic purposes, which could explain the reduced number of publicly available sequences. Thus, data sharing via GenBank, especially for the kind of sensitive metadata that we needed, i.e., sampling dates and locations (states), has been decreasing due to confidentiality concerns.
Phylogeographic and phylodynamic analyses of each particular L1 sub-lineage not only revealed varying patterns of inter-regional spread and population dynamics, but also connected the dots between the virus’s evolutionary history, historical PRRS outbreaks, and potential factors facilitating disease spread. Apart from the first emergence caused by the primitive L1 viruses (L1D), L1F was the earliest well-described sub-lineage and contributed to the second wave of the effective population size of L1, which peaked in the mid-2000s. A virus belonging to this sub-lineage was first isolated from a sample collected in 2001 from a severe PRRS case in southern Minnesota; this isolate is widely referred to as MN184 [80] according to the 1-8-4 ORF5 restriction fragment length polymorphism (RFLP) pattern [66]. L1F viruses were mainly detected in Midwestern regions, with multiple introductions to the Southwest [63]. Interstate swine movement records in 2001 further supported this finding, showing that the Southwest was the second-most frequent destination for pig movements from the Upper Midwest [76]. Currently, L1F appears almost extinct in the U.S., with fewer than two sequences detected each year since 2019 (Figure 3).
Between 2007 and 2008, two virulent PRRSV-2 strains that were responsible for regional outbreaks in Iowa and Minnesota were isolated. They were initially designated as 1-18-2 [81] and NADC-30 [82] strains, which were eventually classified as part of the sub-lineages L1BG and L1C, respectively. These coincided with the third wave of L1 population expansion in the early 2010s, which was influenced by the rise of those sub-lineages along with the sub-lineage L1F. Unlike the previous wave, the estimated inter-regional spread of L1C and L1BG during this wave was prevalent in the eastern part of the country as well as in the Upper Midwest. Patterns of spread completely shifted during the most recent wave in the late 2010s, to which L1A(2) and L1H were the primary contributors (Supplementary Video S1). L1A(2) was the infamous and virulent 1-7-4 strain first detected in 2014–2015 [83,84]. Our analysis confirmed unpublished reports that its spreading hotspot was the Eastern U.S., where the virus spread widely before spilling out into the Midwest [85].
In contrast, L1H (80% of the clade members were RFLP 1-8-4 [18]) appears to be an endemic virus that is mostly confined to the Southwest; we found few published epidemiological and virological characterizations of this particular group of viruses. The impact of recent and widespread outbreaks in the Upper Midwest in 2020–2021, caused by the novel L1C-1-4-4 [86] variant, was not captured by our analysis despite the availability of a number of ORF5 sequences. This may be because the novel L1C variant, as of 2021, was not yet so widely spread at the national scale that it significantly contributed to the virus’s overall population and spatiotemporal dynamics.
The inter-regional long-distance spread of PRRSV-2 has been an unavoidable consequence of the vertical integration of U.S. swine production systems, and L1 viruses have evolved in parallel with the expansion of U.S. swine production. After entering the country around the late 1990s, L1 viruses were almost impossible to contain in the Midwestern U.S. since hog and pig inventories were growing rapidly in new areas of the country, including the eastern state of North Carolina and several southwestern states (Oklahoma, Colorado, Texas, and Utah) during the same period [87]. In 2013, Shi et al. estimated that L1 viruses largely circulated only between Canada and the Midwestern regions (the Lake States, Corn Belt, and Northern Plains); meanwhile, the East (Appalachia) was one of the spreading hotspots for non-L1 lineages (Lineages 5–9) [28]. Over time, with increasing L1 prevalence, we captured the geographical shift in the spread of L1. The Eastern region became another spreading hotspot for particular sub-lineages (L1C and L1A(2)) during the third and the fourth waves of L1, likely because it is now the second largest swine producer after the Midwest [43] and the number of outgoing animal shipments is comparable to or sometimes higher than that in the Midwest [88]. These spatial dynamics of the spread may not be exclusive to PRRSV-2 but may be applicable to other swine diseases. Swine influenza, for example, appears to have been introduced into the Midwest from Canada, and new spreading hotspots in the East or other regions are also apparent in phylogeographic analyses of swine influenza in the U.S. [78,79]. Such a correspondence indicates that the dissemination patterns of multiple pathogens in the U.S. swine industry have common determinants, which are likely related to the structural and demographic organization of the industry.
The U.S. swine industry has been characterized by multi-site pig production for several decades [89,90] and animal movement is a key component that keeps the production flow uninterrupted [91]. The long-distance transport of live animals, cull hogs [92], feed, personnel, and equipment is driven by the uneven distribution of feed resources, production phases, and slaughtering facilities between geographic regions, and/or by contractual relationships between sites [91]. All of these logistical factors are potential risk factors that help to transmit PRRSV-2 via either direct or indirect contacts [20]. In addition, variation in protective measures, such as biosecurity practices and vaccinations, may cause unpredictable changes to the patterns of viral spread, though we were not able to assess these here. Further information relating to field samples, such as the production type of an infected farm, the immunization status, and farm-related transportations before an outbreak, could be useful factors to include in phylogeographic regressions; they can be employed within the DTA framework [93,94] to estimate factors associated with inter-regional transmission. The fraction of sequenced samples relative to the number of actual cases could improve sub-sampling techniques and prevent the occurrence of biases related to unequal sampling/sequencing efforts between regions [62].
We found no support for the hypothesis that selection pressures varied spatially. Episodic diversifying selection along the L1 virus phylogenetic tree was rarely detected. The few branches with evidence of episodic selection were related neither to the (re-)emergence of L1 sub-lineages nor to the expansion of any virus variant. This is not surprising, as such analyses are typically used to assess a virus’s adaptation to a different host species [95,96,97,98,99,100]. Accordingly, we concluded that there is little evidence for episodic selection in PRRSV-2 L1’s evolutionary population dynamics. The estimation of episodic selection takes into account overall genetic changes within ORF5 across each branch of the tree. Since many parts of ORF5 are strongly conserved, using this branch-site model potentially obscures positive selection that occurs in specific amino acid residues, e.g., antigenically important amino acid sites that have been detected using site-specific models [3,101].

5. Conclusions

Here, we used phylogeographic models to analyze the spread of PRRSV-2 L1 in the U.S. over 30 years, covering the time period from the first appearance of L1 in the U.S. to its current dominance. Canada was estimated to be the most likely origin of the virus, which was introduced to the U.S. in the late 1990s, potentially through imports of feeder and finisher animals. Since then, the virus has diverged into several sub-lineages, each displaying different spatiotemporal dynamics. Once L1 viruses were established in the U.S., the swine-dense Upper Midwest region became a major hotspot for inter-regional spread. The contribution of the Eastern region of the U.S. to inter-regional spread has increased in the last decade, potentially due to the expansion of hog inventories within that region. The national-scale overview of recent PRRSV spatiotemporal dynamics presented here provides a contextual framework to aid in the design of PRRS prevention and control strategies for pig farming and disease control.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/pathogens12050740/s1, Figure S1: ORF5 sequences gathering and filtering process.; Figure S2: Geographical distribution of pig farming (left) and ORF5 sequences from the full dataset (right) by region (top) and state (bottom).; Figure S3: Geographical distribution of ORF5 sequences from each subsampled dataset.; Figure S4: Temporal distribution of ORF5 sequences from each subsampled dataset colored by source, sampling region, and pre-identified sub-lineage.; Figure S5: Probability of region of origin (top) and Median tMRCA with 95% HPD interval (bottom) of each L1 sub-lineage and overall L1 from all runs compared between different subsampling techniques.; Figure S6: Probability of region of origin (top) and Median tMRCA with 95% HPD interval (bottom) of each L1 sub-lineage and overall L1 from all runs compared between different phylogeographic approaches.; Figure S7: Key results from TreeTime analysis on the full L1 dataset. (A) The time-scaled phylogenetic tree with tip colored by sampling region and internal branch colored by inferred ancestral region. (B) Probability (of region of origin of each L1 sub-lineage and overall L1. (C) The similar timed tree with tip colored by classified sub-lineage. (D) Median tMRCA with 95% HPD interval of L1 and its sub-lineages.; Figure S8: Comparison of inter-regional spread of PRRSV-2 L1 in the U.S. between different phylogeographic analyses and subsampling techniques in map and arrows format.; Figure S9: Comparison of inter-regional spread of PRRSV-2 L1 in the U.S. between different phylogeographic analyses and subsampling techniques in matrix format.; Figure S10: Median between-region transitions of each L1 sub-lineage estimated by DTA on spatio-temporal stratified sampled sets in matrix format.; Figure S11: Proportion of ancestral region (trait of branch) through time between different phylogeographic analyses and subsampling techniques.; Figure S12: Proportion of the region receiving the virus from another region (regional transition) through time between different phylogeographic analyses and subsampling techniques.; Figure S13: Comparison of L1 population dynamics estimated by Bayesian Skygrid analysis on datasets from different subsampling techniques. Thin lines in the background are median effective population size of each run. Thick lines with bands are LOESS smoothing curve with 95% probability interval of the median population sizes from five runs of each subsampling technique. Color of line and band on the plot represents subsampling technique.; Figure S14: Comparison of branch median substitution rate between sub-lineages (left) and temporal distribution of the rate between sub-lineages (right).; Figure S15: Comparison of branch median substitution rate between regions (left) and temporal distribution of the rate between regions (right).; Figure S16: Comparison of branch specific dN/dS ratio through time between sub-lineages (left) and between regions (right). Black circle locates a significant episodic positive selected branch.; Figure S17: Comparison of branch specific dN/dS ratio (y-axis) and substitution rate (x-axis) between sub-lineages (left) and between regions (right). Black circle locates a significant episodic positive selected branch.; Table S1: Designated geographic regions, pig inventory, and available PRRSV-2 L1 ORF5 sequences.; Table S2: Parameter settings in different phylogeographic approaches.; Video S1: Inter-regional spread of PRRSV-2 L1 in the U.S. through time inferred from the full L1 dataset (n = 19,395) using TreeTime in the Nextstrain platform.

Author Contributions

N.P. and K.V. conceived and designed the study. C.A.C., A.R. and M.K. provided the sequencing data and metadata. I.A.D.P. curated and analyzed some of the sequencing data. D.N.M. compartmentalized spatial data. K.V. and S.L. supervised the analysis and the interpretation of the results. N.P. performed the data analysis, interpreted the results, and wrote the first draft of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the joint US-UK NIFA-NSF-NIH-BBSRC Ecology and Evolution of Infectious Disease awards 2019-67015-29918 and BB/T004401/1, the University of Minnesota College of Veterinary Medicine Signature Programs, grant number MIN-62-133, and the Critical Agricultural Research and Extension Program, grant number 2018-68008-27890. This work was also partly funded by the University of Minnesota Swine Disease Eradication Center (SDEC) and the Swine Health Information Center (SHIC) as the funding agency for MSHMP. N.P. was supported by a Royal Thai Government Scholarship.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used for the analyses in this study may be made available upon request to the corresponding author (K.V.). Only some of the data are publicly available (the NCBI GenBank sequencing data), as most PRRSV-2 genetic sequences are confidential diagnostic data obtained from third parties (i.e., the participating swine production system and veterinarians who submitted samples for diagnosis to one of the universities’ VDLs).

Acknowledgments

The authors would like to thank the Morrison Swine Health Monitoring Project (MSHMP) participants and the Veterinary Diagnostic Laboratory, University of Minnesota, for sharing its PRRSV-2 genetic sequences, as well as the Swine health Information Center (SHIC) for funding the MSHMP.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Lunney, J.K.; Benfield, D.A.; Rowland, R.R.R. Porcine reproductive and respiratory syndrome virus: An update on an emerging and re-emerging viral disease of swine. Virus Res. 2010, 154, 1–6. [Google Scholar] [CrossRef] [PubMed]
  2. Shi, M.; Lam, T.T.Y.; Hon, C.C.; Hui, R.K.H.; Faaberg, K.S.; Wennblom, T.; Murtaugh, M.P.; Stadejek, T.; Leung, F.C.C. Molecular epidemiology of PRRSV: A phylogenetic perspective. Virus Res. 2010, 154, 7–17. [Google Scholar] [CrossRef] [PubMed]
  3. Paploski, I.A.D.; Corzo, C.; Rovira, A.; Murtaugh, M.P.; Sanhueza, J.M.; Vilalta, C.; Schroeder, D.C.; VanderWaal, K. Temporal Dynamics of Co-circulating Lineages of Porcine Reproductive and Respiratory Syndrome Virus. Front. Microbiol. 2019, 10, 2486. [Google Scholar] [CrossRef]
  4. Walker, P.J.; Siddell, S.G.; Lefkowitz, E.J.; Mushegian, A.R.; Adriaenssens, E.M.; Alfenas-Zerbini, P.; Davison, A.J.; Dempsey, D.M.; Dutilh, B.E.; García, M.L.; et al. Changes to virus taxonomy and to the International Code of Virus Classification and Nomenclature ratified by the International Committee on Taxonomy of Viruses. Arch. Virol. 2021, 166, 2633–2648. [Google Scholar] [CrossRef]
  5. Calzada-Nova, G.; Schnitzlein, W.M.; Husmann, R.J.; Zuckermann, F.A. North American Porcine Reproductive and Respiratory Syndrome Viruses Inhibit Type I Interferon Production by Plasmacytoid Dendritic Cells. J. Virol. 2011, 85, 2703–2713. [Google Scholar] [CrossRef]
  6. Rahe, M.C.; Murtaugh, M.P. Mechanisms of adaptive immunity to porcine reproductive and respiratory syndrome virus. Viruses 2017, 9, 148. [Google Scholar] [CrossRef] [PubMed]
  7. Costers, S.; Lefebvre, D.J.; Delputte, P.L.; Nauwynck, H.J. Porcine reproductive and respiratory syndrome virus modulates apoptosis during replication in alveolar macrophages. Arch. Virol. 2008, 153, 1453–1465. [Google Scholar] [CrossRef] [PubMed]
  8. Malgarin, C.M.; Moser, F.; Pasternak, J.A.; Hamonic, G.; Detmer, S.E.; MacPhee, D.J.; Harding, J.C.S. Fetal hypoxia and apoptosis following maternal porcine reproductive and respiratory syndrome virus (PRRSV) infection. BMC Veter. Res. 2021, 17, 182. [Google Scholar] [CrossRef]
  9. Thanawongnuwech, R.; Thacker, B.; Halbur, P.; Thacker, E.L. Increased production of proinflammatory cytokines following infection with porcine reproductive and respiratory syndrome virus and Mycoplasma hyopneumoniae. Clin. Diagn. Lab. Immunol. 2004, 11, 901–908. [Google Scholar]
  10. Li, J.; Wang, S.; Li, C.; Wang, C.; Liu, Y.; Wang, G.; He, X.; Hu, L.; Liu, Y.; Cui, M.; et al. Secondary Haemophilus parasuis infection enhances highly pathogenic porcine reproductive and respiratory syndrome virus (HP-PRRSV) infection-mediated inflammatory responses. Veter. Microbiol. 2017, 204, 35–42. [Google Scholar] [CrossRef]
  11. Alex Pasternak, J.; MacPhee, D.J.; Harding, J.C.S. Fetal cytokine response to porcine reproductive and respiratory syndrome virus-2 infection. Cytokine 2020, 126, 154883. [Google Scholar] [CrossRef] [PubMed]
  12. The OIE AD HOC Group on Porcine Reproductive Respiratory Syndrome. PRRS: The Disease, Its Diagnosis, Prevention and Control. Available online: https://www.oie.int/fileadmin/Home/eng/Our_scientific_expertise/docs/pdf/PRRS_guide_web_bulletin.pdf (accessed on 15 October 2021).
  13. Valdes-Donoso, P.; Alvarez, J.; Jarvis, L.S.; Morrison, R.B.; Perez, A.M. Production losses from an endemic animal disease: Porcine reproductive and respiratory syndrome (PRRS) in selected Midwest US Sow Farms. Front. Veter. Sci. 2018, 5, 102. [Google Scholar] [CrossRef] [PubMed]
  14. Neumann, E.J.; Kliebenstein, J.B.; Johnson, C.D.; Mabry, J.W.; Bush, E.J.; Seitzinger, A.H.; Green, A.L.; Zimmerman, J.J. Assessment of the economic impact of porcine reproductive and respiratory syndrome on swine production in the United States. J. Am. Veter. Med. Assoc. 2005, 227, 385–392. [Google Scholar] [CrossRef] [PubMed]
  15. Holtkamp, D.J.; Kliebenstein, J.B.; Neumann, E.J.; Zimmerman, J.J.; Rotto, H.F.; Yoder, T.K.; Wang, C.; Yeske, P.E.; Mowrer, C.L.; Haley, C.A. Assessment of the economic impact of porcine reproductive and respiratory syndrome virus on United States pork producers. J. Swine Health Prod. 2013, 21, 72–84. [Google Scholar]
  16. Keffaber, K.K. Reproduction failure of unknown etiology. Am. Assoc. Swine Pr. Newsl. 1989, 1, 1–9. [Google Scholar]
  17. The Morrison Swine Health Monitoring Project. PRRS CUMULATIVE INCIDENCE. Available online: https://vetmed.umn.edu/centers-programs/swine-program/outreach-leman-mshmp/mshmp/mshmp-prrs-figures (accessed on 21 September 2022).
  18. Paploski, I.A.D.; Pamornchainavakul, N.; Makau, D.N.; Rovira, A.; Corzo, C.A.; Schroeder, D.C.; Cheeran, M.C.J.; Doeschl-Wilson, A.; Kao, R.R.; Lycett, S.; et al. Phylogenetic structure and sequential dominance of sub-lineages of prrsv type-2 lineage 1 in the United States. Vaccines 2021, 9, 608. [Google Scholar] [CrossRef]
  19. Lee, K.; Polson, D.; Lowe, E.; Main, R.; Holtkamp, D.; Martínez-López, B. Unraveling the contact patterns and network structure of pig shipments in the United States and its association with porcine reproductive and respiratory syndrome virus (PRRSV) outbreaks. Prev. Veter. Med. 2017, 138, 113–123. [Google Scholar] [CrossRef]
  20. Pileri, E.; Mateu, E. Review on the transmission porcine reproductive and respiratory syndrome virus between pigs and farms and impact on vaccination. Veter. Res. 2016, 47, 108. [Google Scholar] [CrossRef] [PubMed]
  21. VanderWaal, K.; Paploski, I.A.D.; Makau, D.N.; Corzo, C.A. Contrasting animal movement and spatial connectivity networks in shaping transmission pathways of a genetically diverse virus. Prev. Veter. Med. 2020, 178, 104977. [Google Scholar] [CrossRef]
  22. Makau, D.N.; Paploski, I.A.D.; Corzo, C.A.; VanderWaal, K. Dynamic network connectivity influences the spread of a sub-lineage of porcine reproductive and respiratory syndrome virus. Transbound. Emerg. Dis. 2022, 69, 524–537. [Google Scholar] [CrossRef]
  23. Wesley, R.D.; Mengeling, W.L.; Lager, K.M.; Clouser, D.F.; Landgraf, J.G.; Frey, M.L. Differentiation of a porcine reproductive and respiratory syndrome virus vaccine strain from North American field strains by restriction fragment length polymorphism analysis of ORF 5. J. Veter. Diagnostic Investig. 1998, 10, 140–144. [Google Scholar] [CrossRef]
  24. Wissink, E.H.J.; van Wijk, H.A.R.; Kroese, M.V.; Weiland, E.; Meulenberg, J.J.M.; Rottier, P.J.M.; van Rijn, P.A. The major envelope protein, GP5, of a European porcine reproductive and respiratory syndrome virus contains a neutralization epitope in its N-terminal ectodomain. J. Gen. Virol. 2003, 84, 1535–1543. [Google Scholar] [CrossRef]
  25. Kim, W.I.; Kim, J.J.; Cha, S.H.; Wu, W.H.; Cooper, V.; Evans, R.; Choi, E.J.; Yoon, K.J. Significance of genetic variation of PRRSV ORF5 in virus neutralization and molecular determinants corresponding to cross neutralization among PRRS viruses. Veter. Microbiol. 2013, 162, 10–22. [Google Scholar] [CrossRef] [PubMed]
  26. Shi, M.; Lam, T.T.-Y.; Hon, C.-C.; Murtaugh, M.P.; Davies, P.R.; Hui, R.K.-H.; Li, J.; Wong, L.T.-W.; Yip, C.-W.; Jiang, J.-W.; et al. Phylogeny-Based Evolutionary, Demographical, and Geographical Dissection of North American Type 2 Porcine Reproductive and Respiratory Syndrome Viruses. J. Virol. 2010, 84, 8700–8711. [Google Scholar] [CrossRef]
  27. Giovani Trevisan; Daniel Linhares Swine Disease Detection Dashboards. Available online: https://fieldepi.research.cvm.iastate.edu/domestic-swine-disease-monitoring-program/ (accessed on 20 September 2022).
  28. Shi, M.; Lemey, P.; Singh Brar, M.; Suchard, M.A.; Murtaugh, M.P.; Carman, S.; D’Allaire, S.; Delisle, B.; Lambert, M.È.; Gagnon, C.A.; et al. The spread of type 2 porcine reproductive and respiratory syndrome virus (prrsv) in North America: A phylogeographic approach. Virology 2013, 447, 146–154. [Google Scholar] [CrossRef] [PubMed]
  29. MSHMP History|College of Veterinary Medicine—University of Minnesota. Available online: https://vetmed.umn.edu/centers-programs/swine-program/outreach-leman-mshmp/mshmp-history (accessed on 28 January 2021).
  30. Economic Research Service, USDA. Hogs: Annual and Cumulative Year-to-Date U.S. Trade (Head). 2022. Available online: https://www.ers.usda.gov/webdocs/DataFiles/81475/Hog_YearlyFull.xlsx?v=7597.1 (accessed on 30 September 2022).
  31. Kazutaka, K.; Misakwa, K.; Kei-ichi, K.; Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar]
  32. Price, M.N.; Dehal, P.S.; Arkin, A.P. FastTree 2—Approximately maximum-likelihood trees for large alignments. PLoS ONE 2010, 5, e9490. [Google Scholar] [CrossRef]
  33. Libin, P.J.K.; Deforche, K.; Abecasis, A.B.; Theys, K. VIRULIGN: Fast codon-correct alignment and annotation of viral genomes. Bioinformatics 2019, 35, 1763–1765. [Google Scholar] [CrossRef]
  34. Martin, D.P.; Varsani, A.; Roumagnac, P.; Botha, G.; Maslamoney, S.; Schwab, T.; Kelz, Z.; Kumar, V.; Murrell, B. RDP5: A computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets. Virus Evol. 2021, 7, veaa087. [Google Scholar] [CrossRef]
  35. Martin, D.; Rybicki, E. RDP: Detection of recombination amongst aligned sequences. Bioinformatics 2000, 16, 562–563. [Google Scholar] [CrossRef]
  36. Padidam, M.; Sawyer, S.; Fauquet, C.M. Possible emergence of new geminiviruses by frequent recombination. Virology 1999, 265, 218–225. [Google Scholar] [CrossRef] [PubMed]
  37. Smith, J.M. Analyzing the mosaic structure of genes. J. Mol. Evol. 1992, 34, 126–129. [Google Scholar] [CrossRef] [PubMed]
  38. Martin, D.P.; Posada, D.; Crandall, K.A.; Williamson, C. A Modified Bootscan Algorithm for Automated Identification of Recombinant Sequences and Recombination Breakpoints. AIDS Res. Hum. Retrovir. 2005, 21, 98–102. [Google Scholar] [CrossRef]
  39. Gibbs, M.J.; Armstrong, J.S.; Gibbs, A.J. Sister-scanning: A Monte Carlo procedure for assessing signals in rebombinant sequences. Bioinformatics 2000, 16, 573–582. [Google Scholar] [CrossRef]
  40. Posada, D.; Crandall, K.A. Evaluation of methods for detecting recombination from DNA sequences: Computer simulations. Proc. Natl. Acad. Sci. USA 2001, 98, 13757–13762. [Google Scholar] [CrossRef]
  41. Lam, H.M.; Ratmann, O.; Boni, M.F. Improved Algorithmic Complexity for the 3SEQ Recombination Detection Algorithm. Mol. Biol. Evol. 2018, 35, 247–251. [Google Scholar] [CrossRef] [PubMed]
  42. The Swine Health Information Center The Swine Health Information Center’s Rapid Response Program. Available online: https://www.swinehealth.org/rapid-response-to-emerging-disease-program/ (accessed on 22 September 2022).
  43. National Agricultural Statistics Service. 2017 Census of Agriculture. 2019. Available online: https://www.nass.usda.gov/Publications/AgCensus/2017/Full_Report/Volume_1,_Chapter_1_US/usv1.pdf (accessed on 22 September 2022).
  44. De Maio, N.; Wu, C.H.; O’Reilly, K.M.; Wilson, D. New Routes to Phylogeography: A Bayesian Structured Coalescent Approximation. PLoS Genet. 2015, 11, e1005421. [Google Scholar] [CrossRef] [PubMed]
  45. Kalkauskas, A.; Perron, U.; Sun, Y.; Goldman, N.; Baele, G.; Guindon, S.; De Maio, N. Sampling bias and model choice in continuous phylogeography: Getting lost on a random walk. PLoS Comput. Biol. 2021, 17, e1008561. [Google Scholar] [CrossRef]
  46. Lemey, P.; Rambaut, A.; Drummond, A.J.; Suchard, M.A. Bayesian phylogeography finds its roots. PLoS Comput. Biol. 2009, 5, e1000520. [Google Scholar] [CrossRef]
  47. Suchard, M.A.; Lemey, P.; Baele, G.; Ayres, D.L.; Drummond, A.J.; Rambaut, A. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 2018, 4, vey016. [Google Scholar] [CrossRef]
  48. Drummond, A.J.; Ho, S.Y.W.; Phillips, M.J.; Rambaut, A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006, 4, e88. [Google Scholar] [CrossRef] [PubMed]
  49. Gill, M.S.; Lemey, P.; Faria, N.R.; Rambaut, A.; Shapiro, B.; Suchard, M.A. Improving bayesian population dynamics inference: A coalescent-based model for multiple loci. Mol. Biol. Evol. 2013, 30, 713–724. [Google Scholar] [CrossRef]
  50. Rambaut, A.; Lam, T.T.; Max Carvalho, L.; Pybus, O.G. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2016, 2, vew007. [Google Scholar] [CrossRef]
  51. Trifinopoulos, J.; Nguyen, L.T.; von Haeseler, A.; Minh, B.Q. W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016, 44, W232–W235. [Google Scholar] [CrossRef] [PubMed]
  52. Drummond, A.J.; Rambaut, A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol. Biol. 2007, 7, 214. [Google Scholar] [CrossRef]
  53. Hadfield, J.; Megill, C.; Bell, S.M.; Huddleston, J.; Potter, B.; Callender, C.; Sagulenko, P.; Bedford, T.; Neher, R.A. NextStrain: Real-time tracking of pathogen evolution. Bioinformatics 2018, 34, 4121–4123. [Google Scholar] [CrossRef]
  54. Ginestet, C. ggplot2: Elegant Graphics for Data Analysis. J. R. Stat. Soc. Ser. A Stat. Soc. 2011, 174, 245–246. [Google Scholar] [CrossRef]
  55. R Core Team R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. 2019. Available online: https://www.r-project.org/ (accessed on 25 October 2022).
  56. Sagulenko, P.; Puller, V.; Neher, R.A. TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol. 2018, 4, vex042. [Google Scholar] [CrossRef]
  57. Müller, N.F.; Rasmussen, D.A.; Stadler, T. The structured coalescent and its approximations. Mol. Biol. Evol. 2017, 34, 2970–2981. [Google Scholar] [CrossRef]
  58. Müller, N.F.; Rasmussen, D.; Stadler, T. MASCOT: Parameter and state inference under the marginal structured coalescent approximation. Bioinformatics 2018, 34, 3843–3848. [Google Scholar] [CrossRef] [PubMed]
  59. Bouckaert, R.; Vaughan, T.G.; Barido-Sottani, J.; Duchêne, S.; Fourment, M.; Gavryushkina, A.; Heled, J.; Jones, G.; Kühnert, D.; De Maio, N.; et al. BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 2019, 15, e1006650. [Google Scholar] [CrossRef] [PubMed]
  60. Yang, Z.; Nielsent, R. Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol. Biol. Evol. 2002, 19, 908–917. [Google Scholar] [CrossRef] [PubMed]
  61. Smith, M.D.; Wertheim, J.O.; Weaver, S.; Murrell, B.; Scheffler, K.; Kosakovsky Pond, S.L. Less is more: An adaptive branch-site random effects model for efficient detection of episodic diversifying selection. Mol. Biol. Evol. 2015, 32, 1342–1353. [Google Scholar] [CrossRef] [PubMed]
  62. Liu, P.; Song, Y.; Colijn, C.; MacPherson, A. The impact of sampling bias on viral phylogeographic reconstruction. PLOS Glob. Public Health 2022, 2, e0000577. [Google Scholar] [CrossRef]
  63. Han, J.; Wang, Y.; Faaberg, K.S. Complete genome analysis of RFLP 184 isolates of porcine reproductive and respiratory syndrome virus. Virus Res. 2006, 122, 175–182. [Google Scholar] [CrossRef] [PubMed]
  64. Meulenberg, J.J.M.; Hulst, M.M.; De Meijer, E.J.; Moonen, P.L.J.M.; Den Besten, A.; De Kluyver, E.P.; Wensvoort, G.; Moormann, R.J.M. Lelystad virus, the causative agent of porcine epidemic abortion and respiratory syndrome (PEARS), is related to LDV and EAV. Virology 1993, 192, 62–72. [Google Scholar] [CrossRef]
  65. Meng, X.J.; Paul, P.S.; Halbur, P.G.; Morozov, I. Sequence comparison of open reading frames 2 to 5 of low and high virulence United States isolates of porcine reproductive and respiratory syndrome virus. J. Gen. Virol. 1995, 76, 3181–3188. [Google Scholar] [CrossRef]
  66. Kapur, V.; Elam, M.R.; Pawlovich, T.M.; Murtaugh, M.P. Genetic variation in porcine reproductive and respiratory syndrome virus isolates in the midwestern United States. J. Gen. Virol. 1996, 77, 1271–1276. [Google Scholar] [CrossRef]
  67. Andreyev, V.G.; Wesley, R.D.; Mengeling, W.L.; Vorwald, A.C.; Lager, K.M. Genetic variation and phylogenetic relationships of 22 porcine reproductive and respiratory syndrome virus (PRRSV) field strains based on sequence analysis of open reading frame 5. Arch. Virol. 1997, 142, 993–1001. [Google Scholar] [CrossRef]
  68. Mardassi, H.; Mounir, S.; Dea, S. Molecular analysis of the ORFs 3 to 7 of porcine reproductive and respiratory syndrome virus, Québec reference strain. Arch. Virol. 1995, 140, 1405–1418. [Google Scholar] [CrossRef]
  69. Rodriguez, M.J.; Sarraseca, J.; Garcia, J.; Sanz, A.; Plana-Durán, J.; Casal, J.I. Epitope mapping of the nucleocapsid protein of European and North American isolates of porcine reproductive and respiratory syndrome virus. J. Gen. Virol. 1997, 78, 2269–2278. [Google Scholar] [CrossRef]
  70. Pirzadeh, B.; Gagnon, C.A.; Dea, S. Genomic and Antigenic Variations of Porcine Reproductive and Respiratory Syndrome Virus Major Envelope GP5 Glycoprotein. Can. J. Veter. Res. 1998, 62, 170–177. [Google Scholar]
  71. Gagnon, C.A.; Dea, S. Differentiation between Porcine Reproductive and Respiratory Syndrome Virus Isolates by Restriction Fragment Length Polymorphism of Their ORFs 6 and 7 Genes. Can. J. Veter. Res. 1998, 62, 110–116. [Google Scholar]
  72. Pearl, J. Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach. Cogn. Syst. Lab. Sch. Eng. Appl. Sci. 2022, 129–138. [Google Scholar] [CrossRef]
  73. Haley, M.M. Market Integration in the North American Hog Industries; USDA, Economic Research Service: Washington, DC, USA, 2004. [Google Scholar]
  74. Brisson, Y. The changing face of the Canada hog industry. Stat. Canada 2014, 96-325-X. [Google Scholar]
  75. Whiting, T.L. Special welfare concerns in countries dependent on live animal trade: The real foreign animal disease emergency for Canada. J. Appl. Anim. Welf. Sci. 2008, 11, 149–164. [Google Scholar] [CrossRef] [PubMed]
  76. Shields, D.A.; Mathews, K.H. Interstate Livestock Movements; USDA, Economic Research Service: Washington, DC, USA, 2003. [Google Scholar]
  77. Cabezas, A.H.; Sanderson, M.W.; Lockhart, C.Y.; Riley, K.A.; Hanthorn, C.J. Spatial and network analysis of U.S. livestock movements based on Interstate Certificates of Veterinary Inspection. Prev. Veter. Med. 2021, 193, 105391. [Google Scholar] [CrossRef]
  78. Nelson, M.I.; Culhane, M.R.; Trovão, N.S.; Patnayak, D.P.; Halpin, R.A.; Lin, X.; Shilts, M.H.; Das, S.R.; Detmer, S.E. The emergence and evolution of influenza A (H1α) viruses in swine in Canada and the United States. J. Gen. Virol. 2017, 98, 2663–2675. [Google Scholar] [CrossRef]
  79. Scotch, M.; Mei, C. Phylogeography of swine influenza H3N2 in the United States: Translational public health for zoonotic disease surveillance. Infect. Genet. Evol. 2013, 13, 224–229. [Google Scholar] [CrossRef] [PubMed]
  80. Johnson, W.; Roof, M.; Vaughn, E.; Christopher-Hennings, J.; Johnson, C.R.; Murtaugh, M.P. Pathogenic and humoral immune responses to porcine reproductive and respiratory syndrome virus (PRRSV) are related to viral load in acute infection. Veter. Immunol. Immunopathol. 2004, 102, 233–247. [Google Scholar] [CrossRef]
  81. Yeske, P.; Murtaugh, M. Epidemiology of a new PRRS virus isolate and Outbreak. In Proceedings of the Allen D Leman Swine Conference; Smith, T., Ed.; University of Minnesota: St. Paul, MN, USA, 2008; pp. 11–15. Available online: https://conservancy.umn.edu/bitstream/handle/11299/140078/Yeske.pdf?sequence=1&isAllowed=y (accessed on 25 October 2022).
  82. Brockmeier, S.L.; Loving, C.L.; Vorwald, A.C.; Kehrli, M.E.; Baker, R.B.; Nicholson, T.L.; Lager, K.M.; Miller, L.C.; Faaberg, K.S. Genomic sequence and virulence comparison of four Type 2 porcine reproductive and respiratory syndrome virus strains. Virus Res. 2012, 169, 212–221. [Google Scholar] [CrossRef]
  83. Alkhamis, M.A.; Perez, A.M.; Murtaugh, M.P.; Wang, X.; Morrison, R.B. Applications of Bayesian phylodynamic methods in a recent U.S. porcine reproductive and respiratory syndrome virus outbreak. Front. Microbiol. 2016, 7, 67. [Google Scholar] [CrossRef] [PubMed]
  84. van Geelen, A.G.M.; Anderson, T.K.; Lager, K.M.; Das, P.B.; Otis, N.J.; Montiel, N.A.; Miller, L.C.; Kulshreshtha, V.; Buckley, A.C.; Brockmeier, S.L.; et al. Porcine reproductive and respiratory disease virus: Evolution and recombination yields distinct ORF5 RFLP 1-7-4 viruses with individual pathogenicity. Virology 2018, 513, 168–179. Available online: https://www.sciencedirect.com/science/article/pii/S0042682217303458 (accessed on 25 October 2022). [CrossRef] [PubMed]
  85. Morrison, R.B. PRRS RFLP 1-7-4 Summary. p. 1. Available online: https://www.vetmed.umn.edu/sites/vetmed.umn.edu/files/shmp_2014.43_prrs_rflp_1-7-4-science_page.pdf (accessed on 25 October 2022).
  86. Kikuti, M.; Paploski, I.A.D.; Pamornchainavakul, N.; Picasso-Risso, C.; Schwartz, M.; Yeske, P.; Leuwerke, B.; Bruner, L.; Murray, D.; Roggow, B.D.; et al. Emergence of a New Lineage 1C Variant of Porcine Reproductive and Respiratory Syndrome Virus 2 in the United States. Front. Veter. Sci. 2021, 8, 752938. [Google Scholar] [CrossRef] [PubMed]
  87. Key, N.; McBride, W.D. The Changing Economics of U.S. Hog Production. SSRN Electron. J. 2011. [Google Scholar] [CrossRef]
  88. Sellman, S.; Beck-Johnson, L.M.; Hallman, C.; Miller, R.S.; Owers Bonner, K.A.; Portacci, K.; Webb, C.T.; Lindström, T. Modeling nation-wide U.S. swine movement networks at the resolution of the individual premises. Epidemics 2022, 41, 100636. Available online: https://www.sciencedirect.com/science/article/pii/S1755436522000767 (accessed on 25 October 2022). [CrossRef] [PubMed]
  89. Harris, D.L. Multiple site production. In Proceedings of the Southeast Swine Practitioner Conference, Raleigh, NC, USA, 7–10 June 1992; pp. 1–19. [Google Scholar]
  90. Tokach, M.D.; Goodband, B.D.; O’Quinn, T.G. Performance-enhancing technologies in swine production. Anim. Front. 2016, 6, 15–21. [Google Scholar] [CrossRef]
  91. Ramirez, A.; Whitney, D.; Bickett-Weddle, D. Swine Industry Manual. FAD PReP: Foreign Animal Disease Preparedness & Response Plan/National Animal Health Emergency Management System. United States Dep. Agric. Anim. Plant Health Insp. Serv. 2011. [Google Scholar]
  92. Blair, B.; Lowe, J. Describing the cull sow market network in the US: A pilot project. Prev. Veter. Med. 2019, 162, 107–109. [Google Scholar] [CrossRef]
  93. Faria, N.R.; Suchard, M.A.; Rambaut, A.; Streicker, D.G.; Lemey, P. Simultaneously reconstructing viral crossspecies transmission history and identifying the underlying constraints. Philos. Trans. R. Soc. B Biol. Sci. 2013, 368, 20120196. [Google Scholar] [CrossRef]
  94. Lemey, P.; Rambaut, A.; Bedford, T.; Faria, N.; Bielejec, F.; Baele, G.; Russell, C.A.; Smith, D.J.; Pybus, O.G.; Brockmann, D.; et al. Unifying Viral Genetics and Human Transportation Data to Predict the Global Transmission Dynamics of Human Influenza H3N2. PLoS Pathog. 2014, 10, e1003932. [Google Scholar] [CrossRef]
  95. Spielman, S.J.; Weaver, S.; Shank, S.D.; Magalis, B.R.; Li, M.; Kosakovsky Pond, S.L. Evolution of viral genomes: Interplay between selection, recombination, and other forces. Methods Mol. Biol. 2019, 1910, 427–468. [Google Scholar] [PubMed]
  96. Benfield, C.T.O.; Hill, S.; Shatar, M.; Shiilegdamba, E.; Damdinjav, B.; Fine, A.; Willett, B.; Kock, R.; Bataille, A. Molecular epidemiology of peste des petits ruminants virus emergence in critically endangered Mongolian saiga antelope and other wild ungulates. Virus Evol. 2021, 7, veab062. [Google Scholar] [CrossRef]
  97. Caraballo, D.A.; Lema, C.; Novaro, L.; Gury-Dohmen, F.; Russo, S.; Beltrán, F.J.; Palacios, G.; Cisterna, D.M. A novel terrestrial rabies virus lineage occurring in south america: Origin, diversification, and evidence of contact between wild and domestic cycles. Viruses 2021, 13, 2484. [Google Scholar] [CrossRef] [PubMed]
  98. MacLean, O.A.; Lytras, S.; Weaver, S.; Singer, J.B.; Boni, M.F.; Lemey, P.; Kosakovsky Pond, S.L.; Robertson, D.L. Natural selection in the evolution of SARS-CoV-2 in bats created a generalist virus and highly capable human pathogen. PLoS Biol. 2021, 19, e3001115. [Google Scholar] [CrossRef] [PubMed]
  99. Cariou, M.; Picard, L.; Guéguen, L.; Jacquet, S.; Cimarelli, A.; Fregoso, O.I.; Molaro, A.; Navratil, V.; Etienne, L. Distinct evolutionary trajectories of SARS-CoV-2-interacting proteins in bats and primates identify important host determinants of COVID-19. Proc. Natl. Acad. Sci. USA 2022, 119, e2206610119. [Google Scholar] [CrossRef]
  100. Berry, I.M.; Eyase, F.; Pollett, S.; Konongoi, S.L.; Joyce, M.G.; Figueroa, K.; Ofula, V.; Koka, H.; Koskei, E.; Nyunja, A.; et al. Global Outbreaks and Origins of a Chikungunya Virus Variant Carrying Mutations Which May Increase Fitness for Aedes aegypti: Revelations from the 2016 Mandera, Kenya outbreak. Am. J. Trop. Med. Hyg. 2019, 100, 1249–1257. [Google Scholar] [CrossRef]
  101. Delisle, B.; Gagnon, C.A.; Lambert, M.-È; D’Allaire, S. Porcine reproductive and respiratory syndrome virus diversity of Eastern Canada swine herds in a large sequence dataset reveals two hypervariable regions under positive selection. Infect. Genet. Evol. 2012, 12, 1111–1119. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Temporal distribution of the full PRRSV-2 L1 dataset (left) and an example of a spatiotemporally stratified subsampled dataset (right) colored according to (A) source, (B) sampling location (region), and (C) pre-determined sub-lineage.
Figure 1. Temporal distribution of the full PRRSV-2 L1 dataset (left) and an example of a spatiotemporally stratified subsampled dataset (right) colored according to (A) source, (B) sampling location (region), and (C) pre-determined sub-lineage.
Pathogens 12 00740 g001
Figure 2. Key results from the DTA of spatiotemporally stratified sampled sets. (A) A time-scaled phylogenetic tree of one subsampled set with the tips colored according to the sampling region and the internal branches colored according to the inferred ancestral region. (B) Probability (0–1) of region of origin for each L1 sub-lineage and overall L1 from all runs. (C) The same timed-scaled tree with tips colored according to the classified sub-lineage. (D) Median tMRCA with a 95% HPD interval of L1 and its sub-lineages from the same runs.
Figure 2. Key results from the DTA of spatiotemporally stratified sampled sets. (A) A time-scaled phylogenetic tree of one subsampled set with the tips colored according to the sampling region and the internal branches colored according to the inferred ancestral region. (B) Probability (0–1) of region of origin for each L1 sub-lineage and overall L1 from all runs. (C) The same timed-scaled tree with tips colored according to the classified sub-lineage. (D) Median tMRCA with a 95% HPD interval of L1 and its sub-lineages from the same runs.
Pathogens 12 00740 g002
Figure 3. Maximum-likelihood time-scaled phylogenetic tree of the full L1 dataset (n = 19,395) estimated by TreeTime. Tips and branches are colored according to the sampling region and inferred ancestral region, respectively. The exterior ring is colored according to L1 sub-lineages based on phylogenetic grouping.
Figure 3. Maximum-likelihood time-scaled phylogenetic tree of the full L1 dataset (n = 19,395) estimated by TreeTime. Tips and branches are colored according to the sampling region and inferred ancestral region, respectively. The exterior ring is colored according to L1 sub-lineages based on phylogenetic grouping.
Pathogens 12 00740 g003
Figure 4. Inter-regional spread of PRRSV-2 L1 in the U.S. estimated by the DTA of spatiotemporally stratified sampled sets. (A) Median between-region transitions of the L1 lineage overall. The shade and thickness of the arrows represent the estimated number of transitions. (B) Median between-region transitions of each L1 sub-lineage. The color of the arrow represents the sub-lineage, whereas the thickness represents the number of transitions.
Figure 4. Inter-regional spread of PRRSV-2 L1 in the U.S. estimated by the DTA of spatiotemporally stratified sampled sets. (A) Median between-region transitions of the L1 lineage overall. The shade and thickness of the arrows represent the estimated number of transitions. (B) Median between-region transitions of each L1 sub-lineage. The color of the arrow represents the sub-lineage, whereas the thickness represents the number of transitions.
Pathogens 12 00740 g004
Figure 5. PRRSV-2 L1 population dynamics in the U.S. estimated by Bayesian Skygrid analyses on spatiotemporally stratified sampled sets. Lines with shaded bands are LOESS smoothing curves, with a 95% confidence interval, of the median log effective population sizes from 5 runs of each sub-lineage and overall L1. Lines are colored according to the L1 sub-lineage.
Figure 5. PRRSV-2 L1 population dynamics in the U.S. estimated by Bayesian Skygrid analyses on spatiotemporally stratified sampled sets. Lines with shaded bands are LOESS smoothing curves, with a 95% confidence interval, of the median log effective population sizes from 5 runs of each sub-lineage and overall L1. Lines are colored according to the L1 sub-lineage.
Pathogens 12 00740 g005
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pamornchainavakul, N.; Paploski, I.A.D.; Makau, D.N.; Kikuti, M.; Rovira, A.; Lycett, S.; Corzo, C.A.; VanderWaal, K. Mapping the Dynamics of Contemporary PRRSV-2 Evolution and Its Emergence and Spreading Hotspots in the U.S. Using Phylogeography. Pathogens 2023, 12, 740. https://doi.org/10.3390/pathogens12050740

AMA Style

Pamornchainavakul N, Paploski IAD, Makau DN, Kikuti M, Rovira A, Lycett S, Corzo CA, VanderWaal K. Mapping the Dynamics of Contemporary PRRSV-2 Evolution and Its Emergence and Spreading Hotspots in the U.S. Using Phylogeography. Pathogens. 2023; 12(5):740. https://doi.org/10.3390/pathogens12050740

Chicago/Turabian Style

Pamornchainavakul, Nakarin, Igor A. D. Paploski, Dennis N. Makau, Mariana Kikuti, Albert Rovira, Samantha Lycett, Cesar A. Corzo, and Kimberly VanderWaal. 2023. "Mapping the Dynamics of Contemporary PRRSV-2 Evolution and Its Emergence and Spreading Hotspots in the U.S. Using Phylogeography" Pathogens 12, no. 5: 740. https://doi.org/10.3390/pathogens12050740

APA Style

Pamornchainavakul, N., Paploski, I. A. D., Makau, D. N., Kikuti, M., Rovira, A., Lycett, S., Corzo, C. A., & VanderWaal, K. (2023). Mapping the Dynamics of Contemporary PRRSV-2 Evolution and Its Emergence and Spreading Hotspots in the U.S. Using Phylogeography. Pathogens, 12(5), 740. https://doi.org/10.3390/pathogens12050740

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop