Next Article in Journal
Preeclampsia as a Study Model for Aging: The Klotho Gene Paradigm
Previous Article in Journal
Multicore, SDS-Based Polyelectrolyte Nanocapsules as Novel Nanocarriers for Paclitaxel to Reduce Cardiotoxicity by Protecting the Mitochondria
Previous Article in Special Issue
Expression of Network Medicine-Predicted Genes in Human Macrophages Infected with Leishmania major
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Co-Location of QTL for Vigor and Resistance to Three Diseases in Juglans microcarpa × J. regia Rootstocks

1
Department of Plant Sciences, University of California, Davis, CA 95616, USA
2
Department of Nematology, University of California, Riverside, CA 92521, USA
3
USDA-ARS Crops Pathology and Genetics Research Unit, Department of Plant Pathology, University of California, Davis, CA 95616, USA
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(3), 903; https://doi.org/10.3390/ijms26030903
Submission received: 27 November 2024 / Revised: 11 January 2025 / Accepted: 16 January 2025 / Published: 22 January 2025
(This article belongs to the Special Issue Molecular Biology of Host and Pathogen Interactions: 2nd Edition)

Abstract

:
A QTL on chromosome 4D of the Juglans microcarpa × J. regia genome that co-located resistance against Agrobacterium tumefaciens, Phytophthora pini, and Phytophthora cinnamomi disease scores was investigated for additional traits. Phenotypic data for Pratylenchus vulnus counts and tree height were analyzed in this study for the same hybrids previously used to identify this QTL. Using the same GBS genotype data, the same co-located QTL for A. tumefaciens and Phytophthora spp. disease scores were reproduced and the QTL for P. vulnus counts and tree height were co-located with resistance to A. tumefaciens and Phytophthora spp. Moreover, we found GBS genotype data to harbor additional genetic variation unrelated to any of the traits analyzed. Marker-assisted and genomic selection models were created and assessed for their performance in selection. The ability to predict traits using SNP data was strongest with two-year tree height, followed by A. tumefaciens disease score, three-year tree height, Phytophthora spp. disease score, and P. vulnus counts. These results suggest a shared mechanism of action that links disease to tree height. Moreover, deploying these selection models would assist efforts in walnut improvement for rootstock genotypes.

1. Introduction

Walnuts make a substantial contribution to the California economy, ranking 10th among all California commodities and valued at USD 1.022 billion in 2021 [1]. Crown gall (caused by Agrobacterium tumefaciens), Phytophthora crown and root rot (Phytophthora spp.), and root lesion nematode (Pratylenchus vulnus) are important rootstock diseases of California walnuts that reduce their value. Agrobacterium tumefaciens is a rod-shaped, Gram-negative soil bacterium, while Phytophthora spp. are soilborne oomycetes. These microbes obstruct the vascular tissues with tumors (A. tumefaciens) or kill root, crown, and trunk tissues directly (Phytophthora spp.). Both pathogens cause disease by inhibiting the flow of nutrients and water to the grafted scion, the upper portion of the plant, reducing crop productivity while imperiling plant health. Pratylenchus vulnus, an obligate migratory endoparasitic nematode, causes crop losses by damaging the root cortical tissue and consuming plant nutrients, which ultimately impedes the flow of nutrients and water to the scion [2]. The most recent estimate suggests that these root diseases collectively cost the California walnut industry USD 241 million (~20% of the total farm gate value) per year [3].
Quantitative Trait Locus (QTL) analysis provides a powerful approach to locate and understand the functional basis of important genetic traits in tree crops. QTL mapping studies in tree crops have identified numerous QTL for various growth-related traits, such as stem diameter, tree height, and the number of whorls, providing valuable insights into the genetic control of these traits. For example, a study on Hevea brasiliensis identified 24 QTL for stem diameter, 7 for tree height, and 7 for the number of whorls [4]. Similarly, a study on Populus deltoides and Populus simonii hybrids identified a total of 208 QTL for drought-related traits, leading to the development of molecular identification markers for drought tolerance [5]. Most relevantly, co-located QTL for resistance to Agrobacterium tumefaciens disease score, Phytophthora pini, and Phytophthora cinnamomi were found on chromosome 4D of the Juglans microcarpa × Juglans regia genome [6]. It is important to note that most traits in tree crops are influenced by a large number of underlying QTL, and more research is needed to capture a large proportion of the genetic variation [7]. Therefore, QTL analysis provides valuable information for understanding the genetic basis of important traits in tree crops and has the potential to complement functional genomics analyses for crop improvement [4,5,6,7,8].
Another beneficial outcome of QTL analysis is the foundation for marker-assisted selection (MAS). MAS has demonstrated significant efficacy in tree crops compared to no selection, particularly in expediting the breeding process and enhancing the accuracy of trait selection. Once a QTL has been identified, these markers are used to predict phenotypes in breeding lines with unknown phenotypes to pre-select for potentially beneficial accessions. This contrasts with traditional methods, where selection is based solely on the often laborious determination of phenotypic traits. Several studies have highlighted the advantages of MAS, such as its ability to improve the efficiency of plant breeding, especially in tree species where breeding is time-consuming and challenging due to the genetic complexity of traits [9]. The use of MAS led to higher genetic gain and cost efficiency compared to traditional selection methods when dealing with complex traits controlled by multiple genes [10,11]. In grapes, marker-assisted selection increased the selection efficiency for seedlessness in the fruit. The author’s in silico analysis started with only 52% of the 1012 plants being seedless, and after selection 92% of the 547 plants were seedless [10]. While genomic selection has gained prominence, MAS remains a valuable tool with traits controlled by a few or single loci.
Use of QTL analysis in conjunction with other omics approaches has several advantages over the use of QTL analysis alone, which results in uneven sampling of the genome and potentially missing critical markers [12]. Combining QTL analysis or GWAS with transcriptomic, proteomic, or metabolomic approaches is becoming increasingly popular and shows promise in identifying potential causal genetic-trait relationships [13]. A major QTL had been found in walnut rootstocks for resistance to A. tumefaciens (crown gall) and Phytophthora spp. in Juglans hybrids created by crossing J. microcarpa half-sib mother trees DJUG 31.01 (henceforth 31.01) and DJUG 31.09 (henceforth 31.09) with the cultivated J. regia cv. Serr (henceforth cv. Serr) [6]. A small subset of the hybrids from the breeding populations used previously were analyzed in this study using transcriptomics. The differentially expressed genes suggested that cell wall biogenesis was involved in resistance to these pathogens and tree height [2]. In the work reported here, the QTL for A. tumefaciens and Phytophthora spp. disease scores, and a novel QTL for tree height and P. vulnus counts, all were co-located in the same region. Furthermore, these QTL were employed for genomic selection and marker-assisted selection with exceptional selection accuracies. This research builds on the foundation of molecular biology research in walnuts by providing further clues to root disease mechanisms and adds value to the breeding efforts in the industry by reducing the ratio of work inputs to desirable genetic output.

2. Results

2.1. QTL Mapping and Marker Selection

To discover markers for P. vulnus counts and tree height at two and three years of tree age, the mapped markers of 31.01, 31.09, and cv. ‘Serr’ from seedlings of 31.01 × ‘Serr’ and 31.09 × ‘Serr’ were downloaded from Ramasamy et al. [6] (Supplementary Table S2 of the paper). See methods Section 4.3, “Phenotypic Analysis”, for detailed definitions of the traits analyzed in this study. For 31.01, no markers were removed by the minor allele frequency (MAF) threshold of 0.1, 52 markers were removed by the missing sample of 0.95, no samples were removed by the missing marker threshold of 0.5, and 2152 data points (0.64% of the data) were imputed. For 31.09, 2 markers were removed by the MAF threshold of 0.1, 5 markers were removed by the missing sample of 0.95, no samples were removed by the missing marker threshold of 0.5, and 417 data points (0.11% of the data) were imputed. For cv. ‘Serr’ from the cross with 31.01, no markers were removed by the MAF threshold of 0.1, 40 markers were removed by the missing sample of 0.95, 1 sample was removed by the missing marker threshold of 0.5, and 1514 data points (0.56% of the data) were imputed. For cv. ‘Serr’ from cross with 31.09, no markers were removed by MAF threshold of 0.1, 42 markers were removed by missing sample of 0.95, no samples were removed by missing marker threshold of 0.5, and 1177 data points (0.41% of the data) were imputed.
For the 31.01 haplotype, recursive feature elimination showed that markers 31.01_Jm4D_26168643, 31.01_Jm4D_26359154, and 31.01_Jm4D_26669075 were shared amongst the most important eight predictors by mean decrease in impurity (MDI) across two- and three-year tree heights, and two- and three-year P. vulnus counts, A. tumefaciens disease score, and Phytophthora spp. disease score (File S1). For the 31.09 haplotype, recursive feature elimination showed that markers X31.09_Jm4D_26359154, X31.09_Jm4D_25625822, and X31.09_Jm4D_25101968 were shared amongst the most important eight predictors across two- and three-year tree heights, and three-year P. vulnus counts, A. tumefaciens disease score, and Phytophthora spp. disease score but not two-year P. vulnus counts (File S2). Overall, most of the important predictors were markers from chromosome Jm4D. rrBLUP coefficients of these markers for each trait showed that they were amongst the largest negative effects of the markers selected by the rfe function and indicated that disease resistance and lower tree height were associated with the b allele (Files S3 and S4, Figures S1 and S2).
Moreover, logarithm of the odds (LOD) scores displayed major distinct peaks along 31.01 and 31.09 chromosomes 4D for each trait except for two-year P. vulnus counts in the 31.09 haplotype (Figure 1 and Figure 2). LOD significance thresholds at α = 0.05 for 31.01 × cv. Serr were 3.3, 3.1, 3.2, 3.2, 3.3, and 3.3 for two- and three-year P. vulnus counts, two- and three-year tree height, A. tumefaciens disease score, and Phytophthora spp. disease score, respectively. These thresholds resulted in 31, 17, 65, and 59 significant markers on Jm4D for two- and three-year P. vulnus counts, two-year tree height, and A. tumefaciens disease score, respectively (Table 1). The markers for three-year tree height and Phytophthora spp. disease score were split into 65 and 14, and 38 and 3 for chromosomes 4D and 6D, respectively. Furthermore, these markers spanned 12,109,447, 5,298,334, 30,660,994, and 30,060,179 base pairs on Jm4D for two- and three-year P. vulnus counts, two-year tree height, and A. tumefaciens disease score, respectively. The distances for three-year tree height and Phytophthora spp. disease score QTL were split into 30,660,994 and 3,493,793, and 21,190,695 and 3,720,218 base pairs for chromosomes 4D and 6D, respectively. The top markers by LOD score were 31.01_Jm4D_26669075 two- and three-year P. vulnus counts and two- and three-year tree height (Table 2). For A. tumefaciens and Phytophthora spp. disease scores, 31.01_Jm4D_26168643, 31.01_Jm4D_26359154, and 31.01_Jm4D_26669075 were shared as top makers because they had the same LOD and covered 500,432 base pairs on chromosome 4D.
For 31.01 × cv. Serr, these QTL explained 13.9%, 13.1%, 70.3%, and 45.4% of the variation in the phenotypic data for two- and three-year P. vulnus counts, two-year tree height, and A. tumefaciens disease score, respectively, all from Jm4D (Table 1). The variances explained for three-year tree height and Phytophthora spp. disease score were split into 41.4% and 4.8%, and 21.0% and 6.1% for chromosomes 4D and 6D, respectively. Conversely, just the top marker from each QTL explained 15.7%, 12.9%, 67.1%, 33.8%, 44.1%, and 18.9% for two- and three-year P. vulnus counts, two- and three-year tree height, A. tumefaciens disease score, and Phytophthora spp. disease score, respectively (Table 2).
LOD significance thresholds at α = 0.05 for 31.09 × cv. Serr were 3.2, 3.2, 3.4, 3.3, 3.2, and 3.3 for two- and three-year P. vulnus counts, two- and three-year tree height, A. tumefaciens disease score, and Phytophthora spp. disease score, respectively. These thresholds resulted in 31, 19, and 46 significant markers on Jm4D for two- and three-year tree height and Phytophthora spp. disease score, respectively (Table 3). For two-year P. vulnus counts, these markers were split into amounts of three and two for chromosomes Jm4D and Jm3D, respectively. For three-year P. vulnus counts, these markers were split into amounts of 14 and 39 for chromosomes Jm4D and Jm 1D, respectively. For A. tumefaciens disease score, these markers were split into amounts of 59 and 1 for chromosomes Jm4D and Jm7D, respectively.
These markers spanned 21,274,993, 10,480,961, and 29,549,284 bases on Jm4D for two- and three-year tree height, and Phytophthora spp. disease score, respectively (Table 3). For two-year P. vulnus counts, these distances were split into 54,916 and 372,371 base pairs for chromosomes Jm4D and Jm3D, respectively. For three-year P. vulnus counts, these distances were split into 9,378,085 and 24,916,299 base pairs for chromosomes Jm4D and Jm 1D, respectively. For A. tumefaciens disease score, these distances were split into 31,696,222 and 0 base pairs for chromosomes Jm4D and Jm7D, respectively.
The top marker(s) by LOD score were X31.09_Jm4D_23906200, X31.09_Jm4D_23960954, and X31.09_Jm4D_23961116 for two-year P. vulnus counts, X31.09_Jm4D_26359154 for three-year P. vulnus counts, 31.09_Jm4D_25101968 for two-year tree height, X31.09_Jm4D_26359154 for three-year tree height, X31.09_Jm4D_26359154 for A. tumefaciens disease score, and X31.09_Jm4D_23816262 and X31.09_Jm4D_24259264 for Phytophthora spp. disease score. Two-year P. vulnus counts had three top markers because the LOD scores were the same. These top markers spanned 54,916, 0, 0, 0, 0, and 443,002 base pairs for two- and three-year P. vulnus counts, two- and three-year tree height, A. tumefaciens disease score, and Phytophthora spp. disease score, respectively.
For 31.09 × cv. Serr, the variance explained in the phenotypic data for these QTL was 7.0% and 8.2% on chromosomes Jm3D and 4D for two-year P. vulnus counts, 8.5% and 16.7% on chromosomes Jm1D and 4D for three-year P. vulnus counts, 20.1% on chromosome Jm4D for two-year tree height, 17.2% on chromosome Jm4D for three-year tree height, 39.7% and 7% on chromosomes Jm4D and Jm7D for A. tumefaciens disease score, and 17.5% on chromosome Jm4D for Phytophthora spp. disease score (Table 3). Conversely, just the top marker from each QTL explained 8.2%, 17.3%, 23.5%, 17.1%, 36.9%, and 17.6% for two- and three-year P. vulnus counts, two- and three-year tree height, A. tumefaciens disease score, and Phytophthora spp. disease score, respectively (Table 4).
The cv. Serr haplotype showed no such QTL peaks nor did it have any significant markers for any trait in either cross (Figure 1 and Figure 2).
The top markers were highly correlated with each other, as were many other markers that tended to correlate by chromosome (Figure 3 and Figure 4). Moreover, PCA suggested that chromosome 4D is just one part of the total variation in the GBS data. In the 31.01 haplotype, chromosome 3S explained about as much variation in the data as 4D, and 5D and 1S explained about double the variation in the data as chromosome 4D (Figure 3). In the 31.09 haplotype, chromosome 4D explained a relatively minor amount of variation in the GBS data relative to the other chromosomes.

2.2. Prediction Accuracy of RF and rrBLUP Models to Discover Markers for P. vulnus

We used the rfe function from the caret package to estimate not which but how many markers to use in the marker-assisted selection (MAS) approach via random forest (RF) modeling. By default, the rfe function retained the top four markers for two-year tree height and two- and three-year P. vulnus counts and A. tumefaciens disease score. Conversely, for three-year tree height and Phytophthora spp. disease score, the rfe function retained all 946 markers. For our genomics selection (GS) approach, the ridge regression (rrBLUP) function uses all markers by default, albeit differently than standard regression.
To avoid model overfitting, we manually retained the top eight markers from the random forest function for three-year tree height and Phytophthora spp. disease score, which resulted in increased prediction accuracy over the default 946 markers selected. Using the top eight markers for modeling seemed to produce optimal prediction accuracies across all traits. Please see Methods Section 4.6. “Prediction and Selection Performance Estimation” for more details.
The predictive accuracy of the top markers for each trait was assessed by a machine learning approach and correlation analyses using the randomForest and rrBLUP packages in R. RF and rrBLUP machine learning models resulted in the greatest prediction accuracy for two-year tree height followed by A. tumefaciens disease score, three-year tree height, Phytophthora spp. disease score, two-year P. vulnus counts, and three-year P. vulnus counts (Figure 5 and Table 5). On average, the RF approach outperformed rrBLUP, and predictions made in the 31.01 haplotype outperformed predictions made in the 31.09 haplotype. The only scenarios where the average p-value fell below the significance threshold of 0.05 were two and three-year P. vulnus counts and two-year tree height with the rrBLUP modeling method in the 31.09 maternal haplotype and three-year P. vulnus counts with the rrBLUP method in the 31.01 maternal haplotype.
To assess prediction accuracy more practically, the null hypothesis that artificial selection had no effect on phenotypic values was tested. To do this, we arranged each data split’s actual values by their predicted values, then selected the top 25% of the data and extracted only the actual values. These data became the “selected values”. We then compared the selected values to all the actual values (no selection) using a Student’s t-test. This was repeated 15 times for each trait and method combination. As with the performance assessment by correlation, on average, the RF approach outperformed rrBLUP, and predictions made in the 31.01 haplotype outperformed predictions made in the 31.09 haplotype (Figure 6 and Table 6). In all traits, the use of selection resulted in more desirable phenotypic values. However, average p-values fell below the significance threshold of 0.05 for two and three-year P. vulnus counts for all instances of the rrBLUP modeling method and the RF method in just the two-year P. vulnus counts-female parent 31.09 instance. Moreover, the rrBLUP modeling method’s average p-values fell below the significance threshold for all traits under the 31.09 haplotype. Other selection analyses falling below the significance threshold included all instances of the rrBLUP method for Phytophthora spp. disease score. The scenarios with the highest significance were two-year tree height in the 31.01 haplotype regardless of the modeling method and A. tumefaciens disease score with RF regardless of the maternal haplotype. Selection for Phytophthora spp. disease score yielded high levels of significance with the RF approach, while that of two- and three-year P. vulnus counts had lower values.

3. Discussion

3.1. Co-Located QTL

Machine learning methods were successfully implemented in QTL analyses [14,15] and using non-traditional and traditional statistical methods reproduced the co-located QTL spanning, 31.01_Jm4D_26168643, 31.01_Jm4D_26359154, and 31.01_Jm4D_26669075, for A. tumefaciens disease score and Phytophthora spp. disease score discovered by [6]. Moreover, the QTL for two- and three-year tree height and two- and three-year P. vulnus counts mapped to the same region (Figure 1 and Figure 2). While the co-located QTL were also present in the 31.09 × cv. Serr progeny, the peaks were not as high, and their exact location varied slightly between traits. This result raises some fundamental questions. Why would tree height also be under this QTL? Do any gene(s) within these QTL have SNPs? What are these gene(s) and their function(s)? The co-location of this many QTL for disease resistance suggests a nonspecific and robust disease resistance mechanism, as a gene-for-gene or induced mechanism seems unlikely for such broad-spectrum resistance. Moreover, the co-location of two- and three-year tree height QTL, traits seemingly unrelated to pathogenesis, to the same loci supports the hypothesis that the resistance is due to some pre-formed factor potentially related to the tree’s height. One such pre-formed factor is the plant cell wall, which is known to affect both the growth and disease-resistance characteristics of plants [6,16,17,18,19,20]. A recent study analyzing transcriptomes of four hybrids from this study and three from another breeding population found that gene expression coding for cell wall biogenesis was a significant biological process correlated with tree height, A. tumefaciens root and crown gall size, and Phytophthora spp. root and crown rot [2]. Saxe et al. found that increasing the expression of cell wall biogenetic genes was correlated with increasing tree height, and susceptibility to A. tumefaciens and Phytophthora spp. Similarly, we found that all traits co-located to this QTL on chromosome 4D showed the same relationship. Increasing tree height and increasing susceptibility to all pathogens was associated with the ‘a’ allele and decreasing tree height and resistance to all pathogens was associated with the ‘b’ allele (Files S3 and S4). Given the size of the LOD peaks for the QTL, fine mapping with higher marker density or a multi-omics approach has a higher chance of increasing the understanding of what genes or genetic elements are involved in the causality of these traits. Fine mapping could also improve the resolution of said QTL and may further increase the accuracy of selection models. In addition to fine mapping using higher marker densities, one could also employ transcriptomics to reveal the impact of the SNPs on the functional part of the genome. As has been observed in most RNAseq studies, not just one gene is differentially expressed, but thousands are [2]. And these genes can be analyzed for enrichment in biological processes correlated with the trait, giving more than just gene-level insights.

3.2. MAS and GS Prediction Accuracy

While QTL analysis can help understand the genetic basis underlying phenotypes, it also offers the potential for high throughput selection. Co-located QTL have been reported in these J. microcarpa × J. regia hybrids [6]; however, their ability to predict phenotypes was not assessed. We found a wide range in the ability of the selected markers to predict these phenotypes. Given that the predictors for each trait were largely the same, it seems likely that there are varying degrees of environmental or technical variation skewing the phenotypic data. For example, tree height, A. tumefaciens disease score (root and crown gall size), and Phytophthora spp. disease score (root and crown rot) are direct measures of the plant or plant response. Conversely, P. vulnus counts were generated by counting the number of nematodes per gram of root, an indirect measure of the host response. Perhaps the relationship between root lesions caused by P. vulnus and these QTL would be greater than the nematode count. Another reason for the large differences in prediction accuracy could be the genomic sequencing approach. Genotyping-by-sequencing was used to generate SNPs for this population. While this method is a powerful tool for SNP discovery, it has the drawback of potentially sampling the genome unevenly [12], thus missing critical markers for some traits. We observed that predictions using a MAS approach with fewer markers were better than the those found with the GS approach, which used all available markers with regularization (Figure 5 and Figure 6, Table 5 and Table 6). This is likely due to the presence of such a major QTL in each cross as MAS tends to outperform GS in such cases, while complex traits regulated by many QTL favors the use of GS [21,22].
While the predictive models have varying accuracies, they can be deployed in the same breeding population from which they were built to increase the efficiency of the artificial selection of the traits analyzed in this study. For example, utilizing these models to enhance rootstock selection for A. tumefaciens disease score and tree height would be the most effective given that they yielded the highest accuracies (Figure 5 and Table 5), and that their simulated selection effectiveness was the most significant (Figure 6 and Table 6). While two-year tree height yielded the highest accuracy overall, the three-year height data were more relevant, realistic, and prudent, as the prediction accuracy decreased substantially from two years to three years. For example, relying on a model based on the two-year data may overestimate the true accuracy of the model, as these trees develop and live much longer than two years.

4. Materials and Methods

4.1. Plant Material

Juglans microcarpa trees DJUG 31.01 and DJUG 31.09 (USDA National Clonal Germplasm Repository, Davis, https://www.ars.usda.gov/pacific-west-area/davis-ca/natl-clonal-germplasm-rep-tree-fruit-nut-crops-grapes/, accessed on 10 January 2020) were used as mother trees in hybridization with J. regia cv. Serr in the springs of 2012–2015. Female flowers were sealed in pollen-impermeable semi-porous bags (PBS 10-1, PBS International, Crawley, UK) prior to opening, using non-absorbent cotton (Custom Hospital, Clackamas, OR, USA) to cushion and seal the bags around the branch after male flowers had been removed. When female flowers began opening (at the stigma separation stage), previously collected cv. Serr pollen was injected into the bags using a syringe. The bags were removed 3–4 weeks after pollination and immature nuts were tagged. The nuts were collected while still immature and with intact hulls (July–August), and stored refrigerated (up to several weeks). Embryos were extracted from nuts, germinated in vitro, and micropropagated to produce clones for disease resistance testing” [6].

4.2. GBS and SNP Discovery

“Genomic DNA was isolated with the CTAB plant genomic DNA isolation method from micro-propagated F1 hybrids and J. microcarpa and cv. Serr parents. DNA was diluted to the uniform concentration of 55 ng/µL. GBS involved complexity reduction by PstI restriction enzyme digestion. Groups of 96 samples including both parents were multiplexed using unique barcodes, and single-end sequenced with the Illumina HiSeq2000 platform at the Genomic Diversity Facility of Institute of Biotechnology, Cornell University. SNPs were called with Tassel-GBS pipeline v2.0 using default settings except that a minimum mapping quality of 2 was used in SAMToGBSdbPlugin. Illumina reads were deconvoluted using the barcodes, and reads were filtered and trimmed to 64 bp to construct unique tags. The J. microcarpa Jm31.01_v1.0 and J. regia JrSerr_v1.0 genome sequences were combined into a single fasta file and used as a reference. Unique tags were aligned onto the reference sequences with the BWA-aln aligner. High-quality SNPs were selected among SNPs obtained from the Tassel-GBS pipeline with VCFtools, BCFtools, and in-house awk and Python scripts. Non-segregating SNPs and calls below a threshold of five reads were removed.
The following convention was implemented in naming the markers. Each name was started with that of the mapping population followed by the abbreviated names of the genome and chromosome, and ended with the location of the SNP in the registry of the corresponding JrSerr_v1.0 or Jm31.01_v1.0 pseudomolecule” [6].

4.3. Phenotypic Analysis

The phenotyping of these hybrids for resistance to A. tumefaciens and Phytophthora spp. has been described in detail [6]. Briefly, the phenotype used for A. tumefaciens disease score was a visually assessed rank on a scale of one (no gall) to four (complete girdling of the plant from gall). The phenotype used for Phytophthora spp. disease score was a visually assessed percentage of crown or root rotted after pathogen inoculation of either P. cinnamomi or P. pini. The average percentages of crown and root rot for P. cinnamomi and P. pini were entered into the analysis and were then referred to as “Phytophthora spp.”.
To phenotype hybrids for P. vulnus counts, at least six clonal saplings of each hybrid and commercial comparatives RX1 and VX211 [23,24] were planted in a series of field experiments from 2014 to 2018. The hybrids used in this study were propagated from tissue cultures and developed into saplings in greenhouse culture. At the Kearney Agricultural Research and Extension Center (36.6015° N, 119.5109° W), saplings were planted in rows of 3.35 m distance at 1.65 m spacing within the row. About one month after planting, every tree was concomitantly inoculated with ~1000 vermiform P. vulnus and second-stage juveniles (J2) of Meloidogyne incognita by dispensing infested field soil from underneath infected perennial crops at the base of the tree. Selected trees of the genotype groups were chosen for root collections for nematode evaluations. A 20–25 cm-deep trench was dug next to the tree to collect young roots of the respective tree genotype avoiding suberized roots. Kept cool in plastic bags until processing within 48 h of collection, the roots were chopped into 1.2 cm pieces and 20 g portions placed on top of Baermann funnels. In a mist chamber apparatus, the roots were intermittently sprayed with water at 27 °C for five days. After this, the extracts were collected and P. vulnus were identified and counted [25]. Nematode numbers were recorded on a per 1 g basis. In each dormant season, the height of each of the trees in these nematode field screens was measured from the ground level perpendicularly to the maximum height of the trees with an extended ruler. Thus, the tree height was measured under nematode-infested conditions.

4.4. Data Cleaning and Preparation

The genotype data for 31.01 and 31.09 and cv. Serr “Serr” analysis were cleaned using the snpReady package in R [26]. The data used for QTL mapping and marker-assisted selection/genomic selection were converted from “a” to 1 and “b” to 2. Using the raw.data() function, SNP data from the genotype matrix were cleaned by removing markers with an MAF of 0.1, a sweep sample rate of 0.5 or higher, a call rate of 0.95 or higher, and imputing data points with the “k-nearest markers” method. See the raw.data() function from the snpReady package for more details. The data used for QTL mapping was not imputed and was converted back to a/b coding before analysis.

4.5. QTL Mapping

We performed QTL mapping using the “QTL” package in R [27]. After loading the data as a cross object, we calculated the genotype probabilities and then performed a single QTL scan using the marker regression (mr) method. To further refine the mapping step, we performed permutation testing via 1000 permutations to calculate a significance threshold at alpha = 0.05 for each marker’s Logarithm of the Odds (LOD) score. These LOD scores and significance thresholds were used to generate the LOD plots (Figure 1 and Figure 2). To estimate the total variation explained by each QTL, we built a multiple regression model in R with all significant markers within the QTL and reported the adjusted R2 of the full model. We performed the same analysis for the top three markers.

4.6. Prediction and Selection Performance Estimation

RF and rrBLUP approaches were used on each trait in this study to assess the markers’ predictive power. To assess model accuracy for RF and rrBLUP approaches while lowering the risk of overfitting, we used 70% of the data for training and 30% for testing.
After evaluating different numbers of top markers and their prediction accuracies with the rfe function from the caret package in R, we opted to utilize the top eight markers as described in the results. The rrBLUP package utilizes all markers for prediction in a ridge regression approach, with critical differences over standard regression or random forest models. Please see the package information for more details [28].
For the RF approach, a random forest model was built on the training data and its top eight markers were extracted by mean decrease in impurity (MDI). Then, these markers were used as predictors to train a new random forest model for predicting the phenotypes of the testing data.
Prediction accuracy was assessed by calculating the Pearson correlation coefficient of the predicted vs. actual values of the testing data with a one-tailed alternative hypothesis predicting whether the correlation coefficient would be positive rather than negative. Marker selection and accuracy estimation was repeated 15 independent times, and the average p-value of these tests was reported. For selection performance, we used the same 70/30 training and testing splits to produce the predicted values. The testing values used for prediction were then arranged by their accompanying predicted values, and the top 25% of that data were extracted and defined as the “selected values”. These selected values were then compared to a “no selection” scenario in which all the testing values were kept. We used a Student’s t-test with a one-tailed alternative hypothesis predicting that the predicted values would be less (if disease score was being analyzed) or more (if tree height was being analyzed). This was repeated 15 independent times, and the average p-value of these tests was reported (Figure 6 and Table 6).

4.7. R Code and Availability

The R statistical software R version 4.4.0 (2024-04-24 ucrt) was used in Rstudio version 2023.6.0.421 to facilitate all statistical analysis and visualization produced in this study [29,30]. The following R packages were used:
caret [31], data.table [32], dplyr [33], ggplot [34], ggprism [35], ggtext [36], kableExtra [37], knitr [38], OmicsAnalyst [39], openxlsx [40], qtl [27], randomForest [41], rmarkdown [42], rrBLUP [28], sjPlot [43], snpReady [26], stringr [44], tibble [45], tidytable [46], tidytext [47], tidyr [48], webshot2 [49].
A repository for the code used in this study can be found at [https://github.com/hsaxe/Walnut_Root_QTL] (accessed on 1 January 2025).

5. Conclusions

This study provided further insights into the genetics of walnut root diseases. The co-location of tree height and the root system disease traits of nematode count, Phytophthora disease score, and crown gall disease score to a single QTL is a novel finding worth further investigation. It begs the question of how a single QTL could regulate many disease-related traits and tree height. Moreover, we provided actionable knowledge by leveraging these QTL to predict these traits using just the SNPs. Such knowledge provides the potential to improve the breeding efficiency of walnut rootstocks. Furthermore, fine mapping of this region could yield further benefits to both outcomes of this study. Fine mapping also paves the way for an optimized MAS pipeline by narrowing down the best markers to use for advanced selection of future, unphenotyped progeny. Lastly, this GBS data contain other variation that is not explained by any of the traits reported here. Therefore, the GBS data can serve as a resource to be further mined for QTL for drought resistance, blackline, or other agronomic traits of interest in walnut roots.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms26030903/s1.

Author Contributions

The study was designed by H.J.S. and A.M.D. H.J.S. performed QTL mapping and GS/MAS analysis. C.A.L. provided plant data, P.J.B. the SNP information, phenotypic data was provided by A.W., D.A.K. and G.T.B. H.J.S. and A.M.D. generated the first draft of the paper. All authors read, edited, and approved the final draft. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by USDA NIFA-SCRI No. 59-5306-2-333 and USDA NIFA-SCRI Grant No. 2018-51181-28437 and grants from the California Walnut Board.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Genotype data were obtained by downloading from Supplementary Table S2 of [6]. The phenotypic data used in this study are available as Supplementary Data (Files S5–S7).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. California Agricultural Statistics Review. California Agricultural Statistics Review 2020–2021. 2022. Available online: https://www.cdfa.ca.gov/Statistics/PDFs/2022_Ag_Stats_Review.pdf (accessed on 17 January 2025).
  2. Saxe, H.J.; Walawage, S.L.; Balan, B.; Leslie, C.A.; Brown, P.J.; Browne, G.T.; Kluepfel, D.A.; Westphal, A.; Dandekar, A.M. Transcriptomic Evidence of a Link between Cell Wall Biogenesis, Pathogenesis, and Vigor in Walnut Root and Trunk Diseases. Int. J. Mol. Sci. 2024, 25, 931. [Google Scholar] [CrossRef] [PubMed]
  3. Kluepfel, D.; McClean, J.; Leslie, C.A.; Aradhya, M.; Luo, M.; Brown, P.J.; Ramasamy, R.K.; Luo, M.C.; Haisey, D.; Jan, D.; et al. Putting Phenotypic and Genotypic Tools to Work for Improving Walnut Rootstocks; Walnut Research Reports 2019; California Walnut Board: Folsom, CA, USA, 2020; pp. 63–74. [Google Scholar]
  4. Conson, A.R.O.; Taniguti, C.H.; Amadeu, R.R.; Andreotti, I.A.A.; de Souza, L.M.; dos Santos, L.H.B.; Rosa, J.R.B.F.; Mantello, C.C.; da Silva, C.C.; Junior, E.J.S.; et al. High-Resolution Genetic Map and QTL Analysis of Growth-Related Traits of Hevea brasiliensis Cultivated Under Suboptimal Temperature and Humidity Conditions. Front. Plant Sci. 2018, 9, 1255. [Google Scholar] [CrossRef] [PubMed]
  5. Du, C.; Sun, P.; Cheng, X.; Zhang, L.; Wang, L.; Hu, J. QTL mapping of drought-related traits in the hybrids of Populus deltoides ‘Danhong’ × Populus simonii ‘Tongliao1’. BMC Plant Biol. 2022, 22, 238. [Google Scholar] [CrossRef]
  6. Ramasamy, R.K.; Luo, M.C.; Leslie, C.A.; Velasco, D.; Ott, N.; McClean, A.; Dandekar, A.M.; Aradhya, M.; Brown, P.J.; Browne, G.T.; et al. Co-located quantitative trait loci mediate resistance to Agrobacterium tumefaciens disease score, Phytophthora cinnamomi, and P. pini in Juglans microcarpa × J. regia hybrids. Hortic. Res. 2021, 8, 111. [Google Scholar] [CrossRef] [PubMed]
  7. Hall, D.; Hallingbäck, H.R.; Wu, H.X. Estimation of number and size of QTL effects in forest tree traits. Tree Genet. Genomes 2016, 12, 110. [Google Scholar] [CrossRef]
  8. Wang, D.; Yang, L.; Shi, C.; Li, S.; Tang, H.; He, C.; Cai, N.; Duan, A.; Gong, H. QTL mapping for growth-related traits by constructing the first genetic linkage map in Simao pine. BMC Plant Biol. 2022, 22, 48. [Google Scholar] [CrossRef] [PubMed]
  9. De Mori, G.; Cipriani, G. Marker-Assisted Selection in Breeding for Fruit Trait Improvement: A Review. Int. J. Mol. Sci. 2023, 24, 8984. [Google Scholar] [CrossRef] [PubMed]
  10. Karaagac, E.; Vargas, A.M.; de Andrés, M.T.; Carreño, I.; Ibáñez, J.; Carreño, J.; Miguel Martínez-Zapater, J.; Cabezas, J.A. Marker assisted selection for seedlessness in table grape breeding. Tree Genet. Genomes 2012, 8, 1003–1015. [Google Scholar] [CrossRef]
  11. Kumar, A.; Sandhu, N.; Dixit, S.; Yadav, S.; Swamy, B.P.M.; Shamsudin, N.A.A. Marker-assisted selection strategy to pyramid two or more QTLs for quantitative trait-grain yield under drought. Rice 2018, 11, 35. [Google Scholar] [CrossRef]
  12. Fu, Y.B.; Peterson, G.W.; Dong, Y. Increasing Genome Sampling and Improving SNP Genotyping for Genotyping-by-Sequencing with New Combinations of Restriction Enzymes. G3 2016, 6, 845–856. [Google Scholar] [CrossRef]
  13. Xiao, Y.; Liu, H.; Wu, L.; Warburton, M.; Yan, J. Genome-wide association studies in maize: Praise and stargaze. Mol. Plant 2017, 10, 359–374. [Google Scholar] [CrossRef] [PubMed]
  14. Bedo, J.; Wenzl, P.; Kowalczyk, A.; Kilian, A. Precision-mapping and statistical validation of quantitative trait loci by machine learning. BMC Genet. 2008, 9, 35. [Google Scholar] [CrossRef] [PubMed]
  15. Koc, A.; Odilbekov, F.; Alamrani, M.; Henriksson, T.; Chawade, A. Predicting yellow rust in wheat breeding trials by proximal phenotyping and machine learning. Plant Methods 2022, 18, 30. [Google Scholar] [CrossRef]
  16. Zhang, B.; Gao, Y.; Zhang, L.; Zhou, Y. The plant cell wall: Biosynthesis, construction, and functions. J. Integr. Plant Biol. 2021, 63, 251–272. [Google Scholar] [CrossRef] [PubMed]
  17. Menna, A.; Dora, S.; Sancho-Andrés, G.; Kashyap, A.; Meena, M.K.; Sklodowski, K.; Gasperini, D.; Coll, N.S.; Sánchez-Rodríguez, C. A primary cell wall cellulose-dependent defense mechanism against vascular pathogens revealed by time-resolved dual transcriptomics. BMC Biol. 2021, 19, 161. [Google Scholar] [CrossRef] [PubMed]
  18. Roudier, F.; Fernandez, A.G.; Fujita, M.; Himmelspach, R.; Borner, G.H.; Schindelman, G.; Song, S.; Baskin, T.I.; Dupree, P.; Wasteneys, G.O.; et al. COBRA, an Arabidopsis extracellular glycosyl-phosphatidyl inositol-anchored protein, specifically controls highly anisotropic expansion through its involvement in cellulose microfibril orientation. Plant Cell 2005, 17, 1749–1763. [Google Scholar] [CrossRef]
  19. Nicol, F.; His, I.; Jauneau, A.; Vernhettes, S.; Canut, H.; Höfte, H. A plasma membrane-bound putative endo-1,4-β-D-glucanase is required for normal wall assembly and cell elongation in Arabidopsis. EMBO J. 1998, 17, 5563–5576. [Google Scholar] [CrossRef]
  20. Hu, H.; Zhang, R.; Tao, Z.; Li, X.; Li, Y.; Huang, J.; Li, X.; Han, X.; Feng, S.; Zhang, G.; et al. Cellulose Synthase Mutants Distinctively Affect Cell Growth and Cell Wall Integrity for Plant Biomass Production in Arabidopsis. Plant Cell Physiol. 2018, 59, 1144–1157. [Google Scholar] [CrossRef]
  21. Zhao, Y.; Mette, M.; Gowda, M.; Longin, C.F.H.; Reif, J.C. Bridging the gap between marker-assisted and genomic selection of heading time and plant height in hybrid wheat. Heredity 2014, 112, 638–645. [Google Scholar] [CrossRef]
  22. Cerrudo, D.; Cao, S.; Yuan, Y.; Martinez, C.; Suarez, E.A.; Babu, R.; Zhang, X.; Trachsel, S. Genomic Selection Outperforms Marker Assisted Selection for Grain Yield and Physiological Traits in a Maize Doubled Haploid Population Across Water Treatments. Front. Plant Sci. 2018, 9, 366. [Google Scholar] [CrossRef]
  23. McGranahan, G.; Browne, G.; Leslie, C.; Hackett, W.; McKenna, M. Walnut Rootstock ‘RX1’. US Plant Patent No. 20,649, 19 January 2010. [Google Scholar]
  24. McGranahan, G.; Leslie, C.; Hackett, W.; Browne, G.; McKenna, J.; Buzo, T.; Kaku, S.; McKenry, M. Walnut Rootstock ‘VX211’. US Plant Patent No. 21,179, 3 August 2010. [Google Scholar]
  25. Buzo, T.; Mckenna, J.; Kaku, S.; Anwar, S.A.; McKenry, M.V. VX211, A vigorous new walnut hybrid clone with nematode tolerance and a useful resistance mechanism. J. Nematol. 2009, 41, 211–216. [Google Scholar] [PubMed]
  26. Granato, I.; Fritsche-Neto, R. snpReady: Preparing Genotypic Datasets in Order to Run Genomic Analysis. R Package Version 0.9.6, 2018. Available online: https://cran.r-project.org/web/packages/snpReady (accessed on 1 January 2024).
  27. Broman, K.W.; Wu, H.; Sen, Ś.; Churchill, G.A. R/qtl: QTL mapping in experimental crosses. Bioinformatics 2003, 19, 889–890. [Google Scholar] [CrossRef]
  28. Endelman, J.B. Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 2011, 4, 255–258. [Google Scholar] [CrossRef]
  29. Posit Team. RStudio: Integrated Development Environment for R, Posit Software version 2023.6.0.421; PBC: Boston, MA, USA, 2023; Available online: http://www.posit.co/ (accessed on 1 January 2024).
  30. R Core Team. R: A Language and Environment for Statistical Computing R version 4.4.0 (2024-04-24 ucrt); R Foundation for Statistical Computing: Vienna, Austria, 2024; Available online: https://www.R-project.org/ (accessed on 2 July 2024).
  31. Kuhn, M. Building Predictive Models in R Using the caret Package. J. Stat. Softw. 2008, 28, 1–26. [Google Scholar] [CrossRef]
  32. Dowle, M.; Srinivasan, A. data.table: Extension of data.frame. 2022. Available online: https://cran.r-project.org/web/packages/data.table (accessed on 2 July 2024).
  33. Wickham, H.; François, R.; Henry, L.; Müller, K. dplyr: A Grammar of Data Manipulation. 2022. Available online: https://cran.r-project.org/web/packages/dplyr (accessed on 2 July 2024).
  34. Wickham, H.; Chang, W.; Henry, L.; Pedersen, T.L.; Takahashi, K.; Wilke, C.; Woo, K.; Yutani, H.; Dunnington, D.; Brand, T.v.D. ggplot2: Create Elegant Data Visualisations Using the Grammar of Graphics. 2022. Available online: https://cran.r-project.org/web/packages/ggplot2 (accessed on 2 July 2024).
  35. Dawson, C. ggprism: A ‘ggplot2’ Extension Inspired by ‘GraphPad Prism’. 2024. Available online: https://csdaw.github.io/ggprism/ (accessed on 2 July 2024).
  36. Wilke, C.; Wiernik, B. ggtext: Improved Text Rendering Support for ’ggplot2’. R Package Version 0.1.2. 2022. Available online: https://cran.r-project.org/web/packages/ggtext (accessed on 2 July 2024).
  37. Zhu, H. kableExtra: Construct Complex Table with ‘kable’ and Pipe Syntax. R Package Version 1.4.0. 2024. Available online: https://cran.r-project.org/web/packages/kableExtra (accessed on 2 July 2024).
  38. Xie, Y. knitr: A General-Purpose Package for Dynamic Report Generation in R. 2022. Available online: https://yihui.org/knitr/ (accessed on 2 July 2024).
  39. Saxe, H. OmicsAnalyst: Functions to Facilitate the Statistical Analysis of Omics Data. 2023. Available online: https://github.com/hsaxe/Omics-Analyst (accessed on 2 July 2024).
  40. Schauberger, P.; Walker, A. openxlsx: Read, Write and Edit xlsx Files. R Package Version 4.2.5.2. 2023. Available online: https://cran.r-project.org/web/packages/openxlsx (accessed on 2 July 2024).
  41. Liaw, A.; Wiener, M. Classification and Regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
  42. Allaire, J.; Xie, Y.; Dervieux, C.; McPherson, J.; Luraschi, J.; Ushey, K.; Atkins, A.; Wickham, H.; Cheng, J.; Chang, W. rmarkdown: Dynamic Documents for R. 2022. Available online: https://cran.r-project.org/web/packages/rmarkdown (accessed on 2 July 2024).
  43. Lüdecke, D. sjPlot: Data Visualization for Statistics in Social Science. 2022. Available online: https://strengejacke.github.io/sjPlot/ (accessed on 2 July 2024).
  44. Wickham, H. stringr: Simple, Consistent Wrappers for Common String Operations. 2022. Available online: https://cran.r-project.org/web/packages/stringr (accessed on 2 July 2024).
  45. Müller, K.; Wickham, H. tibble: Simple Data Frames. 2022. Available online: https://cran.r-project.org/web/packages/tibble (accessed on 2 July 2024).
  46. Fairbanks, M. tidytable: Tidy Interface to data.table. 2022. Available online: https://github.com/markfairbanks/tidytable (accessed on 2 July 2024).
  47. Robinson, D.; Silge, J. tidytext: Text Mining using dplyr, ggplot2, and Other Tidy Tools. 2022. Available online: https://github.com/juliasilge/tidytext (accessed on 2 July 2024).
  48. Wickham, H.; Girlich, M. tidyr: Tidy Messy Data. 2022. Available online: https://cran.r-project.org/web/packages/tidyr (accessed on 2 July 2024).
  49. Chang, W. webshot2: Take Screenshots of Web Pages. R Package Version 0.1.1. 2023. Available online: https://cran.r-project.org/web/packages/webshot2 (accessed on 2 July 2024).
Figure 1. LOD plots for each of the six traits mapped to 31.01 and cv. Serr haplotypes. Chromosome numbers are on the x-axis, and LOD score is on the y-axis. The horizontal dashed line indicates the threshold above which LOD scores are significant.
Figure 1. LOD plots for each of the six traits mapped to 31.01 and cv. Serr haplotypes. Chromosome numbers are on the x-axis, and LOD score is on the y-axis. The horizontal dashed line indicates the threshold above which LOD scores are significant.
Ijms 26 00903 g001
Figure 2. LOD plots for each of the six traits mapped to the 31.09 and cv. Serr haplotypes. Chromosome numbers are on the x-axis, and LOD score is on the y-axis. The horizontal dashed line indicates the threshold above which LOD scores are significant.
Figure 2. LOD plots for each of the six traits mapped to the 31.09 and cv. Serr haplotypes. Chromosome numbers are on the x-axis, and LOD score is on the y-axis. The horizontal dashed line indicates the threshold above which LOD scores are significant.
Ijms 26 00903 g002
Figure 3. Scatter plot of Principal Component Analysis (PCA) loadings of all Juglans microcarpa 31.01 × cv. Serr markers discovered by GBS. The plot is facetted by parental haplotype. Each point represents a marker. All points and ellipses are colored and labeled by their chromosome of origin. Ellipses are labeled by chromosome of origin in black.
Figure 3. Scatter plot of Principal Component Analysis (PCA) loadings of all Juglans microcarpa 31.01 × cv. Serr markers discovered by GBS. The plot is facetted by parental haplotype. Each point represents a marker. All points and ellipses are colored and labeled by their chromosome of origin. Ellipses are labeled by chromosome of origin in black.
Ijms 26 00903 g003
Figure 4. Scatter plot of Principal Component Analysis (PCA) loadings of all Juglans microcarpa 31.09 × cv. Serr markers discovered by GBS. The plot is facetted by parental haplotype. Each point represents a marker. All points and ellipses are colored and labeled by their chromosome of origin. Ellipses are labeled by chromosome of origin in black.
Figure 4. Scatter plot of Principal Component Analysis (PCA) loadings of all Juglans microcarpa 31.09 × cv. Serr markers discovered by GBS. The plot is facetted by parental haplotype. Each point represents a marker. All points and ellipses are colored and labeled by their chromosome of origin. Ellipses are labeled by chromosome of origin in black.
Ijms 26 00903 g004
Figure 5. Prediction accuracy (r) and their significance for each trait assessed by Pearson correlation of model-predicted vs. actual values. There are four boxplots per trait representing two selection methods by two mother parents. The two methods tested were random forest (RF) from the caret package in R and functions from the rrBLUP (rrBLUP) package also from R. Stars indicate the average p-value for each boxplot of correlations. “***” average p-value less than 0.001, “**” average p-value between 0.01 and 0.001, “*” average p-value between 0.05 and 0.01.
Figure 5. Prediction accuracy (r) and their significance for each trait assessed by Pearson correlation of model-predicted vs. actual values. There are four boxplots per trait representing two selection methods by two mother parents. The two methods tested were random forest (RF) from the caret package in R and functions from the rrBLUP (rrBLUP) package also from R. Stars indicate the average p-value for each boxplot of correlations. “***” average p-value less than 0.001, “**” average p-value between 0.01 and 0.001, “*” average p-value between 0.05 and 0.01.
Ijms 26 00903 g005
Figure 6. Selection performance by trait, method, and female parent. Boxplots represent phenotypic values (y-axis) of traits comparing t-test estimates of using selection vs. no selection. Stars indicate the average p-value for each boxplot of correlations. “***” average p-value less than 0.001, “**” average p-value between 0.01 and 0.001, “*” average p-value between 0.05 and 0.01, “.” Average p-value between 0.01 and 0.1. Dots above and below boxplots represent outliers.
Figure 6. Selection performance by trait, method, and female parent. Boxplots represent phenotypic values (y-axis) of traits comparing t-test estimates of using selection vs. no selection. Stars indicate the average p-value for each boxplot of correlations. “***” average p-value less than 0.001, “**” average p-value between 0.01 and 0.001, “*” average p-value between 0.05 and 0.01, “.” Average p-value between 0.01 and 0.1. Dots above and below boxplots represent outliers.
Ijms 26 00903 g006
Table 1. All significant markers from QTL mapping on the 31.01 × cv. Serr progeny with LOD threshold α = 0.05. This defines the QTL. Length is QTL length in base pairs. Markers are the number of unique markers within the QTL.
Table 1. All significant markers from QTL mapping on the 31.01 × cv. Serr progeny with LOD threshold α = 0.05. This defines the QTL. Length is QTL length in base pairs. Markers are the number of unique markers within the QTL.
TraitChromosomeUnique MarkersLength (bp)Percent Variance Explained
Two-year P. vulnus Counts4D3112,109,44713.9%
Three-year P. vulnus Counts4D175,298,33413.1%
Two-year Tree Height4D6530,660,99470.3%
Three-year Tree Height4D6530,660,99441.4%
Three-year Tree Height6D143,493,7934.8%
A. tumefaciens Disease Score4D5930,060,17945.4%
Phytophthora spp. Disease Score4D3821,190,69521.0%
Phytophthora spp. Disease Score6D33,720,2186.1%
Table 2. Top markers by LOD from QTL mapping on the 31.01 × cv. Serr progeny.
Table 2. Top markers by LOD from QTL mapping on the 31.01 × cv. Serr progeny.
TraitMarker(s)ChromosomeLODLength (bp)Percent Variance Explained
Two-year P. vulnus Counts31.01_Jm4D_266690754D9.8015.7%
Three-year P. vulnus Counts31.01_Jm4D_266690754D7.6012.9%
Two-year Tree Height31.01_Jm4D_266690754D44.9067.1%
Three-year Tree Height31.01_Jm4D_266690754D22.2033.8%
A. tumefaciens Disease Score31.01_Jm4D_261686434D28.4500,43244.1%
A. tumefaciens Disease Score31.01_Jm4D_263591544D28.4500,43244.1%
A. tumefaciens Disease Score31.01_Jm4D_266690754D28.4500,43244.1%
Phytophthora spp. Disease Score31.01_Jm4D_261686434D11.1500,43218.9%
Phytophthora spp. Disease Score31.01_Jm4D_263591544D11.1500,43218.9%
Phytophthora spp. Disease Score31.01_Jm4D_266690754D11.1500,43218.9%
Table 3. All significant markers from QTL mapping on the 31.09 × cv. Serr progeny with LOD threshold α = 0.05.
Table 3. All significant markers from QTL mapping on the 31.09 × cv. Serr progeny with LOD threshold α = 0.05.
TraitChromosomeUnique MarkersLength (bp)Percent Variance Explained
Two-year P. vulnus Counts3D2372,3717.0%
Two-year P. vulnus Counts4D354,9168.2%
Three-year P. vulnus Counts1D3924,916,2998.5%
Three-year P. vulnus Counts4D149,378,08516.7%
Two-year Tree Height4D3121,274,99320.1%
Three-year Tree Height4D1910,480,96117.2%
A. tumefaciens Disease Score4D5931,696,22239.7%
A. tumefaciens Disease Score7D107.0%
Phytophthora spp. Disease Score4D4629,549,28417.5%
Table 4. Top markers by LOD from QTL mapping on the 31.09 × cv. Serr progeny.
Table 4. Top markers by LOD from QTL mapping on the 31.09 × cv. Serr progeny.
TraitMarker(s)ChromosomeLODLength (bp)Percent Variance Explained
Two-year P. vulnus Counts31.09_Jm4D_239062004D3.954,9168.2%
Two-year P. vulnus Counts31.09_Jm4D_239609544D3.954,9168.2%
Two-year P. vulnus Counts31.09_Jm4D_239611164D3.954,9168.2%
Three-year P. vulnus Counts31.09_Jm4D_263591544D7.1017.3%
Two-year Tree Height31.09_Jm4D_251019684D6.2023.5%
Three-year Tree Height31.09_Jm4D_263591544D7.5017.1%
A. tumefaciens Disease Score31.09_Jm4D_263591544D18.9036.9%
Phytophthora spp. Disease Score31.09_Jm4D_238162624D11.0443,00217.6%
Phytophthora spp. Disease Score31.09_Jm4D_242592644D11.0443,00217.6%
Table 5. Prediction accuracies (r) and their significance for each trait assessed by Pearson correlation of model-predicted vs. actual values. Each value in the table is the average correlation coefficient representing prediction accuracy estimates. Stars indicate the average p-value for each boxplot of correlations. “***” average p-value less than 0.001, “**” average p-value between 0.01 and 0.001, “*” average p-value between 0.05 and 0.01.
Table 5. Prediction accuracies (r) and their significance for each trait assessed by Pearson correlation of model-predicted vs. actual values. Each value in the table is the average correlation coefficient representing prediction accuracy estimates. Stars indicate the average p-value for each boxplot of correlations. “***” average p-value less than 0.001, “**” average p-value between 0.01 and 0.001, “*” average p-value between 0.05 and 0.01.
MethodFemale ParentA. tumefaciens
Disease Score
Phytophthora spp.
Disease Score
Three-Year
Tree Height
Three-Year
P. vulnus
Counts
Two-Year
Tree Height
Two-Year
P. vulnus
Counts
RF31.010.67 ***0.49 **0.59 ***0.33 **0.84 ***0.35 **
RF31.090.58 ***0.38 **0.31 *0.34 *0.46 *0.19
rrBLUP31.010.46 ***0.36 **0.52 ***0.24 0.73 ***0.25 *
rrBLUP31.090.42 **0.25 *0.37 *0.31 *0.51 *0.19
Table 6. Selection performance by trait, method, and female parent. Each cell represents average phenotypic values of traits comparing t-test estimates of using selection vs. no. Stars indicate the average p-value for each boxplot of correlations. “***” average p-value less than 0.001, “**” average p-value between 0.01 and 0.001, “*” average p-value between 0.05 and 0.01, “.” Average p-value between 0.01 and 0.1.
Table 6. Selection performance by trait, method, and female parent. Each cell represents average phenotypic values of traits comparing t-test estimates of using selection vs. no. Stars indicate the average p-value for each boxplot of correlations. “***” average p-value less than 0.001, “**” average p-value between 0.01 and 0.001, “*” average p-value between 0.05 and 0.01, “.” Average p-value between 0.01 and 0.1.
StatMethodFemale ParentA. tumefaciens
Disease Score
Phytophthora spp.
Disease Score
Three-Year
Tree Height
Three-Year
P. vulnus
Counts
Two-Year
Tree Height
Two-Year
P. vulnus
Counts
SelectionRF31.011.54 ***25.8 *134 .30.1 *127 ***5.02 .
No SelectionRF31.012.36 ***37.9 *112 .59.3 *89.3 ***12.5 .
SelectionRF31.091.5 ***16.5 *113 46.7 .98.4 *13.9
No SelectionRF31.092.32 ***26.4 *99.3 94.1 .70.6 *19.4
SelectionrrBLUP31.011.72 **31 143 *40.3 124 ***6.47
No SelectionrrBLUP31.012.36 **37.9 112 *59.3 89.3 ***12.8
SelectionrrBLUP31.091.84 .19.9 .120 .54.4 86.6 13.5
No SelectionrrBLUP31.092.32 .26.4 .99.3 .94.1 70.6 19.6
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Saxe, H.J.; Leslie, C.A.; Brown, P.J.; Westphal, A.; Kluepfel, D.A.; Browne, G.T.; Dandekar, A.M. Co-Location of QTL for Vigor and Resistance to Three Diseases in Juglans microcarpa × J. regia Rootstocks. Int. J. Mol. Sci. 2025, 26, 903. https://doi.org/10.3390/ijms26030903

AMA Style

Saxe HJ, Leslie CA, Brown PJ, Westphal A, Kluepfel DA, Browne GT, Dandekar AM. Co-Location of QTL for Vigor and Resistance to Three Diseases in Juglans microcarpa × J. regia Rootstocks. International Journal of Molecular Sciences. 2025; 26(3):903. https://doi.org/10.3390/ijms26030903

Chicago/Turabian Style

Saxe, Houston J., Charles A. Leslie, Patrick J. Brown, Andreas Westphal, Daniel A. Kluepfel, Gregory T. Browne, and Abhaya M. Dandekar. 2025. "Co-Location of QTL for Vigor and Resistance to Three Diseases in Juglans microcarpa × J. regia Rootstocks" International Journal of Molecular Sciences 26, no. 3: 903. https://doi.org/10.3390/ijms26030903

APA Style

Saxe, H. J., Leslie, C. A., Brown, P. J., Westphal, A., Kluepfel, D. A., Browne, G. T., & Dandekar, A. M. (2025). Co-Location of QTL for Vigor and Resistance to Three Diseases in Juglans microcarpa × J. regia Rootstocks. International Journal of Molecular Sciences, 26(3), 903. https://doi.org/10.3390/ijms26030903

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop