Next Article in Journal
Special Issue: Lipid Metabolism, Adipogenesis and Fat Tissue Metabolism: Gene Regulation
Previous Article in Journal
Association of Surfactant Protein D Single Nucleotide Polymorphisms rs721917, rs2243639, rs3088308 with Recurrent Aphthous Stomatitis in Pakistani Population
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Accuracy of Eye and Hair Color Prediction in Mexican Mestizos from Monterrey City Based on ForenSeqTM DNA Signature Prep

by
José Alonso Aguilar-Velázquez
1,2,
Blanca Jeannete Llamas-de-Dios
3,
Miranda Fabiola Córdova-Mercado
4,
Carolina Elena Coronado-Ávila
4,
Orlando Salas-Salas
4,
Andrés López-Quintero
5,
Benito Ramos-González
4 and
Héctor Rangel-Villalobos
6,*
1
Instituto de Investigación en Ciencias Biomédicas, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara (CUCS-UdeG), Guadalajara 44340, Jalisco, Mexico
2
Departamento de Morfología, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara (CUCS-UdeG), Guadalajara 44340, Jalisco, Mexico
3
Licenciatura en Ciencias Forenses, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara (CUCS-UdeG), Guadalajara 44340, Jalisco, Mexico
4
Instituto de Criminalística y Servicios Periciales, Fiscalía General de Justicia del Estado de Nuevo León (FGJNL), Monterrey 64720, Nuevo León, Mexico
5
Instituto de Nutrigenética y Nutrigenómica Traslacional, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara (CUCS-UdeG), Guadalajara 44340, Jalisco, Mexico
6
Instituto de Investigación en Genética Molecular, Centro Universitario de la Ciénega, Universidad de Guadalajara (CUCI-UdeG), Ocotlán 47820, Jalisco, Mexico
*
Author to whom correspondence should be addressed.
Genes 2023, 14(5), 1120; https://doi.org/10.3390/genes14051120
Submission received: 1 May 2023 / Revised: 14 May 2023 / Accepted: 18 May 2023 / Published: 22 May 2023
(This article belongs to the Section Human Genomics and Genetic Diseases)

Abstract

:
Forensic genomic systems allow simultaneously analyzing identity informative (iiSNPs), ancestry informative (aiSNPs), and phenotype informative (piSNPs) genetic markers. Among these kits, the ForenSeq DNA Signature prep (Verogen) analyzes identity STRs and SNPs as well as 24 piSNPs from the HIrisPlex system to predict the hair and eye color. We report herein these 24 piSNPs in 88 samples from Monterrey City (Northeast, Mexico) based on the ForenSeq DNA Signature prep. Phenotypes were predicted by genotype results with both Universal Analysis Software (UAS) and the web tool of the Erasmus Medical Center (EMC). We observed predominantly brown eyes (96.5%) and black hair (75%) phenotypes, whereas blue eyes, and blond and red hair were not observed. Both UAS and EMC showed high performance in eye color prediction (p ≥ 96.6%), but a lower accuracy was observed for hair color prediction. Overall, UAS hair color predictions showed better performance and robustness than those obtained with the EMC web tool (when hair shade is excluded). Although we employed a threshold (p > 70%), we suggest using the EMC enhanced approach to avoid the exclusion of a high number of samples. Finally, although our results are helpful to employ these genomic tools to predict eye color, caution is suggested for hair color prediction in Latin American (admixed) populations such as those studied herein, principally when no black color is predicted.

1. Introduction

In the last several decades, short tandem repeat (STR) loci have become the golden standard in DNA forensic casework. Nevertheless, STR systems can be ineffectual in some situations as in low-quality or quantity samples, resulting in a missing or incomplete STR profile for human identification (HID) purposes. Single nucleotide polymorphisms (SNPs) have been targeted for analysis to address these forensic issues, given that the shorter amplicon length generated with SNPs increases the analytical success of degraded DNA samples [1]. Additionally, SNPs provide a research tool to infer some individual traits, such as physical appearance and biogeographic origin. This is relevant in the context of decedents whose remains may offer a poor visual indication of their physical features, such as in cases of advanced decomposition or mass disasters [2], especially when DNA profiles are absent from available databases [1]. Thus, information derived from phenotype-informative SNPs (piSNPs) and ancestry-informative SNPs (aiSNPs) can be essential in cold cases for the HID process [3].
The first practical tool to infer eye color phenotypes (IrisPlex) was developed in 2011 [4] and, subsequently, it was expanded to infer both eye and hair coloration (HIrisPlex) [5]. The system showed >94% prediction accuracy for blue and brown eye colors, while for intermediate eye color was 77%; for hair color, the average prediction accuracy for blond, brown, red, and black were 69.5, 78.5, 80, and 87.5%, respectively [5]. Massively parallel sequencing (MPS) is an emerging tool in forensic science that is particularly beneficial for overcoming the shortfalls of STR profiling [6,7]. This technology accomplishes the simultaneous analysis of STRs, identity-informative SNPs (iiSNPs), phenotype informative (piSNPs) [8], and ancestry informative (aiSNPs) [9], as well as mitochondrial DNA analysis [10,11]. Among the forensic MPS systems, the ForenSeqTM DNA Signature Prep kit (Verogen, San Diego, CA, USA) contains two separated DNA primer sets: (1) DNA Primer Set-A (DPS-A), targets and amplifies 27 autosomal STRs (aSTRs), 7 X-chromosome STRs (X-STRs), 24 Y-chromosome STRs (Y-STRs), and 94 iiSNPs, and; (2) DNA Primer Set-B (DPS-B), targets all those markers included in DPS-A plus 56 aiSNPs and 22 piSNPs [12]. The piSNPs of the DPS-B are based on the markers included in the HIrisPlex.
The DPS-A of this forensic MPS system has been analyzed worldwide [13,14,15,16,17,18,19,20]. Conversely, scarce population studies based on DPS-B have been reported; they are limited to the Norwegian population [21], Yavapai ethnic group [22], and the four major ethnic groups from the USA [23]. To our best knowledge, this task has not been conducted in Latin American or Mestizo (admixed) populations, which is of great interest because of their complex population structure, resulting from ~500 years of admixture since the European colonization of the Americas. In the Mexican population, their major ancestral contributors involve Spaniards, Native Americans, and—to a lesser extent—African slaves. An ancestral gradient along the Mexican territory has been reported for Mestizos: the European ancestry shows a decreasing North-to-South gradient, whereas the Native American ancestry displays the contrary decreasing South-to-North pattern. Thus, the southeast of Mexico—also known as Mesoamerica in the pre-Columbian period—shows the highest Native American ancestry [24].
Although iris color prediction and ancestry inference has been carried out in European and South American populations [25], it is necessary to predict the accuracy of forensic MPS platforms in admixed populations with complex genetic structures. Particularly in Mexican Mestizos, where brown and black coloration predominates for both eye and hair phenotypes, respectively. In this paper, we assessed—for the first time in a Latin American population—the accuracy of eye and hair color based on the ForenSeq™ DNA Signature Prep DPS-B in the Mexican Mestizo population from Monterrey City (Northeast, Mexico), which is the second main economic, cultural, and political metropolis of this country.

2. Materials and Methods

2.1. Population Sample

Previously, we reported the forensic parameters of the genetic markers included in the DPS-A of 105 unrelated residents of Monterrey City in the Nuevo Leon state (Northeast, Mexico) [19]. In this study, most of these samples were analyzed with the DPS-B to infer eye and hair color (n = 88). Seventeen samples were excluded from the original sample set due to the impossibility of phenotype observation (individuals with dyed hair or baldness). For the 88 samples, reported information on the sample donor regarding hair color, eye color, birthplace, and ancestry was provided in the informed consent. All volunteers signed an informed consent form before their inclusion in the study, according to the Helsinki Declaration Ethical Guidelines. This work was approved by the Ethical Research Committee at the Institute of Criminalistics and Forensic Services of the Attorney General of the Nuevo Leon state (Project number assigned: IC-017-2019).

2.2. Eye and Hair Color Phenotype Observed

A high-resolution photograph was taken of all volunteers to establish confident eye and hair color phenotypes. For this purpose, we used a Canon Eos Rebel T6 camera. For qualitative determination of eye and hair colors, two individuals were separately asked to intuitively assign each photograph to one of the following phenotype categories: (1) eye color: blue, intermediate, or brown, where intermediate color was assigned to those neither blue nor brown; (2) hair color, according to the enhanced prediction method: black, dark brown/black, brown/dark brown, brown, dark blond/brown, blond/dark blond, blond, and red; and (3) hair color, according to the original HIrisPlex method: blond, red, brown, or black. The coincidence between interpreters to assign the volunteers’ phenotypes simplified to establish a final color designation in the total sample. The criteria of color categories were taken based on the IrisPlex and HIrisPlex models. We found predominantly brown and black colors for both eye and hair phenotypes as described below.

2.3. DNA Extraction Method

DNA was obtained from peripheral blood using the Prep-Filer Express BTA™ Forensic DNA extraction kit. For this purpose, the AutoMate Express DNA extraction system was used according to the supplier’s instructions (Applied Biosystems, Foster City, CA, USA). Next, DNA was quantified with the Quantifiler® Trio DNA quantification kit in a 7500 Applied Biosystems Real-Time PCR system (Applied Biosystems, Foster City, CA, USA).

2.4. Massive Parallel Sequencing (MPS) Method

Libraries were generated using the DPS-B of the ForenSeq™ DNA Signature Prep kit (Verogen®, San Diego, CA, USA). Library preparation involved PCR to amplify the DNA targets (piSNPs and aiSNPs) and incorporate dual-indexed adaptors [12]. The libraries were normalized and then pooled. A total of 12–32 samples were pooled on each run. The library pool was diluted, denatured, and then added to the MiSeq FGx™ reagent standard and micro kits for cluster generation on the flow cell according to manufacturer recommendations. Sequencing was conducted following the procedures outlined in the MiSeq FGx™ Instrument Reference Guide [14]. After each sequencing run was completed, a post-run wash was performed. We carried out five sequencing runs, two of them with 12 samples on the Microflow cell and three runs with 32 samples on the Standard Flow cell, including positive and negative amplification controls in each run. However, 17 samples were excluded as mentioned above. Sequencing results were analyzed with the Universal Analysis Software v1.3 (UAS) provided by the manufacturer and using its default parameters for variant calling and then retrieved and downloaded in an Excel sheet.

2.5. Data Analysis

Genotype data generated by the MiSeq FGx™ instrument was processed and analyzed using Verogen’s Universal Analysis Software v1.3 (UAS). Then, genetic data were downloaded in Microsoft Excel format. Allelic frequencies, observed heterozygosity (Ho), expected heterozygosity (He), Hardy–Weinberg equilibrium (HWE), and linkage disequilibrium (LD) were evaluated with the Excel complement GenALEx v6.5 [26] and GDA v1.1 software [27]. The piSNP data were input into the Erasmus Medical Center (EMC) web tool (https://hirisplex.erasmusmc.nl (accessed on 20 May 2023), Rotterdam, Netherlands). The R script supplied by the EMC web tool was employed to convert the Excel piSNP format—provided by the UAS—to the specific format needed for the web tool. Although UAS and EMC predictors use multinomial logistic regression (MLR) prediction models, UAS was only trained with a European reference dataset based on the original HIrisPlex System [5]. Conversely, the EMC web tool provides an enhanced prediction model that considers hair color and shade, and it was trained with an extended dataset comprising individuals with various continental origins, as follows: 80% from Europe, 16% from North America (including individuals of European, African, and Asian ancestry), and 4% from Africa and Oceania [4,5,28]. Predictions obtained with the UAS and the EMC web tool were compared among each other, as well as with the color phenotypes based on the volunteer’s photographs. Both the UAS and the EMC web tool included blue, intermediate, and brown eye color inference providing p-values for each phenotype. For hair color inference, UAS produced the outcomes black, brown, red, and blond, whereas the EMC web tool produced different outcomes: black, dark brown/black, brown/dark brown, brown, dark blond/brown, blond/dark blond, blond, and red. For hair color estimation, we employed the following methods: (1) the enhanced prediction algorithm (HirisPlex-S DNA Phenotyping Webtool User Manual 2.0 (2018), found at: https://hirisplex.erasmusmc.nl accessed on 20 May 2023) for the EMC predictions considering the hair color and shade (black, dark brown/black, brown/dark brown, brown, dark blond/brown, blond/dark blond, blond, and red) and (2) employing only the black, brown, blond, and red categories for comparison purposes between the two algorithms. For comparative and assessment purposes, two different levels of predicted p-values were taken into account, as follows: (1) considering the highest p-value (pMax) without restriction: where no threshold was considered as long as prediction difference ≥10% between the two higher values (i.e., brown = 0.49, blond = 0.39, thus we assumed the brown color prediction), whereas the result was considered inconclusive when the difference between probabilities was <10%; and (2) employing a threshold of probabilities ≥70% (i.e., brown = 0.71, blond = 0.29, thus we assumed the prediction of brown color), while we declared inconclusive when the probability of the prediction was <70% (i.e., brown = 0.69, blond = 0.31).

3. Results

3.1. Genetic Population Data

Genotypes of 24 piSNPs from 88 samples are available in Supplementary Materials S1. Allelic frequencies, Ho, and HWE results are available in Table 1. The higher Ho was observed in rs885479 (Ho = 0.523), while rs312262906, rs1805006, rs11547464, rs201326893, and rs1805008 were in monomorphic state in the Monterrey population (Ho = 0). We observed two loci (rs1042602 and rs1800407) out of Hardy–Weinberg expectations (p > 0.05). However, when the Bonferroni correction was applied (p > 0.002), all loci were in HWE (Table 1). We found 19 pairs of loci in linkage disequilibrium (LD), in which 2 loci stand out, the rs1042602 (TYR gene) and rs1800407 loci (OCA2 gene) because they were involved in 14 and 5 of these 19 pairwise LD cases, respectively. When Bonferroni correction was applied, only two pairs of loci were in LD, again both rs1042602 and rs1800407 (Supplementary Material S2). Due to the forensic purpose of this work, these findings will not be further discussed.

3.2. Eye color Prediction Performance

No dropouts were found in the sample set; therefore, all samples were predicted (n = 88) for both UAS and EMC systems. We observed eighty-five brown eyes (96.6%) and three intermediate eye colorations (3.4%) in the Monterrey City population, where no blue eyes were observed. On the other hand, the UAS and EMC systems without restrictions (pMax) predicted eighty-six brown eyes and one blue eye, plus one inconclusive result; nevertheless, only eighty-five brown eye assignments were correct (96.6%) (Table 2a). However, when we excluded those predictions with probabilities < 70% (three samples), the prediction increased to 98.8% for the two systems. Both UAS and EMC failed to predict accurately the three intermediate eye colors observed in the whole sample (Figure 1). In the first case, the observed eye color was green, but predictors results were inconclusive for both systems (p = ~0.3 for each color). The second case was hazelnut color, but predictors suggest blue color (p = 0.72 for UAS and p = 0.68 for EMC). For the last case, amber color was observed, but both predictors showed a brown eye color prediction (p = 0.94) (Figure 1). Although we reviewed the results of the principal component analysis (PCA) for these three samples, the same ancestry pattern of Mestizo individuals from northern Mexico was observed; thus, ancestry results did not provide additional relevant information.

3.3. Hair Color Prediction Performance

When the hair color of photographs was assigned according to the enhanced method, the following hair colors were observed: sixty-one black, twenty-four dark brown/black, two brown/dark brown, and one blond/dark brown were observed. No brown, blond, and red categories were observed. EMC web tool predicted 59 black hair, 25 dark brown/black hair, and 4 brown/dark brown hair, with an accuracy of 78.6, 54.2, and 50% of accuracy, respectively (Table 3).
When the hair colors of photographs were assigned employing the original HIrisPlex categories, 66 black (75%) and 22 brown (25%) hairs were observed in the population of Monterrey City. No blond or red hair colors were observed. For hair coloration, the variation of the results among predictors was greater than those obtained for eye color (Table 2). For the unrestricted data, UAS predicted seventy black and six brown hairs, of which fifty-seven and three were correct, respectively (accuracy = 68.2%).
On the other hand, EMC predicted fifty-four black and nineteen brown hairs, with 46 and 10 correct predictions, respectively (accuracy = 63.6%). When we filtered the results to those with probabilities >70%, prediction accuracy improved for both systems (UAS = 84%; EMC = 91%).
Interestingly, it is evident that better results were obtained with both predictors when the hair color was black, as both systems failed to predict brown hair with probabilities > 70%. It appears that UAS is more robust to predict hair coloration than EMC, when the p > 70% constraint is applied (42 vs. 20) (Table 2). Nevertheless, these comparisons were performed when the UAS color categories were employed (black, brown, blond, and red) and the black and brown hair color shades were omitted, which could cause a result bias in favor of UAS performance. Figure 1b shows three mismatch cases of prediction when the hair color was black. In the first case, both systems failed to assign one hair color but provided almost the same probability to brown and blond hair colors. In the second case, probabilities were 0.62 and 0.73 for the brown color in UAS and EMC, respectively. In the third case, UAS correctly predicted black color (p = 0.6), whereas EMC predicted brown color (p = 0.49) (Figure 1b).

4. Discussion

Although different studies have reported the prediction of externally visible characteristics in worldwide populations [21,29,30,31,32], the hair and eye color prediction based on the ForenSeq system has been scarcely reported. In the Americas, we only found two reports based on this forensic genomic system in US populations [22,23]. Therefore, the usefulness of this technology for admixed populations should be analyzed in depth. We report—for the first time in one Latin American population—the performance of the ForenSeq DNA Signature Prep kit to predict hair and eye coloration, specifically in Mexican Mestizos from Monterrey City.

4.1. Data Quality

As opposed to the EMC web tool, UAS is not able to provide phenotype estimations when a specific allele drop-out is present because it significantly affects the phenotype prediction [33,34]. Nevertheless, the 88 samples reported herein provided confident phenotype estimations with both predicting tools since the sequencing runs did not show significant allele drop-out (Supplementary Materials S1).

4.2. Population Characteristics

Mexican Mestizo populations are characterized by the presence of European, Native American, and African ancestries whose admixture processes took place since the European contact with the Americas around 500 years ago. However, the admixture pattern is not homogeneous throughout the country: northern populations show higher European ancestry while southern populations show more Native American ancestry; in general, the ancestry of Mexican Mestizo populations is related to the pre-Hispanic demography and recapitulates the Native American substructure [24,35]. Monterrey City is located in the northeast of Mexico, presenting one of the highest European ancestries in the country [24,36]. Interestingly, the origin of European conquers was principally Spain, and to a lesser extent Portugal and Italy [37]. Although Spain has a high frequency of blue and green/intermediate eyes [25], most of the studied population sample had brown eyes (85/88), and only three showed intermediate eye colors, one sample for each: green, hazelnut, and amber (Table 2, Figure 1a). Conversely, a relatively high frequency of blue and green eye color was reported in one Argentinean population sample [29]. Regarding hair color, we observed only brown and black related-shades. This is in line with a previous description from the Latin American population where blond hair was scarce and red hair was not observed [38].

4.3. Eye Color Prediction

The Monterrey City population showed 96.6% of brown eyes and 3.4% of intermediate eyes, whereas blue eyes were not found. This eye color distribution is in line with the data obtained based on the HGDP-CEPH Human Genome Diversity Cell Line Panel for Mexican populations [25]. Predictions obtained with both UAS and EMC systems were high with the pMax approach (accuracy = 96.6%) but improved with the p > 70% threshold (accuracy = 98.8%). Nevertheless, some considerations must be taken into account, for example, only brown eyes were correctly inferred by both systems, but the three cases of intermediate eye color assignments were incorrect. Although previous studies in European populations show high performance of the IrisPlex based-methods, in admixed populations, the blue eye color usually is scarce [25], with some exceptions such as the Argentineans [29]. We observed two eye colors but predominating brown color eyes; this skew in the color distribution causes an upward shift in the accuracy of the prediction, especially for those methods biased to the most frequent category that seems to offer better performance than they actually are [38]. Consequently, the high performance in eye color prediction seen in the Monterrey population could be biased by the brown eye color predominance, reflecting a better performance of this system to predict extreme brown and blue eye colors [4].
Although it was formerly thought that human eye color is a simple Mendelian phenotype [39], subsequent studies reported that it is a complex pattern of inheritance regulated by complex processes such as epistasis and incomplete dominance [40]. This could be one reason why the six SNPs included in the IrisPlex prediction-based methods show a low accuracy in the prediction of intermediate eye color, as was shown in our study in the Monterrey population, as well as in previous worldwide populations [21,29,30,31,33]. These findings highlight the necessity to include a greater number of loci in eye color prediction systems to better predict the intermediate eye color, especially for populations with high frequencies for this phenotype.

4.4. Hair Color Prediction

We observed a limited variety of hair colors (black, dark brown/black, brown/dark brown and blond/dark brown), with a predominance of black and dark brown/black shades. Blond and red hair colors were not observed in the Monterrey City population. This result is in line with a previous report on Latin American populations [38]. For Hirisplex, a prediction efficacy of 81–93% was initially reported for red, blond, brown, and black hair colors [41]. When we compared the predictions offered by UAS and EMC, employing the original hair colors of HIrisPlex, we obtained an accurate prediction of 68.2 and 63.6% based on pMax approach, respectively.
Although European populations showed better prediction results, they also show higher diversity in hair colors (black, brown, blond, and red hair) [21,30]. For both UAS and EMC, the black hair color was better predicted than brown, as previously described [21,30,33,38]. Thus, the low accuracy of hair prediction observed herein must be influenced by the presence of 25% of brown hair color. On the other hand, the low prediction of blond and brown hair colors may be influenced by the hair darkening that occurs in blond individuals from six to thirteen years old [42].
When we assessed the prediction results based on the EMC-enhanced method that considers the hair shade, a better prediction performance was observed (71.5%). Although this result is lower than the one obtained with the >70% threshold, the EMC-enhanced method allows hair color prediction without excluding samples with low probabilities values.

4.5. Predictor’s Performance

Both UAS and EMC showed the same prediction eye color performance, even employing the threshold criteria, as well as inconclusive results (p = ~0.30 for each color) in only one sample. When the shade was omitted for the hair color prediction, a higher number of inconclusive results were observed in the comparative analysis for both predictors (UAS = 11, EMC = 14), while no inconclusive predictions were observed when we applied the p > 70% threshold. Nevertheless, these accuracy results must be taken carefully. For instance, when we used the pMax approach, UAS showed 17 incorrect predictions (68.2% of accuracy) while EMC showed 18 incorrect predictions (63.6% of accuracy). Conversely, when the threshold was employed, UAS and EMC showed eight and two incorrect predictions, respectively, without inconclusive results. Although this seems good, a high number of predictions were excluded (UAS = 38 and EMC = 66) due to probabilities below the threshold (p < 70%). Thus, the higher accuracy predictions of the threshold (UAS = 84% and EMC = 91%) diminish considerably the number of samples (50 for UAS and 22 for EMC) because many are excluded.
Overall, both UAS and EMC had a similar accuracy precision for eye coloration, but for hair color prediction, UAS seems to be more accurate than EMC, both employing the pMax approach as well as the threshold. Conversely, previous studies in European populations showed a better performance of EMC for black and brown color hair, than those obtained by UAS [21]. This finding could be explained because in these studies, the enhanced method was employed; in fact, we obtained similar results with the enhanced method (accuracy of 71.5%). Moreover, the performance of hair prediction in Monterrey City could be biased by the almost exclusive presence of black hair color [38]. The same results will most likely apply to most of the Mexican Mestizo populations characterized by high Native American ancestry [24,35,36], which will be useful to evaluate future eye color prediction systems specifically designed to improve the prediction of intermediate eye colors. Finally, our prediction performance comparison in the Mexican population allows the recommendation of the enhanced method to predict hair color with the EMC web tool, instead of the threshold or regular pMax methods, which have the disadvantage of excluding samples from the prediction analysis.

5. Conclusions

We evaluated the 24 piSNPs included in the ForenSeq DNA Signature Prep kit to predict eye and hair color in Mexican Mestizos from the Monterrey City population. To our knowledge, this is the first report that predicts eye and hair color based on this forensic MPS technology in one Latin American population. Based on the UAS and EMC platforms, we observed a high-accuracy prediction for brown eye color. However, hair color prediction had a lower performance than in previous studies. Although employing a threshold of p > 70% improved the prediction accuracy, this causes the exclusion of a high number of results below the threshold; hence, the better option in forensic casework should be the pMax enhanced method, considering hair shade. Our results enable the reliable forensic use of this genomic platform to predict eye and hair color in this Mexican Mestizo population, principally in brown eyes and black hair colors, but with caution to predict hair phenotypes which are different from black hair. Future studies must be conducted with higher sample sizes in Mexican populations to evaluate the prediction performance of other possible eye and hair colors not observed in this study.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes14051120/s1, Supplementary Material S1: SNP genotype database of the studied Mexican Mestizo population sample (n = 88). Supplementary Material S2: Results of the pairwise LD tests (p-values) between piSNPs studied in the Mexican Mestizo population sample (n = 88).

Author Contributions

Conceptualization: B.R.-G. and H.R.-V. Methodology: M.F.C.-M., C.E.C.-Á. and O.S.-S. Software, Validation, Formal analysis and Data curation: J.A.A.-V., B.J.L.-d.-D. and A.L.-Q. Project administration and Funding acquisition: B.R.-G. and H.R.-V. Writing—Original Draft Preparation: J.A.A.-V. and H.R.-V. Writing—Review and Editing: J.A.A.-V., B.R.-G. and H.R.-V. All authors have read and agreed to the published version of the manuscript.

Funding

Mexican government thought CONACYT (grant 286623) to H.R.-V.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and was approved by the Ethical Research Committee at the Institute of Criminalistics and Forensic Services of the Attorney General of Nuevo Leon (Project number assigned: IC-017-2019).

Informed Consent Statement

All the volunteers signed an informed consent form before their inclusion in the study, according to the Helsinki Declaration Ethical Guidelines. The anonymity of the participants will be preserved at all times.

Data Availability Statement

The online version contains Supplementary Materials available on the journal webpage.

Acknowledgments

The authors thank the volunteers who kindly accept to participate in this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kayser, M.; de Knijff, P. Improving human forensics through advances in genetics, genomics and molecular biology. Nat. Rev. Genet. 2011, 12, 179–192. [Google Scholar] [CrossRef] [PubMed]
  2. Biesecker, L.G.; Bailey-Wilson, J.E.; Ballantyne, J.; Baum, H.; Bieber, F.R.; Brenner, C.; Budowle, B.; Butler, J.M.; Carmody, G.; Conneally, P.M.; et al. DNA Identifications After the 9/11 World Trade Center Attack. Science 2005, 310, 1122–1123. [Google Scholar] [CrossRef] [PubMed]
  3. Butler, J.M.; Coble, M.; Vallone, P. STRs vs. SNPs: Thoughts on the future of forensic DNA testing. Forensic Sci. Med. Pathol. 2007, 3, 200–205. [Google Scholar] [CrossRef] [PubMed]
  4. Walsh, S.; Liu, F.; Ballantyne, K.N.; van Oven, M.; Lao, O.; Kayser, M. IrisPlex: A sensitive DNA tool for accurate prediction of blue and brown eye colour in the absence of ancestry information. Forensic Sci. Int. Genet. 2011, 5, 170–180. [Google Scholar] [CrossRef] [PubMed]
  5. Walsh, S.; Liu, F.; Wollstein, A.; Kovatsi, L.; Ralf, A.; Kosiniak-Kamysz, A.; Branicki, W.; Kayser, M. The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Sci. Int. Genet. 2013, 7, 98–115. [Google Scholar] [CrossRef]
  6. de Knijff, P. From next generation sequencing to now generation sequencing in forensics. Forensic Sci. Int. Genet. 2019, 38, 175–180. [Google Scholar] [CrossRef]
  7. Børsting, C.; Morling, N. Next generation sequencing and its applications in forensic genetics. Forensic Sci. Int. Genet. 2015, 18, 78–89. [Google Scholar] [CrossRef]
  8. Jäger, A.C.; Alvarez, M.L.; Davis, C.P.; Guzmán, E.; Han, Y.; Way, L.; Walichiewicz, P.; Silva, D.; Pham, N.; Caves, G.; et al. Developmental validation of the MiSeq FGx Forensic Genomics System for Targeted Next Generation Sequencing in Forensic DNA Casework and Database Laboratories. Forensic Sci. Int. Genet. 2017, 28, 52–70. [Google Scholar] [CrossRef]
  9. Phillips, C. Forensic genetic analysis of bio-geographical ancestry. Forensic Sci. Int. Genet. 2015, 18, 49–65. [Google Scholar] [CrossRef]
  10. Strobl, C.; Eduardoff, M.; Bus, M.; Allen, M.; Parson, W. Evaluation of the precision ID whole MtDNA genome panel for forensic analyses. Forensic Sci. Int. Genet. 2018, 35, 21–25. [Google Scholar] [CrossRef]
  11. Holt, C.L.; Stephens, K.M.; Walichiewicz, P.; Fleming, K.D.; Forouzmand, E.; Wu, S.-F. Human Mitochondrial Control Region and mtGenome: Design and Forensic Validation of NGS Multiplexes, Sequencing and Analytical Software. Genes 2017, 12, 599. [Google Scholar] [CrossRef] [PubMed]
  12. Verogene. ForenSeq™ DNA Signature Prep Reference Guide. 2020. D2018005 Rev. C.; QIAGEN: San Diego, CA, USA, 2020. [Google Scholar]
  13. Wendt, F.R.; King, J.L.; Novroski, N.M.; Churchill, J.D.; Ng, J.; Oldt, R.F.; McCulloh, K.L.; Weise, J.A.; Smith, D.G.; Kanthaswamy, S.; et al. Flanking region variation of ForenSeq™ DNA Signature Prep Kit STR and SNP loci in Yavapai Native Americans. Forensic Sci. Int. Genet. 2017, 28, 146–154. [Google Scholar] [CrossRef] [PubMed]
  14. Devesse, L.; Ballard, D.; Davenport, L.; Riethorst, I.; Mason-Buck, G.; Court, D.S. Concordance of the ForenSeq™ system and characterisation of sequence-specific autosomal STR alleles across two major population groups. Forensic Sci. Int. Genet. 2018, 34, 57–61. [Google Scholar] [CrossRef] [PubMed]
  15. Phillips, C.; Devesse, L.; Ballard, D.; van Weert, L.; de la Puente, M.; Melis, S.; Iglesias, V.; Freire-Aradas, A.; Oldroyd, N.; Holt, C.; et al. Global patterns of STR sequence variation: Sequencing the CEPH human genome diversity panel for 58 forensic STRs using the Illumina ForenSeq DNA Signature Prep Kit. Electrophoresis 2018, 39, 2708–2724. [Google Scholar] [CrossRef]
  16. Hussing, C.; Bytyci, R.; Huber, C.; Morling, N.; Børsting, C. The Danish STR sequence database: Duplicate typing of 363 Danes with the ForenSeq™ DNA Signature Prep Kit. Int. J. Leg. Med. 2019, 133, 325–334. [Google Scholar] [CrossRef]
  17. Delest, A.; Godfrin, D.; Chantrel, Y.; Ulus, A.; Vannier, J.; Faivre, M.; Hollard, C.; Laurent, F.-X. Sequenced-based French population data from 169 unrelated individuals with Verogen’s ForenSeq DNA signature prep kit. Forensic Sci. Int. Genet. 2020, 47, 102304. [Google Scholar] [CrossRef]
  18. Guevara, E.K.; Palo, J.U.; King, J.L.; Buś, M.M.; Guillén, S.; Budowle, B.; Sajantila, A. Autosomal STR and SNP characterization of populations from the Northeastern Peruvian Andes with the ForenSeqTM DNA Signature Prep Kit. Forensic Sci. Int. Genet. 2021, 52, 102487. [Google Scholar] [CrossRef]
  19. Aguilar-Velázquez, J.A.; Duran-Salazar, M.; Córdoba-Mercado, M.F.; Coronado-Avila, C.E.; Salas-Salas, O.; Martinez-Cortés, G.; Casals, F.; Calafell, F.; Ramos-González, B.; Rangel-Villalobos, H. Characterization of 58 STRs and 94 SNPs with the ForenSeq™ DNA signature prep kit in Mexican-Mestizos from the Monterrey city (Northeast, Mexico). Mol. Biol. Rep. 2020, 49, 7601–7609. [Google Scholar] [CrossRef]
  20. Casals, F.; Rasal, R.; Anglada, R.; Tormo, M.; Bonet, N.; Rivas, N.; Vásquez, P.; Calafell, F. A forensic population database in El Salvador: 58 STRs and 94 SNPs. Forensic Sci. Int. Genet. 2022, 57, 102646. [Google Scholar] [CrossRef]
  21. Salvo, N.M.; Janssen, K.; Kirsebom, M.K.; Meyer, O.S.; Berg, T.; Olsen, G.-H. Predicting eye and hair colour in a Norwegian population using Verogen’s ForenSeq™ DNA signature prep kit. Forensic Sci. Int. Genet. 2022, 56, 102620. [Google Scholar] [CrossRef]
  22. Wendt, F.R.; Churchill, J.D.; Novroski, N.M.; King, J.L.; Ng, J.; Oldt, R.F.; McCulloh, K.L.; Weise, J.A.; Smith, D.G.; Kanthaswamy, S.; et al. Genetic analysis of the Yavapai Native Americans from West-Central Arizona using the Illumina MiSeq FGx™ forensic genomics system. Forensic Sci. Int. Genet. 2016, 24, 18–23. [Google Scholar] [CrossRef] [PubMed]
  23. Churchill, J.D.; Novroski, N.M.; King, J.L.; Seah, L.H.; Budowle, B. Population and performance analyses of four major populations with Illumina’s FGx Forensic Genomics System. Forensic Sci. Int. Genet. 2017, 30, 81–92. [Google Scholar] [CrossRef] [PubMed]
  24. Rubi-Castellanos, R.; Martínez-Cortés, G.; Muñoz-Valle, J.F.; González-Martín, A.; Cerda-Flores, R.M.; Anaya-Palafox, M.; Rangel-Villalobos, H. Pre-Hispanic Mesoamerican demography approximates the present-day ancestry of Mestizos throughout the territory of Mexico. Am. J. Phys. Anthr. 2009, 139, 284–294. [Google Scholar] [CrossRef] [PubMed]
  25. Freire-Aradas, A.; Ruiz, Y.; Phillips, C.; Maroñas, O.; Söchtig, J.; Tato, A.G.; Dios, J.; de Cal, M.C.; Silbiger, V.; Luchessi, A.; et al. Exploring iris colour prediction and ancestry inference in admixed populations of South America. Forensic Sci. Int. Genet. 2014, 13, 3–9. [Google Scholar] [CrossRef] [PubMed]
  26. Peakall, R.; Smouse, P.E. GenAlEx 6.5: Genetic analysis in Excel. Population genetic software for teaching and research-an update. Bioinformatics 2012, 28, 2537–2539. [Google Scholar] [CrossRef] [PubMed]
  27. Lewis, P.O.; Zaykin, D. Genetic Data Analysis (GDA): Computer Program for the Analysis of Allelic Data; Version 1.1; Free Program Distributed by the Authors over the Internet; 2001; Available online: https://plewis.github.io/software/ (accessed on 1 May 2023).
  28. Chaitanya, L.; Breslin, K.; Zuñiga, S.; Wirken, L.; Pośpiech, E.; Kukla-Bartoszek, M.; Sijen, T.; de Knijff, P.; Liu, F.; Branicki, W.; et al. The HIrisPlex-S system for eye, hair and skin colour prediction from DNA: Introduction and forensic developmental validation. Forensic Sci. Int. Genet. 2018, 35, 123–135. [Google Scholar] [CrossRef]
  29. Hohl, D.M.; González, R.; Meztler, G.P.D.S.; Patiño-Rico, J.; Dejean, C.; Avena, S.; Gutiérrez, M.D.L.; Catanesi, C.I. Applicability of the IrisPlex system for eye color prediction in an admixed population from Argentina. Ann. Hum. Genet. 2022, 86, 297–327. [Google Scholar] [CrossRef]
  30. Sari, O.I.; Simsek, S.Z.; Filoglu, G.; Bulbul, O. Predicting Eye and Hair Color in a Turkish Population Using the HIrisPlex System. Genes 2022, 13, 2094. [Google Scholar] [CrossRef]
  31. Alghamdi, J.; Amoudi, M.; Kassab, A.C.; Al Mufarrej, M.; Al Ghamdi, S. Eye color prediction using single nucleotide polymorphisms in Saudi population. Saudi J. Biol. Sci. 2018, 26, 1607–1612. [Google Scholar] [CrossRef]
  32. Salvoro, C.; Faccinetto, C.; Zucchelli, L.; Porto, M.; Marino, A.; Occhi, G.; de Los Campos, G.; Vazza, G. Performance of four models for eye color prediction in an Italian population sample. Forensic Sci. Int. Genet. 2019, 40, 192–200. [Google Scholar] [CrossRef]
  33. Sharma, V.; Jani, K.; Khosla, P.; Butler, E.; Siegel, D.; Wurmbach, E. Evaluation of ForenSeq™ Signature Prep Kit B on predicting eye and hair coloration as well as biogeographical ancestry by using Universal Analysis Software (UAS) and available web-tools. Electrophoresis 2019, 40, 1353–1364. [Google Scholar] [CrossRef]
  34. Weisz, N.A.; Roberts, K.A.; Hardy, W.R. Reliability of phenotype estimation and extended classification of ancestry using decedent samples. Int. J. Leg. Med. 2021, 135, 2221–2233. [Google Scholar] [CrossRef] [PubMed]
  35. Moreno-Estrada, A.; Gignoux, C.R.; Fernández-López, J.C.; Zakharia, F.; Sikora, M.; Contreras, A.V.; Acuña-Alonzo, V.; Sandoval, K.; Eng, C.; Romero-Hidalgo, S.; et al. The genetics of Mexico recapitulates Native American substructure and affects biomedical traits. Science 2014, 344, 1280–1285. [Google Scholar] [CrossRef] [PubMed]
  36. Aguilar-Velázquez, J.A.; Martínez-Cortés, G.; Inclán-Sánchez, A.; Favela-Mendoza, A.F.; Velarde-Félix, J.S.; Rangel-Villalobos, H. Forensic parameters and admixture in Mestizos from five geographic regions of Mexico based on 20 autosomal STRs (Powerplex 21 system). Int. J. Leg. Med. 2018, 132, 1293–1296. [Google Scholar] [CrossRef] [PubMed]
  37. Grunberg, B. El universo de los conquistadores: Resultado de una investigación prosopográfica. Signos Históricos 2004, 012, 94–118. [Google Scholar]
  38. Palmal, S.; Adhikari, K.; Mendoza-Revilla, J.; Fuentes-Guajardo, M.; Silva de Cerqueira, C.C.; Bonfante, B.; Chacón-Duque, J.C.; Sohail, A.; Hurtado, M.; Villegas, V.; et al. Prediction of eye, hair and skin colour in Latin Americans. Forensic Sci. Int. Genet. 2021, 53, 102517. [Google Scholar] [CrossRef]
  39. Hurst, C. On the inheritance of eye-colour in man. Proc. R. Soc. London—Ser. B Contain. Pap. A Biol. Character 1908, 80, 85–96. [Google Scholar] [CrossRef]
  40. White, D.; Rabago-Smith, M. Genotype–phenotype associations and human eye color. J. Hum. Genet. 2011, 56, 5–7. [Google Scholar] [CrossRef]
  41. Walsh, S.; Chaitanya, L.; Clarisse, L.; Wirken, L.; Draus-Barini, J.; Kovatsi, L.; Maeda, H.; Ishikawa, T.; Sijen, T.; de Knijff, P.; et al. Developmental validation of the HIrisPlex system: DNA-based eye and hair colour prediction for forensic and anthropological usage. Forensic Sci. Int. Genet. 2014, 9, 150–161. [Google Scholar] [CrossRef]
  42. Kukla-Bartoszek, M.; Pośpiech, E.; Spólnicka, M.; Karłowska-Pik, J.; Strapagiel, D.; Żądzińska, E.; Rosset, I.; Sobalska-Kwapis, M.; Słomka, M.; Walsh, S.; et al. Investigating the impact of age-depended hair colour darkening during childhood on DNA-based hair colour prediction with the HIrisPlex system. Forensic Sci. Int. Genet. 2018, 36, 26–33. [Google Scholar] [CrossRef]
Figure 1. Mismatches between predicted and observed phenotypes for eye (a) and hair (b) colors in the Mestizo population sample from Monterrey City (Northeast, Mexico).
Figure 1. Mismatches between predicted and observed phenotypes for eye (a) and hair (b) colors in the Mestizo population sample from Monterrey City (Northeast, Mexico).
Genes 14 01120 g001
Table 1. Allele frequencies, heterozygosity and Hardy–Weinberg evaluation of the 24 piSNPs studied in the Mexican Mestizo population of Monterrey City (n = 88).
Table 1. Allele frequencies, heterozygosity and Hardy–Weinberg evaluation of the 24 piSNPs studied in the Mexican Mestizo population of Monterrey City (n = 88).
GeneSNPAllele FrequenciesHoHeHWE
p-Value
ATGC
SLC45A2rs287770.3636--0.63640.43180.46550.3663
IRF4rs12203592-0.0284-0.97160.03410.05550.0584
LOC105374875rs49592700.4830--0.51710.42050.50230.1428
TYRP1rs6830.4546--0.54550.47730.49870.8225
TYRrs10426020.2273--0.77270.25000.35320.0100 *
TYRrs13933500.1023-0.8977-0.18180.18471
KITLGrs12821256-0.9943-0.00570.01140.01141
LOC105370627rs12896399-0.28410.7159-0.38640.40910.6025
SLC24A4rs24021300.8409-0.1591-0.25000.26910.4478
OCA2rs18004070.0398-0.9602-0.05680.07680.0047 *
MC1Rrs312262906 --1.0000001
MC1Rrs1805005-0.07390.9261-0.12500.13760.3759
MC1Rrs1805006---1.0000001
MC1Rrs22284790.0341-0.9659-0.06820.06621
MC1Rrs11547464--1.0000-001
MC1Rrs1805007-0.0114-0.98860.02270.02261
MC1Rrs201326893 --1.0000001
MC1Rrs1110400-0.9886-0.01140.02270.02261
MC1Rrs1805008---1.0000001
MC1Rrs8854790.3864-0.6136-0.52270.47690.4991
TUBB3rs1805009--0.98860.01140.02270.02261
PIGUrs23782490.9489-0.0511-0.10230.09761
SLC45A2rs16891982--0.34660.65340.44320.45550.8178
HERC2rs129138320.8693-0.1307-0.23860.22851
* SNPs out of Hardy–Weinberg expectations (p > 0.05); in grey are highlighted the cells of genes that predict eye color.
Table 2. Predictive results and accuracy of (a) eye and (b) hair coloration in Mexican Mestizos from Monterrey City using the ForenSeq™ DNA Signature Prep.
Table 2. Predictive results and accuracy of (a) eye and (b) hair coloration in Mexican Mestizos from Monterrey City using the ForenSeq™ DNA Signature Prep.
(a) Eye Color Prediction
Without Restriction ThresholdThreshold > 70%
ColorObserved
Color
UAS Prediction
(CA)
EMC Prediction
(CA)
UAS Prediction
(CA)
EMC Prediction (CA)
Brown8586 (85)86 (85)85 (84)85 (84)
Intermediate30000
Blue01 (0)1 (0)1 (0)0
Inconclusive-1100
Excluded-0033
Correct/
Incorrect
-85/285/284/184/1
Percentage
accuracy
-96.6%96.6%98.8%98.8%
(b) Hair Color Prediction
Without Restriction Threshold Threshold > 70%
ColorObserved
Color
UAS Prediction
(CA)
EMC Prediction
(CA)
UAS Prediction
(CA)
EMC Prediction (CA)
Black6670 (57)54 (46)50 (42)21 (20)
Brown226 (3)19 (10)01 (0)
Blond00000
Red00000
Inconclusive *-111400
Excluded **-003866
Correct/
Incorrect
-60/1756/1842/820/2
Percentage
accuracy ***
-68.2%63.6%84%91%
CA: Correctly assigned; * Inconclusive: prediction results with <10% of difference between the higher probabilities; ** Excluded: samples excluded due to prediction probabilities < 70%; *** Percentage accuracy: taking into account the total number of predictions for this calculus in both UAS and EMC for each threshold.
Table 3. Hair color prediction results based on the enhanced model of EMC web tool.
Table 3. Hair color prediction results based on the enhanced model of EMC web tool.
ColorObserved
Color
EMC
Prediction
(CA)
Accuracy
(%)
Black6159 (48)78.6
D. Brown/Black2425 (13)54.2
Brown00 -
Brown/D. Brown24 (2)50
Blond/D. Brown100
Blond00-
Red00-
Correct/Incorrect-63/25-
Percentage accuracy *-71.5%-
CA, Correctly assigned. * Percentage accuracy: taking into account the total number of predictions for this calculus in both UAS and EMC for each threshold.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Aguilar-Velázquez, J.A.; Llamas-de-Dios, B.J.; Córdova-Mercado, M.F.; Coronado-Ávila, C.E.; Salas-Salas, O.; López-Quintero, A.; Ramos-González, B.; Rangel-Villalobos, H. Accuracy of Eye and Hair Color Prediction in Mexican Mestizos from Monterrey City Based on ForenSeqTM DNA Signature Prep. Genes 2023, 14, 1120. https://doi.org/10.3390/genes14051120

AMA Style

Aguilar-Velázquez JA, Llamas-de-Dios BJ, Córdova-Mercado MF, Coronado-Ávila CE, Salas-Salas O, López-Quintero A, Ramos-González B, Rangel-Villalobos H. Accuracy of Eye and Hair Color Prediction in Mexican Mestizos from Monterrey City Based on ForenSeqTM DNA Signature Prep. Genes. 2023; 14(5):1120. https://doi.org/10.3390/genes14051120

Chicago/Turabian Style

Aguilar-Velázquez, José Alonso, Blanca Jeannete Llamas-de-Dios, Miranda Fabiola Córdova-Mercado, Carolina Elena Coronado-Ávila, Orlando Salas-Salas, Andrés López-Quintero, Benito Ramos-González, and Héctor Rangel-Villalobos. 2023. "Accuracy of Eye and Hair Color Prediction in Mexican Mestizos from Monterrey City Based on ForenSeqTM DNA Signature Prep" Genes 14, no. 5: 1120. https://doi.org/10.3390/genes14051120

APA Style

Aguilar-Velázquez, J. A., Llamas-de-Dios, B. J., Córdova-Mercado, M. F., Coronado-Ávila, C. E., Salas-Salas, O., López-Quintero, A., Ramos-González, B., & Rangel-Villalobos, H. (2023). Accuracy of Eye and Hair Color Prediction in Mexican Mestizos from Monterrey City Based on ForenSeqTM DNA Signature Prep. Genes, 14(5), 1120. https://doi.org/10.3390/genes14051120

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop