Next Article in Journal
Validation of Endogenous Control Genes by Real-Time Quantitative Reverse Transcriptase Polymerase Chain Reaction for Acute Leukemia Gene Expression Studies
Next Article in Special Issue
A New Tool for Probabilistic Assessment of MPS Data Associated with mtDNA Mixtures
Previous Article in Journal
Time-Series Transcriptome of Cucumis melo Reveals Extensive Transcriptomic Differences with Different Maturity
Previous Article in Special Issue
Amplification Failure of the Amelogenin X Gene Caused by a Rare Mutation in the Primer-Binding Region
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Utilizing Massively Parallel Sequencing (MPS) of Human Leukocyte Antigen (HLA) Gene Polymorphism to Assess Relatedness in Deficiency Parentage Testing

by
Diamanto I. Kouniaki
1,
Konstantinos V. Fotopoulos
2,
Katerina Tarassi
1 and
Alexandra Tsirogianni
1,*
1
Immunology and Histocompatibility Department, Evangelismos General Hospital, 10676 Athens, Greece
2
School of Electrical and Computer Engineering, National Technical University of Athens (ECE-NTUA), 15772 Zografou, Greece
*
Author to whom correspondence should be addressed.
Genes 2024, 15(2), 150; https://doi.org/10.3390/genes15020150
Submission received: 21 December 2023 / Revised: 13 January 2024 / Accepted: 20 January 2024 / Published: 24 January 2024

Abstract

:
In the realm of DNA testing with legal implications, the reliability and precision of genetic markers play a pivotal role in confirming or negating paternity claims. This study aimed to assess the potential utility of human leukocyte antigen (HLA) gene polymorphism through massively parallel sequencing (MPS) technology as robust forensic markers for parentage testing involving genetic deficiencies. It sought to redefine the significance of HLA genes in this context. Data on autosomal short tandem repeat (aSTR) mutational events across 18 paternity cases involving 16 commonly employed microsatellite loci were presented. In instances where traditional aSTR analysis failed to establish statistical certainty, kinship determination was pursued via HLA genotyping, encompassing the amplification of 17 linked HLA loci. Within the framework of this investigation, phase-resolved genotypes for HLA genes were meticulously generated, resulting in the definition of 34 inherited HLA haplotypes. An impressive total of 274 unique HLA alleles, which were classified at either the field 3 or 4 level, were identified, including the discovery of four novel HLA alleles. Likelihood ratio (LR) values, which indicated the likelihood of the observed data under a true biological relationship versus no relationship, were subsequently calculated. The analysis of the LR values demonstrated that the HLA genes significantly enhanced kinship determination compared with the aSTR analysis. Combining LR values from aSTR markers and HLA loci yielded conclusive outcomes in duo paternity cases, showcasing the potential of HLA genes and MPS technology for deeper insights and diversity in genetic testing. Comprehensive reference databases and high-resolution HLA typing across diverse populations are essential. Reintegrating HLA alleles into forensic identification complements existing markers, creating a potent method for future forensic analysis.

1. Introduction

Human leukocyte antigens (HLAs) are part of the major histocompatibility complex (MHC) and play an important role in the regulation of the immune system, as well as in fundamental molecular and cellular processes [1]. The HLA system is one of the most polymorphic regions of the human genome and, to date, more than 36,000 HLA class I and II alleles have been identified according to the Immuno Polymorphism Database-ImMunoGeneTics project/HLA Database (IPD-IMGT/HLA database, v 3.54, 2023-03), which contains coding for more than 21,500 distinct functional proteins specialized in presenting antigenic peptides to the T-cell receptor (TCR) [2]. Due to their proximity on the short arm of chromosome 6 (6p21.3), a complete set of alleles of genes mapped in a row to the same chromosome is usually inherited as a haplotype in a Mendelian fashion from each parent [3]. During the 1980s to the 1990s, the determination of the HLA region served as the established procedure in forensic genetics [4]. The HLA system was considered highly suitable for parentage determination due to its low recombination rate, lack of mutation recording in family studies, and availability of HLA allele frequencies for diverse ethnic groups [5]. However, traditional HLA typing methods, which focus on the polymorphic regions responsible for encoding the antigen recognition site (ARS), namely, exons 2 and 3 for HLA-class I genes and exon 2 for HLA-class II genes, often provide ambiguous results, particularly in instances of heterozygosity. Multiple analysis patterns within the same exon coding for the HLA antigen were observed in such cases. The limited discriminatory power, which is influenced by linkage disequilibrium and prevalent alleles in certain ethnic groups, posed challenges in forensic genetics. As a result, advancements in autosomal short tandem repeat (aSTR) typing technology have largely overshadowed HLA typing in the past two decades, offering improved accuracy and discriminatory power [6,7,8,9,10].
In contemporary parentage investigations, microsatellites have emerged as the markers of choice due to their highly polymorphic nature, codominant inheritance, and wide distribution throughout the euchromatic genome [11]. Microsatellites, or autosomal short tandem repeats (aSTRs), consist of tandem repeats of short nucleotide sequences (1–6 bp repeat units) that form series with lengths of up to 100 nucleotides (nt) [12]. The Mendelian segregation of aSTR markers in families renders them ideal for personal identification systems in medical and forensic applications, where they are admissible as evidence in legal proceedings. However, certain factors can render these markers suboptimal for parentage analyses in some cases. In particular, the presence of undetectable alleles (null or silent) [13], genotyping errors, and mutational events [12,14] can lead to Mendelian-inconsistent genotypes (observed in equal different parent mismatches) between parents and offspring, potentially impacting the accuracy of genetic profiles. A null or silent aSTR allele refers to an allele at a specific genetic locus that does not produce a detectable product during the genotyping process. It may be a result of mutations or variations in the DNA sequence that impede the amplification or detection of the targeted region [13]. Genotyping errors encompass a range of inaccuracies that can occur during the process of determining an individual’s genetic composition. These errors may result from technical issues, such as equipment malfunctions, contamination, or human errors in sample handling [12,14]. Mutational events involve changes in the DNA sequence of a specific locus over time. These changes can include insertions, deletions, or substitutions of nucleotides [12,14]. Mutational events, which are primarily driven by DNA polymerase slippage, can cause variations in the microsatellite length [12,15,16]. Although the repeat length is typically maintained within a stable range during somatic or germinal replication through mismatch repair systems, sporadic gametic mutations can occur, thereby influencing the interpretation of tests. It is well established that certain alleles within a single STR locus are more prone to mutation than others in the human genome, with their genetic diversity ranging from around 10−6 to 10−2 nt per generation [12,17]. Therefore, understanding the mutation rate of locus-specific aSTRs is crucial in paternity or kinship assessments. In the context of genetic analysis, microsatellites may be considered insufficient in certain cases due to their limitations, prompting the use of alternative markers, such as hypervariable regions of mitochondrial DNA (mt-DNA) [18], single nucleotide polymorphisms (SNPs) [19], X/Y-STR analysis [20], or HLA genotyping [7], to enhance the precision and comprehensiveness of genetic profiling. Additionally, statistical approaches, which are exemplified by the formulas proposed by the AABB, offer another valuable avenue for resolving paternity tests. This comprehensive approach aims to minimize errors and ensure accurate parentage evaluation, providing a broader scope for forensic genetic analysis.
Massively parallel sequencing (MPS) technology or next-generation sequencing (NGS) technology is a transformative method in genomics that is revolutionizing HLA genotyping and enabling profound insights into genetic diversity. MPS allows for the in-depth analysis of HLA sequences, overcoming traditional method limitations [21,22]. It employs long-range PCR and phase-defined sequencing, generating extensive data for exploring unsequenced HLA regions, identifying novel alleles, and reducing per-sample costs through multiplexing [23]. This technology has widespread applications, contributing significantly to groundbreaking discoveries in various fields [24].
The present study provided information on mutational events identified in 16 aSTRs loci commonly employed in forensic and paternity testing. Particularly, within the dataset of 428 disputed parentage tests involving 1210 individuals, we observed a total of eighteen aSTR mutational events. These mutations were distributed across various loci, with five occurring on the SE33 locus; three on the vWA locus; three on the D12S391 locus; two on the D8S1179 locus; and one each on the D10S1248, D3S1358, FGA, D2S1338, and D19S433 loci. These mutational events were identified in an equivalent number of distinct parent–offspring allelic transfers, and notably, two cases involved motherless scenarios. These mutational events resulted in discrepancies in the inherited alleles between the parents and offspring, contributing to diminished residual paternity indexes (PIs) in instances where alleles at certain aSTR genetic loci are infrequent. To enhance the result reliability, an alternative approach was adopted to investigate paternity determination through HLA genotyping using MPS technology. This adjustment aimed to address potential challenges and explore additional avenues for a more comprehensive analysis. The principal objective of this research endeavor was to assess the potential of HLA gene polymorphism through the application of MPS technology, with a view toward establishing HLA genes as effective forensic markers in the realm of genetic deficiency parentage testing. This study sought to redefine the role of HLA genes within this specialized domain.

2. Materials and Methods

2.1. Sample Collection and DNA Extraction

Among 428 civil disputed parentage tests (1210 individuals) addressed to the Immunology and Histocompatibility Lab of Evangelismos General Hospital between January 2014 and May 2023, 18 aSTR mutation events were observed in an equal number of investigations without ruling out fatherhood. In all cases requested by private or court order and apart from the child in question, both parents were submitted for testing, except 74 of which were father–motherless ones (duo cases), to prove or disprove suspected hypotheses. Genomic DNA (gDNA) was isolated from peripheral blood samples collected in EDTA-containing tubes and/or oral cotton buccal swabs from all individuals using the Maxwell® 16 Blood DNA Purification kit and Maxwell® 16 Buccal Swab LEV DNA kit, respectively (Maxwell Promega, Madison, WI, USA). The extracted DNA was quantified and the purity (A260/280 ratio > 1.8) was confirmed using the Qubit 1X dsDNA High-Sensitivity Assay Kit (Thermo Fisher Scientific, San Francisco, CA, USA) before genotyping according to the manufacturer’s instructions.
The study was approved by the Institutional Review Board (Ethics Committee) of Evangelismos Hospital (protocol code 78 and date of approval 10 March 2023). All participants provided written informed consent for the genetic studies prior to the sample collection, in accordance with the Code of Ethics of the World Medical Association (Declaration of Helsinki). Written informed consent was also provided by a parent or legal guardian for the minor child in question.

2.2. CE-Based aSTR Genotyping

A total of 1210 individuals (354 trio/74 duo parentage cases) were typed in the genetic marker system composed of the 16 aSTR loci (D10S1248, vWA, D16S539, D2S1338, D8S1179, D21S11, D18S51, D22S1045, D19S433, TH01, FGA, D2S441, D3S1358, D1S1656, D12S391, SE33) plus amelogenin for gender identification, applying the AmpFlSTR® NGM SElect™ PCR amplification kit (Applied Biosystems, Waltham, MA, USA) according to the manufacturer’s instructions. Amplified aSTR fragments were separated via capillary electrophoresis (CE) on an ABI 3130 Genetic Analyzer (Applied Biosystems, Waltham, MA, USA). The raw sequencing data (.fsa file) were stored using Data Collection Software v.1.3 (Applied Biosystem, Waltham, MA, USA). Microsatellite fragment analysis and allele calling were automatically assigned using GeneMapper ID-X Software v.1.3 (Applied Biosystem, Waltham, MA, USA). To avoid possible influence caused by genotyping errors, all loci with suspicious alleles were re-genotyped. The aSTR genotyping was performed according to the revised guidelines from the scientific group on DNA analysis methods (SWGDAM) and the International Society for Forensic Genetics (ISFG) recommendations [25] concerning STR nomenclature and working practices.

2.3. NGS-Based HLA Genotyping

2.3.1. HLA Gene Amplification and Library Preparation

In order to check the paternity affiliation in 18 cases with observed mutation events, further analysis of 52 individuals via amplifying 17 linked HLA loci (classical HLA-A, -B, -C, -DRB1/3/4/5, -DQA1, -DQB1, -DPA1, and -DPB1; non-classical HLA-F, -G, -H, and -E genes; and MICA and MICB) at high-resolution (HR) level using MPS assay was performed. Particularly, library preparation was carried out using hybrid capture technology supplied by an AlloSeq Tx17 kit (CareDX, Stockholm, Sweden). The multiplex commercial locus-specific primers were designed to amplify the HLA class Ia (HLA-A, -B, -C) and HLA class Ib (HLA-F, -G, -H, -E) from the 5′ untranslated region (UTR) to the 3′UTR (full gene). The HLA class II genes (DRB1/3/4/5, -DQA1, -DQB1, -DPA1, and -DPB1) were sequenced from exon 1 to the 3′UTR region (full exon to the 3rd field) and the MICA and MICB genes full exon to the 2nd field (Figure 1). Libraries were quantified using the Qubit 1X dsDNA High-Sensitivity Assay Kit (Thermo Fisher Scientific, San Francisco, CA, USA). All steps, including target generation, library preparation, clonal amplification, sequencing, and data analysis, were performed according to the manufacturer’s recommendations (Illumina, San Diego, CA, USA).

2.3.2. HLA Sequencing

Sequencing was performed on an Illumina MiSeq platform (Illumina, San Diego, CA, USA). The obtained raw sequencing data (FASTQ files) were analyzed using AlloSeq Assign analysis software Tx17.1 v.1.0.4. (CareDX, Stockholm, Sweden) with reference to the IPD-IMGT/HLA database v3.51.0.0 (12 January 2023). The HLA genotyping was carried out by observing the quality metrics for each locus, including the depth of reading coverage threshold, the level of overlap to determine the phase, the coverage level of key exons, and the flag messages highlighted by the software. Alleles were described with the first 3 fields of HLA allele nomenclature, which represent the nucleotide level assignment, or the 4th field (full gene).
When a potentially novel allele was detected, confirmatory HLA typing was performed using a multiplex long-range PCR assay using a commercial NGSgo®-MX6-1 kit (GenDx, Utrecht, The Netherlands), and the sequencing was carried out on a MiSeq system (Illumina, San Diego, CA, USA). The obtained sequencing data were analyzed using NGSengine software v.2.29.0 (GenDx, Utrecht, The Netherlands) with reference to the IPD-IMGT/HLA Database v3.51.0.0 (12 January 2023).

2.4. Statistical Analysis

2.4.1. Data Analysis for aSTRs Markers

In this study, in order to establish the kinship relationship between the parents and the child in question using aSTRs, the statistical parameters of power of exclusion (PE), random man not excluded (RMNE) [26], paternity index (PI) value of each locus, cumulative paternity index (CPI) value for all loci, and the probability of paternity (W) were performed by applying Bayes’ theorem [27] and according to the guidelines and recommendations of the International Society for Forensic Genetics (ISFG) [25]. Additionally, in order to create pedigrees and calculate the CPI and W values, the “Familias” program (downloaded free from http://www.nr.no/familias, accessed on 1 May 2010) was used. The likelihood ratio calculations (LRs) were based upon aSTR allelic frequencies, as estimated from the Caucasian population database provided by Steffen C.R. et al. [28]. Then, the likelihood ratio (LR) values, which represent the ratio of the likelihood of the observed data under the hypothesis of a true biological relationship to the likelihood under the hypothesis of no relationship, were subsequently calculated.
According to the American Association of Blood Banks (AABB) guidelines, in order to avoid the risk of falling into a false exclusion of the biological father of the child, more than two mismatches are required to satisfy the principle for an unambiguous exclusion of paternity [29]. In cases where 1 or 2 isolated exclusions occur for PI computation, the AABB recommends employing the corresponding mutation rate (μ) and the average probability of exclusion (PE) for non-fathers within the given system, as expressed in the formula PI = μ/PE [25]. The mutation rate (μ) was estimated using the observed frequency of inferred mutations at that marker in casework triplets, as expressed in the formula μ = s/n, where n is the total number of meiosis events and s is the number of these events deemed to be mutations [30]. An estimation of the germ-line mutation at genetic loci can be achieved by comparing the genotypes of offspring to those of their parents, after discarding genotyping errors, and is typically recognized as a shift in allelic mobility. The combined PI was calculated by multiplying PIs based on the product rule. The formulas of LRs calculation for trio and motherless cases are shown in Table S1. In this study, a PI greater than 10,000 was considered as proof of a parent–offspring relationship, where W, which represented the LR, corresponds to the probability of paternity being equal to or greater than 99.99%, assuming a priori probability of paternity of 0.5 [29]. When PI ranged from 0.0001 to 10,000 (including the aSTR mutation loci), other genetic markers were added until it was sufficient to make the decisions.
Due to the fact that there are limited data on Greek population genetics, the aSTR locus-specific mutation rates were collected based on the Caucasian population database provided by Ge J. et al. [31] and the PE value provided by Steffen C.R. et al. [28]. The overall loci-specific mutation rates with 95% confidence intervals (CI) were calculated at http://statpages.org/confint.html, accessed on 25 May 2009.

2.4.2. Data Analysis for HLA Alleles

For the parentage investigations, incompatibilities in at least one HLA allele between the parents and offspring indicate exclusion. For inclusions, the statistical parameters paternity index (PI) and probability of paternity (W) were calculated using Essen–Möller values. The PI is the LR, which is calculated using the mathematical formula LR = 1/p (where p is the frequency of the HLA haplotype), assuming a prior probability of 0.5. The posterior probability of paternity, denoted as W by Essen–Möller, is then calculated using the W = LR/(LR + 1) formula [32]. The LR calculations were based on the HLA allele frequencies as estimated using the Caucasian population from the Allele Frequency Net Database (http://www.allelefrequencies.net/default.asp, accessed 2020). For the HLA genes, if no frequency data were available for the 3rd field, the frequency of the 2nd field was applied.

3. Results

3.1. aSTR Typing Results

Eighteen aSTR mutational events were observed and were distributed among the different loci as follows: five mutations on the SE33 locus, three mutations on the vWA locus; three mutations on the D12S391 locus; two mutations on the D8S1179 locus; and one mutation each on the D10S1248, D3S1358, FGA, D2S1338, and D19S433 loci. These mutations occurred in an equal number (18/428, 4.21%) of distinct parent–offspring allelic transfers, with two cases involving motherless scenarios. Notably, no mutations were detected in the other seven aSTR loci (refer to Table S2). The apparent mutation events were counted under approximately 708 meiotic transfers, resulting in 11 328 allele transfers in the parent–child duos, all of which were in the male germ line, which provided either a gain or loss of a single-step repeat unit. The ratio of repeat gains and losses was relatively balanced (5:9), while four mutations could not be assigned. The average paternal mutation rate estimated across all loci was 0.0016 (95% CI 0.0009–0.0025) per locus per gamete per generation, which is mostly in agreement with previous studies [33,34]. The genotype details of paternity inconsistencies that resulted from mutations in 16 autosomal microsatellite loci studied are described in Table 1 (unpublished data).
In 15 out of the 18 cases, the aSTR analysis of the results showed the alleged father (AF) could be determined as being the biological father with a probability value that ranged from 47,478 to Log10PI 3.98 × 109. Only in three cases, the probability (W) was lower than 0.9999. Particularly, in the 11th case, the total PI value (8473) represented a likelihood that the genetic data supported the hypothesis of parentage over the hypothesis of coincidental paternal obligate allele(s) (POAs) sharing (when the PI value is between 1000 and 10,000, the verbal equivalent is “strong support”). Additionally, in the 8th and 14th cases, which were motherless, the PI values 697 and 7035, respectively, were weak data to support the hypothesis of parentage and more genetic markers are required for confident paternity results. Accordingly, in all cases, the indicators combined PE, which depends on deducing, in each case, the POAs from the child (in a duo) or the child plus its mother (in a trio), ranged from 0.99999978 to 0.99999999, and the RMNE, which ranged from Log10RMNE 1.82 × 10−14 to 1.29 × 10−7 (reliable equations to determine the power of a genetic test to exclude a pair of individuals as parents), argued in favor of relatedness (Table 2).
The 18 actual included cases were restudied after omitting maternal genotypes (i.e., only types of the father–child pairs were considered) to assess the probability of false exclusion occurrence upon simulating them into motherless cases. In two simulated cases (cases 15 and 16), no mismatches were detected, increasing the probability of relatedness (log10PI: 1.64 × 105 and 5.25 × 106, respectively), and therefore, fatherhood was not ruled out. Additionally, six included simulated duos failed to meet the criteria for concluding and reporting paternity inclusion, with PIs below the threshold of 10,000 (Table 2).

3.2. HLA Typing Results

3.2.1. HLA Sequencing Metrics

In the above cases, in order to increase the strength of the genetic evidence, further analysis of samples using HLA typing on the Illumina MiSeq system was performed. In particular, 52 individuals were sequenced in two different runs (2 × 150 bp), and a total of 2109.3 MB of data was obtained. Τhe final pooled library concentrations were 15.2 ng/μL and 12.6 ng/μL (Figure 2). The run quality control (QC) metrics revealed a median cluster density of 715 K/mm2 (first run) and 990 K/mm2 (second run), with 94.1% and 87.7% passing filters (Figure 3a,b). The median quality ≥ Q30 scores were 93.6% and 89.1%, respectively; Figure 4a). The average depth of coverage was 189× for all HLA loci (Figure 4b) and ranged from 99 (HLA-DRB1) to 248 (HLA-B). As indicated by the average minor allele percentages, the HLA-DQB1 locus had the highest allele imbalance, while HLA-A and F had the lowest (Figure 4c).

3.2.2. HLA Haplotypes

The comprehensive analysis revealed a total of 34 HLA inherited haplotypes on the HLA-A, ~B, ~C, ~DRB1/3/4/5, ~DQA1, ~DQB1, ~DPA1, ~DPB1, ~F, ~G, ~H, ~E, MICA, and MICB loci. The HLA typing results for each sample are presented in Table S3. Furthermore, a total of 274 unique alleles at either the field 3 or 4 level were identified, out of which 270 were determined to possess sequences that conformed identically to those documented within the IMGT/HLA database (Table 3). The remaining four alleles were distinguished by the presence of single-nucleotide polymorphism (SNP) variants and considered as HLA novel alleles. Their full-length DNA sequences were deposited in GenBank (accession numbers: OQ357851, OQ885042, OQ885046, and OQ885045) and the IPDIMGT/HLA Database (submission numbers: HWS10065189, HWS10066175, HWS10066149, and HWS10066153) (Table 4). The names HLA-B*14:02:01:26, -B*35:580, -B*40:02:01:41, and -C*04:01:01:175 were officially assigned by the World Health Organization (WHO) Nomenclature Committee for Factors of the HLA System in May 2023. This follows the agreed policy that subject to the conditions stated in the most recent Nomenclature Report [35], names will be assigned to new sequences as they are identified. Lists of such new names will be published in the following WHO Nomenclature Report. One nucleotide substitution was observed in exon 5 of HLA-B, while the remaining mutations were located in the non-coding regions of the HLA-B (3′UTR) and HLA-B and -C (5′UTR) genes. The single nucleotide substitution in the novel allele HLA-B*35:580 resulted in an amino acid change from alanine (A) to valine (V) (non-synonymous mutations). All novel HLA alleles that appeared in parent–child pairs followed the heredity rule. To validate the putative novel alleles observed at the HLA-B and -C loci, HLA genotyping using a commercial NGSgo®-MX6-1 kit (GenDx, Utrecht, The Netherlands) was performed and the results confirmed all identified mutation sites.
Also, in our study, 22 ambiguities at an 8-digit level were detected. Their HLA sequences matched with several allele combinations that could not be excluded based on the sequence information obtained. HLA alleles that have identical nucleotide sequences across the exons that encode the peptide binding domains but may show polymorphisms outside it belong to the same HLA G group. The most ambiguities were observed in the HLA-DPB locus (11 HLA-DPB1 genotype combinations, for a total of 26 individuals), owing to the length limitation of PCR amplification, followed by the HLA-DQB (4 genotype combinations, for a total 5 individuals), -DRB1 (4), -DRB3 (1 genotype combination, 2 individuals), -A (1), and -C (1) loci. No ambiguities were found on the HLA-B, -DRB4/5, DQA, DPA, -F, -G, -H, -E, MICA, and MICB loci (Table S3).
Our results show that in the total of 34 meioses from the 18 parentage assessments, two discrepancy events on the HLA-E loci and one on HLA-DRB3 were observed (Table S3). Specifically, for the fourth case, no data were obtained for the HLA-H genetic locus in the child in question. The HLA typing was HLA-H*02:07:01:01 homozygous for the AF and HLA-H*02:05:01:03 homozygous for the biological mother. Subsequently, for the 12th case, the HLA typing was HLA-H*02:01:01:01 homozygous for the AF, HLA-H*02:03:02 and *02:05:01:01 heterozygous for the biological mother, and HLA-H*02:03:02 homozygous for the child in question. Additionally, for the 13th case, no data were obtained for the HLA-DRB3 genetic locus in the child in question (HLA-DRB3*02:02:01 for the biological mother and AF). All discrepancies observed were linked to PCR challenges, where certain alleles had minimal amplification that fell below our analysis program’s threshold and led to a “no call” outcome. This threshold, which was an internal parameter in our analysis pipeline, distinguished heterozygous from potentially homozygous samples. The remaining 15 HLA loci were inherited following Mendelian inheritance.

3.2.3. Distribution of the LR

The HLA genotyping revealed no mismatches between the child in question and the AF, giving a PI that ranged from 13,355 to Log10PI 8.06 × 108 and a W that ranged from 0.99992513 to 0.99999999 regarding the six classical HLA-A, -B, -C, -DRB1, -DQB1, and -DPB1 genes for the seventeen disputed cases (Table 2). For one case (case 2), HLA genotyping did not become sufficient to overcome the threshold of PI ≥ 10 000 (PI equal to 6 672) owing to the high frequency of HLA alleles transmitted by the AF (HLA-inherited haplotype ~A*29:02:01:01, ~B*35:01:01, ~C*04:01:01:06, ~DRB1*11:04:01G, ~DQB1*03:01:01:02, ~DPB1*04:01:01G). The genetic markers HLA-DRB3/4/5, -DQA1, -DPA1, -F, -G, -H, -E, MICA, and MICB were not included in the statistical analysis owing to limited haplotype frequency data in the Caucasian population. The HLA-inherited haplotypes transmitted by the AFs are shown in Table S3. Combining the LR values obtained from the two systems (16 aSTR markers and 6 linked HLA loci) and assuming a prior probability of 0.5, the resulting value of Log10CPI was more than 1.69 × 107 and W was more than 0.99999994, which were sufficient for the verbal predicate “paternity practically proven” for all duo simulated cases (Figure 5).

4. Discussion

The aSTR analysis has demonstrated itself to be one of the most reliable and cost-effective molecular tools in forensic casework. However, aSTRs may undergo evident variations in the copy number through a process known as dynamic or mutable mutation, resulting in incompatibility between parents and offspring. If mutation events are overlooked, it can have a significant impact on the genetic evidence of consanguinity, potentially leading to a result supporting an incorrect conclusion [36]. If mutations are not considered, LR values might lead to false exclusion [37]. Additionally, in cases of inclusion, the mean frequency of mutation in the relevant population should be considered, as well as its range. This factor could potentially alter the data [31].
The aSTR data from the present study showed that fatherhood was not ruled out for 15 out of 16 complete triplet cases with isolated exclusions by mutations, and a sufficient paternity probability was achieved even after the mutation calculation, with an average probability of 0.99999979, Log10PI: 4.71 × 106 (min. W: 0.99997894, PI 47,478; max. W 0.99999999, Log10PI: 3.98 × 109). In one triplet case, the W for the biological father was only 0.99988199 (PI 8 473) due to the inheritance of very common aSTR alleles. In addition, when all the above triplet cases were further analyzed without investigation of the mother’s genetic profile from the probability calculation, this displayed a remarkable decrease in the simulated duo cases below 0.999 compared with that of the trio cases, and the difference became even more significant when the aSTR POAs were presented with high frequency in the population study [17]. Also, while in the two triplet cases, one discrepancy was shown between the AF and the child in question, this discrepancy was eliminated when the DNA profile of the mother was missing from the analysis, which was an event observed in other studies [38]. Our results led to the conclusion that when simulating the duo from trio families, the mother’s genetic profile can hide additional mismatches, providing enough certainty to include the putative father [39]. Additionally, we emphasize the necessity for greater caution when dealing with motherless cases, especially in cases where mutation events occur. Therefore, the present study evaluated the effect of the availability of both parents in cases with observed mutation events. The 16 forensic loci were sufficient to provide positive proof (strong evidence) of paternity, offering high discriminating power in only 15 out of 16 triplet cases, and more genetic information and accurate statistical analyses for achieving confident results are required. According to García-Aceves et al., the implementation of ≥ 20 aSTRs as the routine battery of markers for paternity testing labs allows for obtaining sound conclusions to solve the large majority of motherless cases [33]. In addition, as most DNA-typing applications have legal and ethical implications and there is a particular need for high reliability and high discrimination power in cases where fatherhood cannot be confirmed or ruled out with statistical certainty, the focus should be shifted to the application of alternative and often more discriminative and polymorphic markers [40,41,42].
HLA genotyping using MPS should be considered in cases where the complexity of the involved subjects requires a deeper analysis. Nonetheless, even if the HLA system is one of the most extensively studied regions, this level of polymorphism remains a challenge when it comes to type HLA genes. To date, with existing short-read technologies, HLA genotyping ambiguities generated due either to failure to interrogate all polymorphic positions or when two or more different allele combinations produce identical sequences (cis/trans ambiguities) remain an issue. As it is well known, the HLA-DPB1 is the most susceptible to generating an ambiguous locus. In our study, almost all results, except HLA-DPB1, were obtained without any ambiguities at the third field level. Phasing of the large intron 2 of the -DPB1 locus is of significant importance and could reduce the rate of ambiguities reported, leading to a more accurate description of HLA diversity [43]. Also, the high accuracy percentage obtained using MPS indicates adequate coverage that allows for correct HLA variant calls. In this study, acceptable quality results were observed in all HLA loci without allele dropout events, except on HLA-E and HLA-DRB3 loci, where three discrepancy events due to low DNA concentration were observed. This is a known potential limitation of multiplex primers. However, in the case of failure or doubt in only one locus of a given sample, it is recommended to include the sample in another test battery or to perform additional testing [44].
HLA genotyping using MPS outside the core region (exon 2, 3, and 4 for HLA class I; exon 2 and 3 for HLA class II genes) gives more precise sequencing results and allows for the identification of rare and novel HLA alleles using the high-quality typing compared with conventional techniques [45,46,47]. As illustrated in the results section, four novel HLA alleles were identified for classical HLA class genes, leading to a more accurate HLA haplotype definition. It should be noticed that the median average Log10LRs of HLA genes were much higher than that of the aSTR loci in duo simulated cases, even when the rare alleles on the third field or novel alleles were excluded from the analysis, which led to reliable results similar to those achieved in complete triplet analysis using aSTRs.
In this report, the analysis primarily focused on classical HLA genes (HLA-A, -B, -C, -DRB1/3/4/5, -DQA1, -DQB1, -DPA1, -DPB1) and secondarily on non-classical HLA genes (HLA-F, -G, -H, -E), MICA, and MICB. Information about non-classical HLA genes allows for a more accurate description of HLA haplotype diversity but relatively few reference data exist on polymorphisms at the high-resolution level (third or fourth field) [48]. To accurately determine the relationship between individuals, it is crucial to have access to high-resolution HLA frequency data for different populations. While the number of newly discovered HLA alleles has increased significantly in recent years, the IMGT/HLA reference database does not have complete genomic sequences for all HLA class II alleles, which leads to the generation of ambiguous results. Ehrenberg et al. primarily attributed these challenges to factors such as inadequate reference data, the emergence of new SNPs in less-studied populations, and distinguishing SNPs within the untranslated region (UTR) [49]. In the present study, the LR calculations were based upon HLA allele frequencies, as estimated from the Caucasian population from the Allele Frequency Net Database, as our population has not been adequately standardized at the third field level at present. Our results demonstrate that even when no frequency data were available in the third field in most of the cases, the median average Log10LRs (min. W 0.99985014, PI 6672; max. W 0.99999999, Log10PI 8.06 × 108) of the HLA genes managed to reach definitive conclusions and improve the power of discrimination for kinship determination in comparison with the aSTR analysis in seventeen out of the eighteen parentage cases. Finally, combining the LR values obtained from the two systems (aSTR markers and six linked HLA loci), the arrived value of Log10PI was 1.69 × 107, which was sufficient for the verbal predicate “paternity practically proven” for the eighteenth simulated duo case.
The rapid advancement of MPS technology has significantly enhanced its clinical application in precision medicine, revolutionizing the research of genetic factors in structural disorders [50]. Especially, in immunogenetics, MPS has emerged as a powerful and versatile tool, offering efficient and cost-effective DNA/RNA sequencing while surpassing the limitations of traditional methods [21,51]. In forensic applications, HLA typing is highly recommended by parentage profiling laboratories for cases involving individuals without a known mother or father, especially when an aSTR mutation event is recorded. The utilization of MPS for HLA genotyping can augment the LR in identifying true parental relationships within a pedigree, while concurrently minimizing the risk of erroneous attributions. Furthermore, the unique genomic characteristics of HLA genes located in proximity on the same chromosome and inherited as haplotypes in a Mendelian manner render them exceptionally conducive to parentage testing. The intricate patterns of inheritance associated with HLA genes make them particularly pertinent for discerning relationships between individuals originating from the same lineage or for undertaking indirect kinship analyses [42,52].
To fully harness the potential of the HLA system in disputed paternity/maternity assessments, certain considerations need to be addressed. These include evaluating the relevance of the provided information, the ability to analyze the complete HLA gene across all genetic loci, and the availability of high-resolution HLA frequency data across diverse populations. While acknowledging the constraints imposed by current limitations in population haplotypic databases, it is essential to underscore the substantial contributions made by MPS in enhancing the specificity and discriminatory power of HLA gene analysis, thereby significantly fortifying the robustness of paternity testing. The selection of MPS technology for HLA gene analysis is underpinned by its capacity to provide a more exhaustive and intricate characterization of genetic variations. By adequately addressing these issues, we can facilitate a comprehensive reintegration of the HLA system, thereby reinforcing its application in disputed paternity/maternity assessments.
However, the incorporation of HLA alleles into the field of forensic identification should not be perceived as a substitution but rather as a supplementary tool to complement the well-established and extensively utilized markers already in place [53]. The integration of MPS for the analysis of HLA genes in paternity testing offers a complementary approach alongside conventional forensic kits, which primarily focus on STRs and SNPs. While traditional kits have demonstrated efficacy in forensic applications, the integration of MPS provides an additional layer of depth to the genetic analysis. MPS allows for a more comprehensive sequencing of HLA genes, enabling the identification of a broader spectrum of genetic variations, including indels, copy number variations, and complex structural variations. This simultaneous use of both methodologies harnesses the strengths of each approach. Forensic kits, with their known sequences of STR alleles, enhance the statistical values of results. On the other hand, MPS contributes to a finer understanding of genetic relationships, offering increased resolution and discrimination. The parallel application of these methodologies ensures a more robust and versatile paternity-testing framework that combines the strengths of traditional forensic approaches with the detailed insights provided by MPS technology.

5. Conclusions

In conclusion, despite their limitations, aSTRs remain the most reliable and widespread markers used in forensic identification. However, in instances where conventional methods prove insufficient in yielding satisfactory results, there arises a need for alternative or supplementary markers to facilitate the analysis. The inclusion of HLA genotyping using MPS technology presents a promising avenue to overcome these limitations. By harnessing the capabilities of MPS, HLA polymorphism analysis holds the potential to enhance the evaluation of relatedness in cases involving genetic deficiencies. These insights highlight the necessity for further exploration and optimization of HLA genotyping methods, ultimately leading to improved accuracy and reliability in paternity determination and relatedness assessment. Concurrently, the reintroduction of HLA alleles into the sphere of forensic identification must be perceived as an auxiliary tool to complement preexisting and widely accepted markers. This symbiotic union of HLA genes and established forensic markers holds the promise of engendering an exceptionally potent methodology for future applications within the domain of forensic analysis.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes15020150/s1, Table S1: The formulas of likelihood ratio (LR) calculation for trio and motherless cases; Table S2: aSTR typing results; Table S3: Classical and non-classical HLA typing results.

Author Contributions

Methodology, D.I.K.; software, D.I.K. and K.V.F.; validation, D.I.K. and A.T.; formal analysis, D.I.K. and K.V.F.; data curation, D.I.K.; writing—original draft preparation, D.I.K.; writing—review and editing, D.I.K., K.V.F. and A.T.; visualization, D.I.K., K.T. and A.T.; supervision, A.T. All authors have read and agreed to the published version of the manuscript.

Funding

This study received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board (Ethics Committee) of Evangelismos Hospital (protocol code 78 and date of approval 10 March 2023).

Informed Consent Statement

All participants provided their written informed consent for the genetic studies prior to the sample collection.

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions. Data on HLA-B*14:02:01:26, -B*35:580, -B*40:02:01:41, and -C*04:01:01:175 novel alleles are freely available in the GenBank Database (https://www.ncbi.nlm.nih.gov/genbank/) (accession numbers: OQ357851, OQ885042, OQ885046, and OQ885045, respectively) and the IPD-IMGT/HLA Database (https://www.ebi.ac.uk/ipd/imgt/hla/) (submission numbers: HWS10065189, HWS10066175, HWS10066149, and HWS10066153, respectively).

Acknowledgments

We are indebted to all individuals whose samples were used in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Medhasi, S.; Chantratita, N. Human Leukocyte Antigen (HLA) System: Genetics and Association with Bacterial and Viral Infections. J. Immunol. Res. 2022, 2022, 9710376. [Google Scholar] [CrossRef] [PubMed]
  2. Marsh, S.G.E. Nomenclature for factors of the HLA system, update April, May and June 2023. HLA 2023, 102, 401–430. [Google Scholar] [CrossRef] [PubMed]
  3. Tumer, G.; Simpson, B.; Roberts, T.K. Genetics, Human Major Histocompatibility Complex (MHC). In StatPearls; Stat Pearls Publishing: Treasure Island, FL, USA, 2023. [Google Scholar]
  4. Silver, H. Paternity testing. Crit. Rev. Clin. Lab. Sci. 1989, 27, 391–408. [Google Scholar] [CrossRef]
  5. Carrington, M. Recombination within the human MHC. Immunol. Rev. 1999, 167, 245–256. [Google Scholar] [CrossRef] [PubMed]
  6. Kereszturya, L.; Rajczya, K.; Lászikb, A.; Gyódia, E.; Pénzes, M.; Falus, A.; Petrányia, G.G. Combination of DNA-based and conventional methods to detect human leukocyte antigen polymorphism and its use for paternity testing. Am. J. Forensic Med. Pathol. 2002, 23, 57–62. [Google Scholar] [CrossRef]
  7. Toscanini, U.; Berardi, G.; Rodríguez-Diez, M.B.; Padros, K.; Raimondi, E. Molecular HLA typing is an invaluable tool in some complex paternity cases. Forensic Sci. Int. Genet. Suppl. Ser. 2008, 1, 531–532. [Google Scholar] [CrossRef]
  8. Grubic, Z.; Stingl, K.; Martinez, N.; Palfi, B.; Brkljacic-Kerhin, V.; Kastelan, A. STR and HLA analysis in paternity testing. Int. Congr. Ser. 2004, 1261, 535–537. [Google Scholar] [CrossRef]
  9. Ota, M.; Shimada, K.; Asamura, H.; Takayanag, K.; Katsuyama, Y.; Fukushima, H. Validation of sensitive human leukocyte antigen-sequence-specific primer and probe typing in forensic DNA examination. Leg. Med. 2006, 8, 203–209. [Google Scholar] [CrossRef]
  10. Dean, L.; Kwon, Y.J.; Philpott, M.K.; Stanciu, C.E.; Seashols-Williams, S.J.; Dawson Cruz, T.; Sturgill, J.; Ehrhardt, C.J. Separation of uncompromised whole blood mixtures for single source STR profiling using fluorescently-labeled human leukocyte antigen (HLA) probes and fluorescence activated cell sorting (FACS). Forensic Sci. Int. Genet. 2015, 17, 8–16. [Google Scholar] [CrossRef]
  11. Pokhriyal, B.; Thorat, K.; Limaye, D.A.; Joshi, Y.M.; Kadam, V.J.; Dubey, R. Microsatellite markers—A novel tool in molecular genetics. Int. J. Res. Pharm. Chem. 2012, 2, 397–412. [Google Scholar]
  12. Fan, H.; Chu, J.Y. A brief review of short tandem repeat mutation. Genom. Proteom. Bioinform. 2007, 5, 7–14. [Google Scholar] [CrossRef] [PubMed]
  13. Zhou, S.; Wang, H.; Wang, Q.K.; Wang, P.; Wang, F.; Xu, C. Loss of heterozygosity detected at three short tandem repeat locus commonly used for human DNA identification in a case of paternity testing. Leg. Med. 2017, 24, 7–11. [Google Scholar] [CrossRef]
  14. Dauber, E.M.; Kratzer, A.; Neuhuber, F.; Parson, W.; Klintschar, M.; Bär, W.; Mayr, W.R. Germline mutations of STR-alleles include multi-step mutations as defined by sequencing of repeat and flanking regions. Forensic Sci. Int. Genet. 2012, 6, 381–386. [Google Scholar] [CrossRef] [PubMed]
  15. Bhargava, A.; Fuentes, F.F. Mutational dynamics of microsatellites. Mol. Biotechnol. 2010, 44, 250–266. [Google Scholar] [CrossRef] [PubMed]
  16. Brinkmann, B.; Klintschar, M.; Neuhuber, F.; Hühne, J.; Rolf, B. Mutation rate in human microsatellites: Influence of the structure and length of the tandem repeat. Am. J. Hum. Genet. 1998, 62, 1408–1415. [Google Scholar] [CrossRef] [PubMed]
  17. Kouniaki, D.I.; Papasteriades, C.A.; Tsirogianni, A. Short Tandem Repeats Loci in Parentage Testing. Hosp. Chron. 2015, 10, 83–90. [Google Scholar]
  18. Chaitanya, L.; van Oven, M.; Brauer, S.; Zimmermann, B.; Huber, G.; Xavier, C.; Parson, W.; de Knijff, P.; Kayser, M. High-quality mtDNA control region sequences from 680 individuals sampled across the Netherlands to establish a national forensic mtDNA reference database. Forensic Sci. Int. Genet. 2016, 21, 158–167. [Google Scholar] [CrossRef]
  19. Zhang, Q.; Zhou, Z.; Wang, L.; Quan, C.; Liu, Q.; Tang, Z.; Liu, L.; Liu, Y.; Wang, S. Pairwise kinship testing with a combination of STR and SNP loci. Forensic Sci. Int. Genet. 2020, 46, 102265. [Google Scholar] [CrossRef]
  20. Kaitholia, K.; Dash, H.R.; Shrivastava, P.; Kumawat, R.K.; Dixit, S.; Chaubey, G. Forensic characterization and genetic evaluation in the Central Indian population using 27 Y-STRs. Int. J. Leg. Med. 2021, 135, 791–792. [Google Scholar] [CrossRef]
  21. Robinson, J.; Barker, D.J.; Georgiou, X.; Cooper, M.A.; Flicek, P.; Marsh, S.G.E. IPD-IMGT/HLA Database. Nucleic Acids Res. 2020, 48, D948–D955. [Google Scholar] [CrossRef]
  22. Creary, L.E.; Guerra, S.G.; Chong, W.; Brown, C.J.; Turner, T.R.; Robinson, J.; Bultitude, W.P.; Mayor, N.P.; Marsh, S.G.E.; Saito, K.; et al. Next-generation HLA typing of 382 International Histocompatibility Working Group reference B-lymphoblastoid cell lines: Report from the 17th International HLA and Immunogenetics Workshop. Hum. Immunol. 2019, 80, 449–460. [Google Scholar] [CrossRef] [PubMed]
  23. Tran, J.N.; Sherwood, K.R.; Mostafa, A.; Benedicto, R.V.; ElaAlim, A.; Greenshields, A.; Keown, P.; Liwski, R.; Lan, J.H. Novel alleles in the era of next-generation sequencing-based HLA typing calls for standardization and policy. Front. Genet. 2023, 14, 1282834. [Google Scholar] [CrossRef] [PubMed]
  24. Zhong, Y.; Xu, F.; Wu, J.; Schubert, J.; Li, M.M. Application of Next Generation Sequencing in Laboratory Medicine. Ann. Lab. Med. 2021, 41, 25–43. [Google Scholar] [CrossRef] [PubMed]
  25. Gjertson, D.W.; Brenner, C.H.; Baur, M.P.; Carracedo, A.; Guidet, F.; Luque, J.A.; Lessig, R.; Mayr, W.R.; Pascali, V.L.; Prinz, M.; et al. ISFG: Recommendations on biostatistics in paternity testing. Forensic Sci. Int. Genet. 2007, 1, 223–231. [Google Scholar] [CrossRef] [PubMed]
  26. Scarpetta, M.A.; Staub, R.W.; Einum, D.D. Assessing exclusionary power of a paternity test involving a pair of alleged grandparents. Transfusion 2007, 47, 335–340. [Google Scholar] [CrossRef] [PubMed]
  27. Giacalone, M.; Giannuzzi, M.R.; Panarello, D. DNA test to assess criminal responsibility: A Bayesian approach. Qual. Quant. 2018, 52, 2837–2853. [Google Scholar] [CrossRef]
  28. Steffen, C.R.; Coble, M.D.; Gettings, K.B.; Vallone, P.M. Corrigendum to ‘U.S. Population Data for 29 Autosomal STR Loci’. Forensic Sci. Int. Genet. 2013, 7, e82–e83, Erratum in Forensic Sci. Int. Genet. 2017, 31, e36–e40. [Google Scholar] [CrossRef]
  29. References AABB-American Association of Blood Banks. Annual Report Summary for Testing in 2008: Relationship Testing Program Unit. pp. 1–64. Available online: https://www.aabb.org/docs/default-source/default-document-library/accreditation/rtannrpt08.pdf?sfvrsn=982c3264_0 (accessed on 1 January 2010).
  30. Vicard, P.; Dawid, A.P.; Mortera, J.; Lauritzen, S.L. Estimating mutation rates from paternity casework. Forensic Sci. Int. Genet. 2008, 2, 9–18. [Google Scholar] [CrossRef]
  31. Ge, J.; Eisenberg, A.; Budowle, B. Developing criteria and data to determine best options for expanding the core CODIS loci. Investig. Genet. 2012, 3, 1. [Google Scholar] [CrossRef]
  32. Egeland, T.; Mostad, P.F. Statistical Genetics and Genetical Statistics: A Forensic Perspective. Scand. J. Stat. 2002, 29, 297–307. [Google Scholar] [CrossRef]
  33. García-Aceves, M.E.; Romero Rentería, O.; Díaz-Navarro, X.X.; Rangel-Villalobos, H. Paternity tests in Mexico: Results obtained in 3005 cases. J. Forensic Leg. Med. 2018, 55, 1–7. [Google Scholar] [CrossRef] [PubMed]
  34. Jin, B.; Su, Q.; Luo, H.; Li, Y.; Wu, J.; Yan, J.; Hou, Y.; Liang, W.; Zhang, L. Mutational analysis of 33 autosomal short tandem repeat (STR) loci in southwest Chinese Han population based on trio parentage testing. Forensic Sci. Int. Genet. 2016, 23, 86–90. [Google Scholar] [CrossRef] [PubMed]
  35. Marsh, S.G.E.; Albert, E.D.; Bodmer, W.F.; Bontrop, R.E.; Dupont, B.; Erlich, H.A.; Fernández-Viña, M.; Geraghty, D.E.; Holdsworth, R.; Hurley, C.K.; et al. Nomenclature for Factors of the HLA System, 2010. Tissue Antigens 2010, 75, 291–455. [Google Scholar] [CrossRef] [PubMed]
  36. González-Andrade, F.; Sánchez, D.; Penacino, G.; Martínez Jarreta, B. Two fathers for the same child: A deficient paternity case of false inclusion with autosomic STRs. Forensic Sci. Int. Genet. 2009, 3, 138–140. [Google Scholar] [CrossRef] [PubMed]
  37. Narkuti, V.; Vellanki, R.N.; Anubrolu, N.; Doddapaneni, K.K.; Gandhi Kaza, P.C.; Mangamoori, L.N. Single and double incompatibility at vWA and D8S1179/D21S11 loci between mother and child: Implications in kinship analysis. Clin. Chim. Acta 2008, 395, 162–165. [Google Scholar] [CrossRef]
  38. De Kock, A.; Kloppers, J.J. The impact of motherless paternity testing in a South African population. Afr. Health Sci. 2021, 21, 379–384. [Google Scholar] [CrossRef]
  39. Aguiar, V.R.C.; de Castro, A.M.; Pinto, L.M.; Ferreira, A.C.S.; Dos Santos, E.V.W.; Louro, I.D. Assessing false paternity risk in simulated motherless cases from more than 20 000 real exclusion trios. Transfusion 2021, 61, 678–681. [Google Scholar] [CrossRef]
  40. Li, R.; Li, H.; Peng, D.; Hao, B.; Wang, Z.; Huang, E.; Wu, R.; Sun, H. Improved pairwise kinship analysis using massively parallel sequencing. Forensic Sci. Int. Genet. 2019, 38, 77–85. [Google Scholar] [CrossRef]
  41. Butler, J. Recent advances in forensic biology and forensic DNA typing: INTERPOL review 2019–2022. Forensic Sci. Int. Synerg. 2022, 27, 100311. [Google Scholar] [CrossRef]
  42. Wu, R.; Li, H.; Peng, D.; Li, R.; Zhang, Y.; Hao, B.; Huang, E.; Zheng, C.; Sun, H. Revisiting the potential power of human leukocyte antigen (HLA) genes on relationship testing by massively parallel sequencing-based HLA typing in an extended family. J. Hum. Genet. 2019, 64, 29–38. [Google Scholar] [CrossRef]
  43. Duke, J.L.; Mosbruger, T.L.; Ferriola, D.; Chitnis, N.; Hu, T.; Tairis, N.; Margolis, D.J.; Monos, D.S. Resolving MiSeq-Generated Ambiguities in HLA-DPB1 Typing by Using the Oxford Nanopore Technology. J. Mol. Diagn. 2019, 21, 852–861. [Google Scholar] [CrossRef] [PubMed]
  44. Osoegawa, K.; Vayntrub, T.A.; Wenda, S.; De Santis, D.; Barsakis, K.; Ivanova, M.; Hsu, S.; Barone, J.; Holdsworth, R.; Diviney, M.; et al. Quality control project of NGS HLA genotyping for the 17th International HLA and Immunogenetics Workshop. Hum. Immunol. 2019, 80, 228–236. [Google Scholar] [CrossRef]
  45. Ingram, K.J.; O’Shields, E.F.; Kiger, D.F.; Gautreaux, M.D. NGS and HLA: The long road ahead. Hum. Immunol. 2020, 81, 280–284. [Google Scholar] [CrossRef] [PubMed]
  46. Kouniaki, D.; Tarassi, K.; Kitsiou, V.; Fotopoulos, K.; Tsirogianni, A. Identification of the novel HLA-DPB1*02:01:68 allele in a Greek individual. HLA 2023, 102, 550–552. [Google Scholar] [CrossRef] [PubMed]
  47. Kouniaki, D.; Athanassiades, T.; Fotopoulos, K.; Tsirogianni, A. Genomic sequence of the HLA-A*24:587 allele identified in a Greek individual. HLA 2023, 102, 513–515. [Google Scholar] [CrossRef] [PubMed]
  48. Paech, C.; Albrecht, V.; Putke, K.; Schöfl, G.; Schöne, B.; Schmidt, A.H.; Lange, V.; Klussmeier, A. HLA-E diversity unfolded: Identification and characterization of 170 novel HLA-E alleles. HLA 2021, 97, 389–398. [Google Scholar] [CrossRef] [PubMed]
  49. Ehrenberg, P.K.; Geretz, A.; Sindhu, R.K.; Vayntrub, T.; Fernández Viña, M.A.; Apps, R.; Michael, N.L.; Thomas, R. High-throughput next-generation sequencing to genotype six classical HLA loci from 96 donors in a single MiSeq run. HLA 2017, 90, 284–291. [Google Scholar] [CrossRef]
  50. Hu, T.; Chitnis, N.; Monos, D.; Dinh, A. Next-generation sequencing technologies: An overview. Hum. Immunol. 2021, 82, 801–811. [Google Scholar] [CrossRef]
  51. Bravo-Egana, V.; Monos, D. The impact of next-generation sequencing in immunogenetics: Current status and future directions. Curr. Opin. Organ Transpl. 2017, 22, 400–406. [Google Scholar] [CrossRef]
  52. Kouniaki, D.; Tsirogianni, A. Classical and Non-Classical HLA Alleles as Supplementary Markers in Indirect Kinship Parentage Testing. Immuno 2022, 2, 501–505. [Google Scholar] [CrossRef]
  53. Kuffel, A.; Gray, A.; Nic Daeid, N. Human Leukocyte Antigen alleles as an aid to STR in complex forensic DNA samples. Sci. Justice 2020, 60, 1–8. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Outline of targeted PCR regions in seventeen HLA loci. Yellow boxes indicate amplified exons. Gray boxes indicate non-amplified exons. Green boxes indicate upstream UTR or 5′ untranslated region and downstream UTR or 3′ untranslated region. The region between exons involves introns.
Figure 1. Outline of targeted PCR regions in seventeen HLA loci. Yellow boxes indicate amplified exons. Gray boxes indicate non-amplified exons. Green boxes indicate upstream UTR or 5′ untranslated region and downstream UTR or 3′ untranslated region. The region between exons involves introns.
Genes 15 00150 g001
Figure 2. The analysis pertained to the distribution of fragments (light blue line) that underwent shearing, as observed in the entirety of the library being studied. The uppermost section of the graph depicts the fragment size evaluated (bp). In addition to the peaks, the detected baseline (in red) and the threshold for peak detection above the baseline (in blue) are displayed in the electropherogram.
Figure 2. The analysis pertained to the distribution of fragments (light blue line) that underwent shearing, as observed in the entirety of the library being studied. The uppermost section of the graph depicts the fragment size evaluated (bp). In addition to the peaks, the detected baseline (in red) and the threshold for peak detection above the baseline (in blue) are displayed in the electropherogram.
Genes 15 00150 g002
Figure 3. Q-score Distribution Analysis: Capturing Q-score distribution-graphical representations unveil the quantity of reads categorized by their respective quality scores. Q-scores below 30 are depicted in blue, and those between 30 and 40 are represented in green. Quality Metrics for HLA Sequencing: (a) Cluster density (K/mm2) = 715 ± 6, and clusters passing filter (%) = 94.11 ± 2.08. (b) Cluster density (K/mm2) = 990 ± 26, and clusters passing filter (%) = 87.73 ± 4.13. The median quality ≥ Q30 scores were 93.6% and 89.1%, respectively.
Figure 3. Q-score Distribution Analysis: Capturing Q-score distribution-graphical representations unveil the quantity of reads categorized by their respective quality scores. Q-scores below 30 are depicted in blue, and those between 30 and 40 are represented in green. Quality Metrics for HLA Sequencing: (a) Cluster density (K/mm2) = 715 ± 6, and clusters passing filter (%) = 94.11 ± 2.08. (b) Cluster density (K/mm2) = 990 ± 26, and clusters passing filter (%) = 87.73 ± 4.13. The median quality ≥ Q30 scores were 93.6% and 89.1%, respectively.
Genes 15 00150 g003
Figure 4. Distribution of Q30, usable reads, and percent of minor allele among the 17 HLA loci: (a) The percentage of base calls with a quality score above Q30 for 17 HLA genes, as observed using boxplots. Each box indicates the median and the first and third quartiles of the data, except for -DRB4, where similar values were observed. (b) The distribution of usable reads among the 17 HLA loci sequenced. Whiskers correspond to the interquartile range, and outliers are plotted as dots. (c) The distribution of allele balance for all heterozygous 14 HLA loci sequenced. Notably, the genes HLA-DRB3/4/5 were excluded from this analysis due to the limited number of heterozygotes available for these genes. Outliers are also identified and plotted as dots.
Figure 4. Distribution of Q30, usable reads, and percent of minor allele among the 17 HLA loci: (a) The percentage of base calls with a quality score above Q30 for 17 HLA genes, as observed using boxplots. Each box indicates the median and the first and third quartiles of the data, except for -DRB4, where similar values were observed. (b) The distribution of usable reads among the 17 HLA loci sequenced. Whiskers correspond to the interquartile range, and outliers are plotted as dots. (c) The distribution of allele balance for all heterozygous 14 HLA loci sequenced. Notably, the genes HLA-DRB3/4/5 were excluded from this analysis due to the limited number of heterozygotes available for these genes. Outliers are also identified and plotted as dots.
Genes 15 00150 g004
Figure 5. In this study, a PI of more than 104 was considered proof of parent–offspring relationship. The median average Log10LRs of HLA genes were much higher than that of the aSTR loci in duo simulated cases, even when the HLA rare alleles on 3rd field or HLA novel alleles were not included in the analysis. Additionally, in all cases, the RMNE value illustrates the fact the genetic data supported the hypothesis of paternity.
Figure 5. In this study, a PI of more than 104 was considered proof of parent–offspring relationship. The median average Log10LRs of HLA genes were much higher than that of the aSTR loci in duo simulated cases, even when the HLA rare alleles on 3rd field or HLA novel alleles were not included in the analysis. Additionally, in all cases, the RMNE value illustrates the fact the genetic data supported the hypothesis of paternity.
Genes 15 00150 g005
Table 1. Eighteen aSTR mutations were observed in equal different parent/child allele transfers. Paternity not excluded.
Table 1. Eighteen aSTR mutations were observed in equal different parent/child allele transfers. Paternity not excluded.
aSTRs LociAlleged Father
Genotype
Mother
Genotype
Child
Genotype
POAPutative MutationGain/LossSex/Age
(Years)
Single/Double
Step
Case 01SE3318, 2020, 3419, 3418 or 20    18→19 οr
20→19
Gain or lossMale/45Single
Case 02SE3315, 3412, 1616, 161515→16GainMale/29Single
Case 03SE3317, 25.219, 28.219, 24.225.225.2→24.2LossMale/39Single
Case 04SE3328.2, 33.222.2, 28.228.2, 32.233.233.2→32.2LossMale/41Single
Case 05SE3320, 2016, 32.219, 32.22020→19LossMale/39Single
Case 06vWA17, 1916, 1818, 201919→20GainMale/41Single
Case 07vWA16, 1916, 1717, 181919→18LossMale/36Single
Case 08vWA17, 20NA18, 19UncertainUncertain Gain or loss Male/33Single
Case 09D12S39116, 2122, 2220, 222121→20LossMale/31Single
Case 10D12S39121, 2420, 2020, 202121→20LossMale/35Single
Case 11D12S39122, 2419.3, 2219.3, 2322 or 24     22→23 οr
24→23
Gain or lossMale/28Single
Case 12D8S117913, 1312, 1212, 141313→14GainMale/49Single
Case 13D8S117913, 158, 138, 161515→16GainMale/19Single
Case 14D10S124814, 17NA15, 16Uncertain Uncertain Gain or loss Male/45Single
Case 15D3S135815, 1818, 1818, 191818→19GainMale/41Single
Case 16FGA19, 2320, 2322, 232323→22LossMale/35Single
Case 17D2S133816, 2424, 2523, 252424→23LossMale/40Single
Case 18D19S43315, 1513, 14.213, 131515→13LossMale/27Double
NA, not available; POA, paternal obligate allele.
Table 2. Random man not excluded (RMNE), power of exclusion (PE), combined paternity index (PI), and probability of paternity (W) for autosomal STRs (aSTRs) and human leukocyte antigen (HLA) alleles.
Table 2. Random man not excluded (RMNE), power of exclusion (PE), combined paternity index (PI), and probability of paternity (W) for autosomal STRs (aSTRs) and human leukocyte antigen (HLA) alleles.
aSTR TrioaSTR Duo *HLAaSTR Duo and HLA
RMNEPEPIWPIWPIWPIW
Case 011.39 × 10−90.999999996.66 × 1050.9999984911,7320.9999147713,3550.999925131.57 × 1080.99999999
Case 027.82 × 10−100.999999991.22 × 1050.9999917925400.9996064566720.999850141.69 × 1070.99999994
Case 034.90 × 10−90.999999995.91 × 1050.999998302600.996168585.04 × 1060.999999801.31 × 1090.99999999
Case 045.54 × 10−100.999999993.78 × 1060.9999997313,9250.9999281916,1930.999938252.25 × 1080.99999999
Case 052.85 × 10−90.999999991.72 × 1050.9999941837690.999734759.63 × 1060.999999893.63 × 10100.99999999
Case 068.17 × 10−100.999999992.43 × 1070.9999999510,7980.9999074081,8670.999987788.83 × 1080.99999999
Case 074.81 × 10−130.999999993.98 × 1090.999999999.25 × 1070.999999993.48 × 1060.999999713.21 × 10140.99999999
Case 089.68 × 10−100.99999999NANA6970.9985673430,7690.999967502.14 × 1070.99999995
Case 092.14 × 10−100.999999991.29 × 1050.9999922424020.999583854.07 × 1050.999997549.77 × 1080.99999999
Case 101.82 × 10−140.999999994.79 × 1070.999999988.35 × 1050.999998808.72 × 1050.999998857.28 × 10110.99999999
Case 111.29 × 10−70.9999997884730.999881992100.995260661.03 × 1060.999999032.16 × 1080.99999999
Case 123.64 × 10−110.999999995.44 × 1060.9999998236310.999724672.53 × 1050.999996059.19 × 1080.99999999
Case 131.57 × 10−90.999999994.88 × 1050.9999979528640.999650968.06 × 1080.999999992.31 × 10120.99999999
Case 147.38 × 10−100.99999999NANA70350.999857876.34 × 1080.999999994.46 × 10120.99999999
Case 152.44 × 10−90.9999999951,4930.999980581.64 × 1050.999993911.72 × 1060.999999412.83 × 10110.99999999
Case 161.23 × 10−80.9999999885,4360.999988295.25 × 1060.999999812.19 × 1060.999999541.15 × 10130.99999999
Case 174.01 × 10−110.9999999947,4780.9999789411,0350.9999093921,8510.999954242.41 × 1080.99999999
Case 181.45 × 10−100.999999996.02 × 1070.9999999840,8860.999975541.23 × 1070.999999995.04 × 10120.99999999
* Duo simulated cases from the previous trio family.
Table 3. HLA alleles identified in this study.
Table 3. HLA alleles identified in this study.
HLA LocusABCEFGH
Alleles01:01:01:01
02:01:01
02:01:01:01
02:01:01:05
02:02:01:01
02:11:01:01
03:01:01:01
03:02:01:01
11:01:01
11:01:01:01
23:01:01:01
24:02:01:01
24:02:01:04
26:01:01:01
29:02:01:01
30:01:01:01
30:04:01:01
31:01:02:01
32:01:01:01
33:01:01:01
68:01:01:02
68:01:02:02
69:01:01:01
07:02:01
07:05:01
13:02:01:01
14:02:01:26 *
15:18:01:02
18:01:01
18:05:01:02
27:05:02:01
35:01:01
35:03:01
35:03:01:01
35:03:01:03
35:08:01
35:08:01:01
35:580 *
37:01:01
38:01:01
39:01:01:05
40:01:02
40:02:01:41 *
40:06:01:13
41:01:01:01
41:02:01:01
44:02:01
44:02:01:01
44:05:01:01
49:01:01:01
49:01:01:04
51:01:01
51:01:01:01
51:01:01:10
51:01:01:11
51:01:01:33
51:07:01
51:08:01:01
52:01:01:01
52:01:01:02
52:01:01:18
55:01:01:01
56:01:01
58:01:01:03
01:02:01
01:02:01:01
02:02:02:01
03:02:02
03:02:02:05
03:03:01:01
03:04:01:01
04:01:01:06
04:01:01:14
04:01:01:175 *
04:01:01:75
04:01:01:79
05:01:01
05:01:01:02
06:02:01:01
07:01:01
07:01:01:16
07:02:01
07:02:01:01
07:02:01:03
07:04:01:03
08:02:01:01
12:02:02
12:02:02:01
12:03:01:01
14:02:01:01
15:02:01
15:02:01:01
15:05:02
15:09:01:01
16:02:01:01
16:04:01:01
17:01:01:05
01:01:01
01:01:01:06
01:01:01:10
01:03:01:01
01:03:02
01:03:02:01
01:03:03
01:03:05:01
01:06:01:01
01:01:01
01:01:01:08
01:01:01:09
01:01:01:18
01:01:02
01:01:02:08
01:01:02:09
01:01:02:10
01:03:01
01:03:01:03
01:04:01:02
01:01:01
01:01:01:01
01:01:01:04
01:01:01:05
01:01:02
01:01:02:01
01:01:03:03
01:01:08
01:01:12
01:01:13
01:01:14
01:01:22:01
01:03:01
01:03:01:02
01:04:01:01
01:04:04
01:05N
01:06:01:01
01:01:01
01:01:01:01
01:01:02:01
01:03:01:01
02:01:01:01
02:02:01:01
02:03:01
02:03:02
02:04:01
02:05:01:01
02:05:01:03
02:07
02:07:01:01
02:09
02:12
02:23
HLA LocusDPA1DPB1DQA1DQB1DRB1DRB3DRB4DRB5MICAMICB
Alleles01:03:01
01:03:01:60
02:01:01
02:01:01:02
02:01:08
02:07:01
01:01:01G
02:01:02
02:01:02G
03:01:01G
04:01:01
04:01:01G
04:02:01
04:02:01G
06:01:01G
10:01:01G
104:01:01
13:01:01
13:01:01G
14:01:01G
23:01:01
01:01:01
01:01:02:01
01:02:01
01:02:02
01:03:01
01:03:01:01
01:04:01
01:04:02
01:05:01
02:01:01
02:02:02
03:01:01
03:03:01
05:01:01
05:01:01:03
05:05:01
06:01:01
06:01:01:03
02:01:01
02:02:01
02:02:01:02
03:01:01
03:01:01:02
03:01:01G
03:02:01
03:03:02
03:292
05:01:01
05:01:01G
05:02:01
05:03:01
05:03:01G
06:01:01
06:02:01
06:03:01
06:03:01G
06:04:01
06:04:01G
01:01:01
01:02:01:03
03:01:01
04:01:01
04:02:01
04:05:01
04:07:01
07:01:01
07:07:01:01
08:01:01
08:03:02
08:10
10:01:01
11:01:01
11:03:01
11:04:01
11:04:01G
12:01:01
13:01:01
13:01:01:07
13:02:01
14:01:01G
14:04:01
14:54:01
15:01:01
15:02:01
15:02:02
16:01:01
16:02:01G
01:01:02
01:01:02G
02:02:01
02:02:01G
03:01:01
01:01:01
01:03:01
01:03:01N
01:01:01
01:02:01
01:10N
02:02:01
002:01
004:01
006
007:01
008:01
008:02
008:04
009:01
011:01
012:01
012:03
016:01
018:01
019:01
027:01
049:01
19:01
002:01
004:01
005:01
005:02
005:03
008:01
014:01
* Novel allele; uppercase “G”, ambiguous allele; uppercase “N”, null allele.
Table 4. HLA novel alleles detected in 52 individuals.
Table 4. HLA novel alleles detected in 52 individuals.
HLA Novel AlleleMost Homologous
Allele
gDNA PositionExonCodon
Change
NCSNPaa ChangeType of
Mutation
GenBank
Accession
Number
IPD-IMGT/HLA Database Accession
Number
B*14:02:01:26B*14:02:01:01−260 5′UTRG to TNoSilentOQ357851HWS10065189
B*35:580B*35:01:01:0220235304 (GCT >> GTT) C to TYes
Ala (A) to Val (V)
Non-synonymousOQ885042HWS10066175
B*40:02:01:41B*40:02:01:013010:1
3020
3021
3′UTRC to Deletion (−)
C to T
C to T
NoSilentOQ885046HWS10066149
C*04:01:01:175C*04:01:01:01−249 5′UTRC to TNoSilentOQ885045HWS10066153
aa, amino acid; SNP, single-nucleotide polymorphism; NC, non-coding region.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kouniaki, D.I.; Fotopoulos, K.V.; Tarassi, K.; Tsirogianni, A. Utilizing Massively Parallel Sequencing (MPS) of Human Leukocyte Antigen (HLA) Gene Polymorphism to Assess Relatedness in Deficiency Parentage Testing. Genes 2024, 15, 150. https://doi.org/10.3390/genes15020150

AMA Style

Kouniaki DI, Fotopoulos KV, Tarassi K, Tsirogianni A. Utilizing Massively Parallel Sequencing (MPS) of Human Leukocyte Antigen (HLA) Gene Polymorphism to Assess Relatedness in Deficiency Parentage Testing. Genes. 2024; 15(2):150. https://doi.org/10.3390/genes15020150

Chicago/Turabian Style

Kouniaki, Diamanto I., Konstantinos V. Fotopoulos, Katerina Tarassi, and Alexandra Tsirogianni. 2024. "Utilizing Massively Parallel Sequencing (MPS) of Human Leukocyte Antigen (HLA) Gene Polymorphism to Assess Relatedness in Deficiency Parentage Testing" Genes 15, no. 2: 150. https://doi.org/10.3390/genes15020150

APA Style

Kouniaki, D. I., Fotopoulos, K. V., Tarassi, K., & Tsirogianni, A. (2024). Utilizing Massively Parallel Sequencing (MPS) of Human Leukocyte Antigen (HLA) Gene Polymorphism to Assess Relatedness in Deficiency Parentage Testing. Genes, 15(2), 150. https://doi.org/10.3390/genes15020150

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop