Next Article in Journal
Evaluation of the Efficacy and Safety of Silver Nanoparticles in the Treatment of Non-Neurological and Neurological Distemper in Dogs: A Randomized Clinical Trial
Next Article in Special Issue
Bayesian Molecular Dating Analyses Combined with Mutational Profiling Suggest an Independent Origin and Evolution of SARS-CoV-2 Omicron BA.1 and BA.2 Sub-Lineages
Previous Article in Journal
Pre-Existing HSV-1 Immunity Enhances Anticancer Efficacy of a Novel Immune-Stimulating Oncolytic Virus
Previous Article in Special Issue
Development of Single-Cell Transcriptomics and Its Application in COVID-19
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Immunoinformatics Identification of the Conserved and Cross-Reactive T-Cell Epitopes of SARS-CoV-2 with Human Common Cold Coronaviruses, SARS-CoV, MERS-CoV and Live Attenuated Vaccines Presented by HLA Alleles of Indonesian Population

1
Department of Biomedicine, School of Life Sciences, Indonesia International Institute for Life Sciences, Jl. Pulomas Barat Kav 88, Jakarta 13210, Indonesia
2
Department of Bioinformatics, School of Life Sciences, Indonesia International Institute for Life Sciences, Jl. Pulomas Barat Kav 88, Jakarta 13210, Indonesia
3
Department of Pulmonology and Respiratory Medicine, Faculty of Medicine University of Indonesia, Persahabatan Hospital, Jl. Persahabatan Raya 1, Jakarta 13230, Indonesia
*
Author to whom correspondence should be addressed.
Viruses 2022, 14(11), 2328; https://doi.org/10.3390/v14112328
Submission received: 22 September 2022 / Revised: 13 October 2022 / Accepted: 16 October 2022 / Published: 24 October 2022
(This article belongs to the Special Issue Bioinformatics Research on SARS-CoV-2)

Abstract

:
Reports on T-cell cross-reactivity against SARS-CoV-2 epitopes in unexposed individuals have been linked with prior exposure to the human common cold coronaviruses (HCCCs). Several studies suggested that cross-reactive T-cells response to live attenuated vaccines (LAVs) such as BCG (Bacillus Calmette–Guérin), OPV (Oral Polio Vaccine), and MMR (measles, mumps, and rubella) can limit the development and severity of COVID-19. This study aims to identify potential cross-reactivity between SARS-CoV-2, HCCCs, and LAVs in the context of T-cell epitopes peptides presented by HLA (Human Leukocyte Antigen) alleles of the Indonesian population. SARS-CoV-2 derived T-cell epitopes were predicted using immunoinformatics tools and assessed for their conservancy, variability, and population coverage. Two fully conserved epitopes with 100% similarity and nine heterologous epitopes with identical T-cell receptor (TCR) contact residues were identified from the ORF1ab fragment of SARS-CoV-2 and all HCCCs. Cross-reactive epitopes from various proteins of SARS-CoV-2 and LAVs were also identified (15 epitopes from BCG, 7 epitopes from MMR, but none from OPV). A majority of the identified epitopes were observed to belong to ORF1ab, further suggesting the vital role of ORF1ab in the coronaviruses family and suggesting it as a candidate for a potential universal coronavirus vaccine that protects against severe disease by inducing cell mediated immunity.

1. Introduction

Severe acute respiratory syndrome coronavirus (SARS-CoV-2) is a new variant of coronavirus that caused an ongoing pandemic of coronavirus disease 2019 (COVID-19) [1]. As of May 2022, approximately 519 million cases and 6.3 million deaths had been reported globally [2]. A variety of studies and research have been undertaken into both preventive and curative measures against the SARS-CoV-2 virus. SARS-CoV-2 is an RNA virus from betacoronavirus of Coronaviridae family. Its genome is a positive single-stranded RNA with a length of 26 to 32 kb, which encodes several structural and nonstructural proteins. SARS-CoV-2 is characterized by having spike proteins on the surface of the enveloped virus [3]. As of 18 May 2022, WHO reported a total of two currently circulating variants of concern (VOCs), three previously circulating VOCs, eight previously circulating variants of interest (VOI), and one variant under monitoring (VUMs) of SARS-CoV-2 [4]. Different variants manifest different symptoms with different levels of severity and transmissibility. Typical symptoms include cough, fever, and sore throat. It may also develop into respiratory tract disease, which now has become the biggest contributing factor to the mortality rate [5].
Belonging to the same Coronaviridae family, the human common cold coronaviruses have been going around for decades. However, unlike COVID-19, it is well-known that the common cold coronaviruses do not normally cause serious signs and symptoms. Some common cold coronaviruses infecting humans are 229E and NL63 from the alphacoronavirus, HKU1, and OC43 from the betacoronavirus [6].
Cross-reactivity refers to unanticipated reactivity towards another antigen that differs from the expected targets, possibly due to similar structural regions [7]. Several reports of SARS-CoV-2 cross-reactive T-cells in naïve individuals have been linked to prior exposure to HCCCs [8,9]. T-cell memory and reactivity might play some role in attenuating the disease severity [6,10]. Several studies have also suggested that LAVs, such as OPV, BCG, and MMR vaccines may offer nonspecific protection by strengthening trained innate immunity against other infections, including SARS-CoV-2 [11,12]. The protection provided by LAVs was also suggested at the adaptive immune level where there is similarity of potential immunogenic epitopes sequence with SARS-CoV-2 [7,13,14,15,16,17].
In order for T-cells to elicit responses, the T-cell receptor (TCR) has to recognize peptides that are presented by HLA molecules [18]. However, it is a well-known fact that HLA molecules are population specific. Thus, different populations may present or possess different sets of alleles [19]. To date, most of the T-cell epitope studies have originated from European and North American populations where the predominant HLA allotypes differ from Indonesian [20,21]. As we mentioned in our previous publication, the Indonesian HLA alleles are frequently neglected and not well-studied in research and published papers [22]. Thus, by primarily using HLA allotypes prevalent in Indonesia, this study hopes to gain more understanding and could be the starting point in identifying SARS-CoV-2 immunogenic epitopes and supporting cell-mediated immunity studies relevant for Indonesian populations. Moreover, in the process, identifying conserved T-cell epitopes throughout the coronavirus family might help generate a universal vaccine construct [23].

2. Materials and Methods

2.1. Coronavirus Proteome Sequence Retrieval

The protein sequences of coronaviruses were primarily retrieved from NCBI (https://www.ncbi.nlm.nih.gov) on 14 september 2021. The reference numbers of each sequence are as follows: NC_002645.1 (229E), NC_006577.2 (HKU1), NC_005831.2 (NL63), NC_006213.1 (OC43), and NC_045512.2 (SARS-CoV-2) [24]. From each sequence, the encoded proteins present in all strains, namely ORF1ab, S, E, M, and N proteins, were selected for the subsequent analysis. The reference sequence of SARS-CoV-2 (NC_045512.2) was primarily used for T-cell epitope prediction.
For additional analysis performed in this study, ORF1ab sequences of SARS-CoV (NC_004718.3), MERS-CoV (NC_019843.3) were retrieved through NCBI [24]. ORF1ab sequences of SARS-CoV-2 VOCs Alpha (B.1.1.7), Beta (B.1.351), Delta (B.1.617.2), and Gamma (P.1) were obtained from the previous study conducted by Gustiananda et al. (2021) [25] and used for the analysis. The sequence for omicron (B. 1.1.529) variants was retrieved through NCBI on 13 February 2022 from Asian countries only. The included sequence was filtered through several criteria, such as a complete sequence of 7093–7096 amino acids in length with no ambiguous character and human as the host. The FASTA file of the sequences of SARS-CoV-2 VOCs used in this study can be found in File S1 of Supplementary Materials.

2.2. HLA Alleles Predominant in Indonesia Identification

The HLA alleles predominant in the Indonesian population were identified through the Allele Frequency Net Database (http://allelefrequencies.net/) [26] which was accessed on 16 september 2021. The run was done using ‘HLA Classical Allele Freq Search’ options, with the country parameters set to ‘Indonesia’. The HLA Class I A, HLA Class I B, and HLA Class II DRB1 alleles with frequencies of more than 5% were chosen for further analysis.

2.3. Cytotoxic T Lymphocyte (CTL) Epitopes Prediction

Epitope binding to HLA Class I was predicted using netCTLpan 1.1 tools (https://services.healthtech.dtu.dk/service.php?NetCTLpan-1.1) which was accessed on 30 January 2022 [27]. Several parameters, including peptide length (9-mer), weight on C terminal cleavage (0.225), weight on TAP transport efficiency (0.025), and threshold for epitope identification (1.0), were set accordingly. The strong binder epitopes, denoted by ‘E’ with percent rank <1%, were selected. Promiscuous epitopes that bind to different HLA alleles were selected for further analysis.
The selected 9-mer CTL epitopes were then checked for their immunogenicity using the immunogenicity tool provided by IEDB (http://tools.iedb.org/immunogenicity/) which was accessed on 21 April 2022 [28]. Since the TCR contact residue of CTL epitopes is located in positions 3–8, residue numbers 1, 2, and C-terminal were selected to be masked in the available option. Peptides with positive immunogenicity were selected for further analysis.

2.4. Helper T Lymphocyte (HTL) Epitopes Prediction

Prediction of peptide binding to HLA Class II DRB1 was conducted using netMHCII-pan 4.0 tools (https://services.healthtech.dtu.dk/service.php?NetMHCIIpan-4.0) which was accessed on 3 November 2021 [29]. Several parameters, including peptide length (15-mer), the threshold for strong binder (1%), and the threshold for weak binder (5%), were set accordingly. In addition, the strong binder epitopes, denoted by ‘SB,’ were selected for further analysis. Like the CTL epitopes, promiscuous epitopes that bind to different HLA alleles were selected for further analysis.
The selected 15-mer HTL epitopes were checked for their IFN-γ inducing ability through the IFNepitope server (http://crdd.osdd.net/raghava/ifnepitope/predict.php) which was accessed on 29 April 2022 [30]. The HTL must be able to induce IFN-γ to ensure induction of an appropriate immune response. The algorithm chosen for the prediction is the Support Vector Machine (SVM) based, while the model is left as default as ‘IFN-gamma vs. Non-IFN-gamma.’ Peptides with positive IFN-γ scores were selected for further analysis.

2.5. IEDB Epitopes Validation

Experimentally known epitopes from different studies are cataloged and available to be checked through the Immune Epitope Database (IEDB) (http://www.iedb.org) which was accessed on 17 May 2022 [31]. The short-listed epitopes, both CTL from step 2.3 and HTL from step 2.4, were checked against the database as validation measures. The epitope source was specified as ‘SARS-CoV-2’, ‘Human’ as the host, and antigen according to each generated epitope (ORF1ab, S, E, M, and N). Peptides reported and validated in IEDB were usually prioritized to be included in subsequent studies.

2.6. Conservancy and Cross-Reactivity Analysis of SARS-CoV-2 T-Cell Epitopes with HCCCs

In order to analyze the cross-reactivity of SARS-CoV-2 with other human CCCs, the conservancy analysis was done using the IEDB epitope conservancy analysis tool (http://tools.iedb.org/conservancy/) which was accessed on 9 March 2022 [32]. The respective epitopes from each identified protein were run against the four human CCCs. As a result, the epitopes that share 100% similarity across all strains were identified. Information on the 100% conserved epitopes, such as location in the gene, proteins, and domains in all five different viruses, was obtained through NCBI (https://www.ncbi.nlm.nih.gov) which was accessed on 14 March 2022.
Additional conservancy analysis was performed against the fully conserved epitopes. For more defined and complete results, the conserved epitopes were again run against SARS-CoV, MERS-CoV, previous, and currently circulating SARS-CoV-2 VOCs. Information regarding the epitopes in different viruses was also gathered and compared.
Epitopes that possess different residues in non-critical positions still have the possibility of binding to the same HLA alleles, provided that the residues fulfill the HLA binding motif. The epitopes that have identical TCR contact residue might still be recognized by the same T-cell clone. The TCR contact residues of CTL epitopes are located in residue number 3–8, while HTL epitopes are located in residue number 2, 3, 5, 7, and 8. Therefore, epitopes that possessed identical TCR contact residues across five different viruses during the conservancy run were identified. The sequence identity cutoff value in IEDB epitope conservancy analysis tool was first set to 67% to ensure optimal recognition and binding [33]. Following that, epitopes with identical TCR contact residue were sorted. The identified epitopes were then subjected to another NetCTLpan 1.1 analysis (https://services.healthtech.dtu.dk/service.php?NetCTLpan-1.1) for CTL epitopes and NetMHCIIpan 4.0 analysis (https://services.healthtech.dtu.dk/service.php?NetMHCIIpan-4.0) for HTL epitopes [27,29]. Both servers were accessed on 2 May 2022. The parameters for each prediction were identical to steps 2.3 and 2.4. This step was conducted to ensure that the residue difference in the sequence indeed did not affect the epitopes’ binding ability to the HLA alleles.

2.7. Cross-Reactivity Analysis of SARS-CoV-2 T-Cell Epitopes with OPV, BCG, and MMR

The cross-reactivity analysis was done by using the Blastp (Protein BLAST) from the NCBI server (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins) which was accessed on 23 April 2022 to identify similar amino acid sequences between SARS-CoV-2 T-cells epitopes with the epitope sequences from OPV, BCG, and MMR. The epitopes sequences were inputted into the “Query sequence” box. Meanwhile, the taxon ID from each live attenuated vaccine (OPV, BCG, MMR) was inputted into the “Organism” box. The taxon ID used in the epitope cross reactivity analysis are as follows: Mycobacterium tuberculosis variant bovis (1765), Human poliovirus 1 (12080), Human poliovirus 2 (12083), Human poliovirus 3 (12086), Measles virus strain Schwarz (132487), Rubella virus (11041), and Mumps orthorubulavirus (2560602). All of the options in the “exclude” section were selected. Some settings in the “Algorithm parameters” section were adjusted, such as the “Expect threshold” being 30000, the “Word size” being 2, and the “Matrix” being changed to PAM30; the “Compositional adjustments” was set to no adjustments, and the option in the “Filter” section was clicked.

2.8. Cross-Reactivity of Predicted Epitopes with Human Peptides

Blastp analysis (http://blast.ncbi.nlm.nih.gov/ Blast.cgi) was conducted to find SARS-CoV-2 epitopes that match with human peptides. The website was accessed on 2 May 2022. The method was adapted from previous studies [22,25]. The Blastp algorithm parameter was set as follows: expect threshold 30000, word size 2, matrix PAM30, gap cost set to existence = 9 and extension = 1; the compositional parameter was set to no adjustment, while the low complexity filter was disabled and automatically adjusted for short input sequences. The results from Blastp analysis were transferred into Microsoft Excel and were screened for the peptides which shared at least contiguously 7 identical amino acid residues with the human peptides with no gap and no mismatches residue.

2.9. Population Coverage Analysis

In order to assess whether the cross-reactive and short-listed epitopes are prevalent in Indonesia, population coverage analysis was performed through the IEDB population coverage analysis tools (http://tools.iedb.org/population/) which was accessed on 25 July 2022 [34]. The epitopes and corresponding HLA alleles that bind were inputted. Then, several parameters, including the population (Indonesia) and calculation options (Class I separate, Class II separate, and Class I and II combined), were set accordingly. The population coverage was then further analyzed.

2.10. Epitope Selection for Universal Coronavirus Vaccine Construct

Following different analyses, epitopes with attractive traits to be included in the coronavirus vaccine construct were further sorted. Criteria included conservancy, promiscuity, immunogenicity, IFNγ score, population coverage, no similarity with human peptides and IEDB reported peptides.

3. Results

3.1. HLA Allele Frequencies of Indonesian Population

Through the allele frequencies database, a total of 21 alleles from both HLA Class I and II with frequencies equal to or more than 5% were identified and used for the epitopes prediction (Figure 1). Out of 21, 15 of them belonged to HLA Class I alleles where six belonged to Class A and nine belonged to Class B. The highest frequencies from HLA Class I alleles were found to be HLA-A*24:07, with average frequencies of 23.1% coming from three different Indonesian populations. However, the highest frequency allele was located among the remaining six alleles that belonged to the HLA Class II DRB1 alleles. The HLA-DRB1*12:02 followed by HLA-DRB1*15:02 recorded an average frequency of 34.3% and 28.32%, coming from eight and six different Indonesian populations, respectively, making them the two highest frequency alleles of the HLA in the Indonesian population identified in this study.

3.2. Prediction of CTL Epitopes from SARS-CoV-2 Proteins

The prediction of CTL epitopes from SARS-CoV-2 protein with the HLA Class I alleles (%Rank <1%) resulted in 1599 bindings combined from 5 different SARS-CoV-2 proteins. In the process, it was found that HLA-B*15:21 and HLA-A*11:01 showed the most significant number of bindings, at 136 and 133 epitopes, respectively. Overall, as shown in Figure 2, each HLA Class I allele is predicted to bind to a large number of epitopes; the least records a total of 75 bindings, by HLA-B*44:03. Moreover, around 80% of the bindings from each allele were contributed by ORF1ab epitopes (Figure 2; blue color), followed by spike (S), membrane (M), nucleocapsid (N), and envelope (E) protein.
From the NetCTLpan 1.1 prediction, promiscuous peptides from SARS-CoV-2 (ORF1ab, S, E, M, and N) were further sorted and checked for their immunogenicity and population coverage. Promiscuous peptides here are defined as peptides that can bind with at least two different HLA Class I alleles. Out of 1599 predicted peptides, 414 of them are promiscuous. Highly promiscuous peptides that bind to at least five different HLA Class I alleles are listed in Table 1, while the complete list of peptides identified in this study can be found in Supplementary Materials (Table S1). Most of the peptides were observed to belong to the ORF1ab. With starting residue number 1767, the VMYMCTLSY peptide belonging to ORF1ab binds to 9 different peptides (HLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:05, HLA-B*44:03, HLA-B*58:01), making it the most promiscuous CTL peptide among other residues and covering the largest proportion in the Indonesian population of up to 90.22%.
Besides binding affinity to the HLA molecules, immunogenicity is an essential determinant in peptide recognition by TCR. Highly immunogenic CTL epitopes with a score >0.35 are listed in Table 2. A total of 14 highly immunogenic CTL epitopes (score >0.35) were identified along with their HLA Class I that binds, population coverage, and previously reported IEDB validation. The DYVYNPFMI epitope from ORF1ab recorded the highest immunogenicity score of 0.56221, bound to two HLA Class I alleles, and covered 58.77% of the Indonesian population. The binding with HLA-A*24:02 was also experimentally proven by HLA ligand assay and reported in IEDB.

3.3. Prediction of HTL Epitopes from SARS-CoV-2 Proteins

Despite the number of HLA Class II alleles being almost one-third the number of HLA Class I alleles, they record a nearly similar total number of predicted epitope bindings (Figure 3). The HLA-DRB1*15:02 allele holds the highest record, with a total of 299 HTL epitopes predicted to bind to it. In contrast, the HLA-DRB1*11:01 recorded a total of 170 bindings, the lowest record in HLA Class II alleles. All of the HLA Class II DRB1 included in this prediction binds to many more peptides when compared to HLA Class I. Moreover, consistent with the HLA Class I, ORF1ab is still responsible for most of the bindings (Figure 3; blue color), approximately 80% of the total. This is followed, in order, by spike, nucleocapsid, membrane, and envelope protein.
Following the NetMHCIIpan 4.0 prediction, promiscuous HTL peptides from SARS-CoV-2 (ORF1ab, S, E, M, and N) were also sorted and further checked for their IFN-γ score and population coverage. Similar to class I, promiscuous peptides here are defined as peptides that can bind to at least two different HLA Class II alleles. Out of 1422 predicted peptides, 414 of them are found to be promiscuous. The highly promiscuous peptides that bind to at least 3 out of 6 HLA Class II alleles are listed in Table 3, while the complete list of HTL peptides identified in this study can be found in Supplementary Materials (Table S2). With starting residue number 6751, the 15-mer peptides of LDDFVEIIKSQDLSV belonging to ORF1ab bind to all six different HLA Class II alleles (HLA-DRB1*07:01, HLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02), making it the most promiscuous HTL peptide that covers 95.26% of the Indonesian population. The binding between the peptides and HLA alleles has also been experimentally proven by T-cell assay and reported in IEDB.
Apart from binding affinity, IFN-γ is an important cytokine produced by HTL to trigger immune responses. Therefore, HTL epitopes with good IFN-γ inducing ability reflected as IFN-γ score are more likely to produce immune responses. Thus, 15 top 15-mer HTL epitopes with the highest IFN-γ score are listed in Table 4, along with information on HLA Class II that binds, population coverage, and previous IEDB reports. The complete list can also be found in Supplementary Materials (Table S2). The highest IFN-γ score was IAQFAPSASAFFGMS epitope from ORF9 that binds to DRB1_0701 with a score of 1.06167. The binding has also been experimentally proven by B-cell assay and reported in IEDB.

3.4. Identification of Conserved Epitopes in SARS-CoV-2 and Human CCCs

Cross-reactivity analysis was performed through the IEDB Conservancy Analysis Tools to identify conserved epitopes throughout the SARS-CoV-2 and four other human CCCs. The analysis was performed using five proteins (ORF1ab, S, E, M, and N) conserved in all five viruses against HLA Class I and II alleles. It was found that only two 9-mer epitopes were conserved in all five viruses (Table 5). Both epitopes belong to the ORF1ab, specifically within the region of nsp12, which encodes for the RNA dependent RNA polymerase (RdRp) enzyme of coronaviruses. The two epitopes bind to a total of 4 different HLA Class I alleles, with a population coverage of 13.07%. Moreover, specifically for the ‘SLAIDAYPL’ epitope, its binding with HLA-A*02:01 has been experimentally proven by both the T-cell assay and HLA ligand assay reported in IEDB.
Additional analysis was performed with the two identified conserved epitopes. The conserved epitopes were run against the SARS-CoV-2 current and previously circulating VOCs. The current VOCs include delta and omicron variants with a total of 1157 and 884 isolates, respectively. The previously circulating VOCs include alpha, beta, and gamma variants with a total of 158, 374, and 9 isolates, respectively. The sequences of SARS-CoV-2 variants can be found in Supplementary Materials. From the conservancy run, it was found that all VOCs of SARS-CoV-2 possessed the two conserved epitope sequences (Table 6). These were slightly different in position but still belonging to the same SARS-CoV-like RdRp region and domain. The results show that the epitopes are fully conserved across SARS-CoV-2 variants despite different mutations that are responsible for emergence of different variants.
In addition to the VOCs, the epitopes were also run against the SARS-CoV and MERS-CoV from the same betacoronavirus family (Table 7). It was found that the SLAIDAYPL epitopes were conserved in both SARS-CoV and MERS-CoV. The HEFCSQHTM epitope was conserved 100% in SARS-CoV but not in MERS-CoV. One amino acid difference in the non-critical ninth residues was found in MERS-CoV, a change from methionine (M) to leucine (L). Despite the difference, the epitope from MERS-CoV still binds to the same HLA allele as SARS-CoV-2. Together with the SARS-CoV and MERS-CoV, the conservancy analysis indicated that the two identified epitopes are conserved across all coronaviruses that have infected humans.

3.5. Epitopes with Identical TCR Contact Residue between SARS-CoV-2 and Human CCC

Besides the fully conserved epitopes, epitopes with identical TCR contact residues between SARS-CoV-2 and four human CCCs were also identified. As previously indicated, epitopes with residue differences may still experience binding to the same HLA if the residues still fulfill the HLA binding motifs. The peptide which has an identical TCR contact residue might be recognized by the same T-cell clone. This study specifically used a sequence similarity cutoff value of 67% to ensure optimal recognition and binding [33]. Moreover, to ensure that the epitopes from human CCC with differing residues still indeed bind to the same HLA alleles with the original epitopes from SARS-CoV-2 reference sequence, another round of netCTLpan 1.1 analysis for 9-mer epitopes and netMHCIIpan 4.0 analysis for 15-mer epitopes was conducted.
During the process, a total of four 9-mer peptides (Table 8) and five 15-mer peptides (Table 9) from ORF1ab were identified. Almost all of them possess some residue difference in the non-TCR contact residue. Overall, however, more residue differences were observed in the 15-mer HTL epitopes. Data showed that not all listed peptides turn out to bind to the same alleles where the peptide from SARS-CoV-2 reference sequence binds, which could be due to the differences in the anchor residue of the peptide. The identified 9 peptides were found to mainly originate from nsp12, nsp13, and nsp14 of ORF1ab, which is suggested to play an essential role in the viral survival for it to retain some degree of conservancy across all five viruses. In addition, some of the bindings between the epitopes and HLA have also been reported in IEDB and validated through HLA ligand assay or B cell assay.

3.6. Cross-Reactive T-Cell Epitopes between SARS-CoV-2 and BCG

Potential cross-reactive CD8+ and CD4+ T-cells epitopes were discovered utilizing Blastp analysis and reinspected using NetCTLpan 1.1 and NetMHCIIpan-4.0 correspondingly. In total, the BCG vaccine contains 14 similar sequences as SARS-CoV-2 in ORF2, ORF3a, ORF4, ORF5, ORF6, ORF7b, ORF8, and ORF9 regions (Table 10). These similar sequences are not completely identical but have 2 to 4 amino acid substitutions. Despite that, these 14 epitopes are still able to bind to various HLA Class I alleles, especially HLA-A*02:01 and HLA-A*02:03. Moreover, several cross-reactive epitopes from BCG are also capable of binding different HLA alleles than the SARS-CoV-2 epitopes. Additionally, a few cross-reactive BCG epitopes also demonstrate stronger binding to HLA than SARS-CoV-2 epitopes according to the HLA binding affinity score.
Besides cross-reactive CTL epitopes, one cross-reactive HTL epitope was also identified through Blastp analysis. The cross-reactive HTL epitope from BCG is not identical to the SARS-CoV-2 epitope in the ORF7a region but has 2 amino acid substitutions in its core epitope. In spite of its difference, the cross-reactive HTL epitope is still able to bind HLA-DRB1*11:01 more strongly than SARS-CoV-2. In contrast, the cross-reactive HTL epitope has much lower Indonesian population coverage compared to the SARS-CoV-2 epitope (Table 11).

3.7. Cross-Reactive T-Cell Epitopes between SARS-CoV-2 and MMR

Similarly to BCG, several cross-reactive CTL epitopes were identified from the MMR vaccine, which is further divided into measles, mumps, and rubella. There were 7 similar sequences discovered from MMR compared with SARS-CoV-2 sequences in ORF2, ORF4, ORF7a, ORF7b, and ORF9 regions. These cross-reactive epitopes are able to bind to either one or two HLA Class I alleles, notably HLA-A*02:01 and HLA-A*02:03. However, only three cross-reactive epitopes from MMR demonstrate stronger HLA binding than SARS-CoV-2 epitopes, indicated by a lower HLA binding affinity score (Table 12).

3.8. Potential Epitope Set for Coronavirus Universal Vaccine Construct and Its Population Coverage

The conserved epitope from SARS-CoV-2 and HCCs identified in the preceding steps might be helpful in the universal vaccine construction. However, in regard to epitope selection for vaccines specifically, several additional criteria have to be considered, such as immunogenicity, IFN-γ inducing ability, population coverage, similarity with human peptides, and many more. Therefore, the selected epitopes were further characterized for the similarity with the human peptides and the entire epitope set was calculated for population coverage (Table 13).
Combining all the epitopes into a vaccine construct will result in 99.58% projected population coverage for Indonesia, with an average number of 4.25 epitope hits/HLA combinations recognized by the population, and a minimum number of 2.24 epitope hits/HLA combinations recognized by 90% of the population (Figure 4).

4. Discussion

T-cell responses to SARS-CoV-2 infection have been garnering attention lately due to potentially playing a vital role in measuring disease responses and progression, predicting disease recurrence, and control strategies [35]. Understanding T-cell responses can provide greater insights primarily on immune responses towards the disease, thus provide deeper understanding of its pathogenesis, vaccine development, responses, evaluation, disease severity, and much more [36]. There is strong evidence on T-cell cross-reactivity as the cause behind pre-existing immunity toward SARS-CoV-2 in the naïve population [8,9]. These cross-reactive immune responses reportedly correlate with shared homology between SARS-CoV-2 and the circulating HCoVs. Prior exposure to the human CCCs likely mediates the priming of SARS-CoV-2 responsive T-cells in the unexposed population [33,35,37,38].
This study discovered two fully conserved SARS-CoV-2 derived epitopes with 100% identity shared by all four common cold coronaviruses (HCoV-229E, HCoV-NL63, HCoV-OC43, HCoV-HKU1). They are presented by four major Indonesian HLA alleles (HLA-B*18:01, HLA-B*38:02, HLA-B*44:03 and HLA-A*02:01) with accumulated population coverage of 50.2%. The ‘SLAIDAYPL’ epitope binding with HLA-A*02:01, specifically, has been reported in IEDB. Moreover, the epitope has also reported to be conserved across CCCs in other population-specific HLA (UK and US) such as HLA-B*39 and HLA-B*62 [39].
In addition to the homologous epitopes, nine heterologous epitopes with identical TCR contact residue were also identified. They are all represented by at least one major Indonesian HLA allele and some of their bindings have been reported in IEDB. This result showed that cross-reactivity could indeed potentially arise due to sequence homology shared between the SARS-CoV-2 and human CCC. Previous studies demonstrated that T-cells, especially CD4+ T-cells that recognize human CCCs, exhibited cross-reactivity towards the homologous epitopes of SARS-CoV-2 [33]. It is important to note that the small number of epitopes identified in this study was most likely due to the analysis being performed across all four human CCC at once instead of towards each human CCC virus individually. More extensive epitope sequence similarity would be expected if the latter exercise was performed. A previous study identified as many as 13 fully conserved epitopes and 17 heterologous epitopes of SARS-CoV-2-derived epitopes when only compared to the betacoronavirus (HCoV-OC43 and HCoV-HKU1) [40]. Other studies have also consistently explored and reported a larger number of bindings when they conduct the analysis separately. Collectively, the data support the idea that cross-reactivity occurrence happens due to the shared homology of the HCoVs [39,41].
A further look into the fully conserved epitopes also showed that they are conserved in the previous and currently circulating SARS-CoV-2 VOCs, SARS-CoV and MERS-CoV. This result shows that there are low chances of these epitopes being responsible for the difference in disease outcomes and severity, given that their presence across the SARS-CoV-2 variants, SARS-CoV, MERS-CoV, and human CCCs is consistent. While data is still lacking to draw a conclusive claim, the information on the HLA alleles that bind to these conserved epitopes could possibly help in understanding different disease outcomes across the coronavirus family. The region where the conserved epitopes originated might play an essential role in the coronavirus family, for they are conserved across all the HCoVs.
Out of five proteins (ORF1ab, S, E, M, and N) used during the analysis, all of the identified shared epitopes in this study belong to the ORF1ab. This may indicate that ORF1ab possessed a high degree of conservancy in the coronavirus family. This finding is also aligned with our previous study which validated it through entropy analysis [25] and studies done by others using genome-wide screening technology [42]. Besides the shared epitopes, ORF1ab also ranks highest in the number of predicted epitopes when compared to the rest of the proteins such as S, E, M, and N. This is understandable given the number of epitopes highly correlated with the gene length, as ORF1ab length is almost two-thirds of the whole genome [40]. This data was corroborated by others which revealed that ORF1ab contains the highest number of experimentally proven immunodominant epitopes [42] and highlighted the role of ORF1ab as the region with the most cross-reactive T-cells that can elicit responses in individuals exposed to other HCoVs but naive to SARS-CoV-2 infection [38].
Moreover, ORF1ab also has a primary function as a replicase enzyme that is vital for viral survival [43]. The ORF1ab is further cleaved into 16 nsps, where several of them have an integral function during the viral life cycle [44]. This study specifically identified epitopes belonging to the nsp12, nsp13, and nsp14 regions. The nsp12 encodes for RdRp protein responsible for viral replication, transcription, and overall survival. The nsp13 belongs to the helicase superfamily, which possesses both helicase and NTPase activity [45]. They primarily function to unwind the DNA or RNA strands during replication; they also support and interact with other replication-transcription complexes [46]. Lastly, the nsp14 encodes for proteins necessary for proofreading ability, evading the host’s antiviral responses, and ensuring mRNA stability and translation [47]. The essential functions of each region in SARS-CoV-2 further support the idea of them being conserved across the whole coronavirus family. Previous studies have reported that nsp12-nsp14 are among the most conserved and accessible sites in the entire SARS-CoV-2 proteome, which is aligned with this study’s results [45,46]. As a conserved region, they also become an attractive target for intervention such as drug targets or vaccines [47].
The role of live attenuated vaccines such as BCG, OPV and MMR in attenuating COVID-19 diseases has been hypothesized [48] and investigated in some clinical trials [49,50]. In addition, a case-control study was also performed which investigated the MMR vaccination’s probable protective effect against SARS-CoV-2 [51]. Although most clinical and case-control studies focus on trained innate immunity, some studies showed that these LAVs also contain cross-reactive T-cells epitopes similar to epitopes of SARS-CoV-2 that may generate adaptive immunological memory [7,16,17]. These cross-reactive T-cells epitopes might be one of the main factors contributing to protective effects from LAVs apart from the trained innate immune system. The Blastp analysis conducted in this study discovered that BCG and MMR contain 15 and 7 similar yet not identical sequences with SARS-CoV-2, respectively. Despite the fact that these sequences have 2 to 4 amino acid substitution, these sequences are potentially able to induce cross-reactive CTL against SARS-CoV-2. Additionally, one sequence discovered from BCG also has the ability to give rise to cross-reactive HLT against SARS-CoV-2. However, the dissimilarity in terms of which HLAs bind to the cross-reactive epitopes suggests that not all people will benefit from the presence of cross-reactive epitopes. Hence, only people who have certain HLA alleles will be able to respond to the epitopes and gain benefit from the heterologous immunity.
In contrast to other studies [13,14] the Blastp analysis did not find any cross-reactive CTL and HTL epitopes from OPV specific to the Indonesian population. One of the factors that might lead to no cross-reactive epitopes between OPV and SARS-CoV-2 is the genome nature of poliovirus. Compared to other viruses, the genomic RNA of the poliovirus is only approximately 7500 nucleotides in length. Hence, the poliovirus is classified as a small virus in comparison to BCG and MMR, resulting in lower possibilities of similarity matches in Blastp analysis. However, the result of this study does not imply that OPV does not have beneficial cross-protection effects on the Indonesian population. There is still the possibility that OPV provides nonspecific protection through trained innate immunity, such as Toll Like Receptor (TLR) stimulation, notably the TLR3/TLR7, against SARS-CoV-2 which eventually leads to the early activation of innate immunity, such as macrophages and dendritic cells [52] as well as the potential cross-reactivity of the humoral immune responses [15].
Regarding the association of the T-cell cross-reactivity with the clinical outcomes, there are limitations on the conclusions that can be drawn from immunoinformatics analyses. The ideas discussed here require investigation and validation through laboratory experiments. Several studies, for instance, proved that cross-reactive memory T-cells indeed result in protection by measuring the IL-2 and IFN-γ secretion through fluorescence-linked immunospot (FLISpot) assay on PBMC [40]. An experiment using an epitope-mapping and flow cytometry activation-induced marker assay suggested that the presence of cross-reactive T-cells might be responsible for the different clinical outcomes and severity in COVID-19 patients [33].
It is still difficult and too early for now to conclude whether this cross-reactivity indeed results in cross-protection or instead results in enhanced pathogenicity. Apart from T-cell reactivity, the association of HLA alleles with COVID-19 severity has also been reported [53]. HLA genetic background is considered one of the contributing factors to different SARS-CoV-2 infection outcomes (transmissibility, susceptibility, progression, and severity), as it is directly related to an individual’s immunogenetic variation [21]. Each allele possesses a different level of binding affinity. This binding affinity, as mentioned previously, will directly affect and produce different immune responses such as neutralizing antibodies or pro-inflammatory cytokines. In general, most of the HLA Class II alleles are less significant when compared to HLA Class I in regard to severity [54].
Out of 21 HLA alleles prevalent in Indonesia used in this study (Figure 1), only some have been further analyzed for their contribution towards severity in different populations (not Indonesian). For example, HLA-A*11:01 was proven by different studies to be highly correlated with severe disease and outcomes in the European and Chinese populations [21,54]. One study implied that individuals who possessed HLA-A*11:01 and HLA-A*24:02 might produce better T-cell immune responses when compared to HLA*02:01 [55]. HLA-A*02:01 instead is associated with an increased risk of infection due to its lower ability to present SARS-CoV-2 antigen in multinational populations. Lastly, the DRB1*12:01 are found to display protective effects against SARS-CoV-2, while the DRB1*15:01 are found to be increased in asymptomatic patients in the European and Chinese population [20,54]. Findings on different populations may or may not be reflected in the Indonesian population. However, it can be seen that differences in the genetic profile and geographic background will affect the outcomes of infection along with other factors such as age, gender, ABO blood groups, and other comorbid conditions [20,41].
The COVID-19 pandemic has once again shown the ability of viruses from the coronavirus family to cause massive outbreaks [56,57]. Studies have highlighted SARS-CoV-2 potential to evolve, survive, and reemerge, owing to its efficient host-switching and genetic recombination ability [58,59]. The immediate example that can be witnessed is the emergence of numerous SARS-CoV-2 variants with different transmissibility and immune evasion ability, which influences the treatment efficiency, including for vaccines. These factors stress the need for a universal coronavirus vaccine in order to minimize disease burden or even prevent the next coronavirus outbreak [56].
This study suggested some epitopes that might be helpful in the universal coronavirus vaccine construction (Table 13). Based on a series of analyses conducted in this study, ORF1ab was specifically identified as an attractive candidate for a universal vaccine. Epitopes belonging to the ORF1ab family have some features of high conservancy across the coronavirus family, promiscuousness that covers a large population, immunodominance, and good IFN-γ inducing ability; these make them very promising as possible targets for universal coronavirus vaccines. Several studies that primarily utilized ORF1ab for their multi-epitope vaccines design have also professed interest in and highlighted ORF1ab qualities as a vaccine target [25,60]. The presence of pre-existing cross-reactive memory T-cells recognizing epitopes from the early proteins that were made by the infected cells such as the polymerase proteins have been associated with the abortive infection which is beneficial in protecting individual from SARS-CoV-2 infection [61]. This supports ORF1ab as a candidate for a universal coronavirus vaccine and shows the benefit of cross-reactive T-cells. Yet there are still reports of contradictory findings where it is believed exposure to human common cold coronavirus does not primarily explain the cross-reactivity [62]. Debates over the exact cause and mechanism are still circulating. Pre-existing immunity may result in either protection or increased susceptibility toward SARS-CoV-2 [24]. Regardless of different opinions and arguments, understanding T-cell responses and reactivity is believed to be the key to understanding the disease responses to ensure suitable and effective control strategies are implemented [35,36].
To date, no studies have conducted extensive research on the universal coronavirus vaccine. However, some essential points can be learned from the attempt to make a universal influenza vaccine [63]. It is vital to ensure that a universal vaccine offers broad and durable protection that is effective towards at least three-fourths of viruses from the family, despite the mutations that may arise over time [64,65]. It also must be suitable for all age groups and possess good immunogenicity, low allergenicity, and low homology with the human peptides [64]. A better understanding of the infection itself will also help improve and speed up the vaccine design process, assessing the vaccine responses and evaluation [64]. All in all, it may take a long time to actually produce an effective and safe universal vaccine, but scientists believe the idea of a universal coronavirus vaccine is possible with further exploration through research and analysis [56,57,65].

5. Conclusions

Through immunoinformatics approaches, two fully conserved SARS-CoV-2 derived epitopes with 100% sequence similarity shared by all four human CCCs, along with additional nine heterologous epitopes with identical TCR contact residues and cross-reactive epitopes with the live attenuated vaccines were identified. Each epitope was found to be represented by at least one major Indonesian HLA allele, covering a considerable population with good binding affinity. These findings overall support the idea that T-cell cross reactivity occurrence in SARS-CoV-2 naïve individuals could arise from shared sequence homology between SARS-CoV-2 and human CCCs. Additional analysis performed with SARS-CoV, MERS-CoV, and SARS-CoV-2 VOCs also indicated that the conserved epitopes are not responsible for differential outcomes in regards to the resulting signs, symptoms, and severity, as their presence is consistent across all assessed isolates. Moreover, with most predicted conserved epitopes belonging to the ORF1ab, this study also implies the vital role of ORF1ab in the coronavirus family. This further suggests its potential as a candidate for the universal coronavirus vaccine target.
The present study is limited to in-silico analysis which cannot confirm matters concerning T-cell reactivity and HLA association with clinical outcomes and severity. Therefore, further validation using laboratory experiments that allow measurement of immune system key player (i.e., IL-2, IFN-γ) levels might provide greater insights to resolve the issues raised.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/v14112328/s1. Table S1: Full list of predicted 9-mer CTL epitopes from five different SARS-CoV-2 proteins (ORF1ab, S, E, M, and N) with %Rank less than 1% identified from NetCTLpan 1.1 along with their immunogenicity score. Table S2. Full list of predicted 15-mer HTL epitopes from five different SARS-CoV-2 proteins (ORF1ab, S, E, M, and N) with %Rank less than 1% identified from NetMHCpan 4.0 along with their IFN-γ score. File S1 contains the FASTA files of the ORF1ab sequences from SARS-CoV-2 VOCs.

Author Contributions

Conceptualization, M.G., V.J. and A.H.; Data curation, M.G, V.J., A.H. and D.A.; Formal analysis, M.G., V.J., A.H. and D.A.; Funding acquisition, M.G., D.A. and S.A.; Investigation, M.G., V.J., A.H., G.G.F., R.H., L.K., E.S., R.S., D.A., S.A. and A.A.P.; Methodology, M.G, D.A. and A.A.P.; Writing—original draft, M.G., V.J. and A.H.; Writing—review and editing, M.G., V.J., A.H., G.G.F., R.H., L.K., E.S., R.S., D.A., S.A. and A.A.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Ministry of Education, Culture, Research and Technology, Republic of Indonesia, Southeast Asia—Europe Joint Funding Scheme for Research and Innovation, contract grant number 447/LL3/AK.04/2022.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Naqvi, A.A.T.; Fatima, K.; Mohammad, T.; Fatima, U.; Singh, I.K.; Singh, A.; Atif, S.M.; Hariprasad, G.; Hasan, G.M.; Hassan, I. Insights into SARS-CoV-2 genome, structure, evolution, pathogenesis and therapies: Structural genomics approach. Biochim. Biophys. Acta (BBA)–Mol. Basis Dis. 2020, 1866, 165878. [Google Scholar] [CrossRef] [PubMed]
  2. World Health Organization (WHO). Coronavirus COVID-19 Dashboard. Available online: https://covid19.who.int/ (accessed on 18 May 2022).
  3. Zheng, J. SARS-CoV-2: An Emerging Coronavirus that Causes a Global Threat. Int. J. Biol. Sci. 2020, 16, 1678–1685. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. World Health Organization Tracking Sars-COV-2 Variants. Available online: https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/ (accessed on 18 May 2021).
  5. Lim, Y.X.; Ng, Y.L.; Tam, J.P.; Liu, D.X. Human Coronaviruses: A Review of Virus–Host Interactions. Diseases 2016, 4, 26. [Google Scholar] [CrossRef] [Green Version]
  6. Poland, G.A.; Ovsyannikova, I.G.; Kennedy, R.B. SARS-CoV-2 immunity: Review and applications to phase 3 vaccine candidates. Lancet 2020, 396, 1595–1606. [Google Scholar] [CrossRef]
  7. Tomita, Y.; Sato, R.; Ikeda, T.; Sakagami, T. BCG vaccine may generate cross-reactive T cells against SARS-CoV-2: In silico analyses and a hypothesis. Vaccine 2020, 38, 6352–6356. [Google Scholar] [CrossRef] [PubMed]
  8. Altmann, D.M.; Boyton, R.J. SARS-CoV-2 T cell immunity: Specificity, function, durability, and role in protection. Sci. Immunol. 2020, 5, eabd6160. [Google Scholar] [CrossRef]
  9. Sette, A.; Crotty, S. Pre-existing immunity to SARS-CoV-2: The knowns and unknowns. Nat. Rev. Immunol. 2020, 20, 457–458. [Google Scholar] [CrossRef]
  10. Echeverría, G.; Guevara, Á.; Coloma, J.; Ruiz, A.M.; Vasquez, M.M.; Tejera, E.; de Waard, J.H. Pre-existing T-cell immunity to SARS-CoV-2 in unexposed healthy controls in Ecuador, as detected with a COVID-19 Interferon-Gamma Release Assay. Int. J. Infect. Dis. 2021, 105, 21–25. [Google Scholar] [CrossRef]
  11. Chumakov, K.; Benn, C.S.; Aaby, P.; Kottilil, S.; Gallo, R. Can existing live vaccines prevent COVID-19? Science 2020, 368, 1187–1188. [Google Scholar] [CrossRef]
  12. Chumakov, K.; Avidan, M.S.; Benn, C.S.; Bertozzi, S.M.; Blatt, L.; Chang, A.Y.; Jamison, D.T.; Khader, S.A.; Kottilil, S.; Netea, M.G.; et al. Old vaccines for new infections: Exploiting innate immunity to control COVID-19 and prevent future pandemics. Proc. Natl. Acad. Sci. USA 2021, 118, e2101718118. [Google Scholar] [CrossRef]
  13. Reche, P.A. Potential Cross-Reactive Immunity to SARS-CoV-2 From Common Human Pathogens and Vaccines. Front. Immunol. 2020, 11, 586984. [Google Scholar] [CrossRef] [PubMed]
  14. Haddad-Boubaker, S.; Othman, H.; Touati, R.; Ayouni, K.; Lakhal, M.; Ben Mustapha, I.; Ghedira, K.; Kharrat, M.; Triki, H. In silico comparative study of SARS-CoV-2 proteins and antigenic proteins in BCG, OPV, MMR and other vaccines: Evidence of a possible putative protective effect. BMC Bioinform. 2021, 22, 163. [Google Scholar] [CrossRef]
  15. Comunale, B.A.; Engineer, L.; Jiang, Y.; Andrews, J.C.; Liu, Q.; Ji, L.; Yurkovich, J.T.; Comunale, R.A.; Xie, Q. Poliovirus Vaccination Induces a Humoral Immune Response That Cross Reacts with SARS-CoV-2. Front. Med. 2021, 8, 1285. [Google Scholar] [CrossRef] [PubMed]
  16. Tarabini, R.F.; Rigo, M.M.; Fonseca, A.F.; Rubin, F.; Bellé, R.; Kavraki, L.E.; Ferreto, T.C.; Antunes, D.A.; de Souza, A.P.D. Large-Scale Structure-Based Screening of Potential T Cell Cross-Reactivities Involving Peptide-Targets From BCG Vaccine and SARS-CoV-2. Front. Immunol. 2022, 12, 5858. [Google Scholar] [CrossRef] [PubMed]
  17. Eggenhuizen, P.J.; Ng, B.H.; Chang, J.; Fell, A.L.; Cheong, R.M.Y.; Wong, W.Y.; Gan, P.-Y.; Holdsworth, S.R.; Ooi, J.D. BCG Vaccine Derived Peptides Induce SARS-CoV-2 T Cell Cross-Reactivity. Front. Immunol. 2021, 12, 3034. [Google Scholar] [CrossRef]
  18. Abbas, A.K.; Lichtman, A.H.; Pillai, S. Basic Immunology: Functions and Disorders of the Immune System; Elsevier: Amsterdam, The Netherlands, 2020. [Google Scholar]
  19. Wieczorek, M.; Abualrous, E.T.; Sticht, J.; Álvaro-Benito, M.; Stolzenberg, S.; Noé, F.; Freund, C. Major Histocompatibility Complex (MHC) Class I and MHC Class II Proteins: Conformational Plasticity in Antigen Presentation. Front. Immunol. 2017, 8, 292. [Google Scholar] [CrossRef] [Green Version]
  20. Langton, D.J.; Bourke, S.C.; Lie, B.A.; Reiff, G.; Natu, S.; Darlay, R.; Burn, J.; Echevarria, C. The influence of HLA genotype on the severity of COVID-19 infection. HLA 2021, 98, 14–22. [Google Scholar] [CrossRef]
  21. Migliorini, F.; Torsiello, E.; Spiezia, F.; Oliva, F.; Tingart, M.; Maffulli, N. Association between HLA genotypes and COVID-19 susceptibility, severity and progression: A comprehensive review of the literature. Eur. J. Med. Res. 2021, 26, 84. [Google Scholar] [CrossRef]
  22. Gustiananda, M. What do T cells see in SARS-CoV2? Indones. J. Life Sci. 2020, 2, 29–43. [Google Scholar] [CrossRef]
  23. Zhao, J.; Zhao, J.; Mangalam, A.K.; Channappanavar, R.; Fett, C.; Meyerholz, D.K.; Agnihothram, S.; Baric, R.S.; David, C.S.; Perlman, S. Airway Memory CD4+ T Cells Mediate Protective Immunity against Emerging Respiratory Coronaviruses. Immunity 2016, 44, 1379–1391. [Google Scholar] [CrossRef] [PubMed]
  24. Lee, C.H.; Pinho, M.P.; Buckley, P.R.; Woodhouse, I.B.; Ogg, G.; Simmons, A.; Napolitani, G.; Koohy, H. Potential CD8+ T Cell Cross-Reactivity against SARS-CoV-2 Conferred by Other Coronavirus Strains. Front. Immunol. 2020, 11, 579480. [Google Scholar] [CrossRef]
  25. Gustiananda, M.; Sulistyo, B.P.; Agustriawan, D.; Andarini, S. Immunoinformatics Analysis of SARS-CoV-2 ORF1ab Polyproteins to Identify Promiscuous and Highly Conserved T-Cell Epitopes to Formulate Vaccine for Indonesia and the World Population. Vaccines 2021, 9, 1459. [Google Scholar] [CrossRef] [PubMed]
  26. Gonzalez-Galarza, F.F.; McCabe, A.; Dos Santos, E.J.M.; Jones, J.; Takeshita, L.; Ortega-Rivera, N.D.; Del Cid-Pavon, G.M.; Ramsbottom, K.; Ghattaoraya, G.; Alfirevic, A.; et al. Allele frequency net database (AFND) 2020 update: Gold-standard data classification, open access genotype data and new query tools. Nucleic Acids Res. 2019, 48, D783–D788. [Google Scholar] [CrossRef] [PubMed]
  27. Stranzl, T.; Larsen, M.V.; Lundegaard, C.; Nielsen, M. NetCTLpan: Pan-specific MHC class I pathway epitope predictions. Immunogenetics 2010, 62, 357–368. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Calis, J.J.A.; Maybeno, M.; Greenbaum, J.A.; Weiskopf, D.; De Silva, A.D.; Sette, A.; Keşmir, C.; Peters, B. Properties of MHC Class I Presented Peptides That Enhance Immunogenicity. PLoS Comput. Biol. 2013, 9, e1003266. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Reynisson, B.; Barra, C.; Kaabinejadian, S.; Hildebrand, W.H.; Peters, B.; Nielsen, M. Improved Prediction of MHC II Antigen Presentation through Integration and Motif Deconvolution of Mass Spectrometry MHC Eluted Ligand Data. J. Proteome Res. 2020, 19, 2304–2315. [Google Scholar] [CrossRef]
  30. Dhanda, S.K.; Vir, P.; Raghava, G.P.S. Designing of interferon-gamma inducing MHC class-II binders. Biol. Direct 2013, 8, 30. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. Vita, R.; Mahajan, S.; Overton, J.A.; Dhanda, S.K.; Martini, S.; Cantrell, J.R.; Wheeler, D.K.; Sette, A.; Peters, B. The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 2019, 47, D339–D343. [Google Scholar] [CrossRef] [Green Version]
  32. Bui, H.-H.; Sidney, J.; Li, W.; Fusseder, N.; Sette, A. Development of an epitope conservancy analysis tool to facilitate the design of epitope-based diagnostics and vaccines. BMC Bioinform. 2007, 8, 361. [Google Scholar] [CrossRef] [Green Version]
  33. Mateus, J.; Grifoni, A.; Tarke, A.; Sidney, J.; Ramirez, S.I.; Dan, J.M.; Burger, Z.C.; Rawlings, S.A.; Smith, D.M.; Phillips, E.; et al. Selective and cross-reactive SARS-CoV-2 T cell epitopes in unexposed humans. Science 2020, 370, 89–94. [Google Scholar] [CrossRef]
  34. Bui, H.-H.; Sidney, J.; Dinh, K.; Southwood, S.; Newman, M.J.; Sette, A. Predicting population coverage of T-cell epitope-based diagnostics and vaccines. BMC Bioinform. 2006, 7, 153. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Grifoni, A.; Weiskopf, D.; Ramirez, S.I.; Mateus, J.; Dan, J.M.; Moderbacher, C.R.; Rawlings, S.A.; Sutherland, A.; Premkumar, L.; Jadi, R.S.; et al. Targets of T Cell Responses to SARS-CoV-2 Coronavirus in Humans with COVID-19 Disease and Unexposed Individuals. Cell 2020, 181, 1489–1501.e15. [Google Scholar] [CrossRef] [PubMed]
  36. Vardhana, S.; Baldo, L.; Morice, W.G.; Wherry, E.J. Understanding T cell responses to COVID-19 is essential for informing public health strategies. Sci. Immunol. 2022, 7, eabo1303. [Google Scholar] [CrossRef] [PubMed]
  37. Braun, J.; Loyal, L.; Frentsch, M.; Wendisch, D.; Georg, P.; Kurth, F.; Hippenstiel, S.; Dingeldey, M.; Kruse, B.; Fauchere, F.; et al. SARS-CoV-2-reactive T cells in healthy donors and patients with COVID-19. Nature 2020, 587, 270–274. [Google Scholar] [CrossRef]
  38. Le Bert, N.; Tan, A.T.; Kunasegaran, K.; Tham, C.Y.L.; Hafezi, M.; Chia, A.; Chng, M.H.Y.; Lin, M.; Tan, N.; Linster, M.; et al. SARS-CoV-2-specific T cell immunity in cases of COVID-19 and SARS, and uninfected controls. Nature 2020, 584, 457–462. [Google Scholar] [CrossRef]
  39. Pacholczyk, M.; Rieske, P. In Silico Studies Suggest T-Cell Cross-Reactivity between SARS-COV-2 and Less Dangerous Coronaviruses. Res. Sq. 2020. [Google Scholar] [CrossRef]
  40. Kundu, R.; Narean, J.S.; Wang, L.; Fenn, J.; Pillay, T.; Fernandez, N.D.; Conibear, E.; Koycheva, A.; Davies, M.; Tolosa-Wright, M.; et al. Cross-reactive memory T cells associate with protection against SARS-CoV-2 infection in COVID-19 contacts. Nat. Commun. 2022, 13, 80. [Google Scholar] [CrossRef]
  41. Deng, H.; Yan, X.; Yuan, L. Human genetic basis of coronavirus disease 2019. Signal Transduct. Target. Ther. 2021, 6, 344. [Google Scholar] [CrossRef]
  42. Ferretti, A.P.; Kula, T.; Wang, Y.; Nguyen, D.M.; Weinheimer, A.; Dunlap, G.S.; Xu, Q.; Nabilsi, N.; Perullo, C.R.; Cristofaro, A.W.; et al. Unbiased Screens Show CD8+ T Cells of COVID-19 Patients Recognize Shared Epitopes in SARS-CoV-2 that Largely Reside outside the Spike Protein. Immunity 2020, 53, 1095–1107.e3. [Google Scholar] [CrossRef]
  43. Yoshimoto, F.K. The Proteins of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS CoV-2 or n-COV19), the Cause of COVID-19. J. Protein Chem. 2020, 39, 198–216. [Google Scholar] [CrossRef]
  44. Aftab, S.O.; Ghouri, M.Z.; Masood, M.U.; Haider, Z.; Khan, Z.; Ahmad, A.; Munawar, N. Analysis of SARS-CoV-2 RNA-dependent RNA polymerase as a potential therapeutic drug target using a computational approach. J. Transl. Med. 2020, 18, 275. [Google Scholar] [CrossRef] [PubMed]
  45. Shu, T.; Huang, M.; Wu, D.; Ren, Y.; Zhang, X.; Han, Y.; Mu, J.; Wang, R.; Qiu, Y.; Zhang, D.-Y.; et al. SARS-Coronavirus-2 Nsp13 Possesses NTPase and RNA Helicase Activities That Can Be Inhibited by Bismuth Salts. Virol. Sin. 2020, 35, 321–329. [Google Scholar] [CrossRef] [PubMed]
  46. Newman, J.A.; Douangamath, A.; Yadzani, S.; Yosaatmadja, Y.; Aimon, A.; Brandão-Neto, J.; Dunnett, L.; Gorrie-Stone, T.; Skyner, R.; Fearon, D.; et al. Structure, mechanism and crystallographic fragment screening of the SARS-CoV-2 NSP13 helicase. Nat. Commun. 2021, 12, 4848. [Google Scholar] [CrossRef] [PubMed]
  47. Hsu, J.C.-C.; Laurent-Rolle, M.; Pawlak, J.B.; Wilen, C.B.; Cresswell, P. Translational shutdown and evasion of the innate immune response by SARS-CoV-2 NSP14 protein. Proc. Natl. Acad. Sci. USA 2021, 118, e2101161118. [Google Scholar] [CrossRef] [PubMed]
  48. Gong, W.; Mao, Y.; Li, Y.; Qi, Y. BCG Vaccination: A potential tool against COVID-19 and COVID-19-like Black Swan incidents. Int. Immunopharmacol. 2022, 108, 108870. [Google Scholar] [CrossRef] [PubMed]
  49. Seo, S.-U.; Seong, B.-L. Prospects on Repurposing a Live Attenuated Vaccine for the Control of Unrelated Infections. Front. Immunol. 2022, 13, 877845. [Google Scholar] [CrossRef]
  50. Wang, J.; Zhang, Q.; Wang, H.; Gong, W. The Potential Roles of BCG Vaccine in the Prevention or Treatment of COVID-19. Front. Biosci. 2022, 27, 157. [Google Scholar] [CrossRef]
  51. Lundberg, L.; Bygdell, M.; von Feilitzen, G.S.; Woxenius, S.; Ohlsson, C.; Kindblom, J.M.; Leach, S. Recent MMR vaccination in health care workers and COVID-19: A test negative case-control study. Vaccine 2021, 39, 4414–4418. [Google Scholar] [CrossRef]
  52. Sanchez, M.M.; Saleeb, P.; Kottilil, S.; Mathur, P. Oral Polio Vaccine to Protect against COVID-19: Out of the Box Strategies? Open Forum Infect. Dis. 2021, 8, ofab367. [Google Scholar] [CrossRef]
  53. Buckley, P.R.; Lee, C.H.; Pinho, M.P.; Babu, R.O.; Woo, J.; Antanaviciute, A.; Simmons, A.; Ogg, G.; Koohy, H. HLA-dependent variation in SARS-CoV-2 CD8 + T cell cross-reactivity with human coronaviruses. Immunology 2022, 166, 78–103. [Google Scholar] [CrossRef]
  54. Wang, F.; Huang, S.; Gao, R.; Zhou, Y.; Lai, C.; Li, Z.; Xian, W.; Qian, X.; Li, Z.; Huang, Y.; et al. Initial whole-genome sequencing and analysis of the host genetic contribution to COVID-19 severity and susceptibility. Cell Discov. 2020, 6, 83. [Google Scholar] [CrossRef]
  55. Tomita, Y.; Ikeda, T.; Sato, R.; Sakagami, T. Association between HLA gene polymorphisms and mortality of COVID-19: An in silico analysis. Immun. Inflamm. Dis. 2020, 8, 684–694. [Google Scholar] [CrossRef] [PubMed]
  56. Koff, W.C.; Berkley, S.F. A universal coronavirus vaccine. Science 2021, 371, 759. [Google Scholar] [CrossRef] [PubMed]
  57. Morens, D.M.; Taubenberger, J.K.; Fauci, A.S. Universal Coronavirus Vaccines—An Urgent Need. N. Engl. J. Med. 2022, 386, 297–299. [Google Scholar] [CrossRef] [PubMed]
  58. Morens, D.M.; Taubenberger, J.K.; Fauci, A.S. A Centenary Tale of Two Pandemics: The 1918 Influenza Pandemic and COVID-19, Part I. Am. J. Public Health 2021, 111, 1086–1094. [Google Scholar] [CrossRef]
  59. Morens, D.M.; Taubenberger, J.K.; Fauci, A.S. A Centenary Tale of Two Pandemics: The 1918 Influenza Pandemic and COVID-19, Part II. Am. J. Public Health 2021, 111, 1267–1272. [Google Scholar] [CrossRef]
  60. Almofti, Y.A.; Abd-Elrahman, K.A.; Eltilib, E.E.M. Vaccinomic approach for novel multi epitopes vaccine against severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). BMC Immunol. 2021, 22, 22. [Google Scholar] [CrossRef]
  61. Swadling, L.; Diniz, M.O.; Schmidt, N.M.; Amin, O.E.; Chandran, A.; Shaw, E.; Pade, C.; Gibbons, J.M.; Le Bert, N.; Tan, A.T.; et al. Pre-existing polymerase-specific T cells expand in abortive seronegative SARS-CoV-2. Nature 2022, 601, 110–117. [Google Scholar] [CrossRef]
  62. Tan, C.C.; Owen, C.J.; Tham, C.Y.; Bertoletti, A.; van Dorp, L.; Balloux, F. Pre-existing T cell-mediated cross-reactivity to SARS-CoV-2 cannot solely be explained by prior exposure to endemic human coronaviruses. Infect. Genet. Evol. 2021, 95, 105075. [Google Scholar] [CrossRef]
  63. Beans, C. Researchers getting closer to a “universal” flu vaccine. Proc. Natl. Acad. Sci. USA 2022, 119, e2123477119. [Google Scholar] [CrossRef] [PubMed]
  64. Erbelding, E.J.; Post, D.J.; Stemmy, E.J.; Roberts, P.C.; Augustine, A.D.; Ferguson, S.; Paules, C.I.; Graham, B.S.; Fauci, A.S. A Universal Influenza Vaccine: The Strategic Plan for the National Institute of Allergy and Infectious Diseases. J. Infect. Dis. 2018, 218, 347–354. [Google Scholar] [CrossRef] [PubMed]
  65. Lawton, G. Sights set on universal vaccine. New Sci. 2021, 249, 8–9. [Google Scholar] [CrossRef]
Figure 1. HLA Allele of Indonesian population with frequency equal or more than 5% identified from Allele Frequency Net Database. A total of 21 alleles were identified: 6 HLA Class I A, 8 HLA Class I B, and 6 HLA Class II DRB1 alleles.
Figure 1. HLA Allele of Indonesian population with frequency equal or more than 5% identified from Allele Frequency Net Database. A total of 21 alleles were identified: 6 HLA Class I A, 8 HLA Class I B, and 6 HLA Class II DRB1 alleles.
Viruses 14 02328 g001
Figure 2. Number of predicted CTL epitopes with %Rank less than 1% from SARS-CoV-2 proteins (ORF1ab, ORF2 (S), ORF4 (E), ORF5 (M), and ORF9 (N)) with previously identified HLA Class I alleles. A total of 1599 bindings were predicted through NetCTLpan 1.1 with the highest binding recorded by HLA-B*15:21 and the least by HLA-B*44:03, 136 and 75, respectively. ORF1ab is represented by blue color; ORF2 (S) is represented by red color; ORF4 (E) is represented by yellow color; ORF5 (M) is represented by green color; ORF9 (N) is represented by pink color.
Figure 2. Number of predicted CTL epitopes with %Rank less than 1% from SARS-CoV-2 proteins (ORF1ab, ORF2 (S), ORF4 (E), ORF5 (M), and ORF9 (N)) with previously identified HLA Class I alleles. A total of 1599 bindings were predicted through NetCTLpan 1.1 with the highest binding recorded by HLA-B*15:21 and the least by HLA-B*44:03, 136 and 75, respectively. ORF1ab is represented by blue color; ORF2 (S) is represented by red color; ORF4 (E) is represented by yellow color; ORF5 (M) is represented by green color; ORF9 (N) is represented by pink color.
Viruses 14 02328 g002
Figure 3. Number of predicted HTL epitopes binding with %Rank less than 1% from SARS-CoV-2 proteins (ORF1ab, ORF2 (S), ORF4 (E), ORF5 (M), and ORF9 (N)) with previously identified HLA Class II DRB1 alleles. A total of 1422 bindings were predicted through NetMHCpan 4.0 with the highest binding recorded by HLA-DRB1*15:02 and the least by HLA-DRB1*11:01, 299 and 170, respectively. ORF1ab is represented by blue color; ORF2 (S) is represented by red color; ORF4 (E) is represented by yellow color; ORF5 (M) is represented by green color; ORF9 (N) is represented by pink color.
Figure 3. Number of predicted HTL epitopes binding with %Rank less than 1% from SARS-CoV-2 proteins (ORF1ab, ORF2 (S), ORF4 (E), ORF5 (M), and ORF9 (N)) with previously identified HLA Class II DRB1 alleles. A total of 1422 bindings were predicted through NetMHCpan 4.0 with the highest binding recorded by HLA-DRB1*15:02 and the least by HLA-DRB1*11:01, 299 and 170, respectively. ORF1ab is represented by blue color; ORF2 (S) is represented by red color; ORF4 (E) is represented by yellow color; ORF5 (M) is represented by green color; ORF9 (N) is represented by pink color.
Viruses 14 02328 g003
Figure 4. Projected coverage of the vaccine construct containing all epitopes in Table 10 for Indonesian, South East Asia, Europe, and the world population.
Figure 4. Projected coverage of the vaccine construct containing all epitopes in Table 10 for Indonesian, South East Asia, Europe, and the world population.
Viruses 14 02328 g004
Table 1. Highly promiscuous 9-mer CTL epitopes from five different SARS-CoV-2 proteins (ORF1ab, S, E, M, and N) that bind to at least five different HLA Class I alleles. Immunogenicity, population coverage, and previous report of IEDB finding of each peptide are also specified.
Table 1. Highly promiscuous 9-mer CTL epitopes from five different SARS-CoV-2 proteins (ORF1ab, S, E, M, and N) that bind to at least five different HLA Class I alleles. Immunogenicity, population coverage, and previous report of IEDB finding of each peptide are also specified.
ProteinStart ResiduePeptideHLA Class I AlleleImmunogenicityPopulation Coverage (%)
ORF1ab899WSMATYYLFHLA-A*24:02c,HLA-A*24:07, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:05, HLA-B*58:010.0074289.83
ORF1ab915LASHMYCSFHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.1258868.68
ORF1ab1269LVSDIDITFHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:010.228168.68
ORF1ab1581QVVDMSMTYHLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05−0.130667.15
ORF1ab1585MSMTYGQQFHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.2479168.68
ORF1ab1767VMYMGTLSYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:05, HLA-B*44:03, HLA-B*58:01−0.0383190.22
ORF1ab1805MMSAPPAQYHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01,0.0859368.68
ORF1ab1822CASEYTGNYHLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:050.1831967.15
ORF1ab2253MSNLGMPSYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:010.0514381.53
ORF1ab2272STNVTIATYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*58:01−0.0038475.56
ORF1ab2322LVAEWFLAYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:05−0.2712681.9
ORF1ab2381RMYIFFASFHLA-A*24:02, HLA-A*24:07, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*38:02, HLA-B*58:010.0876189.49
ORF1ab2383YIFFASFYYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:050.334381.9
ORF1ab2593YVNTFSSTFHLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:050.2201967.15
ORF1ab2936NVLEGSVAYHLA-A*34:01, HLA-B*15:02, HLA-B*15:21, HLA-B*18:01, HLA-B*35:05−0.105961.73
ORF1ab3135VPFWITIAYHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:05−0.0593369.31
ORF1ab3372QTFSVLACYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*58:01−0.0676575.56
ORF1ab3645TVAYFNMVYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:21, HLA-B*35:05−0.0172368.69
ORF1ab3710LMNVLTLVYHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.0171768.68
ORF1ab3945SEFSSLPSY bHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01b, HLA-B*44:03c0.1260269.92
ORF1ab3948SSLPSYAAFHLA-A*24:02, HLA-A*24:07, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.2289687.09
ORF1ab4263STVLSFCAFHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:010.1599268.68
ORF1ab4271FAVDAAKAYHLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01b, HLA-B*35:050.0341473.5
ORF1ab4757LSFKELLVYHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:010.0825468.68
ORF1ab4904RLYYDSMSY aHLA-A*11:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:050.0615973.23
ORF1ab5057MVMCGGSLYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05−0.1234677.56
ORF1ab5245LMIERFVSLHLA-A*02:01c, HLA-A*02:03, HLA-B*15:02, HLA-B*15:21, HLA-B*38:02−0.1131454.17
ORF1ab5267EYADVFHLYHLA-A*24:02, HLA-A*24:07, HLA-A*33:03, HLA-A*34:01, HLA-B*18:010.0044685.01
ORF1ab5532VVYRGTTTY aHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.2153581.53
ORF1ab5678YVFCTVNAL aHLA-A*02:01, HLA-A*02:03, HLA-A*34:01, HLA-B*15:02, HLA-B*15:21, HLA-B*35:05, HLA-B*38:020.2459369.29
ORF1ab5980SMMGFKMNYHLA-A*11:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05−0.1146277.56
ORF1ab6153HSIGFDYVYHLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:05, HLA-B*58:01−0.2143878.69
ORF1ab6424MMISAGFSL aHLA-A*02:01c, HLA-A*02:03, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*38:020.3889174.8
ORF1ab6433WVYKQFDTYHLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:050.0648667.15
ORF1ab7000EAFLIGCNYHLA-A*34:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:050.0778167.15
ORF1ab7019YVMHANYIFHLA-A*24:02, HLA-A*24:07, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05−0.1217189.49
ORF2 (S)83LPFNDGVYFHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:050.1176769.31
ORF2 (S)159YSSANNCTFHLA-A*24:07, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.0495480.83
ORF2 (S)161SANNCTFEYHLA-A*11:01, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:010.1327377.97
ORF2 (S)268YLQPRTFLL aHLA-A*02:01c, HLA-A*02:03, HLA-A*24:02a, HLA-A*24:07, HLA-B*38:020.130574.4
ORF2 (S)368YNSASFSTFHLA-A*24:02, HLA-A*24:07, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05−0.1821789.49
ORF2 (S)686VASQSIIAYHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.070968.68
ORF2 (S)690SIIAYTMSL aHLA-A*02:01c, HLA-A*02:03, HLA-A*34:01, HLA-B*15:21, HLA-B*38:02−0.1293548.2
ORF2 (S)868MIAQYTSAL aHLA-A*02:03b, HLA-A*34:01, HLA-B*15:02, HLA-B*15:21, HLA-B*35:05−0.1876857.95
ORF2 (S)895IPFAMQMAYHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*18:01, HLA-B*35:05−0.3280169.31
ORF5169VATSRTLSY bHLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.1729568.68
ORF9265KAYNVTQAF bHLA-A*24:02, HLA-A*24:07, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:05, HLA-B*58:01−0.0058787.09
a The binding between the peptide and HLA has been experimentally proven by T-cell assay and reported in IEDB; b The binding between the peptide and HLA has been experimentally proven by HLA ligand assay and reported in IEDB; c The binding between the peptide and HLA has been experimentally proven by both T-cell assay and HLA ligand assay and reported in IEDB.
Table 2. Highly immunogenic (>0.35) 9-mer CTL epitopes identified from five different SARS-CoV-2 proteins (ORF1ab, ORF2 (S), ORF5 (M), and ORF9 (N)).
Table 2. Highly immunogenic (>0.35) 9-mer CTL epitopes identified from five different SARS-CoV-2 proteins (ORF1ab, ORF2 (S), ORF5 (M), and ORF9 (N)).
ProteinStart ResiduePeptideHLA Class I AlleleImmunogenicityPopulation Coverage (%)
ORF1ab44HLKDGTCGLHLA-A*02:030.402347.68
ORF1ab1634YHTTDPSFLHLA-B*38:020.4428910.65
ORF1ab2001TYKPNTWCIHLA-A*24:02 b; HLA-A*24:070.557258.77
ORF1ab2789ITPVHVMSKHLA-A*11:01 b0.4322129.64
ORF1ab3541RTILGSALLHLA-B*58:010.4528511.11
ORF1ab3640LPSLATVAYHLA-B*15:02, HLA-B*15:21, HLA-B*18:01, HLA-B*35:050.3560955.67
ORF1ab4580TVQFCDAMRHLA-A*33:030.3721829.13
ORF1ab6158DYVYNPFMIHLA-A*24:02 b, HLA-A*24:070.5622158.77
ORF1ab6819LEKCDLQNYHLA-B*18:01, HLA-B*44:030.3710128.15
ORF2 (S)1098GTHWFVTQRHLA-A*11:01 c, HLA-A*33:03 b0.3513353.67
ORF2 (S)1208YIKWPWYIWHLA-B*58:010.4252411.11
ORF2 (S)1211WPWYIWLGFHLA-B*18:01, HLA-B*35:050.4167327.22
ORF5 (M)25FLFLTWICLHLA-A*02:01 a0.3539713.07
ORF9 (N)103LSPRWYFYYHLA-A*24:07, HLA-B*58:010.3573445.59
a The binding between the peptide and HLA has been experimentally proven by T-cell assay and reported in IEDB; b The binding between the peptide and HLA has been experimentally proven by HLA ligand assay and reported in IEDB; c The binding between the peptide and HLA has been experimentally proven by both T-cell assay and HLA ligand assay and reported in IEDB.
Table 3. Highly promiscuous 15-mer HTL Epitopes from five different SARS-CoV-2 proteins (ORF1ab, S, E, M, and N) that bind to at least three different HLA Class II alleles. IFN-γ score, population coverage, and previous report of IEDB finding of each peptide are also specified.
Table 3. Highly promiscuous 15-mer HTL Epitopes from five different SARS-CoV-2 proteins (ORF1ab, S, E, M, and N) that bind to at least three different HLA Class II alleles. IFN-γ score, population coverage, and previous report of IEDB finding of each peptide are also specified.
ProteinStart ResiduePeptideHLA Class II AlleleIFN-γ ScorePopulation Coverage (%)
ORF1ab457EKVNINIVGDFKLNEHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.07396111787.72
ORF1ab458KVNINIVGDFKLNEE cHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1069632487.72
ORF1ab734KAPKEIIFLEGETLPHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.2492991487.72
ORF1ab735APKEIIFLEGETLPTHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.08214937887.72
ORF1ab736PKEIIFLEGETLPTE cHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.07706931487.72
ORF1ab1053KPTVVVNAANVYLKHHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.4537678287.72
ORF1ab1171TNVYLAVFDKNLYDKHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:02−0.3234233391.86
ORF1ab1349CKSAFYILPSIISNEHLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:020.2897654786.38
ORF1ab1350KSAFYILPSIISNEK a,cHLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:020.3378483486.38
ORF1ab1363EKQEILGTVSWNLREHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.009606891165.21
ORF1ab1419GARFYFYTSKTTVASHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.4881456565.21
ORF1ab1799QQESPFVMMSAPPAQHLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:02−0.3160963386.38
ORF1ab1800QESPFVMMSAPPAQY cHLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1547833186.38
ORF1ab1801ESPFVMMSAPPAQYE aHLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:02−0.2104028686.38
ORF1ab1802SPFVMMSAPPAQYELcHLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1869164986.38
ORF1ab1908TEQPIDLVPNQPYPN cHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1876231187.72
ORF1ab1909EQPIDLVPNQPYPNAHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1561377387.72
ORF1ab2211LEASFNYLKSPNFSK aHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1272225591.86
ORF1ab2212EASFNYLKSPNFSKL cHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:02−0.3777334191.86
ORF1ab2518LSHFVNLDNLRANNT cHLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:02, HLA-DRB1*16:02−0.2935021586.38
ORF1ab2958DGSIIQFPNTYLEGSHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.08772074487.72
ORF1ab3150TKHFYWFFSNYLKRR cHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.3458342465.21
ORF1ab3816STQEFRYMNSQGLLPHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.2819859994.14
ORF1ab3817TQEFRYMNSQGLLPPHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.4176087894.14
ORF1ab3818QEFRYMNSQGLLPPKHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.2543790294.14
ORF1ab4559ENPDILRVYANLGERHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.01172562287.72
ORF1ab4560NPDILRVYANLGERVHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.2299356687.72
ORF1ab4561PDILRVYANLGERVR aHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.2616146987.72
ORF1ab5018MPNMLRIMASLVLAR cHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.3201495387.72
ORF1ab5019PNMLRIMASLVLARK aHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.3913688387.72
ORF1ab5166GLVASIKNFKSVLYY aHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.4310614987.72
ORF1ab5167LVASIKNFKSVLYYQHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.09370702387.72
ORF1ab5168VASIKNFKSVLYYQNHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.2982533787.72
ORF1ab5169ASIKNFKSVLYYQNNHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.2577596187.72
ORF1ab5450CTERLKLFAAETLKAHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1092091487.72
ORF1ab5451TERLKLFAAETLKATHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.01334088394.14
ORF1ab5769PAEIVDTVSALVYDN cHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.3128744465.21
ORF1ab5845VASKILGLPTQTVDSHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.01567835787.72
ORF1ab5846ASKILGLPTQTVDSSHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1214687987.72
ORF1ab6002EEAIRHVRAWIGFDV cHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.4231216587.72
ORF1ab6039GVNLVAVPTGYVDTPHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.4678467994.14
ORF1ab6087VRIKIVQMLSDTLKNHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.0884665487.72
ORF1ab6516KPVPEVKILNNLGVD cHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.09281820987.72
ORF1ab6517PVPEVKILNNLGVDIHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.006884775687.72
ORF1ab6518VPEVKILNNLGVDIAHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.2593766487.72
ORF1ab6519PEVKILNNLGVDIAAHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.4126788187.72
ORF1ab6750LLDDFVEIIKSQDLS cHLA-DRB1*07:01, HLA-DRB1*11:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.2773323168.03
ORF1ab6751LDDFVEIIKSQDLSV aHLA-DRB1*07:01, HLA-DRB1*11:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.0484113795.26
ORF1ab6752DDFVEIIKSQDLSVV cHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.2888269594.14
ORF2 (S)307TVEKGIYQTSNFRVQ cHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.4501787365.21
ORF2 (S)308VEKGIYQTSNFRVQP bHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.2082702665.21
ORF2 (S)309EKGIYQTSNFRVQPT b,cHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.4646757965.21
ORF2 (S)310KGIYQTSNFRVQPTE bHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.3616440265.21
ORF4 (E)58VYSRVKNLNSSRVPDHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1870522187.72
ORF4 (E)59YSRVKNLNSSRVPDL cHLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.4532966787.72
ORF5 (M)163DLPKEITVATSRTLSHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.3619309865.21
ORF5 (M)164LPKEITVATSRTLSYHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:02−0.1702099294.14
ORF5 (M)165PKEITVATSRTLSYY aHLA-DRB1*07:01, HLA-DRB1*12:02, HLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.212104794.14
a The binding between the peptide and HLA has been experimentally proven by T-cell assay and reported in IEDB; b The binding between the peptide and HLA has been experimentally proven by HLA ligand assay and reported in IEDB; c The binding between the peptide and HLA has been experimentally proven by B-cell assay and reported in IEDB.
Table 4. Top 15 of 15-mer HTL epitopes with highest IFN-γ score from different SARS-CoV-2 proteins with %Rank less than 1%.
Table 4. Top 15 of 15-mer HTL epitopes with highest IFN-γ score from different SARS-CoV-2 proteins with %Rank less than 1%.
ProteinStart ResiduePeptideHLA Class II AlleleIFN-γ ScorePopulation Coverage (%)
ORF9 (N)304IAQFAPSASAFFGMS bHLA-DRB1*07:011.061677720.49
ORF9 (N)309PSASAFFGMSRIGMEHLA-DRB1*11:010.948910094.83
ORF9 (N)303QIAQFAPSASAFFGMHLA-DRB1*07:010.8365524420.49
ORF1ab7076GRLIIRENNRVVISSHLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.7985306551.25
ORF9 (N)310SASAFFGMSRIGMEVHLA-DRB1*11:010.797075654.83
ORF1ab7082ENNRVVISSDVLVNN bHLA-DRB1*07:01, HLA-DRB1*15:01, HLA-DRB1*15:020.7923184561.9
ORF1ab7075KGRLIIRENNRVVISHLA-DRB1*15:01, HLA-DRB1*15:02, HLA-DRB1*16:020.7895114651.25
ORF1ab4124WPLIVTALRANSAVKHLA-DRB1*11:01, HLA-DRB1*12020.7856935860.59
ORF1ab1631AFEYYHTTDPSFLGRHLA-DRB1*07:010.7795656420.49
ORF1ab1630EAFEYYHTTDPSFLG bHLA-DRB1*07:010.747345420.49
ORF9 (N)302PQIAQFAPSASAFFGHLA-DRB1*07:010.7235886920.49
ORF2 (S)24LPPAYTNSFTRGVYYHLA-DRB1*07:010.7176004420.49
ORF2 (S)306FTVEKGIYQTSNFRV a,bHLA-DRB1*07:01 a0.7091900320.49
ORF2 (S)681PRRARSVASQSIIAY bHLA-DRB1*07:010.6769325620.49
ORF1ab391SGLKTILRKGGRTIAHLA-DRB1*11:010.674772914.83
a The binding between the peptide and HLA has been experimentally proven by T-cell assay and reported in IEDB; b The binding between the peptide and HLA has been experimentally proven by B-cell assay and reported in IEDB.
Table 5. List of epitopes that are fully conserved in SARS-CoV-2 ancestral sequence and four other Human CCCs. A total of two 9-mer epitopes originated from ORF1ab found to be 100% conserved in all five viruses.
Table 5. List of epitopes that are fully conserved in SARS-CoV-2 ancestral sequence and four other Human CCCs. A total of two 9-mer epitopes originated from ORF1ab found to be 100% conserved in all five viruses.
Epitope SequenceImmunogenicityIdentity (%)HLA that BindsPopulation Coverage (%)PositionLocation
HEFCSQHTM0.03852100HLA-B*18:01, HLA-B*38:02, HLA-B*44:0337.135202—5210
  • SARS-CoV-like RdRp [nsp12]
  • Conserved polymerase motif E
  • Putative RNA binding site
SLAIDAYPL0.0619100HLA-A*02:01 a13.075254—5262
  • SARS-CoV-like RdRp [nsp12]
  • Putative RNA binding site
a The binding between the peptide and HLA has been experimentally proven by both T-cell assay and HLA ligand assay and reported in IEDB.
Table 6. Conservancy analysis of conserved epitopes in SARS-CoV-2 reference sequence against SARS-CoV-2 currently circulating VOCs of delta and omicron variants.
Table 6. Conservancy analysis of conserved epitopes in SARS-CoV-2 reference sequence against SARS-CoV-2 currently circulating VOCs of delta and omicron variants.
StartEpitope
Sequence
Alpha B.1.1.7 (158)Beta B.1.351 (374)Delta B.1.617.2
(1157)
Gamma P.1 (9)Omicron
BA.1.1
(884)
5202HEFCSQHTM100.00% (111/111)100.00% (146/146)100.00% (458/458)100.00% (8/8)100.00%
(119/119)
5254SLAIDAYPL100.00% (111/111)100.00% (146/146)100.00% (458/458)100.00% (8/8)100.00%
(119/119)
Table 7. Conservancy analysis of conserved Epitopes in SARS-CoV-2 against SARS-CoV and MERS-CoV which belong to the same Betacoronavirus family. The different residue is underlined.
Table 7. Conservancy analysis of conserved Epitopes in SARS-CoV-2 against SARS-CoV and MERS-CoV which belong to the same Betacoronavirus family. The different residue is underlined.
SARS-CoV-2SARS-CoVMERS-CoV
5202HEFCSQHTM52105179HEFCSQHTM51875188HEFCSQHTL5196
5254SLAIDAYPL52625230SLAIDAYPL52385239SLAIDAYPL5247
Table 8. 9-mer epitopes originated from OR1ab with identical TCR contact residues between SARS-CoV-2 and Human CCC identified through IEDB Conservancy Analysis Tools, along with information on HLA that binds, population coverage, position and previous IEDB reports on the binding. The TCR contact residue of HLA Class I is located in position 3–8 (marked by underlined). The residue difference across the sequence is indicated with red color text.
Table 8. 9-mer epitopes originated from OR1ab with identical TCR contact residues between SARS-CoV-2 and Human CCC identified through IEDB Conservancy Analysis Tools, along with information on HLA that binds, population coverage, position and previous IEDB reports on the binding. The TCR contact residue of HLA Class I is located in position 3–8 (marked by underlined). The residue difference across the sequence is indicated with red color text.
Epitope Sequence
[Domain Function]
HLA that Binds
[Population Coverage]
229E
[Position]
NL63
[Position]
HKU1
[Position]
OC43
[Position]
5005HLMGWDYPK5013
[RdRp (nsp12), Catalytic residue, Putative inhibitor binding site, Conserved polymerase binding A]
HLA-A*11:01 a
HLA-A*33:03
HLA-A*34:01
(62.81)
KLMGWDYPc
(4676–4684)
MLMGWDYPb
(4651–4659)
VLMGWDYPc
(5066–5074)
VLMGWDYPc
(4978–4986)
5251FVSLAIDAY5259
[RdRp (nsp12), Putative RNA binding site]
HLA-A*34:01
HLA-B*15:02
HLA-B*15:21
HLA-B*35:05
(54.16)
YVSLAIDAb
(4922–4930)
YVSLAIDAb
(4897–4905)
FVSLAIDAb
(5312–5320)
FVSLAIDAb
(5224–5232)
6321SIVCRFDTR6329
(Nsp14, N7-Mtase Active site)
HLA-A*33:03
(29.13)
SIVCRFDTb
(5981–5989)
SIVCRFDTb
(5956–5964)
SIVCRFDTb
(6378–6386)
AVVCRFDTR
(6290–6298)
6844KYTQLCQYL6852
(Coronavirus nsp13 super family)
HLA-A*24:02 a
HLA-A*24:07
(58.77)
KYTQLCQYF  b
(6503–6511)
KYTQLCQYb
(6474–6482)
KYTQLCQYb
(6929–6937)
KYTQLCQYb
(6842–6850)
a The binding between the peptide and HLA has been experimentally proven by HLA ligand assay and reported in IEDB. b Still binds to exactly the same allele with SARS-CoV-2 original epitope sequence according to netCTLpan with %Rank less than 1%. c Still binds to at least one same allele with SARS-CoV-2 original epitope sequence according to netCTLpan with %Rank less than 1%.
Table 9. 15-mer epitopes (9-mer core epitope is highlighted in bold) originated from ORF1ab with identical TCR contact residue between SARS-CoV-2 and Human CCC identified through IEDB Conservancy Analysis Tools, along with information on HLA that binds, population coverage, position and previous IEDB report on the binding. The TCR contact residue (highlighted in bold and underlined) of HLA Class II is located in position 2, 3, 5, 7 and 8 within the 9 mer core. The residue difference across the sequence is indicated with red color font.
Table 9. 15-mer epitopes (9-mer core epitope is highlighted in bold) originated from ORF1ab with identical TCR contact residue between SARS-CoV-2 and Human CCC identified through IEDB Conservancy Analysis Tools, along with information on HLA that binds, population coverage, position and previous IEDB report on the binding. The TCR contact residue (highlighted in bold and underlined) of HLA Class II is located in position 2, 3, 5, 7 and 8 within the 9 mer core. The residue difference across the sequence is indicated with red color font.
Epitope Sequence
[Position]
[Domain Function]
HLA
[Population Coverage]
229E
[Position]
NL63
[Position]
HKU1
[Position]
OC43
[Position]
MNLKYAISAKNRART a
[4934–4948]
[RdRp (nsp12), Putative RNA binding site, Putative inhibitor binding site, Putative nsp7 interaction site, Conserved polymerase binding F]
DRB1*11:01
[4.83]
LNLKYAISGKERART b
[4605–4619]
LNLKYAISGKERART b
[4580–4594]
MNLKYAISAKNRART b
[4995–5009]
MNLKYAISAKNRART b
[4907–4921]
KAVFISPYNSQNAVA
[5832–5846]
[Nsp13, Helicase domain]
DRB1*15:01
DRB1*15:02
DRB1*16:02
[51.25]
KAVFISPYNSQNYVA c
[5504–5518]
KAVFISPYNSQNYVA c
[5479–5493]
SAVFISPYNSQNYVA c
[5892–5906]
KAVFISPYNSQNFAA c
[5804–5818]
AVFISPYNSQNAVAS
[5833–5847]
[Nsp13, Helicase domain]
DRB1*15:01
DRB1*15:02
DRB1*16:02
[51.25]
AVFISPYNSQNYVAAc
[5505–5519]
AVFISPYNSQNYVAS c
[5480–5494]
AVFISPYNSQNYVAK c
[5893–5907]
AVFISPYNSQNFAAK c
[5805–5819]
VFISPYNSQNAVASK a
[5834–5848]
[Nsp13, Helicase domain]
DRB1*15:01
DRB1*15:02
DRB1*16:02
[51.25]
VFISPYNSQNYVAAR
[5506–5520]
VFISPYNSQNYVASR
[5481–5495]
VFISPYNSQNYVAKRc
[5894–5908]
VFISPYNSQNFAAKRc
[5806–5820]
NCNVDRYPANSIVCR
[6311–6325]
[Nsp14, N7-Mtase active site]
DRB1*15:01
DRB1*15:02
[47.35]
NCNVDMYPEFSIVCR
[5971–5985]
NCNVDMYPEFSIVCR
[5946–5960]
NCNVDKYPSNSIVCR b
[6368–6382]
NCNVDKYPPNAVVCR
[6280–6294]
a The binding between the peptide and HLA has been experimentally proven by B cell assay (ELISA) and reported in IEDB. b Still binds to exactly the same allele with SARS-CoV-2 original epitope sequence according to netMHCIIpan and exhibiting strong binding level with %Rank less than 1%. c Still binds to at least one same allele with SARS-CoV-2 original epitope sequence according to netMHCIIpan and exhibiting strong binding level with %Rank less than 1%.
Table 10. Cross-reactive CTL epitopes between SARS-CoV2 and BCG predicted by Blastp. The amino acid substitution is indicated by underlining and TCR contact residue (residue 3–8) is indicated by bold font. n.b. indicates that peptide was predicted as a non-binder to HLA allele.
Table 10. Cross-reactive CTL epitopes between SARS-CoV2 and BCG predicted by Blastp. The amino acid substitution is indicated by underlining and TCR contact residue (residue 3–8) is indicated by bold font. n.b. indicates that peptide was predicted as a non-binder to HLA allele.
SARS-CoV-2 PeptideBCG PeptideProtein Name in SARS-CoV-2Protein Name in BCGHLA AllelePredicted HLA Binding Affinity a
SARS-CoV-2BCG
FVFLVLLPLFVFLALLPFORF2transmembrane ABC transporter ATP-binding proteinHLA-A*02:014.15n.b
HLA-A*34:0110.5411.56
HLA-B*15:02n.b3.92
HLA-B*15:13n.b4.93
HLA-B*15:21n.b4.98
HLA-B*18:01n.b9.84
HLA-B*35:05n.b4.91
WLIVGVALL WLMVGIAILORF3aNUDIX domain-containing proteinHLA-A*02:013.973.53
HLA-A*02:033.553.63
HLA-B*38:02n.b13.42
FLIVGIELLORF3ahypothetical protein N054_02474HLA-A*02:013.972.96
HLA-A*02:033.552.97
WLIAGVVFLORF3amembrane proteinHLA-A*02:013.972.78
HLA-A*02:033.553.06
WLIVGYFAVORF3aMFS transporterHLA-A*02:013.972.74
HLA-A*02:033.552.99
SVLLFLAFVYVLLFVSLVORF4cytochrome c biogenesis proteinHLA-A*02:013.343.72
KLLEQWNLVKLLEEWLAMORF5Isocitrate lyaseHLA-A*02:012.873.74
HLVDFQVTIHLVDFPVTLORF6RD1 region associated proteinHLA-A*02:014.404.41
SLVDSQVTVORF6transmembrane transport protein MmpL7HLA-A*02:014.403.99
HLA-A*02:03n.b3.52
FLAFLLFLVFLAFLDSTIORF7bMFS transporterHLA-A*02:012.343.68
HLA-A*02:032.482.97
FYSKWYIRVGYEKWRIRFORF8SAM-dependent methyltransferaseHLA-A*24:026.187.60
HLA-A*24:077.8310.09
LLLDRLNQLYLLDRLAELORF9alpha-keto acid decarboxylase family proteinHLA-A*02:013.062.29
HLA-A*02:033.502.45
LLLDRLPNLORF9cytochrome P450HLA-A*02:013.062.81
HLA-A*02:033.503.62
LLRDRLAQLORF9inositol monophosphataseHLA-A*02:013.06n.b
HLA-A*02:033.503.26
a The unit of predicted HLA binding affinity score displayed is nM times 10−5.
Table 11. Cross-reactive HTL epitopes between SARS-CoV2 and BCG. The amino acid substitution is indicated by underlining. The core epitope is highlighted in red color and bold font indicating the TCR contact residue.
Table 11. Cross-reactive HTL epitopes between SARS-CoV2 and BCG. The amino acid substitution is indicated by underlining. The core epitope is highlighted in red color and bold font indicating the TCR contact residue.
SARS-CoV-2BCGGene Name in SARS-CoV-2Protein Name in BCGHLA AllelePredicted HLA Binding Affinity *
SARS-CoV-2BCG
VKHVYQLRARSVSPKTDLLYQLASRSVSSTORF7aamidaseHLA-DRB1*11:013.642.89
HLA-DRB1*12:024.38-
* The unit of predicted HLA binding affinity score displayed is nM times 10−5.
Table 12. Cross-reactive CTL epitopes between SARS-CoV2 and MMR vaccine. The amino acid substitution is indicated by underlining and TCR contact residue is indicated by bold font.
Table 12. Cross-reactive CTL epitopes between SARS-CoV2 and MMR vaccine. The amino acid substitution is indicated by underlining and TCR contact residue is indicated by bold font.
MMRSARS-CoV-2 PeptidesMMR PeptidesProtein Name in SARS-CoV-2Protein Name in MMRHLA AllelePredicted HLA Binding Affinity a
SARS-CoV-2MMR
MeaslesSVLLFLAFVYVLLAVLFVORF4hemagglutininHLA-A*02:013.343.77
ILFLALITLVMFLSLIGLORF7ahemagglutininHLA-A*02:014.753.97
HLA-A*02:03-4.31
LLLDRLNQLRLLDRLVRLORF9nucleocapsid proteinHLA-A*02:013.963.66
HLA-A*02:033.50-
MumpsFVFLVLLPLFLLLILLPLORF2small hydrophobic proteinHLA-A*02:014.153.58
HLA-A*34:0110.54-
SVLLFLAFVSLLLFLLYLORF4small hydrophobic proteinHLA-A*02:013.344.67
FLAFLLFLVFLTFLLLIVORF7bsmall hydrophobic proteinHLA-A*02:012.343.82
HLA-A*02:032.483.73
RubellaLLLDRLNQLALLDRLRGVORF9nonstructural capsid protein, partialHLA-A*02:013.064.84
HLA-A*02:033.504.84
a The unit of predicted HLA binding affinity score displayed is nM times 10−5.
Table 13. Potential epitopes for universal vaccine construct. Listed are epitopes that are fully conserved across the coronavirus family and epitopes with identical TCR contact residue across SARS-CoV-2 and Human CCCs. Each of these epitopes was not similar to the human peptides.
Table 13. Potential epitopes for universal vaccine construct. Listed are epitopes that are fully conserved across the coronavirus family and epitopes with identical TCR contact residue across SARS-CoV-2 and Human CCCs. Each of these epitopes was not similar to the human peptides.
Epitope SequenceHLA that BindsImmunogenicityPopulation Coverage of Individual Epitope (%)Similarity to Human Peptides
HEFCSQHTMHLA-B*18:01, HLA-B*38:02, HLA-B*44:030.0385237.13None
SLAIDAYPLHLA-A*02:010.061913.07None
HLMGWDYPKHLA-A*11:01, HLA-A*33:03, HLA-A*34:010.105362.81None
FVSLAIDAYHLA-A*34:01, HLA-B*15:02, HLA-B*15:21, HLA-B*35:050.0081354.16None
KYTQLCQYLHLA-A*24:02, HLA-A*24:07−0.0686658.77None
MNLKYAISAKNRARTDRB1*11:010.401890094.83None
KAVFISPYNSQNAVADRB1*15:01, DRB1*15:02, DRB1*16:020.472096151.25None
AVFISPYNSQNAVASDRB1*15:01, DRB1*15:02, DRB1*16:020.5370454351.25None
VFISPYNSQNAVASKDRB1*15:01, DRB1*15:02, DRB1*16:020.2236335951.25None
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gustiananda, M.; Julietta, V.; Hermawan, A.; Febriana, G.G.; Hermantara, R.; Kristiani, L.; Sidhartha, E.; Sutejo, R.; Agustriawan, D.; Andarini, S.; et al. Immunoinformatics Identification of the Conserved and Cross-Reactive T-Cell Epitopes of SARS-CoV-2 with Human Common Cold Coronaviruses, SARS-CoV, MERS-CoV and Live Attenuated Vaccines Presented by HLA Alleles of Indonesian Population. Viruses 2022, 14, 2328. https://doi.org/10.3390/v14112328

AMA Style

Gustiananda M, Julietta V, Hermawan A, Febriana GG, Hermantara R, Kristiani L, Sidhartha E, Sutejo R, Agustriawan D, Andarini S, et al. Immunoinformatics Identification of the Conserved and Cross-Reactive T-Cell Epitopes of SARS-CoV-2 with Human Common Cold Coronaviruses, SARS-CoV, MERS-CoV and Live Attenuated Vaccines Presented by HLA Alleles of Indonesian Population. Viruses. 2022; 14(11):2328. https://doi.org/10.3390/v14112328

Chicago/Turabian Style

Gustiananda, Marsia, Vivi Julietta, Angelika Hermawan, Gabriella Gita Febriana, Rio Hermantara, Lidya Kristiani, Elizabeth Sidhartha, Richard Sutejo, David Agustriawan, Sita Andarini, and et al. 2022. "Immunoinformatics Identification of the Conserved and Cross-Reactive T-Cell Epitopes of SARS-CoV-2 with Human Common Cold Coronaviruses, SARS-CoV, MERS-CoV and Live Attenuated Vaccines Presented by HLA Alleles of Indonesian Population" Viruses 14, no. 11: 2328. https://doi.org/10.3390/v14112328

APA Style

Gustiananda, M., Julietta, V., Hermawan, A., Febriana, G. G., Hermantara, R., Kristiani, L., Sidhartha, E., Sutejo, R., Agustriawan, D., Andarini, S., & Parikesit, A. A. (2022). Immunoinformatics Identification of the Conserved and Cross-Reactive T-Cell Epitopes of SARS-CoV-2 with Human Common Cold Coronaviruses, SARS-CoV, MERS-CoV and Live Attenuated Vaccines Presented by HLA Alleles of Indonesian Population. Viruses, 14(11), 2328. https://doi.org/10.3390/v14112328

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop