Next Article in Journal
Anti-U11/U12 Antibodies as a Rare but Important Biomarker in Patients with Systemic Sclerosis: A Narrative Review
Next Article in Special Issue
Interest of Absolute Eosinopenia as a Marker of Influenza in Outpatients during the Fall-Winter Seasons 2016–2018 in the Greater Paris Area: The SUPERFLUOUS Study
Previous Article in Journal
Perfusion Computed Tomography May Help in Discriminating Gastrointestinal Tuberculosis and Crohn’s Disease
Previous Article in Special Issue
Head-to-Head Comparison of Nasopharyngeal, Oropharyngeal and Nasal Swabs for SARS-CoV-2 Molecular Testing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Brief Report

SARS-CoV-2 Variants of Concern: Presumptive Identification via Sanger Sequencing Analysis of the Receptor Binding Domain (RBD) Region of the S Gene

by
Grazielle Motta Rodrigues
1,2,
Fabiana Caroline Zempulski Volpato
3,4,
Priscila Lamb Wink
3,4,
Rodrigo Minuto Paiva
2,
Afonso Luís Barth
3,4 and
Fernanda de-Paris
1,2,3,*
1
Residência Multiprofissional em Saúde e em Área Profissional da Saúde do Hospital de Clínicas de Porto Alegre, Porto Alegre 90035-903, Rio Grande do Sul, Brazil
2
Serviço de Diagnóstico Laboratorial, Hospital de Clínicas de Porto Alegre, Porto Alegre 90035-903, Rio Grande do Sul, Brazil
3
LABRESIS–Laboratório de Pesquisa em Resistência Bacteriana, Hospital de Clínicas de Porto Alegre, Porto Alegre 90035-903, Rio Grande do Sul, Brazil
4
Programa de Pós-Graduação em Ciências Farmacêuticas, Universidade Federal do Rio Grande do Sul, Porto Alegre 90160-093, Rio Grande do Sul, Brazil
*
Author to whom correspondence should be addressed.
Diagnostics 2023, 13(7), 1256; https://doi.org/10.3390/diagnostics13071256
Submission received: 26 December 2022 / Revised: 7 February 2023 / Accepted: 8 February 2023 / Published: 27 March 2023
(This article belongs to the Special Issue Diagnosis of Viral Respiratory Infections)

Abstract

:
Variants of concern (VOCs) of SARS-CoV-2 are viral strains that have mutations associated with increased transmissibility and/or increased virulence, and their main mutations are in the receptor binding domain (RBD) region of the viral spike. This study aimed to characterize SARS-CoV-2 VOCs via Sanger sequencing of the RBD region and compare the results with data obtained via whole genome sequencing (WGS). Clinical samples (oro/nasopharyngeal) with positive RT-qPCR results for SARS-CoV-2 were used in this study. The viral RNA from SARS-CoV-2 was extracted and a PCR fragment of 1006 base pairs was submitted for Sanger sequencing. The results of the Sanger sequencing were compared to the lineage assigned by WGS using next-generation sequencing (NGS) techniques. A total of 37 specimens were sequenced via WGS, and classified as: VOC gamma (8); delta (7); omicron (10), with 3 omicron specimens classified as the BQ.1 subvariant and 12 specimens classified as non-VOC variants. The results of the partial Sanger sequencing presented as 100% in agreement with the WGS. The Sanger protocol made it possible to characterize the main SARS-CoV-2 VOCs currently circulating in Brazil through partial Sanger sequencing of the RBD region of the viral spike. Therefore, the sequencing of the RBD region is a fast and cost-effective laboratory tool for clinical and epidemiological use in the genomic surveillance of SARS-CoV-2.

1. Introduction

SARS-CoV-2′s rapid worldwide spread has caused the emergence of new lineages of great importance for the pandemic scenario. The new lineages may present an accelerated rate of transmission which results in a continuous and rapid process of emergence of other new mutant variants [1]. Until now, there were five SARS-CoV-2 variants of concern (VOCs) as defined by the World Health Organization (WHO); these variants have mutations with functional significance in the S gene in common, responsible for expressing the glycoprotein spike [2,3].
Based on the sequences deposited in the National Genomics Data Center (NGDC) of China, more than 27,500 mutations in the S gene have already been identified and documented which may cause changes in some sites of the spike protein amino acid sequence. Considering these genomic alterations, more than 7000 mutations cause some alteration in the sequence of the receptor binding domain (RBD) of the spike protein [4]. The significance of the accumulation of mutations in the spike protein, especially in the RBD region, is due to the important role that this region plays in the main process of virus entrance into the host cells. In fact, the entrance process is mediated by the affinity of the SARS-CoV-2 RBD region with the angiotensin-converting enzyme 2 receptor (ACE2). The RBD is also important due to the fact that it is the main target of neutralizing antibodies. Therefore, the fixation of mutations in the spike protein allows an important advantage in the virus’s replication rate and is usually used for variant determination [5,6].
The evaluation of SARS-CoV-2 mutations and their lineage determination is carried out via whole genome sequencing (WGS) using next-generation sequencing (NGS) methods, which allows for phylogenetic assignment [1,7]. The NGS gives accurate information of genetic variability of the virus, as it is able to generate comprehensive genomic data which allows for tracing of the origin and spread of the virus, besides monitoring its evolution [8]. However, WGS is a time-consuming assay, the data analysis is complex, and it is an expensive technique, especially in resource-limited settings. In this sense, it is important to evaluate alternative techniques for identifying SARS-CoV-2 variants and carrying out genomic surveillance of VOCs [9]. The Sanger sequencing technique is considered the standard method for short nucleotide sequence determination, it is available to many labs, it is less expensive and faster than NGS, and it can run samples individually [10]. These features allow Sanger sequencing to be used as a screening method for detecting SARS-CoV-2 mutations and generating data of importance for public health and surveillance systems [10,11].
The VOCs carry their main mutations in the RBD region of the spike protein, which indicates that a technique capable of predicting VOCs based on the mutational profile of this region could contribute to genomic surveillance [5]. In this context, we proposed a simplified Sanger sequencing assay of the RBD region in the S gene to presumptively characterize all SARS-CoV-2 variants of concern described until now.

2. Materials and Methods

2.1. Clinical Specimens and Ethical Statement

Thirty-seven oro/nasopharyngeal swabs with positive results for SARS-CoV-2 according to the RT-qPCR protocol contained in the Centers for Disease Control and Prevention (CDC) guidelines were included in this study [12]. As part of a genomic surveillance research, all of the RNA sequences from the clinical specimens were submitted for whole genome sequencing (WGS). This study was approved by the Ethics Committee of the Hospital de Clínicas de Porto Alegre, CAAE number: 48879321000005327.

2.2. In Silico Analysis

In order to evaluate the capability for distinguishing between VOCs and non-VOCs sequences using only the RDB region analysis, we use an in silico approach. Sequences from alpha, beta, gamma, delta, and omicron VOCs and some omicron subvariants (BA.1, BA.2, BA.4, BA.5 and BQ.1) were downloaded from the GISAID Database. Sequences were selected based on whether their status was both “complete” (>29,000 nucleotides) and classified as “high coverage” (<0.05% of unique amino acid mutations). These sequences were aligned in the BioEdit and CodonCode Aligner software programs to the SARS-CoV-2 reference with the names NC_045512.2 (complete genome), NC_045512.2:21563-25384 (gene S), and NC_045512.2:22517-23522 (representing the 1006 bp fragment amplified via PCR).

2.3. RNA Extraction and cDNA Synthesis

The RNA was extracted from the clinical samples using the commercially available QIAamp Viral RNA Kit (Qiagen, Valencia, CA, USA) according to the manufacturer’s instructions. Reverse transcription was performed using GoScript™ Reverse Transcriptase (Promega, Madison, WI, USA) with an optimized half-reaction: 0.5 μL of a random primer was incubated with 2 μL of RNA at 70 °C for 5 min and was afterwards quickly chilled on ice for 5 min. This mixture was added to a reverse transcription mix containing 2 μL of GoScript™ 5X Reaction Buffer, 0.6 μL of MgCl2 (25 mM), 0.5 μL of PCR Nucleotide Mix, 3.65 μL of Nuclease-Free Water, 0.25 μL of Recombinant RNasin® Ribonuclease Inhibitor, and 0.5 of μL GoScript™ Reverse Transcriptase. The product, which had a final volume of 10 μL, was submitted to a temperature of 25 °C for 5 min (annealing), followed by 42 °C for 60 min (cDNA synthesis), and then was heated to 70 °C for 15 min (inactivation of reverse transcriptase).

2.4. RBD Polymerase Chain Reaction (PCR) Amplification and Sanger Sequencing

The PCR for the RBD region was performed using the primers 75L (5′-AGAGTCCAACCAACAGAATCTATTGT-3′) and 77R (5′-CAGCCCCTATTAAACAGCCTGC-3′) designed by ARTIC protocol [13]. The predicted PCR amplicon with these primers is a 1006 bp product flanking the RBD region of the spike protein of the SARS-CoV-2 virus. The PCR was prepared as described by Dorlas et al. [11], and the products were analyzed in 1% agarose gel electrophoresis (40 min at 110 v). The products were purified with ExoSAP-ITTM PCR Product Cleanup (Affymetrix, Santa Clara, CA, USA) according to the manufacturer’s protocol. The final product was used in the Sanger sequencing that was carried out with the BigDye™ Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems, Foster City, CA, USA). The sequencing was processed in an ABI 3500 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA).

2.5. Limit of Detection (LoD) and Repeatability

To determine the limit of detection, we evaluated a serial dilution (from 1:1 to 1:10,000) of SARS-CoV-2 positive samples in a RT-qPCR assay and used a standard curve to quantify the viral load, as previously described by Wink et al. [14]. The same dilutions of SARS-CoV-2 positive specimens were submitted for RBD PCR. The final dilution determined as the LoD was submitted a series of 20 parallel PCRs to establish repeatability.

2.6. Bi-Directional Sanger Sequencing Analysis

The data obtained from Sanger sequencing were aligned with the SARS-CoV-2 reference sequence (NC_045512.2) and gene S (NC_045512.2:21563-25384) using the ClustalW multiple method with BioEdit Alignment Editor software v.7.2 and CodonCode Aligner v.10 software. The quality of the sequencing data was assessed using Sanger electropherograms of both forward and reverse sequences. The prediction of VOCs was evaluated according to the presence/absence of SNVs (single nucleotide variants) in comparison with the reference sequence (Table 1). The pairwise sequence alignment score was obtained using the ClustalW multiple method using BioEdit software. The mutated base quality was analyzed using the Phred quality values, calculated using CodonCode Aligner software, that represent the probability of error for each base call. The quality values analyzed considered the Phred score for both the forward and the reverse sequences and the consensus scores for the two sequences.

3. Results

3.1. In Silico Analysis

In the in silico analysis, all VOCs could be differentiated based on the 75L/77R 1006 bp fragment (Figure 1). Furthermore, the omicron subvariants BA.2 and BQ.1 could be distinguished from omicron BA.1. The BA.4 and BA.5 omicron subvariants could not be differentiated based on their 75L/77R fragments. This fragment comprises the mutations present between nucleotides 22,517 and 23,521 of the SARS-CoV-2 genome. All VOCs’ mutations and omicron subvariants’ mutations investigated in this nucleotide interval are listed in Table 1 and Table 2, respectively, along with the main established nomenclature systems for the VOCs.

3.2. Sanger Sequencing and WGS Results

We successfully sequenced the RBD region from the 37 clinical samples using the Sanger sequencing technique. It was possible to identify the gamma (8/37), delta (7/37), and omicron (10/37) VOCs, while 12 samples were classified as non-VOCs. Moreover, among the omicron variants, it was also possible to identify the subvariant BQ.1 (3/10). Among the non-VOCs sequenced, it was possible to differentiate the zeta (5/12) and lambda (1/12) variants of interest (VOI). It was not possible to distinguish the other non-VOC linages by sequencing the 75L/77R 1006 bp fragment. The prediction of SARS-CoV-2 VOC and non-VOCs using Sanger sequencing presented as 100% in concordance with the results generated from WGS. The SNVs identified and the presumptive categorization from the 37 clinical samples and the lineage assigned via WGS are shown in Table 3.

3.3. Sanger Sequencing Quality

We found that the 1006 bp amplicon was able to cover the entire RBD region with a high degree of quality in the Sanger sequencing results when we performed the bi-directional analysis. The protocol described in this study was able to generate fragments with an average of 886 and 867 bases of the forward and reverse sequences, respectively, with a high degree of quality. Analysis of the alignment with the reference sequence showed that the average percentage of matches was 95% for the forward sequence and 93% for the reverse sequence. The average of the quality of the base calls was 52 for the forward sequences and 56 for the reverse sequences, which means that there was only about a one in 100,000 chance that the base call was incorrect. When we analyzed the forward and reverse sequences together (bi-directional sequencing), the base call quality consensus increased to 88, which means that there was about a one in 100,000,000 chance that the base call was incorrect, demonstrating the high quality and fidelity of the base call. It was possible to obtain sequencing results using the Sanger technique with only the forward or the reverse sense for seven of the 37 specimens, even when the procedure was repeated. Although these samples gave rise to only one sequencing sense, it was possible to identify a VOC along the sequence with a high base call quality score.

3.4. Limit of Detection

The limit of detection for the RBD region generated via PCR was determined as shown in Table 4. The lower viral load limit for performing the RBD PCR was around 500,000 copies/uL (dilution 1:100). This viral load corresponds to a cycle threshold (Ct) of around 20 in the RT-qPCR from the CDC protocol. All specimens included in this study had a RT-qPCR Ct value, as defined in the CDC protocol for detecting SARS-CoV-2, of less than 20, and specimens that had Ct values higher than 20 were not amplified using the RBD PCR protocol. At these lower copy detection limits, a repeatability assay of 20 parallel RBD PCRs was performed and 95% (19/20) of these were amplified.

4. Discussion

In this study, we proposed an approach that allows for the performance of genomic surveillance of VOCs based on an analysis of the RBD region in the S gene of SARS-CoV-2 using partial and bi-directional Sanger sequencing. We found that the Sanger sequencing results of the 75L/77R 1006 bp fragment presented as 100% in agreement with WGS for lineage determination. This protocol was initially developed when only the alpha (B.1.1.7), beta (B.1.351), and gamma (P.1) variants were circulating around the world. Later on, the protocol was applied to also detect the delta (B.1.617.2) and omicron (B.1.1.529) variants, including the omicron subvariant BQ.1. In this study, we show that it is possible to identify these VOCs and differentiate between them and differentiate them from non-VOCs using only one PCR fragment.
The Sanger protocol was also able to identify some non-VOC variants by confirming the absence of the mutations of concern described in Table 1 or by confirming the presence of only D614G mutations in the analyzed region. The protocol was also able, without the need for extra adaptation, to detect additional VOCs not described when the technique was originally developed, showing that this method can be used as a generic approach to target specific mutations to distinguish other potential VOCs that may appear in the future. Although the VOCs share some identical mutations, each variant has a unique combination of mutations which generates a specific mutational profile in the RBD region [15,16].
The RBD region is composed of 749 nucleotides and the concentration of the lineage-defining mutations in this region allows for analysis via Sanger sequencing, which supports the generation of sequences up to 1000 bp [17,18]. In general, longer fragments are challenging to use due the difficulty of using Sanger sequencing to distinguish single base pair differences at the end of fragments up to 900 bp long and the loss of the first 15–40 bases due to primer binding [17]. Bi-directional sequencing helps to improve the analysis efficiency for longer fragments, such as 75L/77R, and enables the analysis of mutations that are located at the beginning or at the end of the fragment with a higher degree of quality. Among the mutations identified via Sanger sequencing in the RBD region, the G22578A mutation which leads to the spike G339D mutation in the omicron variant is located at the beginning of the fragment used in our study. In all omicron samples it was possible to identify this mutation, but the maximum base call quality score obtained for the consensus sequences was 47. Despite the fact that there is a low probability of an incorrect base call, this low score indicates that this mutation is located in a fragment region that has a lower quality of sequencing than other regions of the amplicon. However, this low score does not compromise the identification of omicron variant, as its RBD mutational profile is very different from the other VOCs [19].
The unique mutational profile of the omicron variant allows it to be easily distinguished from alpha, beta, gamma, delta, and non-VOC variants, but the emergence of omicron descendents’ lineages makes the differentiation between omicron variants and their subvariants via the RBD region difficult. However, a mutational profile change in the omicron subvariants allows for differentiation of the descendents’ lineages from omicron via the RBD region based on the presence of the A22688G, G22775A, and A22786C mutations (that lead to the T376A, D405N, and R408S spike mutations, respectively). These mutations are absent in the omicron BA.1 variant and are an indication that the sample is an omicron subvariant [20]. In this way, the BA.2 and BQ.1 omicron subvariants that have been circulating the most recently in Brazil can be distinguished via key mutations in the RBD region [21]. Nevertheless, the BA.4 and BA.5 variants cannot be differentiated via only the RBD analysis due their identical spike sequences, as shown in Table 2 [3,20]. In our study, we also sequenced the RBD region from the BQ.1 omicron subvariant, and differentiation between the other subvariants was possible due to the A22893C change (that leads to the spike K444T mutation), which is not present in the omicron variants or in other omicron subvariants.
Due to the large size of the fragment 75L/77R, a high viral load is required to provide reliable results from Sanger sequencing. Based on a comparative analysis, the LOD was determined as a minimal viral load of 500,000 copies/uL using an adapted RT-qPCR CDC protocol [14], corresponding to Ct values of up to 20, the same described in another study [22]. This viral load is necessary to generate a proper sequencing electropherogram that allows for an accurate analysis with no ambiguous base calls. The need for a high viral load seems not to pose a problem, as most of the Ct values in symptomatic SARS-CoV-2-positive patients are lower than 20, especially when the clinical samples are collected within 10 days since symptom onset [23,24]. Moreover, the high viral load does not prevent performance of the test, as greater risk of transmission, as in an outbreak scenario, is associated with increased viral load values and positive relationships between viral load and infectiousness [25].
The Sanger sequencing protocol proposed in this study is able to sequence a 1006 bp fragment using a bi-directional approach with high accuracy to distinguish among VOCs via analysis of the RBD region of the S gene from the SARS-CoV-2 virus, generating a result in 100% agreement with the lineage definition generated via WGS. This approach allows for the identification of mutational profiles of SARS-CoV-2 VOCs from an individual sample with a lower time burden and a lower cost in comparison with the WGS techniques. Of note is the fact that this is possible using a unique PCR fragment [11]. Considering the emergence of new VOCs with mutations in the RBD portion, this protocol can be applied to predict VOCs and discriminate amongst them. Hence, Sanger sequencing can be used as an important tool for the screening and identification of VOCs to provide data for the genomic surveillance of SARS-CoV-2. Furthermore, clinical assistance could be improved, as the rapid results of Sanger sequencing is useful for the differentiation between re-infection versus persistent infections in SARS-CoV-2-positive patients during prolonged periods, for example.

5. Conclusions

The fast and accessible determination of VOCs is an essential tool for SARS-CoV-2 genomic surveillance. In this study, Sanger sequencing of the RBD region was shown to be in agreement with SARS-Cov-2 lineage assigned via WGS. This analysis of the RBD Sanger sequencing was capable of detecting all five VOCs already described by the WHO. Therefore, Sanger sequencing of the RBD region is a potential applicable and cost-effective laboratory tool for clinical and epidemiological use in the genomic surveillance of SARS-CoV-2.

Author Contributions

Conceptualization, F.d.-P.; methodology and validation, G.M.R.; formal analysis, G.M.R. and F.d.-P.; resources, A.L.B. and R.M.P.; data curation, G.M.R. and F.d.-P.; writing—original draft preparation G.M.R.; writing—review and editing, F.d.-P., F.C.Z.V., P.L.W., R.M.P. and A.L.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Fundo de Incentivo à Pesquisa e Eventos (FIPE) from the Hospital de Clínicas de Porto Alegre (HCPA), number 2021-0299.

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and approved in 21 July 2021 by the Ethics Committee of the Hospital de Clínicas de Porto Alegre, CAAE number: 48879321000005327.

Informed Consent Statement

Patient consent was waived due to the retrospective design of this study.

Data Availability Statement

The raw data supporting the findings of this study are available from the corresponding author on reasonable request, including the raw sequencing data. All other data generated and analyzed during the realization of this study are included in this paper.

Acknowledgments

We would like to thank to Richard Steiner Salvato and Tatiana Shaffer Greguanini from Centro Estadual de Vigilância em Saúde do Rio Grande do Sul and the Genetica e Biologia Molecular laboratory from Hospital Moinhos de Vento for kindly providing specimens of the delta and omicron variants.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cascella, M.; Rajnik, A.; Aleem, S.C.; Dulebohn, R. Di Napoli. Features, Evaluation, and Treatment of Coronavirus (COVID-19). In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2022. Available online: http://www.ncbi.nlm.nih.gov/books/NBK554776/ (accessed on 12 August 2022).
  2. O’Toole, Á.; Scher, E.; Underwood, A.; Jackson, B.; Hill, V.; McCrone, J.T.; Colquhoun, R.; Ruis, C.; Abu-Dahab, K.; Taylor, B.; et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021, 7, veab064. [Google Scholar] [CrossRef] [PubMed]
  3. WHO. Tracking SARS-CoV-2 Variants. 2022. Available online: https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/ (accessed on 6 June 2022).
  4. Song, S.; Ma, L.; Zou, D.; Tian, D.; Li, C.; Zhu, J.; Chen, M.; Wang, A.; Ma, Y.; Li, M.; et al. The Global Landscape of SARS-CoV-2 Genomes, Variants, and Haplotypes in 2019nCoVR. Genom. Proteom. Bioinform. 2020, 18, 749–759. [Google Scholar] [CrossRef] [PubMed]
  5. Boehm, E.; Kronig, I.; Neher, R.A.; Eckerle, I.; Vetter, P.; Kaiser, L.; Geneva Center for Emerging Viral Diseases. Novel SARS-CoV-2 variants: The pandemics within the pandemic. Clin. Microbiol. Infect. Off. Publ. Eur. Soc. Clin. Microbiol. Infect. Dis. 2021, 27, 1109–1117. [Google Scholar] [CrossRef] [PubMed]
  6. Harrison, A.G.; Lin, T.; Wang, P. Mechanisms of SARS-CoV-2 Transmission and Pathogenesis. Trends Immunol. 2020, 41, 1100–1115. [Google Scholar] [CrossRef] [PubMed]
  7. Bechtold, P.; Wagner, P.; Hosch, S.; Siegrist, D.; Ruiz-Serrano, A.; Gregorini, M.; Mpina, M.; Ondó, F.A.; Obama, J.; Ayekaba, M.O.; et al. Rapid Identification of SARS-CoV-2 Variants of Concern Using a Portable peakPCR Platform. Anal. Chem. 2021, 93, 16350–16359. [Google Scholar] [CrossRef] [PubMed]
  8. Hu, B.; Guo, H.; Zhou, P.; Shi, Z.L. Characteristics of SARS-CoV-2 and COVID-19. Nat. Rev. Microbiol. 2021, 19, 141–154. [Google Scholar] [CrossRef] [PubMed]
  9. Chiara, M.; D’Erchia, A.M.; Gissi, C.; Manzari, C.; Parisi, A.; Resta, N.; Zambelli, F.; Picardi, E.; Pavesi, G.; Horner, D.S.; et al. Next generation sequencing of SARS-CoV-2 genomes: Challenges, applications and opportunities. Brief. Bioinform. 2021, 22, 616–630. [Google Scholar] [CrossRef] [PubMed]
  10. Hamilton, A.; Tétreault, M.; Dyment, D.A.; Zou, R.; Kernohan, K.; Geraghty, M.T.; FORGE Canada Consortium; Care4Rare Canada Consortium; Hartley, T.; Boycott, K.M. Concordance between whole-exome sequencing and clinical Sanger sequencing: Implications for patient care. Mol. Genet. Genom. Med. 2016, 4, 504–512. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Dorlass, E.G.; Lourenço, K.L.; Magalhães, R.; Sato, H.; Fiorini, A.; Peixoto, R.; Coelho, H.P.; Telezynski, B.L.; Scagion, G.P.; Ometto, T.; et al. Survey of SARS-CoV-2 genetic diversity in two major Brazilian cities using a fast and affordable Sanger sequencing strategy. Genomics 2021, 113, 4109–4115. [Google Scholar] [CrossRef] [PubMed]
  12. CDC. 2019-Novel Coronavirus (2019-nCoV) Real-Time RT-PCR Diagnostic Panel. Centers for Disease Control and Prevention. Available online: https://www.fda.gov/media/134922/download (accessed on 23 October 2022).
  13. ARTICnetwork Project. Available online: https://github.com/artic-network/artic-ncov2019/blob/7e359dae37d894b40ae7e35c3582f14244ef4d36/primer_schemes/nCoV-2019/V3/nCoV-2019.tsv (accessed on 6 June 2021).
  14. Wink, P.L.; Volpato, F.; Lima-Morales, D.; Paiva, R.M.; Willig, J.B.; Bock, H.; Paris, F.; Barth, A.L. RT-qPCR half-reaction optimization for the detection of SARS-CoV-2. Rev. Soc. Bras. Med. Trop. 2021, 54, e03192020. [Google Scholar] [CrossRef] [PubMed]
  15. Lee, S.H. A Routine Sanger Sequencing Target Specific Mutation Assay for SARS-CoV-2 Variants of Concern and Interest. Viruses 2021, 13, 2386. [Google Scholar] [CrossRef] [PubMed]
  16. Papanikolaou, V.; Chrysovergis, A.; Ragos, V.; Tsiambas, E.; Katsinis, S.; Manoli, A.; Papouliakos, S.; Roukas, D.; Mastronikolis, S.; Peschos, D.; et al. From Delta to Omicron: S1-RBD/S2 mutation/deletion equilibrium in SARS-CoV-2 defined variants. Gene 2022, 814, 146134. [Google Scholar] [CrossRef] [PubMed]
  17. Crossley, B.M.; Bai, J.; Glaser, A.; Maes, R.; Porter, E.; Killian, M.L.; Clement, T.; Toohey-Kurth, K. Guidelines for Sanger sequencing and molecular assay monitoring. J. Vet. Diagn. Investig. 2020, 32, 767–775. [Google Scholar] [CrossRef] [PubMed]
  18. S Surface Glycoprotein of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). National Center for Biotechnology Information. Available online: https://www.ncbi.nlm.nih.gov/gene/43740568 (accessed on 6 October 2021).
  19. Kannan, S.; Shaik Syed Ali, P.; Sheeza, A. Omicron (B.1.1.529)-variant of concern-molecular profile and epidemiology: A mini review. Eur. Rev. Med. Pharmacol. Sci. 2021, 25, 8019–8022. [Google Scholar] [CrossRef] [PubMed]
  20. CoVariants. 2022. Available online: https://covariants.org/ (accessed on 10 December 2022).
  21. Boletim Genômico 14-Vigilância Genômica CEVS/SES-RS. Available online: https://coronavirus.rs.gov.br/upload/arquivos/202211/21171734-211122-informe-vigilancia-genomica.pdf (accessed on 10 December 2022).
  22. Daniels, R.S.; Harvey, R.; Ermetal, B.; Xiang, Z.; Galiano, M.; Adams, L.; McCauley, J.W. A Sanger sequencing protocol for SARS-CoV-2 S-gene. Influenza Other Respir. Viruses 2021, 15, 707–710. [Google Scholar] [CrossRef] [PubMed]
  23. Aranha, C.; Patel, V.; Bhor, V.; Gogoi, D. Cycle threshold values in RT-PCR to determine dynamics of SARS-CoV-2 viral load: An approach to reduce the isolation period for COVID-19 patients. J. Med. Virol. 2021, 93, 6794–6797. [Google Scholar] [CrossRef] [PubMed]
  24. Fox-Lewis, A.; Fox-Lewis, S.; Beaumont, J.; Drinković, D.; Harrower, J.; Howe, K.; Jackson, C.; Rahnama, F.; Shilton, B.; Qiao, H.; et al. SARS-CoV-2 viral load dynamics and real-time RT-PCR cycle threshold interpretation in symptomatic non-hospitalised individuals in New Zealand: A multicentre cross sectional observational study. Pathology 2021, 53, 530–535. [Google Scholar] [CrossRef] [PubMed]
  25. Gentile, A.; Juarez, M.; Lucion, M.F.; Pejito, M.N.; Alexay, S.; Orqueda, A.S.; Bollon, L.R.; Mistchenko, A. COVID-19 in Children: Correlation Between Epidemiologic, Clinical Characteristics, and RT-qPCR Cycle Threshold Values. Pediatr. Infect. Dis. J. 2022, 41, 666–670. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Genomic organization of SARS-CoV-2. The RBD region is located inside the S1 subunit from the S gene. The 75L/77R fragment comprises all mutations of concern from the VOCs along the RBD region. The wild type sequence refers to the SARS-CoV-2 reference sequence NC_045512.2.
Figure 1. Genomic organization of SARS-CoV-2. The RBD region is located inside the S1 subunit from the S gene. The 75L/77R fragment comprises all mutations of concern from the VOCs along the RBD region. The wild type sequence refers to the SARS-CoV-2 reference sequence NC_045512.2.
Diagnostics 13 01256 g001
Table 1. Summary of genomic annotation, amino acid changes, and the main established nomenclature systems for the SARS-CoV-2 variants of concern.
Table 1. Summary of genomic annotation, amino acid changes, and the main established nomenclature systems for the SARS-CoV-2 variants of concern.
WHO Variant of ConcernGenomic AnnotationAmino Acid ChangePangolineageNextstrain Clade
AlphaA23063TN501YB.1.1.720I (V1)
C23271AA570D
A23403GD614G
BetaG22813TK417NB.1.35120H (V2)
G23012AE484K
A23063TN501Y
A23403GD614G
GammaA22812CK417TP.120J (V3)
G23012AE484K
A23063TN501Y
A23403GD614G
DeltaT22917GL452RB.1.617.221A, 21I, 21J
C22995AT478K
A23403GD614G
OmicronG22578AG339DB.1.1.52921K, 21L, 21M
T22673CS371L
C22674T
T22679CS373P
C22686TS375F
G22813TK417N
T22882GN440K
G22898AG446S
G22992AS477N
C22995AT478K
A23013CE484A
A23040GQ493R
G23048AG496S
A23055GQ498R
A23063TN501Y
T23075CY505H
C23202AT547K
A23403GD614G
Table 2. Summary with genome annotation and amino acid changes in omicron subvariants BA.1, BA.2, BA.4, BA.5, and BQ.1.
Table 2. Summary with genome annotation and amino acid changes in omicron subvariants BA.1, BA.2, BA.4, BA.5, and BQ.1.
BA.1
Omicron Descendent Lineage
BA.2
Omicron Descendent Lineage
BA.4 and 5
Omicron Descendent Lineage
BQ.1
Omicron Descendent Lineage
Genome AnnotationAmino Acid ChangeGenome AnnotationAmino Acid ChangeGenome AnnotationAmino Acid ChangeGenome AnnotationAmino Acid Change
G22578AG339DG22578AG339DG22578AG339DG22578AG339D
-R346T absent-R346T absent-R346T absentG22599CR346T
-L368I absent-L368I absent-L368I absent-L368I absent
T22673C/
C22674T
S371LC22674TS371FC22674TS371FC22674TS371F
T22679CS373PT22679CS373PT22679CS373PT22679CS373P
C22686TS375FC22686TS375FC22686TS375FC22686TS375F
-T376A absentA22688GT376AA22688GT376AA22688GT376A
-D405N absentG22775AD405NG22775AD405NG22775AD405N
-R408S absentA22786CR408SA22786CR408SA22786CR408S
G22813TK417NG22813TK417NG22813TK417NG22813TK417N
T22882GN440KT22882GN440KT22882GN440KT22882GN440K
-K444T absent-K444T absent-K444T absentA22893CK444T
-V445P absent-V445P absent-V445P absent-V445P absent
G22898AG446S-G446S absent-G446S absent-G446S absent
-L452R/Q absent-L452R/Q absentT22917GL452RT22917GL452R
-N460K absent-N460K absent-N460K absentT22942AN460K
G22992AS477NG22992AS477NG22992AS477NG22992AS477N
C22995AT478KC22995AT478KC22995AT478KC22995AT478K
A23013CE484AA23013CE484AA23013CE484AA23013CE484A
-F486V/S absent-F486V/S absentT23018GF486VT23018GF486V
-F490S absent-F490S absent-F490S absent-F490S absent
A23040GQ493RA23040GQ493R-Q493R absent-Q493R absent
G23048AG496S-G496S absent-G496S absent-G496S absent
A23055GQ498RA23055GQ498RA23055GQ498RA23055GQ498R
A23063TN501YA23063TN501YA23063TN501YA23063TN501Y
T23075CY505HT23075CY505HT23075CY505HT23075CY505H
C23202AT547K-T547K absent-T547K absent-T547K absent
A23403GD614GA23403GD614GA23403GD614GA23403GD614G
Table 3. Clinical samples sequenced for presumptive variant categorization and the previous lineage assigned via WGS, Ct in the RT-qPCR, fragment size obtained including quality, and the pairwise alignment score compared to the reference sequence.
Table 3. Clinical samples sequenced for presumptive variant categorization and the previous lineage assigned via WGS, Ct in the RT-qPCR, fragment size obtained including quality, and the pairwise alignment score compared to the reference sequence.
CDC
RT-qPCR Ct
Fragment Size (bp)Pairwise Sequence Alignment Score SNVs Identified via Sanger SequencingSanger Presumptive IdentificationLineage Assigned via WGS
Sample IDN1N2FWDREVFWDREVPangoLineageWHO VOCs
114.9915.16922ND98%NDA22812C; G23012A; A23063T; A23403GgammaP.1gamma
414.8615.1790483895%89%A23403Gnon-VOCB.1.1.28non-VOC
516.6715.2885881792%92%A22812C; G23012A; A23063T; A23403GgammaP.1gamma
615.0415.4889886897%93%G23012A; A23403Gnon-VOC/zetaP.2non-VOC/zeta
815.2614.1583778194%94%G23012A; A23403Gnon-VOC/zetaP.2non-VOC/zeta
1116.4816.5689884895%91%T22917A; T23031C; A23403Gnon-VOC/lambdaC.37non-VOC/lambda
1213.1813.8591086596%95%A22812C; G23012A; A23063T; A23403GgammaP.1gamma
1314.9713.9494193297%99%T22917A; C22995A; A23403GdeltaB.1.617.2delta
1418.3218.5793692097%97%T22917A; C22995A; A23403GdeltaB.617.2-likedelta
1512.0312.4193392198%98%A23403Gnão-VOCB.1.1.29non-VOC
2214.4415.2492993497%96%A22812C; G23012A; A23063T; A23403GgammaP.1gamma
2318.7117.6591489696%94%A22812C; G23012A; A23063T; A23403GgammaP.1gamma
3013.2912.292892098%95%T22917A; C22995A; A23403GdeltaB.617.2-likedelta
3215.9415.189886897%93%G23012A; A23403Gnon-VOC/zetaP.2non-VOC/zeta
3312.1112.0388789095%95%G23012A; A23403Gnon-VOC/zetaP.2non-VOC/zeta
3418.1818.4888579295%89%A22812C; G23012A; A23063T; A23403GgammaP.1gamma
3513.6513.4188685093%93%A23403Gnon-VOCB.1.1.28non-VOC
3920.1619.14897ND96%ND-non-VOCB.1.1.28non-VOC
4018.4719.7176849495%91%A23403Gnon-VOCB.1.1.28non-VOC
4218.6719.9982677194%93%A22812C; G23012A; A23063T; A23403GgammaP.1gamma
4713.4814.1589389896%94%A22812C; G23012A; A23063T; A23403GgammaP.1gamma
511212.587781094%92%G22578A; T22673C; C22674T; T22679C; C22686T; G22813T; T22882G; G22898A; G22992A; C22995A; A23013C; A23040G; G23048A; A23055G; A23063T; T23075C; C23202A; A23403GomicronB.1.1.529omicron
521314.387786792%93%G22578A; T22673C; C22674T; T22679C; C22686T; G22813T; T22882G; G22898A; G22992A; C22995A; A23013C; A23040G; G23048A; A23055G; A23063T; T23075C; C23202A; A23403GomicronB.1.1.529omicron
5810.989.1887786795%95%G23012A; A23403Gnon-VOC/zetaP.2non-VOC/zeta
5919.1519.3270583792%94%A23403Gnon-VOCB.1.1.28non-VOC
6018.0819.3288688994%94%G22578A; T22673C; C22674T; T22679C; C22686T; G22813T; T22882G; G22898A; G22992A; C22995A; A23013C; A23040G; G23048A; A23055G; A23063T; T23075C; C23202A; A23403GomicronB.1.1.529omicron
6120.8732.8285483792%92%G22578A; T22673C; C22674T; T22679C; C22686T; G22813T; T22882G; G22898A; G22992A; C22995A; A23013C; A23040G; G23048A; A23055G; A23063T; T23075C; C23202A; A23403GomicronB.1.1.529omicron
6223.7929.0287936995%93%T22917A; C22995A; A23403GdeltaB.617.2-likedelta
6319.8920.4ND692ND94%T22917A; C22995A; A23403GdeltaB.617.2-likedelta
6419.8630.8188083193%92%G22578A; T22673C; C22674T; T22679C; C22686T; G22813T; T22882G; G22898A; G22992A; C22995A; A23013C; A23040G; G23048A; A23055G; A23063T; T23075C; C23202A; A23403GomicronB.1.1.529omicron
6512.9712.8992890096%95%T22917A; C22995A; A23403GdeltaB.617.2-likedelta
6617.3822.78839ND93%NDG22578A; T22673C; C22674T; T22679C; C22686T; G22813T; T22882G; G22898A; G22992A; C22995A; A23013C; A23040G; G23048A; A23055G; A23063T; T23075C; C23202A; A23403GomicronB.1.1.529omicron
6816.0520.890187392%93%G22578A; T22673C; C22674T; T22679C; C22686T; G22813T; T22882G; G22898A; G22992A; C22995A; A23013C; A23040G; G23048A; A23055G; A23063T; T23075C; C23202A; A23403GomicronB.1.1.529omicron
6919.142987989095%93%T22917A; C22995A; A23403GdeltaB.617.2-likedelta
70NDND897ND93%NDG22599C, C22674T, T22679C, C22686T, A22688G, G22775A, A22786C, G22813T, T22882G, A22893C, T22917G, T22942A, G22992A, C22995A, A23013C, T23018G, A23055G, A23063T, T23075C, A23403Gomicron, sublinage BQ.1BQ.1omicron, sublinage BQ.1
73NDND876ND94%NDG22599C, C22674T, T22679C, C22686T, A22688G, G22775A, A22786C, G22813T, T22882G, A22893C, T22917G, T22942A, G22992A, C22995A, A23013C, T23018G, A23055G, A23063T, T23075C, A23403Gomicron, sublinage BQ.1BQ.1omicron, sublinage BQ.1
74NDND760ND91%NDG22599C, C22674T, T22679C, C22686T, A22688G, G22775A, A22786C, G22813T, T22882G, A22893C, T22917G, T22942A, G22992A, C22995A, A23013C, T23018G, A23055G, A23063T, T23075Comicron, sublinage BQ.1BQ.1omicron, sublinage BQ.1
Ct—cycle threshold; bp—base pairs; N1—nucleocapsid 1; N2—nucleocapsid 2; FWD—forward sense; REV—reverse sense; ND—not determined.
Table 4. Standard curve quantification for determining the sample minimum viral load limit of detection.
Table 4. Standard curve quantification for determining the sample minimum viral load limit of detection.
Serial DilutionTargetCt StandardVL Standard (Copies/uL)Ct SampleVLSample (Copies/uL)RBD PCR
Result
1:1N123.41100,00013.2860,740,000Positive
N223.56100,00015.0415,915,000
1:10N126.8610,00017.254,709,500Positive
N226.8710,00018.481,918,500
1:100N130.1100020.62540,550Positive
N230.53100022.4171,750
1:1000N133.7710025.8818,285Negative
N234.2910027.557252
1:10,000N138.181030.9742Negative
N238.571033.11245
Ct—cycle threshold; VL—viral load; N1—nucleocapsid 1; N2—nucleocapsid 2.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Rodrigues, G.M.; Volpato, F.C.Z.; Wink, P.L.; Paiva, R.M.; Barth, A.L.; de-Paris, F. SARS-CoV-2 Variants of Concern: Presumptive Identification via Sanger Sequencing Analysis of the Receptor Binding Domain (RBD) Region of the S Gene. Diagnostics 2023, 13, 1256. https://doi.org/10.3390/diagnostics13071256

AMA Style

Rodrigues GM, Volpato FCZ, Wink PL, Paiva RM, Barth AL, de-Paris F. SARS-CoV-2 Variants of Concern: Presumptive Identification via Sanger Sequencing Analysis of the Receptor Binding Domain (RBD) Region of the S Gene. Diagnostics. 2023; 13(7):1256. https://doi.org/10.3390/diagnostics13071256

Chicago/Turabian Style

Rodrigues, Grazielle Motta, Fabiana Caroline Zempulski Volpato, Priscila Lamb Wink, Rodrigo Minuto Paiva, Afonso Luís Barth, and Fernanda de-Paris. 2023. "SARS-CoV-2 Variants of Concern: Presumptive Identification via Sanger Sequencing Analysis of the Receptor Binding Domain (RBD) Region of the S Gene" Diagnostics 13, no. 7: 1256. https://doi.org/10.3390/diagnostics13071256

APA Style

Rodrigues, G. M., Volpato, F. C. Z., Wink, P. L., Paiva, R. M., Barth, A. L., & de-Paris, F. (2023). SARS-CoV-2 Variants of Concern: Presumptive Identification via Sanger Sequencing Analysis of the Receptor Binding Domain (RBD) Region of the S Gene. Diagnostics, 13(7), 1256. https://doi.org/10.3390/diagnostics13071256

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop