Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades

Patro, L Ponoop Prasad and Sathyaseelan, Chakkarai and Uttamrao, Patil Pranita and Rathinavelan, Thenmalarchelvi (2021) Global variation in SARS-CoV-2 proteome and its implication in pre-lockdown emergence and dissemination of 5 dominant SARS-CoV-2 clades. Infection, Genetics and Evolution, 93 (104973). ISSN 15671348

[img] Text

Download (9MB)


SARS-CoV-2 is currently causing major havoc worldwide with its efficient transmission and propagation. To track the emergence as well as the persistence of mutations during the early stage of the pandemic, a comparative analysis of SARS-CoV-2 whole proteome sequences has been performed by considering manually curated 31,389 whole genome sequences from 84 countries. Among the 7 highly recurring (percentage frequency≥10%) mutations (Nsp2:T85I, Nsp6:L37F, Nsp12:P323L, Spike:D614G, ORF3a:Q57H, N protein:R203K and N protein:G204R), N protein:R203K and N protein: G204R are co-occurring (dependent) mutations. Nsp12:P323L and Spike:D614G often appear simultaneously. The highly recurring Spike:D614G, Nsp12:P323L and Nsp6:L37F as well as moderately recurring (percentage frequency between ≥1 and <10%) ORF3a:G251V and ORF8:L84S mutations have led to4 major clades in addition to a clade that lacks high recurring mutations. Further, the occurrence of ORF3a:Q57H&Nsp2:T85I, ORF3a:Q57H and N protein:R203K&G204R along with Nsp12:P323L&Spike:D614G has led to 3 additional sub-clades. Similarly, occurrence of Nsp6:L37F and ORF3a:G251V together has led to the emergence of a sub-clade. Nonetheless, ORF8:L84S does not occur along with ORF3a:G251V or Nsp6:L37F. Intriguingly, ORF3a:G251V and ORF8:L84S are found to occur independent of Nsp12:P323L and Spike:D614G mutations. These clades have evolved during the early stage of the pandemic and have disseminated across several countries. Further, Nsp10 is found to be highly resistant to mutations, thus, it can be exploited for drug/vaccine development and the corresponding gene sequence can be used for the diagnosis. Concisely, the study reports the SARS-CoV-2 antigens diversity across the globe during the early stage of the pandemic and facilitates the understanding of viral evolution.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Patro, L Ponoop PrasadUNSPECIFIED
Sathyaseelan, ChakkaraiUNSPECIFIED
Uttamrao, Patil PranitaUNSPECIFIED
Rathinavelan, Thenmalarchelvi
Item Type: Article
Uncontrolled Keywords: Highly recurring mutations; Moderately recurring mutations; Mutational susceptibility; Phyloproteomics; Proteome analysis; SARS-CoV-2 viromics
Subjects: Others > Biotechnology
Divisions: Department of Biotechnology
Depositing User: . LibTrainee 2021
Date Deposited: 05 Jul 2021 04:58
Last Modified: 05 Jul 2021 04:59
Publisher URL:
OA policy:
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 8081 Statistics for this ePrint Item