<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<?covid-19-tdm?>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">753440</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2021.753440</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Hotspot Mutations in SARS-CoV-2</article-title>
<alt-title alt-title-type="left-running-head">Saha et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Hotspot Mutations in SARS-CoV-2</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Saha</surname>
<given-names>Indrajit</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<xref ref-type="fn" rid="FN1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/559740/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ghosh</surname>
<given-names>Nimisha</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="fn" rid="FN1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/994634/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sharma&#x2009;</surname>
<given-names>Nikhil</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Nandi</surname>
<given-names>Suman</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1506564/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<label>
<sup>1</sup>
</label>Department of Computer Science and Engineering, National Institute of Technical Teachers&#x2019; Training and Research, <addr-line>Kolkata</addr-line>, <country>India</country>
</aff>
<aff id="aff2">
<label>
<sup>2</sup>
</label>Department of Computer Science and Information Technology, Institute of Technical Education and Research, Siksha &#x2018;O&#x2019; Anusandhan (Deemed to be University), <addr-line>Bhubaneswar</addr-line>, <country>India</country>
</aff>
<aff id="aff3">
<label>
<sup>3</sup>
</label>Department of Electronics and Communication Engineering, Jaypee Institute of Information Technology, <addr-line>Noida</addr-line>, <country>India</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/29966/overview">Yang Zhang</ext-link>, University of Michigan, United&#x20;States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/988447/overview">Xiaoqiang Huang</ext-link>, University of Michigan, United&#x20;States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1443164/overview">Yavuz Oktay</ext-link>, Dokuz Eylul University, Turkey</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Indrajit Saha, <email>indrajit@nitttrkol.ac.in</email>
</corresp>
<fn fn-type="equal" id="FN1">
<label>
<sup>&#x2020;</sup>
</label>
<p>These authors have contributed equally to this&#x20;work</p>
</fn>
<fn fn-type="other">
<p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>29</day>
<month>11</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>753440</elocation-id>
<history>
<date date-type="received">
<day>04</day>
<month>08</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>07</day>
<month>10</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Saha, Ghosh, Sharma&#x2009; and Nandi.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Saha, Ghosh, Sharma&#x2009; and Nandi</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Since its emergence in Wuhan, China, severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) has spread very rapidly around the world, resulting in a global pandemic. Though the vaccination process has started, the number of COVID-affected patients is still quite large. Hence, an analysis of hotspot mutations of the different evolving virus strains needs to be carried out. In this regard, multiple sequence alignment of 71,038 SARS-CoV-2 genomes of 98 countries over the period from January 2020 to June 2021 is performed using MAFFT followed by phylogenetic analysis in order to visualize the virus evolution. These steps resulted in the identification of hotspot mutations as deletions and substitutions in the coding regions based on entropy greater than or equal to 0.3, leading to a total of 45 unique hotspot mutations. Moreover, 10,286 Indian sequences are considered from 71,038 global SARS-CoV-2 sequences as a demonstrative example that gives 52 unique hotspot mutations. Furthermore, the evolution of the hotspot mutations along with the mutations in variants of concern is visualized, and their characteristics are discussed as well. Also, for all the non-synonymous substitutions (missense mutations), the functional consequences of amino acid changes in the respective protein structures are calculated using PolyPhen-2 and I-Mutant 2.0. In addition to this, SSIPe is used to report the binding affinity between the receptor-binding domain of Spike protein and human ACE2 protein by considering L452R, T478K, E484Q, and N501Y hotspot mutations in that region.</p>
</abstract>
<kwd-group>
<kwd>COVID-19</kwd>
<kwd>deletions</kwd>
<kwd>entropy</kwd>
<kwd>hotspot mutations</kwd>
<kwd>SARS-CoV-2 genomes</kwd>
<kwd>substitution</kwd>
</kwd-group>
<contract-num rid="cn001">CVD/2020/000991</contract-num>
<contract-sponsor id="cn001">Science and Engineering Research Board<named-content content-type="fundref-id">10.13039/501100001843</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>COVID-19 caused by severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) was first identified in late December 2019 and has a high transmission rate (<xref ref-type="bibr" rid="B33">Zhu et&#x20;al., 2020</xref>). The WHO declared this outbreak as a pandemic on March 11, 2020 (<xref ref-type="bibr" rid="B8">Cucinotta and Vanelli, 2020</xref>). Like other coronaviruses, SARS-CoV-2 is also an enveloped single-stranded RNA virus containing nearly 30&#xa0;K nucleotide sequences (<xref ref-type="bibr" rid="B3">Alexandersen et&#x20;al., 2020</xref>). SARS-CoV-2 encompasses 11 codding regions, which include ORF1ab, Spike (S), ORF3a, Envelope (E), Membrane (M), ORF6, ORF7a, ORF7b, ORF8, Nucleocapsid (N), and ORF10.</p>
<p>Though the vaccination process has started, the virus is evolving and spreading all across the world, causing fresh waves every few months. Since the virus is mutating frequently, it creates new variant of the original virus. Among several variants, B.1.1.7 (Alpha), B.1.351 (Beta), P.1 (Gamma), and B.1.617.2 (Delta) are declared as variants of concern (<xref ref-type="bibr" rid="B23">Singh et&#x20;al., 2021</xref>). In this regard, the variant B.1.1.7 was first identified in the United&#x20;Kingdom, which contains E484K, N501Y, D614G, and P681H mutations in Spike glycoprotein (<xref ref-type="bibr" rid="B25">Tang et&#x20;al., 2020</xref>). In December 2020, the variant B.1.351 was first detected in South Africa, with mutations such as K417N, E484K, N501Y, D614G, and A701V (<xref ref-type="bibr" rid="B24">Tang et&#x20;al., 2021</xref>). The Brazilian variant P.1 also has almost the same mutations as the B.1.351 variant, but instead of A701V, the P.1 variant has H555Y mutation (<xref ref-type="bibr" rid="B9">Faria et&#x20;al., 2021</xref>). On the other hand, the variant B.1.617.2 was first identified in India with L452R, T478K, D614G, and P681R mutations in Spike glycoprotein (<xref ref-type="bibr" rid="B4">Bernal et&#x20;al., 2021</xref>).</p>
<p>To understand the new variants of SARS-CoV-2, <xref ref-type="bibr" rid="B26">Tiwari and Mishra (2021</xref>) have performed phylogenetic analysis of 591 SARS-CoV-2 genomes where they have found 43 synonymous and 57&#x20;non-synonymous mutations in 12 protein regions. They found the most prevalent mutations in the Spike protein, followed by NSP2, NSP3, and ORF9. They have also highlighted several distinct SARS-CoV-2 features as compared with other human-infecting viruses. <xref ref-type="bibr" rid="B31">Yuan et&#x20;al. (2020)</xref> have analyzed 11,183 global sequences where they have identified 119&#x20;single-nucleotide polymorphisms (SNPs) with 74&#x20;non-synonymous and 43 synonymous mutations. The mutational profiling shows that the highest mutation has occurred in Nucleocapsid, followed by NSP2, NSP3, and Spike. From China, India, the United&#x20;States, and Europe, 570 SARS-CoV-2 genomes are analyzed by <xref ref-type="bibr" rid="B27">Weber et&#x20;al. (2020</xref>), where they have identified 10 individual mutations where most of the mutations altered the amino acids in the replication-relevant proteins. <xref ref-type="bibr" rid="B22">Sarkar et&#x20;al. (2021)</xref> have performed a genome-wide analysis of 837 Indian SARS-CoV-2 genomes, where 33 unique mutations were observed, among which 18 mutations were identified in India in five protein regions (six in Spike, five in NSP3, four in RdRp, two in NSP2, and one in Nucleocapsid). The isolated Indian sequences were classified into 22 groups based on their coexisting mutations. This study highlights several mutations identified in various protein regions, which also help to identify the evolution of virus genome across various geographic locations of India. <xref ref-type="bibr" rid="B20">Saha et&#x20;al. (2020)</xref> have performed phylogenetic analysis of 566 Indian SARS-CoV-2 genomes to identify several mutations. As a result, 933 substitutions, 2,449 deletions, and two insertions have been identified from the aligned sequences. In another study, <xref ref-type="bibr" rid="B21">Saha et&#x20;al. (2021)</xref> have performed genomic analysis of 10,664 SARS-CoV-2 genomes, resulting in 7,209 substitutions, 11,700 deletions, 119 insertions, and 53&#x20;SNPs.</p>
<p>Motivated by the aforementioned analysis, in this work, we have performed multiple sequence alignment (MSA) of 71,038 SARS-CoV-2 genomes using MAFFT (<xref ref-type="bibr" rid="B15">Katoh et&#x20;al., 2002</xref>) followed by their phylogenetic analysis using Nextstrain (<xref ref-type="bibr" rid="B12">Hadfield et&#x20;al., 2018</xref>) to visualize the virus evolution. This led to the identification of hotspot mutations as deletions and substitutions in the coding regions based on entropy greater than or equal to 0.3. Furthermore, as a demonstrative example, 10,286 Indian sequences are considered from 71,038 global SARS-CoV-2 sequences. For all the non-synonymous substitutions (missense mutations), the functional consequences of amino acid changes in the respective protein structures are calculated using PolyPhen-2 and I-Mutant 2.0. Finally, SSIPe is used to report the binding affinity between the receptor-binding domain (RBD) of Spike protein and human ACE2 protein by considering the hotspot mutations in that region.</p>
</sec>
<sec id="s2">
<title>2 Methods</title>
<p>In this section, the dataset collection for the SARS-CoV-2 genomes is discussed along with the proposed pipeline.</p>
<sec id="s2-1">
<title>2.1 Data Preparation</title>
<p>For MSA and phylogenetic analysis, 71,038 global SARS-CoV-2 genomes are collected from Global Initiative on Sharing All Influenza Data (GISAID)<xref ref-type="fn" rid="FN2">
<sup>1</sup>
</xref>, and the Reference Genome (NC 045512.2)<xref ref-type="fn" rid="FN3">
<sup>2</sup>
</xref> is collected from the National Center for Biotechnology Information (NCBI). The SARS-CoV-2 sequences are mostly distributed from January 2020 to June 2021 globally. Moreover, to map the protein sequences and changes in the amino acid, Protein Data Bank (PDB) is collected from Zhang Lab<xref ref-type="fn" rid="FN4">
<sup>3</sup>
</xref> (<xref ref-type="bibr" rid="B32">Zhang et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B29">Wu et&#x20;al., 2021</xref>), and it is then used to show the structural changes. All these analyses are performed on the High Performance Computing facility of NITTTR, Kolkata; and for checking the amino acid changes, MATLAB R2019b is&#x20;used.</p>
</sec>
<sec id="s2-2">
<title>2.2 Pipeline of the Work</title>
<p>The pipeline of this work is provided in <xref ref-type="fig" rid="F1">Figure&#x20;1A</xref>. Initially, MSA of 71,038 global SARS-CoV-2 genomes is performed using MAFFT, which is followed by their phylogenetic analysis using Nextstrain. The corresponding phylogenetic tree is shown in <xref ref-type="fig" rid="F1">Figure&#x20;1B</xref>. MAFFT merges local and global algorithms for MSA, and it uses two different heuristic methods such as progressive (FFT-NS-2) and iterative refinement (FFT-NS-i). To create a provisional MSA, FFT-NS-2 calculates all-pairwise distances from which refined distances are calculated. Thereafter, FFT-NS-i is performed to get the final MSA. As MAFFT uses fast Fourier transform, it scores over other alignment techniques. On the other hand, Nextstrain is a collection of open-source tools, which is useful for understanding the evolution and spread of pathogen, particularly during an outbreak. By taking advantage of this tool, in this work, the evolution and geographic distribution of SARS-CoV-2 genomes are visualized by creating the metadata in our High Performance Computing environment.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Pipeline of the workflow.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g001.tif"/>
</fig>
<p>Once the alignment and the phylogenetic analysis are completed, hotspot mutations as deletions and substitutions are identified in the coding regions based on entropy greater than or equal to 0.3. Furthermore, 10,286 Indian sequences are considered as an example to identify such mutations as well. The corresponding phylogenetic tree for Indian sequences is shown in <xref ref-type="fig" rid="F1">Figure&#x20;1C</xref>. Moreover, using the codon table, amino acid changes in the SARS-CoV-2 proteins for the corresponding mutations are highlighted as well. The hotspot mutations are identified considering their entropy values, which are calculated as:<disp-formula id="e1">
<mml:math id="m1">
<mml:mi mathvariant="script">E</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>ln</mml:mi>
<mml:mspace width="0.3333em" class="nbsp"/>
<mml:mn>5</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mo>&#x2211;</mml:mo>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b3;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b4;</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:mspace width="0.3333em" class="nbsp"/>
<mml:mspace width="0.3333em" class="nbsp"/>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mspace width="0.3333em" class="nbsp"/>
<mml:mi>ln</mml:mi>
<mml:mspace width="0.3333em" class="nbsp"/>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b3;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b4;</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mspace width="0.3333em" class="nbsp"/>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>where <inline-formula id="inf1">
<mml:math id="m2">
<mml:msubsup>
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b3;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b4;</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> represents the frequency of each residue <italic>&#x3b3;</italic> occurring at position <italic>&#x3b4;</italic> and 5 represents the four possible residues as nucleotides plus gap. Thereafter, the amino acid changes in the SARS-CoV-2 proteins for the non-synonymous deletions and substitutions for both global and Indian sequences are graphically visualized as shown in <xref ref-type="fig" rid="F1">Figure&#x20;1D</xref>. Finally, these changes are also used for the evaluation of their functional characteristics and are visualized in the respective protein structure as&#x20;well.</p>
</sec>
</sec>
<sec id="s3">
<title>3 Results</title>
<p>The experiments in this work are carried out according to the pipeline as given in <xref ref-type="fig" rid="F1">Figure&#x20;1A</xref>. Initially, MSA of 71,038 global SARS-CoV-2 genomes across 98 countries is carried out using MAFFT followed by their phylogenetic analysis using Nextstrain, which revealed five clades: 19A, 19B, 20A, 20B, and 20C. The number of sequences for each country is reported in <xref ref-type="sec" rid="s11">Supplementary Table S1</xref>. This resulted in the identification of hotspot mutation points as deletions and substitutions in the coding regions based on entropy. In this regard, only those hotspot mutations are considered whose entropy values are greater than or equal to 0.3. The entropy values for each of the genomic coordinates for both global and Indian sequences are provided in <xref ref-type="sec" rid="s11">Supplementary Table S2</xref>. The mutation statistics by considering different threshold values of entropy for each category are reported in <xref ref-type="table" rid="T1">Table&#x20;1</xref>. Based on the results in this table, the entropy value of 0.3 is considered as the threshold for choosing the hotspot mutations. It is to be noted that choosing a threshold value as either 0.2 or 0.1 will lead to a huge amount of hotspot mutations, which is not desired. As a consequence of choosing entropy threshold of 0.3, 45 unique hotspot mutations are identified, which resulted in 39&#x20;non-synonymous deletions and substitutions with nine unique deletions and 22 unique amino acid changes. Also, out of the 98 countries that are considered for global analysis, India with 10,286 sequences is taken as an example to demonstrate the mutations for a particular country as well. In this regard, 52 unique hotspot mutations provide 45&#x20;non-synonymous deletions and substitutions with five unique amino acid changes for deletions and 36 unique amino acid changes for substitutions. The analysis on other countries with the most number of sequences is provided in the Supplementary Material. The phylogenetic trees in radial and rectangular views considering global analysis are shown in <xref ref-type="fig" rid="F2">Figures 2A,B</xref>, respectively, while for Indian sequences, such views are provided in <xref ref-type="fig" rid="F2">Figures 2D,E</xref>, respectively. These phylogenetic trees respectively show the evolution of the global and Indian SARS-CoV-2 genomes over the months. For the benefit of the readers, it is important to mention that the number of sequences does not have any direct relationship with the number of hotspot mutations. The number of hotspots is based on the entropy value, which in turn depends on the frequency of mutations at a given genomic coordinate. So even with smaller number of sequences, if the frequency of mutations is higher than that with larger number of sequences, it will produce more hotspot mutations. Thus, with 71,038 global sequences, 45 unique hotspot mutations are identified, while for 10,286 Indian sequences, 52 such mutations are identified.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Mutation statistics of 71,038 global and 10,286 Indian SARS-CoV-2 genomes by considering different threshold values.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">Threshold value</th>
<th colspan="26" align="center">Coding regions of global SARS-CoV-2 genomes</th>
</tr>
<tr>
<th align="center">NSP1</th>
<th align="center">NSP2</th>
<th align="center">NSP3</th>
<th align="center">NSP4</th>
<th align="center">3CL-Pro</th>
<th align="center">NSP6</th>
<th align="center">NSP7</th>
<th align="center">NSP8</th>
<th align="center">NSP9</th>
<th align="center">NSP10</th>
<th align="center">NSP11</th>
<th align="center">RdRp</th>
<th align="center">Helicase</th>
<th align="center">Exon</th>
<th align="center">endoRNAse</th>
<th align="center">NSP16</th>
<th align="center">Spike</th>
<th align="left">ORF3a</th>
<th align="left">Envelope</th>
<th align="left">Membrane</th>
<th align="left">ORF6</th>
<th align="left">ORF7a</th>
<th align="left">ORF7b</th>
<th align="left">ORF8</th>
<th align="left">Nucleocapsid</th>
<th align="left">ORF10</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">&#x3e; &#x3d;0.60</td>
<td align="center">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">3</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.50 to &#x3c; 0.60</td>
<td align="center">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.40 to &#x3c; 0.50</td>
<td align="center">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">4</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">8</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">3</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">13</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">3</td>
<td align="char" char=".">4</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.30 to &#x3c; 0.40</td>
<td align="center">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.20 to &#x3c; 0.30</td>
<td align="center">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">6</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">3</td>
<td align="char" char=".">0</td>
<td align="char" char=".">2</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">3</td>
<td align="char" char=".">1</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.10 to &#x3c; 0.20</td>
<td align="center">1</td>
<td align="char" char=".">3</td>
<td align="char" char=".">7</td>
<td align="char" char=".">4</td>
<td align="char" char=".">1</td>
<td align="char" char=".">3</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">6</td>
<td align="char" char=".">3</td>
<td align="char" char=".">3</td>
<td align="char" char=".">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">14</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">7</td>
<td align="char" char=".">8</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.05 to &#x3c; 0.10</td>
<td align="center">1</td>
<td align="char" char=".">3</td>
<td align="char" char=".">18</td>
<td align="char" char=".">3</td>
<td align="char" char=".">2</td>
<td align="char" char=".">2</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">3</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">5</td>
<td align="char" char=".">6</td>
<td align="char" char=".">2</td>
<td align="char" char=".">4</td>
<td align="char" char=".">1</td>
<td align="char" char=".">25</td>
<td align="char" char=".">7</td>
<td align="char" char=".">0</td>
<td align="char" char=".">2</td>
<td align="char" char=".">1</td>
<td align="char" char=".">2</td>
<td align="char" char=".">0</td>
<td align="char" char=".">2</td>
<td align="char" char=".">5</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">
<bold>Threshold value</bold>
</td>
<td colspan="26" align="center">
<bold>Coding regions of Indian SARS-CoV-2 genomes</bold>
</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.60</td>
<td align="center">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">2</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">4</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.50 to &#x3c; 0.60</td>
<td align="center">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.40 to &#x3c; 0.50</td>
<td align="center">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">9</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.30 to &#x3c; 0.40</td>
<td align="center">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">4</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">6</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">4</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.20 to &#x3c; 0.30</td>
<td align="center">0</td>
<td align="char" char=".">3</td>
<td align="char" char=".">4</td>
<td align="char" char=".">3</td>
<td align="char" char=".">0</td>
<td align="char" char=".">5</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">4</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">2</td>
<td align="char" char=".">1</td>
<td align="char" char=".">16</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">4</td>
<td align="char" char=".">4</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.10 to &#x3c; 0.20</td>
<td align="center">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">12</td>
<td align="char" char=".">5</td>
<td align="char" char=".">0</td>
<td align="char" char=".">10</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">3</td>
<td align="char" char=".">2</td>
<td align="char" char=".">3</td>
<td align="char" char=".">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">8</td>
<td align="char" char=".">2</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">2</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">2</td>
<td align="char" char=".">4</td>
<td align="char" char=".">0</td>
</tr>
<tr>
<td align="left">&#x3e; &#x3d;0.05 to &#x3c; 0.10</td>
<td align="center">0</td>
<td align="char" char=".">7</td>
<td align="char" char=".">11</td>
<td align="char" char=".">4</td>
<td align="char" char=".">1</td>
<td align="char" char=".">5</td>
<td align="char" char=".">0</td>
<td align="char" char=".">1</td>
<td align="char" char=".">1</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">4</td>
<td align="char" char=".">1</td>
<td align="char" char=".">8</td>
<td align="char" char=".">1</td>
<td align="char" char=".">3</td>
<td align="char" char=".">53</td>
<td align="char" char=".">3</td>
<td align="char" char=".">0</td>
<td align="char" char=".">11</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0</td>
<td align="char" char=".">5</td>
<td align="char" char=".">7</td>
<td align="char" char=".">1</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Phylogenetic analysis of <bold>(A, B, C)</bold> global and <bold>(D, E, F)</bold> Indian SARS-CoV-2 genomes.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g002.tif"/>
</fig>
<p>The list of hotspot mutations for the global and Indian SARS-CoV-2 genomes along with their associated details is respectively provided in <xref ref-type="table" rid="T2">Tables 2</xref> and <xref ref-type="table" rid="T3">3</xref>. For example, in <xref ref-type="table" rid="T2">Table&#x20;2</xref>, genomic coordinate 28,881 in Nucleocapsid with nucleotide changes G &#x3e; A and G &#x3e; T has the highest entropy value of 0.773655. India also shows the same mutation but with an entropy value of 1.14807 as shown in <xref ref-type="table" rid="T3">Table&#x20;3</xref>. Please note that mutations like G28881A and G28883C may have an impact on antigenicity of Nucleocapsid protein (<xref ref-type="bibr" rid="B31">Yuan et&#x20;al., 2020</xref>). The entropy values for the corresponding nucleotide changes for global analysis are shown in <xref ref-type="fig" rid="F2">Figure&#x20;2C</xref>, while for India, the same is shown in <xref ref-type="fig" rid="F2">Figure&#x20;2F</xref>. It is to be noted that the total number of unique amino acid changes for deletions and substitutions is less than the number of non-synonymous deletions and substitutions. One of the reasons for this can be that if there are deletions at consecutive genomic coordinates, the corresponding amino acid changes are the same. For example, as can be seen from <xref ref-type="table" rid="T2">Table&#x20;2</xref>, at the three consecutive genomic coordinates 11,288, 11,289, and 11,290, deletion has occurred with the amino acid change as S106-. Thus, though the number of non-synonymous deletions is 3, the number of unique amino acid change is 1. This is true for other such changes as&#x20;well.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>List of hotspot mutations for 71,038 global SARS-CoV-2 genomes along with the protein change.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Genomic coordinate</th>
<th align="center">Overall entropy</th>
<th align="center">Nucleotide change</th>
<th align="center">Amino acid change</th>
<th align="center">Protein coordinate</th>
<th align="center">Gene</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">28,881</td>
<td align="char" char=".">0.773655</td>
<td align="left">G &#x3e; A, G &#x3e; T</td>
<td align="left">R &#x3e; K, R &#x3e; M</td>
<td align="char" char=".">203</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">28,883</td>
<td align="char" char=".">0.663399</td>
<td align="left">G &#x3e; C</td>
<td align="left">G &#x3e; R</td>
<td align="char" char=".">204</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">28,882</td>
<td align="char" char=".">0.663308</td>
<td align="left">G &#x3e; A</td>
<td align="left">R &#x3e; R</td>
<td align="char" char=".">203</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">23,604</td>
<td align="char" char=".">0.642160</td>
<td align="left">C &#x3e; A, C &#x3e; G</td>
<td align="left">P &#x3e; H, P &#x3e; R</td>
<td align="char" char=".">681</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">11,296</td>
<td align="char" char=".">0.502171</td>
<td align="left">T &#x3e; -</td>
<td align="left">F &#x3e; -</td>
<td align="char" char=".">108</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">21,993</td>
<td align="char" char=".">0.500865</td>
<td align="left">A &#x3e; -</td>
<td align="left">Y &#x3e; -</td>
<td align="char" char=".">144</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">11,291</td>
<td align="char" char=".">0.499603</td>
<td align="left">G &#x3e; -</td>
<td align="left">G &#x3e; -</td>
<td align="char" char=".">107</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">28,280</td>
<td align="char" char=".">0.491543</td>
<td align="left">G &#x3e; C</td>
<td align="left">D &#x3e; H</td>
<td align="char" char=".">3</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">23,063</td>
<td align="char" char=".">0.484066</td>
<td align="left">A &#x3e; T</td>
<td align="left">N &#x3e; Y</td>
<td align="char" char=".">501</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">21,770</td>
<td align="char" char=".">0.476393</td>
<td align="left">G &#x3e; -</td>
<td align="left">V &#x3e; -</td>
<td align="char" char=".">70</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">3,267</td>
<td align="char" char=".">0.475810</td>
<td align="left">C &#x3e; T</td>
<td align="left">T &#x3e; I</td>
<td align="char" char=".">183</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">11,288</td>
<td align="char" char=".">0.474924</td>
<td align="left">T &#x3e; -</td>
<td align="left">S &#x3e; -</td>
<td align="char" char=".">106</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">11,289</td>
<td align="char" char=".">0.472836</td>
<td align="left">C &#x3e; -</td>
<td align="left">S &#x3e; -</td>
<td align="char" char=".">106</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">21,765</td>
<td align="char" char=".">0.471435</td>
<td align="left">T &#x3e; -</td>
<td align="left">I &#x3e; -</td>
<td align="char" char=".">68</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">21,767</td>
<td align="char" char=".">0.469881</td>
<td align="left">C &#x3e; -</td>
<td align="left">H &#x3e; -</td>
<td align="char" char=".">69</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">11,290</td>
<td align="char" char=".">0.467890</td>
<td align="left">T &#x3e; -</td>
<td align="left">S &#x3e; -</td>
<td align="char" char=".">106</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">21,766</td>
<td align="char" char=".">0.467479</td>
<td align="left">A &#x3e; -</td>
<td align="left">I &#x3e; -</td>
<td align="char" char=".">68</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">21,768</td>
<td align="char" char=".">0.467116</td>
<td align="left">A &#x3e; -</td>
<td align="left">H &#x3e; -</td>
<td align="char" char=".">69</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">21,769</td>
<td align="char" char=".">0.466151</td>
<td align="left">T &#x3e; -</td>
<td align="left">H &#x3e; -</td>
<td align="char" char=".">69</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">11,293</td>
<td align="char" char=".">0.465319</td>
<td align="left">T &#x3e; -</td>
<td align="left">G &#x3e; -</td>
<td align="char" char=".">107</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">11,292</td>
<td align="char" char=".">0.464056</td>
<td align="left">G &#x3e; -</td>
<td align="left">G &#x3e; -</td>
<td align="char" char=".">107</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">11,294</td>
<td align="char" char=".">0.463926</td>
<td align="left">T &#x3e; -</td>
<td align="left">F &#x3e; -</td>
<td align="char" char=".">108</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">24,914</td>
<td align="char" char=".">0.461770</td>
<td align="left">G &#x3e; C</td>
<td align="left">D &#x3e; H</td>
<td align="char" char=".">1118</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">6,954</td>
<td align="char" char=".">0.461746</td>
<td align="left">T &#x3e; C</td>
<td align="left">I &#x3e; T</td>
<td align="char" char=".">1412</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">28,977</td>
<td align="char" char=".">0.460661</td>
<td align="left">C &#x3e; T</td>
<td align="left">S &#x3e; F</td>
<td align="char" char=".">235</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">21,992</td>
<td align="char" char=".">0.460243</td>
<td align="left">T &#x3e; -</td>
<td align="left">Y &#x3e; -</td>
<td align="char" char=".">144</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">913</td>
<td align="char" char=".">0.460233</td>
<td align="left">C &#x3e; T</td>
<td align="left">S &#x3e; S</td>
<td align="char" char=".">36</td>
<td align="left">NSP2</td>
</tr>
<tr>
<td align="left">11,295</td>
<td align="char" char=".">0.459624</td>
<td align="left">T &#x3e; -</td>
<td align="left">F &#x3e; -</td>
<td align="char" char=".">108</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">5,986</td>
<td align="char" char=".">0.459543</td>
<td align="left">C &#x3e; T</td>
<td align="left">F &#x3e; F</td>
<td align="char" char=".">1089</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">28,282</td>
<td align="char" char=".">0.459253</td>
<td align="left">T &#x3e; A</td>
<td align="left">D &#x3e; E</td>
<td align="char" char=".">3</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">28,048</td>
<td align="char" char=".">0.458864</td>
<td align="left">G &#x3e; T</td>
<td align="left">R &#x3e; I</td>
<td align="char" char=".">52</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">14,676</td>
<td align="char" char=".">0.458373</td>
<td align="left">C &#x3e; T</td>
<td align="left">P &#x3e; P</td>
<td align="char" char=".">412</td>
<td align="left">RdRp</td>
</tr>
<tr>
<td align="left">23,271</td>
<td align="char" char=".">0.458086</td>
<td align="left">C &#x3e; A</td>
<td align="left">A &#x3e; D</td>
<td align="char" char=".">570</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">28,281</td>
<td align="char" char=".">0.458038</td>
<td align="left">A &#x3e; T</td>
<td align="left">D &#x3e; V</td>
<td align="char" char=".">3</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">27,972</td>
<td align="char" char=".">0.457841</td>
<td align="left">C &#x3e; T</td>
<td align="left">Q &#x3e; &#x2a;</td>
<td align="char" char=".">27</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">5,388</td>
<td align="char" char=".">0.457761</td>
<td align="left">C &#x3e; A</td>
<td align="left">A &#x3e; D</td>
<td align="char" char=".">890</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">28,111</td>
<td align="char" char=".">0.457624</td>
<td align="left">A &#x3e; G</td>
<td align="left">Y &#x3e; C</td>
<td align="char" char=".">73</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">23,709</td>
<td align="char" char=".">0.456643</td>
<td align="left">C &#x3e; T</td>
<td align="left">T &#x3e; I</td>
<td align="char" char=".">716</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">24,506</td>
<td align="char" char=".">0.455921</td>
<td align="left">T &#x3e; G</td>
<td align="left">S &#x3e; A</td>
<td align="char" char=".">982</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">15,279</td>
<td align="char" char=".">0.455884</td>
<td align="left">C &#x3e; T</td>
<td align="left">H &#x3e; H</td>
<td align="char" char=".">613</td>
<td align="left">RdRp</td>
</tr>
<tr>
<td align="left">16,176</td>
<td align="char" char=".">0.455573</td>
<td align="left">T &#x3e; C</td>
<td align="left">T &#x3e; T</td>
<td align="char" char=".">912</td>
<td align="left">RdRp</td>
</tr>
<tr>
<td align="left">21,991</td>
<td align="char" char=".">0.455314</td>
<td align="left">T &#x3e; -</td>
<td align="left">V &#x3e; -</td>
<td align="char" char=".">143</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">25,563</td>
<td align="char" char=".">0.442049</td>
<td align="left">G &#x3e; T</td>
<td align="left">Q &#x3e; H</td>
<td align="char" char=".">57</td>
<td align="left">ORF3a</td>
</tr>
<tr>
<td align="left">22,227</td>
<td align="char" char=".">0.310063</td>
<td align="left">C &#x3e; T</td>
<td align="left">A &#x3e; V</td>
<td align="char" char=".">222</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">28,253</td>
<td align="char" char=".">0.300528</td>
<td align="left">C &#x3e; T, C &#x3e; -</td>
<td align="left">F &#x3e; F, F &#x3e; -</td>
<td align="char" char=".">120</td>
<td align="left">ORF8</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>List of hotspot mutations for 10,286 Indian SARS-CoV-2 genomes along with the protein change.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Genomic coordinate</th>
<th align="center">Overall entropy</th>
<th align="center">Nucleotide change</th>
<th align="center">Amino acid change</th>
<th align="center">Protein coordinate</th>
<th align="center">Gene</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">28,881</td>
<td align="char" char=".">1.14807</td>
<td align="left">G &#x3e; A, G &#x3e; T</td>
<td align="left">R &#x3e; K, R &#x3e; M</td>
<td align="char" char=".">203</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">23,604</td>
<td align="char" char=".">0.8631</td>
<td align="left">C &#x3e; A, C &#x3e; G</td>
<td align="left">P &#x3e; H, P &#x3e; R</td>
<td align="char" char=".">681</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">28,882</td>
<td align="char" char=".">0.69019</td>
<td align="left">G &#x3e; A</td>
<td align="left">R &#x3e; R</td>
<td align="char" char=".">203</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">28,883</td>
<td align="char" char=".">0.68846</td>
<td align="left">G &#x3e; C</td>
<td align="left">G &#x3e; R</td>
<td align="char" char=".">204</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">26,767</td>
<td align="char" char=".">0.68419</td>
<td align="left">T &#x3e; C, T &#x3e; G</td>
<td align="left">I &#x3e; T, I &#x3e; S</td>
<td align="char" char=".">82</td>
<td align="left">Membrane</td>
</tr>
<tr>
<td align="left">28,253</td>
<td align="char" char=".">0.65534</td>
<td align="left">C &#x3e; T, C &#x3e; -</td>
<td align="left">F &#x3e; F, F &#x3e; -</td>
<td align="char" char=".">120</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">25,469</td>
<td align="char" char=".">0.6227</td>
<td align="left">C &#x3e; T</td>
<td align="left">S &#x3e; L</td>
<td align="char" char=".">26</td>
<td align="left">ORF3a</td>
</tr>
<tr>
<td align="left">29,402</td>
<td align="char" char=".">0.61955</td>
<td align="left">G &#x3e; T</td>
<td align="left">D &#x3e; Y</td>
<td align="char" char=".">377</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">22,917</td>
<td align="char" char=".">0.61006</td>
<td align="left">T &#x3e; G</td>
<td align="left">L &#x3e; R</td>
<td align="char" char=".">452</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">27,638</td>
<td align="char" char=".">0.60866</td>
<td align="left">T &#x3e; C</td>
<td align="left">V &#x3e; A</td>
<td align="char" char=".">82</td>
<td align="left">ORF7a</td>
</tr>
<tr>
<td align="left">25,563</td>
<td align="char" char=".">0.55354</td>
<td align="left">G &#x3e; T</td>
<td align="left">Q &#x3e; H</td>
<td align="char" char=".">57</td>
<td align="left">ORF3a</td>
</tr>
<tr>
<td align="left">22,444</td>
<td align="char" char=".">0.53665</td>
<td align="left">C &#x3e; T</td>
<td align="left">D &#x3e; D</td>
<td align="char" char=".">249</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">18,877</td>
<td align="char" char=".">0.52834</td>
<td align="left">C &#x3e; T</td>
<td align="left">L &#x3e; L</td>
<td align="char" char=".">280</td>
<td align="left">Exon</td>
</tr>
<tr>
<td align="left">26,735</td>
<td align="char" char=".">0.52715</td>
<td align="left">C &#x3e; T</td>
<td align="left">Y &#x3e; Y</td>
<td align="char" char=".">71</td>
<td align="left">Membrane</td>
</tr>
<tr>
<td align="left">28,854</td>
<td align="char" char=".">0.51198</td>
<td align="left">C &#x3e; T</td>
<td align="left">S &#x3e; L</td>
<td align="char" char=".">194</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">24,410</td>
<td align="char" char=".">0.49845</td>
<td align="left">G &#x3e; A</td>
<td align="left">D &#x3e; N</td>
<td align="char" char=".">950</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">21,987</td>
<td align="char" char=".">0.49717</td>
<td align="left">G &#x3e; A</td>
<td align="left">G &#x3e; D</td>
<td align="char" char=".">142</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">21,618</td>
<td align="char" char=".">0.48836</td>
<td align="left">C &#x3e; G</td>
<td align="left">T &#x3e; R</td>
<td align="char" char=".">19</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">27,752</td>
<td align="char" char=".">0.48264</td>
<td align="left">C &#x3e; T</td>
<td align="left">T &#x3e; I</td>
<td align="char" char=".">120</td>
<td align="left">ORF7a</td>
</tr>
<tr>
<td align="left">22,034</td>
<td align="char" char=".">0.47915</td>
<td align="left">A &#x3e; -</td>
<td align="left">R &#x3e; -</td>
<td align="char" char=".">158</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">22,995</td>
<td align="char" char=".">0.47879</td>
<td align="left">C &#x3e; A</td>
<td align="left">T &#x3e; K</td>
<td align="char" char=".">478</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">28,461</td>
<td align="char" char=".">0.46436</td>
<td align="left">A &#x3e; G</td>
<td align="left">D &#x3e; G</td>
<td align="char" char=".">63</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">15,451</td>
<td align="char" char=".">0.44421</td>
<td align="left">G &#x3e; A</td>
<td align="left">G &#x3e; S</td>
<td align="char" char=".">671</td>
<td align="left">RdRp</td>
</tr>
<tr>
<td align="left">23,012</td>
<td align="char" char=".">0.44086</td>
<td align="left">G &#x3e; C</td>
<td align="left">E &#x3e; Q</td>
<td align="char" char=".">484</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">22,033</td>
<td align="char" char=".">0.4385</td>
<td align="left">C &#x3e; -</td>
<td align="left">F &#x3e; -</td>
<td align="char" char=".">157</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">16,466</td>
<td align="char" char=".">0.43082</td>
<td align="left">C &#x3e; T</td>
<td align="left">P &#x3e; L</td>
<td align="char" char=".">77</td>
<td align="left">Helicase</td>
</tr>
<tr>
<td align="left">22,032</td>
<td align="char" char=".">0.42673</td>
<td align="left">T &#x3e; -</td>
<td align="left">F &#x3e; -</td>
<td align="char" char=".">157</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">11,201</td>
<td align="char" char=".">0.42554</td>
<td align="left">A &#x3e; G</td>
<td align="left">T &#x3e; A</td>
<td align="char" char=".">77</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">28,249</td>
<td align="char" char=".">0.41704</td>
<td align="left">A &#x3e; -</td>
<td align="left">D &#x3e; -</td>
<td align="char" char=".">119</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">5,184</td>
<td align="char" char=".">0.40139</td>
<td align="left">C &#x3e; T</td>
<td align="left">P &#x3e; L</td>
<td align="char" char=".">822</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">22,031</td>
<td align="char" char=".">0.40074</td>
<td align="left">T &#x3e; -</td>
<td align="left">F &#x3e; -</td>
<td align="char" char=".">157</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">313</td>
<td align="char" char=".">0.39475</td>
<td align="left">C &#x3e; T</td>
<td align="left">L &#x3e; L</td>
<td align="char" char=".">16</td>
<td align="left">NSP1</td>
</tr>
<tr>
<td align="left">22,029</td>
<td align="char" char=".">0.38676</td>
<td align="left">A &#x3e; -</td>
<td align="left">E &#x3e; -</td>
<td align="char" char=".">156</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">5,700</td>
<td align="char" char=".">0.38604</td>
<td align="left">C &#x3e; A</td>
<td align="left">A &#x3e; D</td>
<td align="char" char=".">994</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">20,396</td>
<td align="char" char=".">0.38407</td>
<td align="left">A &#x3e; G</td>
<td align="left">K &#x3e; R</td>
<td align="char" char=".">259</td>
<td align="left">endoRNAse</td>
</tr>
<tr>
<td align="left">3,267</td>
<td align="char" char=".">0.37579</td>
<td align="left">C &#x3e; T</td>
<td align="left">T &#x3e; I</td>
<td align="char" char=".">183</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">22,030</td>
<td align="char" char=".">0.3738</td>
<td align="left">G &#x3e; -</td>
<td align="left">E &#x3e; -</td>
<td align="char" char=".">156</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">28,251</td>
<td align="char" char=".">0.36694</td>
<td align="left">T &#x3e; -</td>
<td align="left">F &#x3e; -</td>
<td align="char" char=".">120</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">28,248</td>
<td align="char" char=".">0.36497</td>
<td align="left">G &#x3e; -</td>
<td align="left">D &#x3e; -</td>
<td align="char" char=".">119</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">24,775</td>
<td align="char" char=".">0.36197</td>
<td align="left">A &#x3e; T</td>
<td align="left">Q &#x3e; H</td>
<td align="char" char=".">1071</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">21,895</td>
<td align="char" char=".">0.35931</td>
<td align="left">T &#x3e; C</td>
<td align="left">D &#x3e; D</td>
<td align="char" char=".">111</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">28,280</td>
<td align="char" char=".">0.35905</td>
<td align="left">G &#x3e; C</td>
<td align="left">D &#x3e; H</td>
<td align="char" char=".">3</td>
<td align="left">Nucleocapsid</td>
</tr>
<tr>
<td align="left">28,250</td>
<td align="char" char=".">0.35546</td>
<td align="left">T &#x3e; -</td>
<td align="left">D &#x3e; -</td>
<td align="char" char=".">119</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">28,252</td>
<td align="char" char=".">0.351</td>
<td align="left">T &#x3e; -</td>
<td align="left">F &#x3e; -</td>
<td align="char" char=".">120</td>
<td align="left">ORF8</td>
</tr>
<tr>
<td align="left">11,418</td>
<td align="char" char=".">0.34861</td>
<td align="left">T &#x3e; C</td>
<td align="left">V &#x3e; A</td>
<td align="char" char=".">149</td>
<td align="left">NSP6</td>
</tr>
<tr>
<td align="left">9,891</td>
<td align="char" char=".">0.34766</td>
<td align="left">C &#x3e; T</td>
<td align="left">A &#x3e; V</td>
<td align="char" char=".">446</td>
<td align="left">NSP4</td>
</tr>
<tr>
<td align="left">17,523</td>
<td align="char" char=".">0.33196</td>
<td align="left">G &#x3e; T</td>
<td align="left">M &#x3e; I</td>
<td align="char" char=".">429</td>
<td align="left">Helicase</td>
</tr>
<tr>
<td align="left">3,457</td>
<td align="char" char=".">0.3314</td>
<td align="left">C &#x3e; T</td>
<td align="left">Y &#x3e; Y</td>
<td align="char" char=".">246</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">4,965</td>
<td align="char" char=".">0.32981</td>
<td align="left">C &#x3e; T</td>
<td align="left">T &#x3e; I</td>
<td align="char" char=".">749</td>
<td align="left">NSP3</td>
</tr>
<tr>
<td align="left">22,022</td>
<td align="char" char=".">0.31618</td>
<td align="left">G &#x3e; A</td>
<td align="left">E &#x3e; K</td>
<td align="char" char=".">154</td>
<td align="left">Spike</td>
</tr>
<tr>
<td align="left">1191</td>
<td align="char" char=".">0.30404</td>
<td align="left">C &#x3e; T</td>
<td align="left">P &#x3e; L</td>
<td align="char" char=".">129</td>
<td align="left">NSP2</td>
</tr>
<tr>
<td align="left">21,846</td>
<td align="char" char=".">0.30253</td>
<td align="left">C &#x3e; T</td>
<td align="left">T &#x3e; I</td>
<td align="char" char=".">95</td>
<td align="left">Spike</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The amino acid changes in protein for the non-synonymous deletions and substitutions as reported in <xref ref-type="table" rid="T2">Tables 2</xref> and <xref ref-type="table" rid="T3">3</xref> are visualized in <xref ref-type="fig" rid="F1">Figure&#x20;1D</xref>; <xref ref-type="sec" rid="s11">Supplementary Figure S1</xref>. All the amino acid changes in the protein for the non-synonymous substitutions or missense mutations for the global sequences are shown in <xref ref-type="fig" rid="F3">Figure&#x20;3</xref>, while the same for the Indian sequences are depicted in <xref ref-type="fig" rid="F4">Figure&#x20;4</xref>. The month-wise virus evolution in terms of entropy for both global and Indian genomic sequences is visualized respectively in <xref ref-type="fig" rid="F5">Figures 5</xref> and <xref ref-type="fig" rid="F6">6</xref>, while the corresponding entropy values are reported in <xref ref-type="sec" rid="s11">Supplementary Tables S3 and S4</xref>. For example, it can be seen from both the figures that both P681H and P681R, which are part of the variant of concerns Alpha or B.1.1.7 and Delta or B.1.617.2, have evolved over time globally and for India as well. It is to be noted that due to the lack of appropriate number of sequences, the data of January and February 2020 have been merged for the global analysis, while for India, such merging is for the months January to March 2020. Also, please note that since the calculation of entropy is performed on aligned sequences, only coding regions are considered for the identification of hotspot mutations, as the non-coding regions exhibit high entropy values and can be misleading while selecting such mutation points as hotspot mutations. Furthermore, the evolution of the mutation points for global SARS-CoV-2 genomes pertaining to the different variants of concern like Alpha, Beta, Gamma, and Delta as declared by the WHO is also reported respectively in <xref ref-type="fig" rid="F7">Figures 7A,B,C,D</xref>. It can be observed from the figures that the popular mutation D614G, which is common in all the variants though predominant in the earlier months of the pandemic, has waned over time. Also, the mutation T478K, which is unique to the Delta variant, is known to facilitate antibody escape (<xref ref-type="bibr" rid="B19">Planas et&#x20;al., 2021</xref>). Some important hotspot mutations like H69-, V70-, Y144-, A222V, N501Y, A570D, P681H, and P681R identified in this study are associated with the different SARS-CoV-2 variants of concern like Alpha, Beta, Gamma, and Delta.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Highlighted amino acid changes in the protein structures for the non-synonymous substitutions or missense hotspot mutations for global SARS-CoV-2 genomes in <bold>(A)</bold> NSP3, <bold>(B)</bold> ORF3a, <bold>(C)</bold> Spike, <bold>(D)</bold> ORF8, and <bold>(E)</bold> Nucleocapsid.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g003.tif"/>
</fig>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Highlighted amino acid changes in the protein structures for the non-synonymous substitutions or missense hotspot mutations for Indian SARS-CoV-2 genomes in <bold>(A)</bold> NSP2, <bold>(B)</bold> NSP3, <bold>(C)</bold> NSP4, <bold>(D)</bold> NSP6, <bold>(E)</bold> RdRp, <bold>(F)</bold> helicase, <bold>(G)</bold> endoRNAse, <bold>(H)</bold> ORF3a, <bold>(I)</bold> Membrane, <bold>(J)</bold> Spike, <bold>(K)</bold> ORF7a, and <bold>(L)</bold> Nucleocapsid.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g004.tif"/>
</fig>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Month-wise evolution of global SARS-CoV-2 genomes based on entropy.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g005.tif"/>
</fig>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Month-wise evolution of Indian SARS-CoV-2 genomes based on entropy.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g006.tif"/>
</fig>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Month-wise evolution of <bold>(A)</bold> Alpha (B.1.1.7), <bold>(B)</bold> Beta (B.1.351), <bold>(C)</bold> Gamma (501.V3), and <bold>(D)</bold> Delta (B.1.617.2) variants in global SARS-CoV-2 genomes.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g007.tif"/>
</fig>
<p>The unique and common hotspot mutations between global and Indian sequences are represented in the form of Venn diagram in <xref ref-type="fig" rid="F8">Figures 8A,B</xref>, which shows the unique and common non-synonymous hotspot mutations, while the unique and common amino acid changes are shown in <xref ref-type="fig" rid="F8">Figure&#x20;8C</xref>. As shown in <xref ref-type="fig" rid="F8">Figure&#x20;8A</xref>, there are 37 and 44 unique mutations in global and Indian sequences, while eight are common in both. For non-synonymous hotspot deletions and substitutions, there are 32 and 38 unique mutations in each category, while the common number of such mutations is seven as reported in <xref ref-type="fig" rid="F8">Figure&#x20;8B</xref>. For amino acid changes, as shown in <xref ref-type="fig" rid="F8">Figure&#x20;8C</xref>, these statistics are 22, 32, and nine. The Venn diagram showing the common and unique hotspot mutations for global and Indian sequences with Alpha, Beta, Gamma, and Delta variants of SARS-CoV-2 is reported in <xref ref-type="sec" rid="s11">Supplementary Figure S2</xref>. For example, in <xref ref-type="sec" rid="s11">Supplementary Figure S2A</xref>, there are four unique mutations in both global sequences and Alpha variant, while there are nine mutations that are common to&#x20;both.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>Venn diagrams of global and Indian SARS-CoV-2 genomes to represent common hotspot mutations.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g008.tif"/>
</fig>
</sec>
<sec id="s4">
<title>4 Discussion</title>
<p>There are spurts of new waves in almost every country around the globe. India has already gone through the massively catastrophic second wave, and according to the experts, a third wave is imminent. This can be attributed to the fact that the virus is evolving and new strains are getting identified, thereby making the study of this ever-evolving virus all the more important. The functional characteristics of some important mutations in the global and Indian SARS-CoV-2 genomic sequences are reported in <xref ref-type="table" rid="T4">Table&#x20;4</xref>.</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Functional characteristics of some important mutations.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Mutations</th>
<th align="center">Functional characteristics</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">H69-</td>
<td align="left">Leads to conformational changes in Spike protein (<xref ref-type="bibr" rid="B16">Meng et&#x20;al., 2021</xref>; <xref ref-type="bibr" rid="B18">McCarthy et&#x20;al., 2021</xref>)</td>
</tr>
<tr>
<td align="left">V70-</td>
<td align="left">Leads to conformational changes in Spike protein (<xref ref-type="bibr" rid="B16">Meng et&#x20;al., 2021</xref>; <xref ref-type="bibr" rid="B18">McCarthy et&#x20;al., 2021</xref>)</td>
</tr>
<tr>
<td align="left">Y144-</td>
<td align="left">Reduces affinity of antibody binding (<xref ref-type="bibr" rid="B18">McCarthy et&#x20;al., 2021</xref>)</td>
</tr>
<tr>
<td align="left">L452R</td>
<td align="left">Increases the binding ability of the ACE2 receptor and can also reduce the attaching capability of vaccine-simulated antibodies with Spike protein (<xref ref-type="bibr" rid="B10">Garcia-Beltran et&#x20;al., 2021</xref>)</td>
</tr>
<tr>
<td align="left">T478K</td>
<td align="left">Facilitates antibody escape (<xref ref-type="bibr" rid="B19">Planas et&#x20;al., 2021</xref>)</td>
</tr>
<tr>
<td align="left">E484Q</td>
<td align="left">Associated with reduced sera neutralization (<xref ref-type="bibr" rid="B11">Greaney et&#x20;al., 2021</xref>)</td>
</tr>
<tr>
<td align="left">N501Y</td>
<td align="left">Highest binding affinity with human receptor cell hACE2 and resistant to neutralization (<xref ref-type="bibr" rid="B17">Luan et&#x20;al., 2021</xref>)</td>
</tr>
<tr>
<td align="left">P681H</td>
<td align="left">Near furin cleavage site, may affect transmissibility of the virus (<xref ref-type="bibr" rid="B5">Boehm et&#x20;al., 2021</xref>)</td>
</tr>
<tr>
<td align="left">P681R</td>
<td align="left">Near furin cleavage site, may affect transmissibility of the virus (<xref ref-type="bibr" rid="B5">Boehm et&#x20;al., 2021</xref>)</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Structural changes in amino acid residues may sometimes lead to functional instability in proteins due to change in protein translations. To judge their characteristics, these changes are demonstrated through sequence and structural homology-based prediction for the hotspot deletions and missense mutations for global and Indian sequences in <xref ref-type="table" rid="T5">Table&#x20;5</xref>. The tools used for these predictions are PolyPhen-2 (Polymorphism Phenotyping) (<xref ref-type="bibr" rid="B1">Adzhubei et&#x20;al., 2010</xref>) and I-Mutant 2.0 (<xref ref-type="bibr" rid="B6">Capriotti et&#x20;al., 2005</xref>). PolyPhen-2<xref ref-type="fn" rid="FN5">
<sup>4</sup>
</xref> works with sequence, structural, and phylogenetic information of missense mutations, while I-Mutant 2.0<xref ref-type="fn" rid="FN6">
<sup>5</sup>
</xref> uses support vector machine (SVM) for the automatic prediction of protein stability changes upon missense mutations. PolyPhen-2 is used to find the damaging hotspot mutations, and I-Mutant 2.0 determines protein stability. To determine if a mutation is damaging using PolyPhen-2, its score is considered, which lies between 0 and 1. If the score is close to 1, then a mutation is considered to be damaging. It can be concluded from <xref ref-type="table" rid="T5">Table&#x20;5</xref> that out of the 22 unique amino acid changes for substitutions in global sequences, 14 are damaging, while for Indian sequences, 24 are damaging out of 36 changes. It is important to note that in case of protein, damaging mostly defines instability. Generally, this is used for human proteins. As a consequence, if the human protein is damaging in nature because of mutations, then the human protein&#x2013;protein interactions may occur with high or low binding affinity. Now in case of virus, similar consequences may happen, which means that if the virus protein is damaged because of mutations, it may interact with human proteins with similar binding affinity. As a result, the virus may acquire characteristics like transmissibility and escaping antibodies (<xref ref-type="bibr" rid="B2">Alenquer et&#x20;al., 2021</xref>; <xref ref-type="bibr" rid="B13">Harvey et&#x20;al., 2021</xref>).</p>
<table-wrap id="T5" position="float">
<label>TABLE 5</label>
<caption>
<p>Sequence and structural homology-based prediction of non-synonymous substitution as hotspot mutations along with their protein structural stability for 71,038 global SARS-CoV-2 genomes.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Change in</th>
<th align="center">Change in</th>
<th align="center">Mapped with</th>
<th colspan="2" align="center">PolyPhen-2</th>
<th colspan="2" align="center">I-mutant 2.0</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Nucleotide</td>
<td align="left">Amino acid</td>
<td align="left">Coding regions</td>
<td align="left">Prediction</td>
<td align="left">Score</td>
<td align="left">Stability</td>
<td align="left">DDG (kcal/mol)</td>
</tr>
<tr>
<td align="left">G28881A</td>
<td align="left">R203&#xa0;K</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">0.969</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;2.26</td>
</tr>
<tr>
<td align="left">G28881T</td>
<td align="left">R203M</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">0.998</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.52</td>
</tr>
<tr>
<td align="left">G28883C</td>
<td align="left">G204R</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">1</td>
<td align="left">No change</td>
<td align="center">0</td>
</tr>
<tr>
<td align="left">C23604A</td>
<td align="left">P681H</td>
<td align="left">Spike</td>
<td align="left">Not generated</td>
<td align="left">Not generated</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.92</td>
</tr>
<tr>
<td align="left">C23604G</td>
<td align="left">P681R</td>
<td align="left">Spike</td>
<td align="left">Not generated</td>
<td align="left">Not generated</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.79</td>
</tr>
<tr>
<td align="left">G28280C</td>
<td align="left">D3H</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">1</td>
<td align="left">Increase</td>
<td align="center">0.34</td>
</tr>
<tr>
<td align="left">A23063T</td>
<td align="left">N501Y</td>
<td align="left">Spike</td>
<td align="left">Benign</td>
<td align="left">0.145</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.34</td>
</tr>
<tr>
<td align="left">C3267T</td>
<td align="left">T183I</td>
<td align="left">NSP3</td>
<td align="left">Not generated</td>
<td align="left">Not generated</td>
<td align="left">Decrease</td>
<td align="center">-0.1</td>
</tr>
<tr>
<td align="left">G24914C</td>
<td align="left">D1118H</td>
<td align="left">Spike</td>
<td align="left">Probably damaging</td>
<td align="left">0.998</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.1</td>
</tr>
<tr>
<td align="left">T6954C</td>
<td align="left">I1412T</td>
<td align="left">NSP3</td>
<td align="left">Benign</td>
<td align="left">0.026</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;2.78</td>
</tr>
<tr>
<td align="left">C28977T</td>
<td align="left">S235F</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">0.998</td>
<td align="left">Increase</td>
<td align="center">2.43</td>
</tr>
<tr>
<td align="left">T28282A</td>
<td align="left">D3E</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">0.997</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.02</td>
</tr>
<tr>
<td align="left">G28048T</td>
<td align="left">R52I</td>
<td align="left">ORF8</td>
<td align="left">Probably damaging</td>
<td align="left">1</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.09</td>
</tr>
<tr>
<td align="left">C23271A</td>
<td align="left">A570D</td>
<td align="left">Spike</td>
<td align="left">Benign</td>
<td align="left">0.031</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.32</td>
</tr>
<tr>
<td align="left">A28281T</td>
<td align="left">D3V</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">1</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.22</td>
</tr>
<tr>
<td align="left">C5388A</td>
<td align="left">A890D</td>
<td align="left">NSP3</td>
<td align="left">Probably damaging</td>
<td align="left">1</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.09</td>
</tr>
<tr>
<td align="left">A28111G</td>
<td align="left">Y73C</td>
<td align="left">ORF8</td>
<td align="left">Probably damaging</td>
<td align="left">0.994</td>
<td align="left">Increase</td>
<td align="center">1.04</td>
</tr>
<tr>
<td align="left">C23709T</td>
<td align="left">T716I</td>
<td align="left">Spike</td>
<td align="left">Possibly damaging</td>
<td align="left">0.696</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.95</td>
</tr>
<tr>
<td align="left">T24506G</td>
<td align="left">S982A</td>
<td align="left">Spike</td>
<td align="left">Probably damaging</td>
<td align="left">0.996</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.36</td>
</tr>
<tr>
<td align="left">C22227T</td>
<td align="left">A222V</td>
<td align="left">Spike</td>
<td align="left">Benign</td>
<td align="left">0.001</td>
<td align="left">Increase</td>
<td align="center">0.48</td>
</tr>
<tr>
<td align="left">T26767G</td>
<td align="left">I82S</td>
<td align="left">Membrane</td>
<td align="left">Possibly damaging</td>
<td align="left">0.951</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;2</td>
</tr>
<tr>
<td align="left">C25469T</td>
<td align="left">S26L</td>
<td align="left">ORF3a</td>
<td align="left">Benign</td>
<td align="left">0.017</td>
<td align="left">Increase</td>
<td align="center">0.92</td>
</tr>
<tr>
<td align="left">G29402T</td>
<td align="left">D377Y</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">1</td>
<td align="left">Increase</td>
<td align="center">0.51</td>
</tr>
<tr>
<td align="left">T22917G</td>
<td align="left">L452R</td>
<td align="left">Spike</td>
<td align="left">Benign</td>
<td align="left">0.04</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.4</td>
</tr>
<tr>
<td align="left">T27638C</td>
<td align="left">V82A</td>
<td align="left">ORF7a</td>
<td align="left">Possibly damaging</td>
<td align="left">0.732</td>
<td align="left">Decrease</td>
<td align="center">-2.18</td>
</tr>
<tr>
<td align="left">G25563T</td>
<td align="left">Q57H</td>
<td align="left">ORF3a</td>
<td align="left">Probably damaging</td>
<td align="left">0.983</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.12</td>
</tr>
<tr>
<td align="left">C28854T</td>
<td align="left">S194L</td>
<td align="left">Nucleocapsid</td>
<td align="left">Probably damaging</td>
<td align="left">0.994</td>
<td align="left">Increase</td>
<td align="center">0.45</td>
</tr>
<tr>
<td align="left">G24410A</td>
<td align="left">D950N</td>
<td align="left">Spike</td>
<td align="left">Possibly damaging</td>
<td align="left">0.731</td>
<td align="left">Increase</td>
<td align="center">0.15</td>
</tr>
<tr>
<td align="left">G21987A</td>
<td align="left">G142D</td>
<td align="left">Spike</td>
<td align="left">Benign</td>
<td align="left">0.051</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.17</td>
</tr>
<tr>
<td align="left">C21618G</td>
<td align="left">T19R</td>
<td align="left">Spike</td>
<td align="left">Benign</td>
<td align="left">0.004</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.12</td>
</tr>
<tr>
<td align="left">C27752T</td>
<td align="left">T120I</td>
<td align="left">ORF7a</td>
<td align="left">Possibly damaging</td>
<td align="left">0.915</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.26</td>
</tr>
<tr>
<td align="left">C22995A</td>
<td align="left">T478K</td>
<td align="left">Spike</td>
<td align="left">Benign</td>
<td align="left">0</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.09</td>
</tr>
<tr>
<td align="left">A28461G</td>
<td align="left">D63G</td>
<td align="left">Nucleocapsid</td>
<td align="left">Benign</td>
<td align="left">0</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.57</td>
</tr>
<tr>
<td align="left">G15451A</td>
<td align="left">G671S</td>
<td align="left">RdRp</td>
<td align="left">Probably damaging</td>
<td align="left">1</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.29</td>
</tr>
<tr>
<td align="left">G23012C</td>
<td align="left">E484Q</td>
<td align="left">Spike</td>
<td align="left">Possibly damaging</td>
<td align="left">0.786</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.48</td>
</tr>
<tr>
<td align="left">C16466T</td>
<td align="left">P77L</td>
<td align="left">Helicase</td>
<td align="left">Probably damaging</td>
<td align="left">1</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.03</td>
</tr>
<tr>
<td align="left">A11201G</td>
<td align="left">T77A</td>
<td align="left">NSP6</td>
<td align="left">Possibly damaging</td>
<td align="left">0.577</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.7</td>
</tr>
<tr>
<td align="left">C5184T</td>
<td align="left">P822L</td>
<td align="left">NSP3</td>
<td align="left">Benign</td>
<td align="left">0.007</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.54</td>
</tr>
<tr>
<td align="left">C5700A</td>
<td align="left">A994D</td>
<td align="left">NSP3</td>
<td align="left">Probably damaging</td>
<td align="left">0.972</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.78</td>
</tr>
<tr>
<td align="left">A20396G</td>
<td align="left">K259R</td>
<td align="left">endoRNAse</td>
<td align="left">Benign</td>
<td align="left">0</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.49</td>
</tr>
<tr>
<td align="left">A24775T</td>
<td align="left">Q1071H</td>
<td align="left">Spike</td>
<td align="left">Possibly damaging</td>
<td align="left">0.998</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.19</td>
</tr>
<tr>
<td align="left">T11418C</td>
<td align="left">V149A</td>
<td align="left">NSP6</td>
<td align="left">Possibly damaging</td>
<td align="left">0.865</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;3.43</td>
</tr>
<tr>
<td align="left">C9891T</td>
<td align="left">A446V</td>
<td align="left">NSP4</td>
<td align="left">Probably damaging</td>
<td align="left">0.999</td>
<td align="left">Increase</td>
<td align="center">0.64</td>
</tr>
<tr>
<td align="left">G17523T</td>
<td align="left">M429I</td>
<td align="left">Helicase</td>
<td align="left">Possibly damaging</td>
<td align="left">0.649</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.26</td>
</tr>
<tr>
<td align="left">C4965T</td>
<td align="left">T749I</td>
<td align="left">NSP3</td>
<td align="left">Probably damaging</td>
<td align="left">0.996</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.92</td>
</tr>
<tr>
<td align="left">G22022A</td>
<td align="left">E154&#xa0;K</td>
<td align="left">Spike</td>
<td align="left">Not generated</td>
<td align="left">Not Generated</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.4</td>
</tr>
<tr>
<td align="left">C1191T</td>
<td align="left">P129L</td>
<td align="left">NSP2</td>
<td align="left">Possibly damaging</td>
<td align="left">0.888</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;0.53</td>
</tr>
<tr>
<td align="left">C21846T</td>
<td align="left">T95I</td>
<td align="left">Spike</td>
<td align="left">Probably damaging</td>
<td align="left">0.999</td>
<td align="left">Decrease</td>
<td align="center">&#x2212;1.8</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Another important parameter to judge the functional and structural activities of a protein is protein stability, which dictates the conformational structure of a protein. Any change in protein stability may cause misfolding, degradation, or aberrant conglomeration of proteins. I-Mutant 2.0 uses free energy change values (DDG (kcal/mol)) to predict the changes in the protein stability wherein a negative value of DDG indicates that the protein has a decreasing stability, while a positive value indicates an increase in stability. For example, the very low DDG value of G25563T shows that there is a decreased protein stability, thereby resulting in a reduction of virus virulence (<xref ref-type="bibr" rid="B7">Cheng et&#x20;al., 2021</xref>). The results from I-mutant 2.0 show that out of the 14 and 24 unique damaging changes for global and Indian sequences, 10 and 18 changes respectively decrease the stability of the protein structures. <xref ref-type="fig" rid="F9">Figure&#x20;9</xref> shows the binding affinity between the RBD of Spike protein and human ACE2 protein performed using SSIPe<xref ref-type="fn" rid="FN7">
<sup>6</sup>
</xref> (<xref ref-type="bibr" rid="B14">Huang et&#x20;al., 2019</xref>) for the four mutations of SARS-CoV-2, viz., L452R, T478K, E484Q, and N501Y, taking place in such domain. The region marked in red shows the exact positions (471&#x2013;492) where the binding takes place. To report the binding affinity using SSIPe, initially the RBD region of Spike protein (<xref ref-type="bibr" rid="B28">Woo et&#x20;al., 2020</xref>) is docked with human ACE2 protein<xref ref-type="fn" rid="FN8">
<sup>7</sup>
</xref> using PatchDock<xref ref-type="fn" rid="FN9">
<sup>8</sup>
</xref>. The best docked structure is then provided as an input to SSIPe. <xref ref-type="table" rid="T6">Table&#x20;6</xref> further reports the binding affinity values for the four mutations. A strongly favorable mutation is usually defined as the one that has DDG value &#x2264; &#x2212;1.5&#xa0;kcal/mol, while a strongly unfavorable mutation is the one that has DDG value &#x2265;1.5&#xa0;kcal/mol. The DDG value of &#x2212;0.769&#xa0;kcal/mol for E484Q indicates that this is a favorable mutation, while DDG values of 1.083, 1.248, and 0.236&#xa0;kcal/mol for L452R, T478K, and N501Y indicate that these mutations are somewhat unfavorable. These results corroborate our earlier explanation that because of mutation, virus&#x2013;human protein&#x2013;protein interactions may occur with high or low binding affinity.</p>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption>
<p>Binding between RBD region of Spike protein (specifically 471&#x2013;492 the region marked in red)) and human ACE2 protein. RBD, receptor-binding domain.</p>
</caption>
<graphic xlink:href="fgene-12-753440-g009.tif"/>
</fig>
<table-wrap id="T6" position="float">
<label>TABLE 6</label>
<caption>
<p>Binding affinity of the mutations in RBD region of Spike protein and human ACE2 protein.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Genomic coordinate</th>
<th align="center">Nucleotide change</th>
<th align="center">Amino acid change</th>
<th align="center">Protein coordinate</th>
<th align="center">DDG (kcal/mol)</th>
<th align="center">SSIPscore</th>
<th align="center">EvoEFscore</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">22,917</td>
<td align="left">T &#x3e; G</td>
<td align="left">L &#x3e; R</td>
<td align="char" char=".">452</td>
<td align="char" char=".">1.083</td>
<td align="char" char=".">2.083</td>
<td align="char" char=".">&#x2212;1.91</td>
</tr>
<tr>
<td align="left">22,995</td>
<td align="left">C &#x3e; A</td>
<td align="left">T &#x3e; K</td>
<td align="char" char=".">478</td>
<td align="char" char=".">1.248</td>
<td align="char" char=".">1.779</td>
<td align="char" char=".">&#x2212;0.77</td>
</tr>
<tr>
<td align="left">23,012</td>
<td align="left">G &#x3e; C</td>
<td align="left">E &#x3e; Q</td>
<td align="char" char=".">484</td>
<td align="char" char=".">&#x2212;0.769</td>
<td align="char" char=".">1.098</td>
<td align="char" char=".">&#x2212;5.22</td>
</tr>
<tr>
<td align="left">23,063</td>
<td align="left">A &#x3e; T</td>
<td align="left">N &#x3e; Y</td>
<td align="char" char=".">501</td>
<td align="char" char=".">0.236</td>
<td align="char" char=".">0</td>
<td align="char" char=".">0.09</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>Note. RBD, receptor-binding domain.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>
<xref ref-type="sec" rid="s11">Supplementary Figure S3</xref> shows the percentage of nucleotide change and frequency of nucleotide change for hotspot mutations for global and Indian sequences. For example, in <xref ref-type="sec" rid="s11">Supplementary Figure S3A</xref>, the occurrence of nucleotide change G &#x3e; A in 71,038 global sequences is almost 45%, while the number of times it occurs in 45 hotspot mutations is two, as is also evident from <xref ref-type="table" rid="T2">Table&#x20;2</xref>. It can also be seen from <xref ref-type="sec" rid="s11">Supplementary Figures S3B, S3D</xref> that 10 and 16 out of 39 and 45&#x20;non-synonymous mutations are from C to T, thereby representing abundant transition. This transition increases the frequency of codons for hydrophobic amino acids and provides evidence of potential antiviral editing mechanisms driven by host (<xref ref-type="bibr" rid="B31">Yuan et&#x20;al., 2020</xref>). Also, more C-to-T transition means less CpG abundance, indicating rapid adaptation of virus in host. This CpG deficiency, which leads to evasion of host antiviral defense mechanisms, is exhibited the most in SARS-CoV-2 virus (<xref ref-type="bibr" rid="B30">Xia, 2020</xref>).</p>
</sec>
<sec id="s5">
<title>5 Conclusion</title>
<p>With the imminent third wave, it is very crucial to understand the evolution of SARS-CoV-2. In this regard, MSA of 71,038 SARS-CoV-2 genomes of 98 countries over the period from January 2020 to June 2021 is performed using MAFFT followed by phylogenetic analysis to visualize the evolution of SARS-CoV-2. This resulted in the identification of hotspot mutations as deletions and substitutions in the coding regions based on entropy, which should be greater than or equal to 0.3. Consequently, a total of 45 unique hotspot mutations out of which 39&#x20;non-synonymous deletions and substitutions are identified with nine unique amino acid changes for deletions and 22 unique amino acid changes for substitutions. Moreover, 10,286 Indian sequences are considered from 71,038 global SARS-CoV-2 sequences as a demonstrative example, which gives 52 unique hotspot mutations, resulting in 45&#x20;non-synonymous deletions and substitutions with five unique amino acid changes for deletions and 36 unique amino acid changes for substitutions. Some important mutations in such sequences pertaining to the Delta variant of SARS-CoV-2 are T19R, G142D, E156-, F157-, L452R, T478K, and P681R. Furthermore, the evolution of the hotspot mutations along with the mutations in variants of concern is visualized, and their characteristics are also discussed. Moreover, for all the missense mutations, the functional consequences of amino acid changes in the respective protein structures are calculated using PolyPhen-2 and I-Mutant 2.0. Finally, SSIPe is used to report the binding affinity between the RBD of Spike protein and human ACE2 protein by considering L452R, T478K, E484Q, and N501Y hotspot mutations in that region.</p>
</sec>
</body>
<back>
<sec id="s6">
<title>Data Availability Statement</title>
<p>The aligned 71038 Global SARS-CoV-2 genomes with the reference sequence and the final results of this work are available at <ext-link ext-link-type="uri" xlink:href="http://www.nitttrkol.ac.in/indrajit/projects/COVID-Hotspot-Mutation-Global-71K/">http://www.nitttrkol.ac.in/indrajit/projects/COVID-Hotspot-Mutation-Global-71K/</ext-link>. Further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>IS and NG designed the research. IS, NG, NS, and SN analyzed the data and wrote the article. All the authors reviewed and approved the final version of the article.</p>
</sec>
<sec id="s8">
<title>Funding</title>
<p>This work has been partially supported by CRG short-term research grant on COVID-19 (CVD/2020/000,991) from Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India. However, it does not provide any publication&#x20;fees.</p>
</sec>
<sec sec-type="COI-statement" id="s9">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ack>
<p>We thank all those who have contributed sequences to GISAID and NCBI databases.</p>
</ack>
<sec id="s11">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2021.753440/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2021.753440/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.pdf" id="SM1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<fn-group>
<fn id="FN2">
<label>1</label>
<p>
<ext-link ext-link-type="uri" xlink:href="https://www.gisaid.org/">https://www.gisaid.org/</ext-link>
</p>
</fn>
<fn id="FN3">
<label>2</label>
<p>
<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/1798174254">https://www.ncbi.nlm.nih.gov/nuccore/1798174254</ext-link>
</p>
</fn>
<fn id="FN4">
<label>3</label>
<p>
<ext-link ext-link-type="uri" xlink:href="https://zhanglab.ccmb.med.umich.edu/COVID-19/">https://zhanglab.ccmb.med.umich.edu/COVID-19/</ext-link>
</p>
</fn>
<fn id="FN5">
<label>4</label>
<p>
<ext-link ext-link-type="uri" xlink:href="http://genetics.bwh.harvard.edu/pph2/">http://genetics.bwh.harvard.edu/pph2/</ext-link>
</p>
</fn>
<fn id="FN6">
<label>5</label>
<p>
<ext-link ext-link-type="uri" xlink:href="https://folding.biofold.org/i-mutant/i-mutant2.0.html">https://folding.biofold.org/i-mutant/i-mutant2.0.html</ext-link>
</p>
</fn>
<fn id="FN7">
<label>6</label>
<p>
<ext-link ext-link-type="uri" xlink:href="https://zhanggroup.org/SSIPe/">https://zhanggroup.org/SSIPe/</ext-link>
</p>
</fn>
<fn id="FN8">
<label>7</label>
<p>
<ext-link ext-link-type="uri" xlink:href="https://www.rcsb.org/structure/1R42">https://www.rcsb.org/structure/1R42</ext-link>
</p>
</fn>
<fn id="FN9">
<label>8</label>
<p>
<ext-link ext-link-type="uri" xlink:href="http://bioinfo3d.cs.tau.ac.il/PatchDock/patchdock.html">http://bioinfo3d.cs.tau.ac.il/PatchDock/patchdock.html</ext-link>
</p>
</fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adzhubei</surname>
<given-names>I. A.</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Peshkin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ramensky</surname>
<given-names>V. E.</given-names>
</name>
<name>
<surname>Gerasimova</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Bork</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>A Method and Server for Predicting Damaging Missense Mutations</article-title>. <source>Nat. Methods</source> <volume>7</volume>, <fpage>248</fpage>&#x2013;<lpage>249</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth0410-248</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alenquer</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ferreira</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Lousa</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Val&#xe9;rio</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Medina-Lopes</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bergman</surname>
<given-names>M.-L.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Signatures in SARS-CoV-2 Spike Protein Conferring Escape to Neutralizing Antibodies</article-title>. <source>Plos Pathog.</source> <volume>17</volume>, <fpage>e1009772</fpage>. <pub-id pub-id-type="doi">10.1371/journal.ppat.1009772</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alexandersen</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chamings</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Bhatta</surname>
<given-names>T. R.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>SARS-CoV-2 Genomic and Subgenomic RNAs in Diagnostic Samples Are Not an Indicator of Active Replication</article-title>. <source>Nat. Commun.</source> <volume>11</volume>, <fpage>1</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1038/s41467-020-19883-7</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bernal</surname>
<given-names>J.&#x20;L.</given-names>
</name>
<name>
<surname>Andrews</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Gower</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Gallagher</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Simmons</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Thelwall</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Effectiveness of Covid-19 Vaccines against the B.1.617.2 (Delta) Variant</article-title>. <source>N. Engl. J. Med.</source> <volume>385</volume> (<issue>7</issue>), <fpage>585</fpage>&#x2013;<lpage>594</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa2108891</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Boehm</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Kronig</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Neher</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Eckerle</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Vetter</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Kaiser</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Novel SARS-CoV-2 Variants: the Pandemics within the Pandemic</article-title>. <source>Clin. Microbiol. Infect.</source> <volume>27</volume>, <fpage>1109</fpage>&#x2013;<lpage>1117</lpage>. <pub-id pub-id-type="doi">10.1016/j.cmi.2021.05.022</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Capriotti</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Fariselli</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Casadio</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>I-Mutant2.0: Predicting Stability Changes upon Mutation from the Protein Sequence or Structure</article-title>. <source>Nucleic Acids Res.</source> <volume>33</volume>, <fpage>W306</fpage>&#x2013;<lpage>W310</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gki375</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Qi</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Functional Alterations Caused by Mutations Reflect Evolutionary Trends of SARS-CoV-2</article-title>. <source>Brief. Bioinform.</source> <volume>22</volume>, <fpage>1442</fpage>&#x2013;<lpage>1450</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbab042</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cucinotta</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Vanelli</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>WHO Declares COVID-19 a Pandemic</article-title>. <source>Acta Biomed.</source> <volume>91</volume>, <fpage>157</fpage>&#x2013;<lpage>160</lpage>. <pub-id pub-id-type="doi">10.23750/abm.v91i1.9397</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Faria</surname>
<given-names>N. R.</given-names>
</name>
<name>
<surname>Mellan</surname>
<given-names>T. A.</given-names>
</name>
<name>
<surname>Whittaker</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Claro</surname>
<given-names>I. M.</given-names>
</name>
<name>
<surname>Candido</surname>
<given-names>D. D. S.</given-names>
</name>
<name>
<surname>Mishra</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Genomics and Epidemiology of the P.1 SARS-CoV-2 Lineage in Manaus, Brazil</article-title>. <source>Science</source> <volume>372</volume>, <fpage>815</fpage>&#x2013;<lpage>821</lpage>. <pub-id pub-id-type="doi">10.1126/science.abh2644</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Garcia-Beltran</surname>
<given-names>W. F.</given-names>
</name>
<name>
<surname>Lam</surname>
<given-names>E. C.</given-names>
</name>
<name>
<surname>St. Denis</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Nitido</surname>
<given-names>A. D.</given-names>
</name>
<name>
<surname>Garcia</surname>
<given-names>Z. H.</given-names>
</name>
<name>
<surname>Hauser</surname>
<given-names>B. M.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Multiple SARS-CoV-2 Variants Escape Neutralization by Vaccine-Induced Humoral Immunity</article-title>. <source>Cell</source> <volume>184</volume>, <fpage>2372</fpage>&#x2013;<lpage>2383.e9</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2021.03.013</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Greaney</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Starr</surname>
<given-names>T. N.</given-names>
</name>
<name>
<surname>Gilchuk</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Zost</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Binshtein</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Loes</surname>
<given-names>A. N.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Complete Mapping of Mutations to the SARS-CoV-2 Spike Receptor-Binding Domain that Escape Antibody Recognition</article-title>. <source>Cell Host &#x26; Microbe</source> <volume>29</volume>, <fpage>44</fpage>&#x2013;<lpage>57.e9</lpage>. <pub-id pub-id-type="doi">10.1016/j.chom.2020.11.007</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hadfield</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Megill</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Bell</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Huddleston</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Potter</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Callender</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Nextstrain: Real-Time Tracking of Pathogen Evolution</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>4121</fpage>&#x2013;<lpage>4123</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty407</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Harvey</surname>
<given-names>W. T.</given-names>
</name>
<name>
<surname>Carabelli</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Jackson</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Gupta</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Thomson</surname>
<given-names>E. C.</given-names>
</name>
<name>
<surname>Harrison</surname>
<given-names>E. M.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>SARS-CoV-2 Variants, Spike Mutations and Immune Escape</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>19</volume>, <fpage>409</fpage>&#x2013;<lpage>424</lpage>. <pub-id pub-id-type="doi">10.1038/s41579-021-00573-0</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Pearce</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>SSIPe: Accurately Estimating Protein-Protein Binding Affinity Change upon Mutations Using Evolutionary Profiles in Combination with an Optimized Physical Energy Function</article-title>. <source>Bioinformatics</source> <volume>36</volume>, <fpage>2429</fpage>&#x2013;<lpage>2437</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz926</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Katoh</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Misawa</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Kuma</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Miyata</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>MAFFT: A Novel Method for Rapid Multiple Sequence Alignment Based on Fast Fourier Transform</article-title>. <source>Nucleic Acids Res.</source> <volume>30</volume>, <fpage>3059</fpage>&#x2013;<lpage>3066</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkf436</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Luan</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Huynh</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Enhanced Binding of the N501Y&#x2010;mutated SARS&#x2010;CoV&#x2010;2 Spike Protein to the Human ACE2 Receptor: Insights from Molecular Dynamics Simulations</article-title>. <source>FEBS Lett.</source> <volume>595</volume>, <fpage>1454</fpage>&#x2013;<lpage>1461</lpage>. <pub-id pub-id-type="doi">10.1002/1873-3468.14076</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McCarthy</surname>
<given-names>K. R.</given-names>
</name>
<name>
<surname>Rennick</surname>
<given-names>L. J.</given-names>
</name>
<name>
<surname>Nambulli</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Robinson-McCarthy</surname>
<given-names>L. R.</given-names>
</name>
<name>
<surname>Bain</surname>
<given-names>W. G.</given-names>
</name>
<name>
<surname>Haidar</surname>
<given-names>G.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Recurrent Deletions in the SARS-CoV-2 Spike Glycoprotein Drive Antibody Escape</article-title>. <source>Science</source> <volume>371</volume>, <fpage>1139</fpage>&#x2013;<lpage>1142</lpage>. <pub-id pub-id-type="doi">10.1126/science.abf6950</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meng</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Kemp</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Papa</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Datir</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Ferreira</surname>
<given-names>I. A. T. M.</given-names>
</name>
<name>
<surname>Marelli</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Recurrent Emergence of SARS-CoV-2 Spike Deletion H69/V70 and Its Role in the Alpha Variant B.1.1.7</article-title>. <source>Cell Rep.</source> <volume>35</volume> (<issue>13</issue>), <fpage>109292</fpage>. <pub-id pub-id-type="doi">10.1016/j.celrep.2021.109292</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Planas</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Veyer</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Baidaliuk</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Staropoli</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Guivel-Benhassine</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Rajah</surname>
<given-names>M. M.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Reduced Sensitivity of SARS-CoV-2 Variant Delta to Antibody Neutralization</article-title>. <source>Nature</source> <volume>596</volume>, <fpage>276</fpage>&#x2013;<lpage>280</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-021-03777-9</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saha</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Ghosh</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Maity</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Sharma</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Sarkar</surname>
<given-names>J.&#x20;P.</given-names>
</name>
<name>
<surname>Mitra</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Genome-wide Analysis of Indian SARS-CoV-2 Genomes for the Identification of Genetic Mutation and SNP</article-title>. <source>Infect. Genet. Evol.</source> <volume>85</volume>, <fpage>104457</fpage>. <pub-id pub-id-type="doi">10.1016/j.meegid.2020.104457</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saha</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Ghosh</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Pradhan</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Sharma</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Maity</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Mitra</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Whole Genome Analysis of More Than 10&#x20;000 SARS-CoV-2 Virus Unveils Global Genetic Diversity and Target Region of NSP6</article-title>. <source>Brief. Bioinform.</source> <volume>22</volume>, <fpage>1106</fpage>&#x2013;<lpage>1121</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbab025</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sarkar</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Mitra</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chandra</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Saha</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Banerjee</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Dutta</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Comprehensive Analysis of Genomic Diversity of SARS-CoV-2 in Different Geographic Regions of India: an Endeavour to Classify Indian SARS-CoV-2 Strains on the Basis of Co-existing Mutations</article-title>. <source>Arch. Virol.</source> <volume>166</volume>, <fpage>801</fpage>&#x2013;<lpage>812</lpage>. <pub-id pub-id-type="doi">10.1007/s00705-020-04911-0</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Rahman</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Ehtesham</surname>
<given-names>N. Z.</given-names>
</name>
<name>
<surname>Hira</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Hasnain</surname>
<given-names>S. E.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>SARS-CoV-2 Variants of Concern Are Emerging in India</article-title>. <source>Nat. Med.</source> <volume>27</volume>, <fpage>1131</fpage>. <pub-id pub-id-type="doi">10.1038/s41591-021-01397-4</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Toovey</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Harvey</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Huic</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Introduction of the South African SARS-CoV-2 Variant 501Y.V2 into the UK</article-title>. <source>J.&#x20;Infect.</source> <volume>82</volume>, <fpage>e8</fpage>. <pub-id pub-id-type="doi">10.1016/j.jinf.2021.01.007</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>J.&#x20;W.</given-names>
</name>
<name>
<surname>Tambyah</surname>
<given-names>P. A.</given-names>
</name>
<name>
<surname>Hui</surname>
<given-names>D. S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Emergence of a New SARS-CoV-2 Variant in the UK</article-title>. <source>J.&#x20;Infect.</source> <volume>82</volume>, <fpage>e27</fpage>&#x2013;<lpage>e28</lpage>. <pub-id pub-id-type="doi">10.1016/j.jinf.2020.12.024</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tiwari</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mishra</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Investigating the Genomic Landscape of Novel Coronavirus (2019-nCoV) to Identify Non-synonymous Mutations for Use in Diagnosis and Drug Design</article-title>. <source>J.&#x20;Clin. Virol.</source> <volume>128</volume>, <fpage>104441</fpage>. <pub-id pub-id-type="doi">10.1016/j.jcv.2020.104441</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Weber</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ramirez</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Doerfler</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Signal Hotspot Mutations in SARS-CoV-2 Genomes Evolve as the Virus Spreads and Actively Replicates in Different Parts of the World</article-title>. <source>Virus. Res.</source> <volume>289</volume>, <fpage>198170</fpage>. <pub-id pub-id-type="doi">10.1016/j.virusres.2020.198170</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Woo</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>S.-J.</given-names>
</name>
<name>
<surname>Choi</surname>
<given-names>Y. K.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tanveer</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Cao</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Developing a Fully Glycosylated Full-Length SARS-CoV-2 Spike Protein Model in a Viral Membrane</article-title>. <source>J.&#x20;Phys. Chem. B</source> <volume>124</volume>, <fpage>7128</fpage>&#x2013;<lpage>7137</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jpcb.0c04553</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>X.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Effects of SARS&#x2010;CoV&#x2010;2 Mutations on Protein Structures and Intraviral Protein-Protein Interactions</article-title>. <source>J.&#x20;Med. Virol.</source> <volume>93</volume>, <fpage>2132</fpage>&#x2013;<lpage>2140</lpage>. <pub-id pub-id-type="doi">10.1002/jmv.26597</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xia</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Extreme Genomic CpG Deficiency in SARS-CoV-2 and Evasion of Host Antiviral Defense</article-title>. <source>Mol. Biol. Evol.</source> <volume>37</volume>, <fpage>2699</fpage>&#x2013;<lpage>2705</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msaa094</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuan</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Fang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Global SNP Analysis of 11,183 SARS&#x2010;CoV&#x2010;2 Strains Reveals High Genetic Diversity</article-title>. <source>Transbound. Emerg. Dis.</source> <pub-id pub-id-type="doi">10.1111/tbed.13931</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Bell</surname>
<given-names>E. W.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Protein Structure and Sequence Reanalysis of 2019-nCoV Genome Refutes Snakes as its Intermediate Host and the Unique Similarity between its Spike Protein Insertions and HIV-1</article-title>. <source>J.&#x20;Proteome Res.</source> <volume>19</volume>, <fpage>1351</fpage>&#x2013;<lpage>1360</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jproteome.0c00129</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Song</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>A Novel Coronavirus from Patients with Pneumonia in China, 2019</article-title>. <source>N. Engl. J.&#x20;Med.</source> <volume>382</volume>, <fpage>727</fpage>&#x2013;<lpage>733</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa2001017</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>