<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2016.01735</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Overlapping Regions in HIV-1 Genome Act as Potential Sites for Host&#x2013;Virus Interaction</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Saha</surname> <given-names>Deeya</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Podder</surname> <given-names>Soumita</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Ghosh</surname> <given-names>Tapash C.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/359661/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Bioinformatics Centre, Bose Institute</institution> <country>Kolkata, India</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Microbiology, Raiganj University</institution> <country>Raiganj, India</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Akio Adachi, Tokushima University, Japan</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Hirotaka Ode, National Hospital Organization Nagoya Medical Center, Japan; Kei Sato, Kyoto University, Japan</italic></p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x002A;Correspondence: <italic>Tapash C. Ghosh, <email>tapash@jcbose.ac.in</email></italic></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Virology, a section of the journal Frontiers in Microbiology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>04</day>
<month>11</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="collection">
<year>2016</year>
</pub-date>
<volume>7</volume>
<elocation-id>1735</elocation-id>
<history>
<date date-type="received">
<day>05</day>
<month>07</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>17</day>
<month>10</month>
<year>2016</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2016 Saha, Podder and Ghosh.</copyright-statement>
<copyright-year>2016</copyright-year>
<copyright-holder>Saha, Podder and Ghosh</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>More than a decade, overlapping genes in RNA viruses became a subject of research which has explored various effect of gene overlapping on the evolution and function of viral genomes like genome size compaction. Additionally, overlapping regions (OVRs) are also reported to encode elevated degree of protein intrinsic disorder (PID) in unspliced RNA viruses. With the aim to explore the roles of OVRs in HIV-1 pathogenesis, we have carried out an in-depth analysis on the association of gene overlapping with PID in 35 HIV1- M subtypes. Our study reveals an over representation of PID in OVR of HIV-1 genomes. These disordered residues endure several vital, structural features like short linear motifs (SLiMs) and protein phosphorylation (PP) sites which are previously shown to be involved in massive host&#x2013;virus interaction. Moreover, SLiMs in OVRs are noticed to be more functionally potential as compared to that of non-overlapping region. Although, density of experimentally verified SLiMs, resided in 9 HIV-1 genes, involved in host&#x2013;virus interaction do not show any bias toward clustering into OVR, <italic>tat</italic> and <italic>rev</italic> two important proteins mediates host&#x2013;pathogen interaction by their experimentally verified SLiMs, which are mostly localized in OVR. Finally, our analysis suggests that the acquisition of SLiMs in OVR is mutually exclusive of the occurrence of disordered residues, while the enrichment of PPs in OVR is solely dependent on PID and not on overlapping coding frames. Thus, OVRs of HIV-1 genomes could be demarcated as potential molecular recognition sites during host&#x2013;virus interaction.</p>
</abstract>
<kwd-group>
<kwd>structural disorder</kwd>
<kwd>short linear motifs</kwd>
<kwd>gene overlapping</kwd>
<kwd>host&#x2013;pathogen interaction</kwd>
<kwd>HIV-1</kwd>
<kwd>protein phosphorylation</kwd>
</kwd-group>
<contract-sponsor id="cn001">Department of Science and Technology, Ministry of Science and Technology<named-content content-type="fundref-id">10.13039/501100001409</named-content></contract-sponsor>
<counts>
<fig-count count="5"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="62"/>
<page-count count="10"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec><title>Introduction</title>
<p>Overlapping regions (OVR) where one DNA sequence codes for multiple proteins with different reading frames, are omnipresent in diverse life forms, for instance, it has been observed in the genomes of acellular obligate parasites like virus (<xref ref-type="bibr" rid="B1">Barrell et al., 1976</xref>; <xref ref-type="bibr" rid="B46">Pavesi, 2006</xref>; <xref ref-type="bibr" rid="B8">Chirico et al., 2010</xref>; <xref ref-type="bibr" rid="B59">Simon-Loriere et al., 2013</xref>), prokaryotes including archaea (<xref ref-type="bibr" rid="B54">Saha et al., 2016</xref>) and eubacteria (<xref ref-type="bibr" rid="B54">Saha et al., 2016</xref>) and subsequently in complex eukaryotes such as human (<xref ref-type="bibr" rid="B61">Veeramachaneni et al., 2004</xref>; <xref ref-type="bibr" rid="B40">Makalowska et al., 2007</xref>; <xref ref-type="bibr" rid="B56">Sanna et al., 2008</xref>). Overlapping genes had been widely studied as a mechanism to minimize genome size in prokaryotes (<xref ref-type="bibr" rid="B55">Sakharkar et al., 2005</xref>; <xref ref-type="bibr" rid="B51">Sabath et al., 2013</xref>; <xref ref-type="bibr" rid="B53">Saha et al., 2015</xref>, <xref ref-type="bibr" rid="B54">2016</xref>). Genomesize minimization again was attributed to environmental stimuli such as a rise in temperature (<xref ref-type="bibr" rid="B53">Saha et al., 2015</xref>) or in response to life history traits such as faster growth rates (<xref ref-type="bibr" rid="B54">Saha et al., 2016</xref>) in prokaryotes. Similar to the prokaryotes, recent studies have widely employed gene overlapping as a phenomenon of generating genetic novelty in RNA viruses (<xref ref-type="bibr" rid="B46">Pavesi, 2006</xref>). Moreover, the low fidelity of replication in RNA viruses compelled them to execute compact genomic structure since the longer genome size increases the chances of higher mutational load (<xref ref-type="bibr" rid="B27">Holland et al., 1982</xref>; <xref ref-type="bibr" rid="B12">Domingo, 1997</xref>). Hence, gene overlapping act as an effective mechanism of compacting genome size (<xref ref-type="bibr" rid="B8">Chirico et al., 2010</xref>). <xref ref-type="bibr" rid="B59">Simon-Loriere et al. (2013)</xref> showed that OVRs in RNA viruses possessed lower rates of evolution as compared to non-overlapping counterparts. This may in turn constrain the adaptive potential of RNA viruses (<xref ref-type="bibr" rid="B17">Elena et al., 2006</xref>; <xref ref-type="bibr" rid="B3">Belshaw et al., 2008</xref>). In addition to that OVRs also could encode proteins with novel features. In another study, <xref ref-type="bibr" rid="B49">Rancurel et al. (2009)</xref> reported that OVRs in unspliced RNA virus genomes encode significant amount of structural disorder. These intrinsically disordered proteins play an essential part in viral proteome as they are typically enriched with Short linear motifs (SLiMs) (<xref ref-type="bibr" rid="B21">Fuxreiter et al., 2007</xref>), post-translational modification sites (PTMs; <xref ref-type="bibr" rid="B22">Gao et al., 2013</xref>; <xref ref-type="bibr" rid="B37">Kurotani et al., 2014</xref>), and proteolytic cleavage sites (<xref ref-type="bibr" rid="B19">Fan et al., 2014</xref>). Viruses extensively use these sites to interact and hijack host cellular machinery and thus intrinsically disordered proteins help in viral spread and invasion in host system (<xref ref-type="bibr" rid="B25">Hagai et al., 2014</xref>).</p>
<p>Human Immunodeficiency Virus (HIV) is a retrovirus that has evolved from simian immunodeficiency virus (SIV; <xref ref-type="bibr" rid="B58">Sharp and Hahn, 2011</xref>) causing acquired immunodeficiency syndrome (AIDS) in human. Though HIV-1 encodes only eight proteins, yet is a potential human pathogen hijacking the entire human cellular machinery. Significant volume of research concentrates to explore how do retroviruses like HIV-1 with such small genome can evade the host&#x2019;s complex immune system (<xref ref-type="bibr" rid="B34">Johnson and Desrosiers, 2002</xref>; <xref ref-type="bibr" rid="B6">Castiglione and Bernaschi, 2005</xref>). Studies on human HIV-1 interactome revealed that HIV-1 generally targets host hub like proteins (<xref ref-type="bibr" rid="B18">Evans et al., 2009</xref>; <xref ref-type="bibr" rid="B26">Halehalli and Nagarajaram, 2015</xref>). Targeting host hubs could intervene with many other interactions essential for cellular maintenance and survival. Although, it is known that HIV-1 proteins can interact and regulate a number of host proteins, there is a dearth of research explaining the exact structural properties of HIV-1 proteins that help the virus to fascinate multiple interactions with host machinery. It has been previously explored that HIV-1 genome contains considerable numbers of coding regions that are overlapping in nature (<xref ref-type="bibr" rid="B42">Mayrose et al., 2013</xref>). However, if there is any structural specialty of proteins encoded by these OVRs which in turn facilitates HIV-1 interaction with host machinery remains to be elucidated. Hence, our study aims to explore whether OVRs in HIV-1 genome encode intrinsically disordered proteins or not, if yes, then how these disordered regions are utilized to successfully invade host cellular machinery.</p>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec><title>Preparation of Overlapping Gene Dataset from Genomes of HIV-1 Group M Subtype</title>
<p>We chose HIV-1 group M as our model system of study because the group M subtype is the most predominant HIV-1 strains which are responsible for AIDS pandemic. HIV-1 group M reference sequences were downloaded from <ext-link ext-link-type="uri" xlink:href="http://www.hiv.lanl.gov">http://www.hiv.lanl.gov</ext-link>. The list of HIV-1 group M subtype genomes used in this study is provided in <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1</xref></bold>. All of these group M subtype genomes were of non-recombinant types. The genomic coordinates of each gene from each of the HIV-1 genomes were downloaded from GenBank. From these genomic coordinates, we have identified regions of gene to gene overlap in the respective genomes (<xref ref-type="bibr" rid="B52">Sabath et al., 2008</xref>). We considered those regions as true overlaps where a two adjacent genes involved in overlapping are in different reading frames. The details of overlapping gene pairs of each HIV-1 genome used in this study, along with their genomic coordinates and reading frames are enlisted in <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1</xref></bold>.</p>
</sec>
<sec><title>Prediction of Protein Intrinsic Disorder in HIV-1 Proteome</title>
<p>The residue wise disorder score for each protein was predicted by IUPred algorithm (<xref ref-type="bibr" rid="B13">Dosztanyi et al., 2005a</xref>) using the option to predict long disordered regions. We choose IUPred algorithm for disorder prediction as IUPred didn&#x2019;t train on any specific dataset and hence it gives an unbiased estimate of disorder score (<xref ref-type="bibr" rid="B15">Dosztanyi et al., 2007</xref>). Moreover, this algorithm has been validated to reflect truly the structural status of proteins in viral genomes (<xref ref-type="bibr" rid="B25">Hagai et al., 2014</xref>). Based on the pairwise interaction energy score, IUPred assigns disorder scores for each amino acid (<xref ref-type="bibr" rid="B14">Dosztanyi et al., 2005b</xref>). Residues having a disorder score above threshold value 0.5 were considered as disordered. From these predicted disorder scores, we calculated several measures of protein disorder content. We have measured the percentage of disordered residues by calculating the total disordered residues to the total amino acid residues of the proteins. Next, the percentages of disordered residues were averaged for all proteins in the proteome.</p>
</sec>
<sec><title>Prediction of Short Linear Motif (SLiMs)</title>
<p>The short linear motif patterns were downloaded from ELMdb<sup><xref ref-type="fn" rid="fn01">1</xref></sup> (<xref ref-type="bibr" rid="B11">Dinkel et al., 2012</xref>). The prediction of SLiMs on HIV-1 proteome was done using in-house perl scripts. In order to predict true SLiM in OVR as well as in non-overlapping region (NOVR) we took an approach similar to <xref ref-type="bibr" rid="B25">Hagai et al. (2014)</xref> where, we have shuffled the entire protein sequence of each gene 1000 times and predicted SLiMs in the shuffled set of sequences. The SLiMs that were retained in both the shuffled set of sequences as well as the original type were referred to be false positives, while the SLiMs that occurred in original sequences but disappeared upon shuffling were referred to as the original or true SLiMs. We scored each SLiM according to their occurrences in the randomly shuffled sequence set, for instance if a SLiM occur 10 times out of 1000, then the probability of occurrence of that SLiM in shuffled set is 0.01. Next, we classified those SLiMs as true SLiMs which has a probability of occurrence in the shuffled set is less than 0.1. The sequences were shuffled using the Ushuffle program (<xref ref-type="bibr" rid="B33">Jiang et al., 2008</xref>) with seed 10<sup>-4</sup>. Experimentally validated SLiMs that are involved in interaction with host proteins were retrieved from the published dataset of <xref ref-type="bibr" rid="B9">Davey et al. (2011)</xref> and <xref ref-type="bibr" rid="B26">Halehalli and Nagarajaram (2015)</xref>.</p>
<p>We calculated SLiMs which are present within intrinsically disordered residues, since they may act as potential molecular recognition features (<xref ref-type="bibr" rid="B44">Mohan et al., 2006</xref>). We identified true SLiMs whose positions coincided with intrinsically disordered residues and termed them as functionally potential SLiMs.</p>
</sec>
<sec><title>Estimation of SLiM Conservation Level</title>
<p>Conservation levels of the respective true SLiM positions were estimated where a conservation score was assigned to each of the SLiMs identified. The conservation score was calculated as the ratio of the number of HIV-1 genome in which a given SLiM was retained in the same position in all orthologous genes of the respective genomes and the total number of HIV-1 genomes studied.</p>
</sec>
<sec><title>Enrichment of Protein Phosphorylation Sites in HIV-1 Proteome</title>
<p>Prediction of protein phosphorylation (PP) sites were done using NetPhos 2.0 (<xref ref-type="bibr" rid="B4">Blom et al., 1999</xref>) which gives accurate predictions using neural network for serine, threonine and tyrosine residues in a given protein. It assigns a score to each serine, threonine or tyrosine residues indicating the probability of these residues of getting phosphorylated. Residues with scores greater than 0.50, indicate potential PP sites. List of experimentally validated phosphorylation sites were obtained from (<xref ref-type="bibr" rid="B57">Schwartz and Church, 2010</xref>).</p>
</sec>
<sec><title>Statistical Analyses</title>
<p>Mann&#x2013;Whitney <italic>U</italic> test was used for pairwise comparison of different factors used in this study. Spearman&#x2019;s bivariate correlations were used to find out the association between factors. All of these statistical analyses were done using SPSS 13.0 package.</p>
</sec>
</sec>
<sec><title>Results</title>
<sec><title>OVRs of HIV-1 Genome Tend to Encode Intrinsically Disordered Residues</title>
<p>Previously, it was reported that the HIV-1 genome contains overlapping genesthat contribute to synonymous site conservation in the virus (<xref ref-type="bibr" rid="B42">Mayrose et al., 2013</xref>). Here, we identified 235 OVRs of varying lengths across 35 HIV-1 group M subtype genomes (<bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1</xref></bold>). The proportion of OVRs of a given genome was then calculated by summing up all the lengths of OVRs present in that genome, and then dividing it by the total length of the coding sequence of that genome. We observed that proportion of overlap varies from 4.8 to 8.0% of the total coding regions (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>; <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1</xref></bold>). The density plot (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>) demonstrates that majority of the genomes have proportion of OVR of 8% of the total coding regions.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p><bold>Density distribution of proportion of OVR (<italic>P<sub>ov</sub></italic>) across 35 HIV-1 subtypes showing the minimum and maximum values of <italic>P<sub>ov</sub></italic></bold>.</p></caption>
<graphic xlink:href="fmicb-07-01735-g001.tif"/>
</fig>
<p>It was reported that viruses of eukaryotes, often encode for proteins with unstructured regions those are less constrained and more likely to accumulate mutations (<xref ref-type="bibr" rid="B47">Pushker et al., 2013</xref>). This may in turn expedite the advent of novel functions during the evolution of the virus (<xref ref-type="bibr" rid="B23">Gitlin et al., 2014</xref>). Previously, it was shown that OVR in unspliced RNA viruses encode an elevated degree of protein intrinsic disorder (PID; <xref ref-type="bibr" rid="B49">Rancurel et al., 2009</xref>). So, we intended to investigate whether the OVRs of HIV-1 genome have the potency to encode PID. In calculating the proportion of disordered residues within the OVRs and NOVRs of all the genes of 35 HIV-1 group M subtype genomes, it was noticed that disordered residues are significantly (<italic>P</italic> = 0.003) overrepresented in OVR than NOVR (<bold>Figure <xref ref-type="fig" rid="F2">2A</xref></bold>). To study the effect of gene overlapping on PID more explicitly, we decided to split the data into eight HIV-1 genes (<italic>tat, rev, gag, pol, vif, vpr, env</italic>, and <italic>vpu</italic>). It was observed that except for <italic>vpu</italic> (which has a very ordered and stable secondary structure and a very little number of disordered residues in OVR), all other proteins encode considerable amount of PID in OVR as compared to that in NOVR (<bold>Figure <xref ref-type="fig" rid="F2">2B</xref></bold>). In addition, we calculated the proportion of intrinsically disordered residues for each gene of 35 HIV-1 subtype genomes. It was observed that the proportion of OVRs of each gene shared a strong, significant positive correlation with the proportion of predicted intrinsically disordered residues in the protein encoded by the corresponding genes (<bold>Figure <xref ref-type="fig" rid="F2">2C</xref></bold>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p><bold>(A)</bold> Boxplot showing difference in distribution of proportion of intrinsically disordered residues in overlapping region (OVR) and non-overlapping region (NOVR) of 35 HIV-1 subtype genomes <bold>(B)</bold> Graphical representation of disorder tendency across the entire protein length of 8 proteins of HIV-1group M genome. The analysis was carried out with <italic>gag</italic> (BAF31333.1), <italic>env</italic> (BAF31340.1), <italic>tat</italic> (BAF31337.1), <italic>rev</italic> (BAF31338.1), <italic>vif</italic> (BAF31335.1), <italic>vpu</italic> (BAF31339.1), <italic>vpr</italic> (BAF31336.1), and <italic>pol</italic> (BAF31334.1) genes of a Rwandan subtype, AB253421. The four catalytic domains of <italic>pol</italic> gene have been pointed. The OVRs are highlighted in gray. <bold>(C)</bold> Scattered plot showing the correlation between proportion of OVR and proportion of disordered residues across the different genes of HIV-1 subtype genomes <bold>(D)</bold> Protein Databank (PDB) structure of <italic>vpr</italic> protein (PDB ID &#x2013; 1M8L) with OVR highlighted in red <bold>(E)</bold> PDB structure of <italic>vpu</italic> protein (PDB ID &#x2013; 1VPU) with OVR highlighted in red.</p></caption>
<graphic xlink:href="fmicb-07-01735-g002.tif"/>
</fig>
<p>Moreover, OVR stretches are short and terminal in nature, and short sequences are predicted to be highly disordered. We next decided to investigate whether the increase of structural disorder in OVR is independent of the length of the protein. We did a linear regression analysis taking total disordered residues in a given protein and its length as independent covariates and OVR length as dependent variable. The result of the multiple linear regression analysis is represented in <bold>Table <xref ref-type="table" rid="T1">1</xref></bold>. It could be interpreted from the given multivariate linear regression that increase of intrinsic disorder in OVR is independent of protein length.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Multivariate linear regression analysis between length of overlapping regions (OVRs) (independent variable) and disordered residues in OVR, total protein length.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Covariates</th>
<th valign="top" align="center">&#x03B2;</th>
<th valign="top" align="center"><italic>P</italic>-value</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Number of disordered residues</td>
<td valign="top" align="center">3.82</td>
<td valign="top" align="center">1.6 &#x00D7; 10<sup>-4</sup></td>
</tr>
<tr>
<td valign="top" align="left">Total protein length</td>
<td valign="top" align="center">4.99</td>
<td valign="top" align="center">10<sup>-6</sup></td>
</tr>
<tr>
<td valign="top" align="left"></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Next, we planned to validate whether regions predicted by IUPRED is actually disordered or not. For this purpose we retrieved the structure of individual HIV-1 proteins from Protein Databank (PDB) and tried to ascertain whether the OVR in individual proteins are actually disordered. It was previously shown by <xref ref-type="bibr" rid="B38">Le Gall et al. (2007)</xref> that the missing residues (unobserved residues) in X ray crystallographic structures could be structurally disordered, due to their highly dynamic atom positions which vary significantly over time. Taking this into consideration, we have mapped the OVR in PDB structure of HIV-1 proteins, the result of which are delineated in <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM2">2</xref></bold>. It was observed that OVRs are overrepresented in the &#x201C;unobserved&#x201D; or missing residue category of PDB structure. Thus, it is reasonable to assume that OVR residues could be highly dynamic and its structure could not be deciphered using standard X-Ray crystallographic protocols. Moreover, the structure of <italic>vpr</italic> and <italic>vpu</italic> are delineated in <bold>Figures <xref ref-type="fig" rid="F2">2D,E</xref></bold> with their OVR highlighted in red. It could be observed that the highlighted region, i.e., OVR are structurally more flexible.</p>
</sec>
<sec><title>OVRs Facilitate Enrichment of SLiMs and PPs in HIV-1 Genome</title>
<p>In the previous section, we observed that OVR encodes significant amount of PID. It has been previously reported that PID plays pivotal role in protein&#x2013;protein interactions (<xref ref-type="bibr" rid="B60">Uversky, 2013</xref>). Therefore, we were curious to investigate whether OVRs play any role in virus&#x2013;host interaction or not. In the recent years, there are considerable volume of researches that discovered a profound role of SLiMs in viral proteome that readily facilitate virus host protein&#x2013;protein interaction (<xref ref-type="bibr" rid="B25">Hagai et al., 2014</xref>). Thus, preponderance of these types of motifs in a given viral sequence often acts as an indicator of its involvement in protein&#x2013;protein interaction. Henceforth, we tried to ascertain whether OVRs in HIV-1 genome contains SLiMs to initiate virus&#x2013;host interaction. In this context, we intended to find out whether OVRs, apart from encoding structural disorder are also involved in encoding SLiMs that could be essential for host&#x2013;viral interactions. We scanned all the genes involved in overlapping across 35 HIV-1 group M genomes and figured out the position of different SLiMs along the entire protein length. Similarly, we shuffled individual protein sequences and figured out the probability of occurrence of SLiMs in the shuffled set. The SLiMs that occurred at a low frequency or does not occur at all in shuffled set were considered as true SLiMs. It was observed that OVRs are enriched in true SLiMs as compared to NOVR which has a depleted density of true SLiMs. (<bold>Figure <xref ref-type="fig" rid="F3">3A</xref></bold>). Moreover, it was observed that the proportion of OVRs of a gene shares a significantly positive correlation with the SLiM density of the concerned genes (<bold>Figure <xref ref-type="fig" rid="F3">3B</xref></bold>). We next asked whether SLiMs residing in OVRs were functionally more important than the ones residing in NOVRs. Previously, <xref ref-type="bibr" rid="B25">Hagai et al. (2014)</xref> determined the functional potency of SLiMs by calculating the number of SLiMs residing in disordered region of proteins. They have proposed that SLiMs, residing within locally flexible disordered regions are more suitable to participate in protein&#x2013;protein interactions and thus these SLiMs are termed as functionally potential SLiMs. Similar to their analysis, we also computed densities of functionally potential SLiMs (SLiMs localizing within structurally disordered residues of proteins) in OVR as well as in NOVRs of HIV-1 genomes. It was observed that densities of functionally potential SLiMs were far higher in the OVR as compared to the NOVR (<bold>Figure <xref ref-type="fig" rid="F3">3C</xref></bold>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p><bold>(A)</bold> Boxplot showing the distribution difference of true short linear motifs (SLiM) density between OVR and NOVR along with significance level indicated as <italic>P</italic>-value of Mann&#x2013;Whitney <italic>U</italic> test. <bold>(B)</bold> Scattered plot showing the correlation between proportion of OVR and true SLiM density across the different genes of HIV-1 subtype genomes <bold>(C)</bold> Boxplot showing the distribution difference of functionally potential SLiM density between OVR and NOVR along with significance level indicated as <italic>P</italic>-value of Mann&#x2013;Whitney <italic>U</italic> test. <bold>(D)</bold> Boxplot showing no significant distribution difference of conservation score of SLiM between OVR and NOVR along with significance level indicated as <italic>P</italic>-value of Mann&#x2013;Whitney <italic>U</italic> test.</p></caption>
<graphic xlink:href="fmicb-07-01735-g003.tif"/>
</fig>
<p>We further compiled a dataset of experimentally validated interaction SLiMs that are reported to interact with host proteome. We retrieved the list of genes and their corresponding interaction SLiMs <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3a</xref></bold>), and found out whether these SLiMs are located within OVR or NOVR of that gene. Out of 71 SLiM-gene pair only 21 unique SLiM-gene pairs were observed to be matching with our true SLiM dataset. It was observed that interacting SLiMs were enriched in OVR for <italic>tat</italic> and <italic>rev</italic> proteins, <italic>gag</italic> protein also contained a few experimentally validated SLiMs in their OVR. In contrast, <italic>vif, vpu</italic> and <italic>vpr</italic>, and <italic>env</italic> have all their experimentally validated SLiMs in NOVR (<bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3b</xref></bold>).</p>
<p>Next we intended to find out whether the SLiMs discovered in the OVR of HIV-1 proteins are well conserved in their position across all subtypes of HIV-1 genome. For this, we have estimated the conservation level of each SLiMs located within OVRs as well as in the NOVRs. It was observed that there was no significant difference of conservation level between SLiMs located in the OVR to that in the NOVR <bold>Figure <xref ref-type="fig" rid="F3">3D</xref></bold>. This may be due to higher intrinsic disorder content of OVR which relaxes the level of purifying selection on these SLiMs.</p>
<p>Another potential indicator of virus&#x2013;host interaction is PTMs; <xref ref-type="bibr" rid="B29">Hundt et al., 2013</xref>). Precisely, among the different PTMs that take place <italic>in vivo</italic>, PP is one of the most important and well studied types of PTM. In previous works, PP has been shown to play a pivotal role in HIV-1 interactome (<xref ref-type="bibr" rid="B20">Francis et al., 2011</xref>). Therefore, we were interested in exploring the role of gene overlapping in effecting PP across the protein length. In order to ascertain this we have mapped the OVR of each gene and calculated the PP density of OVRs as well NOVRs. It was observed that the PP densities of OVRs were far greater than that of NOVRs (<bold>Figure <xref ref-type="fig" rid="F4">4A</xref></bold>). It was also observed that the proportion of gene overlapping correlated with a fraction of PP density of a given gene (<bold>Figure <xref ref-type="fig" rid="F4">4B</xref></bold>). We have confirmed our results using experimentally validated PP sites, the results of which are delineated in <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM4">4</xref></bold>.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p><bold>(A)</bold> Boxplot showing the distribution difference of protein phosphorylation (PP) density between OVR and NOVR along with significance level indicated as <italic>P</italic>-value of Mann&#x2013;Whitney <italic>U</italic> test. <bold>(B)</bold> Scattered plot showing the correlation between proportion of OVR and PP density across the different genes of HIV-1 subtype genomes.</p></caption>
<graphic xlink:href="fmicb-07-01735-g004.tif"/>
</fig>
</sec>
<sec><title>Role of PID and OVR in Acquisition of SLiMs and PPs HIV-1 Genome</title>
<p>We have found that OVRs attributed to the overrepresentation of SLiMs and PPs in HIV-1 genome. Now, it was also reported that SLiMs and PP sites encompass affluence of disordered residues (<xref ref-type="bibr" rid="B30">Iakoucheva et al., 2004</xref>; <xref ref-type="bibr" rid="B43">Meszaros et al., 2012</xref>). Thus, we have asked whether the enrichment of SLiMs and PP sites are by-products of an abundance of intrinsic disorder in the respective OVRs or OVRs by virtue harbors more SLiMs and PP sites as compared to NOVRs. In other words, we were interested to find out whether the overrepresentation of SLiMs and PPs in OVRs are independent of PID. For this, divided each HIV-1 subtype proteome into three categories- (i) OVRs with 80% or more of its residues in disordered state (DO) (ii) OVRs with 80% of its residue in the structured state (SO) (iii) NOVR with 80% or more of its residue in structured state PID (SN). Our hypothesis, in this context was that DO would entail elevated degree of SLiMs and PPs as compared to SO and SN, since it is highly disordered as well as overlapping in nature. In addition, SO would entail an elevated density of PP and SLiM as compared to SN if the occurrences of PPs and SLiMs are mutually exclusive of PID. This is because the phenomenon of gene overlapping will itself be responsible for enrichment SLiMs and PPs in SO, whereas SN (where there is no genetic overlap) would definitely confer depleted density of SLiMs and PPs. On the other hand, the absence of significant difference of PP and SLiM density between SO and SN would reflect that PID is the reason behind the elevated densities of PPs and SLiMs in the OVR and OVR doesn&#x2019;t have any significant impact over PP and SLiM density. Finally, we estimated the SLiM and PP density for these three categories separately, the result of which are delineated in (<bold>Figure <xref ref-type="fig" rid="F5">5</xref></bold>). It could be observed that SLiM content is higher in SO as compared to SN which reflects that acquisition of elevated frequency of SLiMs in the OVRs is mutually exclusive of PID. On the contrary occurrence of PP was not mutually exclusive of PID since there was no significant difference of PP density between SO and SN groups. Hence, it could be interpreted that although acquisition of SLiMs in OVR is exclusive of PID, but enrichment of PP in OVR was mainly due to increased disorder content of OVRs.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p><bold>The distribution difference of PP and SLiM Density between DO, SN, and SO group with <italic>P-values</italic> indicating the level of significance of Mann-Whitney <italic>U</italic> test</bold>.</p></caption>
<graphic xlink:href="fmicb-07-01735-g005.tif"/>
</fig>
</sec>
</sec>
<sec><title>Discussion</title>
<p>Gene overlapping is a common phenomenon in viral genomes and has several important implications and significances in viral evolution (<xref ref-type="bibr" rid="B28">Hughes et al., 2001</xref>; <xref ref-type="bibr" rid="B24">Guyader and Ducray, 2002</xref>; <xref ref-type="bibr" rid="B46">Pavesi, 2006</xref>; <xref ref-type="bibr" rid="B62">Zhao et al., 2007</xref>). Other than genome size evolution, gene overlapping might increase selective constraints at synonymous sites of viral genes (<xref ref-type="bibr" rid="B45">Ngandu et al., 2008</xref>). Our study is focused to investigate the role of OVR in viral pathogenicity. In order to do so, we have analyzed the properties of OVR encoded proteins and noticed that they encode a considerable amount of structural disorder as compared to NOVRs. This observation is in agreement with a previous report by <xref ref-type="bibr" rid="B49">Rancurel et al. (2009)</xref>, where it has been observed that OVRs of unspliced RNA viruses encodes an elevated degree of PID. Their work put forward a noble explanation behind such an observation. Since disordered proteins are under weaker selection pressure than the ordered ones (<xref ref-type="bibr" rid="B5">Brown et al., 2002</xref>), they proposed that PID encoded by OVRs might balance out the excessive amount of evolutionary constraints imposed over them due to their dual coding natures. As a result, the protein portions encoded by OVRs are structurally flexible in order to shed off their structural complexities without losing their key functions. Earlier, studies on amino acid compositions of OVRs of viral proteins revealed a strong bias toward highly degenerate codons which are found to encode PID (<xref ref-type="bibr" rid="B49">Rancurel et al., 2009</xref>). However, OVRs were previously shown to be evolutionarily conserved as compared to the NOVRs (<xref ref-type="bibr" rid="B59">Simon-Loriere et al., 2013</xref>) and on the contrary disordered residues have been previously shown to evolve faster and are under weaker selection pressure (<xref ref-type="bibr" rid="B23">Gitlin et al., 2014</xref>). Thus, apparently it is surprising that these evolutionarily conserved regions encode sufficient amount of PID. PID often mediates massive protein&#x2013;protein interaction that could add functional constrain to the evolution of regions encoding PID (<xref ref-type="bibr" rid="B5">Brown et al., 2002</xref>; <xref ref-type="bibr" rid="B7">Chen et al., 2006</xref>; <xref ref-type="bibr" rid="B2">Bellay et al., 2011</xref>). Moreover, enrichment of phosphorylation sites in intrinsically disordered regions could also make these regions evolutionarily conserved as site of PP may involve in many different molecular functions (<xref ref-type="bibr" rid="B39">Maathuis, 2008</xref>). Thus, intrinsically disordered regions sometimes show higher level of conservation as compared to structured regions and so OVRs in spite of encoding elevated degree of protein intrinsic disorder could be evolutionarily constrained. In order to investigate whether PID encoded by OVR are indeed responsible for massive protein&#x2013;protein interaction, we decided to explore whether there is (i) over-representation of SLiMs (ii) abundance of PP sites in OVRs. In agreement to our hypothesis we indeed noticed a significant over-representation of PP and SLiMs in the OVRs.</p>
<p>It is already known that viruses with extremely diminished genome size encode proteins of short length. Thus, structured domains are seldom in viral proteome (<xref ref-type="bibr" rid="B50">Rappoport and Linial, 2012</xref>). Rather, extensive usage of SLiMs and in viruses facilitate hijacking of host cellular machineries and evade host immune system (<xref ref-type="bibr" rid="B25">Hagai et al., 2014</xref>). Again, interaction motifs are observed to be rapidly evolving and thus facilitate quick rewiring of virus&#x2013;host interaction network (<xref ref-type="bibr" rid="B32">Jackson et al., 2003</xref>). Given the fact that RNA viruses accumulate more mutations than DNA viruses, tolerability of SLiMs to mutations thus facilitate a lesser extent of disruption of important interactions with the host (<xref ref-type="bibr" rid="B25">Hagai et al., 2014</xref>). In other words, SLiMs have shorter interfaces as compared to globular domains and are involved in transient or conditional interactions with the host (<xref ref-type="bibr" rid="B10">Davey et al., 2012</xref>). Henceforth they are more suitable to mimic multiple host peptides by virtue of their versatile interaction modes (<xref ref-type="bibr" rid="B16">Duro et al., 2015</xref>). This promiscuously is highly favored by occurrences of conformational flexibility of regions surrounded by these peptide motifs (<xref ref-type="bibr" rid="B21">Fuxreiter et al., 2007</xref>). As a result SLiMs in viruses are often localized into short stretches of disordered residues. PID thus imparts additional functional potency to these peptide motifs. Hence, we have determined the distribution of functionally potent SLiMs in OVRs and NOVRs of HIV-1 genome. Consequently, by analyzing distribution of predicted SLiMs, it was observed that SLiMs in the OVR tend to be more functionally potential (Beauclair et al., 2013). In addition to SLiMs, OVRs was also observed to contain significant amount of predicted PP sites compared to NOVRs. Over-representation of PP sites in OVRs further reflects the importance of gene overlapping in HIV-1 genomes. Previously, there has been range of studies focusing on the role of PP in the development of productive infection cycle by viruses within their host (<xref ref-type="bibr" rid="B35">Keck et al., 2015</xref>). PPs of HIV-1 proteins have been found to be associated with nuclear transport, replication, repair, and transcription (<xref ref-type="bibr" rid="B31">Idriss et al., 1999</xref>; <xref ref-type="bibr" rid="B20">Francis et al., 2011</xref>; <xref ref-type="bibr" rid="B36">Kudoh et al., 2014</xref>). For instance, HIV-1 integrase has been shown to be directly phosphorylated by host kinases in order to integrate into the host genome efficiently (<xref ref-type="bibr" rid="B41">Manganaro et al., 2010</xref>). On the other hand extensive phosphorylation of viral proteins might in favor of the pathogen and increase the affinity of a given protein to participate in protein&#x2013;protein interactions with the host (<xref ref-type="bibr" rid="B35">Keck et al., 2015</xref>). Thereby, accumulation of PP residues within OVRs also suggests its importance in development of viral infection and increasing viral pathogenicity.</p>
<p>Our study also sought to address whether OVRs by virtue accumulate SLiMs and PPs or it is the conformational flexibility of intrinsically disordered regions within OVRs that drives the increased frequency of SLiMs and PP residues in these regions. Our observation undoubtedly reveals that the impacts of PID and OVR in increasing SLiM density over these regions are mutually exclusive. However, the enrichment of PP sites in OVR is a byproduct of elevated structural disorder in these regions. This finding adds another interesting dimension to our study.</p>
<p>Hence, our work sheds light on the significance of overlapping genes in the virulence of HIV-1. Thus, it paves inroads for future studies to explore the therapeutic tools against these culprit regions to overcome AIDs.</p>
</sec>
<sec><title>Author Contributions</title>
<p>All authors contributed to the design of experiments, the interpretation of data, and drafting of the manuscript. Experiments were performed by DS.</p>
</sec>
<sec><title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> The study was supported by Department of Science and Technology, Government of India.</p>
</fn>
</fn-group>
<ack>
<p>We acknowledge Department of Biotechnology, Government of India for financial help. Authors are thankful to four referees for their critical and constructive comments in improving the manuscript. We are thankful to Dr. Kamalika Sen for critical reading of the manuscript. We also thank Mr. S. K. Gupta for his technical help.</p>
</ack>
<sec sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fmicb.2016.01735/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fmicb.2016.01735/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.XLS" id="SM1" mimetype="application/vnd.ms-excel" xmlns:xlink="http://www.w3.org/1999/xlink">
</supplementary-material>
<supplementary-material xlink:href="Table_2.XLS" id="SM2" mimetype="application/vnd.ms-excel" xmlns:xlink="http://www.w3.org/1999/xlink">
</supplementary-material>
<supplementary-material xlink:href="Table_3.XLS" id="SM3" mimetype="application/vnd.ms-excel" xmlns:xlink="http://www.w3.org/1999/xlink">
</supplementary-material>
<supplementary-material xlink:href="Table_4.XLS" id="SM4" mimetype="application/vnd.ms-excel" xmlns:xlink="http://www.w3.org/1999/xlink">
</supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barrell</surname> <given-names>B. G.</given-names></name> <name><surname>Air</surname> <given-names>G. M.</given-names></name> <name><surname>Hutchison</surname> <given-names>C. A.</given-names></name></person-group> (<year>1976</year>). <article-title>Overlapping genes in bacteriophage phiX174.</article-title> <source><italic>Nature</italic></source> <volume>264</volume> <fpage>34</fpage>&#x2013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1038/264034a0</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bellay</surname> <given-names>J.</given-names></name> <name><surname>Han</surname> <given-names>S.</given-names></name> <name><surname>Michaut</surname> <given-names>M.</given-names></name> <name><surname>Kim</surname> <given-names>T.</given-names></name> <name><surname>Costanzo</surname> <given-names>M.</given-names></name> <name><surname>Andrews</surname> <given-names>B. J.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Bringing order to protein disorder through comparative genomics and genetic interactions.</article-title> <source><italic>Genome Biol.</italic></source> <volume>12</volume>:<issue>R14</issue>. <pub-id pub-id-type="doi">10.1186/gb-2011-12-2-r14</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Belshaw</surname> <given-names>R.</given-names></name> <name><surname>Gardner</surname> <given-names>A.</given-names></name> <name><surname>Rarnbaut</surname> <given-names>A.</given-names></name> <name><surname>Pybus</surname> <given-names>O. G.</given-names></name></person-group> (<year>2008</year>). <article-title>Pacing a small cage: mutation and RNA viruses.</article-title> <source><italic>Trends Ecol. Evol.</italic></source> <volume>23</volume> <fpage>188</fpage>&#x2013;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.1016/j.tree.2007.11.010</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blom</surname> <given-names>N.</given-names></name> <name><surname>Gammeltoft</surname> <given-names>S.</given-names></name> <name><surname>Brunak</surname> <given-names>S.</given-names></name></person-group> (<year>1999</year>). <article-title>Sequence and structure-based prediction of eukaryotic protein phosphorylation sites.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>294</volume> <fpage>1351</fpage>&#x2013;<lpage>1362</lpage>. <pub-id pub-id-type="doi">10.1006/jmbi.1999.3310</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>C. J.</given-names></name> <name><surname>Takayama</surname> <given-names>S.</given-names></name> <name><surname>Campen</surname> <given-names>A. M.</given-names></name> <name><surname>Vise</surname> <given-names>P.</given-names></name> <name><surname>Marshall</surname> <given-names>T. W.</given-names></name> <name><surname>Oldfield</surname> <given-names>C. J.</given-names></name><etal/></person-group> (<year>2002</year>). <article-title>Evolutionary rate heterogeneity in proteins with long disordered regions.</article-title> <source><italic>J. Mol. Evol.</italic></source> <volume>55</volume> <fpage>104</fpage>&#x2013;<lpage>110</lpage>. <pub-id pub-id-type="doi">10.1007/s00239-001-2309-6</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Castiglione</surname> <given-names>F.</given-names></name> <name><surname>Bernaschi</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>HIV-1 strategies of immune evasion.</article-title> <source><italic>Int. J. Mod. Phys. C</italic></source> <volume>16</volume> <fpage>1869</fpage>&#x2013;<lpage>1878</lpage>. <pub-id pub-id-type="doi">10.1142/S0129183105008394</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>J. W.</given-names></name> <name><surname>Romero</surname> <given-names>P.</given-names></name> <name><surname>Uversky</surname> <given-names>V. N.</given-names></name> <name><surname>Dunker</surname> <given-names>A. K.</given-names></name></person-group> (<year>2006</year>). <article-title>Conservation of intrinsic disorder in protein domains and families: II. functions of conserved disorder.</article-title> <source><italic>J. Proteome Res.</italic></source> <volume>5</volume> <fpage>888</fpage>&#x2013;<lpage>898</lpage>. <pub-id pub-id-type="doi">10.1021/pr060049p</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chirico</surname> <given-names>N.</given-names></name> <name><surname>Vianelli</surname> <given-names>A.</given-names></name> <name><surname>Belshaw</surname> <given-names>R.</given-names></name></person-group> (<year>2010</year>). <article-title>Why genes overlap in viruses.</article-title> <source><italic>Proc. Biol. Sci.</italic></source> <volume>277</volume> <fpage>3809</fpage>&#x2013;<lpage>3817</lpage>. <pub-id pub-id-type="doi">10.1098/rspb.2010.1052</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davey</surname> <given-names>N. E.</given-names></name> <name><surname>Trave</surname> <given-names>G.</given-names></name> <name><surname>Gibson</surname> <given-names>T. J.</given-names></name></person-group> (<year>2011</year>). <article-title>How viruses hijack cell regulation.</article-title> <source><italic>Trends Biochem. Sci.</italic></source> <volume>36</volume> <fpage>159</fpage>&#x2013;<lpage>169</lpage>. <pub-id pub-id-type="doi">10.1016/j.tibs.2010.10.002</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davey</surname> <given-names>N. E.</given-names></name> <name><surname>Van Roey</surname> <given-names>K.</given-names></name> <name><surname>Weatheritt</surname> <given-names>R. J.</given-names></name> <name><surname>Toedt</surname> <given-names>G.</given-names></name> <name><surname>Uyar</surname> <given-names>B.</given-names></name> <name><surname>Altenberg</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Attributes of short linear motifs.</article-title> <source><italic>Mol. Biosyst.</italic></source> <volume>8</volume> <fpage>268</fpage>&#x2013;<lpage>281</lpage>. <pub-id pub-id-type="doi">10.1039/c1mb05231d</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dinkel</surname> <given-names>H.</given-names></name> <name><surname>Michael</surname> <given-names>S.</given-names></name> <name><surname>Weatheritt</surname> <given-names>R. J.</given-names></name> <name><surname>Davey</surname> <given-names>N. E.</given-names></name> <name><surname>Van Roey</surname> <given-names>K.</given-names></name> <name><surname>Altenberg</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>ELM-the database of eukaryotic linear motifs.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume> <fpage>D242</fpage>&#x2013;<lpage>D251</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr1064</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Domingo</surname> <given-names>E.</given-names></name></person-group> (<year>1997</year>). <article-title>Rapid evolution of viral RNA genomes.</article-title> <source><italic>J. Nutr.</italic></source> <volume>127</volume> <fpage>S958</fpage>&#x2013;<lpage>S961</lpage>.</citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dosztanyi</surname> <given-names>Z.</given-names></name> <name><surname>Csizmok</surname> <given-names>V.</given-names></name> <name><surname>Tompa</surname> <given-names>P.</given-names></name> <name><surname>Simon</surname> <given-names>I.</given-names></name></person-group> (<year>2005a</year>). <article-title>IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content.</article-title> <source><italic>Bioinformatics</italic></source> <volume>21</volume> <fpage>3433</fpage>&#x2013;<lpage>3434</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bti541</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dosztanyi</surname> <given-names>Z.</given-names></name> <name><surname>Csizmok</surname> <given-names>V.</given-names></name> <name><surname>Tompa</surname> <given-names>P.</given-names></name> <name><surname>Simon</surname> <given-names>I.</given-names></name></person-group> (<year>2005b</year>). <article-title>The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins.</article-title> <source><italic>FEBS J.</italic></source> <volume>272</volume> <fpage>360</fpage>&#x2013;<lpage>360</lpage>.</citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dosztanyi</surname> <given-names>Z.</given-names></name> <name><surname>Sandor</surname> <given-names>M.</given-names></name> <name><surname>Tompa</surname> <given-names>P.</given-names></name> <name><surname>Simon</surname> <given-names>I.</given-names></name></person-group> (<year>2007</year>). <article-title>Prediction of protein disorder at the domain level.</article-title> <source><italic>Curr. Protein Pept. Sci.</italic></source> <volume>8</volume> <fpage>161</fpage>&#x2013;<lpage>171</lpage>. <pub-id pub-id-type="doi">10.2174/138920307780363406</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Duro</surname> <given-names>N.</given-names></name> <name><surname>Miskei</surname> <given-names>M.</given-names></name> <name><surname>Fuxreiter</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>Fuzziness endows viral motif-mimicry.</article-title> <source><italic>Mol. Biosyst.</italic></source> <volume>11</volume> <fpage>2821</fpage>&#x2013;<lpage>2829</lpage>. <pub-id pub-id-type="doi">10.1039/c5mb00301f</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Elena</surname> <given-names>S. F.</given-names></name> <name><surname>Carrasco</surname> <given-names>P.</given-names></name> <name><surname>Daros</surname> <given-names>J. A.</given-names></name> <name><surname>Sanjuan</surname> <given-names>R.</given-names></name></person-group> (<year>2006</year>). <article-title>Mechanisms of genetic robustness in RNA viruses.</article-title> <source><italic>EMBO Rep.</italic></source> <volume>7</volume> <fpage>168</fpage>&#x2013;<lpage>173</lpage>. <pub-id pub-id-type="doi">10.1038/sj.embor.7400636</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Evans</surname> <given-names>P.</given-names></name> <name><surname>Dampier</surname> <given-names>W.</given-names></name> <name><surname>Ungar</surname> <given-names>L.</given-names></name> <name><surname>Tozeren</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Prediction of HIV-1 virus-host protein interactions using virus and host sequence motifs.</article-title> <source><italic>BMC Med. Genomics</italic></source> <volume>2</volume>:<issue>27</issue>. <pub-id pub-id-type="doi">10.1186/1755-8794-2-27</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fan</surname> <given-names>X.</given-names></name> <name><surname>Xue</surname> <given-names>B.</given-names></name> <name><surname>Dolan</surname> <given-names>P. T.</given-names></name> <name><surname>Lacount</surname> <given-names>D. J.</given-names></name> <name><surname>Kurgan</surname> <given-names>L.</given-names></name> <name><surname>Uversky</surname> <given-names>V. N.</given-names></name></person-group> (<year>2014</year>). <article-title>The intrinsic disorder status of the human hepatitis C virus proteome.</article-title> <source><italic>Mol. Biosyst.</italic></source> <volume>10</volume> <fpage>1345</fpage>&#x2013;<lpage>1363</lpage>. <pub-id pub-id-type="doi">10.1039/c4mb00027g</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Francis</surname> <given-names>A. C.</given-names></name> <name><surname>Di Primio</surname> <given-names>C.</given-names></name> <name><surname>Allouch</surname> <given-names>A.</given-names></name> <name><surname>Cereseto</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>Role of phosphorylation in the nuclear biology of HIV-1.</article-title> <source><italic>Curr. Med. Chem.</italic></source> <volume>18</volume> <fpage>2904</fpage>&#x2013;<lpage>2912</lpage>. <pub-id pub-id-type="doi">10.2174/092986711796150478</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fuxreiter</surname> <given-names>M.</given-names></name> <name><surname>Tompa</surname> <given-names>P.</given-names></name> <name><surname>Simon</surname> <given-names>I.</given-names></name></person-group> (<year>2007</year>). <article-title>Local structural disorder imparts plasticity on linear motifs.</article-title> <source><italic>Bioinformatics</italic></source> <volume>23</volume> <fpage>950</fpage>&#x2013;<lpage>956</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btm035</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gao</surname> <given-names>F.</given-names></name> <name><surname>Luo</surname> <given-names>H.</given-names></name> <name><surname>Zhang</surname> <given-names>C.-T.</given-names></name></person-group> (<year>2013</year>). <article-title>DoriC 5.0: an updated database of oriC regions in both bacterial and archaeal genomes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>41</volume> <fpage>D90</fpage>&#x2013;<lpage>D93</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks990</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gitlin</surname> <given-names>L.</given-names></name> <name><surname>Hagai</surname> <given-names>T.</given-names></name> <name><surname>Labarbera</surname> <given-names>A.</given-names></name> <name><surname>Solovey</surname> <given-names>M.</given-names></name> <name><surname>Andino</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>Rapid evolution of virus sequences in intrinsically disordered protein regions.</article-title> <source><italic>Plos Pathog.</italic></source> <volume>10</volume>:<issue>e1004529</issue>. <pub-id pub-id-type="doi">10.1371/journal.ppat.1004529</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guyader</surname> <given-names>S.</given-names></name> <name><surname>Ducray</surname> <given-names>D. G.</given-names></name></person-group> (<year>2002</year>). <article-title>Sequence analysis of <italic>Potato leafroll virus</italic> isolates reveals genetic stability, major evolutionary events and differential selection pressure between overlapping reading frame products.</article-title> <source><italic>J Gen. Virol.</italic></source> <volume>83</volume> <fpage>1799</fpage>&#x2013;<lpage>1807</lpage>. <pub-id pub-id-type="doi">10.1099/0022-1317-83-7-1799</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hagai</surname> <given-names>T.</given-names></name> <name><surname>Azia</surname> <given-names>A.</given-names></name> <name><surname>Babu</surname> <given-names>M. M.</given-names></name> <name><surname>Andino</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>Use of host-like peptide motifs in viral proteins is a prevalent strategy in host-virus interactions.</article-title> <source><italic>Cell Rep.</italic></source> <volume>7</volume> <fpage>1729</fpage>&#x2013;<lpage>1739</lpage>. <pub-id pub-id-type="doi">10.1016/j.celrep.2014.04.052</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Halehalli</surname> <given-names>R. R.</given-names></name> <name><surname>Nagarajaram</surname> <given-names>H. A.</given-names></name></person-group> (<year>2015</year>). <article-title>Molecular principles of human virus protein-protein interactions.</article-title> <source><italic>Bioinformatics</italic></source> <volume>31</volume> <fpage>1025</fpage>&#x2013;<lpage>1033</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu763</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holland</surname> <given-names>J.</given-names></name> <name><surname>Spindler</surname> <given-names>K.</given-names></name> <name><surname>Horodyski</surname> <given-names>F.</given-names></name> <name><surname>Grabau</surname> <given-names>E.</given-names></name> <name><surname>Nichol</surname> <given-names>S.</given-names></name> <name><surname>Vandepol</surname> <given-names>S.</given-names></name></person-group> (<year>1982</year>). <article-title>Rapid evolution of RNA genomes.</article-title> <source><italic>Science</italic></source> <volume>215</volume> <fpage>1577</fpage>&#x2013;<lpage>1585</lpage>. <pub-id pub-id-type="doi">10.1126/science.7041255</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hughes</surname> <given-names>A. L.</given-names></name> <name><surname>Westover</surname> <given-names>K.</given-names></name> <name><surname>Da Silva</surname> <given-names>J.</given-names></name> <name><surname>O&#x2019;connor</surname> <given-names>D. H.</given-names></name> <name><surname>Watkins</surname> <given-names>D. I.</given-names></name></person-group> (<year>2001</year>). <article-title>Simultaneous positive and purifying selection on overlapping reading frames of the tat and vpr genes of simian immunodeficiency virus.</article-title> <source><italic>J. Virol.</italic></source> <volume>75</volume> <fpage>7966</fpage>&#x2013;<lpage>7972</lpage>. <pub-id pub-id-type="doi">10.1128/JVI.75.17.7966-7972.2001</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hundt</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>Z. B.</given-names></name> <name><surname>Liu</surname> <given-names>Q.</given-names></name></person-group> (<year>2013</year>). <article-title>Post-translational modifications of hepatitis C viral proteins and their biological significance.</article-title> <source><italic>World J. Gastroenterol.</italic></source> <volume>19</volume> <fpage>8929</fpage>&#x2013;<lpage>8939</lpage>. <pub-id pub-id-type="doi">10.3748/wjg.v19.i47.8929</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Iakoucheva</surname> <given-names>L. M.</given-names></name> <name><surname>Radivojac</surname> <given-names>P.</given-names></name> <name><surname>Brown</surname> <given-names>C. J.</given-names></name> <name><surname>O&#x2019;connor</surname> <given-names>T. R.</given-names></name> <name><surname>Sikes</surname> <given-names>J. G.</given-names></name> <name><surname>Obradovic</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>The importance of intrinsic disorder for protein phosphorylation.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>32</volume> <fpage>1037</fpage>&#x2013;<lpage>1049</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkh253</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Idriss</surname> <given-names>H.</given-names></name> <name><surname>Kawa</surname> <given-names>S.</given-names></name> <name><surname>Damuni</surname> <given-names>Z.</given-names></name> <name><surname>Thompson</surname> <given-names>E. B.</given-names></name> <name><surname>Wilson</surname> <given-names>S. H.</given-names></name></person-group> (<year>1999</year>). <article-title>HIV-1 reverse transcriptase is phosphorylated in vitro and in a cellular system.</article-title> <source><italic>Int. J. Biochem. Cell Biol.</italic></source> <volume>31</volume> <fpage>1443</fpage>&#x2013;<lpage>1452</lpage>. <pub-id pub-id-type="doi">10.1016/S1357-2725(99)00097-7</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jackson</surname> <given-names>T.</given-names></name> <name><surname>King</surname> <given-names>A. M. Q.</given-names></name> <name><surname>Stuart</surname> <given-names>D. I.</given-names></name> <name><surname>Fry</surname> <given-names>E.</given-names></name></person-group> (<year>2003</year>). <article-title>Structure and receptor binding.</article-title> <source><italic>Virus Res.</italic></source> <volume>91</volume> <fpage>33</fpage>&#x2013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1016/S0168-1702(02)00258-7</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>M. H.</given-names></name> <name><surname>Anderson</surname> <given-names>J.</given-names></name> <name><surname>Gillespie</surname> <given-names>J.</given-names></name> <name><surname>Mayne</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>uShuffle: a useful tool for shuffling biological sequences while preserving the k-let counts.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>9</volume>:<issue>192</issue>. <pub-id pub-id-type="doi">10.1186/1471-2105-9-192</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johnson</surname> <given-names>W. E.</given-names></name> <name><surname>Desrosiers</surname> <given-names>R. C.</given-names></name></person-group> (<year>2002</year>). <article-title>Viral persistence: HIV&#x2019;s strategies of immune system evasion.</article-title> <source><italic>Annu. Rev. Med.</italic></source> <volume>53</volume> <fpage>499</fpage>&#x2013;<lpage>518</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.med.53.082901.104053</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keck</surname> <given-names>F.</given-names></name> <name><surname>Ataey</surname> <given-names>P.</given-names></name> <name><surname>Amaya</surname> <given-names>M.</given-names></name> <name><surname>Bailey</surname> <given-names>C.</given-names></name> <name><surname>Narayanan</surname> <given-names>A.</given-names></name></person-group> (<year>2015</year>). <article-title>Phosphorylation of single stranded RNA virus proteins and potential for novel therapeutic strategies.</article-title> <source><italic>Viruses</italic></source> <volume>7</volume> <fpage>5257</fpage>&#x2013;<lpage>5273</lpage>. <pub-id pub-id-type="doi">10.3390/v7102872</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kudoh</surname> <given-names>A.</given-names></name> <name><surname>Takahama</surname> <given-names>S.</given-names></name> <name><surname>Sawasaki</surname> <given-names>T.</given-names></name> <name><surname>Ode</surname> <given-names>H.</given-names></name> <name><surname>Yokoyama</surname> <given-names>M.</given-names></name> <name><surname>Okayama</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>The phosphorylation of HIV-1 Gag by atypical protein kinase C facilitates viral infectivity by promoting Vpr incorporation into virions.</article-title> <source><italic>Retrovirology</italic></source> <volume>11</volume>:<issue>9</issue>. <pub-id pub-id-type="doi">10.1186/1742-4690-11-9</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kurotani</surname> <given-names>A.</given-names></name> <name><surname>Tokmakov</surname> <given-names>A. A.</given-names></name> <name><surname>Kuroda</surname> <given-names>Y.</given-names></name> <name><surname>Fukami</surname> <given-names>Y.</given-names></name> <name><surname>Shinozaki</surname> <given-names>K.</given-names></name> <name><surname>Sakurai</surname> <given-names>T.</given-names></name></person-group> (<year>2014</year>). <article-title>Correlations between predicted protein disorder and post-translational modifications in plants.</article-title> <source><italic>Bioinformatics</italic></source> <volume>30</volume> <fpage>1095</fpage>&#x2013;<lpage>1103</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt762</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Le Gall</surname> <given-names>T.</given-names></name> <name><surname>Romero</surname> <given-names>P. R.</given-names></name> <name><surname>Cortese</surname> <given-names>M. S.</given-names></name> <name><surname>Uversky</surname> <given-names>V. N.</given-names></name> <name><surname>Dunker</surname> <given-names>A. K.</given-names></name></person-group> (<year>2007</year>). <article-title>Intrinsic disorder in the Protein Data Bank.</article-title> <source><italic>J. Biomol. Struct. Dyn.</italic></source> <volume>24</volume> <fpage>325</fpage>&#x2013;<lpage>342</lpage>. <pub-id pub-id-type="doi">10.1080/07391102.2007.10507123</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maathuis</surname> <given-names>F. J.</given-names></name></person-group> (<year>2008</year>). <article-title>Conservation of protein phosphorylation sites within gene families and across species.</article-title> <source><italic>Plant Signal. Behav.</italic></source> <volume>3</volume> <fpage>1011</fpage>&#x2013;<lpage>1013</lpage>. <pub-id pub-id-type="doi">10.4161/psb.6721</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Makalowska</surname> <given-names>I.</given-names></name> <name><surname>Lin</surname> <given-names>C.-F.</given-names></name> <name><surname>Hernandez</surname> <given-names>K.</given-names></name></person-group> (<year>2007</year>). <article-title>Birth and death of gene overlaps in vertebrates.</article-title> <source><italic>BMC Evol. Biol.</italic></source> <volume>7</volume>:<issue>193</issue>. <pub-id pub-id-type="doi">10.1186/1471-2148-7-193</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manganaro</surname> <given-names>L.</given-names></name> <name><surname>Lusic</surname> <given-names>M.</given-names></name> <name><surname>Gutierrez</surname> <given-names>M. I.</given-names></name> <name><surname>Cereseto</surname> <given-names>A.</given-names></name> <name><surname>Del Sal</surname> <given-names>G.</given-names></name> <name><surname>Giacca</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Concerted action of cellular JNK and Pin1 restricts HIV-1 genome integration to activated CD4(+) T lymphocytes.</article-title> <source><italic>Nat. Med.</italic></source> <volume>16</volume> <fpage>329</fpage>&#x2013;<lpage>333</lpage>. <pub-id pub-id-type="doi">10.1038/nm.2102</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mayrose</surname> <given-names>I.</given-names></name> <name><surname>Stern</surname> <given-names>A.</given-names></name> <name><surname>Burdelova</surname> <given-names>E. O.</given-names></name> <name><surname>Sabo</surname> <given-names>Y.</given-names></name> <name><surname>Laham-Karam</surname> <given-names>N.</given-names></name> <name><surname>Zamostiano</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Synonymous site conservation in the HIV-1 genome.</article-title> <source><italic>BMC Evol. Biol.</italic></source> <volume>13</volume>:<issue>164</issue>. <pub-id pub-id-type="doi">10.1186/1471-2148-13-164</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meszaros</surname> <given-names>B.</given-names></name> <name><surname>Dosztanyi</surname> <given-names>Z.</given-names></name> <name><surname>Simon</surname> <given-names>I.</given-names></name></person-group> (<year>2012</year>). <article-title>Disordered binding regions and linear motifs-bridging the gap between two models of molecular recognition.</article-title> <source><italic>PLoS ONE</italic></source> <volume>7</volume>:<issue>e46829</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0046829</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mohan</surname> <given-names>A.</given-names></name> <name><surname>Oldfield</surname> <given-names>C. J.</given-names></name> <name><surname>Radivojac</surname> <given-names>P.</given-names></name> <name><surname>Vacic</surname> <given-names>V.</given-names></name> <name><surname>Cortese</surname> <given-names>M. S.</given-names></name> <name><surname>Dunker</surname> <given-names>A. K.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>Analysis of molecular recognition features (MoRFs).</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>362</volume> <fpage>1043</fpage>&#x2013;<lpage>1059</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2006.07.087</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ngandu</surname> <given-names>N. K.</given-names></name> <name><surname>Scheffler</surname> <given-names>K.</given-names></name> <name><surname>Moore</surname> <given-names>P.</given-names></name> <name><surname>Woodman</surname> <given-names>Z.</given-names></name> <name><surname>Martin</surname> <given-names>D.</given-names></name> <name><surname>Seoighe</surname> <given-names>C.</given-names></name></person-group> (<year>2008</year>). <article-title>Extensive purifying selection acting on synonymous sites in HIV-1 Group M sequences.</article-title> <source><italic>Virol. J.</italic></source> <volume>5</volume>:<issue>160</issue>. <pub-id pub-id-type="doi">10.1186/1743-422X-5-160</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pavesi</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Origin and evolution of overlapping genes in the family Microviridae.</article-title> <source><italic>J. Gen. Virol.</italic></source> <volume>87</volume> <fpage>1013</fpage>&#x2013;<lpage>1017</lpage>. <pub-id pub-id-type="doi">10.1099/vir.0.81375-0</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pushker</surname> <given-names>R.</given-names></name> <name><surname>Mooney</surname> <given-names>C.</given-names></name> <name><surname>Davey</surname> <given-names>N. E.</given-names></name> <name><surname>Jacque</surname> <given-names>J. M.</given-names></name> <name><surname>Shields</surname> <given-names>D. C.</given-names></name></person-group> (<year>2013</year>). <article-title>Marked variability in the extent of protein disorder within and between viral families.</article-title> <source><italic>PLoS ONE</italic></source> <volume>8</volume>:<issue>e60724</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0060724</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rachita</surname> <given-names>H. R.</given-names></name> <name><surname>Nagarajaram</surname> <given-names>H. A.</given-names></name></person-group> (<year>2014</year>). <article-title>Viral proteins that bridge unconnected proteins and components in the human PPI network.</article-title> <source><italic>Mol. Biosyst.</italic></source> <volume>10</volume> <fpage>2448</fpage>&#x2013;<lpage>2458</lpage>.</citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rancurel</surname> <given-names>C.</given-names></name> <name><surname>Khosravi</surname> <given-names>M.</given-names></name> <name><surname>Dunker</surname> <given-names>A. K.</given-names></name> <name><surname>Romero</surname> <given-names>P. R.</given-names></name> <name><surname>Karlin</surname> <given-names>D.</given-names></name></person-group> (<year>2009</year>). <article-title>Overlapping genes produce proteins with unusual sequence properties and offer insight into de novo protein creation.</article-title> <source><italic>J. Virol.</italic></source> <volume>83</volume> <fpage>10719</fpage>&#x2013;<lpage>10736</lpage>. <pub-id pub-id-type="doi">10.1128/JVI.00595-09</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rappoport</surname> <given-names>N.</given-names></name> <name><surname>Linial</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>Viral proteins acquired from a host converge to simplified domain architectures.</article-title> <source><italic>PloS Comput. Biol.</italic></source> <volume>8</volume>:<issue>e1002364</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002364</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sabath</surname> <given-names>N.</given-names></name> <name><surname>Ferrada</surname> <given-names>E.</given-names></name> <name><surname>Barve</surname> <given-names>A.</given-names></name> <name><surname>Wagner</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Growth temperature and genome size in bacteria are negatively correlated, suggesting genomic streamlining during thermal adaptation.</article-title> <source><italic>Genome Biol. Evol.</italic></source> <volume>5</volume> <fpage>966</fpage>&#x2013;<lpage>977</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evt050</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sabath</surname> <given-names>N.</given-names></name> <name><surname>Graur</surname> <given-names>D.</given-names></name> <name><surname>Landan</surname> <given-names>G.</given-names></name></person-group> (<year>2008</year>). <article-title>Same-strand overlapping genes in bacteria: compositional determinants of phase bias.</article-title> <source><italic>Biol. Direct.</italic></source> <volume>3</volume>:<issue>36</issue>. <pub-id pub-id-type="doi">10.1186/1745-6150-3-36</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saha</surname> <given-names>D.</given-names></name> <name><surname>Panda</surname> <given-names>A.</given-names></name> <name><surname>Podder</surname> <given-names>S.</given-names></name> <name><surname>Ghosh</surname> <given-names>T. C.</given-names></name></person-group> (<year>2015</year>). <article-title>Overlapping genes: a new strategy of thermophilic stress tolerance in prokaryotes.</article-title> <source><italic>Extremophiles</italic></source> <volume>19</volume> <fpage>345</fpage>&#x2013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1007/s00792-014-0720-3</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saha</surname> <given-names>D.</given-names></name> <name><surname>Podder</surname> <given-names>S.</given-names></name> <name><surname>Panda</surname> <given-names>A.</given-names></name> <name><surname>Ghosh</surname> <given-names>T. C.</given-names></name></person-group> (<year>2016</year>). <article-title>Overlapping genes: a significant genomic correlate of prokaryotic growth rates.</article-title> <source><italic>Gene</italic></source> <volume>582</volume> <fpage>143</fpage>&#x2013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1016/j.gene.2016.02.002</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sakharkar</surname> <given-names>K. R.</given-names></name> <name><surname>Sakharkar</surname> <given-names>M. K.</given-names></name> <name><surname>Verma</surname> <given-names>C.</given-names></name> <name><surname>Chow</surname> <given-names>V. T. K.</given-names></name></person-group> (<year>2005</year>). <article-title>Comparative study of overlapping genes in bacteria, with special reference to <italic>Rickettsia prowazekii</italic> and <italic>Rickettsia conorii</italic>.</article-title> <source><italic>Int. J. Syst. Evol. Microbiol.</italic></source> <volume>55(Pt 3)</volume>, <fpage>1205</fpage>&#x2013;<lpage>1209</lpage>. <pub-id pub-id-type="doi">10.1099/ijs.0.63446-0</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sanna</surname> <given-names>C. R.</given-names></name> <name><surname>Li</surname> <given-names>W.-H.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name></person-group> (<year>2008</year>). <article-title>Overlapping genes in the human and mouse genomes.</article-title> <source><italic>BMC Genomics</italic></source> <volume>9</volume>:<issue>169</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-9-169</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwartz</surname> <given-names>D.</given-names></name> <name><surname>Church</surname> <given-names>G. M.</given-names></name></person-group> (<year>2010</year>). <article-title>Collection and motif-based prediction of phosphorylation sites in human viruses.</article-title> <source><italic>Sci. Signal.</italic></source> <volume>3</volume>:<issue>rs2</issue>. <pub-id pub-id-type="doi">10.1126/scisignal.2001099</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharp</surname> <given-names>P. M.</given-names></name> <name><surname>Hahn</surname> <given-names>B. H.</given-names></name></person-group> (<year>2011</year>). <article-title>Origins of HIV and the AIDS Pandemic.</article-title> <source><italic>Cold Spring Harb. Perspect. Med.</italic></source> <volume>1</volume>:<issue>a006841</issue>. <pub-id pub-id-type="doi">10.1101/cshperspect.a006841</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simon-Loriere</surname> <given-names>E.</given-names></name> <name><surname>Holmes</surname> <given-names>E. C.</given-names></name> <name><surname>Pagan</surname> <given-names>I.</given-names></name></person-group> (<year>2013</year>). <article-title>The effect of gene overlapping on the rate of RNA virus evolution.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>30</volume> <fpage>1916</fpage>&#x2013;<lpage>1928</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/mst094</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Uversky</surname> <given-names>V. N.</given-names></name></person-group> (<year>2013</year>). <article-title>Intrinsic Disorder-based Protein Interactions and their Modulators.</article-title> <source><italic>Curr. Pharm. Des.</italic></source> <volume>19</volume> <fpage>4191</fpage>&#x2013;<lpage>4213</lpage>. <pub-id pub-id-type="doi">10.2174/1381612811319230005</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Veeramachaneni</surname> <given-names>V.</given-names></name> <name><surname>Makalowski</surname> <given-names>W.</given-names></name> <name><surname>Galdzicki</surname> <given-names>M.</given-names></name> <name><surname>Sood</surname> <given-names>R.</given-names></name> <name><surname>Makalowska</surname> <given-names>I.</given-names></name></person-group> (<year>2004</year>). <article-title>Mammalian overlapping genes: the comparative perspective.</article-title> <source><italic>Genome Res.</italic></source> <volume>14</volume> <fpage>280</fpage>&#x2013;<lpage>286</lpage>. <pub-id pub-id-type="doi">10.1101/gr.1590904</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>X.</given-names></name> <name><surname>Mcgirr</surname> <given-names>K. M.</given-names></name> <name><surname>Buehring</surname> <given-names>G. C.</given-names></name></person-group> (<year>2007</year>). <article-title>Potential evolutionary influences on overlapping reading frames in the bovine leukemia virus pXBL region.</article-title> <source><italic>Genomics</italic></source> <volume>89</volume> <fpage>502</fpage>&#x2013;<lpage>511</lpage>. <pub-id pub-id-type="doi">10.1016/j.ygeno.2006.12.007</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn01"><label>1</label><p><ext-link ext-link-type="uri" xlink:href="http://elm.eu.org">http://elm.eu.org</ext-link></p></fn>
</fn-group>
</back>
</article>