<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="review-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mol. Biosci.</journal-id>
<journal-title>Frontiers in Molecular Biosciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mol. Biosci.</abbrev-journal-title>
<issn pub-type="epub">2296-889X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">643752</article-id>
<article-id pub-id-type="doi">10.3389/fmolb.2021.643752</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Molecular Biosciences</subject>
<subj-group>
<subject>Mini Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Recent Advances in Protein Homology Detection Propelled by Inter-Residue Interaction Map Threading</article-title>
<alt-title alt-title-type="left-running-head">Bhattacharya et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Protein Interaction Map Threading</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Bhattacharya</surname>
<given-names>Sutanu</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1231257/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Roche</surname>
<given-names>Rahmatullah</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1327387/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Shuvo</surname>
<given-names>Md Hossain</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1231274/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Bhattacharya</surname>
<given-names>Debswapna</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1024811/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<label>
<sup>1</sup>
</label>Department of Computer Science and Software Engineering, Auburn University, <addr-line>Auburn</addr-line>, <addr-line>AL</addr-line>, <country>United&#x20;States</country>
</aff>
<aff id="aff2">
<label>
<sup>2</sup>
</label>Department of Biological Sciences, Auburn University, <addr-line>Auburn</addr-line>, <addr-line>AL</addr-line>, <country>United&#x20;States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/676789/overview">Paolo Marcatili</ext-link>, Technical University of Denmark, Denmark</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/686837/overview">Dimitrios P. Vlachakis</ext-link>, Agricultural University of Athens, Greece</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/778347/overview">Kresten Lindorff-Larsen</ext-link>, University of Copenhagen, Denmark</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Debswapna Bhattacharya, <email>bhattacharyad@auburn.edu</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Structural Biology, a section of the journal Frontiers in Molecular Biosciences</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>11</day>
<month>05</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>8</volume>
<elocation-id>643752</elocation-id>
<history>
<date date-type="received">
<day>18</day>
<month>12</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>04</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Bhattacharya, Roche, Shuvo and Bhattacharya.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Bhattacharya, Roche, Shuvo and Bhattacharya</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Sequence-based protein homology detection has emerged as one of the most sensitive and accurate approaches to protein structure prediction. Despite the success, homology detection remains very challenging for weakly homologous proteins with divergent evolutionary profile. Very recently, deep neural network architectures have shown promising progress in mining the coevolutionary signal encoded in multiple sequence alignments, leading to reasonably accurate estimation of inter-residue interaction maps, which serve as a rich source of additional information for improved homology detection. Here, we summarize the latest developments in protein homology detection driven by inter-residue interaction map threading. We highlight the emerging trends in distant-homology protein threading through the alignment of predicted interaction maps at various granularities ranging from binary contact maps to finer-grained distance and orientation maps as well as their combination. We also discuss some of the current limitations and possible future avenues to further enhance the sensitivity of protein homology detection.</p>
</abstract>
<kwd-group>
<kwd>protein homology</kwd>
<kwd>inter-residue interaction map</kwd>
<kwd>protein threading</kwd>
<kwd>homology modeling</kwd>
<kwd>protein structure prediction</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>The development of computational approaches for accurately predicting the protein three-dimensional (3D) structure directly from the sequence information is of central importance in structural biology (<xref ref-type="bibr" rid="B42">Jones et&#x20;al., 1992</xref>; <xref ref-type="bibr" rid="B8">Baker and Sali, 2001</xref>; <xref ref-type="bibr" rid="B28">Dill and MacCallum, 2012</xref>). While <italic>ab initio</italic> modeling aims to predict the 3D structure purely from the sequence information (<xref ref-type="bibr" rid="B55">Marks et&#x20;al., 2011</xref>; <xref ref-type="bibr" rid="B4">Adhikari et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B86">Wang et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B3">Adhikari and Cheng, 2018</xref>; <xref ref-type="bibr" rid="B34">Greener et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B71">Senior et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B93">Xu, 2019</xref>; <xref ref-type="bibr" rid="B97">Yang et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B69">Roche et&#x20;al., 2021</xref>), many protein targets have evolutionary-related (homologous) structures, also known as homologous templates, already available in the Protein Data Bank (PDB) (<xref ref-type="bibr" rid="B11">Berman et&#x20;al., 2000</xref>). Correctly identifying these templates given the sequence of a query protein and building 3D models by performing query&#x2013;template alignment, a technique broadly known as homology modeling (<xref ref-type="bibr" rid="B7">Altschul et&#x20;al., 1997</xref>; <xref ref-type="bibr" rid="B94">Xu et&#x20;al., 2003</xref>; <xref ref-type="bibr" rid="B90">Wu and Zhang, 2008</xref>; <xref ref-type="bibr" rid="B50">Lobley et&#x20;al., 2009</xref>; <xref ref-type="bibr" rid="B91">Wu and Zhang, 2010</xref>; <xref ref-type="bibr" rid="B43">K&#xe4;llberg et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B51">Ma et&#x20;al., 2014</xref>) often results in highly accurate predicted structural models (<xref ref-type="bibr" rid="B1">Abeln et&#x20;al., 2017</xref>). As such, the success of homology modeling critically depends on the ability to identify the closely homologous template on the basis of sequence similarity and generate accurate query&#x2013;template alignment. Intuitively, the performance of these methods sharply deteriorates when the direct evolutionary relationship between the query and templates becomes very low, typically when the sequence similarity falls below 30%, the so-called distant-homology modeling scenarios (<xref ref-type="bibr" rid="B21">Bowie et&#x20;al., 1991</xref>; <xref ref-type="bibr" rid="B67">Petrey and Honig, 2005</xref>). Protein threading, the most widely used distant-homology modeling technique, aims to address the challenge by leveraging multiple sources of information by mining the evolutionary profile of the query and templates to reveal potential distant homology and perform distant-homology modeling to predict the 3D structure of the query protein.</p>
<p>Existing threading methods exploit a wide range of techniques ranging from dynamic programming to profile-based comparison to machine learning (<xref ref-type="bibr" rid="B40">Jones, 1999</xref>; <xref ref-type="bibr" rid="B70">Rychlewski et&#x20;al., 2000</xref>; <xref ref-type="bibr" rid="B96">Xu and Xu, 2000</xref>; <xref ref-type="bibr" rid="B74">Skolnick and Kihara, 2001</xref>; <xref ref-type="bibr" rid="B32">Ginalski et&#x20;al., 2003</xref>; <xref ref-type="bibr" rid="B56">Marti et&#x20;al., 2004</xref>; <xref ref-type="bibr" rid="B39">Jaroszewski et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B76">S&#xf6;ding, 2005</xref>; <xref ref-type="bibr" rid="B103">Zhou and Zhou, 2005</xref>; <xref ref-type="bibr" rid="B26">Cheng and Baldi, 2006</xref>; <xref ref-type="bibr" rid="B64">Peng and Xu, 2009</xref>; <xref ref-type="bibr" rid="B47">Lee and Skolnick, 2010</xref>; <xref ref-type="bibr" rid="B65">Peng and Xu, 2010</xref>; <xref ref-type="bibr" rid="B98">Yang et&#x20;al., 2011</xref>; <xref ref-type="bibr" rid="B52">Ma et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B53">Ma et&#x20;al., 2013</xref>; <xref ref-type="bibr" rid="B33">Gniewek et&#x20;al., 2014</xref>). The recent advancement in predicting the inter-residue interaction maps using sequence coevolution and deep learning (<xref ref-type="bibr" rid="B61">Morcos et&#x20;al., 2011</xref>; <xref ref-type="bibr" rid="B37">He et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B85">Wang et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B5">Adhikari et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B36">Hanson et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B45">Kandathil et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B97">Yang et&#x20;al., 2020</xref>) has opened new possibilities to further improve the sensitivity of distant-homology protein threading by incorporating the predicted inter-residue interaction information. Fueled by this, several efforts have been made in the recent past to integrate interaction maps into threading. For instance, EigenTHREADER (<xref ref-type="bibr" rid="B23">Buchan and Jones, 2017</xref>), map_align (<xref ref-type="bibr" rid="B62">Ovchinnikov et&#x20;al., 2017</xref>), CEthreader (<xref ref-type="bibr" rid="B101">Zheng et&#x20;al., 2019a</xref>), CATHER (<xref ref-type="bibr" rid="B30">Du et&#x20;al., 2020</xref>), and ThreaderAI (<xref ref-type="bibr" rid="B100">Zhang and Shen, 2020</xref>) have utilized predicted contact maps in protein threading. DeepThreader (<xref ref-type="bibr" rid="B104">Zhu et&#x20;al., 2018</xref>) has exploited finer-grained distance maps for query proteins instead of using binary contacts to improve threading template selection and alignment. DisCovER (<xref ref-type="bibr" rid="B19">Bhattacharya et&#x20;al., 2020</xref>) goes one step further by incorporating inter-residue orientation along with distance information together with topological network neighborhood (<xref ref-type="bibr" rid="B24">Chen et&#x20;al., 2019</xref>) of query&#x2013;template alignment to further improve threading performance. Here, we provide an overview of the latest advances in protein homology detection propelled by inter-residue interaction map threading.</p>
</sec>
<sec id="s2">
<title>Granularities of Protein Inter-Residue Interaction Maps</title>
<p>Protein inter-residue interaction maps are predicted at various resolutions ranging from binary contact maps to finer-grained distance and orientation maps as well as their combination. A low-resolution version of inter-residue interaction is a contact map, which is a square, symmetric matrix with binary entries, where a contact indicates the spatial proximity of a residue pair at a given cutoff distance, typically set to 8&#xc5; between the C<sub>&#x3b1;</sub> or C<sub>&#x3b2;</sub> carbons of the interacting residue pairs. Inter-residue distance map is finer-grained in that it captures the distribution of real-valued inter-residue spatial proximity information rather than the binary contacts at a fixed cutoff distance. Recent studies (<xref ref-type="bibr" rid="B95">Xu and Wang, 2019</xref>; <xref ref-type="bibr" rid="B93">Xu, 2019</xref>) have demonstrated the advantage of using distance maps in protein structure prediction over binary contacts as distances carry more physical constraint information of protein structures than contacts. The granularities of predicted distance maps vary from distance histograms to real-valued distances (<xref ref-type="bibr" rid="B34">Greener et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B2">Adhikari, 2020</xref>; <xref ref-type="bibr" rid="B29">Ding and Gong, 2020</xref>; <xref ref-type="bibr" rid="B48">Li and Xu, 2020</xref>; <xref ref-type="bibr" rid="B92">Wu et&#x20;al., 2021</xref>; <xref ref-type="bibr" rid="B97">Yang et&#x20;al., 2020</xref>). Very recently, trRosetta (<xref ref-type="bibr" rid="B97">Yang et&#x20;al., 2020</xref>) has introduced inter-residue orientations in addition to distances to capture not only the spatial proximity information of the interacting pairs but also their relative angles and dihedrals. Collectively, inter-residue distances and orientations encapsulate the spatial positioning of the interacting pairs much better than only distances let&#x20;alone binary contacts.</p>
</sec>
<sec id="s3">
<title>Inter-Residue Interaction Map Threading</title>
<p>
<xref ref-type="fig" rid="F1">Figure&#x20;1</xref> shows an overview of an interaction map threading of a query protein. Generally, threading has four components: (1) an effective scoring function to evaluate the fitness of query&#x2013;template alignment; (2) efficient template searching or homology detection strategy; (3) optimal query&#x2013;template alignments; and (4) building 3D models of query proteins based on alignments. One of the most important components of threading approaches is the scoring function, which is composed of standard threading features ranging from sequential features such as secondary structures, solvent accessibility, and sequence profiles to nonlinear features such as pairwise potentials (<xref ref-type="bibr" rid="B20">Bienkowska and Lathrop, 2005</xref>; <xref ref-type="bibr" rid="B22">Brylinski and Skolnick, 2010</xref>). Weights control the relative importance of different terms. An efficient scoring function should reliably differentiate a homologous template from the alternatives because the accuracy of the predicted model significantly depends on the evolutionary relatedness of the identified template. The inter-residue interaction map helps to improve the sensitivity of the threading scoring function by augmenting the standard scoring terms with additional contributions from the predicted interactions. Specifically, the score to align the <inline-formula id="inf1">
<mml:math id="m1">
<mml:mi>i</mml:mi>
</mml:math>
</inline-formula> th residue of the query protein to the <inline-formula id="inf2">
<mml:math id="m2">
<mml:mi>j</mml:mi>
</mml:math>
</inline-formula> th residue of the template can be defined as:<disp-formula id="equ1">
<mml:math id="m3">
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:msubsup>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:munder>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>s</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>d</mml:mi>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>g</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>f</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:munder>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi>E</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>where the first term accounts for the contribution of the interaction map and the second term accounts for the standard threading features with <inline-formula id="inf3">
<mml:math id="m4">
<mml:mrow>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> being their relative weights. Typically, the similarity between the predicted inter-residue interaction map of the query protein and that derived from the template structure informs the interaction map term in the threading scoring function. It is worth noting here that the raw alignment score is biased to protein length (<xref ref-type="bibr" rid="B94">Xu et&#x20;al., 2003</xref>). As such, most threading methods use a normalized alignment score in standard deviation units relative to the mean score of all templates in the template library for homology detection&#x2014;detecting best-fit templates from the&#x20;PDB.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Illustration of protein interaction map threading.</p>
</caption>
<graphic xlink:href="fmolb-08-643752-g001.tif"/>
</fig>
</sec>
<sec id="s4">
<title>Emerging Trends in Protein Homology Detection by Interaction Map Threading</title>
<p>With the recent advancement in contact prediction mediated by sequence coevolution and deep learning, significant research efforts have been made in the recent past to incorporate contact information as an additional scoring term into the threading scoring function for protein homology detection. For instance, Jones and coworkers developed EigenTHREADER (<xref ref-type="bibr" rid="B23">Buchan and Jones, 2017</xref>) that uses eigen-decomposition (<xref ref-type="bibr" rid="B27">Di Lena et&#x20;al., 2010</xref>) of contact maps predicted using classical neural network&#x2013;based predictor MetaPSICOV (<xref ref-type="bibr" rid="B41">Jones et&#x20;al., 2015</xref>) to search a library of template contact maps for contact map threading. Baker and coworkers developed map_align (<xref ref-type="bibr" rid="B62">Ovchinnikov et&#x20;al., 2017</xref>) that employs an iterative double dynamic programming framework (<xref ref-type="bibr" rid="B80">Taylor, 1999</xref>) for homology detection. map_align takes advantage of metagenomics sequence databases of microbial DNA (<xref ref-type="bibr" rid="B75">S&#xf6;ding, 2017</xref>) and uses contact maps predicted by coevolutionary contact predictor GREMLIN (<xref ref-type="bibr" rid="B9">Balakrishnan et&#x20;al., 2011</xref>; <xref ref-type="bibr" rid="B44">Kamisetty et&#x20;al., 2013</xref>) to perform contact map threading by maximizing the number of overlapping contacts and minimizing the number of gaps. Recently, Zhang and coworkers developed CEthreader (<xref ref-type="bibr" rid="B101">Zheng et&#x20;al., 2019a</xref>) using contact maps predicted by deep learning&#x2013;based contact map predictor ResPRE (<xref ref-type="bibr" rid="B49">Li et&#x20;al., 2019</xref>). CEthreader also relies on eigen-decomposition and performs contact map threading through dynamic programming using a dot-product scoring function by integrating contacts as well as secondary structures and sequence profiles. Alongside, we developed a contact-assisted threading method (<xref ref-type="bibr" rid="B17">Bhattacharya and Bhattacharya, 2019</xref>) that incorporates contact information, predicted by deep learning&#x2013;based predictor RaptorX (<xref ref-type="bibr" rid="B85">Wang et&#x20;al., 2017</xref>), into threading using a two-stage approach. After selecting a subset of top templates from the template library using a standard profile-based threading technique in the first stage, our method subsequently uses eigen-decomposition of the contact information along with the profile-based alignment score to select the best-fit template. We further analyze the impact of contact map quality on threading performance (<xref ref-type="bibr" rid="B18">Bhattacharya and Bhattacharya, 2020</xref>), which reveals that incorporating high-quality contact maps having the Matthews correlation coefficient (MCC) &#x2265; 0.5 improves the threading performance for &#x223c; 30% cases in comparison to a baseline contact-free threading used as a control, while incorporating low-quality contacts with MCC &#x3c;0.35 deteriorates the performance for 50% cases. Yang and coworkers developed CATHER (<xref ref-type="bibr" rid="B30">Du et&#x20;al., 2020</xref>) by incorporating contact maps predicted by deep learning&#x2013;based predictor MapPred (<xref ref-type="bibr" rid="B88">Wu et&#x20;al., 2020</xref>) along with standard sequential information in the threading scoring function. Very recently, Shen and coworkers have developed ThreaderAI (<xref ref-type="bibr" rid="B100">Zhang and Shen, 2020</xref>) that implements a neural network for predicting alignments by incorporating deep learning&#x2013;based contact information with conventional sequential and structural features into the scoring function.</p>
<p>Building on the successes of contact-assisted threading methods, Xu and coworkers developed a distance-based threading method called DeepThreader (<xref ref-type="bibr" rid="B104">Zhu et&#x20;al., 2018</xref>). The method predicts distance maps by employing deep learning and then incorporates the predicted inter-residue distance information along with sequential features into threading through alternating direction method of multipliers (ADMM) algorithm. The inter-residue distance is binned into 12 bins: &#x3c;5&#xc5;, 5&#x2013;6&#xc5;, .., 14&#x2013;15&#xc5;, and &#x3e;15&#xc5;. Based on their reported results as well as the performance evaluation in the 13th Critical Assessment of protein Structure Prediction (CASP13), incorporating distance information boosts threading performance, particularly for distant-homology targets, outperforming contact-assisted threading methods by a large margin (<xref ref-type="bibr" rid="B95">Xu and Wang, 2019</xref>, 13). Zhang and coworkers have recently extended CEthreader to develop a distance-assisted threading method DEthreader introduced during the recently concluded CASP14 experiment by incorporating a distance-based scoring term into the scoring function. The method uses the C<sub>&#x3b1;</sub>&#x2013;C<sub>&#x3b1;</sub> and C<sub>&#x3b2;</sub>&#x2013;C<sub>&#x3b2;</sub> distance distribution, both are binned into 38 bins: 1 bin of &#x3c;2&#xc5;, 36 bins of 2&#x2013;20&#xc5; with a width of 0.5&#xc5;, and 1 bin of &#x2265;20&#xc5;. Similarly, Yang and coworkers have extended CATHER into a distance-based threading approach by replacing contacts with distances in CASP14.</p>
<p>Powered by the development of the recent deep learning&#x2013;based trRosetta method (<xref ref-type="bibr" rid="B97">Yang et&#x20;al., 2020</xref>) for the prediction of inter-residue orientations and distances, our recent method DisCovER (<xref ref-type="bibr" rid="B19">Bhattacharya et&#x20;al., 2020</xref>) goes one step further by incorporating predicted inter-residue orientations in addition to distances together with the neighborhood effect of the query&#x2013;template alignment using an iterative double dynamic programming framework. The predicted distances are binned into 9 bins with a bin size of 1&#xc5;: &#x3c;6&#xc5; to &#x3c;14&#xc5; by summing up the likelihoods for distance bins below a distance threshold. The two orientation dihedrals (&#x3c9;, &#x3b8;) are binned into 24 bins with a width&#x20;of 15&#xb0;, and the orientation angle (&#x3d5;) is binned into 12 bins with a width of 15&#xb0;. Experimental results demonstrate the improved threading performance of DisCovER over the other state-of-the-art threading approaches on multiple benchmark datasets across various target categories, especially for distantly homologous proteins. Representative examples on CAMEO targets 6D2S_A and 6CP8_D provide some insights into the origin of the improved performance. <xref ref-type="fig" rid="F2">Figure&#x20;2</xref> shows our recent method DisCovER predicts correct folds (TM-score &#x3e; 0.5) for both the targets 6D2S_A and 6CP8_D with a TM-score of 0.76 and 0.69, respectively, significantly better than the others. While the pure profile-based threading method CNFpred (<xref ref-type="bibr" rid="B52">Ma et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B53">Ma et&#x20;al., 2013</xref>) and the recent contact-assisted threading method CEthreader fail to predict the correct fold for the target 6D2S_A, DisCovER and the CAMEO server RaptorX (<xref ref-type="bibr" rid="B43">K&#xe4;llberg et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B104">Zhu et&#x20;al., 2018</xref>), employing the distance-based threading method DeepThreader (<xref ref-type="bibr" rid="B35">Haas et&#x20;al., 2019</xref>), effectively predict the correct fold, with noticeably better performance by DisCovER (an improvement of 0.2&#x20;TM-score points) than the next best RaptorX. We also notice the superior performance of DisCovER for the target 6CP8_D where DisCovER significantly outperforms the other competing methods including the next best CEthreader by 0.18&#x20;TM-score points. It is worth mentioning both the targets are officially classified as &#x201c;hard&#x201d; by CAMEO (<xref ref-type="bibr" rid="B35">Haas et&#x20;al., 2019</xref>), which warrants a distantly homologous nature in which current threading methods have limitations. Overall, the results show that the integration of the orientation information and the neighborhood effect in DisCovER results in improved threading, attaining state-of-the-art performance in (distant) homology detection.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Structural superposition between predicted models using various threading methods (in violet) and the corresponding experimental structures (in gray) for representative CAMEO targets 6D2S_A of length 289 residues and 6CP8_D of length 164 residues.</p>
</caption>
<graphic xlink:href="fmolb-08-643752-g002.tif"/>
</fig>
</sec>
<sec id="s5">
<title>The Role of Sequence Databases in Interaction Map Threading</title>
<p>The prediction of inter-residue interaction maps depends heavily on the availability of homologous sequences. As such, the role of the sequence databases is becoming increasingly important in protein homology detection via interaction map threading. In addition to the well-established whole-genome sequence databases such as the nr database from the National Center for Biotechnology Information (NCBI), UniRef (<xref ref-type="bibr" rid="B79">Suzek et&#x20;al., 2015</xref>), UniProt (<xref ref-type="bibr" rid="B81">The UniProt Consortium, 2019</xref>), and Uniclust (<xref ref-type="bibr" rid="B58">Mirdita et&#x20;al., 2017</xref>); emerging metagenome sequence databases from the European Bioinformatics Institute (EBI) Metagenomics (<xref ref-type="bibr" rid="B54">Markowitz et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B60">Mitchell et&#x20;al., 2018</xref>) and Metaclust (<xref ref-type="bibr" rid="B77">Steinegger and S&#xf6;ding, 2018</xref>) are playing a prominent role. For example, <xref ref-type="bibr" rid="B87">Wang et&#x20;al. (2019</xref>) have demonstrated the applications of marine metagenomics for improved protein structure prediction. map_align uses the Integrated Microbial Genomes (IMG) database (<xref ref-type="bibr" rid="B54">Markowitz et&#x20;al., 2014</xref>), containing around 4 million unique protein sequences, to reliably predict high-quality models for distant-homology Pfam families of unknown structures. Another recent method for generating protein multiple sequence alignments, DeepMSA (<xref ref-type="bibr" rid="B99">Zhang et&#x20;al., 2020</xref>), combines whole-genome and metagenome sequence databases and reports improved threading performance, particularly for distant-homology proteins. Newer sequence databases are getting larger and diverse. For example, BFD (<xref ref-type="bibr" rid="B78">Steinegger et&#x20;al., 2019</xref>), a recent sequence database, is one of the largest sequence databases containing 2 billion protein sequences from soil samples and 292 million sequences of marine samples. Another very recent sequence database MGnify (<xref ref-type="bibr" rid="B59">Mitchell et&#x20;al., 2020</xref>) contains around 1 billion nonredundant protein sequences. As such, the availability of evolutionary information of distant-homology proteins is getting enriched, likely leading to improved prediction accuracy of inter-residue interaction maps and hence more accurate interaction map threading for distant-homology protein modeling.</p>
</sec>
<sec sec-type="discussion" id="s6">
<title>Discussion</title>
<p>While the use of interaction maps is the main driving force behind the improved threading performance, the optimal granularity and information content of the predicted interaction maps remain elusive. Existing works consider various distance bins (<xref ref-type="bibr" rid="B104">Zhu et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B19">Bhattacharya et&#x20;al., 2020</xref>) and subsets of predicted interactions either based on top predicted pairs sorted based on their likelihood values or using arbitrary likelihood cutoffs (<xref ref-type="bibr" rid="B17">Bhattacharya and Bhattacharya, 2019</xref>; <xref ref-type="bibr" rid="B101">Zheng et&#x20;al., 2019a</xref>). A robust mechanism for defining and selecting interacting residue pairs can be beneficial to existing threading methods. Another challenge is how to integrate heterogeneous sources of available information from multiple interaction map predictors and/or sequence databases in a singular framework for unified interaction map threading. Finally, the use of multiple templates (<xref ref-type="bibr" rid="B25">Cheng, 2008</xref>; <xref ref-type="bibr" rid="B66">Peng and Xu, 2011</xref>; <xref ref-type="bibr" rid="B57">Meier and S&#xf6;ding, 2015</xref>) and meta-approaches (<xref ref-type="bibr" rid="B89">Wu and Zhang, 2007</xref>; <xref ref-type="bibr" rid="B102">Zheng et&#x20;al., 2019b</xref>) possibly coupled with model quality assessment methods (<xref ref-type="bibr" rid="B68">Ray et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B83">Uziela et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B82">Uziela et&#x20;al., 2017</xref>; 3; <xref ref-type="bibr" rid="B6">Alapati and Bhattacharya, 2018</xref>; <xref ref-type="bibr" rid="B46">Karasikov et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B10">Baldassarre et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B31">Eismann et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B73">Shuvo et&#x20;al., 2020</xref>) and potentially aided by structure refinement (<xref ref-type="bibr" rid="B12">Bhattacharya and Cheng, 2013a</xref>; <xref ref-type="bibr" rid="B13">Bhattacharya and Cheng, 2013b</xref>; <xref ref-type="bibr" rid="B14">Bhattacharya and Cheng, 2013c</xref>; <xref ref-type="bibr" rid="B15">Bhattacharya et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B16">Bhattacharya, 2019</xref>; <xref ref-type="bibr" rid="B84">Wang et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B38">Heo and Feig, 2020</xref>) can collectively improve the accuracy of distant-homology protein modeling.</p>
<p>Recent CASP experiments have witnessed dramatic recent advances by DeepMind&#x2019;s AlphaFold series (<xref ref-type="bibr" rid="B71">Senior et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B72">Senior et&#x20;al., 2020</xref>) in <italic>ab initio</italic> protein structure prediction, significantly outperforming the other groups. The success of AlphaFold series is primarily attributed to the successful application of deep neural networks for accurately predicting inter-residue spatial proximity information coupled with end-to-end training, significantly improving the accuracy of protein structure prediction (<xref ref-type="bibr" rid="B63">Pearce and Zhang, 2021</xref>). The integration of deep learning into various stages of protein modeling represents an exciting future direction that shall have a transformative impact on distant-homology protein modeling via interaction map threading, complementing and supplementing <italic>ab initio</italic> protein structure prediction methods developed by DeepMind.</p>
</sec>
</body>
<back>
<sec id="s7">
<title>Author Contributions</title>
<p>All authors contributed in writing and revising the manuscript under the supervision of&#x20;DB.</p>
</sec>
<sec id="s8">
<title>Funding</title>
<p>This work was partially supported by the National Science Foundation CAREER Award DBI-1942692 to DB, the National Science Foundation grant IIS-2030722 to DB, and the National Institute of General Medical Sciences Maximizing Investigators&#x27; Research Award (MIRA) R35GM138146 to&#x20;DB.</p>
</sec>
<sec sec-type="COI-statement" id="s9">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<ack>
<p>This work was made possible in part by Auburn University Early Career Development grant to&#x20;DB.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Abeln</surname>
<given-names>S.</given-names>
</name>
</person-group>, <person-group person-group-type="author">
<name>
<surname>Heringa</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Anton Feenstra</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2017</year>). <source>Introduction to protein structure prediction. arXiv [arXiv:1712.00407]</source>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1712.00407v1">https://arxiv.org/abs/1712.00407v1</ext-link>
</comment>.</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adhikari</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A Fully Open-Source Framework for Deep Learning Protein Real-Valued Distances</article-title>. <source>Scientific Rep.</source> <volume>10</volume> (<issue>1</issue>), <fpage>13374</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-020-70181-0</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adhikari</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>CONFOLD2: Improved Contact-Driven Ab Initio Protein Structure Modeling</article-title>. <source>BMC Bioinformatics</source> <volume>19</volume> (<issue>1</issue>), <fpage>22</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-018-2032-6</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adhikari</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Cao</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>CONFOLD: Residue-Residue Contact-Guidedab Initioprotein Folding</article-title>. <source>Proteins</source> <volume>83</volume> (<issue>8</issue>), <fpage>1436</fpage>&#x2013;<lpage>1449</lpage>. <pub-id pub-id-type="doi">10.1002/prot.24829</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adhikari</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Hou</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>DNCON2: Improved Protein Contact Prediction Using Two-Level Deep Convolutional Neural Networks</article-title>. <source>Bioinformatics</source> <volume>34</volume> (<issue>9</issue>), <fpage>1466</fpage>&#x2013;<lpage>1472</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx781</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Alapati</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>ClustQ: Efficient Protein Decoy Clustering Using Superposition-free Weighted Internal Distance Comparisons</article-title>,&#x201d; In <conf-name>Proceedings Of the 2018 ACM International Conference On Bioinformatics, Computational Biology, and Health Informatics</conf-name>. <publisher-loc>New York, NY, USA</publisher-loc>: <publisher-name>Association for Computing Machinery</publisher-name>. <comment>BCB &#x2019;18</comment>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1145/3233547.3233570">10.1145/3233547.3233570</ext-link> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Altschul</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Madden</surname>
<given-names>T. L.</given-names>
</name>
<name>
<surname>Sch&#xe4;ffer</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Lipman</surname>
<given-names>D. J.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Gapped BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs</article-title>. <source>Nucleic Acids Res.</source> <volume>25</volume> (<issue>17</issue>), <fpage>3389</fpage>&#x2013;<lpage>3402</lpage>. <pub-id pub-id-type="doi">10.1093/nar/25.17.3389</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baker</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Sali</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Protein Structure Prediction and Structural Genomics</article-title>. <source>Science</source> <volume>294</volume> (<issue>5540</issue>), <fpage>93</fpage>&#x2013;<lpage>96</lpage>. <pub-id pub-id-type="doi">10.1126/science.1065659</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Balakrishnan</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Kamisetty</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Carbonell</surname>
<given-names>J.&#x20;G.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S.-I.</given-names>
</name>
<name>
<surname>Langmead</surname>
<given-names>C. J.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Carbonell, Su-In Lee, and Christopher James LangmeadLearning Generative Models for Protein Fold Families</article-title>. <source>Proteins</source> <volume>79</volume> (<issue>4</issue>), <fpage>1061</fpage>&#x2013;<lpage>1078</lpage>. <pub-id pub-id-type="doi">10.1002/prot.22934</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baldassarre</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Hurtado</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Elofsson</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Azizpour</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>GraphQA: Protein Model Quality Assessment Using Graph Convolutional Networks</article-title>. <source>Bioinformatics</source>, <volume>37</volume> (<issue>3</issue>), <fpage>360</fpage>-<lpage>366</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa714</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Berman</surname>
<given-names>H. M.</given-names>
</name>
<name>
<surname>Westbrook</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Gilliland</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Bhat</surname>
<given-names>T. N.</given-names>
</name>
<name>
<surname>Weissig</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2000</year>). <article-title>The Protein Data Bank</article-title>. <source>Nucleic Acids Res.</source> <volume>28</volume> (<issue>1</issue>), <fpage>235</fpage>&#x2013;<lpage>242</lpage>. <pub-id pub-id-type="doi">10.1093/nar/28.1.235</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2013a</year>). <article-title>3Drefine: Consistent Protein Structure Refinement by Optimizing Hydrogen Bonding Network and Atomic-Level Energy Minimization</article-title>. <source>Proteins</source> <volume>81</volume> (<issue>1</issue>), <fpage>119</fpage>&#x2013;<lpage>131</lpage>. <pub-id pub-id-type="doi">10.1002/prot.24167</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2013b</year>). <article-title>I3Drefine Software for Protein 3D Structure Refinement and its Assessment in CASP10</article-title>. <source>PLOS ONE</source> <volume>8</volume> (<issue>7</issue>), <fpage>e69648</fpage>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1371/journal.pone.0069648">10.1371/journal.pone.0069648</ext-link> </citation>
</ref>
<ref id="B14">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2013c</year>). &#x201c;<article-title>Protein Structure Refinement by Iterative Fragment Exchange</article-title>,&#x201d; in <conf-name>Proceedings Of the International Conference On Bioinformatics, Computational Biology And Biomedical Informatics</conf-name>. <publisher-loc>New York, NY, USA</publisher-loc>: <publisher-name>BCB&#x2019;13 Association for Computing Machinery</publisher-name>. </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Nowotny</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Cao</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>3Drefine: An Interactive Web Server for Efficient Protein Structure Refinement</article-title>. <source>Nucleic Acids Res.</source> <volume>44</volume> (<issue>W1</issue>), <fpage>W406</fpage>&#x2013;<lpage>W409</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw336</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>RefineD: Improved Protein Structure Refinement Using Machine Learning Based Restrained Relaxation</article-title>. <source>Bioinformatics</source> <volume>35</volume> (<issue>18</issue>), <fpage>3320</fpage>&#x2013;<lpage>3328</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz101</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhattacharya</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Does Inclusion of Residue&#x2010;residue Contact Information Boost Protein Threading?</article-title> <source>Proteins</source> <volume>87</volume> (<issue>7</issue>), <fpage>596</fpage>&#x2013;<lpage>606</lpage>. <pub-id pub-id-type="doi">10.1002/prot.25684</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhattacharya</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Evaluating the Significance of Contact Maps in Low-Homology Protein Modeling Using Contact-Assisted Threading</article-title>. <source>Scientific Rep.</source> <volume>10</volume> (<issue>1</issue>), <fpage>2908</fpage>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/s41598-020-59834-2">10.1038/s41598-020-59834-2</ext-link> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhattacharya</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Roche</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>DisCovER: Distance- and Orientation-Based Covariational Threading for Weakly Homologous Proteins</article-title>. <source>BioRxiv</source>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1101/2020.01.31.923409">10.1101/2020.01.31.923409</ext-link> </citation>
</ref>
<ref id="B20">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Bienkowska</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lathrop</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2005</year>). &#x201c;<article-title>Threading Algorithms</article-title>,&#x201d; in <source>Encyclopedia Of Genetics, Genomics, Proteomics and Bioinformatics</source>. <comment>Editors</comment> <person-group person-group-type="editor">
<name>
<surname>Jorde</surname>
<given-names>L. B.</given-names>
</name>
<name>
<surname>Little</surname>
<given-names>P. F. R.</given-names>
</name>
<name>
<surname>Dunn</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Subramaniam</surname>
<given-names>S.</given-names>
</name>
</person-group> (<publisher-name>American Cancer Society</publisher-name>). <pub-id pub-id-type="doi">10.1002/047001153X.g409202</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bowie</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Luthy</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Eisenberg</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>1991</year>). <article-title>A Method to Identify Protein Sequences that Fold into a Known Three-Dimensional Structure</article-title>. <source>Science</source> <volume>253</volume> (<issue>5016</issue>), <fpage>164</fpage>&#x2013;<lpage>170</lpage>. <pub-id pub-id-type="doi">10.1126/science.1853201</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brylinski</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Skolnick</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Comparison of Structure-Based and Threading-Based Approaches to Protein Functional Annotation</article-title>. <source>Proteins</source> <volume>78</volume> (<issue>1</issue>), <fpage>18</fpage>&#x2013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.1002/prot.22566</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Buchan</surname>
<given-names>D. W. A.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>D. T.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>EigenTHREADER: Analogous Protein Fold Recognition by Efficient Contact Map Threading</article-title>. <source>Bioinformatics</source> <volume>33</volume> (<issue>17</issue>), <fpage>2684</fpage>&#x2013;<lpage>2690</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx217</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>C.-C.</given-names>
</name>
<name>
<surname>Jeong</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Qian</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yoon</surname>
<given-names>B.-J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>TOPAS: Network-Based Structural Alignment of RNA Sequences</article-title>. <source>Bioinformatics</source> <volume>35</volume> (<issue>17</issue>), <fpage>2941</fpage>&#x2013;<lpage>2948</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz001</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>A Multi-Template Combination Algorithm for Protein Comparative Modeling</article-title>. <source>BMC Struct. Biol.</source> <volume>8</volume> (<issue>1</issue>), <fpage>18</fpage>. <pub-id pub-id-type="doi">10.1186/1472-6807-8-18</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Baldi</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>A Machine Learning Information Retrieval Approach to Protein Fold Recognition</article-title>. <source>Bioinformatics</source> <volume>22</volume> (<issue>12</issue>), <fpage>1456</fpage>&#x2013;<lpage>1463</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btl102</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Di Lena</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Fariselli</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Margara</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Vassura</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Casadio</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Fast Overlapping of Protein Contact Maps by Alignment of Eigenvectors</article-title>. <source>Bioinformatics</source> <volume>26</volume> (<issue>18</issue>), <fpage>2250</fpage>&#x2013;<lpage>2258</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btq402</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dill</surname>
<given-names>K. A.</given-names>
</name>
<name>
<surname>MacCallum</surname>
<given-names>J.&#x20;L.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>The Protein-Folding Problem, 50&#x20;Years on</article-title>. <source>Science</source> <volume>338</volume> (<issue>6110</issue>), <fpage>1042</fpage>&#x2013;<lpage>1046</lpage>. <pub-id pub-id-type="doi">10.1126/science.1219021</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ding</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Gong</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Predicting the Real&#x2010;Valued Inter&#x2010;Residue Distances for Proteins</article-title>. <source>Adv. Sci.</source> <volume>7</volume> (<issue>19</issue>), <fpage>2001314</fpage>. <pub-id pub-id-type="doi">10.1002/advs.202001314</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Du</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>CATHER: A Novel Threading Algorithm with Predicted Contacts</article-title>. <source>Bioinformatics</source> <volume>36</volume> (<issue>7</issue>), <fpage>2119</fpage>&#x2013;<lpage>2125</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz876</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Eismann</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Suriana</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Jing</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Raphael</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Townshend</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Dror</surname>
<given-names>Ron. O.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Protein Model Quality Assessment Using Rotation-Equivariant, Hierarchical Neural Networks</article-title> [<comment>arXiv. 2011.13557</comment>]. <ext-link ext-link-type="uri" xlink:href="http://arxiv.org/abs/2011.13557">http://arxiv.org/abs/2011.13557</ext-link>. </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ginalski</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Pas</surname>
<given-names>Jakub.</given-names>
</name>
<name>
<surname>Wyrwicz</surname>
<given-names>L. S.</given-names>
</name>
<name>
<surname>von Grotthuss</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bujnicki</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Rychlewski</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>ORFeus: Detection of Distant Homology Using Sequence Profiles and Predicted Secondary Structure</article-title>. <source>Nucleic Acids Res.</source> <volume>31</volume> (<issue>13</issue>), <fpage>3804</fpage>&#x2013;<lpage>3807</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkg504</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gniewek</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Kolinski</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kloczkowski</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Gront</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>BioShell-Threading: Versatile Monte Carlo Package for Protein 3D Threading</article-title>. <source>BMC Bioinformatics</source> <volume>15</volume> (<issue>1</issue>), <fpage>22</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-15-22</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Greener</surname>
<given-names>J.&#x20;G.</given-names>
</name>
<name>
<surname>Kandathil</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>David. T.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Deep Learning Extends De Novo Protein Modelling Coverage of Genomes Using Iteratively Predicted Structural Constraints</article-title>. <source>Nat. Commun.</source> <volume>10</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-11994-0</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haas</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Gumienny</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Barbato</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ackermann</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Tauriello</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Bertoni</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Introducing &#x201C;best Single Template&#x201D; Models as Reference Baseline for the Continuous Automated Model Evaluation (CAMEO)</article-title>. <source>Proteins</source> <volume>87</volume> (<issue>12</issue>), <fpage>1378</fpage>&#x2013;<lpage>1387</lpage>. <pub-id pub-id-type="doi">10.1002/prot.25815</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hanson</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Paliwal</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Litfin</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Accurate Prediction of Protein Contact Maps by Coupling Residual Two-Dimensional Bidirectional Long Short-Term Memory with Convolutional Neural Networks</article-title>. <source>Bioinformatics</source> <volume>34</volume> (<issue>23</issue>), <fpage>4039</fpage>&#x2013;<lpage>4045</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty481</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>He</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Mortuza</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>H.-B.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>NeBcon: Protein Contact Map Prediction Using Neural Network Training Coupled with Na&#xef;ve Bayes Classifiers</article-title>. <source>Bioinformatics</source> <volume>33</volume> (<issue>15</issue>), <fpage>2296</fpage>&#x2013;<lpage>2306</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx164</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heo</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Feig</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>High&#x2010;accuracy Protein Structures by Combining Machine&#x2010;learning with Physics&#x2010;based Refinement</article-title>. <source>Proteins</source> <volume>88</volume> (<issue>5</issue>), <fpage>637</fpage>&#x2013;<lpage>642</lpage>. <pub-id pub-id-type="doi">10.1002/prot.25847</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jaroszewski</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Rychlewski</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Godzik</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>FFAS03: a Server for Profile-Profile Sequence Alignments</article-title>. <source>Nucleic Acids Res.</source> <volume>33</volume> (<issue>Suppl. l_2</issue>), <fpage>W284</fpage>&#x2013;<lpage>W288</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gki418</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname>
<given-names>D. T.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>GenTHREADER: an Efficient and Reliable Protein Fold Recognition Method for Genomic Sequences</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>287</volume> (<issue>4</issue>), <fpage>797</fpage>&#x2013;<lpage>815</lpage>. <pub-id pub-id-type="doi">10.1006/jmbi.1999.2583</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname>
<given-names>D. T.</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kosciolek</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tetchner</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>MetaPSICOV: Combining Coevolution Methods for Accurate Prediction of Contacts and Long Range Hydrogen Bonding in Proteins</article-title>. <source>Bioinformatics</source> <volume>31</volume> (<issue>7</issue>), <fpage>999</fpage>&#x2013;<lpage>1006</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu791</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname>
<given-names>D. T.</given-names>
</name>
<name>
<surname>Taylort</surname>
<given-names>W. R.</given-names>
</name>
<name>
<surname>Thornton</surname>
<given-names>J.&#x20;M.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>A New Approach to Protein Fold Recognition</article-title>. <source>Nature</source> <volume>358</volume> (<issue>6381</issue>), <fpage>86</fpage>&#x2013;<lpage>89</lpage>. <pub-id pub-id-type="doi">10.1038/358086a0</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>K&#xe4;llberg</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Template-Based Protein Structure Modeling Using the RaptorX Web Server</article-title>. <source>Nat. Protoc.</source> <volume>7</volume> (<issue>8</issue>), <fpage>1511</fpage>&#x2013;<lpage>1522</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2012.085</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kamisetty</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Ovchinnikov</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Assessing the Utility of Coevolution-Based Residue-Residue Contact Predictions in a Sequence- and Structure-Rich Era</article-title>. <source>Proc. Natl. Acad. Sci.</source> <volume>110</volume> (<issue>39</issue>), <fpage>15674</fpage>&#x2013;<lpage>15679</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1314045110</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kandathil</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Greener</surname>
<given-names>J.&#x20;G.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>D. T.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Prediction of Interresidue Contacts with DeepMetaPSICOV in CASP13</article-title>. <source>Proteins</source> <volume>87</volume> (<issue>12</issue>), <fpage>1092</fpage>&#x2013;<lpage>1099</lpage>. <pub-id pub-id-type="doi">10.1002/prot.25779</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Karasikov</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Pag&#xe8;s</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Grudinin</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Smooth Orientation-dependent Scoring Function for Coarse-Grained Protein Quality Assessment</article-title>. <source>Bioinformatics</source> <volume>35</volume> (<issue>16</issue>), <fpage>2801</fpage>&#x2013;<lpage>2808</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty1037</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>S. Y.</given-names>
</name>
<name>
<surname>Skolnick</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>TASSER_WT: A Protein Structure Prediction Algorithm with Accurate Predicted Contact Restraints for Difficult Protein Targets</article-title>. <source>Biophysical J.</source> <volume>99</volume> (<issue>9</issue>), <fpage>3066</fpage>&#x2013;<lpage>3075</lpage>. <pub-id pub-id-type="doi">10.1016/j.bpj.2010.09.007</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>&#x201c;Study of Real-Valued Distance Prediction for Protein Structure Prediction with Deep Learning&#x201d;</article-title> <source>BioRxiv</source> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1101/2020.11.26.400523">10.1101/2020.11.26.400523</ext-link> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>D.-J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>ResPRE: High-Accuracy Protein Contact Prediction by Coupling Precision Matrix with Deep Residual Neural Networks</article-title>. <source>Bioinformatics</source> <volume>35</volume> (<issue>22</issue>), <fpage>4647</fpage>&#x2013;<lpage>4655</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz291</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lobley</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Sadowski</surname>
<given-names>M. I.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>D. T.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>PGenTHREADER and PDomTHREADER: New Methods for Improved Protein Fold Recognition and Superfamily Discrimination</article-title>. <source>Bioinformatics</source> <volume>25</volume> (<issue>14</issue>), <fpage>1761</fpage>&#x2013;<lpage>1767</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp302</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>Jianzhu.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Sheng.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Zhiyong.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Jinbo.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>MRFalign: Protein Homology Detection through Alignment of Markov Random Fields</article-title>. <source>PLOS Comput. Biol.</source> <volume>10</volume> (<issue>3</issue>), <fpage>e1003500</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1003500</pub-id> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>A Conditional Neural Fields Model for Protein Threading</article-title>. <source>Bioinformatics</source> <volume>28</volume> (<issue>12</issue>), <fpage>i59</fpage>&#x2013;<lpage>i66</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts213</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Protein Threading Using Context-specific Alignment Potential</article-title>. <source>Bioinformatics</source> <volume>29</volume> (<issue>13</issue>), <fpage>i257</fpage>&#x2013;<lpage>i265</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt210</pub-id> </citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Markowitz</surname>
<given-names>V. M.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>I.-M. A.</given-names>
</name>
<name>
<surname>Chu</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Szeto</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Palaniappan</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Pillay</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>IMG/M 4 Version of the Integrated Metagenome Comparative Analysis System</article-title>. <source>Nucl. Acids Res.</source> <volume>42</volume> (<issue>D1</issue>), <fpage>D568</fpage>&#x2013;<lpage>D573</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkt919</pub-id> </citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marks</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Robert</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Hopf</surname>
<given-names>Ts. A.</given-names>
</name>
<name>
<surname>Pagnani</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Zecchina</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Sander</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Protein 3D Structure Computed from Evolutionary Sequence Variation</article-title>. <source>PLOS ONE</source> <volume>6</volume> (<issue>12</issue>), <fpage>e28766</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0028766</pub-id> </citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marti</surname>
<given-names>R. M. A.</given-names>
</name>
<name>
<surname>Madhusudhan</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Sali</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Alignment of Protein Sequences by Their Profiles</article-title>. <source>Protein Sci.</source> <volume>13</volume> (<issue>4</issue>), <fpage>1071</fpage>&#x2013;<lpage>1087</lpage>. <pub-id pub-id-type="doi">10.1110/ps.03379804</pub-id> </citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meier</surname>
<given-names>Armin.</given-names>
</name>
<name>
<surname>S&#xf6;ding</surname>
<given-names>Johannes.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Automatic Prediction of Protein 3D Structures by Probabilistic Multi-Template Homology Modeling</article-title>. <source>PLOS Comput. Biol.</source> <volume>11</volume> (<issue>10</issue>), <fpage>e1004343</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1004343</pub-id> </citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mirdita</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>von&#xa0;den&#xa0;Driesch</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Galiez</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>S&#xf6;ding</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Steinegger</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Uniclust Databases of Clustered and Deeply Annotated Protein Sequences and Alignments</article-title>. <source>Nucleic Acids Res.</source> <volume>45</volume> (<issue>D1</issue>), <fpage>D170</fpage>&#x2013;<lpage>D176</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw1081</pub-id> </citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mitchell</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Almeida</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Beracochea</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Boland</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Burgin</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Cochrane</surname>
<given-names>G.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>MGnify: The Microbiome Analysis Resource in 2020</article-title>. <source>Nucleic Acids Res.</source> <volume>48</volume> (<issue>D1</issue>), <fpage>D570</fpage>&#x2013;<lpage>D578</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz1035</pub-id> </citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mitchell</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Scheremetjew</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Denise</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Potter</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Tarkowska</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Qureshi</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>EBI Metagenomics in 2017: Enriching the Analysis of Microbial Communities, from Sequence Reads to Assemblies</article-title>. <source>Nucleic Acids Res.</source> <volume>46</volume> (<issue>D1</issue>), <fpage>D726</fpage>&#x2013;<lpage>D735</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx967</pub-id> </citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morcos</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Pagnani</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Lunt</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Bertolino</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Marks</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Sander</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>Marks, Chris Sander, Riccardo Zecchina, Jos&#xe9; N. Onuchic, Terence Hwa, and Martin Weigt.Direct-Coupling Analysis of Residue Coevolution Captures Native Contacts across Many Protein Families</article-title>. <source>Proc. Natl. Acad. Sci.</source> <volume>108</volume> (<issue>49</issue>), <fpage>E1293</fpage>&#x2013;<lpage>E1301</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1111471108</pub-id> </citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ovchinnikov</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Varghese</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>P.-S.</given-names>
</name>
<name>
<surname>Pavlopoulos</surname>
<given-names>G. A.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>D. E.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Protein Structure Determination Using Metagenome Sequence Data</article-title>. <source>Science</source> <volume>355</volume> (<issue>6322</issue>), <fpage>294</fpage>&#x2013;<lpage>298</lpage>. <pub-id pub-id-type="doi">10.1126/science.aah4043</pub-id> </citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pearce</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Deep Learning Techniques Have Significantly Impacted Protein Structure Prediction and Protein Design</article-title>. <source>Curr. Opin. Struct. Biol.</source> <volume>68</volume> (<issue>June</issue>), <fpage>194</fpage>&#x2013;<lpage>207</lpage>. <pub-id pub-id-type="doi">10.1016/j.sbi.2021.01.007</pub-id> </citation>
</ref>
<ref id="B64">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Peng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2009</year>). &#x201c;<article-title>Boosting Protein Threading Accuracy</article-title>,&#x201d; in <source>In <italic>Research In Computational Molecular Biology</italic>
</source>, Editor <person-group person-group-type="editor">
<name>
<surname>Batzoglou</surname>
<given-names>S</given-names>
</name>
</person-group>, <fpage>31</fpage>&#x2013;<lpage>45</lpage>. <comment>Lecture Notes in Computer Science</comment> (<publisher-loc>Berlin, Heidelberg</publisher-loc>: <publisher-name>Springer Berlin Heidelberg</publisher-name>). <pub-id pub-id-type="doi">10.1007/978-3-642-02008-7_3</pub-id> </citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Low-Homology Protein Threading</article-title>. <source>Bioinformatics</source> <volume>26</volume> (<issue>12</issue>), <fpage>i294</fpage>&#x2013;<lpage>300</lpage>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bioinformatics/btq192">10.1093/bioinformatics/btq192</ext-link> </citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>A Multiple-Template Approach to Protein Threading</article-title>. <source>Proteins: Struct. Funct. Bioinformatics</source> <volume>79</volume> (<issue>6</issue>), <fpage>1930</fpage>&#x2013;<lpage>1939</lpage>. <pub-id pub-id-type="doi">10.1002/prot.23016</pub-id> </citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Petrey</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Honig</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Protein Structure Prediction: Inroads to Biology</article-title>. <source>Mol. Cel</source> <volume>20</volume> (<issue>6</issue>), <fpage>811</fpage>&#x2013;<lpage>819</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2005.12.005</pub-id> </citation>
</ref>
<ref id="B68">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ray</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Lindahl</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Wallner</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Improved Model Quality Assessment Using ProQ2</article-title>. <source>BMC Bioinformatics</source> <volume>13</volume> (<issue>1</issue>), <fpage>224</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-13-224</pub-id> </citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Roche</surname>
<given-names>Rahmatullah.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>Sutanu.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>Debswapna.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Hybridized Distance- and Contact-Based Hierarchical Structure Modeling for Folding Soluble and Membrane Proteins</article-title>. <source>PLOS Comput. Biol.</source> <volume>17</volume> (<issue>2</issue>), <fpage>e1008753</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1008753</pub-id> </citation>
</ref>
<ref id="B70">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rychlewski</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Jaroszewski</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Godzik</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Comparison of Sequence Profiles. Strategies for Structural Predictions Using Sequence Information</article-title>. <source>Protein Sci.</source> <volume>9</volume> (<issue>2</issue>), <fpage>232</fpage>&#x2013;<lpage>241</lpage>. <pub-id pub-id-type="doi">10.1110/ps.9.2.232</pub-id> </citation>
</ref>
<ref id="B71">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Senior</surname>
<given-names>A. W.</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Jumper</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kirkpatrick</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Sifre</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Green</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Protein Structure Prediction Using Multiple Deep Neural Networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13)</article-title>. <source>Proteins</source> <volume>87</volume> (<issue>12</issue>), <fpage>1141</fpage>&#x2013;<lpage>1148</lpage>. <pub-id pub-id-type="doi">10.1002/prot.25834</pub-id> </citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Senior</surname>
<given-names>A. W.</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Jumper</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kirkpatrick</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Sifre</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Green</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Improved Protein Structure Prediction Using Potentials from Deep Learning</article-title>. <source>Nature</source> <volume>577</volume> (<issue>7792</issue>), <fpage>706</fpage>&#x2013;<lpage>710</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-019-1923-7</pub-id> </citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shuvo</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Bhattacharya</surname>
<given-names>Debswapna.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>QDeep: Distance-Based Protein Model Quality Estimation by Residue-Level Ensemble Error Classifications Using Stacked Deep Residual Neural Networks</article-title>. <source>Bioinformatics</source> <volume>36</volume> (<issue>Suppl. ment_1</issue>), <fpage>i285</fpage>&#x2013;<lpage>i291</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa455</pub-id> </citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Skolnick</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kihara</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Defrosting the Frozen Approximation: PROSPECTOR? A New Approach to Threading</article-title>. <source>Proteins</source> <volume>42</volume> (<issue>3</issue>), <fpage>319</fpage>&#x2013;<lpage>331</lpage>. <pub-id pub-id-type="doi">10.1002/1097-0134(20010215)42:3&#x3c;319::aid-prot30&#x3e;3.0.co;2-a</pub-id> </citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>S&#xf6;ding</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Big-Data Approaches to Protein Structure Prediction</article-title>. <source>Science</source> <volume>355</volume> (<issue>6322</issue>), <fpage>248</fpage>&#x2013;<lpage>249</lpage>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1126/science.aal4512">10.1126/science.aal4512</ext-link> </citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>S&#xf6;ding</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Protein Homology Detection by HMM-HMM Comparison</article-title>. <source>Bioinformatics</source> <volume>21</volume> (<issue>7</issue>), <fpage>951</fpage>&#x2013;<lpage>960</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bti125</pub-id> </citation>
</ref>
<ref id="B77">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Steinegger</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>S&#xf6;ding</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Clustering Huge Protein Sequence Sets in Linear Time</article-title>. <source>Nat. Commun.</source> <volume>9</volume> (<issue>1</issue>), <fpage>2542</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-018-04964-5</pub-id> </citation>
</ref>
<ref id="B78">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Steinegger</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mirdita</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>S&#xf6;ding</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Protein-Level Assembly Increases Protein Sequence Recovery from Metagenomic Samples Manyfold</article-title>. <source>Nat. Methods</source> <volume>16</volume> (<issue>7</issue>), <fpage>603</fpage>&#x2013;<lpage>606</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-019-0437-</pub-id> </citation>
</ref>
<ref id="B79">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Suzek</surname>
<given-names>B. E.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>McGarvey</surname>
<given-names>P. B.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>C. H.</given-names>
</name>
</person-group>
<collab>, and the UniProt Consortium</collab> (<year>2015</year>). <article-title>UniRef Clusters: A Comprehensive and Scalable Alternative for Improving Sequence Similarity Searches</article-title>. <source>Bioinformatics</source> <volume>31</volume> (<issue>6</issue>), <fpage>926</fpage>&#x2013;<lpage>932</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu739</pub-id> </citation>
</ref>
<ref id="B80">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Taylor</surname>
<given-names>William. R.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Protein Structure Comparison Using Iterated Double Dynamic Programming</article-title>. <source>Protein Sci.</source> <volume>8</volume> (<issue>3</issue>), <fpage>654</fpage>&#x2013;<lpage>665</lpage>. <pub-id pub-id-type="doi">10.1110/ps.8.3.654</pub-id> </citation>
</ref>
<ref id="B81">
<citation citation-type="journal">
<collab>The UniProt Consortium</collab> (<year>2019</year>). <article-title>UniProt: A Worldwide Hub of Protein Knowledge</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume> (<issue>D1</issue>), <fpage>D506</fpage>&#x2013;<lpage>D515</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky1049</pub-id> </citation>
</ref>
<ref id="B82">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Uziela</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Men&#xe9;ndez Hurtado</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Shu</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Wallner</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Elofsson</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>ProQ3D: Improved Model Quality Assessments Using Deep Learning</article-title>. <source>Bioinformatics</source> <volume>33</volume> (<issue>10</issue>), <fpage>1578</fpage>&#x2013;<lpage>1580</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btw819</pub-id> </citation>
</ref>
<ref id="B83">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Uziela</surname>
<given-names>Karolis.</given-names>
</name>
<name>
<surname>Shu</surname>
<given-names>Nanjiang.</given-names>
</name>
<name>
<surname>Wallner</surname>
<given-names>Bj&#xf6;rn.</given-names>
</name>
<name>
<surname>Elofsson</surname>
<given-names>Arne.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>ProQ3: Improved Model Quality Assessments Using Rosetta Energy Terms</article-title>. <source>Scientific Rep.</source> <volume>6</volume> (<issue>1</issue>), <fpage>33509</fpage>. <pub-id pub-id-type="doi">10.1038/srep33509</pub-id> </citation>
</ref>
<ref id="B84">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Geng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Y.-J.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Artificial Intelligence-Based Multi-Objective Optimization Protocol for Protein Structure Refinement</article-title>. <source>Bioinformatics</source> <volume>36</volume> (<issue>2</issue>), <fpage>437</fpage>&#x2013;<lpage>448</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz544</pub-id> </citation>
</ref>
<ref id="B85">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>Sheng.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>Siqi.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Zhen.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Renyu.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Jinbo.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Accurate De Novo Prediction of Protein Contact Map by Ultra-deep Learning Model</article-title>. <source>PLOS Comput. Biol.</source> <volume>13</volume> (<issue>1</issue>), <fpage>e1005324</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005324</pub-id> </citation>
</ref>
<ref id="B86">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>CoinFold: A Web Server for Protein Contact Prediction and Contact-Assisted Protein Folding</article-title>. <source>Nucleic Acids Res.</source> <volume>44</volume> (<issue>W1</issue>), <fpage>W361</fpage>&#x2013;<lpage>W366</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw307</pub-id> </citation>
</ref>
<ref id="B87">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Mortuza</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Fueling Ab Initio Folding with Marine Metagenomics Enables Structure and Function Predictions of New Protein Families</article-title>. <source>Genome Biol.</source> <volume>20</volume> (<issue>1</issue>), <fpage>229</fpage>. <pub-id pub-id-type="doi">10.1186/s13059-019-1823-z</pub-id> </citation>
</ref>
<ref id="B88">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Anishchenko</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Cong</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Protein Contact Prediction Using Metagenome Sequence Data and Residual Neural Networks</article-title>. <source>Bioinformatics</source> <volume>36</volume> (<issue>1</issue>), <fpage>41</fpage>&#x2013;<lpage>48</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz477</pub-id> </citation>
</ref>
<ref id="B89">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>LOMETS: A Local Meta-Threading-Server for Protein Structure Prediction</article-title>. <source>Nucleic Acids Res.</source> <volume>35</volume> (<issue>10</issue>), <fpage>3375</fpage>&#x2013;<lpage>3382</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkm251</pub-id> </citation>
</ref>
<ref id="B90">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>&#x201c;MUSTER: Improving Protein Sequence Profile&#x2013;Profile Alignments by Using Multiple Sources of Structure Information</article-title>. <source>Proteins: Struct. Funct. Bioinformatics</source> <volume>72</volume> (<issue>2</issue>), <fpage>547</fpage>&#x2013;<lpage>556</lpage>. <pub-id pub-id-type="doi">10.1002/prot.21945</pub-id> </citation>
</ref>
<ref id="B91">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Recognizing Protein Substructure Similarity Using Segmental Threading</article-title>. <source>Structure</source> <volume>18</volume> (<issue>7</issue>), <fpage>858</fpage>&#x2013;<lpage>867</lpage>. <pub-id pub-id-type="doi">10.1016/j.str.2010.04.007</pub-id> </citation>
</ref>
<ref id="B92">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>Tianqi.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Zhiye.</given-names>
</name>
<name>
<surname>Hou</surname>
<given-names>Jie.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>Jianlin.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>DeepDist: Real-Value Inter-residue Distance Prediction with Deep Residual Convolutional Network</article-title>. <source>BMC Bioinformatics</source> <volume>22</volume> (<issue>1</issue>), <fpage>30</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-021-03960-9</pub-id> </citation>
</ref>
<ref id="B93">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Distance-Based Protein Folding Powered by Deep Learning</article-title>. <source>Proc. Natl. Acad. Sci. USA</source> <volume>116</volume> (<issue>34</issue>), <fpage>16856</fpage>&#x2013;<lpage>16865</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1821309116</pub-id> </citation>
</ref>
<ref id="B94">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Raptor: Optimal Protein Threading by Linear Programming</article-title>. <source>J.&#x20;Bioinform. Comput. Biol.</source> <volume>01</volume> (<issue>01</issue>), <fpage>95</fpage>&#x2013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1142/s0219720003000186</pub-id> </citation>
</ref>
<ref id="B95">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Analysis of Distance&#x2010;based Protein Structure Prediction by Deep Learning in CASP13</article-title>. <source>Proteins</source> <volume>87</volume> (<issue>12</issue>), <fpage>1069</fpage>&#x2013;<lpage>1081</lpage>. <pub-id pub-id-type="doi">10.1002/prot.25810</pub-id> </citation>
</ref>
<ref id="B96">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Protein Threading Using PROSPECT: Design and Evaluation</article-title>. <source>Proteins</source> <volume>40</volume> (<issue>3</issue>), <fpage>343</fpage>&#x2013;<lpage>354</lpage>. <pub-id pub-id-type="doi">10.1002/1097-0134(20000815)40:3&#x3c;343::aid-prot10&#x3e;3.0.co;2-s</pub-id> </citation>
</ref>
<ref id="B97">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Anishchenko</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Ovchinnikov</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Improved Protein Structure Prediction Using Predicted Interresidue Orientations</article-title>. <source>Proc. Natl. Acad. Sci. USA</source> <volume>117</volume> (<issue>3</issue>), <fpage>1496</fpage>&#x2013;<lpage>1503</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1914677117</pub-id> </citation>
</ref>
<ref id="B98">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Faraggi</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Improving Protein Fold Recognition and Template-Based Modeling by Employing Probabilistic-Based Matching between Predicted One-Dimensional Structural Properties of Query and Corresponding Native Properties of Templates</article-title>. <source>Bioinformatics</source> <volume>27</volume> (<issue>15</issue>), <fpage>2076</fpage>&#x2013;<lpage>2082</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr350</pub-id> </citation>
</ref>
<ref id="B99">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Mortuza</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>DeepMSA: Constructing Deep Multiple Sequence Alignment to Improve Contact Prediction and Fold-Recognition for Distant-Homology Proteins</article-title>. <source>Bioinformatics</source> <volume>36</volume> (<issue>7</issue>), <fpage>2105</fpage>&#x2013;<lpage>2112</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz863</pub-id> </citation>
</ref>
<ref id="B100">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Hg.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Template-Based Prediction of Protein Structure with Deep Learning</article-title>. <source>BMC Genomics</source> <volume>21</volume> (<issue>11</issue>), <fpage>878</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-020-07249-8</pub-id> </citation>
</ref>
<ref id="B101">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wuyun</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Li.</given-names>
</name>
<name>
<surname>Mortuza</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Pearce</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2019a</year>). <article-title>Detecting Distant-Homology Protein Structures by Aligning Deep Neural-Network Based Contact Maps</article-title>. <source>PLOS Comput. Biol.</source> <volume>15</volume> (<issue>10</issue>), <fpage>e1007411</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1007411</pub-id> </citation>
</ref>
<ref id="B102">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wuyun</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Pearce</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2019b</year>). <article-title>LOMETS2: Improved Meta-Threading Server for Fold-Recognition and Structure-Based Function Annotation for Distant-Homology Proteins</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume> (<issue>W1</issue>), <fpage>W429</fpage>&#x2013;<lpage>W436</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz384</pub-id> </citation>
</ref>
<ref id="B103">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Fold Recognition by Combining Sequence Profiles Derived from Evolution and from Depth-dependent Structural Alignment of Fragments</article-title>. <source>Proteins</source> <volume>58</volume> (<issue>2</issue>), <fpage>321</fpage>&#x2013;<lpage>328</lpage>. <pub-id pub-id-type="doi">10.1002/prot.20308</pub-id>
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/prot.20308"/> </citation>
</ref>
<ref id="B104">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Protein Threading Using Residue Co-variation and Deep Learning</article-title>. <source>Bioinformatics</source> <volume>34</volume> (<issue>13</issue>), <fpage>i263</fpage>&#x2013;<lpage>i273</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty278</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>