<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="review-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Immunol.</journal-id>
<journal-title>Frontiers in Immunology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Immunol.</abbrev-journal-title>
<issn pub-type="epub">1664-3224</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fimmu.2024.1463931</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Immunology</subject>
<subj-group>
<subject>Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Integrating machine learning to advance epitope mapping</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Grewal</surname>
<given-names>Simranjit</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2541649"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Hegde</surname>
<given-names>Nidhi</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2791200"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Yanow</surname>
<given-names>Stephanie K.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/707343"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
<role content-type="https://credit.niso.org/contributor-roles/funding-acquisition/"/>
<role content-type="https://credit.niso.org/contributor-roles/project-administration/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Department of Medical Microbiology and Immunology, University of Alberta</institution>, <addr-line>Edmonton, AB</addr-line>, <country>Canada</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Computing Science, University of Alberta</institution>, <addr-line>Edmonton, AB</addr-line>, <country>Canada</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>School of Public Health, University of Alberta</institution>, <addr-line>Edmonton, AB</addr-line>, <country>Canada</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Anastas Dimitrov Pashov, Bulgarian Academy of Sciences (BAS), Bulgaria</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Anjali Dhall, National Cancer Institute (NIH), United States</p>
<p>Minh Nguyen, Bioinformatics Institute (A*STAR), Singapore</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Stephanie K. Yanow, <email xlink:href="mailto:yanow@ualberta.ca">yanow@ualberta.ca</email>
</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>30</day>
<month>09</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="collection">
<year>2024</year>
</pub-date>
<volume>15</volume>
<elocation-id>1463931</elocation-id>
<history>
<date date-type="received">
<day>12</day>
<month>07</month>
<year>2024</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>09</month>
<year>2024</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2024 Grewal, Hegde and Yanow</copyright-statement>
<copyright-year>2024</copyright-year>
<copyright-holder>Grewal, Hegde and Yanow</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Identifying epitopes, or the segments of a protein that bind to antibodies, is critical for the development of a variety of immunotherapeutics and diagnostics. In vaccine design, the intent is to identify the minimal epitope of an antigen that can elicit an immune response and avoid off-target effects. For prognostics and diagnostics, the epitope-antibody interaction is exploited to measure antigens associated with disease outcomes. Experimental methods such as X-ray crystallography, cryo-electron microscopy, and peptide arrays are used widely to map epitopes but vary in accuracy, throughput, cost, and feasibility. By comparing machine learning epitope mapping tools, we discuss the importance of data selection, feature design, and algorithm choice in determining the specificity and prediction accuracy of an algorithm. This review discusses limitations of current methods and the potential for machine learning to deepen interpretation and increase feasibility of these methods. We also propose how machine learning can be employed to refine epitope prediction to address the apparent promiscuity of polyreactive antibodies and the challenge of defining conformational epitopes. We highlight the impact of machine learning on our current understanding of epitopes and its potential to guide the design of therapeutic interventions with more predictable outcomes.</p>
</abstract>
<kwd-group>
<kwd>machine learning</kwd>
<kwd>epitope</kwd>
<kwd>B-cell</kwd>
<kwd>algorithm</kwd>
<kwd>features</kwd>
<kwd>databases</kwd>
<kwd>toolboxes</kwd>
<kwd>vaccine</kwd>
</kwd-group>
<contract-num rid="cn002">R01AI150944</contract-num>
<contract-sponsor id="cn001">Canadian Institutes of Health Research<named-content content-type="fundref-id">10.13039/501100000024</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">National Institute of Allergy and Infectious Diseases<named-content content-type="fundref-id">10.13039/100000060</named-content>
</contract-sponsor>
<counts>
<fig-count count="1"/>
<table-count count="6"/>
<equation-count count="0"/>
<ref-count count="152"/>
<page-count count="14"/>
<word-count count="7126"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-in-acceptance</meta-name>
<meta-value>Vaccines and Molecular Therapeutics</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<label>1</label>
<title>Introduction</title>
<p>Vaccines are among the most successful and cost-effective public health interventions, particularly to protect against infectious diseases. This was never more evident than during the COVID-19 pandemic where vaccines were the most valuable intervention to protect vulnerable populations from hospitalization and death (<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B2">2</xref>). All the SARS-CoV-2 vaccines were based on the spike protein as the vaccine antigen, either expressed from DNA, mRNA, or as a recombinant protein, and elicited immune responses against dominant epitopes (or segments) within the protein. In response, new variants of the virus emerged with different amino acid sequences in these epitopes, impairing the efficacy of the first-generation vaccines and requiring the design of new variant-specific ones. The ongoing management of SARS-CoV-2 depends on our preparedness against emerging variants; this can be facilitated by designing vaccines that focus immune responses on highly conserved epitopes.</p>
<p>Precise mapping of B-cell epitopes is critical beyond vaccine development for other antibody-based interventions such as immunotherapeutics and diagnostics (<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B4">4</xref>). Likewise, mapping T-cell epitopes can improve our understanding of immune responses to infectious and autoimmune diseases (<xref ref-type="bibr" rid="B5">5</xref>) and support the development of immunotherapeutics with an increased safety profile by reducing off-target effects (<xref ref-type="bibr" rid="B6">6</xref>). The challenge is how to identify or map these epitopes.</p>
<p>Machine learning algorithms targeted at epitope mapping are undergoing continual development and immense growth. These algorithms are improving upon existing <italic>in vitro</italic> methods by exploiting the vast reservoir of published experimental data to find patterns and predict regions of a protein likely to be a part of an epitope. Here, we discuss how current methods have benefited from integrating machine learning and explore future applications to further refine epitope mapping.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>
<italic>In vitro</italic> epitope mapping methods</title>
<p>What constitutes an epitope varies immensely between two major types of immune cells in the body: T-cells and B-cells. T-cell epitopes consist of antigens processed into 8-10 amino acid linear segments that are recognized by major histocompatibility complex (MHC) class I molecules and 13-17 amino acid segments for MHC class II molecules (<xref ref-type="bibr" rid="B7">7</xref>). In contrast, B-cell epitopes are typically (90%) conformational, involving amino acids that are spatially close to one another due to secondary structure, tertiary structure, or quaternary structure (<xref ref-type="bibr" rid="B8">8</xref>&#x2013;<xref ref-type="bibr" rid="B10">10</xref>). This makes them variable in length and structure. Further, while antibodies typically interact with 15 to 22 amino acids on the surface of an antigen, approximately 5 amino acids contribute most significantly to stabilizing the antibody-antigen complex (<xref ref-type="bibr" rid="B11">11</xref>).</p>
<p>Several experimental methods are used to map epitopes, each with pros and cons related to accuracy, throughput, and cost (<xref ref-type="bibr" rid="B12">12</xref>). X-ray crystallography, for example, provides accurate information about the three-dimensional complex between an antibody and the antigen that can encompass conformational changes and even highlight dynamics by comparing bound and unbound states. Epitopes are mapped based on their proximity to crystallized residues within the paratope of the antibody (<xref ref-type="bibr" rid="B13">13</xref>). Analysis tools, such as PDBePISA, provide information on the interactions of residues between an epitope and the corresponding paratope based on distance, participating residues, and orientation (<xref ref-type="bibr" rid="B14">14</xref>). Challenges with this approach are the stability, size, and solubility of the proteins to form a well-ordered lattice. Post-translational modifications can also impede crystal formation. Further, there can be several possible antibody-antigen complexes depending on the physical parameters used for crystallization (<xref ref-type="bibr" rid="B15">15</xref>).</p>
<p>Cryo-electron microscopy (cryo-EM) is another similar biophysical method for mapping epitopes. Freezing epitopes bound and unbound to their paratope yields density maps that compare the two states. These density maps are made by compositing 2D images from various angles into a 3D map of the molecule and the differences between the two density maps indicate the residues of an epitope (<xref ref-type="bibr" rid="B16">16</xref>).</p>
<p>On the other hand, screening peptide arrays provides much higher throughput, but at the cost of accuracy; the resolution of an epitope by a peptide array is not as high as the resolution attained from cryo-EM or X-ray crystallography. Peptide arrays consist of libraries of synthetic, overlapping peptides (usually 15-20 amino acids in length) that are screened with an antibody of interest (or serum) to identify those peptides that bind strongly (<xref ref-type="bibr" rid="B17">17</xref>). Thousands of peptides can be screened at once. The primary antibody bound to peptides is detected by a secondary antibody that emits a luminescent signal. Despite epitopes being typically conformational, peptide arrays are useful for several reasons (<xref ref-type="bibr" rid="B9">9</xref>). They are ideal for mapping linear epitopes and the extent of overlap among the peptides can reveal residues that are important components of the epitope. These can be specifically mutated (e.g. by alanine walking) to confirm their contribution to antibody binding (<xref ref-type="bibr" rid="B18">18</xref>). Thus, it is possible to determine key residues that dominate the antibody&#x2019;s affinity for the linear epitope (<xref ref-type="bibr" rid="B19">19</xref>). However, peptide arrays are less informative to identify conformational epitopes. When screening reveals multiple peptides recognized by an antibody, the interpretation of these data relies on 3D models of the protein to map the binding sites. If the reactive peptides lie in proximity within the 3D protein structure, this could indicate a conformational epitope (<xref ref-type="bibr" rid="B20">20</xref>).</p>
</sec>
<sec id="s3">
<label>3</label>
<title>Machine learning</title>
<p>Machine learning is a powerful tool that can be used to address the complexity of data from epitope mapping studies. It is effectively a subset of artificial intelligence in which previous outcomes of a task provide an algorithm with experience that allows it to improve in the same task applied to new data (<xref ref-type="bibr" rid="B21">21</xref>). &#x2018;Surviving the Titanic&#x2019; is a common example of machine learning; this problem involves a dataset containing passenger information that is used to predict the likelihood of survival of specific passengers (<xref ref-type="bibr" rid="B22">22</xref>). Information on gender, age, cabin class, etc. are studied and used to predict whether a passenger survived the Titanic disaster. The algorithm identifies patterns in the data and correlates them to what it is tasked with predicting (in this case survival). The &#x2018;Surviving the Titanic&#x2019; problem is a valuable practice problem to learn the basics of machine learning and includes many resources and tutorials.</p>
<sec id="s3_1">
<label>3.1</label>
<title>Datasets &#x2013; training and testing</title>
<p>Importantly, the performance of an algorithm directly relates to the breadth and complexity of the datasets it learns from. The more novel cases and rare &#x2018;edge&#x2019; cases in the dataset, the more the algorithm will learn how to predict similar scenarios in the future. In this way, machine learning can be used to make accurate predictions on a variety of tasks. In general, the algorithm benefits from more data but it is also important to recognize that curated data may be valuable for developing specialized algorithms. For example, if a machine learning algorithm is used specifically to predict cancer epitopes, it should be trained on a specialized cancer database, such as CEDAR (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>) (<xref ref-type="bibr" rid="B23">23</xref>). Adding data for the prediction of different types of epitopes may improve a machine learning algorithm&#x2019;s generalizability but could also mask biological patterns unique to cancer epitopes. Additionally, data that are very similar to existing training data do not &#x2018;teach&#x2019; an algorithm anything new.</p>
<table-wrap id="T1" position="float">
<label>Table&#xa0;1</label>
<caption>
<p>T-cell epitope associated databases<sup>a</sup>.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" align="left">#</th>
<th valign="middle" align="left">Name</th>
<th valign="middle" align="left">Scope</th>
<th valign="middle" align="left">Approximate size</th>
<th valign="middle" align="left">Link</th>
<th valign="middle" align="left">Ref.</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">IEDB-3D 2.0</td>
<td valign="top" align="left">Structural T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">410 assays<break/>(T-cell)<break/>1,446 assays<break/>(MHC binding)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://www.iedb.org/">https://www.iedb.org/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B81">81</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">HIV molecular immunology database</td>
<td valign="top" align="left">HIV-associated T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">13,700 entries<break/>(T-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://www.hiv.lanl.gov/content/immunology/index.html">https://www.hiv.lanl.gov/content/immunology/index.html</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B82">82</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">CEDAR</td>
<td valign="top" align="left">Cancer-associated T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">224,000 entries<break/>(B-cell &amp; T-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://cedar.iedb.org/">https://cedar.iedb.org/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B23">23</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">IEDB</td>
<td valign="top" align="left">T- cell &amp; B-cell epitopes</td>
<td valign="top" align="left">1,610,000 entries (B-cell &amp; T-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://www.iedb.org/">https://www.iedb.org/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B83">83</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">TANTIGEN 2.0</td>
<td valign="top" align="left">Cancer-associated T-cell epitopes</td>
<td valign="top" align="left">1,000 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://projects.met-hilab.org/tadb/">http://projects.met-hilab.org/tadb/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B84">84</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="left">Protegen</td>
<td valign="top" align="left">Protective antigens</td>
<td valign="top" align="left">1600 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://violinet.org/protegen/">https://violinet.org/protegen/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B85">85</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="left">FLAVIdB<sup>b</sup>
</td>
<td valign="top" align="left">Flavivirus associated T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">12,800 entries<break/>(B-cell &amp; T-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://cvc.dfci.harvard.edu/flavi/">http://cvc.dfci.harvard.edu/flavi/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B86">86</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="left">MHCBN 4.0</td>
<td valign="top" align="left">Peptides associated with TAP and MHC</td>
<td valign="top" align="left">25,000 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://webs.iiitd.edu.in/raghava/mhcbn/">https://webs.iiitd.edu.in/raghava/mhcbn/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B87">87</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">EPIMHC</td>
<td valign="top" align="left">MHC-restricted peptide ligands and epitopes</td>
<td valign="top" align="left">4800 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://bio.med.ucm.es/epimhc/">http://bio.med.ucm.es/epimhc/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B88">88</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="left">SYFPEITHI</td>
<td valign="top" align="left">Peptide binding MHC I or MHC II</td>
<td valign="top" align="left">7,000 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.syfpeithi.de/">http://www.syfpeithi.de/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B89">89</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>
<sup>a</sup>Accessibility evaluated on 12-06-2024.</p>
</fn>
<fn>
<p>
<sup>b</sup>This database was found to be inaccessible as of 12-06-2024. Its inclusion depends on two factors: links often migrate or are repaired, and these databases have been used in the development of tools in the past. The link provided is the last known link used to access the database.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>Many publicly accessible databases are available as sources of datasets for either T-cell or B-cell epitopes (<xref ref-type="table" rid="T1">
<bold>Tables&#xa0;1</bold>
</xref>, <xref ref-type="table" rid="T2">
<bold>2</bold>
</xref>). These databases provide large, specialized sources of data that algorithms can study and use for predictions. RCSB PDB, for example, provides primarily 3D structures of various proteins and multimolecular complexes, and information relating to their molecular composition, position, length, chains, etc. AntigenDB, on the other hand, provides sequence, structure, classifications, etc., of various experimentally validated antigens. AntiJen contains data on many topics but, notably, it is a valuable source of data for both B-cell and T-cell epitopes.</p>
<table-wrap id="T2" position="float">
<label>Table&#xa0;2</label>
<caption>
<p>B-cell epitope associated databases<sup>a</sup>.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" align="left">#</th>
<th valign="middle" align="left">Name</th>
<th valign="middle" align="left">Scope</th>
<th valign="middle" align="left">Approximate size</th>
<th valign="middle" align="left">Link</th>
<th valign="middle" align="left">Ref.</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">IEDB-3D 2.0</td>
<td valign="top" align="left">Structural T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">4,859 assays<break/>(B-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://www.iedb.org/">https://www.iedb.org/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B81">81</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">SDAP 2.0</td>
<td valign="top" align="left">Allergen-associated B-cell epitopes</td>
<td valign="top" align="left">4,000 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://fermi.utmb.edu/SDAP/sdap_fas.html">https://fermi.utmb.edu/SDAP/sdap_fas.html</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B90">90</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">HIV molecular immunology database</td>
<td valign="top" align="left">HIV-associated T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">4,200 entries<break/>(B-cells)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://www.hiv.lanl.gov/content/immunology/index.html">https://www.hiv.lanl.gov/content/immunology/index.html</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B82">82</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">CEDAR</td>
<td valign="top" align="left">Cancer-associated T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">224,000 entries<break/>(B-cell &amp; T-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://cedar.iedb.org/">https://cedar.iedb.org/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B23">23</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="left">IEDB</td>
<td valign="top" align="left">T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">1,610,000 entries<break/>(B-cell &amp; T-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://www.iedb.org/">https://www.iedb.org/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B83">83</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="left">FLAVIdB<sup>b</sup>
</td>
<td valign="top" align="left">Flavivirus-associated T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">12,858 entries<break/>(B-cell &amp; T-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://cvc.dfci.harvard.edu/flavi/">http://cvc.dfci.harvard.edu/flavi/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B86">86</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="left">Protegen</td>
<td valign="top" align="left">Protective antigens</td>
<td valign="top" align="left">1600 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://violinet.org/protegen/">https://violinet.org/protegen/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B85">85</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="left">CED<sup>b</sup>
</td>
<td valign="top" align="left">Conformational B-cell epitopes</td>
<td valign="top" align="left">225 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://immunet.cn/ced/">http://immunet.cn/ced/</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B91">91</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="left">EPITOME<sup>b</sup>
</td>
<td valign="top" align="left">B-cell epitopes</td>
<td valign="top" align="left">142 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://www.rostlab.org/services/epitome/submit.php">https://www.rostlab.org/services/epitome/submit.php</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B92">92</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="left">BciPep</td>
<td valign="top" align="left">B-cell epitopes</td>
<td valign="top" align="left">3,031 entries</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://crdd.osdd.net/raghava/bcipep/pep_src.html">http://crdd.osdd.net/raghava/bcipep/pep_src.html</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B28">28</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">11</td>
<td valign="top" align="left">AntiJen</td>
<td valign="top" align="left">T-cell &amp; B-cell epitopes</td>
<td valign="top" align="left">24,000 entries<break/>(B-cell &amp; T-cell)</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.ddg-pharmfac.net/antijen/AntiJen/antijenhomepage.htm">http://www.ddg-pharmfac.net/antijen/AntiJen/antijenhomepage.htm</ext-link>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B93">93</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>
<sup>a</sup>Accessibility evaluated on 12-06-2024.</p>
</fn>
<fn>
<p>
<sup>b</sup>This database was found to be inaccessible as of 12-06-2024. The choice to include it depended on two factors: links often migrate or get repaired and these databases have been used in the development of tools in the past. The link provided is the last known link used to access the database.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>In essence, a machine learning algorithm studies databases to find patterns that inform the analysis of novel data. The quality of the algorithm requires careful selection of the appropriate database. For example, EPSVR is a B-cell epitope prediction algorithm focused on conformational epitopes and the type of data needed for accurate conformational epitope prediction is structural (<xref ref-type="bibr" rid="B24">24</xref>). Their training set consisted of structures of curated antigen-antibody complexes and was tested on a sampling of structures from the Conformation Epitope Database (CED) (<xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B26">26</xref>). By limiting their datasets to a particular type of epitope, the algorithm becomes much more accurate at predicting that type of epitope. In the case of EPSVR, linear epitopes were excluded because of their poor correlation to structural data. If linear epitopes were included, they may affect how the algorithm evaluates structural data and decrease prediction accuracy on conformational epitopes. For LBTOPE, a model focused on linear epitope prediction, only the primary sequences of epitopes for B-cell receptors and non B-cell epitopes are needed (<xref ref-type="bibr" rid="B27">27</xref>). Since these data are more available and less complex, it is easier to generate a dataset of tens of thousands of entries. In another example, the BCIpep database contains many linear epitopes and was used to develop ABCpred to support peptide-based vaccine design and allergy research (<xref ref-type="bibr" rid="B28">28</xref>, <xref ref-type="bibr" rid="B29">29</xref>). Training the dataset specifically on linear epitopes resulted in a model that can strongly predict these types of epitopes.</p>
<p>Finally, datasets are further divided into &#x2018;training sets&#x2019;, &#x2018;validation sets&#x2019;, and &#x2018;test sets&#x2019;. Machine learning models are built from training sets; a validation sets help determine the optimal parameters or models for a given problem; and test sets are used to evaluate the machine learning models. These sets are mutually exclusive.</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Features and labels &#x2013; how data are encoded</title>
<p>Even when an algorithm is provided a database robust enough to characterize the different patterns present in both linear and conformational epitopes, it still requires the tools to make use of those data. These databases are assembled into datasets with each datapoint referred to by the term &#x2018;example&#x2019;. Within each example there is information relating to &#x2018;labels&#x2019; and &#x2018;features&#x2019;. Labels and features are the tools an algorithm uses to analyze examples.</p>
<p>Labels are what the algorithm learns to predict (the outcome). For instance, the label may refer to the strength of peptide-antibody binding. Machine learning can predict labels based on classification or regression analysis (<xref ref-type="bibr" rid="B30">30</xref>); for B-cell epitope prediction, classification involves classifying a peptide as a B-cell epitope or not, while regression analysis would assign a continuous value to the likelihood of the binding.</p>
<p>Features contain descriptive information about the peptide (sequence, structure, physico-chemical properties, etc.) or the parent protein it is derived from. The learning algorithm effectively learns patterns in the features that relate to a specific label value. More specifically, several algorithms use surface accessibility as the feature to predict whether an epitope is likely to be recognized by B-cells as this feature correlates well with binding strength (the label) (<xref ref-type="bibr" rid="B31">31</xref>, <xref ref-type="bibr" rid="B32">32</xref>). Developing accurate machine learning models begins with selecting features that correlate strongly to a label (<xref ref-type="bibr" rid="B30">30</xref>). The features that are important depend entirely on the nature of the question being asked and the success of an algorithm depends on the design of features (<xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>). A good feature set uses all data that correlate well with the label (what is being predicted). However, there are many ways to encode the same feature that are more effective for specific machine learning algorithms (<xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>).</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>Visualization of protein analysis for feature design. The protein structure of leporine serum albumin was visualized in &#x2018;new cartoon&#x2019; with VMD (<xref ref-type="bibr" rid="B33">33</xref>). Coil and loop regions of the secondary structure are colored yellow and helical regions are colored purple. Certain features are derived from the analysis of the whole protein, smaller subsections, and sequence-based analysis. Features can broadly be categorized as structural, physicochemical, or sequence-based. The features provided below each of the categories are select examples that belong to each category. The secondary structure used as referenced PDB ID 4F5V (<xref ref-type="bibr" rid="B34">34</xref>).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-15-1463931-g001.tif"/>
</fig>
<table-wrap id="T3" position="float">
<label>Table&#xa0;3</label>
<caption>
<p>Features employed by machine learning-based B-cell epitope prediction tools.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Machine learning tool</th>
<th valign="top" align="left">Feature types</th>
<th valign="top" align="left">Year</th>
<th valign="top" align="left">Ref.</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">ABCpred</td>
<td valign="top" align="left">Amino acid composition and sequence</td>
<td valign="top" align="left">2006</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B29">29</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">COBEPRO</td>
<td valign="top" align="left">Similarity to other epitopes</td>
<td valign="top" align="left">2009</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B94">94</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">EPITOPIA</td>
<td valign="top" align="left">Amino acid preference, secondary structure preference, surface accessibility, surface structure, evolution rate, polarity scale, flexibility scale, antigenicity scale, hydrophilicity scale</td>
<td valign="top" align="left">2009</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B37">37</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">CBTOPE</td>
<td valign="top" align="left">Amino acid composition, polarity, flexibility, antigenicity, hydrophobicity, sequence, similarity to other epitopes</td>
<td valign="top" align="left">2010</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B36">36</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">LBtope</td>
<td valign="top" align="left">Amino acid composition, sequence, similarity to other epitopes, variable epitope length control</td>
<td valign="top" align="left">2013</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B27">27</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">SEPPA 1.0 (*)<break/>SEPPA 2.0 (**)<break/>SEPPA 3.0 (***)</td>
<td valign="top" align="left">Amino acid propensity*, sequence combined with structure*, solvent accessible surface areas*, antigenicity combined with structure **, glycosylation combined with structure***</td>
<td valign="top" align="left">2019</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B95">95</xref>&#x2013;<xref ref-type="bibr" rid="B97">97</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">EPCES/EPSVR</td>
<td valign="top" align="left">Amino acid, side-chain energy score, surface exposure, antigenicity combined with surface structure, and secondary structure</td>
<td valign="top" align="left">2020</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B24">24</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">SCANNET</td>
<td valign="top" align="left">Amino acid composition, secondary structure, accessible surface area, coordination number (van der Waals interaction), 2D solvent exposure, backbone and sidechain depth (distance from surface), surface convexity index, amino acid conservation</td>
<td valign="top" align="left">2022</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B32">32</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>Different versions of the SEPPA program have new features added to them, features with '*' were first included in SEPPA 1.0, while features with '**' and '***' were first included in SEPPA 2.0 and SEPPA3.0 respectively.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>For example, algorithms EPITOPIA and CBTOPE both use Grantham polarity and Ponnuswami polarity index to calculate polarity as a feature, while LBtope cites CBTOPE&#x2019;s feature set but excludes polarity entirely (<xref ref-type="bibr" rid="B35">35</xref>&#x2013;<xref ref-type="bibr" rid="B37">37</xref>). This may be because LBtope is trying to predict B-cell epitopes of variable lengths and applies a feature set called Composition-Transition-Distribution (CTD) which allows for the comparison of peptides of variable length by simplifying the residues into categories (<xref ref-type="bibr" rid="B38">38</xref>). The CTD divisions used by LBtope is based on a set proposed by Chinnasamy et&#xa0;al. that places each amino acid into groups (either 1, 2, or 3) based on certain physicochemical properties: hydrophobicity, polarizability, polarity, and Van der Waal&#x2019;s volume (<xref ref-type="bibr" rid="B39">39</xref>). CTD characterizes the percent frequency of these groups (Composition), their spatial relationship to one another (Transition), and distribution of each group across the peptide (Distribution). The CTD feature set exemplifies an alternate way to encode certain physical features to allow for expanded function of an algorithm by enabling predictions on variable length peptides.</p>
<p>However, the difference in encoding methodologies between features can be much simpler than CTD; hydrophilicity can be calculated using a hydropathy table or a hydrophobicity index for proximal amino acids (<xref ref-type="bibr" rid="B40">40</xref>, <xref ref-type="bibr" rid="B41">41</xref>). Hydropathy plots are frequently used to estimate hydrophobic and hydrophilic properties over a 20 amino acid window and, as such, may be an appropriate feature when an algorithm is focusing on predictions of peptides of a similar length (<xref ref-type="bibr" rid="B42">42</xref>, <xref ref-type="bibr" rid="B43">43</xref>). Further, there is some variability between experimentally determined hydropathy tables, and pH will affect the hydrophilicity calculations of some amino acids.</p>
<p>Features that describe protein structure may be better at predicting conformational epitopes but may be unimportant when the algorithm is predicting a linear epitope. For example, surface accessibility and flexibility are more impactful features for B-cell epitope prediction software specializing in conformational epitopes. Sequence-based features, on the other hand, may be more beneficial in predicting linear epitopes. Physicochemical features such as hydrophobicity and polarity would be beneficial to both types of predictions and may not benefit one type of prediction more than another. In general, adding more features that provide new information about the data will improve accuracy but with certain limitations. If every feature represents a decision that can be made by a machine learning algorithm, adding noisy features increases the likelihood that a machine learning model uses a feature that appears effective in the training set but is inaccurate once extrapolated to the test set or new cases. This is because every machine learning model assigns a relative weight to different features. When a poorly correlative feature is used to make a prediction, it will likely increase the error. Typically, this is not a major concern unless the dataset is too small; too many features relative to the size of the dataset may impact the predictive accuracy of a model. Further, information provided by databases may limit the features that will be used to predict the label.</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Algorithms &#x2013; how data are processed</title>
<p>Another equally important aspect of a machine learning algorithm is selecting the algorithm itself. While algorithms can either be classifiers or regressors, there is much more variety to algorithms than just this single trait. We consider four common machine learning algorithms: support vector machine (SVM), neural networks, decision trees, and language models.</p>
<p>SVM or support vector machine is a classifier model that is accurate, simple, common, and elegant; it delineates a boundary in N-dimensional space in which data on one side of the boundary fall into one classification while data on the other side fall into a different classification. For example, if flexibility and net charge are the features for this model, every datapoint is plotted onto a grid using the flexibility score on one axis and net charge on the other. After training on the dataset, SVM algorithms will produce a line that bisects the data. If the algorithm is intended to predict B-cell epitopes, the datapoints would be classified as either &#x201c;B-cell epitopes&#x201d; or &#x201c;non B-cell epitopes&#x201d; based on this line. SVM is relatively simple computationally but is sensitive to noisy data and too many features relative to the dataset size (<xref ref-type="bibr" rid="B44">44</xref>, <xref ref-type="bibr" rid="B45">45</xref>).</p>
<p>Neural networks are another common machine learning algorithm. They process data in a way that mimics how the human brain functions. Neural networks consist of a series of layers: input layer, hidden layer(s), and output layer. The input layer consists of neurons equal to the number of features and the output layer is the prediction made by the algorithm. The hidden layer(s) is defined by the programmer and is characterized by a series of weight values. The sum of all inputs multiplied by the weight values will determine whether the &#x2018;neuron&#x2019; in the hidden layer will activate. Activated neurons will propagate data to the output layer in the same process; the hidden layer inputs are multiplied by weight values between the hidden layer and output layer. The activations in the output layer will determine the prediction of the algorithm. During training, a neural network adjusts weight values between layers to strengthen some connections and weaken others similar to how neural pathways in our brain can be reinforced or weakened. Parameters like the optimal size of the hidden layer and number of hidden layers are generally found by trail-and-error. The reason why it is difficult to predict the correct parameters is there is no way to interpret the analysis performed in these hidden layers, a black box. Two common neural networks in biological research are deep neural networks (neural networks with more than four layers) and convolutional neural networks (a neural network architecture that is effective at&#xa0;image processing). Convolutional neural networks can be &#x2018;deep&#x2019;.&#xa0;Also, neural networks rely on very large datasets and as such, trial-and-error optimization of hidden layers is very demanding computationally. While neural networks struggle with interpretability, they are highly accurate once optimized (<xref ref-type="bibr" rid="B46">46</xref>).</p>
<p>Unlike neural networks, decision trees are defined by their interpretability (<xref ref-type="bibr" rid="B47">47</xref>). Decision trees involve a series of nodes that each split into two &#x2018;child&#x2019; nodes continually until they reach their pre-defined depth limit forming a pyramid shape. At each node, the algorithm imposes a criterion that splits the data into one of the two subsequent nodes. For example, if a peptide is being evaluated by the algorithm, a node could be &#x201c;net charge &#x2265;3&#x201d; where all the datapoints that satisfy this criterion go to one of the child nodes and all the other datapoints go to the other child node. During training, the features at each node and the criterion are adjusted to sort the data accurately. Commonly, decision trees are used in random forests that consist of many decision trees where the output is a consensus of a majority of trees (classification), or the mean of tree outputs (regression).</p>
<p>Lastly, language models function by studying patterns in speech and language to predict what comes next probabilistically. These language models are also applied to analyze patterns in protein and genetic sequences to find patterns common to specific types of epitopes and produce prediction tools. Language models accurately represent the data on which they are trained but are very computationally demanding and require large datasets (<xref ref-type="bibr" rid="B48">48</xref>).</p>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Applications of machine learning to epitope mapping</title>
<sec id="s4_1">
<label>4.1</label>
<title>Machine learning integrated into prediction of T-cell and B-cell epitopes</title>
<p>Since the early 2000s, the integration of machine learning into epitope prediction has increased the accuracy of those predictions and, currently, most modern algorithms use machine learning methods (<xref ref-type="table" rid="T4">
<bold>Tables&#xa0;4</bold>
</xref>, <xref ref-type="table" rid="T5">
<bold>5</bold>
</xref>). These algorithms are provided with sequence and/or structural information about a protein, and they use machine learning to predict which epitopes will be recognized by receptors on B-cells or T-cells (<xref ref-type="bibr" rid="B49">49</xref>).</p>
<table-wrap id="T4" position="float">
<label>Table&#xa0;4</label>
<caption>
<p>B-cell epitope prediction tools<sup>a</sup>.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">Year</th>
<th valign="top" align="left">Name</th>
<th valign="middle" align="left">Prediction specialization</th>
<th valign="top" align="left">Link</th>
<th valign="middle" align="left">Machine learning</th>
<th valign="middle" align="left">Algorithm type</th>
<th valign="top" align="left">Ref.</th>
</tr>
</thead>
<tbody>
<tr>
<th valign="middle" colspan="7" align="left">Require structural data</th>
</tr>
<tr>
<td valign="top" align="left">2024</td>
<td valign="top" align="left">Discotope 3.0</td>
<td valign="top" align="left">N/A</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://services.healthtech.dtu.dk/services/DiscoTope-3.0/">https://services.healthtech.dtu.dk/services/DiscoTope-3.0/</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">XGBOOST</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B98">98</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2024</td>
<td valign="top" align="left">SEMA 2.0 3D</td>
<td valign="top" align="left">Conformational</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://sema.airi.net/prediction_analysis">https://sema.airi.net/prediction_analysis</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">LM</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B51">51</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2022</td>
<td valign="top" align="left">SCANNET</td>
<td valign="top" align="left">Conformational</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://bioinfo3d.cs.tau.ac.il/ScanNet/download.html">http://bioinfo3d.cs.tau.ac.il/ScanNet/download.html</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Deep learning</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B32">32</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2022</td>
<td valign="top" align="left">Epitope3D</td>
<td valign="top" align="left">Conformational</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://biosig.lab.uq.edu.au/epitope3d/prediction">https://biosig.lab.uq.edu.au/epitope3d/prediction</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Adaboost Classifier</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B99">99</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2020</td>
<td valign="top" align="left">EPCES</td>
<td valign="top" align="left">Conformational</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://sysbio.unl.edu/EPCES/">http://sysbio.unl.edu/EPCES/</ext-link>
</td>
<td valign="top" align="left">No</td>
<td valign="top" align="left">N/A</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B24">24</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2019</td>
<td valign="top" align="left">SEPPA3.0</td>
<td valign="top" align="left">Conformational</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.badd-cao.net/seppa3/submission.html">http://www.badd-cao.net/seppa3/submission.html</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Logistic Regression</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B97">97</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2014</td>
<td valign="top" align="left">Epipred</td>
<td valign="top" align="left">Conformational</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://opig.stats.ox.ac.uk/webapps/sabdab-sabpred/sabpred/more#EpiPred">https://opig.stats.ox.ac.uk/webapps/sabdab-sabpred/sabpred/more#EpiPred</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">bespoke</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B100">100</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2010</td>
<td valign="top" align="left">EPSVR</td>
<td valign="top" align="left">Conformational</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://sysbio.unl.edu/EPSVR/">http://sysbio.unl.edu/EPSVR/</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Support Vector Regression</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B24">24</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2008</td>
<td valign="top" align="left">Ellipro</td>
<td valign="top" align="left">Conformational</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://tools.iedb.org/ellipro/">http://tools.iedb.org/ellipro/</ext-link>
</td>
<td valign="top" align="left">No</td>
<td valign="top" align="left">N/A</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B101">101</xref>)</td>
</tr>
<tr>
<th valign="middle" colspan="7" align="left">Require sequence data</th>
</tr>
<tr>
<td valign="top" align="left">2024</td>
<td valign="top" align="left">LBCE-BERT</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/Lfang111/LBCE-BERT">https://github.com/Lfang111/LBCE-BERT</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Language model</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B102">102</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2024</td>
<td valign="top" align="left">CLBtope</td>
<td valign="top" align="left">N/A</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://webs.iiitd.edu.in/raghava/clbtope/">https://webs.iiitd.edu.in/raghava/clbtope/</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Random Forest</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B103">103</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2024</td>
<td valign="top" align="left">SEMA 2.0 1D</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://sema.airi.net/prediction_analysis">https://sema.airi.net/prediction_analysis</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Language model</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B51">51</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2023</td>
<td valign="top" align="left">Epitope1D</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://biosig.lab.uq.edu.au/epitope1d/prediction">https://biosig.lab.uq.edu.au/epitope1d/prediction</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Explainable Boost Model</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B104">104</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2022</td>
<td valign="top" align="left">BepiPred 3.0</td>
<td valign="top" align="left">N/A</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://services.healthtech.dtu.dk/service.php?BepiPred-3.0">https://services.healthtech.dtu.dk/service.php?BepiPred-3.0</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Language model</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B52">52</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2021</td>
<td valign="top" align="left">BCEPS</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://imath.med.ucm.es/bceps/">http://imath.med.ucm.es/bceps/</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B105">105</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2021</td>
<td valign="top" align="left">EpiDope</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/flomock/EpiDope">https://github.com/flomock/EpiDope</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">DNN</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B106">106</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2020</td>
<td valign="top" align="left">DLBepitope</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://bio.tools/dlbepitope">https://bio.tools/dlbepitope</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">DNN</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B107">107</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2020</td>
<td valign="top" align="left">AAPpred</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://www.bioinf.ru/aappred/predict">https://www.bioinf.ru/aappred/predict</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B108">108</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2013</td>
<td valign="top" align="left">Lbtope</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://webs.iiitd.edu.in/raghava/lbtope/protein.php">https://webs.iiitd.edu.in/raghava/lbtope/protein.php</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B27">27</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2012</td>
<td valign="top" align="left">SVMTriP</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://sysbio.unl.edu/SVMTriP/prediction.php">http://sysbio.unl.edu/SVMTriP/prediction.php</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B109">109</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2010</td>
<td valign="top" align="left">CBTOPE</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://webs.iiitd.edu.in/raghava/cbtope/submit.php">https://webs.iiitd.edu.in/raghava/cbtope/submit.php</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B36">36</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2009</td>
<td valign="top" align="left">COBEPRO</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://scratch.proteomics.ics.uci.edu/">https://scratch.proteomics.ics.uci.edu/</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B94">94</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2008</td>
<td valign="top" align="left">BCPREDS</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://ailab-projects2.ist.psu.edu/bcpred/predict.html">http://ailab-projects2.ist.psu.edu/bcpred/predict.html</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B110">110</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2006</td>
<td valign="top" align="left">ABCpred</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://webs.iiitd.edu.in/raghava/abcpred/ABC_submission.html">https://webs.iiitd.edu.in/raghava/abcpred/ABC_submission.html</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Neural Network</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B29">29</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2004</td>
<td valign="top" align="left">BCEpred</td>
<td valign="top" align="left">Linear</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://webs.iiitd.edu.in/raghava/bcepred/bcepred_submission.html">https://webs.iiitd.edu.in/raghava/bcepred/bcepred_submission.html</ext-link>
</td>
<td valign="top" align="left">No</td>
<td valign="top" align="left">N/A</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B111">111</xref>)</td>
</tr>
<tr>
<th valign="middle" colspan="7" align="left">Can use either sequence or structural data</th>
</tr>
<tr>
<td valign="top" align="left">2024</td>
<td valign="top" align="left">CALIBER</td>
<td valign="top" align="left">N/A</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="https://caliber.math.biu.ac.il/">https://caliber.math.biu.ac.il/</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Multiple <sup>c</sup>
</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B112">112</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">2009</td>
<td valign="top" align="left">EPITOPIA<sup>b</sup>
</td>
<td valign="top" align="left">N/A</td>
<td valign="top" align="left">
<ext-link ext-link-type="uri" xlink:href="http://epitopia.tau.ac.il">http://epitopia.tau.ac.il</ext-link>
</td>
<td valign="top" align="left">Yes</td>
<td valign="top" align="left">Na&#xef;ve Bayes classifier</td>
<td valign="top" align="left">(<xref ref-type="bibr" rid="B37">37</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>
<sup>a</sup>Accessibility evaluated on 12-06-2024.</p>
</fn>
<fn>
<p>
<sup>b</sup>Epitopia is inaccessible but included because its design is discussed in this review.</p>
</fn>
<fn>
<p>
<sup>c</sup>Users of CALIBER must select a model of Recurrent Neural Network, Graph Convolution Network, or Boosting (a combination of both).</p>
</fn>
<fn>
<p>N/A, Not applicable.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="T5" position="float">
<label>Table&#xa0;5</label>
<caption>
<p>T-cell epitope prediction tools<sup>a</sup>.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" align="left">Year</th>
<th valign="middle" align="left">Name</th>
<th valign="middle" align="left">Link</th>
<th valign="middle" align="left">Machine learning</th>
<th valign="middle" align="left">Algorithm type</th>
<th valign="middle" align="left">Ref.</th>
</tr>
</thead>
<tbody>
<tr>
<th valign="middle" colspan="6" align="left">MHC I predictor</th>
</tr>
<tr>
<td valign="middle" align="left">2024</td>
<td valign="middle" align="left">TEPCAM</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/Chenjw99/TEPCAM">https://github.com/Chenjw99/TEPCAM</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Deep Learning</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B113">113</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2023</td>
<td valign="middle" align="left">PANPEP</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/bm2-lab/PanPep">https://github.com/bm2-lab/PanPep</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Neural Network</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B114">114</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2023</td>
<td valign="middle" align="left">TEINET</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/jiangdada1221/TEINet">https://github.com/jiangdada1221/TEINet</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Neural Network</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B115">115</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2022</td>
<td valign="middle" align="left">ATM-TCR</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/Lee-CBG/ATM-TCR">https://github.com/Lee-CBG/ATM-TCR</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Multi-head Self-attention model</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B116">116</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2021</td>
<td valign="middle" align="left">TITAN</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/PaccMann/TITAN">https://github.com/PaccMann/TITAN</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Nearest Neighbour</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B117">117</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2020</td>
<td valign="middle" align="left">NETMHCpan 4.1</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://services.healthtech.dtu.dk/services/NetMHCpan-4.1/">https://services.healthtech.dtu.dk/services/NetMHCpan-4.1/</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Neural Network <sup>b</sup>
</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B118">118</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2020</td>
<td valign="middle" align="left">MHCflurry 2.0</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/openvax/mhcflurry">https://github.com/openvax/mhcflurry</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Logistic Regression</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B119">119</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2019</td>
<td valign="middle" align="left">ACME</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/HYsxe/ACME">https://github.com/HYsxe/ACME</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">DNN</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B120">120</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2013</td>
<td valign="middle" align="left">EPISOPT</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="http://bio.med.ucm.es/episopt.htmL">http://bio.med.ucm.es/episopt.htmL</ext-link>
</td>
<td valign="middle" align="left">No</td>
<td valign="middle" align="left">N/A</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B121">121</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2009</td>
<td valign="middle" align="left">PickPocket</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://services.healthtech.dtu.dk/services/PickPocket-1.1/">https://services.healthtech.dtu.dk/services/PickPocket-1.1/</ext-link>
</td>
<td valign="middle" align="left">No</td>
<td valign="middle" align="left">N/A</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B122">122</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2007</td>
<td valign="middle" align="left">NetCTL</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://services.healthtech.dtu.dk/services/NetCTL-1.2/">https://services.healthtech.dtu.dk/services/NetCTL-1.2/</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">ANN</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B123">123</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2006</td>
<td valign="middle" align="left">EpiJen</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.ddg-pharmfac.net/epijen/EpiJen/EpiJen.htm">http://www.ddg-pharmfac.net/epijen/EpiJen/EpiJen.htm</ext-link>
</td>
<td valign="middle" align="left">No</td>
<td valign="middle" align="left">N/A</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B124">124</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2005</td>
<td valign="middle" align="left">PEPVAC</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="http://imed.med.ucm.es/PEPVAC/">http://imed.med.ucm.es/PEPVAC/</ext-link>
</td>
<td valign="middle" align="left">No</td>
<td valign="middle" align="left">N/A</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B125">125</xref>)</td>
</tr>
<tr>
<th valign="middle" colspan="6" align="left">MHC II predictor</th>
</tr>
<tr>
<td valign="middle" align="left">2023</td>
<td valign="middle" align="left">MixMHX2pred</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/GfellerLab/MixMHC2pred">https://github.com/GfellerLab/MixMHC2pred</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Neural Networks</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B126">126</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2020</td>
<td valign="middle" align="left">NETMHCIIpan4.0</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://services.healthtech.dtu.dk/services/NetMHCIIpan-4.0/">https://services.healthtech.dtu.dk/services/NetMHCIIpan-4.0/</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Neural Network <sup>b</sup>
</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B118">118</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2019</td>
<td valign="middle" align="left">MARIA</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://maria.stanford.edu/about.php">https://maria.stanford.edu/about.php</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Neural Network</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B127">127</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2019</td>
<td valign="middle" align="left">NeonMHC2</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://neonmhc2.org/neonmhc2/neonmhc2_main/">https://neonmhc2.org/neonmhc2/neonmhc2_main/</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Neural Network</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B128">128</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2017</td>
<td valign="middle" align="left">NNAlign 2.0</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://services.healthtech.dtu.dk/services/NNAlign-2.0/">https://services.healthtech.dtu.dk/services/NNAlign-2.0/</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">ANN</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B129">129</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2013</td>
<td valign="middle" align="left">EpiDOCK</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.ddg-pharmfac.net/epidock/index.html">http://www.ddg-pharmfac.net/epidock/index.html</ext-link>
</td>
<td valign="middle" align="left">No</td>
<td valign="middle" align="left">N/A</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B130">130</xref>)</td>
</tr>
<tr>
<th valign="middle" colspan="6" align="left">MHC I &amp; MHC II predictor</th>
</tr>
<tr>
<td valign="middle" align="left">2024</td>
<td valign="middle" align="left">TCRen</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/antigenomics/tcren-ms">https://github.com/antigenomics/tcren-ms</ext-link>
</td>
<td valign="middle" align="left">No</td>
<td valign="middle" align="left">N/A</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B131">131</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2021</td>
<td valign="middle" align="left">ERGO-II</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://github.com/IdoSpringer/ERGO-II">https://github.com/IdoSpringer/ERGO-II</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Language Model</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B132">132</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2021</td>
<td valign="middle" align="left">Vaxign2</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="https://violinet.org/vaxign/">https://violinet.org/vaxign/</ext-link>
</td>
<td valign="middle" align="left">Yes</td>
<td valign="middle" align="left">Custom</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B133">133</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">2004</td>
<td valign="middle" align="left">Rankpep</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="http://imed.med.ucm.es/Tools/rankpep.html">http://imed.med.ucm.es/Tools/rankpep.html</ext-link>
</td>
<td valign="middle" align="left">No</td>
<td valign="middle" align="left">N/A</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B134">134</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">1999</td>
<td valign="middle" align="left">SYFPEITHI</td>
<td valign="middle" align="left">
<ext-link ext-link-type="uri" xlink:href="http://www.syfpeithi.de/bin/MHCServer.dll/EpitopePrediction.htm">http://www.syfpeithi.de/bin/MHCServer.dll/EpitopePrediction.htm</ext-link>
</td>
<td valign="middle" align="left">No</td>
<td valign="middle" align="left">N/A</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B89">89</xref>)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>
<sup>a</sup>Accessibility evaluated on 12-06-2024.</p>
</fn>
<fn>
<p>N/A, Not applicable.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>Non-machine learning methods use scoring methods to predict epitopes. In the case of EpiJen, a 4-step process produces a score and eliminates datapoints in a stepwise manner using quantitative matrices. It first determines whether proteasome cleavage would occur, then whether TAP binding would occur, then whether MHC binding would occur, and finally whether T-cell recognition would occur. The output would be a small subsection of the data.</p>
<p>The early B-cell epitope prediction tools focused primarily on the prediction of linear epitopes despite linear epitopes making up only a small portion of B-cell epitopes (<xref ref-type="bibr" rid="B10">10</xref>). Rapidly, new tools were developed that boasted higher accuracy on a benchmark dataset or pioneered new methods of analysis; for example, ABCpred was the first B-cell prediction server based on recurrent neural networks (<xref ref-type="bibr" rid="B29">29</xref>). Commonly, tools specialized for linear prediction take sequence files as their input and conformational predictions require structural inputs, but more recently, tools focus on predicting both linear and conformational epitopes. In 2017, BepiPred 2.0 became a benchmark algorithm for the comparison of newer ones based on its accuracy (for 2017 standards) and capability to predict both conformational and linear epitopes (<xref ref-type="bibr" rid="B50">50</xref>). Other tools were released since that predict both types of epitopes, including BepiPred 3.0, SEMA-1D, and SEMA-3D (<xref ref-type="bibr" rid="B51">51</xref>, <xref ref-type="bibr" rid="B52">52</xref>). Ideally, the best tool is one that is specialized to address a given problem and trained on a dataset containing relevant cases.</p>
<p>The value in predicting T-cell epitopes lies in the characterization of potential immune responses prior to treatment to avoid side effects against self-peptides (<xref ref-type="bibr" rid="B53">53</xref>). In T-cell epitope prediction, we encounter the dichotomy of algorithms focused primarily on either MHC class I or MHC class II binding prediction (<xref ref-type="table" rid="T5">
<bold>Table&#xa0;5</bold>
</xref>). The open binding groove of MHC class II molecules makes predictions much more complex compared to MHC class I. Specifically, this means that the ligand does not fit cleanly inside the binding groove; there is a structural element introduced and MHC class II predictive algorithms have lower accuracy compared to MHC class I algorithms. There exist several accessible online machine learning tools for predicting T-cell epitopes and/or MHC class I and class II peptide binding (<xref ref-type="table" rid="T5">
<bold>Table&#xa0;5</bold>
</xref>). The review by Peters et&#xa0;al. (<xref ref-type="bibr" rid="B54">54</xref>) explores T-cell epitope prediction and algorithm design more thoroughly.</p>
<p>Since features, datasets, and algorithm selection all define the specificity of a machine learning model and the type of data it is predicting, it is difficult to meaningfully compare different T-cell and B-cell epitope prediction tools. These tools will often compare themselves to similar contemporary tools based on performance to benchmark datasets, but there are issues with generalizing the accuracy of these tools. The first issue is that the test sets used to compare algorithms to one another are not the same and often tailored to the specific comparison. For example, comparing linear B-cell epitope prediction software to conformational software would be affected by whether the benchmark dataset contained primarily linear or conformational data. If the benchmark set is small, neural networks may perform worse and if it is noisy then SVM algorithms will be disadvantaged. While this makes it difficult to easily select a single &#x2018;best&#x2019; tool, it is important to appreciate that certain tools are highly specialized to address particular problems, and the breadth of tools available should be considered in selecting which are most appropriate.</p>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Machine learning integrated into experimental epitope mapping methods</title>
<p>A significant advance in epitope mapping is the integration of machine learning to improve the analysis of peptide arrays (<xref ref-type="table" rid="T6">
<bold>Table&#xa0;6</bold>
</xref>). For example, Xue et&#xa0;al. demonstrated the utility of machine learning algorithms to address a common issue for peptide arrays: certain peptides result in high noise relative to signal in the output data (low signal-to-noise ratio). This noise ratio results in difficulties for interpreting array data and affects its utility in defining the boundaries of an epitope. Often, array data is simplified to streamline interpretation; a threshold-based approach is used to convert data into a binary compatible with classification-type algorithms (<xref ref-type="bibr" rid="B20">20</xref>, <xref ref-type="bibr" rid="B55">55</xref>&#x2013;<xref ref-type="bibr" rid="B57">57</xref>). The data are usually sorted into either binding or not binding. However, machine learning can predict which peptides will result in a low signal to noise ratio. By training an algorithm on a small subset of the intended peptides, Xue et&#xa0;al. demonstrated that their machine learning program succeeded in accurately predicting which peptides in the larger peptide set would result in a low signal-to-noise ratio (<xref ref-type="bibr" rid="B58">58</xref>). There are several possible ways to consider predictions of which peptides will result in significant noise: &#x2018;noisy&#x2019; peptides could be eliminated from training sets for future algorithm development, excluded from testing all together, or the predictions could be used in interpreting results.</p>
<table-wrap id="T6" position="float">
<label>Table&#xa0;6</label>
<caption>
<p>Machine learning applied to <italic>in vitro</italic> epitope mapping methods.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" align="center">Method</th>
<th valign="middle" align="center">Brief summary</th>
<th valign="middle" align="center">Ref.</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="middle" rowspan="3" align="left">Peptide array</td>
<td valign="middle" align="left">
<italic>De novo</italic> binding prediction</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B135">135</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Removal of systematic effects, unreliable measurements, and non-specific secondary antibody responses</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B136">136</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">
<italic>De novo</italic> prediction of signal-to-noise ratios</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B58">58</xref>)</td>
</tr>
<tr>
<td valign="middle" rowspan="9" align="left">Cryo-EM</td>
<td valign="middle" align="left">Automated particle selection</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B137">137</xref>, <xref ref-type="bibr" rid="B138">138</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Denoising of images</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B139">139</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">False positive pruning in single particle analysis</td>
<td valign="bottom" align="left">(<xref ref-type="bibr" rid="B140">140</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Data pre-processing</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B141">141</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Ice thickness determination</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B142">142</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Resolution estimation</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B143">143</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Atom structure determination</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B144">144</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">3D model building of protein backbone</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B145">145</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Resolution determination improved by Alphafold 2</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B146">146</xref>)</td>
</tr>
<tr>
<td valign="middle" rowspan="5" align="left">X-ray crystallography</td>
<td valign="middle" align="left">Prediction of crystallizability of a protein</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B147">147</xref>, <xref ref-type="bibr" rid="B148">148</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Crystallization outcome classification</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B149">149</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Model correctness</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B150">150</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Protein solubility prediction</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B151">151</xref>)</td>
</tr>
<tr>
<td valign="middle" align="left">Data reduction</td>
<td valign="middle" align="left">(<xref ref-type="bibr" rid="B152">152</xref>)</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Machine learning is also used in docking software to predict the discrete interactions between a monoclonal antibody and a protein. Non-machine learning algorithms like Zdock and Rosetta are relatively effective and accurate antibody docking software programs, but they specialize in &#x201c;local docking&#x201d; which necessitates partial epitope knowledge, like relative location of the epitope within approximately 8 &#xc5; (<xref ref-type="bibr" rid="B59">59</xref>, <xref ref-type="bibr" rid="B60">60</xref>). Machine learning allows programs like Mabtope to analyze millions of docking poses and identify those that are optimal. When compared to other methods like FRODOCK (another non-machine learning method), Mabtope reports are more accurate at predicting whether a specific residue participates in an epitope (80% accuracy compared to 35% accuracy) (<xref ref-type="bibr" rid="B61">61</xref>, <xref ref-type="bibr" rid="B62">62</xref>). The impacts of machine learning on protein-ligand docking are discussed in a review by Yang et&#xa0;al. (<xref ref-type="bibr" rid="B63">63</xref>). Docking-based approaches provide the ability to explore specific antibody-protein interactions <italic>in silico</italic>.</p>
<p>In X-ray crystallography, machine learning can address the difficulty with performing this method on large proteins. When analyzing large proteins, a pre-experimental step can use B-cell mapping algorithms to focus on specific regions of a protein; testing a smaller discrete region of a protein would help address the size issue and improve the throughput. Machine learning can also predict protein crystallizability which saves time and resources (<xref ref-type="bibr" rid="B64">64</xref>, <xref ref-type="bibr" rid="B65">65</xref>). Moreover, the study of highly variable regions of a protein is facilitated by algorithms that can predict &#x2018;correctness&#x2019; scores (a per-residue estimate of how reliable the crystal structure is) for residue main chains and side chains to improve interpretability of crystallographic protein structures (<xref ref-type="table" rid="T6">
<bold>Table&#xa0;6</bold>
</xref>).</p>
<p>Cryo-EM depends on machine learning integration where a cryogenically frozen molecule is subjected to electron microscopy to produce and assemble a 3D reconstruction. Machine learning has automated particle selection, improved post-processing resolution, and map reconstruction (<xref ref-type="table" rid="T6">
<bold>Table&#xa0;6</bold>
</xref>).</p>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Case studies</title>
<p>Several studies of SARS-CoV-2 highlight applications of machine learning to probe the specificity of immune responses to the virus. As an example, Hotop et&#xa0;al. applied machine learning to analyze data from peptide microarrays screened with sera from patients with disease outcomes of varying severity (<xref ref-type="bibr" rid="B55">55</xref>). The analysis revealed important differences in the antibody responses to SARS-CoV-2 that correlated with improved clearance of the infection. Unlike conventional B-cell epitope mapping algorithms that use regression analysis (providing a continuous variable as an output), this application applied a random forest in classification analysis providing an output as a &#x2018;1&#x2019; or a &#x2018;0&#x2019;. More specifically, the machine learning algorithms labeled each peptide as either correlated (&#x2018;1&#x2019;) or non-correlated (&#x2018;0&#x2019;) with better disease outcomes. Five different algorithms were used to select 10 peptides (out of 648) that were recognized predominantly by sera from patients with milder infections. This information can now be applied to vaccine design to focus on neutralizing epitopes. It could also be used as a prognostic to predict the severity of disease outcomes.</p>
<p>A second example is the application of the algorithm ScanNet (<xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>) (<xref ref-type="bibr" rid="B32">32</xref>) to study the immune response against the receptor binding site of the SARS-CoV-2 Omicron variant (<xref ref-type="bibr" rid="B66">66</xref>). ScanNet is a geometric model that characterizes the binding surface of a protein (or protein segment in this case) based on its physical and chemical properties: shape, charge, depth, surface accessibility, etc. The algorithm analyzed antigenicity (the likelihood of antibody recognition) residue by residue of the SARS-CoV-2 receptor binding domain (RBD) from the original strain compared to the same region on each of the Delta, Alpha, Beta and Omicron variants. This analysis suggested that mutations in the RBD reduced antibody recognition, which could explain why immunity to previous variants of SARS-CoV-2 may not be as protective against the Omicron strain. These results were corroborated by competition ELISAs that showed reduced serological recognition of the Omicron RBD and supported the need for an Omicron-specific vaccine. ScanNet&#x2019;s antigenicity predictions for the variants further supported the need to design Omicron-specific vaccines.</p>
</sec>
<sec id="s6">
<label>6</label>
<title>Future applications of machine learning to epitope mapping and immunology</title>
<p>Although B-cell epitope prediction algorithms are a common tool, there is room for tremendous growth in the application of machine learning to further refine epitope predictions. This is especially relevant in the interpretation of antibody-binding data that point to multiple epitopes; rather than recognizing a single discrete epitope, some antibodies are promiscuous or polyreactive (<xref ref-type="bibr" rid="B67">67</xref>, <xref ref-type="bibr" rid="B68">68</xref>). Applying feature analysis approaches to these types of antibody binding data could identify features that are shared across epitopes within the same or related protein. An exciting opportunity is to use machine learning to extract -with high resolution - the specific features of these epitopes that are required for an antibody to bind. This information can then inform&#xa0;diagnostics, immunotherapeutics or vaccine design. In the example of SARS-CoV-2, peptide vaccines could be designed based on the features of an epitope that are conserved across virus variants to elicit broadly neutralizing antibodies.</p>
<p>Another application of machine learning is to improve prediction of conformational epitopes. High throughput mapping of linear peptide arrays could be integrated with <italic>in silico</italic> protein modeling, like AlphaFold, to increase the interpretability of the data (<xref ref-type="bibr" rid="B69">69</xref>). Recently, the release of AlphaFold 3 further improved the accuracy of predicting biomolecular complexes and overhauled its training procedure (<xref ref-type="bibr" rid="B70">70</xref>). Unfortunately, AlphaFold 3 is closed-source, which may limit the potential for specialized derivative models to be developed. For example, AlphaFold-Multimer was derived from the open-source code of Alphafold 2 and improved on&#xa0;its ability to predict large protein complexes (<xref ref-type="bibr" rid="B71">71</xref>). As discussed previously, peptide arrays focus on linear epitopes and are considered less informative with regards to conformational epitopes. However, mapping binding regions from an array to a predicted structure may highlight binding clusters that indicate an epitope. A more granular approach to the same methods could be to use the large peptide array datasets and train a machine learning algorithm from which feature data can be extracted. The feature data would encapsulate physical and chemical characteristics associated with binding regions (i.e. charge, secondary structure, etc.) which could also be applied to the 3D structures to identify binding sites. The impact this would have on vaccine development and therapeutics is profound. Success of these algorithms may improve peptide-based vaccines through scaffolded or stapled peptides that can recapitulate 3D structural components. It would also provide more accurate characterization of off-target binding.</p>
<p>A paratope-based approach may also have potential to predict epitopes and play an important role in profiling the safety and specificity of monoclonal antibodies now widely used as therapeutics. Sequences of antigen binding sites and/or structural information could be used to predict the features of a compatible epitope. In this case, the features would be structural, physicochemical, and sequence-based descriptions of the binding region of an antibody and the label(s) would correspond to aspects of an epitope, such as structure, charge, polarity, etc. Just like B-cell epitope prediction software, there are promising predictive models for paratope regions on an antibody (<xref ref-type="bibr" rid="B72">72</xref>&#x2013;<xref ref-type="bibr" rid="B74">74</xref>). Additionally, programs like AlphaFold, or faster and higher-throughput derivatives, can be applied to design and inform structural features (<xref ref-type="bibr" rid="B69">69</xref>, <xref ref-type="bibr" rid="B75">75</xref>, <xref ref-type="bibr" rid="B76">76</xref>). The burgeoning field of paratope-focused analysis could yield a better understanding of off-target binding, and potentially increase accuracy and safety. Focusing on both the paratope and epitope may result in more predictable outcomes for vaccine design and other therapeutics.</p>
<p>Importantly, machine learning has been integrated into research beyond epitope mapping, with impacts in the broader field of immunology. For example, the tool AllerCatPro 2.0 uses machine learning to predict allergenicity of proteins by predicting potential immune epitopes (<xref ref-type="bibr" rid="B77">77</xref>) and, clinically, machine learning has improved allergic disease diagnostics (<xref ref-type="bibr" rid="B78">78</xref>). To improve resource management and streamline clinical processes, machine learning-enhanced prognostic tools are being rapidly developed to analyze patient data. Specifically, plasma from SARS-CoV-2 infected patients was used to produce a cytokine panel that can accurately predict disease severity (<xref ref-type="bibr" rid="B79">79</xref>). Conditions that are difficult to diagnose, such as auto immune diseases, have benefited significantly from the incorporation of machine learning models to analyze patient data (<xref ref-type="bibr" rid="B80">80</xref>). In future, machine learning directed to these clinical applications has the potential to transform personalized medicine.</p>
</sec>
</body>
<back>
<sec id="s7" sec-type="author-contributions">
<title>Author contributions</title>
<p>SG: Writing &#x2013; original draft, Writing &#x2013; review &amp; editing. NH: Conceptualization, Writing &#x2013; review &amp; editing. SY: Conceptualization, Writing &#x2013; review &amp; editing, Funding acquisition, Project administration, Supervision.</p>
</sec>
<sec id="s8" sec-type="funding-information">
<title>Funding</title>
<p>The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was supported by a Project Grant (RN399320) from the Canadian Institutes of Health Research, and the National Institute of Allergy and Infectious Diseases of the National Institutes of Health under Award Number R01AI150944.</p>
</sec>
<ack>
<title>Acknowledgments</title>
<p>The authors thank D. Vinals, U. Iyamu, and M. Good for comments and suggestions.</p>
</ack>
<sec id="s9" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
<p>The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.</p>
</sec>
<sec id="s10" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s11" sec-type="disclaimer">
<title>Author disclaimer</title>
<p>The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vasireddy</surname> <given-names>D</given-names>
</name>
<name>
<surname>Vanaparthy</surname> <given-names>R</given-names>
</name>
<name>
<surname>Mohan</surname> <given-names>G</given-names>
</name>
<name>
<surname>Malayala</surname> <given-names>SV</given-names>
</name>
<name>
<surname>Atluri</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>Review of COVID-19 variants and COVID-19 vaccine efficacy: what the clinician should know</article-title>? <source>J Clin Med Res</source>. (<year>2021</year>) <volume>13</volume>:<page-range>317&#x2013;25</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.14740/jocmr4518</pub-id>
</citation>
</ref>
<ref id="B2">
<label>2</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xing</surname> <given-names>K</given-names>
</name>
<name>
<surname>Tu</surname> <given-names>XY</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>M</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>ZW</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>JN</given-names>
</name>
<name>
<surname>Li</surname> <given-names>JJ</given-names>
</name>
<etal/>
</person-group>. <article-title>Efficacy and safety of COVID-19 vaccines: a systematic review</article-title>. <source>Zhongguo Dang Dai Er Ke Za Zhi</source>. (<year>2021</year>) <volume>23</volume>:<page-range>221&#x2013;8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.7499/j.issn.1008-8830.2101133</pub-id>
</citation>
</ref>
<ref id="B3">
<label>3</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pollard</surname> <given-names>AJ</given-names>
</name>
<name>
<surname>Bijker</surname> <given-names>EM</given-names>
</name>
</person-group>. <article-title>A guide to vaccinology: from basic principles to new developments</article-title>. <source>Nat Rev Immunol</source>. (<year>2021</year>) <volume>21</volume>:<fpage>83</fpage>&#x2013;<lpage>100</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41577-020-00479-7</pub-id>
</citation>
</ref>
<ref id="B4">
<label>4</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ghotloo</surname> <given-names>S</given-names>
</name>
<name>
<surname>Maghsood</surname> <given-names>F</given-names>
</name>
<name>
<surname>Golsaz-Shirazi</surname> <given-names>F</given-names>
</name>
<name>
<surname>Amiri</surname> <given-names>MM</given-names>
</name>
<name>
<surname>Moog</surname> <given-names>C</given-names>
</name>
<name>
<surname>Shokri</surname> <given-names>F</given-names>
</name>
</person-group>. <article-title>Epitope mapping of neutralising anti-SARS-CoV-2 monoclonal antibodies: Implications for immunotherapy and vaccine design</article-title>. <source>Rev Med Virol</source>. (<year>2022</year>) <volume>32</volume>:<elocation-id>e2347</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/rmv.v32.5</pub-id>
</citation>
</ref>
<ref id="B5">
<label>5</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hudson</surname> <given-names>D</given-names>
</name>
<name>
<surname>Fernandes</surname> <given-names>RA</given-names>
</name>
<name>
<surname>Basham</surname> <given-names>M</given-names>
</name>
<name>
<surname>Ogg</surname> <given-names>G</given-names>
</name>
<name>
<surname>Koohy</surname> <given-names>H</given-names>
</name>
</person-group>. <article-title>Can we predict T cell specificity with digital biology and machine learning</article-title>? <source>Nat Rev Immunol</source>. (<year>2023</year>) <volume>23</volume>:<page-range>511&#x2013;21</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41577-023-00835-3</pub-id>
</citation>
</ref>
<ref id="B6">
<label>6</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Becker</surname> <given-names>JP</given-names>
</name>
<name>
<surname>Riemer</surname> <given-names>AB</given-names>
</name>
</person-group>. <article-title>The importance of being presented: target validation by immunopeptidomics for epitope-specific immunotherapies</article-title>. <source>Front Immunol</source>. (<year>2022</year>) <volume>13</volume>:<elocation-id>883989</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fimmu.2022.883989</pub-id>
</citation>
</ref>
<ref id="B7">
<label>7</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sidney</surname> <given-names>J</given-names>
</name>
<name>
<surname>Peters</surname> <given-names>B</given-names>
</name>
<name>
<surname>Sette</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>Epitope prediction and identification- adaptive T cell responses in humans</article-title>. <source>Semin Immunol</source>. (<year>2020</year>) <volume>50</volume>:<fpage>101418</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.smim.2020.101418</pub-id>
</citation>
</ref>
<ref id="B8">
<label>8</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barlow</surname> <given-names>DJ</given-names>
</name>
<name>
<surname>Edwards</surname> <given-names>MS</given-names>
</name>
<name>
<surname>Thornton</surname> <given-names>JM</given-names>
</name>
</person-group>. <article-title>Continuous and discontinuous protein antigenic determinants</article-title>. <source>Nature</source>. (<year>1986</year>) <volume>322</volume>:<page-range>747&#x2013;8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/322747a0</pub-id>
</citation>
</ref>
<ref id="B9">
<label>9</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van Regenmortel</surname> <given-names>MH</given-names>
</name>
</person-group>. <article-title>What is a B-cell epitope</article-title>? <source>Methods Mol Biol</source>. (<year>2009</year>) <volume>524</volume>:<fpage>3</fpage>&#x2013;<lpage>20</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-1-59745-450-6_1</pub-id>
</citation>
</ref>
<ref id="B10">
<label>10</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kringelum</surname> <given-names>JV</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
<name>
<surname>Padkj&#xe6;r</surname> <given-names>SB</given-names>
</name>
<name>
<surname>Lund</surname> <given-names>O</given-names>
</name>
</person-group>. <article-title>Structural analysis of B-cell epitopes in antibody:protein complexes</article-title>. <source>Mol Immunol</source>. (<year>2013</year>) <volume>53</volume>:<fpage>24</fpage>&#x2013;<lpage>34</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.molimm.2012.06.001</pub-id>
</citation>
</ref>
<ref id="B11">
<label>11</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rahman Kh</surname> <given-names>S</given-names>
</name>
<name>
<surname>Chowdhury</surname> <given-names>EU</given-names>
</name>
<name>
<surname>Sachse</surname> <given-names>K</given-names>
</name>
<name>
<surname>Kaltenboeck</surname> <given-names>B</given-names>
</name>
</person-group>. <article-title>Inadequate reference datasets biased toward short non-epitopes confound B-cell epitope prediction</article-title>. <source>J Biol Chem</source>. (<year>2016</year>) <volume>291</volume>:<page-range>14585&#x2013;99</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1074/jbc.M116.729020</pub-id>
</citation>
</ref>
<ref id="B12">
<label>12</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Potocnakova</surname> <given-names>L</given-names>
</name>
<name>
<surname>Bhide</surname> <given-names>M</given-names>
</name>
<name>
<surname>Pulzova</surname> <given-names>LB</given-names>
</name>
</person-group>. <article-title>An introduction to B-cell epitope mapping and in silico epitope prediction</article-title>. <source>J Immunol Res</source>. (<year>2016</year>) <volume>2016</volume>:<fpage>6760830</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1155/2016/6760830</pub-id>
</citation>
</ref>
<ref id="B13">
<label>13</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Toride King</surname> <given-names>M</given-names>
</name>
<name>
<surname>Brooks</surname> <given-names>CL</given-names>
</name>
</person-group>. <article-title>Epitope mapping of antibody-antigen interactions with X-ray crystallography</article-title>. <source>Methods Mol Biol</source>. (<year>2018</year>) <volume>1785</volume>:<fpage>13</fpage>&#x2013;<lpage>27</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-1-4939-7841-0_2</pub-id>
</citation>
</ref>
<ref id="B14">
<label>14</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krissinel</surname> <given-names>E</given-names>
</name>
<name>
<surname>Henrick</surname> <given-names>K</given-names>
</name>
</person-group>. <article-title>Inference of macromolecular assemblies from crystalline state</article-title>. <source>J Mol Biol</source>. (<year>2007</year>) <volume>372</volume>:<page-range>774&#x2013;97</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jmb.2007.05.022</pub-id>
</citation>
</ref>
<ref id="B15">
<label>15</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smyth</surname> <given-names>MS</given-names>
</name>
<name>
<surname>Martin</surname> <given-names>JH</given-names>
</name>
</person-group>. <article-title>x ray crystallography</article-title>. <source>Mol Pathol</source>. (<year>2000</year>) <volume>53</volume>:<fpage>8</fpage>&#x2013;<lpage>14</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1136/mp.53.1.8</pub-id>
</citation>
</ref>
<ref id="B16">
<label>16</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wigge</surname> <given-names>C</given-names>
</name>
<name>
<surname>Stefanovic</surname> <given-names>A</given-names>
</name>
<name>
<surname>Radjainia</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>The rapidly evolving role of cryo-EM in drug design</article-title>. <source>Drug Discovery Today Technol</source>. (<year>2020</year>) <volume>38</volume>:<fpage>91</fpage>&#x2013;<lpage>102</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ddtec.2020.12.003</pub-id>
</citation>
</ref>
<ref id="B17">
<label>17</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Amartely</surname> <given-names>H</given-names>
</name>
<name>
<surname>Iosub-Amir</surname> <given-names>A</given-names>
</name>
<name>
<surname>Friedler</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>Identifying protein-protein interaction sites using peptide arrays</article-title>. <source>J Vis Exp</source>. (<year>2014</year>) <volume>93</volume>:<elocation-id>e52097</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3791/52097</pub-id>
</citation>
</ref>
<ref id="B18">
<label>18</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morrison</surname> <given-names>KL</given-names>
</name>
<name>
<surname>Weiss</surname> <given-names>GA</given-names>
</name>
</person-group>. <article-title>Combinatorial alanine-scanning</article-title>. <source>Curr Opin Chem Biol</source>. (<year>2001</year>) <volume>5</volume>:<page-range>302&#x2013;7</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/S1367-5931(00)00206-4</pub-id>
</citation>
</ref>
<ref id="B19">
<label>19</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Szymczak</surname> <given-names>LC</given-names>
</name>
<name>
<surname>Kuo</surname> <given-names>HY</given-names>
</name>
<name>
<surname>Mrksich</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>Peptide arrays: development and application</article-title>. <source>Anal Chem</source>. (<year>2018</year>) <volume>90</volume>:<page-range>266&#x2013;82</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1021/acs.analchem.7b04380</pub-id>
</citation>
</ref>
<ref id="B20">
<label>20</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname> <given-names>M</given-names>
</name>
<name>
<surname>Dou</surname> <given-names>X</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>X</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>M</given-names>
</name>
<name>
<surname>Xiong</surname> <given-names>D</given-names>
</name>
<name>
<surname>Jiang</surname> <given-names>R</given-names>
</name>
<etal/>
</person-group>. <article-title>Protective antigenic epitopes revealed by immunosignatures after three doses of inactivated SARS-CoV-2 vaccine</article-title>. <source>Front Immunol</source>. (<year>2022</year>) <volume>13</volume>:<elocation-id>938378</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fimmu.2022.938378</pub-id>
</citation>
</ref>
<ref id="B21">
<label>21</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Goodfellow</surname> <given-names>I</given-names>
</name>
<name>
<surname>Bengio</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Courville</surname> <given-names>A</given-names>
</name>
</person-group>. <source>Deep Learning</source>. <publisher-loc>Cambridge, Massachusetts</publisher-loc>: <publisher-name>The MIT Press</publisher-name> (<year>2016</year>).</citation>
</ref>
<ref id="B22">
<label>22</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Cukierski</surname> <given-names>W</given-names>
</name>
</person-group>. <source>Titanic - machine learning from disaster</source>. <publisher-loc>Delaware, USA</publisher-loc>: <publisher-name>Kaggle</publisher-name> (<year>2012</year>). Available at: <uri xlink:href="https://www.kaggle.com/competitions/titanic">https://www.kaggle.com/competitions/titanic</uri>.</citation>
</ref>
<ref id="B23">
<label>23</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kosaloglu-Yalcin</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Blazeska</surname> <given-names>N</given-names>
</name>
<name>
<surname>Vita</surname> <given-names>R</given-names>
</name>
<name>
<surname>Carter</surname> <given-names>H</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
<name>
<surname>Schoenberger</surname> <given-names>S</given-names>
</name>
<etal/>
</person-group>. <article-title>The cancer epitope database and analysis resource (CEDAR)</article-title>. <source>Nucleic Acids Res</source>. (<year>2023</year>) <volume>51</volume>:<page-range>D845&#x2013;D52</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkac902</pub-id>
</citation>
</ref>
<ref id="B24">
<label>24</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liang</surname> <given-names>S</given-names>
</name>
<name>
<surname>Zheng</surname> <given-names>D</given-names>
</name>
<name>
<surname>Yao</surname> <given-names>B</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>C</given-names>
</name>
</person-group>. <article-title>EPCES and EPSVR: prediction of B-cell antigenic epitopes on protein surfaces with conformational information</article-title>. <source>Methods Mol Biol</source>. (<year>2020</year>) <volume>2131</volume>:<page-range>289&#x2013;97</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-1-0716-0389-5_16</pub-id>
</citation>
</ref>
<ref id="B25">
<label>25</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ponomarenko</surname> <given-names>JV</given-names>
</name>
<name>
<surname>Bourne</surname> <given-names>PE</given-names>
</name>
</person-group>. <article-title>Antibody-protein interactions: benchmark datasets and prediction tools evaluation</article-title>. <source>BMC Struct Biol</source>. (<year>2007</year>) <volume>7</volume>:<fpage>64</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1472-6807-7-64</pub-id>
</citation>
</ref>
<ref id="B26">
<label>26</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mintseris</surname> <given-names>J</given-names>
</name>
<name>
<surname>Wiehe</surname> <given-names>K</given-names>
</name>
<name>
<surname>Pierce</surname> <given-names>B</given-names>
</name>
<name>
<surname>Anderson</surname> <given-names>R</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>R</given-names>
</name>
<name>
<surname>Janin</surname> <given-names>J</given-names>
</name>
<etal/>
</person-group>. <article-title>Protein-protein docking benchmark 2.0: an update</article-title>. <source>Proteins</source>. (<year>2005</year>) <volume>60</volume>:<page-range>214&#x2013;6</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/prot.20560</pub-id>
</citation>
</ref>
<ref id="B27">
<label>27</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname> <given-names>H</given-names>
</name>
<name>
<surname>Ansari</surname> <given-names>HR</given-names>
</name>
<name>
<surname>Raghava</surname> <given-names>GP</given-names>
</name>
</person-group>. <article-title>Improved method for linear B-cell epitope prediction using antigen's primary sequence</article-title>. <source>PLoS One</source>. (<year>2013</year>) <volume>8</volume>:<elocation-id>e62216</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0062216</pub-id>
</citation>
</ref>
<ref id="B28">
<label>28</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saha</surname> <given-names>S</given-names>
</name>
<name>
<surname>Bhasin</surname> <given-names>M</given-names>
</name>
<name>
<surname>Raghava</surname> <given-names>GP</given-names>
</name>
</person-group>. <article-title>Bcipep: a database of B-cell epitopes</article-title>. <source>BMC Genomics</source>. (<year>2005</year>) <volume>6</volume>:<fpage>79</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2164-6-79</pub-id>
</citation>
</ref>
<ref id="B29">
<label>29</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saha</surname> <given-names>S</given-names>
</name>
<name>
<surname>Raghava</surname> <given-names>GP</given-names>
</name>
</person-group>. <article-title>Prediction of continuous B-cell epitopes in an antigen using recurrent neural network</article-title>. <source>Proteins</source>. (<year>2006</year>) <volume>65</volume>:<page-range>40&#x2013;8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/prot.21078</pub-id>
</citation>
</ref>
<ref id="B30">
<label>30</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sarker</surname> <given-names>IH</given-names>
</name>
</person-group>. <article-title>Machine learning: algorithms, real-world applications and research directions</article-title>. <source>SN Comput Sci</source>. (<year>2021</year>) <volume>2</volume>:<fpage>160</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s42979-021-00592-x</pub-id>
</citation>
</ref>
<ref id="B31">
<label>31</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kulkarni-Kale</surname> <given-names>U</given-names>
</name>
<name>
<surname>Bhosle</surname> <given-names>S</given-names>
</name>
<name>
<surname>Kolaskar</surname> <given-names>AS</given-names>
</name>
</person-group>. <article-title>CEP: a conformational epitope prediction server</article-title>. <source>Nucleic Acids Res</source>. (<year>2005</year>) <volume>33</volume>:<page-range>W168&#x2013;71</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gki460</pub-id>
</citation>
</ref>
<ref id="B32">
<label>32</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tubiana</surname> <given-names>J</given-names>
</name>
<name>
<surname>Schneidman-Duhovny</surname> <given-names>D</given-names>
</name>
<name>
<surname>Wolfson</surname> <given-names>HJ</given-names>
</name>
</person-group>. <article-title>ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction</article-title>. <source>Nat Methods</source>. (<year>2022</year>) <volume>19</volume>:<page-range>730&#x2013;9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41592-022-01490-7</pub-id>
</citation>
</ref>
<ref id="B33">
<label>33</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Humphrey</surname> <given-names>W</given-names>
</name>
<name>
<surname>Dalke</surname> <given-names>A</given-names>
</name>
<name>
<surname>Schulten</surname> <given-names>K</given-names>
</name>
</person-group>. <article-title>VMD: Visual molecular dynamics</article-title>. <source>J Mol Graph Model</source>. (<year>1996</year>) <volume>14</volume>:<page-range>33&#x2013;8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/0263-7855(96)00018-5</pub-id>
</citation>
</ref>
<ref id="B34">
<label>34</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bujacz</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>Structures of bovine, equine and leporine serum albumin</article-title>. <source>Acta Crystallogr D Biol Crystallogr</source>. (<year>2012</year>) <volume>68</volume>:<page-range>1278&#x2013;89</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1107/S0907444912027047</pub-id>
</citation>
</ref>
<ref id="B35">
<label>35</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rubinstein</surname> <given-names>ND</given-names>
</name>
<name>
<surname>Mayrose</surname> <given-names>I</given-names>
</name>
<name>
<surname>Pupko</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>A machine-learning approach for predicting B-cell epitopes</article-title>. <source>Mol Immunol</source>. (<year>2009</year>) <volume>46</volume>:<page-range>840&#x2013;7</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.molimm.2008.09.009</pub-id>
</citation>
</ref>
<ref id="B36">
<label>36</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ansari</surname> <given-names>HR</given-names>
</name>
<name>
<surname>Raghava</surname> <given-names>GP</given-names>
</name>
</person-group>. <article-title>Identification of conformational B-cell Epitopes in an antigen from its primary sequence</article-title>. <source>Immunome Res</source>. (<year>2010</year>) <volume>6</volume>:<fpage>6</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1745-7580-6-6</pub-id>
</citation>
</ref>
<ref id="B37">
<label>37</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rubinstein</surname> <given-names>ND</given-names>
</name>
<name>
<surname>Mayrose</surname> <given-names>I</given-names>
</name>
<name>
<surname>Martz</surname> <given-names>E</given-names>
</name>
<name>
<surname>Pupko</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>Epitopia: a web-server for predicting B-cell epitopes</article-title>. <source>BMC Bioinf</source>. (<year>2009</year>) <volume>10</volume>:<fpage>287</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2105-10-287</pub-id>
</citation>
</ref>
<ref id="B38">
<label>38</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>El-Manzalawy</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Dobbs</surname> <given-names>D</given-names>
</name>
<name>
<surname>Honavar</surname> <given-names>V</given-names>
</name>
</person-group>. <article-title>Predicting flexible length linear B-cell epitopes</article-title>. <source>Comput Syst Bioinf Conf</source>. (<year>2008</year>) <volume>7</volume>:<page-range>121&#x2013;32</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1142/p585</pub-id>
</citation>
</ref>
<ref id="B39">
<label>39</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chinnasamy</surname> <given-names>A</given-names>
</name>
<name>
<surname>Sung</surname> <given-names>WK</given-names>
</name>
<name>
<surname>Mittal</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>Protein structure and fold prediction using tree-augmented naive Bayesian classifier</article-title>. <source>Pac Symp Biocomput</source>. (<year>2004</year>), <page-range>387&#x2013;98</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1142/9789812704856_0037</pub-id>
</citation>
</ref>
<ref id="B40">
<label>40</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Di Rienzo</surname> <given-names>L</given-names>
</name>
<name>
<surname>Miotto</surname> <given-names>M</given-names>
</name>
<name>
<surname>Bo</surname> <given-names>L</given-names>
</name>
<name>
<surname>Ruocco</surname> <given-names>G</given-names>
</name>
<name>
<surname>Raimondo</surname> <given-names>D</given-names>
</name>
<name>
<surname>Milanetti</surname> <given-names>E</given-names>
</name>
</person-group>. <article-title>Characterizing hydropathy of amino acid side chain in a protein environment by investigating the structural changes of water molecules network</article-title>. <source>Front Mol Biosci</source>. (<year>2021</year>) <volume>8</volume>:<elocation-id>626837</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fmolb.2021.626837</pub-id>
</citation>
</ref>
<ref id="B41">
<label>41</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mant</surname> <given-names>CT</given-names>
</name>
<name>
<surname>Kovacs</surname> <given-names>JM</given-names>
</name>
<name>
<surname>Kim</surname> <given-names>HM</given-names>
</name>
<name>
<surname>Pollock</surname> <given-names>DD</given-names>
</name>
<name>
<surname>Hodges</surname> <given-names>RS</given-names>
</name>
</person-group>. <article-title>Intrinsic amino acid side-chain hydrophilicity/hydrophobicity coefficients determined by reversed-phase high-performance liquid chromatography of model peptides: comparison with other hydrophilicity/hydrophobicity scales</article-title>. <source>Biopolymers</source>. (<year>2009</year>) <volume>92</volume>:<page-range>573&#x2013;95</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/bip.21316</pub-id>
</citation>
</ref>
<ref id="B42">
<label>42</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname> <given-names>T</given-names>
</name>
<name>
<surname>Cheng</surname> <given-names>L</given-names>
</name>
<name>
<surname>Zang</surname> <given-names>T</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>Peptide-major histocompatibility complex class I binding prediction based on deep learning with novel feature</article-title>. <source>Front Genet</source>. (<year>2019</year>) <volume>10</volume>:<elocation-id>1191</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fgene.2019.01191</pub-id>
</citation>
</ref>
<ref id="B43">
<label>43</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eisenberg</surname> <given-names>D</given-names>
</name>
</person-group>. <article-title>Three-dimensional structure of membrane and surface proteins</article-title>. <source>Annu Rev Biochem</source>. (<year>1984</year>) <volume>53</volume>:<fpage>595</fpage>&#x2013;<lpage>623</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1146/annurev.bi.53.070184.003115</pub-id>
</citation>
</ref>
<ref id="B44">
<label>44</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>HX</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>JL</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>G</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>B</given-names>
</name>
</person-group>. <article-title>Probabilistic support vector machines for classification of noise affected data</article-title>. <source>Inform Sci</source>. (<year>2013</year>) <volume>221</volume>:<fpage>60</fpage>&#x2013;<lpage>71</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ins.2012.09.041</pub-id>
</citation>
</ref>
<ref id="B45">
<label>45</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cervantes</surname> <given-names>J</given-names>
</name>
<name>
<surname>Garcia-Lamont</surname> <given-names>F</given-names>
</name>
<name>
<surname>Rodr&#xed;guez-Mazahua</surname> <given-names>L</given-names>
</name>
<name>
<surname>Lopez</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>A comprehensive survey on support vector machine classification: Applications, challenges and trends</article-title>. <source>Neurocomputing</source>. (<year>2020</year>) <volume>408</volume>:<fpage>189</fpage>&#x2013;<lpage>215</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.neucom.2019.10.118</pub-id>
</citation>
</ref>
<ref id="B46">
<label>46</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Tino</surname> <given-names>P</given-names>
</name>
<name>
<surname>Leonardis</surname> <given-names>A</given-names>
</name>
<name>
<surname>Tang</surname> <given-names>K</given-names>
</name>
</person-group>. <article-title>A survey on neural network interpretability</article-title>. <source>IEEE T Em Top Comp I</source>. (<year>2021</year>) <volume>5</volume>:<page-range>726&#x2013;42</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1109/TETCI.2021.3100641</pub-id>
</citation>
</ref>
<ref id="B47">
<label>47</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Costa</surname> <given-names>VG</given-names>
</name>
<name>
<surname>Pedreira</surname> <given-names>CE</given-names>
</name>
</person-group>. <article-title>Recent advances in decision trees: an updated survey</article-title>. <source>Artif Intell Rev</source>. (<year>2023</year>) <volume>56</volume>:<page-range>4765&#x2013;800</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s10462-022-10275-5</pub-id>
</citation>
</ref>
<ref id="B48">
<label>48</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ofer</surname> <given-names>D</given-names>
</name>
<name>
<surname>Brandes</surname> <given-names>N</given-names>
</name>
<name>
<surname>Linial</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>The language of proteins: NLP, machine learning &amp; protein sequences</article-title>. <source>Comput Struct Biotec</source>. (<year>2021</year>) <volume>19</volume>:<page-range>1750&#x2013;8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.csbj.2021.03.022</pub-id>
</citation>
</ref>
<ref id="B49">
<label>49</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Blythe</surname> <given-names>MJ</given-names>
</name>
<name>
<surname>Flower</surname> <given-names>DR</given-names>
</name>
</person-group>. <article-title>Benchmarking B cell epitope prediction: underperformance of existing methods</article-title>. <source>Protein Sci</source>. (<year>2005</year>) <volume>14</volume>:<page-range>246&#x2013;8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1110/ps.041059505</pub-id>
</citation>
</ref>
<ref id="B50">
<label>50</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jespersen</surname> <given-names>MC</given-names>
</name>
<name>
<surname>Peters</surname> <given-names>B</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
<name>
<surname>Marcatili</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes</article-title>. <source>Nucleic Acids Res</source>. (<year>2017</year>) <volume>45</volume>:<page-range>W24&#x2013;W9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkx346</pub-id>
</citation>
</ref>
<ref id="B51">
<label>51</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ivanisenko</surname> <given-names>NV</given-names>
</name>
<name>
<surname>Shashkova</surname> <given-names>TI</given-names>
</name>
<name>
<surname>Shevtsov</surname> <given-names>A</given-names>
</name>
<name>
<surname>Sindeeva</surname> <given-names>M</given-names>
</name>
<name>
<surname>Umerenkov</surname> <given-names>D</given-names>
</name>
<name>
<surname>Kardymon</surname> <given-names>O</given-names>
</name>
</person-group>. <article-title>SEMA 2.0: web-platform for B-cell conformational epitopes prediction using artificial intelligence</article-title>. <source>Nucleic Acids Res</source>. (<year>2024</year>) <volume>52</volume>:<page-range>W533&#x2013;W9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkae386</pub-id>
</citation>
</ref>
<ref id="B52">
<label>52</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Clifford</surname> <given-names>JN</given-names>
</name>
<name>
<surname>Hoie</surname> <given-names>MH</given-names>
</name>
<name>
<surname>Deleuran</surname> <given-names>S</given-names>
</name>
<name>
<surname>Peters</surname> <given-names>B</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
<name>
<surname>Marcatili</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>BepiPred-3.0: Improved B-cell epitope prediction using protein language models</article-title>. <source>Protein Sci</source>. (<year>2022</year>) <volume>31</volume>:<elocation-id>e4497</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/pro.4497</pub-id>
</citation>
</ref>
<ref id="B53">
<label>53</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Koren</surname> <given-names>E</given-names>
</name>
<name>
<surname>De Groot</surname> <given-names>AS</given-names>
</name>
<name>
<surname>Jawa</surname> <given-names>V</given-names>
</name>
<name>
<surname>Beck</surname> <given-names>KD</given-names>
</name>
<name>
<surname>Boone</surname> <given-names>T</given-names>
</name>
<name>
<surname>Rivera</surname> <given-names>D</given-names>
</name>
<etal/>
</person-group>. <article-title>Clinical validation of the "in silico" prediction of immunogenicity of a human recombinant therapeutic protein</article-title>. <source>Clin Immunol</source>. (<year>2007</year>) <volume>124</volume>:<fpage>26</fpage>&#x2013;<lpage>32</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.clim.2007.03.544</pub-id>
</citation>
</ref>
<ref id="B54">
<label>54</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peters</surname> <given-names>B</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
<name>
<surname>Sette</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>T cell epitope predictions</article-title>. <source>Annu Rev Immunol</source>. (<year>2020</year>) <volume>38</volume>:<page-range>123&#x2013;45</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1146/annurev-immunol-082119-124838</pub-id>
</citation>
</ref>
<ref id="B55">
<label>55</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hotop</surname> <given-names>SK</given-names>
</name>
<name>
<surname>Reimering</surname> <given-names>S</given-names>
</name>
<name>
<surname>Shekhar</surname> <given-names>A</given-names>
</name>
<name>
<surname>Asgari</surname> <given-names>E</given-names>
</name>
<name>
<surname>Beutling</surname> <given-names>U</given-names>
</name>
<name>
<surname>Dahlke</surname> <given-names>C</given-names>
</name>
<etal/>
</person-group>. <article-title>Peptide microarrays coupled to machine learning reveal individual epitopes from human antibody responses with neutralizing capabilities against SARS-CoV-2</article-title>. <source>Emerg Microbes Infect</source>. (<year>2022</year>) <volume>11</volume>:<page-range>1037&#x2013;48</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1080/22221751.2022.2057874</pub-id>
</citation>
</ref>
<ref id="B56">
<label>56</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hada-Neeman</surname> <given-names>S</given-names>
</name>
<name>
<surname>Weiss-Ottolenghi</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Wagner</surname> <given-names>N</given-names>
</name>
<name>
<surname>Avram</surname> <given-names>O</given-names>
</name>
<name>
<surname>Ashkenazy</surname> <given-names>H</given-names>
</name>
<name>
<surname>Maor</surname> <given-names>Y</given-names>
</name>
<etal/>
</person-group>. <article-title>Domain-scan: combinatorial sero-diagnosis of infectious diseases using machine learning</article-title>. <source>Front Immunol</source>. (<year>2020</year>) <volume>11</volume>:<elocation-id>619896</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fimmu.2020.619896</pub-id>
</citation>
</ref>
<ref id="B57">
<label>57</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maguy</surname> <given-names>A</given-names>
</name>
<name>
<surname>Tardif</surname> <given-names>JC</given-names>
</name>
<name>
<surname>Busseuil</surname> <given-names>D</given-names>
</name>
<name>
<surname>Ribi</surname> <given-names>C</given-names>
</name>
<name>
<surname>Li</surname> <given-names>J</given-names>
</name>
</person-group>. <article-title>Autoantibody signature in cardiac arrest</article-title>. <source>Circulation</source>. (<year>2020</year>) <volume>141</volume>:<page-range>1764&#x2013;74</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1161/CIRCULATIONAHA.119.044408</pub-id>
</citation>
</ref>
<ref id="B58">
<label>58</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xue</surname> <given-names>AY</given-names>
</name>
<name>
<surname>Szymczak</surname> <given-names>LC</given-names>
</name>
<name>
<surname>Mrksich</surname> <given-names>M</given-names>
</name>
<name>
<surname>Bagheri</surname> <given-names>N</given-names>
</name>
</person-group>. <article-title>Machine learning on signal-to-noise ratios improves peptide array design in SAMDI mass spectrometry</article-title>. <source>Anal Chem</source>. (<year>2017</year>) <volume>89</volume>:<page-range>9039&#x2013;47</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1021/acs.analchem.7b01728</pub-id>
</citation>
</ref>
<ref id="B59">
<label>59</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krawczyk</surname> <given-names>K</given-names>
</name>
<name>
<surname>Baker</surname> <given-names>T</given-names>
</name>
<name>
<surname>Shi</surname> <given-names>J</given-names>
</name>
<name>
<surname>Deane</surname> <given-names>CM</given-names>
</name>
</person-group>. <article-title>Antibody i-Patch prediction of the antibody binding site improves rigid local antibody-antigen docking</article-title>. <source>Protein Eng Des Sel</source>. (<year>2013</year>) <volume>26</volume>:<page-range>621&#x2013;9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/protein/gzt043</pub-id>
</citation>
</ref>
<ref id="B60">
<label>60</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Weitzner</surname> <given-names>BD</given-names>
</name>
<name>
<surname>Jeliazkov</surname> <given-names>JR</given-names>
</name>
<name>
<surname>Lyskov</surname> <given-names>S</given-names>
</name>
<name>
<surname>Marze</surname> <given-names>N</given-names>
</name>
<name>
<surname>Kuroda</surname> <given-names>D</given-names>
</name>
<name>
<surname>Frick</surname> <given-names>R</given-names>
</name>
<etal/>
</person-group>. <article-title>Modeling and docking of antibody structures with Rosetta</article-title>. <source>Nat Protoc</source>. (<year>2017</year>) <volume>12</volume>:<page-range>401&#x2013;16</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nprot.2016.180</pub-id>
</citation>
</ref>
<ref id="B61">
<label>61</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Garzon</surname> <given-names>JI</given-names>
</name>
<name>
<surname>Lopez-Blanco</surname> <given-names>JR</given-names>
</name>
<name>
<surname>Pons</surname> <given-names>C</given-names>
</name>
<name>
<surname>Kovacs</surname> <given-names>J</given-names>
</name>
<name>
<surname>Abagyan</surname> <given-names>R</given-names>
</name>
<name>
<surname>Fernandez-Recio</surname> <given-names>J</given-names>
</name>
<etal/>
</person-group>. <article-title>FRODOCK: a new approach for fast rotational protein-protein docking</article-title>. <source>Bioinformatics</source>. (<year>2009</year>) <volume>25</volume>:<page-range>2544&#x2013;51</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btp447</pub-id>
</citation>
</ref>
<ref id="B62">
<label>62</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bourquard</surname> <given-names>T</given-names>
</name>
<name>
<surname>Musnier</surname> <given-names>A</given-names>
</name>
<name>
<surname>Puard</surname> <given-names>V</given-names>
</name>
<name>
<surname>Tahir</surname> <given-names>S</given-names>
</name>
<name>
<surname>Ayoub</surname> <given-names>MA</given-names>
</name>
<name>
<surname>Jullian</surname> <given-names>Y</given-names>
</name>
<etal/>
</person-group>. <article-title>MAbTope: A method for improved epitope mapping</article-title>. <source>J Immunol</source>. (<year>2018</year>) <volume>201</volume>:<page-range>3096&#x2013;105</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.4049/jimmunol.1701722</pub-id>
</citation>
</ref>
<ref id="B63">
<label>63</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname> <given-names>C</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>EA</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>Protein-ligand docking in the machine-learning era</article-title>. <source>Molecules</source>. (<year>2022</year>) <volume>27</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/molecules27144568</pub-id>
</citation>
</ref>
<ref id="B64">
<label>64</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname> <given-names>HL</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>MJ</given-names>
</name>
<name>
<surname>Tan</surname> <given-names>H</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>ZD</given-names>
</name>
<name>
<surname>Song</surname> <given-names>JN</given-names>
</name>
</person-group>. <article-title>PredPPCrys: accurate prediction of sequence cloning, protein production, purification and crystallization propensity from protein sequences using multi-step heterogeneous feature fusion and selection</article-title>. <source>PLoS One</source>. (<year>2014</year>) <volume>9</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0105902</pub-id>
</citation>
</ref>
<ref id="B65">
<label>65</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Slabinski</surname> <given-names>L</given-names>
</name>
<name>
<surname>Jaroszewski</surname> <given-names>L</given-names>
</name>
<name>
<surname>Rychlewski</surname> <given-names>L</given-names>
</name>
<name>
<surname>Wilson</surname> <given-names>IA</given-names>
</name>
<name>
<surname>Lesley</surname> <given-names>SA</given-names>
</name>
<name>
<surname>Godzik</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>XtalPred: a web server for prediction of protein crystallizability</article-title>. <source>Bioinformatics</source>. (<year>2007</year>) <volume>23</volume>:<page-range>3403&#x2013;5</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btm477</pub-id>
</citation>
</ref>
<ref id="B66">
<label>66</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tubiana</surname> <given-names>J</given-names>
</name>
<name>
<surname>Xiang</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>L</given-names>
</name>
<name>
<surname>Wolfson</surname> <given-names>HJ</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>K</given-names>
</name>
<name>
<surname>Schneidman-Duhovny</surname> <given-names>D</given-names>
</name>
<etal/>
</person-group>. <article-title>Reduced B cell antigenicity of Omicron lowers host serologic response</article-title>. <source>Cell Rep</source>. (<year>2022</year>) <volume>41</volume>:<fpage>111512</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.celrep.2022.111512</pub-id>
</citation>
</ref>
<ref id="B67">
<label>67</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jain</surname> <given-names>D</given-names>
</name>
<name>
<surname>Salunke</surname> <given-names>DM</given-names>
</name>
</person-group>. <article-title>Antibody specificity and promiscuity</article-title>. <source>Biochem J</source>. (<year>2019</year>) <volume>476</volume>:<page-range>433&#x2013;47</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1042/BCJ20180670</pub-id>
</citation>
</ref>
<ref id="B68">
<label>68</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gunti</surname> <given-names>S</given-names>
</name>
<name>
<surname>Notkins</surname> <given-names>AL</given-names>
</name>
</person-group>. <article-title>Polyreactive antibodies: function and quantification</article-title>. <source>J&#xa0;Infect Dis</source>. (<year>2015</year>) <volume>212 Suppl 1</volume>:<page-range>S42&#x2013;6</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/infdis/jiu512</pub-id>
</citation>
</ref>
<ref id="B69">
<label>69</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jumper</surname> <given-names>J</given-names>
</name>
<name>
<surname>Evans</surname> <given-names>R</given-names>
</name>
<name>
<surname>Pritzel</surname> <given-names>A</given-names>
</name>
<name>
<surname>Green</surname> <given-names>T</given-names>
</name>
<name>
<surname>Figurnov</surname> <given-names>M</given-names>
</name>
<name>
<surname>Ronneberger</surname> <given-names>O</given-names>
</name>
<etal/>
</person-group>. <article-title>Highly accurate protein structure prediction with AlphaFold</article-title>. <source>Nature</source>. (<year>2021</year>) <volume>596</volume>:<page-range>583&#x2013;9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41586-021-03819-2</pub-id>
</citation>
</ref>
<ref id="B70">
<label>70</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abramson</surname> <given-names>J</given-names>
</name>
<name>
<surname>Adler</surname> <given-names>J</given-names>
</name>
<name>
<surname>Dunger</surname> <given-names>J</given-names>
</name>
<name>
<surname>Evans</surname> <given-names>R</given-names>
</name>
<name>
<surname>Green</surname> <given-names>T</given-names>
</name>
<name>
<surname>Pritzel</surname> <given-names>A</given-names>
</name>
<etal/>
</person-group>. <article-title>Accurate structure prediction of biomolecular interactions with AlphaFold 3</article-title>. <source>Nature</source>. (<year>2024</year>) <volume>630</volume>, <page-range>493&#x2013;500</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41586-024-07487-w</pub-id>
</citation>
</ref>
<ref id="B71">
<label>71</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>J</given-names>
</name>
<name>
<surname>Guo</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>T</given-names>
</name>
<name>
<surname>Roy</surname> <given-names>RS</given-names>
</name>
<name>
<surname>Quadir</surname> <given-names>F</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>C</given-names>
</name>
<etal/>
</person-group>. <article-title>Enhancing alphafold-multimer-based protein complex structure prediction with MULTICOM in CASP15</article-title>. <source>Commun Biol</source>. (<year>2023</year>) <volume>6</volume>:<fpage>1140</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s42003-023-05525-3</pub-id>
</citation>
</ref>
<ref id="B72">
<label>72</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Deac</surname> <given-names>A</given-names>
</name>
<name>
<surname>VeliCkovic</surname> <given-names>P</given-names>
</name>
<name>
<surname>Sormanni</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>Attentive cross-modal paratope prediction</article-title>. <source>J&#xa0;Comput Biol</source>. (<year>2019</year>) <volume>26</volume>:<page-range>536&#x2013;45</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1089/cmb.2018.0175</pub-id>
</citation>
</ref>
<ref id="B73">
<label>73</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chinery</surname> <given-names>L</given-names>
</name>
<name>
<surname>Wahome</surname> <given-names>N</given-names>
</name>
<name>
<surname>Moal</surname> <given-names>I</given-names>
</name>
<name>
<surname>Deane</surname> <given-names>CM</given-names>
</name>
</person-group>. <article-title>Paragraph-antibody paratope prediction using graph neural networks with minimal feature vectors</article-title>. <source>Bioinformatics</source>. (<year>2023</year>) <volume>39</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btac732</pub-id>
</citation>
</ref>
<ref id="B74">
<label>74</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liberis</surname> <given-names>E</given-names>
</name>
<name>
<surname>Velickovic</surname> <given-names>P</given-names>
</name>
<name>
<surname>Sormanni</surname> <given-names>P</given-names>
</name>
<name>
<surname>Vendruscolo</surname> <given-names>M</given-names>
</name>
<name>
<surname>Lio</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>Parapred: antibody paratope prediction using convolutional and recurrent neural networks</article-title>. <source>Bioinformatics</source>. (<year>2018</year>) <volume>34</volume>:<page-range>2944&#x2013;50</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/bty305</pub-id>
</citation>
</ref>
<ref id="B75">
<label>75</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Varadi</surname> <given-names>M</given-names>
</name>
<name>
<surname>Anyango</surname> <given-names>S</given-names>
</name>
<name>
<surname>Deshpande</surname> <given-names>M</given-names>
</name>
<name>
<surname>Nair</surname> <given-names>S</given-names>
</name>
<name>
<surname>Natassia</surname> <given-names>C</given-names>
</name>
<name>
<surname>Yordanova</surname> <given-names>G</given-names>
</name>
<etal/>
</person-group>. <article-title>AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models</article-title>. <source>Nucleic Acids Res</source>. (<year>2022</year>) <volume>50</volume>:<page-range>D439&#x2013;D44</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkab1061</pub-id>
</citation>
</ref>
<ref id="B76">
<label>76</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mirdita</surname> <given-names>M</given-names>
</name>
<name>
<surname>Schutze</surname> <given-names>K</given-names>
</name>
<name>
<surname>Moriwaki</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Heo</surname> <given-names>L</given-names>
</name>
<name>
<surname>Ovchinnikov</surname> <given-names>S</given-names>
</name>
<name>
<surname>Steinegger</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>ColabFold: making protein folding accessible to all</article-title>. <source>Nat Methods</source>. (<year>2022</year>) <volume>19</volume>:<page-range>679&#x2013;82</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41592-022-01488-1</pub-id>
</citation>
</ref>
<ref id="B77">
<label>77</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nguyen</surname> <given-names>MN</given-names>
</name>
<name>
<surname>Krutz</surname> <given-names>NL</given-names>
</name>
<name>
<surname>Limviphuvadh</surname> <given-names>V</given-names>
</name>
<name>
<surname>Lopata</surname> <given-names>AL</given-names>
</name>
<name>
<surname>Gerberick</surname> <given-names>GF</given-names>
</name>
<name>
<surname>Maurer-Stroh</surname> <given-names>S</given-names>
</name>
</person-group>. <article-title>AllerCatPro 2.0: a web server for predicting protein allergenicity potential</article-title>. <source>Nucleic Acids Res</source>. (<year>2022</year>) <volume>50</volume>:<page-range>W36&#x2013;43</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkac446</pub-id>
</citation>
</ref>
<ref id="B78">
<label>78</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ferrante</surname> <given-names>G</given-names>
</name>
<name>
<surname>Licari</surname> <given-names>A</given-names>
</name>
<name>
<surname>Fasola</surname> <given-names>S</given-names>
</name>
<name>
<surname>Marseglia</surname> <given-names>GL</given-names>
</name>
<name>
<surname>La Grutta</surname> <given-names>S</given-names>
</name>
</person-group>. <article-title>Artificial intelligence in the diagnosis of pediatric allergic diseases</article-title>. <source>Pediatr Allergy Immunol</source>. (<year>2021</year>) <volume>32</volume>:<page-range>405&#x2013;13</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/pai.13419</pub-id>
</citation>
</ref>
<ref id="B79">
<label>79</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Patterson</surname> <given-names>BK</given-names>
</name>
<name>
<surname>Guevara-Coto</surname> <given-names>J</given-names>
</name>
<name>
<surname>Yogendra</surname> <given-names>R</given-names>
</name>
<name>
<surname>Francisco</surname> <given-names>EB</given-names>
</name>
<name>
<surname>Long</surname> <given-names>E</given-names>
</name>
<name>
<surname>Pise</surname> <given-names>A</given-names>
</name>
<etal/>
</person-group>. <article-title>Immune-based prediction of COVID-19 severity and chronicity decoded using machine learning</article-title>. <source>Front Immunol</source>. (<year>2021</year>) <volume>12</volume>:<elocation-id>700782</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fimmu.2021.700782</pub-id>
</citation>
</ref>
<ref id="B80">
<label>80</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stafford</surname> <given-names>IS</given-names>
</name>
<name>
<surname>Kellermann</surname> <given-names>M</given-names>
</name>
<name>
<surname>Mossotto</surname> <given-names>E</given-names>
</name>
<name>
<surname>Beattie</surname> <given-names>RM</given-names>
</name>
<name>
<surname>MacArthur</surname> <given-names>BD</given-names>
</name>
<name>
<surname>Ennis</surname> <given-names>S</given-names>
</name>
</person-group>. <article-title>A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases</article-title>. <source>NPJ Digit Med</source>. (<year>2020</year>) <volume>3</volume>:<fpage>30</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41746-020-0229-3</pub-id>
</citation>
</ref>
<ref id="B81">
<label>81</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mendes</surname> <given-names>M</given-names>
</name>
<name>
<surname>Mahita</surname> <given-names>J</given-names>
</name>
<name>
<surname>Blazeska</surname> <given-names>N</given-names>
</name>
<name>
<surname>Greenbaum</surname> <given-names>J</given-names>
</name>
<name>
<surname>Ha</surname> <given-names>B</given-names>
</name>
<name>
<surname>Wheeler</surname> <given-names>K</given-names>
</name>
<etal/>
</person-group>. <article-title>IEDB-3D 2.0: Structural data analysis within the Immune Epitope Database</article-title>. <source>Protein Sci</source>. (<year>2023</year>) <volume>32</volume>:<elocation-id>e4605</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/pro.4605</pub-id>
</citation>
</ref>
<ref id="B82">
<label>82</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Mamrosh E-SD-F</surname> <given-names>JL</given-names>
</name>
<name>
<surname>Korber</surname> <given-names>BTM</given-names>
</name>
<name>
<surname>Brander</surname> <given-names>C</given-names>
</name>
<name>
<surname>Barouch</surname> <given-names>D</given-names>
</name>
<name>
<surname>de Boer</surname> <given-names>R</given-names>
</name>
<name>
<surname>Haynes</surname> <given-names>BF</given-names>
</name>
<etal/>
</person-group>. <source>HIV Molecular Immunology</source>. <edition>2024 ed</edition>. <publisher-loc>Los Alamos, New Mexico</publisher-loc>: <publisher-name>Los Alamos National Laboratory, Theoretical Biology and Biophysics</publisher-name> (<year>2023</year>).</citation>
</ref>
<ref id="B83">
<label>83</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vita</surname> <given-names>R</given-names>
</name>
<name>
<surname>Overton</surname> <given-names>JA</given-names>
</name>
<name>
<surname>Greenbaum</surname> <given-names>JA</given-names>
</name>
<name>
<surname>Ponomarenko</surname> <given-names>J</given-names>
</name>
<name>
<surname>Clark</surname> <given-names>JD</given-names>
</name>
<name>
<surname>Cantrell</surname> <given-names>JR</given-names>
</name>
<etal/>
</person-group>. <article-title>The immune epitope database (IEDB) 3.0</article-title>. <source>Nucleic Acids Res</source>. (<year>2015</year>) <volume>43</volume>:<page-range>D405&#x2013;12</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gku938</pub-id>
</citation>
</ref>
<ref id="B84">
<label>84</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Olsen</surname> <given-names>LR</given-names>
</name>
<name>
<surname>Tongchusak</surname> <given-names>S</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>H</given-names>
</name>
<name>
<surname>Reinherz</surname> <given-names>EL</given-names>
</name>
<name>
<surname>Brusic</surname> <given-names>V</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>GL</given-names>
</name>
</person-group>. <article-title>TANTIGEN: a comprehensive database of tumor T cell antigens</article-title>. <source>Cancer Immunol Immunother</source>. (<year>2017</year>) <volume>66</volume>:<page-range>731&#x2013;5</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s00262-017-1978-y</pub-id>
</citation>
</ref>
<ref id="B85">
<label>85</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname> <given-names>B</given-names>
</name>
<name>
<surname>Sayers</surname> <given-names>S</given-names>
</name>
<name>
<surname>Xiang</surname> <given-names>Z</given-names>
</name>
<name>
<surname>He</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>Protegen: a web-based protective antigen database and analysis system</article-title>. <source>Nucleic Acids Res</source>. (<year>2011</year>) <volume>39</volume>:<page-range>D1073&#x2013;8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkq944</pub-id>
</citation>
</ref>
<ref id="B86">
<label>86</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Olsen</surname> <given-names>LR</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>GL</given-names>
</name>
<name>
<surname>Reinherz</surname> <given-names>EL</given-names>
</name>
<name>
<surname>Brusic</surname> <given-names>V</given-names>
</name>
</person-group>. <article-title>FLAVIdB: A data mining system for knowledge discovery in flaviviruses with direct applications in immunology and vaccinology</article-title>. <source>Immunome Res</source>. (<year>2011</year>) <volume>7</volume>.</citation>
</ref>
<ref id="B87">
<label>87</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lata</surname> <given-names>S</given-names>
</name>
<name>
<surname>Bhasin</surname> <given-names>M</given-names>
</name>
<name>
<surname>Raghava</surname> <given-names>GP</given-names>
</name>
</person-group>. <article-title>MHCBN 4.0: A database of MHC/TAP binding peptides and T-cell epitopes</article-title>. <source>BMC Res Notes</source>. (<year>2009</year>) <volume>2</volume>:<fpage>61</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1756-0500-2-61</pub-id>
</citation>
</ref>
<ref id="B88">
<label>88</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reche</surname> <given-names>PA</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>H</given-names>
</name>
<name>
<surname>Glutting</surname> <given-names>JP</given-names>
</name>
<name>
<surname>Reinherz</surname> <given-names>EL</given-names>
</name>
</person-group>. <article-title>EPIMHC: a curated database of MHC-binding peptides for customized computational vaccinology</article-title>. <source>Bioinformatics</source>. (<year>2005</year>) <volume>21</volume>:<page-range>2140&#x2013;1</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/bti269</pub-id>
</citation>
</ref>
<ref id="B89">
<label>89</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schuler</surname> <given-names>MM</given-names>
</name>
<name>
<surname>Nastke</surname> <given-names>MD</given-names>
</name>
<name>
<surname>Stevanovikc</surname> <given-names>S</given-names>
</name>
</person-group>. <article-title>SYFPEITHI: database for searching and T-cell epitope prediction</article-title>. <source>Methods Mol Biol</source>. (<year>2007</year>) <volume>409</volume>:<fpage>75</fpage>&#x2013;<lpage>93</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-1-60327-118-9_5</pub-id>
</citation>
</ref>
<ref id="B90">
<label>90</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Negi</surname> <given-names>SS</given-names>
</name>
<name>
<surname>Schein</surname> <given-names>CH</given-names>
</name>
<name>
<surname>Braun</surname> <given-names>W</given-names>
</name>
</person-group>. <article-title>The updated Structural Database of Allergenic Proteins (SDAP 2.0) provides 3D models for allergens and incorporated bioinformatics tools</article-title>. <source>J Allergy Clin Immunol Glob</source>. (<year>2023</year>) <volume>2</volume>:<fpage>100162</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jacig.2023.100162</pub-id>
</citation>
</ref>
<ref id="B91">
<label>91</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname> <given-names>J</given-names>
</name>
<name>
<surname>Honda</surname> <given-names>W</given-names>
</name>
</person-group>. <article-title>CED: a conformational epitope database</article-title>. <source>BMC Immunol</source>. (<year>2006</year>) <volume>7</volume>:<fpage>7</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2172-7-7</pub-id>
</citation>
</ref>
<ref id="B92">
<label>92</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schlessinger</surname> <given-names>A</given-names>
</name>
<name>
<surname>Ofran</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Yachdav</surname> <given-names>G</given-names>
</name>
<name>
<surname>Rost</surname> <given-names>B</given-names>
</name>
</person-group>. <article-title>Epitome: database of structure-inferred antigenic epitopes</article-title>. <source>Nucleic Acids Res</source>. (<year>2006</year>) <volume>34</volume>:<page-range>D777&#x2013;80</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkj053</pub-id>
</citation>
</ref>
<ref id="B93">
<label>93</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Blythe</surname> <given-names>MJ</given-names>
</name>
<name>
<surname>Doytchinova</surname> <given-names>IA</given-names>
</name>
<name>
<surname>Flower</surname> <given-names>DR</given-names>
</name>
</person-group>. <article-title>JenPep: a database of quantitative functional peptide data for immunology</article-title>. <source>Bioinformatics</source>. (<year>2002</year>) <volume>18</volume>:<page-range>434&#x2013;9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/18.3.434</pub-id>
</citation>
</ref>
<ref id="B94">
<label>94</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sweredoski</surname> <given-names>MJ</given-names>
</name>
<name>
<surname>Baldi</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>COBEpro: a novel system for predicting continuous B-cell epitopes</article-title>. <source>Protein Eng Des Sel</source>. (<year>2009</year>) <volume>22</volume>:<page-range>113&#x2013;20</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/protein/gzn075</pub-id>
</citation>
</ref>
<ref id="B95">
<label>95</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname> <given-names>J</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>D</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>T</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>X</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>X</given-names>
</name>
<name>
<surname>Tao</surname> <given-names>L</given-names>
</name>
<etal/>
</person-group>. <article-title>SEPPA: a computational server for spatial epitope prediction of protein antigens</article-title>. <source>Nucleic Acids Res</source>. (<year>2009</year>) <volume>37</volume>:<page-range>W612&#x2013;6</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkp417</pub-id>
</citation>
</ref>
<ref id="B96">
<label>96</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qi</surname> <given-names>T</given-names>
</name>
<name>
<surname>Qiu</surname> <given-names>T</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Q</given-names>
</name>
<name>
<surname>Tang</surname> <given-names>K</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Qiu</surname> <given-names>J</given-names>
</name>
<etal/>
</person-group>. <article-title>SEPPA 2.0&#x2013;more refined server to predict spatial epitope considering species of immune host and subcellular localization of protein antigen</article-title>. <source>Nucleic Acids Res</source>. (<year>2014</year>) <volume>42</volume>:<page-range>W59&#x2013;63</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gku395</pub-id>
</citation>
</ref>
<ref id="B97">
<label>97</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname> <given-names>C</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>L</given-names>
</name>
<name>
<surname>Yan</surname> <given-names>D</given-names>
</name>
<name>
<surname>Mao</surname> <given-names>T</given-names>
</name>
<name>
<surname>Tang</surname> <given-names>K</given-names>
</name>
<etal/>
</person-group>. <article-title>SEPPA 3.0-enhanced spatial epitope prediction enabling glycoprotein antigens</article-title>. <source>Nucleic Acids Res</source>. (<year>2019</year>) <volume>47</volume>:<page-range>W388&#x2013;W94</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkz413</pub-id>
</citation>
</ref>
<ref id="B98">
<label>98</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hoie</surname> <given-names>MH</given-names>
</name>
<name>
<surname>Gade</surname> <given-names>FS</given-names>
</name>
<name>
<surname>Johansen</surname> <given-names>JM</given-names>
</name>
<name>
<surname>Wurtzen</surname> <given-names>C</given-names>
</name>
<name>
<surname>Winther</surname> <given-names>O</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
<etal/>
</person-group>. <article-title>DiscoTope-3.0: improved B-cell epitope prediction using inverse folding latent representations</article-title>. <source>Front Immunol</source>. (<year>2024</year>) <volume>15</volume>:<elocation-id>1322712</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fimmu.2024.1322712</pub-id>
</citation>
</ref>
<ref id="B99">
<label>99</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>da Silva</surname> <given-names>BM</given-names>
</name>
<name>
<surname>Myung</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Ascher</surname> <given-names>DB</given-names>
</name>
<name>
<surname>Pires</surname> <given-names>DEV</given-names>
</name>
</person-group>. <article-title>epitope3D: a machine learning method for conformational B-cell epitope prediction</article-title>. <source>Brief Bioinform</source>. (<year>2022</year>) <volume>23</volume>:<fpage>bbab423</fpage>5. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bib/bbab423</pub-id>
</citation>
</ref>
<ref id="B100">
<label>100</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krawczyk</surname> <given-names>K</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>X</given-names>
</name>
<name>
<surname>Baker</surname> <given-names>T</given-names>
</name>
<name>
<surname>Shi</surname> <given-names>J</given-names>
</name>
<name>
<surname>Deane</surname> <given-names>CM</given-names>
</name>
</person-group>. <article-title>Improving B-cell epitope prediction and its application to global antibody-antigen docking</article-title>. <source>Bioinformatics</source>. (<year>2014</year>) <volume>30</volume>:<page-range>2288&#x2013;94</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btu190</pub-id>
</citation>
</ref>
<ref id="B101">
<label>101</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ponomarenko</surname> <given-names>J</given-names>
</name>
<name>
<surname>Bui</surname> <given-names>HH</given-names>
</name>
<name>
<surname>Li</surname> <given-names>W</given-names>
</name>
<name>
<surname>Fusseder</surname> <given-names>N</given-names>
</name>
<name>
<surname>Bourne</surname> <given-names>PE</given-names>
</name>
<name>
<surname>Sette</surname> <given-names>A</given-names>
</name>
<etal/>
</person-group>. <article-title>ElliPro: a new structure-based tool for the prediction of antibody epitopes</article-title>. <source>BMC Bioinf</source>. (<year>2008</year>) <volume>9</volume>:<fpage>514</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2105-9-514</pub-id>
</citation>
</ref>
<ref id="B102">
<label>102</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>F</given-names>
</name>
<name>
<surname>Yuan</surname> <given-names>C</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>H</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>F</given-names>
</name>
</person-group>. <article-title>Prediction of linear B-cell epitopes based on protein sequence features and BERT embeddings</article-title>. <source>Sci Rep</source>. (<year>2024</year>) <volume>14</volume>:<fpage>2464</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41598-024-53028-w</pub-id>
</citation>
</ref>
<ref id="B103">
<label>103</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname> <given-names>N</given-names>
</name>
<name>
<surname>Tripathi</surname> <given-names>S</given-names>
</name>
<name>
<surname>Sharma</surname> <given-names>N</given-names>
</name>
<name>
<surname>Patiyal</surname> <given-names>S</given-names>
</name>
<name>
<surname>Devi</surname> <given-names>NL</given-names>
</name>
<name>
<surname>Raghava</surname> <given-names>GPS</given-names>
</name>
</person-group>. <article-title>A method for predicting linear and conformational B-cell epitopes in an antigen from its primary sequence</article-title>. <source>Comput Biol Med</source>. (<year>2024</year>) <volume>170</volume>:<fpage>108083</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.compbiomed.2024.108083</pub-id>
</citation>
</ref>
<ref id="B104">
<label>104</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>da Silva</surname> <given-names>BM</given-names>
</name>
<name>
<surname>Ascher</surname> <given-names>DB</given-names>
</name>
<name>
<surname>Pires</surname> <given-names>DEV</given-names>
</name>
</person-group>. <article-title>epitope1D: accurate taxonomy-aware B-cell linear epitope prediction</article-title>. <source>Brief Bioinform</source>. (<year>2023</year>) <volume>24</volume>:<fpage>bbad114</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bib/bbad114</pub-id>
</citation>
</ref>
<ref id="B105">
<label>105</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ras-Carmona</surname> <given-names>A</given-names>
</name>
<name>
<surname>Pelaez-Prestel</surname> <given-names>HF</given-names>
</name>
<name>
<surname>Lafuente</surname> <given-names>EM</given-names>
</name>
<name>
<surname>Reche</surname> <given-names>PA</given-names>
</name>
</person-group>. <article-title>BCEPS: A web server to predict linear B cell epitopes with enhanced immunogenicity and cross-reactivity</article-title>. <source>Cells</source>. (<year>2021</year>) <volume>10</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/cells10102744</pub-id>
</citation>
</ref>
<ref id="B106">
<label>106</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Collatz</surname> <given-names>M</given-names>
</name>
<name>
<surname>Mock</surname> <given-names>F</given-names>
</name>
<name>
<surname>Barth</surname> <given-names>E</given-names>
</name>
<name>
<surname>Holzer</surname> <given-names>M</given-names>
</name>
<name>
<surname>Sachse</surname> <given-names>K</given-names>
</name>
<name>
<surname>Marz</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>EpiDope: a deep neural network for linear B-cell epitope prediction</article-title>. <source>Bioinformatics</source>. (<year>2021</year>) <volume>37</volume>:<page-range>448&#x2013;55</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btaa773</pub-id>
</citation>
</ref>
<ref id="B107">
<label>107</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname> <given-names>T</given-names>
</name>
<name>
<surname>Shi</surname> <given-names>K</given-names>
</name>
<name>
<surname>Li</surname> <given-names>W</given-names>
</name>
</person-group>. <article-title>Deep learning methods improve linear B-cell epitope prediction</article-title>. <source>BioData Min</source>. (<year>2020</year>) <volume>13</volume>:<fpage>1</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s13040-020-00211-0</pub-id>
</citation>
</ref>
<ref id="B108">
<label>108</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ya.I. Davydov</surname> <given-names>AGT</given-names>
</name>
</person-group>. <article-title>Prediction of linear B-cell epitopes</article-title>. <source>Molekulyarnaya Biologiya</source>. (<year>2009</year>) <volume>43</volume>:<page-range>166&#x2013;74</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1134/S0026893309010208</pub-id>
</citation>
</ref>
<ref id="B109">
<label>109</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yao</surname> <given-names>B</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>L</given-names>
</name>
<name>
<surname>Liang</surname> <given-names>S</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>C</given-names>
</name>
</person-group>. <article-title>SVMTriP: a method to predict antigenic epitopes using support vector machine to integrate tri-peptide similarity and propensity</article-title>. <source>PloS One</source>. (<year>2012</year>) <volume>7</volume>:<elocation-id>e45152</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0045152</pub-id>
</citation>
</ref>
<ref id="B110">
<label>110</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>El-Manzalawy</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Dobbs</surname> <given-names>D</given-names>
</name>
<name>
<surname>Honavar</surname> <given-names>V</given-names>
</name>
</person-group>. <article-title>Predicting linear B-cell epitopes using string kernels</article-title>. <source>J Mol Recognit</source>. (<year>2008</year>) <volume>21</volume>:<page-range>243&#x2013;55</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/jmr.893</pub-id>
</citation>
</ref>
<ref id="B111">
<label>111</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Saha</surname> <given-names>S</given-names>
</name>
<name>
<surname>Raghava</surname> <given-names>GP</given-names>
</name>
</person-group>. (<year>2004</year>). &#x2018;<article-title>BCEPRED: Prediction of continuous B-cell epitopes in antigenic sequences using physico-chemical properties</article-title>&#x2019;, <source>Lecture Notes in Computer Science</source>, vol <volume>3239</volume>. <publisher-name>Berlin, Heidelberg</publisher-name>: <publisher-loc>Springer</publisher-loc>. pp. <page-range>197&#x2013;204</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-3-540-30220-9_16</pub-id>
</citation>
</ref>
<ref id="B112">
<label>112</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Israeli</surname> <given-names>S</given-names>
</name>
<name>
<surname>Louzoun</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>Single-residue linear and conformational B cell epitopes prediction using random and ESM-2 based projections</article-title>. <source>Brief Bioinform</source>. (<year>2024</year>) <volume>25</volume>:<fpage>bbae084</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bib/bbae084</pub-id>
</citation>
</ref>
<ref id="B113">
<label>113</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>J</given-names>
</name>
<name>
<surname>Zhao</surname> <given-names>B</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>S</given-names>
</name>
<name>
<surname>Sun</surname> <given-names>H</given-names>
</name>
<name>
<surname>Mao</surname> <given-names>X</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>M</given-names>
</name>
<etal/>
</person-group>. <article-title>TEPCAM: Prediction of T-cell receptor-epitope binding specificity via interpretable deep learning</article-title>. <source>Protein Sci</source>. (<year>2024</year>) <volume>33</volume>:<elocation-id>e4841</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/pro.v33.1</pub-id>
</citation>
</ref>
<ref id="B114">
<label>114</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname> <given-names>YC</given-names>
</name>
<name>
<surname>Gao</surname> <given-names>YL</given-names>
</name>
<name>
<surname>Fan</surname> <given-names>YX</given-names>
</name>
<name>
<surname>Zhu</surname> <given-names>CY</given-names>
</name>
<name>
<surname>Wei</surname> <given-names>ZT</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>C</given-names>
</name>
<etal/>
</person-group>. <article-title>Pan-Peptide Meta Learning for T-cell receptor-antigen binding recognition</article-title>. <source>Nat Mach Intell</source>. (<year>2023</year>) <volume>5</volume>:<page-range>236&#x2013;49</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s42256-023-00619-3</pub-id>
</citation>
</ref>
<ref id="B115">
<label>115</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jiang</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Huo</surname> <given-names>M</given-names>
</name>
<name>
<surname>Cheng Li</surname> <given-names>S</given-names>
</name>
</person-group>. <article-title>TEINet: a deep learning framework for prediction of TCR-epitope binding specificity</article-title>. <source>Brief Bioinform</source>. (<year>2023</year>) <volume>24</volume>:<fpage>bbad086</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bib/bbad086</pub-id>
</citation>
</ref>
<ref id="B116">
<label>116</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cai</surname> <given-names>M</given-names>
</name>
<name>
<surname>Bang</surname> <given-names>S</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>P</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>H</given-names>
</name>
</person-group>. <article-title>ATM-TCR: TCR-epitope binding affinity prediction using a multi-head self-attention model</article-title>. <source>Front Immunol</source>. (<year>2022</year>) <volume>13</volume>:<elocation-id>893247</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fimmu.2022.893247</pub-id>
</citation>
</ref>
<ref id="B117">
<label>117</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Weber</surname> <given-names>A</given-names>
</name>
<name>
<surname>Born</surname> <given-names>J</given-names>
</name>
<name>
<surname>Rodriguez Martinez</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>TITAN: T-cell receptor specificity prediction with bimodal attention networks</article-title>. <source>Bioinformatics</source>. (<year>2021</year>) <volume>37</volume>:<page-range>i237&#x2013;i44</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btab294</pub-id>
</citation>
</ref>
<ref id="B118">
<label>118</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reynisson</surname> <given-names>B</given-names>
</name>
<name>
<surname>Alvarez</surname> <given-names>B</given-names>
</name>
<name>
<surname>Paul</surname> <given-names>S</given-names>
</name>
<name>
<surname>Peters</surname> <given-names>B</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data</article-title>. <source>Nucleic Acids Res</source>. (<year>2020</year>) <volume>48</volume>:<page-range>W449&#x2013;W54</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkaa379</pub-id>
</citation>
</ref>
<ref id="B119">
<label>119</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>O'Donnell</surname> <given-names>TJ</given-names>
</name>
<name>
<surname>Rubinsteyn</surname> <given-names>A</given-names>
</name>
<name>
<surname>Laserson</surname> <given-names>U</given-names>
</name>
</person-group>. <article-title>MHCflurry 2.0: improved pan-allele prediction of MHC class I-presented peptides by incorporating antigen processing</article-title>. <source>Cell Syst</source>. (<year>2020</year>) <volume>11</volume>:<page-range>418&#x2013;9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.cels.2020.09.001</pub-id>
</citation>
</ref>
<ref id="B120">
<label>120</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>H</given-names>
</name>
<name>
<surname>Wan</surname> <given-names>F</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>L</given-names>
</name>
<name>
<surname>Xiong</surname> <given-names>Y</given-names>
</name>
<etal/>
</person-group>. <article-title>ACME: pan-specific peptide-MHC class I binding prediction through attention-based deep neural networks</article-title>. <source>Bioinformatics</source>. (<year>2019</year>) <volume>35</volume>:<page-range>4946&#x2013;54</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btz427</pub-id>
</citation>
</ref>
<ref id="B121">
<label>121</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Molero-Abraham</surname> <given-names>M</given-names>
</name>
<name>
<surname>Lafuente</surname> <given-names>EM</given-names>
</name>
<name>
<surname>Flower</surname> <given-names>DR</given-names>
</name>
<name>
<surname>Reche</surname> <given-names>PA</given-names>
</name>
</person-group>. <article-title>Selection of conserved epitopes from hepatitis C virus for pan-populational stimulation of T-cell responses</article-title>. <source>Clin Dev Immunol</source>. (<year>2013</year>) <volume>2013</volume>:<fpage>601943</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1155/2013/601943</pub-id>
</citation>
</ref>
<ref id="B122">
<label>122</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>H</given-names>
</name>
<name>
<surname>Lund</surname> <given-names>O</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>The PickPocket method for predicting binding specificities for receptors based on receptor pocket similarities: application to MHC-peptide binding</article-title>. <source>Bioinformatics</source>. (<year>2009</year>) <volume>25</volume>:<page-range>1293&#x2013;9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btp137</pub-id>
</citation>
</ref>
<ref id="B123">
<label>123</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stranzl</surname> <given-names>T</given-names>
</name>
<name>
<surname>Larsen</surname> <given-names>MV</given-names>
</name>
<name>
<surname>Lundegaard</surname> <given-names>C</given-names>
</name>
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>NetCTLpan: pan-specific MHC class I pathway epitope predictions</article-title>. <source>Immunogenetics</source>. (<year>2010</year>) <volume>62</volume>:<page-range>357&#x2013;68</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s00251-010-0441-4</pub-id>
</citation>
</ref>
<ref id="B124">
<label>124</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Doytchinova</surname> <given-names>IA</given-names>
</name>
<name>
<surname>Guan</surname> <given-names>P</given-names>
</name>
<name>
<surname>Flower</surname> <given-names>DR</given-names>
</name>
</person-group>. <article-title>EpiJen: a server for multistep T cell epitope prediction</article-title>. <source>BMC Bioinf</source>. (<year>2006</year>) <volume>7</volume>:<fpage>131</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2105-7-131</pub-id>
</citation>
</ref>
<ref id="B125">
<label>125</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reche</surname> <given-names>PA</given-names>
</name>
<name>
<surname>Reinherz</surname> <given-names>EL</given-names>
</name>
</person-group>. <article-title>PEPVAC: a web server for multi-epitope vaccine development based on the prediction of supertypic MHC ligands</article-title>. <source>Nucleic Acids Res</source>. (<year>2005</year>) <volume>33</volume>:<page-range>W138&#x2013;42</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gki357</pub-id>
</citation>
</ref>
<ref id="B126">
<label>126</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Racle</surname> <given-names>J</given-names>
</name>
<name>
<surname>Guillaume</surname> <given-names>P</given-names>
</name>
<name>
<surname>Schmidt</surname> <given-names>J</given-names>
</name>
<name>
<surname>Michaux</surname> <given-names>J</given-names>
</name>
<name>
<surname>Larabi</surname> <given-names>A</given-names>
</name>
<name>
<surname>Lau</surname> <given-names>K</given-names>
</name>
<etal/>
</person-group>. <article-title>Machine learning predictions of MHC-II specificities reveal alternative binding mode of class II epitopes</article-title>. <source>Immunity</source>. (<year>2023</year>) <volume>56</volume>:<fpage>1359</fpage>&#x2013;<lpage>75.e13</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.immuni.2023.03.009</pub-id>
</citation>
</ref>
<ref id="B127">
<label>127</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>B</given-names>
</name>
<name>
<surname>Khodadoust</surname> <given-names>MS</given-names>
</name>
<name>
<surname>Olsson</surname> <given-names>N</given-names>
</name>
<name>
<surname>Wagar</surname> <given-names>LE</given-names>
</name>
<name>
<surname>Fast</surname> <given-names>E</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>CL</given-names>
</name>
<etal/>
</person-group>. <article-title>Predicting HLA class II antigen presentation through integrated deep learning</article-title>. <source>Nat Biotechnol</source>. (<year>2019</year>) <volume>37</volume>:<page-range>1332&#x2013;43</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41587-019-0280-2</pub-id>
</citation>
</ref>
<ref id="B128">
<label>128</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abelin</surname> <given-names>JG</given-names>
</name>
<name>
<surname>Harjanto</surname> <given-names>D</given-names>
</name>
<name>
<surname>Malloy</surname> <given-names>M</given-names>
</name>
<name>
<surname>Suri</surname> <given-names>P</given-names>
</name>
<name>
<surname>Colson</surname> <given-names>T</given-names>
</name>
<name>
<surname>Goulding</surname> <given-names>SP</given-names>
</name>
<etal/>
</person-group>. <article-title>Defining HLA-II ligand processing and binding rules with mass spectrometry enhances cancer epitope prediction</article-title>. <source>Immunity</source>. (<year>2019</year>) <volume>51</volume>:<fpage>766</fpage>&#x2013;<lpage>79.e17</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.immuni.2019.08.012</pub-id>
</citation>
</ref>
<ref id="B129">
<label>129</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nielsen</surname> <given-names>M</given-names>
</name>
<name>
<surname>Andreatta</surname> <given-names>M</given-names>
</name>
</person-group>. <article-title>NNAlign: a platform to construct and evaluate artificial neural network models of receptor-ligand interactions</article-title>. <source>Nucleic Acids Res</source>. (<year>2017</year>) <volume>45</volume>:<page-range>W344&#x2013;W9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkx276</pub-id>
</citation>
</ref>
<ref id="B130">
<label>130</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Atanasova</surname> <given-names>M</given-names>
</name>
<name>
<surname>Patronov</surname> <given-names>A</given-names>
</name>
<name>
<surname>Dimitrov</surname> <given-names>I</given-names>
</name>
<name>
<surname>Flower</surname> <given-names>DR</given-names>
</name>
<name>
<surname>Doytchinova</surname> <given-names>I</given-names>
</name>
</person-group>. <article-title>EpiDOCK: a molecular docking-based tool for MHC class II binding prediction</article-title>. <source>Protein Eng Des Sel</source>. (<year>2013</year>) <volume>26</volume>:<page-range>631&#x2013;4</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/protein/gzt018</pub-id>
</citation>
</ref>
<ref id="B131">
<label>131</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Karnaukhov</surname> <given-names>VK</given-names>
</name>
<name>
<surname>Shcherbinin</surname> <given-names>DS</given-names>
</name>
<name>
<surname>Chugunov</surname> <given-names>AO</given-names>
</name>
<name>
<surname>Chudakov</surname> <given-names>DM</given-names>
</name>
<name>
<surname>Efremov</surname> <given-names>RG</given-names>
</name>
<name>
<surname>Zvyagin</surname> <given-names>IV</given-names>
</name>
<etal/>
</person-group>. <article-title>Structure-based prediction of T cell receptor recognition of unseen epitopes using TCRen</article-title>. <source>Nat Comput Sci</source>. (<year>2024</year>) <volume>4</volume>:<page-range>510&#x2013;21</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s43588-024-00653-0</pub-id>
</citation>
</ref>
<ref id="B132">
<label>132</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Springer</surname> <given-names>I</given-names>
</name>
<name>
<surname>Tickotsky</surname> <given-names>N</given-names>
</name>
<name>
<surname>Louzoun</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>Contribution of T cell receptor alpha and beta CDR3, MHC typing, V and J genes to peptide binding prediction</article-title>. <source>Front Immunol</source>. (<year>2021</year>) <volume>12</volume>:<elocation-id>664514</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fimmu.2021.664514</pub-id>
</citation>
</ref>
<ref id="B133">
<label>133</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ong</surname> <given-names>E</given-names>
</name>
<name>
<surname>Cooke</surname> <given-names>MF</given-names>
</name>
<name>
<surname>Huffman</surname> <given-names>A</given-names>
</name>
<name>
<surname>Xiang</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Wong</surname> <given-names>MU</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>H</given-names>
</name>
<etal/>
</person-group>. <article-title>Vaxign2: the second generation of the first Web-based vaccine design program using reverse vaccinology and machine learning</article-title>. <source>Nucleic Acids Res</source>. (<year>2021</year>) <volume>49</volume>:<page-range>W671&#x2013;W8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkab279</pub-id>
</citation>
</ref>
<ref id="B134">
<label>134</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reche</surname> <given-names>PA</given-names>
</name>
<name>
<surname>Glutting</surname> <given-names>JP</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>H</given-names>
</name>
<name>
<surname>Reinherz</surname> <given-names>EL</given-names>
</name>
</person-group>. <article-title>Enhancement to the RANKPEP resource for the prediction of peptide binding to MHC molecules using profiles</article-title>. <source>Immunogenetics</source>. (<year>2004</year>) <volume>56</volume>:<page-range>405&#x2013;19</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s00251-004-0709-7</pub-id>
</citation>
</ref>
<ref id="B135">
<label>135</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Taguchi</surname> <given-names>AT</given-names>
</name>
<name>
<surname>Boyd</surname> <given-names>J</given-names>
</name>
<name>
<surname>Diehnelt</surname> <given-names>CW</given-names>
</name>
<name>
<surname>Legutki</surname> <given-names>JB</given-names>
</name>
<name>
<surname>Zhao</surname> <given-names>ZG</given-names>
</name>
<name>
<surname>Woodbury</surname> <given-names>NW</given-names>
</name>
</person-group>. <article-title>Comprehensive prediction of molecular recognition in a combinatorial chemical space using machine learning</article-title>. <source>ACS Comb Sci</source>. (<year>2020</year>) <volume>22</volume>:<page-range>500&#x2013;8</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1021/acscombsci.0c00003</pub-id>
</citation>
</ref>
<ref id="B136">
<label>136</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Renard</surname> <given-names>BY</given-names>
</name>
<name>
<surname>Lower</surname> <given-names>M</given-names>
</name>
<name>
<surname>Kuhne</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Reimer</surname> <given-names>U</given-names>
</name>
<name>
<surname>Rothermel</surname> <given-names>A</given-names>
</name>
<name>
<surname>Tureci</surname> <given-names>O</given-names>
</name>
<etal/>
</person-group>. <article-title>rapmad: Robust analysis of peptide microarray data</article-title>. <source>BMC Bioinf</source>. (<year>2011</year>) <volume>12</volume>:<fpage>324</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2105-12-324</pub-id>
</citation>
</ref>
<ref id="B137">
<label>137</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ogura</surname> <given-names>T</given-names>
</name>
<name>
<surname>Sato</surname> <given-names>C</given-names>
</name>
</person-group>. <article-title>An automatic particle pickup method using a neural network applicable to low-contrast electron micrographs</article-title>. <source>J Struct Biol</source>. (<year>2001</year>) <volume>136</volume>:<page-range>227&#x2013;38</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1006/jsbi.2002.4442</pub-id>
</citation>
</ref>
<ref id="B138">
<label>138</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ogura</surname> <given-names>T</given-names>
</name>
<name>
<surname>Sato</surname> <given-names>C</given-names>
</name>
</person-group>. <article-title>Automatic particle pickup method using a neural network has high accuracy by applying an initial weight derived from eigenimages: a new reference free method for single-particle analysis</article-title>. <source>J Struct Biol</source>. (<year>2004</year>) <volume>145</volume>:<fpage>63</fpage>&#x2013;<lpage>75</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/S1047-8477(03)00139-4</pub-id>
</citation>
</ref>
<ref id="B139">
<label>139</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lei</surname> <given-names>H</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>Y</given-names>
</name>
</person-group>. <article-title>CDAE: A cascade of denoising autoencoders for noise reduction in the clustering of single-particle cryo-EM images</article-title>. <source>Front Genet</source>. (<year>2020</year>) <volume>11</volume>:<elocation-id>627746</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fgene.2020.627746</pub-id>
</citation>
</ref>
<ref id="B140">
<label>140</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sanchez-Garcia</surname> <given-names>R</given-names>
</name>
<name>
<surname>Segura</surname> <given-names>J</given-names>
</name>
<name>
<surname>Maluenda</surname> <given-names>D</given-names>
</name>
<name>
<surname>Carazo</surname> <given-names>JM</given-names>
</name>
<name>
<surname>Sorzano</surname> <given-names>COS</given-names>
</name>
</person-group>. <article-title>Deep Consensus, a deep learning-based approach for particle pruning in cryo-electron microscopy</article-title>. <source>IUCrJ</source>. (<year>2018</year>) <volume>5</volume>:<page-range>854&#x2013;65</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1107/S2052252518014392</pub-id>
</citation>
</ref>
<ref id="B141">
<label>141</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tegunov</surname> <given-names>D</given-names>
</name>
<name>
<surname>Cramer</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>Real-time cryo-electron microscopy data preprocessing with Warp</article-title>. <source>Nat Methods</source>. (<year>2019</year>) <volume>16</volume>:<page-range>1146&#x2013;52</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41592-019-0580-y</pub-id>
</citation>
</ref>
<ref id="B142">
<label>142</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yokoyama</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Terada</surname> <given-names>T</given-names>
</name>
<name>
<surname>Shimizu</surname> <given-names>K</given-names>
</name>
<name>
<surname>Nishikawa</surname> <given-names>K</given-names>
</name>
<name>
<surname>Kozai</surname> <given-names>D</given-names>
</name>
<name>
<surname>Shimada</surname> <given-names>A</given-names>
</name>
<etal/>
</person-group>. <article-title>Development of a deep learning-based method to identify "good" regions of a cryo-electron microscopy grid</article-title>. <source>Biophys Rev</source>. (<year>2020</year>) <volume>12</volume>:<page-range>349&#x2013;54</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s12551-020-00669-6</pub-id>
</citation>
</ref>
<ref id="B143">
<label>143</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Avramov</surname> <given-names>TK</given-names>
</name>
<name>
<surname>Vyenielo</surname> <given-names>D</given-names>
</name>
<name>
<surname>Gomez-Blanco</surname> <given-names>J</given-names>
</name>
<name>
<surname>Adinarayanan</surname> <given-names>S</given-names>
</name>
<name>
<surname>Vargas</surname> <given-names>J</given-names>
</name>
<name>
<surname>Si</surname> <given-names>D</given-names>
</name>
</person-group>. <article-title>Deep learning for validating and estimating resolution of cryo-electron microscopy density maps (dagger)</article-title>. <source>Molecules</source>. (<year>2019</year>) <volume>24</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/molecules24061181</pub-id>
</citation>
</ref>
<ref id="B144">
<label>144</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pfab</surname> <given-names>J</given-names>
</name>
<name>
<surname>Phan</surname> <given-names>NM</given-names>
</name>
<name>
<surname>Si</surname> <given-names>D</given-names>
</name>
</person-group>. <article-title>DeepTracer for fast <italic>de novo</italic> cryo-EM protein structure modeling and special studies on CoV-related complexes</article-title>. <source>Proc Natl Acad Sci U.S.A</source>. (<year>2021</year>) <volume>118</volume>. doi:&#xa0;<pub-id pub-id-type="doi">10.1073/pnas.2017525118</pub-id>
</citation>
</ref>
<ref id="B145">
<label>145</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Si</surname> <given-names>D</given-names>
</name>
<name>
<surname>Moritz</surname> <given-names>SA</given-names>
</name>
<name>
<surname>Pfab</surname> <given-names>J</given-names>
</name>
<name>
<surname>Hou</surname> <given-names>J</given-names>
</name>
<name>
<surname>Cao</surname> <given-names>R</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>L</given-names>
</name>
<etal/>
</person-group>. <article-title>Deep learning to predict protein backbone structure from high-resolution cryo-EM density maps</article-title>. <source>Sci Rep</source>. (<year>2020</year>) <volume>10</volume>:<fpage>4282</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41598-020-60598-y</pub-id>
</citation>
</ref>
<ref id="B146">
<label>146</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hryc</surname> <given-names>CF</given-names>
</name>
<name>
<surname>Baker</surname> <given-names>ML</given-names>
</name>
</person-group>. <article-title>AlphaFold2 and CryoEM: Revisiting CryoEM modeling in near-atomic resolution density maps</article-title>. <source>iScience</source>. (<year>2022</year>) <volume>25</volume>:<fpage>104496</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.isci.2022.104496</pub-id>
</citation>
</ref>
<ref id="B147">
<label>147</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname> <given-names>YH</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>J</given-names>
</name>
<name>
<surname>Ge</surname> <given-names>F</given-names>
</name>
<name>
<surname>Li</surname> <given-names>F</given-names>
</name>
<name>
<surname>Song</surname> <given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Y</given-names>
</name>
<etal/>
</person-group>. <article-title>Accurate multistage prediction of protein crystallization propensity using deep-cascade forest with sequence-based features</article-title>. <source>Brief Bioinform</source>. (<year>2021</year>) <volume>22</volume>:<fpage>bbaa076</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bib/bbaa076</pub-id>
</citation>
</ref>
<ref id="B148">
<label>148</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname> <given-names>PH</given-names>
</name>
<name>
<surname>Zhu</surname> <given-names>YH</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>X</given-names>
</name>
<name>
<surname>Yu</surname> <given-names>DJ</given-names>
</name>
</person-group>. <article-title>GCmapCrys: Integrating graph attention network with predicted contact map for multi-stage protein crystallization propensity prediction</article-title>. <source>Anal Biochem</source>. (<year>2023</year>) <volume>663</volume>:<fpage>115020</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ab.2022.115020</pub-id>
</citation>
</ref>
<ref id="B149">
<label>149</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bruno</surname> <given-names>AE</given-names>
</name>
<name>
<surname>Charbonneau</surname> <given-names>P</given-names>
</name>
<name>
<surname>Newman</surname> <given-names>J</given-names>
</name>
<name>
<surname>Snell</surname> <given-names>EH</given-names>
</name>
<name>
<surname>So</surname> <given-names>DR</given-names>
</name>
<name>
<surname>Vanhoucke</surname> <given-names>V</given-names>
</name>
<etal/>
</person-group>. <article-title>Classification of crystallization outcomes using deep convolutional neural networks</article-title>. <source>PLoS One</source>. (<year>2018</year>) <volume>13</volume>:<elocation-id>e0198883</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0198883</pub-id>
</citation>
</ref>
<ref id="B150">
<label>150</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bond</surname> <given-names>PS</given-names>
</name>
<name>
<surname>Wilson</surname> <given-names>KS</given-names>
</name>
<name>
<surname>Cowtan</surname> <given-names>KD</given-names>
</name>
</person-group>. <article-title>Predicting protein model correctness in Coot using machine learning</article-title>. <source>Acta Crystallogr D Struct Biol</source>. (<year>2020</year>) <volume>76</volume>:<page-range>713&#x2013;23</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1107/S2059798320009080</pub-id>
</citation>
</ref>
<ref id="B151">
<label>151</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Khurana</surname> <given-names>S</given-names>
</name>
<name>
<surname>Rawi</surname> <given-names>R</given-names>
</name>
<name>
<surname>Kunji</surname> <given-names>K</given-names>
</name>
<name>
<surname>Chuang</surname> <given-names>GY</given-names>
</name>
<name>
<surname>Bensmail</surname> <given-names>H</given-names>
</name>
<name>
<surname>Mall</surname> <given-names>R</given-names>
</name>
</person-group>. <article-title>DeepSol: a deep learning framework for sequence-based protein solubility prediction</article-title>. <source>Bioinformatics</source>. (<year>2018</year>) <volume>34</volume>:<page-range>2605&#x2013;13</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/bty166</pub-id>
</citation>
</ref>
<ref id="B152">
<label>152</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rahmani</surname> <given-names>V</given-names>
</name>
<name>
<surname>Nawaz</surname> <given-names>S</given-names>
</name>
<name>
<surname>Pennicard</surname> <given-names>D</given-names>
</name>
<name>
<surname>Setty</surname> <given-names>SPR</given-names>
</name>
<name>
<surname>Graafsma</surname> <given-names>H</given-names>
</name>
</person-group>. <article-title>Data reduction for X-ray serial crystallography using machine learning</article-title>. <source>J Appl Crystallogr</source>. (<year>2023</year>) <volume>56</volume>:<page-range>200&#x2013;13</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1107/S1600576722011748</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>