<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Immunol.</journal-id>
<journal-title>Frontiers in Immunology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Immunol.</abbrev-journal-title>
<issn pub-type="epub">1664-3224</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fimmu.2013.00357</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Immunology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Germline Amino Acid Diversity in B Cell Receptors is a Good Predictor of Somatic Selection Pressures</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Schwartz</surname> <given-names>Gregory W.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://frontiersin.org/people/u/118498"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Hershberg</surname> <given-names>Uri</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="cor1">&#x0002A;</xref>
<uri xlink:href="http://frontiersin.org/people/u/105693"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Systems Immunology Laboratory, School of Biomedical Engineering, Science, and Health Systems, Drexel University</institution>, <addr-line>Philadelphia, PA</addr-line>, <country>USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Microbiology and Immunology, College of Medicine, Drexel University</institution>, <addr-line>Philadelphia, PA</addr-line>, <country>USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Michal Or-Guil, Humboldt University Berlin, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Andrew M. Collins, University of New South Wales, Australia; Nicole Wittenbrink, Humboldt University Berlin, Germany</p></fn>
<corresp content-type="corresp" id="cor1">&#x0002A;Correspondence: Uri Hershberg, Systems Immunology Laboratory, School of Biomedical Engineering, Science, and Health Systems, Drexel University, 3141 Chestnut Street, Philadelphia, PA 19104, USA e-mail: <email>uri.hershberg&#x00040;drexel.edu</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to B Cell Biology, a section of the journal Frontiers in Immunology.</p></fn>
</author-notes>
<pub-date pub-type="epreprint">
<day>12</day>
<month>09</month>
<year>2013</year>
</pub-date>
<pub-date pub-type="epub">
<day>08</day>
<month>11</month>
<year>2013</year>
</pub-date>
<pub-date pub-type="collection">
<year>2013</year>
</pub-date><volume>4</volume>
<elocation-id>357</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>09</month>
<year>2013</year>
</date>
<date date-type="accepted">
<day>21</day>
<month>10</month>
<year>2013</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2013 Schwartz and Hershberg.</copyright-statement>
<copyright-year>2013</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The diversity of the immune repertoire is important for the adaptive immune system&#x02019;s ability to detect pathogens. Much of this diversity is generated in two steps, first through the recombination of germline gene segments and second through hypermutation during an immune response. While both steps are to some extent based on the germline level repertoire of genes, the final structure and selection of specific receptors is at the somatic level. How germline diversity and selection relate to somatic diversity and selection has not been clear. To investigate how germline diversity relates to somatic diversity and selection, we considered the published repertoire of Ig heavy chain V genes taken from the blood of 12 individuals, post-vaccination against influenza, sequenced by 454 high-throughput sequencing. We here show that when we consider individual amino acid positions in the heavy chain V gene sequence, there exists a strong correlation between the diversity of the germline repertoire at a position and the number of B cell clones that change amino acids at that position. At the same time, we find that the diversity of amino acids used in the mutated positions is greater than in the germline, albeit still correlated to germline diversity. From these findings, we propose that while germline diversity and germline amino acid usage at a given position do not fully specify the amino acid mutant needed to promote survival of specific clones, germline diversity at a given position is a good indicator for the potential to survive after somatic mutation at that position. We would therefore suggest that germline diversity at each specific position is the better <italic>a priori</italic> model for the effects of somatic mutation and selection, than simply the division into complementarity determining and framework regions.</p>
</abstract>
<kwd-group>
<kwd>B cells</kwd>
<kwd>somatic hypermutation</kwd>
<kwd>selection</kwd>
<kwd>diversity</kwd>
<kwd>evolution</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="1"/>
<equation-count count="2"/>
<ref-count count="19"/>
<page-count count="7"/>
<word-count count="5018"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="introduction">
<label>1</label> <title>Introduction</title>
<p>The adaptive immune system&#x02019;s ability to react to disease is based on the diversity of its immune repertoire. In the case of B cells, this diversity is generated in two rounds: the recombination of germline gene segments (V, D, and J for heavy chains, V and J for light chains) to create the B cell receptor (BCR), (<xref ref-type="bibr" rid="B1">1</xref>&#x02013;<xref ref-type="bibr" rid="B3">3</xref>) as well as somatic hypermutation during an immune response (<xref ref-type="bibr" rid="B4">4</xref>&#x02013;<xref ref-type="bibr" rid="B6">6</xref>). In both cases, these diversification processes are coupled with stringent somatic selection based on the binding affinity of the BCR (<xref ref-type="bibr" rid="B7">7</xref>). Thus, while the initial state of the BCR is at least somewhat based on an individual&#x02019;s germline genes, the final structure of specific BCR mutants is based on somatic selection processes related to the binding affinity of the BCR. It remains unclear how selection and diversity at the germline level relate to selection at the somatic level. In this analysis, we demonstrate a link between the diversity and selection at the germline and somatic levels for V genes.</p>
<p>The germline genes encoding the different regions of the BCR are themselves diverse, even before considering the diversification produced from the recombination of different gene segments. Specifically in V genes, this diversity is non-uniformly spread across the gene sequence. Some positions always utilize the same amino acid in all V genes while others can utilize many different amino acids. This differential diversity is considered an indicator of the functional role of each position in the eventual tertiary structure of the receptor. Variable regions of the V gene sequences, called complementarity determining regions (CDR), are thought to be those that encode regions which interact with antigens, while the more invariant positions, called framework regions (FR), are proposed to be involved in the backbone of the receptor (<xref ref-type="bibr" rid="B8">8</xref>). It has been generally thought that somatic selection segregates along these two regions. Positive selection is thought to occur in the CDR, while mutations in the FR were mostly debilitating to affinity and lethal to the cell (<xref ref-type="bibr" rid="B5">5</xref>). It is now clear that this segregation is not strictly true &#x02013; positively selected key mutations can be found in the FR (<xref ref-type="bibr" rid="B9">9</xref>) and negative selection can be seen in mutations throughout the sequence (<xref ref-type="bibr" rid="B10">10</xref>).</p>
<p>Previously, the diversity measurements of the receptors were based on differing diversity indexes with varying appropriateness and on partial sets of germline and mutant sequences (<xref ref-type="bibr" rid="B8">8</xref>, <xref ref-type="bibr" rid="B11">11</xref>). We directly measured the &#x0201C;true&#x0201D; diversity of light chain V genes and heavy chain V genes (<italic>V<sub>H</sub></italic>) based on the entire known BCR germline repertoire as found in the IMGT Ig gene database (<xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B13">13</xref>). We demonstrated that the pattern of diversity in all V genes is non-uniform, with most positions showing a low level of diversity (2&#x02013;5 relevant amino acids) and a few exhibiting higher levels of diversity (up to 10 relevant amino acids). If we rank all the positions in the sequence by their diversity, we can explicitly show that while the CDR is enriched for high diversity positions, many CDR positions have diversities as low as those found in FR and some FR positions have quite a high diversity (<xref ref-type="bibr" rid="B12">12</xref>). We previously suggested that it is the diversity of positions, not solely their association with the contiguous CDR or FR positions, which determines their functional role and the consequence of mutation.</p>
<p>The diversity of positions in the germline repertoire of V genes is the result of evolutionary selection of individuals and their progeny. The process of affinity maturation is based on somatic mutation and selection. It has thus far been unclear how these two processes of selection are related and if they can be connected at the V gene sequence level. To study this possible relationship, we considered a published dataset of &#x0223C;17,000 recombined BCR <italic>V<sub>H</sub></italic> gene sequences from 12 individuals (<xref ref-type="bibr" rid="B14">14</xref>). We divided sequences by their clonotypes, identifying the clonal origin for each recombined sequence. In this way we could now count how many times each position was mutated in the repertoire. Comparing the number of individual times a position was mutated to its germline diversity (<xref ref-type="bibr" rid="B12">12</xref>), we found that while synonymous mutations were evenly spread across all positions, there was a clear positive correlation between the number of times a position had a mutated amino acid and that position&#x02019;s diversity in the germline repertoire. From this we conclude that the diversity at the germline level is an indication for the potential for somatic harm as a result of mutation. The diversity of each specific position is a more direct measure of the functional consequence of mutation and selection at the somatic level than a mere division into CDR and FR.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<label>2</label> <title>Materials and Methods</title>
<sec id="S2-1">
<label>2.1</label> <title>Sequences analyzed</title>
<p>We analyzed the amino acid and nucleotide sequences of <italic>Homo sapiens</italic> BCR recombined <italic>V<sub>H</sub></italic> genes (<xref ref-type="bibr" rid="B14">14</xref>). The sequences came from twelve healthy individuals post-vaccination against influenza (<xref ref-type="bibr" rid="B14">14</xref>). The individuals came from two age cohorts: 6 young (age range 19&#x02013;45) and 6 old (age range 70&#x02013;89). Sequences were acquired at days 0, 7, and 28 post-vaccination and included both IgG, IgM, and IgA class switched receptors. We divided the sequences into clones by fully aligning their nucleotide sequences to the germline V, D, and J genes from the IMGT Ig database (<xref ref-type="bibr" rid="B13">13</xref>). All sequences that shared the same germline source (V, J, and CDR3 length) were considered to be from the same clone. We filtered out sequences with &#x02265;30&#x02009;nt point mutations from the germline. This alignment resulted in the identification of 17,553 sequences divided into 9482 clones. Due to sequencing issues in the original dataset, we only analyzed the sequences from position 25 and on. IMGT numbering leaves gaps in order to remain consistent with all V genes. Also, the length of V genes is not always identical. Therefore, we only calculated germline diversity for amino acid positions 25 &#x02212; 30, 35 &#x02212; 59, 63 &#x02212; 72, and 74 &#x02212; 106, leaving us with 74 positions in the analysis. These positions were verified for adequate sampling by the use of rarefaction curves at each position (<xref ref-type="bibr" rid="B15">15</xref>). We considered a position viable if more than 99% of the curve consisted of a richness of &#x02265;95% of the height of the curve (<xref ref-type="bibr" rid="B12">12</xref>). These curves rule out the possibility of having too many gaps in the germline repertoire.</p>
<p>We calculated the germline diversity per amino acid position using BCR <italic>V<sub>H</sub></italic> genes collected from IMGT as in Ref. (<xref ref-type="bibr" rid="B12">12</xref>). We filtered out non-functional, partial, and duplicate sequences for the analysis. All sequences were numbered according to the IMGT unique numbering system based off of the universal alignment provided by IMGT (<xref ref-type="bibr" rid="B13">13</xref>). We defined CDR and FR positions as in Ref. (<xref ref-type="bibr" rid="B16">16</xref>).</p>
</sec>
<sec id="S2-2">
<label>2.2</label> <title>Diversity measures</title>
<p>We measured the diversity of amino acids per position as in Ref. (<xref ref-type="bibr" rid="B12">12</xref>) with an order of diversity equal to 1. The process of measuring diversity is dependent on the order, or &#x0201C;Hill number&#x0201D; (<xref ref-type="bibr" rid="B17">17</xref>), we use during calculations. While measuring the effective number of species, the order affects the influence of the sample abundances. An order of 0 does not consider abundances, thus all types are considered equally (this is equivalent to the number of different types, also called &#x0201C;richness&#x0201D;). An order gives greater weight to rare species, while an order &#x0003E;1 gives greater weight to common species. When the order is 1, the effective diversity is determined without any bias (<xref ref-type="bibr" rid="B18">18</xref>). We previously described the result of analyzing the diversity of the different amino acid positions in the V gene germline repertoire at different orders of diversity (<xref ref-type="bibr" rid="B12">12</xref>). We decided here to focus on the order of 1 as we found no <italic>a priori</italic> reason to bias toward either the more commonly used amino acids at each position or toward the rare amino acids.</p>
<p>At each position <italic>p</italic>, the number of amino acids at that position was <italic>N<sub>p</sub></italic> and the richness of the amino acids at that position was <italic>R<sub>p</sub></italic>.</p>
<p>The measure of diversity used for these positions was &#x0201C;true&#x0201D; diversity <italic><sup>q</sup>D<sub>p</sub></italic>, where
<disp-formula id="E1"><label>(1)</label><mml:math id="M1"><mml:mrow><mml:msup><mml:mtext>&#x02009;</mml:mtext><mml:mi>q</mml:mi></mml:msup><mml:msub><mml:mi>D</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow><mml:mo class="MathClass-rel">&#x02261;</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo class="MathClass-op">&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo class="MathClass-rel">&#x0003D;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munderover><mml:msubsup><mml:mrow><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mn>1</mml:mn><mml:mo class="MathClass-bin">&#x02215;</mml:mo><mml:mn>1</mml:mn><mml:mo class="MathClass-bin">&#x02212;</mml:mo><mml:mi>q</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:msup></mml:math></disp-formula>
and <italic>q</italic> is the order of diversity and <italic>p<sub>i</sub></italic> is the frequency of amino acid <italic>i</italic> (<xref ref-type="bibr" rid="B17">17</xref>, <xref ref-type="bibr" rid="B18">18</xref>). At <italic>q</italic>&#x02009;&#x0003D;&#x02009;1, equation (1) does not exist, however the limit as <italic>q</italic> approaches 1 is
<disp-formula id="E2"><label>(2)</label><mml:math id="M2"><mml:mrow><mml:msup><mml:mtext>&#x02009;</mml:mtext><mml:mi>1</mml:mi></mml:msup><mml:msub><mml:mi>D</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow><mml:mo class="MathClass-rel">&#x02261;</mml:mo><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:mi>p</mml:mi><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mo class="MathClass-bin">&#x02212;</mml:mo><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo class="MathClass-op">&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo class="MathClass-rel">&#x0003D;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munderover><mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">ln</mml:mi><mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></disp-formula></p>
</sec>
<sec id="S2-3">
<label>2.3</label> <title>Definition of position categories</title>
<p>For every amino acid position, we counted &#x02013; across all clones from any time point and person &#x02013; the number of times a position changed amino acids and how many times that position maintained its amino acid from the germline. If a position was found to change into several amino acids in a single clone, that position was counted once for each different amino acid. The cases where amino acids were maintained relative to the germline were in some cases further divided into non-mutated and synonymous mutations. The amino acids collected in each category (changed, maintained, or synonymous mutation) were then further analyzed for their diversity and amino acid composition tendencies.</p>
</sec>
<sec id="S2-4">
<label>2.4</label> <title>Correlations of diversity in germline positions versus changed or maintained amino acid position categories</title>
<p>Using a two sided Spearman&#x02019;s rank correlation test, we assessed the correlation of germline diversity of human <italic>V<sub>H</sub></italic> genes, as calculated in Ref. (<xref ref-type="bibr" rid="B12">12</xref>), with the counts and diversities of the three categories (changed amino acid, maintained amino acid, and synonymous mutation) described above.</p>
</sec>
<sec id="S2-5">
<label>2.5</label> <title>Amino acid usage analysis</title>
<p>We assessed if position categories were biased toward specific amino acid usage types. Following our definitions of Ig relevant amino acid categorization by hydrophobicity and tendency to be found on the surface of the receptor (<xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B19">19</xref>), we categorized amino acids as hydrophobic (IVLFCMW), neutral (AGTSYPH), and hydrophilic (NDQEKR) (<xref ref-type="bibr" rid="B12">12</xref>). We then categorized a position by how biased that position was to using amino acids from only one of these categories. If a position used only amino acids from one category, that position was considered to be of that type (i.e., a hydrophobic, a neutral, or a hydrophilic position). If the position had both neutral and one other category of amino acids, that position would be considered a &#x0201C;weak&#x0201D; version of that category (i.e., weak hydrophobic or weak hydrophilic). If there were amino acids in all categories, then that position was considered indeterminate. In all instances, if a position had a single amino acid in one category, and three or more in another category, the single amino acid category was ignored (<xref ref-type="bibr" rid="B12">12</xref>).</p>
</sec>
</sec>
<sec id="S3">
<label>3</label> <title>Results</title>
<sec id="S3-6">
<label>3.1</label> <title>Correlation of germline diversity to number of changed amino acids per position</title>
<p>When comparing the germline diversity at each position &#x02013; as calculated from the prototypical IMGT database (<xref ref-type="bibr" rid="B12">12</xref>) &#x02013; and the number of unique changed amino acids at each position, we find that these two properties are highly positively correlated (&#x003C1;&#x02009;&#x0003D;&#x02009;0.710, <italic>p</italic>&#x02009;&#x0003D;&#x02009;1.41&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;12</sup>). This correlation holds true if we consider CDR (&#x003C1;&#x02009;&#x0003D;&#x02009;0.676, <italic>p</italic>&#x02009;&#x0003D;&#x02009;2.18&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;4</sup>) and FR (&#x003C1;&#x02009;&#x0003D;&#x02009;0.668, <italic>p</italic>&#x02009;&#x0003D;&#x02009;2.17&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;7</sup>) positions separately and if we consider all positions as a whole (Figure <xref ref-type="fig" rid="F1">1</xref>). While this correlation is monotonic, it is by no means strictly linear as the linear model explains only &#x0223C;43% of variation in the extent of amino acid exchanges at the different positions (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.433). Interestingly, while in general the CDR positions with similar diversity have more changed amino acids than most FR positions of similar germline diversity and the linear fits to CDR and FR are distinct, FR and CDR positions are not clearly separated in this plane (Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<fig position="float" id="F1">
<label>Figure 1</label>
<caption>
<p><bold>The number of unique changed amino acids versus the diversity of order 1 of the germline sequences per position</bold>. The points are labeled by their IMGT sequence position number and if they are found in the CDR (red) or the FR (blue). The lines represent linear regressions for all positions (black, <italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.433), for FR positions (dashed blue, <italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.289), and for CDR positions (dotted red, <italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.349). We found a significant positive correlation for all positions (&#x003C1;&#x02009;&#x0003D;&#x02009;0.710, <italic>p</italic>&#x02009;&#x0003D;&#x02009;1.41&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;12</sup>). By correlating the positions based on FR and CDR, we found a significant positive correlation for both FR (&#x003C1;&#x02009;&#x0003D;&#x02009;0.668, <italic>p</italic>&#x02009;&#x0003D;&#x02009;2.17&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;7</sup>) and CDR (&#x003C1;&#x02009;&#x0003D;&#x02009;0.676, <italic>p</italic>&#x02009;&#x0003D;&#x02009;2.18&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;4</sup>).</p></caption>
<graphic xlink:href="fimmu-04-00357-g001.tif"/>
</fig>
<p>The analysis with synonymous mutations shows no correlation (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;1.87&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;3</sup>, &#x003C1;&#x02009;&#x0003D;&#x02009;0.232, <italic>p</italic>&#x02009;&#x0003D;&#x02009;0.0468) and similar mutation levels across the entire range of germline position diversities and no difference between CDR (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.0322, &#x003C1;&#x02009;&#x0003D;&#x02009;&#x02212;&#x02009;0.223, <italic>p</italic>&#x02009;&#x0003D;&#x02009;0.273) and FR (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.0127, &#x003C1;&#x02009;&#x0003D;&#x02009;0.289, <italic>p</italic>&#x02009;&#x0003D;&#x02009;0.0466) positions (Figure <xref ref-type="fig" rid="F2">2</xref>). The results found using the diversity of the germline repertoire were at the whole repertoire level. No division into certain germlines was necessary and so the possibility for misidentification of the germlines by IMGT would have little to no impact on the diversity at the repertoire level. Moreover, when splitting up the analysis of clones by the germline they aligned with, there was no real difference in between different germlines and at the repertoire level (results now shown).</p>
<fig position="float" id="F2">
<label>Figure 2</label>
<caption>
<p><bold>The number of synonymous mutations versus the diversity of order 1 of the germline sequences per position</bold>. We found no positive correlation with a flat trend (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;1.87&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;3</sup>, &#x003C1;&#x02009;&#x0003D;&#x02009;0.232, <italic>p</italic>&#x02009;&#x0003D;&#x02009;0.0468). When splitting by region, we found no correlation for FR (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.0127, &#x003C1;&#x02009;&#x0003D;&#x02009;0.289, <italic>p</italic>&#x02009;&#x0003D;&#x02009;0.0466) and CDR (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.0322, &#x003C1;&#x02009;&#x0003D;&#x02009;&#x02212;&#x02009;0.223, <italic>p</italic>&#x02009;&#x0003D;&#x02009;0.273).</p></caption>
<graphic xlink:href="fimmu-04-00357-g002.tif"/>
</fig>
</sec>
<sec id="S3-7">
<label>3.2</label> <title>Correlation of germline diversity to changed or maintained diversity per position</title>
<p>We next looked to see how the actual amino acid diversity of the mutant repertoire at the different positions related to the germline diversity. We found that the maintained positions had a diversity that was essentially identical to that found in the IMGT based germline repertoire (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.947, &#x003C1;&#x02009;&#x0003D;&#x02009;0.961, <italic>p</italic>&#x02009;&#x0003D;&#x02009;7.52&#x02009;&#x000D7;&#x02009;10<sup>4</sup>2) (Figure <xref ref-type="fig" rid="F3">3</xref>A). In the changed positions a more complex pattern emerges. While overall we find again that there is a positive correlation between germline diversity and the diversity of the changed amino acids (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.284, &#x003C1;&#x02009;&#x0003D;&#x02009;0.359, <italic>p</italic>&#x02009;&#x0003D;&#x02009;1.70&#x02009;&#x000D7;&#x02009;10<sup>3</sup>), the range of diversity is much greater in the changed positions (Figure <xref ref-type="fig" rid="F3">3</xref>B). This greater range of diversity is present in both CDR (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.419, &#x003C1;&#x02009;&#x0003D;&#x02009;0.580, <italic>p</italic>&#x02009;&#x0003D;&#x02009;2.27&#x02009;&#x000D7;&#x02009;10<sup>3</sup>) and FR (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.0347, &#x003C1;&#x02009;&#x0003D;&#x02009;0.132, <italic>p</italic>&#x02009;&#x0003D;&#x02009;0.371). However, when the FR is considered on its own this leads to a lack of significant correlation with germline diversity.</p>
<fig position="float" id="F3">
<label>Figure 3</label>
<caption>
<p><bold>The diversity of order 1 of the amino acids versus the diversity of order 1 for the germline sequences per position</bold>. <bold>(A)</bold> The diversity of order 1 amino acid usage in amino acids positions. We found a significant positive correlation (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.947, &#x003C1;&#x02009;&#x0003D;&#x02009;0.961, <italic>p</italic>&#x02009;&#x0003D;&#x02009;7.52&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;42</sup>). We also found positive correlations when the positions were split by region for FR (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.970, &#x003C1;&#x02009;&#x0003D;&#x02009;0.968, <italic>p</italic>&#x02009;&#x0003D;&#x02009;3.67&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;29</sup>) and CDR (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.937, &#x003C1;&#x02009;&#x0003D;&#x02009;0.880, <italic>p</italic>&#x02009;&#x0003D;&#x02009;1.76&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;6</sup>). <bold>(B)</bold> The diversity of order 1 amino acid usage of unique changed amino acids. We found a positive correlation (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.284, &#x003C1;&#x02009;&#x0003D;&#x02009;0.359, <italic>p</italic>&#x02009;&#x0003D;&#x02009;1.70&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;3</sup>). When splitting the positions by regions, however, FR did not correlate with the germline diversities (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.0347, &#x003C1;&#x02009;&#x0003D;&#x02009;0.132, <italic>p</italic>&#x02009;&#x0003D;&#x02009;0.371) while CDR did have a positive correlation (<italic>r</italic><sup>2</sup>&#x02009;&#x0003D;&#x02009;0.419, &#x003C1;&#x02009;&#x0003D;&#x02009;0.580, <italic>p</italic>&#x02009;&#x0003D;&#x02009;2.27&#x02009;&#x000D7;&#x02009;10<sup>&#x02212;3</sup>).</p></caption>
<graphic xlink:href="fimmu-04-00357-g003.tif"/>
</fig>
</sec>
<sec id="S3-8">
<label>3.3</label> <title>Changes in amino acid usage pattern</title>
<p>We categorized the amino acid usage patterns for each position. We found in the maintained amino acid positions the biases toward using specific amino acid types are maintained. This was especially true for the positions in the germline that had stricter categories of amino acids usage bias. 13 out of 14 hydrophobic positions, 17 out of 19 neutral positions, and 8 out of 10 hydrophilic positions retain the same bias in the maintained positions as in the germline (Table <xref ref-type="table" rid="T1">1</xref>). The positions with the more intermediate biases (weak hydrophobic and weak hydrophilic) in the germline did not adhere as strictly to the same bias category but tend to have changed to one of the neighboring biases. Weak hydrophobic becomes either hydrophobic or neutral. Weak hydrophilic becomes either hydrophilic or neutral (Table <xref ref-type="table" rid="T1">1</xref>). Looking now at the changed position we see that biases change much more (Figure <xref ref-type="fig" rid="F4">4</xref>). Most positions simply become indeterminate (i.e., have no clear bias). However, it is interesting to note that those positions that do have some bias exhibit either exactly the same bias as they have in the germline repertoire or one that is similar (Table <xref ref-type="table" rid="T1">1</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption>
<p><bold>Number of positions for each amino acid usage bias</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Category</th>
<th align="center">Germline</th>
<th align="center" colspan="7">Maintained</th>
<th align="center" colspan="7">Changed</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Hydrophobic</td>
<td align="center">14</td>
<td align="right">16:</td>
<td align="right">(13,</td>
<td align="right">3,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0)</td>
<td align="right">7:</td>
<td align="right">(6,</td>
<td align="right">1,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0)</td>
</tr>
<tr>
<td align="left">Weak hydrophobic</td>
<td align="center">12</td>
<td align="right">0:</td>
<td align="right">(0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0)</td>
<td align="right">8:</td>
<td align="right">(5,</td>
<td align="right">2,</td>
<td align="right">1,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0)</td>
</tr>
<tr>
<td align="left">Neutral</td>
<td align="center">19</td>
<td align="right">27:</td>
<td align="right">(0,</td>
<td align="right">6,</td>
<td align="right">17,</td>
<td align="right">4,</td>
<td align="right">0,</td>
<td align="right">0)</td>
<td align="right">2:</td>
<td align="right">(0,</td>
<td align="right">0,</td>
<td align="right">2,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0)</td>
</tr>
<tr>
<td align="left">Weak hydrophilic</td>
<td align="center">13</td>
<td align="right">3:</td>
<td align="right">(0,</td>
<td align="right">0,</td>
<td align="right">1,</td>
<td align="right">2,</td>
<td align="right">0,</td>
<td align="right">0)</td>
<td align="right">5:</td>
<td align="right">(0,</td>
<td align="right">0,</td>
<td align="right">1,</td>
<td align="right">3,</td>
<td align="right">1,</td>
<td align="right">0)</td>
</tr>
<tr>
<td align="left">Hydrophilic</td>
<td align="center">11</td>
<td align="right">10:</td>
<td align="right">(0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">1,</td>
<td align="right">8,</td>
<td align="right">1)</td>
<td align="right">0:</td>
<td align="right">(0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0,</td>
<td align="right">0)</td>
</tr>
<tr>
<td align="left">Indeterminate</td>
<td align="center">5</td>
<td align="right">18:</td>
<td align="right">(1,</td>
<td align="right">3,</td>
<td align="right">1,</td>
<td align="right">6,</td>
<td align="right">3,</td>
<td align="right">4)</td>
<td align="right">52:</td>
<td align="right">(3,</td>
<td align="right">9,</td>
<td align="right">15,</td>
<td align="right">10,</td>
<td align="right">10,</td>
<td align="right">5)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The numbers in parentheses after the colon signify the classifications of the respective positions in the germline repertoire: hydrophobic, weak hydrophobic, neutral, weak hydrophilic, hydrophilic, indeterminate.</italic></p>
</table-wrap-foot>
</table-wrap>
<fig position="float" id="F4">
<label>Figure 4</label>
<caption>
<p><bold>Copy of Figure <xref ref-type="fig" rid="F3">3</xref>B annotated for amino acid usage bias</bold>. The coloring for the labels signifies whether that position is hydrophobic (red), weak hydrophobic (light red), neutral (purple), weak hydrophilic (light blue), hydrophilic (blue), or indeterminate (black), while a circle around the position indicates that the position shares the same hydrophobicity bias as the equivalent germline position. All amino acids are changed.</p></caption>
<graphic xlink:href="fimmu-04-00357-g004.tif"/>
</fig>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<label>4</label> <title>Discussion</title>
<p>The specificity of B cell and T cell receptors, while based on genes in the germline, is ultimately not of the germline template. Due to the imprecise nature of V(D)J recombination and, in B cells, somatic mutation, the final affinity of each immune receptor is neither inherited nor heritable. For this reason it is difficult to assess how germline diversity and its selection relate to selection during an immune response and specifically how they relate to the anticipated outcome of somatic mutation during an immune response. We have previously shown that the diversity of the germline V gene repertoire can be characterized by looking at the amino acid diversity of individual positions in the V gene sequence (<xref ref-type="bibr" rid="B12">12</xref>). The distribution of diversity is non-uniform with most, but not all highly diverse positions being found in the CDR. Furthermore, different positions show different biases toward the use of hydrophobic or hydrophilic amino acids (<xref ref-type="bibr" rid="B12">12</xref>). To contrast this picture of germline diversity with somatic changes, we have taken a published sample of the human peripheral B cell repertoire following influenza vaccination. We divided all of the sequences in this dataset into their respective clones and counted the number of times each position in the V gene sequence changed or maintained the amino acid found in the germline origin of its clone. By doing so, we could compare for each position how it contributes to repertoire diversity and its selection when changed from its germline and when it remained the same. Analyzing the maintained positions and their diversity allows us to ask to what extent clonal shift changes the diversity of the repertoire from the germline while analyzing the changed positions describes the effects of selection.</p>
<p>Starting with the maintained positions, we see that the germline diversity exhibited in the prototypic repertoire in the IMGT database, which does not assume any specific biases in <italic>V<sub>H</sub></italic> usage, is recapitulated in even the small and clonally shifted snapshot of the immune repertoire analyzed here. We find a significant linear correlation between germline diversity and that of positions with maintained amino acids (Figure <xref ref-type="fig" rid="F3">3</xref>A) and also clear conservation of amino acid usage biases (Table <xref ref-type="table" rid="T1">1</xref>). Thus, despite the fact that we analyzed &#x0223C;1000 clones per person (out of potentially 10<sup>11</sup> clones) with a significant shift toward certain <italic>V<sub>H</sub></italic> genes, we still identify more or less exactly the same diversity and amino acid usage as described by the IMGT database. This suggests to us that clonal shift does not change the make up of amino acid diversity in the B cell repertoire. Furthermore, the existing IMGT database of human V genes represents this positional diversity well.</p>
<p>With regard to the changed positions, we find that there is a significant positive correlation between the level of diversity in the germline at a specific position and the survival of clones with changed amino acids at that position (Figure <xref ref-type="fig" rid="F1">1</xref>). Such a correlation suggests that there is a relationship between the tendency to diversify a position at the germline level over evolutionary time and the likelihood of mutants at those positions to survive somatic mutation and selection. We do not find any kind of correlation between germline diversity and synonymous mutation level (Figure <xref ref-type="fig" rid="F2">2</xref>). For this reason, while the exact observed levels of mutations and surviving mutants with specific amino acid changes may have also been influenced by biases in somatic mutation targeting or sequencing error, these explanations could not be the only reasons for our results. It would be unreasonable to think that mutation bias and sequencing error would only influence non-synonymous mutation rates and so it is thus quite clear that selection is causing this skew in mutant numbers. While assessing the exact rate of selection is beyond the scope of this paper, we can attempt to use these levels of synonymous mutations to estimate some ballpark level of expected non-synonymous mutations, which under neutral conditions we would assume to be three times as high. We can then see that all the positions with lowest germline diversity must be undergoing quite stringent negative selection and that once germline diversity gets higher (&#x0003E;2) there are some positions that appear to also be undergoing some positive selection. The positions with a germline diversity value &#x0003E;5 show rates of non-synonymous mutations 10- to 20-fold greater than synonymous mutations &#x02013; a clear indicator of strong positive selection.</p>
<p>Another indication that specific positive selection has great influence on the final level of amino acid changed at each position is that the diversity of changed positions is much higher than the germline diversity at those positions (Figure <xref ref-type="fig" rid="F3">3</xref>B). Furthermore, in most cases their bias in usage is indeterminate (Figure <xref ref-type="fig" rid="F4">4</xref>). Thus, while the likelihood of survival is related to germline diversity, the specific change in amino acid that is needed to save the clone is also determined by the specific selection interactions in which that change was positively selected. However, it is worth noting that positions that can be classified (i.e., are not indeterminate) in the mutants all exhibit the same general amino acid bias as the germline repertoire (Table <xref ref-type="table" rid="T1">1</xref>).</p>
<p>Taking all of these findings into account, we propose that germline diversity is a good indicator of the likelihood of survival following mutation but cannot account for the specific amino acid whose usage accounted for survival of a specific clone, although this usage can be approximated. This usage is based on the specific affinity maturation event and immune response that leads to the formation of the clone. We would further conclude that while CDR and FR do roughly segregate the sequence, a better measure of potential selection force is the specific germline diversity of each position. This is especially true for positions with &#x0003C;5 diversity in their germline amino acids. In such positions, while diversity indicates a range of possible levels of surviving mutants, there is no clear distinction between positions in the CDR and the FR. Indeed, the only reason one exists beyond diversity of 5 is that no FR positions have such high germline diversities.</p>
</sec>
<sec id="S5">
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>The authors would like to thank Deborah Dunn Walters for giving us access to her data from her experiments and with several interesting conversations on the meaning of its behavior. Funding: Gregory W. Schwartz is funded by the Graduate Assistance in Areas of National Need (GAANN) program.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alt</surname> <given-names>FW</given-names></name> <name><surname>Baltimore</surname> <given-names>D</given-names></name></person-group>. <article-title>Joining of immunoglobulin heavy chain gene segments: implications from a chromosome with evidence of three D-JH fusions</article-title>. <source>Proc Natl Acad Sci U S A</source> (<year>1982</year>) <volume>79</volume>(<issue>13</issue>):<fpage>4118</fpage>&#x02013;<lpage>22</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.79.13.4118</pub-id><pub-id pub-id-type="pmid">6287467</pub-id></citation></ref>
<ref id="B2"><label>2</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sakano</surname> <given-names>H</given-names></name> <name><surname>Huppi</surname> <given-names>K</given-names></name> <name><surname>Heinrich</surname> <given-names>G</given-names></name> <name><surname>Tonegawa</surname> <given-names>S</given-names></name></person-group>. <article-title>Sequences at the somatic recombination sites of immunoglobulin light-chain genes</article-title>. <source>Nature</source> (<year>1979</year>) <volume>280</volume>(<issue>5720</issue>):<fpage>288</fpage>&#x02013;<lpage>94</lpage>.<pub-id pub-id-type="doi">10.1038/280288a0</pub-id><pub-id pub-id-type="pmid">111144</pub-id></citation></ref>
<ref id="B3"><label>3</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weigert</surname> <given-names>M</given-names></name> <name><surname>Gatmaitan</surname> <given-names>L</given-names></name> <name><surname>Loh</surname> <given-names>E</given-names></name> <name><surname>Schilling</surname> <given-names>J</given-names></name> <name><surname>Hood</surname> <given-names>L</given-names></name></person-group>. <article-title>Rearrangement of genetic information may produce immunoglobulin diversity</article-title>. <source>Nature</source> (<year>1978</year>) <volume>276</volume>(<issue>5690</issue>):<fpage>785</fpage>&#x02013;<lpage>90</lpage>.<pub-id pub-id-type="doi">10.1038/276785a0</pub-id><pub-id pub-id-type="pmid">103003</pub-id></citation></ref>
<ref id="B4"><label>4</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>S</given-names></name> <name><surname>Davis</surname> <given-names>M</given-names></name> <name><surname>Sinn</surname> <given-names>E</given-names></name> <name><surname>Patten</surname> <given-names>P</given-names></name> <name><surname>Hood</surname> <given-names>L</given-names></name></person-group>. <article-title>Antibody diversity: somatic hypermutation of rearranged VH genes</article-title>. <source>Cell</source> (<year>1981</year>) <volume>27</volume>(<issue>3</issue>):<fpage>573</fpage>&#x02013;<lpage>81</lpage>.<pub-id pub-id-type="doi">10.1016/0092-8674(81)90399-8</pub-id><pub-id pub-id-type="pmid">6101208</pub-id></citation></ref>
<ref id="B5"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shlomchik</surname> <given-names>MJ</given-names></name> <name><surname>Marshak-Rothstein</surname> <given-names>A</given-names></name> <name><surname>Wolfowicz</surname> <given-names>CB</given-names></name> <name><surname>Rothstein</surname> <given-names>TL</given-names></name> <name><surname>Weigert</surname> <given-names>MG</given-names></name></person-group>. <article-title>The role of clonal selection and somatic mutation in autoimmunity</article-title>. <source>Nature</source> (<year>1987</year>) <volume>328</volume>(<issue>6133</issue>):<fpage>805</fpage>&#x02013;<lpage>11</lpage>.<pub-id pub-id-type="doi">10.1038/328805a0</pub-id></citation></ref>
<ref id="B6"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caton</surname> <given-names>AJ</given-names></name> <name><surname>Swartzentruber</surname> <given-names>JR</given-names></name> <name><surname>Kuhl</surname> <given-names>AL</given-names></name> <name><surname>Carding</surname> <given-names>SR</given-names></name> <name><surname>Stark</surname> <given-names>SE</given-names></name></person-group>. <article-title>Activation and negative selection of functionally distinct subsets of antibody-secreting cells by influenza hemagglutinin as a viral and a neo-self antigen</article-title>. <source>J Exp Med</source> (<year>1996</year>) <volume>183</volume>(<issue>1</issue>):<fpage>13</fpage>&#x02013;<lpage>26</lpage>.<pub-id pub-id-type="doi">10.1084/jem.183.1.13</pub-id><pub-id pub-id-type="pmid">8551216</pub-id></citation></ref>
<ref id="B7"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Casola</surname> <given-names>S</given-names></name> <name><surname>Otipoby</surname> <given-names>KL</given-names></name> <name><surname>Alimzhanov</surname> <given-names>M</given-names></name> <name><surname>Humme</surname> <given-names>S</given-names></name> <name><surname>Uyttersprot</surname> <given-names>N</given-names></name> <name><surname>Kutok</surname> <given-names>JL</given-names></name> <etal/></person-group> <article-title>B cell receptor signal strength determines b cell fate</article-title>. <source>Nat Immunol</source> (<year>2004</year>) <volume>5</volume>(<issue>3</issue>):<fpage>317</fpage>&#x02013;<lpage>27</lpage>.<pub-id pub-id-type="doi">10.1038/ni1036</pub-id><pub-id pub-id-type="pmid">14758357</pub-id></citation></ref>
<ref id="B8"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>TT</given-names></name> <name><surname>Kabat</surname> <given-names>EA</given-names></name></person-group>. <article-title>An analysis of the sequences of the variable regions of Bence Jones proteins and myeloma light chains and their implications for antibody complementarity</article-title>. <source>J Exp Med</source> (<year>1970</year>) <volume>132</volume>(<issue>2</issue>):<fpage>211</fpage>&#x02013;<lpage>50</lpage>.<pub-id pub-id-type="doi">10.1084/jem.132.2.211</pub-id></citation></ref>
<ref id="B9"><label>9</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anderson</surname> <given-names>SM</given-names></name> <name><surname>Khalil</surname> <given-names>A</given-names></name> <name><surname>Uduman</surname> <given-names>M</given-names></name> <name><surname>Hershberg</surname> <given-names>U</given-names></name> <name><surname>Louzoun</surname> <given-names>Y</given-names></name> <name><surname>Haberman</surname> <given-names>AM</given-names></name> <etal/></person-group> <article-title>Taking advantage: high-affinity B cells in the germinal center have lower death rates, but similar rates of division, compared to low-affinity cells</article-title>. <source>J Immunol</source> (<year>2009</year>) <volume>183</volume>(<issue>11</issue>):<fpage>7314</fpage>&#x02013;<lpage>25</lpage>.<pub-id pub-id-type="doi">10.4049/jimmunol.0902452</pub-id><pub-id pub-id-type="pmid">19917681</pub-id></citation></ref>
<ref id="B10"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hershberg</surname> <given-names>U</given-names></name> <name><surname>Uduman</surname> <given-names>M</given-names></name> <name><surname>Shlomchik</surname> <given-names>MJ</given-names></name> <name><surname>Kleinstein</surname> <given-names>SH</given-names></name></person-group>. <article-title>Improved methods for detecting selection by mutation analysis of Ig V region sequences</article-title>. <source>Int Immunol</source> (<year>2008</year>) <volume>20</volume>(<issue>5</issue>):<fpage>683</fpage>&#x02013;<lpage>94</lpage>.<pub-id pub-id-type="doi">10.1093/intimm/dxn045</pub-id><pub-id pub-id-type="pmid">18397909</pub-id></citation></ref>
<ref id="B11"><label>11</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stewart</surname> <given-names>JJ</given-names></name> <name><surname>Lee</surname> <given-names>CY</given-names></name> <name><surname>Ibrahim</surname> <given-names>S</given-names></name> <name><surname>Watts</surname> <given-names>P</given-names></name> <name><surname>Shlomchik</surname> <given-names>M</given-names></name> <name><surname>Weigert</surname> <given-names>M</given-names></name> <etal/></person-group> <article-title>A Shannon entropy analysis of immunoglobulin and t cell receptor</article-title>. <source>Mol Immunol</source> (<year>1997</year>) <volume>34</volume>(<issue>15</issue>):<fpage>1067</fpage>&#x02013;<lpage>82</lpage>.<pub-id pub-id-type="doi">10.1016/S0161-5890(97)00130-2</pub-id><pub-id pub-id-type="pmid">9519765</pub-id></citation></ref>
<ref id="B12"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwartz</surname> <given-names>GW</given-names></name> <name><surname>Hershberg</surname> <given-names>U</given-names></name></person-group>. <article-title>Conserved variation: identifying patterns of stability and variability in BCR and TCR V genes with different diversity and richness metrics</article-title>. <source>Phys Biol</source> (<year>2013</year>) <volume>10</volume>(<issue>3</issue>):<fpage>1</fpage>&#x02013;<lpage>10</lpage>.<pub-id pub-id-type="doi">10.1088/1478-3975/10/3/035005</pub-id><pub-id pub-id-type="pmid">23735612</pub-id></citation></ref>
<ref id="B13"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lefranc</surname> <given-names>M-P</given-names></name></person-group>. <article-title>IMGT&#x000C2;<sup>&#x000AE;</sup>, the international ImMunoGeneTics information system&#x000C2;<sup>&#x000AE;</sup> for immunoinformatics. Methods for querying IMGT&#x000C2;<sup>&#x000AE;</sup> databases, tools and webresources in the context of immunoinformatics</article-title>. <source>Mol Biotechnol</source> (<year>2008</year>) <volume>40</volume>(<issue>1</issue>):<fpage>101</fpage>&#x02013;<lpage>11</lpage>.<pub-id pub-id-type="doi">10.1007/s12033-008-9062-7</pub-id></citation></ref>
<ref id="B14"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>Y</given-names></name> <name><surname>Kipling</surname> <given-names>D</given-names></name> <name><surname>Dunn-Walters</surname> <given-names>DK</given-names></name></person-group>. <article-title>Age-related changes in human peripheral blood IGH repertoire following vaccination</article-title>. <source>Front Immunol</source> (<year>2012</year>) <volume>3</volume>:<fpage>193</fpage>.<pub-id pub-id-type="doi">10.3389/fimmu.2012.00193</pub-id><pub-id pub-id-type="pmid">22787463</pub-id></citation></ref>
<ref id="B15"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heck</surname> <given-names>LK</given-names></name> <name><surname>van Belle</surname> <given-names>G</given-names></name> <name><surname>Simberloff</surname> <given-names>D</given-names></name></person-group>. <article-title>Explicit calculation of the rarefaction diversity measurement and the determination of sufficient sample size</article-title>. <source>Ecology</source> (<year>1975</year>) <volume>56</volume>(<issue>6</issue>):<fpage>1459</fpage>&#x02013;<lpage>61</lpage>.<pub-id pub-id-type="doi">10.2307/1934716</pub-id></citation></ref>
<ref id="B16"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hershberg</surname> <given-names>U</given-names></name> <name><surname>Shlomchik</surname> <given-names>MJ</given-names></name></person-group>. <article-title>Differences in potential for amino acid change following mutation reveals distinct strategies for kappa and lambda light-chain variation</article-title>. <source>Proc Natl Acad Sci U S A</source> (<year>2006</year>) <volume>103</volume>(<issue>43</issue>):<fpage>15963</fpage>&#x02013;<lpage>8</lpage>.<pub-id pub-id-type="doi">10.1073/pnas.0607581103</pub-id><pub-id pub-id-type="pmid">17038496</pub-id></citation></ref>
<ref id="B17"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hill</surname> <given-names>M</given-names></name></person-group>. <article-title>Diversity and evenness: a unifying notation and its consequences</article-title>. <source>Ecology</source> (<year>1973</year>) <volume>54</volume>(<issue>2</issue>):<fpage>427</fpage>&#x02013;<lpage>32</lpage>.<pub-id pub-id-type="doi">10.2307/1934352</pub-id></citation></ref>
<ref id="B18"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jost</surname> <given-names>L</given-names></name></person-group>. <article-title>Entropy and diversity</article-title>. <source>Oikos</source> (<year>2006</year>) <volume>113</volume>(<issue>2</issue>):<fpage>363</fpage>&#x02013;<lpage>75</lpage>.<pub-id pub-id-type="doi">10.1111/j.2006.0030-1299.14714.x</pub-id></citation></ref>
<ref id="B19"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chothia</surname> <given-names>C</given-names></name> <name><surname>Gelfand</surname> <given-names>I</given-names></name> <name><surname>Kister</surname> <given-names>A</given-names></name></person-group>. <article-title>Structural determinants in the sequences of immunoglobulin variable domain</article-title>. <source>J Mol Biol</source> (<year>1998</year>) <volume>278</volume>(<issue>2</issue>):<fpage>457</fpage>&#x02013;<lpage>79</lpage>.<pub-id pub-id-type="doi">10.1006/jmbi.1998.1653</pub-id><pub-id pub-id-type="pmid">9571064</pub-id></citation></ref>
</ref-list>
</back>
</article>