<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Immunol.</journal-id>
<journal-title>Frontiers in Immunology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Immunol.</abbrev-journal-title>
<issn pub-type="epub">1664-3224</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fimmu.2021.735135</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Immunology</subject>
<subj-group>
<subject>Hypothesis and Theory</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>How Naive T-Cell Clone Counts Are Shaped By Heterogeneous Thymic Output and Homeostatic Proliferation</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Dessalles</surname>
<given-names>Renaud</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Pan</surname>
<given-names>Yunbei</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Xia</surname>
<given-names>Mingtao</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1045451"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Maestrini</surname>
<given-names>Davide</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1555782"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>D&#x2019;Orsogna</surname>
<given-names>Maria R.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1376780"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Chou</surname>
<given-names>Tom</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1315665"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Department of Computational Medicine, University of California at Los Angeles (UCLA)</institution>, <addr-line>Los Angeles, CA</addr-line>, <country>United States</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Mathematics, California State University at Northridge</institution>, <addr-line>Los Angeles, CA</addr-line>, <country>United States</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>Department of Mathematics, University of California at Los Angeles (UCLA)</institution>, <addr-line>Los Angeles, CA</addr-line>, <country>United States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Gr&#xe9;goire Altan-Bonnet, Division of Cancer Biology (NCI), United States</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Carmen Molina-paris, University of Leeds, United Kingdom; Antoine Toubert, Universit&#xe9; Paris Diderot, France; Meriem Bensouda Koraichi, &#xc9;cole Normale Sup&#xe9;rieure, France</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Tom Chou, <email xlink:href="mailto:tomchou@ucla.edu">tomchou@ucla.edu</email>
</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Systems Immunology, a section of the journal Frontiers in Immunology</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>17</day>
<month>02</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>735135</elocation-id>
<history>
<date date-type="received">
<day>02</day>
<month>07</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>06</day>
<month>12</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Dessalles, Pan, Xia, Maestrini, D&#x2019;Orsogna and Chou</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Dessalles, Pan, Xia, Maestrini, D&#x2019;Orsogna and Chou</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The specificity of T cells is that each T cell has only one T cell receptor (TCR). A T cell clone represents a collection of T cells with the same TCR sequence. Thus, the number of different T cell clones in an organism reflects the number of different T cell receptors (TCRs) that arise from recombination of the V(D)J gene segments during T cell development in the thymus. TCR diversity and more specifically, the clone abundance distribution, are important factors in immune functions. Specific recombination patterns occur more frequently than others while subsequent interactions between TCRs and self-antigens are known to trigger proliferation and sustain naive T cell survival. These processes are TCR-dependent, leading to clone-dependent thymic export and naive T cell proliferation rates. We describe the heterogeneous steady-state population of naive T cells (those that have not yet been antigenically triggered) by using a mean-field model of a regulated birth-death-immigration process. After accounting for random sampling, we investigate how TCR-dependent heterogeneities in immigration and proliferation rates affect the shape of clone abundance distributions (the number of different clones that are represented by a specific number of cells, or &#x201c;clone counts&#x201d;). By using reasonable physiological parameter values and fitting predicted clone counts to experimentally sampled clone abundances, we show that realistic levels of heterogeneity in immigration rates cause very little change to predicted clone-counts, but that modest heterogeneity in proliferation rates can generate the observed clone abundances. Our analysis provides constraints among physiological parameters that are necessary to yield predictions that qualitatively match the data. Assumptions of the model and potentially other important mechanistic factors are discussed.</p>
</abstract>
<kwd-group>
<kwd>naive T cells</kwd>
<kwd>T-cell receptor</kwd>
<kwd>repertoire diversity</kwd>
<kwd>clone-count distributions</kwd>
<kwd>mathematical modeling</kwd>
<kwd>immigration-proliferation model</kwd>
<kwd>heterogeneity</kwd>
</kwd-group>
<contract-sponsor id="cn001">National Institutes of Health<named-content content-type="fundref-id">10.13039/100000002</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">National Science Foundation<named-content content-type="fundref-id">10.13039/100000001</named-content>
</contract-sponsor>
<contract-sponsor id="cn003">National Science Foundation<named-content content-type="fundref-id">10.13039/100000001</named-content>
</contract-sponsor>
<counts>
<fig-count count="9"/>
<table-count count="2"/>
<equation-count count="23"/>
<ref-count count="64"/>
<page-count count="16"/>
<word-count count="12034"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<title>Introduction</title>
<p>Naive T cells play a crucial role in the immune system&#x2019;s response to pathogens, tumors, and other infectious agents. These cells are produced in the bone marrow, mature in the thymus, circulate through the blood, and migrate to the lymph nodes where they may be presented with different antigen proteins from various pathogens. Naive T cells mature in the thymus where the so-called V, D, and J segments of genes that code T cell receptors undergo rearrangement. Most T cell receptors (TCRs) are comprised of an alpha chain and a beta chain that are formed after VJ segment and VDJ segment recombination, respectively. The number of possible TCR gene sequences is extremely large, but while recombination is a nearly random process, not all TCRs are formed with the same probability.</p>
<p>The unique receptors expressed on the cell surface of circulating TCRs enable them to recognize specific antigens; well-known examples include the naive forms of helper T cells (CD4+) and cytotoxic T cells (CD8+). The set of naive T cells that express the same TCR are said to belong to the same T cell clone. Upon encountering the antigens that activate their TCRs, naive T cells turn into effector cells that assist in eliminating infected cells. Effector cells die after pathogen clearance, but some develop into memory T cells. Because of the large number of unknown pathogens, TCR clonal diversity is a key factor for mounting an effective immune response. Recent studies also reveal that human TCR clonal diversity is implicated in healthy aging, neonatal immunity, vaccination response and T cell reconstitution following haematopoietic stem cell transplantation (<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B2">2</xref>). Despite the central role of the naive T cell pool in host defense, and broadly speaking in health and disease, TCR diversity is difficult to quantify. For example, the human body hosts a large repertoire of T cell clones, however the actual distribution of clone sizes is not precisely known (<xref ref-type="bibr" rid="B3">3</xref>). Only recently have experimental and theoretical efforts been devoted to understanding the mechanistic origins of TCR diversity (<xref ref-type="bibr" rid="B4">4</xref>&#x2013;<xref ref-type="bibr" rid="B9">9</xref>). The goal of this work is to formulate a realistic mathematical model that incorporates heterogeneity in naive T cell generation and reproduction. Model predictions are compared with T cell clone data to estimate reasonable and realistic parameter values.</p>
<p>One way to describe the TCR repertoire is by tallying the population <italic>n<sub>i</sub>
</italic> of T cells carrying receptor <italic>i</italic>. Another is to use the clone abundance distribution or &#x201c;clone count&#x201d; that measures the number of distinct clones composed of exactly <italic>k</italic> T cells, <inline-formula>
<mml:math display="inline" id="im1">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mi>:</mml:mi>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>&#x3a3;</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> <inline-graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-i001.tif"/>(<italic>n<sub>i</sub>, k</italic>), where the indicator function <inline-graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-i001.tif"/>(<italic>n</italic>, <italic>k</italic>) = 1 if <italic>n</italic> = <italic>k</italic> and 0 otherwise. Clone counts <inline-formula>
<mml:math display="inline" id="im2">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> do not carry TCR identity information as <italic>n<sub>i</sub>
</italic> does, however, they can be used to construct other summary indices for T cell diversity such as Shannon&#x2019;s entropy, Simpson&#x2019;s index, or the whole population richness <inline-formula>
<mml:math display="inline" id="im3">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>:</mml:mi>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>&#x3a3;</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> (<xref ref-type="bibr" rid="B10">10</xref>).</p>
<p>Clone counts <inline-formula>
<mml:math display="inline" id="im4">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and the total number of circulating naive T cells are difficult to measure in humans. Nonetheless, high-throughput DNA sequencing on samples of peripheral blood containing T cells (<xref ref-type="bibr" rid="B11">11</xref>&#x2013;<xref ref-type="bibr" rid="B14">14</xref>) have provided some insight into TCR diversity. A commonly invoked model is that clone counts <inline-formula>
<mml:math display="inline" id="im5">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> exhibit a power-law distribution (<xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B15">15</xref>&#x2013;<xref ref-type="bibr" rid="B17">17</xref>) in the clone abundance <italic>k</italic>. Several models have been developed to explain observed features of clone counts (<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B18">18</xref>, <xref ref-type="bibr" rid="B19">19</xref>), including the apparent power-law behavior. One proposal is that T cells in different clones have TCRs that have different affinities for self-ligands that are necessary for peripheral proliferation (<xref ref-type="bibr" rid="B4">4</xref>&#x2013;<xref ref-type="bibr" rid="B6">6</xref>), leading to clone specific replication rates. An alternative hypothesis (<xref ref-type="bibr" rid="B7">7</xref>) is that specific TCR sequences are more likely to arise in the V(D)J recombination process in the thymus (<xref ref-type="bibr" rid="B20">20</xref>) leading to a higher probability that these TCRs are produced. De Greef et al. (<xref ref-type="bibr" rid="B7">7</xref>) estimated the probability of production of a given TCR sequence by using the Inference and Generation of Repertoires (IGoR) simulation tool that quantitatively characterizes the statistics of receptor generation from both cDNA and gDNA data (<xref ref-type="bibr" rid="B20">20</xref>).</p>
<p>Although power-law models have been motivated, this behavior has been observed across only about two decades of clone sizes <italic>k</italic>, as shown in <xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>. Moreover, the above models have not systematically incorporated and compared heterogeneity in both immigration and replication rates, and/or fitted models to measured TCR clone abundance distributions. Finally, some of them have not taken into account subsampling in measurements, which will affect the predicted clone counts, especially for small clone sizes <italic>k</italic> which can be missed in small samples. In this paper, we analyze the effects of heterogeneity and sampling within a dynamic mean-field model based on a stochastic clone-dependent birth-death-immigration (BDI) process that includes (i)&#xa0;immigration representing the arrival of new clones from the thymus, (ii)&#xa0;birth during homeostatic proliferation of naive T cells that yield newborn naive T cells with the same TCR as their parent, and (iii)&#xa0;death representing cell apoptosis&#xa0; (<xref ref-type="bibr" rid="B10">10</xref>). We also include a regulating &#x201c;carrying capacity&#x201d; mechanism through a total population-dependent death rate which may represent the global competition for cytokines, such as Interleukin-7 (<xref ref-type="bibr" rid="B21">21</xref>&#x2013;<xref ref-type="bibr" rid="B25">25</xref>), needed for naive T cell survival and homeostasis (<xref ref-type="bibr" rid="B26">26</xref>, <xref ref-type="bibr" rid="B27">27</xref>). Since these cytokine signals are TCR-independent, the regulatory interaction, which ensures a finite homeostatic naive T cell population, is clone-independent (<xref ref-type="bibr" rid="B23">23</xref>).</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>Normalized naive T cell clone count data from one patient in Oakes et al. (<xref ref-type="bibr" rid="B12">12</xref>) plotted on a log-log scale. Values of the normalized clone counts along the vertical axis are the average of three samples among CD4 and CD8 cell subgroups. Clones are defined by different nucleotide sequences associated with different alpha or beta chains of the TCR.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g001.tif"/>
</fig>
<p>We derive analytic expressions for the steady state clone counts in the entire organism and show that the predicted distributions are negative binomials. However, since T cell clone populations are measured in small blood subsamples extracted from an organism, we modify our predictions to include the effects of random subsampling and find that the negative binomial structure is preserved. Finally, the subsampled prediction will be averaged over distributions of TCR generation (thymic output) and homeostatic proliferation rates. The distribution of TCR generation rates are extracted from new computational tools: Inference and Generation of Repertoires (IGoR) (<xref ref-type="bibr" rid="B20">20</xref>) and Optimized Likelihood estimate of immunoGlobulin Amino-acid sequences (OLGA) (<xref ref-type="bibr" rid="B28">28</xref>). Since there are no equivalent tools that measure proliferation rates, we will assume simple functions for the distribution of homeostatic proliferation rates. These model-derived results depend on the rate parameters of the model and the hyperparameters defining the probability distributions over these T cell production and proliferation rates (see <xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>).</p>
<table-wrap id="T1" position="float">
<label>Table&#xa0;1</label>
<caption>
<p>Model parameters <italic>&#x3b8;</italic> and hyperparameters <italic>&#x3b8;</italic>
<sub>0</sub>.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">(Hyper) Parameters</th>
<th valign="top" align="center">definition</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">
<italic>&#x3b1; &#x2208; &#x211d;<sup>+</sup>
</italic>
</td>
<td valign="top" align="left">naive T cell production rate</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im6">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x211d;</mml:mi>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">mean production rate across all possible <italic>Q</italic> TCRs</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>r &#x2208;</italic> [0, <italic>R</italic>]</td>
<td valign="top" align="left">naive T cell proliferation rate</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im7">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>r</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x211d;</mml:mi>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">mean proliferation rate across all possible <italic>Q</italic> TCRs</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>R &#x2208;</italic> &#x211d;<sup>+</sup>
</td>
<td valign="top" align="left">maximum proliferation rate of all possible <italic>Q</italic> TCRs</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>w &#x2208;</italic> [0, 1]</td>
<td valign="top" align="left">dimensionless width of box distribution of <italic>r</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>&#xb5;</italic>
<sup>&#x2217;</sup> &gt; <italic>R</italic>
</td>
<td valign="top" align="left">naive T cell death rate at steady state</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>&#x3b7; &#x2208;</italic> [0,1]</td>
<td valign="top" align="left">blood subsampling fraction</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The dimensional parameters associated with our mechanistic population model. Hyperparameters such as <inline-formula>
<mml:math display="inline" id="im8">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, r, R, w define the probability distribution or heterogeneity in the underling rate parameters <inline-formula>
<mml:math display="inline" id="im8a">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> and r. In our analyses, we typically nondimensionalize by normalizing all rates by R, the maximum proliferation rate across all clones.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>Our results are then compared to the data shown in <xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref> and used to estimate hyperparameters associated with the heterogeneity in the TCR-specific immigration and proliferation rates. Specifically, we quantify how the width of a simple uniform proliferation rate distribution and the heterogeneity of immigration rates from a generative model affect the predicted clone counts. Our analysis explicitly shows that within reasonable physiological parameter ranges, heterogeneity in the thymic immigration rate cannot significantly change clone count distributions. However, clone counts <italic>are</italic> sensitive to heterogeneity in T cell proliferation rates. Thus, different levels of heterogeneity in proliferation rates can give rise to qualitatively different clone count distributions. This finding of the dominance of proliferation in shaping clone count distributions is consistent with the observation that in older humans with severely reduced thymic output a broad clone count distribution is still maintained (<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B29">29</xref>).</p>
</sec>
<sec id="s2" sec-type="materials|methods">
<title>Materials and Methods</title>
<p>To understand the observed clone counts, we focus on the clone count distribution <inline-formula>
<mml:math display="inline" id="im9">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> associated only with naive T cells, the first type of cells produced by the thymus that have not yet been activated by any antigen. Antigen-mediated activation initiates a largely irreversible cascade of differentiation into effector and memory T cells that we can subsume into a death rate. Thus, we limit our analysis to birth, death, and immigration within the naive T cell compartment. Here, we first present the mathematical framework of the BDI process to provide an initial qualitative understanding for clone counts.</p>
<sec id="s2_1">
<title>Heterogeneous Birth-Death-Immigration Model</title>
<p>The multiclone BDI process is depicted in <xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref>. We define <italic>Q</italic> to be the theoretical number of all possible functional naive T cell receptor clones that can be generated by V(D)J recombination in the thymus which is estimated to be <italic>Q</italic> ~ 10<sup>13</sup> &#x2013; 10<sup>18</sup> (<xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B28">28</xref>). As we will later show, results of our model will not depend on the explicit value of <italic>Q</italic> as long as <italic>Q</italic> &#x226b; 1. Due to naive T cell death or removal from the sampling-accessible pool, not all possible clone types will be presented in the organism, so we denote the number of clones actually present in the body (or &#x201c;richness&#x201d;) by <inline-formula>
<mml:math display="inline" id="im14">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo>&#x226a;</mml:mo>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, where estimates of <inline-formula>
<mml:math display="inline" id="im15">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> range from ~ 10<sup>6</sup> &#x2013; 10<sup>8</sup> in mice and humans (<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B32">32</xref>, <xref ref-type="bibr" rid="B33">33</xref>, <xref ref-type="bibr" rid="B35">35</xref>, <xref ref-type="bibr" rid="B36">36</xref>).</p>
<fig id="f2" position="float">
<label>Figure&#xa0;2</label>
<caption>
<p>Schematic of a multiclone birth-death-immigration process. Clones are defined by distinct TCR sequences <italic>i</italic>. Each clone carries its own thymic output and peripheral proliferation rates, <italic>&#x3b1;<sub>i</sub>
</italic> and <italic>r<sub>i</sub>
</italic>, respectively. We assume all clones have the same population-dependent death rate <italic>&#x3bc;</italic>(<italic/>), where <italic/> is the total number of cells in the organism that influence the death rate. Since <italic>Q</italic> &#x226b; 1, we impose a continuous distribution over the rates <italic>&#x3b1;</italic> and <italic>r</italic>. Theoretically, there may be <italic>Q</italic> <bold>&#x2273;</bold> 10<sup>15</sup> (<xref ref-type="bibr" rid="B6">6</xref>) or more (<xref ref-type="bibr" rid="B30">30</xref>, <xref ref-type="bibr" rid="B31">31</xref>) possible viable V(D)J recombinations. The actual, effective number of different selected TCRs sequences is expected to be much less since extremely low probability sequences may never be formed during the organism&#x2019;s lifetime. A strict lower bound on <italic>Q</italic> is the actual number of distinct clones <italic>&#x108;</italic> in an entire organism [<italic>&#x108;</italic> &#x223c; 10<sup>6</sup> &#x2013; 10<sup>8</sup> for humans (<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B32">32</xref>&#x2013;<xref ref-type="bibr" rid="B34">34</xref>)].</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g002.tif"/>
</fig>
<p>Although naive T cells are difficult to distinguish from the entire T cell population, the total number of naive T cells (across all clones present) in humans has been estimated to be about <inline-formula>
<mml:math display="inline" id="im16">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo>&#x223c;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>11</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. Circulating naive T cells number approximately 10<sup>9</sup> (<xref ref-type="bibr" rid="B37">37</xref>) but can exchange, at different time scales, with those that reside in peripheral tissue, which may carry their own proliferation and death rates. The <italic>effective</italic> pool that is ultimately sampled is thus difficult to estimate, but measurements show that the theoretical number of different clones is much larger than the total number of naive T cells, which is in turn much greater that the total number of different T cell clones actually in the body <inline-formula>
<mml:math display="inline" id="im17">
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mo>&#x226b;</mml:mo>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo>&#x226b;</mml:mo>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>. Regardless of the precise values of the discrete quantities <inline-formula>
<mml:math display="inline" id="im18">
<mml:mrow>
<mml:mi>Q</mml:mi>
<mml:mo>,</mml:mo>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, they are related to the discrete clone counts <inline-formula>
<mml:math display="inline" id="im19">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> via</p>
<disp-formula>
<label>(1)</label>
<mml:math display="block" id="M1">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
<mml:mo>&#x226a;</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mtext>&#xa0;and&#xa0;</mml:mtext>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>As depicted in <xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref>, each distinct clone <italic>i</italic> (with 1 &#x2264; <italic>i</italic> &#x2264; <italic>Q</italic>) is characterized by an immigration rate <italic>&#x3b1;<sub>i</sub>
</italic> and a per cell replication rate <italic>r<sub>i</sub>
</italic>. The immigration rate <italic>&#x3b1;<sub>i</sub>
</italic> is clone-specific because it depends on the preferential V(D)J recombination process; the replication rate <italic>r<sub>i</sub>
</italic> is also clone-specific due to the different interactions with self-peptides that trigger proliferation. Since both the numbers of theoretically possible <inline-formula>
<mml:math display="inline" id="im20">
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mo>&#x226b;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> and observed <inline-formula>
<mml:math display="inline" id="im21">
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo>&#x226b;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> clones are extremely large, we can define a continuous, normalized probability density &#x3c0;(<italic>&#x3b1;</italic>, <italic>r</italic>) from which immigration and proliferation rates <italic>&#x3b1;</italic> and <italic>r</italic> of a randomly chosen clone are drawn. This means that the probability that a randomly chosen clone has an immigration rate between <italic>&#x3b1;</italic> and <italic>&#x3b1;</italic> + d<italic>&#x3b1;</italic> and replication rate between <italic>r</italic> and <italic>r</italic> + d<italic>r</italic> is <italic>&#x3c0;</italic>(<italic>&#x3b1;</italic>, <italic>r</italic>)d<italic>&#x3b1;</italic>d<italic>r</italic>, and <inline-formula>
<mml:math display="inline" id="im22">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>&#x3b1;</mml:mi>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>r</mml:mi>
<mml:mi>&#x3c0;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>Since <italic>Q</italic> is finite and countable, there will exist maximum values <italic>A</italic> and <italic>R</italic> for the immigration and proliferation rates, respectively, such that <italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>) = 0 for <italic>&#x3b1; &gt; A</italic> or <italic>r &gt; R</italic>. In the BDI process, the upper bound <italic>R</italic> on the proliferation rate prevents unbounded numbers of naive T cells and is necessary for a self-consistent solution. The heterogeneity in the immigration and replication rates allows us to go beyond typical &#x201c;neutral&#x201d; BDI models, where both rates are fixed to a specific value for all clones, <italic>&#x3b1;<sub>i</sub> = &#x3b1;</italic> and <italic>r<sub>i</sub> = r</italic> for all <italic>i</italic>.</p>
<p>Finally, we assume the per cell death rate <inline-formula>
<mml:math display="inline" id="im23">
<mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> is clone-independent but a function of the total population <inline-formula>
<mml:math display="inline" id="im24">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>. This dependence represents the competition among all naive T cells for a common resource (such as cytokines), which effectively imposes a carrying capacity on the population (<xref ref-type="bibr" rid="B24">24</xref>, <xref ref-type="bibr" rid="B31">31</xref>, <xref ref-type="bibr" rid="B38">38</xref>). The specific form of the regulation will not qualitatively affect our findings since we will ultimately be interested in only its value <italic>&#x3bc;</italic>(<italic>N</italic>
<sup>&#x2217;</sup>) <italic>&#x2261; &#x3bc;</italic>
<sup>&#x2217;</sup> at the mean steady state population <italic>N</italic>
<sup>&#x2217;</sup>.</p>
</sec>
<sec id="s2_2">
<title>Mean-Field Approximation of the BDI Process</title>
<p>The exact steady-state probabilities of configurations of the discrete abundances <inline-formula>
<mml:math display="inline" id="im25">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> for a fully stochastic neutral BDI model with regulated death rate <inline-formula>
<mml:math display="inline" id="im26">
<mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> were recently derived (<xref ref-type="bibr" rid="B10">10</xref>). In Dessalles et al. (<xref ref-type="bibr" rid="B10">10</xref>) exact results were derived for the steady-state probability <inline-formula>
<mml:math display="inline" id="im27">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> under uniform immigration, proliferation, and death rates &#x3b1;, r, and &#x3bc;, respectively. The significant contribution of this paper is that we go beyond the neutral model (equal immigration, proliferation, and death rates for all clones) by allowing for heterogeneous distributions of these rates. To incorporate TCR-dependent immigration and replication rates in a non-neutral model, we must consider distinct values of <italic>&#x3b1;<sub>i</sub>
</italic> and <italic>r<sub>i</sub>
</italic> for each clone <italic>i</italic>. In this case, an analytic solution for the probability distribution over <inline-formula>
<mml:math display="inline" id="im28">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, even at steady state, cannot be expressed in an explicit form. However, since the effective number of naive T cells (<inline-formula>
<mml:math display="inline" id="im29">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo>&#x223c;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mn>9</mml:mn>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>11</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> (<xref ref-type="bibr" rid="B35">35</xref>)) is large, we can exploit a mean-field approximation to the non-neutral BDI model and derive expressions for the mean values of the discrete clone counts <inline-formula>
<mml:math display="inline" id="im30">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. We will show later that under realistic parameter regimes, the mean-field approximation is quantitatively accurate. Breakdown of the mean field approximation has been carefully analyzed in other studies (<xref ref-type="bibr" rid="B39">39</xref>).</p>
<sec id="s2_2_1">
<title>i) Deterministic Approximation for the Total Population and the Effective Death Rate</title>
<p>To implement the mean-field approximation in the presence of a general regulated death rate <inline-formula>
<mml:math display="inline" id="im31">
<mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">^</mml:mo>
</mml:mover>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, we start by writing the deterministic, &#x201c;mass-action&#x201d; ODE for the mean number of cells <italic>n<sub>&#x3b1;,r</sub>
</italic>(<italic>t</italic>) with a realized immigration rate <italic>&#x3b1;</italic> and proliferation rate <italic>r</italic> in a BDI process</p>
<disp-formula>
<label>(2)</label>
<mml:math display="block" id="M2">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mo>=</mml:mo>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>r</mml:mi>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Next, we define and exploit the density of realized values of <italic>&#x3b1;</italic> and <italic>r</italic>. Since <italic>Q</italic> &#x226b; 1, the number of TCRs that are associated with immigration rate between <italic>&#x3b1;</italic> and <italic>&#x3b1;</italic> + d<italic>&#x3b1;</italic> and a replication rate between <italic>r</italic> and <italic>r</italic> + d<italic>r</italic> is denoted <italic>Q&#x3c0;</italic>(<italic>&#x3b1;, r</italic>)d<italic>&#x3b1;</italic>d<italic>r</italic>, where <italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>) is a normalized density that describes how these realized values of <italic>&#x3b1;</italic> and <italic>r</italic> are distributed. Our model for the total mean number <italic>N</italic>(<italic>t</italic>) of naive T cells can then be estimated as a weighted integral over all <italic>n<sub>&#x3b1;,r</sub>
</italic>(<italic>t</italic>)</p>
<disp-formula>
<label>(3)</label>
<mml:math display="block" id="M3">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>A</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>R</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>r</mml:mi>
<mml:mtext>&#x00A0;</mml:mtext>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>&#x3c0;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Note that the limits of the integration above can equivalently be taken as <italic>A, R</italic>&#x2192;&#x221e; as long as <italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>) = 0 when <italic>&#x3b1; &gt; A</italic> or <italic>r &gt; R</italic>. At steady-state, the solution to Eq.&#xa0;2 can be simply expressed as</p>
<disp-formula>
<label>(4)</label>
<mml:math display="block" id="M4">
<mml:mrow>
<mml:msubsup>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>&#x2217;</mml:mo>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mi>&#x3b1;</mml:mi>
<mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
<p>in which <italic>N</italic>
<sup>&#x2217;</sup> is the predicted steady-state value of <italic>N</italic>(<italic>t</italic>) as <italic>t</italic> &#x2192; &#x221e;. Thus, upon weighting Eq.&#xa0;4 over all possible values of <italic>&#x3b1;</italic> and <italic>r</italic>, we find</p>
<disp-formula>
<label>(5)</label>
<mml:math display="block" id="M5">
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>R</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mfrac>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>&#x3c0;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x2019;</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
<p>a self-consistent equation for <italic>N</italic>
<sup>&#x2217;</sup> which depends implicitly on the parameters that define the distribution <italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>). Eq.&#xa0;5 clearly shows why a finite cutoff <italic>&#x3c0;</italic>(<italic>&#x3b1;, r &gt; R</italic>) = 0, <italic>R</italic> &lt; <italic>&#x3bc;</italic>(<italic>N</italic>
<sup>&#x2217;</sup>) is required since the integral diverges if <italic>&#x3c0;</italic>(<italic>&#x3b1;, r &#x2265; &#x3bc;</italic>(<italic>N</italic>
<sup>&#x2217;</sup>)) &gt; 0. However, as long as <italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>) decays faster than 1/<italic>&#x3b1;</italic>
<sup>2</sup>, the <italic>&#x3b1;</italic>-integration converges with an explicit cutoff <italic>A</italic>.</p>
<p>We will first assume that <italic>&#x3b1;</italic> and <italic>r</italic> are uncorrelated and that the distribution factorises: <italic>&#x3c0;</italic>(<italic>&#x3b1;,r</italic>) = <italic>&#x3c0;<sub>&#x3b1;</sub>
</italic>(<italic>&#x3b1;</italic>)<italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>). Then, the self-consistent effective steady state death rate <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> <italic>&#x2261; &#x3bc;</italic>(<italic>N</italic>
<sup>&#x2217;</sup>) depends only on the combination</p>
<disp-formula>
<mml:math display="block" id="M100">
<mml:mrow>
<mml:mfrac>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>Q</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mfrac>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>R</mml:mi>
</mml:msubsup>
<mml:mfrac>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>r</mml:mi>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mstyle>
<mml:mi>,</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where</p>
<disp-formula>
<mml:math display="block" id="M101">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x2261;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>A</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</disp-formula>
<p>is the mean immigration rate across all possible clones. To simplify subsequent notation, we normalize all rates by the maximum proliferation rate <italic>R</italic>. To avoid population blow-up, we impose that the maximum proliferation is smaller than the steady-state death rate <italic>R &lt; &#x3bc;</italic>
<sup>&#x2217;</sup>. By measuring time in units of 1/<italic>R</italic>, we redefine <italic>r/R</italic> &#x2192; <italic>r</italic> &#x2264; 1, <italic>&#x3b1;/R</italic> &#x2192; <italic>&#x3b1;</italic>, <inline-formula>
<mml:math display="inline" id="im34">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo stretchy="false">/</mml:mo>
<mml:mi>R</mml:mi>
<mml:mo>&#x2192;</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, <italic>&#x3bc;</italic>
<sup>&#x2217;</sup>/<italic>R</italic> &#x2192; <italic>&#x3bc;</italic>
<sup>&#x2217;</sup>, and <italic>R</italic>
<sup>2</sup>
<italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>) &#x2192; <italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>) so that these quantities are now dimensionless, unless otherwise explicitly stated. The steady-state self-consistent condition becomes</p>
<disp-formula>
<label>(6)</label>
<mml:math display="block" id="M6">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x2261;</mml:mo>
<mml:mfrac>
<mml:mi>&#x3bb;</mml:mi>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mfrac>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mn>1</mml:mn>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mi>.</mml:mi>
</mml:math>
</disp-formula>
<p>Since the effective <italic>Q</italic> is a large, uncertain number, we parameterize our model in terms of <italic>&#x3bb;</italic> &#x2261; <italic>N</italic>
<sup>&#x2217;</sup>/<italic>Q</italic>, the total steady state naive T cell population normalized by the total possible number of clones <italic>Q</italic>. It is sometimes deemed a measure of the &#x201c;coverage&#x201d; of the entire repertoire (6). Values of <italic>N</italic>
<sup>&#x2217;</sup> and <italic>Q</italic> that are consistent with measurements and physiologic expectations give <italic>&#x3bb;</italic> &#x226a; 1. Once <inline-formula>
<mml:math display="inline" id="im35">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> and <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>) are estimated, we can self-consistently determine <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> from Eq.&#xa0;6. Besides <inline-formula>
<mml:math display="inline" id="im36">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, the self-consistent value of <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> will also depend on the function <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>). Note from the form of Eq.&#xa0;6, the self-consistent <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> is inversely related to <italic>&#x3bb;</italic>.</p>
</sec>
<sec id="s2_2_2">
<title>ii) Mean-Field Model of Clone Counts</title>
<p>Given a relationship such as Eq.&#xa0;6 that determines <italic>&#x3bc;</italic>
<sup>&#x2217;</sup>, we can explicitly develop a model that quantifies naive T cell subpopulations according to their immigration and proliferation rates <italic>&#x3b1;</italic> and <italic>r</italic>. For a given, realized value of <italic>&#x3b1;</italic> and <italic>r</italic>, we denote the expected number of clones of size <italic>k</italic> with these immigration and proliferation rates by <inline-formula>
<mml:math display="inline" id="im37">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. The mean-field equations for the dynamics of these mean clone counts in the neutral model were derived in (<xref ref-type="bibr" rid="B39">39</xref>, <xref ref-type="bibr" rid="B40">40</xref>) and are reviewed in Section 1 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>. In a neutral model, we assume that all clones <italic>Q</italic> carry the same rates <italic>&#x3b1;</italic> and <italic>r</italic> so that the mean field evolution equation for <inline-formula>
<mml:math display="inline" id="im38">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is given by solving (<xref ref-type="bibr" rid="B38">38</xref>, <xref ref-type="bibr" rid="B39">39</xref>)</p>
<disp-formula>
<label>(7)</label>
<mml:math display="block" id="M7">
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mi>&#x3b1;</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi>r</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>k</mml:mi>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>+</mml:mo>
<mml:mi>&#x3bc;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>k</mml:mi>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>]</mml:mo>
<mml:mi>,</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>along with the constraint <inline-formula>
<mml:math display="inline" id="im39">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</inline-formula>. Note that <inline-formula>
<mml:math display="inline" id="im40">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and <italic>n<sub>&#x3b1;,r</sub>
</italic> are related via <inline-formula>
<mml:math display="inline" id="im41">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</inline-formula>. We use the notation <italic>c<sub>k</sub>
</italic> to denote the predicted clone counts derived from our mathematical model to distinguish them from <italic>measured</italic> clone counts <inline-formula>
<mml:math display="inline" id="im42">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Equation&#xa0;7 assumes that both <inline-formula>
<mml:math display="inline" id="im43">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and <italic>N</italic> are uncorrelated, allowing us to write the last term as a product of functions of the mean population <inline-formula>
<mml:math display="inline" id="im44">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im45">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Under steady-state, we approximate <italic>&#x3bc;</italic>(<italic>N</italic>) by <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> found by solving Eq.&#xa0;6 as a function of <inline-formula>
<mml:math display="inline" id="im46">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, and the hyperparameters defining <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>). The steady-state solution of Eq.&#xa0;7 follows a negative binomial distribution with parameters &#x3b1;/<italic>r</italic> and <italic>r</italic>/<italic>&#x3bc;</italic>
<sup>&#x2217;</sup> &lt; 1 (<xref ref-type="bibr" rid="B10">10</xref>, <xref ref-type="bibr" rid="B39">39</xref>)</p>
<disp-formula>
<label>(8)</label>
<mml:math display="block" id="M8">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>Q</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mi>r</mml:mi>
<mml:mrow>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mi>r</mml:mi>
<mml:mrow>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:msup>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>!</mml:mo>
</mml:mrow>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x220f;</mml:mo>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munderover>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>r</mml:mi>
</mml:mfrac>
<mml:mo>+</mml:mo>
<mml:mi>&#x2113;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mi>,</mml:mi>
</mml:math>
</disp-formula>
<p>The predicted number of absent clones is <inline-formula>
<mml:math display="inline" id="im47">
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</inline-formula>. The solution 8 depends implicitly on the parameter <inline-formula>
<mml:math display="inline" id="im48">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> through <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> determined by Eq.&#xa0;6. Although <italic>c<sub>k</sub>
</italic>(<italic>&#x3b1;</italic>, <italic>r</italic>, <italic>&#x3bc;</italic>
<sup>&#x2217;</sup>) has not yet been averaged over <italic>&#x3b1;, r</italic>, it also implicitly depends on <italic>&#x3bb;</italic> and the parameters that define <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>) through <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> and Eq.&#xa0;6. Specifically, larger <italic>&#x3bb;</italic> leading to smaller <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> results in a more slowly decaying <italic>c<sub>k</sub>
</italic>(<italic>&#x3b1;, r, &#x3bc;</italic>
<sup>&#x2217;</sup>) as a function of <italic>k</italic>. This behavior will be propagated through subsampling and averaging over <italic>&#x3b1;</italic> and <italic>r</italic>.</p>
</sec>
</sec>
<sec id="s2_3">
<title>Subsampling</title>
<p>Unless an animal is sacked and its entire naive T cell population is sequenced, TCR clone distributions are typically measured from sequencing TCRs in a small blood sample. In such samples, low population clones may be missed. In order to compare our predictions with measured clone abundance distributions, we must revise our predictions to allow for random cell sampling. We define <italic>&#x3b7;</italic> as the fraction of naive T cells in an organism that is drawn in a sample and assume that all naive T cells in the organism have the same probability <italic>&#x3b7;</italic> of being sampled. This is true only if naive T cells carrying different TCRs are not preferentially partitioned into different tissues and are uniformly distributed within an animal. Let us assume that a specific clone is represented by <inline-formula>
<mml:math display="inline" id="im49">
<mml:mi>&#x2113;</mml:mi>
</mml:math>
</inline-formula> cells in an organism. If <inline-formula>
<mml:math display="inline" id="im50">
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mi>&#x3b7;</mml:mi>
<mml:mo>&#x226b;</mml:mo>
<mml:mi>&#x2113;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, the probability that <italic>k</italic> cells are randomly sampled from the same clone approximately follows a binomial distribution with parameters <inline-formula>
<mml:math display="inline" id="im200">
<mml:mi>&#x2113;</mml:mi>
</mml:math>
</inline-formula> and <italic>&#x3b7;</italic> (<xref ref-type="bibr" rid="B40">40</xref>&#x2013;<xref ref-type="bibr" rid="B44">44</xref>)</p>
<disp-formula>
<label>(9)</label>
<mml:math display="block" id="M9">
<mml:mrow>
<mml:mi>&#x2119;</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>|</mml:mo>
<mml:mi>&#x2113;</mml:mi>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mo>&#x2248;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow/>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
</mml:mrow>
</mml:mstyle>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mi>&#x3b7;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2009;</mml:mo>
<mml:mi>k</mml:mi>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>&#x2113;</mml:mi>
</mml:mrow>
<mml:mi>.</mml:mi>
</mml:math>
</disp-formula>
<p>The associated mean <italic>sampled</italic> clone count <inline-formula>
<mml:math display="inline" id="im51">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> depends on the predicted whole-organism clone count and &#x2119;[<italic>k</italic>|<italic>&#x2113;</italic>] via the formula</p>
<disp-formula>
<label>(10)</label>
<mml:math display="block" id="M10">
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2248;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>&#x2113;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mi>&#x2119;</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>|</mml:mo>
<mml:mi>&#x2113;</mml:mi>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mstyle displaystyle="true">
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>&#x2113;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow/>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
</mml:mrow>
</mml:mstyle>
</mml:msubsup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mi>&#x3b7;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mi>.</mml:mi>
</mml:mstyle>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>where <italic>c<sub>&#x2113;</sub>
</italic>(&#x3b1;, <italic>r</italic>, <italic>&#x3bc;</italic>
<sup>&#x2217;</sup>) is determined by Eq.&#xa0;8. Explicitly performing the sum in Eq.&#xa0;10 yields the sampled clone count</p>
<disp-formula>
<label>(11)</label>
<mml:math display="block" id="M11">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mi>Q</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>!</mml:mo>
</mml:mrow>
</mml:mfrac>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>&#x3b7;</mml:mi>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo> </mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mfrac>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>r</mml:mi>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x220f;</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munderover>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>r</mml:mi>
</mml:mfrac>
<mml:mo>+</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
</disp-formula>
<p>The total expected number of clones in the sample (the richness) can be found via direct summation:</p>
<disp-formula>
<label>(12)</label>
<mml:math display="block" id="M12">
<mml:mrow>
<mml:msup>
<mml:mi>C</mml:mi>
<mml:mi>s</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
</disp-formula>
<p>As shown in <xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>, random subsampling greatly affects the observed clone counts, with small sampling fractions <italic>&#x3b7;</italic> leading to fast decay in <italic>k</italic> of <inline-formula>
<mml:math display="inline" id="im54">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and shifting <italic>c<sub>k</sub>
</italic> at large <italic>k</italic> to much smaller values of <italic>k</italic> while reducing the values of <italic>c<sub>k</sub>
</italic> for small <italic>k</italic> (<xref ref-type="bibr" rid="B42">42</xref>). Note that setting <italic>&#x3b7;</italic> = 1 in Eq.&#xa0;11 leads to Eq.&#xa0;8, the whole-body clone count. In <xref ref-type="fig" rid="f3">
<bold>Figures&#xa0;3A, B</bold>
</xref> we plot results from our model using two very different dimensionless parameter sets, <italic>&#x3b1;</italic> = 10<sup>-5</sup>, <italic>r</italic> = 1/2, <italic>&#x3bb;</italic> = 0.01, and <italic>&#x3b1; = &#x3bb;</italic> =10, <italic>r</italic> = 1/2, to generate two qualitatively different patterns of neutral model clone counts <italic>c<sub>k</sub>
</italic>. If the subsampling <italic>&#x3b7;</italic> &#x226a; 1 is sufficiently small, the resulting <inline-formula>
<mml:math display="inline" id="im55">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> corresponding to the two qualitatively different <italic>c<sub>k</sub>
</italic> can appear similar. This implies that small sampling fractions make the estimation of whole-body clone counts from sampled data somewhat ill-conditioned, <italic>i.e.</italic>, different whole-body clone counts, upon sampling, may yield similar sampled clone counts. Although sampling can strongly affect the inference of <italic>c<sub>k</sub>
</italic>, immigration and proliferation rate distributions may also affect the observed clone count as we investigate below.</p>
<fig id="f3" position="float">
<label>Figure&#xa0;3</label>
<caption>
<p>The effects of sampling on two different neutral-model relative clone counts <inline-formula>
<mml:math display="inline" id="im52">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>C</mml:mi>
<mml:mi>s</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> plotted using the dimensionless proliferation rate <italic>r</italic> = 1/2 in Eqs.&#xa0;11 and 12 or Eq.&#xa0;10 and S9 from Section 2 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>. In <bold>(A)</bold>, we used <italic>&#x3b1;</italic> = 10<sup>-5</sup>, <italic>&#x3bb;</italic> =0.01. The effect of sampling is illustrated for <italic>&#x3b7;</italic> = 1 (no subsampling), 10<sup>-3</sup>, 10<sup>-4</sup>, 10<sup>-5</sup>, and 10<sup>-6</sup>. All clone counts are qualitatively similar, with subsampling increasing the exponential decay in <italic>c<sub>k</sub>
</italic>. In <bold>(B)</bold>, we use a physiologically unrealistic set of parameters, <italic>&#x3b1;</italic> = <italic>&#x3bb;</italic> = 10, which leads to a qualitatively different unsampled clone count pattern that exhibits a peak. However, under small subsampling fractions <italic>&#x3b7;</italic>, the clone count loses its peak as it shifts to a rapidly decreasing patterns <inline-formula>
<mml:math display="inline" id="im53">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> that are not significantly different from sampled clone counts predicted using the parameters <italic>&#x3b1;</italic> and <italic>&#x3bb;</italic> in <bold>(A)</bold>. This indicates inferring parameters using clone counts is ill-conditioned (rather insensitive to parameters) if <italic>&#x3b7;</italic> is too small.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g003.tif"/>
</fig>
</sec>
<sec id="s2_4">
<title>Heterogeneity and Determination of <italic>&#x3c0;</italic> (<italic>&#x3b1;</italic>, <italic>r | &#x19f;</italic>
<sub>0</sub>)</title>
<p>The fundamental result given in Eq.&#xa0;11 applies only to the clone count density in a neutral model in which the immigration and proliferation rates are <italic>&#x3b1;</italic> and <italic>r</italic> for all clones. We now average the sampled clone counts <inline-formula>
<mml:math display="inline" id="im56">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> (Eq.&#xa0;11) and the richness <italic>C<sup>s</sup>
</italic>(<italic>&#x3b1;</italic>, <italic>r</italic>, <italic>&#x3bc;</italic>
<sup>&#x2217;</sup>, <italic>&#x3b7;</italic>) (Eq.&#xa0;12) over a distribution of immigration and proliferation rates <italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>) to capture the heterogeneity across TCR clones. This final result can then be compared with experimentally measured clone counts. Recall that <italic>&#x3c0;</italic>(<italic>&#x3b1;, r</italic>) can depend on hyperparameters <italic>&#x3b8;</italic>
<sub>0</sub> that define the shape of <italic>&#x3c0;</italic>. We then explicitly denote the distribution by <italic>&#x3c0;</italic>(<italic>&#x3b1;</italic>, <italic>r|&#x3b8;</italic>
<sub>0</sub>).</p>
<p>Once <italic>&#x3c0;</italic>(<italic>&#x3b1;</italic>, <italic>r|&#x3b8;</italic>
<sub>0</sub>) is defined, we can weight sampled clone counts accordingly. For example, one may assume <inline-formula>
<mml:math display="inline" id="im57">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b8;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, with each of the two hyperparameters defining <inline-formula>
<mml:math display="inline" id="im58">
<mml:mrow>
<mml:mi>&#x3c0;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>&#x3b8;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>|</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, leading to</p>
<disp-formula>
<mml:math display="block" id="M102">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3b8;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>&#x3b1;</mml:mi>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mn>1</mml:mn>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>r</mml:mi>
<mml:mi>&#x3c0;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>&#x3b8;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
</disp-formula>
<sec id="s2_4_1">
<title>i) Proliferation Rate Heterogeneity</title>
<p>First, we consider a distribution of TCR sequence-dependent proliferation rates. Since TCR-antigen affinity depends on the receptor amino-acid sequence, the rate of T cell activation and subsequent proliferation can be clone-specific (<xref ref-type="bibr" rid="B31">31</xref>, <xref ref-type="bibr" rid="B45">45</xref>). Thus, the specific interactions between TCRs and low-affinity MHC/self-peptide complexes maps to a distribution of proliferation rates among all the <italic>Q</italic> possible clones. Since there are no data (known to us) that can be used to infer this mapping or the specific shape of <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>|<italic>w</italic>), we assume, for simplicity, a simple uniform &#x201c;box&#x201d; distribution centered about a mean value <inline-formula>
<mml:math display="inline" id="im60">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>r</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>:</p>
<disp-formula>
<label>(13)</label>
<mml:math display="block" id="M13">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>|</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow/>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>&#x2009;</mml:mo>
<mml:mtext>otherwise</mml:mtext>
</mml:mrow>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">/</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>&#x2009;</mml:mo>
<mml:mtext>if</mml:mtext>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mo>|</mml:mo>
</mml:mrow>
<mml:mo>&lt;</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mstyle>
</mml:msubsup>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where 0 &#x2264; <italic>w</italic> &#x2264; 1 represents the relative width of the uniform box distribution. The minimum and maximum dimensionless proliferation rates in this distribution are then 1/2-<italic>w</italic>/2 and 1/2+<italic>w</italic>/2, respectively. The dimensionless self-consistency condition (Eq.&#xa0;6) thus yields</p>
<disp-formula>
<label>(14)</label>
<mml:math display="block" id="M14">
<mml:mrow>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:mfrac>
<mml:mo>+</mml:mo>
<mml:mfrac>
<mml:mi>w</mml:mi>
<mml:mn>2</mml:mn>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mi>w</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:mfrac>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mi>w</mml:mi>
<mml:mn>2</mml:mn>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mrow>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mi>w</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
<p>To understand the effects of proliferation rate heterogeneity we begin by considering it effects on whole-organism (<italic>&#x3b7;</italic> = 1) clone counts. Since the function <italic>c<sub>k</sub>
</italic>(<italic>&#x3b1;</italic>, <italic>r</italic>, <italic>&#x3bc;</italic>
<sup>&#x2217;</sup>) defined by Eq.&#xa0;8 contains the exponentially decaying term (<italic>r</italic>/<italic>&#x3bc;</italic>
<sup>&#x2217;</sup>)<italic>
<sup>k</sup>
</italic>, a fixed dimensionless value of <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> and <italic>r</italic> = 1/2 leads to an exponential decay in <italic>c<sub>k</sub>
</italic> in <italic>k</italic>. However, if <italic>w</italic> &gt; 0, different values of <italic>r</italic> and <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> contribute to this decay term, yielding nontrivial behavior and a much slower decay as seen in <xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref> for <inline-formula>
<mml:math display="inline" id="im65">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>8</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>80</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and different values of <italic>w</italic>.</p>
<fig id="f4" position="float">
<label>Figure&#xa0;4</label>
<caption>
<p>An exploration of the effects of proliferation rate heterogeneity on the mean clone counts <italic>c<sub>k</sub>
</italic> with Q = 10<sup>13</sup>. <bold>(A)</bold> Various box distributions <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>|<italic>w</italic>) for <italic>w</italic> = 0, 0.2, 0.4, 0.6, 0.8, and 1. <bold>(B)</bold> Using Eq.&#xa0;14 and the dimensionless values <inline-formula>
<mml:math display="inline" id="im61">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>8</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> such that <inline-formula>
<mml:math display="inline" id="im62">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>8</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, we plot, using the same color spectrum as <bold>(A)</bold>, the corresponding clone counts <italic>C<sub>k</sub>
</italic> and show that wider distributions typically generate longer-tailed <italic>c<sub>k</sub>
</italic>. However, if <italic>&#x3bb;</italic> is set even larger such that <inline-formula>
<mml:math display="inline" id="im63">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>80</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> even modest values of <italic>w</italic> can generate a very long-tailed <italic>c<sub>k</sub>
</italic>, as shown in <bold>(C)</bold>. The color spectrum in <bold>(C)</bold> is for visualization only and not associated with that in <bold>(A, B)</bold>. In the limit of very large <inline-formula>
<mml:math display="inline" id="im64">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, the effects of heterogeneous proliferation saturate at very small <italic>w</italic> beyond which it has negligible effect in further extending the tail.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g004.tif"/>
</fig>
</sec>
<sec id="s2_4_2">
<title>ii) Immigration Rate Heterogeneity</title>
<p>Next, we use previous studies that predict V(D)J recombination frequencies associated with each TCR sequence to construct a distribution <italic>&#x3c0;</italic>
<sub>&#x3b1;</sub>(&#x3b1;) for the TCR sequence-dependent thymic output. A statistical model for differential V(D)J recombination in humans is implemented in the Optimized Likelihood estimate of immunoGlobulin Amino-acid sequences (OLGA) software (<xref ref-type="bibr" rid="B28">28</xref>), which is an updated version of the Inference and Generation of Repertoires (IGoR) software (<xref ref-type="bibr" rid="B20">20</xref>). Below, we estimate <inline-formula>
<mml:math display="inline" id="im66">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> by sampling a large number of TCRs from OLGA that draws sequences according to their generation probability. Our working assumption is that thymic selection is uncorrelated with V(D)J recombination so the relative probabilities of forming different TCRs provide an accurate representation of the ratios of the TCRs exported into the periphery.</p>
<p>Both IGoR and OLGA can be used to generate the probabilities corresponding to each drawn sequence but this requires significant computational time and memory. Equivalently, since the sequence draws are proportional to the underlying probabilities, we simply drew <italic>N<sub>*</sub>
</italic> sequences and counted the frequencies of each amino acid sequence. Out of <italic>N<sub>*</sub>
</italic> sequence draws from IGoR or OLGA, there will be <italic>C<sub>*</sub>
</italic> distinct amino sequences (the richness of the drawn sequences). Since some sequences are drawn <italic>j</italic>&gt;1 times, <italic>C<sub>*</sub>
</italic> &#x2264; <italic>N<sub>*</sub>
</italic>. If <italic>b<sub>j</sub>
</italic> distinct sequences are drawn <italic>j</italic> times, and the maximum observed frequency max{<italic>j</italic>} &#x2261; <italic>J</italic>, <inline-formula>
<mml:math display="inline" id="im67">
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>J</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula>
<mml:math display="inline" id="im68">
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>J</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</inline-formula>, while <italic>b<sub>j</sub>
</italic>/<italic>C<sub>*</sub>
</italic> is the fraction of all drawn sequences that appear <italic>j</italic> times. For <italic>N<sub>*</sub>
</italic> = 10<sup>9</sup>, we found <italic>C<sub>*</sub>
</italic> = 372,806,648 &#x2248; 3.72 &#xd7; 10<sup>8</sup> and a maximum observed frequency max{<italic>j</italic>} = <italic>J</italic> = 52,294 for the alpha chain and <italic>C<sub>*</sub>
</italic> = 875,920,705 &#x2248; 8.76 &#xd7; 10<sup>8</sup> and <italic>J</italic> = 6430 for the beta chain.</p>
<p>We model the effective immigration rate of a TCR sequence drawn <italic>j</italic> times to be proportional to <italic>j</italic> so that <italic>&#x3b1;<sub>j</sub>
</italic> &#x2261; <italic>&#x3b1;<sub>*</sub>j</italic>. To fix the proportionality <italic>&#x3b1;</italic>
<sub>*</sub>, we identify the mean immigration rate averaged across the <italic>C</italic>
<sub>*</sub> observed sequences with the mean physiological rate <inline-formula>
<mml:math display="inline" id="im70">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula></p>
<disp-formula>
<label>(15)</label>
<mml:math display="block" id="M103">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>j</mml:mi>
</mml:munder>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mstyle>
<mml:mo>&#x2248;</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</disp-formula>
<p>to find <inline-formula>
<mml:math display="inline" id="im72">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo stretchy="false">/</mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and thus</p>
<disp-formula>
<label>(16)</label>
<mml:math display="block" id="M15">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:mo stretchy="false">/</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
</disp-formula>
<p>The frequencies <italic>j</italic> of the drawn realization of clones are plotted in decreasing order against the <italic>C</italic>
<sub>*</sub> distinct sequences in <xref ref-type="fig" rid="f5">
<bold>Figures&#xa0;5A, B</bold>
</xref>. From these frequencies <italic>j</italic> and the number of sequences <italic>b<sub>j</sub>
</italic> exhibiting them, we approximate averages of any function <italic>y</italic>(<italic>&#x3b1;</italic>) over <inline-formula>
<mml:math display="inline" id="im75">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> by taking a sum over the values <italic>&#x3b1;<sub>j</sub>
</italic>:</p>
<disp-formula>
<label>(17)</label>
<mml:math display="block" id="M16">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:mo>&#x222b;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2248;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>J</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mi>y</mml:mi>
</mml:mrow>
</mml:mstyle>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
</disp-formula>
<fig id="f5" position="float">
<label>Figure&#xa0;5</label>
<caption>
<p>Ordered integer-valued frequencies <italic>j</italic>, plotted on a log-log scale, of the <italic>C<sub>*</sub>
</italic> distinct <bold>(A)</bold> alpha and <bold>(B)</bold> beta chains drawn using OLGA. The index 1 &#x2264; <italic>i</italic> &#x2264; <italic>C<sub>*</sub>
</italic> &lt; <italic>N<sub>*</sub>
</italic> labels the distinct sequences drawn while <italic>b<sub>j</sub>
</italic> is defined as the number these sequences that exhibit the specific frequency <italic>j</italic> [<italic>b</italic>
<sub>1</sub> and <italic>b</italic>
<sub>2</sub> are explicitly indicated in <bold>(B)</bold>]. The highest frequency clone appears <italic>J</italic> times such that <italic>b<sub>j&gt;J</sub>
</italic> = 0. Since <italic>C<sub>*</sub>
</italic> is comparable to <italic>N<sub>*</sub>
</italic>, the drawn sequences are dominated by the low probability ones that appear only once. The insets display the frequencies on a linear scale and indicate the long-tailed behavior of the frequencies. The shape of the frequency spectra is self-similar once <italic>N<sub>*</sub>
</italic> &#x2273; 10<sup>7</sup>, allowing us to use this sampling procedure to reliably estimate <inline-formula>
<mml:math display="inline" id="im74">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g005.tif"/>
</fig>
<p>Alternatively, when drawing sequences IGoR and OLGA (using the Pgen feature) one can also directly output their probabilities <italic>p<sub>i</sub>
</italic>, whose values would be proportional to the frequency <italic>j</italic> if large numbers of sequences are drawn as described above. We can use these countable sequences and probabilities to construct <italic>&#x3b1;</italic> and <italic>&#x3c0;<sub>&#x3b1;</sub>
</italic>(<italic>&#x3b1;</italic>) by defining <inline-formula>
<mml:math display="inline" id="im76">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>Q</mml:mi>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="false">/</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>T</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> where <inline-formula>
<mml:math display="inline" id="im77">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>T</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</inline-formula>. By plotting the values of <italic>p<sub>i</sub>
</italic>, we arrive at a distribution similar to that shown in <xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>. In this case too, we find that a large number of low-probability sequences dominates the averaging of clone counts using the distribution of immigration rates constructed using IGoR/OLGA.</p>
<p>Now that we have specified the distributions for <inline-formula>
<mml:math display="inline" id="im78">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>|<italic>w</italic>), we can compute the mean, sampled, immigration- and proliferation-averaged clone counts and compare them with measurements. The full formula for the immigration and proliferation rate-averaged clone counts under subsampling is</p>
<disp-formula>
<label>(18)</label>
<mml:math display="block" id="M17">
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mn>1</mml:mn>
</mml:msubsup>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>r</mml:mi>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>|</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mi>Q</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>!</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>J</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mo>*</mml:mo>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x222b;</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>+</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mtext>d</mml:mtext>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mi>w</mml:mi>
</mml:mfrac> </mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mstyle>
<mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>&#x3b7;</mml:mi>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo> </mml:msup> </mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:msup>
<mml:mo>&#xd7;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo> </mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup> </mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mi>r</mml:mi>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x220f;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munderover>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mi>r</mml:mi>
</mml:mfrac>
<mml:mo>+</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>where <italic>&#x3b1;<sub>j</sub>
</italic> is given by Eq.&#xa0;16 and <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> is given by Eq.&#xa0;14. Eq.&#xa0;18 is our &#x201c;full model&#x201d; from which we make predictions of clones count-related quantities and compare them with data. Using this expression, we can mathematically study the importance of the heterogeneities in <italic>&#x3b1;</italic> and <italic>r</italic> by comparing predictions from simple forms of <inline-formula>
<mml:math display="inline" id="im79">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>|<italic>w</italic>), as presented in Section 2 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref> to those derived from <inline-formula>
<mml:math display="inline" id="im80">
<mml:mrow>
<mml:mi>&#x3c0;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> of the neutral model.</p>
<p>From <xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>, observe that <inline-formula>
<mml:math display="inline" id="im81">
<mml:mrow>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x226b;</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&gt;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. In fact, a majority of the naive T cell population is comprised of clones that are produced only once. The linear-scale insets also show a long tail indicating a large number of clones that are generated few times. Thus, for sufficiently small <inline-formula>
<mml:math display="inline" id="im82">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula>, our formulae for <italic>c<sub>k</sub>
</italic> and all subsequent quantities can be approximated by taking the <inline-formula>
<mml:math display="inline" id="im83">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo stretchy="false">/</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x226a;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> limit. As we show in Section 3 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>, such a simpler expression remains highly accurate, provided the dimensionless <inline-formula>
<mml:math display="inline" id="im84">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&lt;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, and allows efficient computation. This implies that the full result arising from averaging <inline-formula>
<mml:math display="inline" id="im85">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> over <inline-formula>
<mml:math display="inline" id="im86">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> can also be approximated by using a single effective value <inline-formula>
<mml:math display="inline" id="im87">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x3bc;</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, supporting our overall conclusion that predicted heterogeneity in human T cell immigration rates do not appreciably influence clone count distributions. While physiological distributions <inline-formula>
<mml:math display="inline" id="im88">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> do not yield clone counts appreciably different from those of a neutral immigration model, small changes in proliferation rate heterogeneity <italic>w</italic> can significantly affect the clone count structure <inline-formula>
<mml:math display="inline" id="im89">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>. Nonetheless, for completeness, we will perform the full summation over <italic>&#x3b1;<sub>j</sub>
</italic> (Eq.&#xa0;18). All parameters, hyperparameters, and variables used in our modeling and data analysis are listed in <xref ref-type="table" rid="T1">
<bold>Tables&#xa0;1</bold>
</xref>, <xref ref-type="table" rid="T2">
<bold>2</bold>
</xref>.</p>
<table-wrap id="T2" position="float">
<label>Table&#xa0;2</label>
<caption>
<p>Model variables and their definitions.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top" align="left">variables</th>
<th valign="top" align="center">definition</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">
<italic>Q &#x2208; &#x2124;</italic>
<sup>+</sup> <italic> </italic>
</td>
<td valign="top" align="left">theoretical number of possible TCRs ~10<sup>13</sup> &#x2013; 10<sup>18</sup> (<xref ref-type="bibr" rid="B36">36</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im90">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>N</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x2124;</mml:mi>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">number of naive T cells in organism ~10<sup>10</sup> &#x2013; 10<sup>11</sup> (<xref ref-type="bibr" rid="B5">5</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>N</italic>(<italic>t</italic>) <italic>&#x2208;</italic> &#x211d;<sup>+</sup>
</td>
<td valign="top" align="left">number of naive T cells from model</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>N</italic>
<sup>&#x2217;</sup> <italic>&#x2208;</italic> &#x211d;<sup>+</sup>
</td>
<td valign="top" align="left">steady-state number of naive T cells from model</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>N<sup>s</sup>
</italic> &#x2261; <italic>&#x3b7;N</italic>
<sup>&#x2217;</sup> <italic>&#x2208;</italic> &#x211d;<sup>+ </sup>
</td>
<td valign="top" align="left">subsampled number of naive cells from model</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>N<sub>*</sub> &#x2208;</italic> &#x2124;<sup>+</sup>
</td>
<td valign="top" align="left">number of draws from IGoR/OLGA</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im91">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x2124;</mml:mi>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">total number of clones in organism (richness) ~10<sup>6</sup> &#x2013; 10<sup>8</sup> (<xref ref-type="bibr" rid="B36">36</xref>)</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im92">
<mml:mrow>
<mml:msup>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>s</mml:mi>
</mml:msup>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x2124;</mml:mi>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">total number of sampled clones (sampled richness)</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>C</italic>(<italic>&#x3b8;</italic>) <italic>&#x2208;</italic> &#x211d;<sup>+</sup>
</td>
<td valign="top" align="left">total number of clones in organism from model</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>C<sup>s</sup>
</italic>(<italic>&#x3b8;</italic>,<italic>&#x3b7;</italic>) <italic>&#x2208;</italic> &#x211d;<sup>+</sup>
</td>
<td valign="top" align="left">total number of sampled clones from model</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>C<sub>*</sub> &#x2208;</italic> &#x2124;<sup>+</sup>
</td>
<td valign="top" align="left">number of different sequences drawn from IGoR/OLGA</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im93">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x2124;</mml:mi>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">discrete number of clones of size <italic>k</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">
<italic>c<sub>k</sub>
</italic>(<italic>&#x3b8;</italic>) <italic>&#x2208;</italic> &#x211d;<sup>+</sup>
</td>
<td valign="top" align="left">model of number of clones containing <italic>k</italic> cells</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im94">
<mml:mrow>
<mml:msubsup>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x2124;</mml:mi>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">discrete number clones of size <italic>k</italic> in sample</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im95">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mi>&#x211d;</mml:mi>
<mml:mo>+</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">modeled number of sampled clones containing <italic>k</italic> cells</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im96">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:msubsup>
<mml:mover accent="true">
<mml:mi>c</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mover accent="true">
<mml:mi>C</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
<mml:mi>s</mml:mi>
</mml:msup>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x2208;</mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">fraction of all sampled cells in clones of size <italic>k</italic>
</td>
</tr>
<tr>
<td valign="top" align="left">
<inline-formula>
<mml:math display="inline" id="im97">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>C</mml:mi>
<mml:mi>s</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x2208;</mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>
</td>
<td valign="top" align="left">modeled fraction of all sampled cells in clones of size <italic>k</italic>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The variables with <inline-formula>
<mml:math display="inline" id="im98">
<mml:mover accent="true">
<mml:mo>.</mml:mo>
<mml:mo>^</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> denote measured numbers, while populations written as functions of parameters &#x3b8; are those predicted from our model (the dimensionless parameters used in our model are &#x3b8; = {&#x3b1;,r}). The probability distributions <italic>&#x3c0;</italic>(&#x3b1;,r|&#x3b8;<sub>0</sub>) are defined by hyperparameters &#x3b8;<sub>0</sub> (the dimensionless hyperparameters used in this study are <inline-formula>
<mml:math display="inline" id="im99">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b8;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>). Upon averaging predicted quantities such as <inline-formula>
<mml:math display="inline" id="im100">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> over &#x3c0;(&#x3b1;, r|&#x3b8;<sub>0</sub>) we find <inline-formula>
<mml:math display="inline" id="im100a">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>&#x3b8;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
</sec>
<sec id="s3" sec-type="results">
<title>Results and Analysis</title>
<p>Before performing a quantitative comparison with measured clone counts from Oakes et al. (<xref ref-type="bibr" rid="B12">12</xref>), we discuss the qualitative features of our model and typical physiological parameter ranges. While even the basic model parameters are difficult to measure, our nondimensionalized model unifies the mechanisms and concepts common to the maintenance of diversity in the T cell repertoire across different organisms.</p>
<p>When considering the data, we observe that even after significant subsampling, there are appreciable clone counts at reasonably large clone sizes <italic>k</italic>, whereas the unsampled clone counts decay exponentially in <italic>k</italic> with rate log(<italic>&#x3bc;</italic>
<sup>&#x2217;</sup>/<italic>r</italic>). Even though <italic>r</italic> may take on a range of values, as determined by &#x3c0;<italic>
<sub>r</sub>
</italic>(<italic>r</italic>), the slowest decay of <italic>c<sub>k</sub>
</italic> arises from the largest possible values of <italic>r</italic>. Thus, a larger proliferation rate heterogeneity <italic>w</italic> will generally yield a longer-tailed <italic>c<sub>k</sub>
</italic>, as illustrated in <xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref>. Since the data we analyze are derived from human samples, we will use the following arguments as a rough guide to the relevant range of parameters:</p>
<list list-type="bullet">
<list-item>
<p>The average total number of naive T cells is not completely known but is estimated to be about <italic>N</italic>
<sup>&#x2217;</sup>~ 10<sup>11</sup> (<xref ref-type="bibr" rid="B35">35</xref>). However, the circulating population in the peripheral blood is approximately two orders of magnitude smaller. These circulating naive T cells nonetheless exchange with those in the much larger population in the lymph and other tissues. The timescale of this exchange (relative to the age of the organism being sampled or the intersample times) will determine the effective statistically accessible <italic>N</italic>
<sup>&#x2217;</sup> relevant for sampling clone counts <inline-formula>
<mml:math display="inline" id="im101">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>. We will use an order-of-magnitude estimate on the lower range of measurements and estimate <italic>N</italic>
<sup>&#x2217;</sup>~10<sup>10</sup>&#x2212;10<sup>11</sup>.</p>
</list-item>
<list-item>
<p>The theoretical total possible number <italic>Q</italic> of TCRs of either alpha or beta chains may be in the range 10<sup>13</sup>&#x2212;10<sup>18</sup> (<xref ref-type="bibr" rid="B46">46</xref>), but the actual number of clones with immigration rate <italic>&#x3b1;<sub>i</sub>
</italic> that allows it to be produced even once in a lifetime is more relevant and probably much smaller. Thus, the effective value of <italic>Q</italic> may reside at the lower range, leading to &#x3bb; &#x2261; <italic>N</italic>
<sup>&#x2217;</sup>/<italic>Q</italic> ~ 10<sup>-4</sup>&#x2212;10<sup>-2</sup>.</p>
</list-item>
<list-item>
<p>The average (dimensional) immigration rate per clone <inline-formula>
<mml:math display="inline" id="im102">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> can be deduced from the total thymic output of all clones <inline-formula>
<mml:math display="inline" id="im103">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, which has been estimated across a wide range of values <inline-formula>
<mml:math display="inline" id="im104">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>Q</mml:mi>
<mml:mo>&#x223c;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mn>7</mml:mn>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mn>8</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>/day (<xref ref-type="bibr" rid="B29">29</xref>, <xref ref-type="bibr" rid="B47">47</xref>&#x2013;<xref ref-type="bibr" rid="B50">50</xref>). If we use an effective repertoire size of <italic>Q</italic> ~ 10<sup>13</sup>&#x2212;10<sup>14</sup>, the average per clone immigration rate becomes <inline-formula>
<mml:math display="inline" id="im105">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x223c;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>7</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>/day.</p>
</list-item>
<list-item>
<p>The mean proliferation rate <italic>r</italic> is difficult to measure but has been estimated to be on the order of <inline-formula>
<mml:math display="inline" id="im106">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>r</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x223c;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>/day (<xref ref-type="bibr" rid="B29">29</xref>). If we nondimensionalize using <inline-formula>
<mml:math display="inline" id="im107">
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mover accent="true">
<mml:mi>r</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, the <italic>dimensionless</italic> <inline-formula>
<mml:math display="inline" id="im108">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x223c;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</list-item>
<list-item>
<p>The sampling fraction <italic>&#x3b7;</italic>, although in principle determined experimentally, is also hard to quantify due to the uncertainty in <italic>N</italic>
<sup>&#x2217;</sup>. Blood sampling volume fractions from humans are typically <italic>&#x3b7;</italic> ~ 10<sup>-3</sup>; however, in recent experiments (<xref ref-type="bibr" rid="B12">12</xref>) the number of enumerated sequences ~10<sup>5</sup>, which, given rough estimates of effective <italic>N</italic>
<sup>&#x2217;</sup> ~ 10<sup>10</sup>-10<sup>11</sup>, yield <italic>&#x3b7;</italic> ~ 10<sup>-6</sup> - 10<sup>-4</sup>. Due to this uncertainty in <italic>&#x3b7;</italic>, we will explore different fixed values of <italic>&#x3b7;</italic> around 10<sup>-5</sup>.</p>
</list-item>
</list>
<p>Using the above guide for reasonable parameter ranges, we now consider fitting our results in Eqs.&#xa0;18, S9-S14 to some of the available data (<xref ref-type="bibr" rid="B12">12</xref>). Before doing so, note that although the log-log plots shown in <xref ref-type="fig" rid="f1">
<bold>Figures&#xa0;1A, B</bold>
</xref> provide a simple visual for <inline-formula>
<mml:math display="inline" id="im109">
<mml:mrow>
<mml:mi>log</mml:mi>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> or <inline-formula>
<mml:math display="inline" id="im110">
<mml:mrow>
<mml:mi>log</mml:mi>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>C</mml:mi>
<mml:mi>s</mml:mi>
</mml:msup>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, fitting must be performed on the linear scale. The measured data includes data at values of <italic>k</italic> for which no clones were detected so that <inline-formula>
<mml:math display="inline" id="im111">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. These data points nonetheless should be included in the fitting as they represent realizations of the system. However, on the log scale these zero data points translate to <inline-formula>
<mml:math display="inline" id="im112">
<mml:mrow>
<mml:mi>log</mml:mi>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo>&#x2192;</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x221e;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> so numerical fitting on the log-log scale could be misleading once a value of <inline-formula>
<mml:math display="inline" id="im113">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> is encountered. Thus, we will fit our mean-field model on the linear scale to the <italic>fraction</italic> <inline-formula>
<mml:math display="inline" id="im114">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> of the total number of sampled cells that are in clones of size <italic>k</italic></p>
<disp-formula>
<label>(19)</label>
<mml:math display="block" id="im115">
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2261;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mi>s</mml:mi>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>&#x2113;</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mfrac>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>Q</mml:mi>
<mml:mi>&#x3b7;</mml:mi>
<mml:mi>&#x3bb;</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>where the denominator <italic>Q&#x3b7;&#x3bb;</italic> comes directly from the definition <inline-formula>
<mml:math display="inline" id="im116">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mi>&#x2113;</mml:mi>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>&#x2113;</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mstyle>
<mml:mo>&#x2261;</mml:mo>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mi>s</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, the sampling relation <italic>N<sup>s</sup>
</italic> = <italic>&#x3b7;N</italic>
<sup>&#x2217;</sup>, and Eq.&#xa0;6. Note that we have switched the dependence from <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> to <italic>&#x3bb;</italic> (see Eq.&#xa0;14). Rather than using <italic>N<sup>s</sup>
</italic> directly from the number of reads in an experimental sample, equivalently, we use the model expression <italic>N<sup>s</sup> </italic>= <italic>Q&#x3b7;&#x3bb;</italic> to arrive at the last equality in Eq.&#xa0;19. This form ensures strict normalization and is independent of the unknown repertoire size <italic>Q</italic> since <inline-formula>
<mml:math display="inline" id="im117">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> is proportional to <italic>Q</italic>. The implicit factor of <italic>Q</italic> in <inline-formula>
<mml:math display="inline" id="im118">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> from Eq.&#xa0;11 cancels the explicit <italic>Q</italic> in the denominator of Eq.&#xa0;19 so that <inline-formula>
<mml:math display="inline" id="im119">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> as well as <inline-formula>
<mml:math display="inline" id="im120">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo stretchy="false">/</mml:mo>
<mml:msup>
<mml:mi>C</mml:mi>
<mml:mi>s</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> depend on <italic>Q</italic> only through the determination of <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> through <italic>&#x3bb; &#x2261; N</italic>
<sup>&#x2217;</sup>/<italic>Q</italic> in Eq.&#xa0;6.</p>
<p>Our mathematical framework provides only <italic>mean</italic> sampled clone counts while each sample of the data represents one realization. Large sample-to-sample variations in the clone counts would render the fitting less informative, but these large variations were not seen in the triplicate samples in Oakes et al. (<xref ref-type="bibr" rid="B12">12</xref>). Mechanistically, we expect that for large <italic>k</italic> the number of cells contributing to <inline-formula>
<mml:math display="inline" id="im121">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> is also large so demographic stochasticity is relatively small and results in small uncertainties in the value of <italic>k</italic>, and not in the magnitude of <inline-formula>
<mml:math display="inline" id="im122">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>. Large clones are also likely to include memory T cells that have been produced after antigen stimulation of specific clones. Memory T cells are difficult to accurately distinguish from naive T cells (<xref ref-type="bibr" rid="B12">12</xref>) but we will see that large <italic>k</italic> components of <inline-formula>
<mml:math display="inline" id="im123">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> negligibly influence the fitting. We can now compare our model <inline-formula>
<mml:math display="inline" id="im124">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> with the data <inline-formula>
<mml:math display="inline" id="im125">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> (data) by constructing the error</p>
<disp-formula>
<label>(20)</label>
<mml:math display="block" id="M18">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>data</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mo>|</mml:mo>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
</disp-formula>
<p>and exploring how it depends on the parameters <inline-formula>
<mml:math display="inline" id="im126">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, and sampling fraction &#x3b7;. Our goal is to find relationships among the parameters <inline-formula>
<mml:math display="inline" id="im127">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, and <italic>w</italic> that minimize <inline-formula>
<mml:math display="inline" id="im128">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>In <xref ref-type="fig" rid="f6">
<bold>Figures&#xa0;6A&#x2013;C</bold>
</xref> the data <inline-formula>
<mml:math display="inline" id="im136">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>(data) were derived from the average of three samples of beta chain CD4 sequences from one patient (<xref ref-type="bibr" rid="B12">12</xref>). These data, were used to compute and plot the error <inline-formula>
<mml:math display="inline" id="im137">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> as a function of <italic>&#x3bb;</italic> for various values of <inline-formula>
<mml:math display="inline" id="im138">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> using the neutral model (<italic>w</italic> = 0, Eq.&#xa0;S9 in Section 2 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>). For reasonable values of dimensionless <inline-formula>
<mml:math display="inline" id="im139">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x2248;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>0.01</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and sampling fractions <italic>&#x3b7;</italic> = 10<sup>-4</sup>, 10<sup>-5</sup>, and 10<sup>-6</sup>, we find that the value of <italic>&#x3bb;</italic> that minimizes <inline-formula>
<mml:math display="inline" id="im140">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, <italic>&#x3bb;</italic>
<sub>min</sub>, is typically &#x1d4aa;(1) or larger. In <xref ref-type="fig" rid="f6">
<bold>Figures&#xa0;6D&#x2013;F</bold>
</xref> we use the full-width distribution <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>|<italic>w</italic> = 1) to show the error for the same data using the same sampling fractions <italic>&#x3b7;</italic> = 10<sup>-4</sup>, 10<sup>-5</sup>, 10<sup>-6</sup>. Note that the values of <italic>&#x3bb;</italic>
<sub>min</sub> are significantly smaller than those in found using <italic>w</italic> = 0 in <xref ref-type="fig" rid="f6">
<bold>Figures&#xa0;6A&#x2013;C</bold>
</xref> and that the results are rather insensitive to the sampling fraction <italic>&#x3b7;</italic>. These smaller values of <italic>&#x3bb;</italic>
<sub>min</sub> are more consistent with known physiological understanding. Thus, the distributed proliferation rate model provides a much more self-consistent fit to the data than the fixed proliferation rate neutral model. <xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref> also reveals that the values of <italic>H</italic> along the minimum valley are nearly constant, only slightly decreasing as <inline-formula>
<mml:math display="inline" id="im141">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x2192;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. For each value of <italic>&#x3b1;</italic> we can identify the corresponding <italic>&#x3bb;</italic>
<sub>min</sub> that minimizes <italic>H</italic>. However since the values of <inline-formula>
<mml:math display="inline" id="im142">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> for each <inline-formula>
<mml:math display="inline" id="im143">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> pair do not change appreciably, we cannot independently determine both.</p>
<fig id="f6" position="float">
<label>Figure&#xa0;6</label>
<caption>
<p>The error <inline-formula>
<mml:math display="inline" id="im129">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> plotted as a function of <italic>&#x3b1;</italic> (on a log<sub>10</sub> scale) and <italic>&#x3bb;</italic>. Darker colors represent smaller values of error as shown by the scale bar on the right. The data used are the clone counts of beta chain sequences of naive CD4 cells from one patient, averaged over three samples. Panels <bold>(A&#x2013;C)</bold> use the simple neutral model (Eqs.&#xa0;S9 and S10) and sampling fractions &#x3b7; = 10<sup>-4</sup>, 10<sup>-5</sup>, and 10<sup>-6</sup>, respectively. Since <inline-formula>
<mml:math display="inline" id="im130">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> is on a log scale, the error is minimal along a line <inline-formula>
<mml:math display="inline" id="im131">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x221d;</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>; the error does not change appreciably along this path and only slightly decreases as <italic>&#x3bb;</italic> and <inline-formula>
<mml:math display="inline" id="im132">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> become smaller. For the neutral model (w = 0), the error is very sensitive to the sampling fraction <italic>&#x3b7;</italic>. Here, a fixed, physiologically reasonable value of <inline-formula>
<mml:math display="inline" id="im133">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> results in a minimizing <italic>&#x3bb;</italic>
<sub>min</sub> that is unreasonably large, in excess of one and that does not agree well with our expectations of <inline-formula>
<mml:math display="inline" id="im134">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
<mml:mo stretchy="false">/</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mo>&#x226a;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. Panels <bold>(D&#x2013;F)</bold> show results for the distributed proliferation rate model at full width (<italic>w</italic> = 1). In this case, the errors are insensitive to the specific choice of &#x3b7; and the minimizing <italic>&#x3bb;</italic>
<sub>min</sub> values are much smaller, consistent with our estimates of <italic>N</italic>
<sup>&#x2217;</sup> and repertoire size. For <italic>w</italic> = 1, the values of the errors <italic>H</italic> are also smaller along the <inline-formula>
<mml:math display="inline" id="im135">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> minimum valley.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g006.tif"/>
</fig>
<p>An alternate representation is shown in <xref ref-type="fig" rid="f7">
<bold>Figure&#xa0;7</bold>
</xref> where the relationship between <inline-formula>
<mml:math display="inline" id="im140a">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> and <italic>&#x3bb;</italic>
<sub>min</sub> is seen to be approximately linear for both the neutral model (<italic>w</italic> = 0) and the heterogeneous, full-width model (<italic>w</italic> = 1). The color shading represents the corresponding value of <inline-formula>
<mml:math display="inline" id="im145">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. One major observation is that the full-width case yields values of <inline-formula>
<mml:math display="inline" id="im146">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> that are closer to measured and expected physiological values and that these results are also less sensitive to <italic>&#x3b7;</italic> compared to those of the neutral case. On the other hand, although the variation in <italic>H</italic> is negligible across <inline-formula>
<mml:math display="inline" id="im147">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> in both cases, the fully heterogeneous model (<italic>w</italic> = 1) carries a slightly larger error than the neutral one (w = 0). This is solely a consequence of our use of <inline-formula>
<mml:math display="inline" id="im148">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> which weights the small <italic>k</italic> values significantly more in the fitting.</p>
<fig id="f7" position="float">
<label>Figure&#xa0;7</label>
<caption>
<p>Log-log plots of <italic>&#x3bb;</italic>
<sub>min</sub> values as functions of <italic>&#x3b1;</italic> for <italic>&#x3b7;</italic> = 10<sup>-4</sup>, 10<sup>-5</sup>, 10<sup>-5</sup>, and 10<sup>-6</sup> for <bold>(A)</bold> the neutral model, <italic>w</italic> = 0, and <bold>(B)</bold> the full-width distributed proliferation rate model, <italic>w</italic> = 1. These curves trace the values of <italic>&#x3bb;</italic>
<sub>min</sub> along the minimum valley in and show the relative insensitivity of the distributed proliferation rate model to the subsampling fraction <italic>&#x3b7;</italic>. In both <bold>(A, B)</bold>, the minimum line slopes are near one, with <bold>(B)</bold> showing a slightly greater slope, indicating <italic>&#x3bb;</italic>
<sub>min</sub> is approximately proportional to <inline-formula>
<mml:math display="inline" id="im144a">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> over the entire range of <italic>w</italic>. The color intensity along the lines in <bold>(A, B)</bold> indicates variation in the total error along the minimum valley; their uniformity shows that the errors are nearly constant along each line. <bold>(C)</bold> Log-linear plot of <italic>&#x3bb;</italic>
<sub>min</sub> as a function of proliferation rate heterogeneity <italic>w</italic> for <inline-formula>
<mml:math display="inline" id="im144">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mn>0</mml:mn>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mn>0</mml:mn>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. The lower darker curves in each pair correspond to <italic>&#x3b7;</italic> = 10<sup>-4</sup> while the lighter curves correspond to <italic>&#x3b7;</italic> = 10<sup>-6</sup>. The curves show that even a small heterogeneity <italic>w</italic> quickly reduces <italic>&#x3bb;</italic>
<sub>min</sub> to below one; however, if <italic>&#x3bb;</italic> is forced to be even smaller, the required heterogeneity <italic>w</italic> increases.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g007.tif"/>
</fig>
<p>Since experimentally we expect small <italic>&#x3bb;</italic>, we also investigate whether small errors <italic>H</italic> emerge for values of <inline-formula>
<mml:math display="inline" id="im149">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3bb;</mml:mi>
<mml:mrow>
<mml:mi>min</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x226a;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> at intermediate 0 &lt; <italic>w</italic> &lt; 1. In <xref ref-type="fig" rid="f7">
<bold>Figure&#xa0;7C</bold>
</xref>, we plot <italic>&#x3bb;</italic>
<sub>min</sub> as a function of <italic>w</italic> for various values of <inline-formula>
<mml:math display="inline" id="im150">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula>. Note that even small <italic>w</italic> significantly reduces, relative to the neutral case, the corresponding <italic>&#x3bb;</italic>
<sub>min</sub>. However, if our target is <italic>&#x3bb;</italic>
<sub>min</sub> &#x223c; 10<sup>-4</sup>-10<sup>-3</sup>, the required <italic>w</italic> can become quite large. These results indicate that more heterogeneity is associated with more realistic values of the experimentally observed values of <italic>N</italic>
<sup>&#x2217;</sup>/<italic>Q</italic>.</p>
<p>Finally, to explore the dependence of the error on the proliferation rate heterogeneity <italic>w</italic>, we fix <inline-formula>
<mml:math display="inline" id="im151">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, and <italic>&#x3b7;</italic>, and plot <inline-formula>
<mml:math display="inline" id="im152">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> as a function of <italic>w</italic>. <xref ref-type="fig" rid="f8">
<bold>Figure&#xa0;8</bold>
</xref> shows that the <italic>H</italic>-minimizing <italic>w</italic> is very sensitive to <inline-formula>
<mml:math display="inline" id="im156">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>: for fixed <italic>&#x3b7;</italic>, as <inline-formula>
<mml:math display="inline" id="im157">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> is decreased the error is lowest for larger proliferation heterogeneity <italic>w</italic>. The minimum value of <inline-formula>
<mml:math display="inline" id="im158">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, however, is rather insensitive to <inline-formula>
<mml:math display="inline" id="im159">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> for all chosen <italic>&#x3b7;</italic>. Hence, near-optimal solutions with <inline-formula>
<mml:math display="inline" id="im160">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x226a;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> can be found when the proliferation rate heterogeneity <italic>w</italic> is appreciable. Using the parameters associated with the minima in <xref ref-type="fig" rid="f8">
<bold>Figure&#xa0;8A</bold>
</xref> (<italic>&#x3b7;</italic> = 10<sup>-4</sup>), we plot our predicted <inline-formula>
<mml:math display="inline" id="im161">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> against the data <inline-formula>
<mml:math display="inline" id="im162">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>(data) in <xref ref-type="fig" rid="f9">
<bold>Figure&#xa0;9</bold>
</xref>. As can be seen, when proliferation rate heterogeneity is allowed, the best-fits have small error and are found using realistic values, <inline-formula>
<mml:math display="inline" id="im167">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x226a;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. Note that most of the information in the data lies in how <inline-formula>
<mml:math display="inline" id="im168">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>(data) decreases over the first few values of <italic>k</italic>. The neutral model (<italic>w</italic> = 0) fits best for small values of <italic>k</italic>, but the corresponding values of <italic>&#x3bb;</italic> and <inline-formula>
<mml:math display="inline" id="im169">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> are too large and small, respectively. The goodness of fit of our model to the data depends mostly on the predicted initial decreases in <inline-formula>
<mml:math display="inline" id="im170">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. The constraints among the parameters <inline-formula>
<mml:math display="inline" id="im171">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, and <italic>&#x3b7;</italic> derived from our model and can be applied to different clone counts such as the data shown in <xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>. However, due to the ill-conditioning when <italic>&#x3b7;</italic> &#x226a; 1, the differences in these constraints across different data sets do not vary appreciably are only quantitatively different. Generally, the more rapidly decaying a clone count, the smaller the <italic>w</italic>, the smaller the <italic>&#x3b7;</italic>, of the larger the <italic>&#x3bb;</italic>, all else being equal.</p>
<fig id="f8" position="float">
<label>Figure&#xa0;8</label>
<caption>
<p>The error <inline-formula>
<mml:math display="inline" id="im153">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>,</mml:mo>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>&#x3b7;</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> using CD4 alpha data from Oakes et&#xa0;al. (<xref ref-type="bibr" rid="B12">12</xref>) plotted as a function of <italic>w</italic> for various <inline-formula>
<mml:math display="inline" id="im154">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>. We fixed <italic>&#x3bb;</italic> = 10<sup>-3</sup> and varied, from left to right, <inline-formula>
<mml:math display="inline" id="im155">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> (red), 6 &#xd7; 10<sup>-5</sup> (green), 10<sup>-4</sup> (blue) and 1.4 &#xd7; 10<sup>-4</sup> (black). From <bold>(A&#x2013;C)</bold>, <italic>&#x3b7;</italic> = 10<sup>-4</sup>, 10<sup>-5</sup>, and 10<sup>-6</sup>. Smaller values of <inline-formula>
<mml:math display="inline" id="im201">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> result in larger best-fit values of <italic>w</italic>.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g008.tif"/>
</fig>
<fig id="f9" position="float">
<label>Figure&#xa0;9</label>
<caption>
<p>Plots of the representative optimal solutions of clone counts <inline-formula>
<mml:math display="inline" id="im163">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> from Eq.&#xa0;19 (using <italic>&#x3b7;</italic> = 10<sup>-4</sup> and &#x3bb; = 10<sup>-3</sup> unless otherwise indicated) plotted along side the shown data from Oakes et al. (<xref ref-type="bibr" rid="B12">12</xref>). The model predictions and CD4 beta chain data are shown in both <bold>(A)</bold> log-log and <bold>(B)</bold> linear scales (there are no zero-values clone counts in this dataset). In <bold>(A)</bold>, the best fit model for the neutral model (<italic>w</italic> = 0 and <inline-formula>
<mml:math display="inline" id="im164">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>) using <inline-formula>
<mml:math display="inline" id="im165">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mn>0</mml:mn>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> is given by &#x3bb; &#x2248; 3 shown by the solid black curve. The dashed curves represents best-fit curves using the values associated with the error minima in, where <inline-formula>
<mml:math display="inline" id="im166">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mn>0</mml:mn>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, <italic>w</italic> &#x2248; 0.09 (red), 6 &#xd7; 10<sup>-5</sup>, <italic>w</italic> &#x2248; 0.3 (green), 10<sup>-4</sup>, <italic>w</italic> &#x2248; 0.53 (blue) and 1.4 &#xd7; 10<sup>-4</sup>, <italic>w</italic> &#x2248; 0.76 (black). Note that the neutral model fits well for only the first 2-3 <italic>k</italic>-points, while the heterogeneous model (<italic>w &gt; 0</italic>) fits better at larger <italic>k</italic>.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fimmu-12-735135-g009.tif"/>
</fig>
</sec>
<sec id="s4" sec-type="discussion">
<title>Discussion</title>
<p>Here, we review and justify a number of critical biological assumptions and mathematical approximations used in our analysis. The effects of relaxing our approximations are also discussed.</p>
<sec id="s4_1">
<title>Distinct T Cell Components</title>
<p>It is known that naive T cells can change in time, with recent thymic emigrants evolving into mature naive T cells that carry different proliferation and death rates (<xref ref-type="bibr" rid="B51">51</xref>). For simplicity, we have assumed a single naive T cell compartment. To incorporate naive T cell evolution, we can allow the distribution <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>) to evolve in time to reflect the relative abundances of T cell subpopulations, or, one can explicitly include multiple compartments, with cells from a recent emigrant compartment transitioning into a mature compartment. Each compartment would be described by its own steady-state death rates, clone counts, and distributions of proliferation rates. An analysis of a related sequential cell state transition model has been developed for clonal tracking in hematopoiesis (<xref ref-type="bibr" rid="B41">41</xref>).</p>
</sec>
<sec id="s4_2">
<title>Factorization of &#x3c0; (&#x3b1;, r)</title>
<p>For mathematical tractability, we have assumed <inline-formula>
<mml:math display="inline" id="im172">
<mml:mrow>
<mml:mi>&#x3c0;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>&#x3b8;</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>r</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>|</mml:mo>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. Given the typical physiological values of <inline-formula>
<mml:math display="inline" id="im173">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula>, the clone count formulae derived from our model can be accurately approximated by a single value of <inline-formula>
<mml:math display="inline" id="im174">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula>. Thus, we expect that the immigration rate distribution can be approximated by <inline-formula>
<mml:math display="inline" id="im175">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>&#x3b4;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. This allows further approximation of our formulae as shown in Section 3 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>. In Section 4 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>, we explicitly show that factorisation is an accurate approximation.</p>
<p>We have also assumed that selection is uncorrelated with the generation probabilities of the TCR nucleotide sequences encoded in IGoR/OLGA. The assumption is that the recombination statistics are uncorrelated with the statistics of thymic selection, a process that is based on TCR amino acid sequences. However, we note that it has been suggested that selection pressure may induce a correlation between TCRs generated and selected (<xref ref-type="bibr" rid="B52">52</xref>). The corresponding statistics of the frequencies of <italic>selected</italic> TCRs would be modified from those of the <italic>generated</italic> TCRs shown in <xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>. Nonetheless, we assume that the resulting distribution can still be approximated by a single-<italic>&#x3b1;</italic> model which will not qualitatively alter our conclusions.</p>
</sec>
<sec id="s4_3">
<title>Mean-Field Approximation</title>
<p>Our mean-field approximation for the mean clone count <italic>c<sub>k</sub>
</italic> is embodied in Eq.&#xa0;7, where correlations between fluctuations in the total population <inline-formula>
<mml:math display="inline" id="im176">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>=</mml:mo>
<mml:mstyle displaystyle="true">
<mml:msub>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mi>k</mml:mi>
</mml:mstyle>
<mml:msub>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> in the regulation term <italic>&#x3bc;</italic>(<italic>N</italic>) and the explicit <italic>c<sub>k</sub>
</italic> terms are neglected. This approximation has been shown to be accurate for <italic>k</italic> &#x2272; <italic>N</italic>
<sup>&#x2217;</sup> when <inline-formula>
<mml:math display="inline" id="im177">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:msup>
<mml:mi>Q</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&gt;</mml:mo>
<mml:mi>&#x3bc;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>N</mml:mi>
<mml:mo>&#x2217;</mml:mo>
</mml:msup>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> (<xref ref-type="bibr" rid="B39">39</xref>). The mean-field results overestimate the clone counts for <italic>k</italic> &#x2272; <italic>N</italic>
<sup>&#x2217;</sup>. Moreover, when the total steady-state T cell immigration rate is extremely small, the effects of competitive exclusion dominate and a single large clone arises (<xref ref-type="bibr" rid="B39">39</xref>, <xref ref-type="bibr" rid="B53">53</xref>, <xref ref-type="bibr" rid="B54">54</xref>). Nonetheless, an accurate approximation for the steady-state clone abundance <italic>c<sub>k</sub>
</italic> can be obtained using a variation of the two-species Moran model as shown in (<xref ref-type="bibr" rid="B39">39</xref>). For the naive T cell system, because <italic>Q</italic> is so large, the mean immigration rate <inline-formula>
<mml:math display="inline" id="im178">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> is such that competitive exclusion is not a dominant feature. Moreover, since <italic>N</italic>
<sup>&#x2217;</sup> &#x2273; 10<sup>10</sup>, clones counts at comparable sizes are not observed and predicted to be negligible in all models. Since the values of <inline-formula>
<mml:math display="inline" id="im179">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>(data) become exponentially smaller for large <italic>k</italic>, our inference is most sensitive to the values of <inline-formula>
<mml:math display="inline" id="im180">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>(data) for small to modest <italic>k</italic>. The information in the data is primarily manifested by how the <inline-formula>
<mml:math display="inline" id="im181">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>(data) decays in <italic>k</italic>, we before the mean-field approximation deviates from the exact solution. Thus, the parameters associated with the human adaptive immune system satisfy the conditions for the mean-field approximation to be accurate, justifying its use in the BDI model.</p>
</sec>
<sec id="s4_4">
<title>Steady State Assumption</title>
<p>In this study, we only considered the steady state of our birth-death-immigration model in Eq. 8 because this limit allowed relatively easy derivations of analytical results. This was also the strategy for previous modeling work (<xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B38">38</xref>, <xref ref-type="bibr" rid="B39">39</xref>). However, the per-clone immigration and proliferation times may be on the order of months or years, a time scale over which thymic output diminishes as an individual ages (<xref ref-type="bibr" rid="B29">29</xref>, <xref ref-type="bibr" rid="B55">55</xref>&#x2013;<xref ref-type="bibr" rid="B57">57</xref>). Indeed, clone abundance distributions have been shown to show specific patterns as a function of age (<xref ref-type="bibr" rid="B58">58</xref>&#x2013;<xref ref-type="bibr" rid="B60">60</xref>). Although <italic>N</italic>(<italic>t</italic>), with fixed <inline-formula>
<mml:math display="inline" id="im182">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math display="inline" id="im183">
<mml:mover accent="true">
<mml:mi>r</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> relaxes to steady-state quickly, on a timescale of months, the different subpopulations of specific sizes described by their number <italic>c<sub>k</sub>
</italic> relax to quasi-steady-state across a spectrum of time scales depending on the clone sizes <italic>k</italic> (<xref ref-type="bibr" rid="B39">39</xref>, <xref ref-type="bibr" rid="B61">61</xref>). The timescales of relaxation of the largest clones can be estimated from the eigenvalues of the linearized system (Eqs.&#xa0;7) and are found to be ~ 10 years. Thymic involution could be modeled by using a time-dependent <italic>&#x3b1;</italic>(<italic>t</italic>) that slowly decreases with age (<xref ref-type="bibr" rid="B57">57</xref>). Although T cells are thought to be primarily maintained through proliferation, thymic regeneration has also been shown to affect the naive T cell pool many years after thymectomy in infants. Here, a time dependent increase in <italic>&#x3b1;</italic>(<italic>t</italic>) after early thymectomy could be used. Indeed, the clone counts may be determined in early life (<xref ref-type="bibr" rid="B17">17</xref>) suggesting the dynamics of certain clones may be very slow, precluding a strict steady-state analysis for the entire repertoire.</p>
<p>In addition to time-dependent changes in <italic>&#x3b1;</italic>, more subtle time-inhomogeneities such as changes in proliferation and death rates have been demonstrated (<xref ref-type="bibr" rid="B55">55</xref>, <xref ref-type="bibr" rid="B56">56</xref>, <xref ref-type="bibr" rid="B62">62</xref>). Thus, our steady-state assumption could be relaxed by incorporation of time-dependent perturbations to the model parameters <italic>&#x3bc;</italic>
<sup>&#x2217;</sup> and/or &#x3c0;(<italic>&#x3b1;, r</italic>). Longitudinal measurements of clone abundances or experiments involving time-dependent perturbations would provide significant insight into the overall dynamics of clone abundances. The timescales required to reach steady state fall between 1/(<inline-formula>
<mml:math display="inline" id="im184">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>Q</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>) and <inline-formula>
<mml:math display="inline" id="im185">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">/</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>. Thus, it is possible that some components of <italic>c<sub>k</sub>
</italic> does not reach steady state in an organism&#x2019;s lifetime and our steady state model might not be be valid for all values of <italic>c<sub>k</sub>
</italic> (<xref ref-type="bibr" rid="B57">57</xref>, <xref ref-type="bibr" rid="B61">61</xref>) and a dynamic approach must be taken.</p>
</sec>
<sec id="s4_5">
<title>Clustered Immigration</title>
<p>Our mean field model assumed that each immigration event introduced a single naive T cell in the immune system. However, T cells can divide before leaving the thymus and reach a homeostatic state in the periphery. This process can be described by the simultaneous immigration of more than one naive T cell with the same TCR. Clustered immigration of <italic>q</italic> cells can be implemented in the core model for <italic>c<sub>k</sub>
</italic> (Eq.&#xa0;7) via an immigration term of the form <italic>&#x3b1;<sub>q</sub>
</italic>(<italic>c<sub>k-q</sub>
</italic>(<italic>&#x3b1;<sub>q</sub>
</italic>, <italic>r</italic>)-<italic>c<sub>k</sub>
</italic>(<italic>&#x3b1;<sub>q</sub>
</italic>, r)), where <italic>c<sub>k-q</sub> </italic>= 0 for <italic>k-q &lt; 0</italic> (see Section 5 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>). For <italic>q</italic> &gt; 1, an informative analytic expression for <italic>c<sub>k</sub>
</italic> is not available. In Figure S2 of the Section 5 of the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>, we show the predicted clone abundance <italic>c<sub>k</sub>
</italic> for a neutral model in which <italic>q</italic> = 5. When compared to the case where there is only one cell per immigration, the clone abundance <italic>c<sub>k</sub>
</italic> will have a larger slope for <italic>k</italic> &#x2a85; <italic>q</italic>, making it kink more downward near <italic>k &#x2248; q</italic>. Thus, from <xref ref-type="supplementary-material" rid="SM1">
<bold>Figures S2</bold>
</xref> and <xref ref-type="fig" rid="f9">
<bold>9A</bold>
</xref>, we can see that paired immigration (<italic>q</italic> = 2) would increase <inline-formula>
<mml:math display="inline" id="im186">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> for <italic>k</italic> = 2, providing an improved fitting to data over single copy immigration (<italic>q</italic> = 1).</p>
<p>Thus, in addition to appreciable sensitivity of the predicted clone counts to &#x3c0;<italic>
<sub>r</sub>
</italic>(<italic>r</italic>|<italic>w</italic>), we also expect clustered immigration defined through the immigration rates &#x3b1;<italic>
<sub>q</sub>
</italic>, q &gt; 1 to control the goodness of fit to data. Indeed, <xref ref-type="supplementary-material" rid="SM1">
<bold>Figure S2</bold>
</xref> suggests that the distribution of immigration cluster sizes <italic>q</italic>, in addition to the proliferation rate heterogeneity <italic>w</italic>, is an important determinant of measured clone counts and that <italic>&#x3b1;<sub>q</sub>
</italic> may be constrained by data. We leave this for future investigation.</p>
</sec>
<sec id="s4_6">
<title>General Conclusions</title>
<p>We developed a heterogeneous multispecies birth-death-immigration model and analyzed it in the context of T cell clonal heterogeneity; the clone abundance distribution is derived in the mean-field limit. Unlike previous studies (<xref ref-type="bibr" rid="B4">4</xref>), our modeling approach incorporated sampling statistics and provided simple formulae, allowing us to predict clone abundances under different rate distributions for arbitrarily large systems (<italic>N</italic>
<sup>&#x2217;</sup> &#x223c; 10<sup>10</sup> - 10<sup>11</sup>), without the need for simulation. The properties of the BDI model and the overall shape of the sampled clone count data renders the first few <italic>k</italic>-values of <inline-formula>
<mml:math display="inline" id="im187">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> or <inline-formula>
<mml:math display="inline" id="im188">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> the most important for determining the constraints among the model parameters. In other words, only the initial rate of the decrease in <inline-formula>
<mml:math display="inline" id="im189">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>(data) for small <italic>k</italic> governs the quality of fitting to the model, and one should not expect to be able to explicitly infer more than one or two free parameters.</p>
<p>Our heterogeneous BDI model produced mean sampled clone count distributions that we could directly compare with measured clone counts. The unsampled clone counts <italic>c<sub>k</sub>
</italic> of the neutral model (homogeneous <italic>&#x3b1;</italic> and <italic>r</italic>) follow a negative binomial distribution which is further modified upon sampling and distribution over the heterogeneous immigration and proliferation rates. Although we determined <inline-formula>
<mml:math display="inline" id="im190">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> through a code that implemented recombination statistics inferred from cDNA and gDNA sequences (<xref ref-type="bibr" rid="B20">20</xref>, <xref ref-type="bibr" rid="B28">28</xref>), we found that the behavior of the model is rather insensitive to distributions <inline-formula>
<mml:math display="inline" id="im191">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>&#x3b1;</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> with mean values <inline-formula>
<mml:math display="inline" id="im192">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> much smaller than the largest proliferation rates <italic>r</italic>. The model results are dominated by many low immigration-rate clones and a model that replaces <italic>&#x3b1;</italic> with its mean value <inline-formula>
<mml:math display="inline" id="im193">
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> is sufficient.</p>
<p>Conversely, we find that the shape of the clone count profiles <italic>c<sub>k</sub>
</italic> are quite sensitive to the proliferation rate heterogeneity <italic>w</italic>. A small amount of heterogeneity quickly reduces the best-fit values of <italic>&#x3bb;</italic> to reasonable values. For estimated values <italic>&#x3b7;</italic> ~ 10<sup>-6</sup> &#x2013; 10<sup>-4</sup>, <inline-formula>
<mml:math display="inline" id="im194">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mo>&#x223c;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, and small values of <italic>&#x3bb;</italic> = <italic>N</italic>
<sup>&#x2217;</sup>/<italic>Q</italic> &#x2272; 10<sup>-3</sup>, requires a best-fit width <italic>w</italic> &#x2248; 1. Heterogeneity is needed to generate clones of sufficiently large size that persist after sampling. Although the number of TCR clones with large proliferation rates <italic>r</italic> may be small, such clones proliferate more rapidly contributing to higher clone counts at larger sizes. In particular, we found that the shape of expected clone abundance is sensitive to the behavior of the proliferation rate distribution near the maximum dimensional proliferation rate <italic>R, &#x3c0;<sub>r</sub>
</italic>(<italic>r</italic> &#x2248; <italic>R</italic>). The predicted clone counts are also modestly sensitive to the distribution of immigration cluster sizes <italic>q</italic> (representing transient proliferation just before thymic output). When <italic>q</italic> &gt; 1 cells of a clone are simultaneously exported by the thymus, the predicted mean clone counts decay much more slowly for small <italic>k</italic> &#x2272; <italic>q</italic> (see <xref ref-type="supplementary-material" rid="SM1">
<bold>Figure S2</bold>
</xref>). This modification will allow for better fitting since clustered immigration increases the predicted clone counts for larger <inline-formula>
<mml:math display="inline" id="im195">
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn>
<mml:mi>s</mml:mi>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mn>3</mml:mn>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, etc., and eventually <inline-formula>
<mml:math display="inline" id="im196">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mn>2</mml:mn>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula>
<mml:math display="inline" id="im197">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mn>3</mml:mn>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>, etc. Thus, we expect that a model containing multiple clustered immigration rates <italic>&#x3b1;<sub>q&#x2265;1</sub>
</italic> will lower the error and provide better fitting, particularly at larger <italic>w</italic>. Additional analysis using a distribution of immigration cluster sizes may allow this type of clone count data to reveal more information about the physiological mechanism of naive T cell maintenance.</p>
<p>Even assuming modest heterogeneity, our work leads to the conclusion that the typical immigration heterogeneity is not enough to influence measured clone counts and that varying levels of proliferation heterogeneity is needed to shape <inline-formula>
<mml:math display="inline" id="im198">
<mml:mrow>
<mml:msubsup>
<mml:mi>c</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> (and <inline-formula>
<mml:math display="inline" id="im199">
<mml:mrow>
<mml:msubsup>
<mml:mi>f</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>s</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>) (<xref ref-type="bibr" rid="B12">12</xref>). These results are consistent with the finding that naive T cells in humans are maintained by proliferation rather than thymic output (<xref ref-type="bibr" rid="B9">9</xref>). Since we have only investigated the effects of a uniform distribution for <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>|<italic>w</italic>), further studies using more complex shapes of <italic>&#x3c0;</italic>(<italic>&#x3b1;</italic>, <italic>r</italic>|<italic>&#x3b8;</italic>
<sub>0</sub>) can be easily explored numerically using our modeling framework. Different parameter values and rate distributions appropriate for mice, in which naive T cells are maintained by thymic output, should also be explored within our modeling framework. Finally, it will be important to extend our steady-state model to allow <italic>&#x3b1;</italic>(t), <italic>&#x3c0;<sub>r</sub>
</italic>(<italic>r</italic>,<italic>t</italic>), and <italic>&#x3bc;</italic>
<sup>&#x2217;</sup>(<italic>t</italic>) to be functions of time in order to predict clone abundances in the presence of thymic involution and reduced proliferation with age (<xref ref-type="bibr" rid="B62">62</xref>, <xref ref-type="bibr" rid="B63">63</xref>), which can even arise differentially in different compartments (<xref ref-type="bibr" rid="B64">64</xref>).</p>
</sec>
</sec>
<sec id="s5" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>Publicly available datasets were analyzed in this study. This data can be found here: <uri xlink:href="https://www.frontiersin.org/articles/10.3389/fimmu.2017.01267/full#">https://www.frontiersin.org/articles/10.3389/fimmu.2017.01267/full</uri>.</p>
</sec>
<sec id="s6" sec-type="author-contributions">
<title>Author Contributions</title>
<p>RD, TC, and MRD developed and analyzed the model and wrote the manuscript. YP organized published data, and DM assisted in sorting and organizing generated data. TC, YP, and MX performed numerical analyses and data fitting. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="s7" sec-type="funding-information">
<title>Funding</title>
<p>This work was supported by grants from the NIH through grant R01HL146552 (TC), the Army Research Office through grant W911NF-18-1-0345 (MRD), the NSF through grants DMS-1814364 (TC) and DMS-1814090 (MRD). The authors also thank the Collaboratory in Institute for Quantitative and Computational Biosciences at UCLA for support to RD.</p>
</sec>
<sec id="s8" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s9" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="s10" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fimmu.2021.735135/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fimmu.2021.735135/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Presentation_1.pdf" id="SM1" mimetype="application/pdf"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qi</surname> <given-names>Q</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Cheng</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Glanville</surname> <given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>D</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>JY</given-names>
</name>
<etal/>
</person-group>. <article-title>Diversity and Clonal Selection in the Human T-Cell Repertoire</article-title>. <source>Proc Natl Acad Sci (USA)</source> (<year>2014</year>) <volume>111</volume>:<page-range>13139&#x2013;44</page-range>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1409155111</pub-id>
</citation>
</ref>
<ref id="B2">
<label>2</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>van den Broek</surname> <given-names>T</given-names>
</name>
<name>
<surname>Borghans</surname> <given-names>JAM</given-names>
</name>
<name>
<surname>van Wijk</surname> <given-names>F</given-names>
</name>
</person-group>. <article-title>The Full Spectrum of Human Naive T Cells</article-title>. <source>Nat Rev Immunol</source> (<year>2018</year>) <volume>18</volume>:<page-range>363&#x2013;73</page-range>. doi: <pub-id pub-id-type="doi">10.1038/s41577-018-0001-y</pub-id>
</citation>
</ref>
<ref id="B3">
<label>3</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Laydon</surname> <given-names>DJ</given-names>
</name>
<name>
<surname>Bangham</surname> <given-names>CRM</given-names>
</name>
<name>
<surname>Asquith</surname> <given-names>B</given-names>
</name>
</person-group>. <article-title>Estimating T-Cell Repertoire Diversity: Limitations of Classical Estimators and a New Approach</article-title>. <source>Phil Trans R Soc B</source> (<year>2015</year>) <volume>370</volume>:<fpage>20140291</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1098/rstb.2014.0291</pub-id>
</citation>
</ref>
<ref id="B4">
<label>4</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Desponds</surname> <given-names>J</given-names>
</name>
<name>
<surname>Mora</surname> <given-names>T</given-names>
</name>
<name>
<surname>Walczak</surname> <given-names>AM</given-names>
</name>
</person-group>. <article-title>Fluctuating Fitness Shapes the Clone-Size Distribution of Immune Repertoires</article-title>. <source>Proc Natl Acad Sci USA</source> (<year>2016</year>) <volume>113</volume>:<page-range>274&#x2013;9</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1073/pnas.1512977112</pub-id>
</citation>
</ref>
<ref id="B5">
<label>5</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Desponds</surname> <given-names>J</given-names>
</name>
<name>
<surname>Mayer</surname> <given-names>A</given-names>
</name>
<name>
<surname>Mora</surname> <given-names>T</given-names>
</name>
<name>
<surname>Walczak</surname> <given-names>AM</given-names>
</name>
</person-group>. <article-title>Population Dynamics of Immune Repertoires</article-title>. In: <person-group person-group-type="author">
<name>
<surname>Molina-Par&#xed;s</surname> <given-names>C</given-names>
</name>
<name>
<surname>Lythe</surname> <given-names>G</given-names>
</name>
</person-group>, editors. <source>Mathematical, Computational and Experimental T Cell Immunology</source>. <publisher-loc>Cham, Switzerland</publisher-loc>: <publisher-name>Springer</publisher-name> (<year>2021</year>). p. <fpage>67</fpage>&#x2013;<lpage>79</lpage>.</citation>
</ref>
<ref id="B6">
<label>6</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lythe</surname> <given-names>G</given-names>
</name>
<name>
<surname>Callard</surname> <given-names>RE</given-names>
</name>
<name>
<surname>Hoare</surname> <given-names>RL</given-names>
</name>
<name>
<surname>Molina-Par&#xed;s</surname> <given-names>C</given-names>
</name>
</person-group>. <article-title>How Many TCR Clonotypes Does a Body Maintain</article-title>? <source>J Theor Biol</source> (<year>2016</year>) <volume>389</volume>:<page-range>214&#x2013;24</page-range>. doi: <pub-id pub-id-type="doi">10.1016/j.jtbi.2015.10.016</pub-id>
</citation>
</ref>
<ref id="B7">
<label>7</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>de Greef</surname> <given-names>PC</given-names>
</name>
<name>
<surname>Oakes</surname> <given-names>T</given-names>
</name>
<name>
<surname>Gerritsen</surname> <given-names>B</given-names>
</name>
<name>
<surname>Ismail</surname> <given-names>M</given-names>
</name>
<name>
<surname>Heather</surname> <given-names>JM</given-names>
</name>
<name>
<surname>Hermsen</surname> <given-names>R</given-names>
</name>
<etal/>
</person-group>. <article-title>The Naive T-Cell Receptor Repertoire has an Extremely Broad Distribution of Clone Sizes</article-title>. <source>eLife</source> (<year>2020</year>) <volume>9</volume>:<fpage>e49900</fpage>. doi: <pub-id pub-id-type="doi">10.7554/eLife.49900</pub-id>
</citation>
</ref>
<ref id="B8">
<label>8</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Koch</surname> <given-names>H</given-names>
</name>
<name>
<surname>Starenki</surname> <given-names>D</given-names>
</name>
<name>
<surname>Cooper</surname> <given-names>SJ</given-names>
</name>
<name>
<surname>Myers</surname> <given-names>RM</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Q</given-names>
</name>
</person-group>. <article-title>powerTCR: A Model-Based Approach to Comparative Analysis of the Clone Size Distribution of the T Cell Receptor Repertoire</article-title>. <source>PloS Comput Biol</source> (<year>2018</year>) <volume>14</volume>:<elocation-id>e1006571</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pcbi.1006571</pub-id>
</citation>
</ref>
<ref id="B9">
<label>9</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>den Braber</surname> <given-names>I</given-names>
</name>
<name>
<surname>Mugwagwa</surname> <given-names>T</given-names>
</name>
<name>
<surname>Vrisekoop</surname> <given-names>N</given-names>
</name>
<name>
<surname>Westera</surname> <given-names>L</given-names>
</name>
<name>
<surname>M&#xf6;gling</surname> <given-names>R</given-names>
</name>
<name>
<surname>Bregje de Boer</surname> <given-names>A</given-names>
</name>
<etal/>
</person-group>. <article-title>Maintenance of Peripheral Naive T Cells Is Sustained by Thymus Output in Mice But Not Humans</article-title>. <source>Immunity</source> (<year>2012</year>) <volume>36</volume>:<page-range>288&#x2013;97</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.immuni.2012.02.006</pub-id>
</citation>
</ref>
<ref id="B10">
<label>10</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dessalles</surname> <given-names>R</given-names>
</name>
<name>
<surname>D&#x2019;Orsogna</surname> <given-names>MR</given-names>
</name>
<name>
<surname>Chou</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>Exact Steady-State Distributions of Multispecies Birth-Death-Immigration Processes: Effects of Mutations and Carrying Capacity on Diversity</article-title>. <source>J Stat Phys</source> (<year>2018</year>) <volume>173</volume>:<fpage>182</fpage>&#x2013;<lpage>221</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10955-018-2128-4</pub-id>
</citation>
</ref>
<ref id="B11">
<label>11</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mora</surname> <given-names>T</given-names>
</name>
<name>
<surname>Walczak</surname> <given-names>AM</given-names>
</name>
<name>
<surname>Bialek</surname> <given-names>W</given-names>
</name>
<name>
<surname>Callan</surname> <given-names>CG</given-names>
</name>
</person-group>. <article-title>Maximum Entropy Models for Antibody Diversity</article-title>. <source>Proc Natl Acad Sci USA</source> (<year>2010</year>) <volume>107</volume>:<page-range>5405&#x2013;10</page-range>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1001705107</pub-id>
</citation>
</ref>
<ref id="B12">
<label>12</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Oakes</surname> <given-names>T</given-names>
</name>
<name>
<surname>Heather</surname> <given-names>JM</given-names>
</name>
<name>
<surname>Best</surname> <given-names>K</given-names>
</name>
<name>
<surname>Byng-Maddick</surname> <given-names>R</given-names>
</name>
<name>
<surname>Husovsky</surname> <given-names>C</given-names>
</name>
<name>
<surname>Ismail</surname> <given-names>M</given-names>
</name>
<etal/>
</person-group>. <article-title>Quantitative Characterization of the T Cell Receptor Repertoire of Na&#xef;ve and Memory Subsets Using an Integrated Experimental and Computational Pipeline Which Is Robust, Economical, and Versatile</article-title>. <source>Front Immunol</source> (<year>2017</year>) <volume>8</volume>:<elocation-id>1267</elocation-id>. doi: <pub-id pub-id-type="doi">10.3389/fimmu.2017.01267</pub-id>
</citation>
</ref>
<ref id="B13">
<label>13</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aguilera-Sandoval</surname> <given-names>CR</given-names>
</name>
<name>
<surname>OYang</surname> <given-names>O</given-names>
</name>
<name>
<surname>Jojic</surname> <given-names>N</given-names>
</name>
<name>
<surname>Lovato</surname> <given-names>P</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>DY</given-names>
</name>
<name>
<surname>Boechat</surname> <given-names>MI</given-names>
</name>
<etal/>
</person-group>. <article-title>Supranormal Thymic Output Up to Two Decades After HIV-1 Infection</article-title>. <source>AIDS (Lond Engl)</source> (<year>2016</year>) <volume>30</volume>:<page-range>701&#x2013;11</page-range>. doi: <pub-id pub-id-type="doi">10.1097/QAD.0000000000001010</pub-id>
</citation>
</ref>
<ref id="B14">
<label>14</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gerritsen</surname> <given-names>B</given-names>
</name>
<name>
<surname>Pandit</surname> <given-names>A</given-names>
</name>
<name>
<surname>Andeweg</surname> <given-names>AC</given-names>
</name>
<name>
<surname>de Boer</surname> <given-names>RJ</given-names>
</name>
</person-group>. <article-title>RTCR: A Pipeline for Complete and Accurate Recovery of T Cell Repertoires From High Throughput Sequencing Data</article-title>. <source>Bioinformatics</source> (<year>2016</year>) <volume>32</volume>:<page-range>3098&#x2013;106</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btw339</pub-id>
</citation>
</ref>
<ref id="B15">
<label>15</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Naumov</surname> <given-names>YN</given-names>
</name>
<name>
<surname>Naumova</surname> <given-names>EN</given-names>
</name>
<name>
<surname>Hogan</surname> <given-names>KT</given-names>
</name>
<name>
<surname>Selin</surname> <given-names>LK</given-names>
</name>
<name>
<surname>Gorski</surname> <given-names>J</given-names>
</name>
</person-group>. <article-title>A Fractal Clonotype Distribution in the CD8+ Memory T Cell Repertoire Could Optimize Potential for Immune Responses</article-title>. <source>J Immunol</source> (<year>2003</year>) <volume>170</volume>:<fpage>3994</fpage>&#x2013;<lpage>4001</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.4049/jimmunol.170.8.3994</pub-id>
</citation>
</ref>
<ref id="B16">
<label>16</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meier</surname> <given-names>J</given-names>
</name>
<name>
<surname>Roberts</surname> <given-names>C</given-names>
</name>
<name>
<surname>Avent</surname> <given-names>K</given-names>
</name>
<name>
<surname>Archer</surname> <given-names>KJ</given-names>
</name>
<name>
<surname>Manjili</surname> <given-names>MH</given-names>
</name>
<name>
<surname>Toor</surname> <given-names>AA</given-names>
</name>
</person-group>. <article-title>Fractal Organization of the Human T Cell Repertoire in Health and After Stem Cell Transplantation</article-title>. <source>Biol Blood Marrow Transplant</source> (<year>2013</year>) <volume>19</volume>:<page-range>366&#x2013;77</page-range>. doi: <pub-id pub-id-type="doi">10.1016/j.bbmt.2012.12.004</pub-id>
</citation>
</ref>
<ref id="B17">
<label>17</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gaimann</surname> <given-names>MU</given-names>
</name>
<name>
<surname>Nguyen</surname> <given-names>M</given-names>
</name>
<name>
<surname>Desponds</surname> <given-names>J</given-names>
</name>
<name>
<surname>Mayer</surname> <given-names>A</given-names>
</name>
</person-group>. <article-title>Early Life Imprints the Hierarchy of T Cell Clone Sizes</article-title>. <source>eLife</source> (<year>2020</year>) <volume>9</volume>:<fpage>e61639</fpage>. doi: <pub-id pub-id-type="doi">10.7554/eLife.61639</pub-id>
</citation>
</ref>
<ref id="B18">
<label>18</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Burgos</surname> <given-names>JD</given-names>
</name>
<name>
<surname>Moreno-Tovar</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>Zipf-Scaling Behavior in the Immune System</article-title>. <source>Biosystems</source> (<year>1996</year>) <volume>39</volume>:<page-range>227&#x2013;32</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/0303-2647(96)01618-8</pub-id>
</citation>
</ref>
<ref id="B19">
<label>19</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Weinstein</surname> <given-names>JA</given-names>
</name>
<name>
<surname>Jiang</surname> <given-names>N</given-names>
</name>
<name>
<surname>White</surname> <given-names>RA</given-names>
</name>
<name>
<surname>Fisher</surname> <given-names>DS</given-names>
</name>
<name>
<surname>Quake</surname> <given-names>SR</given-names>
</name>
</person-group>. <article-title>High-Throughput Sequencing of the Zebrafish Antibody Repertoire</article-title>. <source>Science</source> (<year>2009</year>) <volume>324</volume>:<page-range>807&#x2013;10</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1126/science.1170020</pub-id>
</citation>
</ref>
<ref id="B20">
<label>20</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marcou</surname> <given-names>Q</given-names>
</name>
<name>
<surname>Mora</surname> <given-names>T</given-names>
</name>
<name>
<surname>Walczak</surname> <given-names>AM</given-names>
</name>
</person-group>. <article-title>High-Throughput Immune Repertoire Analysis With IGoR</article-title>. <source>Nat Commun</source> (<year>2018</year>) <volume>9</volume>:<fpage>561</fpage>. doi: <pub-id pub-id-type="doi">10.1038/s41467-018-02832-w</pub-id>
</citation>
</ref>
<ref id="B21">
<label>21</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tan</surname> <given-names>JT</given-names>
</name>
<name>
<surname>Dudl</surname> <given-names>E</given-names>
</name>
<name>
<surname>LeRoy</surname> <given-names>E</given-names>
</name>
<name>
<surname>Murray</surname> <given-names>R</given-names>
</name>
<name>
<surname>Sprent</surname> <given-names>J</given-names>
</name>
<name>
<surname>Weinberg</surname> <given-names>KI</given-names>
</name>
<etal/>
</person-group>. <article-title>IL-7 Is Critical for Homeostatic Proliferation and Survival of Na&#xef;ve T Cells</article-title>. <source>Proc Natl Acad Sci USA</source> (<year>2001</year>) <volume>98</volume>:<page-range>8732&#x2013;7</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1073/pnas.161126098</pub-id>
</citation>
</ref>
<ref id="B22">
<label>22</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schluns</surname> <given-names>KS</given-names>
</name>
<name>
<surname>Kieper</surname> <given-names>WC</given-names>
</name>
<name>
<surname>Jameson</surname> <given-names>SC</given-names>
</name>
<name>
<surname>Lefran&#xe7;ois</surname> <given-names>L</given-names>
</name>
</person-group>. <article-title>Interleukin-7 Mediates the Homeostasis of Na&#xef;ve and Memory CD8 T Cells <italic>In Vivo</italic>
</article-title>. <source>Nat Immunol</source> (<year>2000</year>) <volume>1</volume>:<page-range>426&#x2013;32</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/80868</pub-id>
</citation>
</ref>
<ref id="B23">
<label>23</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ciupe</surname> <given-names>SM</given-names>
</name>
<name>
<surname>Devlin</surname> <given-names>BH</given-names>
</name>
<name>
<surname>Markert</surname> <given-names>ML</given-names>
</name>
<name>
<surname>Kepler</surname> <given-names>TB</given-names>
</name>
</person-group>. <article-title>The Dynamics of T-Cell Receptor Repertoire Diversity Following Thymus Transplantation for DiGeorge Anomaly</article-title>. <source>PloS Comput Biol</source> (<year>2009</year>) <volume>5</volume>:<elocation-id>e1000396</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pcbi.1000396</pub-id>
</citation>
</ref>
<ref id="B24">
<label>24</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reynolds</surname> <given-names>J</given-names>
</name>
<name>
<surname>Coles</surname> <given-names>M</given-names>
</name>
<name>
<surname>Lythe</surname> <given-names>G</given-names>
</name>
<name>
<surname>Molina-Par&#xed;s</surname> <given-names>C</given-names>
</name>
</person-group>. <article-title>Mathematical Model of Naive T Cell Division and Survival IL-7 Thresholds</article-title>. <source>Front Immunol</source> (<year>2013</year>) <volume>4</volume>:<elocation-id>434</elocation-id>. doi: <pub-id pub-id-type="doi">10.3389/fimmu.2013.00434</pub-id>
</citation>
</ref>
<ref id="B25">
<label>25</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Silva</surname> <given-names>SL</given-names>
</name>
<name>
<surname>Sousa</surname> <given-names>AE</given-names>
</name>
</person-group>. <article-title>Establishment and Maintenance of the Human Na&#xef;ve Cd4+ T-Cell Compartment</article-title>. <source>Front Pediatr</source> (<year>2016</year>) <volume>4</volume>:<elocation-id>119</elocation-id>. doi: <pub-id pub-id-type="doi">10.3389/fped.2016.00119</pub-id>
</citation>
</ref>
<ref id="B26">
<label>26</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Surh</surname> <given-names>CD</given-names>
</name>
<name>
<surname>Sprent</surname> <given-names>J</given-names>
</name>
</person-group>. <article-title>Homeostasis of Naive and Memory T Cells</article-title>. <source>Immunity</source> (<year>2008</year>) <volume>29</volume>:<page-range>848&#x2013;62</page-range>. doi: <pub-id pub-id-type="doi">10.1016/j.immuni.2008.11.002</pub-id>
</citation>
</ref>
<ref id="B27">
<label>27</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Farber</surname> <given-names>DL</given-names>
</name>
<name>
<surname>Yudanin</surname> <given-names>NA</given-names>
</name>
<name>
<surname>Restifo</surname> <given-names>NP</given-names>
</name>
</person-group>. <article-title>Human Memory T Cells: Generation, Compartmentalization and Homeostasis</article-title>. <source>Nat Rev Immunol</source> (<year>2014</year>) <volume>14</volume>:<fpage>24</fpage>&#x2013;<lpage>35</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nri3567</pub-id>
</citation>
</ref>
<ref id="B28">
<label>28</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sethna</surname> <given-names>Z</given-names>
</name>
<name>
<surname>Elhanati</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Callan</surname> <given-names>CG</given-names>
</name>
<name>
<surname>Walczak</surname> <given-names>AM</given-names>
</name>
<name>
<surname>Mora</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>OLGA: Fast Computation of Generation Probabilities of B- and T-Cell Receptor Amino Acid Sequences and Motifs</article-title>. <source>Bioinformatics</source> (<year>2019</year>) <volume>35</volume>:<page-range>2974&#x2013;81</page-range>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btz035</pub-id>
</citation>
</ref>
<ref id="B29">
<label>29</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Westera</surname> <given-names>L</given-names>
</name>
<name>
<surname>van Hoeven</surname> <given-names>V</given-names>
</name>
<name>
<surname>Drylewicz</surname> <given-names>J</given-names>
</name>
<name>
<surname>Spierenburg</surname> <given-names>G</given-names>
</name>
<name>
<surname>van Velzen</surname> <given-names>JF</given-names>
</name>
<name>
<surname>de Boer</surname> <given-names>RJ</given-names>
</name>
<etal/>
</person-group>. <article-title>Lymphocyte Maintenance During Healthy Aging Requires No Substantial Alterations in Cellular Turnover</article-title>. <source>Aging Cell</source> (<year>2015</year>) <volume>14</volume>:<page-range>219&#x2013;27</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/acel.12311</pub-id>
</citation>
</ref>
<ref id="B30">
<label>30</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Robins</surname> <given-names>HS</given-names>
</name>
<name>
<surname>Campregher</surname> <given-names>PV</given-names>
</name>
<name>
<surname>Srivastava</surname> <given-names>SK</given-names>
</name>
<name>
<surname>Wacher</surname> <given-names>A</given-names>
</name>
<name>
<surname>Turtle</surname> <given-names>CJ</given-names>
</name>
<name>
<surname>Kahsai</surname> <given-names>O</given-names>
</name>
<etal/>
</person-group>. <article-title>Comprehensive Assessment of T-Cell Receptor <italic>&#x3b2;.</italic>-Chain Diversity in <italic>&#x3b1;&#x3b2;</italic> T Cells</article-title>. <source>Blood</source> (<year>2009</year>) <volume>114</volume>:<page-range>4099&#x2013;107</page-range>. doi: <pub-id pub-id-type="doi">10.1182/blood-2009-04-217604</pub-id>
</citation>
</ref>
<ref id="B31">
<label>31</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mayer</surname> <given-names>A</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Perelson</surname> <given-names>AS</given-names>
</name>
<name>
<surname>Wingreen</surname> <given-names>NS</given-names>
</name>
</person-group>. <article-title>Regulation of T Cell Expansion by Antigen Presentation Dynamics</article-title>. <source>Proc Natl Acad Sci USA</source> (<year>2019</year>) <volume>116</volume>:<page-range>5914&#x2013;9</page-range>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1812800116</pub-id>
</citation>
</ref>
<ref id="B32">
<label>32</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Arstila</surname> <given-names>TP</given-names>
</name>
<name>
<surname>Casrouge</surname> <given-names>A</given-names>
</name>
<name>
<surname>Baron</surname> <given-names>V</given-names>
</name>
<name>
<surname>Even</surname> <given-names>J</given-names>
</name>
<name>
<surname>Kanellopoulos</surname> <given-names>J</given-names>
</name>
<name>
<surname>Kourilsky</surname> <given-names>P</given-names>
</name>
</person-group>. <article-title>A Direct Estimate of the Human <italic>&#x3b1;&#x3b2;</italic> T Cell Receptor Diversity</article-title>. <source>Science</source> (<year>1999</year>) <volume>286</volume>:<page-range>958&#x2013;61</page-range>. doi: <pub-id pub-id-type="doi">10.1126/science.286.5441.958</pub-id>
</citation>
</ref>
<ref id="B33">
<label>33</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Warren</surname> <given-names>RL</given-names>
</name>
<name>
<surname>Freeman</surname> <given-names>JD</given-names>
</name>
<name>
<surname>Zeng</surname> <given-names>T</given-names>
</name>
<name>
<surname>Choe</surname> <given-names>G</given-names>
</name>
<name>
<surname>Munro</surname> <given-names>S</given-names>
</name>
<name>
<surname>Moore</surname> <given-names>R</given-names>
</name>
</person-group>. <article-title>Exhaustive T-Cell Repertoire Sequencing of Human Peripheral Blood Samples Reveals Signatures of Antigen Selection and a Directly Measured Repertoire Size of at Least 1 Million Clonotypes</article-title>. <source>Genome Res</source> (<year>2011</year>) <volume>21</volume>:<page-range>790&#x2013;7</page-range>. doi: <pub-id pub-id-type="doi">10.1101/gr.115428.110</pub-id>
</citation>
</ref>
<ref id="B34">
<label>34</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zarnitsyna</surname> <given-names>V</given-names>
</name>
<name>
<surname>Evavold</surname> <given-names>B</given-names>
</name>
<name>
<surname>Schoettle</surname> <given-names>L</given-names>
</name>
<name>
<surname>Blattman</surname> <given-names>J</given-names>
</name>
<name>
<surname>Antia</surname> <given-names>R</given-names>
</name>
</person-group>. <article-title>Estimating the Diversity, Completeness, and Cross-Reactivity of the T Cell Repertoire</article-title>. <source>Front Immunol</source> (<year>2013</year>) <volume>4</volume>:<elocation-id>485</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fimmu.2013.00485</pub-id>
</citation>
</ref>
<ref id="B35">
<label>35</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jenkins</surname> <given-names>MK</given-names>
</name>
<name>
<surname>Chu</surname> <given-names>HH</given-names>
</name>
<name>
<surname>McLachlan</surname> <given-names>JB</given-names>
</name>
<name>
<surname>Moon</surname> <given-names>JJ</given-names>
</name>
</person-group>. <article-title>On the Composition of the Preimmune Repertoire of T Cells Specific for Peptide&#x2013;Major Histocompatibility Complex Ligands</article-title>. <source>Annu Rev Immunol</source> (<year>2010</year>) <volume>28</volume>:<page-range>275&#x2013;94</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.1146/annurev-immunol-030409-101253</pub-id>
</citation>
</ref>
<ref id="B36">
<label>36</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mora</surname> <given-names>T</given-names>
</name>
<name>
<surname>Walczak</surname> <given-names>AM</given-names>
</name>
</person-group>. <article-title>How Many Different Clonotypes do Immune Repertoires Contain</article-title>? <source>Curr Opin Syst Biol</source> (<year>2019</year>) <volume>18</volume>:<page-range>104&#x2013;10</page-range>. doi: <pub-id pub-id-type="doi">10.1016/j.coisb.2019.10.001</pub-id>
</citation>
</ref>
<ref id="B37">
<label>37</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Murphy</surname> <given-names>K</given-names>
</name>
<name>
<surname>Weaver</surname> <given-names>C</given-names>
</name>
</person-group>. <source>Janeway&#x2019;s Immunobiology</source>. <publisher-loc>New York NY</publisher-loc>: <publisher-name>Garland Science/Taylor &amp; Francis</publisher-name> (<year>2016</year>).</citation>
</ref>
<ref id="B38">
<label>38</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Goyal</surname> <given-names>S</given-names>
</name>
<name>
<surname>Kim</surname> <given-names>S</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>ISY</given-names>
</name>
<name>
<surname>Chou</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>Mechanisms of Blood Homeostasis: Lineage Tracking and a Neutral Model of Cell Populations in Rhesus Macaques</article-title>. <source>BMC Biol</source> (<year>2015</year>) <volume>13</volume>:<fpage>85</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12915-015-0191-8</pub-id>
</citation>
</ref>
<ref id="B39">
<label>39</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname> <given-names>S</given-names>
</name>
<name>
<surname>Chou</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>Immigration-Induced Phase Transition in a Regulated Multispecies Birth-Death Process</article-title>. <source>J Phys A: Math Theor</source> (<year>2018</year>) <volume>51</volume>:<fpage>425602</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1088/1751-8121/aadcb4</pub-id>
</citation>
</ref>
<ref id="B40">
<label>40</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wiuf</surname> <given-names>C</given-names>
</name>
<name>
<surname>Stumpf</surname> <given-names>MPH</given-names>
</name>
</person-group>. <article-title>Binomial Subsampling</article-title>. <source>Proc R Soc A</source> (<year>2006</year>) <volume>462</volume>:<page-range>1181&#x2013;95</page-range>. doi: <pub-id pub-id-type="doi">10.1098/rspa.2005.1622</pub-id>
</citation>
</ref>
<ref id="B41">
<label>41</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname> <given-names>S</given-names>
</name>
<name>
<surname>Kim</surname> <given-names>S</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>ISY</given-names>
</name>
<name>
<surname>Chou</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>Modeling Large Fluctuations of Thousands of Clones During Hematopoiesis: The Role of Stem Cell Self-Renewal and Bursty Progenitor Dynamics in Rhesus Macaque</article-title>. <source>PloS Comput Biol</source> (<year>2018</year>) <volume>14</volume>:<fpage>e1006489</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pcbi.1006489</pub-id>
</citation>
</ref>
<ref id="B42">
<label>42</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levina</surname> <given-names>A</given-names>
</name>
<name>
<surname>Priesemann</surname> <given-names>V</given-names>
</name>
</person-group>. <article-title>Subsampling Scaling</article-title>. <source>Nat Commun</source> (<year>2017</year>) <volume>8</volume>:<fpage>15140</fpage>. doi: <pub-id pub-id-type="doi">10.1038/ncomms15140</pub-id>
</citation>
</ref>
<ref id="B43">
<label>43</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Ferrarini</surname> <given-names>M</given-names>
</name>
<name>
<surname>Molina-Par&#xed;s</surname> <given-names>C</given-names>
</name>
<name>
<surname>Lythe</surname> <given-names>G</given-names>
</name>
</person-group>. <article-title>Sampling From T Cell Receptor Repertoires</article-title>. In: <person-group person-group-type="author">
<name>
<surname>Graw</surname> <given-names>F</given-names>
</name>
<name>
<surname>Franziska Matth&#xe4;us</surname> <given-names>JP</given-names>
</name>
</person-group>, editors. <source>Modeling Cellular Systems</source>. <publisher-loc>Cham, Switzerland</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name> (<year>2017</year>). p. <fpage>67</fpage>&#x2013;<lpage>79</lpage>.</citation>
</ref>
<ref id="B44">
<label>44</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lythe</surname> <given-names>G</given-names>
</name>
<name>
<surname>Molina-Par&#xed;s</surname> <given-names>C</given-names>
</name>
</person-group>. <article-title>Some Deterministic and Stochastic Mathematical Models of Naive T-Cell Homeostasis</article-title>. <source>Immunol Rev</source> (<year>2018</year>) <volume>285</volume>:<page-range>206&#x2013;17</page-range>. doi: <pub-id pub-id-type="doi">10.1111/imr.12696</pub-id>
</citation>
</ref>
<ref id="B45">
<label>45</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Min</surname> <given-names>B</given-names>
</name>
<name>
<surname>Foucras</surname> <given-names>G</given-names>
</name>
<name>
<surname>Meier-Schellersheim</surname> <given-names>M</given-names>
</name>
<name>
<surname>Paul</surname> <given-names>WE</given-names>
</name>
</person-group>. <article-title>Spontaneous Proliferation, a Response of Na&#xef;ve CD4 T Cells Determined by the Diversity of the Memory Cell Repertoire</article-title>. <source>Proc Natl Acad Sci USA</source> (<year>2004</year>) <volume>101</volume>:<page-range>3874&#x2013;9</page-range>. doi: <pub-id pub-id-type="doi">10.1073/pnas.0400606101</pub-id>
</citation>
</ref>
<ref id="B46">
<label>46</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Davis</surname> <given-names>MM</given-names>
</name>
<name>
<surname>Bjorkman</surname> <given-names>PJ</given-names>
</name>
</person-group>. <article-title>T-Cell Antigen Receptor Genes and T-Cell Recognition</article-title>. <source>Nature</source> (<year>1988</year>) <volume>334</volume>:<fpage>395</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/334395a0</pub-id>
</citation>
</ref>
<ref id="B47">
<label>47</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Clark</surname> <given-names>DR</given-names>
</name>
<name>
<surname>de Boer</surname> <given-names>RJ</given-names>
</name>
<name>
<surname>Wolthers</surname> <given-names>KC</given-names>
</name>
<name>
<surname>Miedema</surname> <given-names>F</given-names>
</name>
</person-group>. <article-title>T Cell Dynamics in HIV-1 Infection</article-title>. <source>Adv Immunol</source> (<year>1999</year>) <volume>73</volume>:<page-range>301&#x2013;27</page-range>. doi: <pub-id pub-id-type="doi">10.1016/S0065-2776(08)60789-0</pub-id>
</citation>
</ref>
<ref id="B48">
<label>48</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ye</surname> <given-names>P</given-names>
</name>
<name>
<surname>Kirschner</surname> <given-names>DE</given-names>
</name>
</person-group>. <article-title>Reevaluation of T Cell Receptor Excision Circles as a Measureof Human Recent Thymic Emigrants</article-title>. <source>J Immunol</source> (<year>2002</year>) <volume>168</volume>:<page-range>4968&#x2013;79</page-range>. doi: <pub-id pub-id-type="doi">10.4049/jimmunol.168.10.4968</pub-id>
</citation>
</ref>
<ref id="B49">
<label>49</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hazenberg</surname> <given-names>MD</given-names>
</name>
<name>
<surname>Borghans</surname> <given-names>JAM</given-names>
</name>
<name>
<surname>de Boer</surname> <given-names>RJ</given-names>
</name>
<name>
<surname>Miedema</surname> <given-names>F</given-names>
</name>
</person-group>. <article-title>Thymic Output: A Bad TREC Record</article-title>. <source>Nat Immunol</source> (<year>2003</year>) <volume>4</volume>:<page-range>97&#x2013;9</page-range>. doi: <pub-id pub-id-type="doi">10.1038/ni0203-97</pub-id>
</citation>
</ref>
<ref id="B50">
<label>50</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bains</surname> <given-names>I</given-names>
</name>
<name>
<surname>Thi&#xe9;baut</surname> <given-names>R</given-names>
</name>
<name>
<surname>Yates</surname> <given-names>AJ</given-names>
</name>
<name>
<surname>Callard</surname> <given-names>R</given-names>
</name>
</person-group>. <article-title>Quantifying Thymic Export: Combining Models of Naive T Cell Proliferation and TCR Excision Circle Dynamics Gives an Explicit Measure of Thymic Output</article-title>. <source>J Immunol</source> (<year>2009</year>) <volume>183</volume>:<page-range>4329&#x2013;36</page-range>. doi: <pub-id pub-id-type="doi">10.4049/jimmunol.0900743</pub-id>
</citation>
</ref>
<ref id="B51">
<label>51</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cunningham</surname> <given-names>CA</given-names>
</name>
<name>
<surname>Helm</surname> <given-names>EY</given-names>
</name>
<name>
<surname>Fink</surname> <given-names>PJ</given-names>
</name>
</person-group>. <article-title>Reinterpreting Recent Thymic Emigrant Function: Defective or Adaptive</article-title>? <source>Curr Opin Immunol</source> (<year>2018</year>) <volume>51</volume>:<fpage>1</fpage>&#x2013;<lpage>6</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.coi.2017.12.006</pub-id>
</citation>
</ref>
<ref id="B52">
<label>52</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elhanati</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Murugan</surname> <given-names>A</given-names>
</name>
<name>
<surname>Callan</surname> <given-names>CG</given-names> <suffix>Jr</suffix>
</name>
<name>
<surname>Mora</surname> <given-names>T</given-names>
</name>
<name>
<surname>Walczak</surname> <given-names>AM</given-names>
</name>
</person-group>. <article-title>Quantifying Selection in Immune Receptor Repertoires</article-title>. <source>Proc Natl Acad Sci USA</source> (<year>2014</year>) <volume>111</volume>:<page-range>9875&#x2013;80</page-range>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1409572111</pub-id>
</citation>
</ref>
<ref id="B53">
<label>53</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hardin</surname> <given-names>G</given-names>
</name>
</person-group>. <article-title>The Competitive Exclusion Principle</article-title>. <source>Science</source> (<year>1960</year>) <volume>131</volume>:<page-range>1292&#x2013;7</page-range>. doi: <pub-id pub-id-type="doi">10.1126/science.131.3409.1292</pub-id>
</citation>
</ref>
<ref id="B54">
<label>54</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hutchinson</surname> <given-names>GE</given-names>
</name>
</person-group>. <article-title>The Paradox of the Plankton</article-title>. <source>Am Nat</source> (<year>1961</year>) <volume>95</volume>:<page-range>137&#x2013;45</page-range>. doi: <pub-id pub-id-type="doi">10.1086/282171</pub-id>
</citation>
</ref>
<ref id="B55">
<label>55</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hogan</surname> <given-names>T</given-names>
</name>
<name>
<surname>Gossel</surname> <given-names>G</given-names>
</name>
<name>
<surname>Yates</surname> <given-names>AJ</given-names>
</name>
<name>
<surname>Seddon</surname> <given-names>S</given-names>
</name>
</person-group>. <article-title>Temporal Fate Mapping Reveals Age-Structured Heterogeneity in Naive CD4 and CD8 T Lymphocyte Populations in Mice</article-title>. <source>Proc Natl Acad Sci USA</source> (<year>2015</year>) <volume>112</volume>:<page-range>E6917&#x2013;26</page-range>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1517246112</pub-id>
</citation>
</ref>
<ref id="B56">
<label>56</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rane</surname> <given-names>S</given-names>
</name>
<name>
<surname>Hogan</surname> <given-names>T</given-names>
</name>
<name>
<surname>Seddon</surname> <given-names>B</given-names>
</name>
<name>
<surname>Yates</surname> <given-names>AJ</given-names>
</name>
</person-group>. <article-title>Age Is Not Just a Number: Naive T Cells Increase Their Ability to Persist in the Circulation Over Time</article-title>. <source>PloS Comput Biol</source> (<year>2018</year>) <volume>16</volume>:<fpage>e2003949</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pbio.2003949</pub-id>
</citation>
</ref>
<ref id="B57">
<label>57</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lewkiewicz</surname> <given-names>S</given-names>
</name>
<name>
<surname>Chuang</surname> <given-names>YL</given-names>
</name>
<name>
<surname>Chou</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>A Mathematical Model of the Effects of Aging on Naive T Cell Populations and Diversity</article-title>. <source>Bull Math Biol</source> (<year>2019</year>) <volume>81</volume>:<page-range>2783&#x2013;817</page-range>. doi: <pub-id pub-id-type="doi">10.1007/s11538-019-00630-z</pub-id>
</citation>
</ref>
<ref id="B58">
<label>58</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Johnson</surname> <given-names>P</given-names>
</name>
<name>
<surname>Yates</surname> <given-names>A</given-names>
</name>
<name>
<surname>Goronzy</surname> <given-names>J</given-names>
</name>
<name>
<surname>Antia</surname> <given-names>R</given-names>
</name>
</person-group>. <article-title>Peripheral Selection Rather Than Thymic Involution Explains Sudden Contraction in Naive CD4 T-Cell Diversity With Age</article-title>. <source>Proc Natl Acad Sci USA</source> (<year>2012</year>) <volume>109</volume>:<page-range>21432&#x2013;7</page-range>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1209283110</pub-id>
</citation>
</ref>
<ref id="B59">
<label>59</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Britanova</surname> <given-names>OV</given-names>
</name>
<name>
<surname>Shugay</surname> <given-names>M</given-names>
</name>
<name>
<surname>Merzlyak</surname> <given-names>EM</given-names>
</name>
<name>
<surname>Staroverov</surname> <given-names>DB</given-names>
</name>
<name>
<surname>Putintseva</surname> <given-names>EV</given-names>
</name>
<name>
<surname>Turchaninova</surname> <given-names>MA</given-names>
</name>
<etal/>
</person-group>. <article-title>Dynamics of Individual T Cell Repertoires: From Cord Blood to Centenarians</article-title>. <source>J Immunol</source> (<year>2016</year>) <volume>196</volume>:<page-range>5005&#x2013;13</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.4049/jimmunol.1600005</pub-id>
</citation>
</ref>
<ref id="B60">
<label>60</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Egorov</surname> <given-names>ES</given-names>
</name>
<name>
<surname>Kasatskaya</surname> <given-names>SA</given-names>
</name>
<name>
<surname>Zubov</surname> <given-names>VN</given-names>
</name>
<name>
<surname>Izraelson</surname> <given-names>M</given-names>
</name>
<name>
<surname>Nakonechnaya</surname> <given-names>TO</given-names>
</name>
<name>
<surname>Staroverov</surname> <given-names>DB</given-names>
</name>
<etal/>
</person-group>. <article-title>The Changing Landscape of Naive T Cell Receptor Repertoire With Human Aging</article-title>. <source>Front Immunol</source> (<year>2018</year>) <volume>9</volume>:<elocation-id>1618</elocation-id>. doi: <pub-id pub-id-type="doi">10.3389/fimmu.2018.01618</pub-id>
</citation>
</ref>
<ref id="B61">
<label>61</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lewkiewicz</surname> <given-names>SM</given-names>
</name>
<name>
<surname>Chuang</surname> <given-names>YL</given-names>
</name>
<name>
<surname>Chou</surname> <given-names>T</given-names>
</name>
</person-group>. <article-title>Dynamics of T Cell Receptor Distributions Following Acute Thymic Atrophy and Resumption</article-title>. <source>Math Biosci Eng</source> (<year>2020</year>) <volume>17</volume>:<fpage>28</fpage>&#x2013;<lpage>55</lpage>. doi: <pub-id pub-id-type="doi">10.3934/mbe.2020002</pub-id>
</citation>
</ref>
<ref id="B62">
<label>62</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mold</surname> <given-names>JE</given-names>
</name>
<name>
<surname>R&#xe9;u</surname> <given-names>P</given-names>
</name>
<name>
<surname>Olin</surname> <given-names>A</given-names>
</name>
<name>
<surname>Bernard</surname> <given-names>S</given-names>
</name>
<name>
<surname>Micha&#xeb;lsson</surname> <given-names>J</given-names>
</name>
<name>
<surname>Rane</surname> <given-names>S</given-names>
</name>
<etal/>
</person-group>. <article-title>Cell Generation Dynamics Underlying Naive T-Cell Homeostasis in Adult Humans</article-title>. <source>PloS Biol</source> (<year>2019</year>) <volume>17</volume>:<fpage>e3000383</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pbio.3000383</pub-id>
</citation>
</ref>
<ref id="B63">
<label>63</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>van den Broek</surname> <given-names>T</given-names>
</name>
<name>
<surname>Delemarre</surname> <given-names>EM</given-names>
</name>
<name>
<surname>Janssen</surname> <given-names>WJ</given-names>
</name>
<name>
<surname>Nievelstein</surname> <given-names>RA</given-names>
</name>
<name>
<surname>Broen</surname> <given-names>JC</given-names>
</name>
<name>
<surname>Tesselaar</surname> <given-names>K</given-names>
</name>
<etal/>
</person-group>. <article-title>Neonatal Thymectomy Reveals Differentiation and Plasticity Within Human Naive T Cells</article-title>. <source>J Clin Invest</source> (<year>2016</year>) <volume>126</volume>:<page-range>1126&#x2013;36</page-range>. doi: <pub-id pub-id-type="doi">10.1172/JCI84997</pub-id>
</citation>
</ref>
<ref id="B64">
<label>64</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thome</surname> <given-names>JJC</given-names>
</name>
<name>
<surname>Grinshpun</surname> <given-names>B</given-names>
</name>
<name>
<surname>Kumar</surname> <given-names>BV</given-names>
</name>
<name>
<surname>Kubota</surname> <given-names>M</given-names>
</name>
<name>
<surname>Ohmura</surname> <given-names>Y</given-names>
</name>
<name>
<surname>Lerner</surname> <given-names>H</given-names>
</name>
<etal/>
</person-group>. <article-title>Long-Term Maintenance of Human Naive T Cells Through <italic>In Situ</italic> Homeostasis in Lymphoid Tissue Sites</article-title>. <source>Sci Immunol</source> (<year>2016</year>) <volume>1</volume>:<fpage>eaah6506</fpage>. doi: <pub-id pub-id-type="doi">10.1126/sciimmunol.aah6506</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>