<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2017.01418</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Human Retrotransposon Insertion Polymorphisms Are Associated with Health and Disease via Gene Regulatory Phenotypes</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Wang</surname> <given-names>Lu</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/390646/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Norris</surname> <given-names>Emily T.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/461132/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Jordan</surname> <given-names>I. K.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/372301/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Biological Sciences, Georgia Institute of Technology, Atlanta</institution> <country>GA, United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>PanAmerican Bioinformatics Institute</institution> <country>Cali, Colombia</country></aff>
<aff id="aff3"><sup>3</sup><institution>Applied Bioinformatics Laboratory, Atlanta</institution> <country>GA, United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Guenther Witzany, Telos - Philosophische Praxis, Austria</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Dixie Mager, BC Cancer Agency, Canada; Jianwei Wang, China Academy of Chinese Medical Sciences, China</italic></p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x002A;Correspondence: <italic>I. K. Jordan, <email>king.jordan@biology.gatech.edu</email></italic></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Virology, a section of the journal Frontiers in Microbiology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>02</day>
<month>08</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>1418</elocation-id>
<history>
<date date-type="received">
<day>12</day>
<month>05</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>13</day>
<month>07</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2017 Wang, Norris and Jordan.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Wang, Norris and Jordan</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The human genome hosts several active families of transposable elements (TEs), including the Alu, LINE-1, and SVA retrotransposons that are mobilized via reverse transcription of RNA intermediates. We evaluated how insertion polymorphisms generated by human retrotransposon activity may be related to common health and disease phenotypes that have been previously interrogated through genome-wide association studies (GWAS). To address this question, we performed a genome-wide screen for retrotransposon polymorphism disease associations that are linked to TE induced gene regulatory changes. Our screen first identified polymorphic retrotransposon insertions found in linkage disequilibrium (LD) with single nucleotide polymorphisms that were previously associated with common complex diseases by GWAS. We further narrowed this set of candidate disease associated retrotransposon polymorphisms by identifying insertions that are located within tissue-specific enhancer elements. We then performed expression quantitative trait loci analysis on the remaining set of candidates in order to identify polymorphic retrotransposon insertions that are associated with gene expression changes in B-cells of the human immune system. This progressive and stringent screen yielded a list of six retrotransposon insertions as the strongest candidates for TE polymorphisms that lead to disease via enhancer-mediated changes in gene regulation. For example, we found an SVA insertion within a cell-type specific enhancer located in the second intron of the <italic>B4GALT1</italic> gene. <italic>B4GALT1</italic> encodes a glycosyltransferase that functions in the glycosylation of the Immunoglobulin G (IgG) antibody in such a way as to convert its activity from pro- to anti-inflammatory. The disruption of the <italic>B4GALT1</italic> enhancer by the SVA insertion is associated with down-regulation of the gene in B-cells, which would serve to keep the IgG molecule in a pro-inflammatory state. Consistent with this idea, the <italic>B4GALT1</italic> enhancer SVA insertion is linked to a genomic region implicated by GWAS in both inflammatory conditions and autoimmune diseases, such as systemic lupus erythematosus and Crohn&#x2019;s disease. We explore this example and the other cases uncovered by our genome-wide screen in an effort to illuminate how retrotransposon insertion polymorphisms can impact human health and disease by causing changes in gene expression.</p>
</abstract>
<kwd-group>
<kwd>transposable elements</kwd>
<kwd>retrotransposons</kwd>
<kwd>Alu</kwd>
<kwd>L1</kwd>
<kwd>SVA</kwd>
<kwd>gene expression</kwd>
<kwd>gene regulation</kwd>
<kwd>GWAS</kwd>
</kwd-group>
<counts>
<fig-count count="7"/>
<table-count count="1"/>
<equation-count count="1"/>
<ref-count count="56"/>
<page-count count="13"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec><title>Introduction</title>
<p>At least one half of the human genome sequence is derived from the replication and insertion of retrotransposons &#x2013; RNA agents that transpose among chromosomal locations via the reverse transcription of RNA intermediates (<xref ref-type="bibr" rid="B26">Lander et al., 2001</xref>; <xref ref-type="bibr" rid="B14">de Koning et al., 2011</xref>). The vast majority of retrotransposon-related sequences in the human genome are derived from ancient insertion events and are no longer capable of transposition. Nevertheless, there are several families of human retrotransposons that remain active. The most abundant active families of human retrotransposons are the Alu (<xref ref-type="bibr" rid="B4">Batzer and Deininger, 1991</xref>; <xref ref-type="bibr" rid="B5">Batzer et al., 1991</xref>), LINE-1 (L1) (<xref ref-type="bibr" rid="B24">Kazazian et al., 1988</xref>; <xref ref-type="bibr" rid="B7">Brouha et al., 2003</xref>), and SVA (<xref ref-type="bibr" rid="B33">Ostertag et al., 2003</xref>; <xref ref-type="bibr" rid="B50">Wang et al., 2005</xref>) retrotransposons; recent evidence indicates that a smaller number of HERV-K endogenous retroviruses also remain capable of transposition (<xref ref-type="bibr" rid="B55">Wildschutte et al., 2016</xref>).</p>
<p>Sequences from active retrotransposon families generate insertional polymorphisms within and between human populations by means of germline transposition events. In this way, ongoing retrotranspositional activity of these RNA agents serves as an important source of human genetic variation. Retrotransposons are further distinguished by the fact that they are known to impact the regulation of human genes in a number of different ways (<xref ref-type="bibr" rid="B16">Feschotte, 2008</xref>; <xref ref-type="bibr" rid="B38">Rebollo et al., 2012</xref>; <xref ref-type="bibr" rid="B9">Chuong et al., 2017</xref>). Nevertheless, the joint phenotypic implications of retrotransposon generated human genetic variation, coupled with their capacity for genome regulation, have yet to be fully explored. We previously studied the implications of somatic retrotransposition for the etiology of cancer vis a vis retrotransposon induced regulatory changes in tumor suppressor genes (<xref ref-type="bibr" rid="B11">Clayton et al., 2016</xref>). For the current study, we were curious to understand how insertion polymorphisms generated by human retrotransposon activity may be related to commonly expressed health and disease phenotypes.</p>
<p>In one sense, a link between retrotransposon activity and disease is already well established. Active human retrotransposons were originally discovered due to the deleterious effects of element insertions (<xref ref-type="bibr" rid="B24">Kazazian et al., 1988</xref>). There are 124 genetic diseases that have been demonstrated to be caused by retrotransposon insertions, including cystic fibrosis (Alu), hemophilia A (L1) and X-linked dystonia-parkinsonism (SVA) (<xref ref-type="bibr" rid="B19">Hancks and Kazazian, 2012</xref>; <xref ref-type="bibr" rid="B18">Hancks and Kazazian, 2016</xref>). However, these cases represent so-called Mendelian diseases caused by very deleterious mutations that are expressed with high penetrance. Disease causing mutations of this kind are extremely rare and do not segregate among populations as common genetic polymorphisms. Complex multi-factorial diseases, on the other hand, are associated with more common genetic variants that exert their effects in a probabilistic as opposed to a deterministic manner. The contribution of common retrotransposon polymorphisms to complex health and disease related phenotypes has yet to be systematically explored.</p>
<p>Given the known connection between retrotransposon activity and genetic disease, we hypothesized that retrotransposon insertion polymorphisms may also contribute to inter-individual phenotypic differences that are associated with common diseases that have complex, multi-factorial genetic etiology. Since we previously showed that retrotransposon insertions contribute to inter-individual and population-specific differences in human gene regulation (<xref ref-type="bibr" rid="B52">Wang L. et al., 2016</xref>), we also hypothesized that the impact of retrotransposon insertion polymorphisms on human health could be mediated by gene regulatory effects.</p>
<p>Previously, it has only been possible to investigate the impact of retrotransposon polymorphisms on disease phenotypes for a limited number of individuals owing to the number of genomes that were available (<xref ref-type="bibr" rid="B41">Rishishwar et al., 2017</xref>). For the current study, we leveraged the accumulation of whole genome sequence and expression datasets, along with data on single nucleotide polymorphism (SNP) disease-associations, in order to perform a population level genome-wide screen for retrotransposon polymorphisms that are linked to complex health- and disease-related phenotypes.</p>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec><title>Polymorphic Transposable Element (PolyTE) and SNP Genotypes</title>
<p>Human polymorphic TE (polyTE) insertion presence/absence genotypes for whole genome sequences of 445 individuals from five human populations were accessed from the phase 3 variant release of the 1000 Genomes Project (1KGP) (<xref ref-type="bibr" rid="B2">Altshuler et al., 2015</xref>). Whole genome SNP genotypes were taken for the same set of individuals. The phase 3 variant release corresponds to the human genome reference sequence build GRCh37/hg19, and the 5 human populations are YRI: Yoruba in Ibadan, Nigeria from Africa, CEU: Utah Residents (CEPH) with Northern and Western European Ancestry, FIN: Finnish in Finland, GBR: British in England and Scotland and TSI: Toscani in Italia from Europe. We chose these genome sequence datasets because they have matching RNA-seq data for the same individuals [see Expression Quantitative Trait Locus (eQTL) Analysis section]. The YRI population was taken to represent the African continental population group (AFR), and the four populations from Europe (CEU, FIN, GBR, and TSI) were grouped together as the European (EUR) continental population group for downstream analysis (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>). PolyTE insertion genotypes were characterized by the 1KGP Structural Variation Group using the program MELT as previously described (<xref ref-type="bibr" rid="B47">Sudmant et al., 2015</xref>). Previously, we performed an independent validation the performance of this program for human polyTE insertion variant calling from whole genome sequences (<xref ref-type="bibr" rid="B39">Rishishwar et al., 2016</xref>). The polyTE genotype data were downloaded via the 1000 Genomes Project ftp hosted by the NCBI: <ext-link ext-link-type="uri" xlink:href="http://ftp-trace.ncbi.nlm.nih.gov/1000genomes/ftp/release/20130502/">http://ftp-trace.ncbi.nlm.nih.gov/1000genomes/ftp/release/20130502/</ext-link> For a given polyTE insertion site in the genome, there are three possible presence/absence genotype values for an individual genome: 0-no polyTE insertion (homozygote absent), 1-a single polyTE insertion (heterozygote), and 2-two polyTE insertions (homozygote present). PolyTE genotypes were used for eQTL analysis as described in section &#x201C;Expression Quantitative Trait Locus (eQTL) Analysis&#x201D;. For each of the two continental population groups, only polyTE insertions and SNPs with greater than 5% minor allele frequencies (MAF) were used for the downstream analysis to ensure both the confidence of genotype calls and the reliability of the association analyses. Minor alleles for TEs are assumed to be the insertion present allele, since the ancestral state for any polyTE insertion site corresponds to the absence of an insertion (<xref ref-type="bibr" rid="B40">Rishishwar et al., 2015</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Integrative data analysis used to screen for polyTE disease-associations. Matched DNA-seq and RNA-seq samples were analyzed for 445 individuals, 87 Africans (AFR &#x2013; blue) and 358 Europeans (EUR &#x2013; orange). Genome-wide genotypes were characterized for polyTE insertions (presence/absence) and SNPs, and the linkage disequilibrium (LD) structure for polyTE insertions and SNP alleles was characterized for all samples. The NHGRI-EBI GWAS database was mined to extract SNP disease-associations and related information on diseases, phenotypes, genes and SNP genomic locations. A series of filters was applied to screen for a set of high-confidence polyTE disease-associations. PolyTEs were evaluated for: (1) minor allele frequency (MAF), (2) linkage to disease-associated SNPS (i.e., GWAS hits), (3) overlap with tissue-specific enhancers, (4) associations with gene expression, and (5) functional relevance for blood- and immune system-related diseases.</p></caption>
<graphic xlink:href="fmicb-08-01418-g001.tif"/>
</fig>
</sec>
<sec><title>PolyTE-SNP Linkage Analysis</title>
<p>The GCTA program (version 1.25.0) was used to estimate the linkage disequilibrium (LD) structure for polyTEs and SNPs in genomic regions centered at each polyTE insertion site. For each polyTE insertion site, pairwise correlations (<italic>r</italic>) between the target polyTE insertion alleles and all SNP alleles in the same LD block were computed across all individual genome samples. A correlation (<italic>r</italic>) significance <italic>P</italic>-value threshold of 0.05 was used to identify all SNPs considered to be in LD with each polyTE insertion. Pairwise distances between polyTE insertion sites and linked SNPs were calculated as the number of base pairs between each polyTE insertion site and all linked SNP locations.</p>
</sec>
<sec><title>Genome Wide Association Studies (GWAS) for Disease</title>
<p>Associations between human genetic variants (SNPs) and health- or disease-related phenotypes were explored using the NHGRI-EBI GWAS database (<xref ref-type="bibr" rid="B29">MacArthur et al., 2017</xref>). GWAS database SNPs with genome-wide association values of <italic>P</italic> &#x003C; 10<sup>-5</sup> were taken for analysis, and the genomic location, specific health- or disease-related phenotype, identity of the risk allele and original reporting publications were recorded for each associated SNP. The GWAS SNPs were screened for LD with polyTE insertions as described in section &#x201C;PolyTE-SNP Linkage Analysis&#x201D; to yield a set of candidate disease-linked polyTE insertions for further analysis.</p>
</sec>
<sec><title>Evaluating polyTE Regulatory Potential</title>
<p>The regulatory potential for polyTE insertions was evaluated by considering their co-location with known enhancer sequences. Active enhancers for 127 cell-types and tissues were characterized by the Roadmap Epigenomics Project using the ChromHMM program (<xref ref-type="bibr" rid="B15">Ernst and Kellis, 2012</xref>; <xref ref-type="bibr" rid="B43">Roadmap Epigenomics et al., 2015</xref>). ChromHMM integrates multiple genome-wide chromatin datasets (i.e., epigenomes), such as ChIP-seq of various histone modifications, using a multivariate Hidden Markov Model to identify the locations of tissue-specific enhancers based on their characteristic chromatin states. The data files with genomic locations for enhancers across all 127 epigenomes were accessed through the project website at <ext-link ext-link-type="uri" xlink:href="http://mitra.stanford.edu/kundaje/leepc12/web_portal/chr_state_learning.html">http://mitra.stanford.edu/kundaje/leepc12/web_portal/chr_state_learning.html</ext-link>. The genomic locations of polyTE insertions that are in LD with disease-associated SNPs were compared with the genomic locations for enhancers from the 127 epigenomes, and polyTE insertions found to be located within active enhancer elements were considered to have regulatory potential. A subset of 27 epigenomes characterized for cells and tissues related to blood and the immune system &#x2013; such as T cells, B cells and hematopoietic stem cells &#x2013; were selected for downstream eQTL analysis [see Expression Quantitative Trait Locus (eQTL) Analysis].</p>
<p>The overall relative regulatory potential of polyTE insertions in a given epigenome <italic>i</italic> is quantified as:</p>
<disp-formula id="E1"><mml:math id="M1"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mtext>i</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mtext>i</mml:mtext></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mtext>i</mml:mtext></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>
<p>where t<sub>i</sub> is the proportion of polyTEs that are co-located with an enhancer element in a given epigenome <italic>i</italic>, and s<sub>i</sub> is the proportion of SNPs from polyTE LD blocks that overlap with an enhancer element in the same epigenome <italic>i</italic>.</p>
<p>Physical associations between TE-enhancer insertions and nearby gene promoter regions were evaluated with chromatin-chromatin interaction map data, based on several different data sources, including 4C, 5C, ChIA-PET, and Hi-C, using the Chromatin Chromatin Space Interaction (CCSI) database at <ext-link ext-link-type="uri" xlink:href="http://songyanglab.sysu.edu.cn/ccsi/">http://songyanglab.sysu.edu.cn/ccsi/</ext-link> (<xref ref-type="bibr" rid="B56">Xie et al., 2016</xref>).</p>
</sec>
<sec><title>Expression Quantitative Trait Locus (eQTL) Analysis</title>
<p>Associations between polyTE insertion genotypes and tissue-specific gene expression levels were characterized using eQTL analysis (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>). PolyTE insertion presence/absence genotypes were characterized as described in section &#x201C;Polymorphic Transposable Element (polyTE) and SNP Genotypes&#x201D;. RNA-seq gene expression data for the same 445 individual genome samples used for polyTE genotype characterization were taken from the GEUVADIS RNA sequencing project. Genome-wide expression levels were measured for the same lymphoblastoid cell lines, i.e., Epstein&#x2013;Barr virus (EBV) transformed B-lymphocytes (B cells), as used for DNA-seq analysis in the 1KGP. RNA isolation, library preparation, sequencing and read-to-genome mapping were performed as previously described (<xref ref-type="bibr" rid="B27">Lappalainen et al., 2013</xref>). As with the polyTE genotype data, the RNA-seq reads were mapped to the human genome build GRCh37/hg19. The process of gene expression normalization and quantification based on these RNA-seq data has been extensively validated as part of the GEUVEDIS project (<xref ref-type="bibr" rid="B49">t Hoen et al., 2013</xref>). The GEUVADIS RNA-seq data were used to compute gene expression levels for ENSEMBL gene models as previously described (<xref ref-type="bibr" rid="B17">Flicek et al., 2013</xref>). Normalization of gene expression levels was done using a combination of a modified reads per kilobase per million mapped reads (RPKM) approach followed by the probabilistic estimation of expression residuals (PEER) method as previously described (<xref ref-type="bibr" rid="B46">Stegle et al., 2012</xref>). This procedure has been shown to eliminate batch effects among different RNA-seq samples and to reduce the overall variance across samples, thereby ensuring the most accurate and comparable gene expression level inferences among samples. The normalized gene expression levels were accessed from the GUEVADIS project ftp server hosted at the EBI: <ext-link ext-link-type="uri" xlink:href="ftp://ftp.ebi.ac.uk/pub/databases/microarray/data/experiment/GEUV/E-GEUV-1/analysis_results/">ftp://ftp.ebi.ac.uk/pub/databases/microarray/data/experiment/GEUV/E-GEUV-1/analysis_results/</ext-link>.</p>
<p>PolyTE insertions that are (1) linked to at least one disease-associated SNP, and (2) located within a blood- or immune system-related enhancer were taken as a candidate set for eQTL analysis with the lymphoblastoid cell line RNA-seq data. PolyTE insertion presence/absence genotypes were regressed against gene expression levels to identify eQTLs (TE-eQTLs) using the program Matrix eQTL (<xref ref-type="bibr" rid="B45">Shabalin, 2012</xref>). Matrix eQTL was run using the additive linear (least squares model) option with gender and population used as covariates. This was done for all possible pairs of polyTE insertion sites from the candidate set and all genes. <italic>Cis</italic> vs. <italic>trans</italic> TE-eQTLs were defined later as polyTE insertion sites that fall inside (<italic>cis</italic>) or outside (<italic>trans</italic>) 1 megabase from gene boundaries. <italic>P</italic>-values were calculated for all TE-eQTL associations, and FDR <italic>q</italic>-values were then calculated to correct for multiple statistical tests. The genome-wide significant TE-eQTL association threshold was set at FDR <italic>q</italic> &#x003C; 0.05, corresponding to <italic>P</italic> = 4.7 &#x00D7; 10<sup>-7</sup> (AFR) and <italic>P</italic> = 2.6 &#x00D7; 10<sup>-7</sup> (EUR).</p>
</sec>
<sec><title>Interrogation of Disease-Associated Gene Function and Association Consistency</title>
<p>The potential functional impacts of disease-associated TE-eQTL were evaluated via comparison of annotated gene functions and reported GWAS phenotypes for polyTE-linked SNPs. Gene functions were taken from the NCBI Entrez gene summaries, and GWAS phenotypes were taken from the original literature where the associations were reported. Genes that were found to be functionally related to GWAS reported health- or disease-related phenotypes were further checked for the direction of association. If the GWAS SNP-gene pair shows the same direction of association as the polyTE-gene pair, then the pair was included in the final set of significant gene-polyTEs association pairs (<bold>Table <xref ref-type="table" rid="T1">1</xref></bold>). For each gene in the final set, its tissue-specific expression levels across 18 tissues, including 4 blood- and immune-related tissues, were taken from the Illumina BodyMap and GTEx projects (<xref ref-type="bibr" rid="B17">Flicek et al., 2013</xref>; <xref ref-type="bibr" rid="B31">Mele et al., 2015</xref>; <xref ref-type="bibr" rid="B42">Rivas et al., 2015</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>PolyTE disease associations.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">PolyTE<sup>a</sup></th>
<th valign="top" align="left">Chr<sup>b</sup></th>
<th valign="top" align="left">Pos<sup>b</sup></th>
<th valign="top" align="left">GWAS SNP<sup>c</sup></th>
<th valign="top" align="left">GWAS Phenotype<sup>c</sup></th>
<th valign="top" align="left">GWAS gene<sup>c</sup></th>
<th valign="top" align="left">GWAS <italic>P</italic>-value<sup>c</sup></th>
<th valign="top" align="left">#Enhancer Overlaps<sup>d</sup></th>
<th valign="top" align="left">eGene<sup>d</sup></th>
<th valign="top" align="left">eQTL <italic>P</italic>-value<sup>d</sup></th>
<th valign="top" align="left">eQTL Type<sup>d</sup></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Alu-2829</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">154966214</td>
<td valign="top" align="left">rs13064954</td>
<td valign="top" align="left">Diabetic retinopathy</td>
<td valign="top" align="left"><italic>LINC00881, CCNL1</italic></td>
<td valign="top" align="left">7.00E-07</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left"><italic>LILRA1</italic></td>
<td valign="top" align="left">5.94e-10</td>
<td valign="top" align="left"><italic>Trans</italic></td>
</tr>
<tr>
<td valign="top" align="left">Alu-5072</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">32589834</td>
<td valign="top" align="left">rs4530903</td>
<td valign="top" align="left">Lymphoma</td>
<td valign="top" align="left"><italic>TRNAI25</italic></td>
<td valign="top" align="left">2.00E-08</td>
<td valign="top" align="left">20</td>
<td valign="top" align="left"><italic>HLA-DRB5</italic></td>
<td valign="top" align="left">8.49e-13</td>
<td valign="top" align="left"><italic>Cis</italic></td>
</tr>
<tr>
<td valign="top" align="left">Alu-5075</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">32657952</td>
<td valign="top" align="left">rs2858870</td>
<td valign="top" align="left">Nodular sclerosis Hodgkin lymphoma</td>
<td valign="top" align="left"><italic>TRNAI25</italic></td>
<td valign="top" align="left">8.00E-18</td>
<td valign="top" align="left">15</td>
<td valign="top" align="left"><italic>HLA-DQB1-AS1</italic></td>
<td valign="top" align="left">1.36e-11</td>
<td valign="top" align="left"><italic>Cis</italic></td>
</tr>
<tr>
<td valign="top" align="left">SVA-282</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">33030313</td>
<td valign="top" align="left">rs3077</td>
<td valign="top" align="left">Chronic hepatitis B infection</td>
<td valign="top" align="left"><italic>HLA-DPA1</italic></td>
<td valign="top" align="left">5.00E-39</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left"><italic>HLA-DPB2</italic></td>
<td valign="top" align="left">1.05e-13</td>
<td valign="top" align="left"><italic>Cis</italic></td>
</tr>
<tr>
<td valign="top" align="left">SVA-401</td>
<td valign="top" align="left">9</td>
<td valign="top" align="left">33130564</td>
<td valign="top" align="left">rs10758189</td>
<td valign="top" align="left">IgG glycosylation</td>
<td valign="top" align="left"><italic>B4GALT1</italic></td>
<td valign="top" align="left">2.00E-06</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left"><italic>B4GALT1</italic></td>
<td valign="top" align="left">4.47e-20</td>
<td valign="top" align="left"><italic>Cis</italic></td>
</tr>
<tr>
<td valign="top" align="left">SVA-438</td>
<td valign="top" align="left">10</td>
<td valign="top" align="left">17712792</td>
<td valign="top" align="left">rs6602203</td>
<td valign="top" align="left">Glucose homeostasis traits</td>
<td valign="top" align="left"><italic>ST8SIA6-AS1, PRPF38AP2</italic></td>
<td valign="top" align="left">5.00E-06</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left"><italic>TMEM236</italic></td>
<td valign="top" align="left">1.30e-07</td>
<td valign="top" align="left"><italic>Cis</italic></td></tr>
</tbody></table>
<table-wrap-foot>
<attrib><italic><sup>a</sup>PolyTE insertion identifier following the nomenclature of the 1KGP structural variation group. <sup>b</sup>Genomic location of the polyTE. <sup>c</sup>Information on the linked disease-associated GWAS SNP, disease phenotype and gene. <sup>d</sup>Information on the regulatory potential of the polyTE insertion based on co-location with enhancers and eQTL analysis.</italic></attrib>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec><title>Results</title>
<p>We used a genome-scale data analysis approach to explore the potential impact of human genetic variation generated by the activity of TEs on health and disease (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>). This approach entailed an integrative analysis of (1) TE insertion polymorphisms, (2) SNPs, (3) SNP-disease associations, (4) tissue-specific enhancers, (5) eQTL, and (6) gene function/expression profiles. The rationale behind this approach was to employ a series of successive genome-wide filters, which would converge on a set of high-confidence TE insertion polymorphisms that are most likely to impact health- or disease-related phenotypes. Our analysis started with 5,845 polyTE insertions, with MAF > 0.05 for two continental population groups (European and African), and converged on a final set of seven high-confidence TE disease-association candidates (<bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold>). The final set of seven health/disease-implicated TE insertion polymorphisms that we found are distinguished by their linkage to disease-associated SNPs as well as their regulatory and functional properties. We describe the results and implications for each step in our TE disease-association screen in the sections below.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Results of the genome-wide screen for polyTE disease-associations. As illustrated in <bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>, a series of filters was applied to screen for a final set of high-confidence polyTE disease-associations. The number of polyTE insertions that remain after the application of each successive filter is shown for the African (AFR &#x2013; blue) and European (EUR &#x2013; orange) population groups.</p></caption>
<graphic xlink:href="fmicb-08-01418-g002.tif"/>
</fig>
<sec><title>Linkage Disequilibrium for PolyTEs and Disease-Associated SNPs</title>
<p>The genomic locations of polyTE insertions were characterized for 445 individuals from one African (AFR) and four European (EUR) populations as described in section &#x201C;Polymorphic Transposable Element (polyTE) and SNP Genotypes&#x201D; of the Materials and Methods. This was done for the most common families of active human TEs: Alu, L1, and SVA. For each polyTE insertion location, individual genotypes were characterized as homozygous absent (0), heterozygous (1), or homozygous present (2). The distributions of polyTE insertion genotypes among individuals from each population were used to screen for polyTEs that are found at relatively high MAF > 0.05. The LD structure of the resulting common polyTE insertions, with adjacent common SNPs (also at MAF > 0.05), was then defined using correlation analysis across individual genome samples (see Materials and Methods section PolyTE-SNP Linkage Analysis). In addition, the genomic locations of common polyTE insertion variants and their linked SNPs were compared to the locations of disease-associated SNPs reported in the NHGRI-EBI GWAS database [see Materials and Methods Genome Wide Association Studies (GWAS) for Disease]. Linkage correlation coefficients between all polyTE insertions analyzed here and GWAS SNPs are shown in <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM1">S1</xref></bold>.</p>
<p>Distributions of LD correlations between polyTEs and adjacent SNPs were compared separately for non-disease-associated vs. disease-associated SNPs. For all three families of active human TEs, in both the AFR and EUR population groups, polyTEs are found in significantly higher LD with disease-associated SNPs compared to non-disease-associated SNPs (<bold>Figure <xref ref-type="fig" rid="F3">3A</xref></bold>). In addition, polyTE variants are located closer to disease-associated SNPs than non-disease-associated SNPs for the EUR population group (<bold>Figure <xref ref-type="fig" rid="F3">3B</xref></bold>). A similar enrichment was not seen for the AFR population group, which may be attributed to the lower number of samples available for analysis for this group (AFR = 87 vs. EUR = 358). Indeed, when a larger number of AFR samples, which do not have matched RNA-seq data, were used for the same linkage analysis, the results were qualitatively identical to those seen for the EUR samples analyzed here. Taken together, these results indicate that polyTEs are more likely to be tightly linked to disease-associated SNPs compared to adjacent linked SNPs from the same LD blocks, suggesting a possible role in disease etiology for some TE variants. This is notable in light of the facts that (1) the TE genotypes were not considered in the initial association studies, and (2) TE insertions entail substantially larger-scale genetic variants than SNPs. Thus, polyTEs found on the same haplotypes as disease-associated SNPs may be expected to have an even greater impact on health- and disease-outcomes in some cases.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Linkage between polyTE insertions and SNP disease-associations from GWAS. <bold>(A)</bold> Distributions of the allele correlation coefficients (<italic>r</italic>) are shown for (1) polyTE insertions and non-GWAS SNPs, and (2) polyTE insertions and GWAS SNPs. Higher correlation coefficients indicate tighter linkage. The significance of the differences for the non-GWAS SNP vs. the GWAS SNP correlation coefficient distributions are indicated (Kolmogorov&#x2013;Smirnov test; <sup>&#x2217;&#x2217;&#x2217;</sup><italic>P</italic> &#x003C; 2.4 &#x00D7; 10<sup>-5</sup>). <bold>(B)</bold> Density distributions of the distance between polyTE insertions and SNPs to the nearest GWAS disease-associated SNP. Correlation coefficient <bold>(A)</bold> and density <bold>(B)</bold> distributions are shown separately for the Alu (red), L1 (blue), and SVA (green) TE families in the African (AFR &#x2013; blue) and European (EUR &#x2013; orange) population groups.</p></caption>
<graphic xlink:href="fmicb-08-01418-g003.tif"/>
</fig>
</sec>
<sec><title>Co-location of Disease-Linked PolyTEs with Tissue-Specific Enhancers</title>
<p>Given the fact that TEs are known to participate in human gene regulation via a wide variety of mechanisms (<xref ref-type="bibr" rid="B16">Feschotte, 2008</xref>; <xref ref-type="bibr" rid="B38">Rebollo et al., 2012</xref>; <xref ref-type="bibr" rid="B9">Chuong et al., 2017</xref>), we hypothesized that polyTEs may impact disease by virtue of gene regulatory effects. The regulatory potential of polyTEs linked to disease-associated SNPs was first evaluated by searching for insertions that are co-located with tissue-specific enhancers. The locations of enhancers were characterized for 127 cell- and tissue-types based on their chromatin signatures as described in section &#x201C;Evaluating polyTE Regulatory Potential&#x201D; of the Materials and Methods. An example of an enhancer co-located with a disease-linked polyTE is shown for an Alu element that is inserted 5&#x2032; to the Immunoglobulin Heavy Variable 2-26 (<italic>IGHV2-26</italic>) encoding gene (<bold>Figure <xref ref-type="fig" rid="F4">4A</xref></bold>). We found a total 607 disease-linked polyTEs co-located with enhancers in the AFR population group and 437 in EUR group; 391 (AFR) and 336 (EUR) of those enhancers correspond to blood- or immune-related tissues (<bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold>). Details on the co-localization of polyTE insertions and the enhancers characterized for each epigenome are shown in <bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM2">S2</xref></bold>.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Regulatory potential of disease-linked polyTE insertions. PolyTE insertions linked to disease-associated SNPs were evaluated for their co-location with tissue-specific enhancers. <bold>(A)</bold> UCSC Genome Browser screen capture showing an example of a polyTE insertion &#x2013; Alu-10537 &#x2013; that overlaps with a number of tissue specific enhancers. The genomic location of the Alu insertion on chromosome 14, downstream of the <italic>IGHV2-26</italic> gene (blue gene model), is indicated with a red arrow. The genomic locations of co-located enhancers, characterized based on chromatin signatures from a variety of tissue-specific epigenomes, are indicated with yellow bars. <bold>(B)</bold> Heatmap showing the relative regulatory potential (see Materials and Methods section Evaluating polyTE Regulatory Potential) of polyTE insertions for a variety of tissue-specific epigenomes. Blood- and immune-related tissues are shown separately from examples of the other tissue types analyzed here. <bold>(C)</bold> Developmental lineage of immune-related cells for which enhancer genomic locations were characterized. <bold>(D)</bold> The mean relative regulatory potential for disease-linked polyTE insertions is shown for blood- and immune-related tissues compared to all other tissue-types analyzed here. Values for the African (AFR &#x2013; blue) and European (EUR &#x2013; orange) population groups are shown separately.</p></caption>
<graphic xlink:href="fmicb-08-01418-g004.tif"/>
</fig>
<p>We estimated the overall regulatory potential for disease-linked polyTEs in all cell- and tissue-types by computing the relative ratio of enhancer co-located insertions as described in section &#x201C;Evaluating polyTE Regulatory Potential&#x201D; of the Materials and Methods. The results of this analysis are considered separately for the blood- and immune-related tissues (<bold>Figure <xref ref-type="fig" rid="F4">4B</xref></bold>) and all other tissues from which enhancers were characterized. Enhancer co-located disease-linked polyTEs from blood/immune cell-types show higher overall regulatory potential than ones that are co-located with enhancers characterized for the other tissue-types (<bold>Figures <xref ref-type="fig" rid="F4">4C,D</xref></bold>). These results suggest that the set of disease-linked polyTEs studied here may have a disproportionate impact on immune-related diseases, and we focused our subsequent efforts on this subset of health conditions.</p>
</sec>
<sec><title>Expression Associations for Disease-Linked and Enhancer Co-located PolyTEs</title>
<p>We further evaluated the regulatory potential of the polyTEs that were found to be both disease-linked and co-located with blood- and immune-related enhancers using an eQTL approach [see Materials and Methods section Expression Quantitative Trait Locus (eQTL) analysis]. Genotypes for this subset of polyTEs from the 445 genome samples analyzed here were regressed against gene expression levels characterized from lymphoblastoid cell lines for the same individuals. Quantile&#x2013;quantile (Q&#x2013;Q) plots comparing the observed vs. expected <italic>P</italic>-values for the e-QTL analysis in the AFR and EUR population groups are shown in <bold>Figure <xref ref-type="fig" rid="F5">5A</xref></bold>, revealing a number of statistically significant associations that are likely to be true-positives. There are 83 (AFR) and 42 (EUR) genome-wide significant TE-eQTL (<bold>Supplementary Table <xref ref-type="supplementary-material" rid="SM3">S3</xref></bold>), and they are enriched in genomic regions that encode immune-related genes (<bold>Figure <xref ref-type="fig" rid="F5">5B</xref></bold>). We narrowed this list further by selecting the strongest TE-eQTL association for each individual polyTE, resulting in a final list of 37 (AFR) and 24 (EUR) immune-related TE-eQTL (<bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Expression quantitative trait (eQTL) analysis for disease-linked polyTEs. eQTL analysis was performed by regressing lymphoblastoid gene expression levels against polyTE insertion genotypes for the for the African (AFR &#x2013; blue) and European (EUR &#x2013; orange) individuals analyzed here. <bold>(A)</bold> Quantile&#x2013;quantile (Q&#x2013;Q) plots relating the observed (<italic>y</italic>-axis) to the expected (<italic>x</italic>-axis) TE-eQTL log transformed <italic>P</italic>-values. <bold>(B)</bold> Manhattan plots showing the genomic distributions of TE-eQTL log transformed <italic>P</italic>-values. The dashed line corresponds to a false discovery rate (FDR) threshold of <italic>q</italic> &#x003C; 0.05, corresponding to <italic>P</italic> = 4.7 &#x00D7; 10<sup>-7</sup> (AFR) and <italic>P</italic> = 2.6 &#x00D7; 10<sup>-7</sup> (EUR).</p></caption>
<graphic xlink:href="fmicb-08-01418-g005.tif"/>
</fig>
<p>The results of the TE-eQTL analysis further underscore the regulatory potential of the disease-linked polyTEs characterized here and also allowed us to narrow down the list of candidate insertions. Starting with the list of TE-eQTL, we searched for &#x2018;consistent&#x2019; examples where the disease-linked polyTE is associated with the expression of a gene that is functionally related to the annotated disease phenotype. This allowed us to converge on a final set of seven high-confidence disease-associated TE insertion polymorphisms (<bold>Figure <xref ref-type="fig" rid="F2">2</xref></bold> and <bold>Table <xref ref-type="table" rid="T1">1</xref></bold>). Four of these disease-associated polyTEs are illustrated in <bold>Figure <xref ref-type="fig" rid="F6">6</xref></bold>, and we provide additional information on two examples in the following section &#x201C;Effects of polyTE Insertions on Immune- and Blood-Related Conditions.&#x201D; The four examples shown in <bold>Figure <xref ref-type="fig" rid="F6">6</xref></bold> all correspond to polyTEs that are linked to disease-associated SNPs and co-located with enhancers characterized from blood- or immune system-related tissues; in addition, the genes that these polyTE insertions regulate are all known to function in the immune system.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Gene expression profiles and eQTL results for disease-associated polyTE insertions. Bar-charts of tissue-specific expression levels and box-pots of eQTL analyses are shown for four examples of TE-eQTLs corresponding to disease-linked and enhancer co-located polyTE insertions that regulate immune-related genes: <bold>(A)</bold> Alu-2829 and <italic>LILRA1</italic>, <bold>(B)</bold> Alu-5072 and <italic>HLA-DRB5</italic>, <bold>(C)</bold> Alu-5075 and <italic>HLA-DQB1-AS1</italic>, and <bold>(D)</bold> SVA-282 and <italic>HLA-DPB2</italic>. Bar-charts show tissue-specific expression levels as normalized RPKM values (green). The inset eQTL box-plots show individual sample gene expression levels (<italic>y</italic>-axis) regressed against polyTE insertion presence/absence genotypes (<italic>x</italic>-axis): 0-homozygous absent, 1-heterozygous, 2-homozygous present. Each dot represents a single individual from the African (AFR &#x2013; blue) and/or European (EUR &#x2013; orange) population groups.</p></caption>
<graphic xlink:href="fmicb-08-01418-g006.tif"/>
</fig>
<p>Six of the seven disease-associated polyTE insertions are considered to be population-specific, based on significant eQTL results in only one population, whereas a single case is shared between both the AFR and EUR population groups (<bold>Figure <xref ref-type="fig" rid="F6">6C</xref></bold>). However, two of the six cases considered to be population-specific using the eQTL criterion do show consistent trends across populations but failed to reach genome-wide significance when controls for multiple statistical tests were implemented (<bold>Figures <xref ref-type="fig" rid="F6">6D</xref>, <xref ref-type="fig" rid="F7">7B</xref></bold>).</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p>PolyTE insertions associated with immune- and blood-related conditions. <bold>(A)</bold> UCSC Genome Browser screen capture showing the location of the SVA-401 insertion (red arrow) on chromosome 19 within the second exon of the <italic>B4GALT1</italic> gene. The inset shows the genomic locations of co-located enhancers, characterized based on chromatin signatures from a variety of tissue-specific epigenomes locations, as yellow bars. The bar-chart shows <italic>B4GALT1</italic> tissue-specific expression levels as normalized RPKM values (green). <bold>(B)</bold> eQTL box-plots show individual sample gene expression levels (<italic>y</italic>-axis) regressed against SVA-401 insertion presence/absence genotypes (<italic>x</italic>-axis): 0-homozygous absent, 1-heterozygous, 2-homozygous present. Each dot represents a single individual from the African (AFR &#x2013; blue) or European (EUR &#x2013; orange) population groups. <bold>(C)</bold> <italic>B4GALT1</italic> catalyzed glycosylation of the Immunoglobulin G (IgG) antibody, resulting in conversion from pro- to anti-inflammatory activity. <bold>(D)</bold> UCSC Genome Browser screen capture showing the location of the SVA-438 insertion (red arrow) on chromosome 10 within the first exon of the <italic>STAM</italic> gene, upstream of the regulated <italic>TMEM236</italic> gene. The inset shows the genomic locations of co-located enhancers (yellow bars). <bold>(E)</bold> Bar-chart of <italic>TMEM236</italic> tissue-specific expression levels and box-pot of the SVA-438 <italic>TMEM236</italic> eQTL analyses. <bold>(F)</bold> Functional role and cell-type specific expression profile for <italic>TMEM236</italic>.</p></caption>
<graphic xlink:href="fmicb-08-01418-g007.tif"/>
</fig>
</sec>
<sec><title>Effects of PolyTE Insertions on Immune- and Blood-Related Conditions</title>
<p>Here, we described two specific examples of the effects that polyTE insertions can exert on immune- and blood-related disease phenotypes. <bold>Figure <xref ref-type="fig" rid="F7">7A</xref></bold> shows the SVA-401 insertion that is co-located with a cell-type specific enhancer found in the second intron of the Beta-1,4-Galactosyltransferase 1 (<italic>B4GALT1</italic>) encoding gene, which is normally expressed at high levels in immune-related tissues. Chromatin interaction maps characterized for several different cell types &#x2013; CD34, GM12878, and Mcf7 &#x2013; show that this <italic>B4GALT1</italic> intronic enhancer physically associates with the gene&#x2019;s promoter region. The disruption of the <italic>B4GALT1</italic> enhancer by the SVA insertion is associated with down-regulation of the gene in B-cells, for both AFR and EUR population groups (<bold>Figure <xref ref-type="fig" rid="F7">7B</xref></bold>). <italic>B4GALT1</italic> encodes a glycosyltransferase that functions in the glycosylation of the Immunoglobulin G (IgG) antibody in such a way as to convert its activity from pro- to anti-inflammatory (<bold>Figure <xref ref-type="fig" rid="F7">7C</xref></bold>) (<xref ref-type="bibr" rid="B22">Kaneko et al., 2006</xref>; <xref ref-type="bibr" rid="B3">Anthony and Ravetch, 2010</xref>; <xref ref-type="bibr" rid="B28">Lauc et al., 2013</xref>). Down-regulation of this gene in individuals with the enhancer SVA insertion should thereby serve to keep the IgG molecule in a pro-inflammatory state. Consistent with this idea, the <italic>B4GALT1</italic> enhancer SVA insertion is linked to a genomic region implicated by GWAS in both inflammatory conditions and autoimmune diseases such as systemic lupus erythematosus and Crohn&#x2019;s disease (<xref ref-type="bibr" rid="B28">Lauc et al., 2013</xref>).</p>
<p>Another example of an SVA insertion into an enhancer element is shown for the adjacently located Signal Transducing Adaptor Molecule (<italic>STAM</italic>) and Transmembrane Protein 236 (<italic>TMEM236</italic>) encoding genes. The SVA-438 insertion is co-located with an enhancer in the first intron of the <italic>STAM</italic> gene (<bold>Figure <xref ref-type="fig" rid="F7">7D</xref></bold>), but its presence is associated with changes in expression of the nearby <italic>TMEM236</italic> gene (<bold>Figure <xref ref-type="fig" rid="F7">7E</xref></bold>). <italic>TMEM236</italic> is located &#x223C;100 kbp downstream of the SVA-438 insertion and is most highly expressed in pancreatic islet &#x03B1;-cells (<bold>Figure <xref ref-type="fig" rid="F7">7F</xref></bold>) (<xref ref-type="bibr" rid="B1">Ackermann et al., 2016</xref>; <xref ref-type="bibr" rid="B53">Wang Y.J. et al., 2016</xref>). Islet &#x03B1;-cells function to secrete glucagon, a peptide hormone that elevates glucose levels in the blood (<xref ref-type="bibr" rid="B37">Quesada et al., 2008</xref>). The SVA-438 insertion is associated with increased expression of <italic>TMEM236</italic>, which would be expected to lead to increased blood glucose levels. This expectation is consistent with the fact that the SVA-438 insertion is also linked to the risk allele (T) of the SNP rs6602203, which is associated with a reduced metabolic clearance rate of insulin (MCRI), an endophenotype that is associated with the risk of type 2 diabetes (<xref ref-type="bibr" rid="B34">Palmer et al., 2015</xref>). In other words, up-regulation of <italic>TMEM236</italic> by the SVA-438 insertion may be mechanistically linked to insulin resistance by virtue of increasing blood sugar and decreasing insulin clearance.</p>
</sec>
</sec>
<sec><title>Discussion</title>
<p>The results reported here underscore the influence that retrotransposon insertion polymorphisms can exert on human health- and disease-related phenotypes. The integrative data analysis approach that we took for this study also revealed how polyTE disease-associations are mediated by the gene regulatory properties of retrotransposon insertions. We adopted a conservative approach to screen for the potential regulatory effects of retrotransposon insertions by choosing candidate elements as those that were inserted into regions previously defined as tissue-specific enhancers in blood/immune cells. Retrotransposons that insert into enhancer sequences could entail loss-of-function mutants by virtue of disrupting enhancer sequences, or they could serve as gain-of-function mutants by altering enhancer activity. Our results can be considered to show instances of both loss- and gain-of-function enhancer mutations with respect to the decrease or increase, respectively, of gene expression levels that are associated with element insertion genotypes (<bold>Figures <xref ref-type="fig" rid="F6">6</xref>, <xref ref-type="fig" rid="F7">7</xref></bold>). Nevertheless, it is worth noting that our conservative approach could be prone to false negatives as it would not uncover novel enhancer activity provided by element insertions at new locations in the genome.</p>
<p>The TE regulatory findings that we report here are consistent with previous studies showing that TE-derived sequences have contributed a wide variety of gene regulatory elements to the human genome (<xref ref-type="bibr" rid="B16">Feschotte, 2008</xref>; <xref ref-type="bibr" rid="B38">Rebollo et al., 2012</xref>; <xref ref-type="bibr" rid="B9">Chuong et al., 2017</xref>), including promoters (<xref ref-type="bibr" rid="B21">Jordan et al., 2003</xref>; <xref ref-type="bibr" rid="B30">Marino-Ramirez et al., 2005</xref>; <xref ref-type="bibr" rid="B13">Conley et al., 2008</xref>), enhancers (<xref ref-type="bibr" rid="B6">Bejerano et al., 2006</xref>; <xref ref-type="bibr" rid="B25">Kunarso et al., 2010</xref>; <xref ref-type="bibr" rid="B10">Chuong et al., 2013</xref>, <xref ref-type="bibr" rid="B8">2016</xref>; <xref ref-type="bibr" rid="B32">Notwell et al., 2015</xref>), transcription terminators (<xref ref-type="bibr" rid="B12">Conley and Jordan, 2012</xref>) and several classes of small RNAs (<xref ref-type="bibr" rid="B54">Weber, 2006</xref>; <xref ref-type="bibr" rid="B36">Piriyapongsa et al., 2007</xref>; <xref ref-type="bibr" rid="B23">Kapusta et al., 2013</xref>). Human TEs can also influence gene regulation by modulating various aspects of chromatin structure throughout the genome (<xref ref-type="bibr" rid="B26">Lander et al., 2001</xref>; <xref ref-type="bibr" rid="B35">Pavlicek et al., 2001</xref>; <xref ref-type="bibr" rid="B44">Schmidt et al., 2012</xref>; <xref ref-type="bibr" rid="B20">Jacques et al., 2013</xref>; <xref ref-type="bibr" rid="B48">Sundaram et al., 2014</xref>; <xref ref-type="bibr" rid="B51">Wang et al., 2015</xref>).</p>
<p>It is important to note that the research efforts which have uncovered the regulatory properties of human TEs, including a number of our own studies, have dealt exclusively with sequences derived from relatively ancient insertion events. These ancient TE insertions are present at the same (fixed) locations in the genome sequences of all human individuals. In other words, previously described TE-derived regulatory sequences are uniformly present among individual human genomes and thereby do not represent a source of structural genetic variation. Such fixed TE-derived regulatory sequences may not be expected to provide for gene regulatory variation among individuals or for that matter to contribute to inter-individual differences related to health and disease.</p>
<p>Nevertheless, we recently showed that TE insertion polymorphisms also exert regulatory effects on the human genome (<xref ref-type="bibr" rid="B52">Wang L. et al., 2016</xref>). Specifically, polyTE insertions were shown to contribute to both inter-individual and population-specific differences in gene expression and to facilitate the re-wiring of transcriptional networks. The results reported here extend those findings up the hierarchy of human biological organization by revealing potential mechanistic links between polyTE-induced gene regulatory changes and the endophenotypes that underlie human health and disease.</p>
</sec>
<sec><title>Author Contributions</title>
<p>LW performed all of the analyses described in the study. EN contributed all GWAS data used in the study. IJ conceived of, designed and supervised the study. All authors contributed to the drafting and revision of the manuscript.</p>
</sec>
<sec><title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> LW was supported by the Georgia Tech Bioinformatics Graduate Program. EN and IJ were supported by the IHRC-Georgia Tech Applied Bioinformatics Laboratory (ABiL).</p></fn>
</fn-group>
<ack>
<p>The authors would like to thank Jianrong Wang for his advice on the analysis of tissue-specific enhancer elements.</p>
</ack>
<sec sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fmicb.2017.01418/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fmicb.2017.01418/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.XLSX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S1</label>
<caption><p>Linkage disequilibrium (LD) structure between polyTEs and GWAS disease-associated SNPs.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_1.XLSX" id="SM5" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.XLSX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S2</label>
<caption><p>Genomic co-localization of polyTEs with tissue-specific enhancers.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_2.XLSX" id="SM6" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_3.XLSX" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>TABLE S3</label>
<caption><p>Expression quantitative trait loci (eQTL) results for disease-linked and enhancer co-located polyTEs.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_3.XLSX" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ackermann</surname> <given-names>A. M.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Schug</surname> <given-names>J.</given-names></name> <name><surname>Naji</surname> <given-names>A.</given-names></name> <name><surname>Kaestner</surname> <given-names>K. H.</given-names></name></person-group> (<year>2016</year>). <article-title>Integration of ATAC-seq and RNA-seq identifies human alpha cell and beta cell signature genes.</article-title> <source><italic>Mol. Metab.</italic></source> <volume>5</volume> <fpage>233</fpage>&#x2013;<lpage>244</lpage>. <pub-id pub-id-type="doi">10.1016/j.molmet.2016.01.002</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altshuler</surname> <given-names>D. M.</given-names></name> <name><surname>Durbin</surname> <given-names>R. M.</given-names></name> <name><surname>Abecasis</surname> <given-names>G. R.</given-names></name> <name><surname>Bentley</surname> <given-names>D. R.</given-names></name> <name><surname>Chakravarti</surname> <given-names>A.</given-names></name> <name><surname>Clark</surname> <given-names>A. G.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>A global reference for human genetic variation.</article-title> <source><italic>Nature</italic></source> <volume>526</volume> <fpage>68</fpage>&#x2013;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1038/nature15393</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anthony</surname> <given-names>R. M.</given-names></name> <name><surname>Ravetch</surname> <given-names>J. V.</given-names></name></person-group> (<year>2010</year>). <article-title>A novel role for the IgG Fc glycan: the anti-inflammatory activity of sialylated IgG Fcs.</article-title> <source><italic>J. Clin. Immunol.</italic></source> <volume>30(Suppl. 1)</volume> <fpage>S9</fpage>&#x2013;<lpage>S14</lpage>. <pub-id pub-id-type="doi">10.1007/s10875-010-9405-6</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Batzer</surname> <given-names>M. A.</given-names></name> <name><surname>Deininger</surname> <given-names>P. L.</given-names></name></person-group> (<year>1991</year>). <article-title>A human-specific subfamily of Alu sequences.</article-title> <source><italic>Genomics</italic></source> <volume>9</volume> <fpage>481</fpage>&#x2013;<lpage>487</lpage>. <pub-id pub-id-type="doi">10.1016/0888-7543(91)90414-A</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Batzer</surname> <given-names>M. A.</given-names></name> <name><surname>Gudi</surname> <given-names>V. A.</given-names></name> <name><surname>Mena</surname> <given-names>J. C.</given-names></name> <name><surname>Foltz</surname> <given-names>D. W.</given-names></name> <name><surname>Herrera</surname> <given-names>R. J.</given-names></name> <name><surname>Deininger</surname> <given-names>P. L.</given-names></name></person-group> (<year>1991</year>). <article-title>Amplification dynamics of human-specific (HS) Alu family members.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>19</volume> <fpage>3619</fpage>&#x2013;<lpage>3623</lpage>. <pub-id pub-id-type="doi">10.1093/nar/19.13.3619</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bejerano</surname> <given-names>G.</given-names></name> <name><surname>Lowe</surname> <given-names>C. B.</given-names></name> <name><surname>Ahituv</surname> <given-names>N.</given-names></name> <name><surname>King</surname> <given-names>B.</given-names></name> <name><surname>Siepel</surname> <given-names>A.</given-names></name> <name><surname>Salama</surname> <given-names>S. R.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>A distal enhancer and an ultraconserved exon are derived from a novel retroposon.</article-title> <source><italic>Nature</italic></source> <volume>441</volume> <fpage>87</fpage>&#x2013;<lpage>90</lpage>. <pub-id pub-id-type="doi">10.1038/nature04696</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brouha</surname> <given-names>B.</given-names></name> <name><surname>Schustak</surname> <given-names>J.</given-names></name> <name><surname>Badge</surname> <given-names>R. M.</given-names></name> <name><surname>Lutz-Prigge</surname> <given-names>S.</given-names></name> <name><surname>Farley</surname> <given-names>A. H.</given-names></name> <name><surname>Moran</surname> <given-names>J. V.</given-names></name><etal/></person-group> (<year>2003</year>). <article-title>Hot L1s account for the bulk of retrotransposition in the human population.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>100</volume> <fpage>5280</fpage>&#x2013;<lpage>5285</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0831042100</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chuong</surname> <given-names>E. B.</given-names></name> <name><surname>Elde</surname> <given-names>N. C.</given-names></name> <name><surname>Feschotte</surname> <given-names>C.</given-names></name></person-group> (<year>2016</year>). <article-title>Regulatory evolution of innate immunity through co-option of endogenous retroviruses.</article-title> <source><italic>Science</italic></source> <volume>351</volume> <fpage>1083</fpage>&#x2013;<lpage>1087</lpage>. <pub-id pub-id-type="doi">10.1126/science.aad5497</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chuong</surname> <given-names>E. B.</given-names></name> <name><surname>Elde</surname> <given-names>N. C.</given-names></name> <name><surname>Feschotte</surname> <given-names>C.</given-names></name></person-group> (<year>2017</year>). <article-title>Regulatory activities of transposable elements: from conflicts to benefits.</article-title> <source><italic>Nat. Rev. Genet.</italic></source> <volume>18</volume> <fpage>71</fpage>&#x2013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1038/nrg.2016.139</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chuong</surname> <given-names>E. B.</given-names></name> <name><surname>Rumi</surname> <given-names>M. A.</given-names></name> <name><surname>Soares</surname> <given-names>M. J.</given-names></name> <name><surname>Baker</surname> <given-names>J. C.</given-names></name></person-group> (<year>2013</year>). <article-title>Endogenous retroviruses function as species-specific enhancer elements in the placenta.</article-title> <source><italic>Nat. Genet.</italic></source> <volume>45</volume> <fpage>325</fpage>&#x2013;<lpage>329</lpage>. <pub-id pub-id-type="doi">10.1038/ng.2553</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clayton</surname> <given-names>E. A.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Rishishwar</surname> <given-names>L.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>McDonald</surname> <given-names>J. F.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2016</year>). <article-title>Patterns of transposable element expression and insertion in cancer.</article-title> <source><italic>Front. Mol. Biosci.</italic></source> <volume>3</volume>:<issue>76</issue>. <pub-id pub-id-type="doi">10.3389/fmolb.2016.00076</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Conley</surname> <given-names>A. B.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2012</year>). <article-title>Cell type-specific termination of transcription by transposable element sequences.</article-title> <source><italic>Mob. DNA</italic></source> <volume>3</volume>:<issue>15</issue>. <pub-id pub-id-type="doi">10.1186/1759-8753-3-15</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Conley</surname> <given-names>A. B.</given-names></name> <name><surname>Piriyapongsa</surname> <given-names>J.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2008</year>). <article-title>Retroviral promoters in the human genome.</article-title> <source><italic>Bioinformatics</italic></source> <volume>24</volume> <fpage>1563</fpage>&#x2013;<lpage>1567</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btn243</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Koning</surname> <given-names>A. P. J.</given-names></name> <name><surname>Gu</surname> <given-names>W. J.</given-names></name> <name><surname>Castoe</surname> <given-names>T. A.</given-names></name> <name><surname>Batzer</surname> <given-names>M. A.</given-names></name> <name><surname>Pollock</surname> <given-names>D. D.</given-names></name></person-group> (<year>2011</year>). <article-title>Repetitive elements may comprise over two-thirds of the human genome.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>7</volume>:<issue>e1002384</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1002384</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ernst</surname> <given-names>J.</given-names></name> <name><surname>Kellis</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>ChromHMM: automating chromatin-state discovery and characterization.</article-title> <source><italic>Nat. Methods</italic></source> <volume>9</volume> <fpage>215</fpage>&#x2013;<lpage>216</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1906</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feschotte</surname> <given-names>C.</given-names></name></person-group> (<year>2008</year>). <article-title>Transposable elements and the evolution of regulatory networks.</article-title> <source><italic>Nat. Rev. Genet.</italic></source> <volume>9</volume> <fpage>397</fpage>&#x2013;<lpage>405</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2337</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flicek</surname> <given-names>P.</given-names></name> <name><surname>Ahmed</surname> <given-names>I.</given-names></name> <name><surname>Amode</surname> <given-names>M. R.</given-names></name> <name><surname>Barrell</surname> <given-names>D.</given-names></name> <name><surname>Beal</surname> <given-names>K.</given-names></name> <name><surname>Brent</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Ensembl 2013.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>41</volume> <fpage>D48</fpage>&#x2013;<lpage>D55</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks1236</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hancks</surname> <given-names>D. C.</given-names></name> <name><surname>Kazazian</surname> <given-names>H. H.</given-names> <suffix>Jr.</suffix></name></person-group> (<year>2016</year>). <article-title>Roles for retrotransposon insertions in human disease.</article-title> <source><italic>Mob. DNA</italic></source> <volume>7</volume>:<issue>9</issue>. <pub-id pub-id-type="doi">10.1186/s13100-016-0065-9</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hancks</surname> <given-names>D. C.</given-names></name> <name><surname>Kazazian</surname> <given-names>H. H.</given-names></name></person-group> (<year>2012</year>). <article-title>Active human retrotransposons: variation and disease.</article-title> <source><italic>Curr. Opin. Genet. Dev.</italic></source> <volume>22</volume> <fpage>191</fpage>&#x2013;<lpage>203</lpage>. <pub-id pub-id-type="doi">10.1016/j.gde.2012.02.006</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jacques</surname> <given-names>P. E.</given-names></name> <name><surname>Jeyakani</surname> <given-names>J.</given-names></name> <name><surname>Bourque</surname> <given-names>G.</given-names></name></person-group> (<year>2013</year>). <article-title>The majority of primate-specific regulatory sequences are derived from transposable elements.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>9</volume>:<issue>e1003504</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1003504</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jordan</surname> <given-names>I. K.</given-names></name> <name><surname>Rogozin</surname> <given-names>I. B.</given-names></name> <name><surname>Glazko</surname> <given-names>G. V.</given-names></name> <name><surname>Koonin</surname> <given-names>E. V.</given-names></name></person-group> (<year>2003</year>). <article-title>Origin of a substantial fraction of human regulatory sequences from transposable elements.</article-title> <source><italic>Trends Genet.</italic></source> <volume>19</volume> <fpage>68</fpage>&#x2013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/S0168-9525(02)00006-9</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaneko</surname> <given-names>Y.</given-names></name> <name><surname>Nimmerjahn</surname> <given-names>F.</given-names></name> <name><surname>Ravetch</surname> <given-names>J. V.</given-names></name></person-group> (<year>2006</year>). <article-title>Anti-inflammatory activity of immunoglobulin G resulting from Fc sialylation.</article-title> <source><italic>Science</italic></source> <volume>313</volume> <fpage>670</fpage>&#x2013;<lpage>673</lpage>. <pub-id pub-id-type="doi">10.1126/science.1129594</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kapusta</surname> <given-names>A.</given-names></name> <name><surname>Kronenberg</surname> <given-names>Z.</given-names></name> <name><surname>Lynch</surname> <given-names>V. J.</given-names></name> <name><surname>Zhuo</surname> <given-names>X. Y.</given-names></name> <name><surname>Ramsay</surname> <given-names>L.</given-names></name> <name><surname>Bourque</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>9</volume>:<issue>e1003470</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1003470</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kazazian</surname> <given-names>H. H.</given-names> <suffix>Jr.</suffix></name> <name><surname>Wong</surname> <given-names>C.</given-names></name> <name><surname>Youssoufian</surname> <given-names>H.</given-names></name> <name><surname>Scott</surname> <given-names>A. F.</given-names></name> <name><surname>Phillips</surname> <given-names>D. G.</given-names></name> <name><surname>Antonarakis</surname> <given-names>S. E.</given-names></name></person-group> (<year>1988</year>). <article-title>Haemophilia A resulting from de novo insertion of L1 sequences represents a novel mechanism for mutation in man.</article-title> <source><italic>Nature</italic></source> <volume>332</volume> <fpage>164</fpage>&#x2013;<lpage>166</lpage>. <pub-id pub-id-type="doi">10.1038/332164a0</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kunarso</surname> <given-names>G.</given-names></name> <name><surname>Chia</surname> <given-names>N. Y.</given-names></name> <name><surname>Jeyakani</surname> <given-names>J.</given-names></name> <name><surname>Hwang</surname> <given-names>C.</given-names></name> <name><surname>Lu</surname> <given-names>X.</given-names></name> <name><surname>Chan</surname> <given-names>Y. S.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Transposable elements have rewired the core regulatory network of human embryonic stem cells.</article-title> <source><italic>Nat. Genet.</italic></source> <volume>42</volume> <fpage>631</fpage>&#x2013;<lpage>634</lpage>. <pub-id pub-id-type="doi">10.1038/ng.600</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lander</surname> <given-names>E. S.</given-names></name> <name><surname>Linton</surname> <given-names>L. M.</given-names></name> <name><surname>Birren</surname> <given-names>B.</given-names></name> <name><surname>Nusbaum</surname> <given-names>C.</given-names></name> <name><surname>Zody</surname> <given-names>M. C.</given-names></name> <name><surname>Baldwin</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2001</year>). <article-title>Initial sequencing and analysis of the human genome.</article-title> <source><italic>Nature</italic></source> <volume>409</volume> <fpage>860</fpage>&#x2013;<lpage>921</lpage>. <pub-id pub-id-type="doi">10.1038/35057062</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lappalainen</surname> <given-names>T.</given-names></name> <name><surname>Sammeth</surname> <given-names>M.</given-names></name> <name><surname>Friedlander</surname> <given-names>M. R.</given-names></name> <name><surname>t Hoen</surname> <given-names>P. A. C.</given-names></name> <name><surname>Monlong</surname> <given-names>J.</given-names></name> <name><surname>Rivas</surname> <given-names>M. A.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Transcriptome and genome sequencing uncovers functional variation in humans.</article-title> <source><italic>Nature</italic></source> <volume>501</volume> <fpage>506</fpage>&#x2013;<lpage>511</lpage>. <pub-id pub-id-type="doi">10.1038/nature12531</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lauc</surname> <given-names>G.</given-names></name> <name><surname>Huffman</surname> <given-names>J. E.</given-names></name> <name><surname>Pucic</surname> <given-names>M.</given-names></name> <name><surname>Zgaga</surname> <given-names>L.</given-names></name> <name><surname>Adamczyk</surname> <given-names>B.</given-names></name> <name><surname>Muzinic</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Loci associated with N-glycosylation of human immunoglobulin G show pleiotropy with autoimmune diseases and haematological cancers.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>9</volume>:<issue>e1003225</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1003225</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>MacArthur</surname> <given-names>J.</given-names></name> <name><surname>Bowler</surname> <given-names>E.</given-names></name> <name><surname>Cerezo</surname> <given-names>M.</given-names></name> <name><surname>Gil</surname> <given-names>L.</given-names></name> <name><surname>Hall</surname> <given-names>P.</given-names></name> <name><surname>Hastings</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog).</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>45</volume> <fpage>D896</fpage>&#x2013;<lpage>D901</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw1133</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marino-Ramirez</surname> <given-names>L.</given-names></name> <name><surname>Lewis</surname> <given-names>K. C.</given-names></name> <name><surname>Landsman</surname> <given-names>D.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2005</year>). <article-title>Transposable elements donate lineage-specific regulatory sequences to host genomes.</article-title> <source><italic>Cytogenet. Genome Res.</italic></source> <volume>110</volume> <fpage>333</fpage>&#x2013;<lpage>341</lpage>. <pub-id pub-id-type="doi">10.1159/000084965</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mele</surname> <given-names>M.</given-names></name> <name><surname>Ferreira</surname> <given-names>P. G.</given-names></name> <name><surname>Reverter</surname> <given-names>F.</given-names></name> <name><surname>DeLuca</surname> <given-names>D. S.</given-names></name> <name><surname>Monlong</surname> <given-names>J.</given-names></name> <name><surname>Sammeth</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Human genomics. The human transcriptome across tissues and individuals.</article-title> <source><italic>Science</italic></source> <volume>348</volume> <fpage>660</fpage>&#x2013;<lpage>665</lpage>. <pub-id pub-id-type="doi">10.1126/science.aaa0355</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Notwell</surname> <given-names>J. H.</given-names></name> <name><surname>Chung</surname> <given-names>T.</given-names></name> <name><surname>Heavner</surname> <given-names>W.</given-names></name> <name><surname>Bejerano</surname> <given-names>G.</given-names></name></person-group> (<year>2015</year>). <article-title>A family of transposable elements co-opted into developmental enhancers in the mouse neocortex.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>6</volume>:<issue>6644</issue>. <pub-id pub-id-type="doi">10.1038/ncomms7644</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ostertag</surname> <given-names>E. M.</given-names></name> <name><surname>Goodier</surname> <given-names>J. L.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Kazazian</surname> <given-names>H. H.</given-names> <suffix>Jr.</suffix></name></person-group> (<year>2003</year>). <article-title>SVA elements are nonautonomous retrotransposons that cause disease in humans.</article-title> <source><italic>Am. J. Hum. Genet.</italic></source> <volume>73</volume> <fpage>1444</fpage>&#x2013;<lpage>1451</lpage>. <pub-id pub-id-type="doi">10.1086/380207</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Palmer</surname> <given-names>N. D.</given-names></name> <name><surname>Goodarzi</surname> <given-names>M. O.</given-names></name> <name><surname>Langefeld</surname> <given-names>C. D.</given-names></name> <name><surname>Wang</surname> <given-names>N.</given-names></name> <name><surname>Guo</surname> <given-names>X.</given-names></name> <name><surname>Taylor</surname> <given-names>K. D.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Genetic variants associated with quantitative glucose homeostasis traits translate to type 2 diabetes in Mexican Americans: the GUARDIAN (genetics underlying diabetes in Hispanics) consortium.</article-title> <source><italic>Diabetes</italic></source> <volume>64</volume> <fpage>1853</fpage>&#x2013;<lpage>1866</lpage>. <pub-id pub-id-type="doi">10.2337/db14-0732</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pavlicek</surname> <given-names>A.</given-names></name> <name><surname>Jabbari</surname> <given-names>K.</given-names></name> <name><surname>Paces</surname> <given-names>J.</given-names></name> <name><surname>Paces</surname> <given-names>V.</given-names></name> <name><surname>Hejnar</surname> <given-names>J. V.</given-names></name> <name><surname>Bernardi</surname> <given-names>G.</given-names></name></person-group> (<year>2001</year>). <article-title>Similar integration but different stability of Alus and LINEs in the human genome.</article-title> <source><italic>Gene</italic></source> <volume>276</volume> <fpage>39</fpage>&#x2013;<lpage>45</lpage>. <pub-id pub-id-type="doi">10.1016/S0378-1119(01)00645-X</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Piriyapongsa</surname> <given-names>J.</given-names></name> <name><surname>Marino-Ramirez</surname> <given-names>L.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2007</year>). <article-title>Origin and evolution of human microRNAs from transposable elements.</article-title> <source><italic>Genetics</italic></source> <volume>176</volume> <fpage>1323</fpage>&#x2013;<lpage>1337</lpage>. <pub-id pub-id-type="doi">10.1534/genetics.107.072553</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quesada</surname> <given-names>I.</given-names></name> <name><surname>Tuduri</surname> <given-names>E.</given-names></name> <name><surname>Ripoll</surname> <given-names>C.</given-names></name> <name><surname>Nadal</surname> <given-names>A.</given-names></name></person-group> (<year>2008</year>). <article-title>Physiology of the pancreatic alpha-cell and glucagon secretion: role in glucose homeostasis and diabetes.</article-title> <source><italic>J. Endocrinol.</italic></source> <volume>199</volume> <fpage>5</fpage>&#x2013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1677/JOE-08-0290</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rebollo</surname> <given-names>R.</given-names></name> <name><surname>Romanish</surname> <given-names>M. T.</given-names></name> <name><surname>Mager</surname> <given-names>D. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Transposable elements: an abundant and natural source of regulatory sequences for host genes.</article-title> <source><italic>Annu. Rev. Genet.</italic></source> <volume>46</volume> <fpage>21</fpage>&#x2013;<lpage>42</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genet-110711-155621</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rishishwar</surname> <given-names>L.</given-names></name> <name><surname>Marino-Ramirez</surname> <given-names>L.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2016</year>). <article-title>Benchmarking computational tools for polymorphic transposable element detection.</article-title> <source><italic>Brief. Bioinform.</italic></source> <pub-id pub-id-type="doi">10.1093/bib/bbw072</pub-id> <comment>[Epub ahead of print]</comment>.</citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rishishwar</surname> <given-names>L.</given-names></name> <name><surname>Tellez Villa</surname> <given-names>C. E.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2015</year>). <article-title>Transposable element polymorphisms recapitulate human evolution.</article-title> <source><italic>Mob. DNA</italic></source> <volume>6</volume>:<issue>21</issue>. <pub-id pub-id-type="doi">10.1186/s13100-015-0052-6</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rishishwar</surname> <given-names>L.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Clayton</surname> <given-names>E. A.</given-names></name> <name><surname>Marino-Ramirez</surname> <given-names>L.</given-names></name> <name><surname>McDonald</surname> <given-names>J. F.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2017</year>). <article-title>Population and clinical genetics of human transposable elements in the (post) genomic era.</article-title> <source><italic>Mob. Genet. Elements</italic></source> <volume>7</volume> <fpage>1</fpage>&#x2013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1080/2159256X.2017.1280116</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rivas</surname> <given-names>M. A.</given-names></name> <name><surname>Pirinen</surname> <given-names>M.</given-names></name> <name><surname>Conrad</surname> <given-names>D. F.</given-names></name> <name><surname>Lek</surname> <given-names>M.</given-names></name> <name><surname>Tsang</surname> <given-names>E. K.</given-names></name> <name><surname>Karczewski</surname> <given-names>K. J.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Human genomics. Effect of predicted protein-truncating genetic variants on the human transcriptome.</article-title> <source><italic>Science</italic></source> <volume>348</volume> <fpage>666</fpage>&#x2013;<lpage>669</lpage>. <pub-id pub-id-type="doi">10.1126/science.1261877</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roadmap Epigenomics</surname> <given-names>C.</given-names></name> <name><surname>Kundaje</surname> <given-names>A.</given-names></name> <name><surname>Meuleman</surname> <given-names>W.</given-names></name> <name><surname>Ernst</surname> <given-names>J.</given-names></name> <name><surname>Bilenky</surname> <given-names>M.</given-names></name> <name><surname>Yen</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Integrative analysis of 111 reference human epigenomes.</article-title> <source><italic>Nature</italic></source> <volume>518</volume> <fpage>317</fpage>&#x2013;<lpage>330</lpage>. <pub-id pub-id-type="doi">10.1038/nature14248</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>D.</given-names></name> <name><surname>Schwalie</surname> <given-names>P. C.</given-names></name> <name><surname>Wilson</surname> <given-names>M. D.</given-names></name> <name><surname>Ballester</surname> <given-names>B.</given-names></name> <name><surname>Goncalves</surname> <given-names>A.</given-names></name> <name><surname>Kutter</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Waves of retrotransposon expansion remodel genome organization and CTCF binding in multiple mammalian lineages.</article-title> <source><italic>Cell</italic></source> <volume>148</volume> <fpage>335</fpage>&#x2013;<lpage>348</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2011.11.058</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shabalin</surname> <given-names>A. A.</given-names></name></person-group> (<year>2012</year>). <article-title>Matrix eQTL: ultra fast eQTL analysis via large matrix operations.</article-title> <source><italic>Bioinformatics</italic></source> <volume>28</volume> <fpage>1353</fpage>&#x2013;<lpage>1358</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts163</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stegle</surname> <given-names>O.</given-names></name> <name><surname>Parts</surname> <given-names>L.</given-names></name> <name><surname>Piipari</surname> <given-names>M.</given-names></name> <name><surname>Winn</surname> <given-names>J.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). <article-title>Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses.</article-title> <source><italic>Nat. Protoc.</italic></source> <volume>7</volume> <fpage>500</fpage>&#x2013;<lpage>507</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2011.457</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sudmant</surname> <given-names>P. H.</given-names></name> <name><surname>Rausch</surname> <given-names>T.</given-names></name> <name><surname>Gardner</surname> <given-names>E. J.</given-names></name> <name><surname>Handsaker</surname> <given-names>R. E.</given-names></name> <name><surname>Abyzov</surname> <given-names>A.</given-names></name> <name><surname>Huddleston</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>An integrated map of structural variation in 2,504 human genomes.</article-title> <source><italic>Nature</italic></source> <volume>526</volume> <fpage>75</fpage>&#x2013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1038/nature15394</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sundaram</surname> <given-names>V.</given-names></name> <name><surname>Cheng</surname> <given-names>Y.</given-names></name> <name><surname>Ma</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>D.</given-names></name> <name><surname>Xing</surname> <given-names>X.</given-names></name> <name><surname>Edge</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Widespread contribution of transposable elements to the innovation of gene regulatory networks.</article-title> <source><italic>Genome Res.</italic></source> <volume>24</volume> <fpage>1963</fpage>&#x2013;<lpage>1976</lpage>. <pub-id pub-id-type="doi">10.1101/gr.168872.113</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>t Hoen</surname> <given-names>P. A.</given-names></name> <name><surname>Friedlander</surname> <given-names>M. R.</given-names></name> <name><surname>Almlof</surname> <given-names>J.</given-names></name> <name><surname>Sammeth</surname> <given-names>M.</given-names></name> <name><surname>Pulyakhina</surname> <given-names>I.</given-names></name> <name><surname>Anvar</surname> <given-names>S. Y.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>31</volume> <fpage>1015</fpage>&#x2013;<lpage>1022</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.2702</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Xing</surname> <given-names>J.</given-names></name> <name><surname>Grover</surname> <given-names>D.</given-names></name> <name><surname>Hedges</surname> <given-names>D. J.</given-names></name> <name><surname>Han</surname> <given-names>K.</given-names></name> <name><surname>Walker</surname> <given-names>J. A.</given-names></name><etal/></person-group> (<year>2005</year>). <article-title>SVA elements: a hominid-specific retroposon family.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>354</volume> <fpage>994</fpage>&#x2013;<lpage>1007</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2005.09.085</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Vicente-Garcia</surname> <given-names>C.</given-names></name> <name><surname>Seruggia</surname> <given-names>D.</given-names></name> <name><surname>Molto</surname> <given-names>E.</given-names></name> <name><surname>Fernandez-Minan</surname> <given-names>A.</given-names></name> <name><surname>Neto</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>MIR retrotransposon sequences provide insulators to the human genome.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>112</volume> <fpage>E4428</fpage>&#x2013;<lpage>E4437</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1507253112</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Rishishwar</surname> <given-names>L.</given-names></name> <name><surname>Marino-Ramirez</surname> <given-names>L.</given-names></name> <name><surname>Jordan</surname> <given-names>I. K.</given-names></name></person-group> (<year>2016</year>). <article-title>Human population-specific gene expression and transcriptional network modification with polymorphic transposable elements.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>45</volume> <fpage>2318</fpage>&#x2013;<lpage>2328</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw1286</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y. J.</given-names></name> <name><surname>Schug</surname> <given-names>J.</given-names></name> <name><surname>Won</surname> <given-names>K. J.</given-names></name> <name><surname>Liu</surname> <given-names>C.</given-names></name> <name><surname>Naji</surname> <given-names>A.</given-names></name> <name><surname>Avrahami</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Single-Cell transcriptomics of the human endocrine pancreas.</article-title> <source><italic>Diabetes</italic></source> <volume>65</volume> <fpage>3028</fpage>&#x2013;<lpage>3038</lpage>. <pub-id pub-id-type="doi">10.2337/db16-0405</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weber</surname> <given-names>M. J.</given-names></name></person-group> (<year>2006</year>). <article-title>Mammalian small nucleolar RNAs are mobile genetic elements.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>2</volume>:<issue>e205</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.0020205</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wildschutte</surname> <given-names>J. H.</given-names></name> <name><surname>Williams</surname> <given-names>Z. H.</given-names></name> <name><surname>Montesion</surname> <given-names>M.</given-names></name> <name><surname>Subramanian</surname> <given-names>R. P.</given-names></name> <name><surname>Kidd</surname> <given-names>J. M.</given-names></name> <name><surname>Coffin</surname> <given-names>J. M.</given-names></name></person-group> (<year>2016</year>). <article-title>Discovery of unfixed endogenous retrovirus insertions in diverse human populations.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>113</volume> <fpage>E2326</fpage>&#x2013;<lpage>E2334</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1602336113</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xie</surname> <given-names>X.</given-names></name> <name><surname>Ma</surname> <given-names>W.</given-names></name> <name><surname>Songyang</surname> <given-names>Z.</given-names></name> <name><surname>Luo</surname> <given-names>Z.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Dai</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>CCSI: a database providing chromatin-chromatin spatial interaction information.</article-title> <source><italic>Database</italic></source> <volume>2016</volume>:<issue>bav124</issue>. <pub-id pub-id-type="doi">10.1093/database/bav124</pub-id></citation></ref>
</ref-list>
</back>
</article>