<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2016.00207</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Evolutionary and Functional Features of Copy Number Variation in the Cattle Genome</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes"><name><surname>Keel</surname> <given-names>Brittney N.</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/348110/overview"/></contrib>
<contrib contrib-type="author"><name><surname>Lindholm-Perry</surname> <given-names>Amanda K.</given-names></name><uri xlink:href="http://loop.frontiersin.org/people/43539/overview"/></contrib>
<contrib contrib-type="author"><name><surname>Snelling</surname> <given-names>Warren M.</given-names></name><uri xlink:href="http://loop.frontiersin.org/people/67954/overview"/></contrib>
</contrib-group>
<aff><institution>Agricultural Research Service (USDA), Meat Animal Research Center</institution> <country>Clay Center, NE, USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Johann S&#x000F6;lkner, University of Natural Resources and Life Sciences, Vienna, Austria</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Joanna Szyda, Wroclaw University of Environmental and Life Sciences, Poland; Alessandro Bagnato, University of Milan, Italy</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Brittney N. Keel <email>brittney.keel&#x00040;ars.usda.gov</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Livestock Genomics, a section of the journal Frontiers in Genetics</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>22</day>
<month>11</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="collection">
<year>2016</year>
</pub-date>
<volume>7</volume>
<elocation-id>207</elocation-id>
<history>
<date date-type="received">
<day>03</day>
<month>08</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>08</day>
<month>11</month>
<year>2016</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2016 Keel, Lindholm-Perry and Snelling.</copyright-statement>
<copyright-year>2016</copyright-year>
<copyright-holder>Keel, Lindholm-Perry and Snelling</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Genomic structural variations are an important source of genetic diversity. Copy number variations (CNVs), gains and losses of large regions of genomic sequence between individuals of a species, have been associated with a wide variety of phenotypic traits. However, in cattle, as well as many other species, relatively little is understood about CNV, including frequency of CNVs in the genome, sizes, and locations, chromosomal properties, and evolutionary processes acting to shape CNV. In this work, we focused on copy number variation in the bovine genome, with the aim to detect CNVs in <italic>Bos taurus</italic> coding sequence and explore potential evolutionary mechanisms shaping these CNV. We identified and characterized CNV regions by utilizing exome sequence from 175 influential sires used in the Germplasm Evaluation project, representing 10 breeds. We examined various evolutionary and functional aspects of these CNVs, including selective constraint on CNV-overlapped genes, centrality of CNV genes in protein-protein interaction networks, and tissue-specific expression of CNV genes. Patterns of CNV in the <italic>Bos taurus</italic> genome reveal that reduced functional constraint and mutational bias may play a prominent role in shaping this type of structural variation.</p></abstract>
<kwd-group>
<kwd>copy number variation</kwd>
<kwd>cattle genome</kwd>
<kwd>next-generation sequencing</kwd>
<kwd>deletion</kwd>
<kwd>duplication</kwd>
<kwd>gene expression</kwd>
<kwd>network centrality</kwd>
<kwd>selective constraint</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="5"/>
<equation-count count="0"/>
<ref-count count="74"/>
<page-count count="11"/>
<word-count count="8901"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1"><title>Introduction</title>
<p>Copy number variations (CNVs) are gains and losses of large regions of genomic sequence between individuals of a species (Mills et al., <xref ref-type="bibr" rid="B45">2011</xref>). CNVs have been well-studied and linked to various phenotypic traits and diseases in humans and rodents (Cook and Scherer, <xref ref-type="bibr" rid="B12">2008</xref>; Almal and Padh, <xref ref-type="bibr" rid="B2">2012</xref>; Girirajan et al., <xref ref-type="bibr" rid="B21">2013</xref>). Initial CNV studies have been performed in a number of domesticated animals: dog (Nicholas et al., <xref ref-type="bibr" rid="B47">2011</xref>; Alvarez and Akey, <xref ref-type="bibr" rid="B3">2012</xref>; Berglund et al., <xref ref-type="bibr" rid="B7">2012</xref>), sheep (Fontanesi et al., <xref ref-type="bibr" rid="B18">2011</xref>; Liu et al., <xref ref-type="bibr" rid="B41">2013</xref>), pig (Fadista et al., <xref ref-type="bibr" rid="B16">2008</xref>; Ramayo-Caldas et al., <xref ref-type="bibr" rid="B55">2010</xref>; Chen et al., <xref ref-type="bibr" rid="B10">2012</xref>; Paudel et al., <xref ref-type="bibr" rid="B50">2013</xref>, <xref ref-type="bibr" rid="B51">2015</xref>), chicken (Crooijmans et al., <xref ref-type="bibr" rid="B13">2013</xref>; Yi et al., <xref ref-type="bibr" rid="B71">2014</xref>), and goat (Fontanesi et al., <xref ref-type="bibr" rid="B19">2010</xref>). These studies have linked many phenotypic traits to CNV, including chicken pea-comb phenotype (Wright et al., <xref ref-type="bibr" rid="B68">2009</xref>) and white coat color in pigs and sheep (Johansson Moller et al., <xref ref-type="bibr" rid="B30">1996</xref>; Norris and Whan, <xref ref-type="bibr" rid="B48">2008</xref>).</p>
<p>Several studies have investigated CNV in the bovine genome. Cattle CNVs have been reported using a variety of platforms, including comparative genomic hybridization arrays (Liu et al., <xref ref-type="bibr" rid="B40">2008</xref>, <xref ref-type="bibr" rid="B39">2010</xref>; Fadista et al., <xref ref-type="bibr" rid="B17">2010</xref>), the Illumina BovineHD BeadChip (Hou et al., <xref ref-type="bibr" rid="B26">2012a</xref>; Wu et al., <xref ref-type="bibr" rid="B69">2015</xref>; Aguilar et al., <xref ref-type="bibr" rid="B1">2016</xref>; Prinsen et al., <xref ref-type="bibr" rid="B54">2016</xref>; Xu et al., <xref ref-type="bibr" rid="B70">2016</xref>), the Illumina BovineSNP50 BeadChip (Matukumalli et al., <xref ref-type="bibr" rid="B43">2009</xref>; Bae et al., <xref ref-type="bibr" rid="B4">2010</xref>; Hou et al., <xref ref-type="bibr" rid="B27">2011</xref>, <xref ref-type="bibr" rid="B28">2012b</xref>; Jiang et al., <xref ref-type="bibr" rid="B29">2012</xref>; Bagnato et al., <xref ref-type="bibr" rid="B5">2015</xref>; Ben Sassi et al., <xref ref-type="bibr" rid="B6">2016</xref>), and next-generation sequencing (NGS) (Stothard et al., <xref ref-type="bibr" rid="B62">2011</xref>; Zhan et al., <xref ref-type="bibr" rid="B73">2011</xref>; Bickhart et al., <xref ref-type="bibr" rid="B8">2012</xref>; Choi et al., <xref ref-type="bibr" rid="B11">2013</xref>; Keel et al., <xref ref-type="bibr" rid="B31">2016</xref>; Ben Sassi et al., <xref ref-type="bibr" rid="B6">2016</xref>). In these studies, it is reported that copy number variable regions comprise &#x0007E;2&#x02013;7% of the cattle genome.</p>
<p>In cattle, as well as many other species, relatively little is known about the properties and dynamics of CNVs. Open questions remain about the frequency of CNVs in the genome, sizes, and locations, and chromosomal properties. In addition, the extent to which CNV affect phenotype is not well understood. In humans, it has been observed that two unrelated, healthy individuals can differ from one another in gene copy number across their genomes (Sabat et al., <xref ref-type="bibr" rid="B57">2004</xref>), which raises uncertainty about the existence of a characteristic number of copies of any one gene. Of all of the topics related to CNVs, our knowledge of the functional and evolutionary impact of CNVs is the most limited.</p>
<p>Whole genome sequence (WGS) is often used in CNV discovery. However, until sequencing costs drop dramatically, it is simply not feasible to generate the high coverage (&#x0003E; 10x) whole genome sequence, suggested for CNV detection, on large numbers of animals. Due to its cost-effectiveness, WES is routinely used for the detection of coding sequence variation (Guo et al., <xref ref-type="bibr" rid="B23">2013</xref>). In humans, the exome comprises approximately 1&#x02013;3% of the genome, but accounts for over 85% of all mutations identified in Mendelian disorders (Ng et al., <xref ref-type="bibr" rid="B46">2010</xref>), making it a desirable and practical approach for investigating variations in coding sequence.</p>
<p>In this study, we investigated some evolutionary and functional aspects of coding sequence copy number variation in the bovine genome. We first characterized CNV regions detected in whole exome sequence from 175 influential sires used in the USMARC Germplasm Evaluation project and identified genes overlapping with CNVRs. We then examined selective constraint on CNV genes to test the hypothesis that genes affected by CNV are subject to accelerated sequence evolution compared to copy number neutral genes. In addition, we utilized gene expression data and protein-protein interaction networks to investigate network centrality and tissue-specific expression patterns of CNV genes.</p>
</sec>
<sec sec-type="materials and methods" id="s2"><title>Materials and methods</title>
<p>The DNA samples sequenced for this study were extracted from semen collected by commercial AI services and from blood archived under standard operating procedures for the U.S. Meat Animal Research Center tissue repository. The research did not involve experimentation on animals requiring IACUC approval.</p>
<sec><title>Sequencing and data acquisition of GPE sires</title>
<p>CNV were detected from whole exome sequence of 175 bulls used in Cycle VII of the USMARC germplasm evaluation (GPE) project. This included 122 purebred AI sires representing 10 different breeds, and 53 F<sub>1</sub> natural service sires representing 10 different crosses of 7 breeds (Table <xref ref-type="table" rid="T1">1</xref>). Bulls were selected for sequencing according to their influence on the GPE project (see Snelling et al., <xref ref-type="bibr" rid="B60">2015</xref> for full details). Exome sequence is available for download from the National Center for Biotechnology Information Sequence Read Archive (SRA) with Accession Number <ext-link ext-link-type="NCBI:sra" xlink:href="SRP076471">SRP076471</ext-link>.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Breeds of sequenced bulls from the USMARC germplasm evaluation (GPE) population used in this study</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Breed</bold></th>
<th valign="top" align="center"><bold>Number of bulls</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Hereford</td>
<td valign="top" align="center">17</td>
</tr>
<tr>
<td valign="top" align="left">Angus</td>
<td valign="top" align="center">19</td>
</tr>
<tr>
<td valign="top" align="left">Simmental</td>
<td valign="top" align="center">16</td>
</tr>
<tr>
<td valign="top" align="left">Limousin</td>
<td valign="top" align="center">15</td>
</tr>
<tr>
<td valign="top" align="left">Charolais</td>
<td valign="top" align="center">17</td>
</tr>
<tr>
<td valign="top" align="left">Gelbvieh (German Yellow)</td>
<td valign="top" align="center">17</td>
</tr>
<tr>
<td valign="top" align="left">Red Angus</td>
<td valign="top" align="center">15</td>
</tr>
<tr>
<td valign="top" align="left">Shorthorn</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">Braunvieh (Brown Swiss)</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">Brahman</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left">Charolais &#x000D7; Angus</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left">Gelbvieh &#x000D7; Hereford</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">Simmental &#x000D7; Hereford</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">Simmental &#x000D7; Angus</td>
<td valign="top" align="center">4</td>
</tr>
<tr>
<td valign="top" align="left">Hereford &#x000D7; Angus</td>
<td valign="top" align="center">13</td>
</tr>
<tr>
<td valign="top" align="left">Limousin &#x000D7; Hereford</td>
<td valign="top" align="center">5</td>
</tr>
<tr>
<td valign="top" align="left">Gelbvieh &#x000D7; Angus</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">Red Angus &#x000D7; Hereford</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td valign="top" align="left">Charolais &#x000D7; Hereford</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">Limousin &#x000D7; Angus</td>
<td valign="top" align="center">4</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Exome sequencing was previously described by Snelling et al. (<xref ref-type="bibr" rid="B60">2015</xref>). Briefly, genomic DNA was extracted from semen and blood using standard DNA extraction protocols (phenol-choloroform extraction for semen and QIAamp DNA Mini Kit for blood), and sheared to an average size of 300 bp. Indexing adapters were added to allow identification of individual DNA samples from pools of 8 samples. The Agilent SureSelect Target Enrichment System Kit I and Kit II (Agilent Technologies, Inc., Santa Clara, CA) were used to generate a DNA library for each sample. Equal quantities of each indexed DNA library were pooled into groups of 8 for exome capture using the Agilent SureSelect XT Bovine capture reagent (Agilent Technologies, Inc., Santa Clara, CA). Exome capture libraries were then sequenced with the Illumina MiSeq technology (MiSeq Reagent Kit V2 and V3 chemistry; Illumina, San Diego, CA) to obtain a mean 20x coverage of targeted intervals.</p>
<p>Processing of the FASTQ files was done using the best practices established for the Genome Analysis Toolkit (GATK, Van der Auwera et al., <xref ref-type="bibr" rid="B67">2013</xref>). Reads were removed if overall quality score was less than 20, if they contained more than three uncalled bases, or if they failed the Illumina chastity filter. TrimmomaticPE (Bolger et al., <xref ref-type="bibr" rid="B9">2014</xref>) was used to trim Illumina adaptor sequences and low quality bases from the reads. The bowtie2 (Langmead and Salzberg, <xref ref-type="bibr" rid="B36">2012</xref>) program was then used to map the reads to the UMD 3.1 genome assembly (Zimin et al., <xref ref-type="bibr" rid="B74">2009</xref>).</p>
</sec>
<sec><title>CNV detection and defining CNVRs</title>
<p>The cn.MOPS algorithm (Klambauer et al., <xref ref-type="bibr" rid="B32">2012</xref>) was used to identify putative CNVs in the exome sequence of the 175 bulls. cn.MOPs is a multiple sample read depth CNV detection method that applies a Bayesian approach to decompose read variations across multiple samples into integer copy numbers and noise by its mixture components and Poisson distributions, respectively. cn.MOPS avoids read count biases along the chromosomes by modeling the depth of coverage across all samples at each genomic position. The exome version of the cn.MOPS program was run using the default parameters.</p>
<p>CNVs were then used to construct a set of copy number variable regions (CNVRs). A CNVR was constructed by merging CNVs across samples that exhibited at least 50% pairwise reciprocal overlap in their genomic coordinates. For example, suppose we have two CNVs, CNV1 beginning at position <italic>a</italic> and ending at position <italic>b</italic> and CNV2 running from <italic>c</italic> to <italic>d</italic> with <italic>a</italic> &#x0003C; <italic>c</italic> &#x0003C; <italic>b</italic> &#x0003C; <italic>d</italic>. If the reciprocal overlap between the two CNVs is at least 50% then they are merged into a CNVR which runs from <italic>a</italic> to <italic>d</italic> on the genome.</p>
</sec>
<sec><title>Gene content and gene ontology</title>
<p>We identified genes from the Ensembl (Version 80; Cunningham et al., <xref ref-type="bibr" rid="B14">2015</xref>) annotation of UMD 3.1 overlapping (both completely and partially) with detected CNVRs. Functions of protein-coding CNV genes were determined using the PANTHER classification system (Version 10.0, Mi et al., <xref ref-type="bibr" rid="B44">2013</xref>).</p>
<p>Enrichment analysis of gene function was performed using PANTHER&#x00027;s implementation of the binomial test of overrepresentation. Significance of gene ontology (GO) terms was assessed using the default Ensembl <italic>Bos taurus</italic> GO annotation as the reference set for the enrichment analysis, and data was considered statistically significant at a Bonferroni corrected <italic>P</italic> &#x0003C; 0.05.</p>
</sec>
<sec><title>Analysis of selective constraint in CNV genes</title>
<p>Pairs of orthologous genes between <italic>Bos taurus</italic> and <italic>Homo sapiens</italic> were identified using Biomart (Guberman et al., <xref ref-type="bibr" rid="B22">2011</xref>). dN/dS ratios were then computed in MATLAB (<xref ref-type="bibr" rid="B42">2015</xref>) using the suggested protocol. Briefly, for each ortholog pair the nucleotide sequences were translated to amino acid sequences, which were then aligned using the BLOSUM50 scoring matrix. The gaps from the aligned amino acid sequences were then copied to their corresponding nucleotide sequences, producing a codon-aligned pair of nucleotide sequences. Lastly, the synonymous (dS) and nonsynonymous (dN) substitution rates of the codon-aligned sequences were computed using the <italic>dnds</italic> function. Pairs of input sequences that were too divergent, i.e., pairs exhibiting saturation of substitutions, were removed from further analysis because a sensible dN/dS ratio could not be computed. <italic>P</italic>-values from a one-tailed Wilcoxon rank-sum test were used to test the hypothesis that dN/dS ratios of cattle genes overlapped by CNV were significantly shifted toward higher values than those of non-overlapped genes, i.e., that selection pressure is relaxed for CNV genes.</p>
</sec>
<sec><title>Tissue specificity analysis</title>
<p>Tissue specificity of genes overlapped by CNVRs was assessed using two types of expression data, microarray and RNA sequencing, encompassing 22 different tissues (Table <xref ref-type="table" rid="T2">2</xref>). Raw data sets for experiments <ext-link ext-link-type="NCBI:geo" xlink:href="GSE41637">GSE41637</ext-link>, <ext-link ext-link-type="NCBI:geo" xlink:href="GSE55435">GSE55435</ext-link>, <ext-link ext-link-type="NCBI:geo" xlink:href="GSE71153">GSE71153</ext-link>, <ext-link ext-link-type="NCBI:geo" xlink:href="GSE73699">GSE73699</ext-link>, <ext-link ext-link-type="NCBI:geo" xlink:href="GSE73261">GSE73261</ext-link>, and <ext-link ext-link-type="NCBI:geo" xlink:href="GSE73159">GSE73159</ext-link> were downloaded from NCBI&#x00027;s Gene Expression Omnibus (<ext-link ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/geo">www.ncbi.nlm.nih.gov/geo</ext-link>), and the raw data for experiment <ext-link ext-link-type="EBI:ena" xlink:href="ERP005899">ERP005899</ext-link> was downloaded from EMBL-EBI&#x00027;s European Nucleotide Archive (<ext-link ext-link-type="uri" xlink:href="http://www.ebi.ac.uk/ena">http://www.ebi.ac.uk/ena</ext-link>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Gene expression data sets</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Study</bold></th>
<th valign="top" align="left"><bold>Tissue</bold></th>
<th valign="top" align="left"><bold>Data type</bold></th>
<th valign="top" align="center"><bold>Number of samples</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">GSE73699</td>
<td valign="top" align="left">Mesenteric fat</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="center">15</td>
</tr>
<tr>
<td valign="top" align="left">GSE73261</td>
<td valign="top" align="left">Spleen<sup>&#x0002A;</sup></td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="center">16</td>
</tr>
<tr>
<td valign="top" align="left">GSE73159</td>
<td valign="top" align="left">Duodenum</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="center">16</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Jejunum</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="center">16</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Ileum</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="center">16</td>
</tr>
<tr>
<td valign="top" align="left">GSE41637</td>
<td valign="top" align="left">Brain</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Colon</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Heart</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Kidney<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Liver<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Lung<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Skeletal muscle</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Spleen<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Testes</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left">GSE55435</td>
<td valign="top" align="left">Hypothalamus<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Pituitary gland</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Uterus</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Endometrium</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Ovary</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Subcataneous fat</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Liver<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Longissimus dorsi muscle</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td valign="top" align="left">GSE71153</td>
<td valign="top" align="left">Rumen</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">16</td>
</tr>
<tr>
<td valign="top" align="left">ERP005899</td>
<td valign="top" align="left">Adipose</td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">7&#x0007E;14 pooled</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Duodenum<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">7&#x0007E;14 pooled</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Hypothalamus<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">7&#x0007E;14 pooled</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Kidney<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">7&#x0007E;14 pooled</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Lung<sup>&#x0002A;</sup></td>
<td valign="top" align="left">RNAseq</td>
<td valign="top" align="center">7&#x0007E;14 pooled</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Tissues marked with <sup>&#x0002A;</sup>were present in multiple studies.</italic></p>
</table-wrap-foot>
</table-wrap>
<p>The microarray data (experiments GSE73699, GSE73261, and GSE73159) was processed as follows. Individual CEL files were processed using the UPC function from the SCAN.UPC package in R (Piccolo et al., <xref ref-type="bibr" rid="B52">2012</xref>, <xref ref-type="bibr" rid="B53">2013</xref>). UPC is a quantitative approach for normalizing gene expression data that produces standardized expression values that estimate whether a gene is &#x0201C;active&#x0201D; in a given sample. The program outputs for each gene in a given sample a universal expression code (UPC), a number between 0 and 1 where larger values suggest a greater likelihood that the gene is expressed in the sample. The UPC function was run using the default parameters, and for each tissue a gene was considered to be expressed in the tissue if it had a UPC &#x0003E; 0.5 in at least one sample.</p>
<p>The RNA sequencing data (experiments GSE41637, GSE55435, GSE71153, and ERP005899) was processed as follows. Raw sequence reads in individual fastq files were first mapped to the UMD 3.1 genome assembly using Tophat (Version 2.0.1; Trapnell et al., <xref ref-type="bibr" rid="B64">2009</xref>). The Cufflinks software (Version 2.2; Roberts et al., <xref ref-type="bibr" rid="B56">2011</xref>) was then used to compute the fragments per kilobase of transcript per million mapped reads (FPKM) for paired-end reads and the analogous reads per kilobase of transcript per million mapped reads (RPKM) for single-end reads. Both software packages were run using the default parameters, and for each tissue a gene was considered expressed in the tissue if it had FPKM or RPKM &#x0003E; 1.0 in at least one sample. Note that some tissues, including duodenum, hypothalamus, kidney, liver, lung, and spleen, were included in two of the experiments. For these tissues, a gene was considered expressed if it passed the expression criterion in at least one of the two experiments. Genes belonging to both the set of expressed genes and our CNV gene set were classified as expressed CNV genes, while genes that were expressed but not overlapped by CNVs were classified as expressed neutral genes. The <italic>P</italic>-values from a one-tailed Wilcoxon rank-sum test were used to test the hypothesis that expressed CNV genes in cattle are expressed in fewer tissues than expressed neutral genes.</p>
</sec>
<sec><title>Analysis of network centrality</title>
<p>Centrality of CNV overlapped genes in protein-protein interaction (PPI) networks was assessed using the <italic>Bos taurus</italic> interaction dataset from the STRING database (Franceschini et al., <xref ref-type="bibr" rid="B20">2013</xref>). This dataset consisted of 3,904,694 interactions for 19,032 unique genes. Network centrality was measured by computing the degree of the representative node in the PPI network for each gene. <italic>P</italic>-values from a one-tailed Wilcoxon rank-sum test were used to test the hypothesis that node degrees of cattle genes overlapped by CNV were significantly shifted toward lower values than those of non-overlapped genes, i.e., that CNV genes are less central in PPI networks.</p>
</sec>
</sec>
<sec id="s3"><title>Results and discussion</title>
<sec><title>CNVR discovery and statistics in the GPE bulls</title>
<p>Putative CNVs across the population of 175 bulls were identified using the exome cn.MOPS software package (Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1B</xref>). We chose to use the cn.MOPS package since it has been shown to have a lower false-positive rate than other exome CNV detection methods (Guo et al., <xref ref-type="bibr" rid="B23">2013</xref>). CNVs were then merged across samples into CNVRs. In this work, we aimed to study common coding sequence CNVs across the <italic>Bos taurus</italic> genome. In an attempt to filter out possible false-positive and rare CNVs, CNVRs were filtered out if they were not present in at least 3 samples (&#x0003E;2% of the population). Note that the 2% threshold was chosen arbitrarily. A total of 74 CNVRs were filtered out in this step. The final set of CNVRs consisted of 57 CNVRs (48 on the autosomes and 9 on the X chromosome).</p>
<p>Sizes of the CNVRs ranged from 0.0018 to 1.56 Mb, with an average of 0.1419 Mb and a median of 0.0567 Mb. The CNVRs occupied a total of 5.27 unique Mb or 0.19% of the UMD 3.1 <italic>Bos taurus</italic> genome. Among the CNVRs, 30 showed copy number loss, 16 showed copy number gain, and 11 showed a mix of copy number loss and gain from different individuals. A full list of the CNVRs can be found in Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1A</xref>.</p>
<p>The distribution of CNVRs along each of the chromosomes is shown in Figure <xref ref-type="fig" rid="F1">1</xref>. Many CNVRs were present in a small number of bulls (24 of 57 were present in at most 5 bulls). One CNVR [CNVR 4 in Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1A</xref>] was present in 36% of the bulls. We observed some variation in the number of CNVRs between breeds. The greatest numbers of CNVRs were seen in Hereford (70), Angus (82), Simmental (72), and Red Angus (70), while the smallest numbers were seen in Braunveih (4) and Charolais &#x000D7; Angus (7). None of the CNVRs were breed-specific.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>CNVRs in GPE bulls</bold>. Plot shows the CNVRs identified from the 175 sequenced GPE bull genomes in Circos format (Krzywinski et al., <xref ref-type="bibr" rid="B35">2009</xref>). The outer ideogram runs clockwise from chromosome 1 to chromosome X with labels in Mb of physical distance. The copy number data is represented in the inner tracks. The two innermost tracks show scatter plots of the CNVRs, where the red track shows copy number loss and the green track shows copy number gain. The size of the dot in the scatter plot is proportional to the number of samples containing the CNVR. The other track shows a heat map which indicates the parts of the genome that contain copy number gain and loss. This plot simply collapses the scatter plot values onto a single radial position.</p></caption>
<graphic xlink:href="fgene-07-00207-g0001.tif"/>
</fig>
</sec>
<sec><title>Comparison of CNVRs with previous studies</title>
<p>Comparison of our results with autosomal CNVRs identified in several previous cattle studies showed varying levels of overlapping CNVRs between studies (Supplementary Table <xref ref-type="supplementary-material" rid="SM2">2</xref>). In this analysis we used a much less stringent definition of overlapped CNVRs than in the rest of this work, where two CNVRs were considered overlapped as long as they shared at least one base. In order to compare some of the data sets to our results, we first had to map coordinates from the Btau 4.0 genome assembly to the UMD 3.1 assembly. This was done using the UCSC <italic>liftover</italic> tool (<ext-link ext-link-type="uri" xlink:href="https://genome.ucsc.edu/util.html">https://genome.ucsc.edu/util.html</ext-link>).</p>
<p>Array CGH with approximately 385,000 probes was used by Liu et al. (<xref ref-type="bibr" rid="B39">2010</xref>) to identify 200 CNVRs from 90 samples representing 11 different breeds, while Fadista et al. (<xref ref-type="bibr" rid="B17">2010</xref>) utilized the same technology with approximately 6.3 million probes to detect 254 CNVRs in 20 individuals from 4 breeds. The percentage of CNVRs from our results overlapping with these data sets was 18.8 and 70.8%, respectively.</p>
<p>A large variation in the number of detected CNVRs was seen in the SNP array-based studies. The number of CNVRs identified using the Illumina BovineSNP50 BeadChip ranged from 101 to 811. The two studies utilizing the Illumina BovineHD BeadChip had an even greater discrepancy in number of CNVRs, with 3438 CNVRs reported by Hou et al. (<xref ref-type="bibr" rid="B26">2012a</xref>) and only 247 CNVRs reported by Wu et al. (<xref ref-type="bibr" rid="B69">2015</xref>). The overlap of our results with these studies ranged from 0% in the BovineSNP50 chip studies of Hou et al. (<xref ref-type="bibr" rid="B28">2012b</xref>) and Jiang et al. (<xref ref-type="bibr" rid="B29">2012</xref>) to 79.2% in the BovineHD chip study of Hou et al. (<xref ref-type="bibr" rid="B26">2012a</xref>).</p>
<p>Comparing our results to other cattle CNVR sets generated from NGS we saw lower percentages of overlap. The study of Bickhart et al. (<xref ref-type="bibr" rid="B8">2012</xref>) identified 1265 CNVRs in the Btau 4.0 genome assembly. Their data consisted of WGS from 5 individuals representing 3 breeds, along with simulated NGS reads from the sequenced Hereford cow, L1 Dominette 01449. Only 2 of the CNVRs in our set overlapped with their data. Another NGS-based study, investigated copy number variation between one Holstein and one Black Angus bull (Stothard et al., <xref ref-type="bibr" rid="B62">2011</xref>). A total of 790 CNVRs were identified in this study, and only 4 CNVRs from our set were found to be overlapping. In the NGS study of Zhan et al. (<xref ref-type="bibr" rid="B73">2011</xref>), 520 CNVRs were identified on the genome of one Holstein-Friesian bull when comparing the sequence reads against a Fleckvieh bull. A total of 7 of our CNVRs overlapped with this set. In a previous CNV study, we detected CNVRs from low coverage WGS of 154 pure bred bulls from 7 breeds used in the GPE project (Keel et al., <xref ref-type="bibr" rid="B31">2016</xref>). The exome sequence of 117 of these bulls was used in the current study. Thirty one of our 57 CNVRs (64.6%) were overlapped by CNVRs from our previous study.</p>
<p>Generally speaking, percentages of overlap in CNV events identified between our study and previous studies were low, with an average of 30.9% of our CNVRs being overlapped by CNVRs in a previous study. This is similar to what we see when we compare previous studies (&#x0003C;40% overlap). These discrepancies are likely driven by many technical aspects, including vastly different sample sizes, differences in breeds and the number of breeds represented, detection platform (array-based vs. NGS), and CNV detection algorithms. The current study is one of the largest sequence-based cattle CNV studies to date, utilizing a larger sample size (175 samples) than previous NGS CNV studies, as well as samples from multiple breeds (10 breeds). It should be noted that the studies that had the highest percentage of overlap with the current study were those that had the largest numbers of breeds represented. This suggests that the inclusion of more breeds into CNV analyses may be crucial in identifying common CNVs across the <italic>Bos taurus</italic> genome and constructing a more comprehensive CNV map.</p>
</sec>
<sec><title>Function of CNV genes</title>
<p>A total of 110 Ensembl genes from the UMD 3.1 assembly were identified to be CNV genes, overlapping (either completely or partially) with our detected CNVRs (Supplementary Table <xref ref-type="supplementary-material" rid="SM4">4</xref>). These genes included 96 protein-coding genes, 7 snRNA, 6 pseudogenes, and 1 rRNA. Using PANTHER&#x00027;s functional annotation tool to inspect GO slim terms mapping to protein-coding CNV genes, we identified that many of these genes were involved in binding (35%), catalytic activity (23%), receptor activity (39%), signal transducer activity (36%), biological regulation (38%), cellular process (35%), and response to stimulus (48%).</p>
<p>Enrichment analysis was performed, using both the full <italic>Bos taurus</italic> GO database and the GO slim database, to identify GO terms that were significantly over- and underrepresented in our gene set. GO slim terms are a subset of the terms in the entire GO that give a broad overview of the ontology content. GO slim enrichment analysis showed that the terms extracellular transport, response to toxic substance, response to stimulus, response to interferon-gamma, amino acid transport, sensory perception of smell, G-protein coupled receptor signaling pathway, regulation of biological process, MHC protein complex, heterotrimeric G-protein complex, and plasma membrane were significantly overrepresented in the protein-coding genes overlapped by CNVRs (Bonferroni-corrected <italic>P</italic> &#x0003C; 0.05; Table <xref ref-type="table" rid="T3">3</xref>). Results from the full GO database analysis are shown in Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3</xref>.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Significantly over- and underrepresented GO slim terms in the set of CNV genes</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Ontology Term</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>Gene Set (n genes)</bold></th>
<th/>
<th/>
<th/>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>Annotated genes<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref> (19879)</bold></th>
<th valign="top" align="center"><bold>CNV genes<xref ref-type="table-fn" rid="TN2"><sup>b</sup></xref> (89)</bold></th>
<th valign="top" align="center"><bold>CNV genes expected</bold></th>
<th valign="top" align="center"><bold>Over (&#x0002B;) or Under (&#x02212;)</bold></th>
<th valign="top" align="center"><bold><italic>P</italic>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" colspan="6" style="background-color:#bbbdc0"><bold>BIOLOGICAL PROCESS</bold></td>
</tr>
<tr>
<td valign="top" align="left">Extracellular transport</td>
<td valign="top" align="center">53</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0.24</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">4.16E-05</td>
</tr>
<tr>
<td valign="top" align="left">Response to toxic substance</td>
<td valign="top" align="center">45</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0.20</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">5.08E-04</td>
</tr>
<tr>
<td valign="top" align="left">Response to stimulus</td>
<td valign="top" align="center">2880</td>
<td valign="top" align="center">43</td>
<td valign="top" align="center">12.89</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">9.08E-12</td>
</tr>
<tr>
<td valign="top" align="left">Response to interferon-gamma</td>
<td valign="top" align="center">58</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.26</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">3.50E-02</td>
</tr>
<tr>
<td valign="top" align="left">Amino acid transport</td>
<td valign="top" align="center">81</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0.36</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">8.45E-03</td>
</tr>
<tr>
<td valign="top" align="left">Macrophage activation</td>
<td valign="top" align="center">131</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0.59</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">7.18E-03</td>
</tr>
<tr>
<td valign="top" align="left">Sensory perception of smell</td>
<td valign="top" align="center">667</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">2.99</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">9.14E-20</td>
</tr>
<tr>
<td valign="top" align="left">Sensory perception of chemical stimulus</td>
<td valign="top" align="center">875</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">3.92</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">1.23E-18</td>
</tr>
<tr>
<td valign="top" align="left">Sensory perception</td>
<td valign="top" align="center">1108</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">4.96</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">1.19E-15</td>
</tr>
<tr>
<td valign="top" align="left">Neurological system process</td>
<td valign="top" align="center">1593</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">7.13</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">1.18E-14</td>
</tr>
<tr>
<td valign="top" align="left">System process</td>
<td valign="top" align="center">1809</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">8.10</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">6.23E-13</td>
</tr>
<tr>
<td valign="top" align="left">Single-multicellular organism process</td>
<td valign="top" align="center">2189</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">9.80</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">2.01E-10</td>
</tr>
<tr>
<td valign="top" align="left">Multicellular organismal process</td>
<td valign="top" align="center">2199</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">9.85</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">2.30E-10</td>
</tr>
<tr>
<td valign="top" align="left">G-protein coupled receptor signaling pathway</td>
<td valign="top" align="center">789</td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">3.53</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">1.21E-02</td>
</tr>
<tr>
<td valign="top" align="left">Regulation of biological process</td>
<td valign="top" align="center">2260</td>
<td valign="top" align="center">34</td>
<td valign="top" align="center">10.12</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">1.36E-08</td>
</tr>
<tr>
<td valign="top" align="left">Biological regulation</td>
<td valign="top" align="center">2636</td>
<td valign="top" align="center">34</td>
<td valign="top" align="center">11.80</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">8.17E-07</td>
</tr>
<tr>
<td valign="top" align="left">Metabolic process</td>
<td valign="top" align="center">6613</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">29.61</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">3.88E-02</td>
</tr>
<tr>
<td valign="top" align="left" colspan="6" style="background-color:#bbbdc0"><bold>MOLECULAR FUNCTION</bold></td>
</tr>
<tr>
<td valign="top" align="center" colspan="6">n/a</td>
</tr>
<tr>
<td valign="top" align="left" colspan="6" style="background-color:#bbbdc0"><bold>CELLULAR COMPONENT</bold></td>
</tr>
<tr>
<td valign="top" align="left">MHC protein complex</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.09</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">5.59E-03</td>
</tr>
<tr>
<td valign="top" align="left">Heterotrimeric G-protein complex</td>
<td valign="top" align="center">38</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.17</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">1.72E-03</td>
</tr>
<tr>
<td valign="top" align="left">Integral to membrane</td>
<td valign="top" align="center">1478</td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">6.62</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">3.11E-17</td>
</tr>
<tr>
<td valign="top" align="left">Membrane</td>
<td valign="top" align="center">2433</td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">10.89</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">2.19E-10</td>
</tr>
<tr>
<td valign="top" align="left">Plasma membrane</td>
<td valign="top" align="center">1458</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">6.53</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">1.01E-06</td>
</tr>
<tr>
<td valign="top" align="left">Cell part</td>
<td valign="top" align="center">4063</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">18.19</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">1.97E-02</td>
</tr>
<tr>
<td valign="top" align="left">Intracellular</td>
<td valign="top" align="center">3993</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">17.88</td>
<td valign="top" align="center">&#x02212;</td>
<td valign="top" align="center">2.58E-02</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN1">
<label>a</label>
<p><italic>Number of genes in the background Bos taurus GO slim annotation set with given GO term. Total number of annotated genes is shown in parentheses.</italic></p></fn>
<fn id="TN2">
<label>b</label>
<p><italic>Number of CNV genes with given GO term. Total number of CNV genes with annotations in the background Bos taurus GO slim annotation set is shown in parentheses.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<p>In addition, CNV genes were separated into three categories, duplication genes (genes overlapped by gain CNVs), deletion genes (genes overlapped by deletion CNVs), and mixed genes (genes overlapped by mixed CNVs) (Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3</xref>), and enrichment analysis was performed separately for each group. GO slim terms antigen processing and presentation of peptide or polysaccharide antigen via MHC class II, antigen processing and presentation, immune system process, and MHC protein complex were significantly overrepresented in the set of 25 genes overlapped by gain CNVs. For the 38 genes overlapped by deletion CNVs the terms response to toxic substance, response to stimulus, extracellular transport, sensory perception of smell, neurological system process, and regulation of biological process were significantly overrepresented. Genes overlapped by mixed CNVs had overrepresentation of GO terms response to interferon gamma, response to stimulus, response to toxic substance, sensory perception of smell, neurological system process, and regulation of biological process.</p>
<p>Several of the biological process categories identified for our cattle CNV have also been identified in other species. For example, MHC class II genes, olfactory receptors (OR), and amino acid transporters have been identified within CNV regions in humans (Schmidt et al., <xref ref-type="bibr" rid="B58">2003</xref>; Traherne, <xref ref-type="bibr" rid="B63">2008</xref>; Young et al., <xref ref-type="bibr" rid="B72">2008</xref>). Human MHC class II and class III genes lie within CNVR in humans, and some of these have been linked to phenotypic variation like congenital hyperplasia, systemic lupus erythematosus disease risk, and host control of HIV-1 (Traherne, <xref ref-type="bibr" rid="B63">2008</xref>). Olfactory receptors are G-protein coupled receptors involved in signal transduction. Young et al. (<xref ref-type="bibr" rid="B72">2008</xref>) showed that 18 OR and OR psuedogenes displayed varying copy numbers among 50 people. This variation may play a role in olfactory ability and sensitivity. Olfactory receptors may also play a chemosensory role as they are expressed on sperm and thought to direct them to the egg via chemotaxis (Spehr et al., <xref ref-type="bibr" rid="B61">2006</xref>). Across several subspecies of the <italic>Sus</italic> genus, OR genes were also over-represented among CNVR (Paudel et al., <xref ref-type="bibr" rid="B51">2015</xref>). These genes may have been important components of swine evolution, as scent would have been critical for foraging for food, avoiding predators, and finding a mate.</p>
</sec>
<sec><title>Selective constraint on CNV genes</title>
<p>A central question in biology is how genomes evolve with respect to size and gene content and which factors affect and constrain this evolution. Intuitively, CNVs are likely to be subjected to selective pressure since large variants, in contrast with SNPs and other small variants, often affect entire protein-coding genes and substantial amounts of flanking DNA sequence.</p>
<p>It has long been hypothesized that gene duplications are drivers of both genome and gene function evolution. As described by Ohno (<xref ref-type="bibr" rid="B49">2013</xref>), when a gene duplication event first occurs, the two copies of the gene are assumed to be functionally redundant. It is believed that in most instances one copy of the gene will eventually be lost (pseudogenization or nonfunctionalization). However, as natural selection does not &#x0201C;know&#x0201D; which copy of the duplicated gene should be under selection and which should be free of selective constraint, both paralogs experience a period of relaxed selection. During this stage, it is possible that some divergence may be allowed and occasionally one copy may acquire a new function and subsequently be maintained by natural selection.</p>
<p>Rates of molecular evolution can be used to understand the selection constraints experienced by genes. In particular, contrasting the rate of protein-changing (non-synonymous) substitution and the rate of silent (synonymous) substitution at the nucleotide level allows us to identify the type of selection acting on individual genes. We measured selective constraint on cattle genes by using the dN/dS ratio. Here, dS denotes the synonymous substitution rate, and dN denotes the nonsynonymous substitution rate. When computed using sequences from divergent species, the dN/dS ratio is a measure of adaptive evolution in protein-coding sequences (Kryazhimskiy and Plotkin, <xref ref-type="bibr" rid="B34">2008</xref>). For this reason we chose to use <italic>Homo sapiens</italic> as the comparison species since it is a well-studied organism, divergent from cattle.</p>
<p>Generally dN/dS ratios are interpreted as follows. dN/dS &#x0003D; 1 implies equal numbers of synonymous and nonsynonymous substitutions. This means that most variation is not caused by natural selection, but by random drift of mutant alleles that are neutral. dN/dS &#x0003E; 1 implies more nonsynonymous changes than synonymous. This means that there has been evolutionary pressure to escape the ancestral state, i.e., positive selection. Similarly, dN/dS &#x0003C; 1 implies a larger number of synonymous changes compared to nonsynonymous, meaning that there has been evolutionary pressure to conserve the ancestral state, i.e., negative selection.</p>
<p>dN/dS ratios were computed for orthologous pairs of genes (both CNV and neutral genes) between cattle and human (Supplementary Table <xref ref-type="supplementary-material" rid="SM5">5</xref>). We first tested the hypothesis that, in general, compared to copy number neutral genes, CNV genes tend to be under relaxed selective pressure. This was done using a one-tailed Wilcoxon rank sum test, to test whether the median dN/dS ratio of all CNV genes was significantly higher that the median dN/dS ratio of neutral genes. We found that dN/dS ratios of CNV genes were significantly shifted toward higher values than neutral genes (Table <xref ref-type="table" rid="T4">4</xref>), suggesting that CNV genes are subject to reduced selective constraint. This finding is consistent with previous results in both cattle and pigs (Fadista et al., <xref ref-type="bibr" rid="B17">2010</xref>; Li et al., <xref ref-type="bibr" rid="B38">2012</xref>).</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p><bold>dN/dS analysis</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th valign="top" align="center"><bold>dN</bold></th>
<th valign="top" align="center"><bold><italic>P</italic>-value</bold></th>
<th valign="top" align="center"><bold>dS</bold></th>
<th valign="top" align="center"><bold><italic>P</italic>-value</bold></th>
<th valign="top" align="center"><bold>dN/dS</bold></th>
<th valign="top" align="center"><bold><italic>P</italic>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">All CNV genes</td>
<td valign="top" align="center">0.1418</td>
<td valign="top" align="center">2.29E-09</td>
<td valign="top" align="center">0.5589</td>
<td valign="top" align="center">1.51E-07</td>
<td valign="top" align="center">0.2813</td>
<td valign="top" align="center">2.81E-06</td>
</tr>
<tr>
<td valign="top" align="left">Duplication genes</td>
<td valign="top" align="center">0.1601</td>
<td valign="top" align="center">1.45E-05</td>
<td valign="top" align="center">0.5135</td>
<td valign="top" align="center">0.0072</td>
<td valign="top" align="center">0.3151</td>
<td valign="top" align="center">3.01E-05</td>
</tr>
<tr>
<td valign="top" align="left">Deletion genes</td>
<td valign="top" align="center">0.1308</td>
<td valign="top" align="center">0.0142</td>
<td valign="top" align="center">0.5814</td>
<td valign="top" align="center">0.0083</td>
<td valign="top" align="center">0.2308</td>
<td valign="top" align="center">0.1068</td>
</tr>
<tr>
<td valign="top" align="left">Mixed genes</td>
<td valign="top" align="center">0.1235</td>
<td valign="top" align="center">1.36E-04</td>
<td valign="top" align="center">0.5681</td>
<td valign="top" align="center">4.79E-05</td>
<td valign="top" align="center">0.2702</td>
<td valign="top" align="center">0.0068</td>
</tr>
<tr>
<td valign="top" align="left">Neutral genes</td>
<td valign="top" align="center">0.0793</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">0.4288</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">0.1843</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Median nonsynonymous (dN), synonymous (dS), and dN/dS rates are shown. P-values compare copy number variable genes with copy number neutral genes using a one-tailed Wilcoxon rank-sum test.</italic></p>
</table-wrap-foot>
</table-wrap>
<p>We also tested, individually, if duplication genes, deletion genes, and mixed genes tended to be under relaxed selective constraint compared to neutral genes. Both duplication and mixed genes were shown to have significantly higher dN/dS ratios than neutral genes, while dN/dS ratios of deletion genes did not differ significantly from those of neutral genes. The reduction in selective constraint observed in duplication and mixed genes follows Ohno&#x00027;s hypothesis that in a gene duplication event, one or both duplicates should experience relaxed selective constraint resulting in elevated rates of sequence evolution.</p>
</sec>
<sec><title>Tissue specificity of CNV genes</title>
<p>Previous studies in fly (Dopman and Hartl, <xref ref-type="bibr" rid="B15">2007</xref>) and mouse (Henrichsen et al., <xref ref-type="bibr" rid="B25">2009</xref>) have shown that CNV genes tend to be more specific in their tissue expression patterns. We investigated this phenomenon in cattle using gene expression data from 22 different tissues (Table <xref ref-type="table" rid="T2">2</xref>). Expressed CNV genes were expressed in fewer tissues (median &#x0003D; 2) than expressed neutral genes (median &#x0003D; 10) (one-tailed Wilcoxon rank-sum test, <italic>P</italic> &#x0003C; 0.00001). This is consistent with results from a similar study in fly (Dopman and Hartl, <xref ref-type="bibr" rid="B15">2007</xref>), suggesting that CNVs occur more often in genes with tissue-specific expression than widely expressed genes that may have housekeeping functions.</p>
<p>A total of 6 CNV genes were identified to be tissue-specific in their expression (Table <xref ref-type="table" rid="T5">5</xref>). Most of these genes (67%) were found in the testes. The most abundant gene family represented in this set, including 2 of the 4 genes, was the neuroblastoma breakpoint family (<italic>NBPF</italic>). Genes belonging to this family are involved in transporting RNA between the cell nucleus and the cytoplasm. <italic>NBPF</italic> genes have been shown to be copy number variable in humans and other primates (Vandepoele et al., <xref ref-type="bibr" rid="B66">2005</xref>). This gene family has been shown to be expressed in the testes of humans (Vandepoele and van Roy, <xref ref-type="bibr" rid="B65">2007</xref>) and is hypothesized to play a role in male reproduction (Vandepoele et al., <xref ref-type="bibr" rid="B66">2005</xref>). The testis is a tissue that has a high level of interaction with the environment. Environmental factors, such as interference with testicular cooling and endocrine disruptors, are known to influence the development and function of the testes (Sharpe and Franks, <xref ref-type="bibr" rid="B59">2002</xref>). Our finding tissue-specific CNV genes in the testes is perhaps not coincidental. It has been argued in previous studies that copy number variation is the result of positive selection for a diverse set of proteins that can meet the challenges of a constantly changing environment (Kondrashov and Kondrashov, <xref ref-type="bibr" rid="B33">2006</xref>).</p>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p><bold>Number of tissue-specific genes with copy number variation</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Tissue</bold></th>
<th valign="top" align="center"><bold>Number of tissue-specific genes</bold></th>
<th valign="top" align="center"><bold>Number of tissue-specific CNV genes</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Testes</td>
<td valign="top" align="center">531</td>
<td valign="top" align="center">4</td>
</tr>
<tr>
<td valign="top" align="left">Brain</td>
<td valign="top" align="center">318</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Spleen</td>
<td valign="top" align="center">81</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">Duodenum</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Colon</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">Liver</td>
<td valign="top" align="center">69</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Lung</td>
<td valign="top" align="center">45</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Kidney</td>
<td valign="top" align="center">75</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Ovary</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Endometrium</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Uterus</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Rumen</td>
<td valign="top" align="center">117</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Mesenteric fat</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Adipose</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Hypothalamus</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Heart</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Skeletal muscle</td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Pituitary gland</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Subcutaneous fat</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Longissimus dorsi muscle</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Jejunum</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Ileum</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">0</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>It should be noted that the tissues used in this analysis were downloaded from the NCBI database and did not originate from the same samples in which our CNV were detected. This is a major limitation in our tissue specificity analysis. As mentioned before, concordance between individual cattle CNV studies tends to be quite low. Cattle CNVs have also been shown to be lineage-differentiated (Xu et al., <xref ref-type="bibr" rid="B70">2016</xref>). Therefore, it is quite possible that CNVs in the samples used for RNA sequencing could be quite different from those identified in this study. Hence, the tissue-specific expression patterns of CNV genes warrants further investigation using a dataset that includes whole-genome sequence as well RNA sequence from multiple tissues in the same set of samples.</p>
</sec>
<sec><title>Network centrality of CNV genes</title>
<p>Protein centrality in PPI networks has been correlated with evolutionary rate and essentiality of genes in several species (Hahn and Kern, <xref ref-type="bibr" rid="B24">2005</xref>). Proteins that are more central in PPI networks tend to evolve more slowly and be more essential. As shown above, CNV genes show a tendency to evolve more rapidly and are under reduced selective constraint. Therefore, it follows that the products of genes overlapped by CNV may be less central in PPI networks.</p>
<p>We tested this hypothesis using PPI data from the STRING database. We found that, in general, the number of interactors (i.e., the network node degree) for all CNV genes with &#x02265; 1 interaction was not significantly lower compared to neutral genes with &#x02265; 1 interaction (one-tailed Wilcoxon rank-sum test, <italic>P</italic> &#x0003D; 0.9137). Taking a closer look, we found that duplication genes did have significantly smaller numbers of interactors compared to neutral genes (one-tailed Wilcoxon rank-sum test, <italic>P</italic> &#x0003D; 0.0208), while deletion genes and mixed genes did not exhibit significantly lower numbers of interactors (<italic>P</italic> &#x0003D; 0.99 and <italic>P</italic> &#x0003D; 0.62, respectively). This finding is consistent with results in fly (Dopman and Hartl, <xref ref-type="bibr" rid="B15">2007</xref>) and yeast (Li et al., <xref ref-type="bibr" rid="B37">2006</xref>) in which products of duplicated genes were shown to have reduced network connectivity.</p>
<p>It is possible that a gene&#x00027;s copy number status may reveal information about its essentiality in PPI networks. The results above suggest that genes with lower network centrality may be more likely to have duplicates that are retained during evolution. We have shown that duplication genes are subject to reduced selective constraint, and as a result, they tend to undergo more rapid sequence evolution. Genes with high centrality in PPI networks may be more evolutionarily constrained since changes in protein coding could hinder the ability of the resulting protein to form interactions with other proteins in the network. Therefore, as hypothesized by Dopman and Hartl (<xref ref-type="bibr" rid="B15">2007</xref>), the set of genes with low numbers of interactions in PPI networks, populated by duplication genes in cattle, fly, and yeast, may experience reduced pleiotropy, and consequently be robust to structural mutations as well as less constrained during evolution.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s4"><title>Conclusion</title>
<p>In recent years, copy number variation has gained considerable interest as a source of genetic variation that likely plays a role in phenotypic diversity. Much of the effort in studying copy number variation has been allocated to identification and validation of CNVs in several different organisms. Genome wide association studies have even linked changes in copy number to complex diseases. However, the evolutionary and functional impact of copy number variation is not well understood.</p>
<p>Cattle CNV research has made significant progress in the last 5 years. Genome-wide CNV maps have been generated using a variety of platforms and detection algorithms. However, the overlap between results from these studies is quite low. As mentioned earlier, these discrepancies may be due to differences in breeds, sample size, platform, and detection algorithm. In attempt to capture a larger portion of coding sequence copy number variation in the bovine genome, we chose to use a larger sample size (175 samples) than previous NGS CNV studies, as well as samples from multiple breeds (10 breeds). Additional copy number variation may be detected by including broader sampling from each from each breed and will likely be more effective in capturing breed-specific differences in CNV.</p>
<p>The evolutionary and functional patterns identified in this work for <italic>Bos taurus</italic> and in other studies for other species support a partial adaptive explanation for copy number diversity. We have shown that the dominant evolutionary forces that shape CNV are likely reduced functional (selective) constraint and mutational bias. Genomics research has traditionally concentrated on single-nucleotide polymorphisms as the most relevant source of structural variation in the genome. However, it is becoming progressively clear that CNVs may have considerable functional and evolutionary consequences. Understanding the role that CNVs play in reshaping gene structure, modulating gene expression, and ultimately contributing to phenotypic variation represent major future goals for the population genetics of structural variation.</p>
</sec>
<sec id="s5"><title>Author contributions</title>
<p>BK conceived of the study, and BK, AL-P, and WS participated in its design and coordination. WS mapped the exome sequence data, and BK performed all subsequent data analysis. BK drafted the manuscript, and all authors read and approved the final manuscript.</p>
</sec>
<sec>
<title>Disclosure</title>
<p>Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. The U.S. Department of Agriculture (USDA) prohibits discrimination in all its programs and activities on the basis of race, color, national origin, age, disability, and where applicable, sex, marital status, familial status, parental status, religion, sexual orientation, genetic information, political beliefs, reprisal, or because all or part of an individual&#x00027;s income is derived from any public assistance program. (Not all prohibited bases apply to all programs.) Persons with disabilities who require alternative means for communication of program information (Braille, large print, audiotape, etc.) should contact USDA&#x00027;s TARGET Center at (202) 720&#x02013;2600 (voice and TDD). To file a complaint of discrimination, write to USDA, Director, Office of Civil Rights, 1400 Independence Avenue, S. W., Washington, D. C. 20250-9410, or call (800) 795&#x02013;3272 (voice) or (202) 720&#x02013;6382 (TDD). USDA is an equal opportunity provider and employer.</p>
<sec><title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<sec sec-type="supplementary-material" id="s6"><title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fgene.2016.00207/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fgene.2016.00207/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table1.XLSX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table2.DOCX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table3.DOCX" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table4.XLSX" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table5.XLSX" id="SM5" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.DOCX" id="SM6" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aguilar</surname> <given-names>M. D.</given-names></name> <name><surname>Rom&#x000E1;n Ponce</surname> <given-names>S. I.</given-names></name> <name><surname>Ruiz L&#x000F3;pez</surname> <given-names>F. J.</given-names></name> <name><surname>Gonz&#x000E1;lez Padilla</surname> <given-names>E.</given-names></name> <name><surname>V&#x000E1;squez Pel&#x000E1;ez</surname> <given-names>C. G.</given-names></name> <name><surname>Bagnato</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Genome-wide association study for milk somatic cell score in Holstein cattle using copy number variation as markers</article-title>. <source>J. Anim. Breed. Genet</source>. <pub-id pub-id-type="doi">10.1111/jbg.12238</pub-id> [Epub ahead of print]. <pub-id pub-id-type="pmid">27578198</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Almal</surname> <given-names>S. H.</given-names></name> <name><surname>Padh</surname> <given-names>H.</given-names></name></person-group> (<year>2012</year>). <article-title>Implications of gene copy-number variation in health and diseases</article-title>. <source>J. Hum. Genet.</source> <volume>57</volume>, <fpage>6</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1038/jhg.2011.108</pub-id><pub-id pub-id-type="pmid">21956041</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alvarez</surname> <given-names>C. E.</given-names></name> <name><surname>Akey</surname> <given-names>J. M.</given-names></name></person-group> (<year>2012</year>). <article-title>Copy number variation in the domestic dog</article-title>. <source>Mamm. Genome</source> <volume>23</volume>, <fpage>144</fpage>&#x02013;<lpage>163</lpage>. <pub-id pub-id-type="doi">10.1007/s00335-011-9369-8</pub-id><pub-id pub-id-type="pmid">22138850</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bae</surname> <given-names>J. S.</given-names></name> <name><surname>Cheong</surname> <given-names>H. S.</given-names></name> <name><surname>Kim</surname> <given-names>L. H.</given-names></name> <name><surname>NamGung</surname> <given-names>S.</given-names></name> <name><surname>Park</surname> <given-names>T. J.</given-names></name> <name><surname>Chun</surname> <given-names>J.-Y.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Identification of copy number variations and common deletion polymorphisms in cattle</article-title>. <source>BMC Genomics</source> <volume>11</volume>:<fpage>232</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-11-232</pub-id><pub-id pub-id-type="pmid">20377913</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bagnato</surname> <given-names>A.</given-names></name> <name><surname>Strillacci</surname> <given-names>M. G.</given-names></name> <name><surname>Pellegrino</surname> <given-names>L.</given-names></name> <name><surname>Schiavini</surname> <given-names>F.</given-names></name> <name><surname>Frigo</surname> <given-names>E.</given-names></name> <name><surname>Rossoni</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Identification and validation of copy number variants in Italian Brown Swiss dairy cattle using Illumina Bovine SNP50 Beadchip&#x000AE;</article-title>. <source>Ital. J. Anim. Sci.</source> <volume>14</volume>:<fpage>3900</fpage>. <pub-id pub-id-type="doi">10.4081/ijas.2015.3900</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ben Sassi</surname> <given-names>N.</given-names></name> <name><surname>Gonz&#x000E1;lez-Reci&#x000F3;</surname> <given-names>&#x000D3;.</given-names></name> <name><surname>de Paz-del R&#x000ED;o</surname> <given-names>R.</given-names></name> <name><surname>Rodr&#x000ED;guez-Ramilo</surname> <given-names>S. T.</given-names></name> <name><surname>Fern&#x000E1;ndez</surname> <given-names>A. I.</given-names></name></person-group> (<year>2016</year>). <article-title>Associated effects of copy number variants on economically important traits in Spanish Holstein dairy cattle</article-title>. <source>J. Dairy Sci.</source> <volume>99</volume>, <fpage>6371</fpage>&#x02013;<lpage>6380</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2015-10487</pub-id><pub-id pub-id-type="pmid">27209136</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berglund</surname> <given-names>J.</given-names></name> <name><surname>Nevalainen</surname> <given-names>E. M.</given-names></name> <name><surname>Molin</surname> <given-names>A. M.</given-names></name> <name><surname>Perloski</surname> <given-names>M.</given-names></name> <name><surname>The LUPA Consortium</surname> <given-names>Andr&#x000E9;, C.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Novel origins of copy number variation in the dog genome</article-title>. <source>Genome Biol.</source> <volume>13</volume>:<fpage>R73</fpage>. <pub-id pub-id-type="doi">10.1186/gb-2012-13-8-r73</pub-id><pub-id pub-id-type="pmid">22916802</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bickhart</surname> <given-names>D. M.</given-names></name> <name><surname>Hou</surname> <given-names>Y.</given-names></name> <name><surname>Schroeder</surname> <given-names>S. G.</given-names></name> <name><surname>Alkan</surname> <given-names>C.</given-names></name> <name><surname>Cardone</surname> <given-names>M. F.</given-names></name> <name><surname>Matukumalli</surname> <given-names>L. K.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Copy number variation of individual cattle genomes using next-generation sequencing</article-title>. <source>Genome Res.</source> <volume>22</volume>, <fpage>778</fpage>&#x02013;<lpage>790</lpage>. <pub-id pub-id-type="doi">10.1101/gr.133967.111</pub-id><pub-id pub-id-type="pmid">22300768</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bolger</surname> <given-names>A. M.</given-names></name> <name><surname>Lohse</surname> <given-names>M.</given-names></name> <name><surname>Usadel</surname> <given-names>B.</given-names></name></person-group> (<year>2014</year>). <article-title>Trimmomatic: a flexible trimmer for Illumina sequence data</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>2114</fpage>&#x02013;<lpage>2120</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu170</pub-id><pub-id pub-id-type="pmid">24695404</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>C.</given-names></name> <name><surname>Qiao</surname> <given-names>R.</given-names></name> <name><surname>Wei</surname> <given-names>R.</given-names></name> <name><surname>Guo</surname> <given-names>Y.</given-names></name> <name><surname>Ai</surname> <given-names>H.</given-names></name> <name><surname>Ma</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>A comprehensive survey of copy number variation in 18 diverse pig populations and identification of candidate copy number variable genes associated with complex traits</article-title>. <source>BMC Genomics</source> <volume>13</volume>:<fpage>733</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-13-733</pub-id><pub-id pub-id-type="pmid">23270433</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Choi</surname> <given-names>J.-W.</given-names></name> <name><surname>Lee</surname> <given-names>K.-T.</given-names></name> <name><surname>Liao</surname> <given-names>X.</given-names></name> <name><surname>Stothard</surname> <given-names>P.</given-names></name> <name><surname>An</surname> <given-names>H.-S.</given-names></name> <name><surname>Ahn</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Genome-wide copy number variation in Hanwoo, Black Angus, and Holstein cattle</article-title>. <source>Mamm. Genome</source> <volume>24</volume>, <fpage>151</fpage>&#x02013;<lpage>163</lpage>. <pub-id pub-id-type="doi">10.1007/s00335-013-9449-z</pub-id><pub-id pub-id-type="pmid">23543395</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cook</surname> <given-names>E. H.</given-names> <suffix>Jr.</suffix></name> <name><surname>Scherer</surname> <given-names>S. W.</given-names></name></person-group> (<year>2008</year>). <article-title>Copy-number variations associated with neuropsychiatric conditions</article-title>. <source>Nature</source> <volume>455</volume>, <fpage>919</fpage>&#x02013;<lpage>923</lpage>. <pub-id pub-id-type="doi">10.1038/nature07458</pub-id><pub-id pub-id-type="pmid">18923514</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crooijmans</surname> <given-names>R. P.</given-names></name> <name><surname>Fife</surname> <given-names>M. S.</given-names></name> <name><surname>Fitzgerald</surname> <given-names>T. W.</given-names></name> <name><surname>Strickland</surname> <given-names>S.</given-names></name> <name><surname>Cheng</surname> <given-names>H. H.</given-names></name> <name><surname>Kaiser</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Large scale variation in DNA copy number in chicken breeds</article-title>. <source>BMC Genomics</source> <volume>14</volume>:<fpage>398</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-14-398</pub-id><pub-id pub-id-type="pmid">23763846</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cunningham</surname> <given-names>F.</given-names></name> <name><surname>Amode</surname> <given-names>M. R.</given-names></name> <name><surname>Barrell</surname> <given-names>D.</given-names></name> <name><surname>Beal</surname> <given-names>K.</given-names></name> <name><surname>Billis</surname> <given-names>K.</given-names></name> <name><surname>Brent</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Ensembl 2015</article-title>. <source>Nucleic Acids Res.</source> <volume>43</volume>, <fpage>D662</fpage>&#x02013;<lpage>D669</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gku1010</pub-id><pub-id pub-id-type="pmid">25352552</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dopman</surname> <given-names>E. B.</given-names></name> <name><surname>Hartl</surname> <given-names>D. L.</given-names></name></person-group> (<year>2007</year>). <article-title>A portrait of copy-number polymorphism in <italic>Drosophila melanogaster</italic></article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>104</volume>, <fpage>19920</fpage>&#x02013;<lpage>19925</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0709888104</pub-id><pub-id pub-id-type="pmid">18056801</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fadista</surname> <given-names>J.</given-names></name> <name><surname>Nygaard</surname> <given-names>M.</given-names></name> <name><surname>Holm</surname> <given-names>L.-E.</given-names></name> <name><surname>Thomsen</surname> <given-names>B.</given-names></name> <name><surname>Bendixen</surname> <given-names>C.</given-names></name></person-group> (<year>2008</year>). <article-title>A snapshot of CNVs in the pig genome</article-title>. <source>PLoS ONE</source> <volume>3</volume>:<fpage>e3916</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0003916</pub-id><pub-id pub-id-type="pmid">19079605</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fadista</surname> <given-names>J.</given-names></name> <name><surname>Thomsen</surname> <given-names>B.</given-names></name> <name><surname>Holm</surname> <given-names>L.-E.</given-names></name> <name><surname>Bendixen</surname> <given-names>C.</given-names></name></person-group> (<year>2010</year>). <article-title>Copy number variation in the bovine genome</article-title>. <source>BMC Genomics</source> <volume>11</volume>:<fpage>284</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-11-284</pub-id><pub-id pub-id-type="pmid">20459598</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fontanesi</surname> <given-names>L.</given-names></name> <name><surname>Beretti</surname> <given-names>F.</given-names></name> <name><surname>Martelli</surname> <given-names>P. L.</given-names></name> <name><surname>Colombo</surname> <given-names>M.</given-names></name> <name><surname>Dall&#x00027;Olio</surname> <given-names>S.</given-names></name> <name><surname>Occidente</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>A first comparative map of copy number variations in the sheep genome</article-title>. <source>Genomics</source> <volume>97</volume>, <fpage>158</fpage>&#x02013;<lpage>165</lpage>. <pub-id pub-id-type="doi">10.1016/j.ygeno.2010.11.005</pub-id><pub-id pub-id-type="pmid">21111040</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fontanesi</surname> <given-names>L.</given-names></name> <name><surname>Martelli</surname> <given-names>P. L.</given-names></name> <name><surname>Beretti</surname> <given-names>F.</given-names></name> <name><surname>Riggio</surname> <given-names>V.</given-names></name> <name><surname>Dall&#x00027;Olio</surname> <given-names>S.</given-names></name> <name><surname>Colombo</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>An initial comparative map of copy number variations in the goat (<italic>Capra hircus</italic>) genome</article-title>. <source>BMC Genomics</source> <volume>11</volume>:<fpage>639</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-11-639</pub-id><pub-id pub-id-type="pmid">21083884</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Franceschini</surname> <given-names>A.</given-names></name> <name><surname>Szklarczyk</surname> <given-names>D.</given-names></name> <name><surname>Frankild</surname> <given-names>S.</given-names></name> <name><surname>Kuhn</surname> <given-names>M.</given-names></name> <name><surname>Simonovic</surname> <given-names>M.</given-names></name> <name><surname>Roth</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>STRING v9.1: protein-protein interaction networks, with increased coverage and integration</article-title>. <source>Nucleic Acids Res.</source> <volume>41</volume>, <fpage>D808</fpage>&#x02013;<lpage>D815</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks1094</pub-id><pub-id pub-id-type="pmid">23203871</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Girirajan</surname> <given-names>S.</given-names></name> <name><surname>Dennis</surname> <given-names>M. Y.</given-names></name> <name><surname>Baker</surname> <given-names>C.</given-names></name> <name><surname>Malig</surname> <given-names>M.</given-names></name> <name><surname>Coe</surname> <given-names>B. P.</given-names></name> <name><surname>Campbell</surname> <given-names>C. D.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Refinement and discovery of new hotspots of copy-number variation associated with autism spectrum disorder</article-title>. <source>Am. J. Hum. Genet.</source> <volume>92</volume>, <fpage>221</fpage>&#x02013;<lpage>237</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2012.12.016</pub-id><pub-id pub-id-type="pmid">23375656</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guberman</surname> <given-names>J. M.</given-names></name> <name><surname>Ai</surname> <given-names>J.</given-names></name> <name><surname>Arnaiz</surname> <given-names>O.</given-names></name> <name><surname>Baran</surname> <given-names>J.</given-names></name> <name><surname>Blake</surname> <given-names>A.</given-names></name> <name><surname>Baldock</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>BioMart Central Portal: an open database network for the biological community</article-title>. <source>Database</source> <volume>2011</volume>:<fpage>bar041</fpage>. <pub-id pub-id-type="doi">10.1093/database/bar041</pub-id><pub-id pub-id-type="pmid">21930507</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>Y.</given-names></name> <name><surname>Sheng</surname> <given-names>Q.</given-names></name> <name><surname>Samuels</surname> <given-names>D. C.</given-names></name> <name><surname>Lehmann</surname> <given-names>B.</given-names></name> <name><surname>Bauer</surname> <given-names>J. A.</given-names></name> <name><surname>Pietenpol</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Comparative study of exome copy number variation estimation tools using array comparative genomic hybridization as control</article-title>. <source>Bio Med Res. Int.</source> <volume>2013</volume>:<fpage>915636</fpage>. <pub-id pub-id-type="doi">10.1155/2013/915636</pub-id><pub-id pub-id-type="pmid">24303503</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hahn</surname> <given-names>M. W.</given-names></name> <name><surname>Kern</surname> <given-names>A. D.</given-names></name></person-group> (<year>2005</year>). <article-title>Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks</article-title>. <source>Mol. Biol. Evol.</source> <volume>22</volume>, <fpage>803</fpage>&#x02013;<lpage>806</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msi072</pub-id><pub-id pub-id-type="pmid">15616139</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Henrichsen</surname> <given-names>C. N.</given-names></name> <name><surname>Vinckenbosch</surname> <given-names>N.</given-names></name> <name><surname>Z&#x000F6;llner</surname> <given-names>S.</given-names></name> <name><surname>Chaignat</surname> <given-names>E.</given-names></name> <name><surname>Pradervand</surname> <given-names>S.</given-names></name> <name><surname>Sch&#x000FC;tz</surname> <given-names>F.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Segmental copy number variation shapes tissue transcriptomes</article-title>. <source>Nat. Genet.</source> <volume>41</volume>, <fpage>424</fpage>&#x02013;<lpage>429</lpage>. <pub-id pub-id-type="doi">10.1038/ng.345</pub-id><pub-id pub-id-type="pmid">19270705</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hou</surname> <given-names>Y.</given-names></name> <name><surname>Bickhart</surname> <given-names>D. M.</given-names></name> <name><surname>Hvinden</surname> <given-names>M. L.</given-names></name> <name><surname>Li</surname> <given-names>C.</given-names></name> <name><surname>Song</surname> <given-names>J.</given-names></name> <name><surname>Boichard</surname> <given-names>D. A.</given-names></name> <etal/></person-group>. (<year>2012a</year>). <article-title>Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array</article-title>. <source>BMC Genomics</source> <volume>13</volume>:<fpage>376</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-13-376</pub-id><pub-id pub-id-type="pmid">22866901</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hou</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>G. E.</given-names></name> <name><surname>Bickhart</surname> <given-names>D. M.</given-names></name> <name><surname>Cardone</surname> <given-names>M. F.</given-names></name> <name><surname>Wang</surname> <given-names>K.</given-names></name> <name><surname>Kim</surname> <given-names>E.-S.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Genomic characteristics of cattle copy number variations</article-title>. <source>BMC Genomics</source> <volume>12</volume>:<fpage>127</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-12-127</pub-id><pub-id pub-id-type="pmid">21345189</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hou</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>G. E.</given-names></name> <name><surname>Bickhart</surname> <given-names>D. M.</given-names></name> <name><surname>Matukumalli</surname> <given-names>L. K.</given-names></name> <name><surname>Li</surname> <given-names>C.</given-names></name> <name><surname>Song</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2012b</year>). <article-title>Genomic regions showing copy number variations associate with resistance or susceptibility to gastrointestinal nematodes in <italic>Angus cattle</italic></article-title>. <source>Funct. Integr. Genomics</source> <volume>12</volume>, <fpage>81</fpage>&#x02013;<lpage>92</lpage>. <pub-id pub-id-type="doi">10.1007/s10142-011-0252-1</pub-id><pub-id pub-id-type="pmid">21928070</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>L.</given-names></name> <name><surname>Jiang</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Ding</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>Q.</given-names></name></person-group> (<year>2012</year>). <article-title>Genome-wide identification of copy number variations in Chinese Holstein</article-title>. <source>PLoS ONE</source> <volume>7</volume>:<fpage>e48732</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0048732</pub-id><pub-id pub-id-type="pmid">23144949</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johansson Moller</surname> <given-names>M.</given-names></name> <name><surname>Chaudhary</surname> <given-names>R.</given-names></name> <name><surname>Hellm&#x000E9;n</surname> <given-names>E.</given-names></name> <name><surname>H&#x000F6;yheim</surname> <given-names>B.</given-names></name> <name><surname>Chowdhary</surname> <given-names>B.</given-names></name> <name><surname>Andersson</surname> <given-names>L.</given-names></name></person-group> (<year>1996</year>). <article-title>Pigs with the dominant white coat color phenotype carry a duplication of the <italic>KIT</italic> gene encoding the mast/stem cell growth factor receptor</article-title>. <source>Mamm. Genome</source> <volume>7</volume>, <fpage>822</fpage>&#x02013;<lpage>830</lpage>. <pub-id pub-id-type="doi">10.1007/s003359900244</pub-id><pub-id pub-id-type="pmid">8875890</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keel</surname> <given-names>B. N.</given-names></name> <name><surname>Keele</surname> <given-names>J. W.</given-names></name> <name><surname>Snelling</surname> <given-names>W. M.</given-names></name></person-group> (<year>2016</year>). <article-title>Genome-wide copy number variation in the bovine genome detected using low coverage sequence of popular beef breeds</article-title>. <source>Anim. Genet.</source>. <pub-id pub-id-type="doi">10.1111/age.12519</pub-id> [Epub ahead of print]. <pub-id pub-id-type="pmid">27775157</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klambauer</surname> <given-names>G.</given-names></name> <name><surname>Schwarzbauer</surname> <given-names>K.</given-names></name> <name><surname>Mayr</surname> <given-names>A.</given-names></name> <name><surname>Clevert</surname> <given-names>D.-A.</given-names></name> <name><surname>Mitterecker</surname> <given-names>A.</given-names></name> <name><surname>Bodenhofer</surname> <given-names>U.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate</article-title>. <source>Nucleic Acids Res.</source> <volume>40</volume>, <fpage>e69</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gks003</pub-id><pub-id pub-id-type="pmid">22302147</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kondrashov</surname> <given-names>F. A.</given-names></name> <name><surname>Kondrashov</surname> <given-names>A. S.</given-names></name></person-group> (<year>2006</year>). <article-title>Role of selection in fixation of gene duplications</article-title>. <source>J. Theor. Biol.</source> <volume>239</volume>, <fpage>141</fpage>&#x02013;<lpage>151</lpage>. <pub-id pub-id-type="doi">10.1016/j.jtbi.2005.08.033</pub-id><pub-id pub-id-type="pmid">16242725</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kryazhimskiy</surname> <given-names>S.</given-names></name> <name><surname>Plotkin</surname> <given-names>J. B.</given-names></name></person-group> (<year>2008</year>). <article-title>The population genetics of dN/dS</article-title>. <source>PLoS Genet.</source> <volume>4</volume>:<fpage>e1000304</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1000304</pub-id><pub-id pub-id-type="pmid">19081788</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krzywinski</surname> <given-names>M.</given-names></name> <name><surname>Schein</surname> <given-names>J.</given-names></name> <name><surname>Birol</surname> <given-names>I.</given-names></name> <name><surname>Connors</surname> <given-names>J.</given-names></name> <name><surname>Gascoyne</surname> <given-names>R.</given-names></name> <name><surname>Horsman</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Circos: an information aesthetic for comparative genomics</article-title>. <source>Genome Res.</source> <volume>19</volume>, <fpage>1639</fpage>&#x02013;<lpage>1645</lpage>. <pub-id pub-id-type="doi">10.1101/gr.092759.109</pub-id><pub-id pub-id-type="pmid">19541911</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langmead</surname> <given-names>B.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Fast gapped-read alignment with Bowtie 2</article-title>. <source>Nat. Methods</source> <volume>9</volume>, <fpage>357</fpage>&#x02013;<lpage>359</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id><pub-id pub-id-type="pmid">22388286</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Huang</surname> <given-names>Y.</given-names></name> <name><surname>Xia</surname> <given-names>X.</given-names></name> <name><surname>Sun</surname> <given-names>Z.</given-names></name></person-group> (<year>2006</year>). <article-title>Preferential duplication in the sparse part of yeast protein interaction network</article-title>. <source>Mol. Biol. Evol.</source> <volume>23</volume>, <fpage>2467</fpage>&#x02013;<lpage>2473</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msl121</pub-id><pub-id pub-id-type="pmid">16980576</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Mei</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Peng</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>G.</given-names></name> <name><surname>Tao</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Identification of genome-wide copy number variations among diverse pig breeds by array CGH</article-title>. <source>BMC Genomics</source> <volume>13</volume>:<fpage>725</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-13-725</pub-id><pub-id pub-id-type="pmid">23265576</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>G. E.</given-names></name> <name><surname>Hou</surname> <given-names>Y.</given-names></name> <name><surname>Zhu</surname> <given-names>B.</given-names></name> <name><surname>Cardone</surname> <given-names>M. F.</given-names></name> <name><surname>Jiang</surname> <given-names>L.</given-names></name> <name><surname>Cellamare</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Analysis of copy number variations among diverse cattle breeds</article-title>. <source>Genome Res.</source> <volume>20</volume>, <fpage>693</fpage>&#x02013;<lpage>703</lpage>. <pub-id pub-id-type="doi">10.1101/gr.105403.110</pub-id><pub-id pub-id-type="pmid">20212021</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>G. E.</given-names></name> <name><surname>Van Tassel</surname> <given-names>C. P.</given-names></name> <name><surname>Sonstegard</surname> <given-names>T. S.</given-names></name> <name><surname>Li</surname> <given-names>R. W.</given-names></name> <name><surname>Alexander</surname> <given-names>L. J.</given-names></name> <name><surname>Keele</surname> <given-names>J. W.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Detection of germline and somatic copy number variations in cattle</article-title>. <source>Dev. Biol. (Basel)</source> <volume>132</volume>, <fpage>231</fpage>&#x02013;<lpage>237</lpage>. <pub-id pub-id-type="doi">10.1159/000317165</pub-id><pub-id pub-id-type="pmid">18817307</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Xu</surname> <given-names>L.</given-names></name> <name><surname>Ren</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Analysis of copy number variations in the sheep genome using 50K SNP BeadChip array</article-title>. <source>BMC Genomics</source> <volume>14</volume>:<fpage>229</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-14-229</pub-id><pub-id pub-id-type="pmid">23565757</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="book"><person-group person-group-type="author"><collab>MATLAB</collab></person-group> (<year>2015</year>). <source>Bioinformatics Toolbox Release.</source> <publisher-loc>Natick, MA</publisher-loc>: <publisher-name>The MathWorks, Inc</publisher-name>.</citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matukumalli</surname> <given-names>L. K.</given-names></name> <name><surname>Lawley</surname> <given-names>C. T.</given-names></name> <name><surname>Schnabel</surname> <given-names>R. D.</given-names></name> <name><surname>Taylor</surname> <given-names>J. F.</given-names></name> <name><surname>Allan</surname> <given-names>M. F.</given-names></name> <name><surname>Heaton</surname> <given-names>M. P.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Development and characterization of a high density SNP genotyping assay for cattle</article-title>. <source>PLoS ONE</source> <volume>4</volume>:<fpage>e5350</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0005350</pub-id><pub-id pub-id-type="pmid">19390634</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mi</surname> <given-names>H.</given-names></name> <name><surname>Muruganujan</surname> <given-names>A.</given-names></name> <name><surname>Casagrande</surname> <given-names>J. T.</given-names></name> <name><surname>Thomas</surname> <given-names>P. D.</given-names></name></person-group> (<year>2013</year>). <article-title>Large-scale gene function analysis with the PANTHER classification system</article-title>. <source>Nat. Protoc.</source> <volume>8</volume>, <fpage>1551</fpage>&#x02013;<lpage>1566</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2013.092</pub-id><pub-id pub-id-type="pmid">23868073</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mills</surname> <given-names>R. E.</given-names></name> <name><surname>Walter</surname> <given-names>K.</given-names></name> <name><surname>Stewart</surname> <given-names>C.</given-names></name> <name><surname>Handsaker</surname> <given-names>R. E.</given-names></name> <name><surname>Chen</surname> <given-names>K.</given-names></name> <name><surname>Alkan</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Mapping copy number variation by population-scale genome sequencing</article-title>. <source>Nature</source> <volume>470</volume>, <fpage>59</fpage>&#x02013;<lpage>65</lpage>. <pub-id pub-id-type="doi">10.1038/nature09708</pub-id><pub-id pub-id-type="pmid">21293372</pub-id></citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname> <given-names>S. B.</given-names></name> <name><surname>Buckingham</surname> <given-names>K. J.</given-names></name> <name><surname>Lee</surname> <given-names>C.</given-names></name> <name><surname>Bigham</surname> <given-names>A. W.</given-names></name> <name><surname>Tabor</surname> <given-names>H. K.</given-names></name> <name><surname>Dent</surname> <given-names>K. M.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Exome sequencing identifies the cause of a mendelian disorder</article-title>. <source>Nat. Genet.</source> <volume>42</volume>, <fpage>30</fpage>&#x02013;<lpage>35</lpage>. <pub-id pub-id-type="doi">10.1038/ng.499</pub-id><pub-id pub-id-type="pmid">19915526</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nicholas</surname> <given-names>T. J.</given-names></name> <name><surname>Baker</surname> <given-names>C.</given-names></name> <name><surname>Eichler</surname> <given-names>E. E.</given-names></name> <name><surname>Akey</surname> <given-names>J. M.</given-names></name></person-group> (<year>2011</year>). <article-title>A high-resolution integrated map of copy number polymorphisms within and between breeds of the modern domesticated dog</article-title>. <source>BMC Genomics</source> <volume>12</volume>:<fpage>414</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-12-414</pub-id><pub-id pub-id-type="pmid">21846351</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Norris</surname> <given-names>B. J.</given-names></name> <name><surname>Whan</surname> <given-names>V. A.</given-names></name></person-group> (<year>2008</year>). <article-title>A gene duplication affecting expression of the ovine <italic>ASIP</italic> gene is responsible for white and black sheep</article-title>. <source>Genome Res.</source> <volume>18</volume>, <fpage>1282</fpage>&#x02013;<lpage>1293</lpage>. <pub-id pub-id-type="doi">10.1101/gr.072090.107</pub-id><pub-id pub-id-type="pmid">18493018</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ohno</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <source>Evolution by Gene Duplication</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer Science &#x00026; Business Media</publisher-name>.</citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paudel</surname> <given-names>Y.</given-names></name> <name><surname>Madsen</surname> <given-names>O.</given-names></name> <name><surname>Megens</surname> <given-names>H.-J.</given-names></name> <name><surname>Frantz</surname> <given-names>L. A.</given-names></name> <name><surname>Bosse</surname> <given-names>M.</given-names></name> <name><surname>Bastiaansen</surname> <given-names>J. W.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Evolutionary dynamics of copy number variation in pig genomes in the context of adaptation and domestication</article-title>. <source>BMC Genomics</source> <volume>14</volume>:<fpage>449</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-14-449</pub-id><pub-id pub-id-type="pmid">23829399</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paudel</surname> <given-names>Y.</given-names></name> <name><surname>Madsen</surname> <given-names>O.</given-names></name> <name><surname>Megens</surname> <given-names>H.-J.</given-names></name> <name><surname>Frantz</surname> <given-names>L. A. F.</given-names></name> <name><surname>Bosse</surname> <given-names>M.</given-names></name> <name><surname>Crooijmans</surname> <given-names>R. P. M. A.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Copy number variation in the speciation of pigs: a possible prominent role for olfactory receptors</article-title>. <source>BMC Genomics</source> <volume>16</volume>:<fpage>330</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-015-1449-9</pub-id><pub-id pub-id-type="pmid">25896665</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Piccolo</surname> <given-names>S. R.</given-names></name> <name><surname>Sun</surname> <given-names>Y.</given-names></name> <name><surname>Campbell</surname> <given-names>J. D.</given-names></name> <name><surname>Lenburg</surname> <given-names>M. E.</given-names></name> <name><surname>Bild</surname> <given-names>A. H.</given-names></name> <name><surname>Johnson</surname> <given-names>W. E.</given-names></name></person-group> (<year>2012</year>). <article-title>A single-sample microarray normalization method to facilitate personalized-medicine workflows</article-title>. <source>Genomics</source> <volume>100</volume>, <fpage>337</fpage>&#x02013;<lpage>344</lpage>. <pub-id pub-id-type="doi">10.1016/j.ygeno.2012.08.003</pub-id><pub-id pub-id-type="pmid">22959562</pub-id></citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Piccolo</surname> <given-names>S. R.</given-names></name> <name><surname>Withers</surname> <given-names>M. R.</given-names></name> <name><surname>Francis</surname> <given-names>O. E.</given-names></name> <name><surname>Bild</surname> <given-names>A. H.</given-names></name> <name><surname>Johnson</surname> <given-names>W. E.</given-names></name></person-group> (<year>2013</year>). <article-title>Multiplatform single-sample estimates of transcriptional activation</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>110</volume>, <fpage>17778</fpage>&#x02013;<lpage>17783</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1305823110</pub-id><pub-id pub-id-type="pmid">24128763</pub-id></citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prinsen</surname> <given-names>R. T. M. M.</given-names></name> <name><surname>Strillaci</surname> <given-names>M. G.</given-names></name> <name><surname>Schiavini</surname> <given-names>F.</given-names></name> <name><surname>Santus</surname> <given-names>E.</given-names></name> <name><surname>Rossoni</surname> <given-names>A.</given-names></name> <name><surname>Maurer</surname> <given-names>V.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>A genome-wide scan of copy number variants using high-density SNPs in Brown Swiss dairy cattle</article-title>. <source>Livestock Sci.</source> <volume>191</volume>, <fpage>153</fpage>&#x02013;<lpage>160</lpage>. <pub-id pub-id-type="doi">10.1016/j.livsci.2016.08.006</pub-id></citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramayo-Caldas</surname> <given-names>Y.</given-names></name> <name><surname>Castell&#x000F3;</surname> <given-names>A.</given-names></name> <name><surname>Pena</surname> <given-names>R. N.</given-names></name> <name><surname>Alves</surname> <given-names>E.</given-names></name> <name><surname>Mercad&#x000E9;</surname> <given-names>A.</given-names></name> <name><surname>Souza</surname> <given-names>C. A.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Copy number variation in the porcine genome inferred from a 60 k SNP BeadChip</article-title>. <source>BMC Genomics</source> <volume>11</volume>:<fpage>593</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-11-593</pub-id><pub-id pub-id-type="pmid">20969757</pub-id></citation>
</ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roberts</surname> <given-names>A.</given-names></name> <name><surname>Pimentel</surname> <given-names>H.</given-names></name> <name><surname>Trapnell</surname> <given-names>C.</given-names></name> <name><surname>Pachter</surname> <given-names>L.</given-names></name></person-group> (<year>2011</year>). <article-title>Identification of novel transcripts in annotated genomes using RNA-Seq</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>2325</fpage>&#x02013;<lpage>2329</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr355</pub-id><pub-id pub-id-type="pmid">21697122</pub-id></citation>
</ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sabat</surname> <given-names>J.</given-names></name> <name><surname>Lakshmi</surname> <given-names>B.</given-names></name> <name><surname>Troge</surname> <given-names>J.</given-names></name> <name><surname>Alexander</surname> <given-names>J.</given-names></name> <name><surname>Young</surname> <given-names>J.</given-names></name> <name><surname>Lundin</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2004</year>). <article-title>Large-scale polymorphism in the human genome</article-title>. <source>Science</source> <volume>305</volume>, <fpage>525</fpage>&#x02013;<lpage>528</lpage>. <pub-id pub-id-type="doi">10.1126/science.1098918</pub-id><pub-id pub-id-type="pmid">15273396</pub-id></citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>C.</given-names></name> <name><surname>Vester</surname> <given-names>U.</given-names></name> <name><surname>Wagner</surname> <given-names>C. A.</given-names></name> <name><surname>Lahme</surname> <given-names>S.</given-names></name> <name><surname>Hesse</surname> <given-names>A.</given-names></name> <name><surname>Hoyer</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2003</year>). <article-title>Significant contribution of genomic rearrangements in <italic>SLC3A1</italic> and <italic>SLC7A9</italic> to the etiology of cystinuria</article-title>. <source>Kidney Int.</source> <volume>64</volume>, <fpage>1564</fpage>&#x02013;<lpage>1572</lpage>. <pub-id pub-id-type="doi">10.1046/j.1523-1755.2003.00250.x</pub-id><pub-id pub-id-type="pmid">14531788</pub-id></citation>
</ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharpe</surname> <given-names>R. M.</given-names></name> <name><surname>Franks</surname> <given-names>S.</given-names></name></person-group> (<year>2002</year>). <article-title>Environment, lifestyle and infertility &#x02013; an intergenerational issue</article-title>. <source>Nat. Cell Biol.</source> <volume>4</volume>, <fpage>S33</fpage>&#x02013;<lpage>S40</lpage>. <pub-id pub-id-type="doi">10.1038/ncb-nm-fertilityS33</pub-id><pub-id pub-id-type="pmid">12479613</pub-id></citation>
</ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Snelling</surname> <given-names>W. M.</given-names></name> <name><surname>Bennett</surname> <given-names>G. L.</given-names></name> <name><surname>Keele</surname> <given-names>J. W.</given-names></name> <name><surname>Kuehn</surname> <given-names>L. A.</given-names></name> <name><surname>McDaneld</surname> <given-names>T. G.</given-names></name> <name><surname>Smith</surname> <given-names>T. P.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>A survey of polymorphisms detected from sequences of popular beef breeds</article-title>. <source>J. Anim. Sci.</source> <volume>93</volume>, <fpage>5128</fpage>&#x02013;<lpage>5143</lpage>. <pub-id pub-id-type="doi">10.2527/jas.2015-9356</pub-id><pub-id pub-id-type="pmid">26641033</pub-id></citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spehr</surname> <given-names>M.</given-names></name> <name><surname>Schwane</surname> <given-names>K.</given-names></name> <name><surname>Riffell</surname> <given-names>J. A.</given-names></name> <name><surname>Zimmer</surname> <given-names>R. K.</given-names></name> <name><surname>Hatt</surname> <given-names>H.</given-names></name></person-group> (<year>2006</year>). <article-title>Odorant receptors and olfactory-like signaling mechanisms in mammalian sperm</article-title>. <source>Mol. Cell. Endocrinol.</source> <volume>250</volume>, <fpage>128</fpage>&#x02013;<lpage>136</lpage>. <pub-id pub-id-type="doi">10.1016/j.mce.2005.12.035</pub-id><pub-id pub-id-type="pmid">16413109</pub-id></citation>
</ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stothard</surname> <given-names>P.</given-names></name> <name><surname>Choi</surname> <given-names>J.-W.</given-names></name> <name><surname>Basu</surname> <given-names>U.</given-names></name> <name><surname>Sumner-Thomson</surname> <given-names>J. M.</given-names></name> <name><surname>Meng</surname> <given-names>Y.</given-names></name> <name><surname>Liao</surname> <given-names>X.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Whole genome resequencing of Black Angus and Holstein cattle for SNP and CNV discovery</article-title>. <source>BMC Genomics</source> <volume>12</volume>:<fpage>559</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-12-559</pub-id><pub-id pub-id-type="pmid">22085807</pub-id></citation>
</ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Traherne</surname> <given-names>J. A.</given-names></name></person-group> (<year>2008</year>). <article-title>Human MHC architecture and evolution: implications for disease association studies</article-title>. <source>Int. J. Immunogenet.</source> <volume>35</volume>, <fpage>179</fpage>&#x02013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1111/j.1744-313X.2008.00765.x</pub-id><pub-id pub-id-type="pmid">18397301</pub-id></citation>
</ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trapnell</surname> <given-names>C.</given-names></name> <name><surname>Pachter</surname> <given-names>L.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2009</year>). <article-title>TopHat: discovering splice junctions with RNA-Seq</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>1105</fpage>&#x02013;<lpage>1111</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp120</pub-id><pub-id pub-id-type="pmid">19289445</pub-id></citation>
</ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vandepoele</surname> <given-names>K.</given-names></name> <name><surname>van Roy</surname> <given-names>F.</given-names></name></person-group> (<year>2007</year>). <article-title>Insertion of an HERV(K) LTR in the intron of NBPF3 is not required for transcriptional activity</article-title>. <source>Virology</source> <volume>1</volume>, <fpage>1</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1016/j.virol.2007.01.044</pub-id></citation>
</ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vandepoele</surname> <given-names>K.</given-names></name> <name><surname>van Roy</surname> <given-names>N.</given-names></name> <name><surname>Staes</surname> <given-names>K.</given-names></name> <name><surname>Speleman</surname> <given-names>F.</given-names></name> <name><surname>van Roy</surname> <given-names>F.</given-names></name></person-group> (<year>2005</year>). <article-title>A novel gene family NBPF: intricate structure generated by gene duplications during primate evolution</article-title>. <source>Mol. Biol. Evol.</source> <volume>22</volume>, <fpage>2265</fpage>&#x02013;<lpage>2274</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msi222</pub-id><pub-id pub-id-type="pmid">16079250</pub-id></citation>
</ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van der Auwera</surname> <given-names>G. A.</given-names></name> <name><surname>Carneiro</surname> <given-names>M. O.</given-names></name> <name><surname>Hartl</surname> <given-names>C.</given-names></name> <name><surname>Poplin</surname> <given-names>R.</given-names></name> <name><surname>Del Angel</surname> <given-names>G.</given-names></name> <name><surname>Levy-Moonshine</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>From FastQ data to high-confidence variant calls: the genome analysis toolkit best practices pipeline</article-title>. <source>Curr. Protoc. Bioinform</source>. <volume>43</volume>:<fpage>11</fpage>.10.1-33. <pub-id pub-id-type="doi">10.1002/0471250953.bi1110s43</pub-id><pub-id pub-id-type="pmid">25431634</pub-id></citation>
</ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>D.</given-names></name> <name><surname>Boije</surname> <given-names>H.</given-names></name> <name><surname>Meadows</surname> <given-names>J. R.</given-names></name> <name><surname>Bed&#x00027;hom</surname> <given-names>B.</given-names></name> <name><surname>Gourichon</surname> <given-names>D.</given-names></name> <name><surname>Vieaud</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Copy number variation in intron 1 of <italic>SOX5</italic> causes the pea-comb phenotype in chickens</article-title>. <source>PLoS Genet.</source> <volume>5</volume>:<fpage>e1000512</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1000512</pub-id><pub-id pub-id-type="pmid">19521496</pub-id></citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Fan</surname> <given-names>H.</given-names></name> <name><surname>Jing</surname> <given-names>S.</given-names></name> <name><surname>Xia</surname> <given-names>J.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>A genome-wide scan for copy number variations using high-density single nucleotide polymorphism array in <italic>Simmental cattle</italic></article-title>. <source>Anim. Genet.</source> <volume>46</volume>, <fpage>289</fpage>&#x02013;<lpage>298</lpage>. <pub-id pub-id-type="doi">10.1111/age.12288</pub-id><pub-id pub-id-type="pmid">25917301</pub-id></citation>
</ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>L.</given-names></name> <name><surname>Hou</surname> <given-names>Y.</given-names></name> <name><surname>Bickhart</surname> <given-names>D. M.</given-names></name> <name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Hay</surname> <given-names>E. H.</given-names></name> <name><surname>Song</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Population-genetic properties of differentiated copy number variations in cattle</article-title>. <source>Sci. Rep.</source> <volume>6</volume>:<fpage>23161</fpage>. <pub-id pub-id-type="doi">10.1038/srep23161</pub-id><pub-id pub-id-type="pmid">27005566</pub-id></citation>
</ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yi</surname> <given-names>G.</given-names></name> <name><surname>Qu</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Yan</surname> <given-names>Y.</given-names></name> <name><surname>Xu</surname> <given-names>G.</given-names></name> <name><surname>Yang</surname> <given-names>N.</given-names></name></person-group> (<year>2014</year>). <article-title>Genome-wide patterns of copy number variation in the diversified chicken genomes using next-generation sequencing</article-title>. <source>BMC Genomics</source> <volume>15</volume>:<fpage>962</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-15-962</pub-id><pub-id pub-id-type="pmid">25378104</pub-id></citation>
</ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Young</surname> <given-names>J. M.</given-names></name> <name><surname>Endicott</surname> <given-names>R. M.</given-names></name> <name><surname>Parghi</surname> <given-names>S. S.</given-names></name> <name><surname>Walker</surname> <given-names>M.</given-names></name> <name><surname>Kidd</surname> <given-names>J. M.</given-names></name> <name><surname>Trask</surname> <given-names>B. J.</given-names></name></person-group> (<year>2008</year>). <article-title>Extensive copy-number variation of the human olfactory receptor gene family</article-title>. <source>Am. J. Hum. Genet.</source> <volume>83</volume>, <fpage>228</fpage>&#x02013;<lpage>242</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2008.07.005</pub-id><pub-id pub-id-type="pmid">18674749</pub-id></citation>
</ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhan</surname> <given-names>B.</given-names></name> <name><surname>Fadista</surname> <given-names>J.</given-names></name> <name><surname>Thomsen</surname> <given-names>B.</given-names></name> <name><surname>Hedegaard</surname> <given-names>J.</given-names></name> <name><surname>Panitz</surname> <given-names>F.</given-names></name> <name><surname>Bendixen</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping</article-title>. <source>BMC Genomics</source> <volume>12</volume>:<fpage>557</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-12-557</pub-id><pub-id pub-id-type="pmid">22082336</pub-id></citation>
</ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zimin</surname> <given-names>A. V.</given-names></name> <name><surname>Delcher</surname> <given-names>A. L.</given-names></name> <name><surname>Florea</surname> <given-names>L.</given-names></name> <name><surname>Kelley</surname> <given-names>D. R.</given-names></name> <name><surname>Schatz</surname> <given-names>M. C.</given-names></name> <name><surname>Puiu</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>A whole-genome assembly of the domestic cow, <italic>Bos taurus</italic></article-title>. <source>Genome Biol.</source> <volume>10</volume>:<fpage>R42</fpage>. <pub-id pub-id-type="doi">10.1186/gb-2009-10-4-r42</pub-id><pub-id pub-id-type="pmid">19393038</pub-id></citation>
</ref>
</ref-list>



</back>
</article>