<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1359591</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2024.1359591</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Meta-analysis of set-based multiple phenotype association test based on GWAS summary statistics from different cohorts</article-title>
<alt-title alt-title-type="left-running-head">Zhu et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fgene.2024.1359591">10.3389/fgene.2024.1359591</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Zhu</surname>
<given-names>Lirong</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/2608420/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
<role content-type="https://credit.niso.org/contributor-roles/data-curation/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>Shuanglin</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Sha</surname>
<given-names>Qiuying</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/24584/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
</contrib>
</contrib-group>
<aff>
<institution>Department of Mathematical Sciences</institution>, <institution>Michigan Technological University</institution>, <addr-line>Houghton</addr-line>, <addr-line>MI</addr-line>, <country>United States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2748218/overview">Xihao Li</ext-link>, University of North Carolina at Chapel Hill, United States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/115107/overview">Lin Hou</ext-link>, Tsinghua University, China</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1378686/overview">Shibo Wang</ext-link>, University of California, Riverside, United States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1596930/overview">Diptavo Dutta</ext-link>, National Cancer Institute (NIH), United States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Qiuying Sha, <email>qsha@mtu.edu</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>09</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="collection">
<year>2024</year>
</pub-date>
<volume>15</volume>
<elocation-id>1359591</elocation-id>
<history>
<date date-type="received">
<day>21</day>
<month>12</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>23</day>
<month>08</month>
<year>2024</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2024 Zhu, Zhang and Sha.</copyright-statement>
<copyright-year>2024</copyright-year>
<copyright-holder>Zhu, Zhang and Sha</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Genome-wide association studies (GWAS) have emerged as popular tools for identifying genetic variants that are associated with complex diseases. Standard analysis of a GWAS involves assessing the association between each variant and a disease. However, this approach suffers from limited reproducibility and difficulties in detecting multi-variant and pleiotropic effects. Although joint analysis of multiple phenotypes for GWAS can identify and interpret pleiotropic loci which are essential to understand pleiotropy in diseases and complex traits, most of the multiple phenotype association tests are designed for a single variant, resulting in much lower power, especially when their effect sizes are small and only their cumulative effect is associated with multiple phenotypes. To overcome these limitations, set-based multiple phenotype association tests have been developed to enhance statistical power and facilitate the identification and interpretation of pleiotropic regions. In this research, we propose a new method, named Meta-TOW-S, which conducts joint association tests between multiple phenotypes and a set of variants (such as variants in a gene) utilizing GWAS summary statistics from different cohorts. Our approach applies the set-based method that Tests for the effect of an Optimal Weighted combination of variants in a gene (TOW) and accounts for sample size differences across GWAS cohorts by employing the Cauchy combination method. Meta-TOW-S combines the advantages of set-based tests and multi-phenotype association tests, exhibiting computational efficiency and enabling analysis across multiple phenotypes while accommodating overlapping samples from different GWAS cohorts. To assess the performance of Meta-TOW-S, we develop a phenotype simulator package that encompasses a comprehensive simulation scheme capable of modeling multiple phenotypes and multiple variants, including noise structures and diverse correlation patterns among phenotypes. Simulation studies validate that Meta-TOW-S maintains a desirable Type I error rate. Further simulation under different scenarios shows that Meta-TOW-S can improve power compared with other existing meta-analysis methods. When applied to four psychiatric disorders summary data, Meta-TOW-S detects a greater number of significant genes.</p>
</abstract>
<kwd-group>
<kwd>meta-analysis</kwd>
<kwd>joint analyses of multiple phenotypes</kwd>
<kwd>set-based association tests</kwd>
<kwd>GWAS summary statistics</kwd>
<kwd>phenotype simulator</kwd>
<kwd>multiple GWAS cohorts</kwd>
</kwd-group>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Statistical Genetics and Methodology</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Genome-wide association study (GWAS) is typically employed to identify an individual genetic variant associated with a specific phenotype. However, in cases where causal variants have weak effects on the trait and it is challenging to detect these variants, set-based tests are employed to identify the joint effects of multiple variants (multi-variants) on a particular phenotype (<xref ref-type="bibr" rid="B17">Lin et al., 2022</xref>; <xref ref-type="bibr" rid="B11">Huang et al., 2011</xref>; <xref ref-type="bibr" rid="B20">Liu et al., 2010</xref>; <xref ref-type="bibr" rid="B30">Neale and Sham, 2004</xref>). Compared to single-variant approaches, set-based tests can help to reduce the number of genome-wide association tests and are effective when a causal variant is unobserved or multiple causal variants are present (<xref ref-type="bibr" rid="B17">Lin et al., 2022</xref>). Consequently, set-based tests have been applied in gene-set analyses of common variants and rare variants (<xref ref-type="bibr" rid="B19">Liu et al., 2014</xref>). For instance, both weighted and unweighted burden tests have a good performance when variants in a gene affect a phenotype in the same direction (<xref ref-type="bibr" rid="B29">Morris and Zeggini, 2010</xref>). The sequence kernel association test (SKAT) is a score-based variance-component test that accommodates variants in a gene with opposite effects on a phenotype (<xref ref-type="bibr" rid="B43">Wu et al., 2011</xref>). However, the performance of the approaches mentioned above depends on the weighting scheme, such as the MAF-based weighting scheme that up-weights the contribution of rare variants and down-weights that of common variants. This weighting scheme may lead to a loss in power when common variants in the region are associated with the phenotype (<xref ref-type="bibr" rid="B8">Dutta et al., 2019b</xref>; <xref ref-type="bibr" rid="B7">Dutta et al., 2019a</xref>). The optimal strategy for grouping or weighting genetic variants depends on the unknown genetic architecture of each phenotype and each variant. TOW for Testing the effect of an Optimal Weighted combination of variants in a set is a powerful method to increase power by assigning optimal weights to genetic variants (<xref ref-type="bibr" rid="B36">Sha et al., 2012</xref>).</p>
<p>Many complex phenotypes are influenced by multiple genetic variants, each with a small effect size. Set-based tests that consider a single phenotype might not capture the collective effects of these variants, leading to reduced power. Alternatively, cross-phenotype tests that aggregate associations in multiple phenotypes can substantially improve power over single phenotype-based methods (<xref ref-type="bibr" rid="B8">Dutta et al., 2019b</xref>; <xref ref-type="bibr" rid="B7">Dutta et al., 2019a</xref>). Meta-analysis of multiple phenotypes, using GWAS summary statistics, is a practical approach to increase power by increasing sample sizes and aggregating variants with small effect sizes to detect more significant pleiotropic genes (<xref ref-type="bibr" rid="B3">Cirulli et al., 2020</xref>; <xref ref-type="bibr" rid="B31">Panagiotou et al., 2013</xref>). Detecting the pleiotropic genes can provide insights into biological mechanisms influencing complex human phenotypes. The challenge in meta-analysis is that there is no uniformly most powerful (UMP) test. The power depends on signal directions and between-phenotype correlation. To boost analysis power, several methods have been proposed for GWAS multiple phenotype analysis. For instance, Fisher&#x2019;s method of combining independent <italic>p</italic>-values has been extended to dependent univariate tests (<xref ref-type="bibr" rid="B18">Li et al., 2014</xref>). However, the <italic>p</italic>-value approximations of these tests are not accurate for small significance levels often required by GWASs. The minimum of the <italic>p</italic>-values (MinP) of multiple phenotypes has been proposed as a testing statistic (<xref ref-type="bibr" rid="B4">Conneely and Boehnke, 2007</xref>). And this test is powerful when a gene affects only a very small number of multiple phenotypes, but is less powerful in the presence of denser signals (<xref ref-type="bibr" rid="B24">Liu and Lin, 2018</xref>). The aggregated Cauchy association test (ACAT) is a flexible and computationally efficient <italic>p</italic>-value combination method that boosts power under various genetic architectures (<xref ref-type="bibr" rid="B23">Liu and Xie, 2019</xref>; <xref ref-type="bibr" rid="B22">Liu et al., 2019</xref>). Thus, in this article, we applied this approach to combine <italic>p</italic>-values of set-based tests from multiple cohort studies. Most importantly, the Cauchy combination is an extremely fast omnibus testing procedure that performs multiple testing adjustments analytically and applies to the combination of any tests (<xref ref-type="bibr" rid="B26">Li et al., 2023</xref>; <xref ref-type="bibr" rid="B28">Li et al., 2022</xref>; <xref ref-type="bibr" rid="B25">Li et al., 2020</xref>).</p>
<p>Meta-analysis is a promising approach to detecting a series of gene-phenotype associations that would have remained undetected by a participating cohort alone. In addition, a meta-analysis based on GWAS summary statistics simplifies data sharing, keeping sensitive individual data at the cohort level and sharing only non-sensitive summary data (<xref ref-type="bibr" rid="B17">Lin et al., 2022</xref>). For instance, Meta-MultiSKAT performs a variance component test and uses summary statistics to test for association between multiple continuous phenotypes and variants in a region. However, the <italic>p</italic>-value calculation of Meta-MultiSKAT relies on the normality assumption of the score vector and this assumption may be violated in the meta-analysis (<xref ref-type="bibr" rid="B8">Dutta et al., 2019b</xref>; <xref ref-type="bibr" rid="B7">Dutta et al., 2019a</xref>). MetaUSAT is a novel unified association test of multiple traits with only a single genetic variant, and the test statistic is dominated by a predefined parameter weight (<xref ref-type="bibr" rid="B35">Ray and Boehnke, 2018</xref>). In the cross meta-analysis, the overlapping subjects can induce a correlation between the summary statistics and inflate the false discovery rate of meta-analyses. Method FOLD is proposed to account for overlapping subjects at the summary statistics level using a split prior which categorizes subjects based on their contributions to the final statistic (<xref ref-type="bibr" rid="B13">Kim et al., 2017</xref>). However, it is designed for qualitative phenotypes and is difficult to obtain splitting prior if the numbers of cases and controls in any of the GWAS cohorts are missing. Here we propose a new approach, Meta-TOW-S, which conducts joint association tests between multiple phenotypes and genetic variants within a gene, utilizing GWAS summary statistics from diverse GWAS. Our approach applies set-based tests using an optimal weighted combination of variants and accounts for sample size differences across different GWAS by employing the Cauchy combination method. Meta-TOW-S combines the advantages of set-based tests and multi-phenotype modeling, exhibiting computational efficiency and enabling analysis across multiple phenotypes while accommodating overlapping samples from different cohorts.</p>
<p>To evaluate the performance of Meta-TOW-S, we need to mimic GWAS summary data from different cohorts. Thus, we also develop a phenotype simulator package that encompasses a comprehensive simulation scheme capable of modeling multiple phenotypes with multiple underlying genetic loci, intricate noise structures, and diverse correlation patterns among the phenotypes. The R package for the phenotype simulator is available on GitHub (<ext-link ext-link-type="uri" xlink:href="https://github.com/Julia-lirong/PheGen">https://github.com/Julia-lirong/PheGen</ext-link>). Furthermore, we evaluate the performance of our method using simulation studies and compare the power of our method with the power of three existing methods which integrate Burden (<xref ref-type="bibr" rid="B29">Morris and Zeggini, 2010</xref>), SKAT (<xref ref-type="bibr" rid="B43">Wu et al., 2011</xref>), and VEGAS (<xref ref-type="bibr" rid="B20">Liu et al., 2010</xref>) with Cauchy combination (<xref ref-type="bibr" rid="B23">Liu and Xie, 2019</xref>) to detect pleiotropic effects. Our simulation studies validate that Meta-TOW-S maintains a desirable Type I error rate and enhances power across various simulation scenarios compared with other existing meta-analysis methods. We also apply Meta-TOW-S to four psychiatric disorders summary data which are available from the Psychiatric Genomics Consortium (PGC) (<xref ref-type="bibr" rid="B40">Sullivan et al., 2018</xref>). The real data analyses demonstrate that Meta-TOW-S outperforms other comparison methods by detecting a greater number of significant genes.</p>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>2 Materials and methods</title>
<p>Consider <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> phenotypes from <inline-formula id="inf2">
<mml:math id="m2">
<mml:mrow>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> GWAS cohorts with sample sizes <inline-formula id="inf3">
<mml:math id="m3">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> that are subject to <inline-formula id="inf4">
<mml:math id="m4">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mover accent="true">
<mml:mo>&#x2211;</mml:mo>
<mml:mi>K</mml:mi>
</mml:mover>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munder>
</mml:mstyle>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. For the <inline-formula id="inf5">
<mml:math id="m5">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> cohort, suppose that the GWAS summary statistics <inline-formula id="inf6">
<mml:math id="m6">
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msubsup>
<mml:mi>Z</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>1</mml:mn>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>Z</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>Z</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>M</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> are the Z-scores of <inline-formula id="inf7">
<mml:math id="m7">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> genetic variants in a genomic region. We assume all cohorts share the same genetic variants in the specific region. <inline-formula id="inf8">
<mml:math id="m8">
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the number of overlapping subjects among all cohorts, where <inline-formula id="inf9">
<mml:math id="m9">
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mo>&#x2061;</mml:mo>
<mml:mi mathvariant="italic">min</mml:mi>
<mml:mrow>
<mml:mfenced open="{" close="}" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<sec id="s2-1">
<title>2.1 Meta-TOW-S</title>
<p>For the <inline-formula id="inf10">
<mml:math id="m10">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> cohort, we suppose that <inline-formula id="inf11">
<mml:math id="m11">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is a <inline-formula id="inf12">
<mml:math id="m12">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> matrix of genotypes in the interested genomic region (gene or pathway), and <inline-formula id="inf13">
<mml:math id="m13">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is a <inline-formula id="inf14">
<mml:math id="m14">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> vector of phenotypes (either a quantitative or qualitative phenotype). In TOW (<xref ref-type="bibr" rid="B36">Sha et al., 2012</xref>), the generalized linear regression model with the fixed effect is used to link the phenotype and genotypes. The statistic model is defined as <inline-formula id="inf15">
<mml:math id="m15">
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x7c;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf16">
<mml:math id="m16">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of weights for the <inline-formula id="inf17">
<mml:math id="m17">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> genetic variants; <inline-formula id="inf18">
<mml:math id="m18">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf19">
<mml:math id="m19">
<mml:mrow>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the weight assigned to the <inline-formula id="inf20">
<mml:math id="m20">
<mml:mrow>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> variant in the <inline-formula id="inf21">
<mml:math id="m21">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> cohort. <inline-formula id="inf22">
<mml:math id="m22">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the effect size of the weighted combination of genetic variants <inline-formula id="inf23">
<mml:math id="m23">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> on the phenotype <inline-formula id="inf24">
<mml:math id="m24">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Under the null hypothesis of no association between the variants in the region and the <inline-formula id="inf25">
<mml:math id="m25">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> phenotype, we test <inline-formula id="inf26">
<mml:math id="m26">
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x2236;</mml:mo>
<mml:msub>
<mml:mi>&#x3b2;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. The score test statistic is given by <inline-formula id="inf27">
<mml:math id="m27">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msubsup>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi mathvariant="bold-italic">k</mml:mi>
<mml:mi mathvariant="bold-italic">T</mml:mi>
</mml:msubsup>
<mml:mi mathvariant="bold">G</mml:mi>
</mml:mrow>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf28">
<mml:math id="m28">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:msub>
<mml:mn mathvariant="bold">1</mml:mn>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:msub>
<mml:msubsup>
<mml:mn mathvariant="bold">1</mml:mn>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mi>T</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> , <inline-formula id="inf29">
<mml:math id="m29">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>/</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:math>
</inline-formula> , <inline-formula id="inf30">
<mml:math id="m30">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf31">
<mml:math id="m31">
<mml:mrow>
<mml:msub>
<mml:mn mathvariant="bold">1</mml:mn>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> represents a <inline-formula id="inf32">
<mml:math id="m32">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> vector containing all ones, and <inline-formula id="inf33">
<mml:math id="m33">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is a <inline-formula id="inf34">
<mml:math id="m34">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#xd7;</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> identity matrix. Under the null hypothesis, <inline-formula id="inf35">
<mml:math id="m35">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> follows a multivariate normal distribution with mean vector <inline-formula id="inf36">
<mml:math id="m36">
<mml:mrow>
<mml:mn mathvariant="bold">0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and covariance matrix <inline-formula id="inf37">
<mml:math id="m37">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. The TOW method obtains the optimal weights <inline-formula id="inf38">
<mml:math id="m38">
<mml:mrow>
<mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x2a;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> by maximizing the score test statistic <inline-formula id="inf39">
<mml:math id="m39">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> using the Cauchy-Schwartz inequality. Specifically, we have the form <inline-formula id="inf40">
<mml:math id="m40">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn mathvariant="bold">1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn mathvariant="bold">2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mn mathvariant="bold">1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn mathvariant="bold">2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="bold">w</mml:mi>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn mathvariant="bold">1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn mathvariant="bold">2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mn mathvariant="bold">1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn mathvariant="bold">2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x2264;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mrow>
<mml:mfenced open="&#x2329;" close="&#x232a;" separators="&#x7c;">
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn mathvariant="bold">2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn mathvariant="bold">2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mfenced open="&#x2329;" close="&#x232a;" separators="&#x7c;">
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn mathvariant="bold">2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn mathvariant="bold">2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:mi>E</mml:mi>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>c</mml:mi>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, with the equality when <inline-formula id="inf41">
<mml:math id="m41">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x221d;</mml:mo>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> , where <inline-formula id="inf42">
<mml:math id="m42">
<mml:mrow>
<mml:mi>E</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is any <inline-formula id="inf43">
<mml:math id="m43">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> positive definite matrix, and <inline-formula id="inf44">
<mml:math id="m44">
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:mi>E</mml:mi>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula> is a constant. Based on Yan&#x2019;s work (<xref ref-type="bibr" rid="B44">Yan, 2022</xref>), <inline-formula id="inf45">
<mml:math id="m45">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>c</mml:mi>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msup>
<mml:mi mathvariant="bold-italic">E</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is a quadric term with the asymptotical distribution of weighted sum of Chi-squares. We assume <inline-formula id="inf46">
<mml:math id="m46">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is full rank and let <inline-formula id="inf47">
<mml:math id="m47">
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Then the optimal weight is obtained with the form <inline-formula id="inf48">
<mml:math id="m48">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">w</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x2a;</mml:mo>
</mml:msup>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn mathvariant="bold">1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Using the optimal weights, the test statistic of TOW is given by <inline-formula id="inf49">
<mml:math id="m49">
<mml:mrow>
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>W</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">u</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:msubsup>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula> . Under the null hypothesis, <inline-formula id="inf50">
<mml:math id="m50">
<mml:mrow>
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>W</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> asymptotically follows a Chi-square distribution with a degree of freedom <inline-formula id="inf51">
<mml:math id="m51">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, that is <inline-formula id="inf52">
<mml:math id="m52">
<mml:mrow>
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>W</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x223c;</mml:mo>
<mml:msubsup>
<mml:mi>&#x3c7;</mml:mi>
<mml:mi>M</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> (<xref ref-type="bibr" rid="B44">Yan, 2022</xref>; <xref ref-type="bibr" rid="B36">Sha et al., 2012</xref>).</p>
<p>Inspired by the idea proposed in this project (<xref ref-type="bibr" rid="B41">Svishcheva et al., 2019</xref>), <xref ref-type="bibr" rid="B44">Yan (2022)</xref> rewrite the test statistic <inline-formula id="inf53">
<mml:math id="m53">
<mml:mrow>
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>W</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> using GWAS summary statistics. For the <inline-formula id="inf54">
<mml:math id="m54">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> cohort, the Z score of the <inline-formula id="inf55">
<mml:math id="m55">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> genetic variants in a region can be written as <inline-formula id="inf56">
<mml:math id="m56">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msubsup>
<mml:mi>Z</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>1</mml:mn>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>Z</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>M</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">V</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:msubsup>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:msqrt>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:msqrt>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf57">
<mml:math id="m57">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">V</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is an <inline-formula id="inf58">
<mml:math id="m58">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> diagonal matrix of the square roots of genotypic variances of the <inline-formula id="inf59">
<mml:math id="m59">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> variants. We assume that under the null hypothesis, <inline-formula id="inf60">
<mml:math id="m60">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> follows the multivariate normal distribution <inline-formula id="inf61">
<mml:math id="m61">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mn mathvariant="bold">0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="bold">U</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf62">
<mml:math id="m62">
<mml:mrow>
<mml:mi mathvariant="bold">U</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is an <inline-formula id="inf63">
<mml:math id="m63">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> matrix of correlations between the genotypes of these variant and <inline-formula id="inf64">
<mml:math id="m64">
<mml:mrow>
<mml:mi mathvariant="bold">U</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">V</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:msubsup>
<mml:mi mathvariant="bold-italic">G</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">P</mml:mi>
<mml:mn mathvariant="bold">0</mml:mn>
</mml:msub>
<mml:mi mathvariant="bold-italic">G</mml:mi>
</mml:mrow>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msup>
<mml:mi mathvariant="bold-italic">V</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula>. Then the test statistic of TOW based on individual data can be written as <inline-formula id="inf65">
<mml:math id="m65">
<mml:mrow>
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>W</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msubsup>
<mml:mi mathvariant="normal">&#x3a3;</mml:mi>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:msubsup>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">P</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msup>
<mml:mi mathvariant="bold">U</mml:mi>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. This test statistic is called <inline-formula id="inf66">
<mml:math id="m66">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>W</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf67">
<mml:math id="m67">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>W</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x223c;</mml:mo>
<mml:msubsup>
<mml:mi>&#x3c7;</mml:mi>
<mml:msup>
<mml:mi>M</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> asymptotically, where <inline-formula id="inf68">
<mml:math id="m68">
<mml:mrow>
<mml:msup>
<mml:mi>M</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> is the number of variants left after correlation pruning (<xref ref-type="bibr" rid="B41">Svishcheva et al., 2019</xref>). We can estimate <inline-formula id="inf69">
<mml:math id="m69">
<mml:mrow>
<mml:mi mathvariant="bold">U</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> using a reference sample of genotypes from the same population, such as 1,000 Genome phase 3 if individual genotype data are not available.</p>
<p>Consider <inline-formula id="inf70">
<mml:math id="m70">
<mml:mrow>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> phenotypes from different GWAS cohorts, we denote <inline-formula id="inf71">
<mml:math id="m71">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> as the <italic>p</italic>-value of <inline-formula id="inf72">
<mml:math id="m72">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>O</mml:mi>
<mml:mi>W</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> for the <inline-formula id="inf73">
<mml:math id="m73">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> GWAS cohort, where <inline-formula id="inf74">
<mml:math id="m74">
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. Then to detect the association between genetic variants in this region and multiple phenotypes, we define the Cauchy combination test statistic as <inline-formula id="inf75">
<mml:math id="m75">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>C</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mover accent="true">
<mml:mo>&#x2211;</mml:mo>
<mml:mi>K</mml:mi>
</mml:mover>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munder>
</mml:mstyle>
<mml:msub>
<mml:mi>&#x3bd;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x2061;</mml:mo>
<mml:mi mathvariant="italic">tan</mml:mi>
<mml:mrow>
<mml:mfenced open="{" close="}" separators="&#x7c;">
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mn>0.5</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>&#x3c0;</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, where the weight is defined as <inline-formula id="inf76">
<mml:math id="m76">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bd;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf77">
<mml:math id="m77">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>C</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> has a standard Cauchy distribution under the null (<xref ref-type="bibr" rid="B23">Liu and Xie, 2019</xref>). Here we assigned more weight to a large GWAS cohort because a GWAS cohort with a large sample size carries more information than a smaller GWAS cohort (<xref ref-type="bibr" rid="B47">Zhu et al., 2015</xref>).</p>
</sec>
<sec id="s2-2">
<title>2.2 Comparison with other set-based association tests</title>
<p>Versatile set-based association study (VEGAS): For a specific region with <inline-formula id="inf78">
<mml:math id="m78">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> genetic variants in the <inline-formula id="inf79">
<mml:math id="m79">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> GWAS summary study, the test statistic of VEGAS is the sum of all squared Z-scores which is <inline-formula id="inf80">
<mml:math id="m80">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> (<xref ref-type="bibr" rid="B20">Liu et al., 2010</xref>). Under the null hypothesis, <inline-formula id="inf81">
<mml:math id="m81">
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> asymptotically follows a mixture of chi-square distribution. To obtain the <italic>p</italic>-value of VEGAS, several methods have been proposed, such as numerical inversion of the characteristic function (<xref ref-type="bibr" rid="B21">Liu et al., 2009</xref>), Davies method (<xref ref-type="bibr" rid="B5">Davies, 1980</xref>), or Saddlepoint approximation (<xref ref-type="bibr" rid="B14">Kuonen, 1999</xref>).</p>
<p>SKAT and Burden: For the <inline-formula id="inf82">
<mml:math id="m82">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> cohort, we consider the generalized linear model with the random effect, we have <inline-formula id="inf83">
<mml:math id="m83">
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x7c;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf84">
<mml:math id="m84">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">&#x3b2;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is a vector of effect size of the <inline-formula id="inf85">
<mml:math id="m85">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> variants which is assumed to follow a normal distribution <inline-formula id="inf86">
<mml:math id="m86">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>&#x3c4;</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> under the null hypothesis, where <inline-formula id="inf87">
<mml:math id="m87">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x3c4;</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> is the variance component of the phenotype explained by the <inline-formula id="inf88">
<mml:math id="m88">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> variants. To test the association between the <inline-formula id="inf89">
<mml:math id="m89">
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> genetic variants in a region and the phenotype, Burden and SKAT test the null hypothesis <inline-formula id="inf90">
<mml:math id="m90">
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>&#x2236;</mml:mo>
<mml:msubsup>
<mml:mi>&#x3c4;</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> against <inline-formula id="inf91">
<mml:math id="m91">
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mi>a</mml:mi>
</mml:msub>
<mml:mo>&#x2236;</mml:mo>
<mml:msubsup>
<mml:mi>&#x3c4;</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x3e;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> (<xref ref-type="bibr" rid="B41">Svishcheva et al., 2019</xref>). For the <inline-formula id="inf92">
<mml:math id="m92">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> GWAS cohort, the burden test statistic using GWAS summary statistics can be written as <inline-formula id="inf93">
<mml:math id="m93">
<mml:mrow>
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi mathvariant="normal">k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:mi mathvariant="bold">W</mml:mi>
<mml:msub>
<mml:mn mathvariant="bold">1</mml:mn>
<mml:mi>M</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> , where <inline-formula id="inf94">
<mml:math id="m94">
<mml:mrow>
<mml:mi mathvariant="bold">W</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is a <inline-formula id="inf95">
<mml:math id="m95">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> diagonal matrix with <inline-formula id="inf96">
<mml:math id="m96">
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf97">
<mml:math id="m97">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the minor allele frequency for the <inline-formula id="inf98">
<mml:math id="m98">
<mml:mrow>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> variant (<xref ref-type="bibr" rid="B16">Lee et al., 2013</xref>). Under the null hypothesis, <inline-formula id="inf99">
<mml:math id="m99">
<mml:mrow>
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> follows a scaled Chi-square distribution with one degree of freedom. In SKAT, the test statistic based on GWAS summary statistic is defined as <inline-formula id="inf100">
<mml:math id="m100">
<mml:mrow>
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>K</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi mathvariant="normal">k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi>k</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
<mml:mtext>WW</mml:mtext>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
<mml:msub>
<mml:mi mathvariant="bold">S</mml:mi>
<mml:mi mathvariant="normal">k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf101">
<mml:math id="m101">
<mml:mrow>
<mml:mi mathvariant="bold">W</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is a <inline-formula id="inf102">
<mml:math id="m102">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> diagonal matrix of weights with <inline-formula id="inf103">
<mml:math id="m103">
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>B</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. The beta distribution density function has pre-specified parameters <inline-formula id="inf104">
<mml:math id="m104">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf105">
<mml:math id="m105">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf106">
<mml:math id="m106">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. To obtain the <italic>p</italic>-value of Burden and SKAT, several methods have been proposed, such as numerical inversion of the characteristic function (<xref ref-type="bibr" rid="B21">Liu et al., 2009</xref>), Davies method (<xref ref-type="bibr" rid="B5">Davies, 1980</xref>), or Saddlepoint approximation (<xref ref-type="bibr" rid="B14">Kuonen, 1999</xref>).</p>
<p>Let <inline-formula id="inf107">
<mml:math id="m107">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> be the <italic>p</italic>-value of the <inline-formula id="inf108">
<mml:math id="m108">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> cohort study for <inline-formula id="inf109">
<mml:math id="m109">
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> based on the three comparison methods Vegas, Burden, and SKAT. We use the same strategy to detect the association between multiple phenotypes in different GWAS cohorts and genetic variants in a region by employing the Cauchy combination (<xref ref-type="bibr" rid="B23">Liu and Xie, 2019</xref>). And we designate these three methods as Meta-Vegas, Meta-Burden, and Meta-SKAT, respectively.</p>
</sec>
</sec>
<sec id="s3">
<title>3 Simulation studies</title>
<sec id="s3-1">
<title>3.1 Phenotype simulator</title>
<p>Suppose we have <inline-formula id="inf110">
<mml:math id="m110">
<mml:mrow>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> cohorts with sample sizes <inline-formula id="inf111">
<mml:math id="m111">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, respectively. For the <inline-formula id="inf112">
<mml:math id="m112">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> cohort, the phenotype is modeled by a linear model <inline-formula id="inf113">
<mml:math id="m113">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold-italic">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf114">
<mml:math id="m114">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> are matrices of genotypes with columns standardized to mean zero and variance 1, with dimension <inline-formula id="inf115">
<mml:math id="m115">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. <inline-formula id="inf116">
<mml:math id="m116">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is a <inline-formula id="inf117">
<mml:math id="m117">
<mml:mrow>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> vector of standardized phenotypes with mean zero and variance 1. <inline-formula id="inf118">
<mml:math id="m118">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of genotypes effect sizes in the specific gene and <inline-formula id="inf119">
<mml:math id="m119">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the vector of residuals representing environmental effects and non-additive genetic effects for the <inline-formula id="inf120">
<mml:math id="m120">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> cohort. For each cohort, we assume that all genotype effect sizes are drawn with equal variance for all causal variants in a gene (<xref ref-type="bibr" rid="B15">Lee et al., 2014</xref>). Then for all <inline-formula id="inf121">
<mml:math id="m121">
<mml:mrow>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> cohorts, we suppose that the matrix of effect size <inline-formula id="inf122">
<mml:math id="m122">
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
</inline-formula> has mean zero and covariance matrix <inline-formula id="inf123">
<mml:math id="m123">
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>r</mml:mi>
<mml:mrow>
<mml:mfenced open="[" close="]" separators="&#x7c;">
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mrow>
<mml:mfenced open="[" close="]" separators="&#x7c;">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mtext>&#x2002;</mml:mtext>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mn>12</mml:mn>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mn>12</mml:mn>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>h</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>h</mml:mi>
<mml:mi>K</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf124">
<mml:math id="m124">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the genetic covariance on two phenotypes <inline-formula id="inf125">
<mml:math id="m125">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf126">
<mml:math id="m126">
<mml:mrow>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> on shared individuals, <inline-formula id="inf127">
<mml:math id="m127">
<mml:mrow>
<mml:msubsup>
<mml:mi>h</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> is the heritability explained by the variants in a region for the <inline-formula id="inf128">
<mml:math id="m128">
<mml:mrow>
<mml:msup>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> phenotype. The residuals <inline-formula id="inf129">
<mml:math id="m129">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> has mean zero and covariance matrix <inline-formula id="inf130">
<mml:math id="m130">
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>r</mml:mi>
<mml:mrow>
<mml:mfenced open="[" close="]" separators="&#x7c;">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mfenced open="[" close="]" separators="&#x7c;">
<mml:mrow>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>12</mml:mn>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>12</mml:mn>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>h</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
<mml:mtd>
<mml:mtable columnalign="center">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mi>h</mml:mi>
<mml:mi>K</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi mathvariant="bold">I</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, where <inline-formula id="inf131">
<mml:math id="m131">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the covariance of non-genetic effects on the <inline-formula id="inf132">
<mml:math id="m132">
<mml:mrow>
<mml:msup>
<mml:mi>i</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf133">
<mml:math id="m133">
<mml:mrow>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> phenotypes among shared individuals. Then, the phenotypic correlation for cohort <inline-formula id="inf134">
<mml:math id="m134">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and cohort <inline-formula id="inf135">
<mml:math id="m135">
<mml:mrow>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> among the <inline-formula id="inf136">
<mml:math id="m136">
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> overlapping samples is <inline-formula id="inf137">
<mml:math id="m137">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi>&#x3c1;</mml:mi>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> (Lemma). For these <inline-formula id="inf138">
<mml:math id="m138">
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> overlapping individuals, all cohorts share the same genotype data.</p>
<p>Next, to generate genotypes for individuals in a cohort, we employ the calibration coalescent model (COSI) to generate 10,000 haplotypes for a region of approximately 200&#xa0;kbps, mimicking the LD structure found in individuals of European ancestry (<xref ref-type="bibr" rid="B10">He et al., 2017</xref>). We randomly select regions of 10&#xa0;kbps in length, which encompass approximately 100 genetic variants, and utilize the simulated haplotypes to create genotypes for the variant sets. Among these genetic variants, we specifically designate 10% as causal variants, comprising 60% rare variants and 40% common variants, respectively. Subsequently, we generate the genetic component <inline-formula id="inf139">
<mml:math id="m139">
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
</inline-formula> and non-genetic component <inline-formula id="inf140">
<mml:math id="m140">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> based on the distribution described above. Specifically, we fix the phenotypic correlation between each pair of phenotypes. The phenotype correlation among the overlapping individuals is influenced by both genetic and non-genetic covariance, as proven in Lemma in the <xref ref-type="sec" rid="s12">Supplementary Material</xref>. We allocate 80% of the phenotypic correlation to genetic covariance and 20% to non-genetic covariance. Finally, we generate K quantitative phenotypes in different cohorts using the additive model <inline-formula id="inf141">
<mml:math id="m141">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="bold">y</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">G</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b2;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="bold">&#x3b5;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf142">
<mml:math id="m142">
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mfenced open="{" close="}" separators="&#x7c;">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</sec>
<sec id="s3-2">
<title>3.2 Simulation</title>
<p>To set up a multi-cohort scenario, we generated individuals for multiple cohorts with different sample sizes but the same overlapping sample size for simplicity. To achieve a normally distributed input for the association test between gene and phenotype within each cohort, a rank-based inverse-normal transformation to the residuals of each phenotype was performed. In simulations, we access the performance of Meta-TOW-S with compared methods Meta-Vegas, Meta-Burden, and Meta-SKAT. We design different patterns of phenotypes by varying the correlation of phenotypes, sample sizes, and overlapping samples.</p>
</sec>
</sec>
<sec sec-type="results" id="s4">
<title>4 Results</title>
<sec id="s4-1">
<title>4.1 Type I error rates</title>
<p>To evaluate the Type I error rates of Meta-TOW-S, we first simulate <inline-formula id="inf143">
<mml:math id="m143">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>000</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> regions, each encompassing approximately 100 genetic variants, reflecting the LD structure observed in individuals of European ancestry. We then replicate <inline-formula id="inf144">
<mml:math id="m144">
<mml:mrow>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mn>5</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> times to generate the phenotypes under the null hypothesis of no genetic contribution to any of the three traits, that is <inline-formula id="inf145">
<mml:math id="m145">
<mml:mrow>
<mml:mi>&#x3b2;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. Then we simulate <inline-formula id="inf146">
<mml:math id="m146">
<mml:mrow>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mn>8</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> datasets to estimate the Type I error rates at nominal significance levels <inline-formula id="inf147">
<mml:math id="m147">
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>2.5</mml:mn>
<mml:mo>&#x2a;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. We generate three phenotypes with different sample sizes. In the first situation, the sample sizes for the three phenotypes are equal, with a ratio of <inline-formula id="inf148">
<mml:math id="m148">
<mml:mrow>
<mml:mn>1000</mml:mn>
<mml:mo>:</mml:mo>
<mml:mn>1000</mml:mn>
<mml:mo>:</mml:mo>
<mml:mn>1000</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. In the second scenario, the sample sizes are unequal, with a ratio of <inline-formula id="inf149">
<mml:math id="m149">
<mml:mrow>
<mml:mn>2000</mml:mn>
<mml:mo>:</mml:mo>
<mml:mn>1000</mml:mn>
<mml:mo>:</mml:mo>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. For simplicity, the number of overlapping individuals between any two phenotypes is kept constant at <inline-formula id="inf150">
<mml:math id="m150">
<mml:mrow>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. The phenotypic correlation between any two phenotypes among the overlapping individuals is set to be either <inline-formula id="inf151">
<mml:math id="m151">
<mml:mrow>
<mml:mi>&#x3c1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> or <inline-formula id="inf152">
<mml:math id="m152">
<mml:mrow>
<mml:mi>&#x3c1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. A correlation of 1 indicates simulation of mimicking one phenotype but form different cohorts</p>
<p>For Type I error evaluations, we use the significance levels <inline-formula id="inf153">
<mml:math id="m153">
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>2.5</mml:mn>
<mml:mo>&#x2a;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. <xref ref-type="table" rid="T1">Table 1</xref> summarizes the estimated Type I error rates of the four tests based on different settings. We can see from <xref ref-type="table" rid="T1">Table 1</xref>, that the Type I error rates of all tests are all within the estimated 95% confidence intervals in most situations indicating that the Type I error rates of the four tests are well controlled at the nominal significance levels.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Type I error estimates of the four tests. Each entry represents the Type I error rate estimated by the proportions of <italic>p</italic>-values less than &#x3b1; with <inline-formula id="inf154">
<mml:math id="m154">
<mml:mrow>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mn>8</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> simulations. The phenotypic correlation between any pair of phenotypes is set to be either <inline-formula id="inf155">
<mml:math id="m155">
<mml:mrow>
<mml:mi>&#x3c1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> or <inline-formula id="inf156">
<mml:math id="m156">
<mml:mrow>
<mml:mi>&#x3c1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. The sample sizes of the three cohorts are <inline-formula id="inf157">
<mml:math id="m157">
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2236;</mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>&#x2236;</mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1000</mml:mn>
<mml:mo>&#x2236;</mml:mo>
<mml:mn>1000</mml:mn>
<mml:mo>&#x2236;</mml:mo>
<mml:mn>1000</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf158">
<mml:math id="m158">
<mml:mrow>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2236;</mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>&#x2236;</mml:mo>
<mml:msub>
<mml:mi>N</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>2000</mml:mn>
<mml:mo>&#x2236;</mml:mo>
<mml:mn>1000</mml:mn>
<mml:mo>&#x2236;</mml:mo>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Correlation between phenotypes</th>
<th align="left">Sample size in each cohort</th>
<th align="left">Number of overlapped samples</th>
<th align="left">Significance level</th>
<th align="left">Meta-TOW-S</th>
<th align="left">Meta-Vegas</th>
<th align="left">Meta-SKAT</th>
<th align="left">Meta-burden</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3" align="left">0.5</td>
<td rowspan="3" align="left">1,000:1,000:1,000</td>
<td rowspan="3" align="left">500</td>
<td align="left">1.00E-04</td>
<td align="left">1.03E-04</td>
<td align="left">9.21E-05</td>
<td align="left">8.81E-05</td>
<td align="left">1.02E-04</td>
</tr>
<tr>
<td align="left">1.00E-05</td>
<td align="left">9.66E-06</td>
<td align="left">9.53E-06</td>
<td align="left">7.88E-06</td>
<td align="left">1.03E-05</td>
</tr>
<tr>
<td align="left">2.5E-06</td>
<td align="left">2.27E-06</td>
<td align="left">2.74E-06</td>
<td align="left">2.21E-06</td>
<td align="left">2.33E-06</td>
</tr>
<tr>
<td rowspan="3" align="left">0.5</td>
<td rowspan="3" align="left">2,000:1,000:500</td>
<td rowspan="3" align="left">500</td>
<td align="left">1.00E-04</td>
<td align="left">9.94E-05</td>
<td align="left">9.15E-05</td>
<td align="left">9.04E-05</td>
<td align="left">9.80E-05</td>
</tr>
<tr>
<td align="left">1.00E-05</td>
<td align="left">9.45E-06</td>
<td align="left">8.63E-06</td>
<td align="left">8.47E-06</td>
<td align="left">9.92E-06</td>
</tr>
<tr>
<td align="left">2.5E-06</td>
<td align="left">3.05E-06</td>
<td align="left">2.42E-06</td>
<td align="left">2.32E-06</td>
<td align="left">2.67E-06</td>
</tr>
<tr>
<td rowspan="3" align="left">1</td>
<td rowspan="3" align="left">1,000:1,000:1,000</td>
<td rowspan="3" align="left">500</td>
<td align="left">1.00E-04</td>
<td align="left">1.11E-04</td>
<td align="left">9.09E-05</td>
<td align="left">8.72E-05</td>
<td align="left">1.00E-04</td>
</tr>
<tr>
<td align="left">1.00E-05</td>
<td align="left">9.82E-06</td>
<td align="left">8.31E-06</td>
<td align="left">7.69E-06</td>
<td align="left">1.02E-05</td>
</tr>
<tr>
<td align="left">2.5E-06</td>
<td align="left">2.40E-06</td>
<td align="left">2.23E-06</td>
<td align="left">1.74E-06</td>
<td align="left">2.64E-06</td>
</tr>
<tr>
<td rowspan="3" align="left">1</td>
<td rowspan="3" align="left">2,000:1,000:500</td>
<td rowspan="3" align="left">500</td>
<td align="left">1.00E-04</td>
<td align="left">8.97E-05</td>
<td align="left">9.10E-05</td>
<td align="left">8.79E-05</td>
<td align="left">1.02E-04</td>
</tr>
<tr>
<td align="left">1.00E-05</td>
<td align="left">8.17E-06</td>
<td align="left">8.36E-06</td>
<td align="left">7.92E-06</td>
<td align="left">1.02E-05</td>
</tr>
<tr>
<td align="left">2.5E-06</td>
<td align="left">1.93E-06</td>
<td align="left">2.45E-06</td>
<td align="left">1.93E-06</td>
<td align="left">2.61E-06</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4-2">
<title>4.2 Power comparisons</title>
<p>We compare the empirical power of Meta-TOW-S with Meta-Vegas, Meta-SKAT, and Meta-Burden. The power is defined as the proportion of test statistics with <italic>p</italic>-values less than the nominal significance level and we evaluate power at the nominal significance level <inline-formula id="inf159">
<mml:math id="m159">
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>2.5</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> after Bonferroni correction. For power comparisons, we generate phenotypes under the alternative hypothesis where the genetic effect is added correspondingly. Under each simulation setting, we generate 10,000 datasets to evaluate power at the nominal significance level <inline-formula id="inf160">
<mml:math id="m160">
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>2.5</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. For the overlapping individuals, the phenotypic correlation between any pair of phenotypes across four scenarios: <inline-formula id="inf161">
<mml:math id="m161">
<mml:mrow>
<mml:mn>0.1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf162">
<mml:math id="m162">
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. A correlation of 1 indicates a simulation of mimicking one phenotype but from different cohorts. The primary simulations explore four distinct schemes. The first scheme compares the set-based association test for multiple phenotypes with that for a single phenotype. The second scheme evaluates the set-based association test for multiple phenotypes across varying numbers of traits, which are <inline-formula id="inf163">
<mml:math id="m163">
<mml:mrow>
<mml:mi>K</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>K</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf164">
<mml:math id="m164">
<mml:mrow>
<mml:mi>K</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>10</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. The third scheme tests the performance of a weighting scheme we designed within the Cauchy combination. The final scheme evaluates the set-based multiple phenotypes association test under different degrees of sample overlap. For simplicity, we set the heritability for each phenotype in the above scheme to be either <inline-formula id="inf165">
<mml:math id="m165">
<mml:mrow>
<mml:msubsup>
<mml:mi>h</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.01</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> or <inline-formula id="inf166">
<mml:math id="m166">
<mml:mrow>
<mml:msubsup>
<mml:mi>h</mml:mi>
<mml:mi>k</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.3</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> for <inline-formula id="inf167">
<mml:math id="m167">
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>. The simulation results are depicted in <xref ref-type="fig" rid="F1">Figures 1</xref>&#x2013;<xref ref-type="fig" rid="F4">4</xref> for a heritability of <inline-formula id="inf168">
<mml:math id="m168">
<mml:mrow>
<mml:mn>0.01</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, while results for a heritability of <inline-formula id="inf169">
<mml:math id="m169">
<mml:mrow>
<mml:mn>0.3</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> are presented in the <xref ref-type="sec" rid="s12">Supplementary Material</xref>.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Power comparison of Meta-TOW-S, Meta-Vegas, Meta-SKAT, and Meta-Burden with single phenotype-based set-level tests at the significance level <inline-formula id="inf170">
<mml:math id="m170">
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>2.5</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. Three phenotypes are generated from different cohorts with different sample sizes <inline-formula id="inf171">
<mml:math id="m171">
<mml:mrow>
<mml:mn>1000</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf172">
<mml:math id="m172">
<mml:mrow>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf173">
<mml:math id="m173">
<mml:mrow>
<mml:mn>100</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, respectively. The heritability for each phenotype is <inline-formula id="inf174">
<mml:math id="m174">
<mml:mrow>
<mml:mn>0.01</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. The overlapping individuals in all cohorts were <inline-formula id="inf175">
<mml:math id="m175">
<mml:mrow>
<mml:mn>50</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. Four simulated scenarios are created: in scenario <bold>(A)</bold>, the phenotype correlation between any pair of phenotypes is set at <inline-formula id="inf176">
<mml:math id="m176">
<mml:mrow>
<mml:mn>0.1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>; in scenario <bold>(B)</bold>, it is raised to <inline-formula id="inf177">
<mml:math id="m177">
<mml:mrow>
<mml:mn>0.2</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>; Scenario <bold>(C)</bold> features a phenotype correlation of <inline-formula id="inf178">
<mml:math id="m178">
<mml:mrow>
<mml:mn>0.5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>; and in scenario <bold>(D)</bold>, the phenotype correlation is set at <inline-formula id="inf179">
<mml:math id="m179">
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</caption>
<graphic xlink:href="fgene-15-1359591-g001.tif"/>
</fig>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Power comparison of meta-analysis with different numbers of cohorts. <inline-formula id="inf180">
<mml:math id="m180">
<mml:mrow>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf181">
<mml:math id="m181">
<mml:mrow>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf182">
<mml:math id="m182">
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> phenotypes from different cohorts are generated, and the genetic heritability is fixed at <inline-formula id="inf183">
<mml:math id="m183">
<mml:mrow>
<mml:mn>0.01</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and is equally distributed among all causal variants in a region. All cohorts have the same sample size <inline-formula id="inf184">
<mml:math id="m184">
<mml:mrow>
<mml:mn>200</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. The overlapping sample size is <inline-formula id="inf185">
<mml:math id="m185">
<mml:mrow>
<mml:mn>100</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> for all cohorts and the correlations between any pair of phenotypes in <bold>(A&#x2013;D)</bold> are <inline-formula id="inf186">
<mml:math id="m186">
<mml:mrow>
<mml:mn>0.1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf187">
<mml:math id="m187">
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, respectively.</p>
</caption>
<graphic xlink:href="fgene-15-1359591-g002.tif"/>
</fig>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Power comparison of meta-analysis in the presence of weighting and unweighting scheme in Cauchy combination. Three phenotypes from three cohorts are generated. The sample sizes of these three cohorts are <inline-formula id="inf188">
<mml:math id="m188">
<mml:mrow>
<mml:mn>2000</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf189">
<mml:math id="m189">
<mml:mrow>
<mml:mn>100</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, respectively. The heritability is fixed at <inline-formula id="inf190">
<mml:math id="m190">
<mml:mrow>
<mml:mn>0.01</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and is equally distributed among all causal variants in a region. The overlapping sample size is 10 among all cohorts and the correlations between any pair of phenotypes in <bold>(A&#x2013;D)</bold> are <inline-formula id="inf191">
<mml:math id="m191">
<mml:mrow>
<mml:mn>0.1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf192">
<mml:math id="m192">
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, respectively.</p>
</caption>
<graphic xlink:href="fgene-15-1359591-g003.tif"/>
</fig>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Power comparison of meta-analysis in the presence of the same sample size in each cohort but different overlapping individuals. The sample size of each cohort is <inline-formula id="inf193">
<mml:math id="m193">
<mml:mrow>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and the heritability for each phenotype is <inline-formula id="inf194">
<mml:math id="m194">
<mml:mrow>
<mml:mn>0.01</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. The overlapping sample sizes are <inline-formula id="inf195">
<mml:math id="m195">
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf196">
<mml:math id="m196">
<mml:mrow>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and the correlations between any pair of phenotypes in <bold>(A&#x2013;D)</bold> are <inline-formula id="inf197">
<mml:math id="m197">
<mml:mrow>
<mml:mn>0.1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf198">
<mml:math id="m198">
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, respectively.</p>
</caption>
<graphic xlink:href="fgene-15-1359591-g004.tif"/>
</fig>
<p>We first compare the meta-analysis with single phenotype-based set-level tests. It shows that the integration of different cohorts to detect the genetic variants in a region associated with at least one phenotype could boost power (<xref ref-type="fig" rid="F1">Figure 1</xref>; <xref ref-type="sec" rid="s12">Supplementary Figure S1</xref>). Our proposed Meta-TOW-S has slightly better performance compared with Meta-Vegas, and Meta-SKAT, and is better performed than Meta-Burden since the Burden-based method has poor performance if the effects of causal variants are in different directions. We also found the increase in correlation between pairs of phenotypes will boost power as well, indicating that the integration of phenotypes will benefit to detect more pleiotropic genes.</p>
<p>Next, we vary the number of cohorts where each cohort has the same sample size <inline-formula id="inf199">
<mml:math id="m199">
<mml:mrow>
<mml:mn>200</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and genetic heritability <inline-formula id="inf200">
<mml:math id="m200">
<mml:mrow>
<mml:msup>
<mml:mi>h</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.01</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. Specifically, we create datasets with <inline-formula id="inf201">
<mml:math id="m201">
<mml:mrow>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf202">
<mml:math id="m202">
<mml:mrow>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf203">
<mml:math id="m203">
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> cohorts and assess the effect of incorporating multiple cohorts in the meta-analysis for the identification of pleiotropic genes. As illustrated in <xref ref-type="fig" rid="F2">Figure 2</xref>; <xref ref-type="sec" rid="s12">Supplementary Figure S2</xref>, Meta-TOW-S outperforms or performs equivalently well as the other three methods. It also shows that the increase in the number of cohorts boosts the power of all methods.</p>
<p>In Meta-TOW-S, Meta-Vegas, Meta-SKAT, and Meta-Burden, more weight is assigned to a large study, and a small weight is assigned to a small study in the Cauchy combination. We compare those four methods with the unweighting scheme where the weight is the same among all studies in the Cauchy combination. <xref ref-type="fig" rid="F3">Figure 3</xref>; <xref ref-type="sec" rid="s12">Supplementary Figure S3</xref> shows that the weighting scheme-based meta-analysis has a higher power compared with an unweighting scheme. Specifically, when the correlation between any pair of phenotypes is low, the weighting scheme meta-analysis exhibits significantly enhanced statistical power. Conversely, when the correlation between any pair of phenotypes is high, there is only a slight improvement in power. It indicates that the weighting scheme has a slight improvement when the genetic effect contributes to all three studies equivalently if the phenotype correlation is <inline-formula id="inf204">
<mml:math id="m204">
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, which represents homogeneity across cohorts for the same phenotype.</p>
<p>Last, we consider the situation in which the sample sizes are the same in each cohort but different proportions of overlapping individuals are shared among all cohorts. As expected in <xref ref-type="fig" rid="F4">Figure 4</xref>; <xref ref-type="sec" rid="s12">Supplementary Figure S4</xref>, Meta-TOW-S outperforms the other three methods, and Meta-Vegas and Meta-SKAT have comparable performance. It shows that the power is further improved when there are larger overlapping samples between studies if phenotypes are highly correlated, which could be attributed to reduced heterogeneity across cohorts.</p>
</sec>
<sec id="s4-3">
<title>4.3 Real data analysis</title>
<p>We apply these four methods to GWAS summary statistics for four psychiatric disorders available from the Psychiatric Genomics Consortium (PGC) (<xref ref-type="bibr" rid="B40">Sullivan et al., 2018</xref>). These phenotypes are attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder (ASD), bipolar disorder (BD), and schizophrenia (SCZ) (<xref ref-type="bibr" rid="B46">Zhang et al., 2021</xref>). The sample sizes for these four traits range from <inline-formula id="inf205">
<mml:math id="m205">
<mml:mrow>
<mml:mn>46</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>351</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> to <inline-formula id="inf206">
<mml:math id="m206">
<mml:mrow>
<mml:mn>105</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>318</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, with all individuals of European ancestry. We utilize the LD structure data from the 1,000 Genome Project Phase III European population as the reference in set-based multiple phenotype association test. The details of these four GWAS summary statistics are summarized in <xref ref-type="sec" rid="s12">Supplementary Table S1</xref>. The analysis of these four methods for the four psychiatric disorders is summarized using the UpSet plot shown in <xref ref-type="fig" rid="F5">Figure 5</xref>. We use the gene-based GWAS significance level <inline-formula id="inf207">
<mml:math id="m207">
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>2.5</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:msup>
<mml:mn>10</mml:mn>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> in the analysis. For Meta-Burden and Meta-SKAT, the weight in the gene-based tests are defined in relation to the minor allele frequency (MAF) of genetic varaints. Specifically, we use the default weight <inline-formula id="inf208">
<mml:math id="m208">
<mml:mrow>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> for Meta-Burden and <inline-formula id="inf209">
<mml:math id="m209">
<mml:mrow>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>B</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> for Meta-SKAT, where the beta distribution density function has pre-specified parameters <inline-formula id="inf210">
<mml:math id="m210">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf211">
<mml:math id="m211">
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf212">
<mml:math id="m212">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>A</mml:mi>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> of the <inline-formula id="inf213">
<mml:math id="m213">
<mml:mrow>
<mml:msup>
<mml:mi>j</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> variant. As a result, there are 557 genes detected by Meta-TOW-S, 517 genes detected by Meta-Vegas, 83 genes detected by Meta-SKAT, and 62 genes detected by Meta-Burden. These results indicate that Meta-TOW-S identifies more significant genes compared to Meta-Vegas, Meta-SKAT, and Meta-Burden.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Upset plot showing the number of overlapping detected genes between Meta-TOW-S, Meta-Vegas, Meta-SKAT, and Meta-Burden.</p>
</caption>
<graphic xlink:href="fgene-15-1359591-g005.tif"/>
</fig>
</sec>
<sec id="s4-4">
<title>4.4 Enrichment analysis</title>
<p>We implement the Gene Set Enrichment Analysis (GSEA) to analyze the significant genes identified by Meta-TOW-S that are enriched toward the top list of genes that are associated with specific biological pathways, processes, functions, or diseases (<xref ref-type="bibr" rid="B39">Subramanian et al., 2005</xref>). Gene Ontology (GO) is a community-based bioinformatics resource that employs ontologies to represent biological knowledge and describes information about gene and gene product information (<xref ref-type="bibr" rid="B33">Peng et al., 2017</xref>). Go is widely used to infer functional information for gene products, such as gene function enrichment, protein function prediction, and disease association analysis. And Go contains three categories: cellular component (CC), molecular function (MF: the biological function of the gene), and biological process (BP: pathways or larger processes that multiple gene products are involved in). KEGG (Kyoto Encyclopedia of Genes and Genomes) is a database resource for understanding high-level functions and utilities of the biological system. KEGG is used to search for the pathways associated with the identified genes. Detection of KEGG pathway database over-representation against a universal Homo Sapien background is assessed by hypergeometric tests (<xref ref-type="bibr" rid="B37">Solomon et al., 2022</xref>; <xref ref-type="bibr" rid="B12">Kanehisa et al., 2016</xref>). DisGeNET is an integrative and comprehensive resource of gene-disease associations from several public data sources and the literature (<xref ref-type="bibr" rid="B34">Pi&#xf1;ero et al., 2015</xref>; <xref ref-type="bibr" rid="B45">Yu et al., 2014</xref>). It contains gene-disease associations and variant-gene-disease associations. The disease enrichment analysis is used to assess whether the genes associated with multiple phenotypes in meta-analysis are overrepresented in specific gene sets. A Bonferroni corrected cutoff of 0.05 was used for the significance of the pathway and disease. For genes annotation, we employed the org.Hs.e.g.,.db package in R. This package offers an extensive set of annotations for the human genome, including mappings between different gene identifiers and detailed genomic features.</p>
<p>We use the 557 significant genes identified by Meta-TOW-S, which are associated with four psychiatric disorders, for the enrichment analysis. We perform enrichment analyses at two different levels: pathways and diseases (<xref ref-type="fig" rid="F6">Figures 6</xref> and <xref ref-type="fig" rid="F7">7</xref>; <xref ref-type="sec" rid="s12">Supplementary Figure S5</xref>). The top 20 enriched KEGG pathways are summarised in <xref ref-type="fig" rid="F6">Figure 6</xref> with results sorted from lowest to highest <italic>p</italic>-values. Of the 280 KEGG pathways tested, 37 were statistically significant after adjusting for multiple testing. The top KEGG pathways predominantly belong to groups associated with Human T-cell Leukemia virus 1 infection, Epstein-Barr virus infection, Antigen processing and presentation, etc. In the disease enrichment analysis, of the 5,496 disease tests, 455 had Bonferroni-corrected enrichment <italic>p</italic>-values lower than 0.05. The implicated genes are involved in Child Development Disorders Pervasive, Myasthenia Gravis, Vitiligo, Sarcoidosis, etc. (<xref ref-type="fig" rid="F7">Figure 7</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>KEGG analysis is performed using the DOSE package. The significant genes are detected by Meta-TOW-S. The size of the circles represents the number of differential genes in a pathway. GeneRatio is the ratio of the number of differentially expressed genes annotated in a pathway to the number of all genes annotated in these pathways.</p>
</caption>
<graphic xlink:href="fgene-15-1359591-g006.tif"/>
</fig>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Disease enrichment analysis is performed using DisGeNET. The significant genes are detected by Meta-TOW-S.</p>
</caption>
<graphic xlink:href="fgene-15-1359591-g007.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s5">
<title>5 Discussion</title>
<p>Much research suggests that many genes are associated with multiple correlated or even distinct phenotypes, and such associations have been termed cross-phenotype associations, which is relevant to pleiotropy in complex phenotypes (<xref ref-type="bibr" rid="B27">Li and Zhu, 2017</xref>). We propose a new method, Meta-TOW-S, which integrates association evidence of multiple phenotypes from study-specific GWAS summary statistics and thus detects the significant pleiotropic genes. This method Meta-TOW-S is based on the set-based test which uses the weights that maximize the score test statistic to increase power. To combine the test statistics from multiple GWAS cohorts, Meta-TOW-S uses the Cauchy combination by assigning more weight to a large GWAS to account for more biological information. In addition, Meta-TOW-S enables analysis across multiple phenotypes by accommodating overlapping samples from different cohorts.</p>
<p>To mimic real multiple comprehensive study-specific phenotypes, we develop a phenotype simulator that encompasses a simulation scheme capable of modeling multiple phenotypes with multiple underlying genetic loci, intricating noise structures, and different correlation patterns among the phenotypes with overlapped samples across different cohort studies. Our simulations show that the Type I error rates of Meta-TOW-S are well maintained under different conditions of phenotype correlation structures and overlapping samples and are more powerful than the other three comparison methods under most scenarios. We also find that the power of meta-analysis is significantly increased compared to the single phenotype set-based tests. A higher phenotype correlation, larger overlapping samples across multiple cohort studies, and more cohorts can increase power as well. We apply Meta-TOW-S to the summary statistics of four psychiatric disorders provided by the Psychiatric Genomics Consortium (PGC): attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder (ASD), bipolar disorder (BD), and schizophrenia (SCZ). As a result, 557 significant cross-phenotype associations are identified by Meta-TOW-S which is more than the number of genes identified by the other three methods. In the enrichment analysis, the gene sets of Child Development Disorders Pervasive are more enriched for genes associated with these four psychiatric disorders. In the KEGG pathway analysis, the significant genes identified by Meta-TOW-S showed notable enrichment in immune-related pathways rather than neurological processes. However, it is important to recognize that there is substantial evidence linking immune system involvement to neurological disorders. For instance, research has indicated that developmental disorders, such as Autism Spectrum Disorders (ASDs), can involve significant immune activity, including neuroinflammation, which plays a crucial role in the pathophysiology of these conditions (<xref ref-type="bibr" rid="B42">Vargas et al., 2005</xref>; <xref ref-type="bibr" rid="B1">Ashwood and Van de Water, 2004</xref>). Additionally, studies have demonstrated that elevated levels of regulatory T cells are associated with an increased risk of Attention-Deficit/Hyperactivity Disorder (ADHD) (<xref ref-type="bibr" rid="B2">&#xc7;etin et al., 2022</xref>), suggesting that immune dysregulation may contribute to the manifestation of neurological symptoms in certain contexts.</p>
<p>Meta-TOW-S has multiple advantages for identifying cross-phenotype associations. First of all, Meta-TOW-S can integrate information from multiple cohort studies to increase power and has the potential to detect more pleiotropic genes. Secondly, the test statistic has a standard Cauchy distribution under the null hypothesis and greatly reduces the computing time. Thirdly, this method is based on GWAS summary statistics from different cohort studies and GWAS summary statistics are more accessible than individual-level phenotype and genotype data The last point is that Meta-TOW-S gives more weight to the study with a larger sample size. Meanwhile, the developed phenotypes simulator can mimic complex structures in a meta-analysis which can be applied in other cross-phenotype analyses. However, Meta-TOW-S leverages information from correlated phenotypes to enhance its statistical power. Simulation results demonstrate that Meta-TOW-S surpasses other methods in power performance when phenotypic correlations are strong. However, when the correlation between phenotypes is weak, Meta-TOW-S may not outperform other methods, as it does not rely heavily on borrowing information from other phenotypes in such scenarios.</p>
<p>Currently, the framework of Meta-TOW-S needs to estimate the correlation matrix from a reference panel due to the unavailability of individual genotype data. However, the choice of the reference panel may influence the performance of Meta-TOW-S. The second challenge is that we use a correlation pruning procedure to ensure that the correlation matrix is fully ranked, which may drop some correlated variants in a gene.</p>
<p>In summary, the Meta-TOW-S method is a very useful method for detecting gene associations of multiple phenotypes from different GWAS cohorts. Meta-TOW-S has robust power and can handle different scenarios such as diverse phenotype correlation, and intricating cohort studies. The computational efficiency of Meta-TOW-S can also improve genetic discovery for hundreds of phenotypes across multiple GWAS cohorts in compliance with data privacy.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s12">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7">
<title>Ethics statement</title>
<p>Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants&#x2019;; legal guardians/next of kin in accordance with the national legislation and the institutional requirements.</p>
</sec>
<sec id="s8">
<title>Author contributions</title>
<p>LZ: Formal Analysis, Methodology, Writing&#x2013;review and editing, Data curation, Writing&#x2013;original draft. SZ: Formal Analysis, Methodology, Writing&#x2013;original draft, Supervision. QS: Formal Analysis, Methodology, Writing&#x2013;review and editing.</p>
</sec>
<sec sec-type="funding-information" id="s9">
<title>Funding</title>
<p>The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.</p>
</sec>
<ack>
<p>Part of this research has been conducted using the UK Biobank Resource under application number 102999 and the NHGRI-EBI GWAS Catalog.</p>
</ack>
<sec sec-type="COI-statement" id="s10">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s12">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2024.1359591/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2024.1359591/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet2.PDF" id="SM1" mimetype="application/PDF" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ashwood</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Van de Water</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Is autism an autoimmune disease?</article-title> <source>Autoimmun. Rev.</source> <volume>3</volume>, <fpage>557</fpage>&#x2013;<lpage>562</lpage>. <pub-id pub-id-type="doi">10.1016/j.autrev.2004.07.036</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>&#xc7;etin</surname>
<given-names>F. H.</given-names>
</name>
<name>
<surname>U&#xe7;ary&#x131;LMAZ</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>U&#xe7;ar</surname>
<given-names>H. N.</given-names>
</name>
<name>
<surname>Arta&#xe7;</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>G&#xfc;ler</surname>
<given-names>H. A.</given-names>
</name>
<name>
<surname>Duran</surname>
<given-names>S. A.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Regulatory T cells in children with attention deficit hyperactivity disorder: a case-control study</article-title>. <source>J. Neuroimmunol.</source> <volume>367</volume>, <fpage>577848</fpage>. <pub-id pub-id-type="doi">10.1016/j.jneuroim.2022.577848</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cirulli</surname>
<given-names>E. T.</given-names>
</name>
<name>
<surname>White</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Read</surname>
<given-names>R. W.</given-names>
</name>
<name>
<surname>Elhanan</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Metcalf</surname>
<given-names>W. J.</given-names>
</name>
<name>
<surname>Tanudjaja</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Genome-wide rare variant analysis for thousands of phenotypes in over 70,000 exomes from two cohorts</article-title>. <source>Nat. Commun.</source> <volume>11</volume>, <fpage>542</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-020-14288-y</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Conneely</surname>
<given-names>K. N.</given-names>
</name>
<name>
<surname>Boehnke</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests</article-title>. <source>Am. J. Hum. Genet.</source> <volume>81</volume>, <fpage>1158</fpage>&#x2013;<lpage>1168</lpage>. <pub-id pub-id-type="doi">10.1086/522036</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Davies</surname>
<given-names>R. B.</given-names>
</name>
</person-group> (<year>1980</year>). <article-title>Algorithm AS 155: the distribution of a linear combination of &#x3c7; 2 random variables</article-title>. <source>J. R. Stat. Soc. Ser. C Appl. Statistics</source> <volume>29</volume>, <fpage>323</fpage>&#x2013;<lpage>333</lpage>. <pub-id pub-id-type="doi">10.2307/2346911</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Demontis</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Walters</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Mattheisen</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Als</surname>
<given-names>T. D.</given-names>
</name>
<name>
<surname>Agerbo</surname>
<given-names>E.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder</article-title>. <source>Nat. Genet.</source> <volume>51</volume>, <fpage>63</fpage>&#x2013;<lpage>75</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-018-0269-7</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dutta</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Gagliano Taliun</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Weinstock</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Zawistowski</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sidore</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Fritsche</surname>
<given-names>L. G.</given-names>
</name>
<etal/>
</person-group> (<year>2019a</year>). <article-title>Meta&#x2010;MultiSKAT: multiple phenotype meta&#x2010;analysis for region&#x2010;based association test</article-title>. <source>Genet. Epidemiol.</source> <volume>43</volume>, <fpage>800</fpage>&#x2013;<lpage>814</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.22248</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dutta</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Scott</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Boehnke</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2019b</year>). <article-title>Multi&#x2010;SKAT: general framework to test for rare&#x2010;variant association with multiple phenotypes</article-title>. <source>Genet. Epidemiol.</source> <volume>43</volume>, <fpage>4</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.22156</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Grove</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ripke</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Als</surname>
<given-names>T. D.</given-names>
</name>
<name>
<surname>Mattheisen</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Walters</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Won</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Identification of common genetic risk variants for autism spectrum disorder</article-title>. <source>Nat. Genet.</source> <volume>51</volume>, <fpage>431</fpage>&#x2013;<lpage>444</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-019-0344-8</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>He</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ionita-Laza</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Unified sequence-based association tests allowing for multiple functional annotations and meta-analysis of noncoding variation in metabochip data</article-title>. <source>Am. J. Hum. Genet.</source> <volume>101</volume>, <fpage>340</fpage>&#x2013;<lpage>352</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2017.07.011</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Chanda</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Alonso</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Bader</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Arking</surname>
<given-names>D. E.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Gene-based tests of association</article-title>. <source>PLoS Genet.</source> <volume>7</volume>, <fpage>e1002177</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1002177</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kanehisa</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sato</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Kawashima</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Furumichi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Tanabe</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>KEGG as a reference resource for gene and protein annotation</article-title>. <source>Nucleic acids Res.</source> <volume>44</volume>, <fpage>D457</fpage>&#x2013;<lpage>D462</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv1070</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kim</surname>
<given-names>E. E.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>C. H.</given-names>
</name>
<name>
<surname>Oh</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Song</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>FOLD: a method to optimize power in meta-analysis of genetic association studies with overlapping subjects</article-title>. <source>Bioinformatics</source> <volume>33</volume>, <fpage>3947</fpage>&#x2013;<lpage>3954</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx463</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kuonen</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Miscellanea. Saddlepoint approximations for distributions of quadratic forms in normal variables</article-title>. <source>Biometrika</source> <volume>86</volume>, <fpage>929</fpage>&#x2013;<lpage>935</lpage>. <pub-id pub-id-type="doi">10.1093/biomet/86.4.929</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Abecasis</surname>
<given-names>G. R.</given-names>
</name>
<name>
<surname>Boehnke</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Rare-variant association analysis: study designs and statistical tests</article-title>. <source>Am. J. Hum. Genet.</source> <volume>95</volume>, <fpage>5</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2014.06.009</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Teslovich</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Boehnke</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>General framework for meta-analysis of rare variants in sequencing association studies</article-title>. <source>Am. J. Hum. Genet.</source> <volume>93</volume>, <fpage>42</fpage>&#x2013;<lpage>53</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2013.05.010</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Shadrin</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Van der Meer</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Hindley</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>S&#xf8;nderby</surname>
<given-names>I. E.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Efficient meta-analysis of multivariate genome-wide association studies with Meta-MOSTest</article-title>. <source>bioRxiv</source>. </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Fisher&#x27;s method of combining dependent statistics using generalizations of the gamma distribution with applications to genetic pleiotropic associations</article-title>. <source>Biostatistics</source> <volume>15</volume>, <fpage>284</fpage>&#x2013;<lpage>295</lpage>. <pub-id pub-id-type="doi">10.1093/biostatistics/kxt045</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>D. J.</given-names>
</name>
<name>
<surname>Peloso</surname>
<given-names>G. M.</given-names>
</name>
<name>
<surname>Zhan</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Holmen</surname>
<given-names>O. L.</given-names>
</name>
<name>
<surname>Zawistowski</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Meta-analysis of gene-level tests for rare variant association</article-title>. <source>Nat. Genet.</source> <volume>46</volume>, <fpage>200</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1038/ng.2852</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>J. Z.</given-names>
</name>
<name>
<surname>Mcrae</surname>
<given-names>A. F.</given-names>
</name>
<name>
<surname>Nyholt</surname>
<given-names>D. R.</given-names>
</name>
<name>
<surname>Medland</surname>
<given-names>S. E.</given-names>
</name>
<name>
<surname>Wray</surname>
<given-names>N. R.</given-names>
</name>
<name>
<surname>Brown</surname>
<given-names>K. M.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>A versatile gene-based test for genome-wide association studies</article-title>. <source>Am. J. Hum. Genet.</source> <volume>87</volume>, <fpage>139</fpage>&#x2013;<lpage>145</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2010.06.009</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H. H.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>A new chi-square approximation to the distribution of non-negative definite quadratic forms in non-central normal variables</article-title>. <source>Comput. Statistics &#x26; Data Analysis</source> <volume>53</volume>, <fpage>853</fpage>&#x2013;<lpage>856</lpage>. <pub-id pub-id-type="doi">10.1016/j.csda.2008.11.025</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Morrison</surname>
<given-names>A. C.</given-names>
</name>
<name>
<surname>Boerwinkle</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>ACAT: a fast and powerful p value combination method for rare-variant analysis in sequencing studies</article-title>. <source>Am. J. Hum. Genet.</source> <volume>104</volume>, <fpage>410</fpage>&#x2013;<lpage>421</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2019.01.002</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures</article-title>. <source>J. Am. Stat. Assoc.</source> <volume>115</volume>, <fpage>393</fpage>&#x2013;<lpage>402</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.2018.1554485</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Multiple phenotype association tests using summary statistics in genome&#x2010;wide association studies</article-title>. <source>Biometrics</source> <volume>74</volume>, <fpage>165</fpage>&#x2013;<lpage>175</lpage>. <pub-id pub-id-type="doi">10.1111/biom.12735</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Gaynor</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Dynamic incorporation of multiple <italic>in silico</italic> functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale</article-title>. <source>Nat. Genet.</source> <volume>52</volume>, <fpage>969</fpage>&#x2013;<lpage>983</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-020-0676-4</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Quick</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Gaynor</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>Powerful, scalable and resource-efficient meta-analysis of rare variant associations in large whole genome sequencing studies</article-title>. <source>Nat. Genet.</source> <volume>55</volume>, <fpage>154</fpage>&#x2013;<lpage>164</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-022-01225-6</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Cross-phenotype association analysis using summary statistics from GWAS</article-title>. <source>Stat. Hum. Genet. Methods Protoc.</source> <volume>1666</volume>, <fpage>455</fpage>&#x2013;<lpage>467</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4939-7274-6_22</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Gaynor</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Selvaraj</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Arapoglou</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies</article-title>. <source>Nat. methods</source> <volume>19</volume>, <fpage>1599</fpage>&#x2013;<lpage>1611</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-022-01640-x</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morris</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Zeggini</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>An evaluation of statistical approaches to rare variant analysis in genetic association studies</article-title>. <source>Genet. Epidemiol.</source> <volume>34</volume>, <fpage>188</fpage>&#x2013;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.20450</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Neale</surname>
<given-names>B. M.</given-names>
</name>
<name>
<surname>Sham</surname>
<given-names>P. C.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>The future of association studies: gene-based analysis and replication</article-title>. <source>Am. J. Hum. Genet.</source> <volume>75</volume>, <fpage>353</fpage>&#x2013;<lpage>362</lpage>. <pub-id pub-id-type="doi">10.1086/423901</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Panagiotou</surname>
<given-names>O. A.</given-names>
</name>
<name>
<surname>Willer</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Hirschhorn</surname>
<given-names>J. N.</given-names>
</name>
<name>
<surname>Ioannidis</surname>
<given-names>J. P.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>The power of meta-analysis in genome-wide association studies</article-title>. <source>Annu. Rev. genomics Hum. Genet.</source> <volume>14</volume>, <fpage>441</fpage>&#x2013;<lpage>465</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genom-091212-153520</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pardi&#xf1;as</surname>
<given-names>A. F.</given-names>
</name>
<name>
<surname>Holmans</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Pocklington</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Escott-Price</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Ripke</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Carrera</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection</article-title>. <source>Nat. Genet.</source> <volume>50</volume>, <fpage>381</fpage>&#x2013;<lpage>389</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-018-0059-2</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hui</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Shang</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Identifying term relations cross different gene ontology categories</article-title>. <source>BMC Bioinforma.</source> <volume>18</volume>, <fpage>573</fpage>&#x2013;<lpage>574</lpage>. <pub-id pub-id-type="doi">10.1186/s12859-017-1959-3</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pi&#xf1;ero</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Queralt-Rosinach</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Bravo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Deu-Pons</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Bauer-Mehren</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Baron</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes</article-title>. <source>Database</source> <volume>2015</volume>, <fpage>bav028</fpage>. <pub-id pub-id-type="doi">10.1093/database/bav028</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ray</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Boehnke</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Methods for meta&#x2010;analysis of multiple traits using GWAS summary statistics</article-title>. <source>Genet. Epidemiol.</source> <volume>42</volume>, <fpage>134</fpage>&#x2013;<lpage>145</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.22105</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sha</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Detecting association of rare and common variants by testing an optimally weighted combination of variants</article-title>. <source>Genet. Epidemiol.</source> <volume>36</volume>, <fpage>561</fpage>&#x2013;<lpage>571</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.21649</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Solomon</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Huen</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Yousefi</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>K&#xfc;pers</surname>
<given-names>L. K.</given-names>
</name>
<name>
<surname>Gonz&#xe1;lez</surname>
<given-names>J. R.</given-names>
</name>
<name>
<surname>Suderman</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>Meta-analysis of epigenome-wide association studies in newborns and children show widespread sex differences in blood DNA methylation</article-title>. <source>Mutat. Research/Reviews Mutat. Res.</source> <volume>789</volume>, <fpage>108415</fpage>. <pub-id pub-id-type="doi">10.1016/j.mrrev.2022.108415</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stahl</surname>
<given-names>E. A.</given-names>
</name>
<name>
<surname>Breen</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Forstner</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Mcquillin</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ripke</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Trubetskoy</surname>
<given-names>V.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Genome-wide association study identifies 30 loci associated with bipolar disorder</article-title>. <source>Nat. Genet.</source> <volume>51</volume>, <fpage>793</fpage>&#x2013;<lpage>803</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-019-0397-8</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Subramanian</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Tamayo</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Mootha</surname>
<given-names>V. K.</given-names>
</name>
<name>
<surname>Mukherjee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ebert</surname>
<given-names>B. L.</given-names>
</name>
<name>
<surname>Gillette</surname>
<given-names>M. A.</given-names>
</name>
<etal/>
</person-group> <year>2005</year>. <article-title>Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles</article-title>, <source>Proc. Natl. Acad. Sci. U. S. A.</source>
<italic>,</italic> <volume>102</volume>, <fpage>15545</fpage>&#x2013;<lpage>15550</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0506580102</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sullivan</surname>
<given-names>P. F.</given-names>
</name>
<name>
<surname>Agrawal</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Bulik</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Andreassen</surname>
<given-names>O. A.</given-names>
</name>
<name>
<surname>B&#xf8;rglum</surname>
<given-names>A. D.</given-names>
</name>
<name>
<surname>Breen</surname>
<given-names>G.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Psychiatric genomics: an update and an agenda</article-title>. <source>Am. J. Psychiatry</source> <volume>175</volume>, <fpage>15</fpage>&#x2013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1176/appi.ajp.2017.17030283</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Svishcheva</surname>
<given-names>G. R.</given-names>
</name>
<name>
<surname>Belonogova</surname>
<given-names>N. M.</given-names>
</name>
<name>
<surname>Zorkoltseva</surname>
<given-names>I. V.</given-names>
</name>
<name>
<surname>Kirichenko</surname>
<given-names>A. V.</given-names>
</name>
<name>
<surname>Axenovich</surname>
<given-names>T. I.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Gene-based association tests using GWAS summary statistics</article-title>. <source>Bioinformatics</source> <volume>35</volume>, <fpage>3701</fpage>&#x2013;<lpage>3708</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz172</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vargas</surname>
<given-names>D. L.</given-names>
</name>
<name>
<surname>Nascimbene</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Krishnan</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zimmerman</surname>
<given-names>A. W.</given-names>
</name>
<name>
<surname>Pardo</surname>
<given-names>C. A.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Neuroglial activation and neuroinflammation in the brain of patients with autism</article-title>. <source>Ann. Neurology Official J. Am. Neurological Assoc. Child Neurology Soc.</source> <volume>57</volume>, <fpage>67</fpage>&#x2013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1002/ana.20315</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Boehnke</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Rare-variant association testing for sequencing data with the sequence kernel association test</article-title>. <source>Am. J. Hum. Genet.</source> <volume>89</volume>, <fpage>82</fpage>&#x2013;<lpage>93</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2011.05.029</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Yan</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2022</year>). <source>Statistical methods for controlling population stratification and gene-based association studies</source>. <publisher-name>Michigan Technological University</publisher-name>.</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.-G.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>G.-R.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>Q.-Y.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>DOSE: an R/Bioconductor package for disease ontology semantic and enrichment analysis</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>608</fpage>&#x2013;<lpage>609</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu684</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>SUPERGNOVA: local genetic correlation analysis reveals heterogeneous etiologic sharing of complex traits</article-title>. <source>Genome Biol.</source> <volume>22</volume>, <fpage>262</fpage>&#x2013;<lpage>330</lpage>. <pub-id pub-id-type="doi">10.1186/s13059-021-02478-w</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tayo</surname>
<given-names>B. O.</given-names>
</name>
<name>
<surname>Liang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Young</surname>
<given-names>J. H.</given-names>
</name>
<name>
<surname>Franceschini</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Meta-analysis of correlated traits via summary statistics from GWASs with an application in hypertension</article-title>. <source>Am. J. Hum. Genet.</source> <volume>96</volume>, <fpage>21</fpage>&#x2013;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2014.11.011</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>