<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Appl. Math. Stat.</journal-id>
<journal-title>Frontiers in Applied Mathematics and Statistics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Appl. Math. Stat.</abbrev-journal-title>
<issn pub-type="epub">2297-4687</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fams.2022.873746</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Applied Mathematics and Statistics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A Novel Correction for the Adjusted Box-Pierce Test</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Danioko</surname> <given-names>Sidy</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1492648/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Zheng</surname> <given-names>Jianwei</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1203701/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Anderson</surname> <given-names>Kyle</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Barrett</surname> <given-names>Alexander</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Rakovski</surname> <given-names>Cyril S.</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1003746/overview"/>
</contrib>
</contrib-group>
<aff><institution>Schmid College of Science and Technology, Chapman University</institution>, <addr-line>Orange, CA</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Avner Bar-Hen, Conservatoire National des Arts et M&#x000E9;tiers (CNAM), France</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Hossein Hassani, University of Tehran, Iran; Christian Derquenne, Electricit&#x000E9; de France, France</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Jianwei Zheng <email>zheng120&#x00040;mail.chapman.edu</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Statistics and Probability, a section of the journal Frontiers in Applied Mathematics and Statistics</p></fn></author-notes>
<pub-date pub-type="epub">
<day>19</day>
<month>05</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>8</volume>
<elocation-id>873746</elocation-id>
<history>
<date date-type="received">
<day>11</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>12</day>
<month>04</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Danioko, Zheng, Anderson, Barrett and Rakovski.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Danioko, Zheng, Anderson, Barrett and Rakovski</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The classical Box-Pierce and Ljung-Box tests for auto-correlation of residuals possess severe deviations from nominal type I error rates. Previous studies have attempted to address this issue by either revising existing tests or designing new techniques. The Adjusted Box-Pierce achieves the best results with respect to attaining type I error rates closer to nominal values. This research paper proposes a further correction to the adjusted Box-Pierce test that possesses near perfect type I error rates. The approach is based on an inflation of the rejection region for all sample sizes and lags calculated <italic>via</italic> a linear model applied to simulated data that encompasses a large range of data scenarios. Our results show that the new approach possesses the best type I error rates of all goodness-of-fit time series statistics.</p></abstract>
<kwd-group>
<kwd>model selection</kwd>
<kwd>residuals</kwd>
<kwd>auto-correlation</kwd>
<kwd>type I error</kwd>
<kwd>diagnostic test</kwd>
<kwd>portmanteau Q statistic</kwd>
</kwd-group>
<counts>
<fig-count count="2"/>
<table-count count="7"/>
<equation-count count="10"/>
<ref-count count="19"/>
<page-count count="7"/>
<word-count count="3610"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>The Box-Jenkins algorithm is a1 general systematic approach for model checking of a time series model. Examples of the approach can be found in [<xref ref-type="bibr" rid="B1">1</xref>&#x02013;<xref ref-type="bibr" rid="B3">3</xref>]. A well-fitting model produces residuals that are free of correlation. Thus, standard goodness-of-fit approaches are in essence global tests for absence of correlation among estimated residuals. Accordingly, many statistical techniques have been designed to assess the absence of correlation among the time series model residuals.</p>
<p>Following classical notation, let {<italic>X</italic><sub><italic>t</italic></sub>} be an observed time series generated by a stationary and invertible ARMA(<italic>p,q</italic>) process &#x003D5;(<italic>B</italic>)<italic>X</italic><sub><italic>t</italic></sub> = &#x003B8;(<italic>B</italic>)&#x003F5;<sub><italic>t</italic></sub>, where &#x003D5;(<italic>B</italic>) and &#x003B8;(<italic>B</italic>) are the autoregressive and moving average characteristic polynomial and <inline-formula><mml:math id="M1"><mml:msup><mml:mrow><mml:mstyle class="text"><mml:mtext class="textit" mathvariant="italic">B</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mrow><mml:mstyle class="text"><mml:mtext class="textit" mathvariant="italic">X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mstyle class="text"><mml:mtext class="textit" mathvariant="italic">X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the backshift operator. The desired parameters, &#x003D5;<sub><italic>i</italic></sub> and &#x003B8;<sub><italic>i</italic></sub> are estimated using maximum likelihood or least squares methods to obtain <inline-formula><mml:math id="M2"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003D5;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M3"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003B8;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, the residuals are calculated <italic>via</italic> <inline-formula><mml:math id="M4"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003F5;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003B8;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>B</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003D5;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>B</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mstyle class="text"><mml:mtext class="textit" mathvariant="italic">X</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and the sample auto-correlation coefficients are in turn obtained from <inline-formula><mml:math id="M5"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003F5;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003F5;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover><mml:msubsup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003F5;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula>.</p>
<p>In recent years, many techniques have been employed to test the global hypothesis of all autocorrelations up to a certain lag, <bold>H</bold><sub>0</sub> : <italic>r</italic><sub>1</sub> &#x0003D; <italic>r</italic><sub>2</sub> &#x0003D; &#x02026; &#x0003D; <italic>r</italic><sub><italic>m</italic></sub> &#x0003D; 0. In general, these techniques are designed as weighted sums of squares of the estimated autocorrelations and they can produce misleading conclusions due to deviations from the asymptotic limiting distribution in moderate size samples [<xref ref-type="bibr" rid="B4">4</xref>&#x02013;<xref ref-type="bibr" rid="B6">6</xref>]. Thus, a new and more robust test is proposed in this research that attains precise type I error rates for all sample sizes.</p>
<p>The history of portmanteau tests traces its roots back to the Box-Pierce diagnostic test defined as [<xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B7">7</xref>]:</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M6"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>B</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>m</mml:mi></mml:munderover><mml:mrow><mml:msubsup><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mi>k</mml:mi><mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
<p>where <italic>n</italic>, <italic>m</italic>, and <inline-formula><mml:math id="M7"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> represent the sample size, number of lags being tested and the sample auto-correlation of order <italic>k</italic> of the residuals, respectively. The authors showed that the asymptotic distribution of <italic>Q</italic><sub><italic>BP</italic></sub> is approximately &#x003C7;<sup>2</sup>(<italic>m-p-q</italic>) but considerable deviations for moderate sample sizes have been observed [<xref ref-type="bibr" rid="B7">7</xref>&#x02013;<xref ref-type="bibr" rid="B9">9</xref>]. That deficiency entails imperfections of type I error rates and prompted the design of a weighted and improved versions of the test. In their stimulation studies, Ray and Xiaolou [<xref ref-type="bibr" rid="B4">4</xref>] focused on investigating the type I errors in the <inline-formula><mml:math id="M8"><mml:msubsup><mml:mrow><mml:mi>&#x003C7;</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula> setting. They remarked that the Box-Pierce test has imperfect type I error rates for most sample size and lag values.</p>
<p>Ljung and Box [<xref ref-type="bibr" rid="B7">7</xref>] were the first ones to propose a design that assigns larger weights to residuals estimated with more data:</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M9"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>L</mml:mi><mml:mi>B</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mo stretchy='false'>)</mml:mo><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>m</mml:mi></mml:munderover><mml:mrow><mml:mfrac><mml:mrow><mml:msubsup><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mi>k</mml:mi><mml:mn>2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>m</mml:mi></mml:munderover><mml:mrow><mml:mfrac><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle><mml:msubsup><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mi>k</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
<p>The Box-Pierce and Ljung-Box tests are asymptotically equivalent. The Ljung-Box test has been shown to overcorrect in moderate samples [<xref ref-type="bibr" rid="B4">4</xref>]. They also realized that Ljung-Box inflates the test statistic using a variance estimate of the residuals. They further showed that on moderate sized data, <italic>Q</italic><sub><italic>LB</italic></sub> rejects too often because the test statistic is too small.</p>
<p>Li and McLeod [<xref ref-type="bibr" rid="B9">9</xref>] refined the <italic>Q</italic><sub><italic>BP</italic></sub> test by proposing the following statistic,</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M10"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>L</mml:mi><mml:mi>B</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>B</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mfrac><mml:mrow><mml:mi>m</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>m</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>n</mml:mi></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>m</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>m</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>n</mml:mi></mml:mrow></mml:mfrac><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>m</mml:mi></mml:munderover><mml:mrow><mml:msub><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
<p>This approach only corrects the mean of the Box-Pierce statistic and consequently fails to properly adjust the type I error rates.</p>
<p>Monti [<xref ref-type="bibr" rid="B10">10</xref>] proposed a portmanteau test based on the residual partial autocorrelations. The test is defined as,</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M11"><mml:mtable class="eqnarray" columnalign="right center left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mtext class="textit" mathvariant="italic">Q</mml:mtext></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mfrac><mml:mrow><mml:msubsup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003C0;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Monti [<xref ref-type="bibr" rid="B10">10</xref>] showed <italic>via</italic> simulations that the performance of <italic>Q</italic><sub><italic>M</italic></sub> is comparable to that <italic>Q</italic><sub><italic>LB</italic></sub>. In addition, he concluded that in certain scenarios, <italic>Q</italic><sub><italic>LB</italic></sub> outperforms <italic>Q</italic><sub><italic>M</italic></sub>.</p>
<p>Pe&#x000F1;a and Rodr&#x000ED;guez [<xref ref-type="bibr" rid="B11">11</xref>] proposed a test based on a different measure of dependence of the residual autocorrelations,</p>
<disp-formula id="E5"><label>(5)</label><mml:math id="M12"><mml:mtable class="eqnarray" columnalign="right center left"><mml:mtr><mml:mtd><mml:mtext class="textit" mathvariant="italic">D</mml:mtext><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where</p>
<disp-formula id="E6"><label>(6)</label><mml:math id="M13"><mml:mrow><mml:msub><mml:mover accent='true'><mml:mi>R</mml:mi><mml:mo>&#x002DC;</mml:mo></mml:mover><mml:mi>m</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable><mml:mtr><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x02026;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x02026;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mrow><mml:mi>m</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022F1;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x022EE;</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mover accent='true'><mml:mi>r</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mrow><mml:mi>m</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x02026;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>In their work, the authors showed that under particular conditions, their test greatly outperformed <italic>Q</italic><sub><italic>LB</italic></sub> test. Furthermore, they demonstrated that the test had an advantage over the McLeod and Li&#x00027;s test regardless of sample size. However, the convergence of the asymptotic distribution of the test developed by Pe&#x000F1;a and Rodr&#x000ED;guez is very slow [<xref ref-type="bibr" rid="B12">12</xref>].</p>
<p>Fisher proposed new weighted versions of the Box-Pierce and Monti&#x00027;s tests, the Q statistic [<xref ref-type="bibr" rid="B5">5</xref>]:</p>
<disp-formula id="E7"><label>(7)</label><mml:math id="M14"><mml:mtable class="eqnarray" columnalign="right center left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>W</mml:mi><mml:mi>L</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mfrac><mml:mrow><mml:mi>m</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:msubsup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>and</p>
<disp-formula id="E8"><label>(8)</label><mml:math id="M15"><mml:mtable class="eqnarray" columnalign="right center left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>Q</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>W</mml:mi><mml:mi>M</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover></mml:mstyle><mml:mfrac><mml:mrow><mml:mi>m</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:msubsup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003C0;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>A comparison simulation study by Safi and Al-Reqep [<xref ref-type="bibr" rid="B13">13</xref>] showed that for small sample size and <italic>m</italic> values <italic>Q</italic><sub><italic>WL</italic></sub> performs better than <italic>Q</italic><sub><italic>LB</italic></sub>. For moderate sample sized data, they also found that <italic>Q</italic><sub><italic>WL</italic></sub> does better than <italic>Q</italic><sub><italic>LB</italic></sub> and <italic>Q</italic><sub><italic>WM</italic></sub> outperforms <italic>Q</italic><sub><italic>M</italic></sub>.</p>
<p>To remedy some of the shortcomings of all previously existing tests, Kan and Wang [<xref ref-type="bibr" rid="B4">4</xref>] proposed a new modification of the portmanteau test, widely called the adjusted Box-Pierce test. They defined their statistic as,</p>
<disp-formula id="E9"><label>(9)</label><mml:math id="M16"><mml:mtable class="eqnarray" columnalign="right center left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mtext class="textit" mathvariant="italic">Q</mml:mtext></mml:mrow><mml:mrow><mml:mi>B</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mtext class="textit" mathvariant="italic">m</mml:mtext><mml:mo>&#x0002B;</mml:mo><mml:msqrt><mml:mrow><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>V</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mtext class="textit" mathvariant="italic">Q</mml:mtext></mml:mrow><mml:mrow><mml:mtext class="textit" mathvariant="italic">BP</mml:mtext></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:msqrt><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mtext class="textit" mathvariant="italic">Q</mml:mtext></mml:mrow><mml:mrow><mml:mi>B</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mtext class="textit" mathvariant="italic">E</mml:mtext><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mtext class="textit" mathvariant="italic">Q</mml:mtext></mml:mrow><mml:mrow><mml:mi>B</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The authors conducted an evaluation of various tests including Box-Pierce and Ljung-Box. The design of the adjusted Box-Pierce statistic (9) explicitly recenters and rescales <italic>Q</italic><sub><italic>BP</italic></sub> to attain the mean and variance of a &#x003C7;<sup>2</sup>(<italic>m</italic>) variable. The authors showed through simulations that the test possesses very good adherence to nominal type I error rates. In their comparison study, they found that both the distributions of <italic>Q</italic><sub><italic>BP</italic></sub> and <italic>Q</italic><sub><italic>LB</italic></sub> deviate from the expected variance of &#x003C7;<sup>2</sup>(<italic>m</italic>) distribution for small and moderate sample sizes and almost all choices for the value of <italic>m</italic>.</p>
<p>All of the above-mentioned tests exhibit deviations from the nominal type I error rates that compromise their performance. Hassani and Yeganegi [<xref ref-type="bibr" rid="B14">14</xref>, <xref ref-type="bibr" rid="B15">15</xref>] conducted simulation studies to evaluate the optimal lag value for the Ljung-Box test. They found that the optimal number of lags not only depends on the length of the time series, but also on the significance level of the test. Thus, a new approach is proposed which aims at correcting the rejection region instead of redesigning the test statistic itself. This technique was introduced by Bernard in his effort to construct a more powerful alternative to Fisher&#x00027;s exact test [<xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B17">17</xref>] and later by Boschloo [<xref ref-type="bibr" rid="B18">18</xref>]. The same idea of rejection region correction has been recently employed by Ehwerhemuepha et al. [<xref ref-type="bibr" rid="B19">19</xref>] to produce the best performing test for homogeneity for multinational distributions.</p>
</sec>
<sec sec-type="methods" id="s2">
<title>2. Methods</title>
<p>A model based correction of the rejection region of the adjusted Box-Pierce test was designed. A large scale simulation study was then conducted to not only estimate the correction, but to also assess the performance advantages (defined as adherence to the nominal type I error rates for all scenarios) of the proposed corrected method.</p>
<sec>
<title>2.1. Simulation Study</title>
<p>For sample size values of <italic>n</italic> &#x0003D; 40, 50, &#x02026;, 300, we simulated 10<sup>6</sup> white noise samples, <inline-formula><mml:math id="M17"><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mn>1</mml:mn><mml:msup><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo>&#x0007E;</mml:mo><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mi>I</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>. These mimic the behavior of residuals of a well-fitting time series model (under the null). Next, the adjusted Box-Pierce test was applied to every sample and for all possible lags, <italic>m</italic> (2 &#x02264; <italic>m</italic> &#x02264; <italic>n</italic>&#x02212;1) and the corresponding <italic>p</italic>-values, <inline-formula><mml:math id="M18"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>m</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>m</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>m</mml:mi><mml:mn>1</mml:mn><mml:msup><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math></inline-formula> were obtained. For each pair (<italic>n, m</italic>), the estimated the type I error rate of the adjusted Box-Pierce test at alpha level of 0.05 was empirically estimated by <inline-formula><mml:math id="M19"><mml:msubsup><mml:mrow><mml:mstyle class="text"><mml:mtext class="textit" mathvariant="italic">P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>&#x003B1;</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:msup><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:munderover><mml:mi>I</mml:mi><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>m</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0003C;</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>/</mml:mo><mml:mn>1</mml:mn><mml:msup><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. Thus, for each sample size <italic>n</italic>, <italic>n</italic>&#x02212;2 empirically estimated type I error rates yielding a dataset with three columns, <italic>n</italic>, <italic>m</italic>, and <inline-formula><mml:math id="M20"><mml:msubsup><mml:mrow><mml:mstyle class="text"><mml:mtext class="textit" mathvariant="italic">P</mml:mtext></mml:mstyle></mml:mrow><mml:mrow><mml:mi>&#x003B1;</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula>. Further, these datasets obtained from all individual sample sizes <italic>n</italic> were stacked to get an aggregated dataset with number of rows <inline-formula><mml:math id="M21"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>4</mml:mn></mml:mrow><mml:mrow><mml:mn>30</mml:mn></mml:mrow></mml:munderover><mml:mn>10</mml:mn><mml:mi>n</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>10</mml:mn><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mn>2</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mn>934</mml:mn><mml:mo>,</mml:mo><mml:mn>920</mml:mn></mml:math></inline-formula>.</p>
</sec>
<sec>
<title>2.2. Linear Model</title>
<p>The primary idea of this study was to provide a model-based correction to the rejection region of the adjusted Box-Pierce test in order to attain improved type I error rates for all sample sizes and lags. We created six linear regression models trained on the simulated data described in the section above. These six models were trained on different subsets of the data split into sample size intervals [0, 50], [51, 70], [71, 90], [91, 120], [121, 200], and [201, 300]. The difference in the type I error rate patterns for distinct sample seizes (shown in <xref ref-type="fig" rid="F1">Figure 1</xref>) necessitated the use of separate models to achieve the desired level of fit. These linear models are complex as they encompass different powers of <italic>n</italic>, <italic>m</italic>, and their 2-way interactions. The general formula adopted for the models was,</p>
<disp-formula id="E10"><label>(10)</label><mml:math id="M22"><mml:mtable class="eqnarray" columnalign="right center left"><mml:mtr><mml:mtd><mml:mtable style="text-align:axis;" equalrows="false" columnlines="none" equalcolumns="false" class="array"><mml:mtr><mml:mtd><mml:mi>Y</mml:mi><mml:mo>-</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msup><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msup><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msup><mml:mo>*</mml:mo><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>s</mml:mi></mml:mrow></mml:msup><mml:mo>*</mml:mo><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>p</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>s</mml:mi></mml:mrow></mml:msup></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>s</mml:mi></mml:mrow></mml:msup><mml:mo>*</mml:mo><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>p</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>s</mml:mi></mml:mrow></mml:msup><mml:mo>*</mml:mo><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>p</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn><mml:mi>p</mml:mi></mml:mrow></mml:msup><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mn>9</mml:mn></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn><mml:mi>p</mml:mi></mml:mrow></mml:msup><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Further, within the general form (10) an extensive grid search to find the best values of the power transformation parameters <italic>s</italic> and <italic>p</italic> was performed. The type I error rates from the selected best models are presented in <xref ref-type="table" rid="T1">Table 1</xref>. The rates were calculated using validation data with sample sizes of <italic>n</italic><sub><italic>val</italic></sub> &#x0003D; 45, 65, 85, 100, 250.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Parametric correction to the rejection region for sample sizes 50, 70, 90, 130, 200, and 300.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-873746-g0001.tif"/>
</fig>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Performance summary of the correction to the Adjusted Box-Pierce.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Sample size</bold></th>
<th valign="top" align="center"><bold>s</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic></th>
<th valign="top" align="center"><bold>AdjBoxPierce</bold></th>
<th valign="top" align="center"><bold>Corrected version</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>n</italic> = 45</td>
<td valign="top" align="center">0.2</td>
<td valign="top" align="center">0.3</td>
<td valign="top" align="center">0.04868907</td>
<td valign="top" align="center">0.05001953</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic> = 65</td>
<td valign="top" align="center">10.0</td>
<td valign="top" align="center">1.0</td>
<td valign="top" align="center">0.05163921</td>
<td valign="top" align="center">0.05002905</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic> = 85</td>
<td valign="top" align="center">7.0</td>
<td valign="top" align="center">2.0</td>
<td valign="top" align="center">0.05305157</td>
<td valign="top" align="center">0.05045904</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic> = 100</td>
<td valign="top" align="center">1.3</td>
<td valign="top" align="center">1.7</td>
<td valign="top" align="center">0.05447408</td>
<td valign="top" align="center">0.05020469</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic> = 160</td>
<td valign="top" align="center">0.8</td>
<td valign="top" align="center">0.9</td>
<td valign="top" align="center">0.05629981</td>
<td valign="top" align="center">0.04987525</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic> = 250</td>
<td valign="top" align="center">1.9</td>
<td valign="top" align="center">0.8</td>
<td valign="top" align="center">0.05813593</td>
<td valign="top" align="center">0.05037286</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>3. Results</title>
<p>Noticeable differences between the patterns of type I error rates across the analyzed sample sizes (40&#x02013;300) were discovered. Therefore, sample-size specific models (0&#x02013;50, 51&#x02013;70, 71&#x02013;90, 91&#x02013;120, 120&#x02013;200, 201&#x02013;300) were constructed to capture the exact pattern for that particular scenario. <xref ref-type="table" rid="T1">Table 1</xref> displays a condensed form of the comparative study between revised version of Box-Pierce, which to the best of our knowledge is the last version, and the correction that we have brought into the study. For different time series lengths, the corresponding <italic>s</italic>- and <italic>p</italic>-values along with the type I error rates for the adjusted Box-Pierce and those of the corrected version that we designed. It is important to realize that the results from the implementation of these models show that in all settings, the proposed regression-based correction provided almost perfect type I error rates. In particular, the adjusted type I error rates after the correction to the rejection regions were exactly 0.05 with detailed results.</p>
<p><xref ref-type="table" rid="T2">Tables 2</xref>&#x02013;<xref ref-type="table" rid="T7">7</xref> show detailed summary from the sample-size specific model fits. These models provide a parametric correction of the type I error rates. Graphical representation of results from the implementation of these models for several scenarios are shown in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Summary statistics for selected variables in interval sample size &#x0003C;50.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="center"><bold>Estimate</bold></th>
<th valign="top" align="center"><bold>Std.Error</bold></th>
<th valign="top" align="center"><italic><bold>t</bold></italic><bold>-value</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic><bold>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup></td>
<td valign="top" align="center">0.425295</td>
<td valign="top" align="center">0.251604</td>
<td valign="top" align="center">1.690</td>
<td valign="top" align="center">0.095008</td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;1.353900</td>
<td valign="top" align="center">0.793110</td>
<td valign="top" align="center">&#x02212;1.707</td>
<td valign="top" align="center">0.091837</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup>&#x0002A;<italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">0.593460</td>
<td valign="top" align="center">0.396921</td>
<td valign="top" align="center">1.495</td>
<td valign="top" align="center">0.138960</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">0.149028</td>
<td valign="top" align="center">0.056476</td>
<td valign="top" align="center">2.639</td>
<td valign="top" align="center">0.010065<sup>&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup></td>
<td valign="top" align="center">&#x02212;0.183531</td>
<td valign="top" align="center">0.122355</td>
<td valign="top" align="center">&#x02212;1.500</td>
<td valign="top" align="center">0.137706</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;0.070355</td>
<td valign="top" align="center">0.030893</td>
<td valign="top" align="center">&#x02212;2.277</td>
<td valign="top" align="center">0.025539<sup>&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>3<italic>p</italic></sup></td>
<td valign="top" align="center">0.004419</td>
<td valign="top" align="center">0.002064</td>
<td valign="top" align="center">2.141</td>
<td valign="top" align="center">0.035436<sup>&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>4<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;0.017762</td>
<td valign="top" align="center">0.004355</td>
<td valign="top" align="center">&#x02212;4.079</td>
<td valign="top" align="center">0.000109<xref ref-type="table-fn" rid="TN1"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>5<italic>p</italic></sup></td>
<td valign="top" align="center">0.002106</td>
<td valign="top" align="center">0.000461</td>
<td valign="top" align="center">4.570</td>
<td valign="top" align="center">1.83e-05<xref ref-type="table-fn" rid="TN1"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The symbols &#x0002A;,</italic></p>
<fn id="TN1"><label>&#x0002A;&#x0002A;&#x0002A;</label><p><italic>designate the statistical significance level of the variables in a given model</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Summary statistics for selected variables in finite sample size between 51 and 70.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="center"><bold>Estimate</bold></th>
<th valign="top" align="center"><bold>Std.Error</bold></th>
<th valign="top" align="center"><italic><bold>t</bold></italic><bold>-value</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic><bold>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup></td>
<td valign="top" align="center">&#x02212;2.652e-06</td>
<td valign="top" align="center">8.296e-07</td>
<td valign="top" align="center">&#x02212;3.196</td>
<td valign="top" align="center">0.00179 <sup>&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">1.209e-03</td>
<td valign="top" align="center">2.984e-04</td>
<td valign="top" align="center">4.053</td>
<td valign="top" align="center">9.12e-05 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup>&#x0002A;<italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;2.283e-07</td>
<td valign="top" align="center">7.347e-08</td>
<td valign="top" align="center">&#x02212;3.108</td>
<td valign="top" align="center">0.00237 <sup>&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;2.068e-12</td>
<td valign="top" align="center">3.852e-13</td>
<td valign="top" align="center">&#x02212;5.369</td>
<td valign="top" align="center">4.07e-07 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup></td>
<td valign="top" align="center">4.910e-10</td>
<td valign="top" align="center">1.869e-10</td>
<td valign="top" align="center">2.627</td>
<td valign="top" align="center">0.00977 <sup>&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">4.637e-16</td>
<td valign="top" align="center">8.877e-17</td>
<td valign="top" align="center">5.223</td>
<td valign="top" align="center">7.75e-07 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>3<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;1.167e-18</td>
<td valign="top" align="center">2.439e-19</td>
<td valign="top" align="center">&#x02212;4.784</td>
<td valign="top" align="center">5.05e-06 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>4<italic>p</italic></sup></td>
<td valign="top" align="center">6.138e-10</td>
<td valign="top" align="center">2.856e-10</td>
<td valign="top" align="center">2.150</td>
<td valign="top" align="center">0.03364 <sup>&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>5<italic>p</italic></sup></td>
<td valign="top" align="center">2.552e-12</td>
<td valign="top" align="center">1.811e-12</td>
<td valign="top" align="center">1.409</td>
<td valign="top" align="center">0.16150</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The symbols &#x0002A;, &#x0002A;&#x0002A;, &#x0002A;&#x0002A;&#x0002A; designate the statistical significance level of the variables in a given model</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Summary statistics for selected variables in finite sample size between 71 and 90.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="center"><bold>Estimate</bold></th>
<th valign="top" align="center"><bold>Std.Error</bold></th>
<th valign="top" align="center"><italic><bold>t</bold></italic><bold>-value</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic><bold>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup></td>
<td valign="top" align="center">3.214e-17</td>
<td valign="top" align="center">2.901e-17</td>
<td valign="top" align="center">1.108</td>
<td valign="top" align="center">0.269585</td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">3.833e-06</td>
<td valign="top" align="center">1.130e-06</td>
<td valign="top" align="center">3.392</td>
<td valign="top" align="center">0.000877 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup>&#x0002A;<italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;1.392e-20</td>
<td valign="top" align="center">3.309e-20</td>
<td valign="top" align="center">&#x02212;0.421</td>
<td valign="top" align="center">0.674609</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;4.627e-36</td>
<td valign="top" align="center">6.406e-37</td>
<td valign="top" align="center">&#x02212;7.224</td>
<td valign="top" align="center">2.02e-11 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup></td>
<td valign="top" align="center">&#x02212;6.756e-31</td>
<td valign="top" align="center">6.616e-31</td>
<td valign="top" align="center">&#x02212;1.021</td>
<td valign="top" align="center">0.308740</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">9.423e-50</td>
<td valign="top" align="center">1.523e-50</td>
<td valign="top" align="center">6.189</td>
<td valign="top" align="center">5.00e-09 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>3<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;1.759e-54</td>
<td valign="top" align="center">4.077e-55</td>
<td valign="top" align="center">&#x02212;4.315</td>
<td valign="top" align="center">2.80e-05 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>4<italic>p</italic></sup></td>
<td valign="top" align="center">2.816e-17</td>
<td valign="top" align="center">2.774e-18</td>
<td valign="top" align="center">10.153</td>
<td valign="top" align="center">&#x0003C;2e-16 <sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The symbol &#x0002A;&#x0002A;&#x0002A; designates the statistical significance level of the variables in a given model</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p>Summary statistics for selected variables in finite sample size between 91 and 120.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="center"><bold>Estimate</bold></th>
<th valign="top" align="center"><bold>Std.Error</bold></th>
<th valign="top" align="center"><italic><bold>t</bold></italic><bold>-value</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic><bold>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup></td>
<td valign="top" align="center">5.169e-06</td>
<td valign="top" align="center">3.434e-06</td>
<td valign="top" align="center">1.505</td>
<td valign="top" align="center">0.133211</td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">1.266e-05</td>
<td valign="top" align="center">3.809e-06</td>
<td valign="top" align="center">3.323</td>
<td valign="top" align="center">0.000994<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup>&#x0002A;<italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;1.569e-09</td>
<td valign="top" align="center">9.362e-09</td>
<td valign="top" align="center">&#x02212;0.168</td>
<td valign="top" align="center">0.867045</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;2.021e-13</td>
<td valign="top" align="center">1.482e-14</td>
<td valign="top" align="center">&#x02212;13.641</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup></td>
<td valign="top" align="center">&#x02212;1.216e-08</td>
<td valign="top" align="center">7.488e-09</td>
<td valign="top" align="center">&#x02212;1.624</td>
<td valign="top" align="center">0.105408</td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">3.782e-16</td>
<td valign="top" align="center">3.539e-17</td>
<td valign="top" align="center">10.687</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>3<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;4.778e-20</td>
<td valign="top" align="center">4.874e-21</td>
<td valign="top" align="center">&#x02212;9.804</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>4&#x0002A;<italic>p</italic></sup></td>
<td valign="top" align="center">3.367e-15</td>
<td valign="top" align="center">1.792e-16</td>
<td valign="top" align="center">18.793</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>5<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;4.058e-19</td>
<td valign="top" align="center">3.561e-20</td>
<td valign="top" align="center">&#x02212;11.397</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The symbol &#x0002A;&#x0002A;&#x0002A; designates the statistical significance level of the variables in a given model</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T6">
<label>Table 6</label>
<caption><p>Summary statistics for selected variables in finite sample size between 121 and 200.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="center"><bold>Estimate</bold></th>
<th valign="top" align="center"><bold>Std.Error</bold></th>
<th valign="top" align="center"><italic><bold>t</bold></italic><bold>-value</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic><bold>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup></td>
<td valign="top" align="center">5.966e-05</td>
<td valign="top" align="center">2.343e-05</td>
<td valign="top" align="center">2.546</td>
<td valign="top" align="center">0.01102<sup>&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">8.195e-04</td>
<td valign="top" align="center">5.830e-05</td>
<td valign="top" align="center">14.056</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup>&#x0002A;<italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;1.227e-05</td>
<td valign="top" align="center">1.336e-06</td>
<td valign="top" align="center">&#x02212;9.181</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;8.989e-09</td>
<td valign="top" align="center">3.701e-10</td>
<td valign="top" align="center">&#x02212;24.290</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup></td>
<td valign="top" align="center">&#x02212;1.271e-06</td>
<td valign="top" align="center">3.925e-07</td>
<td valign="top" align="center">&#x02212;3.237</td>
<td valign="top" align="center">0.00124<sup>&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">1.864e-10</td>
<td valign="top" align="center">5.775e-12</td>
<td valign="top" align="center">32.280</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>3<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;1.079e-12</td>
<td valign="top" align="center">2.925e-14</td>
<td valign="top" align="center">&#x02212;36.873</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>4<italic>p</italic></sup></td>
<td valign="top" align="center">1.233e-09</td>
<td valign="top" align="center">8.712e-11</td>
<td valign="top" align="center">14.147</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>5<italic>p</italic></sup></td>
<td valign="top" align="center">6.308e-12</td>
<td valign="top" align="center">6.042e-13</td>
<td valign="top" align="center">10.440</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The symbols &#x0002A;, &#x0002A;&#x0002A;, &#x0002A;&#x0002A;&#x0002A; designate the statistical significance level of the variables in a given model</italic>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T7">
<label>Table 7</label>
<caption><p>Summary statistics for selected variables in finite sample size between 201 and 300.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="center"><bold>Estimate</bold></th>
<th valign="top" align="center"><bold>Std.Error</bold></th>
<th valign="top" align="center"><italic><bold>t</bold></italic><bold>-value</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic><bold>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup></td>
<td valign="top" align="center">1.740e-07</td>
<td valign="top" align="center">5.213e-08</td>
<td valign="top" align="center">3.338</td>
<td valign="top" align="center">0.000868<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">2.056e-04</td>
<td valign="top" align="center">5.313e-05</td>
<td valign="top" align="center">3.870</td>
<td valign="top" align="center">0.000114<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup><italic>s</italic></sup>&#x0002A;<italic>m</italic><sup><italic>p</italic></sup></td>
<td valign="top" align="center">1.206e-08</td>
<td valign="top" align="center">5.327e-09</td>
<td valign="top" align="center">2.263</td>
<td valign="top" align="center">0.023777<sup>&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;9.680e-14</td>
<td valign="top" align="center">6.970e-15</td>
<td valign="top" align="center">&#x02212;13.889</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>2<italic>s</italic></sup></td>
<td valign="top" align="center">&#x02212;1.845e-11</td>
<td valign="top" align="center">2.884e-12</td>
<td valign="top" align="center">&#x02212;6.396</td>
<td valign="top" align="center">2.22e-10<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>2<italic>p</italic></sup></td>
<td valign="top" align="center">5.841e-18</td>
<td valign="top" align="center">1.928e-19</td>
<td valign="top" align="center">30.295</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>n</italic><sup>3<italic>s</italic></sup>&#x0002A;<italic>m</italic><sup>3<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;5.966e-20</td>
<td valign="top" align="center">2.469e-21</td>
<td valign="top" align="center">&#x02212;24.161</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>4<italic>p</italic></sup></td>
<td valign="top" align="center">&#x02212;4.111e-09</td>
<td valign="top" align="center">5.612e-10</td>
<td valign="top" align="center">&#x02212;7.326</td>
<td valign="top" align="center">4.14e-13<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
<tr>
<td valign="top" align="left"><italic>m</italic><sup>5<italic>p</italic></sup></td>
<td valign="top" align="center">1.660e-10</td>
<td valign="top" align="center">7.322e-12</td>
<td valign="top" align="center">22.678</td>
<td valign="top" align="center">&#x0003C;2e-16<sup>&#x0002A;&#x0002A;&#x0002A;</sup></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The symbols &#x0002A;, &#x0002A;&#x0002A;&#x0002A; designate the statistical significance level of the variables in a given model</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>Form left-to-right-up-to-down the fitting curves with appropriately found models in cases where (<italic>n</italic> &#x0003D; 50, 70, 90, 120, 300) can be viewed. Empirically, it can be seen that the models that best fit the specific curve in a given data were found.</p>
</sec>
<sec id="s4">
<title>4. Data Example</title>
<p>An application of our corrected version of the adjusted Box-Pierce test was performed using S&#x00026;P 500 stock data. We provide instances of both false positive and false negative results obtained by the standard adjusted Box-Pierce test using EQT Corporation stock. This corporation created in 1884 and headquartered in Pittsburg is one of the leading companies extensively devoted to the exploration and transportation of hydrocarbon (Petroleum, natural gas, natural gas liquid). The average daily price of the EQT Corporation was calculated by collecting its opening and closing prices over a period over 8 years (2010&#x02013;2018). For a window size of 50, numerous false negative and false positive points were found at different lags. In this case, instead of a critical value we have a critical boundary or curve exists. In this setting, the same rejection conditions are the same as in the normal case.</p>
<p>In <xref ref-type="fig" rid="F2">Figure 2</xref>, instances of a false positive rejection at lag 26 are shown where the adjusted Box-Pierce test obtains a <italic>p</italic>-value of 0.0504 but the proposed model correction inflates the rejection region to start at 0.058. The graph also shows a false negative results with <italic>p</italic>-value of 0.046 at lag 47. However, the proposed correction shrinks the rejection region to start at 0.045.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Parametric correction to the rejection region for the real EQT Corporation data is size 50.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-08-873746-g0002.tif"/>
</fig>
</sec>
<sec sec-type="discussion" id="s5">
<title>5. Discussion</title>
<p>In this work a new approach for correction of adjusted Box-Pierce test recently developed by Kan and Wang [<xref ref-type="bibr" rid="B4">4</xref>]. Conceptually, the rejection region correction idea is similar to the ones successfully employed in the work of [<xref ref-type="bibr" rid="B18">18</xref>, <xref ref-type="bibr" rid="B19">19</xref>] to counterbalance the conservativeness of exact homogeneity tests. The provided method combines large scale simulations with subsequent scenario-specific regression modeling that includes complex interaction terms to achieve exceptionally good fit that entails nominal type I error rates for all sample sizes and lags used in the test statistic. The regression models that were constructed depend on the length of the series (<italic>n</italic>) and the lag order (<italic>m</italic>). The exponents (<italic>s</italic>) and (<italic>p</italic>) of different variables present in the models are treated as hyperparameters in order to control the learning process. To obtain optimal values for those hyperparameters an extensive search through chosen subset values for (<italic>s</italic>) and (<italic>p</italic>) was conducted. The simulation study showed that the test outperforms all existing competing goodness-of-fit approaches for sample sizes up to 300.</p>
<p>It shall be noted that, in this study, we are not developing any new statistic but improving the best test among the current goodness-of-fit methods for time series. Our contribution is the introduction of a completely new idea to time series diagnostics, a rejection region correction <italic>via</italic> a range of parametric regressions fitted to large sample simulation data. Our study is an extension of the Adjusted Box-Pierce, as presented earlier.</p>
<p>The merit to the novel correction to the adjusted Box-Pierce proposed in this study is that it allows to find a test with vastly improved type I error rates for all sample size and lag values. This proposed technique of rejection region correction has direct implication on precise decision making by investors and financial institutions. The same technique can be easily extended to larger sample sizes.</p>
</sec>
<sec sec-type="data-availability" id="s6">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>SD conducted the simulation study, drafted, and revised the manuscript. JZ participated in interpreting the simulation results. KA reviewed and revised the manuscript, and prepared the final draft. AB contributed to the model building and the study design. CR conceived and designed the study, contributed to drafting, reviewing, and revising the manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s8">
<title>Publisher&#x00027;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Cryer</surname> <given-names>JD</given-names></name> <name><surname>Chan</surname> <given-names>K-S</given-names></name></person-group>. <source>Time Series Analysis: With Applications in R</source>, Vol. <volume>2</volume>. <publisher-name>Springer</publisher-name> (<year>2008</year>).</citation>
</ref>
<ref id="B2">
<label>2.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rachev</surname> <given-names>ST</given-names></name> <name><surname>Mittnik</surname> <given-names>S</given-names></name> <name><surname>Fabozzi</surname> <given-names>FJ</given-names></name> <name><surname>Focardi</surname> <given-names>SM</given-names></name> <name><surname>Ja&#x00161;i&#x00107;</surname> <given-names>T</given-names></name></person-group>. <source>Financial Econometrics: From Basics to Advanced Modeling Techniques</source>. Vol. <volume>150</volume>. <publisher-name>John Wiley &#x00026; Sons</publisher-name> (<year>2007</year>).</citation>
</ref>
<ref id="B3">
<label>3.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindberg</surname> <given-names>J</given-names></name></person-group>. <source>Autoregressive Conditional Density</source>. (<year>2016</year>). <pub-id pub-id-type="doi">10.2139/ssrn.2785499</pub-id></citation>
</ref>
<ref id="B4">
<label>4.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kan</surname> <given-names>R</given-names></name> <name><surname>Wang</surname> <given-names>X</given-names></name></person-group>. <article-title>On the distribution of the sample autocorrelation coefficients</article-title>. <source>J Econ</source>. (<year>2010</year>) <volume>154</volume>:<fpage>101</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1016/j.jeconom.2009.06.010</pub-id></citation>
</ref>
<ref id="B5">
<label>5.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fisher</surname> <given-names>TJ</given-names></name></person-group>. <article-title>Testing adequacy of arma models using a weighted portmanteau test on the residual autocorrelations</article-title>. In: <source>Contributed Paper to 2011 SAS Global Forum.</source> (<year>2011</year>) <fpage>327</fpage>.</citation>
</ref>
<ref id="B6">
<label>6.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Box</surname> <given-names>GE</given-names></name> <name><surname>Pierce</surname> <given-names>DA</given-names></name></person-group>. <article-title>Distribution of residual autocorrelations in autoregressive-integrated moving average time series models</article-title>. <source>J Am Stat Assoc</source>. (<year>1970</year>) <volume>65</volume>:<fpage>1509</fpage>&#x02013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.1970.10481180</pub-id></citation>
</ref>
<ref id="B7">
<label>7.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ljung</surname> <given-names>GM</given-names></name> <name><surname>Box</surname> <given-names>GE</given-names></name></person-group>. <article-title>On a measure of lack of fit in time series models</article-title>. <source>Biometrika</source>. (<year>1978</year>) <volume>65</volume>:<fpage>297</fpage>&#x02013;<lpage>303</lpage>. <pub-id pub-id-type="doi">10.1093/biomet/65.2.297</pub-id></citation>
</ref>
<ref id="B8">
<label>8.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arranz</surname> <given-names>MA</given-names></name></person-group>. <article-title>Tol-Project Portmanteau test statistics in time series</article-title>. <source>Time Orientated Lang</source>. (<year>2005</year>) <fpage>1</fpage>&#x02013;<lpage>8</lpage>.</citation>
</ref>
<ref id="B9">
<label>9.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Y</given-names></name></person-group>. <article-title>On the robustness of Ljung- Box and McLeod- Li Q tests: a simulation study</article-title>. <source>Econ Bull</source>. (<year>2002</year>) <volume>3</volume>:<fpage>1</fpage>&#x02013;<lpage>10</lpage>.</citation>
</ref>
<ref id="B10">
<label>10.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Monti</surname> <given-names>AC</given-names></name></person-group>. <article-title>A proposal for a residual autocorrelation test in linear models</article-title>. <source>Biometrika</source>. (<year>1994</year>) <volume>81</volume>:<fpage>776</fpage>&#x02013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.1093/biomet/81.4.776</pub-id></citation>
</ref>
<ref id="B11">
<label>11.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pe&#x000F1;a</surname> <given-names>D</given-names></name> <name><surname>Rodr&#x000ED;guez</surname> <given-names>J</given-names></name></person-group>. <article-title>A powerful portmanteau test of lack of fit for time series</article-title>. <source>J Am Stat Assoc</source>. (<year>2002</year>) <volume>97</volume>:<fpage>601</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1198/016214502760047122</pub-id></citation>
</ref>
<ref id="B12">
<label>12.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>JW</given-names></name> <name><surname>McLeod</surname> <given-names>AI</given-names></name></person-group>. <article-title>Improved Pe&#x00148;a-Rodriguez portmanteau test</article-title>. <source>Comput Stat Data Anal</source>. (<year>2006</year>) <volume>51</volume>:<fpage>1731</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1016/j.csda.2006.06.010</pub-id></citation>
</ref>
<ref id="B13">
<label>13.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Safi</surname> <given-names>SK</given-names></name> <name><surname>Al-Reqep</surname> <given-names>AA</given-names></name></person-group>. <article-title>Comparative study of portmanteau tests for the residuals autocorrelation in ARMA models</article-title>. <source>Sci J Appl Math Stat</source>. (<year>2014</year>) <volume>2</volume>:<fpage>1</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.11648/j.sjams.20140201.11</pub-id></citation>
</ref>
<ref id="B14">
<label>14.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hassani</surname> <given-names>H</given-names></name> <name><surname>Yeganegi</surname> <given-names>MR</given-names></name></person-group>. <article-title>Selecting optimal lag order in Ljung-Box test</article-title>. <source>Phys A</source>. (<year>2020</year>) <volume>541</volume>:<fpage>123700</fpage>. <pub-id pub-id-type="doi">10.1016/j.physa.2019.123700</pub-id></citation>
</ref>
<ref id="B15">
<label>15.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hassani</surname> <given-names>H</given-names></name> <name><surname>Yeganegi</surname> <given-names>MR</given-names></name></person-group>. <article-title>Sum of squared ACF and the Ljung-Box statistics</article-title>. <source>Phys A</source>. (<year>2019</year>) <volume>520</volume>:<fpage>81</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1016/j.physa.2018.12.028</pub-id></citation>
</ref>
<ref id="B16">
<label>16.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barnard</surname> <given-names>G</given-names></name></person-group>. <article-title>A new test for 2&#x000D7; 2 tables</article-title>. <source>Nature</source>. (<year>1945</year>) <volume>156</volume>:<fpage>177</fpage>. <pub-id pub-id-type="doi">10.1038/156177a0</pub-id></citation>
</ref>
<ref id="B17">
<label>17.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fisher</surname> <given-names>RA</given-names></name></person-group>. <article-title>On the interpretation of &#x003C7; 2 from contingency tables, and the calculation of P</article-title>. <source>J R Stat Soc</source>. (<year>1922</year>) <volume>85</volume>:<fpage>87</fpage>&#x02013;<lpage>94</lpage>. <pub-id pub-id-type="doi">10.2307/2340521</pub-id></citation>
</ref>
<ref id="B18">
<label>18.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boschloo</surname> <given-names>R</given-names></name></person-group>. <article-title>Raised conditional level of significance for the 2&#x000D7; 2-table when testing the equality of two probabilities</article-title>. <source>Stat Neerland</source>. (<year>1970</year>) <volume>24</volume>:<fpage>1</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-9574.1970.tb00104.x</pub-id></citation>
</ref>
<ref id="B19">
<label>19.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ehwerhemuepha</surname> <given-names>L</given-names></name> <name><surname>Sok</surname> <given-names>H</given-names></name> <name><surname>Rakovski</surname> <given-names>C</given-names></name></person-group>. <article-title>A more powerful unconditional exact test of homogeneity for 2&#x000D7; c contingency table analysis</article-title>. <source>J Appl Stat</source>. (<year>2019</year>) <volume>46</volume>:<fpage>2572</fpage>&#x02013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1080/02664763.2019.1601689</pub-id></citation>
</ref>
</ref-list> 
</back>
</article>