<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Sports Act. Living</journal-id>
<journal-title>Frontiers in Sports and Active Living</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Sports Act. Living</abbrev-journal-title>
<issn pub-type="epub">2624-9367</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fspor.2022.896828</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Sports and Active Living</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Monitoring Variables Influence on Random Forest Models to Forecast Injuries in Short-Track Speed Skating</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Briand</surname> <given-names>J&#x000E9;r&#x000E9;my</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1677663/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Deguire</surname> <given-names>Simon</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1776946/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Gaudet</surname> <given-names>Sylvain</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1762127/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Bieuzen</surname> <given-names>Fran&#x000E7;ois</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/337671/overview"/>
</contrib>
</contrib-group>
<aff><institution>Institut National du Sport du Qu&#x000E9;bec</institution>, <addr-line>Montr&#x000E9;al, QC</addr-line>, <country>Canada</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Pascal Edouard, Universit&#x000E9; Jean Monnet, France</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Christophe Ley, University of Luxembourg, Luxembourg; Alexander Gerharz, Technical University Dortmund, Germany</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Fran&#x000E7;ois Bieuzen <email>fbieuzen&#x00040;insquebec.org</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Injury Prevention and Rehabilitation, a section of the journal Frontiers in Sports and Active Living</p></fn></author-notes>
<pub-date pub-type="epub">
<day>14</day>
<month>07</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>4</volume>
<elocation-id>896828</elocation-id>
<history>
<date date-type="received">
<day>15</day>
<month>03</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>20</day>
<month>06</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Briand, Deguire, Gaudet and Bieuzen.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Briand, Deguire, Gaudet and Bieuzen</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract>
<p>Injuries limit the athletes&#x00027; ability to participate fully in their training and competitive process. They are detrimental to performance, affecting the athletes psychologically while limiting physiological adaptations and long-term development. This study aims to present a framework for developing random forest classifier models, forecasting injuries in the upcoming 1 to 7 days, to assist the performance support staff in reducing injuries and maximizing performance within the Canadian National Female Short-Track Speed Skating Program. Forty different variables monitored daily over two seasons (2018&#x02013;2019 and 2019&#x02013;2020) were used to develop two sets of forecasting models. One includes only training load variables (TL), and a second (ALL) combines a wide array of monitored variables (neuromuscular function, heart rate variability, training load, psychological wellbeing, past injury type, and location). The sensitivity (ALL: 0.35 &#x000B1; 0.19, TL: 0.23 &#x000B1; 0.03), specificity (ALL: 0.81 &#x000B1; 0.05, TL: 0.74 &#x000B1; 0.03) and Matthews Correlation Coefficients (MCC) (ALL: 0.13 &#x000B1; 0.05, TL: &#x02212;0.02 &#x000B1; 0.02) were computed. Paired <italic>T</italic>-test on the MCC revealed statistically significant (<italic>p</italic> &#x0003C; 0.01) and large positive effects (Cohen d &#x0003E; 1) for the ALL forecasting models&#x00027; MCC over every forecasting window (1 to 7 days). These models were highly determined by the athletes&#x00027; training completion, lower limb and trunk/lumbar injury history, as well as sFatigue, a training load marker. The TL forecasting models&#x00027; MCC suggests they do not bring any added value to forecast injuries. Combining a wide array of monitored variables and quantifying the injury etiology conceptual components significantly improve the injury forecasting performance of random forest models. The ALL forecasting models&#x00027; performances are promising, especially on one time windows of one or two days, with sensitivities and specificities being respectively above 0.5 and 0.7. They could add value to the decision-making process for the support staff in order to assist the Canadian National Female Team Short-Track Speed Skating program in reducing the number of incomplete training days, which could potentially increase performance. On longer forecasting time windows, ALL forecasting models&#x00027; sensitivity and MCC decrease gradually. Further work is needed to determine if such models could be useful for forecasting injuries over three days or longer.</p></abstract>
<kwd-group>
<kwd>data mining</kwd>
<kwd>machine learning</kwd>
<kwd>high performance</kwd>
<kwd>sport injury prevention</kwd>
<kwd>modeling</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="2"/>
<equation-count count="3"/>
<ref-count count="72"/>
<page-count count="14"/>
<word-count count="9738"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Sports performance is complex and influenced by many factors such as genetic heritage, training level, trainability (Issurin et al., <xref ref-type="bibr" rid="B36">2005</xref>), sportive technique, nutrition, psychological state, and health status (Gould et al., <xref ref-type="bibr" rid="B29">1999</xref>; D&#x00027;Isanto, <xref ref-type="bibr" rid="B20">2019</xref>). Injuries, which we define as any physical or mental complaint limiting the athletes&#x00027; ability to participate fully in training and competition (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; Clarsen et al., <xref ref-type="bibr" rid="B16">2013</xref>), are part of the factors affecting sports performance (Renstr&#x000F6;m and Johnson, <xref ref-type="bibr" rid="B53">1985</xref>; Clarsen et al., <xref ref-type="bibr" rid="B16">2013</xref>, <xref ref-type="bibr" rid="B17">2021</xref>; D&#x00027;Isanto, <xref ref-type="bibr" rid="B20">2019</xref>). Within high-performance systems, injuries are common (Lian et al., <xref ref-type="bibr" rid="B43">2005</xref>; Wald&#x000E9;n et al., <xref ref-type="bibr" rid="B69">2005</xref>; Clarsen et al., <xref ref-type="bibr" rid="B15">2010</xref>), potentially because of the high demands of competitive sports, requiring the athletes to push their body to its limit (Soligard et al., <xref ref-type="bibr" rid="B62">2016</xref>). They restrict the athletes&#x00027; ability to progress, affecting performance, long-term development, and athlete retention while adding financial costs (Bahr and Krosshaug, <xref ref-type="bibr" rid="B6">2005</xref>; Bahr et al., <xref ref-type="bibr" rid="B5">2018</xref>).</p>
<p>Injuries are complex and influenced by many interconnected factors (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; Halson, <xref ref-type="bibr" rid="B31">2014</xref>; Bittencourt et al., <xref ref-type="bibr" rid="B9">2016</xref>). We can regroup these factors into three main categories. The first one refers to the stress applied to the athlete. Previous studies extensively explored the impact of external and internal training load (stress imposed on and perceived by the athlete) (Foster et al., <xref ref-type="bibr" rid="B22">2017</xref>) on injuries, often using acute to chronic work ratio (ACWR) (Hulin et al., <xref ref-type="bibr" rid="B35">2014</xref>; Stares et al., <xref ref-type="bibr" rid="B63">2017</xref>). The second category refers to the athletes&#x00027; predispositions that will affect the stress response, such as their physical and psychological health status, age, injury history, flexibility level, fatigue, muscle strength, and imbalances (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; L&#x000F3;pez-Valenciano et al., <xref ref-type="bibr" rid="B44">2018</xref>). The last category is the athletes&#x00027; stress response. According to Selye&#x00027;s general adaptation syndrome (Selye, <xref ref-type="bibr" rid="B59">1950</xref>), the stress response can lead to positive adaptations (Eustress), such as sports performance, or negative adaptations (distress), such as injuries. This stress response (eustress or distress) can, in turn, modify the athletes&#x00027; predisposition and influence their future response to stress exposure. We hypothesize that proper monitoring of various quantitative variables from each of the three categories of factors influencing injuries and sports performance, combined with efficient association and observational techniques, could help predict the athletes&#x00027; future injuries.</p>
<p>Many interrelated variables influence injuries, which complexifies their association with risk factors (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; Bittencourt et al., <xref ref-type="bibr" rid="B9">2016</xref>; Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>). To tackle this problem, studies have started using machine learning algorithms (Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>). Machine learning is a tool that uses computers to &#x0201C;learn&#x0201D; complex relationships from empirical data and that establishes a mathematical link between a large number of covariates and a target variable of interest (Cabitza et al., <xref ref-type="bibr" rid="B13">2018</xref>). Among the studies stated in Van Eetvelde&#x00027;s (Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>) systematic review of machine learning applications to sports injuries, different types of variables were monitored: (1) sports background (McCullagh and Whitfort, <xref ref-type="bibr" rid="B46">2013</xref>; L&#x000F3;pez-Valenciano et al., <xref ref-type="bibr" rid="B44">2018</xref>; Ayala et al., <xref ref-type="bibr" rid="B4">2019</xref>); (2) psychological measurements (McCullagh and Whitfort, <xref ref-type="bibr" rid="B46">2013</xref>; L&#x000F3;pez-Valenciano et al., <xref ref-type="bibr" rid="B44">2018</xref>; Ayala et al., <xref ref-type="bibr" rid="B4">2019</xref>); (3) neuromuscular measurements (L&#x000F3;pez-Valenciano et al., <xref ref-type="bibr" rid="B44">2018</xref>; Ayala et al., <xref ref-type="bibr" rid="B4">2019</xref>); (4) workloads (McCullagh and Whitfort, <xref ref-type="bibr" rid="B46">2013</xref>; Whiteside et al., <xref ref-type="bibr" rid="B71">2016</xref>; Rossi et al., <xref ref-type="bibr" rid="B57">2017</xref>; Thornton et al., <xref ref-type="bibr" rid="B66">2017</xref>; Carey et al., <xref ref-type="bibr" rid="B14">2018</xref>); (5) previous injury background and health status (McCullagh and Whitfort, <xref ref-type="bibr" rid="B46">2013</xref>; Rossi et al., <xref ref-type="bibr" rid="B57">2017</xref>; L&#x000F3;pez-Valenciano et al., <xref ref-type="bibr" rid="B44">2018</xref>; Ruddy et al., <xref ref-type="bibr" rid="B58">2018</xref>); (6) genetic markers (Rodas et al., <xref ref-type="bibr" rid="B55">2019</xref>); and (7) personal data (age, body mass, anthropometric measurements, demographic) (McCullagh and Whitfort, <xref ref-type="bibr" rid="B46">2013</xref>; Whiteside et al., <xref ref-type="bibr" rid="B71">2016</xref>; Ruddy et al., <xref ref-type="bibr" rid="B58">2018</xref>; Oliver et al., <xref ref-type="bibr" rid="B51">2020</xref>; Rommers et al., <xref ref-type="bibr" rid="B56">2020</xref>). Each type of variable helps quantify one of the three categories of factors influencing injuries. Thus far, only the McCullagh and Whitfort (<xref ref-type="bibr" rid="B46">2013</xref>) study has regrouped predictors from the three categories of factors influencing injuries and sports performance.</p>
<p>Short-track speed skating is a highly interactive middle distance sport, which consists in performing multiple laps on the ice at high speed, including tight corners and up to nine athletes in a single race (Hesford et al., <xref ref-type="bibr" rid="B33">2012</xref>; Menting et al., <xref ref-type="bibr" rid="B49">2019</xref>; Konieczny et al., <xref ref-type="bibr" rid="B40">2020</xref>). It has been studied mostly for its biomechanical aspect (Hesford et al., <xref ref-type="bibr" rid="B33">2012</xref>; Kim et al., <xref ref-type="bibr" rid="B39">2019</xref>; Konieczny et al., <xref ref-type="bibr" rid="B40">2020</xref>), as well as its strategic and pacing dimensions (Haug et al., <xref ref-type="bibr" rid="B32">2015</xref>; Konings et al., <xref ref-type="bibr" rid="B42">2016</xref>; Konings and Hettinga, <xref ref-type="bibr" rid="B41">2018</xref>; Menting et al., <xref ref-type="bibr" rid="B49">2019</xref>). This sport induces important leg asymmetries (Hesford et al., <xref ref-type="bibr" rid="B33">2012</xref>; Konieczny et al., <xref ref-type="bibr" rid="B40">2020</xref>), which are suspected to put the athletes particularly at risk of sustaining added injuries in the lower limb regions (Konieczny et al., <xref ref-type="bibr" rid="B40">2020</xref>). Within the Canadian National Female Short-Track Speed Skating Program, over the 2018-2019 and 2019-2020 seasons, athletes have lost on average 75 &#x000B1; 45 training or competition days due to injuries, which represents 12 &#x000B1; 7% of lost training and/or competition opportunities per athlete, over two seasons. On average, 2 days were missed per injury. Implementing a framework helping the performance support staff detect and prevent injuries and their risk factors is required to maximize performance within the Canadian National Female Short-Track Speed Skating Program.</p>
<p>Using a random forest algorithm, shown to be efficient on multiple features classification problems, such as injury prediction (Breiman, <xref ref-type="bibr" rid="B12">2001</xref>), the current study aims to: (1) provide a narrative on a data mining framework (Adriaans and Zantinge, <xref ref-type="bibr" rid="B1">1997</xref>) within the Canadian National Female Short-Track Speed Skating Program; (2) demonstrate that random forest classifiers forecasting injuries could benefit the performance support staff; and (3) highlight the variables of importance in the injury forecasting process. Finally, the study aims to (4) demonstrate how the combination of monitored variables from each of the three categories of factors influencing injuries improves the ability to forecast potential injuries compared to models only built using training load variables.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and Methods</title>
<sec>
<title>Forecasting</title>
<p>Van Eetvelde et al.&#x00027;s review (Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>) presents mostly prospective case-control studies (McCullagh and Whitfort, <xref ref-type="bibr" rid="B46">2013</xref>; Rossi et al., <xref ref-type="bibr" rid="B57">2017</xref>; Thornton et al., <xref ref-type="bibr" rid="B66">2017</xref>; Carey et al., <xref ref-type="bibr" rid="B14">2018</xref>; L&#x000F3;pez-Valenciano et al., <xref ref-type="bibr" rid="B44">2018</xref>; Ruddy et al., <xref ref-type="bibr" rid="B58">2018</xref>; Ayala et al., <xref ref-type="bibr" rid="B4">2019</xref>; Oliver et al., <xref ref-type="bibr" rid="B51">2020</xref>; Rommers et al., <xref ref-type="bibr" rid="B56">2020</xref>), analyzing data retrospectively and predicting injuries based on the athletes&#x00027; features value on a given day. In this prospective case-control study, we predicted the athletes&#x00027; injury status in the upcoming 1 to 7 days. This will help the performance support staff anticipate athletes&#x00027; upcoming injuries over the next training micro-cycle, which corresponds to 7 days within the Canadian National Female Short-Track Speed Skating Program.</p>
</sec>
<sec>
<title>Data Mining Approach</title>
<p>We put a data mining system (Adriaans and Zantinge, <xref ref-type="bibr" rid="B1">1997</xref>) in place within the Canadian National Female Short-Track Speed Skating Program to train and evaluate the random forest injury prediction models. <xref ref-type="fig" rid="F1">Figure 1</xref> provides a schematic of the different steps of data acquisition, preprocessing, feature engineering, and model training and evaluation. Each step presented in <xref ref-type="fig" rid="F1">Figure 1</xref> will be explained in the following subsections.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Overview of the data mining framework to develop and evaluate random forest forecasting models.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fspor-04-896828-g0001.tif"/>
</fig></sec>
<sec>
<title>Athletes</title>
<p>We collected data over the 2018&#x02013;2019 and 2019&#x02013;2020 seasons. Eleven women, members of the Canadian Short-Track Speed Skating National Team, consented to take part in the study. Every athlete signed an &#x0201C;athlete consent form&#x0201D; allowing the use of their data for research purpose and program evaluation. The consent form was approved according to provincial legislations and ethical recommendations. Sensitive information that can directly identify the athletes (e.g. names, addresses) were safeguarded and maintained under controlled conditions according to provincial laws. The athletes went through a familiarization and education process to ensure data were reported in a standard and appropriate way. <xref ref-type="table" rid="T1">Table 1</xref> describes the characteristics of the athletes who participated in the study.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Athletes characteristics summary.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Athletes characteristics</bold></th>
<th valign="top" align="center"><bold>Mean &#x000B1;standard</bold><break/> <bold>deviation</bold></th>
<th valign="top" align="center"><bold>Range</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Age (years)</td>
<td valign="top" align="center">21 &#x000B1; 2</td>
<td valign="top" align="center">18&#x02013;24</td>
</tr>
<tr>
<td valign="top" align="left">World ranking</td>
<td valign="top" align="center">27 &#x000B1; 19</td>
<td valign="top" align="center">2&#x02014;Not ranked</td>
</tr>
<tr>
<td valign="top" align="left">Experience on the national team (years)</td>
<td valign="top" align="center">4 &#x000B1; 2</td>
<td valign="top" align="center">1&#x02013;9</td>
</tr>
</tbody>
</table>
</table-wrap></sec>
<sec>
<title>Conceptual Model of Injuries and Monitored Data</title>
<p>The conceptual model of injuries and sports performance, presented in <xref ref-type="fig" rid="F2">Figure 2</xref>, was created to guide our interpretation of injuries and their underlying mechanisms and provide adequate perspective on the influence of the three categories of factors on injuries. It is inspired by Selye&#x00027;s general adaptation syndrome (Selye, <xref ref-type="bibr" rid="B59">1950</xref>), Meeuwisse et al. (<xref ref-type="bibr" rid="B48">2007</xref>) dynamic model of sports injury etiology, and D&#x00027;Isanto (<xref ref-type="bibr" rid="B20">2019</xref>) factors influencing sports performance. The model (<xref ref-type="fig" rid="F2">Figure 2</xref>) guided the choice of variables to monitor. The amount of stress imposed on the athlete was quantified by monitoring the external (training stress imposed on the athlete) and internal training load (training stress perceived by the athlete) (Foster et al., <xref ref-type="bibr" rid="B22">2017</xref>). Factors influencing stress response were monitored through psychological wellness questionnaire scores (Junge, <xref ref-type="bibr" rid="B38">2000</xref>; Shrier and Hall&#x000E9;, <xref ref-type="bibr" rid="B61">2011</xref>), tracking of athletes&#x00027; injury type and location history, and monitoring of their history of training completion (ability to complete the prescribed training). The stress response was quantified through the tracking of the athletes&#x00027; heart rate variability (HRV) (Goessl et al., <xref ref-type="bibr" rid="B28">2017</xref>) and neuromuscular function (Gathercole et al., <xref ref-type="bibr" rid="B26">2015</xref>).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Conceptual framework of the factors influencing sport performance and injuries and their inter-relation.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fspor-04-896828-g0002.tif"/>
</fig>
<p>We tracked 40 different variables through the monitoring process, grouped into eight subsets, which are described in <xref ref-type="table" rid="T2">Table 2</xref>. Combining the data from the 11 athletes over the two seasons provided a total cumulative dataset of 7,370 daily observations (a daily observation regroups the 40 monitored variable values with the associated injury outcome). The monitored data were stored in a database.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Description of the different variables, their measurement frequency, and the strategies used to replace missing values.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="left"><bold>Description and measurements</bold></th>
<th valign="top" align="left"><bold>Measurement frequency</bold></th>
<th valign="top" align="left"><bold>Missing values replacement strategy</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">External training load</td>
<td valign="top" align="left">Number of laps on the ice rink performed for each training by the athletes in different intensity zones.</td>
<td valign="top" align="left">Every training on ice</td>
<td valign="top" align="left">Main cause: Athletes did not train. Missing values replaced with a zero.</td>
</tr>
<tr>
<td valign="top" align="left">Internal training load</td>
<td valign="top" align="left">Athletes qualitatively assigned fatigue perception on a scale of 0 to 10, which was multiplied by the session duration, in minutes, to provide an internal load score referred to as sFatigue (Dunbar et al., <xref ref-type="bibr" rid="B21">1992</xref>)</td>
<td valign="top" align="left">After every training session</td>
<td valign="top" align="left">Same as external training load.</td>
</tr>
<tr>
<td valign="top" align="left">Psychological wellbeing metrics</td>
<td valign="top" align="left">Athletes provided an assessment, on a scale of 0 to 100, of their levels of stress, energy, happiness, mood, motivation, performance stress, and sleep quality over the three previous days (Junge, <xref ref-type="bibr" rid="B38">2000</xref>; Shrier and Hall&#x000E9;, <xref ref-type="bibr" rid="B61">2011</xref>).</td>
<td valign="top" align="left">3 times a week</td>
<td valign="top" align="left">Main Cause: non-daily measurement frequencies. Replaced with the most recent measurement.</td>
</tr>
<tr>
<td valign="top" align="left">Heart rate variability</td>
<td valign="top" align="left">Resting heart rate variability (HRV) taken using a heart rate monitoring belt (Polar H10, Finland) connected HRV4Training (Altini et al., <xref ref-type="bibr" rid="B2">2017</xref>). Variables measured: resting heart rate, high frequency power (hf), low frequency power (lf), pnn50 (Seyd et al., <xref ref-type="bibr" rid="B60">2008</xref>), RMSSD (Wang and Huang, <xref ref-type="bibr" rid="B70">2012</xref>), sdnn (Wang and Huang, <xref ref-type="bibr" rid="B70">2012</xref>; Goessl et al., <xref ref-type="bibr" rid="B28">2017</xref>).</td>
<td valign="top" align="left">Every 3 days</td>
<td valign="top" align="left">Same as psychological wellbeing metrics.</td>
</tr>
<tr>
<td valign="top" align="left">Neuromuscular function</td>
<td valign="top" align="left">Counter movement jumps (CMJ) performed on force plates. Variables measured: contraction time, flight time to contraction time ratio, jump height (from flight time and impulsion), takeoff velocity, flight time to contraction time ratio, height (from impulsion), height (from flight time) and flight velocity (Gathercole et al., <xref ref-type="bibr" rid="B26">2015</xref>).</td>
<td valign="top" align="left">3 times a week</td>
<td valign="top" align="left">Same as psychological wellbeing metrics.</td>
</tr>
<tr>
<td valign="top" align="left">Injury type and location</td>
<td valign="top" align="left">Each time athletes reported injuries they specified injury body location (head/neck, trunk, trunk/lumbar, lower limb, upper limb) and the type (bone, muscle and tendon, joint and ligament, skin, brain/spinal cord/peripheral nervous system, other).</td>
<td valign="top" align="left">Each time an injury was reported</td>
<td valign="top" align="left">Main cause: Athlete was not injured. Replaced with a zero.</td>
</tr>
<tr>
<td valign="top" align="left">Training completion</td>
<td valign="top" align="left">The athletes ranked the level of training completion according to four factors: 0: training completed without injury/illness, 1: training completed with injury/illness, 2: training could not be completed because of injury/illness and 3: The athlete could not train at all because of injury/illness.</td>
<td valign="top" align="left">After every training session</td>
<td valign="top" align="left">Main cause: Athlete did not train. Missing values replaced with a zero. In the cases where athletes could not train because of injury or illness, the corresponding health status was validated by the medical team.</td>
</tr>
<tr>
<td valign="top" align="left">Injury (Target variable: Forecast 1 to 7 days)</td>
<td valign="top" align="left">The injury status, by definition (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; Clarsen et al., <xref ref-type="bibr" rid="B16">2013</xref>) refers to the <italic>Training completion</italic> variable value in a future time window of 1 to 7 days. A <italic>Training completion</italic> of 0 was labeled as a &#x0201C;non-injured&#x0201D; athlete, while a <italic>Training completion</italic> of 1, 2 and 3 was labeled as an &#x0201C;injured&#x0201D; athlete. Models were trained to predict this variable.</td>
<td valign="top" align="left">Every day</td>
<td valign="top" align="left">Variable derived from the <italic>Training completion</italic> When the Injury variable is defined there are no missing values left.</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>For the missing values and replacement strategies, the main cause of missing values is reported in the third column of the table. A close follow-up was made with the performance support staff to verify all information for specific situations, such as athletes omitting to report due to injuries</italic>.</p>
</table-wrap-foot>
</table-wrap></sec>
<sec>
<title>Data Preprocessing: Missing Values Imputation and Feature Engineering</title>
<p>The monitoring process implies inevitable missing values (Benson et al., <xref ref-type="bibr" rid="B8">2021</xref>). We observed three causes for missing values in the dataset: (1) the athletes did not train on a given day; (2) some variables were not measured daily; and (3) the athletes omitted to report data. Even if all athletes who participated in the study had participation rates ranging from 77 to 100% and averaging 90%, some daily instances were incomplete and displayed, for some variables, missing values. Imputation strategies had to be employed and are reported, for each monitored variable subset, in <xref ref-type="table" rid="T2">Table 2</xref>. Replacing missing values by the most recent measurement was the preferred imputation strategy for the psychological wellbeing metrics, neuromuscular function, and HRV variables, which were measured every 2 to 3 days. Although not perfect, the imputation strategy is easy to implement, avoids information leakage (Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>), and provides indicators of the stress response variables over a short time frame. It is the most simplistic single imputation technique described by Benson et al. (<xref ref-type="bibr" rid="B8">2021</xref>).</p>
<p>Injuries were forecasted through a feature-based approach. New variables were computed from the athlete&#x00027;s monitored variables history, stored in the database, to reflect injuries&#x00027; time dependence (Clarsen et al., <xref ref-type="bibr" rid="B16">2013</xref>; Bittencourt et al., <xref ref-type="bibr" rid="B9">2016</xref>) and improve the models&#x00027; forecasting performances. We calculated the exponentially weighted moving average (EWMA) over the last 7 and 28 days (referred to as the acute and chronic indices) for each monitored variable. These time frame periods were chosen based on previous studies exploring the association between these indices and injuries (Gabbett and Jenkins, <xref ref-type="bibr" rid="B25">2011</xref>; Hulin et al., <xref ref-type="bibr" rid="B35">2014</xref>; Gabbett et al., <xref ref-type="bibr" rid="B24">2016</xref>). It also corresponds to the microcycles (7 days) and mesocycles (28 days) used for the athletes&#x00027; global periodization. The acute to chronic work ratio (ACR) (Gabbett and Jenkins, <xref ref-type="bibr" rid="B25">2011</xref>; Hulin et al., <xref ref-type="bibr" rid="B35">2014</xref>; Gabbett et al., <xref ref-type="bibr" rid="B24">2016</xref>) is an additional feature engineered for all the variables. The EWMA ratios are a standard forecasting practice to improve the forecasting models&#x00027; performances (Holt, <xref ref-type="bibr" rid="B34">2004</xref>). In addition, we computed the acute to chronic difference (ACD), inspired by Banister&#x00027;s training impulse concept (Banister et al., <xref ref-type="bibr" rid="B7">1975</xref>; Jovanovic, <xref ref-type="bibr" rid="B37">2018</xref>).</p>
<p>For each variable, to quantify the influence of the past mesocycles and microcycles, we computed rolling means, standard deviations, and maximums over time windows of 7, 14, 21, and 28 days. At the end of the feature engineering process, a single daily observation is constituted of 1014 features associated with the athletes&#x00027; injury status (injured or not injured).</p>
</sec>
<sec>
<title>Training and Test Sets</title>
<p>To make predictions, the random forest algorithms were &#x0201C;trained&#x0201D; on available data and &#x0201C;learned&#x0201D; a set of rules that allow appropriate classification (injured or not injured), given the set of features provided as input. This set of features was composed of: (1) monitored variables on any given day; (2) imputed variables (if some monitored variables were missing); (3) engineered features based on the athletes&#x00027; monitored variables history stored in the database.</p>
<p>To evaluate the classifiers&#x00027; ability to replicate their predicting performances, they were evaluated on a test set, <italic>i.e</italic>., a set of unseen data (not used in the models&#x00027; training process). Two sets of forecasting models were developed. One including only training load variables (TL), and a second (ALL) combining a wide array of monitored variables (neuromuscular function, heart rate variability, training load, psychological wellbeing, past injury type, and location). From the 7,370 available daily instances, we created two test sets of 1,100 instances, composed of each of the 11 athletes&#x00027; last (chronologically) 100 daily instances. The ALL-Test-Set was composed of each of the 40 monitored variables and their respective engineered features, based on each athlete&#x00027;s complete data history, for a total of 1,014 features, while the TL-Test-Set was composed only of internal and external training load monitored variables and their corresponding engineered features, for a total of 182 features. With the remaining 6,270 daily instances, we created two training sets: the ALL-Training-Set composed of the same features as the ALL-Test-Set and the TL-Training set composed of the same features as the TL-Test-Set. As depicted in <xref ref-type="fig" rid="F1">Figure 1</xref>, the test sets feature engineering was performed on the athletes&#x00027; complete data history. The training sets feature engineering was only performed on the remaining 6,270 daily observations. This process replicates the way data will be handled to make predictions once the models are implemented in the training environment and avoid data leakage, where information of the test set would leak in the training set, leading to overfitting and biased interpretation of the models&#x00027; evaluation (Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>). Two distinct data sets (ALL and TL) were created to assess the impact of building forecasting models composed of variables from each of the three components of the injury conceptual model (ALL) in comparison to models composed of variables only reflecting the stress applied to the athlete (TL).</p>
</sec>
<sec>
<title>Models Training</title>
<p>Two sets of models were trained using the <italic>Waikaito Environment for Knowledge Analysis (WEKA)</italic> (Hall et al., <xref ref-type="bibr" rid="B30">2009</xref>). WEKA provides built-in functions to facilitate classifications of imbalanced datasets, such as the one in this study, where non-injured outweighs injured daily observations. To improve the random forest injury classifier, the injured and non-injured instances of the test set were balanced using WEKA&#x00027;s <italic>class balancer</italic> filter, which applies a weight function to the instances such that injured and non-injured instances display a similar total sum of weight (Hall et al., <xref ref-type="bibr" rid="B30">2009</xref>). A cost function was also introduced using WEKA&#x00027;s <italic>cost-sensitive classifier</italic> function. The function penalizes injury misclassifications through the random forests training process, which improves injury forecasting performances (Hall et al., <xref ref-type="bibr" rid="B30">2009</xref>). Default setups were used for all of WEKA&#x00027;s built-in functions. The ALL forecasting models were trained on the ALL Training Set, and the TL forecasting models were trained on the TL Training Set. For each set, seven classifiers were trained to predict whether the athletes would be injured in an upcoming time frame of 1 to 7 days, respectively. The random forest algorithm introduces randomness, notably through the bootstrap aggregating process (Breiman, <xref ref-type="bibr" rid="B12">2001</xref>). To account for the classifiers&#x00027; random factors, the training of each classifier was performed 30 times for each of the seven forecasting time frames. At the end of the training process, we obtained, for both sets of models (ALL and TL), 30 trained random forest classifiers for each forecasting time window (1 to 7 days).</p>
</sec>
<sec>
<title>Models Evaluation</title>
<p>Each instance of the test-set was provided as input to the forecasting models, and the trained classifier generated predictions on the upcoming injury status (on a time frame of 1 to 7 days depending on the classifier). To obtain a confusion matrix, each prediction was compared to the true outcome, which is already known. Confusion matrices compile True Positives (TP), False Positives (FP), False Negatives (FN), and True Negatives (TN) (Visa et al., <xref ref-type="bibr" rid="B68">2011</xref>). We computed the following three metrics from confusion matrices that summarized each of the 30 models&#x00027; predictive performances for every forecasting time window (1 to 7 days).</p>
<p>The sensitivity.</p>
<disp-formula id="E1"><mml:math id="M1"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle mathvariant="bold-italic"><mml:mi>S</mml:mi></mml:mstyle><mml:mstyle mathvariant="bold-italic"><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi></mml:mstyle><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mstyle><mml:mo>&#x0002B;</mml:mo><mml:mstyle mathvariant="bold-italic"><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mstyle></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The specificity</p>
<disp-formula id="E2"><mml:math id="M2"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle mathvariant="bold-italic"><mml:mi>S</mml:mi><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mi>i</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mo>=</mml:mo></mml:mstyle><mml:mfrac><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mstyle><mml:mo>&#x0002B;</mml:mo><mml:mstyle mathvariant="bold-italic"><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mstyle></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>the Matthews Correlation Coefficient (MCC) (Matthews, <xref ref-type="bibr" rid="B45">1975</xref>).</p>
<disp-formula id="E3"><mml:math id="M3"><mml:mrow><mml:mstyle mathvariant="bold-italic"><mml:mi>M</mml:mi><mml:mi>C</mml:mi><mml:mi>C</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>&#x000B7;</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>&#x000B7;</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>(</mml:mo><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>(</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo stretchy='false'>(</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>
<p>The sensitivity and specificity are commonly used evaluation metrics and were computed to compare our models with existing sports injury prediction models (Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>). The MCC (Matthews, <xref ref-type="bibr" rid="B45">1975</xref>) is considered one of the best methods to evaluate binary classifiers (Powers, <xref ref-type="bibr" rid="B52">2020</xref>). It is a wellbalanced metric that can be used even if the two classes are unbalanced (Boughorbel et al., <xref ref-type="bibr" rid="B11">2017</xref>). An MCC of 1 indicates complete agreement between the predictions and the observation, and inversely, an MCC of &#x02212;1 denotes complete disagreement (Boughorbel et al., <xref ref-type="bibr" rid="B11">2017</xref>). An MCC of 0 would be considered a model where the predictions were made randomly (Boughorbel et al., <xref ref-type="bibr" rid="B11">2017</xref>). The unbalanced nature of the data set is the reason why accuracy (Yin et al., <xref ref-type="bibr" rid="B72">2019</xref>) was not used as an evaluation metric. In our case, out of the total 7,370 daily observations, only 12% constitute injuries. Therefore, even if the models were always to predict non-injured athletes, the accuracy would remain relatively high (&#x0007E;0.88) and would not provide valuable information to the practitioners. We computed the mean and standard deviation of the 30 sensitivity scores, 30 specificity scores, and 30 MCC of every forecasting time window. At the end of the evaluation process, we obtained three evaluation metrics (mean &#x000B1; standard deviation) for each model and each forecast window (1 to 7 days) for both the ALL and TL forecasting models.</p>
</sec>
<sec>
<title>Models Comparison</title>
<p>To analyse the effect of the forecasting model types (ALL or TL) and forecasting time windows (1 to 7 days) on each evaluation metric (sensitivity, specificity, MCC), two-way ANOVAs were performed on each metric. In addition to ANOVAs, paired <italic>T</italic>-tests (Meek et al., <xref ref-type="bibr" rid="B47">1987</xref>) were performed on each separate forecasting window to evaluate the effect of the model type on the evaluation metrics, with the null hypothesis that both forecasting models (ALL and TL) evaluation metrics were equal at a significance level of <italic>p</italic> &#x0003C; 0.01. In addition, we computed Cohen effect sizes (Cohen, <xref ref-type="bibr" rid="B18">1994</xref>; Fritz et al., <xref ref-type="bibr" rid="B23">2012</xref>) to quantify the difference between the two types of model evaluation metrics.</p>
</sec>
<sec>
<title>Variables of Importance</title>
<p>Random forest classifiers enable highlighting variables of importance, which have a practical application for the performance support staff, trying to understand better the causes and mechanisms of injuries (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; Bittencourt et al., <xref ref-type="bibr" rid="B9">2016</xref>; Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>). The higher the random forests&#x00027; minimum GINI decrease coefficient, the greater the importance of the variable in the model (Archer and Kimes, <xref ref-type="bibr" rid="B3">2008</xref>).</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<p>Each graph in <xref ref-type="fig" rid="F3">Figure 3</xref> compares one of the three evaluation metrics derived from the evaluation of the ALL forecasting models and TL forecasting models for each forecasting time window (1&#x02013;7 days). Two-way ANOVAs performed on sensitivity, MCC and specificity indicated that there was a statistically significant interaction between the models type and the forecasting window (<italic>F</italic><sub>(6, 406)</sub> = 351.8, <italic>p</italic> &#x0003C; 0.000 for sensitivity; <italic>F</italic><sub>(6, 406)</sub> = 71.38, <italic>p</italic> &#x0003C; 0.000 for specificity; <italic>F</italic><sub>(6, 406)</sub> = 186.3, <italic>p</italic> &#x0003C; 0.000 for MCC). <xref ref-type="fig" rid="F3">Figure 3A</xref> compares the two types of model sensitivity, <italic>i.e.</italic>, the number of injury outcomes that the model correctly predicted. On average, the sensitivity was 0.35 &#x000B1; 0.19 for the ALL forecasting model and 0.23 &#x000B1; 0.03 for the TL forecasting model. Paired <italic>T</italic>-test performed on each respective forecasting window showed a statistically significant positive effect (<italic>p</italic> &#x0003C; 0.01) on the ALL forecasting models&#x00027; sensitivity, with large Cohen effect sizes (d &#x0003E; 1) for forecasting windows of 1&#x02013;3 days. As the forecasting windows got larger, the sensitivity gradually dropped from 0.70 &#x000B1; 0.03 for a one-day forecasting window to 0.14 &#x000B1; 0.03 for seven days forecasting window, for the ALL forecasting model. The sensitivity was above 0.5 only for 1 and 2 days forecasting windows. The TL forecasting models&#x00027; sensitivity remained relatively stable, oscillating between 0.26 and 0.20.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Evaluation metrics scores for both the ALL forecasting models (red dots) and TL forecasting models (blue dots). The x-axis displays the model forecasting window (from 1 to 7 days), while the y-axis presents in <bold>(A)</bold> the sensitivity, <bold>(B)</bold> Matthews Correlation Coefficient (MCC), and <bold>(C)</bold> specificity. The (&#x0002A;) symbol indicates a significant positive difference for the ALL forecasting models compared to the TL forecasting models (<italic>p</italic> &#x0003C; 0.01), as determined by paired <italic>T</italic>-test performed on each individual forecasting window, while the (&#x00023;) symbol indicates positive large Cohen effect sizes for the ALL forecasting models compared to the TL forecasting models (d &#x0003E; 1). The red dotted line on graph <bold>(B)</bold> indicates a MCC of zero, corresponding to the performance of a model making predictions randomly. Note that the y-axis of each graph are all on different scales to better appreciate the difference between the two types of models for every metric.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fspor-04-896828-g0003.tif"/>
</fig>
<p><xref ref-type="fig" rid="F3">Figure 3B</xref> compares the MCC, <italic>i.e</italic>., an index of the models&#x00027; predictions correlation with real observations. The MCC was on average 0.13 &#x000B1; 0.11 for the ALL forecasting model and &#x02212;0.02 &#x000B1; 0.02 for the TL forecasting model. Paired <italic>T</italic>-test performed on each respective forecasting window showed a statistically significant positive effect (<italic>p</italic> &#x0003C; 0.01) on the ALL forecasting model&#x00027;s MCC, with large Cohen effect sizes (d &#x0003E; 1) for each forecasting window. Every TL forecasting model MCC was below zero, suggesting the model&#x00027;s predictions were slightly worse than if predicted randomly. The ALL forecasting window&#x00027;s MCC decreased as the forecasting windows got larger, reaching 0.00 &#x000B1; 0.02 for seven-day forecasting windows.</p>
<p><xref ref-type="fig" rid="F3">Figure 3C</xref> compares the two types of model specificity, <italic>i.e.</italic>, the number of non-injured outcomes that the models predicted correctly. On average, the specificity was 0.81 &#x000B1; 0.05 for the ALL forecasting model and 0.74 &#x000B1; 0.03 for the TL forecasting model. Paired <italic>T</italic>-test performed on each forecasting window revealed a significant positive effect (<italic>p</italic> &#x0003C; 0.01) on the ALL forecasting model&#x00027;s specificity with large Cohen effect sizes (d &#x0003E; 1) for forecasting windows of four to seven days. The ALL forecasting model&#x00027;s specificity increased as the forecasting window got larger, reaching a maximum of 0.86 &#x000B1; 0.02 for seven-day forecasting windows. The TL forecasting model&#x00027;s specificity remained relatively stable, oscillating between 0.76 and 0.81.</p>
<p>The ALL forecasting models displayed a significantly higher injury prediction rate and MCC, suggesting they could bring added information to the performance support staff, especially over shorter forecasting time windows of 1 or 2 days, where the MCCs suggest low to moderate agreement between the models&#x00027; predictions and the observations (MCCs of respectively 0.32 &#x000B1; 0.03 for a one-day forecast and 0.22 &#x000B1; 0.04 for a two-day forecast). These models are composed of multiple variables from each of the three components of the injury model (<xref ref-type="fig" rid="F2">Figure 2</xref>). To understand better which categories of variables influence these models, the variables of importance were computed. <xref ref-type="fig" rid="F4">Figure 4</xref> presents the five most important variables retained from the ALL forecasting models of each forecasting window (1 to 7 days). Every variable in <xref ref-type="fig" rid="F4">Figure 4</xref> is derived from three monitored variables: athlete training completion, lower limb and trunk/lumbar injury, and sFatigue, an internal training load marker. As the forecasting window gets larger, the minimum GINI decrease coefficient gets smaller.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>The 5 most important variables for each forecasting window (1 to 7 days) of the ALL forecasting models. The selected variables of importance were the ones with the highest minimum GINI decrease coefficient. The variables are numbered and colored according to their corresponding forecasting time period. ACD, Acute Chronic difference. ACR, Acute to Chronic Ratio.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fspor-04-896828-g0004.tif"/>
</fig></sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>The results demonstrate that combining a wide array of monitored variables and quantifying the injury etiology conceptual components significantly improve the injury forecasting performance of random forest models. The ALL forecasting models displayed higher MCC than the TL forecasting models. The TL forecasting models&#x00027; MCC suggests they do not bring any added value to forecast injuries. The ALL forecasting models are particularly promising for injury prediction over short time frames of 1 or 2 days, with sensitivities higher than 0.5 and specificities above 0.7 in both cases. These forecasting models were highly determined by the athletes&#x00027; training completion, lower limb and trunk/lumbar injury history, as well as sFatigue, a training load marker.</p>
<sec>
<title>Machine Learning Framework</title>
<p>The conceptual model of injuries presented in <xref ref-type="fig" rid="F2">Figure 2</xref> is essential for developing an injury prediction machine learning framework. Practitioners who want to elaborate a data mining system need to have in mind the three conceptual components of injuries, <italic>i.e.</italic>, the stress applied to the athlete, the factors influencing the athletes&#x00027; stress response, and the stress response itself, which can lead to positive adaptations, or negative outcomes like injuries. They should define key monitoring variables reflecting each component of the conceptual model of injuries, which could vary depending on the context (exercise modality, athletes&#x00027; level, age, financial resources, etc.) and should be adapted to match the practitioners&#x00027; needs (Halson, <xref ref-type="bibr" rid="B31">2014</xref>; Starling et al., <xref ref-type="bibr" rid="B65">2020</xref>; Clarsen et al., <xref ref-type="bibr" rid="B17">2021</xref>). Monitored data should be stored in a database, as engineered features based on the data history are essential to reflect the time-dependent nature of injuries (Bittencourt et al., <xref ref-type="bibr" rid="B9">2016</xref>). Handling the database is easier when a problem conceptualization has been made (<xref ref-type="fig" rid="F2">Figure 2</xref>), and a data-mining plan (<xref ref-type="fig" rid="F1">Figure 1</xref>) has been established. This planning provides perspective on the problem and ensures data are monitored and stored purposefully.</p>
<p>Missing data have to be handled (Benson et al., <xref ref-type="bibr" rid="B8">2021</xref>). In this study, we used the simplest imputation form, which replaces the missing data with the most recent measurement. The imputation strategy does not minimize the errors and introduces information loss that could potentially be detrimental to the models&#x00027; predictive performances. Other imputation strategies that use mean values or regression techniques minimize the models&#x00027; error prediction (Benson et al., <xref ref-type="bibr" rid="B8">2021</xref>). However, these strategies are less optimal in the perspective of the models&#x00027; implementation and could introduce higher risks of information leakage and overfitting (Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>). To minimize the missing values&#x00027; imputation detrimental effects on future models&#x00027; performance, we will be looking at minimizing missing values, by measuring certain variables more frequently. It is a challenge to maximize measurement frequency without overwhelming the athletes (Starling and Lambert, <xref ref-type="bibr" rid="B64">2018</xref>). New monitoring technology to track the athletes&#x00027; HRV and neuromuscular function may represent an interesting avenue for that purpose (de Zambotti et al., <xref ref-type="bibr" rid="B19">2019</xref>; Roberts et al., <xref ref-type="bibr" rid="B54">2020</xref>). Practitioners are free to use the imputation strategies that better suit their needs. However, they should always keep in mind the models practical value and overfitting risks in their decision-making.</p>
<p>The ALL forecasting models show promising results, especially over short time windows of 1 and 2 days, where more than half the future injury days and more than 70% of future injury-free days can be predicted correctly by the models. Injuries can be compared to extreme meteorological events, which are particularly difficult to forecast (Ghil et al., <xref ref-type="bibr" rid="B27">2011</xref>; Nayak and Ghosh, <xref ref-type="bibr" rid="B50">2013</xref>; Boers et al., <xref ref-type="bibr" rid="B10">2014</xref>). Although modest, the results suggest that, in short time frames, the models could bring added information and help the performance support staff, providing better reaction time and facilitating their injury management decisions. In previous studies, reported sensitivity scores usually ranged between 0.65 and 0.85, while reported specificity scores ranged between 75 and 95% (McCullagh and Whitfort, <xref ref-type="bibr" rid="B46">2013</xref>; Whiteside et al., <xref ref-type="bibr" rid="B71">2016</xref>; Rossi et al., <xref ref-type="bibr" rid="B57">2017</xref>; L&#x000F3;pez-Valenciano et al., <xref ref-type="bibr" rid="B44">2018</xref>; Ayala et al., <xref ref-type="bibr" rid="B4">2019</xref>; Rommers et al., <xref ref-type="bibr" rid="B56">2020</xref>); every study differs in context and methodology.</p>
<p>In this study, methodological choices were made regarding to the models&#x00027; implementation in the speed skating environment, which could explain the lower sensitivities and specificities. We brought a novel approach that predicts the athletes&#x00027; health status in upcoming 1 to 7-day time windows. Inevitably, forecasting implies more uncertainty and diminished performance. Injury forecasting was preferred as it has more potential to help and assist the performance support staff. In the experimental setup, we replicated the models&#x00027; implementation process in the training environment, where models will be trained with all the available information and data history at hand and where new data entries, to predict future outcomes will be provided by the same athletes who participated in the models&#x00027; training process. Therefore, the test sets data was collected on the same athletes that contributed to train the models. We did not test the models&#x00027; ability to predict injuries from athletes that didn&#x00027;t contribute to train the models. The training sets&#x00027; engineered features were computed separately and are not influenced by the test set data. These choices introduce some form of overfitting and information leakage (Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>), because new data entries for injury predictions will inevitably be influenced by the athletes&#x00027; data history used to train the models. However, this process must be seen as a calibration process rather than a major hurdle or methodological issue. The models are calibrated to predict the upcoming injuries of the athletes within the Canadian National Female Short-Track Speed Skating Program and could only work in that particular context, with the specific athletes who took part in the data-mining process. Practitioners should adapt the framework to work in their specific environment and context, and with the athletes participating in their data-mining framework.</p>
</sec>
<sec>
<title>Comparing the Two Types of Models</title>
<p>This study appreciates the significant impact of introducing variables from all three components of the injury conceptual model on the injury forecasting performances. Keeping the methodological aspect constant, the forecasting models, composed of variables from the three components of the injury conceptual model (ALL forecasting models), have displayed significantly higher MCC (<italic>p</italic> &#x0003C; 0.01), with large Cohen effect sizes (d &#x0003E;1) on each of the seven forecasting windows, compared to models built only using training load (TL forecasting models). In fact, the TL forecasting models all had an MCC below zero. Therefore, they behave similarly to a model making predictions randomly and would not be helpful for the performance support staff.</p>
<p>The ALL forecasting models all have an MCC above zero. The MCC values gradually decrease to near zero (0.00 &#x000B1; 0.02) on 7-day forecasting windows. As the forecasting window gets larger, the ALL forecasting models&#x00027; ability to predict injury occurrences from the test set decreases. This is expected, especially for complex problems such as injuries, which are influenced by many interrelated variables and some random factors. For instance, injuries can occur accidentally by falling on the ice. These types of injuries are unfortunately impossible to predict. As the forecasting windows get larger, the influence of the different variables and random factors are multiplied, thus explaining the decrease observed in ALL forecasting models&#x00027; sensitivity and MCC. As for the ALL forecasting models&#x00027; specificity increase with larger forecasting windows, there are two main reasons. The first one is that patterns leading to an athlete staying healthy are probably better defined and predictable by the models. There are more cases where athletes are healthy, which facilitates pattern recognition. While injuries may be compared to extreme meteorological events, injury-free days resemble day-to-day weather, which is relatively predictable. The second reason is the cost function for injury misclassification, which ties the sensitivity and specificity together. As injuries get harder to predict, the only way to minimize the cost function with larger forecasting windows is to maximize the models&#x00027; specificity. We applied the same cost function for misclassification on every forecasting window. It would be interesting to test different cost functions depending on the forecasting windows.</p>
</sec>
<sec>
<title>Variables of Importance</title>
<p>It is crucial for the performance support staff to understand the variables that have the strongest influence on athletes&#x00027; injuries. These variables can provide valuable information on injury risk factors and guide their interventions (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; Clarsen et al., <xref ref-type="bibr" rid="B17">2021</xref>; Van Eetvelde et al., <xref ref-type="bibr" rid="B67">2021</xref>). The random forest algorithms were useful to provide the most important variables [maximizing the minimum GINI decrease (Archer and Kimes, <xref ref-type="bibr" rid="B3">2008</xref>)] of the ALL forecasting models to forecast injuries over the seven time windows. The most important variables are derived from three main features: athletes&#x00027; ability to complete their training (training completion); lower limb and trunk/lumbar injuries; and sFatigue, an internal training load marker. The training completion variable is intimately tied to our definition of injury (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; Clarsen et al., <xref ref-type="bibr" rid="B17">2021</xref>). Injuries impede the athletes&#x00027; ability to complete their training because of physical or psychological complaints. Tracking the athletes&#x00027; ability to complete their training sessions over time is crucial to identify future injuries. Lower limb injuries are another variable worth tracking in short-track speed skating. Lower limb asymmetries have already been pointed out as potential injury risk factors in this sport (Konieczny et al., <xref ref-type="bibr" rid="B40">2020</xref>). In addition, the athletes skate in a crouched posture, which could be responsible for lower back and lumbar discomfort (Hesford et al., <xref ref-type="bibr" rid="B33">2012</xref>). Both variables (training completion, as well as lower limbs and trunk/lumbar injuries) provide indices of the athletes&#x00027; health status, a factor influencing the athletes&#x00027; future stress response. The other monitored variables may not be as important for the ALL forecasting models&#x00027; predictions, but they certainly play a role in the forecasting process. As a matter of fact, sFatigue, an internal training load marker, was among the most important variables for five-day forecasting windows. In addition, as the forecasting windows got larger, the minimum GINI decrease coefficients of the important variables got smaller, potentially linked to the ALL forecasting models&#x00027; MCC and sensitivity decrease. Practitioners building models in different sports and contexts will necessarily observe different variables of importance.</p>
</sec>
<sec>
<title>Limitations and Future Perspectives</title>
<p>The injury conceptual model presented in <xref ref-type="fig" rid="F2">Figure 2</xref>, although useful to understand the interactions of different categories of variables and their influence, remains simplistic. The etiology of injuries remains much more complex (Meeuwisse et al., <xref ref-type="bibr" rid="B48">2007</xref>; Bittencourt et al., <xref ref-type="bibr" rid="B9">2016</xref>). We will keep improving the conceptual model of injuries, making it more robust and accurate, based on the continuous observations provided by the performance support staff. Improvements to the conceptual model will help orient the data mining framework and future variable monitoring process. Thus far, we have focused on the negative stress aspect of the conceptual model presented in <xref ref-type="fig" rid="F2">Figure 2</xref> (injuries). However, in the future, basing our work on the same conceptual model, we could explore the positive stress adaptations to predict performance through a similar framework.</p>
<p>The random forest forecasting models act as black boxes. Even if they provide some indicators of the variables of importance, they cannot assist the performance support staff in understanding how each variable influences the injury conceptual model. Nonetheless, the ALL forecasting windows on short time windows (1 or 2 days) are promising, and we believe they could add value to the performance support staff. We plan to implement the models similarly to meteorological models, assisting the performance support staff and providing injury warnings in the upcoming 1 to 7 days. Doing so could facilitate the performance support staff interventions. Much like meteorological models are used in real life, they do not override human judgment. Ultimately, decisions are based on all available information, some of which is not necessarily taken into consideration by the forecasting models. The implementation of the models will be closely documented to help validate and improve our framework.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>In conclusion, the framework presented in this study is a work in progress. It shows promising results for the use of machine learning techniques to forecast sports injury over short forecast time frames of 1 and 2 days. Further work is needed to determine if improved models could help better forecast injuries in time frames of 3 days or longer. The framework is in its early development stages. It will be refined and improved by monitoring additional variables reflecting the conceptual model of injury (<xref ref-type="fig" rid="F2">Figure 2</xref>), gathering more data, and fine-tuning the models&#x00027; hyperparameters. In addition, the forecasting models were built from the combination of collected data from 11 individual athletes. It would be valuable to individualize the models, as each athlete is different. The variables influencing the conceptual model of injuries, their interaction, and their importance may vary from one athlete to another.</p>
</sec>
<sec id="s6">
<title>Practical Applications</title>
<list list-type="simple">
<list-item><p>- This study presents a novel injury conceptual model (<xref ref-type="fig" rid="F2">Figure 2</xref>), essential to orient variable monitoring and guide data mining projects wishing to predict sports injuries.</p></list-item>
<list-item><p>- Monitoring different types of variables, quantifying the three components of the conceptual injury model, <italic>i.e.</italic>, the stress applied to the athletes, the factors influencing the stress response, and the stress response itself, improves the injury models&#x00027; forecasting performances.</p></list-item>
<list-item><p>- Specifically for athletes within the Canadian National Female Short-Track Speed Skating program who took part in the elaboration of this study&#x00027;s forecasting models, training completion, as well as lower limbs and trunk/lumbar injuries and sFatigue appear particularly important to predict future injuries.</p></list-item>
<list-item><p>- This study displays promising results from the perspective of forecasting models implementation to assist the Canadian National Female Short-Track Speed Skating program injury prevention process.</p></list-item>
</list>
</sec>
<sec sec-type="data-availability" id="s7">
<title>Data Availability Statement</title>
<p>An anonymised version of the raw data (to respect the athletes&#x00027; privacy) supporting the conclusions of this article will be made available by the authors, without undue reservation</p>
</sec>
<sec id="s8">
<title>Ethics Statement</title>
<p>Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was provided by the participants&#x00027; legal guardian/next of kin.</p>
</sec>
<sec id="s9">
<title>Author Contributions</title>
<p>JB worked on the framework elaboration, model development, conceptualization of the project, and manuscript writing. SD worked on data acquisition, communication with the performance support staff, writing of the manuscript, and conceptualization. SG worked on data acquisition and communication with the performance support staff. FB is the supervisor of the project at the Institut National du Sport du Qu&#x000E9;bec and overviewed each step of the project. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="funding-information" id="s10">
<title>Funding</title>
<p>This project was funded by the Institut National du Sport du Qu&#x000E9;bec through the <italic>Programme de recherche, d&#x00027;innovation et de diffusion de l&#x00027;information</italic> (PRIDI &#x00023;2).</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x00027;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack><p>We would like to thank Speed Skating Canada, the coaches of the national training centre, and the athletes.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Adriaans</surname> <given-names>P.</given-names></name> <name><surname>Zantinge</surname> <given-names>D.</given-names></name></person-group> (<year>1997</year>). <source>Data Mining</source>. <publisher-loc>USA</publisher-loc>: <publisher-name>Addison-Wesley Longman Publishing Co., Inc</publisher-name>.</citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altini</surname> <given-names>M.</given-names></name> <name><surname>Van Hoof</surname> <given-names>C.</given-names></name> <name><surname>Amft</surname> <given-names>O.</given-names></name></person-group> (<year>2017</year>). <article-title>&#x0201C;Relation between estimated cardiorespiratory fitness and running performance in free-living: an analysis of HRV4Training data,&#x0201D;</article-title> in <source>2017 IEEE EMBS International Conference on Biomedical Health Informatics (BHI). ieeexplore.ieee.org</source>, p. 249&#x02013;52. <pub-id pub-id-type="doi">10.1109/BHI.2017.7897252</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Archer</surname> <given-names>K. J.</given-names></name> <name><surname>Kimes</surname> <given-names>R. V.</given-names></name></person-group> (<year>2008</year>). <article-title>Empirical characterization of random forest variable importance measures</article-title>. <source>Comput. Stat. Data Anal</source>. <volume>52</volume>, <fpage>2249</fpage>&#x02013;<lpage>2260</lpage>. <pub-id pub-id-type="doi">10.1016/j.csda.2007.08.015</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ayala</surname> <given-names>F.</given-names></name> <name><surname>L&#x000F3;pez-Valenciano</surname> <given-names>A.</given-names></name> <name><surname>G&#x000E1;mez Mart&#x000ED;n</surname> <given-names>J. A.</given-names></name> <name><surname>De Ste Croix</surname> <given-names>M.</given-names></name> <name><surname>Vera-Garcia</surname> <given-names>F. J.</given-names></name> <name><surname>Garc&#x000ED;a-Vaquero</surname> <given-names>M. D. P.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>A preventive model for hamstring injuries in professional soccer: learning algorithms</article-title>. <source>Int. J. Sports Med</source>. <volume>40</volume>, <fpage>344</fpage>&#x02013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1055/a-0826-1955</pub-id><pub-id pub-id-type="pmid">30873572</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bahr</surname> <given-names>R.</given-names></name> <name><surname>Clarsen</surname> <given-names>B.</given-names></name> <name><surname>Ekstrand</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>Why we should focus on the burden of injuries and illnesses, not just their incidence</article-title>. <source>Br. J. Sports Med</source>. <volume>52</volume>, <fpage>1018</fpage>&#x02013;<lpage>1021</lpage>. <pub-id pub-id-type="doi">10.1136/bjsports-2017-098160</pub-id><pub-id pub-id-type="pmid">29021247</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bahr</surname> <given-names>R.</given-names></name> <name><surname>Krosshaug</surname> <given-names>T.</given-names></name></person-group> (<year>2005</year>). <article-title>Understanding injury mechanisms: a key component of preventing injuries in sport</article-title>. <source>Br. J. Sports Med</source>. <volume>39</volume>, <fpage>324</fpage>&#x02013;<lpage>329</lpage>. <pub-id pub-id-type="doi">10.1136/bjsm.2005.018341</pub-id><pub-id pub-id-type="pmid">15911600</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Banister</surname> <given-names>E. W.</given-names></name> <name><surname>Calvert</surname> <given-names>T. W.</given-names></name> <name><surname>Savage</surname> <given-names>M. V.</given-names></name> <name><surname>Bach</surname> <given-names>T. A.</given-names></name></person-group> (<year>1975</year>). <article-title>systems model of training for athletic performance</article-title>. <source>J. Sci. Med. Sport</source>. <volume>7</volume>, <fpage>57</fpage>&#x02013;<lpage>61</lpage>.</citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benson</surname> <given-names>L. C.</given-names></name> <name><surname>Stilling</surname> <given-names>C.</given-names></name> <name><surname>Owoeye</surname> <given-names>O. B. A.</given-names></name> <name><surname>Emery</surname> <given-names>C. A.</given-names></name></person-group> (<year>2021</year>). <article-title>Evaluating methods for imputing missing data from longitudinal monitoring of athlete workload</article-title>. <source>J. Sports Sci. Med</source>. <volume>20</volume>, <fpage>187</fpage>&#x02013;<lpage>195</lpage>. <pub-id pub-id-type="doi">10.52082/jssm.2021.188</pub-id><pub-id pub-id-type="pmid">33948096</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bittencourt</surname> <given-names>N. F. N.</given-names></name> <name><surname>Meeuwisse</surname> <given-names>W. H.</given-names></name> <name><surname>Mendon&#x000E7;a</surname> <given-names>L. D.</given-names></name> <name><surname>Nettel-Aguirre</surname> <given-names>A.</given-names></name> <name><surname>Ocarino</surname> <given-names>J. M.</given-names></name> <name><surname>Fonseca</surname> <given-names>S. T.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Complex systems approach for sports injuries: moving from risk factor identification to injury pattern recognition&#x02014;narrative review and new concept</article-title>. <source>Br. J. Sports Med</source>. <volume>50</volume>, <fpage>1309</fpage>&#x02013;<lpage>1314</lpage>. <pub-id pub-id-type="doi">10.1136/bjsports-2015-095850</pub-id><pub-id pub-id-type="pmid">27445362</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boers</surname> <given-names>N.</given-names></name> <name><surname>Bookhagen</surname> <given-names>B.</given-names></name> <name><surname>Barbosa</surname> <given-names>H. M. J.</given-names></name> <name><surname>Marwan</surname> <given-names>N.</given-names></name> <name><surname>Kurths</surname> <given-names>J.</given-names></name> <name><surname>Marengo</surname> <given-names>J. A.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Prediction of extreme floods in the eastern Central Andes based on a complex networks approach</article-title>. <source>Nat. Commun</source>. 5, 5199. <pub-id pub-id-type="doi">10.1038/ncomms6199</pub-id><pub-id pub-id-type="pmid">25310906</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boughorbel</surname> <given-names>S.</given-names></name> <name><surname>Jarray</surname> <given-names>F.</given-names></name> <name><surname>El-Anbari</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Optimal classifier for imbalanced data using matthews correlation coefficient metric</article-title>. <source>PLoS One</source>. 12, e0177678. <pub-id pub-id-type="doi">10.1371/journal.pone.0177678</pub-id><pub-id pub-id-type="pmid">28574989</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breiman</surname> <given-names>L</given-names></name></person-group>. (<year>2001</year>). <article-title>Random forests</article-title>. <source>Mach. Learn</source>. <volume>45</volume>, <fpage>5</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cabitza</surname> <given-names>F.</given-names></name> <name><surname>Locoro</surname> <given-names>A.</given-names></name> <name><surname>Banfi</surname> <given-names>G.</given-names></name></person-group> (<year>2018</year>). <article-title>Machine learning in orthopedics: a literature review</article-title>. <source>Front. Bioeng. Biotechnol</source>. 6, 75. <pub-id pub-id-type="doi">10.3389/fbioe.2018.00075</pub-id><pub-id pub-id-type="pmid">29998104</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carey</surname> <given-names>D. L.</given-names></name> <name><surname>Ong</surname> <given-names>K.</given-names></name> <name><surname>Whiteley</surname> <given-names>R.</given-names></name> <name><surname>Crossley</surname> <given-names>K. M.</given-names></name> <name><surname>Crow</surname> <given-names>J.</given-names></name> <name><surname>Morris</surname> <given-names>M. E.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Predictive modelling of training loads and injury in australian football</article-title>. <source>Int. J. Comput. Sci. Sport</source>. <volume>17</volume>, <fpage>49</fpage>&#x02013;<lpage>66</lpage>. <pub-id pub-id-type="doi">10.2478/ijcss-2018-0002</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clarsen</surname> <given-names>B.</given-names></name> <name><surname>Krosshaug</surname> <given-names>T.</given-names></name> <name><surname>Bahr</surname> <given-names>R.</given-names></name></person-group> (<year>2010</year>). <article-title>Overuse injuries in professional road cyclists</article-title>. <source>Am. J. Sports Med</source>. <volume>38</volume>, <fpage>2494</fpage>&#x02013;<lpage>2501</lpage>. <pub-id pub-id-type="doi">10.1177/0363546510376816</pub-id><pub-id pub-id-type="pmid">20847225</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clarsen</surname> <given-names>B.</given-names></name> <name><surname>Myklebust</surname> <given-names>G.</given-names></name> <name><surname>Bahr</surname> <given-names>R.</given-names></name></person-group> (<year>2013</year>). <article-title>Development and validation of a new method for the registration of overuse injuries in sports injury epidemiology: the Oslo Sports Trauma Research Centre (OSTRC) overuse injury questionnaire</article-title>. <source>Br. J. Sports Med</source>. <volume>47</volume>, <fpage>495</fpage>&#x02013;<lpage>502</lpage>. <pub-id pub-id-type="doi">10.1136/bjsports-2012-091524</pub-id><pub-id pub-id-type="pmid">23038786</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clarsen</surname> <given-names>B.</given-names></name> <name><surname>Steffen</surname> <given-names>K.</given-names></name> <name><surname>Berge</surname> <given-names>H. M.</given-names></name> <name><surname>Bendiksen</surname> <given-names>F.</given-names></name> <name><surname>Fossan</surname> <given-names>B.</given-names></name> <name><surname>Fredriksen</surname> <given-names>H.</given-names></name></person-group> (<year>2021</year>). <article-title>Methods, challenges benefits of a health monitoring programme for Norwegian, Olympic, Paralympic athletes: the road from London 2012 to Tokyo 2020</article-title>. <source>Br. J. Sports Med.</source> <volume>21</volume>, <fpage>103717</fpage>. <pub-id pub-id-type="doi">10.1136/bjsports-2020-103717</pub-id><pub-id pub-id-type="pmid">34039584</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>J</given-names></name></person-group>. (<year>1994</year>). <article-title>The earth is round (p &#x0003C;0.05)</article-title>. <source>Am Psychol</source>. <volume>49</volume>, <fpage>997</fpage>&#x02013;<lpage>1003</lpage>. <pub-id pub-id-type="doi">10.1037/0003-066X.49.12.997</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Zambotti</surname> <given-names>M.</given-names></name> <name><surname>Rosas</surname> <given-names>L.</given-names></name> <name><surname>Colrain</surname> <given-names>I. M.</given-names></name> <name><surname>Baker</surname> <given-names>F. C.</given-names></name></person-group> (<year>2019</year>). <article-title>The Sleep of the ring: comparison of the OURA sleep tracker against polysomnography</article-title>. <source>Behav. Sleep. Med</source>. <volume>17</volume>, <fpage>124</fpage>&#x02013;<lpage>136</lpage>. <pub-id pub-id-type="doi">10.1080/15402002.2017.1300587</pub-id><pub-id pub-id-type="pmid">28323455</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>D&#x00027;Isanto</surname> <given-names>T</given-names></name></person-group>. (<year>2019</year>). <article-title>Campania MIUR Italy, D&#x00027;Elia F, Raiola G, Altavilla G, University of Salerno, Department of Human, Philosophical and Education, Salerno, Italy, Assessment of sport performance: theoretical aspects and practical indications</article-title>. <source>Sport Mont</source>. <volume>17</volume>, <fpage>79</fpage>&#x02013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.26773/smj.190214</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dunbar</surname> <given-names>C. C.</given-names></name> <name><surname>Robertson</surname> <given-names>R. J.</given-names></name> <name><surname>Baun</surname> <given-names>R.</given-names></name> <name><surname>Blandin</surname> <given-names>M. F.</given-names></name> <name><surname>Metz</surname> <given-names>K.</given-names></name> <name><surname>Burdett</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>1992</year>). <article-title>The validity of regulating exercise intensity by ratings of perceived exertion</article-title>. <source>Med. Sci. Sports. Exerc</source>. <volume>24</volume>, <fpage>94</fpage>&#x02013;<lpage>99</lpage>. <pub-id pub-id-type="doi">10.1249/00005768-199201000-00016</pub-id><pub-id pub-id-type="pmid">17313260</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foster</surname> <given-names>C.</given-names></name> <name><surname>Rodriguez-Marroyo</surname> <given-names>J. A.</given-names></name> <name><surname>Koning</surname> <given-names>D.</given-names></name></person-group> (<year>2017</year>). <article-title>Monitoring training loads: the past, the present, and the future</article-title>. <source>Int. J. Sports Physiol. Perform</source>. <volume>12</volume>, <fpage>S22</fpage>&#x02013;<lpage>S28</lpage>. <pub-id pub-id-type="doi">10.1123/IJSPP.2016-0388</pub-id><pub-id pub-id-type="pmid">28253038</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fritz</surname> <given-names>C. O.</given-names></name> <name><surname>Morris</surname> <given-names>P. E.</given-names></name> <name><surname>Richler</surname> <given-names>J. J.</given-names></name></person-group> (<year>2012</year>). <article-title>Effect size estimates: current use, calculations, and interpretation</article-title>. <source>J. Exp. Psychol. Gen</source>. <volume>141</volume>, <fpage>2</fpage>&#x02013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1037/a0024338</pub-id><pub-id pub-id-type="pmid">21823805</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gabbett</surname> <given-names>T. J.</given-names></name> <name><surname>Hulin</surname> <given-names>B. T.</given-names></name> <name><surname>Blanch</surname> <given-names>P.</given-names></name> <name><surname>Whiteley</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>High training workloads alone do not cause sports injuries: how you get there is the real issue</article-title>. <source>Br. J. Sports Med</source>. <volume>50</volume>, <fpage>444</fpage>&#x02013;<lpage>445</lpage>. <pub-id pub-id-type="doi">10.1136/bjsports-2015-095567</pub-id><pub-id pub-id-type="pmid">26795610</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gabbett</surname> <given-names>T. J.</given-names></name> <name><surname>Jenkins</surname> <given-names>D. G.</given-names></name></person-group> (<year>2011</year>). <article-title>Relationship between training load and injury in professional rugby league players</article-title>. <source>J. Sci. Med. Sport</source>. <volume>14</volume>, <fpage>204</fpage>&#x02013;<lpage>209</lpage>. <pub-id pub-id-type="doi">10.1016/j.jsams.2010.12.002</pub-id><pub-id pub-id-type="pmid">21256078</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gathercole</surname> <given-names>R.</given-names></name> <name><surname>Sporer</surname> <given-names>B.</given-names></name> <name><surname>Stellingwerff</surname> <given-names>T.</given-names></name></person-group> (<year>2015</year>). <article-title>Countermovement jump performance with increased training loads in elite female rugby athletes</article-title>. <source>Int. J. Sports Med</source>. <volume>36</volume>, <fpage>722</fpage>&#x02013;<lpage>728</lpage>. <pub-id pub-id-type="doi">10.1055/s-0035-1547262</pub-id><pub-id pub-id-type="pmid">25831403</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghil</surname> <given-names>M.</given-names></name> <name><surname>Yiou</surname> <given-names>P.</given-names></name> <name><surname>Hallegatte</surname> <given-names>S.</given-names></name> <name><surname>Malamud</surname> <given-names>B. D.</given-names></name> <name><surname>Naveau</surname> <given-names>P.</given-names></name> <name><surname>Soloviev</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Extreme events: dynamics, statistics and prediction</article-title>. <source>Nonlinear Process. Geophys</source>. <volume>18</volume>, <fpage>295</fpage>&#x02013;<lpage>350</lpage>. <pub-id pub-id-type="doi">10.5194/npg-18-295-2011</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goessl</surname> <given-names>V. C.</given-names></name> <name><surname>Curtiss</surname> <given-names>J. E.</given-names></name> <name><surname>Hofmann</surname> <given-names>S. G.</given-names></name></person-group> (<year>2017</year>). <article-title>The effect of heart rate variability biofeedback training on stress and anxiety: a meta-analysis</article-title>. <source>Psychol. Med</source>. <volume>47</volume>, <fpage>2578</fpage>&#x02013;<lpage>2586</lpage>. <pub-id pub-id-type="doi">10.1017/S0033291717001003</pub-id><pub-id pub-id-type="pmid">28478782</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gould</surname> <given-names>D.</given-names></name> <name><surname>Guinan</surname> <given-names>D.</given-names></name> <name><surname>Greenleaf</surname> <given-names>C.</given-names></name> <name><surname>Medbery</surname> <given-names>R.</given-names></name> <name><surname>Peterson</surname> <given-names>K.</given-names></name></person-group> (<year>1999</year>). <article-title>Factors affecting olympic performance: perceptions of athletes and coaches from more and less successful teams</article-title>. <source>Sport Psychol</source>. <volume>13</volume>, <fpage>371</fpage>&#x02013;<lpage>394</lpage>. <pub-id pub-id-type="doi">10.1123/tsp.13.4.371</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hall</surname> <given-names>M.</given-names></name> <name><surname>Frank</surname> <given-names>E.</given-names></name> <name><surname>Holmes</surname> <given-names>G.</given-names></name> <name><surname>Pfahringer</surname> <given-names>B.</given-names></name> <name><surname>Reutemann</surname> <given-names>P.</given-names></name> <name><surname>Witten</surname> <given-names>I. H.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>The WEKA data mining software: an update</article-title>. <source>SIGKDD Explor Newsl</source>. <volume>11</volume>, <fpage>10</fpage>&#x02013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1145/1656274.1656278</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Halson</surname> <given-names>S. L</given-names></name></person-group>. (<year>2014</year>). <article-title>Monitoring training load to understand fatigue in athletes</article-title>. <source>Sports Med</source>. <volume>2</volume>, <fpage>S139</fpage>&#x02013;<lpage>S147</lpage>. <pub-id pub-id-type="doi">10.1007/s40279-014-0253-z</pub-id><pub-id pub-id-type="pmid">25200666</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haug</surname> <given-names>W. B.</given-names></name> <name><surname>Drinkwater</surname> <given-names>E. J.</given-names></name> <name><surname>Mitchell</surname> <given-names>L. J.</given-names></name> <name><surname>Chapman</surname> <given-names>D. W.</given-names></name></person-group> (<year>2015</year>). <article-title>The relationship between start performance and race outcome in elite 500-m short-track speed skating</article-title>. <source>Int. J. Sports Physiol. Perform</source>. <volume>10</volume>, <fpage>902</fpage>&#x02013;<lpage>906</lpage>. <pub-id pub-id-type="doi">10.1123/ijspp.2014-0504</pub-id><pub-id pub-id-type="pmid">25710521</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hesford</surname> <given-names>C. M.</given-names></name> <name><surname>Laing</surname> <given-names>S. J.</given-names></name> <name><surname>Cardinale</surname> <given-names>M.</given-names></name> <name><surname>Cooper</surname> <given-names>C. E.</given-names></name></person-group> (<year>2012</year>). <article-title>Asymmetry of quadriceps muscle oxygenation during elite short-track speed skating</article-title>. <source>Med. Sci. Sports Exerc</source>. <volume>44</volume>, <fpage>501</fpage>&#x02013;<lpage>508</lpage>. <pub-id pub-id-type="doi">10.1249/MSS.0b013e31822f8942</pub-id><pub-id pub-id-type="pmid">21900848</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holt</surname> <given-names>C. C</given-names></name></person-group>. (<year>2004</year>). <article-title>Forecasting seasonals and trends by exponentially weighted moving averages</article-title>. <source>Int. J. Forecast</source>. <volume>20</volume>, <fpage>5</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijforecast.2003.09.015</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hulin</surname> <given-names>B. T.</given-names></name> <name><surname>Gabbett</surname> <given-names>T. J.</given-names></name> <name><surname>Blanch</surname> <given-names>P.</given-names></name> <name><surname>Chapman</surname> <given-names>P.</given-names></name> <name><surname>Bailey</surname> <given-names>D.</given-names></name> <name><surname>Orchard</surname> <given-names>J. W.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Spikes in acute workload are associated with increased injury risk in elite cricket fast bowlers</article-title>. <source>Br. J. Sports Med</source>. <volume>48</volume>, <fpage>708</fpage>&#x02013;<lpage>712</lpage>. <pub-id pub-id-type="doi">10.1136/bjsports-2013-092524</pub-id><pub-id pub-id-type="pmid">23962877</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Issurin</surname> <given-names>V.</given-names></name> <name><surname>Lustig</surname> <given-names>G.</given-names></name> <name><surname>Szopa</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>Ereditariet&#x000E0; e allenabilit&#x000E0;</article-title>. <source>SDS</source>. 65, 43. <pub-id pub-id-type="doi">10.2165/11319770-000000000-00000</pub-id><pub-id pub-id-type="pmid">20199119</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Jovanovic</surname> <given-names>M</given-names></name></person-group>. (<year>2018</year>). <source>Predicting non-contact hamstring injuries by using training load data and ma-chine learning methods, p. 2</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.complementarytraining/net/predicting/hamstring/injuries">www.complementarytraining/net/predicting/hamstring/injuries</ext-link> (accessed March 13, 2022).</citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Junge</surname> <given-names>A</given-names></name></person-group>. (<year>2000</year>). <article-title>The influence of psychological factors on sports injuries</article-title>. <source>Am. J. Sports Med.</source> <volume>28</volume>, <fpage>10</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1177/28.suppl_5.s-10</pub-id><pub-id pub-id-type="pmid">11032102</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>K.</given-names></name> <name><surname>Kim</surname> <given-names>J. S.</given-names></name> <name><surname>Purevsuren</surname> <given-names>T.</given-names></name> <name><surname>Khuyagbaatar</surname> <given-names>B.</given-names></name> <name><surname>Lee</surname> <given-names>S.</given-names></name> <name><surname>Kim</surname> <given-names>Y. H.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>New method to evaluate three-dimensional push-off angle during short-track speed skating using wearable inertial measurement unit sensors</article-title>. <source>Proc. Inst. Mech. Eng. H</source>. <volume>233</volume>, <fpage>476</fpage>&#x02013;<lpage>480</lpage>. <pub-id pub-id-type="doi">10.1177/0954411919831309</pub-id><pub-id pub-id-type="pmid">30773989</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konieczny</surname> <given-names>M.</given-names></name> <name><surname>Pakosz</surname> <given-names>P.</given-names></name> <name><surname>Witkowski</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>Asymmetrical fatiguing of the gluteus maximus muscles in the elite short-track female skaters</article-title>. <source>BMC Sports Sci. Med. Rehabil</source>. 12, 48. <pub-id pub-id-type="doi">10.1186/s13102-020-00193-w</pub-id><pub-id pub-id-type="pmid">32818057</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konings</surname> <given-names>M. J.</given-names></name> <name><surname>Hettinga</surname> <given-names>F. J.</given-names></name></person-group> (<year>2018</year>). <article-title>The effect of preceding race efforts on pacing and short-track speed skating performance</article-title>. <source>Int. J. Sports Physiol. Perform</source>. <volume>13</volume>, <fpage>970</fpage>&#x02013;<lpage>976</lpage>. <pub-id pub-id-type="doi">10.1123/ijspp.2017-0637</pub-id><pub-id pub-id-type="pmid">29345552</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konings</surname> <given-names>M. J.</given-names></name> <name><surname>Noorbergen</surname> <given-names>O. S.</given-names></name> <name><surname>Parry</surname> <given-names>D.</given-names></name> <name><surname>Hettinga</surname> <given-names>F. J.</given-names></name></person-group> (<year>2016</year>). <article-title>Pacing behavior and tactical positioning in 1500-m short-track speed skating</article-title>. <source>Int. J. Sports Physiol. Perform</source>. <volume>11</volume>, <fpage>122</fpage>&#x02013;<lpage>129</lpage>. <pub-id pub-id-type="doi">10.1123/ijspp.2015-0137</pub-id><pub-id pub-id-type="pmid">26062042</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lian</surname> <given-names>O. B.</given-names></name> <name><surname>Engebretsen</surname> <given-names>L.</given-names></name> <name><surname>Bahr</surname> <given-names>R.</given-names></name></person-group> (<year>2005</year>). <article-title>Prevalence of jumper&#x00027;s knee among elite athletes from different sports: a cross-sectional study</article-title>. <source>Am. J. Sports Med</source>. <volume>33</volume>, <fpage>561</fpage>&#x02013;<lpage>567</lpage>. <pub-id pub-id-type="doi">10.1177/0363546504270454</pub-id><pub-id pub-id-type="pmid">15722279</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>L&#x000F3;pez-Valenciano</surname> <given-names>A.</given-names></name> <name><surname>Ayala</surname> <given-names>F.</given-names></name> <name><surname>Puerta</surname> <given-names>J. M.</given-names></name> <name><surname>Ste Croix</surname> <given-names>D. E.</given-names></name></person-group> (<year>2018</year>). <article-title>A preventive model for muscle injuries: a novel approach based on learning algorithms</article-title>. <source>Med. Sci. Sports Exerc</source>. <volume>50</volume>, <fpage>915</fpage>&#x02013;<lpage>927</lpage>. <pub-id pub-id-type="doi">10.1249/MSS.0000000000001535</pub-id><pub-id pub-id-type="pmid">29283933</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matthews</surname> <given-names>B. W</given-names></name></person-group>. (<year>1975</year>). <article-title>Comparison of the predicted and observed secondary structure of T4 phage lysozyme</article-title>. <source>Biochim. Biophys. Acta</source>. <volume>405</volume>, <fpage>442</fpage>&#x02013;<lpage>451</lpage>. <pub-id pub-id-type="doi">10.1016/0005-2795(75)90109-9</pub-id><pub-id pub-id-type="pmid">1180967</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCullagh</surname> <given-names>J.</given-names></name> <name><surname>Whitfort</surname> <given-names>T.</given-names></name></person-group> (<year>2013</year>). <article-title>An investigation into the application of artificial neural networks to the prediction of injuries in sport</article-title>. <source>Int. J. Sport Health Sci</source>. <volume>7</volume>, <fpage>356</fpage>&#x02013;<lpage>360</lpage>. <pub-id pub-id-type="doi">10.5281/zenodo.1086967</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meek</surname> <given-names>G. E.</given-names></name> <name><surname>Mendenhall</surname> <given-names>W.</given-names></name> <name><surname>Scheaffer</surname> <given-names>R. L.</given-names></name> <name><surname>Wackerly</surname> <given-names>D. D.</given-names></name></person-group> (<year>1987</year>). <article-title>Mathematical statistics with applications</article-title>. <source>Technometrics</source>. 29, 384. <pub-id pub-id-type="doi">10.1080/00401706.1987.10488256</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meeuwisse</surname> <given-names>W. H.</given-names></name> <name><surname>Tyreman</surname> <given-names>H.</given-names></name> <name><surname>Hagel</surname> <given-names>B.</given-names></name> <name><surname>Emery</surname> <given-names>C. A.</given-names></name></person-group> (<year>2007</year>). <article-title>dynamic model of aetiology in sport injury: the recursive nature of risk and causation</article-title>. <source>Clin. J. Sport Med</source>. <volume>17</volume>, <fpage>215</fpage>&#x02013;<lpage>219</lpage>. <pub-id pub-id-type="doi">10.1097/JSM.0b013e3180592a48</pub-id><pub-id pub-id-type="pmid">17513916</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Menting</surname> <given-names>S. G. P.</given-names></name> <name><surname>Konings</surname> <given-names>M. J.</given-names></name> <name><surname>Elferink-Gemser</surname> <given-names>M. T.</given-names></name> <name><surname>Hettinga</surname> <given-names>F. J.</given-names></name></person-group> (<year>2019</year>). <article-title>Pacing behavior of elite youth athletes: analyzing 1500-m short-track speed skating</article-title>. <source>Int. J. Sports Physiol. Perform</source>. <volume>14</volume>, <fpage>222</fpage>&#x02013;<lpage>231</lpage>. <pub-id pub-id-type="doi">10.1123/ijspp.2018-0285</pub-id><pub-id pub-id-type="pmid">30039992</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nayak</surname> <given-names>M. A.</given-names></name> <name><surname>Ghosh</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>Prediction of extreme rainfall event using weather pattern recognition and support vector machine classifier</article-title>. <source>Theor. Appl. Climatol</source>. <volume>114</volume>, <fpage>583</fpage>&#x02013;<lpage>603</lpage>. <pub-id pub-id-type="doi">10.1007/s00704-013-0867-3</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oliver</surname> <given-names>J. L.</given-names></name> <name><surname>Ayala</surname> <given-names>F.</given-names></name> <name><surname>De Ste Croix</surname> <given-names>M. B. A.</given-names></name> <name><surname>Lloyd</surname> <given-names>R. S.</given-names></name> <name><surname>Myer</surname> <given-names>G. D.</given-names></name> <name><surname>Read</surname> <given-names>P. J.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Using machine learning to improve our understanding of injury risk and prediction in elite male youth football players</article-title>. <source>J. Sci. Med. Sport</source> [Internet]. 04, 021. <pub-id pub-id-type="doi">10.1016/j.jsams.2020.04.021</pub-id><pub-id pub-id-type="pmid">32482610</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Powers</surname> <given-names>D. M. W</given-names></name></person-group>. (<year>2020</year>). <source>Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation [Internet]. arXiv [cs.LG]. 2020</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://arxiv.org/abs/2010">http://arxiv.org/abs/2010</ext-link>, p. 16061</citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Renstr&#x000F6;m</surname> <given-names>P.</given-names></name> <name><surname>Johnson</surname> <given-names>R. J.</given-names></name></person-group> (<year>1985</year>). <article-title>Overuse injuries in Sports</article-title>. <source>Sports Med</source>. <volume>2</volume>, <fpage>316</fpage>&#x02013;<lpage>333</lpage>. <pub-id pub-id-type="doi">10.2165/00007256-198502050-00002</pub-id><pub-id pub-id-type="pmid">3901173</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roberts</surname> <given-names>D. M.</given-names></name> <name><surname>Schade</surname> <given-names>M. M.</given-names></name> <name><surname>Mathew</surname> <given-names>G. M.</given-names></name> <name><surname>Gartenberg</surname> <given-names>D.</given-names></name> <name><surname>Buxton</surname> <given-names>O. M.</given-names></name></person-group> (<year>2020</year>). <article-title>Detecting sleep using heart rate and motion data from multisensor consumer-grade wearables, relative to wrist actigraphy and polysomnography</article-title>. <source>Sleep</source> [Internet]. 43, 97. <pub-id pub-id-type="doi">10.1093/sleep/zsaa045</pub-id><pub-id pub-id-type="pmid">32215550</pub-id></citation></ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rodas</surname> <given-names>G.</given-names></name> <name><surname>Osaba</surname> <given-names>L.</given-names></name> <name><surname>Arteta</surname> <given-names>D.</given-names></name> <name><surname>Pruna</surname> <given-names>R.</given-names></name> <name><surname>Fern&#x000E1;ndez</surname> <given-names>D.</given-names></name> <name><surname>Lucia</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Genomic prediction of tendinopathy risk in Elite Team, Sports</article-title>. <source>Int. J. Sports Physiol. Perform.</source> <volume>9</volume>, <fpage>1</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1123/ijspp.2019-0431</pub-id><pub-id pub-id-type="pmid">31615970</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rommers</surname> <given-names>N.</given-names></name> <name><surname>R&#x000D6;ssler</surname> <given-names>R.</given-names></name> <name><surname>Verhagen</surname> <given-names>E.</given-names></name> <name><surname>Vandecasteele</surname> <given-names>F.</given-names></name> <name><surname>Verstockt</surname> <given-names>S.</given-names></name> <name><surname>Vaeyens</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>A machine learning approach to assess injury risk in elite youth football players</article-title>. <source>Med. Sci. Sport. Exerc</source>. <volume>52</volume>, <fpage>1745</fpage>&#x02013;<lpage>1751</lpage>. <pub-id pub-id-type="doi">10.1249/MSS.0000000000002305</pub-id><pub-id pub-id-type="pmid">32079917</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Rossi</surname> <given-names>A.</given-names></name> <name><surname>Pappalardo</surname> <given-names>L.</given-names></name> <name><surname>Cintia</surname> <given-names>P.</given-names></name> <name><surname>Iaia</surname> <given-names>M.</given-names></name> <name><surname>Fernandez</surname> <given-names>J.</given-names></name> <name><surname>Medina</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Effective injury forecasting in soccer with GPS training data and machine learning [Internet]</article-title>. <source>arXiv [stat.ML]</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://arxiv.org/abs/170508079">http://arxiv.org/abs/170508079</ext-link><pub-id pub-id-type="pmid">30044858</pub-id></citation></ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ruddy</surname> <given-names>J. D.</given-names></name> <name><surname>Shield</surname> <given-names>A. J.</given-names></name> <name><surname>Maniar</surname> <given-names>N.</given-names></name> <name><surname>Williams</surname> <given-names>M. D.</given-names></name> <name><surname>Duhig</surname> <given-names>S.</given-names></name> <name><surname>Timmins</surname> <given-names>R. G.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Predictive modeling of hamstring strain injuries in elite australian footballers</article-title>. <source>Med. Sci. Sports Exerc</source>. <volume>50</volume>, <fpage>906</fpage>&#x02013;<lpage>914</lpage>. <pub-id pub-id-type="doi">10.1249/MSS.0000000000001527</pub-id><pub-id pub-id-type="pmid">29266094</pub-id></citation></ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Selye</surname> <given-names>H</given-names></name></person-group>. (<year>1950</year>). <article-title>Stress and the general adaptation syndrome</article-title>. <source>Br Med J</source>. <volume>1</volume>, <fpage>1383</fpage>&#x02013;<lpage>1392</lpage>. <pub-id pub-id-type="doi">10.1136/bmj.1.4667.1383</pub-id><pub-id pub-id-type="pmid">15426759</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Seyd</surname> <given-names>P. T. A.</given-names></name> <name><surname>Ahamed</surname> <given-names>V. I. T.</given-names></name> <name><surname>Jacob</surname> <given-names>J.</given-names></name> <name><surname>Joseph</surname> <given-names>P.</given-names></name></person-group> (<year>2008</year>). <article-title>Time and frequency domain analysis of heart rate variability and their correlations in diabetes mellitus</article-title>. <source>Int. J. Biologic. Life Sci</source>. <volume>4</volume>, <fpage>24</fpage>&#x02013;<lpage>27</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.307.6260&#x00026;rep=rep1&#x00026;type=pdf">https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.307.6260&#x00026;rep=rep1&#x00026;type=pdf</ext-link><pub-id pub-id-type="pmid">9359550</pub-id></citation></ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shrier</surname> <given-names>I.</given-names></name> <name><surname>Hall&#x000E9;</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Psychological predictors of injuries in circus artists: an exploratory study</article-title>. <source>Br. J. Sports Med</source>. <volume>45</volume>, <fpage>433</fpage>&#x02013;<lpage>436</lpage>. <pub-id pub-id-type="doi">10.1136/bjsm.2009.067751</pub-id><pub-id pub-id-type="pmid">21047839</pub-id></citation></ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Soligard</surname> <given-names>T.</given-names></name> <name><surname>Schwellnus</surname> <given-names>M.</given-names></name> <name><surname>Alonso</surname> <given-names>J. M</given-names></name> <name><surname>Bahr</surname> <given-names>R.</given-names></name> <name><surname>Clarsen</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>How much is too much? (Part 1) International Olympic Committee consensus statement on load in sport and risk of injury</article-title>. <source>Br. J. Sports Med</source>. <volume>50</volume>, <fpage>1030</fpage>&#x02013;<lpage>1041</lpage>. <pub-id pub-id-type="doi">10.1136/bjsports-2016-096581</pub-id><pub-id pub-id-type="pmid">27535989</pub-id></citation></ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stares</surname> <given-names>J.</given-names></name> <name><surname>Dawson</surname> <given-names>B.</given-names></name> <name><surname>Peeling</surname> <given-names>P.</given-names></name> <name><surname>Heasman</surname> <given-names>J.</given-names></name> <name><surname>Rogalski</surname> <given-names>B.</given-names></name> <name><surname>Drew</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Identifying high risk loading conditions for in-season injury in elite Australian football players</article-title>. <source>J. Sci. Med. Sport</source> [Internet]. 5, 12. <pub-id pub-id-type="doi">10.1016/j.jsams.2017.05.012</pub-id><pub-id pub-id-type="pmid">28601588</pub-id></citation></ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Starling</surname> <given-names>L. T.</given-names></name> <name><surname>Lambert</surname> <given-names>M. I.</given-names></name></person-group> (<year>2018</year>). <article-title>Monitoring rugby players for fitness and fatigue: what do coaches want?</article-title> <source>Int. J. Sports Physiol. Perform</source>. <volume>13</volume>, <fpage>777</fpage>&#x02013;<lpage>782</lpage>. <pub-id pub-id-type="doi">10.1123/ijspp.2017-0416</pub-id><pub-id pub-id-type="pmid">29140145</pub-id></citation></ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Starling</surname> <given-names>L. T.</given-names></name> <name><surname>Nellemann</surname> <given-names>S.</given-names></name> <name><surname>Parkes</surname> <given-names>A.</given-names></name> <name><surname>Lambert</surname> <given-names>M. I.</given-names></name></person-group> (<year>2020</year>). <article-title>The fatigue and fitness test for teams (FFITT): a practical option for monitoring athletes in a team as individuals</article-title>. <source>EJSS</source>. <volume>20</volume>, <fpage>106</fpage>&#x02013;<lpage>114</lpage>. <pub-id pub-id-type="doi">10.1080/17461391.2019.1612951</pub-id><pub-id pub-id-type="pmid">31109238</pub-id></citation></ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thornton</surname> <given-names>H. R.</given-names></name> <name><surname>Delaney</surname> <given-names>J. A.</given-names></name> <name><surname>Duthie</surname> <given-names>G. M.</given-names></name> <name><surname>Dascombe</surname> <given-names>B. J.</given-names></name></person-group> (<year>2017</year>). <article-title>Importance of various training-load measures in injury incidence of professional rugby league athletes</article-title>. <source>Int. J. Sports Physiol. Perform</source>. <volume>12</volume>, <fpage>819</fpage>&#x02013;<lpage>824</lpage>. <pub-id pub-id-type="doi">10.1123/ijspp.2016-0326</pub-id><pub-id pub-id-type="pmid">27918659</pub-id></citation></ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Eetvelde</surname> <given-names>H.</given-names></name> <name><surname>Mendon&#x000E7;a</surname> <given-names>L. D.</given-names></name> <name><surname>Ley</surname> <given-names>C.</given-names></name> <name><surname>Seil</surname> <given-names>R.</given-names></name> <name><surname>Tischer</surname> <given-names>T.</given-names></name></person-group> (<year>2021</year>). <article-title>Machine learning methods in sport injury prediction and prevention: a systematic review</article-title>. <source>J. Exp. Orthop</source>. 8, 27. <pub-id pub-id-type="doi">10.1186/s40634-021-00346-x</pub-id><pub-id pub-id-type="pmid">33855647</pub-id></citation></ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Visa</surname> <given-names>S.</given-names></name> <name><surname>Ramsay</surname> <given-names>B.</given-names></name> <name><surname>Ralescu</surname> <given-names>A. L.</given-names></name> <name><surname>Van Der Knaap</surname> <given-names>E.</given-names></name></person-group> (<year>2011</year>). <article-title>Confusion matrix-based feature selection</article-title>. <source>MAICS</source>. <volume>710</volume>, <fpage>120</fpage>&#x02013;<lpage>127</lpage>.</citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wald&#x000E9;n</surname> <given-names>M.</given-names></name> <name><surname>H&#x000E4;gglund</surname> <given-names>M.</given-names></name> <name><surname>Ekstrand</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>Injuries in Swedish elite football&#x02013;a prospective study on injury definitions, risk for injury and injury pattern during 2001</article-title>. <source>Scand. J. Med. Sci. Sports</source>. <volume>15</volume>, <fpage>118</fpage>&#x02013;<lpage>125</lpage>. <pub-id pub-id-type="doi">10.1111/j.1600-0838.2004.00393.x</pub-id><pub-id pub-id-type="pmid">15773867</pub-id></citation></ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H. M.</given-names></name> <name><surname>Huang</surname> <given-names>S. C.</given-names></name></person-group> (<year>2012</year>). <article-title>SDNN/RMSSD as a surrogate for LF/HF: a revised investigation</article-title>. <source>Model Simul. Eng</source>. <volume>12</volume>, <fpage>1</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1155/2012/931943</pub-id></citation>
</ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whiteside</surname> <given-names>D.</given-names></name> <name><surname>Martini</surname> <given-names>D. N.</given-names></name> <name><surname>Lepley</surname> <given-names>A. S.</given-names></name> <name><surname>Zernicke</surname> <given-names>R. F.</given-names></name> <name><surname>Goulet</surname> <given-names>G. C.</given-names></name></person-group> (<year>2016</year>). <article-title>Predictors of ulnar collateral ligament reconstruction in major league baseball pitchers</article-title>. <source>Am. J. Sports Med</source>. <volume>44</volume>, <fpage>2202</fpage>&#x02013;<lpage>2209</lpage>. <pub-id pub-id-type="doi">10.1177/0363546516643812</pub-id><pub-id pub-id-type="pmid">27159303</pub-id></citation></ref>
<ref id="B72">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Yin</surname> <given-names>M.</given-names></name> <name><surname>Wortman Vaughan</surname> <given-names>J.</given-names></name> <name><surname>Wallach</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>&#x0201C;Understanding the effect of accuracy on trust in machine learning models,&#x0201D;</article-title> in <source>Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems</source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Association for Computing Machinery</publisher-name>), p. 1&#x02013;12. <pub-id pub-id-type="doi">10.1145/3290605.3300509</pub-id></citation>
</ref>
</ref-list>
</back>
</article>