<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurol.</journal-id>
<journal-title>Frontiers in Neurology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurol.</abbrev-journal-title>
<issn pub-type="epub">1664-2295</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fneur.2023.1234046</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neurology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Interpretable machine learning for prediction of clinical outcomes in acute ischemic stroke</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Lee</surname> <given-names>Joonwon</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2423212/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Park</surname> <given-names>Kang Min</given-names></name>
<xref ref-type="author-notes" rid="fn002"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1306925/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Park</surname> <given-names>Seongho</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2334518/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Neurology, Haeundae Paik Hospital, Inje University College of Medicine</institution>, <addr-line>Busan</addr-line>, <country>Republic of Korea</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Ana Catarina Fonseca, University of Lisbon, Portugal</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Shubham Misra, Yale University, United States; Jiang Li, Geisinger Medical Center, United States</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Seongho Park <email>risepsh&#x00040;gmail.com</email></corresp>
<fn fn-type="other" id="fn001"><p>&#x02020;ORCID: Joonwon Lee <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-0496-2392">orcid.org/0000-0002-0496-2392</ext-link></p></fn>
<fn fn-type="other" id="fn002"><p>Kang Min Park <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-4229-7741">orcid.org/0000-0002-4229-7741</ext-link></p></fn>
<fn fn-type="other" id="fn003"><p>Seongho Park <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-0504-2539">orcid.org/0000-0002-0504-2539</ext-link></p></fn></author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>09</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1234046</elocation-id>
<history>
<date date-type="received">
<day>03</day>
<month>06</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>18</day>
<month>08</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2023 Lee, Park and Park.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Lee, Park and Park</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract>
<sec>
<title>Background and aims</title>
<p>Predicting the prognosis of acute ischemic stroke (AIS) is crucial in a clinical setting for establishing suitable treatment plans. This study aimed to develop and validate a machine learning (ML) model that predicts the functional outcome of AIS patients and provides interpretable insights.</p></sec>
<sec>
<title>Methods</title>
<p>We included AIS patients from a multicenter stroke registry in this prognostic study. ML-based methods were utilized to predict 3-month functional outcomes, which were categorized as either favorable [modified Rankin Scale (mRS) &#x02264; 2] or unfavorable (mRS &#x02265; 3). The SHapley Additive exPlanations (SHAP) method was employed to identify significant features and interpret their contributions to the predictions of the model.</p></sec>
<sec>
<title>Results</title>
<p>The dataset comprised a derivation set of 3,687 patients and two external validation sets totaling 250 and 110 patients each. Among them, the number of unfavorable outcomes was 1,123 (30.4%) in the derivation set, and 93 (37.2%) and 32 (29.1%) in external sets A and B, respectively. Among the ML models used, the eXtreme Gradient Boosting model demonstrated the best performance. It achieved an area under the receiver operating characteristic curve (AUC-ROC) of 0.790 (95% CI: 0.775&#x02013;0.806) on the internal test set and 0.791 (95% CI: 0.733&#x02013;0.848) and 0.873 (95% CI: 0.798&#x02013;0.948) on the two external test sets, respectively. The key features for predicting functional outcomes were the initial NIHSS, early neurologic deterioration (END), age, and white blood cell count. The END displayed noticeable interactions with several other features.</p></sec>
<sec>
<title>Conclusion</title>
<p>ML algorithms demonstrated proficient prediction for the 3-month functional outcome in AIS patients. With the aid of the SHAP method, we can attain an in-depth understanding of how critical features contribute to model predictions and how changes in these features influence such predictions.</p></sec></abstract>
<kwd-group>
<kwd>machine learning</kwd>
<kwd>stroke</kwd>
<kwd>cerebral infarction</kwd>
<kwd>explainable artificial intelligence</kwd>
<kwd>prediction</kwd>
<kwd>functional outcome</kwd>
<kwd>modified Rankin Scale</kwd>
</kwd-group>
<counts>
<fig-count count="3"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="33"/>
<page-count count="10"/>
<word-count count="6293"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Stroke</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introductions</title>
<p>Physicians are concerned about the prognosis following a diagnosis of acute ischemic stroke (AIS) and must establish a long-term treatment plan (<xref ref-type="bibr" rid="B1">1</xref>). Traditional risk-scoring models have been used to predict clinical outcomes after AIS (<xref ref-type="bibr" rid="B2">2</xref>&#x02013;<xref ref-type="bibr" rid="B4">4</xref>), but these models face challenges when learning the input of complex multi-dimensional functions. The optimal weight for one input can easily change depending on other input values (<xref ref-type="bibr" rid="B5">5</xref>, <xref ref-type="bibr" rid="B6">6</xref>). On the other hand, machine learning (ML) algorithms have the advantage of being able to maximize information to enhance predictive accuracy while predicting complex clinical outcomes influenced by various situations and conditions.</p>
<p>However, the clinical application of ML models has been hindered by the interpretability challenge due to the black box problem (<xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B8">8</xref>). In medicine, the lack of interpretability can make it difficult for clinicians to trust the predictions of the model and incorporate them into their decision-making (<xref ref-type="bibr" rid="B9">9</xref>). The SHapley Additive exPlanations (SHAP) is an attribution method used to provide interpretation and an intuitive understanding of the contribution of each variable to the outcome (<xref ref-type="bibr" rid="B10">10</xref>).</p>
<p>In this study, we aimed to develop and validate an ML-based model that predicts the functional outcome of patients with AIS. Furthermore, we aimed to provide interpretation and an intuitive understanding of the model using model-agnostic attribution methods.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Study design and source of data</title>
<p>This study is a prognostic accuracy cohort study utilizing prospectively collected multicenter hospital-based stroke registries. The objective of this study is to develop, validate, and interpret a machine learning system for predicting 3-month functional outcomes after stroke. We utilized a derivation dataset for model training and internal validation, while external datasets were used for external validation.</p>
<p>The dataset from the National Information Society Agency (NIA) was used as the derivation set (<ext-link ext-link-type="uri" xlink:href="https://aihub.or.kr/">https://aihub.or.kr/</ext-link>). This is a dataset prepared for developing an artificial intelligence model from a multicenter prospective stroke registry. It contains clinical data of patients diagnosed with AIS between January 2011 and March 2019 at the stroke centers of three university hospitals in the Republic of Korea. A diagnosis of AIS was confirmed when there was diffusion restriction on MRI. In the dataset of 6,000 patients, data for 1,717 patients for whom clinical data were unavailable were discarded. Of the 4,283 eligible patients, patients under the age of 18 years, pre-stroke modified Rankin scale (mRS) of three points or more, and who were not evaluated with follow-up mRS were excluded (<xref ref-type="bibr" rid="B11">11</xref>). Additional external sets were collected from a stroke center, where acute ischemic stroke patients who visited each center from September 2019 to August 2021 were screened. Ultimately, the derivation set was composed of datasets from two hospitals from the dataset of the NIA, and the remaining one dataset of the hospital along with an additional externally collected dataset became two separate external validation sets. We have selected 16 clinical features from the 31 common clinical features of the NIA dataset, based on both a feature importance algorithm and at the clinicians&#x00027; discretion (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Table S1</xref> for further details). These selected features include initial NIHSS, END, age, BUN, WBC, prothrombin time, previous stroke, serum glucose, hemoglobins, atrial fibrillation, hypertension, diabetes mellitus, BMI, onset-to-door time, EVT, and tPA.</p>
<p>This study followed the TRIPOD and CLAIM reporting guidelines (<xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B13">13</xref>). All data were anonymized using the de-identification method of the data provider (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S1</xref>).</p>
</sec>
<sec>
<title>Definition of functional outcome</title>
<p>A functional outcome was a binary label, defined as an unfavorable outcome if the mRS is 3 or more, and a favorable outcome if &#x0003C; 3 (<xref ref-type="bibr" rid="B11">11</xref>). The functional outcome was evaluated in-person by a neurology specialist when patients revisited the outpatient department 3 months post-ischemic stroke. For a minority of patients who could not attend the outpatient clinic, evaluations were conducted over the phone by a well-trained clinical nurse specialist.</p>
</sec>
<sec>
<title>Data preparation</title>
<p>We performed outlier detection and imputation for missing values &#x0003C; 10% on the clinical data of the entire dataset. Two experienced clinicians, KMP and JL, checked the distributions of the values of 16 common variables and looked for levels that were not clinically feasible. Outlier candidates were replaced with NAs by clinician consensus. We then conducted multivariate imputation chained equations (MICE) on both the NA-replaced outliers and the missing values of the original data (<xref ref-type="bibr" rid="B14">14</xref>). After these procedures, we normalized all the data using the MinMaxScaler.</p>
</sec>
<sec>
<title>Model development and evaluation</title>
<p>We implemented a supervised machine learning approach to develop a predictive model for the 3-month functional outcome of acute ischemic stroke. The modeling process engaged three tree-based machine learning algorithms: random forest, eXtreme Gradient Boosting (XGBoost), and light gradient-boosting machine (LGBM) (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table S2</xref>). These algorithms were chosen due to their proven efficacy in calculating interaction effects among features using TreeSHAP (<xref ref-type="bibr" rid="B15">15</xref>).</p>
<p>To validate the performance of the model, we performed not only internal validation but also external validation using two separate external datasets. Initially, we randomly allocated 20% of the entire dataset as an internal test set and assigned the remaining 80% of the data to a training set. We then used a 5-fold cross-validation method to generate each model. This procedure resulted in five versions of each machine learning model, with their performance being evaluated on the internal test set. The performance of each model was determined using the mean value of the five derived probabilities (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S2</xref>). Finally, we conducted validation using the developed models on the two external datasets. The primary performance metric we employed to assess the effectiveness of the model was the area under the receiver operating characteristic curve (AUC-ROC). All analyses were performed using Python 3.10.6 and the scikit-learn 1.0.2 library.</p>
</sec>
<sec>
<title>Interpretation of the developed models</title>
<p>We employed the SHAP method for interpreting the developed model. First, we calculated the SHAP values for each feature in the model that performed the best. These SHAP values were used to explain the contribution of each feature in the prediction of the model. By plotting the SHAP summary plot, we visualized the top 16 features that were most impactful in the output of the model. Second, we produced a SHAP dependency plot. This plot primarily displays two aspects. One, it shows how SHAP values change as the values of each feature change, essentially demonstrating the relationship between the features and the predictions. The other is it reveals the interactions between features. Through these two aspects, we were able to ascertain the contribution of each feature toward an unfavorable outcome and the interactions between the features.</p>
</sec>
<sec>
<title>Statistical analysis</title>
<p>Values are presented as the mean &#x000B1; standard deviation, median (interquartile range) for continuous variables, or as the number (%) of subjects for categorical variables, as appropriate. Comparisons of the characteristics between the two groups were performed by the chi-square test, Fisher&#x00027;s exact test, Wilcoxon Signed Rank test, or Cochran&#x02013;Armitage trend test according to the type of the variable. To evaluate the performance of the models for discriminating 3-month functional outcome, we plotted receiver operating characteristic (ROC) curves and calculated the AUC and 95% confidence interval (CI) for each model. Differences between AUCs were compared by DeLong&#x00027;s test (<xref ref-type="bibr" rid="B16">16</xref>). Additionally, recall (sensitivity), precision (positive predictive value), accuracy, and Brier score were calculated as secondary outcome metrics. We also calculated the sensitivity and specificity value for the threshold determined by Youden&#x00027;s index J (J = sensitivity &#x0002B; specificity-1) if necessary. Statistical analyses were performed using the R package version 4.1.2. The statistical significance was defined as a two-tailed <italic>P</italic>-value of &#x0003C; 0.05.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<p>A total of 3,687 patients were included in the derivation dataset, of which 1,123 (30.4%) had an unfavorable outcome. The external validation dataset had 360 patients: 250 in external dataset A and 110 in dataset B, with 93 (37.2%) and 32 (29.1%) unfavorable outcomes, respectively (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S3</xref>). All missing values for the derivation dataset common variables were within 10% (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S4</xref>). Hence, multivariate imputation-chained equations were applied to all variables (<xref ref-type="bibr" rid="B14">14</xref>). Baseline demographics and clinical characteristics according to the favorable and unfavorable outcomes of the dataset on which the imputation was performed are listed in <xref ref-type="supplementary-material" rid="SM1">Supplementary Table S3</xref>.</p>
<sec>
<title>Model performances</title>
<p><xref ref-type="table" rid="T1">Table 1</xref> presents the performance estimates of the ML algorithm. Among the three tree-based ML algorithms, the performance was quite similar, but XGB demonstrated the highest performance. As a result, we chose XGB as the representative ML algorithm. The area under the receiver operating characteristic curve (AUROC) values for the internal test set and external validation sets A and B of the model were 0.790 (95% confidence interval [95% CI]: 0.775&#x02013;0.806), 0.791 (95% CI: 0.733&#x02013;0.848), and 0.873 (95% CI: 0.798&#x02013;0.948), respectively (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S5</xref>). The Brier scores for the internal validation set and external validation sets A and B of the model were 0.172, 0.202, and 0.141, respectively. Model calibration was performed to assess the likelihood that a given new observation belongs to each of the known classes. The calibration slopes showed a minimal difference between the predicted and observed probability of unfavorable outcomes, indicating a good model fit (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S6</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Comparison of performance of the models predicting 3-month functional outcomes.</p></caption> 
<table frame="box" rules="all">
<thead>
<tr style="background-color:&#x00023;919498;color:&#x00023;ffffff">
<th/>
<th valign="top" align="center"><bold>RF</bold></th>
<th valign="top" align="center"><bold>LGBM</bold></th>
<th valign="top" align="center"><bold>XGB</bold></th>
</tr>
</thead>
<tbody>
<tr style="background-color:#dee1e1">
<td valign="top" align="left" colspan="4"><bold>Internal test set (</bold><italic><bold>n</bold> =</italic> <bold>737)</bold></td>
</tr> <tr>
<td valign="top" align="left">AUROC (95% CI)</td>
<td valign="top" align="center">0.789 (0.773&#x02013;0.804)</td>
<td valign="top" align="center">0.785 (0.769&#x02013;0.801)</td>
<td valign="top" align="center">0.790 (0.775&#x02013;0.806)</td>
</tr> <tr>
<td valign="top" align="left">Recall (95% CI)</td>
<td valign="top" align="center">0.695 (0.668&#x02013;0.722)</td>
<td valign="top" align="center">0.705 (0.679&#x02013;0.732)</td>
<td valign="top" align="center">0.756 (0.731&#x02013;0.781)</td>
</tr> <tr>
<td valign="top" align="left">Precision (95% CI)</td>
<td valign="top" align="center">0.553 (0.527&#x02013;0.579)</td>
<td valign="top" align="center">0.540 (0.514&#x02013;0.565)</td>
<td valign="top" align="center">0.525 (0.500&#x02013;0.549)</td>
</tr> <tr>
<td valign="top" align="left">Accuracy (95% CI)</td>
<td valign="top" align="center">0.736 (0.736&#x02013;0.736)</td>
<td valign="top" align="center">0.727 (0.727&#x02013;0.727)</td>
<td valign="top" align="center">0.717 (0.717&#x02013;0.717)</td>
</tr> <tr style="background-color:#dee1e1">
<td valign="top" align="left" colspan="4"><bold>External validation set A (</bold><italic><bold>n</bold> =</italic> <bold>250)</bold></td>
</tr> <tr>
<td valign="top" align="left">AUROC (95% CI)</td>
<td valign="top" align="center">0.794 (0.737&#x02013;0.852)</td>
<td valign="top" align="center">0.787 (0.729&#x02013;0.845)</td>
<td valign="top" align="center">0.791 (0.733&#x02013;0.848)</td>
</tr> <tr>
<td valign="top" align="left">Recall (95% CI)</td>
<td valign="top" align="center">0.677 (0.582&#x02013;0.772)</td>
<td valign="top" align="center">0.806 (0.726&#x02013;0.887)</td>
<td valign="top" align="center">0.656 (0.559&#x02013;0.752)</td>
</tr> <tr>
<td valign="top" align="left">Precision (95% CI)</td>
<td valign="top" align="center">0.677 (0.582&#x02013;0.772)</td>
<td valign="top" align="center">0.551 (0.468&#x02013;0.635)</td>
<td valign="top" align="center">0.663 (0.566&#x02013;0.760)</td>
</tr> <tr>
<td valign="top" align="left">Accuracy (95% CI)</td>
<td valign="top" align="center">0.760 (0.759&#x02013;0.761)</td>
<td valign="top" align="center">0.684 (0.682&#x02013;0.686)</td>
<td valign="top" align="center">0.748 (0.747&#x02013;0.749)</td>
</tr> <tr style="background-color:#dee1e1">
<td valign="top" align="left" colspan="4"><bold>External validation set B (</bold><italic><bold>n</bold> =</italic> <bold>110)</bold></td>
</tr> <tr>
<td valign="top" align="left">AUROC (95% CI)</td>
<td valign="top" align="center">0.860 (0.788&#x02013;0.940)</td>
<td valign="top" align="center">0.869 (0.795&#x02013;0.943)</td>
<td valign="top" align="center">0.873 (0.798&#x02013;0.948)</td>
</tr> <tr>
<td valign="top" align="left">Recall (95% CI)</td>
<td valign="top" align="center">0.875 (0.760&#x02013;0.990)</td>
<td valign="top" align="center">0.844 (0.718&#x02013;0.970)</td>
<td valign="top" align="center">0.875 (0.760&#x02013;0.990)</td>
</tr> <tr>
<td valign="top" align="left">Precision (95% CI)</td>
<td valign="top" align="center">0.609 (0.468&#x02013;0.750)</td>
<td valign="top" align="center">0.643 (0.498&#x02013;0.788)</td>
<td valign="top" align="center">0.667 (0.524&#x02013;0.809)</td>
</tr> <tr>
<td valign="top" align="left">Accuracy (95% CI)</td>
<td valign="top" align="center">0.800 (0.797&#x02013;0.803)</td>
<td valign="top" align="center">0.818 (0.816&#x02013;0.821)</td>
<td valign="top" align="center">0.836 (0.834&#x02013;0.839)</td>
</tr></tbody>
</table>
<table-wrap-foot>
<p>AUROC, area under the receiver operating characteristic curve; RF, random forest; LGBM, light gradient boosting model; XGB, eXtreme Gradient Boosting.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Identification of important features</title>
<p>The SHAP summary plot illustrated the influence of each feature on an unfavorable outcome (<xref ref-type="fig" rid="F1">Figure 1</xref>). In the descending order, the risk factors contributing most significantly to the prediction of an unfavorable outcome were initial NIHSS, early neurologic deterioration, age, WBC, BUN, hemoglobins, PT, use of tPA, and previous stroke. The initial NIHSS, early neurologic deterioration, and age provided notably larger contributions to the unfavorable outcome prediction compared to the other risk factors. The variables hemoglobins and the use of tPA showed a negative correlation with the incidence of an unfavorable outcome.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Feature importance of the gradient boosting model using SHAP. In this summary plot, the redder the value, the larger the value of the feature, and the bluer the value, the smaller the value. In addition, if the SHAP value increases in the positive direction based on 0, the probability of predicting positive increases, and if the SHAP value increases in the negative direction, the probability of predicting negative increases. The higher the initial NIHSS score, the higher the probability of predicting positive for unfavorable outcomes. However, when the hemoglobin level is high or tPA is used, the probability of predicting a negative outcome increases. We obtained information on the importance and contribution of each patient through SHAP. This helps in understanding how much each feature contributes to the model prediction and how that contribution changes. This information enhances the interpretability of the model and can provide insightful guidance for feature selection or model tuning. NIHSS, National Institutes of Health Stroke Scale; END, early neurologic deterioration; WBC, white blood cell; BUN, blood urea nitrogen; tPA, tissue plasminogen activator; BMI, body mass index; EVT, endovascular treatment; SHAP, SHapley Additive exPlanations.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-14-1234046-g0001.tif"/>
</fig>
<p>The SHAP dependency plot showed how the SHAP values change as each feature value changes (<xref ref-type="fig" rid="F2">Figure 2</xref>). Initial NIHSS, END, age, BUN, WBC, PT, previous stroke, hypertension, and diabetes mellitus indicated a positive correlation with the unfavorable outcome when the feature values were positive or increasing. On the contrary, hemoglobins, atrial fibrillation, and the use of tPA exhibited an opposite influence.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>SHAP dependency plots of features. In this graph, the x-axis represents the values of a specific feature, while the y-axis represents the SHAP values of that feature. Each point signifies an individual data point, and its position denotes the value of the feature and the impact that this value has on the prediction. For instance, if the SHAP value consistently increases as the feature value rises, it can be interpreted that the feature has a positive influence on the prediction of the model. Conversely, if the SHAP value decreases as the feature value increases, the feature can be understood to negatively affect the prediction. Furthermore, the SHAP dependency plot is useful for visualizing non-linear relationships. For example, if the plot exhibits a U or S shape, it indicates a non-linear relationship between the feature and the prediction.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-14-1234046-g0002.tif"/>
</fig>
<p>We identified certain features exhibiting non-linear relationships with the outcome. Age had a progressively greater positive influence as it increased, whereas the initial NIHSS score followed a sigmoid curve with a positive correlation. BUN, serum glucose, and BMI showed a U-shaped or V-shaped curve, indicating an increase in the contribution to an unfavorable outcome when the values deviated from what is known to be a normal range.</p>
</sec>
<sec>
<title>Identification of interaction between features</title>
<p>The SHAP dependency plot can depict the effect of a specific feature on the prediction by using different colors to represent the values of other features. By observing these color patterns, we can explore the interactions between two features that influence the outcome (<xref ref-type="fig" rid="F3">Figure 3</xref>).</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Interaction plots of early neurologic deterioration (END) with <bold>(A)</bold> Initial NIHSS, <bold>(B)</bold> Age, <bold>(C)</bold> White Blood Cell (WBC) count, and <bold>(D)</bold> Duration of symptom onset to admission. Interactions in SHAP dependency plots visually represent how the SHAP values of one feature change as the values of another feature change. The left column of the plot depicts both the main effects and the interaction effects, while the right column isolates and displays only the interaction effects. In the left column, each dot represents an individual data point, with its position indicating the value of a specific feature and the corresponding SHAP value reflecting its impact on the prediction. These dots are colored according to the values of another feature, illustrating how the relationship between a particular feature and the prediction depends on the values of another feature. If these patterns change depending on the color of the dots (i.e., the values of the other feature), it indicates an interaction between the two features. For example, if dots of a certain color tend to show an increase in SHAP values as the feature value increases, but dots of other colors show a stronger, weaker, or even opposite trend, and this suggests an interaction between the two features. The plot in the right column, drawn using only interaction values, emphasizes the interaction effects more clearly, which may not have been as evident in the left column plot. The larger the absolute value of the interaction (indicated on the y-axis), the stronger the interaction between the two features, indicating that these two features are more closely related to the prediction of the model.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-14-1234046-g0003.tif"/>
</fig>
<p>END exhibited clear interactions with several features. When the initial NIHSS was &#x0003C; 3, the absence of END had a negative association with the unfavorable outcome compared to its presence. However, for NIHSS scores of 3 or higher, the absence of END contributed more to an unfavorable outcome than its presence (<xref ref-type="fig" rid="F3">Figure 3A</xref>). In patients aged 70 years or above, the occurrence of END contributed less to an unfavorable outcome compared to those under 70 years (<xref ref-type="fig" rid="F3">Figure 3B</xref>). When WBC was outside the normal range (6,000&#x02013;9,000/mL), the presence of END showed a positive interaction with the unfavorable outcome (<xref ref-type="fig" rid="F3">Figure 3C</xref>). Patients who arrived within 12 h from symptom onset exhibited interactions with END (<xref ref-type="fig" rid="F3">Figure 3D</xref>). Additional examples of interaction effects between features, other than END, can be found in the <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S7</xref>.</p>
</sec>
<sec>
<title>Individual patient outcome interpretations</title>
<p>SHAP allows for not only global interpretations of a feature&#x00027;s contribution to the outcome across the entire dataset but also local interpretations for individual patients. As illustrated in <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S8</xref>, the patient was predicted to not have an unfavorable 3-month functional outcome. The history of a previous stroke, an age of 74 years, and a WBC count of 8,600/mL all contributed to an unfavorable outcome. However, a lower NIHSS score of 1, the absence of END, and a hemoglobin level of 13.9 had greater contributions to a more favorable outcome. These local interpretations allow us to comprehend why the model predicted a specific outcome for this particular patient.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In this study, we demonstrated the application of machine learning (ML) algorithms to predict the functional outcomes following an acute ischemic stroke. Our model provided an explanation not only based on individual patient data but also utilized the entire dataset to visualize the contribution and directionality of specific input features through SHAP.</p>
<p>The features that contributed most significantly to our study were initial NIHSS, END, and age, which are well-known risk factors for stroke functional outcomes and have been consistently reported in previous studies (<xref ref-type="bibr" rid="B2">2</xref>, <xref ref-type="bibr" rid="B17">17</xref>&#x02013;<xref ref-type="bibr" rid="B20">20</xref>). In the case of a strong linear relationship between two variables where the outcome variable is binary, the predicted probabilities can manifest as a sigmoid pattern (<xref ref-type="bibr" rid="B21">21</xref>, <xref ref-type="bibr" rid="B22">22</xref>). Interestingly, in our study, initial NIHSS showed a sigmoid curve relative to the 3-month binary functional outcome (<xref ref-type="fig" rid="F2">Figure 2</xref>).</p>
<p>In the prediction of unfavorable outcomes, END exhibited clear interactions with most variables. When the initial NIHSS was &#x0003C; 3 points, the absence of END indicated a negative association with unfavorable outcomes. However, in cases where NIHSS was 3 or higher, the absence of END contributed more to unfavorable outcomes. While this finding may seem counterintuitive, it can be understood when considering that the most common cause of END is a progressive stroke, such as lacunar infarction (<xref ref-type="bibr" rid="B23">23</xref>). The mechanism of this progression may well be related to the concept of branch atheromatous disease, suggesting that a hypoperfused area in the perforating arteriole region may deteriorate after admission, leading to the occurrence of END, particularly in patients who were admitted with lower NIHSS scores.</p>
<p>This notion aligns with the results from a previous cohort study conducted on patients with lacunar infarction, where END was associated with worse functional results at 90 days, and low NIHSS scores at admission along with low perfusion lesions predicted END (<xref ref-type="bibr" rid="B18">18</xref>). Our findings corroborate these results. Furthermore, the same cohort study suggested that patients who received IVT treatment demonstrated improved functional outcomes at 90 days, and a conclusion is also reflected in our study.</p>
<p>As shown in <xref ref-type="fig" rid="F3">Figure 3D</xref>, the presence of END within 12 h of admission demonstrated a positive interaction with unfavorable outcomes, whereas it exhibited a negative interaction in patients who arrived later. This discrepancy could be attributed to the fact that patients arriving at the hospital quickly are more likely to have END identified. However, for those arriving later, even if END has occurred, it may not be recorded as such in the data if the event happened outside the hospital, thus potentially explaining this observed interaction.</p>
<p>The other features, while not as widely recognized as risk factors for 3-month functional outcomes as the ones previously mentioned, showed consistency in our model interpretation with existing research results (<xref ref-type="fig" rid="F2">Figure 2</xref>). The <italic>post-hoc</italic> analysis of the Enhanced Control of Hypertension and Thrombolysis Stroke Study (ENCHANTED), which indicated that hyperglycemia and increased WBC count independently relate to poor functional outcomes (<xref ref-type="bibr" rid="B24">24</xref>), was reproduced in our study. Interestingly, lower WBC counts at admission were associated with better functional outcomes regardless of treatment (<xref ref-type="bibr" rid="B25">25</xref>), and the previously observed association of increased WBC with poor functional outcomes seemed to be reproduced in our results as a U-shaped impact pattern.</p>
<p>Consistent with previous studies suggesting a potential link between increased BUN and unfavorable outcomes (<xref ref-type="bibr" rid="B26">26</xref>), our study also found an association between increased BUN and unfavorable outcomes. BUN exhibited a V-shaped pattern, which seems to be a plausible non-linear result when combined with previous findings that a decrease in BUN is associated with better neurological improvement (<xref ref-type="bibr" rid="B27">27</xref>).</p>
<p>Our study found that a decrease in hemoglobin contributed significantly to unfavorable outcomes. Previous studies have also found significant associations between abnormal hemoglobin levels and poor outcomes (<xref ref-type="bibr" rid="B28">28</xref>, <xref ref-type="bibr" rid="B29">29</xref>).</p>
<p>The duration of symptoms to hospital admission within 3 h tended to decrease the likelihood of unfavorable outcomes but showed a positive association thereafter. However, this influence slightly decreased as the duration increased. This likely reflects the impact of being able to use rtPA for those who arrived within 3 h.</p>
<p>In <xref ref-type="fig" rid="F1">Figure 1</xref>, EVT showed relatively low importance for the model, but the dependency plot revealed its association with a favorable outcome in patients with an initial NIHSS value of 6 or more (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S7c</xref>). This aligns with guidelines suggesting that performing EVT on patients with an initial NIHSS score of 6 or higher correlates with better prognoses (<xref ref-type="bibr" rid="B30">30</xref>). However, it is necessary to interpret with caution, considering that features with low contributions might have limited clinical significance for the model.</p>
<p>We looked at the interpretation between the classical generalized linear models and the interpretation of ML-based SHAP. Referring to the estimates from the multivariable logistic regression model based on the dataset used in this study, there was no considerable difference in the contributions of major features derived from the SHAP of the XGB model (<xref ref-type="fig" rid="F1">Figure 1</xref>, <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S9</xref>). The overall consistency in the results from both approaches in our dataset can be attributed to the following reasons. As shown in <xref ref-type="fig" rid="F2">Figure 2</xref>, variables considered crucial in this study (initial NIHSS, Age, and END) did not exhibit complex non-linear or multi-dimensional relationships. Datasets with such simple inter-variable relationships are likely to result in no substantial difference in predictive performance between traditional and ML models. Indeed, the AUC of the logistic regression model was not notably different from the XGB model. The AUC for the logistic regression was 0.792 (0.776&#x02013;0.807) for the internal test set, 0.795 (0.737&#x02013;0.853) for external validation set A, and 0.893 (0.829&#x02013;0.956) for set B. These AUC values from the logistic regression did not show a statistically significant difference when compared to the AUC of the XGB model (<italic>p</italic> = 0.793, 0.738, and 0.247 for the internal test set, external validation sets A and B, respectively). However, in datasets with more complex relationships and interactions between variables, the interpretation results and performance of these two models might differ. Ultimately, understanding the characteristics and structure of the data and choosing the appropriate model may be pivotal in clinical research and practical decision-making.</p>
<p>Nevertheless, interpretable methods such as SHAP provides added value. Through the use of SHAP, we can visually understand not only the intensity of the contribution of risk factors to outcomes but also the non-linear contribution of specific risk factors and the interactions between variables. These are insights that were not discernible in previous traditional statistical methods. Through such analysis, we gain a deep understanding of how key features contribute to model predictions and how changes in their values impact those predictions. For example, we can not only reproduce the results of previous studies but also decipher hidden implications within those results. Furthermore, we can identify lesser-known risk factors. In this way, interpretable methods can provide insights about characteristics observed in clinical settings, potentially improving model performance and facilitating its use in medical practice.</p>
<p>Our research has several distinct advantages over previous studies. Earlier research efforts particularly showed limitations in their generalizability. This was mainly due to the limited amount of data relative to the number of features and the absence of external validation (<xref ref-type="bibr" rid="B31">31</xref>&#x02013;<xref ref-type="bibr" rid="B33">33</xref>). However, our study overcame these limitations by utilizing a much broader sample and conducting both internal cross-validation and external validations.</p>
<p>Furthermore, while past research lacked clear methodologies or approaches to the interpretability of ML models, our research sought to address this gap by employing SHAP. Through this, we were able to delve deeper into the interactions and relationships among the variables used in our study. We believe that our research will significantly contribute to expanding the current knowledge on post-stroke management in clinical decision support systems powered by AI.</p>
<p>Our study has several limitations. First, due to the multicenter retrospective nature of the investigation, the performance results may be inadequate for determining the robustness of the ML model for clinical utility. Well-designed prospective cohort studies are required to provide clear evidence for clinical use.</p>
<p>Second, the performance of our model in some external datasets was superior to that in internal validation. There could be multiple explanations for this phenomenon: (1) If the training data contain many noises or outliers, the model may overfit by learning these noises. In this case, if the test data are cleaner or better reflect the general pattern, the performance on these datasets may be better; (2) The performance results could also vary depending on the evaluation metric used. Particularly, if the distribution of training data and test data differ, the same model can have varying performance depending on the dataset. In such a case, the test dataset could receive a higher score in a particular evaluation metric; and (3) If the model is robust, that is, if it can capture specific patterns or trends not present in the training data, it can enhance performance in the test data. This happens because the model can respond well to new patterns that were not observed in the training data. If the model is not overfit and is appropriately generalized, it can have a high predictive capability for new data.</p>
<p>Third, our study primarily employed a cross-sectional research design, focusing on predicting short-term outcomes. While this offers immediate insights, it captures data at a single point in time, potentially limiting our ability to track changes over time. Additionally, this approach might not provide a comprehensive perspective on how these outcomes might evolve or persist in the long run. A longitudinal study on the long-term outcomes of stroke should be undertaken as it could shed light on the temporal patterns and variations, granting deeper insights into the cause-and-effect relationships.</p>
<p>Fourth, while SHAP helps infer the importance of features for a given model, there may be inherent problems in the SHAP inferences if the model is poorly developed or trained. However, we developed our model using a large dataset and also carried out external validation.</p>
<p>In conclusion, machine learning algorithms, specifically the tree-based model, can be used to predict the 3-month functional outcome in patients with acute ischemic stroke. Through the utilization of the SHAP method, we can gain an in-depth understanding of how critical features contribute to model predictions and how their changes influence these predictions. Future work should focus on refining the model, exploring additional predictive features, and validating these findings in prospective cohort studies.</p>
</sec>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The data analyzed in this study is subject to the following licenses/restrictions: access to the data from the National Information Society Agency (<ext-link ext-link-type="uri" xlink:href="https://aihub.or.kr/">https://aihub.or.kr/</ext-link>) has been granted, but the data is only available for review by domestic residents who have received approval. Requests to access these datasets should be directed to SP, <email>risepsh&#x00040;gmail.com</email>.</p>
</sec>
<sec sec-type="ethics-statement" id="s6">
<title>Ethics statement</title>
<p>The studies involving humans were approved by Institutional Review Board (IRB No. 2021-09-025-006) of Haeundae Paik Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The Ethics Committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants&#x00027; legal guardians/next of kin because the design of the study was retrospective.</p>
</sec>
<sec sec-type="author-contributions" id="s7">
<title>Author contributions</title>
<p>JL: data curation and manuscript editing. KMP: data curation. SP: study design and concept, data curation and analysis, and manuscript editing. All authors contributed to this article and approved the final manuscript.</p>
</sec>
</body>
<back>
<ack><p>This study was supported by the 2021 Inje University research grant.</p>
</ack>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s8">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec sec-type="supplementary-material" id="s9">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fneur.2023.1234046/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fneur.2023.1234046/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.DOCX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hung</surname> <given-names>M-C</given-names></name> <name><surname>Hsieh</surname> <given-names>C-L</given-names></name> <name><surname>Hwang</surname> <given-names>J-S</given-names></name> <name><surname>Jeng</surname> <given-names>J-S</given-names></name> <name><surname>Wang</surname> <given-names>J-D</given-names></name></person-group>. <article-title>Estimation of the long-term care needs of stroke patients by integrating functional disability and survival</article-title>. <source>PLoS ONE.</source> (<year>2013</year>) <volume>8</volume>:<fpage>e75605</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0075605</pub-id><pub-id pub-id-type="pmid">24124500</pub-id></citation></ref>
<ref id="B2">
<label>2.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ntaios</surname> <given-names>G</given-names></name> <name><surname>Faouzi</surname> <given-names>M</given-names></name> <name><surname>Ferrari</surname> <given-names>J</given-names></name> <name><surname>Lang</surname> <given-names>W</given-names></name> <name><surname>Vemmos</surname> <given-names>K</given-names></name> <name><surname>Michel</surname> <given-names>P</given-names></name></person-group>. <article-title>An integer-based score to predict functional outcome in acute ischemic stroke: the ASTRAL score</article-title>. <source>Neurology.</source> (<year>2012</year>) <volume>78</volume>:<fpage>1916</fpage>&#x02013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1212/WNL.0b013e318259e221</pub-id><pub-id pub-id-type="pmid">23212191</pub-id></citation></ref>
<ref id="B3">
<label>3.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hallevi</surname> <given-names>H</given-names></name> <name><surname>Barreto</surname> <given-names>AD</given-names></name> <name><surname>Liebeskind</surname> <given-names>DS</given-names></name> <name><surname>Morales</surname> <given-names>MM</given-names></name> <name><surname>Martin-Schild</surname> <given-names>SB</given-names></name> <name><surname>Abraham</surname> <given-names>AT</given-names></name> <etal/></person-group>. <article-title>Identifying patients at high risk for poor outcome after intra-arterial therapy for acute ischemic stroke</article-title>. <source>Stroke.</source> (<year>2009</year>) <volume>40</volume>:<fpage>1780</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.108.535146</pub-id><pub-id pub-id-type="pmid">19359652</pub-id></citation></ref>
<ref id="B4">
<label>4.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flint</surname> <given-names>AC</given-names></name> <name><surname>Cullen</surname> <given-names>S</given-names></name> <name><surname>Faigeles</surname> <given-names>B</given-names></name> <name><surname>Rao</surname> <given-names>V</given-names></name></person-group>. <article-title>Predicting long-term outcome after endovascular stroke treatment: the totaled health risks in vascular events score</article-title>. <source>Am J Neuroradiol.</source> (<year>2010</year>) <volume>31</volume>:<fpage>1192</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.3174/ajnr.A2050</pub-id><pub-id pub-id-type="pmid">20223889</pub-id></citation></ref>
<ref id="B5">
<label>5.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heitman</surname> <given-names>A</given-names></name> <name><surname>Brackbill</surname> <given-names>N</given-names></name> <name><surname>Greschner</surname> <given-names>M</given-names></name> <name><surname>Sher</surname> <given-names>A</given-names></name> <name><surname>Litke</surname> <given-names>AM</given-names></name> <name><surname>Chichilnisky</surname> <given-names>E</given-names></name></person-group>. <article-title>Testing pseudo-linear models of responses to natural scenes in primate retina</article-title>. <source>BioRxiv 2016:045336</source>. <pub-id pub-id-type="doi">10.1101/045336</pub-id></citation>
</ref>
<ref id="B6">
<label>6.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benjamin</surname> <given-names>AS</given-names></name> <name><surname>Fernandes</surname> <given-names>HL</given-names></name> <name><surname>Tomlinson</surname> <given-names>T</given-names></name> <name><surname>Ramkumar</surname> <given-names>P</given-names></name> <name><surname>VerSteeg</surname> <given-names>C</given-names></name> <name><surname>Chowdhury</surname> <given-names>RH</given-names></name> <etal/></person-group>. <article-title>Modern machine learning as a benchmark for fitting neural responses</article-title>. <source>Front Comput Neurosci.</source> (<year>2018</year>) <volume>12</volume>:<fpage>56</fpage>. <pub-id pub-id-type="doi">10.3389/fncom.2018.00056</pub-id><pub-id pub-id-type="pmid">30072887</pub-id></citation></ref>
<ref id="B7">
<label>7.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>E-J</given-names></name> <name><surname>Kim</surname> <given-names>Y-H</given-names></name> <name><surname>Kim</surname> <given-names>N</given-names></name> <name><surname>Kang</surname> <given-names>D-W</given-names></name></person-group>. <article-title>Deep into the brain: artificial intelligence in stroke imaging</article-title>. <source>J Stroke.</source> (<year>2017</year>) <volume>19</volume>:<fpage>277</fpage>. <pub-id pub-id-type="doi">10.5853/jos.2017.02054</pub-id><pub-id pub-id-type="pmid">29037014</pub-id></citation></ref>
<ref id="B8">
<label>8.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michael</surname> <given-names>KY</given-names></name> <name><surname>Ma</surname> <given-names>J</given-names></name> <name><surname>Fisher</surname> <given-names>J</given-names></name> <name><surname>Kreisberg</surname> <given-names>JF</given-names></name> <name><surname>Raphael</surname> <given-names>BJ</given-names></name> <name><surname>Ideker</surname> <given-names>T</given-names></name></person-group>. <article-title>Visible machine learning for biomedicine</article-title>. <source>Cell.</source> (<year>2018</year>) <volume>173</volume>:<fpage>1562</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2018.05.056</pub-id><pub-id pub-id-type="pmid">29906441</pub-id></citation></ref>
<ref id="B9">
<label>9.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gunning</surname> <given-names>D</given-names></name> <name><surname>Stefik</surname> <given-names>M</given-names></name> <name><surname>Choi</surname> <given-names>J</given-names></name> <name><surname>Miller</surname> <given-names>T</given-names></name> <name><surname>Stumpf</surname> <given-names>S</given-names></name> <name><surname>Yang</surname> <given-names>G-Z</given-names></name></person-group>. <article-title>XAI&#x02014;Explainable artificial intelligence</article-title>. <source>Sci Robot.</source> (<year>2019</year>) <volume>4</volume>:<fpage>eaay7120</fpage>. <pub-id pub-id-type="doi">10.1126/scirobotics.aay7120</pub-id><pub-id pub-id-type="pmid">33137719</pub-id></citation></ref>
<ref id="B10">
<label>10.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ribeiro</surname> <given-names>MT</given-names></name> <name><surname>Singh</surname> <given-names>S</given-names></name> <name><surname>Guestrin</surname> <given-names>C</given-names></name></person-group>. <article-title>&#x0201C;Why should i trust you?</article-title>&#x0201D; Explaining the predictions of any classifier. In <italic>Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining</italic>. (<year>2016</year>). p. <fpage>1135</fpage>&#x02013;<lpage>1144</lpage>. <pub-id pub-id-type="doi">10.1145/2939672.2939778</pub-id></citation>
</ref>
<ref id="B11">
<label>11.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Banks</surname> <given-names>JL</given-names></name> <name><surname>Marotta</surname> <given-names>CA</given-names></name></person-group>. <article-title>Outcomes validity and reliability of the modified Rankin scale: implications for stroke clinical trials: a literature review and synthesis</article-title>. <source>Stroke.</source> (<year>2007</year>) <volume>38</volume>:<fpage>1091</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1161/01.STR.0000258355.23810.c6</pub-id><pub-id pub-id-type="pmid">17272767</pub-id></citation></ref>
<ref id="B12">
<label>12.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mongan</surname> <given-names>J</given-names></name> <name><surname>Moy</surname> <given-names>L</given-names></name> <name><surname>Kahn</surname> <given-names>CE</given-names></name></person-group>. <article-title>Checklist for artificial intelligence in medical imaging (CLAIM): a guide for authors and reviewers</article-title>. <source>Radiol Artif Intell.</source> (<year>2020</year>) <volume>2</volume>:<fpage>e200029</fpage>. <pub-id pub-id-type="doi">10.1148/ryai.2020200029</pub-id><pub-id pub-id-type="pmid">33937821</pub-id></citation></ref>
<ref id="B13">
<label>13.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Collins</surname> <given-names>GS</given-names></name> <name><surname>Reitsma</surname> <given-names>JB</given-names></name> <name><surname>Altman</surname> <given-names>DG</given-names></name> <name><surname>Moons</surname> <given-names>KG</given-names></name></person-group>. <article-title>Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement</article-title>. <source>J Br Surg.</source> (<year>2015</year>) <volume>102</volume>:<fpage>148</fpage>&#x02013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1002/bjs.9736</pub-id><pub-id pub-id-type="pmid">25627261</pub-id></citation></ref>
<ref id="B14">
<label>14.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Buuren</surname> <given-names>S</given-names></name> <name><surname>Groothuis-Oudshoorn</surname> <given-names>K</given-names></name></person-group>. <article-title>mice: Multivariate imputation by chained equations in R</article-title>. <source>J Stat Softw.</source> (<year>2011</year>) <volume>45</volume>:<fpage>1</fpage>&#x02013;<lpage>67</lpage>. <pub-id pub-id-type="doi">10.18637/jss.v045.i03</pub-id></citation>
</ref>
<ref id="B15">
<label>15.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lundberg</surname> <given-names>SM</given-names></name> <name><surname>Erion</surname> <given-names>GG</given-names></name> <name><surname>Lee</surname> <given-names>S-I</given-names></name></person-group>. <article-title>Consistent individualized feature attribution for tree ensembles</article-title>. <source>arXiv preprint arXiv:180203888</source> (<year>2018</year>).</citation>
</ref>
<ref id="B16">
<label>16.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>DeLong</surname> <given-names>ER</given-names></name> <name><surname>DeLong</surname> <given-names>DM</given-names></name> <name><surname>Clarke-Pearson</surname> <given-names>DL</given-names></name></person-group>. <article-title>Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach</article-title>. <source>Biometrics</source>. (<year>1988</year>) <volume>44</volume>:<fpage>837</fpage>&#x02013;<lpage>845</lpage>. <pub-id pub-id-type="doi">10.2307/2531595</pub-id><pub-id pub-id-type="pmid">3203132</pub-id></citation></ref>
<ref id="B17">
<label>17.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thijs</surname> <given-names>VN</given-names></name> <name><surname>Lansberg</surname> <given-names>MG</given-names></name> <name><surname>Beaulieu</surname> <given-names>C</given-names></name> <name><surname>Marks</surname> <given-names>MP</given-names></name> <name><surname>Moseley</surname> <given-names>ME</given-names></name> <name><surname>Albers</surname> <given-names>GW</given-names></name></person-group>. <article-title>Is early ischemic lesion volume on diffusion-weighted imaging an independent predictor of stroke outcome? A multivariable analysis</article-title>. <source>Stroke.</source> (<year>2000</year>) <volume>31</volume>:<fpage>2597</fpage>&#x02013;<lpage>602</lpage>. <pub-id pub-id-type="doi">10.1161/01.STR.31.11.2597</pub-id><pub-id pub-id-type="pmid">11062281</pub-id></citation></ref>
<ref id="B18">
<label>18.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vynckier</surname> <given-names>J</given-names></name> <name><surname>Maamari</surname> <given-names>B</given-names></name> <name><surname>Grunder</surname> <given-names>L</given-names></name> <name><surname>Goeldlin</surname> <given-names>MB</given-names></name> <name><surname>Meinel</surname> <given-names>TR</given-names></name> <name><surname>Kaesmacher</surname> <given-names>J</given-names></name> <etal/></person-group>. <article-title>Early neurologic deterioration in lacunar stroke: clinical and imaging predictors and association with long-term outcome</article-title>. <source>Neurology.</source> (<year>2021</year>) <volume>97</volume>:<fpage>e1437</fpage>&#x02013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1212/WNL.0000000000012661</pub-id><pub-id pub-id-type="pmid">34400585</pub-id></citation></ref>
<ref id="B19">
<label>19.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Geng</surname> <given-names>H-H</given-names></name> <name><surname>Wang</surname> <given-names>Q</given-names></name> <name><surname>Li</surname> <given-names>B</given-names></name> <name><surname>Cui</surname> <given-names>B-B</given-names></name> <name><surname>Jin</surname> <given-names>Y-P</given-names></name> <name><surname>Fu</surname> <given-names>R-L</given-names></name> <etal/></person-group>. <article-title>Early neurological deterioration during the acute phase as a predictor of long-term outcome after first-ever ischemic stroke</article-title>. <source>Medicine</source>. (<year>2017</year>) <volume>96</volume>:<fpage>e9068</fpage>. <pub-id pub-id-type="doi">10.1097/MD.0000000000009068</pub-id><pub-id pub-id-type="pmid">29390435</pub-id></citation></ref>
<ref id="B20">
<label>20.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ligot</surname> <given-names>N</given-names></name> <name><surname>Elands</surname> <given-names>S</given-names></name> <name><surname>Damien</surname> <given-names>C</given-names></name> <name><surname>Jodaitis</surname> <given-names>L</given-names></name> <name><surname>Meibodi</surname> <given-names>NS</given-names></name> <name><surname>Mine</surname> <given-names>B</given-names></name> <etal/></person-group>. <article-title>Stroke Core Volume weighs more than recanalization time for predicting outcome in large vessel occlusion recanalized within 6 h of symptoms onset</article-title>. <source>Front Neurol.</source> (<year>2022</year>) <volume>13</volume>:<fpage>838192</fpage>. <pub-id pub-id-type="doi">10.3389/fneur.2022.838192</pub-id><pub-id pub-id-type="pmid">35265032</pub-id></citation></ref>
<ref id="B21">
<label>21.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pilato</surname> <given-names>F</given-names></name> <name><surname>Silva</surname> <given-names>S</given-names></name> <name><surname>Valente</surname> <given-names>I</given-names></name> <name><surname>Distefano</surname> <given-names>M</given-names></name> <name><surname>Broccolini</surname> <given-names>A</given-names></name> <name><surname>Brunetti</surname> <given-names>V</given-names></name> <etal/></person-group>. <article-title>Predicting factors of functional outcome in patients with acute ischemic stroke admitted to neuro-intensive care unit&#x02014;A prospective cohort study</article-title>. <source>Brain Sci.</source> (<year>2020</year>) <volume>10</volume>:<fpage>911</fpage>. <pub-id pub-id-type="doi">10.3390/brainsci10120911</pub-id><pub-id pub-id-type="pmid">33256264</pub-id></citation></ref>
<ref id="B22">
<label>22.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hosmer</surname> <given-names>DW</given-names></name> <name><surname>Lemeshow</surname> <given-names>S</given-names></name> <name><surname>Sturdivant</surname> <given-names>RX</given-names></name></person-group>. <source>Applied Logistic Regression.</source> <publisher-loc>London</publisher-loc>: <publisher-name>John Wiley &#x00026; Sons</publisher-name>, (<year>2013</year>). <pub-id pub-id-type="doi">10.1002/9781118548387</pub-id></citation>
</ref>
<ref id="B23">
<label>23.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siegler</surname> <given-names>JE</given-names></name> <name><surname>Samai</surname> <given-names>A</given-names></name> <name><surname>Semmes</surname> <given-names>E</given-names></name> <name><surname>Martin-Schild</surname> <given-names>S</given-names></name></person-group>. <article-title>Early neurologic deterioration after stroke depends on vascular territory and stroke etiology</article-title>. <source>J Stroke.</source> (<year>2016</year>) <volume>18</volume>:<fpage>203</fpage>. <pub-id pub-id-type="doi">10.5853/jos.2016.00073</pub-id><pub-id pub-id-type="pmid">27283280</pub-id></citation></ref>
<ref id="B24">
<label>24.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xia</surname> <given-names>C</given-names></name> <name><surname>Wang</surname> <given-names>X</given-names></name> <name><surname>Lindley</surname> <given-names>RI</given-names></name> <name><surname>Delcourt</surname> <given-names>C</given-names></name> <name><surname>Zhou</surname> <given-names>Z</given-names></name> <name><surname>Chen</surname> <given-names>X</given-names></name> <etal/></person-group>. <article-title>Combined utility of blood glucose and white blood cell in predicting outcome after acute ischemic stroke: The ENCHANTED trial</article-title>. <source>Clin Neurol Neurosurg.</source> (<year>2020</year>) <volume>198</volume>:<fpage>106254</fpage>. <pub-id pub-id-type="doi">10.1016/j.clineuro.2020.106254</pub-id><pub-id pub-id-type="pmid">33011482</pub-id></citation></ref>
<ref id="B25">
<label>25.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barow</surname> <given-names>E</given-names></name> <name><surname>Quandt</surname> <given-names>F</given-names></name> <name><surname>Cheng</surname> <given-names>B</given-names></name> <name><surname>Gelderblom</surname> <given-names>M</given-names></name> <name><surname>Jensen</surname> <given-names>M</given-names></name> <name><surname>K&#x000F6;nigsberg</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>Association of white blood cell count with clinical outcome independent of treatment with alteplase in acute ischemic stroke</article-title>. <source>Front Neurol.</source> (<year>2022</year>) <volume>13</volume>:<fpage>877367</fpage>. <pub-id pub-id-type="doi">10.3389/fneur.2022.877367</pub-id><pub-id pub-id-type="pmid">35769368</pub-id></citation></ref>
<ref id="B26">
<label>26.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schrock</surname> <given-names>JW</given-names></name> <name><surname>Glasenapp</surname> <given-names>M</given-names></name> <name><surname>Drogell</surname> <given-names>K</given-names></name></person-group>. <article-title>Elevated blood urea nitrogen/creatinine ratio is associated with poor outcome in patients with ischemic stroke</article-title>. <source>Clin Neurol Neurosurg.</source> (<year>2012</year>) <volume>114</volume>:<fpage>881</fpage>&#x02013;<lpage>4</lpage>. <pub-id pub-id-type="doi">10.1016/j.clineuro.2012.01.031</pub-id><pub-id pub-id-type="pmid">22333035</pub-id></citation></ref>
<ref id="B27">
<label>27.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>W-F</given-names></name> <name><surname>Deng</surname> <given-names>M-L</given-names></name></person-group>. <article-title>Prognostic impact of blood urea nitrogen/creatinine ratio changes in patients with acute ischemic stroke</article-title>. <source>Clin Neurol Neurosurg.</source> (<year>2022</year>) <volume>215</volume>:<fpage>107204</fpage>. <pub-id pub-id-type="doi">10.1016/j.clineuro.2022.107204</pub-id><pub-id pub-id-type="pmid">35299071</pub-id></citation></ref>
<ref id="B28">
<label>28.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abe</surname> <given-names>A</given-names></name> <name><surname>Sakamoto</surname> <given-names>Y</given-names></name> <name><surname>Nishiyama</surname> <given-names>Y</given-names></name> <name><surname>Suda</surname> <given-names>S</given-names></name> <name><surname>Suzuki</surname> <given-names>K</given-names></name> <name><surname>Aoki</surname> <given-names>J</given-names></name> <etal/></person-group>. <article-title>Decline in hemoglobin during hospitalization may be associated with poor outcome in acute stroke patients</article-title>. <source>J Stroke Cerebrov Dis.</source> (<year>2018</year>) <volume>27</volume>:<fpage>1646</fpage>&#x02013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1016/j.jstrokecerebrovasdis.2018.01.026</pub-id><pub-id pub-id-type="pmid">29478940</pub-id></citation></ref>
<ref id="B29">
<label>29.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>R</given-names></name> <name><surname>Xu</surname> <given-names>Q</given-names></name> <name><surname>Wang</surname> <given-names>A</given-names></name> <name><surname>Jiang</surname> <given-names>Y</given-names></name> <name><surname>Meng</surname> <given-names>X</given-names></name> <name><surname>Zhou</surname> <given-names>M</given-names></name> <etal/></person-group>. <article-title>Hemoglobin concentration and clinical outcomes after acute ischemic stroke or transient ischemic attack</article-title>. <source>J Am Heart Assoc.</source> (<year>2021</year>) <volume>10</volume>:<fpage>e022547</fpage>. <pub-id pub-id-type="doi">10.1161/JAHA.121.022547</pub-id><pub-id pub-id-type="pmid">34845923</pub-id></citation></ref>
<ref id="B30">
<label>30.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Powers</surname> <given-names>WJ</given-names></name> <name><surname>Rabinstein</surname> <given-names>AA</given-names></name> <name><surname>Ackerson</surname> <given-names>T</given-names></name> <name><surname>Adeoye</surname> <given-names>OM</given-names></name> <name><surname>Bambakidis</surname> <given-names>NC</given-names></name> <name><surname>Becker</surname> <given-names>K</given-names></name> <etal/></person-group>. <article-title>Guidelines for the early management of patients with acute ischemic stroke: 2019 update to the 2018 guidelines for the early management of acute ischemic stroke: a guideline for healthcare professionals from the American Heart Association/American Stroke Association</article-title>. <source>Stroke.</source> (<year>2019</year>) <volume>50</volume>:<fpage>e344</fpage>&#x02013;<lpage>418</lpage>. <pub-id pub-id-type="doi">10.1161/STR.0000000000000211</pub-id><pub-id pub-id-type="pmid">31765293</pub-id></citation></ref>
<ref id="B31">
<label>31.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heo</surname> <given-names>J</given-names></name> <name><surname>Yoon</surname> <given-names>JG</given-names></name> <name><surname>Park</surname> <given-names>H</given-names></name> <name><surname>Kim</surname> <given-names>YD</given-names></name> <name><surname>Nam</surname> <given-names>HS</given-names></name> <name><surname>Heo</surname> <given-names>JH</given-names></name></person-group>. <article-title>Machine learning&#x02013;based model for prediction of outcomes in acute stroke</article-title>. <source>Stroke.</source> (<year>2019</year>) <volume>50</volume>:<fpage>1263</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.118.024293</pub-id><pub-id pub-id-type="pmid">30890116</pub-id></citation></ref>
<ref id="B32">
<label>32.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bacchi</surname> <given-names>S</given-names></name> <name><surname>Zerner</surname> <given-names>T</given-names></name> <name><surname>Oakden-Rayner</surname> <given-names>L</given-names></name> <name><surname>Kleinig</surname> <given-names>T</given-names></name> <name><surname>Patel</surname> <given-names>S</given-names></name> <name><surname>Jannes</surname> <given-names>J</given-names></name></person-group>. <article-title>Deep learning in the prediction of ischaemic stroke thrombolysis functional outcomes: a pilot study</article-title>. <source>Acad Radiol.</source> (<year>2020</year>) <volume>27</volume>:<fpage>e19</fpage>&#x02013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1016/j.acra.2019.03.015</pub-id><pub-id pub-id-type="pmid">31053480</pub-id></citation></ref>
<ref id="B33">
<label>33.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pinto</surname> <given-names>A</given-names></name> <name><surname>Mckinley</surname> <given-names>R</given-names></name> <name><surname>Alves</surname> <given-names>V</given-names></name> <name><surname>Wiest</surname> <given-names>R</given-names></name> <name><surname>Silva</surname> <given-names>CA</given-names></name> <name><surname>Reyes</surname> <given-names>M</given-names></name></person-group>. <article-title>Stroke lesion outcome prediction based on MRI imaging combined with clinical information</article-title>. <source>Front Neurol.</source> (<year>2018</year>) <volume>9</volume>:<fpage>1060</fpage>. <pub-id pub-id-type="doi">10.3389/fneur.2018.01060</pub-id><pub-id pub-id-type="pmid">30568631</pub-id></citation></ref>
</ref-list> 
</back>
</article> 