<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurol.</journal-id>
<journal-title>Frontiers in Neurology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurol.</abbrev-journal-title>
<issn pub-type="epub">1664-2295</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fneur.2025.1603536</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neurology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Lehmann</surname> <given-names>Sebastian</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/3021716/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/data-curation/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/project-administration/"/>
<role content-type="https://credit.niso.org/contributor-roles/software/"/>
<role content-type="https://credit.niso.org/contributor-roles/validation/"/>
<role content-type="https://credit.niso.org/contributor-roles/visualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Vychopen</surname> <given-names>Martin</given-names></name>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>G&#x000FC;resir</surname> <given-names>Erdem</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1124154/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Wach</surname> <given-names>Johannes</given-names></name>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/visualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
</contrib-group>
<aff><institution>Department of Neurosurgery, University Hospital Leipzig</institution>, <addr-line>Leipzig</addr-line>, <country>Germany</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Marc Hohenhaus, University of Freiburg Medical Center, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Jacek Szczygielski, University of Rzeszow, Poland</p>
<p>Kersten Villringer, Charit&#x000E9; University Medicine Berlin, Germany</p>
<p>Luisa Mona Kraus, Technical University of Munich, Germany</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Sebastian Lehmann <email>Sebastian.lehmann&#x00040;medizin.uni-leipzig.de</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>24</day>
<month>07</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<volume>16</volume>
<elocation-id>1603536</elocation-id>
<history>
<date date-type="received">
<day>31</day>
<month>03</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>27</day>
<month>06</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2025 Lehmann, Vychopen, G&#x000FC;resir and Wach.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Lehmann, Vychopen, G&#x000FC;resir and Wach</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<sec>
<title>Introduction</title>
<p>An analysis of the prognostic ability of the large language model (LLM) Generative Pre-trained Transformer (GPT) to predict short-term survival and functional outcomes in patients with malignant middle cerebral artery (MCA) infarction following decompressive hemicraniectomy.</p></sec>
<sec>
<title>Methods</title>
<p>This retrospective study included 100 patients with malignant MCA infarction who underwent decompressive craniectomy (DC). GPT-4 and GPT-4 Omni were used to predict patient outcomes based on 20 patient-specific factors. Each version of GPT was tested with and without context enrichment (CE). CE versions were provided with the current AHA/ASA 2019 guidelines and meta-analyses of RCTs to inform decision-making. The real-life outcome of the patients, measured by the modified Rankin Scale (mRS), served as a reference. The following endpoints were evaluated: survival during inpatient stay, achievement of a functional status of mRS 0&#x02013;4 at discharge, and at 3-, 6-, and 12-months post-discharge. We analyzed the prognostic prediction of GPT by calculating the area under the curve (AUC) and determining the optimal cutoff using the Youden index for divergent prediction outcomes. After dichotomization according to the cutoff set, a chi-squared test (two-sided) was performed.</p></sec>
<sec>
<title>Results</title>
<p>GPT-4 and GPT-4 Omni demonstrated the ability to estimate survival during in-hospital stay. In both versions, the CE GPT outperformed the non-CE versions. GPT-4 Omni (CE) achieved an AUC of 0.67 (95% CI: 0.54&#x02013;0.79; <italic>p</italic> = 0.002), while GPT-4 (CE) reached an AUC of 0.70 (95% CI: 0.57&#x02013;0.82; <italic>p</italic> = 0.018). GPT-4 also achieved statistical significance even without CE (AUC of 0.66; 95% CI: 0.53&#x02013;0.78; <italic>p</italic> = 0.018). In contrast, the non-CE version of GPT-4 Omni did not reach significance in predicting the survival of hospitalization (AUC of 0.60; 95% CI: 0.48&#x02013;0.73; <italic>p</italic> = 0.07). For questions regarding the functional outcome of patients, neither version of GPT was able to make a sufficient prognostic prediction. However, when provided with the pre-stroke mRS, GPT-4 Omni was able to predict the mRS at discharge (<italic>p</italic> = 0.01; Pearson&#x00027;s correlation coefficient = 0.696).</p></sec>
<sec>
<title>Conclusion</title>
<p>The study shows the already existing high potential of AI in predicting short-term outcomes. It also shows the existing limitations for the evaluation of more complex questions, such as functional outcomes.</p></sec></abstract>
<kwd-group>
<kwd>decompressive hemicraniectomy</kwd>
<kwd>middle cerebral artery infarction</kwd>
<kwd>artificial intelligence</kwd>
<kwd>GPT</kwd>
<kwd>survival</kwd>
<kwd>functional outcome</kwd>
<kwd>prediction</kwd>
</kwd-group>
<counts>
<fig-count count="2"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="30"/>
<page-count count="7"/>
<word-count count="5082"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Neuro-Oncology and Neurosurgical Oncology</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>The use of artificial intelligence (AI) is becoming increasingly important for medical use. Regarding prognostic abilities, there are already publications suggesting that AI-based image morphological recognition of stroke extent has potential comparable to that of an experienced neuroanatomist (<xref ref-type="bibr" rid="B1">1</xref>). In the significantly more complex detection of acute common and severe diseases based on clinical data, Levine et al. (<xref ref-type="bibr" rid="B1">1</xref>) demonstrated that Chat GPT 3 outperformed non-medically trained individuals, but not physicians. Bentley et al. (<xref ref-type="bibr" rid="B2">2</xref>) used machine-learning-based image recognition software to predict hemorrhagic transformation after intravenous thrombolysis in ischemic stroke. Using supervised machine learning algorithms, another research group was able to predict the outcome of patients with ischemic stroke after intra-arterial therapy with an accuracy of &#x0007E;70% (<xref ref-type="bibr" rid="B3">3</xref>). Despite these promising results, GPT has not yet been shown to predict the 6-month outcome after traumatic brain injury due to insufficient specificity (<xref ref-type="bibr" rid="B4">4</xref>). However, promising results have been reported regarding GPT&#x00027;s potential for outcome prediction in aneurysmal subarachnoid hemorrhage (<xref ref-type="bibr" rid="B5">5</xref>).</p>
<p>MCA infarction is a severe condition with high mortality and a major impact on the patient&#x00027;s quality of life in cases of survival. In malignant MCA Infarction, decompressive hemicraniectomy is the ultima ratio for preserving the patient&#x00027;s life (<xref ref-type="bibr" rid="B6">6</xref>). To date, the prediction of the prognosis for these patients remains extremely difficult.</p>
<p>The functionality of modern AI is based on complex digital neural networks, which are created based on real data and are capable of processing complex tasks using deep learning techniques (<xref ref-type="bibr" rid="B7">7</xref>). One of the most advanced AI-based applications currently available for public use is GPT. GPT is a language model that was developed and trained by the company OpenAI to generate answers that are as human-like as possible (<xref ref-type="bibr" rid="B8">8</xref>). GPT processes data and relates them to each other within a network and creates a so-called &#x0201C;transformer architecture&#x0201D; to enable precise categorization within the respective context (<xref ref-type="bibr" rid="B8">8</xref>, <xref ref-type="bibr" rid="B9">9</xref>).</p>
<p>The present study is the first to investigate GPT&#x00027;s current ability to process complex real-life patient data into prognostic estimation of the patient&#x00027;s clinical outcome.</p></sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<p>This retrospective analysis investigated the capability of deriving a prognosis using a single data modality input from patient data. Data were collected from patients admitted to the hospital with malignant MCA infarction who underwent emergency decompressive hemicraniectomy. To further enhance prognostic assessment, the study investigated whether providing context for decision-making can contribute to improving predictive accuracy. Data from 100 patients who underwent decompressive hemicraniectomy for MCA infarction at Leipzig University Hospital between 2016 and 2023 were assessed. To provide the large language model (LLM) with comprehensive input, patient-specific parameters (age, gender, previous cardiac diseases, intake of blood-thinning medication, laboratory parameters such as leukocytes, platelets, CRP, and preoperative pTT), disease-specific parameters [infarct size, hemorrhagic transformation, pupil status, mRS, and Glasgow Coma Scale (GCS)], and therapy-specific parameters (volume and diameter of the decompression and hemoglobin levels before and after the procedure) were sampled and provided to the AI anonymously. These parameters have largely already been associated with the prognosis after decompressive hemicraniectomy in previous studies (<xref ref-type="bibr" rid="B10">10</xref>&#x02013;<xref ref-type="bibr" rid="B15">15</xref>).</p>
<p>The infarct volume was calculated using the Brainlab Suite&#x00027;s volumetric function (Brainlab, Feldkirchen, Germany) (<xref ref-type="bibr" rid="B16">16</xref>). To depict the extent of the decompressive hemicraniectomy, both the AP diameter usually given in the literature and the surface area of the decompressed area were specified according to the formula As = &#x003C0;[(d/2)<sup>2</sup> &#x0002B; h<sup>2</sup>] (<xref ref-type="bibr" rid="B17">17</xref>).</p>
<p>The neurological outcome of the patients was assessed using the modified Rankin Scale (mRS) (<xref ref-type="bibr" rid="B18">18</xref>). In accordance with the existing prospective randomized studies&#x02014;DECIMAL (<xref ref-type="bibr" rid="B19">19</xref>), HAMLET (<xref ref-type="bibr" rid="B20">20</xref>), DESTINY (<xref ref-type="bibr" rid="B21">21</xref>), and DESTINY II (<xref ref-type="bibr" rid="B22">22</xref>)&#x02014;the mRS was also included in the analysis at the time of discharge, and after 3 months, 6 months, and 1 year.</p>
<p>The data were provided to ChatGPT in a standardized chat prompt. For our investigation, we utilized two versions of GPT: GPT-4, released in March 2023, and the advanced version, GPT-4 Omni, released in May 2024 (<xref ref-type="bibr" rid="B23">23</xref>). A total of five questions were formulated, each of which had to be answered with a yes/no response. Each chat prompt was offered to both versions of GPT with and without context-enrichment (CE) to provide the LLM a defined base for reasoning. As CE, we chose the current 2019 ASA/AHA guideline (<xref ref-type="bibr" rid="B6">6</xref>) as well as a meta-analysis of the prospective randomized studies (<xref ref-type="bibr" rid="B24">24</xref>) in patients under 60 years of age, and the prospective randomized study DESTINY II (<xref ref-type="bibr" rid="B22">22</xref>) in patients over 60 years of age. Each question was asked a total of 3 times to consider divergent answers. The mean of the given answers was documented. The answers were scored as follows: three times &#x0201C;no&#x0201D; (score: 0), two times &#x0201C;no&#x0201D; (score: 0.33), one time &#x0201C;no&#x0201D; (score: 0.66), and three times &#x0201C;yes&#x0201D; (score: 1.00). GPT was asked to evaluate the survival during the in-hospital stay, as well as the functional outcome at discharge, 3, 6, and 12 months in a yes or no answer. Favorable (mRS 0&#x02013;4) and non-favorable outcomes (mRS 5&#x02013;6) were dichotomized as defined in the prospective randomized studies (<xref ref-type="bibr" rid="B24">24</xref>). An exemplary chat prompt is provided in <xref ref-type="supplementary-material" rid="SM1">Supplementary Figures 1</xref>, <xref ref-type="supplementary-material" rid="SM1">2</xref>.</p>
<p>Data were entered into an anonymized database, and this database was analyzed with SPSS (IBM Corp., Released 2023. IBM SPSS Statistics for Windows, Version 29.0.2.0, Armonk, NY: IBM Corp). First, we performed a the descriptive analysis of our cohort (<xref ref-type="table" rid="T1">Table 1</xref>). GPT&#x00027;s answers were subjected to a receiver operating characteristic analysis (ROC) to determine the area under the curve (AUC), sensitivity, and specificity stated with the 95% confidence interval (CI) (<xref ref-type="fig" rid="F1">Figure 1</xref>). The Youden index was calculated to define the optimal cutoff in the case of divergent answers (<xref ref-type="table" rid="T2">Table 2</xref>). After dichotomization according to the determined cutoff, GPT&#x00027;s answers were tested for significance using a chi-squared test.</p>






<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Comparison of the Leipzig cohort with the cohorts of the randomized studies DECIMAL, DESTINY I/II, and HAMLET.</p></caption>
<table frame="box" rules="all">
<thead>
<tr style="background-color:#727779;color:#ffffff">
<th valign="top" align="left"><bold>Cohort</bold></th>
<th valign="top" align="center"><bold>Age</bold></th>
<th valign="top" align="center"><bold>Male</bold></th>
<th valign="top" align="center"><bold>Death at one year</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Study group Leipzig</td>
<td valign="top" align="center">59.3</td>
<td valign="top" align="center">68%</td>
<td valign="top" align="center">39.8%</td>
</tr> <tr>
<td valign="top" align="left">Subgroup &#x02264; 60 years</td>
<td valign="top" align="center">52</td>
<td valign="top" align="center">72.4%</td>
<td valign="top" align="center">27.1%</td>
</tr> <tr>
<td valign="top" align="left">DECIMAL (surgery group)</td>
<td valign="top" align="center">43.5</td>
<td valign="top" align="center">45%</td>
<td valign="top" align="center">20.0%</td>
</tr> <tr>
<td valign="top" align="left">DESTINY (surgery group)</td>
<td valign="top" align="center">43.7</td>
<td valign="top" align="center">47%</td>
<td valign="top" align="center">17.6%</td>
</tr> <tr>
<td valign="top" align="left">HAMLET (surgery group)</td>
<td valign="top" align="center">50</td>
<td valign="top" align="center">63%</td>
<td valign="top" align="center">22%</td>
</tr> <tr>
<td valign="top" align="left">Subgroup &#x02265;61 years</td>
<td valign="top" align="center">68.3</td>
<td valign="top" align="center">59.6%</td>
<td valign="top" align="center">55%</td>
</tr> <tr>
<td valign="top" align="left">DESTINY II (surgery group)</td>
<td valign="top" align="center">70</td>
<td valign="top" align="center">51%</td>
<td valign="top" align="center">43%</td>
</tr></tbody>
</table>
</table-wrap>

<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>ROC analysis of the response variability of the multiple responses for survival at discharge. The line indicates the highest Youden index.</p></caption>
<alt-text>Receiver Operating Characteristic (ROC) curve comparing sensitivity vs. 1-specificity for four models: GPT 4.0 Omni, GPT 4.0 Omni (CE), GPT 4.0, and GPT 4.0 (CE). Each model is represented by a different colored line.</alt-text>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-16-1603536-g0001.tif"/>
</fig>



<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Results of the ROC analyses of divergent answers for survival at discharge (Question 1) for the different versions of GPT, shown with 95% CI and asymptotic significance level and the optimal cutoff for dichotmisation determined by the highest Youden Index highlighted in bold.</p></caption>
<table frame="box" rules="all">
<thead>
<tr style="background-color:#727779;color:#ffffff">
<th valign="top" align="left"><bold>Question 1</bold></th>
<th valign="top" align="center"><bold>AUC</bold></th>
<th valign="top" align="center"><bold>Significance</bold></th>
<th valign="top" align="center"><bold>95% CI upper limit</bold></th>
<th valign="top" align="center"><bold>95% CI bottom limit</bold></th>
<th valign="top" align="center"><bold>Highest youden index</bold></th>
<th valign="top" align="center"><bold>Determined cutoff</bold></th>
<th valign="top" align="center"><bold>Sensitivity at cutoff</bold></th>
<th valign="top" align="center"><bold>Specificity at cutoff</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">GPT-4 Omni</td>
<td valign="top" align="center">0.60</td>
<td valign="top" align="center">0.120</td>
<td valign="top" align="center">0.48</td>
<td valign="top" align="center">0.72</td>
<td valign="top" align="center">0.20</td>
<td valign="top" align="center"><bold>0.17</bold></td>
<td valign="top" align="center">0.39</td>
<td valign="top" align="center">0.8080</td>
</tr> <tr>
<td valign="top" align="left">GPT-4 Omni (CE)</td>
<td valign="top" align="center">0.67</td>
<td valign="top" align="center">0.010</td>
<td valign="top" align="center">0.54</td>
<td valign="top" align="center">0.80</td>
<td valign="top" align="center">0.37</td>
<td valign="top" align="center"><bold>0.32</bold></td>
<td valign="top" align="center">0.72</td>
<td valign="top" align="center">0.65</td>
</tr> <tr>
<td valign="top" align="left">GPT-4</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">0.016</td>
<td valign="top" align="center">0.53</td>
<td valign="top" align="center">0.79</td>
<td valign="top" align="center">0.31</td>
<td valign="top" align="center"><bold>0.83</bold></td>
<td valign="top" align="center">0.73</td>
<td valign="top" align="center">0.58</td>
</tr> <tr>
<td valign="top" align="left">GPT-4 (CE)</td>
<td valign="top" align="center">0.70</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">0.821</td>
<td valign="top" align="center">0.411</td>
<td valign="top" align="center"><bold>0.83</bold></td>
<td valign="top" align="center">0.76</td>
<td valign="top" align="center">0.65</td>
</tr></tbody>
</table>
</table-wrap>




<p>In an additional prompt, the pre-stroke mRS was included to refine the mode (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 4</xref>). GPT-4 could not be included in the following analysis as it had been replaced by OpenAI with a more recent version. GPT-4 Omni was asked to predict the mRS at the time of discharge. A delta (&#x00394;) between the pre-stroke mRS and the mRS at the time of discharge was calculated for GPT&#x00027;s estimation, as well as the real mRS (<xref ref-type="fig" rid="F2">Figure 2</xref>). Subsequently, the &#x00394;mRS was assessed by Pearson&#x00027;s correlation coefficient.</p>


<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Scatterplot for GPT-4 Omni&#x00027;s responses to the question about mRS at discharge, given pre-stroke mRS vs. real outcomes.</p></caption>
<alt-text>Scatter plot showing a positive correlation between variables X and Y, with yellow data points. A red trend line indicates a Pearson correlation coefficient of 0.70 and a p-value of 0.01. A light red shaded area represents confidence intervals.</alt-text>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-16-1603536-g0002.tif"/>
</fig>

</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Patient characteristics</title>
<p>In our patient cohort, 68% were male, with a median age of 59 years. The median GCS score prior to surgery was 10. According to the parameters in the HAMLET, DESTINY, DESTINY II, and DECIMAL studies, the cohort was divided into patients aged over 61 years and those aged &#x02264; 60 years. In the younger cohort, the median age of onset was 53 years, and 72.4% were male (<xref ref-type="bibr" rid="B19">19</xref>&#x02013;<xref ref-type="bibr" rid="B22">22</xref>, <xref ref-type="bibr" rid="B24">24</xref>). Among patients 61 years or older, the median age was 68 years, with 59.6% male patients. The 1-year mortality rate across all ages was 39.8%. Of these, the cohort of &#x0003E;61-year-olds accounted for the largest proportion, with 55% of patients dying after 1 year (<xref ref-type="table" rid="T1">Table 1</xref>).</p></sec>
<sec>
<title>GPT&#x00027;s performance in the estimation of survival</title>
<p>During the 3-fold presentation of each individual patient to GPT, it was shown that GPT could show divergent answers to the same question, regardless of the version used. For the analysis of survival estimation at the time of discharge, the rate of divergent answers varied from 18 to 30%, with GPT-4.0 showing less divergence than GPT-4Omni [GPT-4.0 18%, GPT-4.0 (CE) 20%, GPT-4 Omni 24%, and GPT-4 Omni (CE) 30%].</p>
<p>In the ROC-analysis (<xref ref-type="fig" rid="F1">Figure 1</xref>) to determine the optimal cutoff for a positive answer in cases of divergent answers regarding survival at discharge, the highest Youden index was achieved with &#x02265;2 positive answers for GPT-4 (&#x0003E;0.66), and &#x02265;1 positive answer for GPT-4 Omni (&#x0003E;0.33). The AUC values of the LLMs ranged from 0.60 to 0.70, with the CE versions outperforming the non-CE-GPT versions in the overall analysis (AUC: GPT-4 Omni non-CE = 0.60, GPT-4 Omni CE = 0.67; GPT-4 non-CE = 0.66, GPT-4 CE = 0.70). In the subgroup analysis, GPT-4 showed weaker results in patients &#x02265;61 years, where GPT-4 Omni outperformed both CE and non-CE versions. GPT-4 CE performed the worst (AUC: GPT-4. Omni non-CE = 0.61, GPT-4 Omni CE=0.68; GPT-4 non-CE = 0.61, GPT-4 CE = 0.59). Significant diagnostic correlations between survival at time of discharge and the estimations of GPT-4 Omni (CE) (<italic>p</italic> = 0.01, 95% CI 0.54&#x02013;0.79), GPT-4 (CE) (<italic>p</italic> = 0.002, 95% CI 0.57&#x02013;0.82), and non-CE (<italic>p</italic> = 0.016, 95% CI 0.53&#x02013;0.78) were observed (<xref ref-type="table" rid="T2">Table 2</xref>). The answers were dichotomized according to the cutoff set by the Youden index (<xref ref-type="table" rid="T2">Table 2</xref>). According to the highest Youden index calculated based on the ROC curve analysis, the cutoff for GPT-4 was set at &#x02265;1/3 positive answers, and for GPT-4 Omni at 2/3 positive answers. Subsequently, GPT&#x00027;s prognoses were compared to real outcomes using cross-tabulation. In the chi-squared test for survival during in-hospital stay (Question 1), GPT significantly predicted patient survival with GPT-4 Omni (CE) (<italic>p</italic> = 0.002), GPT-4 (CE) (<italic>p</italic> = 0.018), and non-CE (<italic>p</italic> = 0.018). GPT4 Omni non-CE narrowly missed statistical significance (<italic>p</italic> = 0.07) and showed considerably reduced sensitivity (<xref ref-type="table" rid="T3">Table 3</xref>).</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Cross-table depiction of GPT&#x00027;s answers compared to real outcome after cutoff-based dichotomization; The first value describes the prognosis by GPT, the second value represents the real outcome; Chi-squared test and <italic>p-value</italic> for survival (mRS &#x0003C;6) at discharge A: GPT-4 Omni, B: GPT-4 Omni with context enrichment (CE), C: GPT-4, D: GPT-4 with context enrichment (CE).</p></caption>
<table frame="box" rules="all">
<thead>
<tr style="background-color:#727779;color:#ffffff">
<th valign="top" align="left"><bold>GPT-4 Omni</bold> <break/><bold>cutoff 1/3 answers</bold></th>
<th valign="top" align="center"><bold>GPT survival at discharge</bold></th>
<th valign="top" align="center"><bold>GPT no survival discharge</bold></th>
<th valign="top" align="center"><bold>p-value</bold></th>
</tr>
</thead>
<tbody>
<tr style="background-color:#dee1e1">
<td valign="top" align="left" colspan="4"><bold>A</bold></td>
</tr> <tr>
<td valign="top" align="left">mRS 6</td>
<td valign="top" align="center">21/26 (81%)</td>
<td valign="top" align="center">5/26 (19%)</td>
<td valign="top" align="center">0.07</td>
</tr> <tr>
<td valign="top" align="left">mRS 0&#x02013;5</td>
<td valign="top" align="center">45/74 (61%)</td>
<td valign="top" align="center">29/74 (39%)</td>
<td/>
</tr> <tr>
<td valign="top" align="left">Total</td>
<td valign="top" align="center">66/100</td>
<td valign="top" align="center">34/100</td>
<td/>
</tr> <tr>
<td valign="top" align="left"><bold>GPT-4 Omni (CE) cutoff 1/3 answers</bold></td>
<td valign="top" align="center"><bold>GPT survival at discharge</bold></td>
<td valign="top" align="center"><bold>GPT no survival discharge</bold></td>
<td valign="top" align="center"><bold>p-value</bold></td>
</tr> <tr style="background-color:#dee1e1">
<td valign="top" align="left" colspan="4"><bold>B</bold></td>
</tr> <tr>
<td valign="top" align="left">mRS 6</td>
<td valign="top" align="center">16/26 (62%)</td>
<td valign="top" align="center">10/26 (38%)</td>
<td valign="top" align="center">0.002</td>
</tr> <tr>
<td valign="top" align="left">mRS 0&#x02013;5</td>
<td valign="top" align="center">20/74 (27%)</td>
<td valign="top" align="center">54/74 (73%)</td>
<td/>
</tr> <tr>
<td valign="top" align="left">Total</td>
<td valign="top" align="center">36/100</td>
<td valign="top" align="center">64/100</td>
<td/>
</tr> <tr>
<td valign="top" align="left"><bold>GPT-4 cutoff 2/3 answers</bold></td>
<td valign="top" align="center"><bold>GPT survival at discharge</bold></td>
<td valign="top" align="center"><bold>GPT no survival discharge</bold></td>
<td valign="top" align="center"><bold>p-value</bold></td>
</tr> <tr style="background-color:#dee1e1">
<td valign="top" align="left" colspan="4"><bold>C</bold></td>
</tr>
<tr>
<td valign="top" align="left">mRS 6</td>
<td valign="top" align="center">11/26 (42%)</td>
<td valign="top" align="center">15/26 (58%)</td>
<td valign="top" align="center">0.018</td>
</tr> <tr>
<td valign="top" align="left">mRS 0&#x02013;5</td>
<td valign="top" align="center">14/74 (19%)</td>
<td valign="top" align="center">60/74 (81%)</td>
<td/>
</tr> <tr>
<td valign="top" align="left">Total</td>
<td valign="top" align="center">36/100</td>
<td valign="top" align="center">64/100</td>
<td/>
</tr> <tr>
<td valign="top" align="left"><bold>GPT-4 (CE) cutoff 2/3 answers</bold></td>
<td valign="top" align="center"><bold>GPT survival at discharge</bold></td>
<td valign="top" align="center"><bold>GPT no survival discharge</bold></td>
<td valign="top" align="center"><bold>p-value</bold></td>
</tr> <tr style="background-color:#dee1e1">
<td valign="top" align="left" colspan="4"><bold>D</bold></td>
</tr> <tr>
<td valign="top" align="left">mRS 6</td>
<td valign="top" align="center">10/26 (38%)</td>
<td valign="top" align="center">16/26 (62%)</td>
<td valign="top" align="center">0.018</td>
</tr> <tr>
<td valign="top" align="left">mRS 0&#x02013;5</td>
<td valign="top" align="center">12/74 (16%)</td>
<td valign="top" align="center">62/74 (84%)</td>
<td/>
</tr> <tr>
<td valign="top" align="left">Total</td>
<td valign="top" align="center">22/100</td>
<td valign="top" align="center">78/100</td>
<td/>
</tr></tbody>
</table>
</table-wrap>


<p>In the subgroup analyses regarding the prognosis for survival at discharge in groups &#x02265;61-year-old patients and &#x0003C;61-year-old patients, GPT-4 Omni (CE) achieved significance for both groups (&#x02265;61 years, <italic>p</italic> = 0.014; &#x0003C; 61 years, <italic>p</italic> = 0.034). For the other models, only GPT-4 reached significance (<italic>p</italic> = 0.036) in &#x02265; 61-year-olds (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 3</xref>, <xref ref-type="supplementary-material" rid="SM1">Supplementary Tables 1</xref>&#x02013;<xref ref-type="supplementary-material" rid="SM1">3</xref>).</p></sec>
<sec>
<title>GPT&#x00027;s performance in the estimation of functional outcomes</title>
<p>For the questions on the functional outcome (Questions 2&#x02013;5), GPT provided almost exclusively negative answers (87%&#x02212;100%). Resulting from ROC curve analysis and Youden index calculation, the cutoff was set to 2/3 positive answers for GPT-4 and 3/3 positive answers for GPT-4 Omni (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 5</xref>, <xref ref-type="supplementary-material" rid="SM1">Supplementary Table 4</xref>). There was no significance for any of the questions across all tested GPT versions with and without CE, with only minimal differences between the versions and questions (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 5</xref>).</p>
<p>The prompt including the pre-stroke mRS, provided to GPT-4 Omni, resulted in usable mRS estimations at the time of discharge by the LLM. Pearson&#x00027;s correlation coefficient showed a significant correlation (<italic>p</italic> = 0.01) with a strong to very strong positive correlation (Pearson&#x00027;s correlation coefficient: 0.696, <xref ref-type="fig" rid="F2">Figure 2</xref>, <xref ref-type="table" rid="T4">Table 4</xref>).</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Pearson&#x00027;s correlation coefficient of mRS estimation given the pre-stroke mRS for GPT-4 Omni.</p></caption>
<table frame="box" rules="all">
<thead>
<tr style="background-color:#727779;color:#ffffff">
<th valign="top" align="left" colspan="2"></th>
<th valign="top" align="center"><bold>Delta_mRS(GPT-4 Omni)</bold></th>
<th valign="top" align="center"><bold>Delta_mRSReality</bold></th>
</tr>
</thead>
<tbody>
<tr style="background-color:#dee1e1">
<td valign="top" align="left" colspan="4"><bold>Correlation</bold></td>
</tr> <tr>
<td valign="top" align="left">Delta_mRS (GPT-4 Omni)</td>
<td valign="top" align="center">Pearson&#x00027;s correlation coefficient</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.696</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">Sig. (two-sided</td>
<td/>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td/>
<td valign="top" align="center"><italic>N</italic></td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr> <tr>
<td valign="top" align="left">Delta_mRSReal</td>
<td valign="top" align="center">Pearson&#x00027;s correlation coefficient</td>
<td valign="top" align="center">0.696</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">Sig. (two-sided)</td>
<td valign="top" align="center">&#x0003C;0.001</td>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center"><italic>N</italic></td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">100</td>
</tr></tbody>
</table>
<table-wrap-foot>
<p>The calculation was based on the delta of the mRS from the time points of pre-stroke to discharge.</p>
</table-wrap-foot>
</table-wrap>

</sec></sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>Our study shows that GPT can estimate prognosis for patients with malignant infarcts of the middle cerebral artery who have undergone decompressive hemicraniectomy based on patient profiles.</p>
<p>After dividing the patients into cohorts of over and under 60 years of age, analogous to the inclusion criteria of the randomized studies [HAMLET (<xref ref-type="bibr" rid="B20">20</xref>), DESTINY/II (<xref ref-type="bibr" rid="B21">21</xref>, <xref ref-type="bibr" rid="B22">22</xref>), and DECIMAL (<xref ref-type="bibr" rid="B19">19</xref>)], our study group included a higher percentage of male patients. Additionally, patients in our cohort were older at the time of the event. The 1-year mortality rate exceeded the mortality rate stated in the randomized studies, especially in the group of patients over 60 years of age. The differences in mean patient age, gender, and long-term survival are likely due to a complex combination of factors in the population groups, healthcare systems, and possibly due to selection bias in the respective study designs.</p>
<p>Despite the differences in the patient cohort, the CE versions GPT4.0 and GPT-4 Omni are able to predict the patient&#x00027;s survival with robust accuracy. Interestingly, the earlier version GPT-4 reaches a higher AUC than GPT-4 Omni. GPT-4 Omni, in turn, achieves the highest statistical significance in the chi-squared analysis after Youden index-based dichotomization of multiple answers. Non-CE GPT versions only reach insufficient AUCs. The present results suggest that CE GPTs may be more capable of estimating survival outcomes.</p>
<p>In the subgroup analysis, significance was achieved for &#x02265;61-year-old patients and &#x0003C;61-year-old patients by GPT-4 Omni (CE), but only for &#x02265;61-year-olds in the model GPT-4. The significance in GPT-4 Omni (CE) is consistent with the main analysis and indicates that the results are valid regardless of the investigated age groups, underlining the advantage of CE. However, the implications of the results from the more recently developed GPT-4 remain unclear, though they may reflect progress in source-based reasoning abilities seen in GPT-4 Omni.</p>
<p>In addition to the question of survival, the LLM was also asked to predict functional outcomes as measured by the mRS. GPT was unable to provide sufficient answers, regardless of the version used. In a further series of tests, GPT4 Omni (CE) was provided with the pre-stroke mRS as a baseline functional status for each individual patient. Here, the functional outcome could also be predicted with a significant correlation by the GPT. The results suggest that a baseline might be vital for the LLM&#x00027;s reasoning process when making predictive estimations of manageable complexity. This additional input supports the theory that initial insufficient answers may be seen as the expression of hallucinations. This &#x0201C;data hallucination&#x0201D; can occur when an AI is working on a topic on which it has not been explicitly trained. As a result, fictitious answers may be generated without a founded basis for reasoning (<xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B26">26</xref>). Another aspect is that GPT seems not to be able to adequately include the concept and perception of time into the calculations, adding another layer of complexity to the question of time-dependent functional recovery (<xref ref-type="bibr" rid="B27">27</xref>).</p>
<p>To understand the limitations of AI, its basic functioning must first be understood. In Order to calculate the propability of the next correct word, each word is related to the previous one and to each other. This highly complex calculation approach is beyond human control and monitoring, making it impossible to understand the rationale behind a calculation. This bears the danger of arbitrary surrogate parameters being used for calculation (<xref ref-type="bibr" rid="B25">25</xref>).</p>
<p>AI is fundamentally limited by the data on which it is trained. An existing bias (ethnic group, patient selection, infrastructural characteristics, etc.) is continued by the AI and can produce a result that does not correspond to existing reality. The differences in the patient population in the existing studies and our collective alone, therefore, inevitably lead to inaccuracies. It is all the more remarkable that, despite these differences, a robust association with CE-GPT&#x00027;s prediction of short-term outcome was achieved.</p>
<p>Another aspect that must always be considered when using AI-based systems is that of ethics. When we weigh up a prognostic decision as treating doctors, we include hard and soft data and factors in our decision-making. Similarly, especially soft factors such as contextual or environmental conditions will not be represented in AI evaluations. The extent to which AI is involved in this decision-making process is a very delicate question and must always remain the subject of controversial debate.</p>
<p>Additionally, access to AI as a source of medical information is not limited to medically trained professionals. Non-medical users have equal access to the AI tool through interfaces such as chatbots. Unlike medical professionals, however, they lack the ability to critically assess, contextualize, and interpret the AI&#x00027;s response. Going forward, great emphasis should be placed on guiding non-medically trained persons to prevent harm by misinterpretation or false conclusions. There are approaches to implement AI-based, machine learning driven prediction models (<xref ref-type="bibr" rid="B28">28</xref>, <xref ref-type="bibr" rid="B29">29</xref>). However, such models are less prone to hallucinations due to their targeted use of validated parameters, yet have not been integrated into LLMs.</p></sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>At the present time, the AI-based language model GPT, in versions GPT-4 and GPT-4 Omni, is able to predict the short-term outcome of patients with decompressive hemicraniectomy after malignant MCA infarction with a significant degree of certainty based on freely available data. However, the question of time-dependent functional outcome appears more complex and does not yield any meaningful results, with a high risk of producing data hallucinations. Future studies should focus on two specific objectives: first, identifying ways to further improve GPT&#x00027;s prognostic abilities; second, understanding AI decision paths to decipher the black box of decision-making before implementing AI-based decision-making in practical healthcare.</p></sec>
</body>
<back>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec sec-type="ethics-statement" id="s7">
<title>Ethics statement</title>
<p>The studies involving humans were approved by the Medizinische Fakult&#x000E4;t Ethik Kommission (Approval number: 038/25-ek). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants&#x00027; legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.</p>
</sec>
<sec sec-type="author-contributions" id="s8">
<title>Author contributions</title>
<p>SL: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing &#x02013; original draft, Writing &#x02013; review &#x00026; editing. MV: Investigation, Writing &#x02013; review &#x00026; editing. EG: Supervision, Writing &#x02013; review &#x00026; editing. JW: Conceptualization, Investigation, Supervision, Visualization, Writing &#x02013; review &#x00026; editing.</p>
</sec>
<sec sec-type="funding-information" id="s9">
<title>Funding</title>
<p>The author(s) declare that no financial support was received for the research and/or publication of this article.</p>
</sec>
<ack><p>The illustration in the visual abstract was generated by GPT-4 Omni (<xref ref-type="bibr" rid="B30">30</xref>), edited by Sebastian Lehmann.</p>
</ack>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="ai-statement" id="s10">
<title>Generative AI statement</title>
<p>The author(s) declare that Gen AI was used in the creation of this manuscript. The large language model GPT was used to generate OPS-Codes that were analyzed and compared to human coders. Additionally, GPT 4.Omni was used to generate the Immage included in the visual abstract. Despite the stated tascs, no AI was used in creation of the manuscript itself.</p></sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec sec-type="supplementary-material" id="s12">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fneur.2025.1603536/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fneur.2025.1603536/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Supplementary_file_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image_1.jpeg" mimetype="image/jpeg" xmlns:xlink="http://www.w3.org/1999/xlink"/></sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levine</surname> <given-names>DM</given-names></name> <name><surname>Tuwani</surname> <given-names>R</given-names></name> <name><surname>Kompa</surname> <given-names>B</given-names></name> <name><surname>Varma</surname> <given-names>A</given-names></name> <name><surname>Finlayson</surname> <given-names>SG</given-names></name> <name><surname>Mehrotra</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>The diagnostic and triage accuracy of the GPT-3 artificial intelligence model</article-title>. <source>medRxiv.</source> (<year>2023</year>) <volume>1</volume>:<fpage>2023</fpage>.01.30.23285067. <pub-id pub-id-type="doi">10.1101/2023.01.30.23285067</pub-id><pub-id pub-id-type="pmid">36778449</pub-id></citation></ref>
<ref id="B2">
<label>2.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bentley</surname> <given-names>P</given-names></name> <name><surname>Ganesalingam</surname> <given-names>J</given-names></name> <name><surname>Carlton Jones</surname> <given-names>AL</given-names></name> <name><surname>Mahady</surname> <given-names>K</given-names></name> <name><surname>Epton</surname> <given-names>S</given-names></name> <name><surname>Rinne</surname> <given-names>P</given-names></name> <etal/></person-group>. <article-title>Prediction of stroke thrombolysis outcome using CT brain machine learning</article-title>. <source>Neuroimage Clin.</source> (<year>2014</year>) <volume>4</volume>:<fpage>635</fpage>&#x02013;<lpage>40</lpage>. <pub-id pub-id-type="doi">10.1016/j.nicl.2014.02.003</pub-id><pub-id pub-id-type="pmid">24936414</pub-id></citation></ref>
<ref id="B3">
<label>3.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Asadi</surname> <given-names>H</given-names></name> <name><surname>Dowling</surname> <given-names>R</given-names></name> <name><surname>Yan</surname> <given-names>B</given-names></name> <name><surname>Mitchell</surname> <given-names>P</given-names></name></person-group>. <article-title>Machine learning for outcome prediction of acute ischemic stroke post intra-arterial therapy</article-title>. <source>PLoS ONE.</source> (<year>2014</year>) <volume>9</volume>:<fpage>e88225</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0088225</pub-id><pub-id pub-id-type="pmid">24520356</pub-id></citation></ref>
<ref id="B4">
<label>4.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gakuba</surname> <given-names>C</given-names></name> <name><surname>Le Barbey</surname> <given-names>C</given-names></name> <name><surname>Sar</surname> <given-names>A</given-names></name> <name><surname>Bonnet</surname> <given-names>G</given-names></name> <name><surname>Cerasuolo</surname> <given-names>D</given-names></name> <name><surname>Giabicani</surname> <given-names>M</given-names></name> <etal/></person-group>. <article-title>Evaluation of ChatGPT in predicting 6-month outcomes after traumatic brain injury</article-title>. <source>Crit Care Med.</source> (<year>2024</year>) <volume>52</volume>:<fpage>942</fpage>&#x02013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.1097/CCM.0000000000006236</pub-id><pub-id pub-id-type="pmid">38445975</pub-id></citation></ref>
<ref id="B5">
<label>5.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Basaran</surname> <given-names>AE</given-names></name> <name><surname>G&#x000FC;resir</surname> <given-names>A</given-names></name> <name><surname>Knoch</surname> <given-names>H</given-names></name> <name><surname>Vychopen</surname> <given-names>M</given-names></name> <name><surname>G&#x000FC;resir</surname> <given-names>E</given-names></name> <name><surname>Wach</surname> <given-names>J</given-names></name></person-group>. <article-title>Beyond traditional prognostics: integrating RAG-enhanced AtlasGPT and ChatGPT 4.0 into aneurysmal subarachnoid hemorrhage outcome prediction</article-title>. <source>Neurosurg Rev.</source> (<year>2024</year>) <volume>48</volume>:<fpage>40</fpage>. <pub-id pub-id-type="doi">10.1007/s10143-025-03194-w</pub-id><pub-id pub-id-type="pmid">39794551</pub-id></citation></ref>
<ref id="B6">
<label>6.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Warner</surname> <given-names>JJ</given-names></name> <name><surname>Harrington</surname> <given-names>RA</given-names></name> <name><surname>Sacco</surname> <given-names>RL</given-names></name> <name><surname>Elkind</surname> <given-names>MSV</given-names></name></person-group>. <article-title>Guidelines for the early management of patients with acute ischemic stroke: 2019 update to the 2018 guidelines for the early management of acute ischemic stroke</article-title>. <source>Stroke.</source> (<year>2019</year>) <volume>50</volume>:<fpage>3331</fpage>&#x02013;<lpage>2</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.119.027708</pub-id><pub-id pub-id-type="pmid">31662117</pub-id></citation></ref>
<ref id="B7">
<label>7.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidhuber</surname> <given-names>J</given-names></name></person-group>. <article-title>Deep learning in neural networks: an overview</article-title>. <source>Neural Netw.</source> (<year>2015</year>) <volume>61</volume>:<fpage>85</fpage>&#x02013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2014.09.003</pub-id><pub-id pub-id-type="pmid">25462637</pub-id></citation></ref>
<ref id="B8">
<label>8.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bhattacharya</surname> <given-names>K</given-names></name> <name><surname>Bhattacharya</surname> <given-names>AS</given-names></name> <name><surname>Bhattacharya</surname> <given-names>N</given-names></name> <name><surname>Yagnik</surname> <given-names>VD</given-names></name> <name><surname>Garg</surname> <given-names>P</given-names></name> <name><surname>Kumar</surname> <given-names>S</given-names></name></person-group>. <article-title>ChatGPT in surgical practice&#x02014;a new kid on the block</article-title>. <source>Indian J Surg.</source> (<year>2023</year>) <volume>85</volume>:<fpage>1346</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1007/s12262-023-03727-x</pub-id></citation>
</ref>
<ref id="B9">
<label>9.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xue</surname> <given-names>VW</given-names></name> <name><surname>Lei</surname> <given-names>P</given-names></name> <name><surname>Cho</surname> <given-names>WC</given-names></name></person-group>. <article-title>The potential impact of ChatGPT in clinical and translational medicine</article-title>. <source>Clin Transl Med.</source> (<year>2023</year>) <volume>13</volume>:<fpage>e1216</fpage>. <pub-id pub-id-type="doi">10.1002/ctm2.1216</pub-id><pub-id pub-id-type="pmid">36856370</pub-id></citation></ref>
<ref id="B10">
<label>10.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wagner</surname> <given-names>S</given-names></name> <name><surname>Schnippering</surname> <given-names>H</given-names></name> <name><surname>Aschoff</surname> <given-names>A</given-names></name> <name><surname>Koziol</surname> <given-names>JA</given-names></name> <name><surname>Schwab</surname> <given-names>S</given-names></name> <name><surname>Steiner</surname> <given-names>T</given-names></name></person-group>. <article-title>Suboptimum hemicraniectomy as a cause of additional cerebral lesions in patients with malignant infarction of the middle cerebral artery</article-title>. <source>J Neurosurg.</source> (<year>2001</year>) <volume>94</volume>:<fpage>693</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.3171/jns.2001.94.5.0693</pub-id><pub-id pub-id-type="pmid">11354398</pub-id></citation></ref>
<ref id="B11">
<label>11.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bian</surname> <given-names>J</given-names></name> <name><surname>Guo</surname> <given-names>S</given-names></name> <name><surname>Huang</surname> <given-names>T</given-names></name> <etal/></person-group>. <article-title>CRP as a potential predictor of outcome in acute ischemic stroke</article-title>. <source>Biomed Rep.</source> (<year>2023</year>) <volume>18</volume>:<fpage>17</fpage>. <pub-id pub-id-type="doi">10.3892/br.2023.1599</pub-id><pub-id pub-id-type="pmid">36776580</pub-id></citation></ref>
<ref id="B12">
<label>12.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kellert</surname> <given-names>L</given-names></name> <name><surname>Schrader</surname> <given-names>F</given-names></name> <name><surname>Ringleb</surname> <given-names>P</given-names></name> <name><surname>Steiner</surname> <given-names>T</given-names></name> <name><surname>B&#x000F6;sel</surname> <given-names>J</given-names></name></person-group>. <article-title>The impact of low hemoglobin levels and transfusion on critical care patients with severe ischemic stroke: STroke: RelevAnt impact of HemoGlobin, Hematocrit and Transfusion (STRAIGHT)&#x02013;an observational study</article-title>. <source>J Crit Care.</source> (<year>2014</year>) <volume>29</volume>:<fpage>236</fpage>&#x02013;<lpage>40</lpage>. <pub-id pub-id-type="doi">10.1016/j.jcrc.2013.11.008</pub-id><pub-id pub-id-type="pmid">24332995</pub-id></citation></ref>
<ref id="B13">
<label>13.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hecht</surname> <given-names>N</given-names></name> <name><surname>Neugebauer</surname> <given-names>H</given-names></name> <name><surname>Fiss</surname> <given-names>I</given-names></name> <etal/></person-group>. <article-title>Infarct volume predicts outcome after decompressive hemicraniectomy for malignant hemispheric stroke</article-title>. <source>J Cereb Blood Flow Metab.</source> (<year>2018</year>) <volume>38</volume>:<fpage>1096</fpage>&#x02013;<lpage>103</lpage>. <pub-id pub-id-type="doi">10.1177/0271678X17718693</pub-id><pub-id pub-id-type="pmid">28665171</pub-id></citation></ref>
<ref id="B14">
<label>14.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Semerano</surname> <given-names>A</given-names></name> <name><surname>Strambo</surname> <given-names>D</given-names></name> <name><surname>Martino</surname> <given-names>G</given-names></name> <name><surname>Comi</surname> <given-names>G</given-names></name> <name><surname>Filippi</surname> <given-names>M</given-names></name> <name><surname>Roveri</surname> <given-names>L</given-names></name> <etal/></person-group>. <article-title>Leukocyte counts and ratios are predictive of stroke outcome and hemorrhagic complications independently of infections</article-title>. <source>Front Neurol.</source> (<year>2020</year>) <volume>11</volume>:<fpage>201</fpage>. <pub-id pub-id-type="doi">10.3389/fneur.2020.00201</pub-id><pub-id pub-id-type="pmid">32308640</pub-id></citation></ref>
<ref id="B15">
<label>15.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sadeghi</surname> <given-names>F</given-names></name> <name><surname>Kov&#x000E1;cs</surname> <given-names>S</given-names></name> <name><surname>Zs&#x000F3;ri</surname> <given-names>KS</given-names></name> <name><surname>Csiki</surname> <given-names>Z</given-names></name> <name><surname>Bereczky</surname> <given-names>Z</given-names></name> <name><surname>Shemirani</surname> <given-names>AH</given-names></name></person-group>. <article-title>Platelet count and mean volume in acute stroke: a systematic review and meta-analysis</article-title>. <source>Platelets.</source> (<year>2020</year>) <volume>31</volume>:<fpage>731</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1080/09537104.2019.1680826</pub-id><pub-id pub-id-type="pmid">31657263</pub-id></citation></ref>
<ref id="B16">
<label>16.</label>
<citation citation-type="web"><person-group person-group-type="author"><collab>Cranial Planning</collab></person-group> (<year>2025</year>). Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.brainlab.com/surgery-products/overview-neurosurgery-products/cranial-planning/">https://www.brainlab.com/surgery-products/overview-neurosurgery-products/cranial-planning/</ext-link> (Accessed May 21, 2025).</citation>
</ref>
<ref id="B17">
<label>17.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ho</surname> <given-names>M-Y</given-names></name> <name><surname>Tseng</surname> <given-names>W-L</given-names></name> <name><surname>Xiao</surname> <given-names>F</given-names></name></person-group>. <article-title>Estimation of the craniectomy surface area by using postoperative images</article-title>. <source>Int J Biomed Imaging.</source> (<year>2018</year>) <volume>2018</volume>:<fpage>5237693</fpage>. <pub-id pub-id-type="doi">10.1155/2018/5237693</pub-id><pub-id pub-id-type="pmid">29971096</pub-id></citation></ref>
<ref id="B18">
<label>18.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saver</surname> <given-names>JL</given-names></name> <name><surname>Chaisinanunkul</surname> <given-names>N</given-names></name> <name><surname>Campbell</surname> <given-names>BCV</given-names></name> <name><surname>Grotta</surname> <given-names>JC</given-names></name> <name><surname>Hill</surname> <given-names>MD</given-names></name> <name><surname>Khatri</surname> <given-names>P</given-names></name> <etal/></person-group>. <article-title>Standardized nomenclature for modified rankin scale global disability outcomes: consensus recommendations from stroke therapy academic industry roundtable XI</article-title>. <source>Stroke.</source> (<year>2021</year>) <volume>52</volume>:<fpage>3054</fpage>&#x02013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.121.034480</pub-id><pub-id pub-id-type="pmid">34320814</pub-id></citation></ref>
<ref id="B19">
<label>19.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vahedi</surname> <given-names>K</given-names></name> <name><surname>Vicaut</surname> <given-names>E</given-names></name> <name><surname>Mateo</surname> <given-names>J</given-names></name> <name><surname>Kurtz</surname> <given-names>A</given-names></name> <name><surname>Orabi</surname> <given-names>M</given-names></name> <name><surname>Guichard</surname> <given-names>JP</given-names></name> <etal/></person-group>. <article-title>Sequential-design, multicenter, randomized, controlled trial of early decompressive craniectomy in malignant middle cerebral artery infarction (DECIMAL Trial)</article-title>. <source>Stroke.</source> (<year>2007</year>) <volume>38</volume>:<fpage>2506</fpage>&#x02013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.107.485235</pub-id><pub-id pub-id-type="pmid">17690311</pub-id></citation></ref>
<ref id="B20">
<label>20.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hofmeijer</surname> <given-names>J</given-names></name> <name><surname>Kappelle</surname> <given-names>LJ</given-names></name> <name><surname>Algra</surname> <given-names>A</given-names></name> <name><surname>Amelink</surname> <given-names>GJ</given-names></name> <name><surname>van Gijn</surname> <given-names>J</given-names></name> <name><surname>van der Worp</surname> <given-names>HB</given-names></name></person-group>. <article-title>Surgical decompression for space-occupying cerebral infarction (the hemicraniectomy after middle cerebral artery infarction with life-threatening edema trial HAMLET): a multicentre, open, randomised trial</article-title>. <source>Lancet Neurol.</source> (<year>2009</year>) <volume>8</volume>:<fpage>326</fpage>&#x02013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1016/S1474-4422(09)70047-X</pub-id><pub-id pub-id-type="pmid">19269254</pub-id></citation></ref>
<ref id="B21">
<label>21.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>J&#x000FC;ttler</surname> <given-names>E</given-names></name> <name><surname>Schwab</surname> <given-names>S</given-names></name> <name><surname>Schmiedek</surname> <given-names>P</given-names></name> <name><surname>Unterberg</surname> <given-names>A</given-names></name> <name><surname>Hennerici</surname> <given-names>M</given-names></name> <name><surname>Woitzik</surname> <given-names>J</given-names></name> <etal/></person-group>. <article-title>Decompressive surgery for the treatment of malignant infarction of the middle cerebral artery (DESTINY): a randomized, controlled trial</article-title>. <source>Stroke.</source> (<year>2007</year>) <volume>38</volume>:<fpage>2518</fpage>&#x02013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.107.485649</pub-id><pub-id pub-id-type="pmid">17690310</pub-id></citation></ref>
<ref id="B22">
<label>22.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>J&#x000FC;ttler</surname> <given-names>E</given-names></name> <name><surname>B&#x000F6;sel</surname> <given-names>J</given-names></name> <name><surname>Amiri</surname> <given-names>H</given-names></name> <name><surname>Schiller</surname> <given-names>P</given-names></name> <name><surname>Limprecht</surname> <given-names>R</given-names></name> <name><surname>Hacke</surname> <given-names>W</given-names></name> <etal/></person-group>. <article-title>DESTINY II: DEcompressive surgery for the treatment of malignant INfarction of the middle cerebral arterY II</article-title>. <source>Int J Stroke.</source> (<year>2011</year>) <volume>6</volume>:<fpage>79</fpage>&#x02013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1111/j.1747-4949.2010.00544.x</pub-id><pub-id pub-id-type="pmid">21205246</pub-id></citation></ref>
<ref id="B23">
<label>23.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>D</given-names></name> <name><surname>Liu</surname> <given-names>M</given-names></name> <name><surname>Yu</surname> <given-names>R</given-names></name> <name><surname>Liu</surname> <given-names>Y</given-names></name> <name><surname>Jiang</surname> <given-names>W</given-names></name> <name><surname>Fan</surname> <given-names>Q</given-names></name> <etal/></person-group>. <article-title>Evaluating the performance of GPT-3.5, GPT-4, and GPT-4o in the Chinese national medical licensing examination</article-title>. <source>Sci Rep</source>. (<year>2025</year>) <volume>15</volume>:<fpage>14119</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-025-98949-2</pub-id><pub-id pub-id-type="pmid">40269046</pub-id></citation></ref>
<ref id="B24">
<label>24.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vahedi</surname> <given-names>K</given-names></name> <name><surname>Hofmeijer</surname> <given-names>J</given-names></name> <name><surname>Juettler</surname> <given-names>E</given-names></name> <name><surname>Vicaut</surname> <given-names>E</given-names></name> <name><surname>George</surname> <given-names>B</given-names></name> <name><surname>Algra</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>Early decompressive surgery in malignant infarction of the middle cerebral artery: a pooled analysis of three randomised controlled trials</article-title>. <source>Lancet Neurol.</source> (<year>2007</year>) <volume>6</volume>:<fpage>215</fpage>&#x02013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1016/S1474-4422(07)70036-4</pub-id><pub-id pub-id-type="pmid">17303527</pub-id></citation></ref>
<ref id="B25">
<label>25.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arshad</surname> <given-names>HB</given-names></name> <name><surname>Butt</surname> <given-names>SA</given-names></name> <name><surname>Khan</surname> <given-names>SU</given-names></name> <name><surname>Javed</surname> <given-names>Z</given-names></name> <name><surname>Nasir</surname> <given-names>K</given-names></name></person-group>. <article-title>ChatGPT and artificial intelligence in hospital level research: potential, precautions, and prospects</article-title>. <source>Methodist Debakey Cardiovasc J.</source> (<year>2023</year>) <volume>19</volume>:<fpage>77</fpage>&#x02013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.14797/mdcvj.1290</pub-id><pub-id pub-id-type="pmid">38028967</pub-id></citation></ref>
<ref id="B26">
<label>26.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Athaluri</surname> <given-names>SA</given-names></name> <name><surname>Manthena</surname> <given-names>SV</given-names></name> <name><surname>Kesapragada</surname> <given-names>VSRKM</given-names></name> <name><surname>Yarlagadda</surname> <given-names>V</given-names></name> <name><surname>Dave</surname> <given-names>T</given-names></name> <name><surname>Duddumpudi</surname> <given-names>RTS</given-names></name></person-group>. <article-title>Exploring the boundaries of reality: investigating the phenomenon of artificial intelligence hallucination in scientific writing through ChatGPT references</article-title>. <source>Cureus.</source> (<year>2023</year>) <volume>15</volume>:<fpage>e37432</fpage>. <pub-id pub-id-type="doi">10.7759/cureus.37432</pub-id><pub-id pub-id-type="pmid">37182055</pub-id></citation></ref>
<ref id="B27">
<label>27.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kozachek</surname> <given-names>D</given-names></name></person-group>. <article-title>Investigating the perception of the future in GPT-3,&#x02212;3.5 and GPT-4</article-title>. In: <italic>Creativity and Cognition</italic>. New York, NY: Association for Computing Machinery (<year>2023</year>). p. <fpage>282</fpage>&#x02013;<lpage>287</lpage>.</citation>
</ref>
<ref id="B28">
<label>28.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Torrente</surname> <given-names>M</given-names></name> <name><surname>Sousa</surname> <given-names>PA</given-names></name> <name><surname>Hern&#x000E1;ndez</surname> <given-names>R</given-names></name> <name><surname>Blanco</surname> <given-names>M</given-names></name> <name><surname>Calvo</surname> <given-names>V</given-names></name> <name><surname>Collazo</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>An artificial intelligence-based tool for data analysis and prognosis in cancer patients: results from the clarify study</article-title>. <source>Cancers</source>. (<year>2022</year>) <volume>14</volume>:<fpage>4041</fpage>. <pub-id pub-id-type="doi">10.3390/cancers14164041</pub-id><pub-id pub-id-type="pmid">36011034</pub-id></citation></ref>
<ref id="B29">
<label>29.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuo</surname> <given-names>CC</given-names></name> <name><surname>Monteiro</surname> <given-names>A</given-names></name> <name><surname>Lim</surname> <given-names>J</given-names></name> <name><surname>Brown</surname> <given-names>NJ</given-names></name> <name><surname>Recker</surname> <given-names>MJ</given-names></name> <name><surname>Ghannam</surname> <given-names>MM</given-names></name> <etal/></person-group>. <article-title>An online calculator using machine learning for predicting survival in pediatric patients with medulloblastoma</article-title>. <source>J Neurosurg Pediatr.</source> (<year>2024</year>) <volume>33</volume>:<fpage>85</fpage>&#x02013;<lpage>94</lpage>. <pub-id pub-id-type="doi">10.3171/2023.8.PEDS2352</pub-id><pub-id pub-id-type="pmid">37922543</pub-id></citation></ref>
<ref id="B30">
<label>30.</label>
<citation citation-type="web"><person-group person-group-type="author"><collab>ChatGPT</collab></person-group> (<year>2025</year>). Available online at: <ext-link ext-link-type="uri" xlink:href="https://chatgpt.com/c/675b4cf6-e9d0-800f-b437-12ba18389a58">https://chatgpt.com/c/675b4cf6-e9d0-800f-b437-12ba18389a58</ext-link> (Accessed March 31, 2025).</citation>
</ref>
</ref-list>
</back>
</article>