<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3-mathml3.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" dtd-version="1.3" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Appl. Math. Stat.</journal-id>
<journal-title-group>
<journal-title>Frontiers in Applied Mathematics and Statistics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Appl. Math. Stat.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2297-4687</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fams.2026.1732313</article-id>
<article-version article-version-type="Version of Record" vocab="NISO-RP-8-2008"/>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Original Research</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Fuzzy decision-tree regression model and its application to measure some climate change factors</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Habeeb</surname> <given-names>Ali Salman</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &#x00026; editing</role>
<uri xlink:href="https://loop.frontiersin.org/people/3255438"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Hasan</surname> <given-names>Hussein A.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &#x00026; editing</role>
<uri xlink:href="https://loop.frontiersin.org/people/3353491"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Al-Sinjary</surname> <given-names>Adnan M.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; original draft" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing &#x2013; original draft</role>
<role vocab="credit" vocab-identifier="https://credit.niso.org/" vocab-term="Writing &#x2013; review &amp; editing" vocab-term-identifier="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing &#x2013; review &#x00026; editing</role>
</contrib>
</contrib-group>
<aff id="aff1"><label>1</label><institution>Department of Statistics, University of Sumer</institution>, <city>Rifai</city>, <country country="iq">Iraq</country></aff>
<aff id="aff2"><label>2</label><institution>Department of Statistics and Informatics Techniques, Northern Technical University</institution>, <city>Mosul</city>, <country country="iq">Iraq</country></aff>
<author-notes>
<corresp id="c001"><label>&#x0002A;</label>Correspondence: Ali Salman Habeeb, <email xlink:href="mailto:asalhabeeb@gmail.com">asalhabeeb@gmail.com</email></corresp>
</author-notes>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2026-01-30">
<day>30</day>
<month>01</month>
<year>2026</year>
</pub-date>
<pub-date publication-format="electronic" date-type="collection">
<year>2026</year>
</pub-date>
<volume>12</volume>
<elocation-id>1732313</elocation-id>
<history>
<date date-type="received">
<day>25</day>
<month>10</month>
<year>2025</year>
</date>
<date date-type="rev-recd">
<day>06</day>
<month>01</month>
<year>2026</year>
</date>
<date date-type="accepted">
<day>08</day>
<month>01</month>
<year>2026</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2026 Habeeb, Hasan and Al-Sinjary.</copyright-statement>
<copyright-year>2026</copyright-year>
<copyright-holder>Habeeb, Hasan and Al-Sinjary</copyright-holder>
<license>
<ali:license_ref start_date="2026-01-30">https://creativecommons.org/licenses/by/4.0/</ali:license_ref>
<license-p>This is an open-access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License (CC BY)</ext-link>. The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</license-p>
</license>
</permissions>
<abstract>
<p>In this paper, we considered a new study that examines the topic of climate change based on data from two important variables: temperature and wind speed. The study aims to employ a decision-making method based on fuzzy logic to overcome the issue of ambiguity and uncertainty. Our proposed idea in this paper was to construct an appropriate analytical framework for the phenomenon, with the aim of arriving at a more accurate decision to overcome the risks of this phenomenon and take appropriate precautions in the near and distant future to deal with this natural emergency that is increasing over time. We discussed how to implement the GUIDE regression tree algorithm as a main tool in analyzing fuzzy sets using the Triangular Membership Function to fuzzify the data to obtain more accurate partial fuzzy sets for description in the analysis of chi-square tables to make a decision using a suitable hypothesis for this purpose. The proposed method was applied to a sample size of 425 daily observations in Dhi-Qar Governorate, Iraq, for the period from December 2024 to February 2025. We used a special code in R programming for the purpose of analysis and obtaining results. Through analyzing the results, we found that two variables (temperature and wind speed) have a fundamental influence on the speed of climate change.</p></abstract>
<kwd-group>
<kwd>algorithm GUIDE</kwd>
<kwd>chi-square test</kwd>
<kwd>fuzzy decision tree</kwd>
<kwd>fuzzy regression</kwd>
<kwd>triangular membership function</kwd>
</kwd-group>
<funding-group>
  <funding-statement>The author(s) declared that financial support was not received for this work and/or its publication.</funding-statement>
</funding-group>
<counts>
<fig-count count="8"/>
<table-count count="4"/>
<equation-count count="17"/>
<ref-count count="35"/>
<page-count count="14"/>
<word-count count="8604"/>
</counts>
<custom-meta-group>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Mathematics of Computation and Data Science</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<label>1</label>
<title>Introduction</title>
<p>The problem of decision-making using statistical methods is one of the important approaches that has attracted the attention of many researchers, aiming to select the best decision by the regression model for a set of crisp data. Study problem: One of the most scientific challenges confronting decision-making is the issue of ambiguity and uncertainty. Regression trees are widely used in statistical literature as one of the most important theoretical methods employed in this area. On the other hand, one of the most important methods for dealing with data to eliminate ambiguity is fuzzy set theory, particularly when dealing with variables that can be described as categorical variables (reading them as categorical groups). Climate change phenomena are ones we encountered daily that impact our lives (temperature and wind speed&#x02026;etc.). We used fuzzy set theory to represent the phenomenon under study, which has been employed to describe certain variables of the phenomenon both qualitatively and quantitatively. In this context, we focus on both aspects in order to develop an applied model that represents two types of variables within the dataset, ultimately leading to optimal decision-making using fuzzy decision tree regression. Our interest is focused on two method types of the analytical approaches to achieve a more accurate interpretation based on a comprehensive understanding of the relationships between methods. The first involves testing the acceptance or rejection approach to analyze the significance of explanatory variables in making correct decisions, based on the appropriate tree structure for the studied phenomenon. The second focuses on modeling using regression analysis, which is based on constructing a crosstabs analysis to applying and analysis the relationship between categorical variables. Many researchers have contributed to this literature. Suarez and Lutsko [<xref ref-type="bibr" rid="B1">1</xref>] presented a study focused on constructing a fuzzy decision tree for regression and classification. Fuzziness is investigated by integrating fuzzy logic with decision trees of the CART type. A training rule for fuzzy decision tree was developed, resembling the backpropagation algorithm used in neural networks. This rule is compatible with a high-quality optimization algorithm designed to determine the parameters of fuzzy partitions. Wei-Yin Loh [<xref ref-type="bibr" rid="B2">2</xref>] presented a comprehensive review of regression and classification trees by examining several available algorithms and comparing their capabilities, strengths, and weaknesses through the application of two examples. He designed a classification tree for dependent variables that take a limited number of unordered values, where prediction error is measured by the cost of misclassification. In other words, regression trees were developed for dependent variables that take continuous or ordered discrete values, with prediction error typically measured by the squared difference between the predicted and actual values. Segatori et al. [<xref ref-type="bibr" rid="B3">3</xref>] proposed a distributed learning model for fuzzy decision trees based on the MapReduce framework, aimed at constructing both binary-split and multi-split trees from big data. The proposed model is based on a distributed fuzzy discretizer, which generates strong fuzzy partitions for each continuous attribute using fuzzy information entropy. These partitions are then used as inputs for the fuzzy decision tree learning algorithm, which selects the most appropriate features at decision nodes based on fuzzy information gain. The results demonstrated that the proposed approach outperforms existing methods by achieving high performance while reducing computational complexity, making it an effective solution for big data classification using distributed fuzzy decision trees. Saeed Mohammadiun et al. [<xref ref-type="bibr" rid="B4">4</xref>] developed a framework for designing and optimizing Fuzzy Decision Tree Regression (FDTR) models, aimed at selecting the most suitable response strategies for oil spill incidents in the harsh Arctic environment. The study employed three types of regression analysis&#x02014;linear, non-linear, and Gaussian Process Regression (GPR). Additionally, four information evaluation metrics were used for decision tree splitting: information gain, deviance, GINI impurities, and misclassification error. To enhance the predictive performance of the FDTR models, the Non-dominated Self-adaptive Differential Evolution (NSDE) algorithm was applied. When tested on oil spill data, the results showed a 14% improvement in prediction accuracy and a 57% reduction in the number of rules, thereby enhancing the efficiency and robustness of the mode. Pavlos Nikolaidis [<xref ref-type="bibr" rid="B5">5</xref>] conducted a study based on real-world data related to energy demand and wind power generation. Regression Trees were used to forecast future renewable energy production. The following climatic factors were used as inputs in distribution networks across different regions: wind speed and direction, ambient temperature, relative humidity, renewable energy capacity, and curtailed renewable energy output. One of the key findings of the study was that in future low-carbon energy systems, the curtailment of renewable energy production will play a significant role in intelligent forecasting systems. Therefore, accurately modeling the relationship between inputs and outputs is essential.</p></sec>
<sec id="s2">
<label>2</label>
<title>The concept of fuzzy logic</title>
<p>Fuzzy logic was developed in 1965 by Lotfi Zadeh, a scientist of Azerbaijani origin from the University of California, who introduced it as a better method for handling data. Fuzzy logic is a logical system based on a generalization of classical logic [<xref ref-type="bibr" rid="B6">6</xref>]. In other words, it encompasses theories and techniques that utilize fuzzy sets without crisp boundaries (i.e., boundaries that are unknown, undefined, or ambiguous) [<xref ref-type="bibr" rid="B7">7</xref>]. Fuzzy logic provides a simple way to describe and represent human expertise. Moreover, it offers practical solutions to real-world problems that are cost-effective and reasonable compared to those offered by other techniques [<xref ref-type="bibr" rid="B8">8</xref>].</p></sec>
<sec id="s3">
<label>3</label>
<title>The basic definitions</title>
<p>A classical (crisp) set is defined as a Set of elements or objects <italic>x</italic>&#x02208;<italic>X</italic>, which may be countable or uncountable, where each element either belongs to the set A or does not belong to it <italic>A</italic>&#x02286; <italic>X</italic> .Hence A can be characterized by the indicator function &#x003BC;<sub><italic>A</italic></sub>(<italic>x</italic>) [<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B35">35</xref>].</p>
<p>Definition 1: If <italic>X</italic> is a set of objects generally denoted by x, then a fuzzy set is defined as a set &#x000C3; of ordered pairs [<xref ref-type="bibr" rid="B10">10</xref>]:</p>
<disp-formula id="EQ1"><mml:math id="M1"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mtext>&#x000C3;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtext>&#x000A0;x&#x000A0;</mml:mtext><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:msub><mml:mrow><mml:mtext>M</mml:mtext></mml:mrow><mml:mrow><mml:mtext>&#x000C3;</mml:mtext></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mtext>x</mml:mtext></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mo>|</mml:mo><mml:mtext>&#x000A0;x</mml:mtext><mml:mo>&#x02208;</mml:mo><mml:mtext>X&#x000A0;</mml:mtext></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(1)</label></disp-formula>
<p>Where M<sub>&#x000C3;</sub>(<italic>x</italic>) is called the membership function, which also defines the degree of membership of x in the set A. When the membership function M<sub>&#x000C3;</sub>(<italic>x</italic>) takes only two values 0 or 1 the set A becomes a classical (non-fuzzy) set. The range of the membership function is a subset of the non-negative real numbers. Generally, elements with a membership degree of 0 are not considered part of the fuzzy set [<xref ref-type="bibr" rid="B11">11</xref>].</p>
<p>Definition 2: Let &#x003A9; be a some set A Fuzzy subset &#x000C3; of &#x003A9; is define by its membership function written as &#x000C3;(<italic>x</italic>) which produces values in [1, 0] for all <italic>x in &#x003A9;</italic> so &#x000C3;(<italic>x</italic>) is a function mapping from &#x003A9; into [ 0, 1].</p>
<p>Not that if &#x000C3;(<italic>x</italic><sub>0</sub>) &#x0003D; 1 then we say <italic>x</italic><sub>0</sub> belong to &#x000C3; and if &#x000C3;(<italic>x</italic><sub>1</sub>) &#x0003D; 0 we say <italic>x</italic><sub>1</sub>dose not belong &#x000C3; and if &#x000C3;(<italic>x</italic><sub>2</sub>) &#x0003D; 0.6 we say the membership value of <italic>x</italic><sub>2</sub> in &#x000C3; is 0.6 [<xref ref-type="bibr" rid="B12">12</xref>].</p></sec>
<sec id="s4">
<label>4</label>
<title>Fuzzy number</title>
<p>Definition 3: A fuzzy number is subset denoted by <inline-formula><mml:math id="M2"><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:math></inline-formula> of the set of real numbers denoted by R and is characterized by the function so called membership function <inline-formula><mml:math id="M3"><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> Fuzzy number satisfy the following constraints [<xref ref-type="bibr" rid="B13">13</xref>].</p>
<list list-type="simple">
<list-item><p>(1) <inline-formula><mml:math id="M4"><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi>R</mml:mi><mml:mo>&#x02192;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>B</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>l</mml:mi><mml:mo>-</mml:mo><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi><mml:mi>s</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>b</mml:mi><mml:mi>l</mml:mi><mml:mi>e</mml:mi></mml:math></inline-formula>.</p></list-item>
<list-item><p>(2) <inline-formula><mml:math id="M5"><mml:mo>&#x02203;</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi>&#x003F5;</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mi>R</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>:</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:mrow></mml:msub><mml:mtext>&#x000A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mn>1</mml:mn><mml:mo>.</mml:mo></mml:math></inline-formula></p></list-item>
</list>
<p>Then &#x000C3; is called a fuzzy number and the function &#x003BC;<sub>&#x000C3;</sub> is called the membership function of the fuzzy number &#x000C3; [<xref ref-type="bibr" rid="B14">14</xref>].</p></sec>
<sec id="s5">
<label>5</label>
<title>Fuzzy number membership functions</title>
<p>The membership function plays a necessary role in Fuzzy Set Theory, as it constitutes one component of the ordered pair that defines a fuzzy set [<xref ref-type="bibr" rid="B15">15</xref>] Membership functions are used to determine the degree of membership of an element to a fuzzy set. In a fuzzy set A, an element <italic>x</italic> belongs to the set partially according to a specific membership function &#x003BC;<sub><italic>A</italic></sub>(<italic>x</italic>) (also referred to in some sources as fuzzification functions). The fundamental requirement for such a function are that the range of its elements is within the interval [0, 1], which determines the degree to which an element belongs to the set [<xref ref-type="bibr" rid="B16">16</xref>]. There are various types of membership functions, each applied to a specific phenomenon depending on its nature, where the data of the phenomenon are represented as fuzzy sets [<xref ref-type="bibr" rid="B17">17</xref>]. There are two main approaches for determining the appropriate membership function:</p>
<list list-type="simple">
<list-item><p>(1) Based on human expertise: That means fuzzy sets are often used to represent and formalize human knowledge, and the membership functions constitute a part of that knowledge.</p></list-item>
<list-item><p>(2) Use collected data to determine the membership function: In this approach, the structure (form) of the membership function is first specified, and then the parameters of the function are fine-tuned based on the observed data [<xref ref-type="bibr" rid="B18">18</xref>].</p></list-item>
<list-item><p>(3) In this study, we proposed using sample quartiles to determine the parameters of the Membership functions.</p></list-item>
</list>
<p>Quartiles can be defined as three statistical measures that divide an ordered dataset into approximately four equal parts. Quartiles are denoted by the symbol <italic>q</italic><sub><italic>i</italic></sub>, where <italic>i</italic> = 1, 2, 3 [<xref ref-type="bibr" rid="B16">16</xref>].</p>
<p><italic>q</italic><sub>1</sub>:The first quartile is defined as the 25th percentile where lowest 25% data is below this point of the total data.</p>
<p><italic>q</italic><sub>2</sub>:The second quartile is defined as the 50th percentile where lowest 50% data is below this point of the total data, The second quartile so called the median point.</p>
<p><italic>q</italic><sub>3</sub>: The third quartile is the 75th percentile where lowest 75% data is below this point, It is known as the upper quartile. It can be calculated by arranging the data in ascending order, then calculating the quartile rank (quartile position) as in the <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref>.</p>
<disp-formula id="EQ2"><mml:math id="M6"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>N</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mfrac><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(2)</label></disp-formula>
<p>Where i represents the symbol of the quartile to be calculated, C Represents the location of the quartile [<xref ref-type="bibr" rid="B19">19</xref>].</p>
<p>Membership functions take various forms, the most well-known of which include the following.</p>
<sec>
<label>5.1</label>
<title>Triangular membership function</title>
<p>The membership values of elements belonging to a fuzzy set can be represented by a straight line, known as a linear function. This function is characterized by three main parameters (boundaries): a, b, and c. It can be defined according to the <xref ref-type="disp-formula" rid="EQ3">Equation 3</xref>.</p>
<disp-formula id="EQ3"><mml:math id="M7"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mtext>&#x000A0;</mml:mtext><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>a</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>b</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>c</mml:mi><mml:mtext>&#x000A0;</mml:mtext></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtable style="text-align:axis;" equalrows="false" columnlines="none none none none none none none none none" equalcolumns="false" class="array"><mml:mtr><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>x</mml:mi><mml:mtext>&#x000A0;&#x000A0;</mml:mtext><mml:mo>&#x02264;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>a</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>&#x000A0;&#x000A0;</mml:mtext><mml:mfrac><mml:mrow><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>-</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>-</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>a</mml:mi><mml:mtext>&#x000A0;</mml:mtext></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>a</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x0003C;</mml:mo><mml:mi>x</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>b</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>&#x000A0;&#x000A0;</mml:mtext><mml:mfrac><mml:mrow><mml:mi>c</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>-</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>-</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>b</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x0003C;</mml:mo><mml:mi>x</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>c</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x02265;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>c</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(3)</label></disp-formula>
<p>Where <italic>a</italic>&#x0003C;<italic>b</italic>&#x0003C;<italic>c &#x003F5; R</italic> .In <xref ref-type="fig" rid="F1">Figure 1a</xref> illustrates the graph of the triangular membership function (see Hasan and Mohammad [<xref ref-type="bibr" rid="B15">15</xref>]).</p>
<fig position="float" id="F1">
<label>Figure 1</label>
<caption><p>This figure represented the shapes of the membership functions: <bold>(a)</bold> first type the triangular, <bold>(b)</bold> Second type the trapezoidal.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-g0001.tif">
<alt-text content-type="machine-generated">Graph (a) depicts a triangular membership function with points a, b, and c on the x-axis, having its peak at b with a value of one. Graph (b) shows a trapezoidal membership function with points a, b, c, and d on the x-axis, maintaining a flat top between b and c at a value of one. Both graphs have &#x000B5;_A(x) on the y-axis.</alt-text>
</graphic>
</fig>
</sec>
<sec>
<label>5.2</label>
<title>Trapezoidal membership function</title>
<p>It is also a linear function, and it is distinguished from other membership functions by having four parameters (boundaries). <italic>a, b, c, d</italic> This function can be defined according to the <xref ref-type="disp-formula" rid="EQ4">Equation 4</xref>,</p>
<disp-formula id="EQ4"><mml:math id="M8"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mtext>&#x000A0;</mml:mtext><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>a</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>,</mml:mo><mml:mi>c</mml:mi><mml:mo>,</mml:mo><mml:mi>d</mml:mi><mml:mtext>&#x000A0;</mml:mtext></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mtable style="text-align:axis;" equalrows="false" columnlines="none none none none none none none none none" equalcolumns="false" class="array"><mml:mtr><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x0003C;</mml:mo><mml:mtext>&#x000A0;&#x000A0;</mml:mtext><mml:mi>a</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mfrac><mml:mrow><mml:mi>x</mml:mi><mml:mo>-</mml:mo><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mi>b</mml:mi><mml:mo>-</mml:mo><mml:mi>a</mml:mi></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mi>a</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x0003C;</mml:mo><mml:mi>b</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>1</mml:mn><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mi>b</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x0003C;</mml:mo><mml:mi>c</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mfrac><mml:mrow><mml:mi>d</mml:mi><mml:mo>-</mml:mo><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mo>-</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mi>c</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x02264;</mml:mo><mml:mi>x</mml:mi><mml:mo>&#x02264;</mml:mo><mml:mi>d</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x0003E;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>d</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(4)</label></disp-formula>
<p>Where &#x0003C; <italic>b</italic>&#x0003C;<italic>c</italic>&#x0003C;<italic>d</italic>&#x02208;<italic>R</italic> . In <xref ref-type="fig" rid="F1">Figure 1b</xref> illustrates the graph of the trapezoidal membership function (see Alavala [<xref ref-type="bibr" rid="B20">20</xref>]).</p></sec>
</sec>
<sec id="s6">
<label>6</label>
<title>Fuzzy linear regression model</title>
<p>Uncertain formulations can be used to model phenomena characterized by ambiguity through the use of fuzzy regression models, which are descriptive in nature and involve linguistic variables [<xref ref-type="bibr" rid="B21">21</xref>].</p>
<p><xref ref-type="fig" rid="F2">Figure 2</xref> shows how the variable is divided into fuzzy categorical subdivisions, which represent the belonging functions, to show how those subdivisions overlap as linguistic variables for a single fuzzy variable in the regression model.</p>
<fig position="float" id="F2">
<label>Figure 2</label>
<caption><p>One-dimensional fuzzy linear regression model.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-g0002.tif">
<alt-text content-type="machine-generated">Flowchart illustrating a zigzag pattern within a triangular region on a graph. The Y-axis is labeled \(Y^{*}\) and the X-axis has vertical dotted lines intersecting the zigzag. Key points marked as \(Y_{0}^{*}\), \(Y_{1}^{*}\), \(Y_{2}^{*}\), and \(Y_{3}^{*}\) show progression along the graph. Arrows extending from points to lines indicate iterative steps or phases labeled \(I/(\lambda)^{n}\).</alt-text>
</graphic>
</fig>
</sec>
<sec id="s7">
<label>7</label>
<title>Reasons for using the fuzzy linear regression model</title>
<p>There are several reasons for using the linear regression model within This literature of fuzzy logic, the most important of which include the following [<xref ref-type="bibr" rid="B22">22</xref>]:</p>
<list list-type="simple">
<list-item><p>(i) When the linear relationship is not well-defined.</p></list-item>
<list-item><p>(ii) The assumptions of classical regression models are often strict, particularly in terms of specifying the distribution of the random error term and the relationships among explanatory variables.</p></list-item>
<list-item><p>(iii) When the data exhibit an unclear or ambiguous linear trend.</p></list-item>
<list-item><p>(iv) When the number of observations for the studied phenomenon is limited, providing insufficient information.</p></list-item>
<list-item><p>(v) Inaccuracy in results due to uncertainty and imprecision, which are associated with vague or linguistic variables [<xref ref-type="bibr" rid="B21">21</xref>].</p></list-item>
</list></sec>
<sec id="s8">
<label>8</label>
<title>Description of the decision tree</title>
<p>One of the fundamental features of machine learning methods is the requirement of a set of numerical values known as input data. An appropriate machine learning algorithm typically featuring a backfitting mechanism is then applied, resulting in a set of values referred to as output data [<xref ref-type="bibr" rid="B23">23</xref>]. The application of the decision tree regression method requires a precise description of the tree structure relevant to the study. The data are typically divided into two subsets: the training set and the testing set [<xref ref-type="bibr" rid="B24">24</xref>]. To understand how the decision tree operates depending on the type of tree selected various forms exist. In this study, Our interest will be on a binary-split decision tree (i.e., with left and right nodes), which is characterized by three types of nodes: a root node, internal split nodes, and terminal leaf nodes. <xref ref-type="fig" rid="F3">Figure 3</xref> illustrates the specific type of decision tree used in this paper. To describe this tree accurately, a graphical representation is required, showing one of the different types of decision trees used in previous studies as non-parametric regression tools to examine the impact of explanatory variables on the dependent variable [<xref ref-type="bibr" rid="B25">25</xref>]. Some researchers have adopted decision trees within the context of machine learning as a decision-making tool, employing established algorithms such as CART, GUIDE, and M5 (Alberto). In the present study, the GUIDE algorithm is employed to implement the decision tree framework, as illustrated in <xref ref-type="fig" rid="F3">Figure 3</xref> [<xref ref-type="bibr" rid="B2">2</xref>].</p>
<fig position="float" id="F3">
<label>Figure 3</label>
<caption><p>Decision tree nodes, root, interior and leaf [<xref ref-type="bibr" rid="B33">33</xref>].</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-g0003.tif">
<alt-text content-type="machine-generated">Diagram illustrating a tree structure. An orange oval represents the root node, branching into gray circles labeled as interior nodes. These further branch into green rectangles labeled as leaf nodes.</alt-text>
</graphic>
</fig>
<sec>
<label>8.1</label>
<title>Generalized unbiased interaction detection and estimation (GUIDE)</title>
<p>It is one of the decision-making algorithms extensively studied by many researchers in the literature for building linear regression models as an important tool in the decision-making process [<xref ref-type="bibr" rid="B26">26</xref>]. This algorithm was designed to eliminate the bias in selecting the most significant variables in modeling important phenomena in regression analysis, thereby providing a good fit to the relevant experimental data [<xref ref-type="bibr" rid="B24">24</xref>]. Its operation is based on the chi-square test applied to residuals. The algorithm is constructed within the framework of piecewise constant linear regression models with univariate splits [<xref ref-type="bibr" rid="B27">27</xref>].</p>
<p>At each terminal node, the sample mean is calculated to serve as the estimate, followed by the computation of residuals. The node is then split into two groups: the first group contains positive residuals, while the second group contains non-positive residuals. The idea behind this division is to detect random patterns within these two groups using a sign test on the residuals at each node. A chi-square test can be employed to examine the association between the signed residuals at each node (categorized as belonging or not belonging) represented as rows in the test, and the total frequencies of the predicted values for c splits represented as columns [<xref ref-type="bibr" rid="B28">28</xref>]. If the predictive variable y is categorical with ccc categories, a 2 &#x000D7; c contingency table is constructed, where the two rows correspond to the two residual groups and the c columns correspond to the predicted categories [<xref ref-type="bibr" rid="B29">29</xref>]. In the case where y is a quantitative variable, these values can be divided into columns according to a specific scheme, commonly using four groups representing the quartiles of the sample. This results in a 2 &#x000D7; 4 contingency table [<xref ref-type="bibr" rid="B30">30</xref>].</p>
</sec>
<sec>
<label>8.2</label>
<title>Fuzzy decision tree regression structure</title>
<p>Binary decision trees are used to estimate the parameters of non-parametric regression models in the context of fuzzy set theory. Suppose that the function &#x003BC;<sub><italic>A</italic></sub>(<italic>x</italic>)represents the membership function of an element <italic>x</italic> to the fuzzy set A, and we attempt to find a solution by designing a decision tree with outcomes{<italic>either x</italic>&#x02208;<italic>A or x</italic>&#x0220B;<italic>A</italic>} where the value 1 represents membership of the element <italic>x</italic> to the set A, and the value 0 represents non-membership of the element <italic>x</italic> to the set A [<xref ref-type="bibr" rid="B23">23</xref>].</p>
<p>The degree of membership of the element <italic>x</italic> to the fuzzy set A allows for any real value between 0 and 1, which means the following [<xref ref-type="bibr" rid="B31">31</xref>]:</p>
<p>&#x003BC;<sub><italic>A</italic></sub>(<italic>x</italic>) &#x02208; [ 0, 1].</p>
<p>By using partial membership in a fuzzy set, the strict logical constraints of set membership can be relaxed, thereby improving the performance of decision trees when dealing with fuzzy sets. This results in enhanced performance of decision trees in regression models in terms of flexibility and robustness [<xref ref-type="bibr" rid="B3">3</xref>].</p>
<p>Based on the literature of crisp decision trees, the GUIDE method can be used to estimate the regression model. Suppose there are P explanatory (independent) variables.</p>
<p>Let <italic>x</italic><sub><italic>i</italic></sub> &#x0003D; (<italic>x</italic><sub>1</sub>, <italic>x</italic><sub>2</sub>, ..., <italic>x</italic><sub><italic>P</italic></sub>) <italic>i</italic> &#x0003D; 1, ..., <italic>p</italic></p>
<p>Categorical variables do not pose a significant challenge, except in the case of fuzzy partitions of categorical variables. The values of the response variable represent the prediction target and can be either categorical (for classification) or real-valued (for regression) [<xref ref-type="bibr" rid="B32">32</xref>].</p>
<p>According to the definition of a fuzzy decision tree, all nodes of the tree are connected to the root node <italic>t</italic><sub>0</sub>, where the decision tree is constructed using a hierarchical splitting strategy. The feature space is divided through a hierarchical sequence of logical tests into a set of non-overlapping regions, making the decision-making process straightforward. Each internal node in the decision tree corresponds to a test in the hierarchical structure used to construct the decision tree of interest [<xref ref-type="bibr" rid="B2">2</xref>].</p>
<p>Suppose the decision tree has been constructed up to a certain level, and the binary (terminal) nodes <italic>t</italic><sub><italic>i</italic></sub> are characterized by the membership function &#x003BC;<sub><italic>i</italic></sub>(<italic>x</italic>). Each node is split into two branches: one representing the value 1, which satisfies the logical test, and the other representing the value 0, which does not satisfy the logical test, as defined by <xref ref-type="disp-formula" rid="EQ5">Equation 5</xref> [<xref ref-type="bibr" rid="B1">1</xref>].</p>
<disp-formula id="EQ5"><mml:math id="M9"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>&#x003B1;</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:msubsup><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mtext>&#x000A0;&#x000A0;</mml:mtext><mml:mi>&#x003B1;</mml:mi><mml:mo>=</mml:mo><mml:mi>L</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>R</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(5)</label></disp-formula>
<p>The absolute membership degree and the number of training examples in the terminal node <italic>t</italic><sub>1</sub> are expressed as <xref ref-type="disp-formula" rid="EQ6">Equation 6</xref>.</p>
<disp-formula id="EQ6"><mml:math id="M10"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(6)</label></disp-formula>
<p>In a regression problem, the node <italic>t</italic><sub><italic>i</italic></sub> provides a prediction for the value of the response variable, which is equal to the mean of the response variable y for the training samples associated with <italic>t</italic><sub><italic>i</italic></sub> and it is represented as <xref ref-type="disp-formula" rid="EQ7">Equation 7</xref> [<xref ref-type="bibr" rid="B33">33</xref>].</p>
<disp-formula id="EQ7"><mml:math id="M11"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(7)</label></disp-formula>
<p>In the regression model, the binary (terminal) node <italic>t</italic><sub><italic>i</italic></sub> represents the predicted value of the response variable. This value corresponds to the average of the response variable y calculated from the training samples assigned to node <italic>t</italic><sub><italic>i</italic></sub>. The mathematical formula for computing this average is given by <xref ref-type="disp-formula" rid="EQ8">Equation 8</xref> [<xref ref-type="bibr" rid="B2">2</xref>].</p>
<disp-formula id="EQ8"><mml:math id="M12"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(8)</label></disp-formula>
<p>where</p>
<p><italic>N</italic><sub><italic>train</italic></sub> Represents the number of training samples for each <italic>n</italic> &#x0003D; 1, ..., <italic>N</italic><sub><italic>train</italic></sub>.</p>
<p>&#x003BC;<sub><italic>i</italic></sub>(.) Represents the membership function for each <italic>i</italic> &#x0003D; 0, 1, 2, ..., where i denotes the number of nodes in the tree.</p>
<p><italic>y</italic><sub><italic>n</italic></sub> &#x02200; <italic>n</italic> &#x0003D; 1, ..., <italic>N</italic><sub><italic>train</italic></sub>All elements of the response variable in the training set <italic>N</italic><sub><italic>train</italic></sub>.</p>
<p><italic>x</italic><sub><italic>n</italic></sub> &#x02200; <italic>n</italic> &#x0003D; 1, ..., <italic>N</italic><sub><italic>train</italic></sub> All elements of the dependent variable in the training set <italic>N</italic><sub><italic>train</italic></sub>.</p>
<p><xref ref-type="disp-formula" rid="EQ2">Equation 2</xref> can be expressed through building the tree as given in <xref ref-type="disp-formula" rid="EQ9">Equations 9</xref>, <xref ref-type="disp-formula" rid="EQ10">10</xref>:</p>
<disp-formula id="EQ9"><mml:math id="M13"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(9)</label></disp-formula>
<disp-formula id="EQ10"><mml:math id="M14"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(10)</label></disp-formula>
<p>The terminal (leaf) nodes for all branches of the tree are calculated and denoted by<inline-formula><mml:math id="M15"><mml:mover accent="true"><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:math></inline-formula>, such that the total number of nodes in the tree can be computed using the following formula [<xref ref-type="bibr" rid="B28">28</xref>].</p>
<disp-formula id="E11"><mml:math id="M16"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mstyle mathvariant="bold"><mml:mn>2</mml:mn></mml:mstyle><mml:mo>|</mml:mo><mml:mover accent="true"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext>T</mml:mtext></mml:mstyle></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover><mml:mo>|</mml:mo><mml:mo>-</mml:mo><mml:mstyle mathvariant="bold"><mml:mn>1</mml:mn></mml:mstyle><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Similarly, the number of internal (branch) nodes of the tree can be determined according to the following formula [<xref ref-type="bibr" rid="B26">26</xref>]:</p>
<disp-formula id="E12"><mml:math id="M17"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mo>|</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover><mml:mo>|</mml:mo><mml:mo>-</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mn>1</mml:mn><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The terminal nodes are used as predictive variables <inline-formula><mml:math id="M18"><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>, and the average value of the dependent variable within each terminal node is calculated based on the number of observations that satisfy the test at that terminal node. Referring back to the definition in the formula above [<xref ref-type="bibr" rid="B31">31</xref>]. When applying the decision tree to a regression problem, it is essential to understand the membership relationship between the terminal (branch) nodes and the root node through the absolute degree of this membership, based on the above <xref ref-type="disp-formula" rid="EQ5">Equation 5</xref> can be rewritten in a more detailed form as follows.</p>
<disp-formula id="EQ13"><mml:math id="M19"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>&#x003B1;</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msubsup><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x003B1;</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mtext>&#x000A0;&#x000A0;</mml:mtext><mml:mi>&#x003B1;</mml:mi><mml:mo>=</mml:mo><mml:mi>L</mml:mi><mml:mo>,</mml:mo><mml:mi>R</mml:mi><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(11)</label></disp-formula>
<p>Where R: right node, L: left node, and &#x003BC;<sub><italic>i</italic></sub>(<italic>x</italic>). The absolute degree of membership for the original node <italic>t</italic><sub><italic>i</italic></sub> can be calculated by repeatedly applying <xref ref-type="disp-formula" rid="EQ13">Equation 11</xref> up to the root node. At this point, all points belong to the root node, Therefore, <xref ref-type="disp-formula" rid="EQ14">Equation 12</xref> is holds true [<xref ref-type="bibr" rid="B24">24</xref>]:</p>
<disp-formula id="EQ14"><mml:math id="M20"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mtext>&#x000A0;</mml:mtext><mml:mo>&#x02200;</mml:mo><mml:mi>x</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(12)</label></disp-formula>
<p>All successful splits originating from the root node, repeated recursively, represent the sequence of connected logical tests from the root node to the terminal node <italic>t</italic><sub><italic>i</italic></sub>. This implies that the estimate (mean) at each terminal node corresponds to the outcome of the test at that terminal node. The sum of these estimates across all terminal nodes provides the overall estimate (global mean) of all decision nodes in the regression tree, which corresponds to the overall mean in a general regression model. Therefore, the following <xref ref-type="disp-formula" rid="EQ15">Equation 13</xref> must hold at every internal (splitting) node to ensure the consistency and accuracy of the model [<xref ref-type="bibr" rid="B33">33</xref>].</p>
<disp-formula id="EQ15"><mml:math id="M21"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(13)</label></disp-formula>
<p>In the regression problem, the predicted value y given by the tree for a specific input feature vector <italic>x</italic><sub><italic>test</italic></sub> is expressed by the <xref ref-type="disp-formula" rid="EQ16">Equation 14</xref>:</p>
<disp-formula id="EQ16"><mml:math id="M22"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:mrow></mml:munder></mml:mstyle><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mtext>&#x000A0;</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(14)</label></disp-formula>
<p>Where l denotes the test set size (i.e., the number of observations in the test set).</p>
<p><inline-formula><mml:math id="M23"><mml:msub><mml:mrow><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is given in <xref ref-type="disp-formula" rid="EQ6">Equation 6</xref> such that, by construction, only one of the membership values <inline-formula><mml:math id="M24"><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003BC;</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>;</mml:mo><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mo>&#x0007E;</mml:mo></mml:mover></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula> equals 1, while the rest are zero. The error rate of the tree based on the training set is given by <xref ref-type="disp-formula" rid="EQ17">Equation 15</xref> [<xref ref-type="bibr" rid="B1">1</xref>].</p>
<disp-formula id="EQ17"><mml:math id="M25"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munderover></mml:mstyle><mml:msup><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mover accent="false" class="mml-overline"><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo accent="true">&#x000AF;</mml:mo></mml:mover><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mtd></mml:mtr></mml:mtable></mml:math><label>(15)</label></disp-formula>
<p>Unless a stopping criterion is specified, the decision tree continues to grow until <italic>R</italic><sub><italic>train</italic></sub>(<italic>T</italic>) &#x0003D; 0:</p>
<p>However, there exists an optimal-sized tree beyond which, despite the fact that <italic>R</italic><sub><italic>train</italic></sub>(<italic>T</italic>) continues to decrease monotonically, the predictive performance deteriorates (i.e., the true error rate or the unbiased estimation error increases). To avoid the problem of overfitting in the decision tree built from the training set and to obtain a tree of optimal size, several strategies have been proposed.</p>
<p>One of the most important methods to address excessive branching in tree growth is pruning, which involves removing branches that contain insignificant splits [<xref ref-type="bibr" rid="B5">5</xref>].</p>
</sec>
<sec>
<label>8.3</label>
<title>Chi-square tests with GUIDE algorithm</title>
<p>For the implementation of the work, the GUIDE algorithm processes four types of data (see Loh [30, p. 12]) as follows</p>
<list list-type="simple">
<list-item><p>(1) <italic>n</italic>-variable: A numerical variable used for both estimation and node splitting.</p></list-item>
<list-item><p>(2) <italic>F</italic>-variable: A numerical variable used only for estimation and not for node splitting.</p></list-item>
<list-item><p>(3) <italic>S</italic>-variable: A numerical variable used only for node splitting and not for estimation.</p></list-item>
<list-item><p>(4) <italic>C</italic>-variable: A categorical variable used only for node splitting and not for estimation [<xref ref-type="bibr" rid="B1">1</xref>].</p></list-item>
</list>
<list list-type="simple">
<list-item><p>(i) Extract the residuals resulting from fitting a constant model to the Y variable data.</p></list-item>
<list-item><p>(ii) For each numerical-valued variable, divide the data into four groups at the sample quartiles; construct a 2 &#x000D7; 4 contingency table with the groups as columns and the signs of the residuals (positive vs. non-positive) as rows; count the number of observations in each cell and compute the &#x003C7;2-statistic and its theoretical <italic>p</italic>-value from a &#x003C7;<sup>2</sup> distribution.</p></list-item>
<list-item><p>(iii) To detect interactions between a pair of numerical-valued variables (<italic>Xi, Xj</italic>), divide the (<italic>Xi, Xj</italic>)-space into four quadrants by splitting the range of each variable into two halves at the sample median; construct a 2 &#x000D7; 4 contingency table using the residual signs as rows and the quadrants as columns; compute the &#x003C7;2&#x02212;<italic>statistic</italic> and <italic>p</italic>-value. Again, columns with zero column totals are omitted. We refer to this as an interaction test.</p></list-item>
<list-item><p>(iv) For each pair of variables (<italic>Xi, Xj</italic>), where <italic>Xi</italic> is numerical-valued and <italic>Xj</italic> is categorical, divide the <italic>Xi</italic>-space into two at the sample median and the <italic>Xj</italic> space into as many sets as the number of categories in its range [if <italic>Xj</italic> has c categories, this splits the (<italic>Xi, Xj</italic>), space into 2c subsets]; construct a 2 &#x000D7; 2c contingency table with the subsets as columns and the signs of the residuals as rows; compute a &#x003C7;2&#x02212;<italic>statistic</italic> and p-value for the table after omitting columns with zero totals. If the smallest p-value is from a curvature test, it is natural to select the associated <italic>X</italic> variable to split the node. If the smallest <italic>p</italic>-value is from an in traction test, we need to select one of the two interacting variables. We could choose on the basis of the curvature <italic>p</italic>-values of the two variables but because the goal is to fit a constant model in each node, we base the choice on reduction in SSE [<xref ref-type="bibr" rid="B2">2</xref>]. This research relied on the fourth type of data (A categorical variable used only for node splitting and not for estimation).</p></list-item>
</list></sec>
</sec>
<sec id="s9">
<label>9</label>
<title>Applied real data</title>
<p>One of the most prominent indicators of climate change is the rise in temperatures, a phenomenon that has had a clear impact in Iraq due to its geographical location between latitudes 29&#x000B0; and 37&#x000B0;N and longitudes 39&#x000B0; and 48&#x000B0;E. The country is characterized by a hot, semi-arid climate with generally moderate winds throughout the year [<xref ref-type="bibr" rid="B24">24</xref>].</p>
<p>Dhi Qar Governorate is located in southern Iraq, and as such, it is among the regions most affected by rising temperatures, which range between 14 and 51.1 &#x000B0;C. Additionally, it experiences relatively higher humidity levels compared to other governorates. Meanwhile, wind speed is considered one of the naturally available energy sources, and it has become increasingly important in recent years as it is classified as a clean energy source [<xref ref-type="bibr" rid="B34">34</xref>]. Moreover, it is regarded as one of the renewable energy resources, which are used for environmental preservation and in various fields, including electricity generation, sailing ships, and transportation, among others. In addition, it is affected by and influenced by the phenomenon of climate change occurring in the environment [<xref ref-type="bibr" rid="B33">33</xref>].</p>
<p>Notably, the highest average wind speed was recorded in 2007 at 3.1 m/s, while in 2025, the highest average reached 14.6 m/s. Data on temperatures and wind speed are obtained for a sample size of 425 daily observations in Dhi Qar Governorate for the period (December 2024&#x02013;February 2025). <xref ref-type="table" rid="T1">Table 1</xref> shows the descriptive statistics of the sample.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Descriptive statistic for that actual data of the two variables (temperature and wind speed).</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th valign="top" align="left"><bold>Variables</bold></th>
<th valign="top" align="center"><bold>No</bold>.</th>
<th valign="top" align="center"><bold>Mean</bold></th>
<th valign="top" align="center"><bold>Min</bold></th>
<th valign="top" align="center"><bold>Max</bold></th>
<th valign="top" align="center"><bold>First quarter (q1)</bold></th>
<th valign="top" align="center"><bold>Second quarter (q2)</bold></th>
<th valign="top" align="center"><bold>Third quarter (q3)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Temperature (<italic>X</italic><sub>1</sub>)</td>
<td valign="top" align="center">425</td>
<td valign="top" align="center">32.4934</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">51.1</td>
<td valign="top" align="center">22.3</td>
<td valign="top" align="center">29.5</td>
<td valign="top" align="center">43.7</td>
</tr>
<tr>
<td valign="top" align="left">Wind speed(<italic>X</italic><sub>2</sub>)</td>
<td valign="top" align="center">425</td>
<td valign="top" align="center">7.5256</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">14.6</td>
<td valign="top" align="center">5.8</td>
<td valign="top" align="center">7.4</td>
<td valign="top" align="center">9.15</td>
</tr></tbody>
</table>
</table-wrap>
<p><xref ref-type="table" rid="T1">Table 1</xref> illustrates that the average temperature variable was 32.4934 &#x000B0;C, with the lowest recorded temperature for the studied period being 14 &#x000B0;C and the highest being 51.1 &#x000B0;C, which is relatively high. Regarding the wind speed variable, the average was 7.5256 m/s, and the highest recorded speed was 14.6 m/s. No wind speed was recorded for typical days, and this variable significantly impacts the climate in its minimum conditions.</p>
<sec>
<label>9.1</label>
<title>Fuzzy logic description of the sample</title>
<p>In this paper, we presented a new statistical method for measuring and analyzing climate change in a specific geographical area and in a specific period of time, relying on data on two important variables (temperature and wind speed) for the purpose of identifying the risks of this phenomenon and taking appropriate precautions in the near and distant future to deal with this natural emergency that is increasing over time. The tool of this study the decision tree regression method, was employed as a new extension of its application in the phenomenon of climate change, relying on two variables and using fuzzy logic to divide the data into groups as an accurate description of the climate situation.</p>
<p>Here we must study and analyze the emerging phenomena according to the reference literature for those interested in studying the field of knowledge from all its aspects and relationships. Here, we employed the decision tree approach as a primary tool, relying on it to construct a regression model for the fuzzy data of both study variables. In the previous section, we presented the quantitative and fuzzy general description of the two variables (<xref ref-type="table" rid="T1">Tables 1</xref>, <xref ref-type="table" rid="T2">2</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Represented attributes vs. fuzzy sets for the variables (temperature and wind speed).</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th valign="top" align="left"><bold>Variables</bold></th>
<th valign="top" align="center"><inline-graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-i0001.tif"/></th>
<th valign="top" align="center"><bold>set1</bold></th>
<th valign="top" align="center"><bold>set2</bold></th>
<th valign="top" align="center"><bold>set3</bold></th>
<th valign="top" align="center"><bold>set4</bold></th>
<th valign="top" align="center"><bold>Set5</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Temperature (<italic>X</italic><sub>1</sub>)</td>
<td valign="top" align="left" rowspan="2">(Language variables in Fuzzy Logic)</td>
<td valign="top" align="center">Very cold</td>
<td valign="top" align="center">Cold</td>
<td valign="top" align="center">Mild</td>
<td valign="top" align="center">Hot</td>
<td valign="top" align="center">Very hot</td>
</tr>
 <tr>
<td valign="top" align="left">Wind speed (<italic>X</italic><sub>2</sub>)</td>
<td valign="top" align="center">Very slow</td>
<td valign="top" align="center">Slow</td>
<td valign="top" align="center">Mild</td>
<td valign="top" align="center">Strong</td>
<td valign="top" align="center">Very strong</td>
</tr></tbody>
</table>
</table-wrap>
<p>We utilized fuzzy logic to transform the quantitative and real-valued data of the two variables temperature, and wind speed, which are characterized by ambiguity and uncertainty, into categorical variables represented by clearer attributes and partitions. These transformed variables may simultaneously belong to two groups at the same time, as shown in <xref ref-type="table" rid="T2">Table 2</xref>. <xref ref-type="fig" rid="F4">Figure 4</xref> illustrates the fuzzy membership functions (triangular) for each partition (attribute) of the temperature variable, as described in <xref ref-type="table" rid="T2">Table 2</xref>.</p>
<fig position="float" id="F4">
<label>Figure 4</label>
<caption><p>The Attributes of the Temperature Variable (very cold, cold, mild, hot, very hot).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-g0004.tif">
<alt-text content-type="machine-generated">Flowchart depicting temperature categories. A red rectangle labeled &#x0201C;Temperature&#x0201D; connects to five gray ovals labeled, from left to right, as &#x0201C;Very Cold,&#x0201D; &#x0201C;Cold,&#x0201D; &#x0201C;Mild,&#x0201D; &#x0201C;Hot,&#x0201D; and &#x0201C;Very Hot.&#x0201D; </alt-text>
</graphic>
</fig>
</sec>
<sec>
<title>9.2 Fuzzy regression tree description</title>
<p>In this section, in line with the objectives of the research, the two variables are examined separately and in a detailed manner. <xref ref-type="fig" rid="F4">Figures 4</xref>, <xref ref-type="fig" rid="F5">5</xref> present the partitions of each variable based on their respective root nodes adopted in this study. Furthermore, <xref ref-type="fig" rid="F6">Figures 6a</xref>&#x02013;<xref ref-type="fig" rid="F6">e</xref>, <xref ref-type="fig" rid="F7">7a</xref>&#x02013;<xref ref-type="fig" rid="F7">e</xref> provide a comprehensive illustration of the hierarchical branching structure at each decision node of the decision tree model constructed using the real data corresponding to the two variables, respectively. In light of this approach, the binary-division sets (binary-division decision trees type) for the purpose of employing the chi-square test, the data set is divided into two types: first, the negative-signal set and second, the positive-signal set. Then, each node of the tree is constructed to represent a specific fuzzy partition based on fuzzy (linguistic) rules. This process transforms the crisp input data into fuzzy sets relying on membership functions, which in turn act as splitting nodes in the tree branching into five child nodes. According to the hierarchical structure of the decision tree partitions, these form partial groups based on quartiles. Subsequently, these groups are further divided into two terminal (leaf) nodes corresponding to positive and negative difference signals. Based on the calculated test statistic, decisions are made using the contingency tables method for the chi-square test. The related tables and the following section explain in detail.</p>
<fig position="float" id="F5">
<label>Figure 5</label>
<caption><p>The Attributes of the wind speed Variable (very slow, slow, mild, strong, very strong).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-g0005.tif">
<alt-text content-type="machine-generated">Flowchart illustrating wind speed categories. A red rectangle labeled &#x0201C;Wind Speed&#x0201D; connects to five gray ellipses representing categories: &#x0201C;Very slow,&#x0201D; &#x0201C;slow,&#x0201D; &#x0201C;mild,&#x0201D; &#x0201C;strong,&#x0201D; and &#x0201C;Very strong.&#x0201D; </alt-text>
</graphic>
</fig>
<fig position="float" id="F6">
<label>Figure 6</label>
<caption><p>The Fuzzy Regression Tree Models for the Attributes of the actual data set representing Variables Temperature <bold>(a)</bold> Internal nodes of the very cold category, <bold>(b)</bold> Internal nodes of the cold category, <bold>(c)</bold> Internal nodes of the mild category, <bold>(d)</bold> Internal nodes of the hot category, <bold>(e)</bold> Internal nodes of the very hot category.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-g0006.tif">
<alt-text content-type="machine-generated">Five decision tree diagrams categorize temperature into &#x0201C;Very Cold,&#x0201D; &#x0201C;Cold,&#x0201D; &#x0201C;Mild,&#x0201D; &#x0201C;Hot,&#x0201D; and &#x0201C;Very Hot,&#x0201D; with each category branching into conditions labeled \( r \geq q_2 \), \( q_3 \leq r &#x0003C; q_2 \), \( q_1 \leq r &#x0003C; q_3 \), and \( r &#x0003C; q_1 \). Each branch ends with numerical values in green triangles, indicating outcomes for the given conditions.</alt-text>
</graphic>
</fig>
<fig position="float" id="F7">
<label>Figure 7</label>
<caption><p>The Fuzzy Regression Tree Models for the Attributes of the actual data set representing Variables wind speed <bold>(a)</bold> Internal nodes of the very slow category, <bold>(b)</bold> Internal nodes of the slow category, <bold>(c)</bold> Internal nodes of the mild category, <bold>(d)</bold> Internal nodes of the strong category, <bold>(e)</bold> Internal nodes of the very strong category.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-g0007.tif">
<alt-text content-type="machine-generated">Flowcharts illustrating different wind speed categories: (a) Very Slow, with values ranging from +106 to -86; (b) Slow, with values from +7 to +108; (c) Mild, with values from +106 to +108; (d) Strong, with values from +41 to -103; (e) Very Strong, ranging from +18 to +108. Each chart shows different conditions for comparison.</alt-text>
</graphic>
</fig>
<p>To carry out the analysis using the steps of the GUIDE algorithm discussed in Section 8.1 on the data of the climate change variables (temperature and wind speed), we will construct 2 &#x000D7; 4 contingency tables for the chi-square test to measure the independence of the error sign between the predicted and actual values. This will be done for both the observed and expected frequencies across the four fuzzy group partitions (columns). Then, we calculate the chi-square test statistic and compare it with the tabulated critical value <inline-formula><mml:math id="M26"><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>8147</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>according to the following hypothesis testing: the null hypothesis, which represents the levels of sign error that are independent of the levels for the fuzzy partition groups, against the alternative hypothesis, which represents the levels of sign error that are dependent on the levels for the fuzzy partition groups. <xref ref-type="fig" rid="F8">Figures 8a</xref>, <xref ref-type="fig" rid="F8">b</xref> explained the descriptive for fuzzy partition groups.</p>
<fig position="float" id="F8">
<label>Figure 8</label>
<caption><p>Triangular membership function for the attributes of the actual data set representing: <bold>(a)</bold> Variables Temperature (very cold, cold, mild, hot, very hot), <bold>(b)</bold> Variables wind speed (very slow, slow, mild, strong, very strong).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fams-12-1732313-g0008.tif">
<alt-text content-type="machine-generated">Two graphs show membership functions. Image (c) plots temperature with categories: very cold, cold, mild, hot, very hot on a scale from 14 to 51.1. Image (d) plots speed with categories: very slow, slow, mild, strong, very strong on a scale from 0 to 14.6. Both graphs use overlapping colored triangles to represent transitions between categories.</alt-text>
</graphic>
</fig>
<p><xref ref-type="fig" rid="F8">Figure 8a</xref> illustrates the triangular membership functions for the temperature variable. And <xref ref-type="fig" rid="F8">Figure 8b</xref> illustrates the triangular membership functions for the wind speed variable, which are measuring some climate change factors during the study period in Dhi Qar Governorate. and the characteristics that represent the fuzzy sets used to classify the temperature variable into five fuzzy groups (very cold, cold, mild, hot, and very hot) and the wind speed variable into five fuzzy groups (very slow, slow, mild, strong, and very strong). In order to give a clear and accurate perception of the weather condition and remove the ambiguity from the general description of the condition to a more accurate description closer to measuring.</p>
<p><xref ref-type="fig" rid="F4">Figure 4</xref> showed the definition of the root node of the temperature variable, and <xref ref-type="fig" rid="F5">Figure 5</xref> showed the definition of the root node of the wind speed variable. <xref ref-type="fig" rid="F6">Figures 6a</xref>&#x02013;<xref ref-type="fig" rid="F6">e</xref> shows the branches of the regression tree of the temperature variable for the fuzzy partitions that we explained in <xref ref-type="fig" rid="F8">Figure 8a</xref> and which relied in its calculations on the steps of the GUIDE algorithm to clarify the work using this approach and relying on the analysis in <xref ref-type="table" rid="T3">Table 3</xref>. For the purpose of clarifying the work in <xref ref-type="table" rid="T3">Table 3</xref>, (<italic>r</italic>) represents the number of repetitions in each cell that corresponds to the error signal for each of (the observed repetition and the expected repetition) and the corresponding subset of the total sets that were divided through the three quartile values (<italic>q</italic><sub>1</sub> &#x0003D; 22.3, <italic>q</italic><sub>2</sub> &#x0003D; 29.5, <italic>q</italic><sub>3</sub> &#x0003D; 43.7). <xref ref-type="fig" rid="F7">Figures 7a&#x02013;e</xref> shows the branches of the regression tree for the wind speed variable for the fuzzy partitions that we explained in <xref ref-type="fig" rid="F8">Figure 8b</xref> and which relied in its calculations on the steps of the GUIDE algorithm to clarify the work using this approach and relying on the analysis in <xref ref-type="table" rid="T4">Table 4</xref>. For the purpose of clarifying the work in <xref ref-type="table" rid="T4">Table 4</xref>, (<italic>r</italic>) represents the number of repetitions in each cell that corresponds to the error signal for each of (the observed repetition and the expected repetition) and the corresponding subset of the total sets that were divided through the three quartile values (<italic>q</italic><sub>1</sub> &#x0003D; 5.8, <italic>q</italic><sub>2</sub> &#x0003D; 7.4, <italic>q</italic><sub>3</sub> &#x0003D; 9.15) in fuzzy logic.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Observation count and expected count of the fuzzy number for temperature.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th/>
<th valign="top" align="left"><bold>Temperature</bold></th>
<th valign="top" align="center"><bold>Sign error</bold></th>
<th valign="top" align="left"><bold><italic>r</italic>&#x0003C;<italic>q</italic><sub>1</sub></bold></th>
<th valign="top" align="center"><bold><italic>q</italic><sub>1</sub> &#x02264; <italic>r</italic>&#x0003C;<italic>q</italic><sub>2</sub></bold></th>
<th valign="top" align="center"><bold><italic>q</italic><sub>2</sub> &#x02264; <italic>r</italic>&#x0003C;<italic>q</italic><sub>3</sub></bold></th>
<th valign="top" align="left"><bold><italic>r</italic>&#x02265;<italic>q</italic><sub>3</sub></bold></th>
<th valign="top" align="center"><bold>&#x02211;Count &#x0003D; 425</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Very Cold</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">22</td>
<td valign="top" align="center">106</td>
<td valign="top" align="center">105</td>
<td valign="top" align="left">106</td>
<td valign="top" align="center">339</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">86</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="center">86</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">86.146</td>
<td valign="top" align="center">84.551</td>
<td valign="top" align="center">83.753</td>
<td valign="top" align="left">83.753</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">21.854</td>
<td valign="top" align="center">21.450</td>
<td valign="top" align="center">21.247</td>
<td valign="top" align="left">21.247</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Cold</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">11</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">105</td>
<td valign="top" align="left">106</td>
<td valign="top" align="center">246</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">97</td>
<td valign="top" align="center">82</td>
<td valign="top" align="center">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="center">179</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">62.513</td>
<td valign="top" align="center">61.355</td>
<td valign="top" align="center">60.776</td>
<td valign="top" align="left">61.355</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">45.487</td>
<td valign="top" align="center">44.645</td>
<td valign="top" align="center">44.124</td>
<td valign="top" align="left">44.645</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Mild</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">108</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">29</td>
<td valign="top" align="left">106</td>
<td valign="top" align="center">274</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">0</td>
<td valign="top" align="center">75</td>
<td valign="top" align="center">76</td>
<td valign="top" align="left">0</td>
<td valign="top" align="center">151</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">69.628</td>
<td valign="top" align="center">68.339</td>
<td valign="top" align="center">67.694</td>
<td valign="top" align="left">68.339</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">38.372</td>
<td valign="top" align="center">37.661</td>
<td valign="top" align="center">37.306</td>
<td valign="top" align="left">37.661</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Hot</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">108</td>
<td valign="top" align="center">106</td>
<td valign="top" align="center">23</td>
<td valign="top" align="left">22</td>
<td valign="top" align="center">259</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">82</td>
<td valign="top" align="left">84</td>
<td valign="top" align="center">166</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">65.816</td>
<td valign="top" align="center">64.598</td>
<td valign="top" align="center">63.988</td>
<td valign="top" align="left">64.598</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">42.184</td>
<td valign="top" align="center">41.402</td>
<td valign="top" align="center">41.012</td>
<td valign="top" align="left">41.402</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Very Hot</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">108</td>
<td valign="top" align="center">106</td>
<td valign="top" align="center">105</td>
<td valign="top" align="left">9</td>
<td valign="top" align="center">328</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="left">97</td>
<td valign="top" align="center">97</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="left">83.351</td>
<td valign="top" align="center">81.807</td>
<td valign="top" align="center">81.053</td>
<td valign="top" align="left">81.807</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="left">24.649</td>
<td valign="top" align="center">24.193</td>
<td valign="top" align="center">23.965</td>
<td valign="top" align="left">24.193</td>
<td valign="top" align="center">&#x02013;</td>
</tr></tbody>
</table>
</table-wrap>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Observation count and expected count of the fuzzy number for wind speed.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th/>
<th valign="top" align="left"><bold>wind speed</bold></th>
<th valign="top" align="center"><bold>Sign error</bold></th>
<th valign="top" align="left"><bold><italic>r</italic>&#x0003C;<italic>q</italic><sub>1</sub></bold></th>
<th valign="top" align="center"><bold><italic>q</italic><sub>1</sub> &#x02264; <italic>r</italic>&#x0003C;<italic>q</italic><sub>2</sub></bold></th>
<th valign="top" align="center"><bold><italic>q</italic><sub>2</sub> &#x02264; <italic>r</italic>&#x0003C;<italic>q</italic><sub>3</sub></bold></th>
<th valign="top" align="center"><bold><italic>r</italic>&#x02265;<italic>q</italic><sub>3</sub></bold></th>
<th valign="top" align="center"><bold>&#x02211;Count &#x0003D; 425</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Very Slow</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">104</td>
<td valign="top" align="center">106</td>
<td valign="top" align="center">339</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">86</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">86</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">86.146</td>
<td valign="top" align="center">85.348</td>
<td valign="top" align="center">82.955</td>
<td valign="top" align="center">83.753</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">21.854</td>
<td valign="top" align="center">21.652</td>
<td valign="top" align="center">21.045</td>
<td valign="top" align="center">21.247</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Slow</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">108</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">34</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">256</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">70</td>
<td valign="top" align="center">99</td>
<td valign="top" align="center">169</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">65.054</td>
<td valign="top" align="center">64.452</td>
<td valign="top" align="center">62.645</td>
<td valign="top" align="center">63.849</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">42.946</td>
<td valign="top" align="center">42.548</td>
<td valign="top" align="center">41.355</td>
<td valign="top" align="center">42.151</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Mild</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">108</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">106</td>
<td valign="top" align="center">264</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">77</td>
<td valign="top" align="center">84</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">161</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">69.628</td>
<td valign="top" align="center">66.466</td>
<td valign="top" align="center">64.602</td>
<td valign="top" align="center">65.845</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">38.372</td>
<td valign="top" align="center">40.534</td>
<td valign="top" align="center">39.398</td>
<td valign="top" align="center">40.155</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Strong</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">41</td>
<td valign="top" align="center">104</td>
<td valign="top" align="center">106</td>
<td valign="top" align="center">256</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">103</td>
<td valign="top" align="center">66</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">169</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">65.054</td>
<td valign="top" align="center">64.452</td>
<td valign="top" align="center">62.645</td>
<td valign="top" align="center">63.849</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">42.976</td>
<td valign="top" align="center">42.548</td>
<td valign="top" align="center">41.355</td>
<td valign="top" align="center">42.151</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left" rowspan="2">Observation count</td>
<td valign="top" align="left" rowspan="4">Very Strong</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">108</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">104</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">337</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">88</td>
<td valign="top" align="center">88</td>
</tr>
 <tr>
<td valign="top" align="left" rowspan="2">Expected count</td>
<td valign="top" align="center">&#x0002B;</td>
<td valign="top" align="center">85.638</td>
<td valign="top" align="center">84.845</td>
<td valign="top" align="center">82.466</td>
<td valign="top" align="center">84.052</td>
<td valign="top" align="center">&#x02013;</td>
</tr>
 <tr>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">22.362</td>
<td valign="top" align="center">22.155</td>
<td valign="top" align="center">21.534</td>
<td valign="top" align="center">21.948</td>
<td valign="top" align="center">&#x02013;</td>
</tr></tbody>
</table>
</table-wrap>
<p><xref ref-type="table" rid="T3">Tables 3</xref>, <xref ref-type="table" rid="T4">4</xref> display the classification outcomes derived from the analysis of a fuzzy dataset. These results were validated using two independent tree-based models. <xref ref-type="fig" rid="F6">Figures 6a</xref>&#x02013;<xref ref-type="fig" rid="F6">e</xref>, <xref ref-type="fig" rid="F7">7a</xref>&#x02013;<xref ref-type="fig" rid="F7">e</xref> demonstrate the underlying decision-making process, which leverages the inherent tree structure within fuzzy regression. This process involves the division of the sample space into quartiles to provide a comprehensive description of the variable states. In the case of analyzing the temperature variable, the output results are in <xref ref-type="table" rid="T3">Table 3</xref>, where the second and third columns display the fuzzy group divisions and the error signal for each division. The rest of the columns represent the branching results for the variable according to the fuzzy categorical division of the error signal. Thus, the descriptive results of the repetition of cases can be analyzed in the cells corresponding to the chi-square table as follows: As for the first division, very cold weather has a main influence and is the focus of our attention, where 339 observations with a rate of 80% corresponded to descriptions of cold weather (the error sign is positive), which compares with rate 20% for very cold weather. Analyzing the first quartile, 22 observations with a rate of 20% were classified as cold weather, while 86 observations with rate 80% were categorized as very cold weather. As for the remaining quarters of this category division, they were 106 and 105 observations, with rates of 100% for cold and rates of 0% for very cold, respectively. As for the rest of the category classifications, they can take the same analysis, and they are moderate with mild climate category divisions and are considered not influential in the main way (cold, intermediate, and hot). As for the fifth division, very hot, it has a main influence and is the focus of our attention. We recorded 328 observations, with 77% of the total observations, compared to 23% who described the weather as very hot. As for the fourth quarter, which described the weather as hot, there were 9 observations, with a rate of 8.5%, while 97 observations described the weather as very hot, representing 91.5%. For <xref ref-type="table" rid="T3">Table 3</xref>, we found that the calculated chi-square value for the very cold of the temperature variable was 316.4632, and compared with the corresponding tabular chi-square value, which was [<inline-formula><mml:math id="M27"><mml:msubsup><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>8147</mml:mn></mml:math></inline-formula>], we note that the calculated value is greater than the tabular value, and thus the null hypothesis is rejected and the hypothesis that states that the levels of the residuals signal are not independent of the partial sums is accepted, and this means that the first fuzzy set for the temperature variable affects climate change. And, we found that the calculated chi-square value for the very hot of the temperature variable was 378.2436, and compared with the corresponding tabular chi-square value, which was [<inline-formula><mml:math id="M28"><mml:msubsup><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>8147</mml:mn></mml:math></inline-formula>], we note that the calculated value is greater than the tabular value, and thus the null hypothesis is rejected and the hypothesis that states that the levels of the residuals signal are not independent of the partial sums is accepted, and this means that the Fifth fuzzy set for the temperature variable affects climate change.</p>
<p>We conclude from <xref ref-type="table" rid="T3">Table 3</xref> that the fuzzy classifications were more accurate in describing temperatures when using the fuzzy regression tree structure based on the selected categorical classifications provided by fuzzy logic. This improves the accuracy of the results shown in <xref ref-type="table" rid="T3">Table 3</xref> and provides a clear understanding for both the general public in Dhi-Qar Governorate who are interested in weather conditions and researchers studying climate change, enabling decision-makers to take immediate and future action to mitigate and prevent temperature increases and to utilize them in electricity generation by developing alternative plans and environmentally friendly policies.</p>
<p>When analyzing the results for the wind speed variable in <xref ref-type="table" rid="T4">Table 4</xref>, the second and third columns display the fuzzy group classifications and the error signal for each classification. The remaining columns present the results of the variable&#x00027;s branches according to the fuzzy categorical division of the error signal. Accordingly, the descriptive results of case repetitions can be analyzed in the cells corresponding to the chi-square table as follows: The number of observed frequencies in the sample describing the weather as strong winds was 256 observations, accounting for 60%, compared to 40% who described the weather as stormy. In the first and second quarters, 46 observations, a rate of 21% described the weather as having strong winds, while 169 observations, a rate of 79%, described it as stormy. The number of observations in the sample describing the weather as stormy was 337, accounting for 79%, compared to 21% who described it as very stormy. In the fourth quarter, 18 observations, a rate of 17% described the weather as stormy, while 88 observations, a rate of 83% described it as very stormy. For <xref ref-type="table" rid="T4">Table 4</xref>, we found that the calculated chi-square value for the strong of the wind speed variable was 302.1632 and compared with the corresponding tabular chi-square value, which was [<inline-formula><mml:math id="M29"><mml:msubsup><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>8147</mml:mn></mml:math></inline-formula>], we note that the calculated value is greater than the tabular value, and thus the null hypothesis is rejected and the hypothesis that states that the levels of the residuals signal are not independent of the partial sums is accepted, and this means that the fourth fuzzy set for the wind speed variable affects climate change. And, we found that the calculated chi-square value for the very strong wind speed variable was 333.9847 and compared with the corresponding tabular chi-square value, which was [<inline-formula><mml:math id="M30"><mml:msubsup><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mn>0</mml:mn><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>8147</mml:mn></mml:math></inline-formula>], we note that the calculated value is greater than the tabular value, and thus the null hypothesis is rejected and the hypothesis that states that the levels of the residuals signal are not independent of the partial sums is accepted, and this means that the fifth fuzzy set for the wind speed variable affects climate change.</p>
<p>We conclude from <xref ref-type="table" rid="T4">Table 4</xref> that the structure of the fuzzy regression tree, based on the selected categorical classifications provided by fuzzy logic, played a significant role in monitoring weather. Analyzing the results presented in <xref ref-type="table" rid="T4">Table 4</xref> provides information both the general public in Dhi-Qar Governorate who are interested in weather, and for researchers focused on climate change and alternative energy. These findings can assist decision-makers in utilizing wind speed as a source of clean and renewable energy. Wind energy can be harnessed across various fields, with one of the most important being the generation of electricity in an environmentally friendly manner.</p></sec>
</sec>
<sec id="s10">
<label>10</label>
<title>Conclusions</title>
<p>The GUIDE decision tree algorithm has been widely applied in the literature to analyze crisp datasets. The main contribution of the present study is the extension of its application to the framework of fuzzy logic theory, which clearly differentiates this work from existing studies. We conclude that fuzzy classifications were more accurate in describing temperature and wind speed variables when using the fuzzy regression tree structure based on the selected categorical classifications provided by fuzzy logic. In addition, these variables played a significant role in monitoring weather conditions. In other words, fuzzy logic offers a more accurate description and provides a clearer picture of the significant and rapid changes in climate and their impact on the environment. It has become evident that regression tree tools are among the most efficient and accurate methods for quantitative analysis, aiding in making correct and precise decisions. There is clear evidence of climate change caused by the combined influence of rising temperatures and wind speed in the southern regions of Iraq, particularly in Dhi-Qar Governorate.</p></sec>
</body>
<back>
<sec sec-type="data-availability" id="s11">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec sec-type="author-contributions" id="s12">
<title>Author contributions</title>
<p>AH: Writing &#x02013; original draft, Writing &#x02013; review &#x00026; editing. HH: Writing &#x02013; original draft, Writing &#x02013; review &#x00026; editing. AA-S: Writing &#x02013; original draft, Writing &#x02013; review &#x00026; editing.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="ai-statement" id="s14">
<title>Generative AI statement</title>
<p>The author(s) declared that generative AI was not used in the creation of this manuscript.</p>
<p>Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.</p></sec>
<sec sec-type="disclaimer" id="s15">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Su&#x000E1;rez</surname> <given-names>A</given-names></name> <name><surname>Lutsko</surname> <given-names>JF</given-names></name></person-group>. <article-title>Globally optimal fuzzy decision trees for classification and regression</article-title>. <source>IEEE Trans Pattern Anal Mach Intell.</source> (<year>1999</year>) <volume>21</volume>:<fpage>1297</fpage>&#x02013;<lpage>311</lpage>. doi: <pub-id pub-id-type="doi">10.1109/34.817409</pub-id></mixed-citation>
</ref>
<ref id="B2">
<label>2.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Loh</surname> <given-names>WY</given-names></name></person-group>. <article-title>Classification and regression trees</article-title>. <source>Wiley Interdiscip Rev Data Mining Knowl Discov.</source> (<year>2011</year>) <volume>1</volume>:<fpage>14</fpage>&#x02013;<lpage>23</lpage>. doi: <pub-id pub-id-type="doi">10.1002/widm.8</pub-id></mixed-citation>
</ref>
<ref id="B3">
<label>3.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Segatori</surname> <given-names>A</given-names></name> <name><surname>Marcelloni</surname> <given-names>F</given-names></name> <name><surname>Pedrycz</surname> <given-names>W</given-names></name></person-group>. <article-title>On distributed fuzzy decision trees for big data</article-title>. <source>IEEE Transac Fuzzy Syst.</source> (<year>2017</year>) <volume>26</volume>:<fpage>174</fpage>&#x02013;<lpage>92</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TFUZZ.2016.2646746</pub-id></mixed-citation>
</ref>
<ref id="B4">
<label>4.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mohammadiun</surname> <given-names>S</given-names></name> <name><surname>Hu</surname> <given-names>G</given-names></name> <name><surname>Gharahbagh</surname> <given-names>AA</given-names></name> <name><surname>Mirshahi</surname> <given-names>R</given-names></name> <name><surname>Li</surname> <given-names>J</given-names></name> <name><surname>Hewage</surname> <given-names>K</given-names></name> <etal/></person-group>. <article-title>Optimization of integrated fuzzy decision tree and regression models for selection of oil spill response method in the Arctic</article-title>. <source>Knowl Based Syst.</source> (<year>2021</year>) <volume>213</volume>:<fpage>106676</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.knosys.2020.106676</pub-id></mixed-citation>
</ref>
<ref id="B5">
<label>5.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nikolaidis</surname> <given-names>P</given-names></name></person-group>. <article-title>Wind power forecasting in distribution networks using non-parametric models and regression trees</article-title>. <source>Discover Energy.</source> (<year>2022</year>) <volume>2</volume>:<fpage>6</fpage>. doi: <pub-id pub-id-type="doi">10.1007/s43937-022-00011-z</pub-id></mixed-citation>
</ref>
<ref id="B6">
<label>6.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Zimmermann</surname> <given-names>HJ</given-names></name></person-group>. <source>Fuzzy Set Theory&#x02014;and Its Applications</source>. <publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer Science and Business Media</publisher-name> (<year>2011</year>).</mixed-citation>
</ref>
<ref id="B7">
<label>7.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Habeeb</surname> <given-names>AS</given-names></name></person-group>. <article-title>Estimating the parameters of the odd Lomax exponential distribution</article-title>. <source>Stat Optim Inform Comput.</source> (<year>2025</year>) <volume>13</volume>:<fpage>694</fpage>&#x02013;<lpage>715</lpage>. doi: <pub-id pub-id-type="doi">10.19139/soic-2310-5070-2121</pub-id></mixed-citation>
</ref>
<ref id="B8">
<label>8.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Spolaor</surname> <given-names>S</given-names></name></person-group>. <source>Fuzzy Logic for the Modeling and Simulation of Complex</source> <publisher-loc>Systems. Milan</publisher-loc>: <publisher-name>University of Milano-Bicocca</publisher-name> (<year>2020</year>).</mixed-citation>
</ref>
<ref id="B9">
<label>9.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Buckley</surname> <given-names>JJ</given-names></name> <name><surname>Eslami</surname> <given-names>E</given-names></name></person-group>. <source>An Introduction to Fuzzy Logic and Fuzzy Sets, Vol. 13.</source> <publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer Science and Business Media.</publisher-name> (<year>2002</year>). doi: <pub-id pub-id-type="doi">10.1007/978-3-7908-1799-7</pub-id></mixed-citation>
</ref>
<ref id="B10">
<label>10.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hasan</surname> <given-names>HA</given-names></name> <name><surname>Mohammad</surname> <given-names>MJ</given-names></name></person-group>. <article-title>Classify the nutritional status of Iraqi children under five years using fuzzy classification</article-title>. <source>Sumer J. Pure Sci.</source> (<year>2024</year>). <volume>29</volume>, <fpage>161</fpage>&#x02013;<lpage>171</lpage>. doi: <pub-id pub-id-type="doi">10.33095/jeas.v29i138.3046</pub-id></mixed-citation>
</ref>
<ref id="B11">
<label>11.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Hellmann</surname> <given-names>M</given-names></name></person-group>. <source>Fuzzy Logic Introduction</source>. <publisher-loc>Rennes</publisher-loc>: <publisher-name>Universit&#x000E9; de Rennes</publisher-name> (<year>2001</year>), 1.</mixed-citation>
</ref>
<ref id="B12">
<label>12.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>H</given-names></name> <name><surname>Liu</surname> <given-names>D</given-names></name></person-group>. <source>Fuzzy Modeling and Fuzzy Control</source>. <publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer Science and Business Media</publisher-name> (<year>2006</year>).</mixed-citation>
</ref>
<ref id="B13">
<label>13.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Bojadziev</surname> <given-names>G</given-names></name> <name><surname>Bojadziev</surname> <given-names>M</given-names></name></person-group>. <source>Fuzzy Logic for Business, Finance, and Management (2nd Ed.).</source> <publisher-loc>Singapore</publisher-loc>: <publisher-name>World Scientific Publishing Co. Pte. Ltd.</publisher-name> (<year>2007</year>).</mixed-citation>
</ref>
<ref id="B14">
<label>14.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Hooda</surname> <given-names>DS</given-names></name> <name><surname>Raich</surname> <given-names>V</given-names></name></person-group>. <source>Fuzzy Logic Models and Fuzzy Control: An Introduction.</source> <publisher-loc>Oxford</publisher-loc>: <publisher-name>Alpha Science International Ltd.</publisher-name> (<year>2017</year>).</mixed-citation>
</ref>
<ref id="B15">
<label>15.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hasan</surname> <given-names>HA</given-names></name> <name><surname>Mohammad</surname> <given-names>MJ</given-names></name></person-group>. <article-title>Classification of Iraqi children according to their nutritional status using fuzzy logic</article-title>. <source>J. Econ. Administr. Sci.</source> (<year>2023</year>) <volume>29</volume>:<fpage>161</fpage>&#x02013;<lpage>71</lpage>. doi: <pub-id pub-id-type="doi">10.33095/jeas.v29i138.3046</pub-id></mixed-citation>
</ref>
<ref id="B16">
<label>16.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Ross</surname> <given-names>TJ</given-names></name></person-group>. <source>Fuzzy Logic with Engineering Applications</source>. <publisher-loc>Hoboken, NJ</publisher-loc>: <publisher-name>John Wiley and Sons</publisher-name> (<year>2009</year>).</mixed-citation>
</ref>
<ref id="B17">
<label>17.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Suganthi</surname> <given-names>L</given-names></name> <name><surname>Iniyan</surname> <given-names>S</given-names></name> <name><surname>Samuel</surname> <given-names>AA</given-names></name></person-group>. <article-title>Applications of fuzzy logic in renewable energy systems&#x02014;a review</article-title>. <source>Renew Sustain Energy Rev.</source> (<year>2015</year>) <volume>48</volume>:<fpage>585</fpage>&#x02013;<lpage>607</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.rser.2015.04.037</pub-id></mixed-citation>
</ref>
<ref id="B18">
<label>18.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Zimmermann</surname> <given-names>HJ</given-names></name> <name><surname>Zadeh</surname> <given-names>LA</given-names></name> <name><surname>Gaines</surname> <given-names>BR</given-names></name></person-group>. <article-title>Fuzzy sets decision analysis</article-title>. <source>Fuzzy Sets Syst.</source> (<year>1985</year>) <volume>1</volume>:<fpage>45</fpage>&#x02013;<lpage>65</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0165-0114(78)90031-3</pub-id></mixed-citation>
</ref>
<ref id="B19">
<label>19.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Mann</surname> <given-names>PS</given-names></name></person-group>. <source>Introductory Statistics</source>. <publisher-loc>Hoboken, NJ</publisher-loc>: <publisher-name>John Wiley and Sons</publisher-name> (<year>2010</year>).</mixed-citation>
</ref>
<ref id="B20">
<label>20.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Alavala</surname> <given-names>CR</given-names></name></person-group>. <source>Fuzzy Logic and Neural Networks: Basic Concepts and Applications</source>. <publisher-loc>Bengaluru</publisher-loc>: <publisher-name>New Age International Publisher</publisher-name> (<year>2008</year>).</mixed-citation>
</ref>
<ref id="B21">
<label>21.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nowakov&#x000E1;</surname> <given-names>J</given-names></name> <name><surname>Pokorn&#x000FD;</surname> <given-names>M</given-names></name></person-group>. <article-title>Fuzzy linear regression analysis</article-title>. <source>IFAC Proc.</source> (<year>2013</year>) <volume>46</volume>:<fpage>245</fpage>&#x02013;<lpage>9</lpage>. doi: <pub-id pub-id-type="doi">10.3182/20130925-3-CZ-3023.00079</pub-id></mixed-citation>
</ref>
<ref id="B22">
<label>22.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shapiro</surname> <given-names>AF</given-names></name></person-group>. <article-title>Fuzzy regression models</article-title>. <source>J Optimiz Theor Appl.</source> (<year>2005</year>) <volume>102</volume>:<fpage>373</fpage>&#x02013;<lpage>83</lpage>. doi: <pub-id pub-id-type="doi">10.1023/A:1021706631165</pub-id></mixed-citation>
</ref>
<ref id="B23">
<label>23.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hasan</surname> <given-names>HA</given-names></name> <name><surname>Mohammad</surname> <given-names>MJ</given-names></name></person-group>. <article-title>Classification of Iraqi children according to nutritional status using fuzzy decision tree</article-title>. <source>J Al-Rafidain Univ Coll Sci.</source> (<year>2025</year>) <volume>56</volume>:<fpage>468</fpage>&#x02013;<lpage>80</lpage>. doi: <pub-id pub-id-type="doi">10.55562/jrucs.v56i1.42</pub-id></mixed-citation>
</ref>
<ref id="B24">
<label>24.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Magesh</surname> <given-names>T</given-names></name> <name><surname>Thiyagesan</surname> <given-names>M</given-names></name></person-group>. <article-title>Machine learning-driven wind energy forecasting for sustainable development</article-title>. <source>MATEC Web Conf.</source> 393:02003 (<year>2024</year>). doi: <pub-id pub-id-type="doi">10.1051/matecconf/202439302003</pub-id></mixed-citation>
</ref>
<ref id="B25">
<label>25.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Marsala</surname> <given-names>C</given-names></name></person-group>. <article-title>Fuzzy decision trees for dynamic data</article-title>. In: <source>2013 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS)</source> (<year>2013</year>), <fpage>17</fpage>&#x02013;<lpage>24</lpage>.</mixed-citation>
</ref>
<ref id="B26">
<label>26.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yuanyuan</surname> <given-names>Z</given-names></name></person-group>. <article-title>MOOC teaching model of basic education based on fuzzy decision tree algorithm</article-title>. <source>Comput Intell Neurosci.</source> (<year>2022</year>) <volume>2022</volume>:<fpage>3175028</fpage>. doi: <pub-id pub-id-type="doi">10.1155/2022/3175028</pub-id><pub-id pub-id-type="pmid">35720933</pub-id></mixed-citation>
</ref>
<ref id="B27">
<label>27.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Loh</surname> <given-names>WY</given-names></name></person-group>. <article-title>Improving the precision of classification trees</article-title>. <source>Ann Appl Stat.</source> (<year>2009</year>) <volume>3</volume>:<fpage>1710</fpage>&#x02013;<lpage>37</lpage>. doi: <pub-id pub-id-type="doi">10.1214/09-AOAS260</pub-id></mixed-citation>
</ref>
<ref id="B28">
<label>28.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>H</given-names></name> <name><surname>Lu</surname> <given-names>J</given-names></name> <name><surname>Zhang</surname> <given-names>G</given-names></name></person-group>. <article-title>Learning a fuzzy decision tree from uncertain data</article-title>. in <source>2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)</source> (<year>2017</year>). p. <fpage>1</fpage>&#x02013;<lpage>7</lpage>.</mixed-citation>
</ref>
<ref id="B29">
<label>29.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Park</surname> <given-names>Y</given-names></name></person-group>. <article-title>A comparison of neural net classifiers and linear tree classifiers: their similarities and differences</article-title>. <source>Pattern Recognit.</source> (<year>1994</year>) <volume>27</volume>:<fpage>1493</fpage>&#x02013;<lpage>503</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0031-3203(94)90127-9</pub-id></mixed-citation>
</ref>
<ref id="B30">
<label>30.</label>
<mixed-citation publication-type="web"><person-group person-group-type="author"><name><surname>Loh</surname> <given-names>WY</given-names></name></person-group>. <article-title>Regression trees with unbiased variable selection and interaction detection</article-title>. <source>Stat Sin.</source> (<year>2002</year>) <volume>12</volume>:<fpage>361</fpage>&#x02013;<lpage>86</lpage>. <ext-link ext-link-type="uri" xlink:href="https://www.jstor.org/stable/24306967">https://www.jstor.org/stable/24306967</ext-link></mixed-citation>
</ref>
<ref id="B31">
<label>31.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fathima</surname> <given-names>TH</given-names></name> <name><surname>Kovoor</surname> <given-names>BC</given-names></name> <name><surname>Ku</surname> <given-names>J</given-names></name></person-group>. <article-title>Big data classification based on distributed fuzzy decision trees</article-title>. In: <source>Proceedings of ICAEEC-2019, IIIT Allahabad India, 31st May-1st June, 2019</source> (<year>2019</year>). doi: <pub-id pub-id-type="doi">10.2139/ssrn.3576492</pub-id></mixed-citation>
</ref>
<ref id="B32">
<label>32.</label>
<mixed-citation publication-type="book"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>CH</given-names></name> <name><surname>H&#x000E4;rdle</surname> <given-names>W</given-names></name> <name><surname>Unwin</surname> <given-names>A</given-names></name> <name><surname>Loh</surname> <given-names>WY</given-names></name></person-group>. <article-title>Regression by parts: fitting visually interpretable models with GUIDE</article-title>. In: <source>Handbook of Data Visualization</source>. <publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name> (<year>2008</year>). p. <fpage>447</fpage>&#x02013;<lpage>69</lpage>.</mixed-citation>
</ref>
<ref id="B33">
<label>33.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kassim</surname> <given-names>NM</given-names></name> <name><surname>Santhiran</surname> <given-names>S</given-names></name> <name><surname>Alkahtani</surname> <given-names>AA</given-names></name> <name><surname>Islam</surname> <given-names>MA</given-names></name> <name><surname>Tiong</surname> <given-names>SK</given-names></name> <name><surname>Mohd Yusof</surname> <given-names>MY</given-names></name> <etal/></person-group>. (<year>2023</year>). <article-title>An adaptive decision tree regression modeling for the output power of large-scale solar (LSS) farm forecasting</article-title>. <source>Sustainability</source> 15:13521. doi: <pub-id pub-id-type="doi">10.3390/su151813521</pub-id></mixed-citation>
</ref>
<ref id="B34">
<label>34.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Y&#x000FC;rek</surname> <given-names>&#x000D6;</given-names></name> <name><surname>Birant</surname> <given-names>D</given-names></name> <name><surname>Y&#x000FC;rek</surname> <given-names>I</given-names></name></person-group>. <article-title>Wind power generation prediction using machine learning algorithms</article-title>. <source>Dokuz Eyl&#x000FC;l &#x000DC;niversitesi M&#x000FC;hendislik Fak&#x000FC;ltesi Fen ve M&#x000FC;hendislik Dergisi.</source> (<year>2021</year>) <volume>23</volume>:<fpage>107</fpage>&#x02013;<lpage>19</lpage>. doi: <pub-id pub-id-type="doi">10.21205/deufmd.2021236709</pub-id></mixed-citation>
</ref>
<ref id="B35">
<label>35.</label>
<mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ross</surname> <given-names>S</given-names></name></person-group>. <source>Probability and Statistics for Engineers and Scientists</source>. New Delhi: Elsevier (<year>2010</year>). </mixed-citation>
</ref>
</ref-list>
<fn-group>
<fn fn-type="custom" custom-type="edited-by" id="fn0001">
<p>Edited by: <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/453438/overview">Firdous A. Shah</ext-link>, University of Kashmir, India</p>
</fn>
<fn fn-type="custom" custom-type="reviewed-by" id="fn0002">
<p>Reviewed by: <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1169351/overview">Mohd Tahir Ismail</ext-link>, University of Science Malaysia (USM), Malaysia</p>
<p><ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/3269741/overview">Hamdi Akhsan</ext-link>, Sriwijaya University, Indonesia</p>
</fn>
</fn-group>
</back>
</article>