<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Built Environ.</journal-id>
<journal-title>Frontiers in Built Environment</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Built Environ.</abbrev-journal-title>
<issn pub-type="epub">2297-3362</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1207193</article-id>
<article-id pub-id-type="doi">10.3389/fbuil.2023.1207193</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Built Environment</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Classification prediction model of indoor PM<sub>2.5</sub> concentration using CatBoost algorithm</article-title>
<alt-title alt-title-type="left-running-head">Guo et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fbuil.2023.1207193">10.3389/fbuil.2023.1207193</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Guo</surname>
<given-names>Zhenwei</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Wang</surname>
<given-names>Xinyu</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2281598/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ge</surname>
<given-names>Liang</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Chinese Society for Urban Studies</institution>, <addr-line>Beijing</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>National Engineering Research Center of Building Technology</institution>, <addr-line>Beijing</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1925922/overview">Joanna Ferdyn-Grygierek</ext-link>, Silesian University of Technology, Poland</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/90621/overview">Mohamed F. Yassin</ext-link>, Kuwait Institute for Scientific Research, Kuwait</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2360454/overview">Xuanye Cai</ext-link>, Central University of Finance and Economics, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Xinyu Wang, <email>wangxinyu356533529@126.com</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>31</day>
<month>07</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>9</volume>
<elocation-id>1207193</elocation-id>
<history>
<date date-type="received">
<day>17</day>
<month>04</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>20</day>
<month>07</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Guo, Wang and Ge.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Guo, Wang and Ge</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>It is increasingly important to create a healthier indoor environment for office buildings. Accurate and reliable prediction of PM<sub>2.5</sub> concentration can effectively alleviate the delay problem of indoor air quality control system. The rapid development of machine learning has provided a research basis for the indoor air quality system to control the PM<sub>2.5</sub> concentration. One approach is to introduce the CatBoost algorithm based on rank lifting training into the classification and prediction of indoor PM<sub>2.5</sub> concentration. Using actual monitoring data from office building, we consider previous indoor PM<sub>2.5</sub> concentration, indoor temperature, relative humidity, CO<sub>2</sub> concentration, and illumination as input variables, with the output indicating whether indoor PM<sub>2.5</sub> concentration exceeds 25&#xa0;&#x3bc;g/m<sup>3</sup>. Based on the CatBoost algorithm, we construct an intelligent classification prediction model for indoor PM<sub>2.5</sub> concentration. The model is evaluated using actual data and compared with the multilayer perceptron (MLP), gradientboosting decision tree (GBDT), logistic regression (LR), decision tree (DT), and k-nearest neighbors (KNN) models. The CatBoost algorithm demonstrates outstanding predictive performance, achieving an impressive area under the ROC curve (AUC) of 0.949 after hyperparameters optimition. Furthermore, when considering the five input variables, the feature importance is ranked as follows: previous indoor PM<sub>2.5</sub> concentration, relative humidity, CO<sub>2</sub>, indoor temperature, and illuminance. Through verification, the prediction model based on CatBoost algorithm can accurately predict the indoor PM<sub>2.5</sub> concentration level. The model can be used to predict whether the indoor concentration of PM<sub>2.5</sub> exceeds the standard in advance and guide the air quality control system to regulate.</p>
</abstract>
<kwd-group>
<kwd>indoor environment</kwd>
<kwd>PM<sub>2.5</sub> limit</kwd>
<kwd>CatBoost model</kwd>
<kwd>classification prediction</kwd>
<kwd>machine learning</kwd>
</kwd-group>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Indoor Environment</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<sec id="s1-1">
<title>1.1 Motivation</title>
<p>More than 80% of people&#x2019;s time is spent indoors, and in recent years, with the prevalence of respiratory infectious diseases and increasingly severe outdoor environmental pollution, this proportion will continue to rise. Office buildings are the main places where people work, and existing research shows that indoor air quality (IAQ) directly affects the physical and mental health of indoor personnel. High-quality indoor environmental quality can promote work efficiency and health (<xref ref-type="bibr" rid="B18">Fisk, 2000</xref>; <xref ref-type="bibr" rid="B51">Newsham et al., 2008</xref>; <xref ref-type="bibr" rid="B60">Thayer Julian et al., 2010</xref>; <xref ref-type="bibr" rid="B24">Horr et al., 2016</xref>). In the era of the COVID-19 pandemic, compared to outdoor, commercial, and hotel buildings, the requirements for wearing masks indoors in office buildings are relatively low. Therefore, IAQ in office buildings is extremely important.</p>
<p>PM<sub>2.5</sub> is particle size &#x2264; 2.5&#xa0;&#x3bc;m fine particles, which are composed of many chemical components, are easy to absorb harmful substances, and are the main pollutants affecting the IAQ. Particulate pollution mainly harms the respiratory system of human body, can cause tracheitis, pneumonia and fever, and may also cause eye and nasal allergy, or even death (<xref ref-type="bibr" rid="B6">BellMichelle and Davis, 2001</xref>; <xref ref-type="bibr" rid="B28">Jacobs et al., 2018</xref>; <xref ref-type="bibr" rid="B50">Mo, 2019</xref>). As the largest developing country in the world and one of the largest anthropogenic air pollution emitters in the world, China is currently facing serious air pollution problems, mainly PM<sub>2.5</sub>. The number of PM<sub>2.5</sub>-related deaths in China in 2017 was 1.8 million, an increase of 30% over 2005 (<xref ref-type="bibr" rid="B49">Ming et al., 2021</xref>). The hazard of indoor PM<sub>2.5</sub> has gradually attracted people&#x2019;s attention, and it has been included in the Standard for Health Building Assessment as a control item (<xref ref-type="bibr" rid="B21">GB, 2002</xref>). Research has revealed that indoor PM<sub>2.5</sub> concentrations occasionally surpass 75&#xa0;&#x3bc;g/m<sup>3</sup> on a daily basis, exceeding the guidelines set by the World Health Organization for IAQ (<xref ref-type="bibr" rid="B74">Zhao et al., 2015</xref>; <xref ref-type="bibr" rid="B17">Fan et al., 2018</xref>; <xref ref-type="bibr" rid="B66">XueWangLiu and Dong, 2020</xref>). To foster a healthy indoor environment, certain indoor office spaces have adopted the use of IAQ improvement devices to ensure the provision of clean air. Fresh air system is the main method for HVAC to control indoor air pollution, which can significantly reduce the concentration of PM<sub>2.5</sub> (<xref ref-type="bibr" rid="B75">ZhaoLiuRen, 2018</xref>; <xref ref-type="bibr" rid="B25">Huang et al., 2020</xref>). While the fresh air systems effectively lower indoor PM<sub>2.5</sub> concentrations to satisfactory levels, a notable concern is that the majority of these systems do not adapt their operation in response to the IAQ conditions, often remaining fully operational at all times (<xref ref-type="bibr" rid="B37">Lai et al., 2018</xref>). This lack of responsiveness is both unnecessary and energy-inefficient.</p>
<p>The application of artificial intelligence (AI) and machine learning (ML) has emerged as a promising solution to tackle the aforementioned challenges. With the gradual maturity of computer technology and automatic control theory, intelligent control technology based on ML has been widely used in the field of HVAC regulation. By combining intelligent predictive models with IAQ control devices, it becomes possible to accurately forecast PM<sub>2.5</sub> concentrations and translate them into control signals, thereby guiding the regulation process. This integration not only facilitates the creation of a healthy indoor environment but also minimizes unnecessary energy consumption within the systems.</p>
</sec>
<sec id="s1-2">
<title>1.2 The application of ML in IAQ prediction</title>
<p>Over the past decade, remarkable advancements have been made in applying AI/ML and Internet of Things (IoT) technologies to monitor and predict the physical environment of buildings. Specifically, ML prediction models for IAQ have been established based on historical data collected from sensors. Various methods have been employed to enhance the accuracy of IAQ prediction, including multivariate linear regression (MLR) (<xref ref-type="bibr" rid="B48">MengSpectorColome and Turpin, 2009</xref>; <xref ref-type="bibr" rid="B46">Martin and &#x160;afr&#xe1;nek, 2011</xref>; <xref ref-type="bibr" rid="B45">Maher Nor et al., 2015</xref>), artificial neural networks (ANN) (<xref ref-type="bibr" rid="B57">Sofuoglu, 2008</xref>; <xref ref-type="bibr" rid="B63">Xie et al., 2009</xref>; <xref ref-type="bibr" rid="B56">Sk&#xf6;n et al., 2012</xref>), random forests (RF) (<xref ref-type="bibr" rid="B35">Kropat et al., 2015</xref>; <xref ref-type="bibr" rid="B70">Yuchi, 2017</xref>; <xref ref-type="bibr" rid="B71">Yuchi et al., 2019</xref>), partial least squares (PLS) (<xref ref-type="bibr" rid="B34">KimSankararaoKang et al., 2012</xref>; <xref ref-type="bibr" rid="B41">Lim et al., 2012</xref>; <xref ref-type="bibr" rid="B39">LeeKimKim and Yoo, 2015</xref>), decision trees (DT) (<xref ref-type="bibr" rid="B35">Kropat et al., 2015</xref>; <xref ref-type="bibr" rid="B10">Choi et al., 2017</xref>; <xref ref-type="bibr" rid="B71">Yuchi et al., 2019</xref>), among others. Additionally, less commonly used ML models have been explored for IAQ prediction. For instance, <xref ref-type="bibr" rid="B30">Justin et al. (2023)</xref> developed a Long Short-Term Memory prediction model using physical data observed by IAQ sensors, achieving an accuracy of approximately 60%&#x2013;80% in determining real-time and near-term concentrations of indoor bioaerosols and PM, surpassing regression models with an accuracy of around 90%. <xref ref-type="bibr" rid="B69">YeganehMotlaghRashidi and Kamalan (2012)</xref> combined PLS with support vector machines (SVM) to predict the daily average value of CO, resulting in satisfactory outcomes. <xref ref-type="bibr" rid="B7">Carlos et al. (2018)</xref> used a kernel regression model to forecast CO<sub>2</sub> concentration, leveraging continuous data obtained through the Internet, which yielded favorable predictive results. These findings underscore the potential of ML in IAQ prediction.</p>
</sec>
<sec id="s1-3">
<title>1.3 The application of ML in PM<sub>2.5</sub> prediction</title>
<p>PM<sub>2.5</sub>, being a significant pollutant influencing IAQ, has garnered considerable attention. Initially, mechanical models were employed for PM<sub>2.5</sub> prediction. However, these models lacked convenience as they necessitated detailed inputs, including indoor sources and sinks of PM<sub>2.5</sub>, building envelope structures, ventilation conditions, and outdoor concentrations (<xref ref-type="bibr" rid="B61">Wei et al., 2019</xref>). When the prerequisites for constructing mechanical models are unavailable, data-driven ML models have emerged as a favored approach for prediction. Feng et al. used ANN to predict PM<sub>2.5</sub> (<xref ref-type="bibr" rid="B62">XiaoQiZhu et al., 2015</xref>). <xref ref-type="bibr" rid="B8">Cheng et al. (2019)</xref> proposed a PM<sub>2.5</sub> prediction method based on multiple example genetic neural networks for hospitals. <xref ref-type="bibr" rid="B33">Kim et al. (2009)</xref> used a recurrent neural network (RNN) to predict the daily indoor PM<sub>2.5</sub> concentration in subway stations, achieving a root mean square error (RMSE) of 17.8&#xa0;&#x3bc;g/m<sup>3</sup>. The RNN model exhibited superior performance with lower RMSE values and higher accuracy compared to other prediction models. <xref ref-type="bibr" rid="B45">Maher Nor et al. (2015)</xref> evaluated indoor PM<sub>2.5</sub> concentrations in naturally ventilated school buildings using MLR and feed-forward backpropagation (FFBP). The FFBP model outperformed the MLR model in determining indoor PM<sub>2.5</sub>. <xref ref-type="bibr" rid="B71">Yuchi et al. (2019)</xref> applied the RF method to model indoor PM<sub>2.5</sub> concentrations in Mongolian apartments, demonstrating its superiority over the MLR model, but showing comparable performance in cross-validation. <xref ref-type="bibr" rid="B67">Xu et al. (2020)</xref> estimated indoor PM<sub>2.5</sub> concentrations in 66 apartments in China using the RF method, achieving an RMSE of approximately 20&#xa0;&#x3bc;g/m<sup>3</sup> in 10-fold cross-validation. <xref ref-type="bibr" rid="B42">Li et al. (2020)</xref> also employed the RF method and successfully estimated indoor PM<sub>2.5</sub> concentrations with a normalized RMSE of 15% in 10-fold cross-validation.</p>
<p>Although ML algorithms have been utilized to predict indoor PM<sub>2.5</sub> concentrations to some extent, there is limited research specifically focused on predicting PM<sub>2.5</sub> concentrations in office buildings (<xref ref-type="bibr" rid="B61">Wei et al., 2019</xref>; <xref ref-type="bibr" rid="B36">LagesseWangLarson and Kim, 2020</xref>). Moreover, these studies have primarily targeted continuous variables representing PM<sub>2.5</sub> concentrations. According to the Chinese standard &#x201c;Healthy Building Evaluation Standard&#x201d; (T/ASC 02-2021), it is desirable to maintain indoor PM<sub>2.5</sub> concentrations below 25&#xa0;&#x3bc;g/m<sup>3</sup> (<xref ref-type="bibr" rid="B59">T/ASC 02-2021, 2021</xref>). When the indoor PM<sub>2.5</sub> concentration exceeds this threshold, there is a potential health risk, requiring intervention to improve IAQ. Therefore, it is crucial to predict whether the indoor PM<sub>2.5</sub> concentration exceeds the standard. However, there is currently a lack of research on the classification prediction of PM<sub>2.5</sub> concentrations.</p>
<p>Considering the practical problem of classification prediction, the Boosting algorithm offers an effective solution approach. Boosting algorithm is an integrated learning idea that converts weak learners into strong learners by adding iterations, which can solve the supervised learning classification problem (<xref ref-type="bibr" rid="B31">Susnjak et al., 2012</xref>; <xref ref-type="bibr" rid="B58">Sun and Zhou, 2014</xref>). Currently, Boosting algorithm is widely used in the photovoltaic power generation prediction field (<xref ref-type="bibr" rid="B27">Imran, 2021</xref>; <xref ref-type="bibr" rid="B43">Liu et al., 2021</xref>; <xref ref-type="bibr" rid="B68">Yamamoto et al., 2022</xref>), business forecasting (<xref ref-type="bibr" rid="B32">Kiki and Vinasetan, 2020</xref>; <xref ref-type="bibr" rid="B64">Xie et al., 2021</xref>), and medical and healthcare (<xref ref-type="bibr" rid="B5">Amy Isabella et al., 2022</xref>; <xref ref-type="bibr" rid="B65">Xue, 2022</xref>). And it has been applied in strength prediction of building materials (<xref ref-type="bibr" rid="B38">Lee et al., 2021</xref>; <xref ref-type="bibr" rid="B72">Zakir et al., 2022</xref>) and accident early warning (<xref ref-type="bibr" rid="B76">Zhou et al., 2021</xref>; <xref ref-type="bibr" rid="B23">Guo et al., 2022</xref>). CatBoost is one of the main algorithms of the Boosting family of algorithms, with strong robustness and versatility, as well as strong platform applicability and prediction speed (<xref ref-type="bibr" rid="B14">Dorogush et al., 2018</xref>). The CatBoost algorithm is based on a gradient boosting decision tree (GBDT) improved efficient ensemble learning idea that uses sorting lifting and symmetric decision trees as weak classifiers (<xref ref-type="bibr" rid="B26">Huang et al., 2019</xref>). Through sorting enhancement, the CatBoost algorithm builds an independent integrated model for each sample, avoiding prediction bias caused by information leakage during the training process, and improving prediction accuracy; Through the structural characteristics of symmetric decision trees, the CatBoost algorithm has smaller degrees of freedom, effectively reducing the probability of model overfitting, and significantly improving the prediction speed.</p>
</sec>
<sec id="s1-4">
<title>1.4 Contribution</title>
<p>Considering that predicting whether the PM<sub>2.5</sub> concentration will exceed the standard in the next moment is beneficial for regulating indoor air quality. However, there is currently a scarcity of research on utilizing ML algorithms to forecast and classify PM<sub>2.5</sub> concentration. This study aims to develop an intelligent prediction model for PM<sub>2.5</sub> concentration using the CatBoost algorithm. The accuracy and effectiveness of the model are established and validated through the utilization of real monitoring data from office buildings. Additionally, the study aims to assess the efficiency and superiority of the CatBoost algorithm by comparing it with other commonly employed algorithms.</p>
</sec>
</sec>
<sec sec-type="methods" id="s2">
<title>2 Methodology</title>
<sec id="s2-1">
<title>2.1 Data acquisition and processing</title>
<p>The data used in this paper comes from indoor air quality monitoring platforms in an office of Beijing, China. The monitoring content includes indoor temperature, relative humidity, CO<sub>2</sub> concentration, illuminance, and PM<sub>2.5</sub> concentration. The environmental data is continuously transmitted to the monitoring platform via sensors using a wireless network. The data is sent every 5&#xa0;min and stored in individual databases for each measurement. The testing range and measurement accuracy of the monitoring instruments are shown in <xref ref-type="table" rid="T1">Table 1</xref>. The monitoring record interval is 5&#xa0;min, and each measurement data is stored in a separate database. We collected measurement data between January 18, 2022, and March 29, 2022.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Test scope and accuracy of indoor environmental testing indicators.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th colspan="2" align="center">Indicators</th>
<th align="center">Range</th>
<th align="center">Accuracy</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">indoor temperature</td>
<td align="center">
<italic>t</italic>
</td>
<td align="center">&#x2212;40&#x2013;80&#xb0;C</td>
<td align="center">&#xb1;0.3&#xb0;C</td>
</tr>
<tr>
<td align="center">relative humidity</td>
<td align="center">
<italic>d</italic>
</td>
<td align="center">0&#x2013;99.9%</td>
<td align="center">&#xb1;5%</td>
</tr>
<tr>
<td align="center">CO<sub>2</sub> concentration</td>
<td align="center">
<italic>C</italic>
</td>
<td align="center">400&#x2013;5 000ppm</td>
<td align="center">&#xb1;5%</td>
</tr>
<tr>
<td align="center">illuminance</td>
<td align="center">
<italic>L</italic>
</td>
<td align="center">0&#x2013;20 000lx</td>
<td align="center">&#xb1;10%</td>
</tr>
<tr>
<td align="center">PM<sub>2.5</sub> concentration</td>
<td align="center">
<italic>P</italic>
</td>
<td align="center">0&#x2013;1 000&#xa0;&#x3bc;g/m<sup>3</sup>
</td>
<td align="center">&#xb1;10&#xa0;&#x3bc;g/m<sup>3</sup>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The relationship between input and output in this study is to predict the classification of PM<sub>2.5</sub> concentration at the current time using previously sampled monitoring data. Based on existing research, we found that when using ML to predict PM<sub>2.5</sub>, indoor temperature (<xref ref-type="bibr" rid="B33">Kim et al., 2009</xref>; <xref ref-type="bibr" rid="B12">Das et al., 2014</xref>; <xref ref-type="bibr" rid="B15">Elbayoumi et al., 2014</xref>; <xref ref-type="bibr" rid="B16">Elbayoumi et al., 2015</xref>; <xref ref-type="bibr" rid="B44">LiuYoo, 2016</xref>; <xref ref-type="bibr" rid="B13">Deepti and Suresh, 2019</xref>; <xref ref-type="bibr" rid="B11">DaiLiuLi, 2021</xref>), indoor relative humidity (<xref ref-type="bibr" rid="B33">Kim et al., 2009</xref>; <xref ref-type="bibr" rid="B16">Elbayoumi et al., 2015</xref>; <xref ref-type="bibr" rid="B44">LiuYoo, 2016</xref>; <xref ref-type="bibr" rid="B13">Deepti and Suresh, 2019</xref>; <xref ref-type="bibr" rid="B11">DaiLiuLi, 2021</xref>), CO<sub>2</sub> concentration (<xref ref-type="bibr" rid="B33">Kim et al., 2009</xref>; <xref ref-type="bibr" rid="B16">Elbayoumi et al., 2015</xref>; <xref ref-type="bibr" rid="B44">LiuYoo, 2016</xref>; <xref ref-type="bibr" rid="B13">Deepti and Suresh, 2019</xref>; <xref ref-type="bibr" rid="B11">DaiLiuLi, 2021</xref>), previous PM<sub>2.5</sub> (<xref ref-type="bibr" rid="B33">Kim et al., 2009</xref>; <xref ref-type="bibr" rid="B41">Lim et al., 2012</xref>; <xref ref-type="bibr" rid="B29">Jorge et al., 2018</xref>; <xref ref-type="bibr" rid="B52">Hyun et al., 2018</xref>; <xref ref-type="bibr" rid="B13">Deepti and Suresh, 2019</xref>; <xref ref-type="bibr" rid="B11">DaiLiuLi, 2021</xref>), were frequently used as input variables. Infer that these parameters are related to indoor PM<sub>2.5</sub>. In addition, <xref ref-type="bibr" rid="B3">Ahn et al., 2017</xref> included the influence of light when using deep learning methods to predict IAQ. The concentration of particulate matter may be related to light. Therefore, we will use the previous indoor temperature (<italic>t</italic>
<sub>
<italic>h</italic>-1</sub>), relative humidity (<italic>d</italic>
<sub>
<italic>h</italic>-1</sub>), CO<sub>2</sub> concentration (<italic>C</italic>
<sub>
<italic>h</italic>-1</sub>), light intensity (<italic>L</italic>
<sub>
<italic>h</italic>-1</sub>), and PM<sub>2.5</sub> concentration (<italic>P</italic>
<sub>
<italic>h</italic>-1</sub>) as input variables for the models.</p>
<p>The indoor PM<sub>2.5</sub> concentration is chaotic and time-varying, strongly correlated with human activities, and it is unrealistic to accurately predict the dynamic concentration of PM<sub>2.5</sub>. Real-time and accurate prediction of the PM<sub>2.5</sub> concentration range is an important aspect of creating an indoor environment. Therefore, this paper chooses the PM<sub>2.5</sub> concentration range as the output of the classification model. The Chinese standard &#x201c;Assessment Standard for Healthy Building" (T/ASC 02-2021) has requirements for the indoor PM<sub>2.5</sub> concentration limit, and in the &#x201c;Air&#x201d; section of the control items, it is stipulated that the annual average concentration of indoor PM<sub>2.5</sub> should not be higher than 25&#xa0;&#x3bc;g/m<sup>3</sup>. In the scoring items, it is proposed that the annual average concentration of PM<sub>2.5</sub> should not be higher than 15&#xa0;&#x3bc;g/m<sup>3</sup>, and the daily average concentration of PM<sub>2.5</sub> should not be higher than 35&#xa0;&#x3bc;g/m<sup>3</sup>, and in the &#x201c;Improvement and Innovation&#x201d; section of the bonus points, it is proposed that the daily average concentration of PM<sub>2.5</sub> should not be higher than 25&#xa0;&#x3bc;g/m<sup>3</sup> (<xref ref-type="bibr" rid="B59">T/ASC 02-2021, 2021</xref>). Based on the above, this paper determines the PM<sub>2.5</sub> concentration as follows: When PM<sub>2.5</sub> &#x2264; 25&#xa0;&#x3bc;g/m<sup>3</sup>, it is determined that indoor air PM<sub>2.5</sub> pollution is relatively low and the current situation is maintained; When PM<sub>2.5</sub> &#x3e; 25&#xa0;&#x3bc;g/m<sup>3</sup>, it is determined that indoor air PM<sub>2.5</sub> pollution is significant and requires purification, as shown in <xref ref-type="table" rid="T2">Table 2</xref>.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Determination of indoor PM2.5 concentration.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">PM<sub>2.5</sub> concentration</th>
<th align="center">Action</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">&#x2264;25&#xa0;&#x3bc;g/m<sup>3</sup>
</td>
<td align="center">Maintain the <italic>status quo</italic>
</td>
</tr>
<tr>
<td align="center">&#x3e;25&#xa0;&#x3bc;g/m<sup>3</sup>
</td>
<td align="center">Need purification</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s2-2">
<title>2.2 Data preprocessing</title>
<p>Affected by power supply, signal transmission, network and other factors, monitoring equipment will have data quality problems such as missing values and outlier. To avoid interference with the data model, invalid data needs to be cleaned.</p>
<p>The data quality problems of the dataset used in this paper are missing values and outlier, and the specific processing methods are as follows: 1) Variables with a missing rate of more than 30% are regarded as invalid variables, and those with a missing rate of less than 5% are filled forward. For data with missing values of 5%&#x2013;30%, random forest Multiple Interpolation Model (RFMICE) is used to fill in; 2) The oversize and undersize outlier are identified with 3Sigma criterion and filled forward; For 12 consecutive groups of samples, repeated outlier are regarded as data collection abnormalities and eliminated directly. In order to minimize the impact of outdoor environment and abnormal use on the model, the indoor temperature and CO<sub>2</sub> concentration variables are limited to a reasonable range. Referring to the Chinese standard &#x201c;Design Code for Heating Ventilation and Air Conditioning of Civil Buildings&#x201d; (GB 50736-2012), the indoor temperature is higher than or equal to 16&#xb0;C (<xref ref-type="bibr" rid="B19">GB, 2012</xref>). Referring to the Chinese standard &#x201c;Hygienic Standard for Carbon Dioxide in Indoor Air&#x201d; (GB/T 17094-1997), indoor CO<sub>2</sub> ranges from 0 to 2000&#xa0;mg/m<sup>3</sup> (<xref ref-type="bibr" rid="B20">GB, 2021</xref>).</p>
</sec>
</sec>
<sec id="s3">
<title>3 Prediction model</title>
<p>This article establishes a PM<sub>2.5</sub> concentration range classification prediction model based on the CatBoost model, as shown in <xref ref-type="fig" rid="F1">Figure 1</xref>. The specific process is as follows:<list list-type="simple">
<list-item>
<p>(1) Data acquisition and preprocessing. Data is obtained from the platform and preprocessed.</p>
</list-item>
<list-item>
<p>(2) Data Splitting. The preprocessed data is divided into training set, validation set, and test set in the proportions of 70%, 15%, and 15% respectively.</p>
</list-item>
<list-item>
<p>(3) Hyperparameter optimization (HPO). The criteria for hyperparameters selection are determined through an evaluation of their impact on the model&#x2019;s performance. In addition, based on our comprehensive analysis of previous literature and our own experience (<xref ref-type="bibr" rid="B73">Zhao et al., 2021</xref>; <xref ref-type="bibr" rid="B53">Peng et al., 2022</xref>), we have observed satisfactory performance of CatBoost model after optimizing the model using the hyperparameters including learning_rate, depth, min_data_in_leaf, bagging_temperature and reg_lambda. To visualize the process of HPO, we utilized the Optuna package (version 2.10.0) (<xref ref-type="bibr" rid="B4">Akiba et al., 2019</xref>), an open-source optimization framework. This framework enables us to easily and efficiently implement complex machine learning experiments and perform HPO using Hyperband methods (<xref ref-type="bibr" rid="B40">Li et al., 2016</xref>). With Optuna, we can dynamically test various combinations of hyperparameters, allowing for an effective and systematic exploration of the hyperparameter space. This aids in finding the optimal configuration for our machine learning models, enhancing their performance and accuracy.</p>
</list-item>
<list-item>
<p>(4) The best hyperparameter combination is used to predict the PM<sub>2.5</sub> concentration range, and the performance of the model is further demonstrated through time series cross validation.</p>
</list-item>
</list>
</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>PM<sub>2.5</sub> concentration range classification prediction model.</p>
</caption>
<graphic xlink:href="fbuil-09-1207193-g001.tif"/>
</fig>
</sec>
<sec sec-type="results" id="s4">
<title>4 Result</title>
<sec id="s4-1">
<title>4.1 Data statistics</title>
<p>After processing the dataset, there were 14570 remaining samples for model analysis, as shown in <xref ref-type="table" rid="T3">Table 3</xref>. The distribution of data for each variable is shown in <xref ref-type="fig" rid="F2">Figure 2</xref>. In the dataset, there are 5985 samples with <italic>p &#x3e;</italic> 25&#xa0;&#x3bc;g/m<sup>3</sup>, and 8585 samples with <italic>p</italic> &#x2264; 25&#xa0;&#x3bc;g/m<sup>3</sup>.</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Data description for the model.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th colspan="2" align="center">Input variable</th>
<th align="center">
<italic>t</italic>
<sub>
<italic>h</italic>-1</sub>
</th>
<th align="center">
<italic>d</italic>
<sub>
<italic>h</italic>-1</sub>
</th>
<th align="center">
<italic>C</italic>
<sub>
<italic>h</italic>-1</sub>
</th>
<th align="center">
<italic>L</italic>
<sub>
<italic>h</italic>-1</sub>
</th>
<th align="center">
<italic>P</italic>
<sub>
<italic>h</italic>-1</sub>
</th>
</tr>
<tr>
<th colspan="2" align="center">unit</th>
<th align="center">&#xb0;C</th>
<th align="center">%</th>
<th align="center">ppm</th>
<th align="center">lx</th>
<th align="center">&#x3bc;g/m<sup>3</sup>
</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3" align="center">Training dataset</td>
<td align="center">Scope</td>
<td align="center">16.0&#x2013;25.3</td>
<td align="center">13.1&#x2013;65.4</td>
<td align="center">406.0&#x2013;1999.0</td>
<td align="center">0.0&#x2013;709.0</td>
<td align="center">1.0&#x2013;124.0</td>
</tr>
<tr>
<td align="center">Average</td>
<td align="center">22.1 &#xb1; 1.9</td>
<td align="center">27.1 &#xb1; 10.4</td>
<td align="center">607.3 &#xb1; 250.0</td>
<td align="center">98.7 &#xb1; 161.3</td>
<td align="center">24.2 &#xb1; 17.8</td>
</tr>
<tr>
<td align="center">Median</td>
<td align="center">22.7</td>
<td align="center">23.7</td>
<td align="center">519.0</td>
<td align="center">2.0</td>
<td align="center">20.0</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Data distribution of various variables in the training dataset.</p>
</caption>
<graphic xlink:href="fbuil-09-1207193-g002.tif"/>
</fig>
</sec>
<sec id="s4-2">
<title>4.2 Hyperparameter optimization</title>
<p>The model underwent HPO to improve the performance of the prediction model. The CatBoost model with maximum AUC was obtained using Optuna. The search domain and set values for the hyperparameters of the CatBoost model are shown in <xref ref-type="table" rid="T4">Table 4</xref>. The search domain and set values for the hyperparameters of the competition models are also shown in <xref ref-type="table" rid="T4">Table 4</xref>. As shown in the <xref ref-type="fig" rid="F3">Figure 3</xref>, after HPO, the performance of the CatBoost model was improved. The AUC value of the model after HPO is 0.949.</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Search domain and optimal combination of the main hyperparameters.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Model</th>
<th align="center">Hyperparameter</th>
<th align="center">Search domain</th>
<th align="center">Set value</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="5" align="center">CatBoost</td>
<td align="center">Learning_rate</td>
<td align="center">[0.01, 0.3]</td>
<td align="center">0.013</td>
</tr>
<tr>
<td align="center">Depth</td>
<td align="center">{4, 5, 6, 7, 8, 9, 10}</td>
<td align="center">4</td>
</tr>
<tr>
<td align="center">reg_lambda</td>
<td align="center">[0.01, 10]</td>
<td align="center">9.704</td>
</tr>
<tr>
<td align="center">Bagging_temperature</td>
<td align="center">[0, 1]</td>
<td align="center">0.299</td>
</tr>
<tr>
<td align="center">Min_data_in_leaf</td>
<td align="center">{1, 2, 3, 4, 5, 6, 7, 8, 9, 10}</td>
<td align="center">1</td>
</tr>
<tr>
<td rowspan="4" align="center">MLP</td>
<td align="center">Hidden_layer_sizes</td>
<td align="center">{(32), (64), (128), (64, 32), (128, 64)}</td>
<td align="center">(32)</td>
</tr>
<tr>
<td align="center">Activation</td>
<td align="center">{identity, logistic, tanh, relu}</td>
<td align="center">tanh</td>
</tr>
<tr>
<td align="center">Solver</td>
<td align="center">{lbfgs, sgd, adam}</td>
<td align="center">adam</td>
</tr>
<tr>
<td align="center">Alpha</td>
<td align="center">[1e-5, 1e-1]</td>
<td align="center">0.002</td>
</tr>
<tr>
<td rowspan="5" align="center">GBDT</td>
<td align="center">learning_rate</td>
<td align="center">[0.01, 0.1]</td>
<td align="center">0.033</td>
</tr>
<tr>
<td align="center">n_estimators</td>
<td align="center">{50, 51, 52, <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:mo>&#x2026;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>,200}</td>
<td align="center">129</td>
</tr>
<tr>
<td align="center">max_depth</td>
<td align="center">{2, 3, 4, 5, 6, 7, 8, 9, 10}</td>
<td align="center">2</td>
</tr>
<tr>
<td align="center">min_samples_split</td>
<td align="center">{2, 3, 4, <inline-formula id="inf2">
<mml:math id="m2">
<mml:mrow>
<mml:mo>&#x2026;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, 20}</td>
<td align="center">7</td>
</tr>
<tr>
<td align="center">min_samples_leaf</td>
<td align="center">{1, 2, 3, 4, 5, 6, 7, 8, 9, 10}</td>
<td align="center">7</td>
</tr>
<tr>
<td rowspan="2" align="center">LR</td>
<td align="center">penalty</td>
<td align="center">{l1, l2}</td>
<td align="center">l1</td>
</tr>
<tr>
<td align="center">C</td>
<td align="center">[0.001, 10]</td>
<td align="center">0.001</td>
</tr>
<tr>
<td rowspan="3" align="center">DT</td>
<td align="center">max_depth</td>
<td align="center">{2, 3, 4, 5, 6, 7, 8, 9, 10}</td>
<td align="center">3</td>
</tr>
<tr>
<td align="center">min_samples_split</td>
<td align="center">{2, 3, 4, <inline-formula id="inf3">
<mml:math id="m3">
<mml:mrow>
<mml:mo>&#x2026;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, 20}</td>
<td align="center">7</td>
</tr>
<tr>
<td align="center">min_samples_leaf</td>
<td align="center">{1, 2, 3, 4, 5, 6, 7, 8, 9, 10}</td>
<td align="center">5</td>
</tr>
<tr>
<td rowspan="3" align="center">KNN</td>
<td align="center">n_neighbors</td>
<td align="center">{1, 2, 3, <inline-formula id="inf4">
<mml:math id="m4">
<mml:mrow>
<mml:mo>&#x2026;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, 20}</td>
<td align="center">20</td>
</tr>
<tr>
<td align="center">weights</td>
<td align="center">{uniform, distance}</td>
<td align="center">distance</td>
</tr>
<tr>
<td align="center">p</td>
<td align="center">{1, 2}</td>
<td align="center">1</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Optimization history of HPO.</p>
</caption>
<graphic xlink:href="fbuil-09-1207193-g003.tif"/>
</fig>
</sec>
<sec id="s4-3">
<title>4.3 Cross validation</title>
<p>In order to further validate the robustness and generalizability of the CatBoost model, we performed a Time Series Split Cross Validation. We have employed the Rolling Window approach to partition the time series data into several combinations of training and validation sets. This method involves sequentially splitting the data in accordance with the chronological order, where the training set comprises past data and the validation set contains future data. By adopting this approach, we can more effectively simulate the model&#x2019;s performance on future data. The result of cross validation was shown in <xref ref-type="fig" rid="F4">Figure 4</xref>.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Cross-validation.</p>
</caption>
<graphic xlink:href="fbuil-09-1207193-g004.tif"/>
</fig>
</sec>
<sec id="s4-4">
<title>4.4 Model comparison</title>
<p>Further validate the performance of the model by comparing it with other commonly used classification algorithms and CatBoost algorithm. Five other classification algorithms were selected, including multilayer perceptron (MLP) model, GBDT model, Logistic Regression (LR) model, DT model, and k-nearest neighbors (KNN) model. The prediction results are shown in <xref ref-type="fig" rid="F5">Figure 5</xref>. Among the models evaluated, the CatBoost algorithm demonstrates the highest predictive performance, achieving an AUC value of 0.949. The MLP model, GBDT model, DT model, and KNN model show similar AUC values, with scores of 0.917, 0.938, 0.927, and 0.926, respectively. In comparison, the LR model exhibits lower AUC value of 0.888. The Precision-Recall (P-R) curve is a graphical representation that illustrates the trade-off between precision and recall at various classification thresholds. By plotting the P-R curve, we can observe how the model&#x2019;s precision and recall change as the threshold varies. The PR curves of the models are shown in <xref ref-type="fig" rid="F6">Figure 6</xref>. The F1 score is a single metric that combines precision and recall into a balanced measure of a model&#x2019;s performance. The F1 score is particularly useful in cases where we want to consider both precision and recall equally important. The F1 scores of the models are shown in <xref ref-type="table" rid="T5">Table 5</xref>.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>ROC curves of 6 different models.</p>
</caption>
<graphic xlink:href="fbuil-09-1207193-g005.tif"/>
</fig>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>P-R curves of 6 different models.</p>
</caption>
<graphic xlink:href="fbuil-09-1207193-g006.tif"/>
</fig>
<table-wrap id="T5" position="float">
<label>TABLE 5</label>
<caption>
<p>Performance of models.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Models</th>
<th align="center">AUC-ROC</th>
<th align="center">AUC-PRC</th>
<th align="center">F1-score</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">CatBoost</td>
<td align="center">0.949</td>
<td align="center">0.928</td>
<td align="center">0.883</td>
</tr>
<tr>
<td align="center">MLP</td>
<td align="center">0.917</td>
<td align="center">0.918</td>
<td align="center">0.835</td>
</tr>
<tr>
<td align="center">GBDT</td>
<td align="center">0.938</td>
<td align="center">0.890</td>
<td align="center">0.880</td>
</tr>
<tr>
<td align="center">LR</td>
<td align="center">0.888</td>
<td align="center">0.919</td>
<td align="center">0.844</td>
</tr>
<tr>
<td align="center">DT</td>
<td align="center">0.927</td>
<td align="center">0.887</td>
<td align="center">0.880</td>
</tr>
<tr>
<td align="center">KNN</td>
<td align="center">0.926</td>
<td align="center">0.924</td>
<td align="center">0.788</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4-5">
<title>4.5 Importance analysis</title>
<p>The importance of model included features was analized through the SHAP (Shapley Additive exPlans) (<xref ref-type="bibr" rid="B55">Shapley, 1953</xref>). <xref ref-type="fig" rid="F7">Figure 7</xref> shows the SHAP value ranking of each feature and the specific impact of each feature on the output variable. The top feature value has the greatest importance for the output feature, and decreases in order from top to bottom. For identifying whether the indoor PM<sub>2.5</sub> concentration exceeds the standard, the <italic>P</italic>
<sub>
<italic>h</italic>-1</sub> is the most important feature, followed by the <italic>d</italic>
<sub>
<italic>h</italic>-1</sub>, the <italic>C</italic>
<sub>
<italic>h</italic>-1</sub> and the <italic>t</italic>
<sub>
<italic>h</italic>-1</sub>. The <italic>L</italic>
<sub>
<italic>h</italic>-1</sub> has the least importance. The blue to red color represents the feature value (red high, blue low). The x-axis measures the impacts on the model output (right positive, left negative). From the figure, it can be seen that the greater the <italic>P</italic>
<sub>
<italic>h</italic>-1</sub>, the higher the risk of exceeding the indoor PM<sub>2.5</sub> concentration; the lower the <italic>d</italic>
<sub>
<italic>h</italic>-1</sub>, the greater the risk of exceeding the indoor PM<sub>2.5</sub> concentration, and <italic>vice versa</italic>, the smaller the risk; the higher the <italic>C</italic>
<sub>
<italic>h</italic>-1</sub>, the greater the risk of exceeding the indoor PM<sub>2.5</sub> concentration. Conversely, the smaller the risk; The lower the <italic>t</italic>
<sub>
<italic>h</italic>-1</sub>, the greater the risk of exceeding the indoor PM<sub>2.5</sub> concentration. Conversely, the lower the risk.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Summary of model SHAP features.</p>
</caption>
<graphic xlink:href="fbuil-09-1207193-g007.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s5">
<title>5 Discussion</title>
<p>This paper presents the development of a predictive model based on ML algorithms that accurately predicts indoor PM<sub>2.5</sub> concentration levels. The CatBoost model demonstrates significantly better predictive performance compared to the MLP, GBDT, LR, DT, and KNN models. Moreover, this study identified <italic>P</italic>
<sub>
<italic>h</italic>-1</sub> and <italic>d</italic>
<sub>
<italic>h</italic>-1</sub> as the two most important predictive factors.</p>
<p>Emerging technologies such as the IoT, AI, and ML have shown tremendous potential in monitoring indoor environmental quality and facilitating timely intervention (<xref ref-type="bibr" rid="B1">Adeleke et al., 2017</xref>). The remarkable predictive capabilities of ML algorithms make them highly attractive when combined with on-site data monitoring systems to effectively determine the constantly changing levels of indoor pollutants (<xref ref-type="bibr" rid="B2">AdityaSharma and Gupta, 2018</xref>; <xref ref-type="bibr" rid="B54">Saini et al., 2022</xref>). Previous studies conducted by Elbayoumi and Yuchi have confirmed the value of ML methods in predicting PM<sub>2.5</sub> concentrations (<xref ref-type="bibr" rid="B16">Elbayoumi et al., 2015</xref>; <xref ref-type="bibr" rid="B71">Yuchi et al., 2019</xref>). These findings align with our research, further validating the efficiency and superiority of ML models. By comparing CatBoost with other commonly used models, we highlight the superior predictive performance of CatBoost, providing valuable insights into its application for indoor air quality prediction.</p>
<p>Given the severity of global air pollution exceeding health thresholds, the impact of air quality on human health has garnered significant attention (<xref ref-type="bibr" rid="B47">Massey et al., 2012</xref>). As mentioned in the introduction, PM<sub>2.5</sub> poses a serious threat to human health, particularly due to the potential attachment of harmful microorganisms to particulate matter. Stressing the importance of maintaining good indoor air quality in office buildings cannot be overstated, as it directly affects the health and wellbeing of occupants. Predictability plays a crucial role in controlling PM<sub>2.5</sub> concentrations since indoor air quality improvement systems often exhibit inherent delays. Accurately predicting PM<sub>2.5</sub> concentrations poses a considerable challenge. This study constructs a PM<sub>2.5</sub> concentration classification prediction model using ML algorithms, enabling accurate predictions of whether PM<sub>2.5</sub> concentrations exceed the standard and serving as a risk warning model. The model&#x2019;s predicted results can be utilized as control signals in the operation and regulation of indoor air quality improvement equipment, such as fresh air systems. By forecasting in advance whether indoor PM<sub>2.5</sub> concentrations will exceed the standard, the model helps determine whether the <italic>status quo</italic> can be maintained or if purification measures are required. Implementing the model&#x2019;s predicted results as control signals guides the control system, achieving improved indoor environmental conditions while minimizing unnecessary energy consumption.</p>
<p>Through variable importance ranking analysis, we have determined that previous PM<sub>2.5</sub> concentration is the most significant influencing factor in predicting whether the PM<sub>2.5</sub> concentration exceeds the standard, as our expected. Furthermore, we observed a negative correlation between indoor humidity and the risk of PM<sub>2.5</sub> concentration exceeding the standard. Humidity plays a vital role in the nucleation, condensation, and volatilization of particles, thereby influencing their diffusion process and altering the concentration of PM<sub>2.5</sub> (<xref ref-type="bibr" rid="B9">Chithra and Nagendra, 2014</xref>). A study conducted by Yang et al. monitored indoor PM concentrations in a primary school classroom in North China and assessed the contributions of various influencing factors, the findings highlighted the critical role of indoor humidity in managing indoor PM<sub>2.5</sub> concentration (<xref ref-type="bibr" rid="B22">Guangfei and YuheBing, 2023</xref>). The research results of this article align with the aforementioned studies, providing further evidence of the close relationship between indoor humidity and the likelihood of PM<sub>2.5</sub> concentration exceeding the standard. In practical applications, this research provides valuable references for real-time assessment and management of indoor air quality. This study reveals that lower humidity increases the risk of indoor PM<sub>2.5</sub> concentrations surpassing the standard in the next time interval, emphasizing the significance of humidity control in improving air quality.</p>
</sec>
<sec sec-type="conclusion" id="s6">
<title>6 Conclusion</title>
<p>In comparison to the MLP, GBDT, LR, DT, and KNN models, the CatBoost model demonstrates notable advantages in predicting whether the indoor PM<sub>2.5</sub> concentration exceeds the standard. Through HPO, the model&#x2019;s predictive performance can be further enhanced. Additionally, this study identifies the previous PM<sub>2.5</sub> concentration and relative humidity as the two most influential factors for prediction.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s7">
<title>Data availability statement</title>
<p>The raw data supporting the conclusion of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec id="s8">
<title>Author contributions</title>
<p>XYW is mainly responsible for the initial writing of articles, data analysis, and so on. ZWG is mainly responsible for reviewing articles, project management, and so on. LG is mainly responsible for data collection, article polishing, and so on. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="s9">
<title>Funding</title>
<p>This study was supported by the Opening Funds of State Key Laboratory of Building Safety and Built Environment and National Engineering Research Center of Building Technology and the key special project of the National Key R&#x0026;D Plan &#x201c;Intergovernmental International Science and Technology Innovation Cooperation&#x201d; titled &#x201c;Research on Improving Energy Efficiency and Health Performance of Building Operation Based on Full Life Cycle Carbon Reduction&#x201d; (2018YFE0106100).</p>
</sec>
<sec sec-type="COI-statement" id="s10">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adeleke</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Moodley</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Rens</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Adewumi</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Integrating statistical machine learning in a semantic sensor web for proactive monitoring and control</article-title>. <source>Sensors</source> <volume>17</volume> (<issue>4</issue>), <fpage>807</fpage>. <pub-id pub-id-type="doi">10.3390/s17040807</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Aditya</surname>
</name>
<name>
<surname>Sharma</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Gupta</surname>
<given-names>S. C.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>An Internet of Things based smart surveillance and monitoring system using arduino[C]</article-title>,&#x201d; in <conf-name>2018 International Conference on Advances in Computing and Communication Engineering (ICACCE)</conf-name> (<publisher-name>IEEE</publisher-name>).</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ahn</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Shin</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Indoor air quality analysis using deep learning with sensor data</article-title>. <source>Sensors</source> <volume>17</volume> (<issue>11</issue>), <fpage>2476</fpage>. <pub-id pub-id-type="doi">10.3390/s17112476</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Akiba</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Sano</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Yanase</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ohta</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Koyama</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Optuna</surname>
</name>
</person-group> (<year>2019</year>). &#x201c;<article-title>A next-generation hyperparameter optimization framework</article-title>,&#x201d; in <conf-name>Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery &#x26; data mining</conf-name>, <fpage>2623</fpage>&#x2013;<lpage>2631</lpage>.</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Amy Isabella</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Javier</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Peter</surname>
<given-names>J. G.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Integrating multiple brain imaging modalities does not boost prediction of subclinical atherosclerosis in midlife adults[J]</article-title>. <source>NeuroImage Clin.</source> <volume>35</volume>, <fpage>103134</fpage>. <pub-id pub-id-type="doi">10.1016/j.nicl.2022.103134</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>BellMichelle</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Davis</surname>
<given-names>D. L.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Reassessment of the lethal london fog of 1952: Novel Indicators of acute and chronic consequences of acute exposure to air pollution</article-title>. <source>J. Environ. Health Perspect. Suppl.</source> <volume>109</volume>, <fpage>389</fpage>. <pub-id pub-id-type="doi">10.2307/3434786</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carlos</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Valeria</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Guillermo</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Use of non&#x2010;industrial environmental sensors and machine learning techniques in telemetry for indoor air pollution</article-title>. <source>ARPN J. Eng. Appl. Sci.</source> <volume>13</volume>, <fpage>2702&#x2010;2712</fpage>.</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Indoor PM2.5 prediction based on multi-instance genetic neural network[J]</article-title>. <source>Comput. Appl. Softw.</source> <volume>36</volume> (<issue>5</issue>), <fpage>7</fpage>. <comment>(in Chinese)</comment>. <pub-id pub-id-type="doi">10.3969/j.issn.1000-386x.2019.05.041</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chithra</surname>
<given-names>V. S.</given-names>
</name>
<name>
<surname>Nagendra</surname>
<given-names>S. S.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Impact of outdoor meteorology on indoor PM10, PM2.5 and PM1 concentrations in a naturally ventilated classroom</article-title>. <source>Urban Clim.</source> <volume>10</volume>, <fpage>77e91</fpage>. <pub-id pub-id-type="doi">10.1016/j.uclim.2014.10.001</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Choi</surname>
<given-names>M. L.</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Kwon</surname>
<given-names>Y. M.</given-names>
</name>
<name>
<surname>Kwon</surname>
<given-names>Y. M.</given-names>
</name>
<name>
<surname>Chung</surname>
<given-names>D. K.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>A study on the prediction method of Emergency Room (ER) pollution level based on deep learning using scattering sensor[J]</article-title>. <source>J. Eng. Appl. Sci.</source> <volume>12</volume> (<issue>10</issue>), <fpage>2560</fpage>&#x2013;<lpage>2564</lpage>. <pub-id pub-id-type="doi">10.3923/jeasci.2017.2560.2564</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>DaiLiuLi</surname>
<given-names>X. J. Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A recurrent neural network using historical data to predict time series indoor PM2.5 concentrations for residential buildings</article-title>. <source>Indoor Air</source> <volume>31</volume>, <fpage>1228</fpage>&#x2013;<lpage>1237</lpage>. <pub-id pub-id-type="doi">10.1111/ina.12794</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Das</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Shrubsole</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Hamilton</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Chalabi</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Davies</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Using probabilistic sampling-based sensitivity analyses for indoor air quality modelling</article-title>. <source>Build. Environ.</source> <volume>78</volume> (<issue>AUG</issue>), <fpage>171</fpage>&#x2013;<lpage>182</lpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2014.04.017</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Deepti</surname>
<given-names>S.</given-names>
</name>
</person-group>, and <person-group person-group-type="author">
<name>
<surname>Suresh</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Impact of intervention of biomass cookstove technologies and kitchen characteristics on indoor air quality and human exposure in rural settings of India</article-title>. <source>Environ. Int.</source> <volume>123</volume>, <fpage>240</fpage>&#x2013;<lpage>255</lpage>. <pub-id pub-id-type="doi">10.1016/j.envint.2018.11.059</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Dorogush</surname>
<given-names>A. V.</given-names>
</name>
<name>
<surname>Ershov</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Gulin</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2018</year>). <source>CatBoost: Gradient boosting with categorical features support[J]</source>.</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elbayoumi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ramli</surname>
<given-names>N. A.</given-names>
</name>
<name>
<surname>Md Yusof</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Yahaya</surname>
<given-names>A. S. B.</given-names>
</name>
<name>
<surname>Al Madhoun</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Ul-Saufie</surname>
<given-names>A. Z.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Multivariate methods for indoor PM10 and PM2.5 modelling in naturally ventilated schools buildings</article-title>. <source>Atmos. Environ.</source> <volume>94</volume>, <fpage>11</fpage>&#x2013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2014.05.007</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elbayoumi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ramli</surname>
<given-names>N. A.</given-names>
</name>
<name>
<surname>Yusof</surname>
<given-names>N. F. F. M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Development and comparison of regression models and feedforward backpropagation neural network models to predict seasonal indoor PM2 5e10 and PM2.5 concentrations in naturally ventilated schools</article-title>. <source>Atmos. Pollut. Res.</source> <volume>6</volume> (<issue>6</issue>), <fpage>1013e1023</fpage>. <pub-id pub-id-type="doi">10.1016/j.apr.2015.09.001</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fan</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yoshino</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Yanagi</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Hasegawa</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Kagi</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Indoor environmental conditions in urban and rural homes with older people during heating season: A case in cold region, China[J]</article-title>. <source>Energy &#x26; Build</source>.</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fisk</surname>
<given-names>W. J.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Health and productivity gains from better indoor environments and their relationship with building energy efficiency[j]</article-title>. <source>Annu. Rev. Energy Environ.</source> <volume>25</volume>. <pub-id pub-id-type="doi">10.1146/annurev.energy.25.1.537</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="book">
<collab>GB</collab> (<year>2012</year>). &#x201c;<article-title>Design Code for heating ventilation and air conditioning of Civil buildings (in Chinese)</article-title>,&#x201d; in <source>Ministry of housing and urban-rural development of the people&#x27;s Republic of China, general administration of quality supervision, inspection and quarantine of the people&#x27;s Republic of China</source> (<publisher-loc>Beijing</publisher-loc>: <publisher-name>China Architecture &#x26; Building Press</publisher-name>).</citation>
</ref>
<ref id="B20">
<citation citation-type="book">
<collab>GB</collab> (<year>2021</year>). <source>Hygienic standard for carbon dioxide in indoor air (in Chinese)</source>. <publisher-name>National Health Commission of the People&#x27;s Republic of China</publisher-name>.</citation>
</ref>
<ref id="B21">
<citation citation-type="book">
<collab>GB</collab> (<year>2002</year>). &#x201c;<article-title>Indoor air quality standard (in Chinese)</article-title>,&#x201d; in <source>The general administration of quality supervision, inspection and quarantine of the People&#x2019;s Republic of China</source> (<publisher-loc>Beijing</publisher-loc>: <publisher-name>General Administration of Quality Supervision</publisher-name>).</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guangfei</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>YuheBing</surname>
<given-names>Z. Y.</given-names>
</name>
</person-group> (<year>2023</year>). <article-title>Contribution of influential factors on PM2.5 concentrations in classrooms of a primary school in North China: A machine discovery approach</article-title>. <source>Energy Build.</source> <volume>283</volume>, <fpage>112787</fpage>. <pub-id pub-id-type="doi">10.1016/j.enbuild.2023.112787</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Quan</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Song</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Liang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Construction of rapid early warning and comprehensive analysis models for urban waterlogging based on AutoML and comparison of the other three machine learning algorithms[J]</article-title>. <source>J. Hydrology</source>, <fpage>605</fpage>.</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Horr</surname>
<given-names>Yousef Al</given-names>
</name>
<name>
<surname>Arif</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kaushik</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Mazroei</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Katafygiotou</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Elsarrag</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Occupant productivity and office indoor environment quality: A review of the literature</article-title>. <source>[J]. Build. Environ.</source> <volume>105</volume>, <fpage>369</fpage>&#x2013;<lpage>389</lpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2016.06.001</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Song</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Indoor air quality analysis of 8 mechanically ventilated residential buildings in northeast China based on long-term monitoring</article-title>. <source>Sustain. Cities Soc.</source> <volume>54</volume> (<issue>C</issue>), <fpage>101947</fpage>. <pub-id pub-id-type="doi">10.1016/j.scs.2019.101947</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>X.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Evaluation of CatBoost method for prediction of reference evapotranspiration in humid regions</article-title>. <source>J. Hydrology</source> <volume>574</volume>, <fpage>1029</fpage>&#x2013;<lpage>1041</lpage>. <pub-id pub-id-type="doi">10.1016/j.jhydrol.2019.04.085</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hyun</surname>
<given-names>P. S.</given-names>
</name>
<name>
<surname>Soo</surname>
<given-names>S. H.</given-names>
</name>
<name>
<surname>Nam</surname>
<given-names>P. S.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>A novel pH-responsive hydrogel based on carboxymethyl cellulose/2-hydroxyethyl acrylate for transdermal delivery of naringenin</article-title>. <source>J. Hazard. Mater.</source> <volume>200</volume> (<issue>Jan.5</issue>), <fpage>341</fpage>&#x2013;<lpage>352</lpage>. <pub-id pub-id-type="doi">10.1016/j.carbpol.2018.08.011</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Imran</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Solar power system assessments using ANN and hybrid boost converter based MPPT algorithm[J]</article-title>. <source>Appl. Sci.</source> <volume>11</volume> (<issue>23</issue>), <fpage>11332</fpage>. <pub-id pub-id-type="doi">10.3390/app112311332</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jacobs</surname>
<given-names>E. T.</given-names>
</name>
<name>
<surname>Burgess</surname>
<given-names>J. L.</given-names>
</name>
<name>
<surname>Abbott</surname>
<given-names>M. B.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>The donora smog revisited: 70 Years after the event that inspired the clean air act</article-title>. <source>Am. J. Public Health</source> <volume>108</volume> (<issue>S2</issue>), <fpage>S85</fpage>&#x2013;<lpage>S88</lpage>. <pub-id pub-id-type="doi">10.2105/ajph.2017.304219</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jorge</surname>
<given-names>L-B.</given-names>
</name>
<name>
<surname>Paulina</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Qian</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>ChangKyoo</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Sequential prediction of quantitative health risk assessment for the fine particulate matter in an underground facility using deep recurrent neural networks</article-title>. <source>[J].Ecotoxicology Environ. Saf.</source> <volume>169</volume>, <fpage>316</fpage>&#x2013;<lpage>324</lpage>. <pub-id pub-id-type="doi">10.1016/j.ecoenv.2018.11.024</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Justin</surname>
<given-names>Y. Y. L.</given-names>
</name>
<name>
<surname>Yanhao</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ricky</surname>
<given-names>L. T. C.</given-names>
</name>
<name>
<surname>Mark</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Patrick</surname>
<given-names>K. H. L.</given-names>
</name>
</person-group> (<year>2023</year>). <article-title>Artificial intelligence-based prediction of indoor bioaerosol concentrations from indoor air quality sensor data[J]</article-title>. <source>Environ. Int</source>.</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kiki</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Vinasetan</surname>
<given-names>R. H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Prediction of the purchase intention of users on ECommerce platforms using gradient boosting</article-title>. <source>Int. J. Eng. Adv. Technol. (IJEAT)</source> <volume>10</volume> (<issue>1</issue>), <fpage>446</fpage>&#x2013;<lpage>450</lpage>. <pub-id pub-id-type="doi">10.35940/ijeat.a1929.1010120</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Kim</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>Y. S.</given-names>
</name>
<name>
<surname>Sung</surname>
<given-names>S. W.</given-names>
</name>
<name>
<surname>Yoo</surname>
<given-names>C. K.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Data-driven prediction model of indoor air quality by the preprocessed recurrent neural networks[C]</article-title>. In <conf-name>Proceeding of the Iccas-sice</conf-name>. <publisher-name>IEEE</publisher-name>.</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>KimSankararaoKang</surname>
<given-names>M. J. B. O. Y.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yoo</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Monitoring and prediction of indoor air quality (IAQ) in subway or metro systems using season dependent models</article-title>. <source>Energy Build.</source> <volume>46</volume> (<issue>Mar</issue>), <fpage>48</fpage>&#x2013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.1016/j.enbuild.2011.10.047</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kropat</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Bochud</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Jaboyedoff</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Laedermann</surname>
<given-names>J. P.</given-names>
</name>
<name>
<surname>Murith</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Palacios</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Improved predictive mapping of indoor radon concentrations using ensemble regression trees based on automatic clustering of geological units</article-title>. <source>J. Environ. Radioact.</source> <volume>147</volume>, <fpage>51</fpage>&#x2013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1016/j.jenvrad.2015.05.006</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>LagesseWangLarson</surname>
<given-names>B. S. T. V.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>A. A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Predicting PM<sub>2.5</sub> in well-mixed indoor air for a large office building using regression and artificial neural network models</article-title>. <source>Environ. Sci. Technol.</source> <volume>54</volume> (<issue>23</issue>), <fpage>15320</fpage>&#x2013;<lpage>15328</lpage>. <pub-id pub-id-type="doi">10.1021/acs.est.0c02549</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lai</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Qi</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Dai</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Ventilation behavior in residential buildings with mechanical ventilation systems across different climate zones in China</article-title>. <source>Build. Environ.</source> <volume>143</volume>, <fpage>679</fpage>&#x2013;<lpage>690</lpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2018.08.006</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Vo</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Thai</surname>
<given-names>T. H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Strength prediction of concrete-filled steel tubular columns using Categorical Gradient Boosting algorithm[J]</article-title>. <source>Eng. Struct.</source>, <fpage>238</fpage>. <pub-id pub-id-type="doi">10.1016/j.engstruct.2021.112109</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>LeeKimKim</surname>
<given-names>S. M. J. J. T.</given-names>
</name>
<name>
<surname>Yoo</surname>
<given-names>C. K.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>In search for modeling predictive control of indoor air quality and ventilation energy demand in subway station</article-title>. <source>Energy &#x26; Build.</source> <volume>98</volume> (<issue>jul</issue>), <fpage>56</fpage>&#x2013;<lpage>65</lpage>. <pub-id pub-id-type="doi">10.1016/j.enbuild.2014.10.082</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Jamieson</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Desalvo</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Rostamizadeh</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Talwalkar</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Hyperband: A novel bandit-based approach to hyperparameter optimization</article-title>. <source>J. Mach. Learn. Res.</source> <volume>18</volume> (<issue>1-52</issue>). <pub-id pub-id-type="doi">10.48550/arXiv.1603.06560</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lim</surname>
<given-names>J. J.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>Y. S.</given-names>
</name>
<name>
<surname>Oh</surname>
<given-names>T. S.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Kang</surname>
<given-names>O. Y.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>J. T.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Analysis and prediction of indoor air pollutants in a subway station using a new key variable selection method</article-title>. <source>Korean J. Chem. Eng.</source> <volume>29</volume>, <fpage>994</fpage>&#x2013;<lpage>1003</lpage>. <pub-id pub-id-type="doi">10.1007/s11814-011-0278-z</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Tong</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Man</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ho</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Yim</surname>
<given-names>S. H. L.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A practical framework for predicting residential indoor PM2.5 concentration using land-use regression and machine learning methods</article-title>. <source>Chemosphere</source> <volume>265</volume> (<issue>2</issue>), <fpage>129140</fpage>. <pub-id pub-id-type="doi">10.1016/j.chemosphere.2020.129140</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Yun</given-names>
</name>
<name>
<surname>Heidari</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Liang</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Boosting slime mould algorithm for parameter identification of photovoltaic models</article-title>. <source>J. Energy</source> <volume>234</volume>, <fpage>121164</fpage>. <pub-id pub-id-type="doi">10.1016/j.energy.2021.121164</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>LiuYoo</surname>
<given-names>H. C. K.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>A robust localized soft sensor for particulate matter modeling in Seoul metro systems</article-title>. <source>J. Hazard. Mater.</source> <volume>305</volume> (<issue>15</issue>), <fpage>209</fpage>&#x2013;<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1016/j.jhazmat.2015.11.051</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maher</surname>
</name>
<name>
<surname>Elbayoumi</surname>
</name>
<name>
<surname>Nor</surname>
</name>
<name>
<surname>Azam</surname>
</name>
<name>
<surname>Ramli</surname>
</name>
<name>
<surname>Noor</surname>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Development and comparison of regression models and feedforward backpropagation neural network models to predict seasonal indoor PM2.5&#x2013;10 and PM2.5 concentrations in naturally ventilated schools[J]</article-title>. <source>Atmos. Pollut. Res</source>.</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Martin</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>&#x160;afr&#xe1;nek</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Characterization of coarse particulate matter in school gyms[J]</article-title>. <source>Environ. Res</source>.</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Massey</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kulshrestha</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Masih</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Taneja</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Seasonal trends of PM10, PM5.0, PM2.5 &#x26; PM1.0 in indoor and outdoor environments of residential homes located in North-Central India</article-title>. <source>indoor outdoor Environ. Resid. homes located north-central India, Build. Environ.</source> <volume>47</volume> (<issue>none</issue>), <fpage>223</fpage>&#x2013;<lpage>231</lpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2011.07.018</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>MengSpectorColome</surname>
<given-names>Q. Y. D. S.</given-names>
</name>
<name>
<surname>Turpin</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Determinants of indoor and personal exposure to PM2.5 of indoor and outdoor origin during the RIOPA study</article-title>. <source>Atmos. Environ.</source> <volume>43</volume> (<issue>36</issue>), <fpage>5750</fpage>&#x2013;<lpage>5758</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2009.07.066</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ming</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Rksa</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Gza</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Recent trends in premature mortality and health disparities attributable to ambient PM 2.5 exposure in China: 2005&#x2013;2017[J]</article-title>. <source>Environ. Pollut.</source> <volume>279</volume>. <pub-id pub-id-type="doi">10.1016/j.envpol.2021.116882</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mo</surname>
<given-names>J-H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Association of particulate matter with ENT diseases</article-title>. <source>Clin. Exp. Otorhinolaryngology</source> <volume>12</volume> (<issue>3</issue>), <fpage>237</fpage>&#x2013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.21053/ceo.2019.00752</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Newsham</surname>
<given-names>G. R.</given-names>
</name>
<name>
<surname>Veitch</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Charles</surname>
<given-names>K. E.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Risk factors for dissatisfaction with the indoor environment in open-plan offices: An analysis of COPE field study data</article-title>. <source>J. Indoor air</source> <volume>18</volume> (<issue>4</issue>), <fpage>271</fpage>&#x2013;<lpage>282</lpage>. <pub-id pub-id-type="doi">10.1111/j.1600-0668.2008.00525.x</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>A machine learning-based prediction model for acute kidney injury in patients with congestive heart failure</article-title>. <source>Front. Cardiovasc Med.</source> <volume>9</volume>, <fpage>842873</fpage>. <pub-id pub-id-type="doi">10.3389/fcvm.2022.842873</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saini</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Dutta</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Marques</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Modeling indoor PM2.5 using adaptive dynamic fuzzy inference system tree (ADFIST) on Internet of things-based sensor network data</article-title>. <source>Internet Things</source> <volume>20</volume>, <fpage>100628</fpage>. <pub-id pub-id-type="doi">10.1016/j.iot.2022.100628</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shapley</surname>
<given-names>L. S.</given-names>
</name>
</person-group> (<year>1953</year>). <article-title>A value for n-persons games[J]</article-title>. <source>Ann. Math. Stud.</source> <volume>28</volume> (<issue>7</issue>), <fpage>307</fpage>&#x2013;<lpage>318</lpage>.</citation>
</ref>
<ref id="B56">
<citation citation-type=" confproc">
<person-group person-group-type="author">
<name>
<surname>Sk&#xf6;n</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Johansson</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Raatikainen</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Leivisk</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Kolehmainen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2012</year>). &#x201c;<article-title>Modelling indoor air carbon dioxide (CO2) concentration using neural network</article-title>,&#x201d; in <conf-name>World Acad Sci Eng Technol Int Sci Index</conf-name> <volume>6</volume> <fpage>737</fpage>&#x2013;<lpage>741</lpage>.</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sofuoglu</surname>
<given-names>S. C .</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Application of artificial neural networks to predict prevalence of building-related symptoms in office buildings</article-title>. <source>Build. Environ.</source> <volume>43</volume> (<issue>6</issue>), <fpage>1121</fpage>&#x2013;<lpage>1126</lpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2007.03.003</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>X. W.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>H. B.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Research on applied technology in experiments with three boosting algorithms</article-title>. <source>Adv. Mater. Res.</source> <volume>3103</volume> (<issue>908-908</issue>), <fpage>513</fpage>&#x2013;<lpage>516</lpage>. <pub-id pub-id-type="doi">10.4028/www.scientific.net/amr.908.513</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Susnjak</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Barczak</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Reyes</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Hawick</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Multiclass cascades for ensemble-based boosting algorithms[J]</article-title>. <source>Front. Artif. Intell. Appl.</source> <volume>241</volume>, <fpage>330</fpage>&#x2013;<lpage>335</lpage>. <pub-id pub-id-type="doi">10.3233/978-1-61499-096-3-330</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="book">
<collab>T/Asc 02-2021</collab> (<year>2021</year>). &#x201c;<article-title>Assessment standard for healthy building (in Chinese)</article-title>,&#x201d; in <source>The architectural society of China</source> (<publisher-loc>Beijing</publisher-loc>: <publisher-name>China Architecture &#x26; Building Press</publisher-name>).</citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thayer Julian</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Verkuil</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Brosschotj</surname>
<given-names>J. F.</given-names>
</name>
<name>
<surname>Kevin</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>West</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Sterling</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>Effects of the physical work environment on physiological measures of stress</article-title>. <source>J. Eur. J. Cardiovasc Prev. Rehabil.</source> <volume>17</volume>, <fpage>431</fpage>&#x2013;<lpage>439</lpage>. <pub-id pub-id-type="doi">10.1097/hjr.0b013e328336923a</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wei</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Ramalho</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Malingre</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Sivanantham</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Little</surname>
<given-names>J. C.</given-names>
</name>
<name>
<surname>Mandin</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Machine learning and statistical models for predicting indoor air quality</article-title>. <source>Indoor Air</source> <volume>29</volume> (<issue>5</issue>), <fpage>704</fpage>&#x2013;<lpage>726</lpage>. <pub-id pub-id-type="doi">10.1111/ina.12580</pub-id>
</citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>XiaoQiZhu</surname>
<given-names>F. L. Y.</given-names>
</name>
<name>
<surname>Hou</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Artificial neural networks forecasting of PM2.5 pollution using air mass trajectory based geographic model and wavelet transformation</article-title>. <source>Atmos. Environ.</source> <volume>107</volume>, <fpage>118</fpage>&#x2013;<lpage>128</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2015.02.030</pub-id>
</citation>
</ref>
<ref id="B63">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Xie</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2009</year>). &#x201c;<article-title>Prediction of indoor air quality using artificial neural networks[C]</article-title>,&#x201d; in <conf-name>International Conference on Natural Computation</conf-name> (<publisher-name>IEEE Computer Society</publisher-name>).</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xie</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Xiang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Factors influencing hypertrophic scarring after thyroidectomy</article-title>. <source>Electr. Eng.</source> <volume>34</volume>, <fpage>1</fpage>&#x2013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1097/01.ASW.0000775924.92065.78</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xue</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Marppi: Boosting prediction of protein-protein interactions with multi-scale architecture residual network[J]</article-title>. <source>Briefings Bioinforma</source>.</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>XueWangLiu</surname>
<given-names>Q. Z. J.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Indoor PM2.5 concentrations during winter in a severe cold region of China: A comparison of passive and conventional residential buildings</article-title>. <source>Build. Environ.</source> <volume>180</volume>, <fpage>106857</fpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2020.106857</pub-id>
</citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>N.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Estimating hourly average indoor PM2.5 using the random forest approach in two megacities, China</article-title>. <source>Build. Environ.</source> <volume>180</volume>, <fpage>107025</fpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2020.107025</pub-id>
</citation>
</ref>
<ref id="B68">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yamamoto</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Kure</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kondoh</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kodaira</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Multi-point forecasting of photovoltaic power generation by light gradient boosting machine[J]</article-title>. <source>Grand. Renew. Energy Proc.</source> <volume>2</volume>, <fpage>9</fpage>. <pub-id pub-id-type="doi">10.24752/gre.2.0_9</pub-id>
</citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>YeganehMotlaghRashidi</surname>
<given-names>B. M. Y.</given-names>
</name>
<name>
<surname>Kamalan</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Prediction of CO concentrations based on a hybrid partial least square and support vector machine model</article-title>. <source>Atmos. Environ.</source> <volume>55</volume> (<issue>none</issue>), <fpage>357</fpage>&#x2013;<lpage>365</lpage>. <pub-id pub-id-type="doi">10.1016/j.atmosenv.2012.02.092</pub-id>
</citation>
</ref>
<ref id="B70">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Yuchi</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2017</year>). <source>Modelling fine particulate matter concentrations inside the homes of pregnant women in ulaanbaatar, Mongolia[J]</source>.</citation>
</ref>
<ref id="B71">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuchi</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Gombojav</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Boldbaatar</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Galsuren</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Allen</surname>
<given-names>R. W.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Evaluation of random forest regression and multiple linear regression for predicting indoor fine particulate matter concentrations in a highly polluted city</article-title>. <source>] Environ. Pollut.</source> <volume>245</volume>, <fpage>746</fpage>&#x2013;<lpage>753</lpage>. <pub-id pub-id-type="doi">10.1016/j.envpol.2018.11.034</pub-id>
</citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zakir</surname>
<given-names>S. S.</given-names>
</name>
<name>
<surname>Khondaker</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Nafiz</surname>
<given-names>I. K.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Machine learning-based failure mode identification of double shear bolted connections in structural steel[J]</article-title>. <source>Eng. Fail. Anal.</source>, <fpage>139</fpage>. <pub-id pub-id-type="doi">10.1016/j.engfailanal.2022.106471</pub-id>
</citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>Q. Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>J. C.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>L. P.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>S. J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Development and validation of a machine-learning model for prediction of extubation failure in intensive care units</article-title>. <source>Front. Med. (Lausanne).</source> <volume>8</volume>, <fpage>676343</fpage>. <pub-id pub-id-type="doi">10.3389/fmed.2021.676343</pub-id>
</citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Cao</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Q.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Influence of atmospheric fine particulate matter (PM 2.5) pollution on indoor environment during winter in Beijing</article-title>. <source>Build. Environ.</source> <volume>87</volume> (<issue>5</issue>), <fpage>283</fpage>&#x2013;<lpage>291</lpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2015.02.008</pub-id>
</citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>ZhaoLiuRen</surname>
<given-names>L. J. J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Impact of various ventilation modes on IAQ and energy consumption in Chinese dwellings: First long-term monitoring study in Tianjin, China</article-title>. <source>Build. Environ.</source> <volume>143</volume> (<issue>OCT</issue>), <fpage>99</fpage>&#x2013;<lpage>106</lpage>. <pub-id pub-id-type="doi">10.1016/j.buildenv.2018.06.057</pub-id>
</citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Xiao</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Fire prediction based on CatBoost algorithm[J]</article-title>. <source>Math. Problems Eng.</source>, <fpage>2021</fpage>.</citation>
</ref>
</ref-list>
</back>
</article>