<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Public Health</journal-id>
<journal-title>Frontiers in Public Health</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Public Health</abbrev-journal-title>
<issn pub-type="epub">2296-2565</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpubh.2022.876949</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Public Health</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Machine learning in the loop for tuberculosis diagnosis support</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Orjuela-Ca&#x000F1;&#x000F3;n</surname> <given-names>Alvaro D.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1247010/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Jutinico</surname> <given-names>Andr&#x000E9;s L.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1893418/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Awad</surname> <given-names>Carlos</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Vergara</surname> <given-names>Erika</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Palencia</surname> <given-names>Ang&#x000E9;lica</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Medicine and Health Sciences, Universidad del Rosario</institution>, <addr-line>Bogot&#x000E1;</addr-line>, <country>Colombia</country></aff>
<aff id="aff2"><sup>2</sup><institution>Biomedical Engineering, Universidad Antonio Nari&#x000F1;o</institution>, <addr-line>Bogot&#x000E1;</addr-line>, <country>Colombia</country></aff>
<aff id="aff3"><sup>3</sup><institution>Subred Integrada de Servicios de Salud Centro Oriente E.S.E</institution>, <addr-line>Bogot&#x000E1;</addr-line>, <country>Colombia</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: ZhiMin Xiao, University of Essex, United Kingdom</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Ivan Miguel Pires, Universidade da Beira Interior, Portugal; Nejat Yumu&#x0015F;ak, Sakarya University, Turkey</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Alvaro D. Orjuela-Ca&#x000F1;&#x000F3;n <email>alvaro.orjuela&#x00040;urosario.edu.co</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Infectious Diseases &#x02013; Surveillance, Prevention and Treatment, a section of the journal Frontiers in Public Health</p></fn></author-notes>
<pub-date pub-type="epub">
<day>26</day>
<month>07</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>10</volume>
<elocation-id>876949</elocation-id>
<history>
<date date-type="received">
<day>16</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>30</day>
<month>06</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Orjuela-Ca&#x000F1;&#x000F3;n, Jutinico, Awad, Vergara and Palencia.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Orjuela-Ca&#x000F1;&#x000F3;n, Jutinico, Awad, Vergara and Palencia</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The use of machine learning (ML) for diagnosis support has advanced in the field of health. In the present paper, the results of studying ML techniques in a tuberculosis diagnosis loop in a scenario of limited resources are presented. Data are analyzed using a tuberculosis (TB) therapy program at a health institution in a main city of a developing country using five ML models. Logistic regression, classification trees, random forest, support vector machines, and artificial neural networks are trained under physician supervision following physicians&#x00027; typical daily work. The models are trained on seven main variables collected when patients arrive at the facility. Additionally, the variables applied to train the models are analyzed, and the models&#x00027; advantages and limitations are discussed in the context of the automated ML techniques. The results show that artificial neural networks obtain the best results in terms of accuracy, sensitivity, and area under the receiver operating curve. These results represent an improvement over smear microscopy, which is commonly used techniques to detect TB for special cases. Findings demonstrate that ML in the TB diagnosis loop can be reinforced with available data to serve as an alternative diagnosis tool based on data processing in places where the health infrastructure is limited.</p></abstract>
<kwd-group>
<kwd>tuberculosis diagnosis</kwd>
<kwd>machine learning</kwd>
<kwd>relevance analysis</kwd>
<kwd>machine learning in the loop</kwd>
<kwd>diagnosis support systems</kwd>
</kwd-group>
<counts>
<fig-count count="2"/>
<table-count count="7"/>
<equation-count count="0"/>
<ref-count count="62"/>
<page-count count="0"/>
<word-count count="6947"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Artificial intelligence (AI) is a set of bioinspired algorithms that are used to solve problems in different applications. Within this wide area, machine learning (ML) is a common subfield in which models learn from examples of data, taking advantage of the idea of adjusting parameters in classification or regression tasks (<xref ref-type="bibr" rid="B1">1</xref>). There are several different ML models according to the fundamental concepts for adapting the parameters, with diverse examples including naive Bayes, decision or classification trees, support vector machines (SVM), and artificial neural networks (ANNs), which emulate the behavior of the brain through connectionist models. Besides these and other ML models, new models are continuously being proposed (<xref ref-type="bibr" rid="B2">2</xref>).</p>
<p>Tuberculosis (TB) is a disease caused by the <italic>Mycobacterium tuberculosis</italic> bacillus, and the World Health Organization still considers it a global emergency because of its high estimate of more than 1.4 million fatalities in the last 3 years (<xref ref-type="bibr" rid="B3">3</xref>). In developing countries, TB incidence is as high as 282,000 new cases in recent years with a mortality rate of 2.4 per 100,000 populations. In one specific place, Colombia, the reported TB incidence was 33, the prevalence was 48, and the mortality was 1.6 per 100,000 populations. Given these numbers, any contribution to decreasing TB fatalities is welcomed. <italic>M. tuberculosis</italic> is slow-growing and replicates itself every 24 h, an important fact that determines subacute symptoms. Additionally, the main organ affected by TB is the lung, and because of this, the main signs of the disease are respiratory-related (<xref ref-type="bibr" rid="B3">3</xref>). Coughing and expectoration allow for assessing the probability of TB by studying sputum; however, because TB is an infectious disease, the accurate diagnosis is microbiological (<xref ref-type="bibr" rid="B4">4</xref>).</p>
<p>In the health area, AI has been applied to solve problems in public health, medical images analysis, and diagnosis support systems (<xref ref-type="bibr" rid="B5">5</xref>&#x02013;<xref ref-type="bibr" rid="B8">8</xref>). For TB, different approaches have been proposed since 1999 with the work of El-Solh et al. (<xref ref-type="bibr" rid="B9">9</xref>), for whom medical images were the main source of information. Advances in this field have allowed for better detecting thoracic diseases including TB, pneumonia, asthma, and cancer (<xref ref-type="bibr" rid="B10">10</xref>, <xref ref-type="bibr" rid="B11">11</xref>). Investigators have widely used specific ML models in health systems to contribute to improving TB diagnosis by taking advantage of available meaningful data (<xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B13">13</xref>), such as data from clinical information (<xref ref-type="bibr" rid="B14">14</xref>&#x02013;<xref ref-type="bibr" rid="B16">16</xref>), or molecular biology (<xref ref-type="bibr" rid="B17">17</xref>, <xref ref-type="bibr" rid="B18">18</xref>).</p>
<p>ANNs have been particularly valuable in incorporating ML into TB diagnosis through different architectures such as multilayer perceptrons (MLP), self-organizing maps, and adaptive resonance theory (ART) joined to fuzzy models in the Fuzzy-ART approach to support detection and clustering in risk groups for pulmonary TB (<xref ref-type="bibr" rid="B19">19</xref>&#x02013;<xref ref-type="bibr" rid="B21">21</xref>) and pleural TB (<xref ref-type="bibr" rid="B22">22</xref>&#x02013;<xref ref-type="bibr" rid="B24">24</xref>). Researchers have used different data sources to support health professionals in daily tasks such as collecting breathing acoustic signals (<xref ref-type="bibr" rid="B25">25</xref>) and other clinical variables (<xref ref-type="bibr" rid="B20">20</xref>, <xref ref-type="bibr" rid="B26">26</xref>).</p>
<p>Finally, TB researchers have used deep learning (DL) architecture using vast data sets to provide scenarios based on images (<xref ref-type="bibr" rid="B27">27</xref>&#x02013;<xref ref-type="bibr" rid="B29">29</xref>). For instance, one important task was establishing the <italic>ImageCLEF</italic> data set, which allowed users to determine TB type and treatment resistance using coaxial tomography images (<xref ref-type="bibr" rid="B28">28</xref>, <xref ref-type="bibr" rid="B30">30</xref>); researchers have also used images from radiography to support health professionals&#x00027; decision making (<xref ref-type="bibr" rid="B31">31</xref>&#x02013;<xref ref-type="bibr" rid="B33">33</xref>). Generally, DL has been widely applied in assisting with medical diagnosis, utilizing radiography images, and obtaining highlight results (<xref ref-type="bibr" rid="B34">34</xref>, <xref ref-type="bibr" rid="B35">35</xref>). Additionally, one DL subfield, transfer learning, entails refining large pretrained models with new data, and several researchers have applied transfer learning to the same kinds of medical images (<xref ref-type="bibr" rid="B27">27</xref>, <xref ref-type="bibr" rid="B36">36</xref>).</p>
<p>Nevertheless, despite its demonstrable benefits, ML&#x00027;s effectiveness can be limited by data availability constraints related to inadequate information technology infrastructure. Precarious health systems that cannot or do not collect radiographic information or conduct specialized testing significantly complicate the implementation of ML models. Researchers have analyzed these characteristics and proposed infrastructure for developing regions that can accommodate few variables and poor information systems have been treated for developing regions (<xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B21">21</xref>).</p>
<p>The present work proposes ML techniques as a tool in the loop of TB diagnosis, where health professionals make decisions but with extra help based on limited available data. This scenario is studied for using ML in situations with limited infrastructure for application within the complete TB diagnosis protocol.</p>
</sec>
<sec id="s2">
<title>Machine learning in the loop</title>
<p>The concept of the &#x0201C;algorithm-in-the-loop&#x0201D; is related to the use of ML models to support decision making and improve both human&#x02013;computer interactions and human performance (<xref ref-type="bibr" rid="B37">37</xref>). Interaction between the model and users in a loop is not limited to simple representations of performance such as numbers but extends to a global idea that articulates ethics, policies, and standards (<xref ref-type="bibr" rid="B38">38</xref>). Including AI and ML stages in the clinical decision making support workflow can ultimately improve patient experiences and outcomes and optimize health system performance (<xref ref-type="bibr" rid="B8">8</xref>). Interactive ML is another term for when algorithms and humans work together to improve the results in terms of metrics, understandability, and outcomes (<xref ref-type="bibr" rid="B39">39</xref>).</p>
<p>For the case of TB, diagnosis was long based on respiratory symptoms followed by testing suspicious patients with a serial sputum smear; however, although this test is simple, it is necessary to consider some aspects in determining its usefulness. Smear microscopy is performed using sputum smear and staining that allows direct microscopic visualization of the bacillus. However, diagnostic sensitivity is low, around 60%, because a high number of microorganisms per cubic millimeter of a sample is required to obtain results (<xref ref-type="bibr" rid="B40">40</xref>). Indeed, a high percentage of people with the disease cannot be diagnosed using this method, and furthermore, detected bacillus could be a non-TB <italic>mycobacterium</italic>. A more sensitive assay is a culture in either solid or liquid medium, which needs at least 2 weeks to obtain results (<xref ref-type="bibr" rid="B41">41</xref>). Following more recent advances, molecular testing is now available: Polymerase chain reaction (PCR) identifies the TB bacillus with high sensitivity and in approximately 2 h (<xref ref-type="bibr" rid="B42">42</xref>). However, the infrastructure for this technology is limited in developing countries such as Colombia.</p>
<p>From the ML point of view, different applications have particular characteristics such as requiring biomedical data that have high uncertainty and incompleteness (<xref ref-type="bibr" rid="B43">43</xref>), and strategies beyond straightforward ML are sometimes demanded. For the present study, ML in the loop (MLL) is investigated; this strategy depends on how the ML tool will be used. Researchers have analyzed the necessary workflows to improve results (<xref ref-type="bibr" rid="B44">44</xref>), but in medicine, where health professionals play an indispensable role, other investigators have studied the doctor-in-the-loop in terms of system performance (<xref ref-type="bibr" rid="B45">45</xref>, <xref ref-type="bibr" rid="B46">46</xref>). Today, how ML models perform is no longer the sole concern; models&#x00027; generalizability and functionality during human interaction are also important. Assessing these broader aspects of performance allows for understanding important aspects of decision making and operation that must be considered in system designs (<xref ref-type="bibr" rid="B47">47</xref>).</p>
<p><xref ref-type="fig" rid="F1">Figure 1</xref> depicts the MLL process for TB diagnosis support that was studied for the present work. First, a subject with respiratory disease symptoms arrives at the medical center for either a consultation or an emergency. There, a member of the medical staff examines the possible patient and then sends the patient to internal medicine for a more detailed examination. After this deeper analysis, if the patient&#x00027;s respiratory symptoms continue, medical staff request three main exams to detect pulmonary TB: sputum smear microscopy, sputum culture, and molecular assay (GenXpert&#x000AE;). If results from these three exams indicate infection, the patient begins antituberculosis therapy. Meanwhile the results are definitive, there is no positive diagnosis. However, the patient initialize the antituberculosis treatment. It is at this point where ML was applied to assist the medical staff members in diagnosis.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Schematic of using ML in TB diagnosis. During the TB diagnosis, ML tools are employed to support the decision about the antituberculosis therapy beginning.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpubh-10-876949-g0001.tif"/>
</fig>
<p>At the study facility, the health care workers are responsible for acquiring basic patient information equivalent to the medical records obtained in other stages. This information is input into a registry for the use of the institution&#x00027;s TB program; the protocol to detect TB can be time-consuming, and using ML with this registry could expedite diagnosis. This study proposed to apply MLL searches to support health care workers during the time the test results take. This allows staff to efficiently manage patient treatment according to the need for isolation, hospital capacity, and necessary medications.</p>
</sec>
<sec sec-type="materials and methods" id="s3">
<title>Materials and Methods</title>
<sec>
<title>Data set</title>
<p>Data were acquired through the TB program at <italic>Hospital Santa Clara</italic> (HSC) in Bogot&#x000E1; D.C., Colombia. The HSC is an important public institution associated with the <italic>Subred Integrada de Servicios de Salud Centro Oriente</italic> (SCO, Middle East Subnetwork of Health Services) that treats vulnerable populations with low socioeconomic status or high risk of sexually transmitted infections as well as persons who live in overcrowded conditions.</p>
<p>As explained earlier, the data were collected within the hospital&#x00027;s traditional TB diagnosis process. Information was considered from 233 clinical suspected pulmonary TB subjects whose data had been acquired in the period from January 2017 to December 2019. From this set, 184 subjects (79%) had TB confirmed and 36 subjects (15%) were determined to be disease-free based on smear microscopy, culture, and molecular examination following the national protocol to diagnose TB (<xref ref-type="bibr" rid="B48">48</xref>). Thirteen subjects were not considered because they had no available information on their TB status. The Ethics and Research Committee of the SCO approved this study on the basis of the use of anonymous data with only population-related variables that posed no risks to subjects. Informed consent was not required because all data were retrospective and anonymous.</p>
<p>At the HSC, electronic health records are used, but they are not standardized across the country; records can include diagnoses and symptoms of medical conditions such as diabetes, chronic kidney disease, and immunosuppression such as by the human immunodeficiency virus (HIV). Sociodemographic variables are also important for TB diagnosis (<xref ref-type="bibr" rid="B49">49</xref>), and the SCO commonly treats vulnerable populations such as persons who are indigenous, homeless, migrants, or refugees for TB. Although some of the data are available, the different information systems do not always communicate with each other. For this reason, only the variables that were available at the beginning of the TB program were applied for this study, as specified above. Using only these data allowed for simulating a scenario with limited information.</p>
<p>Health care workers at this point of TB diagnosis collect only seven variables, which were the ones considered in the present work: sex, age, type of population, city location, HIV/AIDS (acquired immunodeficiency syndrome) status, antiretroviral treatment status, and the number of days since treatment onset (see <xref ref-type="table" rid="T1">Table 1</xref>). Age and number of days were discrete numeric variables that were normalized by maximum of 100 and 15, respectively. Sex was a binary variable where a patient was either male or female, and this variable was set at 00 when no data were available. HIV and antiretroviral treatment status could take either of three possible values: positive, negative, or unknown. Finally, the type of population and city location were, respectively, coded with zeros and ones to reflect if a clinic visitor was a member of a specific vulnerable group and where in Bogot&#x000E1; City the client resided based on established geographic divisions.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Variables collected.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="left"><bold>Values</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Sex</td>
<td valign="top" align="left">Male</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Female</td>
</tr>
<tr>
<td valign="top" align="left">Age</td>
<td valign="top" align="left">Numeric: 0&#x02013;100</td>
</tr>
<tr>
<td valign="top" align="left">Type of population</td>
<td valign="top" align="left">Homeless</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Native</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Exile</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Immigrant</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Prison</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Violence Victim</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Other</td>
</tr>
<tr>
<td valign="top" align="left">City location</td>
<td valign="top" align="left">Antonio Nari&#x000F1;o</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Barrios Unidos</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Bosa</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Chapinero</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Ciudad Bol&#x000ED;var</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Engativ&#x000E1;</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Fontib&#x000F3;n</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Kennedy</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">La Candelaria</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Los M&#x000E1;rtires</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Puente Aranda</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Rafael Uribe Uribe</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">San Crist&#x000F3;bal</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Santa Fe</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Suba</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Teusaquillo</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Tunjuelito</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Usaqu&#x000E9;n</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Usme</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Out of Bogot&#x000E1; City</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Unknown</td>
</tr>
<tr>
<td valign="top" align="left">HIV/AIDS status</td>
<td valign="top" align="left">Yes</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">No</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Unknown</td>
</tr>
<tr>
<td valign="top" align="left">Antiretroviral treatment status</td>
<td valign="top" align="left">Yes</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">No</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Unknown</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Machine learning models</title>
<p>ML models are a set of algorithms that learn from data (<xref ref-type="bibr" rid="B50">50</xref>). For the present study, four MLL models were compared for their usefulness to health professionals and for the interactions between available features in the TB decision making process. In health sciences, logistic regression (LR) algorithms are widely applied to associate predictors or input variables to an output that represents a detection or estimation of the illness (<xref ref-type="bibr" rid="B41">41</xref>, <xref ref-type="bibr" rid="B51">51</xref>). To evaluate the present scenario, LR was the fifth model considered to determine the possible contribution of traditional tools. The optimization algorithm was based on a quasi-Newton method, the Broden&#x02013;Fletcher&#x02013;Goldfarb&#x02013;Shanno (<italic>lbfgs</italic>) approximation; additionally, penalization was used with a maximum of 100 iterations.</p>
<p>Classification or decision tree (DT) algorithms are trained through supervised learning and are considered a non-parametric method for classification or regression (<xref ref-type="bibr" rid="B52">52</xref>). DT structure is based on nodes and leaves, where each node is represented by a function that divides the information flow into two or more classes according to the function&#x00027;s output. For the present case, this function was based on the Gini coefficient. A notable advantage of this ML model is that it allows for visually determining the conditions for the input variables and the leaves. Random forest (RF) is a special DT model, in which more tree structures are analyzed and tested (<xref ref-type="bibr" rid="B53">53</xref>, <xref ref-type="bibr" rid="B54">54</xref>). Then, the best configuration of trees is selected for the classification or regression, according to a sample from the data set and avoiding model overfitting.</p>
<p>SVMs deal with the boundary between hyperplanes that divides the data classes from input variables represented in a features space (<xref ref-type="bibr" rid="B55">55</xref>, <xref ref-type="bibr" rid="B56">56</xref>). The hyperplanes are built from support vectors obtained from the training data and optimized according to the support vectors with the best performance. This model is widely applied with kernelling, modifying the initial non-linear separable space into a linear separation through a non-linear kernel that for the present case was Gaussian.</p>
<p>Finally, an MLP was applied as a model to detect the TB cases because the results were known in this specific problem (<xref ref-type="bibr" rid="B57">57</xref>). For this case, an architecture with one hidden layer was trained to detect TB. The number of input nodes was equal to the number of variables, and there was one output node. Resilient backpropagation was applied for training and stop criteria with a maximum of 500 epochs, zero gradients, and early stopping, the first time early stopping was considered.</p>
<p>Cross-validation was conducted to assess the performance and generalization of the models (<xref ref-type="bibr" rid="B58">58</xref>). Based on the special scenario under study, the mode of data acquisition, and the possibility of a system application in the future, the data were divided into three sets. This allowed for establishing the models based on 2 years of data that were validated and tested for generalizability in the third year. Through this process, the tool can be used using previous information with similar properties. <xref ref-type="table" rid="T2">Table 2</xref> shows these sets, the year of acquisition, and the number of instances per set.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Sets used for cross-validation.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Set</bold></th>
<th valign="top" align="center"><bold>Year</bold></th>
<th valign="top" align="center"><bold>TB positive</bold></th>
<th valign="top" align="center"><bold>TB negative</bold></th>
<th valign="top" align="center"><bold>Total</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">2017</td>
<td valign="top" align="center">34</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">43</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">2018</td>
<td valign="top" align="center">52</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">74</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">2019</td>
<td valign="top" align="center">55</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">65</td>
</tr>
<tr>
<td valign="top" align="center" colspan="2">Total</td>
<td valign="top" align="center">141</td>
<td valign="top" align="center">41</td>
<td valign="top" align="center">182</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>A process to balance the classes was implemented, searching to adjust the inequality between positive and negative TB for the classes. In this case, a weighted training process of internal parameters for each model was regulated according to the frequency of the instances by class (<xref ref-type="bibr" rid="B59">59</xref>).</p>
</sec>
<sec>
<title>Variable analysis</title>
<p>Study variables were analyzed through the performance computation for each ML model under study. The variables in <xref ref-type="table" rid="T1">Table 1</xref> were converted to zero and then applied to the best trained of the DT, LR, RF, SVM, and MLP models. Subsequently, model performance metrics such as accuracy, sensitivity, and specificity were compared.</p>
</sec>
<sec>
<title>Automated machine learning</title>
<p>Automated ML (aML) was also tested to find the best models (<xref ref-type="bibr" rid="B60">60</xref>), and the Tree-based Pipeline Optimization Tool (TPOT) was applied to obtain the best detectors (<xref ref-type="bibr" rid="B61">61</xref>). This was carried out because of differences in the ML models&#x00027; performance. Here aML and TPOT were used to compare the individual models&#x00027; performance and to determine the influences of the ML model parameters in the search results.</p>
</sec>
</sec>
<sec sec-type="results" id="s4">
<title>Results</title>
<p><xref ref-type="table" rid="T3">Table 3</xref> shows the findings for the training process and the test scores with data from the year left out in the cross-validation described before; accuracy (ACC), sensitivity (SE), and specificity (SP) were collected to determine the differences due to the balance between positive and negative TB for each year (see <xref ref-type="table" rid="T2">Table 2</xref>). Additionally, the area under the receiver operating curve (AUC) allowed for considering SE and SP simultaneously.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Results for the ML models.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Model</bold></th>
<th valign="top" align="center"><bold>Validation year</bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;" colspan="4"><bold>Training</bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;" colspan="4"><bold>Test</bold></th>
</tr>
<tr>
<td/>
<td/>
<th valign="top" align="center"><bold>Accuracy</bold></th>
<th valign="top" align="center"><bold>Sensitivity</bold></th>
<th valign="top" align="center"><bold>Specificity</bold></th>
<th valign="top" align="center"><bold>AUC</bold><xref ref-type="table-fn" rid="TN1"><sup>&#x0002A;</sup></xref></th>
<th valign="top" align="center"><bold>Accuracy</bold></th>
<th valign="top" align="center"><bold>Sensitivity</bold></th>
<th valign="top" align="center"><bold>Specificity</bold></th>
<th valign="top" align="center"><bold>AUC</bold><xref ref-type="table-fn" rid="TN1"><sup>&#x0002A;</sup></xref></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">DT</td>
<td valign="top" align="center">2017</td>
<td valign="top" align="center">0.75</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">0.65</td>
<td valign="top" align="center">0.70</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">0.22</td>
<td valign="top" align="center">0.53</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2018</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.73</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">0.68</td>
<td valign="top" align="center">0.81</td>
<td valign="top" align="center">0.36</td>
<td valign="top" align="center">0.59</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2019</td>
<td valign="top" align="center">0.97</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.91</td>
<td valign="top" align="center">0.96</td>
<td valign="top" align="center">0.72</td>
<td valign="top" align="center">0.75</td>
<td valign="top" align="center">0.60</td>
<td valign="top" align="center">0.68</td>
</tr>
<tr>
<td valign="top" align="left">RF</td>
<td valign="top" align="center">2017</td>
<td valign="top" align="center">0.81</td>
<td valign="top" align="center">0.83</td>
<td valign="top" align="center">0.72</td>
<td valign="top" align="center">0.73</td>
<td valign="top" align="center">0.70</td>
<td valign="top" align="center">0.79</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">0.60</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2018</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">0.70</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">0.32</td>
<td valign="top" align="center">0.63</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2019</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.90</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.60</td>
<td valign="top" align="center">0.77</td>
</tr>
<tr>
<td valign="top" align="left">LR</td>
<td valign="top" align="center">2017</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">0.59</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">0.59</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="center">0.61</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2018</td>
<td valign="top" align="center">0.71</td>
<td valign="top" align="center">0.71</td>
<td valign="top" align="center">0.68</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">0.65</td>
<td valign="top" align="center">0.73</td>
<td valign="top" align="center">0.45</td>
<td valign="top" align="center">0.62</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2019</td>
<td valign="top" align="center">0.62</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">0.74</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">0.65</td>
<td valign="top" align="center">0.60</td>
<td valign="top" align="center">0.90</td>
<td valign="top" align="center">0.84</td>
</tr>
<tr>
<td valign="top" align="left">SVM</td>
<td valign="top" align="center">2017</td>
<td valign="top" align="center">0.99</td>
<td valign="top" align="center">0.98</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.97</td>
<td valign="top" align="center">0.65</td>
<td valign="top" align="center">0.74</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">0.45</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2018</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">0.92</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">0.61</td>
<td valign="top" align="center">0.75</td>
<td valign="top" align="center">0.27</td>
<td valign="top" align="center">0.56</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2019</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">0.97</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.68</td>
<td valign="top" align="center">0.69</td>
<td valign="top" align="center">0.60</td>
<td valign="top" align="center">0.68</td>
</tr>
<tr>
<td valign="top" align="left">MLP</td>
<td valign="top" align="center">2017</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">0.95</td>
<td valign="top" align="center">0.38</td>
<td valign="top" align="center">0.77</td>
<td valign="top" align="center">0.74</td>
<td valign="top" align="center">0.88</td>
<td valign="top" align="center">0.22</td>
<td valign="top" align="center">0.65</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2018</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.26</td>
<td valign="top" align="center">0.93</td>
<td valign="top" align="center">0.74</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.14</td>
<td valign="top" align="center">0.65</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2019</td>
<td valign="top" align="center">0.79</td>
<td valign="top" align="center">0.99</td>
<td valign="top" align="center">0.23</td>
<td valign="top" align="center">0.83</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.93</td>
<td valign="top" align="center">0.40</td>
<td valign="top" align="center">0.82</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN1"><label>&#x0002A;</label><p><italic>AUC, Area Under Receiver Operative Curve</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>The LR, RF, and MLP models achieved the best results, obtaining the highest AUC, 0.84, in the test set (see <xref ref-type="table" rid="T3">Table 3</xref>). This value can be compared with the maximum AUC of 0.96 in the DT model for the training set, demonstrating that it was difficult to generalize the findings from the present application.</p>
<p><xref ref-type="table" rid="T4">Table 4</xref> presents the ACC, SE, SP, and AUC means and standard deviations for the three test data subsets. The table shows that MLP obtained the best results for ACC, SE, and AUC and that SP was the best with the LR model. These findings suggest that combining models might give better results for these metrics. Nevertheless, although SP was the best with the LR, that model had the worst results for ACC and SE, which suggests this model&#x00027;s suitability for the objective task of finding negative TB cases. Finally, the SVM model gave the worst results for most metrics.</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>ML model results for the three test subsets.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Model</bold></th>
<th valign="top" align="center"><bold>Accuracy</bold></th>
<th valign="top" align="center"><bold>Sensitivity</bold></th>
<th valign="top" align="center"><bold>Specificity</bold></th>
<th valign="top" align="center"><bold>AUC</bold><xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;</sup></xref></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">DT</td>
<td valign="top" align="center">0.70 &#x000B1; 0.040</td>
<td valign="top" align="center">0.79 &#x000B1; 0.001</td>
<td valign="top" align="center">0.39 &#x000B1; 0.037</td>
<td valign="top" align="center">0.60 &#x000B1; 0.005</td>
</tr>
<tr>
<td valign="top" align="left">RF</td>
<td valign="top" align="center">0.74 &#x000B1; 0.069</td>
<td valign="top" align="center">0.83 &#x000B1; 0.001</td>
<td valign="top" align="center">0.42 &#x000B1; 0.025</td>
<td valign="top" align="center">0.67 &#x000B1; 0.008</td>
</tr>
<tr>
<td valign="top" align="left">LR</td>
<td valign="top" align="center">0.64 &#x000B1; 0.011</td>
<td valign="top" align="center">0.64 &#x000B1; 0.006</td>
<td valign="top" align="center">0.71 &#x000B1;0.054</td>
<td valign="top" align="center">0.69 &#x000B1; 0.017</td>
</tr>
<tr>
<td valign="top" align="left">SVM</td>
<td valign="top" align="center">0.64 &#x000B1; 0.001</td>
<td valign="top" align="center">0.72 &#x000B1; 0.001</td>
<td valign="top" align="center">0.40 &#x000B1; 0.030</td>
<td valign="top" align="center">0.56 &#x000B1; 0.013</td>
</tr>
<tr>
<td valign="top" align="left">MLP</td>
<td valign="top" align="center">0.77 &#x000B1;0.004</td>
<td valign="top" align="center">0.93 &#x000B1;0.003</td>
<td valign="top" align="center">0.25 &#x000B1; 0.017</td>
<td valign="top" align="center">0.71 &#x000B1;0.009</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN2"><label>&#x0002A;</label><p><italic>AUC, Area Under Receiver Operative Curve. The bold values are the highest values for each column</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="T5">Table 5</xref> presents the best results for each metric for all the studied models and the full data set, showing that the LR model had the best accuracy, SVM had the best sensitivity, and MLP had the best specificity. Additionally, following subsection 3.3, all models were checked for relevance. Specifically, for each model, the input variables (see <xref ref-type="table" rid="T1">Table 1</xref>) were set at 0, and then, ACC, SE, and SP were computed. <xref ref-type="fig" rid="F2">Figure 2</xref> shows the effect of this processing, notably that type of population was not important in the LR, RF, and MLP models; when the zero values were eliminated, the models&#x00027; performance improved. <xref ref-type="fig" rid="F2">Figure 2D</xref> shows that age caused significant differences in the SVM model. Finally, all variables were relevant in the MLP model.</p>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p>Best ML model results for the applied metrics and the full data set.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Model</bold></th>
<th valign="top" align="center"><bold>DT</bold></th>
<th valign="top" align="center"><bold>RF</bold></th>
<th valign="top" align="center"><bold>LR</bold></th>
<th valign="top" align="center"><bold>SVM</bold></th>
<th valign="top" align="center"><bold>MLP</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Accuracy</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">0.81</td>
<td valign="top" align="center">0.80</td>
</tr>
<tr>
<td valign="top" align="left">Sensitivity</td>
<td valign="top" align="center">0.90</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">0.95</td>
<td valign="top" align="center">0.82</td>
</tr>
<tr>
<td valign="top" align="left">Specificity</td>
<td valign="top" align="center">0.35</td>
<td valign="top" align="center">0.36</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">0.55</td>
<td valign="top" align="center">0.68</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The bold values are the highest values for each column</italic>.</p>
</table-wrap-foot>
</table-wrap>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Sensitivity, accuracy, and specificity for all five ML models: <bold>(A)</bold> Logistic regression; <bold>(B)</bold> Classification tree; <bold>(C)</bold> Random forest; <bold>(D)</bold> Support vector machine; <bold>(E)</bold> Multilayer perceptron neural network. For all ML models is visualized the effect of using or not each one of the considered variables in terms of sensitivity (blue), specificity (green) and accuracy (orange). There it is possible to see how the metrics change, according to the inclusion or exclusion of the seven variables.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpubh-10-876949-g0002.tif"/>
</fig>
<p><xref ref-type="table" rid="T6">Table 6</xref> presents the findings from testing aML and TPOT, which require less intensive user exploration of the hyperparameters. The table shows that the automated ML was more successful than manual exploration (see <xref ref-type="table" rid="T3">Table 3</xref>), although the results were similar. The first model, for the year 2019, applied six ML models: two passive-aggressive, two MLPs, one extra tree, and one gradient boosting. The second model, for 2018, had 28 models that included a number of the different strategies presented here (e.g., MLP, RF, and logistic regressors). For the 2017 case, aML produced a combination of five models (two random forests, one mlp, one passive-aggressive, and one stochastic gradient descent). <xref ref-type="table" rid="T7">Table 7</xref> presents the aML and TPOT results for all 3 years. Specificity is considerably affected in this automatic generation of models, which is ineffective and not appropriate in the context of diagnosis support.</p>
<table-wrap position="float" id="T6">
<label>Table 6</label>
<caption><p>Results for the auto ML models by year.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Model</bold></th>
<th valign="top" align="center"><bold>Validation year</bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;" colspan="4"><bold>Training</bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;" colspan="4"><bold>Test</bold></th>
</tr>
<tr>
<td/>
<td/>
<th valign="top" align="center"><bold>Accuracy</bold></th>
<th valign="top" align="center"><bold>Sensitivity</bold></th>
<th valign="top" align="center"><bold>Specificity</bold></th>
<th valign="top" align="center"><bold>AUC</bold><xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;</sup></xref></th>
<th valign="top" align="center"><bold>Accuracy</bold></th>
<th valign="top" align="center"><bold>Sensitivity</bold></th>
<th valign="top" align="center"><bold>Specificity</bold></th>
<th valign="top" align="center"><bold>AUC</bold><xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;</sup></xref></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">AutoML</td>
<td valign="top" align="center">2017</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.92</td>
<td valign="top" align="center">0.79</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.50</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2018</td>
<td valign="top" align="center">0.92</td>
<td valign="top" align="center">0.90</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.95</td>
<td valign="top" align="center">0.70</td>
<td valign="top" align="center">0.70</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">0.60</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2019</td>
<td valign="top" align="center">0.91</td>
<td valign="top" align="center">0.92</td>
<td valign="top" align="center">0.88</td>
<td valign="top" align="center">0.90</td>
<td valign="top" align="center">0.83</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">0.46</td>
<td valign="top" align="center">0.70</td>
</tr>
<tr>
<td valign="top" align="left">TPOT</td>
<td valign="top" align="center">2017</td>
<td valign="top" align="center">0.77</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">0.79</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0.50</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2018</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.84</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.92</td>
<td valign="top" align="center">0.73</td>
<td valign="top" align="center">0.72</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.86</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2019</td>
<td valign="top" align="center">0.74</td>
<td valign="top" align="center">0.74</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">0.84</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.50</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN3"><label>&#x0002A;</label><p><italic>AUC, Area Under Receiver Operative Curve</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T7">
<label>Table 7</label>
<caption><p>Results for the auto ML models for 3 years.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Model</bold></th>
<th valign="top" align="center"><bold>Accuracy</bold></th>
<th valign="top" align="center"><bold>Sensitivity</bold></th>
<th valign="top" align="center"><bold>Specificity</bold></th>
<th valign="top" align="center"><bold>AUC</bold><xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">AutoML</td>
<td valign="top" align="center">0.77 &#x000B1; 0.004</td>
<td valign="top" align="center">0.88 &#x000B1; 0.025</td>
<td valign="top" align="center">0.32 &#x000B1; 0.077</td>
<td valign="top" align="center">0.60 &#x000B1; 0.010</td>
</tr>
<tr>
<td valign="top" align="left">TPOT</td>
<td valign="top" align="center">0.78 &#x000B1; 0.003</td>
<td valign="top" align="center">0.90 &#x000B1; 0.026</td>
<td valign="top" align="center">0.33 &#x000B1; 0.333</td>
<td valign="top" align="center">0.62 &#x000B1; 0.043</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN4"><label>&#x0002A;</label><p><italic>AUC, Area Under Receiver Operative Curve</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec sec-type="discussion" id="s5">
<title>Discussion</title>
<p>TB detection in earlier stages is important to prevent transmission of the disease. However, irrespective of when a patient is diagnosed, patients in the populations studied in this work must be kept in isolation because these patients tend not to maintain safe distances as they are being treated.</p>
<p>Because of the lack of specific clinical symptoms, it is difficult for physicians to diagnose tuberculosis, but meanwhile, patients require rapid isolation to prevent spreading the disease to others. Presumptive TB cases require further analysis, and tools for completing specific tasks could reduce the workloads of health professionals. ML and AI could be effective in this context while keeping decisions under the purview of the medical staff. Furthermore, in developing or low-income countries such as Colombia, ML and AI can extend the availability of health care to remote regions with limited infrastructure and few if any health care personnel.</p>
<p>There remain many challenges to applying ML and AI in the health informatics field, but doing so can contribute to easing burdens for clinical personnel; further testing of these applications in real-world settings will be highly beneficial. Furthermore, the coworking between health professionals and health care AI is a challenge. The American Medical Association calls for considering AI an augmentation to human intelligence rather than a replacement (<xref ref-type="bibr" rid="B62">62</xref>). Recent authors have reported on developing this kind of articulation with health professionals as the center of the entire strategy (<xref ref-type="bibr" rid="B12">12</xref>).</p>
<p>In this study, the high incidence rate in the analyzed data set was related to the stage of the diagnosis process, although despite this, it is possible to see that not all presumptive TB cases were ultimately diagnosed as positive TB. This indicates that the ML tool identified variables that were imperceptible to humans, which could help improve therapy management as well as increase the efficient allocation of clinical resources (time, professional staff, medicaments, space, etc.). However, it was determined in this study that the unbalance between positive and negative TB cases could be offer a difficulty of the ML models training (<xref ref-type="bibr" rid="B59">59</xref>). However, the RF, LR, and MLP models achieved similar results for SE and SP, consistent with earlier findings for MLP models (<xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B21">21</xref>, <xref ref-type="bibr" rid="B33">33</xref>, <xref ref-type="bibr" rid="B55">55</xref>); these findings support RF, LR, and MLP as appropriate models for diagnosis support. In the present study, MLP had the best AUC metric, which exhibits best balance between SE and SP. Additionally, the proposed models can decrease the number of cases for which treatment begins without a confirmed diagnosis, which should decrease health system costs in time and other resources. Regarding aML and TPOT, finding the hyperparameters was not a dilemma, but the SP results were not as good as they were with other models. Furthermore, it is common for health informatics applications to have access to only small data sets or represent only rare events, and these conditions significantly reduce the accuracy of the results from aML approaches (<xref ref-type="bibr" rid="B60">60</xref>, <xref ref-type="bibr" rid="B61">61</xref>).</p>
<p>Diagnostic algorithms have been incorporated into several national and international recommendations and guidelines for optimizing patient approaches. In the case of Colombia, health entities must notify the alert surveillance system of public health diseases, to epidemiologically monitor and clinically control TB to verify the success of the treatment. National TB registries allow for acquiring adequate global information on all the current clinical and sociodemographic aspects of TB as well as the success of the treatment strategies used.</p>
<p>In terms of limitations of the present study, there was a high incidence of TB in the data set, which could have induced bias in the analyzed data; addressing this will require more specific scenarios that involve clinical observation. Additionally, TB culture is considered the gold standard for diagnosis in some cases, especially when the infrastructure of GenExpert is not available. In this study, although the hospital database can only hold a limited number of patients, the HSC is an important center for TB treatment in Bogot&#x000E1; City; future researchers could incorporate data from more institutions that treat TB. Finally, researchers could incorporate more technical aspects such as including ensemble methods, combining different ML models, and considering more sophisticated models as the next steps.</p>
</sec>
<sec sec-type="conclusions" id="s6">
<title>Conclusions</title>
<p>The findings of this study make it possible to conclude that sensitive ML algorithms can support TB diagnosis by considering the clinical features of the cases as well as medical and sociodemographic risk factors of the patients. TB continues to be a global leading cause of death, and challenges remain in identifying, treating, and containing the disease in several communities. The mycobacteria&#x02013;host relationship can delay diagnosis for a host of reasons, as can limited clinical resources for diagnosis. Computational tools such as those studied here can support timely TB diagnosis and treatment.</p>
</sec>
<sec sec-type="data-availability" id="s7">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec id="s8">
<title>Author contributions</title>
<p>Conceptualization: AO-C and CA. Methodology, supervision, and resources: AO-C and AJ. Software, writing&#x02014;original draft preparation, funding acquisition, and visualization: AO-C. Validation: AO-C, CA, EV, and AP. Formal analysis, investigation, and writing&#x02014;review and editing: AO-C, AJ, CA, EV, and AP. Data curation: AO-C, CA, and AJ. Project administration: AJ. All authors have read and agreed to the published version of the manuscript.</p>
</sec>
<sec sec-type="funding-information" id="s9">
<title>Funding</title>
<p>This research was funded by <italic>Ministerio de Ciencia, Tecnologia e Innovaci&#x000F3;n of Colombia</italic>&#x02014;Minciencias, grant number 123380762899.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack><p>The authors acknowledge the support of the <italic>Ministerio de Ciencia y Tecnolog</italic>&#x000ED;<italic>a&#x02013;Minciencias of Colombia</italic>, funded through project 123380762899. Additionally, Universidad Antonio Nari&#x000F1;o, the <italic>Subred Integrada de Servicios de Salud Centro Oriente</italic>, and Universidad del Rosario were relevant for the development of this work, according to the availability of computational resources and staff time dedicated to the authors team.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Panch</surname> <given-names>T</given-names></name> <name><surname>Szolovits</surname> <given-names>P</given-names></name> <name><surname>Atun</surname> <given-names>R</given-names></name></person-group>. <article-title>Artificial intelligence, machine learning and health systems</article-title>. <source>J Glob Health.</source> (<year>2018</year>) <volume>8</volume>:<fpage>020303</fpage>. <pub-id pub-id-type="doi">10.7189/jogh.08.020303</pub-id><pub-id pub-id-type="pmid">30405904</pub-id></citation></ref>
<ref id="B2">
<label>2.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Witten</surname> <given-names>IH</given-names></name> <name><surname>Frank</surname> <given-names>E</given-names></name> <name><surname>Hall</surname> <given-names>MA</given-names></name> <name><surname>Pal</surname> <given-names>CJ</given-names></name></person-group>. <source>Data Mining: Practical Machine Learning Tools and Techniques</source>. <publisher-loc>New York, NY, USA</publisher-loc>: <publisher-name>Morgan Kaufmann</publisher-name> (<year>2016</year>).</citation>
</ref>
<ref id="B3">
<label>3.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Annabel</surname> <given-names>B</given-names></name> <name><surname>Anna</surname> <given-names>D</given-names></name> <name><surname>Hannah</surname> <given-names>M</given-names></name></person-group>. <source>Global Tuberculosis Report 2019</source>. <publisher-loc>Geneva</publisher-loc>: <publisher-name>World Heal Organ</publisher-name> (<year>2019</year>).</citation>
</ref>
<ref id="B4">
<label>4.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fogel</surname> <given-names>N</given-names></name></person-group>. <article-title>Tuberculosis: a disease without boundaries</article-title>. <source>Tuberculosis</source>. (<year>2015</year>) <volume>95</volume>:<fpage>527</fpage>&#x02013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1016/j.tube.2015.05.017</pub-id><pub-id pub-id-type="pmid">26198113</pub-id></citation></ref>
<ref id="B5">
<label>5.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wahl</surname> <given-names>B</given-names></name> <name><surname>Cossy-Gantner</surname> <given-names>A</given-names></name> <name><surname>Germann</surname> <given-names>S</given-names></name> <name><surname>Schwalbe</surname> <given-names>NR</given-names></name></person-group>. <article-title>Artificial intelligence (AI) and global health: how can AI contribute to health in resource-poor settings?</article-title> <source>BMJ Glob Heal</source>. (<year>2018</year>) <volume>3</volume>:<fpage>e000798</fpage>. <pub-id pub-id-type="doi">10.1136/bmjgh-2018-000798</pub-id><pub-id pub-id-type="pmid">30233828</pub-id></citation></ref>
<ref id="B6">
<label>6.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>F</given-names></name> <name><surname>Jiang</surname> <given-names>Y</given-names></name> <name><surname>Zhi</surname> <given-names>H</given-names></name> <name><surname>Dong</surname> <given-names>Y</given-names></name> <name><surname>Li</surname> <given-names>H</given-names></name> <name><surname>Ma</surname> <given-names>S</given-names></name> <etal/></person-group>. <article-title>Artificial intelligence in healthcare: past, present and future</article-title>. <source>Stroke Vasc Neurol</source>. (<year>2017</year>) <volume>2</volume>:<fpage>230</fpage>&#x02013;<lpage>43</lpage>. <pub-id pub-id-type="doi">10.1136/svn-2017-000101</pub-id><pub-id pub-id-type="pmid">31670713</pub-id></citation></ref>
<ref id="B7">
<label>7.</label>
<citation citation-type="web"><person-group person-group-type="author"><collab>For International Development, U.S.A. Artificial Intelligence in Global Health</collab></person-group> (<year>2019</year>). Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.usaid.gov/sites/default/files/documents/1864/AI-in-Global-Health_webFinal_508.pdf">https://www.usaid.gov/sites/default/files/documents/1864/AI-in-Global-Health_webFinal_508.pdf</ext-link></citation>
</ref>
<ref id="B8">
<label>8.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>M</given-names></name> <name><surname>Decary</surname> <given-names>M</given-names></name></person-group>. <article-title>Artificial intelligence in healthcare: an essential guide for health leaders</article-title>. <source>Healthc Manage Forum.</source> (<year>2020</year>) <volume>33</volume>:<fpage>10</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1177/0840470419873123</pub-id><pub-id pub-id-type="pmid">31550922</pub-id></citation></ref>
<ref id="B9">
<label>9.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>El-Solh</surname> <given-names>AA</given-names></name> <name><surname>Hsiao</surname> <given-names>C-B</given-names></name> <name><surname>Goodnough</surname> <given-names>S</given-names></name> <name><surname>Serghani</surname> <given-names>J</given-names></name> <name><surname>Grant</surname> <given-names>BJB</given-names></name></person-group>. <article-title>Predicting active pulmonary tuberculosis using an artificial neural network</article-title>. <source>Chest J</source>. (<year>1999</year>) <volume>116</volume>:<fpage>968</fpage>&#x02013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1378/chest.116.4.968</pub-id><pub-id pub-id-type="pmid">10531161</pub-id></citation></ref>
<ref id="B10">
<label>10.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Er</surname> <given-names>O</given-names></name> <name><surname>Yumusak</surname> <given-names>N</given-names></name> <name><surname>Temurtas</surname> <given-names>F</given-names></name></person-group>. <article-title>Chest diseases diagnosis using artificial neural networks</article-title>. <source>Expert Syst Appl</source>. (<year>2010</year>) <volume>37</volume>:<fpage>7648</fpage>&#x02013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.1016/j.eswa.2010.04.078</pub-id></citation>
</ref>
<ref id="B11">
<label>11.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meraj</surname> <given-names>SS</given-names></name> <name><surname>Yaakob</surname> <given-names>R</given-names></name> <name><surname>Azman</surname> <given-names>A</given-names></name> <name><surname>Rum</surname> <given-names>SNM</given-names></name> <name><surname>Nazri</surname> <given-names>ASA</given-names></name></person-group>. <article-title>Artificial intelligence in diagnosing tuberculosis: a review</article-title>. <source>Int J Adv Sci Eng Inf Technol</source>. (<year>2019</year>) <volume>9</volume>:<fpage>81</fpage>&#x02013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.18517/ijaseit.9.1.7567</pub-id></citation>
</ref>
<ref id="B12">
<label>12.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Awaysheh</surname> <given-names>A</given-names></name> <name><surname>Wilcke</surname> <given-names>J</given-names></name> <name><surname>Elvinger</surname> <given-names>F</given-names></name> <name><surname>Rees</surname> <given-names>L</given-names></name> <name><surname>Fan</surname> <given-names>W</given-names></name> <name><surname>Zimmerman</surname> <given-names>KL</given-names></name></person-group>. <article-title>Review of medical decision support and machine-learning methods</article-title>. <source>Vet Pathol</source>. (<year>2019</year>) <volume>56</volume>:<fpage>512</fpage>&#x02013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1177/0300985819829524</pub-id><pub-id pub-id-type="pmid">30866728</pub-id></citation></ref>
<ref id="B13">
<label>13.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michael</surname> <given-names>KY</given-names></name> <name><surname>Ma</surname> <given-names>J</given-names></name> <name><surname>Fisher</surname> <given-names>J</given-names></name> <name><surname>Kreisberg</surname> <given-names>JF</given-names></name> <name><surname>Raphael</surname> <given-names>BJ</given-names></name> <name><surname>Ideker</surname> <given-names>T</given-names></name></person-group>. <article-title>Visible machine learning for biomedicine</article-title>. <source>Cell.</source> (<year>2018</year>) <volume>173</volume>:<fpage>1562</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2018.05.056</pub-id><pub-id pub-id-type="pmid">29906441</pub-id></citation></ref>
<ref id="B14">
<label>14.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whang</surname> <given-names>J</given-names></name> <name><surname>Wang</surname> <given-names>C</given-names></name> <name><surname>Wenyu</surname> <given-names>Z</given-names></name></person-group>. <article-title>Data analysis and forecasting of tuberculosis prevalence rates for smart healthcare based on a novel combination model</article-title>. <source>Appl Sci</source>. (<year>2018</year>) <volume>8</volume>:<fpage>1</fpage>&#x02013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.3390/app8091693</pub-id></citation>
</ref>
<ref id="B15">
<label>15.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nagabhushanam</surname> <given-names>D</given-names></name> <name><surname>Naresh</surname> <given-names>N</given-names></name> <name><surname>Raghunath</surname> <given-names>A</given-names></name> <name><surname>Praveen Kumar</surname> <given-names>K</given-names></name></person-group>. <article-title>Prediction of tuberculosis using data mining techniques on indian patients data</article-title>. <source>IJCST.</source> (<year>2013</year>) <volume>4</volume>:<fpage>262</fpage>&#x02013;<lpage>5</lpage>.</citation>
</ref>
<ref id="B16">
<label>16.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>dos Santos Alves</surname> <given-names>E</given-names></name> <name><surname>Souza Filho</surname> <given-names>JBO</given-names></name> <name><surname>Galliez</surname> <given-names>RM</given-names></name> <name><surname>Kritski</surname> <given-names>A</given-names></name></person-group>. <article-title>Specialized MLP classifiers to support the isolation of patients suspected of pulmonary tuberculosis</article-title>. <source>In Proceedings of the Computational Intelligence and 11th Brazilian Congress on Computational Intelligence (BRICS-CCI &#x00026; CBIC).</source> 2013 BRICS Congress on (<year>2013</year>). p. <fpage>40</fpage>&#x02013;<lpage>5</lpage>.</citation>
</ref>
<ref id="B17">
<label>17.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deelder</surname> <given-names>W</given-names></name> <name><surname>Christakoudi</surname> <given-names>S</given-names></name> <name><surname>Phelan</surname> <given-names>J</given-names></name> <name><surname>Benavente</surname> <given-names>ED</given-names></name> <name><surname>Campino</surname> <given-names>S</given-names></name> <name><surname>McNerney</surname> <given-names>R</given-names></name> <etal/></person-group>. <article-title>Machine learning predicts accurately Mycobacterium tuberculosis drug resistance from whole genome sequencing data</article-title>. <source>Front Genet</source>. (<year>2019</year>) <volume>10</volume>:<fpage>922</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2019.00922</pub-id><pub-id pub-id-type="pmid">31616478</pub-id></citation></ref>
<ref id="B18">
<label>18.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bobak</surname> <given-names>CA</given-names></name> <name><surname>Titus</surname> <given-names>AJ</given-names></name> <name><surname>Hill</surname> <given-names>JE</given-names></name></person-group>. <article-title>Comparison of common machine learning models for classification of tuberculosis using transcriptional biomarkers from integrated datasets</article-title>. <source>Appl Soft Comput</source>. (<year>2019</year>) <volume>74</volume>:<fpage>264</fpage>&#x02013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1016/j.asoc.2018.10.005</pub-id></citation>
</ref>
<ref id="B19">
<label>19.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Orjuela-Ca&#x000F1;&#x000F3;n</surname> <given-names>AD</given-names></name> <name><surname>Mendoza</surname> <given-names>JEC</given-names></name> <name><surname>Garc&#x000ED;a</surname> <given-names>CEA</given-names></name> <name><surname>Vela</surname> <given-names>EPV</given-names></name></person-group>. <article-title>Tuberculosis diagnosis support analysis for precarious health information systems</article-title>. <source>Comput Methods Programs Biomed.</source> (<year>2018</year>) 157:11-7. <pub-id pub-id-type="doi">10.1016/j.cmpb.2018.01.009</pub-id><pub-id pub-id-type="pmid">29477418</pub-id></citation></ref>
<ref id="B20">
<label>20.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>E Souza</surname> <given-names>JBdO</given-names></name> <name><surname>Sanchez</surname> <given-names>M</given-names></name> <name><surname>de Seixas</surname> <given-names>JM</given-names></name> <name><surname>Maidantchik</surname> <given-names>C</given-names></name> <name><surname>Galliez</surname> <given-names>R</given-names></name> <name><surname>Moreira A da</surname> <given-names>SR</given-names></name> <etal/></person-group>. <article-title>Screening for active pulmonary tuberculosis: development and applicability of artificial neural network models</article-title>. <source>Tuberculosis.</source> (<year>2018</year>) <volume>111</volume>:<fpage>94</fpage>&#x02013;<lpage>101</lpage>. <pub-id pub-id-type="doi">10.1016/j.tube.2018.05.012</pub-id><pub-id pub-id-type="pmid">30029922</pub-id></citation></ref>
<ref id="B21">
<label>21.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aguiar</surname> <given-names>FS</given-names></name> <name><surname>Torres</surname> <given-names>RC</given-names></name> <name><surname>Pinto</surname> <given-names>JVF</given-names></name> <name><surname>Kritski</surname> <given-names>AL</given-names></name> <name><surname>Seixas</surname> <given-names>JM</given-names></name> <name><surname>Mello</surname> <given-names>FCQ</given-names></name></person-group>. <article-title>Development of two artificial neural network models to support the diagnosis of pulmonary tuberculosis in hospitalized patients in Rio de Janeiro, Brazil</article-title>. <source>Med Biol Eng Comput</source>. (<year>2016</year>) <volume>54</volume>:<fpage>1751</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1007/s11517-016-1465-1</pub-id><pub-id pub-id-type="pmid">27016365</pub-id></citation></ref>
<ref id="B22">
<label>22.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Orjuela-Ca&#x000F1;&#x000F3;n</surname> <given-names>AD</given-names></name> <name><surname>de Seixas</surname> <given-names>JM</given-names></name> <name><surname>Trajman</surname> <given-names>A</given-names></name></person-group>. <article-title>SOM Neural Networks as a Tool in Pleural Tuberculosis Diagnostic</article-title>. In: Braga AdeP, Bastos Filho CJA, Editors. <source>Proceedings of the Annals of the 11th Brazilian Congress on Computational Intelligence</source>. <publisher-loc>Porto de Galinhas, PE</publisher-loc>: <publisher-name>SBIC</publisher-name> (<year>2013</year>). p. <fpage>1</fpage>&#x02013;<lpage>5</lpage>.</citation>
</ref>
<ref id="B23">
<label>23.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Orjuela-Canon</surname> <given-names>AD</given-names></name> <name><surname>De Seixas</surname> <given-names>J</given-names></name></person-group>. <article-title>Fuzzy-ART neural networks for triage in pleural tuberculosis</article-title>. <source>In Proceedings of the Pan American Health Care Exchanges, PAHCE.</source> (<publisher-loc>Medellin, Colombia</publisher-loc>) (<year>2013</year>). <pub-id pub-id-type="doi">10.1109/PAHCE.2013.6568342</pub-id></citation>
</ref>
<ref id="B24">
<label>24.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seixas</surname> <given-names>JM</given-names></name> <name><surname>Faria</surname> <given-names>J</given-names></name> <name><surname>Souza</surname> <given-names>F</given-names></name> <name><surname>Vieira</surname> <given-names>AFM</given-names></name> <name><surname>Kritski</surname> <given-names>A</given-names></name> <name><surname>Trajman</surname> <given-names>A</given-names></name></person-group>. <article-title>Artificial neural network models to support the diagnosis of pleural tuberculosis in adult patients</article-title>. <source>Int J Tuberc Lung Dis</source>. (<year>2013</year>) <volume>17</volume>:<fpage>682</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.5588/ijtld.12.0829</pub-id><pub-id pub-id-type="pmid">23575336</pub-id></citation></ref>
<ref id="B25">
<label>25.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Becker</surname> <given-names>KW</given-names></name> <name><surname>Scheffer</surname> <given-names>C</given-names></name> <name><surname>Blanckenberg</surname> <given-names>MM</given-names></name> <name><surname>Diacon</surname> <given-names>AH</given-names></name></person-group>. <article-title>Analysis of adventitious lung sounds originating from pulmonary tuberculosis</article-title>. In: <source>Proceedings of the Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE</source> (<year>2013</year>). p. <fpage>4334</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1109/EMBC.2013.6610505</pub-id><pub-id pub-id-type="pmid">24110692</pub-id></citation></ref>
<ref id="B26">
<label>26.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Winarko</surname> <given-names>E</given-names></name></person-group>. <source>Review on Data Mining Methods for Tuberculosis Diagnosis. ISICO 2013</source> (<year>2013</year>).</citation>
</ref>
<ref id="B27">
<label>27.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rajaraman</surname> <given-names>S</given-names></name> <name><surname>Antani</surname> <given-names>SK</given-names></name></person-group>. <article-title>Modality-specific deep learning model ensembles toward improving TB detection in chest radiographs</article-title>. <source>IEEE Access.</source> (<year>2020</year>) <volume>8</volume>:<fpage>27318</fpage>&#x02013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2020.2971257</pub-id><pub-id pub-id-type="pmid">32257736</pub-id></citation></ref>
<ref id="B28">
<label>28.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gao</surname> <given-names>XW</given-names></name> <name><surname>Qian</surname> <given-names>Y</given-names></name></person-group>. <article-title>Prediction of multidrug-resistant TB from CT pulmonary images based on deep learning techniques</article-title>. <source>Mol Pharm</source>. (<year>2017</year>) <volume>15</volume>:<fpage>4326</fpage>&#x02013;<lpage>35</lpage>. <pub-id pub-id-type="doi">10.1021/acs.molpharmaceut.7b00875</pub-id><pub-id pub-id-type="pmid">29257894</pub-id></citation></ref>
<ref id="B29">
<label>29.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nash</surname> <given-names>M</given-names></name> <name><surname>Kadavigere</surname> <given-names>R</given-names></name> <name><surname>Andrade</surname> <given-names>J</given-names></name> <name><surname>Sukumar</surname> <given-names>CA</given-names></name> <name><surname>Chawla</surname> <given-names>K</given-names></name> <name><surname>Shenoy</surname> <given-names>VP</given-names></name> <etal/></person-group>. <article-title>Deep learning, computer-aided radiography reading for tuberculosis: a diagnostic accuracy study from a tertiary hospital in India</article-title>. <source>Sci Rep</source>. (<year>2020</year>) <volume>10</volume>:<fpage>1</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1038/s41598-019-56589-3</pub-id><pub-id pub-id-type="pmid">31937802</pub-id></citation></ref>
<ref id="B30">
<label>30.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Cid</surname> <given-names>YD</given-names></name> <name><surname>Kalinovsky</surname> <given-names>A</given-names></name> <name><surname>Liauchuk</surname> <given-names>V</given-names></name> <name><surname>Kovalev</surname> <given-names>V</given-names></name> <name><surname>M&#x000FC;ller</surname> <given-names>H</given-names></name></person-group>. <article-title>Overview of the imageclef 2017 tuberculosis task-predicting tuberculosis type and drug resistances</article-title>. <source>In: Proceedings of the CLEF (Working Notes)</source> (<publisher-loc>Dublin, Ireland</publisher-loc>) (<year>2017</year>).</citation>
</ref>
<ref id="B31">
<label>31.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaeger</surname> <given-names>S</given-names></name> <name><surname>Karargyris</surname> <given-names>A</given-names></name> <name><surname>Candemir</surname> <given-names>S</given-names></name> <name><surname>Folio</surname> <given-names>L</given-names></name> <name><surname>Siegelman</surname> <given-names>J</given-names></name> <name><surname>Callaghan</surname> <given-names>F</given-names></name> <etal/></person-group>. <article-title>Automatic tuberculosis screening using chest radiographs</article-title>. <source>IEEE Trans Med Imaging.</source> (<year>2014</year>) <volume>33</volume>:<fpage>233</fpage>&#x02013;<lpage>45</lpage>. <pub-id pub-id-type="doi">10.1109/TMI.2013.2284099</pub-id><pub-id pub-id-type="pmid">29959539</pub-id></citation></ref>
<ref id="B32">
<label>32.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>M</given-names></name> <name><surname>Antani</surname> <given-names>S</given-names></name> <name><surname>Jaeger</surname> <given-names>S</given-names></name> <name><surname>Xue</surname> <given-names>Z</given-names></name> <name><surname>Candemir</surname> <given-names>S</given-names></name> <name><surname>Kohli</surname> <given-names>M</given-names></name> <etal/></person-group>. <article-title>Local-global classifier fusion for screening chest radiographs. in proceedings of the medical imaging 2017</article-title>. <source>Imag Inform Healthcare Res Appl</source>. (<year>2017</year>) <volume>10138</volume>:<fpage>101380A</fpage>. <pub-id pub-id-type="doi">10.1117/12.2252459</pub-id></citation>
</ref>
<ref id="B33">
<label>33.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hwang</surname> <given-names>S</given-names></name> <name><surname>Kim</surname> <given-names>H-E</given-names></name> <name><surname>Jeong</surname> <given-names>J</given-names></name> <name><surname>Kim</surname> <given-names>H-J</given-names></name></person-group>. <article-title>A novel approach for tuberculosis screening based on deep convolutional neural networks</article-title>. <source>Proc Med Imag 2016 Comput Aided Diagn.</source> (<year>2016</year>) <volume>9785</volume>:<fpage>97852W</fpage>. <pub-id pub-id-type="doi">10.1117/12.2216198</pub-id></citation>
</ref>
<ref id="B34">
<label>34.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hwang</surname> <given-names>EJ</given-names></name> <name><surname>Park</surname> <given-names>S</given-names></name> <name><surname>Jin</surname> <given-names>K-N</given-names></name> <name><surname>Kim</surname> <given-names>JI</given-names></name> <name><surname>Choi</surname> <given-names>SY</given-names></name> <name><surname>Lee</surname> <given-names>JH</given-names></name> <etal/></person-group>. <article-title>Development and validation of a deep learning&#x02013;based automatic detection algorithm for active pulmonary tuberculosis on chest radiographs</article-title>. <source>Clin Infect Dis</source>. (<year>2019</year>) <volume>69</volume>:<fpage>739</fpage>&#x02013;<lpage>47</lpage>. <pub-id pub-id-type="doi">10.1093/cid/ciy967</pub-id><pub-id pub-id-type="pmid">30418527</pub-id></citation></ref>
<ref id="B35">
<label>35.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Qin</surname> <given-names>ZZ</given-names></name> <name><surname>Sander</surname> <given-names>MS</given-names></name> <name><surname>Rai</surname> <given-names>B</given-names></name> <name><surname>Titahong</surname> <given-names>CN</given-names></name> <name><surname>Sudrungrot</surname> <given-names>S</given-names></name> <name><surname>Laah</surname> <given-names>SN</given-names></name> <etal/></person-group>. <article-title>Using artificial intelligence to read chest radiographs for tuberculosis detection: a multi-site evaluation of the diagnostic accuracy of three deep learning systems</article-title>. <source>Sci Rep</source>. (<year>2019</year>) <volume>9</volume>:<fpage>1</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1038/s41598-019-51503-3</pub-id><pub-id pub-id-type="pmid">31628424</pub-id></citation></ref>
<ref id="B36">
<label>36.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Paul</surname> <given-names>HY</given-names></name> <name><surname>Kim</surname> <given-names>TK</given-names></name> <name><surname>Lin</surname> <given-names>CT</given-names></name></person-group>. <article-title>Generalizability of deep learning tuberculosis classifier to COVID-19 chest radiographs: new tricks for an old algorithm?</article-title> <source>J Thorac Imaging.</source> (<year>2020</year>) 35:W102-4. <pub-id pub-id-type="doi">10.1097/RTI.0000000000000532</pub-id><pub-id pub-id-type="pmid">32427650</pub-id></citation></ref>
<ref id="B37">
<label>37.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Green</surname> <given-names>B</given-names></name> <name><surname>Chen</surname> <given-names>Y</given-names></name></person-group>. <article-title>Disparate interactions: an algorithm-in-the-loop analysis of fairness in risk assessments</article-title>. In: <source>Proceedings of the Proceedings of the Conference on Fairness, Accountability, and Transparency</source> (<publisher-loc>New York, NY, USA</publisher-loc>) (<year>2019</year>). p. <fpage>90</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1145/3287560.3287563</pub-id></citation>
</ref>
<ref id="B38">
<label>38.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Green</surname> <given-names>B</given-names></name> <name><surname>Chen</surname> <given-names>Y</given-names></name></person-group>. <article-title>The principles and limits of algorithm-in-the-loop decision making</article-title>. <source>Proc ACM Human Comput Interact</source>. (<year>2019</year>) <volume>3</volume>:<fpage>1</fpage>&#x02013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1145/3359152</pub-id></citation>
</ref>
<ref id="B39">
<label>39.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holzinger</surname> <given-names>A</given-names></name></person-group>. <article-title>Interactive machine learning for health informatics: when do we need the human-in-the-loop?</article-title> <source>Brain Inform.</source> (<year>2016</year>) <volume>3</volume>:<fpage>119</fpage>&#x02013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1007/s40708-016-0042-6</pub-id><pub-id pub-id-type="pmid">27747607</pub-id></citation></ref>
<ref id="B40">
<label>40.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lewinsohn</surname> <given-names>DM</given-names></name> <name><surname>Leonard</surname> <given-names>MK</given-names></name> <name><surname>LoBue</surname> <given-names>PA</given-names></name> <name><surname>Cohn</surname> <given-names>DL</given-names></name> <name><surname>Daley</surname> <given-names>CL</given-names></name> <name><surname>Desmond</surname> <given-names>E</given-names></name> <etal/></person-group>. <article-title>Official american thoracic society/infectious diseases society of america/centers for disease control and prevention clinical practice guidelines: diagnosis of tuberculosis in adults and children</article-title>. <source>Clin Infect Dis</source>. (<year>2017</year>) <volume>64</volume>:<fpage>e1</fpage>&#x02013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1093/cid/ciw694</pub-id><pub-id pub-id-type="pmid">28052967</pub-id></citation></ref>
<ref id="B41">
<label>41.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghazvini</surname> <given-names>K</given-names></name> <name><surname>Yousefi</surname> <given-names>M</given-names></name> <name><surname>Firoozeh</surname> <given-names>F</given-names></name> <name><surname>Mansouri</surname> <given-names>S</given-names></name></person-group>. <article-title>Predictors of tuberculosis: Application of a logistic regression model</article-title>. <source>Gene Rep.</source> (<year>2019</year>) <volume>17</volume>:<fpage>100527</fpage>. <pub-id pub-id-type="doi">10.1016/j.genrep.2019.100527</pub-id></citation>
</ref>
<ref id="B42">
<label>42.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berra</surname> <given-names>TZ</given-names></name> <name><surname>Gomes</surname> <given-names>D</given-names></name> <name><surname>Ramos</surname> <given-names>ACV</given-names></name> <name><surname>Alves</surname> <given-names>YM</given-names></name> <name><surname>Bruce</surname> <given-names>ATI</given-names></name> <name><surname>Arroyo</surname> <given-names>LH</given-names></name> <etal/></person-group>. <article-title>Effectiveness and trend forecasting of tuberculosis diagnosis after the introduction of GeneXpert in a city in south-eastern Brazil</article-title>. <source>PLoS ONE.</source> (<year>2021</year>) <volume>16</volume>:<fpage>e0252375</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0252375</pub-id><pub-id pub-id-type="pmid">34048490</pub-id></citation></ref>
<ref id="B43">
<label>43.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Holzinger</surname> <given-names>A</given-names></name></person-group>. <source>Biomedical Informatics: Discovering Knowledge in Big Data</source>. <publisher-loc>Graz, Austria</publisher-loc>: <publisher-name>Springer</publisher-name> (<year>2014</year>). <pub-id pub-id-type="doi">10.1007/978-3-319-04528-3</pub-id></citation>
</ref>
<ref id="B44">
<label>44.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Xin</surname> <given-names>D</given-names></name> <name><surname>Ma</surname> <given-names>L</given-names></name> <name><surname>Liu</surname> <given-names>J</given-names></name> <name><surname>Macke</surname> <given-names>S</given-names></name> <name><surname>Song</surname> <given-names>S</given-names></name> <name><surname>Parameswaran</surname> <given-names>A</given-names></name></person-group>. <article-title>Accelerating human-in-the-loop machine learning: challenges and opportunities</article-title>. In: <source>Proceedings of the Proceedings of the Second Workshop on Data Management for End-To-End Machine Learning</source> (<publisher-loc>New York, NY, USA</publisher-loc>) (<year>2018</year>). p. <fpage>1</fpage>&#x02013;<lpage>4</lpage>. <pub-id pub-id-type="doi">10.1145/3209889.3209897</pub-id></citation>
</ref>
<ref id="B45">
<label>45.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Holzinger</surname> <given-names>A</given-names></name></person-group>. <source>Trends in Interactive Knowledge Discovery For Personalized Medicine: Cognitive Science Meets Machine Learning</source> (<year>2014</year>).</citation>
</ref>
<ref id="B46">
<label>46.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Robert</surname> <given-names>S</given-names></name> <name><surname>B&#x000FC;ttner</surname> <given-names>S</given-names></name> <name><surname>R&#x000F6;cker</surname> <given-names>C</given-names></name> <name><surname>Holzinger</surname> <given-names>A</given-names></name></person-group>. <article-title>Reasoning under uncertainty: Towards collaborative interactive machine learning</article-title>. In: <source>Machine Learning for Health Informatics</source>. <publisher-loc>Springer</publisher-loc> (<year>2016</year>). p. <fpage>357</fpage>&#x02013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-50478-0_18</pub-id></citation>
</ref>
<ref id="B47">
<label>47.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Nay</surname> <given-names>J</given-names></name> <name><surname>Strandburg</surname> <given-names>KJ</given-names></name></person-group>. <article-title>Generalizability: Machine Learning and Humans-in-the-Loop</article-title>. In: <source>Res. Handb. BIG DATA LAW (rol. Vogl, ed., Edward Elgar, 2020 Forthcoming)</source> (<year>2019</year>). p. <fpage>20</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.2139/ssrn.3417436</pub-id></citation>
</ref>
<ref id="B48">
<label>48.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>de Salud</surname> <given-names>IN</given-names></name></person-group>. <source>Tuberculosis: Protocolo de Vigilancia en Salud P&#x000FA;blica</source>. <publisher-loc>Colombia</publisher-loc>: <publisher-name>Instituto acional de Salud</publisher-name> (<year>2020</year>).</citation>
</ref>
<ref id="B49">
<label>49.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parsons</surname> <given-names>LM</given-names></name> <name><surname>Somosk&#x000F6;vi</surname> <given-names>&#x000C1;</given-names></name> <name><surname>Gutierrez</surname> <given-names>C</given-names></name> <name><surname>Lee</surname> <given-names>E</given-names></name> <name><surname>Paramasivan</surname> <given-names>CN</given-names></name> <name><surname>Abimiku</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>Laboratory diagnosis of tuberculosis in resource-poor countries: challenges and opportunities</article-title>. <source>Clin Microbiol Rev</source>. (<year>2011</year>) <volume>24</volume>:<fpage>314</fpage>&#x02013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.1128/CMR.00059-10</pub-id><pub-id pub-id-type="pmid">21482728</pub-id></citation></ref>
<ref id="B50">
<label>50.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Calamuneri</surname> <given-names>A</given-names></name> <name><surname>Donato</surname> <given-names>L</given-names></name> <name><surname>Scimone</surname> <given-names>C</given-names></name> <name><surname>Costa</surname> <given-names>A</given-names></name> <name><surname>D&#x00027;Angelo</surname> <given-names>R</given-names></name> <name><surname>Sidoti</surname> <given-names>A</given-names></name></person-group>. <article-title>On Machine Learning in Biomedicine</article-title>. <source>Life Saf Secur.</source> (<year>2017</year>) <volume>5</volume>:<fpage>96</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.12882/2283-7604.2017.5.12</pub-id></citation>
</ref>
<ref id="B51">
<label>51.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ohene</surname> <given-names>S-A</given-names></name> <name><surname>Fordah</surname> <given-names>S</given-names></name> <name><surname>Boni</surname> <given-names>P</given-names></name></person-group>. <article-title>Dela Childhood tuberculosis and treatment outcomes in Accra: a retrospective analysis</article-title>. <source>BMC Infect Dis</source>. (<year>2019</year>) <volume>19</volume>:<fpage>1</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1186/s12879-019-4392-6</pub-id><pub-id pub-id-type="pmid">31455234</pub-id></citation></ref>
<ref id="B52">
<label>52.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cruz</surname> <given-names>APD</given-names></name> <name><surname>Tumibay</surname> <given-names>GM</given-names></name></person-group>. <article-title>Predicting tuberculosis treatment relapse: a decision tree analysis of J48 for data mining</article-title>. <source>J Comput Commun</source>. (<year>2019</year>) <volume>7</volume>:<fpage>243</fpage>&#x02013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.4236/jcc.2019.77020</pub-id></citation>
</ref>
<ref id="B53">
<label>53.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>Y</given-names></name> <name><surname>Wang</surname> <given-names>H</given-names></name> <name><surname>Wu</surname> <given-names>F</given-names></name></person-group>. <article-title>Automatic classification of pulmonary tuberculosis and sarcoidosis based on random forest</article-title>. In: <source>Proceedings of the 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics</source> (<publisher-loc>CISP-BMEI</publisher-loc>) (Piscataway, New Jersey) (<year>2017</year>). p. <fpage>1</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1109/CISP-BMEI.2017.8302280</pub-id><pub-id pub-id-type="pmid">27295638</pub-id></citation></ref>
<ref id="B54">
<label>54.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Sugirtha</surname> <given-names>GE</given-names></name> <name><surname>Murugesan</surname> <given-names>G</given-names></name></person-group>. <article-title>Detection of tuberculosis bacilli from microscopic sputum smear images</article-title>. In: <source>Proceedings of the 2017 Third International Conference on Biosignals, Images and Instrumentation (ICBSII)</source> (<publisher-loc>Red Hook, NY, USA</publisher-loc>) (<year>2017</year>). p. <fpage>1</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1109/ICBSII.2017.8082271</pub-id></citation>
</ref>
<ref id="B55">
<label>55.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Yahiaoui</surname> <given-names>A</given-names></name> <name><surname>Er</surname> <given-names>O</given-names></name> <name><surname>Yumusak</surname> <given-names>N</given-names></name></person-group>. <article-title>A new method of automatic recognition for tuberculosis disease diagnosis using support vector machines</article-title>. <source>Biomed Res</source>. (<year>2017</year>) 28:4208-12.</citation>
</ref>
<ref id="B56">
<label>56.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zulvia</surname> <given-names>FE</given-names></name> <name><surname>Kuo</surname> <given-names>RJ</given-names></name> <name><surname>Roflin</surname> <given-names>E</given-names></name></person-group>. <article-title>An Initial Screening Method for Tuberculosis Diseases Using a Multi-objective Gradient Evolution-Based Support Vector Machine and C5</article-title>. 0 Decision Tree. In: <italic>Proceedings of the 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC)</italic> (<year>2017</year>). p. <fpage>204</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1109/COMPSAC.2017.57</pub-id></citation>
</ref>
<ref id="B57">
<label>57.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname> <given-names>MT</given-names></name> <name><surname>Kaushik</surname> <given-names>AC</given-names></name> <name><surname>Ji</surname> <given-names>L</given-names></name> <name><surname>Malik</surname> <given-names>SI</given-names></name> <name><surname>Ali</surname> <given-names>S</given-names></name> <name><surname>Wei</surname> <given-names>D-Q</given-names></name></person-group>. <article-title>Artificial neural networks for prediction of tuberculosis disease</article-title>. <source>Front Microbiol</source>. (<year>2019</year>) <volume>10</volume>:<fpage>395</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2019.00395</pub-id><pub-id pub-id-type="pmid">30886608</pub-id></citation></ref>
<ref id="B58">
<label>58.</label>
<citation citation-type="book"><person-group person-group-type="author"><collab>Haykin S</collab></person-group>. <source>Neural Networks and Learning Machines. Neural networks and learning machines.</source> Prentice Hall (<year>2009</year>). ISBN 978-0-13-147139-9.</citation>
</ref>
<ref id="B59">
<label>59.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname> <given-names>W</given-names></name> <name><surname>Huang</surname> <given-names>Z</given-names></name> <name><surname>Li</surname> <given-names>S</given-names></name> <name><surname>Jia</surname> <given-names>Y</given-names></name></person-group>. <article-title>Distribution-sensitive unbalanced data oversampling method for medical diagnosis</article-title>. <source>J Med Syst</source>. (<year>2019</year>) <volume>43</volume>:<fpage>39</fpage>. <pub-id pub-id-type="doi">10.1007/s10916-018-1154-8</pub-id><pub-id pub-id-type="pmid">30631957</pub-id></citation></ref>
<ref id="B60">
<label>60.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Feurer</surname> <given-names>M</given-names></name> <name><surname>Klein</surname> <given-names>A</given-names></name> <name><surname>Eggensperger</surname> <given-names>K</given-names></name> <name><surname>Springenberg</surname> <given-names>J</given-names></name> <name><surname>Blum</surname> <given-names>M</given-names></name> <name><surname>Hutter</surname> <given-names>F</given-names></name></person-group>. <article-title>Efficient and Robust Automated Machine Learning. In: Cortes C, Lawrence ND, Lee DD, Sugiyama M, Garnett R, Editors</article-title>. <source>Advances in Neural Information Processing Systems.</source> Curran Associates Inc. (<year>2015</year>). p. <fpage>2962</fpage>&#x02013;<lpage>70</lpage>.</citation>
</ref>
<ref id="B61">
<label>61.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Olson</surname> <given-names>RS</given-names></name> <name><surname>Bartley</surname> <given-names>N</given-names></name> <name><surname>Urbanowicz</surname> <given-names>RJ</given-names></name> <name><surname>Moore</surname> <given-names>JH</given-names></name></person-group>. <article-title>Evaluation of a tree-based pipeline optimization tool for automating data science</article-title>. In: <source>Proceedings of the Proceedings of the Genetic and Evolutionary Computation Conference 2016</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Association for Computing Machinery</publisher-name> (<year>2016</year>). p. <fpage>485</fpage>&#x02013;<lpage>92</lpage>. <pub-id pub-id-type="doi">10.1145/2908812.2908918</pub-id></citation>
</ref>
<ref id="B62">
<label>62.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Association</surname> <given-names>TAM</given-names></name></person-group>. <source>AMA: Put augmented Intelligence in Practice of Medicine</source> (<year>2020</year>).</citation>
</ref>
</ref-list> 
</back>
</article>