<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Aging Neurosci.</journal-id>
<journal-title>Frontiers in Aging Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Aging Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1663-4365</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnagi.2022.888575</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A Novel Key Features Screening Method Based on Extreme Learning Machine for Alzheimer&#x2019;s Disease Study</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Lu</surname> <given-names>Jia</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1330786/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Zeng</surname> <given-names>Weiming</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/576792/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhang</surname> <given-names>Lu</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1812210/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Shi</surname> <given-names>Yuhu</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/717298/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Laboratory of Digital Image and Intelligent Computation, Shanghai Maritime University</institution>, <addr-line>Shanghai</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>Basic Experiment and Training Center, Shanghai Maritime University</institution>, <addr-line>Shanghai</addr-line>, <country>China</country></aff>
<aff id="aff3"><sup>3</sup><institution>College of Information Engineering Shanghai Maritime University</institution>, <addr-line>Shanghai</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: M.Tanveer, Indian Institute of Technology Indore, India</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Hong Wang, Central South University, China; Ponnuthurai Suganthan, Nanyang Technological University, Singapore</p></fn>
<corresp id="c001">&#x002A;Correspondence: Weiming Zeng, <email>zengwm86@163.com</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Alzheimer&#x2019;s Disease and Related Dementias, a section of the journal Frontiers in Aging Neuroscience</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>25</day>
<month>05</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>14</volume>
<elocation-id>888575</elocation-id>
<history>
<date date-type="received">
<day>03</day>
<month>03</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>25</day>
<month>04</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 Lu, Zeng, Zhang and Shi.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Lu, Zeng, Zhang and Shi</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The Extreme Learning Machine (ELM) is a simple and efficient Single Hidden Layer Feedforward Neural Network(SLFN) algorithm. In recent years, it has been gradually used in the study of Alzheimer&#x2019;s disease (AD). When using ELM to diagnose AD based on high-dimensional features, there are often some features that have no positive impact on the diagnosis, while others have a significant impact on the diagnosis. In this paper, a novel Key Features Screening Method based on Extreme Learning Machine (KFS-ELM) is proposed. It can screen for key features that are relevant to the classification (diagnosis). It can also assign weights to key features based on their importance. We designed an experiment to screen for key features of AD. A total of 920 key functional connections screened from 4005 functional connections. Their weights were also obtained. The results of the experiment showed that: (1) Using all (4,005) features to diagnose AD, the accuracy is 95.33%. Using 920 key features to diagnose AD, the accuracy is 99.20%. The 3,085 (4,005 - 920) features that were screened out had a negative effect on the diagnosis of AD. This indicates the KFS-ELM is effective in screening key features. (2) The higher the weight of the key features and the smaller their number, the greater their impact on AD diagnosis. This indicates that the KFS-ELM is rational in assigning weights to the key features for their importance. Therefore, KFS-ELM can be used as a tool for studying features and also for improving classification accuracy.</p>
</abstract>
<kwd-group>
<kwd>fMRI</kwd>
<kwd>brain functional connectivity</kwd>
<kwd>extreme learning machine</kwd>
<kwd>AD</kwd>
<kwd>KFS-ELM</kwd>
</kwd-group>
<contract-num rid="cn001">31870979</contract-num>
<contract-sponsor id="cn001">National Natural Science Foundation of China<named-content content-type="fundref-id">10.13039/501100001809</named-content></contract-sponsor>
<counts>
<fig-count count="3"/>
<table-count count="7"/>
<equation-count count="13"/>
<ref-count count="40"/>
<page-count count="14"/>
<word-count count="9664"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>Alzheimer&#x2019;s disease (AD) is a neurodegenerative disease. Compared to cognitively normal individuals, the AD patient&#x2019;s brain undergoes morphological or functional changes. For example, <xref ref-type="bibr" rid="B15">Kazemifar et al. (2014)</xref> found significant atrophy of the temporal cortex in AD patients. <xref ref-type="bibr" rid="B32">Supekar et al. (2008)</xref> used resting-state fMRI to find reduced local efficiency of functional brain networks in AD patients. <xref ref-type="bibr" rid="B2">Buckner et al. (2009)</xref> used resting-state fMRI data to find that the distribution of core nodes in the functional brain network overlaps highly with the brain regions where A&#x00DF; amyloid is deposited in AD patients. It means that the brain connection hub area is vulnerable to receive attacks. <xref ref-type="bibr" rid="B40">Zhou et al. (2012)</xref> used resting-state functional brain networks to predict the course of developmental changes in AD pathology. These studies above not only provide experimental evidence for the brain region disconnection hypothesis of AD from the perspective of functional integration, but also provide an explanation for the abnormalities in functional integration of the AD brain.</p>
<p>The AD Neuroimaging Initiative (ADNI), led by Dr. Weiner, was launched in 2004. This is a long-term research plan jointly composed of several institutions. It integrates the database of multi-center and cross-disciplinary longitudinal studies carried out by various methods. Such as clinical cognitive function evaluation, neuroimaging examination, and detection of molecular biological markers of cerebrospinal fluid and blood (<xref ref-type="bibr" rid="B33">Susanne et al., 2005</xref>). Its primary goal is to explore the patterns of relationships between clinical, cognitive, imaging, genetic and biomarker features as the disease progresses. It also has the goal of establishing standardized methods for imaging/biomarker collection and analysis, and ultimately for use in clinical research.</p>
<p>In 2004, <xref ref-type="bibr" rid="B12">Huang et al. (2004)</xref> proposed the Extreme Learning Machine (ELM). ELM generates random input weights and uses the Moore-Penrose pseudo-inverse (MPP) to calculate the output weights. It is a feed-forward neural network based on randomization(RFNN). The RFNN was introduced by <xref ref-type="bibr" rid="B29">Schmidt et al. (1992)</xref> and <xref ref-type="bibr" rid="B31">Suganthan and Katuwal (2021)</xref>. Methods using random input weights and MPP also include random vector function linked neural networks (RVFL) (<xref ref-type="bibr" rid="B24">Pao et al., 1994</xref>), and a method proposed by <xref ref-type="bibr" rid="B4">Guo et al. (1995)</xref>.</p>
<p>The ELM be widely used in many fields such as disease diagnosis, traffic sign recognition, image quality assessment and so on (<xref ref-type="bibr" rid="B3">Chyzhyk et al., 2015</xref>; <xref ref-type="bibr" rid="B14">Huang et al., 2016</xref>; <xref ref-type="bibr" rid="B37">Wang et al., 2016</xref>). ELM strives to solve the research problems in machine learning fields such as regression, classification, supervised learning and unsupervised learning under a single framework (<xref ref-type="bibr" rid="B6">Huang et al., 2014</xref>). From the perspective of learning efficiency, ELM is concise to implement, with extremely high learning speeds and less human intervention. From the perspective of theoretical studies, ELM can still maintain SLFN&#x2019;s interpolation ability (<xref ref-type="bibr" rid="B14">Huang et al., 2016</xref>), general approximation ability (<xref ref-type="bibr" rid="B8">Huang and Chen, 2007</xref>) and classification ability (<xref ref-type="bibr" rid="B11">Huang et al., 2012</xref>) even in the case of randomly generating hidden layer neuron parameters. From the perspective of structural risk minimization, the VC dimensionality (Vapnik-Chervonenkis dimensionality) of ELM depends on the number of neurons in the hidden layer (<xref ref-type="bibr" rid="B19">Liu et al., 2012</xref>). The size of the VC dimensionality can be controlled by adjusting the number of neurons in the hidden layer of ELM, to make a compromise between training error and model complexity, and get the optimal generalization performance. ELM has also been extended to a deep learning model (<xref ref-type="bibr" rid="B34">Tang et al., 2015</xref>; <xref ref-type="bibr" rid="B16">Kim et al., 2017</xref>), and made a lot of research results.</p>
<p>In recent years, RFNN has also been gradually used for AD studies based on medical images. <xref ref-type="bibr" rid="B30">Sharma et al. (2021)</xref> proposed that the FAF-DRVFL method achieved 86.67% accuracy in classifying AD with CN. Malik et al. (2022) proposed the IFRVFL method for diagnosing AD. Its performance is better than standard ELM and RVFL. <xref ref-type="bibr" rid="B17">Lama and Kwon (2021)</xref> proposed the ElM+Graph embedding method achieved 90.93% accuracy in classifying AD, MCI, CN. <xref ref-type="bibr" rid="B23">Nguyen et al. (2019)</xref> proposed the MVPA+ELM method achieved 98.86% accuracy in classifying AD, MCI, CN. The RFNN is often used as a classifier in these studies. It is mainly used to process selected low-dimensional features. Few studies have used RFNN as a feature screening method.</p>
<p>The ELM classifier has redundant hidden layer nodes when the number of hidden layer nodes is large enough. <xref ref-type="bibr" rid="B26">Rong et al. (2008)</xref> proposed the method Pruned-Extreme Learning Machine (P-ELM), as well as <xref ref-type="bibr" rid="B21">Miche et al. (2010</xref>, <xref ref-type="bibr" rid="B22">2011)</xref> improved the P-ELM method, which was used to pruning of the hidden layer nodes with the aim of making the ELM classifier more compact, with high speed and more robustness. But it cannot improve the accuracy of the classifier. In ELM classifiers with high feature dimensionality, some input nodes have no positive effect on classification. Useless input nodes are removed using the idea of P-ELM. It can screen features that have an impact on the classification. It may even improve the accuracy of the classifier.</p>
<p>We analyzed the state of machine learning techniques in diagnosing AD (<xref ref-type="bibr" rid="B35">Tanveer et al., 2020</xref>). We found no method to prune the network to extract key features. Some of the work involving AD feature extraction (<xref ref-type="bibr" rid="B1">Bi et al., 2018</xref>; <xref ref-type="bibr" rid="B25">Richhariya et al., 2020</xref>; <xref ref-type="bibr" rid="B5">Hao et al., 2021</xref>; <xref ref-type="bibr" rid="B28">Sadiq et al., 2021</xref>) have achieved good results in terms of classification accuracy, but none have assigned weights to the importance of these features. In this study, we propose a novel key features screening method based on extreme learning machine (KFS-ELM) to screen key features of AD. It is a data-driven approach that is not based on empirical assumptions or prerequisites. It prunes the ELM classifier to determine the relationship between each feature and AD, screens the key features, and identifies their importance. This will make the key features more intuitive, facilitate the study of the patterns behind AD, and benefit the construction of better classifiers to diagnose AD. The KFS-ELM can be used not only for studying AD, but also for other research fields where the studied subjects have a high dimension of features.</p>
<p>The rest of this article is organized as follows. The section &#x201C;Materials and Methods&#x201D; describes the &#x201C;Brain Functional Connectivity Network,&#x201D; &#x201C;ELM,&#x201D; and &#x201C;KFS-ELM method.&#x201D; The section &#x201C;Experiment&#x201D; introduces the experimental procedure, experimental environment, data preparation, screening of key features by the KFS-ELM method, and validation of key features. The section &#x201C;Results&#x201D; presents the streamlining ability of the KFS-ELM method, the distribution of the screened key features, and the effect of the key features on the classification of the ELM classifier. The section &#x201C;Discussion&#x201D; discusses the limitations of the ELM method and the advantages of the KFS-ELM method, and analyzes the key features. The section &#x201C;Conclusion&#x201D; summarizes the work of this study.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2.SS1">
<title>Brain Functional Connectivity</title>
<p>Brain functional connectivity network is a mathematical representation defined by a set of nodes and edges (<xref ref-type="bibr" rid="B27">Rubinov and Sporns, 2010</xref>; <xref ref-type="bibr" rid="B18">Liu et al., 2016</xref>). These nodes represent brain regions on different scales. The temporal correlations (functional connectivity) between the fMRI time course of these nodes form the edge of the brain&#x2019;s functional network. The smaller the size of a node, the greater the number of nodes and edges. The more complex the described pattern of neural activity in the brain, the more difficult it is to calculate and analyze. Researchers often divide the brain into regions or nodes by using atlas. automated anatomical labeling (AAL) (<xref ref-type="bibr" rid="B36">Tzourio-Mazoyer et al., 2002</xref>) is one of the most commonly used atlas. Templates is one of the most commonly used templates. AAL divides the brain into 116 regions, including 90 regions of the cerebrum and 26 regions of the cerebellum. The feature measures adopted in this paper is the whole cerebrum functional connectivity network. It includes 4005 functional connectivity features.</p>
</sec>
<sec id="S2.SS2">
<title>Extreme Learning Machine</title>
<p>Extreme learning machines belong to single hidden layer feed forward neural networks (SLFNs) and have the characteristics of single hidden layer neural networks. (1) Implement complex nonlinear mapping directly from the input layer. (2) It can provide appropriate classification model for large category data sets. Compared with other single hidden layer neural network models, the speed of model training and classification is faster. <xref ref-type="bibr" rid="B7">Huang and Babri (1998)</xref> indicated that the input layer weights and hidden layer bias values of other SLFNS networks need to be iteratively adjusted to fit the current training data. When there are a large number of hidden layer nodes, this calculation can lead to significant computational time consumption (<xref ref-type="bibr" rid="B9">Huang and Chen, 2008</xref>; <xref ref-type="bibr" rid="B10">Huang et al., 2011</xref>). At the same time, since gradient descent has become an effective method to solve SLFNs, this method not only limits the solving speed, but also can easily fall into the local minimum from the calculation principle of the calculation method. Aiming at the above problems, <xref ref-type="bibr" rid="B13">Huang et al. (2006)</xref> proposed the algorithm of extreme learning machine. It transformed the iterative solution method into the solution method of linear equations by randomly specifying the weight and bias values of the input layer, and finally obtained the analytical solution of the network. It can be quickly solved on the premise of ensuring the accuracy of calculation.</p>
<p>Extreme learning machine can be described as: given N arbitrary samples {<italic>X</italic><sub><italic>i</italic></sub>,<italic>t</italic><sub><italic>i</italic></sub>}, <italic>X</italic><sub><italic>i</italic></sub>=<inline-formula><mml:math id="INEQ1"><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy='false'>]</mml:mo></mml:mrow><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> &#x2208; <italic>R<sup>n</sup></italic>,<italic>t</italic><sub><italic>i</italic></sub>=<inline-formula><mml:math id="INEQ2"><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo stretchy='false'>]</mml:mo></mml:mrow><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> &#x2208; <italic>R<sup>m</sup></italic>. For a single hidden layer neural network with L hidden layer nodes, it can be expressed as</p>
<disp-formula id="S2.E1">
<label>(1)</label>
<mml:math id="M1">
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:munderover>
<mml:mo largeop="true" movablelimits="false" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>L</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x22C5;</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo rspace="5.8pt">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo rspace="7.5pt">,</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="normal">&#x22EF;</mml:mi>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mpadded width="+5pt">
<mml:mi>L</mml:mi>
</mml:mpadded>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:mi>j</mml:mi>
</mml:mpadded>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Where <italic>g</italic>(<italic>x</italic>) is the activation function. <italic>W</italic><sub><italic>i</italic></sub>=[<italic>w</italic><sub><italic>i</italic>,1</sub>,<italic>w</italic><sub><italic>i</italic>,2</sub>,&#x2026;,<italic>w</italic><sub><italic>i</italic>,<italic>n</italic></sub>] is input weight. &#x03B2;<sub><italic>i</italic></sub>=<inline-formula><mml:math id="INEQ3"><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo stretchy='false'>]</mml:mo></mml:mrow><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> is output weight, <italic>b<sub>i</sub></italic> is the bias of the <italic>ith</italic> hidden layer node. <italic>O</italic><sub><italic>j</italic></sub>=<inline-formula><mml:math id="INEQ4"><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msub><mml:mi>o</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:msub><mml:mi>o</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo stretchy='false'>]</mml:mo></mml:mrow><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> is output of the sample <italic>X<sub>j</sub></italic>. The goal of single-hidden layer neural network learning is to minimize the output error, can be represented as</p>
<disp-formula id="S2.E2">
<label>(2)</label>
<mml:math id="M2">
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:munderover>
<mml:mo largeop="true" movablelimits="false" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:mo fence="true">||</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo fence="true">||</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mo rspace="7.5pt">,</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>There exist &#x03B2;<sub><italic>i</italic></sub>, <italic>W<sub>i</sub></italic> and <italic>b<sub>i</sub></italic>, such that</p>
<disp-formula id="S2.E3">
<label>(3)</label>
<mml:math id="M3">
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:munderover>
<mml:mo largeop="true" movablelimits="false" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>L</mml:mi>
</mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>g</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x22C5;</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo rspace="5.8pt">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo rspace="7.5pt">,</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>j</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mpadded width="+5pt">
<mml:mi>N</mml:mi>
</mml:mpadded>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>It can be expressed in matrix form</p>
<disp-formula id="S2.E4">
<label>(4)</label>
<mml:math id="M4">
<mml:mrow>
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:mi>&#x03B2;</mml:mi>
</mml:mpadded>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Where <italic>H</italic> is the output of the hidden layer node, &#x03B2; is the output weight, and <italic>T</italic> is the expected output.</p>
<disp-formula id="S2.Ex1">
<label>(5)</label>
<mml:math id="M5">
<mml:mrow>
<mml:mi>H</mml:mi>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo rspace="7.5pt">,</mml:mo>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S2.E5">
<mml:math id="M6">
<mml:mrow>
<mml:mi/>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mtable columnspacing="5pt" rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x22C5;</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
<mml:mtd columnalign="center">
<mml:mi mathvariant="normal">&#x22EF;</mml:mi>
</mml:mtd>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>&#x22C5;</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mi mathvariant="normal">&#x22EE;</mml:mi>
</mml:mtd>
<mml:mtd columnalign="center">
<mml:mi mathvariant="normal">&#x22EF;</mml:mi>
</mml:mtd>
<mml:mtd columnalign="center">
<mml:mi mathvariant="normal">&#x22EE;</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x22C5;</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>N</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
<mml:mtd/>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
<mml:mo>&#x22C5;</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi>N</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>b</mml:mi>
<mml:mi>L</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>N</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">&#x00D7;</mml:mo>
<mml:mi>L</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S2.E6">
<label>(6)</label>
<mml:math id="M7">
<mml:mrow>
<mml:mrow>
<mml:mi>&#x03B2;</mml:mi>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mtable rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:msubsup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mn>1</mml:mn>
<mml:mi>T</mml:mi>
</mml:msubsup>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mi mathvariant="normal">&#x22EE;</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:msubsup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>L</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>L</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">&#x00D7;</mml:mo>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo rspace="7.5pt">,</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>T</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mtable rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mn>1</mml:mn>
<mml:mi>T</mml:mi>
</mml:msubsup>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mi mathvariant="normal">&#x22EE;</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:msubsup>
<mml:mi>T</mml:mi>
<mml:mi>N</mml:mi>
<mml:mi>T</mml:mi>
</mml:msubsup>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>N</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">&#x00D7;</mml:mo>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>In the ELM algorithm, <italic>W<sub>i</sub></italic> and <italic>b<sub>i</sub></italic> are randomly determined, and the output matrix <italic>H</italic> of the hidden layer is uniquely determined. The training of the single-hidden layer neural network can be transformed into adding a linear system, Equation (4). And the output weight &#x03B2; can be determined by Equation 7.</p>
<disp-formula id="S2.E7">
<label>(7)</label>
<mml:math id="M8">
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mover accent="true">
<mml:mi>&#x03B2;</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>H</mml:mi>
<mml:mo>&#x2020;</mml:mo>
</mml:msup>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Where <italic>H</italic><sup>&#x2020;</sup> is the Moore-Penrose inverse of <italic>H</italic>. and the solution norm of <inline-formula><mml:math id="INEQ5"><mml:mover accent="true"><mml:mi>&#x03B2;</mml:mi><mml:mo>^</mml:mo></mml:mover></mml:math></inline-formula> is minimal and unique. We solve for <inline-formula><mml:math id="INEQ6"><mml:mover accent="true"><mml:mi>&#x03B2;</mml:mi><mml:mo>^</mml:mo></mml:mover></mml:math></inline-formula> to construct ELM.</p>
<p>In this paper, the ELM classifier is constructed by dividing the data set into three sets: training set, validation set, and test set. The training set is used to build enough classifiers. The validation set is used to verify the accuracy of each classification, and find the best ELM classifier. As the input weights of the ELM classifier are randomly generated, its classification accuracy will also vary randomly. To ensure that the classifier has high accuracy, a sufficient number of ELM classifiers need to be trained, until the average accuracy of all classifiers converges. The number of classifiers constructed is <italic>Loop</italic>, and the average accuracy of ELM classifiers is <inline-formula><mml:math id="INEQ7"><mml:mover accent="true"><mml:mrow><mml:mi>A</mml:mi><mml:mo>&#x2062;</mml:mo><mml:mi>c</mml:mi><mml:mo>&#x2062;</mml:mo><mml:mi>c</mml:mi><mml:mo>&#x2062;</mml:mo><mml:mi>u</mml:mi><mml:mo>&#x2062;</mml:mo><mml:mi>r</mml:mi><mml:mo>&#x2062;</mml:mo><mml:mi>a</mml:mi><mml:mo>&#x2062;</mml:mo><mml:mi>c</mml:mi><mml:mo>&#x2062;</mml:mo><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x00AF;</mml:mo></mml:mover></mml:math></inline-formula>. The value of the variable <italic>Loop</italic> should satisfy the constraint of Equation 9, where the parameter <italic>p</italic> is the allowed fluctuation of the average accuracy. The loop calculation produces 2&#x002A;<italic>Loop</italic> accuracies, and the absolute value of the difference between the average accuracies of any consecutive <italic>Loop</italic> times should be less than or equal to parameter <italic>p</italic>. The 2&#x002A;<italic>Loop</italic> ELM classifiers are validated with the validation set, and the classifier with the highest accuracy is the optimal classifier we are looking for. In this experiment <italic>p</italic> = 0.15%.</p>
<disp-formula id="S2.E8">
<label>(8)</label>
<mml:math id="M9">
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mover accent="true">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mo>&#x00AF;</mml:mo>
</mml:mover>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo largeop="true" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S2.Ex2">
<label>(9)</label>
<mml:math id="M10">
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo largeop="true" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mo>-</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo largeop="true" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>u</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>c</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo rspace="5.8pt">|</mml:mo>
</mml:mrow>
<mml:mo rspace="5.8pt">&#x2264;</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S2.E9">
<mml:math id="M11">
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="normal">&#x2026;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
</sec>
<sec id="S2.SS3">
<title>KFS-ELM Method</title>
<p>The main idea of the KFS-ELM algorithm lies in pruning the ELM classifier. Removing input nodes and hidden layer nodes (including the weights connected to them) which have no positive impact with the classification, and keeping the accuracy of the ELM classifier from decreasing on the training and validation sets. The features corresponding to the input layer nodes of the pruned classifier are considered as key features. They are strongly related to the classification. <bold>Definition 1:</bold> <italic>W<sup>i</sup></italic> is the diagonal matrix, <italic>i</italic> &#x2208; [1,&#x2026;,<italic>n</italic>],<italic>n</italic> is the dimensionality of the subject&#x2019;s features. <italic>W<sup>i</sup></italic> satisfies the constraints of <italic>rank</italic>(<italic>W<sup>i</sup></italic>) = <italic>i</italic>, <italic>rank</italic><sup>(&#x002A;</sup>) denotes the rank of the matrix. <bold>Definition 2:</bold> &#x03B2;<italic><sup>j</sup></italic> is the diagonal matrix, <italic>j</italic> &#x2208; [1,&#x22EF;,<italic>L</italic>], <italic>L</italic> is the number of hidden layer nodes of the ELM classifier. &#x03B2;<italic><sup>j</sup></italic> satisfies the constraints of <inline-formula><mml:math id="INEQ8"><mml:mrow><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>n</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mpadded width="+3.3pt"><mml:mi>&#x03B2;</mml:mi></mml:mpadded><mml:mmultiscripts><mml:mo rspace="5.8pt">)</mml:mo><mml:mprescripts/><mml:none/><mml:mi>j</mml:mi></mml:mmultiscripts></mml:mrow><mml:mo rspace="5.8pt">=</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:math></inline-formula>.</p>
<p>In the ELM classifier the output weight <inline-formula><mml:math id="INEQ9"><mml:mover accent="true"><mml:mi>&#x03B2;</mml:mi><mml:mo>^</mml:mo></mml:mover></mml:math></inline-formula>, the input weight <italic>W</italic>, and the bias <italic>b</italic> are known quantities. The output weights after pruned can be expressed as <inline-formula><mml:math id="INEQ10"><mml:mrow><mml:msup><mml:mi>&#x03B2;</mml:mi><mml:mi>j</mml:mi></mml:msup><mml:mo>&#x22C5;</mml:mo><mml:mover accent="true"><mml:mi>&#x03B2;</mml:mi><mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula>. The input weights can be expressed as <italic>W</italic>&#x22C5;<italic>W<sup>i</sup></italic>. The key feature matrix of the subject can be expressed as <italic>W<sup>i</sup></italic>&#x22C5;<italic>X</italic>. One <italic>W<sup>i</sup></italic> corresponds to a group of key features. The pruned ELM classifier can be described by Equation (10) as</p>
<disp-formula id="S2.E10">
<label>(10)</label>
<mml:math id="M12">
<mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msup>
<mml:mo rspace="5.8pt">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mi>g</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mi>W</mml:mi>
<mml:mo>&#x22C5;</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mo>&#x22C5;</mml:mo>
<mml:mi>X</mml:mi>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&#x22C5;</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msup>
<mml:mo>&#x22C5;</mml:mo>
<mml:mpadded width="+3.3pt">
<mml:mover accent="true">
<mml:mi>&#x03B2;</mml:mi>
<mml:mo>^</mml:mo>
</mml:mover>
</mml:mpadded>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mi>O</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>
<p><italic>g</italic>(&#x002A;) is the activation function. <italic>X</italic> is the input matrix consisting of the features of the input data set (training set, validation set and test set). <italic>X</italic> = [<italic>X</italic><sub>1</sub>,&#x22EF;,<italic>X</italic><sub><italic>N</italic></sub>], where <italic>X<sub>i</sub></italic> denotes the feature vector of the <italic>ith</italic> subject, and there are <italic>N</italic> subjects. <italic>O</italic> is the output matrix.<italic>O</italic> = [<italic>O</italic><sub>1</sub>,&#x22EF;,<italic>O</italic><sub><italic>N</italic></sub>], <italic>O<sub>i</sub></italic> is the label vector output by the ELM classifier, corresponding to a subject. <italic>O</italic><sub><italic>i</italic></sub>=<inline-formula><mml:math id="INEQ11"><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msub><mml:mi>o</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:msub><mml:mi>o</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo stretchy='false'>]</mml:mo></mml:mrow><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, there is only one <italic>o</italic><sub><italic>i</italic>,<italic>j</italic></sub>=1,<italic>o</italic><sub><italic>i</italic>,<italic>j</italic></sub> &#x2208; <italic>O</italic><sub><italic>i</italic></sub>, the others <italic>o</italic><sub><italic>i</italic>,<italic>j</italic></sub> = 0. <italic>T</italic> = [<italic>T</italic><sub>1</sub>,&#x22EF;,<italic>T</italic><sub><italic>N</italic></sub>] is the set of labels corresponding to the data set, where <italic>T<sub>i</sub></italic> denotes the label vector corresponding to the <italic>ith</italic> subject. <italic>T</italic><sub><italic>i</italic></sub>=<inline-formula><mml:math id="INEQ12"><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mtext>&#x2009;</mml:mtext><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo stretchy='false'>]</mml:mo></mml:mrow><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, there is only one <italic>t</italic><sub><italic>i</italic>,<italic>j</italic></sub>=1,<italic>t</italic><sub><italic>i</italic>,<italic>j</italic></sub> &#x2208; <italic>T</italic><sub><italic>i</italic></sub>, the others <italic>t</italic><sub><italic>i</italic>,<italic>j</italic></sub>=0. The accuracy of the ELM classifier can be expressed as Equation 11. &#x2225;&#x002A;&#x2225; denotes the Modulus of the vector.</p>
<disp-formula id="S2.E11">
<label>(11)</label>
<mml:math id="M13">
<mml:mrow>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>O</mml:mi>
<mml:mo rspace="5.8pt">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>N</mml:mi>
</mml:mfrac>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:munderover>
<mml:mo largeop="true" movablelimits="false" symmetric="true">&#x2211;</mml:mo>
<mml:mrow>
<mml:mpadded width="+3.3pt">
<mml:mi>i</mml:mi>
</mml:mpadded>
<mml:mo rspace="5.8pt">=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mfrac>
<mml:mrow>
<mml:mo>&#x2225;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mi>O</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>&#x2225;</mml:mo>
</mml:mrow>
<mml:msqrt>
<mml:mn>2</mml:mn>
</mml:msqrt>
</mml:mfrac>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>In Equation 10, <italic>W<sup>i</sup></italic> and &#x03B2;<italic><sup>j</sup></italic> are the variables to be solved, and the other parameters are known quantities. <italic>W<sup>i</sup></italic> should satisfy inequality (Equation 12). &#x03B2;<italic><sup>j</sup></italic> should satisfy inequality (Equation 13).</p>
<disp-formula id="S2.E12">
<label>(12)</label>
<mml:math id="M14">
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mtable rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mi/>
</mml:mrow>
</mml:math>
</disp-formula>
<disp-formula id="S2.E13">
<label>(13)</label>
<mml:math id="M15">
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mtable rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:mrow>
<mml:mi>h</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mi>&#x03B2;</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>i</mml:mi>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mi/>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Pruning an ELM classifier obtains a <italic>W<sup>i</sup></italic>, which corresponds to a set of key features. Pruning enough ELM classifiers will obtain enough <italic>W<sup>i</sup></italic>. The merge set of <italic>W</italic>&#x002A;=&#x2211;<italic>W<sup>i</sup></italic> correspond to the complete key features. It should be noted that the datasets used for validation W&#x002A; are the training and validation sets and do not include the test set.</p>
</sec>
<sec id="S2.SS4">
<title>KFS-ELM Algorithm Steps</title>
<list list-type="simple">
<list-item>
<label>1.</label>
<p>Construct an ELM classifier using all the features from the training and validation sets.</p>
</list-item>
<list-item>
<label>2.</label>
<p>Prune the input layer nodes and their corresponding output weights in the ELM classifier, which means that keeping &#x03B2;<italic><sup>j</sup></italic> constant to solve <italic>W<sup>i</sup></italic> iteratively. The initial values of <italic>W<sup>i</sup></italic> and &#x03B2;<italic><sup>j</sup></italic> are <italic>nth</italic>-order unit matrix and <italic>Lth</italic>-order unit matrix, respectively, where n is the feature dimensionality of the subject and m is the number of ELM hidden layer nodes. When the inequality (12) is satisfied, take <italic>W<sup>i</sup></italic> and &#x03B2;<italic><sup>j</sup></italic> into step 3.</p>
</list-item>
<list-item>
<label>3.</label>
<p>Prune the input layer nodes and their corresponding output weights in the ELM classifier, which means that keeping <italic>W<sup>i</sup></italic> constant to solve &#x03B2;<italic><sup>j</sup></italic> iteratively. When &#x03B2;<italic><sup>j</sup></italic> satisfies inequality (13), take <italic>W<sup>i</sup></italic> and &#x03B2;<italic><sup>j</sup></italic> into step 2. When <italic>W<sup>i</sup></italic> and &#x03B2;<italic><sup>j</sup></italic> satisfy both inequalities (10) and inequality (11), take <italic>W<sup>i</sup></italic> into step 4.</p>
</list-item>
<list-item>
<label>4.</label>
<p>Loop through steps 1-3, find <italic>W</italic>&#x002A;=&#x2211;<italic>W<sup>i</sup></italic>. And Construct ELM classifier based on the key features corresponding to <italic>W</italic>&#x002A;. When the accuracy of the ELM classifier is no longer increasing, the calculation ends. The features corresponding to the final <italic>W</italic>&#x002A; are the key features screened by the KFS-ELM method.</p>
</list-item>
</list>
</sec>
</sec>
<sec id="S3">
<title>Experiment</title>
<sec id="S3.SS1">
<title>Flow of the Experiment</title>
<p>This experiment consists of three parts: data preparation, screen features by using KFS-ELM, test ELM classifier, and validate key features (as shown in <xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Flow of the experiment.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-14-888575-g001.tif"/>
</fig>
<list list-type="simple">
<list-item>
<label>1.</label>
<p>Data preparation: fMRI data for AD and CN are acquired from the ADNI database and pre-processed using DPARSF, SPM to obtain the functional brain connectivity matrix for all subjects.</p>
</list-item>
<list-item>
<label>2.</label>
<p>Screen key features using KFS-ELM: Screening for key features which are relevant to the diagnosis of AD.</p>
</list-item>
<list-item>
<label>3.</label>
<p>Test ELM classifier: Test the ELM classifier before and after pruning, and observe the change in classifier performance.</p>
</list-item>
<list-item>
<label>4.</label>
<p>Validate key features: compare the ability to diagnose AD using the key feature set versus the full feature set. Compare the impact of key features with different weights on diagnosing AD.</p>
</list-item>
</list>
</sec>
<sec id="S3.SS2">
<title>Environment of the Experiment</title>
<p>All experiments run on a PC with intel core i7-8700 @ 3.20GHz, NVIDIA GeForce RTX 2080 8GB, 16GB DDR4 3600MHz, 1TB SSD.</p>
</sec>
<sec id="S3.SS3">
<title>Data of the Experiment</title>
<p>The fMRI data used in this paper are all from LONI&#x2019;s ADNI database ADNI2 project. It included 100 Cognitively Normal (CN) subjects, 49 females and 51 males, with a mean age and standard deviation of 74.09 &#x00B1; 5.45. And it included 100 Alzheimer&#x2019;s disease (AD) subjects, 44 women and 56 men, with a mean age and standard deviation of 75.07 &#x00B1; 7.59. All subjects used the same acquisition parameters. Field Strength = 3.0 tesla; Flip Angle = 80.0 degree; Manufacturer = Philips Medical Systems; Matrix X = 64.0 pixels; Matrix Y = 64.0 pixels; Mfg Model = Intera; Pixel Spacing X = 3.3125 mm; Pixel Spacing Y = 3.3125 mm; Pulse Sequence = GR; Slices = 6720.0; Slice Thickness = 3.312999963760376 mm; TE = 30.000999450683594 ms; TR = 3000.0 ms; The subject data can be downloaded at <ext-link ext-link-type="uri" xlink:href="http://adni.loni.usc.edu">http://adni.loni.usc.edu</ext-link>.</p>
</sec>
<sec id="S3.SS4">
<title>Pre-processing of Data</title>
<p>The data pre-processing tools chosen for this experiment are &#x2018;&#x2018;Data Processing Assistant for Resting-State fMRI Advanced Edition&#x2019;&#x2019; (DPARSF 4.4 Advanced Edition<sup><xref ref-type="fn" rid="footnote1">1</xref></sup>)(<xref ref-type="bibr" rid="B39">Yan and Zang, 2010</xref>) and &#x2018;&#x2018; Statistical Parametric Mapping&#x2019;&#x2019; (SPM12<sup><xref ref-type="fn" rid="footnote2">2</xref></sup>). Remove the first 10 time points for each subject in order to remove the phase where the subject is familiar with the MRI scanner environment at the beginning of the data scan, and where brain activity is not smooth during the noise. Slice Timing and Head Motion correction is performed for each subject, and EPI template is used for standardization. Band-pass filtering is used to obtain signals between 0.01 and 0.1Hz. After processing, the bounding box of all subjects is [-90-126-72; 90 90 108], and the Voxel size is [3 3 3]. And then detrend the signal. The functional connectivity network of the brain is extracted based on the ALL template. Finally, we obtained 100 functional connectivity matrices for each of the two categories of subjects, AD and CN.</p>
</sec>
<sec id="S3.SS5">
<title>Grouping of Data Sets</title>
<p>The 200 functional connection matrices are divided into three groups: training set, validation set, and test set. The ratio of AD subjects to CN subjects within each group is 1:1.</p>
<p>In the KFS-ELM and test ELM classifier sections, the ratio of subjects in the training set, validation set and test set is 90:90:20. The test set is generated randomly. It does not change during the KFS-ELM calculation. The training and validation sets were randomly divided for the construction of each ELM classifier. The training set is used to construct the ELM classifier. The validation set is used to evaluate and find the best ELM classifiers. In the pruning process, both the training and validation sets are used to evaluate the change in the accuracy of the ELM classifiers. Test set for evaluating changes in ELM classifier before and after pruning.</p>
<p>In the validation of key features section, the ratio of subjects in the training set, validation set and test set is 160:20:20.</p>
</sec>
<sec id="S3.SS6">
<title>Construction of ELM Classifier</title>
<p>The training and validation sets are re-divided before each training of the classifier. The number of subjects and the proportion of categories in each set are kept constant. The number of input layer nodes is 4005, corresponding to all functional connections of the cerebrum. In our previous study, we found that the number of hidden layer nodes is positively correlated with the accuracy of the classifier in scenarios with small sample size and high feature dimensionality. The number of hidden nodes is set to 64,000 as the hardware performance allows. Due to the fact that this experiment involves two categories of data, the number of output nodes is set to 2. The convergence precision threshold <italic>p</italic> = 0.0015. The only things that need to be set manually are the number of hidden layer nodes and the convergence precision threshold. After the ELM classifier has been constructed, it will be tested using the test set.</p>
</sec>
<sec id="S3.SS7">
<title>Pruning of the Input Layer</title>
<p>The pruning of the input layer is actually solving Equation 10 for <italic>W<sup>i</sup></italic>, While ensuring that it satisfies the constraints of the set of inequality (12). As there are two variables <italic>W<sup>i</sup></italic> and &#x03B2;<italic><sup>j</sup></italic> in Equation (10), the value of &#x03B2;<italic><sup>j</sup></italic> is fixed first when solving for <italic>W<sup>i</sup></italic>. Due to the fact that the activation functions of the hidden layer nodes are nonlinear and the input layer is fully connected to the hidden layer, pruning the input layer nodes will result in a nonlinear variation in the output of the ELM classifier. Different pruning order or different number of nodes per pruning may screen different input layer nodes. Thus there are various ways of solving <italic>W<sup>i</sup></italic>. In this experiment, <italic>W<sup>i</sup></italic> is solved by zeroing the elements on the diagonal of <italic>W<sup>i</sup></italic> one by one according to their order. that is, a new <italic>W<sup>i</sup></italic> is generated when the accuracy of the ELM classifier is unchanged or improved (using the training and validation sets) after zeroing an element in <italic>W<sup>i</sup></italic>. Also because of the nonlinearity of ELM, if one or more nodes are pruned in the input layer pruning process, the pruning process needs to be executed again until no input node can be pruned. For the same reason, if &#x03B2;<italic><sup>j</sup></italic> changes, it is also necessary to prune the input layer again.</p>
</sec>
<sec id="S3.SS8">
<title>Pruning of the Hidden Layer</title>
<p>In the process of pruning the hidden layer nodes, the hidden layer nodes that have no effect on the accuracy of ELM classifier or improve the accuracy are pruned in turn. And then find &#x03B2;<italic><sup>j</sup></italic>. Because the ELM output is a linear summation of the hidden node outputs, no iterative computation is required. If any hidden layer node is pruned, the new &#x03B2;<italic><sup>j</sup></italic> and <italic>W<sup>i</sup></italic> should be substituted into the process of &#x201C;ELM classifier input layer pruning&#x201D; to update <italic>W<sup>i</sup></italic> again. If no hidden layer nodes are pruned, the pruning process of a single ELM classifier is completed.</p>
</sec>
<sec id="S3.SS9">
<title>Obtain Key Features</title>
<p>A single <italic>W<sup>i</sup></italic> corresponds to only part of the key features. ELM classifiers constructed by them are also less accurate. As more key features corresponding to <italic>W</italic>&#x002A; are available, the higher the accuracy of the ELM classifier constructed using it will be. When the accuracy of the ELM classifier is no longer improved, the corresponding key features become complete.</p>
</sec>
<sec id="S3.SS10">
<title>Validate Key Features</title>
<p>First, set the weights of non-key features equal to 0. A series of feature groups is formed by gradually excluding the features with the lowest weights from all features. We write the feature group as <italic>F</italic><sub><italic>i</italic></sub>,<italic>i</italic> = [0,&#x22EF;,<italic>s</italic>], <italic>s</italic> is the highest weights of key features. <italic>F<sub>i</sub></italic> is the feature group, containing all features whose weight <italic>i</italic>. <italic>s+1</italic> feature groups will be formed.</p>
<p>Then, redivide the training set, validation set and test set randomly. The ratio of training set, validation set and test set is 160:20:20, and the ratio of AD to CN in each set is 1:1. The data input to the ELM classifier is the features which are selected from each data set base on the feature group <italic>F<sub>i</sub></italic>.</p>
<p>Finally, the ELM classifier constructed from each group of features is used to validate the feature group classification (diagnostic) ability. 10 times 10 fold cross-validation is performed for each feature group.</p>
</sec>
</sec>
<sec id="S4" sec-type="results">
<title>Results</title>
<p>According to the KFS-ELM method, the accuracy of ELM classification constructed using the key feature set corresponding to <italic>W</italic>&#x002A; is no longer improved when screening to the 17th group <italic>W<sup>i</sup></italic>. The accuracy of the 17 ELM classifiers is shown in <xref ref-type="table" rid="T1">Table 1</xref>. The average training accuracy is 100%, the average verification accuracy is 97.84%, and the average test accuracy is 92.35%.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Accuracy of ELM trained with full amount of features,</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Classifier no.</td>
<td valign="top" align="center">Training accuracy (%)</td>
<td valign="top" align="center">Verification accuracy (%)</td>
<td valign="top" align="center">Test accuracy (%)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">95.00</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">85.00</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">98.89</td>
<td valign="top" align="center">90.00</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">95.00</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">95.00</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">95.00</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">100.00</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">75.00</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">75.00</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">95.00</td>
</tr>
<tr>
<td valign="top" align="left">11</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">100.00</td>
</tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">100.00</td>
</tr>
<tr>
<td valign="top" align="left">13</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">95.00</td>
</tr>
<tr>
<td valign="top" align="left">14</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">95.00</td>
</tr>
<tr>
<td valign="top" align="left">15</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">90.00</td>
</tr>
<tr>
<td valign="top" align="left">16</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">100.00</td>
</tr>
<tr>
<td valign="top" align="left">17</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.78</td>
<td valign="top" align="center">90.00%</td>
</tr>
<tr>
<td valign="top" align="left">Mean</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.84</td>
<td valign="top" align="center">92.35%</td>
</tr>
</tbody>
</table></table-wrap>
<p>The accuracy of the 17 ELM classifiers after pruning is shown in <xref ref-type="table" rid="T2">Table 2</xref>. After pruning, the average training accuracy of all ELM classifiers remained 100%. the average validation accuracy increased from 97.84% to 99.8%. And the average test accuracy decreased from 92.35 to 84.71%. The number of input nodes and the number of hidden nodes are drastically reduced. The average number of input nodes decreases from 4005 to 106.24, which is 2.65% of the original number of input nodes. The average number of remaining hidden nodes decreased from 64,000 to 1042.53, which is 1.63% of the original number of hidden nodes.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Number of nodes and accuracy of ELM classifier after pruning.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Classifier no.</td>
<td valign="top" align="center">Training accuracy (%)</td>
<td valign="top" align="center">Validation accuracy (%)</td>
<td valign="top" align="center">Test accuracy (%)</td>
<td valign="top" align="center">Number of input nodes screened</td>
<td valign="top" align="center">Number of hidden layer nodes screened</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">71</td>
<td valign="top" align="center">707</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">85.00</td>
<td valign="top" align="center">87</td>
<td valign="top" align="center">963</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">80.00</td>
<td valign="top" align="center">160</td>
<td valign="top" align="center">1,094</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">85.00</td>
<td valign="top" align="center">82</td>
<td valign="top" align="center">680</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">90.00</td>
<td valign="top" align="center">106</td>
<td valign="top" align="center">1,245</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">95.00</td>
<td valign="top" align="center">150</td>
<td valign="top" align="center">1,563</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">85.00</td>
<td valign="top" align="center">89</td>
<td valign="top" align="center">716</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">65.00</td>
<td valign="top" align="center">154</td>
<td valign="top" align="center">1,577</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">85.00</td>
<td valign="top" align="center">86</td>
<td valign="top" align="center">890</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">85.00</td>
<td valign="top" align="center">117</td>
<td valign="top" align="center">738</td>
</tr>
<tr>
<td valign="top" align="left">11</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">95.00</td>
<td valign="top" align="center">175</td>
<td valign="top" align="center">1,939</td>
</tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">85.00</td>
<td valign="top" align="center">108</td>
<td valign="top" align="center">1,402</td>
</tr>
<tr>
<td valign="top" align="left">13</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">95.00</td>
<td valign="top" align="center">63</td>
<td valign="top" align="center">557</td>
</tr>
<tr>
<td valign="top" align="left">14</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">98.89</td>
<td valign="top" align="center">70.00</td>
<td valign="top" align="center">85</td>
<td valign="top" align="center">641</td>
</tr>
<tr>
<td valign="top" align="left">15</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">98.89</td>
<td valign="top" align="center">80.00</td>
<td valign="top" align="center">87</td>
<td valign="top" align="center">825</td>
</tr>
<tr>
<td valign="top" align="left">16</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">98.89</td>
<td valign="top" align="center">85.00</td>
<td valign="top" align="center">96</td>
<td valign="top" align="center">934</td>
</tr>
<tr>
<td valign="top" align="left">17</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">75.00</td>
<td valign="top" align="center">90</td>
<td valign="top" align="center">1,252</td>
</tr>
<tr>
<td valign="top" align="left">Mean</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">99.80</td>
<td valign="top" align="center">84.71</td>
<td valign="top" align="center">106.24</td>
<td valign="top" align="center">1,042.53</td>
</tr>
</tbody>
</table></table-wrap>
<p>The key features corresponding to the pruned ELM classifier are shown in <xref ref-type="fig" rid="F2">Figure 2</xref> by the functional connectivity matrix. <xref ref-type="fig" rid="F2">Figures 2A&#x2013;C</xref> are the image representations of the key features (functional connectivity matrix) of the 1st, 2nd, and 3rd ELM classifier, respectively. <xref ref-type="fig" rid="F2">Figure 2D</xref> is the image representation of <italic>W</italic>&#x002A;. The horizontal and vertical coordinates of the image correspond to the serial numbers of the 90 brain regions of the AAL template. The colors of the points correspond to the weights of the key features. The total number of key features corresponding to <italic>W</italic>&#x002A; is 920. where the highest weight is 12. That is, the intersection of 17 <italic>W<sup>i</sup></italic> is empty.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p><bold>(A)</bold> Key features filtered in the first ELM classifier, <bold>(B)</bold> key features filtered in the second ELM classifier, <bold>(C)</bold> key features filtered in the third ELM classifier, <bold>(D)</bold> the concatenated set of all key features.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-14-888575-g002.tif"/>
</fig>
<p><xref ref-type="table" rid="T3">Table 3</xref> shows the performance of testing the classification (diagnosis) ability of key features with different weights versus full features for AD. Perform ten times 10 fold cross-validation and calculate the average of their accuracy rates. The ELM classifier trained using the full amount of features had a test accuracy of 95.33% with a standard deviation of 0.0035. The ELM classifier trained with all key features (920 features, 22.97% of the total number of features) had the highest test accuracy of 99.20% with a standard deviation of 0.0021. When the number of key features is 199 (4.97% of the total number of features), its test accuracy is 95.24%, which is only 0.09% lower than the accuracy of the full number of features. That is, the number of selected features was reduced from 4005 to 199, and the accuracy of the constructed ELM classifier test did not decrease significantly. When the number of selected features is 45 (1.12% of all features), its verification accuracy can still reach 100%, and the test accuracy is 87.84%. When the selected features are further reduced, the accuracy of the ELM classifier is also further reduced. When the number of selected features is 4 or 1, the ELM test accuracy is close to 50%. It is no longer practical to do classification using only these features.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Accuracy of constructing ELM classifier with full amount of features or key features with different weights.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Criteria for feature selection</td>
<td valign="top" align="center">Amount of features</td>
<td valign="top" align="center">Selected features/Full amount of features (%)</td>
<td valign="top" align="center">Training accuracy (%)</td>
<td valign="top" align="center">Validation accuracy (%)</td>
<td valign="top" align="center">Test accuracy &#x00B1; Standard deviation</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Weight &#x2265; 0</td>
<td valign="top" align="center">4005</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">95.33% &#x00B1; 0.0035</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 1</td>
<td valign="top" align="center">920</td>
<td valign="top" align="center">22.97</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">99.20% &#x00B1; 0.0021</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 2</td>
<td valign="top" align="center">397</td>
<td valign="top" align="center">9.91</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">96.92% &#x00B1; 0.0057</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 3</td>
<td valign="top" align="center">199</td>
<td valign="top" align="center">4.97</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">95.24% &#x00B1; 0.0048</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 4</td>
<td valign="top" align="center">109</td>
<td valign="top" align="center">2.72</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">93.33% &#x00B1; 0.0074</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 5</td>
<td valign="top" align="center">62</td>
<td valign="top" align="center">1.55</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">91.68% &#x00B1; 0.0090</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 6</td>
<td valign="top" align="center">45</td>
<td valign="top" align="center">1.12</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">87.84% &#x00B1; 0.0106</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 7</td>
<td valign="top" align="center">28</td>
<td valign="top" align="center">0.70</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">99.75</td>
<td valign="top" align="center">82.19% &#x00B1; 0.0126</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 8</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">0.55</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">97.55</td>
<td valign="top" align="center">75.27% &#x00B1; 0.0195</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 9</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.27</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">92.70</td>
<td valign="top" align="center">69.03% &#x00B1; 0.0170</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 10</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">0.20</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">87.75</td>
<td valign="top" align="center">60.14% &#x00B1; 0.0359</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 11</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.10</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">81.65</td>
<td valign="top" align="center">51.69% &#x00B1; 0.0251</td>
</tr>
<tr>
<td valign="top" align="left">Weight &#x2265; 12</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.02</td>
<td valign="top" align="center">58.05</td>
<td valign="top" align="center">84.85</td>
<td valign="top" align="center">55.34% &#x00B1; 0.0140</td>
</tr>
</tbody>
</table></table-wrap>
<p>Considering the visualization effect, this paper takes 45 key features with weights greater than or equal to 6 as an example to show their distribution. <xref ref-type="table" rid="T4">Table 4</xref> shows the features (functional connectivity) and their corresponding weights. <xref ref-type="table" rid="T5">Table 5</xref> shows the brain regions corresponding to the features and their weights. Its weight is the sum of the weights of all functional connections involving that brain region. <xref ref-type="fig" rid="F3">Figure 3</xref> is a demonstration of these key features. The visualization tool used is BrainNet Viewer (<xref ref-type="bibr" rid="B38">Xia et al., 2013</xref>).</p>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>The key features with weight value greater than or equal to 6.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Functional connectivity</td>
<td valign="top" align="center">Weight</td>
<td valign="top" align="left">Functional connectivity</td>
<td valign="top" align="center">Weight</td>
<td valign="top" align="left">Functional connectivity</td>
<td valign="top" align="center">Weight</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">FFG.R-STG.R (56&#x2013;82)</td>
<td valign="top" align="center">12</td>
<td valign="top" align="left">MOG.R-PCUN.R (52&#x2013;68)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">ORBsup.L-MTG.L (5&#x2013;85)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">MOG.L-PCUN.R (51&#x2013;68)</td>
<td valign="top" align="center">11</td>
<td valign="top" align="left">MOG.R-PCL.R (52&#x2013;70)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">ORBmid.L-IFGtriang.L (9&#x2013;13)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">TPOsup.R-ITG.R (84&#x2013;90)</td>
<td valign="top" align="center">11</td>
<td valign="top" align="left">IOG.L-IOG.R (53&#x2013;54)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">IFGoperc.L-SFGmed.L (11&#x2013;23)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">MTG.L-TPOmid.L (85&#x2013;87)</td>
<td valign="top" align="center">11</td>
<td valign="top" align="left">FFG.R-PCL.L (56&#x2013;69)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">IFGoperc.R-HIP.R (12&#x2013;38)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">ORBsup.L-TPOmid.L (5&#x2013;87)</td>
<td valign="top" align="center">10</td>
<td valign="top" align="left">PCL.L-ITG.R (56&#x2013;90)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">SOG.R-PCL.L (50&#x2013;69)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">ORBmid.L-IFGoperc.L (9&#x2013;11)</td>
<td valign="top" align="center">10</td>
<td valign="top" align="left">STG.R-ITG.R (82&#x2013;90)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">SOG.R-PCL.R (50&#x2013;70)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">FFG.R-PUT.L (56&#x2013;73)</td>
<td valign="top" align="center">10</td>
<td valign="top" align="left">TPOmid.L-ITG.L (87&#x2013;89)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">IOG.L-FFG.R (53&#x2013;56)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">FFG.R-TPOsup.R (56&#x2013;84)</td>
<td valign="top" align="center">10</td>
<td valign="top" align="left">PreCG.L-THA.L (1&#x2013;77)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">FFG.R-PCUN.R (56&#x2013;68)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">PreCG.R-HES.R (2&#x2013;80)</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left">SFGdor.R-TPOsup.R (4&#x2013;84)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">FFG.R-TPOsup.L (56&#x2013;83)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">MFG.L-IFGoperc.L (7&#x2013;11)</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left">MFG.R-IFGtriang.R (8&#x2013;14)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">PoCG.R-IPL.L 58&#x2013;61)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">MOG.L-PCL.L (51&#x2013;69)</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left">CUN.R-PCUN.R (46&#x2013;68)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">PCUN.R-MTG.L (68&#x2013;85)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">MFG.R-IFGoperc.R (8&#x2013;12)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">MOG.L-SPG.L (51&#x2013;59)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">PCL.R-PUT.L (70&#x2013;73)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">SOG.R-IOG.R (50&#x2013;54)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">CAU.R-STG.R (72&#x2013;82)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">PCL.R-PUT.R (70&#x2013;74)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">MOG.L-PCUN.L (51&#x2013;67)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">SFGdor.R-THA.R (4&#x2013;78)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="left">MTG.L-ITG.R (85&#x2013;90)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">MOG.L-PCL.R (51&#x2013;70)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">ORBsup.L-PCL.R (5&#x2013;70)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="left">MTG.R-TPOmid.L (86&#x2013;87)</td>
<td valign="top" align="center">6</td>
</tr>
</tbody>
</table></table-wrap>
<table-wrap position="float" id="T5">
<label>TABLE 5</label>
<caption><p>The brain regions corresponding to the key features with weight greater than or equal to 6.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Brain region (serial No.)</td>
<td valign="top" align="center">Weight</td>
<td valign="top" align="left">Brain region (serial No.)</td>
<td valign="top" align="center">Weight</td>
<td valign="top" align="left">Brain region (serial No.)</td>
<td valign="top" align="center">Weight</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">FFG.R (56)</td>
<td valign="top" align="center">66</td>
<td valign="top" align="left">MOG.R (52)</td>
<td valign="top" align="center">16</td>
<td valign="top" align="left">CUN.R (46)</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td valign="top" align="left">MOG.L (51)</td>
<td valign="top" align="center">43</td>
<td valign="top" align="left">IOG.R (54)</td>
<td valign="top" align="center">16</td>
<td valign="top" align="left">SPG.L (59)</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td valign="top" align="left">PCL.R (70)</td>
<td valign="top" align="center">40</td>
<td valign="top" align="left">PUT.L (73)</td>
<td valign="top" align="center">16</td>
<td valign="top" align="left">CAU.R (72)</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td valign="top" align="left">PCUN.R (68)</td>
<td valign="top" align="center">38</td>
<td valign="top" align="left">MFG.R (8)</td>
<td valign="top" align="center">15</td>
<td valign="top" align="left">THA.L (77)</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td valign="top" align="left">TPOmid.L (87)</td>
<td valign="top" align="center">35</td>
<td valign="top" align="left">IFGoperc.R (12)</td>
<td valign="top" align="center">14</td>
<td valign="top" align="left">IFGtriang.L (13)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">ITG.R (90)</td>
<td valign="top" align="center">33</td>
<td valign="top" align="left">IOG.L (53)</td>
<td valign="top" align="center">14</td>
<td valign="top" align="left">SFGmed.L (23)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">MTG.L (85)</td>
<td valign="top" align="center">29</td>
<td valign="top" align="left">SFGdor.R (4)</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left">HIP.R (38)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">TPOsup.R (84)</td>
<td valign="top" align="center">28</td>
<td valign="top" align="left">PreCG.R (2)</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left">PoCG.R (58)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">STG.R (82)</td>
<td valign="top" align="center">27</td>
<td valign="top" align="left">MFG.L (7)</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left">IPL.L (61)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">IFGoperc.L (11)</td>
<td valign="top" align="center">25</td>
<td valign="top" align="left">HES.R (80)</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left">PUT.R (74)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">PCL.L (69)</td>
<td valign="top" align="center">23</td>
<td valign="top" align="left">PCUN.L (67)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">THA.R (78)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">ORBsup.L (5)</td>
<td valign="top" align="center">22</td>
<td valign="top" align="left">ITG.L (89)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">TPOsup.L (83)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">SOG.R (50)</td>
<td valign="top" align="center">20</td>
<td valign="top" align="left">PreCG.L (1)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">MTG.R (86)</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left">ORBmid.L (9)</td>
<td valign="top" align="center">16</td>
<td valign="top" align="left">IFGtriang.R (14)</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left"/>
<td/>
</tr>
</tbody>
</table></table-wrap>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Full-view diagram of key features with weights greater than or equal to 6.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnagi-14-888575-g003.tif"/>
</fig>
</sec>
<sec id="S5" sec-type="discussion">
<title>Discussion</title>
<p>Comparing the ELM classifier before and after pruning (according to <xref ref-type="table" rid="T2">Tables 2</xref>, <xref ref-type="table" rid="T3">3</xref>), a significant decrease in the number of input nodes was observed. The average number of input nodes decreased from 4005 to 106.24. Its corresponding number of key features accounts for 2.65% of the total number of features. According to the constraints of Inequality (12) and Inequality (13), removing any of the key nodes will lead to a decrease in the accuracy of the classifier. It shows that all these features are important. The KFS-ELM feature screening experiments yielded 17 groups of key features whose intersection is empty. This indicates that the key features that contribute differently in different classifiers. Therefore, the union of these key feature groups can describe AD more comprehensively.</p>
<p>The performance of the full amount of features and key features in building ELM classifiers was tested in the experiments. The ELM classifier trained using the full amount of features (4,005 features) has a test accuracy of 95.33%. The ELM classifier trained using all key features (920 features) has a test accuracy is 99.20%. It indicates that, training the ELM classifier with fewer features results in higher test accuracy, the 3,085 (4,005 &#x2212; 920 = 3,085) features that were excluded had a negative effect in classification. Therefore, the KFS-ELM is effective in screening key features.</p>
<p>In this experiment, the weights assigned to the features by KFS-ELM are from 0 to 12, and the bigger the weights, the more important they are. In the testing section, we divided the features into 13 groups according to their weights and gradually excluded the group with the lowest feature weights to test the diagnostic ability of the remaining features for AD. For example, in <xref ref-type="table" rid="T3">Table 3</xref>, the row where &#x201C;Weight &#x2265; 1&#x201D; indicates the diagnostic ability of all key features (excluding features with weights equal to 0) for AD. Comparing the results of &#x201C;Weight &#x2265; 0&#x201D; and &#x201C;Weight &#x2265; 1,&#x201D; the features with weight equal to 0 play a negative role in AD diagnosis. The row where &#x201C;Weight &#x2265; 2&#x201D; indicates that the key features &#x201C;Weight = 0&#x201D; and &#x201C;Weight = 1&#x201D; are excluded. Comparing the results of &#x201C;Weight &#x2265; 2&#x201D; and &#x201C;Weight &#x2265; 1,&#x201D; we can judge the effect of the &#x201C;Weight = 1&#x201D; feature group on the diagnosis of AD. Adding the feature group of &#x201C;Weight = 1&#x201D; to the feature group of &#x201C;Weight &#x2265; 2&#x201D; increased the number of features by 523 (920 - 397), and the training and validation accuracy did not change, while the test accuracy increased by 2.28% (99.20 - 96.92%). That is, the 523 features(&#x201C;Weight = 1&#x201D;) contributed 2.28% to the test accuracy of AD. In this way, the effect of the features with weight = 10 on the diagnosis of AD can be shown by comparing the experimental results of &#x201C;Weight &#x2265; 10&#x201D; and &#x201C;Weight &#x2265; 11.&#x201D; The feature group with Weight = 10 contributed 6.10% (87.75 - 81.65%) to the validation accuracy and 8.45% (60.14 - 51.69%) to the test accuracy in AD diagnosis. According to this rule, the effect of feature groups with different weights on AD diagnosis was calculated based on the results in <xref ref-type="table" rid="T3">Table 3</xref>, as detailed in <xref ref-type="table" rid="T6">Table 6</xref>. In addition, according to <xref ref-type="table" rid="T3">Table 3</xref>, the accuracy of the training set was 58.05% when &#x201C;Weight &#x2265; 12&#x201D;, which means that the group of features with Weight = 12 is not enough to distinguish AD from normal people, so this result is not adopted as the basis for <xref ref-type="table" rid="T6">Table 6</xref>. Therefore, in <xref ref-type="table" rid="T6">Table 6</xref>, the impact of AD diagnosis can only be assessed for the feature groups with weights from 0 to 10.</p>
<table-wrap position="float" id="T6">
<label>TABLE 6</label>
<caption><p>Effect of features with different weights on AD diagnosis.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Weight of feature group</td>
<td valign="top" align="center">Features amount</td>
<td valign="top" align="center">Contribution of feature group to validation accuracy (%)</td>
<td valign="top" align="center">Contribution of each feature to validation accuracy (%)</td>
<td valign="top" align="center">Contribution of feature group to test accuracy (%)</td>
<td valign="top" align="center">Contribution of each feature to test accuracy (%)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Weight = 10</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">6.10</td>
<td valign="top" align="center">1.5250</td>
<td valign="top" align="center">8.45</td>
<td valign="top" align="center">2.1125</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 9</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">4.95</td>
<td valign="top" align="center">1.6500</td>
<td valign="top" align="center">8.89</td>
<td valign="top" align="center">2.9633</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 8</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">4.85</td>
<td valign="top" align="center">0.4409</td>
<td valign="top" align="center">6.24</td>
<td valign="top" align="center">0.5673</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 7</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">2.20</td>
<td valign="top" align="center">0.3667</td>
<td valign="top" align="center">6.92</td>
<td valign="top" align="center">1.1533</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 6</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.25</td>
<td valign="top" align="center">0.0147</td>
<td valign="top" align="center">5.65</td>
<td valign="top" align="center">0.3324</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 5</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.0000</td>
<td valign="top" align="center">3.84</td>
<td valign="top" align="center">0.2259</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 4</td>
<td valign="top" align="center">47</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.0000</td>
<td valign="top" align="center">1.65</td>
<td valign="top" align="center">0.0351</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 3</td>
<td valign="top" align="center">90</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.0000</td>
<td valign="top" align="center">1.91</td>
<td valign="top" align="center">0.0212</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 2</td>
<td valign="top" align="center">198</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.0000</td>
<td valign="top" align="center">1.68</td>
<td valign="top" align="center">0.0085</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 1</td>
<td valign="top" align="center">523</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.0000</td>
<td valign="top" align="center">2.28</td>
<td valign="top" align="center">0.0044</td>
</tr>
<tr>
<td valign="top" align="left">Weight = 0</td>
<td valign="top" align="center">3085</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.0000</td>
<td valign="top" align="center">-3.87</td>
<td valign="top" align="center">-0.0013</td>
</tr>
</tbody>
</table></table-wrap>
<p>Each row in <xref ref-type="table" rid="T6">Table 6</xref> indicates the effect of a key feature group on the accuracy of AD diagnosis for a given weight. The first column indicates the weights of the feature group. The second column indicates the number of features in the group. The third column indicates the contribution of the feature group to the validation accuracy. The fourth column indicates the average contribution of each feature in the group to the validation accuracy. The fifth column indicates the contribution of the feature group to the test accuracy. The sixth column indicates the average contribution of each feature in the group to the test accuracy.</p>
<p>According to <xref ref-type="table" rid="T6">Table 6</xref>, in terms of the contribution of single feature to the validation accuracy (column 4), basically the higher the weight of the feature, the more it contributes to the AD diagnosis (except weight = 9). In terms of the contribution of single feature to the test accuracy (column 6), basically the higher the weight of the feature, the more it contributes to the AD diagnosis (except weight = 8 and weight = 9). Basically, it can be considered that the higher weight the feature has, the greater contribution it makes to the AD diagnosis.</p>
<p>According to the pattern in column 2 of <xref ref-type="table" rid="T6">Table 6</xref>, the lower the weight of the feature group, the higher amount of features it contains. However, the group with a weight of 9 has fewer features than expected and the group with a weight of 8 has more features than expected. This also caused the contribution of features to diagnose AD deviated when the weight equals 8 and 9. We consider that this is due to the ELM classifier&#x2019;s randomness in the utilization of features (refer to <xref ref-type="fig" rid="F2">Figure 2</xref>). There are some deviations in the results of feature screening from the probability distribution of the features. Features with weights of 8, 9, 10, 11, and 12 account for 0.27, 0.07, 0.10, 0.07, and 0.02% of all features, respectively, and there are only 22 of these features in total, accounting for 0.55% of the total number of features. If there is a deviation of 0.1% in the feature screening, the results may deviate from the expected pattern, as it happens in <xref ref-type="table" rid="T6">Table 6</xref> when the weights are equal to 8 and 9. When the number of features in the feature set is greater than or equal to 17 (0.42% of the total features), no such deviation occurs.</p>
<p>We have taken four references with similar work for comparison, as shown in <xref ref-type="table" rid="T7">Table 7</xref>. They all use functional connectivity as a feature measure, and they all screen for features as well. All the five studies used different feature screening methods. <xref ref-type="bibr" rid="B1">Bi et al. (2018)</xref> obtains feature sets by clustering, without assigning weights. It selects a class with the best classification performance from a large number of classes. <xref ref-type="bibr" rid="B23">Nguyen et al. (2019)</xref> set thresholds on the original values of the features. These values do not have a clear relationship with classification. <xref ref-type="bibr" rid="B28">Sadiq et al. (2021)</xref> and <xref ref-type="bibr" rid="B5">Hao et al. (2021)</xref> have in common that the features are given new weights by the feature screening method. They are both selected the top N features according to the weight. <xref ref-type="bibr" rid="B5">Hao et al. (2021)</xref> did not test the effects of different numbers of features on classification. It cannot be shown that there is a clear relationship between the weights of the features and the classification. <xref ref-type="bibr" rid="B28">Sadiq et al. (2021)</xref> used the ReliefF method to screen for features. This is a method that is widely used for feature screening. They tried 100, 200, 300, 400, 500 features for classification, respectively. The accuracy did not show a positive correlation with the number of features. Our experimental results show a clear positive correlation between the accuracy and the number of features.</p>
<table-wrap position="float" id="T7">
<label>TABLE 7</label>
<caption><p>Comparison the performances with references.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">References</td>
<td valign="top" align="center">DataSet AD:CN</td>
<td valign="top" align="left">Feature measures</td>
<td valign="top" align="left">Feature screening methods</td>
<td valign="top" align="left">Criteria for feature selection</td>
<td valign="top" align="center">Feature weights represent importance on classification</td>
<td valign="top" align="left">Classifier</td>
<td valign="top" align="center">Selected features</td>
<td valign="top" align="center">Selected features/All features(%)</td>
<td valign="top" align="center">Acc (%)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B1">Bi et al., 2018</xref></td>
<td valign="top" align="center">25:36</td>
<td valign="top" align="left">Funtional Connectivity (FC)</td>
<td valign="top" align="left">random neural network cluster</td>
<td valign="top" align="left">Selection of the best feature set from 11,000 times clustering</td>
<td valign="top" align="center">NO</td>
<td valign="top" align="left">Elam Neural Network</td>
<td valign="top" align="center">120</td>
<td valign="top" align="center">30.00</td>
<td valign="top" align="center">92.31</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B23">Nguyen et al., 2019</xref></td>
<td valign="top" align="center">34:31</td>
<td valign="top" align="left">voxel-wise regional spontaneous, FC</td>
<td valign="top" align="left">t-test, SVM-RFE, LASSO</td>
<td valign="top" align="left">Thresholds for selecting features</td>
<td valign="top" align="center">NO</td>
<td valign="top" align="left">ELM</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">98.86</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B28">Sadiq et al., 2021</xref></td>
<td valign="top" align="center">35:31</td>
<td valign="top" align="left">FC</td>
<td valign="top" align="left">ReliefF</td>
<td valign="top" align="left">Top n Features</td>
<td valign="top" align="center">NO</td>
<td valign="top" align="left">K-Nearest Neighbor</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">1.50</td>
<td valign="top" align="center">87.10</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">200</td>
<td valign="top" align="center">3.00</td>
<td valign="top" align="center">93.50</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">300</td>
<td valign="top" align="center">4.50</td>
<td valign="top" align="center">88.70</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">400</td>
<td valign="top" align="center">6.00</td>
<td valign="top" align="center">91.90</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">7.50</td>
<td valign="top" align="center">91.70</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B5">Hao et al., 2021</xref></td>
<td valign="top" align="center">252:215 (none ADNI)</td>
<td valign="top" align="left">first-order neighborhood aggregation of FC</td>
<td valign="top" align="left">Weight-constrained lowrank Learning</td>
<td valign="top" align="left">Top n Features</td>
<td valign="top" align="center">NO</td>
<td valign="top" align="left">Multi-Kernel SVM</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">25.00</td>
<td valign="top" align="center">88.63</td>
</tr>
<tr>
<td valign="top" align="left">Proposed method</td>
<td valign="top" align="center">100:100</td>
<td valign="top" align="left">FC</td>
<td valign="top" align="left">KFS-ELM</td>
<td valign="top" align="left">Automatic generation</td>
<td valign="top" align="center">YES</td>
<td valign="top" align="left">ELM</td>
<td valign="top" align="center">45</td>
<td valign="top" align="center">1.12</td>
<td valign="top" align="center">87.84</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">62</td>
<td valign="top" align="center">1.55</td>
<td valign="top" align="center">91.68</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">109</td>
<td valign="top" align="center">2.72</td>
<td valign="top" align="center">93.33</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">199</td>
<td valign="top" align="center">4.97</td>
<td valign="top" align="center">95.24</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">397</td>
<td valign="top" align="center">9.91</td>
<td valign="top" align="center">96.92</td>
</tr>
<tr>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="left"/>
<td valign="top" align="center"/>
<td valign="top" align="left"/>
<td valign="top" align="center">920</td>
<td valign="top" align="center">22.97</td>
<td valign="top" align="center">99.20</td>
</tr>
</tbody>
</table></table-wrap>
<p>In summary, the KFS-ELM method can rationally identify the weights of key features. The higher the weights of the features, the greater the impact in AD diagnosis.</p>
<p>Extreme learning machine is linearly transformed between the hidden and output layers. The KFS-ELM method has achieved good performance based on ELM. Some other representative RFNN methods may achieve high classification accuracies in scenarios with high-dimensional features, such as RVFL, where it is non-linearly transformed between the hidden and output layers. In the future, we will attempt to use the idea of key feature screening for research on the RVFL. We will also apply the proposed method to explore patterns in AD-like diseases or other brain sciences.</p>
</sec>
<sec id="S6" sec-type="conclusion">
<title>Conclusion</title>
<p>The experimental results and discussion analysis showed that, the KFS-ELM method can effectively screen key features related to the diagnosis of AD, and can assign rational weights to the features to identify their importance for the diagnosis of AD. The KFS-ELM can be used to construct better classifiers for the diagnosis of AD, and can also be used as a feature analysis tool to study the patterns inherent in the brains of AD patients. We consider that the KFS-ELM is also applicable to the classification and the study of feature patterns of the other objects with high feature dimensions, even with small sample sizes.</p>
</sec>
<sec id="S7" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="TS1">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="S8">
<title>Ethics Statement</title>
<p>The studies involving human participants were reviewed and approved by ADNI. All the fMRI data used in this manuscript are from LONI&#x2019;s ADNI database ADNI2 project. It can be downloaded from <ext-link ext-link-type="uri" xlink:href="http://adni.loni.usc.edu">http://adni.loni.usc.edu</ext-link>. All ADNI participants signed an informed consent form at the time of fMRI and related information collection, and the protocol for consent was approved by the institutional review board at each site. The ethical approval was obtained by ADNI and can be found at <ext-link ext-link-type="uri" xlink:href="http://www.loni.usc.edu/ADNI/">http://www.loni.usc.edu/ADNI/</ext-link>. All studies in this manuscript were conducted in accordance with the relevant guidelines.</p>
</sec>
<sec id="S9">
<title>Author Contributions</title>
<p>JL and WZ contributed to the conception of the study. JL performed the experiment and data analyses and wrote the manuscript. JL and LZ contributed significantly to the analysis and manuscript preparation. YS helped to perform the analysis with constructive discussions. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="pudiscl1" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="S10" sec-type="funding-information">
<title>Funding</title>
<p>This work was supported by the National Natural Science Foundation of China (Grant Nos. 31870979 and 61906117) and the Shanghai Sailing Program (Grant No. 19YF1419000).</p>
</sec>
<ack><p>Thanks to the Alzheimer&#x2019;s Disease Neuroimaging Initiative (ADNI) for fMRI data sharing.</p>
</ack>
<sec id="S12" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fnagi.2022.888575/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fnagi.2022.888575/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.XLSX" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.XLSX" id="TS2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bi</surname> <given-names>X. A.</given-names></name> <name><surname>Jiang</surname> <given-names>Q.</given-names></name> <name><surname>Sun</surname> <given-names>Q.</given-names></name> <name><surname>Shu</surname> <given-names>Q.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Analysis of Alzheimer&#x2019;s disease based on the random neural network cluster in fMRI.</article-title> <source><italic>Front. Neuroinform.</italic></source> <volume>12</volume>:<issue>60</issue>. <pub-id pub-id-type="doi">10.3389/fninf.2018.00060</pub-id> <pub-id pub-id-type="pmid">30245623</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Buckner</surname> <given-names>R. L.</given-names></name> <name><surname>Sepulcre</surname> <given-names>J.</given-names></name> <name><surname>Talukdar</surname> <given-names>T.</given-names></name> <name><surname>Krienen</surname> <given-names>F. M.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Hedden</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Cortical hubs revealed by intrinsic functional connectivity: mapping, assessment of stability, and relation to Alzheimer&#x2019;s disease.</article-title> <source><italic>J. Neurosci.</italic></source> <volume>29</volume> <fpage>1860</fpage>&#x2013;<lpage>1873</lpage>. <pub-id pub-id-type="doi">10.1523/JNEUROSCI.5062-08.2009</pub-id> <pub-id pub-id-type="pmid">19211893</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chyzhyk</surname> <given-names>D.</given-names></name> <name><surname>Savio</surname> <given-names>A.</given-names></name> <name><surname>Gra&#x00F1;a</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>Computer aided diagnosis of schizophrenia on resting state fMRI data by ensembles of ELM.</article-title> <source><italic>Neural Netw.</italic></source> <volume>68</volume> <fpage>23</fpage>&#x2013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2015.04.002</pub-id> <pub-id pub-id-type="pmid">25965771</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>P.</given-names></name> <name><surname>Chen</surname> <given-names>C. P.</given-names></name> <name><surname>Sun</surname> <given-names>Y.</given-names></name></person-group> (<year>1995</year>). &#x201C;<article-title>An exact supervised learning for a three-layer supervised neural network</article-title>,&#x201D; in <source><italic>Proceedings of the1995 International Conference on Neural Information Processing</italic></source>, (<publisher-loc>Beijing</publisher-loc>: <publisher-name>Publishing House of Electronics Industry</publisher-name>), <fpage>1041</fpage>&#x2013;<lpage>1044</lpage>.</citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hao</surname> <given-names>X. K.</given-names></name> <name><surname>An</surname> <given-names>Q. J.</given-names></name> <name><surname>Jiang</surname> <given-names>T.</given-names></name> <name><surname>Guo</surname> <given-names>Y. C.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Shi</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2021</year>). &#x201C;<article-title>Multi-template neuroimaging feature selection using weight-constrained low-rank learning for alzheimer&#x2019;s disease classification</article-title>,&#x201D; in <source><italic>Proceedings of the 2021 36th Youth Academic Annual Conference of Chinese Association of Automation (YAC)</italic></source>, <publisher-loc>Nanchang</publisher-loc>, <fpage>173</fpage>&#x2013;<lpage>178</lpage>. <pub-id pub-id-type="doi">10.1109/YAC53711.2021.9486487</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G.</given-names></name> <name><surname>Song</surname> <given-names>S. J.</given-names></name> <name><surname>Gupta</surname> <given-names>J. N. D.</given-names></name> <name><surname>Wu</surname> <given-names>C.</given-names></name></person-group> (<year>2014</year>). <article-title>Semi-supervised and unsupervised extreme learning machines.</article-title> <source><italic>IEEE Trans. Cybern.</italic></source> <volume>44</volume> <fpage>2405</fpage>&#x2013;<lpage>2417</lpage>. <pub-id pub-id-type="doi">10.1109/tcyb.2014.2307349</pub-id> <pub-id pub-id-type="pmid">25415946</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Babri</surname> <given-names>H. A.</given-names></name></person-group> (<year>1998</year>). <article-title>Upper bounds on the number of hidden neurons in feedforward networks with arbitrary bounded nonlinear activation functions.</article-title> <source><italic>IEEE Trans. Neural Netw.</italic></source> <volume>9</volume> <fpage>224</fpage>&#x2013;<lpage>229</lpage>. <pub-id pub-id-type="doi">10.1109/72.655045</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2007</year>). <article-title>Convex incremental extreme learning machine.</article-title> <source><italic>Neurocomputing</italic></source> <volume>70</volume> <fpage>3056</fpage>&#x2013;<lpage>3062</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2007.02.009</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2008</year>). <article-title>Enhanced random search based incremental extreme learning machine.</article-title> <source><italic>Neurocomputing</italic></source> <volume>71</volume> <fpage>3460</fpage>&#x2013;<lpage>3468</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2007.10.008</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Wang</surname> <given-names>D. H.</given-names></name> <name><surname>Lan</surname> <given-names>Y.</given-names></name></person-group> (<year>2011</year>). <article-title>Extreme learning machines: a survey.</article-title> <source><italic>Int. J. Mach. Learn. Cybern.</italic></source> <volume>2</volume> <fpage>107</fpage>&#x2013;<lpage>122</lpage>. <pub-id pub-id-type="doi">10.1007/s13042-011-0019-y</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Zhou</surname> <given-names>H. M.</given-names></name> <name><surname>Ding</surname> <given-names>X. J.</given-names></name> <name><surname>Zhang</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). <article-title>Extreme learning machine for regression and multiclass classification.</article-title> <source><italic>IEEE Trans. Syst. Man. Cybern.</italic></source> <volume>42</volume> <fpage>513</fpage>&#x2013;<lpage>529</lpage>. <pub-id pub-id-type="doi">10.1109/tsmcb.2011.2168604</pub-id> <pub-id pub-id-type="pmid">21984515</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Zhu</surname> <given-names>Q. Y.</given-names></name> <name><surname>Siew</surname> <given-names>C. K.</given-names></name></person-group> (<year>2004</year>). &#x201C;<article-title>Extreme learning machine: a new learning scheme of feedforward neural networks,</article-title>&#x201D; in <source><italic>Proceedings of the IEEE International Joint Conference on Neural Networks</italic></source> (<publisher-loc>Budapest</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>985</fpage>&#x2013;<lpage>990</lpage>. <pub-id pub-id-type="doi">10.1109/IJCNN.2004.1380068</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Zhu</surname> <given-names>Q. Y.</given-names></name> <name><surname>Siew</surname> <given-names>C. K.</given-names></name></person-group> (<year>2006</year>). <article-title>Extreme learning machine: theory and applications.</article-title> <source><italic>Neurocomputing</italic></source> <volume>70</volume> <fpage>489</fpage>&#x2013;<lpage>501</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2005.12.126</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>Z.</given-names></name> <name><surname>Yu</surname> <given-names>Y.</given-names></name> <name><surname>Gu</surname> <given-names>J.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>2016</year>). <article-title>An efficient method for traffic sign recognition based on extreme learning machine.</article-title> <source><italic>IEEE Trans. Cybern.</italic></source> <volume>47</volume> <fpage>920</fpage>&#x2013;<lpage>933</lpage>. <pub-id pub-id-type="doi">10.1109/TCYB.2016.2533424</pub-id> <pub-id pub-id-type="pmid">26992185</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kazemifar</surname> <given-names>S.</given-names></name> <name><surname>Drozd</surname> <given-names>J. J.</given-names></name> <name><surname>Rajakumar</surname> <given-names>N.</given-names></name> <name><surname>Borrie</surname> <given-names>M. J.</given-names></name> <name><surname>Bartha</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>Automated algorithm to measure changes in medial temporal lobe volume in Alzheimer disease.</article-title> <source><italic>J. Neurosci. Meth.</italic></source> <volume>227</volume> <fpage>35</fpage>&#x2013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1016/j.jneumeth.2014.01.033</pub-id> <pub-id pub-id-type="pmid">24518149</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>J.</given-names></name> <name><surname>Kim</surname> <given-names>J.</given-names></name> <name><surname>Jang</surname> <given-names>G. J.</given-names></name> <name><surname>Lee</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Fast learning method for convolutional neural networks using extreme learning machine and its application to lane detection.</article-title> <source><italic>Neural Netw.</italic></source> <volume>87</volume> <fpage>109</fpage>&#x2013;<lpage>121</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2016.12.002</pub-id> <pub-id pub-id-type="pmid">28110106</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lama</surname> <given-names>R. K.</given-names></name> <name><surname>Kwon</surname> <given-names>G. R.</given-names></name></person-group> (<year>2021</year>). <article-title>Diagnosis of Alzheimer&#x2019;s disease using brain network.</article-title> <source><italic>Front. Neurosci.</italic></source> <volume>15</volume>:<issue>605115</issue>. <pub-id pub-id-type="doi">10.3389/fnins.2021.605115</pub-id> <pub-id pub-id-type="pmid">33613178</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Zeng</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>T. L.</given-names></name> <name><surname>Xu</surname> <given-names>B.</given-names></name></person-group> (<year>2016</year>). <article-title>Parallel brain simulator: a multi-scale and parallel brain-inspired neural network modeling and simulation platform.</article-title> <source><italic>Cogn. Comput.</italic></source> <volume>8</volume> <fpage>967</fpage>&#x2013;<lpage>981</lpage>. <pub-id pub-id-type="doi">10.1007/s12559-016-9411-y</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X. Y.</given-names></name> <name><surname>Gao</surname> <given-names>C. H.</given-names></name> <name><surname>Li</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <article-title>A comparative analysis of support vector machines and extreme learning machines.</article-title> <source><italic>Neural Netw.</italic></source> <volume>33</volume> <fpage>58</fpage>&#x2013;<lpage>66</lpage>. <pub-id pub-id-type="doi">10.1016/j.neunet.2012.04.002</pub-id> <pub-id pub-id-type="pmid">22572469</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Malik</surname> <given-names>A. K.</given-names></name> <name><surname>Ganaie</surname> <given-names>M. A.</given-names></name> <name><surname>Tanveer</surname> <given-names>M.</given-names></name> <name><surname>Suganthan</surname> <given-names>P. N.</given-names></name></person-group> <collab>Alzheimer&#x2019;s Disease, and Neuroimaging Initiative</collab> (<year>2022</year>). <article-title>Alzheimer&#x2019;s disease diagnosis via intuitionistic fuzzy random vector functional link network.</article-title> <source><italic>IEEE Trans. Comput. Soc. Syst.</italic></source> (<comment>in press</comment>). <fpage>1</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1109/TCSS.2022.3146974</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miche</surname> <given-names>Y.</given-names></name> <name><surname>Sorjamaa</surname> <given-names>A.</given-names></name> <name><surname>Bas</surname> <given-names>P.</given-names></name> <name><surname>Simula</surname> <given-names>O.</given-names></name> <name><surname>Jutten</surname> <given-names>C.</given-names></name> <name><surname>Lendasse</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>OP-ELM: optimally pruned extreme learning machine.</article-title> <source><italic>IEEE Trans. Neural Netw.</italic></source> <volume>21</volume> <fpage>158</fpage>&#x2013;<lpage>162</lpage>. <pub-id pub-id-type="doi">10.1109/TNN.2009.2036259</pub-id> <pub-id pub-id-type="pmid">20007026</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miche</surname> <given-names>Y.</given-names></name> <name><surname>van Heeswijk</surname> <given-names>M.</given-names></name> <name><surname>Bas</surname> <given-names>P.</given-names></name> <name><surname>Simula</surname> <given-names>O.</given-names></name> <name><surname>Lendasse</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>TROP-ELM: a double-regularized ELM using LARS and Tikhonov regularization.</article-title> <source><italic>Neurocomputing</italic></source> <volume>74</volume> <fpage>2413</fpage>&#x2013;<lpage>2421</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2010.12.042</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nguyen</surname> <given-names>D. T.</given-names></name> <name><surname>Ryu</surname> <given-names>S.</given-names></name> <name><surname>Qureshi</surname> <given-names>M. N. I.</given-names></name> <name><surname>Choi</surname> <given-names>M.</given-names></name> <name><surname>Lee</surname> <given-names>K. H.</given-names></name> <name><surname>Lee</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Hybrid multivariate pattern analysis combined with extreme learning machine for Alzheimer&#x2019;s dementia diagnosis using multi-measure rs-fMRI spatial patterns.</article-title> <source><italic>PLoS One</italic></source> <volume>14</volume>:<issue>e0212582</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0212582</pub-id> <pub-id pub-id-type="pmid">30794629</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pao</surname> <given-names>Y. H.</given-names></name> <name><surname>Park</surname> <given-names>G. H.</given-names></name> <name><surname>Sobajic</surname> <given-names>D. J.</given-names></name></person-group> (<year>1994</year>). <article-title>Learning and generalization characteristics of the random vector functional-link net.</article-title> <source><italic>Neurocomputing</italic></source> <volume>6</volume> <fpage>163</fpage>&#x2013;<lpage>180</lpage>. <pub-id pub-id-type="doi">10.1016/0925-2312(94)90053-1</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richhariya</surname> <given-names>B.</given-names></name> <name><surname>Tanveer</surname> <given-names>M.</given-names></name> <name><surname>Rashid</surname> <given-names>A. H.</given-names></name></person-group> <collab>Alzheimer&#x2019;s Disease Neuroimaging Initiative</collab> (<year>2020</year>). <article-title>Diagnosis of Alzheimer&#x2019;s disease using universum support vector machine based recursive feature elimination (USVM-RFE).</article-title> <source><italic>Biomed. Signal. Proces.</italic></source> <volume>59</volume>:<issue>101903</issue>. <pub-id pub-id-type="doi">10.1016/j.bspc.2020.101903</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rong</surname> <given-names>H. J.</given-names></name> <name><surname>Ong</surname> <given-names>Y. S.</given-names></name> <name><surname>Tan</surname> <given-names>A. H.</given-names></name> <name><surname>Zhu</surname> <given-names>Z. X.</given-names></name></person-group> (<year>2008</year>). <article-title>A fast pruned-extreme learning machine for classification problem.</article-title> <source><italic>Neurocomputing</italic></source> <volume>72</volume> <fpage>359</fpage>&#x2013;<lpage>366</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2008.01.005</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rubinov</surname> <given-names>M.</given-names></name> <name><surname>Sporns</surname> <given-names>O.</given-names></name></person-group> (<year>2010</year>). <article-title>Complex network measures of brain connectivity: uses and interpretations.</article-title> <source><italic>Neuroimage</italic></source> <volume>52</volume> <fpage>1059</fpage>&#x2013;<lpage>1069</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2009.10.003</pub-id> <pub-id pub-id-type="pmid">19819337</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sadiq</surname> <given-names>A.</given-names></name> <name><surname>Yahya</surname> <given-names>N.</given-names></name> <name><surname>Tang</surname> <given-names>T. B.</given-names></name></person-group> (<year>2021</year>). &#x201C;<article-title>Diagnosis of Alzheimer&#x2019;s disease using pearson&#x2019;s correlation and relieff feature selection approach</article-title>,&#x201D; in <source><italic>Proceedings of the 2021 International Conference on Decision Aid Sciences and Application (DASA)</italic></source>, <publisher-loc>Bahrain</publisher-loc>, <fpage>578</fpage>&#x2013;<lpage>582</lpage>. <pub-id pub-id-type="doi">10.1109/DASA53625.2021.9682409</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>W. F.</given-names></name> <name><surname>Kraaijveld</surname> <given-names>M. A.</given-names></name> <name><surname>Duin</surname> <given-names>R. P.</given-names></name></person-group> (<year>1992</year>). &#x201C;<article-title>Feed forward neural networks with random weights</article-title>,&#x201D; in <source><italic>Proceedings of the International Conference on Pattern Recognition</italic></source>, <publisher-loc>Netherlands</publisher-loc>, <fpage>1</fpage>&#x2013;<lpage>1</lpage>.</citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharma</surname> <given-names>R.</given-names></name> <name><surname>Goel</surname> <given-names>T.</given-names></name> <name><surname>Tanveer</surname> <given-names>M.</given-names></name> <name><surname>Dwivedi</surname> <given-names>S.</given-names></name> <name><surname>Murugan</surname> <given-names>R.</given-names></name></person-group> (<year>2021</year>). <article-title>FAF-DRVFL: fuzzy activation function based deep random vector functional links network for early diagnosis of Alzheimer disease.</article-title> <source><italic>Appl. Soft Comput.</italic></source> <volume>106</volume>:<issue>107371</issue>. <pub-id pub-id-type="doi">10.1016/j.asoc.2021.107371</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Suganthan</surname> <given-names>P. N.</given-names></name> <name><surname>Katuwal</surname> <given-names>R.</given-names></name></person-group> (<year>2021</year>). <article-title>On the origins of randomization-based feedforward neural networks.</article-title> <source><italic>Appl. Soft Comput.</italic></source> <volume>105</volume>:<issue>107239</issue>. <pub-id pub-id-type="doi">10.1016/j.asoc.2021.107239</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Supekar</surname> <given-names>K.</given-names></name> <name><surname>Menon</surname> <given-names>V.</given-names></name> <name><surname>Rubin</surname> <given-names>D.</given-names></name> <name><surname>Musen</surname> <given-names>M.</given-names></name> <name><surname>Greicius</surname> <given-names>M. D.</given-names></name> <name><surname>Sporns</surname> <given-names>O.</given-names></name></person-group> (<year>2008</year>). <article-title>Network analysis of intrinsic functional brain connectivity in Alzheimer&#x2019;s Disease.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>4</volume>:<issue>e1000100</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1000100</pub-id> <pub-id pub-id-type="pmid">18584043</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Susanne</surname> <given-names>G. M.</given-names></name> <name><surname>Michael</surname> <given-names>W. W.</given-names></name> <name><surname>Leon</surname> <given-names>J. T.</given-names></name> <name><surname>Ronald</surname> <given-names>C. P.</given-names></name> <name><surname>Clifford</surname> <given-names>R. J.</given-names></name> <name><surname>William</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2005</year>). <article-title>Ways toward an early diagnosis in Alzheimer&#x2019;s disease: the Alzheimer&#x2019;s Disease Neuroimaging Initiative (ADNI).</article-title> <source><italic>Alzheimers Dement.</italic></source> <volume>1</volume> <fpage>55</fpage>&#x2013;<lpage>66</lpage>. <pub-id pub-id-type="doi">10.1016/j.jalz.2005.06.003</pub-id> <pub-id pub-id-type="pmid">17476317</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tang</surname> <given-names>J. X.</given-names></name> <name><surname>Deng</surname> <given-names>C. W.</given-names></name> <name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Zhao</surname> <given-names>B. J.</given-names></name></person-group> (<year>2015</year>). <article-title>Compressed-domain ship detection on spaceborne optical image using deep neural network and extreme learning machine.</article-title> <source><italic>IEEE Trans. Geosci. Remote</italic></source> <volume>53</volume> <fpage>1174</fpage>&#x2013;<lpage>1185</lpage>. <pub-id pub-id-type="doi">10.1109/tgrs.2014.2335751</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tanveer</surname> <given-names>M.</given-names></name> <name><surname>Richhariya</surname> <given-names>B.</given-names></name> <name><surname>Khan</surname> <given-names>R. U.</given-names></name> <name><surname>Rashid</surname> <given-names>A. H.</given-names></name> <name><surname>Khanna</surname> <given-names>P.</given-names></name> <name><surname>Prasad</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Machine learning techniques for the diagnosis of Alzheimer&#x2019;s disease: a review.</article-title> <source><italic>ACM Trans. Multim. Comput.</italic></source> <volume>16</volume> <fpage>1</fpage>&#x2013;<lpage>35</lpage>. <pub-id pub-id-type="doi">10.1145/3344998</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tzourio-Mazoyer</surname> <given-names>N.</given-names></name> <name><surname>Landeau</surname> <given-names>B.</given-names></name> <name><surname>Papathanassiou</surname> <given-names>D.</given-names></name> <name><surname>Crivello</surname> <given-names>F.</given-names></name> <name><surname>Etard</surname> <given-names>O.</given-names></name> <name><surname>Delcroix</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2002</year>). <article-title>Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain.</article-title> <source><italic>Neuroimage</italic></source> <volume>15</volume> <fpage>273</fpage>&#x2013;<lpage>289</lpage>. <pub-id pub-id-type="doi">10.1006/nimg.2001.0978</pub-id> <pub-id pub-id-type="pmid">11771995</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>S. G.</given-names></name> <name><surname>Deng</surname> <given-names>C. W.</given-names></name> <name><surname>Lin</surname> <given-names>W. S.</given-names></name> <name><surname>Huang</surname> <given-names>G. B.</given-names></name> <name><surname>Zhao</surname> <given-names>B. J.</given-names></name></person-group> (<year>2016</year>). <article-title>NMF-based image quality assessment using extreme learning machine.</article-title> <source><italic>IEEE Trans. Cybern.</italic></source> <volume>47</volume> <fpage>1</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1109/TCYB.2015.2512852</pub-id> <pub-id pub-id-type="pmid">26863686</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xia</surname> <given-names>M. R.</given-names></name> <name><surname>Wang</surname> <given-names>J. H.</given-names></name> <name><surname>He</surname> <given-names>Y.</given-names></name></person-group> (<year>2013</year>). <article-title>BrainNet viewer: a network visualization tool for human brain connectomics.</article-title> <source><italic>PLoS. One</italic></source> <volume>8</volume>:<issue>e68910</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0068910</pub-id> <pub-id pub-id-type="pmid">23861951</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yan</surname> <given-names>C. G.</given-names></name> <name><surname>Zang</surname> <given-names>Y. F.</given-names></name></person-group> (<year>2010</year>). <article-title>DPARSF: a MATLAB toolbox for &#x201C;pipeline&#x201D; data analysis of resting-state fMRI.</article-title> <source><italic>Front. Syst. Neurosci.</italic></source> <volume>4</volume>:<issue>13</issue>. <pub-id pub-id-type="doi">10.3389/fnsys.2010.00013</pub-id> <pub-id pub-id-type="pmid">20577591</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>J.</given-names></name> <name><surname>Gennatas</surname> <given-names>E. D.</given-names></name> <name><surname>Kramer</surname> <given-names>J. H.</given-names></name> <name><surname>Miller</surname> <given-names>B. L.</given-names></name> <name><surname>Seeley</surname> <given-names>W. W.</given-names></name></person-group> (<year>2012</year>). <article-title>Predicting regional neurodegeneration from the healthy brain functional connectome.</article-title> <source><italic>Neuron</italic></source> <volume>73</volume> <fpage>1216</fpage>&#x2013;<lpage>1227</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuron.2012.03.004</pub-id> <pub-id pub-id-type="pmid">22445348</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="http://rfmri.org/DPARSF">http://rfmri.org/DPARSF</ext-link></p></fn>
<fn id="footnote2">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.fil.ion.ucl.ac.uk/spm/software/spm12/">https://www.fil.ion.ucl.ac.uk/spm/software/spm12/</ext-link></p></fn>
</fn-group>
</back>
</article>