<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2022.874658</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Identification of foodborne pathogenic bacteria using confocal Raman microspectroscopy and chemometrics</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Zhang</surname> <given-names>Jin</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1676130/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Gao</surname> <given-names>Pengya</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Wu</surname> <given-names>Yuan</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Yan</surname> <given-names>Xiaomei</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Ye</surname> <given-names>Changyun</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Liang</surname> <given-names>Weili</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Yan</surname> <given-names>Meiying</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Xu</surname> <given-names>Xuefang</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/735060/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Jiang</surname> <given-names>Hong</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x002A;</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Criminal Investigation School, People&#x2019;s Public Security University of China</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>State Key Laboratory for Infectious Disease Prevention and Control, National Institute for Communicable Diseases Control and Prevention, Chinese Center for Disease Control and Prevention</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Lin Lin, Jiangsu University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Siong Fong Sim, Universiti Malaysia Sarawak, Malaysia; Meikun Fan, Southwest Jiaotong University, China</p></fn>
<corresp id="c001">&#x002A;Correspondence: Xuefang Xu, <email>xuxuefang@icdc.cn</email></corresp>
<corresp id="c002">Hong Jiang, <email>jiangh2001@163.com</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Food Microbiology, a section of the journal Frontiers in Microbiology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>11</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>874658</elocation-id>
<history>
<date date-type="received">
<day>12</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>17</day>
<month>10</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 Zhang, Gao, Wu, Yan, Ye, Liang, Yan, Xu and Jiang.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Zhang, Gao, Wu, Yan, Ye, Liang, Yan, Xu and Jiang</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Rapid and accurate identification of foodborne pathogenic bacteria is of great importance because they are often responsible for the majority of serious foodborne illnesses. The confocal Raman microspectroscopy (CRM) is a fast and easy-to-use method known for its effectiveness in detecting and identifying microorganisms. This study demonstrates that CRM combined with chemometrics can serve as a rapid, reliable, and efficient method for the detection and identification of foodborne pathogenic bacteria without any laborious pre-treatments. Six important foodborne pathogenic bacteria including <italic>S. flexneri</italic>, <italic>L. monocytogenes</italic>, <italic>V. cholerae</italic>, <italic>S. aureus</italic>, <italic>S. typhimurium</italic>, and <italic>C. botulinum</italic> were investigated with CRM. These pathogenic bacteria can be differentiated based on several characteristic peaks and peak intensity ratio. Principal component analysis (PCA) was used for investigating the difference of various samples and reducing the dimensionality of the dataset. Performances of some classical classifiers were compared for bacterial detection and identification including decision tree (DT), artificial neural network (ANN), and Fisher&#x2019;s discriminant analysis (FDA). Correct recognition ratio (CRR), area under the receiver operating characteristic curve (ROC), cumulative gains, and lift charts were used to evaluate the performance of models. The impact of different pretreatment methods on the models was explored, and pretreatment methods include Savitzky&#x2013;Golay algorithm smoothing (SG), standard normal variate (SNV), multivariate scatter correction (MSC), and Savitzky&#x2013;Golay algorithm 1st Derivative (SG 1st Der). In the DT, ANN, and FDA model, FDA is more robust for overfitting problem and offers the highest accuracy. Most pretreatment methods raised the performance of the models except SNV. The results revealed that CRM coupled with chemometrics offers a powerful tool for the discrimination of foodborne pathogenic bacteria.</p>
</abstract>
<kwd-group>
<kwd>foodborne pathogenic bacteria</kwd>
<kwd>confocal Raman microspectroscopy (CRM)</kwd>
<kwd>pretreatment</kwd>
<kwd>chemometrics</kwd>
<kwd>classification</kwd>
</kwd-group>
<contract-sponsor id="cn001">National Key Research and Development Program of China<named-content content-type="fundref-id">10.13039/501100012166</named-content></contract-sponsor>
<counts>
<fig-count count="7"/>
<table-count count="7"/>
<equation-count count="0"/>
<ref-count count="56"/>
<page-count count="13"/>
<word-count count="7974"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>The World Health Organization (WHO) survey results indicated that foodborne diseases are increasingly reported as serious public health problems around the world (<xref ref-type="bibr" rid="B51">Wu et al., 2013</xref>). Billions of people in the world are at risks of unsafe food (<xref ref-type="bibr" rid="B14">Fung et al., 2018</xref>). Millions of people are infected with foodborne diseases every year. It leads to high rates of morbidity and mortality. Meantime, since most of foodborne pathogenic bacteria can survive and even multiply in the harsh environmental conditions, it also presents huge challenges to the production, processing, and storage of food products for the food industry (<xref ref-type="bibr" rid="B15">Giaouris et al., 2015</xref>). Foodborne diseases not only are a serious threat to the health of the people, but also cause inestimable loss of property to consumers and food-related industries. It also poses dramatic negative impact to economic growth, political, and social stability of the country (<xref ref-type="bibr" rid="B6">Chen and Alali, 2018</xref>; <xref ref-type="bibr" rid="B56">Zhou et al., 2019</xref>; <xref ref-type="bibr" rid="B30">Mi et al., 2021</xref>). Thus, it also poses greater importance to solve this problem in view of the seriousness and harmfulness of foodborne diseases (<xref ref-type="bibr" rid="B30">Mi et al., 2021</xref>).</p>
<p>The most commonly well-known bacterial pathogens in connection with foodborne diseases worldwide include <italic>Shigella</italic>, <italic>Listeria monocytogenes</italic>, <italic>Vibrio cholerae</italic>, <italic>Staphylococcus aureus</italic>, <italic>Salmonella</italic>, and <italic>Clostridium botulinum</italic> (<italic>C. botulinum</italic>) (<xref ref-type="bibr" rid="B6">Chen and Alali, 2018</xref>). The constant threats from these bacterial pathogens make rapid and cost-effective detection and discrimination of foodborne pathogenic bacteria a crucial issue for environmental monitoring, food safety, and early diagnosis of diseases (<xref ref-type="bibr" rid="B21">Kant et al., 2018</xref>; <xref ref-type="bibr" rid="B53">Yin et al., 2020</xref>). Traditional culture-based methods are the common and mature techniques for the detection of bacterial pathogens, and simple operation and low cost are the main reason to make these methods popular, whereas it is a slow process and not to achieve the aim for rapid detection in today&#x2019;s food industry (<xref ref-type="bibr" rid="B51">Wu et al., 2013</xref>). Methods based on immunology include enzyme-linked immunosorbent assay (ELISA) (<xref ref-type="bibr" rid="B46">Vaz-Velho et al., 2000</xref>; <xref ref-type="bibr" rid="B12">Ferreira et al., 2001</xref>; <xref ref-type="bibr" rid="B17">Hochel et al., 2004</xref>), immunomagnetic separation technique (<xref ref-type="bibr" rid="B47">Wang and Slavik, 1999</xref>; <xref ref-type="bibr" rid="B40">Taban et al., 2009</xref>), and immunofluorescence labeling technique (<xref ref-type="bibr" rid="B52">Yan et al., 2008</xref>). ELISA is more common among them, and the advantages of ELISA are fast separation speed, high sensitivity, and specificity for bacterial types and strains, whereas rapid detection using ELISA in the field is impractical due to the requirement of multiple steps, various chemical reagents, and time-consuming incubation (<xref ref-type="bibr" rid="B51">Wu et al., 2013</xref>). Molecular biology methods have been extensively adopted for microbial detection and identification in the past few decades (<xref ref-type="bibr" rid="B30">Mi et al., 2021</xref>), such as pulsed-field gel electrophoresis (PFGE) (<xref ref-type="bibr" rid="B43">Umeda et al., 2009</xref>; <xref ref-type="bibr" rid="B38">Skarin et al., 2010</xref>; <xref ref-type="bibr" rid="B2">Anza et al., 2014</xref>), amplified fragment length polymorphism (AFLP) (<xref ref-type="bibr" rid="B22">Keto-Timonen et al., 2005</xref>), DNA microarray (<xref ref-type="bibr" rid="B3">Artin et al., 2010</xref>; <xref ref-type="bibr" rid="B36">Raphael et al., 2010</xref>; <xref ref-type="bibr" rid="B45">Vanhomwegen et al., 2013</xref>; <xref ref-type="bibr" rid="B32">Ng and Lin, 2014</xref>), multilocus sequence typing (MLST) (<xref ref-type="bibr" rid="B28">Macdonald et al., 2011</xref>; <xref ref-type="bibr" rid="B34">Olsen et al., 2014</xref>), and polymerase chain reaction (PCR) (<xref ref-type="bibr" rid="B39">Szabo et al., 1994</xref>; <xref ref-type="bibr" rid="B5">Braconnier et al., 2001</xref>; <xref ref-type="bibr" rid="B26">Lindstr&#x00F6;m et al., 2001</xref>; <xref ref-type="bibr" rid="B9">Dahlsten et al., 2008</xref>; <xref ref-type="bibr" rid="B10">De Medici et al., 2009</xref>). These methods can be carried out without time-consuming incubation compared to culture-based methods, but there are some inevitable restrictions limiting their applications. For example, PCR is based on nucleic acid amplification and consequently cannot discriminate nucleic acid amplified from viable and non-viable bacteria (<xref ref-type="bibr" rid="B51">Wu et al., 2013</xref>). In summary, many existing detection techniques have great limitations (<xref ref-type="bibr" rid="B30">Mi et al., 2021</xref>), and it is necessary to find a rapid and efficient method for rapid multispecies tests (<xref ref-type="bibr" rid="B29">Maquelin et al., 2002</xref>; <xref ref-type="bibr" rid="B16">Ho et al., 2019</xref>).</p>
<p>Raman spectroscopy has already been recognized as a powerful analytical technique for rapid characterization and detection of bacteria without external labels or tedious preparation. The Raman spectrum deriving from molecular vibrations can be considered as a typical whole-organism fingerprint of the biochemical composition of microorganisms. This vibrational spectrum could show the differences of the molecular compositions in various bacterial pathogens at the molecular level (<xref ref-type="bibr" rid="B29">Maquelin et al., 2002</xref>; <xref ref-type="bibr" rid="B25">Lin et al., 2019</xref>). Thus, Raman spectra can be used to infer strain-specific physiological, metabolic, and phenotypic states of bacterial cells (<xref ref-type="bibr" rid="B8">Chisanga et al., 2020</xref>).</p>
<p>Confocal Raman microscopy (CRM) is a powerful optical spectroscopy technique. It combines Raman spectroscopy with a confocal microscope. It is with advantages such as its fingerprint-identification capability and great sensitivity in aqueous medium (<xref ref-type="bibr" rid="B1">Andrei et al., 2020</xref>). It gives the opportunity to identify single bacterial cell in high spectral resolution, combining the power of 3D sample analysis with focused biological component. A laser beam of approximately 1 mm with known wavelength is used to analyze a sample. The scattered radiation and energy shift are measured, and differentiation of species and even strain level is achieved by the acquired chemical characteristic information of the sample (<xref ref-type="bibr" rid="B37">Serrano et al., 2015</xref>; <xref ref-type="bibr" rid="B23">Kriem et al., 2020</xref>). In biomedicine, CRM has been applied in the discrimination, classification, and diagnosis of pathological conditions, such as various malignancies and tumors. However, few reports have addressed the use of this technique in the detection and discrimination of foodborne pathogenic bacteria.</p>
<p>Since the main biological components are similar in different foodborne pathogenic bacteria, such as nucleic acids, proteins, lipids, and carbohydrates, it always leads to high similarity of Raman spectrum. Thus, it is important to apply chemometrics to spectral data for distinguishing different bacteria species. Statistical approaches include unsupervised techniques and supervised techniques. In the unsupervised techniques, unlabeled datasets are analyzed and clustered without the need for human intervention. Principle component analysis (PCA) is one of the common unsupervised technique for spectral analysis (<xref ref-type="bibr" rid="B23">Kriem et al., 2020</xref>). In the supervised techniques, the aim is to classify data or predict outcomes accurately in labeled datasets. Some classical supervised classifiers include decision tree (DT), artificial neural network (ANN), and Fisher&#x2019;s discriminant analysis (FDA).</p>
<p>In this study, we aim to evaluate and examine the potential of the CRM and chemometrics methods for the detection and classification of six foodborne pathogenic bacteria. We also explored the impact of different pretreatment methods on the models including Savitzky&#x2013;Golay algorithm smoothing (SG), standard normal variate (SNV), multivariate scatter correction (MSC), and Savitzky&#x2013;Golay algorithm 1st Derivative (SG 1st Der). PCA was used for investigating the difference of various samples and reducing the dimensionality of the dataset and extracting feature. Performances of classical classifiers were compared for bacterial detection and identification including DT, ANN, and FDA model. Correct recognition ratio (CRR), area under the receiver operating characteristic curve (ROC), cumulative gains, and lift charts were employed to evaluate the performance of models. According to our results, CRM combined with chemometrics offered a powerful tool for the discrimination of foodborne pathogenic bacteria. As far as we know, this is the first study for the identification of six foodborne pathogenic bacteria using CRM coupled with DT, ANN, and FDA classifiers along with four single pretreatment methods.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and methods</title>
<sec id="S2.SS1">
<title>Preparation of bacterial samples</title>
<p>The following bacteria were used in the study; they are <italic>Salmonella typhimurium</italic> (<italic>S. typhimurium</italic>) (LT2, Sa 11030), <italic>Shigella flexneri</italic> (<italic>S. flexneri</italic>), <italic>Listeria monocytogenes</italic> (<italic>L. monocytogenes</italic>) (Lin), <italic>Vibrio cholerae</italic> (<italic>V. cholerae</italic>) (Non-toxigenic strain, 93097), <italic>Staphylococcus aureus</italic> (<italic>S. aureus</italic>) (ATCC 25923), and <italic>C. botulinum</italic>. The strains were provided by the State Key Laboratory for Infectious Disease Prevention and Control, National Institute for Communicable Diseases Control and Prevention, and Chinese Center for Disease Control and Prevention (ICDC; Beijing, China).</p>
<p><italic>C. botulinum</italic> strains were stored in TPGY broth mixed with glycerol at &#x2013;70<sup>&#x00B0;</sup>C; other strains were housed in Luria-Bertani broth mixed with glycerol at &#x2013;70<sup>&#x00B0;</sup>C. To multiply bacterial cells, <italic>C. botulinum</italic> strains were grown in TPGY broth for 24 h at 37<sup>&#x00B0;</sup>C anaerobically; other strains were grown in Luria-Bertani broth for 24 h at 37<sup>&#x00B0;</sup>C. The culture media were purchased from Beijing Land Bridge Technology Co., Ltd. (Beijing, China). All spectra were collected during the stationary phase to avoid the influence of different growth periods of bacteria. In this study, their initial and final OD<sub>600</sub> values are approximately equal based on the results of ultraviolet spectrophotometer (Varian, USA).</p>
</sec>
<sec id="S2.SS2">
<title>Bacteria sample pretreatment</title>
<p>Each culture was vortexed quickly and centrifuged for 8 min at 4,000 rpm at 4<sup>&#x00B0;</sup>C using Centrifuge 5418 R (Eppendorf, Germany). The supernatant was discarded after centrifugation. Subsequently, 5 mL of 0.9% NaCl solution was added and stirred. This procedure was repeated one time. The sediment was suspended in 1 mL of 0.9% NaCl solution and transferred to a new microcentrifuge tube (1.5 mL). The suspension was centrifuged for 3 min at 13,500 rpm at room temperature using microcentrifuge (Pic017; Heraeus, Germany). This procedure was repeated two times. Finally, the sediment was resuspended in 200 &#x03BC;L of 0.9% NaCl solution. To record the spectra, about 5 &#x03BC;L of solution was dropped onto the aluminum foil and dried at room temperature for 5 min for Raman spectra measuring (<xref ref-type="bibr" rid="B54">Zhang et al., 2021</xref>).</p>
</sec>
<sec id="S2.SS3">
<title>Spectra collection</title>
<p>All Raman experiments were performed using confocal and high performance Raman microscope (XploRA PLUS, HORIBA, Japan). Bacteria samples were detected with a 532-nm laser under a 600 g/mm grating, the laser power is 1 mW, a spectral resolution of 0.6 cm<sup>&#x2013;1</sup>, and 50X objective lens. The integration time was 10 s. Each bacteria sample was collected six times. Each bacterial spectrum was collected and consisted of 603 points in the range of 600&#x2013;1,800 cm<sup>&#x2013;1</sup>. All the spectra were processed <italic>via</italic> Origin software (OriginLab, USA). The background was removed using baseline correction (method: second derivative, baseline mode: user defined, number of baseline points: 16). LabSpec 6.3 (HORIBA, Japan) was utilized to optimize acquisition parameters and collect sufficient Raman spectra for statistical analysis. A single bacteria cell was mapped since the advantage of confocal Raman. Eighty samples of each type of bacteria were cultured, Raman spectra were collected, respectively, and a total of 480 Raman spectra were collected.</p>
<p>By increasing the integration time and the number of scans, the spectra jumper by external interference can be eliminated, and the signal strength of the Raman spectrum can be promoted. However, this method takes longer to collect spectra. Therefore, it is necessary to choose an appropriate integration time and number of scans. Clostridium botulinum was chosen as the test object. Based on the same experimental conditions, the integration time was set to 5, 10, and 15 s, respectively, and the Raman spectra of Clostridium botulinum under different integration times were compared. Based on the same experimental conditions, the number of scans was set to 3, 6, and 9, respectively, and the Raman spectra of Clostridium botulinum under different scan times were compared.</p>
<p>In each bacterium, a sample was randomly selected and tested 10 times in parallel to examine the repeatability of the method.</p>
</sec>
<sec id="S2.SS4">
<title>Data pretreatment and multivariate analysis</title>
<p>The dataset was preprocessed by four single pretreatment methods, respectively, to explore the impact of various methods on the model. The pretreatment methods included Savitzky&#x2013;Golay algorithm smoothing (SG), SNV, MSC, and SG 1st Der. Among them, SG can remove the noise of the spectral. SNV and MSC can reduce the influence of scattering on the original spectrum. SG 1st Der can eliminate the interference of the baseline and background. All the preprocessing procedures were carried out in the Unscrambler X 10.4 (CAMO, Norway).</p>
<p>Principal component analysis (PCA) is an unsupervised and multivariate technique that projects a set of correlated features onto a set of uncorrelated features using an orthogonal transformation. It preserves the most of information. In this study, PCA was used for two purposes. The first one is to plot the distribution of data based on the PCA scores to investigate the difference of various samples. The second one is to reduce the dimensionality of the dataset and extract feature. Computational costs of classifiers were reduced, and overfitting was prevented.</p>
<p>Decision tree (DT) gives the various outcomes from a series of decisions based on a flowchart-like diagram. It is a strong tool for planning strategy, research analysis, and making decision. One of primary advantage is easy to follow and understand. There are four popular algorithms in DT model; they are CHAID, exhaustive CHAID, CART, and QUEST algorithm. All of them were applied to build DT models.</p>
<p>ANN is inspired based on modern neuroscience research. The large amount of processing unit is utilized to build a complex model. Human brain neural network structure and function are imitated in the ANN model. ANN is widely applied for spectral analysis and identification (<xref ref-type="bibr" rid="B35">Park and Lek, 2016</xref>). There are two popular algorithms in ANN model; they are multilayer perceptron (MLP) and radial basis function (RBF). All of them were applied to build models.</p>
<p>FDA is a supervised technique using a discriminant function to assign data to different groups. FDA is often used to build a model combined with PCA. The PCs from PCA are used in FDA to define and predict classes.</p>
<p>The hold-out method is used as a method of cross-validation in DT, ANN model. In the hold-out method, the dataset is divided into two parts, 70% of the samples are allocated to the training set, and there are 320 samples in the training set. About 30% of the samples are assigned to the testing set, and there are 160 samples in the testing set. They were used to test the predictions of the models (<xref ref-type="bibr" rid="B27">Liu et al., 2018</xref>). Leave-one-out cross-validation (LOOCV) technique is used for the cross-validation of FDA model, and it can examine the quality of the classifier and avoid overfitting. In the LOOCV, one spectrum in the dataset is allocated to the validation data and the remaining spectra are allocated to the training data. This process is repeated until each spectrum is used once as the validation data (<xref ref-type="bibr" rid="B42">Tie et al., 2020</xref>). Therefore, there are 479 samples in the training set, and there is a sample in the testing set. The tests of equality of group means are used for evaluating the potential of every independent variable before constructing the model. CRR, area under the ROC, cumulative gains, and lift charts were used for comparing the performance of classifiers under different pretreatment methods. All statistic procedures were operated in the IBM SPSS Statistics 25 and the Unscrambler X 10.4 software.</p>
</sec>
</sec>
<sec id="S3" sec-type="results|discussion">
<title>Results and discussion</title>
<p>The number of scans is closely related to SNR (signal to noise ratio) level of the data, and the scan time is closely associated with Raman intensity of peaks. To acquire the optimal number and time of scans, 3, 6, and 9 scans were tested. About 5, 10, and 15 s for scan time were also tested.</p>
<p><xref ref-type="fig" rid="F1">Figure 1</xref> is the spectrum based on the different integration times and number of scan. As shown in <xref ref-type="fig" rid="F1">Figure 1A</xref>, the signal intensity of the spectra with an integration time of 10 s is significantly higher than that of 5 s. The peak intensity varied little as the integration time increased. Therefore, 10-s scan time of the sample was considered optimal. As shown in <xref ref-type="fig" rid="F1">Figure 1B</xref>, the peak position and intensity varied little as the number of scan increased. Considering external interfering elements, six scans of the sample were considered optimal.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Spectrum based on the different integration times <bold>(A)</bold> and number of scan <bold>(B)</bold>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-13-874658-g001.tif"/>
</fig>
<p><xref ref-type="fig" rid="F2">Figure 2</xref> is the repeat experiment in the six kinds of bacteria.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Repetitive experimental trial.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-13-874658-g002.tif"/>
</fig>
<p>The sample was tested in parallel for 10 times. It can be seen that the spectrum was very similar. The RSD is less than 5%. The result indicated that the repeatability of the experiment is good.</p>
<sec id="S3.SS1">
<title>Visual analysis of spectra</title>
<p>In this study, the CRM technique was used for the detection and identification of six in total foodborne pathogenic bacteria species, namely <italic>S. typhimurium</italic>, <italic>S. flexneri</italic>, <italic>L. monocytogenes</italic>, <italic>V</italic>. <italic>cholerae</italic>, <italic>S. aureus</italic>, and <italic>C</italic>. <italic>botulinum.</italic></p>
<p>The mean normalized CRM spectra of six bacterial species are shown in <xref ref-type="fig" rid="F3">Figure 3</xref>. Major spectral bands and peaks assignment are shown in <xref ref-type="table" rid="T1">Table 1</xref>. Some spectral bands are common in all species, and every spectrum showed bands at <italic>ca.</italic> 746, 1,140, and 1,245 cm<sup>&#x2013;1</sup> assignable to cytosine and uracil, = C-O-C = (unsaturated fatty acids in lipids), amide III (random), and thymine, respectively. There are also many differences in the spectral images. For example, only <italic>L. monocytogenes</italic> showed a band at &#x223C;1,094 and 1,574 cm<sup>&#x2013;1</sup> assignable to CC skeletal and COC stretch (<xref ref-type="bibr" rid="B55">Zheng et al., 2020</xref>) from glycosidic link, CN stretching of amide II (<xref ref-type="bibr" rid="B11">Fan et al., 2011</xref>), respectively; only <italic>S. aureus</italic> showed a band at &#x223C;1,302 cm<sup>&#x2013;1</sup> assignable to amide III (<xref ref-type="bibr" rid="B20">Jarvis et al., 2004</xref>). A band was located at 1,658 cm<sup>&#x2013;1</sup> in all species except <italic>V</italic>. <italic>cholerae</italic>. There are few differences among <italic>C. botulinum</italic> and <italic>S. flexneri</italic>. However, it is possible to differentiate them by using the ratio of the peak intensities. A total of 1,322/1,350 and 1,463/1,587 were used for differences between <italic>C. botulinum</italic> and <italic>S. flexneri</italic>. A significant difference of peak intensity ratios was shown as depicted in <xref ref-type="table" rid="T2">Table 2</xref>. Therefore, the different species can be characterized based on the abundant and unique spectral information of CRM.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Mean normalized spectra of six bacterial species.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-13-874658-g003.tif"/>
</fig>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Raman spectral peak assignment observed in the spectra of six bacterial strains.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Range (cm<sup>&#x2013;1</sup>)</td>
<td valign="top" align="left">Assignment (<xref ref-type="bibr" rid="B13">Franco et al., 2017</xref>; <xref ref-type="bibr" rid="B49">Witkowska et al., 2017</xref>, <xref ref-type="bibr" rid="B50">2018</xref>; <xref ref-type="bibr" rid="B24">Lemma et al., 2019</xref>; <xref ref-type="bibr" rid="B44">Uysal et al., 2020</xref>)</td>
<td valign="top" align="left">Range (cm<sup>&#x2013;1</sup>)</td>
<td valign="top" align="left">Assignment (<xref ref-type="bibr" rid="B13">Franco et al., 2017</xref>; <xref ref-type="bibr" rid="B49">Witkowska et al., 2017</xref>, <xref ref-type="bibr" rid="B50">2018</xref>; <xref ref-type="bibr" rid="B24">Lemma et al., 2019</xref>; <xref ref-type="bibr" rid="B44">Uysal et al., 2020</xref>)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">720&#x2013;740</td>
<td valign="top" align="left">Adenine</td>
<td valign="top" align="left">1,213&#x2013;1,295</td>
<td valign="top" align="left">Amide III (random), thymine</td>
</tr>
<tr>
<td valign="top" align="left">745&#x2013;790</td>
<td valign="top" align="left">Cytosine, uracil</td>
<td valign="top" align="left">1,320&#x2013;1,330</td>
<td valign="top" align="left">Amide III (&#x03B1;-helix)</td>
</tr>
<tr>
<td valign="top" align="left">995&#x2013;1,010</td>
<td valign="top" align="left">Phenylalanine, C aromatic ring stretching</td>
<td valign="top" align="left">1,340&#x2013;1,365</td>
<td valign="top" align="left">Amide III (protein)</td>
</tr>
<tr>
<td valign="top" align="left">1,025&#x2013;1,040</td>
<td valign="top" align="left">Phenylalanine</td>
<td valign="top" align="left">1,455&#x2013;1,465</td>
<td valign="top" align="left">&#x03B4;(CH2) wagging</td>
</tr>
<tr>
<td valign="top" align="left">1,120&#x2013;1,145</td>
<td valign="top" align="left">= C-O-C= (Unsaturated fatty acids in lipids), galactomannan</td>
<td valign="top" align="left">1,580&#x2013;1,590</td>
<td valign="top" align="left">Guanine ring stretching</td>
</tr>
<tr>
<td valign="top" align="left">1,150&#x2013;1,195</td>
<td valign="top" align="left">Aromatic amino acids</td>
<td valign="top" align="left">1,655&#x2013;1,670</td>
<td valign="top" align="left">Amide I (&#x03B1;-helix)</td>
</tr>
</tbody>
</table></table-wrap>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Intensity and intensity ratio of some peaks between <italic>C. botulinum</italic> and <italic>S. flexneri.</italic></p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Type</td>
<td valign="top" align="center" colspan="4">Peak intensity (Arbitr. Units)<hr/></td>
<td valign="top" align="center" colspan="2">Peak intensity ratio<hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center">1,322 cm<sup>&#x2013;1</sup></td>
<td valign="top" align="center">1,350 cm<sup>&#x2013;1</sup></td>
<td valign="top" align="center">1,463 cm<sup>&#x2013;1</sup></td>
<td valign="top" align="center">1,587 cm<sup>&#x2013;1</sup></td>
<td valign="top" align="center">1,322/1,350</td>
<td valign="top" align="center">1,463/1,587</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>C. Botulinum</italic></td>
<td valign="top" align="center">0.29</td>
<td valign="top" align="center">0.22</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">0.28</td>
<td valign="top" align="center">1.32</td>
<td valign="top" align="center">1.18</td>
</tr>
<tr>
<td valign="top" align="left"><italic>S. flexneri</italic></td>
<td valign="top" align="center">0.07</td>
<td valign="top" align="center">0.20</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.23</td>
<td valign="top" align="center">0.35</td>
<td valign="top" align="center">2.26</td>
</tr>
</tbody>
</table></table-wrap>
<p>As shown in <xref ref-type="fig" rid="F3">Figure 3</xref>, mean normalized spectra of different bacterial species have a high level of similarity; although they can be differentiated based on several peaks and peak intensity ratio, visual detection of these minor differences is quite time-consuming and may lead to misdiagnosis (<xref ref-type="bibr" rid="B25">Lin et al., 2019</xref>). Thus, it is essential to analyze the data combined with multivariate analysis techniques for revealing minor spectral differences. In this study, PCA, DT, ANN, and FDA were used for further study.</p>
</sec>
<sec id="S3.SS2">
<title>Unsupervised analysis</title>
<p>In this study, the PCA method was used to construct classification models of bacterial species based on the four kinds of pretreatment methods, and it has reduced the dimensionality of the dataset and extracted feature.</p>
<p><xref ref-type="table" rid="T3">Table 3</xref> shows the results of PCA under different pretreatment methods. It can be seen that the number of PCs and corresponding cumulative contribution rate has changed along with difference of pretreatment methods. <xref ref-type="fig" rid="F4">Figures 4A&#x2013;E</xref> represents PCA score plots under various preprocess methods. These results represented the influence of preprocessed spectra on PCA models and the possibility of differentiation for bacterial species with a high accuracy. <xref ref-type="fig" rid="F4">Figure 4A</xref> shows that all bacterial species can be differentiated. In <xref ref-type="fig" rid="F4">Figures 4B,D</xref>, the samples of <italic>S. aureus</italic> and <italic>S. typhimurium</italic> became more scattered, the categories were closer to each other, and it was difficult to distinguish different categories. In <xref ref-type="fig" rid="F4">Figure 4C</xref>, although the spatial distance of each group became larger, the problem of overlap was more serious. In <xref ref-type="fig" rid="F4">Figure 4E</xref>, the plot had placed samples of the same types closer by SG 1st Der, and overlapping samples were reduced between <italic>C. botulinum</italic> and <italic>S. flexneri</italic> compared to <xref ref-type="fig" rid="F4">Figure 4C</xref>.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Summary of data dimension reduction by PCA under different pretreatment methods.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Pretreatment method</td>
<td valign="top" align="center">The number of PCs</td>
<td valign="top" align="center">Cumulative contribution rate (%)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Original data</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">99.97</td>
</tr>
<tr>
<td valign="top" align="left">SG</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">99.99</td>
</tr>
<tr>
<td valign="top" align="left">SNV</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">90.78</td>
</tr>
<tr>
<td valign="top" align="left">MSC</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">99.98</td>
</tr>
<tr>
<td valign="top" align="left">SG 1st Der</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">87.05</td>
</tr>
</tbody>
</table></table-wrap>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>PCA results obtained for six bacterial species. <bold>(A&#x2013;E)</bold> Represent PCA score plots of various preprocess methods, that is, <bold>(A)</bold> original data, <bold>(B)</bold> SG, <bold>(C)</bold> SNV, <bold>(D)</bold> MSC, and <bold>(E)</bold> SG 1st Der. <bold>(F)</bold> A PC-1 loading plot based on the original data.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-13-874658-g004.tif"/>
</fig>
<p><xref ref-type="fig" rid="F4">Figure 4F</xref> represents the most important variables for classification in the original data. The major positive loadings were located at <italic>ca.</italic> 1,322, 1,353, 1,465, and 1,590 cm<sup>&#x2013;1</sup>. The largest loadings in the positive direction were located at <italic>ca.</italic> 1,353 cm<sup>&#x2013;1</sup> (Amide III (protein)) and <italic>ca.</italic> 1,590 cm<sup>&#x2013;1</sup> (Guanine ring stretching). The major negative loadings were located at <italic>ca.</italic> 755, 1,001, 1,123, 1,221, and 1,302 cm<sup>&#x2013;1</sup>. The largest loadings in the negative direction were located at <italic>ca.</italic> 755 cm<sup>&#x2013;1</sup> (Cytosine or uracil) and <italic>ca.</italic> 1,302 cm<sup>&#x2013;1</sup> (Amide III).</p>
<p>As mentioned above, we can conclude that preliminary classification can be achieved by PCA model combined with the preprocess methods. It is worth noting that PCA is an unsupervised technique, not meant for classification purpose. DT, ANN, and FDA, these supervised and multivariate techniques, were used for the construction of classifiers. They have more advantages than PCA because the information among the classes is taken into account (<xref ref-type="bibr" rid="B48">Wichmann et al., 2020</xref>).</p>
<p>To evaluate the potential of every independent variable before constructing the classification model, the tests of equality of group means are used for this study. <xref ref-type="supplementary-material" rid="DS1">Supplementary Tables 1</xref>&#x2013;<xref ref-type="supplementary-material" rid="DS1">5</xref> give the summary of the tests of equality of group means with spectra preprocessed by various pretreatment methods. In each test, the grouping variable was treated as the factor, and the results of a one-way ANOVA are given for the independent variable. Smaller Wilk&#x2019;s lambda values suggest that the variable is better at discrimination of groups. If the significance value (Sig) of the independent variable is less than 0.05, it indicates that the variable is significant for the creation of the models. As shown in <xref ref-type="supplementary-material" rid="DS1">Supplementary Tables 1</xref>&#x2013;<xref ref-type="supplementary-material" rid="DS1">5</xref>, Wilk&#x2019;s lambda value of major PCs is smaller, and every Sig value is far less than 0.5. It indicates that every variable is significant for the construction of the models and there is not any evident anomaly (<xref ref-type="bibr" rid="B7">Chiang et al., 2001</xref>).</p>
</sec>
<sec id="S3.SS3">
<title>Supervised analysis</title>
<sec id="S3.SS3.SSS1">
<title>Decision tree model</title>
<p>As depicted in <xref ref-type="table" rid="T4">Table 4</xref>, comparing with the decision tree based on MSC and SNV, the decision tree based on SG and SG 1st Der had a stronger prediction capacity because of its higher accuracy. SG and SG 1st Der raised accuracy and the ability to extend of DT model compared to original data, and MSC raised accuracy of models in part of algorithms, while SNV resulted in a less model accuracy.</p>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>Summary of DT modeling results for bacteria species identification expressed in CRR.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Method</td>
<td valign="top" align="center" colspan="4">CRR of training set (%)<hr/></td>
<td valign="top" align="center" colspan="4">CRR of testing set (%)<hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center">CHAID</td>
<td valign="top" align="center">Exhaustive CHAID</td>
<td valign="top" align="center">CART</td>
<td valign="top" align="center">QUEST</td>
<td valign="top" align="center">CHAID</td>
<td valign="top" align="center">Exhaustive CHAID</td>
<td valign="top" align="center">CART</td>
<td valign="top" align="center">QUEST</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Original data</td>
<td valign="top" align="center">91.9</td>
<td valign="top" align="center">91.9</td>
<td valign="top" align="center">90.2</td>
<td valign="top" align="center">75.7</td>
<td valign="top" align="center">87.4</td>
<td valign="top" align="center">87.0</td>
<td valign="top" align="center">86.7</td>
<td valign="top" align="center">78.1</td>
</tr>
<tr>
<td valign="top" align="left">SG</td>
<td valign="top" align="center">91.1</td>
<td valign="top" align="center">93.2</td>
<td valign="top" align="center">91.5</td>
<td valign="top" align="center">84.7</td>
<td valign="top" align="center">85.8</td>
<td valign="top" align="center">87.1</td>
<td valign="top" align="center">89.4</td>
<td valign="top" align="center">80.3</td>
</tr>
<tr>
<td valign="top" align="left">SNV</td>
<td valign="top" align="center">88.7</td>
<td valign="top" align="center">87.4</td>
<td valign="top" align="center">84.6</td>
<td valign="top" align="center">83.6</td>
<td valign="top" align="center">82.2</td>
<td valign="top" align="center">86.1</td>
<td valign="top" align="center">84.4</td>
<td valign="top" align="center">78.7</td>
</tr>
<tr>
<td valign="top" align="left">MSC</td>
<td valign="top" align="center">90.7</td>
<td valign="top" align="center">92.5</td>
<td valign="top" align="center">92.0</td>
<td valign="top" align="center">85.7</td>
<td valign="top" align="center">83.7</td>
<td valign="top" align="center">88.1</td>
<td valign="top" align="center">89.8</td>
<td valign="top" align="center">82.6</td>
</tr>
<tr>
<td valign="top" align="left">SG 1st Der</td>
<td valign="top" align="center">92.2</td>
<td valign="top" align="center">91.1</td>
<td valign="top" align="center">100.0</td>
<td valign="top" align="center">92.3</td>
<td valign="top" align="center">93.2</td>
<td valign="top" align="center">91.2</td>
<td valign="top" align="center">98.1 (157/160)</td>
<td valign="top" align="center">92.9</td>
</tr>
</tbody>
</table></table-wrap>
<p>The individual performance for the DT method varied according to the species. While some species such as species <italic>S. flexneri</italic>, <italic>L. monocytogenes</italic>, <italic>V</italic>. <italic>cholerae</italic>, and <italic>C</italic>. <italic>botulinum</italic> yielded good prediction performances, some others such as the class <italic>S. aureus</italic> and the <italic>S. typhimurium</italic> fell short from the overall average. The optimal DT model on CRM data was obtained by using CART algorithm (the spectra were preprocessed with SG 1st Der) showing a CRR for training set of 100.0%. On the contrary, the testing set given a CRR of cross-validation of 98.1% (157/160) was correctly classified, one sample from the class S. <italic>aureus</italic> was wrongly allocated to the group <italic>L. monocytogenes</italic>, and two samples from the class S. <italic>typhimurium</italic> were wrongly allocated to the group <italic>S. flexneri</italic> and <italic>V. cholerae</italic>.</p>
<p>To evaluate the performance of the optimal DT model further, the true positive rate (sensitivity) and true negative rate (specificity) were calculated, and the relationship between sensitivity and specificity was represented by ROC curve graphically for all possible thresholds. The area under the ROC curve (AUC) is widely recognized as the important index of the performance of a model (<xref ref-type="bibr" rid="B18">IBM USA, 2021b</xref>). Chance diagonal is a line from (0,0) to (1,1). The area under the chance diagonal is 0.5. The model is effective when AUC is greater than 0.5. As shown in <xref ref-type="fig" rid="F5">Figure 5</xref>, all the AUC values of six bacteria species in the DT model are greater than 0.95. Generally, the AUC value for ROC curve is 1 maximum that indicates that the model is the best classifier and the values closer to 0 indicate poor performance. Thus, the performance of the optimal DT model is excellent (<xref ref-type="bibr" rid="B33">Obuchowski and Bullen, 2018</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>ROC curve of the DT model using the SG 1st Der.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-13-874658-g005.tif"/>
</fig>
</sec>
<sec id="S3.SS3.SSS2">
<title>Artificial neural network model</title>
<p>As shown in <xref ref-type="table" rid="T5">Table 5</xref>, red-lettered values represent MLP, and blue-lettered values represent RBF. Red-lettered values are higher than the corresponding blue-lettered values; it shows that a model with MLP algorithm has a better performance compared to RBF algorithm, and the ability to find complex relationships is more important in this study. SG and MSC raised accuracy and the ability to extend of MLP model compared to original data. SG 1st Der resulted in the minor reduction in model accuracy, while SNV resulted in a less model accuracy.</p>
<table-wrap position="float" id="T5">
<label>TABLE 5</label>
<caption><p>Summary of ANN modeling results for bacteria species identification expressed in CRR.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Method</td>
<td valign="top" align="center" colspan="2">CRR of training set (%)<hr/></td>
<td valign="top" align="center" colspan="2">CRR of testing set (%)<hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center">MLP</td>
<td valign="top" align="center">RBF</td>
<td valign="top" align="center">MLP</td>
<td valign="top" align="center">RBF</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Original data</td>
<td valign="top" align="center">97.6</td>
<td valign="top" align="center">87.5</td>
<td valign="top" align="center">95.6</td>
<td valign="top" align="center">87.7</td>
</tr>
<tr>
<td valign="top" align="left">SG</td>
<td valign="top" align="center">99.2</td>
<td valign="top" align="center">82.7</td>
<td valign="top" align="center">95.7</td>
<td valign="top" align="center">75.9</td>
</tr>
<tr>
<td valign="top" align="left">SNV</td>
<td valign="top" align="center">91.6</td>
<td valign="top" align="center">91.8</td>
<td valign="top" align="center">92.1</td>
<td valign="top" align="center">85.0</td>
</tr>
<tr>
<td valign="top" align="left">MSC</td>
<td valign="top" align="center">100.0</td>
<td valign="top" align="center">82.4</td>
<td valign="top" align="center">99.4 (159/160)</td>
<td valign="top" align="center">76.0</td>
</tr>
<tr>
<td valign="top" align="left">SG 1st Der</td>
<td valign="top" align="center">97.2</td>
<td valign="top" align="center">88.3</td>
<td valign="top" align="center">94.6</td>
<td valign="top" align="center">82.2</td>
</tr>
</tbody>
</table></table-wrap>
<p>The individual performance in the ANN model varied according to the species, and some species such as the class <italic>S. flexneri</italic> fell short from the overall average. The optimal ANN model was obtained by using MLP algorithm (the spectra were preprocessed with MSC) showing a CRR for training set of 100.0%. Meanwhile, the testing set provided a CRR of cross-validation of 99.4% (159/160) correctly classified, and one sample from the class <italic>S. flexneri</italic> was wrongly allocated to the group <italic>C. botulinum</italic>. The reason for this can again be attributed to the similarities between the species. However, our collective results do highlight that the DT and ANN models are efficient enough to predict the species identity at the genus level. This is significant since accurate identification of the genus of the foodborne pathogenic bacteria may be sufficient in several applications. For example, in many regulatory practices of food safety, the goal is often to screen a larger sample volume than have detailed characterizations of a very small sample set. In such applications, where fast, efficient, and inter-mediate screening is necessary, perhaps an initial genus-level accuracy is a welcome relief, in which the DT and ANN models are quite capable of providing (<xref ref-type="bibr" rid="B4">Bisgin et al., 2018</xref>).</p>
<p><xref ref-type="fig" rid="F6">Figure 6A</xref> is the ROC curve of the ANN model by using the MSC, and all the AUC values of six bacteria species in the ANN model are greater than 0.98. Thus, the performance of the optimal ANN model is superior. Moreover, there are also other powerful methods to assess the performance of the model such as cumulative gains and lift charts. <xref ref-type="fig" rid="F6">Figure 6B</xref> shows the cumulative gain evaluation curve of the optimal ANN model in this study. A lift chart is shown in <xref ref-type="fig" rid="F6">Figure 6C</xref>. The baseline represents the results of random guessing. The lift chart originates directly from the cumulative gain evaluation curve. The X-axis of the lift chart is identical with the cumulative gain evaluation curve, while Y-axis is equal to the ratio of the cumulative gains of independent curve and the baseline. In other word, it presents how many times the model is better than the random choice of samples (<xref ref-type="bibr" rid="B19">IBM USA, 2021a</xref>; <xref ref-type="bibr" rid="B31">Nargis et al., 2021</xref>). In <xref ref-type="fig" rid="F6">Figure 6B</xref>, every curve rapidly reaches a very high cumulative gain value (100%) and then keeps in level (<xref ref-type="bibr" rid="B41">Tibco USA, 2021</xref>). In <xref ref-type="fig" rid="F6">Figure 6C</xref>, each curve is smooth and drops rapidly to 1. The top 10% would contain approximately 70% of the target samples in the ANN model, but it is only 10% without using the model (see <xref ref-type="fig" rid="F6">Figure 6B</xref>). The target samples increase six times with the model (see <xref ref-type="fig" rid="F6">Figure 6C</xref>). As mentioned above, the performance of the optimal ANN model is excellent.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>ROC curve <bold>(A)</bold>, cumulative gains <bold>(B)</bold>, and lift charts <bold>(C)</bold> of the optimal ANN model in this study.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-13-874658-g006.tif"/>
</fig>
</sec>
<sec id="S3.SS3.SSS3">
<title>Fisher&#x2019;s discriminant analysis model</title>
<p>The LOOCV technique was used as the cross-validation method to develop FDA model. As shown in the <xref ref-type="table" rid="T6">Table 6</xref>, the total model accuracy of training set was 93.2% (Original data), 94.0% (SG), 89.3% (SNV), 93.8% (MSC), and 100.0% (SG 1st Der), respectively. The testing set was 92.8% (Original data), 92.8% (SG), 88.9% (SNV), 92.8% (MSC), and 100.0% (SG 1st Der), respectively. The accuracy of training set is similar with the corresponding testing set. It shows that FDA model is fairly robust against overfitting problem. All pretreatment methods contributed in the increase of model accuracy except SNV. The optimal FDA model was obtained by using SG 1st Der algorithm; it resulted in a CRR of training set of 100.0% and a CRR of cross-validation of 100.0%. The individual performance in the FDA model did not vary according to the species, and the different individual species yielded good prediction performances in the FDA model.</p>
<table-wrap position="float" id="T6">
<label>TABLE 6</label>
<caption><p>Summary of FDA modeling results for bacteria species identification expressed in CRR.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Method</td>
<td valign="top" align="center" colspan="2">CRR (%)<hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center">Training set</td>
<td valign="top" align="center">Testing set</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Original data</td>
<td valign="top" align="center">93.2</td>
<td valign="top" align="center">92.8</td>
</tr>
<tr>
<td valign="top" align="left">SG</td>
<td valign="top" align="center">94.0</td>
<td valign="top" align="center">92.8</td>
</tr>
<tr>
<td valign="top" align="left">SNV</td>
<td valign="top" align="center">89.3</td>
<td valign="top" align="center">88.9</td>
</tr>
<tr>
<td valign="top" align="left">MSC</td>
<td valign="top" align="center">93.8</td>
<td valign="top" align="center">92.8</td>
</tr>
<tr>
<td valign="top" align="left">SG 1st Der</td>
<td valign="top" align="center">100.0</td>
<td valign="top" align="center">100.0</td>
</tr>
</tbody>
</table></table-wrap>
<p><xref ref-type="table" rid="T7">Table 7</xref> presents the summary of Fisher&#x2019;s discriminant functions in the optimal FDA model. In FDA model, Wilk&#x2019;s lambda values are used to test whether each discriminant function is statistically significant. The range of values is from 0 to 1.0 represents total discrimination, and 1 represents no discrimination, since Wilk&#x2019;s lambda values of top five functions are greater than 1, and the corresponding Sig is less than 0.05. These five functions are significant in the model. However, Wilk&#x2019;s lambda value of Function 6 is 1, the corresponding Sig is 0.672 being far greater than 0.05, and variance contribution is 0. Thus, the Function 6 should be discarded. Variance contribution of top three functions is 73.6, 16.7, and 8.5%, respectively. All of these contribute 98.8% of all the variations. All-Groups 3D scatter plot was shown using F<sub>1</sub>, F<sub>2</sub>, and F<sub>3</sub>. As depicted in <xref ref-type="fig" rid="F7">Figure 7A</xref>, six kinds of bacteria species are clearly distinguished between each other with a CRR of 100.0%. <xref ref-type="fig" rid="F7">Figure 7B</xref> represents that all the AUC values of six bacteria species in the DT model are greater than 0.95. In conclusion, our findings demonstrated that CRM and FDA model could be used for the identification of various bacteria species efficiently.</p>
<table-wrap position="float" id="T7">
<label>TABLE 7</label>
<caption><p>Summary of Fisher&#x2019;s discriminant functions.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Function</td>
<td valign="top" align="center">Variance contribution%</td>
<td valign="top" align="center">Cumulative contribution%</td>
<td valign="top" align="center">Correlation</td>
<td valign="top" align="center">Function test</td>
<td valign="top" align="center">Wilks&#x2019; Lambda</td>
<td valign="top" align="center">Sig</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">73.6</td>
<td valign="top" align="center">73.6</td>
<td valign="top" align="center">0.966</td>
<td valign="top" align="center">1&#x2013;6</td>
<td valign="top" align="center">0.005</td>
<td valign="top" align="center">0.000</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">16.7</td>
<td valign="top" align="center">90.3</td>
<td valign="top" align="center">0.872</td>
<td valign="top" align="center">2&#x2013;6</td>
<td valign="top" align="center">0.075</td>
<td valign="top" align="center">0.000</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">8.5</td>
<td valign="top" align="center">98.8</td>
<td valign="top" align="center">0.785</td>
<td valign="top" align="center">3&#x2013;6</td>
<td valign="top" align="center">0.313</td>
<td valign="top" align="center">0.000</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">0.9</td>
<td valign="top" align="center">99.7</td>
<td valign="top" align="center">0.380</td>
<td valign="top" align="center">4&#x2013;6</td>
<td valign="top" align="center">0.815</td>
<td valign="top" align="center">0.000</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center">0.3</td>
<td valign="top" align="center">100.0</td>
<td valign="top" align="center">0.218</td>
<td valign="top" align="center">5&#x2013;6</td>
<td valign="top" align="center">0.952</td>
<td valign="top" align="center">0.000</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="center">0.0</td>
<td valign="top" align="center">100.0</td>
<td valign="top" align="center">0.019</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">0.672</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>First five canonical discriminant functions were used in the analysis. F<sub>1</sub> = 1.126<italic>PC1</italic> + 0.708<italic>PC2</italic> + 0.067<italic>PC3</italic>-0.092<italic>PC4</italic> + 0.050<italic>PC5</italic>-0.125<italic>PC6</italic>. F<sub>2</sub> = 0.135<italic>PC1</italic>-0.797<italic>PC2</italic> + 0.293<italic>PC3</italic> + 0.987<italic>PC4</italic>-0.240<italic>PC5</italic>-0.188<italic>PC6</italic>. F<sub>3</sub> = -0.093<italic>PC1</italic> + 0.605<italic>PC2</italic> + 0.789<italic>PC3</italic> + 0.520<italic>PC4</italic> + 0.256<italic>PC5</italic>-0.195<italic>PC6</italic>. F<sub>4</sub> = 0<italic>PC1</italic> -0.236<italic>PC2</italic> + 0.600<italic>PC3</italic>-0.343<italic>PC4</italic> + 0.283<italic>PC5</italic>-0.240<italic>PC6</italic>. F<sub>5</sub> = -0.022<italic>PC1</italic>-0.236<italic>PC2</italic> + 0.600<italic>PC3</italic>-0.343<italic>PC4</italic> + 0.283<italic>PC5</italic>-0.240<italic>PC6</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p><bold>(A)</bold> All-Groups 3D scatter plot and <bold>(B)</bold> the ROC curve of the FDA model using the SG 1st Der.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-13-874658-g007.tif"/>
</fig>
</sec>
</sec>
</sec>
<sec id="S4" sec-type="conclusion">
<title>Conclusion</title>
<p>The present study evaluates and reveals the potential of the CRM and chemometrics methods for the detection and classification of foodborne pathogenic bacteria. Our findings showed that different bacteria species can be distinguished based on characteristic peaks and peak intensity ratio, but it is time-consuming and not applicable for a large amount of samples. Then, we explored the impact of different pretreatment methods on the models. Most pretreatment methods raised the performance of the models except SNV. Performances of some classical classifiers were compared for bacterial detection and identification. CRR, ROC, cumulative gains, and lift charts were used to evaluate the performance of models. In these studies, preliminary classification can be achieved by PCA model. In the DT and ANN model, there is a difference between the CRR of training set and corresponding testing set. In the FDA model, the CRR of training set is more similar with the corresponding testing set. Therefore, the FDA model is more robust for overfitting problem and offers the highest CRR. In conclusion, CRM and chemometrics offer a powerful tool for the discrimination of foodborne pathogenic bacteria; thus, an application of the technology in the food sector would lead to a huge benefit.</p>
</sec>
<sec id="S5" sec-type="data-availability">
<title>Data availability statement</title>
<p>The original contributions presented in this study are included in the article/<xref ref-type="supplementary-material" rid="DS1">Supplementary material</xref>, further inquiries can be directed to the corresponding author/s.</p>
</sec>
<sec id="S6">
<title>Author contributions</title>
<p>JZ: methodology, data curation, software, investigation, and writing&#x2014;original draft. PG: conceptualization and supervision. YW: funding acquisition, supervision, and resources. XY, WL, and MY: supervision and resources. CY: investigation, resources, and supervision. XX: resources, visualization, supervision, funding acquisition, investigation, and editing. HJ: reviewing, validation, resources, and supervision. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<ack><p>This work was supported by the National Key Research and Development Program of China (No. 2021YFC2301000), the National Key Research and Development Program of China (No. 2018YFC1603800), and the High-grade, Precision and Advanced Discipline Construction Project of National Security Studies of Beijing, the People&#x2019;s Public Security University of China (No. 2020GDLW039).</p>
</ack>
<sec id="S7" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="S8" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="S9" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2022.874658/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmicb.2022.874658/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.doc" id="DS1" mimetype="application/msword" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Andrei</surname> <given-names>C. C.</given-names></name> <name><surname>Moraillon</surname> <given-names>A.</given-names></name> <name><surname>Lau</surname> <given-names>S.</given-names></name> <name><surname>F&#x00E9;lidj</surname> <given-names>N.</given-names></name> <name><surname>Yamakawa</surname> <given-names>N.</given-names></name> <name><surname>Bouckaert</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Rapid and sensitive identification of uropathogenic <italic>Escherichia coli</italic> using a surface-enhanced-Raman-scattering-based biochip.</article-title> <source><italic>Talanta</italic></source> <volume>219</volume>:<issue>121174</issue>. <pub-id pub-id-type="doi">10.1016/j.talanta.2020.121174</pub-id> <pub-id pub-id-type="pmid">32887096</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anza</surname> <given-names>I.</given-names></name> <name><surname>Skarin</surname> <given-names>H.</given-names></name> <name><surname>Vidal</surname> <given-names>D.</given-names></name> <name><surname>Lindberg</surname> <given-names>A.</given-names></name> <name><surname>Baverud</surname> <given-names>V.</given-names></name> <name><surname>Mateo</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>The same clade of <italic>Clostridium botulinum</italic> strains is causing avian botulism in southern and northern Europe.</article-title> <source><italic>Anaerobe</italic></source> <volume>26</volume> <fpage>20</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1016/j.anaerobe.2014.01.002</pub-id> <pub-id pub-id-type="pmid">24418766</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Artin</surname> <given-names>I.</given-names></name> <name><surname>Mason</surname> <given-names>D. R.</given-names></name> <name><surname>Pin</surname> <given-names>C.</given-names></name> <name><surname>Schelin</surname> <given-names>J.</given-names></name> <name><surname>Peck</surname> <given-names>M. W.</given-names></name> <name><surname>Holst</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Carter, effects of carbon dioxide on growth of proteolytic <italic>Clostridium botulinum</italic>, its ability to produce neurotoxin, and its transcriptome.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>76</volume> <fpage>1168</fpage>&#x2013;<lpage>1172</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.02247-09</pub-id> <pub-id pub-id-type="pmid">20038699</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bisgin</surname> <given-names>H.</given-names></name> <name><surname>Bera</surname> <given-names>T.</given-names></name> <name><surname>Ding</surname> <given-names>H.</given-names></name> <name><surname>Semey</surname> <given-names>H. G.</given-names></name> <name><surname>Wu</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Comparing SVM and ANN based machine learning methods for species identification of food contaminating beetles.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>8</volume>:<issue>6532</issue>. <pub-id pub-id-type="doi">10.1038/s41598-018-24926-7</pub-id> <pub-id pub-id-type="pmid">29695741</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Braconnier</surname> <given-names>A.</given-names></name> <name><surname>Broussolle</surname> <given-names>V.</given-names></name> <name><surname>Perelle</surname> <given-names>S.</given-names></name> <name><surname>Fach</surname> <given-names>P.</given-names></name> <name><surname>Nguyen-The</surname> <given-names>C.</given-names></name> <name><surname>Carlin</surname> <given-names>F.</given-names></name></person-group> (<year>2001</year>). <article-title>Screening for 250 <italic>Clostridium botulinum</italic> type A, B, and E in cooked chilled foods containing vegetables and raw material using 251 polymerase chain reaction and molecular probes.</article-title> <source><italic>J. Food Protect.</italic></source> <volume>64</volume> <fpage>201</fpage>&#x2013;<lpage>207</lpage>. <pub-id pub-id-type="doi">10.4315/0362-028x-64.2.201</pub-id> <pub-id pub-id-type="pmid">11271768</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>L.</given-names></name> <name><surname>Alali</surname> <given-names>W.</given-names></name></person-group> (<year>2018</year>). <article-title>Editorial: Recent discoveries in human serious foodborne pathogenic bacteria: Resurgence, pathogenesis, and control strategies.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>9</volume>:<issue>2412</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2018.02412</pub-id> <pub-id pub-id-type="pmid">30356687</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chiang</surname> <given-names>L. H.</given-names></name> <name><surname>Russell</surname> <given-names>E. L.</given-names></name> <name><surname>Braatz</surname> <given-names>R. D.</given-names></name></person-group> (<year>2001</year>). &#x201C;<article-title>Tennessee eastman process</article-title>,&#x201D; in <source><italic>Fault detection and diagnosis in industrial systems. Advanced textbooks in control and signal processing</italic></source> (<publisher-loc>Bristol</publisher-loc>: <publisher-name>IOP Publishing Ltd</publisher-name>).</citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chisanga</surname> <given-names>M.</given-names></name> <name><surname>Linton</surname> <given-names>D.</given-names></name> <name><surname>Muhamadali</surname> <given-names>H.</given-names></name> <name><surname>Ellis</surname> <given-names>D. I.</given-names></name> <name><surname>Kimber</surname> <given-names>R. L.</given-names></name> <name><surname>Mironov</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Rapid differentiation of <italic>Campylobacter jejuni</italic> cell wall mutants using Raman spectroscopy, SERS and mass spectrometry combined with chemometrics.</article-title> <source><italic>Analyst</italic></source> <volume>145</volume> <fpage>1236</fpage>&#x2013;<lpage>1249</lpage>. <pub-id pub-id-type="doi">10.1039/c9an02026h</pub-id> <pub-id pub-id-type="pmid">31776524</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dahlsten</surname> <given-names>E.</given-names></name> <name><surname>Korkeala</surname> <given-names>H.</given-names></name> <name><surname>Somervuo</surname> <given-names>P.</given-names></name> <name><surname>Lindstr&#x00F6;m</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>PCR assay for differentiating between Group I (proteolytic) and Group II (nonproteolytic) strains of <italic>Clostridium botulinum</italic>.</article-title> <source><italic>Int. J. Food Microbiol.</italic></source> <volume>124</volume> <fpage>108</fpage>&#x2013;<lpage>111</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijfoodmicro.2008.02.018</pub-id> <pub-id pub-id-type="pmid">18374440</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>De Medici</surname> <given-names>D.</given-names></name> <name><surname>Anniballi</surname> <given-names>F.</given-names></name> <name><surname>Wyatt</surname> <given-names>G. M.</given-names></name> <name><surname>Lindstr&#x00F6;m</surname> <given-names>M.</given-names></name> <name><surname>Messelh&#x00E4;usser</surname> <given-names>U.</given-names></name> <name><surname>Aldus</surname> <given-names>C. F.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Multiplex PCR for detection of botulinum neurotoxin-producing clostridia in clinical, food, and environmental samples.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>75</volume> <fpage>6457</fpage>&#x2013;<lpage>6461</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.00805-09</pub-id> <pub-id pub-id-type="pmid">19684163</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fan</surname> <given-names>C.</given-names></name> <name><surname>Hu</surname> <given-names>Z.</given-names></name> <name><surname>Mustapha</surname> <given-names>A.</given-names></name> <name><surname>Lin</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Rapid detection of food- and waterborne bacteria using surface-enhanced Raman spectroscopy coupled with silver nanosubstrates.</article-title> <source><italic>Appl. Microbiol. Biotechnol.</italic></source> <volume>92</volume> <fpage>1053</fpage>&#x2013;<lpage>1061</lpage>. <pub-id pub-id-type="doi">10.1007/s00253-011-3634-3</pub-id> <pub-id pub-id-type="pmid">22005743</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferreira</surname> <given-names>J. L.</given-names></name> <name><surname>Eliasberg</surname> <given-names>S. J.</given-names></name> <name><surname>Harrison</surname> <given-names>M. A.</given-names></name> <name><surname>Edmonds</surname> <given-names>P.</given-names></name></person-group> (<year>2001</year>). <article-title>Detection of preformed type A botulinal toxin in hash brown potatoes by using the mouse bioassay and a modified ELISA test.</article-title> <source><italic>J. AOAC Int.</italic></source> <volume>84</volume> <fpage>1460</fpage>&#x2013;<lpage>1464</lpage>. <pub-id pub-id-type="doi">10.1093/jaoac/84.5.1460</pub-id> <pub-id pub-id-type="pmid">11601465</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Franco</surname> <given-names>D.</given-names></name> <name><surname>Trusso</surname> <given-names>S.</given-names></name> <name><surname>Fazio</surname> <given-names>E.</given-names></name> <name><surname>Allegra</surname> <given-names>A.</given-names></name> <name><surname>Musolino</surname> <given-names>C.</given-names></name> <name><surname>Speciale</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Raman spectroscopy differentiates between sensitive and resistant multiple myeloma cell lines.</article-title> <source><italic>Spectrochim. Acta A Mol. Biomol. Spectrosc.</italic></source> <volume>187</volume> <fpage>15</fpage>&#x2013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1016/j.saa.2017.06.020</pub-id> <pub-id pub-id-type="pmid">28645097</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fung</surname> <given-names>F.</given-names></name> <name><surname>Wang</surname> <given-names>H. S.</given-names></name> <name><surname>Menon</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>Food safety in the 21st century.</article-title> <source><italic>Biomed. J.</italic></source> <volume>41</volume> <fpage>88</fpage>&#x2013;<lpage>95</lpage>. <pub-id pub-id-type="doi">10.1016/j.bj.2018.03.003</pub-id> <pub-id pub-id-type="pmid">29866604</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giaouris</surname> <given-names>E.</given-names></name> <name><surname>Heir</surname> <given-names>E.</given-names></name> <name><surname>Desvaux</surname> <given-names>M.</given-names></name> <name><surname>H&#x00E9;braud</surname> <given-names>M.</given-names></name> <name><surname>M&#x00F8;retr&#x00F8;</surname> <given-names>T.</given-names></name> <name><surname>Langsrud</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Intra- and inter-species interactions within biofilms of important foodborne bacterial pathogens.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>6</volume>:<issue>841</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2015.00841</pub-id> <pub-id pub-id-type="pmid">26347727</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ho</surname> <given-names>C. S.</given-names></name> <name><surname>Jean</surname> <given-names>N.</given-names></name> <name><surname>Hogan</surname> <given-names>C. A.</given-names></name> <name><surname>Blackmon</surname> <given-names>L.</given-names></name> <name><surname>Jeffrey</surname> <given-names>S. S.</given-names></name> <name><surname>Holodniy</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>10</volume>:<issue>4927</issue>. <pub-id pub-id-type="doi">10.1038/s41467-019-12898-9</pub-id> <pub-id pub-id-type="pmid">31666527</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hochel</surname> <given-names>I.</given-names></name> <name><surname>Viochna</surname> <given-names>D.</given-names></name> <name><surname>Skvor</surname> <given-names>J.</given-names></name> <name><surname>Musil</surname> <given-names>M.</given-names></name></person-group> (<year>2004</year>). <article-title>Development of an indirect competitive ELISA for detection of <italic>Campylobacter jejuni</italic> subsp.jejuni O:23 in foods.</article-title> <source><italic>Folia Microbiol. (Praha)</italic></source> <volume>49</volume> <fpage>579</fpage>&#x2013;<lpage>586</lpage>. <pub-id pub-id-type="doi">10.1007/BF02931537</pub-id> <pub-id pub-id-type="pmid">15702549</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><collab>IBM USA</collab> (<year>2021b</year>). <source><italic>Tests of equality of group means.</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.ibm.com/support/knowledgecenter/en/SSLVMB_23.0.0/spss/tutorials/discrim_bankloan_groupmean.html">https://www.ibm.com/support/knowledgecenter/en/SSLVMB_23.0.0/spss/tutorials/discrim_bankloan_groupmean.html</ext-link> <comment>(accessed February 22, 2021)</comment>.</citation></ref>
<ref id="B19"><citation citation-type="journal"><collab>IBM USA</collab> (<year>2021a</year>). <source><italic>Cumulative gains and lift charts.</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.ibm.com/support/knowledgecenter/en/SSLVMB_23.0.0/spss/tutorials/mlp_bankloan_outputtype_02.html">https://www.ibm.com/support/knowledgecenter/en/SSLVMB_23.0.0/spss/tutorials/mlp_bankloan_outputtype_02.html</ext-link> <comment>(accessed February 22, 2021)</comment>.</citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jarvis</surname> <given-names>R. M.</given-names></name> <name><surname>Brooker</surname> <given-names>A.</given-names></name> <name><surname>Goodacre</surname> <given-names>R.</given-names></name></person-group> (<year>2004</year>). <article-title>Surface-enhanced Raman spectroscopy for bacterial discrimination utilizing a scanning electron microscope with a Raman spectroscopy interface.</article-title> <source><italic>Anal. Chem.</italic></source> <volume>76</volume> <fpage>5198</fpage>&#x2013;<lpage>5202</lpage>. <pub-id pub-id-type="doi">10.1021/ac049663f</pub-id> <pub-id pub-id-type="pmid">15373461</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kant</surname> <given-names>K.</given-names></name> <name><surname>Shahbazi</surname> <given-names>M.-A.</given-names></name> <name><surname>Dave</surname> <given-names>V. P.</given-names></name> <name><surname>Ngo</surname> <given-names>T. A.</given-names></name> <name><surname>Chidambara</surname> <given-names>V. A.</given-names></name> <name><surname>Than</surname> <given-names>L. Q.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Microfluidic devices for sample preparation and rapid detection of foodborne pathogens.</article-title> <source><italic>Biotechnol. Adv.</italic></source> <volume>36</volume> <fpage>1003</fpage>&#x2013;<lpage>1024</lpage>. <pub-id pub-id-type="doi">10.1016/j.biotechadv.2018.03.002</pub-id> <pub-id pub-id-type="pmid">29534915</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keto-Timonen</surname> <given-names>R.</given-names></name> <name><surname>Nevas</surname> <given-names>M.</given-names></name> <name><surname>Korkeala</surname> <given-names>H.</given-names></name></person-group> (<year>2005</year>). <article-title>Efficient DNA fingerprinting of <italic>Clostridium botulinum</italic> types A, B, E, and F by amplified fragment length polymorphism analysis.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>71</volume> <fpage>1148</fpage>&#x2013;<lpage>1154</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.71.3.1148-1154.2005</pub-id> <pub-id pub-id-type="pmid">15746312</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kriem</surname> <given-names>L. S.</given-names></name> <name><surname>Wright</surname> <given-names>K.</given-names></name> <name><surname>Ccahuana-Vasquez</surname> <given-names>R. A.</given-names></name> <name><surname>Rupp</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Confocal Raman microscopy to identify bacteria in oral subgingival biofilm models.</article-title> <source><italic>PLoS One</italic></source> <volume>15</volume>:<issue>e0232912</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0232912</pub-id> <pub-id pub-id-type="pmid">32392236</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lemma</surname> <given-names>T.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Arstila</surname> <given-names>K.</given-names></name> <name><surname>Hyt&#x00F6;nen</surname> <given-names>V. P.</given-names></name> <name><surname>Toppari</surname> <given-names>J. J.</given-names></name></person-group> (<year>2019</year>). <article-title>Identifying yeasts using surface enhanced Raman spectroscopy.</article-title> <source><italic>Spectrochim. Acta Part A Mol. Biomol. Spectrosc.</italic></source> <volume>218</volume> <fpage>299</fpage>&#x2013;<lpage>307</lpage>. <pub-id pub-id-type="doi">10.1016/j.saa.2019.04.010</pub-id> <pub-id pub-id-type="pmid">31005737</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Liu</surname> <given-names>W.</given-names></name> <name><surname>Zheng</surname> <given-names>Y.</given-names></name> <name><surname>Yang</surname> <given-names>X.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Rapid screening of colistin-resistant <italic>Escherichia coli</italic>, <italic>Acinetobacter baumannii</italic> and <italic>Pseudomonas aeruginosa</italic> by the use of Raman spectroscopy and hierarchical cluster analysis.</article-title> <source><italic>Analyst</italic></source> <volume>144</volume> <fpage>2803</fpage>&#x2013;<lpage>2810</lpage>. <pub-id pub-id-type="doi">10.1039/c8an02220h</pub-id> <pub-id pub-id-type="pmid">30882113</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindstr&#x00F6;m</surname> <given-names>M.</given-names></name> <name><surname>Keto</surname> <given-names>R.</given-names></name> <name><surname>Markkula</surname> <given-names>A.</given-names></name> <name><surname>Nevas</surname> <given-names>M.</given-names></name> <name><surname>Hielm</surname> <given-names>S.</given-names></name> <name><surname>Korkeala</surname> <given-names>H.</given-names></name></person-group> (<year>2001</year>). <article-title>Multiplex PCR assay for detection and identification of <italic>Clostridium botulinum</italic> types A, B, E, and F in food and fecal material.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>67</volume> <fpage>5694</fpage>&#x2013;<lpage>5699</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.67.12.5694-5699.2001</pub-id> <pub-id pub-id-type="pmid">11722924</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Zhao</surname> <given-names>J.</given-names></name> <name><surname>Lu</surname> <given-names>X. Z.</given-names></name> <name><surname>Li</surname> <given-names>G.</given-names></name> <name><surname>Wu</surname> <given-names>T.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name></person-group> (<year>2018</year>). <article-title>Blood hyperviscosity identification with reflective spectroscopy of tongue tip based on principal component analysis combining artificial neural network.</article-title> <source><italic>Biomed. Eng. Online</italic></source> <volume>17</volume>:<issue>60</issue>. <pub-id pub-id-type="doi">10.1186/s12938-018-0495-3</pub-id> <pub-id pub-id-type="pmid">29747693</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Macdonald</surname> <given-names>T. E.</given-names></name> <name><surname>Helma</surname> <given-names>C. H.</given-names></name> <name><surname>Shou</surname> <given-names>Y.</given-names></name> <name><surname>Valdez</surname> <given-names>Y. E.</given-names></name> <name><surname>Ticknor</surname> <given-names>L. O.</given-names></name> <name><surname>Foley</surname> <given-names>B. T.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Analysis of <italic>Clostridium botulinum</italic> serotype E strains by using multilocus sequence typing, amplified fragment length polymorphism, variable-number tandem-repeat analysis, and botulinum neurotoxin gene sequencing.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>77</volume> <fpage>8625</fpage>&#x2013;<lpage>8634</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.05155-11</pub-id> <pub-id pub-id-type="pmid">22003031</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maquelin</surname> <given-names>K.</given-names></name> <name><surname>Choo-Smith</surname> <given-names>L. P.</given-names></name> <name><surname>Endtz</surname> <given-names>H. P.</given-names></name> <name><surname>Bruining</surname> <given-names>H. A.</given-names></name> <name><surname>Puppels</surname> <given-names>G. J.</given-names></name></person-group> (<year>2002</year>). <article-title>Rapid identification of <italic>Candida</italic> species by confocal Raman microspectroscopy.</article-title> <source><italic>J. Clin. Microbiol.</italic></source> <volume>40</volume> <fpage>594</fpage>&#x2013;<lpage>600</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.40.2.594-600.2002</pub-id> <pub-id pub-id-type="pmid">11825976</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mi</surname> <given-names>F.</given-names></name> <name><surname>Guan</surname> <given-names>M.</given-names></name> <name><surname>Hu</surname> <given-names>C.</given-names></name> <name><surname>Peng</surname> <given-names>F.</given-names></name> <name><surname>Sun</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name></person-group> (<year>2021</year>). <article-title>Application of lectin-based biosensor technology in the detection of foodborne pathogenic bacteria: A review.</article-title> <source><italic>Analyst</italic></source> <volume>146</volume> <fpage>429</fpage>&#x2013;<lpage>443</lpage>. <pub-id pub-id-type="doi">10.1039/d0an01459a</pub-id> <pub-id pub-id-type="pmid">33231246</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nargis</surname> <given-names>H. F.</given-names></name> <name><surname>Nawaz</surname> <given-names>H.</given-names></name> <name><surname>Bhatti</surname> <given-names>H. N.</given-names></name> <name><surname>Jilani</surname> <given-names>K.</given-names></name> <name><surname>Saleem</surname> <given-names>M.</given-names></name></person-group> (<year>2021</year>). <article-title>Comparison of surface enhanced Raman spectroscopy and Raman spectroscopy for the detection of breast cancer based on serum samples.</article-title> <source><italic>Spectrochim. Acta Part A Mol. Biomol. Spectrosc.</italic></source> <volume>246</volume>:<issue>119034</issue>. <pub-id pub-id-type="doi">10.1016/j.saa.2020.119034</pub-id> <pub-id pub-id-type="pmid">33049470</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname> <given-names>V.</given-names></name> <name><surname>Lin</surname> <given-names>W. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Comparison of assembled <italic>Clostridium botulinum</italic> A1 genomes revealed their evolutionary relationship.</article-title> <source><italic>Genomics</italic></source> <volume>103</volume> <fpage>94</fpage>&#x2013;<lpage>106</lpage>. <pub-id pub-id-type="doi">10.1016/j.ygeno.2013.12.003</pub-id> <pub-id pub-id-type="pmid">24369123</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Obuchowski</surname> <given-names>N. A.</given-names></name> <name><surname>Bullen</surname> <given-names>J. A.</given-names></name></person-group> (<year>2018</year>). <article-title>Receiver operating characteristic (ROC) curves: Review of methods with applications in diagnostic medicine.</article-title> <source><italic>Phys. Med. Biol.</italic></source> <volume>63</volume>:<issue>07TR01</issue>. <pub-id pub-id-type="doi">10.1088/1361-6560/aab4b1</pub-id> <pub-id pub-id-type="pmid">29512515</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olsen</surname> <given-names>J. S.</given-names></name> <name><surname>Scholz</surname> <given-names>H.</given-names></name> <name><surname>Fillo</surname> <given-names>S.</given-names></name> <name><surname>Ramisse</surname> <given-names>V.</given-names></name> <name><surname>Lista</surname> <given-names>F.</given-names></name> <name><surname>Tromborg</surname> <given-names>A. K.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Analysis of the genetic distribution among members of <italic>Clostridium botulinum</italic> group I using a novel multilocus sequence typing (MLST) assay.</article-title> <source><italic>J. Microbiol. Methods</italic></source> <volume>96</volume> <fpage>84</fpage>&#x2013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.1016/j.mimet.2013.11.003</pub-id> <pub-id pub-id-type="pmid">24246230</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Park</surname> <given-names>Y. S.</given-names></name> <name><surname>Lek</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Artificial neural networks: Multilayer perceptron for ecological modeling</article-title>,&#x201D; in <source><italic>Developments in environmental modelling</italic></source>, <volume>Vol. 28</volume> <role>ed.</role> <person-group person-group-type="editor"><name><surname>Jorgensen</surname> <given-names>S. E.</given-names></name></person-group> (<publisher-loc>Amsterdam</publisher-loc>: <publisher-name>Elsevier</publisher-name>), <fpage>123</fpage>&#x2013;<lpage>140</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-444-63623-2.00007-4</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raphael</surname> <given-names>B. H.</given-names></name> <name><surname>Joseph</surname> <given-names>L. A.</given-names></name> <name><surname>McCroskey</surname> <given-names>L. M.</given-names></name> <name><surname>L&#x00FA;quez</surname> <given-names>C.</given-names></name> <name><surname>Maslanka</surname> <given-names>S. E.</given-names></name></person-group> (<year>2010</year>). <article-title>Detection and differentiation of <italic>Clostridium botulinum</italic> type A strains using a focused DNA microarray.</article-title> <source><italic>Mol. Cell. Probes</italic></source> <volume>24</volume> <fpage>146</fpage>&#x2013;<lpage>153</lpage>. <pub-id pub-id-type="doi">10.1016/j.mcp.2009.12.003</pub-id> <pub-id pub-id-type="pmid">20056143</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Serrano</surname> <given-names>P.</given-names></name> <name><surname>Hermelink</surname> <given-names>A.</given-names></name> <name><surname>Lasch</surname> <given-names>P.</given-names></name> <name><surname>de Vera</surname> <given-names>J. P.</given-names></name> <name><surname>K&#x00F6;nig</surname> <given-names>N.</given-names></name> <name><surname>Burckhardt</surname> <given-names>O.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Confocal raman microspectroscopy reveals a convergence of the chemical composition in methanogenic archaea from a siberian permafrost-affected soil</article-title>. <source><italic>FEMS Microbiol. Ecol.</italic></source> <volume>91</volume>:<issue>fiv126</issue>. <pub-id pub-id-type="doi">10.1093/femsec/fiv126</pub-id> <pub-id pub-id-type="pmid">26499486</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Skarin</surname> <given-names>H.</given-names></name> <name><surname>Lindberg</surname> <given-names>A.</given-names></name> <name><surname>Blomqvist</surname> <given-names>G.</given-names></name> <name><surname>Asp&#x00E1;n</surname> <given-names>A.</given-names></name> <name><surname>B&#x00E5;verud</surname> <given-names>V.</given-names></name></person-group> (<year>2010</year>). <article-title>Molecular characterization and comparison of <italic>Clostridium botulinum</italic> type C avian strains.</article-title> <source><italic>Avian Pathology</italic></source> <volume>39</volume> <fpage>511</fpage>&#x2013;<lpage>518</lpage>. <pub-id pub-id-type="doi">10.1080/03079457.2010.526923</pub-id> <pub-id pub-id-type="pmid">21154062</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Szabo</surname> <given-names>E. A.</given-names></name> <name><surname>Pemberton</surname> <given-names>J. M.</given-names></name> <name><surname>Gibson</surname> <given-names>A. M.</given-names></name> <name><surname>Eyles</surname> <given-names>M. J.</given-names></name> <name><surname>Desmarchellier</surname> <given-names>P. M.</given-names></name></person-group> (<year>1994</year>). <article-title>Polymerase chain reaction for the detection of <italic>Clostridium botulinum</italic> types A, Band E in food, soil and infant faeces.</article-title> <source><italic>J. Appl. Bacteriol.</italic></source> <volume>76</volume> <fpage>539</fpage>&#x2013;<lpage>545</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2672.1994.tb01650.x</pub-id> <pub-id pub-id-type="pmid">8027003</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Taban</surname> <given-names>B. M.</given-names></name> <name><surname>Ben</surname> <given-names>U.</given-names></name> <name><surname>Aytac</surname> <given-names>S. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Rapid detection of <italic>Salmonella</italic> in milk by combined immunomagnetic separation-polymerase chain reaction assay.</article-title> <source><italic>J. Dairy Sci.</italic></source> <volume>92</volume> <fpage>2382</fpage>&#x2013;<lpage>2388</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2008-1537</pub-id> <pub-id pub-id-type="pmid">19447970</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><collab>Tibco USA</collab>, (<year>2021</year>). <source><italic>Gains vs ROC curves. Do you understand the difference?</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://community.tibco.com/wiki/gains-vs-roc-curves-do-you-understand-difference">https://community.tibco.com/wiki/gains-vs-roc-curves-do-you-understand-difference</ext-link> <comment>(accessed February 22, 2021)</comment>.</citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tie</surname> <given-names>Y.</given-names></name> <name><surname>Duchateau</surname> <given-names>C.</given-names></name> <name><surname>Van de Steene</surname> <given-names>S.</given-names></name> <name><surname>Mees</surname> <given-names>C.</given-names></name> <name><surname>De Braekeleer</surname> <given-names>K.</given-names></name> <name><surname>De Beer</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Spectroscopic techniques combined with chemometrics for fast on-site characterization of suspected illegal antimicrobials.</article-title> <source><italic>Talanta</italic></source> <volume>217</volume>:<issue>121026</issue>. <pub-id pub-id-type="doi">10.1016/j.talanta.2020.121026</pub-id> <pub-id pub-id-type="pmid">32498874</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Umeda</surname> <given-names>K.</given-names></name> <name><surname>Seto</surname> <given-names>Y.</given-names></name> <name><surname>Kohda</surname> <given-names>T.</given-names></name> <name><surname>Mukamoto</surname> <given-names>M.</given-names></name> <name><surname>Kozaki</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <article-title>Genetic Characterization of <italic>Clostridium botulinum</italic> associated with Type B infant botulism in Japan.</article-title> <source><italic>J. Clin. Microbiol.</italic></source> <volume>47</volume> <fpage>2720</fpage>&#x2013;<lpage>2728</lpage>. <pub-id pub-id-type="doi">10.1128/JCM.00077-09</pub-id> <pub-id pub-id-type="pmid">19571018</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Uysal</surname> <given-names>C. F.</given-names></name> <name><surname>Saridag</surname> <given-names>A. M.</given-names></name> <name><surname>Kilic</surname> <given-names>I. H.</given-names></name> <name><surname>Tokmakci</surname> <given-names>M.</given-names></name> <name><surname>Kahraman</surname> <given-names>M.</given-names></name> <name><surname>Aydin</surname> <given-names>O.</given-names></name></person-group> (<year>2020</year>). <article-title>Identification of methicillin-resistant <italic>Staphylococcus aureus</italic> bacteria using surface-enhanced Raman spectroscopy and machine learning techniques</article-title>. <source><italic>Analyst</italic></source> <volume>145</volume>, <fpage>7559</fpage>&#x2013;<lpage>7570</lpage>. <pub-id pub-id-type="doi">10.1039/d0an00476f</pub-id> <pub-id pub-id-type="pmid">33135033</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vanhomwegen</surname> <given-names>J.</given-names></name> <name><surname>Berthet</surname> <given-names>N.</given-names></name> <name><surname>Mazuet</surname> <given-names>C.</given-names></name> <name><surname>Guigon</surname> <given-names>G.</given-names></name> <name><surname>Vallaeys</surname> <given-names>T.</given-names></name> <name><surname>Stamboliyska</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Application of high-density DNA resequencing microarray for detection and characterization of botulinum neurotoxin-producing clostridia.</article-title> <source><italic>PLoS One</italic></source> <volume>8</volume>:<issue>67510</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0067510</pub-id> <pub-id pub-id-type="pmid">23818983</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vaz-Velho</surname> <given-names>M.</given-names></name> <name><surname>Duarte</surname> <given-names>G.</given-names></name> <name><surname>Gibbs</surname> <given-names>P.</given-names></name></person-group> (<year>2000</year>). <article-title>Evaluation of mini-VIDAS rapid test for detection of <italic>Listeria monocytogenes</italic> from production lines of fresh to cold-smoked fish.</article-title> <source><italic>J. Microbiol. Methods</italic></source> <volume>40</volume> <fpage>147</fpage>&#x2013;<lpage>151</lpage>. <pub-id pub-id-type="doi">10.1016/s0167-7012(00)00118-4</pub-id> <pub-id pub-id-type="pmid">10699670</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Slavik</surname> <given-names>M. F.</given-names></name></person-group> (<year>1999</year>). <article-title>Rapid detection of <italic>Salmonella</italic> in chicken washes by immunomagnetic separation and flow cytometry.</article-title> <source><italic>J. Food Protect.</italic></source> <volume>62</volume>:<issue>717</issue>. <pub-id pub-id-type="doi">10.4315/0362-028x-62.7.717</pub-id> <pub-id pub-id-type="pmid">10419261</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wichmann</surname> <given-names>C.</given-names></name> <name><surname>Bocklitz</surname> <given-names>T.</given-names></name> <name><surname>R&#x00F6;sch</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2020</year>). <source><italic>Spectrochim. Acta A Mol. Biomol. Spectrosc.</italic></source> <volume>248</volume>:<issue>119170</issue>. <pub-id pub-id-type="doi">10.1016/j.saa.2020.119170</pub-id> <pub-id pub-id-type="pmid">33296748</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Witkowska</surname> <given-names>E.</given-names></name> <name><surname>Jagielski</surname> <given-names>T.</given-names></name> <name><surname>Kami&#x0144;ska</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). <article-title>Genus- and species-level identification of dermatophyte fungi by surface-enhanced Raman spectroscopy.</article-title> <source><italic>Spectrochim. Acta A Mol. Biomol. Spectrosc.</italic></source> <volume>192</volume> <fpage>285</fpage>&#x2013;<lpage>290</lpage>. <pub-id pub-id-type="doi">10.1016/j.saa.2017.11.008</pub-id> <pub-id pub-id-type="pmid">29156315</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Witkowska</surname> <given-names>E.</given-names></name> <name><surname>Jagielski</surname> <given-names>T.</given-names></name> <name><surname>Kami&#x0144;ska</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>Genus- and species-level identification of dermatophyte fungi by surface-enhanced Raman spectroscopy.</article-title> <source><italic>Spectrochim. Acta A Mol. Biomol. Spectrosc.</italic></source> <volume>192</volume> <fpage>285</fpage>&#x2013;<lpage>290</lpage>. <pub-id pub-id-type="doi">10.1016/j.saa.2017.11.008</pub-id> <pub-id pub-id-type="pmid">29156315</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>X.</given-names></name> <name><surname>Xu</surname> <given-names>C.</given-names></name> <name><surname>Tripp</surname> <given-names>R. A.</given-names></name> <name><surname>Huang</surname> <given-names>Y. W.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name></person-group> (<year>2013</year>). <article-title>Detection and differentiation of foodborne pathogenic bacteria in mung bean sprouts using field deployable label-free SERS devices</article-title>. <source><italic>Analyst</italic></source> <volume>138</volume>, <fpage>3005</fpage>&#x2013;<lpage>3012</lpage>. <pub-id pub-id-type="doi">10.1039/c3an00186e</pub-id> <pub-id pub-id-type="pmid">23563168</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yan</surname> <given-names>B.</given-names></name> <name><surname>Cheng</surname> <given-names>A. C.</given-names></name> <name><surname>Wang</surname> <given-names>M. S.</given-names></name> <name><surname>Deng</surname> <given-names>S. X.</given-names></name> <name><surname>Zhang</surname> <given-names>Z. H.</given-names></name> <name><surname>Yin</surname> <given-names>N. C.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>Application of an indirect immunofluorescent staining method for detection of <italic>Salmonella enteritidis</italic> in paraffin slices and antigen location in infected duck tissues.</article-title> <source><italic>World J. Gastroenterol.</italic></source> <volume>14</volume> <fpage>776</fpage>&#x2013;<lpage>781</lpage>. <pub-id pub-id-type="doi">10.3748/wjg.14.776</pub-id> <pub-id pub-id-type="pmid">18205271</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yin</surname> <given-names>M.</given-names></name> <name><surname>Jing</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Deng</surname> <given-names>Q.</given-names></name> <name><surname>Wang</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Surface chemistry modified upconversion nanoparticles as fluorescent sensor array for discrimination of foodborne pathogenic bacteria.</article-title> <source><italic>J. Nanobiotechnol.</italic></source> <volume>18</volume>:<issue>41</issue>. <pub-id pub-id-type="doi">10.1186/s12951-020-00596-4</pub-id> <pub-id pub-id-type="pmid">32111217</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Jiang</surname> <given-names>H.</given-names></name> <name><surname>Gao</surname> <given-names>P.</given-names></name> <name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Sun</surname> <given-names>H.</given-names></name> <name><surname>Huang</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Confocal Raman microspectroscopy combined with chemometrics as a discrimination method of clostridia and serotypes of <italic>Clostridium botulinum</italic> strains</article-title>. <source><italic>J. Raman Spectrosc.</italic></source> <volume>52</volume>, <fpage>1820</fpage>&#x2013;<lpage>1829</lpage>. <pub-id pub-id-type="doi">10.1002/jrs.6244</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>X.</given-names></name> <name><surname>Wu</surname> <given-names>G.</given-names></name> <name><surname>Lv</surname> <given-names>G.</given-names></name> <name><surname>Yin</surname> <given-names>L.</given-names></name> <name><surname>Luo</surname> <given-names>B.</given-names></name> <name><surname>Lv</surname> <given-names>X.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Combining derivative Raman with autofluorescence to improve the diagnosis performance of echinococcosis</article-title>. <source><italic>Spectrochim. Acta A Mol. Biomol. Spectrosc.</italic></source> <volume>247</volume>:<issue>119083</issue>. <pub-id pub-id-type="doi">10.1016/j.saa.2020.119083</pub-id> <pub-id pub-id-type="pmid">33137629</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>C.</given-names></name> <name><surname>Xu</surname> <given-names>W.</given-names></name> <name><surname>Zhang</surname> <given-names>P.</given-names></name> <name><surname>Jiang</surname> <given-names>M.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Kwok</surname> <given-names>R. T. K.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Engineering sensor arrays using aggregation-induced emission luminogens for pathogen identification.</article-title> <source><italic>Adv. Funct. Mater.</italic></source> <volume>29</volume> <fpage>1805986.1</fpage>&#x2013;<lpage>1805986.10</lpage>. <pub-id pub-id-type="doi">10.1039/d2an00643j</pub-id> <pub-id pub-id-type="pmid">35611940</pub-id></citation></ref>
</ref-list>
</back>
</article>