<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Public Health</journal-id>
<journal-title>Frontiers in Public Health</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Public Health</abbrev-journal-title>
<issn pub-type="epub">2296-2565</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpubh.2018.00006</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Public Health</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The <italic>CARPEDIEM</italic> Algorithm: A Rule-Based System for Identifying Heart Failure Phenotype with a Precision Public Health Approach</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Franchini</surname> <given-names>Michela</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://frontiersin.org/people/u/459809"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Pieroni</surname> <given-names>Stefania</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://frontiersin.org/people/u/473979"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Passino</surname> <given-names>Claudio</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://frontiersin.org/people/u/517807"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Emdin</surname> <given-names>Michele</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Molinaro</surname> <given-names>Sabrina</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="cor1">&#x0002A;</xref>
<uri xlink:href="http://frontiersin.org/people/u/156955"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Institute of Clinical Physiology, National Research Council</institution>, <addr-line>Pisa</addr-line>, <country>Italy</country></aff>
<aff id="aff2"><sup>2</sup><institution>Division of Cardiology and Cardiovascular Medicine, Fondazione Toscana Gabriele Monasterio</institution>, <addr-line>Pisa</addr-line>, <country>Italy</country></aff>
<aff id="aff3"><sup>3</sup><institution>Institute of Life Sciences, Scuola Superiore Sant&#x02019;Anna</institution>, <addr-line>Pisa</addr-line>, <country>Italy</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Enrico Capobianco, University of Miami, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Pietro Lio, University of Cambridge, United Kingdom; Ajay Desai, Royal Brompton &#x00026; Harefield NHS Foundation Trust, United Kingdom</p></fn>
<corresp content-type="corresp" id="cor1">&#x0002A;Correspondence: Sabrina Molinaro, <email>sabrina.molinaro&#x00040;ifc.cnr.it</email>, <email>molinaro&#x00040;ifc.cnr.it</email></corresp>
<fn fn-type="other" id="fn001"><p>Specialty section: This article was submitted to Digital Health, a section of the journal Frontiers in Public Health</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>29</day>
<month>01</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="collection">
<year>2018</year>
</pub-date>
<volume>6</volume>
<elocation-id>6</elocation-id>
<history>
<date date-type="received">
<day>23</day>
<month>08</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>01</month>
<year>2018</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2018 Franchini, Pieroni, Passino, Emdin and Molinaro.</copyright-statement>
<copyright-year>2018</copyright-year>
<copyright-holder>Franchini, Pieroni, Passino, Emdin and Molinaro</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Modern medicine remains dependent on the accurate evaluation of a patient&#x02019;s health state, recognizing that disease is a process that evolves over time and interacts with many factors unique to that patient. The <italic>CARPEDIEM</italic> project represents a concrete attempt to address these issues by developing reproducible algorithms to support the accuracy in detection of complex diseases. This study aims to establish and validate the <italic>CARPEDIEM</italic> approach and algorithm for identifying those patients presenting with or at risk of heart failure (HF) by studying 153,393 subjects in Italy, based on patient information flow databases and is not reliant on the electronic health record to accomplish its goals. The resulting algorithm has been validated in a two-stage process, comparing predicted results with (1) HF diagnosis as identified by general practitioners (GPs) among the reference cohort and (2) HF diagnosis as identified by cardiologists within a randomly sampled subpopulation of 389 patients. The sources of data used to detect HF cases are numerous and were standardized for this study. The accuracy and the predictive values of the algorithm with respect to the GPs and the clinical standards are highly consistent with those from previous studies. In particular, the algorithm is more efficient in detecting the more severe cases of HF according to the GPs&#x02019; validation (specificity increases according to the number of comorbidities) and external validation (NYHA: II&#x02013;IV; HF severity index: 2, 3). Positive and negative predictive values reveal that the <italic>CARPEDIEM</italic> algorithm is most consistent with clinical evaluation performed in the specialist setting, while it presents a greater ability to rule out false-negative HF cases within the GP practice, probably as a consequence of the different HF prevalence in the two different care settings. Further development includes analyzing the clinical features of false-positive and -negative predictions, to explore the natural clustering of markers of chronic conditions by adding additional methodologies, e.g., Social Network Analysis. <italic>CARPEDIEM</italic> establishes the potential that an algorithmic approach, based on integrating administrative data with other public data sources, can enable the development of low cost and high value population-based evaluations for improving public health and impacting public health policies.</p>
</abstract>
<kwd-group>
<kwd>algorithm</kwd>
<kwd>phenotype</kwd>
<kwd>precision public health</kwd>
<kwd>heart failure</kwd>
<kwd>accuracy</kwd>
<kwd>predictive measures</kwd>
</kwd-group>
<contract-sponsor id="cn01">Ministero della Salute<named-content content-type="fundref-id">10.13039/501100003196</named-content></contract-sponsor>
<counts>
<fig-count count="4"/>
<table-count count="6"/>
<equation-count count="0"/>
<ref-count count="29"/>
<page-count count="10"/>
<word-count count="6551"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="introduction">
<title>Introduction</title>
<p>In today&#x02019;s healthcare environment, both physicians and patients are the beneficiaries of great scientific and technologic advances. However, modern medicine remains dependent on the accurate evaluation of a patient&#x02019;s health state, recognizing that disease is a process that evolves over time and interacts with many factors unique to that patient. As a result, the most critical factors that must be determined are as follows: (1) what is the disease path/process that a patient is exhibiting, i.e., the diagnosis, and (2) how far along that path has the patient progressed, i.e., disease staging. The entire premise of precision medicine requires the ability to determine both diagnosis and disease stage with a high degree of accuracy so that the appropriate selection of treatment pathway can be applied to optimize outcome (<xref ref-type="bibr" rid="B1">1</xref>).</p>
<p>In spite of the increased access to clinical data resources, there remains a lack of standardization of definitions and formats that negatively affects interoperability of health records. This further limits the development of comprehensive and universally adopted standards for diagnosis and treatment using evidence-based medicine and reduces the accuracy in establishing a patient&#x02019;s true health condition. The US Institute of Medicine estimates that approximately 10% of all diagnoses are incorrect. This estimate includes both type 1 errors (false-positive diagnoses) and type 2 errors (false-negative diagnoses) and leads to unnecessary patient deaths (estimated at 40,000&#x02013;80,000 per year in the United States) with an estimated cost of &#x00024;750 billion per year in the United States. Death due to medical errors has been estimated to be between 200,000 and 400,000 per year (the United States) and places this cause just behind heart disease and cancer with misdiagnosis and missed diagnosis representing between 15 and 25% of all medical errors (<xref ref-type="bibr" rid="B2">2</xref>). In addition, these numbers significantly underestimate healthcare costs associated with non-fatal misdiagnoses and missed diagnoses that result in inappropriate but non-fatal treatment of patients (<xref ref-type="bibr" rid="B3">3</xref>).</p>
<p>Heart failure (HF) is one of the most important public health problems encountered in the developed world. Despite the introduction of new treatments, HF is associated with high costs and poor outcomes. Many of the patients exhibit non-specific symptoms, which makes it difficult to identify HF and distinguish it from other conditions. Thus, many patients may have undiagnosed HF, or even when diagnosed, other undiagnosed concomitant conditions, as diabetes that is common in patients with acute HF, may confound the HF diagnosis. It is important to identify these patients and provide access to appropriate treatment to reduce mortality, improve healthcare, and reduce costs derived from undiagnosed disease (<xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B5">5</xref>).</p>
<p>The development of rule-based systems for organizing and presenting information extracted from multiple sources of standardized data related to large patient populations that facilitate problem solving and decision making in the clinical field is the first step toward establishing Precision Public Health (<xref ref-type="bibr" rid="B6">6</xref>).</p>
<p>In Italy, common and interoperable syntaxes have been developed for the National Health System (NHS) subcomponents (hospitalization, outpatient specialized treatment, and others). These enable collection, in specific data flows, of all information (individual characteristics, treatments/drugs, tariffs, co-pay fee exemption) generated when citizens interact with the NHS. The use of these standardized information models facilitates their integration from different data sources. This enables the development of more precise diagnostic criteria; moreover performing the analysis on large databases that include many different healthcare processes and events, enhances patient phenotype identification/assignment.</p>
<p>Although the primary purpose of these sources of data is to collect, store, and exchange patient information to allow administrative management of the NHS (<xref ref-type="bibr" rid="B7">7</xref>), in this manner, they may be also used to identify patient cohorts by first deriving rule-based phenotyping from the combination of standards, archetypes, ontology, and reasoning (<xref ref-type="bibr" rid="B8">8</xref>) and then application of these algorithms to screen all patients in the database for potential candidates with as yet undiagnosed disease.</p>
<p>This effort to develop phenotyping algorithms requires consistency in data (archetypes level) and knowledge level (ontology) simultaneously because diagnostic criteria (reasoning) are calculated from raw data but also need derived variables not directly available in original data (<xref ref-type="bibr" rid="B8">8</xref>, <xref ref-type="bibr" rid="B9">9</xref>).</p>
<p>Furthermore, administrative data represent not only a patients&#x02019; clinical characteristics but also the manner in which information is collected and recorded during the healthcare process events, ranging from inpatient admission, inpatient discharge, outpatient visit, emergency department visit, and ambulatory surgery (<xref ref-type="bibr" rid="B10">10</xref>). The propensity of the variables for clustering based on their association with specific healthcare process events results in better derivation of phenotyping algorithms for cohort/classification (<xref ref-type="bibr" rid="B10">10</xref>).</p>
<p>This study was undertaken to establish and validate the <italic>CARPEDIEM</italic> approach and algorithm for identifying those patients presenting with or at risk for HF to streamline the development of patient registries for supporting the proactive approach inspired by the Chronic Care Model, developed by Wagner in the late 1990s (<xref ref-type="bibr" rid="B11">11</xref>). This model assumes that improvement in care requires an approach that incorporates patient, provider, and system level interventions and that better coordination of care may also reduce medical expenditures, especially for persons with multiple chronic conditions (<xref ref-type="bibr" rid="B12">12</xref>).</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2-1">
<title>Reference Cohort and Data Sources</title>
<p>The reference cohort (153,393 subjects) used in this study includes all patients older than 14&#x02009;years attending those general practitioners (GPs) who participated in the PHP model in 2008&#x02013;2012 in two different geographical areas in the center of Tuscany (area 1: 83 GPs; area 2: 31 GPs).</p>
<p>The administrative data used to detect HF cases include (1) an archive of all residents in the Tuscany areas receiving NHS assistance storing demographic and administrative data, including the GP&#x02019;s identifier of each subject, (2) hospital discharge forms from public and/or private hospitals reporting all diagnoses related to hospitalizations, (3) all outpatient drug prescriptions reimbursable by the NHS, (4) certifications of chronic disease for the exemption from co-payment, and (5) all outpatient prescriptions for visits, laboratory/imaging tests, and medications.</p>
<p>Moreover, the pathology register that contains reporting from GPs about all their patients presenting HF was used as one of the standards for validating the algorithm. Apart from the last source of data referring to the period 2010&#x02013;2012, the others referred to the period 2011&#x02013;2012.</p>
<p>In all the archives, each patient&#x02019;s record has been deidentified using an encrypted unique identifier code, which became the key element to link the sources of data through a deterministic approach.</p>
<p>Each final patient record, created through the linkage procedure, contained information about gender, age class, the presence or absence of comorbidities (the clinical features), and HF diagnosis as identified by GPs.</p>
</sec>
<sec id="S2-2">
<title>The CARPEDIEM Algorithm Definition Process</title>
<p>The <italic>CARPEDIEM</italic> algorithm involves a four-step process (Figure <xref ref-type="fig" rid="F1">1</xref>), which includes (1) development of preliminary inference rules to build up the phenotype definition, (2) evaluation and revision of these rules, (3) selection of the final rules, and (4) subsequent validation by checking the rules against the standards.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Schema of the <italic>CARPEDIEM</italic> algorithm.</p></caption>
<graphic xlink:href="fpubh-06-00006-g001.tif"/>
</fig>
<p>The preliminary set of inference rules were developed by referring to a classification method that defines for each patient the groups of conditions to which he/she belongs. Through this model, each patient is classified based on criteria that consider (1) &#x0201C;exemption from co-payment codes&#x0201D; (codes elaborated by Italian Law M.D. of May 28, 1999 nr. 329 and further modifications), (2) diagnosis codes [International Classification of Diseases, Ninth Revision, Clinical Modification (ICD9-CM)] from hospital discharge, (3) drug classes as administered to the patient, and (4) related daily defined dose. Drug classes were defined using the Anatomical Therapeutic Chemical classification index derived from drug prescription data flow. The inclusion criteria were originally developed by the University of Pavia (<xref ref-type="bibr" rid="B13">13</xref>). The Pavia model defines 17 categories each with a different level of severity: the inference rules that define the 16 classifications other than &#x0201C;cardiovascular&#x0201D; were used to estimate the number and the type of patient&#x02019;s comorbidities (<xref ref-type="bibr" rid="B14">14</xref>).</p>
<p>The preliminary inference rules identify an HF case if (1) he/she has drug prescriptions of &#x003B2;-blocker, spironolactone (SPI), furosemide (FUR), angiotensin-converting enzyme (ACE) inhibitors, angiotensin receptor blocker (ARB), digoxin (DIG), and canrenone as detailed in Table <xref ref-type="table" rid="T1">1</xref>, (2) he/she has a &#x0201C;exemption from co-payment code&#x0201D; for HF (code 021), or (3) he/she has diagnosis of HF at hospital discharge, i.e., diagnosis of hypertensive cardiopathy with heart failure, cardiovascular hypertension with heart failure and chronic renal disease, cardiomyopathy, and heart failure (ICD9-CM codes listed in the Table <xref ref-type="table" rid="T2">2</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Drug codes included in the <italic>CARPEDIEM</italic> algorithm as markers of heart failure.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="center"/>
<th valign="top" align="left">Drug</th>
<th valign="top" align="center">Anatomical therapeutic chemical code</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="4">&#x003B2;-blocker</td>
<td align="left" valign="top">Bisoprolol</td>
<td align="center" valign="top">C07AB07</td>
</tr>
<tr>
<td align="left" valign="top">Nebivolol</td>
<td align="center" valign="top">C07AB12</td>
</tr>
<tr>
<td align="left" valign="top">Carvedilol</td>
<td align="center" valign="top">C07AG02</td>
</tr>
<tr>
<td align="left" valign="top">Metoprolol</td>
<td align="center" valign="top">C07AB02</td>
</tr>
<tr>
<td align="left" valign="top" colspan="3"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">Angiotensin-converting enzyme inhibitors</td>
<td align="left" valign="top">Enalapril</td>
<td align="center" valign="top">C09AA02</td>
</tr>
<tr>
<td align="left" valign="top">Captopril</td>
<td align="center" valign="top">C09AA01</td>
</tr>
<tr>
<td align="left" valign="top">Lisinopril</td>
<td align="center" valign="top">C09AA03</td>
</tr>
<tr>
<td align="left" valign="top">Ramipril</td>
<td align="center" valign="top">C09AA05</td>
</tr>
<tr>
<td align="left" valign="top" colspan="3"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Angiotensin receptor blocker</td>
<td align="left" valign="top">Candesartan</td>
<td align="center" valign="top">C09CA06</td>
</tr>
<tr>
<td align="left" valign="top">Losartan</td>
<td align="center" valign="top">C09CA01</td>
</tr>
<tr>
<td align="left" valign="top">Valsartan</td>
<td align="center" valign="top">C09CA03</td>
</tr>
<tr>
<td align="left" valign="top" colspan="3"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">Other</td>
<td align="left" valign="top">Digoxin</td>
<td align="center" valign="top">C01AA05</td>
</tr>
<tr>
<td align="left" valign="top">Canrenone</td>
<td align="center" valign="top">C03DA03</td>
</tr>
<tr>
<td align="left" valign="top">Spironolacton</td>
<td align="center" valign="top">C03DA01</td>
</tr>
<tr>
<td align="left" valign="top">Furosemide</td>
<td align="center" valign="top">C03CA01</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>ICDIX codes included in the <italic>CARPEDIEM</italic> algorithm as markers of HF.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Diagnosis</th>
<th valign="top" align="center">ICD9-CM code</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="3">Hypertensive cardiopathy with heart failure</td>
<td align="center" valign="top">40201</td>
</tr>
<tr>
<td align="center" valign="top">40211</td>
</tr>
<tr>
<td align="center" valign="top">40291</td>
</tr>
<tr>
<td align="center" valign="top" colspan="2"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">Cardiovascular hypertension with heart failure and chronic renal disease</td>
<td align="center" valign="top">40401</td>
</tr>
<tr>
<td align="center" valign="top">40403</td>
</tr>
<tr>
<td align="center" valign="top">40411</td>
</tr>
<tr>
<td align="center" valign="top">40413</td>
</tr>
<tr>
<td align="center" valign="top" colspan="2"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Cardionephropathy hypertension with heart failure</td>
<td align="center" valign="top">40491</td>
</tr>
<tr>
<td align="center" valign="top">40493</td>
</tr>
<tr>
<td align="center" valign="top" colspan="2"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="5">Cardiomyopathy</td>
<td align="center" valign="top">4254</td>
</tr>
<tr>
<td align="center" valign="top">4255</td>
</tr>
<tr>
<td align="center" valign="top">4257</td>
</tr>
<tr>
<td align="center" valign="top">4258</td>
</tr>
<tr>
<td align="center" valign="top">4259</td>
</tr>
<tr>
<td align="center" valign="top" colspan="2"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Heart failure</td>
<td align="center" valign="top">4280</td>
</tr>
<tr>
<td align="center" valign="top">4281</td>
</tr>
<tr>
<td align="center" valign="top">4289</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In general, the validity of rule-based identification system is assessed by comparing the diagnosis from the administrative database to an accepted &#x0201C;gold standard&#x0201D; reference diagnosis and by estimating the principal measures of validity such as sensitivity, specificity, predictive values, likelihood ratios (LRs), and Kappa scores.</p>
<p>Heart failure patients with clinical evidence documented by GPs represented the initial gold standard against which we tested the preliminary rules.</p>
<p>Heart failure prevalence rate estimated by the preliminary rules was considered as an indication of the false-positive rate and accuracy of the preliminary rules.</p>
<p>To improve the accuracy of the rule-based system, the criteria derived from semistructured individual and independent interviews with three cardiologists and from clinical guidelines were combined into a matrix of possible therapeutic strategies for HF, which might be identified from the administrative data.</p>
<p>In particular, during the interviews, the clinicians were asked to indicate which drug codes they thought should be grouped into combinations that are considered to be clinically appropriate for treating HF in its various levels of severity. The resulting grouping rules consider &#x003B2;-blockers (BBs), ACE inhibitor, ARBs, DIG, Candesartan (CAN), SPI, FUR with different possible combinations: (1) BB and ACE, (2) BB and ARB, (3) DIG and BB, (4) DIG and (ACE or ARB), (5) CAN and BB, (6) CAN and (ACE or ARB), (7) SPI and BB, (8) SPI and (ACE or ARB), (9) FUR and BB, (10) FUR and (ACE or ARB), (11) FUR and DIG, (12) FUR and CAN, and (13) FUR and SPI. Within a single combination, drug codes were taken in logical conjunction (AND connective), and then each drug combination was taken in inclusive disjunction (OR connective) with diagnoses (Table <xref ref-type="table" rid="T2">2</xref>) and exemption from co-payment code for HF.</p>
<p>This leads to identifying a narrow rule-based system, the <italic>CARPEDIEM</italic> algorithm, which was tested both against the GPs and the clinical standard.</p>
<p>The external clinical validation was performed in a subpopulation of 389 patients (area 1: 269; area 2: 120) randomly sampled among those patients of the reference cohort who attended the Gabriele Monasterio Foundation (GM Foundation) for hospitalizations or outpatient activities during the observational period (2011&#x02013;2014).</p>
<p>Those patients were clinically characterized through specific laboratory parameters (Ejection Fraction, NT-proBNP, HF severity index, NYHA index, etc.). The cardiologists performed a blinded HF assessment by evaluating the specific clinical parameters only.</p>
<p>The cardiologists&#x02019; classification represented the external clinical standard against which we checked the accuracy and predictive values of the <italic>CARPEDIEM</italic> algorithm.</p>
<p>In the following sections, we describe the result of each step of this process.</p>
</sec>
</sec>
<sec id="S3">
<title>Results</title>
<sec id="S3-1">
<title>Development and Evaluation of the Preliminary Inference Rules</title>
<p>To build inference rules, it is critical to determine what elements might be informative for identifying health conditions.</p>
<p>There are a variety of approaches for classifying patients into a particular phenotype, and our hypothesis is that the process of phenotype identification is more efficient if considering many different information sources. So, we began with the available data sets, including a set of conditions (hospitalization, drug treatment, and exemptions) taken in inclusive disjunction (OR).</p>
<p>The use of preliminary rules allowed to identify a total of 37,210 HF cases, equal to a prevalence rate of 6.1% (area 1: 5.9%; area 2: 6.5%), as shown in Table <xref ref-type="table" rid="T3">3</xref>.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Annual percentage prevalence rate of HF estimated either by GPs or referring to the preliminary and final rules (2011&#x02013;2014).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Source</th>
<th valign="top" align="center">Area 1</th>
<th valign="top" align="center">Area 2</th>
<th valign="top" align="center">Area 1&#x02009;&#x0002B;&#x02009;area 2</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">GPs assessment (GPstd)</td>
<td align="center" valign="top">1.4%</td>
<td align="center" valign="top">1.2%</td>
<td align="center" valign="top">1.3%</td>
</tr>
<tr>
<td align="left" valign="top">IC 95% lower</td>
<td align="center" valign="top">1.3%</td>
<td align="center" valign="top">1.1%</td>
<td align="center" valign="top">1.3%</td>
</tr>
<tr>
<td align="left" valign="top">IC 95% upper</td>
<td align="center" valign="top">1.5%</td>
<td align="center" valign="top">1.3%</td>
<td align="center" valign="top">1.4%</td>
</tr>
<tr>
<td align="left" valign="top">Preliminary rules (HFprR)</td>
<td align="center" valign="top">5.9%</td>
<td align="center" valign="top">6.5%</td>
<td align="center" valign="top">6.1%</td>
</tr>
<tr>
<td align="left" valign="top">IC 95% lower</td>
<td align="center" valign="top">5.8%</td>
<td align="center" valign="top">6.3%</td>
<td align="center" valign="top">6.0%</td>
</tr>
<tr>
<td align="left" valign="top">IC 95% upper</td>
<td align="center" valign="top">5.9%</td>
<td align="center" valign="top">6.6%</td>
<td align="center" valign="top">6.1%</td>
</tr>
<tr>
<td align="left" valign="top">CARPEDIEM algorithm</td>
<td align="center" valign="top">2.2%</td>
<td align="center" valign="top">2.5%</td>
<td align="center" valign="top">2.3%</td>
</tr>
<tr>
<td align="left" valign="top">IC 95% lower</td>
<td align="center" valign="top">2.2%</td>
<td align="center" valign="top">2.5%</td>
<td align="center" valign="top">2.3%</td>
</tr>
<tr>
<td align="left" valign="top">IC 95% upper</td>
<td align="center" valign="top">2.3%</td>
<td align="center" valign="top">2.6%</td>
<td align="center" valign="top">2.4%</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>These rates were four times higher than the value estimated by GPs (patients with life course documented clinical evidence), and this suggested a potentially high number of false positives influencing the accuracy of the rule-based system.</p>
</sec>
<sec id="S3-2">
<title>Selection and Evaluation of the <italic>CARPEDIEM</italic> Algorithm</title>
<p>The broader definition of HF, referring to the inclusive disjunction (OR) among codes derived from different data sources, tended to overestimate the number of false-positive cases.</p>
<p>The narrower definition included in the final rules was more accurate in identifying HF cases. The prevalence rate estimated by running the <italic>CARPEDIEM</italic> algorithm was 2.3% (area 1: 2.2%; area 2: 2.5%), a value slightly higher than the GPs standard (Table <xref ref-type="table" rid="T3">3</xref>).</p>
</sec>
<sec id="S3-3">
<title>External Validation of the <italic>CARPEDIEM</italic> Algorithm</title>
<p>The patient group used for the external validation included 312 HF cases identified through the <italic>CARPEDIEM</italic> algorithm in the period 2011&#x02013;2014 and 77 subjects not affected by HF.</p>
<p>Age and gender distribution of the patients in the validation subpopulation was different compared to the general population. In particular, individuals aged 65&#x02009;years and older and males were overrepresented in the subpopulation compared to the reference cohort (76.3 vs 27.5% and 51.9 vs 47.5%, respectively).</p>
<p>This could be expected as the subpopulation used for the validation and composed of individuals who interacted with a public entity of the Regional Health Service (GM Foundation), which is particularly involved in managing patients with more severe cardiovascular diseases.</p>
<p>Figure <xref ref-type="fig" rid="F2">2</xref> shows the accuracy and predictive measures of the final rules estimated by the comparison with the external clinical standard.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Accuracy and predictive measures of the <italic>CARPEDIEM</italic> algorithm.</p></caption>
<graphic xlink:href="fpubh-06-00006-g002.tif"/>
</fig>
<p>The comparison with the clinical standard showed that the <italic>CARPEDIEM</italic> algorithm validity varied by age, with sensitivity that increased from 75 to 92% according to the increase of age, while specificity sharply decreased.</p>
<p>Positive predictive value (PPV) and negative predictive value (NPV) were generally higher than 65% and varied by age.</p>
<p>The <italic>CARPEDIEM</italic> algorithm validity also varied according to the worsening of HF severity estimated by using NYHA and HF severity index (Tables <xref ref-type="table" rid="T4">4</xref> and <xref ref-type="table" rid="T5">5</xref>).</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Degree of concordance by NHYA score between <italic>CARPEDIEM</italic> algorithm and external standard.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">NYHA score</th>
<th valign="top" align="center">Cohen&#x02019;s Kappa</th>
<th valign="top" align="left">Strength of agreement</th>
<th valign="top" align="center"><italic>CARPEDIEM</italic> algorithm, number of cases (number of patients)<xref ref-type="table-fn" rid="tfn1"><sup>a</sup></xref></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">I&#x02014;Cardiac disease, but no symptoms and no limitation</td>
<td align="center" valign="top">0.15</td>
<td align="left" valign="top">Poor</td>
<td align="center" valign="top">17 (22)</td>
</tr>
<tr>
<td align="left" valign="top">II&#x02014;Mild symptoms and slight limitation during ordinary activity</td>
<td align="center" valign="top">0.57<xref ref-type="table-fn" rid="tfn2">&#x0002A;</xref></td>
<td align="left" valign="top">Moderate</td>
<td align="center" valign="top">84 (94)</td>
</tr>
<tr>
<td align="left" valign="top">III&#x02014;Marked limitation in activity due to symptoms, even during less than ordinary activity</td>
<td align="center" valign="top">0.57<xref ref-type="table-fn" rid="tfn2">&#x0002A;</xref></td>
<td align="left" valign="top">Moderate</td>
<td align="center" valign="top">85 (98)</td>
</tr>
<tr>
<td align="left" valign="top">IV&#x02014;Severe limitations</td>
<td align="center" valign="top">0.67<xref ref-type="table-fn" rid="tfn2">&#x0002A;</xref></td>
<td align="left" valign="top">Good</td>
<td align="center" valign="top">14 (16)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1"><p><italic><sup>a</sup>Limited to 200 patients with heart failure clinically validated and New York Heart Association score &#x02265;1</italic>.</p></fn>
<fn id="tfn2"><p><italic>&#x0002A;p&#x02009;&#x0003C;&#x02009;0.05</italic>.</p></fn></table-wrap-foot></table-wrap>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p>Degree of concordance by heart failure (HF) severity score between the <italic>CARPEDIEM</italic> algorithm and the external standard.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Heart failure severity score</th>
<th valign="top" align="center">Cohen&#x02019;s Kappa</th>
<th valign="top" align="center">Strength of agreement</th>
<th valign="top" align="center"><italic>CARPEDIEM</italic> algorithm, number of cases (number of patients)<xref ref-type="table-fn" rid="tfn3"><sup>a</sup></xref></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">1</td>
<td align="center" valign="top">0.44<xref ref-type="table-fn" rid="tfn4">&#x0002A;</xref></td>
<td align="center" valign="top">Moderate</td>
<td align="center" valign="top">98 (121)</td>
</tr>
<tr>
<td align="left" valign="top">2</td>
<td align="center" valign="top">0.66<xref ref-type="table-fn" rid="tfn4">&#x0002A;</xref></td>
<td align="center" valign="top">Good</td>
<td align="center" valign="top">28 (29)</td>
</tr>
<tr>
<td align="left" valign="top">3</td>
<td align="center" valign="top">0.70<xref ref-type="table-fn" rid="tfn4">&#x0002A;</xref></td>
<td align="center" valign="top">Good</td>
<td align="center" valign="top">70 (76)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn3"><p><italic><sup>a</sup>Limited to 226 patients with HF clinically validated and New York Heart Association (NYHA) score &#x02265;1</italic>.</p></fn>
<fn id="tfn4"><p><italic>&#x0002A;p&#x02009;&#x0003C;&#x02009;0.05</italic>.</p></fn></table-wrap-foot></table-wrap>
</sec>
<sec id="S3-4">
<title>Validation of the <italic>CARPEDIEM</italic> Algorithm against GPs Assessment</title>
<p>Table <xref ref-type="table" rid="T6">6</xref> shows the accuracy and predictive measures of the <italic>CARPEDIEM</italic> algorithm estimated by the comparison with true positive and negative HF cases as identified by GPs within the whole reference cohort (153,393 subjects).</p>
<table-wrap position="float" id="T6">
<label>Table 6</label>
<caption><p>Accuracy and predictive measures of the <italic>CARPEDIEM</italic> algorithm, estimated by the comparison with the GPs assessment (GPstd) by area.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="center"/>
<th valign="top" align="center">Sensitivity IC 95%</th>
<th valign="top" align="center">Specificity IC 95%</th>
<th valign="top" align="center">PPV IC 95%</th>
<th valign="top" align="center">NPV IC 95%</th>
<th valign="top" align="center">Likelihood ratio (LR) (&#x0002B;) IC 95%</th>
<th valign="top" align="center">LR (&#x02212;) IC 95%</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="3">Area 1</td>
<td align="center" valign="top">72.5</td>
<td align="center" valign="top">93.8</td>
<td align="center" valign="top">14.3</td>
<td align="center" valign="top">99.6</td>
<td align="center" valign="top">11.7</td>
<td align="center" valign="top">0.3</td>
</tr>
<tr>
<td align="center" valign="top">72.2</td>
<td align="center" valign="top">93.6</td>
<td align="center" valign="top">14.1</td>
<td align="center" valign="top">99.5</td>
<td align="center" valign="top">11.3</td>
<td align="center" valign="top">0.3</td>
</tr>
<tr>
<td align="center" valign="top">72.7</td>
<td align="center" valign="top">94.0</td>
<td align="center" valign="top">14.5</td>
<td align="center" valign="top">99.6</td>
<td align="center" valign="top">12.1</td>
<td align="center" valign="top">0.3</td>
</tr>
<tr>
<td align="left" valign="top" colspan="7"><hr/></td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Area 2</td>
<td align="center" valign="top">83.3</td>
<td align="center" valign="top">92.7</td>
<td align="center" valign="top">12.5</td>
<td align="center" valign="top">99.8</td>
<td align="center" valign="top">11.5</td>
<td align="center" valign="top">0.2</td>
</tr>
<tr>
<td align="center" valign="top">83.0</td>
<td align="center" valign="top">92.5</td>
<td align="center" valign="top">12.3</td>
<td align="center" valign="top">99.7</td>
<td align="center" valign="top">11.1</td>
<td align="center" valign="top">0.2</td>
</tr>
<tr>
<td align="center" valign="top">83.6</td>
<td align="center" valign="top">92.9</td>
<td align="center" valign="top">12.8</td>
<td align="center" valign="top">99.8</td>
<td align="center" valign="top">11.8</td>
<td align="center" valign="top">0.2</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><italic>CARPEDIEM</italic> algorithm performance is also expressed in terms of LRs, the likelihood of a given test result in a person with the disease, compared with the likelihood of this result in a person without the disease (<xref ref-type="bibr" rid="B15">15</xref>). In other words, LRs indicate how much a given test result will raise or lower the probability of having the event that the test is designed to predict (<xref ref-type="bibr" rid="B16">16</xref>).</p>
<p>Predictive values give probabilities of abnormality for particular test results, but they depend on the prevalence of abnormality in the study sample and they can be rarely generalized beyond the study, except when the study is based on a suitable random sample, as is sometimes the case for population screening studies (<xref ref-type="bibr" rid="B17">17</xref>).</p>
<p>Otherwise, LRs are alternative statistics for summarizing diagnostic accuracy that have several particularly powerful properties making them more useful clinically than other statistics.</p>
<p>LR is portable, while predictive values of test are driven by the prevalence of the disease. Moreover, while predictive values infer test characteristics to the population, LRs can be applied to a specific patient (<xref ref-type="bibr" rid="B18">18</xref>) and again the LR is thought to be more stable because sensitivity and specificity usually vary in opposite directions (<xref ref-type="bibr" rid="B19">19</xref>).</p>
<p>As with all ratios, LRs range from zero to infinity and hence the further LR is from 1, the greater effect is on the probability of disease. In particular, tests with LR(&#x0002B;) between 2 and 5 are only somewhat useful, between 5 and 10 are moderately useful, tests with LR less than 2 are not useful, and tests with LR(&#x0002B;) greater than 10 are considered very useful.</p>
<p>LR(&#x02212;) is the proportion between false- and true-negative patients. Tests with LR(&#x02212;) between 0.2 and 0.5 are only somewhat useful, between 0.1 and 0.2 are moderately useful, tests with LR greater than 0.5 are of no use, and tests with LR(&#x02212;) less than 0.1 are considered very useful.</p>
<p>In both areas, LR(&#x0002B;) values showed the very useful results of applying the <italic>CARPEDIEM</italic> algorithm in raising the probability of having HF, while its ability in correctly identifying the true negatives is moderate (Table <xref ref-type="table" rid="T6">6</xref>).</p>
<p>In term of accuracy, the <italic>CARPEDIEM</italic> algorithm showed a good sensitivity and a very high specificity (Figure <xref ref-type="fig" rid="F3">3</xref>), but these rates varied according to the number of different pathological categories the patient belongs to, or in other term, according to the complexity of the clinical status (severity). As a general rule, sensitivity increased mainly depending on two types of evidence: (1) among clinically similar or related diseases many medications overlap and (2) patients with a higher number of comorbidities have more probability of being hospitalized and this makes these patients better identifiable.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Accuracy and predictive measures of the <italic>CARPEDIEM</italic> algorithm, estimated by the comparison with the general practitioners assessment (GPstd), by the number of comorbidities.</p></caption>
<graphic xlink:href="fpubh-06-00006-g003.tif"/>
</fig>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<p>Heart failure happens when the heart cannot pump enough blood and oxygen to support other organs in the body. It is a complex condition that can result from coronary artery disease, heart attack, cardiomyopathy, high blood pressure, diabetes, thyroid disease, kidney disease, valve disease, etc. and requires a specialist for both accurate diagnosis and therapeutic intervention. Globally it remains the single largest cause of death. The prevalence of this condition in Italy is about 1.25% [95% confidence interval (CI), 1.23&#x02013;1.27], and the incidence rate was 1.99 per 1000 person-years (95% CI, 1.81&#x02013;2.08) that is expected to grow (<xref ref-type="bibr" rid="B20">20</xref>).</p>
<p>As with most chronic diseases, in the absence of effective prevention, early detection and appropriate treatment are critical for patient management and quality of life.</p>
<p>The estimated cost for HF in Italy is &#x00024;39.2 billion annually (<xref ref-type="bibr" rid="B21">21</xref>).</p>
<p>Because of the variability in cause, in presentation, in patterns of comorbidities, etc. and especially in the differences in presentation of symptoms between males and females, it is critical that patients be identified who are at risk or in early stages of this disease so that proper diagnosis and treatment can be obtained. The complexity associated with HF makes it significantly susceptible to the errors of misdiagnosis and missed diagnosis, especially in patients who are not engaged with a cardiac specialist.</p>
<p>Understanding the complexity associated with the healthcare process is vital for achieving scalable and high-throughput phenotyping algorithms that might operate on the administrative data contained in the NHS data repositories (Figure <xref ref-type="fig" rid="F4">4</xref>).</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Schema of the <italic>CARPEDIEM</italic> approach.</p></caption>
<graphic xlink:href="fpubh-06-00006-g004.tif"/>
</fig>
<p>A significant aspect of the <italic>CARPEDIEM</italic> approach is that it is based on the data that exist with patient information flow databases and is not reliant on the electronic health record to accomplish its goals.</p>
<p>The nature of HF, not considered to be a discrete condition, but a &#x0201C;complex clinical syndrome&#x0201D; (<xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B22">22</xref>) characterized by high comorbidity burdens, makes the integration among different healthcare process events mandatory in defining HF cases.</p>
<p>Moreover, recent literature stated that most if not all age-related diseases share low-grade chronic inflammation termed &#x0201C;Inflammaging,&#x0201D; which is a highly significant risk factor for both morbidity and mortality in elderly people (<xref ref-type="bibr" rid="B23">23</xref>).</p>
<p>Chronic inflammation can be prevented and cured (<xref ref-type="bibr" rid="B24">24</xref>), but there is an urgent need to better understand the links between molecular features and phenotype to develop strategies to deal with their implications for the individual patient (<xref ref-type="bibr" rid="B25">25</xref>).</p>
<p>Currently used organizational structure of the taxonomy of diseases, not taking into account that multiple different diseases share a common molecular cause, is so rigid as to preclude description of the complex interrelationships that link diseases to each other and to the vast array of causative factors.</p>
<p>This confirms the hypothesis that combining different variables from many sources of administrative data (hospitalization, drugs, ambulatory care record) is the best strategy in identifying cases of specific disease basing on their association with specific healthcare process events (<xref ref-type="bibr" rid="B10">10</xref>).</p>
<p>For instance, while HF may contribute to the need of hospitalization caused by other health problems, the HF diagnosis may not be entered on the discharge record. Considering diagnosis codes from hospital discharge as a sole criterion to identify cases of HF, leads to underestimate the prevalence of HF (<xref ref-type="bibr" rid="B26">26</xref>).</p>
<p>In integrating different sources of data, it is mandatory to consider that the inclusion of drug prescription codes in the rule-based identification system could increase the HF false-positive rate, because among clinically similar or related diseases, many medications overlap (<xref ref-type="bibr" rid="B27">27</xref>). Otherwise it is also true that the pharmacological treatment of patients with chronic HF seems to be acceptably adherent to the recommendations (<xref ref-type="bibr" rid="B28">28</xref>).</p>
<p>In this study, we specify, apply, and validate the most suitable algorithm to identify those patients with or at risk to develop HF and we set this as the first step for improving the Precision Public Health approach.</p>
<p>In particular, we tested two different combinations of diagnostic and therapeutic codes used for identifying individuals with HF from the administrative data of the general population.</p>
<p>The broader combination was based on the inclusive disjunction (OR connective) among drug, diagnoses, and exemption codes, while in the narrow combination, the <italic>CARPEDIEM</italic> algorithm, drug codes were grouped in some possible therapeutic strategies for HF. These subgroups were taken in inclusive disjunction (OR connective) with diagnoses and exemption codes.</p>
<p>The HF prevalence rate obtained by running the broader combination is four times higher than the value estimated by GPs, while the prevalence from the <italic>CARPEDIEM</italic> algorithm is only slightly higher (2.3%; 2.3&#x02013;2.4%).</p>
<p>The accuracy and the predictive values for the <italic>CARPEDIEM</italic> algorithm with respect to the GPs and the clinical standards are highly consistent with those from other studies in terms of validity of HF diagnoses in administrative databases. In particular, the sensitivity of the <italic>CARPEDIEM</italic> algorithm compared to the clinical standard ranges from 72 to 92% according to the increase of age, with an average value of 88%, which is consistent with those values estimated among the high-quality studies selected by McCormick et al. (<xref ref-type="bibr" rid="B29">29</xref>).</p>
<p>The sensitivity of the <italic>CARPEDIEM</italic> algorithm compared to the GPs standard is a bit lower (76.1%; IC 95%: 75.3&#x02013;76.2), but it varies from 34.1 to 81.3% according to the complexity of the clinical status, expressed as the number of comorbidities belonging to different categories of disease (disability, psychiatric disorders, chronic renal insufficiency, transplantation, neoplasms, cardiovascular diseases, chronic obstructive pulmonary disease, gastroenteropathy, neuropathy, autoimmune, endocrine and metabolic diseases, diabetes, rare diseases, pregnancy, and other health conditions that cannot be classified as chronic disease or pregnancy).</p>
<p>Evidence of the relationship among clinical status severity and the <italic>CARPEDIEM</italic> algorithm validity also comes from the extent of agreement with the clinical scores assessed within the validation cohort (NHYA and HF severity), which increases from less severe conditions to the worst (Cohen&#x02019;s Kappa from 0.44 to 0.70).</p>
<p>These results suggested that severe cases of HF may be recorded more often in administrative databases than mild severity ones.</p>
<p>PPV is moderate (65.1%) when the final rules have been compared to the clinical standard and very low when compared to the GPs standard (13.6%; IC 95%: 13.4&#x02013;13.8). On the contrary, the best NPV refers to the GPs standard (99 vs 65%).</p>
<p>This means that the rule-based system is more consistent with a clinical evaluation performed in a specialist setting while it has a more intense ability in ruling out the false negative HF cases within a more general clinical setting (General Medicine). This could be critical to add in the early detection of patients either at risk for HF or in early stages of HF and enable their referral to specialists who can further optimize their cardiovascular care.</p>
<p>This could reflect that PPV and NPV are both dependent on the prevalence of the condition in the reference population. As a consequence, PPV will be higher in a specialist setting where the prevalence is higher, while the NPV will be better in a general setting were the prevalence is lower.</p>
<p>Further support about the rule-based system validity respect to the general setting was provided by the LR values estimated through the comparison with GPs standard. In particular, LR (&#x0002B;) is greater than 10 in both geographic areas, and this means that an individual classified for HF by the <italic>CARPEDIEM</italic> algorithm is more than 10 times likely to have had a HF diagnosed by GPs with respect to a not classified individual.</p>
<p>In summary, <italic>CARPEDIEM</italic> establishes the potential that an algorithmic approach, based on integrating the administrative data with other public data sources, can present the opportunity to develop low-cost and high-value population-based evaluations for improving public health and impacting public health policies. This suggests that the <italic>CARPEDIEM</italic> approach can become a significant contributor to the ultimate goal of Personalized Medicine by providing a reliable and reproducible component of disease diagnosis. Moreover, the core of the algorithm can be easily customized to process data sources other than those we used in our study, provided that they contain standardized clinical information. This represents a potential for evolution and improvement of predictive levels of the <italic>CARPEDIEM</italic> algorithm not only in the heart diseases.</p>
<p>Further research will be carried out by using alternative methodologies such as the Social Network Analysis, a specific tool for enhancing hypothesis generation and with a particular ability in identifying complex and hidden relationships among the data, exposing them for further appropriate statistical verification and supporting complex investigation about undiagnosed HF and gender differences in the clinical presentation of HF.</p>
<p>Thus, the ability to examine readily accessible data, such as the administrative data contained in the National Health Service database, can be a critical component of a broad population-focused surveillance program in HF and also lead to its potential application in other chronic and complex disorders, and <italic>CARPEDIEM</italic> provides an initial prototype for such development.</p>
</sec>
<sec id="S5">
<title>Ethics Statement</title>
<p>This study was carried out in accordance with Italian Legislative Decree 196/2003 on privacy, as stated by the ethical committee of &#x0201C;Area Vasta Nord Ovest PISA&#x0201D; (approval document no. 539/2015, on February 20, 2015). Written informed consent was given by the patients for their information to be stored in the databases used in this study. Written consent follows the criteria established in the Arts. 13 and 22 of the Italian Legislative Decree 196/2003 on privacy. GPs collect written consent only once, at the time of taking in charge of each patient. GPs usually store written consents in their digital archives. Written consent is mandatory whenever patient and NHS interact. All the written consents also regards statistical processing of the data for research purposes. In this case, data must be anonymous, and the patient must not be identifiable in any way. The linkage procedure among the databases was performed using encrypted unique identifier codes created for each patient by the Local Administrative Authority, which is the health provider legally authorized for the processing of sensitive data.</p>
</sec>
<sec id="S6" sec-type="author-contributor">
<title>Author Contributions</title>
<p>MF provided substantial contributions to the epidemiological conception and design of the work, assisted with acquisition of data and algorithm identification, completed all analysis, led interpretation of data, drafted the work, and revised it critically. SP provided relevant technical contributions to the design of the work, assisted with acquisition of data and algorithm identification, led interpretation of data, and assisted in drafting and revising the work. CP and ME provided substantial contribution to the clinical conception of the work and algorithm identification, acquired and provided clinical data, and participated to the final approval of the version to be published and ensured that questions related to the clinical accuracy or integrity of any part of the work are appropriately investigated and resolved. SM assisted in designing the work, drafting the paper, and revising it critically. SM also provided the final approval of the version to be published and ensured that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.</p>
</sec>
<sec id="S7">
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>The authors would like to thank the Management of the Local Administrative Unit of Pisa and Empoli. In particular, we are grateful to Nedo Mennuti (MD) and Antonella Tomei (MD) for having ensured data availability. Special thanks to Prof Michael N Liebman for his support in the paper design.</p>
</ack>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This work is one of the results of the project Comorbidity And Risk Profiles Evaluation in Diabetes and hEart Morbidities (<italic>CARPEDIEM</italic>; code WFR RF-2011-02348236) funded by the Health Ministry of Italy within the Call for Proposals 2011&#x02013;2012.</p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Collins</surname> <given-names>FS</given-names></name> <name><surname>Varmus</surname> <given-names>H</given-names></name></person-group>. <article-title>A new initiative on precision medicine</article-title>. <source>N Engl J Med</source> (<year>2015</year>) <volume>372</volume>:<fpage>793</fpage>&#x02013;<lpage>5</lpage>.<pub-id pub-id-type="doi">10.1056/NEJMp1500523</pub-id><pub-id pub-id-type="pmid">25635347</pub-id></citation></ref>
<ref id="B2"><label>2</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Makary</surname> <given-names>MA</given-names></name> <name><surname>Daniel</surname> <given-names>M</given-names></name></person-group>. <article-title>Medical error-the third leading cause of death in the US</article-title>. <source>BMJ</source> (<year>2016</year>) <volume>353</volume>:<fpage>i2139</fpage>.<pub-id pub-id-type="doi">10.1136/bmj.i2139</pub-id></citation></ref>
<ref id="B3"><label>3</label><citation citation-type="book"><collab>National Academies of Sciences, Engineering, and Medicine</collab>. <source>Improving Diagnosis in Health Care</source>. <publisher-loc>Washington, DC</publisher-loc>: <publisher-name>The National Academies Press</publisher-name> (<year>2015</year>). <fpage>472</fpage> p.</citation></ref>
<ref id="B4"><label>4</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrandis</surname> <given-names>M-J</given-names></name> <name><surname>Ryden</surname> <given-names>I</given-names></name> <name><surname>Lindahl</surname> <given-names>TL</given-names></name> <name><surname>Larsson</surname> <given-names>A</given-names></name></person-group>. <article-title>Ruling out cardiac failure: cost-benefit analysis of a sequential testing strategy with NT-proBNP before echocardiography</article-title>. <source>Ups J Med Sci</source> (<year>2013</year>) <volume>118</volume>(<issue>2</issue>):<fpage>75</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.3109/03009734.2012.751471</pub-id><pub-id pub-id-type="pmid">23230860</pub-id></citation></ref>
<ref id="B5"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flores-Le Roux</surname> <given-names>JA</given-names></name> <name><surname>Comin</surname> <given-names>J</given-names></name> <name><surname>Pedro-Botet</surname> <given-names>J</given-names></name> <name><surname>Benaiges</surname> <given-names>D</given-names></name> <name><surname>Puig-de Dou</surname> <given-names>J</given-names></name> <name><surname>Chillar&#x000F3;n</surname> <given-names>JJ</given-names></name> <etal/></person-group> <article-title>Seven-year mortality in heart failure patients with undiagnosed diabetes: an observational study</article-title>. <source>Cardiovasc Diabetol</source> (<year>2011</year>) <volume>10</volume>:<fpage>39</fpage>.<pub-id pub-id-type="doi">10.1186/1475-2840-10-39</pub-id><pub-id pub-id-type="pmid">21569580</pub-id></citation></ref>
<ref id="B6"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khoury</surname> <given-names>MJ</given-names></name> <name><surname>Iademarco</surname> <given-names>MF</given-names></name> <name><surname>Riley</surname> <given-names>WT</given-names></name></person-group>. <article-title>Precision public health for the era of precision medicine</article-title>. <source>Am J Prev Med</source> (<year>2016</year>) <volume>50</volume>(<issue>3</issue>):<fpage>398</fpage>&#x02013;<lpage>401</lpage>.<pub-id pub-id-type="doi">10.1016/j.amepre.2015.08.031</pub-id></citation></ref>
<ref id="B7"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jonnagaddala</surname> <given-names>J</given-names></name> <name><surname>Liaw</surname> <given-names>ST</given-names></name> <name><surname>Ray</surname> <given-names>P</given-names></name> <name><surname>Kumar</surname> <given-names>M</given-names></name> <name><surname>Dai</surname> <given-names>HJ</given-names></name> <name><surname>Hsu</surname> <given-names>CY</given-names></name></person-group>. <article-title>Identification and progression of heart disease risk factors in diabetic patients from longitudinal electronic health records</article-title>. <source>Biomed Res Int</source> (<year>2015</year>).<pub-id pub-id-type="doi">10.1155/2015/636371</pub-id></citation></ref>
<ref id="B8"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fern&#x000E0;ndez-Breis</surname> <given-names>JT</given-names></name> <name><surname>Maldonado</surname> <given-names>JA</given-names></name> <name><surname>Marcos</surname> <given-names>M</given-names></name> <etal/></person-group> <article-title>Leveraging electronic healthcare record standards and semantic web technologies for the identification of patient cohorts</article-title>. <source>J Am Med Inform Assoc</source> (<year>2013</year>) <volume>20</volume>:<fpage>e288</fpage>&#x02013;<lpage>96</lpage>.<pub-id pub-id-type="doi">10.1136/amiajnl-2013</pub-id>-001923</citation></ref>
<ref id="B9"><label>9</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pathak</surname> <given-names>J</given-names></name> <name><surname>Kho</surname> <given-names>AN</given-names></name> <name><surname>Denny</surname> <given-names>JC</given-names></name></person-group>. <article-title>Electronic health records-driven phenotyping: challenges, recent advances, and perspectives</article-title>. <source>J Am Med Inform Assoc</source> (<year>2013</year>) <volume>20</volume>(<issue>e2</issue>):<fpage>e206</fpage>&#x02013;<lpage>11</lpage>.<pub-id pub-id-type="doi">10.1136/amiajnl-2013-002428</pub-id></citation></ref>
<ref id="B10"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hripcsak</surname> <given-names>G</given-names></name> <name><surname>Albers</surname> <given-names>DJ</given-names></name></person-group>. <article-title>Correlating electronic health record concepts with healthcare process events</article-title>. <source>J Am Med Inform Assoc</source> (<year>2013</year>) <volume>20</volume>:<fpage>e311</fpage>&#x02013;<lpage>8</lpage>.<pub-id pub-id-type="doi">10.1136/amiajnl-2013-001922</pub-id><pub-id pub-id-type="pmid">23975625</pub-id></citation></ref>
<ref id="B11"><label>11</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wagner</surname> <given-names>EH</given-names></name></person-group>. <article-title>Chronic disease management: what will it take to improve care for chronic illness?</article-title> <source>Eff Clin Pract</source> (<year>1998</year>) <volume>1</volume>(<issue>1</issue>):<fpage>2</fpage>&#x02013;<lpage>4</lpage>.</citation></ref>
<ref id="B12"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolff</surname> <given-names>JL</given-names></name> <name><surname>Starfield</surname> <given-names>B</given-names></name> <name><surname>Anderson</surname> <given-names>G</given-names></name></person-group>. <article-title>Prevalence, expenditures, and complications of multiple chronic conditions in the elderly</article-title>. <source>Arch Intern Med</source> (<year>2002</year>) <volume>162</volume>:<fpage>2269</fpage>&#x02013;<lpage>76</lpage>.<pub-id pub-id-type="doi">10.1001/archinte.162.20.2269</pub-id><pub-id pub-id-type="pmid">12418941</pub-id></citation></ref>
<ref id="B13"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cerra</surname> <given-names>C</given-names></name> <name><surname>Lottaroli</surname> <given-names>S</given-names></name></person-group>. <article-title>Use of administrative databases for calculating the cost of chronic pathologies</article-title>. <source>Pharmacoecon Italian Res Articles</source> (<year>2004</year>) <volume>6</volume>(<issue>3</issue>):<fpage>141</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1007/BF03320632</pub-id></citation></ref>
<ref id="B14"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Franchini</surname> <given-names>M</given-names></name> <name><surname>Pieroni</surname> <given-names>S</given-names></name> <name><surname>Fortunato</surname> <given-names>L</given-names></name> <name><surname>Molinaro</surname> <given-names>S</given-names></name> <name><surname>Liebman</surname> <given-names>M</given-names></name></person-group>. <article-title>Poly-pharmacy among the elderly: analyzing the co-morbidity of hypertension and diabetes</article-title>. <source>Curr Pharm Des</source> (<year>2015</year>) <volume>21</volume>(<issue>6</issue>):<fpage>791</fpage>&#x02013;<lpage>805</lpage>.<pub-id pub-id-type="doi">10.2174/1381612820666141024150901</pub-id><pub-id pub-id-type="pmid">25341855</pub-id></citation></ref>
<ref id="B15"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaeschke</surname> <given-names>R</given-names></name> <name><surname>Guyatt</surname> <given-names>GH</given-names></name> <name><surname>Sackett</surname> <given-names>DL</given-names></name></person-group>. <article-title>Users&#x02019; guides to the medical literature. III. How to use an article about a diagnostic test. B. What are the results and will they help me in caring for my patients? The Evidence-Based Medicine Working Group</article-title>. <source>JAMA</source> (<year>1994</year>) <volume>271</volume>(<issue>9</issue>):<fpage>703</fpage>&#x02013;<lpage>7</lpage>.<pub-id pub-id-type="doi">10.1001/jama.271.5.389</pub-id></citation></ref>
<ref id="B16"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname> <given-names>KS</given-names></name> <name><surname>Khan</surname> <given-names>SF</given-names></name> <name><surname>Nwosu</surname> <given-names>CR</given-names></name> <name><surname>Arnott</surname> <given-names>N</given-names></name> <name><surname>Chien</surname> <given-names>PF</given-names></name></person-group>. <article-title>Misleading authors&#x02019; inferences in obstetric diagnostic test literature</article-title>. <source>Am J Obstet Gynecol</source> (<year>1999</year>) <volume>181</volume>(<issue>1</issue>):<fpage>112</fpage>&#x02013;<lpage>5</lpage>.<pub-id pub-id-type="doi">10.1016/S0002-9378(99)70445-X</pub-id><pub-id pub-id-type="pmid">10411805</pub-id></citation></ref>
<ref id="B17"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deeks</surname> <given-names>JJ</given-names></name> <name><surname>Altman</surname> <given-names>DG</given-names></name></person-group>. <article-title>Diagnostic tests 4: likelihood ratios</article-title>. <source>BMJ</source> (<year>2004</year>) <volume>329</volume>(<issue>7458</issue>):<fpage>168</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1136/bmj.329.7458.168</pub-id></citation></ref>
<ref id="B18"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grimes</surname> <given-names>DA</given-names></name> <name><surname>Schulz</surname> <given-names>KF</given-names></name></person-group>. <article-title>Refining clinical diagnosis with likelihood ratios</article-title>. <source>Lancet</source> (<year>2005</year>) <volume>365</volume>(<issue>9469</issue>):<fpage>1500</fpage>&#x02013;<lpage>5</lpage>.<pub-id pub-id-type="doi">10.1016/S0140-6736(05)66422-7</pub-id><pub-id pub-id-type="pmid">15850636</pub-id></citation></ref>
<ref id="B19"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moons</surname> <given-names>KG</given-names></name> <name><surname>van Es</surname> <given-names>GA</given-names></name> <name><surname>Deckers</surname> <given-names>JW</given-names></name> <name><surname>Habbema</surname> <given-names>JD</given-names></name> <name><surname>Grobbee</surname> <given-names>DE</given-names></name></person-group>. <article-title>Limitations of sensitivity, specificity, likelihood ratio, and Bayes&#x02019; theorem in assessing diagnostic probabilities: a clinical example</article-title>. <source>Epidemiology</source> (<year>1997</year>) <volume>8</volume>(<issue>1</issue>):<fpage>12</fpage>&#x02013;<lpage>7</lpage>.<pub-id pub-id-type="doi">10.1097/00001648-199701000-00002</pub-id><pub-id pub-id-type="pmid">9116087</pub-id></citation></ref>
<ref id="B20"><label>20</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Piccinni</surname> <given-names>C</given-names></name> <name><surname>Antonazzo</surname> <given-names>IC</given-names></name> <name><surname>Simonetti</surname> <given-names>M</given-names></name> <name><surname>Mennuni</surname> <given-names>MG</given-names></name> <name><surname>Parretti</surname> <given-names>D</given-names></name> <name><surname>Cricelli</surname> <given-names>C</given-names></name></person-group>. <article-title>The burden of chronic heart failure in primary care in Italy</article-title>. <source>High Blood Press Cardiovasc Prev</source> (<year>2017</year>) <volume>24</volume>(<issue>2</issue>):<fpage>171</fpage>&#x02013;<lpage>8</lpage>.<pub-id pub-id-type="doi">10.1007/s40292-017-0193-4</pub-id></citation></ref>
<ref id="B21"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cook</surname> <given-names>C</given-names></name> <name><surname>Cole</surname> <given-names>G</given-names></name> <name><surname>Asaria</surname> <given-names>P</given-names></name> <name><surname>Jabbour</surname> <given-names>R</given-names></name> <name><surname>Francis</surname> <given-names>DP</given-names></name></person-group>. <article-title>The annual global economic burden of heart failure</article-title>. <source>Int J Cardiol</source> (<year>2014</year>) <volume>171</volume>(<issue>3</issue>):<fpage>368</fpage>&#x02013;<lpage>76</lpage>.<pub-id pub-id-type="doi">10.1016/j.ijcard.2013.12.028</pub-id><pub-id pub-id-type="pmid">24398230</pub-id></citation></ref>
<ref id="B22"><label>22</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hunt</surname> <given-names>SA</given-names></name> <name><surname>Abraham</surname> <given-names>WT</given-names></name> <name><surname>Chin</surname> <given-names>MH</given-names></name> <name><surname>Feldman</surname> <given-names>AM</given-names></name> <name><surname>Francis</surname> <given-names>GS</given-names></name> <name><surname>Ganiats</surname> <given-names>TG</given-names></name> <etal/></person-group> <article-title>Focused update incorporated into the ACC/AHA 2005 Guidelines for the Diagnosis and Management of Heart Failure in Adults a report of the American College of Cardiology Foundation/American Heart Association Task Force on Practice Guidelines developed in collaboration with the International Society for Heart and Lung Transplantation</article-title>. <source>J Am Coll Cardiol</source> (<year>2009</year>) <volume>53</volume>(<issue>15</issue>):<fpage>e1</fpage>&#x02013;<lpage>e90</lpage>.<pub-id pub-id-type="doi">10.1016/j.jacc.2008.11.013</pub-id></citation></ref>
<ref id="B23"><label>23</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Franceschi</surname> <given-names>C</given-names></name> <name><surname>Campisi</surname> <given-names>J</given-names></name></person-group>. <article-title>Chronic inflammation (inflammaging) and its potential contribution to age-associated diseases</article-title>. <source>J Gerontol A Biol Sci Med Sci</source> (<year>2014</year>) <volume>69</volume>(<issue>Suppl 1</issue>):<fpage>S4</fpage>&#x02013;<lpage>9</lpage>.<pub-id pub-id-type="doi">10.1093/gerona/glu057</pub-id><pub-id pub-id-type="pmid">24833586</pub-id></citation></ref>
<ref id="B24"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Franceschi</surname> <given-names>C</given-names></name></person-group>. <article-title>Inflammaging as a major characteristic of old people: can it be prevented or cured?</article-title> <source>Nutr Rev</source> (<year>2007</year>) <volume>65</volume>(<issue>12 Pt 2</issue>):<fpage>S173</fpage>&#x02013;<lpage>6</lpage>.<pub-id pub-id-type="doi">10.1111/j.1753-4887.2007.tb00358.x</pub-id><pub-id pub-id-type="pmid">18240544</pub-id></citation></ref>
<ref id="B25"><label>25</label><citation citation-type="book"><collab>National Research Council</collab>. <source>Toward Precision Medicine: Building a Knowledge Network for Biomedical Research and a New Taxonomy of Disease</source>. <publisher-loc>Washington, DC</publisher-loc>: <publisher-name>The National Academies Press</publisher-name> (<year>2011</year>).</citation></ref>
<ref id="B26"><label>26</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schultz</surname> <given-names>SE</given-names></name> <name><surname>Rothwell</surname> <given-names>DM</given-names></name> <name><surname>Chen</surname> <given-names>Z</given-names></name> <name><surname>Tu</surname> <given-names>K</given-names></name></person-group>. <article-title>Identifying cases of congestive heart failure from administrative data: a validation study using primary care patient records</article-title>. <source>Chronic Dis Inj Can</source> (<year>2013</year>) <volume>33</volume>(<issue>3</issue>):<fpage>160</fpage>&#x02013;<lpage>6</lpage>.<pub-id pub-id-type="pmid">23735455</pub-id></citation></ref>
<ref id="B27"><label>27</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>A</given-names></name> <name><surname>Pang</surname> <given-names>J</given-names></name> <name><surname>Feblowitz</surname> <given-names>JC</given-names></name> <name><surname>Maloney</surname> <given-names>FL</given-names></name> <name><surname>Wilcox</surname> <given-names>AR</given-names></name> <name><surname>Ramelson</surname> <given-names>HZ</given-names></name> <etal/></person-group> <article-title>A method and knowledge base for automated inference of patient problems from structured data in an electronic medical record</article-title>. <source>J Am Med Inform Assoc</source> (<year>2011</year>) <volume>18</volume>(<issue>6</issue>):<fpage>859</fpage>&#x02013;<lpage>67</lpage>.<pub-id pub-id-type="doi">10.1136/amiajnl-2011-000121</pub-id><pub-id pub-id-type="pmid">21613643</pub-id></citation></ref>
<ref id="B28"><label>28</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maggioni</surname> <given-names>AP</given-names></name> <name><surname>Anker</surname> <given-names>SD</given-names></name> <name><surname>Dahlstr&#x000F6;m</surname> <given-names>U</given-names></name> <name><surname>Filippatos</surname> <given-names>G</given-names></name> <name><surname>Ponikowski</surname> <given-names>P</given-names></name> <name><surname>Zannad</surname> <given-names>F</given-names></name> <etal/></person-group> <article-title>Are hospitalized or ambulatory patients with heart failure treated in accordance with European Society of Cardiology guidelines? Evidence from 12,440 patients of the ESC Heart Failure Long-term Registry</article-title>. <source>Eur J Heart Fail</source> (<year>2013</year>) <volume>15</volume>(<issue>10</issue>):<fpage>1173</fpage>&#x02013;<lpage>84</lpage>.<pub-id pub-id-type="doi">10.1093/eurjhf/hft134</pub-id></citation></ref>
<ref id="B29"><label>29</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCormick</surname> <given-names>N</given-names></name> <name><surname>Lacaille</surname> <given-names>D</given-names></name> <name><surname>Bhole</surname> <given-names>V</given-names></name> <name><surname>Avina-Zubieta</surname> <given-names>JA</given-names></name></person-group>. <article-title>Validity of heart failure diagnoses in administrative databases: a systematic review and meta-analysis</article-title>. <source>PLoS One</source> (<year>2014</year>) <volume>9</volume>(<issue>8</issue>):<fpage>e104519</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0104519</pub-id><pub-id pub-id-type="pmid">25126761</pub-id></citation></ref>
</ref-list>
</back>
</article>
