<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mol. Biosci.</journal-id>
<journal-title>Frontiers in Molecular Biosciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mol. Biosci.</abbrev-journal-title>
<issn pub-type="epub">2296-889X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">892763</article-id>
<article-id pub-id-type="doi">10.3389/fmolb.2022.892763</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Molecular Biosciences</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Comprehensive Mapping and Dynamics of Site-Specific Prolyl-Hydroxylation, Lysyl-Hydroxylation and Lysyl O-Glycosylation of Collagens Deposited in ECM During Zebrafish Heart Regeneration</article-title>
<alt-title alt-title-type="left-running-head">Sarohi et al.</alt-title>
<alt-title alt-title-type="right-running-head">PTM Map of Collagens From Zebrafish Cardiac ECM</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Sarohi</surname>
<given-names>Vivek</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1787183/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Srivastava</surname>
<given-names>Shriya</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Basak</surname>
<given-names>Trayambak</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1087915/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>School of Biosciences and Bioengineering (BSBE)</institution>, <institution>Indian Institute of Technology (IIT)- Mandi</institution>, <addr-line>Mandi</addr-line>, <country>India</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>BioX Center</institution>, <institution>IIT-Mandi</institution>, <addr-line>Mandi</addr-line>, <country>India</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/346369/overview">Qi Wu</ext-link>, Aarhus University, Denmark</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1730848/overview">Luigi Scietti</ext-link>, European Institute of Oncology (IEO), Italy</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/224173/overview">Alexandra Naba</ext-link>, University of Illinois at Chicago, United States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Trayambak Basak, <email>trayambak@iitmandi.ac.in</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Molecular Diagnostics and Therapeutics, a section of the journal Frontiers in Molecular Biosciences</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>16</day>
<month>06</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>9</volume>
<elocation-id>892763</elocation-id>
<history>
<date date-type="received">
<day>09</day>
<month>03</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>05</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Sarohi, Srivastava and Basak.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Sarohi, Srivastava and Basak</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Cardiac fibrosis-mediated heart failure (HF) is one of the major forms of end-stage cardiovascular diseases (CVDs). Cardiac fibrosis is an adaptive response of the myocardium upon any insult/injury. Excessive deposition of collagen molecules in the extracellular matrix (ECM) is the hallmark of fibrosis. This fibrotic response initially protects the myocardium from ventricular rupture. Although in mammals this fibrotic response progresses towards scar-tissue formation leading to HF, some fishes and urodeles have mastered the art of cardiac regeneration following injury-mediated fibrotic response. Zebrafish have a unique capability to regenerate the myocardium after post-amputation injury. Following post-amputation, the ECM of the zebrafish heart undergoes extensive remodeling and deposition of collagen. Being the most abundant protein of ECM, collagen plays important role in the assembly and cell-matrix interactions. However, the mechanism of ECM remodeling is not well understood. Collagen molecules undergo heavy post-translational modifications (PTMs) mainly hydroxylation of proline, lysine, and glycosylation of lysine during biosynthesis. The critical roles of these PTMs are emerging in several diseases, embryonic development, cell behavior regulation, and cell-matrix interactions. The site-specific identification of these collagen PTMs in zebrafish heart ECM is not known. As these highly modified peptides are not amenable to mass spectrometry (MS), the site-specific identification of these collagen PTMs is challenging. Here, we have implemented our in-house proteomics analytical pipeline to analyze two ECM proteomics datasets (PXD011627, PXD010092) of the zebrafish heart during regeneration (post-amputation). We report the first comprehensive site-specific collagen PTM map of zebrafish heart ECM. We have identified a total of 36 collagen chains (19 are reported for the first time here) harboring a total of 95 prolyl-3-hydroxylation, 108 hydroxylysine, 29 galactosyl-hydroxylysine, and 128 glucosylgalactosyl-hydroxylysine sites. Furthermore, we comprehensively map the three chains (COL1A1a, COL1A1b, and COL1A2) of collagen I, the most abundant protein in zebrafish heart ECM. We achieved more than 95% sequence coverage for all the three chains of collagen I. Our analysis also revealed the dynamics of prolyl-3-hydroxylation occupancy oscillations during heart regeneration at these sites. Moreover, quantitative site-specific analysis of lysine-O-glycosylation microheterogeneity during heart regeneration revealed a significant (<italic>p</italic> &#x3c; 0.05) elevation of site-specific (K<sup>1017</sup>) glucosylgalactosyl-hydroxylysine on the col1a1a chain. Taken together, these site-specific PTM maps and the dynamic changes of site-specific collagen PTMs in ECM during heart regeneration will open up new avenues to decode ECM remodeling and may lay the foundation to tinker the cardiac regeneration process with new approaches.</p>
</abstract>
<kwd-group>
<kwd>collagens</kwd>
<kwd>post-translational modifications</kwd>
<kwd>O-glycosylation</kwd>
<kwd>mass-spectrometry</kwd>
<kwd>microheterogeneity</kwd>
<kwd>extra-cellular matrix</kwd>
<kwd>cardiac regeneration</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Cardiac fibrosis leading to heart failure has remained one of the major causes of death worldwide (<xref ref-type="bibr" rid="B15">Gonz&#xe1;lez et al., 2018</xref>). Extracellular matrix (ECM) remodeling in the cardiac tissues is one of the main mechanisms of fibrosis development to an injury/insult in the myocardium (<xref ref-type="bibr" rid="B23">Li et al., 2018</xref>). Excessive deposition of collagen molecules in the ECM during cardiac fibrosis has remained the hallmark of fibrosis (<xref ref-type="bibr" rid="B6">Cowling et al., 2019</xref>). Initially, this fibrotic response helps in preventing the rupture of the ventricular wall of the heart (<xref ref-type="bibr" rid="B6">Cowling et al., 2019</xref>). However, in the mammalian hearts, this persistent fibrotic response progresses towards scar tissue formation, tissue-stiffening and ultimately resulting in heart failure (<xref ref-type="bibr" rid="B15">Gonz&#xe1;lez et al., 2018</xref>). Contrastingly, in nature, some non-mammalian vertebrates such as zebrafish and urodele amphibians (<xref ref-type="bibr" rid="B7">Cutie and Huang, 2021</xref>) have mastered a unique exercise that allows their cardiomyocytes to undergo restricted dedifferentiation achieving cardiac regeneration. This exceptional property of adult zebrafish permits heart regeneration in case of a severe tissue insult or experimental amputation (<xref ref-type="bibr" rid="B7">Cutie and Huang, 2021</xref>). This unique ability makes Zebrafish an appropriate model system to study complex cellular and molecular processes underlying cardiac regeneration (<xref ref-type="bibr" rid="B3">Bootorabi et al., 2017</xref>). Under chronic/acute injury conditions, excess collagens and other proteins get deposited in the myocardium ECM inducing fibrosis (<xref ref-type="bibr" rid="B49">Travers et al., 2016</xref>) and eventually causing systolic and diastolic dysfunction. In the case of zebrafish, the fibrosis regression that has been documented as a transient phase occurs through fibroblast inactivation (<xref ref-type="bibr" rid="B40">S&#xe1;nchez-Iranzo et al., 2018</xref>). The crucial understanding of participation and contribution of zebrafish heart ECM remains majorly unexplored. The ECM has recently gained attention to probe a better insight into the mechanistic changes during heart regeneration in zebrafish. In fact, the zebrafish heart ECM has been shown to have the potency to induce mammalian heart regeneration (<xref ref-type="bibr" rid="B5">Chen et al., 2016</xref>). Although the role of a few ECM components such as hyaluronic acid has been reported to play a major role in the epithelial-to-mesenchymal transition during regeneration (<xref ref-type="bibr" rid="B27">Missinato et al., 2015</xref>), the roles of major components of ECM such as collagens, glycoproteins, and proteoglycans are not well understood. Collagens and other cardiac ECM proteins are not just responsible for maintaining the architecture of the tissue but also play a vital role in mechanical and biochemical interactions that determine different cell behavior and functions (<xref ref-type="bibr" rid="B11">Frangogiannis, 2019</xref>).</p>
<p>In the ECM, collagens are the most abundant protein constituent (<xref ref-type="bibr" rid="B8">Di Lullo et al., 2002</xref>). Nauroy <italic>et al</italic>. have predicted 58 collagen genes contributing towards the zebrafish matrisome (<xref ref-type="bibr" rid="B32">Nauroy et al., 2018</xref>). Collagens are responsible for maintaining the structure and functions of cardiac tissues. These are also crucial in wound healing process (<xref ref-type="bibr" rid="B53">Willyard, 2018</xref>). Recently, ablation of COL1A2 in the zebrafish heart has been shown to impair cardiomyocyte regeneration (<xref ref-type="bibr" rid="B40">S&#xe1;nchez-Iranzo et al., 2018</xref>). Other collagen chains such as COL7A11 and COL8A2 present in zebrafish heart ECM have been found upregulated during heart regeneration (<xref ref-type="bibr" rid="B40">S&#xe1;nchez-Iranzo et al., 2018</xref>). Collagens are heavily post-translationally modified triple-helical molecules that form the interstitial fibers of the myocardium. Collagen post-translational modifications (PTMs) play a significant role in the function and stability of the triple helix. The proline residues are the most abundant amino acids found in collagen chains (<xref ref-type="bibr" rid="B31">Myllyharju, 2005</xref>). Prolines present in collagen chains are 4-hydroxylated (4-HyP) generally in the &#x201c;Y&#x201d; position of the &#x201c;Gly-Xaa-Yaa&#x201d; motif, providing the structural mobility for triple helix formation. In addition to this, there are proline residues also present in the &#x201c;X&#x201d; position of the &#x201c;Gly-Xaa-HyP&#x201d; motif that are commonly 3-hydroxylated (3-HyP) in nature. However, the occurrence of 3-HyP is rarer compared to 4-HyP (<xref ref-type="bibr" rid="B51">Weis et al., 2010</xref>). These prolyl hydroxylations are catalyzed by a specific class of prolyl hydroxylases (<xref ref-type="bibr" rid="B18">Hudson and Eyre, 2013</xref>; <xref ref-type="bibr" rid="B26">Merl-Pham et al., 2019</xref>; <xref ref-type="bibr" rid="B39">Salo and Myllyharju, 2021</xref>). Recently, it has been documented that genetic deletion of prolyl-3-hydroxylase 2 (P3H2) in mice evidenced that abolishment of prolyl-3-hydroxylations in basement membrane collagen could be involved in regulating platelet aggregation and development of eye tissues (<xref ref-type="bibr" rid="B36">Pokidysheva et al., 2014</xref>; <xref ref-type="bibr" rid="B19">Hudson et al., 2015</xref>). Osteogenesis Imperfecta is one of the reported diseases resulting from a lack of 3-hydroxyproline in COL1A1 (P<sup>1164</sup>) (<xref ref-type="bibr" rid="B30">Morello et al., 2006</xref>; <xref ref-type="bibr" rid="B4">Cabral et al., 2007</xref>).</p>
<p>Apart from proline hydroxylation, lysine hydroxylation, and glycosylation are also critically important for the proper functioning of collagens. In collagen chains, lysine residues occurring within the Gly-Xaa-Lys motif are likely to be hydroxylated by different family members of the lysyl hydrolase; lysyl hydroxylase 1 (LH1), LH2, and LH3. Once hydroxylated, these hydroxylysines (HyK) can further be O-linked glycosylated by the addition of monosaccharides and/or disaccharides by specific glycosyltransferases (<xref ref-type="bibr" rid="B21">Kivirikko et al., 1973</xref>; <xref ref-type="bibr" rid="B37">Risteli and Kivirikko, 1974</xref>; <xref ref-type="bibr" rid="B41">Schegg et al., 2009</xref>; <xref ref-type="bibr" rid="B42">Scietti et al., 2018</xref>). Lysine (found in the Gly-Xaa-Lys motif) could be present in four different forms in collagen molecules deposited in the ECM. Unmodified lysine (K), hydroxylysine (HyK), galactosyl-hydroxylysine (G-HyK), and glucosyl-galactosyl-hydroxylysine (GG-HyK) occurring on a single site of the collagen chain. This site-specific variation of lysine residues is termed lysine microheterogeneity. Altered levels of lysine microheterogeneity in collagens lead to dysfunction in the tissues. Musculoskeletal defects (<xref ref-type="bibr" rid="B13">Geister et al., 2019</xref>), connective tissue disorder (<xref ref-type="bibr" rid="B38">Salo et al., 2008</xref>), and cerebral small vessel disease (<xref ref-type="bibr" rid="B28">Miyatake et al., 2018</xref>) are reported to have altered hydroxylysine and lysyl O-glycosylation levels in collagens. These studies indicate the significant role of collagen PTMs in maintaining ECM and tissue homeostasis. ECM homeostasis is remodeled upon injury to the myocardium and thereby ECM remodeling holds the key to the regeneration of amputated zebrafish hearts. Collagens are the most abundant components of ECM and play a significant role in the regeneration of zebrafish hearts. Various functions of collagen are dependent on site-specific PTMs (<xref ref-type="bibr" rid="B44">Sipil&#xe4; et al., 2007</xref>; <xref ref-type="bibr" rid="B20">J&#xfc;rgensen et al., 2011</xref>; <xref ref-type="bibr" rid="B45">Stawikowski et al., 2014</xref>). More importantly, these site-specific collagen PTMs could be tissue-specific which demands rigorous exploration in this field.</p>
<p>In the context of zebrafish cardiac tissue regeneration, the dynamics of the site-specific collagen PTM network are still unexplored. Therefore, to comprehensively map the site-specific collagen PTMs constituting the myocardium ECM and to understand the dynamics of collagen PTMs during the regeneration process, we found an appropriate proteomics study performed by <xref ref-type="bibr" rid="B12">Garcia-Puig et al. (2019</xref>) on zebrafish heart regeneration. To bridge this gap we downloaded this publicly available data set (PXD011627 and PXD010092) and generated a comprehensive map of site-specific collagen PTMs in adult zebrafish hearts. Furthermore, we quantitated the occupancy levels of collagen PTMs during the regeneration process. Understanding the dynamics of site-specific collagen PTMs in cardiac extracellular matrix during heart regeneration will open up new avenues to decode ECM remodeling and may lay the foundation to tinker the cardiac regeneration process with new approaches.</p>
</sec>
<sec id="s2">
<title>2 Methods</title>
<sec id="s2-1">
<title>2.1 Mass Spectrometry Data Source</title>
<p>In this study, two publicly available datasets were utilized. The first dataset with identifier &#x201c;PXD011627&#x201d; and the second dataset with identifier &#x201c;PXD010092&#x201d; were both submitted by <xref ref-type="bibr" rid="B12">Garcia-Puig et al. (2019</xref>) in ProteomeXchange. In this study, Garcia-Puig <italic>et al</italic>. established the zebrafish myocardium ECM enrichment protocol by decellularizing zebrafish ventricular tissue samples (<xref ref-type="bibr" rid="B12">Garcia-Puig et al., 2019</xref>). Moreover, this study assessed the dynamic changes in the ECM proteome of regenerating zebrafish hearts using an amputation model. The samples of regenerating heart ventricles were taken at 7-, 14- and 30-days post-amputation (DPA). Ventricular decellularization was done using SDS and Triton-X. The sham model was used as a control in their study.</p>
<p>In their study, MS data acquisition was done differently for the two datasets. Easy-nLC 1,000 (Thermo) was used for liquid chromatography separation. Nanoflex (Thermo) was used as an ESI source and LC-MS/MS was done on Q-exactive HF mass spectrometer (Thermo) with HCD fragmentation and the data-dependent acquisition opted for dataset &#x201c;PXD011627&#x201d;. For dataset &#x201c;PXD010092&#x201d; a nano HPLC system (Proxeon) coupled with Maxis Impact (Bruker) Q-TOF mass spectrometer was used and each sample of regenerating and sham model heart ventricle was analyzed using LC-MS in duplicates. Furthermore, healthy human heart sample (ECM) mass-spectrometry data from Barallobre-Barreiro <italic>et al</italic>. (PXD028908) (<xref ref-type="bibr" rid="B1">Barallobre-Barreiro et al., 2021</xref>), and mice heart sample (ECM) mass-spectrometry data from Padmanabhan <italic>et al</italic> (PXD002488) (<xref ref-type="bibr" rid="B33">Padmanabhan Iyer et al., 2016</xref>) was reanalyzed for the site-specific collagen PTM identification.</p>
</sec>
<sec id="s2-2">
<title>2.2 Raw Data Description</title>
<p>The publicly available datasets &#x201c;PXD011627&#x201d; and &#x201c;PXD010092&#x201d; were downloaded from ProteomeXchange. The complete &#x2a;.d folders for &#x201c;PXD010092&#x201d; were shared by Prof. Angel Raya&#x2019;s research group. Dataset &#x201c;PXD011627&#x201d; contains 9 (.raw) files and Dataset &#x201c;PXD010092&#x201d; contains 8 (.d) files, corresponding to two files (biological replicate) each for sham, 7-, 14- and 30-days post-amputation samples. Dataset &#x201c;PXD010092&#x201d; was not compatible with search engines because (.d) files are not readily accepted by many search engines. To overcome this, (.d) files were converted into. mgf and. mzML format using MSConvert with turbocharger filter. These converted files were further used as inputs by search engines MyriMatch (<xref ref-type="bibr" rid="B46">Tabb et al., 2007</xref>) and MSFragger (<xref ref-type="bibr" rid="B22">Kong et al., 2017</xref>), respectively. We also downloaded 60 (.raw) files from Barallobre-Barreiro <italic>et al</italic> (PX028908) and 6 (.raw) files from Padmanabhan <italic>et al</italic>. (PXD002488).</p>
</sec>
<sec id="s2-3">
<title>2.3 Database Search for the Identification of Collagen Chains and PTMs (Hydroxylation of Proline and Lysine, O-Glycosylation of Lysines) Using MyriMatch</title>
<p>Two different search engines were employed to reanalyze these datasets to identify collagen PTMs from the zebrafish myocardium ECM. All the database searches were performed on the high-performance computing (HPC) facility at the Indian Institute of Technology (IIT)- Mandi. First, a general database search was performed on raw MS data (&#x2a;.raw files) of dataset &#x201c;PXD011627&#x201d; with a complete <italic>Danio rerio</italic> uniprot database having 62,593 entries, downloaded on 20 October 2020. For the PXD028908 dataset, a general database search was performed on raw MS data (&#x2a;.raw files) with a complete <italic>Homo sapiens</italic> uniprot database having 20,396 entries, downloaded on 16 February 2021. For the PXD002488 dataset, a general database search was performed on raw MS data (&#x2a;.raw files) with a complete <italic>Mus musculus</italic> uniprot database having 17,090 entries, downloaded on 04 December 2021. Precursor ion tolerance was set at 10&#xa0;ppm and fragment ion m/z tolerance was allowed up to 20&#xa0;ppm for zebrafish and human data. However, for the mice dataset, a fragment ion tolerance of 50&#xa0;ppm was used. Carbamidomethylation (&#x2b;57.0236) on cysteine was used as static modification and methionine oxidation (&#x2b;15.9949) and hydroxyproline (&#x2b;15.9949) were used as dynamic modifications with up to 4 maximum dynamic modifications per peptide. A maximum of 2 missed cleavages were allowed for fully tryptic digestion. MyriMatch searches were performed to generate &#x2a;.pepXML files for respective datasets. These &#x2a;.pepXML files were further grouped for PSM matches, peptide, and protein group identification using IDPicker with &#x3c;1% FDR. After a general database search, the identified list of ECM proteins from zebrafish and humans was exported from IDPicker as a &#x2a;.FASTA database. This &#x2a;.FASTA database (separately for zebrafish, humans, and mice) was used as a subset database to perform an in-depth collagen PTM search. In the subset database search, up to 4 missed cleavages for fully tryptic digestion were allowed with a similar precursor (10&#xa0;ppm) and fragment ion tolerance (20&#xa0;ppm) tolerance. For mice MS data, a fragment ion tolerance of 50&#xa0;ppm was used. As mentioned previously, (<xref ref-type="bibr" rid="B2">Basak et al., 2016</xref>; <xref ref-type="bibr" rid="B26">Merl-Pham et al., 2019</xref>), a &#x201c;Gly-Xaa-Yaa&#x201d; based motif-specific PTM search strategy was employed using the MyriMatch motif-specific module for the identification of collagen PTMs. Carbamidomethylation on cysteine was used as static modification and methionine oxidation, hydroxyproline (P! &#x2b;15.9949), hydroxylysine (GXK! &#x2b;15.994,916), galactosyl-hydroxylysine (GXK! 178.047738) and glucosylgalactosyl-hydroxylysine (GXK! 340.100,562) were used as dynamic modifications with up to 10 maximum dynamic modifications per peptide. IDPicker was used for grouping the &#x2a;.pepXML output file by controlling FDR at a 1% level for PSMs, peptides, and proteins. 3-hydroxyproline modifications were only considered in case a proline residue was found to be hydroxylated at the &#x201c;X&#x201d; position of a &#x201c;G-Xaa-HyP&#x201d; motif in the collagen chains. Furthermore, pLabel was used for manual inspection, analysis, and validation of subset database search PSMs for assigning a specific collagen PTM containing peptide.</p>
</sec>
<sec id="s2-4">
<title>2.4 Database Search for the Identification of Collagen Chains and PTMs (Hydroxylation of Proline and Lysine, O-Glycosylation of Hydroxylysine Sites) Using MSFragger&#x2a;</title>
<p>mzML files were used for MSFragger mediated database search. In MSFragger based pipeline, the decoy database was generated by a philosopher from the same uniprot database (<italic>Danio rerio</italic>, 62,593 entries, downloaded on 20 October 2020) as in the case of the MyriMatch general search. Precursor tolerance and fragment ion tolerance were kept at 50 and 25&#xa0;ppm, respectively. PSM, peptide, and protein level FDR were kept at &#x3c; 1%, with a maximum of three missed cleavages of fully tryptic peptides. Carbamidomethylation (&#x2b;57.0236) on cysteine was used as static modification and methionine oxidation (&#x2b;15.9949), hydroxyproline (&#x2b;15.9949), N-terminal acetylation (&#x2b;42.010565), hydroxylysine (&#x2b;15.994,916), galactosyl-hydroxylysine (&#x2b;178.047738) and glucosylgalactosyl-hydroxylysine (&#x2b;340.100,562) were used as dynamic modifications with up to 5 maximum dynamic modification per peptide. IDPicker was used for grouping the &#x2a;.pepXML output file generated from MSFragger searches by controlling FDR at &#x3c;1% level for PSMs, peptides, and proteins. Similar to MyriMatch-based workflow, pLabel was used for manual inspection and validation for assigning a specific PSM to identify site-specific collagen PTM containing peptides.</p>
</sec>
<sec id="s2-5">
<title>2.5 Relative Abundances of Collagen Chains in the Heart ECM During Regeneration</title>
<p>Spectral counts (referred to the total number of peptides detected at &#x223c;1% FDR per protein group) from the results of the MyriMatch database search were exported from IDPicker. These spectral counts were normalized with the total spectral counts of each raw MS file ID. The normalized spectral counts corresponding to unique protein groups were further used to quantitate the relative abundance of collagens in wild-type and regenerating zebrafish hearts. Additionally, the same normalized spectral counts data for collagens from MyriMatch database search results were used to generate a heatmap of relative collagen expression during regeneration. Collagen chains identified with &#x2265;3 normalized spectral counts were only considered for the heatmap representation.</p>
</sec>
<sec id="s2-6">
<title>2.6 Quantitation of Occupancy Level of Collagen PTM Sites Using Skyline</title>
<p>Database search results (&#x2a;.pepXML files) generated by either MyriMatch and/or MSFragger were parsed through PeptideProphet (TPP pipeline module) for importing the probability scores (0-1) (<xref ref-type="bibr" rid="B34">Pedrioli, 2010</xref>). After parsing the &#x2a;.pepXML files through PeptideProphet, these &#x2a;.pepXML files were utilized to build a spectral library (.blib) using open-source Skyline (<xref ref-type="bibr" rid="B25">MacLean et al., 2010</xref>). A spectral library was used in Skyline for extracting the raw abundance intensity of unmodified and different forms of modified collagen peptides. Occupancy calculation of prolyl hydroxylation sites and microheterogeneity (unmodified, hydroxylated, O-glycosylation of lysines) of lysine sites were performed using MS<sup>1</sup> based targeted extraction pipeline as described previously (<xref ref-type="bibr" rid="B26">Merl-Pham et al., 2019</xref>).</p>
</sec>
<sec id="s2-7">
<title>2.7 Statistical Analysis</title>
<p>GraphPad prism was used for statistical analysis for calculating the dynamics of different collagen site-specific PTM level quantitation. ANOVA was performed and a <italic>p</italic> value &#x3c; 0.05 was considered statistically significant.</p>
</sec>
</sec>
<sec sec-type="results|discussion" id="s3">
<title>3 Results and Discussion</title>
<p>Myocardium extracellular matrix (ECM) consisting of collagens as a major component, is sought to be one of the key molecules involved in the remodeling process during zebrafish heart regeneration (<xref ref-type="bibr" rid="B40">S&#xe1;nchez-Iranzo et al., 2018</xref>). Collagen and its types are typically triple-helical, large protein molecules secreted in the extracellular space and get assembled to form fibrils in the interstitium of the tissue responsible for structural support, cell-ECM interactions, and cell behavior. (<xref ref-type="bibr" rid="B2">Basak et al., 2016</xref>).</p>
<sec id="s3-1">
<title>3.1 Identification of Collagen Chains, and Their Abundances in Wild Type Zebrafish Heart ECM-</title>
<p>To identify and quantitate the abundance of collagen chains present in wild-type zebrafish heart ECM, a dual database search engine-based (MyriMatch and MSFragger) comprehensive strategy was implemented. MyriMatch (<xref ref-type="bibr" rid="B46">Tabb et al., 2007</xref>) and MSFragger (<xref ref-type="bibr" rid="B22">Kong et al., 2017</xref>) were utilized in this study for in-depth identification of collagen chains present in the myocardium ECM of zebrafish from the publicly available datasets (PXD011627, PXD010092) as mentioned earlier. This analytical pipeline identified a total of 36 collagen chains present in the ECM of the zebrafish heart (<xref ref-type="fig" rid="F1">Figure 1A</xref>; <xref ref-type="sec" rid="s10">Supplementary Table S1</xref>). Out of these 36 collagen chains, 17 are commonly identified and 19 collagen chains are newly identified in the ECM of zebrafish heart (<xref ref-type="fig" rid="F1">Figure 1A</xref>, <xref ref-type="sec" rid="s10">Supplementary Table S1</xref>). Previously, Garcia-Puig et al. identified a total of 21 collagen chains from the enriched ECM of zebrafish hearts using this dataset (<xref ref-type="bibr" rid="B12">Garcia-Puig et al., 2019</xref>). Since collagens are heavily modified with prolyl hydroxylation, a standard proteomic data analysis pipeline will not be able to identify hydroxylated peptides of collagen chains present in the raw data (<xref ref-type="bibr" rid="B26">Merl-Pham et al., 2019</xref>). However, in our analytical pipeline, we included prolyl-hydroxylation as a dynamic modification in the database search yielding a higher number of collagen chain identification. Database searches yielded almost 19,456 and 18,274 total peptide ids from MyriMatch and MSFragger respectively. As expected, almost 61 and 40% of these identified peptides contained prolyl-hydroxylation (<xref ref-type="fig" rid="F1">Figure 1B</xref>, <xref ref-type="sec" rid="s10">Supplementary Table S2</xref>). Thus, it substantiates the use of prolyl-hydroxylation as an important &#x201c;dynamic modification&#x201d; in order to explore ECM mass-spectrometry data to increase the identification of collagen chains along with their increased sequence coverage. Normalized spectral counts were used to calculate the abundance of the top 10 collagen chains present in the ECM of wild-type zebrafish hearts (<xref ref-type="fig" rid="F1">Figure 1C</xref>). Both MyriMatch and MSFragger unambiguously yielded COL1A1a, COL1A1b, and COL1A2 chains of collagen 1 to be the highest abundant collagen chains present in the cardiac ECM of zebrafish. These three chains form the triple-helical collagen 1 protomers (<xref ref-type="bibr" rid="B14">Gistelinck et al., 2016</xref>), the most abundant protein present in the extracellular matrix of wild-type zebrafish hearts. Apart from Collagen 1, COL5A1, COL5A2a, COL6A3, COL4A1, COL6A1, and COL4A2 chains were found to be in the 10 most abundant collagen chains present in the ECM of WT zebrafish heart. These abundance plots yielded from two different search engines were in agreement highlighting the robustness of the analysis.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Identification of collagen chains and their relative abundances in zebrafish heart ECM: <bold>(A)</bold> Total number of collagen chains (<xref ref-type="bibr" rid="B4">Cabral et al., 2007</xref>) identified previously by Garcia-Puig et al. compared to (<xref ref-type="bibr" rid="B33">Padmanabhan Iyer et al., 2016</xref>) number of collagen chains identified in our analysis. <bold>(B)</bold> depicts inclusion of hydroxyproline (HyP) modification in the database search by MyriMatch and MSFragger resulting in identification of almost 61.12 and 40.46% (summed number from all the raw &#x2a;.pepXML files used for database search) new unique peptides. This strategy yielded more no. of peptide identification resulting in a higher number of collagen chain identification from the same dataset. <bold>(C)</bold> Top 10 abundant collagen chains deposited in the zebrafish heart ECM were identified by two different search engines MyriMatch and MSFragger respectively. <bold>(D)</bold> Relative abundances of different collagen chains during zebrafish heart regeneration are shown by the heatmap. Light yellow represents the lower value (&#x2212;2) and dark red represents the higher value (&#x2b;2) in the row. Normalized spectral count values have been used to generate the heatmap (considering&#xa0;&#x2265;&#xa0;3 spectral counts per chain). collagen chains marked with red (.) dots are quantitated during regeneration in re-analysis for the first time (DPA &#x3d; day post amputation).</p>
</caption>
<graphic xlink:href="fmolb-09-892763-g001.tif"/>
</fig>
</sec>
<sec id="s3-2">
<title>3.2 Relative Abundances of Collagen Chains in the Heart ECM During Regeneration&#x2013;</title>
<p>The cardiac regeneration process involves ECM remodeling. Being the most abundant component of ECM (<xref ref-type="bibr" rid="B8">Di Lullo et al., 2002</xref>), levels of collagens are also altered during regeneration (<xref ref-type="bibr" rid="B12">Garcia-Puig et al., 2019</xref>) (<xref ref-type="bibr" rid="B40">S&#xe1;nchez-Iranzo et al., 2018</xref>). As expected, normalized spectral counts of both the search engines show that Collagen 1 (COL1A1a, COL1A1b, and COL1A2) is most abundant in zebrafish heart ECM during 7, 14, and 30 days post-amputation (<xref ref-type="sec" rid="s10">Supplementary Figure S1</xref>). Here, we specifically focused to showcase the relative changes of different collagen chains present in the ECM during regeneration (<xref ref-type="fig" rid="F1">Figure 1D</xref>, <xref ref-type="sec" rid="s10">Supplementary Table S3</xref>). Out of the total 36 collagen chains identified, relative abundances were calculated for 24 collagen chains summarized in the heatmap presented in <xref ref-type="fig" rid="F1">Figure 1D</xref> (raw data for individual chains are provided in <xref ref-type="sec" rid="s10">Supplementary Table S3</xref>). Interestingly, the level of total collagen 1 (summed abundances of all three chains of the triple-helix) decreased (to 0.68 fold, 32% decrease in the level, not significant) at 7 days post-amputation, was slightly increased at 14 days post-amputation (0.86 fold compared to control, 14% decrease in level compared to sham control model) and it also further increased (0.93 fold compared to sham model (control), 6.2% decrease in level compared to sham model) at 30 days post-amputation. Although these changes were not found to be statistically significant, the expression trend corroborated the previous analysis (<xref ref-type="bibr" rid="B12">Garcia-Puig et al., 2019</xref>). A similar expression was found for COL5A1 and COL5A2a during the regeneration process. Levels of these 2 collagen chains were first decreased at 7&#xa0;days post-amputation and then increased at 14 and 30 days post-amputation (<xref ref-type="bibr" rid="B12">Garcia-Puig et al., 2019</xref>). The expression of COL4A2, COL5A1, and COL5A2a in our analysis is also similar to the analysis done by <xref ref-type="bibr" rid="B12">Garcia-Puig et al. (2019)</xref> in which they found a decrease in the level of COL4A2, COL5A1, and COL5A2a on 7&#xa0;days post-amputation indicating a feedback post-transcriptional regulation. Furthermore in this analysis, we quantitated the relative abundances of 8 new collagen (COL6A4, COL11A1, COL2A1b, COL28A2a, COL4A5, COL5A3b, COL2A1a, COL1A1) (<xref ref-type="fig" rid="F1">Figure 1D</xref>, marked with red dots) chains present in the myocardium ECM during heart regeneration. The oscillatory dynamics of abundances of these new collagen chains are depicted in the heatmap (<xref ref-type="fig" rid="F1">Figure 1D</xref>, <xref ref-type="sec" rid="s10">Supplementary Table S3</xref>). These expressions of collagen chains highlight the role of collagen in the extracellular remodeling during regeneration.</p>
</sec>
<sec id="s3-3">
<title>3.3 Global Characterization and Identification of Novel Site-specific Collagen Post-translational Modifications (PTMs) in Zebrafish Myocardium ECM</title>
<p>Collagen PTMs such as O-glycosylation (on hydroxylysine residues) and hydroxylation (on lysine and proline residues) have been shown to be crucial for embryonic development, assembly of ECM fibrils, and cell-matrix interactions (<xref ref-type="bibr" rid="B36">Pokidysheva et al., 2014</xref>; <xref ref-type="bibr" rid="B19">Hudson et al., 2015</xref>). Collagen molecules get heavily post-translationally modified during the biosynthesis of new chains in the endoplasmic reticulum (<xref ref-type="bibr" rid="B17">Hennet, 2019</xref>). Recently, the use of a hydroxyproline-based search strategy using mass-spectrometry data yielded increased coverage for collagen chains (<xref ref-type="bibr" rid="B26">Merl-Pham et al., 2019</xref>; <xref ref-type="bibr" rid="B2">Basak et al., 2016</xref>; <xref ref-type="bibr" rid="B43">Shao et al., 2020</xref>). However, it is challenging to identify the site-specific O-glycosylation sites present in the big collagen chains (<xref ref-type="bibr" rid="B2">Basak et al., 2016</xref>). The role of collagen PTMs during tissue regeneration is completely unexplored. Classical amino acid analysis-based approaches could throw insights regarding the composition (including modified amino acids) of these collagen triple helices. However, site-specific identification of these collagen PTMs (hydroxylation, O-glycosylation)&#x2013;can only be achieved by high-resolution mass spectrometry. So, this study primarily focused on the identification of site-specific collagen PTMs present in the ECM of the zebrafish heart. Furthermore, the dynamics of these PTMs in collagen I (most abundant collagen in heart ECM, a heteromeric trimer constituting three different chains COL1A1a, COL1A1b, and COL1A2) is being showcased during the regeneration of zebrafish heart. A total of 36 collagen chains were identified from the zebrafish heart ECM in this analysis as mentioned previously. Identification of collagen PTMs from a global MS analysis, without biochemical purification, has remained challenging. We had previously developed a MyriMatch-based workflow to identify site-specific collagen PTMs from crude ECM proteome preparations using high-resolution mass spectrometry (<xref ref-type="bibr" rid="B2">Basak et al., 2016</xref>). Here, our <italic>inhouse</italic> collagen PTM identification pipeline (<xref ref-type="fig" rid="F2">Figure 2</xref>) with a dual search engine-based strategy maximized the identification of prolyl-hydroxylation, lysyl-hydroxylation, and O-glycosylation sites present in 23 collagen chains from zebrafish heart ECM (<xref ref-type="table" rid="T1">Table 1</xref>). We detected a total of 95 3-hydroxyproline, 108 hydroxylysine, 29 galactosyl-hydroxylysine, and 128 glucosylgalactosyl-hydroxylysine sites in 23 collagen chains (<xref ref-type="table" rid="T1">Table 1</xref>) present in the zebrafish heart ECM. This is the first catalog of site-specific collagen PTMs identified in the ECM of the zebrafish heart. Notably, site-specific collagen PTMs were identified in the highest abundant COL1A1a, along with COL11A1a and COL22A1 spanning across three orders of magnitude (log scale) (see <xref ref-type="table" rid="T1">Table 1</xref>). This highlights the sensitivity and the depth of dynamic range of our analytical pipeline to explore the PTMs of collagens present in the ECM of zebrafish heart. Collagen I seemed to have a higher number of prolyl-3-hydroxylations and lysyl-hydroxylation sites. Collagen 5 is a minor constituent of fibril assembly (<xref ref-type="bibr" rid="B52">Wenstrup et al., 2004</xref>). However, this analysis revealed 12 (P<sup>686</sup>, P<sup>746</sup>, P<sup>824</sup>, P<sup>1112</sup>, P<sup>1115</sup>, P<sup>1163</sup>, P<sup>1166</sup>, P<sup>1190</sup>, P<sup>1193</sup>, P<sup>1253</sup>, P<sup>1421</sup>, P<sup>1424</sup>) and nine sites (P<sup>241</sup>, P<sup>277</sup>, P<sup>532</sup>, P<sup>661</sup>, P<sup>868</sup>, P<sup>901</sup>, P<sup>937</sup>, P<sup>967</sup>, P<sup>1105</sup>) of 3-HyP in COL5A1 and COL5A2a, respectively. COL5A2a was then further mapped with 11 sites (K<sup>329</sup>, K<sup>338</sup>, K<sup>500</sup>, K<sup>557</sup>, K<sup>572</sup>, K<sup>761</sup>, K<sup>761</sup>, K<sup>794</sup>, K<sup>638</sup>, K<sup>803</sup>, K<sup>887</sup>) of glucosyl-galactosyl hydroxylysines. Only one galactosyl-hydroxylysine site (K<sup>1344</sup>) was identified in COL5A1. As previously shown in the zebrafish genome (from fin tissue) analysis (<xref ref-type="bibr" rid="B9">Duran et al., 2015</xref>), no type III collagen chain is detected in our analysis from the zebrafish cardiac ECM. The absence of type III collagen is a unique feature of zebrafish heart ECM and warrants further scope to investigate the role of collagen III during regeneration. Four different chains of collagen 6 (COL6A1, COL6A2, COL6A3, COL6A4a) were found to harbor O-glycosylation but only one prolyl-3-hydroxylation site (3-HyP<sup>470</sup> in COL6A2) (<xref ref-type="table" rid="T1">Table 1</xref>). Although basement membrane collagen IV is not a major constituent of heart ECM; in the case of collagen IV alpha 1 only seven sites (K<sup>463</sup>, K<sup>466</sup>, K<sup>882</sup>, K<sup>909</sup>, K<sup>1149</sup>, K<sup>1179</sup>, K<sup>1182</sup>) of glucosyl-galactosyl-hydroxylation were identified. Surprisingly, 24 glucosyl-galactosyl-hydroxylation sites were identified in COL4A2. A total of 4 (P<sup>201</sup>, P<sup>204</sup>, P<sup>294</sup>, P<sup>297</sup>, p<sup>1338</sup>) and 3 (P<sup>339</sup>, P<sup>555</sup>, P<sup>613</sup>) prolyl-3-hydroxylation sites were identified in COL4A1 and COL4A2, respectively. It appeared that COL4A2 is way more glycosylated than COL4A1 in the zebrafish cardiac ECM compared to mice (<xref ref-type="bibr" rid="B2">Basak et al., 2016</xref>). Interestingly, we have also identified the COL4A5 chain with a common hydroxylysine site harboring microheterogeneity of galactosyl-hydroxylysine and glucosyl-galactosyl-hydroxylysine (K<sup>225</sup>). Additionally, many other sites of prolyl-3-hydroxylations, lysyl-hydroxylation and O-glycosylation sites of lysine have been identified in various other collagen types (COL11A1b, COL7A1, COL22A1, etc.; see <xref ref-type="table" rid="T1">Table 1</xref>) highlighting the advantageous nature of these analyses increasing the depth of site-specific PTM identification coverage in the ECM. Taken together, these results showcased the comprehensive mapping and site-specific identification of many collagens present in the zebrafish heart ECM.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Optimized dual database search engine-based MS analysis pipeline for the global identification and quantitation of site-specific collagen PTMs from zebrafish heart ECM. Thermo. raw or Bruker. d MS/MS files were initially converted to. mgf and. mzML files (by MSConvert) respectively and searched with MyriMatch and MSFragger to identify the collagen present in the zebrafish heart ECM. For MyriMatch, the subset of identified proteins was used as a second database to perform a PTM module enabled search defining specific sequence motifs for the site-specific identification of collagen PTMs in zebrafish heart ECM. For MSFragger, the PTM searches were conducted directly with the entire zebrafish database. From MyriMatch and MSFragger &#x2a;.pep.XML output files containing each peptide spectrum match (PSM) were further parsed by PeptideProphet to compute the probability score (0,1). The &#x2a;.pep.XML output file parsed by PeptideProphet was further imported into Skyline along with all the raw MS/MS files in to generate the spectral library (.blib). This spectral library (.blib) in Skyline was used for the targeted MS<sup>1</sup>-based extraction of all the PTM modified and unmodified collagen peptide species for each specific site. The area of MS<sup>1</sup> area for each peptide for different samples was computed from Skyline.</p>
</caption>
<graphic xlink:href="fmolb-09-892763-g002.tif"/>
</fig>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Mass-Spectrometry-based identification of site-specific hydroxylation and glycosylation of different collagens present in the zebrafish cardiac ECM.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Collagen chains</th>
<th align="center">3-Hydroxyproline Sites (95)</th>
<th align="center">Hydroxylysine Sites (108)</th>
<th align="center">Galactosyl-Hydroxylysine Sites (29)</th>
<th align="center">Glucosylgalactosyl-Hydroxylysine Sites (128)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">COL1A1a</td>
<td align="left">P<sup>176</sup>, P<sup>188</sup>, P<sup>317</sup>, P<sup>401</sup>, P<sup>443</sup>, P<sup>446</sup>, P<sup>551</sup>, P<sup>623</sup>, P<sup>707</sup>, P<sup>755</sup>, P<sup>800</sup>, P<sup>854</sup>, P<sup>869</sup>, P<sup>878</sup>, P<sup>881</sup>, P<sup>911</sup>, P<sup>980</sup>, P<sup>992</sup>, P<sup>1034</sup>, P<sup>1103</sup>, P<sup>1106</sup>, P<sup>1148</sup>, P<sup>1166</sup>, P<sup>1169</sup>
</td>
<td align="left">K<sup>261</sup>, K<sup>270</sup>, K<sup>336</sup>, K<sup>381</sup>, K<sup>414</sup>, K<sup>426</sup>, K<sup>432</sup>, K<sup>504</sup>, K<sup>522</sup>, K<sup>570</sup>, K<sup>596</sup>, K<sup>641</sup>, K<sup>693</sup>, K<sup>726</sup>, K<sup>735</sup>, K<sup>765</sup>, K<sup>810</sup>, K<sup>819</sup>, K<sup>846</sup>, K<sup>918</sup>, K<sup>1017</sup>, K<sup>1080</sup>, K<sup>1191</sup>
</td>
<td align="left">K<sup>261</sup>, K<sup>270</sup>, K<sup>432</sup>, K<sup>504</sup>, K<sup>570</sup>, K<sup>693</sup>, K<sup>819</sup>, K<sup>846</sup>, K<sup>1191</sup>
</td>
<td align="left">K<sup>261</sup>, K<sup>270</sup>, K<sup>432</sup>, K<sup>504</sup>, K<sup>522</sup>, K<sup>570</sup>, K<sup>693</sup>, K<sup>726</sup>, K<sup>819</sup>, K<sup>846</sup>, K<sup>1017</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1b</td>
<td align="left">P<sup>176</sup>, P<sup>191</sup>, P<sup>302</sup>, P<sup>320,</sup> P<sup>404</sup>, P<sup>470</sup>, P<sup>554</sup>, P<sup>758</sup>, P<sup>872</sup>, P<sup>881</sup>, P<sup>914</sup>, P<sup>938</sup>, P<sup>983</sup>, P<sup>1031</sup>, P<sup>1109</sup>, P<sup>1151</sup>, P<sup>1175</sup>
</td>
<td align="left">K<sup>215</sup>, K<sup>264</sup>, K<sup>273</sup>, K<sup>339</sup>, K<sup>384</sup>, K<sup>417</sup>, K<sup>429</sup>, K<sup>525</sup>, K<sup>573</sup>, K<sup>618</sup>, K<sup>696</sup>, K<sup>738</sup>, K<sup>746</sup>, K<sup>768</sup>, K<sup>813</sup>, K<sup>849</sup>, K<sup>900</sup>, K<sup>1020</sup>, K<sup>1193</sup>
</td>
<td align="left">K<sup>264</sup>, K<sup>273</sup>, K<sup>849</sup>, K<sup>1020</sup>
</td>
<td align="left">K<sup>264</sup>, K<sup>273</sup>, K<sup>339</sup>, K<sup>849</sup>
</td>
</tr>
<tr>
<td align="left">COL1A2</td>
<td align="left">P<sup>56</sup>, P<sup>59</sup>, P<sup>244</sup>, P<sup>286</sup>, P<sup>361</sup>, P<sup>451</sup>, P<sup>469</sup>, P<sup>673</sup>, P<sup>718</sup>, P<sup>787</sup>, P<sup>820</sup>, P<sup>838</sup>, P<sup>925</sup>, P<sup>1066</sup>, P<sup>1081</sup>
</td>
<td align="left">K<sup>74</sup>, K<sup>179</sup>, K<sup>188</sup>, K<sup>254</sup>, K<sup>299</sup>, K<sup>344</sup>, K<sup>350</sup>, K<sup>454</sup>, K<sup>488</sup>, K<sup>500</sup>, K<sup>533</sup>, K<sup>578</sup>, K<sup>644</sup>, K<sup>647</sup>, K<sup>683</sup>, K<sup>731</sup>, K<sup>737</sup>, K<sup>836</sup>, K<sup>935</sup>, K<sup>998</sup>, K<sup>1004</sup>
</td>
<td align="left">K<sup>74</sup>, K<sup>188</sup>, K<sup>254</sup>, K<sup>344</sup>, K<sup>350</sup>, K<sup>644</sup>
</td>
<td align="left">K<sup>74</sup>, K<sup>167</sup>, K<sup>188</sup>, K<sup>254</sup>, K<sup>299</sup>, K<sup>578</sup>, K<sup>644</sup>, K<sup>647</sup>, K<sup>935</sup>
</td>
</tr>
<tr>
<td align="left">COL4A1</td>
<td align="left">P<sup>201</sup>, P<sup>204</sup>, P<sup>294</sup>, P<sup>297</sup>, p<sup>1338</sup>
</td>
<td align="left"/>
<td align="left"/>
<td align="left">K<sup>463</sup>, K<sup>466</sup>, K<sup>882</sup>, K<sup>909</sup>, K<sup>1149</sup>, K<sup>1179</sup>, K<sup>1182</sup>
</td>
</tr>
<tr>
<td align="left">COL5A1</td>
<td align="left">P<sup>686</sup>, P<sup>746</sup>, P<sup>824</sup>, P<sup>1112</sup>, P<sup>1115</sup>, P<sup>1163</sup>, P<sup>1166</sup>, P<sup>1190</sup>, P<sup>1193</sup>, P<sup>1253</sup>, P<sup>1421</sup>, P<sup>1424</sup>
</td>
<td align="left">K<sup>1134</sup>, K<sup>1296</sup>, K<sup>1329</sup>, K<sup>1497</sup>
<sub>,</sub> K<sup>1578</sup>, K<sup>1581</sup>, K<sup>1592</sup>, K<sup>1604</sup>, K<sup>1644</sup>
</td>
<td align="left">K<sup>1344</sup>
</td>
<td align="left">K<sup>623</sup>, K<sup>626</sup>, K<sup>753</sup>, K<sup>813</sup>, K<sup>945</sup>, K<sup>1104</sup>, K<sup>1209</sup>, K<sup>1245</sup>, K<sup>1290</sup>, K<sup>1338</sup>, K<sup>1383</sup>, K<sup>1455</sup>
<sub>,</sub> K<sup>1482</sup>
<sub>,</sub> K<sup>1578</sup>, K<sup>1581</sup>
</td>
</tr>
<tr>
<td align="left">COL5A2a</td>
<td align="center">P<sup>241</sup>, P<sup>277</sup>, P<sup>532</sup>, P<sup>661</sup>, P<sup>868</sup>, P<sup>901</sup>, P<sup>937</sup>, P<sup>967</sup>, P<sup>1105</sup>
</td>
<td align="center">K<sup>280</sup>, K<sup>329</sup>, K<sup>449</sup>, K<sup>482</sup>, K<sup>494</sup>, K<sup>709</sup>, K<sup>749</sup>, K<sup>803</sup>, K<sup>878</sup>, K<sup>986</sup>, K<sup>1036</sup>, K<sup>1078</sup>, K<sup>1124</sup>, K<sup>1135</sup>
</td>
<td align="left"/>
<td align="left">K<sup>329</sup>, K<sup>338</sup>, K<sup>500</sup>, K<sup>557</sup>, K<sup>572</sup>, K<sup>761</sup>, K<sup>761</sup>, K<sup>794</sup>, K<sup>638</sup>, K<sup>803</sup>, K<sup>887</sup>
</td>
</tr>
<tr>
<td align="left">COL6A1</td>
<td align="left">P<sup>574</sup>, P<sup>577</sup>
</td>
<td align="left">K<sup>482</sup>, K<sup>538</sup>
</td>
<td align="left"/>
<td align="left">K<sup>398</sup>, K<sup>488</sup>, K<sup>538</sup>, K<sup>560</sup>, K<sup>563</sup>
</td>
</tr>
<tr>
<td align="left">COL6A3</td>
<td align="left"/>
<td align="left">K<sup>1796</sup>, K<sup>1975</sup>
</td>
<td align="left"/>
<td align="left">K<sup>1660</sup>, K<sup>1778</sup>, K<sup>1796</sup>, K<sup>1711</sup>, K<sup>1963</sup>, K<sup>1880</sup>, K<sup>1889</sup>
</td>
</tr>
<tr>
<td align="left">COL6A2</td>
<td align="left">P<sup>470</sup>
</td>
<td align="left">K<sup>387</sup>
</td>
<td align="left">K<sup>539</sup>
</td>
<td align="left">K<sup>339</sup>, K<sup>387</sup>, K<sup>426</sup>, K<sup>435</sup>, K<sup>438</sup>, K<sup>456</sup>, K<sup>459</sup>, K<sup>539</sup>, K<sup>545</sup>
</td>
</tr>
<tr>
<td align="left">COL11A1a</td>
<td align="left">P<sup>919</sup>
</td>
<td align="left">K<sup>1313</sup>
</td>
<td align="left">K<sup>1217</sup>, K<sup>1304</sup>
</td>
<td align="left">K<sup>911</sup>, K<sup>1148</sup>, K<sup>1151</sup>
</td>
</tr>
<tr>
<td align="left">COL4A2</td>
<td align="left">P<sup>339</sup>, P<sup>555</sup>, P<sup>613</sup>
</td>
<td align="left">K<sup>593</sup>, K<sup>704</sup>, K<sup>707</sup>, K<sup>1620</sup>, K<sup>1651</sup>
</td>
<td align="left">K<sup>689</sup>
</td>
<td align="left">K<sup>54</sup>, K<sup>72</sup>, K<sup>318</sup>, K<sup>567</sup>, K<sup>582</sup>, K<sup>593</sup>, K<sup>662</sup>, K<sup>674</sup>, K<sup>689</sup>, K<sup>704</sup>, K<sup>707</sup>, K<sup>725</sup>, K<sup>948</sup>, K<sup>951</sup>, K<sup>969</sup>, K<sup>1045</sup>, K<sup>1048</sup>, K<sup>1076</sup>, K<sup>1235</sup>, K<sup>1247</sup>, K<sup>1283</sup>, K<sup>1292</sup>, K<sup>1331</sup>, K<sup>1346</sup>
</td>
</tr>
<tr>
<td align="left">COL4A5</td>
<td align="left">P<sup>93</sup>
</td>
<td align="left">K<sup>127</sup>, K<sup>214</sup>
</td>
<td align="left">K<sup>225</sup>
</td>
<td align="left">K<sup>225</sup>
</td>
</tr>
<tr>
<td align="left">COL6A4a</td>
<td align="left">-</td>
<td align="left">-</td>
<td align="left">K<sup>1501</sup>
</td>
<td align="left">K<sup>1490</sup>, K<sup>1787</sup>, K<sup>1796</sup>, K<sup>2148</sup>
</td>
</tr>
<tr>
<td align="left">COL2A1a</td>
<td align="left">-</td>
<td align="left">K<sup>612</sup>
</td>
<td align="left">K<sup>423</sup>
</td>
<td align="left">K<sup>423</sup>, K<sup>1178</sup>
</td>
</tr>
<tr>
<td align="left">COL2A1b</td>
<td align="left"/>
<td align="left"/>
<td align="left"/>
<td align="left"/>
</tr>
<tr>
<td align="left">COL16A1</td>
<td align="left">-</td>
<td align="left">K<sup>1008</sup>, K<sup>1343</sup>, K<sup>1495</sup>, K<sup>1498</sup>
</td>
<td align="left">-</td>
<td align="left">K<sup>644</sup>, K<sup>654</sup>, K<sup>1008</sup>, K<sup>1233</sup>
</td>
</tr>
<tr>
<td align="left">&#x2003;COL11A1b</td>
<td align="left">-</td>
<td align="left">-</td>
<td align="left">-</td>
<td align="left">K<sup>1191</sup>, K<sup>1400</sup>
</td>
</tr>
<tr>
<td align="left">COL5A2b</td>
<td align="left">P<sup>838</sup>
</td>
<td align="left">-</td>
<td align="left">-</td>
<td align="left">K<sup>731</sup>
</td>
</tr>
<tr>
<td align="left">COL11A2</td>
<td align="left">P<sup>1133</sup>
</td>
<td align="left">K<sup>959</sup>, K<sup>963</sup>
</td>
<td align="left">-</td>
<td align="left">K<sup>1188</sup>, K<sup>1197</sup>, K<sup>1200</sup>, K<sup>1203</sup>
</td>
</tr>
<tr>
<td align="left">COL7A1</td>
<td align="left">-</td>
<td align="left">-</td>
<td align="left">-</td>
<td align="left">K<sup>1504</sup>, K<sup>2136</sup>
</td>
</tr>
<tr>
<td align="left">COL5A3b</td>
<td align="left">P<sup>437</sup>, P<sup>467</sup>
</td>
<td align="left">-</td>
<td align="left">-</td>
<td align="left">-</td>
</tr>
<tr>
<td align="left">COL22A1</td>
<td align="left">-</td>
<td align="left">K<sup>495</sup>, K<sup>1128</sup>
</td>
<td align="left">K<sup>788</sup>, K<sup>1538</sup>
</td>
<td align="left">K<sup>498</sup>, K<sup>513</sup>, K<sup>785</sup>
</td>
</tr>
<tr>
<td align="left">&#x2003;COL17A1a</td>
<td align="left">P<sup>1156</sup>
</td>
<td align="left">-</td>
<td align="left">-</td>
<td align="left">-</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3-4">
<title>3.4 Characterization of PTMs in Collagen I From the Zebrafish Heart ECM</title>
<p>Collagen I is the highest abundant ECM protein present in the zebrafish hearts (<xref ref-type="bibr" rid="B14">Gistelinck et al., 2016</xref>). The composition of the heterotrimer of the collagen I chain forming a protomer (triple helices) is different compared to mammals (<xref ref-type="bibr" rid="B14">Gistelinck et al., 2016</xref>). In mammals, two alpha 1 chain and one alpha 2 chain form the protomer. However, in zebrafish, three different genes (COL1A1a, COL1A1b, and COL1A2) code for three different collagen I chains (collagen alpha 1a, collagen 1 alpha 1b (alpha 3), and collagen 1 alpha 2), respectively. High expression of these three genes has been documented in adult zebrafish tissues as well as during embryonic and larval development indicating equimolar stoichiometry in forming the protomers (<xref ref-type="bibr" rid="B14">Gistelinck et al., 2016</xref>). However, different glycosylated forms of COL1A1a have been found in zebrafish embryos indicating altered PTM levels during development (<xref ref-type="bibr" rid="B14">Gistelinck et al., 2016</xref>). Thus, tissue-specific mapping of collagen I site-specific PTMs are of the highest importance in order to further delineate their function. Here, we have comprehensively mapped the site-specific prolyl-hydroxylation, lysyl-hydroxylation, and lysine O-glycosylation in three different chains (COL1A1a, COL1A1b, and COL1A2) of collagen I in zebrafish heart ECM.</p>
<sec id="s3-4-1">
<title>3.4.1 Comprehensive Site-specific Map of PTMs in COL1A1a From the Zebrafish Heart ECM&#x2013;</title>
<p>COL1A1a chain was found to be the highest abundant out of all the three chains of collagen 1 triple helix in ECM of zebrafish heart. This chain has an important role in the stability and proper functioning of zebrafish cardiac tissues (<xref ref-type="bibr" rid="B14">Gistelinck et al., 2016</xref>). It is a large 1,447&#xa0;amino acid long collagen chain. For determining the N (23&#x2013;146, FSP/QMS), and C terminal propeptides cleavage site (1,202&#x2013;1,447, RA/DDAN), we performed cleavage site sequence alignment matching with human COL1A1, which has also been reported previously (<xref ref-type="bibr" rid="B14">Gistelinck et al., 2016</xref>). However, we also detected a peptide sequence from N terminal propeptide hinting toward the dynamics of collagen assembly in the ECM. PTM module enabled dual search engine-based database search yielded 94.98% sequence coverage (considering the processed chain as the full length) (<xref ref-type="fig" rid="F3">Figure 3</xref>; <xref ref-type="table" rid="T1">Table 1</xref>, <xref ref-type="sec" rid="s10">Supplementary Figure S2</xref>). A total of 92 4-HyP sites on the &#x201c;Yaa&#x201d; position of &#x201c;Gly-Xaa-Yaa&#x201d; motif were detected. This estimate is suggestive of about 95% 4-HyP occupancy in the Yaa positioned (in Gly-Xaa-Yaa motif) prolines corroborating the previous Edman sequencing analysis in COL1A1 from mammals (<xref ref-type="bibr" rid="B10">Fietzek and K&#xfc;hn, 1975</xref>). Furthermore, we have also detected 41 hydroxyproline sites in the &#x201c;Xaa&#x201d; position of the &#x201c;Gly-Xaa-Yaa&#x201d; motif. This finding is similar (n &#x3d; 37) to human COL1A1 (<xref ref-type="bibr" rid="B26">Merl-Pham et al., 2019</xref>). In addition, we analyzed the occurrence of prolyl-3-hydroxylation sites providing crucial structural equilibrium in the collagen molecules deposited in ECM and affecting interactions with other ECM proteins (<xref ref-type="bibr" rid="B36">Pokidysheva et al., 2014</xref>; <xref ref-type="bibr" rid="B16">Grafe et al., 2014</xref>; <xref ref-type="bibr" rid="B29">Montgomery et al., 2018</xref>). We assigned these HyP as 3-hydroxyproline (3-HyP) as determined by previous experimental inferences (<xref ref-type="bibr" rid="B10">Fietzek and K&#xfc;hn, 1975</xref>). A total of twenty-four 3-hydroxyproline sites (see <xref ref-type="table" rid="T1">Table 1</xref>) at the &#x201c;Xaa&#x201d; position of the G-Xaa-HyP motif in COL1A1a were identified. Out of these, 10 sites (P<sup>176</sup>, P<sup>551</sup>, P<sup>755</sup>, P<sup>800</sup>, P<sup>869</sup>, P<sup>881</sup>, P<sup>911</sup>, P<sup>1103</sup>, P<sup>1106</sup>
<sub>,</sub> and P<sup>1148</sup>) are evolutionarily conserved in mice (<xref ref-type="bibr" rid="B33">Padmanabhan Iyer et al., 2016</xref>) and human heart tissue (<xref ref-type="bibr" rid="B1">Barallobre-Barreiro et al., 2021</xref>) (<xref ref-type="fig" rid="F4">Figure 4</xref>; <xref ref-type="table" rid="T1">Table 1</xref>, <xref ref-type="sec" rid="s10">Supplementary Figure S2</xref> and <xref ref-type="sec" rid="s10">Supplementary Figure S3</xref>). This highlights probable conserved functions of these sites in collagen molecule providing structural support and maintaining homeostatic ECM assembly. Interestingly, we report the identification of 14 novels (P<sup>188</sup>, P<sup>317</sup>, P<sup>401</sup>, P<sup>443</sup>, P<sup>446</sup>, P<sup>623</sup>, P<sup>707</sup>, P<sup>854</sup>, P<sup>878</sup>, P<sup>980</sup>, P<sup>992</sup>, P<sup>1034</sup>, P<sup>1166</sup>
<sub>,</sub> and P<sup>1169</sup>) 3-HyP sites in the COL1A1a from zebrafish heart ECM. In zebrafish heart ECM, COL1A1a harbors numerous unique 3-HyP sites close to the N-terminal propeptide cleavage sites (3-HyP<sup>188,317</sup>), spanning across the triple-helical region as well. Two clusters of &#x201c;Gly-Pro-Pro&#x201d; motif present close to C-terminal propeptide were found to be 3-hydroxylated (3-HyP<sup>1103,1106</sup> and 3-HyP<sup>1166,1169</sup>). The presence of many new 3-HyP sites compared to human and mouse hearts is itself a unique finding in the zebrafish COL1A1a (<xref ref-type="fig" rid="F4">Figure 4</xref>) protein structure. These unique prolyl-3-hydroxylation sites may contribute to ECM remodeling during heart regeneration and warrants further investigations.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Comprehensive map of proline/lysine hydroxylation sites and lysine O-glycosylation sites in COL1A1a of WT zebrafish heart ECM. Identified peptide sequence in the proteomic analysis is shown in black color, sequence not identified in this analysis are shown in grey color. A total of 94.98% sequence coverage of COL1A1a is detected (considering the matured form of COL1A1The signalgnal peptide is 22&#xa0;amino acids (1&#x2013;22) long. Sequence alignment matching with human COL1A1 revealed the propeptide cleavage sites. Dark yellow arrows show N terminal (23&#x2013;146) and C-Term (1,202&#x2013;1,447) propeptide cleavage sites. As shown in the top right corner, red bold &#x201c;<bold>P</bold>&#x201d; with a blue star represents 3-hydroxyproline on the Xaa position followed by 4-hydroxyproline the on Yaa position in the Gly-Xaa-Yaa motif. 4-hydroxyproline on Yaa position is represented with red color &#x201c;P&#x201d;. Hydroxyproline on unusual Xaa position with (Ala, Val, Met, Ile, Ser, Glu, Arg, and Asp) on Yaa position are also identified but cannot label either 3-hydroxyproline or 4-hydroxyproline. Hydroxylysine sites are presented by bold &#x201c;<bold>K</bold>&#x201d;. Lysine sites highlighted with a yellow circle represents galactosyl-hydroxylysine sites and yellow plus blue coloued circles represent glucosylgalactosyl-hydroxylysine sites. The presence of glucosylgalactosyl-hydroxylysine, galactosyl-hydroxylysine, and hydroxylysine on the same site shows lysine microheterogeneity. A summary of these site-specific PTMs of COL1A1a is presented in <xref ref-type="table" rid="T1">Table 1</xref>, and all the PSMs for O-glycosylated lysine and 3-hydroxyproline sites are provided in <xref ref-type="sec" rid="s10">Supplementary Figure S2.1-S2.21</xref>.</p>
</caption>
<graphic xlink:href="fmolb-09-892763-g003.tif"/>
</fig>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Comparison of 3-HyP sites identified in COL1A1a from zebrafish heart ECM to COL1A1 of human and mice heart ECM. The horizontal box represents the full-length COL1A1 sequence and the vertical black lines indicate the corresponding 3-HyP sites. The information for 3-HyP sites of human and mice heart ECM were re-analyzed in this manuscript from the available raw MS data from Barallobre-Barreiro et al. and Padmanabhan et al. The 3-HyP sites marked with red represent the conserved sites among human, mouse, and zebrafish.</p>
</caption>
<graphic xlink:href="fmolb-09-892763-g004.tif"/>
</fig>
</sec>
<sec id="s3-4-2">
<title>3.4.2 Comprehensive Site-specific Map of PTMs in COL1A1b From the Zebrafish Heart ECM&#x2013;</title>
<p>In zebrafish cardiac ECM, the second chain contributing to the collagen 1 triple helix is COL1A1b. It is a 1,449&#xa0;amino acid long chain. Our analysis revealed COL1A1b to be the next highest abundant collagen chain present in the zebrafish cardiac ECM. We identified and comprehensively mapped site-specific PTMs of COL1A1b. N terminal (150, FLS/QMA) and C terminal propeptide cleavage sites (1,204, YRA/DDA) were determined by sequence alignment, corroborating the previous report (<xref ref-type="bibr" rid="B14">Gistelinck et al., 2016</xref>). We detected 96.48% sequence coverage (considering the matured form) for COL1A1b. A total of 99 4-HyP sites on the &#x201c;Yaa&#x201d; position of &#x201c;Gly-Xaa-Yaa&#x201d; motif were detected out of the possible 103 sites. Similar to COL1A1a, we detected seventeen 3-HyP sites in COL1A1b on the &#x2018;Xaa&#x2019; position of the Gly-Xaa-HyP motif out of the possible 27 sites. It seems that COL1A1b is less 3-hydroxylated at proline residues compared to COL1A1a. Furthermore, we have also detected 51 hydroxyproline sites in the &#x201c;Xaa&#x201d; position of &#x201c;Gly-Xaa-Yaa&#x201d; motif. As described previously, mass-spectrometry-based methods will not be able to resolve whether these prolyl-hydroxylations are occurring at the 3&#x2032; or 4&#x2032; position. However, recent studies have shown that these are most probably 4-prolyl-hydroxylations (<xref ref-type="bibr" rid="B50">Van Huizen et al., 2019</xref>). Surprisingly, we have detected a smaller number of lysine O-glycosylation sites in COL1A1b. A total of 19 hydroxylysines, 4 galactosyl-hydroxylysine, and 4 glucosylgalactosyl-hydroxylysine sites have been identified (<xref ref-type="table" rid="T1">Table 1</xref>). These comprehensive PTM maps of COL1A1b of zebrafish heart ECM are summarized in <xref ref-type="fig" rid="F5">Figure 5</xref> (the corresponding PSMs are summarized for each PTM site in <xref ref-type="sec" rid="s10">Supplementary Figure S2</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Comprehensive PTM map of COL1A1b of WT zebrafish heart ECM, presenting proline/lysine hydroxylation sites and lysine O-glycosylation sites. The representation of PTM sthe ites is similar to the COL1A1a PTM map as shown in top right corner. Peptides identified in proteomics analysis are shown in black color and unidentified peptides are shown in grey color. A total of 96.48% sequence coverage of COL1A1a is detected (considering the matured form of COL1A1b). The signal peptide is 22&#xa0;amino acids (1&#x2013;22) long. Sequence alignment matching and previous analysis by Gistelink et al. with human COL1A1 revealed the propeptide cleavage sites. Dark yellow arrows show N terminal (23&#x2013;150) and C-Term (1,204&#x2013;1,447) propeptide cleavage sites. Red bold &#x201c;<bold>P</bold>&#x201d; with a blue star represents 3-HyP and red &#x201c;P&#x201d; represents 4-HyP. Hydroxylysine is represented with bold &#x201c;<bold>K</bold>&#x201d; and yellow and blue circle represents the lysine O-glycosylation. A summary of these site-specific PTMs of COL1A1b is presented in <xref ref-type="table" rid="T1">Table 1</xref>, and all the PSMs for O-glycosylated lysine and 3-hydroxyproline sites are provided in Supplementary Figures. S2.22&#x2013;S2.52.</p>
</caption>
<graphic xlink:href="fmolb-09-892763-g005.tif"/>
</fig>
</sec>
<sec id="s3-4-3">
<title>3.4.3 Comprehensive Site-specific Map of PTMs in COL1A2 From the Zebrafish Heart ECM&#x2013;</title>
<p>COL1A2 is the third contributing collagen chain in the collagen 1 triple helix of zebrafish heart ECM. COL1A2 is a little smaller than the other 2 chains of collagen 1 triple helix. It consists of 1,352&#xa0;amino acids. We performed a similar propeptide cleavage site sequence alignment match of human COL1A2 with zebrafish COL1A2 (<xref ref-type="fig" rid="F6">Figure 6</xref>; <xref ref-type="table" rid="T1">Table 1</xref>, <xref ref-type="sec" rid="s10">Supplementary Figure S2</xref>). We detected 97.21% sequence coverage of COL1A2 (considering the processed mature form). A total of 82 4-HyP sites on the &#x2018;Yaa&#x2019; position of &#x201c;Gly-Xaa-Yaa&#x201d; motif were detected out of the possible 83 sites highlighting almost complete occupancy of 4-HyP possible sites. We identified 15 3-HyP sites in COL1A2 on &#x201c;Xaa&#x201d; position of the G-Xaa-HyP motif out of the possible 21 sites. Furthermore, we have also detected 60 hydroxyproline sites in the &#x201c;Xaa&#x201d; position of &#x201c;Gly-Xaa-Yaa&#x201d; motif. Along with proline modification, lysine modifications are also detected in COL1A2. A total of 21 hydroxylysine sites were detected in COL1A2. Hydroxylysines serve as the substrate for further O-glycosylation. A total of 6 galactosyl-hydroxylysine sites and 9 glucosylgalactosyl-hydroxylysine sites were detected in COL1A2. <xref ref-type="fig" rid="F6">Figure 6</xref>, summarizes the first comprehensive PTM map of COL1A2.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Comprehensive map of COL1A2 of ECM of WT zebrafish heart. It presents proline/lysine hydroxylation sites and lysine glycosylation sites. Representation of PTM sites is similar to COL1A1a and COL1A1b PTM maps. Peptides identified in proteomics analysis are shown in black color and unidentified peptides are shown in grey color. Total 97.21% sequence coverage of COL1A2 is detected (considering the matured form of COL1A1b). The signal peptide is 22&#xa0;amino acids (1&#x2013;22) long. N terminal propeptide (23&#x2013;68) and C terminal propeptide (1,109&#x2013;1,352) cleavage sites are marked with dark yellow arrows. Red bold &#x201c;<bold>P</bold>&#x201d; with blue star represents 3-HyP and red &#x201c;P&#x201d; represents 4-HyP. Hydroxylysine is represented with bold &#x201c;<bold>K</bold>&#x201d; and yellow and blue circle represents the lysine O-glycosylation. A summary of these site-specific PTMs of COL1A2 is presented in <xref ref-type="table" rid="T1">Table 1</xref>, and all the PSMs for O-glycosylated lysine and 3-hydroxyproline sites are provided in <xref ref-type="sec" rid="s10">Supplementary Figure S2.53&#x2013;S2.83</xref>.</p>
</caption>
<graphic xlink:href="fmolb-09-892763-g006.tif"/>
</fig>
</sec>
</sec>
<sec id="s3-5">
<title>3.5 Quantitation of Site-specific Prolyl-3-Hydroxylation Occupancy During Regeneration of Zebrafish Heart</title>
<p>Identification of prolyl-3-hydroxylation sites in collagen chains of heart ECM provided new insights into the tissue-specific variation of collagen molecules providing structural support in ECM. Latest studies have revealed that the absence of prolyl-3-hydroxylation in one site (P<sup>1164</sup>) of COL1A1 is associated with osteogenesis imperfecta. Mutation in the prolyl-hydroxylases can cause the formation of dysfunctional collagen (without a 3-HyP site) as found in osteogenesis imperfecta disease conditions (<xref ref-type="bibr" rid="B4">Cabral et al., 2007</xref>). Furthermore, Merl-Pham <italic>et al</italic>. recently showed that the occupancy of 3-HyP<sup>771</sup> of human COL1A1 was increased during TGF-beta treated fibrotic conditions in primary human lung fibroblast (<xref ref-type="bibr" rid="B26">Merl-Pham et al., 2019</xref>). These indicate that changes in the occupancy of the 3-HyP level can have important consequences on the functionality of the collagen molecules present in the ECM. Therefore, we check the occupancy level of 3-HyP in collagens during the regeneration of the zebrafish heart. We utilized our proteomic pipeline and quantitated the occupancy levels of collagen PTMs during zebrafish heart regeneration in the sham model used as control, 7-, 14- and 30-day post-amputation. The critical prolyl-3-hydroxylation site (P<sup>1164</sup>) involved in osteogenesis imperfecta had been previously shown to be fully 3-hydroxylated in mouse skin fibroblast (<xref ref-type="bibr" rid="B30">Morello et al., 2006</xref>). Surprisingly, this conserved site present in zebrafish (3-HyP<sup>1148</sup>) control cardiac ECM is only about half (48%) 3-hydroxylated (<xref ref-type="table" rid="T2">Table 2</xref>; <xref ref-type="fig" rid="F7">Figure 7</xref>). Furthermore, the occupancy of this site decreased to &#x223c;7% in the 30 DPA regenerated zebrafish heart. The occupancy level of two other 3-HyP sites (P<sup>869</sup>, P<sup>878</sup>) of COL1A1a decreased at 30 DPA compared to control and 7 DPA occupancy (<xref ref-type="table" rid="T2">Table 2</xref>; <xref ref-type="fig" rid="F7">Figure 7</xref>). Only 3-HyP<sup>707</sup> occupancy level was increased (7.5&#x2013;20%) at 30 DPA regenerated zebrafish heart compared to sham. This increased occupancy may favor the assembly of ECM by enhancing collagen fibril formation by forming a water bridge with the available free carbonyl group of the subsequent collagen chain. However, a decrease in occupancy of other 3-HyP sites may favor the reduced interaction of deposited collagen molecules in ECM thereby favoring the transition to regeneration from the fibrotic milieu. Although, the exact functional role of these site-specific 3-HyPs needs to be experimentally assessed. A total of five 3-hydroxylation sites (P<sup>404,554,914,1031,1109</sup>) occupancy levels were quantitated in COL1A1b. Out of these, 4 sites (P<sup>404,914,1031,1109</sup>) showed the trend of increased occupancy level at 30 DPA regenerated zebrafish heart. We quantitated the occupancy level of 4 sites in COL1A2. Most of these sites were having &#x3c;10% prolyl-3-hydroxylation. Interestingly, we identified a cluster of 3-HyP sites (P<sup>1195,1201</sup>) in COL5A2a to be highly 3-hydroxylated (24%) after 7 DPA samples compared to only 1.7% occupancy in control. The occupancy of this 3-HyP cluster showed dynamic changes at 14 DPA (55%) and further reverting to 30% at 30 DPA, in the regenerated zebrafish heart. The site-specific variation in the occupancy level of 3-hydroxyproline could be due to varied gene expression of specific enzymes expressed in the zebrafish heart during post-amputation regeneration. A recent report from Sanchez-Iranzo et al. showed the expression of prolyl-3-hydroxylase-3 (P3H3) and prolyl-3-hydroxylase-4 (P3H4) decreased at 7&#xa0;days post cryoinjury (log fold change -0.4,-0.2, respectively), in the zebrafish heart. However, in the same RNA seq analysis, the expression level of prolyl-3-hydroxylase-1 (P3H1) and prolyl-3-hydroxylase-2 (P3H2) were found to be increased (log fold change1.41,1.75 respectively) in the zebrafish heart at 7&#xa0;days post-cryoinjury model. Following the regeneration in the same model till 60&#xa0;days post-cryoinjury, the expression level of P3H2 and P3H4 increased (log fold change 0.94, 1.73 respectively) and the expression of P3H1 and P3H3 decreased (log fold change &#x2212;0.56, &#x2212;1.13) respectively highlighting the dynamic changes in the expression of these isoforms of prolyl-3-hydroxylases. However, the enzyme-substrate specificity for these different isoforms of prolyl-3-hydroxylases is not known yet. Thus, ascertaining a direct correlation between the expression level of one enzyme isoform with the substrate (occupancy level of site-specific modifications) specificity in this dataset is difficult. But, the varied expression of these enzymes during the post-cryoinjury regeneration model in zebrafish heart further supports the specific regulation of site-specific dynamic prolyl-3-hydroxylation changes in collagens deposited in the zebrafish heart ECM during regeneration. The dynamics of prolyl-3-hydroxylation occupancy in collagen molecules deposited in ECM during zebrafish heart regeneration are reported here for the first time.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Quantitative occupancy (%) of 3-hydroxyproline (3-HyP) sites identified in three different chains of collagen I (COL1A1a, COL1A1b, and COL1A2) from zebrafish cardiac ECM. Results are expressed as mean &#xb1; S.D. Significant statistical differences were estimated by ANOVA (&#x2a;<italic>p</italic> &#x3c; 0.05, <sup>ns</sup>p &#x3e; 0.05).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">3-HyP site</th>
<th colspan="4" align="center">3-Hydroxyproline average occupancy %</th>
</tr>
<tr>
<th align="center">Sham average</th>
<th align="center">7 DPA Average</th>
<th align="center">14 DPA Average</th>
<th align="center">30 DPA Average</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">COL1A1a P<sup>707</sup>
</td>
<td align="char" char="plusmn">7.55 &#xb1; 4.3</td>
<td align="char" char="plusmn">31.49 &#xb1; 38.1</td>
<td align="char" char="plusmn">21.21 &#xb1; 7.8</td>
<td align="char" char="plusmn">20.50 &#xb1; 1.5 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1a P<sup>869</sup>
</td>
<td align="char" char="plusmn">35.37 &#xb1; 6.9</td>
<td align="char" char="plusmn">22.82 &#xb1; 13.3</td>
<td align="char" char="plusmn">48.10 &#xb1; 63.8</td>
<td align="char" char="plusmn">20.12 &#xb1; 2.9 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1a P<sup>878</sup>
</td>
<td align="char" char="plusmn">28.87 &#xb1; 7.2</td>
<td align="char" char="plusmn">17.57 &#xb1; 18.4</td>
<td align="char" char="plusmn">27.63 &#xb1; 35.8</td>
<td align="char" char="plusmn">12.76 &#xb1; 1.8 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1a P<sup>1148</sup>
</td>
<td align="char" char="plusmn">48.73 &#xb1; 14.7</td>
<td align="char" char="plusmn">44.03 &#xb1; 17.0</td>
<td align="char" char="plusmn">68.96 &#xb1; 41.1</td>
<td align="char" char="plusmn">7.23 &#xb1; 4.6<sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL5A2a P<sup>1195, 1201</sup>
</td>
<td align="char" char="plusmn">1.77 &#xb1; 1.3</td>
<td align="char" char="plusmn">23.97 &#xb1; 4.5</td>
<td align="char" char="plusmn">55.24 &#xb1; 47.4</td>
<td align="char" char="plusmn">30.04 &#xb1; 41.1 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A2 P<sup>718</sup>
</td>
<td align="char" char="plusmn">0.19 &#xb1; 0.1</td>
<td align="char" char="plusmn">0.20 &#xb1; 0.3</td>
<td align="char" char="plusmn">0.10 &#xb1; 0.1</td>
<td align="char" char="plusmn">10.54 &#xb1; 7.4 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A2 P<sup>925</sup>
</td>
<td align="char" char="plusmn">5.14 &#xb1; 5.7</td>
<td align="char" char="plusmn">10.00 &#xb1; 13.2</td>
<td align="char" char="plusmn">0.53 &#xb1; 0.7</td>
<td align="char" char="plusmn">1.86 &#xb1; 1.8 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A2 P<sup>1066</sup>
</td>
<td align="char" char="plusmn">3.92 &#xb1; 3.4</td>
<td align="char" char="plusmn">3.87 &#xb1; 3.1</td>
<td align="char" char="plusmn">12.95 &#xb1; 15.7</td>
<td align="char" char="plusmn">6.91 &#xb1; 2.0 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A2 P<sup>361</sup>
</td>
<td align="char" char="plusmn">0.96 &#xb1; 0.3</td>
<td align="char" char="plusmn">1.52 &#xb1; 1.6</td>
<td align="char" char="plusmn">4.66 &#xb1; 4.5</td>
<td align="char" char="plusmn">1.18 &#xb1; 0.4 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1b P<sup>404</sup>
</td>
<td align="char" char="plusmn">7.88 &#xb1; 1.9</td>
<td align="char" char="plusmn">18.28 &#xb1; 12.8</td>
<td align="char" char="plusmn">30.08 &#xb1; 6.7</td>
<td align="char" char="plusmn">11.95 &#xb1; 7.1 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1b P<sup>554</sup>
</td>
<td align="char" char="plusmn">47.73 &#xb1; 4.0</td>
<td align="char" char="plusmn">46.52 &#xb1; 14.8</td>
<td align="char" char="plusmn">76.14 &#xb1; 29.7</td>
<td align="char" char="plusmn">42.91 &#xb1; 1.3 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1b P<sup>914</sup>
</td>
<td align="char" char="plusmn">17.09 &#xb1; 1.4</td>
<td align="char" char="plusmn">16.74 &#xb1; 10.6</td>
<td align="char" char="plusmn">21.16 &#xb1; 14.0</td>
<td align="char" char="plusmn">47.35 &#xb1; 6.2 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1b P<sup>1031</sup>
</td>
<td align="char" char="plusmn">12.44 &#xb1; 16.1</td>
<td align="char" char="plusmn">32.42 &#xb1; 34.0</td>
<td align="char" char="plusmn">18.31 &#xb1; 24.2</td>
<td align="char" char="plusmn">33.45 &#xb1; 3.6 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">COL1A1b P<sup>1109</sup>
</td>
<td align="char" char="plusmn">10.36 &#xb1; 7.6</td>
<td align="char" char="plusmn">3.93 &#xb1; 1.7</td>
<td align="char" char="plusmn">4.76 &#xb1; 6.0</td>
<td align="char" char="plusmn">32.64 &#xb1; 18.6 <sup>ns</sup>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Heatmap depicting the relative occupancy level of 3-hydroxyproline sites in three different chains of collagen 1 deposited in zebrafish heart ECM during regeneration. Further occupancy of one 3-HyP cluster of COL5A2a<sup>1195, 1201</sup> was also quantitated during zebrafish heart regeneration. Normalized occupancy values of prolyl-3-hydroxylations were computed to generate the heat map. Light yellow represents the low value (lowest occupancy) and dark red shows the higher value (highest occupancy) in the row.</p>
</caption>
<graphic xlink:href="fmolb-09-892763-g007.tif"/>
</fig>
</sec>
<sec id="s3-6">
<title>3.6 Comprehensive Mapping and Microheterogeneity of Collagen 1 Lysine O-Glycosylation Sites During Regeneration of Zebrafish Heart</title>
<p>In collagens, lysine hydroxylation and glycosylation play important role in collagen assembly and fibril formation (<xref ref-type="bibr" rid="B47">Terajima et al., 2014</xref>; <xref ref-type="bibr" rid="B48">Terajima et al., 2016</xref>). Musculoskeletal defects and cerebral small vessel disease are caused due to lack of collagen glycosylation (<xref ref-type="bibr" rid="B13">Geister et al., 2019</xref>; <xref ref-type="bibr" rid="B28">Miyatake et al., 2018</xref>; <xref ref-type="bibr" rid="B21">Kivirikko et al., 1973</xref>). A total of 23 HyK, 9&#xa0;G-HyK, and 11&#xa0;GG-HyK sites in zebrafish COL1A1a were identified from wild-type (Sham) zebrafish heart ECM (<xref ref-type="fig" rid="F3">Figure 3</xref>). Classical amino acid analysis has revealed that fibrillar collagens are less glycosylated compared to basement membrane collagen IV (<xref ref-type="bibr" rid="B47">Terajima et al., 2014</xref>; <xref ref-type="bibr" rid="B35">Perdivara et al., 2013</xref>; <xref ref-type="bibr" rid="B55">Yamauchi and Sricholpech, 2012</xref>). However, collagen glycosylation in fibrillar collagen is important for crosslinking mediated fibrillar assembly. Microheterogeneity of lysine sites has been determined in collagens chains (<xref ref-type="bibr" rid="B54">Yamauchi et al., 1982</xref>). A single lysine site of a collagen chain could be present in the form of unmodified, hydroxylysine (HyK), galactosyl-hydroxylysine (G-HyK), and glucosyl-galactosyl-hydroxylysine (GG-HyK). This microheterogeneity was found to be highly dynamic in the COL1A1a chain during zebrafish heart regeneration. We detected microheterogeneity on a total of eight lysine sites in the COL1A1a chain. <xref ref-type="fig" rid="F5">Figure 5</xref> and <xref ref-type="fig" rid="F6">Figure 6</xref> respectively show the site-specific mapping of lysine O-glycosylation sites in COL1A1b and COL1A2 chains from wild-type zebrafish heart ECM. In COL1A1b a total of 24 HyK, 4&#xa0;G-HyK (K<sup>264</sup>, K<sup>273</sup>, K<sup>849</sup>, K<sup>1020</sup>), and 4&#xa0;GG-HyK (K<sup>264</sup>, K<sup>273</sup>, K<sup>399</sup>, K<sup>849</sup>) sites were identified. Three of these sites showed microheterogeneity in glycosylation patterns. Similarly, a total of 21 HyK, 6&#xa0;G-HyK (K<sup>74</sup>, K<sup>188</sup>, K<sup>254</sup>, K<sup>344</sup>, K<sup>350</sup>, K<sup>644</sup>), and 9&#xa0;GG-HyK (K<sup>74</sup>, K<sup>167</sup>, K<sup>188</sup>, K<sup>254</sup>, K<sup>299</sup>, K<sup>578</sup>, K<sup>644</sup>, K<sup>647</sup>, K<sup>935</sup>) sites were identified in COL1A2 chain for the first time from zebrafish heart ECM. Four of these sites showed microheterogeneity in different glycosylation moieties (galactosyl or glucosyl-galactosyl). Overall, our analysis mapped the different lysine O-glycosylation sites in three different chains of collagen 1 chains. Collagen 1A1a was found to be less deposited during the regeneration of the zebrafish heart by our as well as Garcia-Puig et al. analysis (<xref ref-type="bibr" rid="B12">Garcia-Puig et al., 2019</xref>). Furthermore, we wanted to assess the changes in O-glycosylation of collagen I chains during zebrafish heart regeneration.</p>
<p>In order to assess the changes in the microheterogeneity of lysine O-glycosylation sites on collagen I during regeneration of zebrafish heart, the Skyline-based targeted MS<sup>1</sup> quantitative method was integrated into our pipeline as described previously (<xref ref-type="bibr" rid="B26">Merl-Pham et al., 2019</xref>). The relative microheterogeneity analysis of site-specific lysine in three different chains of collagen I was performed. A total of 12 sites of lysine microheterogeneity from three different chains of collagen I was assessed (<xref ref-type="table" rid="T3">Table 3</xref>). <xref ref-type="fig" rid="F8">Figure 8</xref> represents an example of a COL1A1a peptide <sup>1011</sup>DGAAGPKGDRGETGPSGTPGAPGPPGAAGPIGPAGK<sup>1046</sup> eluting at 42.3&#xa0;min during the chromatographic separation as non-glycosylated form with three 4-HyP (P<sup>1029,1032,1035</sup>) sites. Hydroxylation of K<sup>1017</sup> resulted in an early elution of this peptide at &#x223c;42&#xa0;min in the C18 column-based separation. Further, the glucosyl-galactosyl-hydroxylysine form of K<sup>1017</sup> was further eluted earlier (at 41.2&#xa0;min) than the hydroxylation form. We determined the microheterogeneity of this site (K<sup>1017</sup>) across control, 7-day post-amputation (DPA), 14 DPA, and 30 DPA samples across zebrafish heart regeneration. Interestingly, the glucosyl-galactosyl-hydroxylation of K<sup>1017</sup> increased significantly (<italic>p</italic> &#x3c; 0.01) to 15.4% in the 30 DPA regenerated heart as compared to control (2.74%), (3.35%) 7 DPA. and 14 DPA (10.5%) samples (<xref ref-type="fig" rid="F8">Figure 8</xref>; <xref ref-type="table" rid="T3">Table 3</xref>). The increase in the micro heterogenic distribution of O-glycosylation the in K<sup>1017</sup> site was evident by a decrease in the HyK level during regeneration (<xref ref-type="fig" rid="F8">Figure 8</xref>; <xref ref-type="table" rid="T3">Table 3</xref>). Using a similar strategy, we found K<sup>846</sup> (COL1A1a) to be specifically present in a full glycosylated form (either G-HyK or GG-HyK). The GG-HyK<sup>846</sup> was decreased at 30 DPA regenerated zebrafish heart. We also quantitated the microheterogeneity of three lysine sites (K<sup>264,273,849</sup>) in COL1A1b and 2 sites in COL1A2 (K<sup>254,644</sup>) (see <xref ref-type="table" rid="T3">Table 3</xref>). The micro-heterogenic distribution of G-HyK<sup>264</sup> of COL1A1b was found to have an increasing trend at 30 DPA. The changes in the level of hydroxylation and O-glycosylation of lysine sites present in collagen I could be potentially due to the gene expression changes of lysyl-hydroxylases and glycosyltransferases during zebrafish heart regeneration. In a similar model of zebrafish heart regeneration, proteomic analysis performed by Ma <italic>et al</italic>. (<xref ref-type="bibr" rid="B24">2018</xref>) revealed a significant elevation of LH2 levels at 2 DPA (<xref ref-type="sec" rid="s10">Supplementary Figure S4</xref>). However, at 14 DPA the LH2 protein level stabilizes to sham levels. Similar oscillatory gene expressional observations of the genes (PLOD1a, PLOD2, and PLOD3) coding for different isoforms of lysyl-hydroxylases has been documented in the zebrafish heart cryoinjury model (<xref ref-type="bibr" rid="B40">S&#xe1;nchez-Iranzo et al., 2018</xref>). Global gene expression analysis revealed increased expression of these three genes in the zebrafish heart at 7&#xa0;days post-injury. However, the level of gene expression of PLOD1a, PLOD2, and PLOD3 decreases at 60&#xa0;days post injury in the regenerated heart (<xref ref-type="bibr" rid="B40">S&#xe1;nchez-Iranzo et al., 2018</xref>). These micro-heterogenic site-specific glycosylation profiles point towards the differences present in three different chains of collagen I that may play an important role in the ECM remodeling mediated zebrafish heart regeneration.</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Quantitative micro-heterogenic occupancy (%) of O-glycosylated lysine sites identified three different chains of collagen I (COL1A1a, COL1A1b, and COL1A2) from zebrafish cardiac ECM. Results are expressed as mean &#xb1; SEM. Statistically significant differences were estimated by ANOVA (&#x2a;<italic>p</italic> &#x3c; 0.05, <sup>ns</sup> &#x3e; 0.05). N/D, not detected; HyK, hydroxylysine; G-HyK, galactosyl-hydroxylysine; GG-HyK, glucosylgalactosylhydroxylysine.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center"/>
<th align="left"/>
<th colspan="4" align="center">Microheterogeneity %</th>
</tr>
<tr>
<th align="left">Lysine sites</th>
<th align="center">Modification</th>
<th align="center">Sham</th>
<th align="center">7 DPA</th>
<th align="center">14 DPA</th>
<th align="center">30 DPA</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="4" align="left">COL1A1a K<sup>261</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">36.62 &#xb1; 9.7</td>
<td align="char" char="plusmn">43.75 &#xb1; 4.38</td>
<td align="char" char="plusmn">46.09 &#xb1; 4.89</td>
<td align="char" char="plusmn">32.30 &#xb1; 5.08 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">38.38 &#xb1; 10.38</td>
<td align="char" char="plusmn">34.81 &#xb1; 12.3</td>
<td align="char" char="plusmn">21.13 &#xb1; 2.85</td>
<td align="char" char="plusmn">45.72 &#xb1; 8.65 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="char" char="plusmn">16.71 &#xb1; 4.9</td>
<td align="char" char="plusmn">17.85 &#xb1; 12.0</td>
<td align="char" char="plusmn">31.26 &#xb1; 3.01</td>
<td align="char" char="plusmn">20.22 &#xb1; 3.01 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">8.28 &#xb1; 6.5</td>
<td align="char" char="plusmn">3.59 &#xb1; 4.3</td>
<td align="char" char="plusmn">1.32 &#xb1; 0.97</td>
<td align="char" char="plusmn">2.35 &#xb1; 0.57 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1a K<sup>270</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">50.46 &#xb1; 5.3</td>
<td align="char" char="plusmn">32.92 &#xb1; 4.4</td>
<td align="char" char="plusmn">26.58 &#xb1; 1.7</td>
<td align="char" char="plusmn">46.48 &#xb1; 4.1 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">34.97 &#xb1; 0.7</td>
<td align="char" char="plusmn">37.13 &#xb1; 1.3</td>
<td align="char" char="plusmn">46.12 &#xb1; 1.1</td>
<td align="char" char="plusmn">40.69 &#xb1; 4.4 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="char" char="plusmn">12.26 &#xb1; 5.0</td>
<td align="char" char="plusmn">29.03 &#xb1; 4.4</td>
<td align="char" char="plusmn">24.56 &#xb1; 2.7</td>
<td align="char" char="plusmn">11.52 &#xb1; 0.4 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">2.32 &#xb1; 1.1</td>
<td align="char" char="plusmn">0.93 &#xb1; 1.3</td>
<td align="char" char="plusmn">2.74 &#xb1; 0.2</td>
<td align="char" char="plusmn">1.30 &#xb1; 0.7 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1a K<sup>504</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">35.12 &#xb1; 0.2</td>
<td align="char" char="plusmn">6.54 &#xb1; 0.3</td>
<td align="char" char="plusmn">34.96 &#xb1; 46.9</td>
<td align="char" char="plusmn">51.61 &#xb1; 20.2 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">7.20 &#xb1; 4.4</td>
<td align="char" char="plusmn">3.49 &#xb1; 0.3</td>
<td align="char" char="plusmn">16.03 &#xb1; 22.4</td>
<td align="char" char="plusmn">16.85 &#xb1; 5.2 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="char" char="plusmn">57.68 &#xb1; 4.2</td>
<td align="char" char="plusmn">89.97 &#xb1; 0.0</td>
<td align="char" char="plusmn">49.01 &#xb1; 69.3</td>
<td align="char" char="plusmn">31.54 &#xb1; 15.0 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1a K<sup>570</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">59.52 &#xb1; 12.5</td>
<td align="char" char="plusmn">24.79 &#xb1; 15.4</td>
<td align="char" char="plusmn">17.67 &#xb1; 1.6</td>
<td align="char" char="plusmn">42.84 &#xb1; 51.5 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">14.06 &#xb1; 4.6</td>
<td align="char" char="plusmn">6.83 &#xb1; 7.6</td>
<td align="char" char="plusmn">4.50 &#xb1; 0.1</td>
<td align="char" char="plusmn">0.21 &#xb1; 0.3 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="center">ND</td>
<td align="left">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">26.42 &#xb1; 17.1</td>
<td align="char" char="plusmn">68.39 &#xb1; 7.8</td>
<td align="char" char="plusmn">77.83 &#xb1; 1.6</td>
<td align="char" char="plusmn">56.95 &#xb1; 51.8 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1a K<sup>693</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">16.05 &#xb1; 1.8</td>
<td align="char" char="plusmn">11.35 &#xb1; 1.4</td>
<td align="char" char="plusmn">21.14 &#xb1; 15.4</td>
<td align="char" char="plusmn">17.46 &#xb1; 2.3 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">73.86 &#xb1; 2.2</td>
<td align="char" char="plusmn">83.40 &#xb1; 0.1</td>
<td align="char" char="plusmn">70.60 &#xb1; 23.8</td>
<td align="char" char="plusmn">74.25 &#xb1; 1.7 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GK</td>
<td align="char" char="plusmn">1.34 &#xb1; 0.3</td>
<td align="char" char="plusmn">0.25 &#xb1; 0.4</td>
<td align="char" char="plusmn">0.56 &#xb1; 0.8</td>
<td align="char" char="plusmn">1.47 &#xb1; 0.2 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GGK</td>
<td align="char" char="plusmn">8.75 &#xb1; 0.4</td>
<td align="char" char="plusmn">5.00 &#xb1; 1.0</td>
<td align="char" char="plusmn">7.69 &#xb1; 9.2</td>
<td align="char" char="plusmn">6.82 &#xb1; 3.8 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1a K<sup>846</sup>
</td>
<td align="left">K</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="char" char="plusmn">48.69 &#xb1; 29.7</td>
<td align="char" char="plusmn">85.04 &#xb1; 19.4</td>
<td align="char" char="plusmn">51.02 &#xb1; 9.0</td>
<td align="char" char="plusmn">68.87 &#xb1; 5.5 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">51.31 &#xb1; 29.7</td>
<td align="char" char="plusmn">14.96 &#xb1; 19.4</td>
<td align="char" char="plusmn">48.98 &#xb1; 9.0</td>
<td align="char" char="plusmn">31.13 &#xb1; 5.5 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1a K<sup>1017</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">62.44 &#xb1; 0.3</td>
<td align="char" char="plusmn">33.69 &#xb1; 22.8</td>
<td align="char" char="plusmn">52.55 &#xb1; 21.8</td>
<td align="char" char="plusmn">60.09 &#xb1; 0.5 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">34.82 &#xb1; 0.4</td>
<td align="char" char="plusmn">62.96 &#xb1; 20.9</td>
<td align="char" char="plusmn">36.95 &#xb1; 22.1</td>
<td align="char" char="plusmn">24.44 &#xb1; 0.2 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">2.74 &#xb1; 0.7</td>
<td align="char" char="plusmn">3.35 &#xb1; 1.9</td>
<td align="char" char="plusmn">10.50 &#xb1; 0.3</td>
<td align="char" char="plusmn">15.47 &#xb1; 0.3 &#x2a;</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1b K<sup>264</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">50.52 &#xb1; 15.5</td>
<td align="char" char="plusmn">40.63 &#xb1; 50.7</td>
<td align="char" char="plusmn">52.94 &#xb1; 15.8</td>
<td align="char" char="plusmn">20.39 &#xb1; 20.2 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">41.23 &#xb1; 8.2</td>
<td align="char" char="plusmn">53.16 &#xb1; 56.0</td>
<td align="char" char="plusmn">30.76 &#xb1; 34.6</td>
<td align="char" char="plusmn">46.65 &#xb1; 44.7 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="char" char="plusmn">5.19 &#xb1; 4.4</td>
<td align="char" char="plusmn">4.64 &#xb1; 3.6</td>
<td align="char" char="plusmn">9.75 &#xb1; 10.8</td>
<td align="char" char="plusmn">27.51 &#xb1; 20.7 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">3.05 &#xb1; 2.9</td>
<td align="char" char="plusmn">1.57 &#xb1; 1.6</td>
<td align="char" char="plusmn">6.54 &#xb1; 7.9</td>
<td align="char" char="plusmn">5.45 &#xb1; 3.7 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1b K<sup>273</sup>
</td>
<td align="left">K</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">41.65 &#xb1; 47.5</td>
<td align="char" char="plusmn">50.93 &#xb1; 39.7</td>
<td align="char" char="plusmn">24.43 &#xb1; 25.6</td>
<td align="char" char="plusmn">33.90 &#xb1; 36.0 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="char" char="plusmn">5.58 &#xb1; 4.6</td>
<td align="char" char="plusmn">33.38 &#xb1; 20.8</td>
<td align="char" char="plusmn">27.55 &#xb1; 13.3</td>
<td align="char" char="plusmn">19.99 &#xb1; 10.5 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">52.77 &#xb1; 42.9</td>
<td align="char" char="plusmn">15.69 &#xb1; 19.0</td>
<td align="char" char="plusmn">48.02 &#xb1; 38.9</td>
<td align="char" char="plusmn">46.11 &#xb1; 25.5 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A1b K<sup>849</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">27.06 &#xb1; 1.1</td>
<td align="char" char="plusmn">10.35 &#xb1; 13.7</td>
<td align="char" char="plusmn">11.17 &#xb1; 2.2</td>
<td align="char" char="plusmn">22.95 &#xb1; 3.4 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">67.46 &#xb1; 1.1</td>
<td align="char" char="plusmn">68.53 &#xb1; 16.8</td>
<td align="char" char="plusmn">70.77 &#xb1; 9.7</td>
<td align="char" char="plusmn">62.89 &#xb1; 0.3 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="char" char="plusmn">3.35 &#xb1; 0.4</td>
<td align="char" char="plusmn">6.76 &#xb1; 9.4</td>
<td align="char" char="plusmn">12.97 &#xb1; 4.5</td>
<td align="char" char="plusmn">9.48 &#xb1; 1.5 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">2.13 &#xb1; 0.4</td>
<td align="char" char="plusmn">14.36 &#xb1; 12.4</td>
<td align="char" char="plusmn">5.09 &#xb1; 2.9</td>
<td align="char" char="plusmn">4.68 &#xb1; 2.2 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A2 K<sup>254</sup>
</td>
<td align="left">K</td>
<td align="char" char="plusmn">6.48 &#xb1; 3.2</td>
<td align="char" char="plusmn">20.75 &#xb1; 14.6</td>
<td align="char" char="plusmn">21.61 &#xb1; 14.3</td>
<td align="char" char="plusmn">55.69 &#xb1; 18.4 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">89.24 &#xb1; 1.6</td>
<td align="char" char="plusmn">78.24 &#xb1; 13.4</td>
<td align="char" char="plusmn">69.76 &#xb1; 2.2</td>
<td align="char" char="plusmn">43.90 &#xb1; 18.1 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="char" char="plusmn">0.11 &#xb1; 0.0</td>
<td align="char" char="plusmn">0.10 &#xb1; 0.1</td>
<td align="char" char="plusmn">0.22 &#xb1; 0.3</td>
<td align="char" char="plusmn">0.05 &#xb1; 0.0 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">GG-HyK</td>
<td align="char" char="plusmn">4.17 &#xb1; 4.8</td>
<td align="char" char="plusmn">0.91 &#xb1; 1.1</td>
<td align="char" char="plusmn">8.40 &#xb1; 11.7</td>
<td align="char" char="plusmn">0.36 &#xb1; 0.3 <sup>ns</sup>
</td>
</tr>
<tr>
<td rowspan="4" align="left">COL1A2 K<sup>644</sup>
</td>
<td align="left">K</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
</tr>
<tr>
<td align="left">HyK</td>
<td align="char" char="plusmn">84.59 &#xb1; 1.8</td>
<td align="char" char="plusmn">61.15 &#xb1; 40.2</td>
<td align="char" char="plusmn">95.15 &#xb1; 6.5</td>
<td align="char" char="plusmn">90.77 &#xb1; 11.1 <sup>ns</sup>
</td>
</tr>
<tr>
<td align="left">G-HyK</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
<td align="center">ND</td>
</tr>
<tr>
<td align="left">GGK</td>
<td align="char" char="plusmn">15.41 &#xb1; 1.8</td>
<td align="char" char="plusmn">38.85 &#xb1; 40.2</td>
<td align="char" char="plusmn">4.85 &#xb1; 6.5</td>
<td align="char" char="plusmn">9.23 &#xb1; 11.1 <sup>ns</sup>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>Quantitation of microheterogeneity of K<sup>1017</sup> site in COL1A1a present in zebrafish heart ECM during regeneration. <bold>(A)</bold> Chromatogram plots represent the elution of unmodified <bold>(K)</bold>, hydroxylysine (HyK), and glucosylgalactosyl-hydroxylysine (GG-HyK) modified K<sup>1017</sup> site containing peptide <sup>1011</sup>DGAAGP<bold>K</bold>GDRGETGPSGTPGAPGPPGAAGPIGPAGK<sup>1046</sup> <bold>(B)</bold> Graphical representation of Skyline-based MS<sup>1</sup> quantitation of micro-heterogenic distribution of unmodified K<sup>1017</sup> (yellow), HyK<sup>1017</sup> (green) and GG-HyK277 (blue) species in COL1A1a from the ECM digest of control, 7 DPA, 14 DPA, and 30 DPA regenerating zebrafish heart. The different colors in the bar represent the occupancy of different forms at the K<sup>1017</sup> site in COL1A1a with mean &#xb1; SEM. An increase in glucosylgalactosyl-hydroxylysine levels during regeneration is significant (ANOVA, <italic>p</italic> &#x3c; 0.05) (See <xref ref-type="table" rid="T3">Table 3</xref>)</p>
</caption>
<graphic xlink:href="fmolb-09-892763-g008.tif"/>
</fig>
</sec>
</sec>
<sec id="s4">
<title>4 Conclusion</title>
<p>In conclusion, our analysis presents the first comprehensive mapping and dynamics of site-specific collagen PTMs from the crude extracellular matrix proteins in wild-type and regenerating zebrafish hearts. We optimized an in-house dual database search engine-based mass-spectrometry data analysis strategy to efficiently identify long post-translationally modified collagen peptides from crude ECM analysis. We identified the highest number of collagen chains (36) present in the zebrafish heart ECM. Our analysis revealed that the expression of collagens gets altered during regeneration. Global characterization of zebrafish heart ECM revealed the identification of site-specific PTMs (proline hydroxylation, lysine hydroxylation, and lysine glycosylation sites) in 23 different collagen chains. These are the first comprehensive site-specific collagen PTM maps of zebrafish cardiac ECM highlighting some novel sites. Further, we present the first comprehensive PTM maps of three chains (COL1A1a, COL1A1b, and COL1A2) of collagen 1. Our analysis also identified the conserved 3-hydroxyproline sites occurring in the COL1A1 chain in humans, mice, and zebrafish heart ECM. This finding highlights the importance of these conserved 3-HyP sites involved in the basic function of collagens in humans, mice, and zebrafish myocardium. Moreover, we established that site-specific PTM (Prolyl-3-hydroxylations and lysyl O-glycosylation) changes in collagen chains are correlated with the regeneration process. Glycosylation on one site (GG-HyK<sup>1017</sup>) of COL1A1a is found to be significantly increased during the regeneration process. Taken together, our analysis will widen new avenues to dig deeper in understanding the role of collagen PTMs during zebrafish heart regeneration.</p>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>Publicly available datasets were analyzed in this study. The datasets PXD011627 and PXD010092 can be accessed at: <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://www.proteomexchange.org/">https://www.proteomexchange.org/</ext-link>.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>VS, SS, and TB worked on analyzing, interpreting, and summarizing the study. VS and SS prepared the initial drafts of the manuscript. VS prepared all the figures. TB conceptualize the overall structure of the study and finalized the manuscript.</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ack>
<p>The authors acknowledge Prof. Angel Raya for sharing the raw mass-spectrometry data (&#x2a;.d files) for the regenerating samples. Authors also acknowledge the HPC facility of IIT-Mandi for performing intensive database searches. The Seed grant (IITM/SG/TB/75) from IIT-Mandi and the Indian Council of Medical Research (ICMR) funded &#x201c;High-Resolution Plasma Proteomic and Lipidomic Analyses for Fibrosis-Related Metabolic Assessment In Dilated Cardiomyopathy (DCM) Patients in India: A Multi-center Based Study (NCD/Adhoc/40/2020&#x2013;2021)&#x201d; to TB is acknowledged for this work. VS and SS also acknowledge the HTRA fellowship (MHRD, Govt. of India) for the doctoral and post-graduate program.</p>
</ack>
<sec id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmolb.2022.892763/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmolb.2022.892763/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.PDF" id="SM1" mimetype="application/PDF" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barallobre-Barreiro</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Radovits</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Fava</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mayr</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>W.-Y.</given-names>
</name>
<name>
<surname>Ermolaeva</surname>
<given-names>E.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Extracellular Matrix in Heart Failure: Role of ADAMTS5 in Proteoglycan Remodeling</article-title>. <source>Circulation</source> <volume>144</volume> (<issue>25</issue>), <fpage>2021</fpage>&#x2013;<lpage>2034</lpage>. <pub-id pub-id-type="doi">10.1161/circulationaha.121.055732</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Basak</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Vega-Montoto</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zimmerman</surname>
<given-names>L. J.</given-names>
</name>
<name>
<surname>Tabb</surname>
<given-names>D. L.</given-names>
</name>
<name>
<surname>Hudson</surname>
<given-names>B. G.</given-names>
</name>
<name>
<surname>Vanacore</surname>
<given-names>R. M.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Comprehensive Characterization of Glycosylation and Hydroxylation of Basement Membrane Collagen IV by High-Resolution Mass Spectrometry</article-title>. <source>J. Proteome Res.</source> <volume>15</volume> (<issue>1</issue>), <fpage>245</fpage>&#x2013;<lpage>258</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jproteome.5b00767</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bootorabi</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Manouchehri</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Changizi</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Barker</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Palazzo</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Saltari</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Zebrafish as a Model Organism for the Development of Drugs for Skin Cancer</article-title>. <source>Int. J. Mol. Sci.</source> <volume>18</volume> (<issue>7</issue>), <fpage>1550</fpage>. <pub-id pub-id-type="doi">10.3390/ijms18071550</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cabral</surname>
<given-names>W. A.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Barnes</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Weis</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Scott</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Leikin</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2007</year>). <article-title>Prolyl 3-hydroxylase 1 Deficiency Causes a Recessive Metabolic Bone Disorder Resembling Lethal/severe Osteogenesis Imperfecta</article-title>. <source>Nat. Genet.</source> <volume>39</volume> (<issue>3</issue>), <fpage>359</fpage>&#x2013;<lpage>365</lpage>. <pub-id pub-id-type="doi">10.1038/ng1968</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>W. C.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Missinato</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>D. W.</given-names>
</name>
<name>
<surname>Long</surname>
<given-names>D. W.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H. J.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Decellularized Zebrafish Cardiac Extracellular Matrix Induces Mammalian Heart Regeneration</article-title>. <source>Sci. Adv.</source> <volume>2</volume> (<issue>11</issue>), <fpage>e1600844</fpage>. <pub-id pub-id-type="doi">10.1126/sciadv.1600844</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cowling</surname>
<given-names>R. T.</given-names>
</name>
<name>
<surname>Kupsky</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kahn</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Daniels</surname>
<given-names>L. B.</given-names>
</name>
<name>
<surname>Greenberg</surname>
<given-names>B. H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Mechanisms of Cardiac Collagen Deposition in Experimental Models and Human Disease</article-title>. <source>Transl. Res.</source> <volume>209</volume>, <fpage>138</fpage>&#x2013;<lpage>155</lpage>. <pub-id pub-id-type="doi">10.1016/j.trsl.2019.03.004</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cutie</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>G. N.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Vertebrate Cardiac Regeneration: Evolutionary and Developmental Perspectives</article-title>. <source>Cell Regen.</source> <volume>10</volume> (<issue>1</issue>), <fpage>6</fpage>&#x2013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1186/s13619-020-00068-y</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Di Lullo</surname>
<given-names>G. A.</given-names>
</name>
<name>
<surname>Sweeney</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>K&#xf6;rkk&#xf6;</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ala-Kokko</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>San Antonio</surname>
<given-names>J. D.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Mapping the Ligand-Binding Sites and Disease-Associated Mutations on the Most Abundant Protein in the Human, Type I Collagen</article-title>. <source>J. Biol. Chem.</source> <volume>277</volume> (<issue>6</issue>), <fpage>4223</fpage>&#x2013;<lpage>4231</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m110709200</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Duran</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Csukasi</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>S. P.</given-names>
</name>
<name>
<surname>Krakow</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Becerra</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Bombarely</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Collagen Duplicate Genes of Bone and Cartilage Participate during Regeneration of Zebrafish Fin Skeleton</article-title>. <source>Gene Expr. Patterns</source> <volume>19</volume> (<issue>1&#x2013;2</issue>), <fpage>60</fpage>&#x2013;<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1016/j.gep.2015.07.004</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fietzek</surname>
<given-names>P. P.</given-names>
</name>
<name>
<surname>K&#xfc;hn</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>1975</year>). <article-title>Information Contained in the Amino Acid Sequence of the alpha1(I)-chain of Collagen and its Consequences upon the Formation of the Triple Helix, of Fibrils and Crosslinks</article-title>. <source>Mol. Cell Biochem.</source> <volume>8</volume> (<issue>3</issue>), <fpage>141</fpage>&#x2013;<lpage>157</lpage>. <pub-id pub-id-type="doi">10.1007/bf01792765</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frangogiannis</surname>
<given-names>N. G.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>The Extracellular Matrix in Ischemic and Nonischemic Heart Failure</article-title>. <source>Circ. Res.</source> <volume>125</volume> (<issue>1</issue>), <fpage>117</fpage>&#x2013;<lpage>146</lpage>. <pub-id pub-id-type="doi">10.1161/circresaha.119.311148</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Garcia-Puig</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Mosquera</surname>
<given-names>J. L.</given-names>
</name>
<name>
<surname>Jim&#xe9;nez-Delgado</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Garc&#xed;a-Pastor</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Jorba</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Navajas</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Proteomics Analysis of Extracellular Matrix Remodeling during Zebrafish Heart Regeneration</article-title>. <source>Mol. Cell. Proteomics</source> <volume>18</volume> (<issue>9</issue>), <fpage>1745</fpage>&#x2013;<lpage>1755</lpage>. <pub-id pub-id-type="doi">10.1074/mcp.ra118.001193</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Geister</surname>
<given-names>K. A.</given-names>
</name>
<name>
<surname>Lopez-Jimenez</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Houghtaling</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ho</surname>
<given-names>T. H.</given-names>
</name>
<name>
<surname>Vanacore</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Beier</surname>
<given-names>D. R.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Loss of Function of Colgalt1 Disrupts Collagen Post-translational Modification and Causes Musculoskeletal Defects</article-title>. <source>Dis. Model Mech.</source> <volume>12</volume> (<issue>6</issue>), <fpage>dmm037176</fpage>. <pub-id pub-id-type="doi">10.1242/dmm.037176</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gistelinck</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Gioia</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Gagliardi</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Tonelli</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Marchese</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Bianchi</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Zebrafish Collagen Type I: Molecular and Biochemical Characterization of the Major Structural Protein in Bone and Skin</article-title>. <source>Sci. Rep.</source> <volume>6</volume>, <fpage>21540</fpage>. <pub-id pub-id-type="doi">10.1038/srep21540</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gonz&#xe1;lez</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Schelbert</surname>
<given-names>E. B.</given-names>
</name>
<name>
<surname>D&#xed;ez</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Butler</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Myocardial Interstitial Fibrosis in Heart Failure: Biological and Translational Perspectives</article-title>. <source>J. Am. Coll. Cardiol.</source> <volume>71</volume> (<issue>15</issue>), <fpage>1696</fpage>&#x2013;<lpage>1706</lpage>. <pub-id pub-id-type="doi">10.1016/j.jacc.2018.02.021</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Grafe</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Alexander</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Homan</surname>
<given-names>E. P.</given-names>
</name>
<name>
<surname>Lietman</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>M. M.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Excessive Transforming Growth Factor-&#x3b2; Signaling Is a Common Mechanism in Osteogenesis Imperfecta</article-title>. <source>Nat. Med.</source> <volume>20</volume> (<issue>6</issue>), <fpage>670</fpage>&#x2013;<lpage>675</lpage>. <pub-id pub-id-type="doi">10.1038/nm.3544</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hennet</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Collagen Glycosylation</article-title>. <source>Curr. Opin. Struct. Biol.</source> <volume>56</volume>, <fpage>131</fpage>&#x2013;<lpage>138</lpage>. <pub-id pub-id-type="doi">10.1016/j.sbi.2019.01.015</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hudson</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Eyre</surname>
<given-names>D. R.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Collagen Prolyl 3-hydroxylation: a Major Role for a Minor Post-translational Modification?</article-title> <source>Connect. Tissue Res.</source> <volume>54</volume> (<issue>4&#x2013;5</issue>), <fpage>245</fpage>&#x2013;<lpage>251</lpage>. <pub-id pub-id-type="doi">10.3109/03008207.2013.800867</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hudson</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Joeng</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Werther</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Rajagopal</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Weis</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>B. H.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Post-translationally Abnormal Collagens of Prolyl 3-hydroxylase-2 Null Mice Offer a Pathobiological Mechanism for the High Myopia Linked to Human LEPREL1 Mutations</article-title>. <source>J. Biol. Chem.</source> <volume>290</volume> (<issue>13</issue>), <fpage>8613</fpage>&#x2013;<lpage>8622</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m114.634915</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>J&#xfc;rgensen</surname>
<given-names>H. J.</given-names>
</name>
<name>
<surname>Madsen</surname>
<given-names>D. H.</given-names>
</name>
<name>
<surname>Ingvarsen</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Melander</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>G&#xe5;rdsvoll</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Patthy</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>A Novel Functional Role of Collagen Glycosylation: Interaction with the Endocytic Collagen Receptor uparap/ENDO180</article-title>. <source>J. Biol. Chem.</source> <volume>286</volume> (<issue>37</issue>), <fpage>32736</fpage>&#x2013;<lpage>32748</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M111.266692</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kivirikko</surname>
<given-names>K. I.</given-names>
</name>
<name>
<surname>Ryhanen</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Anttinen</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Bornstein</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Prockop</surname>
<given-names>D. J.</given-names>
</name>
</person-group> (<year>1973</year>). <article-title>Hydroxylation of Lysyl Residues in Collagen by Protocollagen Lysyl Hydroxylase <italic>In Vitro</italic>
</article-title>. <source>Biochemistry</source> <volume>12</volume> (<issue>24</issue>), <fpage>4966</fpage>&#x2013;<lpage>4971</lpage>. <pub-id pub-id-type="doi">10.1021/bi00748a023</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kong</surname>
<given-names>A. T.</given-names>
</name>
<name>
<surname>Leprevost</surname>
<given-names>F. V.</given-names>
</name>
<name>
<surname>Avtonomov</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Mellacheruvu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Nesvizhskii</surname>
<given-names>A. I.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>MSFragger: Ultrafast and Comprehensive Peptide Identification in Mass Spectrometry-Based Proteomics</article-title>. <source>Nat. Methods</source> <volume>14</volume> (<issue>5</issue>), <fpage>513</fpage>&#x2013;<lpage>520</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.4256</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Kong</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Extracellular Matrix Remodeling and Cardiac Fibrosis</article-title>. <source>Matrix Biol.</source> <volume>68-69</volume>, <fpage>490</fpage>&#x2013;<lpage>506</lpage>. <pub-id pub-id-type="doi">10.1016/j.matbio.2018.01.013</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Tu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Sheng</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Kan</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Dynamics of Zebrafish Heart Regeneration Using an HPLC-ESI-MS/MS Approach</article-title>. <source>J. Proteome Res.</source> <volume>17</volume> (<issue>3</issue>), <fpage>1300</fpage>&#x2013;<lpage>1308</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jproteome.7b00915</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>MacLean</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Tomazela</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Shulman</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Chambers</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Finney</surname>
<given-names>G. L.</given-names>
</name>
<name>
<surname>Frewen</surname>
<given-names>B.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>Skyline: an Open Source Document Editor for Creating and Analyzing Targeted Proteomics Experiments</article-title>. <source>Bioinformatics</source> <volume>26</volume> (<issue>7</issue>), <fpage>966</fpage>&#x2013;<lpage>968</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btq054</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Merl-Pham</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Basak</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kn&#xfc;ppel</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ramanujam</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Athanason</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Behr</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Quantitative Proteomic Profiling of Extracellular Matrix and Site-specific Collagen Post-translational Modifications in an <italic>In Vitro</italic> Model of Lung Fibrosis</article-title>. <source>Matrix Biol. Plus</source> <volume>1</volume>, <fpage>100005</fpage>. <pub-id pub-id-type="doi">10.1016/j.mbplus.2019.04.002</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Missinato</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Tobita</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Romano</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Carroll</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Tsang</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Extracellular Component Hyaluronic Acid and its Receptor Hmmr Are Required for Epicardial EMT during Heart Regeneration</article-title>. <source>Cardiovasc Res.</source> <volume>107</volume> (<issue>4</issue>), <fpage>487</fpage>&#x2013;<lpage>498</lpage>. <pub-id pub-id-type="doi">10.1093/cvr/cvv190</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Miyatake</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Schneeberger</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Koyama</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Yokochi</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ohmura</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Shiina</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Biallelic COLGALT1 Variants Are Associated with Cerebral Small Vessel Disease</article-title>. <source>Ann. Neurol.</source> <volume>84</volume> (<issue>6</issue>), <fpage>843</fpage>&#x2013;<lpage>853</lpage>. <pub-id pub-id-type="doi">10.1002/ana.25367</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Montgomery</surname>
<given-names>N. T.</given-names>
</name>
<name>
<surname>Zientek</surname>
<given-names>K. D.</given-names>
</name>
<name>
<surname>Pokidysheva</surname>
<given-names>E. N.</given-names>
</name>
<name>
<surname>B&#xe4;chinger</surname>
<given-names>H. P.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Post-translational Modification of Type IV Collagen with 3-hydroxyproline Affects its Interactions with Glycoprotein VI and Nidogens 1 and 2</article-title>. <source>J. Biol. Chem.</source> <volume>293</volume> (<issue>16</issue>), <fpage>5987</fpage>&#x2013;<lpage>5999</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.ra117.000406</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morello</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Bertin</surname>
<given-names>T. K.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Hicks</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Tonachini</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Monticone</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2006</year>). <article-title>CRTAP Is Required for Prolyl 3- Hydroxylation and Mutations Cause Recessive Osteogenesis Imperfecta</article-title>. <source>Cell</source> <volume>127</volume> (<issue>2</issue>), <fpage>291</fpage>&#x2013;<lpage>304</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2006.08.039</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Myllyharju</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Intracellular Post-Translational Modifications of Collagens</article-title>. <source>Top. Curr. Chem.</source> <volume>247</volume>, <fpage>115</fpage>&#x2013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1007/b103821</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nauroy</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Hughes</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Naba</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ruggiero</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>The In-Silico Zebrafish Matrisome: A New Tool to Study Extracellular Matrix Gene and Protein Functions</article-title>. <source>Matrix Biol.</source> <volume>65</volume>, <fpage>5</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1016/j.matbio.2017.07.001</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Padmanabhan Iyer</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Chiao</surname>
<given-names>Y. A.</given-names>
</name>
<name>
<surname>Flynn</surname>
<given-names>E. R.</given-names>
</name>
<name>
<surname>Hakala</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Cates</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Weintraub</surname>
<given-names>S. T.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Matrix Metalloproteinase-9-dependent Mechanisms of Reduced Contractility and Increased Stiffness in the Aging Heart</article-title>. <source>Prot. Clin. Appl.</source> <volume>10</volume> (<issue>1</issue>), <fpage>92</fpage>&#x2013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.1002/prca.201500038</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pedrioli</surname>
<given-names>P. G. A.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Trans-proteomic Pipeline: a Pipeline for Proteomic Analysis</article-title>. <source>Methods Mol. Biol. Clift. NJ)</source> <volume>604</volume>, <fpage>213</fpage>&#x2013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-60761-444-9_15</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perdivara</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Yamauchi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Tomer</surname>
<given-names>K. B.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Molecular Characterization of Collagen Hydroxylysine O-Glycosylation by Mass Spectrometry: Current Status</article-title>. <source>Aust. J. Chem.</source> <volume>66</volume> (<issue>7</issue>), <fpage>760</fpage>&#x2013;<lpage>769</lpage>. <pub-id pub-id-type="doi">10.1071/ch13174</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pokidysheva</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Boudko</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Vranka</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zientek</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Maddox</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Moser</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Biological Role of Prolyl 3-hydroxylation in Type IV Collagen</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>111</volume> (<issue>1</issue>), <fpage>161</fpage>&#x2013;<lpage>166</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1307597111</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Risteli</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kivirikko</surname>
<given-names>K. I.</given-names>
</name>
</person-group> (<year>1974</year>). <article-title>Activities of Prolyl Hydroxylase, Lysyl Hydroxylase, Collagen Galactosyltransferase and Collagen Glucosyltransferase in the Liver of Rats with Hepatic Injury</article-title>. <source>Biochem. J.</source> <volume>144</volume> (<issue>1</issue>), <fpage>115</fpage>&#x2013;<lpage>122</lpage>. <pub-id pub-id-type="doi">10.1042/bj1440115</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Salo</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Cox</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Farndon</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Moss</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Grindulis</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Risteli</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>A Connective Tissue Disorder Caused by Mutations of the Lysyl Hydroxylase 3 Gene</article-title>. <source>Am. J. Hum. Genet.</source> <volume>83</volume> (<issue>4</issue>), <fpage>495</fpage>&#x2013;<lpage>503</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2008.09.004</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Salo</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Myllyharju</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Prolyl and Lysyl Hydroxylases in Collagen Synthesis</article-title>. <source>Exp. Dermatol</source> <volume>30</volume> (<issue>1</issue>), <fpage>38</fpage>&#x2013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.1111/exd.14197</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>S&#xe1;nchez-Iranzo</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Galardi-Castilla</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sanz-Morej&#xf3;n</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Gonz&#xe1;lez-Rosa</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Costa</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Ernst</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Transient Fibrosis Resolves via Fibroblast Inactivation in the Regenerating Zebrafish Heart</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>115</volume> (<issue>16</issue>), <fpage>4188</fpage>&#x2013;<lpage>4193</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1716713115</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schegg</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Hu&#x308;lsmeier</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Rutschmann</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Maag</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Hennet</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Core Glycosylation of Collagen Is Initiated by Two &#x3b2;(1- O)Galactosyltransferases</article-title>. <source>Mol. Cell Biol.</source> <volume>29</volume> (<issue>4</issue>), <fpage>943</fpage>&#x2013;<lpage>952</lpage>. <pub-id pub-id-type="doi">10.1128/mcb.02085-07</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Scietti</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Chiapparino</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>De Giorgi</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Fumagalli</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Khoriauli</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Nergadze</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Molecular Architecture of the Multifunctional Collagen Lysyl Hydroxylase and Glycosyltransferase LH3</article-title>. <source>Nat. Commun.</source> <volume>99</volume> (<issue>11</issue>), <fpage>3163</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-018-05631-5</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Taha</surname>
<given-names>I. N.</given-names>
</name>
<name>
<surname>Clauser</surname>
<given-names>K. R.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Naba</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>MatrisomeDB: the ECM-Protein Knowledge Database</article-title>. <source>Nucleic acids Res.</source> <volume>48</volume> (<issue>D1</issue>), <fpage>D1136</fpage>&#x2013;<lpage>D1144</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz849</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sipil&#xe4;</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ruotsalainen</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Sormunen</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>N. L.</given-names>
</name>
<name>
<surname>Lamand&#xe9;</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Vapola</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2007</year>). <article-title>Secretion and Assembly of Type IV and VI Collagens Depend on Glycosylation of Hydroxylysines</article-title>. <source>J. Biol. Chem.</source> <volume>282</volume> (<issue>46</issue>), <fpage>33381</fpage>&#x2013;<lpage>33388</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M704198200</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stawikowski</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Aukszi</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Stawikowska</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Cudic</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Fields</surname>
<given-names>G. B.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Glycosylation Modulates Melanoma Cell &#x3b1;2&#x3b2;1 and &#x3b1;3&#x3b2;1 Integrin Interactions with Type IV Collagen</article-title>. <source>J. Biol. Chem.</source> <volume>289</volume> (<issue>31</issue>), <fpage>21591</fpage>&#x2013;<lpage>21604</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m114.572073</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tabb</surname>
<given-names>D. L.</given-names>
</name>
<name>
<surname>Fernando</surname>
<given-names>C. G.</given-names>
</name>
<name>
<surname>Chambers</surname>
<given-names>M. C.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>MyriMatch: Highly Accurate Tandem Mass Spectral Peptide Identification by Multivariate Hypergeometric Analysis</article-title>. <source>J. Proteome Res.</source> <volume>6</volume> (<issue>2</issue>), <fpage>654</fpage>&#x2013;<lpage>661</lpage>. <pub-id pub-id-type="doi">10.1021/pr0604054</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Terajima</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Perdivara</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Sricholpech</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Deguchi</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Pleshko</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Tomer</surname>
<given-names>K. B.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Glycosylation and Cross-Linking in Bone Type I Collagen</article-title>. <source>J. Biol. Chem.</source> <volume>289</volume> (<issue>33</issue>), <fpage>22636</fpage>&#x2013;<lpage>22647</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m113.528513</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Terajima</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Taga</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Cabral</surname>
<given-names>W. A.</given-names>
</name>
<name>
<surname>Hou-Fu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Srisawasdi</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Cyclophilin-B Modulates Collagen Cross-Linking by Differentially Affecting Lysine Hydroxylation in the Helical and Telopeptidyl Domains of Tendon Type I Collagen</article-title>. <source>J. Biol. Chem.</source> <volume>291</volume> (<issue>18</issue>), <fpage>9501</fpage>&#x2013;<lpage>9512</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m115.699470</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Travers</surname>
<given-names>J. G.</given-names>
</name>
<name>
<surname>Kamal</surname>
<given-names>F. A.</given-names>
</name>
<name>
<surname>Robbins</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yutzey</surname>
<given-names>K. E.</given-names>
</name>
<name>
<surname>Blaxall</surname>
<given-names>B. C.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Cardiac Fibrosis</article-title>. <source>Circ. Res.</source> <volume>118</volume> (<issue>6</issue>), <fpage>1021</fpage>&#x2013;<lpage>1040</lpage>. <pub-id pub-id-type="doi">10.1161/circresaha.115.306565</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van Huizen</surname>
<given-names>N. A.</given-names>
</name>
<name>
<surname>Burgers</surname>
<given-names>P. C.</given-names>
</name>
<name>
<surname>Saintmont</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Brocorens</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Gerbaux</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Stingl</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Identification of 4-Hydroxyproline at the Xaa Position in Collagen by Mass Spectrometry</article-title>. <source>J. Proteome Res.</source> <volume>18</volume> (<issue>5</issue>), <fpage>2045</fpage>&#x2013;<lpage>2051</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jproteome.8b00930</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Weis</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Hudson</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Scott</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>J.-J.</given-names>
</name>
<name>
<surname>Eyre</surname>
<given-names>D. R.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Location of 3-Hydroxyproline Residues in Collagen Types I, II, III, and V/XI Implies a Role in Fibril Supramolecular Assembly</article-title>. <source>J. Biol. Chem.</source> <volume>285</volume> (<issue>4</issue>), <fpage>2580</fpage>&#x2013;<lpage>2590</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m109.068726</pub-id> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wenstrup</surname>
<given-names>R. J.</given-names>
</name>
<name>
<surname>Florer</surname>
<given-names>J. B.</given-names>
</name>
<name>
<surname>Brunskill</surname>
<given-names>E. W.</given-names>
</name>
<name>
<surname>Bell</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Chervoneva</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Birk</surname>
<given-names>D. E.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Type V Collagen Controls the Initiation of Collagen Fibril Assembly</article-title>. <source>J. Biol. Chem.</source> <volume>279</volume> (<issue>51</issue>), <fpage>53331</fpage>&#x2013;<lpage>53337</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.m409622200</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Willyard</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Unlocking the Secrets of Scar-free Skin Healing</article-title>. <source>Nature</source> <volume>563</volume> (<issue>7732</issue>), <fpage>S86</fpage>&#x2013;<lpage>S88</lpage>. <pub-id pub-id-type="doi">10.1038/d41586-018-07430-w</pub-id> </citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yamauchi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Noyes</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Kuboki</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Mechanic</surname>
<given-names>G. L.</given-names>
</name>
</person-group> (<year>1982</year>). <article-title>Collagen Structural Microheterogeneity and a Possible Role for Glycosylated Hydroxylysine in Type I Collagen</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>79</volume>, <fpage>7684</fpage>&#x2013;<lpage>7688</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.79.24.7684</pub-id> </citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yamauchi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sricholpech</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Lysine Post-translational Modifications of Collagen</article-title>. <source>Essays Biochem.</source> <volume>52</volume> (<issue>1</issue>), <fpage>113</fpage>&#x2013;<lpage>133</lpage>. <pub-id pub-id-type="doi">10.1042/bse0520113</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>