<?xml version="1.0" encoding="UTF-8"?>
<collection xmlns="http://www.loc.gov/MARC21/slim">
 <record>
  <leader>     caa a22        4500</leader>
  <controlfield tag="001">445879165</controlfield>
  <controlfield tag="003">CHVBK</controlfield>
  <controlfield tag="005">20180317145537.0</controlfield>
  <controlfield tag="007">cr unu---uuuuu</controlfield>
  <controlfield tag="008">170323e20110501xx      s     000 0 eng  </controlfield>
  <datafield tag="024" ind1="7" ind2="0">
   <subfield code="a">10.1007/s11030-010-9240-y</subfield>
   <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="035" ind1=" " ind2=" ">
   <subfield code="a">(NATIONALLICENCE)springer-10.1007/s11030-010-9240-y</subfield>
  </datafield>
  <datafield tag="245" ind1="0" ind2="0">
   <subfield code="a">Prediction of mucin-type O -glycosylation sites by a two-staged strategy</subfield>
   <subfield code="h">[Elektronische Daten]</subfield>
   <subfield code="c">[YuDong Cai, JianFeng He, Lin Lu]</subfield>
  </datafield>
  <datafield tag="520" ind1="3" ind2=" ">
   <subfield code="a">The mucin-type O-glycosylation of a protein is an important type of protein post-translational modification. This process is mediated by a family of UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferases which transfer the N-acetylgalactosamine (GalNAc) to the serine or threonine residues with unknown specificity. In order to determine the glycosylation sites of a given protein, we present a two-staged prediction method here, which first determines whether a protein is a glycoprotein, and then determines the glycosylation sites of a protein that has been predicted to be glycosylated in the first stage. In the first stage, a protein is encoded by the protein families in PFAM, which is a collective annotated database of classified protein families; then it is predicted by a predictor trained by the training set. In the second stage, nonapeptides of the predicted mucin-type glycoproteins, with serine or threonine residues at their fifth sites, are represented by indices in AAIndex. Then, it is predicted whether the nonapeptides are attached by GalNAc by a predictor, which is constructed with features selected by feature selection methods [Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection method]. The prediction accuracy of the first stage is 94.9% validated by Leave-One-Out validation method; the prediction accuracy of the second stage is 99.4%. These results show that this method is valuable to study the mucin-type O-glycosylation. The analysis of the features used to construct the predictor of the second stage confirms the previously obtained results from other groups. The residues at position −1 and +3 have great impact on the prediction. Among other amino acid indices, the indices about alpha and turn propensities and indices about hydrophobicity of the residues in nonapeptide also influence the recognition of the GalNAc transferases. A web server is available at http://chemdata.shu.edu.cn/gal/.</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
   <subfield code="a">Springer Science+Business Media B.V., 2010</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">Mucin-type O -glycosylation sites</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">PFAM</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">Minimum Redundancy Maximum Relevance</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">Feature Selection</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">Nearest Neighbor Algorithm</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">K-fold cross-validation</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">mRMR : The Maximum Relevance Minimum Redundancy</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">NNA : Nearest Neighbor Algorithm</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">AAIndex : Amino Acid Index</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="690" ind1=" " ind2="7">
   <subfield code="a">IFS method : Incremental Feature Selection</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="700" ind1="1" ind2=" ">
   <subfield code="a">Cai</subfield>
   <subfield code="D">YuDong</subfield>
   <subfield code="u">Institute of System Biology, Shanghai University, 99 Shangda Road, 200244, Shanghai, People's Republic of China</subfield>
   <subfield code="4">aut</subfield>
  </datafield>
  <datafield tag="700" ind1="1" ind2=" ">
   <subfield code="a">He</subfield>
   <subfield code="D">JianFeng</subfield>
   <subfield code="u">Department of Biomedical Engineering, Shanghai Jiao Tong University, 200040, Shanghai, People's Republic of China</subfield>
   <subfield code="4">aut</subfield>
  </datafield>
  <datafield tag="700" ind1="1" ind2=" ">
   <subfield code="a">Lu</subfield>
   <subfield code="D">Lin</subfield>
   <subfield code="u">Department of Biomedical Engineering, Shanghai Jiao Tong University, 200040, Shanghai, People's Republic of China</subfield>
   <subfield code="4">aut</subfield>
  </datafield>
  <datafield tag="773" ind1="0" ind2=" ">
   <subfield code="t">Molecular Diversity</subfield>
   <subfield code="d">Springer Netherlands</subfield>
   <subfield code="g">15/2(2011-05-01), 427-433</subfield>
   <subfield code="x">1381-1991</subfield>
   <subfield code="q">15:2&lt;427</subfield>
   <subfield code="1">2011</subfield>
   <subfield code="2">15</subfield>
   <subfield code="o">11030</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2="0">
   <subfield code="u">https://doi.org/10.1007/s11030-010-9240-y</subfield>
   <subfield code="q">text/html</subfield>
   <subfield code="z">Onlinezugriff via DOI</subfield>
  </datafield>
  <datafield tag="908" ind1=" " ind2=" ">
   <subfield code="D">1</subfield>
   <subfield code="a">research-article</subfield>
   <subfield code="2">jats</subfield>
  </datafield>
  <datafield tag="950" ind1=" " ind2=" ">
   <subfield code="B">NATIONALLICENCE</subfield>
   <subfield code="P">856</subfield>
   <subfield code="E">40</subfield>
   <subfield code="u">https://doi.org/10.1007/s11030-010-9240-y</subfield>
   <subfield code="q">text/html</subfield>
   <subfield code="z">Onlinezugriff via DOI</subfield>
  </datafield>
  <datafield tag="950" ind1=" " ind2=" ">
   <subfield code="B">NATIONALLICENCE</subfield>
   <subfield code="P">700</subfield>
   <subfield code="E">1-</subfield>
   <subfield code="a">Cai</subfield>
   <subfield code="D">YuDong</subfield>
   <subfield code="u">Institute of System Biology, Shanghai University, 99 Shangda Road, 200244, Shanghai, People's Republic of China</subfield>
   <subfield code="4">aut</subfield>
  </datafield>
  <datafield tag="950" ind1=" " ind2=" ">
   <subfield code="B">NATIONALLICENCE</subfield>
   <subfield code="P">700</subfield>
   <subfield code="E">1-</subfield>
   <subfield code="a">He</subfield>
   <subfield code="D">JianFeng</subfield>
   <subfield code="u">Department of Biomedical Engineering, Shanghai Jiao Tong University, 200040, Shanghai, People's Republic of China</subfield>
   <subfield code="4">aut</subfield>
  </datafield>
  <datafield tag="950" ind1=" " ind2=" ">
   <subfield code="B">NATIONALLICENCE</subfield>
   <subfield code="P">700</subfield>
   <subfield code="E">1-</subfield>
   <subfield code="a">Lu</subfield>
   <subfield code="D">Lin</subfield>
   <subfield code="u">Department of Biomedical Engineering, Shanghai Jiao Tong University, 200040, Shanghai, People's Republic of China</subfield>
   <subfield code="4">aut</subfield>
  </datafield>
  <datafield tag="950" ind1=" " ind2=" ">
   <subfield code="B">NATIONALLICENCE</subfield>
   <subfield code="P">773</subfield>
   <subfield code="E">0-</subfield>
   <subfield code="t">Molecular Diversity</subfield>
   <subfield code="d">Springer Netherlands</subfield>
   <subfield code="g">15/2(2011-05-01), 427-433</subfield>
   <subfield code="x">1381-1991</subfield>
   <subfield code="q">15:2&lt;427</subfield>
   <subfield code="1">2011</subfield>
   <subfield code="2">15</subfield>
   <subfield code="o">11030</subfield>
  </datafield>
  <datafield tag="900" ind1=" " ind2="7">
   <subfield code="a">Metadata rights reserved</subfield>
   <subfield code="b">Springer special CC-BY-NC licence</subfield>
   <subfield code="2">nationallicence</subfield>
  </datafield>
  <datafield tag="898" ind1=" " ind2=" ">
   <subfield code="a">BK010053</subfield>
   <subfield code="b">XK010053</subfield>
   <subfield code="c">XK010000</subfield>
  </datafield>
  <datafield tag="949" ind1=" " ind2=" ">
   <subfield code="B">NATIONALLICENCE</subfield>
   <subfield code="F">NATIONALLICENCE</subfield>
   <subfield code="b">NL-springer</subfield>
  </datafield>
 </record>
</collection>
