Occurence of potentially GPI modified proteins in plants
Contact:
Birgit Eisenhaber (IMP/Austria)
Michael Wildpaner (IMP/Austria)
Frank Eisenhaber (IMP/Austria)
Carolyn Schultz (University of Adelaide/Australia)
Paul Dupree and Georg Borner (University of Cambridge/UK)
Data sheets: Learning set and prediction results
- Derivation of motif properties
Comparison with amino acid property scales
Analysis of volume compensation in the omega-site region
- Details of prediction method derivation and validation
Prediction function parametrization
The plant learning set - a list of 219 proteins
The self-consistency-test of the plant learning set
Problem with COBRA-like CAA74765.1 protein - alignment of the COBRA-family
The jack-knife-test I over the whole plant learning set
The jack-knife-test II over the largest subset of non-homologous sequences only
- Prediction results: Occurence of potentially GPI modified proteins in
the Arabidopsis thaliana genome (June 2002)
the Oryza sativa genome (June 2002)
the SPTrEMBL database (July 2002, pln.dat)
the SWISSPROT database (rel. 40, Viridiplantae)
- Functional classification of
Arabidopsis thaliana hits
Oryza sativa hits
- Comparison with previous published data
comparison with A.thaliana protein lists created by Borner GHH et al. (2002, 2003)
Acknowledgement:
The protein sequences for the complete genomes
were taken from the following web-pages:
Arabidopsis thaliana:
ftp://ftp.tigr.org/pub/data/a_thaliana/ath1/SEQUENCES/
Oryza sativa:
ftp://ftp.tigr.org/pub/data/o_sativa/irgsp/PUBLICATION_RELEASE/GENOME/OSA1.pep
Last modified: 7th March 2003