GPI Lipid Anchor Project |
Home Page B.E. |
In the column 'number of proteins', the total number of proteins encoded in the genome/chromosome is listed. The column 'complete' shows the number of proteins without 'sequence lacks', it means that the last C-terminal 60 amino acids are represented by correct amino acid letters. The column 'failed' is just the difference between the first two columns. Additional to those having 'sequence lacks', a number of proteins is too short (length < 55 amino acids) for the prediction algorithm. The number of these proteins is presented in column "too short".
In the column 'predicted GPI-modifications', 'C' represent the number of proteins predicted as GPI-anchored with a P-value P<0.75%, 'D' with a P-value 0.75%<=P<1%. Because 'S' is the number of predictions in the twilight zone (1%<=P<1.75%), only the number of predicted proteins having additionally a positive score were counted.The number C+D+S is considered the total number of predicted GPI-anchored proteins(column 'total').The last column '%' shows the fraction of GPI-anchored proteins in the whole genome.
Last modified: 13th June 2002