IMP GPI Biosynthesis Report IMP-Bioinformatics
GPI Biosynthesis Main Page  
GPI Site Motif   GPI Site Prediction   Home Page B.E.  

GPI Anchor Biosynthesis Report: AAF18502.1 (PIG-F family, Arabidopsis thaliana)




BLASTP 2.1.1 [Aug-8-2000]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query=
         (270 letters)

Database: nr
           887,402 sequences; 277,845,442 total letters

Searching..................................................

Converged !!!


Results of PSI-Blast iteration 2

Distribution of 15 Blast Hits on the Query Sequence


Sequences with E-value BETTER than threshold (0.002)

gb|AAF18502.1|AC010924_15 (AC010924) ESTs gb|AI992787, gb|T20398... 426 e-118
ref|NP_173057.1| (NM_101473) hypothetical protein [Arabidopsis t... 249 5e-65
ref|NP_010588.1| (NC_001136) Glycosylphosphatidylinositol (GPI) ... 214 1e-54
pir||T40997 probable short-chain dehydrogenase - fission yeast ... 214 1e-54
ref|NP_032864.1| (NM_008838) phosphatidylinositol glycan, class ... 181 9e-45
ref|NP_002634.1| (NM_002643) phosphatidylinositol glycan, class ... 172 8e-42
gb|AAL48412.1| (AY070790) AT13969p [Drosophila melanogaster] 171 1e-41
gb|AAF49133.1| (AE003516) CG9376 gene product [Drosophila melano... 170 3e-41
ref|NP_173056.1| (NM_101472) unknown protein [Arabidopsis thaliana] 155 1e-36
gb|AAH21725.1|AAH21725 (BC021725) Similar to phosphatidylinosito... 147 2e-34
pir||T29643 hypothetical protein F49E8.1 - Caenorhabditis elegans 130 3e-29
ref|NP_501226.1| (NM_068825) F49E8.1.p [Caenorhabditis elegans] ... 130 4e-29
ref|XP_018442.1| (XM_018442) similar to phosphatidylinositol gly... 92 1e-17
Sequences with E-value WORSE than threshold

ref|NP_215732.1| (NC_000962) hypothetical protein Rv1216c [Mycob... 37 0.44
Alignments
>gb|AAF18502.1|AC010924_15 (AC010924) ESTs gb|AI992787, gb|T20398 come from this gene.
           [Arabidopsis thaliana]
          Length = 270

 Score =  426 bits (1084), Expect = e-118
 Identities = 270/270 (100%), Positives = 270/270 (100%)

Query: 1   MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL 60
           MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL
Sbjct: 1   MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL 60

Query: 61  WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSK 120
           WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSK
Sbjct: 61  WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSK 120

Query: 121 TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 180
           TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW
Sbjct: 121 TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 180

Query: 181 PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN 240
           PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN
Sbjct: 181 PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN 240

Query: 241 YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 270
           YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP
Sbjct: 241 YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 270
>ref|NP_173057.1| (NM_101473) hypothetical protein [Arabidopsis thaliana]
          Length = 160

 Score =  249 bits (630), Expect = 5e-65
 Identities = 141/150 (94%), Positives = 143/150 (95%)

Query: 121 TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 180
           T  W F+  +  VVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW
Sbjct: 11  TSAWRFVHDLKDVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 70

Query: 181 PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN 240
           PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN
Sbjct: 71  PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN 130

Query: 241 YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 270
           YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP
Sbjct: 131 YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 160
>ref|NP_010588.1| (NC_001136) Glycosylphosphatidylinositol (GPI) assembly; Gpi11p
           [Saccharomyces cerevisiae]
 pir||S61188 probable membrane protein YDR302w - yeast (Saccharomyces
           cerevisiae)
 gb|AAB64738.1| (U28374) Ydr302wp [Saccharomyces cerevisiae]
          Length = 219

 Score =  214 bits (541), Expect = 1e-54
 Identities = 46/197 (23%), Positives = 72/197 (36%), Gaps = 27/197 (13%)

Query: 26  VYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLLWIIEFPIVVIIYSLFRRNPEKCSYF 85
           VYV     + F +H+V   Y    +S    T++LL    F I   +  L  +  +   Y 
Sbjct: 41  VYVRKTPLMTFPYHLVALLYYYVFVSSNFNTVKLL---SFLIPTQVAYLVLQFNKCTVYG 97

Query: 86  RAVGRSLVGLIAGALINALGA--------VSLGAPIGMQSLSKTIHWSFLMSVFTVVPAT 137
             + +    L    L              +  GAP+ M  L +T   S     F   PA 
Sbjct: 98  NKIIKINYSLTIICLGVTFLLSFPTMLLTILFGAPL-MDLLWETWLLSL-HFAFLAYPAV 155

Query: 138 AVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICV 197
                       +F     +G+ +   +      ++GGW     +PLDW+R WQ WPI +
Sbjct: 156 Y----------SVFNCDFKVGLWKKYFIFI----VVGGWISCVVIPLDWDRDWQNWPIPI 201

Query: 198 CYGAIGGYIGGQMLGLM 214
             G   G + G  +G  
Sbjct: 202 VVGGYLGALVGYTIGAY 218
>pir||T40997 probable short-chain dehydrogenase - fission yeast
           (Schizosaccharomyces pombe)
 emb|CAB40182.1| (AL049559) putative short-chain dehydrogenase [Schizosaccharomyces
           pombe]
          Length = 503

 Score =  214 bits (541), Expect = 1e-54
 Identities = 61/184 (33%), Positives = 92/184 (49%), Gaps = 9/184 (4%)

Query: 31  GLFLIFGFHVVRNKYSVDLISDPTLTLRLLWIIEFPIVVIIYSL--FRRNPEKCSYFRAV 88
            L L F    +       LI +P   LR      FPI  I+ +L  + ++P      + +
Sbjct: 323 TLLLTFTQLTIFYLSLNCLIENPYRMLR----NTFPIWFIMQTLQIYIQSPRPPLTPKRL 378

Query: 89  GRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWIDWH 148
                 ++ G+L+ +   V+ GAP+ +     T   +  +SVFTV P  + L  +   W 
Sbjct: 379 LAGAASMLIGSLLISFILVAFGAPL-LHDFHLTYFCALTLSVFTVYPLASTLAFNTEQWQ 437

Query: 149 RIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGYIGG 208
           R F +LK   +I  M L  ++G IIG WFGA+P+PLDW+RPWQ WPI +  GA  GY   
Sbjct: 438 R-FLTLKSFNVIGSMQL-RSWGPIIGAWFGAFPIPLDWDRPWQAWPITIVIGAFLGYAFA 495

Query: 209 QMLG 212
            ++G
Sbjct: 496 AIVG 499
>ref|NP_032864.1| (NM_008838) phosphatidylinositol glycan, class F [Mus musculus]
 dbj|BAA08818.1| (D50264) phosphatidylinositol glycan class F [Mus musculus]
          Length = 219

 Score =  181 bits (457), Expect = 9e-45
 Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 1/126 (0%)

Query: 86  RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
               +  V  +    +  +  V  GAP+    L +T  ++ ++S FT VP   +LG +  
Sbjct: 78  TRALKCCVCFLMSCFLLHIIFVLYGAPLIELVL-ETFLFAVVLSTFTTVPCLCLLGPNLK 136

Query: 146 DWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGY 205
            W R+F+      I E+ L +    +  G W GA+P+PLDWERPWQ WPI    GA  GY
Sbjct: 137 AWLRVFSRNGVTSIWENSLQITTISSFTGAWLGAFPIPLDWERPWQVWPISCTLGATFGY 196

Query: 206 IGGQML 211
           + G ++
Sbjct: 197 VAGLVI 202
>ref|NP_002634.1| (NM_002643) phosphatidylinositol glycan, class F [Homo sapiens]
 sp|Q07326|PIGF_HUMAN Phosphatidylinositol-glycan biosynthesis, class F protein (PIG-F)
 pir||A46097 GPI-anchor biosynthesis protein PIG-F - human
 dbj|BAA02697.1| (D13435) PIG-F [Homo sapiens]
          Length = 219

 Score =  172 bits (432), Expect = 8e-42
 Identities = 41/126 (32%), Positives = 64/126 (50%), Gaps = 1/126 (0%)

Query: 86  RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
               +  +  +       +  V  GAP+   +L +T  ++ ++S FT VP   +LG +  
Sbjct: 78  TGFLKCCIYFLMSCFSFHVIFVLYGAPLIELAL-ETFLFAVILSTFTTVPCLCLLGPNLK 136

Query: 146 DWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGY 205
            W R+F+      I E+ L +    + +G W GA P+PLDWERPWQ WPI    GA  GY
Sbjct: 137 AWLRVFSRNGVTSIWENSLQITTISSFVGAWLGALPIPLDWERPWQVWPISCTLGATFGY 196

Query: 206 IGGQML 211
           + G ++
Sbjct: 197 VAGLVI 202
>gb|AAL48412.1| (AY070790) AT13969p [Drosophila melanogaster]
          Length = 236

 Score =  171 bits (430), Expect = 1e-41
 Identities = 45/161 (27%), Positives = 71/161 (43%), Gaps = 18/161 (11%)

Query: 76  RRNPEKCSYFRAVGRSLVG----LIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVF 131
           ++  +K SYF    R L+G         L+ A   + LGAP+ + +  +T   + LM++ 
Sbjct: 77  KQRQKKNSYFTP--RELLGGFTLQFLCTLLYAFICIILGAPV-LGNYEQTFVLALLMTLL 133

Query: 132 TVVPATAVLGASWIDWHRIFASLKPIGIIE------HMLLVPAYGAIIGGWFGAWPMPLD 185
           TV P   +LG       ++    KP  + +      ++    A G I+G W G+   PLD
Sbjct: 134 TVSPTVFLLGGGGA--LQVCFCEKPDFVTKCEDTALNLFKYNALGGILGAWAGSVVAPLD 191

Query: 186 WERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRV 226
           W R WQ +PI    GA    +G  +  +       Y T RV
Sbjct: 192 WGRDWQAYPIPNVIGA---LLGSALGNIYACTHVLYATARV 229
>gb|AAF49133.1| (AE003516) CG9376 gene product [Drosophila melanogaster]
          Length = 209

 Score =  170 bits (427), Expect = 3e-41
 Identities = 45/161 (27%), Positives = 71/161 (43%), Gaps = 18/161 (11%)

Query: 76  RRNPEKCSYFRAVGRSLVG----LIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVF 131
           ++  +K SYF    R L+G         L+ A   + LGAP+ + +  +T   + LM++ 
Sbjct: 50  KQRQKKNSYFTP--RELLGGFTLQFLCTLLYAFICIILGAPV-LGNYEQTFVLALLMTLL 106

Query: 132 TVVPATAVLGASWIDWHRIFASLKPIGIIE------HMLLVPAYGAIIGGWFGAWPMPLD 185
           TV P   +LG       ++    KP  + +      ++    A G I+G W G+   PLD
Sbjct: 107 TVSPTVFLLGGGGA--LQVCFCEKPDFVTKCEDTALNLFKYNALGGILGAWAGSVVAPLD 164

Query: 186 WERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRV 226
           W R WQ +PI    GA    +G  +  +       Y T RV
Sbjct: 165 WGRDWQAYPIPNVIGA---LLGSALGNIYACTHVLYATARV 202
>ref|NP_173056.1| (NM_101472) unknown protein [Arabidopsis thaliana]
          Length = 98

 Score =  155 bits (388), Expect = 1e-36
 Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 1  MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL 60
          MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL
Sbjct: 1  MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL 60

Query: 61 WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAG 98
          WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAG
Sbjct: 61 WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAG 98
>gb|AAH21725.1|AAH21725 (BC021725) Similar to phosphatidylinositol glycan, class F [Homo
           sapiens]
          Length = 206

 Score =  147 bits (370), Expect = 2e-34
 Identities = 33/106 (31%), Positives = 53/106 (49%), Gaps = 1/106 (0%)

Query: 86  RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
               +  +  +       +  V  GAP+   +L +T  ++ ++S FT VP   +LG +  
Sbjct: 78  TGFLKCCIYFLMSCFSFHVIFVLYGAPLIELAL-ETFLFAVILSTFTTVPCLCLLGPNLK 136

Query: 146 DWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQ 191
            W R+F+      I E+ L +    + +G W GA P+PLDWERPWQ
Sbjct: 137 AWLRVFSRNGVTSIWENSLQITTISSFVGAWLGALPIPLDWERPWQ 182
 Score =  120 bits (299), Expect = 3e-26
 Identities = 23/95 (24%), Positives = 42/95 (44%), Gaps = 1/95 (1%)

Query: 86  RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
               +  +  +       +  V  GAP+   +L +T  ++ ++S FT VP   +LG +  
Sbjct: 78  TGFLKCCIYFLMSCFSFHVIFVLYGAPLIELAL-ETFLFAVILSTFTTVPCLCLLGPNLK 136

Query: 146 DWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 180
            W R+F+      I E+ L +    + +G W GA 
Sbjct: 137 AWLRVFSRNGVTSIWENSLQITTISSFVGAWLGAL 171
>pir||T29643 hypothetical protein F49E8.1 - Caenorhabditis elegans
          Length = 571

 Score =  130 bits (325), Expect = 3e-29
 Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 4/105 (3%)

Query: 104 LGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGA---SWIDWHRIFASLKPIGII 160
           + AV  GAP     +  T   +  ++  + +PA  +  +   +     ++F+        
Sbjct: 20  ILAVLFGAPFF-SDIIATAVLATALTAVSALPAVFLFDSEERAVEVILQLFSCEGNPTPK 78

Query: 161 EHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGY 205
           E +LL  +  A +G W  A   PLDW+R WQ +P+    G   G 
Sbjct: 79  ESVLLFNSVFAFLGAWAAAAVHPLDWDRWWQRYPLPSLVGCFIGA 123
>ref|NP_501226.1| (NM_068825) F49E8.1.p [Caenorhabditis elegans]
 gb|AAB03156.2| (U61949) Hypothetical protein F49E8.1 [Caenorhabditis elegans]
          Length = 553

 Score =  130 bits (324), Expect = 4e-29
 Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 4/105 (3%)

Query: 104 LGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGA---SWIDWHRIFASLKPIGII 160
           + AV  GAP     +  T   +  ++  + +PA  +  +   +     ++F+        
Sbjct: 20  ILAVLFGAPFF-SDIIATAVLATALTAVSALPAVFLFDSEERAVEVILQLFSCEGNPTPK 78

Query: 161 EHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGY 205
           E +LL  +  A +G W  A   PLDW+R WQ +P+    G   G 
Sbjct: 79  ESVLLFNSVFAFLGAWAAAAVHPLDWDRWWQRYPLPSLVGCFIGA 123
>ref|XP_018442.1| (XM_018442) similar to phosphatidylinositol glycan, class F (H.
           sapiens) [Homo sapiens]
          Length = 153

 Score = 91.5 bits (225), Expect = 1e-17
 Identities = 16/76 (21%), Positives = 32/76 (42%), Gaps = 1/76 (1%)

Query: 86  RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
               +  +  +       +  V  GAP+   +L +T  ++ ++S FT+ P   +LG +  
Sbjct: 78  TRFLKCCIYFLMSCFSFHVIFVLYGAPLIELAL-ETFLFAVILSTFTIGPYLCLLGPNLK 136

Query: 146 DWHRIFASLKPIGIIE 161
            W R+F+      I  
Sbjct: 137 AWLRVFSRNGVTSIWG 152
>ref|NP_215732.1| (NC_000962) hypothetical protein Rv1216c [Mycobacterium
           tuberculosis H37Rv]
 ref|NP_335697.1| (NC_002755) hypothetical protein [Mycobacterium tuberculosis
           CDC1551]
 pir||F70610 hypothetical protein Rv1216c - Mycobacterium tuberculosis  (strain
           H37RV)
 emb|CAB07818.1| (Z93777) hypothetical protein Rv1216c [Mycobacterium tuberculosis
           H37Rv]
 gb|AAK45511.1| (AE007002) hypothetical protein [Mycobacterium tuberculosis
           CDC1551]
          Length = 224

 Score = 37.0 bits (85), Expect = 0.44
 Identities = 27/138 (19%), Positives = 45/138 (32%), Gaps = 5/138 (3%)

Query: 101 INALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGII 160
           +   GA+  G P G     +   +       T+ P    L  +     +      P+   
Sbjct: 15  LVVFGALLFG-PAGTFDYWQAWVFLAAFVSTTIGPTIY-LARNDPAALQRRMRSGPLAEG 72

Query: 161 EHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQ 220
             +      GA +G +        D    W   P  VC       + G  + ++   + +
Sbjct: 73  RTIQKFIVIGAFLGFFAMMVLSACDHRYGWSSVPAAVCVIGDVLVMTGLGIAMLVVIQNR 132

Query: 221 YL--TVRVIYHGQTLSDD 236
           Y   TVRV   GQ L+ D
Sbjct: 133 YAASTVRV-EAGQILASD 149
CPU time:    71.62 user secs.	    4.30 sys. secs	   75.92 total secs.

  Database: nr
    Posted date:  Apr 21, 2002  2:19 PM
  Number of letters in database: 277,845,442
  Number of sequences in database:  887,402
  
Lambda     K      H
   0.318    0.162    0.519 

Gapped
Lambda     K      H
   0.270   0.0569    0.230 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 164362122
Number of Sequences: 887402
Number of extensions: 7081645
Number of successful extensions: 30258
Number of sequences better than 10.0: 78
Number of HSP's better than 10.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 64
Number of HSP's that attempted gapping in prelim test: 30194
Number of HSP's gapped (non-prelim): 91
length of query: 270
length of database: 277,845,442
effective HSP length: 47
effective length of query: 223
effective length of database: 236,137,548
effective search space: 52658673204
effective search space used: 52658673204
T: 11
A: 40
X1: 15 ( 6.9 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 40 (21.0 bits)
S2: 73 (32.6 bits)