Sequences with E-value BETTER than threshold (0.002)
gb|AAF18502.1|AC010924_15 (AC010924) ESTs gb|AI992787, gb|T20398... 426 e-118
ref|NP_173057.1| (NM_101473) hypothetical protein [Arabidopsis t... 249 5e-65
ref|NP_010588.1| (NC_001136) Glycosylphosphatidylinositol (GPI) ... 214 1e-54
pir||T40997 probable short-chain dehydrogenase - fission yeast ... 214 1e-54
ref|NP_032864.1| (NM_008838) phosphatidylinositol glycan, class ... 181 9e-45
ref|NP_002634.1| (NM_002643) phosphatidylinositol glycan, class ... 172 8e-42
gb|AAL48412.1| (AY070790) AT13969p [Drosophila melanogaster] 171 1e-41
gb|AAF49133.1| (AE003516) CG9376 gene product [Drosophila melano... 170 3e-41
ref|NP_173056.1| (NM_101472) unknown protein [Arabidopsis thaliana] 155 1e-36
gb|AAH21725.1|AAH21725 (BC021725) Similar to phosphatidylinosito... 147 2e-34
pir||T29643 hypothetical protein F49E8.1 - Caenorhabditis elegans 130 3e-29
ref|NP_501226.1| (NM_068825) F49E8.1.p [Caenorhabditis elegans] ... 130 4e-29
ref|XP_018442.1| (XM_018442) similar to phosphatidylinositol gly... 92 1e-17
Sequences with E-value WORSE than threshold
ref|NP_215732.1| (NC_000962) hypothetical protein Rv1216c [Mycob... 37 0.44
Alignments
>gb|AAF18502.1|AC010924_15 (AC010924) ESTs gb|AI992787, gb|T20398 come from this gene.
[Arabidopsis thaliana]
Length = 270
Score = 426 bits (1084), Expect = e-118
Identities = 270/270 (100%), Positives = 270/270 (100%)
Query: 1 MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL 60
MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL
Sbjct: 1 MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL 60
Query: 61 WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSK 120
WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSK
Sbjct: 61 WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSK 120
Query: 121 TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 180
TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW
Sbjct: 121 TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 180
Query: 181 PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN 240
PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN
Sbjct: 181 PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN 240
Query: 241 YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 270
YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP
Sbjct: 241 YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 270
>ref|NP_173057.1| (NM_101473) hypothetical protein [Arabidopsis thaliana]
Length = 160
Score = 249 bits (630), Expect = 5e-65
Identities = 141/150 (94%), Positives = 143/150 (95%)
Query: 121 TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 180
T W F+ + VVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW
Sbjct: 11 TSAWRFVHDLKDVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 70
Query: 181 PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN 240
PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN
Sbjct: 71 PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN 130
Query: 241 YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 270
YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP
Sbjct: 131 YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 160
>ref|NP_010588.1| (NC_001136) Glycosylphosphatidylinositol (GPI) assembly; Gpi11p
[Saccharomyces cerevisiae]
pir||S61188 probable membrane protein YDR302w - yeast (Saccharomyces
cerevisiae)
gb|AAB64738.1| (U28374) Ydr302wp [Saccharomyces cerevisiae]
Length = 219
Score = 214 bits (541), Expect = 1e-54
Identities = 46/197 (23%), Positives = 72/197 (36%), Gaps = 27/197 (13%)
Query: 26 VYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLLWIIEFPIVVIIYSLFRRNPEKCSYF 85
VYV + F +H+V Y +S T++LL F I + L + + Y
Sbjct: 41 VYVRKTPLMTFPYHLVALLYYYVFVSSNFNTVKLL---SFLIPTQVAYLVLQFNKCTVYG 97
Query: 86 RAVGRSLVGLIAGALINALGA--------VSLGAPIGMQSLSKTIHWSFLMSVFTVVPAT 137
+ + L L + GAP+ M L +T S F PA
Sbjct: 98 NKIIKINYSLTIICLGVTFLLSFPTMLLTILFGAPL-MDLLWETWLLSL-HFAFLAYPAV 155
Query: 138 AVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICV 197
+F +G+ + + ++GGW +PLDW+R WQ WPI +
Sbjct: 156 Y----------SVFNCDFKVGLWKKYFIFI----VVGGWISCVVIPLDWDRDWQNWPIPI 201
Query: 198 CYGAIGGYIGGQMLGLM 214
G G + G +G
Sbjct: 202 VVGGYLGALVGYTIGAY 218
>pir||T40997 probable short-chain dehydrogenase - fission yeast
(Schizosaccharomyces pombe)
emb|CAB40182.1| (AL049559) putative short-chain dehydrogenase [Schizosaccharomyces
pombe]
Length = 503
Score = 214 bits (541), Expect = 1e-54
Identities = 61/184 (33%), Positives = 92/184 (49%), Gaps = 9/184 (4%)
Query: 31 GLFLIFGFHVVRNKYSVDLISDPTLTLRLLWIIEFPIVVIIYSL--FRRNPEKCSYFRAV 88
L L F + LI +P LR FPI I+ +L + ++P + +
Sbjct: 323 TLLLTFTQLTIFYLSLNCLIENPYRMLR----NTFPIWFIMQTLQIYIQSPRPPLTPKRL 378
Query: 89 GRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWIDWH 148
++ G+L+ + V+ GAP+ + T + +SVFTV P + L + W
Sbjct: 379 LAGAASMLIGSLLISFILVAFGAPL-LHDFHLTYFCALTLSVFTVYPLASTLAFNTEQWQ 437
Query: 149 RIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGYIGG 208
R F +LK +I M L ++G IIG WFGA+P+PLDW+RPWQ WPI + GA GY
Sbjct: 438 R-FLTLKSFNVIGSMQL-RSWGPIIGAWFGAFPIPLDWDRPWQAWPITIVIGAFLGYAFA 495
Query: 209 QMLG 212
++G
Sbjct: 496 AIVG 499
>ref|NP_032864.1| (NM_008838) phosphatidylinositol glycan, class F [Mus musculus]
dbj|BAA08818.1| (D50264) phosphatidylinositol glycan class F [Mus musculus]
Length = 219
Score = 181 bits (457), Expect = 9e-45
Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 1/126 (0%)
Query: 86 RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
+ V + + + V GAP+ L +T ++ ++S FT VP +LG +
Sbjct: 78 TRALKCCVCFLMSCFLLHIIFVLYGAPLIELVL-ETFLFAVVLSTFTTVPCLCLLGPNLK 136
Query: 146 DWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGY 205
W R+F+ I E+ L + + G W GA+P+PLDWERPWQ WPI GA GY
Sbjct: 137 AWLRVFSRNGVTSIWENSLQITTISSFTGAWLGAFPIPLDWERPWQVWPISCTLGATFGY 196
Query: 206 IGGQML 211
+ G ++
Sbjct: 197 VAGLVI 202
>ref|NP_002634.1| (NM_002643) phosphatidylinositol glycan, class F [Homo sapiens]
sp|Q07326|PIGF_HUMAN Phosphatidylinositol-glycan biosynthesis, class F protein (PIG-F)
pir||A46097 GPI-anchor biosynthesis protein PIG-F - human
dbj|BAA02697.1| (D13435) PIG-F [Homo sapiens]
Length = 219
Score = 172 bits (432), Expect = 8e-42
Identities = 41/126 (32%), Positives = 64/126 (50%), Gaps = 1/126 (0%)
Query: 86 RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
+ + + + V GAP+ +L +T ++ ++S FT VP +LG +
Sbjct: 78 TGFLKCCIYFLMSCFSFHVIFVLYGAPLIELAL-ETFLFAVILSTFTTVPCLCLLGPNLK 136
Query: 146 DWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGY 205
W R+F+ I E+ L + + +G W GA P+PLDWERPWQ WPI GA GY
Sbjct: 137 AWLRVFSRNGVTSIWENSLQITTISSFVGAWLGALPIPLDWERPWQVWPISCTLGATFGY 196
Query: 206 IGGQML 211
+ G ++
Sbjct: 197 VAGLVI 202
>gb|AAL48412.1| (AY070790) AT13969p [Drosophila melanogaster]
Length = 236
Score = 171 bits (430), Expect = 1e-41
Identities = 45/161 (27%), Positives = 71/161 (43%), Gaps = 18/161 (11%)
Query: 76 RRNPEKCSYFRAVGRSLVG----LIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVF 131
++ +K SYF R L+G L+ A + LGAP+ + + +T + LM++
Sbjct: 77 KQRQKKNSYFTP--RELLGGFTLQFLCTLLYAFICIILGAPV-LGNYEQTFVLALLMTLL 133
Query: 132 TVVPATAVLGASWIDWHRIFASLKPIGIIE------HMLLVPAYGAIIGGWFGAWPMPLD 185
TV P +LG ++ KP + + ++ A G I+G W G+ PLD
Sbjct: 134 TVSPTVFLLGGGGA--LQVCFCEKPDFVTKCEDTALNLFKYNALGGILGAWAGSVVAPLD 191
Query: 186 WERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRV 226
W R WQ +PI GA +G + + Y T RV
Sbjct: 192 WGRDWQAYPIPNVIGA---LLGSALGNIYACTHVLYATARV 229
>gb|AAF49133.1| (AE003516) CG9376 gene product [Drosophila melanogaster]
Length = 209
Score = 170 bits (427), Expect = 3e-41
Identities = 45/161 (27%), Positives = 71/161 (43%), Gaps = 18/161 (11%)
Query: 76 RRNPEKCSYFRAVGRSLVG----LIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVF 131
++ +K SYF R L+G L+ A + LGAP+ + + +T + LM++
Sbjct: 50 KQRQKKNSYFTP--RELLGGFTLQFLCTLLYAFICIILGAPV-LGNYEQTFVLALLMTLL 106
Query: 132 TVVPATAVLGASWIDWHRIFASLKPIGIIE------HMLLVPAYGAIIGGWFGAWPMPLD 185
TV P +LG ++ KP + + ++ A G I+G W G+ PLD
Sbjct: 107 TVSPTVFLLGGGGA--LQVCFCEKPDFVTKCEDTALNLFKYNALGGILGAWAGSVVAPLD 164
Query: 186 WERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRV 226
W R WQ +PI GA +G + + Y T RV
Sbjct: 165 WGRDWQAYPIPNVIGA---LLGSALGNIYACTHVLYATARV 202
>ref|NP_173056.1| (NM_101472) unknown protein [Arabidopsis thaliana]
Length = 98
Score = 155 bits (388), Expect = 1e-36
Identities = 98/98 (100%), Positives = 98/98 (100%)
Query: 1 MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL 60
MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL
Sbjct: 1 MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL 60
Query: 61 WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAG 98
WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAG
Sbjct: 61 WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAG 98
>gb|AAH21725.1|AAH21725 (BC021725) Similar to phosphatidylinositol glycan, class F [Homo
sapiens]
Length = 206
Score = 147 bits (370), Expect = 2e-34
Identities = 33/106 (31%), Positives = 53/106 (49%), Gaps = 1/106 (0%)
Query: 86 RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
+ + + + V GAP+ +L +T ++ ++S FT VP +LG +
Sbjct: 78 TGFLKCCIYFLMSCFSFHVIFVLYGAPLIELAL-ETFLFAVILSTFTTVPCLCLLGPNLK 136
Query: 146 DWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQ 191
W R+F+ I E+ L + + +G W GA P+PLDWERPWQ
Sbjct: 137 AWLRVFSRNGVTSIWENSLQITTISSFVGAWLGALPIPLDWERPWQ 182
Score = 120 bits (299), Expect = 3e-26
Identities = 23/95 (24%), Positives = 42/95 (44%), Gaps = 1/95 (1%)
Query: 86 RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
+ + + + V GAP+ +L +T ++ ++S FT VP +LG +
Sbjct: 78 TGFLKCCIYFLMSCFSFHVIFVLYGAPLIELAL-ETFLFAVILSTFTTVPCLCLLGPNLK 136
Query: 146 DWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW 180
W R+F+ I E+ L + + +G W GA
Sbjct: 137 AWLRVFSRNGVTSIWENSLQITTISSFVGAWLGAL 171
>pir||T29643 hypothetical protein F49E8.1 - Caenorhabditis elegans
Length = 571
Score = 130 bits (325), Expect = 3e-29
Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 4/105 (3%)
Query: 104 LGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGA---SWIDWHRIFASLKPIGII 160
+ AV GAP + T + ++ + +PA + + + ++F+
Sbjct: 20 ILAVLFGAPFF-SDIIATAVLATALTAVSALPAVFLFDSEERAVEVILQLFSCEGNPTPK 78
Query: 161 EHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGY 205
E +LL + A +G W A PLDW+R WQ +P+ G G
Sbjct: 79 ESVLLFNSVFAFLGAWAAAAVHPLDWDRWWQRYPLPSLVGCFIGA 123
>ref|NP_501226.1| (NM_068825) F49E8.1.p [Caenorhabditis elegans]
gb|AAB03156.2| (U61949) Hypothetical protein F49E8.1 [Caenorhabditis elegans]
Length = 553
Score = 130 bits (324), Expect = 4e-29
Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 4/105 (3%)
Query: 104 LGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGA---SWIDWHRIFASLKPIGII 160
+ AV GAP + T + ++ + +PA + + + ++F+
Sbjct: 20 ILAVLFGAPFF-SDIIATAVLATALTAVSALPAVFLFDSEERAVEVILQLFSCEGNPTPK 78
Query: 161 EHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGY 205
E +LL + A +G W A PLDW+R WQ +P+ G G
Sbjct: 79 ESVLLFNSVFAFLGAWAAAAVHPLDWDRWWQRYPLPSLVGCFIGA 123
>ref|XP_018442.1| (XM_018442) similar to phosphatidylinositol glycan, class F (H.
sapiens) [Homo sapiens]
Length = 153
Score = 91.5 bits (225), Expect = 1e-17
Identities = 16/76 (21%), Positives = 32/76 (42%), Gaps = 1/76 (1%)
Query: 86 RAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWI 145
+ + + + V GAP+ +L +T ++ ++S FT+ P +LG +
Sbjct: 78 TRFLKCCIYFLMSCFSFHVIFVLYGAPLIELAL-ETFLFAVILSTFTIGPYLCLLGPNLK 136
Query: 146 DWHRIFASLKPIGIIE 161
W R+F+ I
Sbjct: 137 AWLRVFSRNGVTSIWG 152
>ref|NP_215732.1| (NC_000962) hypothetical protein Rv1216c [Mycobacterium
tuberculosis H37Rv]
ref|NP_335697.1| (NC_002755) hypothetical protein [Mycobacterium tuberculosis
CDC1551]
pir||F70610 hypothetical protein Rv1216c - Mycobacterium tuberculosis (strain
H37RV)
emb|CAB07818.1| (Z93777) hypothetical protein Rv1216c [Mycobacterium tuberculosis
H37Rv]
gb|AAK45511.1| (AE007002) hypothetical protein [Mycobacterium tuberculosis
CDC1551]
Length = 224
Score = 37.0 bits (85), Expect = 0.44
Identities = 27/138 (19%), Positives = 45/138 (32%), Gaps = 5/138 (3%)
Query: 101 INALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGII 160
+ GA+ G P G + + T+ P L + + P+
Sbjct: 15 LVVFGALLFG-PAGTFDYWQAWVFLAAFVSTTIGPTIY-LARNDPAALQRRMRSGPLAEG 72
Query: 161 EHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQ 220
+ GA +G + D W P VC + G + ++ + +
Sbjct: 73 RTIQKFIVIGAFLGFFAMMVLSACDHRYGWSSVPAAVCVIGDVLVMTGLGIAMLVVIQNR 132
Query: 221 YL--TVRVIYHGQTLSDD 236
Y TVRV GQ L+ D
Sbjct: 133 YAASTVRV-EAGQILASD 149
CPU time: 71.62 user secs. 4.30 sys. secs 75.92 total secs.
Database: nr
Posted date: Apr 21, 2002 2:19 PM
Number of letters in database: 277,845,442
Number of sequences in database: 887,402
Lambda K H
0.318 0.162 0.519
Gapped
Lambda K H
0.270 0.0569 0.230
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 164362122
Number of Sequences: 887402
Number of extensions: 7081645
Number of successful extensions: 30258
Number of sequences better than 10.0: 78
Number of HSP's better than 10.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 64
Number of HSP's that attempted gapping in prelim test: 30194
Number of HSP's gapped (non-prelim): 91
length of query: 270
length of database: 277,845,442
effective HSP length: 47
effective length of query: 223
effective length of database: 236,137,548
effective search space: 52658673204
effective search space used: 52658673204
T: 11
A: 40
X1: 15 ( 6.9 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 40 (21.0 bits)
S2: 73 (32.6 bits)