Sequences with E-value BETTER than threshold
Score E
Sequences producing significant alignments: (bits) Value
gb|AAF76891.1| (AF267484) PIG-A [Paramecium tetraurelia] 887 0.0
ref|NP_566874.1| (NM_114379) n-acetylglucosaminyl-phosphatidy... 404 e-111
ref|NP_002632.1| (NM_002641) phosphatidylinositol glycan, cla... 401 e-111
pir||A55731 GPI-anchor biosynthesis protein PIG-A - mouse >gi... 395 e-109
pir||T47450 n-acetylglucosaminyl-phosphatidylinositol-like pr... 395 e-109
pir||T40367 n-acetylglucosaminyl-phosphatidylinositol - fissi... 371 e-101
ref|NP_495840.1| (NM_063439) phosphatidylinositol biosyntheti... 368 e-101
ref|NP_015150.2| (NC_001148) N-acetylglucosaminyl-phosphatidy... 337 2e-91
pir||S65187 GPI-anchor biosynthesis protein PIG-A - yeast (Sa... 333 3e-90
prf||1804343A SPT14 gene [Saccharomyces cerevisiae] 326 5e-88
gb|AAF57802.1| (AE003802) CG6401 gene product [Drosophila mel... 322 8e-87
ref|XP_045453.1| (XM_045453) similar to phosphatidylinositol ... 191 2e-47
emb|CAB57276.1| (X77725) PIG-A [Homo sapiens] 165 1e-39
pir||I52665 class A GlcNAc-inositol phospholipid assembly pro... 131 2e-29
ref|NP_143674.1| (NC_000961) hypothetical protein [Pyrococcus... 115 2e-24
ref|NP_126928.1| (NC_000868) GALACTOSYLTRANSFERASE or LPS BIO... 106 5e-22
ref|NP_148357.1| (NC_000854) N-acetylglucosaminyl-phosphatidy... 101 3e-20
ref|NP_065205.1| (NM_020472) phosphatidylinositol glycan, cla... 99 2e-19
ref|NP_070556.1| (NC_000917) galactosyltransferase [Archaeogl... 93 1e-17
gb|AAL82009.1| (AE010283) glycosyl transferase [Pyrococcus fu... 92 2e-17
ref|NP_266369.1| (NC_002662) LPS biosynthesis protein [Lactoc... 89 1e-16
ref|NP_440248.1| (NC_000911) unknown protein [Synechocystis s... 86 1e-15
ref|NP_295278.1| (NC_001263) conserved hypothetical protein [... 84 4e-15
ref|NP_489278.1| (NC_003272) glycosyltransferase [Nostoc sp. ... 84 4e-15
gb|AAL81485.1| (AE010240) glycosyl transferase [Pyrococcus fu... 79 2e-13
gb|AAC77851.1| (U38473) putative glycosyl transferase [Escher... 78 2e-13
ref|NP_487738.1| (NC_003272) heterocyst envelope polysacchari... 77 4e-13
emb|CAB57789.1| (AJ250131) HepD protein [Nostoc sp. PCC 7120] 77 5e-13
gb|AAL80915.1| (AE010196) hypothetical protein [Pyrococcus fu... 77 6e-13
ref|NP_416548.1| (NC_000913) putative colanic acid biosynthes... 77 7e-13
gb|AAL81488.1| (AE010240) hypothetical protein [Pyrococcus fu... 77 7e-13
ref|NP_288550.1| (NC_002655) putative colanic acid biosynthes... 77 7e-13
ref|NP_147773.1| (NC_000854) capM protein [Aeropyrum pernix] ... 75 3e-12
gb|AAL80431.1| (AE010155) glycosyltransferase [Pyrococcus fur... 73 6e-12
ref|NP_275316.1| (NC_000916) LPS biosynthesis RfbU related pr... 73 9e-12
ref|NP_242281.1| (NC_002570) BH1415~unknown conserved protein... 72 2e-11
ref|NP_489242.1| (NC_003272) glycosyltransferase [Nostoc sp. ... 72 3e-11
ref|NP_360212.1| (NC_003103) capM protein [Rickettsia conorii... 72 3e-11
ref|NP_248617.1| (NC_000909) LPS biosynthesis protein, putati... 71 5e-11
dbj|BAB18471.1| (AB033991) BtrM [Bacillus circulans] 70 9e-11
ref|NP_489241.1| (NC_003272) glycosyltransferase [Nostoc sp. ... 70 1e-10
ref|NP_563139.1| (NC_003366) probable mannosyltransferase B [... 68 2e-10
ref|NP_302182.1| (NC_002677) putative transferase [Mycobacter... 68 2e-10
ref|NP_390127.1| (NC_000964) alternate gene name: jojH~simila... 68 2e-10
ref|NP_214125.1| (NC_000918) capsular polysaccharide biosynth... 67 7e-10
ref|NP_248171.1| (NC_000909) conserved hypothetical protein [... 66 1e-09
gb|AAK20702.1|AF316641_8 (AF316641) WciS [Streptococcus pneum... 63 7e-09
emb|CAB43611.1| (AJ239004) galactosyl transferase [Streptococ... 63 7e-09
ref|NP_349652.1| (NC_003030) Glycosyltransferase [Clostridium... 58 4e-07
gb|AAL67552.1|AF461121_3 (AF461121) putative galactosyltransf... 57 7e-07
Alignments
>gb|AAF76891.1| (AF267484) PIG-A [Paramecium tetraurelia]
Length = 442
Score = 887 bits (2267), Expect = 0.0
Identities = 442/442 (100%), Positives = 442/442 (100%)
Query: 1 MVNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYY 60
MVNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYY
Sbjct: 1 MVNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYY 60
Query: 61 CPFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVF 120
CPFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVF
Sbjct: 61 CPFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVF 120
Query: 121 TDHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVD 180
TDHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVD
Sbjct: 121 TDHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVD 180
Query: 181 CSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKK 240
CSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKK
Sbjct: 181 CSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKK 240
Query: 241 ILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVV 300
ILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVV
Sbjct: 241 ILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVV 300
Query: 301 STNVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQVAE 360
STNVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQVAE
Sbjct: 301 STNVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQVAE 360
Query: 361 RTEKVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFDLIFLMILDFLQPHKGIHK 420
RTEKVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFDLIFLMILDFLQPHKGIHK
Sbjct: 361 RTEKVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFDLIFLMILDFLQPHKGIHK 420
Query: 421 PGIFNQIYKNQKEKVWGSSIQS 442
PGIFNQIYKNQKEKVWGSSIQS
Sbjct: 421 PGIFNQIYKNQKEKVWGSSIQS 442
>ref|NP_566874.1| (NM_114379) n-acetylglucosaminyl-phosphatidylinositol-like protein
[Arabidopsis thaliana]
gb|AAK62657.1| (AY039602) AT3g45100/T14D3_40 [Arabidopsis thaliana]
Length = 447
Score = 404 bits (1027), Expect = e-111
Identities = 203/419 (48%), Positives = 282/419 (66%), Gaps = 1/419 (0%)
Query: 2 VNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYC 61
+ + ++ DFF+P GGVE HI+ L CL++ G KV+++TH Y RSGVRYMT GLKVYY
Sbjct: 7 LRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYV 66
Query: 62 PFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFT 121
P+ P + T GTLPI R IL RE+I +VH H A S L E L+HA++MG+K VFT
Sbjct: 67 PWRPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMGYKVVFT 126
Query: 122 DHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDC 181
DHSL+ F D S H+NK+L++ L +ID +I VSH SKEN +R+ L P + +IPNAVD
Sbjct: 127 DHSLYGFADVGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDT 186
Query: 182 SRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKI 241
+ F P R + I IVVI R+ +RKG DLLV+V+ +C+ +P + F++GGDGPK
Sbjct: 187 AMFKP-ASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVR 245
Query: 242 LEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVS 301
LEE ++++LQ++ E+LG+VP +V+ VL GHIFLN+SLTEAFCIAI+EAASCGL VS
Sbjct: 246 LEEMREKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVS 305
Query: 302 TNVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQVAER 361
T VGG+ EVLP +MV+ A+P P+D+ I +AI I + H +KK+YSW+ VA+R
Sbjct: 306 TRVGGVPEVLPDDMVVLAEPDPDDMVRAIEKAISILPTINPEEMHNRMKKLYSWQDVAKR 365
Query: 362 TEKVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFDLIFLMILDFLQPHKGIHK 420
TE VY + L+ N+++L+R S G G +++I D + +L LQP + I +
Sbjct: 366 TEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFCMVMILDYLLWRLLQLLQPDEDIEE 424
>ref|NP_002632.1| (NM_002641) phosphatidylinositol glycan, class A isoform 1;
Phosphatidylinositol glycan, class A; GLCNAC-PI
synthesis protein [Homo sapiens]
sp|P37287|PIGA_HUMAN N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein
(GlcNac-PI synthesis protein)
(Phosphatidylinositol-glycan biosynthesis, class A
protein) (PIG-A)
pir||A46217 GPI-anchor biosynthesis protein PIG-A - human
dbj|BAA02019.1| (D11466) PIG-A protein [Homo sapiens]
dbj|BAA05966.1| (D28791) PIG-A protein [Homo sapiens]
Length = 484
Score = 401 bits (1021), Expect = e-111
Identities = 202/421 (47%), Positives = 284/421 (66%), Gaps = 16/421 (3%)
Query: 3 NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCP 62
NIC++ DFFYP +GGVE HI+QL CLIERG KVII+TH Y R G+RY+T+GLKVYY P
Sbjct: 34 NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLP 93
Query: 63 FIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTD 122
T +LP+ R I +RE + I+HSH++ S + + L HAK+MG +TVFTD
Sbjct: 94 LKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTD 153
Query: 123 HSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCS 182
HSLF F D +S NK+L LC+ +H I VS+ SKEN +RA+L+P +SVIPNAVD +
Sbjct: 154 HSLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPT 213
Query: 183 RFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKIL 242
FTP+P +R+ ++I IVV+ R+ +RKG+DLL ++ +C+++P++ FIIGG+GPK+ IL
Sbjct: 214 DFTPDPFRRH--DSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIIL 271
Query: 243 EETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVST 302
EE +RY L ++ LLG++ V++VL +GHIFLNTSLTEAFC+AIVEAASCGL VVST
Sbjct: 272 EEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVST 331
Query: 303 NVGGISEVLPQNMVLYADPTPEDISHKITQAI--------PIAKNFYVYQQHELVKKMYS 354
VGGI EVLP+N+++ +P+ + + + +AI P +N H +VK Y+
Sbjct: 332 RVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENI-----HNIVKTFYT 386
Query: 355 WEQVAERTEKVYYKILQTQNQTILKRFKDCYSN-GQIYGLFLMILLIFDLIFLMILDFLQ 413
W VAERTEKVY ++ + KR S+ G + G +L +F+ +FL+ L ++
Sbjct: 387 WRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMT 446
Query: 414 P 414
P
Sbjct: 447 P 447
>pir||A55731 GPI-anchor biosynthesis protein PIG-A - mouse
pir||I52484 gene PIG-A protein - mouse
dbj|BAA05047.1| (D26047) Pig-a precursor [Mus musculus]
dbj|BAA06663.1| (D31863) PIG-A protein [Mus musculus]
Length = 485
Score = 395 bits (1005), Expect = e-109
Identities = 200/421 (47%), Positives = 281/421 (66%), Gaps = 15/421 (3%)
Query: 3 NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCP 62
NIC++ DFFYP +GGVE HI+QL CLIERG KVI +TH Y R GVRY+TNGLKVYY P
Sbjct: 34 NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVITVTHAYGNRKGVRYLTNGLKVYYLP 93
Query: 63 FIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTD 122
T +LP+ R I +RE I I+HSH++ S + + L HAK+MG +TVFTD
Sbjct: 94 LRVMYNQSTATTLFHSLPLLRYIFVRERITIIHSHSSFSAMAHDALFHAKTMGLQTVFTD 153
Query: 123 HSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCS 182
HSLF F D +S NK+L LC+ +H I VS+ SKEN +RA+L+P +SVIPNAVD +
Sbjct: 154 HSLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPT 213
Query: 183 RFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKIL 242
FTP+P +R+ + I +VV+ R+ +RKG DLL ++ +C+++ E++F+IGG+GPK+ IL
Sbjct: 214 DFTPDPFRRHD-SVITVVVVSRLVYRKGTDLLSGIIPELCQKYQELHFLIGGEGPKRIIL 272
Query: 243 EETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVST 302
EE +RY L ++ +LLG++ V++VL +GHIFLNTSLTEAFC+AIVEAASCGL VVST
Sbjct: 273 EEVRERYQLHDRVQLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVST 332
Query: 303 NVGGISEVLPQNMVLYADPTPEDISHKITQAI--------PIAKNFYVYQQHELVKKMYS 354
VGGI EVLP+++++ +P+ + + + +AI P +N H +VK Y+
Sbjct: 333 KVGGIPEVLPESLIILCEPSVKSLCDGLEKAIFQVKSGTLPAPENI-----HNVVKTFYT 387
Query: 355 WEQVAERTEKVYYKILQTQNQTILKRFKDCYSN-GQIYGLFLMILLIFDLIFLMILDFLQ 413
W VAERTEKVY ++ + + KR S+ G + G +L + +FL+ L ++
Sbjct: 388 WRNVAERTEKVYERVSKETVLPMHKRLDRLISHCGPVTGYMFALLAVLSYLFLIFLQWMT 447
Query: 414 P 414
P
Sbjct: 448 P 448
>pir||T47450 n-acetylglucosaminyl-phosphatidylinositol-like protein -
Arabidopsis thaliana
emb|CAB72148.1| (AL138649) n-acetylglucosaminyl-phosphatidylinositol-like protein
[Arabidopsis thaliana]
Length = 450
Score = 395 bits (1005), Expect = e-109
Identities = 202/422 (47%), Positives = 281/422 (65%), Gaps = 4/422 (0%)
Query: 2 VNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYC 61
+ + ++ DFF+P GGVE HI+ L CL++ G KV+++TH Y RSGVRYMT GLKVYY
Sbjct: 7 LRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYV 66
Query: 62 PFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFT 121
P+ P + T GTLPI R IL RE+I +VH H A S L E L+HA++MG+K VFT
Sbjct: 67 PWRPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMGYKVVFT 126
Query: 122 DHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDC 181
DHSL+ F D S H+NK+L++ L +ID +I VSH SKEN +R+ L P + +IPNAVD
Sbjct: 127 DHSLYGFADVGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDT 186
Query: 182 SRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKI 241
+ F P R + I IVVI R+ +RKG DLLV+V+ +C+ +P + F++GGDGPK
Sbjct: 187 AMFKP-ASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVR 245
Query: 242 LEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVS 301
LEE ++++LQ++ E+LG+VP +V+ VL GHIFLN+SLTEAFCIAI+EAASCGL VS
Sbjct: 246 LEEMREKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVS 305
Query: 302 TNVGGI---SEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQV 358
T VGG +VLP +MV+ A+P P+D+ I +AI I + H +KK+YSW+ V
Sbjct: 306 TRVGGFLHGLQVLPDDMVVLAEPDPDDMVRAIEKAISILPTINPEEMHNRMKKLYSWQDV 365
Query: 359 AERTEKVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFDLIFLMILDFLQPHKGI 418
A+RTE VY + L+ N+++L+R S G G +++I D + +L LQP + I
Sbjct: 366 AKRTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFCMVMILDYLLWRLLQLLQPDEDI 425
Query: 419 HK 420
+
Sbjct: 426 EE 427
>pir||T40367 n-acetylglucosaminyl-phosphatidylinositol - fission yeast
(Schizosaccharomyces pombe)
emb|CAB09127.1| (Z95620) n-acetylglucosaminyl-phosphatidylinositol
[Schizosaccharomyces pombe]
Length = 456
Score = 371 bits (944), Expect = e-101
Identities = 193/415 (46%), Positives = 270/415 (64%), Gaps = 3/415 (0%)
Query: 6 LICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFIP 65
++ DFF+P GG+E HIFQL LI+ G KVI+ITH Y+ R GVRY+TNGL VYY P
Sbjct: 1 MVSDFFFPQPGGIESHIFQLSQRLIDLGHKVIVITHAYKDRVGVRYLTNGLTVYYVPLHT 60
Query: 66 AIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTDHSL 125
+ ++ PIFR I++RE I IVH H + S+L + +LHA++MG KT FTDHSL
Sbjct: 61 VYRETTFPSFFSFFPIFRNIVIRENIEIVHGHGSLSFLCHDAILHARTMGLKTCFTDHSL 120
Query: 126 FAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFT 185
F F DA S NK+LK+ + +++H I VSH +EN +RA L+P+ +SVIPNA+ F
Sbjct: 121 FGFADAGSIVTNKLLKFTMSDVNHVICVSHTCRENTVLRAVLNPKRVSVIPNALVAENFQ 180
Query: 186 PNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEET 245
P+P K + + IVVI R+ + KG+DLL+ V+ IC QHP++ F+I GDGPK LE+
Sbjct: 181 PDPSKASK-DFLTIVVISRLYYNKGIDLLIAVIPRICAQHPKVRFVIAGDGPKSIDLEQM 239
Query: 246 IQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVG 305
++Y LQ++ E+LGSV QV+DV+ RGHI+L+ SLTEAF +VEAASCGL V+ST VG
Sbjct: 240 REKYMLQDRVEMLGSVRHDQVRDVMVRGHIYLHPSLTEAFGTVLVEAASCGLYVISTKVG 299
Query: 306 GISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQ--HELVKKMYSWEQVAERTE 363
G+ EVLP +M +A P +D++ ++ I + + + HE VK+MYSW VAERTE
Sbjct: 300 GVPEVLPSHMTRFARPEEDDLADTLSSVITDYLDHKIKTETFHEEVKQMYSWIDVAERTE 359
Query: 364 KVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFDLIFLMILDFLQPHKGI 418
KVY I N ++ R K Y GQ G +L+ D + +++L+++ P I
Sbjct: 360 KVYDSICSENNLRLIDRLKLYYGCGQWAGKLFCLLIAIDYLVMVLLEWIWPASDI 414
>ref|NP_495840.1| (NM_063439) phosphatidylinositol biosynthetic protein
[Caenorhabditis elegans]
pir||T20374 hypothetical protein D2085.6 - Caenorhabditis elegans
emb|CAA91062.1| (Z54284) contains similarity to Pfam domain: PF00534 (Glycosyl
transferases group 1), Score=91.6, E-value=9.5e-25,
N=1~cDNA EST yk349e7.5 comes from this gene
[Caenorhabditis elegans]
Length = 444
Score = 368 bits (936), Expect = e-101
Identities = 190/420 (45%), Positives = 272/420 (64%), Gaps = 7/420 (1%)
Query: 3 NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCP 62
+I L+ DFF P GGVE HI+ L CLIE G +V++ITH Y R G+RY++NGLKVYY P
Sbjct: 9 SIALVSDFFCPNAGGVETHIYFLAQCLIELGHRVVVITHGYGNRKGIRYLSNGLKVYYLP 68
Query: 63 FIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTD 122
FI A L + VG++P R++LLRE + I+H H+ S L E L+ MG +TVFTD
Sbjct: 69 FIVAYNGATLGSIVGSMPWLRKVLLRENVQIIHGHSTFSSLAHETLMIGGLMGLRTVFTD 128
Query: 123 HSLFAFNDAASFHVNK-ILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDC 181
HSLF F DA++ NK +L+Y L +D +I VS+ SKEN +R LDP +S IPNA++
Sbjct: 129 HSLFGFADASAILTNKLVLQYSLINVDQTICVSYTSKENTVLRGKLDPNKVSTIPNAIET 188
Query: 182 SRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKI 241
S FTP+ + ++ N IV + R+ +RKG DLL +++ +C +H + FIIGGDGPK+
Sbjct: 189 SLFTPD-RNQFFNNPTTIVFLGRLVYRKGADLLCEIVPKVCARHKSVRFIIGGDGPKRIE 247
Query: 242 LEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVS 301
LEE ++R+ L + +LG +P +QVK VLN+G IF+NTSLTEAFC++IVEAASCGL VVS
Sbjct: 248 LEEMLERFKLHERVVILGMLPHNQVKRVLNQGQIFINTSLTEAFCMSIVEAASCGLHVVS 307
Query: 302 TNVGGISEVLP-QNMVLYADPTPEDISHKITQAIPIAKNFYVY---QQHELVKKMYSWEQ 357
T VGG+ EVLP + +P P+D+ + +A+ + + ++HE V KMY+W
Sbjct: 308 TRVGGVPEVLPIGEFISLEEPVPDDLVDALLKAVDRREKGLLMDPTEKHEAVSKMYNWPD 367
Query: 358 VAERTEKVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFDLIFLMILD-FLQPHK 416
VA RT+ +Y K ++++ L R K Y G +G+ +++ + +L +LD F P K
Sbjct: 368 VAARTQVIYQKAVESEPTGRLGRLKGYYDQGIGFGIMYIVVSCIIIFWLTVLDLFDSPRK 427
>ref|NP_015150.2| (NC_001148) N-acetylglucosaminyl-phosphatidylinositol biosynthetic
protein; Spt14p [Saccharomyces cerevisiae]
sp|P32363|GPI3_YEAST N-ACETYLGLUCOSAMINYL-PHOSPHATIDYLINOSITOL BIOSYNTHETIC PROTEIN
(GLCNAC-PI SYNTHESIS PROTEIN)
emb|CAA44924.1| (X63290) trans-acting transcription factor [Saccharomyces
cerevisiae]
Length = 452
Score = 337 bits (856), Expect = 2e-91
Identities = 189/428 (44%), Positives = 256/428 (59%), Gaps = 12/428 (2%)
Query: 3 NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCP 62
NI ++CDFFYP LGGVE HI+ L LI+ G V+IITH Y+ R GVR++TNGLKVY+ P
Sbjct: 4 NIAMLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVP 63
Query: 63 FIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTD 122
F + T T PI R ILLRE+I IVHSH + S E +LHA +MG +TVFTD
Sbjct: 64 FFVIFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTD 123
Query: 123 HSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCS 182
HSL+ FN+ S VNK+L + L ID I VS+ KEN+ +R L P ISVIPNAV
Sbjct: 124 HSLYGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSE 183
Query: 183 RFTP-----NPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGP 237
F P +++ + I IVVI R+ KG DLL ++ +C H ++ FI+ GDGP
Sbjct: 184 DFKPRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGP 243
Query: 238 KKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGL 297
K ++ I+ + LQ + +LLGSVP +V+DVL +G I+L+ SLTEAF +VEAASC L
Sbjct: 244 KFIDFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNL 303
Query: 298 CVVSTNVGGISEVLPQNMVLYADPTP-EDISHKITQAIPI--AKNFYVYQQHELVKKMYS 354
+V+T VGGI EVLP M +YA+ T D+ +AI I +K H+ V KMY
Sbjct: 304 LIVTTQVGGIPEVLPNEMTVYAEQTSVSDLVQATNKAINIIRSKALDTSSFHDSVSKMYD 363
Query: 355 WEQVAERTEKVYYKILQTQ---NQTILKRFKDCYSNGQIYGLFLMILL-IFDLIFLMILD 410
W VA+RT ++Y I T ++ +K + Y I+ L +L I + + +L+
Sbjct: 364 WMDVAKRTVEIYTNISSTSSADDKDWMKMVANLYKRDGIWAKHLYLLCGIVEYMLFFLLE 423
Query: 411 FLQPHKGI 418
+L P I
Sbjct: 424 WLYPRDEI 431
>pir||S65187 GPI-anchor biosynthesis protein PIG-A - yeast (Saccharomyces
cerevisiae)
emb|CAA97882.1| (Z73531) ORF YPL175w [Saccharomyces cerevisiae]
Length = 461
Score = 333 bits (846), Expect = 3e-90
Identities = 187/425 (44%), Positives = 254/425 (59%), Gaps = 12/425 (2%)
Query: 6 LICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFIP 65
++CDFFYP LGGVE HI+ L LI+ G V+IITH Y+ R GVR++TNGLKVY+ PF
Sbjct: 16 MLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVPFFV 75
Query: 66 AIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTDHSL 125
+ T T PI R ILLRE+I IVHSH + S E +LHA +MG +TVFTDHSL
Sbjct: 76 IFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTDHSL 135
Query: 126 FAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFT 185
+ FN+ S VNK+L + L ID I VS+ KEN+ +R L P ISVIPNAV F
Sbjct: 136 YGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSEDFK 195
Query: 186 P-----NPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKK 240
P +++ + I IVVI R+ KG DLL ++ +C H ++ FI+ GDGPK
Sbjct: 196 PRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGPKFI 255
Query: 241 ILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVV 300
++ I+ + LQ + +LLGSVP +V+DVL +G I+L+ SLTEAF +VEAASC L +V
Sbjct: 256 DFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNLLIV 315
Query: 301 STNVGGISEVLPQNMVLYADPTP-EDISHKITQAIPI--AKNFYVYQQHELVKKMYSWEQ 357
+T VGGI EVLP M +YA+ T D+ +AI I +K H+ V KMY W
Sbjct: 316 TTQVGGIPEVLPNEMTVYAEQTSVSDLVQATNKAINIIRSKALDTSSFHDSVSKMYDWMD 375
Query: 358 VAERTEKVYYKILQTQ---NQTILKRFKDCYSNGQIYGLFLMILL-IFDLIFLMILDFLQ 413
VA+RT ++Y I T ++ +K + Y I+ L +L I + + +L++L
Sbjct: 376 VAKRTVEIYTNISSTSSADDKDWMKMVANLYKRDGIWAKHLYLLCGIVEYMLFFLLEWLY 435
Query: 414 PHKGI 418
P I
Sbjct: 436 PRDEI 440
>prf||1804343A SPT14 gene [Saccharomyces cerevisiae]
Length = 415
Score = 326 bits (828), Expect = 5e-88
Identities = 182/404 (45%), Positives = 244/404 (60%), Gaps = 11/404 (2%)
Query: 6 LICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFIP 65
++CDFFYP LGGVE HI+ L LI+ G V+IITH Y+ R GVR++TNGLKVY+ PF
Sbjct: 1 MLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVPFFV 60
Query: 66 AIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTDHSL 125
+ T T PI R ILLRE+I IVHSH + S E +LHA +MG +TVFTDHSL
Sbjct: 61 IFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTDHSL 120
Query: 126 FAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFT 185
+ FN+ S VNK+L + L ID I VS+ KEN+ +R L P ISVIPNAV F
Sbjct: 121 YGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSEDFK 180
Query: 186 P-----NPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKK 240
P +++ + I IVVI R+ KG DLL ++ +C H ++ FI+ GDGPK
Sbjct: 181 PRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGPKFI 240
Query: 241 ILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVV 300
++ I+ + LQ + +LLGSVP +V+DVL +G I+L+ SLTEAF +VEAASC L +V
Sbjct: 241 DFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNLLIV 300
Query: 301 STNVGGISEVLPQNMVLYADPTP-EDISHKITQAIPI--AKNFYVYQQHELVKKMYSWEQ 357
+T VGGI EVLP M +YA+ T D+ +AI I +K H+ V KMY W
Sbjct: 301 TTQVGGIPEVLPNEMTVYAEQTSVSDLVQATNKAINIIRSKALDTSSFHDSVSKMYDWMD 360
Query: 358 VAERTEKVYYKILQTQ---NQTILKRFKDCYSNGQIYGLFLMIL 398
VA+RT ++Y I T ++ +K + Y I+ L +L
Sbjct: 361 VAKRTVEIYTNISSTSSADDKDWMKMVANLYKRDGIWAKHLYLL 404
>gb|AAF57802.1| (AE003802) CG6401 gene product [Drosophila melanogaster]
Length = 479
Score = 322 bits (817), Expect = 8e-87
Identities = 159/320 (49%), Positives = 222/320 (68%), Gaps = 1/320 (0%)
Query: 2 VNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYC 61
+ IC++ DFFYP +GGVE H++ L L+ G K++++TH Y SG+RY+T LKVYY
Sbjct: 1 MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
Query: 62 PFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFT 121
P +L T V +P+ R +LLRE + +VH H+A S L E L+ +G KTVFT
Sbjct: 61 PIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFT 120
Query: 122 DHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDC 181
DHSLF F D ++ N +L+ L ++H+I VSH+ KEN +RA + +SVIPNAVD
Sbjct: 121 DHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDT 180
Query: 182 SRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKI 241
+ FTP+PQ+R + INIVV R+ +RKG+DLL ++ K P I FII GDGPK+ +
Sbjct: 181 ALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDL 239
Query: 242 LEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVS 301
LEE ++ N+Q + +++G+V ++V+D L RGHIFLNTSLTEA+C+AIVEAASCGL VVS
Sbjct: 240 LEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVS 299
Query: 302 TNVGGISEVLPQNMVLYADP 321
T+VGGI EVLP++++L A+P
Sbjct: 300 TSVGGIPEVLPKSLILLAEP 319
Score = 38.4 bits (88), Expect = 0.22
Identities = 24/77 (31%), Positives = 44/77 (56%), Gaps = 6/77 (7%)
Query: 343 YQQHELVKKMYSWEQVAERTEKVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFD 402
Y+ +ELV+ +Y+WE VA RT KVY ++L ++ T + + +G + +F ++
Sbjct: 397 YRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQHGSWFLVFFVVAH--- 453
Query: 403 LIFLM-ILDFLQPHKGI 418
FLM +L+ +P K +
Sbjct: 454 --FLMRLLELWRPRKHV 468
>ref|XP_045453.1| (XM_045453) similar to phosphatidylinositol glycan, class A isoform
1; Phosphatidylinositol glycan, class A; GLCNAC-PI
synthesis protein [Homo sapiens]
Length = 280
Score = 191 bits (482), Expect = 2e-47
Identities = 99/216 (45%), Positives = 136/216 (62%), Gaps = 3/216 (1%)
Query: 3 NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCP 62
NIC+ DFFYP +GGVE HI+QL CLI RG KVII+ H Y R G+RY+TN LKVYY P
Sbjct: 34 NICMASDFFYPNMGGVESHIYQLPQCLIGRGDKVIIVIHAYGNRKGIRYLTNDLKVYYLP 93
Query: 63 FIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTD 122
+ T +LP+ + I ++E + I+HSH++ S + ++L HAK+MG +TV TD
Sbjct: 94 LKVMYNQSMAMTLFHSLPLLKYIFVQERVTIIHSHSSFSAMAHDVLFHAKTMGLQTVLTD 153
Query: 123 HSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCS 182
H L F S NK+L LC+ I VS+ SKEN +RA+L +SVIPNAVD
Sbjct: 154 HPLSGFAKVHSVLTNKLLTVSLCDTSRIICVSYTSKENTVLRAALITEIVSVIPNAVDPI 213
Query: 183 RFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVL 218
FTP+P +R+ TI V+ R+ +RKG +L+ ++
Sbjct: 214 DFTPDPFRRHDSITI---VVSRLVYRKGTNLVSGII 246
>emb|CAB57276.1| (X77725) PIG-A [Homo sapiens]
Length = 248
Score = 165 bits (415), Expect = 1e-39
Identities = 85/172 (49%), Positives = 120/172 (69%), Gaps = 15/172 (8%)
Query: 208 RKG--VDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQ 265
RKG +DLL ++ +C+++P++ FIIGG+GPK+ ILEE +RY L ++ LLG++
Sbjct: 77 RKGIRIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKD 136
Query: 266 VKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVLPQNMVLYADPTPED 325
V++VL +GHIFLNTSLTEAFC+AIVEAASCGL VVST VGGI EVLP+N+++ +P+ +
Sbjct: 137 VRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKS 196
Query: 326 ISHKITQAI--------PIAKNFYVYQQHELVKKMYSWEQVAERTEKVYYKI 369
+ + +AI P +N H +VK Y+W VAERTEKVY ++
Sbjct: 197 LCEGLEKAIFQLKSGTLPAPENI-----HNIVKTFYTWRNVAERTEKVYDRV 243
Score = 75.8 bits (184), Expect = 1e-12
Identities = 35/70 (50%), Positives = 47/70 (67%), Gaps = 1/70 (1%)
Query: 3 NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRY-MTNGLKVYYC 61
NIC++ DFFYP +GGVE HI+QL CLIERG KVII+TH Y R G+R + +G+ C
Sbjct: 34 NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRIDLLSGIIPELC 93
Query: 62 PFIPAIQTVV 71
P + ++
Sbjct: 94 QKYPDLNFII 103
>pir||I52665 class A GlcNAc-inositol phospholipid assembly protein PIG-A - human
gb|AAD14160.1|S74936_1 (S74936) class A GlcNAc-inositol phospholipid assembly protein
[Homo sapiens]
Length = 315
Score = 131 bits (328), Expect = 2e-29
Identities = 73/170 (42%), Positives = 105/170 (60%), Gaps = 14/170 (8%)
Query: 254 QTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVLPQ 313
+ LLG++ V++VL +GHIFLNTSLTEAFC+AIVEAASCGL VVST VGGI EVLP+
Sbjct: 114 RVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPE 173
Query: 314 NMVLYADPTPEDISHKITQAI--------PIAKNFYVYQQHELVKKMYSWEQVAERTEKV 365
N+++ +P+ + + + +AI P +N H +VK Y+W VAERTEKV
Sbjct: 174 NLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENI-----HNIVKTFYTWRNVAERTEKV 228
Query: 366 YYKILQTQNQTILKRFKDCYSN-GQIYGLFLMILLIFDLIFLMILDFLQP 414
Y ++ + KR S+ G + G +L +F+ +FL+ L ++ P
Sbjct: 229 YDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTP 278
Score = 100 bits (248), Expect = 4e-20
Identities = 47/85 (55%), Positives = 57/85 (66%)
Query: 3 NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCP 62
NIC++ DFFYP +GGVE HI+QL CLIERG KVII+TH Y R G+RY+T+GLKVYY P
Sbjct: 34 NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLP 93
Query: 63 FIPAIQTVVLFTYVGTLPIFRQILL 87
T +LP+ R LL
Sbjct: 94 LKVMYNQSTATTLFHSLPLLRVRLL 118
>ref|NP_143674.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
pir||F71196 probable hexosyltransferase (EC 2.4.1.-) PH1844 - Pyrococcus
horikoshii
dbj|BAA30965.1| (AP000007) 381aa long hypothetical protein [Pyrococcus horikoshii]
Length = 381
Score = 115 bits (286), Expect = 2e-24
Identities = 104/388 (26%), Positives = 186/388 (47%), Gaps = 30/388 (7%)
Query: 2 VNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYC 61
+ I L+ D++YP +GGV H+ L + L ERG +V I+T+ G+++
Sbjct: 4 MKIALVSDWYYPKIGGVATHMHNLAIKLRERGHEVGIVTNNRPTGKEEELKRYGIELIKI 63
Query: 62 PFIPAIQTVVLFTY-VGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVF 120
P I + V TY + + + L ++ I+HSH A + L + L K+M T+
Sbjct: 64 PGIISPFLDVNLTYGLKSSEELNEFL--KDFDIIHSHHAFTPLSLKALKAGKNMEKGTLL 121
Query: 121 TDHSL-FAFN----DAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVI 175
T HS+ FA D F + Y + +S + VSK S + ++
Sbjct: 122 TTHSISFAHESKLWDTLGFTIPLFKSY----LKYSHRIIAVSKAAKSFIEHFTSVPVLIV 177
Query: 176 PNAVDCSRFTPNPQK-----RYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYF 230
PN VD RF P K ++ L ++ + RM++RKG +L++ I +
Sbjct: 178 PNGVDDERFFPARDKEKIKAKFGLEGNVVLYVSRMSYRKGPHVLLNAFSKI----EDATL 233
Query: 231 IIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSL-TEAFCIAI 289
++ G+G L+ + ++N+ +G VP + +V +F+ S+ +EAF I I
Sbjct: 234 VMVGNGEMLPFLKAQTKFLGIENKVVFMGYVPDDILPEVFRMADVFVLPSISSEAFGIVI 293
Query: 290 VEAASCGLCVVSTNVGGISEVLPQNMVLYADPTPEDISHKITQAI-PIAKN-----FYVY 343
+EA + G+ +++T+VGGI EV+ +N P ++ K+ +AI + KN +Y
Sbjct: 294 LEAMASGVPIIATDVGGIPEVIKENSAGLLVPPGNEL--KLREAIEKLLKNEELRKWYGN 351
Query: 344 QQHELVKKMYSWEQVAERTEKVYYKILQ 371
V++ YSW ++ + E++Y ++LQ
Sbjct: 352 NGRRSVEEKYSWNKIVVKIERIYNEVLQ 379
>ref|NP_126928.1| (NC_000868) GALACTOSYLTRANSFERASE or LPS BIOSYNTHESIS RFBU RELATED
PROTEIN [Pyrococcus abyssi]
pir||A75033 probable hexosyltransferase (EC 2.4.1.-) PAB0827 [similarity] -
Pyrococcus abyssi (strain Orsay)
emb|CAB50158.1| (AJ248287) GALACTOSYLTRANSFERASE or LPS BIOSYNTHESIS RFBU RELATED
PROTEIN [Pyrococcus abyssi]
Length = 371
Score = 106 bits (264), Expect = 5e-22
Identities = 91/373 (24%), Positives = 178/373 (47%), Gaps = 24/373 (6%)
Query: 2 VNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYC 61
+ I L+ D+++P +GGV +H+ L + L + G +V I+T+ G+ +
Sbjct: 4 LKIALVSDWYFPKIGGVAIHVHNLAIHLRKMGHEVSIVTNALTNGKEGELQKYGIDLIKV 63
Query: 62 PFI--PAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTV 119
P + I ++ +L + + +VH+ A + L + + +G T+
Sbjct: 64 PGLIKDGINLSMIAKSSNSL-----VEYLKGFDVVHAQHAFTPLSLKSIPAGNKVGALTL 118
Query: 120 FTDHSL----FAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVI 175
T+HS+ F+ + S K L ++ I VSK ++S I I
Sbjct: 119 VTNHSVEFENFSILNGFSKMSYSYFKMYLGQVKVGIG---VSKASVSFLRKFTNAPIVEI 175
Query: 176 PNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGD 235
PN V+ RF ++ T NI+ + R+ RKGV+ L+ ++ + E I GD
Sbjct: 176 PNGVNIERFNGRGRE---WGTRNILYVGRLEPRKGVNYLISAMKFV-----EGKLTIVGD 227
Query: 236 GPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASC 295
G +K+L+ ++ ++++ E LG + ++ + + +F+ SL+EAF I ++EA +
Sbjct: 228 GSMRKVLKMQAKKLGVEDKVEFLGFISQEELILLYKKSEVFVLPSLSEAFGIVLLEAMAS 287
Query: 296 GLCVVSTNVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQ--HELVKKMY 353
+ V+ T+VGGI E++ ++ + +++ I + K + + V+++Y
Sbjct: 288 EVPVIGTSVGGIPEIIGDAGIIVPPRDSKALANAINAILSNQKTAKRLGKLGRKRVERLY 347
Query: 354 SWEQVAERTEKVY 366
SW+ VAERTE++Y
Sbjct: 348 SWDVVAERTERLY 360
>ref|NP_148357.1| (NC_000854) N-acetylglucosaminyl-phosphatidylinositol biosynthetic
protein [Aeropyrum pernix]
pir||D72511 probable hexosyltransferase (EC 2.4.1.-) APE2066 [similarity] -
Aeropyrum pernix (strain K1)
dbj|BAA81076.1| (AP000063) 392aa long hypothetical
N-acetylglucosaminyl-phosphatidylinositol biosynthetic
protein [Aeropyrum pernix]
Length = 392
Score = 101 bits (249), Expect = 3e-20
Identities = 94/375 (25%), Positives = 178/375 (47%), Gaps = 24/375 (6%)
Query: 4 ICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPF 63
I ++ DF +GGV+ H+ L L + G V+I++ + G+ V+ + P
Sbjct: 22 IVMVMDFHPSSVGGVQSHVRDLTRLLQDFGYDVVIVS-RALGKGDVKDLEAEGHYIVKPL 80
Query: 64 IPAIQTVVLFTYVGTLPIFRQI--LLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFT 121
P ++F + R+I L + +H H + TS L L A+ +G + T
Sbjct: 81 FP---LEIIFVPPDPSDLRREIESLKPDVVHSHHIYTLTSLLA---LKAARDLGLPRIAT 134
Query: 122 DHSLF-AFNDAASFHVNKIL---KYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPN 177
+HS+F A++ A + + I+ +Y+L ISVS + + + D + +IPN
Sbjct: 135 NHSIFLAYDKVALWRIASIVLPTRYLLPNAQAVISVSTAADKMVEGIVG-DSVDRYIIPN 193
Query: 178 AVDCSRFTPN-PQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDG 236
VD RF P+ P+ YPL ++ + R+ +RKG +LV + + + + IGG G
Sbjct: 194 GVDVERFKPSTPKADYPL----VLFLGRLVWRKGAHVLVRAFRHVVDEIRDAKLYIGGKG 249
Query: 237 PKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSL-TEAFCIAIVEAASC 295
+ I++ I RY L+N ++LG VP + + + + S+ E+F I +E+ S
Sbjct: 250 EFEPIIKLLIARYGLENNVKMLGVVPESEKPSLYSSAWVTAVPSIVNESFGIVALESLSS 309
Query: 296 GLCVVSTNVGGISEVLPQNM--VLYADPTPEDISHKITQAIPIA--KNFYVYQQHELVKK 351
G VV++ GG+ +V+ +L + ++++ + + + + + ++V +
Sbjct: 310 GTPVVASRQGGLKDVVKHGKTGLLVKPGSSKELAKALITLLQDSGLRKRMSEEARKIVLE 369
Query: 352 MYSWEQVAERTEKVY 366
Y W +V + KVY
Sbjct: 370 RYDWRKVVPQILKVY 384
>ref|NP_065205.1| (NM_020472) phosphatidylinositol glycan, class A isoform 2;
Phosphatidylinositol glycan, class A; GLCNAC-PI
synthesis protein [Homo sapiens]
Length = 118
Score = 98.8 bits (243), Expect = 2e-19
Identities = 45/81 (55%), Positives = 55/81 (67%)
Query: 3 NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCP 62
NIC++ DFFYP +GGVE HI+QL CLIERG KVII+TH Y R G+RY+T+GLKVYY P
Sbjct: 34 NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLP 93
Query: 63 FIPAIQTVVLFTYVGTLPIFR 83
T +LP+ R
Sbjct: 94 LKVMYNQSTATTLFHSLPLLR 114
>ref|NP_070556.1| (NC_000917) galactosyltransferase [Archaeoglobus fulgidus]
pir||G69465 probable hexosyltransferase (EC 2.4.1.-) AF1728 - Archaeoglobus
fulgidus
gb|AAB89517.1| (AE000983) galactosyltransferase [Archaeoglobus fulgidus]
Length = 356
Score = 92.6 bits (227), Expect = 1e-17
Identities = 89/380 (23%), Positives = 169/380 (44%), Gaps = 35/380 (9%)
Query: 2 VNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYC 61
+ + L+ +F P +GGVE+H+ ++ L RG +V+++T GR + +V Y
Sbjct: 1 MKVVLLSSYFPPHIGGVEVHVERIAHHLHRRGFEVVVVTSTASGREKFPF-----RVEYV 55
Query: 62 PFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFT 121
P IP Y P + L + + I HSH + L KS T
Sbjct: 56 PSIP-------IPYSPITPFLGRFLEKIDGDIFHSHTPPPFFSCSL---RKSPHVITYHC 105
Query: 122 D------HSLFAFNDAASFHVNKILKYILCE-IDHSISVSHVSKENLSMRASLDPRNISV 174
D + F A S + + +L E +D + ++ +K L R+ V
Sbjct: 106 DIEIPEKYGRFPIPRALSKLIIRRTDDMLSEALDRADAIVATTKSYAETSRLLAGRDYHV 165
Query: 175 IPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGG 234
IPN ++ S F ++ P ++ + R+ KGVD+L+ ++ + E +I G
Sbjct: 166 IPNGIELSEFEGVEAEKEP----TVLFLGRLAATKGVDVLLKAMKHV---DVEARCVIIG 218
Query: 235 DGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLT--EAFCIAIVEA 292
DG ++ LE + L+ E G +P +V + L+R + + SL+ EAF I ++EA
Sbjct: 219 DGEERSSLERLARE--LEVNAEFTGFLPRKKVIEYLSRASLLVLPSLSRLEAFGIVLLEA 276
Query: 293 ASCGLCVVSTNVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQ--HELVK 350
+CG V ++++ G+ +V + ++ +S I + + + + +V+
Sbjct: 277 MACGTPVAASDLPGVRDVASEAGFVFPPGDYMRLSEIINEVLSDERKVKAIGESGRRIVR 336
Query: 351 KMYSWEQVAERTEKVYYKIL 370
+ YSW+ V + ++Y ++
Sbjct: 337 EKYSWDVVVKSLIRLYESLI 356
>gb|AAL82009.1| (AE010283) glycosyl transferase [Pyrococcus furiosus DSM 3638]
Length = 358
Score = 92.2 bits (226), Expect = 2e-17
Identities = 86/361 (23%), Positives = 163/361 (44%), Gaps = 17/361 (4%)
Query: 25 LGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFIPAIQTVVLFTYVGTLPIFRQ 84
L + L ERG +V I+T+ G+ + P + + V TY +
Sbjct: 4 LAIKLRERGHEVGIVTNNRVTGKEKELEKYGIDLIKIPGVVSPLLEVNITYGLKSSELNE 63
Query: 85 ILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTDHSL-FAFNDAASFHVNKILKYI 143
L ++HSH A L + + ++M T+ T HS+ FA + +
Sbjct: 64 FL--NNFDVIHSHHAFMPLALKAVKAGRTMEKATLLTTHSISFAHESKLWDTLGLTIPLF 121
Query: 144 LCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFTPNPQK-----RYPLNTIN 198
+ + + VSK S ++S++PN VD +RF P K ++ L
Sbjct: 122 RSYLKYPHRIIAVSKAAKSFIEHFTSVSVSIVPNGVDDTRFFPAKHKDKIKAKFGLEGNI 181
Query: 199 IVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELL 258
++ + RM++RKG +L++ I + ++ G G L+ + ++ + +
Sbjct: 182 VLYVSRMSYRKGPHVLLNAFSKI----EDATLVMVGSGEMLPFLKAQAKFLGIEERVVFM 237
Query: 259 GSVPGHQVKDVLNRGHIFLNTSLT-EAFCIAIVEAASCGLCVVSTNVGGISEVLPQNMV- 316
G VP + +V +F+ S++ EAF I ++EA + G+ VV+T+VGGI E++ +N
Sbjct: 238 GYVPDDALPEVFRMADVFVLPSVSAEAFGIVVLEAMASGVPVVATDVGGIPEIIKENEAG 297
Query: 317 LYADPTPEDISHKITQAI---PIAKNFYVYQQHELVKKMYSWEQVAERTEKVYYKILQTQ 373
L P E + TQ + + +Y + V++ YSW+++ E++Y ++L+ Q
Sbjct: 298 LLVPPGNELKLREATQKLLKNEELRKWYGMNGRKAVEEKYSWDKIVVEIERIYSEVLEEQ 357
Query: 374 N 374
+
Sbjct: 358 S 358
>ref|NP_266369.1| (NC_002662) LPS biosynthesis protein [Lactococcus lactis subsp.
lactis]
gb|AAK04311.1|AE006259_5 (AE006259) LPS biosynthesis protein [Lactococcus lactis subsp.
lactis]
Length = 379
Score = 89.5 bits (219), Expect = 1e-16
Identities = 88/378 (23%), Positives = 171/378 (44%), Gaps = 19/378 (5%)
Query: 4 ICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPF 63
+ + ++ P LGGVE + + + L E+G +VIIIT ++ + G+K+Y P
Sbjct: 6 VAIFNGYYIPHLGGVERYTYNIAKKLTEKGYRVIIITTQHDENLTNEEIQEGIKIYRLPI 65
Query: 64 IPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLL---HAKSMGFKTVF 120
+ + ++ I+ ++ + E + + A + +L AK+ G + +
Sbjct: 66 KNLWKN--RYPFLKKNRIYHSLIEKIEAESIDYYVANTRFHLPAMLGVKMAKAKGKEAIV 123
Query: 121 TDHS---LFAFNDAASFHVNKILKYILCEIDHSISVSH-VSKENLSMRASLDPRNISVIP 176
+H L N F + KI + ++ + S+ + VS E + D + V+P
Sbjct: 124 IEHGSSYLTLNNPVLDFMLRKIEQLLIGRVKKDTSLFYGVSNEASEWLKTFDIKAKGVLP 183
Query: 177 NAVDCSRFTPNPQKRYPLNTINIVVICRMTFR-KGVDLLVDVLQIICKQHPEIYFIIGGD 235
NAV + N + + I R+ + KGV++L+ + K+ + II GD
Sbjct: 184 NAVAVDEYF-NQKIEKDEKKLTISYAGRLIPQMKGVEILLSTFSKLSKERKNLELIIAGD 242
Query: 236 GPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASC 295
GP +L E ++Y+ Q + LG VP +V ++ + +F+ S +E F A++EAA
Sbjct: 243 GP---LLNEVKRKYS-QKNIKFLGYVPYEKVLEIDAKSDVFVLMSRSEGFATAMLEAAML 298
Query: 296 GLCVVST-NVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKM-- 352
+++T VGG +++P Y E + + K Q ++ K +
Sbjct: 299 ENVIITTPTVGGARDIMPDETYGYIIENNETKLFETLTKVLDNKEHMRLMQKKISKNVLE 358
Query: 353 -YSWEQVAERTEKVYYKI 369
++WEQ A++ KV+ ++
Sbjct: 359 NFTWEQSAKQFIKVFNEL 376
>ref|NP_440248.1| (NC_000911) unknown protein [Synechocystis sp. PCC 6803]
pir||S74777 hypothetical protein slr1076 - Synechocystis sp. (strain PCC 6803)
dbj|BAA16928.1| (D90901) ORF_ID:slr1076~unknown protein [Synechocystis sp. PCC
6803]
Length = 381
Score = 85.9 bits (210), Expect = 1e-15
Identities = 75/261 (28%), Positives = 120/261 (45%), Gaps = 23/261 (8%)
Query: 114 MGFKTVFTDHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNIS 173
MG H + A+N + H+ + L++ D ++VSH +++ L +LDP +
Sbjct: 112 MGISYWTVAHGVDAWN-LQNPHIIQALRH----ADRILAVSHYTRDRLLQEQALDPEKVV 166
Query: 174 VIPNAVDCSRFTPNPQKRYPLNTIN-------IVVICRMTFR---KGVDLLVDVLQIICK 223
V+PN D SRF P+ + L N I+ I R+ KG D ++ L I K
Sbjct: 167 VLPNTFDTSRFQIAPKPQSLLEKYNLTPDQQVILTIARLAGEERYKGYDQIIRALPEIIK 226
Query: 224 QHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTE 283
P I+++IGG G + +E+ IQ +L++ L G +P ++ D N +F S E
Sbjct: 227 TIPNIHYLIGGKGGDRPRIEKLIQDLDLEDYVTLAGFIPDEELADHYNLCDVFAMPSKGE 286
Query: 284 AFCIAIVEAASCGLCVVSTNVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAKNFY-- 341
F I +EA +CG + N G + L N L P+D+ T I + Y
Sbjct: 287 GFGIVYLEAMACGKPTIGGNQDGAIDALC-NGELGVLVNPDDLDEISTVITQILEKTYPL 345
Query: 342 --VYQQHELVKK---MYSWEQ 357
+YQ L +K +Y +EQ
Sbjct: 346 PILYQPETLRQKVIEIYGFEQ 366
>ref|NP_295278.1| (NC_001263) conserved hypothetical protein [Deinococcus
radiodurans]
pir||E75381 conserved hypothetical protein - Deinococcus radiodurans (strain
R1)
gb|AAF11118.1|AE001999_2 (AE001999) conserved hypothetical protein [Deinococcus radiodurans]
Length = 411
Score = 84.4 bits (206), Expect = 4e-15
Identities = 82/290 (28%), Positives = 133/290 (45%), Gaps = 24/290 (8%)
Query: 84 QILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVF------TDHSLFAFNDAASFHVN 137
+++L + + H+H A + LHA+S+ KT TD +L A
Sbjct: 113 EVILEHGVDLTHAHYAIPHASAA--LHARSITGKTRVLTTLHGTDVTLVGTEPA----FQ 166
Query: 138 KILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRF--TPNPQKRYPLN 195
++ + DH +VSH +D R+I VI N VD RF P+P R
Sbjct: 167 HTTRHAIERSDHVTAVSHSLAAETREVFGVD-RDIEVIHNFVDSDRFRRIPDPGVRARFA 225
Query: 196 TINIVVICRMTFRKGVDLLVDVLQIICKQHPEI--YFIIGGDGPKKKILEETIQRYNLQN 253
+I ++ + + + DV+Q+ + EI ++ GDGP++ E + +
Sbjct: 226 HPEEALIVHVSNFRPIKRVEDVVQVFARIASEIPARLLMIGDGPERARAFELARELGVIG 285
Query: 254 QTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVLPQ 313
+T+ LGS P V+ VL +FL TS E+F +A +EA SC + VV++N GGI EV+
Sbjct: 286 RTQFLGSFP--DVQTVLGISDLFLLTSSHESFGLAALEAMSCEVPVVASNAGGIPEVVQH 343
Query: 314 --NMVLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQVAER 361
N L +D++H A+ I ++ YQQ + + EQ R
Sbjct: 344 GVNGFLSDVGDVDDMAH---HALKILRDQETYQQMGQAARRTAVEQFHPR 390
>ref|NP_489278.1| (NC_003272) glycosyltransferase [Nostoc sp. PCC 7120]
dbj|BAB76937.1| (AP003599) glycosyltransferase [Nostoc sp. PCC 7120]
Length = 382
Score = 84.0 bits (205), Expect = 4e-15
Identities = 73/254 (28%), Positives = 117/254 (45%), Gaps = 14/254 (5%)
Query: 137 NKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFTPNPQKRYPLNT 196
N +K L D ++VSH +++ + + L+P +S++PN SRF P P+ Y L
Sbjct: 129 NAEVKKSLHHADQILAVSHYTRDRIIEKHRLNPDKVSILPNTFASSRFKPAPKPNYLLRK 188
Query: 197 IN-------IVVICRMT---FRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETI 246
I+ + R+ KG D ++ L I + P ++++I G G K +E I
Sbjct: 189 YQLKPEQQIILTVARLAEAQRYKGYDQILQALPHIRQLIPNVHYVIVGKGNDKHRIESMI 248
Query: 247 QRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGG 306
+ LQN L G VP Q+ D N +F S E F I +EA +CG V+ N G
Sbjct: 249 VQQGLQNCVTLAGFVPDEQLCDYYNLCDVFAMPSKREGFGIVYLEALACGKPVLGGNQDG 308
Query: 307 ISEVLPQ-NMVLYADP-TPEDISHKITQAIP-IAKNFYVYQQHELVKKMYSWEQVAERTE 363
++ L + DP E+I+ + Q + I N +YQ L +K+ + ER +
Sbjct: 309 ANDALCHGELGALVDPDNVEEIALTLIQILQGIYPNQLMYQPDALRQKVIDYFGF-ERFQ 367
Query: 364 KVYYKILQTQNQTI 377
K L + Q+I
Sbjct: 368 ATLAKYLDKRLQSI 381
>gb|AAL81485.1| (AE010240) glycosyl transferase [Pyrococcus furiosus DSM 3638]
Length = 383
Score = 78.5 bits (191), Expect = 2e-13
Identities = 76/301 (25%), Positives = 135/301 (44%), Gaps = 25/301 (8%)
Query: 84 QILLREEIHIVHSHAATSYLGG---ELLLHAKSMGFKTVFTDHSLFAFNDAASFHVNKIL 140
+++ RE + +HA ++ G +L + F V T H L +N +L
Sbjct: 91 KVIKRENLKFKIAHAHFTWPSGYATHILKRTHKIPF--VVTTHGLH------DTRMNFLL 142
Query: 141 KYILCEIDHSI-SVSHVSKE--NLSMRASLDPRNISVIPNAVDCSRFTPN------PQKR 191
K E+ S ++ +VS++ L MR + + IPN VD S F P +
Sbjct: 143 KNGAMEVWKSADAIINVSRKCVKLLMRVGIPEDKLYYIPNGVDTSLFYPQETALIRKELN 202
Query: 192 YPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNL 251
P++ ++ + + +KG + L+ ++II ++ I G+GP +K LE + L
Sbjct: 203 IPIDKKILISVGNLVEKKGFEYLIRAMKIILHARDDVLLYIIGEGPLRKRLENITRELKL 262
Query: 252 QNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVL 311
+ L+G P + +N G +F+ SL E F + +EA +CG V+ST GG EV+
Sbjct: 263 EEHVFLVGPKPHRDIPLWINAGDLFVLPSLVENFGVVNIEALACGKPVISTINGGSEEVI 322
Query: 312 PQNM--VLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQVAERTEKVYYKI 369
+L PE ++ KI A+ + ++ + + W +A + KVY +
Sbjct: 323 TSEEYGLLCPPRDPECLAEKILMAL---NKEWDREKIRKYAEQFDWRNIARQIFKVYEDV 379
Query: 370 L 370
L
Sbjct: 380 L 380
>gb|AAC77851.1| (U38473) putative glycosyl transferase [Escherichia coli]
Length = 406
Score = 78.2 bits (190), Expect = 2e-13
Identities = 48/146 (32%), Positives = 83/146 (55%), Gaps = 7/146 (4%)
Query: 172 ISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFI 231
I+V AVD +RF+P P K P + I+ + R+T +KG+ + ++ + + +Q +
Sbjct: 199 IAVSRMAVDMTRFSPRPVKA-PATPLEIISVARLTEKKGLHVAIEACRQLKEQGVAFRYR 257
Query: 232 IGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLT------EAF 285
I G GP ++ L I++Y L++ E+ G P H+VK +L+ +FL S+T E
Sbjct: 258 ILGIGPWERRLRTLIEQYQLEDVVEMPGFKPSHEVKAMLDDADVFLLPSVTGADGDMEGI 317
Query: 286 CIAIVEAASCGLCVVSTNVGGISEVL 311
+A++EA + G+ VVST GI E++
Sbjct: 318 PVALMEAMAVGIPVVSTLHSGIPELV 343
>ref|NP_487738.1| (NC_003272) heterocyst envelope polysaccharide synthesis protein
[Nostoc sp. PCC 7120]
gb|AAB08106.1| (U68035) HepB [Anabaena sp.]
dbj|BAB75397.1| (AP003594) heterocyst envelope polysaccharide synthesis protein
[Nostoc sp. PCC 7120]
Length = 389
Score = 77.4 bits (188), Expect = 4e-13
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 30/283 (10%)
Query: 108 LLHAKSMGFKTVFTDHSLFAFNDAASFHVNKI---LKYILCE------IDHSISVSHVSK 158
+L G F H +A NKI LK L E D I +S
Sbjct: 109 ILDILPQGIPITFNFHGPWASESKQELVKNKISIFLKRRLIEQTTYNHCDRFIVLSKAFG 168
Query: 159 ENLSMRASLDPRNISVIPNAVDCSRFTPNPQKRYPLNTIN-------IVVICRMTFRKGV 211
L + + I +IP V+ +F PN ++ +N + R+ R GV
Sbjct: 169 NILHQQYQIPWHKIHIIPGGVNIDKFQPNLSRQQARQQLNWPESRPILFTSRRLVHRVGV 228
Query: 212 DLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLN 271
D L+ L II + P+I+ I G G + LE+ Q L+N + LG +P Q+
Sbjct: 229 DKLLQALAIIKPKLPDIWLAIAGRGHLQTTLEKQAQELGLENNVKFLGFLPDEQLPIAYQ 288
Query: 272 RGHIFLNTSLT-EAFCIAIVEAASCGLCVVSTNVGGISEVL----PQNMVLYADPTPEDI 326
++ + S + E F +AI E+ +CG V+ T +GG+ E+L PQ ++ A P I
Sbjct: 289 AANLTVMPSQSFEGFGLAITESLACGTPVLCTPIGGMPEILTPFSPQ--LITASPEATAI 346
Query: 327 SHKITQ----AIPIAKNFYVYQQHELVKKMYSWEQVAERTEKV 365
+ KI Q IP + + + W+++A++ +V
Sbjct: 347 AEKIAQILLEQIPKPSR---EECRQYAVTNFDWQKIAQQVRQV 386
>emb|CAB57789.1| (AJ250131) HepD protein [Nostoc sp. PCC 7120]
Length = 391
Score = 77.4 bits (188), Expect = 5e-13
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 30/283 (10%)
Query: 108 LLHAKSMGFKTVFTDHSLFAFNDAASFHVNKI---LKYILCE------IDHSISVSHVSK 158
+L G F H +A NKI LK L E D I +S
Sbjct: 109 ILDILPQGIPITFNFHGPWASESKQELVKNKISIFLKRRLIEQTTYNHCDRFIVLSKAFG 168
Query: 159 ENLSMRASLDPRNISVIPNAVDCSRFTPNPQKRYPLNTIN-------IVVICRMTFRKGV 211
L + + I +IP V+ +F PN ++ +N + R+ R GV
Sbjct: 169 NILHQQYQIPWHKIHIIPGGVNIDKFQPNLSRQQARQQLNWPESRPILFTSRRLVHRVGV 228
Query: 212 DLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLN 271
D L+ L II + P+I+ I G G + LE+ Q L+N + LG +P Q+
Sbjct: 229 DKLLQALAIIKPKLPDIWLAIAGRGHLQTTLEKQAQELGLENNVKFLGFLPDEQLPIAYQ 288
Query: 272 RGHIFLNTSLT-EAFCIAIVEAASCGLCVVSTNVGGISEVL----PQNMVLYADPTPEDI 326
++ + S + E F +AI E+ +CG V+ T +GG+ E+L PQ ++ A P I
Sbjct: 289 AANLTVMPSQSFEGFGLAITESLACGTPVLCTPIGGMPEILTPFSPQ--LITASPEATAI 346
Query: 327 SHKITQ----AIPIAKNFYVYQQHELVKKMYSWEQVAERTEKV 365
+ KI Q IP + + + W+++A++ +V
Sbjct: 347 AEKIAQILLEQIPKPSR---EECRQYAVTNFDWQKIAQQVRQV 386
>gb|AAL80915.1| (AE010196) hypothetical protein [Pyrococcus furiosus DSM 3638]
Length = 389
Score = 77.0 bits (187), Expect = 6e-13
Identities = 58/221 (26%), Positives = 111/221 (49%), Gaps = 12/221 (5%)
Query: 160 NLSMRASLDPRNISVIPNAVDCSRFTPNPQK--RYPLNTIN----IVVICRMTFR-KGVD 212
+L R + P I IPN D ++F P PQ+ R LN + I+ + M R KG +
Sbjct: 172 DLFSRVGITPSKIRYIPNGFDGNKFYPIPQEIARRKLNLVEYEKIIINVANMYSRVKGHE 231
Query: 213 LLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNR 272
L+ + + + + I+ G G L++ L ++ GS P ++ +N
Sbjct: 232 YLLRAFSKVAENTSDAFLILVGSGKLLSHLKKLADNLYLGHRVLFAGSKPHDEIPLWMNA 291
Query: 273 GHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISE-VLPQNMVLYADP-TPEDISHKI 330
+F+ SL E+F + +EA +CG+ VV+T GG E ++ ++ L +P P++++ KI
Sbjct: 292 ADLFVLPSLRESFGVVQIEAMACGVPVVATRNGGSEEIIISEDYGLLCEPANPKELAEKI 351
Query: 331 TQAIPIAKNFYVYQQHELVKKMYSWEQVAERTEKVYYKILQ 371
A+ + + ++ + ++WE +A++T +VY +L+
Sbjct: 352 LIAL---EKEWDREKIRKYAEQFTWENIAKKTLEVYRGVLK 389
>ref|NP_416548.1| (NC_000913) putative colanic acid biosynthesis glycosyl transferase
[Escherichia coli K12]
sp|P71243|WCAL_ECOLI PUTATIVE COLANIC ACID BIOSYNTHESIS GLYCOSYL TRANSFERASE WCAL
pir||C64970 hypothetical protein b2044 - Escherichia coli (strain K-12)
dbj|BAA15898.1| (D90842) ORF_ID:o352#3; similar to [PIR Accession Number S15296]
[Escherichia coli]
gb|AAC75105.1| (AE000295) putative colanic acid biosynthesis glycosyl transferase
[Escherichia coli K12]
Length = 406
Score = 76.6 bits (186), Expect = 7e-13
Identities = 47/146 (32%), Positives = 82/146 (55%), Gaps = 7/146 (4%)
Query: 172 ISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFI 231
I+V VD +RF+P P K P + I+ + R+T +KG+ + ++ + + +Q +
Sbjct: 199 IAVSRMGVDMTRFSPRPVKA-PATPLEIISVARLTEKKGLHVAIEACRQLKEQGVAFRYR 257
Query: 232 IGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLT------EAF 285
I G GP ++ L I++Y L++ E+ G P H+VK +L+ +FL S+T E
Sbjct: 258 ILGIGPWERRLRTLIEQYQLEDVVEMPGFKPSHEVKAMLDDADVFLLPSVTGADGDMEGI 317
Query: 286 CIAIVEAASCGLCVVSTNVGGISEVL 311
+A++EA + G+ VVST GI E++
Sbjct: 318 PVALMEAMAVGIPVVSTLHSGIPELV 343
>gb|AAL81488.1| (AE010240) hypothetical protein [Pyrococcus furiosus DSM 3638]
Length = 373
Score = 76.6 bits (186), Expect = 7e-13
Identities = 61/223 (27%), Positives = 112/223 (49%), Gaps = 11/223 (4%)
Query: 148 DHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTF 207
D+ I+VS +K++L +A L +NI V+PN +D + Y T +I+ + R+
Sbjct: 154 DNHIAVSLKTKKDL-YKAGLR-KNIYVVPNGIDFEKIQEIKPSSY---TSDIIFVGRLIK 208
Query: 208 RKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQ-V 266
K V LL+ L II + P++ ++ GDGP+++ LE+ + NLQ+ + LG + ++ V
Sbjct: 209 EKNVPLLLKALTIIKQDIPDVKAVVVGDGPEREYLEKLSFKLNLQDNVKFLGFLNRYEDV 268
Query: 267 KDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTN---VGGISEVLPQNMVLYADPTP 323
++ +F SL E F I ++EA + GL VV+ +L A
Sbjct: 269 VALMKASKVFAFPSLREGFGIVVIEANASGLPVVTVEHEMNASKDLILEWKNGFIAKVNE 328
Query: 324 EDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQVAERTEKVY 366
+D + KI I + K + + + + Y+W ++ ++ E+ Y
Sbjct: 329 KDFAEKIL--IALEKRKKMKKLSTEIARKYNWNEIVKKLERYY 369
>ref|NP_288550.1| (NC_002655) putative colanic acid biosynthesis glycosyl transferase
[Escherichia coli O157:H7 EDL933]
ref|NP_310876.1| (NC_002695) putative colanic acid biosynthesis glycosyl transferase
[Escherichia coli O157:H7]
gb|AAG57104.1|AE005430_4 (AE005430) putative colanic acid biosynthesis glycosyl transferase
[Escherichia coli O157:H7 EDL933]
dbj|BAB36272.1| (AP002559) putative colanic acid biosynthesis glycosyl transferase
[Escherichia coli O157:H7]
Length = 406
Score = 76.6 bits (186), Expect = 7e-13
Identities = 47/146 (32%), Positives = 82/146 (55%), Gaps = 7/146 (4%)
Query: 172 ISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFI 231
I+V VD +RF+P P K P + I+ + R+T +KG+ + ++ + + +Q +
Sbjct: 199 IAVSRMGVDMTRFSPRPVKA-PATPLEIISVARLTEKKGLHVAIEACRQLKEQGVAFRYR 257
Query: 232 IGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLT------EAF 285
I G GP ++ L I++Y L++ E+ G P H+VK +L+ +FL S+T E
Sbjct: 258 ILGIGPWERRLRTLIEQYQLEDVVEMPGFKPSHEVKAMLDDADVFLLPSVTGADGDMEGI 317
Query: 286 CIAIVEAASCGLCVVSTNVGGISEVL 311
+A++EA + G+ VVST GI E++
Sbjct: 318 PVALMEAMAVGIPVVSTLHSGIPELV 343
>ref|NP_147773.1| (NC_000854) capM protein [Aeropyrum pernix]
pir||C72590 probable hexosyltransferase (EC 2.4.1.-) APE1191 [similarity] -
Aeropyrum pernix (strain K1)
dbj|BAA80177.1| (AP000061) 363aa long hypothetical capM protein [Aeropyrum pernix]
Length = 363
Score = 74.7 bits (181), Expect = 3e-12
Identities = 54/218 (24%), Positives = 105/218 (47%), Gaps = 20/218 (9%)
Query: 151 ISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKG 210
I+VS +K+ L+ R +DP I+V+PN VD ++ P + P I+ R+ K
Sbjct: 144 IAVSQSTKKELAKRLGIDPDRIAVVPNGVDLEKYRPGSKDPRP----TILWAGRIKMYKN 199
Query: 211 VDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVL 270
+D L+ +I+ ++ P+ II G G +++ + E ++ ++ LG + + +
Sbjct: 200 LDHLLKAYRIVKQEIPDAQLIIIGTGDQEQKMRELAKKLEPRD-VHFLGKMSEQEKIMWM 258
Query: 271 NRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVLP--QNMVLYADPTPEDISH 328
R I ++TS+ E + I I EAA+C + ++ NV G+ + + + +L E ++
Sbjct: 259 QRAWIIVSTSMIEGWGITITEAAACKIPAIAYNVPGLRDSVKHMETGILVEPGNIEQLAK 318
Query: 329 KITQAI-------PIAKNFYVYQQHELVKKMYSWEQVA 359
I + +++N Y Y Q +SW+ A
Sbjct: 319 AIAWLLTDNSLRNKLSENAYNYAQS------FSWDNTA 350
>gb|AAL80431.1| (AE010155) glycosyltransferase [Pyrococcus furiosus DSM 3638]
Length = 336
Score = 73.5 bits (178), Expect = 6e-12
Identities = 100/373 (26%), Positives = 172/373 (45%), Gaps = 49/373 (13%)
Query: 6 LICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFIP 65
L+ + P GGV H+ QL CL E+ +V ++T+ V + P I
Sbjct: 4 LLVGHYPPHKGGVARHVKQLKECL-EKRHEVYVLTY-----GTVAVEEENVYSVKVPNIF 57
Query: 66 AIQTVVLFTYVGTLPIFRQILLREEIH--IVHSH--AATSYLGGELLLHAKSMGFKTVFT 121
I+ T L + + L E+ + +VH+H TS+ G +L + G V T
Sbjct: 58 GIRG----TSFALLASKKIVKLHEKYNFDLVHAHYVGTTSFAG---VLAKRKTGVPLVIT 110
Query: 122 DH-SLFAFNDAASFHVNKILKYILCEIDHSISVSH-VSKENLSMRASLDPRNISVIPNAV 179
H S F +K + E D+ I+VSH ++K+ L + AS ISVIPN
Sbjct: 111 AHGSDLEFMSRLPLG-GYFVKTSIMEADYVIAVSHYLAKKALELGAS----RISVIPNWT 165
Query: 180 DCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKK 239
+ S +++Y I+ + R+ KG++ ++ + K+ P F++ G+GP
Sbjct: 166 ELS---GESERKY------ILFLGRVASYKGIEDFIE----LAKRFPGEEFVVAGEGPLL 212
Query: 240 KILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCV 299
K L R + LG VP +DVL + + + S E F + ++EA S + V
Sbjct: 213 KKL-----RAKSPPNVKFLGYVPA---EDVLKKAKVLVLPSKREGFGLVVIEANSFKVPV 264
Query: 300 VSTNVGGISEVL--PQNMVLYADPTPEDISHKITQAIPIAKNFYVYQQHELVKKMYSWEQ 357
+ NVGGI E++ +N L+ D + I++ T +P N + + + K +S E+
Sbjct: 265 LGRNVGGIRELIRFSKNGYLFED-IEDAITYLKTLLVP-KTNVKLGSIGKRISKGHSQEK 322
Query: 358 VAERTEKVYYKIL 370
+ ER E++Y +++
Sbjct: 323 MCERVEEIYREVI 335
>ref|NP_275316.1| (NC_000916) LPS biosynthesis RfbU related protein
[Methanothermobacter thermautotrophicus]
pir||C69098 probable hexosyltransferase (EC 2.4.1.-) MTH173 - Methanobacterium
thermoautotrophicum (strain Delta H)
gb|AAB84679.1| (AE000805) LPS biosynthesis RfbU related protein
[Methanothermobacter thermautotrophicus]
Length = 382
Score = 73.1 bits (177), Expect = 9e-12
Identities = 88/389 (22%), Positives = 163/389 (41%), Gaps = 28/389 (7%)
Query: 2 VNICLICDFFYPCL-GGVEMHIFQLGLCLIERGLKVIIITH------KYQGRSGVRYMTN 54
+ I ++ DFF P GG E F++ L+ERG V +I+ +Y+ SGVR
Sbjct: 4 MRILIVSDFFVPHYNGGGERRYFEIARRLVERGHVVDVISMGIHGVGEYEEVSGVRVHHL 63
Query: 55 GLKVYYCPFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSM 114
G ++ P + + FR + + + I+ + L L ++
Sbjct: 64 GPRIRKPPLRGPLDFIRFMAAA-----FRWV-MTHDYDIIDAQTYAPLLPA--FLASRIH 115
Query: 115 GFKTVFTDHSLFAFNDAASFHVNK---ILKYILCEI--DHSISVSHVSKENLSMRASLDP 169
G V T H + + + +K IL+ +L + D I+VS + L+ +P
Sbjct: 116 GTPMVATIHDVSSAHGDQWLQSSKTATILERVLMRLPYDGVITVSRSTASALTELHGRNP 175
Query: 170 RNISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIY 229
I +IPN VD P I+ + R+ K VD L++V + P++
Sbjct: 176 DGIHIIPNGVDPELI----DSVTPATGNYIIFVGRLAPHKHVDHLIEVFSKLVIDFPDLR 231
Query: 230 FIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAI 289
I GDG ++ L+ + +++ ++ +V + + + S E F + +
Sbjct: 232 LEIIGDGVERARLKAMVDECGIRDSVTFHHNLSYPEVISRIRGARVLVLPSTREGFGMVL 291
Query: 290 VEAASCGLCVVSTNVGGISEVLP--QNMVLYADPTPEDISHKITQAIPI--AKNFYVYQQ 345
EA +CG+ V+ GG+ EV+ +N L E + KI I ++ Q
Sbjct: 292 AEAGACGVPAVAYRSGGVVEVIDDGENGFLVEPCDKEALHDKIKLLISDDELRDRMGSQG 351
Query: 346 HELVKKMYSWEQVAERTEKVYYKILQTQN 374
+ V++ + W++V + E+ Y I+ +N
Sbjct: 352 RKKVEEEFIWDRVVDEVERTYSFIIARKN 380
>ref|NP_242281.1| (NC_002570) BH1415~unknown conserved protein in others [Bacillus
halodurans]
dbj|BAB05134.1| (AP001512) BH1415~unknown conserved protein in others [Bacillus
halodurans]
Length = 923
Score = 72.3 bits (175), Expect = 2e-11
Identities = 86/393 (21%), Positives = 170/393 (42%), Gaps = 47/393 (11%)
Query: 6 LICDFFYP--CLGGVEMHIFQLGLCLIERGLKVIIITHKYQG-----RSGVRYM--TNGL 56
L+ + YP +GG+ H+ L L ++G ++ ++T G ++G ++ +GL
Sbjct: 540 LMLSWEYPPHVVGGLSRHVDALSQALAKKGHEIHVVTAAMDGAPEYEKNGEVHIHRVSGL 599
Query: 57 KVYYCPFIPAIQT--VVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSM 114
+ PF+ + + + +F +V L FR ++H+H + L+ ++
Sbjct: 600 QPEREPFLDWVASLNLAMFEHVKKLYRFR------PFDVIHAH--------DWLVSGAAL 645
Query: 115 GFKTVFTDHSLFAFNDAASFHVNKILKY------------ILCEIDHSISVSHVSKENLS 162
K +F SL A A N+ + ++ E D I S KE++
Sbjct: 646 ALKHLFQT-SLMATIHATEHGRNQGIHTELQQAIHEQEMKLVTEADQIIVCSQFMKEHVQ 704
Query: 163 MRASLDPRNISVIPNAVDCSRF-TPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQII 221
+P ++VI N V + Q P N + + R+ KG LL++
Sbjct: 705 SLFVPNPDKVAVIANGVAREQIEAARLQTISPENRFIVFSVGRIVQEKGFSLLIEA-AAK 763
Query: 222 CKQHPE-IYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTS 280
CK+ E I F++ G GP ++ ++ +L+ +G + + + +R + + S
Sbjct: 764 CKELGEPIQFVVAGHGPLLADYQQQVKERHLEAWISFVGYISDSERNEWYHRADVCIFPS 823
Query: 281 LTEAFCIAIVEAASCGLCVVSTNVGGISEVLPQNMVLYADPTPEDISHKITQAI-----P 335
L E F I +EA + G + ++ GG++E++ PT D+ + Q + P
Sbjct: 824 LYEPFGIVALEAMAAGTPTIVSDTGGLAEIVEHGDNGLKVPT-GDVDAIVAQLLSLYHKP 882
Query: 336 IAKNFYVYQQHELVKKMYSWEQVAERTEKVYYK 368
+ + ++ + V + YSWE +A++TE + K
Sbjct: 883 LLRAQIGFKGSQDVIEQYSWETIADQTEAILVK 915
>ref|NP_489242.1| (NC_003272) glycosyltransferase [Nostoc sp. PCC 7120]
dbj|BAB76901.1| (AP003599) glycosyltransferase [Nostoc sp. PCC 7120]
Length = 429
Score = 71.5 bits (173), Expect = 3e-11
Identities = 58/194 (29%), Positives = 96/194 (48%), Gaps = 19/194 (9%)
Query: 168 DPRNISVIPNAVDCSRFTPNPQKRYPLN-TINIVVICRMTFRKGVDLLVDVLQIICKQHP 226
D I V + +D + F ++ YP + I I R+ +KG++ ++ + + K +P
Sbjct: 198 DADKIHVHGSGIDSNSFFFQ-ERSYPHDGIIRIATTGRLVEKKGIEYVIKAVAQVIKNYP 256
Query: 227 EIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLT---- 282
+I + I GDG K E+ I NL +LLG ++ D+L++ HIF+ S+T
Sbjct: 257 DIEYNIIGDGELKTHFEKLIFELNLSQNVKLLGWKQQKEIVDILDKCHIFVAPSVTGKDG 316
Query: 283 --EAFCIAIVEAASCGLCVVSTNVGGISEVLPQNMVLYADPT--PEDISHKITQAIPIAK 338
+A + EA + GL V+ST GGI E++ + + P E I+HK+T
Sbjct: 317 NQDAPVNTLKEAMAMGLPVISTRHGGIPELVTDGVSGFLVPERDAEAIAHKLT------- 369
Query: 339 NFYVYQQHELVKKM 352
Y+ + EL KKM
Sbjct: 370 --YLIEHPELWKKM 381
>ref|NP_360212.1| (NC_003103) capM protein [Rickettsia conorii]
gb|AAL03113.1| (AE008618) capM protein [Rickettsia conorii]
Length = 338
Score = 71.5 bits (173), Expect = 3e-11
Identities = 89/358 (24%), Positives = 161/358 (44%), Gaps = 43/358 (12%)
Query: 15 LGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFIPAIQTVVLFT 74
LGG++ L + +++I IT Y+ + LK + +V F
Sbjct: 12 LGGIQQAFLDYSTALEMQKIEIINIT-SYKAKINSFLHKQSLK---------LPNIVPFD 61
Query: 75 YVGTLPIFRQIL--LREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTDHSLFAFNDAA 132
+ L IF+ I+ + +I I H + A ++ AKS K + H
Sbjct: 62 LLSVL-IFKYIIHKTKPDIIIAHGNRAINFSK-----FAKSQNIKLIGIAH--------- 106
Query: 133 SFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSR-FTPNPQKR 191
N LK L + D I+++H KE L ++ I ++PN ++ ++ F PN R
Sbjct: 107 ----NYSLKG-LRKCDFVIALTHHMKEFL-LKNHFAESRICILPNMINIAKDFIPNKTYR 160
Query: 192 YPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNL 251
P + I V+ R +KGVD+ + ++I+ ++ +++ +IGG G +K L + NL
Sbjct: 161 KP---VVIGVLARFVAKKGVDVFIKAIKILKEKKYDLHAVIGGSGEEKDNLIALAHKLNL 217
Query: 252 QNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVL 311
Q+Q G V + + IF SL E F I ++EA + +VST+ G + +L
Sbjct: 218 QDQISFTGWV--NDRDKFFKQIDIFCLPSLHEPFGIIVLEAMEASMPIVSTDTEGPTAIL 275
Query: 312 P--QNMVLYADPTPEDISHKITQAI--PIAKNFYVYQQHELVKKMYSWEQVAERTEKV 365
Q+ ++ + ED++ KI I PI + + +K+ Y + V+E+ + +
Sbjct: 276 NDMQDGLICKAGSAEDLAAKIVYLIENPIKAKEFSKNAYLTLKQNYEIKVVSEKLQHI 333
>ref|NP_248617.1| (NC_000909) LPS biosynthesis protein, putative [Methanococcus
jannaschii]
pir||F64500 probable hexosyltransferase (EC 2.4.1.-) MJ1607 - Methanococcus
jannaschii
gb|AAB99629.1| (U67601) LPS biosynthesis protein, putative [Methanococcus
jannaschii]
Length = 390
Score = 70.8 bits (171), Expect = 5e-11
Identities = 90/377 (23%), Positives = 167/377 (43%), Gaps = 35/377 (9%)
Query: 15 LGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFI--PAIQTVVL 72
+GG+ +H L L+ G +V +IT Y NG+ VY I P T +
Sbjct: 15 VGGLAIHCKGLAEGLVRNGHEVDVITVGYDLPE--YENINGVNVYRVRPISHPHFLTWAM 72
Query: 73 FTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTDHS-------- 124
F + IL ++ ++H H ++ G L H M + V + HS
Sbjct: 73 FM-AEEMEKKLGILGVDKYDVIHCHDWMTHFVGANLKHICRMPY--VQSIHSTEIGRCGG 129
Query: 125 LFAFNDAASFHVNKILK-YILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSR 183
L++ +D+ + H + L Y C++ I+VS KE + + + VI N ++
Sbjct: 130 LYS-DDSKAIHAMEYLSTYESCQV---ITVSKSLKEEVCSIFNTPEDKVKVIYNGINPWE 185
Query: 184 FTPNPQKRYPLN---TIN-------IVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIG 233
F N +N +I I+ + R+T++KG++ L+ + I ++H +I
Sbjct: 186 FDINLSWEEKINFRRSIGVQDDEKMILFVGRLTYQKGIEYLIRAMPKILERH-NAKLVIA 244
Query: 234 GDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAA 293
G G + LE+ + ++++ LG V G +K + + + S+ E F I +EA
Sbjct: 245 GSGDMRDYLEDLCYQLGVRHKVVFLGFVNGDTLKKLYKSADVVVIPSVYEPFGIVALEAM 304
Query: 294 SCGLCVVSTNVGGISEVLPQ--NMVLYADPTPEDISHKITQAIPI--AKNFYVYQQHELV 349
+ G VV ++VGG+ E++ N + P+ I+ + + + + + V + V
Sbjct: 305 AAGTPVVVSSVGGLMEIIKHEVNGIWVYPKNPDSIAWGVDRVLSDWGFREYIVNNAKKDV 364
Query: 350 KKMYSWEQVAERTEKVY 366
+ YSW+ +A+ T VY
Sbjct: 365 YEKYSWDNIAKETVNVY 381
>dbj|BAB18471.1| (AB033991) BtrM [Bacillus circulans]
Length = 389
Score = 69.6 bits (168), Expect = 9e-11
Identities = 55/208 (26%), Positives = 103/208 (49%), Gaps = 13/208 (6%)
Query: 152 SVSHVSKENLSMRASLDPRNISVIPNAVD---CSRFTPNPQKRYPLNTINIVVICRMTFR 208
S + + +++ M A D + + +++D CSR K L ++ + R+
Sbjct: 163 SYNRLLEDSSKMTAISDCIGSNHLSHSIDCPFCSRL-----KTELLGKKTVLFLGRIAHE 217
Query: 209 KGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKD 268
KG V V + + + ++ FI+ GDGP+++ +EE I+ NLQNQ + G + V
Sbjct: 218 KGWSTFVSVAKELADKIGDLQFIVCGDGPQREAMEEQIKAANLQNQFRITGFISHKFVSC 277
Query: 269 VLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVLPQ-NMVLYADPTP---- 323
L+ +FL S E F +++EAA G+ ++STN GG +++ + DP
Sbjct: 278 YLHHAQLFLLPSHHEEFGGSLIEAAIAGVPIISTNNGGPADIFTHGETAILKDPGDVSGI 337
Query: 324 EDISHKITQAIPIAKNFYVYQQHELVKK 351
D ++KI +A++ ++ + E+V K
Sbjct: 338 ADEAYKILTNDSVAESLRLHSRPEVVSK 365
>ref|NP_489241.1| (NC_003272) glycosyltransferase [Nostoc sp. PCC 7120]
dbj|BAB76900.1| (AP003599) glycosyltransferase [Nostoc sp. PCC 7120]
Length = 430
Score = 69.6 bits (168), Expect = 1e-10
Identities = 47/175 (26%), Positives = 89/175 (50%), Gaps = 8/175 (4%)
Query: 168 DPRNISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPE 227
+P + + + +DC++FT P+ + + R+ +KG++ + + + + +P
Sbjct: 199 NPDKLIIHGSGLDCNKFTFKPRYFPADGKVQVATTGRLVEKKGIEYAIRAVAKVAELYPN 258
Query: 228 IYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLT----- 282
I + + GDG K+ LE+ I N+ + +LLG ++ ++L HIF+ S+T
Sbjct: 259 IEYQVIGDGDLKEDLEQLITELNIGHIVKLLGWKQQKEIVEILENTHIFIAPSVTAADGN 318
Query: 283 -EAFCIAIVEAASCGLCVVSTNVGGISEVLPQNMVLYADPT--PEDISHKITQAI 334
+A + EA + GL V+ST GGI E++ + + P E I+HK+T I
Sbjct: 319 QDAPVNTLKEAMAMGLPVISTRHGGIPELVTDGVSGFLVPERDAEAIAHKLTYLI 373
>ref|NP_563139.1| (NC_003366) probable mannosyltransferase B [Clostridium
perfringens]
dbj|BAB81929.1| (AP003193) probable mannosyltransferase B [Clostridium perfringens]
Length = 381
Score = 68.4 bits (165), Expect = 2e-10
Identities = 75/287 (26%), Positives = 130/287 (45%), Gaps = 29/287 (10%)
Query: 116 FKTVFTDHSLFAF---NDAASFHVNKILKYILCEIDHS---ISVSHVSKEN-LSMRASLD 168
F + T H L + ++ K L+ + ID+S I+VS SK + L
Sbjct: 100 FAKLVTIHDLIPYILPETVGKGYLKKFLQSMPEIIDNSTGIITVSEYSKSDILRFFPHFP 159
Query: 169 PRNISVIPNA-------VDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQII 221
NI V P A +D + + KR+ N I+ I + RK V LVD I
Sbjct: 160 AENIFVTPLAANENYKPLDKEKCLFDVNKRFDFNGPFIMYIGGFSLRKNVKGLVDAFNNI 219
Query: 222 CKQHPEIY--FIIGG---DGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIF 276
K E Y I+GG +G K K E++ ++++ G + + + N +F
Sbjct: 220 HKNIDENYKLLIVGGLRDEGLKLKAYTESLP---IKDKVIFTGFIEDEYLPTLYNATTLF 276
Query: 277 LNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVLPQNMVLYADPTPEDISHKITQAIPI 336
+ SL E F + +EA SC V+++N+ I EV+P L P+++S K+ +
Sbjct: 277 VYPSLYEGFGLPPLEAMSCKTAVLTSNITSIPEVVPFKESLVDPNNPKELSSKLENLLND 336
Query: 337 AKNFYVYQQHELV----KKMYSWEQVAERTEKVYYKILQTQNQTILK 379
+K + E + K ++WE+ A++T +VY K+++ +++K
Sbjct: 337 SK---LRNNLEDICFERSKEFTWEKTAKKTLEVYKKVVEISKNSLIK 380
>ref|NP_302182.1| (NC_002677) putative transferase [Mycobacterium leprae]
emb|CAC30668.1| (AL583923) putative transferase [Mycobacterium leprae]
Length = 438
Score = 68.4 bits (165), Expect = 2e-10
Identities = 55/250 (22%), Positives = 111/250 (44%), Gaps = 7/250 (2%)
Query: 131 AASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFTPNPQK 190
A S V+ + +++ E D I+ S + I+VI N +D +R+ P +
Sbjct: 176 ALSRQVHAVESWLVRESDSLITCSASMCNEIIELFGPGLAEITVIRNGIDPARW-PFAAR 234
Query: 191 RYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYN 250
R ++ + R+ + KGV ++ L I + +P I G+G ++ L + ++Y
Sbjct: 235 RARTGPAELLYVGRLEYEKGVHDVIAALPRIRRSYPGTTLTIAGEGTQQDWLVDQARKYK 294
Query: 251 LQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEV 310
+ T +G + +++ L R + S E F + +EAA+ G +V++N+GG+ E
Sbjct: 295 VIKATRFVGHLNHNELLAALQRADAAVLPSHYEPFGLVALEAAAAGTPLVTSNIGGLGEA 354
Query: 311 LPQNMVLYADPTPEDISHKITQAIPIAKN-----FYVYQQHELVKKMYSWEQVAERTEKV 365
+ + P P DI+ + ++ E + + W+ VA++T +V
Sbjct: 355 VINGQTGVSCP-PRDIAELAAMVCTVLEDPDAAQQRALAARERLTSDFDWQTVAQQTAQV 413
Query: 366 YYKILQTQNQ 375
Y + + Q
Sbjct: 414 YLAAKRRERQ 423
>ref|NP_390127.1| (NC_000964) alternate gene name: jojH~similar to lipopolysaccharide
biosynthesis-related protein [Bacillus subtilis]
sp|P42982|YPJH_BACSU Putative glycosyl transferase ypjH
pir||G69937 lipopolysaccharide biosynthesis-related pr homolog ypjH - Bacillus
subtilis
gb|AAB38445.1| (L47709) 21.4% of identity to trans-acting transcription factor of
Sacharomyces cerevisiae; 25% of identity to sucrose
synthase of Zea mays; putative [Bacillus subtilis]
emb|CAB14162.1| (Z99115) alternate gene name: jojH~similar to lipopolysaccharide
biosynthesis-related protein [Bacillus subtilis]
Length = 377
Score = 68.4 bits (165), Expect = 2e-10
Identities = 88/390 (22%), Positives = 183/390 (46%), Gaps = 51/390 (13%)
Query: 12 YPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFIP----AI 67
YP +GG + +LG L E+G ++ IT S + + N Y P I +
Sbjct: 11 YPSVGGSGIIATELGKQLAEKGHEIHFIT------SSIPFRLNT----YHPNIHFHEVEV 60
Query: 68 QTVVLFTY----VGTLPIFRQILLREEIHIVHS-----HAATSYLGGELLLHAKSMGFKT 118
+F Y + ++ RE + I+H+ HA +YL ++L +++G T
Sbjct: 61 NQYAVFKYPPYDLTLASKIAEVAERENLDIIHAHYALPHAVCAYLAKQML--KRNIGIVT 118
Query: 119 VF--TDHSLFAFNDAASFHVNKILKYILCEIDHSISVSH-VSKENLSMRASLDP-RNISV 174
TD ++ ++ + + ++++ + D +VS ++ E + + P + I
Sbjct: 119 TLHGTDITVLGYDPS----LKDLIRFAIESSDRVTAVSSALAAETYDL---IKPEKKIET 171
Query: 175 IPNAVD----CSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQII--CKQHPEI 228
I N +D + T ++++ + VVI FRK V + DV+++ +
Sbjct: 172 IYNFIDERVYLKKNTAAIKEKHGILPDEKVVIHVSNFRK-VKRVQDVIRVFRNIAGKTKA 230
Query: 229 YFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIA 288
++ GDGP+K E I++Y L++Q +LG+ +V+D+ + + L S E+F +
Sbjct: 231 KLLLVGDGPEKSTACELIRKYGLEDQVLMLGN--QDRVEDLYSISDLKLLLSEKESFGLV 288
Query: 289 IVEAASCGLCVVSTNVGGISEVLPQNMVLYADPTPEDISHKITQAIPIAK-----NFYVY 343
++EA +CG+ + TN+GGI EV+ N+ + D++ +A+ I + N +
Sbjct: 289 LLEAMACGVPCIGTNIGGIPEVIKNNVSGFLVDV-GDVTAATARAMSILEDEQLSNRFTK 347
Query: 344 QQHELVKKMYSWEQVAERTEKVYYKILQTQ 373
E+++ +S +++ + E++Y + + +
Sbjct: 348 AAIEMLENEFSSKKIVSQYEQIYADLAEPE 377
>ref|NP_214125.1| (NC_000918) capsular polysaccharide biosynthsis protein [Aquifex
aeolicus]
pir||F70441 capsular polysaccharide biosynthsis protein - Aquifex aeolicus
gb|AAC07522.1| (AE000749) capsular polysaccharide biosynthsis protein [Aquifex
aeolicus]
Length = 316
Score = 66.9 bits (161), Expect = 7e-10
Identities = 42/175 (24%), Positives = 92/175 (52%), Gaps = 6/175 (3%)
Query: 139 ILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFTPNPQKRYPLNTIN 198
++K +L ++D + VS+ K +L + + V+ N +D + + ++
Sbjct: 85 MIKVLLEKLDGIVCVSNTVKRDLKQTFWIKDDKLKVVYNLIDIDKIRKQADESINVDFDY 144
Query: 199 IVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELL 258
I+ + R+ +KG ++ ++I ++ +++ +I G+G KK +E+ I+ L+N+ LL
Sbjct: 145 IIAVGRLEDQKGYPYMLRAFKLISEKFKDLHLLIIGEGSKKNQVEKLIEELGLKNKVHLL 204
Query: 259 GSVPGHQVK--DVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEVL 311
G+Q+ + R +L TS+ E F + +VEA + G+ V++ ++ + EVL
Sbjct: 205 ----GYQLNPYKYIKRAKAYLMTSIYEGFGLVLVEAMALGIPVIAFDIPAVREVL 255
>ref|NP_248171.1| (NC_000909) conserved hypothetical protein [Methanococcus
jannaschii]
pir||H64446 probable hexosyltransferase (EC 2.4.1.-) MJ1178 [similarity] -
Methanococcus jannaschii
gb|AAB99181.1| (U67559) conserved hypothetical protein [Methanococcus jannaschii]
Length = 351
Score = 66.1 bits (159), Expect = 1e-09
Identities = 94/369 (25%), Positives = 154/369 (41%), Gaps = 31/369 (8%)
Query: 6 LICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCPFIP 65
L+ +YP +GG+ +H+ L L + ++ I+T+ Y N K +P
Sbjct: 7 LMPSIYYPYIGGITLHVENLVKRL--KDIEFHILTYD-------SYEENEYKNVIIHNVP 57
Query: 66 AIQTVVLFTY-VGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTDHS 124
++ +Y + I + I+ E I ++HSH A LL K + + T H
Sbjct: 58 HLKKFRGISYLINAYKIGKNIIESEGIDLIHSHYAFPQGCVGALLKNK-LSIPHILTLHG 116
Query: 125 LFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRF 184
A S KY D I VS K L +L R I VI N V
Sbjct: 117 SDALILKNSIKGRYFFKYATTNSDKIICVSKYIKNQLD--ENLKNRAI-VIYNGV----- 168
Query: 185 TPNPQKRYPLNTINI-VVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILE 243
N + Y N + + +KGVD+L+D ++ I + F + GDG K +E
Sbjct: 169 --NKEILYNEGDYNFGLFVGAFVPQKGVDILIDAIKDI-----DFNFKLIGDGKLYKKIE 221
Query: 244 ETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTN 303
+ + NL + ELLG +V + + + S +E F + VE +C V++T
Sbjct: 222 NFVVKNNL-SHIELLGRKSFDEVASFMRKCSFLVVPSRSEGFGMVAVEGMACSKPVIATR 280
Query: 304 VGGISEVLPQ--NMVLYADPTPEDISHKITQAIPIAK-NFYVYQQHELVKKMYSWEQVAE 360
VGG+ E++ N +L P D+ KI + I + + + + K +SWE+
Sbjct: 281 VGGLGEIVIDGYNGLLAEKNNPNDLKEKILELINNEELRKTLGENGKEFSKKFSWEKCVM 340
Query: 361 RTEKVYYKI 369
KVY ++
Sbjct: 341 GVRKVYEEL 349
>gb|AAK20702.1|AF316641_8 (AF316641) WciS [Streptococcus pneumoniae]
Length = 354
Score = 63.4 bits (152), Expect = 7e-09
Identities = 77/335 (22%), Positives = 142/335 (41%), Gaps = 43/335 (12%)
Query: 64 IPAIQTVVLFTYVGTLPIFRQILLREEIH-----IVHSHAATS--------------YLG 104
I A Q+ + + V L LL+ +H I H H AT
Sbjct: 37 ITAYQSFIDGSLVTRLTYSSYALLKFVVHSGNYDIYHIHTATRGSCWRKLLYLKLLKSKN 96
Query: 105 GELLLHAKSMGFKTVFTDHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKE---NL 161
+ +LH F+ ++ + NK+ + +L D+ I +S + N+
Sbjct: 97 KKAILHIHGAEFQIF--------YDSLPEYKKNKV-REMLELSDYVIVLSQTWYDFFSNI 147
Query: 162 SMRASLDPRNISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQII 221
++ A I ++ N VD S + +K+ + + N + + RM RKG L+D +
Sbjct: 148 NINA-----KIVIVENGVDTSFYV---EKKKSITSNNFLFLGRMGKRKGAYDLIDAMNQA 199
Query: 222 CKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSL 281
+P ++ + GDG + I + I NL + + V K + + S
Sbjct: 200 VAINPNLHLTMAGDGELEDI-RQKISNLNLTDHITIYDWVNQRDKKILFQANQTLILPSY 258
Query: 282 TEAFCIAIVEAASCGLCVVSTNVGGISEVLPQNMVLYADPTP-EDISHKITQAI--PIAK 338
E +AI+EA + GL ++ST VGGI E++ ++ P +S+ I +A P
Sbjct: 259 NEGLPMAILEAMASGLAIISTPVGGIPEIIHEDNGWLIQPGDISQLSNIILEASYNPDVV 318
Query: 339 NFYVYQQHELVKKMYSWEQVAERTEKVYYKILQTQ 373
+ H+LV++ YS+ + + +K+Y +L+T+
Sbjct: 319 SLMGSNNHKLVEEKYSFHSMHGKIKKIYNTLLETK 353
>emb|CAB43611.1| (AJ239004) galactosyl transferase [Streptococcus pneumoniae]
Length = 354
Score = 63.4 bits (152), Expect = 7e-09
Identities = 77/335 (22%), Positives = 142/335 (41%), Gaps = 43/335 (12%)
Query: 64 IPAIQTVVLFTYVGTLPIFRQILLREEIH-----IVHSHAATS--------------YLG 104
I A Q+ + + V L LL+ +H I H H AT
Sbjct: 37 ITAYQSFIDGSLVTRLTYSSYALLKFVVHSGNYDIYHIHTATRGSCWRKLLYLKLLKSKN 96
Query: 105 GELLLHAKSMGFKTVFTDHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKE---NL 161
+ +LH F+ ++ + NK+ + +L D+ I +S + N+
Sbjct: 97 KKAILHIHGAEFQIF--------YDSLPEYKKNKV-REMLELSDYVIVLSQTWYDFFSNI 147
Query: 162 SMRASLDPRNISVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQII 221
++ A I ++ N VD S + +K+ + + N + + RM RKG L+D +
Sbjct: 148 NINA-----KIVIVENGVDTSFYV---EKKKSITSNNFLFLGRMGKRKGAYDLIDAMNQA 199
Query: 222 CKQHPEIYFIIGGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSL 281
+P ++ + GDG + I + I NL + + V K + + S
Sbjct: 200 VAINPNLHLTMAGDGELEDI-RQKISNLNLTDHITIYDWVNQRDKKILFQANQTLILPSY 258
Query: 282 TEAFCIAIVEAASCGLCVVSTNVGGISEVLPQNMVLYADPTP-EDISHKITQAI--PIAK 338
E +AI+EA + GL ++ST VGGI E++ ++ P +S+ I +A P
Sbjct: 259 NEGLPMAILEAMASGLAIISTPVGGIPEIIHEDNGWLIQPGDISQLSNIILEASYNPDVV 318
Query: 339 NFYVYQQHELVKKMYSWEQVAERTEKVYYKILQTQ 373
+ H+LV++ YS+ + + +K+Y +L+T+
Sbjct: 319 SLMGSNNHKLVEEKYSFHSMHGKIKKIYNTLLETK 353
>ref|NP_349652.1| (NC_003030) Glycosyltransferase [Clostridium acetobutylicum]
gb|AAK80992.1|AE007802_8 (AE007802) Glycosyltransferase [Clostridium acetobutylicum]
Length = 352
Score = 57.5 bits (137), Expect = 4e-07
Identities = 50/248 (20%), Positives = 117/248 (47%), Gaps = 15/248 (6%)
Query: 72 LFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTDHSLFAFNDA 131
LF + ++I++ + I+++H+++ + ++ K V+T H+L
Sbjct: 62 LF---SKIKTIKKIVISKNINVIHANSLRLAIISSIVKKLYKKDLKIVYTKHNLTILE-- 116
Query: 132 ASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCSRFTPNPQK- 190
H ++ +D ++V + ++N+ + + + VIPN++D F N +
Sbjct: 117 -KIHTKLFSAFVNKNVDIVLAVCNKDRDNM-ISIGVSEEKVKVIPNSIDLKHFKFNSKYL 174
Query: 191 RYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKILEETIQRYN 250
R + ++ R++ K + +D+ + + +IGGDGP ++ + I++ N
Sbjct: 175 RDAGKDFKVGMLSRLSKEKNHEFFLDI-----AEKADFRALIGGDGPLREEINNRIEKSN 229
Query: 251 LQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVSTNVGGISEV 310
L+ + ++LG++ + L+ + L S E F + ++EA + G V+S ++GGI +
Sbjct: 230 LKKKVKMLGNI--ENSYEFLSSVDVMLLVSTREIFPMTLLEAMAVGTIVISVDIGGIRDC 287
Query: 311 LPQNMVLY 318
+ + Y
Sbjct: 288 VINDKTGY 295
>gb|AAL67552.1|AF461121_3 (AF461121) putative galactosyltransferase WbgM [Escherichia coli]
Length = 364
Score = 56.7 bits (135), Expect = 7e-07
Identities = 77/317 (24%), Positives = 155/317 (48%), Gaps = 26/317 (8%)
Query: 64 IPAI-QTVVLFTYVGTLPIFRQILLREEIHIVHSHAA-TSYLGGELLLHAKSMGFKTVFT 121
IP + + + LF +L +I+ +E+ IVH+H++ T +LG + AK G K +
Sbjct: 57 IPTLTREISLFKDCASLFQLYKIIKKEKFDIVHTHSSKTGFLG---RVAAKLAGTKKIVH 113
Query: 122 DHSLFAFNDAASFHVNKILKYI--LCEIDHS-----ISVSHVSKENLSMRASLDPR--NI 172
FAF NK++K+I L E+ S I V + S E ++ + + + +
Sbjct: 114 TVHGFAFPSTE----NKLIKFIYFLMELIASYCSNIIIVMNESDERIARKYFVKNKKSKL 169
Query: 173 SVIPNAVDCSRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFII 232
+I NA+D ++ + K + IV++ R+ +K LL++ ++ + I+ I
Sbjct: 170 LLINNAIDVDKYNKDKDKDKDKDIFKIVMVGRLCDQKNPLLLIEAIKDL---ESNIHVDI 226
Query: 233 GGDGPKKKILEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEA 292
GDGP K L E I +YN+ ++ LG + V++ L + +F+ S E +A++EA
Sbjct: 227 IGDGPLKVKLLEKINQYNIADKVSFLGWIDA--VEEHLYKYDLFVLPSRWEGMPLAMLEA 284
Query: 293 ASCGLCVVSTNVGGISEVLPQNM-VLYADPTPEDISHKIT--QAIPIAKNFYVYQQHELV 349
+ + V+S+++ ++ + V++ D +D+ KI A P +N ++ ++ +
Sbjct: 285 MAAKVPVLSSDIEANKYLIEKTAGVVFKDEDSKDLKRKINVLHANPELRNNLAHKAYQAL 344
Query: 350 KKMYSWEQVAERTEKVY 366
+ + + + E +Y
Sbjct: 345 IEDFDLTKRTKILESLY 361
CPU time: 74.83 user secs. 1.50 sys. secs 76.33 total secs.
Database: nr
Posted date: Apr 21, 2002 2:19 PM
Number of letters in database: 277,845,442
Number of sequences in database: 887,402
Lambda K H
0.326 0.142 0.429
Gapped
Lambda K H
0.270 0.0470 0.230
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 241480859
Number of Sequences: 887402
Number of extensions: 9987286
Number of successful extensions: 30875
Number of sequences better than 10.0: 525
Number of HSP's better than 10.0 without gapping: 230
Number of HSP's successfully gapped in prelim test: 295
Number of HSP's that attempted gapping in prelim test: 30333
Number of HSP's gapped (non-prelim): 592
length of query: 442
length of database: 277,845,442
effective HSP length: 54
effective length of query: 388
effective length of database: 229,925,734
effective search space: 89211184792
effective search space used: 89211184792
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 40 (21.6 bits)
S2: 74 (33.2 bits)