IMP GPI Biosynthesis Report IMP-Bioinformatics
GPI Biosynthesis Main Page  
GPI Site Motif   GPI Site Prediction   Home Page B.E.  

GPI Anchor Biosynthesis Report: PIG-A (NP_002632.1, human)




BLASTP 2.1.1 [Aug-8-2000]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 
         (484 letters)

Database: nr
           887,402 sequences; 277,845,442 total letters

Searching..................................................


Distribution of 54 Blast Hits on the Query Sequence


Sequences with E-value BETTER than threshold
Score E Sequences producing significant alignments: (bits) Value ref|NP_002632.1| (NM_002641) phosphatidylinositol glycan, cla... 996 0.0 pir||A55731 GPI-anchor biosynthesis protein PIG-A - mouse >gi... 865 0.0 ref|NP_566874.1| (NM_114379) n-acetylglucosaminyl-phosphatidy... 466 e-130 pir||T47450 n-acetylglucosaminyl-phosphatidylinositol-like pr... 454 e-126 pir||T40367 n-acetylglucosaminyl-phosphatidylinositol - fissi... 447 e-124 ref|NP_495840.1| (NM_063439) phosphatidylinositol biosyntheti... 431 e-119 pir||I52665 class A GlcNAc-inositol phospholipid assembly pro... 418 e-115 ref|XP_045453.1| (XM_045453) similar to phosphatidylinositol ... 417 e-115 gb|AAF57802.1| (AE003802) CG6401 gene product [Drosophila mel... 415 e-114 gb|AAF76891.1| (AF267484) PIG-A [Paramecium tetraurelia] 404 e-111 ref|NP_015150.2| (NC_001148) N-acetylglucosaminyl-phosphatidy... 383 e-105 pir||S65187 GPI-anchor biosynthesis protein PIG-A - yeast (Sa... 379 e-104 prf||1804343A SPT14 gene [Saccharomyces cerevisiae] 369 e-101 emb|CAB57276.1| (X77725) PIG-A [Homo sapiens] 346 6e-94 ref|NP_065205.1| (NM_020472) phosphatidylinositol glycan, cla... 240 4e-62 ref|NP_126928.1| (NC_000868) GALACTOSYLTRANSFERASE or LPS BIO... 117 6e-25 ref|NP_143674.1| (NC_000961) hypothetical protein [Pyrococcus... 112 1e-23 ref|NP_148357.1| (NC_000854) N-acetylglucosaminyl-phosphatidy... 102 1e-20 ref|NP_266369.1| (NC_002662) LPS biosynthesis protein [Lactoc... 99 1e-19 ref|NP_275316.1| (NC_000916) LPS biosynthesis RfbU related pr... 93 8e-18 gb|AAL82009.1| (AE010283) glycosyl transferase [Pyrococcus fu... 89 2e-16 gb|AAL81488.1| (AE010240) hypothetical protein [Pyrococcus fu... 85 3e-15 ref|NP_228553.1| (NC_000853) conserved hypothetical protein [... 81 4e-14 ref|NP_472029.1| (NC_003212) weakly similar to human N-acetyl... 79 1e-13 ref|NP_213400.1| (NC_000918) hypothetical protein [Aquifex ae... 78 3e-13 gb|AAL81485.1| (AE010240) glycosyl transferase [Pyrococcus fu... 78 4e-13 ref|NP_347689.1| (NC_003030) LPS glycosyltransferase [Clostri... 76 2e-12 ref|NP_466078.1| (NC_003210) weakly similar to human N-acetyl... 75 2e-12 ref|NP_440248.1| (NC_000911) unknown protein [Synechocystis s... 75 3e-12 emb|CAB57789.1| (AJ250131) HepD protein [Nostoc sp. PCC 7120] 75 4e-12 ref|NP_487738.1| (NC_003272) heterocyst envelope polysacchari... 75 4e-12 ref|NP_248617.1| (NC_000909) LPS biosynthesis protein, putati... 73 1e-11 ref|NP_437172.1| (NC_003078) putative membrane-anchored glyco... 73 1e-11 gb|AAL80915.1| (AE010196) hypothetical protein [Pyrococcus fu... 73 1e-11 ref|NP_228431.1| (NC_000853) lipopolysaccharide biosynthesis ... 72 2e-11 ref|NP_489242.1| (NC_003272) glycosyltransferase [Nostoc sp. ... 72 2e-11 ref|NP_355849.1| (NC_003063) AGR_L_35GMp [Agrobacterium tumef... 72 3e-11 ref|NP_302182.1| (NC_002677) putative transferase [Mycobacter... 72 3e-11 gb|AAL80431.1| (AE010155) glycosyltransferase [Pyrococcus fur... 70 6e-11 ref|NP_275281.1| (NC_000916) GlcNAc-phosphatidylinositol rela... 70 1e-10 ref|NP_070556.1| (NC_000917) galactosyltransferase [Archaeogl... 70 1e-10 ref|NP_489241.1| (NC_003272) glycosyltransferase [Nostoc sp. ... 69 1e-10 dbj|BAB18471.1| (AB033991) BtrM [Bacillus circulans] 68 3e-10 gb|AAL23756.1| (U52844) putative glycosyltransferase [Serrati... 68 3e-10 ref|NP_147773.1| (NC_000854) capM protein [Aeropyrum pernix] ... 68 4e-10 ref|NP_350177.1| (NC_003030) Glycosyltransferase [Clostridium... 68 4e-10 ref|NP_142415.1| (NC_000961) hypothetical protein [Pyrococcus... 67 8e-10 ref|NP_214125.1| (NC_000918) capsular polysaccharide biosynth... 66 1e-09 ref|NP_295278.1| (NC_001263) conserved hypothetical protein [... 63 1e-08 ref|NP_279218.1| (NC_002607) LPS glycosyltransferase; Lpg [Ha... 59 2e-07
Alignments
>ref|NP_002632.1| (NM_002641) phosphatidylinositol glycan, class A isoform 1;
           Phosphatidylinositol glycan, class A; GLCNAC-PI
           synthesis protein [Homo sapiens]
 sp|P37287|PIGA_HUMAN N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein
           (GlcNac-PI synthesis protein)
           (Phosphatidylinositol-glycan biosynthesis, class A
           protein) (PIG-A)
 pir||A46217 GPI-anchor biosynthesis protein PIG-A - human
 dbj|BAA02019.1| (D11466) PIG-A protein [Homo sapiens]
 dbj|BAA05966.1| (D28791) PIG-A protein [Homo sapiens]
          Length = 484

 Score =  996 bits (2547), Expect = 0.0
 Identities = 484/484 (100%), Positives = 484/484 (100%)

Query: 1   MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60
           MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL
Sbjct: 1   MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60

Query: 61  IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRE 120
           IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRE
Sbjct: 61  IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRE 120

Query: 121 RVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNH 180
           RVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNH
Sbjct: 121 RVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNH 180

Query: 181 IICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGI 240
           IICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGI
Sbjct: 181 IICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGI 240

Query: 241 DLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLV 300
           DLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLV
Sbjct: 241 DLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLV 300

Query: 301 QGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGLE 360
           QGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGLE
Sbjct: 301 QGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGLE 360

Query: 361 KAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHC 420
           KAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHC
Sbjct: 361 KAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHC 420

Query: 421 GPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRGAWTNNYSHSKRGGENNEI 480
           GPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRGAWTNNYSHSKRGGENNEI
Sbjct: 421 GPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRGAWTNNYSHSKRGGENNEI 480

Query: 481 SETR 484
           SETR
Sbjct: 481 SETR 484
>pir||A55731 GPI-anchor biosynthesis protein PIG-A - mouse
 pir||I52484 gene PIG-A protein - mouse
 dbj|BAA05047.1| (D26047) Pig-a precursor [Mus musculus]
 dbj|BAA06663.1| (D31863) PIG-A protein [Mus musculus]
          Length = 485

 Score =  865 bits (2212), Expect = 0.0
 Identities = 425/485 (87%), Positives = 453/485 (92%), Gaps = 1/485 (0%)

Query: 1   MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60
           MA R G G G   S + S  S G+L   RT THNICMVSDFFYPNMGGVESHIYQLSQCL
Sbjct: 1   MANRRGGGQGQPPSVSPSPGSSGNLSDDRTCTHNICMVSDFFYPNMGGVESHIYQLSQCL 60

Query: 61  IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRE 120
           IERGHKVI VTHAYGNRKG+RYLT+GLKVYYLPL+VMYNQSTATTLFHSLPLLRYIFVRE
Sbjct: 61  IERGHKVITVTHAYGNRKGVRYLTNGLKVYYLPLRVMYNQSTATTLFHSLPLLRYIFVRE 120

Query: 121 RVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNH 180
           R+TIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNH
Sbjct: 121 RITIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNH 180

Query: 181 IICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDS-ITIVVVSRLVYRKG 239
           IICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDS IT+VVVSRLVYRKG
Sbjct: 181 IICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSVITVVVVSRLVYRKG 240

Query: 240 IDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVL 299
            DLLSGIIPELCQKY +L+F+IGGEGPKRIILEEVRERYQLHDRV+LLGALEHKDVRNVL
Sbjct: 241 TDLLSGIIPELCQKYQELHFLIGGEGPKRIILEEVRERYQLHDRVQLLGALEHKDVRNVL 300

Query: 300 VQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGL 359
           VQGHIFLNTSLTEAFCMAIVEAASCGLQVVST+VGGIPEVLPE+LIILCEPSVKSLC+GL
Sbjct: 301 VQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTKVGGIPEVLPESLIILCEPSVKSLCDGL 360

Query: 360 EKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISH 419
           EKAIFQ+KSGTLPAPENIHN+VKTFYTWRNVAERTEKVY+RVS E VLPM KRLDRLISH
Sbjct: 361 EKAIFQVKSGTLPAPENIHNVVKTFYTWRNVAERTEKVYERVSKETVLPMHKRLDRLISH 420

Query: 420 CGPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRGAWTNNYSHSKRGGENNE 479
           CGPVTGY+FALLAV ++LFLIFL+WMTPDS IDVAIDATGPR AWT+ +   K+  EN++
Sbjct: 421 CGPVTGYMFALLAVLSYLFLIFLQWMTPDSFIDVAIDATGPRRAWTHQWPRDKKRDENDK 480

Query: 480 ISETR 484
           IS++R
Sbjct: 481 ISQSR 485
>ref|NP_566874.1| (NM_114379) n-acetylglucosaminyl-phosphatidylinositol-like protein
           [Arabidopsis thaliana]
 gb|AAK62657.1| (AY039602) AT3g45100/T14D3_40 [Arabidopsis thaliana]
          Length = 447

 Score =  466 bits (1186), Expect = e-130
 Identities = 229/425 (53%), Positives = 310/425 (72%), Gaps = 9/425 (2%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94
           + MVSDFF+PN GGVE+HIY LSQCL++ GHKV+++THAYGNR G+RY+T GLKVYY+P 
Sbjct: 9   VLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYVPW 68

Query: 95  KVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDH 154
           +    Q+T  T++ +LP++R I  RE++T++H H +FS + H+AL HA+TMG + VFTDH
Sbjct: 69  RPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMGYKVVFTDH 128

Query: 155 SLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTD 214
           SL+GFADV S+  NK+L  SL D +  ICVS+TSKENTVLR+ L+P  V +IPNAVD   
Sbjct: 129 SLYGFADVGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDTAM 188

Query: 215 FTPDPFR-RHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEE 273
           F P   R   D ITIVV+SRLVYRKG DLL  +IPE+C+ YP++ F++GG+GPK + LEE
Sbjct: 189 FKPASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVRLEE 248

Query: 274 VRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRV 333
           +RE++ L DRV +LGA+ H  VR+VLV GHIFLN+SLTEAFC+AI+EAASCGL  VSTRV
Sbjct: 249 MREKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVSTRV 308

Query: 334 GGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPA--PENIHNIVKTFYTWRNVA 391
           GG+PEVLP+++++L EP    +   +EKAI       LP   PE +HN +K  Y+W++VA
Sbjct: 309 GGVPEVLPDDMVVLAEPDPDDMVRAIEKAI-----SILPTINPEEMHNRMKKLYSWQDVA 363

Query: 392 ERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDSII 451
           +RTE VYDR    +   + +RL R +S CG   G +F ++ + ++L    L+ + PD  I
Sbjct: 364 KRTEIVYDRALKCSNRSLLERLMRFLS-CGAWAGKLFCMVMILDYLLWRLLQLLQPDEDI 422

Query: 452 DVAID 456
           + A D
Sbjct: 423 EEAPD 427
>pir||T47450 n-acetylglucosaminyl-phosphatidylinositol-like protein -
           Arabidopsis thaliana
 emb|CAB72148.1| (AL138649) n-acetylglucosaminyl-phosphatidylinositol-like protein
           [Arabidopsis thaliana]
          Length = 450

 Score =  454 bits (1156), Expect = e-126
 Identities = 227/428 (53%), Positives = 308/428 (71%), Gaps = 12/428 (2%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94
           + MVSDFF+PN GGVE+HIY LSQCL++ GHKV+++THAYGNR G+RY+T GLKVYY+P 
Sbjct: 9   VLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYVPW 68

Query: 95  KVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDH 154
           +    Q+T  T++ +LP++R I  RE++T++H H +FS + H+AL HA+TMG + VFTDH
Sbjct: 69  RPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMGYKVVFTDH 128

Query: 155 SLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTD 214
           SL+GFADV S+  NK+L  SL D +  ICVS+TSKENTVLR+ L+P  V +IPNAVD   
Sbjct: 129 SLYGFADVGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDTAM 188

Query: 215 FTPDPFR-RHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEE 273
           F P   R   D ITIVV+SRLVYRKG DLL  +IPE+C+ YP++ F++GG+GPK + LEE
Sbjct: 189 FKPASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVRLEE 248

Query: 274 VRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRV 333
           +RE++ L DRV +LGA+ H  VR+VLV GHIFLN+SLTEAFC+AI+EAASCGL  VSTRV
Sbjct: 249 MREKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVSTRV 308

Query: 334 GGI---PEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPA--PENIHNIVKTFYTWR 388
           GG     +VLP+++++L EP    +   +EKAI       LP   PE +HN +K  Y+W+
Sbjct: 309 GGFLHGLQVLPDDMVVLAEPDPDDMVRAIEKAI-----SILPTINPEEMHNRMKKLYSWQ 363

Query: 389 NVAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPD 448
           +VA+RTE VYDR    +   + +RL R +S CG   G +F ++ + ++L    L+ + PD
Sbjct: 364 DVAKRTEIVYDRALKCSNRSLLERLMRFLS-CGAWAGKLFCMVMILDYLLWRLLQLLQPD 422

Query: 449 SIIDVAID 456
             I+ A D
Sbjct: 423 EDIEEAPD 430
>pir||T40367 n-acetylglucosaminyl-phosphatidylinositol - fission yeast
           (Schizosaccharomyces pombe)
 emb|CAB09127.1| (Z95620) n-acetylglucosaminyl-phosphatidylinositol
           [Schizosaccharomyces pombe]
          Length = 456

 Score =  447 bits (1139), Expect = e-124
 Identities = 221/421 (52%), Positives = 294/421 (69%), Gaps = 3/421 (0%)

Query: 37  MVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKV 96
           MVSDFF+P  GG+ESHI+QLSQ LI+ GHKVI++THAY +R G+RYLT+GL VYY+PL  
Sbjct: 1   MVSDFFFPQPGGIESHIFQLSQRLIDLGHKVIVITHAYKDRVGVRYLTNGLTVYYVPLHT 60

Query: 97  MYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSL 156
           +Y ++T  + F   P+ R I +RE + I+H H S S + HDA+ HA+TMGL+T FTDHSL
Sbjct: 61  VYRETTFPSFFSFFPIFRNIVIRENIEIVHGHGSLSFLCHDAILHARTMGLKTCFTDHSL 120

Query: 157 FGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFT 216
           FGFAD  S++TNKLL  ++ D NH+ICVS+T +ENTVLRA LNP+ VSVIPNA+   +F 
Sbjct: 121 FGFADAGSIVTNKLLKFTMSDVNHVICVSHTCRENTVLRAVLNPKRVSVIPNALVAENFQ 180

Query: 217 PDPFR-RHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVR 275
           PDP +   D +TIVV+SRL Y KGIDLL  +IP +C ++P + F+I G+GPK I LE++R
Sbjct: 181 PDPSKASKDFLTIVVISRLYYNKGIDLLIAVIPRICAQHPKVRFVIAGDGPKSIDLEQMR 240

Query: 276 ERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGG 335
           E+Y L DRV +LG++ H  VR+V+V+GHI+L+ SLTEAF   +VEAASCGL V+ST+VGG
Sbjct: 241 EKYMLQDRVEMLGSVRHDQVRDVMVRGHIYLHPSLTEAFGTVLVEAASCGLYVISTKVGG 300

Query: 336 IPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTE 395
           +PEVLP ++     P    L + L   I       +   E  H  VK  Y+W +VAERTE
Sbjct: 301 VPEVLPSHMTRFARPEEDDLADTLSSVITDYLDHKIKT-ETFHEEVKQMYSWIDVAERTE 359

Query: 396 KVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAI 455
           KVYD +  E  L +  RL +L   CG   G +F LL   ++L ++ L W+ P S ID A+
Sbjct: 360 KVYDSICSENNLRLIDRL-KLYYGCGQWAGKLFCLLIAIDYLVMVLLEWIWPASDIDPAV 418

Query: 456 D 456
           D
Sbjct: 419 D 419
>ref|NP_495840.1| (NM_063439) phosphatidylinositol biosynthetic protein
           [Caenorhabditis elegans]
 pir||T20374 hypothetical protein D2085.6 - Caenorhabditis elegans
 emb|CAA91062.1| (Z54284) contains similarity to Pfam domain: PF00534 (Glycosyl
           transferases group 1), Score=91.6, E-value=9.5e-25,
           N=1~cDNA EST yk349e7.5 comes from this gene
           [Caenorhabditis elegans]
          Length = 444

 Score =  431 bits (1097), Expect = e-119
 Identities = 232/443 (52%), Positives = 301/443 (67%), Gaps = 16/443 (3%)

Query: 33  HNICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYL 92
           ++I +VSDFF PN GGVE+HIY L+QCLIE GH+V+++TH YGNRKGIRYL++GLKVYYL
Sbjct: 8   YSIALVSDFFCPNAGGVETHIYFLAQCLIELGHRVVVITHGYGNRKGIRYLSNGLKVYYL 67

Query: 93  PLKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFT 152
           P  V YN +T  ++  S+P LR + +RE V IIH HS+FS++AH+ L     MGL+TVFT
Sbjct: 68  PFIVAYNGATLGSIVGSMPWLRKVLLRENVQIIHGHSTFSSLAHETLMIGGLMGLRTVFT 127

Query: 153 DHSLFGFADVSSVLTNKL-LTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVD 211
           DHSLFGFAD S++LTNKL L  SL + +  ICVSYTSKENTVLR  L+P  VS IPNA++
Sbjct: 128 DHSLFGFADASAILTNKLVLQYSLINVDQTICVSYTSKENTVLRGKLDPNKVSTIPNAIE 187

Query: 212 PTDFTPDPFR-RHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRII 270
            + FTPD  +  ++  TIV + RLVYRKG DLL  I+P++C ++  + FIIGG+GPKRI 
Sbjct: 188 TSLFTPDRNQFFNNPTTIVFLGRLVYRKGADLLCEIVPKVCARHKSVRFIIGGDGPKRIE 247

Query: 271 LEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVS 330
           LEE+ ER++LH+RV +LG L H  V+ VL QG IF+NTSLTEAFCM+IVEAASCGL VVS
Sbjct: 248 LEEMLERFKLHERVVILGMLPHNQVKRVLNQGQIFINTSLTEAFCMSIVEAASCGLHVVS 307

Query: 331 TRVGGIPEVLP-ENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRN 389
           TRVGG+PEVLP    I L EP    L + L KA+ + + G L  P   H  V   Y W +
Sbjct: 308 TRVGGVPEVLPIGEFISLEEPVPDDLVDALLKAVDRREKGLLMDPTEKHEAVSKMYNWPD 367

Query: 390 VAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDS 449
           VA RT+ +Y + +VE+      RL RL  +     G  F ++ +     +IF  W+T   
Sbjct: 368 VAARTQVIYQK-AVES--EPTGRLGRLKGYYDQGIG--FGIMYIVVSCIIIF--WLTVLD 420

Query: 450 IIDVAIDATGPRGAWTNNYSHSK 472
           + D       PR   TN+ +  K
Sbjct: 421 LFD------SPRKNGTNDKTSEK 437
>pir||I52665 class A GlcNAc-inositol phospholipid assembly protein PIG-A - human
 gb|AAD14160.1|S74936_1 (S74936) class A GlcNAc-inositol phospholipid assembly protein
           [Homo sapiens]
          Length = 315

 Score =  418 bits (1064), Expect = e-115
 Identities = 202/202 (100%), Positives = 202/202 (100%)

Query: 283 RVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPE 342
           RVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPE
Sbjct: 114 RVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPE 173

Query: 343 NLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVS 402
           NLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVS
Sbjct: 174 NLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVS 233

Query: 403 VEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRG 462
           VEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRG
Sbjct: 234 VEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDSIIDVAIDATGPRG 293

Query: 463 AWTNNYSHSKRGGENNEISETR 484
           AWTNNYSHSKRGGENNEISETR
Sbjct: 294 AWTNNYSHSKRGGENNEISETR 315
 Score =  244 bits (618), Expect = 2e-63
 Identities = 124/162 (76%), Positives = 131/162 (80%), Gaps = 10/162 (6%)

Query: 1   MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60
           MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL
Sbjct: 1   MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60

Query: 61  IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRE 120
           IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLR   +  
Sbjct: 61  IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRVRLLGA 120

Query: 121 ------RVTIIHSH----SSFSAMAHDALFHAKTMGLQTVFT 152
                 R  ++  H    +S +     A+  A + GLQ V T
Sbjct: 121 LEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVST 162
>ref|XP_045453.1| (XM_045453) similar to phosphatidylinositol glycan, class A isoform
           1; Phosphatidylinositol glycan, class A; GLCNAC-PI
           synthesis protein [Homo sapiens]
          Length = 280

 Score =  417 bits (1060), Expect = e-115
 Identities = 212/250 (84%), Positives = 222/250 (88%), Gaps = 1/250 (0%)

Query: 1   MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60
           MA +G  G+G   SATLS+VSPGSLYT RT THNICM SDFFYPNMGGVESHIYQL QCL
Sbjct: 1   MAYKGEGGHGQPPSATLSQVSPGSLYTRRTHTHNICMASDFFYPNMGGVESHIYQLPQCL 60

Query: 61  IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRE 120
           I RG KVIIV HAYGNRKGIRYLT+ LKVYYLPLKVMYNQS A TLFHSLPLL+YIFV+E
Sbjct: 61  IGRGDKVIIVIHAYGNRKGIRYLTNDLKVYYLPLKVMYNQSMAMTLFHSLPLLKYIFVQE 120

Query: 121 RVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLLTVSLCDTNH 180
           RVTIIHSHSSFSAMAHD LFHAKTMGLQTV TDH L GFA V SVLTNKLLTVSLCDT+ 
Sbjct: 121 RVTIIHSHSSFSAMAHDVLFHAKTMGLQTVLTDHPLSGFAKVHSVLTNKLLTVSLCDTSR 180

Query: 181 IICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGI 240
           IICVSYTSKENTVLRAAL  EIVSVIPNAVDP DFTPDPFRRHDSITI VVSRLVYRKG 
Sbjct: 181 IICVSYTSKENTVLRAALITEIVSVIPNAVDPIDFTPDPFRRHDSITI-VVSRLVYRKGT 239

Query: 241 DLLSGIIPEL 250
           +L+SGIIP+L
Sbjct: 240 NLVSGIIPKL 249
>gb|AAF57802.1| (AE003802) CG6401 gene product [Drosophila melanogaster]
          Length = 479

 Score =  415 bits (1055), Expect = e-114
 Identities = 204/318 (64%), Positives = 254/318 (79%), Gaps = 3/318 (0%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94
           ICMVSDFFYP++GGVE H+Y LSQ L+  GHK++++THAYG+  GIRY+T  LKVYYLP+
Sbjct: 3   ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPI 62

Query: 95  KVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDH 154
           KV YNQ    T   ++P+LR + +RERV ++H HS+FSA+AH+AL     +GL+TVFTDH
Sbjct: 63  KVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDH 122

Query: 155 SLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTD 214
           SLFGFAD+S+ LTN LL V+L   NH ICVS+  KENTVLRA +    VSVIPNAVD   
Sbjct: 123 SLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTAL 182

Query: 215 FTPDPFRR--HDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILE 272
           FTPDP +R  +D I IVV SRLVYRKGIDLL+GIIP   +  P++NFII G+GPKR +LE
Sbjct: 183 FTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDLLE 241

Query: 273 EVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTR 332
           E+RE+  + +RV+++GA+EH  VR+ LV+GHIFLNTSLTEA+CMAIVEAASCGLQVVST 
Sbjct: 242 EIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTS 301

Query: 333 VGGIPEVLPENLIILCEP 350
           VGGIPEVLP++LI+L EP
Sbjct: 302 VGGIPEVLPKSLILLAEP 319
 Score = 35.7 bits (81), Expect = 1.7
 Identities = 24/83 (28%), Positives = 37/83 (43%), Gaps = 5/83 (6%)

Query: 374 PENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAV 433
           P   + +V+T Y W +VA RT KVYDRV  E      + +  +  H      +      V
Sbjct: 396 PYRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQH----GSWFLVFFVV 451

Query: 434 FNFLFLIFLRWMTPDSIIDVAID 456
            +FL  +   W  P   +++A D
Sbjct: 452 AHFLMRLLELW-RPRKHVEIAQD 473
>gb|AAF76891.1| (AF267484) PIG-A [Paramecium tetraurelia]
          Length = 442

 Score =  404 bits (1029), Expect = e-111
 Identities = 202/421 (47%), Positives = 284/421 (66%), Gaps = 16/421 (3%)

Query: 34  NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLP 93
           NIC++ DFFYP +GGVE HI+QL  CLIERG KVII+TH Y  R G+RY+T+GLKVYY P
Sbjct: 3   NICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYCP 62

Query: 94  LKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTD 153
                      T   +LP+ R I +RE + I+HSH++ S +  + L HAK+MG +TVFTD
Sbjct: 63  FIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFTD 122

Query: 154 HSLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPT 213
           HSLF F D +S   NK+L   LC+ +H I VS+ SKEN  +RA+L+P  +SVIPNAVD +
Sbjct: 123 HSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDCS 182

Query: 214 DFTPDPFRRH--DSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIIL 271
            FTP+P +R+  ++I IVV+ R+ +RKG+DLL  ++  +C+++P++ FIIGG+GPK+ IL
Sbjct: 183 RFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKIL 242

Query: 272 EEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVST 331
           EE  +RY L ++  LLG++    V++VL +GHIFLNTSLTEAFC+AIVEAASCGL VVST
Sbjct: 243 EETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVST 302

Query: 332 RVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENI-----HNIVKTFYT 386
            VGGI EVLP+N+++  +P+ + +   + +AI        P  +N      H +VK  Y+
Sbjct: 303 NVGGISEVLPQNMVLYADPTPEDISHKITQAI--------PIAKNFYVYQQHELVKKMYS 354

Query: 387 WRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMT 446
           W  VAERTEKVY ++       + KR     S+ G + G    +L +F+ +FL+ L ++ 
Sbjct: 355 WEQVAERTEKVYYKILQTQNQTILKRFKDCYSN-GQIYGLFLMILLIFDLIFLMILDFLQ 413

Query: 447 P 447
           P
Sbjct: 414 P 414
>ref|NP_015150.2| (NC_001148) N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein; Spt14p [Saccharomyces cerevisiae]
 sp|P32363|GPI3_YEAST N-ACETYLGLUCOSAMINYL-PHOSPHATIDYLINOSITOL BIOSYNTHETIC PROTEIN
           (GLCNAC-PI SYNTHESIS PROTEIN)
 emb|CAA44924.1| (X63290) trans-acting transcription factor [Saccharomyces
           cerevisiae]
          Length = 452

 Score =  383 bits (975), Expect = e-105
 Identities = 196/433 (45%), Positives = 287/433 (66%), Gaps = 14/433 (3%)

Query: 34  NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLP 93
           NI M+ DFFYP +GGVE HIY LSQ LI+ GH V+I+THAY +R G+R+LT+GLKVY++P
Sbjct: 4   NIAMLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVP 63

Query: 94  LKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTD 153
             V++ ++T  T+F + P++R I +RE++ I+HSH S S  AH+ + HA TMGL+TVFTD
Sbjct: 64  FFVIFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTD 123

Query: 154 HSLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPT 213
           HSL+GF +++S+  NKLLT +L + + +ICVS T KEN ++R  L+P+I+SVIPNAV   
Sbjct: 124 HSLYGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSE 183

Query: 214 DFTP-DPF------RRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGP 266
           DF P DP       +  D I IVV+ RL   KG DLL+ IIP++C  + D+ FI+ G+GP
Sbjct: 184 DFKPRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGP 243

Query: 267 KRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGL 326
           K I  +++ E ++L  RV+LLG++ H+ VR+VL QG I+L+ SLTEAF   +VEAASC L
Sbjct: 244 KFIDFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNL 303

Query: 327 QVVSTRVGGIPEVLPENLIILCE-PSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFY 385
            +V+T+VGGIPEVLP  + +  E  SV  L +   KAI  ++S  L    + H+ V   Y
Sbjct: 304 LIVTTQVGGIPEVLPNEMTVYAEQTSVSDLVQATNKAINIIRSKALDT-SSFHDSVSKMY 362

Query: 386 TWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHC----GPVTGYIFALLAVFNFLFLIF 441
            W +VA+RT ++Y  +S  +    DK   +++++     G    +++ L  +  ++    
Sbjct: 363 DWMDVAKRTVEIYTNISSTSSAD-DKDWMKMVANLYKRDGIWAKHLYLLCGIVEYMLFFL 421

Query: 442 LRWMTPDSIIDVA 454
           L W+ P   ID+A
Sbjct: 422 LEWLYPRDEIDLA 434
>pir||S65187 GPI-anchor biosynthesis protein PIG-A - yeast (Saccharomyces
           cerevisiae)
 emb|CAA97882.1| (Z73531) ORF YPL175w [Saccharomyces cerevisiae]
          Length = 461

 Score =  379 bits (964), Expect = e-104
 Identities = 194/430 (45%), Positives = 285/430 (66%), Gaps = 14/430 (3%)

Query: 37  MVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKV 96
           M+ DFFYP +GGVE HIY LSQ LI+ GH V+I+THAY +R G+R+LT+GLKVY++P  V
Sbjct: 16  MLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVPFFV 75

Query: 97  MYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSL 156
           ++ ++T  T+F + P++R I +RE++ I+HSH S S  AH+ + HA TMGL+TVFTDHSL
Sbjct: 76  IFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTDHSL 135

Query: 157 FGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFT 216
           +GF +++S+  NKLLT +L + + +ICVS T KEN ++R  L+P+I+SVIPNAV   DF 
Sbjct: 136 YGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSEDFK 195

Query: 217 P-DPF------RRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRI 269
           P DP       +  D I IVV+ RL   KG DLL+ IIP++C  + D+ FI+ G+GPK I
Sbjct: 196 PRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGPKFI 255

Query: 270 ILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVV 329
             +++ E ++L  RV+LLG++ H+ VR+VL QG I+L+ SLTEAF   +VEAASC L +V
Sbjct: 256 DFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNLLIV 315

Query: 330 STRVGGIPEVLPENLIILCE-PSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWR 388
           +T+VGGIPEVLP  + +  E  SV  L +   KAI  ++S  L    + H+ V   Y W 
Sbjct: 316 TTQVGGIPEVLPNEMTVYAEQTSVSDLVQATNKAINIIRSKALDT-SSFHDSVSKMYDWM 374

Query: 389 NVAERTEKVYDRVSVEAVLPMDKRLDRLISHC----GPVTGYIFALLAVFNFLFLIFLRW 444
           +VA+RT ++Y  +S  +    DK   +++++     G    +++ L  +  ++    L W
Sbjct: 375 DVAKRTVEIYTNISSTSSAD-DKDWMKMVANLYKRDGIWAKHLYLLCGIVEYMLFFLLEW 433

Query: 445 MTPDSIIDVA 454
           + P   ID+A
Sbjct: 434 LYPRDEIDLA 443
>prf||1804343A SPT14 gene [Saccharomyces cerevisiae]
          Length = 415

 Score =  369 bits (937), Expect = e-101
 Identities = 184/374 (49%), Positives = 261/374 (69%), Gaps = 9/374 (2%)

Query: 37  MVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKV 96
           M+ DFFYP +GGVE HIY LSQ LI+ GH V+I+THAY +R G+R+LT+GLKVY++P  V
Sbjct: 1   MLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVPFFV 60

Query: 97  MYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSL 156
           ++ ++T  T+F + P++R I +RE++ I+HSH S S  AH+ + HA TMGL+TVFTDHSL
Sbjct: 61  IFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTDHSL 120

Query: 157 FGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFT 216
           +GF +++S+  NKLLT +L + + +ICVS T KEN ++R  L+P+I+SVIPNAV   DF 
Sbjct: 121 YGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSEDFK 180

Query: 217 P-DPF------RRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRI 269
           P DP       +  D I IVV+ RL   KG DLL+ IIP++C  + D+ FI+ G+GPK I
Sbjct: 181 PRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGPKFI 240

Query: 270 ILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVV 329
             +++ E ++L  RV+LLG++ H+ VR+VL QG I+L+ SLTEAF   +VEAASC L +V
Sbjct: 241 DFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNLLIV 300

Query: 330 STRVGGIPEVLPENLIILCE-PSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWR 388
           +T+VGGIPEVLP  + +  E  SV  L +   KAI  ++S  L    + H+ V   Y W 
Sbjct: 301 TTQVGGIPEVLPNEMTVYAEQTSVSDLVQATNKAINIIRSKALDT-SSFHDSVSKMYDWM 359

Query: 389 NVAERTEKVYDRVS 402
           +VA+RT ++Y  +S
Sbjct: 360 DVAKRTVEIYTNIS 373
>emb|CAB57276.1| (X77725) PIG-A [Homo sapiens]
          Length = 248

 Score =  346 bits (878), Expect = 6e-94
 Identities = 172/181 (95%), Positives = 173/181 (95%), Gaps = 2/181 (1%)

Query: 228 IVVVSRLVYRKG--IDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVR 285
           I+V      RKG  IDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVR
Sbjct: 68  IIVTHAYGNRKGIRIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVR 127

Query: 286 LLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLI 345
           LLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLI
Sbjct: 128 LLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLI 187

Query: 346 ILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEA 405
           ILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEA
Sbjct: 188 ILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEA 247

Query: 406 V 406
           V
Sbjct: 248 V 248
 Score =  178 bits (447), Expect = 2e-43
 Identities = 84/88 (95%), Positives = 85/88 (96%), Gaps = 1/88 (1%)

Query: 1  MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60
          MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL
Sbjct: 1  MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60

Query: 61 IERGHKVIIVTHAYGNRKGIRY-LTSGL 87
          IERGHKVIIVTHAYGNRKGIR  L SG+
Sbjct: 61 IERGHKVIIVTHAYGNRKGIRIDLLSGI 88
>ref|NP_065205.1| (NM_020472) phosphatidylinositol glycan, class A isoform 2;
           Phosphatidylinositol glycan, class A; GLCNAC-PI
           synthesis protein [Homo sapiens]
          Length = 118

 Score =  240 bits (607), Expect = 4e-62
 Identities = 114/114 (100%), Positives = 114/114 (100%)

Query: 1   MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60
           MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL
Sbjct: 1   MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCL 60

Query: 61  IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLR 114
           IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLR
Sbjct: 61  IERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQSTATTLFHSLPLLR 114
>ref|NP_126928.1| (NC_000868) GALACTOSYLTRANSFERASE or LPS BIOSYNTHESIS RFBU RELATED
           PROTEIN [Pyrococcus abyssi]
 pir||A75033 probable hexosyltransferase (EC 2.4.1.-) PAB0827 [similarity] -
           Pyrococcus abyssi (strain Orsay)
 emb|CAB50158.1| (AJ248287) GALACTOSYLTRANSFERASE or LPS BIOSYNTHESIS RFBU RELATED
           PROTEIN [Pyrococcus abyssi]
          Length = 371

 Score =  117 bits (290), Expect = 6e-25
 Identities = 99/368 (26%), Positives = 179/368 (47%), Gaps = 17/368 (4%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94
           I +VSD+++P +GGV  H++ L+  L + GH+V IVT+A  N K       G+ +  +P 
Sbjct: 6   IALVSDWYFPKIGGVAIHVHNLAIHLRKMGHEVSIVTNALTNGKEGELQKYGIDLIKVPG 65

Query: 95  KVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDH 154
            +    + +     S  L+ Y+   +   ++H+  +F+ ++  ++     +G  T+ T+H
Sbjct: 66  LIKDGINLSMIAKSSNSLVEYL---KGFDVVHAQHAFTPLSLKSIPAGNKVGALTLVTNH 122

Query: 155 SL----FGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAV 210
           S+    F   +  S ++     + L      I VS  S   + LR   N  IV  IPN V
Sbjct: 123 SVEFENFSILNGFSKMSYSYFKMYLGQVKVGIGVSKASV--SFLRKFTNAPIVE-IPNGV 179

Query: 211 DPTDFTPDPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRII 270
           +   F     R   +  I+ V RL  RKG++ L   +     K+ +    I G+G  R +
Sbjct: 180 NIERFNGRG-REWGTRNILYVGRLEPRKGVNYLISAM-----KFVEGKLTIVGDGSMRKV 233

Query: 271 LEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVS 330
           L+   ++  + D+V  LG +  +++  +  +  +F+  SL+EAF + ++EA +  + V+ 
Sbjct: 234 LKMQAKKLGVEDKVEFLGFISQEELILLYKKSEVFVLPSLSEAFGIVLLEAMASEVPVIG 293

Query: 331 TRVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNV 390
           T VGGIPE++ +  II+     K+L   +   +   K+            V+  Y+W  V
Sbjct: 294 TSVGGIPEIIGDAGIIVPPRDSKALANAINAILSNQKTAKRLGKLG-RKRVERLYSWDVV 352

Query: 391 AERTEKVY 398
           AERTE++Y
Sbjct: 353 AERTERLY 360
>ref|NP_143674.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
 pir||F71196 probable hexosyltransferase (EC 2.4.1.-) PH1844 - Pyrococcus
           horikoshii
 dbj|BAA30965.1| (AP000007) 381aa long hypothetical protein [Pyrococcus horikoshii]
          Length = 381

 Score =  112 bits (278), Expect = 1e-23
 Identities = 114/397 (28%), Positives = 180/397 (44%), Gaps = 49/397 (12%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGN-------RKGIRYLT-SG 86
           I +VSD++YP +GGV +H++ L+  L ERGH+V IVT+           R GI  +   G
Sbjct: 6   IALVSDWYYPKIGGVATHMHNLAIKLRERGHEVGIVTNNRPTGKEEELKRYGIELIKIPG 65

Query: 87  LKVYYLPLKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMG 146
           +   +L + + Y   ++  L   L         +   IIHSH +F+ ++  AL   K M 
Sbjct: 66  IISPFLDVNLTYGLKSSEELNEFL---------KDFDIIHSHHAFTPLSLKALKAGKNME 116

Query: 147 LQTVFTDHSLFGFADVSSV-----LTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPE 201
             T+ T HS+  FA  S +      T  L    L  ++ II VS  +K            
Sbjct: 117 KGTLLTTHSI-SFAHESKLWDTLGFTIPLFKSYLKYSHRIIAVSKAAKS---FIEHFTSV 172

Query: 202 IVSVIPNAVDPTDFTPDPFRRHDSI---------TIVVVSRLVYRKGIDLLSGIIPELCQ 252
            V ++PN VD   F P   R  + I          ++ VSR+ YRKG  +L         
Sbjct: 173 PVLIVPNGVDDERFFPA--RDKEKIKAKFGLEGNVVLYVSRMSYRKGPHVLLNAF----S 226

Query: 253 KYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSL-T 311
           K  D   ++ G G     L+   +   + ++V  +G +    +  V     +F+  S+ +
Sbjct: 227 KIEDATLVMVGNGEMLPFLKAQTKFLGIENKVVFMGYVPDDILPEVFRMADVFVLPSISS 286

Query: 312 EAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLC--EGLEKAIFQLKSG 369
           EAF + I+EA + G+ +++T VGGIPEV+ EN   L  P    L   E +EK    LK+ 
Sbjct: 287 EAFGIVILEAMASGVPIIATDVGGIPEVIKENSAGLLVPPGNELKLREAIEKL---LKNE 343

Query: 370 TLPA--PENIHNIVKTFYTWRNVAERTEKVYDRVSVE 404
            L      N    V+  Y+W  +  + E++Y+ V  E
Sbjct: 344 ELRKWYGNNGRRSVEEKYSWNKIVVKIERIYNEVLQE 380
>ref|NP_148357.1| (NC_000854) N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Aeropyrum pernix]
 pir||D72511 probable hexosyltransferase (EC 2.4.1.-) APE2066 [similarity] -
           Aeropyrum pernix (strain K1)
 dbj|BAA81076.1| (AP000063) 392aa long hypothetical
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Aeropyrum pernix]
          Length = 392

 Score =  102 bits (253), Expect = 1e-20
 Identities = 103/380 (27%), Positives = 173/380 (45%), Gaps = 25/380 (6%)

Query: 31  RTHNICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVY 90
           R   I MV DF   ++GGV+SH+  L++ L + G+ V+IV+ A G          G  + 
Sbjct: 18  RGSRIVMVMDFHPSSVGGVQSHVRDLTRLLQDFGYDVVIVSRALGKGDVKDLEAEGHYIV 77

Query: 91  --YLPLKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQ 148
               PL++++     + L   +  L       +  ++HSH  ++  +  AL  A+ +GL 
Sbjct: 78  KPLFPLEIIFVPPDPSDLRREIESL-------KPDVVHSHHIYTLTSLLALKAARDLGLP 130

Query: 149 TVFTDHSLFGFAD-------VSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPE 201
            + T+HS+F   D        S VL  + L   L +   +I VS T+ +  V     +  
Sbjct: 131 RIATNHSIFLAYDKVALWRIASIVLPTRYL---LPNAQAVISVS-TAADKMVEGIVGDSV 186

Query: 202 IVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFII 261
              +IPN VD   F P    + D   ++ + RLV+RKG  +L      +  +  D    I
Sbjct: 187 DRYIIPNGVDVERFKPST-PKADYPLVLFLGRLVWRKGAHVLVRAFRHVVDEIRDAKLYI 245

Query: 262 GGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSL-TEAFCMAIVE 320
           GG+G    I++ +  RY L + V++LG +   +  ++     +    S+  E+F +  +E
Sbjct: 246 GGKGEFEPIIKLLIARYGLENNVKMLGVVPESEKPSLYSSAWVTAVPSIVNESFGIVALE 305

Query: 321 AASCGLQVVSTRVGGIPEVLPENLI-ILCEP-SVKSLCEGLEKAIFQLKSGTLPAPENIH 378
           + S G  VV++R GG+ +V+      +L +P S K L + L   + Q         E   
Sbjct: 306 SLSSGTPVVASRQGGLKDVVKHGKTGLLVKPGSSKELAKAL-ITLLQDSGLRKRMSEEAR 364

Query: 379 NIVKTFYTWRNVAERTEKVY 398
            IV   Y WR V  +  KVY
Sbjct: 365 KIVLERYDWRKVVPQILKVY 384
>ref|NP_266369.1| (NC_002662) LPS biosynthesis protein [Lactococcus lactis subsp.
           lactis]
 gb|AAK04311.1|AE006259_5 (AE006259) LPS biosynthesis protein [Lactococcus lactis subsp.
           lactis]
          Length = 379

 Score = 99.2 bits (244), Expect = 1e-19
 Identities = 92/387 (23%), Positives = 183/387 (46%), Gaps = 36/387 (9%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94
           + + + ++ P++GGVE + Y +++ L E+G++VII+T  +        +  G+K+Y LP+
Sbjct: 6   VAIFNGYYIPHLGGVERYTYNIAKKLTEKGYRVIIITTQHDENLTNEEIQEGIKIYRLPI 65

Query: 95  KVMYNQS----TATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTV 150
           K ++           ++HS  L+  I   E +    +++ F   A   +  AK  G + +
Sbjct: 66  KNLWKNRYPFLKKNRIYHS--LIEKIEA-ESIDYYVANTRFHLPAMLGVKMAKAKGKEAI 122

Query: 151 FTDHSLFGFADVSSVLT--NKLLTVSLCDTNHIICVSYTSKENTVLRAALNP-------- 200
             +H        SS LT  N +L   L     ++ +    K+ ++     N         
Sbjct: 123 VIEHG-------SSYLTLNNPVLDFMLRKIEQLL-IGRVKKDTSLFYGVSNEASEWLKTF 174

Query: 201 --EIVSVIPNAVDPTDFTPDPFRRHD-SITIVVVSRLVYR-KGIDLLSGIIPELCQKYPD 256
             +   V+PNAV   ++      + +  +TI    RL+ + KG+++L     +L ++  +
Sbjct: 175 DIKAKGVLPNAVAVDEYFNQKIEKDEKKLTISYAGRLIPQMKGVEILLSTFSKLSKERKN 234

Query: 257 LNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCM 316
           L  II G+GP   +L EV+ +Y     ++ LG + ++ V  +  +  +F+  S +E F  
Sbjct: 235 LELIIAGDGP---LLNEVKRKYS-QKNIKFLGYVPYEKVLEIDAKSDVFVLMSRSEGFAT 290

Query: 317 AIVEAASCGLQVVST-RVGGIPEVLP-ENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAP 374
           A++EAA     +++T  VGG  +++P E    + E +   L E L K +   +   L   
Sbjct: 291 AMLEAAMLENVIITTPTVGGARDIMPDETYGYIIENNETKLFETLTKVLDNKEHMRLMQK 350

Query: 375 ENIHNIVKTFYTWRNVAERTEKVYDRV 401
           +   N+++ F TW   A++  KV++ +
Sbjct: 351 KISKNVLENF-TWEQSAKQFIKVFNEL 376
>ref|NP_275316.1| (NC_000916) LPS biosynthesis RfbU related protein
           [Methanothermobacter thermautotrophicus]
 pir||C69098 probable hexosyltransferase (EC 2.4.1.-) MTH173 - Methanobacterium
           thermoautotrophicum (strain Delta H)
 gb|AAB84679.1| (AE000805) LPS biosynthesis RfbU related protein
           [Methanothermobacter thermautotrophicus]
          Length = 382

 Score = 93.3 bits (229), Expect = 8e-18
 Identities = 89/333 (26%), Positives = 149/333 (44%), Gaps = 48/333 (14%)

Query: 35  ICMVSDFFYPNM-GGVESHIYQLSQCLIERGHKVIIVT---HAYGNRKGIRYLTSGLKVY 90
           I +VSDFF P+  GG E   +++++ L+ERGH V +++   H  G  + +    SG++V+
Sbjct: 6   ILIVSDFFVPHYNGGGERRYFEIARRLVERGHVVDVISMGIHGVGEYEEV----SGVRVH 61

Query: 91  YLPLKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHA----KTMG 146
           +L  ++         L   L  +R++    R  + H +    A  +  L  A    +  G
Sbjct: 62  HLGPRI-----RKPPLRGPLDFIRFMAAAFRWVMTHDYDIIDAQTYAPLLPAFLASRIHG 116

Query: 147 LQTVFTDHSLFGFADVSSVLTNKLLTVSLCDT-----------NHIICVSYTSKENTVLR 195
              V T H      DVSS   ++ L  S   T           + +I VS ++       
Sbjct: 117 TPMVATIH------DVSSAHGDQWLQSSKTATILERVLMRLPYDGVITVSRSTASALTEL 170

Query: 196 AALNPEIVSVIPNAVDPTDFTPDPFRRHDSIT------IVVVSRLVYRKGIDLLSGIIPE 249
              NP+ + +IPN VDP           DS+T      I+ V RL   K +D L  +  +
Sbjct: 171 HGRNPDGIHIIPNGVDPELI--------DSVTPATGNYIIFVGRLAPHKHVDHLIEVFSK 222

Query: 250 LCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTS 309
           L   +PDL   I G+G +R  L+ + +   + D V     L + +V + +    + +  S
Sbjct: 223 LVIDFPDLRLEIIGDGVERARLKAMVDECGIRDSVTFHHNLSYPEVISRIRGARVLVLPS 282

Query: 310 LTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPE 342
             E F M + EA +CG+  V+ R GG+ EV+ +
Sbjct: 283 TREGFGMVLAEAGACGVPAVAYRSGGVVEVIDD 315
>gb|AAL82009.1| (AE010283) glycosyl transferase [Pyrococcus furiosus DSM 3638]
          Length = 358

 Score = 89.1 bits (218), Expect = 2e-16
 Identities = 104/371 (28%), Positives = 166/371 (44%), Gaps = 34/371 (9%)

Query: 53  IYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVM-YNQSTATTLFHSLP 111
           ++ L+  L ERGH+V IVT+     K       G+ +  +P  V    +   T    S  
Sbjct: 1   MHNLAIKLRERGHEVGIVTNNRVTGKEKELEKYGIDLIKIPGVVSPLLEVNITYGLKSSE 60

Query: 112 LLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSSVLTNKLL 171
           L  ++       +IHSH +F  +A  A+   +TM   T+ T HS+  FA  S +     L
Sbjct: 61  LNEFL---NNFDVIHSHHAFMPLALKAVKAGRTMEKATLLTTHSI-SFAHESKLWDTLGL 116

Query: 172 TVSLCDT-----NHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSI 226
           T+ L  +     + II VS  +K       +++   VS++PN VD T F P   +  D I
Sbjct: 117 TIPLFRSYLKYPHRIIAVSKAAKSFIEHFTSVS---VSIVPNGVDDTRFFPA--KHKDKI 171

Query: 227 T---------IVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRER 277
                     ++ VSR+ YRKG  +L         K  D   ++ G G     L+   + 
Sbjct: 172 KAKFGLEGNIVLYVSRMSYRKGPHVLLNAF----SKIEDATLVMVGSGEMLPFLKAQAKF 227

Query: 278 YQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLT-EAFCMAIVEAASCGLQVVSTRVGGI 336
             + +RV  +G +    +  V     +F+  S++ EAF + ++EA + G+ VV+T VGGI
Sbjct: 228 LGIEERVVFMGYVPDDALPEVFRMADVFVLPSVSAEAFGIVVLEAMASGVPVVATDVGGI 287

Query: 337 PEVLPENLIILCEPSVKSLCEGLEKAIFQ-LKSGTLPA--PENIHNIVKTFYTWRNVAER 393
           PE++ EN   L  P    L   L +A  + LK+  L      N    V+  Y+W  +   
Sbjct: 288 PEIIKENEAGLLVPPGNEL--KLREATQKLLKNEELRKWYGMNGRKAVEEKYSWDKIVVE 345

Query: 394 TEKVYDRVSVE 404
            E++Y  V  E
Sbjct: 346 IERIYSEVLEE 356
>gb|AAL81488.1| (AE010240) hypothetical protein [Pyrococcus furiosus DSM 3638]
          Length = 373

 Score = 84.8 bits (207), Expect = 3e-15
 Identities = 85/320 (26%), Positives = 153/320 (47%), Gaps = 41/320 (12%)

Query: 32  THNICMVSDFFYPNM-GGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVY 90
           T  I  + D  YP + GGVE  +Y++++ L E+ H+V I  + + + K I+ +     ++
Sbjct: 4   TLRIAFIYDVIYPWVKGGVERRLYEIAKRLAEK-HEVHIYGYKHWDGKKIQEMNG---IF 59

Query: 91  Y----LPLKVMYNQSTAT--TLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKT 144
           Y     P K+ +    A    +FHS+ LL ++   + + II       A  +   + ++ 
Sbjct: 60  YHGTIKPKKIYHGNRRAILPPIFHSINLL-FLLKGQHLDIIDCQ----ATPYFPCYASRV 114

Query: 145 MGLQTVFTDHSLFGFADVSSV----LTNKLLTVSL-CDTNHIICVSYTSKENTVLRAALN 199
                V T H  +G   +  +       K++   L   T++ I VS  +K++ + +A L 
Sbjct: 115 SNSNLVITWHEFWGNYWLKYLGRAGFFGKIIERGLFVLTDNHIAVSLKTKKD-LYKAGLR 173

Query: 200 PEIVSVIPNAVD--------PTDFTPDPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELC 251
             I  V+PN +D        P+ +T D         I+ V RL+  K + LL   +  + 
Sbjct: 174 KNIY-VVPNGIDFEKIQEIKPSSYTSD---------IIFVGRLIKEKNVPLLLKALTIIK 223

Query: 252 QKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGAL-EHKDVRNVLVQGHIFLNTSL 310
           Q  PD+  ++ G+GP+R  LE++  +  L D V+ LG L  ++DV  ++    +F   SL
Sbjct: 224 QDIPDVKAVVVGDGPEREYLEKLSFKLNLQDNVKFLGFLNRYEDVVALMKASKVFAFPSL 283

Query: 311 TEAFCMAIVEAASCGLQVVS 330
            E F + ++EA + GL VV+
Sbjct: 284 REGFGIVVIEANASGLPVVT 303
>ref|NP_228553.1| (NC_000853) conserved hypothetical protein [Thermotoga maritima]
 pir||C72340 probable hexosyltransferase (EC 2.4.1.-) TM0744 - Thermotoga
           maritima (strain MSB8)
 gb|AAD35825.1|AE001744_15 (AE001744) conserved hypothetical protein [Thermotoga maritima]
          Length = 406

 Score = 80.9 bits (197), Expect = 4e-14
 Identities = 87/337 (25%), Positives = 145/337 (42%), Gaps = 41/337 (12%)

Query: 34  NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLP 93
           NI M SD + P + GV + I    + L ERGHKV++V  +    +   ++   +   + P
Sbjct: 2   NIAMFSDTYAPQINGVATSIRVYKKKLTERGHKVVVVAPSAPEEEKDVFVVRSIPFPFEP 61

Query: 94  LKVMYNQSTATTLFHSLPLLRYIFVRE-RVTIIHSHSSFSAMAHDALFHAKTMGLQTVFT 152
              +   ST   L          F+RE  V IIHSHS F  +   AL   + MGL  V T
Sbjct: 62  QHRISIASTKNIL---------EFMRENNVQIIHSHSPF-FIGFKALRVQEEMGLPHVHT 111

Query: 153 DHSLF---------GFADVSSVLTNKLLTVSLCD-TNHIICVSYTSKENTVLRAALNPEI 202
            H+L           F     ++ +   +   C+ TN +I  +   K          P  
Sbjct: 112 YHTLLPEYRHYIPKPFTPPKRLVEH--FSAWFCNMTNVVIAPTEDIKRELESYGVKRP-- 167

Query: 203 VSVIPNAVDPTDF---TPDPFRR----HDSITIVVVSRLVYRKGIDLLSGIIPELCQKYP 255
           + V+P  ++   F    P+  +R         ++   R+   K +D L  +   L    P
Sbjct: 168 IEVLPTGIEVEKFEVEAPEELKRKWNPEGKKVVLYAGRIAKEKNLDFLLRVFESL--NAP 225

Query: 256 DLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFC 315
            + FI+ G+GP+R  +EE  +   L   +++ G + H ++      G +F+  S TE   
Sbjct: 226 GIAFIMVGDGPEREEVEEFAKEKGLD--LKITGFVPHDEIPLYYKLGDVFVFASKTETQG 283

Query: 316 MAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSV 352
           + ++EA + GL VV+ +  G+ +VL       CE +V
Sbjct: 284 LVLLEALASGLPVVALKWKGVKDVLKN-----CEAAV 315
>ref|NP_472029.1| (NC_003212) weakly similar to human
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Listeria innocua]
 emb|CAC97926.1| (AL596173) weakly similar to human
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Listeria innocua]
          Length = 427

 Score = 79.3 bits (193), Expect = 1e-13
 Identities = 79/318 (24%), Positives = 142/318 (43%), Gaps = 25/318 (7%)

Query: 34  NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLP 93
           NI + +D + P + GV + I  +   L ++GH V I T    N    R    G +V+ LP
Sbjct: 2   NIGIFTDTYSPQISGVATSIMIMENELRKQGHTVYIFTTTDPNAD--RESEEG-RVFRLP 58

Query: 94  LKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTD 153
                           +     +  R  + IIH+H+ FS +       AK   + ++ T 
Sbjct: 59  SIPFVFFPERRVAIAGMNKFIKLVGRLNLDIIHTHTEFS-LGLLGKRIAKKYNIPSIHTY 117

Query: 154 HSLF----GFADVSSVLTNKL---LTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVI 206
           H+++     +     +LT  +   +T S CD+   I ++ T+K    L      +++  +
Sbjct: 118 HTMYVDYLHYIAKGKILTPSMVGKMTKSFCDSYDAI-ITPTAKVRHHLEEQGIHKLMYTV 176

Query: 207 PNAVDPTDFTPDPFRR------------HDSITIVVVSRLVYRKGIDLLSGIIPELCQKY 254
           P   D + F P   +R            +DS+ I+ + R+ + K ID +   +PE+ +  
Sbjct: 177 PTGTDISSFAPVEKQRILDLKQSLGIEENDSV-ILSLGRIAHEKNIDAIINAMPEVLETK 235

Query: 255 PDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAF 314
           P+   +I G+GP R  LE++ E  QL + V   GA++ +++      G +F++ S TE  
Sbjct: 236 PNAKLVIVGDGPVRKDLEKLVETKQLENHVIFTGAVDWENISLYYQLGDLFVSASTTETQ 295

Query: 315 CMAIVEAASCGLQVVSTR 332
            +   EA +  L VV+ R
Sbjct: 296 GLTYAEAMAASLPVVAKR 313
>ref|NP_213400.1| (NC_000918) hypothetical protein [Aquifex aeolicus]
 pir||D70351 probable hexosyltransferase (EC 2.4.1.-) aq_572 [similarity] -
           Aquifex aeolicus
 gb|AAC06809.1| (AE000696) hypothetical protein [Aquifex aeolicus]
          Length = 366

 Score = 78.2 bits (190), Expect = 3e-13
 Identities = 91/364 (25%), Positives = 160/364 (43%), Gaps = 45/364 (12%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94
           I + +D F  ++GG      QL+  L ++G++V+++T +    +              P 
Sbjct: 3   IALFTDSFRKDLGGGTQVARQLAFGLSKKGYEVLVITGSTAEEE-------------TPF 49

Query: 95  KVMYNQSTATTLFH----SLPLLRYIFVRERVT--IIHSHSSFSAMAHDALFHAKTMGLQ 148
           KV+   S     +H    +LP +  +   +     +IH H  F A    AL   K + + 
Sbjct: 50  KVLKLPSIKYPFYHNVEIALPNVELLKELKNFNPDVIHYHDPFLAGTM-ALLMGKILKIP 108

Query: 149 TVFTDH------SLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEI 202
           TV T H      +  G    + V+  KL++      N   CV + SK    L   L+   
Sbjct: 109 TVGTIHIHPKQLTYHGIKIDNGVIAKKLVSFF---GNFTDCVVFVSKYQKKLYEELDSFC 165

Query: 203 VSVIPNAVDPTDFTPDPFR-RHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFII 261
           V VI N +    F  +  + R+    I+ VSRL   K  +     + E+ ++ P + + I
Sbjct: 166 VKVIYNGIPDYFFVSEKRKLRNPRNRILTVSRLDKDKNPEFALKCVAEISKEVP-VEYTI 224

Query: 262 GGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEA 321
            GEG ++  LE++  +  L  +   LG +  +++  + +   + LNTS TE F ++  EA
Sbjct: 225 VGEGNEKEKLEKLARK--LGIKANFLGFVPREELPELYLSHDVLLNTSKTETFGLSFAEA 282

Query: 322 ASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSG-------TLPAP 374
            + G+ V++ + G  PE++ +   ILCE  V    E ++KA  +L          +  AP
Sbjct: 283 MATGMPVIALKEGSAPEIVGDGG-ILCEEKV----ECVKKAFLKLYQNPELYFKLSQKAP 337

Query: 375 ENIH 378
           E  H
Sbjct: 338 ERAH 341
>gb|AAL81485.1| (AE010240) glycosyl transferase [Pyrococcus furiosus DSM 3638]
          Length = 383

 Score = 77.8 bits (189), Expect = 4e-13
 Identities = 67/218 (30%), Positives = 101/218 (45%), Gaps = 14/218 (6%)

Query: 193 VLRAALNPEIVSVIPNAVDPTDFTPDP---FRRHDSITI-----VVVSRLVYRKGIDLLS 244
           ++R  +  + +  IPN VD + F P      R+  +I I     + V  LV +KG + L 
Sbjct: 167 LMRVGIPEDKLYYIPNGVDTSLFYPQETALIRKELNIPIDKKILISVGNLVEKKGFEYLI 226

Query: 245 GIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHI 304
             +  +     D+   I GEGP R  LE +    +L + V L+G   H+D+   +  G +
Sbjct: 227 RAMKIILHARDDVLLYIIGEGPLRKRLENITRELKLEEHVFLVGPKPHRDIPLWINAGDL 286

Query: 305 FLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVL-PENLIILCEPSVKSLCEGLEKAI 363
           F+  SL E F +  +EA +CG  V+ST  GG  EV+  E   +LC P         EK +
Sbjct: 287 FVLPSLVENFGVVNIEALACGKPVISTINGGSEEVITSEEYGLLCPPRDPECLA--EKIL 344

Query: 364 FQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRV 401
             L        E I    + F  WRN+A +  KVY+ V
Sbjct: 345 MALNKEW--DREKIRKYAEQF-DWRNIARQIFKVYEDV 379
>ref|NP_347689.1| (NC_003030) LPS glycosyltransferase [Clostridium acetobutylicum]
 gb|AAK79029.1|AE007621_3 (AE007621) LPS glycosyltransferase [Clostridium acetobutylicum]
          Length = 466

 Score = 75.8 bits (184), Expect = 2e-12
 Identities = 60/218 (27%), Positives = 103/218 (46%), Gaps = 17/218 (7%)

Query: 201 EIVSVIPNAVDPTDFTPD----PFRRH----DSITIVVVSRLVYRKGIDLLSGIIPELCQ 252
           E V +IPN +D   F  D     FRR     D   +  + R V+ KGI +L    P +  
Sbjct: 177 EKVWIIPNGIDLNSFDFDFDWLKFRRKYACDDEKIVFFIGRHVFEKGIQILIDAAPGIVS 236

Query: 253 KYPDLNFIIGGEGPKRIILEEVRERYQ---LHDRVRLLGALEHKDVRNVLVQGHIFLNTS 309
           +Y    FII G GP   + EE++++ +   L D+    G +++K  +       + +  S
Sbjct: 237 EYNKTKFIIAGTGP---MTEELKDKVKSIGLQDKFLFTGYMDNKTKKKFYRVASVAVFPS 293

Query: 310 LTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPE--NLIILCEPSVKSLCEGLEKAIFQLK 367
           L E F + ++EA + G   V +  GG  E++    N + +   SV+SL + + + I +  
Sbjct: 294 LYEPFGIVLLEAMAAGCPAVVSDTGGFGEIIQHRSNGMKMINSSVESLKDNVLE-ILKND 352

Query: 368 SGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVSVEA 405
           S       N    V+  YTW+ V++ T ++Y+ +  EA
Sbjct: 353 SLAQTVRRNAIKTVEDKYTWQRVSKLTTEMYELIKEEA 390
 Score = 34.5 bits (78), Expect = 3.7
 Identities = 15/31 (48%), Positives = 21/31 (67%), Gaps = 2/31 (6%)

Query: 43 YP--NMGGVESHIYQLSQCLIERGHKVIIVT 71
          YP  N+GG+ +H+Y LS  L   GH+V +VT
Sbjct: 10 YPPKNVGGLSNHVYNLSHALASLGHEVYVVT 40
>ref|NP_466078.1| (NC_003210) weakly similar to human
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Listeria monocytogenes EGD-e]
 emb|CAD00633.1| (AL591983) weakly similar to human
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Listeria monocytogenes]
          Length = 427

 Score = 75.4 bits (183), Expect = 2e-12
 Identities = 78/317 (24%), Positives = 137/317 (42%), Gaps = 23/317 (7%)

Query: 34  NICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLP 93
           NI + +D + P + GV + I  +   L ++GH V I T    N    R    G +V+ LP
Sbjct: 2   NIGIFTDTYSPQISGVATSIMIMENELRKQGHTVYIFTTTDPNAD--RESEEG-RVFRLP 58

Query: 94  LKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTD 153
                           +     +  R  + IIH+H+ FS +       AK   + ++ T 
Sbjct: 59  SIPFVFFPERRVAIAGMNKFIKLVGRLDLDIIHTHTEFS-LGLLGKRIAKKYHIPSIHTY 117

Query: 154 HSLF----GFADVSSVLTNKL---LTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVI 206
           H+++     +     +LT  +   +T S CD+   I ++ T+K    L      +++  +
Sbjct: 118 HTMYVDYLHYIAKGKILTPSMVGKMTKSFCDSYDAI-ITPTAKVRHHLEEQGIHKLMYTV 176

Query: 207 PNAVDPTDFTPDPFRR-----------HDSITIVVVSRLVYRKGIDLLSGIIPELCQKYP 255
           P   D + F P   +R            +   I+ + R+ + K ID +   +PE+ Q   
Sbjct: 177 PTGTDISSFAPVEKQRILDLKKLLGIGENDPVILSLGRIAHEKNIDAIINAMPEVLQTKT 236

Query: 256 DLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFC 315
               +I G+GP R  LE++ E  QL D V   GA++ +++      G +F++ S TE   
Sbjct: 237 TAKLVIVGDGPVRKDLEKLVEEKQLADHVIFTGAVDWENISLYYQLGDLFVSASTTETQG 296

Query: 316 MAIVEAASCGLQVVSTR 332
           +   EA +  L VV+ R
Sbjct: 297 LTYAEAMAASLPVVAKR 313
>ref|NP_440248.1| (NC_000911) unknown protein [Synechocystis sp. PCC 6803]
 pir||S74777 hypothetical protein slr1076 - Synechocystis sp. (strain PCC 6803)
 dbj|BAA16928.1| (D90901) ORF_ID:slr1076~unknown protein [Synechocystis sp. PCC
           6803]
          Length = 381

 Score = 75.0 bits (182), Expect = 3e-12
 Identities = 70/283 (24%), Positives = 128/283 (44%), Gaps = 24/283 (8%)

Query: 105 TLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADVSS 164
           T + +L L    F +    II  H++F+ +AH      + MG+      H +  +     
Sbjct: 75  TFYFALLLFISSFQKRPDLIICGHANFTPVAH---LVQRLMGISYWTVAHGVDAWN---- 127

Query: 165 VLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDF--TPDP--- 219
            L N  +  +L   + I+ VS+ +++  +   AL+PE V V+PN  D + F   P P   
Sbjct: 128 -LQNPHIIQALRHADRILAVSHYTRDRLLQEQALDPEKVVVLPNTFDTSRFQIAPKPQSL 186

Query: 220 FRRH----DSITIVVVSRLVYR---KGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILE 272
             ++    D   I+ ++RL      KG D +   +PE+ +  P+++++IGG+G  R  +E
Sbjct: 187 LEKYNLTPDQQVILTIARLAGEERYKGYDQIIRALPEIIKTIPNIHYLIGGKGGDRPRIE 246

Query: 273 EVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVV-ST 331
           ++ +   L D V L G +  +++ +      +F   S  E F +  +EA +CG   +   
Sbjct: 247 KLIQDLDLEDYVTLAGFIPDEELADHYNLCDVFAMPSKGEGFGIVYLEAMACGKPTIGGN 306

Query: 332 RVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAP 374
           + G I  +    L +L  P      + +   I Q+   T P P
Sbjct: 307 QDGAIDALCNGELGVLVNPDD---LDEISTVITQILEKTYPLP 346
>emb|CAB57789.1| (AJ250131) HepD protein [Nostoc sp. PCC 7120]
          Length = 391

 Score = 74.7 bits (181), Expect = 4e-12
 Identities = 84/386 (21%), Positives = 164/386 (41%), Gaps = 46/386 (11%)

Query: 39  SDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGL--KVYYLPLKV 96
           S +F  N GG+E +IY+L+  L               N+  +     GL    ++LP+K+
Sbjct: 20  SGWFPTNPGGLERYIYELTYQL-------------SANQDRVELCGVGLPDNQFHLPIKL 66

Query: 97  MYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSL 156
               S  + ++     +R  F + R+    + +   A+    +      G+   F  H  
Sbjct: 67  TNLASPDSKIWQRFWSIRNNFQKTRIGKPDAINLHFALYSFPILDILPQGIPITFNFHGP 126

Query: 157 FGFADVSSVLTNKL-----------LTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSV 205
           +       ++ NK+            T + CD   ++  ++ +  +   +   +   + +
Sbjct: 127 WASESKQELVKNKISIFLKRRLIEQTTYNHCDRFIVLSKAFGNILHQQYQIPWHK--IHI 184

Query: 206 IPNAVDPTDFTPDPFRRH--------DSITIVVVS-RLVYRKGIDLLSGIIPELCQKYPD 256
           IP  V+   F P+  R+         +S  I+  S RLV+R G+D L   +  +  K PD
Sbjct: 185 IPGGVNIDKFQPNLSRQQARQQLNWPESRPILFTSRRLVHRVGVDKLLQALAIIKPKLPD 244

Query: 257 LNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLT-EAFC 315
           +   I G G  +  LE+  +   L + V+ LG L  + +       ++ +  S + E F 
Sbjct: 245 IWLAIAGRGHLQTTLEKQAQELGLENNVKFLGFLPDEQLPIAYQAANLTVMPSQSFEGFG 304

Query: 316 MAIVEAASCGLQVVSTRVGGIPEVL----PENLIILCEPSVKSLCEGLEKAIFQLKSGTL 371
           +AI E+ +CG  V+ T +GG+PE+L    P+  +I   P   ++ E + + +  L+    
Sbjct: 305 LAITESLACGTPVLCTPIGGMPEILTPFSPQ--LITASPEATAIAEKIAQIL--LEQIPK 360

Query: 372 PAPENIHNIVKTFYTWRNVAERTEKV 397
           P+ E       T + W+ +A++  +V
Sbjct: 361 PSREECRQYAVTNFDWQKIAQQVRQV 386
>ref|NP_487738.1| (NC_003272) heterocyst envelope polysaccharide synthesis protein
           [Nostoc sp. PCC 7120]
 gb|AAB08106.1| (U68035) HepB [Anabaena sp.]
 dbj|BAB75397.1| (AP003594) heterocyst envelope polysaccharide synthesis protein
           [Nostoc sp. PCC 7120]
          Length = 389

 Score = 74.7 bits (181), Expect = 4e-12
 Identities = 84/386 (21%), Positives = 164/386 (41%), Gaps = 46/386 (11%)

Query: 39  SDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGL--KVYYLPLKV 96
           S +F  N GG+E +IY+L+  L               N+  +     GL    ++LP+K+
Sbjct: 20  SGWFPTNPGGLERYIYELTYQL-------------SANQDRVELCGVGLPDNQFHLPIKL 66

Query: 97  MYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSL 156
               S  + ++     +R  F + R+    + +   A+    +      G+   F  H  
Sbjct: 67  TNLASPDSKIWQRFWSIRNNFQKTRIGKPDAINLHFALYSFPILDILPQGIPITFNFHGP 126

Query: 157 FGFADVSSVLTNKL-----------LTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSV 205
           +       ++ NK+            T + CD   ++  ++ +  +   +   +   + +
Sbjct: 127 WASESKQELVKNKISIFLKRRLIEQTTYNHCDRFIVLSKAFGNILHQQYQIPWHK--IHI 184

Query: 206 IPNAVDPTDFTPDPFRRH--------DSITIVVVS-RLVYRKGIDLLSGIIPELCQKYPD 256
           IP  V+   F P+  R+         +S  I+  S RLV+R G+D L   +  +  K PD
Sbjct: 185 IPGGVNIDKFQPNLSRQQARQQLNWPESRPILFTSRRLVHRVGVDKLLQALAIIKPKLPD 244

Query: 257 LNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLT-EAFC 315
           +   I G G  +  LE+  +   L + V+ LG L  + +       ++ +  S + E F 
Sbjct: 245 IWLAIAGRGHLQTTLEKQAQELGLENNVKFLGFLPDEQLPIAYQAANLTVMPSQSFEGFG 304

Query: 316 MAIVEAASCGLQVVSTRVGGIPEVL----PENLIILCEPSVKSLCEGLEKAIFQLKSGTL 371
           +AI E+ +CG  V+ T +GG+PE+L    P+  +I   P   ++ E + + +  L+    
Sbjct: 305 LAITESLACGTPVLCTPIGGMPEILTPFSPQ--LITASPEATAIAEKIAQIL--LEQIPK 360

Query: 372 PAPENIHNIVKTFYTWRNVAERTEKV 397
           P+ E       T + W+ +A++  +V
Sbjct: 361 PSREECRQYAVTNFDWQKIAQQVRQV 386
>ref|NP_248617.1| (NC_000909) LPS biosynthesis protein, putative [Methanococcus
           jannaschii]
 pir||F64500 probable hexosyltransferase (EC 2.4.1.-) MJ1607 - Methanococcus
           jannaschii
 gb|AAB99629.1| (U67601) LPS biosynthesis protein, putative [Methanococcus
           jannaschii]
          Length = 390

 Score = 73.1 bits (177), Expect = 1e-11
 Identities = 90/385 (23%), Positives = 169/385 (43%), Gaps = 27/385 (7%)

Query: 35  ICMVSDFFYPNM-GGVESHIYQLSQCLIERGHKVIIVTHAYG--NRKGIRYLTSGLKVYY 91
           I MV+  + P + GG+  H   L++ L+  GH+V ++T  Y     + I    +G+ VY 
Sbjct: 3   IAMVTWEYPPRIVGGLAIHCKGLAEGLVRNGHEVDVITVGYDLPEYENI----NGVNVYR 58

Query: 92  L-PLKVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMG-LQT 149
           + P+   +  + A  +   +     I   ++  +IH H   +      L H   M  +Q+
Sbjct: 59  VRPISHPHFLTWAMFMAEEMEKKLGILGVDKYDVIHCHDWMTHFVGANLKHICRMPYVQS 118

Query: 150 VFTDH--SLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIP 207
           + +       G     S   + +  +S  ++  +I VS + KE          + V VI 
Sbjct: 119 IHSTEIGRCGGLYSDDSKAIHAMEYLSTYESCQVITVSKSLKEEVCSIFNTPEDKVKVIY 178

Query: 208 NAVDPTDFTPD-------PFRRH-----DSITIVVVSRLVYRKGIDLLSGIIPELCQKYP 255
           N ++P +F  +        FRR      D   I+ V RL Y+KGI+ L   +P++ +++ 
Sbjct: 179 NGINPWEFDINLSWEEKINFRRSIGVQDDEKMILFVGRLTYQKGIEYLIRAMPKILERH- 237

Query: 256 DLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFC 315
           +   +I G G  R  LE++  +  +  +V  LG +    ++ +     + +  S+ E F 
Sbjct: 238 NAKLVIAGSGDMRDYLEDLCYQLGVRHKVVFLGFVNGDTLKKLYKSADVVVIPSVYEPFG 297

Query: 316 MAIVEAASCGLQVVSTRVGGIPEVLPE--NLIILCEPSVKSLCEGLEKAIFQLKSGTLPA 373
           +  +EA + G  VV + VGG+ E++    N I +   +  S+  G+++ +          
Sbjct: 298 IVALEAMAAGTPVVVSSVGGLMEIIKHEVNGIWVYPKNPDSIAWGVDRVLSDWGFREYIV 357

Query: 374 PENIHNIVKTFYTWRNVAERTEKVY 398
             N    V   Y+W N+A+ T  VY
Sbjct: 358 -NNAKKDVYEKYSWDNIAKETVNVY 381
>ref|NP_437172.1| (NC_003078) putative membrane-anchored glycosyltransferase protein
           [Sinorhizobium meliloti]
 emb|CAC49032.1| (AL603644) putative membrane-anchored glycosyltransferase protein
           [Sinorhizobium meliloti]
          Length = 416

 Score = 72.7 bits (176), Expect = 1e-11
 Identities = 66/239 (27%), Positives = 104/239 (42%), Gaps = 37/239 (15%)

Query: 200 PEIVSVIPNAVDPTDFTP-DPFRRHDSIT---IVVVSRLVYRKGIDLLSGIIPELCQKYP 255
           P  V+ + N VD   F P +     D+ T   I+ V R+   KG+  L     E+  ++P
Sbjct: 178 PGAVASVGNGVDVFHFRPSEAGASGDARTGRVILFVGRISPEKGLHTLVEAFSEVALRFP 237

Query: 256 DLNFIIGG-------------EGPKRII-----------------LEEVRERYQLHDRVR 285
           D+   I G                 R++                 L+E+ +R++L  R+R
Sbjct: 238 DVELRIAGPYSPLPVDFLTSLSSDPRVLDLKRFYDQWNRCRYQQHLDELMDRHRLRHRIR 297

Query: 286 LLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPE-VLPENL 344
            LG + HK++        I +N SL+E+F +++VE  +CG+ VV TRVGG+ E +L  + 
Sbjct: 298 FLGNVSHKELVAAYHDADIVVNPSLSESFGISVVEGMACGIPVVGTRVGGMCESILDGHT 357

Query: 345 IILCEPSVK-SLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVYDRVS 402
            +L E      L + L   +           E     V   Y+W   AER   VY+RVS
Sbjct: 358 GMLVEADAPGELSQALITVLDDPARARGMGTEGRERAV-ALYSWEARAERLRSVYERVS 415
>gb|AAL80915.1| (AE010196) hypothetical protein [Pyrococcus furiosus DSM 3638]
          Length = 389

 Score = 72.7 bits (176), Expect = 1e-11
 Identities = 62/214 (28%), Positives = 100/214 (45%), Gaps = 15/214 (7%)

Query: 195 RAALNPEIVSVIPNAVDPTDFTPDP---FRR------HDSITIVVVSRLVYRKGIDLLSG 245
           R  + P  +  IPN  D   F P P    RR      ++ I I V +     KG + L  
Sbjct: 176 RVGITPSKIRYIPNGFDGNKFYPIPQEIARRKLNLVEYEKIIINVANMYSRVKGHEYLLR 235

Query: 246 IIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIF 305
              ++ +   D   I+ G G     L+++ +   L  RV   G+  H ++   +    +F
Sbjct: 236 AFSKVAENTSDAFLILVGSGKLLSHLKKLADNLYLGHRVLFAGSKPHDEIPLWMNAADLF 295

Query: 306 LNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPE-VLPENLIILCEPSVKSLCEGLEKAIF 364
           +  SL E+F +  +EA +CG+ VV+TR GG  E ++ E+  +LCEP+     E  EK + 
Sbjct: 296 VLPSLRESFGVVQIEAMACGVPVVATRNGGSEEIIISEDYGLLCEPANPK--ELAEKILI 353

Query: 365 QLKSGTLPAPENIHNIVKTFYTWRNVAERTEKVY 398
            L+       E I    + F TW N+A++T +VY
Sbjct: 354 ALEKEW--DREKIRKYAEQF-TWENIAKKTLEVY 384
>ref|NP_228431.1| (NC_000853) lipopolysaccharide biosynthesis protein, putative
           [Thermotoga maritima]
 pir||E72354 probable hexosyltransferase (EC 2.4.1.-) TM0622 - Thermotoga
           maritima (strain MSB8)
 gb|AAD35706.1|AE001736_4 (AE001736) lipopolysaccharide biosynthesis protein, putative
           [Thermotoga maritima]
          Length = 388

 Score = 72.3 bits (175), Expect = 2e-11
 Identities = 46/138 (33%), Positives = 73/138 (52%), Gaps = 4/138 (2%)

Query: 205 VIPNAVDPTDFTPDPFRR--HDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIG 262
           VI N +D   F+ D  +R   D   ++ V+RL   K   LL     +  Q  P+L   + 
Sbjct: 174 VIYNGIDVQKFSIDQPKRVDRDKTILINVARLSREKNHALLVRAFSKAVQSCPNLELWLV 233

Query: 263 GEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAA 322
           G+G  R  +EE+ ++  L ++V+  G     DV  +L Q  IF+ +S  E F + + EA 
Sbjct: 234 GDGELRRDIEELVKQLGLEEKVKFFGV--RSDVPELLSQADIFVLSSDYEGFGLVVAEAM 291

Query: 323 SCGLQVVSTRVGGIPEVL 340
           + GL V++T +GGIPE+L
Sbjct: 292 AAGLPVIATAIGGIPEIL 309
>ref|NP_489242.1| (NC_003272) glycosyltransferase [Nostoc sp. PCC 7120]
 dbj|BAB76901.1| (AP003599) glycosyltransferase [Nostoc sp. PCC 7120]
          Length = 429

 Score = 71.9 bits (174), Expect = 2e-11
 Identities = 43/129 (33%), Positives = 73/129 (56%), Gaps = 7/129 (5%)

Query: 223 HDSIT-IVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLH 281
           HD I  I    RLV +KGI+ +   + ++ + YPD+ + I G+G  +   E++     L 
Sbjct: 223 HDGIIRIATTGRLVEKKGIEYVIKAVAQVIKNYPDIEYNIIGDGELKTHFEKLIFELNLS 282

Query: 282 DRVRLLGALEHKDVRNVLVQGHIFLNTSLT------EAFCMAIVEAASCGLQVVSTRVGG 335
             V+LLG  + K++ ++L + HIF+  S+T      +A    + EA + GL V+STR GG
Sbjct: 283 QNVKLLGWKQQKEIVDILDKCHIFVAPSVTGKDGNQDAPVNTLKEAMAMGLPVISTRHGG 342

Query: 336 IPEVLPENL 344
           IPE++ + +
Sbjct: 343 IPELVTDGV 351
>ref|NP_355849.1| (NC_003063) AGR_L_35GMp [Agrobacterium tumefaciens] [Agrobacterium
           tumefaciens str. C58 (Cereon)]
 ref|NP_535293.1| (NC_003305) glycosyltransferase [Agrobacterium tumefaciens str. C58
           (U. Washington)]
 gb|AAK88634.1| (AE008204) AGR_L_35GMp [Agrobacterium tumefaciens str. C58
           (Cereon)]
 gb|AAL45609.1| (AE009410) glycosyltransferase [Agrobacterium tumefaciens str. C58
           (U. Washington)]
          Length = 391

 Score = 71.5 bits (173), Expect = 3e-11
 Identities = 58/211 (27%), Positives = 95/211 (44%), Gaps = 11/211 (5%)

Query: 205 VIPNAVDPTDFTPDP----FRR-----HDSITIVVVSRLVYRKGIDLLSGIIPELCQKYP 255
           +IPN V   +F P P    FR+     H    I+ +SRL  +KGID+L+     +C+ Y 
Sbjct: 179 IIPNGVFAEEFDPLPARGHFRQKIALAHGRRYILFLSRLHIKKGIDILASAFAAICETYV 238

Query: 256 DLNFIIGG-EGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAF 314
           D++ ++ G  G        + ++  +  RV ++GA+  K     +V    F   S  E F
Sbjct: 239 DVDLVVAGPPGGAEGHFMHLVKKLNIRHRVFMVGAIYGKAKLEAMVDADCFCLPSRQEGF 298

Query: 315 CMAIVEAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAP 374
            MAI EA +CG  VV T     PEV   +  ++       + + L  ++    +      
Sbjct: 299 SMAITEALACGTPVVITDQCHFPEVGSADAGLIVSVDAAEVAKAL-ASMLGNPARARTMG 357

Query: 375 ENIHNIVKTFYTWRNVAERTEKVYDRVSVEA 405
           EN   +V   +TW  +A  T + Y   ++EA
Sbjct: 358 ENGRRLVLEKFTWPAIAHATLEGYRLSALEA 388
>ref|NP_302182.1| (NC_002677) putative transferase [Mycobacterium leprae]
 emb|CAC30668.1| (AL583923) putative transferase [Mycobacterium leprae]
          Length = 438

 Score = 71.5 bits (173), Expect = 3e-11
 Identities = 82/382 (21%), Positives = 160/382 (41%), Gaps = 35/382 (9%)

Query: 46  MGGVESHIYQLSQCLIERGHKVIIV----------THAYGNR--KGIRYLTSGLKVYYLP 93
           +GG+  H++ LS  L   GH V+++          TH   +   +G+R + +    +   
Sbjct: 39  IGGLGRHVHHLSTALAAAGHDVVVLSRRPSGTDPCTHPTSDEISEGVRVIAAAQDPHEFT 98

Query: 94  LKVMYNQSTATTLFHSLPLLRYIFVRERVT--------IIHSHSSFSAMAHDALFHAKTM 145
                N   A TL     ++R      R +        ++H+H     +AH A+  A+  
Sbjct: 99  FS---NDMMAWTLAMGHAMIRTGLSLTRHSSDLPWRPDVVHAHDWL--VAHPAITLAQFY 153

Query: 146 GLQTVFTDHSLFGFAD---VSSVLTNKLLTVS---LCDTNHIICVSYTSKENTVLRAALN 199
            +  V T H+         VS  L+ ++  V    + +++ +I  S +     +      
Sbjct: 154 DVPMVSTIHATEAGRHSGWVSGALSRQVHAVESWLVRESDSLITCSASMCNEIIELFGPG 213

Query: 200 PEIVSVIPNAVDPTDFTPDPFRRHDS--ITIVVVSRLVYRKGIDLLSGIIPELCQKYPDL 257
              ++VI N +DP  + P   RR  +    ++ V RL Y KG+  +   +P + + YP  
Sbjct: 214 LAEITVIRNGIDPARW-PFAARRARTGPAELLYVGRLEYEKGVHDVIAALPRIRRSYPGT 272

Query: 258 NFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMA 317
              I GEG ++  L +   +Y++    R +G L H ++   L +    +  S  E F + 
Sbjct: 273 TLTIAGEGTQQDWLVDQARKYKVIKATRFVGHLNHNELLAALQRADAAVLPSHYEPFGLV 332

Query: 318 IVEAASCGLQVVSTRVGGIPE-VLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPEN 376
            +EAA+ G  +V++ +GG+ E V+     + C P   +    +   + +           
Sbjct: 333 ALEAAAAGTPLVTSNIGGLGEAVINGQTGVSCPPRDIAELAAMVCTVLEDPDAAQQRALA 392

Query: 377 IHNIVKTFYTWRNVAERTEKVY 398
               + + + W+ VA++T +VY
Sbjct: 393 ARERLTSDFDWQTVAQQTAQVY 414
>gb|AAL80431.1| (AE010155) glycosyltransferase [Pyrococcus furiosus DSM 3638]
          Length = 336

 Score = 70.4 bits (170), Expect = 6e-11
 Identities = 93/368 (25%), Positives = 172/368 (46%), Gaps = 54/368 (14%)

Query: 44  PNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYN-QST 102
           P+ GGV  H+ QL +CL E+ H+V ++T  YG             VY + +  ++  + T
Sbjct: 11  PHKGGVARHVKQLKECL-EKRHEVYVLT--YGT-----VAVEEENVYSVKVPNIFGIRGT 62

Query: 103 ATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADV 162
           +  L  S  +++ +  +    ++H+H   +      L   KT G+  V T H     +D+
Sbjct: 63  SFALLASKKIVK-LHEKYNFDLVHAHYVGTTSFAGVLAKRKT-GVPLVITAHG----SDL 116

Query: 163 SSV----LTNKLLTVSLCDTNHIICVS-YTSKENTVLRAALNPEIVSVIPNAVDPTDFTP 217
             +    L    +  S+ + +++I VS Y +K+   L A+     +SVIPN    T+ + 
Sbjct: 117 EFMSRLPLGGYFVKTSIMEADYVIAVSHYLAKKALELGASR----ISVIPNW---TELSG 169

Query: 218 DPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRER 277
           +  R++    I+ + R+   KGI+       EL +++P   F++ GEGP   +L+++R +
Sbjct: 170 ESERKY----ILFLGRVASYKGIEDFI----ELAKRFPGEEFVVAGEGP---LLKKLRAK 218

Query: 278 YQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIP 337
                 V+ LG +  +D   VL +  + +  S  E F + ++EA S  + V+   VGGI 
Sbjct: 219 SP--PNVKFLGYVPAED---VLKKAKVLVLPSKREGFGLVVIEANSFKVPVLGRNVGGIR 273

Query: 338 EVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPE----NIHNIVKTFYTWRNVAER 393
           E++  +           L E +E AI  LK+  +P       +I   +   ++   + ER
Sbjct: 274 ELIRFS-------KNGYLFEDIEDAITYLKTLLVPKTNVKLGSIGKRISKGHSQEKMCER 326

Query: 394 TEKVYDRV 401
            E++Y  V
Sbjct: 327 VEEIYREV 334
>ref|NP_275281.1| (NC_000916) GlcNAc-phosphatidylinositol related biosynthetic
           protein [Methanothermobacter thermautotrophicus]
 pir||E69050 GlcNAc-phosphatidylinositol related biosynthetic protein -
           Methanobacterium thermoautotrophicum (strain Delta H)
 gb|AAB84644.1| (AE000802) GlcNAc-phosphatidylinositol related biosynthetic protein
           [Methanothermobacter thermautotrophicus]
          Length = 384

 Score = 70.0 bits (169), Expect = 1e-10
 Identities = 49/149 (32%), Positives = 72/149 (47%), Gaps = 4/149 (2%)

Query: 203 VSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIG 262
           VSV+ N V   DF   P R+  + +I  VSRLV  K I  L   +  + +K+PD+   I 
Sbjct: 184 VSVVHNMV---DFRTPPVRKTSTPSIACVSRLVEYKRIQDLIRAVSVIREKFPDIRCRII 240

Query: 263 GEGPKRIILEEVRERYQLHDRVRLLGALE-HKDVRNVLVQGHIFLNTSLTEAFCMAIVEA 321
           G GP    L  +     + D V  +G +E H DV  V+ +  +F   S+ E F + +VEA
Sbjct: 241 GTGPLEERLRGLARELAVEDNVEFMGFVEKHADVLEVIAESWVFCLPSVVEGFGIVVVEA 300

Query: 322 ASCGLQVVSTRVGGIPEVLPENLIILCEP 350
             CG   V+ R+  + E   E   +  EP
Sbjct: 301 MGCGTPFVAARIPPVMESSQEKGGLFFEP 329
>ref|NP_070556.1| (NC_000917) galactosyltransferase [Archaeoglobus fulgidus]
 pir||G69465 probable hexosyltransferase (EC 2.4.1.-) AF1728 - Archaeoglobus
           fulgidus
 gb|AAB89517.1| (AE000983) galactosyltransferase [Archaeoglobus fulgidus]
          Length = 356

 Score = 69.6 bits (168), Expect = 1e-10
 Identities = 87/380 (22%), Positives = 159/380 (40%), Gaps = 44/380 (11%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94
           + ++S +F P++GGVE H+ +++  L  RG +V++VT     R+              P 
Sbjct: 3   VVLLSSYFPPHIGGVEVHVERIAHHLHRRGFEVVVVTSTASGREK------------FPF 50

Query: 95  KVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHS-------SFSAMAHDALFHAKTMGL 147
           +V Y  S         P L     +    I HSH+       S     H   +H      
Sbjct: 51  RVEYVPSIPIPYSPITPFLGRFLEKIDGDIFHSHTPPPFFSCSLRKSPHVITYHCDI--- 107

Query: 148 QTVFTDHSLFGFADVSSVL----TNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIV 203
             +   +  F      S L    T+ +L+ +L   + I+  + +  E + L A  +    
Sbjct: 108 -EIPEKYGRFPIPRALSKLIIRRTDDMLSEALDRADAIVATTKSYAETSRLLAGRD---Y 163

Query: 204 SVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNF--II 261
            VIPN ++ ++F  +        T++ + RL   KG+D+L   +     K+ D+    +I
Sbjct: 164 HVIPNGIELSEF--EGVEAEKEPTVLFLGRLAATKGVDVLLKAM-----KHVDVEARCVI 216

Query: 262 GGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLT--EAFCMAIV 319
            G+G +R  LE  R   +L       G L  K V   L +  + +  SL+  EAF + ++
Sbjct: 217 IGDGEERSSLE--RLARELEVNAEFTGFLPRKKVIEYLSRASLLVLPSLSRLEAFGIVLL 274

Query: 320 EAASCGLQVVSTRVGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHN 379
           EA +CG  V ++ + G+ +V  E   +        L E + + +   +       E+   
Sbjct: 275 EAMACGTPVAASDLPGVRDVASEAGFVFPPGDYMRLSEIINE-VLSDERKVKAIGESGRR 333

Query: 380 IVKTFYTWRNVAERTEKVYD 399
           IV+  Y+W  V +   ++Y+
Sbjct: 334 IVREKYSWDVVVKSLIRLYE 353
>ref|NP_489241.1| (NC_003272) glycosyltransferase [Nostoc sp. PCC 7120]
 dbj|BAB76900.1| (AP003599) glycosyltransferase [Nostoc sp. PCC 7120]
          Length = 430

 Score = 69.2 bits (167), Expect = 1e-10
 Identities = 44/154 (28%), Positives = 81/154 (52%), Gaps = 8/154 (5%)

Query: 199 NPEIVSVIPNAVDPTDFTPDP--FRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPD 256
           NP+ + +  + +D   FT  P  F     + +    RLV +KGI+     + ++ + YP+
Sbjct: 199 NPDKLIIHGSGLDCNKFTFKPRYFPADGKVQVATTGRLVEKKGIEYAIRAVAKVAELYPN 258

Query: 257 LNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLT----- 311
           + + + G+G  +  LE++     +   V+LLG  + K++  +L   HIF+  S+T     
Sbjct: 259 IEYQVIGDGDLKEDLEQLITELNIGHIVKLLGWKQQKEIVEILENTHIFIAPSVTAADGN 318

Query: 312 -EAFCMAIVEAASCGLQVVSTRVGGIPEVLPENL 344
            +A    + EA + GL V+STR GGIPE++ + +
Sbjct: 319 QDAPVNTLKEAMAMGLPVISTRHGGIPELVTDGV 352
>dbj|BAB18471.1| (AB033991) BtrM [Bacillus circulans]
          Length = 389

 Score = 68.4 bits (165), Expect = 3e-10
 Identities = 37/113 (32%), Positives = 61/113 (53%)

Query: 227 TIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRL 286
           T++ + R+ + KG      +  EL  K  DL FI+ G+GP+R  +EE  +   L ++ R+
Sbjct: 207 TVLFLGRIAHEKGWSTFVSVAKELADKIGDLQFIVCGDGPQREAMEEQIKAANLQNQFRI 266

Query: 287 LGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEV 339
            G + HK V   L    +FL  S  E F  +++EAA  G+ ++ST  GG  ++
Sbjct: 267 TGFISHKFVSCYLHHAQLFLLPSHHEEFGGSLIEAAIAGVPIISTNNGGPADI 319
>gb|AAL23756.1| (U52844) putative glycosyltransferase [Serratia marcescens]
          Length = 388

 Score = 68.0 bits (164), Expect = 3e-10
 Identities = 50/164 (30%), Positives = 76/164 (45%), Gaps = 2/164 (1%)

Query: 181 IICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGI 240
           +I  S+ SK    LR  L      VI     P      P      I I   SRLV  KGI
Sbjct: 134 VISASHASKRVMELRFNLPCPNHVVINRIKTPAGIDNTPKTLSQPIRIGTASRLVSLKGI 193

Query: 241 DLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLV 300
            +   ++ EL ++  D+   + G+GP R   E +  R QL DRV   G     DV     
Sbjct: 194 SVSLLMMQELLRRGHDVTLEVAGKGPDRAAFEALAARLQLGDRVTFSGY--QDDVAGFFN 251

Query: 301 QGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENL 344
           + HI+++T +TE F ++ +E+   G+ V+  +V G PE + + +
Sbjct: 252 RTHIYMSTPITEPFGLSCMESLYFGVPVIFPQVDGQPEAVKDGV 295
>ref|NP_147773.1| (NC_000854) capM protein [Aeropyrum pernix]
 pir||C72590 probable hexosyltransferase (EC 2.4.1.-) APE1191 [similarity] -
           Aeropyrum pernix (strain K1)
 dbj|BAA80177.1| (AP000061) 363aa long hypothetical capM protein [Aeropyrum pernix]
          Length = 363

 Score = 68.0 bits (164), Expect = 4e-10
 Identities = 59/214 (27%), Positives = 104/214 (48%), Gaps = 11/214 (5%)

Query: 182 ICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPDPFRRHDSITIVVVSRLVYRKGID 241
           I VS ++K+    R  ++P+ ++V+PN VD   + P    +    TI+   R+   K +D
Sbjct: 144 IAVSQSTKKELAKRLGIDPDRIAVVPNGVDLEKYRPG--SKDPRPTILWAGRIKMYKNLD 201

Query: 242 LLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQ 301
            L      + Q+ PD   II G G +   + E+ ++ +  D V  LG +  ++    + +
Sbjct: 202 HLLKAYRIVKQEIPDAQLIIIGTGDQEQKMRELAKKLEPRD-VHFLGKMSEQEKIMWMQR 260

Query: 302 GHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPE-NLIILCEPSVKSLCEGLE 360
             I ++TS+ E + + I EAA+C +  ++  V G+ + +      IL EP      E L 
Sbjct: 261 AWIIVSTSMIEGWGITITEAAACKIPAIAYNVPGLRDSVKHMETGILVEPGN---IEQLA 317

Query: 361 KAI-FQLKSGTL--PAPENIHNIVKTFYTWRNVA 391
           KAI + L   +L     EN +N  ++F +W N A
Sbjct: 318 KAIAWLLTDNSLRNKLSENAYNYAQSF-SWDNTA 350
>ref|NP_350177.1| (NC_003030) Glycosyltransferase [Clostridium acetobutylicum]
 gb|AAK81517.1|AE007856_1 (AE007856) Glycosyltransferase [Clostridium acetobutylicum]
          Length = 398

 Score = 67.6 bits (163), Expect = 4e-10
 Identities = 77/322 (23%), Positives = 130/322 (39%), Gaps = 42/322 (13%)

Query: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRK----GIRYLTSGLKVY 90
           I + +D +YP + GV      L + L   GH V I+T +Y  R+     I YL S     
Sbjct: 3   ILITTDAYYPMINGVVVSTNNLYKQLKMAGHDVRILTLSYNGREYIEGDIYYLNSHFVKV 62

Query: 91  YLPLKVM--YNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQ 148
           Y   ++M  +     + +    P            IIHS + FS M   A +  + + + 
Sbjct: 63  YPDARIMKPFGNKVISKIVEWSP-----------EIIHSQTEFSTML-VAKYIKRKLDIP 110

Query: 149 TVFTDHSLF--------GFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNP 200
            V T H+++        G   +      KLL + L   + II  + T K   VLR     
Sbjct: 111 QVHTYHTMYEDYLKYFLGGKVIRKGTMAKLLKILLNTFDEII--APTEKVKNVLREYEVY 168

Query: 201 EIVSVIPNAVDPTDFTPD-------------PFRRHDSITIVVVSRLVYRKGIDLLSGII 247
           + + ++P  +D   F  +              ++  D I +V V R+   K ID +  + 
Sbjct: 169 KDIKIVPTGIDIKSFQKELSSKEREKILNHYGWKTKDKI-LVYVGRVAEEKNIDEIINLF 227

Query: 248 PELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLN 307
            +   +  D+  +I G GP    L+E+  RY + D V+  G ++   V      G  F+ 
Sbjct: 228 KKGLNELKDIKLLIVGGGPYLSQLKELVSRYGIEDIVKFTGMVDSDQVYKYYKMGIAFVT 287

Query: 308 TSLTEAFCMAIVEAASCGLQVV 329
            S +E   +  +EA + G  V+
Sbjct: 288 ASQSETQGLTYIEALASGCPVI 309
>ref|NP_142415.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
 pir||C71154 hypothetical protein PH0434 - Pyrococcus horikoshii
 dbj|BAA29520.1| (AP000002) 336aa long hypothetical protein [Pyrococcus horikoshii]
          Length = 336

 Score = 66.9 bits (161), Expect = 8e-10
 Identities = 100/367 (27%), Positives = 156/367 (42%), Gaps = 52/367 (14%)

Query: 44  PNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYN-QST 102
           P+ GGV  H+  L   L  R H+V ++T  YG  KG      G  V Y+ +  ++  + T
Sbjct: 11  PHRGGVARHVKDLVDYL-SREHEVHVIT--YGTVKG-----KGENVSYVKVPNIFGLRGT 62

Query: 103 ATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFADV 162
           + TL  S  L   +  +    +IH+H   +      L   +T GL  V T H     +D+
Sbjct: 63  SFTLLAS-KLGVKLHKKLNFDLIHAHYVGTTSYAGVLIKERT-GLPLVVTAHG----SDL 116

Query: 163 SSVLTNKL------LTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFT 216
               T+KL      +  SL   N +I VS+      V    L  E V VIPN V  T  +
Sbjct: 117 D--FTSKLPLGSYYVKKSLIKANAVIAVSHYLG---VKAKMLGAENVKVIPNWVTKTGKS 171

Query: 217 PDPFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGP--KRIILEEV 274
              +       I  + RL   KG++       EL + +P   F++ GEGP  K+++ E  
Sbjct: 172 RGEY-------IAFIGRLTEYKGVEDFI----ELAKLFPQEKFVVAGEGPLLKKLMKESP 220

Query: 275 RERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVG 334
           +        V+ LG   +K   +VL +  + +  S  E F + I+EA S  +  +  RVG
Sbjct: 221 KN-------VKFLG---YKPSEDVLSKAKVLILPSKREGFGLVILEANSFKVPSLGRRVG 270

Query: 335 GIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAERT 394
           GI E++ +        ++    E L K +   K G       I   +  FY+     +R 
Sbjct: 271 GIREIIRDGKNGYTFSALDEAYEYL-KELLNPKKGRKAGA--ISYRISRFYSMEESCKRI 327

Query: 395 EKVYDRV 401
            KVY+ V
Sbjct: 328 LKVYEEV 334
>ref|NP_214125.1| (NC_000918) capsular polysaccharide biosynthsis protein [Aquifex
           aeolicus]
 pir||F70441 capsular polysaccharide biosynthsis protein - Aquifex aeolicus
 gb|AAC07522.1| (AE000749) capsular polysaccharide biosynthsis protein [Aquifex
           aeolicus]
          Length = 316

 Score = 66.5 bits (160), Expect = 1e-09
 Identities = 71/306 (23%), Positives = 136/306 (44%), Gaps = 31/306 (10%)

Query: 96  VMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHS 155
           + Y +  +  +    P + Y F R   TI+   + F            T+ L +V    +
Sbjct: 18  IFYAKRLSEVIKSEKPDIVYAFFRSMSTILGLSTFFGK-------ETGTIYLGSVHNTDN 70

Query: 156 LFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDF 215
              +  +  +    ++ V L   + I+CVS T K +      +  + + V+ N +D    
Sbjct: 71  YIKYGSLKHIPYRVMIKVLLEKLDGIVCVSNTVKRDLKQTFWIKDDKLKVVYNLID---- 126

Query: 216 TPDPFRRH--DSIT-----IVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKR 268
             D  R+   +SI      I+ V RL  +KG   +      + +K+ DL+ +I GEG K+
Sbjct: 127 -IDKIRKQADESINVDFDYIIAVGRLEDQKGYPYMLRAFKLISEKFKDLHLLIIGEGSKK 185

Query: 269 IILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQV 328
             +E++ E   L ++V LLG     +    + +   +L TS+ E F + +VEA + G+ V
Sbjct: 186 NQVEKLIEELGLKNKVHLLGY--QLNPYKYIKRAKAYLMTSIYEGFGLVLVEAMALGIPV 243

Query: 329 VSTRVGGIPEVLPENLIILCEP--SVKSLCEGLEKAI-------FQLKSGTLPAPE-NIH 378
           ++  +  + EVL +    +  P   + +  +GLEK +       + +K+G + A + +I 
Sbjct: 244 IAFDIPAVREVLNDGKAGVLVPFGDINAFAKGLEKLLTDRNLREYYIKNGLIRAKDFDIS 303

Query: 379 NIVKTF 384
            + K F
Sbjct: 304 KLDKIF 309
>ref|NP_295278.1| (NC_001263) conserved hypothetical protein [Deinococcus
           radiodurans]
 pir||E75381 conserved hypothetical protein - Deinococcus radiodurans  (strain
           R1)
 gb|AAF11118.1|AE001999_2 (AE001999) conserved hypothetical protein [Deinococcus radiodurans]
          Length = 411

 Score = 63.0 bits (151), Expect = 1e-08
 Identities = 91/358 (25%), Positives = 139/358 (38%), Gaps = 58/358 (16%)

Query: 23  GSLYTCRTRTHNICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRY 82
           GS      R H       FF P    V S +          G K+ ++ H      G+  
Sbjct: 2   GSAAVGGPRPHLYFPAMSFFSPPSASVASTL----------GPKIAVLCHTGAGGSGVVA 51

Query: 83  LTSGLKV-----------YYLPLKVMYNQSTATTLFHSLPLLRY---------------- 115
              GLKV             +P ++  +Q      FH +    Y                
Sbjct: 52  TELGLKVADAGHEVHFVGTAMPFRLTGHQGLRGPYFHQVGGFAYALFEQPFPELSAANTL 111

Query: 116 --IFVRERVTIIHSHSSF---SAMAHDALFHAKTMGLQTVF-TDHSLFGFADVSSVLTNK 169
             + +   V + H+H +    SA  H      KT  L T+  TD +L G        T  
Sbjct: 112 SEVILEHGVDLTHAHYAIPHASAALHARSITGKTRVLTTLHGTDVTLVGTEPAFQHTTRH 171

Query: 170 LLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDF--TPDP-----FRR 222
            +  S    +H+  VS++    T     ++ +I  VI N VD   F   PDP     F  
Sbjct: 172 AIERS----DHVTAVSHSLAAETREVFGVDRDI-EVIHNFVDSDRFRRIPDPGVRARFAH 226

Query: 223 HDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHD 282
            +   IV VS     K ++ +  +   +  + P    +I G+GP+R    E+     +  
Sbjct: 227 PEEALIVHVSNFRPIKRVEDVVQVFARIASEIPARLLMI-GDGPERARAFELARELGVIG 285

Query: 283 RVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVL 340
           R + LG+    DV+ VL    +FL TS  E+F +A +EA SC + VV++  GGIPEV+
Sbjct: 286 RTQFLGSF--PDVQTVLGISDLFLLTSSHESFGLAALEAMSCEVPVVASNAGGIPEVV 341
>ref|NP_279218.1| (NC_002607) LPS glycosyltransferase; Lpg [Halobacterium sp. NRC-1]
 gb|AAG18698.1| (AE004975) LPS glycosyltransferase; Lpg [Halobacterium sp. NRC-1]
          Length = 333

 Score = 59.1 bits (141), Expect = 2e-07
 Identities = 52/151 (34%), Positives = 75/151 (49%), Gaps = 11/151 (7%)

Query: 203 VSVIPNA-VDPTDFTPDPFR-RHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFI 260
           +S +P A +D  ++ P      H++IT+  V RL   KG D L     ++     DL F 
Sbjct: 137 ISTLPIAGIDVKEYQPSKTHPSHENITVSTVGRLANVKGYDDLIRCARDIGD---DLQFQ 193

Query: 261 IGGEGPKRIILEEVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVE 320
           I GEG +R  LE      +  D V   G + ++ +   L    I+   S  E  CMA++E
Sbjct: 194 IAGEGEERERLES-----KTPDNVNFQGMVPNEQIPQFLNNSDIYFQPSKYEGLCMAVIE 248

Query: 321 AASCGLQVVSTRVGGIPE-VLPENLIILCEP 350
           A +CGL VV++ VGGI E V+P     LC P
Sbjct: 249 AMACGLPVVASDVGGITESVVPGETGFLCRP 279
CPU time:    75.83 user secs.	    1.54 sys. secs	   77.37 total secs.

  Database: nr
    Posted date:  Apr 21, 2002  2:19 PM
  Number of letters in database: 277,845,442
  Number of sequences in database:  887,402
  
Lambda     K      H
   0.324    0.139    0.417 

Gapped
Lambda     K      H
   0.270   0.0470    0.230 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 266818266
Number of Sequences: 887402
Number of extensions: 10897250
Number of successful extensions: 24835
Number of sequences better than 10.0: 711
Number of HSP's better than 10.0 without gapping: 272
Number of HSP's successfully gapped in prelim test: 439
Number of HSP's that attempted gapping in prelim test: 24203
Number of HSP's gapped (non-prelim): 850
length of query: 484
length of database: 277,845,442
effective HSP length: 55
effective length of query: 429
effective length of database: 229,038,332
effective search space: 98257444428
effective search space used: 98257444428
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (22.0 bits)
S2: 74 (33.2 bits)