IMP GPI Biosynthesis Report IMP-Bioinformatics
GPI Biosynthesis Main Page  
GPI Site Motif   GPI Site Prediction   Home Page B.E.  

GPI Anchor Biosynthesis Report: AAF57802 (PIG-A family, Drosophila melanogaster )





BLASTP 2.1.1 [Aug-8-2000]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= (479 letters) Database: nr 887,402 sequences; 277,845,442 total letters Searching..................................................



Distribution of 65 Blast Hits on the Query Sequence


Sequences with E-value BETTER than threshold
Score E Sequences producing significant alignments: (bits) Value gb|AAF57802.1| (AE003802) CG6401 gene product [Drosophila mel... 920 0.0 ref|NP_002632.1| (NM_002641) phosphatidylinositol glycan, cla... 417 e-115 pir||A55731 GPI-anchor biosynthesis protein PIG-A - mouse >gi... 409 e-113 ref|NP_566874.1| (NM_114379) n-acetylglucosaminyl-phosphatidy... 388 e-106 pir||T47450 n-acetylglucosaminyl-phosphatidylinositol-like pr... 377 e-103 ref|NP_495840.1| (NM_063439) phosphatidylinositol biosyntheti... 370 e-101 pir||T40367 n-acetylglucosaminyl-phosphatidylinositol - fissi... 346 5e-94 gb|AAF76891.1| (AF267484) PIG-A [Paramecium tetraurelia] 326 4e-88 ref|NP_015150.2| (NC_001148) N-acetylglucosaminyl-phosphatidy... 298 2e-79 pir||S65187 GPI-anchor biosynthesis protein PIG-A - yeast (Sa... 296 7e-79 prf||1804343A SPT14 gene [Saccharomyces cerevisiae] 295 1e-78 ref|XP_045453.1| (XM_045453) similar to phosphatidylinositol ... 234 2e-60 emb|CAB57276.1| (X77725) PIG-A [Homo sapiens] 164 3e-39 pir||I52665 class A GlcNAc-inositol phospholipid assembly pro... 115 1e-24 ref|NP_065205.1| (NM_020472) phosphatidylinositol glycan, cla... 112 2e-23 ref|NP_126928.1| (NC_000868) GALACTOSYLTRANSFERASE or LPS BIO... 84 5e-15 ref|NP_217548.1| (NC_000962) hypothetical protein Rv3032 [Myc... 81 5e-14 ref|NP_489242.1| (NC_003272) glycosyltransferase [Nostoc sp. ... 79 2e-13 gb|AAL81488.1| (AE010240) hypothetical protein [Pyrococcus fu... 78 4e-13 gb|AAL82009.1| (AE010283) glycosyl transferase [Pyrococcus fu... 77 5e-13 ref|NP_350177.1| (NC_003030) Glycosyltransferase [Clostridium... 77 6e-13 ref|NP_148357.1| (NC_000854) N-acetylglucosaminyl-phosphatidy... 76 1e-12 ref|NP_228431.1| (NC_000853) lipopolysaccharide biosynthesis ... 74 5e-12 ref|NP_302182.1| (NC_002677) putative transferase [Mycobacter... 72 2e-11 ref|NP_213400.1| (NC_000918) hypothetical protein [Aquifex ae... 72 2e-11 ref|NP_489241.1| (NC_003272) glycosyltransferase [Nostoc sp. ... 72 2e-11 ref|NP_486077.1| (NC_003272) probable galactosyltransferase [... 70 7e-11 ref|NP_143674.1| (NC_000961) hypothetical protein [Pyrococcus... 70 1e-10 ref|NP_472029.1| (NC_003212) weakly similar to human N-acetyl... 68 2e-10 gb|AAL81485.1| (AE010240) glycosyl transferase [Pyrococcus fu... 68 3e-10 ref|NP_220795.1| (NC_000963) CAPM PROTEIN (capM2) [Rickettsia... 66 1e-09 pir||T34839 probable hexosyltransferase (EC 2.4.1.-) SC2G5.06... 66 1e-09 gb|AAL25631.1| (AY057452) putative glycosyltransferase [Edwar... 65 3e-09 ref|NP_228553.1| (NC_000853) conserved hypothetical protein [... 65 3e-09 dbj|BAB69052.1| (AB050640) putative glycosyltransferase [Spha... 65 4e-09 ref|NP_295278.1| (NC_001263) conserved hypothetical protein [... 64 4e-09 ref|NP_489278.1| (NC_003272) glycosyltransferase [Nostoc sp. ... 64 5e-09 emb|CAB43611.1| (AJ239004) galactosyl transferase [Streptococ... 64 5e-09 gb|AAK20702.1|AF316641_8 (AF316641) WciS [Streptococcus pneum... 64 5e-09 ref|NP_378386.1| (NC_003106) 352aa long conserved hypothetica... 64 6e-09 ref|NP_349148.1| (NC_003030) Glycosyltransferase [Clostridium... 63 8e-09 ref|NP_466078.1| (NC_003210) weakly similar to human N-acetyl... 62 3e-08 ref|NP_484962.1| (NC_003272) probable glycosyltransferase [No... 61 4e-08 ref|NP_440248.1| (NC_000911) unknown protein [Synechocystis s... 60 8e-08 ref|NP_390127.1| (NC_000964) alternate gene name: jojH~simila... 59 1e-07 ref|NP_275316.1| (NC_000916) LPS biosynthesis RfbU related pr... 57 8e-07 ref|NP_127136.1| (NC_000868) LPS BIOSYNTHESIS RFBU RELATED PR... 56 2e-06 ref|NP_279218.1| (NC_002607) LPS glycosyltransferase; Lpg [Ha... 54 6e-06 gb|AAL67552.1|AF461121_3 (AF461121) putative galactosyltransf... 47 5e-04
Alignments
>gb|AAF57802.1| (AE003802) CG6401 gene product [Drosophila melanogaster]
          Length = 479

 Score =  920 bits (2352), Expect = 0.0
 Identities = 479/479 (100%), Positives = 479/479 (100%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL
Sbjct: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60

Query: 61  PIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFT 120
           PIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFT
Sbjct: 61  PIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFT 120

Query: 121 DHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDT 180
           DHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDT
Sbjct: 121 DHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDT 180

Query: 181 ALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNTPNINFIIVGDGPKRDLL 240
           ALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNTPNINFIIVGDGPKRDLL
Sbjct: 181 ALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNTPNINFIIVGDGPKRDLL 240

Query: 241 EEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVST 300
           EEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVST
Sbjct: 241 EEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVST 300

Query: 301 SVGGIPEVLPKSLILLAEPEIDAIYAAILIAIDRHRKSSFKVSPSVGNGHLASDANGKVK 360
           SVGGIPEVLPKSLILLAEPEIDAIYAAILIAIDRHRKSSFKVSPSVGNGHLASDANGKVK
Sbjct: 301 SVGGIPEVLPKSLILLAEPEIDAIYAAILIAIDRHRKSSFKVSPSVGNGHLASDANGKVK 360

Query: 361 RRRRRKVDTPISPTQLPAVSAPPADQNSSTEPVMCPYRCNELVETLYNWEDVALRTVKVY 420
           RRRRRKVDTPISPTQLPAVSAPPADQNSSTEPVMCPYRCNELVETLYNWEDVALRTVKVY
Sbjct: 361 RRRRRKVDTPISPTQLPAVSAPPADQNSSTEPVMCPYRCNELVETLYNWEDVALRTVKVY 420

Query: 421 DRVLNERSFTTSELVFAVWQHGSWFLVFFVVAHFLMRLLELWRPRKHVEIAQDVQRRAS 479
           DRVLNERSFTTSELVFAVWQHGSWFLVFFVVAHFLMRLLELWRPRKHVEIAQDVQRRAS
Sbjct: 421 DRVLNERSFTTSELVFAVWQHGSWFLVFFVVAHFLMRLLELWRPRKHVEIAQDVQRRAS 479
>ref|NP_002632.1| (NM_002641) phosphatidylinositol glycan, class A isoform 1;
           Phosphatidylinositol glycan, class A; GLCNAC-PI
           synthesis protein [Homo sapiens]
 sp|P37287|PIGA_HUMAN N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein
           (GlcNac-PI synthesis protein)
           (Phosphatidylinositol-glycan biosynthesis, class A
           protein) (PIG-A)
 pir||A46217 GPI-anchor biosynthesis protein PIG-A - human
 dbj|BAA02019.1| (D11466) PIG-A protein [Homo sapiens]
 dbj|BAA05966.1| (D28791) PIG-A protein [Homo sapiens]
          Length = 484

 Score =  417 bits (1061), Expect = e-115
 Identities = 207/347 (59%), Positives = 266/347 (76%), Gaps = 3/347 (0%)

Query: 3   ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPI 62
           ICMVSDFFYP++GGVE H+Y LSQ L+  GHK++++THAYG+  GIRY+T  LKVYYLP+
Sbjct: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94

Query: 63  KVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDH 122
           KV YNQ    T   ++P+LR + +RERV ++H HS+FSA+AH+AL     +GL+TVFTDH
Sbjct: 95  KVMYNQSTATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDH 154

Query: 123 SLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTAL 182
           SLFGFAD+S+ LTN LL V+L   NH ICVS+  KENTVLRA +    VSVIPNAVD   
Sbjct: 155 SLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTD 214

Query: 183 FTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDLLE 241
           FTPDP +R  +D I IVV SRLVYRKGIDLL+GIIP   +  P++NFII G+GPKR +LE
Sbjct: 215 FTPDPFRR--HDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILE 272

Query: 242 EIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTS 301
           E+RE+  + +RV+++GA+EH  VR+ LV+GHIFLNTSLTEA+CMAIVEAASCGLQVVST 
Sbjct: 273 EVRERYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTR 332

Query: 302 VGGIPEVLPKSLILLAEPEIDAIYAAILIAIDRHRKSSFKVSPSVGN 348
           VGGIPEVLP++LI+L EP + ++   +  AI + +  +     ++ N
Sbjct: 333 VGGIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHN 379
 Score = 35.7 bits (81), Expect = 1.6
 Identities = 24/83 (28%), Positives = 37/83 (43%), Gaps = 5/83 (6%)

Query: 396 PYRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQH----GSWFLVFFVV 451
           P   + +V+T Y W +VA RT KVYDRV  E      + +  +  H      +      V
Sbjct: 374 PENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAV 433

Query: 452 AHFLMRLLELW-RPRKHVEIAQD 473
            +FL  +   W  P   +++A D
Sbjct: 434 FNFLFLIFLRWMTPDSIIDVAID 456
>pir||A55731 GPI-anchor biosynthesis protein PIG-A - mouse
 pir||I52484 gene PIG-A protein - mouse
 dbj|BAA05047.1| (D26047) Pig-a precursor [Mus musculus]
 dbj|BAA06663.1| (D31863) PIG-A protein [Mus musculus]
          Length = 485

 Score =  409 bits (1040), Expect = e-113
 Identities = 200/347 (57%), Positives = 262/347 (74%), Gaps = 2/347 (0%)

Query: 3   ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPI 62
           ICMVSDFFYP++GGVE H+Y LSQ L+  GHK++ +THAYG+  G+RY+T  LKVYYLP+
Sbjct: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVITVTHAYGNRKGVRYLTNGLKVYYLPL 94

Query: 63  KVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDH 122
           +V YNQ    T   ++P+LR + +RER+ ++H HS+FSA+AH+AL     +GL+TVFTDH
Sbjct: 95  RVMYNQSTATTLFHSLPLLRYIFVRERITIIHSHSSFSAMAHDALFHAKTMGLQTVFTDH 154

Query: 123 SLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTAL 182
           SLFGFAD+S+ LTN LL V+L   NH ICVS+  KENTVLRA +    VSVIPNAVD   
Sbjct: 155 SLFGFADVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTD 214

Query: 183 FTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDLLE 241
           FTPDP +R  + +I +VV SRLVYRKG DLL+GIIP   +    ++F+I G+GPKR +LE
Sbjct: 215 FTPDPFRR-HDSVITVVVVSRLVYRKGTDLLSGIIPELCQKYQELHFLIGGEGPKRIILE 273

Query: 242 EIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTS 301
           E+RE+  + +RVQ++GA+EH  VR+ LV+GHIFLNTSLTEA+CMAIVEAASCGLQVVST 
Sbjct: 274 EVRERYQLHDRVQLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTK 333

Query: 302 VGGIPEVLPKSLILLAEPEIDAIYAAILIAIDRHRKSSFKVSPSVGN 348
           VGGIPEVLP+SLI+L EP + ++   +  AI + +  +     ++ N
Sbjct: 334 VGGIPEVLPESLIILCEPSVKSLCDGLEKAIFQVKSGTLPAPENIHN 380
 Score = 39.2 bits (90), Expect = 0.16
 Identities = 23/83 (27%), Positives = 40/83 (47%), Gaps = 5/83 (6%)

Query: 396 PYRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQH-----GSWFLVFFV 450
           P   + +V+T Y W +VA RT KVY+RV  E      + +  +  H     G  F +  V
Sbjct: 375 PENIHNVVKTFYTWRNVAERTEKVYERVSKETVLPMHKRLDRLISHCGPVTGYMFALLAV 434

Query: 451 VAHFLMRLLELWRPRKHVEIAQD 473
           +++  +  L+   P   +++A D
Sbjct: 435 LSYLFLIFLQWMTPDSFIDVAID 457
>ref|NP_566874.1| (NM_114379) n-acetylglucosaminyl-phosphatidylinositol-like protein
           [Arabidopsis thaliana]
 gb|AAK62657.1| (AY039602) AT3g45100/T14D3_40 [Arabidopsis thaliana]
          Length = 447

 Score =  388 bits (987), Expect = e-106
 Identities = 194/333 (58%), Positives = 249/333 (74%), Gaps = 2/333 (0%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           +R+ MVSDFF+P+ GGVE H+Y LSQ LL LGHK+VV+THAYG+ SG+RY+TG LKVYY+
Sbjct: 7   LRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYV 66

Query: 61  PIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFT 120
           P +    Q   PT    +P++R +L RE++ VVHGH AFS L HEALM    +G K VFT
Sbjct: 67  PWRPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMGYKVVFT 126

Query: 121 DHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDT 180
           DHSL+GFAD+ +   N +L+ +L  ++ AICVSH  KENTVLR+ ++  +V +IPNAVDT
Sbjct: 127 DHSLYGFADVGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDT 186

Query: 181 ALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDL 239
           A+F P    RPS DII IVV SRLVYRKG DLL  +IP   +  PN+ F++ GDGPK   
Sbjct: 187 AMFKP-ASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVR 245

Query: 240 LEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVS 299
           LEE+REK ++Q+RV+M+GAV H+RVR  LV GHIFLN+SLTEA+C+AI+EAASCGL  VS
Sbjct: 246 LEEMREKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVS 305

Query: 300 TSVGGIPEVLPKSLILLAEPEIDAIYAAILIAI 332
           T VGG+PEVLP  +++LAEP+ D +  AI  AI
Sbjct: 306 TRVGGVPEVLPDDMVVLAEPDPDDMVRAIEKAI 338
 Score = 38.0 bits (87), Expect = 0.38
 Identities = 28/87 (32%), Positives = 47/87 (53%), Gaps = 4/87 (4%)

Query: 392 PVMCPYRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQHGSW----FLV 447
           P + P   +  ++ LY+W+DVA RT  VYDR L   + +  E +      G+W    F +
Sbjct: 342 PTINPEEMHNRMKKLYSWQDVAKRTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFCM 401

Query: 448 FFVVAHFLMRLLELWRPRKHVEIAQDV 474
             ++ + L RLL+L +P + +E A D+
Sbjct: 402 VMILDYLLWRLLQLLQPDEDIEEAPDI 428
>pir||T47450 n-acetylglucosaminyl-phosphatidylinositol-like protein -
           Arabidopsis thaliana
 emb|CAB72148.1| (AL138649) n-acetylglucosaminyl-phosphatidylinositol-like protein
           [Arabidopsis thaliana]
          Length = 450

 Score =  377 bits (958), Expect = e-103
 Identities = 192/336 (57%), Positives = 247/336 (73%), Gaps = 5/336 (1%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           +R+ MVSDFF+P+ GGVE H+Y LSQ LL LGHK+VV+THAYG+ SG+RY+TG LKVYY+
Sbjct: 7   LRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYV 66

Query: 61  PIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFT 120
           P +    Q   PT    +P++R +L RE++ VVHGH AFS L HEALM    +G K VFT
Sbjct: 67  PWRPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMGYKVVFT 126

Query: 121 DHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDT 180
           DHSL+GFAD+ +   N +L+ +L  ++ AICVSH  KENTVLR+ ++  +V +IPNAVDT
Sbjct: 127 DHSLYGFADVGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDT 186

Query: 181 ALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDL 239
           A+F P    RPS DII IVV SRLVYRKG DLL  +IP   +  PN+ F++ GDGPK   
Sbjct: 187 AMFKP-ASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVR 245

Query: 240 LEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVS 299
           LEE+REK ++Q+RV+M+GAV H+RVR  LV GHIFLN+SLTEA+C+AI+EAASCGL  VS
Sbjct: 246 LEEMREKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVS 305

Query: 300 TSVGGI---PEVLPKSLILLAEPEIDAIYAAILIAI 332
           T VGG     +VLP  +++LAEP+ D +  AI  AI
Sbjct: 306 TRVGGFLHGLQVLPDDMVVLAEPDPDDMVRAIEKAI 341
 Score = 38.0 bits (87), Expect = 0.36
 Identities = 28/87 (32%), Positives = 47/87 (53%), Gaps = 4/87 (4%)

Query: 392 PVMCPYRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQHGSW----FLV 447
           P + P   +  ++ LY+W+DVA RT  VYDR L   + +  E +      G+W    F +
Sbjct: 345 PTINPEEMHNRMKKLYSWQDVAKRTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFCM 404

Query: 448 FFVVAHFLMRLLELWRPRKHVEIAQDV 474
             ++ + L RLL+L +P + +E A D+
Sbjct: 405 VMILDYLLWRLLQLLQPDEDIEEAPDI 431
>ref|NP_495840.1| (NM_063439) phosphatidylinositol biosynthetic protein
           [Caenorhabditis elegans]
 pir||T20374 hypothetical protein D2085.6 - Caenorhabditis elegans
 emb|CAA91062.1| (Z54284) contains similarity to Pfam domain: PF00534 (Glycosyl
           transferases group 1), Score=91.6, E-value=9.5e-25,
           N=1~cDNA EST yk349e7.5 comes from this gene
           [Caenorhabditis elegans]
          Length = 444

 Score =  370 bits (941), Expect = e-101
 Identities = 192/338 (56%), Positives = 245/338 (71%), Gaps = 4/338 (1%)

Query: 3   ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPI 62
           I +VSDFF P+ GGVE H+Y L+Q L+ LGH++VV+TH YG+  GIRY++  LKVYYLP 
Sbjct: 10  IALVSDFFCPNAGGVETHIYFLAQCLIELGHRVVVITHGYGNRKGIRYLSNGLKVYYLPF 69

Query: 63  KVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDH 122
            V YN   L + V ++P LR VLLRE V+++HGHS FS+LAHE LM+G L+GL+TVFTDH
Sbjct: 70  IVAYNGATLGSIVGSMPWLRKVLLRENVQIIHGHSTFSSLAHETLMIGGLMGLRTVFTDH 129

Query: 123 SLFGFADLSAALTNNL-LEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTA 181
           SLFGFAD SA LTN L L+ +L  V+  ICVS+  KENTVLR ++  ++VS IPNA++T+
Sbjct: 130 SLFGFADASAILTNKLVLQYSLINVDQTICVSYTSKENTVLRGKLDPNKVSTIPNAIETS 189

Query: 182 LFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDLL 240
           LFTPD  Q   N+   IV   RLVYRKG DLL  I+P+      ++ FII GDGPKR  L
Sbjct: 190 LFTPDRNQF-FNNPTTIVFLGRLVYRKGADLLCEIVPKVCARHKSVRFIIGGDGPKRIEL 248

Query: 241 EEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVST 300
           EE+ E+  + ERV ++G + HN+V+  L +G IF+NTSLTEA+CM+IVEAASCGL VVST
Sbjct: 249 EEMLERFKLHERVVILGMLPHNQVKRVLNQGQIFINTSLTEAFCMSIVEAASCGLHVVST 308

Query: 301 SVGGIPEVLP-KSLILLAEPEIDAIYAAILIAIDRHRK 337
            VGG+PEVLP    I L EP  D +  A+L A+DR  K
Sbjct: 309 RVGGVPEVLPIGEFISLEEPVPDDLVDALLKAVDRREK 346
 Score = 34.1 bits (77), Expect = 5.2
 Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 5/80 (6%)

Query: 393 VMCPYRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQHGSWFLVFFVVA 452
           +M P   +E V  +YNW DVA RT  +Y + +          +   +  G  F + ++V 
Sbjct: 349 LMDPTEKHEAVSKMYNWPDVAARTQVIYQKAVESEPTGRLGRLKGYYDQGIGFGIMYIVV 408

Query: 453 H----FLMRLLELW-RPRKH 467
                F + +L+L+  PRK+
Sbjct: 409 SCIIIFWLTVLDLFDSPRKN 428
>pir||T40367 n-acetylglucosaminyl-phosphatidylinositol - fission yeast
           (Schizosaccharomyces pombe)
 emb|CAB09127.1| (Z95620) n-acetylglucosaminyl-phosphatidylinositol
           [Schizosaccharomyces pombe]
          Length = 456

 Score =  346 bits (879), Expect = 5e-94
 Identities = 170/319 (53%), Positives = 224/319 (69%), Gaps = 2/319 (0%)

Query: 5   MVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPIKV 64
           MVSDFF+P  GG+E H++ LSQ L+ LGHK++V+THAY D  G+RY+T  L VYY+P+  
Sbjct: 1   MVSDFFFPQPGGIESHIFQLSQRLIDLGHKVIVITHAYKDRVGVRYLTNGLTVYYVPLHT 60

Query: 65  CYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDHSL 124
            Y +   P+     P+ R +++RE +E+VHGH + S L H+A++    +GLKT FTDHSL
Sbjct: 61  VYRETTFPSFFSFFPIFRNIVIRENIEIVHGHGSLSFLCHDAILHARTMGLKTCFTDHSL 120

Query: 125 FGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFT 184
           FGFAD  + +TN LL+  +  VNH ICVSH  +ENTVLRA +   RVSVIPNA+    F 
Sbjct: 121 FGFADAGSIVTNKLLKFTMSDVNHVICVSHTCRENTVLRAVLNPKRVSVIPNALVAENFQ 180

Query: 185 PDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDLLEEI 243
           PDP  + S D + IVV SRL Y KGIDLL  +IPR     P + F+I GDGPK   LE++
Sbjct: 181 PDP-SKASKDFLTIVVISRLYYNKGIDLLIAVIPRICAQHPKVRFVIAGDGPKSIDLEQM 239

Query: 244 REKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVG 303
           REK  +Q+RV+M+G+V H++VRD +VRGHI+L+ SLTEA+   +VEAASCGL V+ST VG
Sbjct: 240 REKYMLQDRVEMLGSVRHDQVRDVMVRGHIYLHPSLTEAFGTVLVEAASCGLYVISTKVG 299

Query: 304 GIPEVLPKSLILLAEPEID 322
           G+PEVLP  +   A PE D
Sbjct: 300 GVPEVLPSHMTRFARPEED 318
 Score = 40.4 bits (93), Expect = 0.070
 Identities = 24/78 (30%), Positives = 40/78 (50%), Gaps = 4/78 (5%)

Query: 400 NELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQHGSW----FLVFFVVAHFL 455
           +E V+ +Y+W DVA RT KVYD + +E +    + +   +  G W    F +   + + +
Sbjct: 342 HEEVKQMYSWIDVAERTEKVYDSICSENNLRLIDRLKLYYGCGQWAGKLFCLLIAIDYLV 401

Query: 456 MRLLELWRPRKHVEIAQD 473
           M LLE   P   ++ A D
Sbjct: 402 MVLLEWIWPASDIDPAVD 419
>gb|AAF76891.1| (AF267484) PIG-A [Paramecium tetraurelia]
          Length = 442

 Score =  326 bits (828), Expect = 4e-88
 Identities = 163/333 (48%), Positives = 227/333 (67%), Gaps = 1/333 (0%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           + IC++ DFFYP +GGVE H++ L   L+  G K++++TH Y   SG+RY+T  LKVYY 
Sbjct: 2   VNICLICDFFYPCLGGVEMHIFQLGLCLIERGLKVIIITHKYQGRSGVRYMTNGLKVYYC 61

Query: 61  PIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFT 120
           P        +L T V  +P+ R +LLRE + +VH H+A S L  E L+    +G KTVFT
Sbjct: 62  PFIPAIQTVVLFTYVGTLPIFRQILLREEIHIVHSHAATSYLGGELLLHAKSMGFKTVFT 121

Query: 121 DHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDT 180
           DHSLF F D ++   N +L+  L  ++H+I VSH+ KEN  +RA +    +SVIPNAVD 
Sbjct: 122 DHSLFAFNDAASFHVNKILKYILCEIDHSISVSHVSKENLSMRASLDPRNISVIPNAVDC 181

Query: 181 ALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRF-KNTPNINFIIVGDGPKRDL 239
           + FTP+PQ+R   + INIVV  R+ +RKG+DLL  ++    K  P I FII GDGPK+ +
Sbjct: 182 SRFTPNPQKRYPLNTINIVVICRMTFRKGVDLLVDVLQIICKQHPEIYFIIGGDGPKKKI 241

Query: 240 LEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVS 299
           LEE  ++ N+Q + +++G+V  ++V+D L RGHIFLNTSLTEA+C+AIVEAASCGL VVS
Sbjct: 242 LEETIQRYNLQNQTELLGSVPGHQVKDVLNRGHIFLNTSLTEAFCIAIVEAASCGLCVVS 301

Query: 300 TSVGGIPEVLPKSLILLAEPEIDAIYAAILIAI 332
           T+VGGI EVLP++++L A+P  + I   I  AI
Sbjct: 302 TNVGGISEVLPQNMVLYADPTPEDISHKITQAI 334
 Score = 38.8 bits (89), Expect = 0.18
 Identities = 24/77 (31%), Positives = 44/77 (56%), Gaps = 6/77 (7%)

Query: 397 YRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQHGSWFLVFFVVAH--- 453
           Y+ +ELV+ +Y+WE VA RT KVY ++L  ++ T  +     + +G  + +F ++     
Sbjct: 343 YQQHELVKKMYSWEQVAERTEKVYYKILQTQNQTILKRFKDCYSNGQIYGLFLMILLIFD 402

Query: 454 --FLMRLLELWRPRKHV 468
             FLM +L+  +P K +
Sbjct: 403 LIFLM-ILDFLQPHKGI 418
>ref|NP_015150.2| (NC_001148) N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein; Spt14p [Saccharomyces cerevisiae]
 sp|P32363|GPI3_YEAST N-ACETYLGLUCOSAMINYL-PHOSPHATIDYLINOSITOL BIOSYNTHETIC PROTEIN
           (GLCNAC-PI SYNTHESIS PROTEIN)
 emb|CAA44924.1| (X63290) trans-acting transcription factor [Saccharomyces
           cerevisiae]
          Length = 452

 Score =  298 bits (755), Expect = 2e-79
 Identities = 149/322 (46%), Positives = 217/322 (67%), Gaps = 6/322 (1%)

Query: 3   ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPI 62
           I M+ DFFYP +GGVE H+Y+LSQ L+ LGH +V++THAY D  G+R++T  LKVY++P 
Sbjct: 5   IAMLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVPF 64

Query: 63  KVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDH 122
            V + +   PT     P++R +LLRE++++VH H + S  AHE ++  + +GL+TVFTDH
Sbjct: 65  FVIFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTDH 124

Query: 123 SLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTAL 182
           SL+GF +L++   N LL   L  ++  ICVS+  KEN ++R  ++   +SVIPNAV +  
Sbjct: 125 SLYGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSED 184

Query: 183 FTP-DP----QQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNT-PNINFIIVGDGPK 236
           F P DP    +++ S D I IVV  RL   KG DLL  IIP+  ++  ++ FI+ GDGPK
Sbjct: 185 FKPRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGPK 244

Query: 237 RDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQ 296
               +++ E   +Q+RVQ++G+V H +VRD L +G I+L+ SLTEA+   +VEAASC L 
Sbjct: 245 FIDFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNLL 304

Query: 297 VVSTSVGGIPEVLPKSLILLAE 318
           +V+T VGGIPEVLP  + + AE
Sbjct: 305 IVTTQVGGIPEVLPNEMTVYAE 326
 Score = 34.9 bits (79), Expect = 3.0
 Identities = 23/80 (28%), Positives = 41/80 (50%), Gaps = 8/80 (10%)

Query: 400 NELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVW----QHGSW----FLVFFVV 451
           ++ V  +Y+W DVA RTV++Y  + +  S    + +  V     + G W    +L+  +V
Sbjct: 355 HDSVSKMYDWMDVAKRTVEIYTNISSTSSADDKDWMKMVANLYKRDGIWAKHLYLLCGIV 414

Query: 452 AHFLMRLLELWRPRKHVEIA 471
            + L  LLE   PR  +++A
Sbjct: 415 EYMLFFLLEWLYPRDEIDLA 434
>pir||S65187 GPI-anchor biosynthesis protein PIG-A - yeast (Saccharomyces
           cerevisiae)
 emb|CAA97882.1| (Z73531) ORF YPL175w [Saccharomyces cerevisiae]
          Length = 461

 Score =  296 bits (750), Expect = 7e-79
 Identities = 148/320 (46%), Positives = 216/320 (67%), Gaps = 6/320 (1%)

Query: 5   MVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPIKV 64
           M+ DFFYP +GGVE H+Y+LSQ L+ LGH +V++THAY D  G+R++T  LKVY++P  V
Sbjct: 16  MLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVPFFV 75

Query: 65  CYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDHSL 124
            + +   PT     P++R +LLRE++++VH H + S  AHE ++  + +GL+TVFTDHSL
Sbjct: 76  IFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTDHSL 135

Query: 125 FGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFT 184
           +GF +L++   N LL   L  ++  ICVS+  KEN ++R  ++   +SVIPNAV +  F 
Sbjct: 136 YGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSEDFK 195

Query: 185 P-DP----QQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNT-PNINFIIVGDGPKRD 238
           P DP    +++ S D I IVV  RL   KG DLL  IIP+  ++  ++ FI+ GDGPK  
Sbjct: 196 PRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGPKFI 255

Query: 239 LLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVV 298
             +++ E   +Q+RVQ++G+V H +VRD L +G I+L+ SLTEA+   +VEAASC L +V
Sbjct: 256 DFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNLLIV 315

Query: 299 STSVGGIPEVLPKSLILLAE 318
           +T VGGIPEVLP  + + AE
Sbjct: 316 TTQVGGIPEVLPNEMTVYAE 335
 Score = 34.9 bits (79), Expect = 3.1
 Identities = 23/80 (28%), Positives = 41/80 (50%), Gaps = 8/80 (10%)

Query: 400 NELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVW----QHGSW----FLVFFVV 451
           ++ V  +Y+W DVA RTV++Y  + +  S    + +  V     + G W    +L+  +V
Sbjct: 364 HDSVSKMYDWMDVAKRTVEIYTNISSTSSADDKDWMKMVANLYKRDGIWAKHLYLLCGIV 423

Query: 452 AHFLMRLLELWRPRKHVEIA 471
            + L  LLE   PR  +++A
Sbjct: 424 EYMLFFLLEWLYPRDEIDLA 443
>prf||1804343A SPT14 gene [Saccharomyces cerevisiae]
          Length = 415

 Score =  295 bits (748), Expect = 1e-78
 Identities = 148/320 (46%), Positives = 216/320 (67%), Gaps = 6/320 (1%)

Query: 5   MVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPIKV 64
           M+ DFFYP +GGVE H+Y+LSQ L+ LGH +V++THAY D  G+R++T  LKVY++P  V
Sbjct: 1   MLCDFFYPQLGGVEFHIYHLSQKLIDLGHSVVIITHAYKDRVGVRHLTNGLKVYHVPFFV 60

Query: 65  CYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDHSL 124
            + +   PT     P++R +LLRE++++VH H + S  AHE ++  + +GL+TVFTDHSL
Sbjct: 61  IFRETTFPTVFSTFPIIRNILLREQIQIVHSHGSASTFAHEGILHANTMGLRTVFTDHSL 120

Query: 125 FGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFT 184
           +GF +L++   N LL   L  ++  ICVS+  KEN ++R  ++   +SVIPNAV +  F 
Sbjct: 121 YGFNNLTSIWVNKLLTFTLTNIDRVICVSNTCKENMIVRTELSPDIISVIPNAVVSEDFK 180

Query: 185 P-DP----QQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNT-PNINFIIVGDGPKRD 238
           P DP    +++ S D I IVV  RL   KG DLL  IIP+  ++  ++ FI+ GDGPK  
Sbjct: 181 PRDPTGGTKRKQSRDKIVIVVIGRLFPNKGSDLLTRIIPKVCSSHEDVEFIVAGDGPKFI 240

Query: 239 LLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVV 298
             +++ E   +Q+RVQ++G+V H +VRD L +G I+L+ SLTEA+   +VEAASC L +V
Sbjct: 241 DFQQMIESHRLQKRVQLLGSVPHEKVRDVLCQGDIYLHASLTEAFGTILVEAASCNLLIV 300

Query: 299 STSVGGIPEVLPKSLILLAE 318
           +T VGGIPEVLP  + + AE
Sbjct: 301 TTQVGGIPEVLPNEMTVYAE 320
>ref|XP_045453.1| (XM_045453) similar to phosphatidylinositol glycan, class A isoform
           1; Phosphatidylinositol glycan, class A; GLCNAC-PI
           synthesis protein [Homo sapiens]
          Length = 280

 Score =  234 bits (592), Expect = 2e-60
 Identities = 119/217 (54%), Positives = 157/217 (71%), Gaps = 3/217 (1%)

Query: 3   ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPI 62
           ICM SDFFYP++GGVE H+Y L Q L+  G K++++ HAYG+  GIRY+T  LKVYYLP+
Sbjct: 35  ICMASDFFYPNMGGVESHIYQLPQCLIGRGDKVIIVIHAYGNRKGIRYLTNDLKVYYLPL 94

Query: 63  KVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDH 122
           KV YNQ +  T   ++P+L+ + ++ERV ++H HS+FSA+AH+ L     +GL+TV TDH
Sbjct: 95  KVMYNQSMAMTLFHSLPLLKYIFVQERVTIIHSHSSFSAMAHDVLFHAKTMGLQTVLTDH 154

Query: 123 SLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTAL 182
            L GFA + + LTN LL V+L   +  ICVS+  KENTVLRA +    VSVIPNAVD   
Sbjct: 155 PLSGFAKVHSVLTNKLLTVSLCDTSRIICVSYTSKENTVLRAALITEIVSVIPNAVDPID 214

Query: 183 FTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPR 219
           FTPDP +R  +D I IVV SRLVYRKG +L++GIIP+
Sbjct: 215 FTPDPFRR--HDSITIVV-SRLVYRKGTNLVSGIIPK 248
>emb|CAB57276.1| (X77725) PIG-A [Homo sapiens]
          Length = 248

 Score =  164 bits (411), Expect = 3e-39
 Identities = 84/154 (54%), Positives = 114/154 (73%), Gaps = 3/154 (1%)

Query: 198 IVVASRLVYRKG--IDLLAGIIPRF-KNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQ 254
           I+V      RKG  IDLL+GIIP   +  P++NFII G+GPKR +LEE+RE+  + +RV+
Sbjct: 68  IIVTHAYGNRKGIRIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRERYQLHDRVR 127

Query: 255 MVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVLPKSLI 314
           ++GA+EH  VR+ LV+GHIFLNTSLTEA+CMAIVEAASCGLQVVST VGGIPEVLP++LI
Sbjct: 128 LLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPENLI 187

Query: 315 LLAEPEIDAIYAAILIAIDRHRKSSFKVSPSVGN 348
           +L EP + ++   +  AI + +  +     ++ N
Sbjct: 188 ILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHN 221
 Score = 76.2 bits (185), Expect = 1e-12
 Identities = 31/47 (65%), Positives = 40/47 (84%)

Query: 3  ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIR 49
          ICMVSDFFYP++GGVE H+Y LSQ L+  GHK++++THAYG+  GIR
Sbjct: 35 ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIR 81
 Score = 34.9 bits (79), Expect = 2.8
 Identities = 15/28 (53%), Positives = 19/28 (67%)

Query: 396 PYRCNELVETLYNWEDVALRTVKVYDRV 423
           P   + +V+T Y W +VA RT KVYDRV
Sbjct: 216 PENIHNIVKTFYTWRNVAERTEKVYDRV 243
>pir||I52665 class A GlcNAc-inositol phospholipid assembly protein PIG-A - human
 gb|AAD14160.1|S74936_1 (S74936) class A GlcNAc-inositol phospholipid assembly protein
           [Homo sapiens]
          Length = 315

 Score =  115 bits (287), Expect = 1e-24
 Identities = 56/97 (57%), Positives = 75/97 (76%)

Query: 252 RVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVLPK 311
           RV+++GA+EH  VR+ LV+GHIFLNTSLTEA+CMAIVEAASCGLQVVST VGGIPEVLP+
Sbjct: 114 RVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVGGIPEVLPE 173

Query: 312 SLILLAEPEIDAIYAAILIAIDRHRKSSFKVSPSVGN 348
           +LI+L EP + ++   +  AI + +  +     ++ N
Sbjct: 174 NLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHN 210
 Score =  113 bits (281), Expect = 6e-24
 Identities = 51/84 (60%), Positives = 65/84 (76%)

Query: 3   ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPI 62
           ICMVSDFFYP++GGVE H+Y LSQ L+  GHK++++THAYG+  GIRY+T  LKVYYLP+
Sbjct: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94

Query: 63  KVCYNQCILPTAVCNVPMLRAVLL 86
           KV YNQ    T   ++P+LR  LL
Sbjct: 95  KVMYNQSTATTLFHSLPLLRVRLL 118
 Score = 36.1 bits (82), Expect = 1.3
 Identities = 24/83 (28%), Positives = 37/83 (43%), Gaps = 5/83 (6%)

Query: 396 PYRCNELVETLYNWEDVALRTVKVYDRVLNERSFTTSELVFAVWQH----GSWFLVFFVV 451
           P   + +V+T Y W +VA RT KVYDRV  E      + +  +  H      +      V
Sbjct: 205 PENIHNIVKTFYTWRNVAERTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAV 264

Query: 452 AHFLMRLLELW-RPRKHVEIAQD 473
            +FL  +   W  P   +++A D
Sbjct: 265 FNFLFLIFLRWMTPDSIIDVAID 287
>ref|NP_065205.1| (NM_020472) phosphatidylinositol glycan, class A isoform 2;
           Phosphatidylinositol glycan, class A; GLCNAC-PI
           synthesis protein [Homo sapiens]
          Length = 118

 Score =  112 bits (277), Expect = 2e-23
 Identities = 49/80 (61%), Positives = 63/80 (78%)

Query: 3   ICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPI 62
           ICMVSDFFYP++GGVE H+Y LSQ L+  GHK++++THAYG+  GIRY+T  LKVYYLP+
Sbjct: 35  ICMVSDFFYPNMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPL 94

Query: 63  KVCYNQCILPTAVCNVPMLR 82
           KV YNQ    T   ++P+LR
Sbjct: 95  KVMYNQSTATTLFHSLPLLR 114
>ref|NP_126928.1| (NC_000868) GALACTOSYLTRANSFERASE or LPS BIOSYNTHESIS RFBU RELATED
           PROTEIN [Pyrococcus abyssi]
 pir||A75033 probable hexosyltransferase (EC 2.4.1.-) PAB0827 [similarity] -
           Pyrococcus abyssi (strain Orsay)
 emb|CAB50158.1| (AJ248287) GALACTOSYLTRANSFERASE or LPS BIOSYNTHESIS RFBU RELATED
           PROTEIN [Pyrococcus abyssi]
          Length = 371

 Score = 84.0 bits (205), Expect = 5e-15
 Identities = 63/242 (26%), Positives = 122/242 (50%), Gaps = 14/242 (5%)

Query: 91  EVVHGHSAFSALAHEALMVGSLLGLKTVFTDHSL----FGFADLSAALTNNLLEVNLGMV 146
           +VVH   AF+ L+ +++  G+ +G  T+ T+HS+    F   +  + ++ +  ++ LG V
Sbjct: 91  DVVHAQHAFTPLSLKSIPAGNKVGALTLVTNHSVEFENFSILNGFSKMSYSYFKMYLGQV 150

Query: 147 NHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVY 206
              I VS   K +     +     +  IPN V+   F    ++  +    NI+   RL  
Sbjct: 151 KVGIGVS---KASVSFLRKFTNAPIVEIPNGVNIERFNGRGREWGTR---NILYVGRLEP 204

Query: 207 RKGIDLLAGIIPRFKNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRD 266
           RKG++ L   +   +        IVGDG  R +L+   +K  ++++V+ +G +    +  
Sbjct: 205 RKGVNYLISAMKFVEG----KLTIVGDGSMRKVLKMQAKKLGVEDKVEFLGFISQEELIL 260

Query: 267 FLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVLPKSLILLAEPEIDAIYA 326
              +  +F+  SL+EA+ + ++EA +  + V+ TSVGGIPE++  + I++   +  A+  
Sbjct: 261 LYKKSEVFVLPSLSEAFGIVLLEAMASEVPVIGTSVGGIPEIIGDAGIIVPPRDSKALAN 320

Query: 327 AI 328
           AI
Sbjct: 321 AI 322
 Score = 46.2 bits (108), Expect = 0.001
 Identities = 18/41 (43%), Positives = 32/41 (77%)

Query: 1  MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHA 41
          ++I +VSD+++P IGGV  HV+NL+  L  +GH++ ++T+A
Sbjct: 4  LKIALVSDWYFPKIGGVAIHVHNLAIHLRKMGHEVSIVTNA 44
>ref|NP_217548.1| (NC_000962) hypothetical protein Rv3032 [Mycobacterium tuberculosis
           H37Rv]
 ref|NP_337632.1| (NC_002755) glycosyl transferase [Mycobacterium tuberculosis
           CDC1551]
 pir||C70859 probable hexosyltransferase (EC 2.4.1.-) Rv3032 [similarity] -
           Mycobacterium tuberculosis (strain H37RV)
 emb|CAA16117.1| (AL021287) hypothetical protein Rv3032 [Mycobacterium tuberculosis
           H37Rv]
 gb|AAK47446.1| (AE007130) glycosyl transferase [Mycobacterium tuberculosis
           CDC1551]
          Length = 414

 Score = 80.9 bits (197), Expect = 5e-14
 Identities = 90/339 (26%), Positives = 146/339 (42%), Gaps = 38/339 (11%)

Query: 1   MRICMVSDFFYPS--IGGVEEHVYNLSQMLLSLGHKIVVL----------THAYGD--CS 46
           MRI MVS + YP   IGG+  HV++LS  L + GH +VVL          TH   D    
Sbjct: 1   MRILMVS-WEYPPVVIGGLGRHVHHLSTALAAAGHDVVVLSRCPSGTDPSTHPSSDEVTE 59

Query: 47  GIRYVTGYLKVYYLPIKVCYNQCILPTAVCNVPMLRAVLLRE--------RVEVVHGHSA 98
           G+R +      +        N  +  T      M+RA L  +        R +VVH H  
Sbjct: 60  GVRVIAAAQDPHEFTFG---NDMMAWTLAMGHAMIRAGLRLKKLGTDRSWRPDVVHAHDW 116

Query: 99  FSALAHEALMVGSLLGLKTVFTDHSLFGFAD---LSAALTNNLLEVNLGMVNHA----IC 151
              +AH A+ +     +  V T H+         +S AL+  +  V   +V  +     C
Sbjct: 117 L--VAHPAIALAQFYDVPMVSTIHATEAGRHSGWVSGALSRQVHAVESWLVRESDSLITC 174

Query: 152 VSHIGKENTVLRARVAKHRVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGID 211
            + +  E T L        ++VI N +D A + P   +RP      ++   RL Y KG+ 
Sbjct: 175 SASMNDEITELFGP-GLAEITVIRNGIDAARW-PFAARRPRTGPAELLYVGRLEYEKGVH 232

Query: 212 LLAGIIPRFKNT-PNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVR 270
                +PR + T P     I G+G ++D L +   K  +    + VG ++H  +   L R
Sbjct: 233 DAIAALPRLRRTHPGTTLTIAGEGTQQDWLIDQARKHRVLRATRFVGHLDHTELLALLHR 292

Query: 271 GHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVL 309
               +  S  E + +  +EAA+ G  +V++++GG+ E +
Sbjct: 293 ADAAVLPSHYEPFGLVALEAAAAGTPLVTSNIGGLGEAV 331
>ref|NP_489242.1| (NC_003272) glycosyltransferase [Nostoc sp. PCC 7120]
 dbj|BAB76901.1| (AP003599) glycosyltransferase [Nostoc sp. PCC 7120]
          Length = 429

 Score = 78.5 bits (191), Expect = 2e-13
 Identities = 49/161 (30%), Positives = 85/161 (52%), Gaps = 7/161 (4%)

Query: 170 RVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGID-LLAGIIPRFKNTPNINF 228
           ++ V  + +D+  F    +  P + II I    RLV +KGI+ ++  +    KN P+I +
Sbjct: 201 KIHVHGSGIDSNSFFFQERSYPHDGIIRIATTGRLVEKKGIEYVIKAVAQVIKNYPDIEY 260

Query: 229 IIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLT------EA 282
            I+GDG  +   E++  + N+ + V+++G  +   + D L + HIF+  S+T      +A
Sbjct: 261 NIIGDGELKTHFEKLIFELNLSQNVKLLGWKQQKEIVDILDKCHIFVAPSVTGKDGNQDA 320

Query: 283 YCMAIVEAASCGLQVVSTSVGGIPEVLPKSLILLAEPEIDA 323
               + EA + GL V+ST  GGIPE++   +     PE DA
Sbjct: 321 PVNTLKEAMAMGLPVISTRHGGIPELVTDGVSGFLVPERDA 361
>gb|AAL81488.1| (AE010240) hypothetical protein [Pyrococcus furiosus DSM 3638]
          Length = 373

 Score = 77.8 bits (189), Expect = 4e-13
 Identities = 81/313 (25%), Positives = 155/313 (48%), Gaps = 28/313 (8%)

Query: 1   MRICMVSDFFYPSI-GGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYY 59
           +RI  + D  YP + GGVE  +Y +++ L    H++ +  + + D   I+ + G      
Sbjct: 5   LRIAFIYDVIYPWVKGGVERRLYEIAKRLAE-KHEVHIYGYKHWDGKKIQEMNGIFYHGT 63

Query: 60  LPIKVCYN---QCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLK 116
           +  K  Y+   + ILP    ++ +L  +L  + ++++   +      + + +  S L   
Sbjct: 64  IKPKKIYHGNRRAILPPIFHSINLL-FLLKGQHLDIIDCQATPYFPCYASRVSNSNL--- 119

Query: 117 TVFTDHSLFGFADLS----AALTNNLLEVNLGMV--NHAICVSHIGKENTVLRARVAKHR 170
            V T H  +G   L     A     ++E  L ++  NH I VS   K++ + +A + K+ 
Sbjct: 120 -VITWHEFWGNYWLKYLGRAGFFGKIIERGLFVLTDNH-IAVSLKTKKD-LYKAGLRKN- 175

Query: 171 VSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFK-NTPNINFI 229
           + V+PN +D   F    + +PS+   +I+   RL+  K + LL   +   K + P++  +
Sbjct: 176 IYVVPNGID---FEKIQEIKPSSYTSDIIFVGRLIKEKNVPLLLKALTIIKQDIPDVKAV 232

Query: 230 IVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRD---FLVRGHIFLNTSLTEAYCMA 286
           +VGDGP+R+ LE++  K N+Q+ V+ +G +  NR  D    +    +F   SL E + + 
Sbjct: 233 VVGDGPEREYLEKLSFKLNLQDNVKFLGFL--NRYEDVVALMKASKVFAFPSLREGFGIV 290

Query: 287 IVEAASCGLQVVS 299
           ++EA + GL VV+
Sbjct: 291 VIEANASGLPVVT 303
>gb|AAL82009.1| (AE010283) glycosyl transferase [Pyrococcus furiosus DSM 3638]
          Length = 358

 Score = 77.4 bits (188), Expect = 5e-13
 Identities = 69/233 (29%), Positives = 111/233 (47%), Gaps = 18/233 (7%)

Query: 91  EVVHGHSAFSALAHEALMVGSLLGLKTVFTDHSLFGFADLSA-----ALTNNLLEVNLGM 145
           +V+H H AF  LA +A+  G  +   T+ T HS+  FA  S       LT  L    L  
Sbjct: 69  DVIHSHHAFMPLALKAVKAGRTMEKATLLTTHSI-SFAHESKLWDTLGLTIPLFRSYLKY 127

Query: 146 VNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFTPDPQQRPSNDII----NIVV- 200
            +  I VS   K        V+   VS++PN VD   F P   +           NIV+ 
Sbjct: 128 PHRIIAVSKAAKSFIEHFTSVS---VSIVPNGVDDTRFFPAKHKDKIKAKFGLEGNIVLY 184

Query: 201 ASRLVYRKGIDLLAGIIPRFKNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVE 260
            SR+ YRKG  +L   +  F    +   ++VG G     L+   +   ++ERV  +G V 
Sbjct: 185 VSRMSYRKGPHVL---LNAFSKIEDATLVMVGSGEMLPFLKAQAKFLGIEERVVFMGYVP 241

Query: 261 HNRVRDFLVRGHIFLNTSLT-EAYCMAIVEAASCGLQVVSTSVGGIPEVLPKS 312
            + + +      +F+  S++ EA+ + ++EA + G+ VV+T VGGIPE++ ++
Sbjct: 242 DDALPEVFRMADVFVLPSVSAEAFGIVVLEAMASGVPVVATDVGGIPEIIKEN 294
>ref|NP_350177.1| (NC_003030) Glycosyltransferase [Clostridium acetobutylicum]
 gb|AAK81517.1|AE007856_1 (AE007856) Glycosyltransferase [Clostridium acetobutylicum]
          Length = 398

 Score = 77.4 bits (188), Expect = 6e-13
 Identities = 75/320 (23%), Positives = 134/320 (41%), Gaps = 33/320 (10%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           M+I + +D +YP I GV     NL + L   GH + +LT +Y   +G  Y+ G   +YYL
Sbjct: 1   MKILITTDAYYPMINGVVVSTNNLYKQLKMAGHDVRILTLSY---NGREYIEG--DIYYL 55

Query: 61  PIKVCYNQCILPTAVCNVPMLRAV---LLRERVEVVHGHSAFSALAHEALMVGSLLGLKT 117
                    + P A    P    V   ++    E++H  + FS +   A  +   L +  
Sbjct: 56  NSHFVK---VYPDARIMKPFGNKVISKIVEWSPEIIHSQTEFSTML-VAKYIKRKLDIPQ 111

Query: 118 VFTDHSLF--------GFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKH 169
           V T H+++        G   +       LL++ L   +  I  +   K   VLR      
Sbjct: 112 VHTYHTMYEDYLKYFLGGKVIRKGTMAKLLKILLNTFDEIIAPTE--KVKNVLREYEVYK 169

Query: 170 RVSVIPNAVDTALFTPDPQQRPSNDIIN----------IVVASRLVYRKGIDLLAGIIPR 219
            + ++P  +D   F  +   +    I+N          +V   R+   K ID +  +  +
Sbjct: 170 DIKIVPTGIDIKSFQKELSSKEREKILNHYGWKTKDKILVYVGRVAEEKNIDEIINLFKK 229

Query: 220 FKNT-PNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTS 278
             N   +I  +IVG GP    L+E+  +  +++ V+  G V+ ++V  +   G  F+  S
Sbjct: 230 GLNELKDIKLLIVGGGPYLSQLKELVSRYGIEDIVKFTGMVDSDQVYKYYKMGIAFVTAS 289

Query: 279 LTEAYCMAIVEAASCGLQVV 298
            +E   +  +EA + G  V+
Sbjct: 290 QSETQGLTYIEALASGCPVI 309
>ref|NP_148357.1| (NC_000854) N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Aeropyrum pernix]
 pir||D72511 probable hexosyltransferase (EC 2.4.1.-) APE2066 [similarity] -
           Aeropyrum pernix (strain K1)
 dbj|BAA81076.1| (AP000063) 392aa long hypothetical
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Aeropyrum pernix]
          Length = 392

 Score = 75.8 bits (184), Expect = 1e-12
 Identities = 85/346 (24%), Positives = 152/346 (43%), Gaps = 32/346 (9%)

Query: 2   RICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAY--GDCSGIRYVTGYLKVYY 59
           RI MV DF   S+GGV+ HV +L+++L   G+ +V+++ A   GD   +     Y+    
Sbjct: 21  RIVMVMDFHPSSVGGVQSHVRDLTRLLQDFGYDVVIVSRALGKGDVKDLEAEGHYIVKPL 80

Query: 60  LPIKVCYNQCILPTAVCNVPMLRAVLLRE----RVEVVHGHSAFSALAHEALMVGSLLGL 115
            P+++ +           VP   + L RE    + +VVH H  ++  +  AL     LGL
Sbjct: 81  FPLEIIF-----------VPPDPSDLRREIESLKPDVVHSHHIYTLTSLLALKAARDLGL 129

Query: 116 KTVFTDHSLFGFADLSA--ALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKH-RVS 172
             + T+HS+F   D  A   + + +L     + N    +S     + ++   V       
Sbjct: 130 PRIATNHSIFLAYDKVALWRIASIVLPTRYLLPNAQAVISVSTAADKMVEGIVGDSVDRY 189

Query: 173 VIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNT----PNINF 228
           +IPN VD   F P     P  D   ++   RLV+RKG  +L   +  F++      +   
Sbjct: 190 IIPNGVDVERFKP---STPKADYPLVLFLGRLVWRKGAHVL---VRAFRHVVDEIRDAKL 243

Query: 229 IIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSL-TEAYCMAI 287
            I G G    +++ +  +  ++  V+M+G V  +          +    S+  E++ +  
Sbjct: 244 YIGGKGEFEPIIKLLIARYGLENNVKMLGVVPESEKPSLYSSAWVTAVPSIVNESFGIVA 303

Query: 288 VEAASCGLQVVSTSVGGIPEVLPKSLI-LLAEPEIDAIYAAILIAI 332
           +E+ S G  VV++  GG+ +V+      LL +P      A  LI +
Sbjct: 304 LESLSSGTPVVASRQGGLKDVVKHGKTGLLVKPGSSKELAKALITL 349
>ref|NP_228431.1| (NC_000853) lipopolysaccharide biosynthesis protein, putative
           [Thermotoga maritima]
 pir||E72354 probable hexosyltransferase (EC 2.4.1.-) TM0622 - Thermotoga
           maritima (strain MSB8)
 gb|AAD35706.1|AE001736_4 (AE001736) lipopolysaccharide biosynthesis protein, putative
           [Thermotoga maritima]
          Length = 388

 Score = 73.9 bits (179), Expect = 5e-12
 Identities = 89/343 (25%), Positives = 162/343 (46%), Gaps = 30/343 (8%)

Query: 13  SIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYLPIKVCYNQCILP 72
           ++GG E+ V ++ +        + V+     D   +  +T   K Y +   V   + I P
Sbjct: 12  AVGGAEKLVSDMVEFADRSRFDVAVMRITGTDSFLVEKLTS--KGYQVYTIVLDYEAIAP 69

Query: 73  TAVCNVPMLRAV--------LLRE-RVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDHS 123
           + V    +LRA+        LLRE R +++H H   SAL     ++ +LL  +     H+
Sbjct: 70  SKVIR-RLLRAIKNMRRTYNLLREIRPDIIHSH--LSAL--RIALIPALL-CRIPVKVHT 123

Query: 124 LFGFADLSAA----LTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVD 179
           +   A+  A       N +     G V  +I    + +    L  R  K    VI N +D
Sbjct: 124 IHTVAEKDAKGITRFFNRIAFKFFGFVPVSIS-QEVAESVKKLYGR--KISTPVIYNGID 180

Query: 180 TALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPR-FKNTPNINFIIVGDGPKRD 238
              F+ D  +R   D   ++  +RL   K   LL     +  ++ PN+   +VGDG  R 
Sbjct: 181 VQKFSIDQPKRVDRDKTILINVARLSREKNHALLVRAFSKAVQSCPNLELWLVGDGELRR 240

Query: 239 LLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVV 298
            +EE+ ++  ++E+V+  G    + V + L +  IF+ +S  E + + + EA + GL V+
Sbjct: 241 DIEELVKQLGLEEKVKFFGV--RSDVPELLSQADIFVLSSDYEGFGLVVAEAMAAGLPVI 298

Query: 299 STSVGGIPEVLP--KSLILLAEPEIDAIYAAIL-IAIDRHRKS 338
           +T++GGIPE+L   ++ IL+   ++DA+  AI+ +A D  +++
Sbjct: 299 ATAIGGIPEILEGGRAGILVPPKDVDALAKAIVELARDEKKRA 341
>ref|NP_302182.1| (NC_002677) putative transferase [Mycobacterium leprae]
 emb|CAC30668.1| (AL583923) putative transferase [Mycobacterium leprae]
          Length = 438

 Score = 72.3 bits (175), Expect = 2e-11
 Identities = 86/339 (25%), Positives = 145/339 (42%), Gaps = 38/339 (11%)

Query: 1   MRICMVSDFFYPS--IGGVEEHVYNLSQMLLSLGHKIVVL----------THAYGD--CS 46
           M+I +VS + YP   IGG+  HV++LS  L + GH +VVL          TH   D    
Sbjct: 25  MKILIVS-WEYPPVVIGGLGRHVHHLSTALAAAGHDVVVLSRRPSGTDPCTHPTSDEISE 83

Query: 47  GIRYVTGYLKVYYLPIKVCYNQCILPTAVCNVPMLRAVL--------LRERVEVVHGHSA 98
           G+R +      +        N  +  T      M+R  L        L  R +VVH H  
Sbjct: 84  GVRVIAAAQDPHEFTFS---NDMMAWTLAMGHAMIRTGLSLTRHSSDLPWRPDVVHAHDW 140

Query: 99  FSALAHEALMVGSLLGLKTVFTDHSLFGFAD---LSAALTNNLLEVNLGMVNHA----IC 151
              +AH A+ +     +  V T H+         +S AL+  +  V   +V  +     C
Sbjct: 141 L--VAHPAITLAQFYDVPMVSTIHATEAGRHSGWVSGALSRQVHAVESWLVRESDSLITC 198

Query: 152 VSHIGKENTVLRARVAKHRVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGID 211
            + +  E   L        ++VI N +D A + P   +R       ++   RL Y KG+ 
Sbjct: 199 SASMCNEIIELFGP-GLAEITVIRNGIDPARW-PFAARRARTGPAELLYVGRLEYEKGVH 256

Query: 212 LLAGIIPRFKNT-PNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVR 270
            +   +PR + + P     I G+G ++D L +   K  + +  + VG + HN +   L R
Sbjct: 257 DVIAALPRIRRSYPGTTLTIAGEGTQQDWLVDQARKYKVIKATRFVGHLNHNELLAALQR 316

Query: 271 GHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVL 309
               +  S  E + +  +EAA+ G  +V++++GG+ E +
Sbjct: 317 ADAAVLPSHYEPFGLVALEAAAAGTPLVTSNIGGLGEAV 355
>ref|NP_213400.1| (NC_000918) hypothetical protein [Aquifex aeolicus]
 pir||D70351 probable hexosyltransferase (EC 2.4.1.-) aq_572 [similarity] -
           Aquifex aeolicus
 gb|AAC06809.1| (AE000696) hypothetical protein [Aquifex aeolicus]
          Length = 366

 Score = 71.9 bits (174), Expect = 2e-11
 Identities = 82/341 (24%), Positives = 151/341 (44%), Gaps = 33/341 (9%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           M+I + +D F   +GG  +    L+  L   G++++V+T +  +           KV  L
Sbjct: 1   MKIALFTDSFRKDLGGGTQVARQLAFGLSKKGYEVLVITGSTAE------EETPFKVLKL 54

Query: 61  P-IKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVF 119
           P IK  +   +   A+ NV +L+  L     +V+H H  F A    AL++G +L + TV 
Sbjct: 55  PSIKYPFYHNV-EIALPNVELLKE-LKNFNPDVIHYHDPFLA-GTMALLMGKILKIPTVG 111

Query: 120 TDHSLFGFADLSAALTNNLLEVNLGMV---------NHAICVSHIGKENTVLRARVAKHR 170
           T H           LT + ++++ G++         N   CV  + K    L   +    
Sbjct: 112 TIHI------HPKQLTYHGIKIDNGVIAKKLVSFFGNFTDCVVFVSKYQKKLYEELDSFC 165

Query: 171 VSVIPNAVDTALFTPDPQQ--RPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNTPNINF 228
           V VI N +    F  + ++   P N I+ +   SRL   K  +     +        + +
Sbjct: 166 VKVIYNGIPDYFFVSEKRKLRNPRNRILTV---SRLDKDKNPEFALKCVAEISKEVPVEY 222

Query: 229 IIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIV 288
            IVG+G +++ LE++  K  +  +   +G V    + +  +   + LNTS TE + ++  
Sbjct: 223 TIVGEGNEKEKLEKLARKLGI--KANFLGFVPREELPELYLSHDVLLNTSKTETFGLSFA 280

Query: 289 EAASCGLQVVSTSVGGIPEVLPKSLILLAEPEIDAIYAAIL 329
           EA + G+ V++   G  PE++    I L E +++ +  A L
Sbjct: 281 EAMATGMPVIALKEGSAPEIVGDGGI-LCEEKVECVKKAFL 320
>ref|NP_489241.1| (NC_003272) glycosyltransferase [Nostoc sp. PCC 7120]
 dbj|BAB76900.1| (AP003599) glycosyltransferase [Nostoc sp. PCC 7120]
          Length = 430

 Score = 71.9 bits (174), Expect = 2e-11
 Identities = 45/155 (29%), Positives = 80/155 (51%), Gaps = 7/155 (4%)

Query: 176 NAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNT-PNINFIIVGDG 234
           + +D   FT  P+  P++  + +    RLV +KGI+     + +     PNI + ++GDG
Sbjct: 208 SGLDCNKFTFKPRYFPADGKVQVATTGRLVEKKGIEYAIRAVAKVAELYPNIEYQVIGDG 267

Query: 235 PKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLT------EAYCMAIV 288
             ++ LE++  + N+   V+++G  +   + + L   HIF+  S+T      +A    + 
Sbjct: 268 DLKEDLEQLITELNIGHIVKLLGWKQQKEIVEILENTHIFIAPSVTAADGNQDAPVNTLK 327

Query: 289 EAASCGLQVVSTSVGGIPEVLPKSLILLAEPEIDA 323
           EA + GL V+ST  GGIPE++   +     PE DA
Sbjct: 328 EAMAMGLPVISTRHGGIPELVTDGVSGFLVPERDA 362
>ref|NP_486077.1| (NC_003272) probable galactosyltransferase [Nostoc sp. PCC 7120]
 dbj|BAB73736.1| (AP003588) ORF_ID:all2037~probable galactosyltransferase [Nostoc
           sp. PCC 7120]
          Length = 366

 Score = 70.4 bits (170), Expect = 7e-11
 Identities = 69/253 (27%), Positives = 113/253 (44%), Gaps = 31/253 (12%)

Query: 67  NQCILPTAVCNVPMLRAVLLRE-RVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDHSLF 125
           NQ   P  + N       ++RE + ++VH H     +    L   S   L  V T H+ F
Sbjct: 59  NQSRTPLNLINAARRYRAIIREFQPDIVHAHMMTGVVLGRCLKADSEYAL--VATVHNEF 116

Query: 126 GFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAV------- 178
                SA L        +G+ +  I VSH     ++ R  + +H++ V+PN         
Sbjct: 117 ---QRSAVL--------MGLADRVIAVSH-AVAKSMARRGIPEHKLRVVPNGTLGSVRTR 164

Query: 179 DTALFTPDPQQRPSNDIINIVVASRLVYRKGI-DLLAGIIPRFKNTPNINFIIVGDGPKR 237
           +   ++P   QRP+     I   + +  RKGI +L+A      ++ P ++  +VGDGP+R
Sbjct: 165 NIKEYSPVELQRPA-----IATVAGMYQRKGIGELIAAFAQIAQDFPQVHLYLVGDGPER 219

Query: 238 DLLEEIREKTNMQE-RVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQ 296
            + EE  + T +   R+   G       + +L+   IF+  S  +   + I EA   G  
Sbjct: 220 QIFEEKAQATGLSNTRIHFEGF--QPEPQRYLLAADIFVLASHRDPSPLVIPEAREAGCA 277

Query: 297 VVSTSVGGIPEVL 309
           +V+TSV GIPE L
Sbjct: 278 IVATSVDGIPEAL 290
>ref|NP_143674.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
 pir||F71196 probable hexosyltransferase (EC 2.4.1.-) PH1844 - Pyrococcus
           horikoshii
 dbj|BAA30965.1| (AP000007) 381aa long hypothetical protein [Pyrococcus horikoshii]
          Length = 381

 Score = 69.6 bits (168), Expect = 1e-10
 Identities = 63/233 (27%), Positives = 110/233 (47%), Gaps = 18/233 (7%)

Query: 91  EVVHGHSAFSALAHEALMVGSLLGLKTVFTDHSLFGFADLSA-----ALTNNLLEVNLGM 145
           +++H H AF+ L+ +AL  G  +   T+ T HS+  FA  S        T  L +  L  
Sbjct: 93  DIIHSHHAFTPLSLKALKAGKNMEKGTLLTTHSI-SFAHESKLWDTLGFTIPLFKSYLKY 151

Query: 146 VNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFTPDPQQRPSNDII----NIVV- 200
            +  I VS   K        V    V ++PN VD   F P   +           N+V+ 
Sbjct: 152 SHRIIAVSKAAKSFIEHFTSVP---VLIVPNGVDDERFFPARDKEKIKAKFGLEGNVVLY 208

Query: 201 ASRLVYRKGIDLLAGIIPRFKNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVE 260
            SR+ YRKG  +L   +  F    +   ++VG+G     L+   +   ++ +V  +G V 
Sbjct: 209 VSRMSYRKGPHVL---LNAFSKIEDATLVMVGNGEMLPFLKAQTKFLGIENKVVFMGYVP 265

Query: 261 HNRVRDFLVRGHIFLNTSL-TEAYCMAIVEAASCGLQVVSTSVGGIPEVLPKS 312
            + + +      +F+  S+ +EA+ + I+EA + G+ +++T VGGIPEV+ ++
Sbjct: 266 DDILPEVFRMADVFVLPSISSEAFGIVILEAMASGVPIIATDVGGIPEVIKEN 318
 Score = 44.7 bits (104), Expect = 0.003
 Identities = 18/40 (45%), Positives = 30/40 (75%)

Query: 1  MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTH 40
          M+I +VSD++YP IGGV  H++NL+  L   GH++ ++T+
Sbjct: 4  MKIALVSDWYYPKIGGVATHMHNLAIKLRERGHEVGIVTN 43
>ref|NP_472029.1| (NC_003212) weakly similar to human
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Listeria innocua]
 emb|CAC97926.1| (AL596173) weakly similar to human
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Listeria innocua]
          Length = 427

 Score = 68.4 bits (165), Expect = 2e-10
 Identities = 75/320 (23%), Positives = 132/320 (40%), Gaps = 30/320 (9%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           M I + +D + P I GV   +  +   L   GH + + T    D +  R  +   +V+ L
Sbjct: 1   MNIGIFTDTYSPQISGVATSIMIMENELRKQGHTVYIFTTT--DPNADRE-SEEGRVFRL 57

Query: 61  P-IKVCYNQCILP---TAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLK 116
           P I   +     P    A+  +     ++ R  ++++H H+ FS L      +     + 
Sbjct: 58  PSIPFVF----FPERRVAIAGMNKFIKLVGRLNLDIIHTHTEFS-LGLLGKRIAKKYNIP 112

Query: 117 TVFTDHSLFGFADLSAALTNNLLEVNLGMVNHAICVSH------IGKENTVLRARVAKHR 170
           ++ T H+++       A    L    +G +  + C S+        K    L  +     
Sbjct: 113 SIHTYHTMYVDYLHYIAKGKILTPSMVGKMTKSFCDSYDAIITPTAKVRHHLEEQGIHKL 172

Query: 171 VSVIPNAVDTALFTPDPQQR----------PSNDIINIVVASRLVYRKGIDLLAGIIPRF 220
           +  +P   D + F P  +QR            ND + I+   R+ + K ID +   +P  
Sbjct: 173 MYTVPTGTDISSFAPVEKQRILDLKQSLGIEENDSV-ILSLGRIAHEKNIDAIINAMPEV 231

Query: 221 KNT-PNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSL 279
             T PN   +IVGDGP R  LE++ E   ++  V   GAV+   +  +   G +F++ S 
Sbjct: 232 LETKPNAKLVIVGDGPVRKDLEKLVETKQLENHVIFTGAVDWENISLYYQLGDLFVSAST 291

Query: 280 TEAYCMAIVEAASCGLQVVS 299
           TE   +   EA +  L VV+
Sbjct: 292 TETQGLTYAEAMAASLPVVA 311
>gb|AAL81485.1| (AE010240) glycosyl transferase [Pyrococcus furiosus DSM 3638]
          Length = 383

 Score = 68.0 bits (164), Expect = 3e-10
 Identities = 52/183 (28%), Positives = 92/183 (49%), Gaps = 9/183 (4%)

Query: 161 VLRARVAKHRVSVIPNAVDTALFTPDPQ---QRPSNDIIN---IVVASRLVYRKGID-LL 213
           ++R  + + ++  IPN VDT+LF P      ++  N  I+   ++    LV +KG + L+
Sbjct: 167 LMRVGIPEDKLYYIPNGVDTSLFYPQETALIRKELNIPIDKKILISVGNLVEKKGFEYLI 226

Query: 214 AGIIPRFKNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHI 273
             +        ++   I+G+GP R  LE I  +  ++E V +VG   H  +  ++  G +
Sbjct: 227 RAMKIILHARDDVLLYIIGEGPLRKRLENITRELKLEEHVFLVGPKPHRDIPLWINAGDL 286

Query: 274 FLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVLPKSLILLAEPEID--AIYAAILIA 331
           F+  SL E + +  +EA +CG  V+ST  GG  EV+      L  P  D   +   IL+A
Sbjct: 287 FVLPSLVENFGVVNIEALACGKPVISTINGGSEEVITSEEYGLLCPPRDPECLAEKILMA 346

Query: 332 IDR 334
           +++
Sbjct: 347 LNK 349
>ref|NP_220795.1| (NC_000963) CAPM PROTEIN (capM2) [Rickettsia prowazekii]
 pir||E71699 capm protein (capM2) RP414 - Rickettsia prowazekii
 emb|CAA14871.1| (AJ235271) CAPM PROTEIN (capM2) [Rickettsia prowazekii]
          Length = 338

 Score = 66.5 bits (160), Expect = 1e-09
 Identities = 67/250 (26%), Positives = 119/250 (46%), Gaps = 41/250 (16%)

Query: 70  ILPTAVCNVPMLRAVLLRERVEVV--HGHSA--FSALA--HEALMVGSLLGLKTVFTDHS 123
           +LP    +V +L+ ++ + + +++  HG+ +  FS LA  H   ++G       +  ++S
Sbjct: 57  LLPIDPLSVLILKYIIYKTKPDIIIAHGNRSINFSKLAKPHNTKLIG-------IAHNYS 109

Query: 124 LFGF--ADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVD-T 180
           L G    D + ALT ++ E                    +L+   A+ R+ ++PN ++ T
Sbjct: 110 LKGLRKCDFAIALTYHMKEF-------------------LLKNNFAESRIFILPNMINIT 150

Query: 181 ALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNTP-NINFIIVGDGPKRDL 239
             F P+   +    +I I V +R V +KGID+    I   K+   NI  +I G+G ++D 
Sbjct: 151 KNFVPN---KIYKKVIVIGVLARFVAKKGIDVFIKAIKLLKDKQYNIQVVIGGNGNEKDN 207

Query: 240 LEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVS 299
           L  +  K N+Q+++   G V  N    F  +  IF   SL E + + I+EA    + +VS
Sbjct: 208 LIALVHKLNLQDQISFTGWV--NDKDTFFKQIDIFCLPSLHEPFGIIILEAMQASVPIVS 265

Query: 300 TSVGGIPEVL 309
           T   G  E+L
Sbjct: 266 TDTEGPKEIL 275
>pir||T34839 probable hexosyltransferase (EC 2.4.1.-) SC2G5.06 [similarity] -
           Streptomyces coelicolor
 emb|CAB36593.1| (AL035478) putative transferase [Streptomyces coelicolor A3(2)]
          Length = 406

 Score = 66.1 bits (159), Expect = 1e-09
 Identities = 97/371 (26%), Positives = 158/371 (42%), Gaps = 41/371 (11%)

Query: 1   MRICMVSDFFYP--SIGGVEE-----HVYNLSQMLLSLGHKIVVLTHAYG-DCSGIRYVT 52
           MRI MVS+   P  ++GGV+      +V  L++ L   GH + V T     D      + 
Sbjct: 1   MRIAMVSEHASPLAALGGVDAGGQNVYVARLAEELAGRGHDVTVYTRRDATDLPARVPLP 60

Query: 53  GYLKVYYL----PIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALA----- 103
           G   V ++    P+ V  ++           + RA   RER +VVH H   S +A     
Sbjct: 61  GGAVVEHVPAGPPVTVPKDELFPHMPAFGAHLARA-WARERPDVVHAHFWMSGMASQIGA 119

Query: 104 --HEALMVGSLLGLKTVFTDHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTV 161
             H   +V +   L TV   H   G  D S       +E  LG     +  +   +   +
Sbjct: 120 APHGIPLVQTFHALGTVKRRHQ--GMRDTS-PYERIGIERQLGRTCERVLATCTDEVVEL 176

Query: 162 LRARVAKHRVSVIPNAVDTALFTP--DPQQRPSNDIINIVVA-SRLVYRKGIDLLAGIIP 218
               V   +VSV+P  VD   F P  D  + P   + + ++A  RLV RKG D     + 
Sbjct: 177 GDMGVPARQVSVVPCGVDAEHFHPAADTGRTPERRLRHRLLACGRLVPRKGYD---QAVR 233

Query: 219 RFKNTPNINFIIVGDGPKRDLLEE--------IREKTNMQERVQMVGAVEHNRVRDFLVR 270
              + P+   +I G  P   L  E        I  +  + +RV+++GAV+ + +   L  
Sbjct: 234 ALAHIPDAELLIAGGPPAGALETEPEARRLTGIARRAGVADRVRLLGAVDPDDMPALLRS 293

Query: 271 GHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGG----IPEVLPKSLILLAEPEIDAIYA 326
             + L T + E + +  +EA +CG+ V++T VGG    + + +   L+   +P   A  A
Sbjct: 294 SDLVLCTPVYEPFGIVPLEAMACGVPVLATDVGGHRDSVADGVTGRLVAPQDPGAVAAAA 353

Query: 327 AILIAIDRHRK 337
             L+A +R R+
Sbjct: 354 RELLADERLRR 364
>gb|AAL25631.1| (AY057452) putative glycosyltransferase [Edwardsiella ictaluri]
          Length = 366

 Score = 64.9 bits (156), Expect = 3e-09
 Identities = 66/327 (20%), Positives = 136/327 (41%), Gaps = 68/327 (20%)

Query: 14  IGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGY---------LKVYYLPIKV 64
           +GG E+ +  L+    + G ++ ++           Y+TG          +K+Y L I  
Sbjct: 12  LGGAEKQLSLLADNFTARGEQVSIV-----------YLTGEVLVKPKNKNIKIYNLGIDK 60

Query: 65  CYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLKTVFTDHS- 123
            ++  I       +  L++++   R +V+H H   + +        SL   + V + H+ 
Sbjct: 61  SFSSLIK-----GIWKLKSIISDVRPDVIHSHMYHANILARISCCLSLFSSRLVCSAHNK 115

Query: 124 ---------LFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVI 174
                    ++   D   A T N              VS    +  + +    K + S++
Sbjct: 116 NEGGRVRMIIYRMTDFLCAKTTN--------------VSQEALDEFITKKAFRKRKSSLV 161

Query: 175 PNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNTPNI-------- 226
            N +D ++F     ++ S +I NI     + + + +   AG +   K+ PN+        
Sbjct: 162 YNGIDLSIF-----KKKSTNIQNIKNKLGINFDEKVIFCAGRLTEAKDYPNLILAISKMH 216

Query: 227 ----NFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEA 282
                 II GDGP R  +E + ++ ++  R+ ++G +++  + D+     +F+  S  E 
Sbjct: 217 QKKCKIIIAGDGPMRSDIERLIDRCHLSHRILLIGIIDN--ISDYYNLSDLFVLPSRWEG 274

Query: 283 YCMAIVEAASCGLQVVSTSVGGIPEVL 309
           + + + EA +C   V++T  GG+ EVL
Sbjct: 275 FGLVVAEAMACECPVIATDAGGVAEVL 301
>ref|NP_228553.1| (NC_000853) conserved hypothetical protein [Thermotoga maritima]
 pir||C72340 probable hexosyltransferase (EC 2.4.1.-) TM0744 - Thermotoga
           maritima (strain MSB8)
 gb|AAD35825.1|AE001744_15 (AE001744) conserved hypothetical protein [Thermotoga maritima]
          Length = 406

 Score = 64.5 bits (155), Expect = 3e-09
 Identities = 48/165 (29%), Positives = 83/165 (50%), Gaps = 9/165 (5%)

Query: 168 KHRVSVIPNAVDTALF---TPDPQQRPSNDIINIVV--ASRLVYRKGIDLLAGIIPRFKN 222
           K  + V+P  ++   F    P+  +R  N     VV  A R+   K +D L  +     N
Sbjct: 165 KRPIEVLPTGIEVEKFEVEAPEELKRKWNPEGKKVVLYAGRIAKEKNLDFLLRVFESL-N 223

Query: 223 TPNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEA 282
            P I FI+VGDGP+R+ +EE  ++  +   +++ G V H+ +  +   G +F+  S TE 
Sbjct: 224 APGIAFIMVGDGPEREEVEEFAKEKGLD--LKITGFVPHDEIPLYYKLGDVFVFASKTET 281

Query: 283 YCMAIVEAASCGLQVVSTSVGGIPEVLPK-SLILLAEPEIDAIYA 326
             + ++EA + GL VV+    G+ +VL      +L E E + ++A
Sbjct: 282 QGLVLLEALASGLPVVALKWKGVKDVLKNCEAAVLIEEENERLFA 326
 Score = 39.2 bits (90), Expect = 0.14
 Identities = 36/128 (28%), Positives = 56/128 (43%), Gaps = 17/128 (13%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           M I M SD + P I GV   +    + L   GHK+VV+  +  +     +V   +   + 
Sbjct: 1   MNIAMFSDTYAPQINGVATSIRVYKKKLTERGHKVVVVAPSAPEEEKDVFVVRSIPFPFE 60

Query: 61  P---IKVCYNQCILPTAVCNVPMLRAVLLRE-RVEVVHGHSAFSALAHEALMVGSLLGLK 116
           P   I +   + IL              +RE  V+++H HS F  +  +AL V   +GL 
Sbjct: 61  PQHRISIASTKNILE------------FMRENNVQIIHSHSPF-FIGFKALRVQEEMGLP 107

Query: 117 TVFTDHSL 124
            V T H+L
Sbjct: 108 HVHTYHTL 115
>dbj|BAB69052.1| (AB050640) putative glycosyltransferase [Sphaerotilus natans]
          Length = 366

 Score = 64.5 bits (155), Expect = 4e-09
 Identities = 50/183 (27%), Positives = 81/183 (43%), Gaps = 11/183 (6%)

Query: 146 VNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFTPDPQQ--------RPSNDIIN 197
           V+  + VS   ++  +        RV  I N  +TA+F P  Q         +P   +  
Sbjct: 186 VDALLTVSEAMRQYAIREFGAPADRVHTIINGFNTAVFKPLDQAALRAKWGVKPDEKM-- 243

Query: 198 IVVASRLVYRKGI-DLLAGIIPRFKNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQMV 256
           IV   R V  KG+ +L+       K+ P +   +VGDG  +  L  +   T + ERV + 
Sbjct: 244 IVYVGRFVEAKGMRELITAFQTLAKDDPKVTLALVGDGVMKTELMALVRSTGLTERVHLP 303

Query: 257 GAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVLPKSLILL 316
           G     +V +++    +    S +E Y   +VE  +CG  VV+T VGG  E+L +   +L
Sbjct: 304 GGQAPEQVAEWINAADVLTLPSWSEGYPNVVVEGVACGRPVVATDVGGTREILHERNGIL 363

Query: 317 AEP 319
             P
Sbjct: 364 IPP 366
>ref|NP_295278.1| (NC_001263) conserved hypothetical protein [Deinococcus
           radiodurans]
 pir||E75381 conserved hypothetical protein - Deinococcus radiodurans  (strain
           R1)
 gb|AAF11118.1|AE001999_2 (AE001999) conserved hypothetical protein [Deinococcus radiodurans]
          Length = 411

 Score = 64.1 bits (154), Expect = 4e-09
 Identities = 71/241 (29%), Positives = 110/241 (45%), Gaps = 22/241 (9%)

Query: 81  LRAVLLRERVEVVHGHSAF---SALAHEALMVGSLLGLKTVF-TDHSLFGFADLSAALTN 136
           L  V+L   V++ H H A    SA  H   + G    L T+  TD +L G        T 
Sbjct: 111 LSEVILEHGVDLTHAHYAIPHASAALHARSITGKTRVLTTLHGTDVTLVGTEPAFQHTTR 170

Query: 137 NLLEVNLGMVNHAICVSHIGKENTVLRARVAKHR-VSVIPNAVDTALF--TPDPQQR--- 190
           + +E +    +H   VSH     T  R      R + VI N VD+  F   PDP  R   
Sbjct: 171 HAIERS----DHVTAVSHSLAAET--REVFGVDRDIEVIHNFVDSDRFRRIPDPGVRARF 224

Query: 191 --PSNDIINIVVASRLVYRKGIDLLAGIIPRFKNTPNINFIIVGDGPKRDLLEEIREKTN 248
             P   +I  V   R + R  ++ +  +  R  +      +++GDGP+R    E+  +  
Sbjct: 225 AHPEEALIVHVSNFRPIKR--VEDVVQVFARIASEIPARLLMIGDGPERARAFELARELG 282

Query: 249 MQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEV 308
           +  R Q +G+     V+  L    +FL TS  E++ +A +EA SC + VV+++ GGIPEV
Sbjct: 283 VIGRTQFLGSFPD--VQTVLGISDLFLLTSSHESFGLAALEAMSCEVPVVASNAGGIPEV 340

Query: 309 L 309
           +
Sbjct: 341 V 341
>ref|NP_489278.1| (NC_003272) glycosyltransferase [Nostoc sp. PCC 7120]
 dbj|BAB76937.1| (AP003599) glycosyltransferase [Nostoc sp. PCC 7120]
          Length = 382

 Score = 64.1 bits (154), Expect = 5e-09
 Identities = 46/187 (24%), Positives = 86/187 (45%), Gaps = 11/187 (5%)

Query: 134 LTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFTPDP------ 187
           L N  ++ +L   +  + VSH  ++  + + R+   +VS++PN   ++ F P P      
Sbjct: 127 LKNAEVKKSLHHADQILAVSHYTRDRIIEKHRLNPDKVSILPNTFASSRFKPAPKPNYLL 186

Query: 188 ---QQRPSNDIINIVVASRLVYR-KGIDLLAGIIPRFKN-TPNINFIIVGDGPKRDLLEE 242
              Q +P   II  V       R KG D +   +P  +   PN++++IVG G  +  +E 
Sbjct: 187 RKYQLKPEQQIILTVARLAEAQRYKGYDQILQALPHIRQLIPNVHYVIVGKGNDKHRIES 246

Query: 243 IREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSV 302
           +  +  +Q  V + G V   ++ D+     +F   S  E + +  +EA +CG  V+  + 
Sbjct: 247 MIVQQGLQNCVTLAGFVPDEQLCDYYNLCDVFAMPSKREGFGIVYLEALACGKPVLGGNQ 306

Query: 303 GGIPEVL 309
            G  + L
Sbjct: 307 DGANDAL 313
>emb|CAB43611.1| (AJ239004) galactosyl transferase [Streptococcus pneumoniae]
          Length = 354

 Score = 64.1 bits (154), Expect = 5e-09
 Identities = 44/154 (28%), Positives = 74/154 (47%), Gaps = 11/154 (7%)

Query: 170 RVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGI-DLLAGIIPRFKNTPNINF 228
           ++ ++ N VDT+ +    +   SN   N +   R+  RKG  DL+  +       PN++ 
Sbjct: 152 KIVIVENGVDTSFYVEKKKSITSN---NFLFLGRMGKRKGAYDLIDAMNQAVAINPNLHL 208

Query: 229 IIVGDGPKRDLLEEIREKT---NMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCM 285
            + GDG     LE+IR+K    N+ + + +   V     +         +  S  E   M
Sbjct: 209 TMAGDGE----LEDIRQKISNLNLTDHITIYDWVNQRDKKILFQANQTLILPSYNEGLPM 264

Query: 286 AIVEAASCGLQVVSTSVGGIPEVLPKSLILLAEP 319
           AI+EA + GL ++ST VGGIPE++ +    L +P
Sbjct: 265 AILEAMASGLAIISTPVGGIPEIIHEDNGWLIQP 298
>gb|AAK20702.1|AF316641_8 (AF316641) WciS [Streptococcus pneumoniae]
          Length = 354

 Score = 64.1 bits (154), Expect = 5e-09
 Identities = 44/154 (28%), Positives = 74/154 (47%), Gaps = 11/154 (7%)

Query: 170 RVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGI-DLLAGIIPRFKNTPNINF 228
           ++ ++ N VDT+ +    +   SN   N +   R+  RKG  DL+  +       PN++ 
Sbjct: 152 KIVIVENGVDTSFYVEKKKSITSN---NFLFLGRMGKRKGAYDLIDAMNQAVAINPNLHL 208

Query: 229 IIVGDGPKRDLLEEIREKT---NMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCM 285
            + GDG     LE+IR+K    N+ + + +   V     +         +  S  E   M
Sbjct: 209 TMAGDGE----LEDIRQKISNLNLTDHITIYDWVNQRDKKILFQANQTLILPSYNEGLPM 264

Query: 286 AIVEAASCGLQVVSTSVGGIPEVLPKSLILLAEP 319
           AI+EA + GL ++ST VGGIPE++ +    L +P
Sbjct: 265 AILEAMASGLAIISTPVGGIPEIIHEDNGWLIQP 298
>ref|NP_378386.1| (NC_003106) 352aa long conserved hypothetical protein [Sulfolobus
           tokodaii]
 dbj|BAB67495.1| (AP000989) 352aa long conserved hypothetical protein [Sulfolobus
           tokodaii]
          Length = 352

 Score = 63.7 bits (153), Expect = 6e-09
 Identities = 50/172 (29%), Positives = 85/172 (49%), Gaps = 10/172 (5%)

Query: 139 LEVNLGMVNHAICVSHIGKENTVLRARVAKHRVSVIPNAVDTALFTPDPQQRPSNDIINI 198
           LE  +    + I VS+  K   + R R+ + +++VI N +D  ++ P  ++ P   I  +
Sbjct: 126 LEKTIRNYPYIISVSNTTKYELIKRFRIDESKITVIYNGIDHEIYKPG-EKSP---IPTV 181

Query: 199 VVASRLV-YRKGIDLLAGIIPRFKNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQMVG 257
           +   RL  Y+  +D +  I  + KN   I F I G G   DL E ++   + Q+ +  +G
Sbjct: 182 LWIGRLKNYKNPLDAVK-IFKKVKNNKAI-FYIAGGG---DLEENVKRVISGQKNIIFLG 236

Query: 258 AVEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVL 309
            V  ++      +    ++TS  E + M IVEA SCG   V+ S G IPE++
Sbjct: 237 KVNESQKIKLYQQAWAVISTSFIEGWGMTIVEANSCGTPAVAYSTGSIPEII 288
>ref|NP_349148.1| (NC_003030) Glycosyltransferase [Clostridium acetobutylicum]
 gb|AAK80488.1|AE007752_5 (AE007752) Glycosyltransferase [Clostridium acetobutylicum]
          Length = 393

 Score = 63.4 bits (152), Expect = 8e-09
 Identities = 77/323 (23%), Positives = 139/323 (42%), Gaps = 39/323 (12%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           M+I + +D +YP   GV   + NL + L +LGH + +L  +     G   V G   V+YL
Sbjct: 1   MKILITTDTYYPMTNGVVVSINNLYRQLKTLGHDVRILALS---PDGGEKVVG--DVFYL 55

Query: 61  PIKVCYNQCILPTAVCNVPMLRAV---LLRERVEVVHGHSAFSALAHEALMVGSLLGLKT 117
                +   I P A    P+   +   +++ R +++H  + FS +   A  +   L +  
Sbjct: 56  S---SFAIGIYPDARIMKPIKNKIVGEIIKWRPDIIHSQTEFSTML-VAKYIKRKLNIPE 111

Query: 118 VFTDHSLFGFADLSAALTNNLL-EVNLGMVNHAI---CVSHIGKENTVLRARVAKHRVS- 172
           V T H+++    L   L   +L +  +  V   +   C + I     V R ++  + VS 
Sbjct: 112 VHTYHTMYE-DYLHYLLCGRILGKAGISKVTQKLLNSCEAVIAPTEKV-RLKLQSYDVST 169

Query: 173 ---VIPNAVDTALFTPDPQQRPSNDIIN----------IVVASRLVYRKGID----LLAG 215
              V+P  +D   F  +  +    ++++          +V   R+   K I+    L   
Sbjct: 170 NIDVVPTGIDIKKFQKELNKEEKLELLSKYELTEEDTVLVYVGRIAEEKNIEEVIRLYRM 229

Query: 216 IIPRFKNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFL 275
            +  FK   NI  +IVG GP    L+ I  K  + E V+  G +  +++  +   G +F+
Sbjct: 230 ALKLFK---NIKLLIVGGGPYLSKLKGIIIKNRLSEYVKFTGMISPDKICKYYKLGDVFV 286

Query: 276 NTSLTEAYCMAIVEAASCGLQVV 298
             S +E   +  VEA S GL ++
Sbjct: 287 TASTSETQGITYVEALSSGLPII 309
>ref|NP_466078.1| (NC_003210) weakly similar to human
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Listeria monocytogenes EGD-e]
 emb|CAD00633.1| (AL591983) weakly similar to human
           N-acetylglucosaminyl-phosphatidylinositol biosynthetic
           protein [Listeria monocytogenes]
          Length = 427

 Score = 61.8 bits (148), Expect = 3e-08
 Identities = 73/320 (22%), Positives = 131/320 (40%), Gaps = 30/320 (9%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           M I + +D + P I GV   +  +   L   GH + + T    D +  R  +   +V+ L
Sbjct: 1   MNIGIFTDTYSPQISGVATSIMIMENELRKQGHTVYIFTTT--DPNADRE-SEEGRVFRL 57

Query: 61  P-IKVCYNQCILP---TAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEALMVGSLLGLK 116
           P I   +     P    A+  +     ++ R  ++++H H+ FS L      +     + 
Sbjct: 58  PSIPFVF----FPERRVAIAGMNKFIKLVGRLDLDIIHTHTEFS-LGLLGKRIAKKYHIP 112

Query: 117 TVFTDHSLFGFADLSAALTNNLLEVNLGMVNHAICVSH------IGKENTVLRARVAKHR 170
           ++ T H+++       A    L    +G +  + C S+        K    L  +     
Sbjct: 113 SIHTYHTMYVDYLHYIAKGKILTPSMVGKMTKSFCDSYDAIITPTAKVRHHLEEQGIHKL 172

Query: 171 VSVIPNAVDTALFTPDPQQR----------PSNDIINIVVASRLVYRKGIDLLAGIIPRF 220
           +  +P   D + F P  +QR            ND + I+   R+ + K ID +   +P  
Sbjct: 173 MYTVPTGTDISSFAPVEKQRILDLKKLLGIGENDPV-ILSLGRIAHEKNIDAIINAMPEV 231

Query: 221 KNTPNI-NFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSL 279
             T      +IVGDGP R  LE++ E+  + + V   GAV+   +  +   G +F++ S 
Sbjct: 232 LQTKTTAKLVIVGDGPVRKDLEKLVEEKQLADHVIFTGAVDWENISLYYQLGDLFVSAST 291

Query: 280 TEAYCMAIVEAASCGLQVVS 299
           TE   +   EA +  L VV+
Sbjct: 292 TETQGLTYAEAMAASLPVVA 311
>ref|NP_484962.1| (NC_003272) probable glycosyltransferase [Nostoc sp. PCC 7120]
 dbj|BAB72876.1| (AP003584) ORF_ID:all0919~probable glycosyltransferase [Nostoc sp.
           PCC 7120]
          Length = 429

 Score = 61.0 bits (146), Expect = 4e-08
 Identities = 42/161 (26%), Positives = 80/161 (49%), Gaps = 7/161 (4%)

Query: 170 RVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGIDL-LAGIIPRFKNTPNINF 228
           ++ V  + +D + F    +       I I    RL+ +KGI+  +  +    +  PNI +
Sbjct: 202 KIVVHGSGIDCSRFPFKDRYLHPGQKIRIATTGRLIEKKGIEYGICAVAKVLQFYPNIEY 261

Query: 229 IIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLT------EA 282
            I+GDG  ++ L+++ +  ++ ++V++VG      +   L +  IF+  S+T      +A
Sbjct: 262 QIIGDGELKETLQQLIQSLDITDKVKLVGWKTQPEIIKILDQSDIFIAPSVTAKDGNQDA 321

Query: 283 YCMAIVEAASCGLQVVSTSVGGIPEVLPKSLILLAEPEIDA 323
               + EA   GL V++T+ GGIPE++   +     PE DA
Sbjct: 322 PVNTLKEAMIMGLPVIATTHGGIPELVEDGISGFLVPERDA 362
>ref|NP_440248.1| (NC_000911) unknown protein [Synechocystis sp. PCC 6803]
 pir||S74777 hypothetical protein slr1076 - Synechocystis sp. (strain PCC 6803)
 dbj|BAA16928.1| (D90901) ORF_ID:slr1076~unknown protein [Synechocystis sp. PCC
           6803]
          Length = 381

 Score = 60.2 bits (144), Expect = 8e-08
 Identities = 60/252 (23%), Positives = 110/252 (42%), Gaps = 25/252 (9%)

Query: 92  VVHGHSAFSALAHEALMVGSLLGLKTVFTDHSLFGFADLSAALTNNLLEVNLGMVNHAIC 151
           ++ GH+ F+ +AH   +V  L+G+      H +  +      L N  +   L   +  + 
Sbjct: 94  IICGHANFTPVAH---LVQRLMGISYWTVAHGVDAW-----NLQNPHIIQALRHADRILA 145

Query: 152 VSHIGKENTVLRARVAKHRVSVIPNAVDTALF--TPDPQQ-------RPSNDIINIVVAS 202
           VSH  ++  +    +   +V V+PN  DT+ F   P PQ         P   +  I+  +
Sbjct: 146 VSHYTRDRLLQEQALDPEKVVVLPNTFDTSRFQIAPKPQSLLEKYNLTPDQQV--ILTIA 203

Query: 203 RLVYR---KGIDLLAGIIPR-FKNTPNINFIIVGDGPKRDLLEEIREKTNMQERVQMVGA 258
           RL      KG D +   +P   K  PNI+++I G G  R  +E++ +  ++++ V + G 
Sbjct: 204 RLAGEERYKGYDQIIRALPEIIKTIPNIHYLIGGKGGDRPRIEKLIQDLDLEDYVTLAGF 263

Query: 259 VEHNRVRDFLVRGHIFLNTSLTEAYCMAIVEAASCGLQVVSTSV-GGIPEVLPKSLILLA 317
           +    + D      +F   S  E + +  +EA +CG   +  +  G I  +    L +L 
Sbjct: 264 IPDEELADHYNLCDVFAMPSKGEGFGIVYLEAMACGKPTIGGNQDGAIDALCNGELGVLV 323

Query: 318 EP-EIDAIYAAI 328
            P ++D I   I
Sbjct: 324 NPDDLDEISTVI 335
>ref|NP_390127.1| (NC_000964) alternate gene name: jojH~similar to lipopolysaccharide
           biosynthesis-related protein [Bacillus subtilis]
 sp|P42982|YPJH_BACSU Putative glycosyl transferase ypjH
 pir||G69937 lipopolysaccharide biosynthesis-related pr homolog ypjH - Bacillus
           subtilis
 gb|AAB38445.1| (L47709) 21.4% of identity to trans-acting transcription factor of
           Sacharomyces cerevisiae; 25% of identity to sucrose
           synthase of Zea mays; putative [Bacillus subtilis]
 emb|CAB14162.1| (Z99115) alternate gene name: jojH~similar to lipopolysaccharide
           biosynthesis-related protein [Bacillus subtilis]
          Length = 377

 Score = 59.5 bits (142), Expect = 1e-07
 Identities = 79/333 (23%), Positives = 142/333 (41%), Gaps = 38/333 (11%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           MR   +    YPS+GG       L + L   GH+I  +T +       R  T +  +++ 
Sbjct: 1   MRKLKIGITCYPSVGGSGIIATELGKQLAEKGHEIHFITSSI----PFRLNTYHPNIHFH 56

Query: 61  PIKVCYNQCIL----PTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHEAL---MVGSLL 113
            ++V  NQ  +    P  +     +  V  RE ++++H H A        L   M+   +
Sbjct: 57  EVEV--NQYAVFKYPPYDLTLASKIAEVAERENLDIIHAHYALPHAVCAYLAKQMLKRNI 114

Query: 114 GLKTVF--TDHSLFGFADLSAALTNNLLEVNLGMVNHAICVSHIGKENTVLRARVAKHRV 171
           G+ T    TD ++ G+ D S     +L+   +   +    VS      T    +  K ++
Sbjct: 115 GIVTTLHGTDITVLGY-DPS---LKDLIRFAIESSDRVTAVSSALAAETYDLIKPEK-KI 169

Query: 172 SVIPNAVD--------TALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNT 223
             I N +D        TA         P   ++  V   R V R     +  +I  F+N 
Sbjct: 170 ETIYNFIDERVYLKKNTAAIKEKHGILPDEKVVIHVSNFRKVKR-----VQDVIRVFRNI 224

Query: 224 P---NINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLT 280
                   ++VGDGP++    E+  K  ++++V M+G    +RV D      + L  S  
Sbjct: 225 AGKTKAKLLLVGDGPEKSTACELIRKYGLEDQVLMLG--NQDRVEDLYSISDLKLLLSEK 282

Query: 281 EAYCMAIVEAASCGLQVVSTSVGGIPEVLPKSL 313
           E++ + ++EA +CG+  + T++GGIPEV+  ++
Sbjct: 283 ESFGLVLLEAMACGVPCIGTNIGGIPEVIKNNV 315
>ref|NP_275316.1| (NC_000916) LPS biosynthesis RfbU related protein
           [Methanothermobacter thermautotrophicus]
 pir||C69098 probable hexosyltransferase (EC 2.4.1.-) MTH173 - Methanobacterium
           thermoautotrophicum (strain Delta H)
 gb|AAB84679.1| (AE000805) LPS biosynthesis RfbU related protein
           [Methanothermobacter thermautotrophicus]
          Length = 382

 Score = 56.7 bits (135), Expect = 8e-07
 Identities = 77/321 (23%), Positives = 138/321 (42%), Gaps = 23/321 (7%)

Query: 1   MRICMVSDFFYPSI-GGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYY 59
           MRI +VSDFF P   GG E   + +++ L+  GH + V++           V+G ++V++
Sbjct: 4   MRILIVSDFFVPHYNGGGERRYFEIARRLVERGHVVDVISMGIHGVGEYEEVSG-VRVHH 62

Query: 60  LPIKVCYNQCILPTAVCNVPMLRAVLLRERVEVVHGHSAFSALAHE----ALMVGSLLGL 115
           L  ++       P     +  +R +    R  + H +    A  +     A +   + G 
Sbjct: 63  LGPRIRK-----PPLRGPLDFIRFMAAAFRWVMTHDYDIIDAQTYAPLLPAFLASRIHGT 117

Query: 116 KTVFTDH---SLFGFADLSAALTNNLLEVNLGMVNH--AICVSH-IGKENTVLRARVAKH 169
             V T H   S  G   L ++ T  +LE  L  + +   I VS       T L  R    
Sbjct: 118 PMVATIHDVSSAHGDQWLQSSKTATILERVLMRLPYDGVITVSRSTASALTELHGR-NPD 176

Query: 170 RVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFK-NTPNINF 228
            + +IPN VD  L   D     + +   I+   RL   K +D L  +  +   + P++  
Sbjct: 177 GIHIIPNGVDPELI--DSVTPATGNY--IIFVGRLAPHKHVDHLIEVFSKLVIDFPDLRL 232

Query: 229 IIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYCMAIV 288
            I+GDG +R  L+ + ++  +++ V     + +  V   +    + +  S  E + M + 
Sbjct: 233 EIIGDGVERARLKAMVDECGIRDSVTFHHNLSYPEVISRIRGARVLVLPSTREGFGMVLA 292

Query: 289 EAASCGLQVVSTSVGGIPEVL 309
           EA +CG+  V+   GG+ EV+
Sbjct: 293 EAGACGVPAVAYRSGGVVEVI 313
>ref|NP_127136.1| (NC_000868) LPS BIOSYNTHESIS RFBU RELATED PROTEIN [Pyrococcus
           abyssi]
 pir||A75059 probable hexosyltransferase (EC 2.4.1.-) PAB0973 [similarity] -
           Pyrococcus abyssi (strain Orsay)
 emb|CAB50366.1| (AJ248287) LPS BIOSYNTHESIS RFBU RELATED PROTEIN [Pyrococcus
           abyssi]
          Length = 390

 Score = 55.6 bits (132), Expect = 2e-06
 Identities = 85/339 (25%), Positives = 157/339 (46%), Gaps = 44/339 (12%)

Query: 1   MRICMVSDFFYPSIGGVEEHVYNLSQMLLSLGHKIVVLTHAYGDCSGIRYVTGYLKVYYL 60
           M++ M++ +FYP  GG+E++ Y +++ L+  G ++ V+T A    + +  + G   +   
Sbjct: 1   MKLLMITPYFYPEGGGLEKYAYMIARGLVERGWEVKVIT-ASRKGNSLENLEGIEVIRLA 59

Query: 61  PIKVCYNQCILPTAVCNVPM-LRAVLLRERVEVVHGHSAFSALAHEALMVGSLL--GLKT 117
           P  +  N  I      N+P+ L  V   E+  V++ H+     A  +  V ++L    KT
Sbjct: 60  PHFIVSNTPI----SFNLPLKLIKVFKEEQFSVINAHTPVPYYADVSAWVNNVLKGSNKT 115

Query: 118 --VFTDHSLF---GFA-DLSAALTNNLLEVNLGMVNHAICV--SHIGKENTVLRARVAKH 169
             V T H+     GF  D  A L N  L+  L +++  I     +   E+ +LR    K 
Sbjct: 116 PFVLTYHNDLVKEGFPLDKVAYLYNLSLQRGLLLLSDTIITPSPYCYYESKLLRR--FKK 173

Query: 170 RVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVY----------RKGIDLLAGIIPR 219
           ++  IP  VDT  + P    R  + I N+  ++++V            KG+  L   +  
Sbjct: 174 KLIWIPPGVDTERYFPGKSYR-LHSIYNLPRSAKIVMFIGTMNRGHAHKGVPYL---LKA 229

Query: 220 FK----NTPNINFIIVGDGPKRDLLEEIRE---KTNMQERVQMVGAVEHNRVRDFLVRGH 272
           FK       +   ++VG G   D++ E ++      + +RV   G VE + + +F     
Sbjct: 230 FKYVATQVKDSYLVLVGRG---DMIPEYKKMCMSLGISKRVIFTGYVEEDILPEFYRSSD 286

Query: 273 IFL--NTSLTEAYCMAIVEAASCGLQVVSTSVGGIPEVL 309
           + +  +T++ E + M ++EA + G  V+ T+VGGI  V+
Sbjct: 287 VIVLPSTTVQEGFGMVLIEAGASGKPVIGTNVGGIKHVI 325
>ref|NP_279218.1| (NC_002607) LPS glycosyltransferase; Lpg [Halobacterium sp. NRC-1]
 gb|AAG18698.1| (AE004975) LPS glycosyltransferase; Lpg [Halobacterium sp. NRC-1]
          Length = 333

 Score = 54.0 bits (128), Expect = 6e-06
 Identities = 46/156 (29%), Positives = 72/156 (45%), Gaps = 10/156 (6%)

Query: 166 VAKHRVSVIPNA-VDTALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNTP 224
           V   ++S +P A +D   + P  +  PS++ I +    RL   KG D L        +  
Sbjct: 132 VPDQKISTLPIAGIDVKEYQPS-KTHPSHENITVSTVGRLANVKGYDDLIRCARDIGD-- 188

Query: 225 NINFIIVGDGPKRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEAYC 284
           ++ F I       +  E  R ++   + V   G V + ++  FL    I+   S  E  C
Sbjct: 189 DLQFQIA-----GEGEERERLESKTPDNVNFQGMVPNEQIPQFLNNSDIYFQPSKYEGLC 243

Query: 285 MAIVEAASCGLQVVSTSVGGIPE-VLPKSLILLAEP 319
           MA++EA +CGL VV++ VGGI E V+P     L  P
Sbjct: 244 MAVIEAMACGLPVVASDVGGITESVVPGETGFLCRP 279
>gb|AAL67552.1|AF461121_3 (AF461121) putative galactosyltransferase WbgM [Escherichia coli]
          Length = 364

 Score = 47.4 bits (111), Expect = 5e-04
 Identities = 67/260 (25%), Positives = 122/260 (46%), Gaps = 34/260 (13%)

Query: 65  CYNQC--ILPTAVCNVPMLR---------AVLLRERVEVVHGHSAFSALAHEALMVGSLL 113
           C+  C  I+PT    + + +          ++ +E+ ++VH HS+ +       +   L 
Sbjct: 49  CFGVCTHIIPTLTREISLFKDCASLFQLYKIIKKEKFDIVHTHSSKTGFL--GRVAAKLA 106

Query: 114 GLKTVFTDHSLFGFADLSAALTNN--------LLEVNLGMVNHAICVSHIGKENTVLRAR 165
           G K +   H++ GFA  S   T N        L+E+     ++ I V +   E    +  
Sbjct: 107 GTKKIV--HTVHGFAFPS---TENKLIKFIYFLMELIASYCSNIIIVMNESDERIARKYF 161

Query: 166 V--AKHRVSVIPNAVDTALFTPDPQQRPSNDIINIVVASRLVYRKGIDLLAGIIPRFKNT 223
           V   K ++ +I NA+D   +  D  +    DI  IV+  RL  +K   LL   I   ++ 
Sbjct: 162 VKNKKSKLLLINNAIDVDKYNKDKDKDKDKDIFKIVMVGRLCDQKNPLLLIEAIKDLES- 220

Query: 224 PNINFIIVGDGP-KRDLLEEIREKTNMQERVQMVGAVEHNRVRDFLVRGHIFLNTSLTEA 282
            NI+  I+GDGP K  LLE+I +  N+ ++V  +G ++   V + L +  +F+  S  E 
Sbjct: 221 -NIHVDIIGDGPLKVKLLEKINQ-YNIADKVSFLGWID--AVEEHLYKYDLFVLPSRWEG 276

Query: 283 YCMAIVEAASCGLQVVSTSV 302
             +A++EA +  + V+S+ +
Sbjct: 277 MPLAMLEAMAAKVPVLSSDI 296
CPU time:    78.61 user secs.	    1.54 sys. secs	   80.15 total secs.

  Database: nr
    Posted date:  Apr 21, 2002  2:19 PM
  Number of letters in database: 277,845,442
  Number of sequences in database:  887,402
  
Lambda     K      H
   0.324    0.138    0.410 

Gapped
Lambda     K      H
   0.270   0.0470    0.230 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 252891521
Number of Sequences: 887402
Number of extensions: 10307430
Number of successful extensions: 32893
Number of sequences better than 10.0: 457
Number of HSP's better than 10.0 without gapping: 178
Number of HSP's successfully gapped in prelim test: 279
Number of HSP's that attempted gapping in prelim test: 32380
Number of HSP's gapped (non-prelim): 574
length of query: 479
length of database: 277,845,442
effective HSP length: 56
effective length of query: 423
effective length of database: 228,150,930
effective search space: 96507843390
effective search space used: 96507843390
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 40 (21.5 bits)
S2: 74 (33.2 bits)