IMP GPI Biosynthesis Report IMP-Bioinformatics
GPI Biosynthesis Main Page  
GPI Site Motif   GPI Site Prediction   Home Page B.E.  

GPI Anchor Biosynthesis Report: NP_565647.1 (PIG-L family, Arabidopsis thaliana)




BLASTP 2.1.1 [Aug-8-2000]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query=
         (223 letters)

Database: nr
           887,402 sequences; 277,845,442 total letters

Searching..................................................

Converged !!!


Results of PSI-Blast iteration 7

Distribution of 49 Blast Hits on the Query Sequence


Sequences with E-value BETTER than threshold (0.002)

ref|NP_565647.1| (NM_128293) similar to PIG-L [Arabidopsis thali... 281 7e-75
sp|O35790|PIGL_RAT N-acetylglucosaminyl-phosphatidylinositol de-... 193 2e-48
gb|AAD41996.1|AC006233_13 (AC006233) hypothetical protein [Arabi... 182 6e-45
gb|AAF55732.1| (AE003728) CG4433 gene product [Drosophila melano... 180 1e-44
ref|NP_004269.1| (NM_004278) phosphatidylinositol glycan, class ... 176 3e-43
sp|Q9HDW9|PIGL_SCHPO Probable N-acetylglucosaminyl-phosphatidyli... 171 1e-41
ref|NP_127219.1| (NC_000868) hypothetical protein [Pyrococcus ab... 158 9e-38
ref|NP_142471.1| (NC_000961) hypothetical protein [Pyrococcus ho... 156 3e-37
gb|AAL80478.1| (AE010159) hypothetical protein [Pyrococcus furio... 154 1e-36
ref|NP_244186.1| (NC_002570) BH3320~unknown conserved protein [B... 148 8e-35
ref|NP_014008.1| (NC_001145) N-acetylglucosaminylphosphatidylino... 147 1e-34
ref|NP_294173.1| (NC_001263) conserved hypothetical protein [Dei... 146 3e-34
ref|NP_385870.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sin... 138 8e-32
ref|NP_215598.1| (NC_000962) hypothetical protein Rv1082 [Mycoba... 138 1e-31
ref|NP_371091.1| (NC_002758) conserved hypothetical protein [Sta... 137 1e-31
emb|CAC18708.2| (AL451182) conserved hypothetical protein [Strep... 137 2e-31
emb|CAA77139.1| (Y18353) hypothetical protein [Thermus thermophi... 135 6e-31
ref|NP_293807.1| (NC_001263) conserved hypothetical protein [Dei... 135 1e-30
ref|NP_242548.1| (NC_002570) BH1682~unknown conserved protein [B... 131 1e-29
ref|NP_302547.1| (NC_002677) conserved hypothetical protein [Myc... 129 3e-29
ref|NP_437291.1| (NC_003078) conserved hypothetical protein, pos... 129 6e-29
ref|NP_390128.1| (NC_000964) alternate gene name: jojG~similar t... 127 2e-28
ref|NP_403747.1| (NC_003143) hypothetical protein [Yersinia pest... 125 6e-28
ref|NP_344220.1| (NC_002754) Conserved hypothetical protein [Sul... 124 1e-27
emb|CAC16965.1| (AL450350) conserved hypothetical protein [Strep... 124 1e-27
ref|NP_389828.1| (NC_000964) Uncharacterized conserved protein [... 122 4e-27
emb|CAB66204.1| (AL136502) hypothetical protein SCF43.15c. [Stre... 121 9e-27
ref|NP_296086.1| (NC_001263) Uncharacterized conserved protein [... 119 3e-26
ref|NP_302050.1| (NC_002677) conserved hypothetical protein [Myc... 119 5e-26
gb|AAC14880.1| (AF060080) hypothetical protein [Chlorobium tepidum] 119 6e-26
ref|NP_437176.1| (NC_003078) conserved hypothetical protein [Sin... 119 6e-26
gb|AAG12428.1| (AY005138) unknown [Chlorobium tepidum] 118 7e-26
gb|AAC01723.1| (AF040570) negative regulatorly protein [Amycolat... 118 1e-25
emb|CAC05756.1| (AL391751) hypothetical protein [Streptomyces co... 116 3e-25
sp|P71311|YAIS_ECOLI HYPOTHETICAL 20.5 KDA PROTEIN IN ADHC-TAUA ... 113 3e-24
emb|CAB67717.1| (AJ271405) hypothetical protein [Streptomyces ro... 110 2e-23
ref|NP_376770.1| (NC_003106) 221aa long conserved hypothetical p... 110 2e-23
ref|NP_334747.1| (NC_002755) hypothetical protein [Mycobacterium... 108 8e-23
pir||S44952 lmbE protein - Streptomyces lincolnensis >gi|2127551... 108 1e-22
ref|NP_214837.1| (NC_000962) hypothetical protein Rv0323c [Mycob... 106 3e-22
ref|NP_492873.1| (NM_060472) Y52B11C.1.p [Caenorhabditis elegans... 101 1e-20
ref|NP_191372.1| (NM_115675) putative protein [Arabidopsis thali... 97 2e-19
ref|NP_105992.1| (NC_002678) hypothetical protein [Mesorhizobium... 97 2e-19
emb|CAC04222.1| (AL391515) conserved hypothetical protein [Strep... 96 5e-19
ref|NP_215686.1| (NC_000962) hypothetical protein Rv1170 [Mycoba... 96 6e-19
ref|NP_414898.1| (NC_000913) orf, hypothetical protein [Escheric... 86 5e-16
ref|NP_294927.1| (NC_001263) LmbE-related protein [Deinococcus r... 84 3e-15
ref|NP_285456.1| (NC_001264) hypothetical protein [Deinococcus r... 79 7e-14
pir||D69906 hypothetical protein yojG - Bacillus subtilis >gi|26... 74 2e-12
Alignments
>ref|NP_565647.1| (NM_128293) similar to PIG-L [Arabidopsis thaliana]
          Length = 223

 Score =  281 bits (713), Expect = 7e-75
 Identities = 223/223 (100%), Positives = 223/223 (100%)

Query: 1   MVVVFLSLIVVIWVASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFST 60
           MVVVFLSLIVVIWVASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFST
Sbjct: 1   MVVVFLSLIVVIWVASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFST 60

Query: 61  GNADGMGSIRDQELHRACAVLKVIPFDKEGICDNDSCHCNEEHIITFDNYGVWGHCNHRD 120
           GNADGMGSIRDQELHRACAVLKVIPFDKEGICDNDSCHCNEEHIITFDNYGVWGHCNHRD
Sbjct: 61  GNADGMGSIRDQELHRACAVLKVIPFDKEGICDNDSCHCNEEHIITFDNYGVWGHCNHRD 120

Query: 121 VHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPW 180
           VHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPW
Sbjct: 121 VHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPW 180

Query: 181 KSFKAMAQHLSQWVWFRKLFVLFSSYTYVNTLDRINPESNELL 223
           KSFKAMAQHLSQWVWFRKLFVLFSSYTYVNTLDRINPESNELL
Sbjct: 181 KSFKAMAQHLSQWVWFRKLFVLFSSYTYVNTLDRINPESNELL 223
>sp|O35790|PIGL_RAT N-acetylglucosaminyl-phosphatidylinositol de-N-acetylase
           (Phosphatidylinositol-glycan biosynthesis, class L
           protein) (PIG-L)
 dbj|BAA20869.1| (D88364) PIG-L [Rattus norvegicus]
          Length = 252

 Score =  193 bits (488), Expect = 2e-48
 Identities = 66/214 (30%), Positives = 103/214 (47%), Gaps = 28/214 (13%)

Query: 25  TSISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNADGMGSIRDQELHRACA 79
            + SRA ++     D+  FF+PTI         + +LCFS+GN    G IR +EL ++CA
Sbjct: 38  GAGSRALVVIAHPDDEAMFFAPTILGLARLKQQVSLLCFSSGNYYNQGEIRKKELLQSCA 97

Query: 80  VLKVIP-----FDKEGICDNDSCHCNEEH----------------IITFDNYGVWGHCNH 118
           VL + P      DK    D+     + EH                ++TFD  GV GH NH
Sbjct: 98  VLGIPPSRVMIIDKREFPDDPEVQWDTEHVASTILQHIHANATDLVVTFDAEGVSGHSNH 157

Query: 119 RDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQ 178
             ++  +       K   G   +   S+N+ RKY   +D+  ++LS +     +   +K+
Sbjct: 158 IALYKAVRALHSGGKLPEGCSVLTLQSVNVLRKYVFLLDLPWTLLSPQGVLFVL--TSKE 215

Query: 179 PWKSFKAMAQHLSQWVWFRKLFVLFSSYTYVNTL 212
             ++ KAM+ H SQ +WFR L+ +FS Y  VN+L
Sbjct: 216 VAQAKKAMSCHRSQLLWFRHLYTVFSRYMSVNSL 249
>gb|AAD41996.1|AC006233_13 (AC006233) hypothetical protein [Arabidopsis thaliana]
          Length = 185

 Score =  182 bits (458), Expect = 6e-45
 Identities = 151/185 (81%), Positives = 152/185 (81%), Gaps = 29/185 (15%)

Query: 1   MVVVFLSLIVVIWVASFFKIFFRATSISRATILDDGK-------------FFSPTINYFT 47
           MVVVFLSLIVVIWVASFFKIFFRATSISRATILDDGK             FFSPTINYFT
Sbjct: 1   MVVVFLSLIVVIWVASFFKIFFRATSISRATILDDGKEEKCDVFMDMLDRFFSPTINYFT 60

Query: 48  STACNLHILCFSTGNADGMGSIRDQELHRACAVLKVIPFDKEGICDNDSCHCNEEHIITF 107
           STACNLHILCFSTGNADGMGSIRDQELHRACAVLKVIPFDKEGICDNDSCHCNEEHIITF
Sbjct: 61  STACNLHILCFSTGNADGMGSIRDQELHRACAVLKVIPFDKEGICDNDSCHCNEEHIITF 120

Query: 108 DNYGVWGHCNHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKR 167
           DNYGVWGHCNHRDVH  +                  VSLNIFRKYCGPVDIWLSILSAKR
Sbjct: 121 DNYGVWGHCNHRDVHRGVL----------------YVSLNIFRKYCGPVDIWLSILSAKR 164

Query: 168 HPSKV 172
           HPSKV
Sbjct: 165 HPSKV 169
>gb|AAF55732.1| (AE003728) CG4433 gene product [Drosophila melanogaster]
          Length = 390

 Score =  180 bits (455), Expect = 1e-44
 Identities = 58/248 (23%), Positives = 92/248 (36%), Gaps = 59/248 (23%)

Query: 29  RATIL-----DDGKFFSPTINYFT-STACNLHILCFSTGN-------------------- 62
           R  ++     D+  FF P I   T    C ++ILC S G                     
Sbjct: 141 RVLLITAHPDDECMFFGPLIYSLTQRQGCQVYILCLSNGETTSSDIIPKPPIDLEALNES 200

Query: 63  -ADGMGSIRDQELHRACAVLKVIP-----FDKEGICDNDSCHCNE--------------- 101
             +    +R QEL R+C+ L +        +   + D+                      
Sbjct: 201 NFEHKAKVRRQELWRSCSKLGIPESNIVLMNATNLPDDPYVDWRPDAVASLILHTIESLD 260

Query: 102 -EHIITFDNYGVWGHCNHRDVHPPIDCKI-----DSAKRIHGFLYVHQVSLNIFRKYCGP 155
            + I TFD  GV  H NH  V+               +  +   Y    S+N+ RKY   
Sbjct: 261 IQAIFTFDRDGVSSHPNHCAVYYAAASLCLANLLPKGEEAYCKFYTLD-SINVVRKYLSI 319

Query: 156 VDIWLSILSAKRHPSKVIIIN-KQPWKSFKAMAQHLSQWVWFRKLFVLFSSYTYVNTLDR 214
           +D+  +   +    +   I+N K+      AM +H SQ  WFR L++ FS Y ++N++ +
Sbjct: 320 LDLLCTCFMS----THWCILNWKEAAIVRSAMMEHQSQMRWFRWLYIYFSRYMFINSMRQ 375

Query: 215 INPESNEL 222
           IN    EL
Sbjct: 376 INLSDVEL 383
>ref|NP_004269.1| (NM_004278) phosphatidylinositol glycan, class L [Homo sapiens]
 sp|Q9Y2B2|PIGL_HUMAN N-acetylglucosaminyl-phosphatidylinositol de-N-acetylase
           (Phosphatidylinositol-glycan biosynthesis, class L
           protein) (PIG-L)
 dbj|BAA74775.1| (AB017165) PIG-L [Homo sapiens]
          Length = 252

 Score =  176 bits (444), Expect = 3e-43
 Identities = 61/216 (28%), Positives = 101/216 (46%), Gaps = 28/216 (12%)

Query: 23  RATSISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNADGMGSIRDQELHRA 77
           R  + SR  ++     D+  FF+PT+         +++LCFS GN    G  R +EL ++
Sbjct: 36  RLGAESRTLLVIAHPDDEAMFFAPTVLGLARLRHWVYLLCFSAGNYYNQGETRKKELLQS 95

Query: 78  CAVLKVI-----PFDKEGICDNDSCHCNEEH----------------IITFDNYGVWGHC 116
           C VL +        D     D+     + EH                ++TFD  GV GH 
Sbjct: 96  CDVLGIPLSSVMIIDNRDFPDDPGMQWDTEHVARVLLQHIEVNGINLVVTFDAGGVSGHS 155

Query: 117 NHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIIN 176
           NH  ++  +       K   G   +   S+N+ RKY   +D+ LS+L  +     +   +
Sbjct: 156 NHIALYAAVRALHSEGKLPKGCSVLTLQSVNVLRKYISLLDLPLSLLHTQDVLFVLN--S 213

Query: 177 KQPWKSFKAMAQHLSQWVWFRKLFVLFSSYTYVNTL 212
           K+  ++ KAM+ H SQ +WFR+L+++FS Y  +N+L
Sbjct: 214 KEVAQAKKAMSCHRSQLLWFRRLYIIFSRYMRINSL 249
>sp|Q9HDW9|PIGL_SCHPO Probable N-acetylglucosaminyl-phosphatidylinositol de-N-acetylase
 emb|CAC21467.1| (AL512549) putative N-acetylglucosaminyl phosphatidylinositol
           deacetylase [Schizosaccharomyces pombe]
          Length = 248

 Score =  171 bits (430), Expect = 1e-41
 Identities = 63/207 (30%), Positives = 97/207 (46%), Gaps = 29/207 (14%)

Query: 34  DDGKFFSPTINYFTST-ACNLHILCFSTGNADGMGSIRDQELHRACAVLKV--------- 83
           D+  FF PTI+Y  +  +  +H+LC S GNADG+GS+R++EL  A +  ++         
Sbjct: 43  DESMFFGPTIDYLGNQHSTRVHVLCLSNGNADGLGSVREKELVVAASKYQIDKTNVHVVS 102

Query: 84  IPFDKEGI---CD-NDSCHCNEEHI--------ITFDNYGVWGHCNHRDVHPPIDCKIDS 131
            P  ++G+    D  D      + I        ITFDN G+ GH NH   +      + +
Sbjct: 103 DPQLQDGMQAKWDPTDVAKHISQIIERYNIKTLITFDNKGISGHPNHIACYEGAMKIVKA 162

Query: 132 AKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIIN---KQPWKSFKAM-A 187
             +          S+NIFRKY   +D   +++ ++   +  III+   K   +   AM  
Sbjct: 163 TPQ---VQVFVLESVNIFRKYISYLDTIPTLVQSQAGRNDTIIIHADRKSTQRIRDAMVR 219

Query: 188 QHLSQWVWFRKLFVLFSSYTYVNTLDR 214
            H SQ VWFR  ++  S Y   N L R
Sbjct: 220 GHKSQMVWFRYGWIYLSKYMSNNVLKR 246
>ref|NP_127219.1| (NC_000868) hypothetical protein [Pyrococcus abyssi]
 pir||C75001 hypothetical protein PAB1341 - Pyrococcus abyssi (strain Orsay)
 emb|CAB50449.1| (AJ248288) hypothetical protein [Pyrococcus abyssi]
          Length = 267

 Score =  158 bits (397), Expect = 9e-38
 Identities = 29/192 (15%), Positives = 50/192 (25%), Gaps = 34/192 (17%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNAD---------GMGSIRDQELHRACAVLKVI 84
           D       TI   T     +   C + G             + +IR +E   +  +L V 
Sbjct: 43  DCVIGMGGTIKKLTERGIEVIYACMTDGYMGTLDSSLTGHELATIRRREEEESSKLLGVK 102

Query: 85  PFDKEGICDNDS---CHCNEEHII---------TFDNY---GVWGHCNHRDVHPPIDCKI 129
                   D +        ++ +           F          H +HR+        +
Sbjct: 103 KIYWLNYRDTELPYSREVRKDLVRIIRKEKPDGVFLPDPWLPYEAHPDHRNTGFLALDAV 162

Query: 130 DSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQH 189
             +   +        S        GP  +    L     P+  + I        KA+  H
Sbjct: 163 AFSPLPN-------FSNVDVEIGLGPHQVSFIALYYTNKPNYFVDITDVMELKLKAIRTH 215

Query: 190 LSQW---VWFRK 198
            SQ+   VW   
Sbjct: 216 KSQFPDDVWEVW 227
>ref|NP_142471.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
 pir||F71162 hypothetical protein PH0499 - Pyrococcus horikoshii
 dbj|BAA29587.1| (AP000002) 272aa long hypothetical protein [Pyrococcus horikoshii]
          Length = 272

 Score =  156 bits (393), Expect = 3e-37
 Identities = 27/192 (14%), Positives = 53/192 (27%), Gaps = 33/192 (17%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNAD---------GMGSIRDQELHRACAVLKVI 84
           D       TI   +     +  +C + G             + +IR +E   +  +L V 
Sbjct: 47  DCVIGMGGTIKKLSDMGVEVIYVCMTDGYMGTTDESLSGHELAAIRRKEEEESARLLGVK 106

Query: 85  PFDKEGICDNDSCHCN------------EEHIITFDNY---GVWGHCNHRDVHPPIDCKI 129
                   D +  +              E+    F          H +HR         +
Sbjct: 107 KIYWLNYRDTELPYSREVRKDLTKILRKEQPDGVFAPDPWLPYESHPDHRRTGFLAIESV 166

Query: 130 DSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQH 189
             ++  +       + LN +             L     P+ ++ I        KA+  H
Sbjct: 167 AFSQLPNFSNTDLDIGLNPYNSGSF------IALYYTHKPNYIVDITDLMELKLKAIRVH 220

Query: 190 LSQW---VWFRK 198
            SQ+   +W + 
Sbjct: 221 RSQFPDDIWEKW 232
>gb|AAL80478.1| (AE010159) hypothetical protein [Pyrococcus furiosus DSM 3638]
          Length = 267

 Score =  154 bits (387), Expect = 1e-36
 Identities = 26/192 (13%), Positives = 50/192 (25%), Gaps = 34/192 (17%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNAD---------GMGSIRDQELHRACAVLKVI 84
           D       TI   +     +  +C + G             +  IR +E   +  +L V 
Sbjct: 43  DCAIGMGGTIKKLSDEGVEVIYICMTDGYMGTTDEKLSGHELALIRRREEEESAKLLGVR 102

Query: 85  PFDKEGICDNDS---CHCNEEHI---------ITFDNY---GVWGHCNHRDVHPPIDCKI 129
                   D +        ++ +           F          H +HR         +
Sbjct: 103 KIYWLNYRDTELPYSREVRKDLVKIIRKEKPDGVFAPDPWLPYESHPDHRRTGFLAIESV 162

Query: 130 DSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQH 189
             ++  +        S         P  +    L     P+ ++ I        KA+  H
Sbjct: 163 AFSQLPN-------FSNIDIDIGLKPHSVSFIALYYTHKPNYIVDITDLMELKLKAIRAH 215

Query: 190 LSQW---VWFRK 198
            SQ+   +W   
Sbjct: 216 RSQFTDDIWETW 227
>ref|NP_244186.1| (NC_002570) BH3320~unknown conserved protein [Bacillus halodurans]
 dbj|BAB07039.1| (AP001518) BH3320~unknown conserved protein [Bacillus halodurans]
          Length = 227

 Score =  148 bits (372), Expect = 8e-35
 Identities = 35/203 (17%), Positives = 58/203 (28%), Gaps = 49/203 (24%)

Query: 27  ISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNAD------------GMGSI 69
                ++     D+    S TI  F      +   C + G                +  I
Sbjct: 4   ERHVLVIFPHPDDEAFGVSGTIALFRKQGVPVTYACLTLGEMGRNLGNPPFATRESLPDI 63

Query: 70  RDQELHRACAVLKVIPFDKEGICD----------------NDSCHCNEEHIITFDNYGVW 113
           R +EL ++   + +      G  D                +     N   IITF   G  
Sbjct: 64  RKKELIKSAEAMGIEDLRMLGYRDKTIEFEDETKLTDMVSDLMAELNPSLIITF-YPGYS 122

Query: 114 GHCNHRDVHPPIDCKIDS-AKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKV 172
            H +H      +   +    K +   LY    S N  ++  G  DI             +
Sbjct: 123 VHPDHEATGRAVVRAVRRLEKSMRPKLYGVAFS-NGHQEELGDPDI-------------L 168

Query: 173 IIINKQPWKSFKAMAQHLSQWVW 195
             I+    +   A+  H+SQ  W
Sbjct: 169 FDISPVAEQKKAAIRAHISQTAW 191
>ref|NP_014008.1| (NC_001145) N-acetylglucosaminylphosphatidylinositol
           de-N-acetylase; Gpi12p [Saccharomyces cerevisiae]
 sp|P23797|GP12_YEAST N-acetylglucosaminyl-phosphatidylinositol de-N-acetylase
 pir||S54588 probable membrane protein YMR281w - yeast (Saccharomyces
           cerevisiae)
 emb|CAA89779.1| (Z49704) unknown [Saccharomyces cerevisiae]
 dbj|BAA74776.1| (AB017166) GPI12 [Saccharomyces cerevisiae]
          Length = 304

 Score =  147 bits (370), Expect = 1e-34
 Identities = 65/235 (27%), Positives = 95/235 (39%), Gaps = 55/235 (23%)

Query: 34  DDGKFFSPTINYFTS---TACNLHILCFSTGNADGMGSIRDQELHRACAVL--------- 81
           D+  FFSP I+   S        +I+C S GNA+G+G  R +EL+ + A+L         
Sbjct: 66  DEVMFFSPIISQLNSYFPRTVPFNIICLSKGNAEGLGETRVRELNESAALLLHNERAVSV 125

Query: 82  KVIPFD--KEGICDNDSC------------HCNEEHIITFDNYGVWGHCNHRDVHPPIDC 127
           +V+ F    + I D DS             H   + I+TFD+YGV  H NH+  +  +  
Sbjct: 126 QVMDFQDGMDEIWDIDSITSSLSQKIDIKNHNLNQIIVTFDSYGVSNHINHKSCYAAVKK 185

Query: 128 KIDSAKRIHGF----------LYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVII--- 174
            +D   +              LY+     NI  KY   +   L IL     P + II   
Sbjct: 186 LVDDYAQPKTKRNEQPPHVTALYLRSYKNNIVLKYNSFIWEILKILYDLISPFRRIIQAL 245

Query: 175 ---------------INKQPWKSFKAM-AQHLSQWVWFRKLFVLFSSYTYVNTLD 213
                           + Q   +F  M   H SQ VWFR  + +FS + +VN  D
Sbjct: 246 PPNTAAEKDKLSLMNTHAQYVLAFATMLNAHESQVVWFRYGWWIFSRFVFVNEFD 300
>ref|NP_294173.1| (NC_001263) conserved hypothetical protein [Deinococcus
           radiodurans]
 pir||F75517 conserved hypothetical protein - Deinococcus radiodurans  (strain
           R1)
 gb|AAF10027.1|AE001904_3 (AE001904) conserved hypothetical protein [Deinococcus radiodurans]
          Length = 281

 Score =  146 bits (367), Expect = 3e-34
 Identities = 35/223 (15%), Positives = 67/223 (29%), Gaps = 53/223 (23%)

Query: 24  ATSISRATIL-------DDGKFFSPTINYFTSTACNLHILCFSTGNAD----------GM 66
           +T   RAT+L       D+      T+ ++      + + C + G A            +
Sbjct: 5   STPAPRATLLVIFAHPDDEAFSVGGTLTHYARQGVRVVLACATRGEAGKITVPGMTVDDL 64

Query: 67  GSIRDQELHRACAVLKVIPFDKEGICDND----------------------------SCH 98
           G+ R+QEL  AC  L++ P       D+                                
Sbjct: 65  GAQREQELREACRALEIEPPVFLDYHDSGRYERTRHDDPTALMNVNPLDAEVKLRALIED 124

Query: 99  CNEEHIITFDNYGVWGHCNHRDVHPPIDCK-IDSAKRIHGFLYVHQVSLNIFR-----KY 152
              + I+TFD +G +GH +H  +H         +     G       +    +       
Sbjct: 125 VQPQVIVTFDPHGAYGHVDHLQMHRATVAAFFSTGHLPSGGPQRLYYTAMTHQAAAQISR 184

Query: 153 CGPVDIWLSILS--AKRHPSKVIIINKQPWKSFKAMAQHLSQW 193
            G       ++   +    +  + +         A+A H +Q 
Sbjct: 185 LGHDQSLDPLVYGVSDSTLAVTMDVGAYAENKKAALAAHGTQM 227
>ref|NP_385870.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
 emb|CAC46343.1| (AL591788) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
          Length = 244

 Score =  138 bits (346), Expect = 8e-32
 Identities = 26/219 (11%), Positives = 46/219 (20%), Gaps = 53/219 (24%)

Query: 25  TSISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNADGMGSI---------- 69
                  ++     D        I         + ++C S G       +          
Sbjct: 1   MHQKTGLVVSAHSADFVWRAGGAIAAHARQGYAVTVVCLSFGERGESAKLWKKSGMTLET 60

Query: 70  ----RDQELHRACAVLKVIPFDKEGICD--------------NDSCHCNEEHIITF---D 108
               R +E   A   L V       + D              +       E ++T    D
Sbjct: 61  VKADRRREAENAAKALGVHDILFYDLGDYPIQVTPEAFDRLVDLYREIRPEFMLTHSRQD 120

Query: 109 NYGVWGHC--NHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAK 166
            Y    H        H  +  +    K     L            Y              
Sbjct: 121 PYNFD-HPMATEFAQHARVIAQAHGHKPSTPVL-----GAPPV--YLFEPHQPEQC---N 169

Query: 167 RHPSKVIIINKQPWKSFKAMAQHLSQWVWFRKLFVLFSS 205
             P+ ++ I     K   A+     Q      L+  ++ 
Sbjct: 170 WKPNFLLDITDVWEKKLAAIKCMEGQ----EHLWEYYTR 204
>ref|NP_215598.1| (NC_000962) hypothetical protein Rv1082 [Mycobacterium tuberculosis
           H37Rv]
 ref|NP_335555.1| (NC_002755) lmbE protein [Mycobacterium tuberculosis CDC1551]
 pir||H70894 hypothetical protein Rv1082 - Mycobacterium tuberculosis  (strain
           H37RV)
 emb|CAA17198.1| (AL021897) hypothetical protein Rv1082 [Mycobacterium tuberculosis
           H37Rv]
 gb|AAK45369.1| (AE006992) lmbE protein [Mycobacterium tuberculosis CDC1551]
          Length = 288

 Score =  138 bits (345), Expect = 1e-31
 Identities = 29/264 (10%), Positives = 61/264 (22%), Gaps = 91/264 (34%)

Query: 25  TSISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNADG-------------- 65
            S  R   +     D+    + T+  +      + ++  + G                  
Sbjct: 1   MSELRLMAVHAHPDDESSKGAATLARYADEGHRVLVVTLTGGERGEILNPAMDLPDVHGR 60

Query: 66  MGSIRDQELHRACAVLKVIPFDKEGICDNDS----------------------------- 96
           +  IR  E+ +A  +L V      G  D+                               
Sbjct: 61  IAEIRRDEMTKAAEILGVEHT-WLGFVDSGLPKGDLPPPLPDDCFARVPLEVSTEALVRV 119

Query: 97  -CHCNEEHIITFDNYGVWGHCNHRDVHPPIDCKIDS---------AKRIHGFLYVHQV-- 144
                   + T+D  G + H +H   H       ++         A        ++ V  
Sbjct: 120 VREFRPHVMTTYDENGGYPHPDHIRCHQVSVAAYEAAGDFCRFPDAGEPWTVSKLYYVHG 179

Query: 145 -----------------SLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMA 187
                                F ++    D            +  +  +K   +   A+ 
Sbjct: 180 FLRERMQMLQDEFARHGQRGPFEQWLAYWD--PDHDFLTSRVTTRVECSKYFSQRDDALR 237

Query: 188 QHLSQ-----------WVWFRKLF 200
            H +Q             W  +L+
Sbjct: 238 AHATQIDPNAEFFAAPLAWQERLW 261
>ref|NP_371091.1| (NC_002758) conserved hypothetical protein [Staphylococcus aureus
           subsp. aureus Mu50]
 ref|NP_373778.1| (NC_002745) conserved hypothetical protein [Staphylococcus aureus
           subsp. aureus N315]
 dbj|BAB41756.1| (AP003131) conserved hypothetical protein [Staphylococcus aureus
           subsp. aureus N315]
 dbj|BAB56729.1| (AP003359) conserved hypothetical protein [Staphylococcus aureus
           subsp. aureus Mu50]
          Length = 221

 Score =  137 bits (344), Expect = 1e-31
 Identities = 37/206 (17%), Positives = 59/206 (27%), Gaps = 49/206 (23%)

Query: 25  TSISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNAD------------GMG 67
           T      ++     D+    + T+  +      +   C + G                + 
Sbjct: 2   TDERHVLVIFPHPDDETFSSAGTLASYIQKGIPVTYACLTLGQMGRNLGNPPFATRESLP 61

Query: 68  SIRDQELHRACAVLKVIPFDKEGICD-----NDSCHCN-----------EEHIITFDNYG 111
           SIR++EL  AC V+ +    K G+ D         H +              II+F   G
Sbjct: 62  SIRERELEEACKVIGITDLRKMGLRDKTVEFEPYEHIDGMIKSLIDDTNPSLIISF-YPG 120

Query: 112 VWGHCNHRDVHPPIDCKIDS-AKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPS 170
              H +H      +   ++   K     L +   S N   +  G  DI            
Sbjct: 121 YAVHPDHEATADAVIRTVERMPKEERPRLTLVAFS-NDATEALGEPDIQN---------- 169

Query: 171 KVIIINKQPWKSFKAMAQHLSQWVWF 196
               I        KA   H SQ   F
Sbjct: 170 ---DITDFKELKIKAFEAHASQTGPF 192
>emb|CAC18708.2| (AL451182) conserved hypothetical protein [Streptomyces coelicolor]
          Length = 293

 Score =  137 bits (343), Expect = 2e-31
 Identities = 32/254 (12%), Positives = 62/254 (23%), Gaps = 82/254 (32%)

Query: 25  TSISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNADG-------------- 65
           T   R   +     D+    + T+  + S   ++ ++  + G                  
Sbjct: 2   TDQLRLMAVHAHPDDESSKGAATMAKYVSEGVDVLVVTCTGGERGSILNPKLQGDAYIEE 61

Query: 66  -MGSIRDQELHRACAVLKVIPFDKEGICDNDSCH-------------------------- 98
            +  +R +E+  A  +L V   +  G  D+                              
Sbjct: 62  NIHEVRRKEMDEAREILGVG-QEWLGFVDSGLPEGDPLPPLPEGCFALEDVDKAAGELVR 120

Query: 99  ----CNEEHIITFDNYGVWGHCNHRDVHPPIDCKIDS--------------AKRIHGFLY 140
                  + I T+D  G + H +H   H       +               A +     Y
Sbjct: 121 KIRSFRPQVITTYDENGGYPHPDHIMTHKITMVAFEGAADTEKYPESEYGTAYQPLKVYY 180

Query: 141 VHQVSLNIFR-------------KYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMA 187
               +                   Y   +  W      +R  +  +          KA+ 
Sbjct: 181 NQGFNRPRTEALHHALLDRGLESPYEDWLKRWSEFERKERTLTTHVPCADFFEIRDKALI 240

Query: 188 QHLSQW----VWFR 197
            H +Q      WFR
Sbjct: 241 AHATQIDPEGGWFR 254
>emb|CAA77139.1| (Y18353) hypothetical protein [Thermus thermophilus]
          Length = 227

 Score =  135 bits (339), Expect = 6e-31
 Identities = 27/196 (13%), Positives = 51/196 (25%), Gaps = 36/196 (18%)

Query: 12  IWVASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGNADGMG--SI 69
           + V +                 D       T+    +   +  IL  + G     G    
Sbjct: 4   LLVVAPHPD-------------DGELGCGGTLARAKAEGLSTGILDLTRGEMGSKGTPEE 50

Query: 70  RDQELHRACAVLKVIPFDKEGICDNDSCHCNEEHI------------ITFDNYGVWGHCN 117
           R++E+  A  +L +      G  D       E+ +            + F       H +
Sbjct: 51  REKEVAEASRILGLDFRGNLGFPDGGLADVPEQRLKLAQALRRLRPRVVFAPLEADRHPD 110

Query: 118 HRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINK 177
           H          +  A      L      +     Y G              PS ++ I+ 
Sbjct: 111 HTAASRLAVAAVHLAGLRKAPLEGEPFRVERLFFYPGNHP---------FAPSFLVKISA 161

Query: 178 QPWKSFKAMAQHLSQW 193
              +   A+  + SQ+
Sbjct: 162 FIDQWEAAVLAYRSQF 177
>ref|NP_293807.1| (NC_001263) conserved hypothetical protein [Deinococcus
           radiodurans]
 pir||G75562 conserved hypothetical protein - Deinococcus radiodurans  (strain
           R1)
 gb|AAF09674.1|AE001871_6 (AE001871) conserved hypothetical protein [Deinococcus radiodurans]
          Length = 237

 Score =  135 bits (337), Expect = 1e-30
 Identities = 25/191 (13%), Positives = 53/191 (27%), Gaps = 41/191 (21%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNAD----------GMGSIRDQELHRACAVLKV 83
           D+    S T+  + +   +  ++  + G A            +  +R  EL     V+ +
Sbjct: 24  DEVYGASGTLMEYLAAGESCGLVTLTRGEAGRTLGLCDGPEELARMRAVELAACLEVIGL 83

Query: 84  -------------IPFDKEGICDNDS--------CHCNEEHIITFDNYGVWGHCNHRDVH 122
                             +     +              E ++TF   G  GH +H   H
Sbjct: 84  TTTPGSLHEQHQFPDKYLKDYPFEELVETAREAMERLRPETVLTFPPNGSNGHPDHMTTH 143

Query: 123 PPIDCKIDSAK-RIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWK 181
             +    D         L+ +        +      +          P+    ++    +
Sbjct: 144 RAVKAAWDRLPAGSRPVLWYYASETPPENEELRAAWLP---------PTVKRDVSALVTR 194

Query: 182 SFKAMAQHLSQ 192
             +A+A H SQ
Sbjct: 195 KLQAIACHRSQ 205
>ref|NP_242548.1| (NC_002570) BH1682~unknown conserved protein [Bacillus halodurans]
 dbj|BAB05401.1| (AP001512) BH1682~unknown conserved protein [Bacillus halodurans]
          Length = 231

 Score =  131 bits (327), Expect = 1e-29
 Identities = 21/183 (11%), Positives = 45/183 (24%), Gaps = 40/183 (21%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADGMG--SIRDQELHRACAVLKVIPFDKEGI 91
           D       T+ ++      + I   +       G    R +E   A  +L +    +  +
Sbjct: 15  DVEIGMGATLYHYRQKGHRVGICNLTKAELSSNGTVEQRQKEAADASRILGIDERIQLDL 74

Query: 92  CDNDSC-----HCNE--------EHIITFDNYGVWGHCNHRDVHPPIDCK--------ID 130
            D                     +    F  Y V  H +H      +             
Sbjct: 75  PDRGLRNPSEQQVRNIVSVIRHCQPTFVFVPYPVDRHPDHGHCAELVKEAVFNARIRNYK 134

Query: 131 SAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQHL 190
           +    H    +    +N F +                 P  ++ ++        A+  + 
Sbjct: 135 AEGGAHHVQDLFYYMINSFER-----------------PDLLVDVSHCYEVKQAALNAYK 177

Query: 191 SQW 193
           SQ+
Sbjct: 178 SQF 180
>ref|NP_302547.1| (NC_002677) conserved hypothetical protein [Mycobacterium leprae]
 gb|AAA63037.1| (U15183) lmbE gene product [Mycobacterium leprae]
 emb|CAC31907.1| (AL583925) conserved hypothetical protein [Mycobacterium leprae]
          Length = 290

 Score =  129 bits (324), Expect = 3e-29
 Identities = 32/264 (12%), Positives = 68/264 (25%), Gaps = 91/264 (34%)

Query: 25  TSISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNADG-------------- 65
            S  R   +     D+    + T+  +      + ++  + G                  
Sbjct: 1   MSELRLMAVHAHPDDESSKGAATLARYADEGHRVLVVTLTGGERGEILNPAMDLPDVHGH 60

Query: 66  MGSIRDQELHRACAVLKVIPFDKEGICDND------------------------------ 95
           +  IR  E+ +A  +L V      G  D+                               
Sbjct: 61  IAEIRRDEMAKAAEILGVEHT-WLGFIDSGLPKGDPPPPLPDDCFALVPLEVCTEALVRV 119

Query: 96  SCHCNEEHIITFDNYGVWGHCNHRDVHPPIDCKIDS---------AKRIHGFLYVHQVSL 146
                   + T+D  G + H +H   H       ++         A +      ++    
Sbjct: 120 VRKFRPHVLTTYDENGGYPHPDHIRCHQVSVDAYEAACDYRRFPDAGKPWTVSKLYY--N 177

Query: 147 NIFRKY--------------CGPVDIWLSILSAKRHP-----SKVIIINKQPWKSFKAMA 187
           + F +                GP D WL+  +    P     +  +  +    +   A+ 
Sbjct: 178 HGFLRARMQLLHDEFAKHGQAGPFDKWLAQSNPAHDPFESRVTTRVECSAYFSQRDDALR 237

Query: 188 QHLSQ-----------WVWFRKLF 200
            H +Q             W ++L+
Sbjct: 238 AHATQIDPKAEFFAAPISWQQRLW 261
>ref|NP_437291.1| (NC_003078) conserved hypothetical protein, possibly
           membrane-associated [Sinorhizobium meliloti]
 emb|CAC49151.1| (AL603644) conserved hypothetical protein, possibly
           membrane-associated [Sinorhizobium meliloti]
          Length = 228

 Score =  129 bits (322), Expect = 6e-29
 Identities = 24/199 (12%), Positives = 44/199 (22%), Gaps = 38/199 (19%)

Query: 14  VASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGN---ADGMGSIR 70
           V +                 D+      TI    +    + +   + G     D   + R
Sbjct: 13  VVAPHPD-------------DEVLGAGGTIARLAAEGEEVFVAVVTEGKPPAFDPEATAR 59

Query: 71  DQ-ELHRACAVLKVIPFDKEGICDNDSCH--------------CNEEHIITFDNYGVWGH 115
            Q E  +A   L V       +                                +    H
Sbjct: 60  IQAEARQAHRALGVTETIWLRLPAAQLAETAHATVNAALLELVHRLSPQTVLLPFVGDMH 119

Query: 116 CNHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIII 175
            +H+             +     L +   +           +     LS    P+  + I
Sbjct: 120 MDHQLTFTSALVACRPHQAEFPKLVLAYET-------LSETNWNAPYLSPAFVPNVFVDI 172

Query: 176 NKQPWKSFKAMAQHLSQWV 194
           ++      KAM    SQ  
Sbjct: 173 SEHLEAKLKAMELFASQVR 191
>ref|NP_390128.1| (NC_000964) alternate gene name: jojG~similar to hypothetical
           proteins [Bacillus subtilis]
 sp|P42981|YPJG_BACSU Hypothetical 24.8 kDa protein in DAPB-PAPS intergenic region
 pir||F69937 conserved hypothetical protein ypjG - Bacillus subtilis
 gb|AAA92876.1| (L38424) unknown [Bacillus subtilis]
 gb|AAB38444.1| (L47709) putative [Bacillus subtilis]
 emb|CAB14163.1| (Z99115) alternate gene name: jojG~similar to hypothetical proteins
           [Bacillus subtilis]
          Length = 224

 Score =  127 bits (317), Expect = 2e-28
 Identities = 29/186 (15%), Positives = 54/186 (28%), Gaps = 41/186 (22%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADGMG--SIRDQELHRACAVLKVIPFDKEGI 91
           D       TI  F      + I   +       G  S+R +E   A  +L      +  +
Sbjct: 15  DVEIGMGGTIAKFVKQEKKVMICDLTEAELSSNGTVSLRKEEAAEAARILGADKRIQLTL 74

Query: 92  CDNDSCHCN--EEHIIT----------FDNYGVWGHCNHRDVHPPIDCKI---------- 129
            D      +     I+T          F  Y    H +H +    ++  I          
Sbjct: 75  PDRGLIMSDQAIRSIVTVIRICRPKAVFMPYKKDRHPDHGNAAALVEEAIFSAGIHKYKD 134

Query: 130 DSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQH 189
           + +   H    V+   +N F +                 P  VI I+       +++  +
Sbjct: 135 EKSLPAHKVSKVYYYMINGFHQ-----------------PDFVIDISDTIEAKKQSLNAY 177

Query: 190 LSQWVW 195
            SQ++ 
Sbjct: 178 KSQFIP 183
>ref|NP_403747.1| (NC_003143) hypothetical protein [Yersinia pestis]
 emb|CAC88949.1| (AJ414141) hypothetical protein [Yersinia pestis]
          Length = 310

 Score =  125 bits (313), Expect = 6e-28
 Identities = 30/217 (13%), Positives = 52/217 (23%), Gaps = 52/217 (23%)

Query: 26  SISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNADGMGSIRDQ-------- 72
               A ++     D        I         +HI+C S G       +  +        
Sbjct: 70  PQKTALVVSAHSADFVWRAGGAIALHVEQGYQVHIVCLSYGERGESAKLWRKGDMTEERV 129

Query: 73  ------ELHRACAVLK--VIPFDKEGIC---DNDS--------CHCNEEHIITF---DNY 110
                 E   A  VL   +  FD        D +S               ++T    D Y
Sbjct: 130 KASRHTEAQAAANVLGASIEFFDMGDYPLRADKESLFRLADVFRRIQPHFVLTHSLADPY 189

Query: 111 GVWGHC--NHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRH 168
               H    +      I  + +   R    +    +       YC               
Sbjct: 190 NYD-HPLAANLAQEARIIAQAE-GYRPGEAI----IGAPPV--YCFEPHQPEQCG---WK 238

Query: 169 PSKVIIINKQPWKSFKAMAQHLSQWVWFRKLFVLFSS 205
           P  ++ I     K + A+     Q      L+  ++ 
Sbjct: 239 PDVLLDITSVWEKKYAAIQCMAGQ----EHLWEYYTR 271
>ref|NP_344220.1| (NC_002754) Conserved hypothetical protein [Sulfolobus
           solfataricus]
 gb|AAK43010.1| (AE006882) Conserved hypothetical protein [Sulfolobus solfataricus]
          Length = 193

 Score =  124 bits (311), Expect = 1e-27
 Identities = 28/190 (14%), Positives = 52/190 (26%), Gaps = 47/190 (24%)

Query: 28  SRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNAD----------GMGSIRDQ 72
            R  I+     D+      TI  F      + ++  + G              +  IR Q
Sbjct: 8   RRVLIVAPHPDDETLCCGGTIQIFKEKGYKISVIIVTDGRYGSPDDKLKGSSELIEIRRQ 67

Query: 73  ELHRACAVLKVIPFDKEGICDN---------DSCHCNEEHIITFDNYGVWGHCNHRDVHP 123
           E  RA  +L +         D+                E+ + F       H +H ++  
Sbjct: 68  EALRATKILGIDEVKFLNFEDSKVSEEDAENALAEFLRENDVVFSPIPFDNHPDHANIGK 127

Query: 124 PIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSF 183
            ++    +A     +L      +N                           I K      
Sbjct: 128 AVEKLYPNAY---FYLIWGNTQVNW--------------------REVKFDIRKYKESKL 164

Query: 184 KAMAQHLSQW 193
           +A+ Q++SQ 
Sbjct: 165 RAINQYISQI 174
>emb|CAC16965.1| (AL450350) conserved hypothetical protein [Streptomyces coelicolor]
          Length = 277

 Score =  124 bits (311), Expect = 1e-27
 Identities = 25/241 (10%), Positives = 57/241 (23%), Gaps = 71/241 (29%)

Query: 10  VVIWVASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGNAD----- 64
           + +                     D+       +  + +      ++  + G        
Sbjct: 6   LTLMAVHAHPD-------------DEATSTGGVLARYAAEGIRTVLVTCTDGGCGDGPGG 52

Query: 65  -----------GMGSIRDQELHRACAVLKVIPFDKEGICDND------------------ 95
                       +  +R +EL  +  +LK+   +     D+                   
Sbjct: 53  VKPGDPGHDPAAVALMRRRELEESRDILKISDLETLDYADSGMMGWPSNDAPGSFWRTPV 112

Query: 96  ----------SCHCNEEHIITFDNYGVWGHCNHRDVHPPIDCKIDSAKRIHGFLYVHQVS 145
                       H   + ++T+D  G +GH +H   H      ++         +     
Sbjct: 113 EEGAARLAELMRHYRPDVVVTYDENGFYGHPDHIQAHRITMAALEMTTLTPKVYWTTAPR 172

Query: 146 LNIFRKY----CGPVDIWLS----------ILSAKRHPSKVIIINKQPWKSFKAMAQHLS 191
             + R          D+             I       +  +       + F A+A H S
Sbjct: 173 SMMQRFGEIMREFHPDMPEPDPAEAAAMAEIGLPDEEITTWVDTTSFSGQKFDALAAHAS 232

Query: 192 Q 192
           Q
Sbjct: 233 Q 233
>ref|NP_389828.1| (NC_000964) Uncharacterized conserved protein [Bacillus subtilis]
          Length = 221

 Score =  122 bits (306), Expect = 4e-27
 Identities = 30/209 (14%), Positives = 49/209 (23%), Gaps = 57/209 (27%)

Query: 14  VASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGNAD--------- 64
           V                   D+    +  I         +   C + G            
Sbjct: 7   VILPHPD-------------DESYGVAGLIALNRKKDIPVTYACATLGEMGRNMGDPFFA 53

Query: 65  ---GMGSIRDQELHRACAVLKVIPFDKEGIC--------DNDSCHCNEEH--------II 105
               +  +R QEL  AC  + +      G+         D       EE         I+
Sbjct: 54  NRETLPLLRKQELINACKEMDINDLRMLGLRDKTLEFEDDEYLADIMEEIIDDVKPSLIV 113

Query: 106 TFDNYGVWGHCNHRDVHPPIDCK-IDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILS 164
           TF   G   H +H      +        K          ++ N   +  G  D       
Sbjct: 114 TF-YPGHGVHPDHDACGEAVIRALYRKKKEDRPRTICMAITRNR-EEVLGEAD------- 164

Query: 165 AKRHPSKVIIINKQPWKSFKAMAQHLSQW 193
                  V+ I +       A+  H +Q 
Sbjct: 165 ------VVLDIKEVADIKMNALRAHRTQT 187
>emb|CAB66204.1| (AL136502) hypothetical protein SCF43.15c. [Streptomyces coelicolor
           A3(2)]
          Length = 247

 Score =  121 bits (303), Expect = 9e-27
 Identities = 26/203 (12%), Positives = 54/203 (25%), Gaps = 39/203 (19%)

Query: 22  FRATSI--SRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNAD-------GMG 67
                    RA  +     D     +  I  +T     +  +  + G A           
Sbjct: 9   LEPMPGDWRRALAVVAHPDDLEYGCAAAIAAWTDEGREVAYVLATRGEAGIDTLAPAECA 68

Query: 68  SIRDQELHRACAVLKVIPFDKEGICDN--------------DSCHCNEEHIITF---DNY 110
            +R++E   + AV+ V   +     D                      E +IT    D +
Sbjct: 69  PLREREQRASAAVVGVSEVEFLDHRDGVVEYGTALRRDIAAAIRRHRPELVITMNHRDTW 128

Query: 111 GV--WGHCNHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRH 168
           G   W   +H  V          A     F  +    L           +    ++    
Sbjct: 129 GGVAWNTPDHVAVGRATLDAAADAGNRWIFPELTDRGLEP------WNGVRWVAVAGSSS 182

Query: 169 PSKVIIINKQPWKSFKAMAQHLS 191
           P+  +       ++ +++ +H +
Sbjct: 183 PTHAVDATPGMERAVRSLLEHRT 205
>ref|NP_296086.1| (NC_001263) Uncharacterized conserved protein [Deinococcus
           radiodurans]
          Length = 239

 Score =  119 bits (298), Expect = 3e-26
 Identities = 26/210 (12%), Positives = 49/210 (22%), Gaps = 59/210 (28%)

Query: 16  SFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGNADGMG--SIRDQE 73
           +                 D       T+         + IL  + G     G  + R  E
Sbjct: 21  APHPD-------------DAEIGAGGTLIRLAQAGRAVGILELTRGEKGTQGTPAERQAE 67

Query: 74  LHRACAVLKVIPFDKEGICDNDSCH--------------CNEEHIITFDNYGVWGHCNH- 118
              A  ++ +    + G+ D +                      ++    +    H +H 
Sbjct: 68  CVAAARLMDLSWRGQLGLPDGELADTPPFAHALAAALRTVRPRVLVV--PHWHDRHPDHF 125

Query: 119 -------RDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSK 171
                  R +H     K D          V     N                 +    + 
Sbjct: 126 GTYHLTKRAIHLAALKKADLGGDPWRVQRVLLYQGN-----------------SDISANV 168

Query: 172 VIIINKQPWKSFKAMAQHLSQWVWFRKLFV 201
           ++ I     +   A+  H SQ   F   +V
Sbjct: 169 LVDIGSVMTEWEAAIRAHTSQ---FAGGYV 195
>ref|NP_302050.1| (NC_002677) conserved hypothetical protein [Mycobacterium leprae]
 emb|CAC30445.1| (AL583922) conserved hypothetical protein [Mycobacterium leprae]
          Length = 308

 Score =  119 bits (296), Expect = 5e-26
 Identities = 37/248 (14%), Positives = 60/248 (23%), Gaps = 82/248 (33%)

Query: 29  RATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGN----------------ADGMG 67
           R   +     D+      TI ++TS    + ++  + G                 AD +G
Sbjct: 6   RLLFVHAHPDDESLSNGATIAHYTSRGAQVQVVTCTLGEEGEVIGDRWAELTVDHADQLG 65

Query: 68  SIRDQELHRACAVLKVIPFDKEG----ICDNDS--------------------------- 96
             R  EL  A   L V      G      D+                             
Sbjct: 66  GYRIFELTEALRALGVSAPIYLGGAGRWRDSGMRGTAPRRRQRFIDADENEAVGALVAII 125

Query: 97  CHCNEEHIITFDNYGVWGHCNHRDVHPPID----------------CKIDSAKRIHGFLY 140
                  ++T+D +G +GH +H   H                     +           Y
Sbjct: 126 RELRPHVVVTYDPHGGYGHPDHVHTHFITAAAVASSGVAAGLEVGADEYPGKPWKVPKFY 185

Query: 141 VHQVSLNIFR--------KYCGPVDIWLS------ILSAKRHPSKVIIINKQPWKSFKAM 186
               +L+ F         K   P              S K   + V   +        A+
Sbjct: 186 WSVFALSAFEAGMNALQGKDLRPEWTIPPREEFYFGYSDKDIDAVVEATSDVWAAKTAAL 245

Query: 187 AQHLSQWV 194
             H +Q V
Sbjct: 246 TAHATQVV 253
>gb|AAC14880.1| (AF060080) hypothetical protein [Chlorobium tepidum]
          Length = 240

 Score =  119 bits (296), Expect = 6e-26
 Identities = 24/192 (12%), Positives = 49/192 (25%), Gaps = 54/192 (28%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADGMG--SIRDQELHRACAVLKVIPFDKEGI 91
           D       T+         + +   + G    +G    R QE   A   +  +  ++  +
Sbjct: 21  DVELACGATLLKIMDEGKPVAVCDLTAGEMGTLGTAETRRQEAALATERMGYVAREQLDL 80

Query: 92  CDNDS--------------CHCNEEHIITFDNYGVWGHCNHR--------DVHPPIDCKI 129
            D++                    +    F N     H +H           +     KI
Sbjct: 81  GDSELFYTKESLHKIIRIIRKYRPD--TVFCNPPDERHPDHMKASRLIYEACYYAGLRKI 138

Query: 130 D--------SAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWK 181
           +        +A R    LY  Q                      +  P  V+ ++    +
Sbjct: 139 ETFDGGLPQAAHRPRHLLYYIQFK--------------------QLEPQIVVDVSSTFER 178

Query: 182 SFKAMAQHLSQW 193
           S   +    +Q+
Sbjct: 179 SRAGIEAFGTQF 190
>ref|NP_437176.1| (NC_003078) conserved hypothetical protein [Sinorhizobium meliloti]
 emb|CAC49036.1| (AL603644) conserved hypothetical protein [Sinorhizobium meliloti]
          Length = 292

 Score =  119 bits (296), Expect = 6e-26
 Identities = 22/216 (10%), Positives = 49/216 (22%), Gaps = 53/216 (24%)

Query: 16  SFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGNA--------DGMG 67
           +                 D+            ++   +  +  + G A        + + 
Sbjct: 54  APHPD-------------DETLGCGGVSAKKLASGVEVRFVFVTDGAASHRRLISPEELR 100

Query: 68  SIRDQELHRACAVLKV--IPFDKEGICDND---------------SCHCNEEHIITFDNY 110
           S R+ E   A   L             D +                     + +  F  +
Sbjct: 101 SRRESEALEAVHRLGASSESVTFLRFPDAEASHHIHAITKAIVPLLERWRPQSV--FVTH 158

Query: 111 GVWGHCNHRDVHPPIDCKIDSAKRIHGFL---YVHQVSLNIFRKYCGPVDIWLSILSAKR 167
                 +H  V+  +   +    R          +       R       +W + L    
Sbjct: 159 AKDPPSDHIAVNAAVRAALRWHGRPLTVFEYPVWYWYHWPWVRPAGDLPGMWRTTLRQTV 218

Query: 168 H----------PSKVIIINKQPWKSFKAMAQHLSQW 193
                       + ++ I +       A+A H+SQ 
Sbjct: 219 KTVAGLRALSALNTLVPIGEFLDVKRHALAAHVSQT 254
>gb|AAG12428.1| (AY005138) unknown [Chlorobium tepidum]
          Length = 250

 Score =  118 bits (295), Expect = 7e-26
 Identities = 24/192 (12%), Positives = 49/192 (25%), Gaps = 54/192 (28%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADGMG--SIRDQELHRACAVLKVIPFDKEGI 91
           D       T+         + +   + G    +G    R QE   A   +  +  ++  +
Sbjct: 21  DVELACGATLLKIMDEGKPVAVCDLTAGEMGTLGTAETRRQEAALATERMGYVAREQLDL 80

Query: 92  CDNDS--------------CHCNEEHIITFDNYGVWGHCNHR--------DVHPPIDCKI 129
            D++                    +    F N     H +H           +     KI
Sbjct: 81  GDSELFYTKESLHKIIRIIRKYRPD--TVFCNPPDERHPDHMKASRLIYEACYYAGLRKI 138

Query: 130 D--------SAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWK 181
           +        +A R    LY  Q                      +  P  V+ ++    +
Sbjct: 139 ETFDGGLPQAAHRPRHLLYYIQFK--------------------QLEPQIVVDVSSTFER 178

Query: 182 SFKAMAQHLSQW 193
           S   +    +Q+
Sbjct: 179 SRAGIEAFGTQF 190
>gb|AAC01723.1| (AF040570) negative regulatorly protein [Amycolatopsis
           mediterranei]
          Length = 255

 Score =  118 bits (294), Expect = 1e-25
 Identities = 29/210 (13%), Positives = 52/210 (23%), Gaps = 43/210 (20%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNAD-----------GMGSIRDQELHRACAVLK 82
           DD       +         + ++  + G               +G  R  E   A  VL 
Sbjct: 13  DDTTTCGGVLRKAHEDGHRVVLVLATRGELGYNPDGLLAEGETLGDRRAVEARAAADVLG 72

Query: 83  VIPFDKEGICDND--------------------------SCHCNEEHIITFDNYGVWGHC 116
           V   +  G  D+                                 + +  +D  G +G  
Sbjct: 73  VDRLEFLGYTDSGMTAAADGAGTFQTADVEEAARKLAAILREERADVLTVYDEKGTYGDP 132

Query: 117 NHRDVHPPIDCKIDSAKRIHGFLYVHQV----SLNIFRKYCGPVDIW--LSILSAKRHPS 170
           +H  VH       + A     F          +          VD+       + +   +
Sbjct: 133 DHIQVHRVGTRAAELAGTAKVFQSTINREHIKANQRVLAEQAGVDLPAGPDFGTPEAELT 192

Query: 171 KVIIINKQPWKSFKAMAQHLSQWVWFRKLF 200
             + ++       KA+  H SQ      LF
Sbjct: 193 CRVDVSAYTEYKRKALLAHASQITPQSTLF 222
>emb|CAC05756.1| (AL391751) hypothetical protein [Streptomyces coelicolor A3(2)]
          Length = 295

 Score =  116 bits (290), Expect = 3e-25
 Identities = 24/176 (13%), Positives = 46/176 (25%), Gaps = 51/176 (28%)

Query: 26  SISRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNADG-------------MG 67
              R  ++     D+      T+  + +   ++ ++  + G                 +G
Sbjct: 5   PGRRLLLVHAHPDDESINNGVTMARYAAEGAHVTLVTCTLGERGEVIPPALAHLSGAALG 64

Query: 68  SIRDQELHRACAVLKVIPFDKEG----ICDNDS--------------------------- 96
             R  EL  A   L V  F   G      D+                             
Sbjct: 65  GHRRGELADAMRALGVDDFRLLGGPGRYADSGMLGLSDNDDPGCLWQADVDAAAALLVDV 124

Query: 97  -CHCNEEHIITFDNYGVWGHCNHRDVHPPIDCKIDSAKRIH-GFLYVHQVSLNIFR 150
                 + ++T+D  G +GH +H   H       + A         V+   +   R
Sbjct: 125 IREVRPQVLVTYDPNGGYGHPDHIQAHRIAMRAAELAAEAGCPVAKVYWNRVPRSR 180
>sp|P71311|YAIS_ECOLI HYPOTHETICAL 20.5 KDA PROTEIN IN ADHC-TAUA INTERGENIC REGION
 gb|AAB18087.1| (U73857) hypothetical protein [Escherichia coli]
          Length = 185

 Score =  113 bits (282), Expect = 3e-24
 Identities = 20/180 (11%), Positives = 40/180 (22%), Gaps = 44/180 (24%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADGMGSI-RDQELHRACAVLKVIPFDKEGIC 92
           D       ++         +  +  +TGN+   G I R +E   A  +L           
Sbjct: 28  DIELGCGASLARLAQKGIYIAAVVMTTGNSGTDGIIDRHEESRNALKILGCHQTIHLNFA 87

Query: 93  D-----------NDSCHC-------NEEHIITFDNYGVWGHCNHRDVHPPIDCKIDSAKR 134
           D           +            + E +  +  +    H +H  V+        +  +
Sbjct: 88  DTRAHLQLNDMISALEDIIKNQIPSDVEIMRVYTMHDADRHQDHLAVYQASMVACRTIPQ 147

Query: 135 I---HGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQHLS 191
           I               +F                                   A+ +H S
Sbjct: 148 ILGYETPSTWLSFMPQVFESVKEE----------------------YFTVKLAALKKHKS 185
>emb|CAB67717.1| (AJ271405) hypothetical protein [Streptomyces rochei]
          Length = 234

 Score =  110 bits (275), Expect = 2e-23
 Identities = 22/209 (10%), Positives = 43/209 (20%), Gaps = 50/209 (23%)

Query: 14  VASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGNADGMGS----- 68
           V +                 DD      ++         + ++         +       
Sbjct: 9   VVAPHPD-------------DDVIGCGGSMAKHVREGARVTVVVVIGRERSALDDAVTEA 55

Query: 69  IRDQELHRACAVLKVI-PFDKEGICDN---------DSCHCNEEH--IITFDNYGVWGHC 116
               E   A  +L V      +    +         D      E    + +  +      
Sbjct: 56  EFAAETENAAKILGVHRCVRFDESSRDFAPSRRIHLDLVRVLREVRPQVVYLPHDNDDDV 115

Query: 117 NHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLS---ILSAKRHP---- 169
            HR VH      +               S   F +  G   +      +      P    
Sbjct: 116 EHRMVHRLTTEAL-----------WMAQS--EFFQEAGECPMPAPRLVLGYEVWSPMARY 162

Query: 170 SKVIIINKQPWKSFKAMAQHLSQWVWFRK 198
                I +      +AM  ++SQ      
Sbjct: 163 QYAEDIGEHIHTKVEAMRAYVSQLRHAAW 191
>ref|NP_376770.1| (NC_003106) 221aa long conserved hypothetical protein [Sulfolobus
           tokodaii]
 dbj|BAB65879.1| (AP000984) 221aa long conserved hypothetical protein [Sulfolobus
           tokodaii]
          Length = 221

 Score =  110 bits (275), Expect = 2e-23
 Identities = 29/214 (13%), Positives = 55/214 (25%), Gaps = 58/214 (27%)

Query: 12  IWVASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGNADG------ 65
           I   S                 D+      T+      +  ++I+  + G+A        
Sbjct: 3   ILFISPHPD-------------DECDNAGGTLAKLA-KSHEIYIVYMTDGSAGSPNPEER 48

Query: 66  ---MGSIRDQELHRACAVLKV--IPFDKEGICDNDSCHCNEEH--------------IIT 106
              +  IR +E      VL +           D        E               II 
Sbjct: 49  GEKLAEIRRKEALEGLKVLGIKKDNAFFLNYPDTKLRFHIREASERVAKILREIKPNIII 108

Query: 107 FDNYGVWGHCNHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAK 166
           + +  + GH +H                    +    +++N    Y   + I    +   
Sbjct: 109 YPSL-LDGHNDH----------WSGGYITRIAIRKVGITVN-ELSYLNWLPIPSKSVFDA 156

Query: 167 -------RHPSKVIIINKQPWKSFKAMAQHLSQW 193
                   H    + I +      +AM +H SQ+
Sbjct: 157 IKYLLIPFHRKIKVDIREYKRIKLEAMKKHESQF 190
>ref|NP_334747.1| (NC_002755) hypothetical protein [Mycobacterium tuberculosis
           CDC1551]
 gb|AAK44561.1| (AE006940) hypothetical protein [Mycobacterium tuberculosis
           CDC1551]
          Length = 223

 Score =  108 bits (269), Expect = 8e-23
 Identities = 34/183 (18%), Positives = 53/183 (28%), Gaps = 33/183 (18%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADG-------MGSIRDQELHRACAVLKVIPF 86
           D+       +  FT+    L  LCF+ G A         +G +R +EL  A  VL V   
Sbjct: 22  DESFGLGAVLGDFTAQGTRLRGLCFTHGEASTLGRTDRNLGEVRREELAAAAQVLGVDHV 81

Query: 87  DKEGICDNDSCHC----NEEHII----------TFDNYGVWGHCNHRDVHPPIDCKIDSA 132
                 DN           + ++           FD+ GV GH +HR           + 
Sbjct: 82  QLLAYPDNGLAQIPLNELTQRVVDALAGADLLLVFDDNGVTGHPDHRRATEAALAAASTP 141

Query: 133 KRI---HGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQH 189
                           LN                    H   +I ++    +   A+  H
Sbjct: 142 GIPVLAWALPQPIADRLNAEFSASF-------GGRGHGHLDIMIEVD--RSRQLAAIGCH 192

Query: 190 LSQ 192
            +Q
Sbjct: 193 FTQ 195
>pir||S44952 lmbE protein - Streptomyces lincolnensis
 pir||S69814 lmbE protein - Streptomyces lincolnensis
 emb|CAA55751.1| (X79146) lmbE [Streptomyces lincolnensis]
          Length = 270

 Score =  108 bits (268), Expect = 1e-22
 Identities = 26/224 (11%), Positives = 60/224 (26%), Gaps = 62/224 (27%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADGM--------------GSIRDQELHRACA 79
           D+      T+ ++T+      ++  + G A  +               ++R  EL  +  
Sbjct: 14  DEASRGGATVAHYTAQGVRAVLVTCTDGGAGEVLNPAVTDDFTPERFVAVRSAELDASAR 73

Query: 80  VLKVIPFDKEGICDNDS--------------------------CHCNEEHIITF-DNYGV 112
            L      + G  D+                                 + +I +  N+  
Sbjct: 74  NLGYSAVHRLGYRDSGMDGTAGGAEAFVRAPLDEAATRLARVIADERPDVVIGYGTNHTR 133

Query: 113 WGHCNHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFR-------------KYCGPVDIW 159
             H +H   +  +  ++D        +Y    S    R              Y G +   
Sbjct: 134 DPHPDHIRANEVLTRRVDLL-DHTPAVYHIAFSRRRHRALHQACVDSGVPSPYEGGLSAP 192

Query: 160 LSILSAKRHPSKVIIIN--KQPWKSFKAMAQHLSQW----VWFR 197
                 +   + ++ +       +   A+  H++Q      WF 
Sbjct: 193 PGAFDDEW-ITTLVDVTKGDAVERRLDALRSHVTQVPPASGWFA 235
>ref|NP_214837.1| (NC_000962) hypothetical protein Rv0323c [Mycobacterium
           tuberculosis H37Rv]
 pir||D70526 hypothetical protein Rv0323c - Mycobacterium tuberculosis  (strain
           H37RV)
 emb|CAB09612.1| (Z96800) hypothetical protein Rv0323c [Mycobacterium tuberculosis
           H37Rv]
          Length = 223

 Score =  106 bits (264), Expect = 3e-22
 Identities = 34/183 (18%), Positives = 53/183 (28%), Gaps = 33/183 (18%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADG-------MGSIRDQELHRACAVLKVIPF 86
           D+       +  FT+    L  LCF+ G A         +G +R +EL  A  VL V   
Sbjct: 22  DESFGLGAVLGDFTAQGTRLRGLCFTHGEASTLGRTDRNLGEVRREELAAAAQVLGVDHV 81

Query: 87  DKEGICDNDSCHC----NEEHII----------TFDNYGVWGHCNHRDVHPPIDCKIDSA 132
                 DN           + ++           FD+ GV GH +HR           + 
Sbjct: 82  QLLAYPDNGLAQIPLNELTQRVVDALAGADLLLVFDDNGVTGHPDHRRATEAALAAASTP 141

Query: 133 KRI---HGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQH 189
                           LN                    H   +I ++    +   A+  H
Sbjct: 142 SIPVLAWALPQPIADRLNAEFSASF-------GGRGHGHLDIMIEVD--RSRQLAAIGCH 192

Query: 190 LSQ 192
            +Q
Sbjct: 193 FTQ 195
>ref|NP_492873.1| (NM_060472) Y52B11C.1.p [Caenorhabditis elegans]
 pir||T27111 hypothetical protein Y52B11C.1 - Caenorhabditis elegans
 emb|CAA19544.1| (AL023846) Y52B11C.1 [Caenorhabditis elegans]
          Length = 151

 Score =  101 bits (251), Expect = 1e-20
 Identities = 44/144 (30%), Positives = 64/144 (43%), Gaps = 26/144 (18%)

Query: 1   MVVVFLSLIVVIWVASFFKIFFRATSISRATIL-----DDGKFFSPTINYFTSTACNLHI 55
           ++V  L ++++I+      I     S SR  +L     D+  FFSPTI         + +
Sbjct: 7   LIVTLLLVLLLIFAVRSHPIPL--LSQSRILLLIAHPDDETMFFSPTIRALLQAGHRVFV 64

Query: 56  LCFSTGNADGMGSIRDQELHRACAVLKVI-----PFDKEGICDND------SCHCNEEH- 103
           LC S GN DG+G IR +EL RA + L +        D +   D D       C     H 
Sbjct: 65  LCISNGNFDGLGKIRARELSRAASKLGISASDVICLDYDEFADGDTWNRNALCQIVMRHV 124

Query: 104 -------IITFDNYGVWGHCNHRD 120
                  +I+FD++GV GH NH  
Sbjct: 125 EVLAADTVISFDSHGVSGHHNHAR 148
>ref|NP_191372.1| (NM_115675) putative protein [Arabidopsis thaliana]
 pir||T45973 hypothetical protein F9D24.40 - Arabidopsis thaliana
 emb|CAB68151.1| (AL137081) putative protein [Arabidopsis thaliana]
          Length = 124

 Score = 97.2 bits (240), Expect = 2e-19
 Identities = 59/110 (53%), Positives = 67/110 (60%), Gaps = 21/110 (19%)

Query: 38  FFSPTINYFTSTACNLHILCFSTGNADGMGSIRDQELHRACAVLKV---------IPFDK 88
           FFSPTINY  S ACNLH+LC STGNADGMGSIR+ ELHRACAVLKV          P  +
Sbjct: 7   FFSPTINYLASNACNLHMLCLSTGNADGMGSIRNNELHRACAVLKVPLQQLKILNHPNLQ 66

Query: 89  EGIC------------DNDSCHCNEEHIITFDNYGVWGHCNHRDVHPPID 126
           +G              + +    +   IITFDNYGV GHCNHRDVH  + 
Sbjct: 67  DGFGQLWSHDLLTEIIEEEVTKHDIHTIITFDNYGVSGHCNHRDVHRGVL 116
>ref|NP_105992.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
 dbj|BAB51778.1| (AP003006) hypothetical protein [Mesorhizobium loti]
          Length = 229

 Score = 97.2 bits (240), Expect = 2e-19
 Identities = 26/179 (14%), Positives = 47/179 (25%), Gaps = 44/179 (24%)

Query: 34  DDGKFFSPTINYFTSTACNLHILCFSTGNADG------MGSIRDQELHRACAVLKVIPFD 87
           D   F   T+  + +    L     + G   G      +  +R +E   A  +L   P  
Sbjct: 12  DIEIFMFGTLAVYAAQGAELTFAVATDGAKGGKSDATVLARVRREEATAAAGLLGAAP-R 70

Query: 88  KEGICDNDS--------------CHCNEEHIITFDNYGVWGHCNHRDVHPPIDCKIDSAK 133
                D +                    + +IT        H +HR +   +      A 
Sbjct: 71  FLDFPDGELVADAALIGALKTLIAGTGPDLVITHAPNDY--HADHRALSDSVRIAASFA- 127

Query: 134 RIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQHLSQ 192
                  +H                  ++      P+  + I+       KA+  H SQ
Sbjct: 128 ----VPVLHA----------------DTMGGTGFSPTHYVDISAHAEIKAKAIRMHQSQ 166
>emb|CAC04222.1| (AL391515) conserved hypothetical protein [Streptomyces coelicolor]
          Length = 247

 Score = 96.1 bits (237), Expect = 5e-19
 Identities = 28/206 (13%), Positives = 55/206 (26%), Gaps = 39/206 (18%)

Query: 19  KIFFRATSI--SRATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGNAD--GMGSI 69
               R+      RA  +     D     S  +  + +   ++  L  + G A    +   
Sbjct: 6   PAPLRSMPDDWRRALAVVAHPDDLEYGCSAAVASWVADGKDVAYLLATRGEAGIDTLDPG 65

Query: 70  RDQEL-----HRACAVLKVIPFDKEGICDN--------------DSCHCNEEHIITF--- 107
           R   L       A A + V   +     D                      E +IT    
Sbjct: 66  RAGPLREAEQRAAAAAVGVRAVEFLDHRDGVIEYGASLRRDIAAAVRRHRPELVITLNHR 125

Query: 108 DNY--GVWGHCNHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSA 165
           D +  G W   +H  V   +      A     F  + +  L           +    ++ 
Sbjct: 126 DTWAAGAWNTPDHVAVGRAVLDAAADAGNRWIFPELAEQGLVP------WNGVRWVAVAN 179

Query: 166 KRHPSKVIIINKQPWKSFKAMAQHLS 191
              PS  +       +  +++ +H +
Sbjct: 180 SPTPSHAVSAEPGFEQGVRSLLRHRT 205
>ref|NP_215686.1| (NC_000962) hypothetical protein Rv1170 [Mycobacterium tuberculosis
           H37Rv]
 ref|NP_335650.1| (NC_002755) lmbE-related protein [Mycobacterium tuberculosis
           CDC1551]
 pir||B70875 hypothetical protein Rv1170 - Mycobacterium tuberculosis  (strain
           H37RV)
 emb|CAA15847.1| (AL010186) hypothetical protein Rv1170 [Mycobacterium tuberculosis
           H37Rv]
 gb|AAK45464.1| (AE006998) lmbE-related protein [Mycobacterium tuberculosis
           CDC1551]
          Length = 303

 Score = 95.7 bits (236), Expect = 6e-19
 Identities = 30/211 (14%), Positives = 50/211 (23%), Gaps = 66/211 (31%)

Query: 29  RATIL-----DDGKFFSPTINYFTSTACNLHILCFSTGN----------------ADGMG 67
           R   +     D+      TI ++TS    +H++  + G                 AD +G
Sbjct: 6   RLLFVHAHPDDESLSNGATIAHYTSRGAQVHVVTCTLGEEGEVIGDRWAQLTADHADQLG 65

Query: 68  SIRDQELHRACAVLKVIPFDKEG----ICDNDS--------------------------- 96
             R  EL  A   L V      G      D+                             
Sbjct: 66  GYRIGELTAALRALGVSAPIYLGGAGRWRDSGMAGTDQRSQRRFVDADPRQTVGALVAII 125

Query: 97  CHCNEEHIITFDNYGVWGHCNHRDVHP------------PIDCKIDSAKRIHGFLYVHQV 144
                  ++T+D  G +GH +H   H                             Y   +
Sbjct: 126 RELRPHVVVTYDPNGGYGHPDHVHTHTVTTAAVAAAGVGSGTADHPGDPWTVPKFYWTVL 185

Query: 145 SLNIFRKYCGPVDIWLSILSAKRHPSKVIII 175
            L+          +    L  +    +   I
Sbjct: 186 GLSALISGARA--LVPDDLRPEWVLPRADEI 214
>ref|NP_414898.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
 pir||D64764 hypothetical protein b0364 - Escherichia coli
 gb|AAC73467.1| (AE000143) orf, hypothetical protein [Escherichia coli K12]
          Length = 136

 Score = 85.9 bits (211), Expect = 5e-16
 Identities = 19/157 (12%), Positives = 35/157 (22%), Gaps = 44/157 (28%)

Query: 57  CFSTGNADGMGSI-RDQELHRACAVLKVIPFDKEGICD-----------NDSCHC----- 99
             +TGN+   G I R +E   A  +L           D           +          
Sbjct: 2   VMTTGNSGTDGIIDRHEESRNALKILGCHQTIHLNFADTRAHLQLNDMISALEDIIKNQI 61

Query: 100 --NEEHIITFDNYGVWGHCNHRDVHPPIDCKIDSAKRI---HGFLYVHQVSLNIFRKYCG 154
             + E +  +  +    H +H  V+        +  +I               +F     
Sbjct: 62  PSDVEIMRVYTMHDADRHQDHLAVYQASMVACRTIPQILGYETPSTWLSFMPQVFESVKE 121

Query: 155 PVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQHLS 191
                                         A+ +H S
Sbjct: 122 E----------------------YFTVKLAALKKHKS 136
>ref|NP_294927.1| (NC_001263) LmbE-related protein [Deinococcus radiodurans]
 pir||B75424 LmbE-related protein - Deinococcus radiodurans (strain R1)
 gb|AAF10773.1|AE001969_2 (AE001969) LmbE-related protein [Deinococcus radiodurans]
          Length = 252

 Score = 83.6 bits (205), Expect = 3e-15
 Identities = 16/136 (11%), Positives = 33/136 (23%), Gaps = 30/136 (22%)

Query: 29  RATIL----DDGKFFSPTINYFTSTACNLHILCFSTGNA---------DGMGSIRDQELH 75
           R   +    DD      T+         + ++  + G           + +  IR +   
Sbjct: 2   RIMAVFAHPDDEIGCIGTLAKHARRGDEVLLVWTTLGELASQFGDTEHEEVRRIRREHGA 61

Query: 76  RACAVLKVI-------PFDKEGICDNDS------CHCNEEHIITF-DNYGVWGHCNHRDV 121
                +               G  D                +IT+ D++    H +HR  
Sbjct: 62  WVADKIGAKYHFFDMGDSRMTGGRDEALQLARLYATFRPHAVITWSDDHP---HPDHRMT 118

Query: 122 HPPIDCKIDSAKRIHG 137
                  +  A+    
Sbjct: 119 AKIAFDAVTLARIPKI 134
>ref|NP_285456.1| (NC_001264) hypothetical protein [Deinococcus radiodurans]
 pir||G75608 hypothetical protein - Deinococcus radiodurans (strain R1)
 gb|AAF12317.1|AE001862_143 (AE001862) hypothetical protein [Deinococcus radiodurans]
          Length = 232

 Score = 78.9 bits (193), Expect = 7e-14
 Identities = 25/213 (11%), Positives = 46/213 (20%), Gaps = 60/213 (28%)

Query: 12  IWVASFFKIFFRATSISRATILDDGKFFSPTINYFTSTACNLHILCFSTGNADG------ 65
           +WV +                 D+       +         +  L  + G          
Sbjct: 13  LWVVAPHPD-------------DEALGCGALLAALAEAGREVWALLLTDGGFSHPASKAY 59

Query: 66  ----MGSIRDQELHRACAVLKVIPFDK--EGICDNDSCHCNEEHI--------------I 105
               + ++R  E     +VL V P      G+ D                          
Sbjct: 60  PRPRLSAVRLAEWREGLSVLGVPPARTVALGLPDGALGEHLTAAARAQVRQAFAQARPGT 119

Query: 106 TFDNYGVWGHCNHRDVHPPIDCKIDSAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSA 165
               +    H +HR                   L     S  +  +Y   V +      A
Sbjct: 120 VLLPWERDPHPDHRA--------------AWHLLRGVLPSDTLALEYA--VWLPERGADA 163

Query: 166 KRHPSKVII-----INKQPWKSFKAMAQHLSQW 193
                  +      +        +A+A H +Q 
Sbjct: 164 DWPRPDEVEELTFAVGDWRDAKARAIASHRTQL 196
>pir||D69906 hypothetical protein yojG - Bacillus subtilis
 emb|CAB13838.1| (Z99114) yojG [Bacillus subtilis]
 gb|AAC17855.1| (AF026147) YojG [Bacillus subtilis]
          Length = 142

 Score = 74.3 bits (181), Expect = 2e-12
 Identities = 19/123 (15%), Positives = 32/123 (25%), Gaps = 26/123 (21%)

Query: 81  LKVIPFDKEGICDND-SCHCNEEH--------IITFDNYGVWGHCNHRDVHPPIDCK-ID 130
           L +         D++      EE         I+TF   G   H +H      +      
Sbjct: 2   LGLRDK-TLEFEDDEYLADIMEEIIDDVKPSLIVTF-YPGHGVHPDHDACGEAVIRALYR 59

Query: 131 SAKRIHGFLYVHQVSLNIFRKYCGPVDIWLSILSAKRHPSKVIIINKQPWKSFKAMAQHL 190
             K          ++ N   +  G  D              V+ I +       A+  H 
Sbjct: 60  KKKEDRPRTICMAITRNR-EEVLGEAD-------------VVLDIKEVADIKMNALRAHR 105

Query: 191 SQW 193
           +Q 
Sbjct: 106 TQT 108
CPU time:    67.91 user secs.	    3.27 sys. secs	   71.18 total secs.

  Database: nr
    Posted date:  Apr 21, 2002  2:19 PM
  Number of letters in database: 277,845,442
  Number of sequences in database:  887,402
  
Lambda     K      H
   0.315    0.176    0.509 

Gapped
Lambda     K      H
   0.270   0.0617    0.230 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 166230603
Number of Sequences: 887402
Number of extensions: 8970760
Number of successful extensions: 29002
Number of sequences better than 10.0: 89
Number of HSP's better than 10.0 without gapping: 55
Number of HSP's successfully gapped in prelim test: 34
Number of HSP's that attempted gapping in prelim test: 28768
Number of HSP's gapped (non-prelim): 99
length of query: 223
length of database: 277,845,442
effective HSP length: 48
effective length of query: 175
effective length of database: 235,250,146
effective search space: 41168775550
effective search space used: 41168775550
T: 11
A: 40
X1: 15 ( 6.8 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 40 (20.7 bits)
S2: 72 (32.1 bits)