BLASTP 2.1.1 [Aug-8-2000]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|2496063|sp|Q58012|Y594_METJA HYPOTHETICAL PROTEIN
MJ0594 gi|2128398|pir||B64374 hypothetical protein
MJ0594|gi|1591303|gb|AAB98595.1| (U67508) M. jannaschii predicted
coding region MJ0594 [Methanococcus jannaschii]
         (169 letters)

Database: nr
           618,844 sequences; 195,544,254 total letters

Searching..................................................

E-value threshold for inclusion in PSI-Blast iteration 2: 0.002 
E-value threshold for inclusion in PSI-Blast iteration 3:



Results of PSI-Blast iteration 1

Distribution of 19 Blast Hits on the Query Sequence




Legend:

- means that the alignment score was below the threshold on the previous iteration

- means that the alignment was checked on the previous iteration


Sequences with E-value BETTER than threshold
Score E Sequences producing significant alignments: (bits) Value
sp|Q58012|Y594_METJA HYPOTHETICAL PROTEIN MJ0594 >gi|2128398|pir... 283 8e-76
pir||G75218 hypothetical protein PAB2357 - Pyrococcus abyssi (st... 128 5e-29
pir||H71203 hypothetical protein PH1900 - Pyrococcus horikoshii ... 126 2e-28
pir||B72623 hypothetical protein APE1443 - Aeropyrum pernix (str... 118 3e-26
pir||F69190 conserved hypothetical protein MTH680 - Methanobacte... 114 6e-25
ref|NP_069935.1| conserved hypothetical protein [Archaeoglobus f... 105 4e-22
emb|CAB57572.1| (Y18930) hypothetical protein [Sulfolobus solfat... 51 6e-06
gb|AAD14602.1| (AF092910) stage specific peptide 24 [Trypanosoma... 47 1e-04
emb|CAC18315.1| (AL451022) probable IMP4 protein [Neurospora cra... 45 4e-04
sp|O62518|YHPK_CAEEL HYPOTHETICAL 34.0 KDA PROTEIN ZK795.3 IN CH... 44 8e-04
Sequences with E-value WORSE than threshold

ref|NP_014324.1| Imp4p [Saccharomyces cerevisiae] >gi|1730744|sp... 41 0.011
gb|AAF56395.1| (AE003750) CG11920 gene product [Drosophila melan... 40 0.014
gb|AAG52427.1|AC011622_15 (AC011622) putative U3 small nucleolar... 39 0.035
gb|AAF53162.1| (AE003635) CG6712 gene product [Drosophila melano... 39 0.042
sp|O13823|YEE7_SCHPO HYPOTHETICAL 33.4 KDA PROTEIN C19A8.07C IN ... 38 0.071
dbj|BAB14086.1| (AK022537) unnamed protein product [Homo sapiens] 33 1.4
sp|P54223|BETA_RHIME CHOLINE DEHYDROGENASE (CHD) 33 2.5
pir||T50616 hypothetical protein DKFZp761G0415.1 - human (fragment) 33 2.6
Alignments
>sp|Q58012|Y594_METJA HYPOTHETICAL PROTEIN MJ0594
 pir||B64374 hypothetical protein MJ0594 - Methanococcus jannaschii
 gb|AAB98595.1| (U67508) M. jannaschii predicted coding region MJ0594
           [Methanococcus jannaschii]
          Length = 169

 Score =  283 bits (718), Expect = 8e-76
 Identities = 169/169 (100%), Positives = 169/169 (100%)

Query: 1   MILTTSRKPSQRTRSFARDLERTLNIPYVQRGKLSLKEIFEIDKHVLLIGEFKANPGTLV 60
           MILTTSRKPSQRTRSFARDLERTLNIPYVQRGKLSLKEIFEIDKHVLLIGEFKANPGTLV
Sbjct: 1   MILTTSRKPSQRTRSFARDLERTLNIPYVQRGKLSLKEIFEIDKHVLLIGEFKANPGTLV 60

Query: 61  VYDVENEKRLSSFISVKLQREICGEKIYNDDGIRIKISRELKDNEEFQKYYEIYDEFLFQ 120
           VYDVENEKRLSSFISVKLQREICGEKIYNDDGIRIKISRELKDNEEFQKYYEIYDEFLFQ
Sbjct: 61  VYDVENEKRLSSFISVKLQREICGEKIYNDDGIRIKISRELKDNEEFQKYYEIYDEFLFQ 120

Query: 121 HLNINEDSDITLRLEKDPKYLFAIQFYKGRVKIGPLIRIKSIKLFDSLL 169
           HLNINEDSDITLRLEKDPKYLFAIQFYKGRVKIGPLIRIKSIKLFDSLL
Sbjct: 121 HLNINEDSDITLRLEKDPKYLFAIQFYKGRVKIGPLIRIKSIKLFDSLL 169
>pir||G75218 hypothetical protein PAB2357 - Pyrococcus abyssi (strain Orsay)
 emb|CAB49198.1| (AJ248283) hypothetical protein [Pyrococcus abyssi]
          Length = 224

 Score =  128 bits (319), Expect = 5e-29
 Identities = 32/92 (34%), Positives = 49/92 (52%), Gaps = 10/92 (10%)

Query: 1  MILTTSRKPSQRTRSFARDLERTL-NIPYVQRGKLSLKEIF-----EIDKHVLLIGEFKA 54
          M++TTS +P++RTRSF  DLER   N  Y+ RGK +++E+         + +L+I  +K 
Sbjct: 2  MLITTSHRPTRRTRSFGHDLERVFPNSLYMTRGKKTIQELLMEAYDRGYERLLIINVWKG 61

Query: 55 NPGTLVVYDVENEK----RLSSFISVKLQREI 82
          NP  +    V  +            VKLQRE+
Sbjct: 62 NPLKMTFIKVHPDDWGYLGYLYLHGVKLQREM 93
 Score = 32.7 bits (74), Expect = 2.6
 Identities = 20/81 (24%), Positives = 37/81 (44%), Gaps = 3/81 (3%)

Query: 1   MILTTSRKPSQRTRSFARDLERTLNIPYVQRGKLSLKEIF-EIDKHVLLIGEFKANPGTL 59
           +++TT+++      +FA+         +V RG  SL  I  + +  VL + E       +
Sbjct: 107 LVVTTAKRVGLDHLAFAQVFSELTTGKFVPRGDKSLLSIADKYNTDVLAVIERHPRGIVV 166

Query: 60  VVY--DVENEKRLSSFISVKL 78
             Y  DV  E+ +   I+VK+
Sbjct: 167 NFYRLDVTKERAVGPLINVKI 187
>pir||H71203 hypothetical protein PH1900 - Pyrococcus horikoshii
 dbj|BAA31023.1| (AP000007) 334aa long hypothetical protein [Pyrococcus
          horikoshii]
          Length = 224

 Score =  126 bits (314), Expect = 2e-28
 Identities = 33/92 (35%), Positives = 50/92 (53%), Gaps = 10/92 (10%)

Query: 1  MILTTSRKPSQRTRSFARDLERTL-NIPYVQRGKLSLKEIF-----EIDKHVLLIGEFKA 54
          M++TTS +P++RTRSF  DLER + N  Y+ RGK +++E+         + +L+I  +K 
Sbjct: 2  MLITTSHRPTRRTRSFGHDLERVIPNSLYLTRGKKTIQELLMEAYDRGYERLLIINVWKG 61

Query: 55 NPGTLVVYDVENEK----RLSSFISVKLQREI 82
          NP  +    V  E            VKLQRE+
Sbjct: 62 NPLKMTFIKVHPEDWGYLGYLYLHGVKLQREM 93
>pir||B72623 hypothetical protein APE1443 - Aeropyrum pernix (strain K1)
 dbj|BAA80440.1| (AP000061) 197aa long hypothetical protein [Aeropyrum pernix]
          Length = 197

 Score =  118 bits (295), Expect = 3e-26
 Identities = 32/92 (34%), Positives = 50/92 (53%), Gaps = 11/92 (11%)

Query: 1   MILTTSRKPSQRTRSFARDLERTLNIPY-VQRGKLSLKE-----IFEIDKHVLLIGEFKA 54
           +++TTSR+PS R RSF +DL  T+   +   RG  S++E     I      ++++GE + 
Sbjct: 16  ILVTTSRRPSPRIRSFVKDLSATIPGAFRFTRGHYSMEELAREAIIRGADRIVVVGERRG 75

Query: 55  NPGTLVVYDVENEKRLSSFI-----SVKLQRE 81
           NPG + VY VE  +R  + +      V L RE
Sbjct: 76  NPGIIRVYAVEGPERPDNIVSFIVKGVSLSRE 107
>pir||F69190 conserved hypothetical protein MTH680 - Methanobacterium
          thermoautotrophicum (strain Delta H)
 gb|AAB85185.1| (AE000848) conserved protein [Methanothermobacter
          thermautotrophicus]
          Length = 155

 Score =  114 bits (284), Expect = 6e-25
 Identities = 30/70 (42%), Positives = 43/70 (60%), Gaps = 1/70 (1%)

Query: 1  MILTTSRKPSQRTRSFARDLERTLNIPYVQRGKLSLKE-IFEIDKHVLLIGEFKANPGTL 59
          M+LTTSRKPSQRTRSF++ L R +   Y+ RGK+SL++ + E    V ++ E   NP  +
Sbjct: 1  MLLTTSRKPSQRTRSFSQRLSRIMGWRYINRGKMSLRDVLIEARGPVAVVSERHGNPARI 60

Query: 60 VVYDVENEKR 69
             D    +R
Sbjct: 61 TFLDERGGER 70
>ref|NP_069935.1| conserved hypothetical protein [Archaeoglobus fulgidus]
 pir||A69388 conserved hypothetical protein AF1106 - Archaeoglobus fulgidus
 gb|AAB90132.1| (AE001027) conserved hypothetical protein [Archaeoglobus
          fulgidus]
          Length = 153

 Score =  105 bits (260), Expect = 4e-22
 Identities = 30/62 (48%), Positives = 41/62 (65%)

Query: 2  ILTTSRKPSQRTRSFARDLERTLNIPYVQRGKLSLKEIFEIDKHVLLIGEFKANPGTLVV 61
          +LTTSRKP ++TR FA+ L R  N  YV RGKLSL+++  I +   +I E K NP  L +
Sbjct: 3  VLTTSRKPGRKTRRFAKVLARFFNWKYVNRGKLSLEDLAGIAERFWIISEVKGNPAILNL 62

Query: 62 YD 63
          Y+
Sbjct: 63 YE 64
>emb|CAB57572.1| (Y18930) hypothetical protein [Sulfolobus solfataricus]
 emb|CAC23146.1| (AL512964) hypothetical [Sulfolobus solfataricus]
          Length = 170

 Score = 51.4 bits (122), Expect = 6e-06
 Identities = 26/71 (36%), Positives = 38/71 (52%), Gaps = 6/71 (8%)

Query: 1  MILTTSRKPSQRTRSFARDLERTLN-IPYVQRGKLSLKEIFE-----IDKHVLLIGEFKA 54
          +++T+SR PS RTR+F   L   L     + RGK S  EIFE        ++L +     
Sbjct: 8  IVITSSRDPSIRTRNFLNVLTFVLPDSVKITRGKKSKIEIFERAINLGALYLLFVLAKNG 67

Query: 55 NPGTLVVYDVE 65
          NP  ++VYD+E
Sbjct: 68 NPLRIIVYDLE 78
>gb|AAD14602.1| (AF092910) stage specific peptide 24 [Trypanosoma cruzi]
          Length = 287

 Score = 47.1 bits (111), Expect = 1e-04
 Identities = 18/66 (27%), Positives = 38/66 (57%), Gaps = 6/66 (9%)

Query: 2   ILTTSRKPSQRTRSFARDLERTLNIPY-VQRGKLSLKEIFEIDKH-----VLLIGEFKAN 55
           ++TTSR+PSQ+   FA+++   +     + RG LS++++ +  +      V+++ E +  
Sbjct: 86  LVTTSREPSQKLLEFAKEIRLVIPSAVRMNRGNLSVRQLMDAARRGQYSDVVVLQESQGV 145

Query: 56  PGTLVV 61
           P +L V
Sbjct: 146 PDSLTV 151
>emb|CAC18315.1| (AL451022) probable IMP4 protein [Neurospora crassa]
          Length = 295

 Score = 45.2 bits (106), Expect = 4e-04
 Identities = 22/89 (24%), Positives = 42/89 (46%), Gaps = 7/89 (7%)

Query: 1   MILTTSRKPSQRTRSFARDLERTLN-IPYVQRGKLSLKEIFEIDKH-----VLLIGEFKA 54
           +++TTSR PS R   F++++   L     + RG L L+++    K      V+L+ E + 
Sbjct: 86  ILVTTSRDPSSRLGQFSKEIRLLLPTSVRLNRGNLVLEDLVGAAKAQNLTDVVLLHEHRG 145

Query: 55  NPGTLVVYDV-ENEKRLSSFISVKLQREI 82
            P  + +         + S  +V L+ +I
Sbjct: 146 VPTAMTISHFPHGPTLMVSLHNVVLRADI 174
>sp|O62518|YHPK_CAEEL HYPOTHETICAL 34.0 KDA PROTEIN ZK795.3 IN CHROMOSOME IV
 pir||T27998 hypothetical protein ZK795.3 - Caenorhabditis elegans
 emb|CAB05841.1| (Z83246) predicted using Genefinder~contains similarity to Pfam
           domain: PF01945 (Domain of unknown function),
           Score=306.9, E-value=7.7e-89, N=1~cDNA EST EMBL:M79771
           comes from this gene [Caenorhabditis elegans]
          Length = 292

 Score = 44.4 bits (104), Expect = 8e-04
 Identities = 17/62 (27%), Positives = 34/62 (54%), Gaps = 6/62 (9%)

Query: 1   MILTTSRKPSQRTRSFARDLERTL-NIPYVQRGKLSLKEIFEIDKH-----VLLIGEFKA 54
           +++TTSR PS R + FA++++    N   + RG   +K++ +  K      +++  E + 
Sbjct: 80  IVITTSRDPSSRLKMFAKEMKLIFPNAQRINRGHYDVKQVVQASKAQDSTDLIIFTETRG 139

Query: 55  NP 56
           NP
Sbjct: 140 NP 141
>ref|NP_014324.1| Imp4p [Saccharomyces cerevisiae]
 sp|P53941|IMP4_YEAST U3 SMALL NUCLEOLAR RIBONUCLEOPROTEIN PROTEIN IMP4
 pir||S53904 hypothetical protein YNL075w - yeast (Saccharomyces cerevisiae)
 emb|CAA60184.1| (X86470) unknown [Saccharomyces cerevisiae]
 emb|CAA95949.1| (Z71351) ORF YNL075w [Saccharomyces cerevisiae]
          Length = 290

 Score = 40.5 bits (94), Expect = 0.011
 Identities = 19/89 (21%), Positives = 40/89 (44%), Gaps = 7/89 (7%)

Query: 1   MILTTSRKPSQRTRSFARDLERTLNIPY-VQRGKLSLKEIFEIDKH-----VLLIGEFKA 54
           +I+TTSR PS R   FA++++        + RG   +  + +  K      ++++ E + 
Sbjct: 88  IIVTTSRDPSTRLSQFAKEIKLLFPNAVRLNRGNYVMPNLVDACKKSGTTDLVVLHEHRG 147

Query: 55  NPGTLVVYDV-ENEKRLSSFISVKLQREI 82
            P +L +           S  +V ++ +I
Sbjct: 148 VPTSLTISHFPHGPTAQFSLHNVVMRHDI 176
>gb|AAF56395.1| (AE003750) CG11920 gene product [Drosophila melanogaster]
          Length = 298

 Score = 40.1 bits (93), Expect = 0.014
 Identities = 20/83 (24%), Positives = 37/83 (44%), Gaps = 8/83 (9%)

Query: 1   MILTTSRKPSQRTRSFARDLERTL-NIPYVQRGKLSLKEIFEIDKH-----VLLIGEFKA 54
           ++LTTS  PS R + F ++L   + N   + RG   L  +    +       L++ E + 
Sbjct: 83  IMLTTSHNPSSRLKMFMKELRLIIPNAQQMNRGNYQLTTLMHACRANNVTDFLIVHEHRG 142

Query: 55  NPGTLVVYDVENEKRLSSFISVK 77
            P +LVV         ++F ++ 
Sbjct: 143 IPDSLVV--CHLPYGPTAFFNIS 163
>gb|AAG52427.1|AC011622_15 (AC011622) putative U3 small nucleolar ribonucleoprotein protein;
           1537-3735 [Arabidopsis thaliana]
 gb|AAG52449.1|AC010852_6 (AC010852) putative U3 small nucleolar ribonucleoprotein protein;
           73469-75667 [Arabidopsis thaliana]
          Length = 294

 Score = 39.0 bits (90), Expect = 0.035
 Identities = 19/62 (30%), Positives = 30/62 (47%), Gaps = 6/62 (9%)

Query: 1   MILTTSRKPSQRTRSFARDLERTL-NIPYVQRGKLSLKEIFEIDKH-----VLLIGEFKA 54
           ++LTTSR PS     F ++L+    N   + RG   + EI E  +      V+L+ E + 
Sbjct: 85  ILLTTSRNPSAPLIRFTKELKFVFPNSQRINRGSQVISEIIETARSHDFTDVILVHEHRG 144

Query: 55  NP 56
            P
Sbjct: 145 VP 146
>gb|AAF53162.1| (AE003635) CG6712 gene product [Drosophila melanogaster]
          Length = 394

 Score = 38.6 bits (89), Expect = 0.042
 Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 2   ILTTSRKPSQRTRSFARDLERTLNIPYVQ-RGKLSLKEIFEIDKH-----VLLIGEFKAN 55
           ++T +  P  +TR F  +L R      V+ R K S+K+I +  +      V+++ E +  
Sbjct: 188 LITFADNPVTKTRKFGLELSRIFPNALVKIRNKSSVKKICKSAEREEFTDVVIVNEDRRK 247

Query: 56  P-GTLVVYDVENEKRLSSFISVKLQREI 82
           P G LV++            +VKL  +I
Sbjct: 248 PNGLLVIHLPNGPTAHFKLSNVKLTSDI 275
>sp|O13823|YEE7_SCHPO HYPOTHETICAL 33.4 KDA PROTEIN C19A8.07C IN CHROMOSOME I
 pir||T37954 hypothetical protein SPAC19A8.07c - fission yeast
           (Schizosaccharomyces pombe)
 emb|CAB11643.1| (Z98974) hypothetical protein [Schizosaccharomyces pombe]
          Length = 289

 Score = 37.8 bits (87), Expect = 0.071
 Identities = 21/88 (23%), Positives = 43/88 (48%), Gaps = 7/88 (7%)

Query: 2   ILTTSRKPSQRTRSFARDLERTLNIPY-VQRGKLSLKEIFEIDKH-----VLLIGEFKAN 55
           ++TTSR+PS R   FA+++   +   Y + RG + +  + E  +      ++++ E +  
Sbjct: 87  LVTTSREPSSRLAQFAKEVRLLIPNSYRLNRGNIVVGSLVEAARANDITDIVILHEHRGI 146

Query: 56  PGTLVVYDV-ENEKRLSSFISVKLQREI 82
           P  LV+  +        S  +V L+ +I
Sbjct: 147 PDGLVISHLPYGPTLSFSLHNVVLRHDI 174
>dbj|BAB14086.1| (AK022537) unnamed protein product [Homo sapiens]
          Length = 349

 Score = 33.5 bits (76), Expect = 1.4
 Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 9/90 (10%)

Query: 1   MILTTSRKPSQRTRSFARDLERTLNIP--YVQRGKLSLKEIFE--IDKHV---LLIGEFK 53
           +++TTS +P  RT      L   +     Y +RG L+LK+I    I +     ++I E +
Sbjct: 144 ILITTSDRPHGRTVRLCEQLSTVIPNSHVYYRRG-LALKKIIPQCIARDFTDLIVINEDR 202

Query: 54  ANP-GTLVVYDVENEKRLSSFISVKLQREI 82
             P G ++ +            SV+L++EI
Sbjct: 203 KTPNGLILSHLPNGPTAHFKMSSVRLRKEI 232
>sp|P54223|BETA_RHIME CHOLINE DEHYDROGENASE (CHD)
          Length = 549

 Score = 32.7 bits (74), Expect = 2.5
 Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 17/92 (18%)

Query: 31  RGKLSLKEIFEIDKHVLLIGEFKANPGTLVVYDVENEKRLSSFISVKLQREICGEKIYND 90
           RG +SL+             + KA+P     Y    E        V+L REI G+K ++ 
Sbjct: 386 RGNVSLRS-----------SDPKADPVIRFNYMSHPEDWEKFRHCVRLTREIFGQKAFD- 433

Query: 91  DGIRIKISRELKDNEEFQKYYEIYDEFLFQHL 122
               +    E++  E+ Q   EI D FL +HL
Sbjct: 434 ----LYRGPEIQPGEKVQTDEEI-DGFLREHL 460
>pir||T50616 hypothetical protein DKFZp761G0415.1 - human (fragment)
          Length = 256

 Score = 32.7 bits (74), Expect = 2.6
 Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 7/89 (7%)

Query: 1   MILTTSRKPSQRTRSFARDLERTLNIP--YVQRG---KLSLKEIFEID-KHVLLIGEFKA 54
           +++TTS +P  RT      L   +     Y +RG   K  + +    D   +++I E + 
Sbjct: 51  ILITTSDRPHGRTVRLCEQLSTVIPNSHVYYRRGLALKKIIPQCIARDFTDLIVINEDRK 110

Query: 55  NP-GTLVVYDVENEKRLSSFISVKLQREI 82
            P G ++ +            SV+L++EI
Sbjct: 111 TPNGLILSHLPNGPTAHFKMSSVRLRKEI 139
CPU time:    36.66 user secs.	    1.41 sys. secs	   38.07 total secs.

  Database: nr
    Posted date:  Feb 10, 2001  7:10 PM
  Number of letters in database: 195,544,254
  Number of sequences in database:  618,844
  
Lambda     K      H
   0.318    0.160    0.458 

Gapped
Lambda     K      H
   0.270   0.0561    0.230 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 75698593
Number of Sequences: 618844
Number of extensions: 3508149
Number of successful extensions: 9247
Number of sequences better than 10.0: 25
Number of HSP's better than 10.0 without gapping: 8
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 9224
Number of HSP's gapped (non-prelim): 28
length of query: 169
length of database: 195,544,254
effective HSP length: 54
effective length of query: 115
effective length of database: 162,126,678
effective search space: 18644567970
effective search space used: 18644567970
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.5 bits)
S2: 69 (31.0 bits)