Results for RID 979914585-27675-14189 BLAST Search Results

BLASTX 2.1.2 [Nov-13-2000]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

RID: 979914585-27675-14189

Query= (10,321 letters)

Database: nr 605,060 sequences; 191,393,013 total letters

If you have any problems or questions with the results of this search
please refer to the BLAST FAQs

Distribution of 85 Blast Hits on the Query Sequence




                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T47166  hypothetical protein DKFZp762B245.1 - human (fr...   126  5e-27
gb|AAC53190.1|  (U92453) WW domain binding protein 3; WBP3 [...    87  1e-18
gb|AAC34395.1|  (AF056116) unknown [Takifugu rubripes]             87  2e-17
ref|NP_005883.1|  formin-like; chromosome  17 open reading f...    92  1e-16
gb|AAF25953.1|  (AF215666) formin-like protein [Mus musculus]      86  6e-15
ref|NP_062653.1|  formin-related gene in leukocytes; lymphoc...    86  6e-15
gb|AAF49761.1|  (AE003536) CG6807 gene product [Drosophila m...    62  2e-07
gb|AAF60718.1|  (AC024798) contains similarity to TR:Q9Z2V7 ...    50  1e-04
dbj|BAA31641.1|  (AB014566) KIAA0666 protein [Homo sapiens]        42  0.004
pir||T04455  hypothetical protein F4D11.90 - Arabidopsis tha...    42  0.18
sp|P18616|RPB1_ARATH  DNA-DIRECTED RNA POLYMERASE II LARGEST...    40  0.51
emb|CAA21466.2|  (AL031986) DNA-directed RNA polymerase (EC ...    39  0.88
pir||JDMU1  DNA-directed RNA polymerase (EC 2.7.7.6) II larg...    39  0.88
sp|P31635|RPB0_ARATH  DNA-DIRECTED RNA POLYMERASE II LARGEST...    39  0.88
gb|AAF45601.1|  (AE003420) EG:114D9.2 gene product [Drosophi...    34  0.97
emb|CAA21052.1|  (AL031640) /prediction=(method:""genscan"",...    34  1.00
emb|CAB95888.1|  (AL359988) putative serine-threonine protei...    39  1.1
emb|CAA36734.1|  (X52493) DNA-directed RNA polymerase [Glyci...    39  1.1
pir||S14181  DNA-directed RNA polymerase (EC 2.7.7.6) larges...    39  1.1
pir||T07796  DNA-directed RNA polymerase (EC 2.7.7.6) larges...    39  1.1
pir||S14182  DNA-directed RNA polymerase (EC 2.7.7.6) larges...    39  1.1
dbj|BAA20835.1|  (AB002379) KIAA0381 [Homo sapiens]                36  1.3
gb|AAC99858.1|  (U31159) CR16 [Rattus norvegicus] >gi|409637...    39  1.5
gb|AAA87791.1|  (U25281) SH3 domain binding protein [Rattus ...    39  1.5
pir||S14183  DNA-directed RNA polymerase (EC 2.7.7.6) larges...    38  2.0
gb|AAF48438.1|  (AE003498) CG15032 gene product [Drosophila ...    38  2.0
sp|P25439|BRM_DROME  HOMEOTIC GENE REGULATOR (BRAHMA PROTEIN...    29  2.0
gb|AAF49557.1|  (AE003529) brm gene product [alt 1] [Drosoph...    29  2.0
gb|AAF49558.2|  (AE003529) brm gene product [alt 2] [Drosoph...    29  2.1
dbj|BAA07534.1|  (D38529) DRPLA protein [Homo sapiens]             37  3.3
gb|AAG45420.1|AF309494_1  (AF309494) vegetative cell wall pr...    37  3.3
sp|P16253|CAC3_HAECO  CUTICLE COLLAGEN 3A3 >gi|159169|gb|AAA...    37  3.3
ref|XP_006637.1|  similar to dentatorubral-pallidoluysian at...    37  3.3
pir||A44984  collagen - nematode (Haemonchus contortus)            37  3.3
gb|AAB51321.1|  (U47924) DRPLA [Homo sapiens]                      37  3.3
sp|P40602|APG_ARATH  ANTER-SPECIFIC PROLINE-RICH PROTEIN APG...    37  3.3
pir||S50832  atrophin-1 - human                                    37  3.3
ref|NP_001931.1|  atrophin-1 [Homo sapiens] >gi|7512295|pir|...    37  3.3
emb|CAB88971.1|  (AL353864) hypothetical protein SC8F11.20c....    37  3.3
sp|P54259|DRPL_HUMAN  ATROPHIN-1 (DENTATORUBRAL-PALLIDOLUYSI...    37  3.3
gb|AAF79900.1|AC022472_9  (AC022472) Contains a strong simil...    37  3.3
emb|CAA26904.1|  (X03128) put. RNA polymerase II largest sub...    37  4.4
pir||S50755  hypothetical protein VSP-3 - Chlamydomonas rein...    37  5.7
ref|NP_033115.1|  RNA polymerase II 1 [Mus musculus] >gi|904...    37  5.7
sp|P11414|RPB1_CRIGR  DNA-DIRECTED RNA POLYMERASE II LARGEST...    37  5.7
emb|CAA60502.1|  (X86819) Microtubule-associated protein 4 [...    37  5.7
ref|NP_010141.1|  RNA polymerase II large subunit; Rpo21p [S...    37  5.7
pir||I38186  RNA polymerase II largest subunit - human >gi|8...    37  5.7
sp|P08775|RPB1_MOUSE  DNA-DIRECTED RNA POLYMERASE II LARGEST...    37  5.7
dbj|BAA22376.1|  (D87293) RNA polymerase II largest subunit ...    37  5.7
ref|NP_000928.1|  polymerase (RNA) II (DNA directed) polypep...    37  5.7
pir||I65981  fatty acid omega-hydroxylase (EC 1.14.15.-) cyt...    37  5.7
gb|AAB58418.1|  (U37500) RNA polymerase II largest subunit [...    37  5.7
gb|AAC02612.1|  (AF045646) contains similarity to collagens ...    32  6.2
sp|O43101|CBF5_CANAL  CENTROMERE/MICROTUBULE BINDING PROTEIN...    36  7.4
dbj|BAB15254.1|  (AK025837) unnamed protein product [Homo sa...    36  7.4
pir||S50754  hypothetical protein WP6 - Chlamydomonas eugame...    36  7.4
gb|AAC69221.2|  (AF101312) contains similarity to human diap...    29  9.4
ref|NP_013276.1|  major low affinity 55 kDa Centromere/micro...    36  9.7
ref|XP_009188.1|  death-associated protein kinase 3 [Homo sa...    36  9.7
gb|AAG12789.1|AC023913_8  (AC023913) transcription factor, p...    36  9.7
pir||T34947  hypothetical protein SC4A10.10c - Streptomyces ...    36  9.7
Alignments
>pir||T47166 hypothetical protein DKFZp762B245.1 - human (fragment)
 emb|CAB82400.1| (AL162062) hypothetical protein [Homo sapiens]
          Length = 425

 Score =  126 bits (316), Expect = 5e-27
 Identities = 64/64 (100%), Positives = 64/64 (100%)
 Frame = +1

Query: 8308 DKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSAEEICR 8487
            DKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSAEEICR
Sbjct: 1    DKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSAEEICR 60

Query: 8488 AIHT 8499
            AIHT
Sbjct: 61   AIHT 64
 Score = 53.9 bits (128), Expect = 3e-05
 Identities = 38/103 (36%), Positives = 43/103 (40%), Gaps = 3/103 (2%)
 Frame = +3

Query: 9042 KVRGRQAKEHRPVYE---GRMVPSRTSSQGGPFSLPGAWPLHPAPKAQPNPFGLSFFPTP 9212
            ++R RQAKEHRPVYE   G +    T  +  PF+                          
Sbjct: 370  ELRRRQAKEHRPVYEGKDGTIEDIITVLKSVPFT-------------------------- 403

Query: 9213 HEA*AGWLSPGPQC*RVSLSRARTAKRGSRFFCDAAHHDESNC 9341
                                 ARTAKRGSRFFCDAAHHDESNC
Sbjct: 404  ---------------------ARTAKRGSRFFCDAAHHDESNC 425
>gb|AAC53190.1| (U92453) WW domain binding protein 3; WBP3 [Mus musculus]
          Length = 164

 Score = 86.7 bits (213), Expect(2) = 1e-18
 Identities = 41/55 (74%), Positives = 48/55 (86%)
 Frame = -3

Query: 4121 LSACQETYENTSHQVHTLRRLIKEKEEAFQRRCHLEPNVRGLESVDSEALSQSRP 3957
            L + +ETYENTS+QVHTLRRLIKEKEEAFQRRCHLEP+ RGLES+  EAL++  P
Sbjct: 50   LESIKETYENTSNQVHTLRRLIKEKEEAFQRRCHLEPSARGLESMGGEALARVGP 104
 Score = 32.3 bits (72), Expect(2) = 1e-18
 Identities = 13/17 (76%), Positives = 15/17 (87%)
 Frame = -1

Query: 3970 ARVGPAELSEGMPPKDL 3920
            ARVGP EL+EG+PP DL
Sbjct: 100  ARVGPTELTEGIPPSDL 116
>gb|AAC34395.1| (AF056116) unknown [Takifugu rubripes]
          Length = 1037

 Score = 86.7 bits (213), Expect = 5e-15
 Identities = 39/59 (66%), Positives = 54/59 (91%)
 Frame = +1

Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRS 8469
            ++L+L++FEELFKT+AQGP +DL C+K+K AQKA +KVT+L+ANR+KNLAITLRKA ++
Sbjct: 618  RELELERFEELFKTRAQGPIMDLSCTKSKVAQKAVNKVTILDANRSKNLAITLRKANKT 676
 Score = 81.3 bits (199), Expect(2) = 2e-17
 Identities = 37/44 (84%), Positives = 43/44 (97%)
 Frame = +3

Query: 7878 LAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILEV 8009
            +AIRIKKPIKTKFRLPVFNWTALKPNQI+GTVF+E+DDE+ LE+
Sbjct: 579  VAIRIKKPIKTKFRLPVFNWTALKPNQINGTVFNEIDDERELEL 622
 Score = 46.6 bits (109), Expect = 0.006
 Identities = 22/39 (56%), Positives = 31/39 (79%)
 Frame = -3

Query: 4109 QETYENTSHQVHTLRRLIKEKEEAFQRRCHLEPNVRGLE 3993
            +ETYE+TS QV+TLR++IKEK+ AFQR  ++E  +  LE
Sbjct: 432  RETYESTSSQVNTLRKVIKEKDAAFQRHFNIERRLLELE 470
 Score = 38.1 bits (87), Expect = 2.0
 Identities = 31/103 (30%), Positives = 40/103 (38%), Gaps = 3/103 (2%)
 Frame = +3

Query: 9042 KVRGRQAKEHRPVYE---GRMVPSRTSSQGGPFSLPGAWPLHPAPKAQPNPFGLSFFPTP 9212
            ++R RQAK+HRPVYE   G +    T  +  PF+                          
Sbjct: 982  ELRKRQAKDHRPVYEGKDGTIEDIITVLKSVPFT-------------------------- 1015

Query: 9213 HEA*AGWLSPGPQC*RVSLSRARTAKRGSRFFCDAAHHDESNC 9341
                                 ARTAKRGSRFFC+A   D++NC
Sbjct: 1016 ---------------------ARTAKRGSRFFCEANLCDDANC 1037
 Score = 33.9 bits (76), Expect(2) = 2e-17
 Identities = 15/29 (51%), Positives = 20/29 (68%)
 Frame = +1

Query: 7717 PPDKCPPAPPLPGAAPSVVLTVGLSGEYP 7803
            PP   P APPLP A+PSV+L+V +  + P
Sbjct: 558  PPPPPPLAPPLPDASPSVILSVAIRIKKP 586
>ref|NP_005883.1| formin-like; chromosome  17 open reading frame 1; chromosome 17 open
            reading frame 1B [Homo sapiens]
 ref|XP_008347.1| chromosome 17 open reading frame 1B [Homo sapiens]
 emb|CAA07870.1| (AJ008112) C17orf1 protein [Homo sapiens]
          Length = 463

 Score = 91.7 bits (226), Expect = 1e-16
 Identities = 46/67 (68%), Positives = 53/67 (78%)
 Frame = +1

Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSA 8472
            Q+LD+  FEE FKTK+QGP+LDL   K+K AQKA SK TL+EANRAKNLAITLRK    A
Sbjct: 29   QELDMSDFEEQFKTKSQGPSLDLSALKSKAAQKAPSKATLIEANRAKNLAITLRKGNLGA 88

Query: 8473 EEICRAI 8493
            E IC+AI
Sbjct: 89   ERICQAI 95
 Score = 48.9 bits (115), Expect = 0.001
 Identities = 19/29 (65%), Positives = 27/29 (92%)
 Frame = +3

Query: 7920 LPVFNWTALKPNQISGTVFSELDDEKILE 8006
            +P+ NW ALKP+QI+GTVF+EL+DEK+L+
Sbjct: 1    MPLLNWVALKPSQITGTVFTELNDEKVLQ 29
>gb|AAF25953.1| (AF215666) formin-like protein [Mus musculus]
          Length = 1094

 Score = 86.3 bits (212), Expect = 6e-15
 Identities = 42/69 (60%), Positives = 52/69 (74%)
 Frame = +1

Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSA 8472
            Q+LD++ FEE FKTK+QGP LD+   K K +QKA +K  L+EANRAKNLAITLRK    A
Sbjct: 665  QELDMNDFEEHFKTKSQGPCLDISALKGKASQKAPTKTILIEANRAKNLAITLRKGNLGA 724

Query: 8473 EEICRAIHT 8499
            + IC+AI T
Sbjct: 725  DRICQAIET 733
 Score = 67.0 bits (162), Expect(2) = 1e-10
 Identities = 27/41 (65%), Positives = 38/41 (91%)
 Frame = +3

Query: 7884 IRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILE 8006
            ++ KKPI+TKFR+P+ NW ALKP+QI+GTVF+EL+DEK+L+
Sbjct: 625  VKAKKPIQTKFRMPLLNWVALKPSQITGTVFTELNDEKVLQ 665
 Score = 25.0 bits (53), Expect(2) = 1e-10
 Identities = 9/11 (81%), Positives = 9/11 (81%)
 Frame = +1

Query: 7732 PPAPPLPGAAP 7764
            PPAPPLPG  P
Sbjct: 575  PPAPPLPGDLP 585
>ref|NP_062653.1| formin-related gene in leukocytes; lymphocyte specific formin related
            protein [Mus musculus]
 pir||T13963 formin related protein, lymphocyte specific - mouse
 gb|AAD01273.1| (AF006466) lymphocyte specific formin related protein [Mus musculus]
          Length = 1064

 Score = 86.3 bits (212), Expect = 6e-15
 Identities = 42/69 (60%), Positives = 52/69 (74%)
 Frame = +1

Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSA 8472
            Q+LD++ FEE FKTK+QGP LD+   K K +QKA +K  L+EANRAKNLAITLRK    A
Sbjct: 639  QELDMNDFEEHFKTKSQGPCLDISALKGKASQKAPTKTILIEANRAKNLAITLRKGNLGA 698

Query: 8473 EEICRAIHT 8499
            + IC+AI T
Sbjct: 699  DRICQAIET 707
 Score = 67.0 bits (162), Expect(2) = 6e-10
 Identities = 27/41 (65%), Positives = 38/41 (91%)
 Frame = +3

Query: 7884 IRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILE 8006
            ++ KKPI+TKFR+P+ NW ALKP+QI+GTVF+EL+DEK+L+
Sbjct: 599  VKAKKPIQTKFRMPLLNWVALKPSQITGTVFTELNDEKVLQ 639
 Score = 22.7 bits (47), Expect(2) = 6e-10
 Identities = 7/14 (50%), Positives = 9/14 (64%)
 Frame = +1

Query: 7732 PPAPPLPGAAPSVV 7773
            PP PP PG  P ++
Sbjct: 575  PPPPPPPGGPPDIL 588
>gb|AAF49761.1| (AE003536) CG6807 gene product [Drosophila melanogaster]
          Length = 1043

 Score = 61.6 bits (148), Expect = 2e-07
 Identities = 25/42 (59%), Positives = 34/42 (80%)
 Frame = +3

Query: 7881 AIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILE 8006
            A+ IK+ + TK++LP  NW ALKPNQ+ GT+F+ELDDEKI +
Sbjct: 554  AMTIKRKVPTKYKLPTLNWIALKPNQVRGTIFNELDDEKIFK 595
 Score = 37.4 bits (85), Expect = 3.3
 Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
 Frame = +1

Query: 8293 QDLDLDKFEELFK--------TKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAIT 8448
            + +D ++FEE FK          + G  +D     +K   K    V+LLE  R +N+AI+
Sbjct: 595  KQIDFNEFEERFKIGIGGALRNGSNGTEVDGSLQSSKRF-KRPDNVSLLEHTRLRNIAIS 653

Query: 8449 LRKAGRSAEEICRAIHT 8499
             RK G   +++  AIH+
Sbjct: 654  RRKLGMPIDDVIAAIHS 670
>gb|AAF60718.1| (AC024798) contains similarity to TR:Q9Z2V7 [Caenorhabditis elegans]
          Length = 1164

 Score = 50.1 bits (118), Expect(2) = 1e-04
 Identities = 25/47 (53%), Positives = 31/47 (65%)
 Frame = +3

Query: 7866 PPSKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILE 8006
            P S  A  IKK  +TK +LP  NWTA+KP Q   TVF +L+DE I+E
Sbjct: 575  PASNDAKTIKKIYQTKNKLPQLNWTAMKPMQAKNTVFEKLNDELIIE 621
 Score = 37.7 bits (86), Expect = 2.6
 Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 15/83 (18%)
 Frame = +1

Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTA---------------QKAASKVTLLEANR 8427
            + LD  K EE+FK     P L L   K++ +                 +A K TLL+  R
Sbjct: 621  EKLDFSKLEEMFKLAQ--PTLGLAEPKSEQSVIGQVSPGSTTSAAGTSSARKNTLLDTKR 678

Query: 8428 AKNLAITLRKAGRSAEEICRAIH 8496
             +N+AIT RK    A+ I  A+H
Sbjct: 679  LQNVAITRRKVAMDAKSIMAAVH 701
 Score = 21.2 bits (43), Expect(2) = 1e-04
 Identities = 11/26 (42%), Positives = 11/26 (42%)
 Frame = +1

Query: 7717 PPDKCPPAPPLPGAAPSVVLTVGLSG 7794
            PP   PP P L G  P      GL G
Sbjct: 549  PPPPPPPPPMLGGPPPPPPPPGGLMG 574
>dbj|BAA31641.1| (AB014566) KIAA0666 protein [Homo sapiens]
          Length = 1085

 Score = 41.6 bits (96), Expect(2) = 0.004
 Identities = 22/63 (34%), Positives = 36/63 (56%), Gaps = 3/63 (4%)
 Frame = +3

Query: 7830 PGVCVLGPCVTDPPSKLAIRIKK---PIKTKFRLPVFNWTALKPNQISGTVFSELDDEKI 8000
            PG   LG  +  P + + + +KK   P  T   L  FNW+ L  N++ GTV++E+DD K+
Sbjct: 585  PGPPPLGAIMPPPGAPMGLALKKKSIPQPTN-ALKSFNWSKLPENKLEGTVWTEIDDTKV 643

Query: 8001 LEV 8009
             ++
Sbjct: 644  FKI 646
 Score = 24.6 bits (52), Expect(2) = 0.004
 Identities = 11/15 (73%), Positives = 11/15 (73%)
 Frame = +1

Query: 7711 LLPPDKCPPAPPLPG 7755
            LLPP   PP PPLPG
Sbjct: 555  LLPP---PPPPPLPG 566
>pir||T04455 hypothetical protein F4D11.90 - Arabidopsis thaliana
 emb|CAA18590.1| (AL022537) putative protein [Arabidopsis thaliana]
 emb|CAB79988.1| (AL161581) putative protein kinase [Arabidopsis thaliana]
          Length = 731

 Score = 41.6 bits (96), Expect = 0.18
 Identities = 45/136 (33%), Positives = 55/136 (40%), Gaps = 7/136 (5%)
 Frame = +1

Query: 9859  PCLSPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYLWGC 10038
             P LSPLP  L SP P P       SP P  +PT  P  L   S  S  + +PP P L   
Sbjct: 35    PPLSPLPPPLSSPPPLP-------SPPPLSAPTASPPPLPVESPPSPPIESPPPPLLE-- 85

Query: 10039 S*GHCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPR---APGPH*DP---- 10197
             S    P     P    +S+P+G    P +    A+ + PPS PP     PG    P    
Sbjct: 86    SPPPPPLESPSPPSPHVSAPSGSPPLPFL---PAKPSPPPSSPPSETVPPGNTISPPPRS 142

Query: 10198 LGGRSTSPLRVLQVQP 10245
             L   ST P+      P
Sbjct: 143   LPSESTPPVNTASPPP 158
>sp|P18616|RPB1_ARATH DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (VERSION 1)
 emb|CAA37130.1| (X52954) RNA polymerase II [Arabidopsis thaliana]
          Length = 1841

 Score = 40.0 bits (92), Expect = 0.51
 Identities = 42/133 (31%), Positives = 57/133 (42%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 1647  TSPAYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPAYSPTSPGYSP-- 1702

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T PS S  S   + P  P+YG + PS    +A++SPS +  P   + R   
Sbjct: 1703  ---------TSPSYSPTSP-SYSPTSPSYGPTSPSYNPQSAKYSPSIAYSP--SNARLSP 1750

Query: 10215 FTPQSPTSPTSGP 10253
              +P SPTSP   P
Sbjct: 1751  ASPYSPTSPNYSP 1763
>emb|CAA21466.2| (AL031986) DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain
             [Arabidopsis thaliana]
 emb|CAB81489.1| (AL161588) DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain
             [Arabidopsis thaliana]
          Length = 1840

 Score = 39.3 bits (90), Expect = 0.88
 Identities = 41/133 (30%), Positives = 56/133 (41%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 1646  TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 1701

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T P  S  S   + P  P+YG + PS    +A++SPS +  P   + R   
Sbjct: 1702  ---------TSPGYSPTSP-SYSPTSPSYGPTSPSYNPQSAKYSPSIAYSP--SNARLSP 1749

Query: 10215 FTPQSPTSPTSGP 10253
              +P SPTSP   P
Sbjct: 1750  ASPYSPTSPNYSP 1762
>pir||JDMU1 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - Arabidopsis
             thaliana
          Length = 1834

 Score = 39.3 bits (90), Expect = 0.88
 Identities = 41/133 (30%), Positives = 56/133 (41%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 1640  TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 1695

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T P  S  S   + P  P+YG + PS    +A++SPS +  P   + R   
Sbjct: 1696  ---------TSPGYSPTSP-SYSPTSPSYGPTSPSYNPQSAKYSPSIAYSP--SNARLSP 1743

Query: 10215 FTPQSPTSPTSGP 10253
              +P SPTSP   P
Sbjct: 1744  ASPYSPTSPNYSP 1756
>sp|P31635|RPB0_ARATH DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (VERSION 2)
 pir||JDMU2 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain (version 2) -
             Arabidopsis thaliana
 emb|CAA36735.1| (X52494) DNA-directed RNA polymerase [Arabidopsis thaliana]
          Length = 1860

 Score = 39.3 bits (90), Expect = 0.88
 Identities = 41/133 (30%), Positives = 56/133 (41%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 1666  TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 1721

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T P  S  S   + P  P+YG + PS    +A++SPS +  P   + R   
Sbjct: 1722  ---------TSPGYSPTSP-SYSPTSPSYGPTSPSYNPQSAKYSPSIAYSP--SNARLSP 1769

Query: 10215 FTPQSPTSPTSGP 10253
              +P SPTSP   P
Sbjct: 1770  ASPYSPTSPNYSP 1782
>gb|AAF45601.1| (AE003420) EG:114D9.2 gene product [Drosophila melanogaster]
          Length = 1429

 Score = 33.9 bits (76), Expect(2) = 0.97
 Identities = 18/50 (36%), Positives = 27/50 (54%)
 Frame = +3

Query: 7851 PCVTDPPSKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKI 8000
            P   D P K   +   P+K+      FNW+ L   ++ GTV+SELD+ K+
Sbjct: 889  PPKVDLPKKNVPQPTNPLKS------FNWSKLPDAKLQGTVWSELDESKL 932
 Score = 23.9 bits (50), Expect(2) = 0.97
 Identities = 11/22 (50%), Positives = 13/22 (59%)
 Frame = +1

Query: 7717 PPDKCPPAPPLPGAAPSVVLTV 7782
            PP  CP APP P   PS+  T+
Sbjct: 867  PPPPCPGAPPPP---PSMAQTM 885
>emb|CAA21052.1| (AL031640) /prediction=(method:""genscan"", version:""1.0"",
            score:""400.91"")~/prediction=(method:""genefinder"",
            version:""084"")~/match=(desc:""DIA-12C PROTEIN"",
            species:""HOMO SAPIENS (HUMAN)"",
            ranges:(query:29998..30156, target:SPTREMBL::O60878:>
          Length = 979

 Score = 33.9 bits (76), Expect(2) = 1.00
 Identities = 18/50 (36%), Positives = 27/50 (54%)
 Frame = +3

Query: 7851 PCVTDPPSKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKI 8000
            P   D P K   +   P+K+      FNW+ L   ++ GTV+SELD+ K+
Sbjct: 439  PPKVDLPKKNVPQPTNPLKS------FNWSKLPDAKLQGTVWSELDESKL 482
 Score = 23.9 bits (50), Expect(2) = 1.00
 Identities = 11/22 (50%), Positives = 13/22 (59%)
 Frame = +1

Query: 7717 PPDKCPPAPPLPGAAPSVVLTV 7782
            PP  CP APP P   PS+  T+
Sbjct: 417  PPPPCPGAPPPP---PSMAQTM 435
>emb|CAB95888.1| (AL359988) putative serine-threonine protein kinase [Streptomyces
            coelicolor A3(2)]
          Length = 580

 Score = 38.9 bits (89), Expect = 1.1
 Identities = 26/94 (27%), Positives = 37/94 (38%)
 Frame = +3

Query: 7692 LGHCVISSSPRQVSPSPTSPWCCTLCGVDSGPVR*VSPGLWAGLVGPGVCVLGPCVTDPP 7871
            LG C+ ++   + +P+    WC    G D G      P  W  + GP V V  P     P
Sbjct: 249  LGRCLATAPEERATPAEIVEWCRRELGRDGG--EGAGPAGWREIAGPPVTVPPPAAATGP 306

Query: 7872 SKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTV 7973
            +  A  +  P  T   +    WT  + N   GTV
Sbjct: 307  ATAAAPVAAPGPT--AVHTTPWTVPEGNVAPGTV 338
>emb|CAA36734.1| (X52493) DNA-directed RNA polymerase [Glycine max]
          Length = 494

 Score = 38.9 bits (89), Expect = 1.1
 Identities = 41/133 (30%), Positives = 55/133 (40%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 304   TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 359

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T P  S  S   + P  P+Y  + PS    +A++SPS +  P  P  R   
Sbjct: 360   ---------TSPGYSPTSP-SYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSP--RLSP 407

Query: 10215 FTPQSPTSPTSGP 10253
              +P SPTSP   P
Sbjct: 408   SSPYSPTSPNYSP 420
>pir||S14181 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain  (isoform B1) -
             soybean (fragment)
          Length = 650

 Score = 38.9 bits (89), Expect = 1.1
 Identities = 41/133 (30%), Positives = 55/133 (40%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 460   TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 515

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T P  S  S   + P  P+Y  + PS    +A++SPS +  P  P  R   
Sbjct: 516   ---------TSPGYSPTSP-SYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSP--RLSP 563

Query: 10215 FTPQSPTSPTSGP 10253
              +P SPTSP   P
Sbjct: 564   SSPYSPTSPNYSP 576
>pir||T07796 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain - soybean
             (fragment)
 emb|CAA36733.1| (X52492) DNA-directed RNA polymerase [Glycine max]
          Length = 625

 Score = 38.9 bits (89), Expect = 1.1
 Identities = 41/133 (30%), Positives = 55/133 (40%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 464   TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 519

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T P  S  S   + P  P+Y  + PS    +A++SPS +  P  P  R   
Sbjct: 520   ---------TSPGYSPTSP-SYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSP--RLSP 567

Query: 10215 FTPQSPTSPTSGP 10253
              +P SPTSP   P
Sbjct: 568   SSPYSPTSPNYSP 580
>pir||S14182 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain  (isoform B2) -
             soybean (fragment)
          Length = 491

 Score = 38.9 bits (89), Expect = 1.1
 Identities = 41/133 (30%), Positives = 55/133 (40%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 301   TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 356

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T P  S  S   + P  P+Y  + PS    +A++SPS +  P  P  R   
Sbjct: 357   ---------TSPGYSPTSP-SYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSP--RLSP 404

Query: 10215 FTPQSPTSPTSGP 10253
              +P SPTSP   P
Sbjct: 405   SSPYSPTSPNYSP 417
>dbj|BAA20835.1| (AB002379) KIAA0381 [Homo sapiens]
          Length = 864

 Score = 36.2 bits (82), Expect(2) = 1.3
 Identities = 15/48 (31%), Positives = 27/48 (56%)
 Frame = +3

Query: 7866 PPSKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILEV 8009
            P S + +R K+  +    L  FNW  L   ++ GTV++E+DD ++  +
Sbjct: 374  PSSDVPLRKKRVPQPSHPLKSFNWVKLNEERVPGTVWNEIDDMQVFRI 421
 Score = 21.2 bits (43), Expect(2) = 1.3
 Identities = 11/30 (36%), Positives = 15/30 (49%), Gaps = 6/30 (20%)
 Frame = +1

Query: 7681 SLDAWVTVSFLLPPDK------CPPAPPLP 7752
            +L + +T + L PP        CPP PP P
Sbjct: 317  TLSSSMTTNDLPPPPPPLPFACCPPPPPPP 346
>gb|AAC99858.1| (U31159) CR16 [Rattus norvegicus]
 gb|AAC99859.1| (U31169) SH3 domain binding protein [Rattus norvegicus]
          Length = 485

 Score = 38.5 bits (88), Expect = 1.5
 Identities = 32/118 (27%), Positives = 47/118 (39%), Gaps = 8/118 (6%)
 Frame = +1

Query: 9850  KHQPCLSPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYL 10029
             +  P   P P     P P PL   A  SP+ A++P   P   +  S  S S   PP P  
Sbjct: 291   REPPAPPPPPPPPPPPPPPPLPTYASCSPRAAVAP---PPPPLPGSSNSGSETPPPLP-- 345

Query: 10030 WGCS*GHCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAA--------PPSGPPRAP 10179
                     P SPS    ++L +P G     ++++ + R           PP+ P R+P
Sbjct: 346   --------PKSPSFQTQKALPTPPGAPGPQIILQKKRRGPGAGGGKLNPPPAPPARSP 395
>gb|AAA87791.1| (U25281) SH3 domain binding protein [Rattus norvegicus]
 prf||2205340A CR16 gene [Rattus norvegicus]
          Length = 451

 Score = 38.5 bits (88), Expect = 1.5
 Identities = 32/118 (27%), Positives = 47/118 (39%), Gaps = 8/118 (6%)
 Frame = +1

Query: 9850  KHQPCLSPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYL 10029
             +  P   P P     P P PL   A  SP+ A++P   P   +  S  S S   PP P  
Sbjct: 291   REPPAPPPPPPPPPPPPPPPLPTYASCSPRAAVAP---PPPPLPGSSNSGSETPPPLP-- 345

Query: 10030 WGCS*GHCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAA--------PPSGPPRAP 10179
                     P SPS    ++L +P G     ++++ + R           PP+ P R+P
Sbjct: 346   --------PKSPSFQTQKALPTPPGAPGPQIILQKKRRGPGAGGGKLNPPPAPPARSP 395
>pir||S14183 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain  (isoform C) -
             soybean (fragment)
 emb|CAA36736.1| (X52495) DNA-directed RNA polymerase [Glycine max]
          Length = 977

 Score = 38.1 bits (87), Expect = 2.0
 Identities = 40/133 (30%), Positives = 57/133 (42%), Gaps = 3/133 (2%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SSEMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVGL 10037
             T P  ++  P  +PT      SP+ SPTS  P + P  P  SL     +  S    P   
Sbjct: 797   TSPAYSSTSPAYSPT------SPSYSPTS--PAYSPTSPSYSLTSPSYSPTSPSYSPTSP 848

Query: 10038 LLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRP--SRRQK 10211
                  S    P+    S     P  P+Y  + PS    +A++SPS +  P  P  S    
Sbjct: 849   SYSPTSPAYSPTSPSYS-----PTSPSYSPTSPSYNPQSAKYSPSLAYSPSSPRLSPTSP 903

Query: 10212 HFTPQSPT-SPTS 10247
             +++P SP+ SPTS
Sbjct: 904   NYSPTSPSYSPTS 916
>gb|AAF48438.1| (AE003498) CG15032 gene product [Drosophila melanogaster]
          Length = 277

 Score = 38.1 bits (87), Expect = 2.0
 Identities = 19/49 (38%), Positives = 27/49 (54%), Gaps = 1/49 (2%)
 Frame = -3

Query: 7856 TGAKDTHPWSHQPCP-EPWGYSPDRPTVNTTEGAAPGRGGAGGHLSGGR 7713
            T ++  HP++H P    P G+ P+    N   G+A   G AGG +SGGR
Sbjct: 150  TNSRGLHPYAHSPAHGNPPGFYPNMWYPNAPYGSAGAAGSAGGAVSGGR 198
>sp|P25439|BRM_DROME HOMEOTIC GENE REGULATOR (BRAHMA PROTEIN)
 pir||A42091 transcription activator SNF2/SWI2 homolog brm - fruit fly  (Drosophila
             melanogaster)
 gb|AAA19661.1| (M85049) brahma protein [Drosophila melanogaster]
          Length = 1638

 Score = 28.9 bits (63), Expect(2) = 2.0
 Identities = 20/60 (33%), Positives = 24/60 (39%)
 Frame = +2

Query: 9923  PSNLPNQQFPQLGTPQALSKPPVYLPHCWLLPGPTCGAALEGTVLAHPPILFTGLSLPPQ 10102
             P     Q  P  GTP   S PP   P+   +PG       +  V   PP +  G  LPPQ
Sbjct: 241   PPQQQQQPPPSAGTPPQCSTPPASNPYGPPVPGQ------KMQVAPPPPHMQQGQPLPPQ 294
 Score = 27.7 bits (60), Expect(2) = 2.0
 Identities = 19/59 (32%), Positives = 23/59 (38%)
 Frame = +3

Query: 10095 PHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKHFTPQSPTSPTSGPGIGQGL 10271
             P  P  P  G  PP Q+    Q     S+PP  P    +H  P      + GP  GQ L
Sbjct: 290   PLPPQPPQVGGPPPIQQQQPPQQQQQQSQPP--PPEPHQHQLPNGGKPLSMGPSGGQPL 346
>gb|AAF49557.1| (AE003529) brm gene product [alt 1] [Drosophila melanogaster]
          Length = 1638

 Score = 28.9 bits (63), Expect(2) = 2.0
 Identities = 20/60 (33%), Positives = 24/60 (39%)
 Frame = +2

Query: 9923  PSNLPNQQFPQLGTPQALSKPPVYLPHCWLLPGPTCGAALEGTVLAHPPILFTGLSLPPQ 10102
             P     Q  P  GTP   S PP   P+   +PG       +  V   PP +  G  LPPQ
Sbjct: 241   PPQQQQQPPPSAGTPPQCSTPPASNPYGPPVPGQ------KMQVAPPPPHMQQGQPLPPQ 294
 Score = 27.7 bits (60), Expect(2) = 2.0
 Identities = 19/59 (32%), Positives = 23/59 (38%)
 Frame = +3

Query: 10095 PHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKHFTPQSPTSPTSGPGIGQGL 10271
             P  P  P  G  PP Q+    Q     S+PP  P    +H  P      + GP  GQ L
Sbjct: 290   PLPPQPPQVGGPPPIQQQQPPQQQQQQSQPP--PPEPHQHQLPNGGKPLSMGPSGGQPL 346
>gb|AAF49558.2| (AE003529) brm gene product [alt 2] [Drosophila melanogaster]
          Length = 1537

 Score = 28.9 bits (63), Expect(2) = 2.1
 Identities = 20/60 (33%), Positives = 24/60 (39%)
 Frame = +2

Query: 9923  PSNLPNQQFPQLGTPQALSKPPVYLPHCWLLPGPTCGAALEGTVLAHPPILFTGLSLPPQ 10102
             P     Q  P  GTP   S PP   P+   +PG       +  V   PP +  G  LPPQ
Sbjct: 140   PPQQQQQPPPSAGTPPQCSTPPASNPYGPPVPGQ------KMQVAPPPPHMQQGQPLPPQ 193
 Score = 27.7 bits (60), Expect(2) = 2.1
 Identities = 19/59 (32%), Positives = 23/59 (38%)
 Frame = +3

Query: 10095 PHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKHFTPQSPTSPTSGPGIGQGL 10271
             P  P  P  G  PP Q+    Q     S+PP  P    +H  P      + GP  GQ L
Sbjct: 189   PLPPQPPQVGGPPPIQQQQPPQQQQQQSQPP--PPEPHQHQLPNGGKPLSMGPSGGQPL 245
>dbj|BAA07534.1| (D38529) DRPLA protein [Homo sapiens]
          Length = 1182

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
 Frame = +1

Query: 9853  HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
             H P L P      SPQP    P + +A   P P+++PTGY      P+S +  +   A  
Sbjct: 160   HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 212

Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
               PP P L+ G + G   G P  P       S+  P G         P+ V     S AP
Sbjct: 213   -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 271

Query: 10156 PSGPPRAP 10179
             P+ PP  P
Sbjct: 272   PTKPPTTP 279
>gb|AAG45420.1|AF309494_1 (AF309494) vegetative cell wall protein gp1 [Chlamydomonas reinhardtii]
          Length = 555

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 35/118 (29%), Positives = 43/118 (35%)
 Frame = +1

Query: 9868  SPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYLWGCS*G 10047
             SP P    SP P      A  SP P + P+  P S    +  S    APP P        
Sbjct: 188   SPAPP---SPAPPVPPSPAPPSPAPPVPPSPAPPSPPSPAPPSPPSPAPPSP-------- 236

Query: 10048 HCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DPLGGRSTSP 10221
               P +P  PV  S + P+     P         + PP  PPR P P   P+     SP
Sbjct: 237   -SPPAPPSPVPPSPAPPSPAPPSPKPPAPPPPPSPPPPPPPRPPFPANTPMPPSPPSP 293
>sp|P16253|CAC3_HAECO CUTICLE COLLAGEN 3A3
 gb|AAA29173.1| (M32820) 3A3 collagen [Haemonchus contortus]
          Length = 295

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 20/48 (41%), Positives = 24/48 (49%)
 Frame = +1

Query: 10054 PGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DP 10197
             PG+P  P    L  P G  C+P+ +   A   A P GPP  PGP  DP
Sbjct: 115   PGAPGLPGVPGLPPPDG-SCEPVSIPPCAECPAGPPGPPGKPGPPGDP 161
>ref|XP_006637.1| similar to dentatorubral-pallidoluysian atrophy (atrophin-1) (H.
             sapiens) [Homo sapiens]
 ref|XP_006975.1| atrophin-1 [Homo sapiens]
          Length = 1189

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
 Frame = +1

Query: 9853  HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
             H P L P      SPQP    P + +A   P P+++PTGY      P+S +  +   A  
Sbjct: 159   HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 211

Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
               PP P L+ G + G   G P  P       S+  P G         P+ V     S AP
Sbjct: 212   -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 270

Query: 10156 PSGPPRAP 10179
             P+ PP  P
Sbjct: 271   PTKPPTTP 278
>pir||A44984 collagen - nematode (Haemonchus contortus)
          Length = 295

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 20/48 (41%), Positives = 24/48 (49%)
 Frame = +1

Query: 10054 PGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DP 10197
             PG+P  P    L  P G  C+P+ +   A   A P GPP  PGP  DP
Sbjct: 115   PGAPGLPGVPGLPPPDG-SCEPVSIPPCAECPAGPPGPPGKPGPPGDP 161
>gb|AAB51321.1| (U47924) DRPLA [Homo sapiens]
          Length = 1190

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
 Frame = +1

Query: 9853  HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
             H P L P      SPQP    P + +A   P P+++PTGY      P+S +  +   A  
Sbjct: 160   HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 212

Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
               PP P L+ G + G   G P  P       S+  P G         P+ V     S AP
Sbjct: 213   -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 271

Query: 10156 PSGPPRAP 10179
             P+ PP  P
Sbjct: 272   PTKPPTTP 279
>sp|P40602|APG_ARATH ANTER-SPECIFIC PROLINE-RICH PROTEIN APG PRECURSOR
 pir||S21961 proline-rich protein APG - Arabidopsis thaliana
 emb|CAA42925.1| (X60377) APG [Arabidopsis thaliana]
          Length = 534

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 32/126 (25%), Positives = 43/126 (33%)
 Frame = +1

Query: 9871  PLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYLWGCS*GH 10050
             P P  +  P PDP       SP+P   P   P  +           APP P         
Sbjct: 53    PQPWPMNPPTPDP-------SPKPVAPPGPSPKPV-----------APPGP-------SP 87

Query: 10051 CPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DPLGGRSTSPLRV 10230
             CP  P  P  +   +P+   C     + Q +   PP+ PP  P P   P    +  P   
Sbjct: 88    CPSPPPKPQPKPPPAPSPSPCPSPPPKPQPKPVPPPACPPTPPKPQPKPAPPPAPKPAPP 147

Query: 10231 LQVQPV 10248
                +PV
Sbjct: 148   PAPKPV 153
>pir||S50832 atrophin-1 - human
          Length = 1184

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
 Frame = +1

Query: 9853  HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
             H P L P      SPQP    P + +A   P P+++PTGY      P+S +  +   A  
Sbjct: 160   HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 212

Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
               PP P L+ G + G   G P  P       S+  P G         P+ V     S AP
Sbjct: 213   -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 271

Query: 10156 PSGPPRAP 10179
             P+ PP  P
Sbjct: 272   PTKPPTTP 279
>ref|NP_001931.1| atrophin-1 [Homo sapiens]
 pir||G01763 atrophin-1 - human
 gb|AAB50276.1| (U23851) atrophin-1 [Homo sapiens]
          Length = 1184

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
 Frame = +1

Query: 9853  HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
             H P L P      SPQP    P + +A   P P+++PTGY      P+S +  +   A  
Sbjct: 159   HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 211

Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
               PP P L+ G + G   G P  P       S+  P G         P+ V     S AP
Sbjct: 212   -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 270

Query: 10156 PSGPPRAP 10179
             P+ PP  P
Sbjct: 271   PTKPPTTP 278
>emb|CAB88971.1| (AL353864) hypothetical protein SC8F11.20c. [Streptomyces coelicolor
            A3(2)]
          Length = 1086

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 38/134 (28%), Positives = 47/134 (34%), Gaps = 13/134 (9%)
 Frame = +1

Query: 9028 PRVNTR*GGARPRNTGLCMREGWYHXGHHH--------RGGPFHSLG----PGHCTQLPK 9171
            P ++ R  GARP + G          G H         RGG     G    P    + P 
Sbjct: 716  PELSERYAGARPGSNGAGSPSPGPPAGAHRPPPGGGRERGGHAEPAGTPSVPPRGERRPG 775

Query: 9172 LNLTPLGSPFSPPPMRPERGGSLLGHSAEECP-FHGPVLPSGAHASSVMQPTMMSQTVSP 9348
             N  P G    PPP  P+RGG          P   GP    G H  +   P++ SQ    
Sbjct: 776  NNGEPAGPRQGPPPAAPDRGGRGEPAGPHRVPASDGP--GRGEHPRAAGTPSVPSQAAPD 833

Query: 9349 QGWGPTGTGHRSCP 9390
            +G     TG R  P
Sbjct: 834  RGEHTPSTGTRRVP 847
>sp|P54259|DRPL_HUMAN ATROPHIN-1 (DENTATORUBRAL-PALLIDOLUYSIAN ATROPHY PROTEIN)
 dbj|BAA06626.1| (D31840) DRPLA [Homo sapiens]
          Length = 1185

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
 Frame = +1

Query: 9853  HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
             H P L P      SPQP    P + +A   P P+++PTGY      P+S +  +   A  
Sbjct: 160   HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 212

Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
               PP P L+ G + G   G P  P       S+  P G         P+ V     S AP
Sbjct: 213   -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 271

Query: 10156 PSGPPRAP 10179
             P+ PP  P
Sbjct: 272   PTKPPTTP 279
>gb|AAF79900.1|AC022472_9 (AC022472) Contains a strong similarity to Anther-specific proline-rich
             protein APG precursor from Arabidopsis thaliana gi|728867
             and contains a Lipase/Acylhydrolase domain with GDSL-like
             motif PF|00657.  ESTs gb|AV531882, gb|AV533240,
             gb|AV534374, gb|>
          Length = 1137

 Score = 37.4 bits (85), Expect = 3.3
 Identities = 27/101 (26%), Positives = 38/101 (36%)
 Frame = +1

Query: 9895  PQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYLWGCS*GHCPGSPSHP 10074
             PQP P+      +P P+  P   P         S+  +APP P         CP  P  P
Sbjct: 63    PQPWPMN---PPTPDPSPKPVAPPGP-------SSKPVAPPGP-------SPCPSPPPKP 105

Query: 10075 VHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DP 10197
               +   +P+   C     + Q +   PP+ PP  P P   P
Sbjct: 106   QPKPPPAPSPSPCPSPPPKPQPKPVPPPACPPTPPKPQPKP 146
>emb|CAA26904.1| (X03128) put. RNA polymerase II largest subunit [Saccharomyces
             cerevisiae]
          Length = 1726

 Score = 37.0 bits (84), Expect = 4.4
 Identities = 42/141 (29%), Positives = 57/141 (39%), Gaps = 4/141 (2%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 1580  TSPSYSPTSPSYSPTSPSYSPMSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPSYSP-- 1635

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSP-SGSRPPLRPSRR-- 10205
                      T PS S  S   + P  PAY  + PS    +  +SP S S  P  PS    
Sbjct: 1636  ---------TSPSYSPTSP-SYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPT 1685

Query: 10206 QKHFTPQSPTSPTSGPGIGQG 10268
               +++P SP+   + PG   G
Sbjct: 1686  SPNYSPTSPSYSPTSPGYSPG 1706
>pir||S50755 hypothetical protein VSP-3 - Chlamydomonas reinhardtii
 gb|AAB53953.1| (L29029) amino acid feature: Rod protein domain, aa 266 .. 468; amino
             acid feature: globular protein domain, aa 32 .. 265
             [Chlamydomonas reinhardtii]
          Length = 473

 Score = 36.6 bits (83), Expect = 5.7
 Identities = 36/124 (29%), Positives = 44/124 (35%)
 Frame = +1

Query: 9850  KHQPCLSPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYL 10029
             K  P  SP P    SP P P   KA  SP P+ SP+  P          AS    P P +
Sbjct: 294   KASPSPSPSPKASPSPSPSP---KASPSPSPSPSPSPSP---------KASPSPSPSPSV 341

Query: 10030 WGCS*GHCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DPLGGR 10209
                     P S   P      SP+     P+     + S +P   P  +P P   P    
Sbjct: 342   Q-------PASKPSPSPSPSPSPSPRPSPPLPSPSPSPSPSPSPSPSPSPKPSPSPSPSP 394

Query: 10210 STSP 10221
             S SP
Sbjct: 395   SPSP 398
>ref|NP_033115.1| RNA polymerase II 1 [Mus musculus]
 pir||A28490 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - mouse
 gb|AAA40071.1| (M12130) RNA polymerase II [Mus musculus]
          Length = 1932

 Score = 36.6 bits (83), Expect = 5.7
 Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      T  S    P  
Sbjct: 1671  TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 1726

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T PS S  S   + P  P Y  + PS    +  +SP+   P   PS     
Sbjct: 1727  ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 1772

Query: 10215 FTPQSPTSPTSGP 10253
             +TPQSPT   S P
Sbjct: 1773  YTPQSPTYTPSSP 1785
>sp|P11414|RPB1_CRIGR DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (RPB1)
 pir||A27677 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - Chinese
             hamster (fragment)
 gb|AAA37008.1| (M19538) RNA polymerase II largest subunit [Cricetulus griseus]
          Length = 467

 Score = 36.6 bits (83), Expect = 5.7
 Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      T  S    P  
Sbjct: 206   TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 261

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T PS S  S   + P  P Y  + PS    +  +SP+   P   PS     
Sbjct: 262   ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 307

Query: 10215 FTPQSPTSPTSGP 10253
             +TPQSPT   S P
Sbjct: 308   YTPQSPTYTPSSP 320
>emb|CAA60502.1| (X86819) Microtubule-associated protein 4 [Gallus gallus]
          Length = 928

 Score = 36.6 bits (83), Expect = 5.7
 Identities = 30/97 (30%), Positives = 42/97 (42%)
 Frame = +3

Query: 5712 KKIQAASLENRKQHQIPPYSTPTLACLKAPT*KPRVSQKNSWSATACSLLLSPLPPARIT 5891
            K   A S + R     PP S P  A  ++ T  PR +  +  +ATA +   +  PP R T
Sbjct: 689  KVTDAKSPDKRTSLSKPPSSAPRAAA-RSTTATPRTTATSPVTATAGAKSTTASPPKRPT 747

Query: 5892 SYKIPAQPTDFSPCSPSSIPLTQLLASSASWTFVPSS 6002
            S K  A+P D    +  S         SA+ + V SS
Sbjct: 748  SIKTDAKPADAKKTTAKSPSADLARPKSAAGSTVKSS 784
>ref|NP_010141.1| RNA polymerase II large subunit; Rpo21p [Saccharomyces cerevisiae]
 sp|P04050|RPB1_YEAST DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (B220)
 pir||RNBY2L DNA-directed RNA polymerase (EC 2.7.7.6) II 215K chain - yeast
             (Saccharomyces cerevisiae)
 emb|CAA65619.1| (X96876) RPB1 [Saccharomyces cerevisiae]
 emb|CAA98713.1| (Z74188) ORF YDL140c [Saccharomyces cerevisiae]
          Length = 1733

 Score = 36.6 bits (83), Expect = 5.7
 Identities = 42/141 (29%), Positives = 57/141 (39%), Gaps = 4/141 (2%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      +  S    P  
Sbjct: 1587  TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPSYSP-- 1642

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSP-SGSRPPLRPSRR-- 10205
                      T PS S  S   + P  PAY  + PS    +  +SP S S  P  PS    
Sbjct: 1643  ---------TSPSYSPTSP-SYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPT 1692

Query: 10206 QKHFTPQSPTSPTSGPGIGQG 10268
               +++P SP+   + PG   G
Sbjct: 1693  SPNYSPTSPSYSPTSPGYSPG 1713
>pir||I38186 RNA polymerase II largest subunit - human
 emb|CAA52862.1| (X74874) RNA polymerase II largest subunit [Homo sapiens]
          Length = 1970

 Score = 36.6 bits (83), Expect = 5.7
 Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      T  S    P  
Sbjct: 1709  TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 1764

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T PS S  S   + P  P Y  + PS    +  +SP+   P   PS     
Sbjct: 1765  ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 1810

Query: 10215 FTPQSPTSPTSGP 10253
             +TPQSPT   S P
Sbjct: 1811  YTPQSPTYTPSSP 1823
>sp|P08775|RPB1_MOUSE DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (RPB1)
          Length = 1970

 Score = 36.6 bits (83), Expect = 5.7
 Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      T  S    P  
Sbjct: 1709  TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 1764

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T PS S  S   + P  P Y  + PS    +  +SP+   P   PS     
Sbjct: 1765  ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 1810

Query: 10215 FTPQSPTSPTSGP 10253
             +TPQSPT   S P
Sbjct: 1811  YTPQSPTYTPSSP 1823
>dbj|BAA22376.1| (D87293) RNA polymerase II largest subunit [Cricetulus griseus]
          Length = 1970

 Score = 36.6 bits (83), Expect = 5.7
 Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
 Frame = +3

Query: 9858  TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
             T P  +   P  +PT  S    SP+ SPTS  P++ P  P  S      T  S    P  
Sbjct: 1709  TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 1764

Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
                      T PS S  S   + P  P Y  + PS    +  +SP+   P   PS     
Sbjct: 1765  ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 1810

Query: 10215 FTPQSPTSPTSGP 10253
             +TPQSPT   S P
Sbjct: 1811  YTPQSPTYTPSSP 1823
  Database: nr
    Posted date:  Jan 16, 2001  9:58 PM
  Number of letters in database: 191,393,013
  Number of sequences in database:  605,060
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: -180905813
Number of Sequences: 605060
Number of extensions: 108132989
Number of successful extensions: 373588
Number of sequences better than 10.0: 124
Number of HSP's better than 10.0 without gapping: 189936
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 364078
length of query: 3440
length of database: 191,393,013
effective HSP length: 59
effective length of query: 3380
effective length of database: 155,694,473
effective search space: 526247318740
effective search space used: 526247318740
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)