Score E
Sequences producing significant alignments: (bits) Value
pir||T47166 hypothetical protein DKFZp762B245.1 - human (fr... 126 5e-27
gb|AAC53190.1| (U92453) WW domain binding protein 3; WBP3 [... 87 1e-18
gb|AAC34395.1| (AF056116) unknown [Takifugu rubripes] 87 2e-17
ref|NP_005883.1| formin-like; chromosome 17 open reading f... 92 1e-16
gb|AAF25953.1| (AF215666) formin-like protein [Mus musculus] 86 6e-15
ref|NP_062653.1| formin-related gene in leukocytes; lymphoc... 86 6e-15
gb|AAF49761.1| (AE003536) CG6807 gene product [Drosophila m... 62 2e-07
gb|AAF60718.1| (AC024798) contains similarity to TR:Q9Z2V7 ... 50 1e-04
dbj|BAA31641.1| (AB014566) KIAA0666 protein [Homo sapiens] 42 0.004
pir||T04455 hypothetical protein F4D11.90 - Arabidopsis tha... 42 0.18
sp|P18616|RPB1_ARATH DNA-DIRECTED RNA POLYMERASE II LARGEST... 40 0.51
emb|CAA21466.2| (AL031986) DNA-directed RNA polymerase (EC ... 39 0.88
pir||JDMU1 DNA-directed RNA polymerase (EC 2.7.7.6) II larg... 39 0.88
sp|P31635|RPB0_ARATH DNA-DIRECTED RNA POLYMERASE II LARGEST... 39 0.88
gb|AAF45601.1| (AE003420) EG:114D9.2 gene product [Drosophi... 34 0.97
emb|CAA21052.1| (AL031640) /prediction=(method:""genscan"",... 34 1.00
emb|CAB95888.1| (AL359988) putative serine-threonine protei... 39 1.1
emb|CAA36734.1| (X52493) DNA-directed RNA polymerase [Glyci... 39 1.1
pir||S14181 DNA-directed RNA polymerase (EC 2.7.7.6) larges... 39 1.1
pir||T07796 DNA-directed RNA polymerase (EC 2.7.7.6) larges... 39 1.1
pir||S14182 DNA-directed RNA polymerase (EC 2.7.7.6) larges... 39 1.1
dbj|BAA20835.1| (AB002379) KIAA0381 [Homo sapiens] 36 1.3
gb|AAC99858.1| (U31159) CR16 [Rattus norvegicus] >gi|409637... 39 1.5
gb|AAA87791.1| (U25281) SH3 domain binding protein [Rattus ... 39 1.5
pir||S14183 DNA-directed RNA polymerase (EC 2.7.7.6) larges... 38 2.0
gb|AAF48438.1| (AE003498) CG15032 gene product [Drosophila ... 38 2.0
sp|P25439|BRM_DROME HOMEOTIC GENE REGULATOR (BRAHMA PROTEIN... 29 2.0
gb|AAF49557.1| (AE003529) brm gene product [alt 1] [Drosoph... 29 2.0
gb|AAF49558.2| (AE003529) brm gene product [alt 2] [Drosoph... 29 2.1
dbj|BAA07534.1| (D38529) DRPLA protein [Homo sapiens] 37 3.3
gb|AAG45420.1|AF309494_1 (AF309494) vegetative cell wall pr... 37 3.3
sp|P16253|CAC3_HAECO CUTICLE COLLAGEN 3A3 >gi|159169|gb|AAA... 37 3.3
ref|XP_006637.1| similar to dentatorubral-pallidoluysian at... 37 3.3
pir||A44984 collagen - nematode (Haemonchus contortus) 37 3.3
gb|AAB51321.1| (U47924) DRPLA [Homo sapiens] 37 3.3
sp|P40602|APG_ARATH ANTER-SPECIFIC PROLINE-RICH PROTEIN APG... 37 3.3
pir||S50832 atrophin-1 - human 37 3.3
ref|NP_001931.1| atrophin-1 [Homo sapiens] >gi|7512295|pir|... 37 3.3
emb|CAB88971.1| (AL353864) hypothetical protein SC8F11.20c.... 37 3.3
sp|P54259|DRPL_HUMAN ATROPHIN-1 (DENTATORUBRAL-PALLIDOLUYSI... 37 3.3
gb|AAF79900.1|AC022472_9 (AC022472) Contains a strong simil... 37 3.3
emb|CAA26904.1| (X03128) put. RNA polymerase II largest sub... 37 4.4
pir||S50755 hypothetical protein VSP-3 - Chlamydomonas rein... 37 5.7
ref|NP_033115.1| RNA polymerase II 1 [Mus musculus] >gi|904... 37 5.7
sp|P11414|RPB1_CRIGR DNA-DIRECTED RNA POLYMERASE II LARGEST... 37 5.7
emb|CAA60502.1| (X86819) Microtubule-associated protein 4 [... 37 5.7
ref|NP_010141.1| RNA polymerase II large subunit; Rpo21p [S... 37 5.7
pir||I38186 RNA polymerase II largest subunit - human >gi|8... 37 5.7
sp|P08775|RPB1_MOUSE DNA-DIRECTED RNA POLYMERASE II LARGEST... 37 5.7
dbj|BAA22376.1| (D87293) RNA polymerase II largest subunit ... 37 5.7
ref|NP_000928.1| polymerase (RNA) II (DNA directed) polypep... 37 5.7
pir||I65981 fatty acid omega-hydroxylase (EC 1.14.15.-) cyt... 37 5.7
gb|AAB58418.1| (U37500) RNA polymerase II largest subunit [... 37 5.7
gb|AAC02612.1| (AF045646) contains similarity to collagens ... 32 6.2
sp|O43101|CBF5_CANAL CENTROMERE/MICROTUBULE BINDING PROTEIN... 36 7.4
dbj|BAB15254.1| (AK025837) unnamed protein product [Homo sa... 36 7.4
pir||S50754 hypothetical protein WP6 - Chlamydomonas eugame... 36 7.4
gb|AAC69221.2| (AF101312) contains similarity to human diap... 29 9.4
ref|NP_013276.1| major low affinity 55 kDa Centromere/micro... 36 9.7
ref|XP_009188.1| death-associated protein kinase 3 [Homo sa... 36 9.7
gb|AAG12789.1|AC023913_8 (AC023913) transcription factor, p... 36 9.7
pir||T34947 hypothetical protein SC4A10.10c - Streptomyces ... 36 9.7
Alignments
>pir||T47166 hypothetical protein DKFZp762B245.1 - human (fragment)
emb|CAB82400.1| (AL162062) hypothetical protein [Homo sapiens]
Length = 425
Score = 126 bits (316), Expect = 5e-27
Identities = 64/64 (100%), Positives = 64/64 (100%)
Frame = +1
Query: 8308 DKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSAEEICR 8487
DKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSAEEICR
Sbjct: 1 DKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSAEEICR 60
Query: 8488 AIHT 8499
AIHT
Sbjct: 61 AIHT 64
Score = 53.9 bits (128), Expect = 3e-05
Identities = 38/103 (36%), Positives = 43/103 (40%), Gaps = 3/103 (2%)
Frame = +3
Query: 9042 KVRGRQAKEHRPVYE---GRMVPSRTSSQGGPFSLPGAWPLHPAPKAQPNPFGLSFFPTP 9212
++R RQAKEHRPVYE G + T + PF+
Sbjct: 370 ELRRRQAKEHRPVYEGKDGTIEDIITVLKSVPFT-------------------------- 403
Query: 9213 HEA*AGWLSPGPQC*RVSLSRARTAKRGSRFFCDAAHHDESNC 9341
ARTAKRGSRFFCDAAHHDESNC
Sbjct: 404 ---------------------ARTAKRGSRFFCDAAHHDESNC 425
>gb|AAC53190.1| (U92453) WW domain binding protein 3; WBP3 [Mus musculus]
Length = 164
Score = 86.7 bits (213), Expect(2) = 1e-18
Identities = 41/55 (74%), Positives = 48/55 (86%)
Frame = -3
Query: 4121 LSACQETYENTSHQVHTLRRLIKEKEEAFQRRCHLEPNVRGLESVDSEALSQSRP 3957
L + +ETYENTS+QVHTLRRLIKEKEEAFQRRCHLEP+ RGLES+ EAL++ P
Sbjct: 50 LESIKETYENTSNQVHTLRRLIKEKEEAFQRRCHLEPSARGLESMGGEALARVGP 104
Score = 32.3 bits (72), Expect(2) = 1e-18
Identities = 13/17 (76%), Positives = 15/17 (87%)
Frame = -1
Query: 3970 ARVGPAELSEGMPPKDL 3920
ARVGP EL+EG+PP DL
Sbjct: 100 ARVGPTELTEGIPPSDL 116
>gb|AAC34395.1| (AF056116) unknown [Takifugu rubripes]
Length = 1037
Score = 86.7 bits (213), Expect = 5e-15
Identities = 39/59 (66%), Positives = 54/59 (91%)
Frame = +1
Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRS 8469
++L+L++FEELFKT+AQGP +DL C+K+K AQKA +KVT+L+ANR+KNLAITLRKA ++
Sbjct: 618 RELELERFEELFKTRAQGPIMDLSCTKSKVAQKAVNKVTILDANRSKNLAITLRKANKT 676
Score = 81.3 bits (199), Expect(2) = 2e-17
Identities = 37/44 (84%), Positives = 43/44 (97%)
Frame = +3
Query: 7878 LAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILEV 8009
+AIRIKKPIKTKFRLPVFNWTALKPNQI+GTVF+E+DDE+ LE+
Sbjct: 579 VAIRIKKPIKTKFRLPVFNWTALKPNQINGTVFNEIDDERELEL 622
Score = 46.6 bits (109), Expect = 0.006
Identities = 22/39 (56%), Positives = 31/39 (79%)
Frame = -3
Query: 4109 QETYENTSHQVHTLRRLIKEKEEAFQRRCHLEPNVRGLE 3993
+ETYE+TS QV+TLR++IKEK+ AFQR ++E + LE
Sbjct: 432 RETYESTSSQVNTLRKVIKEKDAAFQRHFNIERRLLELE 470
Score = 38.1 bits (87), Expect = 2.0
Identities = 31/103 (30%), Positives = 40/103 (38%), Gaps = 3/103 (2%)
Frame = +3
Query: 9042 KVRGRQAKEHRPVYE---GRMVPSRTSSQGGPFSLPGAWPLHPAPKAQPNPFGLSFFPTP 9212
++R RQAK+HRPVYE G + T + PF+
Sbjct: 982 ELRKRQAKDHRPVYEGKDGTIEDIITVLKSVPFT-------------------------- 1015
Query: 9213 HEA*AGWLSPGPQC*RVSLSRARTAKRGSRFFCDAAHHDESNC 9341
ARTAKRGSRFFC+A D++NC
Sbjct: 1016 ---------------------ARTAKRGSRFFCEANLCDDANC 1037
Score = 33.9 bits (76), Expect(2) = 2e-17
Identities = 15/29 (51%), Positives = 20/29 (68%)
Frame = +1
Query: 7717 PPDKCPPAPPLPGAAPSVVLTVGLSGEYP 7803
PP P APPLP A+PSV+L+V + + P
Sbjct: 558 PPPPPPLAPPLPDASPSVILSVAIRIKKP 586
>ref|NP_005883.1| formin-like; chromosome 17 open reading frame 1; chromosome 17 open
reading frame 1B [Homo sapiens]
ref|XP_008347.1| chromosome 17 open reading frame 1B [Homo sapiens]
emb|CAA07870.1| (AJ008112) C17orf1 protein [Homo sapiens]
Length = 463
Score = 91.7 bits (226), Expect = 1e-16
Identities = 46/67 (68%), Positives = 53/67 (78%)
Frame = +1
Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSA 8472
Q+LD+ FEE FKTK+QGP+LDL K+K AQKA SK TL+EANRAKNLAITLRK A
Sbjct: 29 QELDMSDFEEQFKTKSQGPSLDLSALKSKAAQKAPSKATLIEANRAKNLAITLRKGNLGA 88
Query: 8473 EEICRAI 8493
E IC+AI
Sbjct: 89 ERICQAI 95
Score = 48.9 bits (115), Expect = 0.001
Identities = 19/29 (65%), Positives = 27/29 (92%)
Frame = +3
Query: 7920 LPVFNWTALKPNQISGTVFSELDDEKILE 8006
+P+ NW ALKP+QI+GTVF+EL+DEK+L+
Sbjct: 1 MPLLNWVALKPSQITGTVFTELNDEKVLQ 29
>gb|AAF25953.1| (AF215666) formin-like protein [Mus musculus]
Length = 1094
Score = 86.3 bits (212), Expect = 6e-15
Identities = 42/69 (60%), Positives = 52/69 (74%)
Frame = +1
Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSA 8472
Q+LD++ FEE FKTK+QGP LD+ K K +QKA +K L+EANRAKNLAITLRK A
Sbjct: 665 QELDMNDFEEHFKTKSQGPCLDISALKGKASQKAPTKTILIEANRAKNLAITLRKGNLGA 724
Query: 8473 EEICRAIHT 8499
+ IC+AI T
Sbjct: 725 DRICQAIET 733
Score = 67.0 bits (162), Expect(2) = 1e-10
Identities = 27/41 (65%), Positives = 38/41 (91%)
Frame = +3
Query: 7884 IRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILE 8006
++ KKPI+TKFR+P+ NW ALKP+QI+GTVF+EL+DEK+L+
Sbjct: 625 VKAKKPIQTKFRMPLLNWVALKPSQITGTVFTELNDEKVLQ 665
Score = 25.0 bits (53), Expect(2) = 1e-10
Identities = 9/11 (81%), Positives = 9/11 (81%)
Frame = +1
Query: 7732 PPAPPLPGAAP 7764
PPAPPLPG P
Sbjct: 575 PPAPPLPGDLP 585
>ref|NP_062653.1| formin-related gene in leukocytes; lymphocyte specific formin related
protein [Mus musculus]
pir||T13963 formin related protein, lymphocyte specific - mouse
gb|AAD01273.1| (AF006466) lymphocyte specific formin related protein [Mus musculus]
Length = 1064
Score = 86.3 bits (212), Expect = 6e-15
Identities = 42/69 (60%), Positives = 52/69 (74%)
Frame = +1
Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAITLRKAGRSA 8472
Q+LD++ FEE FKTK+QGP LD+ K K +QKA +K L+EANRAKNLAITLRK A
Sbjct: 639 QELDMNDFEEHFKTKSQGPCLDISALKGKASQKAPTKTILIEANRAKNLAITLRKGNLGA 698
Query: 8473 EEICRAIHT 8499
+ IC+AI T
Sbjct: 699 DRICQAIET 707
Score = 67.0 bits (162), Expect(2) = 6e-10
Identities = 27/41 (65%), Positives = 38/41 (91%)
Frame = +3
Query: 7884 IRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILE 8006
++ KKPI+TKFR+P+ NW ALKP+QI+GTVF+EL+DEK+L+
Sbjct: 599 VKAKKPIQTKFRMPLLNWVALKPSQITGTVFTELNDEKVLQ 639
Score = 22.7 bits (47), Expect(2) = 6e-10
Identities = 7/14 (50%), Positives = 9/14 (64%)
Frame = +1
Query: 7732 PPAPPLPGAAPSVV 7773
PP PP PG P ++
Sbjct: 575 PPPPPPPGGPPDIL 588
>gb|AAF49761.1| (AE003536) CG6807 gene product [Drosophila melanogaster]
Length = 1043
Score = 61.6 bits (148), Expect = 2e-07
Identities = 25/42 (59%), Positives = 34/42 (80%)
Frame = +3
Query: 7881 AIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILE 8006
A+ IK+ + TK++LP NW ALKPNQ+ GT+F+ELDDEKI +
Sbjct: 554 AMTIKRKVPTKYKLPTLNWIALKPNQVRGTIFNELDDEKIFK 595
Score = 37.4 bits (85), Expect = 3.3
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Frame = +1
Query: 8293 QDLDLDKFEELFK--------TKAQGPALDLICSKNKTAQKAASKVTLLEANRAKNLAIT 8448
+ +D ++FEE FK + G +D +K K V+LLE R +N+AI+
Sbjct: 595 KQIDFNEFEERFKIGIGGALRNGSNGTEVDGSLQSSKRF-KRPDNVSLLEHTRLRNIAIS 653
Query: 8449 LRKAGRSAEEICRAIHT 8499
RK G +++ AIH+
Sbjct: 654 RRKLGMPIDDVIAAIHS 670
>gb|AAF60718.1| (AC024798) contains similarity to TR:Q9Z2V7 [Caenorhabditis elegans]
Length = 1164
Score = 50.1 bits (118), Expect(2) = 1e-04
Identities = 25/47 (53%), Positives = 31/47 (65%)
Frame = +3
Query: 7866 PPSKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILE 8006
P S A IKK +TK +LP NWTA+KP Q TVF +L+DE I+E
Sbjct: 575 PASNDAKTIKKIYQTKNKLPQLNWTAMKPMQAKNTVFEKLNDELIIE 621
Score = 37.7 bits (86), Expect = 2.6
Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 15/83 (18%)
Frame = +1
Query: 8293 QDLDLDKFEELFKTKAQGPALDLICSKNKTA---------------QKAASKVTLLEANR 8427
+ LD K EE+FK P L L K++ + +A K TLL+ R
Sbjct: 621 EKLDFSKLEEMFKLAQ--PTLGLAEPKSEQSVIGQVSPGSTTSAAGTSSARKNTLLDTKR 678
Query: 8428 AKNLAITLRKAGRSAEEICRAIH 8496
+N+AIT RK A+ I A+H
Sbjct: 679 LQNVAITRRKVAMDAKSIMAAVH 701
Score = 21.2 bits (43), Expect(2) = 1e-04
Identities = 11/26 (42%), Positives = 11/26 (42%)
Frame = +1
Query: 7717 PPDKCPPAPPLPGAAPSVVLTVGLSG 7794
PP PP P L G P GL G
Sbjct: 549 PPPPPPPPPMLGGPPPPPPPPGGLMG 574
>dbj|BAA31641.1| (AB014566) KIAA0666 protein [Homo sapiens]
Length = 1085
Score = 41.6 bits (96), Expect(2) = 0.004
Identities = 22/63 (34%), Positives = 36/63 (56%), Gaps = 3/63 (4%)
Frame = +3
Query: 7830 PGVCVLGPCVTDPPSKLAIRIKK---PIKTKFRLPVFNWTALKPNQISGTVFSELDDEKI 8000
PG LG + P + + + +KK P T L FNW+ L N++ GTV++E+DD K+
Sbjct: 585 PGPPPLGAIMPPPGAPMGLALKKKSIPQPTN-ALKSFNWSKLPENKLEGTVWTEIDDTKV 643
Query: 8001 LEV 8009
++
Sbjct: 644 FKI 646
Score = 24.6 bits (52), Expect(2) = 0.004
Identities = 11/15 (73%), Positives = 11/15 (73%)
Frame = +1
Query: 7711 LLPPDKCPPAPPLPG 7755
LLPP PP PPLPG
Sbjct: 555 LLPP---PPPPPLPG 566
>pir||T04455 hypothetical protein F4D11.90 - Arabidopsis thaliana
emb|CAA18590.1| (AL022537) putative protein [Arabidopsis thaliana]
emb|CAB79988.1| (AL161581) putative protein kinase [Arabidopsis thaliana]
Length = 731
Score = 41.6 bits (96), Expect = 0.18
Identities = 45/136 (33%), Positives = 55/136 (40%), Gaps = 7/136 (5%)
Frame = +1
Query: 9859 PCLSPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYLWGC 10038
P LSPLP L SP P P SP P +PT P L S S + +PP P L
Sbjct: 35 PPLSPLPPPLSSPPPLP-------SPPPLSAPTASPPPLPVESPPSPPIESPPPPLLE-- 85
Query: 10039 S*GHCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPR---APGPH*DP---- 10197
S P P +S+P+G P + A+ + PPS PP PG P
Sbjct: 86 SPPPPPLESPSPPSPHVSAPSGSPPLPFL---PAKPSPPPSSPPSETVPPGNTISPPPRS 142
Query: 10198 LGGRSTSPLRVLQVQP 10245
L ST P+ P
Sbjct: 143 LPSESTPPVNTASPPP 158
>sp|P18616|RPB1_ARATH DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (VERSION 1)
emb|CAA37130.1| (X52954) RNA polymerase II [Arabidopsis thaliana]
Length = 1841
Score = 40.0 bits (92), Expect = 0.51
Identities = 42/133 (31%), Positives = 57/133 (42%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 1647 TSPAYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPAYSPTSPGYSP-- 1702
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T PS S S + P P+YG + PS +A++SPS + P + R
Sbjct: 1703 ---------TSPSYSPTSP-SYSPTSPSYGPTSPSYNPQSAKYSPSIAYSP--SNARLSP 1750
Query: 10215 FTPQSPTSPTSGP 10253
+P SPTSP P
Sbjct: 1751 ASPYSPTSPNYSP 1763
>emb|CAA21466.2| (AL031986) DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain
[Arabidopsis thaliana]
emb|CAB81489.1| (AL161588) DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain
[Arabidopsis thaliana]
Length = 1840
Score = 39.3 bits (90), Expect = 0.88
Identities = 41/133 (30%), Positives = 56/133 (41%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 1646 TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 1701
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T P S S + P P+YG + PS +A++SPS + P + R
Sbjct: 1702 ---------TSPGYSPTSP-SYSPTSPSYGPTSPSYNPQSAKYSPSIAYSP--SNARLSP 1749
Query: 10215 FTPQSPTSPTSGP 10253
+P SPTSP P
Sbjct: 1750 ASPYSPTSPNYSP 1762
>pir||JDMU1 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - Arabidopsis
thaliana
Length = 1834
Score = 39.3 bits (90), Expect = 0.88
Identities = 41/133 (30%), Positives = 56/133 (41%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 1640 TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 1695
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T P S S + P P+YG + PS +A++SPS + P + R
Sbjct: 1696 ---------TSPGYSPTSP-SYSPTSPSYGPTSPSYNPQSAKYSPSIAYSP--SNARLSP 1743
Query: 10215 FTPQSPTSPTSGP 10253
+P SPTSP P
Sbjct: 1744 ASPYSPTSPNYSP 1756
>sp|P31635|RPB0_ARATH DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (VERSION 2)
pir||JDMU2 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain (version 2) -
Arabidopsis thaliana
emb|CAA36735.1| (X52494) DNA-directed RNA polymerase [Arabidopsis thaliana]
Length = 1860
Score = 39.3 bits (90), Expect = 0.88
Identities = 41/133 (30%), Positives = 56/133 (41%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 1666 TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 1721
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T P S S + P P+YG + PS +A++SPS + P + R
Sbjct: 1722 ---------TSPGYSPTSP-SYSPTSPSYGPTSPSYNPQSAKYSPSIAYSP--SNARLSP 1769
Query: 10215 FTPQSPTSPTSGP 10253
+P SPTSP P
Sbjct: 1770 ASPYSPTSPNYSP 1782
>gb|AAF45601.1| (AE003420) EG:114D9.2 gene product [Drosophila melanogaster]
Length = 1429
Score = 33.9 bits (76), Expect(2) = 0.97
Identities = 18/50 (36%), Positives = 27/50 (54%)
Frame = +3
Query: 7851 PCVTDPPSKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKI 8000
P D P K + P+K+ FNW+ L ++ GTV+SELD+ K+
Sbjct: 889 PPKVDLPKKNVPQPTNPLKS------FNWSKLPDAKLQGTVWSELDESKL 932
Score = 23.9 bits (50), Expect(2) = 0.97
Identities = 11/22 (50%), Positives = 13/22 (59%)
Frame = +1
Query: 7717 PPDKCPPAPPLPGAAPSVVLTV 7782
PP CP APP P PS+ T+
Sbjct: 867 PPPPCPGAPPPP---PSMAQTM 885
>emb|CAA21052.1| (AL031640) /prediction=(method:""genscan"", version:""1.0"",
score:""400.91"")~/prediction=(method:""genefinder"",
version:""084"")~/match=(desc:""DIA-12C PROTEIN"",
species:""HOMO SAPIENS (HUMAN)"",
ranges:(query:29998..30156, target:SPTREMBL::O60878:>
Length = 979
Score = 33.9 bits (76), Expect(2) = 1.00
Identities = 18/50 (36%), Positives = 27/50 (54%)
Frame = +3
Query: 7851 PCVTDPPSKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKI 8000
P D P K + P+K+ FNW+ L ++ GTV+SELD+ K+
Sbjct: 439 PPKVDLPKKNVPQPTNPLKS------FNWSKLPDAKLQGTVWSELDESKL 482
Score = 23.9 bits (50), Expect(2) = 1.00
Identities = 11/22 (50%), Positives = 13/22 (59%)
Frame = +1
Query: 7717 PPDKCPPAPPLPGAAPSVVLTV 7782
PP CP APP P PS+ T+
Sbjct: 417 PPPPCPGAPPPP---PSMAQTM 435
>emb|CAB95888.1| (AL359988) putative serine-threonine protein kinase [Streptomyces
coelicolor A3(2)]
Length = 580
Score = 38.9 bits (89), Expect = 1.1
Identities = 26/94 (27%), Positives = 37/94 (38%)
Frame = +3
Query: 7692 LGHCVISSSPRQVSPSPTSPWCCTLCGVDSGPVR*VSPGLWAGLVGPGVCVLGPCVTDPP 7871
LG C+ ++ + +P+ WC G D G P W + GP V V P P
Sbjct: 249 LGRCLATAPEERATPAEIVEWCRRELGRDGG--EGAGPAGWREIAGPPVTVPPPAAATGP 306
Query: 7872 SKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTV 7973
+ A + P T + WT + N GTV
Sbjct: 307 ATAAAPVAAPGPT--AVHTTPWTVPEGNVAPGTV 338
>emb|CAA36734.1| (X52493) DNA-directed RNA polymerase [Glycine max]
Length = 494
Score = 38.9 bits (89), Expect = 1.1
Identities = 41/133 (30%), Positives = 55/133 (40%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 304 TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 359
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T P S S + P P+Y + PS +A++SPS + P P R
Sbjct: 360 ---------TSPGYSPTSP-SYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSP--RLSP 407
Query: 10215 FTPQSPTSPTSGP 10253
+P SPTSP P
Sbjct: 408 SSPYSPTSPNYSP 420
>pir||S14181 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain (isoform B1) -
soybean (fragment)
Length = 650
Score = 38.9 bits (89), Expect = 1.1
Identities = 41/133 (30%), Positives = 55/133 (40%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 460 TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 515
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T P S S + P P+Y + PS +A++SPS + P P R
Sbjct: 516 ---------TSPGYSPTSP-SYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSP--RLSP 563
Query: 10215 FTPQSPTSPTSGP 10253
+P SPTSP P
Sbjct: 564 SSPYSPTSPNYSP 576
>pir||T07796 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain - soybean
(fragment)
emb|CAA36733.1| (X52492) DNA-directed RNA polymerase [Glycine max]
Length = 625
Score = 38.9 bits (89), Expect = 1.1
Identities = 41/133 (30%), Positives = 55/133 (40%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 464 TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 519
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T P S S + P P+Y + PS +A++SPS + P P R
Sbjct: 520 ---------TSPGYSPTSP-SYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSP--RLSP 567
Query: 10215 FTPQSPTSPTSGP 10253
+P SPTSP P
Sbjct: 568 SSPYSPTSPNYSP 580
>pir||S14182 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain (isoform B2) -
soybean (fragment)
Length = 491
Score = 38.9 bits (89), Expect = 1.1
Identities = 41/133 (30%), Positives = 55/133 (40%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 301 TSPAYSPTSPAYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPAYSP-- 356
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T P S S + P P+Y + PS +A++SPS + P P R
Sbjct: 357 ---------TSPGYSPTSP-SYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSP--RLSP 404
Query: 10215 FTPQSPTSPTSGP 10253
+P SPTSP P
Sbjct: 405 SSPYSPTSPNYSP 417
>dbj|BAA20835.1| (AB002379) KIAA0381 [Homo sapiens]
Length = 864
Score = 36.2 bits (82), Expect(2) = 1.3
Identities = 15/48 (31%), Positives = 27/48 (56%)
Frame = +3
Query: 7866 PPSKLAIRIKKPIKTKFRLPVFNWTALKPNQISGTVFSELDDEKILEV 8009
P S + +R K+ + L FNW L ++ GTV++E+DD ++ +
Sbjct: 374 PSSDVPLRKKRVPQPSHPLKSFNWVKLNEERVPGTVWNEIDDMQVFRI 421
Score = 21.2 bits (43), Expect(2) = 1.3
Identities = 11/30 (36%), Positives = 15/30 (49%), Gaps = 6/30 (20%)
Frame = +1
Query: 7681 SLDAWVTVSFLLPPDK------CPPAPPLP 7752
+L + +T + L PP CPP PP P
Sbjct: 317 TLSSSMTTNDLPPPPPPLPFACCPPPPPPP 346
>gb|AAC99858.1| (U31159) CR16 [Rattus norvegicus]
gb|AAC99859.1| (U31169) SH3 domain binding protein [Rattus norvegicus]
Length = 485
Score = 38.5 bits (88), Expect = 1.5
Identities = 32/118 (27%), Positives = 47/118 (39%), Gaps = 8/118 (6%)
Frame = +1
Query: 9850 KHQPCLSPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYL 10029
+ P P P P P PL A SP+ A++P P + S S S PP P
Sbjct: 291 REPPAPPPPPPPPPPPPPPPLPTYASCSPRAAVAP---PPPPLPGSSNSGSETPPPLP-- 345
Query: 10030 WGCS*GHCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAA--------PPSGPPRAP 10179
P SPS ++L +P G ++++ + R PP+ P R+P
Sbjct: 346 --------PKSPSFQTQKALPTPPGAPGPQIILQKKRRGPGAGGGKLNPPPAPPARSP 395
>gb|AAA87791.1| (U25281) SH3 domain binding protein [Rattus norvegicus]
prf||2205340A CR16 gene [Rattus norvegicus]
Length = 451
Score = 38.5 bits (88), Expect = 1.5
Identities = 32/118 (27%), Positives = 47/118 (39%), Gaps = 8/118 (6%)
Frame = +1
Query: 9850 KHQPCLSPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYL 10029
+ P P P P P PL A SP+ A++P P + S S S PP P
Sbjct: 291 REPPAPPPPPPPPPPPPPPPLPTYASCSPRAAVAP---PPPPLPGSSNSGSETPPPLP-- 345
Query: 10030 WGCS*GHCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAA--------PPSGPPRAP 10179
P SPS ++L +P G ++++ + R PP+ P R+P
Sbjct: 346 --------PKSPSFQTQKALPTPPGAPGPQIILQKKRRGPGAGGGKLNPPPAPPARSP 395
>pir||S14183 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain (isoform C) -
soybean (fragment)
emb|CAA36736.1| (X52495) DNA-directed RNA polymerase [Glycine max]
Length = 977
Score = 38.1 bits (87), Expect = 2.0
Identities = 40/133 (30%), Positives = 57/133 (42%), Gaps = 3/133 (2%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SSEMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVGL 10037
T P ++ P +PT SP+ SPTS P + P P SL + S P
Sbjct: 797 TSPAYSSTSPAYSPT------SPSYSPTS--PAYSPTSPSYSLTSPSYSPTSPSYSPTSP 848
Query: 10038 LLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRP--SRRQK 10211
S P+ S P P+Y + PS +A++SPS + P P S
Sbjct: 849 SYSPTSPAYSPTSPSYS-----PTSPSYSPTSPSYNPQSAKYSPSLAYSPSSPRLSPTSP 903
Query: 10212 HFTPQSPT-SPTS 10247
+++P SP+ SPTS
Sbjct: 904 NYSPTSPSYSPTS 916
>gb|AAF48438.1| (AE003498) CG15032 gene product [Drosophila melanogaster]
Length = 277
Score = 38.1 bits (87), Expect = 2.0
Identities = 19/49 (38%), Positives = 27/49 (54%), Gaps = 1/49 (2%)
Frame = -3
Query: 7856 TGAKDTHPWSHQPCP-EPWGYSPDRPTVNTTEGAAPGRGGAGGHLSGGR 7713
T ++ HP++H P P G+ P+ N G+A G AGG +SGGR
Sbjct: 150 TNSRGLHPYAHSPAHGNPPGFYPNMWYPNAPYGSAGAAGSAGGAVSGGR 198
>sp|P25439|BRM_DROME HOMEOTIC GENE REGULATOR (BRAHMA PROTEIN)
pir||A42091 transcription activator SNF2/SWI2 homolog brm - fruit fly (Drosophila
melanogaster)
gb|AAA19661.1| (M85049) brahma protein [Drosophila melanogaster]
Length = 1638
Score = 28.9 bits (63), Expect(2) = 2.0
Identities = 20/60 (33%), Positives = 24/60 (39%)
Frame = +2
Query: 9923 PSNLPNQQFPQLGTPQALSKPPVYLPHCWLLPGPTCGAALEGTVLAHPPILFTGLSLPPQ 10102
P Q P GTP S PP P+ +PG + V PP + G LPPQ
Sbjct: 241 PPQQQQQPPPSAGTPPQCSTPPASNPYGPPVPGQ------KMQVAPPPPHMQQGQPLPPQ 294
Score = 27.7 bits (60), Expect(2) = 2.0
Identities = 19/59 (32%), Positives = 23/59 (38%)
Frame = +3
Query: 10095 PHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKHFTPQSPTSPTSGPGIGQGL 10271
P P P G PP Q+ Q S+PP P +H P + GP GQ L
Sbjct: 290 PLPPQPPQVGGPPPIQQQQPPQQQQQQSQPP--PPEPHQHQLPNGGKPLSMGPSGGQPL 346
>gb|AAF49557.1| (AE003529) brm gene product [alt 1] [Drosophila melanogaster]
Length = 1638
Score = 28.9 bits (63), Expect(2) = 2.0
Identities = 20/60 (33%), Positives = 24/60 (39%)
Frame = +2
Query: 9923 PSNLPNQQFPQLGTPQALSKPPVYLPHCWLLPGPTCGAALEGTVLAHPPILFTGLSLPPQ 10102
P Q P GTP S PP P+ +PG + V PP + G LPPQ
Sbjct: 241 PPQQQQQPPPSAGTPPQCSTPPASNPYGPPVPGQ------KMQVAPPPPHMQQGQPLPPQ 294
Score = 27.7 bits (60), Expect(2) = 2.0
Identities = 19/59 (32%), Positives = 23/59 (38%)
Frame = +3
Query: 10095 PHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKHFTPQSPTSPTSGPGIGQGL 10271
P P P G PP Q+ Q S+PP P +H P + GP GQ L
Sbjct: 290 PLPPQPPQVGGPPPIQQQQPPQQQQQQSQPP--PPEPHQHQLPNGGKPLSMGPSGGQPL 346
>gb|AAF49558.2| (AE003529) brm gene product [alt 2] [Drosophila melanogaster]
Length = 1537
Score = 28.9 bits (63), Expect(2) = 2.1
Identities = 20/60 (33%), Positives = 24/60 (39%)
Frame = +2
Query: 9923 PSNLPNQQFPQLGTPQALSKPPVYLPHCWLLPGPTCGAALEGTVLAHPPILFTGLSLPPQ 10102
P Q P GTP S PP P+ +PG + V PP + G LPPQ
Sbjct: 140 PPQQQQQPPPSAGTPPQCSTPPASNPYGPPVPGQ------KMQVAPPPPHMQQGQPLPPQ 193
Score = 27.7 bits (60), Expect(2) = 2.1
Identities = 19/59 (32%), Positives = 23/59 (38%)
Frame = +3
Query: 10095 PHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKHFTPQSPTSPTSGPGIGQGL 10271
P P P G PP Q+ Q S+PP P +H P + GP GQ L
Sbjct: 189 PLPPQPPQVGGPPPIQQQQPPQQQQQQSQPP--PPEPHQHQLPNGGKPLSMGPSGGQPL 245
>dbj|BAA07534.1| (D38529) DRPLA protein [Homo sapiens]
Length = 1182
Score = 37.4 bits (85), Expect = 3.3
Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
Frame = +1
Query: 9853 HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
H P L P SPQP P + +A P P+++PTGY P+S + + A
Sbjct: 160 HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 212
Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
PP P L+ G + G G P P S+ P G P+ V S AP
Sbjct: 213 -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 271
Query: 10156 PSGPPRAP 10179
P+ PP P
Sbjct: 272 PTKPPTTP 279
>gb|AAG45420.1|AF309494_1 (AF309494) vegetative cell wall protein gp1 [Chlamydomonas reinhardtii]
Length = 555
Score = 37.4 bits (85), Expect = 3.3
Identities = 35/118 (29%), Positives = 43/118 (35%)
Frame = +1
Query: 9868 SPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYLWGCS*G 10047
SP P SP P A SP P + P+ P S + S APP P
Sbjct: 188 SPAPP---SPAPPVPPSPAPPSPAPPVPPSPAPPSPPSPAPPSPPSPAPPSP-------- 236
Query: 10048 HCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DPLGGRSTSP 10221
P +P PV S + P+ P + PP PPR P P P+ SP
Sbjct: 237 -SPPAPPSPVPPSPAPPSPAPPSPKPPAPPPPPSPPPPPPPRPPFPANTPMPPSPPSP 293
>sp|P16253|CAC3_HAECO CUTICLE COLLAGEN 3A3
gb|AAA29173.1| (M32820) 3A3 collagen [Haemonchus contortus]
Length = 295
Score = 37.4 bits (85), Expect = 3.3
Identities = 20/48 (41%), Positives = 24/48 (49%)
Frame = +1
Query: 10054 PGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DP 10197
PG+P P L P G C+P+ + A A P GPP PGP DP
Sbjct: 115 PGAPGLPGVPGLPPPDG-SCEPVSIPPCAECPAGPPGPPGKPGPPGDP 161
>ref|XP_006637.1| similar to dentatorubral-pallidoluysian atrophy (atrophin-1) (H.
sapiens) [Homo sapiens]
ref|XP_006975.1| atrophin-1 [Homo sapiens]
Length = 1189
Score = 37.4 bits (85), Expect = 3.3
Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
Frame = +1
Query: 9853 HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
H P L P SPQP P + +A P P+++PTGY P+S + + A
Sbjct: 159 HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 211
Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
PP P L+ G + G G P P S+ P G P+ V S AP
Sbjct: 212 -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 270
Query: 10156 PSGPPRAP 10179
P+ PP P
Sbjct: 271 PTKPPTTP 278
>pir||A44984 collagen - nematode (Haemonchus contortus)
Length = 295
Score = 37.4 bits (85), Expect = 3.3
Identities = 20/48 (41%), Positives = 24/48 (49%)
Frame = +1
Query: 10054 PGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DP 10197
PG+P P L P G C+P+ + A A P GPP PGP DP
Sbjct: 115 PGAPGLPGVPGLPPPDG-SCEPVSIPPCAECPAGPPGPPGKPGPPGDP 161
>gb|AAB51321.1| (U47924) DRPLA [Homo sapiens]
Length = 1190
Score = 37.4 bits (85), Expect = 3.3
Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
Frame = +1
Query: 9853 HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
H P L P SPQP P + +A P P+++PTGY P+S + + A
Sbjct: 160 HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 212
Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
PP P L+ G + G G P P S+ P G P+ V S AP
Sbjct: 213 -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 271
Query: 10156 PSGPPRAP 10179
P+ PP P
Sbjct: 272 PTKPPTTP 279
>sp|P40602|APG_ARATH ANTER-SPECIFIC PROLINE-RICH PROTEIN APG PRECURSOR
pir||S21961 proline-rich protein APG - Arabidopsis thaliana
emb|CAA42925.1| (X60377) APG [Arabidopsis thaliana]
Length = 534
Score = 37.4 bits (85), Expect = 3.3
Identities = 32/126 (25%), Positives = 43/126 (33%)
Frame = +1
Query: 9871 PLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYLWGCS*GH 10050
P P + P PDP SP+P P P + APP P
Sbjct: 53 PQPWPMNPPTPDP-------SPKPVAPPGPSPKPV-----------APPGP-------SP 87
Query: 10051 CPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DPLGGRSTSPLRV 10230
CP P P + +P+ C + Q + PP+ PP P P P + P
Sbjct: 88 CPSPPPKPQPKPPPAPSPSPCPSPPPKPQPKPVPPPACPPTPPKPQPKPAPPPAPKPAPP 147
Query: 10231 LQVQPV 10248
+PV
Sbjct: 148 PAPKPV 153
>pir||S50832 atrophin-1 - human
Length = 1184
Score = 37.4 bits (85), Expect = 3.3
Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
Frame = +1
Query: 9853 HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
H P L P SPQP P + +A P P+++PTGY P+S + + A
Sbjct: 160 HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 212
Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
PP P L+ G + G G P P S+ P G P+ V S AP
Sbjct: 213 -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 271
Query: 10156 PSGPPRAP 10179
P+ PP P
Sbjct: 272 PTKPPTTP 279
>ref|NP_001931.1| atrophin-1 [Homo sapiens]
pir||G01763 atrophin-1 - human
gb|AAB50276.1| (U23851) atrophin-1 [Homo sapiens]
Length = 1184
Score = 37.4 bits (85), Expect = 3.3
Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
Frame = +1
Query: 9853 HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
H P L P SPQP P + +A P P+++PTGY P+S + + A
Sbjct: 159 HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 211
Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
PP P L+ G + G G P P S+ P G P+ V S AP
Sbjct: 212 -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 270
Query: 10156 PSGPPRAP 10179
P+ PP P
Sbjct: 271 PTKPPTTP 278
>emb|CAB88971.1| (AL353864) hypothetical protein SC8F11.20c. [Streptomyces coelicolor
A3(2)]
Length = 1086
Score = 37.4 bits (85), Expect = 3.3
Identities = 38/134 (28%), Positives = 47/134 (34%), Gaps = 13/134 (9%)
Frame = +1
Query: 9028 PRVNTR*GGARPRNTGLCMREGWYHXGHHH--------RGGPFHSLG----PGHCTQLPK 9171
P ++ R GARP + G G H RGG G P + P
Sbjct: 716 PELSERYAGARPGSNGAGSPSPGPPAGAHRPPPGGGRERGGHAEPAGTPSVPPRGERRPG 775
Query: 9172 LNLTPLGSPFSPPPMRPERGGSLLGHSAEECP-FHGPVLPSGAHASSVMQPTMMSQTVSP 9348
N P G PPP P+RGG P GP G H + P++ SQ
Sbjct: 776 NNGEPAGPRQGPPPAAPDRGGRGEPAGPHRVPASDGP--GRGEHPRAAGTPSVPSQAAPD 833
Query: 9349 QGWGPTGTGHRSCP 9390
+G TG R P
Sbjct: 834 RGEHTPSTGTRRVP 847
>sp|P54259|DRPL_HUMAN ATROPHIN-1 (DENTATORUBRAL-PALLIDOLUYSIAN ATROPHY PROTEIN)
dbj|BAA06626.1| (D31840) DRPLA [Homo sapiens]
Length = 1185
Score = 37.4 bits (85), Expect = 3.3
Identities = 40/128 (31%), Positives = 53/128 (41%), Gaps = 19/128 (14%)
Frame = +1
Query: 9853 HQPCLSPLPSGL*SPQPD---PLKCKAQQSPQPAISPTGY------PSSLI*ASCLSASL 10005
H P L P SPQP P + +A P P+++PTGY P+S + + A
Sbjct: 160 HPPPLFPP-----SPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGA-- 212
Query: 10006 LAPPWPYLW-GCS*GHCPGSPSHP----VHRSLSSPTG-----LHCQPMVVRHQARSAAP 10155
PP P L+ G + G G P P S+ P G P+ V S AP
Sbjct: 213 -PPPHPQLYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAP 271
Query: 10156 PSGPPRAP 10179
P+ PP P
Sbjct: 272 PTKPPTTP 279
>gb|AAF79900.1|AC022472_9 (AC022472) Contains a strong similarity to Anther-specific proline-rich
protein APG precursor from Arabidopsis thaliana gi|728867
and contains a Lipase/Acylhydrolase domain with GDSL-like
motif PF|00657. ESTs gb|AV531882, gb|AV533240,
gb|AV534374, gb|>
Length = 1137
Score = 37.4 bits (85), Expect = 3.3
Identities = 27/101 (26%), Positives = 38/101 (36%)
Frame = +1
Query: 9895 PQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYLWGCS*GHCPGSPSHP 10074
PQP P+ +P P+ P P S+ +APP P CP P P
Sbjct: 63 PQPWPMN---PPTPDPSPKPVAPPGP-------SSKPVAPPGP-------SPCPSPPPKP 105
Query: 10075 VHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DP 10197
+ +P+ C + Q + PP+ PP P P P
Sbjct: 106 QPKPPPAPSPSPCPSPPPKPQPKPVPPPACPPTPPKPQPKP 146
>emb|CAA26904.1| (X03128) put. RNA polymerase II largest subunit [Saccharomyces
cerevisiae]
Length = 1726
Score = 37.0 bits (84), Expect = 4.4
Identities = 42/141 (29%), Positives = 57/141 (39%), Gaps = 4/141 (2%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 1580 TSPSYSPTSPSYSPTSPSYSPMSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPSYSP-- 1635
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSP-SGSRPPLRPSRR-- 10205
T PS S S + P PAY + PS + +SP S S P PS
Sbjct: 1636 ---------TSPSYSPTSP-SYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPT 1685
Query: 10206 QKHFTPQSPTSPTSGPGIGQG 10268
+++P SP+ + PG G
Sbjct: 1686 SPNYSPTSPSYSPTSPGYSPG 1706
>pir||S50755 hypothetical protein VSP-3 - Chlamydomonas reinhardtii
gb|AAB53953.1| (L29029) amino acid feature: Rod protein domain, aa 266 .. 468; amino
acid feature: globular protein domain, aa 32 .. 265
[Chlamydomonas reinhardtii]
Length = 473
Score = 36.6 bits (83), Expect = 5.7
Identities = 36/124 (29%), Positives = 44/124 (35%)
Frame = +1
Query: 9850 KHQPCLSPLPSGL*SPQPDPLKCKAQQSPQPAISPTGYPSSLI*ASCLSASLLAPPWPYL 10029
K P SP P SP P P KA SP P+ SP+ P AS P P +
Sbjct: 294 KASPSPSPSPKASPSPSPSP---KASPSPSPSPSPSPSP---------KASPSPSPSPSV 341
Query: 10030 WGCS*GHCPGSPSHPVHRSLSSPTGLHCQPMVVRHQARSAAPPSGPPRAPGPH*DPLGGR 10209
P S P SP+ P+ + S +P P +P P P
Sbjct: 342 Q-------PASKPSPSPSPSPSPSPRPSPPLPSPSPSPSPSPSPSPSPSPKPSPSPSPSP 394
Query: 10210 STSP 10221
S SP
Sbjct: 395 SPSP 398
>ref|NP_033115.1| RNA polymerase II 1 [Mus musculus]
pir||A28490 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - mouse
gb|AAA40071.1| (M12130) RNA polymerase II [Mus musculus]
Length = 1932
Score = 36.6 bits (83), Expect = 5.7
Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S T S P
Sbjct: 1671 TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 1726
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T PS S S + P P Y + PS + +SP+ P PS
Sbjct: 1727 ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 1772
Query: 10215 FTPQSPTSPTSGP 10253
+TPQSPT S P
Sbjct: 1773 YTPQSPTYTPSSP 1785
>sp|P11414|RPB1_CRIGR DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (RPB1)
pir||A27677 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - Chinese
hamster (fragment)
gb|AAA37008.1| (M19538) RNA polymerase II largest subunit [Cricetulus griseus]
Length = 467
Score = 36.6 bits (83), Expect = 5.7
Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S T S P
Sbjct: 206 TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 261
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T PS S S + P P Y + PS + +SP+ P PS
Sbjct: 262 ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 307
Query: 10215 FTPQSPTSPTSGP 10253
+TPQSPT S P
Sbjct: 308 YTPQSPTYTPSSP 320
>emb|CAA60502.1| (X86819) Microtubule-associated protein 4 [Gallus gallus]
Length = 928
Score = 36.6 bits (83), Expect = 5.7
Identities = 30/97 (30%), Positives = 42/97 (42%)
Frame = +3
Query: 5712 KKIQAASLENRKQHQIPPYSTPTLACLKAPT*KPRVSQKNSWSATACSLLLSPLPPARIT 5891
K A S + R PP S P A ++ T PR + + +ATA + + PP R T
Sbjct: 689 KVTDAKSPDKRTSLSKPPSSAPRAAA-RSTTATPRTTATSPVTATAGAKSTTASPPKRPT 747
Query: 5892 SYKIPAQPTDFSPCSPSSIPLTQLLASSASWTFVPSS 6002
S K A+P D + S SA+ + V SS
Sbjct: 748 SIKTDAKPADAKKTTAKSPSADLARPKSAAGSTVKSS 784
>ref|NP_010141.1| RNA polymerase II large subunit; Rpo21p [Saccharomyces cerevisiae]
sp|P04050|RPB1_YEAST DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (B220)
pir||RNBY2L DNA-directed RNA polymerase (EC 2.7.7.6) II 215K chain - yeast
(Saccharomyces cerevisiae)
emb|CAA65619.1| (X96876) RPB1 [Saccharomyces cerevisiae]
emb|CAA98713.1| (Z74188) ORF YDL140c [Saccharomyces cerevisiae]
Length = 1733
Score = 36.6 bits (83), Expect = 5.7
Identities = 42/141 (29%), Positives = 57/141 (39%), Gaps = 4/141 (2%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S + S P
Sbjct: 1587 TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPSYSPTSPSYSPTSPSYSP-- 1642
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSP-SGSRPPLRPSRR-- 10205
T PS S S + P PAY + PS + +SP S S P PS
Sbjct: 1643 ---------TSPSYSPTSP-SYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPT 1692
Query: 10206 QKHFTPQSPTSPTSGPGIGQG 10268
+++P SP+ + PG G
Sbjct: 1693 SPNYSPTSPSYSPTSPGYSPG 1713
>pir||I38186 RNA polymerase II largest subunit - human
emb|CAA52862.1| (X74874) RNA polymerase II largest subunit [Homo sapiens]
Length = 1970
Score = 36.6 bits (83), Expect = 5.7
Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S T S P
Sbjct: 1709 TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 1764
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T PS S S + P P Y + PS + +SP+ P PS
Sbjct: 1765 ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 1810
Query: 10215 FTPQSPTSPTSGP 10253
+TPQSPT S P
Sbjct: 1811 YTPQSPTYTPSSP 1823
>sp|P08775|RPB1_MOUSE DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (RPB1)
Length = 1970
Score = 36.6 bits (83), Expect = 5.7
Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S T S P
Sbjct: 1709 TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 1764
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T PS S S + P P Y + PS + +SP+ P PS
Sbjct: 1765 ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 1810
Query: 10215 FTPQSPTSPTSGP 10253
+TPQSPT S P
Sbjct: 1811 YTPQSPTYTPSSP 1823
>dbj|BAA22376.1| (D87293) RNA polymerase II largest subunit [Cricetulus griseus]
Length = 1970
Score = 36.6 bits (83), Expect = 5.7
Identities = 42/133 (31%), Positives = 53/133 (39%), Gaps = 1/133 (0%)
Frame = +3
Query: 9858 TLPLSTAQWPXVTPT*SS-EMQSPAISPTSNFPNWVPLKPYLSLLFICLTAGSSLALPVG 10034
T P + P +PT S SP+ SPTS P++ P P S T S P
Sbjct: 1709 TSPSYSPTSPSYSPTSPSYSPTSPSYSPTS--PSYSPTSPNYSPTSPNYTPTSPSYSP-- 1764
Query: 10035 LLLRALSWLTLPSCSQVSLFPHRPPLPAYGCSPPSQECCTAQWSPSGSRPPLRPSRRQKH 10214
T PS S S + P P Y + PS + +SP+ P PS
Sbjct: 1765 ---------TSPSYSPTSP-NYTPTSPNYSPTSPSYSPTSPSYSPTS--PSYSPS--SPR 1810
Query: 10215 FTPQSPTSPTSGP 10253
+TPQSPT S P
Sbjct: 1811 YTPQSPTYTPSSP 1823
Database: nr
Posted date: Jan 16, 2001 9:58 PM
Number of letters in database: 191,393,013
Number of sequences in database: 605,060
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: -180905813
Number of Sequences: 605060
Number of extensions: 108132989
Number of successful extensions: 373588
Number of sequences better than 10.0: 124
Number of HSP's better than 10.0 without gapping: 189936
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 364078
length of query: 3440
length of database: 191,393,013
effective HSP length: 59
effective length of query: 3380
effective length of database: 155,694,473
effective search space: 526247318740
effective search space used: 526247318740
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)