analysis of sequence from tem46 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] MSTGLRYKSKLATPEDKQDIDKQYVGFATLPNQVHRKSVKKGFDFTLMVAGESGLGKSTLVHSLFLTDLY KDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLTIVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDE SGLNRKNIQDNRVHCCLYFISPFGHGLRPVDVGFMKALHEKVNIVPLIAKADCLVPSEIRKLKERIREEI DKFGIHVYQFPECDSDEDEDFKQQDRELKESAPFAVIGSNTVVEAKGQRVRGRLYPWGIVEVENQAHCDF VKLRNMLIRTHMHDLKDVTCDVHYENYRAHCIQQMTSKLTQDSRMESPIPILPLPTPDAETEKLIRMKDE ELRRMQEMLQRMKQQMQDQ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > Tem46t_gi|9945439|ref|NP_002679.2| . . . . . 1 MSTGLRYKSKLATPEDKQDIDKQYVGFATLPNQVHRKSVKKGFDFTLMVA 50 ___HHHHHH________HHHHHHHHH___________________EEEEE . . . . . 51 GESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGV 100 E______EEEEHHHHHHHH_HHHHHHHHHHHHHHHHHHHHHHHHHHH___ . . . . . 101 KLKLTIVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDESGLNRKNIQD 150 _EEEEEE___________________HHHHHHHHHHHHH___________ . . . . . 151 NRVHCCLYFISPFGHGLRPVDVGFMKALHEKVNIVPLIAKADCLVPSEIR 200 ___EEEEEEE___________HHHHHHHHHHH__HHHHHHH____HHHHH . . . . . 201 KLKERIREEIDKFGIHVYQFPECDSDEDEDFKQQDRELKESAPFAVIGSN 250 HHHHHHHHHHHH___EEEEE__________HHHHHHHHHH____EEE___ . . . . . 251 TVVEAKGQRVRGRLYPWGIVEVENQAHCDFVKLRNMLIRTHMHDLKDVTC 300 _EEEE__HHHHH______EEEEE_____HHHHHHHHHHHHH____EEEEE . . . . . 301 DVHYENYRAHCIQQMTSKLTQDSRMESPIPILPLPTPDAETEKLIRMKDE 350 EE____HHHHHHHHHHHH_____________________HHHHHHHHHHH . 351 ELRRMQEMLQRMKQQMQDQ 369 HHHHHHHHHHHHHHHH___ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 39.0 % beta-contents : 25.2 % coil-contents : 35.8 % class : mixed method : 2 alpha-contents : 44.7 % beta-contents : 5.6 % coil-contents : 49.7 % class : alpha ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -13.83 -0.45 -0.72 -0.09 -4.00 0.00 -24.00 0.00 -2.83 -7.93 -2.29 -12.00 -12.00 0.00 -12.00 0.00 -92.12 -13.91 -2.74 -2.53 -4.08 0.00 0.00 0.00 0.00 -0.07 -6.27 -2.29 -12.00 -12.00 -24.00 -12.00 0.00 -91.89 ID: Tem46t_gi|9945439|ref|NP_002679.2| AC: xxx Len: 350 1:I 322 Sc: -91.89 Pv: 9.525351e-01 NO_GPI_SITE GPI: learning from protozoa -27.71 -0.43 -0.19 -0.56 -4.00 0.00 -24.00 0.00 -1.34 -6.82 -8.49 -12.00 -12.00 0.00 -12.00 0.00 -109.54 -17.90 -0.49 -0.03 0.00 -4.00 0.00 -20.00 0.00 -0.70 -6.82 -8.49 -12.00 -12.00 -12.00 -12.00 0.00 -106.43 ID: Tem46t_gi|9945439|ref|NP_002679.2| AC: xxx Len: 350 1:I 338 Sc: -106.43 Pv: 9.376300e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem46t_gi|9 0.489 37 Y 0.119 116 N 0.300 166 N 0.060 N # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem46t_gi|9 0.516 167 Y 0.276 167 N 0.687 41 N 0.132 N # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem46t_gi|9 0.498 192 Y 0.282 340 N 0.914 4 N 0.121 N ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] 1-197 MSTGLRYKSKLATPEDKQDIDKQYVGFATL PNQVHRKSVKKGFDFTLMVAGESGLGKSTL VHSLFLTDLYKDRKLLSAEERISQTVEILK HTVDIEEKGVKLKLTIVDTPGFGDAVNNTE CWKPITDYVDQQFEQYFRDESGLNRKNIQD NRVHCCLYFISPFGHGLRPVDVGFMKALHE KVNIVPLIAKADCLVPS eirklkerireeidk 198-212 213-369 FGIHVYQFPECDSDEDEDFKQQDRELKESA PFAVIGSNTVVEAKGQRVRGRLYPWGIVEV ENQAHCDFVKLRNMLIRTHMHDLKDVTCDV HYENYRAHCIQQMTSKLTQDSRMESPIPIL PLPTPDAETEKLIRMKDEELRRMQEMLQRM KQQMQDQ low complexity regions: SEG 25 3.0 3.3 >Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] 1-345 MSTGLRYKSKLATPEDKQDIDKQYVGFATL PNQVHRKSVKKGFDFTLMVAGESGLGKSTL VHSLFLTDLYKDRKLLSAEERISQTVEILK HTVDIEEKGVKLKLTIVDTPGFGDAVNNTE CWKPITDYVDQQFEQYFRDESGLNRKNIQD NRVHCCLYFISPFGHGLRPVDVGFMKALHE KVNIVPLIAKADCLVPSEIRKLKERIREEI DKFGIHVYQFPECDSDEDEDFKQQDRELKE SAPFAVIGSNTVVEAKGQRVRGRLYPWGIV EVENQAHCDFVKLRNMLIRTHMHDLKDVTC DVHYENYRAHCIQQMTSKLTQDSRMESPIP ILPLPTPDAETEKLI rmkdeelrrmqemlqrmkqqmqdq 346-369 low complexity regions: SEG 45 3.4 3.75 >Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] 1-369 MSTGLRYKSKLATPEDKQDIDKQYVGFATL PNQVHRKSVKKGFDFTLMVAGESGLGKSTL VHSLFLTDLYKDRKLLSAEERISQTVEILK HTVDIEEKGVKLKLTIVDTPGFGDAVNNTE CWKPITDYVDQQFEQYFRDESGLNRKNIQD NRVHCCLYFISPFGHGLRPVDVGFMKALHE KVNIVPLIAKADCLVPSEIRKLKERIREEI DKFGIHVYQFPECDSDEDEDFKQQDRELKE SAPFAVIGSNTVVEAKGQRVRGRLYPWGIV EVENQAHCDFVKLRNMLIRTHMHDLKDVTC DVHYENYRAHCIQQMTSKLTQDSRMESPIP ILPLPTPDAETEKLIRMKDEELRRMQEMLQ RMKQQMQDQ low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] MSTGLRYKSKLATPEDKQDIDKQYVGFATLPNQVHRKSVKKGFDFTLMVAGESGLGKSTL VHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLTIVDTPGFGDAVNNTE CWKPITDYVDQQFEQYFRDESGLNRKNIQDNRVHCCLYFISPFGHGLRPVDVGFMKALHE KVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGIHVYQFPecdsdededFKQQDRELKE SAPFAVIGSNTVVEAKGQRVRGRLYPWGIVEVENQAHCDFVKLRNMLIRTHMHDLKDVTC DVHYENYRAHCIQQMTSKLTQDSRMESpipilplptpdaeteKLIRMKDEELRRmqemlq rmkqqmqdq 1 - 221 MSTGLRYKSK LATPEDKQDI DKQYVGFATL PNQVHRKSVK KGFDFTLMVA GESGLGKSTL VHSLFLTDLY KDRKLLSAEE RISQTVEILK HTVDIEEKGV KLKLTIVDTP GFGDAVNNTE CWKPITDYVD QQFEQYFRDE SGLNRKNIQD NRVHCCLYFI SPFGHGLRPV DVGFMKALHE KVNIVPLIAK ADCLVPSEIR KLKERIREEI DKFGIHVYQF P 222 - 230 ecdsdeded 231 - 327 FKQQDRELKE SAPFAVIGSN TVVEAKGQRV RGRLYPWGIV EVENQAHCDF VKLRNMLIRT HMHDLKDVTC DVHYENYRAH CIQQMTSKLT QDSRMES 328 - 342 pip ilplptpdae te 343 - 354 KLIRMKDE ELRR 355 - 369 mqemlq rmkqqmqdq low complexity regions: DUST >Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] MSTGLRYKSKLATPEDKQDIDKQYVGFATLPNQVHRKSVKKGFDFTLMVAGESGLGKSTL VHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLTIVDTPGFGDAVNNTE CWKPITDYVDQQFEQYFRDESGLNRKNIQDNRVHCCLYFISPFGHGLRPVDVGFMKALHE KVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGIHVYQFPECDSDEDEDFKQQDRELKE SAPFAVIGSNTVVEAKGQRVRGRLYPWGIVEVENQAHCDFVKLRNMLIRTHMHDLKDVTC DVHYENYRAHCIQQMTSKLTQDSRMESPIPILPLPTPDAETEKLIRMKDEELRRMQEMLQ RMKQQMQDQ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for Tem46t_gi|9945439|ref|NP_002679.2| sequence: 350 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 MSTGLRYKSK LATPEDKQDI DKQYVGFATL PNQVHRKSVK KGFDFTLMVA GESGLGKSTL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 VHSLFLTDLY KDRKLLSAEE RISQTVEILK HTVDIEEKGV KLKLTIVDTP GFGDAVNNTE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ 1222222222 2222211111 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 CWKPITDYVD QQFEQYFRDE SGLNRKNIQD NRVHCCLYFI SPFGHGLRPV DVGFMKALHE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 KVNIVPLIAK ADCLVPSEIR KLKERIREEI DKFGIHVYQF PECDSDEDED FKQQDRELKE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~8999 9999999999 985~~~~~~~ ~~~~~12222 2222222222 * 14 M'95 -w local . | . | . | . | . | . 300 SAPFAVIGSN TVVEAKGQRV RGRLYPWGIV EVENQAHCDF VKLRNMLIRT HMHDLKDVTC ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. 11~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | DVHYENYRAH CIQQMTSKLT QDSRMESPIP ILPLPTPDAE TEKLIRMKDE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ---------- ---------- ---------- ---------- ---------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem46.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem46.___inter___ (1 sequences) MSTGLRYKSKLATPEDKQDIDKQYVGFATLPNQVHRKSVKKGFDFTLMVA GESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGV KLKLTIVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDESGLNRKNIQD NRVHCCLYFISPFGHGLRPVDVGFMKALHEKVNIVPLIAKADCLVPSEIR KLKERIREEIDKFGIHVYQFPECDSDEDEDFKQQDRELKESAPFAVIGSN TVVEAKGQRVRGRLYPWGIVEVENQAHCDFVKLRNMLIRTHMHDLKDVTC DVHYENYRAHCIQQMTSKLTQDSRMESPIPILPLPTPDAETEKLIRMKDE ELRRMQEMLQRMKQQMQDQ (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 1 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment Loop length 369 K+R profile + CYT-EXT prof -0.33 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 1.00 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): 0.0442 NEG: 59.0000 POS: 54.0000 -> Orientation: N-in CYT-EXT difference: -0.33 -> Orientation: N-in ---------------------------------------------------------------------- "tem46" 369 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem46.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem46.___inter___ (1 sequences) MSTGLRYKSKLATPEDKQDIDKQYVGFATLPNQVHRKSVKKGFDFTLMVA GESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGV KLKLTIVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDESGLNRKNIQD NRVHCCLYFISPFGHGLRPVDVGFMKALHEKVNIVPLIAKADCLVPSEIR KLKERIREEIDKFGIHVYQFPECDSDEDEDFKQQDRELKESAPFAVIGSN TVVEAKGQRVRGRLYPWGIVEVENQAHCDFVKLRNMLIRTHMHDLKDVTC DVHYENYRAHCIQQMTSKLTQDSRMESPIPILPLPTPDAETEKLIRMKDE ELRRMQEMLQRMKQQMQDQ (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 1 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment Loop length 369 K+R profile + CYT-EXT prof -0.33 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 1.00 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): 0.0442 NEG: 59.0000 POS: 54.0000 -> Orientation: N-in CYT-EXT difference: -0.33 -> Orientation: N-in ---------------------------------------------------------------------- "tem46" 369 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ NOW EXECUTING: /bio_software/1D/stat/saps/saps-stroh/SAPS.SSPA/saps /people/maria/tem46.___saps___ SAPS. Version of April 11, 1996. Date run: Tue Oct 31 19:00:38 2000 File: /people/maria/tem46.___saps___ ID Tem46t_gi|9945439|ref|NP_002679.2| DE peanut (Drosophila)-like 1 [Homo sapiens] number of residues: 369; molecular weight: 42.8 kdal 1 MSTGLRYKSK LATPEDKQDI DKQYVGFATL PNQVHRKSVK KGFDFTLMVA GESGLGKSTL 61 VHSLFLTDLY KDRKLLSAEE RISQTVEILK HTVDIEEKGV KLKLTIVDTP GFGDAVNNTE 121 CWKPITDYVD QQFEQYFRDE SGLNRKNIQD NRVHCCLYFI SPFGHGLRPV DVGFMKALHE 181 KVNIVPLIAK ADCLVPSEIR KLKERIREEI DKFGIHVYQF PECDSDEDED FKQQDRELKE 241 SAPFAVIGSN TVVEAKGQRV RGRLYPWGIV EVENQAHCDF VKLRNMLIRT HMHDLKDVTC 301 DVHYENYRAH CIQQMTSKLT QDSRMESPIP ILPLPTPDAE TEKLIRMKDE ELRRMQEMLQ 361 RMKQQMQDQ -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 14( 3.8%); C : 8( 2.2%); D : 29( 7.9%); E : 30( 8.1%); F : 15( 4.1%) G : 18( 4.9%); H : 12( 3.3%); I : 21( 5.7%); K : 31( 8.4%); L : 32( 8.7%) M : 12( 3.3%); N : 11( 3.0%); P : 16( 4.3%); Q : 22( 6.0%); R : 23( 6.2%) S : 17( 4.6%); T : 19( 5.1%); V : 27( 7.3%); W : 2( 0.5%); Y : 10( 2.7%) KR : 54 ( 14.6%); ED : 59 ( 16.0%); AGP : 48 ( 13.0%); KRED : 113 ( 30.6%); KR-ED : -5 ( -1.4%); FIKMNY : 100 ( 27.1%); LVIFM : 107 ( 29.0%); ST : 36 ( 9.8%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 00000+0+0+ 0000--+0-0 -+00000000 00000++00+ +00-000000 0-0000+000 61 0000000-00 +-++0000-- +00000-00+ 000-0--+00 +0+0000-00 000-00000- 121 00+000-00- 000-000+-- 0000++000- 0+00000000 0000000+00 -0000+000- 181 +00000000+ 0-00000-0+ +0+-+0+--0 -+00000000 0-0-0----- 0+00-+-0+- 241 0000000000 000-0+00+0 +0+0000000 -0-00000-0 0+0+0000+0 000-0+-000 301 -000-00+00 0000000+00 0-0+0-0000 0000000-0- 0-+00+0+-- -0++00-000 361 +0+0000-0 A. CHARGE CLUSTERS. Positive charge clusters (cmin = 12/30 or 16/45 or 19/60): none Negative charge clusters (cmin = 12/30 or 17/45 or 20/60): none Mixed charge clusters (cmin = 19/30 or 26/45 or 32/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 6 | 9 | 26 | 10 | 10 | 14 | 11 | 11 | 15 | 7 | 9 | lmin1 7 | 7 | 11 | 31 | 12 | 12 | 17 | 14 | 14 | 18 | 9 | 11 | lmin2 8 | 8 | 12 | 34 | 13 | 14 | 19 | 15 | 16 | 21 | 10 | 13 | (Significance level: 0.010000; Minimal displayed length: 6) There are no charge runs or patterns exceeding the given minimal lengths. Run count statistics: + runs >= 4: 0 - runs >= 4: 1, at 226; * runs >= 6: 0 0 runs >= 17: 0 -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. There are no high scoring hydrophobic segments. There are no high scoring transmembrane segments. 2. SPACINGS OF C. H2N-120-C-33-C-C-36-C-29-C-54-C-21-C-10-C-58-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-34-H-26-H-28-H-29-C-32-H-C-C-8-H-13-H-13-C-22-H-6-C-53-H-C-12-H-1-H-6-C-2-H-6-H-C-58-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 4 Aligned matching blocks: [ 52- 55] ESGL [ 140- 143] ESGL B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 8 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 14 (Expected range: 7-- 34) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 3 (6-10) 1 (11-20) 4 (>=21) 7 3. Clusters of amino acid multiplets (cmin = 8/30 or 10/45 or 12/60): none B. CHARGE ALPHABET. 1. Total number of charge multiplets: 13 (Expected range: 3-- 26) 6 +plets (f+: 14.6%), 7 -plets (f-: 16.0%) Total number of charge altplets: 15 (Critical number: 30) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 4 (6-10) 1 (11-20) 5 (>=21) 4 3. Long charge multiplets (>= 5; Letter/Length/Position): -/5/226 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 86- 113 7 V...... 4 4 0 224- 231 2 D. 4 4 0 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10) Location Period Element Copies Core Errors 73- 114 7 *...00. 6 6 /0/./././2/1/./ 201- 212 2 *. 6 6 0 226- 230 1 - 5 5 0 -------------------------------------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 19- 21 (1.) D( 2)D 30 of 30 0.0058 large minimal spacing 126- 251 (3.) T( 125)T 1 of 20 0.0060 large maximal spacing 224- 226 (3.) D( 2)D 26 of 30 0.0058 matching minimum 226- 228 (3.) D( 2)D 27 of 30 0.0058 matching minimum 228- 230 (3.) D( 2)D 28 of 30 0.0058 matching minimum 368- 370 (1.) D( 2)D 29 of 30 0.0058 matching minimum ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem46 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- GTP_CDC Cell division protein 629.9 1.4e-185 1 kinesin Kinesin motor domain 0.1 42 1 asp Eukaryotic aspartyl protease -0.9 69 1 DUF19 Domain of unknown function DUF19 -1.1 59 1 PI3K_p85B PI3-kinase family, p85-binding domain -21.4 17 1 ras Ras family -137.2 5.6 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- asp 1/1 19 30 .. 345 356 .] -0.9 69 ras 1/1 46 249 .. 1 198 [] -137.2 5.6 PI3K_p85B 1/1 188 259 .. 1 80 [] -21.4 17 DUF19 1/1 277 283 .. 167 173 .] -1.1 59 GTP_CDC 1/1 41 321 .. 1 292 [] 629.9 1.4e-185 kinesin 1/1 345 366 .. 395 417 .] 0.1 42 Alignments of top-scoring domains: asp: domain 1 of 1, from 19 to 30: score -0.9, E = 69 *->DrdNnrvGlApa<-* D+d + vG+A++ Tem46t_gi| 19 DIDKQYVGFATL 30 ras: domain 1 of 1, from 46 to 249: score -137.2, E = 5.6 *->KlVliGDsGVGKSsLliR.FtdnkFveeYipTIGvDFytktvevD.. l + G+sG GKS+L++ F ++ + + ++ ++tve+ ++ Tem46t_gi| 46 TLMVAGESGLGKSTLVHSlFLTDLYKDRKLLSA-EERISQTVEILkh 91 .......GktvkLqIWDTAGQERFralrpaYYRgAqGfLLVYDITsrdSF + + +++G + kL I DT G +a +++ + Tem46t_gi| 92 tvdieekGVKLKLTIVDTPG------FGDA-------------VNNTECW 122 envkkWl.eeilrhadkdenvpivLVGNKcDLeddedleltegq...... + ++++++++++++ + e g ++++ + Tem46t_gi| 123 KPITDYVdQQFEQYFR--D-------------ES--------GLnrkniq 149 .................kRvVsteeGealAkelgAlpFmETSAKtntNVe +++ + ++ +++ R+V+ +al+++ + +p + AK V Tem46t_gi| 150 dnrvhcclyfispfghgLRPVDVGFMKALHEKVNIVPLI---AKADCLVP 196 eaFeelareilkkvse...............vnvnldqpakkkk...skC l ++i++ +++ + + + ++ +++++ ++q+ + k + + Tem46t_gi| 197 SEIRKLKERIREEIDKfgihvyqfpecdsdeDEDFKQQDRELKEsapFAV 246 cil<-* + + Tem46t_gi| 247 IGS 249 PI3K_p85B: domain 1 of 1, from 188 to 259: score -21.4, E = 17 *->lveLrvsrEatlsvIKheLfkqakkmPlgtllkdaqsYvFvqvnqta ++ ++ + K +++ k+ + + +++ Tem46t_gi| 188 IAKADCLVPSEIRKLKERIREEIDKFGIHVYQFPECD--------SD 226 eiEelfDETrRLsdlrlfGTfPiLkLverdgtn<-* e E + +r L + +f + +ve g + Tem46t_gi| 227 EDEDFKQQDRELKESAPFAVIGSNTVVEAKGQR 259 DUF19: domain 1 of 1, from 277 to 283: score -1.1, E = 59 *->qCDFSrL<-* +CDF +L Tem46t_gi| 277 HCDFVKL 283 GTP_CDC: domain 1 of 1, from 41 to 321: score 629.9, E = 1.4e-185 *->kGfdFTLMVVGeSGLGKtTlINTLFltdLidangvanDsreidgase kGfdFTLMV+GeSGLGK+Tl+ +LFltdL+ + r+ ++a+e Tem46t_gi| 41 KGFDFTLMVAGESGLGKSTLVHSLFLTDLYKD-------RKLLSAEE 80 tkikktveIkeitkveiEEdGvkLnLTViDTPGFGDaiDNskcWepIveY +i +tveI +++v iEE+GvkL+LT++DTPGFGDa++N++cW+pI +Y Tem46t_gi| 81 -RISQTVEIL-KHTVDIEEKGVKLKLTIVDTPGFGDAVNNTECWKPITDY 128 IdeQheqYLrqEsrinRtkivDnRVHcCLYFIsPtGhgLkpLDvefMKkL +d+Q+eqY+r+Es++nR++i+DnRVHcCLYFIsP+GhgL+p Dv fMK+L Tem46t_gi| 129 VDQQFEQYFRDESGLNRKNIQDNRVHCCLYFISPFGHGLRPVDVGFMKAL 178 sekVNlIPVIAKADtLTadElqefKkrIreeierqnIkIYkFPdeeedeG +ekVN++P IAKAD+L + E+ +K+rIreei++++I +Y+FP++++de Tem46t_gi| 179 HEKVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGIHVYQFPECDSDE- 227 DEefkeqtqqLkssiPFAIVGSneeiengdGekVRGRkYPWGvVEVENps DE+fk+q+++Lk+s PFA++GSn+++e+ +G++VRGR+YPWG+VEVEN Tem46t_gi| 228 DEDFKQQDRELKESAPFAVIGSNTVVEA-KGQRVRGRLYPWGIVEVENQA 276 HCDFvkLRnlLirtHLqDLketTeeilYEnYRsekLsalglkaen<-* HCDFvkLRn+LirtH++DLk++T +++YEnYR+ +++ + +k Tem46t_gi| 277 HCDFVKLRNMLIRTHMHDLKDVTCDVHYENYRAHCIQQMTSKLTQ 321 kinesin: domain 1 of 1, from 345 to 366: score 0.1, E = 42 *->edpkasallRrleeeiqkLktel<-* +++k+ ++lRr++e q +k+++ Tem46t_gi| 345 IRMKD-EELRRMQEMLQRMKQQM 366 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem46 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- GTP_CDC Cell division protein 629.9 1.4e-185 1 FtsK_SpoIIIE FtsK/SpoIIIE family 1.8 28 1 VHS VHS domain 1.4 41 1 AAA ATPases associated with various cellul 0.4 41 1 kinesin Kinesin motor domain 0.1 42 1 V_ATPase_sub_a V-type ATPase 116kDa subunit family -0.5 31 1 asp Eukaryotic aspartyl protease -0.9 69 1 DUF19 Domain of unknown function DUF19 -1.1 59 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- asp 1/1 19 30 .. 345 356 .] -0.9 69 AAA 1/1 46 64 .. 1 19 [. 0.4 41 FtsK_SpoIIIE 1/1 47 64 .. 48 65 .. 1.8 28 V_ATPase_sub_a 1/1 200 222 .. 243 265 .. -0.5 31 DUF19 1/1 277 283 .. 167 173 .] -1.1 59 GTP_CDC 1/1 41 321 .. 1 292 [] 629.9 1.4e-185 VHS 1/1 349 365 .. 138 155 .] 1.4 41 kinesin 1/1 345 366 .. 395 417 .] 0.1 42 Alignments of top-scoring domains: asp: domain 1 of 1, from 19 to 30: score -0.9, E = 69 *->DrdNnrvGlApa<-* D+d + vG+A++ Tem46t_gi| 19 DIDKQYVGFATL 30 AAA: domain 1 of 1, from 46 to 64: score 0.4, E = 41 *->gvLLyGPPGtGKTlLAkav<-* + G G GK++L++++ Tem46t_gi| 46 TLMVAGESGLGKSTLVHSL 64 FtsK_SpoIIIE: domain 1 of 1, from 47 to 64: score 1.8, E = 28 *->lLiaGatGSGKSvflntl<-* l++aG++G GKS+ +++l Tem46t_gi| 47 LMVAGESGLGKSTLVHSL 64 V_ATPase_sub_a: domain 1 of 1, from 200 to 222: score -0.5, E = 31 *->keildkvkkIlesfgfelydvpe<-* +++++++++ +++fg ++y++pe Tem46t_gi| 200 RKLKERIREEIDKFGIHVYQFPE 222 DUF19: domain 1 of 1, from 277 to 283: score -1.1, E = 59 *->qCDFSrL<-* +CDF +L Tem46t_gi| 277 HCDFVKL 283 GTP_CDC: domain 1 of 1, from 41 to 321: score 629.9, E = 1.4e-185 *->kGfdFTLMVVGeSGLGKtTlINTLFltdLidangvanDsreidgase kGfdFTLMV+GeSGLGK+Tl+ +LFltdL+ + r+ ++a+e Tem46t_gi| 41 KGFDFTLMVAGESGLGKSTLVHSLFLTDLYKD-------RKLLSAEE 80 tkikktveIkeitkveiEEdGvkLnLTViDTPGFGDaiDNskcWepIveY +i +tveI +++v iEE+GvkL+LT++DTPGFGDa++N++cW+pI +Y Tem46t_gi| 81 -RISQTVEIL-KHTVDIEEKGVKLKLTIVDTPGFGDAVNNTECWKPITDY 128 IdeQheqYLrqEsrinRtkivDnRVHcCLYFIsPtGhgLkpLDvefMKkL +d+Q+eqY+r+Es++nR++i+DnRVHcCLYFIsP+GhgL+p Dv fMK+L Tem46t_gi| 129 VDQQFEQYFRDESGLNRKNIQDNRVHCCLYFISPFGHGLRPVDVGFMKAL 178 sekVNlIPVIAKADtLTadElqefKkrIreeierqnIkIYkFPdeeedeG +ekVN++P IAKAD+L + E+ +K+rIreei++++I +Y+FP++++de Tem46t_gi| 179 HEKVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGIHVYQFPECDSDE- 227 DEefkeqtqqLkssiPFAIVGSneeiengdGekVRGRkYPWGvVEVENps DE+fk+q+++Lk+s PFA++GSn+++e+ +G++VRGR+YPWG+VEVEN Tem46t_gi| 228 DEDFKQQDRELKESAPFAVIGSNTVVEA-KGQRVRGRLYPWGIVEVENQA 276 HCDFvkLRnlLirtHLqDLketTeeilYEnYRsekLsalglkaen<-* HCDFvkLRn+LirtH++DLk++T +++YEnYR+ +++ + +k Tem46t_gi| 277 HCDFVKLRNMLIRTHMHDLKDVTCDVHYENYRAHCIQQMTSKLTQ 321 VHS: domain 1 of 1, from 349 to 365: score 1.4, E = 41 *->dpdlsyivdlyklLkrkG<-* d++l+ ++++ +++k ++ Tem46t_gi| 349 DEELRRMQEMLQRMK-QQ 365 kinesin: domain 1 of 1, from 345 to 366: score 0.1, E = 42 *->edpkasallRrleeeiqkLktel<-* +++k+ ++lRr++e q +k+++ Tem46t_gi| 345 IRMKD-EELRRMQEMLQRMKQQM 366 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem46 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Tue Oct 31 19:02:46 2000 Sequence file: tem46 ---------------------------------------- Sequence Tem46t_gi|9945439|ref|NP_002679.2| (369 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 117: NNTE Total matches: 1 Matching pattern PS00005 PKC_PHOSPHO_SITE: 38: SVK 316: TSK 341: TEK Total matches: 3 Matching pattern PS00006 CK2_PHOSPHO_SITE: 13: TPED 77: SAEE 105: TIVD 225: SDED 251: TVVE 323: SRME Total matches: 6 Matching pattern PS00008 MYRISTYL: 54: GLGKST Total matches: 1 Matching pattern PS00017 ATP_GTP_A: 51: GESGLGKS Total matches: 1 Total no of hits in this sequence: 12 ======================================== 1314 pattern(s) searched in 1 sequence(s), 369 residues. Total no of hits in all sequences: 12. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** argv[1]=P argv[2]=-m /data/patterns/own/motif.fa argv[4]=-seq tem46 ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 369 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem46 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem46 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] (369 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value SGTP Small GTPAses 30 0.004 AAA AAA+ ATPase Module 23 0.88 DSP Dual specificity protein phosphatase 22 1.7 AP2 A plant specific DNA binding domain (Apetala 2 like) 21 2.5 DHHC Novel zinc finger domain with DHHC signature 21 2.8 FYVE Zinc Finger domain involved in PtdIns(3)P binding 21 3.3 MATH The Meprin associated TRAF homology domain 19 8.9 >SGTP Small GTPAses Length = 164 Score = 30.2 bits (68), Expect = 0.004 Identities = 20/67 (29%), Positives = 31/67 (45%), Gaps = 9/67 (13%) Query: 45 FTLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKL 104 F +V G +G GKS L+H F+ +K + + T+ + + I G +KL Sbjct: 1 FKFLVIGNAGTGKSCLLHQ-FIEKKFK--------DDSNHTIGVEFGSKIINVGGKYVKL 51 Query: 105 TIVDTPG 111 I DT G Sbjct: 52 QIWDTAG 58 >AAA AAA+ ATPase Module Length = 298 Score = 22.5 bits (47), Expect = 0.88 Identities = 6/33 (18%), Positives = 16/33 (48%) Query: 32 NQVHRKSVKKGFDFTLMVAGESGLGKSTLVHSL 64 + H +++ +++ G G GK+ L ++ Sbjct: 68 HPEHFEALGIAQPKGVLLYGPPGTGKTLLARAV 100 >DSP Dual specificity protein phosphatase Length = 185 Score = 21.8 bits (46), Expect = 1.7 Identities = 6/36 (16%), Positives = 11/36 (29%) Query: 315 MTSKLTQDSRMESPIPILPLPTPDAETEKLIRMKDE 350 M + +S I L + D + L + Sbjct: 69 MHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFER 104 >AP2 A plant specific DNA binding domain (Apetala 2 like) Length = 218 Score = 20.9 bits (43), Expect = 2.5 Identities = 7/45 (15%), Positives = 14/45 (30%), Gaps = 4/45 (8%) Query: 225 SDEDEDFKQQDRELKESAPFAVIGSNTVVEAKGQRVRG-RLYPWG 268 S+ + ++ + ++ K RG R WG Sbjct: 4 SESTKSWEASAVRQENEEEKKKPVKDSG---KHPVYRGVRKRNWG 45 >DHHC Novel zinc finger domain with DHHC signature Length = 217 Score = 20.9 bits (43), Expect = 2.8 Identities = 7/21 (33%), Positives = 12/21 (56%), Gaps = 2/21 (9%) Query: 254 EAKGQRVR--GRLYPWGIVEV 272 E +GQR R G +P +++ Sbjct: 33 ELQGQRSRRNGWSWPPHPLQI 53 >FYVE Zinc Finger domain involved in PtdIns(3)P binding Length = 99 Score = 20.8 bits (43), Expect = 3.3 Identities = 6/34 (17%), Positives = 12/34 (34%) Query: 122 WKPITDYVDQQFEQYFRDESGLNRKNIQDNRVHC 155 WK ++ ++ RK + + HC Sbjct: 27 WKTSDCCQKCNQPFFWNLQAMWQRKVVGLRQHHC 60 >MATH The Meprin associated TRAF homology domain Length = 209 Score = 19.4 bits (40), Expect = 8.9 Identities = 3/22 (13%), Positives = 11/22 (49%) Query: 85 TVEILKHTVDIEEKGVKLKLTI 106 V +++ +V+I + + + Sbjct: 160 EVSVVQDSVNISGQNTMNMVKV 181 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 7 Number of calls to ALIGN: 7 Length of query: 369 Total length of test sequences: 20182 Effective length of test sequences: 16536.0 Effective search space size: 5530670.8 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= Tem46t_gi|9945439|ref|NP_002679.2| peanut (Drosophila)-like 1 [Homo sapiens] (369 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|2814284 [1..224] P-loop containing nucleotide triphosphat... 55 3e-09 gi|1301656 [50..208] P-loop containing nucleotide triphospha... 48 3e-07 gi|729400 [38..325] P-loop containing nucleotide triphosphat... 45 1e-06 gi|1695241 [169..336] P-loop containing nucleotide triphosph... 37 4e-04 gi|1572819 [7..164] P-loop containing nucleotide triphosphat... 35 0.003 gi|1666073 [6..176] P-loop containing nucleotide triphosphat... 31 0.035 gi|4006 [276..439] P-loop containing nucleotide triphosphate... 29 0.11 gi|1086887 [26..189] P-loop containing nucleotide triphospha... 28 0.31 gi|2649774 [45..384] Periplasmic binding protein-like I 27 0.46 gi|2132165 [3..190] P-loop containing nucleotide triphosphat... 27 0.47 gi|2127787 [36..400] Periplasmic binding protein-like I 27 0.71 gi|2494396 [4..243] alpha/beta-Hydrolases 26 1.0 gi|137196 [30..266] P-loop containing nucleotide triphosphat... 25 1.8 gi|730933 [142..476] Cysteine proteinases 25 2.3 gi|1001767 [11..214] P-loop containing nucleotide triphospha... 25 2.5 gi|1487871 [16..342] Periplasmic binding protein-like I 25 2.6 gi|1495469 [3..289] P-loop containing nucleotide triphosphat... 25 2.7 gi|2493387 [94..456] Cytochrome P450 25 2.8 gi|2688657 [81..435] Phosphofructokinase 24 2.9 gi|1531539 [70..226] P-loop containing nucleotide triphospha... 24 3.9 gi|2647961 [96..299] P-loop containing nucleotide triphospha... 24 4.7 gi|134579 [59..285] P-loop containing nucleotide triphosphat... 24 4.9 gi|1280085 [6..334] beta/alpha (TIM)-barrel 23 5.8 gi|809280 [1..209] Glycinamide ribonucleotide transformylase 24 5.9 gi|1703166 [11..338] beta/alpha (TIM)-barrel 23 6.3 gi|2624005 [295..479] beta-Prism I 23 6.4 gi|1072178 [17..364] Phosphoglycerate mutase-like 23 6.5 gi|1350557 [4..271] P-loop containing nucleotide triphosphat... 23 6.7 gi|2650282 [2..154] P-loop containing nucleotide triphosphat... 23 7.3 gi|2635745 [372..520] Aldehyde oxidoreductase, molybdemum co... 23 8.6 gi|2652959 [169..353] P-loop containing nucleotide triphosph... 23 8.7 gi|2190590 [34..336] Periplasmic binding protein-like I 23 9.6 >gi|2814284 [1..224] P-loop containing nucleotide triphosphate hydrolases Length = 224 Score = 54.5 bits (129), Expect = 3e-09 Identities = 22/195 (11%), Positives = 51/195 (25%), Gaps = 15/195 (7%) Query: 45 FTLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKL 104 +V G+ +GK+ L+ S + L + +I + + K Sbjct: 4 IKCVVVGDGAVGKTCLLLSYTTNAFPGEYILTVSHFLSGHLRKIRRKNL---------KK 54 Query: 105 TIVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDESGLNRKNIQDNRVHCCLYFISPFG 164 + + + + + ++ + + D R L G Sbjct: 55 NLKKNLSILF---YLKNHEKLNSFFKKRQLLSVNVFDTYSTNVMVDGR-PINLSLWDTAG 110 Query: 165 HGLRPVDVGFMKALHEKVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGIHVYQFPECD 224 D + + + A + +R + + D Sbjct: 111 Q--DDYDQFRHLSFPQTDVFLVCFALNNPASFENVRAKWYPEVSHHCPNTPIILVGTKAD 168 Query: 225 SDEDEDFKQQDRELK 239 ED D ++ RE + Sbjct: 169 LREDRDTIERLRERR 183 >gi|1301656 [50..208] P-loop containing nucleotide triphosphate hydrolases Length = 159 Score = 47.9 bits (112), Expect = 3e-07 Identities = 18/157 (11%), Positives = 47/157 (29%), Gaps = 18/157 (11%) Query: 45 FTLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKL 104 ++V G +GK+ ++ + + + + T+E + E + L Sbjct: 3 MRVVVVGGKKVGKTAILRQVACVEDVTN-------KPYEPTIEDTYQVLLEEPDKAREIL 55 Query: 105 TIVDTPGFGDAVNNT---------ECWKPITDYVDQQFEQYFRDESGLNRKNIQDNRVHC 155 + DT G + + + + D + + ++ Sbjct: 56 ILHDTAGVSNYGPIELKKAYVQAADAFVLVYSSADYESFNRVDLLKKWIDRQFGKDKKEV 115 Query: 156 CLYFISPFGHGLRPVDVGFMKALHEK--VNIVPLIAK 190 + ++ VD F + + V + + AK Sbjct: 116 PIVVLANMRDRPATVDSAFAHSWAAREKVKLFEVTAK 152 >gi|729400 [38..325] P-loop containing nucleotide triphosphate hydrolases Length = 288 Score = 45.3 bits (105), Expect = 1e-06 Identities = 20/199 (10%), Positives = 50/199 (25%), Gaps = 14/199 (7%) Query: 45 FTLMVAGESGLGKSTLVHSLF---------LTDLYKDRKLLSAEERISQTVEILKHTVDI 95 + + GK+T + D E+ S+ + I + Sbjct: 5 RNIGIIAHIDAGKTTTTERMLYYAGISKHIGDVDTGDTITDFLEQERSRGITIQSAAI-S 63 Query: 96 EEKGVKLKLTIVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRD--ESGLNRKNIQDNRV 153 + ++DTPG D + D + ++ K + Sbjct: 64 FPWRNTFAINLIDTPGHIDFTFEVIRALKVIDSCVVILDAVAGVEAQTEKVWKQSKSKPK 123 Query: 154 HCCLYFISPFGHGLRPVDVGFMKALHEKVNIVPLIAKADC--LVPSEIRKLKERIREEID 211 C + + G + P++ P+ + + + + ++ Sbjct: 124 ICFINKMDRMGASFNHTVNDLINKFMRGTTTKPVLVNIPYYRKQPTSNDYVFQGVIDVVN 183 Query: 212 KFGIHVYQFPECDSDEDED 230 + + DE Sbjct: 184 GKRLTWNPENPDEIIVDEL 202 >gi|1695241 [169..336] P-loop containing nucleotide triphosphate hydrolases Length = 168 Score = 37.4 bits (85), Expect = 4e-04 Identities = 14/168 (8%), Positives = 43/168 (25%), Gaps = 6/168 (3%) Query: 44 DFTLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLK 103 L + G GK +L + + + + T + + V L+ Sbjct: 3 HVVLRIYGARNCGKKSLANR--IHHFATSMAPERVDPDENGNDYTKMTTFLLNGREVTLE 60 Query: 104 LTIVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDESGLNR--KNIQDNRVHCCLYFIS 161 + + T + + + + ++Q Y + L Sbjct: 61 ILLESTLENSPFRQSKTLYIVMYNMDNRQSFIYATQILERITLANLNNPVPLQLFLIGNK 120 Query: 162 PFGHGLRPVDVGFMKALHEK--VNIVPLIAKADCLVPSEIRKLKERIR 207 + + K++ + + + A + + ++ Sbjct: 121 CDLKRNQVISTNEGKSVARTFKCDFLEVSALLGMNTEETWTTILKELQ 168 >gi|1572819 [7..164] P-loop containing nucleotide triphosphate hydrolases Length = 158 Score = 34.5 bits (78), Expect = 0.003 Identities = 23/156 (14%), Positives = 40/156 (24%), Gaps = 21/156 (13%) Query: 45 FTLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKL 104 +++ G+ G GK++LV SL + ++ V E + Sbjct: 4 VRIVLIGDEGCGKTSLVMSLLEDE-------WVDAVPRRLDRVLIPADVTPE----NVTT 52 Query: 105 TIVDTPGFGDAVNNTECWK----------PITDYVDQQFEQYFRDESGLNRKNIQDNRVH 154 +IVD + N +TD Q Sbjct: 53 SIVDLSIKEEDENWIVSEIRQANVICVVYSVTDESTVDGIQTKWLPLIRQSFGEYHETPV 112 Query: 155 CCLYFISPFGHGLRPVDVGFMKALHEKVNIVPLIAK 190 + S + M+A E V A+ Sbjct: 113 ILVGNKSDGTANNTDKILPIMEANTEVETCVECSAR 148 >gi|1666073 [6..176] P-loop containing nucleotide triphosphate hydrolases Length = 171 Score = 30.8 bits (68), Expect = 0.035 Identities = 11/81 (13%), Positives = 25/81 (30%) Query: 47 LMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLTI 106 + V G G+GK+ ++ D + + V + D+ + + Sbjct: 2 VAVLGAPGVGKTAIIRQFLFGDYPERHRPTDGPRLYRPAVLLDGAVYDLSIRDGDVAGPG 61 Query: 107 VDTPGFGDAVNNTECWKPITD 127 G + + + TD Sbjct: 62 SSPGGPEEWPDAKDWSLQDTD 82 >gi|4006 [276..439] P-loop containing nucleotide triphosphate hydrolases Length = 164 Score = 29.2 bits (64), Expect = 0.11 Identities = 25/178 (14%), Positives = 50/178 (28%), Gaps = 31/178 (17%) Query: 47 LMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLTI 106 L++ G +GKS+LV+SL D+ + I G K+ I Sbjct: 2 LVLLGAPNVGKSSLVNSLTNDDI--------SIVSDIPGTTRDSIDAMINVNGY--KVII 51 Query: 107 VDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDESGLNRKNIQDNRVHCCLYFIS-PFGH 165 DT G + ++ I K + + + + Sbjct: 52 CDTAGIREKSSDKIEMLGIDRAK----------------KKSVQSDLCLFIVDPTDLSKL 95 Query: 166 GLRPVDVGFMKALHEKVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGIHVYQFPEC 223 + I+ ++ K+D + E+ K+ ++ Y Sbjct: 96 LPEDILAHLSSKTFGNKRIIIVVNKSDLVSDDEMTKVL----NKLQTRLGSKYPILSV 149 >gi|1086887 [26..189] P-loop containing nucleotide triphosphate hydrolases Length = 164 Score = 27.6 bits (60), Expect = 0.31 Identities = 12/69 (17%), Positives = 24/69 (34%), Gaps = 9/69 (13%) Query: 47 LMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLTI 106 ++V G GKST + + E++ + ++ V + V +L Sbjct: 2 IIVVGAKNAGKSTFIERVE-------FGKFHEEKQTEKLYRVIAKKVFEQTLTV--ELIE 52 Query: 107 VDTPGFGDA 115 D G + Sbjct: 53 KDLGGLTNE 61 >gi|2649774 [45..384] Periplasmic binding protein-like I Length = 340 Score = 27.3 bits (60), Expect = 0.46 Identities = 6/48 (12%), Positives = 17/48 (34%) Query: 200 RKLKERIREEIDKFGIHVYQFPECDSDEDEDFKQQDRELKESAPFAVI 247 E+ +E G + + + DF +++ + P ++ Sbjct: 153 VLWGEKFKEYWKSKGGEIVAEAPVNYYTETDFTTHLQKVLAANPEVIL 200 >gi|2132165 [3..190] P-loop containing nucleotide triphosphate hydrolases Length = 188 Score = 27.3 bits (59), Expect = 0.47 Identities = 14/94 (14%), Positives = 28/94 (28%), Gaps = 7/94 (7%) Query: 36 RKSVKKGFDFTLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDI 95 + K +++++ G GK+T + + L K+ L Q V Sbjct: 7 YNNWNKKEQYSILILGLDNAGKTTFLET-----LKKEYSLAFKALEKIQPTVG--QNVAT 59 Query: 96 EEKGVKLKLTIVDTPGFGDAVNNTECWKPITDYV 129 K L D G + + + + Sbjct: 60 IPVDSKQILKFWDVGGQESLRSMWSEYYSLCHGI 93 >gi|2127787 [36..400] Periplasmic binding protein-like I Length = 365 Score = 26.6 bits (58), Expect = 0.71 Identities = 9/79 (11%), Positives = 21/79 (26%), Gaps = 9/79 (11%) Query: 172 VGFMKALHEKVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGI---HVYQFPECDSDED 228 K + V +I + D + L+ E++ GI + D Sbjct: 134 GDVAK--QLGLKNVIVIYRKD----AWGDGLERATVEKLKANGINIIDEIPYDPNIGDWS 187 Query: 229 EDFKQQDRELKESAPFAVI 247 + ++ + Sbjct: 188 PIIQTTTNKIAGKGNDTGV 206 >gi|2494396 [4..243] alpha/beta-Hydrolases Length = 240 Score = 26.1 bits (57), Expect = 1.0 Identities = 4/30 (13%), Positives = 10/30 (33%) Query: 49 VAGESGLGKSTLVHSLFLTDLYKDRKLLSA 78 + GES + + K+ ++ Sbjct: 186 IGGESAGSIGLHALMVHESMKPKEECIIHN 215 >gi|137196 [30..266] P-loop containing nucleotide triphosphate hydrolases Length = 237 Score = 25.1 bits (53), Expect = 1.8 Identities = 15/179 (8%), Positives = 39/179 (21%), Gaps = 17/179 (9%) Query: 46 TLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLT 105 L++AG S KS ++ + S + V + + + Sbjct: 26 LLILAGPSKSFKSNFGLTM----------VSSYMRQYPDAVCLFYDSEFGITPAYLRSMG 75 Query: 106 IVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDESGL-NRKNIQDNRVHCCLYFISPFG 164 + V + E + E+ + + + N+ + Sbjct: 76 VDPERVIHTPVQSLEQLRIDMVNQLDAIERGEKVVVFIDSLGNLASKKETEDALNEKVVS 135 Query: 165 HGLRPVDVGF----MKALHEKVNIVPLIAKADCLVPSE--IRKLKERIREEIDKFGIHV 217 R + + NI + + + + Sbjct: 136 DMTRAKTMKSLFRIVTPYFSTKNIPCIAINHTYETQEMFSKTVMGGGTGPMYSADTVFI 194 >gi|730933 [142..476] Cysteine proteinases Length = 335 Score = 24.9 bits (54), Expect = 2.3 Identities = 11/43 (25%), Positives = 21/43 (48%), Gaps = 4/43 (9%) Query: 207 REEIDKFGIHVYQFPECDSDEDEDFKQQDRELKESAPFAVIGS 249 R ++ Y++PE S+E + ++ +LK +A F S Sbjct: 292 RMDV----TDKYKYPEGSSEERQVHQKALDKLKPNASFGATSS 330 >gi|1001767 [11..214] P-loop containing nucleotide triphosphate hydrolases Length = 204 Score = 24.7 bits (52), Expect = 2.5 Identities = 5/18 (27%), Positives = 11/18 (60%) Query: 47 LMVAGESGLGKSTLVHSL 64 L+ ++G+GK+ +L Sbjct: 4 LVAGCDTGVGKTVTTSAL 21 >gi|1487871 [16..342] Periplasmic binding protein-like I Length = 327 Score = 24.6 bits (53), Expect = 2.6 Identities = 10/48 (20%), Positives = 18/48 (36%), Gaps = 1/48 (2%) Query: 200 RKLKERIREEIDKFGIHVYQFPECDSDEDEDFKQQDRELKESAPFAVI 247 R R+ + ++G V E S DF +++S V+ Sbjct: 147 RTTHGLARQLLPRYGAGV-VGEEFASLGTRDFAPIIESIQKSGADIVL 193 >gi|1495469 [3..289] P-loop containing nucleotide triphosphate hydrolases Length = 287 Score = 24.7 bits (52), Expect = 2.7 Identities = 6/18 (33%), Positives = 10/18 (55%) Query: 47 LMVAGESGLGKSTLVHSL 64 + G+ G+GKST + Sbjct: 5 IAFYGKGGIGKSTTSQNT 22 >gi|2493387 [94..456] Cytochrome P450 Length = 363 Score = 24.7 bits (53), Expect = 2.8 Identities = 17/192 (8%), Positives = 41/192 (20%), Gaps = 16/192 (8%) Query: 65 FLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLTIVDTPGFGDAVNNTECWKP 124 + L+ I+ V + + + + P +T C Sbjct: 11 LFNPGFNPNYLIGLAPLIADEVVVF---CEQLRQKARTGTVFQLEPLTLRLTVDTICSVT 67 Query: 125 I---TDYVDQQFEQYFRDESGLNRKNIQDNRVHCCLYF-ISPFGHGLRPVDVGFMKALHE 180 + T F + K C P ++ Sbjct: 68 LYVVTPVGRWPFLTPDLEIHSSTTKLRTTPLPQRCNGRSNGPRLELPSTPLRRYLTVRPL 127 Query: 181 KVNIVPLIAKADCLVPSEIRKLKERIREEIDK---FGIHVYQFPECDSDEDEDFKQQDRE 237 + + L+ I + +R E + + ++D + R Sbjct: 128 V------MWYNNRLMNRFIDQEVDRAYREQSGRQSKSVISLALRDYMKEKDGSLEDFKRR 181 Query: 238 LKESAPFAVIGS 249 + + Sbjct: 182 VAPQLRVFLFAG 193 >gi|2688657 [81..435] Phosphofructokinase Length = 355 Score = 24.4 bits (53), Expect = 2.9 Identities = 9/78 (11%), Positives = 22/78 (27%), Gaps = 9/78 (11%) Query: 32 NQVHRKSVKKGFDFTLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKH 91 + ++S + +++ E G G+ H D +K Sbjct: 217 RLLEKES-LEEIPHAVILIAE-GAGQKYFDHFPKKKDDS-------GNLLYEDIGLYIKD 267 Query: 92 TVDIEEKGVKLKLTIVDT 109 + K ++ T+ Sbjct: 268 KITEYFKAKNIQFTLKYI 285 >gi|1531539 [70..226] P-loop containing nucleotide triphosphate hydrolases Length = 157 Score = 24.1 bits (51), Expect = 3.9 Identities = 8/21 (38%), Positives = 13/21 (61%) Query: 44 DFTLMVAGESGLGKSTLVHSL 64 + L V G++ GKS+L+H Sbjct: 1 ELRLGVLGDARSGKSSLIHRF 21 >gi|2647961 [96..299] P-loop containing nucleotide triphosphate hydrolases Length = 204 Score = 23.8 bits (50), Expect = 4.7 Identities = 4/27 (14%), Positives = 9/27 (32%), Gaps = 2/27 (7%) Query: 38 SVKKGFDFTLMVAGESGLGKSTLVHSL 64 ++V G G K+ + + Sbjct: 4 LPNNA--PVILVLGAPGSQKNDISRRI 28 >gi|134579 [59..285] P-loop containing nucleotide triphosphate hydrolases Length = 227 Score = 23.6 bits (49), Expect = 4.9 Identities = 11/51 (21%), Positives = 20/51 (38%), Gaps = 3/51 (5%) Query: 14 PEDKQDIDKQYVGFATLPNQVHRKSVKKGFDFTLMVAGESGLGKSTLVHSL 64 +++ + GF + V G +++ G G GKSTL+ Sbjct: 10 DISLEELPRFSTGFKEFDRVLGG-GVVPG--SAILIGGNPGAGKSTLLLQT 57 >gi|1280085 [6..334] beta/alpha (TIM)-barrel Length = 329 Score = 23.3 bits (50), Expect = 5.8 Identities = 13/104 (12%), Positives = 38/104 (36%), Gaps = 17/104 (16%) Query: 114 DAVNNTECWKPITDYVDQQFEQYFRDESGLNRKNIQDNRVHCCLYFISPFGHGLRPVDVG 173 + + + W+ + + G ++N++ + L + G+ Sbjct: 146 TEIAHIDTWRALEKLYKE----------GKLKQNVEKLVANILLTIHA---LGVSNFSCN 192 Query: 174 FMKALHEKVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGIHV 217 ++AL++ + P + +C + ++ +R K G+ V Sbjct: 193 QLQALYDAAEVKPANQQVECHI----YWPQQELRALCKKLGVTV 232 >gi|809280 [1..209] Glycinamide ribonucleotide transformylase Length = 209 Score = 23.6 bits (51), Expect = 5.9 Identities = 17/92 (18%), Positives = 29/92 (31%), Gaps = 16/92 (17%) Query: 159 FISPFGHGLRP-VDVGFMKALHEKVNIVPLIA-KADCLVPSEIRKLKERIREEIDKFGIH 216 IS G L+ +D + + + + KAD E + GI Sbjct: 6 LISGNGSNLQAIIDACKTNKIK--GTVRAVFSNKADAFG-----------LERARQAGIA 52 Query: 217 VYQFPECDSDEDEDFKQ-QDRELKESAPFAVI 247 + D E + + E+ AP V+ Sbjct: 53 THTLIASAFDSREAYDRELIHEIDMYAPDVVV 84 >gi|1703166 [11..338] beta/alpha (TIM)-barrel Length = 328 Score = 23.3 bits (50), Expect = 6.3 Identities = 9/31 (29%), Positives = 15/31 (48%) Query: 184 IVPLIAKADCLVPSEIRKLKERIREEIDKFG 214 + PL + D ++P K E + E+ KF Sbjct: 18 LFPLAKRNDIILPEGFPKSVEELNEKYKKFR 48 >gi|2624005 [295..479] beta-Prism I Length = 185 Score = 23.2 bits (49), Expect = 6.4 Identities = 14/55 (25%), Positives = 23/55 (41%) Query: 214 GIHVYQFPECDSDEDEDFKQQDRELKESAPFAVIGSNTVVEAKGQRVRGRLYPWG 268 G + Y + S + F+ QD + ++ I SN+ V +YPWG Sbjct: 69 GQNKYSYTNNSSINEGSFRGQDTDYGGTSSTINIPSNSYVYNLWTENYEYIYPWG 123 >gi|1072178 [17..364] Phosphoglycerate mutase-like Length = 348 Score = 23.4 bits (50), Expect = 6.5 Identities = 8/65 (12%), Positives = 19/65 (28%), Gaps = 6/65 (9%) Query: 84 QTVEILKHTVDIEEKGVKLKLTIVDTPGFGDAVNNTECWKPITD----YVDQQFEQYFRD 139 + + + T E I + P +G+A + + + D Y + + Sbjct: 108 KWQPVAQLTD--ESIDWVSLGAIDNCPVYGEAQRKSSEYAEVMDQMEKYDAELLQLVRNH 165 Query: 140 ESGLN 144 Sbjct: 166 ADEPI 170 >gi|1350557 [4..271] P-loop containing nucleotide triphosphate hydrolases Length = 268 Score = 23.1 bits (48), Expect = 6.7 Identities = 12/77 (15%), Positives = 24/77 (30%), Gaps = 2/77 (2%) Query: 15 EDKQDIDKQYVGFATLPNQVHRKSVKKGFDFTLMVAGESGLGKSTLVHSLFLTDLYKDRK 74 +D+ G +L + V KG + + G GK+TL + + Sbjct: 32 PMMRDVQAIPTGSISLDIALGVGGVPKG--RIIEIFGPESSGKTTLCLHIVAEAQKRGGI 89 Query: 75 LLSAEERISQTVEILKH 91 + + V + Sbjct: 90 CGYVDAEHALDVGYARK 106 >gi|2650282 [2..154] P-loop containing nucleotide triphosphate hydrolases Length = 153 Score = 23.2 bits (49), Expect = 7.3 Identities = 7/17 (41%), Positives = 10/17 (58%) Query: 46 TLMVAGESGLGKSTLVH 62 + V G+ G GK+TL Sbjct: 21 IVAVVGQYGTGKTTLGL 37 >gi|2635745 [372..520] Aldehyde oxidoreductase, molybdemum cofactor-binding domain Length = 149 Score = 23.0 bits (49), Expect = 8.6 Identities = 11/89 (12%), Positives = 20/89 (22%), Gaps = 11/89 (12%) Query: 196 PSEIRKLKERIREEIDKFGIHVYQFPEC--------DSDEDEDFKQQDRELKES---APF 244 P E+R+ R ++ + S + + L+ A Sbjct: 2 PLELRRKNIRKPHDLGPLEHRIAPTDGAAQVLNAISKSPILKKTSRNCGYLQRGTGAAIT 61 Query: 245 AVIGSNTVVEAKGQRVRGRLYPWGIVEVE 273 G R L G + Sbjct: 62 MHGGGLGFGRMDAAGGRLSLSSEGKITAS 90 >gi|2652959 [169..353] P-loop containing nucleotide triphosphate hydrolases Length = 185 Score = 23.0 bits (48), Expect = 8.7 Identities = 11/94 (11%), Positives = 25/94 (25%) Query: 47 LMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKLKLTI 106 ++ G G GK+ + + + + I + K ++ + + Sbjct: 4 ILFMGGPGGGKTRHAARVADSLADNGLVHICMPDIIRTALGKYKDKYPEWKEANEHYIRG 63 Query: 107 VDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDE 140 P D + E Y R+ Sbjct: 64 ELIPNQLALTLLKAEMGRHPDAMGFFLEGYPREA 97 >gi|2190590 [34..336] Periplasmic binding protein-like I Length = 303 Score = 22.7 bits (48), Expect = 9.6 Identities = 8/77 (10%), Positives = 18/77 (22%), Gaps = 8/77 (10%) Query: 204 ERIREEIDKFGIHVYQFPECDSDEDEDFKQQDRELKESAPFAVI---GSNTVVEAKGQRV 260 ++ V + D +F+ E+ P V ++ Sbjct: 153 SGFKKSFTAGKGEV-VKDITIAFPDVEFQSALAEIASLKPDCVYAFFSGGGALKFIKDYA 211 Query: 261 R----GRLYPWGIVEVE 273 L+ G + Sbjct: 212 AANLGIPLWGPGFLTDG 228 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 32 Number of calls to ALIGN: 32 Length of query: 369 Total length of test sequences: 256703 Effective length of test sequences: 208388.0 Effective search space size: 68375022.4 Initial X dropoff for ALIGN: 25.0 bits