analysis of sequence from tem21_2homol ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] MNLPPDKARLLRQYDNEKKWDLICDQAFLSAATAAYPSQNCGAILRAFYPNTVPRLIEPRTEAVKEFLND ENRGLDILVEYLSFAQCAVMFGFEGSENGEEDFLDKSKSWSRSIEDLHRPTSQPFYNTLVRSARQSVLRY GSVSSSRTIKNSRLVSQKDDVHVCIMCLRAIMNYQVLYGFNMVMSHAHAVNEIALSLNNKNSRTKALVLE LLAAVCLVRGGHEIILSAFDNFKESSKHRNKTSTVHQGKKRPSQTELINPVVKTRRYFCSPENEAISKEK HRFERLMDYFRSEEGNIDFMVACMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSKHTESDKLSVQ IQAYLDNVFDVGGLLEDAETKNAALEKVEELEDHLSHVTEKLLEVENETMMKVADLEKLLLHKDKELQVI RVSLQLSMLLRRETYESTSSQVNTLRKVIKEKDAAFQRHFNIERRLLELEQQGTIRLHKKPDGDIAIEPC VVGGSSNGGVILQGEIGQLSLGSTVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSASSPNIPAPPP PPPLAPPLPDASPSVILSVAIRIKKPIKTKFRLPVFNWTALKPNQINGTVFNEIDDERELELERFEELFK TRAQGPIMDLSCTKSKVAQKAVNKVTILDANRSKNLAITLRKANKTFDLKTLPVDFVECLMRFLPTEMEV KALRQYERERRPLDQLAEEDRFMLLFSKIERLTQRMNIITFIGNFSDNVAMLTPQLNAIIAASASVKSSP KLKRMLEIILALGNYMNSSKRGCVYGFKLQSLDLLLDTKSTDRKMTLLHYIALIVKEKYPELANFYNELH FVDKAAAVSLENVLLDVRELGKGMDLIRRECSLHDHSVLKGFLQASDTQLDKVQRDAKTAEEAFNNVVNY FGESAKTAPPSVFFPVFVRFLKAYKDAVEENELRKKQEQAMREKLLAEEAKQQDPKVQAQKKRQQQHELI AELRKRQAKDHRPVYEGKDGTIEDIITVLKSVPFTARTAKRGSRFFCEANLCDDANC ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > gi|3170548|gb|AAC34395.1| . . . . . 1 MNLPPDKARLLRQYDNEKKWDLICDQAFLSAATAAYPSQNCGAILRAFYP 50 ______HHHHHHHH________HHHHHHHHHHHH_________EEEE___ . . . . . 51 NTVPRLIEPRTEAVKEFLNDENRGLDILVEYLSFAQCAVMFGFEGSENGE 100 ____EEEHHHHHHHHHHHH_____HHHHHHHHHH_HHHHHHH________ . . . . . 101 EDFLDKSKSWSRSIEDLHRPTSQPFYNTLVRSARQSVLRYGSVSSSRTIK 150 ____________HHHHH_________HHHHHHHHHHHH____________ . . . . . 151 NSRLVSQKDDVHVCIMCLRAIMNYQVLYGFNMVMSHAHAVNEIALSLNNK 200 _EEEEE____EEEEEHHHHHHHHHHHH___HHHHHHHHHHHHHHHH____ . . . . . 201 NSRTKALVLELLAAVCLVRGGHEIILSAFDNFKESSKHRNKTSTVHQGKK 250 __HHHHHHHHHHHHHHHH____EEEEEE______________EEEE____ . . . . . 251 RPSQTELINPVVKTRRYFCSPENEAISKEKHRFERLMDYFRSEEGNIDFM 300 _____EEEEEEEE__EEE______HHHHHHHHHHHHHHHHH_____HHHH . . . . . 301 VACMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSKHTESDKLSVQ 350 HHHHHHHHHHEEE___HHHHHHHHHHHHH________________HHHHH . . . . . 351 IQAYLDNVFDVGGLLEDAETKNAALEKVEELEDHLSHVTEKLLEVENETM 400 HHHHHHH________HHHHH_HHHHHHHHHHHHH_HHHHHHHHHHHHHHH . . . . . 401 MKVADLEKLLLHKDKELQVIRVSLQLSMLLRRETYESTSSQVNTLRKVIK 450 HHHHHHHHHHHHHH_HHHHHHHHHHHHHHHHHHH_______HHHHHHHHH . . . . . 451 EKDAAFQRHFNIERRLLELEQQGTIRLHKKPDGDIAIEPCVVGGSSNGGV 500 HHHHHHHHHHHHHHHHHHHHH_____________EEEE_EEE_______E . . . . . 501 ILQGEIGQLSLGSTVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSA 550 EEE___________EEEEE_______________________________ . . . . . 551 SSPNIPAPPPPPPLAPPLPDASPSVILSVAIRIKKPIKTKFRLPVFNWTA 600 ________________________HHHHHHHHH_____________EEE_ . . . . . 601 LKPNQINGTVFNEIDDERELELERFEELFKTRAQGPIMDLSCTKSKVAQK 650 ________EEEEE___HHHHHHHHHHHHHHHHH_____EEE__HHHHHHH . . . . . 651 AVNKVTILDANRSKNLAITLRKANKTFDLKTLPVDFVECLMRFLPTEMEV 700 HH__EEEE___HHHHHHHHHHHHHH_________HHHHHHHH_____HHH . . . . . 701 KALRQYERERRPLDQLAEEDRFMLLFSKIERLTQRMNIITFIGNFSDNVA 750 HHHHHHHHHHH___HHHHHHHHHHHHHHHHHHHHHHHHEEEEE_______ . . . . . 751 MLTPQLNAIIAASASVKSSPKLKRMLEIILALGNYMNSSKRGCVYGFKLQ 800 _____HHHHHHHHHH____HHHHHHHHHHHHH____________EEEE_H . . . . . 801 SLDLLLDTKSTDRKMTLLHYIALIVKEKYPELANFYNELHFVDKAAAVSL 850 HHHHHHH_____HHHHHHHHHHHHHHH_____HHHHHHHH_HHHHHHHHH . . . . . 851 ENVLLDVRELGKGMDLIRRECSLHDHSVLKGFLQASDTQLDKVQRDAKTA 900 HHHHHHHH__________________HHHHHHHHHH___HHHHHHHHHHH . . . . . 901 EEAFNNVVNYFGESAKTAPPSVFFPVFVRFLKAYKDAVEENELRKKQEQA 950 HHHH___EEE_____________HHHHHHHHHHHHHHHHHHHHHHHHHHH . . . . . 951 MREKLLAEEAKQQDPKVQAQKKRQQQHELIAELRKRQAKDHRPVYEGKDG 1000 HHHHHHHHHHHH____HHHHHHHHHHHHHHHHHHHHHHHH__________ . . . 1001 TIEDIITVLKSVPFTARTAKRGSRFFCEANLCDDANC 1037 __HHHHHHH_______________EEEE_________ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 61.1 % beta-contents : 11.5 % coil-contents : 27.4 % class : mixed method : 2 alpha-contents : 41.3 % beta-contents : 5.5 % coil-contents : 53.3 % class : alpha ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -16.30 -2.95 -4.02 -2.71 -4.00 0.00 -24.00 0.00 -2.10 -7.86 -2.26 -12.00 0.00 0.00 0.00 0.00 -78.19 -15.66 -2.56 -1.56 -0.45 -4.00 0.00 -16.00 0.00 -2.34 -7.86 -2.26 -12.00 -12.00 0.00 0.00 0.00 -76.69 ID: gi|3170548|gb|AAC34395.1| AC: xxx Len: 980 1:I 967 Sc: -76.69 Pv: 7.104651e-01 NO_GPI_SITE GPI: learning from protozoa -22.56 -2.09 -0.76 -0.08 -4.00 0.00 -20.00 0.00 -1.05 -6.75 -8.41 -12.00 -12.00 0.00 0.00 0.00 -89.71 -23.18 -3.43 -1.04 -2.00 -4.00 0.00 -24.00 0.00 -0.91 -6.75 -8.41 -12.00 0.00 0.00 0.00 0.00 -85.72 ID: gi|3170548|gb|AAC34395.1| AC: xxx Len: 980 1:I 969 Sc: -85.72 Pv: 6.151677e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|3170548| 0.513 190 Y 0.438 221 Y 0.885 213 Y 0.128 N # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|3170548| 0.556 190 Y 0.209 938 N 0.924 208 Y 0.143 N # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|3170548| 0.583 190 Y 0.358 190 Y 0.967 211 Y 0.236 N ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] 1-205 MNLPPDKARLLRQYDNEKKWDLICDQAFLS AATAAYPSQNCGAILRAFYPNTVPRLIEPR TEAVKEFLNDENRGLDILVEYLSFAQCAVM FGFEGSENGEEDFLDKSKSWSRSIEDLHRP TSQPFYNTLVRSARQSVLRYGSVSSSRTIK NSRLVSQKDDVHVCIMCLRAIMNYQVLYGF NMVMSHAHAVNEIALSLNNKNSRTK alvlellaavclv 206-218 219-404 RGGHEIILSAFDNFKESSKHRNKTSTVHQG KKRPSQTELINPVVKTRRYFCSPENEAISK EKHRFERLMDYFRSEEGNIDFMVACMQFIN IVVHSVEDMNFRVHLQFEFTKLGLDDYLEK SKHTESDKLSVQIQAYLDNVFDVGGLLEDA ETKNAALEKVEELEDHLSHVTEKLLEVENE TMMKVA dleklllhkdkel 405-417 418-525 QVIRVSLQLSMLLRRETYESTSSQVNTLRK VIKEKDAAFQRHFNIERRLLELEQQGTIRL HKKPDGDIAIEPCVVGGSSNGGVILQGEIG QLSLGSTVGLTVPGGPDT slppaseappppppppppppplpsasspni 526-574 papppppplapplpdasps 575-616 VILSVAIRIKKPIKTKFRLPVFNWTALKPN QINGTVFNEIDD erelelerfeelf 617-629 630-757 KTRAQGPIMDLSCTKSKVAQKAVNKVTILD ANRSKNLAITLRKANKTFDLKTLPVDFVEC LMRFLPTEMEVKALRQYERERRPLDQLAEE DRFMLLFSKIERLTQRMNIITFIGNFSDNV AMLTPQLN aiiaasasvkss 758-769 770-797 PKLKRMLEIILALGNYMNSSKRGCVYGF klqsldllldtkst 798-811 812-918 DRKMTLLHYIALIVKEKYPELANFYNELHF VDKAAAVSLENVLLDVRELGKGMDLIRREC SLHDHSVLKGFLQASDTQLDKVQRDAKTAE EAFNNVVNYFGESAKTA ppsvffpvfv 919-928 929-1037 RFLKAYKDAVEENELRKKQEQAMREKLLAE EAKQQDPKVQAQKKRQQQHELIAELRKRQA KDHRPVYEGKDGTIEDIITVLKSVPFTART AKRGSRFFCEANLCDDANC low complexity regions: SEG 25 3.0 3.3 >gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] 1-185 MNLPPDKARLLRQYDNEKKWDLICDQAFLS AATAAYPSQNCGAILRAFYPNTVPRLIEPR TEAVKEFLNDENRGLDILVEYLSFAQCAVM FGFEGSENGEEDFLDKSKSWSRSIEDLHRP TSQPFYNTLVRSARQSVLRYGSVSSSRTIK NSRLVSQKDDVHVCIMCLRAIMNYQVLYGF NMVMS hahavneialslnnknsrtkalvlellaav 186-219 clvr 220-363 GGHEIILSAFDNFKESSKHRNKTSTVHQGK KRPSQTELINPVVKTRRYFCSPENEAISKE KHRFERLMDYFRSEEGNIDFMVACMQFINI VVHSVEDMNFRVHLQFEFTKLGLDDYLEKS KHTESDKLSVQIQAYLDNVFDVGG lledaetknaalekveeledhlshvtekll 364-431 evenetmmkvadleklllhkdkelqvirvs lqlsmllr 432-490 RETYESTSSQVNTLRKVIKEKDAAFQRHFN IERRLLELEQQGTIRLHKKPDGDIAIEPC vvggssnggvilqgeigqlslgstvgltvp 491-581 ggpdtslppaseappppppppppppplpsa sspnipapppppplapplpdaspsvilsva i 582-938 RIKKPIKTKFRLPVFNWTALKPNQINGTVF NEIDDERELELERFEELFKTRAQGPIMDLS CTKSKVAQKAVNKVTILDANRSKNLAITLR KANKTFDLKTLPVDFVECLMRFLPTEMEVK ALRQYERERRPLDQLAEEDRFMLLFSKIER LTQRMNIITFIGNFSDNVAMLTPQLNAIIA ASASVKSSPKLKRMLEIILALGNYMNSSKR GCVYGFKLQSLDLLLDTKSTDRKMTLLHYI ALIVKEKYPELANFYNELHFVDKAAAVSLE NVLLDVRELGKGMDLIRRECSLHDHSVLKG FLQASDTQLDKVQRDAKTAEEAFNNVVNYF GESAKTAPPSVFFPVFVRFLKAYKDAV eenelrkkqeqamrekllaeeakqqdpkvq 939-989 aqkkrqqqheliaelrkrqak 990-1037 DHRPVYEGKDGTIEDIITVLKSVPFTARTA KRGSRFFCEANLCDDANC low complexity regions: SEG 45 3.4 3.75 >gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] 1-322 MNLPPDKARLLRQYDNEKKWDLICDQAFLS AATAAYPSQNCGAILRAFYPNTVPRLIEPR TEAVKEFLNDENRGLDILVEYLSFAQCAVM FGFEGSENGEEDFLDKSKSWSRSIEDLHRP TSQPFYNTLVRSARQSVLRYGSVSSSRTIK NSRLVSQKDDVHVCIMCLRAIMNYQVLYGF NMVMSHAHAVNEIALSLNNKNSRTKALVLE LLAAVCLVRGGHEIILSAFDNFKESSKHRN KTSTVHQGKKRPSQTELINPVVKTRRYFCS PENEAISKEKHRFERLMDYFRSEEGNIDFM VACMQFINIVVHSVEDMNFRVH lqfeftklglddylekskhtesdklsvqiq 323-454 ayldnvfdvgglledaetknaalekveele dhlshvtekllevenetmmkvadleklllh kdkelqvirvslqlsmllrretyestssqv ntlrkvikekda 455-490 AFQRHFNIERRLLELEQQGTIRLHKKPDGD IAIEPC vvggssnggvilqgeigqlslgstvgltvp 491-581 ggpdtslppaseappppppppppppplpsa sspnipapppppplapplpdaspsvilsva i 582-938 RIKKPIKTKFRLPVFNWTALKPNQINGTVF NEIDDERELELERFEELFKTRAQGPIMDLS CTKSKVAQKAVNKVTILDANRSKNLAITLR KANKTFDLKTLPVDFVECLMRFLPTEMEVK ALRQYERERRPLDQLAEEDRFMLLFSKIER LTQRMNIITFIGNFSDNVAMLTPQLNAIIA ASASVKSSPKLKRMLEIILALGNYMNSSKR GCVYGFKLQSLDLLLDTKSTDRKMTLLHYI ALIVKEKYPELANFYNELHFVDKAAAVSLE NVLLDVRELGKGMDLIRRECSLHDHSVLKG FLQASDTQLDKVQRDAKTAEEAFNNVVNYF GESAKTAPPSVFFPVFVRFLKAYKDAV eenelrkkqeqamrekllaeeakqqdpkvq 939-989 aqkkrqqqheliaelrkrqak 990-1037 DHRPVYEGKDGTIEDIITVLKSVPFTARTA KRGSRFFCEANLCDDANC low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] MNLPPDKARLLRQYDNEKKWDLICDQAFLSAATAAYPSQNCGAILRAFYPNTVPRLIEPR TEAVKEFLNDENRGLDILVEYLSFAQCAVMFGFEGSENGEEDFLDKSKSWSRSIEDLHRP TSQPFYNTLVRSARQSVLRYGSVSSSRTIKNSRLVSQKDDVHVCIMCLRAIMNYQVLYGF NMVMSHAHAVNEIALSLNNKNSRTKALVLELLAAVCLVRGGHEIILSAFDNFKESSKHRN KTSTVHQGKKRPSQTELINPVVKTRRYFCSPENEAISKEKHRFERLMDYFRSEEGNIDFM VACMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSKHTESDKLSVQIQAYLDNVFD VGGLLEDAETKNAALEKVEELEDHLSHVTEKLLEVENETMMKVADLEKLLLHKDKELQVI RVSLQLSMLLRRETYESTSSQVNTLRKVIKEKDAAFQRHFNIERRLLELEQQGTIRLHKK PDGDIAIEPCVVGGSSNGGVILQGEIGQLSLGSTVGLTVPGGPDTSLppaseappppppp pppppplpsasspnipapppppplapplpdaspsvilsvairikkpikTKFRLPVFNWTA LKPNQINGTVFNEIDDERELELERFEELFKTRAQGPIMDLSCTKSKVAQKAVNKVTILDA NRSKNLAITLRKANKTFDLKTLPVDFVECLMRFLPTEMEVKALRQYERERRPLDQLAEED RFMLLFSKIERLTQRMNIITFIGNFSDNVAMLTPQLNAIIAASASVKSSPKLKRMLEIIL ALGNYMNSSKRGCVYGFKLQSLDLLLDTKSTDRKMTLLHYIALIVKEKYPELANFYNELH FVDKAAAVSLENVLLDVRELGKGMDLIRRECSLHDHSVLKGFLQASDTQLDKVQRDAKTA EEAFNNVVNYFGESAKTAPPSVFFPVFVRFLKAYKDAVEENELRKKQEQAMREKLLAEEA KQQDPKVQAqkkrqqqhELIAELRKRQAKDHRPVYEGKDGTIEDIITVLKSVPFTARTAK RGSRFFCEANLCDDANC 1 - 527 MNLPPDKARL LRQYDNEKKW DLICDQAFLS AATAAYPSQN CGAILRAFYP NTVPRLIEPR TEAVKEFLND ENRGLDILVE YLSFAQCAVM FGFEGSENGE EDFLDKSKSW SRSIEDLHRP TSQPFYNTLV RSARQSVLRY GSVSSSRTIK NSRLVSQKDD VHVCIMCLRA IMNYQVLYGF NMVMSHAHAV NEIALSLNNK NSRTKALVLE LLAAVCLVRG GHEIILSAFD NFKESSKHRN KTSTVHQGKK RPSQTELINP VVKTRRYFCS PENEAISKEK HRFERLMDYF RSEEGNIDFM VACMQFINIV VHSVEDMNFR VHLQFEFTKL GLDDYLEKSK HTESDKLSVQ IQAYLDNVFD VGGLLEDAET KNAALEKVEE LEDHLSHVTE KLLEVENETM MKVADLEKLL LHKDKELQVI RVSLQLSMLL RRETYESTSS QVNTLRKVIK EKDAAFQRHF NIERRLLELE QQGTIRLHKK PDGDIAIEPC VVGGSSNGGV ILQGEIGQLS LGSTVGLTVP GGPDTSL 528 - 588 ppa seappppppp pppppplpsa sspnipappp ppplapplpd aspsvilsva irikkpi k 589 - 969 TK FRLPVFNWTA LKPNQINGTV FNEIDDEREL ELERFEELFK TRAQGPIMDL SCTKSKVA QK AVNKVTILDA NRSKNLAITL RKANKTFDLK TLPVDFVECL MRFLPTEMEV KALRQYER ER RPLDQLAEED RFMLLFSKIE RLTQRMNIIT FIGNFSDNVA MLTPQLNAII AASASVKS SP KLKRMLEIIL ALGNYMNSSK RGCVYGFKLQ SLDLLLDTKS TDRKMTLLHY IALIVKEK YP ELANFYNELH FVDKAAAVSL ENVLLDVREL GKGMDLIRRE CSLHDHSVLK GFLQASDT QL DKVQRDAKTA EEAFNNVVNY FGESAKTAPP SVFFPVFVRF LKAYKDAVEE NELRKKQE QA MREKLLAEEA KQQDPKVQA 970 - 977 q kkrqqqh 978 - 1037 ELI AELRKRQAKD HRPVYEGKDG TIEDIITVLK SVPFTARTAK RGSRFFCEAN LCDDANC low complexity regions: DUST >gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] MNLPPDKARLLRQYDNEKKWDLICDQAFLSAATAAYPSQNCGAILRAFYPNTVPRLIEPR TEAVKEFLNDENRGLDILVEYLSFAQCAVMFGFEGSENGEEDFLDKSKSWSRSIEDLHRP TSQPFYNTLVRSARQSVLRYGSVSSSRTIKNSRLVSQKDDVHVCIMCLRAIMNYQVLYGF NMVMSHAHAVNEIALSLNNKNSRTKALVLELLAAVCLVRGGHEIILSAFDNFKESSKHRN KTSTVHQGKKRPSQTELINPVVKTRRYFCSPENEAISKEKHRFERLMDYFRSEEGNIDFM VACMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSKHTESDKLSVQIQAYLDNVFD VGGLLEDAETKNAALEKVEELEDHLSHVTEKLLEVENETMMKVADLEKLLLHKDKELQVI RVSLQLSMLLRRETYESTSSQVNTLRKVIKEKDAAFQRHFNIERRLLELEQQGTIRLHKK PDGDIAIEPCVVGGSSNGGVILQGEIGQLSLGSTVGLTVPGGPDTSLPPASEANNNNNNN NNNNNNLPSASSPNIPAPPPPPPLAPPLPDASPSVILSVAIRIKKPIKTKFRLPVFNWTA LKPNQINGTVFNEIDDERELELERFEELFKTRAQGPIMDLSCTKSKVAQKAVNKVTILDA NRSKNLAITLRKANKTFDLKTLPVDFVECLMRFLPTEMEVKALRQYERERRPLDQLAEED RFMLLFSKIERLTQRMNIITFIGNFSDNVAMLTPQLNAIIAASASVKSSPKLKRMLEIIL ALGNYMNSSKRGCVYGFKLQSLDLLLDTKSTDRKMTLLHYIALIVKEKYPELANFYNELH FVDKAAAVSLENVLLDVRELGKGMDLIRRECSLHDHSVLKGFLQASDTQLDKVQRDAKTA EEAFNNVVNYFGESAKTAPPSVFFPVFVRFLKAYKDAVEENELRKKQEQAMREKLLAEEA KQQDPKVQAQKKRQQQHELIAELRKRQAKDHRPVYEGKDGTIEDIITVLKSVPFTARTAK RGSRFFCEANLCDDANC ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for gi|3170548|gb|AAC34395.1| sequence: 980 amino acids, 94 residue(s) in coiled coil state Coil 1 * 365* 399* EDAETKNAALEKVEELEDHLSHVTEKLLEVENETM Coil 2 * 882* 909* LQASDTQLDKVQRDAKTAEEAFNNVVNY Coil 3 * 930* 963* LKAYKDAVEENELRKKQEQAMREKLLAEEAKQQD . | . | . | . | . | . 60 MNLPPDKARL LRQYDNEKKW DLICDQAFLS AATAAYPSQN CGAILRAFYP NTVPRLIEPR ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~1 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~44444 444444444~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~1 * 14 M'95 -w local . | . | . | . | . | . 120 TEAVKEFLND ENRGLDILVE YLSFAQCAVM FGFEGSENGE EDFLDKSKSW SRSIEDLHRP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register 1333333333 3333333333 33~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. 1222222222 22222~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 TSQPFYNTLV RSARQSVLRY GSVSSSRTIK NSRLVSQKDD VHVCIMCLRA IMNYQVLYGF ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 NMVMSHAHAV NEIALSLNNK NSRTKALVLE LLAAVCLVRG GHEIILSAFD NFKESSKHRN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 KTSTVHQGKK RPSQTELINP VVKTRRYFCS PENEAISKEK HRFERLMDYF RSEEGNIDFM ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~444444444 4444411~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 VACMQFINIV VHSVEDMNFR VHLQFEFTKL GLDDYLEKSK HTESDKLSVQ IQAYLDNVFD ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~2 * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------g * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~11111 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~111111111 1111111111 11~~~22223 * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~2 2222222222 * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~111111 11111111~~ * 14 M'95 -w local . | . | . | . | . | . 420 VGGLLEDAET KNAALEKVEE LEDHLSHVTE KLLEVENETM MKVADLEKLL LHKDKELQVI 2222255577 9999999999 9999999999 9999999984 2222222222 11111111~~ * 21 M'95 -w border abcabcdefg abcdefgabc defgabcdef gabcdefgab fgabcdefga bcdefgab-- * 21 M'95 -w register 1111122677 9999999999 9999999999 9999999931 1111222222 2222222222 * 21 M'95 +w polar 3333344455 8999999999 9999999999 9999999975 5555555555 2222222211 * 21 MTK -w class 2225555999 9999999999 9999999999 9999999999 9642222111 1111111~~~ * 28 M'95 -w signif. ~~79999999 9999999999 9999987777 7733333322 2222222222 11111111~~ * 14 M'95 -w local . | . | . | . | . | . 480 RVSLQLSMLL RRETYESTSS QVNTLRKVIK EKDAAFQRHF NIERRLLELE QQGTIRLHKK ~~~~~~~~~~ ~222222222 2222222223 3333333333 3333333333 32~~~~~~~~ * 21 M'95 -w border ---------- -efgabcdef gabcdefgaf gabcdefgab cdefgabcde fg-------- * 21 M'95 -w register 22222~~~~~ ~~~~~~~~~~ ~~~~~~~~~2 2222222222 2222222222 22~~~~~~~~ * 21 M'95 +w polar 1~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~2 2222222222 2222222222 21~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~111111111 1155888888 8888888888 8888888888 88~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~1122 2222222222 2222~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 540 PDGDIAIEPC VVGGSSNGGV ILQGEIGQLS LGSTVGLTVP GGPDTSLPPA SEAPPPPPPP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 600 PPPPPPLPSA SSPNIPAPPP PPPLAPPLPD ASPSVILSVA IRIKKPIKTK FRLPVFNWTA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 660 LKPNQINGTV FNEIDDEREL ELERFEELFK TRAQGPIMDL SCTKSKVAQK AVNKVTILDA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~111111 1111111111 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 720 NRSKNLAITL RKANKTFDLK TLPVDFVECL MRFLPTEMEV KALRQYERER RPLDQLAEED ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~1111 * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ------cdef * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~2222 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~1111 * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~1111 1111111111 ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 780 RFMLLFSKIE RLTQRMNIIT FIGNFSDNVA MLTPQLNAII AASASVKSSP KLKRMLEIIL 1111111111 1111111~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border gabcdefgab cdefgab--- ---------- ---------- ---------- ---------- * 21 M'95 -w register 2222222222 222222211~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar 1111111111 1111111~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~1112222 2222222222 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 840 ALGNYMNSSK RGCVYGFKLQ SLDLLLDTKS TDRKMTLLHY IALIVKEKYP ELANFYNELH ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 900 FVDKAAAVSL ENVLLDVREL GKGMDLIRRE CSLHDHSVLK GFLQASDTQL DKVQRDAKTA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~188888888 8888888888 * 21 M'95 -w border ---------- ---------- ---------- ---------- -gabcdefga bcdefgabcd * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~55555555 5555555555 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~199999999 9999999999 * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~22 2222222222 2222222222 * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~11112899 9999999999 * 14 M'95 -w local . | . | . | . | . | . 960 EEAFNNVVNY FGESAKTAPP SVFFPVFVRF LKAYKDAVEE NELRKKQEQA MREKLLAEEA 888888888~ ~~~~~~~~~~ ~~~~~~~~~~ 6666888899 9999999999 9999999999 * 21 M'95 -w border efgabcdef- ---------- ---------- abcdefgade fgabcdefga bcdefgabcd * 21 M'95 -w register 555111111~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~16 6666666666 6666666666 * 21 M'95 +w polar 9998888881 ~~~~~~~~~~ ~~~~~~~~~~ 3333333366 7777777777 7777777777 * 21 MTK -w class 2222221111 ~~~~~~~~~~ ~~~~~~~133 9999999999 9999999999 9999999988 * 28 M'95 -w signif. 9993333~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2222227799 9999999999 9998533322 * 14 M'95 -w local . | . | KQQDPKVQAQ KKRQQQHELI 9994~~~~~~ ~~~~~~~~~~ efgf------ ---------- 554~~~~~~~ ~~~~~~~~~~ 7763~~~~~~ ~~~~~~~~~~ 8888666421 111111~~~~ 211~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem21_2homol.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem21_2homol.___inter___ (1 sequences) MNLPPDKARLLRQYDNEKKWDLICDQAFLSAATAAYPSQNCGAILRAFYP NTVPRLIEPRTEAVKEFLNDENRGLDILVEYLSFAQCAVMFGFEGSENGE EDFLDKSKSWSRSIEDLHRPTSQPFYNTLVRSARQSVLRYGSVSSSRTIK NSRLVSQKDDVHVCIMCLRAIMNYQVLYGFNMVMSHAHAVNEIALSLNNK NSRTKALVLELLAAVCLVRGGHEIILSAFDNFKESSKHRNKTSTVHQGKK RPSQTELINPVVKTRRYFCSPENEAISKEKHRFERLMDYFRSEEGNIDFM VACMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSKHTESDKLSVQ IQAYLDNVFDVGGLLEDAETKNAALEKVEELEDHLSHVTEKLLEVENETM MKVADLEKLLLHKDKELQVIRVSLQLSMLLRRETYESTSSQVNTLRKVIK EKDAAFQRHFNIERRLLELEQQGTIRLHKKPDGDIAIEPCVVGGSSNGGV ILQGEIGQLSLGSTVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSA SSPNIPAPPPPPPLAPPLPDASPSVILSVAIRIKKPIKTKFRLPVFNWTA LKPNQINGTVFNEIDDERELELERFEELFKTRAQGPIMDLSCTKSKVAQK AVNKVTILDANRSKNLAITLRKANKTFDLKTLPVDFVECLMRFLPTEMEV KALRQYERERRPLDQLAEEDRFMLLFSKIERLTQRMNIITFIGNFSDNVA MLTPQLNAIIAASASVKSSPKLKRMLEIILALGNYMNSSKRGCVYGFKLQ SLDLLLDTKSTDRKMTLLHYIALIVKEKYPELANFYNELHFVDKAAAVSL ENVLLDVRELGKGMDLIRRECSLHDHSVLKGFLQASDTQLDKVQRDAKTA EEAFNNVVNYFGESAKTAPPSVFFPVFVRFLKAYKDAVEENELRKKQEQA MREKLLAEEAKQQDPKVQAQKKRQQQHELIAELRKRQAKDHRPVYEGKDG TIEDIITVLKSVPFTARTAKRGSRFFCEANLCDDANC (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 16 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 76 96 0.833 Putative 2 295 315 0.852 Putative 3 504 524 0.927 Putative 4 748 768 0.760 Putative ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 Loop length 75 198 188 223 269 K+R profile + + + + + CYT-EXT prof -0.73 -0.31 -0.31 0.64 0.08 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.12 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -2.07 -> Orientation: N-in ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 2 3 4 Loop length 294 188 223 269 K+R profile + + + + CYT-EXT prof 0.41 0.08 -0.31 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.21 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0649 NEG: 36.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 1.10 -> Orientation: N-out ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 3 4 Loop length 75 407 223 269 K+R profile + + + + CYT-EXT prof -0.73 0.08 0.31 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.19 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -0.65 -> Orientation: N-in ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 3 4 Loop length 503 223 269 K+R profile + + + CYT-EXT prof 0.20 -0.31 0.08 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.33 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 0.00 (NEG-POS)/(NEG+POS): 0.0504 NEG: 73.0000 POS: 66.0000 -> Orientation: undecided CYT-EXT difference: -0.19 -> Orientation: N-in ---------------------------------------------------------------------- Structure 5 Transmembrane segments included in this structure: Segment 1 2 4 Loop length 75 198 432 269 K+R profile + + + + CYT-EXT prof -0.73 -0.02 0.64 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.15 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.09 -> Orientation: N-in ---------------------------------------------------------------------- Structure 6 Transmembrane segments included in this structure: Segment 2 4 Loop length 294 432 269 K+R profile + + + CYT-EXT prof 0.41 -0.31 -0.02 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.25 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0649 NEG: 36.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 0.12 -> Orientation: N-out ---------------------------------------------------------------------- Structure 7 Transmembrane segments included in this structure: Segment 1 4 Loop length 75 651 269 K+R profile + + + CYT-EXT prof -0.73 -0.31 0.35 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.23 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.38 -> Orientation: N-in ---------------------------------------------------------------------- Structure 8 Transmembrane segments included in this structure: Segment 4 Loop length 747 269 K+R profile + + CYT-EXT prof 0.25 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.40 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): -2.00 (NEG-POS)/(NEG+POS): 0.0152 NEG: 100.0000 POS: 97.0000 -> Orientation: N-out CYT-EXT difference: 0.56 -> Orientation: N-out ---------------------------------------------------------------------- Structure 9 Transmembrane segments included in this structure: Segment 1 2 3 Loop length 75 198 188 513 K+R profile + + + + CYT-EXT prof -0.73 -0.31 0.64 -0.02 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.30 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.66 -> Orientation: N-in ---------------------------------------------------------------------- Structure 10 Transmembrane segments included in this structure: Segment 2 3 Loop length 294 188 513 K+R profile + + + CYT-EXT prof 0.41 -0.02 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.52 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0649 NEG: 36.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 0.69 -> Orientation: N-out ---------------------------------------------------------------------- Structure 11 Transmembrane segments included in this structure: Segment 1 3 Loop length 75 407 513 K+R profile + + + CYT-EXT prof -0.73 -0.02 0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.48 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.06 -> Orientation: N-in ---------------------------------------------------------------------- Structure 12 Transmembrane segments included in this structure: Segment 3 Loop length 503 513 K+R profile + + CYT-EXT prof 0.20 -0.02 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.82 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 0.00 (NEG-POS)/(NEG+POS): 0.0504 NEG: 73.0000 POS: 66.0000 -> Orientation: undecided CYT-EXT difference: 0.22 -> Orientation: N-out ---------------------------------------------------------------------- Structure 13 Transmembrane segments included in this structure: Segment 1 2 Loop length 75 198 722 K+R profile + + + CYT-EXT prof -0.73 -0.07 0.64 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.37 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.44 -> Orientation: N-in ---------------------------------------------------------------------- Structure 14 Transmembrane segments included in this structure: Segment 2 Loop length 294 722 K+R profile + + CYT-EXT prof 0.41 -0.07 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.63 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0649 NEG: 36.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 0.48 -> Orientation: N-out ---------------------------------------------------------------------- Structure 15 Transmembrane segments included in this structure: Segment 1 Loop length 75 941 K+R profile + + CYT-EXT prof -0.73 0.20 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.58 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -0.94 -> Orientation: N-in ---------------------------------------------------------------------- Structure 16 Transmembrane segments included in this structure: Segment Loop length 1037 K+R profile + CYT-EXT prof 0.14 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 1.00 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0142 NEG: 139.0000 POS: 143.0000 -> Orientation: N-in CYT-EXT difference: 0.14 -> Orientation: N-out ---------------------------------------------------------------------- "tem21_2homol" 1037 76 96 #f 0.833333 295 315 #f 0.852083 504 524 #f 0.927083 748 768 #f 0.760417 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem21_2homol.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem21_2homol.___inter___ (1 sequences) MNLPPDKARLLRQYDNEKKWDLICDQAFLSAATAAYPSQNCGAILRAFYP NTVPRLIEPRTEAVKEFLNDENRGLDILVEYLSFAQCAVMFGFEGSENGE EDFLDKSKSWSRSIEDLHRPTSQPFYNTLVRSARQSVLRYGSVSSSRTIK NSRLVSQKDDVHVCIMCLRAIMNYQVLYGFNMVMSHAHAVNEIALSLNNK NSRTKALVLELLAAVCLVRGGHEIILSAFDNFKESSKHRNKTSTVHQGKK RPSQTELINPVVKTRRYFCSPENEAISKEKHRFERLMDYFRSEEGNIDFM VACMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSKHTESDKLSVQ IQAYLDNVFDVGGLLEDAETKNAALEKVEELEDHLSHVTEKLLEVENETM MKVADLEKLLLHKDKELQVIRVSLQLSMLLRRETYESTSSQVNTLRKVIK EKDAAFQRHFNIERRLLELEQQGTIRLHKKPDGDIAIEPCVVGGSSNGGV ILQGEIGQLSLGSTVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSA SSPNIPAPPPPPPLAPPLPDASPSVILSVAIRIKKPIKTKFRLPVFNWTA LKPNQINGTVFNEIDDERELELERFEELFKTRAQGPIMDLSCTKSKVAQK AVNKVTILDANRSKNLAITLRKANKTFDLKTLPVDFVECLMRFLPTEMEV KALRQYERERRPLDQLAEEDRFMLLFSKIERLTQRMNIITFIGNFSDNVA MLTPQLNAIIAASASVKSSPKLKRMLEIILALGNYMNSSKRGCVYGFKLQ SLDLLLDTKSTDRKMTLLHYIALIVKEKYPELANFYNELHFVDKAAAVSL ENVLLDVRELGKGMDLIRRECSLHDHSVLKGFLQASDTQLDKVQRDAKTA EEAFNNVVNYFGESAKTAPPSVFFPVFVRFLKAYKDAVEENELRKKQEQA MREKLLAEEAKQQDPKVQAQKKRQQQHELIAELRKRQAKDHRPVYEGKDG TIEDIITVLKSVPFTARTAKRGSRFFCEANLCDDANC (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 16 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 76 96 0.833 Putative 2 295 315 0.852 Putative 3 504 524 0.927 Putative 4 748 768 0.760 Putative ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 Loop length 75 198 188 223 269 K+R profile + + + + + CYT-EXT prof -0.73 -0.31 -0.31 0.64 0.08 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.12 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -2.07 -> Orientation: N-in ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 2 3 4 Loop length 294 188 223 269 K+R profile + + + + CYT-EXT prof 0.41 0.08 -0.31 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.21 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0649 NEG: 36.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 1.10 -> Orientation: N-out ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 3 4 Loop length 75 407 223 269 K+R profile + + + + CYT-EXT prof -0.73 0.08 0.31 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.19 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -0.65 -> Orientation: N-in ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 3 4 Loop length 503 223 269 K+R profile + + + CYT-EXT prof 0.20 -0.31 0.08 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.33 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 0.00 (NEG-POS)/(NEG+POS): 0.0504 NEG: 73.0000 POS: 66.0000 -> Orientation: undecided CYT-EXT difference: -0.19 -> Orientation: N-in ---------------------------------------------------------------------- Structure 5 Transmembrane segments included in this structure: Segment 1 2 4 Loop length 75 198 432 269 K+R profile + + + + CYT-EXT prof -0.73 -0.02 0.64 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.15 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.09 -> Orientation: N-in ---------------------------------------------------------------------- Structure 6 Transmembrane segments included in this structure: Segment 2 4 Loop length 294 432 269 K+R profile + + + CYT-EXT prof 0.41 -0.31 -0.02 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.25 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0649 NEG: 36.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 0.12 -> Orientation: N-out ---------------------------------------------------------------------- Structure 7 Transmembrane segments included in this structure: Segment 1 4 Loop length 75 651 269 K+R profile + + + CYT-EXT prof -0.73 -0.31 0.35 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.23 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.38 -> Orientation: N-in ---------------------------------------------------------------------- Structure 8 Transmembrane segments included in this structure: Segment 4 Loop length 747 269 K+R profile + + CYT-EXT prof 0.25 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.40 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): -2.00 (NEG-POS)/(NEG+POS): 0.0152 NEG: 100.0000 POS: 97.0000 -> Orientation: N-out CYT-EXT difference: 0.56 -> Orientation: N-out ---------------------------------------------------------------------- Structure 9 Transmembrane segments included in this structure: Segment 1 2 3 Loop length 75 198 188 513 K+R profile + + + + CYT-EXT prof -0.73 -0.31 0.64 -0.02 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.30 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.66 -> Orientation: N-in ---------------------------------------------------------------------- Structure 10 Transmembrane segments included in this structure: Segment 2 3 Loop length 294 188 513 K+R profile + + + CYT-EXT prof 0.41 -0.02 -0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.52 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0649 NEG: 36.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 0.69 -> Orientation: N-out ---------------------------------------------------------------------- Structure 11 Transmembrane segments included in this structure: Segment 1 3 Loop length 75 407 513 K+R profile + + + CYT-EXT prof -0.73 -0.02 0.31 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.48 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.06 -> Orientation: N-in ---------------------------------------------------------------------- Structure 12 Transmembrane segments included in this structure: Segment 3 Loop length 503 513 K+R profile + + CYT-EXT prof 0.20 -0.02 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.82 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 0.00 (NEG-POS)/(NEG+POS): 0.0504 NEG: 73.0000 POS: 66.0000 -> Orientation: undecided CYT-EXT difference: 0.22 -> Orientation: N-out ---------------------------------------------------------------------- Structure 13 Transmembrane segments included in this structure: Segment 1 2 Loop length 75 198 722 K+R profile + + + CYT-EXT prof -0.73 -0.07 0.64 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.37 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -1.44 -> Orientation: N-in ---------------------------------------------------------------------- Structure 14 Transmembrane segments included in this structure: Segment 2 Loop length 294 722 K+R profile + + CYT-EXT prof 0.41 -0.07 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.63 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0649 NEG: 36.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 0.48 -> Orientation: N-out ---------------------------------------------------------------------- Structure 15 Transmembrane segments included in this structure: Segment 1 Loop length 75 941 K+R profile + + CYT-EXT prof -0.73 0.20 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.58 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): 0.0476 NEG: 11.0000 POS: 10.0000 -> Orientation: N-in CYT-EXT difference: -0.94 -> Orientation: N-in ---------------------------------------------------------------------- Structure 16 Transmembrane segments included in this structure: Segment Loop length 1037 K+R profile + CYT-EXT prof 0.14 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 1.00 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 1.00 (NEG-POS)/(NEG+POS): -0.0142 NEG: 139.0000 POS: 143.0000 -> Orientation: N-in CYT-EXT difference: 0.14 -> Orientation: N-out ---------------------------------------------------------------------- "tem21_2homol" 1037 76 96 #f 0.833333 295 315 #f 0.852083 504 524 #f 0.927083 748 768 #f 0.760417 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ NOW EXECUTING: /bio_software/1D/stat/saps/saps-stroh/SAPS.SSPA/saps /people/maria/tem21_2homol.___saps___ SAPS. Version of April 11, 1996. Date run: Fri Jan 19 10:39:29 2001 File: /people/maria/tem21_2homol.___saps___ ID gi|3170548|gb|AAC34395.1| DE unknown [Takifugu rubripes] number of residues: 1037; molecular weight: 117.5 kdal 1 MNLPPDKARL LRQYDNEKKW DLICDQAFLS AATAAYPSQN CGAILRAFYP NTVPRLIEPR 61 TEAVKEFLND ENRGLDILVE YLSFAQCAVM FGFEGSENGE EDFLDKSKSW SRSIEDLHRP 121 TSQPFYNTLV RSARQSVLRY GSVSSSRTIK NSRLVSQKDD VHVCIMCLRA IMNYQVLYGF 181 NMVMSHAHAV NEIALSLNNK NSRTKALVLE LLAAVCLVRG GHEIILSAFD NFKESSKHRN 241 KTSTVHQGKK RPSQTELINP VVKTRRYFCS PENEAISKEK HRFERLMDYF RSEEGNIDFM 301 VACMQFINIV VHSVEDMNFR VHLQFEFTKL GLDDYLEKSK HTESDKLSVQ IQAYLDNVFD 361 VGGLLEDAET KNAALEKVEE LEDHLSHVTE KLLEVENETM MKVADLEKLL LHKDKELQVI 421 RVSLQLSMLL RRETYESTSS QVNTLRKVIK EKDAAFQRHF NIERRLLELE QQGTIRLHKK 481 PDGDIAIEPC VVGGSSNGGV ILQGEIGQLS LGSTVGLTVP GGPDTSLPPA SEAPPPPPPP 541 PPPPPPLPSA SSPNIPAPPP PPPLAPPLPD ASPSVILSVA IRIKKPIKTK FRLPVFNWTA 601 LKPNQINGTV FNEIDDEREL ELERFEELFK TRAQGPIMDL SCTKSKVAQK AVNKVTILDA 661 NRSKNLAITL RKANKTFDLK TLPVDFVECL MRFLPTEMEV KALRQYERER RPLDQLAEED 721 RFMLLFSKIE RLTQRMNIIT FIGNFSDNVA MLTPQLNAII AASASVKSSP KLKRMLEIIL 781 ALGNYMNSSK RGCVYGFKLQ SLDLLLDTKS TDRKMTLLHY IALIVKEKYP ELANFYNELH 841 FVDKAAAVSL ENVLLDVREL GKGMDLIRRE CSLHDHSVLK GFLQASDTQL DKVQRDAKTA 901 EEAFNNVVNY FGESAKTAPP SVFFPVFVRF LKAYKDAVEE NELRKKQEQA MREKLLAEEA 961 KQQDPKVQAQ KKRQQQHELI AELRKRQAKD HRPVYEGKDG TIEDIITVLK SVPFTARTAK 1021 RGSRFFCEAN LCDDANC -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 75( 7.2%); C : 16( 1.5%); D : 55( 5.3%); E : 84( 8.1%); F : 47( 4.5%) G : 39( 3.8%); H : 22( 2.1%); I : 50( 4.8%); K : 79( 7.6%); L :116(11.2%) M : 24( 2.3%); N : 52( 5.0%); P : 59( 5.7%); Q : 43( 4.1%); R : 64( 6.2%) S : 71( 6.8%); T : 46( 4.4%); V : 70( 6.8%); W : 3( 0.3%); Y : 22( 2.1%) KR : 143 ( 13.8%); ED : 139 ( 13.4%); AGP : 173 ( 16.7%); KRED : 282 ( 27.2%); KR-ED : 4 ( 0.4%); FIKMNY : 274 ( 26.4%); LVIFM : 307 ( 29.6%); ST : 117 ( 11.3%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 00000-+0+0 0+00-0-++0 -000-00000 0000000000 00000+0000 0000+00-0+ 61 0-00+-000- -0+00-000- 0000000000 000-00-00- --00-+0+00 0+00--00+0 121 0000000000 +00+0000+0 000000+00+ 00+0000+-- 00000000+0 0000000000 181 0000000000 0-0000000+ 00+0+0000- 00000000+0 00-000000- 00+-00+0+0 241 +0000000++ +0000-0000 00+0++0000 0-0-000+-+ 0+0-+00-00 +0--000-00 301 0000000000 0000--000+ 00000-00+0 00--00-+0+ 00-0-+0000 00000-000- 361 00000--0-0 +0000-+0-- 0--000000- +00-0-0-00 0+00-0-+00 00+-+-0000 421 +000000000 ++-00-0000 00000++00+ -+-0000+00 00-++00-0- 00000+00++ 481 0-0-000-00 0000000000 0000-00000 0000000000 000-000000 0-00000000 541 0000000000 0000000000 000000000- 0000000000 0+0++00+0+ 0+00000000 601 0+00000000 00-0---+-0 -0-+0--00+ 0+000000-0 000+0+000+ 000+0000-0 661 0+0+000000 ++00+00-0+ 0000-00-00 0+0000-0-0 +00+00-+-+ +00-000--- 721 +000000+0- +000+00000 000000-000 0000000000 000000+000 +0++00-000 781 000000000+ +000000+00 00-000-0+0 0-++000000 00000+-+00 -000000-00 841 00-+000000 -0000-0+-0 0+00-00++- 0000-0000+ 000000-000 -+00+-0+00 901 --00000000 00-00+0000 00000000+0 0+00+-00-- 0-0+++0-00 0+-+000--0 961 +00-0+0000 +++0000-00 0-0+++00+- 0+000-0+-0 00--00000+ 000000+00+ 1021 +00+000-00 00--000 A. CHARGE CLUSTERS. Positive charge clusters (cmin = 12/30 or 16/45 or 20/60): none Negative charge clusters (cmin = 12/30 or 16/45 or 19/60): none Mixed charge clusters (cmin = 19/30 or 25/45 or 31/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. ________________________________ High scoring uncharged segments: score= 1.00 frequency= 0.728 ( LAGSVTIPNFQYHMCW ) score= 0.00 frequency= 0.000 ( BZX ) score= -8.00 frequency= 0.272 ( KEDR ) Expected score/letter: -1.447 M_0.01= 34.07; M_0.05= 28.45 1) From 489 to 581: length= 93, score=57.00 ** 489 PCVVGGSSNG GVILQGEIGQ LSLGSTVGLT VPGGPDTSLP PASEAPPPPP 539 PPPPPPPPLP SASSPNIPAP PPPPPLAPPL PDASPSVILS VAI G: 10(10.8%); S: 12(12.9%); P: 31(33.3%); C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 6 | 6 | 9 | 32 | 11 | 11 | 15 | 12 | 12 | 17 | 7 | 9 | lmin1 7 | 7 | 11 | 39 | 13 | 13 | 18 | 15 | 15 | 20 | 9 | 11 | lmin2 9 | 8 | 12 | 43 | 15 | 14 | 20 | 17 | 17 | 23 | 10 | 12 | (Significance level: 0.010000; Minimal displayed length: 6) (0) 55(2,0,0); at 525- 581: see sequence above (3. quartile) A: 7 (12.3%); S: 8 (14.0%); P: 28 (49.1%); ST: 9 (15.8%); Run count statistics: + runs >= 4: 0 - runs >= 4: 0 * runs >= 6: 0 0 runs >= 22: 2, at 170; 533; -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. There are no high scoring hydrophobic segments. There are no high scoring transmembrane segments. 2. SPACINGS OF C. H2N-23-C-16-C-45-C-76-C-2-C-48-C-52-C-33-C-186-C-151-C-46-C-103-C-77-C-155-C-4-C-4-C-0-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-23-C-16-C-45-C-30-H-43-H-1-C-2-C-18-H-1-H-27-C-5-H-15-H-7-H-22-C-11-H-21-C-8-H-9-H-18-H-42-H-2-H-24-H-46-H-18-H-11-C-151-C-46-C-103-C-25-H-20-H-30-C-2-H-1-H-100-H-13-H-35-C-4-C-4-C-0-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 5 Aligned matching blocks: [ 539- 551] PPPPPP__PPLPSAS [ 558- 572] PPPPPPLAPPLPDAS with superset: [ 534- 541] PPPPPPPP [ 539- 546] PPPP__PPPP [ 558- 567] PPPPPPLAPP B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 9 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 77 (Expected range: 37-- 84) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 27 (6-10) 20 (11-20) 19 (>=21) 12 3. Clusters of amino acid multiplets (cmin = 12/30 or 16/45 or 19/60): none 4. Long amino acid multiplets (>= 5; Letter/Length/Position): P/13/534 P/6/558 B. CHARGE ALPHABET. 1. Total number of charge multiplets: 36 (Expected range: 16-- 51) 18 +plets (f+: 13.8%), 18 -plets (f-: 13.4%) Total number of charge altplets: 33 (Critical number: 56) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 3 (6-10) 2 (11-20) 15 (>=21) 17 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 107- 114 2 S. 4 4 0 122- 161 10 S...S..... 4 4 /0/./././1/./././././ 355- 394 10 L..V...... 4 4 /0/././1/././././././ 523- 567 5 P..P. 8 6 ! /1/././3/./ 534- 546 1 P 13 13 ! 0 536- 563 4 PPP. 6 4 /1/2/2/./ 558- 563 1 P 6 6 ! 0 560- 571 3 P.. 4 4 0 617- 624 2 E. 4 4 0 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10) Location Period Element Copies Core Errors 6- 23 3 *.. 6 6 0 276- 338 7 i...... 8 6 1 279- 296 3 *.. 6 6 0 317- 328 2 i0 6 6 /0/2/ 420- 431 2 i. 6 6 0 446- 493 6 *..... 8 8 0 575- 598 4 i..0 6 6 /0/././2/ 613- 624 2 -. 6 6 0 615- 632 3 *.* 6 6 /0/./2/ 664- 705 7 *..0... 6 6 /0/././1/./././ 932- 955 4 *0.. 6 6 /0/2/././ 936-1001 6 *..... 10 6 1 964-1005 7 *..0*.. 6 6 /0/././2/2/././ 982-1030 7 *....00 7 7 /0/././././2/2/ -------------------------------------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 124- 252 (1.) P( 128)P 2 of 60 0.0001 large 2. maximal spacing 169- 192 (1.) *( 23)* 2 of 283 0.0115 large 2. maximal spacing 271- 481 (2.) P( 210)P 1 of 60 0.0001 large 1. maximal spacing 478- 819 (3.) H( 341)H 1 of 23 0.0032 large maximal spacing 480- 582 (3.) +( 102)+ 1 of 144 0.0000 large maximal spacing 532- 570 (3.) *( 38)* 1 of 283 0.0013 large 1. maximal spacing ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem21_2homol - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- FH2 Formin Homology 2 Domain 546.1 2.3e-160 1 Closter_coat Closterovirus coat protein 2.8 13 1 hexokinase Hexokinase 2.6 7 1 Archaeal_ATPase Archaeal ATPase -0.5 36 1 BRCA2_repeat BRCA2 repeat. -2.2 48 1 GED Dynamin GTPase effector domain -38.5 39 1 Acylphosphatase Acylphosphatase -42.7 70 1 Hydrolase haloacid dehalogenase-like hydrolase -50.5 48 1 COX5A Cytochrome c oxidase subunit Va -60.2 41 1 ELH Egg-laying hormone precursor -66.2 62 1 FUR Ferric uptake regulator family -67.1 92 1 DNA_polymeraseX DNA polymerase X family -72.1 47 1 DUF41 Domain of unknown function DUF41 -77.0 84 1 OspEF Borrelia outer surface protein E and -96.8 93 1 Syntaxin Syntaxin -114.5 30 1 DUF137 Protein of unknown function DUF137 -133.8 43 1 SMC_C SMC family, C-terminal domain -144.4 46 1 Pro_dh Proline dehydrogenase -164.9 73 1 Oxysterol_BP Oxysterol-binding protein -183.6 46 1 FemAB FemAB family -233.1 56 1 ERM Ezrin/radixin/moesin family -248.3 74 1 TRM N2,N2-dimethylguanosine tRNA methyltr -285.6 93 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Acylphosphatase 1/1 310 389 .. 1 89 [] -42.7 70 hexokinase 1/1 372 398 .. 1 27 [. 2.6 7 Pro_dh 1/1 201 452 .. 1 370 [] -164.9 73 Hydrolase 1/1 351 516 .. 1 187 [] -50.5 48 ERM 1/1 330 666 .. 1 392 [] -248.3 74 Syntaxin 1/1 332 693 .. 1 293 [] -114.5 30 DUF41 1/1 347 737 .. 1 247 [] -77.0 84 DNA_polymeraseX 1/1 368 738 .. 1 221 [] -72.1 47 DUF137 1/1 594 741 .. 1 193 [] -133.8 43 SMC_C 1/1 670 802 .. 1 210 [] -144.4 46 FUR 1/1 763 851 .. 1 124 [] -67.1 92 TRM 1/1 575 860 .. 1 476 [] -285.6 93 Closter_coat 1/1 842 863 .. 193 214 .] 2.8 13 Archaeal_ATPase 1/1 879 909 .. 247 277 .. -0.5 36 BRCA2_repeat 1/1 875 916 .. 1 35 [] -2.2 48 COX5A 1/1 874 965 .. 1 109 [] -60.2 41 ELH 1/1 725 979 .. 1 262 [] -66.2 62 GED 1/1 895 983 .. 1 97 [] -38.5 39 FH2 1/1 583 1004 .. 1 503 [] 546.1 2.3e-160 OspEF 1/1 771 1010 .. 1 282 [] -96.8 93 FemAB 1/1 724 1021 .. 1 443 [] -233.1 56 Oxysterol_BP 1/1 633 1023 .. 1 440 [] -183.6 46 Alignments of top-scoring domains: Acylphosphatase: domain 1 of 1, from 310 to 389: score -42.7, E = 70 *->nLkSVdyEVfGRVQGVfFRkytqteAkkLGLvGWVkNtsrGT....V ++SV+ +FR + q e kLGL + + T++++ gi|3170548 310 VVHSVE--------DMNFRVHLQFEFTKLGLDDYLEKSKH-TesdkL 347 tGqiqGpeakvrdmkEWLrktGSPkSrIekaeFsnekeiskleYtd<-* +qiq+ v d L+ ek+ e +++ l + gi|3170548 348 SVQIQAYLDNVFDVGGLLEDAETKNAALEKV----EELEDHLSHVT 389 hexokinase: domain 1 of 1, from 372 to 398: score 2.6, E = 7 *->adllqaveellddFtvstEtLrevtkr<-* ++l++veel+d++ tE+L+ev ++ gi|3170548 372 NAALEKVEELEDHLSHVTEKLLEVENE 398 Pro_dh: domain 1 of 1, from 201 to 452: score -164.9, E = 73 *->llgevflfltalGrplillavseamrlmgrqfvaGetiqsaietakr ++++ +l+l+ l + + +++ e++ ++ f++ + ++ t+++ gi|3170548 201 NSRTKALVLELLAAVCLVRGGHEIILSAFDNFKESSKHRNKTSTVHQ 247 lnetiseglilsvdmLGvkdrkaikEavltaadAaayieaYkqaihaigk + ++ + ++ + v+t ++y+ +++ ++ai+k gi|3170548 248 GK-----KRPSQTELIN--------PVVKT----RRYF--CSPENEAISK 278 aaklsgifsvpgistklselslkyssleeerfelvmeellprlrelaekA + e++ y++ ee + m + + ++ ++ gi|3170548 279 EKHR-------------FERLMDYFRSEEGN-IDFMVACMQFINIVVHSV 314 qelgvavtiDaEetdrlelildifkelmmdpenkgregvgnviQAYLrra ++++ +v++ ++ ++ l ld ++e ++ + +++ iQAYL + gi|3170548 315 EDMNFRVHL---QFEFTKLGLDDYLE---KSKHTESDKLSVQIQAYL-DN 357 gedidalvalAkregrklgvRLVKGAYwdsErkRAqvkGyedppvftdka d+ l++ A+ ++ A +k +++ gi|3170548 358 VFDVGGLLEDAE------------------TKNAALEK--------VEEL 381 ttdmnyqcltrlllegrnykkiypmvAsHNadtvafvlqlakelg.iska + + + +t++lle++n + ++ ad + l++ kel+ i gi|3170548 382 ED--HLSHVTEKLLEVENETMMKV------ADLEKLLLHKDKELQvIR-- 421 dgqfeFqqLyGMgdqlsfeLvkagaDiGKLaynvrkYaPyGpvAHFeell ++ qls L +++ y + gi|3170548 422 -----------VSLQLSMLLRRET---------------YEST------- 438 aYLvRRLlENganssFvnkerdlk<-* + +++++k + +k gi|3170548 439 ----------SSQVNTLRKVIKEK 452 Hydrolase: domain 1 of 1, from 351 to 516: score -50.5, E = 48 *->ikavi...FDlDGTLvdtsepahaqawnealaelg.pfeeverllgr i+a +++ FD+ G L d +++ +a +++ + + +++ e+ll + gi|3170548 351 IQAYLdnvFDVGGLLED--AETKNAALEKVEELEDhLSHVTEKLLEV 395 gleg..rgilllrlfeda...............eayleaale.lklyPgv + e ++++l++l+ +++++ + + + + + ++e+++ + + + gi|3170548 396 ENETmmKVADLEKLLLHKdkelqvirvslqlsmLLRRETYEStSSQVNTL 445 relLeeLkakGyklaivTnssraaerllkrvvealglddyFdviitsddv r++++e a+ +r ++++++ + ++ gi|3170548 446 RKVIKEKDAA-------------FQR-------HFNIERRLLELEQQGTI 475 gsdevaekvgKPdPdiyllalerlgvepeeagpkvlvvgDsladdiaGaa + + KPd di +ep +v g s+ ++i gi|3170548 476 R------LHKKPDGDIA--------IEPC------VVGGSSNGGVILQGE 505 rAGmrtvlvnrg<-* G + + + g gi|3170548 506 -IGQLSLGSTVG 516 ERM: domain 1 of 1, from 330 to 666: score -248.3, E = 74 *->IkNKKGTdLwLGVDALGLNIYekdDrLTPKIgFPWSEIRNISFnDKK LGL Y gi|3170548 330 ---------------LGLDDY-------------------------- 335 FvIKPiDKKapdFvFyaprlrINKrILaLCmGNHeLYMRRRKPDTIEVQQ ++K+ ++ +D + VQ gi|3170548 336 -----LEKSK----------HTE-------------------SDKLSVQ- 350 MKaQAkEEKq......rKqaERekLarEkkaREiaekekEeaEreqqEle +QA + + ++ + +aE + a+Ek Ee E + gi|3170548 351 --IQAYLDNVfdvgglLEDAETKNAALEKV---------EELEDHLSHVT 389 erleqmeEelrkaqeeLeeaeekadeLeeklkqeeeeaqkLekkqselee e+l++ e+e+++ +Le+ +L k k+++ + +L+ l + gi|3170548 390 EKLLEVENETMMKVADLEK------LLLHKDKELQVIRVSLQ-----LSM 428 enrrLeeeamkseeererLeaevdeaeaevaqleeelekkeeEaeqlqte rr e++ s + + L++ + e++a ++q e++ E eq q gi|3170548 429 LLRRETYESTSSQV--NTLRKVIKEKDA-AFQRHFNIERRLLELEQ-QGT 474 lrsaqekeeeaneellevlaap............................ +r + +++ e p +++++++ +++ ++ + +++ + gi|3170548 475 IRLHKKPDGDIAIE-------Pcvvggssnggvilqgeigqlslgstvgl 517 .......................................tappmhhvsap + +++++++ ++ ++ +++++++++++++ ++ ++++ + +pp++++++p gi|3170548 518 tvpggpdtslppaseappppppppppppplpsasspnipAPPPPPPLAPP 567 s......................aELttdngsglesgdngeeasadlePk ++ +++ + + +++ +++ L+ n l + + ++ + ++ + gi|3170548 568 LpdaspsvilsvairikkpiktkFRLPVFNWTALKPNQINGTVFNEID-- 615 EFiLtdedmkdlsnEiEeervteaeknerlqtqLkaLksELssvrdenke d+ E+E+er +e+ k t+ + + ++Ls ++ + ++ gi|3170548 616 ----------DE-RELELERFEELFK-----TRAQGPIMDLSCTKSKVAQ 649 tanDi..lHeENsvRaGrDKYkTLRqIRkGNTKqRVDqFEsM<-* a + ++ N r+K ++ gi|3170548 650 KAVNKvtILDAN-----RSK--------------------NL 666 Syntaxin: domain 1 of 1, from 332 to 693: score -114.5, E = 30 *->mkdRlqelkaaqeddddeddvavvvdgakafmeeFfeqveeIRgnid d+l++ k+ ++d+ + d + + +e e+ ++ ++ gi|3170548 332 LDDYLEKSKHTESDKLSVQ-IQAYLDNVF-DVGGLLEDAETKNAALE 376 KlaenveevkkkhsaiLsaPnpdektkeeLeelmqeikktAnkvrnkLKa K++e + + ++L+ n+ +Le+l+ k + +r++L gi|3170548 377 KVEELEDHLSHVTEKLLEVENETMMKVADLEKLLLHKDKELQVIRVSLQL 426 iesnqsieqdealnrssadlrirktQhstlsrkFvevmteYnrtqsdYrE ++e ++Q+ tl++ +e ++r + E gi|3170548 427 SM----LLRRE-------TYESTSSQVNTLRKVIKEKDAAFQR--HFNIE 463 rcKgriqRqleItgreqttdEelEdmlesGnpaiFtqgilda........ r + q++I+ + + + + + +G+++ g++ +++ ++ + gi|3170548 464 RRLLELEQQGTIRLHKKPDGDIAIEPCVVGGSS--NGGVILQgeigqlsl 511 .................................................. +++ + + +++++++ ++ ++ +++++++++++++ ++ ++++ + ++++ gi|3170548 512 gstvgltvpggpdtslppaseappppppppppppplpsasspnipapppp 561 ........................................NdsqeaKqal ++ ++ ++ +++ + + +++ +++ + + + + q+ + gi|3170548 562 pplapplpdaspsvilsvairikkpiktkfrlpvfnwtalKPNQINGTVF 611 aeieanvylqeRHneimkLEssIrELhdmFmDmamlVesQGEmidrIekN +ei eR e+++ E + ++ + l + ++ + ++ gi|3170548 612 NEIDD-----ERELELERFEELFKTRAQ--GPIMDLSCTKSKVAQKAVNK 654 VedavdyVeravgdtkkAvkyqskARkkKiiIiiivvvlvvili<-* V + a + + A+ + + + + + + +v + gi|3170548 655 VT-----ILDANRSKNLAITLRKANKTFDLKTLPVDFVECLMRF 693 DUF41: domain 1 of 1, from 347 to 737: score -77.0, E = 84 *->lteeQLlstFsNvkhliGslevqnTnfkslsFLanLesIecg..... l+ Q +++ Nv + G le +T k++ L+++e++e + ++ + gi|3170548 347 LSV-QIQAYLDNVFDVGGLLEDAET--KNAA-LEKVEELEDHlshvt 389 .................................................. ++ + ++++ + + ++ +++++ + + + + + ++++ ++++ gi|3170548 390 ekllevenetmmkvadleklllhkdkelqvirvslqlsmllrretyests 439 .................................................. ++ ++ ++ ++++ +++ + +++ + +++++ + +++++++ ++ gi|3170548 440 sqvntlrkvikekdaafqrhfnierrlleleqqgtirlhkkpdgdiaiep 489 .................................................. +++++++ +++ ++ + +++ + + +++++++ ++ ++ ++++++ gi|3170548 490 cvvggssnggvilqgeigqlslgstvgltvpggpdtslppaseapppppp 539 .................................................. +++++++ ++ ++++ + ++++++ ++ ++ +++ + + +++ ++ gi|3170548 540 ppppppplpsasspnipapppppplapplpdaspsvilsvairikkpikt 589 .irk..rnkdrvrkildnihdnpfswidnqnmlelgllnlTnmtrlgLpi ++r + n+ + +++ i++ f+ id ++ L+ gi|3170548 590 kFRLpvFNWTALKPN--QINGTVFNEID---------------DERELE- 621 lsnldlnkLnlpnlknisnpnstgekiivnfenlhpdFClTteEllnffl l+++++++ ++ + + + C ++ + + gi|3170548 622 -------------LERFEELFK-TRAQGPI----MDLSCT-KSKVAQKAV 652 nsnvsienleakyCepksrifflikktdngivyklCnfkslsssvnLdng n+ + + +k+ ++r k + ++l++ L gi|3170548 653 NKVTILDANRSKNLAITLR--------------KANKTFDLKT---L--- 682 CtiIfGdLvIg...pgdEeyVskLknveviFGsLiIqNTnLtnid....F + f + +++ E V+ L ++e + L+ ++++++F gi|3170548 683 -PVDFVECLMRflpTEME--VKALRQYER-------ERRPLDQLAeedrF 722 LenLkyIasLedsvs<-* ++ I++L+ +++ gi|3170548 723 MLLFSKIERLTQRMN 737 DNA_polymeraseX: domain 1 of 1, from 368 to 738: score -72.1, E = 47 *->rvsraEalavaseikeavrelldPdaqvtltGsfRRGketggDvDiL +++ al+ +e++++++ ++ +++ ++ + ++++D+ L gi|3170548 368 AETKNAALEKVEELEDHLSHVT--EKLLEVE---NETMMKVADLEKL 409 itsPhqakdeElLhkvlnllkkkGlLLYydiieST.fekaklpsrlvevv + +kd+E + v+ + ++ +LL eST++ ++ l ++e + gi|3170548 410 LL----HKDKE--LQVIRVSLQLSMLLRRETYESTsSQVNTLRKVIKEKD 453 ehFqkcflilkvlakGvtkssvilkeglrvkfskaeeeenKswka..... Fq f+i++ l + ++++l +++ ++ gi|3170548 454 AAFQRHFNIERRLLELEQQGTIRLHKKP----------------Dgdiai 487 .................................................. ++ +++++++ +++ ++ + +++ + + +++++++ ++ ++ ++++ gi|3170548 488 epcvvggssnggvilqgeigqlslgstvgltvpggpdtslppaseapppp 537 .................................................. +++++++++ ++ ++++ + ++++++ ++ ++ +++ + + +++ gi|3170548 538 ppppppppplpsasspnipapppppplapplpdaspsvilsvairikkpi 587 .................................................. +++ + + + + ++++ +++ ++ +++++ + ++ ++ +++ +++ gi|3170548 588 ktkfrlpvfnwtalkpnqingtvfneidderelelerfeelfktraqgpi 637 ...............................................YCr + + ++++ ++ ++ + + +++++ + ++ +++ + ++ + + gi|3170548 638 mdlsctkskvaqkavnkvtildanrsknlaitlrkanktfdlktlpvDFV 687 RVDLrvvpeeqygaALLgfTGSkeFnrdLR...ryAtskgkklleyglly + r+ p+e AL ++++r+ R+ +++A + +++ll + + gi|3170548 688 ECLMRFLPTEMEVKAL------RQYERERRpldQLAEEDRFMLLFSKIER 731 dktkgkdialdervfiKdkAGepLaesEed<-* + r +i gi|3170548 732 LTQ---------RMNI-------------- 738 DUF137: domain 1 of 1, from 594 to 741: score -133.8, E = 43 *->LLLAknPVISVNGNTAALVPeEvVELAkvvnaKi..EvNLFyRTeER PV N AL P +n+ + +E+ ER gi|3170548 594 ------PVF----NWTALKPN-------QINGTVfnEID-----DER 618 ELAiVrkIeEvLKDDlEtGaiEVLGlddankrIpgLeSeRgkVsedGIyk EL + + +eE +K G i L + + + V I+ gi|3170548 619 ELEL-ERFEELFK-TRAQGPIMDLSCTKS-------KVAQKAVNKVTILD 659 ADVVLVPLEDGDR....tEiLvrmGKkVIaIDLNPLSRTARkAdITIVDN A R+++ + L++ K + DL L + V gi|3170548 660 A----------NRsknlAITLRKANK---TFDLKTL-------PVDFVEC 689 IiRAiPklIElvkEmrdlsREE..LeeIVEnyDNkKnLkEvlelIaeRLk + R P E vk +r ++RE+++L E+ D L +e +R gi|3170548 690 LMRFLPTEME-VKALRQYERERrpLDQLAEE-DRFMLLFSKIERLTQRMN 737 nlel<-* + gi|3170548 738 IITF 741 SMC_C: domain 1 of 1, from 670 to 802: score -144.4, E = 46 *->lekeikeLgpVlnElqvNlkAi.eEYaeveeRy...eelvekledle l k+ k++ +lk ++ ++ e+ R+ ++e +v l ++e gi|3170548 670 LRKANKTF---------DLKTLpVDFVECLMRFlptEMEVKALRQYE 707 eerkkllevieeldkkrleeFmeaFnkInknfkevFqeLtgGGdAeLrLt +er+ l ++ ee ++Fm F kI++ ++++ +t +G + gi|3170548 708 RERRPLDQLAEE------DRFMLLFSKIERLTQRM-NIITFIG----NFS 746 DpdDPFssGieisArPPgKkwknlelLSGGEKtLtALALlFAIhkykPsP D +++ +l+ A + A ++k sP gi|3170548 747 DNVAMLTP--------------QLN------------AIIAASASVKSSP 770 FYvlDEvDAALDeaNVsRvAnyIkrersknaQFIVIsLRnnmmekADaLv ++ ++++ I+ L n m + gi|3170548 771 -----------------KLKRMLE---------IILALGNYMNS------ 788 GVymqddgvskVislkL<-* ++++ +v+ ++L gi|3170548 789 --SKRG-CVYGFKLQSL 802 FUR: domain 1 of 1, from 763 to 851: score -67.1, E = 92 *->eaGlKiTpqRlkILevleksdeeHlsAEevYrellneedpniSlATV +a +K +p+ +++Le++ +++ Y+ ++++ ++ gi|3170548 763 SASVKSSPKLKRMLEIILALGN--------YMNSS-KRGCVYGF--- 797 YRtLklleeaGivkrlefeggesrfElaqeaghHHdHlICekCGkviEFk l + ++ +++ + ++ H I + v+E k gi|3170548 798 -----KLQSLDLLLDTKST--------DR--KMTLLHYIALI---VKE-K 828 ddeIekrqeeiaekhGfklvdHrLely<-* ++e+ + +e++ f + ++ l gi|3170548 829 YPELANFYNELH----FVDKAAAVSLE 851 TRM: domain 1 of 1, from 575 to 860: score -285.6, E = 93 *->ieveiEgeakivVpegsakvsfdsPeeVFYNPrmafNRDlsVlvlqa ++ + i++++ + k+ f+ P VF ++++N ++ +v+++ gi|3170548 575 VILS----VAIRIKK-PIKTKFRLP--VFNWTALKPN-QINGTVFNE 613 lsellgKkivaKrnnkdskkklsaEtrdeedkrlkteeespkkkdklsad ++ ++++ + ++ E + gi|3170548 614 ID-----------DERELELERFEELFK--------------------TR 632 neakievLDALSASGIRaIRfAlEvpgveevvaNDispkAVelIkeNvkl ++i +++ ++ va kAV + gi|3170548 633 AQGPI-----------------MDLSCTKSKVA----QKAVNKV------ 655 NfvgdvsnevvpnAIlkGEKTIVInkdDAnmlmaenkrfaerFdvIDLDP ++ + +++ A +++++ + Fd L gi|3170548 656 ----TILDANRSK-------------NLA-ITLRKANKT---FDLKTL-- 682 FGSPaPFlDsAvqSLvkrgGlLcVTATDtAvLCGNaypkaClrKYgAvpL P+ F++ + p+ gi|3170548 683 ---PVDFVECLMRF---------------------------------LPT 696 rgefCHEvGLRiLlgaiareAAKYekaiePLLSyS.iDHYvRVFvkvkrG +e++ +LR +Ye+ PL +++ D + +F k++r gi|3170548 697 EMEVK---ALR-----------QYERERRPLDQLAeEDRFMLLFSKIERL 732 aakvdkvieklGyvyhCsgCgsfevqklgkpskrksrskkkfakvsvaiP + + + + ++ + ++ ++ p ++ a +sv gi|3170548 733 TQR----MNIITFIG-----NFSDNVAMLTPQL----NAIIAASASV--- 766 PVsakCpHCGgkfhlaGPlWlGpLhDkeFveevleeaesep........e ++++ ++++le+ + +++ +++++ gi|3170548 767 ------------------------KSSPKLKRMLEIILA-LgnymnsskR 791 retyGTkkrilglLklikeElddppVlyYdlhqlasvlklsvPPlqdvvs +yG +++ l l + + d lh +a ++k P l + + gi|3170548 792 GCVYG--FKLQSLDLLLDTKSTDR--KMTLLHYIALIVKEKYPELANFYN 837 aLkeaGFevSrTHanptgIKTnApwdavwevlrelkke<-* +L H++ ++ A ++ vl + + gi|3170548 838 EL----------HFVDKA----AAV-SLENVLLDVREL 860 Closter_coat: domain 1 of 1, from 842 to 863: score 2.8, E = 13 *->ldrsggsevdskvvsLrdLGry<-* +d++++++ + +++ +r LG++ gi|3170548 842 VDKAAAVSLENVLLDVRELGKG 863 Archaeal_ATPase: domain 1 of 1, from 879 to 909: score -0.5, E = 36 *->LkyfLedlkysKpKVevgdEviEelyekvik<-* Lk fL ++ + KV + +Ee +++v++ gi|3170548 879 LKGFLQASDTQLDKVQRDAKTAEEAFNNVVN 909 BRCA2_repeat: domain 1 of 1, from 875 to 916: score -2.2, E = 48 *->nEnsalaFrTASGKkv.......kVSkesLkKakklFsdiee<-* ++ + +F AS + +++ +++ k ++e+++++++ F ++ gi|3170548 875 DHSVLKGFLQASDTQLdkvqrdaKTAEEAFNNVVNYFGESAK 916 COX5A: domain 1 of 1, from 874 to 965: score -60.2, E = 41 *->HGdeEtaEEFdaRYekyFdkeaiDlwEvrkgmNnllgyDLVPePkiI H d+ F + dk D+ Nn + y gi|3170548 874 H-DHSVLKGFLQASDTQLDKVQRDAKTAEEAFNNVVNY-------FG 912 eaALRAaRRvNDlatAiRvLEalKdKvgdekAelYpylLdeLkptlqELG e A A v ++ +R L a Kd v++ l + + ++ l gi|3170548 913 ESAKTAPPSVF-FPVFVRFLKAYKDAVEENE--LRKKQEQAMREKL---- 955 iptkEELgldks<-* + EE + gi|3170548 956 --LAEEAKQQDP 965 ELH: domain 1 of 1, from 725 to 979: score -66.2, E = 62 *->mkrPsyrsantmslilCLiLsaLCvssqsasvHGksFatnRAvKsss + r + m +i + + v+ + a+ vKss+ gi|3170548 725 LFSKIERLTQRMNIITFIGNFSDNVAMLTPQLN-AIIAASASVKSSP 770 pylvlsPadevvsmstengyrsalreafdksssdyddddddvvknekrrl l e++ ++ s+ r + d d + +++ gi|3170548 771 ---KLKRMLEIILALGNY-MNSSKRGCVYGFKLQSLDLLLDTKSTDRKMT 816 rfhkrrlrfdkrdqevsgl..krvmtsrvsasaDenspFDLsneDGAvyq +h l + + e +++ + +++++a + en + D+ +e G gi|3170548 817 LLHYIALIVKEKYPELANFynELHFVDKAAAVSLENVLLDV-RELGKGMD 865 rdlraPRLRFYslrKRAAGdmeqsEgqnPEtesHsrRKRsvLtPslssLg r L s+ K G + s+ q + ++ + + g gi|3170548 866 LIRRECSLHDHSVLK---GFLQASDTQLDKVQRDAKTAEEAFNNVVNYFG 912 esLEsGisKRi..sinqDLKAitDmLltEqiqeReryLadLRqRLLekGK es + LKA D +E R++ +++R +LL gi|3170548 913 ESAKTAPPSVFfpVFVRFLKAYKDA--VEENELRKKQEQAMREKLLAEEA 960 RssevsLlasdkdeeeRel<-* ++ + + a k +++ el gi|3170548 961 KQQDPKVQAQKKRQQQHEL 979 GED: domain 1 of 1, from 895 to 983: score -38.5, E = 39 *->qedseleeIksllkSYfnivrktladqvPkaImhllvneskdslqnF + + + ee+++++ Yf + kt +v + + ++ +kd ++ gi|3170548 895 RDAKTAEEAFNNVVNYFGESAKTAPPSVFFPVFVRFLKAYKDAVEE- 940 QellakLykpraeelldeLLeEdpeiaskRkelkkrLelLkkArqiisev +L+k ++ ++LL E + + + +k+ ++ + i+e gi|3170548 941 ----NELRKKQEQAMREKLLAEEAKQQDPKVQAQKK---RQQQHELIAEL 983 <-* gi|3170548 - - FH2: domain 1 of 1, from 583 to 1004: score 546.1, E = 2.3e-160 *->pkkvpkpkkpLKqLhWdkletlkqpdapPsekskgtvWgttlndeld kk++k+k++L+ ++W++l+ P+++++ tv+ +e+d gi|3170548 583 IKKPIKTKFRLPVFNWTALK--------PNQING-TVF-----NEID 615 etklLenegdldeleelFatkektksakkgdvsetksklmkkkvkknlki ++ +e++l+++eelF k++++++++ d+s+tksk+++k+v+k ++i gi|3170548 616 DE----RELELERFEELF--KTRAQGPIM-DLSCTKSKVAQKAVNK-VTI 657 ldpkrsqnppleIllrslhcdmsieetvikvailevDevkvlsedlleqL ld++rs+n l+I+lr+ a+ ++D k+l++d++e+L gi|3170548 658 LDANRSKN--LAITLRK--------------ANKTFDL-KTLPVDFVECL 690 lkfaPYsktkeelkklrevkedkekdleeLsraeqiFllllsnLipylee ++f+P t++e+k+lr+++++ +++l++L+++++ F+ll+s+ i +l + gi|3170548 691 MRFLP---TEMEVKALRQYERE-RRPLDQLAEEDR-FMLLFSK-IERLTQ 734 RlkallfklnfeeevedLkpklekvtaAseeLreSkkFkklLeliLaiGN R++ ++f++nf+++v++L p+l++++aAs ++++S+k+k++Le+iLa+GN gi|3170548 735 RMNIITFIGNFSDNVAMLTPQLNAIIAASASVKSSPKLKRMLEIILALGN 784 yMNggsrrrgqAkGFkLssLtkLsdvKsadnktTLLHylvkiirekypel yMN +s+r g+++GFkL+sL+ L+d+Ks+d+k+TLLHy++ i++ekypel gi|3170548 785 YMN-SSKR-GCVYGFKLQSLDLLLDTKSTDRKMTLLHYIALIVKEKYPEL 832 ldFysdlhhvdeaSsrkvnleelekdvkslekqiknlEtsldlgnlsdpe ++Fy++lh+vd+a+ v+le+++ dv++l+k +++ l +++s gi|3170548 833 ANFYNELHFVDKAA--AVSLENVLLDVRELGK---GMD--LIRRECSL-- 873 lhpdDkflevmrpFlkaAkekfdkLqkdlkdmktrleslveyygedpkdd h+ + +l+ Fl+a+ +++dk+q d+k+++++++++v+y+ge+ k+ gi|3170548 874 -HD-HSVLKG---FLQASDTQLDKVQRDAKTAEEAFNNVVNYFGESAKT- 917 lspneFFkdfneFleefkkarkenlAEeqrkekeeelkrrrklalkalsk +p FF f++Fl+++k+a++en ++++k+e+++r + la + gi|3170548 918 APPSVFFPVFVRFLKAYKDAVEEN----ELRKKQEQAMREKLLA-----E 958 eklereqgkskqkentlerrkernplmdgeaekeqdfkgvldsllkalrt e +++ +++++qk+ r+++++++a e+ +++++d+ + ++++ gi|3170548 959 EAKQQDPKVQAQKK--------RQQQHELIA--ELRKRQAKDHRPVYEGK 998 dssfrr<-* d+ +++ gi|3170548 999 DGTIED 1004 OspEF: domain 1 of 1, from 771 to 1010: score -96.8, E = 93 *->MNkkmKnlIICAVFALIsSCKnyAssedleQnvkEKveGFldkaldP k +++l I I+ +ny s + + v + + ld ld gi|3170548 771 --KLKRMLEI------ILALGNYMNS-SKRGCVYGFKLQSLDLLLD- 807 AkDkitsssskvDELARkLqEEDkikGvEEnnkDELvqGddpnfSeftvk t s + L + k k E n + d+ + v gi|3170548 808 -----TKSTDRKMTLLHYIALIVKEKYPELANFYNELHFVDKA---AAVS 849 ikpvl..............pasSPqvqdetlvlkaVrqsDGqqEEkvkkv ++vl + ++ +++ + + + S + vlk+ q+ q +kv++ gi|3170548 850 LENVLldvrelgkgmdlirRECS---LHDHSVLKGFLQASDTQLDKVQRD 896 ee..eegkeeGleekkEntEEgesqeLkdkeekqkkRkAEqEkrkkelek +++ ee+ + E ++ + + k gi|3170548 897 AKtaEEAFNNVVNYFGESAKTAPPSVFFPVFVR--------------FLK 932 ArqKgEEEqeFKaqvesgtkEk..kEkqkken..gIqrKaEkEREEqiit A + EE e ++ e + +Ek +E+ k+++++ + +K+ + E i + gi|3170548 933 AYKDAVEENELRKKQEQAMREKllAEEAKQQDpkVQAQKKRQQQHELIAE 982 akelivdgeystfdGkidkindniDvikEqtsvsakkViddAlenilEeL +++ ++ G +d + i+d++ gi|3170548 983 LRKRQAKDHRPVYEG-----KD--------------GTIEDIIT------ 1007 kKlak<-* +k gi|3170548 1008 --VLK 1010 FemAB: domain 1 of 1, from 724 to 1021: score -233.1, E = 56 *->LtaleFenftnkmpvysHftQsienyelrnsegfevhlVGvkddnne L ++e+ t+ m++ + +n+ + + + ++ + + + + gi|3170548 724 LLFSKIERLTQRMNIITFIGNFSDNVAM-LTPQLNAIIAASASVKSS 769 ViAAclltqvpvmklFkyFYsnrGPVmdfenkeLvnfFfkeLskYlkknk + +l + +++ +++ s rG V f+ l++ gi|3170548 770 PKLKRMLEIILALGNYMN-SSKRGCVYGFK---------------LQSLD 803 cLyPvkidPYlpynlrdldgeiiesynkndalvdklkklgsALLPFfehq L ++++ + + ++ + + +i +l++ +++l f+ + gi|3170548 804 -LL-LDTKSTDRKMTLLHYIALIVKEK-YPELANFYNELH------FVDK 844 GFttgFdpPWGWAAAvCsssQirwhSVLdLenkteetLlkem....dkqr ++ + + V L +k ++++ +e + ++++ gi|3170548 845 AAAVSLEN--------------VLLDVREL-GKGMDLIRRECslhdHSVL 879 kRRAACNikKtkkngVkVRdLsedElsiFrklykdTeerkgFndrdddY. k + +++ + kV + + +F + ++ e ++ ++ ++ gi|3170548 880 KG----FLQASDTQLDKVQRDAKTAEEAFNNVVNYFGESAKTAPPSVFFp 925 .fynlikiykDkslvplAyInfdeYleklnrereqlnkkinkadkalakn f +++k+ykD + + gi|3170548 926 vFVRFLKAYKD-----------------------AVE------------- 939 PnskKnknKlneLnqQleaneakieeak.eLikehGpesEDRELGCADel ++++ k++ + + l a eak +k + +k+ gi|3170548 940 -ENELRKKQEQAMREKLLAEEAKQQDPKvQAQKKRQQ------------- 975 diAAgffiinpfEVvYlyGGSsneYrhfmGpYalQWemInyAkdhgIdrY Q e I ++++++ + gi|3170548 976 ----------------------------------QHELIAELRKRQAK-- 989 NFYGIsGdFsedaedyGVykFKkGFnaeVieyiGdFikPinkliYkiytl + + Vy +G + ++ i ++ k + + + gi|3170548 990 ---------DHRP----VY---EGKDGTIED-----IITVLKSVPFTART 1018 Lkk<-* +k+ gi|3170548 1019 AKR 1021 Oxysterol_BP: domain 1 of 1, from 633 to 1023: score -183.6, E = 46 *->kSqhsSsssltsklkfilsepkdltrRldipeeslspisLwsisqlw + + s t++ +++ +k++ +++ + +L + +++ gi|3170548 633 AQGPIMDLSCTKSKVAQKAVNKVTIL------DANRSKNLAITLRKA 673 vGkdLskitlPvfinEpkSmLqrvtedleYpel..LdkAaaredsltrWf k + tlPv + E + m ++te e++ L + + +l + + gi|3170548 674 N-KTFDLKTLPVDFVECL-MRFLPTE----MEVkaLRQYERERRPLDQ-L 716 lyvaastlssysettrrvkKPlNPlLGEtFelkrendElpnkgqyrllsE ++ l ++ r+ N + tF + +n l+ gi|3170548 717 AEEDRFMLLF--SKIERLTQRMNII---TFIGNFS----DN---VAMLTP 754 QVSHHPPvsAffaespkskwrvqgevqikskFlGkslavmkeGltvhtil Q + A +a + s +++ i lG+++ k G + +l gi|3170548 755 QLN---AIIAASASVKSS-PKLKRMLEIILA-LGNYMNSSKRGCVYGFKL 799 dikdingkpEtYtwtqPn......ttveGILvGkpfvElSGdesvinsst d + + t+ ++++ + ++ + ++v + + El + gi|3170548 800 QSLD------LLLDTKSTdrkmtlLHYIALIVKEKYPEL---------AN 834 GdkcvIeFk.tkGyfSGkkdsfkgrivkdpdgnkdKE...kklytlsGkW ++ ++F++++ +S +r+ + + +E + + +l G gi|3170548 835 -FYNELHFVdKAAAVSLENVLLDVRELGKGMDLIRREcslHDHSVLKGFL 883 ndsmyikKvqsksegelflDahktphealk..vrpLeeqhelesrkaWkk s + ++ ++ Da kt+ ea ++ v e + + a gi|3170548 884 QASDTQLDKVQR-------DA-KTAEEAFNnvVNYFGE----SAKTAPPS 921 vadalkvgnltvftktkneleerqRpllkptesrgvedqralfkrledke v ++ v+ l + + +e e r ++ ++ +e + e++ ++ + gi|3170548 922 VFFPVFVRFLKAYKDAVEENELRKKQEQAMREKLLAEEAKQQDPKVQAQK 971 dldkeekkekpeaddkerkqkgslyepkwfvpsgqpvtdKeklyikarke ++ ++++e ++ k + ye k ++ +t ++ ++++ gi|3170548 972 -KRQQQHELIAELRKRQAKDHRPVYEGKDGTI-EDIIT--VLKSVPFTAR 1017 yWefrKk<-* +r gi|3170548 1018 TA-KRGS 1023 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem21_2homol - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- FH2 Formin Homology 2 Domain 544.3 8.4e-160 1 Closter_coat Closterovirus coat protein 2.8 13 1 hexokinase Hexokinase 2.6 7 1 Poty_P1 Potyvirus P1 protease 2.4 12 1 Ribosomal_S12e Ribosomal protein S12e 2.2 27 1 BRCA2_repeat BRCA2 repeat. 2.2 68 1 Ribosomal_S28e Ribosomal protein S28e 2.1 23 1 HR1 Hr1 repeat motif 2.1 31 1 Mg_chelatase Magnesium chelatase, subunit ChlI 0.1 41 1 Monooxygenase Monooxygenase -0.1 73 1 Archaeal_ATPase Archaeal ATPase -0.5 36 1 RNA_helicase RNA helicase -0.9 97 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Ribosomal_S12e 1/1 71 83 .. 118 130 .] 2.2 27 hexokinase 1/1 372 398 .. 1 27 [. 2.6 7 HR1 1/1 398 421 .. 21 44 .. 2.1 31 Ribosomal_S28e 1/1 703 714 .. 60 71 .] 2.1 23 Monooxygenase 1/1 701 722 .. 195 216 .] -0.1 73 RNA_helicase 1/1 715 734 .. 326 345 .] -0.9 97 Closter_coat 1/1 842 863 .. 193 214 .] 2.8 13 Archaeal_ATPase 1/1 879 909 .. 247 277 .. -0.5 36 BRCA2_repeat 1/1 900 916 .. 19 35 .] 2.2 68 Mg_chelatase 1/1 935 982 .. 291 343 .] 0.1 41 FH2 1/1 583 1004 .. 1 503 [] 544.3 8.4e-160 Poty_P1 1/1 937 1008 .. 1 76 [. 2.4 12 Alignments of top-scoring domains: Ribosomal_S12e: domain 1 of 1, from 71 to 83: score 2.2, E = 27 *->EtqaldvlleyFq<-* E +ld+l ey++ gi|3170548 71 ENRGLDILVEYLS 83 hexokinase: domain 1 of 1, from 372 to 398: score 2.6, E = 7 *->adllqaveellddFtvstEtLrevtkr<-* ++l++veel+d++ tE+L+ev ++ gi|3170548 372 NAALEKVEELEDHLSHVTEKLLEVENE 398 HR1: domain 1 of 1, from 398 to 421: score 2.1, E = 31 *->EkllRltsdekLlllkketkevkr<-* E+++++++ ekLll+k ++ +v+r gi|3170548 398 ETMMKVADLEKLLLHKDKELQVIR 421 Ribosomal_S28e: domain 1 of 1, from 703 to 714: score 2.1, E = 23 *->LlEsEREARrLr<-* L+ ERE R+L+ gi|3170548 703 LRQYERERRPLD 714 Monooxygenase: domain 1 of 1, from 701 to 722: score -0.1, E = 73 *->elLdtYeaeRrpraervqdlar<-* ++L+ Ye+eRrp + + + + gi|3170548 701 KALRQYERERRPLDQLAEEDRF 722 RNA_helicase: domain 1 of 1, from 715 to 734: score -0.9, E = 97 *->elseenrrdqvvsllealfq<-* +l ee+r++ ++s++e+l q gi|3170548 715 QLAEEDRFMLLFSKIERLTQ 734 Closter_coat: domain 1 of 1, from 842 to 863: score 2.8, E = 13 *->ldrsggsevdskvvsLrdLGry<-* +d++++++ + +++ +r LG++ gi|3170548 842 VDKAAAVSLENVLLDVRELGKG 863 Archaeal_ATPase: domain 1 of 1, from 879 to 909: score -0.5, E = 36 *->LkyfLedlkysKpKVevgdEviEelyekvik<-* Lk fL ++ + KV + +Ee +++v++ gi|3170548 879 LKGFLQASDTQLDKVQRDAKTAEEAFNNVVN 909 BRCA2_repeat: domain 1 of 1, from 900 to 916: score 2.2, E = 68 *->SkesLkKakklFsdiee<-* ++e+++++++ F ++ gi|3170548 900 AEEAFNNVVNYFGESAK 916 Mg_chelatase: domain 1 of 1, from 935 to 982: score 0.1, E = 41 *->rlfvdedpDvkraallaLrhRsvLrddplepsrtpppvlskikvqar ++ v+e +++++ + a+r+ L + ++ +++ p v ++ k q++ gi|3170548 935 KDAVEEN-ELRKKQEQAMREK--LLAEEAKQQD--PKVQAQKKRQQQ 976 feevfa<-* e++++ gi|3170548 977 HELIAE 982 FH2: domain 1 of 1, from 583 to 1004: score 544.3, E = 8.4e-160 *->pkkvpkpkkpLKqLhWdkletlkqpdapPsekskgtvWgttlndeld kk++k+k++L+ ++W++l+ P+++++ tv+ +e+d gi|3170548 583 IKKPIKTKFRLPVFNWTALK--------PNQING-TVF-----NEID 615 etklLenegdldeleelFatkektksakkgdvsetksklmkkkvkknlki ++ +e++l+++eelF k++++++++ d+s+tksk+++k+v+k ++i gi|3170548 616 DE----RELELERFEELF--KTRAQGPIM-DLSCTKSKVAQKAVNK-VTI 657 ldpkrsqnppleIllrslhcdmsieetvikvailevDevkvlsedlleqL ld++rs+n l+I+lr+ a+ ++D k+l++d++e+L gi|3170548 658 LDANRSKN--LAITLRK--------------ANKTFDL-KTLPVDFVECL 690 lkfaPYsktkeelkklrevkedkekdleeLsraeqiFllllsnLipylee ++f+P t++e+k+lr+++++ +++l++L+++++ F+ll+s+ i +l + gi|3170548 691 MRFLP---TEMEVKALRQYERE-RRPLDQLAEEDR-FMLLFSK-IERLTQ 734 RlkallfklnfeeevedLkpklekvtaAseeLreSkkFkklLeliLaiGN R++ ++f++nf+++v++L p+l++++aAs ++++S+k+k++Le+iLa+GN gi|3170548 735 RMNIITFIGNFSDNVAMLTPQLNAIIAASASVKSSPKLKRMLEIILALGN 784 yMNggsrrrgqAkGFkLssLtkLsdvKsadnktTLLHylvkiirekypel yMN +s+r g+++GFkL+sL+ L+d+Ks+d+k+TLLHy++ i++ekypel gi|3170548 785 YMN-SSKR-GCVYGFKLQSLDLLLDTKSTDRKMTLLHYIALIVKEKYPEL 832 ldFysdlhhvdeaSsrkvnleelekdvkslekqiknlEtsldlgnlsdpe ++Fy++lh+vd+a+ v+le+++ dv++l+k +++ l +++s gi|3170548 833 ANFYNELHFVDKAA--AVSLENVLLDVRELGK---GMD--LIRRECSL-- 873 lhpdDkflevmrpFlkaAkekfdkLqkdlkdmktrleslveyygedpkdd h+ + +l+ Fl+a+ +++dk+q d+k+++++++++v+y+ge+ k+ gi|3170548 874 -HD-HSVLKG---FLQASDTQLDKVQRDAKTAEEAFNNVVNYFGESAKT- 917 lspneFFkdfneFleefkkarkenlAEeqrkekeeelkrrrklalkalsk +p FF f++Fl+++k+a++en ++++k+e+++r + la + gi|3170548 918 APPSVFFPVFVRFLKAYKDAVEEN----ELRKKQEQAMREKLLA-----E 958 eklereqgkskqkentlerrkernplmdgeaekeqdfkgvldsllkalrt e +++ +++++qk+ r+++++++a e+ +++++d+ + ++++ gi|3170548 959 EAKQQDPKVQAQKK--------RQQQHELIA--ELRKRQAKDHRPVYEGK 998 dssfrr<-* d+ +++ gi|3170548 999 DGTIED 1004 Poty_P1: domain 1 of 1, from 937 to 1008: score 2.4, E = 12 *->adleakvaerllrkeekldqmskikqekkgrvilrklspkqvakkre a +e+++++++++ +++++++ ++ +++ +v+ k+ ++q + ++e gi|3170548 937 AVEENELRKKQEQAMREKLLAEEA-KQQDPKVQAQKKRQQQHELIAE 982 klkreereerqkvkflqgayasivskitp<-* kr+++ r +++g++ +i+ +it+ gi|3170548 983 LRKRQAKDHRP---VYEGKDGTIEDIITV 1008 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem21_2homol - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Fri Jan 19 10:42:25 2001 Sequence file: tem21_2homol ---------------------------------------- Sequence gi|3170548|gb|AAC34395.1| (1037 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 240: NKTS 397: NETM 597: NWTA 607: NGTV 661: NRSK 674: NKTF 744: NFSD 787: NSSK Total matches: 8 Matching pattern PS00004 CAMP_PHOSPHO_SITE: 250: KRPS 431: RRET 813: RKMT 1020: KRGS Total matches: 4 Matching pattern PS00005 PKC_PHOSPHO_SITE: 132: SAR 145: SSR 148: TIK 156: SQK 235: SSK 264: TRR 344: SDK 389: TEK 444: TLR 474: TIR 669: TLR 733: TQR 765: SVK 769: SPK 788: SSK 789: SKR 811: TDR 914: SAK 1015: TAR 1018: TAK Total matches: 20 Matching pattern PS00006 CK2_PHOSPHO_SITE: 113: SIED 156: SQKD 227: SAFD 253: SQTE 313: SVED 342: TESD 656: TILD 696: TEME 727: SKIE 872: SLHD 888: TQLD 899: TAEE 1001: TIED Total matches: 13 Matching pattern PS00008 MYRISTYL: 95: GSENGE 141: GSVSSS 493: GGSSNG 494: GSSNGG 512: GSTVGL 521: GGPDTS 608: GTVFNE 783: GNYMNS 792: GCVYGF Total matches: 9 Matching pattern PS00009 AMIDATION: 247: QGKK Total matches: 1 Total no of hits in this sequence: 55 ======================================== 1314 pattern(s) searched in 1 sequence(s), 1037 residues. Total no of hits in all sequences: 55. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** argv[1]=P argv[2]=-m /data/patterns/own/motif.fa argv[4]=-seq tem21_2homol ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1037 units >ER-GOLGI-traffic signal is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1037 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem21_2homol - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem21_2homol - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] (1037 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value UBHYD Ubiquitin C-terminal hydrolase domain 27 0.13 SH3 Src Homology domain 3 26 0.19 ARM Armadillo repeat 26 0.23 DNASE1 DNASE-1/Sphingomyelinase like domain 24 0.74 PAP Papain/bleomycin hydrolase like domain 24 0.75 PDE cyclic NMP phosphodiesterase domain 22 2.5 CYCLIN Cyclin/TFIIB domain 23 2.7 BRIGHT BRIGHT domain (Alpha helical DNA binding domain) 23 2.8 ARR Arrestin domain 23 3.0 SEC7 Sec7 like GDP exchange factor for ARF like GTPases 22 3.1 PP2C PP2C type phosphatase domain 22 3.8 CALMO Calmodulin like EF-hand domains 22 4.9 KIN Protein kinase domain 22 5.0 14-3-3 14-3-3 protein alpha Helical domain 21 7.0 PDZ PDZ domain (A protein-protein interaction domain) 21 9.0 CHLMY Adenovirus like protease domain 21 9.0 >UBHYD Ubiquitin C-terminal hydrolase domain Length = 884 Score = 27.0 bits (59), Expect = 0.13 Identities = 28/124 (22%), Positives = 28/124 (22%), Gaps = 18/124 (14%) Query: 905 NNVVNYFGESAKTAPPSVFFPVFVRFLKA--YKDAVEENELRKKQEQAMREKLLAEEAKQ 962 Sbjct: 596 SNRMNYFQVAKTVAQRLNTDPMLLQFFKSQGYRDG-PGNPLRHNYEGTLRD-LLQFFKPR 653 Query: 963 QDPKVQAQKKRQQQHELIAELRKRQAK----------DHRPVYEGKDGTIEDII-TVLKS 1011 Sbjct: 654 QPKKLYYQQLKMKITDFE---NRRSFKCIWLNSQFREEEITLYPDKHGCVRDLLEECKKA 710 Query: 1012 VPFT 1015 Sbjct: 711 VELG 714 Score = 21.1 bits (44), Expect = 6.7 Identities = 30/138 (21%), Positives = 30/138 (21%), Gaps = 16/138 (11%) Query: 221 GHEIILSAFDNFKESSKHRNKTSTVHQGKKRPSQTELINPVVKTRRYFCSPENEAISKEK 280 Sbjct: 754 IEEIPLDQVDIDKENE----MLVTVAHFHKEVFGTFGIPFLLRIH------QGEHFREVM 803 Query: 281 HRFERLMDYFRSEEGNIDF---MVACMQFINIVVHSVEDMNFRV---HLQFEFTKLGLDD 334 Sbjct: 804 KRIQSLLDIQEKEFEKFKFAIVMTGRHQYINEDEYEVNLKDFEPQPGNMSHPRPWLGLDH 863 Query: 335 YLEKSKHTESDKLSVQIQ 352 Sbjct: 864 FNKAPKRSRYTYLEKAIK 881 >SH3 Src Homology domain 3 Length = 90 Score = 26.4 bits (58), Expect = 0.19 Identities = 4/14 (28%), Positives = 4/14 (28%) Query: 535 PPPPPPPPPPPPLP 548 Sbjct: 1 PRVQAWSPPVRGIP 14 Score = 24.4 bits (53), Expect = 0.70 Identities = 7/17 (41%), Positives = 7/17 (41%) Query: 553 PNIPAPPPPPPLAPPLP 569 Sbjct: 1 PRVQAWSPPVRGIPQLP 17 >ARM Armadillo repeat Length = 532 Score = 26.1 bits (57), Expect = 0.23 Identities = 17/81 (20%), Positives = 17/81 (20%), Gaps = 10/81 (12%) Query: 931 LKAYKDAVEENELRKKQEQAMREKLLAEEAKQQDPKVQAQKKRQQQHELIAELRKRQAKD 990 Sbjct: 13 RNRYKVAVDAEEGRRRREDNMVE---IRKSKREE---SLMKKRREGMQALQGFPSASAAS 66 Query: 991 HRPVYEGKDGTIEDIITVLKS 1011 Sbjct: 67 VDK----KLDSLKDMVAGVWS 83 >DNASE1 DNASE-1/Sphingomyelinase like domain Length = 388 Score = 24.4 bits (52), Expect = 0.74 Identities = 6/48 (12%), Positives = 6/48 (12%), Gaps = 4/48 (8%) Query: 942 ELRKKQEQAMREKLLAEEAKQQDPKVQAQKKRQQQHELIAELRKRQAK 989 Sbjct: 48 ELLEASQHDQSEPLDPKELLDECQV----ALQDRPARLHRDFFSLRSE 91 >PAP Papain/bleomycin hydrolase like domain Length = 376 Score = 24.3 bits (52), Expect = 0.75 Identities = 3/16 (18%), Positives = 3/16 (18%) Query: 548 PSASSPNIPAPPPPPP 563 Sbjct: 361 ARVQKPDMKPRVSCPP 376 Score = 23.5 bits (50), Expect = 1.2 Identities = 3/18 (16%), Positives = 3/18 (16%) Query: 526 SLPPASEAPPPPPPPPPP 543 Sbjct: 359 LTARVQKPDMKPRVSCPP 376 >PDE cyclic NMP phosphodiesterase domain Length = 350 Score = 22.4 bits (47), Expect = 2.5 Identities = 32/164 (19%), Positives = 32/164 (19%), Gaps = 9/164 (5%) Query: 705 QYERERRPLDQLAEEDRFMLLFSKIERLTQRMNIITFIGNFSDNVAMLTPQLNAIIAASA 764 Sbjct: 163 QYKELRRSVVQLILATDMQNHFEHTNKFQHHLNNLPFDRNKKEDRQMI---LNFLIKCGD 219 Query: 765 SVKSSPKLKRMLEIILALGN-YMNSSKRGCVYGFKLQSLDLLLDTKSTDRKMTLLHYIAL 823 Sbjct: 220 ISNIARPWHLNFEWSLRVSDEFFQQSHYETICGYPVTP---FMDKTKTTRARIAADFIDF 276 Query: 824 IVKEKYPELANFYNELHFVDKAAAVSLEN--VLLDVRELGKGMD 865 Sbjct: 277 VASPLFQSMAKFLKESQFLLKVISKNRENWQAYMELQKEGKCND 320 >CYCLIN Cyclin/TFIIB domain Length = 317 Score = 22.6 bits (48), Expect = 2.7 Identities = 10/92 (10%), Positives = 10/92 (10%), Gaps = 4/92 (4%) Query: 129 LVRSARQSVLRYGSVSSSRTIKNSRLVSQKD-DVHVCIMCLRAIMNYQVLYGFNMVMS-H 186 Sbjct: 15 RLRADANRKFRCKAVANGKVLPNDPVFLEPHEEMTLCKYYEKRLLEFCSVFKPAMPRSVV 74 Query: 187 AHAVNEIALSLNNKNSRTKALVLELLAAVCLV 218 Sbjct: 75 GTACMYFKRFYLNNSVMEYH--PRIIMLTCAF 104 >BRIGHT BRIGHT domain (Alpha helical DNA binding domain) Length = 172 Score = 22.6 bits (48), Expect = 2.8 Identities = 7/65 (10%), Positives = 7/65 (10%) Query: 932 KAYKDAVEENELRKKQEQAMREKLLAEEAKQQDPKVQAQKKRQQQHELIAELRKRQAKDH 991 Sbjct: 99 TQYMKYLYPYECEKKNLSTPAELQAAIDGNRREGRRSSYGQYEAMHNQMPMTPISRPSLP 158 Query: 992 RPVYE 996 Sbjct: 159 GGMQQ 163 >ARR Arrestin domain Length = 454 Score = 22.5 bits (48), Expect = 3.0 Identities = 7/25 (28%), Positives = 7/25 (28%) Query: 539 PPPPPPPPLPSASSPNIPAPPPPPP 563 Sbjct: 380 SKPPESPERTDRGLPSIEATNGSEP 404 >SEC7 Sec7 like GDP exchange factor for ARF like GTPases Length = 192 Score = 22.1 bits (47), Expect = 3.1 Identities = 15/67 (22%), Positives = 15/67 (22%), Gaps = 3/67 (4%) Query: 662 RSKNLAITLRKANKTFDLKTLPVDFVECLMRFLPTEMEVKALRQYERERRPLDQLAEEDR 721 Sbjct: 50 SRQMIGEFLGNRQKQFNRDVL-DCVVD-EMDFSTMEL-DEALRKFQAHIRVQGEAQKVER 106 Query: 722 FMLLFSK 728 Sbjct: 107 LIEAFSQ 113 >PP2C PP2C type phosphatase domain Length = 254 Score = 22.2 bits (47), Expect = 3.8 Identities = 7/40 (17%), Positives = 7/40 (17%), Gaps = 3/40 (7%) Query: 200 KNSRTKALVLELLAAVCLVRGGHE---IILSAFDNFKESS 236 Sbjct: 215 TGQGNCDDQAVQLIEEAKNAGGSDNITIVLVDFSEDNQDS 254 >CALMO Calmodulin like EF-hand domains Length = 147 Score = 21.6 bits (45), Expect = 4.9 Identities = 8/28 (28%), Positives = 8/28 (28%) Query: 378 VEELEDHLSHVTEKLLEVENETMMKVAD 405 Sbjct: 103 AAELKHVLTSIGEKLTDAEVDDMLREVS 130 >KIN Protein kinase domain Length = 313 Score = 21.6 bits (45), Expect = 5.0 Identities = 14/113 (12%), Positives = 14/113 (12%), Gaps = 13/113 (11%) Query: 243 STVHQGKKRPSQTELINPVVKTRRYFCSPENEAISKEKHRFERLMDYFRSEEGNIDFMVA 302 Sbjct: 43 GVVCSAKDNLTGEKVAIKKISKAF-----DNLKDTKRTLREIHLLRHFKHEN-------- 89 Query: 303 CMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSKHTESDKLSVQIQAYL 355 Sbjct: 90 LISIKDILKPNSKEQFEDVYIVSELMDTDLHQIITSPQPLSDDHCQYFVYQML 142 >14-3-3 14-3-3 protein alpha Helical domain Length = 270 Score = 21.1 bits (44), Expect = 7.0 Identities = 12/41 (29%), Positives = 12/41 (29%), Gaps = 1/41 (2%) Query: 945 KKQEQAMREKLLAEEAKQQDPKVQAQKK-RQQQHELIAELR 984 Sbjct: 4 TSREDAVYLAKLAEQAERYEGMVENMKSVASTDQELTVEER 44 >PDZ PDZ domain (A protein-protein interaction domain) Length = 90 Score = 20.9 bits (44), Expect = 9.0 Identities = 6/49 (12%), Positives = 6/49 (12%), Gaps = 5/49 (10%) Query: 475 IRLHKKPDGDIAIEPCVVGGSSNGGVILQGEIGQLSLGSTVGLTVPGGP 523 Sbjct: 3 LEFGQVRRGLLGIK--GGELNADLAKAFNVSAQQ---GAFVSEVLPKSA 46 >CHLMY Adenovirus like protease domain Length = 157 Score = 21.0 bits (44), Expect = 9.0 Identities = 10/38 (26%), Positives = 10/38 (26%), Gaps = 7/38 (18%) Query: 303 CMQFINIVV------HSVEDM-NFRVHLQFEFTKLGLD 333 Sbjct: 119 SCMFAEYITRNISLTFSQKDMPRFRKQMKREITNGRLD 156 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 16 Number of calls to ALIGN: 19 Length of query: 1037 Total length of test sequences: 20182 Effective length of test sequences: 16037.0 Effective search space size: 16004148.5 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= gi|3170548|gb|AAC34395.1| unknown [Takifugu rubripes] (1037 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|116871 [267..480] 4-bladed beta-propeller 28 0.84 gi|2499877 [3..454] Cysteine proteinases 27 1.3 gi|1515148 [263..496] Protein kinases (PK), catalytic core 27 1.3 gi|2208905 [146..371] Cupredoxins 27 1.8 gi|631175 [2..294] S-adenosyl-L-methionine-dependent methylt... 26 3.1 gi|2780359 [156..339] Cupredoxins 26 4.1 gi|2648240 [4..218] Aldehyde ferredoxin oxidoreductase, N-te... 26 4.2 gi|625241 [2..283] OB-fold 25 4.3 gi|2144185 [64..321] Flavodoxin-like 26 4.4 gi|2145546 [201..337] Metzincin-like 25 4.5 gi|2649774 [45..384] Periplasmic binding protein-like I 25 4.6 gi|1546735 [56..315] (Phosphotyrosine) protein phosphatases ... 25 6.1 gi|1827916 [1..200] N-terminal domain of DNA photolyase 25 7.2 gi|1666073 [6..176] P-loop containing nucleotide triphosphat... 25 7.6 gi|586529 [222..473] 7-bladed beta-propeller 24 8.0 gi|127428 [1..429] S-adenosyl-L-methionine-dependent methylt... 25 8.2 gi|2815132 [909..1205] 7-bladed beta-propeller 24 8.3 gi|1943833 [45..314] Metallo-dependent phosphatases 24 8.8 >gi|116871 [267..480] 4-bladed beta-propeller Length = 214 Score = 28.0 bits (62), Expect = 0.84 Identities = 7/33 (21%), Positives = 7/33 (21%) Query: 539 PPPPPPPPLPSASSPNIPAPPPPPPLAPPLPDA 571 Sbjct: 2 TPALGPQAGIDTNEIAPLEPDAPPDACEASFDA 34 >gi|2499877 [3..454] Cysteine proteinases Length = 452 Score = 27.4 bits (60), Expect = 1.3 Identities = 3/30 (10%), Positives = 3/30 (10%) Query: 530 ASEAPPPPPPPPPPPPPLPSASSPNIPAPP 559 Sbjct: 418 YEVVVDKKHVPEEVLAVLEQEPIVLPAWDP 447 >gi|1515148 [263..496] Protein kinases (PK), catalytic core Length = 234 Score = 27.0 bits (58), Expect = 1.3 Identities = 10/94 (10%), Positives = 10/94 (10%), Gaps = 5/94 (5%) Query: 261 VVKTRRYFCSPENEAISKEKHRFERLMDYFRSEEGNIDFMVACMQFINIVVHSVEDMNFR 320 Sbjct: 11 REMFTAHVLPEWKASKVIDVIRRLQIPTDISSM---TADEIRLSELCISKRMEIIKSNDR 67 Query: 321 VHLQFEFTKLGLDDYL--EKSKHTESDKLSVQIQ 352 Sbjct: 68 YILMNPCESATIHSYATERLDEITENDVMSIYQK 101 >gi|2208905 [146..371] Cupredoxins Length = 226 Score = 26.9 bits (59), Expect = 1.8 Identities = 8/20 (40%), Positives = 8/20 (40%) Query: 529 PASEAPPPPPPPPPPPPPLP 548 Sbjct: 190 NGNAAPPPLQLPAGEPPVTP 209 >gi|631175 [2..294] S-adenosyl-L-methionine-dependent methyltransferases Length = 293 Score = 26.0 bits (56), Expect = 3.1 Identities = 15/122 (12%), Positives = 15/122 (12%), Gaps = 8/122 (6%) Query: 582 RIKKPIKTKFRLPVFNWTALKPNQINGTVFNEIDDERELELERFEELFKTRAQGPIMDLS 641 Sbjct: 8 RSLGVAAEGLPDQYAEGEAARVWQLYIGDTRSRTAE---YKAWLLGLLRQHGCQRVLDVA 64 Query: 642 C---TKSKVAQKAVNKVTILDANRSKNLAITLRKANKTFDLKTLPVDFVECLMRFLPTEM 698 Sbjct: 65 CGTGVDSIMLVEEGFSVTSVD--ASDKMLKYALKERWNRRHEPAFDKWVIEEANWMTLDK 122 Query: 699 EV 700 Sbjct: 123 DV 124 >gi|2780359 [156..339] Cupredoxins Length = 184 Score = 25.7 bits (56), Expect = 4.1 Identities = 10/32 (31%), Positives = 10/32 (31%), Gaps = 4/32 (12%) Query: 521 GGPDTSLPPA----SEAPPPPPPPPPPPPPLP 548 Sbjct: 153 DRPEVYRGYAVIRYSKASTTPHVPTIPPLTLP 184 >gi|2648240 [4..218] Aldehyde ferredoxin oxidoreductase, N-terminal domains Length = 215 Score = 25.7 bits (56), Expect = 4.2 Identities = 8/44 (18%), Positives = 8/44 (18%) Query: 40 NCGAILRAFYPNTVPRLIEPRTEAVKEFLNDENRGLDILVEYLS 83 Sbjct: 2 YMGKILKVNLSEGSIKELPIDEEIARKYLGGKGYAVRLLYDYLT 45 >gi|625241 [2..283] OB-fold Length = 282 Score = 25.3 bits (55), Expect = 4.3 Identities = 9/46 (19%), Positives = 9/46 (19%) Query: 501 ILQGEIGQLSLGSTVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPP 546 Sbjct: 229 LIAGKSSDSKGIDLTNVTLPDTPTYSKAASDAIPPASPKADAPIDK 274 >gi|2144185 [64..321] Flavodoxin-like Length = 258 Score = 25.5 bits (55), Expect = 4.4 Identities = 10/17 (58%), Positives = 10/17 (58%) Query: 527 LPPASEAPPPPPPPPPP 543 Sbjct: 50 MPPFSEDPDHPFPPVPP 66 >gi|2145546 [201..337] Metzincin-like Length = 137 Score = 25.4 bits (55), Expect = 4.5 Identities = 27/139 (19%), Positives = 27/139 (19%), Gaps = 19/139 (13%) Query: 678 DLKTLPVDFVECLMRFLPTEMEVKALRQYERERRPLDQLAEEDRFMLLFSKIERLTQRMN 737 Sbjct: 4 DLQTLPAGALRG--KIVPTPMQVKVHAQ----DADLRKGVALDLSTLVKPAADVVSQRFA 57 Query: 738 II---TFIGNFSDNVAMLTPQLNAIIAASASVKSSPKLKRMLEIILALGNYMNSSKRGCV 794 Sbjct: 58 LLGVPVQTNGYPIKTDIQPGKFKGAMAVSGAYELKIG-KKEAQVI-------GFDQAGVF 109 Query: 795 YGFKLQSLDLLLDTKSTDR 813 Sbjct: 110 YG--LQSILSLVPSDGSGK 126 >gi|2649774 [45..384] Periplasmic binding protein-like I Length = 340 Score = 25.4 bits (55), Expect = 4.6 Identities = 4/22 (18%), Positives = 4/22 (18%) Query: 172 MNYQVLYGFNMVMSHAHAVNEI 193 Sbjct: 310 VTWEHALEYQAMMVLAKAMDEA 331 >gi|1546735 [56..315] (Phosphotyrosine) protein phosphatases II Length = 260 Score = 25.0 bits (54), Expect = 6.1 Identities = 3/51 (5%), Positives = 3/51 (5%) Query: 278 KEKHRFERLMDYFRSEEGNIDFMVACMQFINIVVHSVEDMNFRVHLQFEFT 328 Sbjct: 94 REELKFGIYDITCKEFVKRNILEYRLLEVAMNIFSFQKPSETHQVHHYKFH 144 >gi|1827916 [1..200] N-terminal domain of DNA photolyase Length = 200 Score = 24.9 bits (54), Expect = 7.2 Identities = 7/29 (24%), Positives = 7/29 (24%) Query: 532 EAPPPPPPPPPPPPPLPSASSPNIPAPPP 560 Sbjct: 163 EGMPECVAAPKVRSSGSIEPSPSITLNYP 191 >gi|1666073 [6..176] P-loop containing nucleotide triphosphate hydrolases Length = 171 Score = 24.6 bits (52), Expect = 7.6 Identities = 10/142 (7%), Positives = 10/142 (7%), Gaps = 5/142 (3%) Query: 286 LMDYFRSEEGNIDFMVACMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSKHTESD 345 Sbjct: 15 IIRQFLFGDYPERHRPTDGPRLYRPAVLLDGAVYDLSIRDGDVAGPGSSPGGPEEWPDAK 74 Query: 346 KLSVQIQAYLDNVFDVGGLLEDAETKNAALEKVEELEDHLSHVTEKLL----EVENETMM 401 Sbjct: 75 DWSLQDTDAFVLVYDICSPDSFDYVKA-LRQRIAETRPAGAPEAPILVVGNKRDRQRLRF 133 Query: 402 KVADLEKLLLHKDKELQVIRVS 423 Sbjct: 134 GPRRALAALVRRGWRCGYLECS 155 >gi|586529 [222..473] 7-bladed beta-propeller Length = 252 Score = 24.4 bits (52), Expect = 8.0 Identities = 7/60 (11%), Positives = 7/60 (11%), Gaps = 6/60 (10%) Query: 141 GSVSSSRTIKNSRLVSQKDDVHVCIMCLRAIMNYQVLYGFNMVMSHAHAVNEIALSLNNK 200 Sbjct: 181 QSIVSASWVGDDKVISCSMDGSVRLWSLKQNTLLALSIV------DGVPIFAGRISQDGQ 234 >gi|127428 [1..429] S-adenosyl-L-methionine-dependent methyltransferases Length = 429 Score = 24.5 bits (53), Expect = 8.2 Identities = 13/68 (19%), Positives = 13/68 (19%), Gaps = 5/68 (7%) Query: 794 VYGFKLQSLDLLLDTKSTDRKMTLLHYIALIVKEKYPELANFYNELHFVDKAAAVSLENV 853 Sbjct: 112 VKCPKCEAVSKAKDERG-----TLFFETALLAEEKKPKFVILENVKGLINSGNGQVLRII 166 Query: 854 LLDVRELG 861 Sbjct: 167 SETMNNIG 174 >gi|2815132 [909..1205] 7-bladed beta-propeller Length = 297 Score = 24.5 bits (52), Expect = 8.3 Identities = 12/61 (19%), Positives = 12/61 (19%), Gaps = 7/61 (11%) Query: 141 GSVSSSRTIKNSR-LVSQKDDVHVCIMCLRAIMNYQVLYGFNMVMSHAHAVNEIALSLNN 199 Sbjct: 101 GSVVSLQLTSNNQFLITGSGDFVVQMWDVTNGKCISRMGG------LMAPVSTLAITSND 154 Query: 200 K 200 Sbjct: 155 A 155 >gi|1943833 [45..314] Metallo-dependent phosphatases Length = 270 Score = 24.4 bits (53), Expect = 8.8 Identities = 11/42 (26%), Positives = 11/42 (26%), Gaps = 1/42 (2%) Query: 206 ALVLELLAAVCLVRGGHEI-ILSAFDNFKESSKHRNKTSTVH 246 Sbjct: 86 AYKILHPEKVVLLKGHHEESISIEMLKVKEWLMARGIERDVD 127 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 18 Number of calls to ALIGN: 18 Length of query: 1037 Total length of test sequences: 256703 Effective length of test sequences: 202633.0 Effective search space size: 200862731.8 Initial X dropoff for ALIGN: 25.0 bits ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ calculation of internal repeats with prospero ***** PROSPERO v1.3 Fri Jan 19 10:43:18 2001 ***** Copyright 2000, Richard Mott, Wellcome Trust Centre for Human Genetics, University of Oxford For help see http://www.well.ox.ac.uk/ariadne For usage use -help using gap penalty 11+1k using matrix BLOSUM62 printing all alignments with eval < 0.100000 using sequence1 gi|3170548|gb|AAC34395.1| using self-comparison > 1 gi|3170548|gb|AAC34395.1| len 1037 from 493 to 584 vs gi|3170548|gb|AAC34395.1| len 1037 from 498 to 588 score 132 eval 4.515320e-10 identity 44.83% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 493 GGSSNGGVILQGEIGQLSLGSTV--GLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSA 550 gi|3170548|gb|AAC34395.1| || | | | :| ::| || | | : || ||||||||||| | |: 498 GGVILQGEIGQLSLGS-TVGLTVPGGPDTSLPPASEAPP----PPPPPPPPPPPLPSASS 552 gi|3170548|gb|AAC34395.1| 551 SSPNIPAPPPP--PPLAPPLPDASPSVILSVAIRIK 584 gi|3170548|gb|AAC34395.1| : | |||| ||| | || : : || 553 PNIPAPPPPPPLAPPLPDASPSVILSVAIRIKKPIK 588 gi|3170548|gb|AAC34395.1| > 2 gi|3170548|gb|AAC34395.1| len 1037 from 493 to 576 vs gi|3170548|gb|AAC34395.1| len 1037 from 498 to 606 score 96 eval 2.509867e-06 identity 46.84% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 493 GGSSNGGVILQGEIGQLSLGSTV--GLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSA 550 gi|3170548|gb|AAC34395.1| || | | | :| ::| || | | : || ||||||||||| | |: 498 GGVILQGEIGQLSLGS-TVGLTVPGGPDTSLPPASEAPP----PPPPPPPPPPPLPSASS 552 gi|3170548|gb|AAC34395.1| 551 SSPNIPAPPPP--PPLAPPLPDA--------------------------SPSVI 576 gi|3170548|gb|AAC34395.1| : | |||| ||| | |: | 553 PNIPAPPPPPPLAPPLPDASPSVILSVAIRIKKPIKTKFRLPVFNWTALKPNQI 606 gi|3170548|gb|AAC34395.1| > 3 gi|3170548|gb|AAC34395.1| len 1037 from 493 to 551 vs gi|3170548|gb|AAC34395.1| len 1037 from 498 to 572 score 91 eval 8.313485e-06 identity 50.00% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 493 GGSSNGGVILQGEIGQLSLGSTV----GLTVPGGPDTSLPPASEAPPPPP---------- 538 gi|3170548|gb|AAC34395.1| || | | | :| ::| || ::| : || ||||| 498 GGVILQGEIGQLSLGS-TVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSASSPNIP 556 gi|3170548|gb|AAC34395.1| 539 -PPPPPP--PPLPSAS 551 gi|3170548|gb|AAC34395.1| |||||| |||| || 557 APPPPPPLAPPLPDAS 572 gi|3170548|gb|AAC34395.1| > 4 gi|3170548|gb|AAC34395.1| len 1037 from 493 to 547 vs gi|3170548|gb|AAC34395.1| len 1037 from 498 to 564 score 89 eval 1.342256e-05 identity 50.00% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 493 GGSSNGGVILQGEIGQLSLGSTV--GLTVPGGPDTSLPPASEAPPPPPPP---------- 540 gi|3170548|gb|AAC34395.1| || | | | :| ::| || | | : || ||||||| 498 GGVILQGEIGQLSLGS-TVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSASSPNIP 556 gi|3170548|gb|AAC34395.1| 541 -PPPPPPL 547 gi|3170548|gb|AAC34395.1| ||||||| 557 APPPPPPL 564 gi|3170548|gb|AAC34395.1| > 5 gi|3170548|gb|AAC34395.1| len 1037 from 493 to 545 vs gi|3170548|gb|AAC34395.1| len 1037 from 498 to 563 score 77 eval 2.377430e-04 identity 48.08% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 493 GGSSNGGVILQGEIGQLSLGSTV--GLTVPGGPDTSLPPASEAPPPPPP----------- 539 gi|3170548|gb|AAC34395.1| || | | | :| ::| || | | : || |||||| 498 GGVILQGEIGQLSLGS-TVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSASSPNIP 556 gi|3170548|gb|AAC34395.1| 540 -PPPPPP 545 gi|3170548|gb|AAC34395.1| |||||| 557 APPPPPP 563 gi|3170548|gb|AAC34395.1| > 6 gi|3170548|gb|AAC34395.1| len 1037 from 493 to 544 vs gi|3170548|gb|AAC34395.1| len 1037 from 498 to 563 score 71 eval 1.000236e-03 identity 45.10% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 493 GGSSNGGVILQGEIGQLSLGSTV----GLTVPGGPDTSLPPASEAPPPPP---------- 538 gi|3170548|gb|AAC34395.1| || | | | :| ::| || ::| : || ||||| 498 GGVILQGEIGQLSLGS-TVGLTVPGGPDTSLPPASEAPPPPPPPPPPPPPLPSASSPNIP 556 gi|3170548|gb|AAC34395.1| 539 -PPPPPP 544 gi|3170548|gb|AAC34395.1| |||||| 557 APPPPPP 563 gi|3170548|gb|AAC34395.1| > 7 gi|3170548|gb|AAC34395.1| len 1037 from 520 to 544 vs gi|3170548|gb|AAC34395.1| len 1037 from 540 to 573 score 57 eval 2.821590e-02 identity 64.00% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 520 PGGPDTSLPPASE----APPPPPPPPPP-----P 544 gi|3170548|gb|AAC34395.1| | | || || ||||||| || | 540 PPPPPPPLPSASSPNIPAPPPPPPLAPPLPDASP 573 gi|3170548|gb|AAC34395.1| > 8 gi|3170548|gb|AAC34395.1| len 1037 from 56 to 171 vs gi|3170548|gb|AAC34395.1| len 1037 from 182 to 307 score 57 eval 2.821590e-02 identity 24.14% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 56 LIEPRTEAVKEF---LNDENRGLDILVEYLSFAQCAVMFGFEGSENGEEDFLDKSKSWSR 112 gi|3170548|gb|AAC34395.1| :: || | ||::| || | | | | | | : ::| : || :: 182 MVMSHAHAVNEIALSLNNKNSRTKALVLELLAAVCLVRGGHEIILSAFDNFKESSKHRNK 241 gi|3170548|gb|AAC34395.1| 113 SI---EDLHRPTSQPFYNTLVRSARQSVLRYGSVSSSRTIKNSRLV----SQKDDVHVCI 165 gi|3170548|gb|AAC34395.1| : : ||: | :|:: | | : ||: |:: :: : 242 TSTVHQGKKRPSQTELINPVVKTRRYFCSPENEAISKEKHRFERLMDYFRSEEGNIDFMV 301 gi|3170548|gb|AAC34395.1| 166 MCLRAI 171 gi|3170548|gb|AAC34395.1| |:: | 302 ACMQFI 307 gi|3170548|gb|AAC34395.1| > 9 gi|3170548|gb|AAC34395.1| len 1037 from 60 to 108 vs gi|3170548|gb|AAC34395.1| len 1037 from 282 to 340 score 52 eval 9.044905e-02 identity 30.61% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 60 RTEAVKEFLNDENRGLDILVEYLSFAQCA----------VMFGFEGSENGEEDFLDKSK 108 gi|3170548|gb|AAC34395.1| | | : :: | :| :| : | | || :: | :|:|:||| 282 RFERLMDYFRSEEGNIDFMVACMQFINIVVHSVEDMNFRVHLQFEFTKLGLDDYLEKSK 340 gi|3170548|gb|AAC34395.1| > 10 gi|3170548|gb|AAC34395.1| len 1037 from 528 to 539 vs gi|3170548|gb|AAC34395.1| len 1037 from 535 to 546 score 52 eval 9.044905e-02 identity 66.67% K 2.262877e-02 L 2.395304e-01 H 1.111504e+00 alpha 9.764034e-02 528 PPASEAPPPPPP 539 gi|3170548|gb|AAC34395.1| || |||||| 535 PPPPPPPPPPPP 546 gi|3170548|gb|AAC34395.1|