analysis of sequence from tem22 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] MGPGRPAPAPWPRHLLRCVLLLGCLHLGRPGAPGDAALPEPNVFLIFSHGLQGCLEAQGGQVRVTPACNT SLPAQRWKWVSRNRLFNLGTMQCLGTGWPGTNTTASLGMYECDREALNLRWHCRTLGDQLSLLLGARTSN ISKPGTLERGDQTRSGQWRIYGSEEDLCALPYHEVYTIQGNSHGKPCTIPFKYDNQWFHGCTSTGREDGH LWCATTQDYGKDERWGFCPIKSNDCETFWDKDQLTDSCYQFNFQSTLSWREAWASCEQQGADLLSITEIH EQTYINGLLTGYSSTLWIGLNDLDTSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESSGGWQNRD CSIALPYVCKKKPNATAEPTPPDRWANVKVECEPSWQPFQGHCYRLQAEKRSWQESKKACLRGGGDLVSI HSMAELEFITKQIKQEVEELWIGLNDLKLQMNFEWSDGSLVSFTHWHPFEPNNFRDSLEDCVTIWGPEGR WNDSPCNQSLPSICKKAGQLSQGAAEEDHGCRKGWTWHSPSCYWLGEDQVTYSEARRLCTDHGSQLVTIT NRFEQAFVSSLIYNWEGEYFWTALQDLNSTGSFFWLSGDEVMYTHWNRDQPGYSRGGCVALATGSAMGLW EVKNCTSFRARYICRQSLGTPVTPELPGPDPTPSLTGSCPQGWASDTKLRYCYKVFSSERLQDKKSWVQA QGACQELGAQLLSLASYEEEHFVANMLNKIFGESEPEIHEQHWFWIGLNRRDPRGGQSWRWSDGVGFSYH NFDRSRHDDDDIRGCAVLDLASLQWVAMQCDTQLDWICKIPRGTDVREPDDSPQGRREWLRFQEAEYKFF EHHSTWAQAQRICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQHWWIGLHTSESDGRFRWTDGSIINF ISWAPGKPRPVGKDKKCVYMTASREDWGDQRCLTALPYICKRSNVTKETQPPDLPTTALGGCPSDWIQFL NKCFQVQGQEPQSRVKWSEAQFSCEQQEAQLVTITNPLEQAFITASLPNVTFDLWIGLHASQRDFQWVEQ EPLMYANWAPGEPSGPSPAPSGNKPTSCAVVLHSPSAHFTGRWDDRSCTEETHGFICQKGTDPSLSPSPA ALPPAPGTELSYLNGTFRLLQKPLRWHDALLLCESHNASLAYVPDPYTQAFLTQAARGLRTPLWIGLAGE EGSRRYSWVSEEPLNYVGWQDGEPQQPGGCTYVDVDGAWRTTSCDTKLQGAVCGVSSGPPPPRRISYHGS CPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGGAVLSILDEMENVFVWEHLQSYEGQSRGAWL GMNFNPKGGTLVWQDNTAVNYSNWGPPGLGPSMLSHNSCYWIQSNSGLWRPGACTNITMGVVCKLPRAEQ SSFSPSALPENPAALVVVLMAVLLLLALLTAALILYRRRQSIERGAFEGARYSRSSSSPTEATEKNILVS DMEMNEQQE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > Tem22_gi|3327232|dbj|BAA31684.1| . . . . . 1 MGPGRPAPAPWPRHLLRCVLLLGCLHLGRPGAPGDAALPEPNVFLIFSHG 50 ____________HHHHHHHHHHHHH_________________EEEEEE__ . . . . . 51 LQGCLEAQGGQVRVTPACNTSLPAQRWKWVSRNRLFNLGTMQCLGTGWPG 100 ___________EEEE______________HHHHHH____EEEE_______ . . . . . 101 TNTTASLGMYECDREALNLRWHCRTLGDQLSLLLGARTSNISKPGTLERG 150 _______EEEHHHHHHHHHHHHH____HHHHHHHH_______________ . . . . . 151 DQTRSGQWRIYGSEEDLCALPYHEVYTIQGNSHGKPCTIPFKYDNQWFHG 200 _______EEE______EEEEE_EEEEEE_________EEEE_________ . . . . . 201 CTSTGREDGHLWCATTQDYGKDERWGFCPIKSNDCETFWDKDQLTDSCYQ 250 __________EEEEE___________________________________ . . . . . 251 FNFQSTLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGYSSTLWIGL 300 ___HHHHHHHHHHHHHH_____________EEE_EEE________EEE__ . . . . . 301 NDLDTSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESSGGWQNRD 350 ___________________________________EEEEEE_________ . . . . . 351 CSIALPYVCKKKPNATAEPTPPDRWANVKVECEPSWQPFQGHCYRLQAEK 400 ___EEE_EEE_______________EEEEEE___________HHHHHHHH . . . . . 401 RSWQESKKACLRGGGDLVSIHSMAELEFITKQIKQEVEELWIGLNDLKLQ 450 HHHHHHHHHHH_____EEEEHHHHHHHHHHHHHHHHHHHHHH___HHHHH . . . . . 451 MNFEWSDGSLVSFTHWHPFEPNNFRDSLEDCVTIWGPEGRWNDSPCNQSL 500 HHH_______EEEE_________________EEEE_______________ . . . . . 501 PSICKKAGQLSQGAAEEDHGCRKGWTWHSPSCYWLGEDQVTYSEARRLCT 550 __HHHHHH________________________EEE_____HHHHHHHHHH . . . . . 551 DHGSQLVTITNRFEQAFVSSLIYNWEGEYFWTALQDLNSTGSFFWLSGDE 600 ___EEEEEEHHHHHHHHHHHHHHH___EEEEEE_________EEE_____ . . . . . 601 VMYTHWNRDQPGYSRGGCVALATGSAMGLWEVKNCTSFRARYICRQSLGT 650 EEEEEE___________EEEEE_______EEEEE____EEEEE_______ . . . . . 651 PVTPELPGPDPTPSLTGSCPQGWASDTKLRYCYKVFSSERLQDKKSWVQA 700 ____________________________EEEE______HHHHHHHHHHHH . . . . . 701 QGACQELGAQLLSLASYEEEHFVANMLNKIFGESEPEIHEQHWFWIGLNR 750 ___HHHHHHHHHHHHHHHHHHHHHHHHHHHH____HHHHHHHHHHHH___ . . . . . 751 RDPRGGQSWRWSDGVGFSYHNFDRSRHDDDDIRGCAVLDLASLQWVAMQC 800 _______EEEE______________________EEEHHHHHHHHHHHHH_ . . . . . 801 DTQLDWICKIPRGTDVREPDDSPQGRREWLRFQEAEYKFFEHHSTWAQAQ 850 __HHHHHH_________________HHHHHHHHHHHHHHH_____HHHHH . . . . . 851 RICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQHWWIGLHTSESDGRF 900 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEE_______ . . . . . 901 RWTDGSIINFISWAPGKPRPVGKDKKCVYMTASREDWGDQRCLTALPYIC 950 ______EEEEE_____________EEEEEEE_________EEEEEE_EEE . . . . . 951 KRSNVTKETQPPDLPTTALGGCPSDWIQFLNKCFQVQGQEPQSRVKWSEA 1000 _________________________HHHHHHHHHEEE_____HHHHHHHH . . . . . 1001 QFSCEQQEAQLVTITNPLEQAFITASLPNVTFDLWIGLHASQRDFQWVEQ 1050 HHHHHHHHHHHH_____HHHHHHHH____EEEE___________EEEE__ . . . . . 1051 EPLMYANWAPGEPSGPSPAPSGNKPTSCAVVLHSPSAHFTGRWDDRSCTE 1100 _EEEEEE___________________EEEEEEE_________________ . . . . . 1101 ETHGFICQKGTDPSLSPSPAALPPAPGTELSYLNGTFRLLQKPLRWHDAL 1150 _____EEE____________________________EEE______HHHHH . . . . . 1151 LLCESHNASLAYVPDPYTQAFLTQAARGLRTPLWIGLAGEEGSRRYSWVS 1200 HHHH____EEEE______HHHHHHHHHHH____EEEE_______EEEEEE . . . . . 1201 EEPLNYVGWQDGEPQQPGGCTYVDVDGAWRTTSCDTKLQGAVCGVSSGPP 1250 ____EEEEE__________EEEEE____EEEEE_____EEEEEEE_____ . . . . . 1251 PPRRISYHGSCPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGG 1300 ____EEE_________________HHHHHHHHHHHHHHHHHHHHHHH___ . . . . . 1301 AVLSILDEMENVFVWEHLQSYEGQSRGAWLGMNFNPKGGTLVWQDNTAVN 1350 HHHHHHHHHHHHEEEHHHHHH__________________EEEEE___EEE . . . . . 1351 YSNWGPPGLGPSMLSHNSCYWIQSNSGLWRPGACTNITMGVVCKLPRAEQ 1400 EE_______________EEEEEE___________EEE_____________ . . . . . 1401 SSFSPSALPENPAALVVVLMAVLLLLALLTAALILYRRRQSIERGAFEGA 1450 _____________HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH___ . . 1451 RYSRSSSSPTEATEKNILVSDMEMNEQQE 1479 EEEE______HHHHHH__HHHHHHHH___ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 26.7 % beta-contents : 5.3 % coil-contents : 68.0 % class : alpha method : 2 alpha-contents : 15.9 % beta-contents : 14.9 % coil-contents : 69.2 % class : mixed ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa 4.98 0.00 0.00 0.00 0.00 0.00 -12.00 0.00 0.00 -3.20 -0.74 -12.00 -12.00 0.00 0.00 0.00 -34.96 6.22 0.00 0.00 0.00 0.00 0.00 -16.00 0.00 0.00 -3.20 -0.74 -12.00 0.00 0.00 0.00 0.00 -25.72 ID: Tem22_gi|3327232|dbj|BAA31684.1| AC: xxx Len: 1470 1:I 1457 Sc: -25.72 Pv: 5.889423e-02 NO_GPI_SITE GPI: learning from protozoa 0.97 0.00 0.00 0.00 0.00 0.00 -12.00 0.00 0.00 -2.91 -3.74 -12.00 -12.00 0.00 0.00 0.00 -41.68 2.13 -0.57 -0.83 -1.49 0.00 0.00 -8.00 0.00 0.00 -2.91 -3.74 -12.00 -12.00 0.00 0.00 0.00 -39.41 ID: Tem22_gi|3327232|dbj|BAA31684.1| AC: xxx Len: 1470 1:I 1455 Sc: -39.41 Pv: 8.414013e-02 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem22_gi|33 0.769 1433 Y 0.695 1433 Y 0.989 1423 Y 0.110 N # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem22_gi|33 0.663 850 Y 0.366 1433 Y 0.997 1425 Y 0.127 N # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem22_gi|33 0.612 850 Y 0.473 35 Y 0.995 1429 Y 0.589 Y ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] 1-1 M gpgrpapapwprhllrcvlllgclhlg 2-28 29-1058 RPGAPGDAALPEPNVFLIFSHGLQGCLEAQ GGQVRVTPACNTSLPAQRWKWVSRNRLFNL GTMQCLGTGWPGTNTTASLGMYECDREALN LRWHCRTLGDQLSLLLGARTSNISKPGTLE RGDQTRSGQWRIYGSEEDLCALPYHEVYTI QGNSHGKPCTIPFKYDNQWFHGCTSTGRED GHLWCATTQDYGKDERWGFCPIKSNDCETF WDKDQLTDSCYQFNFQSTLSWREAWASCEQ QGADLLSITEIHEQTYINGLLTGYSSTLWI GLNDLDTSGGWQWSDNSPLKYLNWESDQPD NPSEENCGVIRTESSGGWQNRDCSIALPYV CKKKPNATAEPTPPDRWANVKVECEPSWQP FQGHCYRLQAEKRSWQESKKACLRGGGDLV SIHSMAELEFITKQIKQEVEELWIGLNDLK LQMNFEWSDGSLVSFTHWHPFEPNNFRDSL EDCVTIWGPEGRWNDSPCNQSLPSICKKAG QLSQGAAEEDHGCRKGWTWHSPSCYWLGED QVTYSEARRLCTDHGSQLVTITNRFEQAFV SSLIYNWEGEYFWTALQDLNSTGSFFWLSG DEVMYTHWNRDQPGYSRGGCVALATGSAMG LWEVKNCTSFRARYICRQSLGTPVTPELPG PDPTPSLTGSCPQGWASDTKLRYCYKVFSS ERLQDKKSWVQAQGACQELGAQLLSLASYE EEHFVANMLNKIFGESEPEIHEQHWFWIGL NRRDPRGGQSWRWSDGVGFSYHNFDRSRHD DDDIRGCAVLDLASLQWVAMQCDTQLDWIC KIPRGTDVREPDDSPQGRREWLRFQEAEYK FFEHHSTWAQAQRICTWFQAELTSVHSQAE LDFLSHNLQKFSRAQEQHWWIGLHTSESDG RFRWTDGSIINFISWAPGKPRPVGKDKKCV YMTASREDWGDQRCLTALPYICKRSNVTKE TQPPDLPTTALGGCPSDWIQFLNKCFQVQG QEPQSRVKWSEAQFSCEQQEAQLVTITNPL EQAFITASLPNVTFDLWIGLHASQRDFQWV EQEPLMYANW apgepsgpspapsg 1059-1072 1073-1112 NKPTSCAVVLHSPSAHFTGRWDDRSCTEET HGFICQKGTD pslspspaalppap 1113-1126 1127-1412 GTELSYLNGTFRLLQKPLRWHDALLLCESH NASLAYVPDPYTQAFLTQAARGLRTPLWIG LAGEEGSRRYSWVSEEPLNYVGWQDGEPQQ PGGCTYVDVDGAWRTTSCDTKLQGAVCGVS SGPPPPRRISYHGSCPQGLADSAWIPFREH CYSFHMELLLGHKEARQRCQRAGGAVLSIL DEMENVFVWEHLQSYEGQSRGAWLGMNFNP KGGTLVWQDNTAVNYSNWGPPGLGPSMLSH NSCYWIQSNSGLWRPGACTNITMGVVCKLP RAEQSSFSPSALPENP aalvvvlmavllllalltaalil 1413-1435 1436-1479 YRRRQSIERGAFEGARYSRSSSSPTEATEK NILVSDMEMNEQQE low complexity regions: SEG 25 3.0 3.3 >Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] 1-1 M gpgrpapapwprhllrcvlllgclhlgrpg 2-39 apgdaalp 40-646 EPNVFLIFSHGLQGCLEAQGGQVRVTPACN TSLPAQRWKWVSRNRLFNLGTMQCLGTGWP GTNTTASLGMYECDREALNLRWHCRTLGDQ LSLLLGARTSNISKPGTLERGDQTRSGQWR IYGSEEDLCALPYHEVYTIQGNSHGKPCTI PFKYDNQWFHGCTSTGREDGHLWCATTQDY GKDERWGFCPIKSNDCETFWDKDQLTDSCY QFNFQSTLSWREAWASCEQQGADLLSITEI HEQTYINGLLTGYSSTLWIGLNDLDTSGGW QWSDNSPLKYLNWESDQPDNPSEENCGVIR TESSGGWQNRDCSIALPYVCKKKPNATAEP TPPDRWANVKVECEPSWQPFQGHCYRLQAE KRSWQESKKACLRGGGDLVSIHSMAELEFI TKQIKQEVEELWIGLNDLKLQMNFEWSDGS LVSFTHWHPFEPNNFRDSLEDCVTIWGPEG RWNDSPCNQSLPSICKKAGQLSQGAAEEDH GCRKGWTWHSPSCYWLGEDQVTYSEARRLC TDHGSQLVTITNRFEQAFVSSLIYNWEGEY FWTALQDLNSTGSFFWLSGDEVMYTHWNRD QPGYSRGGCVALATGSAMGLWEVKNCTSFR ARYICRQ slgtpvtpelpgpdptpsltgscp 647-670 671-1394 QGWASDTKLRYCYKVFSSERLQDKKSWVQA QGACQELGAQLLSLASYEEEHFVANMLNKI FGESEPEIHEQHWFWIGLNRRDPRGGQSWR WSDGVGFSYHNFDRSRHDDDDIRGCAVLDL ASLQWVAMQCDTQLDWICKIPRGTDVREPD DSPQGRREWLRFQEAEYKFFEHHSTWAQAQ RICTWFQAELTSVHSQAELDFLSHNLQKFS RAQEQHWWIGLHTSESDGRFRWTDGSIINF ISWAPGKPRPVGKDKKCVYMTASREDWGDQ RCLTALPYICKRSNVTKETQPPDLPTTALG GCPSDWIQFLNKCFQVQGQEPQSRVKWSEA QFSCEQQEAQLVTITNPLEQAFITASLPNV TFDLWIGLHASQRDFQWVEQEPLMYANWAP GEPSGPSPAPSGNKPTSCAVVLHSPSAHFT GRWDDRSCTEETHGFICQKGTDPSLSPSPA ALPPAPGTELSYLNGTFRLLQKPLRWHDAL LLCESHNASLAYVPDPYTQAFLTQAARGLR TPLWIGLAGEEGSRRYSWVSEEPLNYVGWQ DGEPQQPGGCTYVDVDGAWRTTSCDTKLQG AVCGVSSGPPPPRRISYHGSCPQGLADSAW IPFREHCYSFHMELLLGHKEARQRCQRAGG AVLSILDEMENVFVWEHLQSYEGQSRGAWL GMNFNPKGGTLVWQDNTAVNYSNWGPPGLG PSMLSHNSCYWIQSNSGLWRPGACTNITMG VVCK lpraeqssfspsalpenpaalvvvlmavll 1395-1462 llalltaalilyrrrqsiergafegarysr ssssptea 1463-1479 TEKNILVSDMEMNEQQE low complexity regions: SEG 45 3.4 3.75 >Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] 1-1 M gpgrpapapwprhllrcvlllgclhlgrpg 2-67 apgdaalpepnvflifshglqgcleaqggq vrvtpa 68-1394 CNTSLPAQRWKWVSRNRLFNLGTMQCLGTG WPGTNTTASLGMYECDREALNLRWHCRTLG DQLSLLLGARTSNISKPGTLERGDQTRSGQ WRIYGSEEDLCALPYHEVYTIQGNSHGKPC TIPFKYDNQWFHGCTSTGREDGHLWCATTQ DYGKDERWGFCPIKSNDCETFWDKDQLTDS CYQFNFQSTLSWREAWASCEQQGADLLSIT EIHEQTYINGLLTGYSSTLWIGLNDLDTSG GWQWSDNSPLKYLNWESDQPDNPSEENCGV IRTESSGGWQNRDCSIALPYVCKKKPNATA EPTPPDRWANVKVECEPSWQPFQGHCYRLQ AEKRSWQESKKACLRGGGDLVSIHSMAELE FITKQIKQEVEELWIGLNDLKLQMNFEWSD GSLVSFTHWHPFEPNNFRDSLEDCVTIWGP EGRWNDSPCNQSLPSICKKAGQLSQGAAEE DHGCRKGWTWHSPSCYWLGEDQVTYSEARR LCTDHGSQLVTITNRFEQAFVSSLIYNWEG EYFWTALQDLNSTGSFFWLSGDEVMYTHWN RDQPGYSRGGCVALATGSAMGLWEVKNCTS FRARYICRQSLGTPVTPELPGPDPTPSLTG SCPQGWASDTKLRYCYKVFSSERLQDKKSW VQAQGACQELGAQLLSLASYEEEHFVANML NKIFGESEPEIHEQHWFWIGLNRRDPRGGQ SWRWSDGVGFSYHNFDRSRHDDDDIRGCAV LDLASLQWVAMQCDTQLDWICKIPRGTDVR EPDDSPQGRREWLRFQEAEYKFFEHHSTWA QAQRICTWFQAELTSVHSQAELDFLSHNLQ KFSRAQEQHWWIGLHTSESDGRFRWTDGSI INFISWAPGKPRPVGKDKKCVYMTASREDW GDQRCLTALPYICKRSNVTKETQPPDLPTT ALGGCPSDWIQFLNKCFQVQGQEPQSRVKW SEAQFSCEQQEAQLVTITNPLEQAFITASL PNVTFDLWIGLHASQRDFQWVEQEPLMYAN WAPGEPSGPSPAPSGNKPTSCAVVLHSPSA HFTGRWDDRSCTEETHGFICQKGTDPSLSP SPAALPPAPGTELSYLNGTFRLLQKPLRWH DALLLCESHNASLAYVPDPYTQAFLTQAAR GLRTPLWIGLAGEEGSRRYSWVSEEPLNYV GWQDGEPQQPGGCTYVDVDGAWRTTSCDTK LQGAVCGVSSGPPPPRRISYHGSCPQGLAD SAWIPFREHCYSFHMELLLGHKEARQRCQR AGGAVLSILDEMENVFVWEHLQSYEGQSRG AWLGMNFNPKGGTLVWQDNTAVNYSNWGPP GLGPSMLSHNSCYWIQSNSGLWRPGACTNI TMGVVCK lpraeqssfspsalpenpaalvvvlmavll 1395-1462 llalltaalilyrrrqsiergafegarysr ssssptea 1463-1479 TEKNILVSDMEMNEQQE low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] MGPGRPAPAPWPRHLLRCVLLLGCLHLGRPGAPGDAALPEPNVFLIFSHGLQGCLEAQGG QVRVTPACNTSLPAQRWKWVSRNRLFNLGTMQCLGTGWPGTNTTASLGMYECDREALNLR WHCRTLGDQLSLLLGARTSNISKPGTLERGDQTRSGQWRIYGSEEDLCALPYHEVYTIQG NSHGKPCTIPFKYDNQWFHGCTSTGREDGHLWCATTQDYGKDERWGFCPIKSNDCETFWD KDQLTDSCYQFNFQSTLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGYSSTLWIGL NDLDTSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESSGGWQNRDCSIALPYVCK KKPNATAEPTPPDRWANVKVECEPSWQPFQGHCYRLQAEKRSWQESKKACLRGGGDLVSI HSMAELEFITKQIKQEVEELWIGLNDLKLQMNFEWSDGSLVSFTHWHPFEPNNFRDSLED CVTIWGPEGRWNDSPCNQSLPSICKKAGQLSQGAAEEDHGCRKGWTWHSPSCYWLGEDQV TYSEARRLCTDHGSQLVTITNRFEQAFVSSLIYNWEGEYFWTALQDLNSTGSFFWLSGDE VMYTHWNRDQPGYSRGGCVALATGSAMGLWEVKNCTSFRARYICRQSLGTPVTPELPGPD PTPSLTGSCPQGWASDTKLRYCYKVFSSERLQDKKSWVQAQGACQELGAQLLSLASYEEE HFVANMLNKIFGESEPEIHEQHWFWIGLNRRDPRGGQSWRWSDGVGFSYHNFDRSRHDDD DIRGCAVLDLASLQWVAMQCDTQLDWICKIPRGTDVREPDDSPQGRREWLRFQEAEYKFF EHHSTWAQAQRICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQHWWIGLHTSESDGRF RWTDGSIINFISWAPGKPRPVGKDKKCVYMTASREDWGDQRCLTALPYICKRSNVTKETQ PPDLPTTALGGCPSDWIQFLNKCFQVQGQEPQSRVKWSEAQFSCEQQEAQLVTITNPLEQ AFITASLPNVTFDLWIGLHASQRDFQWVEQEPLMYANWAPGEPSGPSPAPSGNKPTSCAV VLHSPSAHFTGRWDDRSCTEETHGFICQKGTDPSLSPSPAALPPAPGTELSYLNGTFRLL QKPLRWHDALLLCESHNASLAYVPDPYTQAFLTQAARGLRTPLWIGLAGEEGSRRYSWVS EEPLNYVGWQDGEPQQPGGCTYVDVDGAWRTTSCDTKLQGAVCGVssgpppprrISYHGS CPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGGAVLSILDEMENVFVWEHLQS YEGQSRGAWLGMNFNPKGGTLVWQDNTAVNYSNWGPPGLGPSMLSHNSCYWIQSNSGLWR PGACTNITMGVVCKLPRAEQSSFSPSALPENpaalvvvlmavllllalltaalilyrrrq SIERGAFEGarysrsssspteateKNILVSDMEMNEQQE 1 - 1245 MGPGRPAPAP WPRHLLRCVL LLGCLHLGRP GAPGDAALPE PNVFLIFSHG LQGCLEAQGG QVRVTPACNT SLPAQRWKWV SRNRLFNLGT MQCLGTGWPG TNTTASLGMY ECDREALNLR WHCRTLGDQL SLLLGARTSN ISKPGTLERG DQTRSGQWRI YGSEEDLCAL PYHEVYTIQG NSHGKPCTIP FKYDNQWFHG CTSTGREDGH LWCATTQDYG KDERWGFCPI KSNDCETFWD KDQLTDSCYQ FNFQSTLSWR EAWASCEQQG ADLLSITEIH EQTYINGLLT GYSSTLWIGL NDLDTSGGWQ WSDNSPLKYL NWESDQPDNP SEENCGVIRT ESSGGWQNRD CSIALPYVCK KKPNATAEPT PPDRWANVKV ECEPSWQPFQ GHCYRLQAEK RSWQESKKAC LRGGGDLVSI HSMAELEFIT KQIKQEVEEL WIGLNDLKLQ MNFEWSDGSL VSFTHWHPFE PNNFRDSLED CVTIWGPEGR WNDSPCNQSL PSICKKAGQL SQGAAEEDHG CRKGWTWHSP SCYWLGEDQV TYSEARRLCT DHGSQLVTIT NRFEQAFVSS LIYNWEGEYF WTALQDLNST GSFFWLSGDE VMYTHWNRDQ PGYSRGGCVA LATGSAMGLW EVKNCTSFRA RYICRQSLGT PVTPELPGPD PTPSLTGSCP QGWASDTKLR YCYKVFSSER LQDKKSWVQA QGACQELGAQ LLSLASYEEE HFVANMLNKI FGESEPEIHE QHWFWIGLNR RDPRGGQSWR WSDGVGFSYH NFDRSRHDDD DIRGCAVLDL ASLQWVAMQC DTQLDWICKI PRGTDVREPD DSPQGRREWL RFQEAEYKFF EHHSTWAQAQ RICTWFQAEL TSVHSQAELD FLSHNLQKFS RAQEQHWWIG LHTSESDGRF RWTDGSIINF ISWAPGKPRP VGKDKKCVYM TASREDWGDQ RCLTALPYIC KRSNVTKETQ PPDLPTTALG GCPSDWIQFL NKCFQVQGQE PQSRVKWSEA QFSCEQQEAQ LVTITNPLEQ AFITASLPNV TFDLWIGLHA SQRDFQWVEQ EPLMYANWAP GEPSGPSPAP SGNKPTSCAV VLHSPSAHFT GRWDDRSCTE ETHGFICQKG TDPSLSPSPA ALPPAPGTEL SYLNGTFRLL QKPLRWHDAL LLCESHNASL AYVPDPYTQA FLTQAARGLR TPLWIGLAGE EGSRRYSWVS EEPLNYVGWQ DGEPQQPGGC TYVDVDGAWR TTSCDTKLQG AVCGV 1246 - 1254 ssgpp pprr 1255 - 1411 ISYHGS CPQGLADSAW IPFREHCYSF HMELLLGHKE ARQRCQRAGG AVLSILDEME NVFV WEHLQS YEGQSRGAWL GMNFNPKGGT LVWQDNTAVN YSNWGPPGLG PSMLSHNSCY WIQS NSGLWR PGACTNITMG VVCKLPRAEQ SSFSPSALPE N 1412 - 1440 paalvvvlm avllllallt aalilyrrrq 1441 - 1449 SIERGAFEG 1450 - 1464 a rysrsssspt eate 1465 - 1479 KNILVS DMEMNEQQE low complexity regions: DUST >Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] MGPGRPAPAPWPRHLLRCVLLLGCLHLGRPGAPGDAALPEPNVFLIFSHGLQGCLEAQGG QVRVTPACNTSLPAQRWKWVSRNRLFNLGTMQCLGTGWPGTNTTASLGMYECDREALNLR WHCRTLGDQLSLLLGARTSNISKPGTLERGDQTRSGQWRIYGSEEDLCALPYHEVYTIQG NSHGKPCTIPFKYDNQWFHGCTSTGREDGHLWCATTQDYGKDERWGFCPIKSNDCETFWD KDQLTDSCYQFNFQSTLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGYSSTLWIGL NDLDTSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESSGGWQNRDCSIALPYVCK KKPNATAEPTPPDRWANVKVECEPSWQPFQGHCYRLQAEKRSWQESKKACLRGGGDLVSI HSMAELEFITKQIKQEVEELWIGLNDLKLQMNFEWSDGSLVSFTHWHPFEPNNFRDSLED CVTIWGPEGRWNDSPCNQSLPSICKKAGQLSQGAAEEDHGCRKGWTWHSPSCYWLGEDQV TYSEARRLCTDHGSQLVTITNRFEQAFVSSLIYNWEGEYFWTALQDLNSTGSFFWLSGDE VMYTHWNRDQPGYSRGGCVALATGSAMGLWEVKNCTSFRARYICRQSLGTPVTPELPGPD PTPSLTGSCPQGWASDTKLRYCYKVFSSERLQDKKSWVQAQGACQELGAQLLSLASYEEE HFVANMLNKIFGESEPEIHEQHWFWIGLNRRDPRGGQSWRWSDGVGFSYHNFDRSRHDDD DIRGCAVLDLASLQWVAMQCDTQLDWICKIPRGTDVREPDDSPQGRREWLRFQEAEYKFF EHHSTWAQAQRICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQHWWIGLHTSESDGRF RWTDGSIINFISWAPGKPRPVGKDKKCVYMTASREDWGDQRCLTALPYICKRSNVTKETQ PPDLPTTALGGCPSDWIQFLNKCFQVQGQEPQSRVKWSEAQFSCEQQEAQLVTITNPLEQ AFITASLPNVTFDLWIGLHASQRDFQWVEQEPLMYANWAPGEPSGPSPAPSGNKPTSCAV VLHSPSAHFTGRWDDRSCTEETHGFICQKGTDPSLSPSPAALPPAPGTELSYLNGTFRLL QKPLRWHDALLLCESHNASLAYVPDPYTQAFLTQAARGLRTPLWIGLAGEEGSRRYSWVS EEPLNYVGWQDGEPQQPGGCTYVDVDGAWRTTSCDTKLQGAVCGVSSGPPPPRRISYHGS CPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGGAVLSILDEMENVFVWEHLQS YEGQSRGAWLGMNFNPKGGTLVWQDNTAVNYSNWGPPGLGPSMLSHNSCYWIQSNSGLWR PGACTNITMGVVCKLPRAEQSSFSPSALPENPAALVVVLMAVLLLLALLTAALILYRRRQ SIERGAFEGARYSRSSSSPTEATEKNILVSDMEMNEQQE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for Tem22_gi|3327232|dbj|BAA31684.1| sequence: 1470 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 MGPGRPAPAP WPRHLLRCVL LLGCLHLGRP GAPGDAALPE PNVFLIFSHG LQGCLEAQGG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 QVRVTPACNT SLPAQRWKWV SRNRLFNLGT MQCLGTGWPG TNTTASLGMY ECDREALNLR ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 WHCRTLGDQL SLLLGARTSN ISKPGTLERG DQTRSGQWRI YGSEEDLCAL PYHEVYTIQG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 NSHGKPCTIP FKYDNQWFHG CTSTGREDGH LWCATTQDYG KDERWGFCPI KSNDCETFWD ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 KDQLTDSCYQ FNFQSTLSWR EAWASCEQQG ADLLSITEIH EQTYINGLLT GYSSTLWIGL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 NDLDTSGGWQ WSDNSPLKYL NWESDQPDNP SEENCGVIRT ESSGGWQNRD CSIALPYVCK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 420 KKPNATAEPT PPDRWANVKV ECEPSWQPFQ GHCYRLQAEK RSWQESKKAC LRGGGDLVSI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~1 * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~555555 555555554~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 480 HSMAELEFIT KQIKQEVEEL WIGLNDLKLQ MNFEWSDGSL VSFTHWHPFE PNNFRDSLED ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ 5555555555 5555555555 5511~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar 1111111111 1111111111 1111111111 11~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~122222222 2222222222 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 540 CVTIWGPEGR WNDSPCNQSL PSICKKAGQL SQGAAEEDHG CRKGWTWHSP SCYWLGEDQV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 600 TYSEARRLCT DHGSQLVTIT NRFEQAFVSS LIYNWEGEYF WTALQDLNST GSFFWLSGDE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 660 VMYTHWNRDQ PGYSRGGCVA LATGSAMGLW EVKNCTSFRA RYICRQSLGT PVTPELPGPD ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 720 PTPSLTGSCP QGWASDTKLR YCYKVFSSER LQDKKSWVQA QGACQELGAQ LLSLASYEEE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 780 HFVANMLNKI FGESEPEIHE QHWFWIGLNR RDPRGGQSWR WSDGVGFSYH NFDRSRHDDD ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 840 DIRGCAVLDL ASLQWVAMQC DTQLDWICKI PRGTDVREPD DSPQGRREWL RFQEAEYKFF ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 900 EHHSTWAQAQ RICTWFQAEL TSVHSQAELD FLSHNLQKFS RAQEQHWWIG LHTSESDGRF ~~~~~~~~~~ ~~~~~~~~~~ ~~~~244444 4444444444 444444~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ----defgab cdefgabcde fgabcd---- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~11111 1111111111 111111~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~1355555 5555555552 2~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 960 RWTDGSIINF ISWAPGKPRP VGKDKKCVYM TASREDWGDQ RCLTALPYIC KRSNVTKETQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1020 PPDLPTTALG GCPSDWIQFL NKCFQVQGQE PQSRVKWSEA QFSCEQQEAQ LVTITNPLEQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1080 AFITASLPNV TFDLWIGLHA SQRDFQWVEQ EPLMYANWAP GEPSGPSPAP SGNKPTSCAV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1140 VLHSPSAHFT GRWDDRSCTE ETHGFICQKG TDPSLSPSPA ALPPAPGTEL SYLNGTFRLL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1200 QKPLRWHDAL LLCESHNASL AYVPDPYTQA FLTQAARGLR TPLWIGLAGE EGSRRYSWVS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1260 EEPLNYVGWQ DGEPQQPGGC TYVDVDGAWR TTSCDTKLQG AVCGVSSGPP PPRRISYHGS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1320 CPQGLADSAW IPFREHCYSF HMELLLGHKE ARQRCQRAGG AVLSILDEME NVFVWEHLQS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1380 YEGQSRGAWL GMNFNPKGGT LVWQDNTAVN YSNWGPPGLG PSMLSHNSCY WIQSNSGLWR ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1440 PGACTNITMG VVCKLPRAEQ SSFSPSALPE NPAALVVVLM AVLLLLALLT AALILYRRRQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | SIERGAFEGA RYSRSSSSPT EATEKNILVS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ---------- ---------- ---------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem22.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem22.___inter___ (1 sequences) MGPGRPAPAPWPRHLLRCVLLLGCLHLGRPGAPGDAALPEPNVFLIFSHG LQGCLEAQGGQVRVTPACNTSLPAQRWKWVSRNRLFNLGTMQCLGTGWPG TNTTASLGMYECDREALNLRWHCRTLGDQLSLLLGARTSNISKPGTLERG DQTRSGQWRIYGSEEDLCALPYHEVYTIQGNSHGKPCTIPFKYDNQWFHG CTSTGREDGHLWCATTQDYGKDERWGFCPIKSNDCETFWDKDQLTDSCYQ FNFQSTLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGYSSTLWIGL NDLDTSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESSGGWQNRD CSIALPYVCKKKPNATAEPTPPDRWANVKVECEPSWQPFQGHCYRLQAEK RSWQESKKACLRGGGDLVSIHSMAELEFITKQIKQEVEELWIGLNDLKLQ MNFEWSDGSLVSFTHWHPFEPNNFRDSLEDCVTIWGPEGRWNDSPCNQSL PSICKKAGQLSQGAAEEDHGCRKGWTWHSPSCYWLGEDQVTYSEARRLCT DHGSQLVTITNRFEQAFVSSLIYNWEGEYFWTALQDLNSTGSFFWLSGDE VMYTHWNRDQPGYSRGGCVALATGSAMGLWEVKNCTSFRARYICRQSLGT PVTPELPGPDPTPSLTGSCPQGWASDTKLRYCYKVFSSERLQDKKSWVQA QGACQELGAQLLSLASYEEEHFVANMLNKIFGESEPEIHEQHWFWIGLNR RDPRGGQSWRWSDGVGFSYHNFDRSRHDDDDIRGCAVLDLASLQWVAMQC DTQLDWICKIPRGTDVREPDDSPQGRREWLRFQEAEYKFFEHHSTWAQAQ RICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQHWWIGLHTSESDGRF RWTDGSIINFISWAPGKPRPVGKDKKCVYMTASREDWGDQRCLTALPYIC KRSNVTKETQPPDLPTTALGGCPSDWIQFLNKCFQVQGQEPQSRVKWSEA QFSCEQQEAQLVTITNPLEQAFITASLPNVTFDLWIGLHASQRDFQWVEQ EPLMYANWAPGEPSGPSPAPSGNKPTSCAVVLHSPSAHFTGRWDDRSCTE ETHGFICQKGTDPSLSPSPAALPPAPGTELSYLNGTFRLLQKPLRWHDAL LLCESHNASLAYVPDPYTQAFLTQAARGLRTPLWIGLAGEEGSRRYSWVS EEPLNYVGWQDGEPQQPGGCTYVDVDGAWRTTSCDTKLQGAVCGVSSGPP PPRRISYHGSCPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGG AVLSILDEMENVFVWEHLQSYEGQSRGAWLGMNFNPKGGTLVWQDNTAVN YSNWGPPGLGPSMLSHNSCYWIQSNSGLWRPGACTNITMGVVCKLPRAEQ SSFSPSALPENPAALVVVLMAVLLLLALLTAALILYRRRQSIERGAFEGA RYSRSSSSPTEATEKNILVSDMEMNEQQE (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 16 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 90 110 0.828 Putative 2 282 302 0.950 Putative 3 612 632 0.866 Putative 4 1020 1040 0.730 Putative 5 1413 1433 2.477 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 5 Loop length 89 171 309 387 372 46 K+R profile + + + + + 7.00 CYT-EXT prof 0.17 0.18 0.72 -0.84 -0.20 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.11 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 2.11 -> Orientation: N-out ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 2 3 4 5 Loop length 281 309 387 372 46 K+R profile + + 7.00 + + CYT-EXT prof -0.54 -0.20 - 0.18 0.72 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.19 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.0741 NEG: 29.0000 POS: 25.0000 -> Orientation: N-out CYT-EXT difference: -1.64 -> Orientation: N-in ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 3 4 5 Loop length 89 501 387 372 46 K+R profile + + 7.00 + + CYT-EXT prof 0.17 -0.20 - -0.16 0.72 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.12 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: -0.59 -> Orientation: N-in ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 3 4 5 Loop length 611 387 372 46 K+R profile + + + 7.00 CYT-EXT prof -0.12 0.72 -0.20 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.22 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -5.00 (NEG-POS)/(NEG+POS): 0.1654 NEG: 74.0000 POS: 53.0000 -> Orientation: N-out CYT-EXT difference: 0.81 -> Orientation: N-out ---------------------------------------------------------------------- Structure 5 Transmembrane segments included in this structure: Segment 1 2 4 5 Loop length 89 171 717 372 46 K+R profile + + 7.00 + + CYT-EXT prof 0.17 0.01 - -0.84 0.72 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.16 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 0.30 -> Orientation: N-out ---------------------------------------------------------------------- Structure 6 Transmembrane segments included in this structure: Segment 2 4 5 Loop length 281 717 372 46 K+R profile + + + 7.00 CYT-EXT prof -0.54 0.72 0.01 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.28 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.0741 NEG: 29.0000 POS: 25.0000 -> Orientation: N-out CYT-EXT difference: 0.18 -> Orientation: N-out ---------------------------------------------------------------------- Structure 7 Transmembrane segments included in this structure: Segment 1 4 5 Loop length 89 909 372 46 K+R profile + + + 7.00 CYT-EXT prof 0.17 0.72 -0.16 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.19 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 1.05 -> Orientation: N-out ---------------------------------------------------------------------- Structure 8 Transmembrane segments included in this structure: Segment 4 5 Loop length 1019 372 46 K+R profile + 7.00 + CYT-EXT prof -0.13 - 0.72 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.33 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.1312 NEG: 125.0000 POS: 96.0000 -> Orientation: N-out CYT-EXT difference: -0.85 -> Orientation: N-in ---------------------------------------------------------------------- Structure 9 Transmembrane segments included in this structure: Segment 1 2 3 5 Loop length 89 171 309 780 46 K+R profile + + 7.00 + + CYT-EXT prof 0.17 0.18 - -0.84 0.27 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.33 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 0.92 -> Orientation: N-out ---------------------------------------------------------------------- Structure 10 Transmembrane segments included in this structure: Segment 2 3 5 Loop length 281 309 780 46 K+R profile + + + 7.00 CYT-EXT prof -0.54 0.27 0.18 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.58 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.0741 NEG: 29.0000 POS: 25.0000 -> Orientation: N-out CYT-EXT difference: -0.45 -> Orientation: N-in ---------------------------------------------------------------------- Structure 11 Transmembrane segments included in this structure: Segment 1 3 5 Loop length 89 501 780 46 K+R profile + + + 7.00 CYT-EXT prof 0.17 0.27 -0.16 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.38 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 0.60 -> Orientation: N-out ---------------------------------------------------------------------- Structure 12 Transmembrane segments included in this structure: Segment 3 5 Loop length 611 780 46 K+R profile + 7.00 + CYT-EXT prof -0.12 - 0.27 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.66 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -5.00 (NEG-POS)/(NEG+POS): 0.1654 NEG: 74.0000 POS: 53.0000 -> Orientation: N-out CYT-EXT difference: -0.38 -> Orientation: N-in ---------------------------------------------------------------------- Structure 13 Transmembrane segments included in this structure: Segment 1 2 5 Loop length 89 171 1110 46 K+R profile + + + 7.00 CYT-EXT prof 0.17 0.26 -0.84 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.50 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 1.27 -> Orientation: N-out ---------------------------------------------------------------------- Structure 14 Transmembrane segments included in this structure: Segment 2 5 Loop length 281 1110 46 K+R profile + 7.00 + CYT-EXT prof -0.54 - 0.26 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.87 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.0741 NEG: 29.0000 POS: 25.0000 -> Orientation: N-out CYT-EXT difference: -0.80 -> Orientation: N-in ---------------------------------------------------------------------- Structure 15 Transmembrane segments included in this structure: Segment 1 5 Loop length 89 1302 46 K+R profile + 7.00 + CYT-EXT prof 0.17 - 0.10 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.57 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 0.07 -> Orientation: N-out ---------------------------------------------------------------------- Structure 16 Transmembrane segments included in this structure: Segment 5 Loop length 1412 46 K+R profile + 7.00 CYT-EXT prof 0.11 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 1.00 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -4.00 (NEG-POS)/(NEG+POS): 0.1348 NEG: 160.0000 POS: 122.0000 -> Orientation: N-out CYT-EXT difference: 0.11 -> Orientation: N-out ---------------------------------------------------------------------- "tem22" 1479 90 110 #f 0.828125 282 302 #f 0.95 612 632 #f 0.865625 1020 1040 #f 0.730208 1413 1433 #t 2.47708 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem22.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem22.___inter___ (1 sequences) MGPGRPAPAPWPRHLLRCVLLLGCLHLGRPGAPGDAALPEPNVFLIFSHG LQGCLEAQGGQVRVTPACNTSLPAQRWKWVSRNRLFNLGTMQCLGTGWPG TNTTASLGMYECDREALNLRWHCRTLGDQLSLLLGARTSNISKPGTLERG DQTRSGQWRIYGSEEDLCALPYHEVYTIQGNSHGKPCTIPFKYDNQWFHG CTSTGREDGHLWCATTQDYGKDERWGFCPIKSNDCETFWDKDQLTDSCYQ FNFQSTLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGYSSTLWIGL NDLDTSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESSGGWQNRD CSIALPYVCKKKPNATAEPTPPDRWANVKVECEPSWQPFQGHCYRLQAEK RSWQESKKACLRGGGDLVSIHSMAELEFITKQIKQEVEELWIGLNDLKLQ MNFEWSDGSLVSFTHWHPFEPNNFRDSLEDCVTIWGPEGRWNDSPCNQSL PSICKKAGQLSQGAAEEDHGCRKGWTWHSPSCYWLGEDQVTYSEARRLCT DHGSQLVTITNRFEQAFVSSLIYNWEGEYFWTALQDLNSTGSFFWLSGDE VMYTHWNRDQPGYSRGGCVALATGSAMGLWEVKNCTSFRARYICRQSLGT PVTPELPGPDPTPSLTGSCPQGWASDTKLRYCYKVFSSERLQDKKSWVQA QGACQELGAQLLSLASYEEEHFVANMLNKIFGESEPEIHEQHWFWIGLNR RDPRGGQSWRWSDGVGFSYHNFDRSRHDDDDIRGCAVLDLASLQWVAMQC DTQLDWICKIPRGTDVREPDDSPQGRREWLRFQEAEYKFFEHHSTWAQAQ RICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQHWWIGLHTSESDGRF RWTDGSIINFISWAPGKPRPVGKDKKCVYMTASREDWGDQRCLTALPYIC KRSNVTKETQPPDLPTTALGGCPSDWIQFLNKCFQVQGQEPQSRVKWSEA QFSCEQQEAQLVTITNPLEQAFITASLPNVTFDLWIGLHASQRDFQWVEQ EPLMYANWAPGEPSGPSPAPSGNKPTSCAVVLHSPSAHFTGRWDDRSCTE ETHGFICQKGTDPSLSPSPAALPPAPGTELSYLNGTFRLLQKPLRWHDAL LLCESHNASLAYVPDPYTQAFLTQAARGLRTPLWIGLAGEEGSRRYSWVS EEPLNYVGWQDGEPQQPGGCTYVDVDGAWRTTSCDTKLQGAVCGVSSGPP PPRRISYHGSCPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGG AVLSILDEMENVFVWEHLQSYEGQSRGAWLGMNFNPKGGTLVWQDNTAVN YSNWGPPGLGPSMLSHNSCYWIQSNSGLWRPGACTNITMGVVCKLPRAEQ SSFSPSALPENPAALVVVLMAVLLLLALLTAALILYRRRQSIERGAFEGA RYSRSSSSPTEATEKNILVSDMEMNEQQE (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 16 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 90 110 0.828 Putative 2 282 302 0.950 Putative 3 612 632 0.866 Putative 4 1020 1040 0.730 Putative 5 1413 1433 2.477 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 5 Loop length 89 171 309 387 372 46 K+R profile + + + + + 7.00 CYT-EXT prof 0.17 0.18 0.72 -0.84 -0.20 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.11 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 2.11 -> Orientation: N-out ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 2 3 4 5 Loop length 281 309 387 372 46 K+R profile + + 7.00 + + CYT-EXT prof -0.54 -0.20 - 0.18 0.72 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.19 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.0741 NEG: 29.0000 POS: 25.0000 -> Orientation: N-out CYT-EXT difference: -1.64 -> Orientation: N-in ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 3 4 5 Loop length 89 501 387 372 46 K+R profile + + 7.00 + + CYT-EXT prof 0.17 -0.20 - -0.16 0.72 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.12 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: -0.59 -> Orientation: N-in ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 3 4 5 Loop length 611 387 372 46 K+R profile + + + 7.00 CYT-EXT prof -0.12 0.72 -0.20 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.22 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -5.00 (NEG-POS)/(NEG+POS): 0.1654 NEG: 74.0000 POS: 53.0000 -> Orientation: N-out CYT-EXT difference: 0.81 -> Orientation: N-out ---------------------------------------------------------------------- Structure 5 Transmembrane segments included in this structure: Segment 1 2 4 5 Loop length 89 171 717 372 46 K+R profile + + 7.00 + + CYT-EXT prof 0.17 0.01 - -0.84 0.72 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.16 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 0.30 -> Orientation: N-out ---------------------------------------------------------------------- Structure 6 Transmembrane segments included in this structure: Segment 2 4 5 Loop length 281 717 372 46 K+R profile + + + 7.00 CYT-EXT prof -0.54 0.72 0.01 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.28 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.0741 NEG: 29.0000 POS: 25.0000 -> Orientation: N-out CYT-EXT difference: 0.18 -> Orientation: N-out ---------------------------------------------------------------------- Structure 7 Transmembrane segments included in this structure: Segment 1 4 5 Loop length 89 909 372 46 K+R profile + + + 7.00 CYT-EXT prof 0.17 0.72 -0.16 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.19 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 1.05 -> Orientation: N-out ---------------------------------------------------------------------- Structure 8 Transmembrane segments included in this structure: Segment 4 5 Loop length 1019 372 46 K+R profile + 7.00 + CYT-EXT prof -0.13 - 0.72 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.33 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.1312 NEG: 125.0000 POS: 96.0000 -> Orientation: N-out CYT-EXT difference: -0.85 -> Orientation: N-in ---------------------------------------------------------------------- Structure 9 Transmembrane segments included in this structure: Segment 1 2 3 5 Loop length 89 171 309 780 46 K+R profile + + 7.00 + + CYT-EXT prof 0.17 0.18 - -0.84 0.27 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.33 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 0.92 -> Orientation: N-out ---------------------------------------------------------------------- Structure 10 Transmembrane segments included in this structure: Segment 2 3 5 Loop length 281 309 780 46 K+R profile + + + 7.00 CYT-EXT prof -0.54 0.27 0.18 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.58 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.0741 NEG: 29.0000 POS: 25.0000 -> Orientation: N-out CYT-EXT difference: -0.45 -> Orientation: N-in ---------------------------------------------------------------------- Structure 11 Transmembrane segments included in this structure: Segment 1 3 5 Loop length 89 501 780 46 K+R profile + + + 7.00 CYT-EXT prof 0.17 0.27 -0.16 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.38 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 0.60 -> Orientation: N-out ---------------------------------------------------------------------- Structure 12 Transmembrane segments included in this structure: Segment 3 5 Loop length 611 780 46 K+R profile + 7.00 + CYT-EXT prof -0.12 - 0.27 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.66 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -5.00 (NEG-POS)/(NEG+POS): 0.1654 NEG: 74.0000 POS: 53.0000 -> Orientation: N-out CYT-EXT difference: -0.38 -> Orientation: N-in ---------------------------------------------------------------------- Structure 13 Transmembrane segments included in this structure: Segment 1 2 5 Loop length 89 171 1110 46 K+R profile + + + 7.00 CYT-EXT prof 0.17 0.26 -0.84 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.50 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 1.27 -> Orientation: N-out ---------------------------------------------------------------------- Structure 14 Transmembrane segments included in this structure: Segment 2 5 Loop length 281 1110 46 K+R profile + 7.00 + CYT-EXT prof -0.54 - 0.26 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.87 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.0741 NEG: 29.0000 POS: 25.0000 -> Orientation: N-out CYT-EXT difference: -0.80 -> Orientation: N-in ---------------------------------------------------------------------- Structure 15 Transmembrane segments included in this structure: Segment 1 5 Loop length 89 1302 46 K+R profile + 7.00 + CYT-EXT prof 0.17 - 0.10 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.57 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.5000 NEG: 3.0000 POS: 9.0000 -> Orientation: N-in CYT-EXT difference: 0.07 -> Orientation: N-out ---------------------------------------------------------------------- Structure 16 Transmembrane segments included in this structure: Segment 5 Loop length 1412 46 K+R profile + 7.00 CYT-EXT prof 0.11 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 1.00 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -4.00 (NEG-POS)/(NEG+POS): 0.1348 NEG: 160.0000 POS: 122.0000 -> Orientation: N-out CYT-EXT difference: 0.11 -> Orientation: N-out ---------------------------------------------------------------------- "tem22" 1479 90 110 #f 0.828125 282 302 #f 0.95 612 632 #f 0.865625 1020 1040 #f 0.730208 1413 1433 #t 2.47708 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ NOW EXECUTING: /bio_software/1D/stat/saps/saps-stroh/SAPS.SSPA/saps /people/maria/tem22.___saps___ SAPS. Version of April 11, 1996. Date run: Tue Oct 31 14:55:04 2000 File: /people/maria/tem22.___saps___ ID Tem22_gi|3327232|dbj|BAA31684.1| DE KIAA0709 protein [Homo sapiens] number of residues: 1479; molecular weight: 166.7 kdal 1 MGPGRPAPAP WPRHLLRCVL LLGCLHLGRP GAPGDAALPE PNVFLIFSHG LQGCLEAQGG 61 QVRVTPACNT SLPAQRWKWV SRNRLFNLGT MQCLGTGWPG TNTTASLGMY ECDREALNLR 121 WHCRTLGDQL SLLLGARTSN ISKPGTLERG DQTRSGQWRI YGSEEDLCAL PYHEVYTIQG 181 NSHGKPCTIP FKYDNQWFHG CTSTGREDGH LWCATTQDYG KDERWGFCPI KSNDCETFWD 241 KDQLTDSCYQ FNFQSTLSWR EAWASCEQQG ADLLSITEIH EQTYINGLLT GYSSTLWIGL 301 NDLDTSGGWQ WSDNSPLKYL NWESDQPDNP SEENCGVIRT ESSGGWQNRD CSIALPYVCK 361 KKPNATAEPT PPDRWANVKV ECEPSWQPFQ GHCYRLQAEK RSWQESKKAC LRGGGDLVSI 421 HSMAELEFIT KQIKQEVEEL WIGLNDLKLQ MNFEWSDGSL VSFTHWHPFE PNNFRDSLED 481 CVTIWGPEGR WNDSPCNQSL PSICKKAGQL SQGAAEEDHG CRKGWTWHSP SCYWLGEDQV 541 TYSEARRLCT DHGSQLVTIT NRFEQAFVSS LIYNWEGEYF WTALQDLNST GSFFWLSGDE 601 VMYTHWNRDQ PGYSRGGCVA LATGSAMGLW EVKNCTSFRA RYICRQSLGT PVTPELPGPD 661 PTPSLTGSCP QGWASDTKLR YCYKVFSSER LQDKKSWVQA QGACQELGAQ LLSLASYEEE 721 HFVANMLNKI FGESEPEIHE QHWFWIGLNR RDPRGGQSWR WSDGVGFSYH NFDRSRHDDD 781 DIRGCAVLDL ASLQWVAMQC DTQLDWICKI PRGTDVREPD DSPQGRREWL RFQEAEYKFF 841 EHHSTWAQAQ RICTWFQAEL TSVHSQAELD FLSHNLQKFS RAQEQHWWIG LHTSESDGRF 901 RWTDGSIINF ISWAPGKPRP VGKDKKCVYM TASREDWGDQ RCLTALPYIC KRSNVTKETQ 961 PPDLPTTALG GCPSDWIQFL NKCFQVQGQE PQSRVKWSEA QFSCEQQEAQ LVTITNPLEQ 1021 AFITASLPNV TFDLWIGLHA SQRDFQWVEQ EPLMYANWAP GEPSGPSPAP SGNKPTSCAV 1081 VLHSPSAHFT GRWDDRSCTE ETHGFICQKG TDPSLSPSPA ALPPAPGTEL SYLNGTFRLL 1141 QKPLRWHDAL LLCESHNASL AYVPDPYTQA FLTQAARGLR TPLWIGLAGE EGSRRYSWVS 1201 EEPLNYVGWQ DGEPQQPGGC TYVDVDGAWR TTSCDTKLQG AVCGVSSGPP PPRRISYHGS 1261 CPQGLADSAW IPFREHCYSF HMELLLGHKE ARQRCQRAGG AVLSILDEME NVFVWEHLQS 1321 YEGQSRGAWL GMNFNPKGGT LVWQDNTAVN YSNWGPPGLG PSMLSHNSCY WIQSNSGLWR 1381 PGACTNITMG VVCKLPRAEQ SSFSPSALPE NPAALVVVLM AVLLLLALLT AALILYRRRQ 1441 SIERGAFEGA RYSRSSSSPT EATEKNILVS DMEMNEQQE -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 93( 6.3%); C : 56( 3.8%); D : 72( 4.9%); E : 96( 6.5%); F : 51( 3.4%) G :122( 8.2%); H : 40( 2.7%); I : 47( 3.2%); K : 46( 3.1%); L :135( 9.1%) M : 19( 1.3%); N : 58( 3.9%); P : 94( 6.4%); Q : 90( 6.1%); R : 83( 5.6%) S :128( 8.7%); T : 84( 5.7%); V : 58( 3.9%); W++: 67( 4.5%); Y : 40( 2.7%) KR : 129 ( 8.7%); ED : 168 ( 11.4%); AGP : 309 ( 20.9%); KRED : 297 ( 20.1%); KR-ED : -39 ( -2.6%); FIKMNY : 261 ( 17.6%); LVIFM : 310 ( 21.0%); ST : 212 ( 14.3%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 0000+00000 00+000+000 00000000+0 0000-0000- 0000000000 00000-0000 61 00+0000000 00000+0+00 0+0+000000 0000000000 0000000000 -0-+-0000+ 121 000+000-00 000000+000 00+0000-+0 -00+0000+0 000---0000 000-000000 181 0000+00000 0+0-000000 00000+--00 0000000-00 +--+000000 +00-0-000- 241 +-000-0000 000000000+ -00000-000 0-00000-00 -000000000 0000000000 301 0-0-000000 00-0000+00 00-0-00-00 0--00000+0 -0000000+- 000000000+ 361 ++00000-00 00-+0000+0 -0-0000000 0000+000-+ +000-0++00 0+000-0000 421 0000-0-000 +00+0-0--0 00000-0+00 000-00-000 000000000- 0000+-00-- 481 0000000-0+ 00-0000000 0000++0000 00000---00 0++0000000 000000--00 541 000-0++000 -000000000 0+0-000000 00000-0-00 00000-0000 00000000-- 601 0000000+-0 0000+00000 0000000000 -0+00000+0 +000+00000 0000-0000- 661 0000000000 00000-0+0+ 000+0000-+ 00-++00000 00000-0000 0000000--- 721 00000000+0 00-0-0-00- 000000000+ +-0+00000+ 00-0000000 00-+0+0--- 781 -0+00000-0 0000000000 -000-000+0 0+00-0+-0- -0000++-00 +00-0-0+00 841 -000000000 +0000000-0 0000000-0- 0000000+00 +00-000000 0000-0-0+0 901 +00-000000 000000+0+0 00+-++0000 000+--00-0 +000000000 ++0000+-00 961 00-0000000 0000-00000 0+0000000- 000+0+00-0 0000-00-00 00000000-0 1021 0000000000 00-0000000 00+-0000-0 -000000000 0-00000000 000+000000 1081 0000000000 0+0--+000- -0000000+0 0-00000000 00000000-0 0000000+00 1141 0+00+00-00 000-000000 0000-00000 000000+00+ 000000000- -00++00000 1201 --00000000 -0-0000000 000-0-000+ 0000-0+000 0000000000 00++000000 1261 000000-000 000+-00000 00-00000+- 0+0+00+000 000000--0- 00000-0000 1321 0-000+0000 000000+000 0000-00000 0000000000 0000000000 000000000+ 1381 0000000000 000+00+0-0 000000000- 0000000000 0000000000 000000+++0 1441 00-+000-00 +00+000000 -00-+00000 -0-00-00- A. CHARGE CLUSTERS. Positive charge clusters (cmin = 9/30 or 12/45 or 15/60): none Negative charge clusters (cmin = 11/30 or 14/45 or 17/60): none Mixed charge clusters (cmin = 15/30 or 21/45 or 26/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. ________________________________ High scoring uncharged segments: score= 1.00 frequency= 0.799 ( LAGSVTIPNFQYHMCW ) score= 0.00 frequency= 0.000 ( BZX ) score= -8.00 frequency= 0.201 ( KEDR ) Expected score/letter: -0.807 M_0.01= 54.86; M_0.05= 45.40 1) From 1327 to 1436: length=110, score=47.00 * (pocket at 1394 to 1399: length= 6, score=-21.00) 1327 GAWLGMNFNP KGGTLVWQDN TAVNYSNWGP PGLGPSMLSH NSCYWIQSNS 1377 GLWRPGACTN ITMGVVC |KLP RAE| QSSFSPS ALPENPAALV VVLMAVLLLL 1427 ALLTAALILY L: 17(15.5%); A: 11(10.0%); C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 5 | 7 | 46 | 10 | 11 | 14 | 12 | 13 | 16 | 8 | 10 | lmin1 6 | 7 | 9 | 55 | 12 | 13 | 16 | 14 | 16 | 20 | 9 | 12 | lmin2 7 | 8 | 10 | 62 | 13 | 14 | 18 | 16 | 18 | 22 | 10 | 13 | (Significance level: 0.010000; Minimal displayed length: 6) There are no charge runs or patterns exceeding the given minimal lengths. Run count statistics: + runs >= 3: 2, at 360; 1437; - runs >= 4: 1, at 778; * runs >= 5: 0 0 runs >= 31: 1, at 1346; -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. __________________________________ High scoring hydrophobic segments: 2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ) -8.00 (KEDR) Expected score/letter: -2.087 M_0.01= 23.85; M_0.05= 20.24 1) From 1413 to 1436: length= 24, score=35.00 ** 1413 AALVVVLMAV LLLLALLTAA LILY L: 10(41.7%); A: 6(25.0%); V: 4(16.7%); ____________________________________ High scoring transmembrane segments: 5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST) -6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED) Expected score/letter: -4.022 M_0.01= 52.69; M_0.05= 44.49; M_0.30= 34.74 1) From 1413 to 1435: length= 23, score=87.00 ** 1413 AALVVVLMAV LLLLALLTAA LIL L: 10(43.5%); A: 6(26.1%); V: 4(17.4%); 2. SPACINGS OF C. H2N-17-C-5-C-29-C-13-C-24-C-18-C-10-C-44-C-18-C-13-C-11-C-14-C-6-C-12-C-17-C-68-C-15-C-7-C-22-C-10-C-16-C-70-C-14-C-7-C-16-C-10-C-16-C-68-C-16-C-8-C-24-C-12-C-21-C-80-C-14-C-7-C-44-C-73-C-14-C-7-C-21-C-10-C-20-C-73-C-19-C-8-C-45-C-66-C-13-C-8-C-17-C-15-C-17-C-73-C-14-C-8-C-86-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-13-H-3-C-5-C-1-H-22-H-4-C-13-C-24-C-18-C-9-H-C-44-C-4-H-9-H-3-C-11-H-1-C-8-H-2-C-14-C-6-C-12-C-17-C-13-H-54-C-15-C-7-C-22-C-9-H-C-16-C-10-H-43-H-1-H-13-C-14-C-7-C-14-H-1-C-6-H-3-C-16-C-2-H-52-H-12-C-16-C-8-C-24-C-12-C-21-C-16-H-17-H-2-H-27-H-6-H-7-C-14-C-7-C-33-H-H-9-C-10-H-9-H-11-H-5-H-34-C-14-C-7-C-21-C-10-C-20-C-34-H-38-C-4-H-4-H- 9-C-4-H-3-C-39-H-5-C-2-H-63-C-13-C-8-C-14-H-2-C-14-H-C-3-H-6-H-6-C-21-H-48-H-2-C-14-C-8-C-86-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 5 Aligned matching blocks: [ 261- 269] EAWASCEQQ [ 999-1007] EAQFSCEQQ ______________________________ [ 278- 282] EIHEQ [ 737- 741] EIHEQ ______________________________ [ 296- 303]-( -7)-[ 297- 301] [ 440- 447]-( -7)-[ 441- 445] [ ]--------[ 745- 749] [ 296- 303] LWIGLNDL [ 440- 447] LWIGLNDL [ 297- 301] WIGLN [ 441- 445] WIGLN [ 745- 749] WIGLN ______________________________ [ 306- 310] SGGWQ [ 343- 347] SGGWQ ______________________________ [ 555- 567] QLVTITNRFEQAF [1010-1022] QLVTITNPLEQAF ______________________________ [ 667- 672] GSCPQG [1259-1264] GSCPQG ______________________________ [ 888- 892] WIGLH [1035-1039] WIGLH ______________________________ [1034-1038] LWIGL [1183-1187] LWIGL B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 9 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 78 (Expected range: 54--108) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 23 (6-10) 8 (11-20) 21 (>=21) 27 3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 15/60): none B. CHARGE ALPHABET. 1. Total number of charge multiplets: 32 (Expected range: 11-- 43) 14 +plets (f+: 8.7%), 18 -plets (f-: 11.4%) Total number of charge altplets: 27 (Critical number: 47) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 4 (6-10) 5 (11-20) 3 (>=21) 21 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 6- 13 2 P. 4 4 0 657- 664 2 P. 4 4 0 778- 781 1 D 4 4 0 1130-1169 10 L......... 4 4 0 1249-1252 1 P 4 4 0 1423-1429 1 L 6 4 1 1455-1458 1 S 4 4 0 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9) Location Period Element Copies Core Errors 433- 486 9 i.....*.0 6 6 /0/./././././2/./0/ 1415-1429 1 i 13 6 2 -------------------------------------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 18- 24 (1.) C( 6)C 57 of 57 0.0000 large minimal spacing ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem22 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- lectin_c Lectin C-type domain 557.3 9.9e-164 8 fn2 Fibronectin type II domain 75.1 1.8e-27 1 Ricin_B_lectin Similarity to lectin domain of ricin 31.4 2.1e-05 1 Involucrin Involucrin repeat 6.7 60 2 Gram_pos_anchor Gram positive anchor 4.3 12 1 granulin Granulins 2.6 21 1 Xlink Extracellular link domain 1.9 3 1 oxidored_q4 NADH-ubiquinone/plastoquinone oxidore 1.1 46 1 Fork_head Fork head domain 0.9 89 1 THF_DHG_CYH Tetrahydrofolate dehydrogenase/cycloh 0.1 47 1 biopterin_H Biopterin-dependent aromatic amino ac -1.0 41 1 neur Neuraminidases -1.2 39 1 hemocyanin hemocyanin family -1.4 58 1 IBR IBR domain -28.9 87 1 MIase Muconolactone delta-isomerase -44.7 38 1 DB DB module -49.9 78 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Involucrin 1/2 55 64 .. 1 10 [] 2.5 7.2e+02 Ricin_B_lectin 1/1 41 161 .. 1 145 [] 31.4 2.1e-05 fn2 1/1 187 228 .. 1 42 [] 75.1 1.8e-27 lectin_c 1/8 264 359 .. 1 111 [] 107.4 2.8e-28 THF_DHG_CYH 1/1 429 440 .. 1 12 [. 0.1 47 lectin_c 2/8 408 504 .. 1 111 [] 128.0 1.7e-34 IBR 1/1 489 532 .. 1 72 [] -28.9 87 Xlink 1/1 539 556 .. 14 31 .. 1.9 3 lectin_c 3/8 547 644 .. 1 111 [] 85.6 1e-21 granulin 1/1 669 684 .. 28 43 .] 2.6 21 hemocyanin 1/1 760 783 .. 784 807 .] -1.4 58 lectin_c 4/8 702 808 .. 1 111 [] 46.3 6.7e-10 biopterin_H 1/1 866 882 .. 320 336 .] -1.0 41 Fork_head 1/1 863 885 .. 78 101 .] 0.9 89 MIase 1/1 815 896 .. 1 94 [] -44.7 38 neur 1/1 902 912 .. 477 487 .] -1.2 39 lectin_c 5/8 851 950 .. 1 111 [] 78.6 1.2e-19 DB 1/1 917 985 .. 1 124 [] -49.9 78 Involucrin 2/2 1004 1013 .. 1 10 [] 5.1 1.6e+02 lectin_c 6/8 1002 1107 .. 1 111 [] 78.7 1.2e-19 lectin_c 7/8 1151 1243 .. 1 111 [] 43.7 4e-09 lectin_c 8/8 1293 1393 .. 1 111 [] 29.0 6.1e-05 Gram_pos_anchor 1/1 1400 1439 .. 1 41 [] 4.3 12 oxidored_q4 1/1 1415 1444 .. 53 82 .] 1.1 46 Alignments of top-scoring domains: Involucrin: domain 1 of 2, from 55 to 64: score 2.5, E = 7.2e+02 *->leqQegQlkh<-* le Q gQ+ + Tem22_gi|3 55 LEAQGGQVRV 64 Ricin_B_lectin: domain 1 of 1, from 41 to 161: score 31.4, E = 2.1e-05 *->rsvfvitilvngnsgrClDvnsssesdGnqvqlwnChsnpgknQkWs ++vf+i + Cl ++++ + +C+ +Q W+ Tem22_gi|3 41 PNVFLIF---SHGLQGCLEAQGGQV-----RVTPACNT-SLPAQRWK 78 ltYdesdgeirsvvnndkCLtv..nanspgseVklyqCdsatsdnQkWel ++ s ++++ ++ +CL+++ +++++ ++ +y+Cd +n++W++ Tem22_gi|3 79 WV---SRNRLFN-LGTMQCLGTgwPGTNTTASLGMYECDR-EALNLRWHC 123 nndglignkIllnlvntglvLdvkgsdtqngTklilytcsggrnqqWlpt + g+ ++ l+L +++s+++ +l + r+ qW ++ Tem22_gi|3 124 RTLGDQLS----------LLLGARTSNISKP--GTLERGDQTRSGQWRIY 161 <-* Tem22_gi|3 - - fn2: domain 1 of 1, from 187 to 228: score 75.1, E = 1.8e-27 *->CvFPFiynGksYesCTseGRsdgrlWCsTTaNYDrDkkwGFC<-* C PF+y +++++ CTs+GR dg+lWC+TT++Y +D +wGFC Tem22_gi|3 187 CTIPFKYDNQWFHGCTSTGREDGHLWCATTQDYGKDERWGFC 228 lectin_c: domain 1 of 8, from 264 to 359: score 107.4, E = 2.8e-28 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.yaWIGLtdinkgep + C+++ g++L+sI Eq +++ l++ + + +WIGL+d + Tem22_gi|3 264 ASCEQQ--GADLLSITEIHEQTYINGLLTGYSsTLWIGLNDLD---- 304 tegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktpkang t+g+W+Wsd+s p+ y nW +++P+n + e+C++++t + +g Tem22_gi|3 305 TSGGWQWSDNS-PLKYLNWE--SDQPDN----PSEENCGVIRTES---SG 344 kWnDepCdsklpyvC<-* W +++C+ lpyvC Tem22_gi|3 345 GWQNRDCSIALPYVC 359 THF_DHG_CYH: domain 1 of 1, from 429 to 440: score 0.1, E = 47 *->ireelaeevaal<-* i +++++ev++l Tem22_gi|3 429 ITKQIKQEVEEL 440 lectin_c: domain 2 of 8, from 408 to 504: score 128.0, E = 1.7e-34 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.yaWIGLtdinkgep +aC r gg+LvsI+s +E +F+++ +k++ + +WIGL+d + Tem22_gi|3 408 KACLRG--GGDLVSIHSMAELEFITKQIKQEVeELWIGLNDLK---- 448 tegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktpkang +++++WsdGs v++t+W p ePnn r + edCv +++ p g Tem22_gi|3 449 LQMNFEWSDGS-LVSFTHWH--PFEPNNFR--DSLEDCVTIWG--P--EG 489 kWnDepCdsklpyvC<-* +WnD pC+++lp +C Tem22_gi|3 490 RWNDSPCNQSLPSIC 504 IBR: domain 1 of 1, from 489 to 532: score -28.9, E = 87 *->eKYekfmvrsyveknpdlkwCPgpdCsyavrltevssstelaepprV ++ ++ ++ +++ + C++a +l+ a + Tem22_gi|3 489 GRWNDSPCNQSLPS---------I-CKKAGQLSQ------GAAEEDH 519 eCkkPaCgtsFCfkCgaeWHapvsC<-* C+k g +WH+p sC Tem22_gi|3 520 GCRK-----------GWTWHSP-SC 532 Xlink: domain 1 of 1, from 539 to 556: score 1.9, E = 3 *->kltFeEAqaaClrqgAri<-* ++t+ EA+++C + g+++ Tem22_gi|3 539 QVTYSEARRLCTDHGSQL 556 lectin_c: domain 3 of 8, from 547 to 644: score 85.6, E = 1e-21 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn..yaWIGLtdinkge C+ + g++Lv+I ++ Eq+F++sl+ + ++y+W+ L d n Tem22_gi|3 547 RLCTDH--GSQLVTITNRFEQAFVSSLIYNWEgeYFWTALQDLN--- 588 ptegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktpkan + g+++W G+ +v yt+W+ +++P ++ Cv+l t+++ Tem22_gi|3 589 -STGSFFWLSGD-EVMYTHWN--RDQPGY-----SRGGCVALATGSA--M 627 gkWnDepCdskl.pyvC<-* g+W ++C s + +y+C Tem22_gi|3 628 GLWEVKNCTSFRaRYIC 644 granulin: domain 1 of 1, from 669 to 684: score 2.6, E = 21 *->CPqGfhCdadggrClq<-* CPqG d++ ++C + Tem22_gi|3 669 CPQGWASDTKLRYCYK 684 hemocyanin: domain 1 of 1, from 760 to 783: score -1.4, E = 58 *->kdvdtsdkpykkddltyhdsdhlv<-* ++ d + ++y + d+++hd d+ Tem22_gi|3 760 RWSDGVGFSYHNFDRSRHDDDDIR 783 lectin_c: domain 4 of 8, from 702 to 808: score 46.3, E = 6.7e-10 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn...........yaWI aCq++ g++L+s s+eE F+++++++ ++++++ ++++ +WI Tem22_gi|3 702 GACQEL--GAQLLSLASYEEEHFVANMLNKIFgesepeiheqhWFWI 746 GLtdinkgeptegtWvWsdGslpvnytnWakipgePnnwrhkggkedCve GL++++ + +W WsdG +y n +++ + ++ r C++ Tem22_gi|3 747 GLNRRD--PRGGQSWRWSDGV-GFSYHNFDRSRHDDDDIR------GCAV 787 lytktpkangkWnDepCdsklpyvC<-* l + +W + Cd++l ++C Tem22_gi|3 788 LDLA----SLQWVAMQCDTQLDWIC 808 biopterin_H: domain 1 of 1, from 866 to 882: score -1.0, E = 41 *->nsElqiLsnAlakisrk<-* + El+ Ls+ l+k sr Tem22_gi|3 866 QAELDFLSHNLQKFSRA 882 Fork_head: domain 1 of 1, from 863 to 885: score 0.9, E = 89 *->ldPeaedmFenkkkkgsflkrrrr<-* ++ +ae +F+ + ++f++++ + Tem22_gi|3 863 VHSQAELDFLS-HNLQKFSRAQEQ 885 MIase: domain 1 of 1, from 815 to 896: score -44.7, E = 38 *->MLFhVrMtVniPddmpadraerikarEkAySqeLQrqGkWrhLWRPP +V+ Pdd p r+e ++ +E++y + +W R Tem22_gi|3 815 -------DVREPDDSPQGRREWLRFQEAEY-KFFEHHSTWAQAQR-- 851 LiaGeYaNiSvFDVeSndeLheiLssLPLFPYMkieVtpLcrHPSsv<-* i + + V S +eL+ + +L F+ + + H S Tem22_gi|3 852 -IC-TWFQAELTSVHSQAELDFLSHNLQKFSRAQEQHWWIGLHTSES 896 neur: domain 1 of 1, from 902 to 912: score -1.2, E = 39 *->WpDGAdlpffi<-* W DG +++f + Tem22_gi|3 902 WTDGSIINFIS 912 lectin_c: domain 5 of 8, from 851 to 950: score 78.6, E = 1.2e-19 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.....yaWIGLtdin C+ ++ L+s++sq+E dFl+ + ++ + +++ WIGL+ + Tem22_gi|3 851 RICTWF--QAELTSVHSQAELDFLSHNLQKFSraqeqHWWIGLHTSE 895 kgeptegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktp ++g + W+dGs n+ +Wa pg P+ g + +Cv + +++ Tem22_gi|3 896 ----SDGRFRWTDGS-IINFISWA--PGKPRP---VGKDKKCVYMTASRE 935 kangkWnDepCdsklpyvC<-* W D C + lpy+C Tem22_gi|3 936 ----DWGDQRCLTALPYIC 950 DB: domain 1 of 1, from 917 to 985: score -49.9, E = 78 *->CCesdrkvpeeClskkCsfntysksalnmvvqkFkqlmffkqddsCp + r v ++ kkC++ t s + + ++C Tem22_gi|3 917 --K-PRPVGKD---KKCVYMTASREDW-------------GD-QRCL 943 lkgkalpemlfCAAqgrDHtdCClrnGVttPqTlaGkkCLtfCdqspgPR alp + C+r+ Vt+ + +q+p+ Tem22_gi|3 944 T---ALPYI-------------CKRSNVTK-------E-----TQPPD-- 963 RFWfvtqldksYvpCldrfes.IkqCFy<-* +t ++ C +++ + CF+ Tem22_gi|3 964 ---LPTTALG---GCPSDWIQfLNKCFQ 985 Involucrin: domain 2 of 2, from 1004 to 1013: score 5.1, E = 1.6e+02 *->leqQegQlkh<-* +eqQe Ql Tem22_gi|3 1004 CEQQEAQLVT 1013 lectin_c: domain 6 of 8, from 1002 to 1107: score 78.7, E = 1.2e-19 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.yaWIGLtdinkgep C+++ ++Lv+I ++ Eq+F++ +++v+ ++WIGL+ Tem22_gi|3 1002 FSCEQQ--EAQLVTITNPLEQAFITASLPNVTfDLWIGLHA------ 1040 tegtWvWsdGslpvnytnWakipgePnnwr.h.kggk.edC.velytktp ++++++W+ + p+ y nWa pgeP++ + + g+k+ C v+l++ ++ Tem22_gi|3 1041 SQRDFQWVEQE-PLMYANWA--PGEPSGPSpApSGNKpTSCaVVLHSPSA 1087 kangkWnDepCdskl.pyvC<-* +g+W+D++C + ++++C Tem22_gi|3 1088 HFTGRWDDRSCTEEThGFIC 1107 lectin_c: domain 7 of 8, from 1151 to 1243: score 43.7, E = 4e-09 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.yaWIGLtdinkgep l C+++ +++L+ + ++ q+Fl++ + ++ +WIGL + Tem22_gi|3 1151 LLCESH--NASLAYVPDPYTQAFLTQAARGLRtPLWIGLAGEE---- 1191 tegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktpkang ++ + W+ + p+ny W +geP ++ C + +g Tem22_gi|3 1192 GSRRYSWVSEE-PLNYVGWQ--DGEP------QQPGGCTYVDV-----DG 1227 kWnDepCdskl.pyvC<-* W+ +Cd+kl+++vC Tem22_gi|3 1228 AWRTTSCDTKLqGAVC 1243 lectin_c: domain 8 of 8, from 1293 to 1393: score 29.0, E = 6.1e-05 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn....yaWIGLtdink + Cqr+ gg +sI ++ E+ F+++++ + ++++ aW+G++ ++ Tem22_gi|3 1293 QRCQRA--GGAVLSILDEMENVFVWEHLQSYEgqsrGAWLGMNFNP- 1336 geptegtWvWsdGslpvnytnWak.ipgePnnwrhkggkedCvelytktp ++gt vW d++ +vny+nW + g+++ + C +++ Tem22_gi|3 1337 ---KGGTLVWQDNT-AVNYSNWGPpGLGPSML-----SHNSCYWIQSN-- 1375 kangkWnDepCdskl.pyvC<-* +g+W+ +C ++vC Tem22_gi|3 1376 --SGLWRPGACTNITmGVVC 1393 Gram_pos_anchor: domain 1 of 1, from 1400 to 1439: score 4.3, E = 12 *->apaskkkkLPkTGeenatnlltaaGllalvagagllavkrK<-* + + + +LP+ ++ ++l+a+ ll +++ a+l +++r+ Tem22_gi|3 1400 QSSFSPSALPENP-AALVVVLMAVLLLLALLTAALILYRRR 1439 oxidored_q4: domain 1 of 1, from 1415 to 1444: score 1.1, E = 46 *->smlgflsaliFlfiLtiGLvYEWrqGaLeW<-* ++++++ l++l++Lt+ L+ r+ + e Tem22_gi|3 1415 LVVVLMAVLLLLALLTAALILYRRRQSIER 1444 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem22 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- lectin_c Lectin C-type domain 553.9 2.2e-189 8 fn2 Fibronectin type II domain 75.1 1.8e-27 1 Ricin_B_lectin Similarity to lectin domain of ricin b 29.6 6.6e-07 1 MORN MORN motif 3.1 62 1 OPR Octicosapeptide repeat 3.0 45 1 granulin Granulins 2.6 21 1 Xlink Extracellular link domain 1.9 3 1 oxidored_q4 NADH-ubiquinone/plastoquinone oxidored 1.1 46 1 LexA_DNA_bind LexA DNA binding domain 0.9 51 1 Fork_head Fork head domain 0.9 89 1 5-FTHF_cyc-lig 5-formyltetrahydrofolate cyclo-ligase 0.8 36 1 Pico_P2A Picornavirus core protein 2A 0.4 70 1 Nuclease S1/P1 Nuclease 0.4 26 1 Lipase_GDSL Lipase/Acylhydrolase with GDSL-like mo 0.3 56 1 mce mce related protein 0.2 99 1 THF_DHG_CYH Tetrahydrofolate dehydrogenase/cyclohy 0.1 47 1 biopterin_H Biopterin-dependent aromatic amino aci -1.0 41 1 neur Neuraminidases -1.2 39 1 hemocyanin hemocyanin family -1.4 58 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Ricin_B_lectin 1/1 41 161 .. 1 145 [] 29.6 6.6e-07 fn2 1/1 187 228 .. 1 42 [] 75.1 1.8e-27 Lipase_GDSL 1/1 295 310 .. 132 146 .] 0.3 56 lectin_c 1/8 264 359 .. 1 111 [] 105.5 2.7e-35 OPR 1/1 415 434 .. 11 30 .] 3.0 45 THF_DHG_CYH 1/1 429 440 .. 1 12 [. 0.1 47 lectin_c 2/8 408 504 .. 1 111 [] 126.2 2.1e-42 Xlink 1/1 539 556 .. 14 31 .. 1.9 3 lectin_c 3/8 547 644 .. 1 111 [] 83.7 8.1e-28 granulin 1/1 669 684 .. 28 43 .] 2.6 21 Nuclease 1/1 724 735 .. 304 315 .] 0.4 26 hemocyanin 1/1 760 783 .. 784 807 .] -1.4 58 lectin_c 4/8 702 808 .. 1 111 [] 44.5 2.5e-14 5-FTHF_cyc-lig 1/1 820 854 .. 1 35 [. 0.8 36 biopterin_H 1/1 866 882 .. 320 336 .] -1.0 41 Fork_head 1/1 863 885 .. 78 101 .] 0.9 89 MORN 1/1 898 907 .. 14 23 .] 3.1 62 neur 1/1 902 912 .. 477 487 .] -1.2 39 LexA_DNA_bind 1/1 910 920 .. 55 65 .] 0.9 51 lectin_c 5/8 851 950 .. 1 111 [] 76.8 2e-25 Pico_P2A 1/1 1043 1050 .. 120 127 .] 0.4 70 lectin_c 6/8 1002 1107 .. 1 111 [] 76.8 1.9e-25 lectin_c 7/8 1151 1243 .. 1 111 [] 41.9 1.9e-13 lectin_c 8/8 1293 1356 .. 1 69 [. 36.7 1.2e-11 oxidored_q4 1/1 1415 1444 .. 53 82 .] 1.1 46 mce 1/1 1419 1444 .. 1 28 [. 0.2 99 Alignments of top-scoring domains: Ricin_B_lectin: domain 1 of 1, from 41 to 161: score 29.6, E = 6.6e-07 *->rsvfvitilvngnsgrClDvnsssesdGnqvqlwnChsnpgknQkWs ++vf+i + Cl ++++ + +C+ +Q W+ Tem22_gi|3 41 PNVFLIF---SHGLQGCLEAQGGQV-----RVTPACNT-SLPAQRWK 78 ltYdesdgeirsvvnndkCLtv..nanspgseVklyqCdsatsdnQkWel ++ s ++++ ++ +CL+++ +++++ ++ +y+Cd +n++W++ Tem22_gi|3 79 WV---SRNRLFN-LGTMQCLGTgwPGTNTTASLGMYECDR-EALNLRWHC 123 nndglignkIllnlvntglvLdvkgsdtqngTklilytcsggrnqqWlpt + g+ ++ l+L +++s+++ +l + r+ qW ++ Tem22_gi|3 124 RTLGDQLS----------LLLGARTSNISKP--GTLERGDQTRSGQWRIY 161 <-* Tem22_gi|3 - - fn2: domain 1 of 1, from 187 to 228: score 75.1, E = 1.8e-27 *->CvFPFiynGksYesCTseGRsdgrlWCsTTaNYDrDkkwGFC<-* C PF+y +++++ CTs+GR dg+lWC+TT++Y +D +wGFC Tem22_gi|3 187 CTIPFKYDNQWFHGCTSTGREDGHLWCATTQDYGKDERWGFC 228 Lipase_GDSL: domain 1 of 1, from 295 to 310: score 0.3, E = 56 *->tielGgNDlc.dyfrd<-* t+++G NDl+++++++ Tem22_gi|3 295 TLWIGLNDLDtSGGWQ 310 lectin_c: domain 1 of 8, from 264 to 359: score 105.5, E = 2.7e-35 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.yaWIGLtdinkgep + C+++ g++L+sI Eq +++ l++ + + +WIGL+d + Tem22_gi|3 264 ASCEQQ--GADLLSITEIHEQTYINGLLTGYSsTLWIGLNDLD---- 304 tegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktpkang t+g+W+Wsd+s p+ y nW +++P+n + e+C++++t + +g Tem22_gi|3 305 TSGGWQWSDNS-PLKYLNWE--SDQPDN----PSEENCGVIRTES---SG 344 kWnDepCdsklpyvC<-* W +++C+ lpyvC Tem22_gi|3 345 GWQNRDCSIALPYVC 359 OPR: domain 1 of 1, from 415 to 434: score 3.0, E = 45 *->GDlvtisSdeeleeAlrlar<-* GDlv+i S +ele+ ++ + Tem22_gi|3 415 GDLVSIHSMAELEFITKQIK 434 THF_DHG_CYH: domain 1 of 1, from 429 to 440: score 0.1, E = 47 *->ireelaeevaal<-* i +++++ev++l Tem22_gi|3 429 ITKQIKQEVEEL 440 lectin_c: domain 2 of 8, from 408 to 504: score 126.2, E = 2.1e-42 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.yaWIGLtdinkgep +aC r gg+LvsI+s +E +F+++ +k++ + +WIGL+d + Tem22_gi|3 408 KACLRG--GGDLVSIHSMAELEFITKQIKQEVeELWIGLNDLK---- 448 tegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktpkang +++++WsdGs v++t+W p ePnn r + edCv +++ p g Tem22_gi|3 449 LQMNFEWSDGS-LVSFTHWH--PFEPNNFR--DSLEDCVTIWG--P--EG 489 kWnDepCdsklpyvC<-* +WnD pC+++lp +C Tem22_gi|3 490 RWNDSPCNQSLPSIC 504 Xlink: domain 1 of 1, from 539 to 556: score 1.9, E = 3 *->kltFeEAqaaClrqgAri<-* ++t+ EA+++C + g+++ Tem22_gi|3 539 QVTYSEARRLCTDHGSQL 556 lectin_c: domain 3 of 8, from 547 to 644: score 83.7, E = 8.1e-28 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn..yaWIGLtdinkge C+ + g++Lv+I ++ Eq+F++sl+ + ++y+W+ L d n Tem22_gi|3 547 RLCTDH--GSQLVTITNRFEQAFVSSLIYNWEgeYFWTALQDLN--- 588 ptegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktpkan + g+++W G+ +v yt+W+ +++P ++ Cv+l t+++ Tem22_gi|3 589 -STGSFFWLSGD-EVMYTHWN--RDQPGY-----SRGGCVALATGSA--M 627 gkWnDepCdskl.pyvC<-* g+W ++C s + +y+C Tem22_gi|3 628 GLWEVKNCTSFRaRYIC 644 granulin: domain 1 of 1, from 669 to 684: score 2.6, E = 21 *->CPqGfhCdadggrClq<-* CPqG d++ ++C + Tem22_gi|3 669 CPQGWASDTKLRYCYK 684 Nuclease: domain 1 of 1, from 724 to 735: score 0.4, E = 26 *->AaiLNrifsses<-* A++LN if+ + Tem22_gi|3 724 ANMLNKIFGESE 735 hemocyanin: domain 1 of 1, from 760 to 783: score -1.4, E = 58 *->kdvdtsdkpykkddltyhdsdhlv<-* ++ d + ++y + d+++hd d+ Tem22_gi|3 760 RWSDGVGFSYHNFDRSRHDDDDIR 783 lectin_c: domain 4 of 8, from 702 to 808: score 44.5, E = 2.5e-14 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn...........yaWI aCq++ g++L+s s+eE F+++++++ ++++++ ++++ +WI Tem22_gi|3 702 GACQEL--GAQLLSLASYEEEHFVANMLNKIFgesepeiheqhWFWI 746 GLtdinkgeptegtWvWsdGslpvnytnWakipgePnnwrhkggkedCve GL++++ + +W WsdG +y n +++ + ++ r C++ Tem22_gi|3 747 GLNRRD--PRGGQSWRWSDGV-GFSYHNFDRSRHDDDDIR------GCAV 787 lytktpkangkWnDepCdsklpyvC<-* l + +W + Cd++l ++C Tem22_gi|3 788 LDLA----SLQWVAMQCDTQLDWIC 808 5-FTHF_cyc-lig: domain 1 of 1, from 820 to 854: score 0.8, E = 36 *->rRralsreerlqeseelsqllkslpeqqkakrvaa<-* ++ + r e+l+ +e +++++ ++++ +a+r+++ Tem22_gi|3 820 DDSPQGRREWLRFQEAEYKFFEHHSTWAQAQRICT 854 biopterin_H: domain 1 of 1, from 866 to 882: score -1.0, E = 41 *->nsElqiLsnAlakisrk<-* + El+ Ls+ l+k sr Tem22_gi|3 866 QAELDFLSHNLQKFSRA 882 Fork_head: domain 1 of 1, from 863 to 885: score 0.9, E = 89 *->ldPeaedmFenkkkkgsflkrrrr<-* ++ +ae +F+ + ++f++++ + Tem22_gi|3 863 VHSQAELDFLS-HNLQKFSRAQEQ 885 MORN: domain 1 of 1, from 898 to 907: score 3.1, E = 62 *->GvytwanGdr<-* G+++w++G+ Tem22_gi|3 898 GRFRWTDGSI 907 neur: domain 1 of 1, from 902 to 912: score -1.2, E = 39 *->WpDGAdlpffi<-* W DG +++f + Tem22_gi|3 902 WTDGSIINFIS 912 LexA_DNA_bind: domain 1 of 1, from 910 to 920: score 0.9, E = 51 *->yIerdpgkpRa<-* +I + pgkpR Tem22_gi|3 910 FISWAPGKPRP 920 lectin_c: domain 5 of 8, from 851 to 950: score 76.8, E = 2e-25 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.....yaWIGLtdin C+ ++ L+s++sq+E dFl+ + ++ + +++ WIGL+ + Tem22_gi|3 851 RICTWF--QAELTSVHSQAELDFLSHNLQKFSraqeqHWWIGLHTSE 895 kgeptegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktp ++g + W+dGs n+ +Wa pg P+ g + +Cv + +++ Tem22_gi|3 896 ----SDGRFRWTDGS-IINFISWA--PGKPRP---VGKDKKCVYMTASRE 935 kangkWnDepCdsklpyvC<-* W D C + lpy+C Tem22_gi|3 936 ----DWGDQRCLTALPYIC 950 Pico_P2A: domain 1 of 1, from 1043 to 1050: score 0.4, E = 70 *->RDLlwlEe<-* RD++w+E Tem22_gi|3 1043 RDFQWVEQ 1050 lectin_c: domain 6 of 8, from 1002 to 1107: score 76.8, E = 1.9e-25 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.yaWIGLtdinkgep C+++ ++Lv+I ++ Eq+F++ +++v+ ++WIGL+ Tem22_gi|3 1002 FSCEQQ--EAQLVTITNPLEQAFITASLPNVTfDLWIGLHA------ 1040 tegtWvWsdGslpvnytnWakipgePnnwr.h.kggk.edC.velytktp ++++++W+ + p+ y nWa pgeP++ + + g+k+ C v+l++ ++ Tem22_gi|3 1041 SQRDFQWVEQE-PLMYANWA--PGEPSGPSpApSGNKpTSCaVVLHSPSA 1087 kangkWnDepCdskl.pyvC<-* +g+W+D++C + ++++C Tem22_gi|3 1088 HFTGRWDDRSCTEEThGFIC 1107 lectin_c: domain 7 of 8, from 1151 to 1243: score 41.9, E = 1.9e-13 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn.yaWIGLtdinkgep l C+++ +++L+ + ++ q+Fl++ + ++ +WIGL + Tem22_gi|3 1151 LLCESH--NASLAYVPDPYTQAFLTQAARGLRtPLWIGLAGEE---- 1191 tegtWvWsdGslpvnytnWakipgePnnwrhkggkedCvelytktpkang ++ + W+ + p+ny W +geP ++ C + +g Tem22_gi|3 1192 GSRRYSWVSEE-PLNYVGWQ--DGEP------QQPGGCTYVDV-----DG 1227 kWnDepCdskl.pyvC<-* W+ +Cd+kl+++vC Tem22_gi|3 1228 AWRTTSCDTKLqGAVC 1243 lectin_c: domain 8 of 8, from 1293 to 1356: score 36.7, E = 1.2e-11 *->laCqrekpgghLvsIqsqeEqdFlqslvkavn....yaWIGLtdink + Cqr+ gg +sI ++ E+ F+++++ + ++++ aW+G++ ++ Tem22_gi|3 1293 QRCQRA--GGAVLSILDEMENVFVWEHLQSYEgqsrGAWLGMNFNP- 1336 geptegtWvWsdGslpvnytnWakip<-* ++gt vW d++ +vny+nW p Tem22_gi|3 1337 ---KGGTLVWQDNT-AVNYSNWG--P 1356 oxidored_q4: domain 1 of 1, from 1415 to 1444: score 1.1, E = 46 *->smlgflsaliFlfiLtiGLvYEWrqGaLeW<-* ++++++ l++l++Lt+ L+ r+ + e Tem22_gi|3 1415 LVVVLMAVLLLLALLTAALILYRRRQSIER 1444 mce: domain 1 of 1, from 1419 to 1444: score 0.2, E = 99 *->lglfllvalvvllvvllwfglrslPlpr<-* l ++ll+++++ ++++l+++++s + r Tem22_gi|3 1419 LMAVLLLLALLTAALILYRRRQS--IER 1444 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem22 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Tue Oct 31 14:58:49 2000 Sequence file: tem22 ---------------------------------------- Sequence Tem22_gi|3327232|dbj|BAA31684.1| (1479 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 69: NTSL 102: NTTA 140: NISK 364: NATA 497: NQSL 588: NSTG 634: NCTS 954: NVTK 1029: NVTF 1134: NGTF 1157: NASL 1350: NYSN 1386: NITM Total matches: 13 Matching pattern PS00004 CAMP_PHOSPHO_SITE: 1194: RRYS 1253: RRIS 1438: RRQS Total matches: 3 Matching pattern PS00005 PKC_PHOSPHO_SITE: 204: TGR 258: SWR 406: SKK 560: TNR 637: SFR 688: SER 758: SWR 1041: SQR 1090: TGR 1136: TFR 1193: SRR 1463: TEK Total matches: 12 Matching pattern PS00006 CK2_PHOSPHO_SITE: 125: TLGD 163: SEED 204: TGRE 215: TTQD 237: TFWD 258: SWRE 275: SITE 370: TPPD 402: SWQE 422: SMAE 477: SLED 541: TYSE 597: SGDE 716: SYEE 734: SEPE 775: SRHD 802: TQLD 865: SQAE 894: SESD 933: SRED 1041: SQRD 1097: SCTE 1221: TYVD 1232: TSCD 1304: SILD 1458: SPTE 1470: SDME Total matches: 27 Matching pattern PS00007 TYR_PHOSPHO_SITE: 1444: RGAFEGARY Total matches: 1 Matching pattern PS00008 MYRISTYL: 50: GLQGCL 53: GCLEAQ 89: GTMQCL 100: GTNTTA 108: GMYECD 135: GARTSN 180: GNSHGK 200: GCTSTG 287: GLLTGY 336: GVIRTE 344: GGWQNR 458: GSLVSF 520: GCRKGW 616: GGCVAL 624: GSAMGL 764: GVGFSY 890: GLHTSE 905: GSIINF 970: GGCPSD 1037: GLHASQ 1072: GNKPTS 1110: GTDPSL 1127: GTELSY 1227: GAWRTT 1240: GAVCGV 1264: GLADSA 1300: GAVLSI 1323: GQSRGA 1327: GAWLGM 1358: GLGPSM 1382: GACTNI 1445: GAFEGA 1449: GARYSR Total matches: 33 Matching pattern PS00009 AMIDATION: 824: QGRR Total matches: 1 Matching pattern PS00016 RGD: 149: RGD Total matches: 1 Matching pattern PS00023 FIBRONECTIN_2: 187: CTIPFKYDNQWFHGCTSTGREDGHLWCATTQDYGKDERWGFC Total matches: 1 Matching pattern PS00615 C_TYPE_LECTIN_1: 335: CGVIRTESSGGWQNRDCSIALPYVC 481: CVTIWGPEGRWNDSPCNQSLPSIC 927: CVYMTASREDWGDQRCLTALPYIC Total matches: 3 Matching pattern PS00213 LIPOCALIN: 1219: GCTYVDVDGAWRTT Total matches: 1 Matching pattern PS00241 RECEPTOR_CYTOKINES_1: 521: CRKGWTWHSPSCYW Total matches: 1 Total no of hits in this sequence: 97 ======================================== 1314 pattern(s) searched in 1 sequence(s), 1479 residues. Total no of hits in all sequences: 97. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search L=0 25.1908 1951 pos. 244 - 360 PS50041|C_TYPE_LECTIN_2 C-type lectin domain profile. # # P 1 YGGHCYYFSte-rktwaeAELFCqQQrpggHlLVSIQSqeeqefLTSLVKasnsyVWIGL -59 # S 244 LTDSCYQFNFQsTLSWREAWASCEQQ--GAD-LLSITEIHEQTYINGLLTGYSSTLWIGL -1180 # # P 60 NDIttegkWTWSDGtpvSYTNWapgepnnkgerEDCVELys---gKWNDENCEKKLPFVC -2 # S 301 NDLDTSGGWQWSDNSPLKYLNWESDQPDN-PSEENCGVIRTessGGWQNRDCSIALPYVC -1121 # # P 117 K -1 # S 360 K -1120 # L=0 28.3140 2195 pos. 389 - 505 PS50041|C_TYPE_LECTIN_2 C-type lectin domain profile. # # P 1 YGGHCYYFSterktwaeAELFCqQQrpggHlLVSIQSqeeqefLTSLVKasnsyVWIGLN -58 # S 389 FQGHCYRLQAEKRSWQESKKACLRG--GGD-LVSIHSMAELEFITKQIKQEVEELWIGLN -1035 # # P 61 DIttegkWTWSDGtpvSYTNWapgepnn-kgerEDCVELys--gKWNDENCEKKLPFVCK -1 # S 446 DLKLQMNFEWSDGSLVSFTHWHPFEPNNfRDSLEDCVTIWGpeGRWNDSPCNQSLPSICK -975 # L=0 19.7636 1527 pos. 528 - 635 PS50041|C_TYPE_LECTIN_2 C-type lectin domain profile. # # P 1 YGGHCYYFSterktwaeAELFCqQQrpggHlLVSIQSqeeqefLTSLVKasn-syVWIGL -59 # S 528 HSPSCYWLGEDQVTYSEARRLCTDH---GSQLVTITNRFEQAFVSSLIYNWEgEYFWTAL -896 # # P 60 NDIttegkWTWSDGtpvSYTNWapgepnnkgerEDCVELys----gKWNDENC -10 # S 585 QDLNSTGSFFWLSGDEVMYTHWNRDQPG--YSRGGCVALATgsamGLWEVKNC -845 # L=0 18.4324 1423 pos. 678 - 809 PS50041|C_TYPE_LECTIN_2 C-type lectin domain profile. # # P 1 YGGHCYYFSte-----rktwaeAELFCqQQrpggHlLVSIQSqeeqefLTSLVKasn--- -66 # S 678 KLRYCYKVFSSerlqdKKSWVQAQGACQEL--GAQ-LLSLASYEEEHFVANMLNKIFges -746 # # P 52 -------syVWIGLNDItt--egkWTWSDGtpvSYTNWapgepnnkgerEDCVELys--g -17 # S 735 epeiheqHWFWIGLNRRDPrgGQSWRWSDGVGFSYHNFDRSR-HDDDDIRGCAVLDLasL -687 # # P 102 KWNDENCEKKLPFVCK -1 # S 794 QWVAMQCDTQLDWICK -671 # L=0 23.9364 1853 pos. 832 - 951 PS50041|C_TYPE_LECTIN_2 C-type lectin domain profile. # # P 1 YGGHCYYFSterktwaeAELFCqQQrpggHlLVSIQSqeeqefLTSLVKasn----syVW -62 # S 832 FQEAEYKFFEHHSTWAQAQRICTWFQ--AE-LTSVHSQAELDFLSHNLQKFSraqeQHWW -592 # # P 57 IGLNDIttegkWTWSDGtpvSYTNWapgepnnkgerEDCVELys--gKWNDENCEKKLPF -4 # S 889 IGLHTSESDGRFRWTDGSIINFISWAPGKPRPVGKDKKCVYMTAsrEDWGDQRCLTALPY -532 # # P 115 VCK -1 # S 949 ICK -529 # L=0 19.8020 1530 pos. 979 - 1101 PS50041|C_TYPE_LECTIN_2 C-type lectin domain profile. # # P 1 YGGHCYYFSte----rktwaeAELFCqQQrpggHlLVSIQSqeeqefLTSLVKasnsyVW -62 # S 979 FLNKCFQVQGQepqsRVKWSEAQFSCEQQE--AQ-LVTITNPLEQAFITASLPNVTFDLW -445 # # P 57 IGLNDIttegkWTWSDGtpvSYTNWapgepnn------kgerEDCVELys-------gKW -15 # S 1036 IGLHAS--QRDFQWVEQEPLMYANWAPGEPSGpspapsGNKPTSCAVVLHspsahftGRW -387 # # P 104 NDENCEKK -7 # S 1094 DDRSCTEE -379 # L=0 16.8580 1300 pos. 1132 - 1238 PS50041|C_TYPE_LECTIN_2 C-type lectin domain profile. # # P 1 YGGHCYYFSterktwaeAELFCqQQrpggHlLVSIQSqeeqefLTSLVKasnsyVWIGLN -58 # S 1132 YLNGTFRLLQKPLRWHDALLLCESHNAS---LAYVPDPYTQAFLTQAARGLRTPLWIGLA -292 # # P 61 DIttegkWTWSDGtpvSYTNWapgepnnkgerEDCVELys-gKWNDENCEKKL -6 # S 1189 GEEGSRRYSWVSEEPLNYVGWQDGEPQ---QPGGCTYVDVdGAWRTTSCDTKL -242 # L=0 16.5892 1279 pos. 1273 - 1384 PS50041|C_TYPE_LECTIN_2 C-type lectin domain profile. # # P 1 YGGHCYYFSte-rktwaeAELFCqQQrpggHlLVSIQSqeeqefLTSLVKasn---syVW -62 # S 1273 FREHCYSFHMElLLGHKEARQRCQRA--GGA-VLSILDEMENVFVWEHLQSYEgqsRGAW -151 # # P 57 IGLNDIttegkWTWSDGtpvSYTNWap---gepnnkgerEDCVELys--gKWNDENC -10 # S 1330 LGMNFNPKGGTLVWQDNTAVNYSNWGPpglG-P-SMLSHNSCYWIQSnsGLWRPGAC -96 # ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** argv[1]=P argv[2]=-m /data/patterns/own/motif.fa argv[4]=-seq tem22 ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1479 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem22 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem22 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] (1479 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value HISDAC Histone deacetylase domain 29 0.044 UBA Ubiquitin pathway associated domain 27 0.25 RHOGEF RHO-type GTPase GDP exchange factor 24 1.2 CALC Calcineurin like Phosphoesterase domain 24 1.3 DHHC Novel zinc finger domain with DHHC signature 24 1.4 SPRY SPRY domain 24 2.0 SNARE Alpha helical domains which are involved in vesicle fu... 23 2.4 PH Pleckstrin homology domain (lipid and protein interactio... 23 3.4 AAA AAA+ ATPase Module 23 4.4 GAF GAF domain (Predicted Small-ligand binding domain) 22 4.6 KIN Protein kinase domain 22 5.6 AP2 A plant specific DNA binding domain (Apetala 2 like) 22 6.3 KR Kringle domain (Adhesion module) 21 7.8 >HISDAC Histone deacetylase domain Length = 433 Score = 29.2 bits (65), Expect = 0.044 Identities = 17/78 (21%), Positives = 29/78 (36%), Gaps = 20/78 (25%) Query: 858 AELTSVHSQAELDFLSH----NLQKFSRAQEQHWWIGLHTSESDGRF----RWTDGSI-- 907 E+ H+ +DFLS NL+ F R + + +G DG + GS+ Sbjct: 72 QEMCQFHTDEYIDFLSRVTPDNLEMFKRESVK-FNVGDDCPVFDGLYEYCSISGGGSMEG 130 Query: 908 ---------INFISWAPG 916 +++A G Sbjct: 131 AARLNRGKCDVAVNYAGG 148 Score = 22.2 bits (47), Expect = 4.6 Identities = 5/18 (27%), Positives = 8/18 (43%) Query: 271 ADLLSITEIHEQTYINGL 288 A + + H YI+ L Sbjct: 69 ATKQEMCQFHTDEYIDFL 86 >UBA Ubiquitin pathway associated domain Length = 255 Score = 26.6 bits (58), Expect = 0.25 Identities = 13/61 (21%), Positives = 22/61 (35%), Gaps = 9/61 (14%) Query: 689 ERLQDKKSWVQA-----QGACQELGA---QLLSLASYEEEHFVANMLNKIFGESEPEIHE 740 E L + V + + A QL E FV +ML + G++ ++ E Sbjct: 133 EDLLSLRQVVSGNPEALAPLLENISARYPQLREHIMANPEVFV-SMLLEAVGDNMQDVME 191 Query: 741 Q 741 Sbjct: 192 G 192 >RHOGEF RHO-type GTPase GDP exchange factor Length = 176 Score = 24.1 bits (52), Expect = 1.2 Identities = 9/34 (26%), Positives = 14/34 (40%), Gaps = 1/34 (2%) Query: 1278 YSFHMELLLGHKEAR-QRCQRAGGAVLSILDEME 1310 Y FH ++ L E +R G L D+ + Sbjct: 55 YEFHNDIFLSSLENCAHAPERVGPCFLERKDDFQ 88 >CALC Calcineurin like Phosphoesterase domain Length = 274 Score = 24.0 bits (51), Expect = 1.3 Identities = 8/54 (14%), Positives = 14/54 (25%), Gaps = 2/54 (3%) Query: 150 GDQTRSGQWRIYGSEEDLCALPYHEVYTIQGNSHGKPCT--IPFKYDNQWFHGC 201 GD + Y ++ V+ I GN +P + Sbjct: 62 GDLVQDSSDEGYIRFVEMMKPFNKPVFWIPGNHDFQPKMAEFLNQPPMNAAKHL 115 >DHHC Novel zinc finger domain with DHHC signature Length = 217 Score = 24.0 bits (51), Expect = 1.4 Identities = 9/16 (56%), Positives = 11/16 (68%) Query: 1059 APGEPSGPSPAPSGNK 1074 AP +PSGPSP G + Sbjct: 23 APAQPSGPSPELQGQR 38 Score = 21.6 bits (45), Expect = 7.0 Identities = 4/48 (8%), Positives = 5/48 (10%), Gaps = 2/48 (4%) Query: 184 GKP-CTIPFKYDNQWFHGCTSTGREDGHLWCATTQDYGKDERWGFCPI 230 + L C R C Sbjct: 106 ADDNVRDKSYAGPLPIFNRSQHAHVIEDLHCNLCNVD-VSARSKHCSA 152 >SPRY SPRY domain Length = 122 Score = 23.7 bits (51), Expect = 2.0 Identities = 16/87 (18%), Positives = 23/87 (26%), Gaps = 11/87 (12%) Query: 518 DHGCRKGW-TWHSPSCYWLGEDQVTYS-EARRLCTDHGSQLVTITNRFEQAFVSSLIYNW 575 G T P + G Q +Y C ++ F V Y+ Sbjct: 15 GDKICFGLTTNDHPIEVYPGNYQGSYGYSGDGKCYFGTNEGRVYGPSFSSGDVVGCGYDS 74 Query: 576 E---------GEYFWTALQDLNSTGSF 593 G Y A Q +N G + Sbjct: 75 SSKTLYFTKNGVYLGVAAQKVNLIGLY 101 >SNARE Alpha helical domains which are involved in vesicle fusion Length = 254 Score = 23.3 bits (50), Expect = 2.4 Identities = 15/62 (24%), Positives = 22/62 (35%), Gaps = 9/62 (14%) Query: 1391 VVCKLPRAEQSS--FSPSA-------LPENPAALVVVLMAVLLLLALLTAALILYRRRQS 1441 +C+L A + FS + V +L+LL AL L R + Sbjct: 154 PMCELGSANGVTSAFSVDCKGAGKISSAHQRLEPATLSGIVAFILSLLCGALNLIRGFHA 213 Query: 1442 IE 1443 IE Sbjct: 214 IE 215 >PH Pleckstrin homology domain (lipid and protein interaction domain) Length = 138 Score = 22.7 bits (48), Expect = 3.4 Identities = 8/73 (10%), Positives = 24/73 (31%), Gaps = 4/73 (5%) Query: 628 GLWEVKNCTSFRARYICRQSLGTPVTPELPGPDPTPSLTGSCPQGWASDTKLRYCYKVFS 687 G+ ++N + + + P G T + + + + S Sbjct: 56 GIIPLENLSIREVDDPRKPNCFELYIPNNKGQLIKACKTEADGRVVEGNHMVYR----IS 111 Query: 688 SERLQDKKSWVQA 700 + ++K W+++ Sbjct: 112 APTQEEKDEWIKS 124 >AAA AAA+ ATPase Module Length = 298 Score = 22.5 bits (47), Expect = 4.4 Identities = 6/13 (46%), Positives = 10/13 (76%) Query: 1354 WGPPGLGPSMLSH 1366 +GPPG G ++L+ Sbjct: 86 YGPPGTGKTLLAR 98 >GAF GAF domain (Predicted Small-ligand binding domain) Length = 160 Score = 22.2 bits (47), Expect = 4.6 Identities = 9/41 (21%), Positives = 19/41 (45%) Query: 701 QGACQELGAQLLSLASYEEEHFVANMLNKIFGESEPEIHEQ 741 + C L A +SL Y + +A + ++F + + E+ Sbjct: 11 KKLCFLLQADRMSLFMYRARNGIAELATRLFNVHKDAVLEE 51 >KIN Protein kinase domain Length = 313 Score = 22.0 bits (46), Expect = 5.6 Identities = 12/70 (17%), Positives = 20/70 (28%), Gaps = 6/70 (8%) Query: 927 CVYMTASREDWGDQRCLTALPYICKRSNVTKETQPPDLPTTALGGCPSDWIQFLNKCFQV 986 + S ED + A +I ++ QP D I L + Sbjct: 248 ETIGSPSEEDICNIANEQARQFI---RSLNMGNQPKVNFANMFPKANPDAIDLLERMLYF 304 Query: 987 QGQEPQSRVK 996 +P R+ Sbjct: 305 ---DPSKRLT 311 >AP2 A plant specific DNA binding domain (Apetala 2 like) Length = 218 Score = 22.0 bits (46), Expect = 6.3 Identities = 12/47 (25%), Positives = 19/47 (39%), Gaps = 6/47 (12%) Query: 75 QRW-KWVSRNRLFNLGTMQCLGTGWPGTNTTASLGMYECDREALNLR 120 + W KWVS R + W GT + + D AL+++ Sbjct: 42 RNWGKWVSEIREPRKKSRI-----WLGTFPSPEMAARAHDVAALSIK 83 >KR Kringle domain (Adhesion module) Length = 86 Score = 21.5 bits (45), Expect = 7.8 Identities = 8/30 (26%), Positives = 14/30 (46%), Gaps = 4/30 (13%) Query: 208 DGHLWCATTQDYGKDERWGFCPIKSNDCET 237 D WC +D W +C +++ CE+ Sbjct: 61 DERPWCYVVKDSAL--SWEYCRLEA--CES 86 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 13 Number of calls to ALIGN: 15 Length of query: 1479 Total length of test sequences: 20182 Effective length of test sequences: 16637.0 Effective search space size: 24045297.3 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= Tem22_gi|3327232|dbj|BAA31684.1| KIAA0709 protein [Homo sapiens] (1479 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|1469029 [26..148] C-type lectin-like 111 9e-26 gi|1053218 [103..244] C-type lectin-like 103 2e-23 gi|1078939 [54..180] C-type lectin-like 103 2e-23 gi|1109918 [24..147] C-type lectin-like 102 4e-23 gi|2706599 [57..185] C-type lectin-like 100 2e-22 gi|2502062 [25..167] C-type lectin-like 93 4e-20 gi|82035 [136..263] beta-Trefoil 44 1e-05 gi|1655439 [28..139] C-type lectin-like 37 0.002 gi|1685117 [55..161] C-type lectin-like 37 0.002 gi|438998 [1..297] S-adenosyl-L-methionine-dependent methylt... 29 0.73 gi|2507361 [53..298] Periplasmic binding protein-like I 28 0.87 gi|115715 [1..97] Ferredoxin-like 28 1.2 gi|2500689 [29..252] Periplasmic binding protein-like I 27 1.8 gi|1536958 [63..316] Periplasmic binding protein-like I 27 2.0 gi|2650282 [2..154] P-loop containing nucleotide triphosphat... 27 2.1 gi|1708629 [67..339] Periplasmic binding protein-like I 27 2.5 gi|797337 [17..308] Periplasmic binding protein-like I 26 3.7 gi|729214 [63..305] Periplasmic binding protein-like I 26 5.4 gi|2244909 [587..959] beta/alpha (TIM)-barrel 26 5.6 gi|137196 [30..266] P-loop containing nucleotide triphosphat... 25 7.2 gi|586847 [59..303] Periplasmic binding protein-like I 25 7.5 gi|628933 [59..301] Periplasmic binding protein-like I 25 7.6 gi|2500694 [73..586] Periplasmic binding protein-like II 25 8.3 gi|121769 [3..207] Class I glutamine amidotransferases 25 8.5 gi|1616941 [10..247] Metzincin-like 25 9.0 gi|1762781 [797..951] C2 domain-like 25 9.0 gi|2506563 [66..342] Periplasmic binding protein-like I 25 9.5 >gi|1469029 [26..148] C-type lectin-like Length = 123 Score = 111 bits (275), Expect = 9e-26 Identities = 25/127 (19%), Positives = 44/127 (33%), Gaps = 7/127 (5%) Query: 382 CEPSWQPFQ--GHCYRLQAEKRSWQESKKACLRGGGDLVSIHSMAELEFITKQIKQEVEE 439 C W +CY ++ E+ C GG + + E F+ + + Sbjct: 1 CPDGWLLSNETSYCYLTSDRYMTFAEADGYCQSLGGSQAFMMNSYEFTFMKSYTQGMFAQ 60 Query: 440 LWIGLNDLKLQMNFEWSDGSLVSFTHWHPFEPNNFRDSLEDCVTIW-GPEGRWNDSPCNQ 498 W+G+ + +DG+ + W EP+ DCV+ + + C Sbjct: 61 PWLGVTRNLTNNKWYNNDGTTPYSSWWIAGEPSL----NGDCVSFKSTDKQGIRATQCWS 116 Query: 499 SLPSICK 505 P ICK Sbjct: 117 IQPVICK 123 Score = 107 bits (265), Expect = 1e-24 Identities = 25/126 (19%), Positives = 46/126 (35%), Gaps = 3/126 (2%) Query: 235 CETFWDKDQLTDSCYQFNFQSTLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGYSS 294 C W T CY +++ EA C+ G + +E T++ G + Sbjct: 1 CPDGWLLSNETSYCYLT-SDRYMTFAEADGYCQSLGGSQAFMMNSYEFTFMKSYTQGMFA 59 Query: 295 TLWIGLNDLDTSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESSGGWQNRDCSIA 354 W+G+ T+ W +D + W + +P +C ++ G + C Sbjct: 60 QPWLGVTRNLTNNKWYNNDGTTPYSSWWIAGEPSL--NGDCVSFKSTDKQGIRATQCWSI 117 Query: 355 LPYVCK 360 P +CK Sbjct: 118 QPVICK 123 Score = 87.5 bits (214), Expect = 1e-18 Identities = 25/126 (19%), Positives = 50/126 (38%), Gaps = 10/126 (7%) Query: 829 WLRFQEAE--YKFFEHHSTWAQAQRICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQH 886 WL E Y + + T+A+A C + + E F ++ +++ Sbjct: 5 WLLSNETSYCYLTSDRYMTFAEADGYCQSLGGSQAFMMNSYEFTF----MKSYTQGMFAQ 60 Query: 887 WWIGLHTSESDGRFRWTDGSIINFISWAPGKPRPVGKDKKCVYMTASRED-WGDQRCLTA 945 W+G+ + ++ ++ DG+ W G+P G CV ++ + +C + Sbjct: 61 PWLGVTRNLTNNKWYNNDGTTPYSSWWIAGEPSLNG---DCVSFKSTDKQGIRATQCWSI 117 Query: 946 LPYICK 951 P ICK Sbjct: 118 QPVICK 123 Score = 84.4 bits (206), Expect = 1e-17 Identities = 25/126 (19%), Positives = 43/126 (33%), Gaps = 6/126 (4%) Query: 521 CRKGWTWHSP--SCYWLGEDQVTYSEARRLCTDHGSQLVTITNRFEQAFVSSLIYNWEGE 578 C GW + CY + +T++EA C G + N +E F+ S Sbjct: 1 CPDGWLLSNETSYCYLTSDRYMTFAEADGYCQSLGGSQAFMMNSYEFTFMKSYT-QGMFA 59 Query: 579 YFWTALQDLNSTGSFFWLSGDEVMYTHWNRDQPGYSRGGCVALATGSAMGLWEVKNCTSF 638 W + + ++ G + W +P G CV+ + + C S Sbjct: 60 QPWLGVTRNLTNNKWYNNDGTTPYSSWWIAGEP-SLNGDCVSFKS-TDKQGIRATQCWSI 117 Query: 639 RARYIC 644 + IC Sbjct: 118 Q-PVIC 122 Score = 84.0 bits (205), Expect = 1e-17 Identities = 20/141 (14%), Positives = 41/141 (28%), Gaps = 22/141 (15%) Query: 972 CPSDWIQF--LNKCFQVQGQEPQSRVKWSEAQFSCEQQEAQLVTITNPLEQAFITASLPN 1029 CP W+ + C+ + + ++EA C+ + N E F+ + Sbjct: 1 CPDGWLLSNETSYCYLTSDR----YMTFAEADGYCQSLGGSQAFMMNSYEFTFMKSYTQG 56 Query: 1030 VTFDLWIGLHA--SQRDFQWVEQEPLMYANWAPGEPSGPSPAPSGNKPTSCAVVLHSPSA 1087 + W+G+ + + + + W GEP + C + Sbjct: 57 MFAQPWLGVTRNLTNNKWYNNDGTTPYSSWWIAGEP---------SLNGDCVSFKSTD-- 105 Query: 1088 HFTGRWDDRSCTEETHGFICQ 1108 C IC+ Sbjct: 106 --KQGIRATQCWSI-QPVICK 123 Score = 78.2 bits (190), Expect = 8e-16 Identities = 29/142 (20%), Positives = 44/142 (30%), Gaps = 20/142 (14%) Query: 669 CPQGWASDTKLRYCYKVFSSERLQDKKSWVQAQGACQELGAQLLSLASYEEEHFVANMLN 728 CP GW + YCY ++ +A G CQ LG + + E F+ + Sbjct: 1 CPDGWLLSNETSYCYLTS-----DRYMTFAEADGYCQSLGGSQAFMMNSYEFTFMKSYTQ 55 Query: 729 KIFGESEPEIHEQHWFWIGLNRRDPRGGQSWRWSDGVGFSYHNFDRSRHDDDDIRGCAVL 788 +F W+G+ R W +DG + + C Sbjct: 56 GMFA----------QPWLGVTRNLTNN--KWYNNDGTTPYSSWWIAG--EPSLNGDCVSF 101 Query: 789 DLA-SLQWVAMQCDTQLDWICK 809 A QC + ICK Sbjct: 102 KSTDKQGIRATQCWSIQPVICK 123 Score = 65.3 bits (157), Expect = 6e-12 Identities = 20/118 (16%), Positives = 41/118 (33%), Gaps = 3/118 (2%) Query: 1128 TELSYLNGTFRLLQKPLRWHDALLLCESHNASLAYVPDPYTQAFLTQAARGLRTPLWIGL 1187 + + + + + + +A C+S S A++ + Y F+ +G+ W+G+ Sbjct: 6 LLSNETSYCYLTSDRYMTFAEADGYCQSLGGSQAFMMNSYEFTFMKSYTQGMFAQPWLGV 65 Query: 1188 AGEEGSRRYSWVSEEPLNYVGWQDGEPQQPGGCTYVDVD--GAWRTTSCDTKLQGAVC 1243 + ++ W GEP G C R T C + +C Sbjct: 66 TRNLTNNKWYNNDGTTPYSSWWIAGEPSLNGDCVSFKSTDKQGIRATQCWSIQ-PVIC 122 Score = 64.9 bits (156), Expect = 8e-12 Identities = 32/136 (23%), Positives = 49/136 (35%), Gaps = 15/136 (11%) Query: 1261 CPQGLADSAWIPFRE--HCYSFHMELLLGHKEARQRCQRAGGAVLSILDEMENVFVWEHL 1318 CP G W+ E +CY + EA CQ GG+ +++ E F+ + Sbjct: 1 CPDG-----WLLSNETSYCYLTSDR-YMTFAEADGYCQSLGGSQAFMMNSYEFTFMKSYT 54 Query: 1319 QSYEGQSRGAWLGMNFNPKGGTLVWQDNTAVNYSNWGPPGLGPSMLSHNSCYWIQSNSGL 1378 Q Q WLG+ N D T S W G L+ + + ++ Sbjct: 55 QGMFAQ---PWLGVTRNLTNNKWYNNDGTTPYSSWWIA---GEPSLNGDCVSFKSTDKQG 108 Query: 1379 WRPGACTNITMGVVCK 1394 R C + V+CK Sbjct: 109 IRATQCWS-IQPVICK 123 >gi|1053218 [103..244] C-type lectin-like Length = 142 Score = 103 bits (256), Expect = 2e-23 Identities = 26/142 (18%), Positives = 49/142 (34%), Gaps = 18/142 (12%) Query: 382 CEPSWQPFQGHCYRLQAEKRSWQESKKACLR-GGGDLVSIHSMAELEFITKQIKQEVEEL 440 C + + G CY R+W ++ C L + S + F ++ + Sbjct: 1 CPKNSIYYMGSCYVFDTTLRNWNDAALYCNNMNSATLPLVESAEDQAFFAGYLQAMIPSN 60 Query: 441 WIGL--------------NDLKLQMNFEWSDGS-LVSFTHWHPFEPNNFRDSLEDCVTIW 485 + + ++ + GS LV+ T W P EPN + + + CV + Sbjct: 61 PPADMRPPPDGIWTAVRGVNNVTRASWVYYPGSFLVTDTFWAPQEPNIYVNYNDVCVALQ 120 Query: 486 --GPEGRWNDSPCNQSLPSICK 505 W + C ++CK Sbjct: 121 SDSFYREWTTALCTILKYTVCK 142 Score = 80.9 bits (197), Expect = 1e-16 Identities = 27/145 (18%), Positives = 44/145 (29%), Gaps = 22/145 (15%) Query: 235 CETFWDKDQLTDSCYQFNFQSTLSWREAWASCEQ-QGADLLSITEIHEQTYINGLLTGYS 293 C SCY F + +W +A C A L + +Q + G L Sbjct: 1 CPKNS--IYYMGSCYVF-DTTLRNWNDAALYCNNMNSATLPLVESAEDQAFFAGYLQAMI 57 Query: 294 STLWIGL--------------NDLDTSGGWQWSDNSPLK-YLNWESDQPDNP--SEENCG 336 + + T W + S L W +P+ + C Sbjct: 58 PSNPPADMRPPPDGIWTAVRGVNNVTRASWVYYPGSFLVTDTFWAPQEPNIYVNYNDVCV 117 Query: 337 VIRTESS-GGWQNRDCSIALPYVCK 360 ++++S W C+I VCK Sbjct: 118 ALQSDSFYREWTTALCTILKYTVCK 142 Score = 76.7 bits (186), Expect = 3e-15 Identities = 25/138 (18%), Positives = 42/138 (30%), Gaps = 15/138 (10%) Query: 829 WLRFQEAEYKFFEHHSTWAQAQRICT-WFQAELTSVHSQAELDFL---------SHNLQK 878 + + + Y F W A C A L V S + F S+ Sbjct: 5 SIYYMGSCYVFDTTLRNWNDAALYCNNMNSATLPLVESAEDQAFFAGYLQAMIPSNPPAD 64 Query: 879 FSRAQEQHW-WIGLHTSESDGRFRWTDGSIIN-FISWAPGKPRP-VGKDKKCVYMTASRE 935 + W + + + + + GS + WAP +P V + CV + + Sbjct: 65 MRPPPDGIWTAVRGVNNVTRASWVYYPGSFLVTDTFWAPQEPNIYVNYNDVCVALQSDSF 124 Query: 936 D--WGDQRCLTALPYICK 951 W C +CK Sbjct: 125 YREWTTALCTILKYTVCK 142 Score = 75.5 bits (183), Expect = 5e-15 Identities = 30/142 (21%), Positives = 52/142 (36%), Gaps = 19/142 (13%) Query: 521 CRKGWTWHSPSCYWLGEDQVTYSEARRLCTDH-GSQLVTITNRFEQAFVSSLI-----YN 574 C K ++ SCY +++A C + + L + + +QAF + + N Sbjct: 1 CPKNSIYYMGSCYVFDTTLRNWNDAALYCNNMNSATLPLVESAEDQAFFAGYLQAMIPSN 60 Query: 575 WEG------EYFWTAL--QDLNSTGSFFWLSGDE-VMYTHWNRDQPGY---SRGGCVALA 622 + WTA+ + + S+ + G V T W +P CVAL Sbjct: 61 PPADMRPPPDGIWTAVRGVNNVTRASWVYYPGSFLVTDTFWAPQEPNIYVNYNDVCVALQ 120 Query: 623 TGSAMGLWEVKNCTSFRARYIC 644 + S W CT + +C Sbjct: 121 SDSFYREWTTALCTILK-YTVC 141 Score = 70.0 bits (169), Expect = 3e-13 Identities = 26/149 (17%), Positives = 43/149 (28%), Gaps = 15/149 (10%) Query: 669 CPQGWASDTKLRYCYKVFSSERLQDKKSWVQAQGACQEL-GAQLLSLASYEEEHFVANML 727 CP+ CY ++W A C + A L + S E++ F A L Sbjct: 1 CPKNSIYYM--GSCYVFD-----TTLRNWNDAALYCNNMNSATLPLVESAEDQAFFAGYL 53 Query: 728 NKIFGESEPEIHE--QHWFWIGLNRRDPRGGQSWRWSDGVGF-SYHNFDRSRHDDDDIRG 784 + + P W + + SW + G + + + Sbjct: 54 QAMIPSNPPADMRPPPDGIWTAVRGVNNVTRASWVYYPGSFLVTDTFWAPQEPNIYVNYN 113 Query: 785 --CAVLDLASL--QWVAMQCDTQLDWICK 809 C L S +W C +CK Sbjct: 114 DVCVALQSDSFYREWTTALCTILKYTVCK 142 Score = 68.1 bits (164), Expect = 8e-13 Identities = 26/155 (16%), Positives = 46/155 (28%), Gaps = 31/155 (20%) Query: 972 CPSDWIQFLNKCFQVQGQEPQSRVKWSEAQFSCEQQE-AQLVTITNPLEQAFITASLPNV 1030 CP + I ++ C+ W++A C A L + + +QAF L + Sbjct: 1 CPKNSIYYMGSCYVFDTT----LRNWNDAALYCNNMNSATLPLVESAEDQAFFAGYLQAM 56 Query: 1031 TF------------DLWIGLH----ASQRDFQWVEQEPL-MYANWAPGEPSGPSPAPSGN 1073 +W + ++ + + L WAP EP+ N Sbjct: 57 IPSNPPADMRPPPDGIWTAVRGVNNVTRASWVYYPGSFLVTDTFWAPQEPNIYV-----N 111 Query: 1074 KPTSCAVVLHSPSAHFTGRWDDRSCTEETHGFICQ 1108 C + W CT +C+ Sbjct: 112 YNDVCVALQSDSFY---REWTTALCTIL-KYTVCK 142 Score = 56.8 bits (135), Expect = 3e-09 Identities = 29/149 (19%), Positives = 48/149 (31%), Gaps = 22/149 (14%) Query: 1261 CPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGGAVL-SILDEMENVFVWEHLQ 1319 CP+ I + CY F L +A C A L + + F +LQ Sbjct: 1 CPKN-----SIYYMGSCYVFDTT-LRNWNDAALYCNNMNSATLPLVESAEDQAFFAGYLQ 54 Query: 1320 SYEGQSR---------GAWLGMNFNPKG--GTLVWQDNTAVN-YSNWGPPGLGPSMLSHN 1367 + + G W + + V+ + + + W P + ++ Sbjct: 55 AMIPSNPPADMRPPPDGIWTAVRGVNNVTRASWVYYPGSFLVTDTFWAPQEPNIYVNYND 114 Query: 1368 SCYWIQSNS--GLWRPGACTNITMGVVCK 1394 C +QS+S W CT VCK Sbjct: 115 VCVALQSDSFYREWTTALCTI-LKYTVCK 142 Score = 42.0 bits (97), Expect = 7e-05 Identities = 23/130 (17%), Positives = 32/130 (23%), Gaps = 24/130 (18%) Query: 1137 FRLLQKPLRWHDALLLCESHN-ASLAYVPDPYTQAFLTQAARGLRTPLWIGL-------- 1187 + W+DA L C + N A+L V QAF + + Sbjct: 13 YVFDTTLRNWNDAALYCNNMNSATLPLVESAEDQAFFAGYLQAMIPSNPPADMRPPPDGI 72 Query: 1188 ------AGEEGSRRYSWVSEEPLN-YVGWQDGEPQ----QPGGCTYVDVD---GAWRTTS 1233 + + L W EP C + D W T Sbjct: 73 WTAVRGVNNVTRASWVYYPGSFLVTDTFWAPQEPNIYVNYNDVCVALQSDSFYREWTTAL 132 Query: 1234 CDTKLQGAVC 1243 C VC Sbjct: 133 CTILK-YTVC 141 >gi|1078939 [54..180] C-type lectin-like Length = 127 Score = 103 bits (256), Expect = 2e-23 Identities = 27/129 (20%), Positives = 51/129 (38%), Gaps = 8/129 (6%) Query: 382 CEPSWQPFQGHCYRLQA--EKRSWQESKKACLRGGGDLVSIHSMAELEFITKQIKQ-EVE 438 C W+ CY + A + +S ++ AC + G+L E F+ K +++ + Sbjct: 1 CNEGWRGAFNFCYYISAFADIQSHSGAQAACKQQKGELFWPDLPYESFFLKKVLQRVKTS 60 Query: 439 ELWIGLNDLKLQMNFEWSDGS-LVSFTHWHPFEPNNFRDSLEDCVTIWGPEGRWNDSPCN 497 ++ N K + W G + W P +P+ +C+ ++ G +D C Sbjct: 61 TNFVWTNGEKHNGQWNWGTGHPAFTAPRWSPGQPDG----QGECLAVYTHSGFLDDKQCE 116 Query: 498 QSLPSICKK 506 ICK Sbjct: 117 TQYNYICKT 125 Score = 89.9 bits (220), Expect = 3e-19 Identities = 34/131 (25%), Positives = 52/131 (38%), Gaps = 8/131 (6%) Query: 235 CETFWDKDQLTDSCYQFN-FQSTLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGY- 292 C W + CY + F S A A+C+QQ +L +E ++ +L Sbjct: 1 CNEGW--RGAFNFCYYISAFADIQSHSGAQAACKQQKGELFWPDLPYESFFLKKVLQRVK 58 Query: 293 SSTLWIGLNDLDTSGGWQWSDNSPLK-YLNWESDQPDNPSEENCGVIRTESSGGWQNRDC 351 +ST ++ N +G W W P W QPD + C + T SG ++ C Sbjct: 59 TSTNFVWTNGEKHNGQWNWGTGHPAFTAPRWSPGQPDG--QGECLAVYT-HSGFLDDKQC 115 Query: 352 SIALPYVCKKK 362 Y+CK K Sbjct: 116 ETQYNYICKTK 126 Score = 84.0 bits (205), Expect = 2e-17 Identities = 32/142 (22%), Positives = 50/142 (34%), Gaps = 21/142 (14%) Query: 972 CPSDWIQFLNKCFQVQGQEPQSRVKWSEAQFSCEQQEAQLVTITNPLEQAFITASLPNV- 1030 C W N C+ + S AQ +C+QQ+ +L P E F+ L V Sbjct: 1 CNEGWRGAFNFCYYISAFA--DIQSHSGAQAACKQQKGELFWPDLPYESFFLKKVLQRVK 58 Query: 1031 --TFDLWIGLHASQRDFQWVEQEPLM-YANWAPGEPSGPSPAPSGNKPTSCAVVLHSPSA 1087 T +W + W P W+PG+P + C V Sbjct: 59 TSTNFVWTNGEKHNGQWNWGTGHPAFTAPRWSPGQP---------DGQGECLAVYTH--- 106 Query: 1088 HFTGRWDDRSCTEETHGFICQK 1109 +G DD+ C + + +IC+ Sbjct: 107 --SGFLDDKQCETQ-YNYICKT 125 Score = 81.7 bits (199), Expect = 7e-17 Identities = 28/129 (21%), Positives = 48/129 (36%), Gaps = 9/129 (6%) Query: 829 WLRFQEAEYKF--FEHHSTWAQAQRICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQH 886 W Y F + + AQ C + EL E FL LQ+ + Sbjct: 5 WRGAFNFCYYISAFADIQSHSGAQAACKQQKGELFWPDLPYESFFLKKVLQRVKTSTN-- 62 Query: 887 WWIGLHTSESDGRFRWTDGS-IINFISWAPGKPRPVGKDKKCVYMTASREDWGDQRCLTA 945 ++ + + +G++ W G W+PG+P +C+ + D++C T Sbjct: 63 -FVWTNGEKHNGQWNWGTGHPAFTAPRWSPGQPD---GQGECLAVYTHSGFLDDKQCETQ 118 Query: 946 LPYICKRSN 954 YICK + Sbjct: 119 YNYICKTKH 127 Score = 77.4 bits (188), Expect = 1e-15 Identities = 30/142 (21%), Positives = 52/142 (36%), Gaps = 19/142 (13%) Query: 669 CPQGWASDTKLRYCYKVFSSERLQDKKSWVQAQGACQELGAQLLSLASYEEEHFVANMLN 728 C +GW +CY + + D +S AQ AC++ +L E F+ +L Sbjct: 1 CNEGWRGAF--NFCYYISAFA---DIQSHSGAQAACKQQKGELFWPDLPYESFFLKKVLQ 55 Query: 729 KIFGESEPEIHEQHWFWIGLNRRDPRGGQSWRWSDGVG-FSYHNFDRSRHDDDDIRGCAV 787 ++ ++ W + + W W G F+ + D C Sbjct: 56 RVK-------TSTNFVWTNGEKHN----GQWNWGTGHPAFTAPRWSPG--QPDGQGECLA 102 Query: 788 LDLASLQWVAMQCDTQLDWICK 809 + S QC+TQ ++ICK Sbjct: 103 VYTHSGFLDDKQCETQYNYICK 124 Score = 76.2 bits (185), Expect = 4e-15 Identities = 24/127 (18%), Positives = 41/127 (31%), Gaps = 7/127 (5%) Query: 521 CRKGWTWHSPSCYW--LGEDQVTYSEARRLCTDHGSQLVTITNRFEQAFVSSLIYNWEGE 578 C +GW CY+ D ++S A+ C +L +E F+ ++ + Sbjct: 1 CNEGWRGAFNFCYYISAFADIQSHSGAQAACKQQKGELFWPDLPYESFFLKKVLQRVKTS 60 Query: 579 YFWTALQDLNSTGSFFWLSGDEVM-YTHWNRDQPGYSRGGCVALATGSAMGLWEVKNCTS 637 + G + W +G W+ QP G + K C + Sbjct: 61 TNFVWTNGEKHNGQWNWGTGHPAFTAPRWSPGQPDGQGECLAVYTH---SGFLDDKQCET 117 Query: 638 FRARYIC 644 YIC Sbjct: 118 QY-NYIC 123 Score = 67.7 bits (163), Expect = 1e-12 Identities = 25/136 (18%), Positives = 42/136 (30%), Gaps = 14/136 (10%) Query: 1261 CPQGLADSAWIPFREHCYSFHM-ELLLGHKEARQRCQRAGGAVLSILDEMENVFVWEHLQ 1319 C +G W CY + H A+ C++ G + E+ F+ + LQ Sbjct: 1 CNEG-----WRGAFNFCYYISAFADIQSHSGAQAACKQQKGELFWPDLPYESFFLKKVLQ 55 Query: 1320 SYEGQSRGAWLGMNFNPKGGTLVWQDNTAV-NYSNWGPPGLGPSMLSHNSCYWIQSNSGL 1378 + + ++ N G W W P P C + ++SG Sbjct: 56 RVKTST--NFVWTNGEKHNGQWNWGTGHPAFTAPRWSPG--QPD--GQGECLAVYTHSGF 109 Query: 1379 WRPGACTNITMGVVCK 1394 C +CK Sbjct: 110 LDDKQCET-QYNYICK 124 Score = 52.5 bits (124), Expect = 5e-08 Identities = 21/112 (18%), Positives = 39/112 (34%), Gaps = 4/112 (3%) Query: 1136 TFRLLQKPLRWHDALLLCESHNASLAYVPDPYTQAFLTQAARGL-RTPLWIGLAGEEGSR 1194 A C+ L + PY FL + + + + ++ GE+ + Sbjct: 14 YISAFADIQSHSGAQAACKQQKGELFWPDLPYESFFLKKVLQRVKTSTNFVWTNGEKHNG 73 Query: 1195 RYSWVSEEP-LNYVGWQDGEPQQPGGCTYVDV-DGAWRTTSCDTKLQGAVCG 1244 +++W + P W G+P G C V G C+T+ +C Sbjct: 74 QWNWGTGHPAFTAPRWSPGQPDGQGECLAVYTHSGFLDDKQCETQY-NYICK 124 >gi|1109918 [24..147] C-type lectin-like Length = 124 Score = 102 bits (253), Expect = 4e-23 Identities = 25/127 (19%), Positives = 44/127 (33%), Gaps = 7/127 (5%) Query: 382 CEPSWQPFQGHCYRLQAEKRSWQESKKACLR---GGGDLVSIHSMAELEFITKQIKQEVE 438 C + + CY + + C + L + + AE F+T+Q++ Sbjct: 1 CPTGTKIYNAKCYAALGMDMTRDNALAYCRKTYGPYARLTTPLTYAENNFVTEQVQ-LHA 59 Query: 439 ELWIGLNDLKLQMNFEWSDGSLVSFTHWHPFEPNNFRDSLEDCVTIWGPEGRWNDSPCNQ 498 L + DG +T++ EP + C+TI G W PC Sbjct: 60 NWHTWLEYVADGTYIVGDDGKPPIYTNFAVGEPFSVNV--GYCITIG-MNGYWYAQPCPD 116 Query: 499 SLPSICK 505 S +C+ Sbjct: 117 SHSVLCE 123 Score = 91.1 bits (223), Expect = 1e-19 Identities = 22/129 (17%), Positives = 42/129 (32%), Gaps = 9/129 (6%) Query: 235 CETFWDKDQLTDSCYQFNFQSTLSWREAWASCEQQG---ADLLSITEIHEQTYINGLLTG 291 C T CY ++ A A C + A L + E ++ + Sbjct: 1 CPTGT--KIYNAKCYAA-LGMDMTRDNALAYCRKTYGPYARLTTPLTYAENNFVTEQVQL 57 Query: 292 YSSTLWIGLNDLDTSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESSGGWQNRDC 351 +++ L + D P Y N+ +P + + C I +G W + C Sbjct: 58 HANW-HTWLEYVADGTYIVGDDGKPPIYTNFAVGEPFSVNVGYCITIGM--NGYWYAQPC 114 Query: 352 SIALPYVCK 360 + +C+ Sbjct: 115 PDSHSVLCE 123 Score = 84.1 bits (205), Expect = 2e-17 Identities = 23/126 (18%), Positives = 40/126 (31%), Gaps = 10/126 (7%) Query: 829 WLRFQEAEYKFFEHHSTWAQAQRICTWF---QAELTSVHSQAELDFLSHNLQKFSRAQEQ 885 + Y T A C A LT+ + AE +F++ +Q + Sbjct: 5 TKIYNAKCYAALGMDMTRDNALAYCRKTYGPYARLTTPLTYAENNFVTEQVQLHANW--- 61 Query: 886 HWWIGLHTSESDGRFRWTDGSIINFISWAPGKPRPVGKDKKCVYMTASREDWGDQRCLTA 945 L DG + ++A G+P V C+ + W Q C + Sbjct: 62 --HTWLEYVADGTYIVGDDGKPPIYTNFAVGEPFSVNVG-YCITIG-MNGYWYAQPCPDS 117 Query: 946 LPYICK 951 +C+ Sbjct: 118 HSVLCE 123 Score = 81.7 bits (199), Expect = 8e-17 Identities = 27/141 (19%), Positives = 48/141 (33%), Gaps = 22/141 (15%) Query: 972 CPSDWIQFLNKCFQVQGQEPQSRVKWSEAQFSCEQQ---EAQLVTITNPLEQAFITASLP 1028 CP+ + KC+ G + A C + A+L T E F+T + Sbjct: 1 CPTGTKIYNAKCYAALGM----DMTRDNALAYCRKTYGPYARLTTPLTYAENNFVTEQVQ 56 Query: 1029 N-VTFDLWIGLHASQRDFQWVEQEPLMYANWAPGEPSGPSPAPSGNKPTSCAVVLHSPSA 1087 + W+ A + +P +Y N+A GEP + + + Sbjct: 57 LHANWHTWLEYVADGTYIVGDDGKPPIYTNFAVGEPF--------SVNVGYCITIGM--- 105 Query: 1088 HFTGRWDDRSCTEETHGFICQ 1108 G W + C + H +C+ Sbjct: 106 --NGYWYAQPCPDS-HSVLCE 123 Score = 74.3 bits (180), Expect = 1e-14 Identities = 23/128 (17%), Positives = 40/128 (30%), Gaps = 10/128 (7%) Query: 521 CRKGWTWHSPSCYWLGEDQVTYSEARRLCTDHG---SQLVTITNRFEQAFVSSLIYNWEG 577 C G ++ CY +T A C ++L T E FV+ + Sbjct: 1 CPTGTKIYNAKCYAALGMDMTRDNALAYCRKTYGPYARLTTPLTYAENNFVTEQVQLHAN 60 Query: 578 EYFWTALQDLNSTGSFFWLSGDEVMYTHWNRDQP-GYSRGGCVALATGSAMGLWEVKNCT 636 + W G +YT++ +P + G C+ + G W + C Sbjct: 61 WHTWLEYVADG--TYIVGDDGKPPIYTNFAVGEPFSVNVGYCITI---GMNGYWYAQPCP 115 Query: 637 SFRARYIC 644 +C Sbjct: 116 DSH-SVLC 122 Score = 73.1 bits (177), Expect = 3e-14 Identities = 25/144 (17%), Positives = 40/144 (27%), Gaps = 24/144 (16%) Query: 669 CPQGWASDTKLRYCYKVFSSERLQDKKSWVQAQGACQELG---AQLLSLASYEEEHFVAN 725 CP G CY + A C++ A+L + +Y E +FV Sbjct: 1 CPTGTKIYN--AKCYAAL-----GMDMTRDNALAYCRKTYGPYARLTTPLTYAENNFVTE 53 Query: 726 MLNKIFGESEPEIHEQHWFWIGLNRRDPRGGQSWRWSDGVGFSYHNFDRSRHDDDDIRGC 785 + H W+ DG Y NF ++ C Sbjct: 54 QVQL---------HANWHTWLEYVADG----TYIVGDDGKPPIYTNFAVGEPFSVNVGYC 100 Query: 786 AVLDLASLQWVAMQCDTQLDWICK 809 + + W A C +C+ Sbjct: 101 ITIGMNG-YWYAQPCPDSHSVLCE 123 Score = 68.1 bits (164), Expect = 1e-12 Identities = 27/137 (19%), Positives = 45/137 (32%), Gaps = 17/137 (12%) Query: 1261 CPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGGA---VLSILDEMENVFVWEH 1317 CP G + CY+ + A C++ G + + L EN FV Sbjct: 1 CPTG-----TKIYNAKCYAALGM-DMTRDNALAYCRKTYGPYARLTTPLTYAENNFV--- 51 Query: 1318 LQSYEGQSRGAWLGMNFNPKGGTLVWQDNTAVNYSNWGPPGLGPSMLSHNSCYWIQSNSG 1377 + + G +V D Y+N+ P ++ C I N G Sbjct: 52 -TEQVQLHANWHTWLEYVADGTYIVGDDGKPPIYTNFAVG--EPFSVNVGYCITIGMN-G 107 Query: 1378 LWRPGACTNITMGVVCK 1394 W C + + V+C+ Sbjct: 108 YWYAQPCPD-SHSVLCE 123 Score = 55.6 bits (132), Expect = 5e-09 Identities = 19/112 (16%), Positives = 31/112 (26%), Gaps = 7/112 (6%) Query: 1137 FRLLQKPLRWHDALLLCES---HNASLAYVPDPYTQAFLTQAARGLRTPLWIGLAGEEGS 1193 + L + +AL C A L F+T+ + L Sbjct: 13 YAALGMDMTRDNALAYCRKTYGPYARLTTPLTYAENNFVTEQVQLH-ANWHTWLEYVADG 71 Query: 1194 RRYSWVSEEPLNYVGWQDGEPQQPGG--CTYVDVDGAWRTTSCDTKLQGAVC 1243 +P Y + GEP C + ++G W C +C Sbjct: 72 TYIVGDDGKPPIYTNFAVGEPFSVNVGYCITIGMNGYWYAQPCPDSH-SVLC 122 >gi|2706599 [57..185] C-type lectin-like Length = 129 Score = 100 bits (246), Expect = 2e-22 Identities = 29/132 (21%), Positives = 55/132 (40%), Gaps = 11/132 (8%) Query: 382 CEPSWQPFQGHCYRLQAEKRSWQESKKACLR-GGGDLVSIHSMAELEFITKQIKQEVEE- 439 C + C++L + ++ +AC+ GG L SI S E + +K + E Sbjct: 1 CAVGFTYVNKKCWKLITGPQRRADADEACMNLGGSTLFSIRSDIENRALVDFVKDKRIEN 60 Query: 440 LWIGLNDLKLQMN---FEWSDGSLVSFTHWHPFEPNNFRDSLEDCVTIW---GPEGRWND 493 LW GL + ++ + G+ + ++ PNN DC+ G+W Sbjct: 61 LWTGLICVGKNSFSCTWDVNSGTAAVYNNFAEGYPNN---VYGDCIHYMTTGTQAGQWAS 117 Query: 494 SPCNQSLPSICK 505 CN+++ +C+ Sbjct: 118 GSCNETMSFVCE 129 Score = 83.7 bits (204), Expect = 2e-17 Identities = 26/133 (19%), Positives = 47/133 (34%), Gaps = 11/133 (8%) Query: 235 CETFWDKDQLTDSCYQFNFQSTLSWREAWASC-EQQGADLLSITEIHEQTYINGLLTGYS 293 C + + C++ +A +C G+ L SI E + + Sbjct: 1 CAVGF--TYVNKKCWKL-ITGPQRRADADEACMNLGGSTLFSIRSDIENRALVDFVKDKR 57 Query: 294 ST-LWIGLNDLD---TSGGWQWSDNSPLKYLNWESDQPDNPSEENCGVIRTESS--GGWQ 347 LW GL + S W + + Y N+ P+N +C T + G W Sbjct: 58 IENLWTGLICVGKNSFSCTWDVNSGTAAVYNNFAEGYPNNVY-GDCIHYMTTGTQAGQWA 116 Query: 348 NRDCSIALPYVCK 360 + C+ + +VC+ Sbjct: 117 SGSCNETMSFVCE 129 Score = 82.9 bits (202), Expect = 3e-17 Identities = 22/130 (16%), Positives = 40/130 (29%), Gaps = 12/130 (9%) Query: 829 WLRFQEAEYKFFEHHSTWAQAQRICTWFQAE-LTSVHSQAELDFLSHNLQKFSRAQEQHW 887 + + +K A A C L S+ S E L ++ Sbjct: 5 FTYVNKKCWKLITGPQRRADADEACMNLGGSTLFSIRSDIENRALVDFVKDKRIEN---L 61 Query: 888 WIGLHTSE---SDGRFRWTDGSIINFISWAPGKPRPVGKDKKCVYMTASRED---WGDQR 941 W GL + G+ + ++A G P V D C++ + W Sbjct: 62 WTGLICVGKNSFSCTWDVNSGTAAVYNNFAEGYPNNVYGD--CIHYMTTGTQAGQWASGS 119 Query: 942 CLTALPYICK 951 C + ++C+ Sbjct: 120 CNETMSFVCE 129 Score = 81.7 bits (199), Expect = 7e-17 Identities = 28/129 (21%), Positives = 52/129 (39%), Gaps = 6/129 (4%) Query: 521 CRKGWTWHSPSCYWLGEDQVTYSEARRLCTD-HGSQLVTITNRFEQAFVSSLIYNWEGEY 579 C G+T+ + C+ L ++A C + GS L +I + E + + + E Sbjct: 1 CAVGFTYVNKKCWKLITGPQRRADADEACMNLGGSTLFSIRSDIENRALVDFVKDKRIEN 60 Query: 580 FWTALQDLN---STGSFFWLSGDEVMYTHWNRDQPGYSRGGCVAL-ATGSAMGLWEVKNC 635 WT L + + ++ SG +Y ++ P G C+ TG+ G W +C Sbjct: 61 LWTGLICVGKNSFSCTWDVNSGTAAVYNNFAEGYPNNVYGDCIHYMTTGTQAGQWASGSC 120 Query: 636 TSFRARYIC 644 ++C Sbjct: 121 NETM-SFVC 128 Score = 77.8 bits (189), Expect = 1e-15 Identities = 27/144 (18%), Positives = 52/144 (35%), Gaps = 22/144 (15%) Query: 972 CPSDWIQFLNKCFQVQGQEPQSRVKWSEAQFSCEQQEAQ-LVTITNPLEQAFITASLPN- 1029 C + KC+++ + ++A +C L +I + +E + + + Sbjct: 1 CAVGFTYVNKKCWKLITG----PQRRADADEACMNLGGSTLFSIRSDIENRALVDFVKDK 56 Query: 1030 VTFDLWIGLHA-----SQRDFQWVEQEPLMYANWAPGEPSGPSPAPSGNKPTSCAVVLHS 1084 +LW GL + +Y N+A G P+ N C + + Sbjct: 57 RIENLWTGLICVGKNSFSCTWDVNSGTAAVYNNFAEGYPN--------NVYGDCIHYMTT 108 Query: 1085 PSAHFTGRWDDRSCTEETHGFICQ 1108 + G+W SC ET F+C+ Sbjct: 109 GTQ--AGQWASGSC-NETMSFVCE 129 Score = 70.0 bits (169), Expect = 2e-13 Identities = 26/146 (17%), Positives = 49/146 (32%), Gaps = 22/146 (15%) Query: 669 CPQGWASDTKLRYCYKVFSSERLQDKKSWVQAQGACQEL-GAQLLSLASYEEEHFVANML 727 C G+ + C+K+ + + A AC L G+ L S+ S E + + + Sbjct: 1 CAVGFTYVN--KKCWKLIT-----GPQRRADADEACMNLGGSTLFSIRSDIENRALVDFV 53 Query: 728 NKIFGESEPEIHEQHWFWIGLNRRDPRG-GQSWRWSDGVGFSYHNFDRSRHDDDDIRGCA 786 W GL +W + G Y+NF + ++ C Sbjct: 54 KD---------KRIENLWTGLICVGKNSFSCTWDVNSGTAAVYNNFA-EGYPNNVYGDCI 103 Query: 787 VLDLAS---LQWVAMQCDTQLDWICK 809 QW + C+ + ++C+ Sbjct: 104 HYMTTGTQAGQWASGSCNETMSFVCE 129 Score = 61.5 bits (147), Expect = 1e-10 Identities = 28/139 (20%), Positives = 48/139 (34%), Gaps = 15/139 (10%) Query: 1261 CPQGLADSAWIPFREHCYSFHMELLLGHKEARQRCQRAGGAVL-SILDEMENVFVWEHLQ 1319 C G + + C+ +A + C GG+ L SI ++EN + + ++ Sbjct: 1 CAVG-----FTYVNKKCWKLITG-PQRRADADEACMNLGGSTLFSIRSDIENRALVDFVK 54 Query: 1320 -SYEGQSRGAWLGMNFNPKGGTLVWQDNTAVNYSNWGPPGLGPSMLSHNSCYWIQSNS-- 1376 + + N T TA Y+N+ G + C + Sbjct: 55 DKRIENLWTGLICVGKNSFSCTWDVNSGTAAVYNNFAE---GYPNNVYGDCIHYMTTGTQ 111 Query: 1377 -GLWRPGACTNITMGVVCK 1394 G W G+C TM VC+ Sbjct: 112 AGQWASGSCNE-TMSFVCE 129 Score = 49.4 bits (116), Expect = 4e-07 Identities = 27/117 (23%), Positives = 40/117 (34%), Gaps = 11/117 (9%) Query: 1137 FRLLQKPLRWHDALLLCESHNAS-LAYVPDPYTQAFLTQAARGLR-TPLWIGL---AGEE 1191 ++L+ P R DA C + S L + L + R LW GL Sbjct: 13 WKLITGPQRRADADEACMNLGGSTLFSIRSDIENRALVDFVKDKRIENLWTGLICVGKNS 72 Query: 1192 GSRRYSWVSEEPLNYVGWQDGEPQ-QPGGCTYV----DVDGAWRTTSCDTKLQGAVC 1243 S + S Y + +G P G C + G W + SC+ + VC Sbjct: 73 FSCTWDVNSGTAAVYNNFAEGYPNNVYGDCIHYMTTGTQAGQWASGSCNETM-SFVC 128 >gi|2502062 [25..167] C-type lectin-like Length = 143 Score = 92.6 bits (227), Expect = 4e-20 Identities = 28/139 (20%), Positives = 49/139 (35%), Gaps = 19/139 (13%) Query: 386 WQPFQGHCYRLQAEKRSWQESKKACLRGGGDLVSIHSMAELEFITKQIK--QEVEELWIG 443 Q C+ L + ++ ++ +AC R G L+++ S + I+ + WIG Sbjct: 4 SQCVGNECFALFQDPVTFLDASQACQRLQGHLMTVRSSVAADVISLLVSDSSMDSRPWIG 63 Query: 444 L---------NDLKLQMNFEWSDGS-LVSFTHWHPFEPNNFRDSLEDCVTIWGP------ 487 L L F+W G S++ W + CVT+ Sbjct: 64 LQLPQGCGDPVHLGPLRGFQWVTGDNHTSYSRWARPNDQSPPLCGPLCVTVSTATEAAPG 123 Query: 488 EGRWNDSPCNQSL-PSICK 505 E W + PC +C+ Sbjct: 124 EPAWEEKPCENETKGFLCE 142 Score = 82.1 bits (200), Expect = 6e-17 Identities = 33/150 (22%), Positives = 59/150 (39%), Gaps = 22/150 (14%) Query: 973 PSDWIQFLNKCFQVQGQEPQSRVKWSEAQFSCEQQEAQLVTITNPLEQAFITASLPNVT- 1031 P N+CF + V + +A +C++ + L+T+ + + I+ + + + Sbjct: 1 PKGSQCVGNECFALFQD----PVTFLDASQACQRLQGHLMTVRSSVAADVISLLVSDSSM 56 Query: 1032 -FDLWIGLH-----------ASQRDFQWVEQEPLM-YANWAPGEPSGPSPAPSGNKPTSC 1078 WIGL R FQWV + Y+ WA +P P Sbjct: 57 DSRPWIGLQLPQGCGDPVHLGPLRGFQWVTGDNHTSYSRWARPND----QSPPLCGPLCV 112 Query: 1079 AVVLHSPSAHFTGRWDDRSCTEETHGFICQ 1108 V + +A W+++ C ET GF+C+ Sbjct: 113 TVSTATEAAPGEPAWEEKPCENETKGFLCE 142 Score = 78.2 bits (190), Expect = 9e-16 Identities = 24/140 (17%), Positives = 43/140 (30%), Gaps = 18/140 (12%) Query: 829 WLRFQEAEYKFFEHHSTWAQAQRICTWFQAELTSVHSQAELDFLSHNLQKFSRAQEQHW- 887 + F+ T+ A + C Q L +V S D + L S + W Sbjct: 4 SQCVGNECFALFQDPVTFLDASQACQRLQGHLMTVRSSVAADVI-SLLVSDSSMDSRPWI 62 Query: 888 -------WIGLHTSESDGRFRWTDGS-IINFISWAPGKPR-PVGKDKKCVYMTASRED-- 936 F+W G ++ WA + P CV ++ + E Sbjct: 63 GLQLPQGCGDPVHLGPLRGFQWVTGDNHTSYSRWARPNDQSPPLCGPLCVTVSTATEAAP 122 Query: 937 ----WGDQRCLTAL-PYICK 951 W ++ C ++C+ Sbjct: 123 GEPAWEEKPCENETKGFLCE 142 Score = 75.8 bits (184), Expect = 4e-15 Identities = 25/138 (18%), Positives = 49/138 (35%), Gaps = 21/138 (15%) Query: 243 QLTDSCYQFNFQSTLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGY--SSTLWIGL 300 + + C+ FQ +++ +A +C++ L+++ I+ L++ S WIGL Sbjct: 6 CVGNECFAL-FQDPVTFLDASQACQRLQGHLMTVRSSVAADVISLLVSDSSMDSRPWIGL 64 Query: 301 ---------NDLDTSGGWQWSDNSPLK-YLNWESDQPDNP--SEENCGVIRT-----ESS 343 L G+QW Y W +P C + T Sbjct: 65 QLPQGCGDPVHLGPLRGFQWVTGDNHTSYSRWARPNDQSPPLCGPLCVTVSTATEAAPGE 124 Query: 344 GGWQNRDCSIAL-PYVCK 360 W+ + C ++C+ Sbjct: 125 PAWEEKPCENETKGFLCE 142 Score = 72.7 bits (176), Expect = 3e-14 Identities = 32/140 (22%), Positives = 51/140 (35%), Gaps = 18/140 (12%) Query: 523 KGWTWHSPSCYWLGEDQVTYSEARRLCTDHGSQLVTITNRFEQAFVSSLI-YNWEGEYFW 581 KG C+ L +D VT+ +A + C L+T+ + +S L+ + W Sbjct: 2 KGSQCVGNECFALFQDPVTFLDASQACQRLQGHLMTVRSSVAADVISLLVSDSSMDSRPW 61 Query: 582 TALQD---------LNSTGSFFWLSGDEVM-YTHWNRDQP---GYSRGGCVALATG---- 624 LQ L F W++GD Y+ W R CV ++T Sbjct: 62 IGLQLPQGCGDPVHLGPLRGFQWVTGDNHTSYSRWARPNDQSPPLCGPLCVTVSTATEAA 121 Query: 625 SAMGLWEVKNCTSFRARYIC 644 WE K C + ++C Sbjct: 122 PGEPAWEEKPCENETKGFLC 141 Score = 58.7 bits (140), Expect = 7e-10 Identities = 21/140 (15%), Positives = 37/140 (26%), Gaps = 16/140 (11%) Query: 1270 WIPFREHCYSFHMELLLGHKEARQRCQRAGGAVLSILDEMENVFVWEHLQSYEGQS---- 1325 C++ + + +A Q CQR G ++++ + + + S Sbjct: 4 SQCVGNECFALFQD-PVTFLDASQACQRLQGHLMTVRSSVAADVISLLVSDSSMDSRPWI 62 Query: 1326 ----RGAWLGMNFNPKGGTLVWQDNTAV-NYSNWGPPGLGPSMLSHNSCYWI------QS 1374 W +YS W P L C + Sbjct: 63 GLQLPQGCGDPVHLGPLRGFQWVTGDNHTSYSRWARPNDQSPPLCGPLCVTVSTATEAAP 122 Query: 1375 NSGLWRPGACTNITMGVVCK 1394 W C N T G +C+ Sbjct: 123 GEPAWEEKPCENETKGFLCE 142 Score = 58.7 bits (140), Expect = 7e-10 Identities = 29/157 (18%), Positives = 55/157 (34%), Gaps = 32/157 (20%) Query: 670 PQGWASDTKLRYCYKVFSSERLQDKKSWVQAQGACQELGAQLLSLASYEEEHFVANMLNK 729 P+G C+ +F D +++ A ACQ L L+++ S ++ +++ Sbjct: 1 PKGSQCVG--NECFALFQ-----DPVTFLDASQACQRLQGHLMTVRSSVAADVISLLVSD 53 Query: 730 IFGESEPEIHEQHWFWIGLNRRDPRGGQS-------WRWSDGVG-FSYHNFDRSRHDDDD 781 +S WIGL G ++W G SY + R Sbjct: 54 SSMDSR--------PWIGLQLPQGCGDPVHLGPLRGFQWVTGDNHTSYSRWARPNDQSPP 105 Query: 782 IRG--CAVLDLASL------QWVAMQCDTQLD-WICK 809 + G C + A+ W C+ + ++C+ Sbjct: 106 LCGPLCVTVSTATEAAPGEPAWEEKPCENETKGFLCE 142 Score = 47.0 bits (110), Expect = 2e-06 Identities = 25/130 (19%), Positives = 39/130 (29%), Gaps = 23/130 (17%) Query: 1137 FRLLQKPLRWHDALLLCESHNASLAYVPDPYTQAFLTQAARGLRTP--LWIGL------- 1187 F L Q P+ + DA C+ L V ++ WIGL Sbjct: 12 FALFQDPVTFLDASQACQRLQGHLMTVRSSVAADVISLLVSDSSMDSRPWIGLQLPQGCG 71 Query: 1188 --AGEEGSRRYSWVSEEPL-NYVGWQDGEP----QQPGGCTYVDV-------DGAWRTTS 1233 R + WV+ + +Y W C V + AW Sbjct: 72 DPVHLGPLRGFQWVTGDNHTSYSRWARPNDQSPPLCGPLCVTVSTATEAAPGEPAWEEKP 131 Query: 1234 CDTKLQGAVC 1243 C+ + +G +C Sbjct: 132 CENETKGFLC 141 >gi|82035 [136..263] beta-Trefoil Length = 128 Score = 44.4 bits (104), Expect = 1e-05 Identities = 23/94 (24%), Positives = 35/94 (36%), Gaps = 5/94 (5%) Query: 44 FLIFSHGLQG-CLEAQGGQVRVTPACNTSLPAQRWKWVSRNRL-FNLGTMQCLGTGWPGT 101 F+ GL G CL+A G+V C + Q+W + + CL T Sbjct: 5 FVTTIVGLYGMCLQANSGKV-WLEDCTSEKAEQQWALYADGSIRPQQNRDNCLTTDANIK 63 Query: 102 NTTASLGMYECDREALNLRWHCRTLGDQLSLLLG 135 T + + C + RW + G L+L G Sbjct: 64 GTV--VKILSCGPVSSGQRWMFKNDGTILNLYNG 95 >gi|1655439 [28..139] C-type lectin-like Length = 112 Score = 37.3 bits (85), Expect = 0.002 Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 6/111 (5%) Query: 399 EKRSWQESKKACLRGGGDLVS--IHSMAELEFITKQIKQEVEELWIGLNDL-KLQMNFEW 455 E ++ ++ C G LVS + ++ I K W+G ++L K NF W Sbjct: 3 ETMNYADAGAYCQSRGMVLVSSAMRDSTMVKAILDFTKDRRYNYWVGADNLEKGAYNFLW 62 Query: 456 SDG-SLVSFTH-WHPFEPNNFRDSLEDCVTIWGPEGRWNDSPCNQSLPSIC 504 +DG SL + + W P EP+N + S E CV IW ++D+ C + IC Sbjct: 63 NDGVSLPTDSDLWSPDEPSNPQSS-ELCVQIWNRYNLFDDTGCGNAKRVIC 112 Score = 30.3 bits (67), Expect = 0.21 Identities = 23/87 (26%), Positives = 43/87 (48%), Gaps = 8/87 (9%) Query: 256 TLSWREAWASCEQQGADLLSITEIHEQTYINGLLTGYSS---TLWIGLNDLDTSG-GWQW 311 T+++ +A A C+ +G L+S + + + T + +L W+G ++L+ + W Sbjct: 4 TMNYADAGAYCQSRGMVLVS-SAMRDSTMVKAILDFTKDRRYNYWVGADNLEKGAYNFLW 62 Query: 312 SD--NSPLKYLNWESDQPDNP-SEENC 335 +D + P W D+P NP S E C Sbjct: 63 NDGVSLPTDSDLWSPDEPSNPQSSELC 89 Score = 28.3 bits (62), Expect = 0.93 Identities = 19/75 (25%), Positives = 31/75 (41%), Gaps = 3/75 (4%) Query: 879 FSRAQEQHWWIGLHTSESDG-RFRWTDGSIINFIS--WAPGKPRPVGKDKKCVYMTASRE 935 F++ + ++W+G E F W DG + S W+P +P + CV + Sbjct: 38 FTKDRRYNYWVGADNLEKGAYNFLWNDGVSLPTDSDLWSPDEPSNPQSSELCVQIWNRYN 97 Query: 936 DWGDQRCLTALPYIC 950 + D C A IC Sbjct: 98 LFDDTGCGNAKRVIC 112 >gi|1685117 [55..161] C-type lectin-like Length = 107 Score = 37.2 bits (85), Expect = 0.002 Identities = 27/103 (26%), Positives = 43/103 (41%), Gaps = 7/103 (6%) Query: 402 SWQESKKACLRGGGDLVSIHSMAELEFITKQIKQEVEEL-----WIGLNDLK--LQMNFE 454 S+ +++ C + GG LV A +I ++++ EL WIG ++ Sbjct: 5 SFDKAQAICKQTGGHLVHDFRGATSSYILSELERRKSELKPQLVWIGAQKEPGITSRTWK 64 Query: 455 WSDGSLVSFTHWHPFEPNNFRDSLEDCVTIWGPEGRWNDSPCN 497 W +G +V W +PNN+ V G WND CN Sbjct: 65 WVNGDVVQKPTWGKDQPNNYNGEQNCVVLDGGRNWLWNDVGCN 107 Score = 34.5 bits (78), Expect = 0.013 Identities = 30/103 (29%), Positives = 45/103 (43%), Gaps = 8/103 (7%) Query: 258 SWREAWASCEQQGADLLSITEIHEQTYINGLLTGYSSTL-----WIGLNDLD--TSGGWQ 310 S+ +A A C+Q G L+ +YI L S L WIG TS W+ Sbjct: 5 SFDKAQAICKQTGGHLVHDFRGATSSYILSELERRKSELKPQLVWIGAQKEPGITSRTWK 64 Query: 311 WSDNSPLKYLNWESDQPDN-PSEENCGVIRTESSGGWQNRDCS 352 W + ++ W DQP+N E+NC V+ + W + C+ Sbjct: 65 WVNGDVVQKPTWGKDQPNNYNGEQNCVVLDGGRNWLWNDVGCN 107 Score = 28.7 bits (63), Expect = 0.74 Identities = 22/99 (22%), Positives = 39/99 (39%), Gaps = 4/99 (4%) Query: 848 QAQRICTWFQAELTSVHSQAELDFLSHNLQ-KFSRAQEQHWWIGLHTSE--SDGRFRWTD 904 +AQ IC L A ++ L+ + S + Q WIG + ++W + Sbjct: 8 KAQAICKQTGGHLVHDFRGATSSYILSELERRKSELKPQLVWIGAQKEPGITSRTWKWVN 67 Query: 905 GSIINFISWAPGKPRPVGKDKKCVYMTASRE-DWGDQRC 942 G ++ +W +P ++ CV + R W D C Sbjct: 68 GDVVQKPTWGKDQPNNYNGEQNCVVLDGGRNWLWNDVGC 106 Score = 27.5 bits (60), Expect = 1.8 Identities = 26/103 (25%), Positives = 40/103 (38%), Gaps = 9/103 (8%) Query: 541 TYSEARRLCTDHGSQLVTITNRFEQAFVSSLIYNWEGE----YFWTALQDLNSTGS--FF 594 ++ +A+ +C G LV +++ S + + E W Q S + Sbjct: 5 SFDKAQAICKQTGGHLVHDFRGATSSYILSELERRKSELKPQLVWIGAQKEPGITSRTWK 64 Query: 595 WLSGDEVMYTHWNRDQPGYSRG--GCVALATGSAMGLWEVKNC 635 W++GD V W +DQP G CV L G LW C Sbjct: 65 WVNGDVVQKPTWGKDQPNNYNGEQNCVVL-DGGRNWLWNDVGC 106 Score = 27.1 bits (59), Expect = 1.9 Identities = 15/70 (21%), Positives = 33/70 (46%), Gaps = 5/70 (7%) Query: 695 KSWVQAQGACQELGAQLLSLASYEEEHFVANMLNKIFGESEPEIHEQHWFWIGLNRRDPR 754 +S+ +AQ C++ G L+ ++ + L + E +P++ WIG + Sbjct: 4 ESFDKAQAICKQTGGHLVHDFRGATSSYILSELERRKSELKPQL-----VWIGAQKEPGI 58 Query: 755 GGQSWRWSDG 764 ++W+W +G Sbjct: 59 TSRTWKWVNG 68 >gi|438998 [1..297] S-adenosyl-L-methionine-dependent methyltransferases Length = 297 Score = 28.7 bits (63), Expect = 0.73 Identities = 13/41 (31%), Positives = 19/41 (45%), Gaps = 2/41 (4%) Query: 1125 APGTELSYLNGTFRLLQKPLRWHDALLLCESHNASLAYVPD 1165 APGT + YL F L ++W L+ H+ L + D Sbjct: 70 APGTHIRYLRDHFYNLGVIIKW--MLIDGRHHDPILNGLRD 108 >gi|2507361 [53..298] Periplasmic binding protein-like I Length = 246 Score = 28.4 bits (63), Expect = 0.87 Identities = 11/72 (15%), Positives = 22/72 (30%), Gaps = 6/72 (8%) Query: 266 CEQQGADLLSITEIH------EQTYINGLLTGYSSTLWIGLNDLDTSGGWQWSDNSPLKY 319 + G DL + +Q I+ + + +G + + + P+ Sbjct: 18 ARRYGVDLKVLEAGGYSQLATQQAQIDQCKQWGAEAILLGSSTTSFPDLQKQVASLPVIE 77 Query: 320 LNWESDQPDNPS 331 L D P S Sbjct: 78 LVNAIDAPQVKS 89 >gi|115715 [1..97] Ferredoxin-like Length = 97 Score = 28.0 bits (62), Expect = 1.2 Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 3/84 (3%) Query: 815 DVREPDDSPQGRREWLRFQEAEY-KFFEHHSTWAQAQRICT--WFQAELTSVHSQAELDF 871 V+ P D + L+ E E + + TW RI + + V S + Sbjct: 8 TVKLPVDMDPAKAARLKADEKELAQRLQREGTWRHLWRIAVGHYANYSVFDVSSVEACND 67 Query: 872 LSHNLQKFSRAQEQHWWIGLHTSE 895 L F + + H S Sbjct: 68 TLMQLPLFPYMDIEVDGLCRHPSS 91 >gi|2500689 [29..252] Periplasmic binding protein-like I Length = 224 Score = 27.3 bits (60), Expect = 1.8 Identities = 9/72 (12%), Positives = 21/72 (28%), Gaps = 7/72 (9%) Query: 266 CEQQGADLLSITEIH----EQTYINGLLTGYSSTLWIGLNDLDTSGG---WQWSDNSPLK 318 ++ G D + T + + +T + IG ++ + S N P+ Sbjct: 27 AQELGMDYVEFTSENSPAKQMDNARTAVTKGVDAIVIGPVSSTSTPPLLAYLKSQNIPIA 86 Query: 319 YLNWESDQPDNP 330 + Sbjct: 87 FAGIGPQPGQTD 98 >gi|1536958 [63..316] Periplasmic binding protein-like I Length = 254 Score = 27.3 bits (60), Expect = 2.0 Identities = 7/71 (9%), Positives = 17/71 (23%), Gaps = 5/71 (7%) Query: 266 CEQQGADLLSITEIH----EQTYINGLLTGYSSTLWIGLNDLDTSGGWQWSDNS-PLKYL 320 Q G ++ + E YI +L I + + + Sbjct: 27 ASQNGYQVMIGNSNYSQESEDRYIESMLLLGVDGFIIQPTSNFRKYSRIIDEKKKKMVFF 86 Query: 321 NWESDQPDNPS 331 + + + Sbjct: 87 DSQLYEHRTSW 97 >gi|2650282 [2..154] P-loop containing nucleotide triphosphate hydrolases Length = 153 Score = 27.1 bits (59), Expect = 2.1 Identities = 17/123 (13%), Positives = 30/123 (23%), Gaps = 8/123 (6%) Query: 1355 GPPGLGPSMLSHNSCYWIQSNSGLWR---PGACTNITMGVVCKLPRAEQSSFSPSALPEN 1411 G G G + L + Y N +G + + + Sbjct: 26 GQYGTGKTTLGLHFIYEGLKNGEACMIISFDEDEESIIGDAKSVGMDLTAFGDKVHIVRL 85 Query: 1412 PAALVVVLMAVLLLLALLTAA-----LILYRRRQSIERGAFEGARYSRSSSSPTEATEKN 1466 A+ V + L +L +E + RYS ++ E Sbjct: 86 EASEVKKSLEKLESDLPEIVRSLGVSRMLVDSISVLETLFDDAGRYSMLAAFRKMLKENG 145 Query: 1467 ILV 1469 I Sbjct: 146 ITA 148 >gi|1708629 [67..339] Periplasmic binding protein-like I Length = 273 Score = 26.9 bits (59), Expect = 2.5 Identities = 7/71 (9%), Positives = 22/71 (30%), Gaps = 5/71 (7%) Query: 266 CEQQGADLLSITEIH----EQTYINGLLTGYSSTLWIGLNDLDTSGGWQWSDNS-PLKYL 320 C+Q G ++ + E+ + L L + + +++ P + Sbjct: 27 CDQYGYSIMVCNTDNSPEKEREMLLKLEAHSVEGLILNATGENKDVLRAFAEQQIPTILI 86 Query: 321 NWESDQPDNPS 331 + + + Sbjct: 87 DRKLPDLKLDT 97 >gi|797337 [17..308] Periplasmic binding protein-like I Length = 292 Score = 26.1 bits (57), Expect = 3.7 Identities = 10/73 (13%), Positives = 21/73 (28%), Gaps = 8/73 (10%) Query: 266 CEQQGADLLSITEIH----EQTYINGLLTGYSSTLWIG---LNDLDTSGGWQWSDNSPLK 318 G ++ T E+ + LL L I +++D + P+ Sbjct: 27 ASAHGFMVILATSDEDIALERDAVRVLLAKRVDGLIISPTSSSEVDHLISAS-ERDCPVV 85 Query: 319 YLNWESDQPDNPS 331 L+ + Sbjct: 86 LLDRRVPVLRADT 98 >gi|729214 [63..305] Periplasmic binding protein-like I Length = 243 Score = 25.7 bits (56), Expect = 5.4 Identities = 11/69 (15%), Positives = 17/69 (23%), Gaps = 3/69 (4%) Query: 266 CEQQGADLLSITEIHEQT---YINGLLTGYSSTLWIGLNDLDTSGGWQWSDNSPLKYLNW 322 + G + I E LL + L + + N Sbjct: 26 ASEFGWNSFLINIFSEDDAARAARQLLAHRPDGIIYTTMGLRHITLPESLYGENIVLANC 85 Query: 323 ESDQPDNPS 331 +D P PS Sbjct: 86 VADDPALPS 94 >gi|2244909 [587..959] beta/alpha (TIM)-barrel Length = 373 Score = 25.7 bits (56), Expect = 5.6 Identities = 16/62 (25%), Positives = 23/62 (36%), Gaps = 12/62 (19%) Query: 1158 ASLAYVPDPYTQAFL----------TQAARGLRTPLWIGLAGEEGSRRYS--WVSEEPLN 1205 A DP+ T+ R R L +G+ GE G S + +E L+ Sbjct: 293 AKGILQHDPFEVLDQQGVGQLIKMATEKGRAARPSLKVGICGEHGGDPSSVGFFAEAGLD 352 Query: 1206 YV 1207 YV Sbjct: 353 YV 354 >gi|137196 [30..266] P-loop containing nucleotide triphosphate hydrolases Length = 237 Score = 25.5 bits (54), Expect = 7.2 Identities = 3/23 (13%), Positives = 3/23 (13%) Query: 1355 GPPGLGPSMLSHNSCYWIQSNSG 1377 GP S Sbjct: 31 GPSKSFKSNFGLTMVSSYMRQYP 53 >gi|586847 [59..303] Periplasmic binding protein-like I Length = 245 Score = 25.3 bits (55), Expect = 7.5 Identities = 9/70 (12%), Positives = 16/70 (22%), Gaps = 5/70 (7%) Query: 266 CEQQGADLLSITEIH----EQTYINGLLTGYSSTLWIGLNDLDTSGGWQWSDNSPLKYLN 321 Q + + E Y+ L T L I + + Sbjct: 27 AFQHEYATTLLPTNYNPDIEIKYLELLRTKKIDGLIITSRANHWDSILAYQEY-GPVIAC 85 Query: 322 WESDQPDNPS 331 ++ D P Sbjct: 86 EDTGDIDVPC 95 >gi|628933 [59..301] Periplasmic binding protein-like I Length = 243 Score = 25.3 bits (55), Expect = 7.6 Identities = 7/71 (9%), Positives = 16/71 (21%), Gaps = 5/71 (7%) Query: 266 CEQQGADLLSITEIH----EQTYINGLLTGYSSTLWIGLNDLDTSGGWQWSDNS-PLKYL 320 C+ G +S T+ E + + + ++ P + Sbjct: 32 CKNHGYATISTTKGQSIEIELEVQEMIHYHSVDGFIVLYSKKSDPIIDILKSHAMPYVII 91 Query: 321 NWESDQPDNPS 331 D Sbjct: 92 GKPLTDDDIIH 102 >gi|2500694 [73..586] Periplasmic binding protein-like II Length = 514 Score = 25.3 bits (55), Expect = 8.3 Identities = 11/87 (12%), Positives = 22/87 (24%), Gaps = 21/87 (24%) Query: 264 ASCEQQGADLLSITEIHEQTYINGLLTGYSSTLWIGLNDLDTS-GGWQWSDNSPLKYLN- 321 + ++ G + + + Y N D D + W S + + + Sbjct: 400 QNLKRIGIE-ARVRTVDPSQYTNRKRAF----------DYDVTWEVWGQSLSPGNEQADY 448 Query: 322 WESDQPDNPSEENCGVIRTESSGGWQN 348 W S N G + Sbjct: 449 WGSAAATRQGSRN--------YAGISD 467 >gi|121769 [3..207] Class I glutamine amidotransferases Length = 205 Score = 25.0 bits (54), Expect = 8.5 Identities = 9/49 (18%), Positives = 17/49 (34%), Gaps = 5/49 (10%) Query: 702 GACQELGAQLLSLASYEEEHFVANMLNKIFGESEPEIHEQHWFWIGLNR 750 G C G Q +++ V + FG ++ E+ G+ Sbjct: 82 GVC--YGMQTMAMQLGGH---VEASNEREFGYAQVEVVNDSALVRGIED 125 >gi|1616941 [10..247] Metzincin-like Length = 238 Score = 24.9 bits (54), Expect = 9.0 Identities = 3/19 (15%), Positives = 5/19 (25%) Query: 176 YTIQGNSHGKPCTIPFKYD 194 + G + P D Sbjct: 219 QNTGQDFKGAYSSAPLLDD 237 >gi|1762781 [797..951] C2 domain-like Length = 155 Score = 24.9 bits (54), Expect = 9.0 Identities = 6/27 (22%), Positives = 10/27 (36%), Gaps = 1/27 (3%) Query: 769 YHNFDR-SRHDDDDIRGCAVLDLASLQ 794 + D S +D L+SL+ Sbjct: 102 RCSSDGHSYNDKTLPLATYTAKLSSLK 128 >gi|2506563 [66..342] Periplasmic binding protein-like I Length = 277 Score = 24.9 bits (54), Expect = 9.5 Identities = 8/72 (11%), Positives = 19/72 (26%), Gaps = 6/72 (8%) Query: 266 CEQQGADLLSITEIH----EQTYINGLLTGYSSTLWIGLNDLDTSGGWQWSDNSPLKY-- 319 E QG + + + LL + I + + ++ + Sbjct: 25 LEAQGRMVFLLHGGKDGEQLAQRFSLLLNQGVDGVVIAGAAGSSDDLRRMAEEKAIPVIF 84 Query: 320 LNWESDQPDNPS 331 + S D + Sbjct: 85 ASRASYLDDVDT 96 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 27 Number of calls to ALIGN: 75 Length of query: 1479 Total length of test sequences: 256703 Effective length of test sequences: 209547.0 Effective search space size: 301674868.1 Initial X dropoff for ALIGN: 25.0 bits