analysis of sequence from tem9.fa ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] MRGAPARLLLPLLPWLLLLLAPEARGAPGCPLSIRSCKCSGERPKGLSGGVPGPARRRVVCSGGDLPEPP EPGLLPNGTVTLLLSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQPGAFLGLGELKRLDLSNNRIGCL TSETFQGLPRLLRLNISGNIFSSLQPGVFDELPALKVVDLGTEFLTCDCHLRWLLPWAQNRSLQLSEHTL CAYPSALHAQALGSLQEAQLCCEGALELHTHHLIPSLRQVVFQGDRLPFQCSASYLGNDTRIRWYHNRAP VEGDEQAGILLAESLIHDCTFITSELTLSHIGVWASGEWECTVSMAQGNASKKVEIVVLETSASYCPAER VANNRGDFRWPRTLAGITAYQSCLQYPFTSVPLGGGAPGTRASRRCDRAGRWEPGDYSHCLYTNDITRVL YTFVLMPINASNALTLAHQLRVYTAEAASFSDMMDVVYVAQMIQKFLGYVDQIKELVEVMVDMASNLMLV DEHLLWLAQREDKACSRIVGALERIGGAALSPHAQHISVNARNVALEAYLIKPHSYVGLTCTAFQRREGG VPGTRPGSPGQNPPPEPEPPADQQLRFRCTTGRPNVSLSSFHIKNSVALASIQLPPSLFSSLPAALAPPV PPDCTLQLLVFRNGRLFHSHSNTSRPGAAGPGKRRGVATPVIFAGTSGCGVGNLTEPVAVSLRHWAEGAE PVAAWWSQEGPGEAGGWTSEGCQLRSSQPNVSALHCQHLGNVAVLMELSAFPREVGGAGAGLHPVVYPCT ALLLLCLFATIITYILNHSSIRVSRKGWHMLLNLCFHIAMTSAVFAGGITLTNYQMVCQAVGITLHYSSL STLLWMGVKARVLHKELTWRAPPPQEGDPALPTPSPMLRFYLIAGGIPLIICGITAAVNIHNYRDHSPYC WLVWRPSLGAFYIPVALILLITWIYFLCAGLRLRGPLAQNPKAGNSRASLEAGEELRGSTRLRGSGPLLS DSGSLLATGSARVGTPGPPEDGDSLYSPGVQLGALVTTHFLYLAMWACGALAVSQRWLPRVVCSCLYGVA ASALGLFVFTHHCARRRDVRASWRACCPPASPAAPHAPPRALPAAAEDGSPVFGEGPPSLKSSPSGSSGH PLALGPCKLTNLQLAQSQVCEAGAAAGGEGEPEPAGTRGNLAHRHPNNVHHGRRAHKSRAKGHRAGEACG KNRLKALRGGAAGALELLSSESGSLHNSPTDSYLGSSRNSPGAGLQLEGEPMLTPSEGSDTSAAPLSEAG RAGQRRSASRDSLKGGGALEKESHRRSYPLNAASLNGAPKGGKYDDVTLMGAEVASGGCMKTGLWKSETT V ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > gi|17511205|ref|NP_116166.6| . . . . . 1 MRGAPARLLLPLLPWLLLLLAPEARGAPGCPLSIRSCKCSGERPKGLSGG 50 _____EEEEEE__HHHHHH_____________EEEEEE____________ . . . . . 51 VPGPARRRVVCSGGDLPEPPEPGLLPNGTVTLLLSNNKITGLRNGSFLGL 100 ______EEEEE__________________EEEEE____EEE_______HH . . . . . 101 SLLEKLDLRNNIISTVQPGAFLGLGELKRLDLSNNRIGCLTSETFQGLPR 150 HHHHHH______EEE___________EEEE______EEEEEEE_EEE___ . . . . . 151 LLRLNISGNIFSSLQPGVFDELPALKVVDLGTEFLTCDCHLRWLLPWAQN 200 EEEEE___EEE____________EEEEEE____EEEHHHHHHH_HHHHHH . . . . . 201 RSLQLSEHTLCAYPSALHAQALGSLQEAQLCCEGALELHTHHLIPSLRQV 250 HHHHHHHHEEEE___HHHHHHHHHHHHHHHHH__HHHHHHHH_HHHHHEE . . . . . 251 VFQGDRLPFQCSASYLGNDTRIRWYHNRAPVEGDEQAGILLAESLIHDCT 300 EEE_________EEE____EEEE____________HHHHHHHHHHHH___ . . . . . 301 FITSELTLSHIGVWASGEWECTVSMAQGNASKKVEIVVLETSASYCPAER 350 EEE__HHHHH_EEE_____EEEEEE_______EEEEEEEE_______HHH . . . . . 351 VANNRGDFRWPRTLAGITAYQSCLQYPFTSVPLGGGAPGTRASRRCDRAG 400 HHH__________HHHHHHHHHH___________________HHHHHHH_ . . . . . 401 RWEPGDYSHCLYTNDITRVLYTFVLMPINASNALTLAHQLRVYTAEAASF 450 _________EEE____HHHHHHHH________HHHHHHHHHHHHHHHHHH . . . . . 451 SDMMDVVYVAQMIQKFLGYVDQIKELVEVMVDMASNLMLVDEHLLWLAQR 500 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH____HHHHHHHHHHHHHH . . . . . 501 EDKACSRIVGALERIGGAALSPHAQHISVNARNVALEAYLIKPHSYVGLT 550 H__HHHHHHHH_EEE__________EEE___HHHHHEEEEE____EEEEE . . . . . 551 CTAFQRREGGVPGTRPGSPGQNPPPEPEPPADQQLRFRCTTGRPNVSLSS 600 EEE________________________________EEEEE__________ . . . . . 601 FHIKNSVALASIQLPPSLFSSLPAALAPPVPPDCTLQLLVFRNGRLFHSH 650 _____HHHHHHH___________EEE__________EEEEE____EEE__ . . . . . 651 SNTSRPGAAGPGKRRGVATPVIFAGTSGCGVGNLTEPVAVSLRHWAEGAE 700 ___________________EEEEE_______________EEE_______H . . . . . 701 PVAAWWSQEGPGEAGGWTSEGCQLRSSQPNVSALHCQHLGNVAVLMELSA 750 HHHHHHH_______________EEE______EEE_______HHHHHHHH_ . . . . . 751 FPREVGGAGAGLHPVVYPCTALLLLCLFATIITYILNHSSIRVSRKGWHM 800 __________EEEEEEE___HHHHHHHHHHHHHHHH_____EEEE__HHH . . . . . 801 LLNLCFHIAMTSAVFAGGITLTNYQMVCQAVGITLHYSSLSTLLWMGVKA 850 HHHHHEEEEHHHHHH______HHHHHHHHHHH________HHHHHH__HH . . . . . 851 RVLHKELTWRAPPPQEGDPALPTPSPMLRFYLIAGGIPLIICGITAAVNI 900 HHHHHHHHH____________________EEE_____EEEEE____EEE_ . . . . . 901 HNYRDHSPYCWLVWRPSLGAFYIPVALILLITWIYFLCAGLRLRGPLAQN 950 _________EEEEE_____EEEHHHHHHHHHHHHHHHHHHH_________ . . . . . 951 PKAGNSRASLEAGEELRGSTRLRGSGPLLSDSGSLLATGSARVGTPGPPE 1000 _____HHHHHHHHHHH__________________EEEE____________ . . . . . 1001 DGDSLYSPGVQLGALVTTHFLYLAMWACGALAVSQRWLPRVVCSCLYGVA 1050 ____________EEEEEEHHHHHHHHH___________EEEEEEEEHHHH . . . . . 1051 ASALGLFVFTHHCARRRDVRASWRACCPPASPAAPHAPPRALPAAAEDGS 1100 HHHHHHEEEE______HHHHHHHH_________________HHHHH____ . . . . . 1101 PVFGEGPPSLKSSPSGSSGHPLALGPCKLTNLQLAQSQVCEAGAAAGGEG 1150 ____________________EEE______HHHHHHHHHHH__________ . . . . . 1151 EPEPAGTRGNLAHRHPNNVHHGRRAHKSRAKGHRAGEACGKNRLKALRGG 1200 __________________________________________HHHHHHH_ . . . . . 1201 AAGALELLSSESGSLHNSPTDSYLGSSRNSPGAGLQLEGEPMLTPSEGSD 1250 __HHHHHHH_________________________________________ . . . . . 1251 TSAAPLSEAGRAGQRRSASRDSLKGGGALEKESHRRSYPLNAASLNGAPK 1300 ___HHHHHHH____HHHHH_________HHHHHH________________ . . . 1301 GGKYDDVTLMGAEVASGGCMKTGLWKSETT 1330 ______EEEE_EEEE______EEEE_____ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 37.8 % beta-contents : 5.0 % coil-contents : 57.2 % class : alpha method : 2 alpha-contents : 13.8 % beta-contents : 13.7 % coil-contents : 72.5 % class : beta ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -4.44 0.00 0.00 0.00 0.00 0.00 -8.00 0.00 0.00 -6.52 -1.81 -12.00 -12.00 -4.00 -12.00 0.00 -60.76 0.27 0.00 0.00 0.00 0.00 0.00 -12.00 0.00 -0.02 -6.52 -1.81 -12.00 -12.00 -4.00 -12.00 0.00 -60.07 ID: gi|17511205|ref|NP_116166.6| AC: xxx Len: 1330 1:I 1316 Sc: -60.07 Pv: 3.710430e-01 NO_GPI_SITE GPI: learning from protozoa -22.75 -0.37 -0.56 -0.10 -4.00 0.00 -4.00 0.00 0.00 -5.66 -7.09 -12.00 -12.00 -4.00 -12.00 0.00 -84.52 -14.55 0.00 0.00 0.00 -4.00 0.00 -16.00 0.00 -0.36 -5.66 -7.09 -12.00 -12.00 0.00 -12.00 0.00 -83.65 ID: gi|17511205|ref|NP_116166.6| AC: xxx Len: 1330 1:I 1317 Sc: -83.65 Pv: 5.761400e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|17511205 1.000 27 Y 0.910 27 Y 0.986 926 Y 0.938 Y # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|17511205 0.692 27 Y 0.615 27 Y 0.979 775 Y 0.718 Y # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|17511205 0.653 897 Y 0.635 897 Y 0.971 774 Y 0.265 N ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] 1-1 M rgaparlllpllpwlllllapeargapg 2-29 30-62 CPLSIRSCKCSGERPKGLSGGVPGPARRRV VCS ggdlpeppepgllp 63-76 77-572 NGTVTLLLSNNKITGLRNGSFLGLSLLEKL DLRNNIISTVQPGAFLGLGELKRLDLSNNR IGCLTSETFQGLPRLLRLNISGNIFSSLQP GVFDELPALKVVDLGTEFLTCDCHLRWLLP WAQNRSLQLSEHTLCAYPSALHAQALGSLQ EAQLCCEGALELHTHHLIPSLRQVVFQGDR LPFQCSASYLGNDTRIRWYHNRAPVEGDEQ AGILLAESLIHDCTFITSELTLSHIGVWAS GEWECTVSMAQGNASKKVEIVVLETSASYC PAERVANNRGDFRWPRTLAGITAYQSCLQY PFTSVPLGGGAPGTRASRRCDRAGRWEPGD YSHCLYTNDITRVLYTFVLMPINASNALTL AHQLRVYTAEAASFSDMMDVVYVAQMIQKF LGYVDQIKELVEVMVDMASNLMLVDEHLLW LAQREDKACSRIVGALERIGGAALSPHAQH ISVNARNVALEAYLIKPHSYVGLTCTAFQR REGGVPGTRPGSPGQN pppepepp 573-580 581-613 ADQQLRFRCTTGRPNVSLSSFHIKNSVALA SIQ lppslfsslpaalappvpp 614-632 633-654 DCTLQLLVFRNGRLFHSHSNTS rpgaagpgkrrg 655-666 667-768 VATPVIFAGTSGCGVGNLTEPVAVSLRHWA EGAEPVAAWWSQEGPGEAGGWTSEGCQLRS SQPNVSALHCQHLGNVAVLMELSAFPREVG GAGAGLHPVVYP ctallllclfatii 769-782 783-973 TYILNHSSIRVSRKGWHMLLNLCFHIAMTS AVFAGGITLTNYQMVCQAVGITLHYSSLST LLWMGVKARVLHKELTWRAPPPQEGDPALP TPSPMLRFYLIAGGIPLIICGITAAVNIHN YRDHSPYCWLVWRPSLGAFYIPVALILLIT WIYFLCAGLRLRGPLAQNPKAGNSRASLEA GEELRGSTRLR gsgpllsdsgsll 974-986 987-1070 ATGSARVGTPGPPEDGDSLYSPGVQLGALV TTHFLYLAMWACGALAVSQRWLPRVVCSCL YGVAASALGLFVFTHHCARRRDVR aswraccppaspaaphappralpaaa 1071-1096 1097-1103 EDGSPVF gegppslksspsgssghplalgp 1104-1126 1127-1140 CKLTNLQLAQSQVC eagaaaggegepepag 1141-1156 1157-1192 TRGNLAHRHPNNVHHGRRAHKSRAKGHRAG EACGKN rlkalrggaagalellssesgsl 1193-1215 1216-1256 HNSPTDSYLGSSRNSPGAGLQLEGEPMLTP SEGSDTSAAPL seagragqrrsasrds 1257-1272 1273-1331 LKGGGALEKESHRRSYPLNAASLNGAPKGG KYDDVTLMGAEVASGGCMKTGLWKSETTV low complexity regions: SEG 25 3.0 3.3 >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] 1-1 M rgaparlllpllpwlllllapeargapgcp 2-114 lsirsckcsgerpkglsggvpgparrrvvc sggdlpeppepgllpngtvtlllsnnkitg lrngsflglsllekldlrnniis 115-201 TVQPGAFLGLGELKRLDLSNNRIGCLTSET FQGLPRLLRLNISGNIFSSLQPGVFDELPA LKVVDLGTEFLTCDCHLRWLLPWAQNR slqlsehtlcaypsalhaqalgslqeaqlc 202-243 cegalelhthhl 244-555 IPSLRQVVFQGDRLPFQCSASYLGNDTRIR WYHNRAPVEGDEQAGILLAESLIHDCTFIT SELTLSHIGVWASGEWECTVSMAQGNASKK VEIVVLETSASYCPAERVANNRGDFRWPRT LAGITAYQSCLQYPFTSVPLGGGAPGTRAS RRCDRAGRWEPGDYSHCLYTNDITRVLYTF VLMPINASNALTLAHQLRVYTAEAASFSDM MDVVYVAQMIQKFLGYVDQIKELVEVMVDM ASNLMLVDEHLLWLAQREDKACSRIVGALE RIGGAALSPHAQHISVNARNVALEAYLIKP HSYVGLTCTAFQ rreggvpgtrpgspgqnpppepepp 556-580 581-593 ADQQLRFRCTTGR pnvslssfhiknsvalasiqlppslfsslp 594-640 aalappvppdctlqllv 641-685 FRNGRLFHSHSNTSRPGAAGPGKRRGVATP VIFAGTSGCGVGNLT epvavslrhwaegaepvaawwsqegpgeag 686-721 gwtseg 722-955 CQLRSSQPNVSALHCQHLGNVAVLMELSAF PREVGGAGAGLHPVVYPCTALLLLCLFATI ITYILNHSSIRVSRKGWHMLLNLCFHIAMT SAVFAGGITLTNYQMVCQAVGITLHYSSLS TLLWMGVKARVLHKELTWRAPPPQEGDPAL PTPSPMLRFYLIAGGIPLIICGITAAVNIH NYRDHSPYCWLVWRPSLGAFYIPVALILLI TWIYFLCAGLRLRGPLAQNPKAGN srasleageelrgstrlrgsgpllsdsgsl 956-1009 latgsarvgtpgppedgdslyspg 1010-1063 VQLGALVTTHFLYLAMWACGALAVSQRWLP RVVCSCLYGVAASALGLFVFTHHC arrrdvraswraccppaspaaphappralp 1064-1126 aaaedgspvfgegppslksspsgssghpla lgp 1127-1140 CKLTNLQLAQSQVC eagaaaggegepepagtrgnlahrhpnnvh 1141-1235 hgrrahksrakghrageacgknrlkalrgg aagalellssesgslhnsptdsylgssrns pgagl 1236-1242 QLEGEPM ltpsegsdtsaaplseagragqrrsasrds 1243-1287 lkgggalekeshrrs 1288-1331 YPLNAASLNGAPKGGKYDDVTLMGAEVASG GCMKTGLWKSETTV low complexity regions: SEG 45 3.4 3.75 >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] 1-1 M rgaparlllpllpwlllllapeargapgcp 2-175 lsirsckcsgerpkglsggvpgparrrvvc sggdlpeppepgllpngtvtlllsnnkitg lrngsflglsllekldlrnniistvqpgaf lglgelkrldlsnnrigcltsetfqglprl lrlnisgnifsslqpgvfdelpal 176-189 KVVDLGTEFLTCDC hlrwllpwaqnrslqlsehtlcaypsalha 190-267 qalgslqeaqlccegalelhthhlipslrq vvfqgdrlpfqcsasylg 268-555 NDTRIRWYHNRAPVEGDEQAGILLAESLIH DCTFITSELTLSHIGVWASGEWECTVSMAQ GNASKKVEIVVLETSASYCPAERVANNRGD FRWPRTLAGITAYQSCLQYPFTSVPLGGGA PGTRASRRCDRAGRWEPGDYSHCLYTNDIT RVLYTFVLMPINASNALTLAHQLRVYTAEA ASFSDMMDVVYVAQMIQKFLGYVDQIKELV EVMVDMASNLMLVDEHLLWLAQREDKACSR IVGALERIGGAALSPHAQHISVNARNVALE AYLIKPHSYVGLTCTAFQ rreggvpgtrpgspgqnpppepeppadqql 556-670 rfrcttgrpnvslssfhiknsvalasiqlp pslfsslpaalappvppdctlqllvfrngr lfhshsntsrpgaagpgkrrgvatp 671-939 VIFAGTSGCGVGNLTEPVAVSLRHWAEGAE PVAAWWSQEGPGEAGGWTSEGCQLRSSQPN VSALHCQHLGNVAVLMELSAFPREVGGAGA GLHPVVYPCTALLLLCLFATIITYILNHSS IRVSRKGWHMLLNLCFHIAMTSAVFAGGIT LTNYQMVCQAVGITLHYSSLSTLLWMGVKA RVLHKELTWRAPPPQEGDPALPTPSPMLRF YLIAGGIPLIICGITAAVNIHNYRDHSPYC WLVWRPSLGAFYIPVALILLITWIYFLCA glrlrgplaqnpkagnsrasleageelrgs 940-1318 trlrgsgpllsdsgsllatgsarvgtpgpp edgdslyspgvqlgalvtthflylamwacg alavsqrwlprvvcsclygvaasalglfvf thhcarrrdvraswraccppaspaaphapp ralpaaaedgspvfgegppslksspsgssg hplalgpckltnlqlaqsqvceagaaagge gepepagtrgnlahrhpnnvhhgrrahksr akghrageacgknrlkalrggaagalells sesgslhnsptdsylgssrnspgaglqleg epmltpsegsdtsaaplseagragqrrsas rdslkgggalekeshrrsyplnaaslngap kggkyddvtlmgaevasgg 1319-1331 CMKTGLWKSETTV low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] MRGAPARLllpllpwlllllapeargapgcpLSIRSCKCSGERPKGLSGGVPGPARRRVV CSGGDLPEPPEPGLLPNGTVTLLLSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQPGA FLGLGELKRLDLSNNRIGCLTSETFQGLPRLLRLNISGNIFSSLQPGVFDELPALKVVDL GTEFLTCDCHLRWLLPWAQNRSLQLSEHTLCAYPSALHAQALGSLQEAQLCCEGALELHT HHLIPSLRQVVFQGDRLPFQCSASYLGNDTRIRWYHNRAPVEGDEQAGILLAESLIHDCT FITSELTLSHIGVWASGEWECTVSMAQGNASKKVEIVVLETSASYCPAERVANNRGDFRW PRTLAGITAYQSCLQYPFTSVPLGGGAPGTRASRRCDRAGRWEPGDYSHCLYTNDITRVL YTFVLMPINASNALTLAHQLRVYTAEAASFSDMMDVVYVAQMIQKFLGYVDQIKELVEVM VDMASNLMLVDEHLLWLAQREDKACSRIVGALERIGGAALSPHAQHISVNARNVALEAYL IKPHSYVGLTCTAFQRREggvpgtrpgspgqnpppepeppadqqlrfrCTTGRPNVSLSS FHIKNSVALASIQlppslfsslpaalappvppDCTLQLLVFRNGRLFHSHSNTSRPGAAG PGKRRGVATPVIFAGTSGCGVGNLTEPVAVSLRHWAEGAEPVAAWWSQEGPGEAGGWTSE GCQLRSSQPNVSALHCQHLGNVAVLMELSAFPREVGGAGAGLHPVVYPCTALLLLCLFAT IITYILNHSSIRVSRKGWHMLLNLCFHIAMTSAVFAGGITLTNYQMVCQAVGITLHYSSL STLLWMGVKARVLHKELTWRAPPPQEGDPALPTPSPMLRFYLIAGGIPLIICGITAAVNI HNYRDHSPYCWLVWRPSLGAFYIPVALILLITWIYFLCAGLRLRGPLAQNPKAGNSRASL EAGEELRGSTRLRGSGPLLSDSGSLLATGSARVGTPGPPEDGDSLYSPGVQLGALVTTHF LYLAMWACGALAVSQRWLPRVVCSCLYGVAASALGLFVFTHHCARRRDVRASWRAccppa spaaphappralpaaaedgspVFGEGPPSLKSSPSGSSGHPLALGPCKLTNLQLAQSQVC EagaaaggegepepAGTRGNLAHRHPNNVHHGRRAHKSRAKGHRAGEACGKNRLKALRGG AAGALELLSSESGSLHNSPTDSYLGSSRNSPGAGLQLEGEPMLTPSEGSDTSAAPLSEAG RAGQRRSASRDSLKGGGALEKESHRRSYPLNAASLNGAPKGGKYDDVTLMGAEVASGGCM KTGLWKSETTV 1 - 8 MRGAPARL 9 - 31 ll pllpwlllll apeargapgc p 32 - 558 LSIRSCKCS GERPKGLSGG VPGPARRRVV CSGGDLPEPP EPGLLPNGTV TLLLSNNKIT G LRNGSFLGL SLLEKLDLRN NIISTVQPGA FLGLGELKRL DLSNNRIGCL TSETFQGLPR L LRLNISGNI FSSLQPGVFD ELPALKVVDL GTEFLTCDCH LRWLLPWAQN RSLQLSEHTL C AYPSALHAQ ALGSLQEAQL CCEGALELHT HHLIPSLRQV VFQGDRLPFQ CSASYLGNDT R IRWYHNRAP VEGDEQAGIL LAESLIHDCT FITSELTLSH IGVWASGEWE CTVSMAQGNA S KKVEIVVLE TSASYCPAER VANNRGDFRW PRTLAGITAY QSCLQYPFTS VPLGGGAPGT R ASRRCDRAG RWEPGDYSHC LYTNDITRVL YTFVLMPINA SNALTLAHQL RVYTAEAASF S DMMDVVYVA QMIQKFLGYV DQIKELVEVM VDMASNLMLV DEHLLWLAQR EDKACSRIVG A LERIGGAAL SPHAQHISVN ARNVALEAYL IKPHSYVGLT CTAFQRRE 559 - 588 gg vpgtrpgspg qnpppepepp adqqlrfr 589 - 613 CT TGRPNVSLSS FHIKNSVALA SIQ 614 - 632 lppslfs slpaalappv pp 633 - 1075 DCTLQLLV FRNGRLFHSH SNTSRPGAAG PGKRRGVATP VIFAGTSGCG VGNLTEPVAV SL RHWAEGAE PVAAWWSQEG PGEAGGWTSE GCQLRSSQPN VSALHCQHLG NVAVLMELSA FP REVGGAGA GLHPVVYPCT ALLLLCLFAT IITYILNHSS IRVSRKGWHM LLNLCFHIAM TS AVFAGGIT LTNYQMVCQA VGITLHYSSL STLLWMGVKA RVLHKELTWR APPPQEGDPA LP TPSPMLRF YLIAGGIPLI ICGITAAVNI HNYRDHSPYC WLVWRPSLGA FYIPVALILL IT WIYFLCAG LRLRGPLAQN PKAGNSRASL EAGEELRGST RLRGSGPLLS DSGSLLATGS AR VGTPGPPE DGDSLYSPGV QLGALVTTHF LYLAMWACGA LAVSQRWLPR VVCSCLYGVA AS ALGLFVFT HHCARRRDVR ASWRA 1076 - 1101 ccppa spaaphappr alpaaaedgs p 1102 - 1141 VFGEGPPSL KSSPSGSSGH PLALGPCKLT NLQLAQSQVC E 1142 - 1154 agaaaggeg epep 1155 - 1331 AGTRGN LAHRHPNNVH HGRRAHKSRA KGHRAGEACG KNRLKALRGG AAGALELLSS ESGS LHNSPT DSYLGSSRNS PGAGLQLEGE PMLTPSEGSD TSAAPLSEAG RAGQRRSASR DSLK GGGALE KESHRRSYPL NAASLNGAPK GGKYDDVTLM GAEVASGGCM KTGLWKSETT V low complexity regions: DUST >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] MRGAPARLLLPLLPWLLLLLAPEARGAPGCPLSIRSCKCSGERPKGLSGGVPGPARRRVV CSGGDLPEPPEPGLLPNGTVTLLLSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQPGA FLGLGELKRLDLSNNRIGCLTSETFQGLPRLLRLNISGNIFSSLQPGVFDELPALKVVDL GTEFLTCDCHLRWLLPWAQNRSLQLSEHTLCAYPSALHAQALGSLQEAQLCCEGALELHT HHLIPSLRQVVFQGDRLPFQCSASYLGNDTRIRWYHNRAPVEGDEQAGILLAESLIHDCT FITSELTLSHIGVWASGEWECTVSMAQGNASKKVEIVVLETSASYCPAERVANNRGDFRW PRTLAGITAYQSCLQYPFTSVPLGGGAPGTRASRRCDRAGRWEPGDYSHCLYTNDITRVL YTFVLMPINASNALTLAHQLRVYTAEAASFSDMMDVVYVAQMIQKFLGYVDQIKELVEVM VDMASNLMLVDEHLLWLAQREDKACSRIVGALERIGGAALSPHAQHISVNARNVALEAYL IKPHSYVGLTCTAFQRREGGVPGTRPGSPGQNPPPEPEPPADQQLRFRCTTGRPNVSLSS FHIKNSVALASIQLPPSLFSSLPAALAPPVPPDCTLQLLVFRNGRLFHSHSNTSRPGAAG PGKRRGVATPVIFAGTSGCGVGNLTEPVAVSLRHWAEGAEPVAAWWSQEGPGEAGGWTSE GCQLRSSQPNVSALHCQHLGNVAVLMELSAFPREVGGAGAGLHPVVYPCTALLLLCLFAT IITYILNHSSIRVSRKGWHMLLNLCFHIAMTSAVFAGGITLTNYQMVCQAVGITLHYSSL STLLWMGVKARVLHKELTWRAPPPQEGDPALPTPSPMLRFYLIAGGIPLIICGITAAVNI HNYRDHSPYCWLVWRPSLGAFYIPVALILLITWIYFLCAGLRLRGPLAQNPKAGNSRASL EAGEELRGSTRLRGSGPLLSDSGSLLATGSARVGTPGPPEDGDSLYSPGVQLGALVTTHF LYLAMWACGALAVSQRWLPRVVCSCLYGVAASALGLFVFTHHCARRRDVRASWRACCPPA SPAAPHAPPRALPAAAEDGSPVFGEGPPSLKSSPSGSSGHPLALGPCKLTNLQLAQSQVC EAGAAAGGEGEPEPAGTRGNLAHRHPNNVHHGRRAHKSRAKGHRAGEACGKNRLKALRGG AAGALELLSSESGSLHNSPTDSYLGSSRNSPGAGLQLEGEPMLTPSEGSDTSAAPLSEAG RAGQRRSASRDSLKGGGALEKESHRRSYPLNAASLNGAPKGGKYDDVTLMGAEVASGGCM KTGLWKSETTV ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for gi|17511205|ref|NP_116166.6| sequence: 1330 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 MRGAPARLLL PLLPWLLLLL APEARGAPGC PLSIRSCKCS GERPKGLSGG VPGPARRRVV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 CSGGDLPEPP EPGLLPNGTV TLLLSNNKIT GLRNGSFLGL SLLEKLDLRN NIISTVQPGA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 FLGLGELKRL DLSNNRIGCL TSETFQGLPR LLRLNISGNI FSSLQPGVFD ELPALKVVDL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 GTEFLTCDCH LRWLLPWAQN RSLQLSEHTL CAYPSALHAQ ALGSLQEAQL CCEGALELHT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 HHLIPSLRQV VFQGDRLPFQ CSASYLGNDT RIRWYHNRAP VEGDEQAGIL LAESLIHDCT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 FITSELTLSH IGVWASGEWE CTVSMAQGNA SKKVEIVVLE TSASYCPAER VANNRGDFRW ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 420 PRTLAGITAY QSCLQYPFTS VPLGGGAPGT RASRRCDRAG RWEPGDYSHC LYTNDITRVL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 480 YTFVLMPINA SNALTLAHQL RVYTAEAASF SDMMDVVYVA QMIQKFLGYV DQIKELVEVM ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~1222 2222222222 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 540 VDMASNLMLV DEHLLWLAQR EDKACSRIVG ALERIGGAAL SPHAQHISVN ARNVALEAYL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register 2222222211 11~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 600 IKPHSYVGLT CTAFQRREGG VPGTRPGSPG QNPPPEPEPP ADQQLRFRCT TGRPNVSLSS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 660 FHIKNSVALA SIQLPPSLFS SLPAALAPPV PPDCTLQLLV FRNGRLFHSH SNTSRPGAAG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 720 PGKRRGVATP VIFAGTSGCG VGNLTEPVAV SLRHWAEGAE PVAAWWSQEG PGEAGGWTSE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 780 GCQLRSSQPN VSALHCQHLG NVAVLMELSA FPREVGGAGA GLHPVVYPCT ALLLLCLFAT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 840 IITYILNHSS IRVSRKGWHM LLNLCFHIAM TSAVFAGGIT LTNYQMVCQA VGITLHYSSL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 900 STLLWMGVKA RVLHKELTWR APPPQEGDPA LPTPSPMLRF YLIAGGIPLI ICGITAAVNI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 960 HNYRDHSPYC WLVWRPSLGA FYIPVALILL ITWIYFLCAG LRLRGPLAQN PKAGNSRASL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1020 EAGEELRGST RLRGSGPLLS DSGSLLATGS ARVGTPGPPE DGDSLYSPGV QLGALVTTHF ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1080 LYLAMWACGA LAVSQRWLPR VVCSCLYGVA ASALGLFVFT HHCARRRDVR ASWRACCPPA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1140 SPAAPHAPPR ALPAAAEDGS PVFGEGPPSL KSSPSGSSGH PLALGPCKLT NLQLAQSQVC ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1200 EAGAAAGGEG EPEPAGTRGN LAHRHPNNVH HGRRAHKSRA KGHRAGEACG KNRLKALRGG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1111111111 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1260 AAGALELLSS ESGSLHNSPT DSYLGSSRNS PGAGLQLEGE PMLTPSEGSD TSAAPLSEAG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register 1111111111 1~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1320 RAGQRRSASR DSLKGGGALE KESHRRSYPL NAASLNGAPK GGKYDDVTLM GAEVASGGCM ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | KTGLWKSETT ~~~~~~~~~~ ---------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem9.fa.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem9.fa.___inter___ (1 sequences) MRGAPARLLLPLLPWLLLLLAPEARGAPGCPLSIRSCKCSGERPKGLSGG VPGPARRRVVCSGGDLPEPPEPGLLPNGTVTLLLSNNKITGLRNGSFLGL SLLEKLDLRNNIISTVQPGAFLGLGELKRLDLSNNRIGCLTSETFQGLPR LLRLNISGNIFSSLQPGVFDELPALKVVDLGTEFLTCDCHLRWLLPWAQN RSLQLSEHTLCAYPSALHAQALGSLQEAQLCCEGALELHTHHLIPSLRQV VFQGDRLPFQCSASYLGNDTRIRWYHNRAPVEGDEQAGILLAESLIHDCT FITSELTLSHIGVWASGEWECTVSMAQGNASKKVEIVVLETSASYCPAER VANNRGDFRWPRTLAGITAYQSCLQYPFTSVPLGGGAPGTRASRRCDRAG RWEPGDYSHCLYTNDITRVLYTFVLMPINASNALTLAHQLRVYTAEAASF SDMMDVVYVAQMIQKFLGYVDQIKELVEVMVDMASNLMLVDEHLLWLAQR EDKACSRIVGALERIGGAALSPHAQHISVNARNVALEAYLIKPHSYVGLT CTAFQRREGGVPGTRPGSPGQNPPPEPEPPADQQLRFRCTTGRPNVSLSS FHIKNSVALASIQLPPSLFSSLPAALAPPVPPDCTLQLLVFRNGRLFHSH SNTSRPGAAGPGKRRGVATPVIFAGTSGCGVGNLTEPVAVSLRHWAEGAE PVAAWWSQEGPGEAGGWTSEGCQLRSSQPNVSALHCQHLGNVAVLMELSA FPREVGGAGAGLHPVVYPCTALLLLCLFATIITYILNHSSIRVSRKGWHM LLNLCFHIAMTSAVFAGGITLTNYQMVCQAVGITLHYSSLSTLLWMGVKA RVLHKELTWRAPPPQEGDPALPTPSPMLRFYLIAGGIPLIICGITAAVNI HNYRDHSPYCWLVWRPSLGAFYIPVALILLITWIYFLCAGLRLRGPLAQN PKAGNSRASLEAGEELRGSTRLRGSGPLLSDSGSLLATGSARVGTPGPPE DGDSLYSPGVQLGALVTTHFLYLAMWACGALAVSQRWLPRVVCSCLYGVA ASALGLFVFTHHCARRRDVRASWRACCPPASPAAPHAPPRALPAAAEDGS PVFGEGPPSLKSSPSGSSGHPLALGPCKLTNLQLAQSQVCEAGAAAGGEG EPEPAGTRGNLAHRHPNNVHHGRRAHKSRAKGHRAGEACGKNRLKALRGG AAGALELLSSESGSLHNSPTDSYLGSSRNSPGAGLQLEGEPMLTPSEGSD TSAAPLSEAGRAGQRRSASRDSLKGGGALEKESHRRSYPLNAASLNGAPK GGKYDDVTLMGAEVASGGCMKTGLWKSETTV (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 8 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 4 24 1.242 Certain 2 298 318 0.713 Putative 3 370 390 0.833 Putative 4 419 439 0.996 Putative 5 612 632 1.228 Certain 6 666 686 1.308 Certain 7 766 786 2.172 Certain 8 803 823 1.680 Certain 9 828 848 1.319 Certain 10 880 900 1.827 Certain 11 921 941 2.223 Certain 12 1012 1032 1.698 Certain 13 1041 1061 1.833 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 3 4 5 6 7 8 9 10 11 12 13 Loop length 3 345 28 172 33 79 16 4 31 20 70 8 270 K+R profile 3.00 6.00 6.00 3.00 5.00 + + + + + 0.00 2.00 2.00 CYT-EXT prof - - - - - 0.31 0.24 -0.18 1.19 0.92 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 19.00 Tm probability: 0.58 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: -1.38 -> Orientation: N-in ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 1 2 3 5 6 7 8 9 10 11 12 13 Loop length 3 273 51 221 33 79 16 4 31 20 70 8 270 K+R profile 3.00 6.00 6.00 3.00 5.00 + + + + + 0.00 2.00 2.00 CYT-EXT prof - - - - - 0.31 0.24 -0.26 0.86 0.92 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 19.00 Tm probability: 0.16 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: -0.97 -> Orientation: N-in ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 2 4 5 6 7 8 9 10 11 12 13 Loop length 3 273 100 172 33 79 16 4 31 20 70 8 270 K+R profile 3.00 + 6.00 3.00 5.00 + + + + + 0.00 2.00 2.00 CYT-EXT prof - 0.13 - - - 0.31 0.24 -0.26 1.19 0.92 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 13.00 Tm probability: 0.28 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: -1.16 -> Orientation: N-in ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 1 5 6 7 8 9 10 11 12 13 Loop length 3 587 33 79 16 4 31 20 70 8 270 K+R profile 3.00 6.00 3.00 5.00 + + + + 0.00 2.00 2.00 CYT-EXT prof - - - - 0.31 0.24 0.23 0.92 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 13.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: -0.60 -> Orientation: N-in ---------------------------------------------------------------------- Structure 5 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 7 8 9 10 11 12 13 Loop length 3 273 51 28 172 33 79 16 4 31 20 70 8 270 K+R profile 3.00 6.00 + + 0.00 2.00 2.00 + 6.00 6.00 3.00 5.00 + + CYT-EXT prof - - 1.19 0.92 - - - -0.26 - - - - 0.31 0.24 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.16 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: 1.82 -> Orientation: N-out ---------------------------------------------------------------------- Structure 6 Transmembrane segments included in this structure: Segment 1 4 5 6 7 8 9 10 11 12 13 Loop length 3 394 172 33 79 16 4 31 20 70 8 270 K+R profile 3.00 + + 0.00 2.00 2.00 + 6.00 3.00 5.00 + + CYT-EXT prof - 1.19 0.92 - - - -0.20 - - - 0.31 0.24 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.99 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: 1.76 -> Orientation: N-out ---------------------------------------------------------------------- Structure 7 Transmembrane segments included in this structure: Segment 1 3 5 6 7 8 9 10 11 12 13 Loop length 3 345 221 33 79 16 4 31 20 70 8 270 K+R profile 3.00 + + 0.00 2.00 2.00 + 6.00 3.00 5.00 + + CYT-EXT prof - 0.86 0.92 - - - -0.18 - - - 0.31 0.24 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.58 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: 1.41 -> Orientation: N-out ---------------------------------------------------------------------- Structure 8 Transmembrane segments included in this structure: Segment 1 2 5 6 7 8 9 10 11 12 13 Loop length 3 273 293 33 79 16 4 31 20 70 8 270 K+R profile 3.00 + + 0.00 2.00 2.00 + 6.00 3.00 5.00 + + CYT-EXT prof - 1.02 0.92 - - - -0.26 - - - 0.31 0.24 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.28 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: 1.65 -> Orientation: N-out ---------------------------------------------------------------------- "tem9" 1331 4 24 #t 1.24167 298 318 #f 0.7125 370 390 #f 0.833333 419 439 #f 0.995833 612 632 #t 1.22812 666 686 #t 1.30833 766 786 #t 2.17188 803 823 #t 1.68021 828 848 #t 1.31875 880 900 #t 1.82708 921 941 #t 2.22292 1012 1032 #t 1.69792 1041 1061 #t 1.83333 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem9.fa.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem9.fa.___inter___ (1 sequences) MRGAPARLLLPLLPWLLLLLAPEARGAPGCPLSIRSCKCSGERPKGLSGG VPGPARRRVVCSGGDLPEPPEPGLLPNGTVTLLLSNNKITGLRNGSFLGL SLLEKLDLRNNIISTVQPGAFLGLGELKRLDLSNNRIGCLTSETFQGLPR LLRLNISGNIFSSLQPGVFDELPALKVVDLGTEFLTCDCHLRWLLPWAQN RSLQLSEHTLCAYPSALHAQALGSLQEAQLCCEGALELHTHHLIPSLRQV VFQGDRLPFQCSASYLGNDTRIRWYHNRAPVEGDEQAGILLAESLIHDCT FITSELTLSHIGVWASGEWECTVSMAQGNASKKVEIVVLETSASYCPAER VANNRGDFRWPRTLAGITAYQSCLQYPFTSVPLGGGAPGTRASRRCDRAG RWEPGDYSHCLYTNDITRVLYTFVLMPINASNALTLAHQLRVYTAEAASF SDMMDVVYVAQMIQKFLGYVDQIKELVEVMVDMASNLMLVDEHLLWLAQR EDKACSRIVGALERIGGAALSPHAQHISVNARNVALEAYLIKPHSYVGLT CTAFQRREGGVPGTRPGSPGQNPPPEPEPPADQQLRFRCTTGRPNVSLSS FHIKNSVALASIQLPPSLFSSLPAALAPPVPPDCTLQLLVFRNGRLFHSH SNTSRPGAAGPGKRRGVATPVIFAGTSGCGVGNLTEPVAVSLRHWAEGAE PVAAWWSQEGPGEAGGWTSEGCQLRSSQPNVSALHCQHLGNVAVLMELSA FPREVGGAGAGLHPVVYPCTALLLLCLFATIITYILNHSSIRVSRKGWHM LLNLCFHIAMTSAVFAGGITLTNYQMVCQAVGITLHYSSLSTLLWMGVKA RVLHKELTWRAPPPQEGDPALPTPSPMLRFYLIAGGIPLIICGITAAVNI HNYRDHSPYCWLVWRPSLGAFYIPVALILLITWIYFLCAGLRLRGPLAQN PKAGNSRASLEAGEELRGSTRLRGSGPLLSDSGSLLATGSARVGTPGPPE DGDSLYSPGVQLGALVTTHFLYLAMWACGALAVSQRWLPRVVCSCLYGVA ASALGLFVFTHHCARRRDVRASWRACCPPASPAAPHAPPRALPAAAEDGS PVFGEGPPSLKSSPSGSSGHPLALGPCKLTNLQLAQSQVCEAGAAAGGEG EPEPAGTRGNLAHRHPNNVHHGRRAHKSRAKGHRAGEACGKNRLKALRGG AAGALELLSSESGSLHNSPTDSYLGSSRNSPGAGLQLEGEPMLTPSEGSD TSAAPLSEAGRAGQRRSASRDSLKGGGALEKESHRRSYPLNAASLNGAPK GGKYDDVTLMGAEVASGGCMKTGLWKSETTV (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 8 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 4 24 1.242 Certain 2 298 318 0.713 Putative 3 370 390 0.833 Putative 4 419 439 0.996 Putative 5 612 632 1.228 Certain 6 666 686 1.308 Certain 7 766 786 2.172 Certain 8 803 823 1.680 Certain 9 828 848 1.319 Certain 10 880 900 1.827 Certain 11 921 941 2.223 Certain 12 1012 1032 1.698 Certain 13 1041 1061 1.833 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 3 4 5 6 7 8 9 10 11 12 13 Loop length 3 345 28 172 33 79 16 4 31 20 70 8 270 K+R profile 2.00 6.00 6.00 3.00 5.00 + + + + + 0.00 2.00 2.00 CYT-EXT prof - - - - - 0.31 0.24 -0.18 1.19 0.92 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 18.00 Tm probability: 0.58 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: -1.38 -> Orientation: N-in ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 1 2 3 5 6 7 8 9 10 11 12 13 Loop length 3 273 51 221 33 79 16 4 31 20 70 8 270 K+R profile 2.00 6.00 6.00 3.00 5.00 + + + + + 0.00 2.00 2.00 CYT-EXT prof - - - - - 0.31 0.24 -0.26 0.86 0.92 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 18.00 Tm probability: 0.16 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: -0.97 -> Orientation: N-in ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 2 4 5 6 7 8 9 10 11 12 13 Loop length 3 273 100 172 33 79 16 4 31 20 70 8 270 K+R profile 2.00 + 6.00 3.00 5.00 + + + + + 0.00 2.00 2.00 CYT-EXT prof - 0.13 - - - 0.31 0.24 -0.26 1.19 0.92 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 12.00 Tm probability: 0.28 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: -1.16 -> Orientation: N-in ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 1 5 6 7 8 9 10 11 12 13 Loop length 3 587 33 79 16 4 31 20 70 8 270 K+R profile 2.00 6.00 3.00 5.00 + + + + 0.00 2.00 2.00 CYT-EXT prof - - - - 0.31 0.24 0.23 0.92 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 12.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: -0.60 -> Orientation: N-in ---------------------------------------------------------------------- Structure 5 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 7 8 9 10 11 12 13 Loop length 3 273 51 28 172 33 79 16 4 31 20 70 8 270 K+R profile 2.00 6.00 + + 0.00 2.00 2.00 + 6.00 6.00 3.00 5.00 + + CYT-EXT prof - - 1.19 0.92 - - - -0.26 - - - - 0.31 0.24 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -8.00 Tm probability: 0.16 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: 1.82 -> Orientation: N-out ---------------------------------------------------------------------- Structure 6 Transmembrane segments included in this structure: Segment 1 4 5 6 7 8 9 10 11 12 13 Loop length 3 394 172 33 79 16 4 31 20 70 8 270 K+R profile 2.00 + + 0.00 2.00 2.00 + 6.00 3.00 5.00 + + CYT-EXT prof - 1.19 0.92 - - - -0.20 - - - 0.31 0.24 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -8.00 Tm probability: 0.99 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: 1.76 -> Orientation: N-out ---------------------------------------------------------------------- Structure 7 Transmembrane segments included in this structure: Segment 1 3 5 6 7 8 9 10 11 12 13 Loop length 3 345 221 33 79 16 4 31 20 70 8 270 K+R profile 2.00 + + 0.00 2.00 2.00 + 6.00 3.00 5.00 + + CYT-EXT prof - 0.86 0.92 - - - -0.18 - - - 0.31 0.24 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -8.00 Tm probability: 0.58 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: 1.41 -> Orientation: N-out ---------------------------------------------------------------------- Structure 8 Transmembrane segments included in this structure: Segment 1 2 5 6 7 8 9 10 11 12 13 Loop length 3 273 293 33 79 16 4 31 20 70 8 270 K+R profile 2.00 + + 0.00 2.00 2.00 + 6.00 3.00 5.00 + + CYT-EXT prof - 1.02 0.92 - - - -0.26 - - - 0.31 0.24 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -8.00 Tm probability: 0.28 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-out CYT-EXT difference: 1.65 -> Orientation: N-out ---------------------------------------------------------------------- "tem9" 1331 4 24 #t 1.24167 298 318 #f 0.7125 370 390 #f 0.833333 419 439 #f 0.995833 612 632 #t 1.22812 666 686 #t 1.30833 766 786 #t 2.17188 803 823 #t 1.68021 828 848 #t 1.31875 880 900 #t 1.82708 921 941 #t 2.22292 1012 1032 #t 1.69792 1041 1061 #t 1.83333 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ SAPS. Version of April 11, 1996. Date run: Tue Jan 22 00:16:02 2002 File: /people/maria/tem9.fa.___saps___ ID gi|17511205|ref|NP_116166.6| DE tumor endothelial marker 5 precursor [Homo sapiens] number of residues: 1331; molecular weight: 142.0 kdal 1 MRGAPARLLL PLLPWLLLLL APEARGAPGC PLSIRSCKCS GERPKGLSGG VPGPARRRVV 61 CSGGDLPEPP EPGLLPNGTV TLLLSNNKIT GLRNGSFLGL SLLEKLDLRN NIISTVQPGA 121 FLGLGELKRL DLSNNRIGCL TSETFQGLPR LLRLNISGNI FSSLQPGVFD ELPALKVVDL 181 GTEFLTCDCH LRWLLPWAQN RSLQLSEHTL CAYPSALHAQ ALGSLQEAQL CCEGALELHT 241 HHLIPSLRQV VFQGDRLPFQ CSASYLGNDT RIRWYHNRAP VEGDEQAGIL LAESLIHDCT 301 FITSELTLSH IGVWASGEWE CTVSMAQGNA SKKVEIVVLE TSASYCPAER VANNRGDFRW 361 PRTLAGITAY QSCLQYPFTS VPLGGGAPGT RASRRCDRAG RWEPGDYSHC LYTNDITRVL 421 YTFVLMPINA SNALTLAHQL RVYTAEAASF SDMMDVVYVA QMIQKFLGYV DQIKELVEVM 481 VDMASNLMLV DEHLLWLAQR EDKACSRIVG ALERIGGAAL SPHAQHISVN ARNVALEAYL 541 IKPHSYVGLT CTAFQRREGG VPGTRPGSPG QNPPPEPEPP ADQQLRFRCT TGRPNVSLSS 601 FHIKNSVALA SIQLPPSLFS SLPAALAPPV PPDCTLQLLV FRNGRLFHSH SNTSRPGAAG 661 PGKRRGVATP VIFAGTSGCG VGNLTEPVAV SLRHWAEGAE PVAAWWSQEG PGEAGGWTSE 721 GCQLRSSQPN VSALHCQHLG NVAVLMELSA FPREVGGAGA GLHPVVYPCT ALLLLCLFAT 781 IITYILNHSS IRVSRKGWHM LLNLCFHIAM TSAVFAGGIT LTNYQMVCQA VGITLHYSSL 841 STLLWMGVKA RVLHKELTWR APPPQEGDPA LPTPSPMLRF YLIAGGIPLI ICGITAAVNI 901 HNYRDHSPYC WLVWRPSLGA FYIPVALILL ITWIYFLCAG LRLRGPLAQN PKAGNSRASL 961 EAGEELRGST RLRGSGPLLS DSGSLLATGS ARVGTPGPPE DGDSLYSPGV QLGALVTTHF 1021 LYLAMWACGA LAVSQRWLPR VVCSCLYGVA ASALGLFVFT HHCARRRDVR ASWRACCPPA 1081 SPAAPHAPPR ALPAAAEDGS PVFGEGPPSL KSSPSGSSGH PLALGPCKLT NLQLAQSQVC 1141 EAGAAAGGEG EPEPAGTRGN LAHRHPNNVH HGRRAHKSRA KGHRAGEACG KNRLKALRGG 1201 AAGALELLSS ESGSLHNSPT DSYLGSSRNS PGAGLQLEGE PMLTPSEGSD TSAAPLSEAG 1261 RAGQRRSASR DSLKGGGALE KESHRRSYPL NAASLNGAPK GGKYDDVTLM GAEVASGGCM 1321 KTGLWKSETT V -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A :137(10.3%); C : 41( 3.1%); D : 34( 2.6%); E : 64( 4.8%); F : 32( 2.4%) G :135(10.1%); H : 42( 3.2%); I : 43( 3.2%); K : 30( 2.3%); L :173(13.0%) M : 19( 1.4%); N : 46( 3.5%); P :101( 7.6%); Q : 42( 3.2%); R : 87( 6.5%) S :116( 8.7%); T : 63( 4.7%); V : 74( 5.6%); W : 23( 1.7%); Y : 29( 2.2%) KR : 117 ( 8.8%); ED : 98 ( 7.4%); AGP : 373 ( 28.0%); KRED : 215 ( 16.2%); KR-ED : 19 ( 1.4%); FIKMNY- : 199 ( 15.0%); LVIFM : 341 ( 25.6%); ST : 179 ( 13.4%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 0+0000+000 0000000000 00-0+00000 0000+00+00 0-+0+00000 00000+++00 61 0000-00-00 -000000000 0000000+00 00+0000000 000-+0-0+0 0000000000 121 00000-0++0 -0000+0000 00-000000+ 00+0000000 000000000- -0000+00-0 181 00-0000-00 0+00000000 +00000-000 0000000000 000000-000 00-000-000 241 0000000+00 0000-+0000 00000000-0 +0+0000+00 0-0--00000 00-0000-00 301 0000-00000 0000000-0- 0000000000 0++0-0000- 00000000-+ 0000+0-0+0 361 0+00000000 0000000000 0000000000 +00++0-+00 +0-00-0000 0000-00+00 421 0000000000 0000000000 +0000-0000 0-00-00000 0000+00000 -00+-00-00 481 0-00000000 --0000000+ --+000+000 00-+000000 0000000000 0+0000-000 541 0+00000000 00000++-00 0000+00000 00000-0-00 0-000+0+00 00+0000000 601 000+000000 0000000000 0000000000 00-0000000 0+00+00000 0000+00000 661 00+++00000 0000000000 00000-0000 00+000-00- 00000000-0 00-000000- 721 0000+00000 0000000000 000000-000 00+-000000 0000000000 0000000000 781 0000000000 0+00++0000 0000000000 0000000000 0000000000 0000000000 841 00000000+0 +000+-000+ 00000-0-00 00000000+0 0000000000 0000000000 901 000+-00000 0000+00000 0000000000 0000000000 0+0+000000 0+0000+000 961 -00--0+000 +0+0000000 -000000000 0+0000000- -0-0000000 0000000000 1021 0000000000 00000+000+ 0000000000 0000000000 0000+++-0+ 000+000000 1081 000000000+ 000000--00 0000-00000 +000000000 0000000+00 0000000000 1141 -0000000-0 -0-0000+00 000+000000 00++00+0+0 +00+00-000 +0+0+00+00 1201 00000-0000 -000000000 -000000+00 0000000-0- 000000-00- 0000000-00 1261 +000++000+ -00+00000- +-00++0000 000000000+ 00+0--0000 00-0000000 1321 +0000+0-00 0 A. CHARGE CLUSTERS. Positive charge clusters (cmin = 9/30 or 12/45 or 15/60): 1) From 1173 to 1198: RRAHKSRAKGHRAGEACGKNRLKALR ++00+0+0+00+00-000+0+0+00+ quartile: 4; size: 26, +count: 10, -count: 1, 0count: 15; t-value: 4.65 * A: 5 (19.2%); G: 3 (11.5%); K: 4 (15.4%); R: 6 (23.1%); Negative charge clusters (cmin = 8/30 or 11/45 or 13/60): none Mixed charge clusters (cmin = 13/30 or 18/45 or 22/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 4 | 6 | 57 | 10 | 9 | 12 | 12 | 12 | 15 | 8 | 10 | lmin1 6 | 6 | 8 | 68 | 12 | 11 | 15 | 15 | 14 | 19 | 9 | 12 | lmin2 7 | 7 | 9 | 76 | 13 | 13 | 17 | 17 | 16 | 21 | 11 | 14 | (Significance level: 0.010000; Minimal displayed length: 6) There are no charge runs or patterns exceeding the given minimal lengths. Run count statistics: + runs >= 3: 3, at 56; 663; 1065; - runs >= 3: 0 * runs >= 4: 2, at 500; 1065; 0 runs >= 38: 1, at 797; -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. __________________________________ High scoring hydrophobic segments: 2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ) -8.00 (KEDR) Expected score/letter: -1.523 M_0.01= 30.95; M_0.05= 26.05 1) From 918 to 941: length= 24, score=29.00 * 918 LGAFYIPVAL ILLITWIYFL CAGL L: 6(25.0%); A: 3(12.5%); I: 4(16.7%); ____________________________________ High scoring transmembrane segments: 5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST) -6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED) Expected score/letter: -2.719 M_0.01= 74.97; M_0.05= 62.68; M_0.30= 48.07 1) From 755 to 786: length= 32, score=56.00 (pocket at 763 to 764: length= 2, score=-14.00) 755 VGGAGAGL |HP| VVYPCTALLL LCLFATIITY IL L: 7(21.9%); A: 4(12.5%); G: 4(12.5%); 2) From 918 to 941: length= 24, score=63.00 * 918 LGAFYIPVAL ILLITWIYFL CAGL L: 6(25.0%); A: 3(12.5%); I: 4(16.7%); 2. SPACINGS OF C. H2N-29-C-6-C-1-C-21-C-77-C-47-C-1-C-21-C-19-C-C-28-C-37-C-21-C-24-C-26-C-22-C-13-C-94-C-45-C-37-C-44-C-44-C-42-C-13-C-32-C-6-C-28-C-22-C-63-C-17-C-27-C-89-C-14-C-1-C-17-C-12-C-C-49-C-12-C-48-C-129-C-12-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-29-C-6-C-1-C-21-C-77-C-47-C-1-C-H-17-H-2-C-6-H-12-C-C-6-H-1-H-H-18-C-14-H-20-H-1-C-10-H-10-C-24-C-26-C-22-C-12-H-C-27-H-54-H-11-C-17-H-2-H-17-H-6-C-37-C-12-H-31-C-13-H-1-H-28-C-14-H-27-C-12-H-C-1-H-24-H-5-C-6-C-11-H-10-H-5-C-1-H-20-C-7-H-17-H-37-C-8-H-4-H-3-C-27-C-80-H-8-C-14-C-1-C-15-H-H-C-12-C-C-8-H-33-H-6-C-12-C-22-H-1-H-4-H-H-4-H-6-H-5-C-26-H-67-H-34-C-12-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 5 Aligned matching blocks: [ 49- 53] GGVPG [ 559- 563] GGVPG ______________________________ [ 58- 62] RVVCS [1040-1044] RVVCS ______________________________ [ 234- 238] GALEL [1203-1207] GALEL B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 9 Aligned matching blocks: [ 96- 106]-( 4)-[ 111- 122] [ 144- 154]-( 4)-[ 159- 169] [ 96- 106] oiisioii-+i [ 144- 154] oinsip+ii+i [ 111- 122] niiooinpssii [ 159- 169] niiooinp_sii -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 103 (Expected range: 59--113) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 39 (6-10) 25 (11-20) 21 (>=21) 19 3. Clusters of amino acid multiplets (cmin = 13/30 or 16/45 or 20/60): none 4. Long amino acid multiplets (>= 5; Letter/Length/Position): L/5/16 B. CHARGE ALPHABET. 1. Total number of charge multiplets: 19 (Expected range: 4-- 29) 11 +plets (f+: 8.8%), 8 -plets (f-: 7.4%) Total number of charge altplets: 16 (Critical number: 32) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 0 (6-10) 1 (11-20) 2 (>=21) 17 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 16- 20 1 L 5 5 ! 0 84- 187 8 L....... 11 4 2 175- 214 10 L......... 4 4 0 573- 580 2 P. 4 4 0 614- 629 4 L... 4 4 0 772- 775 1 L 4 4 0 960- 983 6 L..... 4 4 0 1071-1098 4 A... 6 4 1 1079-1090 3 P.. 4 4 0 1109-1120 3 S.. 4 4 0 1180-1211 8 A....... 4 4 0 1210-1233 4 S... 6 6 ! 0 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9) Location Period Element Copies Core Errors 82- 141 10 i..0.....0 6 6 /0/././1/./././././2/ 84- 187 8 i00....0 13 13 ! /0/2/3/././././4/ 148- 187 4 i... 9 7 1 459- 542 7 i.0.... 10 6 /2/./4/././././ 800- 811 2 i0 6 6 /0/1/ 921- 932 2 i0 6 6 /0/2/ -------------------------------------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 934-1332 (4.) I( 398)I 1 of 44 0.0000 large maximal spacing ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- LRR Leucine Rich Repeat 66.4 5.9e-16 4 LRRCT Leucine rich repeat C-terminal domain 43.0 6.8e-09 1 GPS Latrophilin/CL-1-like GPS domain 20.8 0.0041 1 HRM Hormone receptor domain 16.3 0.0076 1 ig Immunoglobulin domain 7.8 0.77 1 chitinase_2 Chitinase, family 2 1.2 16 1 LRRNT Leucine rich repeat N-terminal domain 0.3 44 1 late_protein_L1 L1 (late) protein -0.4 32 1 CBD_7 Putative carbohydrate binding domain -10.8 30 1 ZU5 ZU5 domain -45.1 42 1 Ski_Sno SKI/SNO/DAC family -46.1 89 1 CytC_asm Cytochrome C assembly protein -52.9 22 1 7tm_2 7 transmembrane receptor (Secretin fa -56.1 0.00011 1 MgtE Divalent cation transporter -65.2 63 1 Monooxygenase Monooxygenase -68.2 28 1 CPSase_L_D3 Carbamoyl-phosphate synthetase large -93.7 53 1 DsbD Cytochrome C biogenesis protein trans -114.8 73 1 7tm_3 7 transmembrane receptor (metabotropi -172.9 42 1 xan_ur_permease Permease family -214.0 60 1 Presenilin Presenilin -280.9 62 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- LRRNT 1/1 29 72 .. 1 31 [] 0.3 44 LRR 1/4 78 101 .. 1 25 [] 15.2 1.5 LRR 2/4 102 125 .. 1 25 [] 18.3 0.18 LRR 3/4 126 149 .. 1 25 [] 26.6 0.0006 LRR 4/4 150 173 .. 1 25 [] 18.2 0.2 LRRCT 1/1 183 233 .. 1 54 [] 43.0 6.8e-09 Ski_Sno 1/1 195 302 .. 1 116 [] -46.1 89 ig 1/1 254 323 .. 1 45 [] 7.8 0.77 HRM 1/1 343 412 .. 1 79 [] 16.3 0.0076 chitinase_2 1/1 468 477 .. 310 319 .] 1.2 16 Monooxygenase 1/1 326 485 .. 1 216 [] -68.2 28 CPSase_L_D3 1/1 436 546 .. 1 128 [] -93.7 53 ZU5 1/1 606 710 .. 1 115 [] -45.1 42 CBD_7 1/1 692 717 .. 1 45 [] -10.8 30 GPS 1/1 699 751 .. 1 54 [] 20.8 0.0041 MgtE 1/1 777 901 .. 1 137 [] -65.2 63 late_protein_L1 1/1 913 924 .. 1 14 [. -0.4 32 DsbD 1/1 759 932 .. 1 226 [] -114.8 73 CytC_asm 1/1 801 1035 .. 1 242 [] -52.9 22 7tm_3 1/1 763 1051 .. 1 283 [] -172.9 42 Presenilin 1/1 748 1061 .. 1 475 [] -280.9 62 xan_ur_permease 1/1 729 1064 .. 1 443 [] -214.0 60 7tm_2 1/1 762 1069 .. 1 273 [] -56.1 0.00011 Alignments of top-scoring domains: LRRNT: domain 1 of 1, from 29 to 72: score 0.3, E = 44 *->aCp...reCtCsp.............fglvVdCsgrgLtlevPrdlP Cp + r+C+Cs +++++ +++ ++++ ++V Csg L e P P gi|1751120 29 GCPlsiRSCKCSGerpkglsggvpgpARRRVVCSGGDLP-EPPE--P 72 <-* gi|1751120 - - LRR: domain 1 of 4, from 78 to 101: score 15.2, E = 1.5 *->nLeeLdLsnNnLsGslPpesfgnLp<-* +L LsnN+++ +l ++sf +L+ gi|1751120 78 GTVTLLLSNNKIT-GLRNGSFLGLS 101 LRR: domain 2 of 4, from 102 to 125: score 18.3, E = 0.18 *->nLeeLdLsnNnLsGslPpesfgnLp<-* Le LdL+nN +s ++ p++f +L gi|1751120 102 LLEKLDLRNNIIS-TVQPGAFLGLG 125 LRR: domain 3 of 4, from 126 to 149: score 26.6, E = 0.0006 *->nLeeLdLsnNnLsGslPpesfgnLp<-* L++LdLsnN++ l +e+f++Lp gi|1751120 126 ELKRLDLSNNRIG-CLTSETFQGLP 149 LRR: domain 4 of 4, from 150 to 173: score 18.2, E = 0.2 *->nLeeLdLsnNnLsGslPpesfgnLp<-* +L +L++s+N +s sl p++f Lp gi|1751120 150 RLLRLNISGNIFS-SLQPGVFDELP 173 LRRCT: domain 1 of 1, from 183 to 233: score 43.0, E = 6.8e-09 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell + CDC+LrwLl W+++ r+l+ +e+++Ca+P++l+ q l +l+ gi|1751120 183 EFLTCDCHLRWLLPWAQN---RSLQLSEHTLCAYPSALHAQALGSLQ 226 psdfsCp<-* + + C+ gi|1751120 227 EAQLCCE 233 Ski_Sno: domain 1 of 1, from 195 to 302: score -46.1, E = 89 *->lPprpvlsgPilspsDpstnElketeLeGekiacFiVgGEkrLCLPQ lP + s ls+ + + + L + ++ + E LC gi|1751120 195 LPWAQNRSL-QLSEHTLCA---YPSALHAQALGSLQ---EAQLCCEG 234 alnlvLkdlVGgLhqintvldeLeIycsvC...tvdQlriLkglGilppg al+l +l+ +L q+ d+L cs ++ +r + + gi|1751120 235 ALELHTHHLIPSLRQVVFQGDRLPFQCSASylgNDTRIRWYHNRAPVEGD 284 vnrCgLItrtDaerLcnallta<-* + L+ ae L +++ gi|1751120 285 EQAGILL----AESLIHDCTFI 302 ig: domain 1 of 1, from 254 to 323: score 7.8, E = 0.77 *->GesvtLtCsvsgfgpp.p.vtWlrngk.................... G+++ ++Cs+s +++++W++n + +++++ + ++ ++ gi|1751120 254 GDRLPFQCSAS-YLGNdTrIRWYHNRApvegdeqagillaeslihdc 299 ....lslti.svtpeDsgGtYtCvv<-* + ++ lt++ s G+ C+v gi|1751120 300 tfitSELTLsHIGVWAS-GEWECTV 323 HRM: domain 1 of 1, from 343 to 412: score 16.3, E = 0.0076 *->glyCpatwDgi....lCWPrTpaGtlvvvpCPdyfsGfnydttgedf +yCpa+ ++++ WPrT aG +++++C y f gi|1751120 343 ASYCPAERVANnrgdFRWPRTLAGITAYQSCLQYP-----------F 378 s.........ngnasRnCtenGwwerhpnsnwpwpdytnCtspey<-* ++ + +++ ++ asR+C+ G+we + dy+ C +y gi|1751120 379 TsvplgggapGTRASRRCDRAGRWE--------PGDYSHC---LY 412 chitinase_2: domain 1 of 1, from 468 to 477: score 1.2, E = 16 *->GYssaikgsV<-* GY+++ik +V gi|1751120 468 GYVDQIKELV 477 Monooxygenase: domain 1 of 1, from 326 to 485: score -68.2, E = 28 *->radllVGADGarStVRkllgi..fggsaarfevatlpl.vppdkeed a + k + i ++sa + ++ +++++++ gi|1751120 326 ----------AQGNASKKVEIvvLETSASYCPAERVANnRG------ 356 ehvlpgreylgy.gpglflpvsdgrliavvnrvrvltvtlfqwptdpekw ++++ +++ +++ + + + ++++ +v+l+ +++ + gi|1751120 357 -----DFRW--PrTLAGITAYQS------CLQYPFTSVPLGG-GAPGT-- 390 vglatvdelleefagwlpeaiglitadaeaitwlrvfpvarrladrwrkG + +++++ ++ g++ + ++++ ++ l++ + gi|1751120 391 -------RASRRCDRAGRWEPGDYSHCLY-TNDITRVLYTFVLMPINASN 432 RVfLaGDAAHampPlgGQGlNlg.leDAvnLawkLaavlrgggdkaedea ++L AH +++ + ++++D+v a+ +++ l+ gi|1751120 433 ALTL----AHQLRVYTAEAASFSdMMDVVYVAQMIQKFLG---------- 468 ieelLdtYeaeRrpraervqdlar<-* Y ++ + +e ++d+a gi|1751120 469 -------YVDQIKELVEVMVDMAS 485 CPSase_L_D3: domain 1 of 1, from 436 to 546: score -93.7, E = 53 *->lreeLrtPtddRlfaiaeAlrrGgysVdeiheLTkIDkWFL...hKl l +Lr+ t++ a+ + + + V + ++ I k FL+ +++ gi|1751120 436 LAHQLRVYTAE-----AASFSDM-MDVVYVAQM--IQK-FLgyvDQI 473 knIveleEkelksygldlselpaelLrkAKqlGFSDrqIAkllg...... k++ve+ + ++s+ + +++ lL A + D+ ++++g ++ + gi|1751120 474 KELVEV-MVDMASNL--M-LVDEHLLWLAQRE---DKACSRIVGalerig 516 ............vtEleVRklRkelgItPvvKqvDTcAAEFpAqTnYlYs + +++ ++ +v+ +V I+P+ gi|1751120 517 gaalsphaqhisVNARNVALEAYL--IKPH-------------------- 544 TY<-* +Y gi|1751120 545 SY 546 ZU5: domain 1 of 1, from 606 to 710: score -45.1, E = 42 *->ssflVs.....gtvdarGGrLrgsrHsGvrlIiPpr.Aipqgtrytc s++l+s + ++ ++ ++ ++L + Pp+ + ++ gi|1751120 606 SVALASiqlppSLFSSLPAALAPPV--------PPDcTLQLLVFRNG 644 YlvvhkklstpPPlDkee...getLlSPvvecGPCDVSmSAhGalllrPV +l ++ ++ +P +++++++++++Pv+ G S +G+ l+ PV gi|1751120 645 RLFHSHSNTSRPGA--AGpgkRRGVATPVIFAGT---SGCGVGN-LTEPV 688 iLevpHcAslrprdkwelvllrsengg<-* ++++H A+ + + +v + +g gi|1751120 689 AVSLRHWAE--GAE---PVAAWWSQEG 710 CBD_7: domain 1 of 1, from 692 to 717: score -10.8, E = 30 *->apaWqagtvYtgGdvVsYnGrnYkAkWWTQGeePgsssAGtwgpW<- W++g A WW Q e Pg ++g+W gi|1751120 692 LRHWAEG-------------AEPVAAWWSQ-EGPG-----EAGGW 717 * gi|1751120 - - GPS: domain 1 of 1, from 699 to 751: score 20.8, E = 0.0041 *->snpiCvfWdesel.slgvWstdrGCelvetskpshttCsCnHLTsFA +p ++W+ + ++ g+W++ +GC+l + + C+HL++ A gi|1751120 699 AEPVAAWWSQEGPgEAGGWTS-EGCQLRSSQP-NVSALHCQHLGNVA 743 vLmdvspn<-* vLm +s+ gi|1751120 744 VLMELSAF 751 MgtE: domain 1 of 1, from 777 to 901: score -65.2, E = 63 *->fiPlLiGlgGNvGtqlssrlsrgLalGevsskkrvlrvlakelltsi l +++++ + l ++s +rv+r +ll + gi|1751120 777 ---------------LFATIITYI-LNHSS--IRVSRKGWHMLLNLC 805 llgvvlslallfvagilgggsgvideegeallflivaislliisl.vall + +++ s++ + +++++ +++++v+i+l ++sl+++l+ gi|1751120 806 FHIAMTSAVFAGGITLTNYQ----------MVCQAVGITLHYSSLsTLLW 845 lgvlipilfkklgl........DPdn.............vsgPlITTlaD +gv +l k+l ++ ++++++DP+++++++ + ++PlI gi|1751120 846 MGVKARVLHKELTWrapppqegDPALptpspmlrfyliaGGIPLI----- 890 ittlllIyllia<-* i++++ + ++i+ gi|1751120 891 ICGIT-AAVNIH 901 late_protein_L1: domain 1 of 1, from 913 to 924: score -0.4, E = 32 *->MAlWlpsqnkvYLP<-* +W+ps + +Y P gi|1751120 913 --VWRPSLGAFYIP 924 DsbD: domain 1 of 1, from 759 to 932: score -114.8, E = 73 *->aFLAGLLSflSPCv....LPliPaylSyisgvslgDvkqgKGvvKPa +AGL + PC+ L l ++ yi + ++++ + gi|1751120 759 --GAGLHPVVYPCTalllLCLFATIITYIL--NHSSIRVS------- 794 kkvqwrvllsallFvlGftivFlllGvgaafltallrdyllylrviagvl +++ +l + + +++++a+ ++l + + +++g+ gi|1751120 795 --------RKGWHMLLNLCFHIAMTSAVFAGGITLT--NYQMVCQAVGIT 834 vIlmGLhflgvfrgliafLrklrlvWelktkkkveisvlgafllGllFAl + +L++l + + + ++v +k + ++++ G gi|1751120 835 LHYSSLSTL---LW----MGVKARV-----LHKELTWRAPPPQEG----- 867 gWtPCigPiLasvlalasstgslvllgalllvvYvLGLgiPfvlaalfag a+ t+ +++l +Y + giP+++ ++ a+ gi|1751120 868 --------------DPALPTP------SPMLRFYLIAGGIPLIICGITAA 897 sllsflrwLrkhsraikia.....sGvLLiliGallll<-* r hs + + +++ + + ++++ll++ gi|1751120 898 V---NIHNYRDHSPYCWLVwrpslGAFYIPVALILLIT 932 CytC_asm: domain 1 of 1, from 801 to 1035: score -52.9, E = 22 *->Wgllwfwdlvengasllpwlalgallhslslriiffavpaapillai ++ l f++ ++ a++ ++l+ ++ + i +++ ++++ll + gi|1751120 801 LLNLCFHIAMTS-AVFAGGITLTNYQMVCQAVGITLHYSSLSTLLWM 846 sa.......................................plvpalgsf ++ + +++ + + ++++++++ +++++ + ++ pl+++ + gi|1751120 847 GVkarvlhkeltwrapppqegdpalptpspmlrfyliaggiPLIICGITA 896 lvrmhvlvsvhsyatldivl.glpigalftllfLvlgsgwakplwgtrwv v h + + +y+ l v + +ga+ ++ +L+l++ w l+++ gi|1751120 897 AVNIHNYRDHSPYCWL--VWrP-SLGAFYIPVALILLITWIYFLCAGLRL 943 wdarsvkivkvsfl.N.aslftsvlrevasflylnnlllvsalivvllgt ++ +N++++ + ++ e a+ + +++ gi|1751120 944 RGPL---------AqNpKAGNSRASLE-AGEELRGST------------- 970 llpllekldnlsyrligigFpnltigiisgavWanvlawGsywsn....I + g+++ + + la Gs ++++ gi|1751120 971 ---R-----------------LRGSGPLLSDSGSL-LATGSARVGtpgpP 999 Wdpketwalitwlvfalfiaylhlrivkgwlgrkralvasa<-* d +++++ + l + +h++ ++ w ++ al++s+ gi|1751120 1000 EDGDSLYSPGVQLGAL---VTTHFLYLAMWACG--ALAVSQ 1035 7tm_3: domain 1 of 1, from 763 to 1051: score -172.9, E = 42 *->lgivlvalAvlGLivltlfvlvvFvkhrdTPiVKasnreLSYlLLiG +v+ + A+l +++ +++++++++++ ++ s + lL gi|1751120 763 HPVVYPCTALL--LLCLFATIITYILNHSS--IRVSRKGWHMLL--- 802 lilcylcsfl.FIgkG.tv.ekPstwsCilRrilfGlgFtlcySalL... lc+ + ++ ++ G+ + ++ + +C++ +++ l ++ ++S+lL + gi|1751120 803 -NLCFHIAMTsAVFAGgITlTN-YQMVCQA--VGITLHYS-SLSTLLwmg 847 aKTirVl..riFrgakkpgstqkpkfispwaqvlivliltliQviicviW +K+ ++++++ r a p ++ p + p + + li +i +iic+i gi|1751120 848 VKARVLHkeLTWR-APPPQEG-DPALPTPSPMLRFYLIAGGIPLIICGI- 894 LvvePPrpdidiysekekeedeDvsiapiileCnkgsvvaFlvvLgYdgl ++i+ ++++ + + s aF+++ + + l gi|1751120 895 ------TAAVNIHNYRDHSP--------YCWLVWRPSLGAFYIPVALILL 930 LavlctllAFltRnlPvdnFNEAKfItFSMYn.................. + +l A l P + n++ ++++ + + +++ ++ gi|1751120 931 ITWIYFLCAGLRLRGP-----------LAQ-Npkagnsrasleageelrg 968 ....................................vltfCiVWvAFiPa +++ +++++ +++++ +++ + +++++++++++ + gi|1751120 969 strlrgsgpllsdsgsllatgsarvgtpgppedgdsLY------------ 1006 YlstpgsaedykvqvaveifsilaSstvlLgclFaPKcYIILfkPekN<- +pg+ + v++ ++ +++ a+++ + P++ + ++ gi|1751120 1007 ---SPGVQLGALVTTHFLYLAMWACGALAVSQRWLPRVVCSCLYGVAA 1051 * gi|1751120 - - Presenilin: domain 1 of 1, from 748 to 1061: score -280.9, E = 62 *->eeelklkYg..akhvimlfvPVslCMllVvatiksvsfyeqndglkl + + +g+ + + + +P ++ ll ++ + gi|1751120 748 LSAFPREVGgaGAGLHPVVYPCTALLLLCLFATI------------I 782 lYtpFhsdtptvgqklldSllNtlilIsVlvvmTf......LlvvlYkyr +Y + hs + v+ k+++ llN +++ + mT+ +++ + Y++ gi|1751120 783 TYILNHS-SIRVSRKGWHMLLN----LCFHIAMTSavfaggITLTNYQMV 827 FYKiIhgWLilSSl..llLFlfsviyLqe..................... + ++ L SSl++ll ++ + L+ + + + ++++++++ +++++ gi|1751120 828 C-QAVGITLHYSSLstLLWMGVKARVLHKeltwrapppqegdpalptpsp 876 vfkaynDQDDNNQvpmdyptvllliwnFGvvGmmciHWkgpLrLqQfYLI +++ y ++ +p+++ +i +v + +p L+ gi|1751120 877 MLRFYL-------IAGGIPLIICGITA--AVNIHNYRDHSPY----CWLV 913 tisaLmAlvfikyLPeWTaWfvLaaISvWDLVAVLtPkGPLrmLVETAqe +L A ++i + ++L + ++ L A L +GPL Aq gi|1751120 914 WRPSLGA-FYIPVA------LILLITWIYFLCAGLRLRGPL------AQN 950 RnepiFPAliYSStvvvltvgsnqeetnegtpsterrtskstIEYyTkRe +S l++g ++ r +++ gi|1751120 951 PKAG-------NSR-ASLEAG---------EE--LRGSTR---------- 971 aQDDEFYQKIRQRrsaanPdsaptshstlVEAEPSpielkekssteeesd l ++ + + gi|1751120 972 ----------------------------L---------RGS----GPLLS 980 desdsseteGmPLvsgsSNLsssdssttvsqsdsllaphrteeiareqpe d+++ ++t s +++ t+ + + gi|1751120 981 DSGSLLATG-----S-------------------ARVG--TPGPPEDGDS 1004 vqsnslpsNekreeTAaDalhPdGeeErgvkLGLGDFiFYSvLvGkAsat + s gv LG A t gi|1751120 1005 LY-----S-------------------PGVQLG-------------ALVT 1017 asGdwnTtIACFVaILiGLclTLlllavfkkALPALPisitfGlIFYFst + + AC lav+++ LP s G ++ gi|1751120 1018 THFLYLAMWAC------------GALAVSQRWLPRVVCSCLYGVA---AS 1052 dilvtPFmd<-* + F+ gi|1751120 1053 ALGLFVFTH 1061 xan_ur_permease: domain 1 of 1, from 729 to 1064: score -214.0, E = 60 *->gqtlllGLQhl..laMfaatvlvPllvgdalclglsaeslaylistt + + l +Qhl++ a + + P vg a ++gl ++ y+ gi|1751120 729 PNVSALHCQHLgnVAVLMELSAFPREVGGA-GAGL--HPVVYP---- 768 llvsGigTllQllrYgigrifGirlpivlGsSFafvtPaiglialmialG + ll l+ + +t +++ + gi|1751120 769 ---CTALLLLCLF-------------------ATIIT-------YILNHS 789 sapad.qgplepigialgglfGallvagvlfilisftglrgrLarlfPPv s +++ g ++ l f + ++v+ +i +t + gi|1751120 790 SIRVSrKG----WHMLLNLCFHIAMTSAVFAGGITLTNYQ---------- 825 VtGpvvllIGLsLipiavkgvaggwaaildgllTllvgslellglavvvl ++ ++I L ++g++a+ l +l + + + l gi|1751120 826 -MVCQAVGITLHYSSLSTLLWMGVKARVLHK---ELTWRAPPPQEGDPAL 871 avilllsvftavlkgffkslpiLiGiivGwilalfmGpsivdlspeGseA + ++ +f ++ ++i++ i++++++ + + + d+sp+ gi|1751120 872 PTPSPMLRF------YLIAGGIPLIICGITAAVNIHN--YRDHSPY---- 909 RtDkNSLavvrdapwfqlplplpfglpldalgafnpgliltmlavaivai ++++ p+++ f +p +lil+++ ++ +++ gi|1751120 910 ---------C---WLVWRPSLGAFYIP--------VALILLITWIYFLCA 939 vEsi..........GditatakvsgrdlkpgtykprlrRg..lladGlat ++++ ++++ G +a + + l + ++rlr++++ll d+ + gi|1751120 940 GLRLrgplaqnpkaGNSRASLEAGE-ELRG---STRLRGSgpLLSDSGSL 985 llaGlfGagtPt......TtfaeNiGvvalTrvaSrrvgvtaavilillG l +G +gtP +++++++ +++ +v+l+ + vt+ +++ gi|1751120 986 LATGSARVGTPGppedgdSLYSP---GVQLG------ALVTTHFLYLAMW 1026 lfpKfaallssIPspVlGGvmlvlFgmiagsGvsilqsvdldysaRnllI ++++a +P V +s+ + gi|1751120 1027 ACGALAVSQRWLPRVV--------------------CSCLYG-------- 1048 iavSlvlGlgiptvpei<-* +a+S +lGl+++t++ gi|1751120 1049 VAAS-ALGLFVFTHHCA 1064 7tm_2: domain 1 of 1, from 762 to 1069: score -56.1, E = 0.00011 *->allLkviytVGyslSsLvcLllaiaifllfR.kLrctRnyIHm..NL ++ + + t +l L cL +i+++ l +++r R Hm NL gi|1751120 762 LHPVVYPCT---ALL-LLCLFATIITYILNHsSIRVSRKGWHMllNL 804 flsfiLralsfLigdavllnsg.CkvvavflhYfflaNFfWmLvEGlYLy + + + +++f g++ + +++ C +v + lhY l++ Wm+v + L+ gi|1751120 805 CFHIAMTSAVFAGGITLTNYQMvCQAVGITLHYSSLSTLLWMGVKARVLH 854 tLlvvtv.........evffserkrlwwYlliGWGvPavfvtiwaivrpd l + + ++++++ + + ++ ++ ++li G+P+++ +i a+v gi|1751120 855 KEL---TwrapppqegDPALPTPSPMLRFYLIAGGIPLIICGITAAVNIH 901 kygpilaegpagygnegcCWlsndtnsgfwWiikGPilliilvNfiffin + + ++ +CWl +++ g ++i P++li+l+ i f++ gi|1751120 902 N----------YRDHSPYCWLVWRPSLGAFYI---PVALILLITWIYFLC 938 ilriLvqKlridslspqtgetdqyrk........................ + L l +p +g + + +++ +++++ +++++ +++++ gi|1751120 939 AGLRLRGPLA---QNPKAGNSRASLEageelrgstrlrgsgpllsdsgsl 985 ...........................krlvkstLlLlPLLGvtwilflf +++ + +++++++++++ +++ + +lv + +l l w +++ gi|1751120 986 latgsarvgtpgppedgdslyspgvqlGALVTTHFLYL----AMWACGAL 1031 apedqsqGtlslvflylfliLnSfQGffVavlYCflNgEV<-* a++ + + ++v l + S G+fV+ +C +++V gi|1751120 1032 AVSQRW--LPRVVCSCLYGVAASALGLFVFTHHCARRRDV 1069 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- LRR Leucine Rich Repeat 54.8 6e-13 5 LRRCT Leucine rich repeat C-terminal domain 41.1 1.9e-12 1 GPS Latrophilin/CL-1-like GPS domain 18.9 0.0012 1 HRM Hormone receptor domain 17.1 0.005 1 ig Immunoglobulin domain 7.8 0.77 1 7tm_2 7 transmembrane receptor (Secretin fa 7.5 0.21 2 DUF190 Uncharacterized ACR, COG1993 5.8 3.6 1 CBD_7 Putative carbohydrate binding domain 5.5 9.6 1 HAMP HAMP domain 3.5 22 1 BAG BAG domain 2.9 25 1 chitinase_2 Chitinase, family 2 1.2 16 1 Mtap_PNP Phosphorylase family 2 0.2 41 1 Methyltransf_3 O-methyltransferase -0.2 67 1 UPF0060 Uncharacterized BCR, YnfA/UPF0060 fam -0.4 52 1 late_protein_L1 L1 (late) protein -0.4 32 1 DUF243 Domain of unknown function (DUF243) -1.1 80 1 prion Prion protein -1.1 58 1 TT_ORF1 TT viral orf 1 -1.5 68 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- LRR 1/5 78 101 .. 1 25 [] 13.3 0.12 LRR 2/5 102 125 .. 1 25 [] 16.4 0.018 LRR 3/5 126 149 .. 1 25 [] 24.6 0.0001 LRR 4/5 150 173 .. 1 25 [] 16.3 0.019 LRRCT 1/1 183 233 .. 1 54 [] 41.1 1.9e-12 ig 1/1 254 323 .. 1 45 [] 7.8 0.77 HRM 1/1 360 412 .. 14 79 .] 17.1 0.005 DUF190 1/1 435 450 .. 1 16 [. 5.8 3.6 BAG 1/1 459 474 .. 76 91 .] 2.9 25 chitinase_2 1/1 468 477 .. 310 319 .] 1.2 16 LRR 5/5 613 623 .. 15 25 .] 0.4 4.1e+02 CBD_7 1/1 703 717 .. 25 45 .] 5.5 9.6 Mtap_PNP 1/1 734 744 .. 271 281 .] 0.2 41 GPS 1/1 699 751 .. 1 54 [] 18.9 0.0012 Methyltransf_3 1/1 746 754 .. 1 9 [. -0.2 67 7tm_2 1/2 818 848 .. 58 87 .. 5.7 0.7 TT_ORF1 1/1 883 907 .. 95 117 .. -1.5 68 late_protein_L1 1/1 913 924 .. 1 14 [. -0.4 32 prion 1/1 928 940 .. 232 244 .] -1.1 58 UPF0060 1/1 929 941 .. 1 13 [. -0.4 52 HAMP 1/1 921 949 .. 1 29 [. 3.5 22 DUF243 1/1 1001 1020 .. 1 20 [. -1.1 80 7tm_2 2/2 1055 1069 .. 259 273 .] 1.9 9.8 Alignments of top-scoring domains: LRR: domain 1 of 5, from 78 to 101: score 13.3, E = 0.12 *->nLeeLdLsnNnLsGslPpesfgnLp<-* +L LsnN+++ +l ++sf +L+ gi|1751120 78 GTVTLLLSNNKIT-GLRNGSFLGLS 101 LRR: domain 2 of 5, from 102 to 125: score 16.4, E = 0.018 *->nLeeLdLsnNnLsGslPpesfgnLp<-* Le LdL+nN +s ++ p++f +L gi|1751120 102 LLEKLDLRNNIIS-TVQPGAFLGLG 125 LRR: domain 3 of 5, from 126 to 149: score 24.6, E = 0.0001 *->nLeeLdLsnNnLsGslPpesfgnLp<-* L++LdLsnN++ l +e+f++Lp gi|1751120 126 ELKRLDLSNNRIG-CLTSETFQGLP 149 LRR: domain 4 of 5, from 150 to 173: score 16.3, E = 0.019 *->nLeeLdLsnNnLsGslPpesfgnLp<-* +L +L++s+N +s sl p++f Lp gi|1751120 150 RLLRLNISGNIFS-SLQPGVFDELP 173 LRRCT: domain 1 of 1, from 183 to 233: score 41.1, E = 1.9e-12 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell + CDC+LrwLl W+++ r+l+ +e+++Ca+P++l+ q l +l+ gi|1751120 183 EFLTCDCHLRWLLPWAQN---RSLQLSEHTLCAYPSALHAQALGSLQ 226 psdfsCp<-* + + C+ gi|1751120 227 EAQLCCE 233 ig: domain 1 of 1, from 254 to 323: score 7.8, E = 0.77 *->GesvtLtCsvsgfgpp.p.vtWlrngk.................... G+++ ++Cs+s +++++W++n + +++++ + ++ ++ gi|1751120 254 GDRLPFQCSAS-YLGNdTrIRWYHNRApvegdeqagillaeslihdc 299 ....lslti.svtpeDsgGtYtCvv<-* + ++ lt++ s G+ C+v gi|1751120 300 tfitSELTLsHIGVWAS-GEWECTV 323 HRM: domain 1 of 1, from 360 to 412: score 17.1, E = 0.005 *->WPrTpaGtlvvvpCPdyfsGfnydttgedfs.........ngnasRn WPrT aG +++++C y f++ + +++ ++ asR+ gi|1751120 360 WPRTLAGITAYQSCLQYP-----------FTsvplgggapGTRASRR 395 CtenGwwerhpnsnwpwpdytnCtspey<-* C+ G+we + dy+ C +y gi|1751120 396 CDRAGRWE--------PGDYSHC---LY 412 DUF190: domain 1 of 1, from 435 to 450: score 5.8, E = 3.6 *->vkkklLrIYtsEddkf<-* +++ Lr+Yt+E+ +f gi|1751120 435 TLAHQLRVYTAEAASF 450 BAG: domain 1 of 1, from 459 to 474: score 2.9, E = 25 *->lvrkiqalLeklDskk<-* +++ iq++L ++D++k gi|1751120 459 VAQMIQKFLGYVDQIK 474 chitinase_2: domain 1 of 1, from 468 to 477: score 1.2, E = 16 *->GYssaikgsV<-* GY+++ik +V gi|1751120 468 GYVDQIKELV 477 LRR: domain 5 of 5, from 613 to 623: score 0.4, E = 4.1e+02 *->slPpesfgnLp<-* +lPp++f++Lp gi|1751120 613 QLPPSLFSSLP 623 CBD_7: domain 1 of 1, from 703 to 717: score 5.5, E = 9.6 *->AkWWTQGeePgsssAGtwgpW<-* A WW Q e Pg ++g+W gi|1751120 703 AAWWSQ-EGPG-----EAGGW 717 Mtap_PNP: domain 1 of 1, from 734 to 744: score 0.2, E = 41 *->licaillriAv<-* l+c++l ++Av gi|1751120 734 LHCQHLGNVAV 744 GPS: domain 1 of 1, from 699 to 751: score 18.9, E = 0.0012 *->snpiCvfWdesel.slgvWstdrGCelvetskpshttCsCnHLTsFA +p ++W+ + ++ g+W++ +GC+l + + C+HL++ A gi|1751120 699 AEPVAAWWSQEGPgEAGGWTS-EGCQLRSSQP-NVSALHCQHLGNVA 743 vLmdvspn<-* vLm +s+ gi|1751120 744 VLMELSAF 751 Methyltransf_3: domain 1 of 1, from 746 to 754: score -0.2, E = 67 *->retSvyPRe<-* +e S++PRe gi|1751120 746 MELSAFPRE 754 7tm_2: domain 1 of 2, from 818 to 848: score 5.7, E = 0.7 *->gdavllnsg.CkvvavflhYfflaNFfWmLv<-* g++ + +++ C +v + lhY l++ Wm+v gi|1751120 818 GITLTNYQMvCQAVGITLHYSSLSTLLWMGV 848 TT_ORF1: domain 1 of 1, from 883 to 907: score -1.5, E = 68 *->IkGyLPiliCGentv..nrNYalHs<-* I G+ P++iCG + + +++NY Hs gi|1751120 883 IAGGIPLIICGITAAvnIHNYRDHS 907 late_protein_L1: domain 1 of 1, from 913 to 924: score -0.4, E = 32 *->MAlWlpsqnkvYLP<-* +W+ps + +Y P gi|1751120 913 --VWRPSLGAFYIP 924 prion: domain 1 of 1, from 928 to 940: score -1.1, E = 58 *->iLLisFLiflivG<-* iLLi ++ fl G gi|1751120 928 ILLITWIYFLCAG 940 UPF0060: domain 1 of 1, from 929 to 941: score -0.4, E = 52 *->mlrtiLLFvaAaL<-* l+t +F+ A+L gi|1751120 929 LLITWIYFLCAGL 941 HAMP: domain 1 of 1, from 921 to 949: score 3.5, E = 22 *->lvlvlllalllllllvlllarrllrPlrk<-* +++ ++l+ll+ + +l++ rl+ Pl + gi|1751120 921 FYIPVALILLITWIYFLCAGLRLRGPLAQ 949 DUF243: domain 1 of 1, from 1001 to 1020: score -1.1, E = 80 *->qqaaqqgqraqqaaevhKeF<-* + +++++++q a+v+++F gi|1751120 1001 DGDSLYSPGVQLGALVTTHF 1020 7tm_2: domain 2 of 2, from 1055 to 1069: score 1.9, E = 9.8 *->GffVavlYCflNgEV<-* G+fV+ +C +++V gi|1751120 1055 GLFVFTHHCARRRDV 1069 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- LRR-ma 23.8 0.0039 3 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- LRR-ma 1/3 100 113 .. 1 14 [] 8.0 84 LRR-ma 2/3 124 137 .. 1 14 [] 15.0 1.8 LRR-ma 3/3 148 161 .. 1 14 [] 8.5 68 Alignments of top-scoring domains: LRR-ma: domain 1 of 3, from 100 to 113: score 8.0, E = 84 *->lpsLeeLdLsnNrl<-* l+ Le+LdL+nN + gi|1751120 100 LSLLEKLDLRNNII 113 LRR-ma: domain 2 of 3, from 124 to 137: score 15.0, E = 1.8 *->lpsLeeLdLsnNrl<-* l L++LdLsnNr+ gi|1751120 124 LGELKRLDLSNNRI 137 LRR-ma: domain 3 of 3, from 148 to 161: score 8.5, E = 68 *->lpsLeeLdLsnNrl<-* lp+L +L++s+N + gi|1751120 148 LPRLLRLNISGNIF 161 // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Tue Jan 22 00:19:53 2002 Sequence file: tem9.fa ---------------------------------------- Sequence gi|17511205|ref|NP_116166.6| (1331 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 77: NGTV 94: NGSF 155: NISG 200: NRSL 268: NDTR 329: NASK 429: NASN 595: NVSL 652: NTSR 683: NLTE 730: NVSA 787: NHSS Total matches: 12 Matching pattern PS00005 PKC_PHOSPHO_SITE: 33: SIR 36: SCK 246: SLR 331: SKK 393: SRR 591: TGR 653: TSR 691: SLR 790: SIR 794: SRK 858: TWR 969: STR 990: SAR 1034: SQR 1072: SWR 1109: SLK 1226: SSR 1272: SLK 1283: SHR Total matches: 19 Matching pattern PS00006 CK2_PHOSPHO_SITE: 62: SGGD 101: SLLE 224: SLQE 449: SFSD 1218: SPTD 1244: TPSE Total matches: 6 Matching pattern PS00008 MYRISTYL: 29: GCPLSI 46: GLSGGV 73: GLLPNG 91: GLRNGS 95: GSFLGL 119: GAFLGL 138: GCLTSE 158: GNIFSS 288: GILLAE 312: GVWASG 385: GGAPGT 386: GAPGTR 389: GTRASR 548: GLTCTA 559: GGVPGT 560: GVPGTR 563: GTRPGS 675: GTSGCG 678: GCGVGN 715: GGWTSE 756: GGAGAG 757: GAGAGL 818: GITLTN 893: GITAAV 954: GNSRAS 983: GSLLAT 1009: GVQLGA 1013: GALVTT 1048: GVAASA 1143: GAAAGG 1156: GTRGNL 1199: GGAAGA 1200: GAAGAL 1213: GSLHNS 1225: GSSRNS 1248: GSDTSA 1263: GQRRSA 1297: GAPKGG 1311: GAEVAS 1318: GCMKTG 1323: GLWKSE Total matches: 41 Matching pattern PS00009 AMIDATION: 661: PGKR 1171: HGRR Total matches: 2 Matching pattern PS00016 RGD: 355: RGD Total matches: 1 Total no of hits in this sequence: 81 ======================================== 1314 pattern(s) searched in 1 sequence(s), 1331 residues. Total no of hits in all sequences: 81. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >ER-GOLGI-traffic signal is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >INTRA-SIGNAL-M minimal SH3 binding is the MOTIF name >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] ;LENGTH=1331; DIRECT_SEQUENCE n 14 solutions m %_PXXP 11-14 f m %_PXXP 28-31 f m %_PXXP 67-70 f m %_PXXP 69-72 f m %_PXXP 566-569 f m %_PXXP 574-577 f m %_PXXP 577-580 f m %_PXXP 628-631 f m %_PXXP 629-632 f m %_PXXP 869-872 f m %_PXXP 996-999 f m %_PXXP 1079-1082 f m %_PXXP 1082-1085 f m %_PXXP 1085-1088 f >STATISTICS Total : 14 solutions in 1 sequences, 1331 units; out of 1 sequences, 1331 units >INTRA-SIGNAL-M deubiquitinating enzyme SH3 domain binding motif (Kato, 2000) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >INTRA-SIGNAL-M minimal class I consensus-SH3 binding motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >INTRA-SIGNAL-M minimal class II consensus-SH3 binding motif is the MOTIF name >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] ;LENGTH=1331; DIRECT_SEQUENCE n 1 solutions m %_XPXXPXR 1084-1090 f >STATISTICS Total : 1 solutions in 1 sequences, 1331 units; out of 1 sequences, 1331 units >INTRA-SIGNAL-M exact 14-3-3 binding consensus (Muslin 1996 Cell 84 889) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >INTRA-SIGNAL-M 14-3-3 binding motif in RAF and others (Muslin 1996 Cell 84 889) is the MOTIF name >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] ;LENGTH=1331; DIRECT_SEQUENCE n 1 solutions m %_RSXXP 725-729 f >STATISTICS Total : 1 solutions in 1 sequences, 1331 units; out of 1 sequences, 1331 units >INTRA-SIGNAL-M WW domain binding motif in formins (Bedford 1997) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >INTRA-SIGNAL-M PY motif for WW domain is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >TM-CYTOPLASMIC-M di-hydrophobic endocytosis motifs for internalized transmembrane proteins is the MOTIF name >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] ;LENGTH=1331; DIRECT_SEQUENCE n 1 solutions m %_D 981-981 %_XXXL 982-985 %_L 986-986 f >STATISTICS Total : 1 solutions in 1 sequences, 1331 units; out of 1 sequences, 1331 units >TM-CYTOPLASMIC-M tyrosine-based endocytosis motif for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >TM-EXTRACELL-M Endocytosis signal for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >EXTRACELL-M minimal furin protease cleavage site motif is the MOTIF name >gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] ;LENGTH=1331; DIRECT_SEQUENCE n 8 solutions m %_RXXR 150-153 f m %_RXXR 359-362 f m %_RXXR 391-394 f m %_RXXR 395-398 f m %_RXXR 398-401 f m %_RXXR 642-645 f m %_RXXR 792-795 f m %_RXXR 1067-1070 f >STATISTICS Total : 8 solutions in 1 sequences, 1331 units; out of 1 sequences, 1331 units >EXTRACELL-M extended furin protease cleavage site motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >EXTRACELL-M zinc binding motif in MMPs is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >EXTRACELL-M g alpha binding go loco is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS PDX-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS QKI-5 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS HCDA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS SV40 LrgT experimentally determined is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS H2B experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS v-Rel experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Amida experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Amida experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS RanBP3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Pho4p experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS DNAhelicaseQ1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS LEF-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS TCF-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR p53-NLS1 NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hum-Ku70 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS GAL4 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS act/inh betaA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS TR2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS THOV NP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS polyomaVP1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS HIV-1 Tat experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS HIV-1 Rev experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Rex experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS SRY experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS SRY experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS SOX9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS SOX9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS NS5A experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS DNAse EBV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS DNAse EBV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS adenovE1a experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS ystDNApolalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hVDR experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS CPV capsid experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hGlu.cort.experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS cFOS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS cJUN experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hDNApolalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hDNAtopoII experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hDNAtopoII experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hBLM experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hARNT experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS influenzaNP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS influenzaNP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS p54 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS hProTalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Tst1/Oct6 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS protHsc9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS protHsci experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS protHsc3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Ta alpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Pax-QNR experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Hunt.Dis.pro experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS MyoD experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS MyoD experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS opaque2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS CTP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS HCV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS HCV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS p110RB1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS VirD2-Nterm experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS VirD2-Cterm experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Nucloplasmin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Nucleolin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS ICP-8 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Nab2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS M9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS lscMyc experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS humKprotein experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS FluA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Mat-alpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS polyoma Lrg-T experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS polyoma Lrg-T experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS SV40 VP1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS SV40 VP2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS polyoma VP2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS c-myb experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS N-myc experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS p53 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS c-erb-A experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS yeast SKI3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS L29 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS L29 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS Max experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS L3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >NUCLEAR NLS dyskerin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >PDZ domain binding motif science 278_2075_pawson is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units >WW domain binding motif science 278_2075_pawson is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1331 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB+ PSSM from other authors IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] (1400 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value LRR Leucine rich repeats 79 5e-17 PUM Pumilio repeat RNA binding domain 25 0.58 14-3-3 14-3-3 protein alpha Helical domain 24 1.2 PDE cyclic NMP phosphodiesterase domain 24 1.5 DHHC Novel zinc finger domain with DHHC signature 22 5.5 HOG HOG- intein(HINT) like domain 22 5.7 FYVE Zinc Finger domain involved in PtdIns(3)P binding 22 6.1 ARM Armadillo repeat 22 6.4 SH2 Src Homology domain 2 22 6.6 CALC Calcineurin like Phosphoesterase domain 22 7.1 KIN Protein kinase domain 22 7.3 MIZFIN MIZ type Cysteine zinc DNA binding domain 22 7.5 >LRR Leucine rich repeats Length = 339 Score = 78.7 bits (192), Expect = 5e-17 Identities = 37/149 (24%), Positives = 37/149 (24%), Gaps = 3/149 (2%) Query: 69 PPEPGLLPNGTVTLLLSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQPGAFLGLGELK 128 Sbjct: 26 PTDIFRMKDLTI-IDLSRNQLREVPT-NLEYAKGSIVLNLSYNNIETIPNSVCANLIDLL 83 Query: 129 RLDLSNNRIGCLTSETFQGLPRLLRLNISGNIFSSLQPGVFDELPALKVVDLGTEFLTCD 188 Sbjct: 84 FLDLSNNKLDMLPPQ-IRRLSMLQSLKLSNNPLNHFQLKQLPSMTSLSVLHMSNTNRTLD 142 Query: 189 CHLRWLLPWAQNRSLQLSEHTLCAYPSAL 217 Sbjct: 143 NIPPTLDDMHNLRDVDFSENNLPIVPEAL 171 Score = 72.9 bits (177), Expect = 2e-15 Identities = 37/155 (23%), Positives = 37/155 (23%), Gaps = 10/155 (6%) Query: 57 RRVVCSGGDLPEPPEPGLLPNGTVTLLLSNNKITGLRNGSFLGLSLLEKLDLRNNIISTV 116 Sbjct: 178 RKLNLSGNKIEKLNMTEGEWENLETLNMSHNQLTVLPD-CVVKLTRLTKLYAANNQLTFE 236 Query: 117 Q-PGAFLGLGELKRLDLSNNRIGCLTSETFQGLPRLLRLNISGNIFSSLQPGVFDELPAL 175 Sbjct: 237 GIPSGIGKLIQLTVLHLSYNKLELVP-EGISRCVKLQKLKLDHNRLITL-PEGIHLLPDL 294 Query: 176 KVVDLGTEFLTCDCHLRWLLPWAQNRSLQLSEHTL 210 Sbjct: 295 KVLDLH------ENENLVMPPKPNDARKKLAFYNI 323 Score = 25.4 bits (55), Expect = 0.46 Identities = 6/22 (27%), Positives = 6/22 (27%) Query: 94 NGSFLGLSLLEKLDLRNNIIST 115 Sbjct: 1 HGELSDLPRLRSVIVRDNNLKT 22 Score = 23.4 bits (50), Expect = 1.9 Identities = 4/19 (21%), Positives = 4/19 (21%) Query: 119 GAFLGLGELKRLDLSNNRI 137 Sbjct: 2 GELSDLPRLRSVIVRDNNL 20 Score = 22.6 bits (48), Expect = 3.7 Identities = 5/19 (26%), Positives = 5/19 (26%) Query: 145 FQGLPRLLRLNISGNIFSS 163 Sbjct: 4 LSDLPRLRSVIVRDNNLKT 22 >PUM Pumilio repeat RNA binding domain Length = 337 Score = 25.2 bits (55), Expect = 0.58 Identities = 11/73 (15%), Positives = 11/73 (15%), Gaps = 8/73 (10%) Query: 442 VYTAEAASFSDMMDVVYVAQMIQKFLGYV--DQIKELVEVMVDMASNL--MLVDEHLLWL 497 Sbjct: 44 MYEQIKPFFLDLILDPFGNYLVQKLCDYLTAEQKTLLIQT---IYPNVFQISINQYGTRS 100 Query: 498 AQRE-DKACSRIV 509 Sbjct: 101 LQKIIDTVDNEVQ 113 Score = 22.5 bits (48), Expect = 4.3 Identities = 10/51 (19%), Positives = 10/51 (19%), Gaps = 9/51 (17%) Query: 122 LGLGELKRLDLSNNRIGCLTSETFQGLPRLLRLNISGNIFSSLQPGVFDEL 172 Sbjct: 7 LPLRDLDYIKLATDQFGC------RFLQKKLETPSESNMVRDL---MYEQI 48 >14-3-3 14-3-3 protein alpha Helical domain Length = 270 Score = 24.2 bits (52), Expect = 1.2 Identities = 8/21 (38%), Positives = 8/21 (38%) Query: 226 QEAQLCCEGALELHTHHLIPS 246 Sbjct: 93 QELDTICQDILTVLEKHLIPN 113 >PDE cyclic NMP phosphodiesterase domain Length = 350 Score = 24.0 bits (51), Expect = 1.5 Identities = 11/29 (37%), Positives = 11/29 (37%), Gaps = 1/29 (3%) Query: 407 YSHCLYTNDITRVLYTFVLMPINASNALT 435 Sbjct: 66 YHNFTHAFDVTQTVYTF-LTSFNAAQYLT 93 >DHHC Novel zinc finger domain with DHHC signature Length = 217 Score = 22.0 bits (46), Expect = 5.5 Identities = 5/52 (9%), Positives = 5/52 (9%), Gaps = 5/52 (9%) Query: 761 GLHPVVYPCTALLLLCLFATIITYILNHSSIRVSRKGWHMLLNLCFHIAMTS 812 Sbjct: 50 PLQIVAWLLYLFFAVIGFGILVPLLPHHWVP-----AGYACMGAIFAGHLVV 96 >HOG HOG- intein(HINT) like domain Length = 389 Score = 21.9 bits (46), Expect = 5.7 Identities = 8/46 (17%), Positives = 8/46 (17%) Query: 471 DQIKELVEVMVDMASNLMLVDEHLLWLAQREDKACSRIVGALERIG 516 Sbjct: 225 EQMQNFVQLHTDGGAVLTVTPAHLVSVWQPESQKLTFVFAHRIEEK 270 >FYVE Zinc Finger domain involved in PtdIns(3)P binding Length = 99 Score = 22.0 bits (46), Expect = 6.1 Identities = 12/61 (19%), Positives = 12/61 (19%), Gaps = 14/61 (22%) Query: 696 AEGAEPVAAWWSQEGPGEAGGWTSEGCQLRSSQP---NVSAL-----------HCQHLGN 741 Sbjct: 6 ADEHGKLMCWDMNCKRVETPEWKTSDCCQKCNQPFFWNLQAMWQRKVVGLRQHHCRTCGS 65 Query: 742 V 742 Sbjct: 66 A 66 >ARM Armadillo repeat Length = 532 Score = 21.8 bits (46), Expect = 6.4 Identities = 15/89 (16%), Positives = 15/89 (16%), Gaps = 5/89 (5%) Query: 244 IPSLRQVVFQGDRLPFQCSASYLGNDTRIRWYHNRAPVEGDEQAGIL--LAESLIHDCTF 301 Sbjct: 74 LKDMVAGVWSDD-PALQLES--TTQFRKLLSIERSPPIEEVISAGVVPRFVEFLKKEDYP 130 Query: 302 ITSELTLSHIGVWASGEWECTVSMAQGNA 330 Sbjct: 131 AIQFEAAWALTNIASGTSDHTKVVIDHNA 159 >SH2 Src Homology domain 2 Length = 119 Score = 21.7 bits (46), Expect = 6.6 Identities = 5/26 (19%), Positives = 5/26 (19%), Gaps = 3/26 (11%) Query: 73 GLLPNGTVTLLLSNNKITGLRNGSFL 98 Sbjct: 10 KDITRKDAERQLLAP---GNSAGAFL 32 >CALC Calcineurin like Phosphoesterase domain Length = 274 Score = 21.7 bits (45), Expect = 7.1 Identities = 14/135 (10%), Positives = 14/135 (10%), Gaps = 18/135 (13%) Query: 770 TALLLLCLFATIITYIL--NHSS---IRVSRKGWHMLLNLCFHIAMTSAVFAGGITLTNY 824 Sbjct: 74 IRFVEMMKPFNKPVFWIPGNHDFQPKMAEFLNQPPMNAAKHLLLGEHWQALL----LDSQ 129 Query: 825 QMVCQAVGITLHYSSLSTLLWMGVKAR------VLHKELTWRAPPPQEGDPALPTPSPML 878 Sbjct: 130 VYGV--PHGQLSQHQLDLLKETLGKNPERYTLVVLHHHLLPTNSAWLDQHNLRNSHELAE 187 Query: 879 RFYLIAGGIPLIICG 893 Sbjct: 188 VL-APFTNVKAILYG 201 >KIN Protein kinase domain Length = 313 Score = 21.6 bits (45), Expect = 7.3 Identities = 14/121 (11%), Positives = 14/121 (11%), Gaps = 11/121 (9%) Query: 463 IQKFLGYV-----DQIKELVEVMVDMASNLMLVDEHLLWLAQREDKACSR-IVGALERIG 516 Sbjct: 90 LISIKDILKPNSKEQFEDVYIVSELMDTDLHQIITSPQPLSDDHCQYFVYQMLRGLKHIH 149 Query: 517 GAALSPHAQHISVNARNVALEAYLIKPHSYVGLTCTAFQRREGGVPGTRPGSPGQNPPPE 576 Sbjct: 150 ----SANVLHRDLKPSNLLINEDCLLKICDLGL-ARVEDATHQGFMTEYVATRWYRAPEV 204 Query: 577 P 577 Sbjct: 205 I 205 >MIZFIN MIZ type Cysteine zinc DNA binding domain Length = 172 Score = 21.5 bits (45), Expect = 7.5 Identities = 7/29 (24%), Positives = 7/29 (24%) Query: 316 SGEWECTVSMAQGNASKKVEIVVLETSAS 344 Sbjct: 114 VFSKPCSVTVASDASKKKIDVIDLTIESS 142 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 12 Number of calls to ALIGN: 17 Length of query: 1400 Total length of test sequences: 20182 Effective length of test sequences: 16435.0 Effective search space size: 22418435.7 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] (1400 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|1881738 [246..599] Leucine-rich repeats 74 1e-14 gi|132575 [1..456] Leucine-rich repeats 71 1e-13 gi|1082610 [223..548] Leucine-rich repeats 66 4e-12 gi|1418519 [191..423] Leucine-rich repeats 65 6e-12 gi|2623618 [35..332] Leucine-rich repeats 65 1e-11 gi|1652821 [74..598] Periplasmic binding protein-like II 31 0.10 gi|485157 [209..476] Protein kinases (PK), catalytic core 28 0.79 gi|2499672 [624..853] Protein kinases (PK), catalytic core 27 1.8 gi|1536958 [63..316] Periplasmic binding protein-like I 27 3.4 gi|1673470 [12..233] Protein kinases (PK), catalytic core 26 3.6 gi|232142 [190..392] P-loop containing nucleotide triphospha... 26 4.2 gi|2370487 [513..702] Gln-tRNA synthetase (GlnRS), C-termina... 26 5.3 gi|1477763 [233..705] Lipoxigenase, C-terminal domain 25 8.2 gi|1652862 [45..566] Periplasmic binding protein-like II 25 8.5 >gi|1881738 [246..599] Leucine-rich repeats Length = 354 Score = 74.4 bits (180), Expect = 1e-14 Identities = 28/193 (14%), Positives = 28/193 (14%), Gaps = 24/193 (12%) Query: 57 RRVVCSGGDLPEPPEPGL--LPNGTVTLLLSNNKITGLRN-----GSFLGLSLLEKLDLR 109 Sbjct: 16 SRVEGTASAFEELGQAIAKNRNSALTSIDWSNNLIKDAGVAALAAAVASMGHGLTSISLK 75 Query: 110 NNIISTVQPGAFLG--------LGELKRLDLSNNRIGCLTSETFQG----LPRLLRLNIS 157 Sbjct: 76 GGDATKKGTVALCTAFKKNVEMSRTLTVLNLAGNRLDSDGTSALAAFVSGPNALQTLNIS 135 Query: 158 GNIFSSLQ--PGVFDELPALKVVDLGTEFLT---CDCHLRWLLPWAQNRSLQLSEHTLCA 212 Sbjct: 136 GTAANLEMLLPAVMRGCTELEKFNISHNKVTAKTGPELKKFLQSCGRLSELHMRDTAVPV 195 Query: 213 YPSALHAQALGSL 225 Sbjct: 196 QVVRDVIKAIIGN 208 Score = 60.8 bits (145), Expect = 2e-10 Identities = 13/91 (14%), Positives = 13/91 (14%), Gaps = 7/91 (7%) Query: 81 TLLLSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQPGAFLGLGELKRLDLSNNRIGCL 140 Sbjct: 271 ELHLGDNWTRNKTKARSQAVDNLIELISSECPLHKLDL-------SCKVADNQIKTDILP 323 Query: 141 TSETFQGLPRLLRLNISGNIFSSLQPGVFDE 171 Sbjct: 324 FIYSLATNDTLKELDISGNAMGDKVGIALGK 354 Score = 26.5 bits (57), Expect = 3.4 Identities = 10/95 (10%), Positives = 10/95 (10%), Gaps = 5/95 (5%) Query: 127 LKRLDLSNNRIGCLTSETFQGLPRLLRLNISGNIFSSL----QPGVFDELPALKVVDLGT 182 Sbjct: 217 DLAANKLGVLGANMLAGLAAEITTIKSLDLTDNDFGDEGMSIIADGLCHNSSLRELHLGD 276 Query: 183 EFLTCDCHLRWLLPWAQNRSLQLSEHTLCAYPSAL 217 Sbjct: 277 NWTRNKTK-ARSQAVDNLIELISSECPLHKLDLSC 310 >gi|132575 [1..456] Leucine-rich repeats Length = 456 Score = 71.0 bits (171), Expect = 1e-13 Identities = 26/165 (15%), Positives = 26/165 (15%), Gaps = 20/165 (12%) Query: 80 VTLLLSNNKITGLR-NGSFLGLSLLEKLDLRNNIISTVQPG----AFLGLGELKRLDLSN 134 Sbjct: 1 MSLDIQCEQLSDARWTELLPLIQQYQVVRLDDCGLTEVRCKDIRSAIQANPALTELSLRT 60 Query: 135 NRIGCLTSETFQGL-----PRLLRLNISGN----IFSSLQPGVFDELPALKVVDLGTEFL 185 Sbjct: 61 NELGDAGVGLVLQGLQNPTCKIQKLSLQNCSLTEAGCGVLPDVLRSLSTLRELHLNDNPL 120 Query: 186 TCDCHLRWLLPWAQN-----RSLQLSEHTLCAYPSALHAQALGSL 225 Sbjct: 121 GDE-GLKLLCEGLRDPQCRLEKLQLEYCNLTATSCEPLASVLRVK 164 Score = 62.4 bits (149), Expect = 5e-11 Identities = 18/108 (16%), Positives = 18/108 (16%), Gaps = 14/108 (12%) Query: 84 LSNNKITGLRNGSFL-----GLSLLEKLDLRNNIISTVQ----PGAFLGLGELKRLDLSN 134 Sbjct: 343 MSSNPLGDSGVVELCKALGYPDTVLRVLWLGDCDVTDSGCSSLATVLLANRSLRELDLSN 402 Query: 135 NRI-----GCLTSETFQGLPRLLRLNISGNIFSSLQPGVFDELPALKV 177 Sbjct: 403 NCMGDNGVLQLLESLKQPSCILQQLVLYDIYWTDEVEDQLRALEEERP 450 Score = 58.5 bits (139), Expect = 6e-10 Identities = 41/251 (16%), Positives = 41/251 (16%), Gaps = 38/251 (15%) Query: 81 TLLLSNNKITGLRNGSFL----GLSLLEKLDLRNNIIS-----TVQPGAFLGLGELKRLD 131 Sbjct: 198 SLKLENCGITSANCKDLCDVVASKASLQELDLGSNKLGNTGIAALCSGLLLPSCRLRTLW 257 Query: 132 LSNNRIGCLTSET----FQGLPRLLRLNISGNIFSSLQPGVFDELPALKVVDLGTEFLTC 187 Sbjct: 258 LWDCDVTAEGCKDLCRVLRAKQSLKELSLAGNELKDEGAQLLCESLLEPGCQLESLWVKT 317 Query: 188 DCHLRWLLPWAQNRSLQLSEHTLCAYPSALHAQALGSLQEAQLCCEGALELH--THHLIP 245 Sbjct: 318 CSLTAA------------SCPHFCSVLTKNSSLFELQMSSNPLGDSGVVELCKALGYPDT 365 Query: 246 SLRQVVFQGDRL---PFQCSASYLGNDTRIRW---YHNRAPVEGDEQAGILLAESLIHDC 299 Sbjct: 366 VLRVLWLGDCDVTDSGCSSLATVLLANRSLRELDLSNNCMGDNGVLQ----LLESLKQPS 421 Query: 300 TFITSELTLSH 310 Sbjct: 422 CILQ-QLVLYD 431 >gi|1082610 [223..548] Leucine-rich repeats Length = 326 Score = 66.1 bits (159), Expect = 4e-12 Identities = 27/198 (13%), Positives = 27/198 (13%), Gaps = 24/198 (12%) Query: 82 LLLSNNKITGLRNGSFLGLSLLEKLDLRNNII------STVQPGAFLGLGELKRLDLSNN 135 Sbjct: 125 GFPRGAQPAPLLCSVLKASGSLQQLSLDSATFASPQDFGLVLQTLKEYNLALKRLSFHDM 184 Query: 136 RIGCLTS--ETFQGLPRLLRLNISGNIFSSLQP--------GVFDELPALKVVDLG---- 181 Sbjct: 185 NLADCQSEVLFLLQNLTLQEITFSFCRLFEKRPAQFLPEMVAAMKGNSTLKGLRLPGNRL 244 Query: 182 -TEFLTCDCHLRWLLPWAQNRSLQLSEHTLCAYPSALHAQALGSLQEAQLCCEGALELHT 240 Sbjct: 245 GNAGLLALADVFSEDSSSSLCQLDISSNCIKPDGLLEFAKRLERWGRGAF---GHLRLFQ 301 Query: 241 HHLIPSLRQVVFQGDRLP 258 Sbjct: 302 NWLDQDAVTAREAIRRLR 319 Score = 59.4 bits (142), Expect = 4e-10 Identities = 21/124 (16%), Positives = 21/124 (16%), Gaps = 16/124 (12%) Query: 84 LSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQPGAFLGLGE-------------LKRL 130 Sbjct: 2 LSTEAALTLCH-LLSSWVSLESLTLSYNGLGSNIFRLLDSLRALSGQAGCRLRALHLSDL 60 Query: 131 DLSNNRIGCLTSETFQGLPRLLRLNISGNIFSSLQPGVFDELPALKVVDLGTEFLTCDCH 190 Sbjct: 61 FSPLPILELT-RAIVRALPLLRVLSIRVDHPSQRDNPGVPGNAGPPSHIIGDEEIPE-NC 118 Query: 191 LRWL 194 Sbjct: 119 LEQL 122 >gi|1418519 [191..423] Leucine-rich repeats Length = 233 Score = 65.2 bits (157), Expect = 6e-12 Identities = 22/118 (18%), Positives = 22/118 (18%), Gaps = 12/118 (10%) Query: 75 LPNGTVTLLLSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQ----PGAFLGLGELKRL 130 Sbjct: 99 LRNNSIGDSGVRHICDGLRHREAVEKSSLSAMVLWNNNVTGASMDSLAEALIENTKIETL 158 Query: 131 DLSNNRIGCLTSETFQG----LPRLLRLNISGNIFSSLQ----PGVFDELPALKVVDL 180 Sbjct: 159 NIGNNNLGVEGIARLKPALASNSHLHRLGLQNTGINCEGAIILAECIADNIALLRVDI 216 Score = 53.5 bits (127), Expect = 2e-08 Identities = 14/98 (14%), Positives = 14/98 (14%), Gaps = 14/98 (14%) Query: 96 SFLGLSLLEKLDLRNNIISTVQP------GAFLGLGELKRLDLSNNRIGCLT----SETF 145 Sbjct: 1 SIRHAVSLQMLNLRYTNLNDRSIPALCKMARAQPSASLTCLHLENTQMSGKNLLVLICAL 60 Query: 146 QGLPRLLRLNISGNIFSSLQ----PGVFDELPALKVVD 179 Sbjct: 61 KNNTGLRELYLGDNGLQPTDGSHIYQLITSNSSLQLLD 98 >gi|2623618 [35..332] Leucine-rich repeats Length = 298 Score = 64.6 bits (155), Expect = 1e-11 Identities = 24/177 (13%), Positives = 24/177 (13%), Gaps = 16/177 (9%) Query: 65 DLPEPPEPGLLPNGTVTLLLSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQ----PGA 120 Sbjct: 108 SLREMRFNNNGLGIGGKLMAEALITCHEKSTKAGKPLALKVFIAGRNRLENPGATVLAKA 167 Query: 121 FLGLGELKRLDLSNNRIGCLTS----ETFQGLPRLLRLNISGNIFSSLQ----PGVFDEL 172 Sbjct: 168 FKIIGTLEEIALPQNGINYEGITALAEAVEYSHNLKILNLNDNTFTARGAKPMAKAIKNL 227 Query: 173 PALKVVDL-GTEFLTCDCHLRWLLPWAQN---RSLQLSEHTLCAYPSALHAQALGSL 225 Sbjct: 228 SKLEVINFGDCLVRSEGADAIANSLREGVPSLKELNLAFGEIKKEAAVRVAESMDTK 284 Score = 50.6 bits (119), Expect = 2e-07 Identities = 13/80 (16%), Positives = 13/80 (16%), Gaps = 4/80 (5%) Query: 118 PGAFLGLGELKRLDLSNNRIGCLTSETF----QGLPRLLRLNISGNIFSSLQPGVFDELP 173 Sbjct: 7 VAAVEACEGIQSLKLNGNTIGVEAAQALAKALESKPQFQRARWSDMFTGRLRSEIPPALM 66 Query: 174 ALKVVDLGTEFLTCDCHLRW 193 Sbjct: 67 SLGAGIMTAGAHLVEIDLSD 86 Score = 33.5 bits (75), Expect = 0.021 Identities = 4/52 (7%), Positives = 4/52 (7%), Gaps = 5/52 (9%) Query: 69 PPEPGLLPNGTVTLLLSNNKITGLRNGSFL----GLSLLEKLDLRNNIISTV 116 Sbjct: 7 VAAVEACEGIQS-LKLNGNTIGVEAAQALAKALESKPQFQRARWSDMFTGRL 57 >gi|1652821 [74..598] Periplasmic binding protein-like II Length = 525 Score = 31.2 bits (70), Expect = 0.10 Identities = 14/78 (17%), Positives = 14/78 (17%), Gaps = 10/78 (12%) Query: 1081 SPAAPHAPPRALPAAAEDGSPVFGEGPPSLKSSPSGSSGHPLALGPCKLTNLQLAQSQVC 1140 Sbjct: 130 PILPAHILRETVEKTDENGNPLFLSTW-----GTNTPPDQIVVNGPYKLTRYVTSQRVIF 184 Query: 1141 EA-----GAAAGGEGEPE 1153 Sbjct: 185 SKNPYYWEKGPNGESLPY 202 >gi|485157 [209..476] Protein kinases (PK), catalytic core Length = 268 Score = 28.2 bits (61), Expect = 0.79 Identities = 12/147 (8%), Positives = 12/147 (8%), Gaps = 18/147 (12%) Query: 452 DMMDVVYVAQMIQKFLGYVDQIKELVEVMVDMASNLMLVDEHLLWLAQREDKACSRIVGA 511 Sbjct: 97 SENERLDILAQICRGMSFLHTRTPSI-VHGDLAARNVLLKKHPVY--IKKYICKITDLGL 153 Query: 512 LERIGGAALSPHAQHISVNARNVALEAYLIKPHSY----VGLTCTAFQRREGGVPGTRPG 567 Sbjct: 154 AKTCLDELYTSYDDPTKIPFKWLPPEVLGSRTLSLKTDIWAFGIVCFEVCDKV------- 206 Query: 568 SPGQNPPPEPEPPADQQLRFRCTTGRP 594 Sbjct: 207 ----GEPYGACVPTSNLCQYLTDGYVH 229 >gi|2499672 [624..853] Protein kinases (PK), catalytic core Length = 230 Score = 27.3 bits (59), Expect = 1.8 Identities = 7/109 (6%), Positives = 7/109 (6%), Gaps = 13/109 (11%) Query: 770 TALLLLCLFATIITYILNHSSIRVSRKGWHMLLNLCFHIAMT----SAVFAGGITLTNYQ 825 Sbjct: 56 CLLDLMHGLVRGMHYLEDNKIIHNYIRCSNLYVTKYDPNSYVLDAKISDPGYPRPYRESD 115 Query: 826 MVCQA-----VGITLHYSSLSTLLWMGVKARVLHKELTWRAPPPQEGDP 869 Sbjct: 116 SPWIPVKYYRNLQAAKTDQFAQLWAFA----TTIYEIFSRCKEDLSTLR 160 >gi|1536958 [63..316] Periplasmic binding protein-like I Length = 254 Score = 26.5 bits (58), Expect = 3.4 Identities = 5/79 (6%), Positives = 5/79 (6%), Gaps = 6/79 (7%) Query: 411 LYTNDITRVLYTFVLMPINASNALTLAHQLRVYTAEAASFSDMMDVVYVAQMIQKFLGYV 470 Sbjct: 175 IDPDEKTLVFIPNCWALPLVFTVI-KELNYNLPQVGLIGFDN----TEWTCFSSPSVSTL 229 Query: 471 DQ-IKELVEVMVDMASNLM 488 Sbjct: 230 VQPSFEEGQQATKILIDQI 248 >gi|1673470 [12..233] Protein kinases (PK), catalytic core Length = 222 Score = 26.3 bits (56), Expect = 3.6 Identities = 10/101 (9%), Positives = 10/101 (9%), Gaps = 4/101 (3%) Query: 226 QEAQLCCE-GALELHTHHLIPSLRQVVFQGDRLPFQCSASYLGNDTRIRWYHNRAPVEGD 284 Sbjct: 47 TELSVFRKLDSLGNSKRNGFPQFCENFQVDGYYAIVLSDEGDNLNDIRKRNKNLVFNRKN 106 Query: 285 ---EQAGILLAESLIHDCTFITSELTLSHIGVWASGEWECT 322 Sbjct: 107 VATVGCAVSRSLSTLHSIGFFHADLHEQNLMVPYPCRKINI 147 >gi|232142 [190..392] P-loop containing nucleotide triphosphate hydrolases Length = 203 Score = 26.0 bits (56), Expect = 4.2 Identities = 6/46 (13%), Positives = 6/46 (13%), Gaps = 2/46 (4%) Query: 174 ALKVVDL--GTEFLTCDCHLRWLLPWAQNRSLQLSEHTLCAYPSAL 217 Sbjct: 40 IIFVVASSSYNMVIREDNQTNRLQEALNLFKSIWNNRWLRTISVIL 85 >gi|2370487 [513..702] Gln-tRNA synthetase (GlnRS), C-terminal (anticodon-binding) domain Length = 190 Score = 25.6 bits (55), Expect = 5.3 Identities = 19/74 (25%), Positives = 19/74 (25%), Gaps = 11/74 (14%) Query: 437 AHQLRVYTAEAASFSDMMDV-------VYVAQMIQKFLGYVDQIKELVEVMVDMASNLML 489 Sbjct: 46 ANEILIEQADAQSFKQDEEVTLMDWGNAYVREINRDASGKVTSLK--LELHLD--GDFKK 101 Query: 490 VDEHLLWLAQREDK 503 Sbjct: 102 TEKKVTWLADTEDK 115 >gi|1477763 [233..705] Lipoxigenase, C-terminal domain Length = 473 Score = 25.2 bits (55), Expect = 8.2 Identities = 25/147 (17%), Positives = 25/147 (17%), Gaps = 27/147 (18%) Query: 614 LPPSLFSSLPAALAPPVPPDCTLQLLVFRNGRLF----------HSHSNTSRPGAAGPGK 663 Sbjct: 53 FINKTY-DWDAAMQRASGEDSTFSEMI-DAGRVYIVEFSYLYDLSNVEDLTERNTTDRRE 110 Query: 664 RRGVATP-VIFAGTSGCGVGNLTEPVAV--SLRHWAEGAEPVAAWWSQEGPGEAGGWTSE 720 Sbjct: 111 MRGSRAPICTFVSANVGGVRELK-PVAIQTDQSSDAPVFSPVDGSK----------WLMA 159 Query: 721 GCQLRSSQPNVSALHCQHLGNVAVLME 747 Sbjct: 160 KQEFQRADITYAEII-EHLLKTHLLME 185 >gi|1652862 [45..566] Periplasmic binding protein-like II Length = 522 Score = 24.9 bits (54), Expect = 8.5 Identities = 8/58 (13%), Positives = 8/58 (13%), Gaps = 9/58 (15%) Query: 155 NISGNIFSSL---------QPGVFDELPALKVVDLGTEFLTCDCHLRWLLPWAQNRSL 203 Sbjct: 26 EASRIALEPLASFDQEGNLVPFLAAEIPSVENGGVAADGLSVTWKLKPDVLWSDGQPF 83 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 14 Number of calls to ALIGN: 22 Length of query: 1400 Total length of test sequences: 256703 Effective length of test sequences: 207231.0 Effective search space size: 281427601.4 Initial X dropoff for ALIGN: 25.0 bits ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ calculation of internal repeats with prospero ***** PROSPERO v1.3 Tue Jan 22 00:20:53 2002 ***** Copyright 2000, Richard Mott, Wellcome Trust Centre for Human Genetics, University of Oxford For help see http://www.well.ox.ac.uk/ariadne For usage use -help using gap penalty 11+1k using matrix BLOSUM62 printing all alignments with eval < 0.100000 using sequence1 gi|17511205|ref|NP_116166.6| using self-comparison > 1 gi|17511205|ref|NP_116166.6| len 1331 from 84 to 151 vs gi|17511205|ref|NP_116166.6| len 1331 from 108 to 175 score 118 eval 6.420982e-08 identity 42.65% K 2.085770e-02 L 2.294784e-01 H 1.299330e+00 alpha 1.011140e-01 84 LSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQPGAFLGLGELKRLDLSNNRIGCLTSE 143 gi|17511205|ref|NP_116166.6| | || |: :: |:|||| |::||| || | : | || | ||::| | | 108 LRNNIISTVQPGAFLGLGELKRLDLSNNRIGCLTSETFQGLPRLLRLNISGNIFSSLQPG 167 gi|17511205|ref|NP_116166.6| 144 TFQGLPRL 151 gi|17511205|ref|NP_116166.6| | || | 168 VFDELPAL 175 gi|17511205|ref|NP_116166.6| > 2 gi|17511205|ref|NP_116166.6| len 1331 from 84 to 139 vs gi|17511205|ref|NP_116166.6| len 1331 from 132 to 187 score 102 eval 2.524527e-06 identity 42.86% K 2.085770e-02 L 2.294784e-01 H 1.299330e+00 alpha 1.011140e-01 84 LSNNKITGLRNGSFLGLSLLEKLDLRNNIISTVQPGAFLGLGELKRLDLSNNRIGC 139 gi|17511205|ref|NP_116166.6| ||||:| | : :| || | :|:: || |::||| | | || :|| : | 132 LSNNRIGCLTSETFQGLPRLLRLNISGNIFSSLQPGVFDELPALKVVDLGTEFLTC 187 gi|17511205|ref|NP_116166.6| > 3 gi|17511205|ref|NP_116166.6| len 1331 from 1066 to 1219 vs gi|17511205|ref|NP_116166.6| len 1331 from 1070 to 1231 score 57 eval 7.418301e-02 identity 31.03% K 2.085770e-02 L 2.294784e-01 H 1.299330e+00 alpha 1.011140e-01 1066 RRDVRASWRACCPPASPAAPHAPPRALPAAAE---DGSPVFGEGPPSLKSSPSGSSGHPL 1122 gi|17511205|ref|NP_116166.6| | || | | | | | | | : :| | | | | 1070 RASWRACCPPASPAAPHAPPRALPAAAEDGSPVFGEGPPSLKSSPSGSSGHPLALG--PC 1127 gi|17511205|ref|NP_116166.6| 1123 ALGPCKLTNLQLAQSQVCEAGAAAGGEGEP----------EPAGT-RGNLAHRHPNNVHH 1171 gi|17511205|ref|NP_116166.6| | :| |: :: | | || || | | ||: | 1128 KLTNLQLAQSQVCEA---GAAAGGEGEPEPAGTRGNLAHRHPNNVHHGRRAHKSRAKGHR 1184 gi|17511205|ref|NP_116166.6| 1172 -GRRAHKSRAKGHRAGEACGKNRLKALRGGAAGALELLSSES--GSLHNSP 1219 gi|17511205|ref|NP_116166.6| | |:| | | | | | : :|:| ::| || ||| 1185 AGEACGKNRLKALRGGAAGALELLSS----ESGSLHNSPTDSYLGSSRNSP 1231 gi|17511205|ref|NP_116166.6| ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ TIGRFAM hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/tigrfam/tigrfam.hmm Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- TIGR00099 TIGR00099: conserved hypothetical protein 1.5 14 1 TIGR00097 HMP-P_kinase: phosphomethylpyrimidine kinas -0.1 76 1 TIGR00797 matE: MATE efflux family protein -217.3 12 1 TIGR00813 sss: SSS sodium solute transporter superfam -230.6 41 1 TIGR00933 2a38: K+ Transporter (Trk) Family protein -277.8 99 1 TIGR00801 ncs2: uracil-xanthine permease -319.6 62 1 TIGR00819 ydaH: AbgT putative transporter family -354.3 93 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- TIGR00099 1/1 459 467 .. 302 310 .] 1.5 14 TIGR00097 1/1 671 682 .. 258 269 .] -0.1 76 TIGR00797 1/1 457 940 .. 1 412 [] -217.3 12 TIGR00819 1/1 495 954 .. 1 527 [] -354.3 93 TIGR00933 1/1 748 1029 .. 1 409 [] -277.8 99 TIGR00801 1/1 725 1057 .. 1 480 [] -319.6 62 TIGR00813 1/1 691 1076 .. 1 448 [] -230.6 41 Alignments of top-scoring domains: TIGR00099: domain 1 of 1, from 459 to 467: score 1.5, E = 14 *->vAqflkefL<-* vAq++++fL gi|1751120 459 VAQMIQKFL 467 TIGR00097: domain 1 of 1, from 671 to 682: score -0.1, E = 76 *->iaqalnigvGhG<-* ++ a ++g+G+G gi|1751120 671 VIFAGTSGCGVG 682 TIGR00797: domain 1 of 1, from 457 to 940: score -217.3, E = 12 *->PailanilqpllglvDtafvGhlg.pvdLaAvglgissvfmllisil ++++q++ f+G++++ +L+ v + +s +ml+ l gi|1751120 457 ----VYVAQMIQK-----FLGYVDqIKELVEVMVDMASNLMLVDEHL 494 lGlgtattalvaqavGagnrq............................. l l+ + +++ vGa r ++ +++ ++ + + ++ + +++ gi|1751120 495 LWLAQREDKACSRIVGALERIggaalsphaqhisvnarnvaleaylikph 544 .................................................. + + + + ++++++ +++++++++++++++++++ +++ + + +++++ gi|1751120 545 syvgltctafqrreggvpgtrpgspgqnpppepeppadqqlrfrcttgrp 594 .................................................. + + ++ + +++ + + +++ ++ + ++ +++ + + +++ gi|1751120 595 nvslssfhiknsvalasiqlppslfsslpaalappvppdctlqllvfrng 644 .................rlarqaqqslllalllGlpvllvglffidplls + ++++++++++ ++ +r + +++++a++ G +v ++ + + l + gi|1751120 645 rlfhshsntsrpgaagpGKRRGVATPVIFAGTSGCGVGNLTEPVAVSLRH 694 lmgadgevaelAqeYLrililg..iPayllsfvlr.gfLrGqgdTktpmy + ++ ++va +++++ P+ + + ++ Lr+ + + ++ gi|1751120 695 WAEGAEPVAA---------WWSqeGPGEAGGWTSEgCQLRSSQPNVSALH 735 itlignllNivLdyiLIfGkFGfP.elGivGAAlATvisywlmfllllyy + +gn++ +++ + fP+e G++GA+l + ++y +++llll++ gi|1751120 736 CQHLGNVA-----VLMEL--SAFPrEVGGAGAGL-HPVVYPCTALLLLCL 777 ikksaqakkiglkwegllkpdwevlkrllklGlPialrvileslsfalva + + ++ + ++++++ + G+ +l++++ ++ +v gi|1751120 778 FA-------TIIT----YILNHSSIRVS-RKGWHMLLNLCFHIAMTSAVF 815 llvarlGg.sialAahqvalnvesllfmpafGlgiAvsilVGqaLGagdp + + l + + a +++l +sl++++ +G+ + ++ gi|1751120 816 AGGITLTNyQMVCQAVGITLHYSSLSTLLWMGV-------------KARV 852 krAkevarvalklslllglvlaiililfrevglPiarlFTndpeVlelaa + ++++r + +++ dp+ + + gi|1751120 853 LHKELTWR------------------APPPQ--------EGDPALPTPSP 876 syLifvalfqfsdgiqfvlsGvlrGaGd..tkviliitfisywvigLPvg + +++ + ++++i+ ++ +v++ ++++ ++ + s+ +++Pv+ gi|1751120 877 MLRFYLIAGGIPLIICGITAAVNIHNYRdhSPYCWLVWRPSLGAFYIPVA 926 yllgrTDlfytglGlkGlW<-* ++l ++t++ ++ + gi|1751120 927 LIL-----LITWIYFLCAG 940 TIGR00819: domain 1 of 1, from 495 to 954: score -354.3, E = 93 *->mPMSqssiPdarrSGkllrtVErlGNkl..PHP.........vlLFI + + q + + ++ + Er+G +PH ++ + + ++v L gi|1751120 495 LWLAQR---EDKACSRIVGALERIGGAAlsPHAqhisvnarnVALEA 538 ilI....vlLlvasAv........laaFGlSvkdPRPvGAKGRaDdtlvv +lI++++ + l +A ++++++ ++ G +P P aD l gi|1751120 539 YLIkphsYVGLTCTAFqrreggvpGTRPGSPGQNP-PPEPEPPADQQL-R 586 VksLLdvdGLkiltnvvKNFsGFAPLGavLvllLGvGlAEkvGLlsALMv + +KN ++ + + L s L gi|1751120 587 FRCTTGRPNVSLSSFHIKN------------SVALASIQLPPSLFSSLPA 624 kllskvnaklasyMVlFi.ailSniaSdlalVvliPlsAlIFlalG.RHP l+ v+ + + +F ++ l + S+ P A G+R gi|1751120 625 ALAPPVPPDCTLQLLVFRnGRLFHSHSNTS----RPGAA----GPGkRRG 666 lAGLlAAiAGVsgGysANLllvTiDvLLaGIsteAAaiinPdyvVsvidN +A + +AG sg NL + + A + +++ gi|1751120 667 VATPV-IFAGTSGCGVGNL----TEPV-------AVSLRHWAEGAEPVAA 704 WyFMAaSvvVlalvGylvTdKIvEPrLGqyQsDLssdEKDiRHlnelTel W+ S G+ + q s s+ H + l gi|1751120 705 WW----SQEGPGEAGGWTSEGC------QLRS--SQPNVSALHCQHL--- 739 erkGLiiAGVVfllliAllAlsvvPadGILRdPitglVag.SPFlKsIVv G V +l+ ls P + + g ag P + gi|1751120 740 --------GNVAVLME----LSAFPRE------VGGAGAGlHPVVYPCTA 771 lIiLlFalvslvYGiiTRslRreadlvnlMiEsMkelalylVi...vFfl l +L + + Y s+R + +++ l + +++ vF gi|1751120 772 LLLLCLFATIITYILNHSSIRVSRKGWHMLL----NLCFHIAMtsaVF-- 815 AQFVAfFNWsNiGkyIAVklaviLkevGLaGivlFvGliLlsaFlnlfIa A + + N + + AV+ + L + L+ l G l gi|1751120 816 AGGITLTNYQMVCQ--AVGIT--LHYSSLS-TLLWMGVKARVLHKELTWR 860 SaSAiWavlA.PIFVPMlMLlGyaPeviQilyRIaDSv.vlilaPvssyv + A P PMl a + i+ I v++ s+y gi|1751120 861 APPPQEGDPAlPTPSPMLRFYLIAGGIPLIICGITAAVnIHNYRDHSPYC 910 gLilaflqkYKkDAklGTliSlvLPYsliFLvvWllllliWvlVLGLPvG L+ +++ lG ++ P li L+ W+ l++ G gi|1751120 911 WLV------WRP--SLGAFY---IPVALILLITWIYFLCAGLRL----RG 945 PGAPtlYPals<-* P P+ gi|1751120 946 PL--AQNPKAG 954 TIGR00933: domain 1 of 1, from 748 to 1029: score -277.8, E = 99 *->LlaLPfvisssplsfiDALFtAtSAvTvTGLTVldtleslpkaglfl L+a+P ++ v + G+ + + +p l+l gi|1751120 748 LSAFP----RE--------------VGGAGAGLHP-VV-YPCTALLL 774 rslLqqIGGLGfmvlavailillGkKislgarlylaEalgplsmdkllpl + l +a+ i ++l+ + r ++ g + m gi|1751120 775 LCL-----------FATIITYILNH---SSIRVSRK---G-WHM------ 800 ikkiakfifvIyllgaillavrfvleGGwemsLFdAifHsiSAfnNGGFS ll ++f++++ +s +A + +++++ + gi|1751120 801 -----------------LLNLCFHIAM---TSAVFAGGITLTNYQ---MV 827 thsdssigYfndplvnlViafLiIlGGlgFtVhydvylllrkkvkkkffk ++ +++++ ++L +G ++h + l++r + gi|1751120 828 CQAVGITLHYSSL------STLLWMGVKARVLHKE--LTWR----A---P 862 g....kvlsldtklvlfvtllllaigfililllergNpktlyslslgall ++++++ + + ++l+++l++ i+ i+ ++ + +++ gi|1751120 863 PpqegDPALPTPSPMLRFYLIAGGIPLIICGITAA------V-NIH---- 901 lsaFfqsstlrTAGFstlD....nlaalseatlvlllllMfIGGasGSTa ++ D+++ + ++++ + ++ ++I gi|1751120 902 ---------------NYRDhspyCWLVWRPSLGAFYIPVALI-------- 928 GGIKvttfaillksvyseirrgkhpriVkiifsrkiggktiekailisvv ++i+++++ r+++p + + + +++++a ++ + gi|1751120 929 -------LLITWIYFLCAGLRLRGPLA---QNPKAGNSRASLEAGEELRG 968 sfflvfalifllsllelissgydfltllFEvvSAfatVGLSvGnvgtasl s+ l +l d +ll++ ++ ++t G+ g gi|1751120 969 STRLRGSGPLL----------SDSGSLLATGSARVGTPGPPED--GDSLY 1006 taeLpdlgKli...livlMfiGR<-* ++ ++++++ l +M+ gi|1751120 1007 SPGVQLGALVTthfLYLAMWACG 1029 TIGR00801: domain 1 of 1, from 725 to 1057: score -319.6, E = 62 *->RppflrrqtlvlgLQH............................... R + + + l +QH ++ + + +++ ++ + + ++ + gi|1751120 725 RSSQP--NVSALHCQHlgnvavlmelsafprevggagaglhpvvypc 769 ...LlAMFgatVlVPlLvGlapGLqgLsaeqlqYLvsiaLltsGigTLLQ + Ll + at+ +L ++++ +s +L++ gi|1751120 770 talLLLCLFATIITYILNHSSI---RVSRKGWHMLLN------------- 803 lfrtggqgtrYYVGiGlPsyLGsSFafvaVaiglpmiaigsgyCqlDgqG +++++ + +S++f + ++ + +++ + gi|1751120 804 -LCFHIAM--------------TSAVFAGGITLTNYQMVCQA-------- 830 nrlGipaimggliatGlvytllallikklgtrwlrrlfPPvVtGpVVmlI Gi +++l t l ++ a+++ k +t gi|1751120 831 --VGITLHYSSL-STLLWMGVKARVLHKELT------------------- 858 GLsLipvAvkmaaGGeaakssnnGaLCPSAdAPrPltygslenllvafvV + + ++ ++ l+++ gi|1751120 859 -----------WRAPPPQEGDP------------ALPTP----------- 874 LaliillnrFakGflksisILiGivvGYllAlamGlVdfsp.vkeApwfs +++l+ I Gi +++ + +V+ + ++ p gi|1751120 875 -----------SPMLRFYLIAGGIPL--IICGITAAVNIHNyRDHSPYCW 911 lPkpftfGvdYapsFelpailtmlpVaivtlvEsiGdltA.........t l++ + G a+ + ++il+++ +++ ++ +A++++ ++++ gi|1751120 912 LVWRPSLG---AFYIPVALILLITWIYFLCAGLRLRGPLAqnpkagnsrA 958 advtgrdlsgetydprLhrG..vlaDGlatll..aglvGgfPnTtFaQNi + g +l+g + rL +++++l+D + l ++++ vG+++ gi|1751120 959 SLEAGEELRG---STRLRGSgpLLSDSGSLLAtgSARVGTPG-------- 997 GVialTrVaSryVivgaAviLialGfvPKiaAlitsIPspVLGGasivmF + + ++v ++Al+t+ +++ gi|1751120 998 ------PPEDGDSLYSPGV---------QLGALVTT----------HFLY 1022 GmIaasGiRiLirnkldfrrnlliiaaSvgLGlGvtgapdifgnfpPesd + ++ l v+++ ++ p gi|1751120 1023 -------LAMWACGALA-----------------VSQR--WL---P---- 1039 NrdLEGleAiellllsGialagivAilLNlI<-* +++ + +l g++A L l+ gi|1751120 1040 ----------RVVCS---CLYGVAASALGLF 1057 TIGR00813: domain 1 of 1, from 691 to 1076: score -230.6, E = 41 *->YflAGRslgglvvaaSLfAsniSasqflGlpGaiYayGLgevll.ai sl ++ +a +A++ S + +pG++ G ++++++ gi|1751120 691 ------SLRHWAEGAEPVAAWWSQE----GPGEA---G---GWTsEG 721 gfyelgaLvllii....lgwlfvpvyikngalTmpeyLekRFgkghRilr +++ + ++ ++ ++lg ++v+ ++ + p+ + + ++ gi|1751120 722 CQLRSSQPNVSALhcqhLGNVAVLMELSA----FPREVGGAGAG-LHPVV 766 glsvLslilyiflymsvdlfsGAlllelvlGldlylsllllgaitilYTv + ++ l+l++f ++ ++ + + + G+ +l +l + ++++ ++ gi|1751120 767 YPCTALLLLCLFATIITYILNHSSIRVSRKGWH-MLLNLCFHIAMTSAVF 815 fGGlkAvvwTDtiQCavimilGtlilpvfafsevGGslgsvvekasqaap GG+ + + + Q av ++ l + +s +g +++ + + gi|1751120 816 AGGITLTNYQMVCQ-AV-----GITLHYSSLSTLLW-MGVKARVLHKELT 858 slvvpsggNltiksCyt.PvddlldffrdpltsdlpwgagvfGlphvalw + P+ + d + +++ +l+ + G+p++ gi|1751120 859 ---------------WRaPPPQEGDPALPTPSPMLRFYLIAGGIPLI--- 890 YwctnQvivqRclAaksakhak.kgcliagvl...kllpmflavlpGlia + ++Aa +++ ++++ + +v+++++ +++++ l+ li gi|1751120 891 --------ICGITAAVNIHNYRdHSPYCWLVWrpsLGAFYIPVALILLIT 932 ralytdpiacidPalagavcgtevqasdqayPlLvvkllpdlpglaGlfl +++ +c+ l g+ +q++ ++ + + l g l gi|1751120 933 WIYF----LCAGLRLR----GPLAQNPKAGNSRASLEAGEELRGSTRLRG 974 AvllAAvmStlssiLnsaStvft......mD...lYkkiirpaaetekli +++l +S s+ L+++ ++ ++++++++D+++lY++ a gi|1751120 975 SGPL---LSD-SGSLLATGSARVgtpgppEDgdsLYSPGVQLGAL----- 1015 vgrgrvavlvaiviailpavaaaqggqvlqlvqeafgilgapflpvfvLG v+ ++ + ++ a++ v + +v+ ++ +++a++l++fv++ gi|1751120 1016 VTTHFLYLAMWACGALA--VSQRWLPRVVCSCLY---GVAASALGLFVFT 1060 lFwkRmNakGAlaGPimiiG<-* + R + A + + gi|1751120 1061 HHCARRRDVRASWR----AC 1076 // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/tigrfam/tigrfam.hmm-f Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- TIGR00099 TIGR00099: conserved hypothetical protein 1.5 13 1 TIGR00097 HMP-P_kinase: phosphomethylpyrimidine kinas -0.1 76 1 TIGR00936 ahcY: adenosylhomocysteinase -0.7 32 1 TIGR00297 TIGR00297: conserved hypothetical protein T -0.7 92 1 TIGR00886 2A0108: nitrate transporter -1.3 70 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- TIGR00099 1/1 459 467 .. 302 310 .] 1.5 13 TIGR00097 1/1 671 682 .. 258 269 .] -0.1 76 TIGR00297 1/1 766 785 .. 1 20 [. -0.7 92 TIGR00886 1/1 927 937 .. 1 11 [. -1.3 70 TIGR00936 1/1 1009 1030 .. 401 422 .] -0.7 32 Alignments of top-scoring domains: TIGR00099: domain 1 of 1, from 459 to 467: score 1.5, E = 13 *->vAqflkefL<-* vAq++++fL gi|1751120 459 VAQMIQKFL 467 TIGR00097: domain 1 of 1, from 671 to 682: score -0.1, E = 76 *->iaqalnigvGhG<-* ++ a ++g+G+G gi|1751120 671 VIFAGTSGCGVG 682 TIGR00297: domain 1 of 1, from 766 to 785: score -0.7, E = 92 *->iflwlsavllcvfllllaly<-* +++ +llc f+ +++++ gi|1751120 766 VYPCTALLLLCLFATIITYI 785 TIGR00886: domain 1 of 1, from 927 to 937: score -1.3, E = 70 *->RnLffSwfgFf<-* +L ++w++F+ gi|1751120 927 LILLITWIYFL 937 TIGR00936: domain 1 of 1, from 1009 to 1030: score -0.7, E = 32 *->GieideLTeeQkkYLesweeGt<-* G+++ L +YL +w++G gi|1751120 1009 GVQLGALVTTHFLYLAMWACGA 1030 // SMART hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/iprscan/data/smart.HMMs Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- LRR_TYP Leucine-rich repeats, typical (most populate 57.4 3.2e-13 4 LRRCT Leucine rich repeat C-terminal domain 35.2 1.5e-06 1 IG Immunoglobulin 26.4 0.00066 1 GPS G-protein-coupled receptor proteolytic site 21.0 0.0035 1 HormR Domain present in hormone receptors 11.0 0.014 1 LRR_SD22 Leucine-rich repeat, SDS22-like subfamily 10.6 6.8 2 LRR_PS Leucine-rich repeat, plant-specific subfamil 7.2 23 3 LRR_RI Leucine rich repeat, ribonuclease inhibitor 6.0 6.3 1 LRR_CC Leucine-rich repeat - CC (cysteine-containin 1.8 72 1 LRR_BAC Leucine-rich repeats, bacterial type 0.6 88 2 IGc2 Immunoglobulin C-2 Type -1.6 1.4 1 IGv Immunoglobulin V-Type -20.2 50 1 ZU5 Domain present in ZO-1 and Unc5-like netrin -52.5 98 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- LRR_TYP 1/4 75 99 .. 1 24 [] 5.9 13 LRR_SD22 1/2 100 121 .. 1 22 [] 2.3 72 LRR_PS 1/3 100 122 .. 1 24 [] 1.5 1.3e+02 LRR_TYP 2/4 100 123 .. 1 24 [] 21.2 0.024 LRR_BAC 1/2 124 143 .. 1 20 [] 1.1 75 LRR_PS 2/3 124 146 .. 1 24 [] 6.0 34 LRR_TYP 3/4 124 147 .. 1 24 [] 27.8 0.00025 LRR_RI 1/1 124 150 .. 1 28 [] 6.0 6.3 LRR_SD22 2/2 124 150 .. 1 22 [] 8.4 13 LRR_CC 1/1 124 151 .. 1 26 [] 1.8 72 LRR_PS 3/3 148 170 .. 1 24 [] 5.8 36 LRR_TYP 4/4 148 171 .. 1 24 [] 14.0 1.4 LRR_BAC 2/2 148 175 .. 1 20 [] 0.5 90 LRRCT 1/1 183 233 .. 1 55 [] 35.2 1.5e-06 IGv 1/1 256 323 .. 1 76 [] -20.2 50 IGc2 1/1 252 328 .. 1 56 [] -1.6 1.4 IG 1/1 246 339 .. 1 58 [] 26.4 0.00066 HormR 1/1 342 419 .. 1 89 [] 11.0 0.014 ZU5 1/1 606 717 .. 1 115 [] -52.5 98 GPS 1/1 699 751 .. 1 51 [] 21.0 0.0035 Alignments of top-scoring domains: LRR_TYP: domain 1 of 4, from 75 to 99: score 5.9, E = 13 *->LpnL.reLdLsnNqLtsLPpgaFqg<-* Lpn + +L LsnN++t L +g F g gi|1751120 75 LPNGtVTLLLSNNKITGLRNGSFLG 99 LRR_SD22: domain 1 of 2, from 100 to 121: score 2.3, E = 72 *->LtnLeeLdLsqNkIkkiENLde<-* L+ Le LdL++N I+ ++ gi|1751120 100 LSLLEKLDLRNNIISTVQPGAF 121 LRR_PS: domain 1 of 3, from 100 to 122: score 1.5, E = 1.3e+02 *->LtsLqvLdLsnNnLsGeIPsslgn<-* L+ L+ LdL nN +s ++ + + gi|1751120 100 LSLLEKLDLRNNIIS-TVQPGAFL 122 LRR_TYP: domain 2 of 4, from 100 to 123: score 21.2, E = 0.024 *->LpnLreLdLsnNqLtsLPpgaFqg<-* L+ L++LdL nN ++++ pgaF g gi|1751120 100 LSLLEKLDLRNNIISTVQPGAFLG 123 LRR_BAC: domain 1 of 2, from 124 to 143: score 1.1, E = 75 *->PpsLkeLnvsnNrLteLPeL<-* +Lk+L+ snNr+ +L gi|1751120 124 LGELKRLDLSNNRIGCLTSE 143 LRR_PS: domain 2 of 3, from 124 to 146: score 6.0, E = 34 *->LtsLqvLdLsnNnLsGeIPsslgn<-* L L++LdLsnN++ + s+ ++ gi|1751120 124 LGELKRLDLSNNRIG-CLTSETFQ 146 LRR_TYP: domain 3 of 4, from 124 to 147: score 27.8, E = 0.00025 *->LpnLreLdLsnNqLtsLPpgaFqg<-* L++L++LdLsnN++ L +++Fqg gi|1751120 124 LGELKRLDLSNNRIGCLTSETFQG 147 LRR_RI: domain 1 of 1, from 124 to 150: score 6.0, E = 6.3 *->npsLreLdLsnNklgdeGaraLaeaLks<-* + L++LdLsnN +g+ + ++L gi|1751120 124 LGELKRLDLSNNRIGCLTSETF-QGLPR 150 LRR_SD22: domain 2 of 2, from 124 to 150: score 8.4, E = 13 *->LtnLeeLdLsqNkI.....kkiENLde<-* L +L+ LdLs+N+I+ +++ +++L gi|1751120 124 LGELKRLDLSNNRIgcltsETFQGLPR 150 LRR_CC: domain 1 of 1, from 124 to 151: score 1.8, E = 72 *->CpnLkeLdLsgCs..niTDeglqaLakg<-* Lk+LdLs+ + +++T e++q L ++ gi|1751120 124 LGELKRLDLSNNRigCLTSETFQGLPRL 151 LRR_PS: domain 3 of 3, from 148 to 170: score 5.8, E = 36 *->LtsLqvLdLsnNnLsGeIPsslgn<-* L++L +L++s+N +s ++ + ++ gi|1751120 148 LPRLLRLNISGNIFS-SLQPGVFD 170 LRR_TYP: domain 4 of 4, from 148 to 171: score 14.0, E = 1.4 *->LpnLreLdLsnNqLtsLPpgaFqg<-* Lp L +L+ s+N ++sL pg+F+ gi|1751120 148 LPRLLRLNISGNIFSSLQPGVFDE 171 LRR_BAC: domain 2 of 2, from 148 to 175: score 0.5, E = 90 *->PpsLkeLnvsnNrLt........eLPeL<-* p+L +Ln+s+N ++ +++ +eLP+L gi|1751120 148 LPRLLRLNISGNIFSslqpgvfdELPAL 175 LRRCT: domain 1 of 1, from 183 to 233: score 35.2, E = 1.5e-06 *->NPfnCDCeLrwLlrwleaqnnealqdpvsslrCasPeslrGqpllll + CDC+LrwLl+w ++ + lq ++ +++Ca+P++l+ + l + gi|1751120 183 EFLTCDCHLRWLLPWAQN---RSLQLSE-HTLCAYPSALHA-QALGS 224 lp.sefsCp<-* l ++ C+ gi|1751120 225 LQeAQLCCE 233 IGv: domain 1 of 1, from 256 to 323: score -20.2, E = 50 *->svtLsCkaSgftf..yvsWvrqpPsstgkgkglewlgy.sykgRvti + +C+aS +++ + W++ + +g+ ++ +l+++ + + i gi|1751120 256 RLPFQCSASYLGNdtRIRWYHNRAPVEGDEQAGILLAEsLIHDCTFI 302 skdtltraskndvsLtisnLrpeDtgtYyCar<-* + Lt+s + +g C + gi|1751120 303 TSE-----------LTLSHIGVWASGEWECTV 323 IGc2: domain 1 of 1, from 252 to 328: score -1.6, E = 1.4 *->leGqsVtLtCpasgdpvpn.ItWlk.gkplp................ +G+ + ++C as + ++I+W+++ p++++++ + ++ ++ gi|1751120 252 FQGDRLPFQCSASYLGNDTrIRWYHnRAPVEgdeqagillaeslihd 298 ...sgstLtiknvsleDsGlYtCvArNsvg<-* + s+Lt++++ + sG+ +C+++ + g gi|1751120 299 ctfITSELTLSHIGVWASGEWECTVSMAQG 328 IG: domain 1 of 1, from 246 to 339: score 26.4, E = 0.00066 *->pp.vtvkeGesvtLtC.as......titWy................. + +++v G+++ ++C+as +++++i+Wy+++ + ++ + + gi|1751120 246 SLrQVVFQGDRLPFQCsASylgndtRIRWYhnrapvegdeqagilla 292 .............LtI.nvtpeDsesgGtYtCaat..sgsasseqgttLt + ++ + ++ Lt++++ + s G+ C+++ g+as+ +++ gi|1751120 293 eslihdctfitseLTLsHIGVWAS---GEWECTVSmaQGNASK--KVEIV 337 Vl<-* Vl gi|1751120 338 VL 339 HormR: domain 1 of 1, from 342 to 419: score 11.0, E = 0.014 *->tdlgCpat....wDgiiCWpsTpaGqlvavpC......Peyfsgfsn + ++Cpa++ +++g Wp+T aG +++++C + + + + g + gi|1751120 342 SASYCPAErvanNRGDFRWPRTLAGITAYQSClqypftSVPLGGGAP 388 ktggNrtqGnasRnCtedtGvGgWsepfpgaeSnrtWpnysnCtsndyee t+ asR+C+ G+W++ +ys+C + + + gi|1751120 389 GTR-------ASRRCDRA---GRWEP-----------GDYSHCLYTNDIT 417 lk<-* + gi|1751120 418 RV 419 ZU5: domain 1 of 1, from 606 to 717: score -52.5, E = 98 *->psflvs.....gtvdsrGGrLrgprHsGvrliiPpg.AipqGtryec +++l+s + ++ ++ s+ ++L p Pp+ + ++ gi|1751120 606 SVALASiqlppSLFSSLPAALAPPV--------PPDcTLQLLVFRNG 644 YlvvhkklstpPPLDkee...gEtLLSPvvecGPCDVSmSAhGalllrPV +l ++ ++ +P +++++ ++++Pv+ G S +G+ l+ PV gi|1751120 645 RLFHSHSNTSRPGA--AGpgkRRGVATPVIFAGT---SGCGVGN-LTEPV 688 iLevpHC...Aslrprdkwelvllrsengg<-* ++++H ++A++ + w+ + ++g gi|1751120 689 AVSLRHWaegAEPVAAW-WSQEGPGEAGGW 717 GPS: domain 1 of 1, from 699 to 751: score 21.0, E = 0.0035 *->snpiCvfWdese...tgeWstrGCelvetnkpthttCsCnHLTsFAv +p ++W++ +++ g+W+++GC+l + + + C+HL++ Av gi|1751120 699 AEPVAAWWSQEGpgeAGGWTSEGCQLRSSQP-NVSALHCQHLGNVAV 744 Lmdvspn<-* Lm +s+ gi|1751120 745 LMELSAF 751 // COG hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/cogs/cogs.hmm Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- COG1380 -67.5 38 1 COG2832 -93.0 65 1 COG2245 -95.0 20 1 COG1280 -112.4 73 1 COG0730 -128.5 98 1 COG2860 -159.7 43 1 COG0472 -174.3 22 1 COG0531 -178.0 82 1 COG0628 -182.1 49 1 COG3302 -197.0 65 1 COG0038 -210.4 73 1 COG1575 -210.9 46 1 COG0609 -227.0 64 1 COG0591 -251.8 96 1 COG2899 -267.6 84 1 COG0534 -270.5 86 1 COG0651 -296.4 82 1 COG2814 -307.1 84 1 COG1178 -336.4 78 1 COG2995 -336.7 12 1 COG2233 -358.4 72 1 COG3135 -380.4 8.9 1 COG3333 -415.0 61 1 COG1009 -435.7 96 1 COG1138 -442.4 92 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- COG2814 1/1 589 860 .. 1 407 [] -307.1 84 COG2860 1/1 720 868 .. 1 225 [] -159.7 43 COG1380 1/1 758 870 .. 1 132 [] -67.5 38 COG3135 1/1 549 945 .. 1 560 [] -380.4 8.9 COG0609 1/1 768 947 .. 1 346 [] -227.0 64 COG1138 1/1 462 950 .. 1 706 [] -442.4 92 COG2832 1/1 799 954 .. 1 151 [] -93.0 65 COG0534 1/1 594 954 .. 1 482 [] -270.5 86 COG2245 1/1 743 957 .. 1 197 [] -95.0 20 COG3302 1/1 758 959 .. 1 329 [] -197.0 65 COG0591 1/1 371 966 .. 1 529 [] -251.8 96 COG0730 1/1 733 967 .. 1 287 [] -128.5 98 COG1575 1/1 726 991 .. 1 334 [] -210.9 46 COG1280 1/1 770 991 .. 1 214 [] -112.4 73 COG0651 1/1 657 1063 .. 1 534 [] -296.4 82 COG1178 1/1 620 1070 .. 1 615 [] -336.4 78 COG2899 1/1 820 1071 .. 1 371 [] -267.6 84 COG0472 1/1 744 1071 .. 1 432 [] -174.3 22 COG0038 1/1 738 1079 .. 1 486 [] -210.4 73 COG2233 1/1 732 1080 .. 1 509 [] -358.4 72 COG2995 1/1 646 1087 .. 1 525 [] -336.7 12 COG0628 1/1 738 1098 .. 1 381 [] -182.1 49 COG3333 1/1 732 1163 .. 1 519 [] -415.0 61 COG0531 1/1 660 1175 .. 1 537 [] -178.0 82 COG1009 1/1 738 1331 .] 1 701 [] -435.7 96 Alignments of top-scoring domains: COG2814: domain 1 of 1, from 589 to 860: score -307.1, E = 84 *->masstkaqkskklallallaLAlaaFaiGttEFvpvGLLpdiAadlg ++ +++ +++ +++ gi|1751120 589 CTTGRPNVSLS---------------------------------SFH 602 v..siaaAGqLi..slYAlgvalgaPlLtlltarleRKrlLLgllalFiv +++s+a+A +++sl++ a+ aP+ + + l++F gi|1751120 603 IknSVALASIQLppSLFSSLPAALAPPVPPDCT---------LQLLVFRN 643 sNllsAlApnfalLllaRilsAlahGvFwviaaalAariVppekrgrAla + l + +R ++A p +rg A gi|1751120 644 GRLFHSHSNT------SRPGAAG-----------------PGKRRGVATP 670 liftGltlAlvlGv..PLGtflGqlfGWRaaFlaiallallvllllarlL +if+G++ v +++P + l ++ +a v+++ + gi|1751120 671 VIFAGTSGCGVGNLtePVAVSLRHWA----------EGAEPVAAWWSQEG 710 PsipgsaraekpgslrsqlrllrrprvllillatilwigGhFtlYTYiaP P +g + +s + qlr ++p+v ++ + G + gi|1751120 711 PGEAG-----GWTSEGCQLRS-SQPNVSALH----CQHLGNVA------- 743 fLqevaGfsgatvtllLLvfGlagivGnllgGrladrlGprrtllgvlll +L+e++ pr + ++ +l gi|1751120 744 VLMELS------------------------------AF-PREVGGAGAGL 762 lalvlllLplaagslilalaalflWGlaafsagpalQtrvarlapRdaad + +v+ + +l+ +l+a+++ l ++r++ ++ gi|1751120 763 HPVVYPCTALLLL------------CLFATIITYILNHSSIRVSR-KGWH 799 vaaslnvaAfNlGialGAalGGlvldrglGlaalp.wvGAaLvllAllla + l + +f+++++ ++GG+ l++ ++++ ++vG L l ++ gi|1751120 800 M---LLNLCFHIAMTSAVFAGGITLTN---YQMVCqAVGITLHYSSLSTL 843 llaarskkktaaepsll<-* l+ +++ +e + gi|1751120 844 LWMGVKARVLHKELTWR 860 COG2860: domain 1 of 1, from 720 to 868: score -159.7, E = 43 *->rndtllttetlamlLsv..LyiIGitAFAmsGALlAgrekMDlFGVi + l + + ++L+ ++L ++++ + L+A ++ gi|1751120 720 EGCQLRSSQPNVSALHCqhLGNVAVLME-----LSAFPRE------- 754 vLasVTAiGG.GaiRDvLLGhyPlgALWvkhpeYliialvaavltvvvar +GG Ga G P+ ++ l++ l+a +t ++ gi|1751120 755 -------VGGaGA------GLHPVV---YPCTALLLLCLFATIITYILNH 788 r.vrlmrl.lrklflvlDAiGLavFaiiGAqvAldmghsplIvvvaGviT + r+ r++++ l++++ +A+ + +v+aG iT gi|1751120 789 SsIRVSRKgWHMLLNLC------------FHIAM------TSAVFAGGIT 820 GVGGGIlRDlLarriPLVLrkRDELYAtaAiagglvylllaeslglsatv L+ Y ++ a g+++ gi|1751120 821 ------------------LTN----YQMVCQAVGITLHYS---------- 838 AllvtlfvglflRLlAiRfhWkLPtFrleeets<-* +l +l g+ +R+l W+ P + e++ gi|1751120 839 SLSTLLWMGVKARVLHKELTWRAPP---PQEGD 868 COG1380: domain 1 of 1, from 758 to 870: score -67.5, E = 38 *->ltlmlkkvlrivrqlaiLfgflllGewiasllglpvPGSIvGmvlLf + l v+ +++l++L +f+ ++ i + + v+ + m+l gi|1751120 758 AGAGLHPVVYPCTALLLLCLFATIITYILNHSSIRVSRKGWHMLL-- 802 lLLalgvvkleWveqgAslLlseMlLLFVPagVGVinYldllraeglpIl L++ + ++ v +g+ L+ nY+ + +a g+ + gi|1751120 803 -NLCFHIAMTSAVFAGGITLT---------------NYQMVCQAVGITLH 836 vvaivSTlivllvtGwivqllrklkkkkhrresel<-* + ++STl+ + v + + ++ + +++ ++++ + gi|1751120 837 YS-SLSTLLWMGVKARVLHKELTWRAPPPQEGDPA 870 COG3135: domain 1 of 1, from 549 to 945: score -380.4, E = 8.9 *->mmvDGAFvRAEEAvSLNRLPSPiSK...NPAPAAHSDLRL..StCRD + AF R E v R SP +++ P P A LR + t R gi|1751120 549 -LTCTAFQRREGGVPGTRPGSPGQNpppEPEPPADQQLRFrcTTGRP 594 ALElAifynvasvplfslliirPsialrtgraSasrevLlParerifla. + l f+ sv l s + Ps + a a+ P + l gi|1751120 595 NVSLSSFHIKNSVALASIQLP-PSLFSSLPAALAPP--VPPDCTLQLLVf 641 rsdkvRvsFSlkRScsRqagklslVRlGCPGmLRdGeartlleittrepr r ++ s S + sR + + gi|1751120 642 RNGRLFHSHSNT---SRPGAAGP-------------------------GK 663 mRgFaddtpPsaLlAGfvAvlvGGGRvPAPYassavliGLSDwqAaGFqa +Rg a + + AG + vG + +++ L w + gi|1751120 664 RRGVA----TPVIFAGTSGCGVG------NLTEPVAVS-LRHWAEG---- 698 AGaspAQisSWLwalgvvmGvvgitLsLrYRvPivtAWSTPGaALLvaSl a p ++ + g + G ++ + Lr +S gi|1751120 699 --AEPVAAWWSQEGPGEAGGWTSEGCQLR------------------SSQ 728 ggltlaEaiGAFlVSaALitLcGvtapftRlmrkiPlsLAaALLAGiLLP + VSa+ +G a + l ++ P + A ++ L P gi|1751120 729 PN-----------VSALHCQHLGNVAVLMEL-SAFPREVGGA--GAGLHP 764 FglkafaAleqqpaLvllmllsYLLakqfaPRYAVllaLlVGVvvaallG + Y ++Ll + + +++ gi|1751120 765 VV------------------------------YPCTALLLLCLFATIITY 784 qlhladlalkSLlavPewvtPeFSlaAlLslALPLfLvAmtSQnlPGiAv l ++ + + lL+l +f +AmtS +G gi|1751120 785 ILNHSSIRVS-----------RKGWHMLLNL---CFHIAMTSAVFAGGIT 820 LkAaGYRqaPpsPLlvaTGlASllaAPfGshsvnLAAIsAAICmgPdaHp L ++ +++ l l +ll ++ + H gi|1751120 821 LTNYQMVCQAVGITLHYSSLSTLLW---------MG------VKARVLH- 854 DkarRYtA........alwAGi....FYlLaGlFAaallgLfaaLPapll k ++A++++++++al + +FYl+aG gi|1751120 855 -KELTWRApppqegdpALPTPSpmlrFYLIAG------------------ 885 amLAGLALLgsimgsLqnAlkdereReGRGAAqPGSLAAlvaFaVTaSGf G+ L i + +A + +R+ +V + gi|1751120 886 ----GIPL---IICGITAAVNIHNYRD----------HSPYCWLVWRPSL 918 tLlgIGaAfWGLvaGlLaylvLnqiarRerg<-* + I A++ L++ + y+ ++ R rg gi|1751120 919 GAFYIPVALI-LLIT-WIYFLCAG--LRLRG 945 COG0609: domain 1 of 1, from 768 to 947: score -227.0, E = 64 *->llllilllllllllsilslslGaisistfLfpsdvlqalfg..eqii + ++lll l++ ++ ++l+ +i++s + + +l++ + i gi|1751120 768 PCTALLLLCLFATIITYILNHSSIRVS-----RKGWHMLLNlcFHIA 809 ldlRLPRilaallvGAsLAvaGailQgltRNPLAdPsiLGissGAalgav ++ gi|1751120 810 MT------------------------------------------------ 811 laillfpgasssslkrSvilnpyllplaAfiGaliaallvyllarrsggg + ++a++++l +y ++ + gi|1751120 812 ----------------------------SAVFAGGITLTNYQMVC-QA-- 830 nslspirLiLaGvalsalfsAltslllylad.lqqvlfWll.....GSls +G+ l + + + ++ ++a l++ l+W ++++++G+ + gi|1751120 831 ----------VGITLHYSSLSTLLWMGVKARvLHKELTWRApppqeGDPA 870 ganWsdvllllpivllglilllllarkLnlLsLGddlAksLGvnverlrl + s l +i+ ++ ++++ G+ gi|1751120 871 LPTPSPMLRFYLIAGGIPLIIC-------------------GI------- 894 lllllvvlLtgaaVavaGpIgFvGLiaPHiaRllvGpdhrylLPlSaLlG + + ++r P++ L gi|1751120 895 ------------------------------TAAVNIHNYRDHSPYCWL-- 912 alLLllADilaRtilaPaelPvGivTalilliGavaPyFlYLLrrrrkis + R l + +Pv ++ + yFl +r gi|1751120 913 ---------VWRPSLGAFYIPVALILLIT------WIYFLCAGLRL---- 943 letrgrg<-* rg + gi|1751120 944 ---RGPL 947 COG1138: domain 1 of 1, from 462 to 950: score -442.4, E = 92 *->Mlp.elGn.....flLllAlllalllsll...plvGasrrdaklman M+ ++lG+ ++ + l ++ a l l+++++l+ a+r d++ + gi|1751120 462 MIQkFLGYvdqikELVEVMVDMASNLMLVdehLLWLAQREDKACSR- 507 niCLKFFPWiPCtsqvmlArpaayaqflllalAfllLvyaFltdDFSVqY ++++ + +l+ A v+a ++ gi|1751120 508 ----------------IVGALERIGGAALSPHAQHISVNA--------RN 533 VAaNSnseLPlyYKisAvWggHEGSLLLWvllLslwtflvalfsrklp.. VA ++ +P Y l ++ + + p++ gi|1751120 534 VALEAYLIKPHSY-------------------VGLTCTAFQRREGGVPgt 564 ..................dtlkarvLAVmgLivlgFllfilftSNPFtrl ++++++++++++++++ ++ l++r +++ ++ + S F+ gi|1751120 565 rpgspgqnpppepeppadQQLRFR--------CTTGRPNVSLSS--FHIK 604 lPavPieGrdLNPlLQdvgLIfHPPLLYlGYVGfaVaFafa......vAs v+ L + L PP L+ + ++++a a++ ++++ + gi|1751120 605 N-SVA---------LASIQL---PPSLF---S--SLPAALAppvppdCTL 636 lLgGrldsawaRwsRpWtlvaWlFLTlGIvLGSWWAYyELGWGGwWFWDP L R +R gi|1751120 637 QLL------VFRNGR----------------------------------- 645 VENASLmPWLagtALlHSlavTeKRgsFkhWTvLLAilAFsLvLLGTFLV L+HS + T + g + gi|1751120 646 --------------LFHSHSNTSRPGAAGPG------------------K 663 RSGvlvSVHAFAsdpvRGvflLaflvlliggSLlLfalRapqvrEktisr R Gv + V FA+ + Gv + l +++ SL a++a v gi|1751120 664 RRGVATPV-IFAGTSGCGV---GNLTEPVAVSLRHWAEGAEPVA------ 703 lfdawsrvn...fellSrEgllLlnNillvvallvVllGTLyPlvvkllg +ws+ ++++ ++ +Eg+ L + v+al lG + ++ +l++ gi|1751120 704 --AWWSQEGpgeAGGWTSEGCQLRSSQPNVSALHCQHLGNVA-VLMELSA 750 lgsQPPSisvGePFFnklFipLmlpllallGvGplvRWkraqpstlatLi + +vG + a +G+ p+v + +l L+ gi|1751120 751 FPR-----EVG----G-----------AGAGLHPVVY----PCTAL--LL 774 mLlviliaslVlaallifllgdtvnlasaLglaaaawvlLlllaellwrt ++l + i++++l + s + w +Ll+l + ++t gi|1751120 775 LCLFATIITYIL-------------NHSSIRVSRKGWHMLLNLCFHIAMT 811 rgrvflvGkrkltrSqwgmiLaHlGvaliaiGiam..SglyssNQGYSEE + v++ g +L ++ a+Gi+++ S+l+ + gi|1751120 812 SA----VFA-------GGITLTNYQMVCQAVGITLhySSLSTL------- 843 rdVrLaGEVSVpGqtvtlggytftfqqlkeeqgpNytsevAivdltregk L +G+ + + +++t ++ ++g + gi|1751120 844 ----LW-----MGVKARVLHKELTWRAPPPQEG-------------DP-- 869 NmmDEpiatLiPEkRfYvvegqsDrmteaaIdggllrDlYvalgeslrgn + t P RfY ++g + I+ g++ +a+ + gi|1751120 870 -----ALPTPSPMLRFYLIAGG-----IPLIICGIT----AAVNIHNYRD 905 gswAvrlyykgPlvrWIWaGGEGlLMalGglLavtddrryrveveakkr< s l+ + P ++ + ++L +l + + ++ +r+++ + gi|1751120 906 HSPYCWLVWR-PSLGAFYIP--VALILLITWIYFLC-AGLRLRGPLAQN 950 -* gi|1751120 - - COG2832: domain 1 of 1, from 799 to 954: score -93.0, E = 65 *->mWTLDLCARRGlAyagirtpatqmvRylli..avGf......lsvvL L LC + a +++y + +avG++ + ++ls++L gi|1751120 799 HMLLNLCFHIAMTSAVFA-GGITLTNYQMVcqAVGItlhyssLSTLL 844 GiLGifLPvLPTTPFiLLAAACFarsSPRFyaWLlsh...........pv +G+ vL + L + ++++++++ p+ gi|1751120 845 -WMGVKARVL--------------------HKELTWRapppqegdpalPT 873 fGpmvrdwqqggAvPrkaKvrAillivlsfai........SlwlvpvlWl pm+r+++ g++P+ i i++++ i++ +++++ +wlv ++ l gi|1751120 874 PSPMLRFYLIAGGIPLI-----ICGITAAVNIhnyrdhspYCWLVWRPSL 918 qvalllValvlvliym........wRvPtkeevAekqek<-* + Val l+++++ + ++R P +++ gi|1751120 919 GAFYIPVALILLITWIyflcaglrLRGPLAQ---NPKAG 954 COG0534: domain 1 of 1, from 594 to 954: score -270.5, E = 86 *->gsirkaLlkralrLAiPiilanllqtlyglvDtfmvGhlgGadaLA. + ++ i +++++++l +++ + + ++aLA+ gi|1751120 594 PNVSLSSFH---------IKNSVALASIQLPPSLFSS--L-PAALAp 628 ..AVglaspilflliaig.mglgtGtsvlvaQaiGAgdregarraarqgl + + ++l + + + + + +a G g+r+g++ + + gi|1751120 629 pvPPDCTLQLLVFRNGRLfHSHSNTSRPGAA---GPGKRRGVA----TPV 671 vllalllslllgllglfllellLrllg............aseevlalAae ++ a ++ +g l+ + + l ++ + ++ +++ ++e ++++e gi|1751120 672 IF-AGTSGCGVGNLTEPVAVSLRHWAEgaepvaawwsqeGPGEAGGWTSE 720 YLrililglpfallsfvlrgalrgaGdtrtpmvvsiignllNivLdyiLI + lr+ + ++ +gn++ +L gi|1751120 721 ------------------GCQLRSSQPNVSALHCQHLGNVA------VLM 746 fGkg.vlFGfpglGivGAAiATviarwigflvllfyllrgkkgllislfd + + G++GA++ +++ +++l+ll ++ + + l +++ gi|1751120 747 ELSAfP----REVGGAGAGLHP-VVYPCTALLLLCLFATIITYILNHSS- 790 rvlkllkpdrkvlkkllrlGflPialeeilfsvgsfllntlfvarfGtda ++ +rk + ll+l + ia + +++ g+++l+ + + gi|1751120 791 -----IRVSRKGWHMLLNLC-FHIAMTS-AVFAGGITLTNYQMVCQ---- 829 vAAyqialriaslifmppfGisiAvttlvGqnlGAgnyerarraarlglk A +i+l +sl+ + ++G+ ar ++ gi|1751120 830 --AVGITLHYSSLSTLLWMGV------------------KARVLHKE--- 856 lslliglilalllllfpepiasllftsdpevialaaqlllilaisqpfdl l++ +p+p + dp+ + + l+++ ++ gi|1751120 857 -----------LTWRAPPPQEG-----DPA----LPTPSPMLRFYLIAG- 885 FvgiqvvlsGvlrGaGdtkvpliislisywgirlPlayllafftgfgknf gi ++ G+ + +++ +++ + P ++l ++ gi|1751120 886 --GIPLIICGITAA---VNIHNYRDHS-------PYCWLVWRP------- 916 wllltglglaGvWlgliignlvaaiigvvlallwrllrrwrkgteaaaek + G + + ++ +++ ++ + + + + + +g a+ +k gi|1751120 917 --------SLGAFYIPVAL-ILLITW-----IYFLCAGLRLRGPLAQNPK 952 aa<-* a gi|1751120 953 AG 954 COG2245: domain 1 of 1, from 743 to 957: score -95.0, E = 20 *->vikMeLsnAKl.lgGiGaiLqLvGaavgllGiLiwiLSivGlVLvLi + MeLs + + gG Ga G ++++ + ++ l L L gi|1751120 743 AVLMELSAFPReVGGAGA-----GLHPVVY----PCTAL--LLLCLF 778 alymISkqvgddrIFnyyLigfismvaglilfvivifAtvGvsivallkk a +I + + ++I + +g m++ l ++ + A v++ + l+ gi|1751120 779 A-TIITYILNHSSI-RVSRKG-WHMLLNLCFHIAMTSA-VFAGGITLTNY 824 smmslahglaalgsiLaGfViLyIayiigtyFqk....Ks.......... m+ a g+++ s L +L ++ + + k+ + ++++++++++ gi|1751120 825 QMVCQAVGITLHYSSLS---TLLWMGVKARVLHKeltwRApppqegdpal 871 ........yEliAqyTgVdmFrtAGLl..........Yfi.......Gai +++++ ++ liA+ + +++ + ++ +++++Y + +++ Ga gi|1751120 872 ptpspmlrFYLIAGGIPLIICGITAAVnihnyrdhspYCWlvwrpslGAF 921 LLIVlGvGflilliAaILeiVaFFSLPdEikaseqaaVsp<-* + V+ lilli +I + a L+ +++ +a s gi|1751120 922 YIPVA----LILLITWIYFLCAGLRLRGPLAQNPKAGNSR 957 COG3302: domain 1 of 1, from 758 to 959: score -197.0, E = 65 *->MgpGwhEwPLvfFTVlaQcvvGafIVtyfillltkngkgqGLPrqrv g G+h+ V ++ ++ ++ i +n + rv gi|1751120 758 AGAGLHPV------VYPCTALLLLCLFATIITYILNHSS-----IRV 793 lksMfilliLmGiGFIASimHLGsPlRAFnSLnRvGaSaLSRNEIAsGaI + + +l+ + +H a+ ++++G I gi|1751120 794 SRKGWHMLLNL-------CFHI----------------AMT-SAVFAGGI 819 FFtlgGliWLvAilgkmspaLgklWlIvTmiLGilFvwmmamVYasIdtV t + + +a g + l + + l + + a+V + gi|1751120 820 TLTN---------YQMVCQAVGIT-LHYSSLSTLLWMGVKARVL-HKE-- 856 PtWnngiTvlsFfLtvllgGSSAFpiLsYalLlpakvteyfltfLpvlsv +tW G A p+ s + l+f + + gi|1751120 857 LTWRAPPPQ---------EGDPALPTPSPM-----------LRFYLIAGG 886 LglviaaiVRQFYIKRPtvlqGlnLahI..hSsRVqqasaLVPnYatlev + l+i++i + n ++ ++hS Y+ l v gi|1751120 887 IPLIICGIT-----------AAVNIHNYrdHSP-----------YCWL-V 913 wRvvLLqvlPTFIPLAYLftwiiPllllrepYVTDVVnvpLLvlsylLiL wR L + ++ p V++ LL + + L gi|1751120 914 WRPSLGA--------------------FYIP-----VALILL-ITWIYFL 937 aGEaIGRWCFFVEGNHVQvLFYGLHMtvGmaiag<-* + + R G Q+ + G a+ gi|1751120 938 CAGLRLR------GPLAQN------PKAGNSRAS 959 COG0591: domain 1 of 1, from 371 to 966: score -251.8, E = 96 *->nliillifliYlvlvllIGwyayrrtksgseleDYylaGRslgpfvv + f + ++ + G+ a+rr + aGR gi|1751120 371 QSCLQYPFTSVPLGGGAPGTRASRRCDR---------AGR------- 401 alslgatymSaatflGlpGaaYasGlgglwyaigyligalilallvaprl ++S Y++ + ++y + + + +al +a l gi|1751120 402 ---WEPGDYS--------HCLYTNDITRVLYTFVLMPINASNALTLAHQL 440 rklsekknnalTipDflaaRfgsk..skllRilsalviivfflpYlaaQl r+++ + a ++ D++ + + k+l ++ + +v ++ +a++l gi|1751120 441 RVYT-AE--AASFSDMMDVVYVAQmiQKFLGYVDQIKELVEVMVDMASNL 487 vGgglllevlf..idyetgivigalivilYtflG................ + + l l+++ d + ++gal i +l+++ ++ + + ++ + gi|1751120 488 MLVDEHLLWLAqrEDKACSRIVGALERIGGAALSphaqhisvnarnvale 537 .................................................. ++++ + + + ++++++ +++++++++++++++++++ +++ + gi|1751120 538 aylikphsyvgltctafqrreggvpgtrpgspgqnpppepeppadqqlrf 587 ..............HGmkavawTDviQgilmlvalivlpvvvllkgGfge + ++++++ + ++ H i+ + l+ + ++p gi|1751120 588 rcttgrpnvslssfH----------IKNSVALASIQLPPSLFSSL----- 622 afealaaalpl...............................alggsgmv + ++ + p+ + + ++++ ++++++++++ +++++ + + gi|1751120 623 PAALAPPVPPDctlqllvfrngrlfhshsntsrpgaagpgkrRGVAT--- 669 iaiisiilllawgfGvlglPhilvRffaaksakelrlkailiggfwgily ++i+ + + g+G l+ P + + a a+ ++ ++++ + + gi|1751120 670 -PVIF-AGTSGCGVGNLTEPVAVSLRHWAEGAEPVA---AWWSQEGPGEA 714 llaaaliGlaaialvgdgpaadmavglvpadnaqnnWsemvfpalivall + + + G ++ ++++a l + v ++ ++++ gi|1751120 715 GGW-TSEGCQLRSSQPNVSA------LHCQHL------GNVAVLMELSAF 751 ppeALvwlaGfllaailAAimSTadsqLlvassaftrDiYkrvirkgkna p + G + a+l + ++ ++Ll + a i++++++ gi|1751120 752 PR----EVGG--AGAGLHPVVYPCTALLLLCLFAT---IITYILN----- 787 sekklvrvsRiavlllgvialllAllppgnpvvldlvglafgglgaaflp + ++rvsR l + + +A++ + +++ + + + +a + gi|1751120 788 --HSSIRVSRKGWHMLLNLCFHIAMTSAV--FAGGITLTNYQMVCQAVGI 833 vl....llgLyWkRatk................................w l+ ++l +L W + ++ +++ + + ++++++++ +++++ + + gi|1751120 834 TLhyssLSTLLWMGVKArvlhkeltwrapppqegdpalptpspmlrfylI 883 gAiaGmiaGlltvil................yeivPglfvslvafllgsw + ++ i ++t++++ ++ +++++ +++ ++++ ++++ll+ gi|1751120 884 AGGIPLIICGITAAVnihnyrdhspycwlvwRPSLGAFYIPVALILLI-- 931 kak.A.afivvslltkvvasefelkmpaaapvgtrvqdrfeaaakavpaa + + ++ + l ++ +a+ p ++ +a ++ ++ gi|1751120 932 -TWiYfLCAGLRLRGP----------LAQNP-----KAGNSRASLEAGEE 965 k<-* + gi|1751120 966 L 966 COG0730: domain 1 of 1, from 733 to 967: score -128.5, E = 98 *->mmnlalllllflvGfiaGflaglfGiGGGl..llvPaLlflgglpph +++ l ++++ + +f + G G+Gl++++ P +l+ l gi|1751120 733 ALHCQHLGNVAVLMELSAFPREVGGAGAGLhpVVYPCTALLL-L--- 775 vAvGTsllqvlftalssayahlkkgnvdlklglilglgsvvGaalGalla + ++ ++ l+ + + ++ + l l + +++ ++ G ++ gi|1751120 776 ----CLFATIITYILNHSSIRVSRKGWHMLLNLCFHIAMTSAVFAGGITL 821 klipaeikyeaLklifgllllllalymllskkieikpkreekpnkkslpl ++g+ l +++l +ll + k +++ ++l gi|1751120 822 TNYQMVC------QAVGITLHYSSLSTLLWMGV--------KARVLHKEL 857 pglggfyepleieyavrawlliliGfliGflsGllGiGGGsllvPaLill ++ + + G +l +P +l gi|1751120 858 TWRAPPPQE----------------------------GDPALPTPSPMLR 879 lgyslplkkavgtsllaialsslisfllyl....lggpvdwllalglvlG + + +a g+ l++ ++++++ +y +++++ v w+ lg+ + gi|1751120 880 F-----YLIAGGIPLIICGITAAVNIHNYRdhspYCWLV-WRPSLGAFYI 923 siiGaylGarlasrkligprflrllfalilllvalklllkglgmll<-* ++ ++l +++ + +lr ++a + + l++ +l+ gi|1751120 924 PVALILLITWIYFL--CAGLRLRGPLAQNPKAGNSRASLEAGEELR 967 COG1575: domain 1 of 1, from 726 to 991: score -210.9, E = 46 *->mNQttksemtaasmasarlqiWleaaRPkTLpaAlaPvvvGtaLAyw Q + s + + ++ + e ++++P vG +++++ gi|1751120 726 SSQPNVSALHCQHLGN--VAVLME--------LSAFPREVG-GAGAG 761 lggkfdllva..vLaLltAilLqIlsNlANDYgDavkGtDtedRvGplRl l++++ + a +L+L+++i Il ++ +++ gi|1751120 762 LHPVVYPCTAllLLCLFATIITYIL---------------NHSSIR---- 792 ivsglitpkevllasifsvalavLsGlvLvlvagdnLadlwtllgLGlla +++k ++++++ +a+ s v+ +g +L + ++ gi|1751120 793 -----VSRKGWHMLLNLCFHIAMTSA---VFAGGITLTNYQ------MVC 828 IaagytYTGGpkPlGYlGLGdlsVfifFGpLa.Vl...GsyYlQtAAHLA a g+t l Y +L s ++ +G+ a+Vl+++ ++ + gi|1751120 829 QAVGIT-------LHYSSL---STLLWMGVKArVLhkeLTW--RA----- 861 GAFPvAiPPgriswalllpS..ipvGilataiLlvNNlRDiEeDakaGKk PP+ ++al +pS+ + ++a+ i l+ +G+ gi|1751120 862 ------PPPQEGDPALPTPSpmLRFYLIAGGIPLI----------ICGI- 894 TLaVRLGdkrarqlyaaLlv..vAyaavlgfvlataapwwvfLvfLlslP T aV + +r y+ L+ ++++ a ++ +l + + w fL ++ gi|1751120 895 TAAVNIHNYRDHSPYCWLVWrpSLGAFYIPVALILLITWIYFL----CAG 940 lavrvakqVfqk..........eDqPrelapaladtvklslltnlLfavG l +r +++ q+++ ++++ + e +++el+ + + + ll+ +++ gi|1751120 941 LRLR--GPLAQNpkagnsraslE-AGEELRGSTRLRGSGPLLSD---SGS 984 lllgqLr<-* ll+ + + gi|1751120 985 LLATGSA 991 COG1280: domain 1 of 1, from 770 to 991: score -112.4, E = 73 *->mmdmsfllafligslalslalsPGPdnllvlsn.slkyGfraGllaa a+l+ +l++++ +++l ++s++ ++++ +++ gi|1751120 770 -------TALLLL--CLFATI-----ITYILNHsSIRVSRKG--WHM 800 LGlalGdavhvlLaalGlaaL....lktspllFtlLkllGAaYLlYLGvq L + + ++ ++++G +L++ +++ + ++ +L++ + Ll++Gv+ gi|1751120 801 LLNLCFHIAMTSAVFAGGITLtnyqMV-CQAVGITLHYSSLSTLLWMGVK 849 m........lrskgkkleases..aasplsrswklflrG........... ++ +++ ++r + +++e++ + +++sp r ++l +++ + + + gi|1751120 850 ArvlhkeltWR-APPPQEGDPAlpTPSPMLR---FYLIAggipliicgit 895 lltnllN.....PKaiLFfl.sllpqFidpqaslaslaqllvlgaiivlv +++n+ N ++++P+ +L + +sl++ +i+++ l + +++ + +++ gi|1751120 896 AAVNIHNyrdhsPYCWLVWRpSLGAFYIPVALILL----ITWIYFLCAGL 941 dllwfsllAllgsrlarllrsnprfqrvlnrlaGl..lLigfGvklllsr l + ++ +++ l + r+ rl G+++lL +G +l++ + gi|1751120 942 RLRGPLAQNPKAGNSRASLEA-GEELRGSTRLRGSgpLLSDSGSLLATGS 990 l<-* gi|1751120 991 A 991 COG0651: domain 1 of 1, from 657 to 1063: score -296.4, E = 82 *->Msalvll....lillPLvaafllpillllgllgpfagltliallali + + +++++ +++P+++a + +g ++ ++ +l + gi|1751120 657 --GAAGPgkrrGVATPVIFAGTS----GCGVGNLT-EPVAVSLRHWA 696 vllvllgyvlltglgalvggtlGgwlap...................... ++ +++++++ ++ +g gw + + + ++++++ + + ++ ++ gi|1751120 697 EGAEPVAAWWSQEGPGEAG----GWTSEgcqlrssqpnvsalhcqhlgnv 742 .................fgivlelDplsalfllilalvgllvslYsigym + + +++ ++ + g++ ++ p al ll l + + + + + gi|1751120 743 avlmelsafprevggagAGLHPVVYPCTALLLLCL-FATIITYILNHSSI 791 rrekgrgedsaavyyallllllagmlGafltgDlfnlyVFfEimalaSya r+ + + + l l ++ +m+ a+++g + +l+ y gi|1751120 792 RVSRKG------WHMLLNLCFHIAMTSAVFAGGI----------TLTNYQ 825 LvalggtkealeaglkYlvlggvgsalfLlGigllYaatGTLnfadlaav +v +Gi l Y+ TL ++++a+ gi|1751120 826 MVCQ------------------------AVGITLHYSSLSTLLWMGVKAR 851 lls............glalaaflLlliGfgvKaglFPLHfWLPdAyseAp l ++ + + +++++g ++ + f + ag PL gi|1751120 852 VLHkeltwrapppqeGDPALPTPSPMLRFYLIAGGIPL------------ 889 spiSAllSgvvvKvGvYaliRvllllfgglgideq.elipaslgalllil + +G ++ ++ ++ d+ ++ + +++ l ++ gi|1751120 890 -----------IICG------ITAAVNIHNYRDHSpYC-WLVWRPSLGAF 921 aaltaifGvlmAllqsDiKRLLAYSsvsQmGyIllglgiGtpgidlgplt yI ++l++ + gi|1751120 922 -------------------------------YIPVALILLIT-------- 932 alalaaalfHllnHalfKglLFLgAGaviyrtGtrdldklgGLakrmPll ++ FL+AG +l+G + ++P + gi|1751120 933 ------WIY------------FLCAGL-----------RLRGPLAQNP-K 952 talafliaalslaGiPplnGFvSKwliiqallaaggsapvg.lpllavll + + a+l+ag +l + l gi|1751120 953 AGNS------------------------RASLEAG-----EeLRGSTRLR 973 lvgsaitfayfvrllykffgggflgaPrsetepgakeapllmllpmlil. g +++ + ++ ++ g+++ +++ + p+++l + ++++ gi|1751120 974 GSGPLLSDSGSLLATGSARVGTPG----PPEDGDSLYSPGVQLGALVTTh 1019 allclllGvapepvlkalgapaaallliytvkvsavlaglpdiai<-* +l+ +++ + +v+++ + +++ +y+v sa++ +++ + gi|1751120 1020 FLYLAMWACGALAVSQRWLP-RVVCSCLYGVAASALGLFVFTHHC 1063 COG1178: domain 1 of 1, from 620 to 1070: score -336.4, E = 78 *->vvasly.v.m.ysaillvlavgalatla.....tvklvvmggdplri ++ + ++ l+l v++ ++l++++++t++ + g+ + r gi|1751120 620 SSLPAAlApPvPPDCTLQLLVFRNGRLFhshsnTSRPGAAGPGKRR- 665 glwlliglllaalllllppl.v.lavlalsafsgglaeflavlsdayllr g++ +++++ ++ + l+ ++av+++ +++g + a++s+ ++ gi|1751120 666 GVATPVIFAGTS-GCGVGNLtEpVAVSLRHWAEG-AEPVAAWWSQEGPGE 713 llgnTLllallvTvlslilGlplAyllsrydfPGrrwlrwllaLPiLLvi g+T + r + + aL gi|1751120 714 AGGWT-----------------------SEGCQLRSSQPNVSALH----C 736 PalvvAfgfislfGksGwLarllgelfGlssreywlpdiygPlgGiilal +l + ++ l + + gG+ +l gi|1751120 737 QHLGNVAVLMELS-----------------------AFPRE-VGGAGAGL 762 vlfnyPlvyllaaaalesidpsleEaArsLGasrwqvFrrVtLPllrPai + yP ll++ + +i + L++s ++v r+ gi|1751120 763 HPVVYPCTALLLLCLFATIIT------YILNHSSIRVSRK---------- 796 aagalLvFlyclsdFgavliLGGspqytTlttaIyqeilgs.qldlatAa g + +c+ ++++++ G+ +t+ yq + ++ + l +++ gi|1751120 797 --GWHMLLNLCFHIAMTSAVFAGG-----ITLTNYQMVCQAvGITLHYSS 839 lLallLLllsllllwvvvkllerfsrgrqkvyssgqswarpiprilllgl l ll +++ + +l e+ +r+ + +++ ++ gi|1751120 840 LSTLLWMGVKARVLH-----KELTWRA----PPPQ---EGDPALPTPSPM 877 aalvallfclllfsvlllgvilplsflllWtvlltsdewsdamlgplfst + ++++++ +l+ +g++ +++ +++ s gi|1751120 878 LRFYLIAGGIPLI---ICGITAAVN-------IHNYRDHS---------- 907 sfwhalinsLtlallAalialllaLllaylvrrsrsrlsrfidrLsmLpl ++ + + +l+A++i ++l Ll+ + + gi|1751120 908 ---PYCWLVWRPSLGAFYIPVALILLITWIYFLC---------------- 938 AvPGvVlalGllllfnnldnhwvdlaameGvkpllvlygtllllVlAyal Gl l + + n + ++ + l + l gi|1751120 939 --------AGLRLRGPLAQN--------------PKAGNSRASLEAGEEL 966 ralPfalrsleaalrqidprLeeaArsLGasrwqifrrvtLP........ r + +s p L++++ L ++ rv +P++++++++ gi|1751120 967 RGSTRLRGSG--------PLLSDSGSLLATGSA----RVGTPgppedgds 1004 LllpgllaAaalvFalsmkElsATllLgppdftTLttaiynylsggdgry L pg+ +a++ +T +L+ + ++a+ + r+ gi|1751120 1005 LYSPGVQLGALV----------TTHFLYLAMWACGALAV-------SQRW 1037 aaAavllvasaAlvLvlislllfvllikrygersqgt<-* ++ v++ +l v++s+l ++++ + ++ r++ + gi|1751120 1038 ----LPRVVCSCLYGVAASALGLFVFTHHCARRRDVR 1070 COG2899: domain 1 of 1, from 820 to 1071: score -267.6, E = 84 *->MktfReFrGfSlLvTvvcLalAawdGYaRGGHTllalanALfLlvvL t+ + vc a++ Y+ l L++ v gi|1751120 820 --TLTNYQ-------MVCQAVGITLHYS-------SLSTLLWMGVKA 850 avLEVSLSF........DNAViNAaiLkrMSefWqklFLtiGILIAVFGM vL L + ++++++D A L + S+ + GI gi|1751120 851 RVLHKELTWrapppqegDPA------LPTPSPMLRFYLIAGGI------- 887 RLVFPivIVavtaGLglvevmklALenPAHGALEFADGSesYeklLtkah P++I +ta v + P Y l+ gi|1751120 888 ----PLIICGITA---AVNIHNYRDHSP-------------YCWLVWR-- 915 paIAAFGGmFLLmvFLdFlvdndrdlhWLkPiErpLAriGklDaVqaViv p+ G+F + v L l+ W + L G l + + gi|1751120 916 PSL----GAFYIPVALILLIT------WIYFLCAGLRLRGPLAQNPKAGN 955 avaLllatkLfvapaeqfqdvLiAGLvGllv....YLvvdaidrlFktdg + a l a+ + + ++ G l+++++ L+ + r + gi|1751120 956 SRASLEAGEELRGSTRLRG-------SGPLLsdsgSLLATGSARVGTPGP 998 lGEAtPevqvaqkAgkAGvAaFLYLEVLDAsFSfDGViGAFAiTkDvViI Pe++ D +S gi|1751120 999 -----PEDG-------------------DSLYS----------------- 1007 alGLGviGAmFVRSiTvyLVrkgTLdeYvFLEHGAHYAIGaLAVImLlSi G GA+ +T + +L A +A GaLAV S gi|1751120 1008 -PGV-QLGAL----VTT---------HFLYL---AMWACGALAV----SQ 1035 drrfhVPEvVT.glvGva..fIaaAlweSlrrnrrevls<-* r+ P vV + l Gva + ++ + ++rr v+ gi|1751120 1036 --RW-LPRVVCsCLYGVAasALGLFVFTHHCARRRDVRA 1071 COG0472: domain 1 of 1, from 744 to 1071: score -174.3, E = 22 *->MLlmLa.llpalsslnlfsYltafrallalliafllsllltpifipf L +L + + + ++ + ++ +a+ll l+++i++++ gi|1751120 744 VLMELSaFPR--EVGGAGA--G-LHPVVYPCTALLLLCLFATIITYI 785 lrklaikigqdirkdgpksHkshKagTPtmGGlaIllsflivlslllwag l+++ i+++ ++ ++++ h a T a++++ + + + gi|1751120 786 LNHSSIRVSRKGWHMLLNLCF-HIAMT-----SAVFAG------GITLTN 823 lnsganpyevevwlvLlvllgfgliGflDDlfklsrKnnkGLsakiKlll + + + ++G+ + +ls + ll+ gi|1751120 824 YQ-------M----------VCQAVGITLHYSSLS-------T----LLW 845 qfiaAvlllilllkfdgslltqlyiPFfkspsfdlgtllylvlavfalVg + A +l l +++ +P ++p+++ + ++ f+l++ gi|1751120 846 MGVKARVL-HKELTWRAP------PPQEGDPALPTP----SPMLRFYLIA 884 ssNAvNltDGLDGLAaGlsviaalalaliaylsgnvnfAqYLlipyipda +G+ +i+++ a + + d+ gi|1751120 885 ---------------GGIPLIICGITAAVNIHNYR-------------DH 906 gelailclalaGAcLGFLwfNfyPGkAkvFMGDtGSlaLGavlgalavll ++ ++l + P +LGa+ +a++l gi|1751120 907 SPYCWLVW--------------RP-------------SLGAFYIPVALIL 929 klklqei...................................lllimggv ++ i + + +++ ++++ ++++ + + +++ ++++ l g+ gi|1751120 930 L--ITWIyflcaglrlrgplaqnpkagnsrasleageelrgsTRLRGSGP 977 fvietlsvilqvlsrklr......kd.pt.igkrifkmaplHhHfelkgw ++ + s +++ r+ ++++++++d++ ++g + H+ ++ +++ gi|1751120 978 LLSDSGSLLATGSARVGTpgppedGDsLYsPGVQLGALVTTHFLYLAMWA 1027 glkftlrqflifiilcaigiLislslrllreakvvvrfWiislilaligl ++++ +q r+ ++vv++++ + + al gi|1751120 1028 CGALAVSQ------------------RW--LPRVVCSCLYGVAASALGLF 1057 atlllaavgvllavifaflrfviwlklrl<-* ++++ + ++ +r+ gi|1751120 1058 VFTH---------------HCARRRDVRA 1071 COG0038: domain 1 of 1, from 738 to 1079: score -210.4, E = 73 *->frfetsstpllrkvflrllsgrailigviagllavalllalilvggv + + + l + + r + g agl++v+++ ++ l++ + gi|1751120 738 HLGNVAVLMELSAFP-RE------VGGAGAGLHPVVYPCTALLLLCL 777 vgalaagvlalgvavgeglrlvlaagplkvlllplvvalggSallalisg + ++ + +++ + + + ++ + +l +++ ++++g g gi|1751120 778 FATIITY-ILNHSSIRVSRKGWHMLLNLCFHIAMTSAVFAG--------G 818 llvykfapeaaGsGipqviealhgnegGqipwrvlvvKilaslltigsGg + + +++ + +Gi + +l+ + ++ +++ v++ gi|1751120 819 ITLTNYQMVCQAVGITLHYSSLSTLL--WMGVKARVLH------------ 854 SlGREGPsvqiGaalgsalgkllklkgadedrrtrlliaaGaAAGlAAaF l+ + + +e gi|1751120 855 ----------------KELTWRAPPP--QEGDPA---------------- 870 naPLAGalFiaEvlyrrfllvtallpalvasavaavivlvfgfnlfgfdp + +l + l++ + +++ +++++v ++n ++d+ gi|1751120 871 ------------LPTPSPMLRFYLIAGGIPLIICGITAAVNIHN--YRDH 906 lfslpiaasltvplldlplyillGifaGlaG.vlfvrllyklhalfrrlk + l + lG+f ++ +l++ ++y l a++r + gi|1751120 907 SPYCW-----------LVWRPSLGAFYIPVAlILLITWIYFLCAGLRLRG 945 lppvlklvigGlliGgsvvllalllPevlggGyglvqlvfagnlsltspa p + g + ++ l ++ e g + ++ ++ gi|1751120 946 -PLAQNPKAGNSRAS----LE--AGEELRGSTRLRGSGPLLSDSGSL--- 985 gllleilllLfvaKilaTslsygsGapGG......lfaPslaiGatlGra + + s+ +G pG ++++++l+ P++++Ga++ + gi|1751120 986 ----------------LATGSARVGTPGPpedgdsLYSPGVQLGALVTTH 1019 fgllvavllfpaisvspgifalaGMaaflaattraPLTaillvlEmTGny f ++M a+ a + ++ gi|1751120 1020 FL-------------------YLAMWACGA---------------LAVSQ 1035 tllpplLiacliasliaqllggkpiytiyaertlgrqnaeaeager<-* + lp ++cl ++ ++l + +t++ +r+++++ + ++ ++ gi|1751120 1036 RWLPRVVCSCLYGVAASALGLF--VFTHHCARRRDVRASWRACCPP 1079 COG2233: domain 1 of 1, from 732 to 1080: score -358.4, E = 72 *->m.llatalrsvllsmsdsampstmtksdlvygvddrp.pllkllllG + l + l+ v +m sa p+ + g+ ++ p llll gi|1751120 732 SaLHCQHLGNVAVLMELSAFPREVGG--AGAGLHPVVyPCTALLLLC 776 LQHllAMFgatVlVPLivGlalgLaaedlayLIsmsLfasGiaTLlQtli L F+ +++ ++ +++ ++ ++ gi|1751120 777 L------FATIITY-ILNHSSIRVS----------------------RKG 797 tgRpfGiglPivLGsSFAFvgPmIaaGglgkegGadieaamgGifgaglv ++ ++ L A +++ a+ g + +++ + ++ ++ gi|1751120 798 WH------MLLNLCFHIAMTSAVFAG---GITL-TNYQMVCQAVGITLHY 837 ygviglLisrfgtgdrLkrlfPPvVtGpVImvIGLsLapVAikmagGgea +++ +lL +++ + L++ ++ + + + gi|1751120 838 SSLSTLLWMGVKAR-VLHKELT---------------------WRAPPP- 864 amasgnpdfgslenLlLalvVLliilllnrfgkGflrlipILiGlvvGYv +++ al + +++lr + I G+ + gi|1751120 865 QEGDP------------ALPT----------PSPMLRFYLIAGGIPL--I 890 lAlfmGlvdfdsalveaap...wfalPtpfyfGaPYtpaFnwgaIltmlp ++ + v+ + +++p +++ P++ +f + ++Il + gi|1751120 891 ICGITAAVNI-HNYRDHSPycwLVWRPSLGAF------YIPVALILLITW 933 valv.........tivEhvGditAtgkvtgkpllgPeYkpgLhrG..lla ++ + + + +++++ + ++ + +g l g ++L+++++ll gi|1751120 934 IYFLcaglrlrgpLAQNPKAGNSRASLEAGEELRG---STRLRGSgpLLS 980 DGlatllAGl..fGgfPnTTFaqNiGvvalTgVaSryVivwaAvililLG D+ + l++G ++G+++ + +S+ V gi|1751120 981 DSGSLLATGSarVGTPGPPEDGDS--------LYSPGV------------ 1010 lfPKfaallqsIPsPVLGGamivlFGmIAasGiriLirakvdlsknRNLl ++al+++ ++ + +++ + s +R L gi|1751120 1011 ---QLGALVTT-----------------HFLYLAMWACGALAVS-QRWLP 1039 IvAvsLglGlGgaavPpeflaglPavlrplllsGialgaitAIvLNllLp v +s G+ ++a+ ++ + + + gi|1751120 1040 RVVCSCLYGVAASAL-G-LFVFTH-------------------------H 1062 grrrnpiveerreeksvtaeaaeealkkea<-* rr+ ++a +a ++ a gi|1751120 1063 -------CARRRD-----VRASWRACCPPA 1080 COG2995: domain 1 of 1, from 646 to 1087: score -336.7, E = 12 *->mFLSRYFTiLYRiGiLSNGFGSRRTTAqKiFLPDTKGLFiLPNLPVL +F S T R G + G G RR A + T G + NL gi|1751120 646 LFHSHSNT--SRPG--AAGPGKRRGVATPVIFAGTSGCGV-GNLT-- 685 HPDWV.YRLFSFimPPlilatppmtP.......kaklthhAlrGNCCdEl P V+ R ++ P + + P++ ++ ++ + +++ gi|1751120 686 EPVAVsLRHWAEGAEPVAAWWSQEGPgeaggwtSEGCQLRSSQ------- 728 lpgklilCpeCdllidvPrldqsaqsAeCPRCGhkLtrggrwplsrpaAl + + C + + + +l+ A+ PR +gg p+ + gi|1751120 729 PNVSALHCQHLGNVAVLMELS-----AF-PR-----EVGGAGAGLHPVVY 767 AlacLiLm.pfSlgfPyl....sisl..lGvtieatL...legiwllvee ++L+L+ +f+ +y+ +++si++++ G ++L + + + + + gi|1751120 768 PCTALLLLcLFATIITYIlnhsSIRVsrKGWHMLLNLcfhIAMTSAVFAG 817 gYalLAllvFfcvilaPllflllvlylwlaarlkqaRqLsssEvqklRll g l+ + +++ l + l +lw + + v ++l gi|1751120 818 GITLTNYQMVCQAVGITLHYSSLSTLLWMGVKAR---------VLHKELT 858 ll.........vLvrLKeWvMvDVFLVgvlVsliKlkdyAeiei...... + ++++++++ L + + M+ +L++ + li A + i++ +++ gi|1751120 859 WRapppqegdpALPT--PSPMLRFYLIAGGIPLIICGITAAVNIhnyrdh 906 Gpsllpfv....lfalLtvlllskvdvrwLWeeFqPAetelgqnakqvhv p+ + +v +++l a ++l+++ + W +F A+ l+ ++ q gi|1751120 907 SPYCW-LVwrpsLGAFYIPVALILLIT---WIYFLCAGLRLRGPLAQNPK 952 eiksGLvSCsaChyirtqKsllyPdqqkgvCpRCgskLyvRrpnLsLqwt ++s +++ + g +R +++L+ + p+ L+ + gi|1751120 953 AGNS--------RASL----------EAGEELRGSTRLRGSGPL--LSDS 982 wAlLlAaIilyfPANLLPImiiyllGareedTIlsGVisLwkeGsypVAa lL ++ + + + + ed sL + G + A+ gi|1751120 983 GSLLATG------------SARVGTPGPPEDG-----DSLYSPGVQLGAL 1015 VIFlASIlVPvlKiiamayLlasaGkfklqtgqknLkrmllYRvvefvGR V +yL + a ++l ++q+ ++ ++ gi|1751120 1016 V------------TTHFLYLAMWA-CGALAVSQR--WLPRV--------- 1041 WSMiDlFVIailmsLVqrgqilaftpGpaAlyFgaVViLTMLSAevFDpR +++ G aA + g+ V+ T A D R gi|1751120 1042 ------------VCSCL--------YGVAASALGLFVF-THHCARRRDVR 1070 LLWDe..pdsseaefne<-* +W + p+ s+a+ gi|1751120 1071 ASWRAccPPASPAAPHA 1087 COG0628: domain 1 of 1, from 738 to 1098: score -182.1, E = 49 *->mlmspairrllkpvvlrllvlllillllll..flylfqplllplllA +l a++++l+ +++ + +++ l++++y + lll+ l+A gi|1751120 738 HLGNVAVLMELS-----AFPREVGGAGAGLhpVVYPCTALLLLCLFA 779 lvlayllnPvvrwLerklgipRplavllvllllllllallllllvptlv. +++y+ln ++ r +g + l ++ +++ +++a +l +++v gi|1751120 780 TIITYILNHSSIRVSR-KGWHMLLNLCFHIAMTSAVFAGGITLTNYQMVc 828 gidqlgqlirnnlPqlnnllqqllawlpnig................... + + ++ ++l++l++++ ++ ++ + ++++++++ +++++ gi|1751120 829 QAVGITLHY-SSLSTLLWMGVKARVLHKELTwrapppqegdpalptpspm 877 ...llqlslyasldeliqqlisnrlaailgsilssllnllgrqvknllgl + +l+++ ++ + i+++++ i ++ s + +l+ + + + + gi|1751120 878 lrfYLIAGGIPLIICGITAAVN-----IHNYRDHSPYCWLVWRPSLGAFY 922 ivslllvllllfffLldgeklkegilsllPsrlyrprvkrilselnaslg i +l++l++ ++fL g +l+ +++ + + + ++ +el+ s + gi|1751120 923 IPVALILLITWIYFLCAGLRLRGPLAQNPK-AGNSRASLEAGEELRGSTR 971 nyirGqvlvaliiGvlsgigllllgvpy............alllallagl +rG+ + G l + g + +g p++++++++ +++ l+al++ gi|1751120 972 --LRGSGPLLSDSGSLLATGSARVGTPGppedgdslyspgVQLGALVTTH 1019 lnlIPyiGaviiwiPaliyalltggglatwggllvlivflviqqlednvL + y+ ++ ++ + l + + l+++++ + ++ gi|1751120 1020 F---LYLAMWAC---GALAVSQRW--L-PRVVCSCLYGVAAS--ALGLFV 1058 rPklmgkrlgLhPllillsllgGgslfGfvGlilgpPllavlkalldayl +r + + a ++a gi|1751120 1059 FTHHCARRRD-------------------------------VRASWRACC 1077 rgdia.eli.fkklaeqlgeees<-* + a+++ ++ ++++ + e+ gi|1751120 1078 PP--AsPAApHAPPRALPAAAED 1098 COG3333: domain 1 of 1, from 732 to 1163: score -415.0, E = 61 *->MemldsllQGlslvleptNLllmlIGvlvGil...VGlLPGLGpina +l ++L+ + + + ++++VG+ a gi|1751120 732 ------------SALHCQHLGNVAVLMELSAFpreVGG--------A 758 vALllPlTFALGlpPstAlILlagVYlGalyGGSIsSILlNiPGepSsVa A l P+ + P tAl+Ll +l+ I+ IL Ss+ gi|1751120 759 GAGLHPVVY-----PCTALLLL------CLFATIITYIL-----NHSSIR 792 TafDGYPMArkGqAGkALalsayaSfi..GalvsallLlFlsPqlAkfAL G+ M + +++++++ +G ++ ++ + + L gi|1751120 793 VSRKGWHMLLN----LCFHIAMTSAVFagGITLTNYQMVCQAVGI---TL 835 aFgspEYFaLmLfaltivaGmsGqGsplK...tLlaavlGlllatVGiDk +s l+++ m + ++l ++ t+ a + gi|1751120 836 HYSS----------LSTLLWMGVKARVLHkelTWRAPP-----------P 864 vsGvlRFTFdqtfLlsGisFillviGLFaisEILlqveKrrekrsAnEqE G++ + ++ L + +i i L I ++ + gi|1751120 865 QEGDPALPTPSPMLR--FYLIAGGIPL-----IICG------------IT 895 rakntgrmrlRtwkelaevllviLRSsllGfimGVlPGaGaTiASFlsYs +a+n+ r + + +lv+ R sl f + V T Fl + gi|1751120 896 AAVNIHNYRD----HSPYCWLVW-RPSLGAFYIPVALILLITWIYFLCAG 940 larr..lskkAkkeEFGnGsleGvaAPEtAnnaa...tggalvPMLTLGv l r++l+ +++ Gn s ++ A E ++++ +++g P+L gi|1751120 941 LRLRgpLAQ--NPKA-GN-SRASLEAGEELRGSTrlrGSG---PLLS--- 980 PGSgtTAVMLGALliyGl....qPGPqLFveqpDiVWglIaSmyvgNvvl + G Ll+ G + ++PGP e +D ++ + +g v gi|1751120 981 --DS------GSLLATGSarvgTPGPP---EDGDSLYS--PGVQLGALVT 1017 l.IvnlplvrlFakiLavPssvLlPvIlmfclIGvYaVhystlDVylMi. + + l++ + Lav s lP ++ cl Y+V +s l +++ + gi|1751120 1018 ThFLYLAM--WACGALAV-SQRWLPRVVCSCL---YGVAASALGLFVFTh 1061 .......lfGliGYlLrKlgFPlaPLLLGfILGglaEenLRrALviSdGe + +++++ + + + P aP L aE+ + v+ +G gi|1751120 1062 hcarrrdVRASWRACCPPAS-PAAPHAPPRALPAAAEDG---SPVFGEGP 1107 LsilWe......rPlaLtftiLaivmLLlPLlrRCAkAldrRvkr...Ra s +++++++ PlaL+ L + L + A+A +++++ gi|1751120 1108 PSLKSSpsgssgHPLALGPCKLTNLQLAQSQVC-EAGAAAGGEGEpepAG 1156 TGtkaek<-* T+ +++ gi|1751120 1157 TRGNLAH 1163 COG0531: domain 1 of 1, from 660 to 1175: score -178.0, E = 82 *->ssalrkkLGlwdlvalgigamiG..aGifalpgtaaalag..paail + +r+ ++ +a + g +G+ + +a+ + +a ++++ aa++ gi|1751120 660 GPGKRRGVATPVIFAGTSGCGVGnlTEPVAVSLRHWAEGAepVAAWW 706 a.....wllaglivlliAlsyaelssaiapelslaGGvYtYarralG... ++++++ + ++ + s +sa+ + + G + + ++ gi|1751120 707 SqegpgEAGGWTSEGCQLRSSQPNVSAL-HC-Q-HLGNVAVLMELSAfpr 753 fagfltgwlyllsyiigpaiialaavsyl.slfpggfgasfgialptwlg +g + l+ + y ++ + + ++++ +++++ + s+ + +w+ gi|1751120 754 EVGGAGAGLHPVVYPCTALLLLCLFATIItYILNH---SSIRVSRKGWHM 800 illggaqvlivivllllftllnllGvklsgrlntigteSkrvalivipll +l + ++ +++t +++G ++++++ ++++ i l gi|1751120 801 LLNL--------CFHIAMTSAVFAGGITLTNYQMV------CQAVGITLH 836 ifillgiilfasglgfvtanflgpsgngifvdVlpg...glagpgfggvl l +++ + ++ + ++ p++++++a p +++l gi|1751120 837 YSSLSTLLWMGVKARVLHKELTWRA--------PPPqegDPALPTPSPML 878 agailvg.iffaFvGfeavatlaeEvknP.............erniPrAi +++g+i + G+ a+++ ++ ++ +++ ++ iP A+ gi|1751120 879 RFYLIAGgIPLIICGITAAVN-IHNYRDHspycwlvwrpslgAFYIPVAL 927 ilsllivgliYilvsvalvgggdwsevvpsselas.saapangrkvanvf i lli+ + ++++ + l g + ++ a++s a gi|1751120 928 I--LLITWIYFLCAGLRLRG------PLAQNPKAGnSRAS---------- 959 alavaatallGwgaiiisvgallsllgsllawilsasRvlyamArdGilP ++a l G ++ g lls +gslla+ + Ar G gi|1751120 960 --LEAGEELRG-STRLRGSGPLLSDSGSLLAT---------GSARVGTPG 997 k.....ffskvnprfrisltPvialiltgilsallallfplvslafsvlt ++++++++ ++++t + +++++++al l ++ s l gi|1751120 998 PpedgdSLYSPGVQLGALVTTHFLYLAMWACGALAVSQRWLPRVVCSCLY 1047 llaflavllaylvlrialkklkpdlrrpgkkekkafrrpgvg..vlpilg +a a+ l ++ + +a+ ++d+r+++++ + ++ p++++ +l gi|1751120 1048 GVAASALGLFVFTHHCAR---RRDVRASWRACC-PPASPAAPhaPPRALP 1093 ialclya.llyssglp............................lilgvi +a+ ++++ +++ +++++++++++ ++ + ++ + ++ ++ ++ gi|1751120 1094 AAAEDGSpVFGEGPPSlksspsgssghplalgpckltnlqlaqsQVCEAG 1143 lvalGllilyllyarklpsdqsrlilssflhkkevedeeveevlepelas +a G + + + + + ++ gi|1751120 1144 AAAGG-EGEPEPAGT--------------RGNL----------AHRHPNN 1168 aeakaav<-* +++ +++ gi|1751120 1169 VHHGRRA 1175 COG1009: domain 1 of 1, from 738 to 1331: score -435.7, E = 96 *->mmmnlllliillPllgalvlllfggrsrlirktggllagvfslllvl + n + l+ l+ a+ +g + +g +++ + ++l gi|1751120 738 HLGNVAVLMELS----AFPREVGG------AGAGLHPVVYPC--TAL 772 sallsallillflgqggseaaihqslftWlpagnlginfgfyiDgLsllM ++l+ + +i+ +++++ s + + + ++ l +n+ f+i s+++ gi|1751120 773 LLLCLFATIITYILNHSS----IRVSRKGWHM--L-LNLCFHIAMTSAVF 815 llvVtgvglLihiYSigYMadedegkdygRFfayLlLFtasMLgLVlSDN + +t+ +++ gi|1751120 816 AGGITL---------------TNYQ------------------------- 825 LLllflgWEgvGLcSYLLIGFWysrpkamFfpskaAkkAFlvtRvGDlgl + ++ gi|1751120 826 ---------------------------M-------------------VCQ 829 LlGiilLyltfGtlsiselfeaassldaagiqvl.g.............. + Gi l y+ tl + + + + + +++++++++++ ++++ gi|1751120 830 AVGITLHYSSLSTLLWMGVKARVLHKELT----WrApppqegdpalptps 875 .....hvGveslltlilLllllGAfgKSAQfPlHtWLpdAMEGPTPVSAL + + + ++++i L + + + gi|1751120 876 pmlrfY----LIAGGIPLIICG------------------------IT-- 895 lHAATMVtAGVYLvaRlsplfep..........yL.lspvalyl.valvG A ++ +++ +++++ + + s a+y++val+ gi|1751120 896 ---AAVN-------------IHNyrdhspycwlVWrPSLGAFYIpVALIL 929 avTallgaliAlvQnDiKriLAYSTiSQLGyMflalGlg..........g ++T ++ fl +Gl +++ ++++ gi|1751120 930 LITWIY--------------------------FLCAGLRlrgplaqnpkA 953 AysaAifHLitHAfFKAlLFLsaGsvihamhtrqnDirkmGG........ + s A + +++++++ ++++G+++ +++ gi|1751120 954 GNSRASL---------------------EAGEELRGSTRLRGsgpllsds 982 ......................LrkyMPvtavafliGsLaLaGvPPftsG ++ +++ + +++++++++++L++ P + +++l+ t + gi|1751120 983 gsllatgsarvgtpgppedgdsLYS--PGVQLGALV-----------TTH 1019 FfSKdeIleaalasghlllsnvptwgflaplllligallTafYsfRllfl F+ ++ +ala+++ +l ++ + l+ ++ s+++lf+ gi|1751120 1020 FLYLAMWACGALAVSQRWL-----------PRVVCSCLYGVAASALGLFV 1058 vFhG......eperheaerhylHaldddvliaHpHeapgvmlnfylPliv + h+ ++++ +++ a p+ +p + + P+ gi|1751120 1059 FTHHcarrrdVRASWRAC--------------CPPASPAAPH---APPRA 1091 LavlslllGvffpillstwlpapllavaelgeelpggLahitllvlalls L +++ + p++ ++ + l +++ g ++ La + l+ l gi|1751120 1092 LPAAAEDGS---PVFGEGPPS---LKSSPSGSSGHP-LALGPC-KLTNLQ 1133 lalaivGillall.mylgakllptkiyl.............lepfrflyk la + v+ + a+ + + + + +t +l ++++++ ++++++ + r+ + gi|1751120 1134 LAQSQVCEAGAAAgGEGEPEPAGTRGNLahrhpnnvhhgrrAHKSRAKGH 1183 llynkyyldklYdglfvkvglliarlldavldkyliDygivnavgr.... + ++ + ++l + + ++ ++++ ll + l ++ + +++g ++++ gi|1751120 1184 RAGEACGKNRL--KALRGGAAGALELLSSE-SGSLHNSPTDSYLGSsrns 1230 .................................................. ++ + + ++++ +++++++++ + ++ ++ +++++ ++++ ++++ + gi|1751120 1231 pgaglqlegepmltpsegsdtsaaplseagragqrrsasrdslkgggale 1280 .........alvtlfsrGavlrrtqsGylrahYalslvlGavlllailvs +++++++ + ++ ++ + ++ ++ ++ + ++ G + ++ gi|1751120 1281 keshrrsypLNAASLN--GAPKGGKYDDVT-LMGAEVASGGCMKTGL--- 1324 galflvrrl<-* + + + gi|1751120 1325 --WKSETTV 1331 // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/cogs/cogs.hmm-f Sequence file: tem9.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|17511205|ref|NP_116166.6| tumor endothelial marker 5 precursor [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- COG0842 4.3 3.9 1 COG1532 2.2 51 1 COG1659 1.4 20 1 COG3216 0.3 25 1 COG1176 0.3 55 1 COG0052 0.1 52 1 COG1901 -0.4 34 1 COG0750 -0.5 55 1 COG2969 -0.8 86 1 COG1132 -1.5 89 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- COG1532 1/1 125 137 .. 46 58 .. 2.2 51 COG0750 1/1 137 153 .. 487 503 .] -0.5 55 COG2969 1/1 173 179 .. 163 169 .] -0.8 86 COG1659 1/1 587 600 .. 273 286 .] 1.4 20 COG0842 1/1 753 795 .. 324 365 .] 4.3 3.9 COG1132 1/1 768 796 .. 1 32 [. -1.5 89 COG3216 1/1 843 859 .. 141 157 .. 0.3 25 COG1901 1/1 900 911 .. 295 306 .] -0.4 34 COG1176 1/1 910 924 .. 237 251 .. 0.3 55 COG0052 1/1 1028 1039 .. 95 106 .. 0.1 52 Alignments of top-scoring domains: COG1532: domain 1 of 1, from 125 to 137: score 2.2, E = 51 *->GkvvkiDLdehkI<-* G+++++DL ++I gi|1751120 125 GELKRLDLSNNRI 137 COG0750: domain 1 of 1, from 137 to 153: score -0.5, E = 55 *->lllmllltfndiprlla<-* ++++ + tf+++prll+ gi|1751120 137 IGCLTSETFQGLPRLLR 153 COG2969: domain 1 of 1, from 173 to 179: score -0.8, E = 86 *->psLRVVK<-* p+L+VV+ gi|1751120 173 PALKVVD 179 COG1659: domain 1 of 1, from 587 to 600: score 1.4, E = 20 *->sriktaeAililSs<-* +r+ t ++ + lSs gi|1751120 587 FRCTTGRPNVSLSS 600 COG0842: domain 1 of 1, from 753 to 795: score 4.3, E = 3.9 *->Rylllg.aglsdvwfsllvLallgllllllgllllrrrekkar<-* R++ + +agl +v++ + +L+ll+l++ ++ ++l + +++ r gi|1751120 753 REVGGAgAGLHPVVYPCTALLLLCLFATIITYILNHSSIRVSR 795 COG1132: domain 1 of 1, from 768 to 796: score -1.5, E = 89 *->lillllglfllgvaravlsylqryllarvgqr<-* + ll+l+l++ +++y+ ++ +rv+ + gi|1751120 768 PCTALLLLCLFA---TIITYILNHSSIRVSRK 796 COG3216: domain 1 of 1, from 843 to 859: score 0.3, E = 25 *->awLmqrpgrpfsdeLtW<-* ++ m++ +r ++ eLtW gi|1751120 843 LLWMGVKARVLHKELTW 859 COG1901: domain 1 of 1, from 900 to 911: score -0.4, E = 34 *->vhnhLDrlgaps<-* +hn++D+ ++ gi|1751120 900 IHNYRDHSPYCW 911 COG1176: domain 1 of 1, from 910 to 924: score 0.3, E = 55 *->clLVFiPAlGsfviP<-* c+LV P+lG+f+iP gi|1751120 910 CWLVWRPSLGAFYIP 924 COG0052: domain 1 of 1, from 1028 to 1039: score 0.1, E = 52 *->cGmfYVngRWLG<-* cG+ +V++RWL+ gi|1751120 1028 CGALAVSQRWLP 1039 //