analysis of sequence from tem11 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDLELEDGDDFVSPALELSGALRFYDRSD IDAVYVTTNGIIATSEPPAKESHPGLFPPTFGAVAPFLADLDTTDGLGKVYYREDLSPSITQRAAECVHR GFPEISFQPSSAVVVTWESVAPYQGPSRDPDQKGKRNTFQAVLASSDSSSYAIFLYPEDGLQFHTTFSKK ENNQVPAVVAFSQGSVGFLWKSNGAYNIFANDRESIENLAKSSNSGQQGVWVFEIGSPATTNGVVPADVI LGTEDGAEYDDEDEDYDLATTRLGLEDVGTTPFSYKALRRGGADTYSVPSVLSPRRAATERPLGPPTERT RSFQLAVETFHQQHPQVIDVDEVEETGVVFSYNTDSRQTCANNRHQCSVHAECRDYATGFCCSCVAGYTG NGRQCVAEGSPQRVNGKVKGRIFVGSSQVPIVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPV GGIIGWMFAVEQDGFKNGFSITGGEFTRQAEVTFVGHPGNLVIKQRFSGIDEHGHLTIDTELEGRVPQIP FGSSVHIEPYTELYHYSTSVITSSSTREYTVTEPERDGASPSRIYTYQWRQTITFQECVHDDSRPALPST QQLSVDSVFVLYNQEEKILRYAFSNSIGPVREGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIG FRGDGRTCYDIDECSEQPSVCGSHTICNNHPGTFRCECVEGYQFSDEGTCVAVVDQRPINYCETGLHNCD IPQRAQCIYTGGSSYTCSCLPGFSGDGQACQDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCV PGEVEKTRCQHEREHILGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVDRDGREVEGTR TRPGMTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQTGKIERLPLEGNTMRKTEAKAFLHVPAKVI IGLAFDCVDKMVYWTDITEPSIGRASLHGGEPTTIIRQDLGSPEGIAVDHLGRNIFWTDSNLDRIEVAKL DGTQRRVLFETDLVNPRGIVTDSVRGNLYWTDWNRDNPKIETSYMDGTNRRILVQDDLGLPNGLHFDAFS SQLCWVDAGTNRAECLNPSQPSRRKALEGLQYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPH KQTRLYGITTALSQCPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCIERK ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > Tem11gi|128199|sp|P14543|NIDO_HUMAN . . . . . 1 MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDLELEDGD 50 ___HHHHHHHHHHHHHHHHEEE_____________________EEE____ . . . . . 51 DFVSPALELSGALRFYDRSDIDAVYVTTNGIIATSEPPAKESHPGLFPPT 100 ____HHHHHHHHEEEE______EEEEE_______________________ . . . . . 101 FGAVAPFLADLDTTDGLGKVYYREDLSPSITQRAAECVHRGFPEISFQPS 150 _EEEEEE___________EEEEEE______HHHHHHHHHH___EEE____ . . . . . 151 SAVVVTWESVAPYQGPSRDPDQKGKRNTFQAVLASSDSSSYAIFLYPEDG 200 _EEEEEE__________________HHHHHHHHH______EEEEEE____ . . . . . 201 LQFHTTFSKKENNQVPAVVAFSQGSVGFLWKSNGAYNIFANDRESIENLA 250 EEEE____________EEEEE_____EEEEE____EEEE_____HHHHHH . . . . . 251 KSSNSGQQGVWVFEIGSPATTNGVVPADVILGTEDGAEYDDEDEDYDLAT 300 ________EEEEEE_________EEE_EEEEE__________________ . . . . . 301 TRLGLEDVGTTPFSYKALRRGGADTYSVPSVLSPRRAATERPLGPPTERT 350 ______________HHHHH______EEE_____HHHHHH_________HH . . . . . 351 RSFQLAVETFHQQHPQVIDVDEVEETGVVFSYNTDSRQTCANNRHQCSVH 400 HHHHHHHHHHHH____EEEEE_______EEE______________EEEE_ . . . . . 401 AECRDYATGFCCSCVAGYTGNGRQCVAEGSPQRVNGKVKGRIFVGSSQVP 450 __________EEEEE_______EEEEEE__________EEEEEE_____E . . . . . 451 IVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPVGGIIGWMFAV 500 EEE______EEEEE_____EEEE_____EEEEE__________HHHHHEE . . . . . 501 EQDGFKNGFSITGGEFTRQAEVTFVGHPGNLVIKQRFSGIDEHGHLTIDT 550 EE__________________EEEEE_____EEEEE____________HHH . . . . . 551 ELEGRVPQIPFGSSVHIEPYTELYHYSTSVITSSSTREYTVTEPERDGAS 600 HH___________EEEE___EEEEEEE_________EEEEEE________ . . . . . 601 PSRIYTYQWRQTITFQECVHDDSRPALPSTQQLSVDSVFVLYNQEEKILR 650 __EEEEEHHHHHHHH_____________________EEEEEE_HHHHHHH . . . . . 651 YAFSNSIGPVREGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIG 700 HHH_______________________________________________ . . . . . 701 FRGDGRTCYDIDECSEQPSVCGSHTICNNHPGTFRCECVEGYQFSDEGTC 750 __________________________________EEEEE_________EE . . . . . 751 VAVVDQRPINYCETGLHNCDIPQRAQCIYTGGSSYTCSCLPGFSGDGQAC 800 EEEEE_____EEE____________EEEE______EEE____________ . . . . . 801 QDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCVPGEVEKTRCQ 850 ________________________EEE________________HHHHHHH . . . . . 851 HEREHILGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVD 900 HHHHHHH______________________________________EEEE_ . . . . . 901 RDGREVEGTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQ 950 ___________________EEE____________EEEE_____EEEEEE_ . . . . . 951 TGKIERLPLEGNTMRKTEAKAFLHVPAKVIIGLAFDCVDKMVYWTDITEP 1000 ____EEE______HHHHHHHHHH___EEEEEEEEE____EEEEE______ . . . . . 1001 SIGRASLHGGEPTTIIRQDLGSPEGIAVDHLGRNIFWTDSNLDRIEVAKL 1050 _____________EEEEE________EEEE___EEEE______EEEEEE_ . . . . . 1051 DGTQRRVLFETDLVNPRGIVTDSVRGNLYWTDWNRDNPKIETSYMDGTNR 1100 _____EEEEE_____________________________EEEEE_____E . . . . . 1101 RILVQDDLGLPNGLHFDAFSSQLCWVDAGTNRAECLNPSQPSRRKALEGL 1150 EEEEE_________________EEEE________________HHHHHHH_ . . . . . 1151 QYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPHKQTRLYGITT 1200 ____EEE______________HHHHHHHHHHHH________EEEEEEEEE . . . . 1201 ALSQCPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCIERK 1247 EE_________________EEEEE______________EEEEEE___ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 0.0 % beta-contents : 28.8 % coil-contents : 71.2 % class : beta method : 2 alpha-contents : 0.0 % beta-contents : 23.9 % coil-contents : 76.1 % class : beta ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -18.80 -0.56 -0.78 0.00 -4.00 0.00 -4.00 0.00 -2.15 -5.26 -3.96 0.00 -12.00 -16.00 -12.00 0.00 -79.53 -22.69 -1.76 -0.04 -0.55 -4.00 0.00 0.00 0.00 -1.14 -4.93 -3.96 0.00 -12.00 -12.00 -12.00 0.00 -75.08 ID: Tem11gi|128199|sp|P14543|NIDO_HUMAN AC: xxx Len: 1190 1:I 1159 Sc: -75.08 Pv: 6.759856e-01 NO_GPI_SITE GPI: learning from protozoa -17.25 -10.05 -4.27 -3.98 -4.00 0.00 0.00 0.00 -1.33 -3.18 -13.20 0.00 -12.00 -8.00 -12.00 0.00 -89.27 -30.32 -2.01 -0.39 0.00 -4.00 0.00 0.00 0.00 -1.83 -3.59 -13.20 0.00 -12.00 -8.00 -12.00 0.00 -87.34 ID: Tem11gi|128199|sp|P14543|NIDO_HUMAN AC: xxx Len: 1190 1:I 1161 Sc: -87.34 Pv: 6.457286e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem11gi|128 0.586 29 Y 0.694 29 Y 0.975 17 Y 0.885 Y # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem11gi|128 0.509 800 Y 0.333 29 N 0.996 9 Y 0.774 Y # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? Tem11gi|128 0.649 338 Y 0.411 32 Y 0.990 15 Y 0.865 Y ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) 1-15 MLASSSRIRAAWTRA lllplll 16-22 23-41 AGPVGCLSRQELFPFGPGQ gdleledgdd 42-51 52-281 FVSPALELSGALRFYDRSDIDAVYVTTNGI IATSEPPAKESHPGLFPPTFGAVAPFLADL DTTDGLGKVYYREDLSPSITQRAAECVHRG FPEISFQPSSAVVVTWESVAPYQGPSRDPD QKGKRNTFQAVLASSDSSSYAIFLYPEDGL QFHTTFSKKENNQVPAVVAFSQGSVGFLWK SNGAYNIFANDRESIENLAKSSNSGQQGVW VFEIGSPATTNGVVPADVIL gtedgaeyddededydlat 282-300 301-1247 TRLGLEDVGTTPFSYKALRRGGADTYSVPS VLSPRRAATERPLGPPTERTRSFQLAVETF HQQHPQVIDVDEVEETGVVFSYNTDSRQTC ANNRHQCSVHAECRDYATGFCCSCVAGYTG NGRQCVAEGSPQRVNGKVKGRIFVGSSQVP IVFENTDLHSYVVMNHGRSYTAISTIPETV GYSLLPLAPVGGIIGWMFAVEQDGFKNGFS ITGGEFTRQAEVTFVGHPGNLVIKQRFSGI DEHGHLTIDTELEGRVPQIPFGSSVHIEPY TELYHYSTSVITSSSTREYTVTEPERDGAS PSRIYTYQWRQTITFQECVHDDSRPALPST QQLSVDSVFVLYNQEEKILRYAFSNSIGPV REGSPDALQNPCYIGTHGCDTNAACRPGPR TQFTCECSIGFRGDGRTCYDIDECSEQPSV CGSHTICNNHPGTFRCECVEGYQFSDEGTC VAVVDQRPINYCETGLHNCDIPQRAQCIYT GGSSYTCSCLPGFSGDGQACQDVDECQPSR CHPDAFCYNTPGSFTCQCKPGYQGDGFRCV PGEVEKTRCQHEREHILGAAGATDPQRPIP PGLFVPECDAHGHYAPTQCHGSTGYCWCVD RDGREVEGTRTRPGMTPPCLSTVAPPIHQG PAVPTAVIPLPPGTHLLFAQTGKIERLPLE GNTMRKTEAKAFLHVPAKVIIGLAFDCVDK MVYWTDITEPSIGRASLHGGEPTTIIRQDL GSPEGIAVDHLGRNIFWTDSNLDRIEVAKL DGTQRRVLFETDLVNPRGIVTDSVRGNLYW TDWNRDNPKIETSYMDGTNRRILVQDDLGL PNGLHFDAFSSQLCWVDAGTNRAECLNPSQ PSRRKALEGLQYPFAVTSYGKNLYFTDWKM NSVVALDLAISKETDAFQPHKQTRLYGITT ALSQCPQGHNYCSVNNGGCTHLCLATPGSR TCRCPDNTLGVDCIERK low complexity regions: SEG 25 3.0 3.3 >Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) 1-1 M lasssriraawtralllplllagpvgclsr 2-46 qelfpfgpgqgdlel 47-93 EDGDDFVSPALELSGALRFYDRSDIDAVYV TTNGIIATSEPPAKESH pglfpptfgavapfladldttdglg 94-118 119-268 KVYYREDLSPSITQRAAECVHRGFPEISFQ PSSAVVVTWESVAPYQGPSRDPDQKGKRNT FQAVLASSDSSSYAIFLYPEDGLQFHTTFS KKENNQVPAVVAFSQGSVGFLWKSNGAYNI FANDRESIENLAKSSNSGQQGVWVFEIGSP attngvvpadvilgtedgaeyddededydl 269-311 attrlgledvgtt 312-562 PFSYKALRRGGADTYSVPSVLSPRRAATER PLGPPTERTRSFQLAVETFHQQHPQVIDVD EVEETGVVFSYNTDSRQTCANNRHQCSVHA ECRDYATGFCCSCVAGYTGNGRQCVAEGSP QRVNGKVKGRIFVGSSQVPIVFENTDLHSY VVMNHGRSYTAISTIPETVGYSLLPLAPVG GIIGWMFAVEQDGFKNGFSITGGEFTRQAE VTFVGHPGNLVIKQRFSGIDEHGHLTIDTE LEGRVPQIPFG ssvhiepytelyhystsvitssstreytvt 563-607 eperdgaspsriyty 608-912 QWRQTITFQECVHDDSRPALPSTQQLSVDS VFVLYNQEEKILRYAFSNSIGPVREGSPDA LQNPCYIGTHGCDTNAACRPGPRTQFTCEC SIGFRGDGRTCYDIDECSEQPSVCGSHTIC NNHPGTFRCECVEGYQFSDEGTCVAVVDQR PINYCETGLHNCDIPQRAQCIYTGGSSYTC SCLPGFSGDGQACQDVDECQPSRCHPDAFC YNTPGSFTCQCKPGYQGDGFRCVPGEVEKT RCQHEREHILGAAGATDPQRPIPPGLFVPE CDAHGHYAPTQCHGSTGYCWCVDRDGREVE GTRTR pgmtppclstvappihqgpavptaviplpp 913-952 gthllfaqtg 953-1247 KIERLPLEGNTMRKTEAKAFLHVPAKVIIG LAFDCVDKMVYWTDITEPSIGRASLHGGEP TTIIRQDLGSPEGIAVDHLGRNIFWTDSNL DRIEVAKLDGTQRRVLFETDLVNPRGIVTD SVRGNLYWTDWNRDNPKIETSYMDGTNRRI LVQDDLGLPNGLHFDAFSSQLCWVDAGTNR AECLNPSQPSRRKALEGLQYPFAVTSYGKN LYFTDWKMNSVVALDLAISKETDAFQPHKQ TRLYGITTALSQCPQGHNYCSVNNGGCTHL CLATPGSRTCRCPDNTLGVDCIERK low complexity regions: SEG 45 3.4 3.75 >Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) 1-1 M lasssriraawtralllplllagpvgclsr 2-72 qelfpfgpgqgdleledgddfvspalelsg alrfydrsdid 73-274 AVYVTTNGIIATSEPPAKESHPGLFPPTFG AVAPFLADLDTTDGLGKVYYREDLSPSITQ RAAECVHRGFPEISFQPSSAVVVTWESVAP YQGPSRDPDQKGKRNTFQAVLASSDSSSYA IFLYPEDGLQFHTTFSKKENNQVPAVVAFS QGSVGFLWKSNGAYNIFANDRESIENLAKS SNSGQQGVWVFEIGSPATTNGV vpadvilgtedgaeyddededydlattrlg 275-381 ledvgttpfsykalrrggadtysvpsvlsp rraaterplgpptertrsfqlavetfhqqh pqvidvdeveetgvvfs 382-534 YNTDSRQTCANNRHQCSVHAECRDYATGFC CSCVAGYTGNGRQCVAEGSPQRVNGKVKGR IFVGSSQVPIVFENTDLHSYVVMNHGRSYT AISTIPETVGYSLLPLAPVGGIIGWMFAVE QDGFKNGFSITGGEFTRQAEVTFVGHPGNL VIK qrfsgidehghltidtelegrvpqipfgss 535-624 vhiepytelyhystsvitssstreytvtep erdgaspsriytyqwrqtitfqecvhddsr 625-1247 PALPSTQQLSVDSVFVLYNQEEKILRYAFS NSIGPVREGSPDALQNPCYIGTHGCDTNAA CRPGPRTQFTCECSIGFRGDGRTCYDIDEC SEQPSVCGSHTICNNHPGTFRCECVEGYQF SDEGTCVAVVDQRPINYCETGLHNCDIPQR AQCIYTGGSSYTCSCLPGFSGDGQACQDVD ECQPSRCHPDAFCYNTPGSFTCQCKPGYQG DGFRCVPGEVEKTRCQHEREHILGAAGATD PQRPIPPGLFVPECDAHGHYAPTQCHGSTG YCWCVDRDGREVEGTRTRPGMTPPCLSTVA PPIHQGPAVPTAVIPLPPGTHLLFAQTGKI ERLPLEGNTMRKTEAKAFLHVPAKVIIGLA FDCVDKMVYWTDITEPSIGRASLHGGEPTT IIRQDLGSPEGIAVDHLGRNIFWTDSNLDR IEVAKLDGTQRRVLFETDLVNPRGIVTDSV RGNLYWTDWNRDNPKIETSYMDGTNRRILV QDDLGLPNGLHFDAFSSQLCWVDAGTNRAE CLNPSQPSRRKALEGLQYPFAVTSYGKNLY FTDWKMNSVVALDLAISKETDAFQPHKQTR LYGITTALSQCPQGHNYCSVNNGGCTHLCL ATPGSRTCRCPDNTLGVDCIERK low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDLELEDGDDFVSPALELS GALRFYDRSDIDAVYVTTNGIIATSEPPAKESHPGLFPPTFGAVAPFLADLDTTDGLGKV YYREDLSPSITQRAAECVHRGFPEISFQPSSAVVVTWESVAPYQGPSRDPDQKGKRNTFQ AVLASSDSSSYAIFLYPEDGLQFHTTFSKKENNQVPAVVAFSQGSVGFLWKSNGAYNIFA NDRESIENLAKSSNSGQQGVWVFEIGSPATTNGVVPADVILGTEDGAEYDDEDEDYDLAT TRLGLEDVGTTPFSYKALRRGGADTYSVPSVLSPRRAATERPLGPPTERTRSFQLAVETF hqqhpqvidvdeveeTGVVFSYNTDSRQTCANNRHQCSVHAECRDYATGFCCSCVAGYTG NGRQCVAEGSPQRVNGKVKGRIFVGSSQVPIVFENTDLHSYVVMNHGRSYTAISTIPETV GYSLLPLAPVGGIIGWMFAVEQDGFKNGFSITGGEFTRQAEVTFVGHPGNLVIKQRFSGI DEHGHLTIDTELEGRVPQIPFGSSVHIEPYTELYHYSTSVITSSSTREYTVTEPERDGAS PSRIYTYQWRQTITFQECVHDDSRPALPSTQQLSVDSVFVLYNQEEKILRYAFSNSIGPV REGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIGFRGDGRTCYDIDECSEQPSV CGSHTICNNHPGTFRCECVEGYQFSDEGTCVAVVDQRPINYCETGLHNCDIPQRAQCIYT GGSSYTCSCLPGFSGDGQACQDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCV PGEVEKTRCQHEREHILGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVD RDGREVEGTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQTGKIERLPLE GNTMRKTEAKAFLHVPAKVIIGLAFDCVDKMVYWTDITEPSIGRASLHGGEPTTIIRQDL GSPEGIAVDHLGRNIFWTDSNLDRIEVAKLDGTQRRVLFETDLVNPRGIVTDSVRGNLYW TDWNRDNPKIETSYMDGTNRRILVQDDLGLPNGLHFDAFSSQLCWVDAGTNRAECLNPSQ PSRRKALEGLQYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPHKQTRLYGITT ALSQCPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCIERK 1 - 360 MLASSSRIRA AWTRALLLPL LLAGPVGCLS RQELFPFGPG QGDLELEDGD DFVSPALELS GALRFYDRSD IDAVYVTTNG IIATSEPPAK ESHPGLFPPT FGAVAPFLAD LDTTDGLGKV YYREDLSPSI TQRAAECVHR GFPEISFQPS SAVVVTWESV APYQGPSRDP DQKGKRNTFQ AVLASSDSSS YAIFLYPEDG LQFHTTFSKK ENNQVPAVVA FSQGSVGFLW KSNGAYNIFA NDRESIENLA KSSNSGQQGV WVFEIGSPAT TNGVVPADVI LGTEDGAEYD DEDEDYDLAT TRLGLEDVGT TPFSYKALRR GGADTYSVPS VLSPRRAATE RPLGPPTERT RSFQLAVETF 361 - 375 hqqhpqvidv devee 376 - 1247 TGVVF SYNTDSRQTC ANNRHQCSVH AECRDYATGF CCSCVAGYTG NGRQCVAEGS PQRVN GKVKG RIFVGSSQVP IVFENTDLHS YVVMNHGRSY TAISTIPETV GYSLLPLAPV GGIIG WMFAV EQDGFKNGFS ITGGEFTRQA EVTFVGHPGN LVIKQRFSGI DEHGHLTIDT ELEGR VPQIP FGSSVHIEPY TELYHYSTSV ITSSSTREYT VTEPERDGAS PSRIYTYQWR QTITF QECVH DDSRPALPST QQLSVDSVFV LYNQEEKILR YAFSNSIGPV REGSPDALQN PCYIG THGCD TNAACRPGPR TQFTCECSIG FRGDGRTCYD IDECSEQPSV CGSHTICNNH PGTFR CECVE GYQFSDEGTC VAVVDQRPIN YCETGLHNCD IPQRAQCIYT GGSSYTCSCL PGFSG DGQAC QDVDECQPSR CHPDAFCYNT PGSFTCQCKP GYQGDGFRCV PGEVEKTRCQ HEREH ILGAA GATDPQRPIP PGLFVPECDA HGHYAPTQCH GSTGYCWCVD RDGREVEGTR TRPGM TPPCL STVAPPIHQG PAVPTAVIPL PPGTHLLFAQ TGKIERLPLE GNTMRKTEAK AFLHV PAKVI IGLAFDCVDK MVYWTDITEP SIGRASLHGG EPTTIIRQDL GSPEGIAVDH LGRNI FWTDS NLDRIEVAKL DGTQRRVLFE TDLVNPRGIV TDSVRGNLYW TDWNRDNPKI ETSYM DGTNR RILVQDDLGL PNGLHFDAFS SQLCWVDAGT NRAECLNPSQ PSRRKALEGL QYPFA VTSYG KNLYFTDWKM NSVVALDLAI SKETDAFQPH KQTRLYGITT ALSQCPQGHN YCSVN NGGCT HLCLATPGSR TCRCPDNTLG VDCIERK low complexity regions: DUST >Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDLELEDGDDFVSPALELS GALRFYDRSDIDAVYVTTNGIIATSEPPAKESHPGLFPPTFGAVAPFLADLDTTDGLGKV YYREDLSPSITQRAAECVHRGFPEISFQPSSAVVVTWESVAPYQGPSRDPDQKGKRNTFQ AVLASSDSSSYAIFLYPEDGLQFHTTFSKKENNQVPAVVAFSQGSVGFLWKSNGAYNIFA NDRESIENLAKSSNSGQQGVWVFEIGSPATTNGVVPADVILGTEDGAEYDDEDEDYDLAT TRLGLEDVGTTPFSYKALRRGGADTYSVPSVLSPRRAATERPLGPPTERTRSFQLAVETF HQQHPQVIDVDEVEETGVVFSYNTDSRQTCANNRHQCSVHAECRDYATGFCCSCVAGYTG NGRQCVAEGSPQRVNGKVKGRIFVGSSQVPIVFENTDLHSYVVMNHGRSYTAISTIPETV GYSLLPLAPVGGIIGWMFAVEQDGFKNGFSITGGEFTRQAEVTFVGHPGNLVIKQRFSGI DEHGHLTIDTELEGRVPQIPFGSSVHIEPYTELYHYSTSVITSSSTREYTVTEPERDGAS PSRIYTYQWRQTITFQECVHDDSRPALPSTQQLSVDSVFVLYNQEEKILRYAFSNSIGPV REGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIGFRGDGRTCYDIDECSEQPSV CGSHTICNNHPGTFRCECVEGYQFSDEGTCVAVVDQRPINYCETGLHNCDIPQRAQCIYT GGSSYTCSCLPGFSGDGQACQDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCV PGEVEKTRCQHEREHILGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVD RDGREVEGTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQTGKIERLPLE GNTMRKTEAKAFLHVPAKVIIGLAFDCVDKMVYWTDITEPSIGRASLHGGEPTTIIRQDL GSPEGIAVDHLGRNIFWTDSNLDRIEVAKLDGTQRRVLFETDLVNPRGIVTDSVRGNLYW TDWNRDNPKIETSYMDGTNRRILVQDDLGLPNGLHFDAFSSQLCWVDAGTNRAECLNPSQ PSRRKALEGLQYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPHKQTRLYGITT ALSQCPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCIERK ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for Tem11gi|128199|sp|P14543|NIDO_HUMAN sequence: 1190 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 MLASSSRIRA AWTRALLLPL LLAGPVGCLS RQELFPFGPG QGDLELEDGD DFVSPALELS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 GALRFYDRSD IDAVYVTTNG IIATSEPPAK ESHPGLFPPT FGAVAPFLAD LDTTDGLGKV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 YYREDLSPSI TQRAAECVHR GFPEISFQPS SAVVVTWESV APYQGPSRDP DQKGKRNTFQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 AVLASSDSSS YAIFLYPEDG LQFHTTFSKK ENNQVPAVVA FSQGSVGFLW KSNGAYNIFA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 NDRESIENLA KSSNSGQQGV WVFEIGSPAT TNGVVPADVI LGTEDGAEYD DEDEDYDLAT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 TRLGLEDVGT TPFSYKALRR GGADTYSVPS VLSPRRAATE RPLGPPTERT RSFQLAVETF ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 420 HQQHPQVIDV DEVEETGVVF SYNTDSRQTC ANNRHQCSVH AECRDYATGF CCSCVAGYTG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 480 NGRQCVAEGS PQRVNGKVKG RIFVGSSQVP IVFENTDLHS YVVMNHGRSY TAISTIPETV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 540 GYSLLPLAPV GGIIGWMFAV EQDGFKNGFS ITGGEFTRQA EVTFVGHPGN LVIKQRFSGI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 600 DEHGHLTIDT ELEGRVPQIP FGSSVHIEPY TELYHYSTSV ITSSSTREYT VTEPERDGAS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 660 PSRIYTYQWR QTITFQECVH DDSRPALPST QQLSVDSVFV LYNQEEKILR YAFSNSIGPV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 720 REGSPDALQN PCYIGTHGCD TNAACRPGPR TQFTCECSIG FRGDGRTCYD IDECSEQPSV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 780 CGSHTICNNH PGTFRCECVE GYQFSDEGTC VAVVDQRPIN YCETGLHNCD IPQRAQCIYT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 840 GGSSYTCSCL PGFSGDGQAC QDVDECQPSR CHPDAFCYNT PGSFTCQCKP GYQGDGFRCV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 900 PGEVEKTRCQ HEREHILGAA GATDPQRPIP PGLFVPECDA HGHYAPTQCH GSTGYCWCVD ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 960 RDGREVEGTR TRPGMTPPCL STVAPPIHQG PAVPTAVIPL PPGTHLLFAQ TGKIERLPLE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1020 GNTMRKTEAK AFLHVPAKVI IGLAFDCVDK MVYWTDITEP SIGRASLHGG EPTTIIRQDL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1080 GSPEGIAVDH LGRNIFWTDS NLDRIEVAKL DGTQRRVLFE TDLVNPRGIV TDSVRGNLYW ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1140 TDWNRDNPKI ETSYMDGTNR RILVQDDLGL PNGLHFDAFS SQLCWVDAGT NRAECLNPSQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | PSRRKALEGL QYPFAVTSYG KNLYFTDWKM NSVVALDLAI SKETDAFQPH ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ---------- ---------- ---------- ---------- ---------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem11.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem11.___inter___ (1 sequences) MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDLELEDGD DFVSPALELSGALRFYDRSDIDAVYVTTNGIIATSEPPAKESHPGLFPPT FGAVAPFLADLDTTDGLGKVYYREDLSPSITQRAAECVHRGFPEISFQPS SAVVVTWESVAPYQGPSRDPDQKGKRNTFQAVLASSDSSSYAIFLYPEDG LQFHTTFSKKENNQVPAVVAFSQGSVGFLWKSNGAYNIFANDRESIENLA KSSNSGQQGVWVFEIGSPATTNGVVPADVILGTEDGAEYDDEDEDYDLAT TRLGLEDVGTTPFSYKALRRGGADTYSVPSVLSPRRAATERPLGPPTERT RSFQLAVETFHQQHPQVIDVDEVEETGVVFSYNTDSRQTCANNRHQCSVH AECRDYATGFCCSCVAGYTGNGRQCVAEGSPQRVNGKVKGRIFVGSSQVP IVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPVGGIIGWMFAV EQDGFKNGFSITGGEFTRQAEVTFVGHPGNLVIKQRFSGIDEHGHLTIDT ELEGRVPQIPFGSSVHIEPYTELYHYSTSVITSSSTREYTVTEPERDGAS PSRIYTYQWRQTITFQECVHDDSRPALPSTQQLSVDSVFVLYNQEEKILR YAFSNSIGPVREGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIG FRGDGRTCYDIDECSEQPSVCGSHTICNNHPGTFRCECVEGYQFSDEGTC VAVVDQRPINYCETGLHNCDIPQRAQCIYTGGSSYTCSCLPGFSGDGQAC QDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCVPGEVEKTRCQ HEREHILGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVD RDGREVEGTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQ TGKIERLPLEGNTMRKTEAKAFLHVPAKVIIGLAFDCVDKMVYWTDITEP SIGRASLHGGEPTTIIRQDLGSPEGIAVDHLGRNIFWTDSNLDRIEVAKL DGTQRRVLFETDLVNPRGIVTDSVRGNLYWTDWNRDNPKIETSYMDGTNR RILVQDDLGLPNGLHFDAFSSQLCWVDAGTNRAECLNPSQPSRRKALEGL QYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPHKQTRLYGITT ALSQCPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCIERK (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 2 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 11 31 1.122 Certain 2 91 111 1.174 Certain 3 213 233 0.927 Putative 4 480 500 1.798 Certain 5 775 795 1.105 Certain 6 931 951 1.163 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 Loop length 10 59 101 246 274 135 296 K+R profile 4.00 + + + 4.00 + + CYT-EXT prof - 1.09 0.06 -0.29 - 0.87 -0.50 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.82 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 3.0000 -> Orientation: N-in CYT-EXT difference: 0.49 -> Orientation: N-out ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 1 2 4 5 6 Loop length 10 59 368 274 135 296 K+R profile 4.00 + + 4.00 + + CYT-EXT prof - 1.22 -0.50 - 0.06 -0.29 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 1.00 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 3.0000 -> Orientation: N-in CYT-EXT difference: 0.95 -> Orientation: N-out ---------------------------------------------------------------------- "tem11" 1247 11 31 #t 1.12187 91 111 #t 1.17396 213 233 #f 0.927083 480 500 #t 1.79792 775 795 #t 1.10521 931 951 #t 1.1625 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem11.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem11.___inter___ (1 sequences) MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDLELEDGD DFVSPALELSGALRFYDRSDIDAVYVTTNGIIATSEPPAKESHPGLFPPT FGAVAPFLADLDTTDGLGKVYYREDLSPSITQRAAECVHRGFPEISFQPS SAVVVTWESVAPYQGPSRDPDQKGKRNTFQAVLASSDSSSYAIFLYPEDG LQFHTTFSKKENNQVPAVVAFSQGSVGFLWKSNGAYNIFANDRESIENLA KSSNSGQQGVWVFEIGSPATTNGVVPADVILGTEDGAEYDDEDEDYDLAT TRLGLEDVGTTPFSYKALRRGGADTYSVPSVLSPRRAATERPLGPPTERT RSFQLAVETFHQQHPQVIDVDEVEETGVVFSYNTDSRQTCANNRHQCSVH AECRDYATGFCCSCVAGYTGNGRQCVAEGSPQRVNGKVKGRIFVGSSQVP IVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPVGGIIGWMFAV EQDGFKNGFSITGGEFTRQAEVTFVGHPGNLVIKQRFSGIDEHGHLTIDT ELEGRVPQIPFGSSVHIEPYTELYHYSTSVITSSSTREYTVTEPERDGAS PSRIYTYQWRQTITFQECVHDDSRPALPSTQQLSVDSVFVLYNQEEKILR YAFSNSIGPVREGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIG FRGDGRTCYDIDECSEQPSVCGSHTICNNHPGTFRCECVEGYQFSDEGTC VAVVDQRPINYCETGLHNCDIPQRAQCIYTGGSSYTCSCLPGFSGDGQAC QDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCVPGEVEKTRCQ HEREHILGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVD RDGREVEGTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQ TGKIERLPLEGNTMRKTEAKAFLHVPAKVIIGLAFDCVDKMVYWTDITEP SIGRASLHGGEPTTIIRQDLGSPEGIAVDHLGRNIFWTDSNLDRIEVAKL DGTQRRVLFETDLVNPRGIVTDSVRGNLYWTDWNRDNPKIETSYMDGTNR RILVQDDLGLPNGLHFDAFSSQLCWVDAGTNRAECLNPSQPSRRKALEGL QYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPHKQTRLYGITT ALSQCPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCIERK (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 2 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 11 31 1.122 Certain 2 91 111 1.174 Certain 3 213 233 0.927 Putative 4 480 500 1.798 Certain 5 775 795 1.105 Certain 6 931 951 1.163 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 Loop length 10 59 101 246 274 135 296 K+R profile 3.00 + + + 4.00 + + CYT-EXT prof - 1.09 0.06 -0.29 - 0.87 -0.50 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -1.00 Tm probability: 0.82 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 3.0000 -> Orientation: N-in CYT-EXT difference: 0.49 -> Orientation: N-out ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 1 2 4 5 6 Loop length 10 59 368 274 135 296 K+R profile 3.00 + + 4.00 + + CYT-EXT prof - 1.22 -0.50 - 0.06 -0.29 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -1.00 Tm probability: 1.00 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 3.0000 -> Orientation: N-in CYT-EXT difference: 0.95 -> Orientation: N-out ---------------------------------------------------------------------- "tem11" 1247 11 31 #t 1.12187 91 111 #t 1.17396 213 233 #f 0.927083 480 500 #t 1.79792 775 795 #t 1.10521 931 951 #t 1.1625 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ NOW EXECUTING: /bio_software/1D/stat/saps/saps-stroh/SAPS.SSPA/saps /people/maria/tem11.___saps___ SAPS. Version of April 11, 1996. Date run: Thu Jan 18 20:21:05 2001 File: /people/maria/tem11.___saps___ ID Tem11gi|128199|sp|P14543|NIDO_HUMAN DE NIDOGEN PRECURSOR (ENTACTIN) number of residues: 1247; molecular weight: 136.5 kdal 1 MLASSSRIRA AWTRALLLPL LLAGPVGCLS RQELFPFGPG QGDLELEDGD DFVSPALELS 61 GALRFYDRSD IDAVYVTTNG IIATSEPPAK ESHPGLFPPT FGAVAPFLAD LDTTDGLGKV 121 YYREDLSPSI TQRAAECVHR GFPEISFQPS SAVVVTWESV APYQGPSRDP DQKGKRNTFQ 181 AVLASSDSSS YAIFLYPEDG LQFHTTFSKK ENNQVPAVVA FSQGSVGFLW KSNGAYNIFA 241 NDRESIENLA KSSNSGQQGV WVFEIGSPAT TNGVVPADVI LGTEDGAEYD DEDEDYDLAT 301 TRLGLEDVGT TPFSYKALRR GGADTYSVPS VLSPRRAATE RPLGPPTERT RSFQLAVETF 361 HQQHPQVIDV DEVEETGVVF SYNTDSRQTC ANNRHQCSVH AECRDYATGF CCSCVAGYTG 421 NGRQCVAEGS PQRVNGKVKG RIFVGSSQVP IVFENTDLHS YVVMNHGRSY TAISTIPETV 481 GYSLLPLAPV GGIIGWMFAV EQDGFKNGFS ITGGEFTRQA EVTFVGHPGN LVIKQRFSGI 541 DEHGHLTIDT ELEGRVPQIP FGSSVHIEPY TELYHYSTSV ITSSSTREYT VTEPERDGAS 601 PSRIYTYQWR QTITFQECVH DDSRPALPST QQLSVDSVFV LYNQEEKILR YAFSNSIGPV 661 REGSPDALQN PCYIGTHGCD TNAACRPGPR TQFTCECSIG FRGDGRTCYD IDECSEQPSV 721 CGSHTICNNH PGTFRCECVE GYQFSDEGTC VAVVDQRPIN YCETGLHNCD IPQRAQCIYT 781 GGSSYTCSCL PGFSGDGQAC QDVDECQPSR CHPDAFCYNT PGSFTCQCKP GYQGDGFRCV 841 PGEVEKTRCQ HEREHILGAA GATDPQRPIP PGLFVPECDA HGHYAPTQCH GSTGYCWCVD 901 RDGREVEGTR TRPGMTPPCL STVAPPIHQG PAVPTAVIPL PPGTHLLFAQ TGKIERLPLE 961 GNTMRKTEAK AFLHVPAKVI IGLAFDCVDK MVYWTDITEP SIGRASLHGG EPTTIIRQDL 1021 GSPEGIAVDH LGRNIFWTDS NLDRIEVAKL DGTQRRVLFE TDLVNPRGIV TDSVRGNLYW 1081 TDWNRDNPKI ETSYMDGTNR RILVQDDLGL PNGLHFDAFS SQLCWVDAGT NRAECLNPSQ 1141 PSRRKALEGL QYPFAVTSYG KNLYFTDWKM NSVVALDLAI SKETDAFQPH KQTRLYGITT 1201 ALSQCPQGHN YCSVNNGGCT HLCLATPGSR TCRCPDNTLG VDCIERK -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 79( 6.3%); C : 50( 4.0%); D : 76( 6.1%); E : 73( 5.9%); F : 53( 4.3%) G :113( 9.1%); H : 34( 2.7%); I : 56( 4.5%); K : 29( 2.3%); L : 81( 6.5%) M- : 8( 0.6%); N : 45( 3.6%); P : 87( 7.0%); Q : 58( 4.7%); R : 73( 5.9%) S : 93( 7.5%); T : 96( 7.7%); V : 86( 6.9%); W : 13( 1.0%); Y : 44( 3.5%) KR : 102 ( 8.2%); ED : 149 ( 11.9%); AGP : 279 ( 22.4%); KRED : 251 ( 20.1%); KR-ED : -47 ( -3.8%); FIKMNY : 235 ( 18.8%); LVIFM : 284 ( 22.8%); ST : 189 ( 15.2%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 000000+0+0 000+000000 0000000000 +0-0000000 00-0-0--0- -000000-00 61 000+00-+0- 0-00000000 00000-000+ -000000000 000000000- 0-00-000+0 121 00+--00000 00+00-000+ 000-000000 0000000-00 0000000+-0 -0+0++0000 181 000000-000 0000000--0 00000000++ -000000000 0000000000 +000000000 241 0-+-00-000 +000000000 000-000000 0000000-00 000--00-0- -----0-000 301 0+000--000 00000+00++ 000-000000 0000++000- +000000-+0 +000000-00 361 00000000-0 --0--00000 0000-0+000 000+000000 0-0+-00000 0000000000 421 00+0000-00 00+000+0+0 +000000000 000-00-000 0000000+00 0000000-00 481 0000000000 0000000000 -0-00+0000 0000-00+00 -000000000 000+0+0000 541 --000000-0 -0-0+00000 0000000-00 0-00000000 000000+-00 00-0-+-000 601 00+000000+ 000000-000 --0+000000 00000-0000 0000--+00+ 0000000000 661 +-000-0000 000000000- 00000+000+ 00000-0000 0+0-0+000- 0--00-0000 721 0000000000 0000+0-00- 00000--000 0000-0+000 00-000000- 000+000000 781 0000000000 00000-0000 0-0--0000+ 000-000000 00000000+0 0000-00+00 841 00-0-+0+00 0-+-000000 000-00+000 000000-0-0 0000000000 000000000- 901 +-0+-0-00+ 0+00000000 0000000000 0000000000 0000000000 00+0-+000- 961 0000++0-0+ 0000000+00 00000-00-+ 00000-00-0 000+000000 -00000+0-0 1021 000-0000-0 00+00000-0 00-+0-00+0 -000++000- 0-0000+000 0-00+00000 1081 0-00+-00+0 -0000-000+ +0000--000 000000-000 000000-000 0+0-000000 1141 00+++00-00 0000000000 +00000-0+0 000000-000 0+-0-00000 +00+000000 1201 0000000000 0000000000 000000000+ 00+00-0000 0-00-++ A. CHARGE CLUSTERS. Positive charge clusters (cmin = 9/30 or 11/45 or 14/60): none Negative charge clusters (cmin = 11/30 or 15/45 or 18/60): 1) From 278 to 307: DVILGTEDGAEYDDEDEDYDLATTRLGLED -00000--00-0------0-0000+000-- quartile: 1; size: 30, +count: 1, -count: 13, 0count: 16; t-value: 4.74 * L: 4 (13.3%); G: 3 (10.0%); E: 5 (16.7%); T: 3 (10.0%); D: 8 (26.7%); Mixed charge clusters (cmin = 15/30 or 21/45 or 26/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. ______________________________________ High scoring negative charge segments: score= 2.00 frequency= 0.119 ( ED ) score= 0.00 frequency= 0.000 ( BZX ) score= -1.00 frequency= 0.799 ( LAGSVTIPNFQYHMCW ) score= -2.00 frequency= 0.082 ( KR ) Expected score/letter: -0.723 M_0.01= 12.54; M_0.05= 10.60 1) From 278 to 297: length= 20, score=13.00 ** 278 DVILGTEDGA EYDDEDEDYD G: 2(10.0%); E: 4(20.0%); D: 7(35.0%); Y: 2(10.0%); There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 5 | 7 | 45 | 9 | 11 | 13 | 11 | 13 | 16 | 7 | 10 | lmin1 6 | 7 | 9 | 54 | 11 | 13 | 16 | 14 | 16 | 20 | 9 | 12 | lmin2 7 | 8 | 10 | 60 | 13 | 15 | 18 | 16 | 18 | 22 | 10 | 13 | (Significance level: 0.010000; Minimal displayed length: 6) (-) 6(0,0,0); at 290- 295: DDEDED (1. quartile) ------ Run count statistics: + runs >= 3: 1, at 1143; - runs >= 4: 1, at 290; * runs >= 5: 1, at 290; 0 runs >= 30: 2, at 913; 1195; -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. There are no high scoring hydrophobic segments. ____________________________________ High scoring transmembrane segments: 5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST) -6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED) Expected score/letter: -3.764 M_0.01= 57.59; M_0.05= 48.42; M_0.30= 37.50 1) From 480 to 500: length= 21, score=43.00 480 VGYSLLPLAP VGGIIGWMFA V L: 3(14.3%); G: 4(19.0%); V: 3(14.3%); 2. SPACINGS OF C. H2N-27-C-108-C-252-C-6-C-5-C-7-C-C-1-C-10-C-192-C-53-C-6-C-5-C-9-C-1-C-10-C-5-C-6-C-5-C-8-C-1-C-11-C-11-C-6-C-7-C-9-C-1-C-10-C-5-C-4-C-5-C-8-C-1-C-10-C-9-C-28-C-10-C-6-C-1-C-20-C-67-C-136-C-10-C-69-C-6-C-6-C-3-C-8-C-1-C-8-C-4-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-27-C-64-H-43-C-1-H-64-H-156-H-2-H-25-C-4-H-1-C-2-H-2-C-7-C-C-1-C-10-C-33-H-6-H-60-H-15-H-1-H-20-H-8-H-42-C-1-H-51-C-4-H-1-C-5-C-9-C-1-C-10-C-5-C-6-C-2-H-2-C-2-H-5-C-1-C-11-C-11-C-4-H-1-C-7-C-9-C-1-C-10-C-5-C-4-C-H-4-C-8-C-1-C-10-C-9-C-1-H-3-H-22-C-2-H-1-H-5-C-H-5-C-1-C-20-C-8-H-16-H-28-H-12-C-20-H-21-H-84-H-8-C-10-C-54-H-14-C-3-H-2-C-6-C-1-H-1-C-8-C-1-C-8-C-4-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 5 B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 9 Aligned matching blocks: [1045-1046]-( 4)-[1051-1063]-( 4)-[1068-1069] [1090-1091]-( 4)-[1096-1108]-( 4)-[1113-1114] [1051-1063] -son++iii-o-i [1096-1108] -son++iiin--i -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 72 (Expected range: 45-- 94) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 23 (6-10) 10 (11-20) 18 (>=21) 22 3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 16/60): none B. CHARGE ALPHABET. 1. Total number of charge multiplets: 25 (Expected range: 8-- 38) 9 +plets (f+: 8.2%), 16 -plets (f-: 11.9%) Total number of charge altplets: 23 (Critical number: 41) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 5 (6-10) 1 (11-20) 3 (>=21) 17 3. Long charge multiplets (>= 5; Letter/Length/Position): -/6/290 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 16- 23 2 L. 4 4 0 291- 298 2 D. 4 4 0 578- 593 4 T... 4 4 0 918- 949 8 P....... 4 4 0 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9) Location Period Element Copies Core Errors 8- 61 9 i.0....0. 6 6 /0/./2/././././2/./ 290- 295 1 - 6 6 0 516- 563 8 i..00.00 6 6 /0/././2/2/./2/2/ -------------------------------------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 35- 37 (1.) F( 2)F 54 of 54 0.0082 large minimal spacing 137- 390 (1.) C( 253)C 1 of 51 0.0005 large 1. maximal spacing 425- 618 (2.) C( 193)C 2 of 51 0.0000 large 2. maximal spacing ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem11 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- ldl_recept_b Low-density lipoprotein receptor repe 134.7 1.7e-36 3 thyroglobulin_1 Thyroglobulin type-1 repeat 121.6 1.4e-32 1 EGF EGF-like domain 110.1 4.2e-29 6 FGF Fibroblast growth factor 1.4 28 1 NHL NHL repeat 1.1 38 1 IGFBP Insulin-like growth factor binding pr 1.0 58 1 Asn_synthase Asparagine synthase 0.8 27 1 Transpo_mutator Transposase, Mutator family -1.0 87 1 UL42 DNA polymerase processivity factor (U -1.0 71 1 TNFR_c6 TNFR/NGFR cysteine-rich region -1.6 12 1 Androgen_recep Androgen receptor -1.7 36 1 flu_virus_nuc Influenza virus nucleoprotein -2.1 64 1 Arthro_defensin Arthropod defensin -6.6 87 1 metalthio Metallothionein -8.9 5 1 EB EB module -9.9 1.9 1 ldl_recept_a Low-density lipoprotein receptor doma -10.7 10 1 sushi Sushi domain (SCR repeat) -11.6 71 1 zf-CXXC CXXC zinc finger -16.6 69 1 laminin_EGF Laminin EGF-like (Domains III and V) -21.5 38 1 TIL Trypsin Inhibitor like cysteine rich -27.0 91 1 Transposase_1 Transposase -44.7 76 1 Keratin_B2 Keratin, high sulfur B2 protein -93.5 41 1 DUF39 Domain of unknown function DUF39 -189.3 91 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Transpo_mutator 1/1 28 53 .. 1 28 [. -1.0 87 Androgen_recep 1/1 116 135 .. 1 17 [. -1.7 36 Asn_synthase 1/1 113 143 .. 386 417 .. 0.8 27 flu_virus_nuc 1/1 250 298 .. 451 498 .] -2.1 64 DUF39 1/1 48 319 .. 1 336 [] -189.3 91 Arthro_defensin 1/1 387 415 .. 1 36 [] -6.6 87 EGF 1/6 390 425 .. 1 45 [] 23.7 0.0044 UL42 1/1 598 633 .. 122 161 .. -1.0 71 IGFBP 1/1 672 705 .. 1 37 [. 1.0 58 EGF 2/6 672 708 .. 1 45 [] 28.9 0.00011 EGF 3/6 714 750 .. 1 45 [] 26.3 0.0007 ldl_recept_a 1/1 719 752 .. 1 43 [] -10.7 10 TIL 1/1 685 762 .. 1 67 [] -27.0 91 EGF 4/6 769 800 .. 1 45 [] 23.7 0.0043 zf-CXXC 1/1 763 811 .. 1 48 [] -16.6 69 Keratin_B2 1/1 654 814 .. 1 177 [] -93.5 41 sushi 1/1 762 817 .. 1 62 [] -11.6 71 EB 1/1 780 839 .. 1 56 [] -9.9 1.9 EGF 5/6 806 839 .. 1 45 [] 31.1 2.5e-05 metalthio 1/1 772 841 .. 1 67 [] -8.9 5 laminin_EGF 1/1 806 849 .. 1 59 [] -21.5 38 TNFR_c6 1/1 828 853 .. 1 42 [] -1.6 12 FGF 1/1 901 916 .. 121 138 .. 1.4 28 thyroglobulin_1 1/1 849 919 .. 1 81 [] 121.6 1.4e-32 ldl_recept_b 1/3 990 1031 .. 1 47 [] 48.7 1.3e-10 NHL 1/1 1020 1048 .. 1 29 [] 1.1 38 ldl_recept_b 2/3 1033 1074 .. 1 47 [] 50.0 5.1e-11 Transposase_1 1/1 1029 1101 .. 1 84 [] -44.7 76 ldl_recept_b 3/3 1076 1119 .. 1 47 [] 37.6 2.9e-07 EGF 6/6 1212 1243 .. 1 45 [] 16.6 0.6 Alignments of top-scoring domains: Transpo_mutator: domain 1 of 1, from 28 to 53: score -1.0, E = 87 *->mtedellallealdalldneldgiddll<-* ++++l+++++++++l el+++dd++ Tem11gi|12 28 CLSRQELFPFGPGQGDL--ELEDGDDFV 53 Androgen_recep: domain 1 of 1, from 116 to 135: score -1.7, E = 36 *->GLGRvYPR...PPsKtYRGA<-* GLG+vY R++ Ps t R A Tem11gi|12 116 GLGKVYYRedlSPSITQRAA 135 Asn_synthase: domain 1 of 1, from 113 to 143: score 0.8, E = 27 *->ttytKeayyyRkifeelfPsekvallvkkgfP<-* tt + +++yyR+ + + + + ++a+ v++gfP Tem11gi|12 113 TTDGLGKVYYREDLSPSITQ-RAAECVHRGFP 143 flu_virus_nuc: domain 1 of 1, from 250 to 298: score -2.1, E = 64 *->AkpedvsFqGRGvFELsDEKAtnPivPs.FDmsnEGsYFFGDnAEEy Ak+ + qG vFE+ tn +vP++ + E + D E+y Tem11gi|12 250 AKSSNSGQQGVWVFEIGSPATTNGVVPAdVILGTEDGAEYDDEDEDY 296 Dn<-* D Tem11gi|12 297 DL 298 DUF39: domain 1 of 1, from 48 to 319: score -189.3, E = 91 *->eGeavVvTAdEmkkiVrelGeekaAdeVDVVTTgTfGaMsSsgAvin +G+ V A E+ r + +d V V T g Tem11gi|12 48 DGDDFVSPALELSGALRF-YDRSDIDAVYVTTNGI------------ 81 lghsdPpsfiKmervyLNgieAYaGiCPNEfLgaVDayiGaTqpsedpdv + s+Pp K e +G P f gaV ++ + +d Tem11gi|12 82 IATSEPP--AK---------ESHPGLFPPTF-GAVAPFLAD---LDTTD- 115 gdnPneGdGvleyGGghvieDL...vkGkeVElrAtsyGTDCYprkeve. G G + y eDL++++ + E + + +p ++ Tem11gi|12 116 ------GLGKVYYR-----EDLspsITQRAAECVHRG-----FPEISFQp 149 ...teItlDdinqAvmvnp.........RnaYqnYtAavNsseellyTym ++ +++t + + A +p+++++++++Rn +q A+ ss+ y ++ Tem11gi|12 150 ssaVVVTWESV--APYQGPsrdpdqkgkRNTFQ---AVLASSDSSSYAIF 194 gtLlpPenyGnvsfSGcGelnPLeNDviPetksfetigiGTrifL...nG L p ++ f + + eN +P + +G fL ++nG Tem11gi|12 195 --LYP---EDGLQFHTT--FSKKENNQVP---AVVAFSQGSVGFLwksNG 234 AegyilGEGTqhsplqkRlphvPippkgtLMlkGDlKdMdpeYirGAflp A + + + s ++ +L+ + + + G + Tem11gi|12 235 AYNIFAND--RES-----------IE--NLAKSSNSGQ------QGVWVF 263 ryG.peLyvgigvPIPV...lnekvaealavlDeDIelpvvDfgvprRdR G+p +g+ vP V +++ ++ DeD l g Tem11gi|12 264 EIGsPATTNGV-VPADVilgTEDG--AEYDDEDEDYDLATTRLGLEDV-- 308 PvikevtYeelw<-* + +Y+ l+ Tem11gi|12 309 -GTTPFSYKALR 319 Arthro_defensin: domain 1 of 1, from 387 to 415: score -6.6, E = 87 *->gkgcpvNhsaCaaHClakGGrrGGyCnglkavCvCR<-* +++c+ N+ +C H + ++ G C + C+ Tem11gi|12 387 RQTCANNRHQCSVHAECRD-YATGFCCS------CV 415 EGF: domain 1 of 6, from 390 to 425: score 23.7, E = 0.0044 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< Ca n + Cs +++C+++ g+ C C G + ++G++C Tem11gi|12 390 CANNRhQCSVHAECRDYAT-------GFCCSCVAG---YTGNGRQC 425 -* Tem11gi|12 - - UL42: domain 1 of 1, from 598 to 633: score -1.0, E = 71 *->GepPsRlLt....QtvyfardqtEEDGdaGslasGatvqvvaal<-* G++PsR+ t + +Qt++f+ +++ ++s + ++ +l Tem11gi|12 598 GASPSRIYTyqwrQTITFQ--------ECVHDDSRPALPSTQQL 633 IGFBP: domain 1 of 1, from 672 to 705: score 1.0, E = 58 *->CprPcGGpCpaerlarCpPgPpvaPpaecaelvredG<-* C +C+ a+C+PgP ec + r dG Tem11gi|12 672 C-YIGTHGCDTN--AACRPGPRTQFTCECSIGFRGDG 705 EGF: domain 2 of 6, from 672 to 708: score 28.9, E = 0.00011 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< C ++++C+ ++ C+++p+ + +tCeC G + ++G++C Tem11gi|12 672 CYIGThGCDTNAACRPGPR------TQFTCECSIG---FRGDGRTC 708 -* Tem11gi|12 - - EGF: domain 3 of 6, from 714 to 750: score 26.3, E = 0.0007 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< C++++++C + +C+n pg +++CeC +G y + +C Tem11gi|12 714 CSEQPsVCGSHTICNNHPG-------TFRCECVEG--YQFSDEGTC 750 -* Tem11gi|12 - - ldl_recept_a: domain 1 of 1, from 719 to 752: score -10.7, E = 10 *->stCggpdeFqCgsgrrCIprswvCDGdpDCe...DGSDEslenCaa< s Cg ++ C+++ ++C +C ++ SDE +C+a Tem11gi|12 719 SVCG-SHTI-CNNHP----GTFRC----ECVegyQFSDE--GTCVA 752 -* Tem11gi|12 - - TIL: domain 1 of 1, from 685 to 762: score -27.0, E = 91 *->Cpa.neqyteCgpsCe.......psCsnpdgplettppCegtSpkvP C ++ + +C C + ++++++C ++d+ e + +C++ + Tem11gi|12 685 CRPgPRTQFTCE--CSigfrgdgRTCYDIDECSEQPSVCGS-----H 724 stCk..eg...CvCqpGyVrnndgdkCV......prseC<-* +C +++g+ +C+C +Gy +++ g +CV +++p++ C Tem11gi|12 725 TICNnhPGtfrCECVEGYQFSDEG-TCVavvdqrPINYC 762 EGF: domain 4 of 6, from 769 to 800: score 23.7, E = 0.0043 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C+ + ++++C++t+g ++ytC C pG +s++G C Tem11gi|12 769 CDIP----QRAQCIYTGG------SSYTCSCLPG---FSGDGQAC 800 * Tem11gi|12 - - zf-CXXC: domain 1 of 1, from 763 to 811: score -16.6, E = 69 *->kngrkrRrCGvCevCqrpeDC....GkCkaCkDkpKFGGsnrkKQ.. + g + C + qr C ++G+ C p F G +++ Q+ Tem11gi|12 763 ETGLHN-----CDIPQRA-QCiytgGSSYTCSCLPGFSGDGQACQdv 803 .aCkkRrC<-* + C+ rC Tem11gi|12 804 dECQPSRC 811 Keratin_B2: domain 1 of 1, from 654 to 814: score -93.5, E = 41 *->qtScCGfptCStlgtrPsCGsscCQPsCCeSCCQpsCcqp.SC.... ++S G+ ++++ + +Q p++++++ C+++ Tem11gi|12 654 SNSI-GP--VREGS------PDALQN--------PCYIGThGCdtna 683 .CqPtcsqtscCqPtcfqs............sCC.rPsCc..qTSCCqPt C+P ++ C+ c+ ++++++ + + C ++Ps c++ T C Tem11gi|12 684 aCRPGPRTQFTCE--CSIGfrgdgrtcydidECSeQPSVCgsHTICNNHP 731 CcqsssCqtgCgigGsiGyGQeGsSGAvScrirWCRPdCrvegtClPpCC + C +g + + eG Av + RP+ ++e t l C Tem11gi|12 732 GTFRCECVEGYQFS------DEGTCVAVVDQ----RPINYCE-TGLHNCD 770 vv...sCtaPTCCqpvsaQasCCRPsCqPyCgq.sCCRPaCccsvtCtrT ++++ C + +s +s C P + g +++C+ + c++ Tem11gi|12 771 IPqraQCIYT---GGSSYTCS-CLPG---FSGDgQACQDVDECQP----S 809 ccePc<-* +c P Tem11gi|12 810 RCHPD 814 sushi: domain 1 of 1, from 762 to 817: score -11.6, E = 71 *->CppPdieNGrvsss..gtyeypvGdtvtytCneGYrlvG...sssit C + N ++ + + y+ G++ t +C++G++ G+ ++ Tem11gi|12 762 CE-TGLHNCDIPQRaqCIYTG--GSSYTCSCLPGFSGDGqacQDVDE 805 CtedggGgWsppllGelPkC<-* C++ + +p C Tem11gi|12 806 CQPS---RCHPD-----AFC 817 EB: domain 1 of 1, from 780 to 839: score -9.9, E = 1.9 *->CpsgqVevnGeCvk.kvaiGetGClaseQCpgr..wpGSqCidg... +g +++ C+++ + G++ C+ + C+ ++ +p + C++ +++ Tem11gi|12 780 --TGGSSYTCSCLPgFSGDGQA-CQDVDECQPSrcHPDAFCYNTpgs 823 .mCqCpeGftavnGvC<-* +CqC++G+ ++ +C Tem11gi|12 824 fTCQCKPGYQGDGFRC 839 EGF: domain 5 of 6, from 806 to 839: score 31.1, E = 2.5e-05 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C+p C+ +++C ntpg ++tC+C+pG ++++G rC Tem11gi|12 806 CQPS-RCHPDAFCYNTPG-------SFTCQCKPG---YQGDGFRC 839 * Tem11gi|12 - - metalthio: domain 1 of 1, from 772 to 841: score -8.9, E = 5 *->mDPqnCsCatgg....sCtCgtsCkCknCkCtsCkKs.ccsCcP.ag Pq +C+ +g+++ +C+C + ++ + C++ ++ + C P+a Tem11gi|12 772 --PQRAQCIYTGgssyTCSCLPGFSGDGQACQDVDECqPSRCHPdAF 816 C..skCaqgCvCkgggaasetskCsCCa<-* C ++ + +C Ck g + + +C + Tem11gi|12 817 CynTPGSFTCQCKP-GYQGD--GFRCVP 841 laminin_EGF: domain 1 of 1, from 806 to 849: score -21.5, E = 38 *->CdCnphGslsddtCdsddelf.geetGqClkCkpnvtGrrCdrCkpG C+ ++C++d+ ++++ + C+ Ckp+++G+ rC pG Tem11gi|12 806 CQ----P----SRCHPDAFCYnTPGSFTCQ-CKPGYQGDG-FRCVPG 842 yyglpsgdpgqgC<-* C Tem11gi|12 843 EV-----EKT-RC 849 TNFR_c6: domain 1 of 1, from 828 to 853: score -1.6, E = 12 *->CeegvtYtdenhleqClsCsrCepemGqvlvspCtatqnTvC<-* C++g Y+ + ++C p G+v++++C++ + Tem11gi|12 828 CKPG--YQGDG--FRC------VP--GEVEKTRCQHER---- 853 FGF: domain 1 of 1, from 901 to 916: score 1.4, E = 28 *->kkGrpkrGsisKTrphqK<-* ++Gr ++G+ +Trp+++ Tem11gi|12 901 RDGREVEGT--RTRPGMT 916 thyroglobulin_1: domain 1 of 1, from 849 to 919: score 121.6, E = 1.4e-32 *->CqrelervleslaaeqlkteehlesprglYvPnCdenGfYkpvQCdp Cq+e+e++l++++a++++ ++ p+gl+vP+Cd++G+Y+p QC++ Tem11gi|12 849 CQHEREHILGAAGATDPQ----RPIPPGLFVPECDAHGHYAPTQCHG 891 SlggqrGeCWCVDaetGkelpGtrdvgGetPprC<-* ++G+CWCVD++ G+e++Gtr++ G+t p C Tem11gi|12 892 ----STGYCWCVDRD-GREVEGTRTRPGMT-PPC 919 ldl_recept_b: domain 1 of 3, from 990 to 1031: score 48.7, E = 1.3e-10 *->grlYWtDsslrkIasisvadlnGsdrrtlfsededlqhPngIavDpi + +YWtD ++ si++a l+G++ t++++d l +P+gIavD+ Tem11gi|12 990 KMVYWTDITEP---SIGRASLHGGEPTTIIRQD--LGSPEGIAVDHL 1031 <-* Tem11gi|12 - - NHL: domain 1 of 1, from 1020 to 1048: score 1.1, E = 38 *->fdrPrGvavdpsdGq.ivVaDsenhriqvF<-* + +P+G+avd ++G++i +Ds+ ri v Tem11gi|12 1020 LGSPEGIAVD-HLGRnIFWTDSNLDRIEVA 1048 ldl_recept_b: domain 2 of 3, from 1033 to 1074: score 50.0, E = 5.1e-11 *->grlYWtDsslrkIasisvadlnGsdrrtlfsededlqhPngIavDpi ++++WtDs+l+ +i+va+l+G+ rr+lf d l +P+gI+ D + Tem11gi|12 1033 RNIFWTDSNLD---RIEVAKLDGTQRRVLFETD--LVNPRGIVTDSV 1074 <-* Tem11gi|12 - - Transposase_1: domain 1 of 1, from 1029 to 1101: score -44.7, E = 76 *->yyYdpENpkRkksWvdpGepapstpKpnlhgkKvMLcVWWDqkGvih d R+ W d + + ++K + +++ v+ Tem11gi|12 1029 ---D--HLGRNIFWTDSNLDRIEVAKLDGTQRRVL----------FE 1060 yELLppGeTIn....aelYcqQLdrLkqalaeKRPelanRk<-* +L++p+ ++++ +++lY + +r + +++ + nR+ Tem11gi|12 1061 TDLVNPRGIVTdsvrGNLYWTDWNRDNPKIETSYMDGTNRR 1101 ldl_recept_b: domain 3 of 3, from 1076 to 1119: score 37.6, E = 2.9e-07 *->grlYWtDsslrkIasisvadlnGsdrrtlfsededlqhPngIavDpi g+lYWtD++ + ++i+ ++G++rr+l+++d l Png+ D + Tem11gi|12 1076 GNLYWTDWNRDN-PKIETSYMDGTNRRILVQDD--LGLPNGLHFDAF 1119 <-* Tem11gi|12 - - EGF: domain 6 of 6, from 1212 to 1243: score 16.6, E = 0.6 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< C+ nn++C + C+ tpg + tC+Cp G +C Tem11gi|12 1212 CSVNNgGCTH--LCLATPG-------SRTCRCPDN-----TLGVDC 1243 -* Tem11gi|12 - - // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem11 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- ldl_recept_b Low-density lipoprotein receptor repe 130.4 3.3e-35 4 thyroglobulin_1 Thyroglobulin type-1 repeat 119.8 5.2e-32 1 EGF EGF-like domain 99.7 1.9e-26 6 NHL NHL repeat 6.0 7.6 2 TNFR_c6 TNFR/NGFR cysteine-rich region 2.1 43 1 FGF Fibroblast growth factor 1.4 28 1 G_Adapt_CT Gamma-adaptin, C-terminus 1.3 50 1 dsrm Double-stranded RNA binding motif 1.2 84 1 IGFBP Insulin-like growth factor binding pr 1.0 58 1 Ribosomal_S7 Ribosomal protein S7p/S5e 0.9 21 1 DUF15 Domain of unknown function DUF15 0.9 36 1 Asn_synthase Asparagine synthase 0.8 27 1 ANATO Anaphylotoxin-like domain 0.5 14 1 RNA_helicase RNA helicase 0.4 42 1 Arthro_defensin Arthropod defensin 0.3 66 1 Transpo_mutator Transposase, Mutator family -1.0 87 1 UL42 DNA polymerase processivity factor (U -1.0 71 1 Androgen_recep Androgen receptor -1.7 36 1 flu_virus_nuc Influenza virus nucleoprotein -2.1 64 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Transpo_mutator 1/1 28 53 .. 1 28 [. -1.0 87 Androgen_recep 1/1 116 135 .. 1 17 [. -1.7 36 Asn_synthase 1/1 113 143 .. 386 417 .. 0.8 27 G_Adapt_CT 1/1 184 221 .. 1 47 [. 1.3 50 DUF15 1/1 211 229 .. 83 103 .. 0.9 36 RNA_helicase 1/1 226 251 .. 1 26 [. 0.4 42 flu_virus_nuc 1/1 250 298 .. 451 498 .] -2.1 64 Ribosomal_S7 1/1 316 336 .. 79 99 .. 0.9 21 Arthro_defensin 1/1 387 400 .. 1 14 [. 0.3 66 EGF 1/6 390 425 .. 1 45 [] 21.9 0.00017 UL42 1/1 598 633 .. 122 161 .. -1.0 71 IGFBP 1/1 672 705 .. 1 37 [. 1.0 58 dsrm 1/1 686 707 .. 28 58 .. 1.2 84 EGF 2/6 672 708 .. 1 45 [] 27.2 5.7e-06 EGF 3/6 714 750 .. 1 45 [] 24.6 3.2e-05 EGF 4/6 769 800 .. 1 45 [] 22.0 0.00017 ANATO 1/1 784 803 .. 13 32 .. 0.5 14 EGF 5/6 806 839 .. 1 45 [] 29.4 1.3e-06 TNFR_c6 1/1 838 853 .. 21 38 .. 2.1 43 FGF 1/1 901 916 .. 121 138 .. 1.4 28 thyroglobulin_1 1/1 849 919 .. 1 81 [] 119.8 5.2e-32 ldl_recept_b 1/4 990 1031 .. 1 47 [] 46.8 1.6e-11 NHL 1/2 1020 1033 .. 1 15 [. 4.5 20 ldl_recept_b 2/4 1033 1074 .. 1 47 [] 48.2 6.6e-12 NHL 2/2 1063 1087 .. 1 24 [. 1.6 1.4e+02 ldl_recept_b 3/4 1076 1119 .. 1 47 [] 35.6 2.2e-08 ldl_recept_b 4/4 1161 1169 .. 1 9 [. 2.5 45 EGF 6/6 1212 1243 .. 1 45 [] 14.9 0.016 Alignments of top-scoring domains: Transpo_mutator: domain 1 of 1, from 28 to 53: score -1.0, E = 87 *->mtedellallealdalldneldgiddll<-* ++++l+++++++++l el+++dd++ Tem11gi|12 28 CLSRQELFPFGPGQGDL--ELEDGDDFV 53 Androgen_recep: domain 1 of 1, from 116 to 135: score -1.7, E = 36 *->GLGRvYPR...PPsKtYRGA<-* GLG+vY R++ Ps t R A Tem11gi|12 116 GLGKVYYRedlSPSITQRAA 135 Asn_synthase: domain 1 of 1, from 113 to 143: score 0.8, E = 27 *->ttytKeayyyRkifeelfPsekvallvkkgfP<-* tt + +++yyR+ + + + + ++a+ v++gfP Tem11gi|12 113 TTDGLGKVYYREDLSPSITQ-RAAECVHRGFP 143 G_Adapt_CT: domain 1 of 1, from 184 to 221: score 1.3, E = 50 *->PsvspaaeknaaipsttvyesngLkIeftftregppespevlvitat ++s+++ + + +y +gL++ +tf++ e+++v ++a+ Tem11gi|12 184 -ASSDSS-----SYAIFLYPEDGLQFHTTFSK---KENNQVPAVVAF 221 <-* Tem11gi|12 - - DUF15: domain 1 of 1, from 211 to 229: score 0.9, E = 36 *->drerddiPilgiNtGGhLGFL<-* ++ ++++ +++++ G ++GFL Tem11gi|12 211 EN-NQVPAVVAFSQG-SVGFL 229 RNA_helicase: domain 1 of 1, from 226 to 251: score 0.4, E = 42 *->lgwlkkfnegceaakglewsaltifk<-* g+l+k n++++ + ++ s+ +++k Tem11gi|12 226 VGFLWKSNGAYNIFANDRESIENLAK 251 flu_virus_nuc: domain 1 of 1, from 250 to 298: score -2.1, E = 64 *->AkpedvsFqGRGvFELsDEKAtnPivPs.FDmsnEGsYFFGDnAEEy Ak+ + qG vFE+ tn +vP++ + E + D E+y Tem11gi|12 250 AKSSNSGQQGVWVFEIGSPATTNGVVPAdVILGTEDGAEYDDEDEDY 296 Dn<-* D Tem11gi|12 297 DL 298 Ribosomal_S7: domain 1 of 1, from 316 to 336: score 0.9, E = 21 *->rsrRvGGGttyqVPveVrpdR<-* ++ R+GG+ ty VP+ +p+R Tem11gi|12 316 KALRRGGADTYSVPSVLSPRR 336 Arthro_defensin: domain 1 of 1, from 387 to 400: score 0.3, E = 66 *->gkgcpvNhsaCaaH<-* +++c+ N+ +C H Tem11gi|12 387 RQTCANNRHQCSVH 400 EGF: domain 1 of 6, from 390 to 425: score 21.9, E = 0.00017 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< Ca n + Cs +++C+++ g+ C C G + ++G++C Tem11gi|12 390 CANNRhQCSVHAECRDYAT-------GFCCSCVAG---YTGNGRQC 425 -* Tem11gi|12 - - UL42: domain 1 of 1, from 598 to 633: score -1.0, E = 71 *->GepPsRlLt....QtvyfardqtEEDGdaGslasGatvqvvaal<-* G++PsR+ t + +Qt++f+ +++ ++s + ++ +l Tem11gi|12 598 GASPSRIYTyqwrQTITFQ--------ECVHDDSRPALPSTQQL 633 IGFBP: domain 1 of 1, from 672 to 705: score 1.0, E = 58 *->CprPcGGpCpaerlarCpPgPpvaPpaecaelvredG<-* C +C+ a+C+PgP ec + r dG Tem11gi|12 672 C-YIGTHGCDTN--AACRPGPRTQFTCECSIGFRGDG 705 dsrm: domain 1 of 1, from 686 to 707: score 1.2, E = 84 *->tPaHaprFtvevkvggktyvrktfgeGsGsS<-* +P+ + +Ft+e+++g ++G G++ Tem11gi|12 686 RPGPRTQFTCECSIG---------FRGDGRT 707 EGF: domain 2 of 6, from 672 to 708: score 27.2, E = 5.7e-06 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< C ++++C+ ++ C+++p+ + +tCeC G + ++G++C Tem11gi|12 672 CYIGThGCDTNAACRPGPR------TQFTCECSIG---FRGDGRTC 708 -* Tem11gi|12 - - EGF: domain 3 of 6, from 714 to 750: score 24.6, E = 3.2e-05 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< C++++++C + +C+n pg +++CeC +G y + +C Tem11gi|12 714 CSEQPsVCGSHTICNNHPG-------TFRCECVEG--YQFSDEGTC 750 -* Tem11gi|12 - - EGF: domain 4 of 6, from 769 to 800: score 22.0, E = 0.00017 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C+ + ++++C++t+g ++ytC C pG +s++G C Tem11gi|12 769 CDIP----QRAQCIYTGG------SSYTCSCLPG---FSGDGQAC 800 * Tem11gi|12 - - ANATO: domain 1 of 1, from 784 to 803: score 0.5, E = 14 *->gqsCeeRaaritdGkeCrka<-* +++C+ ++ + dG++C+++ Tem11gi|12 784 SYTCSCLPGFSGDGQACQDV 803 EGF: domain 5 of 6, from 806 to 839: score 29.4, E = 1.3e-06 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C+p C+ +++C ntpg ++tC+C+pG ++++G rC Tem11gi|12 806 CQPS-RCHPDAFCYNTPG-------SFTCQCKPG---YQGDGFRC 839 * Tem11gi|12 - - TNFR_c6: domain 1 of 1, from 838 to 853: score 2.1, E = 43 *->rCepemGqvlvspCtatq<-* rC p G+v++++C++ + Tem11gi|12 838 RCVP--GEVEKTRCQHER 853 FGF: domain 1 of 1, from 901 to 916: score 1.4, E = 28 *->kkGrpkrGsisKTrphqK<-* ++Gr ++G+ +Trp+++ Tem11gi|12 901 RDGREVEGT--RTRPGMT 916 thyroglobulin_1: domain 1 of 1, from 849 to 919: score 119.8, E = 5.2e-32 *->CqrelervleslaaeqlkteehlesprglYvPnCdenGfYkpvQCdp Cq+e+e++l++++a++++ ++ p+gl+vP+Cd++G+Y+p QC++ Tem11gi|12 849 CQHEREHILGAAGATDPQ----RPIPPGLFVPECDAHGHYAPTQCHG 891 SlggqrGeCWCVDaetGkelpGtrdvgGetPprC<-* ++G+CWCVD++ G+e++Gtr++ G+t p C Tem11gi|12 892 ----STGYCWCVDRD-GREVEGTRTRPGMT-PPC 919 ldl_recept_b: domain 1 of 4, from 990 to 1031: score 46.8, E = 1.6e-11 *->grlYWtDsslrkIasisvadlnGsdrrtlfsededlqhPngIavDpi + +YWtD ++ si++a l+G++ t++++d l +P+gIavD+ Tem11gi|12 990 KMVYWTDITEP---SIGRASLHGGEPTTIIRQD--LGSPEGIAVDHL 1031 <-* Tem11gi|12 - - NHL: domain 1 of 2, from 1020 to 1033: score 4.5, E = 20 *->fdrPrGvavdpsdGq<-* + +P+G+avd ++G+ Tem11gi|12 1020 LGSPEGIAVD-HLGR 1033 ldl_recept_b: domain 2 of 4, from 1033 to 1074: score 48.2, E = 6.6e-12 *->grlYWtDsslrkIasisvadlnGsdrrtlfsededlqhPngIavDpi ++++WtDs+l+ +i+va+l+G+ rr+lf d l +P+gI+ D + Tem11gi|12 1033 RNIFWTDSNLD---RIEVAKLDGTQRRVLFETD--LVNPRGIVTDSV 1074 <-* Tem11gi|12 - - NHL: domain 2 of 2, from 1063 to 1087: score 1.6, E = 1.4e+02 *->fdrPrGvavdpsdGqivVaD.senh<-* + PrG+++d +G+++ +D ++++ Tem11gi|12 1063 LVNPRGIVTDSVRGNLYWTDwNRDN 1087 ldl_recept_b: domain 3 of 4, from 1076 to 1119: score 35.6, E = 2.2e-08 *->grlYWtDsslrkIasisvadlnGsdrrtlfsededlqhPngIavDpi g+lYWtD++ + ++i+ ++G++rr+l+++d l Png+ D + Tem11gi|12 1076 GNLYWTDWNRDN-PKIETSYMDGTNRRILVQDD--LGLPNGLHFDAF 1119 <-* Tem11gi|12 - - ldl_recept_b: domain 4 of 4, from 1161 to 1169: score 2.5, E = 45 *->grlYWtDss<-* ++lY+tD++ Tem11gi|12 1161 KNLYFTDWK 1169 EGF: domain 6 of 6, from 1212 to 1243: score 14.9, E = 0.016 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< C+ nn++C + C+ tpg + tC+Cp G +C Tem11gi|12 1212 CSVNNgGCTH--LCLATPG-------SRTCRCPDN-----TLGVDC 1243 -* Tem11gi|12 - - // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem11 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Thu Jan 18 20:33:01 2001 Sequence file: tem11 ---------------------------------------- Sequence Tem11gi|128199|sp|P14543|NIDO_HUMAN (1247 residues): Matching pattern PS00004 CAMP_PHOSPHO_SITE: 175: KRNT Total matches: 1 Matching pattern PS00005 PKC_PHOSPHO_SITE: 5: SSR 131: TQR 208: SKK 300: TTR 314: SYK 333: SPR 339: TER 347: TER 585: STR 733: TFR 951: TGK 963: TMR 1053: TQR 1073: SVR 1098: TNR 1130: TNR 1142: SRR 1231: TCR Total matches: 18 Matching pattern PS00006 CK2_PHOSPHO_SITE: 30: SRQE 69: SDID 208: SKKE 475: TIPE 512: TGGE 538: SGID 550: TELE 585: STRE 590: TVTE 592: TEPE 614: TFQE 707: TCYD 1040: SNLD 1093: SYMD Total matches: 14 Matching pattern PS00007 TYR_PHOSPHO_SITE: 810: RCHPDAFCY 1144: RKALEGLQY Total matches: 2 Matching pattern PS00008 MYRISTYL: 80: GIIATS 266: GSPATT 273: GVVPAD 282: GTEDGA 321: GGADTY 377: GVVFSY 491: GGIIGW 513: GGEFTR 598: GASPSR 663: GSPDAL 675: GTHGCD 678: GCDTNA 732: GTFRCE 748: GTCVAV 765: GLHNCD 782: GSSYTC 822: GSFTCQ 858: GAAGAT 1009: GGEPTT 1021: GSPEGI 1109: GLPNGL 1129: GTNRAE 1197: GITTAL 1228: GSRTCR Total matches: 24 Matching pattern PS00009 AMIDATION: 173: KGKR Total matches: 1 Matching pattern PS00010 ASX_HYDROXYL: 403: CRDYATGFCCSC 727: CNNHPGTFRCEC 817: CYNTPGSFTCQC Total matches: 3 Matching pattern PS00016 RGD: 702: RGD Total matches: 1 Matching pattern PS00022 EGF_1: 1232: CRCPDNTLGVDC Total matches: 1 Matching pattern PS01186 EGF_2: 412: CSCVAGYTGNGRQC 695: CECSIGFRGDGRTC 736: CECVEGYQFSDEGTC 787: CSCLPGFSGDGQAC 826: CQCKPGYQGDGFRC Total matches: 5 Matching pattern PS01187 EGF_CA: 710: DIDECSEQPSVCGSHTICNNHPGTFRC 802: DVDECQPSRCHPDAFCYNTPGSFTC Total matches: 2 Matching pattern PS00484 THYROGLOBULIN_1: 874: FVPECDAHGHYAPTQCHGSTGYCWCVDRDG Total matches: 1 Total no of hits in this sequence: 73 ======================================== 1314 pattern(s) searched in 1 sequence(s), 1247 residues. Total no of hits in all sequences: 73. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** argv[1]=P argv[2]=-m /data/patterns/own/motif.fa argv[4]=-seq tem11 ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1247 units >ER-GOLGI-traffic signal is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1247 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem11 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem11 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) (1247 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value AP2 A plant specific DNA binding domain (Apetala 2 like) 25 0.64 PDZ PDZ domain (A protein-protein interaction domain) 25 0.67 S1 S1 RNA binding domain 23 3.5 SEC7 Sec7 like GDP exchange factor for ARF like GTPases 22 5.3 FYVE Zinc Finger domain involved in PtdIns(3)P binding 22 5.6 VWA Von Willebrand factor A domain 22 6.2 MYND MYND domain (A zinc coordinating Cysteine rich domain) 22 6.6 SET Su(var)3-9, Enhancer of Zeste, trithorax domain (A chrom... 21 8.9 >AP2 A plant specific DNA binding domain (Apetala 2 like) Length = 218 Score = 24.8 bits (53), Expect = 0.64 Identities = 12/45 (26%), Positives = 12/45 (26%), Gaps = 7/45 (15%) Query: 259 GVWVFEIGSPATTNGVVPADVILGTEDGAEYDDEDEDYDLATTRL 303 Sbjct: 45 GKWVSEIREPRKK-----SRIWLGTFPSPE--MAARAHDVAALSI 82 >PDZ PDZ domain (A protein-protein interaction domain) Length = 90 Score = 24.8 bits (54), Expect = 0.67 Identities = 11/29 (37%), Positives = 11/29 (37%), Gaps = 2/29 (6%) Query: 255 SGQQGVWVFEI--GSPATTNGVVPADVIL 281 Sbjct: 31 SAQQGAFVSEVLPKSAAEKAGLKAGDIIT 59 >S1 S1 RNA binding domain Length = 305 Score = 22.6 bits (48), Expect = 3.5 Identities = 13/33 (39%), Positives = 13/33 (39%), Gaps = 1/33 (3%) Query: 546 LTIDTELEGRVPQI-PFGSSVHIEPYTELYHYS 577 Sbjct: 191 IAAGNIYEGKVAKIQPYGVFVEIEGVTGLLHVS 223 Score = 21.4 bits (45), Expect = 7.6 Identities = 15/92 (16%), Positives = 15/92 (16%), Gaps = 16/92 (17%) Query: 166 PSRDPDQKGKRNTFQAVLASSDSSSYAIFLYPEDGLQFHTTFSKKENNQVPAVVAFSQGS 225 Sbjct: 2 PSSSNSAAFSLDEFAKALDKHDY-------HAEKGQTVHGKICQHANEG--VYVDFGGKS 52 Query: 226 VGFLWKSNGAYNIFANDRESIENLAKSSNSGQ 257 Sbjct: 53 PGFVPVQE-----LGLRP--HAEIEDSFPLDS 77 >SEC7 Sec7 like GDP exchange factor for ARF like GTPases Length = 192 Score = 21.7 bits (46), Expect = 5.3 Identities = 6/18 (33%), Positives = 6/18 (33%) Query: 451 IVFENTDLHSYVVMNHGR 468 Sbjct: 139 IILLNTDMYSPNVKPERK 156 >FYVE Zinc Finger domain involved in PtdIns(3)P binding Length = 99 Score = 21.6 bits (45), Expect = 5.6 Identities = 15/72 (20%), Positives = 15/72 (20%), Gaps = 17/72 (23%) Query: 365 PQVIDVDEVEETGVVFSYNTDS---RQTCANNRHQCSVHAECRDYATGFCCSC------- 414 Sbjct: 25 PEWKTSDCCQKCNQPFFWNLQAMWQRKVVGLRQHH------CRTCGSAVCGSCCDNWTTY 78 Query: 415 -VAGYTGNGRQC 425 Sbjct: 79 PPMGYETKIRIC 90 >VWA Von Willebrand factor A domain Length = 255 Score = 21.8 bits (46), Expect = 6.2 Identities = 11/91 (12%), Positives = 11/91 (12%), Gaps = 9/91 (9%) Query: 207 FSKKENNQVPAVVAFSQGSVGFLWKSNGAYNIFANDRESIENLAKSSNS----GQQGVWV 262 Sbjct: 91 FDQNPISQM-GIIIMRNGLAQLVSQVSGNPQDHIDALKSIRKQEPKGNPSLQNALEMARG 149 Query: 263 FEIGSPATTNGVVPADVILGTEDGAEYDDED 293 Sbjct: 150 LLLPVP---AHCTREVLIVFG-SLSTTDPGD 176 >MYND MYND domain (A zinc coordinating Cysteine rich domain) Length = 107 Score = 21.7 bits (45), Expect = 6.6 Identities = 17/76 (22%), Positives = 17/76 (22%), Gaps = 12/76 (15%) Query: 348 ERTRSFQLAVETFHQQHPQVI---------DVDEVEETGVVFSYNTDSRQTCAN-NRHQC 397 Sbjct: 1 DMKQSFERRIEVLQKERDAAVSAMRVQVHADIDDPNISGSLHGNEIISAKKCANCNREAL 60 Query: 398 SVHAECRD--YATGFC 411 Sbjct: 61 AECSLCRKTPYCSEFC 76 >SET Su(var)3-9, Enhancer of Zeste, trithorax domain (A chromatin associated domain) Length = 219 Score = 21.1 bits (44), Expect = 8.9 Identities = 13/119 (10%), Positives = 13/119 (10%), Gaps = 16/119 (13%) Query: 646 EKILRYAFSNSIGPVREGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIGFRG-D 704 Sbjct: 30 EQLLQFSFLPELQFQNAAVKQRIQRLCY--------REEKRLAVSSLAKWLGQLHKQRLR 81 Query: 705 GRTCYDIDECSEQPSV-CGSHTICNNHPGTFRCECVEGYQFSDEGTCVAVVDQRPINYC 762 Sbjct: 82 APKNPPVAICWINSYVGYGVFARESIPAWSYIGEYTGILRRRQA------LWLDENDYC 134 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 8 Number of calls to ALIGN: 9 Length of query: 1247 Total length of test sequences: 20182 Effective length of test sequences: 16435.0 Effective search space size: 19918433.3 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= Tem11gi|128199|sp|P14543|NIDO_HUMAN NIDOGEN PRECURSOR (ENTACTIN) (1247 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|128081 [13..98] Neurophysin II 36 0.005 gi|2622601 [25..254] 7-bladed beta-propeller 27 2.2 gi|1016001 [3..270] Metallo-dependent phosphatases 25 6.1 gi|2039378 [21..454] PLP-dependent transferases 25 7.2 gi|2599298 [127..373] beta/alpha (TIM)-barrel 24 10.0 >gi|128081 [13..98] Neurophysin II Length = 86 Score = 35.5 bits (80), Expect = 0.005 Identities = 9/72 (12%), Positives = 9/72 (12%), Gaps = 3/72 (4%) Query: 670 NPCYIGTHGCDTNAACRPGPR---TQFTCECSIGFRGDGRTCYDIDECSEQPSVCGSHTI 726 Sbjct: 14 GPGGQGRCFGPSICCADALGCFVGTAEALRCQEENYLPSPCQSGQKPCGSGGRCAANGVC 73 Query: 727 CNNHPGTFRCEC 738 Sbjct: 74 CNDESCVIEPEC 85 Score = 33.2 bits (74), Expect = 0.028 Identities = 6/86 (6%), Positives = 6/86 (6%), Gaps = 7/86 (8%) Query: 746 DEGTCVAVVDQRPINYCETGLHNCDIPQRAQCIYTGGSSYTCSCLPGFSGDGQACQDVDE 805 Sbjct: 6 ELRQCLPCGPGGQGRCFGP---SICCADALGC-FVGTAEALRCQEENYLPS-PCQSGQKP 60 Query: 806 CQPSR--CHPDAFCYNTPGSFTCQCK 829 Sbjct: 61 CGSGGRCAANGVCCNDESCVIEPECR 86 >gi|2622601 [25..254] 7-bladed beta-propeller Length = 230 Score = 27.0 bits (59), Expect = 2.2 Identities = 14/165 (8%), Positives = 14/165 (8%), Gaps = 15/165 (9%) Query: 1082 DWNRDNPKIETSYMDGTNRRILVQDDLGLPNGLHFDAFSSQLCWVDAGTNRAECL--NPS 1139 Sbjct: 9 HGDEQHTGYAKEPSDFS---AKTWY---LSIGGIKSSPAIFNKVAYIGSLDGRLYAVNLE 62 Query: 1140 QPSRRKALEGLQYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPHKQTRLYGIT 1199 Sbjct: 63 TGSVVWSYKTEGAIVSSPVVVNGTVFVGSWDGYLYAIDTDTGDLEWKFKTGNRIE----- 117 Query: 1200 TALSQCPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCI 1244 Sbjct: 118 --SSPAVSGDTVYIGSDDCRVYAVDRDDGSKKWEFYTGDAVKSSP 160 >gi|1016001 [3..270] Metallo-dependent phosphatases Length = 268 Score = 25.2 bits (55), Expect = 6.1 Identities = 7/37 (18%), Positives = 7/37 (18%), Gaps = 8/37 (21%) Query: 726 ICNNHPGTFRCECVEGYQFSDEGTCVAVVDQRPINYC 762 Sbjct: 231 IIRGH------EAVDGFRTNMNGKVITVFS--SVYHG 259 >gi|2039378 [21..454] PLP-dependent transferases Length = 434 Score = 25.2 bits (54), Expect = 7.2 Identities = 9/107 (8%), Positives = 9/107 (8%), Gaps = 14/107 (13%) Query: 410 FCCSCVAGYTGNGRQCVAEGSPQRVNGKVKGRIFVGSSQVPIV----------FENTDLH 459 Sbjct: 288 DMVTMAKGIGNGFPLGAVVTSKEIADSFNKSLYFNTYGGNPLASVVGKAVLEVIEEEKLQ 347 Query: 460 SYVVMNHGRSYTAISTIPETVGYSLLPLAPVGGIIGWMFAVEQDGFK 506 Sbjct: 348 ENSAVVGDYFLKQL----AAIDDATIGDVRGKGLMIGVELIDEQGKP 390 >gi|2599298 [127..373] beta/alpha (TIM)-barrel Length = 247 Score = 24.4 bits (53), Expect = 10.0 Identities = 13/59 (22%), Positives = 13/59 (22%), Gaps = 4/59 (6%) Query: 459 HSYVVMNHGRSYTAISTIPETVGYSLLPLAPVG-GIIGWMFAVEQDGFKNGFSITGGEF 516 Sbjct: 177 RRVFRPMGGRCWIRRGYSGGLHVYSTIPSMPYGCELLGW---VLADRLTQQLQIKDFEI 232 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 5 Number of calls to ALIGN: 6 Length of query: 1247 Total length of test sequences: 256703 Effective length of test sequences: 208388.0 Effective search space size: 251322163.8 Initial X dropoff for ALIGN: 25.0 bits ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ calculation of internal repeats with prospero ***** PROSPERO v1.3 Thu Jan 18 20:35:24 2001 ***** Copyright 2000, Richard Mott, Wellcome Trust Centre for Human Genetics, University of Oxford For help see http://www.well.ox.ac.uk/ariadne For usage use -help using gap penalty 11+1k using matrix BLOSUM62 printing all alignments with eval < 0.100000 using sequence1 Tem11gi|128199|sp|P14543|NIDO_HUMAN using self-comparison > 1 Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 670 to 1082 vs Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 760 to 1127 score 303 eval 2.127230e-28 identity 30.49% K 2.720488e-02 L 2.454462e-01 H 1.376885e+00 alpha 8.743731e-02 670 NPCYIGTHGCD--TNAACRPGPRTQFTCECSIGFRGDGRTCYDIDECSEQPSVCGSHTIC 727 Tem11gi|128199|sp|P14543|NIDO_HUMAN | | | | || | | : :|| | || |||: | |:||| ||| | | 760 NYCETGLHNCDIPQRAQCIYTGGSSYTCSCLPGFSGDGQACQDVDEC--QPSRCHPDAFC 817 Tem11gi|128199|sp|P14543|NIDO_HUMAN 728 NNHPGTFRCECVEGYQFSDEGTCVAVVDQRPINYCETGLHNCDIPQRAQCIYTGGSSYTC 787 Tem11gi|128199|sp|P14543|NIDO_HUMAN | ||:| |:| ||| | || | | : | : 818 YNTPGSFTCQCKPGYQ-GDGFRCVP---------GEVEKTRCQHEREHILGAAGATDPQR 867 Tem11gi|128199|sp|P14543|NIDO_HUMAN 788 SCLPGFSGDGQACQDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCVPGEVEKT 847 Tem11gi|128199|sp|P14543|NIDO_HUMAN || | || : |: : | | : | : :| | || 868 PIPPGLF--------VPECDAHGHYAPTQCHGSTGYCWCVDRDGREVEGTRTRPGMTPPC 919 Tem11gi|128199|sp|P14543|NIDO_HUMAN 848 RCQHEREHILGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVDRDGREVE 907 Tem11gi|128199|sp|P14543|NIDO_HUMAN | | | |:||| : || : :| : 920 LSTVAPPIHQGPAVPT-AVIPLPPGTHL-------------LFAQTGKIERLPLEGNTMR 965 Tem11gi|128199|sp|P14543|NIDO_HUMAN 908 GTRTRPGMTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQTGKIERLPLEGNTMRKT 967 Tem11gi|128199|sp|P14543|NIDO_HUMAN | : : | : | : | : |: | |: : 966 KTEAKAFLHVPAKVIIGLAF-------DCVDKMVYWTDITEPSIGRASLHGGEPTTIIRQ 1018 Tem11gi|128199|sp|P14543|NIDO_HUMAN 968 EAKAFLHVPAKVIIGLAFDCVDKMVYWTDITEPSIGRASLHGGEPTTIIRQDLGSPEGIA 1027 Tem11gi|128199|sp|P14543|NIDO_HUMAN : | | |:| | : : ::||| | | | | : : || :| || 1019 D----LGSPE----GIAVDHLGRNIFWTDSNLDRIEVAKLDGTQRRVLFETDLVNPRGIV 1070 Tem11gi|128199|sp|P14543|NIDO_HUMAN 1028 VDHLGRNIFWTDSNLD--RIEVAKLDGTQRRVLFETDLVNPRGIVTDSVRGNLYWTD 1082 Tem11gi|128199|sp|P14543|NIDO_HUMAN | : |::||| | | :|| : :||| ||:| : || | |: |: | | | 1071 TDSVRGNLYWTDWNRDNPKIETSYMDGTNRRILVQDDLGLPNGLHFDAFSSQLCWVD 1127 Tem11gi|128199|sp|P14543|NIDO_HUMAN > 2 Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 341 to 425 vs Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 624 to 708 score 84 eval 4.702316e-05 identity 33.75% K 2.720488e-02 L 2.454462e-01 H 1.376885e+00 alpha 8.743731e-02 341 RPLGPPTERTRSFQLAVET---FHQQHPQVIDVDEVEETGVVFSYNTDSRQT-CANNRHQ 396 Tem11gi|128199|sp|P14543|NIDO_HUMAN || | |: ||:|:: : | ::: | | : |: | | | 624 RPALPSTQ-----QLSVDSVFVLYNQEEKILRYAFSNSIGPVREGSPDALQNPCYIGTHG 678 Tem11gi|128199|sp|P14543|NIDO_HUMAN 397 CSVHAECR-DYATGFCCSCVAGYTGNGRQC 425 Tem11gi|128199|sp|P14543|NIDO_HUMAN | :| || | | | | |: |:|| | 679 CDTNAACRPGPRTQFTCECSIGFRGDGRTC 708 Tem11gi|128199|sp|P14543|NIDO_HUMAN > 3 Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 390 to 433 vs Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 762 to 810 score 74 eval 5.472206e-04 identity 36.36% K 2.720488e-02 L 2.454462e-01 H 1.376885e+00 alpha 8.743731e-02 390 CANNRHQCSVHAECRDYATG---FCCSCVAGYTGNGRQC--VAEGSPQR 433 Tem11gi|128199|sp|P14543|NIDO_HUMAN | | | : : || : |||: |::|:|: | | | | | 762 CETGLHNCDIPQRAQCIYTGGSSYTCSCLPGFSGDGQACQDVDECQPSR 810 Tem11gi|128199|sp|P14543|NIDO_HUMAN > 4 Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 652 to 708 vs Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 784 to 839 score 69 eval 1.865759e-03 identity 32.14% K 2.720488e-02 L 2.454462e-01 H 1.376885e+00 alpha 8.743731e-02 652 AFSNSIGPVREGSPDALQNPCYIGTHGCDTNAACRPGPRTQFTCECSIGFRGDGRTC 708 Tem11gi|128199|sp|P14543|NIDO_HUMAN ::: | | | | |: | :| | | : |||:| |::||| | 784 SYTCSCLPGFSGDGQACQDVDECQPSRCHPDAFCYNTPGS-FTCQCKPGYQGDGFRC 839 Tem11gi|128199|sp|P14543|NIDO_HUMAN > 5 Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 396 to 426 vs Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 810 to 840 score 66 eval 3.892262e-03 identity 38.71% K 2.720488e-02 L 2.454462e-01 H 1.376885e+00 alpha 8.743731e-02 396 QCSVHAECRDYATGFCCSCVAGYTGNGRQCV 426 Tem11gi|128199|sp|P14543|NIDO_HUMAN :| | | : | | | || |:| :|| 810 RCHPDAFCYNTPGSFTCQCKPGYQGDGFRCV 840 Tem11gi|128199|sp|P14543|NIDO_HUMAN > 6 Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 390 to 435 vs Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 714 to 760 score 63 eval 8.110903e-03 identity 30.43% K 2.720488e-02 L 2.454462e-01 H 1.376885e+00 alpha 8.743731e-02 390 CANNRHQCSVHAECRDYATGFCCSCVAGYT-GNGRQCVAEGSPQRVN 435 Tem11gi|128199|sp|P14543|NIDO_HUMAN |: | | | :: | | || || : ||| : :| 714 CSEQPSVCGSHTICNNHPGTFRCECVEGYQFSDEGTCVAVVDQRPIN 760 Tem11gi|128199|sp|P14543|NIDO_HUMAN > 7 Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 670 to 777 vs Tem11gi|128199|sp|P14543|NIDO_HUMAN len 1247 from 760 to 811 score 58 eval 2.740290e-02 identity 50.00% K 2.720488e-02 L 2.454462e-01 H 1.376885e+00 alpha 8.743731e-02 670 NPCYIGTHGCD--TNAACRPGPRTQFTCECSIGFRGDGRTCYDIDECSEQPSVCGSHTIC 727 Tem11gi|128199|sp|P14543|NIDO_HUMAN | | | | || | | : :|| | || |||: | |:||| ||| 760 NYCETGLHNCDIPQRAQCIYTGGSSYTCSCLPGFSGDGQACQDVDEC--QPS-------- 809 Tem11gi|128199|sp|P14543|NIDO_HUMAN 728 NNHPGTFRCECVEGYQFSDEGTCVAVVDQRPINYCETGLHNCDIPQRAQC 777 Tem11gi|128199|sp|P14543|NIDO_HUMAN :| 810 ------------------------------------------------RC 811 Tem11gi|128199|sp|P14543|NIDO_HUMAN