analysis of sequence from tem27 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] MAPGWAGVGAAVRARLALALALASVLSGPPAVACPTKCTCSAASVDCHGLGLRAVPRGIPRNAERLDLDR NNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDL SENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTL RLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKKEYVCPAPHSEPPSCNANSISC PSPCTCSNNIVDCRGKGLMEIPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDA FQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISK GLFAPLQSIQTLHLAQNPFVCDCHLKWLADYLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSED YRSRFSSECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVSVLEATGIFKKLPNLR KINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGCVSNDTFAGLSS VRLLSLYDNRITTITPGAFTTLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIP IQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETVVRCSNKGLRALPRGMPKDVTELYLEGNHLTAVPRE LSALRHLTLIDLSNNSISMLTNYTFSNMSHLSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPE GSFNDLTSLSHLALGTNPLHCDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPTHRFQCKGPVD INIVAKCNACLSSPCKNNGTCTQDPVELYRCACPYSYKGKDCTVPINTCIQNPCQHGGTCHLSDSHKDGF SCSCPLGFEGQRCEINPDDCEDNDCENNATCVDGINNYVCICPPNYTGELCDEVIDHCVPELNLCQHEAK CIPLDKGFSCECVPGYSGKLCETDNDDCVAHKCRHGAQCVDTINGYTCTCPQGFSGPFCEHPPPMVLLQT SPCDQYECQNGAQCIVVQQEPTCRCPPGFAGPRCEKLITVNFVGKDSYVELASAKVRPQANISLQVATDK DNGILLYKGDNDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTPK SLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTDRPLGGFHGCIHEVRINNELQDFKALPPQSLGVSP GCKSCTVCKHGLCRSVEKDSVVCECRPGWTGPLCDQEARDPCLGHRCHHGKCVATGTSYMCKCAEGYGGD LCDNKNDSANACSAFKCHHGQCHISDQGEPYCLCQPGFSGEHCQQENPCLGQVVREVIRRQKGYASCATA SKVPIMECRGGCGPQCCQPTRSKRRKYVFQCTDGSSFVEEVERHLECGCLACS ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > gi|4049589|dbj|BAA35186.1| . . . . . 1 MAPGWAGVGAAVRARLALALALASVLSGPPAVACPTKCTCSAASVDCHGL 50 ______HHHHHHHHHHHHHHHHHHHH________________________ . . . . . 51 GLRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVI 100 EEEEE_________________EEEEE_________EEEEE____EEEEE . . . . . 101 ERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPR 150 E__HHHHHHHHHHHHHH______HHHHHHH_____EEE____________ . . . . . 151 KAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTS 200 ___EEEEEEEE______EEEE____HHHHHHHHEEEE______EEEEEEE . . . . . 201 FNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRG 250 ______HHHHHHH_____HHHHHHHHHHHHHHHHHHHHHHHH____EEE_ . . . . . 251 FNVADVQKKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLME 300 ________EEEE___________________________EEE_____EEE . . . . . 301 IPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDA 350 _________EEEEHHHHHHH_____HHHHHHHHHEEE_____________ . . . . . 351 FQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANKINCLRVNTFQ 400 _HHHHHHHHEEE____HHHHHHHH_____HHHHHHHH____EEEEEE___ . . . . . 401 DLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLKWLAD 450 __EEEE_EEE____EEEEE____HHHHHHHHHHHH_____HHHHHHHHHH . . . . . 451 YLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFSSECF 500 HH______________HHHHHHHHHHHHH__EEE_____________HHH . . . . . 501 MDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVSVLEAT 550 HHH__________EEEE____EEEEE_____EEE________EEEEEEE_ . . . . . 551 GIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRV 600 __EEE____EEEE____HHHHHHH_____HHHHHHHH_____EEE_____ . . . . . 601 FRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFT 650 EEEE__HHHHHHHH_EEEEE_________EEEEEEE____EEE_____EE . . . . . 651 TLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIP 700 EEEEEE____________HHHHHHHHHHHHEEEE_________EEEEE__ . . . . . 701 IQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETVVRCSNKGLRALPRGM 750 _EEEEEEE________________________EEEEEE____________ . . . . . 751 PKDVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNYTFSNMSH 800 ______EEE_______HHHHHHHHHHHHHH______EEEEE_______HH . . . . . 801 LSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLS 850 HHHHHHH________EEEE_____EEEEE________________EEE_E . . . . . 851 HLALGTNPLHCDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPT 900 EEE_________HHHHHHHHHHHH______EEE______HHHHHHH____ . . . . . 901 HRFQCKGPVDINIVAKCNACLSSPCKNNGTCTQDPVELYRCACPYSYKGK 950 ___________EEEE____________________EEEEEE_________ . . . . . 951 DCTVPINTCIQNPCQHGGTCHLSDSHKDGFSCSCPLGFEGQRCEINPDDC 1000 ______EEEE_________EEEE__________________EEE______ . . . . . 1001 EDNDCENNATCVDGINNYVCICPPNYTGELCDEVIDHCVPELNLCQHEAK 1050 ________________EEEEE_________HHHHHHH___________EE . . . . . 1051 CIPLDKGFSCECVPGYSGKLCETDNDDCVAHKCRHGAQCVDTINGYTCTC 1100 E_________________EEEE______EEEE_____EEEEE________ . . . . . 1101 PQGFSGPFCEHPPPMVLLQTSPCDQYECQNGAQCIVVQQEPTCRCPPGFA 1150 _______________EEEE_____________EEEEE_____________ . . . . . 1151 GPRCEKLITVNFVGKDSYVELASAKVRPQANISLQVATDKDNGILLYKGD 1200 ___EEEEEEEEE_____HHHHHHHH________EEEE______EEEEE__ . . . . . 1201 NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTL 1250 __HHHHHHHH___EEEEE_______EEEEEEE______EEEEEEEE_HHH . . . . . 1251 NLVVDKGTPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTDRPLG 1300 HH___________________________________HHHHHHH______ . . . . . 1301 GFHGCIHEVRINNELQDFKALPPQSLGVSPGCKSCTVCKHGLCRSVEKDS 1350 ____EEEEEE___HHHHH________________EEEE____________ . . . . . 1351 VVCECRPGWTGPLCDQEARDPCLGHRCHHGKCVATGTSYMCKCAEGYGGD 1400 EEEE__________________________EEEEE___EEEE________ . . . . . 1401 LCDNKNDSANACSAFKCHHGQCHISDQGEPYCLCQPGFSGEHCQQENPCL 1450 _____________EEEE____EEE_______EEE________________ . . . . . 1451 GQVVREVIRRQKGYASCATASKVPIMECRGGCGPQCCQPTRSKRRKYVFQ 1500 _HHHHHHHHHHH___EEEE____EEEE_________________EEEEEE . . 1501 CTDGSSFVEEVERHLECGCLACS 1523 ______EEEHHHHHHHHEEE___ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 20.6 % beta-contents : 23.4 % coil-contents : 56.1 % class : mixed method : 2 alpha-contents : 7.6 % beta-contents : 20.6 % coil-contents : 71.8 % class : beta ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -20.30 -0.88 -0.15 0.00 -4.00 -4.00 0.00 0.00 -3.43 -6.29 -1.30 -12.00 -12.00 0.00 -12.00 0.00 -76.35 -15.29 -5.43 -1.88 -0.94 -4.00 -4.00 -8.00 0.00 -1.37 -4.99 -1.30 -12.00 0.00 0.00 -12.00 0.00 -71.21 ID: gi|4049589|dbj|BAA35186.1| AC: xxx Len: 1470 1:I 1455 Sc: -71.21 Pv: 5.919797e-01 NO_GPI_SITE GPI: learning from protozoa -31.62 -0.93 -0.25 0.00 -4.00 -4.00 0.00 0.00 -1.72 -5.47 -5.56 -12.00 -12.00 0.00 -12.00 0.00 -89.54 -31.05 -1.69 -0.74 -2.76 -4.00 0.00 0.00 0.00 -0.90 -6.28 -5.56 -12.00 -12.00 0.00 -12.00 0.00 -88.99 ID: gi|4049589|dbj|BAA35186.1| AC: xxx Len: 1470 1:I 1451 Sc: -88.99 Pv: 6.769322e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|4049589| 0.652 34 Y 0.680 34 Y 0.970 13 Y 0.831 Y # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|4049589| 0.772 34 Y 0.670 34 Y 0.967 19 Y 0.677 Y # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|4049589| 0.757 34 Y 0.534 34 Y 0.995 17 Y 0.822 Y ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] 1-5 MAPGW agvgaavrarlalalalasvlsgppa 6-31 32-373 VACPTKCTCSAASVDCHGLGLRAVPRGIPR NAERLDLDRNNITRITKMDFAGLKNLRVLH LEDNQVSVIERGAFQDLKQLERLRLNKNKL QVLPELLFQSTPKLTRLDLSENQIQGIPRK AFRGITDVKNLQLDNNHISCIEDGAFRALR DLEILTLNNNNISRILVTSFNHMPKIRTLR LHSNHLYCDCHLAWLSDWLRQRRTVGQFTL CMAPVHLRGFNVADVQKKEYVCPAPHSEPP SCNANSISCPSPCTCSNNIVDCRGKGLMEI PANLPEGIVEIRLEQNSIKAIPAGAFTQYK KLKRIDISKNQISDIAPDAFQGLKSLTSLV LYGNKITEIAKG lfdglvslqllll 374-386 387-992 NANKINCLRVNTFQDLQNLNLLSLYDNKLQ TISKGLFAPLQSIQTLHLAQNPFVCDCHLK WLADYLQDNPIETSGARCSSPRRLANKRIS QIKSKKFRCSGSEDYRSRFSSECFMDLVCP EKCRCEGTIVDCSNQKLVRIPSHLPEYVTD LRLNDNEVSVLEATGIFKKLPNLRKINLSN NKIKEVREGAFDGAASVQELMLTGNQLETV HGRVFRGLSGLKTLMLRSNLIGCVSNDTFA GLSSVRLLSLYDNRITTITPGAFTTLVSLS TINLLSNPFNCNCHLAWLGKWLRKRRIVSG NPRCQKPFFLKEIPIQDVAIQDFTCDGNEE SSCQLSPRCPEQCTCMETVVRCSNKGLRAL PRGMPKDVTELYLEGNHLTAVPRELSALRH LTLIDLSNNSISMLTNYTFSNMSHLSTLIL SYNRLRCIPVHAFNGLRSLRVLTLHGNDIS SVPEGSFNDLTSLSHLALGTNPLHCDCSLR WLSEWVKAGYKEPGIARCSSPEPMADRLLL TTPTHRFQCKGPVDINIVAKCNACLSSPCK NNGTCTQDPVELYRCACPYSYKGKDCTVPI NTCIQNPCQHGGTCHLSDSHKDGFSCSCPL GFEGQR ceinpddcedndcenn 993-1008 1009-1240 ATCVDGINNYVCICPPNYTGELCDEVIDHC VPELNLCQHEAKCIPLDKGFSCECVPGYSG KLCETDNDDCVAHKCRHGAQCVDTINGYTC TCPQGFSGPFCEHPPPMVLLQTSPCDQYEC QNGAQCIVVQQEPTCRCPPGFAGPRCEKLI TVNFVGKDSYVELASAKVRPQANISLQVAT DKDNGILLYKGDNDPLALELYQGHVRLVYD SLSSPPTTVYSVETVNDGQFHS velvtlnqtlnlvv 1241-1254 1255-1479 DKGTPKSLGKLQKQPAVGINSPLYLGGIPT STGLSALRQGTDRPLGGFHGCIHEVRINNE LQDFKALPPQSLGVSPGCKSCTVCKHGLCR SVEKDSVVCECRPGWTGPLCDQEARDPCLG HRCHHGKCVATGTSYMCKCAEGYGGDLCDN KNDSANACSAFKCHHGQCHISDQGEPYCLC QPGFSGEHCQQENPCLGQVVREVIRRQKGY ASCATASKVPIMECR ggcgpqccqp 1480-1489 1490-1523 TRSKRRKYVFQCTDGSSFVEEVERHLECGC LACS low complexity regions: SEG 25 3.0 3.3 >gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] 1-1 M apgwagvgaavrarlalalalasvlsgppa 2-47 vacptkctcsaasvdc 48-104 HGLGLRAVPRGIPRNAERLDLDRNNITRIT KMDFAGLKNLRVLHLEDNQVSVIERGA fqdlkqlerlrlnknklqvlpellfqstpk 105-141 ltrldls 142-262 ENQIQGIPRKAFRGITDVKNLQLDNNHISC IEDGAFRALRDLEILTLNNNNISRILVTSF NHMPKIRTLRLHSNHLYCDCHLAWLSDWLR QRRTVGQFTLCMAPVHLRGFNVADVQKKEY V cpaphseppscnansiscpspctcsnn 263-289 290-380 IVDCRGKGLMEIPANLPEGIVEIRLEQNSI KAIPAGAFTQYKKLKRIDISKNQISDIAPD AFQGLKSLTSLVLYGNKITEIAKGLFDGLV S lqllllnankinclrvntfqdlqnlnllsl 381-410 411-770 YDNKLQTISKGLFAPLQSIQTLHLAQNPFV CDCHLKWLADYLQDNPIETSGARCSSPRRL ANKRISQIKSKKFRCSGSEDYRSRFSSECF MDLVCPEKCRCEGTIVDCSNQKLVRIPSHL PEYVTDLRLNDNEVSVLEATGIFKKLPNLR KINLSNNKIKEVREGAFDGAASVQELMLTG NQLETVHGRVFRGLSGLKTLMLRSNLIGCV SNDTFAGLSSVRLLSLYDNRITTITPGAFT TLVSLSTINLLSNPFNCNCHLAWLGKWLRK RRIVSGNPRCQKPFFLKEIPIQDVAIQDFT CDGNEESSCQLSPRCPEQCTCMETVVRCSN KGLRALPRGMPKDVTELYLEGNHLTAVPRE lsalrhltlidlsnnsismltnytfsnmsh 771-811 lstlilsynrl 812-992 RCIPVHAFNGLRSLRVLTLHGNDISSVPEG SFNDLTSLSHLALGTNPLHCDCSLRWLSEW VKAGYKEPGIARCSSPEPMADRLLLTTPTH RFQCKGPVDINIVAKCNACLSSPCKNNGTC TQDPVELYRCACPYSYKGKDCTVPINTCIQ NPCQHGGTCHLSDSHKDGFSCSCPLGFEGQ R ceinpddcedndcennatcvdginnyvcic 993-1028 ppnytg 1029-1523 ELCDEVIDHCVPELNLCQHEAKCIPLDKGF SCECVPGYSGKLCETDNDDCVAHKCRHGAQ CVDTINGYTCTCPQGFSGPFCEHPPPMVLL QTSPCDQYECQNGAQCIVVQQEPTCRCPPG FAGPRCEKLITVNFVGKDSYVELASAKVRP QANISLQVATDKDNGILLYKGDNDPLALEL YQGHVRLVYDSLSSPPTTVYSVETVNDGQF HSVELVTLNQTLNLVVDKGTPKSLGKLQKQ PAVGINSPLYLGGIPTSTGLSALRQGTDRP LGGFHGCIHEVRINNELQDFKALPPQSLGV SPGCKSCTVCKHGLCRSVEKDSVVCECRPG WTGPLCDQEARDPCLGHRCHHGKCVATGTS YMCKCAEGYGGDLCDNKNDSANACSAFKCH HGQCHISDQGEPYCLCQPGFSGEHCQQENP CLGQVVREVIRRQKGYASCATASKVPIMEC RGGCGPQCCQPTRSKRRKYVFQCTDGSSFV EEVERHLECGCLACS low complexity regions: SEG 45 3.4 3.75 >gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] 1-1 M apgwagvgaavrarlalalalasvlsgppa 2-66 vacptkctcsaasvdchglglravprgipr naerl 67-339 DLDRNNITRITKMDFAGLKNLRVLHLEDNQ VSVIERGAFQDLKQLERLRLNKNKLQVLPE LLFQSTPKLTRLDLSENQIQGIPRKAFRGI TDVKNLQLDNNHISCIEDGAFRALRDLEIL TLNNNNISRILVTSFNHMPKIRTLRLHSNH LYCDCHLAWLSDWLRQRRTVGQFTLCMAPV HLRGFNVADVQKKEYVCPAPHSEPPSCNAN SISCPSPCTCSNNIVDCRGKGLMEIPANLP EGIVEIRLEQNSIKAIPAGAFTQYKKLKRI DIS knqisdiapdafqglksltslvlygnkite 340-439 iakglfdglvslqllllnankinclrvntf qdlqnlnllslydnklqtiskglfaplqsi qtlhlaqnpf 440-754 VCDCHLKWLADYLQDNPIETSGARCSSPRR LANKRISQIKSKKFRCSGSEDYRSRFSSEC FMDLVCPEKCRCEGTIVDCSNQKLVRIPSH LPEYVTDLRLNDNEVSVLEATGIFKKLPNL RKINLSNNKIKEVREGAFDGAASVQELMLT GNQLETVHGRVFRGLSGLKTLMLRSNLIGC VSNDTFAGLSSVRLLSLYDNRITTITPGAF TTLVSLSTINLLSNPFNCNCHLAWLGKWLR KRRIVSGNPRCQKPFFLKEIPIQDVAIQDF TCDGNEESSCQLSPRCPEQCTCMETVVRCS NKGLRALPRGMPKDV telylegnhltavprelsalrhltlidlsn 755-866 nsismltnytfsnmshlstlilsynrlrci pvhafnglrslrvltlhgndissvpegsfn dltslshlalgtnplhcdcslr 867-1523 WLSEWVKAGYKEPGIARCSSPEPMADRLLL TTPTHRFQCKGPVDINIVAKCNACLSSPCK NNGTCTQDPVELYRCACPYSYKGKDCTVPI NTCIQNPCQHGGTCHLSDSHKDGFSCSCPL GFEGQRCEINPDDCEDNDCENNATCVDGIN NYVCICPPNYTGELCDEVIDHCVPELNLCQ HEAKCIPLDKGFSCECVPGYSGKLCETDND DCVAHKCRHGAQCVDTINGYTCTCPQGFSG PFCEHPPPMVLLQTSPCDQYECQNGAQCIV VQQEPTCRCPPGFAGPRCEKLITVNFVGKD SYVELASAKVRPQANISLQVATDKDNGILL YKGDNDPLALELYQGHVRLVYDSLSSPPTT VYSVETVNDGQFHSVELVTLNQTLNLVVDK GTPKSLGKLQKQPAVGINSPLYLGGIPTST GLSALRQGTDRPLGGFHGCIHEVRINNELQ DFKALPPQSLGVSPGCKSCTVCKHGLCRSV EKDSVVCECRPGWTGPLCDQEARDPCLGHR CHHGKCVATGTSYMCKCAEGYGGDLCDNKN DSANACSAFKCHHGQCHISDQGEPYCLCQP GFSGEHCQQENPCLGQVVREVIRRQKGYAS CATASKVPIMECRGGCGPQCCQPTRSKRRK YVFQCTDGSSFVEEVERHLECGCLACS low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] MAPGWagvgaavrarlalalalaSVLSGPPAVACPTKCTCSAASVDCHGLGLravprgip rnaerLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNK LQVLPELLFQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRAL RDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFT LCMAPVHLRGFNVADVQKKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLME IPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDAFQGLKSLTSL VLYGNKITEIAKGLFDGLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISK GLFAPLQSIQTLHLAQNPFVCDCHLKWLADYLQDNPIETSGARCSSPRRLANKRISQIKS KKFRCSGSEDYRSRFSSECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLN DNEVSVLEATGIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRV FRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTINL LSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIPIQDVAIQDFTCDGNEESSCQ LSPRCPEQCTCMETVVRCSNKGLRALPRGMPKDVTELYLEGNHLTAVPRELSALRHLTLI DLSNNSISMLTNYTFSNMSHLSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPE GSFNDLTSLSHLALGTNPLHCDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPT HRFQCKGPVDINIVAKCNACLSSPCKNNGTCTQDPVELYRCACPYSYKGKDCTVPINTCI QNPCQHGGTCHLSDSHKDGFSCSCPLGFEGQRCEINPDDCEDNDCENNATCVDGINNYVC ICPPNYTGELCDEVIDHCVPELNLCQHEAKCIPLDKGFSCECVPGYSGKLCETDNDDCVA HKCRHGAQCVDTINGYTCTCPQGFSGPFCEHPPPMVLLQTSPCDQYECQNGAQCIVVQQE PTCRCPPGFAGPRCEKLITVNFVGKDSYVELASAKVRPQANISLQVATDKDNGILLYKGD NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTPK SLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTDRPLGGFHGCIHEVRINNELQDFKA LPPQSLGVSPGCKSCTVCKHGLCRSVEKDSVVCECRPGWTGPLCDQEARDPCLGHRCHHG KCVATGTSYMCKCAEGYGGDLCDNKNDSANACSAFKCHHGQCHISDQGEPYCLCQPGFSG EHCQQENPCLGQVVREVIRRQKGYASCATASKVPIMECRGGCGPQCCQPTRSKRRKYVFQ CTDGSSFVEEVERHLECGCLACS 1 - 5 MAPGW 6 - 23 agvga avrarlalal ala 24 - 52 SVLSGPP AVACPTKCTC SAASVDCHGL GL 53 - 65 ravprgip rnaer 66 - 1523 LDLDR NNITRITKMD FAGLKNLRVL HLEDNQVSVI ERGAFQDLKQ LERLRLNKNK LQVLP ELLFQ STPKLTRLDL SENQIQGIPR KAFRGITDVK NLQLDNNHIS CIEDGAFRAL RDLEI LTLNN NNISRILVTS FNHMPKIRTL RLHSNHLYCD CHLAWLSDWL RQRRTVGQFT LCMAP VHLRG FNVADVQKKE YVCPAPHSEP PSCNANSISC PSPCTCSNNI VDCRGKGLME IPANL PEGIV EIRLEQNSIK AIPAGAFTQY KKLKRIDISK NQISDIAPDA FQGLKSLTSL VLYGN KITEI AKGLFDGLVS LQLLLLNANK INCLRVNTFQ DLQNLNLLSL YDNKLQTISK GLFAP LQSIQ TLHLAQNPFV CDCHLKWLAD YLQDNPIETS GARCSSPRRL ANKRISQIKS KKFRC SGSED YRSRFSSECF MDLVCPEKCR CEGTIVDCSN QKLVRIPSHL PEYVTDLRLN DNEVS VLEAT GIFKKLPNLR KINLSNNKIK EVREGAFDGA ASVQELMLTG NQLETVHGRV FRGLS GLKTL MLRSNLIGCV SNDTFAGLSS VRLLSLYDNR ITTITPGAFT TLVSLSTINL LSNPF NCNCH LAWLGKWLRK RRIVSGNPRC QKPFFLKEIP IQDVAIQDFT CDGNEESSCQ LSPRC PEQCT CMETVVRCSN KGLRALPRGM PKDVTELYLE GNHLTAVPRE LSALRHLTLI DLSNN SISML TNYTFSNMSH LSTLILSYNR LRCIPVHAFN GLRSLRVLTL HGNDISSVPE GSFND LTSLS HLALGTNPLH CDCSLRWLSE WVKAGYKEPG IARCSSPEPM ADRLLLTTPT HRFQC KGPVD INIVAKCNAC LSSPCKNNGT CTQDPVELYR CACPYSYKGK DCTVPINTCI QNPCQ HGGTC HLSDSHKDGF SCSCPLGFEG QRCEINPDDC EDNDCENNAT CVDGINNYVC ICPPN YTGEL CDEVIDHCVP ELNLCQHEAK CIPLDKGFSC ECVPGYSGKL CETDNDDCVA HKCRH GAQCV DTINGYTCTC PQGFSGPFCE HPPPMVLLQT SPCDQYECQN GAQCIVVQQE PTCRC PPGFA GPRCEKLITV NFVGKDSYVE LASAKVRPQA NISLQVATDK DNGILLYKGD NDPLA LELYQ GHVRLVYDSL SSPPTTVYSV ETVNDGQFHS VELVTLNQTL NLVVDKGTPK SLGKL QKQPA VGINSPLYLG GIPTSTGLSA LRQGTDRPLG GFHGCIHEVR INNELQDFKA LPPQS LGVSP GCKSCTVCKH GLCRSVEKDS VVCECRPGWT GPLCDQEARD PCLGHRCHHG KCVAT GTSYM CKCAEGYGGD LCDNKNDSAN ACSAFKCHHG QCHISDQGEP YCLCQPGFSG EHCQQ ENPCL GQVVREVIRR QKGYASCATA SKVPIMECRG GCGPQCCQPT RSKRRKYVFQ CTDGS SFVEE VERHLECGCL ACS low complexity regions: DUST >gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] MAPGWAGVGAAVRARLALALALASVLSGPPAVACPTKCTCSAASVDCHGLGLRAVPRGIP RNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNK LQVLPELLFQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRAL RDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFT LCMAPVHLRGFNVADVQKKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLME IPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDAFQGLKSLTSL VLYGNKITEIAKGLFDGLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISK GLFAPLQSIQTLHLAQNPFVCDCHLKWLADYLQDNPIETSGARCSSPRRLANKRISQIKS KKFRCSGSEDYRSRFSSECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLN DNEVSVLEATGIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRV FRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTINL LSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIPIQDVAIQDFTCDGNEESSCQ LSPRCPEQCTCMETVVRCSNKGLRALPRGMPKDVTELYLEGNHLTAVPRELSALRHLTLI DLSNNSISMLTNYTFSNMSHLSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPE GSFNDLTSLSHLALGTNPLHCDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPT HRFQCKGPVDINIVAKCNACLSSPCKNNGTCTQDPVELYRCACPYSYKGKDCTVPINTCI QNPCQHGGTCHLSDSHKDGFSCSCPLGFEGQRCEINPDDCEDNDCENNATCVDGINNYVC ICPPNYTGELCDEVIDHCVPELNLCQHEAKCIPLDKGFSCECVPGYSGKLCETDNDDCVA HKCRHGAQCVDTINGYTCTCPQGFSGPFCEHPPPMVLLQTSPCDQYECQNGAQCIVVQQE PTCRCPPGFAGPRCEKLITVNFVGKDSYVELASAKVRPQANISLQVATDKDNGILLYKGD NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTPK SLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTDRPLGGFHGCIHEVRINNELQDFKA LPPQSLGVSPGCKSCTVCKHGLCRSVEKDSVVCECRPGWTGPLCDQEARDPCLGHRCHHG KCVATGTSYMCKCAEGYGGDLCDNKNDSANACSAFKCHHGQCHISDQGEPYCLCQPGFSG EHCQQENPCLGQVVREVIRRQKGYASCATASKVPIMECRGGCGPQCCQPTRSKRRKYVFQ CTDGSSFVEEVERHLECGCLACS ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for gi|4049589|dbj|BAA35186.1| sequence: 1470 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 MAPGWAGVGA AVRARLALAL ALASVLSGPP AVACPTKCTC SAASVDCHGL GLRAVPRGIP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 RNAERLDLDR NNITRITKMD FAGLKNLRVL HLEDNQVSVI ERGAFQDLKQ LERLRLNKNK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. 1111111111 11111~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~111133999 9999999999 * 14 M'95 -w local . | . | . | . | . | . 180 LQVLPELLFQ STPKLTRLDL SENQIQGIPR KAFRGITDVK NLQLDNNHIS CIEDGAFRAL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~333 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. 9995~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~11 * 14 M'95 -w local . | . | . | . | . | . 240 RDLEILTLNN NNISRILVTS FNHMPKIRTL RLHSNHLYCD CHLAWLSDWL RQRRTVGQFT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register 3333333333 333333331~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. 1111111111 11~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 LCMAPVHLRG FNVADVQKKE YVCPAPHSEP PSCNANSISC PSPCTCSNNI VDCRGKGLME ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 IPANLPEGIV EIRLEQNSIK AIPAGAFTQY KKLKRIDISK NQISDIAPDA FQGLKSLTSL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 420 VLYGNKITEI AKGLFDGLVS LQLLLLNANK INCLRVNTFQ DLQNLNLLSL YDNKLQTISK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 480 GLFAPLQSIQ TLHLAQNPFV CDCHLKWLAD YLQDNPIETS GARCSSPRRL ANKRISQIKS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 540 KKFRCSGSED YRSRFSSECF MDLVCPEKCR CEGTIVDCSN QKLVRIPSHL PEYVTDLRLN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 600 DNEVSVLEAT GIFKKLPNLR KINLSNNKIK EVREGAFDGA ASVQELMLTG NQLETVHGRV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~3 4444444444 4444~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 660 FRGLSGLKTL MLRSNLIGCV SNDTFAGLSS VRLLSLYDNR ITTITPGAFT TLVSLSTINL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 720 LSNPFNCNCH LAWLGKWLRK RRIVSGNPRC QKPFFLKEIP IQDVAIQDFT CDGNEESSCQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 780 LSPRCPEQCT CMETVVRCSN KGLRALPRGM PKDVTELYLE GNHLTAVPRE LSALRHLTLI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 840 DLSNNSISML TNYTFSNMSH LSTLILSYNR LRCIPVHAFN GLRSLRVLTL HGNDISSVPE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 900 GSFNDLTSLS HLALGTNPLH CDCSLRWLSE WVKAGYKEPG IARCSSPEPM ADRLLLTTPT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 960 HRFQCKGPVD INIVAKCNAC LSSPCKNNGT CTQDPVELYR CACPYSYKGK DCTVPINTCI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1020 QNPCQHGGTC HLSDSHKDGF SCSCPLGFEG QRCEINPDDC EDNDCENNAT CVDGINNYVC ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1080 ICPPNYTGEL CDEVIDHCVP ELNLCQHEAK CIPLDKGFSC ECVPGYSGKL CETDNDDCVA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1140 HKCRHGAQCV DTINGYTCTC PQGFSGPFCE HPPPMVLLQT SPCDQYECQN GAQCIVVQQE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1200 PTCRCPPGFA GPRCEKLITV NFVGKDSYVE LASAKVRPQA NISLQVATDK DNGILLYKGD ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1260 NDPLALELYQ GHVRLVYDSL SSPPTTVYSV ETVNDGQFHS VELVTLNQTL NLVVDKGTPK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1320 SLGKLQKQPA VGINSPLYLG GIPTSTGLSA LRQGTDRPLG GFHGCIHEVR INNELQDFKA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1380 LPPQSLGVSP GCKSCTVCKH GLCRSVEKDS VVCECRPGWT GPLCDQEARD PCLGHRCHHG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1440 KCVATGTSYM CKCAEGYGGD LCDNKNDSAN ACSAFKCHHG QCHISDQGEP YCLCQPGFSG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | EHCQQENPCL GQVVREVIRR QKGYASCATA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ---------- ---------- ---------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem27.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem27.___inter___ (1 sequences) MAPGWAGVGAAVRARLALALALASVLSGPPAVACPTKCTCSAASVDCHGL GLRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVI ERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPR KAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTS FNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRG FNVADVQKKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLME IPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDA FQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANKINCLRVNTFQ DLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLKWLAD YLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFSSECF MDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVSVLEAT GIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRV FRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFT TLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIP IQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETVVRCSNKGLRALPRGM PKDVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNYTFSNMSH LSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLS HLALGTNPLHCDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPT HRFQCKGPVDINIVAKCNACLSSPCKNNGTCTQDPVELYRCACPYSYKGK DCTVPINTCIQNPCQHGGTCHLSDSHKDGFSCSCPLGFEGQRCEINPDDC EDNDCENNATCVDGINNYVCICPPNYTGELCDEVIDHCVPELNLCQHEAK CIPLDKGFSCECVPGYSGKLCETDNDDCVAHKCRHGAQCVDTINGYTCTC PQGFSGPFCEHPPPMVLLQTSPCDQYECQNGAQCIVVQQEPTCRCPPGFA GPRCEKLITVNFVGKDSYVELASAKVRPQANISLQVATDKDNGILLYKGD NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTL NLVVDKGTPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTDRPLG GFHGCIHEVRINNELQDFKALPPQSLGVSPGCKSCTVCKHGLCRSVEKDS VVCECRPGWTGPLCDQEARDPCLGHRCHHGKCVATGTSYMCKCAEGYGGD LCDNKNDSANACSAFKCHHGQCHISDQGEPYCLCQPGFSGEHCQQENPCL GQVVREVIRRQKGYASCATASKVPIMECRGGCGPQCCQPTRSKRRKYVFQ CTDGSSFVEEVERHLECGCLACS (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 2 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 16 36 1.560 Certain 2 641 661 1.534 Certain 3 789 809 0.747 Putative 4 1271 1291 1.057 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 Loop length 15 604 127 461 232 K+R profile 3.00 + + + + CYT-EXT prof - -0.71 -0.68 -0.42 -0.26 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.37 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.71 -> Orientation: N-in ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 1 2 4 Loop length 15 604 609 232 K+R profile 3.00 + + + CYT-EXT prof - -0.30 -0.42 -0.68 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 0.80 -> Orientation: N-out ---------------------------------------------------------------------- "tem27" 1523 16 36 #t 1.56042 641 661 #t 1.53438 789 809 #f 0.746875 1271 1291 #t 1.05729 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem27.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem27.___inter___ (1 sequences) MAPGWAGVGAAVRARLALALALASVLSGPPAVACPTKCTCSAASVDCHGL GLRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVI ERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPR KAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTS FNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRG FNVADVQKKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLME IPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDA FQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANKINCLRVNTFQ DLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLKWLAD YLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFSSECF MDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVSVLEAT GIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRV FRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFT TLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIP IQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETVVRCSNKGLRALPRGM PKDVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNYTFSNMSH LSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLS HLALGTNPLHCDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPT HRFQCKGPVDINIVAKCNACLSSPCKNNGTCTQDPVELYRCACPYSYKGK DCTVPINTCIQNPCQHGGTCHLSDSHKDGFSCSCPLGFEGQRCEINPDDC EDNDCENNATCVDGINNYVCICPPNYTGELCDEVIDHCVPELNLCQHEAK CIPLDKGFSCECVPGYSGKLCETDNDDCVAHKCRHGAQCVDTINGYTCTC PQGFSGPFCEHPPPMVLLQTSPCDQYECQNGAQCIVVQQEPTCRCPPGFA GPRCEKLITVNFVGKDSYVELASAKVRPQANISLQVATDKDNGILLYKGD NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTL NLVVDKGTPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTDRPLG GFHGCIHEVRINNELQDFKALPPQSLGVSPGCKSCTVCKHGLCRSVEKDS VVCECRPGWTGPLCDQEARDPCLGHRCHHGKCVATGTSYMCKCAEGYGGD LCDNKNDSANACSAFKCHHGQCHISDQGEPYCLCQPGFSGEHCQQENPCL GQVVREVIRRQKGYASCATASKVPIMECRGGCGPQCCQPTRSKRRKYVFQ CTDGSSFVEEVERHLECGCLACS (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 2 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 16 36 1.560 Certain 2 641 661 1.534 Certain 3 789 809 0.747 Putative 4 1271 1291 1.057 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 Loop length 15 604 127 461 232 K+R profile 3.00 + + + + CYT-EXT prof - -0.71 -0.68 -0.42 -0.26 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.37 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.71 -> Orientation: N-in ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 1 2 4 Loop length 15 604 609 232 K+R profile 3.00 + + + CYT-EXT prof - -0.30 -0.42 -0.68 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 2.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 0.80 -> Orientation: N-out ---------------------------------------------------------------------- "tem27" 1523 16 36 #t 1.56042 641 661 #t 1.53438 789 809 #f 0.746875 1271 1291 #t 1.05729 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ NOW EXECUTING: /bio_software/1D/stat/saps/saps-stroh/SAPS.SSPA/saps /people/maria/tem27.___saps___ SAPS. Version of April 11, 1996. Date run: Thu Jan 18 20:24:42 2001 File: /people/maria/tem27.___saps___ ID gi|4049589|dbj|BAA35186.1| DE Slit-3 protein [Homo sapiens] number of residues: 1523; molecular weight: 167.7 kdal 1 MAPGWAGVGA AVRARLALAL ALASVLSGPP AVACPTKCTC SAASVDCHGL GLRAVPRGIP 61 RNAERLDLDR NNITRITKMD FAGLKNLRVL HLEDNQVSVI ERGAFQDLKQ LERLRLNKNK 121 LQVLPELLFQ STPKLTRLDL SENQIQGIPR KAFRGITDVK NLQLDNNHIS CIEDGAFRAL 181 RDLEILTLNN NNISRILVTS FNHMPKIRTL RLHSNHLYCD CHLAWLSDWL RQRRTVGQFT 241 LCMAPVHLRG FNVADVQKKE YVCPAPHSEP PSCNANSISC PSPCTCSNNI VDCRGKGLME 301 IPANLPEGIV EIRLEQNSIK AIPAGAFTQY KKLKRIDISK NQISDIAPDA FQGLKSLTSL 361 VLYGNKITEI AKGLFDGLVS LQLLLLNANK INCLRVNTFQ DLQNLNLLSL YDNKLQTISK 421 GLFAPLQSIQ TLHLAQNPFV CDCHLKWLAD YLQDNPIETS GARCSSPRRL ANKRISQIKS 481 KKFRCSGSED YRSRFSSECF MDLVCPEKCR CEGTIVDCSN QKLVRIPSHL PEYVTDLRLN 541 DNEVSVLEAT GIFKKLPNLR KINLSNNKIK EVREGAFDGA ASVQELMLTG NQLETVHGRV 601 FRGLSGLKTL MLRSNLIGCV SNDTFAGLSS VRLLSLYDNR ITTITPGAFT TLVSLSTINL 661 LSNPFNCNCH LAWLGKWLRK RRIVSGNPRC QKPFFLKEIP IQDVAIQDFT CDGNEESSCQ 721 LSPRCPEQCT CMETVVRCSN KGLRALPRGM PKDVTELYLE GNHLTAVPRE LSALRHLTLI 781 DLSNNSISML TNYTFSNMSH LSTLILSYNR LRCIPVHAFN GLRSLRVLTL HGNDISSVPE 841 GSFNDLTSLS HLALGTNPLH CDCSLRWLSE WVKAGYKEPG IARCSSPEPM ADRLLLTTPT 901 HRFQCKGPVD INIVAKCNAC LSSPCKNNGT CTQDPVELYR CACPYSYKGK DCTVPINTCI 961 QNPCQHGGTC HLSDSHKDGF SCSCPLGFEG QRCEINPDDC EDNDCENNAT CVDGINNYVC 1021 ICPPNYTGEL CDEVIDHCVP ELNLCQHEAK CIPLDKGFSC ECVPGYSGKL CETDNDDCVA 1081 HKCRHGAQCV DTINGYTCTC PQGFSGPFCE HPPPMVLLQT SPCDQYECQN GAQCIVVQQE 1141 PTCRCPPGFA GPRCEKLITV NFVGKDSYVE LASAKVRPQA NISLQVATDK DNGILLYKGD 1201 NDPLALELYQ GHVRLVYDSL SSPPTTVYSV ETVNDGQFHS VELVTLNQTL NLVVDKGTPK 1261 SLGKLQKQPA VGINSPLYLG GIPTSTGLSA LRQGTDRPLG GFHGCIHEVR INNELQDFKA 1321 LPPQSLGVSP GCKSCTVCKH GLCRSVEKDS VVCECRPGWT GPLCDQEARD PCLGHRCHHG 1381 KCVATGTSYM CKCAEGYGGD LCDNKNDSAN ACSAFKCHHG QCHISDQGEP YCLCQPGFSG 1441 EHCQQENPCL GQVVREVIRR QKGYASCATA SKVPIMECRG GCGPQCCQPT RSKRRKYVFQ 1501 CTDGSSFVEE VERHLECGCL ACS -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 81( 5.3%); C+ :106( 7.0%); D : 77( 5.1%); E : 73( 4.8%); F : 44( 2.9%) G :105( 6.9%); H : 43( 2.8%); I : 72( 4.7%); K : 76( 5.0%); L :171(11.2%) M : 16( 1.1%); N : 95( 6.2%); P : 90( 5.9%); Q : 67( 4.4%); R : 85( 5.6%) S :113( 7.4%); T : 79( 5.2%); V : 89( 5.8%); W : 9( 0.6%); Y : 32( 2.1%) KR : 161 ( 10.6%); ED : 150 ( 9.8%); AGP : 276 ( 18.1%); KRED : 311 ( 20.4%); KR-ED : 11 ( 0.7%); FIKMNY : 335 ( 22.0%); LVIFM : 392 ( 25.7%); ST : 192 ( 12.6%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 0000000000 00+0+00000 0000000000 000000+000 00000-0000 00+000+000 61 +00-+0-0-+ 0000+00+0- 0000+00+00 00--000000 -+0000-0+0 0-+0+00+0+ 121 00000-0000 000+00+0-0 0-0000000+ +00+000-0+ 0000-00000 00--000+00 181 +-0-000000 0000+00000 00000+0+00 +00000000- 0000000-00 +0++000000 241 00000000+0 0000-00++- 00000000-0 0000000000 0000000000 0-0+0+000- 301 000000-000 -0+0-0000+ 0000000000 ++0++0-00+ 0000-000-0 0000+00000 361 00000+00-0 0+000-0000 000000000+ 0000+00000 -000000000 0-0+00000+ 421 0000000000 0000000000 0-000+000- 000-000-00 00+0000++0 00++0000+0 481 ++0+0000-- 0+0+000-00 0-0000-+0+ 0-0000-000 0+00+00000 0-000-0+00 541 -0-0000-00 000++0000+ +000000+0+ -0+-000-00 0000-00000 000-0000+0 601 0+00000+00 00+0000000 00-0000000 0+00000-0+ 0000000000 0000000000 661 0000000000 00000+00++ ++000000+0 0+0000+-00 00-0000-00 0-00--0000 721 000+00-000 00-000+000 +00+000+00 0+-00-000- 00000000+- 0000+00000 781 -000000000 0000000000 000000000+ 0+00000000 00+00+0000 000-00000- 841 0000-00000 0000000000 0-000+000- 00+000+-00 00+0000-00 0-+0000000 901 0+000+000- 00000+0000 00000+0000 000-00-00+ 0000000+0+ -000000000 961 0000000000 000-00+-00 00000000-0 0+0-000--0 --0-0-0000 00-0000000 1021 00000000-0 0--00-0000 -000000-0+ 0000-+0000 -0000000+0 0-0-0--000 1081 0+0+000000 -000000000 000000000- 0000000000 000-00-000 000000000- 1141 000+000000 00+0-+0000 0000+-000- 0000+0+000 00000000-+ -000000+0- 1201 0-0000-000 000+000-00 0000000000 -000-00000 0-00000000 0000-+000+ 1261 000+00+000 0000000000 0000000000 0+000-+000 0000000-0+ 000-00-0+0 1321 0000000000 00+00000+0 000+00-+-0 000-0+0000 0000-0-0+- 00000+0000 1381 +000000000 0+00-0000- 00-0+0-000 00000+0000 00000-00-0 0000000000 1441 -0000-0000 0000+-00++ 0+00000000 0+0000-0+0 0000000000 +0++++0000 1501 00-00000-- 0-+00-0000 000 A. CHARGE CLUSTERS. Positive charge clusters (cmin = 11/30 or 15/45 or 18/60): none Negative charge clusters (cmin = 11/30 or 14/45 or 17/60): none Mixed charge clusters (cmin = 17/30 or 22/45 or 27/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 5 | 7 | 45 | 10 | 10 | 14 | 13 | 12 | 17 | 8 | 10 | lmin1 7 | 7 | 9 | 54 | 13 | 12 | 16 | 15 | 15 | 20 | 9 | 12 | lmin2 8 | 8 | 11 | 61 | 14 | 14 | 18 | 17 | 17 | 22 | 11 | 14 | (Significance level: 0.010000; Minimal displayed length: 6) There are no charge runs or patterns exceeding the given minimal lengths. Run count statistics: + runs >= 4: 2, at 679; 1493; - runs >= 3: 0 * runs >= 5: 0 0 runs >= 30: 1, at 641; -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. There are no high scoring hydrophobic segments. There are no high scoring transmembrane segments. 2. SPACINGS OF C. H2N-33-C-3-C-1-C-6-C-123-C-47-C-1-C-20-C-20-C-9-C-6-C-3-C-1-C-6-C-99-C-47-C-1-C-20-C-20-C-13-C-5-C-3-C-1-C-6-C-100-C-47-C-1-C-20-C-20-C-7-C-5-C-3-C-1-C-6-C-74-C-47-C-1-C-20-C-20-C-11-C-2-C-4-C-5-C-9-C-1-C-8-C-6-C-4-C-5-C-11-C-1-C-8-C-6-C-4-C-5-C-8-C-1-C-8-C-6-C-6-C-5-C-8-C-1-C-8-C-6-C-4-C-5-C-8-C-1-C-8-C-13-C-4-C-5-C-8-C-1-C-8-C-150-C-26-C-2-C-2-C-4-C-9-C-1-C-8-C-7-C-4-C-4-C-8-C-1-C-8-C-9-C-4-C-4-C-9-C-1-C-8-C-5-C-17-C-10-C-3-C-3-C-C-13-C-15-C-1-C-2-C-1-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-33-C-3-C-1-C-6-C-H-42-H-76-H-2-C-31-H-9-H-2-H-2-C-1-C-H-19-C-4-H-15-C-3-H-5-C-6-C-3-C-1-C-6-C-99-C-39-H-7-C-1-C-H-19-C-20-C-13-C-5-C-3-C-1-C-6-C-10-H-67-H-21-C-47-C-1-C-H-19-C-20-C-7-C-5-C-3-C-1-C-6-C-24-H-12-H-23-H-12-C-3-H-13-H-19-H-8-H-C-1-C-20-C-16-H-3-C-11-C-2-C-4-C-5-C-9-C-1-C-8-C-6-C-4-C-1-H-3-C-H-4-H-5-C-1-C-8-C-6-C-4-C-5-C-8-C-1-C-8-C-5-H-C-6-C-1-H-3-C-8-C-1-C-8-C-6-C-2-H-1-C-1-H-3-C-8-C-1-C-8-C-1-H-11-C-4-C-5-C-8-C-1-C-8-C-57-H-26-H-63-H-1-C-1-H-24-C-2-C-2-C-1-H-2-C-9-C-1-C-8-C-7-C-2-H-1-C-H-H-2-C-8-C-1-C-8-C-9-C-4-C-H-H-2-C-H-8-C-1-C-7-H-C-5-C-17-C-10-C-3-C-3-C-C-13-C-12-H-2-C-1-C-2-C-1-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 5 Aligned matching blocks: [ ]--------[ 219- 226]-( -6)-[ 221- 234] [ 407- 413]-( 27)-[ 441- 448]--------[ ] [ 633- 639]-( 29)-[ 669- 674]-( -6)-[ 669- 682] [ 407- 413] LLSLYDN [ 633- 639] LLSLYDN [ 219- 226] CDCHLAWL [ 441- 448] CDCHLKWL [ 669- 674] C__HLAWL [ 221- 234] CHLAWLSDWLRQRR [ 669- 682] CHLAWLGKWLRKRR ______________________________ [ 462- 467] ARCSSP [ 882- 887] ARCSSP _________________________ _____ [ 771- 775] LSALR [1288-1292] LSALR B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 9 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 70 (Expected range: 59--114) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 12 (6-10) 13 (11-20) 19 (>=21) 27 3. Clusters of amino acid multiplets (cmin = 9/30 or 12/45 or 14/60): none 4. Long amino acid altplets (>= 8; Letters/Length/Position): LA/8/16 B. CHARGE ALPHABET. 1. Total number of charge multiplets: 22 (Expected range: 12-- 45) 13 +plets (f+: 10.6%), 9 -plets (f-: 9.8%) Total number of charge altplets: 26 (Critical number: 50) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 4 (6-10) 2 (11-20) 2 (>=21) 15 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 2- 17 4 A... 4 4 0 16- 23 2 LA 4 4 0 53- 68 4 R... 4 4 0 114- 141 7 L...... 4 4 0 189- 192 1 N 4 4 0 272- 291 5 S.... 4 4 0 354- 441 8 L....... 9 5 ! 2 383- 386 1 L 4 4 0 466- 505 10 S.......R. 4 4 /0/./././././././1/./ 1280-1307 7 G...... 4 4 0 1372-1441 10 C......... 6 4 1 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9) Location Period Element Copies Core Errors 52- 219 8 i....... 19 10 ! 2 290- 441 8 i....... 17 13 ! 2 469- 498 5 *.00. 6 6 /0/./1/2/./ 482- 541 10 *......0.0 6 6 /0/././././././2/./2/ 556- 667 8 i00....0 13 10 ! /1/4/2/././././3/ 736- 777 7 i00.0.. 6 6 /0/2/2/./2/././ 774- 861 8 i00.0... 11 11 ! /0/2/2/./3/./././ -------------------------------------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 47- 171 (1.) C( 124)C 2 of 107 0.0000 large 2. maximal spacing 336- 338 (1.) I( 2)I 73 of 73 0.0007 large minimal spacing 699- 701 (2.) I( 2)I 71 of 73 0.0007 matching minimum 911- 913 (3.) I( 2)I 72 of 73 0.0007 matching minimum 1154-1305 (4.) C( 151)C 1 of 107 0.0011 large 1. maximal spacing ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem27 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- LRR Leucine Rich Repeat 310.7 1.7e-89 20 LRRCT Leucine rich repeat C-terminal domain 236.2 4.6e-67 4 EGF EGF-like domain 234.6 1.4e-66 9 LRRNT Leucine rich repeat N-terminal domain 124.5 2e-33 4 laminin_G Laminin G domain 88.6 4.2e-24 1 Cys_knot Cystine-knot domain 16.2 0.00099 1 BPL Biotin protein ligase catalytic domai 2.0 29 1 PDGF Platelet-derived growth factor (PDGF) 1.5 30 1 Arthro_defensin Arthropod defensin 1.2 10 1 SRP54 SRP54-type protein 0.9 31 1 gag_MA Matrix protein (MA), p15 -0.2 87 1 Agglutinin Lectin (probable mannose binding) -0.3 35 1 DUF139 Domain of unknown function DUF139 -3.3 88 1 UBA UBA domain -4.4 89 1 metalthio Metallothionein -7.9 4.1 1 TIG IPT/TIG domain -12.4 84 1 laminin_EGF Laminin EGF-like (Domains III and V) -12.8 6.3 1 Plexin_repeat Plexin repeat -15.9 1e+02 1 TIL Trypsin Inhibitor like cysteine rich -16.0 11 1 ldl_recept_a Low-density lipoprotein receptor doma -18.6 70 1 vwc von Willebrand factor type C domain -22.8 51 1 Flu_NS2 Influenza non-structural protein (NS2 -31.8 57 1 Metallothio_PEC Plant PEC family metallothionein -35.1 2.4 1 MannoseP_isomer Mannose-6-phosphate isomerase -132.0 75 1 HrcA HrcA protein -186.1 99 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- LRRNT 1/4 33 60 .. 1 31 [] 43.4 5e-09 LRR 1/20 62 85 .. 1 23 [] 15.5 1.3 UBA 1/1 68 103 .. 1 41 [] -4.4 89 LRR 2/20 86 109 .. 1 23 [] 23.5 0.005 LRR 3/20 110 133 .. 1 23 [] 18.1 0.21 LRR 4/20 134 157 .. 1 23 [] 23.8 0.0041 LRR 5/20 158 181 .. 1 23 [] 16.8 0.5 MannoseP_isomer 1/1 81 198 .. 1 169 [] -132.0 75 LRR 6/20 182 205 .. 1 23 [] 16.3 0.75 LRR 7/20 206 227 .. 1 23 [] 0.3 9.3e+02 LRRCT 1/4 215 264 .. 1 54 [] 70.0 5e-17 LRRNT 2/4 279 306 .. 1 31 [] 35.9 8.9e-07 LRR 8/20 308 331 .. 1 23 [] 9.1 46 LRR 9/20 332 355 .. 1 23 [] 25.5 0.0012 LRR 10/20 356 379 .. 1 23 [] 20.3 0.046 Flu_NS2 1/1 310 402 .. 1 115 [] -31.8 57 LRR 11/20 380 403 .. 1 23 [] 9.6 39 gag_MA 1/1 416 426 .. 1 12 [. -0.2 87 LRR 12/20 404 427 .. 1 23 [] 18.6 0.15 HrcA 1/1 75 436 .. 1 343 [] -186.1 99 LRRCT 2/4 437 486 .. 1 54 [] 57.3 3.3e-13 LRRNT 3/4 504 531 .. 1 31 [] 32.5 9.7e-06 LRR 13/20 558 581 .. 1 23 [] 19.8 0.064 BPL 1/1 562 604 .. 130 174 .] 2.0 29 LRR 14/20 582 605 .. 1 23 [] 16.5 0.62 LRR 15/20 606 629 .. 1 23 [] 14.5 2.5 LRR 16/20 630 653 .. 1 23 [] 16.4 0.67 LRRCT 3/4 663 712 .. 1 54 [] 61.0 2.5e-14 LRRNT 4/4 724 751 .. 1 31 [] 35.7 1.1e-06 LRR 17/20 753 775 .. 1 23 [] 14.1 3.4 LRR 18/20 776 799 .. 1 23 [] 19.6 0.074 LRR 19/20 800 823 .. 1 23 [] 22.0 0.014 TIG 1/1 740 846 .. 1 103 [] -12.4 84 LRR 20/20 824 847 .. 1 23 [] 26.1 0.00081 Agglutinin 1/1 853 860 .. 85 94 .] -0.3 35 SRP54 1/1 886 895 .. 287 296 .] 0.9 31 LRRCT 4/4 857 906 .. 1 54 [] 67.7 2.5e-16 EGF 1/9 920 952 .. 1 45 [] 28.2 0.00019 Plexin_repeat 1/1 915 953 .. 1 67 [] -15.9 1e+02 EGF 2/9 959 993 .. 1 45 [] 33.6 4.5e-06 laminin_EGF 1/1 964 1000 .. 1 59 [] -12.8 6.3 metalthio 1/1 961 1024 .. 1 67 [] -7.9 4.1 EGF 3/9 1000 1031 .. 1 45 [] 41.2 2.4e-08 TIL 1/1 984 1045 .. 1 67 [] -16.0 11 EGF 4/9 1038 1071 .. 1 45 [] 36.8 4.8e-07 Arthro_defensin 1/1 1068 1101 .. 1 36 [] 1.2 10 EGF 5/9 1078 1109 .. 1 45 [] 39.1 1e-07 EGF 6/9 1123 1154 .. 1 45 [] 37.5 3.1e-07 laminin_G 1/1 1186 1317 .. 1 161 [] 88.6 4.2e-24 EGF 7/9 1332 1364 .. 1 45 [] 23.2 0.0062 vwc 1/1 1305 1377 .. 1 84 [] -22.8 51 EGF 8/9 1372 1402 .. 1 45 [] 31.8 1.5e-05 ldl_recept_a 1/1 1370 1414 .. 1 43 [] -18.6 70 EGF 9/9 1412 1443 .. 1 45 [] 27.7 0.00027 Metallothio_PEC 1/1 1401 1469 .. 1 80 [] -35.1 2.4 DUF139 1/1 1472 1488 .. 1 17 [] -3.3 88 PDGF 1/1 1512 1519 .. 77 84 .] 1.5 30 Cys_knot 1/1 1464 1522 .. 14 88 .] 16.2 0.00099 Alignments of top-scoring domains: LRRNT: domain 1 of 4, from 33 to 60: score 43.4, E = 5e-09 *->aCpreCtCspfglvVdCsgrgLtlevPrdlP<-* aCp CtCs + +VdC+g+gL+ +vPr++P gi|4049589 33 ACPTKCTCS--AASVDCHGLGLR-AVPRGIP 60 LRR: domain 1 of 20, from 62 to 85: score 15.5, E = 1.3 *->nLeeLdLsnN.LtslppglfsnLp<-* n e+LdL++N++t + f++L+ gi|4049589 62 NAERLDLDRNnITRITKMDFAGLK 85 UBA: domain 1 of 1, from 68 to 103: score -4.4, E = 89 *->edeekieqLveMGFdreevvkALratngngverAaewLlsh<-* d +i ++ M F + + L+ +++ + ++ + gi|4049589 68 LDRNNITRITKMDFAGLKNLRVLHLEDNQ-----VSVIERG 103 LRR: domain 2 of 20, from 86 to 109: score 23.5, E = 0.005 *->nLeeLdLsnN.LtslppglfsnLp<-* nL++L+L++N+ + ++ g+f++L+ gi|4049589 86 NLRVLHLEDNqVSVIERGAFQDLK 109 LRR: domain 3 of 20, from 110 to 133: score 18.1, E = 0.21 *->nLeeLdLsnN.LtslppglfsnLp<-* +Le+L+L+ N+L+ lp+ lf++ p gi|4049589 110 QLERLRLNKNkLQVLPELLFQSTP 133 LRR: domain 4 of 20, from 134 to 157: score 23.8, E = 0.0041 *->nLeeLdLsnN.LtslppglfsnLp<-* +L++LdLs N+++++p ++f++ + gi|4049589 134 KLTRLDLSENqIQGIPRKAFRGIT 157 LRR: domain 5 of 20, from 158 to 181: score 16.8, E = 0.5 *->nLeeLdLsnN.LtslppglfsnLp<-* + L L+nN+++ +++g+f+ L+ gi|4049589 158 DVKNLQLDNNhISCIEDGAFRALR 181 MannoseP_isomer: domain 1 of 1, from 81 to 198: score -132.0, E = 75 *->dseNsYvySesrLvAtvGvedLVVVeTkDA.vLVadrdkvQDVKkiV ++ G+++L V + +D +v V+ r+ QD K++ gi|4049589 81 -------FA--------GLKNLRVLHLEDNqVSVIERGAFQDLKQL- 111 eqLkangRtEhelHreVyRPWGsYdvideGeryhYqVKRItVKPGerLSl e+L++n+++ l ++ + + + +d +e+ q++ I P + ++ gi|4049589 112 ERLRLNKNKLQVLPELLFQSTPKLTRLDLSEN---QIQGI---PRKAFR- 154 QmHhHRAEHWVVVsGTAkVtvdeevflLtENEStYIPlGaiHrLENPGkI G+ V + + L N I Ga +L gi|4049589 155 --------------GITDV----KNLQLDNNHISCIEDGAFRALR----- 181 PLeLIEVQsGsYLGEDDIVRfeD<-* +Le++ + L I R+ gi|4049589 182 DLEIL---T---LNNNNISRILV 198 LRR: domain 6 of 20, from 182 to 205: score 16.3, E = 0.75 *->nLeeLdLsnN.LtslppglfsnLp<-* Le L L+nN+++ + +f+++p gi|4049589 182 DLEILTLNNNnISRILVTSFNHMP 205 LRR: domain 7 of 20, from 206 to 227: score 0.3, E = 9.3e+02 *->nLeeLdLsnN.LtslppglfsnLp<-* ++++L+L++N+L ++ L+ gi|4049589 206 KIRTLRLHSNhLYCDCH--LAWLS 227 LRRCT: domain 1 of 4, from 215 to 264: score 70.0, E = 5e-17 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell N+++CDC+L+wL++Wlr+ rr+ +q +++C+ P +lrG+ +++++ gi|4049589 215 NHLYCDCHLAWLSDWLRQ---RRTVGQ-FTLCMAPVHLRGFNVADVQ 257 psdfsCp<-* ++ +Cp gi|4049589 258 KKEYVCP 264 LRRNT: domain 2 of 4, from 279 to 306: score 35.9, E = 8.9e-07 *->aCpreCtCspfglvVdCsgrgLtlevPrdlP<-* Cp +CtCs VdC+g+gL e+P lP gi|4049589 279 SCPSPCTCS--NNIVDCRGKGLM-EIPANLP 306 LRR: domain 8 of 20, from 308 to 331: score 9.1, E = 46 *->nLeeLdLsnN.LtslppglfsnLp<-* + e++L+ N+++ +p g+f + + gi|4049589 308 GIVEIRLEQNsIKAIPAGAFTQYK 331 LRR: domain 9 of 20, from 332 to 355: score 25.5, E = 0.0012 *->nLeeLdLsnN.LtslppglfsnLp<-* +L+++d+s N+++++ p++f++L+ gi|4049589 332 KLKRIDISKNqISDIAPDAFQGLK 355 LRR: domain 10 of 20, from 356 to 379: score 20.3, E = 0.046 *->nLeeLdLsnN.LtslppglfsnLp<-* +L++L L +N++t++ glf++L gi|4049589 356 SLTSLVLYGNkITEIAKGLFDGLV 379 Flu_NS2: domain 1 of 1, from 310 to 402: score -31.8, E = 57 *->dIlwRMsKMqlgSsthsSSvdlngiitQFEqLKLyrdSlgelVkrtG ++ R+ + g tQ+ +LK r+ gi|4049589 310 -VEIRLEQNSIKA-------IPAGAFTQYKKLK-----------RI- 336 DlHslqernrlwrEeLsqkfeeIrwlIlevrtvlantensfeqItfLqaL D ++ + + q + l+l ++ + f+ ++Lq+L gi|4049589 337 DI--SKNQISDIAPDAFQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLL 384 qlLyEvedeirtfSfQli<-* l + r+ fQ + gi|4049589 385 LLNANKINCLRVNTFQDL 402 LRR: domain 11 of 20, from 380 to 403: score 9.6, E = 39 *->nLeeLdLsnN.LtslppglfsnLp<-* +L+ L L+ N++ l ++f++L+ gi|4049589 380 SLQLLLLNANkINCLRVNTFQDLQ 403 gag_MA: domain 1 of 1, from 416 to 426: score -0.2, E = 87 *->GQtvtkGLLTPL<-* Qt +kGL PL gi|4049589 416 -QTISKGLFAPL 426 LRR: domain 12 of 20, from 404 to 427: score 18.6, E = 0.15 *->nLeeLdLsnN.LtslppglfsnLp<-* nL L+L +N+L++++ glf+ L+ gi|4049589 404 NLNLLSLYDNkLQTISKGLFAPLQ 427 HrcA: domain 1 of 1, from 75 to 436: score -186.1, E = 99 *->mlteRqlqILraIVedYiktgePVGSKtLsekhdialSsATIRNeMa ++t+ + L+ + +++++ + +e+ ++ +++ gi|4049589 75 RITKMDFAGLKNLRVLHLEDNQ----VSVIERGAFQ--------DLK 109 dLEklGyLekpHtSsGRvCPsekGYRyYVDhllevksltetekrqiksql +LE+l R+ ++ + l e+ ++ ++ + + + gi|4049589 110 QLERL-----------RL------NKNKLQVLPELLFQSTPKLTRLDLSE 142 ldVkgrqydldkllqkaakiLSeLtgl......talVlgPslekdilkhi + + + +ka + + ++ +l+ ++++++++ + +++ l+ + gi|4049589 143 N-------QIQGIPRKAFRGITDVKNLqldnnhISCIEDGAFR--ALRDL 183 elipldphraLaVlvtdsGh.Venriinlpan.................. e+ l+++ + +lvt+++h ++ r+++l n+ + + ++ +++ gi|4049589 184 EILTLNNNNISRILVTSFNHmPKIRTLRLHSNhlycdchlawlsdwlrqr 233 ............................................isesdL ++ ++ + + + ++ + + ++++ + +++++++ + ++is + gi|4049589 234 rtvgqftlcmapvhlrgfnvadvqkkeyvcpaphseppscnansISCPSP 283 aiisnilndrLvGvklsEinqqledeleqilqkyg......l.Aadfvdv ++sn + d +G+ l Ei +l + +++i+ ++++ + + +A+ +++ gi|4049589 284 CTCSNNIVD-CRGKGLMEIPANLPEGIVEIRLEQNsikaipAgAFTQYKK 332 vlqillstldesseerlivmGgknlllnqpeFeDieklrqiLsllEderv +++i +s +++s++ + + G ++ l + +k+++i + l d v gi|4049589 333 LKRIDISKNQISDIAPDAFQG-LKSLTSLVLYG--NKITEIAKGLFDGLV 379 lleLlniyednkgdegvtirIGnEndlksmsgcSvItapYsigekkplGt l+Ll + ++ + +++ + dl +++ +S+ ++ + k G gi|4049589 380 SLQLLLLNANKI--NCLRVNTF--QDLQNLNLLSLYDNKLQTISK---GL 422 lgviGPtRmdYqrviplldlva<-* ++ + + i++l l gi|4049589 423 FAPL--------QSIQTLHLAQ 436 LRRCT: domain 2 of 4, from 437 to 486: score 57.3, E = 3.3e-13 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell NPf CDC+L+wL++ l++ + +e + +rC+sP +l+ +++ ++ gi|4049589 437 NPFVCDCHLKWLADYLQD---NPIETS-GARCSSPRRLANKRISQIK 479 psdfsCp<-* ++f C+ gi|4049589 480 SKKFRCS 486 LRRNT: domain 3 of 4, from 504 to 531: score 32.5, E = 9.7e-06 *->aCpreCtCspfglvVdCsgrgLtlevPrdlP<-* Cp C+C+ g+ VdCs+++L ++P+ lP gi|4049589 504 VCPEKCRCE--GTIVDCSNQKLV-RIPSHLP 531 LRR: domain 13 of 20, from 558 to 581: score 19.8, E = 0.064 *->nLeeLdLsnN.LtslppglfsnLp<-* nL++++LsnN+++++ +g+f++ gi|4049589 558 NLRKINLSNNkIKEVREGAFDGAA 581 BPL: domain 1 of 1, from 562 to 604: score 2.0, E = 29 *->iNvnn....dkPeellnpgitpatslkeilgkdldidreellaellr iN++n++ +++ e++ ++ a+s++e+ + +e++ ++++r gi|4049589 562 INLSNnkikEVREGAFDG----AASVQELMLT--GNQLETVHGRVFR 602 nf<-* + gi|4049589 603 GL 604 LRR: domain 14 of 20, from 582 to 605: score 16.5, E = 0.62 *->nLeeLdLsnN.LtslppglfsnLp<-* + +eL L +N+L++++ +f++L+ gi|4049589 582 SVQELMLTGNqLETVHGRVFRGLS 605 LRR: domain 15 of 20, from 606 to 629: score 14.5, E = 2.5 *->nLeeLdLsnN.LtslppglfsnLp<-* L++L L++N + +++++f++L+ gi|4049589 606 GLKTLMLRSNlIGCVSNDTFAGLS 629 LRR: domain 16 of 20, from 630 to 653: score 16.4, E = 0.67 *->nLeeLdLsnN.LtslppglfsnLp<-* + + L+L +N++t++ pg+f L gi|4049589 630 SVRLLSLYDNrITTITPGAFTTLV 653 LRRCT: domain 3 of 4, from 663 to 712: score 61.0, E = 2.5e-14 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell NPfnC+C+L+wL Wlr+ rr+ + ++rC+ P l+ p+ ++ gi|4049589 663 NPFNCNCHLAWLGKWLRK---RRIVSG-NPRCQKPFFLKEIPIQDVA 705 psdfsCp<-* +df+C+ gi|4049589 706 IQDFTCD 712 LRRNT: domain 4 of 4, from 724 to 751: score 35.7, E = 1.1e-06 *->aCpreCtCspfglvVdCsgrgLtlevPrdlP<-* Cp +CtC ++vV Cs++gL+ ++Pr+ P gi|4049589 724 RCPEQCTCM--ETVVRCSNKGLR-ALPRGMP 751 LRR: domain 17 of 20, from 753 to 775: score 14.1, E = 3.4 *->nLeeLdLsnN.LtslppglfsnLp<-* +eL+L++N+Lt +p + +s L+ gi|4049589 753 DVTELYLEGNhLTAVPRE-LSALR 775 LRR: domain 18 of 20, from 776 to 799: score 19.6, E = 0.074 *->nLeeLdLsnN.LtslppglfsnLp<-* +L+ +dLsnN+++ l + +fsn++ gi|4049589 776 HLTLIDLSNNsISMLTNYTFSNMS 799 LRR: domain 19 of 20, from 800 to 823: score 22.0, E = 0.014 *->nLeeLdLsnN.LtslppglfsnLp<-* +L +L Ls+N+L+ +p ++f++L+ gi|4049589 800 HLSTLILSYNrLRCIPVHAFNGLR 823 TIG: domain 1 of 1, from 740 to 846: score -12.4, E = 84 *->PvItsisPssgplsGGteiTitGsnLgsgsrPrirkntfededikVt + ++ P +g+ + te+++ G +L++++ + ++ + gi|4049589 740 NKGLRALP-RGMPKDVTELYLEGNHLTAVP-----------RELSAL 774 fggvp........Cdvvssess............tqivCktPpva.pgps + + ++++ +++++ + s+ ++ ++ + + ++C +P a g + gi|4049589 775 RHLTLidlsnnsiSMLTNYTFSnmshlstlilsyNRLRC-IPVHAfNGLR 823 ppvvtvsvdgggsgdgrssspvtFtYv<-* ++v +g+ d+ s +++F+ + gi|4049589 824 S--LRVLTLHGN--DISSVPEGSFNDL 846 LRR: domain 20 of 20, from 824 to 847: score 26.1, E = 0.00081 *->nLeeLdLsnN.LtslppglfsnLp<-* +L++L L++N+++s+p+g+f++L+ gi|4049589 824 SLRVLTLHGNdISSVPEGSFNDLT 847 Agglutinin: domain 1 of 1, from 853 to 860: score -0.3, E = 35 *->ATGtnglPLr<-* A+Gtn PL+ gi|4049589 853 ALGTN--PLH 860 SRP54: domain 1 of 1, from 886 to 895: score 0.9, E = 31 *->dperfvsrLL<-* +pe +++rLL gi|4049589 886 SPEPMADRLL 895 LRRCT: domain 4 of 4, from 857 to 906: score 67.7, E = 2.5e-16 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell NP++CDC+LrwL++W+++ ++e++ +rC+sPe++++ +ll+ + gi|4049589 857 NPLHCDCSLRWLSEWVKA---GYKEPG-IARCSSPEPMAD-RLLLTT 898 p.sdfsCp<-* p+ +f+C+ gi|4049589 899 PtHRFQCK 906 EGF: domain 1 of 9, from 920 to 952: score 28.2, E = 0.00019 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C + pC+n+GtC+ p + y+C Cp y Gk+C gi|4049589 920 CLSS-PCKNNGTCTQDP------VELYRCACPYS-----YKGKDC 952 * gi|4049589 - - Plexin_repeat: domain 1 of 1, from 915 to 953: score -15.9, E = 1e+02 *->nCsqhRqsCssClsapdpfgCgWCvsrkrCtrreaygpsfpyg.eCs C +Cls+p C++++ Ct+ + ++y +C+ gi|4049589 915 ------AKCNACLSSP-------CKNNGTCTQDP----VELYRcACP 944 rgekpedNswlgwsqsseqCp<-* ++ + ++C gi|4049589 945 YSY------------KGKDCT 953 EGF: domain 2 of 9, from 959 to 993: score 33.6, E = 4.5e-06 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C +n pC++gGtC+ + + ++g++C Cp G + G rC gi|4049589 959 CIQN-PCQHGGTCHLSDS----HKDGFSCSCPLG-----FEGQRC 993 * gi|4049589 - - laminin_EGF: domain 1 of 1, from 964 to 1000: score -12.8, E = 6.3 *->CdCnphGslsddtCdsddelf..geetGqClkCkpnvtGrrCdrCkp C+ hG +tC++ +++ + + C+ C +++G+rC+ gi|4049589 964 CQ---HG----GTCHL----SdsHKDGFSCS-CPLGFEGQRCEI--- 995 GyyglpsgdpgqgC<-* +p +C gi|4049589 996 --------NPD-DC 1000 metalthio: domain 1 of 1, from 961 to 1024: score -7.9, E = 4.1 *->mDPqnCsCatggsCtCgtsCk.CknCkCtsCkKs.ccsCcPagC..s P C+ gg+C ++s k++ +C C+ ++ c P +C++ gi|4049589 961 QNP----CQHGGTCHLSDSHKdGFSCSCPLGFEGqRCEINPDDCedN 1003 kCaqgCvCkgggaasetskCsCCa<-* +C C + ++ C C + gi|4049589 1004 DCENNATCVD---GINNYVCICPP 1024 EGF: domain 3 of 9, from 1000 to 1031: score 41.2, E = 2.4e-08 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C +n C n++tCv++ + y C+Cpp ytG+ C gi|4049589 1000 CEDN-DCENNATCVDGIN-------NYVCICPPN-----YTGELC 1031 * gi|4049589 - - TIL: domain 1 of 1, from 984 to 1045: score -16.0, E = 11 *->CpaneqyteCgpsCepsCsnpdgplettppCegtSpkvPstCkeg.. Cp + ++C + C++ Ce +tC+ g + gi|4049589 984 CPLGFEGQRCE-INPDDCED--------NDCE-----NNATCVDGin 1016 ...CvCqpGy...VrnndgdkCVp.rseC<-* + C+C+p y++++ ++ +d+CVp+ + C gi|4049589 1017 nyvCICPPNYtgeLCDEVIDHCVPeLNLC 1045 EGF: domain 4 of 9, from 1038 to 1071: score 36.8, E = 4.8e-07 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< C p+ + C++ ++C+++ g++CeC pG y+Gk C gi|4049589 1038 CVPELnLCQHEAKCIPLDK-------GFSCECVPG-----YSGKLC 1071 -* gi|4049589 - - Arthro_defensin: domain 1 of 1, from 1068 to 1101: score 1.2, E = 10 *->gkgcpvNhsaCaaHClakGGrrGGyCng.lka.vCvCR<-* gk c+ ++ C aH k r G++C ++ ++ +C+C gi|4049589 1068 GKLCETDNDDCVAH---KC-RHGAQCVDtINGyTCTCP 1101 EGF: domain 5 of 9, from 1078 to 1109: score 39.1, E = 1e-07 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C + C +g++Cv+t + gytC+Cp+G ++G++C gi|4049589 1078 CVAH-KCRHGAQCVDTIN-------GYTCTCPQG-----FSGPFC 1109 * gi|4049589 - - EGF: domain 6 of 9, from 1123 to 1154: score 37.5, E = 3.1e-07 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C++ C+ng++C+ + + ++tC+CppG ++G+rC gi|4049589 1123 CDQY-ECQNGAQCIVVQQ-------EPTCRCPPG-----FAGPRC 1154 * gi|4049589 - - laminin_G: domain 1 of 1, from 1186 to 1317: score 88.6, E = 4.2e-24 *->FRTtepsGlLllgYggtntdrggkkeigdDFlaleLvdGrlevsydl ++T + +G+Ll Y g+n+ aleL G++++ yd gi|4049589 1186 VATDKDNGILL--YKGDNDPL-----------ALELYQGHVRLVYDS 1219 GsghrlrpavvrsgdrvlnDGkWHrvelerngrkgtLsVdgeepskktls s p ++++ ++++nDG+ H+vel+ ++ ++L Vd++ p gi|4049589 1220 LSS----PPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTP------ 1259 etvvdgespsgpdvtsenLdldtppiLyvGGlPeqksvkrrlaaist... + +g+++++ + + p Ly+GG+P +s+ ++++++t+++ gi|4049589 1260 -------KSLGKLQKQPAVGINSP--LYLGGIP--TSTGLSALRQGTdrp 1298 ..sFkGCirdvsingkpld<-* ++F+GCi++v+in++++d gi|4049589 1299 lgGFHGCIHEVRINNELQD 1317 EGF: domain 7 of 9, from 1332 to 1364: score 23.2, E = 0.0062 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C + ++C++ G C++++ ++ CeC pG +tG+ C gi|4049589 1332 CKSCTVCKH-GLCRSVEK------DSVVCECRPG-----WTGPLC 1364 * gi|4049589 - - vwc: domain 1 of 1, from 1305 to 1377: score -22.8, E = 51 *->CvqnGvvYengetWkpdsq.....PnGvdkCtyiCtCddiedavrlg C++ ++ ++ + +k ++++ + ++G +Ct +C+ l gi|4049589 1305 CIHEVRINNELQDFKALPPqslgvSPGCKSCT---VCKH-----GLC 1343 gkvlCdkitCppel.LpsldCpnprrvdalvippGECCpewvC<-* + v+ d ++C+ + +++ C++ + + C + +C gi|4049589 1344 RSVEKDSVVCECRPgWTGPLCDQE--------ARDPCLG-HRC 1377 EGF: domain 8 of 9, from 1372 to 1402: score 31.8, E = 1.5e-05 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C + C++ G+Cv t+ +y C+C++G y G+ C gi|4049589 1372 CLGH-RCHH-GKCVATGT-------SYMCKCAEG-----YGGDLC 1402 * gi|4049589 - - ldl_recept_a: domain 1 of 1, from 1370 to 1414: score -18.6, E = 70 *->stCggpdeFqCgsgrrCIpr..swvC.....DGdpDCeDGSDEslen ++C ++C++g+ C++ ++s+ C+ ++ G + C++ D gi|4049589 1370 DPCL---GHRCHHGK-CVATgtSYMCkcaegYGGDLCDNKNDS-ANA 1411 Caa<-* C+a gi|4049589 1412 CSA 1414 EGF: domain 9 of 9, from 1412 to 1443: score 27.7, E = 0.00027 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C+ C++ G+C+ + + g+++C C+pG ++G++C gi|4049589 1412 CSAF-KCHH-GQCHISDQ------GEPYCLCQPG-----FSGEHC 1443 * gi|4049589 - - Metallothio_PEC: domain 1 of 1, from 1401 to 1469: score -35.1, E = 2.4 *->gCDDkCGCpsPCPGGkaCRCtsggaaeAsaGdqEHttCpC.....GE CD k + C a C g + dq+ C C+++ +GE gi|4049589 1401 LCDNKNDSANAC---SAFKCHHG---QCHISDQGEPYCLCqpgfsGE 1441 HCGC.NPCtCpksetptgrkgrRAnCsCGagCtCasCAS<-* HC +NPC + r + asCA+ gi|4049589 1442 HCQQeNPCL-GQVVREVIRRQK----------GYASCAT 1469 DUF139: domain 1 of 1, from 1472 to 1488: score -3.3, E = 88 *->aqCapaCqpsCqssCvq<-* + + C C ++C q gi|4049589 1472 KVPIMECRGGCGPQCCQ 1488 PDGF: domain 1 of 1, from 1512 to 1519: score 1.5, E = 30 *->eqHlkCeC<-* e Hl+C C gi|4049589 1512 ERHLECGC 1519 Cys_knot: domain 1 of 1, from 1464 to 1522: score 16.2, E = 0.00099 *->kegCtscktvnttiCaGyCytkdpvykdprsllkQcvCcypevtyet ++ C+ +v + C G C+ p+ Cc+p++++++ gi|4049589 1464 YASCATASKVPIMECRGGCG--------PQ-------CCQPTRSKRR 1495 kvlpgCppGvdptvtYpvAlsChCgdkC<-* k+++ C++G+ ++ ++ l+C C +C gi|4049589 1496 KYVFQCTDGSSFVEEVERHLECGCL-AC 1522 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem27 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- LRR Leucine Rich Repeat 272.5 5.7e-78 20 LRRCT Leucine rich repeat C-terminal domain 221.6 8e-73 5 EGF EGF-like domain 219.5 5.1e-62 9 LRRNT Leucine rich repeat N-terminal domain 118.2 1.5e-31 5 laminin_G Laminin G domain 88.6 4.2e-24 1 Cys_knot Cystine-knot domain 16.2 0.00099 1 DSL Delta serrate ligand 9.5 0.15 4 S_mold_repeat Dictyostelium (slime mold) repeats 3.3 53 1 crystall Beta/Gamma crystallin 2.8 53 1 TNFR_c6 TNFR/NGFR cysteine-rich region 2.1 45 1 BPL Biotin protein ligase catalytic domain 2.0 29 1 UVR UvrB/uvrC motif 1.6 92 1 PDGF Platelet-derived growth factor (PDGF) 1.5 30 1 dCMP_cyt_deam Cytidine and deoxycytidylate deaminase 1.2 43 1 MarR MarR family 1.1 64 1 SRP54 SRP54-type protein 0.9 31 1 rrm RNA recognition motif. (a.k.a. RRM, RBD 0.9 86 1 CX CX module 0.5 68 1 DUF41 Domain of unknown function DUF41 0.4 42 1 gag_MA Matrix protein (MA), p15 -0.2 87 1 Agglutinin Lectin (probable mannose binding) -0.3 35 1 arf ADP-ribosylation factor family -0.4 87 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- LRRNT 1/5 33 60 .. 1 31 [] 41.5 8.9e-10 MarR 1/1 63 78 .. 26 41 .. 1.1 64 LRR 1/20 62 85 .. 1 23 [] 13.5 0.065 LRR 2/20 86 109 .. 1 23 [] 21.5 0.00036 LRR 3/20 110 133 .. 1 23 [] 16.1 0.012 LRR 4/20 134 157 .. 1 23 [] 21.8 0.0003 LRR 5/20 158 181 .. 1 23 [] 14.9 0.028 LRR 6/20 182 205 .. 1 23 [] 14.3 0.04 LRR 7/20 206 217 .. 1 11 [. 0.8 2.6e+02 LRRCT 1/5 215 264 .. 1 54 [] 68.1 1.7e-21 dCMP_cyt_deam 1/1 282 298 .. 77 100 .] 1.2 43 LRRNT 2/5 279 306 .. 1 31 [] 34.1 1.1e-07 LRR 8/20 308 331 .. 1 23 [] 7.2 4.2 LRR 9/20 332 355 .. 1 23 [] 23.5 9.8e-05 LRR 10/20 356 379 .. 1 23 [] 18.3 0.0029 LRR 11/20 380 403 .. 1 23 [] 7.7 3 gag_MA 1/1 416 426 .. 1 12 [. -0.2 87 LRR 12/20 404 427 .. 1 23 [] 16.6 0.0088 arf 1/1 445 455 .. 181 191 .] -0.4 87 LRRCT 2/5 437 486 .. 1 54 [] 55.4 3.1e-17 UVR 1/1 472 491 .. 1 20 [. 1.6 92 LRRNT 3/5 504 531 .. 1 31 [] 30.6 1e-06 LRR 13/20 558 581 .. 1 23 [] 17.8 0.004 BPL 1/1 562 604 .. 130 174 .] 2.0 29 LRR 14/20 582 605 .. 1 23 [] 14.6 0.034 LRR 15/20 606 629 .. 1 23 [] 12.5 0.12 LRR 16/20 630 653 .. 1 23 [] 14.4 0.036 LRRCT 3/5 663 712 .. 1 54 [] 59.1 1.7e-18 LRRNT 4/5 724 751 .. 1 31 [] 33.8 1.3e-07 LRRNT 5/5 764 772 .. 22 31 .] 0.7 2.5e+02 LRR 17/20 753 775 .. 1 23 [] 12.1 0.16 LRR 18/20 776 799 .. 1 23 [] 17.6 0.0045 DUF41 1/1 790 801 .. 1 16 [. 0.4 42 LRR 19/20 800 823 .. 1 23 [] 20.0 0.00095 LRR 20/20 824 847 .. 1 23 [] 24.1 6.6e-05 Agglutinin 1/1 853 860 .. 85 94 .] -0.3 35 SRP54 1/1 886 895 .. 287 296 .] 0.9 31 LRRCT 4/5 857 906 .. 1 54 [] 65.8 1e-20 S_mold_repeat 1/1 918 941 .. 1 24 [] 3.3 53 EGF 1/9 920 952 .. 1 45 [] 26.5 8.7e-06 EGF 2/9 959 993 .. 1 45 [] 31.9 2.7e-07 crystall 1/1 987 1000 .. 9 23 .. 2.8 53 EGF 3/9 1000 1031 .. 1 45 [] 39.5 1.9e-09 EGF 4/9 1038 1071 .. 1 45 [] 35.2 3.2e-08 DSL 1/4 1099 1109 .. 57 67 .] 0.6 81 EGF 5/9 1078 1109 .. 1 45 [] 37.4 7.5e-09 LRRCT 5/5 1107 1129 .. 30 54 .] -2.1 5.1e+02 DSL 2/4 1144 1154 .. 57 67 .] 2.8 17 EGF 6/9 1123 1154 .. 1 45 [] 35.8 2.1e-08 rrm 1/1 1277 1293 .. 1 17 [. 0.9 86 laminin_G 1/1 1186 1317 .. 1 161 [] 88.6 4.2e-24 TNFR_c6 1/1 1332 1342 .. 16 28 .. 2.1 45 DSL 3/4 1355 1364 .. 58 67 .] 6.3 1.5 EGF 7/9 1332 1364 .. 1 45 [] 21.5 0.00023 EGF 8/9 1372 1402 .. 1 45 [] 30.2 8.1e-07 EGF 9/9 1412 1443 .. 1 45 [] 26.0 1.2e-05 DSL 4/4 1433 1443 .. 57 67 .] 5.1 3.3 CX 1/1 1482 1490 .. 61 69 .] 0.5 68 PDGF 1/1 1512 1519 .. 77 84 .] 1.5 30 Cys_knot 1/1 1464 1522 .. 14 88 .] 16.2 0.00099 Alignments of top-scoring domains: LRRNT: domain 1 of 5, from 33 to 60: score 41.5, E = 8.9e-10 *->aCpreCtCspfglvVdCsgrgLtlevPrdlP<-* aCp CtCs + +VdC+g+gL+ +vPr++P gi|4049589 33 ACPTKCTCS--AASVDCHGLGLR-AVPRGIP 60 MarR: domain 1 of 1, from 63 to 78: score 1.1, E = 64 *->AeklgvdrstvTrvld<-* Ae+l++dr ++Tr+++ gi|4049589 63 AERLDLDRNNITRITK 78 LRR: domain 1 of 20, from 62 to 85: score 13.5, E = 0.065 *->nLeeLdLsnN.LtslppglfsnLp<-* n e+LdL++N++t + f++L+ gi|4049589 62 NAERLDLDRNnITRITKMDFAGLK 85 LRR: domain 2 of 20, from 86 to 109: score 21.5, E = 0.00036 *->nLeeLdLsnN.LtslppglfsnLp<-* nL++L+L++N+ + ++ g+f++L+ gi|4049589 86 NLRVLHLEDNqVSVIERGAFQDLK 109 LRR: domain 3 of 20, from 110 to 133: score 16.1, E = 0.012 *->nLeeLdLsnN.LtslppglfsnLp<-* +Le+L+L+ N+L+ lp+ lf++ p gi|4049589 110 QLERLRLNKNkLQVLPELLFQSTP 133 LRR: domain 4 of 20, from 134 to 157: score 21.8, E = 0.0003 *->nLeeLdLsnN.LtslppglfsnLp<-* +L++LdLs N+++++p ++f++ + gi|4049589 134 KLTRLDLSENqIQGIPRKAFRGIT 157 LRR: domain 5 of 20, from 158 to 181: score 14.9, E = 0.028 *->nLeeLdLsnN.LtslppglfsnLp<-* + L L+nN+++ +++g+f+ L+ gi|4049589 158 DVKNLQLDNNhISCIEDGAFRALR 181 LRR: domain 6 of 20, from 182 to 205: score 14.3, E = 0.04 *->nLeeLdLsnN.LtslppglfsnLp<-* Le L L+nN+++ + +f+++p gi|4049589 182 DLEILTLNNNnISRILVTSFNHMP 205 LRR: domain 7 of 20, from 206 to 217: score 0.8, E = 2.6e+02 *->nLeeLdLsnN.L<-* ++++L+L++N+L gi|4049589 206 KIRTLRLHSNhL 217 LRRCT: domain 1 of 5, from 215 to 264: score 68.1, E = 1.7e-21 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell N+++CDC+L+wL++Wlr+ rr+ +q +++C+ P +lrG+ +++++ gi|4049589 215 NHLYCDCHLAWLSDWLRQ---RRTVGQ-FTLCMAPVHLRGFNVADVQ 257 psdfsCp<-* ++ +Cp gi|4049589 258 KKEYVCP 264 dCMP_cyt_deam: domain 1 of 1, from 282 to 298: score 1.2, E = 43 *->ePCgHyGRTpmCaqaiiesgikkV<-* +PC +C+ i+ ++ k + gi|4049589 282 SPC-------TCSNNIVDCRGKGL 298 LRRNT: domain 2 of 5, from 279 to 306: score 34.1, E = 1.1e-07 *->aCpreCtCspfglvVdCsgrgLtlevPrdlP<-* Cp +CtCs VdC+g+gL e+P lP gi|4049589 279 SCPSPCTCS--NNIVDCRGKGLM-EIPANLP 306 LRR: domain 8 of 20, from 308 to 331: score 7.2, E = 4.2 *->nLeeLdLsnN.LtslppglfsnLp<-* + e++L+ N+++ +p g+f + + gi|4049589 308 GIVEIRLEQNsIKAIPAGAFTQYK 331 LRR: domain 9 of 20, from 332 to 355: score 23.5, E = 9.8e-05 *->nLeeLdLsnN.LtslppglfsnLp<-* +L+++d+s N+++++ p++f++L+ gi|4049589 332 KLKRIDISKNqISDIAPDAFQGLK 355 LRR: domain 10 of 20, from 356 to 379: score 18.3, E = 0.0029 *->nLeeLdLsnN.LtslppglfsnLp<-* +L++L L +N++t++ glf++L gi|4049589 356 SLTSLVLYGNkITEIAKGLFDGLV 379 LRR: domain 11 of 20, from 380 to 403: score 7.7, E = 3 *->nLeeLdLsnN.LtslppglfsnLp<-* +L+ L L+ N++ l ++f++L+ gi|4049589 380 SLQLLLLNANkINCLRVNTFQDLQ 403 gag_MA: domain 1 of 1, from 416 to 426: score -0.2, E = 87 *->GQtvtkGLLTPL<-* Qt +kGL PL gi|4049589 416 -QTISKGLFAPL 426 LRR: domain 12 of 20, from 404 to 427: score 16.6, E = 0.0088 *->nLeeLdLsnN.LtslppglfsnLp<-* nL L+L +N+L++++ glf+ L+ gi|4049589 404 NLNLLSLYDNkLQTISKGLFAPLQ 427 arf: domain 1 of 1, from 445 to 455: score -0.4, E = 87 *->ldWLsnnikkr<-* l+WL++++++ gi|4049589 445 LKWLADYLQDN 455 LRRCT: domain 2 of 5, from 437 to 486: score 55.4, E = 3.1e-17 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell NPf CDC+L+wL++ l++ + +e + +rC+sP +l+ +++ ++ gi|4049589 437 NPFVCDCHLKWLADYLQD---NPIETS-GARCSSPRRLANKRISQIK 479 psdfsCp<-* ++f C+ gi|4049589 480 SKKFRCS 486 UVR: domain 1 of 1, from 472 to 491: score 1.6, E = 92 *->ekeikeLekeMeeAaenldF<-* +k+i +++ ++ +++ ++d+ gi|4049589 472 NKRISQIKSKKFRCSGSEDY 491 LRRNT: domain 3 of 5, from 504 to 531: score 30.6, E = 1e-06 *->aCpreCtCspfglvVdCsgrgLtlevPrdlP<-* Cp C+C+ g+ VdCs+++L ++P+ lP gi|4049589 504 VCPEKCRCE--GTIVDCSNQKLV-RIPSHLP 531 LRR: domain 13 of 20, from 558 to 581: score 17.8, E = 0.004 *->nLeeLdLsnN.LtslppglfsnLp<-* nL++++LsnN+++++ +g+f++ gi|4049589 558 NLRKINLSNNkIKEVREGAFDGAA 581 BPL: domain 1 of 1, from 562 to 604: score 2.0, E = 29 *->iNvnn....dkPeellnpgitpatslkeilgkdldidreellaellr iN++n++ +++ e++ ++ a+s++e+ + +e++ ++++r gi|4049589 562 INLSNnkikEVREGAFDG----AASVQELMLT--GNQLETVHGRVFR 602 nf<-* + gi|4049589 603 GL 604 LRR: domain 14 of 20, from 582 to 605: score 14.6, E = 0.034 *->nLeeLdLsnN.LtslppglfsnLp<-* + +eL L +N+L++++ +f++L+ gi|4049589 582 SVQELMLTGNqLETVHGRVFRGLS 605 LRR: domain 15 of 20, from 606 to 629: score 12.5, E = 0.12 *->nLeeLdLsnN.LtslppglfsnLp<-* L++L L++N + +++++f++L+ gi|4049589 606 GLKTLMLRSNlIGCVSNDTFAGLS 629 LRR: domain 16 of 20, from 630 to 653: score 14.4, E = 0.036 *->nLeeLdLsnN.LtslppglfsnLp<-* + + L+L +N++t++ pg+f L gi|4049589 630 SVRLLSLYDNrITTITPGAFTTLV 653 LRRCT: domain 3 of 5, from 663 to 712: score 59.1, E = 1.7e-18 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell NPfnC+C+L+wL Wlr+ rr+ + ++rC+ P l+ p+ ++ gi|4049589 663 NPFNCNCHLAWLGKWLRK---RRIVSG-NPRCQKPFFLKEIPIQDVA 705 psdfsCp<-* +df+C+ gi|4049589 706 IQDFTCD 712 LRRNT: domain 4 of 5, from 724 to 751: score 33.8, E = 1.3e-07 *->aCpreCtCspfglvVdCsgrgLtlevPrdlP<-* Cp +CtC ++vV Cs++gL+ ++Pr+ P gi|4049589 724 RCPEQCTCM--ETVVRCSNKGLR-ALPRGMP 751 LRRNT: domain 5 of 5, from 764 to 772: score 0.7, E = 2.5e+02 *->LtlevPrdlP<-* Lt +vPr+l+ gi|4049589 764 LT-AVPRELS 772 LRR: domain 17 of 20, from 753 to 775: score 12.1, E = 0.16 *->nLeeLdLsnN.LtslppglfsnLp<-* +eL+L++N+Lt +p + +s L+ gi|4049589 753 DVTELYLEGNhLTAVPRE-LSALR 775 LRR: domain 18 of 20, from 776 to 799: score 17.6, E = 0.0045 *->nLeeLdLsnN.LtslppglfsnLp<-* +L+ +dLsnN+++ l + +fsn++ gi|4049589 776 HLTLIDLSNNsISMLTNYTFSNMS 799 DUF41: domain 1 of 1, from 790 to 801: score 0.4, E = 42 *->lteeQLlstFsNvkhl<-* lt+ +tFsN+ hl gi|4049589 790 LTN----YTFSNMSHL 801 LRR: domain 19 of 20, from 800 to 823: score 20.0, E = 0.00095 *->nLeeLdLsnN.LtslppglfsnLp<-* +L +L Ls+N+L+ +p ++f++L+ gi|4049589 800 HLSTLILSYNrLRCIPVHAFNGLR 823 LRR: domain 20 of 20, from 824 to 847: score 24.1, E = 6.6e-05 *->nLeeLdLsnN.LtslppglfsnLp<-* +L++L L++N+++s+p+g+f++L+ gi|4049589 824 SLRVLTLHGNdISSVPEGSFNDLT 847 Agglutinin: domain 1 of 1, from 853 to 860: score -0.3, E = 35 *->ATGtnglPLr<-* A+Gtn PL+ gi|4049589 853 ALGTN--PLH 860 SRP54: domain 1 of 1, from 886 to 895: score 0.9, E = 31 *->dperfvsrLL<-* +pe +++rLL gi|4049589 886 SPEPMADRLL 895 LRRCT: domain 4 of 5, from 857 to 906: score 65.8, E = 1e-20 *->NPfnCDCeLrwLlrWlretnprrledqedlrCasPeslrGqpllell NP++CDC+LrwL++W+++ ++e++ +rC+sPe++++ +ll+ + gi|4049589 857 NPLHCDCSLRWLSEWVKA---GYKEPG-IARCSSPEPMAD-RLLLTT 898 p.sdfsCp<-* p+ +f+C+ gi|4049589 899 PtHRFQCK 906 S_mold_repeat: domain 1 of 1, from 918 to 941: score 3.3, E = 53 *->nactvDsCsnstgvthtpiacddn<-* nac C+n++++t+ p+++ gi|4049589 918 NACLSSPCKNNGTCTQDPVELYRC 941 EGF: domain 1 of 9, from 920 to 952: score 26.5, E = 8.7e-06 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C + pC+n+GtC+ p + y+C Cp y Gk+C gi|4049589 920 CLSS-PCKNNGTCTQDP------VELYRCACPYS-----YKGKDC 952 * gi|4049589 - - EGF: domain 2 of 9, from 959 to 993: score 31.9, E = 2.7e-07 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C +n pC++gGtC+ + + ++g++C Cp G + G rC gi|4049589 959 CIQN-PCQHGGTCHLSDS----HKDGFSCSCPLG-----FEGQRC 993 * gi|4049589 - - crystall: domain 1 of 1, from 987 to 1000: score 2.8, E = 53 *->nFqGrmyEltpGdDC<-* +F G+++E++p dDC gi|4049589 987 GFEGQRCEINP-DDC 1000 EGF: domain 3 of 9, from 1000 to 1031: score 39.5, E = 1.9e-09 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C +n C n++tCv++ + y C+Cpp ytG+ C gi|4049589 1000 CEDN-DCENNATCVDGIN-------NYVCICPPN-----YTGELC 1031 * gi|4049589 - - EGF: domain 4 of 9, from 1038 to 1071: score 35.2, E = 3.2e-08 *->Capnn.pCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC< C p+ + C++ ++C+++ g++CeC pG y+Gk C gi|4049589 1038 CVPELnLCQHEAKCIPLDK-------GFSCECVPG-----YSGKLC 1071 -* gi|4049589 - - DSL: domain 1 of 4, from 1099 to 1109: score 0.6, E = 81 *->lCleGWkGeyC<-* +C+ G +G+ C gi|4049589 1099 TCPQGFSGPFC 1109 EGF: domain 5 of 9, from 1078 to 1109: score 37.4, E = 7.5e-09 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C + C +g++Cv+t + gytC+Cp+G ++G++C gi|4049589 1078 CVAH-KCRHGAQCVDTIN-------GYTCTCPQG-----FSGPFC 1109 * gi|4049589 - - LRRCT: domain 5 of 5, from 1107 to 1129: score -2.1, E = 5.1e+02 *->lrCasPeslrGqpllellp.sdfsCp<-* + C++P+++ ll+ +p + C+ gi|4049589 1107 PFCEHPPPM---VLLQTSPcDQYECQ 1129 DSL: domain 2 of 4, from 1144 to 1154: score 2.8, E = 17 *->lCleGWkGeyC<-* +C++G G+ C gi|4049589 1144 RCPPGFAGPRC 1154 EGF: domain 6 of 9, from 1123 to 1154: score 35.8, E = 2.1e-08 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C++ C+ng++C+ + + ++tC+CppG ++G+rC gi|4049589 1123 CDQY-ECQNGAQCIVVQQ-------EPTCRCPPG-----FAGPRC 1154 * gi|4049589 - - rrm: domain 1 of 1, from 1277 to 1293: score 0.9, E = 86 *->lfVgNLppdvteedLkd<-* l+ g++p+++ + L++ gi|4049589 1277 LYLGGIPTSTGLSALRQ 1293 laminin_G: domain 1 of 1, from 1186 to 1317: score 88.6, E = 4.2e-24 *->FRTtepsGlLllgYggtntdrggkkeigdDFlaleLvdGrlevsydl ++T + +G+Ll Y g+n+ aleL G++++ yd gi|4049589 1186 VATDKDNGILL--YKGDNDPL-----------ALELYQGHVRLVYDS 1219 GsghrlrpavvrsgdrvlnDGkWHrvelerngrkgtLsVdgeepskktls s p ++++ ++++nDG+ H+vel+ ++ ++L Vd++ p gi|4049589 1220 LSS----PPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTP------ 1259 etvvdgespsgpdvtsenLdldtppiLyvGGlPeqksvkrrlaaist... + +g+++++ + + p Ly+GG+P +s+ ++++++t+++ gi|4049589 1260 -------KSLGKLQKQPAVGINSP--LYLGGIP--TSTGLSALRQGTdrp 1298 ..sFkGCirdvsingkpld<-* ++F+GCi++v+in++++d gi|4049589 1299 lgGFHGCIHEVRINNELQD 1317 TNFR_c6: domain 1 of 1, from 1332 to 1342: score 2.1, E = 45 *->ClsCsrCepemGq<-* C+sC++C + G+ gi|4049589 1332 CKSCTVCKH--GL 1342 DSL: domain 3 of 4, from 1355 to 1364: score 6.3, E = 1.5 *->CleGWkGeyC<-* C +GW+G+ C gi|4049589 1355 CRPGWTGPLC 1364 EGF: domain 7 of 9, from 1332 to 1364: score 21.5, E = 0.00023 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C + ++C++ G C++++ ++ CeC pG +tG+ C gi|4049589 1332 CKSCTVCKH-GLCRSVEK------DSVVCECRPG-----WTGPLC 1364 * gi|4049589 - - EGF: domain 8 of 9, from 1372 to 1402: score 30.2, E = 8.1e-07 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C + C++ G+Cv t+ +y C+C++G y G+ C gi|4049589 1372 CLGH-RCHH-GKCVATGT-------SYMCKCAEG-----YGGDLC 1402 * gi|4049589 - - EGF: domain 9 of 9, from 1412 to 1443: score 26.0, E = 1.2e-05 *->CapnnpCsngGtCvntpggssdnfggytCeCppGdyylsytGkrC<- C+ C++ G+C+ + + g+++C C+pG ++G++C gi|4049589 1412 CSAF-KCHH-GQCHISDQ------GEPYCLCQPG-----FSGEHC 1443 * gi|4049589 - - DSL: domain 4 of 4, from 1433 to 1443: score 5.1, E = 3.3 *->lCleGWkGeyC<-* lC +G +Ge+C gi|4049589 1433 LCQPGFSGEHC 1443 CX: domain 1 of 1, from 1482 to 1490: score 0.5, E = 68 *->CGmeCCedd<-* CG CC ++ gi|4049589 1482 CGPQCCQPT 1490 PDGF: domain 1 of 1, from 1512 to 1519: score 1.5, E = 30 *->eqHlkCeC<-* e Hl+C C gi|4049589 1512 ERHLECGC 1519 Cys_knot: domain 1 of 1, from 1464 to 1522: score 16.2, E = 0.00099 *->kegCtscktvnttiCaGyCytkdpvykdprsllkQcvCcypevtyet ++ C+ +v + C G C+ p+ Cc+p++++++ gi|4049589 1464 YASCATASKVPIMECRGGCG--------PQ-------CCQPTRSKRR 1495 kvlpgCppGvdptvtYpvAlsChCgdkC<-* k+++ C++G+ ++ ++ l+C C +C gi|4049589 1496 KYVFQCTDGSSFVEEVERHLECGCL-AC 1522 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem27 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- LRR-ma 115.5 1e-30 23 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- LRR-ma 1/23 60 73 .. 1 14 [] 4.6 3.4e+02 LRR-ma 2/23 84 97 .. 1 14 [] 10.3 33 LRR-ma 3/23 108 121 .. 1 14 [] 11.8 17 LRR-ma 4/23 132 145 .. 1 14 [] 11.6 19 LRR-ma 5/23 156 169 .. 1 14 [] 4.9 3e+02 LRR-ma 6/23 180 193 .. 1 14 [] 9.3 49 LRR-ma 7/23 204 217 .. 1 14 [] 11.0 24 LRR-ma 8/23 306 319 .. 1 14 [] 1.8 1.1e+03 LRR-ma 9/23 330 343 .. 1 14 [] 7.2 1.2e+02 LRR-ma 10/23 354 367 .. 1 14 [] 11.7 18 LRR-ma 11/23 378 391 .. 1 14 [] 5.3 2.6e+02 LRR-ma 12/23 402 415 .. 1 14 [] 7.8 93 LRR-ma 13/23 426 439 .. 1 14 [] 9.3 49 LRR-ma 14/23 556 569 .. 1 14 [] 16.1 0.86 LRR-ma 15/23 580 593 .. 1 14 [] 0.7 1.6e+03 LRR-ma 16/23 604 617 .. 1 14 [] 5.8 2.1e+02 LRR-ma 17/23 628 641 .. 1 14 [] 8.6 66 LRR-ma 18/23 652 665 .. 1 14 [] 4.6 3.5e+02 LRR-ma 19/23 751 764 .. 1 14 [] 6.6 1.5e+02 LRR-ma 20/23 774 787 .. 1 14 [] 12.5 10 LRR-ma 21/23 798 811 .. 1 14 [] 12.8 8.1 LRR-ma 22/23 822 835 .. 1 14 [] 9.7 42 LRR-ma 23/23 846 859 .. 1 14 [] 7.8 93 Alignments of top-scoring domains: LRR-ma: domain 1 of 23, from 60 to 73: score 4.6, E = 3.4e+02 *->lpsLeeLdLsnNrl<-* + + e+LdL +N++ gi|4049589 60 PRNAERLDLDRNNI 73 LRR-ma: domain 2 of 23, from 84 to 97: score 10.3, E = 33 *->lpsLeeLdLsnNrl<-* l++L++L+L +N+ gi|4049589 84 LKNLRVLHLEDNQV 97 LRR-ma: domain 3 of 23, from 108 to 121: score 11.8, E = 17 *->lpsLeeLdLsnNrl<-* l++Le+L+L+ N+l gi|4049589 108 LKQLERLRLNKNKL 121 LRR-ma: domain 4 of 23, from 132 to 145: score 11.6, E = 19 *->lpsLeeLdLsnNrl<-* p+L++LdLs N++ gi|4049589 132 TPKLTRLDLSENQI 145 LRR-ma: domain 5 of 23, from 156 to 169: score 4.9, E = 3e+02 *->lpsLeeLdLsnNrl<-* + ++L L nN++ gi|4049589 156 ITDVKNLQLDNNHI 169 LRR-ma: domain 6 of 23, from 180 to 193: score 9.3, E = 49 *->lpsLeeLdLsnNrl<-* l Le+L L+nN++ gi|4049589 180 LRDLEILTLNNNNI 193 LRR-ma: domain 7 of 23, from 204 to 217: score 11.0, E = 24 *->lpsLeeLdLsnNrl<-* +p++++L+L++N+l gi|4049589 204 MPKIRTLRLHSNHL 217 LRR-ma: domain 8 of 23, from 306 to 319: score 1.8, E = 1.1e+03 *->lpsLeeLdLsnNrl<-* + + e++L +N++ gi|4049589 306 PEGIVEIRLEQNSI 319 LRR-ma: domain 9 of 23, from 330 to 343: score 7.2, E = 1.2e+02 *->lpsLeeLdLsnNrl<-* ++L+++d+s N++ gi|4049589 330 YKKLKRIDISKNQI 343 LRR-ma: domain 10 of 23, from 354 to 367: score 11.7, E = 18 *->lpsLeeLdLsnNrl<-* l+sL++L L++N++ gi|4049589 354 LKSLTSLVLYGNKI 367 LRR-ma: domain 11 of 23, from 378 to 391: score 5.3, E = 2.6e+02 *->lpsLeeLdLsnNrl<-* l sL+ L L+ N++ gi|4049589 378 LVSLQLLLLNANKI 391 LRR-ma: domain 12 of 23, from 402 to 415: score 7.8, E = 93 *->lpsLeeLdLsnNrl<-* l++L L+L++N+l gi|4049589 402 LQNLNLLSLYDNKL 415 LRR-ma: domain 13 of 23, from 426 to 439: score 9.3, E = 49 *->lpsLeeLdLsnNrl<-* l+s+++L+L++N + gi|4049589 426 LQSIQTLHLAQNPF 439 LRR-ma: domain 14 of 23, from 556 to 569: score 16.1, E = 0.86 *->lpsLeeLdLsnNrl<-* lp+L++++LsnN++ gi|4049589 556 LPNLRKINLSNNKI 569 LRR-ma: domain 15 of 23, from 580 to 593: score 0.7, E = 1.6e+03 *->lpsLeeLdLsnNrl<-* s +eL L +N+l gi|4049589 580 AASVQELMLTGNQL 593 LRR-ma: domain 16 of 23, from 604 to 617: score 5.8, E = 2.1e+02 *->lpsLeeLdLsnNrl<-* l+ L++L L++N + gi|4049589 604 LSGLKTLMLRSNLI 617 LRR-ma: domain 17 of 23, from 628 to 641: score 8.6, E = 66 *->lpsLeeLdLsnNrl<-* l+s + L+L++Nr+ gi|4049589 628 LSSVRLLSLYDNRI 641 LRR-ma: domain 18 of 23, from 652 to 665: score 4.6, E = 3.5e+02 *->lpsLeeLdLsnNrl<-* l sL +++L +N + gi|4049589 652 LVSLSTINLLSNPF 665 LRR-ma: domain 19 of 23, from 751 to 764: score 6.6, E = 1.5e+02 *->lpsLeeLdLsnNrl<-* ++ +eL+L +N+l gi|4049589 751 PKDVTELYLEGNHL 764 LRR-ma: domain 20 of 23, from 774 to 787: score 12.5, E = 10 *->lpsLeeLdLsnNrl<-* l +L+ +dLsnN++ gi|4049589 774 LRHLTLIDLSNNSI 787 LRR-ma: domain 21 of 23, from 798 to 811: score 12.8, E = 8.1 *->lpsLeeLdLsnNrl<-* +++L +L Ls+Nrl gi|4049589 798 MSHLSTLILSYNRL 811 LRR-ma: domain 22 of 23, from 822 to 835: score 9.7, E = 42 *->lpsLeeLdLsnNrl<-* l sL++L L++N + gi|4049589 822 LRSLRVLTLHGNDI 835 LRR-ma: domain 23 of 23, from 846 to 859: score 7.8, E = 93 *->lpsLeeLdLsnNrl<-* l+sL +L L+ N l gi|4049589 846 LTSLSHLALGTNPL 859 // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Thu Jan 18 20:37:26 2001 Sequence file: tem27 ---------------------------------------- Sequence gi|4049589|dbj|BAA35186.1| (1523 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 72: NITR 192: NISR 563: NLSN 622: NDTF 784: NNSI 792: NYTF 797: NMSH 928: NGTC 1008: NATC 1025: NYTG 1181: NISL 1247: NQTL 1406: NDSA Total matches: 13 Matching pattern PS00004 CAMP_PHOSPHO_SITE: 473: KRIS Total matches: 1 Matching pattern PS00005 PKC_PHOSPHO_SITE: 132: TPK 209: TLR 318: SIK 466: SPR 480: SKK 630: SVR 722: SPR 739: SNK 824: SLR 864: SLR 900: THR 946: SYK 975: SHK 1067: SGK 1142: TCR 1173: SAK 1188: TDK 1258: TPK 1295: TDR 1492: SKR Total matches: 20 Matching pattern PS00006 CK2_PHOSPHO_SITE: 77: TKMD 98: SVIE 136: TRLD 170: SCIE 398: TFQD 409: SLYD 486: SGSE 514: TIVD 545: SVLE 582: SVQE 635: SLYD 730: TCME 778: TLID 837: SVPE 842: SFND 885: SSPE 975: SHKD 1010: TCVD 1073: TDND 1121: SPCD 1167: SYVE 1188: TDKD 1232: TVND 1506: SFVE Total matches: 24 Matching pattern PS00008 MYRISTYL: 7: GVGAAV 58: GIPRNA 175: GAFRAL 364: GNKITE 373: GLFDGL 461: GARCSS 575: GAFDGA 618: GCVSND 647: GAFTTL 686: GNPRCQ 713: GNEESS 761: GNHLTA 832: GNDISS 880: GIARCS 1236: GQFHSV 1257: GTPKSL 1280: GGIPTS 1281: GIPTST 1300: GGFHGC 1327: GVSPGC 1331: GCKSCT 1341: GLCRSV 1398: GGDLCD 1518: GCLACS Total matches: 24 Matching pattern PS00010 ASX_HYDROXYL: 1011: CVDGINNYVCIC 1089: CVDTINGYTCTC Total matches: 2 Matching pattern PS01185 CTCK_1: 1486: CCQPTRSKRRKYVFQCTDGSSFVEEVERHLECGCLAC Total matches: 1 Matching pattern PS00022 EGF_1: 941: CACPYSYKGKDC 982: CSCPLGFEGQRC 1020: CICPPNYTGELC 1060: CECVPGYSGKLC 1098: CTCPQGFSGPFC 1143: CRCPPGFAGPRC 1353: CECRPGWTGPLC 1391: CKCAEGYGGDLC 1432: CLCQPGFSGEHC Total matches: 9 Matching pattern PS01186 EGF_2: 982: CSCPLGFEGQRC 1060: CECVPGYSGKLC 1098: CTCPQGFSGPFC 1143: CRCPPGFAGPRC 1353: CECRPGWTGPLC 1391: CKCAEGYGGDLC 1432: CLCQPGFSGEHC Total matches: 7 Matching pattern PS01187 EGF_CA: 996: NPDDCEDNDCENNATCVDGINNYVC 1074: DNDDCVAHKCRHGAQCVDTINGYTC Total matches: 2 Matching pattern PS00029 LEUCINE_ZIPPER: 114: LRLNKNKLQVLPELLFQSTPKL Total matches: 1 Matching pattern PS00290 IG_MHC: 261: YVCPAPH Total matches: 1 Total no of hits in this sequence: 105 ======================================== 1314 pattern(s) searched in 1 sequence(s), 1523 residues. Total no of hits in all sequences: 105. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search L=0 11.9851 1316 pos. 1449 - 1523 PS01225|CTCK_2 C-terminal cystine knot profile. # # P 1 CRRTPVS---PIYFEYNGCTSSKPVRMAYCSGSCGDGAMYDVDANDLKHSCRCCQPERTE -32 # S 1449 CLGQVVReviRRQKGYASCATASKVPIMECRGGCGP---------------QCCQPTRSK -31 # # P 58 TRQVRLHCPDGSTLTHSYRHIESCGCHRNCP -1 # S 1494 RRKYVFQCTDGSSFVEEVERHLECGCL-ACS -1 # ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** argv[1]=P argv[2]=-m /data/patterns/own/motif.fa argv[4]=-seq tem27 ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1523 units >ER-GOLGI-traffic signal is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1523 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem27 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem27 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] (1523 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value LRR Leucine rich repeats 147 1e-37 INSL Insulinase like Metallo protease domain 25 0.76 HOMEO Homeodomain (HTH DNa binding domain) 24 1.5 GAF GAF domain (Predicted Small-ligand binding domain) 23 2.7 CALMO Calmodulin like EF-hand domains 23 2.9 AAA AAA+ ATPase Module 23 4.5 KELCH Kelch repeat- beta propeller like domain 22 4.6 DSP Dual specificity protein phosphatase 22 4.8 HECT A ubiquitin conjugating enzyme domain 22 5.2 CBS cystathionine beta -synthase domain (A predicted ligand... 22 5.3 CALC Calcineurin like Phosphoesterase domain 22 5.6 ACTIN Actin ATPase/ Cytoskeletal ATPase domain 22 6.0 PDE cyclic NMP phosphodiesterase domain 22 7.5 UBA Ubiquitin pathway associated domain 22 8.6 MATH The Meprin associated TRAF homology domain 21 9.5 SAM Sterile alpha Motif domain (A protein-protein interacti... 21 9.5 >LRR Leucine rich repeats Length = 339 Score = 147 bits (368), Expect = 1e-37 Identities = 72/358 (20%), Positives = 72/358 (20%), Gaps = 40/358 (11%) Query: 80 DFAGLKNLRVLHLEDNQVSV--IERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTR 137 Sbjct: 3 ELSDLPRLRSVIVRDNNLKTAGIPTDIFR-MKDLTIIDLSRNQLREVPTNLEYAK-GSIV 60 Query: 138 LDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRIL 197 Sbjct: 61 LNLSYNNIETIPNSVCANLIDLLFLDLSNNKLDMLPPQ-IRRLSMLQSLKLSNNPLNHFQ 119 Query: 198 VTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQ 257 Sbjct: 120 LKQLPSMTSLSVLHMSNTNRTLDNIPPTLDD-MHNLRD----------VDFSENNLPIVP 168 Query: 258 KKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLMEIPANLPEGIVEIRLEQN 317 Sbjct: 169 --EALFKLRNLRKLNLSGNKIEKLNMTEGE------------------WENLETLNMSHN 208 Query: 318 SIKAIPAGAFTQYKKLKRIDISKNQISDI-APDAFQGLKSLTSLVLYGNKITEIAKGLFD 376 Sbjct: 209 QLTVLP-DCVVKLTRLTKLYAANNQLTFEGIPSGIGKLIQLTVLHLSYNKLELVPEGIS- 266 Query: 377 GLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHL 434 Sbjct: 267 RCVKLQKLKLDHNRLITLP-EGIHLLPDLKVLDLHENENLVMPPKPNDARKKLAFYNI 323 Score = 141 bits (353), Expect = 6e-36 Identities = 76/353 (21%), Positives = 76/353 (21%), Gaps = 44/353 (12%) Query: 515 IVDCSNQKLVRIPSHLPEY--VTDLRLNDNEVSVLEATGIFKKLPNLRKINLSNNKIKEV 572 Sbjct: 14 IVRDNNLKTAGIPTDIFRMKDLTIIDLSRNQLREVPTN--LEYAKGSIVLNLSYNNIETI 71 Query: 573 REGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVR 632 Sbjct: 72 PNSVCANLIDLLFLDLSNNKLDMLPPQ-IRRLSMLQSLKLSNNPLNHFQLKQLPSMTSLS 130 Query: 633 LLSLY-DNRITTITPGAFTTLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQ 691 Sbjct: 131 VLHMSNTNRTLDNIPPTLDDMHNLRDVDFSEN------NLPIVPEALFKLR--------- 175 Query: 692 KPFFLKEIPIQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETVVRCSNKGLRALPRGMP 751 Sbjct: 176 -------------NLRKLNLSGNKIEKLNMTEG---EWENLETLNMSHNQ-LTVLPDCVV 218 Query: 752 K--DVTELYLEGNHLT--AVPRELSALRHLTLIDLSNNSISMLTNYTFSNMSHLSTLILS 807 Sbjct: 219 KLTRLTKLYAANNQLTFEGIPSGIGKLIQLTVLHLSYNKLELVPE-GISRCVKLQKLKLD 277 Query: 808 YNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLSHLALGTNPLH 860 Sbjct: 278 HNRLITLP-EGIHLLPDLKVLDLHENENLVMPPKPNDARKKLAFYNIDFSLEH 329 Score = 25.0 bits (54), Expect = 0.65 Identities = 11/34 (32%), Positives = 11/34 (32%), Gaps = 2/34 (5%) Query: 45 VDCHGLGLRAVPRGIP--RNAERLDLDRNNITRI 76 Sbjct: 38 IDLSRNQLREVPTNLEYAKGSIVLNLSYNNIETI 71 Score = 24.6 bits (53), Expect = 1.1 Identities = 10/27 (37%), Positives = 10/27 (37%), Gaps = 2/27 (7%) Query: 52 LRAVPRGIPR--NAERLDLDRNNITRI 76 Sbjct: 258 LELVPEGISRCVKLQKLKLDHNRLITL 284 >INSL Insulinase like Metallo protease domain Length = 433 Score = 24.9 bits (54), Expect = 0.76 Identities = 20/112 (17%), Positives = 20/112 (17%), Gaps = 14/112 (12%) Query: 1117 LLQTSPCDQYECQNGAQCIVVQQEPTCRCPPGF---AGPRCEK---------LITVNFVG 1164 Sbjct: 15 VLTAQELYIRDLPNGAKLIVKPRDDTEAVALHVWFRVGSVYEKYDEKGMAHFLEHMLFNG 74 Query: 1165 KDSYVELASAKVRPQANISLQVATDKDNGILLYKGDND--PLALELYQGHVR 1214 Sbjct: 75 TEKYKYGEIDRIIESLGGNINAGTSKDYTYYHVEIAHPYWKQALEVLYQLTM 126 >HOMEO Homeodomain (HTH DNa binding domain) Length = 65 Score = 24.0 bits (52), Expect = 1.5 Identities = 7/47 (14%), Positives = 7/47 (14%), Gaps = 10/47 (21%) Query: 109 KQLERLRLNKNKLQVLPELLFQSTPKLTR---------LDLSENQIQ 146 Sbjct: 5 RRKRRVLFSQAQVYEL-ERRFKQQKYLSAPEREHLASMIHLTPTQVK 50 >GAF GAF domain (Predicted Small-ligand binding domain) Length = 160 Score = 23.0 bits (49), Expect = 2.7 Identities = 14/49 (28%), Positives = 14/49 (28%), Gaps = 12/49 (24%) Query: 1052 IPLDKGFSCECVPGY---SGKLCETDNDDCVAHKCRHGAQCVDTINGYT 1097 Sbjct: 62 FPLDMG-----VVGHVALSKKIVNVPN----TEEDEHFCDFVDTLTEYQ 101 >CALMO Calmodulin like EF-hand domains Length = 147 Score = 23.2 bits (49), Expect = 2.9 Identities = 14/96 (14%), Positives = 14/96 (14%), Gaps = 18/96 (18%) Query: 139 DLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRIL- 197 Sbjct: 4 NLTEEQIAEF-KEAFA--------LFDKDNNGSISSSELATVMRSLGLSPSEAEVNDLMN 54 Query: 198 --------VTSFNHMPKIRTLRLHSNHLYCDCHLAW 225 Sbjct: 55 EIDVDGNHQIEFSEFLALMSRQLKSNDSEQELLEAF 90 >AAA AAA+ ATPase Module Length = 298 Score = 22.5 bits (47), Expect = 4.5 Identities = 13/107 (12%), Positives = 13/107 (12%), Gaps = 3/107 (2%) Query: 480 SKKFRCSGSEDYRSRFSSECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRL 539 Sbjct: 152 GSRLEGGSGGDSEVQRTMLELLNQLDGFEATKNIKVIMATNRIDILDSALLRPGRIDRKI 211 Query: 540 NDNEVSVLEATGIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQEL 586 Sbjct: 212 EFPPPNEEARLDILKIH--SRKMNLTRGINLR-KIAELMPGASGAEV 255 >KELCH Kelch repeat- beta propeller like domain Length = 319 Score = 22.2 bits (47), Expect = 4.6 Identities = 5/67 (7%), Positives = 5/67 (7%), Gaps = 2/67 (2%) Query: 61 RNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNK 120 Sbjct: 190 PKVQCYDQCENRWTVPATCPQPWRYTAAAVL--GNQIFIMGGDTEFSACSAYKFNSETYQ 247 Query: 121 LQVLPEL 127 Sbjct: 248 WTKVGDV 254 >DSP Dual specificity protein phosphatase Length = 185 Score = 22.2 bits (47), Expect = 4.8 Identities = 10/46 (21%), Positives = 10/46 (21%), Gaps = 2/46 (4%) Query: 1248 QTLNLVVDKGTPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQ 1293 Sbjct: 10 QDLNDLLSDGSG--CYSLPSQPCNEVTPRIYVGNASVAQDIPKLQK 53 >HECT A ubiquitin conjugating enzyme domain Length = 255 Score = 22.1 bits (47), Expect = 5.2 Identities = 16/78 (20%), Positives = 16/78 (20%), Gaps = 11/78 (14%) Query: 370 IAKGLFDG-LVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQS 428 Sbjct: 88 MGVAIRTKKPLDLHLAPLVWKQLCCVPL-TLEDLEEVDL--LYVQTLNSI---LHIEDSG 141 Query: 429 IQTL----HLAQNPFVCD 442 Sbjct: 142 ITEESFHEMIPLDSFVGQ 159 >CBS cystathionine beta -synthase domain (A predicted ligand binding domain) Length = 214 Score = 22.3 bits (47), Expect = 5.3 Identities = 4/72 (5%), Positives = 4/72 (5%), Gaps = 5/72 (6%) Query: 135 LTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTL----NN 190 Sbjct: 112 VRNCDVVKT-LAKLYEIPVYKIFKELHNHIGDISWEELMEAAAVVTKRMTGEDITPQEYE 170 Query: 191 NNISRILVTSFN 202 Sbjct: 171 ERIKKTTFGKAI 182 >CALC Calcineurin like Phosphoesterase domain Length = 274 Score = 22.1 bits (46), Expect = 5.6 Identities = 17/146 (11%), Positives = 17/146 (11%), Gaps = 10/146 (6%) Query: 164 LDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHL 223 Sbjct: 126 LDSQVYGVPHGQLSQHQLDLLKETLGKNPERYTLVVLHHH------LLPTNSAWLDQHNL 179 Query: 224 AWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKKEYVCPAPHSEPPSCNANSISCPSP 283 Sbjct: 180 RNSHELAEVLAPFTNVKAILYGHIHQEVNSEWNGYQVMATPATCIQF--KPDCQYFSLDT 237 Query: 284 CTCSNNIVDC--RGKGLMEIPANLPE 307 Sbjct: 238 LQPGWREIELHSDGSIRTEVKRIQQA 263 >ACTIN Actin ATPase/ Cytoskeletal ATPase domain Length = 376 Score = 22.1 bits (47), Expect = 6.0 Identities = 13/38 (34%), Positives = 13/38 (34%), Gaps = 1/38 (2%) Query: 1176 VRPQANISLQVATDKDNGILLYKGDNDPLALELYQGHV 1213 Sbjct: 135 VAIQAVLSL-YASGRTTGIVLDSGDGVSHTVPIYEGDA 171 >PDE cyclic NMP phosphodiesterase domain Length = 350 Score = 21.6 bits (45), Expect = 7.5 Identities = 23/74 (31%), Positives = 23/74 (31%), Gaps = 12/74 (16%) Query: 535 TDLRLNDNEVSVLE---ATGIFKKLPNLR---KINLSNNKIKEVREGAFDGAASVQELML 588 Sbjct: 124 TELSLEYNDISVLENHHAMLTFKILRNSECNILEGLNEDQYKELRRSV------VQLILA 177 Query: 589 TGNQLETVHGRVFR 602 Sbjct: 178 TDMQNHFEHTNKFQ 191 >UBA Ubiquitin pathway associated domain Length = 255 Score = 21.6 bits (45), Expect = 8.6 Identities = 9/42 (21%), Positives = 9/42 (21%), Gaps = 1/42 (2%) Query: 92 LEDNQVSVIERGAFQDLKQLERLRLNKNKLQVLPELLFQSTP 133 Sbjct: 21 RNETIERIMEMG-YQREEVERALRAAFNNPDRAVEYLLMGIP 61 >MATH The Meprin associated TRAF homology domain Length = 209 Score = 21.4 bits (45), Expect = 9.5 Identities = 5/36 (13%), Positives = 5/36 (13%) Query: 543 EVSVLEATGIFKKLPNLRKINLSNNKIKEVREGAFD 578 Sbjct: 160 EVSVVQDSVNISGQNTMNMVKVPECRLADELGGLWE 195 >SAM Sterile alpha Motif domain (A protein-protein interaction domain) Length = 65 Score = 21.3 bits (45), Expect = 9.5 Identities = 8/24 (33%), Positives = 8/24 (33%) Query: 184 EILTLNNNNISRILVTSFNHMPKI 207 Sbjct: 33 ELLHLERRDLKDLGVTKVGHMKRI 56 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 16 Number of calls to ALIGN: 19 Length of query: 1523 Total length of test sequences: 20182 Effective length of test sequences: 16435.0 Effective search space size: 24454913.2 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= gi|4049589|dbj|BAA35186.1| Slit-3 protein [Homo sapiens] (1523 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|1881738 [246..599] Leucine-rich repeats 116 3e-27 gi|132575 [1..456] Leucine-rich repeats 113 3e-26 gi|1082610 [223..548] Leucine-rich repeats 92 7e-20 gi|2623618 [35..332] Leucine-rich repeats 92 7e-20 gi|1418519 [191..423] Leucine-rich repeats 77 2e-15 gi|128081 [13..98] Neurophysin II 33 0.038 gi|135227 [17..277] Restriction endonucleases 30 0.27 gi|2039378 [21..454] PLP-dependent transferases 29 0.38 gi|2501021 [11..154] OB-fold 28 1.5 gi|417837 [59..431] Acid proteases 27 2.5 gi|1788898 [36..308] Periplasmic binding protein-like I 27 2.5 gi|1175369 [257..504] Nucleotidyltransferases 26 3.5 gi|1518938 [387..612] Heat shock protein 70kD (HSP70), C-ter... 26 3.6 gi|2493473 [2..191] EF Hand-like 26 3.9 gi|1778568 [354..541] Heat shock protein 70kD (HSP70), C-ter... 26 4.8 gi|1790450 [741..896] Flavodoxin-like 26 5.2 gi|2130182 [8..348] Protein kinases (PK), catalytic core 26 5.7 gi|2496715 [53..359] NAD(P)-binding Rossmann-fold domains 26 6.0 gi|1942958 [8..338] ATP pyrophoshatases 25 6.0 gi|2665705 [26..489] Periplasmic binding protein-like II 26 6.3 gi|2127787 [36..400] Periplasmic binding protein-like I 25 6.4 gi|1943840 [7..213] Protein kinases (PK), catalytic core 25 6.4 gi|462182 [68..563] alpha/beta-Hydrolases 25 6.5 gi|90456 [248..528] Nucleotidyltransferases 25 7.0 gi|1572759 [30..280] Trypsin-like serine proteases 25 8.4 gi|1169087 [44..365] Cysteine proteinases 25 8.6 gi|1750128 [252..391] A domain of alpha and beta subunits of... 25 9.6 gi|730289 [27..419] Single-stranded right-handed beta-helix 25 9.8 >gi|1881738 [246..599] Leucine-rich repeats Length = 354 Score = 116 bits (288), Expect = 3e-27 Identities = 27/363 (7%), Positives = 27/363 (7%), Gaps = 47/363 (12%) Query: 61 RNAERLDLDRNNITRITKMDFAGL-KNLRVLHLEDNQVS-----VIERGAFQDLKQLERL 114 Sbjct: 13 LVLSRVEGTASAFEELGQAIAKNRNSALTSIDWSNNLIKDAGVAALAAAVASMGHGLTSI 72 Query: 115 RLNKNKLQVLPELL--------FQSTPKLTRLDLSENQIQGIPRKAFRGIT----DVKNL 162 Sbjct: 73 SLKGGDATKKGTVALCTAFKKNVEMSRTLTVLNLAGNRLDSDGTSALAAFVSGPNALQTL 132 Query: 163 QLDNNHISCIED--GAFRALRDLEILTLNNNNISRILVTSFNH----MPKIRTLRLHSNH 216 Sbjct: 133 NISGTAANLEMLLPAVMRGCTELEKFNISHNKVTAKTGPELKKFLQSCGRLSELHMRDTA 192 Query: 217 LYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKKEYVCPAPHSEPPSCNAN 276 Sbjct: 193 VPVQVVRDVIKAIIGNNFITDFQLDL---AANKLGVLGANMLAGLAAEITTIKSLDLTDN 249 Query: 277 SISCPSPCTCSNNIVDCRGKGLMEIPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRI 336 Sbjct: 250 DFGDEGMSIIADGL-------------CHNSSLRELHLGDNWTRNKTKARSQAVDNLIEL 296 Query: 337 DISKNQISDIAPDAFQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANKINCLRV 396 Sbjct: 297 ISSECPLHKLDL-------SCKVADNQIKTDILPFIYSLATNDTLKELDISGNAMGDKVG 349 Query: 397 NTF 399 Sbjct: 350 IAL 352 Score = 108 bits (267), Expect = 7e-25 Identities = 42/352 (11%), Positives = 42/352 (11%), Gaps = 46/352 (13%) Query: 535 TDLRLNDNEVSVLEATGIFKKLPNLRKINLSNNKIKEVR-----EGAFDGAASVQELMLT 589 Sbjct: 16 SRVEGTASAFEELGQAIAKNRNSALTSIDWSNNLIKDAGVAALAAAVASMGHGLTSISLK 75 Query: 590 GNQLETVHGRVFRG--------LSGLKTLMLRSNLIGCVSNDTFAG----LSSVRLLSLY 637 Sbjct: 76 GGDATKKGTVALCTAFKKNVEMSRTLTVLNLAGNRLDSDGTSALAAFVSGPNALQTLNIS 135 Query: 638 DNRITT--ITPGAFTTLVSLSTINLLSNPFN---CNCHLAWLGKWLRKRRIVSGNPRCQK 692 Sbjct: 136 GTAANLEMLLPAVMRGCTELEKFNISHNKVTAKTGPELKKFLQSCGRLSELHMRDTAVPV 195 Query: 693 ---PFFLKEIPIQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETVVRCSNKGLRALPRG 749 Sbjct: 196 QVVRDVIKAIIGNNFITDFQLDLAANKLGVLGANMLAGLAAEITTIKSLDL--------- 246 Query: 750 MPKDVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNYTFSNMSHLSTLILSYN 809 Sbjct: 247 -----TDNDFGDEGMSIIADGLCHNSSLRELHLGDNWTRNKTKARSQAVDNLIELISSEC 301 Query: 810 RLRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLSHLALGTNPLHC 861 Sbjct: 302 PLHKLDL-------SCKVADNQIKTDILPFIYSLATNDTLKELDISGNAMGD 346 Score = 31.2 bits (69), Expect = 0.12 Identities = 5/61 (8%), Positives = 5/61 (8%), Gaps = 1/61 (1%) Query: 397 NTFQDLQNLNLLSLYDNKLQTI-SKGLFAPLQSIQTLHLAQNPFVCDCHLKWLADYLQDN 455 Sbjct: 7 NTKMEELVLSRVEGTASAFEELGQAIAKNRNSALTSIDWSNNLIKDAGVAALAAAVASMG 66 Query: 456 P 456 Sbjct: 67 H 67 Score = 28.8 bits (63), Expect = 0.68 Identities = 13/116 (11%), Positives = 13/116 (11%), Gaps = 14/116 (12%) Query: 400 QDLQNLNLLSLYDNKLQTISKGLFAP----LQSIQTLHLAQNPFVCDCHLKWLAD----- 450 Sbjct: 96 EMSRTLTVLNLAGNRLDSDGTSALAAFVSGPNALQTLNISGTAANLEMLLPAVMRGCTEL 155 Query: 451 ---YLQDNPIETSGAR--CSSPRRLANKRISQIKSKKFRCSGSEDYRSRFSSECFM 501 Sbjct: 156 EKFNISHNKVTAKTGPELKKFLQSCGRLSELHMRDTAVPVQVVRDVIKAIIGNNFI 211 Score = 28.4 bits (62), Expect = 0.96 Identities = 11/60 (18%), Positives = 11/60 (18%), Gaps = 3/60 (5%) Query: 400 QDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPF-VCDCHLKWLADYLQDNPIE 458 Sbjct: 264 CHNSSLRELHLGDNWTRNKTKARSQAVDNLIELISSECPLHKLDLSCKVA--DNQIKTDI 321 >gi|132575 [1..456] Leucine-rich repeats Length = 456 Score = 113 bits (279), Expect = 3e-26 Identities = 68/526 (12%), Positives = 68/526 (12%), Gaps = 95/526 (18%) Query: 335 RIDISKNQISD-IAPDAFQGLKSLTSLVLYGNKITEIAKGL----FDGLVSLQLLLLNAN 389 Sbjct: 2 SLDIQCEQLSDARWTELLPLIQQYQVVRLDDCGLTEVRCKDIRSAIQANPALTELSLRTN 61 Query: 390 KINCLRVNTFQDL-----QNLNLLSLYDNKLQTISKG----LFAPLQSIQTLHLAQNPFV 440 Sbjct: 62 ELGDAGVGLVLQGLQNPTCKIQKLSLQNCSLTEAGCGVLPDVLRSLSTLRELHLNDNPLG 121 Query: 441 CDCHLKWLADYLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFSSECF 500 Sbjct: 122 DEGLKLLCEGLRDPQCR-----------------LEKLQLEYCNLTATSCEPLASVLRVK 164 Query: 501 MDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLND-NEVSVLEATGIFKKLPNL 559 Sbjct: 165 PDFKELVLSNNDFHEAGIHTLCQGLKDSACQLESLKLENCGITSANCKDLCDVVASKASL 224 Query: 560 RKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGC 619 Sbjct: 225 QELDLGSN-------------------KLGNTGIAALCSGLLLPSCRLRTLWLWDCDVTA 265 Query: 620 VSNDTFAG----LSSVRLLSLYDNRITTITPGAFTTLVSLSTINLLSNPFNCNCHLAWLG 675 Sbjct: 266 EGCKDLCRVLRAKQSLKELSLAGNELKDEGAQLLCESLLEPGCQLESLWVKTCSLTAASC 325 Query: 676 KWLRKRRIVSGNPRCQKPFFLKEIPIQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETV 735 Sbjct: 326 PHF-----------CSVLTKNSSLFELQMSSNPLGDSGVVELCKALGYPDTV-------- 366 Query: 736 VRCSNKGLRALPRGMPKDVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSIS-----ML 790 Sbjct: 367 -------LRVL------WLGDCDVTDSGCSSLATVLLANRSLRELDLSNNCMGDNGVLQL 413 Query: 791 TNYTFSNMSHLSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDIS 836 Sbjct: 414 LESLKQPSCILQQLVLYDIYWTDEVED---QLRALEEERPSLRIIS 456 Score = 68.3 bits (164), Expect = 8e-13 Identities = 32/200 (16%), Positives = 32/200 (16%), Gaps = 23/200 (11%) Query: 65 RLDLDRNNIT-RITKMDFAGLKNLRVLHLEDNQVSVIERG----AFQDLKQLERLRLNKN 119 Sbjct: 2 SLDIQCEQLSDARWTELLPLIQQYQVVRLDDCGLTEVRCKDIRSAIQANPALTELSLRTN 61 Query: 120 KLQVLP-----ELLFQSTPKLTRLDLSENQIQGIP----RKAFRGITDVKNLQLDNNHIS 170 Sbjct: 62 ELGDAGVGLVLQGLQNPTCKIQKLSLQNCSLTEAGCGVLPDVLRSLSTLRELHLNDNPLG 121 Query: 171 C-----IEDGAFRALRDLEILTLNNNNISRILV----TSFNHMPKIRTLRLHSNHLYCDC 221 Sbjct: 122 DEGLKLLCEGLRDPQCRLEKLQLEYCNLTATSCEPLASVLRVKPDFKELVLSNNDFHEAG 181 Query: 222 HLAWLSDWLRQRRTVGQFTL 241 Sbjct: 182 IHTLCQGLKDSACQLESLKL 201 Score = 60.5 bits (144), Expect = 2e-10 Identities = 24/199 (12%), Positives = 24/199 (12%), Gaps = 28/199 (14%) Query: 44 SVDCHGLGLRAVPRGIPR--NAERLDLDRNNITRITKMDFAGLK-----NLRVLHLEDNQ 96 Sbjct: 260 DCDVTAEGCKDLCRVLRAKQSLKELSLAGNELKDEGAQLLCESLLEPGCQLESLWVKTCS 319 Query: 97 VSVIE----RGAFQDLKQLERLRLNKNKLQVLP-----ELLFQSTPKLTRLDLSENQIQG 147 Sbjct: 320 LTAASCPHFCSVLTKNSSLFELQMSSNPLGDSGVVELCKALGYPDTVLRVLWLGDCDVTD 379 Query: 148 IPRK----AFRGITDVKNLQLDNNHISC-----IEDGAFRALRDLEILTLNNNNISRILV 198 Sbjct: 380 SGCSSLATVLLANRSLRELDLSNNCMGDNGVLQLLESLKQPSCILQQLVLYDIYWTDEVE 439 Query: 199 TSFNHMPKIRTLRLHSNHL 217 Sbjct: 440 DQLRALEE---ERPSLRII 455 >gi|1082610 [223..548] Leucine-rich repeats Length = 326 Score = 91.8 bits (225), Expect = 7e-20 Identities = 42/350 (12%), Positives = 42/350 (12%), Gaps = 52/350 (14%) Query: 68 LDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLK-------------QLERL 114 Sbjct: 2 LSTEAALTLCH-LLSSWVSLESLTLSYNGLGSNIFRLLDSLRALSGQAGCRLRALHLSDL 60 Query: 115 RLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIED 174 Sbjct: 61 FSPLPILELTRA-IVRALPLLRVLSIRVDHPSQRDNPGVPGNAGPPSHIIGDEEIP---- 115 Query: 175 GAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRR 234 Sbjct: 116 --ENCLEQLEMGFPRGAQPAPLLCSVLKASGSLQQLSLDSATFASPQDFGLVLQTLKEYN 173 Query: 235 TVGQFTLCMAPVHLRGFNVADVQKK-EYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDC 293 Sbjct: 174 LALKR------LSFHDMNLADCQSEVLFLLQNLTLQEITFSFCRLFEKRPAQFLP----- 222 Query: 294 RGKGLMEIPANLPEGIVEIRLEQNSIKAIPA------GAFTQYKKLKRIDISKNQISDIA 347 Sbjct: 223 ---EMVAAMKGNS-TLKGLRLPGNRLGNAGLLALADVFSEDSSSSLCQLDISSNCIKPDG 278 Query: 348 PDAFQ------GLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANKI 391 Sbjct: 279 LLEFAKRLERWGRGAFGHLRLFQNWLDQD---AVTAREAIRRLRATCHVV 325 Score = 80.5 bits (196), Expect = 2e-16 Identities = 46/344 (13%), Positives = 46/344 (13%), Gaps = 64/344 (18%) Query: 539 LNDNEVSVLEATGIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHG 598 Sbjct: 2 LSTEAALTLCH--LLSSWVSLESLTLSYNGLGSNIFRLLDSLRALSGQAGC--RLRALH- 56 Query: 599 RVFRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTI 658 Sbjct: 57 --------LSDLFSPLPIL-ELTRAIVRALPLLRVLSIRVDHPSQRDNPGVPGNAGPPSH 107 Query: 659 NLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIPIQDVAIQDFTCDGNEESS 718 Sbjct: 108 IIGDEEIPE-NCLEQL-EMGFPRGAQPAPLLCSVLKASGSLQQLSLDSATFASPQDFGLV 165 Query: 719 CQLSPRCPEQCTCMETVVRCSNKGLRALPRGMPKDVTELYLEGNHLT---AVPRELSALR 775 Sbjct: 166 LQTLKEYNLA-------------------------LKRLSFHDMNLADCQSEVLFLLQNL 200 Query: 776 HLTLIDLSNNSISMLTN--------YTFSNMSHLSTLILSYNRLRCIP------VHAFNG 821 Sbjct: 201 TLQEITFSFCRLFEKRPAQFLPEMVAAMKGNSTLKGLRLPGNRLGNAGLLALADVFSEDS 260 Query: 822 LRSLRVLTLHGNDISSVPEGSFND------LTSLSHLALGTNPL 859 Sbjct: 261 SSSLCQLDISSNCIKPDGLLEFAKRLERWGRGAFGHLRLFQNWL 304 Score = 33.0 bits (74), Expect = 0.037 Identities = 22/169 (13%), Positives = 22/169 (13%), Gaps = 30/169 (17%) Query: 397 NTFQDLQNLNLLSLYDNKLQTISK--GLFAPLQSIQTLHLAQNPFVCDCHLKWLAD---- 450 Sbjct: 168 TLKEYNLALKRLSFHDMNLADCQSEVLFLLQNLTLQEITFSFCRLFEKRPAQFLPEMVAA 227 Query: 451 ----------YLQDNPIETSG----ARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFS 496 Sbjct: 228 MKGNSTLKGLRLPGNRLGNAGLLALADVFSEDSSSSLCQLDISSNCIKPDGLLEFAKRLE 287 Query: 497 SECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVS 545 Sbjct: 288 RWGRGAF----------GHLRLFQNWLDQDAVTAREAIRRLRATCHVVS 326 Score = 29.1 bits (64), Expect = 0.53 Identities = 7/51 (13%), Positives = 7/51 (13%) Query: 392 NCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCD 442 Sbjct: 109 IGDEEIPENCLEQLEMGFPRGAQPAPLLCSVLKASGSLQQLSLDSATFASP 159 >gi|2623618 [35..332] Leucine-rich repeats Length = 298 Score = 91.9 bits (225), Expect = 7e-20 Identities = 33/314 (10%), Positives = 33/314 (10%), Gaps = 49/314 (15%) Query: 151 KAFRGITDVKNLQLDNNHISCIEDGAFRA----LRDLEILTLNNNNISRILVTSFNHMPK 206 Sbjct: 8 AAVEACEGIQSLKLNGNTIGVEAAQALAKALESKPQFQRARWSDMFTGRLRSEIPPALMS 67 Query: 207 IRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKKEYVCPAP 266 Sbjct: 68 LGAGIMTAGAHLVEIDLSD------------NAFGPDGVKAVRELLESSSCYSLREMRFN 115 Query: 267 HSEPPSCNANSISCPSPCTCSNNIVDCRGKGLMEIPANLPEGIVEIRLEQNSIKAIP--- 323 Sbjct: 116 N------------NGLGIGGKLMAEALITCHEKSTKAGKPLALKVFIAGRNRLENPGATV 163 Query: 324 -AGAFTQYKKLKRIDISKNQISDIA----PDAFQGLKSLTSLVLYGNKITEIAK----GL 374 Sbjct: 164 LAKAFKIIGTLEEIALPQNGINYEGITALAEAVEYSHNLKILNLNDNTFTARGAKPMAKA 223 Query: 375 FDGLVSLQLLLLNANKI-----NCLRVNTFQDLQNLNLLSLYDNKLQTISK----GLFAP 425 Sbjct: 224 IKNLSKLEVINFGDCLVRSEGADAIANSLREGVPSLKELNLAFGEIKKEAAVRVAESMDT 283 Query: 426 LQSIQTLHLAQNPF 439 Sbjct: 284 KPHLTLLDLNGNNI 297 Score = 87.2 bits (213), Expect = 2e-18 Identities = 39/290 (13%), Positives = 39/290 (13%), Gaps = 29/290 (10%) Query: 599 RVFRGLSGLKTLMLRSNLIGCVSNDTF----AGLSSVRLLSLYDNRITTITPGAFTTLVS 654 Sbjct: 8 AAVEACEGIQSLKLNGNTIGVEAAQALAKALESKPQFQRARWSDMFTGRLRSEIPPALMS 67 Query: 655 LSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIPIQDVAIQDFTC-DG 713 Sbjct: 68 LGAGIMTAGAHLVEIDLSDNAFGPDGVKAVRELLESSSCYSLREMRFNNNGLGIGGKLMA 127 Query: 714 NEESSCQLSPRCPEQCTCMETVVRCSNKGLRALPRGMP------KDVTELYLEGNHLT-- 765 Sbjct: 128 EALITCHEKSTKAGKPLALKVFIAGRNRLENPGATVLAKAFKIIGTLEEIALPQNGINYE 187 Query: 766 ---AVPRELSALRHLTLIDLSNNSISMLTNYT----FSNMSHLSTLILSYNRLR-----C 813 Sbjct: 188 GITALAEAVEYSHNLKILNLNDNTFTARGAKPMAKAIKNLSKLEVINFGDCLVRSEGADA 247 Query: 814 IPVHAFNGLRSLRVLTLHGNDISSVP----EGSFNDLTSLSHLALGTNPL 859 Sbjct: 248 IANSLREGVPSLKELNLAFGEIKKEAAVRVAESMDTKPHLTLLDLNGNNI 297 Score = 53.3 bits (126), Expect = 2e-08 Identities = 17/124 (13%), Positives = 17/124 (13%), Gaps = 8/124 (6%) Query: 34 CPTKCTCSAASVDCHGLGLRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHLE 93 Sbjct: 93 GVKAVRELLESSSCYSLREMRFNNNGLGIGGKLMAEALITCHEKSTKAGKPLALKVFIAG 152 Query: 94 DNQVSVIER----GAFQDLKQLERLRLNKNKLQVLPELL----FQSTPKLTRLDLSENQI 145 Sbjct: 153 RNRLENPGATVLAKAFKIIGTLEEIALPQNGINYEGITALAEAVEYSHNLKILNLNDNTF 212 Query: 146 QGIP 149 Sbjct: 213 TARG 216 Score = 45.9 bits (107), Expect = 5e-06 Identities = 10/81 (12%), Positives = 10/81 (12%), Gaps = 5/81 (6%) Query: 72 NITRITKMDFAGLKNLRVLHLEDNQVSVIERGAF----QDLKQLERLRLNKNKLQVLPEL 127 Sbjct: 2 DARDVVA-AVEACEGIQSLKLNGNTIGVEAAQALAKALESKPQFQRARWSDMFTGRLRSE 60 Query: 128 LFQSTPKLTRLDLSENQIQGI 148 Sbjct: 61 IPPALMSLGAGIMTAGAHLVE 81 Score = 43.2 bits (100), Expect = 3e-05 Identities = 16/74 (21%), Positives = 16/74 (21%), Gaps = 9/74 (12%) Query: 81 FAGLKNLRVLHLEDNQVSVIERG-----AFQDLKQLERLRLNKNKLQVLPELLFQST--- 132 Sbjct: 224 IKNLSKLEVINFGDCLVRSEGADAIANSLREGVPSLKELNLAFGEIKKEAAVRVAESMDT 283 Query: 133 -PKLTRLDLSENQI 145 Sbjct: 284 KPHLTLLDLNGNNI 297 Score = 36.2 bits (82), Expect = 0.004 Identities = 16/141 (11%), Positives = 16/141 (11%), Gaps = 7/141 (4%) Query: 460 SGARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFSSECFMDLVCPEKCRCEGTIVDCS 519 Sbjct: 50 SDMFTGRLRSEIPPALMSLGAGIM--TAGAHLVEIDLSDNAFGPDGVKAVRELLESSSCY 107 Query: 520 NQKLVRIPSHLPEYVTDLRLNDNEVSVLEATGIFKKLPNLRKINLSNNKIKEVRE----G 575 Sbjct: 108 SLREMRFNNNGLGIGGKLMAEALITCHEKST-KAGKPLALKVFIAGRNRLENPGATVLAK 166 Query: 576 AFDGAASVQELMLTGNQLETV 596 Sbjct: 167 AFKIIGTLEEIALPQNGINYE 187 Score = 33.8 bits (76), Expect = 0.022 Identities = 16/95 (16%), Positives = 16/95 (16%), Gaps = 15/95 (15%) Query: 514 TIVDCSNQKLVRIPS-HLPEYVTD------LRLNDNEVSVLEATGI----FKKLPNLRKI 562 Sbjct: 203 KILNLNDNTFTARGAKPMAKAIKNLSKLEVINFGDCLVRSEGADAIANSLREGVPSLKEL 262 Query: 563 NLSNNKIKEVR----EGAFDGAASVQELMLTGNQL 593 Sbjct: 263 NLAFGEIKKEAAVRVAESMDTKPHLTLLDLNGNNI 297 Score = 30.3 bits (67), Expect = 0.22 Identities = 4/25 (16%), Positives = 4/25 (16%) Query: 553 FKKLPNLRKINLSNNKIKEVREGAF 577 Sbjct: 10 VEACEGIQSLKLNGNTIGVEAAQAL 34 >gi|1418519 [191..423] Leucine-rich repeats Length = 233 Score = 77.3 bits (188), Expect = 2e-15 Identities = 32/199 (16%), Positives = 32/199 (16%), Gaps = 30/199 (15%) Query: 61 RNAERLDLDRNNIT----RITKMDFAGLKNLRVLHLEDNQVSVIE----RGAFQDLKQLE 112 Sbjct: 36 ASLTCLHLENTQMSGKNLLVLICALKNNTGLRELYLGDNGLQPTDGSHIYQLITSNSSLQ 95 Query: 113 RLRLNKNK---------LQVLPELLFQSTPKLTRLDLSENQIQGIPRK----AFRGITDV 159 Sbjct: 96 LLDLRNNSIGDSGVRHICDGLRHREAVEKSSLSAMVLWNNNVTGASMDSLAEALIENTKI 155 Query: 160 KNLQLDNNHISCIED----GAFRALRDLEILTLNNNNISRI----LVTSFNHMPKIRTLR 211 Sbjct: 156 ETLNIGNNNLGVEGIARLKPALASNSHLHRLGLQNTGINCEGAIILAECIADNIALLRVD 215 Query: 212 LHSNHLYCDCHLAWLSDWL 230 Sbjct: 216 IRDNPI-ALAGLLALHSAM 233 Score = 76.1 bits (185), Expect = 4e-15 Identities = 26/238 (10%), Positives = 26/238 (10%), Gaps = 29/238 (12%) Query: 625 FAGLSSVRLLSLYDNRITTITP------GAFTTLVSLSTINLLSNPFNCNCHLAWLGKWL 678 Sbjct: 2 IRHAVSLQMLNLRYTNLNDRSIPALCKMARAQPSASLTCLHLENTQMSGKNLLVLICALK 61 Query: 679 RKRRIVSGNPRCQKPFFLKEIPIQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETVVRC 738 Sbjct: 62 NNTGLRELYLGDNGLQPTDGSHIYQLITSNSSLQLLDLRNNSIGDSGVRHIC-------- 113 Query: 739 SNKGLRALPRGMPKDVTELYLEGNHLT-----AVPRELSALRHLTLIDLSNNSISMLTNY 793 Sbjct: 114 --DGLRHREAVEKSSLSAMVLWNNNVTGASMDSLAEALIENTKIETLNIGNNNLGVEGIA 171 Query: 794 TF----SNMSHLSTLILSYNRLRCIPVHA----FNGLRSLRVLTLHGNDISSVPEGSF 843 Sbjct: 172 RLKPALASNSHLHRLGLQNTGINCEGAIILAECIADNIALLRVDIRDNPIALAGLLAL 229 Score = 57.4 bits (137), Expect = 2e-09 Identities = 31/237 (13%), Positives = 31/237 (13%), Gaps = 20/237 (8%) Query: 200 SFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKK 259 Sbjct: 1 SIRHAVSLQMLNLRYTNL-NDRSIPALCKMARAQPSASLTCLHLENTQMSGKNLLVLI-- 57 Query: 260 EYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLMEIPANLPEGIVEIRLEQNSI 319 Sbjct: 58 CALKNNTGLRELYLGDNGLQPTDGSHIYQLITSNSSLQLLDLRNNSIGDSG----VRHIC 113 Query: 320 KAIPAGAFTQYKKLKRIDISKNQISDIA----PDAFQGLKSLTSLVLYGNKITEIAK--- 372 Sbjct: 114 DGLRHREAVEKSSLSAMVLWNNNVTGASMDSLAEALIENTKIETLNIGNNNLGVEGIARL 173 Query: 373 -GLFDGLVSLQLLLLNANKINCLRV----NTFQDLQNLNLLSLYDNKLQ-TISKGLF 423 Sbjct: 174 KPALASNSHLHRLGLQNTGINCEGAIILAECIADNIALLRVDIRDNPIALAGLLALH 230 Score = 42.2 bits (98), Expect = 6e-05 Identities = 26/215 (12%), Positives = 26/215 (12%), Gaps = 28/215 (13%) Query: 426 LQSIQTLHLAQNPFVCDCHLKWLADYLQDNPIETSGARCSSPRRLANKRISQIKSKKFRC 485 Sbjct: 35 SASLTCLHLENTQMSGKNLLVLICA-------------LKNNTGLRELYLGDNGLQPTDG 81 Query: 486 SGSEDYRSRFSSECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVS 545 Sbjct: 82 SHIYQLITSNSSLQLLDL---RNNSIGDSGVRHICDGLRHREAVEKSSLSAMVLWNNNVT 138 Query: 546 VLEATGI---FKKLPNLRKINLSNNKIKEVRE----GAFDGAASVQELMLTGNQLETVHG 598 Sbjct: 139 GASMDSLAEALIENTKIETLNIGNNNLGVEGIARLKPALASNSHLHRLGLQNTGINCEGA 198 Query: 599 R----VFRGLSGLKTLMLRSNLI-GCVSNDTFAGL 628 Sbjct: 199 IILAECIADNIALLRVDIRDNPIALAGLLALHSAM 233 Score = 31.7 bits (71), Expect = 0.084 Identities = 6/33 (18%), Positives = 6/33 (18%) Query: 553 FKKLPNLRKINLSNNKIKEVREGAFDGAASVQE 585 Sbjct: 2 IRHAVSLQMLNLRYTNLNDRSIPALCKMARAQP 34 Score = 29.4 bits (65), Expect = 0.39 Identities = 4/34 (11%), Positives = 4/34 (11%) Query: 81 FAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERL 114 Sbjct: 2 IRHAVSLQMLNLRYTNLNDRSIPALCKMARAQPS 35 Score = 29.0 bits (64), Expect = 0.53 Identities = 4/34 (11%), Positives = 4/34 (11%) Query: 129 FQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNL 162 Sbjct: 2 IRHAVSLQMLNLRYTNLNDRSIPALCKMARAQPS 35 Score = 25.1 bits (54), Expect = 8.7 Identities = 5/21 (23%), Positives = 5/21 (23%) Query: 176 AFRALRDLEILTLNNNNISRI 196 Sbjct: 1 SIRHAVSLQMLNLRYTNLNDR 21 >gi|128081 [13..98] Neurophysin II Length = 86 Score = 32.8 bits (73), Expect = 0.038 Identities = 11/85 (12%), Positives = 11/85 (12%), Gaps = 6/85 (7%) Query: 978 DGFSCSCPLGFEGQRCEINPDDCEDNDCENNATCVDGINNYVCICPPNYTGELCDEVIDH 1037 Sbjct: 7 LRQCLPCGPGGQGRCFGPSICCADALGCFVGTAEAL------RCQEENYLPSPCQSGQKP 60 Query: 1038 CVPELNLCQHEAKCIPLDKGFSCEC 1062 Sbjct: 61 CGSGGRCAANGVCCNDESCVIEPEC 85 Score = 26.6 bits (57), Expect = 3.3 Identities = 11/83 (13%), Positives = 11/83 (13%), Gaps = 7/83 (8%) Query: 1353 CECRPGWTGPLCDQEARDPCLGHRCHHGKCVATGTSYMCKCAEGYGGDLCDNKNDSANAC 1412 Sbjct: 11 LPCGPGGQGRCFGPSI------CCADALGCFVGTAEALRCQEENYLPSPCQSGQKPCG-S 63 Query: 1413 SAFKCHHGQCHISDQGEPYCLCQ 1435 Sbjct: 64 GGRCAANGVCCNDESCVIEPECR 86 >gi|135227 [17..277] Restriction endonucleases Length = 261 Score = 30.2 bits (67), Expect = 0.27 Identities = 14/46 (30%), Positives = 14/46 (30%), Gaps = 4/46 (8%) Query: 567 NKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLML 612 Sbjct: 101 KDIINIRNGLLVGKRGDQDLMAAGNAIERSH----KNISEIANFML 142 >gi|2039378 [21..454] PLP-dependent transferases Length = 434 Score = 29.5 bits (65), Expect = 0.38 Identities = 14/127 (11%), Positives = 14/127 (11%), Gaps = 12/127 (9%) Query: 1 MAPGWAGVGAAVRARLALALALASVLSGPPAVACPTKCTCSAASVDCHGLGLRAVPRGIP 60 Sbjct: 296 IGNGFPLG--AVVTSKEIADSFNKSLYFNTYGGNPLASVVGKAVLE--VIEEEKLQENSA 351 Query: 61 RNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERL--RLNK 118 Sbjct: 352 VVGDYF------LKQLAAIDDATIGDVRGKGLMIGVELIDEQGKPLAAAKTAAIFEDTKN 405 Query: 119 NKLQVLP 125 Sbjct: 406 QGLLIGK 412 >gi|2501021 [11..154] OB-fold Length = 144 Score = 27.6 bits (61), Expect = 1.5 Identities = 15/107 (14%), Positives = 15/107 (14%), Gaps = 17/107 (15%) Query: 553 FKKLPNLRK--INLSNNKIK------EVREGAFDGAASVQELMLTGNQLETVHGRVF--R 602 Sbjct: 8 LEKAEQLRQLGLNPYAYTWEITHQAQDLQE-TYRDLSNGEEV----DLKVAIAGRILARR 62 Query: 603 GLSGLKTLMLR--SNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPG 647 Sbjct: 63 VMGKLAFFTLQDESGTIQLYLEKQRLTEHMPELENAFNLLKKITDVG 109 >gi|417837 [59..431] Acid proteases Length = 373 Score = 26.9 bits (59), Expect = 2.5 Identities = 5/65 (7%), Positives = 5/65 (7%), Gaps = 13/65 (20%) Query: 922 SSPCKNNGTCTQDPVELYRCACPYSYKGKDCTVPINTCIQNPCQHGGTCHLSDS---HKD 978 Sbjct: 46 AKNITALSASEIWS----------DTDGVDAGRSTSDIRTNACTNYTCFDYSSTTARRTN 95 Query: 979 GFSCS 983 Sbjct: 96 SSTIG 100 >gi|1788898 [36..308] Periplasmic binding protein-like I Length = 273 Score = 26.9 bits (59), Expect = 2.5 Identities = 11/93 (11%), Positives = 11/93 (11%), Gaps = 7/93 (7%) Query: 288 NNIVDCRGKGLMEIPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIA 347 Sbjct: 46 DTLVARNVDAIILSAVSENGSSRTVRRASE--AGIPVICYNTC--INQKGVDKYVSAYLV 101 Query: 348 PDAFQGLKSLTSLV---LYGNKITEIAKGLFDG 377 Sbjct: 102 GDPLEFGKKLGNAAADYFIANKIDQPKIAVINC 134 >gi|1175369 [257..504] Nucleotidyltransferases Length = 248 Score = 26.4 bits (58), Expect = 3.5 Identities = 7/39 (17%), Positives = 7/39 (17%) Query: 47 CHGLGLRAVPRGIPRNAERLDLDRNNITRITKMDFAGLK 85 Sbjct: 13 IFGVGASHAAEWYQKGWRTIEQVRKHKDSFTKQIKVGLE 51 >gi|1518938 [387..612] Heat shock protein 70kD (HSP70), C-terminal, substrate-binding fragment Length = 226 Score = 26.4 bits (58), Expect = 3.6 Identities = 10/30 (33%), Positives = 10/30 (33%) Query: 51 GLRAVPRGIPRNAERLDLDRNNITRITKMD 80 Sbjct: 73 GLPPAPRGVPQIEVAFNVDQNGIMNVSASD 102 >gi|2493473 [2..191] EF Hand-like Length = 190 Score = 26.3 bits (57), Expect = 3.9 Identities = 12/82 (14%), Positives = 12/82 (14%), Gaps = 1/82 (1%) Query: 135 LTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNIS 194 Sbjct: 17 KKETGFSHSQITRL-YSRFTSLDKGENGTLSREDFQRIPELAINPLGDRIINAFFSEGED 75 Query: 195 RILVTSFNHMPKIRTLRLHSNH 216 Sbjct: 76 QVNFRGFMRTLAHFRPIEDNEK 97 >gi|1778568 [354..541] Heat shock protein 70kD (HSP70), C-terminal, substrate-binding fragment Length = 188 Score = 26.0 bits (57), Expect = 4.8 Identities = 12/96 (12%), Positives = 12/96 (12%), Gaps = 14/96 (14%) Query: 38 CTCSAASV-DCHGLG---LRAVPRGIPRNAE-RLDLDRNNITRITKMDFAGLKNLRV--- 89 Sbjct: 57 YQGENHKVKNNILVESFDVPLKKTGAYQSIDIRFSYDINGLLEVDVLLEDGSVKSRVINH 116 Query: 90 ----LHLEDNQVSVIERGAF--QDLKQLERLRLNKN 119 Sbjct: 117 SPVTLSAQQIEESRTRLSALKIYPRDMLINRTFKAK 152 >gi|1790450 [741..896] Flavodoxin-like Length = 156 Score = 25.7 bits (56), Expect = 5.2 Identities = 15/72 (20%), Positives = 15/72 (20%), Gaps = 9/72 (12%) Query: 1214 RLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTPKSLGKLQKQPAVGI 1273 Sbjct: 36 EIVDLGVMVPAEKI--LRTAKEVNADLIGLSGL-ITPSLDEMVNVAKEMERQ------GF 86 Query: 1274 NSPLYLGGIPTS 1285 Sbjct: 87 TIPLLIGGATTS 98 >gi|2130182 [8..348] Protein kinases (PK), catalytic core Length = 341 Score = 25.5 bits (54), Expect = 5.7 Identities = 18/168 (10%), Positives = 18/168 (10%), Gaps = 7/168 (4%) Query: 993 CEINPDDCEDNDCENNATCVDGINNYVCICPPNYTGELCDEVIDHCVPELNLCQHEAKCI 1052 Sbjct: 74 HEIDILKSVRHPNIITFHDAWYNETEFVFITELMTSGTLREYIRKLTPLPNIKIVKRWCR 133 Query: 1053 PLDKGFSCECVPGYSGKLCETDNDDCVAHKC--RHGAQCVDTINGYTCTCPQGFSGPFCE 1110 Sbjct: 134 QILKGLAYL----HGHEPPII-HRDIKCDNIFINGAHGEIKIGDMGTAEMKNGKKYTVIG 188 Query: 1111 HPPPMVLLQTSPCDQYECQNGAQCIVVQQEPTCRCPPGFAGPRCEKLI 1158 Sbjct: 189 TPEFMAPEMYEEQGYNEKVDIYAFGMCLLEMATGEYPYGECTNAVQVF 236 >gi|2496715 [53..359] NAD(P)-binding Rossmann-fold domains Length = 307 Score = 25.8 bits (55), Expect = 6.0 Identities = 11/147 (7%), Positives = 11/147 (7%), Gaps = 1/147 (0%) Query: 197 LVTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADV 256 Sbjct: 140 ICKEAMQRDGLDVAIVRPRTVLGYGRQGVVQILFDWVERGLDIPVLGGGNNKYQFVHSDD 199 Query: 257 QKKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLMEIPA-NLPEGIVEIRLE 315 Sbjct: 200 LASACIAASNVKGFATYNIGAAEFGTMRELLQVVIKHAETGSRIKSIPMGPTALAANLAS 259 Query: 316 QNSIKAIPAGAFTQYKKLKRIDISKNQ 342 Sbjct: 260 ALGLSPLGPYHSLMYGRAMYFDISKAQ 286 >gi|1942958 [8..338] ATP pyrophoshatases Length = 331 Score = 25.5 bits (55), Expect = 6.0 Identities = 8/102 (7%), Positives = 8/102 (7%), Gaps = 10/102 (9%) Query: 991 QRCEINPDDCEDNDCENNATCVDGI-------NNYVCICPPNYTGELCDEVIDHCVPEL- 1042 Sbjct: 79 EWLGFHWSGNVRYSSDYFDQLHAYAIELINKGLAYVDELTPEQTREYRGTLTQPGKNSPY 138 Query: 1043 -NLCQHEAKCI-PLDKGFSCECVPGYSGKLCETDNDDCVAHK 1082 Sbjct: 139 RDRSVEENLALFEKMRAGGFEEGKACLRAKIDMASPFIVMRD 180 >gi|2665705 [26..489] Periplasmic binding protein-like II Length = 464 Score = 25.7 bits (56), Expect = 6.3 Identities = 4/33 (12%), Positives = 4/33 (12%) Query: 394 LRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPL 426 Sbjct: 2 LYFGLSGEPSTLDTVVQPGTSGRTVKLAIHRGL 34 >gi|2127787 [36..400] Periplasmic binding protein-like I Length = 365 Score = 25.4 bits (55), Expect = 6.4 Identities = 9/96 (9%), Positives = 9/96 (9%), Gaps = 10/96 (10%) Query: 1022 CPPNYTGELCDEVIDHCVPELNLCQHEAKCIPLDKGFSCECVPGYSGKLCETDNDDCVAH 1081 Sbjct: 11 GGLATYGNNEKHICEIAEEKINKYFEE-----KGMPYKVKLYVED----TRADPNICLQK 61 Query: 1082 KCRHGAQCVDTINGYTCTCPQGFSGPFCEHPP-PMV 1116 Sbjct: 62 VQALHAQGITFFLGPMASGEVKNIKGFINSNKIVII 97 >gi|1943840 [7..213] Protein kinases (PK), catalytic core Length = 207 Score = 25.4 bits (54), Expect = 6.4 Identities = 15/180 (8%), Positives = 15/180 (8%), Gaps = 14/180 (7%) Query: 1340 HGLCRSVEKDSVVCECRPGWTGPLCDQEARDPC-LGHRCHHGKCVATGTSYMCKCAEGYG 1398 Sbjct: 18 FEFFHFNGSVAIVLEHFPH--CTATELLFHSKRDLSFALSYFRNLLYAVAYLHHNGYVHR 75 Query: 1399 GDLCDNKNDSANACSAFKCHHGQCHISDQGEPYCLCQPGFSGEHCQQENPCLGQVVREVI 1458 Sbjct: 76 DIKLSN--------FLYSPQTQRFRLVDFGLATVDRSKNE---CSRSHAVAAMERDPECC 124 Query: 1459 RRQKGYASCATASKVPIMECRGGCGPQCCQPTRSKRRKYVFQCTDGSSFVEEVERHLECG 1518 Sbjct: 125 SCKGTSSPCMFCRGKPKRENYNVVGTPGVRAPELLFGIGLCHPSIDIFSCGIVLLSLVSV 184 >gi|462182 [68..563] alpha/beta-Hydrolases Length = 496 Score = 25.4 bits (55), Expect = 6.5 Identities = 7/39 (17%), Positives = 7/39 (17%), Gaps = 3/39 (7%) Query: 935 PVELYRCACPYSYKG---KDCTVPINTCIQNPCQHGGTC 970 Sbjct: 29 GGGLARFQAAQPIGYQGRVNATVQSPNCAQFPELDRLRL 67 >gi|90456 [248..528] Nucleotidyltransferases Length = 281 Score = 25.3 bits (55), Expect = 7.0 Identities = 10/41 (24%), Positives = 10/41 (24%), Gaps = 1/41 (2%) Query: 47 CHGLGLRAVPRGIPRNAERL-DLDRNNITRITKMDFAGLKN 86 Sbjct: 8 VFGVGLKTAEKWFRMGFRTLSKIQSDKSLRFTKMQKAGFLY 48 >gi|1572759 [30..280] Trypsin-like serine proteases Length = 251 Score = 25.1 bits (54), Expect = 8.4 Identities = 7/48 (14%), Positives = 7/48 (14%), Gaps = 1/48 (2%) Query: 1393 CAEGYGGDLCDNKNDSANACSAFKCHHGQCHISDQGEPYCLCQPGFSG 1440 Sbjct: 203 CAGAYLHGTAPGDSGGPLLIHKSNGEYVQIGITSYGADGL-DGVIDQG 249 >gi|1169087 [44..365] Cysteine proteinases Length = 322 Score = 25.1 bits (54), Expect = 8.6 Identities = 8/58 (13%), Positives = 8/58 (13%), Gaps = 10/58 (17%) Query: 926 KNNGTCTQDPVELYRCACPYSYKGKDCTVPINTCIQNPCQHG--------GTCHLSDS 975 Sbjct: 144 VKDGIVTGSNYTANNGCKPYPFPPCEHHS--KKTHFDPCPHDLYPTPKCEKKCVSDYT 199 Score = 25.1 bits (54), Expect = 9.1 Identities = 17/90 (18%), Positives = 17/90 (18%), Gaps = 13/90 (14%) Query: 650 TTLVSLSTINLLSNPFNCN-----CHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIPIQDV 704 Sbjct: 110 ELQVTLSADDLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTAN-NGCKPYPFPP- 167 Query: 705 AIQDFTCDGNEESSCQLS----PRCPEQCT 730 Sbjct: 168 --CEHHSKKTHFDPCPHDLYPTPKCEKKCV 195 >gi|1750128 [252..391] A domain of alpha and beta subunits of F1 ATP synthase-like Length = 140 Score = 24.9 bits (54), Expect = 9.6 Identities = 10/30 (33%), Positives = 10/30 (33%), Gaps = 2/30 (6%) Query: 697 KEIPIQDVAIQDFTCDGNEESSCQLSPRCP 726 Sbjct: 97 AEISVDEVA--EMLNTINYEVVSTLSRRIP 124 >gi|730289 [27..419] Single-stranded right-handed beta-helix Length = 393 Score = 24.9 bits (54), Expect = 9.8 Identities = 11/72 (15%), Positives = 11/72 (15%), Gaps = 3/72 (4%) Query: 562 INLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGCVS 621 Sbjct: 230 ITMSYNYYHDHDKSSIFGSSDSKTSD-DGKLKITLHHNRYKNIVQ-RAPRVRFGQVHVYN 287 Query: 622 NDTFAGLSSVRL 633 Sbjct: 288 NY-YEGSTSSSS 298 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 28 Number of calls to ALIGN: 53 Length of query: 1523 Total length of test sequences: 256703 Effective length of test sequences: 208388.0 Effective search space size: 308860107.0 Initial X dropoff for ALIGN: 25.0 bits ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ calculation of internal repeats with prospero ***** PROSPERO v1.3 Thu Jan 18 20:38:03 2001 ***** Copyright 2000, Richard Mott, Wellcome Trust Centre for Human Genetics, University of Oxford For help see http://www.well.ox.ac.uk/ariadne For usage use -help using gap penalty 11+1k using matrix BLOSUM62 printing all alignments with eval < 0.100000 using sequence1 gi|4049589|dbj|BAA35186.1| using self-comparison > 1 gi|4049589|dbj|BAA35186.1| len 1523 from 31 to 1406 vs gi|4049589|dbj|BAA35186.1| len 1523 from 277 to 1447 score 1238 eval 2.848253e-116 identity 32.57% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 31 AVACPTKCTCSAASVDCHGLGLRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVL 90 gi|4049589|dbj|BAA35186.1| :::||: |||| ||| | || :| :| |: :| 277 SISCPSPCTCSNNIVDCRGKGLMEIPANLPE---------------------GIVEIR-- 313 gi|4049589|dbj|BAA35186.1| 91 HLEDNQVSVIERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPR 150 gi|4049589|dbj|BAA35186.1| || | : | ||| |:|:|: ::||:: : || || | | |:| | : 314 -LEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKITEIAK 372 gi|4049589|dbj|BAA35186.1| 151 KAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTL 210 gi|4049589|dbj|BAA35186.1| | |: :: | |: | |:|: |: |::| :|:| :| : | | : |:|| 373 GLFDGLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTL 432 gi|4049589|dbj|BAA35186.1| 211 RLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKKEYVCPAPHSEP 270 gi|4049589|dbj|BAA35186.1| | | ||||| ||:|:|: | :| | :: :: |:: | 433 HLAQNPFVCDCHLKWLADYLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSEDYR 492 gi|4049589|dbj|BAA35186.1| 271 PSCNAN---SISCPSPCTCSNNIVDCRGKGLMEIPANLPEGIVEIRLEQNSIKAIPA-GA 326 gi|4049589|dbj|BAA35186.1| :: : || | | |||| : |: ||::||| : ::|| | : : | | 493 SRFSSECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVSVLEATGI 552 gi|4049589|dbj|BAA35186.1| 327 FTQYKKLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLL 386 gi|4049589|dbj|BAA35186.1| | : |::|::| |:| :: || | |: |:| ||:: : :| || |: |:| 553 FKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLML 612 gi|4049589|dbj|BAA35186.1| 387 NANKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLK 446 gi|4049589|dbj|BAA35186.1| :| | |: :|| | :: |||||||:: ||: | | | |: |::| ||| |:||| 613 RSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTINLLSNPFNCNCHLA 672 gi|4049589|dbj|BAA35186.1| 447 WLADYLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFSSECFMDLVCP 506 gi|4049589|dbj|BAA35186.1| || :|: | : || | | | : : | | |:|: | | : || 673 WLGKWLRKRRIVSGNPRCQKPFFLKEIPIQDVAIQDFTCDGNEE------SSCQLSPRCP 726 gi|4049589|dbj|BAA35186.1| 507 EKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVSVLEATGIFKKLPNLRKINLSN 566 gi|4049589|dbj|BAA35186.1| |:| | |:| |||: | :| :|: ||:| | | :: : | :| |:||| 727 EQCTCMETVVRCSNKGLRALPRGMPKDVTELYLEGNHLTAVPRE--LSALRHLTLIDLSN 784 gi|4049589|dbj|BAA35186.1| 567 NKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGCVSNDTFA 626 gi|4049589|dbj|BAA35186.1| | | ||| | :| | ||:| | : |: | 785 NSIS----------------MLT--------NYTFSNMSHLSTLILSYNRLRCIPVHAFN 820 gi|4049589|dbj|BAA35186.1| 627 GLSSVRLLSLYDNRITTITPGAFTTLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSG 686 gi|4049589|dbj|BAA35186.1| || |:|:|:|: | |::: |:| | ||| : | :|| :|:| | || :|:: | 821 GLRSLRVLTLHGNDISSVPEGSFNDLTSLSHLALGTNPLHCDCSLRWLSEWVKAGYKEPG 880 gi|4049589|dbj|BAA35186.1| 687 NPRCQKPFFLKEIPIQDVAIQDFTCDG----NEESSCQ--LSPRCPEQCTCMETVVRCSN 740 gi|4049589|dbj|BAA35186.1| || | : : : | | | | : | || | || : 881 IARCSSPEPMADRLLLTTPTHRFQCKGPVDINIVAKCNACLSSPCKNNGTCTQ------- 933 gi|4049589|dbj|BAA35186.1| 741 KGLRALPRGMPKDVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNYTFSNMSH 800 gi|4049589|dbj|BAA35186.1| | ||| 934 ------------DPVELY------------------------------------------ 939 gi|4049589|dbj|BAA35186.1| 801 LSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLSHLALGTNPLH 860 gi|4049589|dbj|BAA35186.1| || ::: | 940 -----------RCACPYSYKGK-------------------------------------- 950 gi|4049589|dbj|BAA35186.1| 861 CDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPTHRFQCKGPVDINIVAKCNAC 920 gi|4049589|dbj|BAA35186.1| ||:: |: | | 951 -DCTV------------------------------------------PI--------NTC 959 gi|4049589|dbj|BAA35186.1| 921 LSSPCKNNGTC--TQDPVELYRCACPYSYKGKDCTVPINTCIQNPCQHGGTCHLSDSHKD 978 gi|4049589|dbj|BAA35186.1| : :||:: ||| : : : |:|| ::|: | : : | | |:: || : 960 IQNPCQHGGTCHLSDSHKDGFSCSCPLGFEGQRCEINPDDCEDNDCENNATC---VDGIN 1016 gi|4049589|dbj|BAA35186.1| 979 GFSCSCPLGFEGQRCEINPDDC--EDNDCENNATCVDGINNYVCICPPNYTGELCDEVID 1036 gi|4049589|dbj|BAA35186.1| : | || : |: |: | | | | |:: | |: : | | | |:|:||: | 1017 NYVCICPPNYTGELCDEVIDHCVPELNLCQHEAKCIPLDKGFSCECVPGYSGKLCETDND 1076 gi|4049589|dbj|BAA35186.1| 1037 HCVPELNLCQHEAKCIPLDKGFSCECVPGYSGKLCE-------TDNDDCVAHKCRHGAQC 1089 gi|4049589|dbj|BAA35186.1| || : |:| |:|: |::| | |:|| || | ::|::|||| 1077 DCV--AHKCRHGAQCVDTINGYTCTCPQGFSGPFCEHPPPMVLLQTSPCDQYECQNGAQC 1134 gi|4049589|dbj|BAA35186.1| 1090 VDTINGYTCTCPQGFSGPFCEHPPPMVLLQTSPCDQYECQNGAQCIVVQQEPTCRCPPGF 1149 gi|4049589|dbj|BAA35186.1| : || || ||:|| || :: : | | : : | 1135 IVVQQEPTCRCPPGFAGPRCE---KLITVNFVGKDSY-----VELASAKVRPQANISLQV 1186 gi|4049589|dbj|BAA35186.1| 1150 AGPRCEKLITVNFVGKDSYVELASAKVR---------PQANISLQVATDKDNGILLYKGD 1200 gi|4049589|dbj|BAA35186.1| | : :: :|| || | |:: | : 1187 ATDKDNGILLYKGDNDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTL 1246 gi|4049589|dbj|BAA35186.1| 1201 NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTPK 1260 gi|4049589|dbj|BAA35186.1| | | | : :| : : | | : | | | | | || : 1247 NQTLNLVVDKGTPKSL-GKLQKQPAVGINSPLYLGGIPTSTGLSALRQ--------GTDR 1297 gi|4049589|dbj|BAA35186.1| 1261 SLGKLQK-QPAVGINSPL--YLGGIPTSTGLSALRQGTDRPLGGFHGCIHEVRINNELQD 1317 gi|4049589|dbj|BAA35186.1| || | ||: | : | | |:| | || | :: : : 1298 PLGGFHGCIHEVRINNELQDFKALPPQSLGVS---PGCKSCTVCKHGLCRSVEKDSVVCE 1354 gi|4049589|dbj|BAA35186.1| 1318 FKALPPQSLGVSPGCKSCT--VCKHGLCRSVEKDSVVCECRPGWTGPLCD--QEARDPCL 1373 gi|4049589|dbj|BAA35186.1| : | | | || | : | :|:| |: | ||| :: : | 1355 CRPGWTGPLCDQEARDPCLGHRCHHGKCVAT-GTSYMCKCAEGYGGDLCDNKNDSANACS 1413 gi|4049589|dbj|BAA35186.1| 1374 GHRCHHGKC-VATGTSYMCKCAEGYGGDLCDNKN 1406 gi|4049589|dbj|BAA35186.1| :||||:| :: | | |: |: | :| 1414 AFKCHHGQCHISDQGEPYCLCQPGFSGEHCQQEN 1447 gi|4049589|dbj|BAA35186.1| > 2 gi|4049589|dbj|BAA35186.1| len 1523 from 32 to 514 vs gi|4049589|dbj|BAA35186.1| len 1523 from 503 to 930 score 629 eval 3.834500e-57 identity 33.96% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 32 VACPTKCTCSAASVDCHGLGLRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLH 91 gi|4049589|dbj|BAA35186.1| : || || | ||| | :| :| : : | 503 LVCPEKCRCEGTIVDCSNQKLVRIPSHLP------------------------EYVTDLR 538 gi|4049589|dbj|BAA35186.1| 92 LEDNQVSVIE-RGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPR 150 gi|4049589|dbj|BAA35186.1| | ||:|||:| | |: | | :: |: ||:: : | | : | |: ||:: : 539 LNDNEVSVLEATGIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHG 598 gi|4049589|dbj|BAA35186.1| 151 KAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTL 210 gi|4049589|dbj|BAA35186.1| : |||:: :| | | :| | |: : | | : :|:| :| |: | :| : : |: 599 RVFRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTI 658 gi|4049589|dbj|BAA35186.1| 211 RLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKKEYVCPAPHSEP 270 gi|4049589|dbj|BAA35186.1| | || |:|||||| |||:|| | | | |: : || ::: | :| 659 NLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQKPFFLKEIPIQDVAIQDFTCDG--NEE 716 gi|4049589|dbj|BAA35186.1| 271 PSCNANSISCPSPCTCSNNIVDCRGKGLMEIPANLPEGIVEIRLEQNSIKAIPAGAFTQY 330 gi|4049589|dbj|BAA35186.1| || | || ||| :| | ||| :| :|: : |: || | : |:| 717 SSCQL-SPRCPEQCTCMETVVRCSNKGLRALPRGMPKDVTELYLEGNHLTAVPR------ 769 gi|4049589|dbj|BAA35186.1| 331 KKLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANK 390 gi|4049589|dbj|BAA35186.1| |: || : | | |: : | : | |:|: |: 770 -------------------ELSALRHLTLIDLSNNSISMLTNYTFSNMSHLSTLILSYNR 810 gi|4049589|dbj|BAA35186.1| 391 INCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLKWLAD 450 gi|4049589|dbj|BAA35186.1| : |: |: | |::| :|:|: | : :: :| | | |: | | || ||| |:||:: 811 LRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLSHLALGTNPLHCDCSLRWLSE 870 gi|4049589|dbj|BAA35186.1| 451 YLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFSSECFMDLVCPEKCR 510 gi|4049589|dbj|BAA35186.1| ::: | |||||| :|:: : : :|:| | | ::| | | |: 871 WVKAGYKEPGIARCSSPEPMADRLLLTTPTHRFQCKGPVDI--NIVAKCNACLSSP--CK 926 gi|4049589|dbj|BAA35186.1| 511 CEGT 514 gi|4049589|dbj|BAA35186.1| || 927 NNGT 930 gi|4049589|dbj|BAA35186.1| > 3 gi|4049589|dbj|BAA35186.1| len 1523 from 7 to 413 vs gi|4049589|dbj|BAA35186.1| len 1523 from 9 to 437 score 266 eval 6.732334e-22 identity 29.70% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 7 GVGAAVRARLALALALASVLSGPPAVACPTKCTCSAASVDCHGLGL--RAVPRGIPR--- 61 gi|4049589|dbj|BAA35186.1| | | |||||| | |: :: || | :|| | 9 GAAVRARLALALALASVLSGPPAVACPTKCTCSAASVDCHGLGLRAVPRGIPRNAERLDL 68 gi|4049589|dbj|BAA35186.1| 62 -----------------NAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGA 104 gi|4049589|dbj|BAA35186.1| | | |: | :: | : | || | | | |:: |: 69 DRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQVLPELL 128 gi|4049589|dbj|BAA35186.1| 105 FQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQL 164 gi|4049589|dbj|BAA35186.1| || :| || |::|::| :| |: : | | | | | ||| : |:: | | 129 FQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTL 188 gi|4049589|dbj|BAA35186.1| 165 DNNHISCIEDGAFRALRDLEILTLNNNNI--SRILVTSFNHMPKIRTLRLHSNHLYCDCH 222 gi|4049589|dbj|BAA35186.1| :||:|| | :| : : | |::|:: | : : : ||: : : | 189 NNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCM-APVH 247 gi|4049589|dbj|BAA35186.1| 223 LAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKKEYVCPAPHSEPPSCNANSISCPS 282 gi|4049589|dbj|BAA35186.1| | : |:: :| || ||: | :|: | : | 248 LRGFNVADVQKKEY----VCPAP----HSEPPSCNANSISCPS----PCTCSNNIVDCRG 295 gi|4049589|dbj|BAA35186.1| 283 ------PCTCSNNIVDCR--GKGLMEIPANL---PEGIVEIRLEQNSIKAIPAGAFTQYK 331 gi|4049589|dbj|BAA35186.1| | ||: | : ||| : : | : :| | | || | 296 KGLMEIPANLPEGIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDAFQGLK 355 gi|4049589|dbj|BAA35186.1| 332 KLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANKI 391 gi|4049589|dbj|BAA35186.1| | : : |:|::|| | || || |:| ||| : | | :| || | ||: 356 SLTSLVLYGNKITEIAKGLFDGLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKL 415 gi|4049589|dbj|BAA35186.1| 392 NCLRVNTFQDLQNLNLLSLYDN 413 gi|4049589|dbj|BAA35186.1| : | ||:: | | | 416 QTISKGLFAPLQSIQTLHLAQN 437 gi|4049589|dbj|BAA35186.1| > 4 gi|4049589|dbj|BAA35186.1| len 1523 from 34 to 292 vs gi|4049589|dbj|BAA35186.1| len 1523 from 725 to 937 score 263 eval 1.316565e-21 identity 33.33% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 34 CPTKCTCSAASVDCHGLGLRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHLE 93 gi|4049589|dbj|BAA35186.1| || :||| | | ||||:|||:|:: | |: |::| : : : : |::| :: | 725 CPEQCTCMETVVRCSNKGLRALPRGMPKDVTELYLEGNHLTAVPR-ELSALRHLTLIDLS 783 gi|4049589|dbj|BAA35186.1| 94 DNQVSVIERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPRKAF 153 gi|4049589|dbj|BAA35186.1| :| :|:: | :: | | |: |:|: 784 NNSISMLTNYTFSNMSHLSTLILSYNRLR------------------------------- 812 gi|4049589|dbj|BAA35186.1| 154 RGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLRLH 213 gi|4049589|dbj|BAA35186.1| || || || | :|||: |:|| : ||| : : | | 813 -----------------CIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLSHLALG 855 gi|4049589|dbj|BAA35186.1| 214 SNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHLRGFNVADVQKKEYVC--PAPHSEPP 271 gi|4049589|dbj|BAA35186.1| :| |:||| | |||:|:: | :| : : : | | : 856 TNPLHCDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPTHRFQCKGPVDINIVA 915 gi|4049589|dbj|BAA35186.1| 272 SCNA-NSISCPSPCTCSNNIVD 292 gi|4049589|dbj|BAA35186.1| ||| | | : ||: : |: 916 KCNACLSSPCKNNGTCTQDPVE 937 gi|4049589|dbj|BAA35186.1| > 5 gi|4049589|dbj|BAA35186.1| len 1523 from 65 to 145 vs gi|4049589|dbj|BAA35186.1| len 1523 from 137 to 217 score 139 eval 1.441687e-09 identity 38.27% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 65 RLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQVL 124 gi|4049589|dbj|BAA35186.1| |||| | | | : | |: ::: | |::| :| || |||: |: || | || | : : 137 RLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRI 196 gi|4049589|dbj|BAA35186.1| 125 PELLFQSTPKLTRLDLSENQI 145 gi|4049589|dbj|BAA35186.1| | ||: | | | : 197 LVTSFNHMPKIRTLRLHSNHL 217 gi|4049589|dbj|BAA35186.1| > 6 gi|4049589|dbj|BAA35186.1| len 1523 from 61 to 171 vs gi|4049589|dbj|BAA35186.1| len 1523 from 109 to 219 score 138 eval 1.802865e-09 identity 27.93% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 61 RNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNK 120 gi|4049589|dbj|BAA35186.1| : ||| |::| : : :: | | | | :||: | | ||: : :: |:|: | 109 KQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNH 168 gi|4049589|dbj|BAA35186.1| 121 LQVLPELLFQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISC 171 gi|4049589|dbj|BAA35186.1| : : : |:: | | |: | | | :| : :: |:| :||: | 169 ISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHLYC 219 gi|4049589|dbj|BAA35186.1| > 7 gi|4049589|dbj|BAA35186.1| len 1523 from 57 to 121 vs gi|4049589|dbj|BAA35186.1| len 1523 from 146 to 217 score 95 eval 2.697246e-05 identity 40.00% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 57 RGIPRNAER-------LDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLK 109 gi|4049589|dbj|BAA35186.1| :|||| | | | || |:|: | | |::| :| | :| :| | :| : 146 QGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMP 205 gi|4049589|dbj|BAA35186.1| 110 QLERLRLNKNKL 121 gi|4049589|dbj|BAA35186.1| :: |||: | | 206 KIRTLRLHSNHL 217 gi|4049589|dbj|BAA35186.1| > 8 gi|4049589|dbj|BAA35186.1| len 1523 from 66 to 119 vs gi|4049589|dbj|BAA35186.1| len 1523 from 610 to 663 score 83 eval 3.944089e-04 identity 38.89% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 66 LDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKN 119 gi|4049589|dbj|BAA35186.1| | | | | :: |||| ::|:| | ||::: | ||| | | : | | 610 LMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTINLLSN 663 gi|4049589|dbj|BAA35186.1| > 9 gi|4049589|dbj|BAA35186.1| len 1523 from 61 to 97 vs gi|4049589|dbj|BAA35186.1| len 1523 from 181 to 217 score 62 eval 4.223106e-02 identity 37.84% K 1.946626e-02 L 2.235630e-01 H 1.402204e+00 alpha 1.036353e-01 61 RNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQV 97 gi|4049589|dbj|BAA35186.1| |: | | |: |||:|| | : :| | | | : 181 RDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHL 217 gi|4049589|dbj|BAA35186.1|