analysis of sequence from tem36 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). MRSRLPPALAALGAALLLSSIEAEVDPPSDLNFKIIDENTVHMSWAEPVD PIVGYRITVDPTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEV EESVPVIGQLTIQTGSSTKPVEKKPGKTEIQKCSVSAWTDLVFLVDGSWS VGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQR DELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVGFPKVAIIITD GKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVA NFDAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEPPSNLIAMEVSSKYV KLNWNPSPSPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEY QISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVDGSYS IGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKV EDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSNVPKVMILITD GKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVE DFDAFQRISFELTQSICLRIEQELAAIKKKAYVPPKDLSFSEVTSYGFKT NWSPAGENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVN VTAEYEDGFSIPLAGEETTEEVKGAPRNLKVTDETTDSFKITWTQAPGRV LRCRIIYRPVAGGESREVTTPPNQRRRTLENLIPDTKYEVSVIPEYFSGP GTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGKVKQYLVTYTP VAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAGDALFGEGT TLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLYDDVDTGE KNLPEDAIHTMIENLQPETKYRISVFATYSSGEGEPLTGDATTELSQDSK TLKVDEETENTMRVTWKPAPGKVVNYRVVYRPHGRGKQMVAKVPPTVTST VLKRLQPQTTYDITVLPIYKMGEGKLRQGSGTTASRFKSPRNLKTSDPTM SSFRVTWEPAPGEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAG TTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSGMECLTRAEADI VLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTE WQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQAGMRPRA RKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDP DDTHDYNVADFESLSRIVDDLTINLCNSVKGPGDLEAPSNLVISERTHRS FRVSWTPPSDSVDRYKVEYYPVSGGKRQEFYVSRMETSTVLKDLKPETEY VVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVGGA TGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLH DLTSEPVTVREVTLPLPRPQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRY KTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPPVTAQE TTRPVPAPTNLKITEVTSEGFRGTWDHGASDVSLYRITWGPFGSSDKMET ILNGDENTLVFENLNPNTIYEVSITAIYADESESDDLIGSERTLPILTTQ APKSGPRNLQVYNATSNSLTVKWDPASGRVQKYRITYQPSTGEGNEQTTT IGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNL RVYDPSTSTLNVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPGNTNYAIL RNLQPDTSYTVTVVPVYTEGDGGRTSDTGRTLMRGLARNVQVYNPTPNRL GVRWDPAPGPVLQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIPDTLY SVNLVALYSDGEGNPSPAQGRTLPRSGPRNLRVFGETTNSLSVAWDHADG PVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYE DGDGGHLTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVLGYKIVY KPVGSNEPMEAFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGLSVPLTDQG TTLYLNVTDLKTYQIGWDTFCVKWSPHRAATSYRLKLSPADGTRGQEITV RGSETSHCFTGLSPDTDYGVTVFVQTPNLEGPGVSVKEHTTVKPTEAPTE PPTPPPPPTIPPARDVCKGAKADIVFLTDASWSIGDDNFNKVVKFIFNTV GGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGN TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQ SGFSVFVVGVADVDYNELANIASKPSERHVFIVDDFESFEKIEDNLITFV CETATSSCPLIYLDGYTSPGFKMLEAYNLTEKNFASVQGVSLESGSFPSY SAYRIQKNAFVNQPTADLHPNGLPPSYTIILLFRLLPETPSDPFAIWQIT DRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFDTEEVKTLFYGSFHK VHIVVTSKSVKIYIDCYEIIEKDIKEAGNITTDGYEILGKLLKGERKSAA FQIQSFDIVCSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVGPPG PPGPAGGPGAKGPRGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLS IPGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGDRGFTGKDSAM GPRGPPGRPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRGDIAS QNMMRAVARQVCEQLISGQMNRFNQMLNQIPNDYQSSRNQPGPPGPPGPP GSAGARGEPGPGGRPGFPGTPGMQGPPGERGLPGEKGERGTGSSGPRGLP GPPGPQGESRTGPPGSTGSRGPPGPPGRPGNSGIQGPPGPPGYCDSSQCA SIPYNGQSYPGSG ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > sp|Q99715|CA1C_HUMAN . . . . . 1 MRSRLPPALAALGAALLLSSIEAEVDPPSDLNFKIIDENTVHMSWAEPVD 50 _____HHHHHHHHHHHHHHHHHHH_______EEEEEE_____________ . . . . . 51 PIVGYRITVDPTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEV 100 __EEEEEEE________EEEEEE___HHHHHHHEEE_EEEEEEEE_____ . . . . . 101 EESVPVIGQLTIQTGSSTKPVEKKPGKTEIQKCSVSAWTDLVFLVDGSWS 150 _______EEEEEEE______________EEEE_______EEEEEE_____ . . . . . 151 VGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQR 200 _____HHHHHHHHHHHHHH______EEEEEEEEE_____EEEHHHHHHHH . . . . . 201 DELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVGFPKVAIIITD 250 HHHHHHHHH___________HHHHHHHHH_______EEE___EEEEEE__ . . . . . 251 GKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVA 300 ____EEEEHHHHHHHH___EEEE__HHHHHHHHHHHHH______EEEEE_ . . . . . 301 NFDAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEPPSNLIAMEVSSKYV 350 _HHHHHHHHHHHHHHH___HHHHHHH_____EEE_____HHHHHHH____ . . . . . 351 KLNWNPSPSPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEY 400 ____________EEEEEEE________EEE_______EEEEE________ . . . . . 401 QISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVDGSYS 450 EEE_____________EEE______EEEEE_____EEE__EEEEE____E . . . . . 451 IGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKV 500 EE__HHHHHHHHHHHHHHHH______EEEEE_________HHHHHHHH_H . . . . . 501 EDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSNVPKVMILITD 550 HHHHHHHH______________HHHHHHEEE___________EEEEEE__ . . . . . 551 GKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVE 600 __________HHHHH___EEEEEEEEHHHHHHHHHHH______EEEEEEE . . . . . 601 DFDAFQRISFELTQSICLRIEQELAAIKKKAYVPPKDLSFSEVTSYGFKT 650 _HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH___________________ . . . . . 651 NWSPAGENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVN 700 ________EEEEEEEEE______EEEEEE_____EEEEE______EEEEE . . . . . 701 VTAEYEDGFSIPLAGEETTEEVKGAPRNLKVTDETTDSFKITWTQAPGRV 750 EEEE________________________EEEE______EEEE______EE . . . . . 751 LRCRIIYRPVAGGESREVTTPPNQRRRTLENLIPDTKYEVSVIPEYFSGP 800 EEEEEEE________EEEE____HHHHHHHH_____EEEEEE________ . . . . . 801 GTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGKVKQYLVTYTP 850 ___________EEE_______________EEEE______EEEEEEEEE__ . . . . . 851 VAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAGDALFGEGT 900 _____EEEEEEE_______________EEEEHHHHHHHH___________ . . . . . 901 TLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLYDDVDTGE 950 __________EEE_________EEEE______EEEEEEEEEE________ . . . . . 951 KNLPEDAIHTMIENLQPETKYRISVFATYSSGEGEPLTGDATTELSQDSK 1000 ______HHHHHHHH______EEEEEEEEE_____________________ . . . . . 1001 TLKVDEETENTMRVTWKPAPGKVVNYRVVYRPHGRGKQMVAKVPPTVTST 1050 EEEE_______EEEEE______EEEEEEEE______EEEEEE_____HHH . . . . . 1051 VLKRLQPQTTYDITVLPIYKMGEGKLRQGSGTTASRFKSPRNLKTSDPTM 1100 HHHH_______EEEE__________________EEEE_____________ . . . . . 1101 SSFRVTWEPAPGEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAG 1150 __EEEE_______EEEE_EEEE_______EEEEEE_____HHHHHHHH__ . . . . . 1151 TTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSGMECLTRAEADI 1200 ____EEE__________EEEEE_EEE_____EEEE_____HHHHHHHHHE . . . . . 1201 VLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTE 1250 EEEE____________HHHHHHHHHEEEE_______EEE_________HH . . . . . 1251 WQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQAGMRPRA 1300 HHHHHH___HHHHHHHHH_________HHHHHHHHHHHHHHH______HH . . . . . 1301 RKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDP 1350 HHHEEEEE_________HHHHHHHH____HHHHH____HHHHHHEEE___ . . . . . 1351 DDTHDYNVADFESLSRIVDDLTINLCNSVKGPGDLEAPSNLVISERTHRS 1400 _____EEE___HHHHHHHHHHHHHEEE_____________EEEEE____E . . . . . 1401 FRVSWTPPSDSVDRYKVEYYPVSGGKRQEFYVSRMETSTVLKDLKPETEY 1450 EEEEE________EEEEEEE_______EEEEEE____EEEEEE____EEE . . . . . 1451 VVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVGGA 1500 EEEEEEEEE_________________EEEEEE______EEEEEEE_____ . . . . . 1501 TGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLH 1550 _EEEEEE____________EEEE_________________EEEEEEEHHH . . . . . 1551 DLTSEPVTVREVTLPLPRPQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRY 1600 HH_____EEEEEE________EEEEE_____EEEEE______EEEEEEEE . . . . . 1601 KTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPPVTAQE 1650 ______EEEEEEE________HHHHHHHHHHEEEEEE_____________ . . . . . 1651 TTRPVPAPTNLKITEVTSEGFRGTWDHGASDVSLYRITWGPFGSSDKMET 1700 __________EEEEEE________________EEEEEE_________EEE . . . . . 1701 ILNGDENTLVFENLNPNTIYEVSITAIYADESESDDLIGSERTLPILTTQ 1750 EE_____EEEEE______EEEEEE____________EEE____EEEEE__ . . . . . 1751 APKSGPRNLQVYNATSNSLTVKWDPASGRVQKYRITYQPSTGEGNEQTTT 1800 _______EEEEEE_____EEEE______EEEEEEEE__________EEE_ . . . . . 1801 IGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNL 1850 _______EEEEE_______EEEEE_____________________EEEEE . . . . . 1851 RVYDPSTSTLNVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPGNTNYAIL 1900 EE_______EEEE_________EEEEEE__________________EEEE . . . . . 1901 RNLQPDTSYTVTVVPVYTEGDGGRTSDTGRTLMRGLARNVQVYNPTPNRL 1950 ________EEEEEE________________HHHHHH__EEEEE_______ . . . . . 1951 GVRWDPAPGPVLQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIPDTLY 2000 EEEE______EEEEEEE__________EEEEE____HHHHHHH______E . . . . . 2001 SVNLVALYSDGEGNPSPAQGRTLPRSGPRNLRVFGETTNSLSVAWDHADG 2050 EEEEEEEE______________________EEEEE_____EEEEE_____ . . . . . 2051 PVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYE 2100 _EEEEEEEEE________EEEE______EEEE_________EEEEEEEE_ . . . . . 2101 DGDGGHLTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVLGYKIVY 2150 ____________EEE_________HHHHHEEEEE_________EEEEEEE . . . . . 2151 KPVGSNEPMEAFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGLSVPLTDQG 2200 E_______HHHHHHHEEEEEE_______EEEEEEEEE_____EEE_____ . . . . . 2201 TTLYLNVTDLKTYQIGWDTFCVKWSPHRAATSYRLKLSPADGTRGQEITV 2250 EEEEEE____EEEEE_____EEE_____HHHHHHHH_________EEEEE . . . . . 2251 RGSETSHCFTGLSPDTDYGVTVFVQTPNLEGPGVSVKEHTTVKPTEAPTE 2300 __________________EEEEEEE_________EEEEE___________ . . . . . 2301 PPTPPPPPTIPPARDVCKGAKADIVFLTDASWSIGDDNFNKVVKFIFNTV 2350 _____________________EEEEEEEE_________HHHHHEEEEEE_ . . . . . 2351 GGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGN 2400 ___________EEEEE________________HHHHHHH___EEEE____ . . . . . 2401 TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQ 2450 ____EEEHHHHHHH____________EEEEEEE_____HHHHHHHHHHHH . . . . . 2451 SGFSVFVVGVADVDYNELANIASKPSERHVFIVDDFESFEKIEDNLITFV 2500 ____EEEEE__________________EEEEEEE___HHHHH__HHHHHH . . . . . 2501 CETATSSCPLIYLDGYTSPGFKMLEAYNLTEKNFASVQGVSLESGSFPSY 2550 _________EEEE________EEEEEEEE____EEEE_____________ . . . . . 2551 SAYRIQKNAFVNQPTADLHPNGLPPSYTIILLFRLLPETPSDPFAIWQIT 2600 HHHHHHHHHH________________HHHHHHHH__________EEEEE_ . . . . . 2601 DRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFDTEEVKTLFYGSFHK 2650 ______EEEEEE_____EEEE_________EEEEE____EEEEE____EE . . . . . 2651 VHIVVTSKSVKIYIDCYEIIEKDIKEAGNITTDGYEILGKLLKGERKSAA 2700 EEEEEE___EEEEEEEHHHHHHHHHHH_______HHHHHHHHHHHHHHHH . . . . . 2701 FQIQSFDIVCSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVGPPG 2750 HEEEEEEEEE________________________________________ . . . . . 2751 PPGPAGGPGAKGPRGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLS 2800 __________________________________________________ . . . . . 2801 IPGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGDRGFTGKDSAM 2850 __________________________________________________ . . . . . 2851 GPRGPPGRPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRGDIAS 2900 _______________________________________________HHH . . . . . 2901 QNMMRAVARQVCEQLISGQMNRFNQMLNQIPNDYQSSRNQPGPPGPPGPP 2950 HHHHHHHHHHHHHHHHH___HHHHHHHH______________________ . . . . . 2951 GSAGARGEPGPGGRPGFPGTPGMQGPPGERGLPGEKGERGTGSSGPRGLP 3000 __________________________________________________ . . . . . 3001 GPPGPQGESRTGPPGSTGSRGPPGPPGRPGNSGIQGPPGPPGYCDSSQCA 3050 __________________________________________________ . 3051 SIPYNGQSYPGSG 3063 _____________ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 0.0 % beta-contents : 30.6 % coil-contents : 69.4 % class : beta method : 2 alpha-contents : 0.0 % beta-contents : 21.0 % coil-contents : 79.0 % class : beta ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -5.68 0.00 -0.01 0.00 0.00 0.00 0.00 0.00 0.00 -8.83 -3.56 -12.00 -12.00 0.00 -12.00 0.00 -54.08 0.48 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 -10.69 -3.56 -12.00 -12.00 0.00 -12.00 0.00 -49.78 ID: sp|Q99715|CA1C_HUMAN AC: xxx Len: 3050 1:I 3031 Sc: -49.78 Pv: 2.228083e-01 NO_GPI_SITE GPI: learning from protozoa -16.90 0.00 0.00 0.00 -4.00 0.00 0.00 0.00 0.00 -9.07 -12.09 -12.00 -12.00 0.00 -12.00 0.00 -78.06 -12.04 0.00 0.00 -0.40 0.00 0.00 0.00 0.00 0.00 -9.35 -12.09 -12.00 -12.00 0.00 -12.00 0.00 -69.86 ID: sp|Q99715|CA1C_HUMAN AC: xxx Len: 3050 1:I 3030 Sc: -69.86 Pv: 3.441914e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? sp|Q99715|C 0.595 282 Y 0.666 24 Y 0.954 5 Y 0.824 Y # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? sp|Q99715|C 0.700 1649 Y 0.529 26 Y 0.987 9 Y 0.750 Y # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? sp|Q99715|C 0.567 1199 Y 0.448 24 Y 0.993 9 Y 0.904 Y ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). 1-4 MRSR lppalaalgaalll 5-18 19-68 SSIEAEVDPPSDLNFKIIDENTVHMSWAEP VDPIVGYRITVDPTTDGPTK eftlsasttetllselvpete 69-89 90-323 YVVTITSYDEVEESVPVIGQLTIQTGSSTK PVEKKPGKTEIQKCSVSAWTDLVFLVDGSW SVGRNNFKYILDFIAALVSAFDIGEEKTRV GVVQYSSDTRTEFNLNQYYQRDELLAAIKK IPYKGGNTMTGDAIDYLVKNTFTESAGARV GFPKVAIIITDGKSQDEVEIPARELRNVGV EVFSLGIKAADAKELKQIASTPSLNHVFNV ANFDAIVDIQNEIISQVCSGVDEQ lgelvsgeevvepps 324-338 339-1906 NLIAMEVSSKYVKLNWNPSPSPVTGYKVIL TPMTAGSRQHALSVGPQTTTLSVRDLSADT EYQISVSAMKGMTSSEPISIMEKTQPMKVQ VECSRGVDIKADIVFLVDGSYSIGIANFVK VRAFLEVLVKSFEISPNRVQISLVQYSRDP HTEFTLKKFTKVEDIIEAINTFPYRGGSTN TGKAMTYVREKIFVPSKGSRSNVPKVMILI TDGKSSDAFRDPAIKLRNSDVEIFAVGVKD AVRSELEAIASPPAETHVFTVEDFDAFQRI SFELTQSICLRIEQELAAIKKKAYVPPKDL SFSEVTSYGFKTNWSPAGENVFSYHITYKE AAGDDEVTVVEPASSTSVVLSSLKPETLYL VNVTAEYEDGFSIPLAGEETTEEVKGAPRN LKVTDETTDSFKITWTQAPGRVLRCRIIYR PVAGGESREVTTPPNQRRRTLENLIPDTKY EVSVIPEYFSGPGTPLTGNAATEEVRGNPR DLRVSDPTTSTMKLSWSGAPGKVKQYLVTY TPVAGGETQEVTVRGDTTNTVLQGLKEGTQ YALSVTALYASGAGDALFGEGTTLEERGSP QDLVTKDITDTSIGAYWTSAPGMVRGYRVS WKSLYDDVDTGEKNLPEDAIHTMIENLQPE TKYRISVFATYSSGEGEPLTGDATTELSQD SKTLKVDEETENTMRVTWKPAPGKVVNYRV VYRPHGRGKQMVAKVPPTVTSTVLKRLQPQ TTYDITVLPIYKMGEGKLRQGSGTTASRFK SPRNLKTSDPTMSSFRVTWEPAPGEVKGYK VTFHPTGDDRRLGELVVGPYDNTVVLEELR AGTTYKVNVFGMFDGGESSPLVGQEMTTLS DTTVMPILSSGMECLTRAEADIVLLVDGSW SIGRANFRTVRSFISRIVEVFDIGPKRVQI ALAQYSGDPRTEWQLNAHRDKKSLLQAVAN LPYKGGNTLTGMALNFIRQQNFRTQAGMRP RARKIGVLITDGKSQDDVEAPSKKLKDEGV ELFAIGIKNADEVELKMIATDPDDTHDYNV ADFESLSRIVDDLTINLCNSVKGPGDLEAP SNLVISERTHRSFRVSWTPPSDSVDRYKVE YYPVSGGKRQEFYVSRMETSTVLKDLKPET EYVVNVYSVVEDEYSEPLKGTEKTLPVPVV SLNIYDVGPTTMHVQWQPVGGATGYILSYK PVKDTEPTRPKEVRLGPTVNDMQLTDLVPN TEYAVTVQAVLHDLTSEPVTVREVTLPLPR PQDLKLRDVTHSTMNVFWEPVPGKVRKYIV RYKTPEEDVKEVEVDRSETSTSLKDLFSQT LYTVSVSAVHDEGESPPVTAQETTRPVPAP TNLKITEVTSEGFRGTWDHGASDVSLYRIT WGPFGSSDKMETILNGDENTLVFENLNPNT IYEVSITAIYADESESDDLIGSERTLPILT TQAPKSGPRNLQVYNATSNSLTVKWDPASG RVQKYRITYQPSTGEGNEQTTTIGGRQNSV VLQKLKPDTPYTITVSSLYPDGEGGRMTGR GKTKPLNTVRNLRVYDPSTSTLNVRWDHAE GNPRQYKLFYAPAAGGPEELVPIPGNTNYA ILRNLQPD tsytvtvvpvytegdggrtsdtgrt 1907-1931 1932-2293 LMRGLARNVQVYNPTPNRLGVRWDPAPGPV LQYRVVYSPVDGTRPSESIVVPGNTRMVHL ERLIPDTLYSVNLVALYSDGEGNPSPAQGR TLPRSGPRNLRVFGETTNSLSVAWDHADGP VQQYRIIYSPTVGDPIDEYTTVPGRRNNVI LQPLQPDTPYKITVIAVYEDGDGGHLTGNG RTVGLLPPQNIHISDEWYTRFRVSWDPSPS PVLGYKIVYKPVGSNEPMEAFVGEMTSYTL HNLNPSTTYDVNVYAQYDSGLSVPLTDQGT TLYLNVTDLKTYQIGWDTFCVKWSPHRAAT SYRLKLSPADGTRGQEITVRGSETSHCFTG LSPDTDYGVTVFVQTPNLEGPGVSVKEHTT VK pteaptepptppppptippa 2294-2313 2314-2746 RDVCKGAKADIVFLTDASWSIGDDNFNKVV KFIFNTVGGFDEISPAGIQVSFVQYSDEVK SEFKLNTYNDKALALGALQNIRYRGGNTRT GKALTFIKEKVLTWESGMRKNVPKVLVVVT DGRSQDEVKKAALVIQQSGFSVFVVGVADV DYNELANIASKPSERHVFIVDDFESFEKIE DNLITFVCETATSSCPLIYLDGYTSPGFKM LEAYNLTEKNFASVQGVSLESGSFPSYSAY RIQKNAFVNQPTADLHPNGLPPSYTIILLF RLLPETPSDPFAIWQITDRDYKPQVGVIAD PSSKTLSFFNKDTRGEVQTVTFDTEEVKTL FYGSFHKVHIVVTSKSVKIYIDCYEIIEKD IKEAGNITTDGYEILGKLLKGERKSAAFQI QSFDIVCSPVWTSRDRCCDIPSRRDEGKCP AFPNSCTCTQDSV gppgppgpaggpgakgprgerg 2747-2768 2769-2770 IS gaigppgprgdigppgpqgppgpqgpng 2771-2798 2799-2816 LSIPGEQGRQGMKGDAGE pglpgrtgtpglpgppgpmgppgdrg 2817-2842 2843-2850 FTGKDSAM gprgppgrpgspgspgvtgpsgkpgkpg 2851-2878 2879-2940 DHGRPGPSGLKGEKGDRGDIASQNMMRAVA RQVCEQLISGQMNRFNQMLNQIPNDYQSSR NQ pgppgppgppgsagargepgpggrpgfpgt 2941-2972 pg 2973-2989 MQGPPGERGLPGEKGER gtgssgprglpgppgpqgesrtgppgstgs 2990-3030 rgppgppgrpg 3031-3063 NSGIQGPPGPPGYCDSSQCASIPYNGQSYP GSG low complexity regions: SEG 25 3.0 3.3 >sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). 1-4 MRSR lppalaalgaalll 5-18 19-57 SSIEAEVDPPSDLNFKIIDENTVHMSWAEP VDPIVGYRI tvdpttdgptkeftlsasttetllselvpe 58-118 teyvvtitsydeveesvpvigqltiqtgss t 119-305 KPVEKKPGKTEIQKCSVSAWTDLVFLVDGS WSVGRNNFKYILDFIAALVSAFDIGEEKTR VGVVQYSSDTRTEFNLNQYYQRDELLAAIK KIPYKGGNTMTGDAIDYLVKNTFTESAGAR VGFPKVAIIITDGKSQDEVEIPARELRNVG VEVFSLGIKAADAKELKQIASTPSLNHVFN VANFDAI vdiqneiisqvcsgvdeqlgelvsgeevve 306-335 336-673 PPSNLIAMEVSSKYVKLNWNPSPSPVTGYK VILTPMTAGSRQHALSVGPQTTTLSVRDLS ADTEYQISVSAMKGMTSSEPISIMEKTQPM KVQVECSRGVDIKADIVFLVDGSYSIGIAN FVKVRAFLEVLVKSFEISPNRVQISLVQYS RDPHTEFTLKKFTKVEDIIEAINTFPYRGG STNTGKAMTYVREKIFVPSKGSRSNVPKVM ILITDGKSSDAFRDPAIKLRNSDVEIFAVG VKDAVRSELEAIASPPAETHVFTVEDFDAF QRISFELTQSICLRIEQELAAIKKKAYVPP KDLSFSEVTSYGFKTNWSPAGENVFSYHIT YKEAAGDD evtvvepasstsvvlsslkpetlylvnvta 674-706 eye 707-1904 DGFSIPLAGEETTEEVKGAPRNLKVTDETT DSFKITWTQAPGRVLRCRIIYRPVAGGESR EVTTPPNQRRRTLENLIPDTKYEVSVIPEY FSGPGTPLTGNAATEEVRGNPRDLRVSDPT TSTMKLSWSGAPGKVKQYLVTYTPVAGGET QEVTVRGDTTNTVLQGLKEGTQYALSVTAL YASGAGDALFGEGTTLEERGSPQDLVTKDI TDTSIGAYWTSAPGMVRGYRVSWKSLYDDV DTGEKNLPEDAIHTMIENLQPETKYRISVF ATYSSGEGEPLTGDATTELSQDSKTLKVDE ETENTMRVTWKPAPGKVVNYRVVYRPHGRG KQMVAKVPPTVTSTVLKRLQPQTTYDITVL PIYKMGEGKLRQGSGTTASRFKSPRNLKTS DPTMSSFRVTWEPAPGEVKGYKVTFHPTGD DRRLGELVVGPYDNTVVLEELRAGTTYKVN VFGMFDGGESSPLVGQEMTTLSDTTVMPIL SSGMECLTRAEADIVLLVDGSWSIGRANFR TVRSFISRIVEVFDIGPKRVQIALAQYSGD PRTEWQLNAHRDKKSLLQAVANLPYKGGNT LTGMALNFIRQQNFRTQAGMRPRARKIGVL ITDGKSQDDVEAPSKKLKDEGVELFAIGIK NADEVELKMIATDPDDTHDYNVADFESLSR IVDDLTINLCNSVKGPGDLEAPSNLVISER THRSFRVSWTPPSDSVDRYKVEYYPVSGGK RQEFYVSRMETSTVLKDLKPETEYVVNVYS VVEDEYSEPLKGTEKTLPVPVVSLNIYDVG PTTMHVQWQPVGGATGYILSYKPVKDTEPT RPKEVRLGPTVNDMQLTDLVPNTEYAVTVQ AVLHDLTSEPVTVREVTLPLPRPQDLKLRD VTHSTMNVFWEPVPGKVRKYIVRYKTPEED VKEVEVDRSETSTSLKDLFSQTLYTVSVSA VHDEGESPPVTAQETTRPVPAPTNLKITEV TSEGFRGTWDHGASDVSLYRITWGPFGSSD KMETILNGDENTLVFENLNPNTIYEVSITA IYADESESDDLIGSERTLPILTTQAPKSGP RNLQVYNATSNSLTVKWDPASGRVQKYRIT YQPSTGEGNEQTTTIGGRQNSVVLQKLKPD TPYTITVSSLYPDGEGGRMTGRGKTKPLNT VRNLRVYDPSTSTLNVRWDHAEGNPRQYKL FYAPAAGGPEELVPIPGNTNYAILRNLQ pdtsytvtvvpvytegdggrtsdtgrt 1905-1931 1932-2293 LMRGLARNVQVYNPTPNRLGVRWDPAPGPV LQYRVVYSPVDGTRPSESIVVPGNTRMVHL ERLIPDTLYSVNLVALYSDGEGNPSPAQGR TLPRSGPRNLRVFGETTNSLSVAWDHADGP VQQYRIIYSPTVGDPIDEYTTVPGRRNNVI LQPLQPDTPYKITVIAVYEDGDGGHLTGNG RTVGLLPPQNIHISDEWYTRFRVSWDPSPS PVLGYKIVYKPVGSNEPMEAFVGEMTSYTL HNLNPSTTYDVNVYAQYDSGLSVPLTDQGT TLYLNVTDLKTYQIGWDTFCVKWSPHRAAT SYRLKLSPADGTRGQEITVRGSETSHCFTG LSPDTDYGVTVFVQTPNLEGPGVSVKEHTT VK pteaptepptppppptippa 2294-2313 2314-2746 RDVCKGAKADIVFLTDASWSIGDDNFNKVV KFIFNTVGGFDEISPAGIQVSFVQYSDEVK SEFKLNTYNDKALALGALQNIRYRGGNTRT GKALTFIKEKVLTWESGMRKNVPKVLVVVT DGRSQDEVKKAALVIQQSGFSVFVVGVADV DYNELANIASKPSERHVFIVDDFESFEKIE DNLITFVCETATSSCPLIYLDGYTSPGFKM LEAYNLTEKNFASVQGVSLESGSFPSYSAY RIQKNAFVNQPTADLHPNGLPPSYTIILLF RLLPETPSDPFAIWQITDRDYKPQVGVIAD PSSKTLSFFNKDTRGEVQTVTFDTEEVKTL FYGSFHKVHIVVTSKSVKIYIDCYEIIEKD IKEAGNITTDGYEILGKLLKGERKSAAFQI QSFDIVCSPVWTSRDRCCDIPSRRDEGKCP AFPNSCTCTQDSV gppgppgpaggpgakgprgergisgaigpp 2747-2901 gprgdigppgpqgppgpqgpnglsipgeqg rqgmkgdagepglpgrtgtpglpgppgpmg ppgdrgftgkdsamgprgppgrpgspgspg vtgpsgkpgkpgdhgrpgpsglkgekgdrg diasq 2902-2935 NMMRAVARQVCEQLISGQMNRFNQMLNQIP NDYQ ssrnqpgppgppgppgsagargepgpggrp 2936-3042 gfpgtpgmqgppgerglpgekgergtgssg prglpgppgpqgesrtgppgstgsrgppgp pgrpgnsgiqgppgppg 3043-3063 YCDSSQCASIPYNGQSYPGSG low complexity regions: SEG 45 3.4 3.75 >sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). 1-46 MRSRLPPALAALGAALLLSSIEAEVDPPSD LNFKIIDENTVHMSWA epvdpivgyritvdpttdgptkeftlsast 47-151 tetllselvpeteyvvtitsydeveesvpv igqltiqtgsstkpvekkpgkteiqkcsvs awtdlvflvdgswsv 152-1391 GRNNFKYILDFIAALVSAFDIGEEKTRVGV VQYSSDTRTEFNLNQYYQRDELLAAIKKIP YKGGNTMTGDAIDYLVKNTFTESAGARVGF PKVAIIITDGKSQDEVEIPARELRNVGVEV FSLGIKAADAKELKQIASTPSLNHVFNVAN FDAIVDIQNEIISQVCSGVDEQLGELVSGE EVVEPPSNLIAMEVSSKYVKLNWNPSPSPV TGYKVILTPMTAGSRQHALSVGPQTTTLSV RDLSADTEYQISVSAMKGMTSSEPISIMEK TQPMKVQVECSRGVDIKADIVFLVDGSYSI GIANFVKVRAFLEVLVKSFEISPNRVQISL VQYSRDPHTEFTLKKFTKVEDIIEAINTFP YRGGSTNTGKAMTYVREKIFVPSKGSRSNV PKVMILITDGKSSDAFRDPAIKLRNSDVEI FAVGVKDAVRSELEAIASPPAETHVFTVED FDAFQRISFELTQSICLRIEQELAAIKKKA YVPPKDLSFSEVTSYGFKTNWSPAGENVFS YHITYKEAAGDDEVTVVEPASSTSVVLSSL KPETLYLVNVTAEYEDGFSIPLAGEETTEE VKGAPRNLKVTDETTDSFKITWTQAPGRVL RCRIIYRPVAGGESREVTTPPNQRRRTLEN LIPDTKYEVSVIPEYFSGPGTPLTGNAATE EVRGNPRDLRVSDPTTSTMKLSWSGAPGKV KQYLVTYTPVAGGETQEVTVRGDTTNTVLQ GLKEGTQYALSVTALYASGAGDALFGEGTT LEERGSPQDLVTKDITDTSIGAYWTSAPGM VRGYRVSWKSLYDDVDTGEKNLPEDAIHTM IENLQPETKYRISVFATYSSGEGEPLTGDA TTELSQDSKTLKVDEETENTMRVTWKPAPG KVVNYRVVYRPHGRGKQMVAKVPPTVTSTV LKRLQPQTTYDITVLPIYKMGEGKLRQGSG TTASRFKSPRNLKTSDPTMSSFRVTWEPAP GEVKGYKVTFHPTGDDRRLGELVVGPYDNT VVLEELRAGTTYKVNVFGMFDGGESSPLVG QEMTTLSDTTVMPILSSGMECLTRAEADIV LLVDGSWSIGRANFRTVRSFISRIVEVFDI GPKRVQIALAQYSGDPRTEWQLNAHRDKKS LLQAVANLPYKGGNTLTGMALNFIRQQNFR TQAGMRPRARKIGVLITDGKSQDDVEAPSK KLKDEGVELFAIGIKNADEVELKMIATDPD DTHDYNVADFESLSRIVDDLTINLCNSVKG PGDLEAPSNL viserthrsfrvswtppsdsvdrykveyyp 1392-1499 vsggkrqefyvsrmetstvlkdlkpeteyv vnvysvvedeyseplkgtektlpvpvvsln iydvgpttmhvqwqpvgg 1500-2263 ATGYILSYKPVKDTEPTRPKEVRLGPTVND MQLTDLVPNTEYAVTVQAVLHDLTSEPVTV REVTLPLPRPQDLKLRDVTHSTMNVFWEPV PGKVRKYIVRYKTPEEDVKEVEVDRSETST SLKDLFSQTLYTVSVSAVHDEGESPPVTAQ ETTRPVPAPTNLKITEVTSEGFRGTWDHGA SDVSLYRITWGPFGSSDKMETILNGDENTL VFENLNPNTIYEVSITAIYADESESDDLIG SERTLPILTTQAPKSGPRNLQVYNATSNSL TVKWDPASGRVQKYRITYQPSTGEGNEQTT TIGGRQNSVVLQKLKPDTPYTITVSSLYPD GEGGRMTGRGKTKPLNTVRNLRVYDPSTST LNVRWDHAEGNPRQYKLFYAPAAGGPEELV PIPGNTNYAILRNLQPDTSYTVTVVPVYTE GDGGRTSDTGRTLMRGLARNVQVYNPTPNR LGVRWDPAPGPVLQYRVVYSPVDGTRPSES IVVPGNTRMVHLERLIPDTLYSVNLVALYS DGEGNPSPAQGRTLPRSGPRNLRVFGETTN SLSVAWDHADGPVQQYRIIYSPTVGDPIDE YTTVPGRRNNVILQPLQPDTPYKITVIAVY EDGDGGHLTGNGRTVGLLPPQNIHISDEWY TRFRVSWDPSPSPVLGYKIVYKPVGSNEPM EAFVGEMTSYTLHNLNPSTTYDVNVYAQYD SGLSVPLTDQGTTLYLNVTDLKTYQIGWDT FCVKWSPHRAATSYRLKLSPADGTRGQEIT VRGSETSHCFTGLS pdtdygvtvfvqtpnlegpgvsvkehttvk 2264-2323 pteaptepptppppptippardvckgakad 2324-2746 IVFLTDASWSIGDDNFNKVVKFIFNTVGGF DEISPAGIQVSFVQYSDEVKSEFKLNTYND KALALGALQNIRYRGGNTRTGKALTFIKEK VLTWESGMRKNVPKVLVVVTDGRSQDEVKK AALVIQQSGFSVFVVGVADVDYNELANIAS KPSERHVFIVDDFESFEKIEDNLITFVCET ATSSCPLIYLDGYTSPGFKMLEAYNLTEKN FASVQGVSLESGSFPSYSAYRIQKNAFVNQ PTADLHPNGLPPSYTIILLFRLLPETPSDP FAIWQITDRDYKPQVGVIADPSSKTLSFFN KDTRGEVQTVTFDTEEVKTLFYGSFHKVHI VVTSKSVKIYIDCYEIIEKDIKEAGNITTD GYEILGKLLKGERKSAAFQIQSFDIVCSPV WTSRDRCCDIPSRRDEGKCPAFPNSCTCTQ DSV gppgppgpaggpgakgprgergisgaigpp 2747-2897 gprgdigppgpqgppgpqgpnglsipgeqg rqgmkgdagepglpgrtgtpglpgppgpmg ppgdrgftgkdsamgprgppgrpgspgspg vtgpsgkpgkpgdhgrpgpsglkgekgdrg d 2898-2935 IASQNMMRAVARQVCEQLISGQMNRFNQML NQIPNDYQ ssrnqpgppgppgppgsagargepgpggrp 2936-3042 gfpgtpgmqgppgerglpgekgergtgssg prglpgppgpqgesrtgppgstgsrgppgp pgrpgnsgiqgppgppg 3043-3063 YCDSSQCASIPYNGQSYPGSG low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). MRSRLPPALAALGAALLLSSIEAEVDPPSDLNFKIIDENTVHMSWAEPVDPIVGYRITVD PTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKP VEKKPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVG VVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVG FPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVA NFDAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEPPSNLIAMEVSSKYVKLNWNPSPSP VTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIME KTQPMKVQVECSRGVDIKADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQIS LVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSN VPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVE DFDAFQRISFELTQSICLRIEQELAAIKKKAYVPPKDLSFSEVTSYGFKTNWSPAGENVF SYHITYKeaagddevtvvepasstsvvlSSLKPETLYLVNVTAEYEDGFSIPLAGEETTE EVKGAPRNLKVTDETTDSFKITWTQAPGRVLRCRIIYRPVAGGESREVTTPPNQRRRTLE NLIPDTKYEVSVIPEYFSGPGTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGK VKQYLVTYTPVAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAGDALFGEGT TLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLYDDVDTGEKNLPEDAIHT MIENLQPETKYRISVFATYSSGEGEPLTGDATTELSQDSKTLKVDEETENTMRVTWKPAP GKVVNYRVVYRPHGRGKQMVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGKLRQGS GTTASRFKSPRNLKTSDPTMSSFRVTWEPAPGEVKGYKVTFHPTGDDRRLGELVVGPYDN TVVLEELRAGTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSGMECLTRAEADI VLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKK SLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQDDVEAPS KKLKDEGVELFAIGIKNADEVELKMIATDPDDTHDYNVADFESLSRIVDDLTINLCNSVK GPGDLEAPSNLVISERTHRSFRVSWTPPSDSVDRYKVEYYPVSGGKRQEFYVSRMETSTV LKDLKPETEYVVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVGGA TGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLHDLTSEPVTVR EVTLPLPRPQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRYKTPEEDVKEVEVDRSETSTS LKDLFSQTLYTVSVSAVHDEGESPPVTAQETTRPVPAPTNLKITEVTSEGFRGTWDHGAS DVSLYRITWGPFGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIYADESESDDLIGS ERTLPILTTQAPKSGPRNLQVYNATSNSLTVKWDPASGRVQKYRITYQPSTGEGNEQTTT IGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNLRVYDPSTSTL NVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPGNTNYAILRNLQPDTSYTVTVVPVYTEG DGGRTSDTGRTLMRGLARNVQVYNPTPNRLGVRWDPAPGPVLQYRVVYSPVDGTRPSESI VVPGNTRMVHLERLIPDTLYSVNLVALYSDGEGNPSPAQGRTLPRSGPRNLRVFGETTNS LSVAWDHADGPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYE DGDGGHLTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVLGYKIVYKPVGSNEPME AFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGLSVPLTDQGTTLYLNVTDLKTYQIGWDTF CVKWSPHRAATSYRLKLSPADGTRGQEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLE GPGVSVKEHTTVKpteaptepptppppptippaRDVCKGAKADIVFLTDASWSIGDDNFN KVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGN TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGV ADVDYNELANIASKPSERHVFIVDDFESFEKIEDNLITFVCETATSSCPLIYLDGYTSPG FKMLEAYNLTEKNFASVQGVSLESGSFPSYSAYRIQKNAFVNQPTADLHPNGLPPSYTII LLFRLLPETPSDPFAIWQITDRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFDTEEV KTLFYGSFHKVHIVVTSKSVKIYIDCYEIIEKDIKEAGNITTDGYEILGKLLKGERKSAA FQIQSFDIVCSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVgppgppgpaggpga kgprgergisgaigppgprgdigppgpqgppgpqgpnglsipgeqgrqgmkgdagepglp grtgtpglpgppgpmgppgdrgFTGKDsamgprgppgrpgspgspgvtgpsgkpgkpgdh grpgpsglkgekgdrgdIASQNMMRAVARQVCEQLISGQMNRFNQMLNQIPNDYQSSRnq pgppgppgppgsagargepgpggrpgfpgtpgmqgppgerglpgekgergtgssgprglp gppgpqgesrtgppgstgsrgppgppgrpgnsgiqgppgppgYCDSSQCASIPYNGQSYP GSG 1 - 667 MRSRLPPALA ALGAALLLSS IEAEVDPPSD LNFKIIDENT VHMSWAEPVD PIVGYRITVD PTTDGPTKEF TLSASTTETL LSELVPETEY VVTITSYDEV EESVPVIGQL TIQTGSSTKP VEKKPGKTEI QKCSVSAWTD LVFLVDGSWS VGRNNFKYIL DFIAALVSAF DIGEEKTRVG VVQYSSDTRT EFNLNQYYQR DELLAAIKKI PYKGGNTMTG DAIDYLVKNT FTESAGARVG FPKVAIIITD GKSQDEVEIP ARELRNVGVE VFSLGIKAAD AKELKQIAST PSLNHVFNVA NFDAIVDIQN EIISQVCSGV DEQLGELVSG EEVVEPPSNL IAMEVSSKYV KLNWNPSPSP VTGYKVILTP MTAGSRQHAL SVGPQTTTLS VRDLSADTEY QISVSAMKGM TSSEPISIME KTQPMKVQVE CSRGVDIKAD IVFLVDGSYS IGIANFVKVR AFLEVLVKSF EISPNRVQIS LVQYSRDPHT EFTLKKFTKV EDIIEAINTF PYRGGSTNTG KAMTYVREKI FVPSKGSRSN VPKVMILITD GKSSDAFRDP AIKLRNSDVE IFAVGVKDAV RSELEAIASP PAETHVFTVE DFDAFQRISF ELTQSICLRI EQELAAIKKK AYVPPKDLSF SEVTSYGFKT NWSPAGENVF SYHITYK 668 - 688 eaa gddevtvvep asstsvvl 689 - 2293 SS LKPETLYLVN VTAEYEDGFS IPLAGEETTE EVKGAPRNLK VTDETTDSFK ITWTQAPG RV LRCRIIYRPV AGGESREVTT PPNQRRRTLE NLIPDTKYEV SVIPEYFSGP GTPLTGNA AT EEVRGNPRDL RVSDPTTSTM KLSWSGAPGK VKQYLVTYTP VAGGETQEVT VRGDTTNT VL QGLKEGTQYA LSVTALYASG AGDALFGEGT TLEERGSPQD LVTKDITDTS IGAYWTSA PG MVRGYRVSWK SLYDDVDTGE KNLPEDAIHT MIENLQPETK YRISVFATYS SGEGEPLT GD ATTELSQDSK TLKVDEETEN TMRVTWKPAP GKVVNYRVVY RPHGRGKQMV AKVPPTVT ST VLKRLQPQTT YDITVLPIYK MGEGKLRQGS GTTASRFKSP RNLKTSDPTM SSFRVTWE PA PGEVKGYKVT FHPTGDDRRL GELVVGPYDN TVVLEELRAG TTYKVNVFGM FDGGESSP LV GQEMTTLSDT TVMPILSSGM ECLTRAEADI VLLVDGSWSI GRANFRTVRS FISRIVEV FD IGPKRVQIAL AQYSGDPRTE WQLNAHRDKK SLLQAVANLP YKGGNTLTGM ALNFIRQQ NF RTQAGMRPRA RKIGVLITDG KSQDDVEAPS KKLKDEGVEL FAIGIKNADE VELKMIAT DP DDTHDYNVAD FESLSRIVDD LTINLCNSVK GPGDLEAPSN LVISERTHRS FRVSWTPP SD SVDRYKVEYY PVSGGKRQEF YVSRMETSTV LKDLKPETEY VVNVYSVVED EYSEPLKG TE KTLPVPVVSL NIYDVGPTTM HVQWQPVGGA TGYILSYKPV KDTEPTRPKE VRLGPTVN DM QLTDLVPNTE YAVTVQAVLH DLTSEPVTVR EVTLPLPRPQ DLKLRDVTHS TMNVFWEP VP GKVRKYIVRY KTPEEDVKEV EVDRSETSTS LKDLFSQTLY TVSVSAVHDE GESPPVTA QE TTRPVPAPTN LKITEVTSEG FRGTWDHGAS DVSLYRITWG PFGSSDKMET ILNGDENT LV FENLNPNTIY EVSITAIYAD ESESDDLIGS ERTLPILTTQ APKSGPRNLQ VYNATSNS LT VKWDPASGRV QKYRITYQPS TGEGNEQTTT IGGRQNSVVL QKLKPDTPYT ITVSSLYP DG EGGRMTGRGK TKPLNTVRNL RVYDPSTSTL NVRWDHAEGN PRQYKLFYAP AAGGPEEL VP IPGNTNYAIL RNLQPDTSYT VTVVPVYTEG DGGRTSDTGR TLMRGLARNV QVYNPTPN RL GVRWDPAPGP VLQYRVVYSP VDGTRPSESI VVPGNTRMVH LERLIPDTLY SVNLVALY SD GEGNPSPAQG RTLPRSGPRN LRVFGETTNS LSVAWDHADG PVQQYRIIYS PTVGDPID EY TTVPGRRNNV ILQPLQPDTP YKITVIAVYE DGDGGHLTGN GRTVGLLPPQ NIHISDEW YT RFRVSWDPSP SPVLGYKIVY KPVGSNEPME AFVGEMTSYT LHNLNPSTTY DVNVYAQY DS GLSVPLTDQG TTLYLNVTDL KTYQIGWDTF CVKWSPHRAA TSYRLKLSPA DGTRGQEI TV RGSETSHCFT GLSPDTDYGV TVFVQTPNLE GPGVSVKEHT TVK 2294 - 2313 pteapte pptpppppti ppa 2314 - 2746 RDVCKGA KADIVFLTDA SWSIGDDNFN KVVKFIFNTV GGFDEISPAG IQVSFVQYSD EVK SEFKLNT YNDKALALGA LQNIRYRGGN TRTGKALTFI KEKVLTWESG MRKNVPKVLV VVT DGRSQDE VKKAALVIQQ SGFSVFVVGV ADVDYNELAN IASKPSERHV FIVDDFESFE KIE DNLITFV CETATSSCPL IYLDGYTSPG FKMLEAYNLT EKNFASVQGV SLESGSFPSY SAY RIQKNAF VNQPTADLHP NGLPPSYTII LLFRLLPETP SDPFAIWQIT DRDYKPQVGV IAD PSSKTLS FFNKDTRGEV QTVTFDTEEV KTLFYGSFHK VHIVVTSKSV KIYIDCYEII EKD IKEAGNI TTDGYEILGK LLKGERKSAA FQIQSFDIVC SPVWTSRDRC CDIPSRRDEG KCP AFPNSCT CTQDSV 2747 - 2842 gppg ppgpaggpga kgprgergis gaigppgprg digppgpqgp pgpqgpngls ipgeqg rqgm kgdagepglp grtgtpglpg ppgpmgppgd rg 2843 - 2847 FTGKD 2848 - 2897 sam gprgppgrpg spgspgvtgp sgkpgkpgdh grpgpsglkg ekgdrgd 2898 - 2938 IAS QNMMRAVARQ VCEQLISGQM NRFNQMLNQI PNDYQSSR 2939 - 3042 nq pgppgppgpp gsagargepg pggrpgfpgt pgmqgppger glpgekgerg tgssgprg lp gppgpqgesr tgppgstgsr gppgppgrpg nsgiqgppgp pg 3043 - 3063 YCDSSQCA SIPYNGQSYP GSG low complexity regions: DUST >sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). MRSRLPPALAALGAALLLSSIEAEVDPPSDLNFKIIDENTVHMSWAEPVDPIVGYRITVD PTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKP VEKKPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVG VVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVG FPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVA NFDAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEPPSNLIAMEVSSKYVKLNWNPSPSP VTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIME KTQPMKVQVECSRGVDIKADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQIS LVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSN VPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVE DFDAFQRISFELTQSICLRIEQELAAIKKKAYVPPKDLSFSEVTSYGFKTNWSPAGENVF SYHITYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDGFSIPLAGEETTE EVKGAPRNLKVTDETTDSFKITWTQAPGRVLRCRIIYRPVAGGESREVTTPPNQRRRTLE NLIPDTKYEVSVIPEYFSGPGTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGK VKQYLVTYTPVAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAGDALFGEGT TLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLYDDVDTGEKNLPEDAIHT MIENLQPETKYRISVFATYSSGEGEPLTGDATTELSQDSKTLKVDEETENTMRVTWKPAP GKVVNYRVVYRPHGRGKQMVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGKLRQGS GTTASRFKSPRNLKTSDPTMSSFRVTWEPAPGEVKGYKVTFHPTGDDRRLGELVVGPYDN TVVLEELRAGTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSGMECLTRAEADI VLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKK SLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQDDVEAPS KKLKDEGVELFAIGIKNADEVELKMIATDPDDTHDYNVADFESLSRIVDDLTINLCNSVK GPGDLEAPSNLVISERTHRSFRVSWTPPSDSVDRYKVEYYPVSGGKRQEFYVSRMETSTV LKDLKPETEYVVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVGGA TGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLHDLTSEPVTVR EVTLPLPRPQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRYKTPEEDVKEVEVDRSETSTS LKDLFSQTLYTVSVSAVHDEGESPPVTAQETTRPVPAPTNLKITEVTSEGFRGTWDHGAS DVSLYRITWGPFGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIYADESESDDLIGS ERTLPILTTQAPKSGPRNLQVYNATSNSLTVKWDPASGRVQKYRITYQPSTGEGNEQTTT IGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNLRVYDPSTSTL NVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPGNTNYAILRNLQPDTSYTVTVVPVYTEG DGGRTSDTGRTLMRGLARNVQVYNPTPNRLGVRWDPAPGPVLQYRVVYSPVDGTRPSESI VVPGNTRMVHLERLIPDTLYSVNLVALYSDGEGNPSPAQGRTLPRSGPRNLRVFGETTNS LSVAWDHADGPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYE DGDGGHLTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVLGYKIVYKPVGSNEPME AFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGLSVPLTDQGTTLYLNVTDLKTYQIGWDTF CVKWSPHRAATSYRLKLSPADGTRGQEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLE GPGVSVKEHTTVKPTEAPTEPPTPPPPPTIPPARDVCKGAKADIVFLTDASWSIGDDNFN KVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGN TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGV ADVDYNELANIASKPSERHVFIVDDFESFEKIEDNLITFVCETATSSCPLIYLDGYTSPG FKMLEAYNLTEKNFASVQGVSLESGSFPSYSAYRIQKNAFVNQPTADLHPNGLPPSYTII LLFRLLPETPSDPFAIWQITDRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFDTEEV KTLFYGSFHKVHIVVTSKSVKIYIDCYEIIEKDIKEAGNITTDGYEILGKLLKGERKSAA FQIQSFDIVCSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVGPPGPPGPAGGPGA KGPRGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLSIPGEQGRQGMKGDAGEPGLP GRTGTPGLPGPPGPMGPPGDRGFTGKDSAMGPRGPPGRPGSPGSPGVTGPSGKPGKPGDH GRPGPSGLKGEKGDRGDIASQNMMRAVARQVCEQLISGQMNRFNQMLNQIPNDYQSSRNQ PGPPGPPGPPGSAGARGEPGPGGRPGFPGTPGMQGPPGERGLPGEKGERGTGSSGPRGLP GPPGPQGESRTGPPGSTGSRGPPGPPGRPGNSGIQGPPGPPGYCDSSQCASIPYNGQSYP GSG ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for sp|Q99715|CA1C_HUMAN sequence: 3050 amino acids, 21 residue(s) in coiled coil state Coil 1 * 1603* 1624* EEDVKEVEVDRSETSTSLKDLF . | . | . | . | . | . 60 MRSRLPPALA ALGAALLLSS IEAEVDPPSD LNFKIIDENT VHMSWAEPVD PIVGYRITVD ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 PTTDGPTKEF TLSASTTETL LSELVPETEY VVTITSYDEV EESVPVIGQL TIQTGSSTKP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 VEKKPGKTEI QKCSVSAWTD LVFLVDGSWS VGRNNFKYIL DFIAALVSAF DIGEEKTRVG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 VVQYSSDTRT EFNLNQYYQR DELLAAIKKI PYKGGNTMTG DAIDYLVKNT FTESAGARVG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 FPKVAIIITD GKSQDEVEIP ARELRNVGVE VFSLGIKAAD AKELKQIAST PSLNHVFNVA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 NFDAIVDIQN EIISQVCSGV DEQLGELVSG EEVVEPPSNL IAMEVSSKYV KLNWNPSPSP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 420 VTGYKVILTP MTAGSRQHAL SVGPQTTTLS VRDLSADTEY QISVSAMKGM TSSEPISIME ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 480 KTQPMKVQVE CSRGVDIKAD IVFLVDGSYS IGIANFVKVR AFLEVLVKSF EISPNRVQIS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 540 LVQYSRDPHT EFTLKKFTKV EDIIEAINTF PYRGGSTNTG KAMTYVREKI FVPSKGSRSN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 600 VPKVMILITD GKSSDAFRDP AIKLRNSDVE IFAVGVKDAV RSELEAIASP PAETHVFTVE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 660 DFDAFQRISF ELTQSICLRI EQELAAIKKK AYVPPKDLSF SEVTSYGFKT NWSPAGENVF ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ 1111111111 1111111111 1~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~3444 4444444444 4~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 720 SYHITYKEAA GDDEVTVVEP ASSTSVVLSS LKPETLYLVN VTAEYEDGFS IPLAGEETTE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 780 EVKGAPRNLK VTDETTDSFK ITWTQAPGRV LRCRIIYRPV AGGESREVTT PPNQRRRTLE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 840 NLIPDTKYEV SVIPEYFSGP GTPLTGNAAT EEVRGNPRDL RVSDPTTSTM KLSWSGAPGK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 900 VKQYLVTYTP VAGGETQEVT VRGDTTNTVL QGLKEGTQYA LSVTALYASG AGDALFGEGT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 960 TLEERGSPQD LVTKDITDTS IGAYWTSAPG MVRGYRVSWK SLYDDVDTGE KNLPEDAIHT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1020 MIENLQPETK YRISVFATYS SGEGEPLTGD ATTELSQDSK TLKVDEETEN TMRVTWKPAP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1080 GKVVNYRVVY RPHGRGKQMV AKVPPTVTST VLKRLQPQTT YDITVLPIYK MGEGKLRQGS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1140 GTTASRFKSP RNLKTSDPTM SSFRVTWEPA PGEVKGYKVT FHPTGDDRRL GELVVGPYDN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1200 TVVLEELRAG TTYKVNVFGM FDGGESSPLV GQEMTTLSDT TVMPILSSGM ECLTRAEADI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1260 VLLVDGSWSI GRANFRTVRS FISRIVEVFD IGPKRVQIAL AQYSGDPRTE WQLNAHRDKK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~111111111 * 14 M'95 -w local . | . | . | . | . | . 1320 SLLQAVANLP YKGGNTLTGM ALNFIRQQNF RTQAGMRPRA RKIGVLITDG KSQDDVEAPS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. 11111~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1380 KKLKDEGVEL FAIGIKNADE VELKMIATDP DDTHDYNVAD FESLSRIVDD LTINLCNSVK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~111111111 11111~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1440 GPGDLEAPSN LVISERTHRS FRVSWTPPSD SVDRYKVEYY PVSGGKRQEF YVSRMETSTV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1500 LKDLKPETEY VVNVYSVVED EYSEPLKGTE KTLPVPVVSL NIYDVGPTTM HVQWQPVGGA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1560 TGYILSYKPV KDTEPTRPKE VRLGPTVNDM QLTDLVPNTE YAVTVQAVLH DLTSEPVTVR ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1620 EVTLPLPRPQ DLKLRDVTHS TMNVFWEPVP GKVRKYIVRY KTPEEDVKEV EVDRSETSTS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~7777777 7777777777 * 21 M'95 -w border ---------- ---------- ---------- ---------- ---efgabcd efgabcdefg * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~3333333 3333333333 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~7777777 7777777777 * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~3333333 3333333~~~ * 14 M'95 -w local . | . | . | . | . | . 1680 LKDLFSQTLY TVSVSAVHDE GESPPVTAQE TTRPVPAPTN LKITEVTSEG FRGTWDHGAS 7777~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border abcd------ ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register 3333~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar 77771~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1740 DVSLYRITWG PFGSSDKMET ILNGDENTLV FENLNPNTIY EVSITAIYAD ESESDDLIGS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1800 ERTLPILTTQ APKSGPRNLQ VYNATSNSLT VKWDPASGRV QKYRITYQPS TGEGNEQTTT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1860 IGGRQNSVVL QKLKPDTPYT ITVSSLYPDG EGGRMTGRGK TKPLNTVRNL RVYDPSTSTL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1920 NVRWDHAEGN PRQYKLFYAP AAGGPEELVP IPGNTNYAIL RNLQPDTSYT VTVVPVYTEG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1980 DGGRTSDTGR TLMRGLARNV QVYNPTPNRL GVRWDPAPGP VLQYRVVYSP VDGTRPSESI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2040 VVPGNTRMVH LERLIPDTLY SVNLVALYSD GEGNPSPAQG RTLPRSGPRN LRVFGETTNS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2100 LSVAWDHADG PVQQYRIIYS PTVGDPIDEY TTVPGRRNNV ILQPLQPDTP YKITVIAVYE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2160 DGDGGHLTGN GRTVGLLPPQ NIHISDEWYT RFRVSWDPSP SPVLGYKIVY KPVGSNEPME ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2220 AFVGEMTSYT LHNLNPSTTY DVNVYAQYDS GLSVPLTDQG TTLYLNVTDL KTYQIGWDTF ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2280 CVKWSPHRAA TSYRLKLSPA DGTRGQEITV RGSETSHCFT GLSPDTDYGV TVFVQTPNLE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2340 GPGVSVKEHT TVKPTEAPTE PPTPPPPPTI PPARDVCKGA KADIVFLTDA SWSIGDDNFN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2400 KVVKFIFNTV GGFDEISPAG IQVSFVQYSD EVKSEFKLNT YNDKALALGA LQNIRYRGGN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2460 TRTGKALTFI KEKVLTWESG MRKNVPKVLV VVTDGRSQDE VKKAALVIQQ SGFSVFVVGV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2520 ADVDYNELAN IASKPSERHV FIVDDFESFE KIEDNLITFV CETATSSCPL IYLDGYTSPG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2580 FKMLEAYNLT EKNFASVQGV SLESGSFPSY SAYRIQKNAF VNQPTADLHP NGLPPSYTII ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2640 LLFRLLPETP SDPFAIWQIT DRDYKPQVGV IADPSSKTLS FFNKDTRGEV QTVTFDTEEV ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2700 KTLFYGSFHK VHIVVTSKSV KIYIDCYEII EKDIKEAGNI TTDGYEILGK LLKGERKSAA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2760 FQIQSFDIVC SPVWTSRDRC CDIPSRRDEG KCPAFPNSCT CTQDSVGPPG PPGPAGGPGA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2820 KGPRGERGIS GAIGPPGPRG DIGPPGPQGP PGPQGPNGLS IPGEQGRQGM KGDAGEPGLP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2880 GRTGTPGLPG PPGPMGPPGD RGFTGKDSAM GPRGPPGRPG SPGSPGVTGP SGKPGKPGDH ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 2940 GRPGPSGLKG EKGDRGDIAS QNMMRAVARQ VCEQLISGQM NRFNQMLNQI PNDYQSSRNQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 3000 PGPPGPPGPP GSAGARGEPG PGGRPGFPGT PGMQGPPGER GLPGEKGERG TGSSGPRGLP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | GPPGPQGESR TGPPGSTGSR GPPGPPGRPG NSGIQGPPGP PGYCDSSQCA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ---------- ---------- ---------- ---------- ---------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem36.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem36.___inter___ (1 sequences) MRSRLPPALAALGAALLLSSIEAEVDPPSDLNFKIIDENTVHMSWAEPVD PIVGYRITVDPTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEV EESVPVIGQLTIQTGSSTKPVEKKPGKTEIQKCSVSAWTDLVFLVDGSWS VGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQR DELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVGFPKVAIIITD GKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVA NFDAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEPPSNLIAMEVSSKYV KLNWNPSPSPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEY QISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVDGSYS IGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKV EDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSNVPKVMILITD GKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVE DFDAFQRISFELTQSICLRIEQELAAIKKKAYVPPKDLSFSEVTSYGFKT NWSPAGENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVN VTAEYEDGFSIPLAGEETTEEVKGAPRNLKVTDETTDSFKITWTQAPGRV LRCRIIYRPVAGGESREVTTPPNQRRRTLENLIPDTKYEVSVIPEYFSGP GTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGKVKQYLVTYTP VAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAGDALFGEGT TLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLYDDVDTGE KNLPEDAIHTMIENLQPETKYRISVFATYSSGEGEPLTGDATTELSQDSK TLKVDEETENTMRVTWKPAPGKVVNYRVVYRPHGRGKQMVAKVPPTVTST VLKRLQPQTTYDITVLPIYKMGEGKLRQGSGTTASRFKSPRNLKTSDPTM SSFRVTWEPAPGEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAG TTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSGMECLTRAEADI VLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTE WQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQAGMRPRA RKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDP DDTHDYNVADFESLSRIVDDLTINLCNSVKGPGDLEAPSNLVISERTHRS FRVSWTPPSDSVDRYKVEYYPVSGGKRQEFYVSRMETSTVLKDLKPETEY VVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVGGA TGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLH DLTSEPVTVREVTLPLPRPQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRY KTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPPVTAQE TTRPVPAPTNLKITEVTSEGFRGTWDHGASDVSLYRITWGPFGSSDKMET ILNGDENTLVFENLNPNTIYEVSITAIYADESESDDLIGSERTLPILTTQ APKSGPRNLQVYNATSNSLTVKWDPASGRVQKYRITYQPSTGEGNEQTTT IGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNL RVYDPSTSTLNVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPGNTNYAIL RNLQPDTSYTVTVVPVYTEGDGGRTSDTGRTLMRGLARNVQVYNPTPNRL GVRWDPAPGPVLQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIPDTLY SVNLVALYSDGEGNPSPAQGRTLPRSGPRNLRVFGETTNSLSVAWDHADG PVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYE DGDGGHLTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVLGYKIVY KPVGSNEPMEAFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGLSVPLTDQG TTLYLNVTDLKTYQIGWDTFCVKWSPHRAATSYRLKLSPADGTRGQEITV RGSETSHCFTGLSPDTDYGVTVFVQTPNLEGPGVSVKEHTTVKPTEAPTE PPTPPPPPTIPPARDVCKGAKADIVFLTDASWSIGDDNFNKVVKFIFNTV GGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGN TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQ SGFSVFVVGVADVDYNELANIASKPSERHVFIVDDFESFEKIEDNLITFV CETATSSCPLIYLDGYTSPGFKMLEAYNLTEKNFASVQGVSLESGSFPSY SAYRIQKNAFVNQPTADLHPNGLPPSYTIILLFRLLPETPSDPFAIWQIT DRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFDTEEVKTLFYGSFHK VHIVVTSKSVKIYIDCYEIIEKDIKEAGNITTDGYEILGKLLKGERKSAA FQIQSFDIVCSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVGPPG PPGPAGGPGAKGPRGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLS IPGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGDRGFTGKDSAM GPRGPPGRPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRGDIAS QNMMRAVARQVCEQLISGQMNRFNQMLNQIPNDYQSSRNQPGPPGPPGPP GSAGARGEPGPGGRPGFPGTPGMQGPPGERGLPGEKGERGTGSSGPRGLP GPPGPQGESRTGPPGSTGSRGPPGPPGRPGNSGIQGPPGPPGYCDSSQCA SIPYNGQSYPGSG (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 16 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 3 23 1.255 Certain 2 355 375 0.623 Putative 3 876 896 0.831 Putative 4 1174 1194 0.658 Putative 5 2443 2463 1.058 Certain 6 2495 2515 0.803 Putative ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 3 4 5 6 Loop length 2 852 277 1248 31 548 K+R profile 3.00 + 3.00 + + + CYT-EXT prof - 1.06 - 1.27 1.49 0.82 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 6.00 Tm probability: 0.04 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -2.52 -> Orientation: N-in ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 1 2 4 5 6 Loop length 2 331 798 1248 31 548 K+R profile 3.00 + 3.00 + + + CYT-EXT prof - 1.21 - 1.21 1.49 0.82 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 6.00 Tm probability: 0.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -2.31 -> Orientation: N-in ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 2 3 5 6 Loop length 2 331 500 1546 31 548 K+R profile 3.00 + 3.00 + + + CYT-EXT prof - 1.13 - 1.21 1.42 0.82 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 6.00 Tm probability: 0.02 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -2.31 -> Orientation: N-in ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 1 5 6 Loop length 2 2419 31 548 K+R profile 3.00 3.00 + + CYT-EXT prof - - 1.45 0.82 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 6.00 Tm probability: 0.51 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -2.27 -> Orientation: N-in ---------------------------------------------------------------------- Structure 5 Transmembrane segments included in this structure: Segment 1 2 3 4 5 Loop length 2 331 500 277 1248 600 K+R profile 3.00 + + + + + CYT-EXT prof - 1.13 1.49 1.21 1.06 0.80 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.44 -> Orientation: N-in ---------------------------------------------------------------------- Structure 6 Transmembrane segments included in this structure: Segment 1 3 4 5 Loop length 2 852 277 1248 600 K+R profile 3.00 + + + + CYT-EXT prof - 1.06 0.80 1.27 1.49 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.08 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.90 -> Orientation: N-in ---------------------------------------------------------------------- Structure 7 Transmembrane segments included in this structure: Segment 1 2 4 5 Loop length 2 331 798 1248 600 K+R profile 3.00 + + + + CYT-EXT prof - 1.21 0.80 1.21 1.49 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.01 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.69 -> Orientation: N-in ---------------------------------------------------------------------- Structure 8 Transmembrane segments included in this structure: Segment 1 4 5 Loop length 2 1150 1248 600 K+R profile 3.00 + + + CYT-EXT prof - 1.49 1.32 0.80 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.15 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.63 -> Orientation: N-in ---------------------------------------------------------------------- Structure 9 Transmembrane segments included in this structure: Segment 1 2 3 5 Loop length 2 331 500 1546 600 K+R profile 3.00 + + + + CYT-EXT prof - 1.13 0.80 1.21 1.42 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.03 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.69 -> Orientation: N-in ---------------------------------------------------------------------- Structure 10 Transmembrane segments included in this structure: Segment 1 3 5 Loop length 2 852 1546 600 K+R profile 3.00 + + + CYT-EXT prof - 1.42 1.27 0.80 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.58 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.65 -> Orientation: N-in ---------------------------------------------------------------------- Structure 11 Transmembrane segments included in this structure: Segment 1 2 5 Loop length 2 331 2067 600 K+R profile 3.00 + + + CYT-EXT prof - 1.42 1.21 0.80 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 0.06 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.59 -> Orientation: N-in ---------------------------------------------------------------------- Structure 12 Transmembrane segments included in this structure: Segment 1 5 Loop length 2 2419 600 K+R profile 3.00 + + CYT-EXT prof - 0.80 1.45 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 3.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.65 -> Orientation: N-in ---------------------------------------------------------------------- Structure 13 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 Loop length 2 331 500 277 1248 31 548 K+R profile 3.00 + + + + + 3.00 CYT-EXT prof - 1.13 1.49 0.82 1.21 1.06 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.00 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 1.18 -> Orientation: N-out ---------------------------------------------------------------------- Structure 14 Transmembrane segments included in this structure: Segment 1 4 5 6 Loop length 2 1150 1248 31 548 K+R profile 3.00 + + + 3.00 CYT-EXT prof - 1.49 0.82 1.32 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.07 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 0.99 -> Orientation: N-out ---------------------------------------------------------------------- Structure 15 Transmembrane segments included in this structure: Segment 1 3 5 6 Loop length 2 852 1546 31 548 K+R profile 3.00 + + + 3.00 CYT-EXT prof - 1.42 0.82 1.27 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.29 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 0.97 -> Orientation: N-out ---------------------------------------------------------------------- Structure 16 Transmembrane segments included in this structure: Segment 1 2 5 6 Loop length 2 331 2067 31 548 K+R profile 3.00 + + + 3.00 CYT-EXT prof - 1.42 0.82 1.21 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 0.03 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 1.03 -> Orientation: N-out ---------------------------------------------------------------------- "tem36" 3063 3 23 #t 1.25521 355 375 #f 0.622917 876 896 #f 0.83125 1174 1194 #f 0.658333 2443 2463 #t 1.05833 2495 2515 #f 0.803125 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem36.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem36.___inter___ (1 sequences) MRSRLPPALAALGAALLLSSIEAEVDPPSDLNFKIIDENTVHMSWAEPVD PIVGYRITVDPTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEV EESVPVIGQLTIQTGSSTKPVEKKPGKTEIQKCSVSAWTDLVFLVDGSWS VGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQR DELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVGFPKVAIIITD GKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVA NFDAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEPPSNLIAMEVSSKYV KLNWNPSPSPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEY QISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVDGSYS IGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKV EDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSNVPKVMILITD GKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVE DFDAFQRISFELTQSICLRIEQELAAIKKKAYVPPKDLSFSEVTSYGFKT NWSPAGENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVN VTAEYEDGFSIPLAGEETTEEVKGAPRNLKVTDETTDSFKITWTQAPGRV LRCRIIYRPVAGGESREVTTPPNQRRRTLENLIPDTKYEVSVIPEYFSGP GTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGKVKQYLVTYTP VAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAGDALFGEGT TLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLYDDVDTGE KNLPEDAIHTMIENLQPETKYRISVFATYSSGEGEPLTGDATTELSQDSK TLKVDEETENTMRVTWKPAPGKVVNYRVVYRPHGRGKQMVAKVPPTVTST VLKRLQPQTTYDITVLPIYKMGEGKLRQGSGTTASRFKSPRNLKTSDPTM SSFRVTWEPAPGEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAG TTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSSGMECLTRAEADI VLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTE WQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQAGMRPRA RKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDP DDTHDYNVADFESLSRIVDDLTINLCNSVKGPGDLEAPSNLVISERTHRS FRVSWTPPSDSVDRYKVEYYPVSGGKRQEFYVSRMETSTVLKDLKPETEY VVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVGGA TGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLH DLTSEPVTVREVTLPLPRPQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRY KTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPPVTAQE TTRPVPAPTNLKITEVTSEGFRGTWDHGASDVSLYRITWGPFGSSDKMET ILNGDENTLVFENLNPNTIYEVSITAIYADESESDDLIGSERTLPILTTQ APKSGPRNLQVYNATSNSLTVKWDPASGRVQKYRITYQPSTGEGNEQTTT IGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNL RVYDPSTSTLNVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPGNTNYAIL RNLQPDTSYTVTVVPVYTEGDGGRTSDTGRTLMRGLARNVQVYNPTPNRL GVRWDPAPGPVLQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIPDTLY SVNLVALYSDGEGNPSPAQGRTLPRSGPRNLRVFGETTNSLSVAWDHADG PVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYE DGDGGHLTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVLGYKIVY KPVGSNEPMEAFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGLSVPLTDQG TTLYLNVTDLKTYQIGWDTFCVKWSPHRAATSYRLKLSPADGTRGQEITV RGSETSHCFTGLSPDTDYGVTVFVQTPNLEGPGVSVKEHTTVKPTEAPTE PPTPPPPPTIPPARDVCKGAKADIVFLTDASWSIGDDNFNKVVKFIFNTV GGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGN TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQ SGFSVFVVGVADVDYNELANIASKPSERHVFIVDDFESFEKIEDNLITFV CETATSSCPLIYLDGYTSPGFKMLEAYNLTEKNFASVQGVSLESGSFPSY SAYRIQKNAFVNQPTADLHPNGLPPSYTIILLFRLLPETPSDPFAIWQIT DRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFDTEEVKTLFYGSFHK VHIVVTSKSVKIYIDCYEIIEKDIKEAGNITTDGYEILGKLLKGERKSAA FQIQSFDIVCSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVGPPG PPGPAGGPGAKGPRGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLS IPGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGDRGFTGKDSAM GPRGPPGRPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRGDIAS QNMMRAVARQVCEQLISGQMNRFNQMLNQIPNDYQSSRNQPGPPGPPGPP GSAGARGEPGPGGRPGFPGTPGMQGPPGERGLPGEKGERGTGSSGPRGLP GPPGPQGESRTGPPGSTGSRGPPGPPGRPGNSGIQGPPGPPGYCDSSQCA SIPYNGQSYPGSG (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 16 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 3 23 1.255 Certain 2 355 375 0.623 Putative 3 876 896 0.831 Putative 4 1174 1194 0.658 Putative 5 2443 2463 1.058 Certain 6 2495 2515 0.803 Putative ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 3 4 5 6 Loop length 2 852 277 1248 31 548 K+R profile 2.00 + 3.00 + + + CYT-EXT prof - 1.06 - 1.27 1.49 0.82 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 5.00 Tm probability: 0.04 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -2.52 -> Orientation: N-in ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 1 2 4 5 6 Loop length 2 331 798 1248 31 548 K+R profile 2.00 + 3.00 + + + CYT-EXT prof - 1.21 - 1.21 1.49 0.82 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 5.00 Tm probability: 0.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -2.31 -> Orientation: N-in ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 2 3 5 6 Loop length 2 331 500 1546 31 548 K+R profile 2.00 + 3.00 + + + CYT-EXT prof - 1.13 - 1.21 1.42 0.82 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 5.00 Tm probability: 0.02 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -2.31 -> Orientation: N-in ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 1 5 6 Loop length 2 2419 31 548 K+R profile 2.00 3.00 + + CYT-EXT prof - - 1.45 0.82 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 5.00 Tm probability: 0.51 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -2.27 -> Orientation: N-in ---------------------------------------------------------------------- Structure 5 Transmembrane segments included in this structure: Segment 1 2 3 4 5 Loop length 2 331 500 277 1248 600 K+R profile 2.00 + + + + + CYT-EXT prof - 1.13 1.49 1.21 1.06 0.80 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 2.00 Tm probability: 0.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.44 -> Orientation: N-in ---------------------------------------------------------------------- Structure 6 Transmembrane segments included in this structure: Segment 1 3 4 5 Loop length 2 852 277 1248 600 K+R profile 2.00 + + + + CYT-EXT prof - 1.06 0.80 1.27 1.49 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 2.00 Tm probability: 0.08 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.90 -> Orientation: N-in ---------------------------------------------------------------------- Structure 7 Transmembrane segments included in this structure: Segment 1 2 4 5 Loop length 2 331 798 1248 600 K+R profile 2.00 + + + + CYT-EXT prof - 1.21 0.80 1.21 1.49 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 2.00 Tm probability: 0.01 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.69 -> Orientation: N-in ---------------------------------------------------------------------- Structure 8 Transmembrane segments included in this structure: Segment 1 4 5 Loop length 2 1150 1248 600 K+R profile 2.00 + + + CYT-EXT prof - 1.49 1.32 0.80 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 2.00 Tm probability: 0.15 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.63 -> Orientation: N-in ---------------------------------------------------------------------- Structure 9 Transmembrane segments included in this structure: Segment 1 2 3 5 Loop length 2 331 500 1546 600 K+R profile 2.00 + + + + CYT-EXT prof - 1.13 0.80 1.21 1.42 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 2.00 Tm probability: 0.03 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.69 -> Orientation: N-in ---------------------------------------------------------------------- Structure 10 Transmembrane segments included in this structure: Segment 1 3 5 Loop length 2 852 1546 600 K+R profile 2.00 + + + CYT-EXT prof - 1.42 1.27 0.80 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 2.00 Tm probability: 0.58 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.65 -> Orientation: N-in ---------------------------------------------------------------------- Structure 11 Transmembrane segments included in this structure: Segment 1 2 5 Loop length 2 331 2067 600 K+R profile 2.00 + + + CYT-EXT prof - 1.42 1.21 0.80 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 2.00 Tm probability: 0.06 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.59 -> Orientation: N-in ---------------------------------------------------------------------- Structure 12 Transmembrane segments included in this structure: Segment 1 5 Loop length 2 2419 600 K+R profile 2.00 + + CYT-EXT prof - 0.80 1.45 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 2.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: -0.65 -> Orientation: N-in ---------------------------------------------------------------------- Structure 13 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 Loop length 2 331 500 277 1248 31 548 K+R profile 2.00 + + + + + 3.00 CYT-EXT prof - 1.13 1.49 0.82 1.21 1.06 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -1.00 Tm probability: 0.00 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 1.18 -> Orientation: N-out ---------------------------------------------------------------------- Structure 14 Transmembrane segments included in this structure: Segment 1 4 5 6 Loop length 2 1150 1248 31 548 K+R profile 2.00 + + + 3.00 CYT-EXT prof - 1.49 0.82 1.32 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -1.00 Tm probability: 0.07 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 0.99 -> Orientation: N-out ---------------------------------------------------------------------- Structure 15 Transmembrane segments included in this structure: Segment 1 3 5 6 Loop length 2 852 1546 31 548 K+R profile 2.00 + + + 3.00 CYT-EXT prof - 1.42 0.82 1.27 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -1.00 Tm probability: 0.29 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 0.97 -> Orientation: N-out ---------------------------------------------------------------------- Structure 16 Transmembrane segments included in this structure: Segment 1 2 5 6 Loop length 2 331 2067 31 548 K+R profile 2.00 + + + 3.00 CYT-EXT prof - 1.42 0.82 1.21 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -1.00 Tm probability: 0.03 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 6.00 (NEG-POS)/(NEG+POS): -1.0000 NEG: 0.0000 POS: 2.0000 -> Orientation: N-in CYT-EXT difference: 1.03 -> Orientation: N-out ---------------------------------------------------------------------- "tem36" 3063 3 23 #t 1.25521 355 375 #f 0.622917 876 896 #f 0.83125 1174 1194 #f 0.658333 2443 2463 #t 1.05833 2495 2515 #f 0.803125 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ NOW EXECUTING: /bio_software/1D/stat/saps/saps-stroh/SAPS.SSPA/saps /people/maria/tem36.___saps___ SAPS. Version of April 11, 1996. Date run: Sun Jan 7 20:25:26 2001 File: /people/maria/tem36.___saps___ ID sp|Q99715|CA1C_HUMAN DE COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). number of residues: 3063; molecular weight: 333.2 kdal 1 MRSRLPPALA ALGAALLLSS IEAEVDPPSD LNFKIIDENT VHMSWAEPVD PIVGYRITVD 61 PTTDGPTKEF TLSASTTETL LSELVPETEY VVTITSYDEV EESVPVIGQL TIQTGSSTKP 121 VEKKPGKTEI QKCSVSAWTD LVFLVDGSWS VGRNNFKYIL DFIAALVSAF DIGEEKTRVG 181 VVQYSSDTRT EFNLNQYYQR DELLAAIKKI PYKGGNTMTG DAIDYLVKNT FTESAGARVG 241 FPKVAIIITD GKSQDEVEIP ARELRNVGVE VFSLGIKAAD AKELKQIAST PSLNHVFNVA 301 NFDAIVDIQN EIISQVCSGV DEQLGELVSG EEVVEPPSNL IAMEVSSKYV KLNWNPSPSP 361 VTGYKVILTP MTAGSRQHAL SVGPQTTTLS VRDLSADTEY QISVSAMKGM TSSEPISIME 421 KTQPMKVQVE CSRGVDIKAD IVFLVDGSYS IGIANFVKVR AFLEVLVKSF EISPNRVQIS 481 LVQYSRDPHT EFTLKKFTKV EDIIEAINTF PYRGGSTNTG KAMTYVREKI FVPSKGSRSN 541 VPKVMILITD GKSSDAFRDP AIKLRNSDVE IFAVGVKDAV RSELEAIASP PAETHVFTVE 601 DFDAFQRISF ELTQSICLRI EQELAAIKKK AYVPPKDLSF SEVTSYGFKT NWSPAGENVF 661 SYHITYKEAA GDDEVTVVEP ASSTSVVLSS LKPETLYLVN VTAEYEDGFS IPLAGEETTE 721 EVKGAPRNLK VTDETTDSFK ITWTQAPGRV LRCRIIYRPV AGGESREVTT PPNQRRRTLE 781 NLIPDTKYEV SVIPEYFSGP GTPLTGNAAT EEVRGNPRDL RVSDPTTSTM KLSWSGAPGK 841 VKQYLVTYTP VAGGETQEVT VRGDTTNTVL QGLKEGTQYA LSVTALYASG AGDALFGEGT 901 TLEERGSPQD LVTKDITDTS IGAYWTSAPG MVRGYRVSWK SLYDDVDTGE KNLPEDAIHT 961 MIENLQPETK YRISVFATYS SGEGEPLTGD ATTELSQDSK TLKVDEETEN TMRVTWKPAP 1021 GKVVNYRVVY RPHGRGKQMV AKVPPTVTST VLKRLQPQTT YDITVLPIYK MGEGKLRQGS 1081 GTTASRFKSP RNLKTSDPTM SSFRVTWEPA PGEVKGYKVT FHPTGDDRRL GELVVGPYDN 1141 TVVLEELRAG TTYKVNVFGM FDGGESSPLV GQEMTTLSDT TVMPILSSGM ECLTRAEADI 1201 VLLVDGSWSI GRANFRTVRS FISRIVEVFD IGPKRVQIAL AQYSGDPRTE WQLNAHRDKK 1261 SLLQAVANLP YKGGNTLTGM ALNFIRQQNF RTQAGMRPRA RKIGVLITDG KSQDDVEAPS 1321 KKLKDEGVEL FAIGIKNADE VELKMIATDP DDTHDYNVAD FESLSRIVDD LTINLCNSVK 1381 GPGDLEAPSN LVISERTHRS FRVSWTPPSD SVDRYKVEYY PVSGGKRQEF YVSRMETSTV 1441 LKDLKPETEY VVNVYSVVED EYSEPLKGTE KTLPVPVVSL NIYDVGPTTM HVQWQPVGGA 1501 TGYILSYKPV KDTEPTRPKE VRLGPTVNDM QLTDLVPNTE YAVTVQAVLH DLTSEPVTVR 1561 EVTLPLPRPQ DLKLRDVTHS TMNVFWEPVP GKVRKYIVRY KTPEEDVKEV EVDRSETSTS 1621 LKDLFSQTLY TVSVSAVHDE GESPPVTAQE TTRPVPAPTN LKITEVTSEG FRGTWDHGAS 1681 DVSLYRITWG PFGSSDKMET ILNGDENTLV FENLNPNTIY EVSITAIYAD ESESDDLIGS 1741 ERTLPILTTQ APKSGPRNLQ VYNATSNSLT VKWDPASGRV QKYRITYQPS TGEGNEQTTT 1801 IGGRQNSVVL QKLKPDTPYT ITVSSLYPDG EGGRMTGRGK TKPLNTVRNL RVYDPSTSTL 1861 NVRWDHAEGN PRQYKLFYAP AAGGPEELVP IPGNTNYAIL RNLQPDTSYT VTVVPVYTEG 1921 DGGRTSDTGR TLMRGLARNV QVYNPTPNRL GVRWDPAPGP VLQYRVVYSP VDGTRPSESI 1981 VVPGNTRMVH LERLIPDTLY SVNLVALYSD GEGNPSPAQG RTLPRSGPRN LRVFGETTNS 2041 LSVAWDHADG PVQQYRIIYS PTVGDPIDEY TTVPGRRNNV ILQPLQPDTP YKITVIAVYE 2101 DGDGGHLTGN GRTVGLLPPQ NIHISDEWYT RFRVSWDPSP SPVLGYKIVY KPVGSNEPME 2161 AFVGEMTSYT LHNLNPSTTY DVNVYAQYDS GLSVPLTDQG TTLYLNVTDL KTYQIGWDTF 2221 CVKWSPHRAA TSYRLKLSPA DGTRGQEITV RGSETSHCFT GLSPDTDYGV TVFVQTPNLE 2281 GPGVSVKEHT TVKPTEAPTE PPTPPPPPTI PPARDVCKGA KADIVFLTDA SWSIGDDNFN 2341 KVVKFIFNTV GGFDEISPAG IQVSFVQYSD EVKSEFKLNT YNDKALALGA LQNIRYRGGN 2401 TRTGKALTFI KEKVLTWESG MRKNVPKVLV VVTDGRSQDE VKKAALVIQQ SGFSVFVVGV 2461 ADVDYNELAN IASKPSERHV FIVDDFESFE KIEDNLITFV CETATSSCPL IYLDGYTSPG 2521 FKMLEAYNLT EKNFASVQGV SLESGSFPSY SAYRIQKNAF VNQPTADLHP NGLPPSYTII 2581 LLFRLLPETP SDPFAIWQIT DRDYKPQVGV IADPSSKTLS FFNKDTRGEV QTVTFDTEEV 2641 KTLFYGSFHK VHIVVTSKSV KIYIDCYEII EKDIKEAGNI TTDGYEILGK LLKGERKSAA 2701 FQIQSFDIVC SPVWTSRDRC CDIPSRRDEG KCPAFPNSCT CTQDSVGPPG PPGPAGGPGA 2761 KGPRGERGIS GAIGPPGPRG DIGPPGPQGP PGPQGPNGLS IPGEQGRQGM KGDAGEPGLP 2821 GRTGTPGLPG PPGPMGPPGD RGFTGKDSAM GPRGPPGRPG SPGSPGVTGP SGKPGKPGDH 2881 GRPGPSGLKG EKGDRGDIAS QNMMRAVARQ VCEQLISGQM NRFNQMLNQI PNDYQSSRNQ 2941 PGPPGPPGPP GSAGARGEPG PGGRPGFPGT PGMQGPPGER GLPGEKGERG TGSSGPRGLP 3001 GPPGPQGESR TGPPGSTGSR GPPGPPGRPG NSGIQGPPGP PGYCDSSQCA SIPYNGQSYP 3061 GSG -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A :158( 5.2%); C : 22( 0.7%); D :172( 5.6%); E :195( 6.4%); F : 91( 3.0%) G :281( 9.2%); H : 31( 1.0%); I :148( 4.8%); K :155( 5.1%); L :203( 6.6%) M : 45( 1.5%); N :118( 3.9%); P :261( 8.5%); Q :105( 3.4%); R :158( 5.2%) S :242( 7.9%); T :266( 8.7%); V :270( 8.8%); W : 30( 1.0%); Y :112( 3.7%) KR : 313 ( 10.2%); ED : 367 ( 12.0%); AGP : 700 ( 22.9%); KRED : 680 ( 22.2%); KR-ED : -54 ( -1.8%); FIKMNY : 669 ( 21.8%); LVIFM : 757 ( 24.7%); ST : 508 ( 16.6%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 0+0+000000 0000000000 0-0-0-000- 000+00--00 000000-00- 00000+000- 61 000-000+-0 0000000-00 00-000-0-0 0000000--0 --00000000 00000000+0 121 0-++00+0-0 0+0000000- 00000-0000 00+000+000 -000000000 -00--+0+00 181 000000-0+0 -00000000+ --00000++0 00+0000000 -00-000+00 00-0000+00 241 00+000000- 0+00--0-00 0+-0+0000- 000000+00- 0+-0+00000 0000000000 301 00-000-000 -000000000 --000-0000 --00-00000 000-000+00 +000000000 361 0000+00000 00000+0000 0000000000 0+-000-0-0 0000000+00 000-00000- 421 +0000+000- 00+00-0+0- 00000-0000 0000000+0+ 000-000+00 -0000+0000 481 00000+-000 -000++00+0 --00-00000 00+0000000 +00000+-+0 0000+00+00 541 00+000000- 0+00-00+-0 00+0+00-0- 000000+-00 +0-0-00000 00-000000- 601 -0-000+000 -0000000+0 -0-0000+++ 00000+-000 0-000000+0 000000-000 661 000000+-00 0---0000-0 0000000000 0+0-000000 000-0--000 00000--00- 721 -0+000+00+ 00--00-00+ 00000000+0 0+0+000+00 000-0+-000 0000+++00- 781 0000-0+0-0 0000-00000 0000000000 --0+000+-0 +00-000000 +00000000+ 841 0+00000000 0000-00-00 0+0-000000 000+-00000 0000000000 00-0000-00 901 00--+0000- 000+-00-00 0000000000 00+00+000+ 000--0-00- +000--0000 961 00-0000-0+ 0+00000000 00-0-0000- 000-000-0+ 00+0---0-0 00+000+000 1021 0+0000+000 +000+0+000 0+00000000 00++000000 0-0000000+ 00-0+0+000 1081 00000+0+00 +00+00-000 000+000-00 00-0+00+00 00000--++0 0-000000-0 1141 0000--0+00 000+000000 0-00-00000 00-00000-0 0000000000 -000+0-0-0 1201 0000-00000 0+000+00+0 000+00-00- 000++00000 00000-0+0- 000000+-++ 1261 0000000000 0+00000000 00000+0000 +00000+0+0 ++000000-0 +00--0-000 1321 ++0+--00-0 00000+00-- 0-0+0000-0 --00-0000- 0-000+00-- 000000000+ 1381 000-0-0000 0000-+00+0 0+0000000- 00-+0+0-00 00000++0-0 000+0-0000 1441 0+-0+0-0-0 00000000-- -00-00+00- +000000000 000-000000 0000000000 1501 0000000+00 +-0-00+0+- 0+000000-0 000-00000- 0000000000 -000-0000+ 1561 -000000+00 -0+0+-0000 000000-000 0+0++000+0 +00---0+-0 -0-+0-0000 1621 0+-0000000 00000000-- 0-0000000- 00+0000000 0+00-000-0 0+000-0000 1681 -0000+0000 00000-+0-0 0000--0000 0-00000000 -00000000- -0-0--0000 1741 -+00000000 00+000+000 0000000000 0+0-0000+0 0+0+000000 00-00-0000 1801 000+000000 0+0+0-0000 00000000-0 -00+000+0+ 0+00000+00 +00-000000 1861 00+0-00-00 0+00+00000 00000--000 0000000000 +0000-0000 00000000-0 1921 -00+00-00+ 000+000+00 00000000+0 00+0-00000 0000+00000 0-00+00-00 1981 000000+000 0-+000-000 000000000- 0-00000000 +000+000+0 0+000-0000 2041 00000-00-0 00000+0000 0000-00--0 00000++000 0000000-00 0+0000000- 2101 -0-0000000 0+00000000 00000--000 +0+000-000 000000+000 +00000-00- 2161 0000-00000 0000000000 -0000000-0 0000000-00 00000000-0 +000000-00 2221 00+0000+00 000+0+0000 -00+00-000 +00-000000 0000-0-000 000000000- 2281 000000+-00 00+00-000- 0000000000 000+-00+00 +0-00000-0 00000--000 2341 +00+000000 000--00000 000000000- -0+0-0+000 00-+000000 0000+0+000 2401 0+00+00000 +-+0000-00 0++000+000 000-0+00-- 0++0000000 0000000000 2461 0-0-00-000 000+00-+00 000--0-00- +0--000000 0-00000000 000-000000 2521 0+00-00000 -+00000000 00-0000000 000+00+000 000000-000 0000000000 2581 000+000-00 0-00000000 -+-0+00000 00-000+000 000+-0+0-0 00000-0--0 2641 +00000000+ 0000000+00 +000-00-00 -+-0+-0000 00-00-000+ 00+0-++000 2701 000000-000 000000+-+0 0-000++--0 +000000000 000-000000 0000000000 2761 +00+0-+000 00000000+0 -000000000 0000000000 000-00+000 +0-00-0000 2821 0+00000000 000000000- +0000+-000 00+0000+00 0000000000 00+00+00-0 2881 0+000000+0 -+0-+0-000 0000+000+0 00-0000000 0+00000000 00-0000+00 2941 0000000000 00000+0-00 000+000000 00000000-+ 0000-+0-+0 000000+000 3001 0000000-0+ 000000000+ 0000000+00 0000000000 0000-00000 0000000000 3061 000 A. CHARGE CLUSTERS. Positive charge clusters (cmin = 11/30 or 14/45 or 17/60): none Negative charge clusters (cmin = 12/30 or 16/45 or 19/60): none Mixed charge clusters (cmin = 18/30 or 23/45 or 29/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 6 | 8 | 44 | 11 | 11 | 15 | 13 | 14 | 18 | 6 | 8 | lmin1 7 | 7 | 10 | 53 | 13 | 14 | 18 | 16 | 17 | 21 | 8 | 10 | lmin2 8 | 9 | 12 | 59 | 15 | 15 | 20 | 18 | 19 | 24 | 9 | 12 | (Significance level: 0.010000; Minimal displayed length: 6) There are no charge runs or patterns exceeding the given minimal lengths. Run count statistics: + runs >= 4: 0 - runs >= 4: 0 * runs >= 5: 0 0 runs >= 30: 0 -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. There are no high scoring hydrophobic segments. There are no high scoring transmembrane segments. 2. SPACINGS OF C. H2N-132-C-183-C-113-C-185-C-135-C-438-C-183-C-844-C-36-C-58-C-183-C-6-C-157-C-43-C-9-C-C-10-C-6-C-1-C-170-C-131-C-4-C-14-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-41-H-90-C-161-H-21-C-60-H-52-C-57-H-105-H-21-C-45-H-89-C-205-H-73-H-88-H-69-C-63-H-97-H-21-C-21-H-92-H-58-H-28-H-58-H-38-H-188-H-123-H-56-H-58-H-16-H-48-H-48-C-5-H-29-H-C-30-H-27-C-161-H-21-C-6-C-60-H-79-H-2-H-13-C-43-C-9-C-C-10-C-6-C-1-C-138-H-31-C-131-C-4-C-14-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 6 Aligned matching blocks: [ 86- 92] PETEYVV [1446-1452] PETEYVV ______________________________ [ 142- 148]-( -7)-[ 142- 156]--------[ ]--------[ 144- 150] [ 442- 448]--------[ ]--------[ 439- 456]-( -13)-[ 444- 450] [1201-1207]-( -7)-[1201-1215]-( -18)-[1198-1215]-( -13)-[1203-1209] [ 142- 148] VFLVDGS [ 442- 448] VFLVDGS [1201-1207] VLLVDGS [ 142- 156] VFLVDGSWSVGRNNF [1201-1215] VLLVDGSWSIGRANF [ 439- 456] ADIVFLVDGSYSIGIANF [1198-1215] ADIVLLVDGSWSIGRANF [ 144- 150] LVDGSWS [ 444- 450] LVDGSYS [1203-1209] LVDGSWS ______________________________ [ 204- 209] LAAIKK [ 624- 629] LAAIKK ______________________________ [ 211- 220]-( 27)-[ 248- 253]-( -6)-[ 248- 258] [ ]--------[ 547- 553]--------[ ] [1270-1279]-( 26)-[1306-1312]-( -6)-[1307-1317] [ 211- 220] PYKGGNTMTG [1270-1279] PYKGGNTLTG [ 248- 253] _ITDGKS [ 547- 553] LITDGKS [1306-1312] LITDGKS [ 248- 258] ITDGKSQDEVE [1307-1317] ITDGKSQDDVE ______________________________ [ 356- 365] PSPSPVTGYK [2138-2147] PSPSPVLGYK ______________________________ [ 438- 444] KADIVFL [2321-2327] KADIVFL ______________________________ [ 759- 769] PVAGGESREVT [ 850- 860] PVAGGETQEVT ______________________________ [1436-1444] ETSTVLKDL [1616-1624] ETSTSLKDL with superset: [1048-1053] TSTVLK [1437-1442] TSTVLK [1617-1622] TSTSLK ______________________________ [1754-1759] SGPRNL [2026-2031] SGPRNL [2994-2999] SGPRGL ______________________________ [2749-2758] PGPPGPAGGP [2785-2793] PGPQGPPG_P [2829-2837] PGPPG_PMGP ______________________________ [2773-2778] IGPPGP [2782-2787] IGPPGP ______________________________ [2788-2793] QGPPGP [3035-3040] QGPPGP ______________________________ [2829-2839] PGPPGPMGPPG [2941-2951] PGPPGPPGPPG with superset: [2749-2754] PGPPGP [2785-2790] PGPQGP [2791-2796] PGPQGP [2829-2834] PGPPGP [2941-2946] PGPPGP [2944-2949] PGPPGP [3000-3005] PGPPGP and: [2749-2758] PGPPGPAGGP [2785-2793] PGPQGP_PGP [2829-2837] PGPPGP_MGP [2941-2949] PGPPGP_PGP ______________________________ [2854-2870] GPPGRPGSPGSPGVTGP [2960-2976] GPGGRPGFPGTPGMQGP with superset: [2747-2753] GPPGPPG [2854-2860] GPPGRPG [2942-2948] GPPGPPG [2960-2966] GPGGRPG [3024-3030] GPPGRPG [3036-3042] GPPGPPG and: [2854-2863] GPPGRPGSPG [2942-2951] GPPGPPGPPG [2960-2969] GPGGRPGFPG Highly repetitive regions: From 2747 to 3042 with major motif GPPGPPGP. B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 10 Aligned matching blocks: [ 243- 258]-( -16)-[ 243- 257]-( -17)-[ 241- 265]-( -6)-[ 260- 281] [ ]--------[ 543- 557]-( -17)-[ 541- 563]-( -4)-[ 560- 579] [1302-1317]-( -16)-[1302-1316]--------[ ]--------[ ] [ ]--------[2427-2441]--------[ ]--------[ ] [ 243- 258] +isiiio-s+on--i- [1302-1317] +isiiio-s+on--i- [ 243- 257] +isiiio-s+on--i [ 543- 557] +imiiio-s+oo-si [1302-1316] +isiiio-s+on--i [2427-2441] +iiiiio-s+on--i [ 241- 265] ip+isiiio-s+on--i-ips+-i+ [ 541- 563] ip+imiiio-s+oo-si+-ps__i+ [ 260- 281] ps+-i+nisi-iioisi+ss-s [ 560- 579] psi+i+no-i-iisisi+__-s ______________________________ [ 438- 463] +s-iiiii-soaoisisnii+i+sii [2321-2346] +s-iiiio-soaois--nin+ii+ii with superset: [ 439- 456] s-iiiii-soaoisisni [1198-1215] s-iiiii-soaois+sni [2322-2339] s-iiiio-soaois--ni -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 184 (Expected range: 143--223) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 62 (6-10) 35 (11-20) 38 (>=21) 50 3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 17/60): none 4. Significant specific amino acid altplet counts: Letters Observed (Critical number) LN 33 (31) at 31 (l= 2) 193 (l= 3) 293 (l= 2) 339 (l= 2) 352 (l= 2) 728 (l= 2) 781 (l= 2) 952 (l= 2) 964 (l= 2) 1092 (l= 2) 1253 (l= 2) 1268 (l= 2) 1282 (l= 2) 1374 (l= 2) 1390 (l= 2) 1480 (l= 2) 1660 (l= 2) 1702 (l= 2) 1713 (l= 3) 1758 (l= 2) 1844 (l= 2) 1849 (l= 2) 1860 (l= 2) 1902 (l= 2) 2003 (l= 2) 2030 (l= 2) 2173 (l= 3) 2205 (l= 2) 2278 (l= 2) 2378 (l= 2) 2495 (l= 2) 2528 (l= 2) 2927 (l= 2) GP 80 (70) at 65 (l= 2) 125 (l= 2) 383 (l= 2) 747 (l= 2) 799 (l= 3) 838 (l= 2) 929 (l= 2) 1020 (l= 2) 1111 (l= 2) 1136 (l= 2) 1232 (l= 2) 1381 (l= 3) 1486 (l= 2) 1524 (l= 2) 1590 (l= 2) 1690 (l= 2) 1755 (l= 2) 1884 (l= 2) 1892 (l= 2) 1958 (l= 3) 1983 (l= 2) 2027 (l= 2) 2050 (l= 2) 2074 (l= 2) 2281 (l= 3) 2519 (l= 2) 2747 (l= 2) 2749 (l= 3) 2752 (l= 3) 2757 (l= 3) 2762 (l= 2) 2774 (l= 2) 2776 (l= 3) 2783 (l= 2) 2785 (l= 3) 2789 (l= 2) 2791 (l= 3) 2795 (l= 2) 2802 (l= 2) 2817 (l= 2) 2820 (l= 2) 2826 (l= 2) 2829 (l= 3) 2832 (l= 3) 2836 (l= 2) 2838 (l= 2) 2851 (l= 2) 2854 (l= 2) 2856 (l= 2) 2859 (l= 2) 2862 (l= 2) 2865 (l= 2) 2869 (l= 2) 2874 (l= 2) 2877 (l= 2) 2883 (l= 3) 2941 (l= 3) 2944 (l= 3) 2947 (l= 3) 2950 (l= 2) 2959 (l= 4) 2965 (l= 2) 2968 (l= 2) 2971 (l= 2) 2975 (l= 2) 2977 (l= 2) 2983 (l= 2) 2995 (l= 2) 3000 (l= 3) 3003 (l= 3) 3012 (l= 2) 3014 (l= 2) 3021 (l= 2) 3023 (l= 3) 3026 (l= 2) 3029 (l= 2) 3036 (l= 2) 3038 (l= 3) 3041 (l= 2) 3060 (l= 2) 5. Long amino acid multiplets (>= 5; Letter/Length/Position): P/5/2304 B. CHARGE ALPHABET. 1. Total number of charge multiplets: 63 (Expected range: 43-- 92) 18 +plets (f+: 10.2%), 45 -plets (f-: 12.0%) Total number of charge altplets: 59 (Critical number: 99) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 8 (6-10) 8 (11-20) 9 (>=21) 39 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 6) Location Period Element Copies Core Errors 885- 896 3 A.. 4 4 0 2291-2306 4 T..P 4 4 0 2301-2308 1 P 7 5 1 2536-2555 5 S.... 4 4 0 2618-2649 8 T....... 4 4 0 2747-2800 3 G.. 18 18 ! 0 2803-2898 3 G.. 31 16 ! 1 2941-2952 3 PGP 4 4 /0/0/1/ 2942-2992 3 G.. 17 17 ! 0 2992-3009 3 G.. 6 6 ! 0 2996-3031 9 P......G. 4 4 /0/././././././1/./ 3012-3044 3 G.. 11 11 ! 0 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10) Location Period Element Copies Core Errors 1- 64 8 i.....0. 8 8 /0/./././././2/./ 284- 310 3 i.0 8 6 /1/./1/ 299- 347 7 i....0. 7 7 /0/././././2/./ 308- 331 4 i00. 6 6 /0/2/2/./ 419- 474 8 i...0... 7 7 /0/./././2/./././ 457- 501 5 i0..* 8 6 /1/3/././3/ 740- 793 9 *..000... 6 6 /0/././2/2/2/./././ 1161-1208 8 i..0*..0 6 6 /0/././1/2/././1/ 1361-1396 6 i..00. 6 6 /0/././2/2/./ 1402-1449 8 *....0.. 6 6 /0/././././1/././ -------------------------------------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 1875-2092 (3.) K( 217)K 1 of 156 0.0013 large maximal spacing 2685-2934 (4.) Y( 249)Y 1 of 113 0.0071 large maximal spacing 2746-2867 (4.) V( 121)V 2 of 271 0.0000 large 2. maximal spacing 2911-3064 (4.) V( 153)V 1 of 271 0.0001 large 1. maximal spacing ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem36 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- fn3 Fibronectin type III domain 984.5 2.6e-292 18 vwa von Willebrand factor type A domain 865.8 1.4e-256 4 TSPN Thrombospondin N-terminal -like domain 280.1 2.8e-80 1 Collagen Collagen triple helix repeat (20 copie 75.8 9e-19 4 MAP1B_neuraxin Neuraxin and MAP1B proteins 4.8 9.9 1 lectin_legB Legume lectins beta domain 4.8 5.5 1 Apidaecin Apidaecin 3.1 35 1 fibrinogen_C Fibrinogen beta and gamma chains, C-te 1.4 15 1 E1-E2_ATPase E1-E2 ATPase 0.9 30 1 Lipoate_B Lipoate-protein ligase B 0.7 7.8 1 Occludin Occludin/ELL family 0.4 25 1 Alpha_core Alphavirus core protein 0.3 43 1 Transp_cyt_pur Permeases for cytosine/purines, uracil -1.2 85 1 FHIPEP FHIPEP family -1.3 47 1 Flg_hook Flagellar hook-length control protein -1.5 92 1 thaumatin Thaumatin family -2.3 86 1 Peptidase_M29 Thermophilic metalloprotease (M29) -2.4 58 1 cadherin Cadherin domain -8.1 3.3 1 Big_1 Bacterial Ig-like domain (group 1) -31.6 42 1 Barwin Barwin family -42.7 54 1 XRCC1_N XRCC1 N terminal domain -54.6 99 1 Coagulin Coagulin -58.3 60 1 DnaJ_C DnaJ C terminal region -84.1 85 1 DUF88 Protein of unknown function DUF88 -116.0 20 1 BcrAD_BadFG BadF/BadG/BcrA/BcrD ATPase family -154.5 65 1 TGT Queuine tRNA-ribosyltransferase -172.3 14 1 DUF39 Domain of unknown function DUF39 -189.9 97 1 Lamp Lysosome-associated membrane glycoprot -207.4 44 1 DUF111 Protein of unknown function DUF111 -247.7 95 1 Biotin_synth Biotin synthase -249.1 49 1 LamB LamB porin -258.8 98 1 UDPG_MGDP_dh UDP-glucose/GDP-mannose dehydrogenase -284.7 63 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- fn3 1/18 25 103 .. 1 84 [] 67.5 2.7e-16 FHIPEP 1/1 96 106 .. 678 688 .] -1.3 47 DUF88 1/1 121 234 .. 1 156 [] -116.0 20 E1-E2_ATPase 1/1 242 265 .. 32 52 .. 0.9 30 vwa 1/4 140 312 .. 1 200 [] 246.0 5.3e-70 BcrAD_BadFG 1/1 166 364 .. 1 276 [] -154.5 65 Biotin_synth 1/1 123 403 .. 1 338 [] -249.1 49 fn3 2/18 334 416 .. 1 84 [] 66.2 6.8e-16 XRCC1_N 1/1 371 499 .. 1 155 [] -54.6 99 TGT 1/1 303 499 .. 1 246 [] -172.3 14 DUF111 1/1 211 510 .. 1 418 [] -247.7 95 Flg_hook 1/1 566 573 .. 403 410 .] -1.5 92 vwa 2/4 440 612 .. 1 200 [] 213.3 3.6e-60 fn3 3/18 632 710 .. 1 84 [] 59.1 9.7e-14 Coagulin 1/1 612 779 .. 1 174 [] -58.3 60 fn3 4/18 723 802 .. 1 84 [] 54.7 2.1e-12 Alpha_core 1/1 869 886 .. 124 141 .] 0.3 43 fn3 5/18 813 895 .. 1 84 [] 67.4 3.1e-16 DnaJ_C 1/1 828 907 .. 1 123 [] -84.1 85 Occludin 1/1 925 934 .. 1 13 [. 0.4 25 Transp_cyt_pur 1/1 917 942 .. 446 472 .] -1.2 85 fn3 6/18 904 987 .. 1 84 [] 42.6 8.9e-09 LamB 1/1 681 1016 .. 1 480 [] -258.8 98 thaumatin 1/1 1025 1032 .. 200 207 .] -2.3 86 fn3 7/18 995 1077 .. 1 84 [] 52.5 9.5e-12 fn3 8/18 1087 1166 .. 1 84 [] 44.4 2.6e-09 UDPG_MGDP_dh 1/1 1041 1370 .. 1 451 [] -284.7 63 vwa 3/4 1199 1371 .. 1 200 [] 225.3 8.6e-64 fn3 9/18 1385 1463 .. 1 84 [] 78.8 1.1e-19 cadherin 1/1 1389 1478 .. 1 107 [] -8.1 3.3 fn3 10/18 1474 1554 .. 1 84 [] 59.9 5.6e-14 Apidaecin 1/1 1548 1575 .. 1 28 [] 3.1 35 Big_1 1/1 1527 1612 .. 1 108 [] -31.6 42 fn3 11/18 1566 1643 .. 1 84 [] 60.4 3.9e-14 fn3 12/18 1654 1732 .. 1 84 [] 62.1 1.2e-14 lectin_legB 1/1 1770 1791 .. 173 196 .. 4.8 5.5 fn3 13/18 1752 1836 .. 1 84 [] 81.0 2.5e-20 Lipoate_B 1/1 1898 1920 .. 1 27 [. 0.7 7.8 fn3 14/18 1844 1926 .. 1 84 [] 54.5 2.3e-12 DUF39 1/1 1640 1933 .. 1 336 [] -189.9 97 fn3 15/18 1934 2016 .. 1 84 [] 56.9 4.3e-13 fn3 16/18 2024 2108 .. 1 84 [] 70.0 5e-17 fn3 17/18 2116 2193 .. 1 84 [] 72.3 1e-17 fn3 18/18 2204 2281 .. 1 84 [] 39.7 6.6e-08 Barwin 1/1 2172 2299 .. 1 128 [] -42.7 54 Lamp 1/1 2170 2456 .. 1 390 [] -207.4 44 vwa 4/4 2323 2496 .. 1 200 [] 202.4 6.8e-57 fibrinogen_C 1/1 2662 2672 .. 1 11 [. 1.4 15 MAP1B_neuraxin 1/1 2666 2672 .. 11 17 .] 4.8 9.9 TSPN 1/1 2520 2712 .. 1 218 [] 280.1 2.8e-80 Collagen 1/4 2747 2806 .. 1 60 [] 28.1 0.00011 Collagen 2/4 2824 2883 .. 1 60 [] 46.8 5e-10 Peptidase_M29 1/1 2920 2928 .. 1 9 [. -2.4 58 Collagen 3/4 2931 2989 .. 1 60 [] 22.7 0.00021 Collagen 4/4 2990 3047 .. 1 60 [] 2.1 0.0024 Alignments of top-scoring domains: fn3: domain 1 of 18, from 25 to 103: score 67.5, E = 2.7e-16 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn ++P++l ++ + +++++ sW p pi gY++t+ p+ +g+++ sp|Q99715| 25 VDPPSDLNFKIIDENTVHMSWAEP--VDPIVGYRITV-DPTTDGPTK 68 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e+t++ +tt+ l+ L+P teY+v++++++++ +eS sp|Q99715| 69 EFTLSASTTETLLSELVPETEYVVTITSYDEV--EES 103 FHIPEP: domain 1 of 1, from 96 to 106: score -1.3, E = 47 *->SyqElpeeikI<-* Sy E++e +++ sp|Q99715| 96 SYDEVEESVPV 106 DUF88: domain 1 of 1, from 121 to 234: score -116.0, E = 20 *->lesikeyiliKkkkrkgeKrIGLLVDG.PNmLRKEFgidLedIreiL +e+ + i k +++ + LVDG+ R F L++I + sp|Q99715| 121 VEKKPGKTEIQKCSVSAWTDLVFLVDGsWSVGRNNFKYILDFIAALV 167 s..eyG..nIrVaKVlLNQYAsdkLIEAVvNQGFePiIVAGDvDVrmAVE s ++G+++ rV+ V QY sd E NQ sp|Q99715| 168 SafDIGeeKTRVGVV---QYSSDTRTEFNLNQ------------------ 196 AmELIYNpnIDvIALvTRDADFlPiInkAKEkGKETiVIGAEPGFStALq + RD + l++I k kG T sp|Q99715| 197 --------------YYQRD-ELLAAIKKIPYKGGNTMT-----------G 220 NsADyvIvLeeege<-* + Dy + ++ sp|Q99715| 221 DAIDYLVKNTFTES 234 E1-E2_ATPase: domain 1 of 1, from 242 to 265: score 0.9, E = 30 *->pptAtVlRdG...keeeipaeeLv<-* p+ A ++dG+++ e+eipa eL+ sp|Q99715| 242 PKVAIIITDGksqDEVEIPARELR 265 vwa: domain 1 of 4, from 140 to 312: score 246.0, E = 5.3e-70 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg D+vFL+DGS+S+g+ nF ++dF+ +v+++d+g+ k +rVg sp|Q99715| 140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGE-EK-----TRVG 180 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq +vQYS++ rtE f+ln+y +de+l+a++ki+ y gg t+Tg+A++ sp|Q99715| 181 VVQYSSDTRTE--FNLNQYYQRDELLAAIKKIP----YKGGNTMTGDAID 224 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln y+v+n fte++G+R g pkv++++TDG+sqd ++ +++ sp|Q99715| 225 YLVKNTFTESAGAR------VGFPKVAIIITDGKSQDE------VEIPAR 262 elkkeagvevfaiGvGnadnnnleeLreIAskpd.dhvfkvsdfeaLdtl el+ gvevf++G++ ad ++eL++IAs p+ +hvf+v +f+a+ ++ sp|Q99715| 263 ELRN-VGVEVFSLGIKAAD---AKELKQIASTPSlNHVFNVANFDAIVDI 308 qelL<-* q+++ sp|Q99715| 309 QNEI 312 BcrAD_BadFG: domain 1 of 1, from 166 to 364: score -154.5, E = 65 *->itlGIDvGStatKcvlleDgNRPPGDkeeviatkverieatdpvksd ++ ++D+G t ++g ++ ++ +++ + + sp|Q99715| 166 LVSAFDIGEEKT-----RVG----------VVQY--SSDTRTEFNL- 194 aeealnealeeaglkleevdyvvaTGYGRntlglvffa.DkivsEischa +++ +e+++++ ++y ++ nt +++ a D v + + sp|Q99715| 195 -NQYYQR--DELLAAIKKIPYKGG-----NT--MTGDAiDYLVKNTFTES 234 lGAsylaPkQkgartViDIGGQDtKVIqiddrGkVvdFqMNDkCAAGtGR GA+ +P ++ +I+i d Gk d sp|Q99715| 235 AGARVGFP---KV------------AIIITD-GKSQDE------------ 256 FLEviArrL...gieveElgslaskGatkpakInStCa...VFAEsEvIs +E+ Ar+L++ g+ev lg a++ a+ I St ++VF v + sp|Q99715| 257 -VEIPARELrnvGVEVFSLGIKAAD-AKELKQIASTPSlnhVF---NVAN 301 llsrGikaEdIlaGiheSvAeRvaelikrePnLvsiekpvvltGGvAkne + + + +I+ ++ v e++ el++ e +e+p n sp|Q99715| 302 FDAIVDIQNEIISQVCSGVDEQLGELVSGE---EVVEPP--------SNL 340 glvkaLedllgekkgEvkinvpPd.sqyaGAiGAALl<-* + + +++++ +n +P +s+++G + sp|Q99715| 341 IAMEVSSKYVK-------LNWNPSpSPVTG------Y 364 Biotin_synth: domain 1 of 1, from 123 to 403: score -249.1, E = 49 *->erwsrdealelyetPndedlldLlhlAqqirrkffgpkeVql.ctIi ++ + e+++ + + dl +L+++ ++ r +f+ + ++ ++ sp|Q99715| 123 KKPGKTEIQKCSVSA-WTDLVFLVDGSWSVGRNNFK-YILDFiAALV 167 si......Kt..GgCsEDCkYCsQSsryktpveAPiypllkvdevlekAK s+ + +++Kt+ G + Ss + t+ + + de+l++ K sp|Q99715| 168 SAfdigeeKTrvGVVQ-------YSSDTRTEFN--LNQYYQRDELLAAIK 208 i........................irakeaGatrrFCivAagRgpYHsp + + +++++ +++ + +++ ++ ++++ G i + + sp|Q99715| 209 KipykggntmtgdaidylvkntfteSAGARVGFP-KVAI------I--IT 249 kdrefeqileaVkeIkneemgLEVCctLGAmLteEQAkqLKe.AGvdrYN +++ ++++ +e +n +g EV +LG ++ Ak+LK A N sp|Q99715| 250 DGKSQDEVEIPARELRN--VGVEV-FSLG--IKAADAKELKQiASTPSLN 294 H.....NLeTSrEeyykniiTThTYeDRlnTlknVrkaGievCsGGIiGM H + N +++ +ii+ ++++ V ++ + sp|Q99715| 295 HvfnvaNFDAIV-DIQNEIIS--------QVCSGVDEQ-----------L 324 GE..seeDRiglaltLaeLdthPeSVPiNfLvpieGTPladkleneqekl GE s e++++ L ++ + S + kl sp|Q99715| 325 GElvSGEEVVEPPSNLIAME--VSS--------------------KYVKL 352 dvleaLktIAtaRiimPkaeiRLagGRetnleelQalcflAGgaNsifVG ++ + + +++i+ +++ + ++ +G sp|Q99715| 353 NWNPSPSPVTGYKVILTP-----------MTAGSRQHALSVG-------- 383 dKLtTtpVKGVGNPKgreleeDlkmladLGLe<-* + tT+ r+l +D + ++ sp|Q99715| 384 PQTTTLS--------VRDLSADTE----YQIS 403 fn3: domain 2 of 18, from 334 to 416: score 66.2, E = 6.8e-16 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.w +P+nl +++v+s ++ l+W+p+ +p+tgY+v + p g++ sp|Q99715| 334 VEPPSNLIAMEVSSKYVKLNWNPS--PSPVTGYKVIL-TPMTAGSrQ 377 neltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* + l+v ++tt+ +++L ++teY+++V A +g +++++ sp|Q99715| 378 HALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTsSEPI 416 XRCC1_N: domain 1 of 1, from 371 to 499: score -54.6, E = 99 *->mPEisLRHvvscssqDstHCAEnLLKADt.YRKWRAAkAGEKtisvv m s H s q t L ADt+Y A G + sp|Q99715| 371 MTAGSRQHALSVGPQTTTLSVRD-LSADTeYQISVSAMKGMTSSEPI 416 LqLEKEEqiHsvDiGnDGsAFvEvLvgssAGGagatAgEqDYEvLLvtss +EK + v v s G + D v Lv s sp|Q99715| 417 SIMEKTQPM-------------KVQVECSRGV----DIKADI-VFLVDGS 448 FmsPsEsRsGsnPnRvRmFGPDKLvRAAAEKRWDRvKivCsqPYsKDsPy + s +n +vR F + Lv+ E Rv+i q Ys+D sp|Q99715| 449 Y-----SIGIANFVKVRAF-LEVLVK-SFEISPNRVQISLVQ-YSRDPHT 490 GLsFvrFHs<-* +F sp|Q99715| 491 EFTLKKFTK 499 TGT: domain 1 of 1, from 303 to 499: score -172.3, E = 14 *->ekiiEIQeaLGSDIiMaLDectpypadrdraek......alerTlrW i++IQ+++ S + De + +++++++ +++++ a+e++ + sp|Q99715| 303 DAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEppsnliAMEVSSKY 349 AersieahkrrgegedqaLFgIvQGstylDL......RkksAkelaeldF + + + ++ g++ L + + Gs L+ ++++ + s+++l + sp|Q99715| 350 VKLNWNPSPSPVTGYKVILTPMTAGSRQHALsvgpqtTTLSVRDLSADT- 398 fdGYAIGGlVPLsvGEpkddmlrvlehscplLPkdKPRYLMGVGhPedla +++ + + G + + e + p+ + + sp|Q99715| 399 EYQISVSA----MKGMTSSEPISIMEKTQPM---------------KVQV 429 lAvelGvDmFDCVaPtRyARnGtllTsreGtiklknakykeDfrPldeeC ++ ++GvD+ + + ++ +G ++++ a++ + sp|Q99715| 430 EC-SRGVDI----------KADIVFLV-DGSYSIGIANFVK--------- 458 dCyvCknYSRaYLrHLiraeEllGarLateHNLhfyleLmekiRqAIleg Ra L L ++ E++ r +++ L++ R+ e sp|Q99715| 459 --------VRAFLEVLVKSFEISPNR--------VQISLVQYSRDPHTE- 491 sllefveeflr<-* +++++ f + sp|Q99715| 492 --FTLKK-FTK 499 DUF111: domain 1 of 1, from 211 to 510: score -247.7, E = 95 *->vaifDafaGiSGDMiLgALidlGvkeEeiaEvlkalgsltdevdvev + G G+ + g id vk ++ + +g + v + + sp|Q99715| 211 -----PYKG--GNTMTGDAIDYLVKNTFTESAGARVG--FPKVAIII 248 skVkkaGlrAtkVevkekedHHSHGPlrvGarefkeilevIkssnLeeev k + Ve+ + e +vG f +Ik ++ e sp|Q99715| 249 TDGKSQ----DEVEIPAREL------RNVGVEVF---SLGIKAADAKE-- 283 kktAlAvFhllAeAEgkVHGvdleeVHFHEVGAaDAIaDvVGavlalekL ++++A + H +F V + DAI+D+ sp|Q99715| 284 -------LKQIASTPSLNH-------VF-NVANFDAIVDI---------- 308 nlkddelyslPvavGsGfVRaaHGllPVPaPAvlEILkgrqlpvvvsdle +e s v sG + E+ g vv +++ sp|Q99715| 309 ---QNEIIS---QVCSGVDEQ-----------LGELVSG--EEVVEPPSN 339 gELvTPTGaAIlvtlakklkpfFPPl...eikeVsYGAGskDlelpNiLR L+ + v l+ p P ++ ++ + AGs+ l sp|Q99715| 340 --LIAMEVSSKYVKLNWNPSPS--PVtgyKVILTPMTAGSRQHALS---- 381 lvrGeeivPHNHPLSSEAPFGqhdtIvlLETnVDDlSg....EvlGYllE G + +l +V DlS++++ ++ +++ sp|Q99715| 382 --VGPQTT------------------TL---SVRDLSAdteyQISVSAMK 408 kLLerDgvlDvvliPv.yMKKnRPaillrVIcaeeeaEev.lellfRETG P+++M K P ++V ++ + +++ ++f G sp|Q99715| 409 GMTSS--------EPIsIMEKTQP---MKVQVECSRGVDIkADIVFLVDG 447 SLGVRIfpqer.raverrreevvkveikgrikvrlkVgrvkkeivnakPE S + I ++ ra ++ v+ ei+ r+ +v+ + + E sp|Q99715| 448 SYSIGIANFVKvRAFLE--VLVKSFEISPN---RVQISLVQYS-RDPHTE 491 FeDiaaIAketGIPlkdvykeivkkisdlkn<-* F ++ k d+++ i+ + sp|Q99715| 492 FT-LKKFTK-----VEDIIEAIN------TF 510 Flg_hook: domain 1 of 1, from 566 to 573: score -1.5, E = 92 *->NsaVDIFA<-* Ns V IFA sp|Q99715| 566 NSDVEIFA 573 vwa: domain 2 of 4, from 440 to 612: score 213.3, E = 3.6e-60 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg DivFL+DGS Sig +nF +v++F+e +v +++++p + v+++ sp|Q99715| 440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISP-NR-----VQIS 480 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq lvQYS +++tE f+l+ +++ +++++a++ ++ y gg tnTg+A+ sp|Q99715| 481 LVQYSRDPHTE--FTLKKFTKVEDIIEAINTFP----YRGGSTNTGKAMT 524 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln yv++++f ++ GsR +++pkv++++TDG+s+d +rd++ sp|Q99715| 525 YVREKIFVPSKGSR------SNVPKVMILITDGKSSDA------FRDPAI 562 elkkeagvevfaiGvGnadnnnleeLreIAskpd.dhvfkvsdfeaLdtl +l+ +ve+fa+Gv++a++ +eL+ IAs+p + hvf v+df+a + + sp|Q99715| 563 KLRN-SDVEIFAVGVKDAVR---SELEAIASPPAeTHVFTVEDFDAFQRI 608 qelL<-* +L sp|Q99715| 609 SFEL 612 fn3: domain 3 of 18, from 632 to 710: score 59.1, E = 9.7e-14 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.w +P++l++++vts+ + +Wsp+ + + +Y +ty +++ g+++ sp|Q99715| 632 YVPPKDLSFSEVTSYGFKTNWSPA--GENVFSYHITY-KEA-AGDdE 674 neltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* +++ p ++ts++l+ LkP t Y v V+A ++ +S sp|Q99715| 675 VTVVEPASSTSVVLSSLKPETLYLVNVTAEYED--GFS 710 Coagulin: domain 1 of 1, from 612 to 779: score -58.3, E = 60 *->tnvPiCLCeEPtlLGRkvivstEiKDKiEkAveAvadkdevsGRGFs iCL E l k+ KD v k + s G sp|Q99715| 612 LTQSICLRIEQELAAIKKKAYVPPKDLSFSEVTSYGFKTNWSPAGEN 658 iFsgHPvFkE.CGKYECRtvrsEdsRCYnFfPFtHFklECPvsvrdCEPv +Fs H +kE G E v + s + l + + E sp|Q99715| 659 VFSYHITYKEaAGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDG 708 FGYtvanElRv..ivqAPkAGFRqCvWqHKCRayGsnsCqrtGRCtqqRs F a+E +++ AP+ v ++ q GR R sp|Q99715| 709 FSIPLAGEETTeeVKGAPRN---LKVTDETTDSFKITWTQAPGRVLRCRI 755 vvRLvtYdLEKdvFlCEsvRtCCGCPCRsf<-* + R v ++v + R R + sp|Q99715| 756 IYRPVAGGESREVTTPPNQRR------RTL 779 fn3: domain 4 of 18, from 723 to 802: score 54.7, E = 2.1e-12 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn +aP+nl+vtd t++s+ ++W+ + g + + ++ yR p+ gge + sp|Q99715| 723 KGAPRNLKVTDETTDSFKITWTQA--PGRVLRCRIIYR-PVAGGESR 766 eltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* e+t p+++++ tl++L P+t+Yev+V+ + +Gp + sp|Q99715| 767 EVTTPPNQRRRTLENLIPDTKYEVSVI--PEYFsGPGT 802 Alpha_core: domain 1 of 1, from 869 to 886: score 0.3, E = 43 *->vlGGanEgartALsvvtW<-* vl G Eg++ ALsv sp|Q99715| 869 VLQGLKEGTQYALSVTAL 886 fn3: domain 5 of 18, from 813 to 895: score 67.4, E = 3.1e-16 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggew ++ P++l+v+d t++++ lsWs + g++ Y vty p+ gge+ sp|Q99715| 813 VrGNPRDLRVSDPTTSTMKLSWSGA--PGKVKQYLVTY-TPVAGGET 856 neltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* +e+tv g+tt+ +l+gLk+gt+Y ++V+A+ + G G + sp|Q99715| 857 QEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGaGDAL 895 DnaJ_C: domain 1 of 1, from 828 to 907: score -84.1, E = 85 *->qrlRlaGeGeaGenGGppGDLYVvirVkeHkiFeRdGdDLycevpis +++ l+ G +G+ Vk + ++ sp|Q99715| 828 STMKLSWSGAPGK-------------VKQ--------------YLVT 847 ftqAaLGgeveVPTLdGkvkLkIPaGTQsGtvfrLkGKGvpklnrgsgrG +t a G + eV T+ G+++ ++ +G + Gt+ L v l sg G sp|Q99715| 848 YTPVAGGETQEV-TVRGDTTNTVLQGLKEGTQYALS---VTAL-YASGAG 892 DLyVhVkVetPkkLseeQkeLLeela<-* D L e L e sp|Q99715| 893 D-----------ALFGEGTTLEERGS 907 Occludin: domain 1 of 1, from 925 to 934: score 0.4, E = 25 *->FYkWtSPPGvvRi<-* WtS PG vR+ sp|Q99715| 925 ---WTSAPGMVRG 934 Transp_cyt_pur: domain 1 of 1, from 917 to 942: score -1.2, E = 85 *->iagsvgepdikvvpgakplydlsyfVG<-* +++s g+ ++ +pg++ +y++s+ sp|Q99715| 917 TDTSIGA-YWTSAPGMVRGYRVSWKSL 942 fn3: domain 6 of 18, from 904 to 987: score 42.6, E = 8.9e-09 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge. ++ P++l+ +d+t+ts+ W+ + g + gY+v++ ++ + + sp|Q99715| 904 ErGSPQDLVTKDITDTSIGAYWTSA--PGMVRGYRVSW-KSLYDDVd 947 wneltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* + e+++p ++ + ++++L+P t+Y ++V A G+G++ sp|Q99715| 948 TGEKNLPEDAIHTMIENLQPETKYRISVFATYSSGeGEPL 987 LamB: domain 1 of 1, from 681 to 1016: score -258.8, E = 98 *->FhGYaRSGvGvsgsGseqqsfaatgkqGETGGsiGRLGNEadtYaEl a S v +s + + + + E++ + + sp|Q99715| 681 ----ASSTSVVLSSLKPETLYLVNVTA------------EYEDGFSI 711 qLgqevwnengktfyfdvMVAdgvGqngsnDWeasesk.fqCtsanGtaL L+ e ++e k V d + W+ + + +C sp|Q99715| 712 PLAGEETTEEVKGAPRNLKVTDETTDSFKITWTQAPGRvLRCRIIYRPVA 761 DGCendedgdfnvrQfnvelVqaKgLlgfapeatlWAGKRfyqrNhDiHm G + ++ +n r + +L++ + + ++ sp|Q99715| 762 GGESREVTTPPNQRRRT-----LENLIP-------------DTK-YEVSV 792 iDfyywnisGtAGaGIEnvkvGpGKlSlAlvrndedd..gsytnsddsik i y+ sG+ G + +A++ + + +++++ sd + sp|Q99715| 793 IPEYF---SGP----------GTPLTGNAATEE-VRGnpRDLRVSDPT-- 826 dYndeinnnvfDlRLagiplwddgsLevsvdYarANeSadekDdsnlnGy ++++ l g p + L v Y + + sp|Q99715| 827 -------TSTMKLSWSGAPGKVKQYL---VTYTPVA------GGETQE-V 859 gaakDgwmftaeltQgllnGGFNkfvlqygtDagtasnfGhfwGaSagtk + +D+ ++ +Qgl G qy+ + +++++ ++G sp|Q99715| 860 TVRGDTTNT---VLQGLKEG------TQYAL--SVTALYASGAG---DAL 895 yngsgaeakngadgfRil..shGviplGdnWevahalv..YqrG.nDiys + g g+ ++ ++ +++++ +++G W a+ +v++Y ++ + +y sp|Q99715| 896 F-GEGTTLEERGSPQDLVtkDITDTSIGAYWTSAPGMVrgYRVSwKSLY- 943 qfGydtadkdtYtGksfdiewytvvvRPmYkWdqimsTafEgGYfnvknk dt+ k+ + T+ E ++ sp|Q99715| 944 -DDVDTGEKN--------------------LPEDAIHTMIE------NLQ 966 stggnkredvgqSFYKlTLAqtwkvGs....SfwafSRPEIRvYATYlkw ++ Y + ++ t++ G +++ ++a +T l + sp|Q99715| 967 PET--------K--YRISVFATYSSGEgeplTGDA---------TTELSQ 997 dekegkgnafdalgrtkdasnSGistdsgndDdewtfGvQmEiWW<-* d k +k ++ + + m + W sp|Q99715| 998 DSKTLK--VDE-----------------ETENT-------MRVTW 1016 thaumatin: domain 1 of 1, from 1025 to 1032: score -2.3, E = 86 *->NYrVvFCP<-* NYrVv P sp|Q99715| 1025 NYRVVYRP 1032 fn3: domain 7 of 18, from 995 to 1077: score 52.5, E = 9.5e-12 *->PsaP.tnltvtdvtstsltlsWspptgngpitgYevtyRqpknggew +s+ ++ l+v + t+++++++W+p+ g++ +Y+v yR p + g+ sp|Q99715| 995 LSQDsKTLKVDEETENTMRVTWKPA--PGKVVNYRVVYR-PHGRGKQ 1038 neltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* vp+t ts +l+ L+P t+Y ++V + G+G+ sp|Q99715| 1039 MVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGeGKLR 1077 fn3: domain 8 of 18, from 1087 to 1166: score 44.4, E = 2.6e-09 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.w P+nl+ +d t +s++++W+p+ g++ gY+vt + p+++ + sp|Q99715| 1087 FKSPRNLKTSDPTMSSFRVTWEPA--PGEVKGYKVTFH-PTGDDRrL 1130 neltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* el v + ++++l+ L++gt+Y+v V +g eS sp|Q99715| 1131 GELVVGPYDNTVVLEELRAGTTYKVNVFGMFDG--GES 1166 UDPG_MGDP_dh: domain 1 of 1, from 1041 to 1370: score -284.7, E = 63 *->mkIaViGlGYVGGPLvtavclaeiGhdVigvDInesKidklnnGkiP +k+ P+vt+++l+ ++ + + +DI ++P sp|Q99715| 1041 AKV---------PPTVTSTVLKRLQPQTT-YDI----------TVLP 1067 IyEPGLeeLvkknvesgrLrfttdykeaikeaDvifIaVgTPskkdGnGk Iy G+ +L ++ + +++k+ + + k+ sp|Q99715| 1068 IYKMGEGKLRQGSG-----TTASRFKS------------PRNLKTS---- 1096 eksaDLtyvesAaetIgpvlkkGdrdkViVekStvPggTeeevrpileka ++ ++ + + +p kG +kV + P g ++ + ++ sp|Q99715| 1097 --DPTMSS-FRVTWEPAPGEVKG--YKV----TFHPTGDDRRLGELVV-- 1135 sgltFpqqvlkegvdFdVasNPEFLrEGnAvyDllnPdRvviGveetpes + ++d V+ E Lr G+ + +++ + +G+ +p + sp|Q99715| 1136 G-----------PYDNTVVL--EELRAGTTYKVNVFG-MFDGGE-SSPLV 1170 aealreLYkpfiekgWlprgpivvtdlk.....tAEliKya........a + + +L + + pi+ ++ + ++ +A + +++++ + ++a sp|Q99715| 1171 GQEMTTLSDTTV-------MPILSSGMEcltraEADIVLLVdgswsigrA 1213 NaFLAtkIsFiNElanICeavGaDvseVaeaaGl...DpRIglrkflnPG N F+ ++ sFi + I e + + V a+ + ++DpR ++ + + sp|Q99715| 1214 N-FRTVR-SFI---SRIVEVFDIGPKRVQIALAQysgDPRTE-WQLN-AH 1256 pGfGGsCfpkDprALfiiaknlgvgehaeLieavrevNdsqpsyvvekli +D + L++ +nl++ +L +++ +N +++ + + + sp|Q99715| 1257 ---------RDKKSLLQAVANLPYKGGNTL--TGMALNFIRQQNF--RTQ 1293 kelgkagktlkGkkiavLGLaFKpntDDlRESpaieiiemLleeglaeVk + + + +ki+vL i ++++ sp|Q99715| 1294 AGM-----RPRARKIGVL-------------------ITDGKSQD----- 1314 ayDPkvepnrehlaqkveyesdavlaeavkraDavviltewdefkeldks D ++ + ++++++e a+ ++ + + +i+t+ d+ +++ + sp|Q99715| 1315 --DVEAPS---KKLKDEGVELFAIGIKNADEVELKMIATDPDDTHDYNVA 1359 kianrmkkpvIiDgRnilDl<-* ++ + +i+D sp|Q99715| 1360 DFES---------LSRIVDD 1370 vwa: domain 3 of 4, from 1199 to 1371: score 225.3, E = 8.6e-64 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg Div+L+DGS+Sig++nF +v+ F++r+ve +d+gp + v+++ sp|Q99715| 1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGP-KR-----VQIA 1239 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq l+QYS+++rtE + ln++++k+++lqa+ +++ y gg t Tg AL+ sp|Q99715| 1240 LAQYSGDPRTE--WQLNAHRDKKSLLQAVANLP----YKGGNTLTGMALN 1283 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln + + + f+ +G+R ++a+k+ V++TDG+sqdd ++++++ sp|Q99715| 1284 FIRQQNFRTQAGMR------PRARKIGVLITDGKSQDD------VEAPSK 1321 elkkeagvevfaiGvGnadnnnleeLreIAskpd.dhvfkvsdfeaLdtl +lk+ +gve faiG++nad + eL+ IA++pd+ h ++v dfe+L+ + sp|Q99715| 1322 KLKD-EGVELFAIGIKNAD---EVELKMIATDPDdTHDYNVADFESLSRI 1367 qelL<-* ++ L sp|Q99715| 1368 VDDL 1371 fn3: domain 9 of 18, from 1385 to 1463: score 78.8, E = 1.1e-19 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn + aP+nl+++++t s+++sW+pp + +++Y+v+y +p++gg+ + sp|Q99715| 1385 LEAPSNLVISERTHRSFRVSWTPP--SDSVDRYKVEY-YPVSGGKRQ 1428 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e+ v++ +ts +l++LkP teY+v V++v ++ ++S sp|Q99715| 1429 EFYVSRMETSTVLKDLKPETEYVVNVYSVVED--EYS 1463 cadherin: domain 1 of 1, from 1389 to 1478: score -8.1, E = 3.3 *->ysasvpEnapvGtevltvtAtDaDdplgpNgrirYsilggnpggwFr + ++E + +v t + d + + ++ Y ++g+ ++ F sp|Q99715| 1389 SNLVISER---THRSFRVSWTPPSD-SVDRYKVEYYPVSGGKRQEFY 1431 IdpdtGdnegiisttkpLDREeifngeYeLtveAtDadplsaaggsppls + + + ++ k L E+ ++++ ++ D+ + +p + sp|Q99715| 1432 VSRMETS-----TVLKDLKPETE--YVVNVYSVVEDE------YSEPLKG 1468 gtatvtitVl<-* ++ t+ + V sp|Q99715| 1469 TEKTLPVPVV 1478 fn3: domain 10 of 18, from 1474 to 1554: score 59.9, E = 5.6e-14 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.. P ++ l + dv++t+++++W+p g tgY ++y +p+ + e++ sp|Q99715| 1474 PVPVVSLNIYDVGPTTMHVQWQPV---GGATGYILSY-KPVKDTEpt 1516 .wneltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* +++e+++ +t + ++lt+L+P+teY v+VqAv + S sp|Q99715| 1517 rPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLHD--LTS 1554 Apidaecin: domain 1 of 1, from 1548 to 1575: score 3.1, E = 35 *->lrakPEAEPGnnRPvYiPqPRPPHPRlr<-* + EP R v +P PRP +lr sp|Q99715| 1548 VLHDLTSEPVTVREVTLPLPRPQDLKLR 1575 Big_1: domain 1 of 1, from 1527 to 1612: score -31.6, E = 42 *->vivdqltAsitsivaddttavAngndaiTlTAtVkDanGnPvagqeV v+ ql t v +n a+T++A + D Pv eV sp|Q99715| 1527 VNDMQL----TDLV-------PNTEYAVTVQAVLHDLTSEPVTVREV 1562 tFtvpsdvsssgtLsnkntakatTDanGvAtvtLTst..kaGtytVtAsl t p ++ +L ++ T++ v + ++k+++y+V ++ sp|Q99715| 1563 TL--PLPRPQDLKLRDV------THS--TMNVFWEPVpgKVRKYIVRYKT 1602 enggatsvdaktV<-* + + v++++V sp|Q99715| 1603 P-EED--VKEVEV 1612 fn3: domain 11 of 18, from 1566 to 1643: score 60.4, E = 3.9e-14 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn ++ P++l+ +dvt +++ + W+p g++ +Y v+y ++ +++ + sp|Q99715| 1566 LPRPQDLKLRDVTHSTMNVFWEPV--PGKVRKYIVRY-KTP-EEDVK 1608 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e+ v++++ts l++L t Y+v+V Av ++ eS sp|Q99715| 1609 EVEVDRSETSTSLKDLFSQTLYTVSVSAVHDE--GES 1643 fn3: domain 12 of 18, from 1654 to 1732: score 62.1, E = 1.2e-14 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggew P +aPtnl++t+vts ++ +W++ ++++ Y++t+ p ++ ++ sp|Q99715| 1654 PvPAPTNLKITEVTSEGFRGTWDHG--ASDVSLYRITW-GPFGSSDK 1697 neltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e ++ g++++ ++++L+P+t Yev+++A+ ++ eS sp|Q99715| 1698 METILNGDENTLVFENLNPNTIYEVSITAIYAD---ES 1732 lectin_legB: domain 1 of 1, from 1770 to 1791: score 4.8, E = 5.5 *->TvsWdlvngkGevAtVlItYdast<-* Tv+Wd ++ G+v + +ItY+ st sp|Q99715| 1770 TVKWDPAS--GRVQKYRITYQPST 1791 fn3: domain 13 of 18, from 1752 to 1836: score 81.0, E = 2.5e-20 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge. P+s P+nl+v + ts+slt+ W+p+ +g +++Y++ty qp+ g+++ sp|Q99715| 1752 PkSGPRNLQVYNATSNSLTVKWDPA--SGRVQKYRITY-QPSTGEGn 1795 wneltvpgtttsytltgLkPgteYevrVqAvnggG..GpeS<-* +++ t+ g ++s++l+ LkP+t+Y+++V ++ +G++G+ + sp|Q99715| 1796 EQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDGegGRMT 1836 Lipoate_B: domain 1 of 1, from 1898 to 1920: score 0.7, E = 7.8 *->dkvnkirarqpkdtillleHpPVYTlG<-* + i +dt++++ + PVYT G sp|Q99715| 1898 A----ILRNLQPDTSYTVTVVPVYTEG 1920 fn3: domain 14 of 18, from 1844 to 1926: score 54.5, E = 2.3e-12 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn + ++nl+v d ++++l ++W+++ g Y++ y p+ g++ sp|Q99715| 1844 LNTVRNLRVYDPSTSTLNVRWDHA--EGNPRQYKLFY-APA-AGGPE 1886 elt.vpgtttsytltgLkPgteYevrVqAvnggG..GpeS<-* el ++pg+t+ ++l++L+P+t Y+v+V+ v ++G++G+ S sp|Q99715| 1887 ELVpIPGNTNYAILRNLQPDTSYTVTVVPVYTEGdgGRTS 1926 DUF39: domain 1 of 1, from 1640 to 1933: score -189.9, E = 97 *->eGeavVvTAdEmkkiVrelGeekaAdeVDVVTT....gTfGaMsSsg eGe vTA E ++ V ++ e VT+++ +gT S sp|Q99715| 1640 EGESPPVTAQETTRPVPA-PTNLKITE---VTSegfrGTWDHGASDV 1682 AvinlghsdPpsfiKmervyLNgieAYaGiCPNEfLgaVDay......iG + +s Kme + LNg e E+L y+ + + i+ sp|Q99715| 1683 SLYRITWGPFGSSDKMETI-LNGDENTLV---FENLNPNTIYevsitaIY 1728 aTqpsedpdvgdnPneGdGvleyGGghvieDLvkGkeVElrAtsyGTDCY a se++d + G+e l sp|Q99715| 1729 A-DESESDD-----------------------LIGSERTLPILT------ 1748 prkeveteItlDdinqAvmvnpRnaYqnYtAavNsseel.......lyTy qA +pRn+ q Y+A Ns + ++ ++++ y sp|Q99715| 1749 --------------TQAPKSGPRNL-QVYNATSNSLTVKwdpasgrVQKY 1783 mgtLlpPenyGnvsfSGcG......elnPLeNDviPetksfetigiGTri t +p Gn G+++++ l L+ D P ti + sp|Q99715| 1784 RITYQPSTGEGNEQTTTIGgrqnsvVLQKLKPD-TP-----YTITV-SSL 1826 fLnGAegyilGEGTqhsplqkRlp.hvPippkgtLMlkGDlKdMdpeYir G g G G + pl ++ v p tL + D + +p+ + sp|Q99715| 1827 YPDGEGGRMTGRG-KTKPLNTVRNlRVYDPSTSTLNVRWDHAEGNPRQYK 1875 GAflpryG..peLyvgigvPIP.Vlnekvaeala..vlDeDIelpvvDfg + p++G++ eL vPIP+ +n ++ +l ++ + ++pv+ g sp|Q99715| 1876 LFYAPAAGgpEEL-----VPIPgNTNYAILRNLQpdTSYTVTVVPVYTEG 1920 vprRdRPvikevtYeelw<-* + +R + t l sp|Q99715| 1921 DGGR-----TSDTGRTLM 1933 fn3: domain 15 of 18, from 1934 to 2016: score 56.9, E = 4.3e-13 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.w + +n++v + t++ l ++W+p+ gp+ Y+v y +p++g + sp|Q99715| 1934 RGLARNVQVYNPTPNRLGVRWDPA--PGPVLQYRVVY-SPVDGTRpS 1977 neltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* ++ vpg+t+ ++l+ L P+t Y+v ++A+ +G+G +S sp|Q99715| 1978 ESIVVPGNTRMVHLERLIPDTLYSVNLVALYSDGeGNPS 2016 fn3: domain 16 of 18, from 2024 to 2108: score 70.0, E = 5e-17 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggew P+s P+nl+v + t++sl++ W+++ +gp++ Y++ y +p+ g + sp|Q99715| 2024 PrSGPRNLRVFGETTNSLSVAWDHA--DGPVQQYRIIY-SPTVGDPI 2067 nelt.vpgtttsytltgLkPgteYevrVqAvnggG..GpeS<-* e+t+vpg ++++l+ L+P+t+Y+++V+Av ++G++G+ + sp|Q99715| 2068 DEYTtVPGRRNNVILQPLQPDTPYKITVIAVYEDGdgGHLT 2108 fn3: domain 17 of 18, from 2116 to 2193: score 72.3, E = 1e-17 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn + +P+n++++d t +++sW+p+ +p+ gY++ y +p++++e+ sp|Q99715| 2116 LLPPQNIHISDEWYTRFRVSWDPS--PSPVLGYKIVY-KPVGSNEPM 2159 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e v t sytl++L+P t+Y v V+A+ + S sp|Q99715| 2160 EAFVGEMT-SYTLHNLNPSTTYDVNVYAQYDS--GLS 2193 fn3: domain 18 of 18, from 2204 to 2281: score 39.7, E = 6.6e-08 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn +t+l+ ++++++ + Wsp + t+Y++++ +p++g + sp|Q99715| 2204 YLNVTDLKTYQIGWDTFCVKWSPH---RAATSYRLKL-SPADGTRGQ 2246 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e+tv g++ts+ +tgL P+t+Y v+V ++ + e sp|Q99715| 2247 EITVRGSETSHCFTGLSPDTDYGVTVFVQTPN--LEG 2281 Barwin: domain 1 of 1, from 2172 to 2299: score -42.7, E = 54 *->sasnVraTYHlYnPqqnnwdlnaaSayCatrfWdadkplswrsKY.. n TY + +q l+ + + t + ++ Y+ sp|Q99715| 2172 HNLNPSTTYDVNVYAQYDSGLSVPLTDQGT---TLYLNVTDLKTYqi 2215 GWtAFCgpagprgqssCGkC.lrvtNtatgasvtvRiVDqggCsnGkdqt GW FC p+ ++ + +l+ g ++tvR s t sp|Q99715| 2216 GWDTFCVKWSPHRAATSYRLkLSPADGTRGQEITVRGSET---SHC--FT 2260 GL..DlDy.vnvFnqlDt.dGaGyq.qGhLnVnyqfVdC<-* GL++D Dy+v+vF q +G G + h +V + sp|Q99715| 2261 GLspDTDYgVTVFVQTPNlEGPGVSvKEHTTVKPTEAPT 2299 Lamp: domain 1 of 1, from 2170 to 2456: score -207.4, E = 44 *->AnlsvnFsvkYeTtlnkqn.ktfdlpskatVthngSsCGsdnvsdPk n s++Y+ + +q++++ + p +++ + + nv+ + sp|Q99715| 2170 TLHNLNPSTTYDVNVYAQYdSGLSVPLTDQG----TTL-YLNVT-DL 2210 laiqFG.sGhswtlnFTKnatrYsvesitfsYNlsDtthFPnAtskgivT + q G +++ +++at Y+ + ls + + t++ T sp|Q99715| 2211 KTYQIGwDTFCVKWSPHRAATSYRLK-------LSPA----DGTRGQEIT 2249 VdsksdikadlnkpfrCtssttvemknVtqtlwdatvq.AfvqNgtfSkn V ++++ C ++ + ++ + vq++ + + +S + sp|Q99715| 2250 VR--------GSETSHCFTGLSPDTDYGV----TVFVQtPNLEGPGVSVK 2287 etrCeeDkpspTvaPtihttvptastTitPsPsPtPvpkenPtvGnYsvt e +++++ pT P++++ P+P+ +P+ + sp|Q99715| 2288 EHTTVKPTEAPTE-PPTPP----------PPPTIPPARD--------VCK 2318 ngNgTCLLAsmgLQLNItyekkdnktvtrilNinPntTkasGsCestsat ++ + + ++ + I + nk v +i+N T + G e+ a sp|Q99715| 2319 GAKADIVFLTDA-SWSIGDDN-FNKVVKFIFN-----T-VGGFDEISPAG 2360 LkLnfsqityLdFgFavnassspvvkFyLqeVnvnmtlpdAngstFsa.s +++f q++ v+ s e +n+++ ++ ++ a + sp|Q99715| 2361 IQVSFVQYSD-----EVK---S--------EFKLNTYND--KALALGAlQ 2392 NnsLselqApLGnSYkCnkEesisVskafqlnvFnLrVQAFkVtdkgkFG N + G kE ++ + nv ++ V Vt g+ sp|Q99715| 2393 NIRYRGGNTRTGKALTFIKEKVLTWESGMRKNVPKVLV---VVT-DGR-- 2436 saeeCsaDedNfLiPIaVGaALaGLllLVLIAYlIGRKRshaGYQti<-* s+De a L++ ++G + sp|Q99715| 2437 -----SQDE----------VKKAALVI------------QQSGFSVF 2456 vwa: domain 4 of 4, from 2323 to 2496: score 202.4, E = 6.8e-57 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg DivFL D S+Sig +nF++v F+ + v+ +d++ + ++V+ sp|Q99715| 2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDEISPAG-----IQVS 2364 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq vQYSd+v+ E fkln+y +k+ l alq+ir y gg t Tg+AL sp|Q99715| 2365 FVQYSDEVKSE--FKLNTYNDKALALGALQNIR----YRGGNTRTGKALT 2408 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln + +++ t +sG+R +++pkvlVv+TDGrsqd +++++ sp|Q99715| 2409 FIKEKVLTWESGMR------KNVPKVLVVVTDGRSQDE------VKKAAL 2446 elkkeagvevfaiGvGnadnnnleeLreIAskpd.dhvfkvsdfeaLdtl ++++ g+ vf++Gv ++d + eL IAskp+++hvf+v+dfe+ +++ sp|Q99715| 2447 VIQQ-SGFSVFVVGVADVD---YNELANIASKPSeRHVFIVDDFESFEKI 2492 qelL<-* ++ L sp|Q99715| 2493 EDNL 2496 fibrinogen_C: domain 1 of 1, from 2662 to 2672: score 1.4, E = 15 *->tgKDCqdilqk<-* +++DC++i+ k sp|Q99715| 2662 IYIDCYEIIEK 2672 MAP1B_neuraxin: domain 1 of 1, from 2666 to 2672: score 4.8, E = 9.9 *->sYEttEK<-* +YE++EK sp|Q99715| 2666 CYEIIEK 2672 TSPN: domain 1 of 1, from 2520 to 2712: score 280.1, E = 2.8e-80 *->gqdllqvfdLpessfsvrkavglkatirvrGsdpsspAYrlgktdri g+++l++++L+e++f+ ++v+l++ Gs+ps++AYr++k sp|Q99715| 2520 GFKMLEAYNLTEKNFASVQGVSLES-----GSFPSYSAYRIQK---- 2557 pavvsqptrrlfPAsglpedFsllttfrqapeTpksrgvLfaiydaqqnv +a+v+qpt++l+P +glp+ ++++++fr++peTp +++++++i+d+ +++ sp|Q99715| 2558 NAFVNQPTADLHP-NGLPPSYTIILLFRLLPETPSDPFAIWQITDR-DYK 2605 rqlglevneVrDGrantlllryqkgsdGkvhtvvFrg.EllpLaDgqWHk +q+g++ + + ++tl++++ k+++G+v+tv+F+++E+ +L++g++Hk sp|Q99715| 2606 PQVGVIAD----PSSKTLSFFN-KDTRGEVQTVTFDTeEVKTLFYGSFHK 2650 laLsVsgssRGapavtLYVDCnkidsrpdldrpfqglppidtdGievlga ++++V+++s v++Y+DC++i +++ +++++ +i+tdG+e+lg+ sp|Q99715| 2651 VHIVVTSKS-----VKIYIDCYEIIEKD-IKEAG----NITTDGYEILGK 2690 qaandekkfqgdLQqlkivcdp<-* + ++++k++ +++Q+++ivc+p sp|Q99715| 2691 LLKGERKSAAFQIQSFDIVCSP 2712 Collagen: domain 1 of 4, from 2747 to 2806: score 28.1, E = 0.00011 *->GppGppGppGppGppGppGppGpaGapGppGppGepGpPGppGppGp GppGppGp+G pG++Gp+G+ G +Ga GppGp+G+ GpPGp+GppGp sp|Q99715| 2747 GPPGPPGPAGGPGAKGPRGERGISGAIGPPGPRGDIGPPGPQGPPGP 2793 pGppGapGapGpp<-* +Gp G + + + sp|Q99715| 2794 QGPNGLSIPGEQG 2806 Collagen: domain 2 of 4, from 2824 to 2883: score 46.8, E = 5e-10 *->GppGppGppGppGppGppGppGpaGapGppGppGepGpPGppGppGp G pG+pGppGp GppG++G++G + a Gp+GppG pG+PG+pG+ Gp sp|Q99715| 2824 GTPGLPGPPGPMGPPGDRGFTGKDSAMGPRGPPGRPGSPGSPGVTGP 2870 pGppGapGapGpp<-* G pG+pG+ G p sp|Q99715| 2871 SGKPGKPGDHGRP 2883 Peptidase_M29: domain 1 of 1, from 2920 to 2928: score -2.4, E = 58 *->MdaFkkeLe<-* M++F + L+ sp|Q99715| 2920 MNRFNQMLN 2928 Collagen: domain 3 of 4, from 2931 to 2989: score 22.7, E = 0.00021 *->GppGppGppGppGppGppGppGpaGapGppGppGepGpPGppGppGp p + ++ pGppGppGppG aGa+G+pGp G pG+PG pG Gp sp|Q99715| 2931 -PNDYQSSRNQPGPPGPPGPPGSAGARGEPGPGGRPGFPGTPGMQGP 2976 pGppGapGapGpp<-* pG++G pG++G++ sp|Q99715| 2977 PGERGLPGEKGER 2989 Collagen: domain 4 of 4, from 2990 to 3047: score 2.1, E = 0.0024 *->GppGppGppGppGppGppGppGpaGapGppGppGepGpPGppGppGp G+ Gp+G+pGppGp G+ G+pG+ G +G+pGpPG pG G sp|Q99715| 2990 -GTGSSGPRGLPGPPGPQGES-RTGPPGSTGSRGPPGPPGRPGNSGI 3034 pGppGapGapGpp<-* +GppG+pG sp|Q99715| 3035 QGPPGPPGYCDSS 3047 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem36 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- fn3 Fibronectin type III domain 957.1 4.4e-284 18 vwa von Willebrand factor type A domain 858.4 2.3e-254 4 TSPN Thrombospondin N-terminal -like domai 278.3 4.3e-80 1 Collagen Collagen triple helix repeat (20 copi 125.5 9.7e-34 6 WH2 Wiskott Aldrich syndrome homology reg 7.7 2.5 1 MAP1B_neuraxin Neuraxin and MAP1B proteins 4.8 9.9 1 lectin_legB Legume lectins beta domain 4.8 5.5 1 Ribosomal_L25p Ribosomal L25p family 4.5 3.1 1 MGS MGS-like domain 4.0 13 1 DUF31 Domain of unknown function DUF31 3.9 7.6 1 Defensin_propep Defensin propeptide 2.5 34 1 Apidaecin Apidaecin 2.0 32 1 Adenylate_cycla Adenylate cyclase, class-I 1.6 7.8 1 Ran_BP1 RanBP1 domain. 1.6 35 1 ThiJ ThiJ/PfpI family 1.5 47 1 fibrinogen_C Fibrinogen beta and gamma chains, C-t 1.4 15 1 Pribosyltran Phosphoribosyl transferase domain 1.3 32 1 Ribosomal_L37ae Ribosomal L37ae protein family 1.0 81 1 E1-E2_ATPase E1-E2 ATPase 0.9 30 1 Filamin Filamin/ABP280 repeat. 0.8 65 1 Lipoate_B Lipoate-protein ligase B 0.7 7.8 1 Occludin Occludin/ELL family 0.4 25 1 Bac_chlorC Bacteriochlorophyll C binding protein 0.3 91 1 Alpha_core Alphavirus core protein 0.3 43 1 DUF73 Protein of unknown function DUF73 -0.1 41 1 Fz Fz domain -0.1 50 1 RPH3A_effector Rabphilin-3A effector domain -0.3 34 1 Myelin_MBP Myelin basic protein -0.5 99 1 Morbilli_P Morbillivirus RNA polymerase alpha su -1.2 88 1 Transp_cyt_pur Permeases for cytosine/purines, uraci -1.2 85 1 FHIPEP FHIPEP family -1.3 47 1 Flg_hook Flagellar hook-length control protein -1.5 92 1 AOX Alternative oxidase -2.0 97 1 thaumatin Thaumatin family -2.3 86 1 Peptidase_M29 Thermophilic metalloprotease (M29) -2.4 58 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- fn3 1/18 25 103 .. 1 84 [] 65.7 1.7e-17 FHIPEP 1/1 96 106 .. 678 688 .] -1.3 47 WH2 1/1 199 211 .. 1 16 [. 7.7 2.5 DUF73 1/1 206 214 .. 321 329 .] -0.1 41 Adenylate_cycla 1/1 192 219 .. 828 855 .] 1.6 7.8 E1-E2_ATPase 1/1 242 265 .. 32 52 .. 0.9 30 ThiJ 1/1 243 271 .. 1 29 [. 1.5 47 vwa 1/4 140 312 .. 1 200 [] 244.1 1.2e-70 fn3 2/18 334 416 .. 1 84 [] 64.3 4.2e-17 Filamin 1/1 422 451 .. 43 73 .. 0.8 65 DUF31 1/1 490 511 .. 107 129 .] 3.9 7.6 Flg_hook 1/1 566 573 .. 403 410 .] -1.5 92 vwa 2/4 440 612 .. 1 200 [] 211.5 4.7e-61 Bac_chlorC 1/1 639 650 .. 59 70 .. 0.3 91 fn3 3/18 632 710 .. 1 84 [] 57.2 4.8e-15 Defensin_propep 1/1 709 719 .. 43 53 .] 2.5 34 Ribosomal_L25p 1/1 713 730 .. 1 18 [. 4.5 3.1 fn3 4/18 723 793 .. 1 74 [. 55.4 1.6e-14 Alpha_core 1/1 869 886 .. 124 141 .] 0.3 43 fn3 5/18 813 895 .. 1 84 [] 65.4 2e-17 Occludin 1/1 925 934 .. 1 13 [. 0.4 25 Transp_cyt_pur 1/1 917 942 .. 446 472 .] -1.2 85 fn3 6/18 904 987 .. 1 84 [] 40.7 2.8e-10 thaumatin 1/1 1025 1032 .. 200 207 .] -2.3 86 fn3 7/18 995 1077 .. 1 84 [] 50.5 4e-13 Myelin_MBP 1/1 1068 1086 .. 151 171 .] -0.5 99 Morbilli_P 1/1 1075 1095 .. 284 303 .] -1.2 88 fn3 8/18 1090 1166 .. 4 84 .] 43.6 4e-11 Ribosomal_L37ae 1/1 1217 1231 .. 77 91 .] 1.0 81 MGS 1/1 1305 1332 .. 1 31 [. 4.0 13 Pribosyltran 1/1 1298 1334 .. 131 160 .] 1.3 32 AOX 1/1 1301 1352 .. 1 57 [. -2.0 97 vwa 3/4 1199 1371 .. 1 200 [] 223.5 1.4e-64 fn3 9/18 1385 1463 .. 1 84 [] 76.9 9.5e-21 fn3 10/18 1474 1554 .. 1 84 [] 58.0 2.8e-15 Apidaecin 1/1 1555 1575 .. 8 28 .] 2.0 32 Ran_BP1 1/1 1591 1605 .. 99 113 .. 1.6 35 fn3 11/18 1566 1643 .. 1 84 [] 58.5 2e-15 fn3 12/18 1654 1732 .. 1 84 [] 60.2 6.3e-16 lectin_legB 1/1 1770 1791 .. 173 196 .. 4.8 5.5 fn3 13/18 1752 1836 .. 1 84 [] 79.0 2.4e-21 Lipoate_B 1/1 1898 1920 .. 1 27 [. 0.7 7.8 fn3 14/18 1844 1926 .. 1 84 [] 52.6 1e-13 fn3 15/18 1938 2016 .. 5 84 .] 59.3 1.2e-15 RPH3A_effector 1/1 2011 2030 .. 289 309 .] -0.3 34 fn3 16/18 2024 2108 .. 1 84 [] 68.1 3.4e-18 fn3 17/18 2116 2193 .. 1 84 [] 70.4 7.3e-19 fn3 18/18 2204 2281 .. 1 84 [] 37.8 1.9e-09 Fz 1/1 2302 2323 .. 85 108 .. -0.1 50 vwa 4/4 2323 2496 .. 1 200 [] 200.6 7.4e-58 fibrinogen_C 1/1 2662 2672 .. 1 11 [. 1.4 15 MAP1B_neuraxin 1/1 2666 2672 .. 11 17 .] 4.8 9.9 TSPN 1/1 2520 2712 .. 1 218 [] 278.3 4.3e-80 Collagen 1/6 2747 2798 .. 1 52 [. 39.0 3e-09 Collagen 2/6 2802 2853 .. 9 60 .] 36.6 1.5e-08 Collagen 3/6 2854 2896 .. 1 43 [. 27.2 5.5e-06 Peptidase_M29 1/1 2920 2928 .. 1 9 [. -2.4 58 Collagen 4/6 2941 2989 .. 12 60 .] 34.1 7.1e-08 Collagen 5/6 2995 3008 .. 1 14 [. 7.6 1.3 Collagen 6/6 3012 3042 .. 1 31 [. 21.3 0.00022 Alignments of top-scoring domains: fn3: domain 1 of 18, from 25 to 103: score 65.7, E = 1.7e-17 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn ++P++l ++ + +++++ sW p pi gY++t+ p+ +g+++ sp|Q99715| 25 VDPPSDLNFKIIDENTVHMSWAEP--VDPIVGYRITV-DPTTDGPTK 68 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e+t++ +tt+ l+ L+P teY+v++++++++ +eS sp|Q99715| 69 EFTLSASTTETLLSELVPETEYVVTITSYDEV--EES 103 FHIPEP: domain 1 of 1, from 96 to 106: score -1.3, E = 47 *->SyqElpeeikI<-* Sy E++e +++ sp|Q99715| 96 SYDEVEESVPV 106 WH2: domain 1 of 1, from 199 to 211: score 7.7, E = 2.5 *->aRsaLLaaIrsagqGi<-* +R++LLaaI+ +++ sp|Q99715| 199 QRDELLAAIK---KIP 211 DUF73: domain 1 of 1, from 206 to 214: score -0.1, E = 41 *->lIkkikEKl<-* +Ikki++K+ sp|Q99715| 206 AIKKIPYKG 214 Adenylate_cycla: domain 1 of 1, from 192 to 219: score 1.6, E = 7.8 *->FNlPQFYQivkvdggvqvvPFrsrsagk<-* FNl Q+YQ +++ +P++++ + sp|Q99715| 192 FNLNQYYQRDELLAAIKKIPYKGGNTMT 219 E1-E2_ATPase: domain 1 of 1, from 242 to 265: score 0.9, E = 30 *->pptAtVlRdG...keeeipaeeLv<-* p+ A ++dG+++ e+eipa eL+ sp|Q99715| 242 PKVAIIITDGksqDEVEIPARELR 265 ThiJ: domain 1 of 1, from 243 to 271: score 1.5, E = 47 *->kvlilaangfEDiEliiPidvlrrgGleV<-* kv+i++++g E+ iP+ lr+ G+eV sp|Q99715| 243 KVAIIITDGKSQDEVEIPARELRNVGVEV 271 vwa: domain 1 of 4, from 140 to 312: score 244.1, E = 1.2e-70 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg D+vFL+DGS+S+g+ nF ++dF+ +v+++d+g+ k +rVg sp|Q99715| 140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGE-EK-----TRVG 180 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq +vQYS++ rtE f+ln+y +de+l+a++ki+ y gg t+Tg+A++ sp|Q99715| 181 VVQYSSDTRTE--FNLNQYYQRDELLAAIKKIP----YKGGNTMTGDAID 224 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln y+v+n fte++G+R g pkv++++TDG+sqd ++ +++ sp|Q99715| 225 YLVKNTFTESAGAR------VGFPKVAIIITDGKSQDE------VEIPAR 262 elkkeagvevfaiGvGnadnnnleeLreIAskpd.dhvfkvsdfeaLdtl el+ gvevf++G++ ad ++eL++IAs p+ +hvf+v +f+a+ ++ sp|Q99715| 263 ELRN-VGVEVFSLGIKAAD---AKELKQIASTPSlNHVFNVANFDAIVDI 308 qelL<-* q+++ sp|Q99715| 309 QNEI 312 fn3: domain 2 of 18, from 334 to 416: score 64.3, E = 4.2e-17 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.w +P+nl +++v+s ++ l+W+p+ +p+tgY+v + p g++ sp|Q99715| 334 VEPPSNLIAMEVSSKYVKLNWNPS--PSPVTGYKVIL-TPMTAGSrQ 377 neltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* + l+v ++tt+ +++L ++teY+++V A +g +++++ sp|Q99715| 378 HALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTsSEPI 416 Filamin: domain 1 of 1, from 422 to 451: score 0.8, E = 65 *->gGgltvtVegPsGaAvvevkvtDngDGTytV<-* +++++v+Ve+ G+ + +++++ + DG+y++ sp|Q99715| 422 TQPMKVQVECSRGV-DIKADIVFLVDGSYSI 451 DUF31: domain 1 of 1, from 490 to 511: score 3.9, E = 7.6 *->tsFsLGKYtdpsdfvslinletp<-* t+F+L+K+t+++d+ in p sp|Q99715| 490 TEFTLKKFTKVEDIIEAINT-FP 511 Flg_hook: domain 1 of 1, from 566 to 573: score -1.5, E = 92 *->NsaVDIFA<-* Ns V IFA sp|Q99715| 566 NSDVEIFA 573 vwa: domain 2 of 4, from 440 to 612: score 211.5, E = 4.7e-61 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg DivFL+DGS Sig +nF +v++F+e +v +++++p + v+++ sp|Q99715| 440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISP-NR-----VQIS 480 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq lvQYS +++tE f+l+ +++ +++++a++ ++ y gg tnTg+A+ sp|Q99715| 481 LVQYSRDPHTE--FTLKKFTKVEDIIEAINTFP----YRGGSTNTGKAMT 524 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln yv++++f ++ GsR +++pkv++++TDG+s+d +rd++ sp|Q99715| 525 YVREKIFVPSKGSR------SNVPKVMILITDGKSSDA------FRDPAI 562 elkkeagvevfaiGvGnadnnnleeLreIAskpd.dhvfkvsdfeaLdtl +l+ +ve+fa+Gv++a++ +eL+ IAs+p + hvf v+df+a + + sp|Q99715| 563 KLRN-SDVEIFAVGVKDAVR---SELEAIASPPAeTHVFTVEDFDAFQRI 608 qelL<-* +L sp|Q99715| 609 SFEL 612 Bac_chlorC: domain 1 of 1, from 639 to 650: score 0.3, E = 91 *->afsevssyavrt<-* +fsev sy+++t sp|Q99715| 639 SFSEVTSYGFKT 650 fn3: domain 3 of 18, from 632 to 710: score 57.2, E = 4.8e-15 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.w +P++l++++vts+ + +Wsp+ + + +Y +ty +++ g+++ sp|Q99715| 632 YVPPKDLSFSEVTSYGFKTNWSPA--GENVFSYHITY-KEA-AGDdE 674 neltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* +++ p ++ts++l+ LkP t Y v V+A ++ +S sp|Q99715| 675 VTVVEPASSTSVVLSSLKPETLYLVNVTAEYED--GFS 710 Defensin_propep: domain 1 of 1, from 709 to 719: score 2.5, E = 34 *->vaiSfagdess<-* ++i++ag+e++ sp|Q99715| 709 FSIPLAGEETT 719 Ribosomal_L25p: domain 1 of 1, from 713 to 730: score 4.5, E = 3.1 *->LnaevRteqgKgasRrLR<-* L +e+ te Kga+R+L+ sp|Q99715| 713 LAGEETTEEVKGAPRNLK 730 fn3: domain 4 of 18, from 723 to 793: score 55.4, E = 1.6e-14 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn +aP+nl+vtd t++s+ ++W+ + g + + ++ yR p+ gge + sp|Q99715| 723 KGAPRNLKVTDETTDSFKITWTQA--PGRVLRCRIIYR-PVAGGESR 766 eltvpgtttsytltgLkPgteYevrVq<-* e+t p+++++ tl++L P+t+Yev+V+ sp|Q99715| 767 EVTTPPNQRRRTLENLIPDTKYEVSVI 793 Alpha_core: domain 1 of 1, from 869 to 886: score 0.3, E = 43 *->vlGGanEgartALsvvtW<-* vl G Eg++ ALsv sp|Q99715| 869 VLQGLKEGTQYALSVTAL 886 fn3: domain 5 of 18, from 813 to 895: score 65.4, E = 2e-17 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggew ++ P++l+v+d t++++ lsWs + g++ Y vty p+ gge+ sp|Q99715| 813 VrGNPRDLRVSDPTTSTMKLSWSGA--PGKVKQYLVTY-TPVAGGET 856 neltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* +e+tv g+tt+ +l+gLk+gt+Y ++V+A+ + G G + sp|Q99715| 857 QEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGaGDAL 895 Occludin: domain 1 of 1, from 925 to 934: score 0.4, E = 25 *->FYkWtSPPGvvRi<-* WtS PG vR+ sp|Q99715| 925 ---WTSAPGMVRG 934 Transp_cyt_pur: domain 1 of 1, from 917 to 942: score -1.2, E = 85 *->iagsvgepdikvvpgakplydlsyfVG<-* +++s g+ ++ +pg++ +y++s+ sp|Q99715| 917 TDTSIGA-YWTSAPGMVRGYRVSWKSL 942 fn3: domain 6 of 18, from 904 to 987: score 40.7, E = 2.8e-10 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge. ++ P++l+ +d+t+ts+ W+ + g + gY+v++ ++ + + sp|Q99715| 904 ErGSPQDLVTKDITDTSIGAYWTSA--PGMVRGYRVSW-KSLYDDVd 947 wneltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* + e+++p ++ + ++++L+P t+Y ++V A G+G++ sp|Q99715| 948 TGEKNLPEDAIHTMIENLQPETKYRISVFATYSSGeGEPL 987 thaumatin: domain 1 of 1, from 1025 to 1032: score -2.3, E = 86 *->NYrVvFCP<-* NYrVv P sp|Q99715| 1025 NYRVVYRP 1032 fn3: domain 7 of 18, from 995 to 1077: score 50.5, E = 4e-13 *->PsaP.tnltvtdvtstsltlsWspptgngpitgYevtyRqpknggew +s+ ++ l+v + t+++++++W+p+ g++ +Y+v yR p + g+ sp|Q99715| 995 LSQDsKTLKVDEETENTMRVTWKPA--PGKVVNYRVVYR-PHGRGKQ 1038 neltvpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* vp+t ts +l+ L+P t+Y ++V + G+G+ sp|Q99715| 1039 MVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGeGKLR 1077 Myelin_MBP: domain 1 of 1, from 1068 to 1086: score -0.5, E = 99 *->iFKLGgsgsRdrsrsgSPmaR<-* i+K+G + r +sg + R sp|Q99715| 1068 IYKMGE--GKLRQGSGTTASR 1086 Morbilli_P: domain 1 of 1, from 1075 to 1095: score -1.2, E = 88 *->kcktESGTtasp.rsqNniEs<-* k ++SGTtas+ +s+ n sp|Q99715| 1075 KLRQGSGTTASRfKSPRNLKT 1095 fn3: domain 8 of 18, from 1090 to 1166: score 43.6, E = 4e-11 *->PtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.wnel P+nl+ +d t +s++++W+p+ g++ gY+vt + p+++ + el sp|Q99715| 1090 PRNLKTSDPTMSSFRVTWEPA--PGEVKGYKVTFH-PTGDDRrLGEL 1133 tvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* v + ++++l+ L++gt+Y+v V +g eS sp|Q99715| 1134 VVGPYDNTVVLEELRAGTTYKVNVFGMFDG--GES 1166 Ribosomal_L37ae: domain 1 of 1, from 1217 to 1231: score 1.0, E = 81 *->tVrsairRLrelkEs<-* tVrs i R++e + sp|Q99715| 1217 TVRSFISRIVEVFDI 1231 MGS: domain 1 of 1, from 1305 to 1332: score 4.0, E = 13 *->aLiSvsDkdKkaellplAkkLvelGrykLyA<-* +Li+ k+++ + ++ kkL+++G ++L+A sp|Q99715| 1305 VLIT-DGKSQD-DVEAPSKKLKDEG-VELFA 1332 Pribosyltran: domain 1 of 1, from 1298 to 1334: score 1.3, E = 32 *->pGakvvgvavlvdrpeggarer........veslivvd<-* p+a+++gv+++ ++ +++++e+++++ +++ +l+ ++ sp|Q99715| 1298 PRARKIGVLITDGK-SQDDVEApskklkdeGVELFAIG 1334 AOX: domain 1 of 1, from 1301 to 1352: score -2.0, E = 97 *->rstssvrtlshkktsskapgssltvkpgaggnkskkavvsywgvevp r ++ t ++ ++ +ap+++ ++++ + +a++++ +ev+ sp|Q99715| 1301 RKIGVLITDGKSQDDVEAPSKK-----LKDEGVELFAIGIKNADEVE 1342 pkvtkedgse<-* k+++ d+ + sp|Q99715| 1343 LKMIATDPDD 1352 vwa: domain 3 of 4, from 1199 to 1371: score 223.5, E = 1.4e-64 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg Div+L+DGS+Sig++nF +v+ F++r+ve +d+gp + v+++ sp|Q99715| 1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGP-KR-----VQIA 1239 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq l+QYS+++rtE + ln++++k+++lqa+ +++ y gg t Tg AL+ sp|Q99715| 1240 LAQYSGDPRTE--WQLNAHRDKKSLLQAVANLP----YKGGNTLTGMALN 1283 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln + + + f+ +G+R ++a+k+ V++TDG+sqdd ++++++ sp|Q99715| 1284 FIRQQNFRTQAGMR------PRARKIGVLITDGKSQDD------VEAPSK 1321 elkkeagvevfaiGvGnadnnnleeLreIAskpd.dhvfkvsdfeaLdtl +lk+ +gve faiG++nad + eL+ IA++pd+ h ++v dfe+L+ + sp|Q99715| 1322 KLKD-EGVELFAIGIKNAD---EVELKMIATDPDdTHDYNVADFESLSRI 1367 qelL<-* ++ L sp|Q99715| 1368 VDDL 1371 fn3: domain 9 of 18, from 1385 to 1463: score 76.9, E = 9.5e-21 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn + aP+nl+++++t s+++sW+pp + +++Y+v+y +p++gg+ + sp|Q99715| 1385 LEAPSNLVISERTHRSFRVSWTPP--SDSVDRYKVEY-YPVSGGKRQ 1428 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e+ v++ +ts +l++LkP teY+v V++v ++ ++S sp|Q99715| 1429 EFYVSRMETSTVLKDLKPETEYVVNVYSVVED--EYS 1463 fn3: domain 10 of 18, from 1474 to 1554: score 58.0, E = 2.8e-15 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.. P ++ l + dv++t+++++W+p g tgY ++y +p+ + e++ sp|Q99715| 1474 PVPVVSLNIYDVGPTTMHVQWQPV---GGATGYILSY-KPVKDTEpt 1516 .wneltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* +++e+++ +t + ++lt+L+P+teY v+VqAv + S sp|Q99715| 1517 rPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLHD--LTS 1554 Apidaecin: domain 1 of 1, from 1555 to 1575: score 2.0, E = 32 *->EPGnnRPvYiPqPRPPHPRlr<-* EP R v +P PRP +lr sp|Q99715| 1555 EPVTVREVTLPLPRPQDLKLR 1575 Ran_BP1: domain 1 of 1, from 1591 to 1605: score 1.6, E = 35 *->aKpeqlAirFKtkEe<-* +K+ +++r Kt+Ee sp|Q99715| 1591 GKVRKYIVRYKTPEE 1605 fn3: domain 11 of 18, from 1566 to 1643: score 58.5, E = 2e-15 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn ++ P++l+ +dvt +++ + W+p g++ +Y v+y ++ +++ + sp|Q99715| 1566 LPRPQDLKLRDVTHSTMNVFWEPV--PGKVRKYIVRY-KTP-EEDVK 1608 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e+ v++++ts l++L t Y+v+V Av ++ eS sp|Q99715| 1609 EVEVDRSETSTSLKDLFSQTLYTVSVSAVHDE--GES 1643 fn3: domain 12 of 18, from 1654 to 1732: score 60.2, E = 6.3e-16 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggew P +aPtnl++t+vts ++ +W++ ++++ Y++t+ p ++ ++ sp|Q99715| 1654 PvPAPTNLKITEVTSEGFRGTWDHG--ASDVSLYRITW-GPFGSSDK 1697 neltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e ++ g++++ ++++L+P+t Yev+++A+ ++ eS sp|Q99715| 1698 METILNGDENTLVFENLNPNTIYEVSITAIYAD---ES 1732 lectin_legB: domain 1 of 1, from 1770 to 1791: score 4.8, E = 5.5 *->TvsWdlvngkGevAtVlItYdast<-* Tv+Wd ++ G+v + +ItY+ st sp|Q99715| 1770 TVKWDPAS--GRVQKYRITYQPST 1791 fn3: domain 13 of 18, from 1752 to 1836: score 79.0, E = 2.4e-21 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge. P+s P+nl+v + ts+slt+ W+p+ +g +++Y++ty qp+ g+++ sp|Q99715| 1752 PkSGPRNLQVYNATSNSLTVKWDPA--SGRVQKYRITY-QPSTGEGn 1795 wneltvpgtttsytltgLkPgteYevrVqAvnggG..GpeS<-* +++ t+ g ++s++l+ LkP+t+Y+++V ++ +G++G+ + sp|Q99715| 1796 EQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDGegGRMT 1836 Lipoate_B: domain 1 of 1, from 1898 to 1920: score 0.7, E = 7.8 *->dkvnkirarqpkdtillleHpPVYTlG<-* + i +dt++++ + PVYT G sp|Q99715| 1898 A----ILRNLQPDTSYTVTVVPVYTEG 1920 fn3: domain 14 of 18, from 1844 to 1926: score 52.6, E = 1e-13 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn + ++nl+v d ++++l ++W+++ g Y++ y p+ g++ sp|Q99715| 1844 LNTVRNLRVYDPSTSTLNVRWDHA--EGNPRQYKLFY-APA-AGGPE 1886 elt.vpgtttsytltgLkPgteYevrVqAvnggG..GpeS<-* el ++pg+t+ ++l++L+P+t Y+v+V+ v ++G++G+ S sp|Q99715| 1887 ELVpIPGNTNYAILRNLQPDTSYTVTVVPVYTEGdgGRTS 1926 fn3: domain 15 of 18, from 1938 to 2016: score 59.3, E = 1.2e-15 *->tnltvtdvtstsltlsWspptgngpitgYevtyRqpkngge.wnelt +n++v + t++ l ++W+p+ gp+ Y+v y +p++g + ++ sp|Q99715| 1938 RNVQVYNPTPNRLGVRWDPA--PGPVLQYRVVY-SPVDGTRpSESIV 1981 vpgtttsytltgLkPgteYevrVqAvnggG.GpeS<-* vpg+t+ ++l+ L P+t Y+v ++A+ +G+G +S sp|Q99715| 1982 VPGNTRMVHLERLIPDTLYSVNLVALYSDGeGNPS 2016 RPH3A_effector: domain 1 of 1, from 2011 to 2030: score -0.3, E = 34 *->gagdtsRSpdGeslvrsenrV<-* g+g s +G +l+rs++r sp|Q99715| 2011 GEGNPS-PAQGRTLPRSGPRN 2030 fn3: domain 16 of 18, from 2024 to 2108: score 68.1, E = 3.4e-18 *->P.saPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggew P+s P+nl+v + t++sl++ W+++ +gp++ Y++ y +p+ g + sp|Q99715| 2024 PrSGPRNLRVFGETTNSLSVAWDHA--DGPVQQYRIIY-SPTVGDPI 2067 nelt.vpgtttsytltgLkPgteYevrVqAvnggG..GpeS<-* e+t+vpg ++++l+ L+P+t+Y+++V+Av ++G++G+ + sp|Q99715| 2068 DEYTtVPGRRNNVILQPLQPDTPYKITVIAVYEDGdgGHLT 2108 fn3: domain 17 of 18, from 2116 to 2193: score 70.4, E = 7.3e-19 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn + +P+n++++d t +++sW+p+ +p+ gY++ y +p++++e+ sp|Q99715| 2116 LLPPQNIHISDEWYTRFRVSWDPS--PSPVLGYKIVY-KPVGSNEPM 2159 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e v t sytl++L+P t+Y v V+A+ + S sp|Q99715| 2160 EAFVGEMT-SYTLHNLNPSTTYDVNVYAQYDS--GLS 2193 fn3: domain 18 of 18, from 2204 to 2281: score 37.8, E = 1.9e-09 *->PsaPtnltvtdvtstsltlsWspptgngpitgYevtyRqpknggewn +t+l+ ++++++ + Wsp + t+Y++++ +p++g + sp|Q99715| 2204 YLNVTDLKTYQIGWDTFCVKWSPH---RAATSYRLKL-SPADGTRGQ 2246 eltvpgtttsytltgLkPgteYevrVqAvnggGGpeS<-* e+tv g++ts+ +tgL P+t+Y v+V ++ + e sp|Q99715| 2247 EITVRGSETSHCFTGLSPDTDYGVTVFVQTPN--LEG 2281 Fz: domain 1 of 1, from 2302 to 2323: score -0.1, E = 50 *->eglgvpptpipPCRslCEavrelg<-* +++++ppt ipP R++C +++ + sp|Q99715| 2302 PTPPPPPT-IPPARDVCKGAKA-D 2323 vwa: domain 4 of 4, from 2323 to 2496: score 200.6, E = 7.4e-58 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg DivFL D S+Sig +nF++v F+ + v+ +d++ + ++V+ sp|Q99715| 2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDEISPAG-----IQVS 2364 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq vQYSd+v+ E fkln+y +k+ l alq+ir y gg t Tg+AL sp|Q99715| 2365 FVQYSDEVKSE--FKLNTYNDKALALGALQNIR----YRGGNTRTGKALT 2408 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln + +++ t +sG+R +++pkvlVv+TDGrsqd +++++ sp|Q99715| 2409 FIKEKVLTWESGMR------KNVPKVLVVVTDGRSQDE------VKKAAL 2446 elkkeagvevfaiGvGnadnnnleeLreIAskpd.dhvfkvsdfeaLdtl ++++ g+ vf++Gv ++d + eL IAskp+++hvf+v+dfe+ +++ sp|Q99715| 2447 VIQQ-SGFSVFVVGVADVD---YNELANIASKPSeRHVFIVDDFESFEKI 2492 qelL<-* ++ L sp|Q99715| 2493 EDNL 2496 fibrinogen_C: domain 1 of 1, from 2662 to 2672: score 1.4, E = 15 *->tgKDCqdilqk<-* +++DC++i+ k sp|Q99715| 2662 IYIDCYEIIEK 2672 MAP1B_neuraxin: domain 1 of 1, from 2666 to 2672: score 4.8, E = 9.9 *->sYEttEK<-* +YE++EK sp|Q99715| 2666 CYEIIEK 2672 TSPN: domain 1 of 1, from 2520 to 2712: score 278.3, E = 4.3e-80 *->gqdllqvfdLpessfsvrkavglkatirvrGsdpsspAYrlgktdri g+++l++++L+e++f+ ++v+l++ Gs+ps++AYr++k sp|Q99715| 2520 GFKMLEAYNLTEKNFASVQGVSLES-----GSFPSYSAYRIQK---- 2557 pavvsqptrrlfPAsglpedFsllttfrqapeTpksrgvLfaiydaqqnv +a+v+qpt++l+P +glp+ ++++++fr++peTp +++++++i+d+ +++ sp|Q99715| 2558 NAFVNQPTADLHP-NGLPPSYTIILLFRLLPETPSDPFAIWQITDR-DYK 2605 rqlglevneVrDGrantlllryqkgsdGkvhtvvFrg.EllpLaDgqWHk +q+g++ + + ++tl++++ k+++G+v+tv+F+++E+ +L++g++Hk sp|Q99715| 2606 PQVGVIAD----PSSKTLSFFN-KDTRGEVQTVTFDTeEVKTLFYGSFHK 2650 laLsVsgssRGapavtLYVDCnkidsrpdldrpfqglppidtdGievlga ++++V+++s v++Y+DC++i +++ +++++ +i+tdG+e+lg+ sp|Q99715| 2651 VHIVVTSKS-----VKIYIDCYEIIEKD-IKEAG----NITTDGYEILGK 2690 qaandekkfqgdLQqlkivcdp<-* + ++++k++ +++Q+++ivc+p sp|Q99715| 2691 LLKGERKSAAFQIQSFDIVCSP 2712 Collagen: domain 1 of 6, from 2747 to 2798: score 39.0, E = 3e-09 *->GppGppGppGppGppGppGppGpaGapGppGppGepGpPGppGppGp GppGppGp+G pG++Gp+G+ G +Ga GppGp+G+ GpPGp+GppGp sp|Q99715| 2747 GPPGPPGPAGGPGAKGPRGERGISGAIGPPGPRGDIGPPGPQGPPGP 2793 pGppG<-* +Gp G sp|Q99715| 2794 QGPNG 2798 Collagen: domain 2 of 6, from 2802 to 2853: score 36.6, E = 1.5e-08 *->pGppGppGppGppGpaGapGppGppGepGpPGppGppGppGppGapG pG++G G +G +G++G pG G pG pGpPGp GppG +G +G++ sp|Q99715| 2802 PGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGDRGFTGKDS 2848 apGpp<-* a+Gp+ sp|Q99715| 2849 AMGPR 2853 Collagen: domain 3 of 6, from 2854 to 2896: score 27.2, E = 5.5e-06 *->GppGppGppGppGppGppGppGpaGapGppGppGepGpPGppG<-* GppG+pG+pG pG +Gp+G pG +G G pGp+G +G+ G +G sp|Q99715| 2854 GPPGRPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRG 2896 Peptidase_M29: domain 1 of 1, from 2920 to 2928: score -2.4, E = 58 *->MdaFkkeLe<-* M++F + L+ sp|Q99715| 2920 MNRFNQMLN 2928 Collagen: domain 4 of 6, from 2941 to 2989: score 34.1, E = 7.1e-08 *->pGppGppGppGpaGapGppGppGepGpPGppGppGppGppGapGapG pGppGppGppG aGa+G+pGp G pG+PG pG GppG++G pG++G sp|Q99715| 2941 PGPPGPPGPPGSAGARGEPGPGGRPGFPGTPGMQGPPGERGLPGEKG 2987 pp<-* ++ sp|Q99715| 2988 ER 2989 Collagen: domain 5 of 6, from 2995 to 3008: score 7.6, E = 1.3 *->GppGppGppGppGp<-* Gp+G+pGppGp+G+ sp|Q99715| 2995 GPRGLPGPPGPQGE 3008 Collagen: domain 6 of 6, from 3012 to 3042: score 21.3, E = 0.00022 *->GppGppGppGppGppGppGppGpaGapGppG<-* GppG+ G++GppGppG+pG G +G+pGppG sp|Q99715| 3012 GPPGSTGSRGPPGPPGRPGNSGIQGPPGPPG 3042 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem36 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Sun Jan 7 20:43:35 2001 Sequence file: tem36 ---------------------------------------- Sequence sp|Q99715|CA1C_HUMAN (3063 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 700: NVTA 1763: NATS 2206: NVTD 2528: NLTE 2679: NITT Total matches: 5 Matching pattern PS00004 CAMP_PHOSPHO_SITE: 495: KKFT 775: RRRT Total matches: 2 Matching pattern PS00005 PKC_PHOSPHO_SITE: 117: STK 346: SSK 390: SVR 493: TLK 519: TGK 665: TYK 690: SLK 738: SFK 829: TMK 860: TVR 938: SWK 1001: TLK 1011: TMR 1015: TWK 1089: SPR 1102: SFR 1152: TYK 1217: TVR 1320: SKK 1378: SVK 1394: SER 1397: THR 1400: SFR 1469: TEK 1506: SYK 1558: TVR 1620: SLK 1651: TTR 1695: SDK 1740: SER 1770: TVK 1777: SGR 1836: TGR 1846: TVR 1928: TGR 2232: SYR 2249: TVR 2285: SVK 2291: TVK 2403: TGK 2476: SER 2530: TEK 2600: TDR 2615: SSK 2656: TSK 2659: SVK 2715: TSR 2725: SRR 2844: TGK 2871: SGK 2936: SSR Total matches: 51 Matching pattern PS00006 CK2_PHOSPHO_SITE: 19: SSIE 44: SWAE 75: STTE 95: TSYD 96: SYDE 168: SAFD 188: TRTE 230: TFTE 253: SQDE 318: SGVD 329: SGEE 390: SVRD 411: TSSE 417: SIME 498: TKVE 567: SDVE 582: SELE 598: TVED 639: SFSE 665: TYKE 676: TVVE 718: TTEE 786: TKYE 900: TTLE 901: TLEE 907: SPQD 941: SLYD 960: TMIE 980: SSGE 1059: TTYD 1124: TGDD 1176: TLSD 1188: SGME 1194: TRAE 1312: SQDD 1348: TDPD 1433: SRME 1456: SVVE 1526: TVND 1558: TVRE 1602: TPEE 1620: SLKD 1647: TAQE 1718: TIYE 1732: SESD 1790: STGE 1918: TEGD 1969: SPVD 2009: SDGE 2062: TVGD 2178: TTYD 2238: SPAD 2285: SVKE 2333: SIGD 2380: TYND 2437: SQDE 2589: TPSD 2600: TDRD 2626: TRGE 2715: TSRD 2725: SRRD 2844: TGKD Total matches: 62 Matching pattern PS00008 MYRISTYL: 215: GNTMTG 236: GARVGF 275: GIKAAD 409: GMTSSE 514: GGSTNT 515: GSTNTG 536: GSRSNV 575: GVKDAV 724: GAPRNL 801: GTPLTG 806: GNAATE 872: GLKEGT 876: GTQYAL 890: GAGDAL 922: GAYWTS 930: GMVRGY 1079: GSGTTA 1081: GTTASR 1159: GMFDGG 1171: GQEMTT 1274: GNTLTG 1279: GMALNF 1304: GVLITD 1334: GIKNAD 1468: GTEKTL 1498: GGATGY 1739: GSERTL 1794: GNEQTT 1802: GGRQNS 1832: GGRMTG 1922: GGRTSD 1935: GLARNV 1973: GTRPSE 2104: GGHLTG 2109: GNGRTV 2245: GQEITV 2252: GSETSH 2360: GIQVSF 2389: GALQNI 2399: GNTRTG 2420: GMRKNV 2545: GSFPSY 2572: GLPPSY 2678: GNITTD 2756: GGPGAK 2768: GISGAI 2951: GSAGAR 2962: GGRPGF 2990: GTGSSG 3015: GSTGSR Total matches: 50 Matching pattern PS00009 AMIDATION: 1424: GGKR 2074: PGRR Total matches: 2 Matching pattern PS00016 RGD: 862: RGD 2779: RGD 2895: RGD Total matches: 3 Total no of hits in this sequence: 175 ======================================== 1314 pattern(s) searched in 1 sequence(s), 3063 residues. Total no of hits in all sequences: 175. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** argv[1]=P argv[2]=-m /data/patterns/own/motif.fa argv[4]=-seq tem36 ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 3063 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem36 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem36 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- DOClong 1.2 44 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- DOClong 1/1 1847 1855 .. 116 124 .] 1.2 44 Alignments of top-scoring domains: DOClong: domain 1 of 1, from 1847 to 1855: score 1.2, E = 44 *->iRqlkiygP<-* +R l++y+P sp|Q99715| 1847 VRNLRVYDP 1855 // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). (3063 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value VWA Von Willebrand factor A domain 170 2e-44 FN3 Fibronectin type III domain (Protein-protein interation ... 107 2e-25 UBA Ubiquitin pathway associated domain 30 0.039 RHOGEF RHO-type GTPase GDP exchange factor 29 0.074 RASGAP RAS-type GTPase GTP hydrolysis activating protein 27 0.32 UBHYD Ubiquitin C-terminal hydrolase domain 26 0.65 CATH Cathepsin like protease domain 26 0.65 S1 S1 RNA binding domain 25 1.2 PCNA Proliferating Cell nuclear antigen like domain 25 1.4 SGTP Small GTPAses 25 1.7 KIN Protein kinase domain 25 1.9 PUM Pumilio repeat RNA binding domain 24 2.1 INSL Insulinase like Metallo protease domain 24 2.3 AAA AAA+ ATPase Module 23 4.6 SCP SCP/PR1 domain (Domain found in plant pathogenesis relat... 23 5.1 SNARE Alpha helical domains which are involved in vesicle fu... 23 5.3 PTB Phospho-Tyrosine Binding domain 23 5.8 MIZFIN MIZ type Cysteine zinc DNA binding domain 23 6.2 DSP Dual specificity protein phosphatase 23 6.8 ARR Arrestin domain 23 8.0 AP2 A plant specific DNA binding domain (Apetala 2 like) 22 8.7 DNASE1 DNASE-1/Sphingomyelinase like domain 22 9.5 CALC Calcineurin like Phosphoesterase domain 22 9.6 ARM Armadillo repeat 22 9.8 >VWA Von Willebrand factor A domain Length = 255 Score = 170 bits (428), Expect = 2e-44 Identities = 34/229 (14%), Positives = 34/229 (14%), Gaps = 15/229 (6%) Query: 95 TSYDEVEESVPVIGQLTIQTGSSTKPVEKKPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN 154 Sbjct: 12 RSWDLVKVD--DEGDMASLVASIVEARKKRTAKKNITPYQRGIIRSLILTLDCSEAMLEK 69 Query: 155 NFK-----YILDFIAALVSAFDIGEEKTRVGVVQYSS-DTRTEFNLNQYYQRDELLAAIK 208 Sbjct: 70 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 129 Query: 209 KIPYKGGNTMTGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQDEVEI--PARELRN 266 Sbjct: 130 RKQEPKGNPSLQNALEMARGLL---LPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVS 186 Query: 267 VGVEVFSLGIKA--ADAKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Sbjct: 187 EKIRVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAV 235 Score = 160 bits (402), Expect = 2e-41 Identities = 28/214 (13%), Positives = 28/214 (13%), Gaps = 14/214 (6%) Query: 1165 ESSPLVGQEMTTLSDTTVMPILSSGMECLTRAEADIVLLVDGSWSIGRANFRTVR----- 1219 Sbjct: 21 DEGDMASLVASIVEARKKRTAKKNITPYQRGIIRSLILTLDCSEAMLEKDLRPNRHAMII 80 Query: 1220 SFISRIVEVFDIGPKRVQIALAQYS-GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLT 1278 Sbjct: 81 QYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSIRKQEPKGNPSL 140 Query: 1279 GMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQD--DVEAPSKKLKDEGVELFAIGIK 1336 Sbjct: 141 QNALEMARGL---LLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKIRVKVLGL- 196 Query: 1337 NADEVELKMI--ATDPDDTHDYNVADFESLSRIV 1368 Sbjct: 197 SAQVAICKELCKATNYGDESFYKILLDETHLKEL 230 Score = 144 bits (361), Expect = 2e-36 Identities = 27/237 (11%), Positives = 27/237 (11%), Gaps = 23/237 (9%) Query: 393 DLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVDGSYSIG 452 Sbjct: 11 KRSWDLVKVDDEGDMASLVAS--IVEARKKRTAKKNITPYQRGIIR-SLILTLDCSEAML 67 Query: 453 IANFVKVRA-----FLEVLVKSFEISPNRVQISLVQYS-RDPHTEFTL-KKFTKVEDIIE 505 Sbjct: 68 EKDLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALK 127 Query: 506 AINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDAFR--DPAIK 563 Sbjct: 128 SIRKQEP-KGNPSLQNALEMARGLL---LPVPAHCTREVLIVFGSLSTTDPGDIHQTIDS 183 Query: 564 LRNSDVEIFAVGVKDAVRSELEAIAS---PPAETHVFTVEDFDAFQRISFELTQSIC 617 Sbjct: 184 LVSEKIRVKVLGL-SAQVAICKELCKATNYGDESFYKILLDETHLKEL---FNEAVT 236 Score = 118 bits (295), Expect = 9e-29 Identities = 27/186 (14%), Positives = 27/186 (14%), Gaps = 16/186 (8%) Query: 2318 KGAKADIVFLTDASWSIGDDNFNK-----VVKFIFNTVGGFDEISPAGIQVSFVQYSDEV 2372 Sbjct: 50 RGIIRSLILTLDCSEAMLEKDLRPNRHAMIIQYAIDFVHEFF-DQNPISQMGIIIMRNGL 108 Query: 2373 KS-EFKLNT-YNDKALALGALQNIRYRGGNTRTGKALTFIKEKVLTWESGMRKNVPKVLV 2430 Sbjct: 109 AQLVSQVSGNPQDHIDALKSIRKQEP-KGNPSLQNALEMARGLL---LPVPAHCTREVLI 164 Query: 2431 VVTDGRSQD--EVKKAALVIQQSGFSVFVVGV-ADVD-YNELANIASKPSERHVFIVDDF 2486 Sbjct: 165 VFGSLSTTDPGDIHQTIDSLVSEKIRVKVLGLSAQVAICKELCKATNYGDESFYKILLDE 224 Query: 2487 ESFEKI 2492 Sbjct: 225 THLKEL 230 >FN3 Fibronectin type III domain (Protein-protein interation domain) Length = 154 Score = 107 bits (265), Expect = 2e-25 Identities = 32/170 (18%), Positives = 32/170 (18%), Gaps = 18/170 (10%) Query: 1754 SGPRNLQVYNATSNSLTVKWDPASGRVQ-KYRITYQPSTGEGNEQTTTIGGRQNSVVLQK 1812 Sbjct: 2 SAVFDIHVVYVTTTEMWLDWKSPDGASEYVYHLVI-----ESKHGSNHTSTYDKAITLQG 56 Query: 1813 LKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNLRVYDPSTSTLNVRWDHAEGNPR 1872 Sbjct: 57 LIPGTLYNITISPEVDHVWG-----------AIQVFDVTAVNISATSLTLIWKVSDNESS 105 Query: 1873 QYKLFYAPAAGGPEELVPIPGNTNYAILRNLQPDTSYTVTVVPVYTEGDG 1922 Sbjct: 106 SNYTYKIHVAGETDS-SNLNVSEPRAVIPGLRSSTFYNITVCPVLGDIEG 154 Score = 103 bits (255), Expect = 4e-24 Identities = 33/169 (19%), Positives = 33/169 (19%), Gaps = 16/169 (9%) Query: 724 GAPRNLKVTDETTDSFKITWTQAPGRVLRCRIIYRPVAGGESREVTTPPNQRRRTLENLI 783 Sbjct: 2 SAVFDIHVVYVTTTEMWLDWKSPDGAS---EYVYHLVIESKHGSNHTSTYDKAITLQGLI 58 Query: 784 PDTKYEVSVIPEYFSGPGTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGKVKQ 843 Sbjct: 59 PGTLYNITISPEVDHVWGAIQ------------VFDVTAVNISATSLTLIWKVSDNESSS 106 Query: 844 YLVTYTPVAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAG 892 Sbjct: 107 NYTYKIHVAGE-TDSSNLNVSEPRAVIPGLRSSTFYNITVCPVLGDIEG 154 Score = 90.8 bits (223), Expect = 2e-20 Identities = 29/168 (17%), Positives = 29/168 (17%), Gaps = 19/168 (11%) Query: 2026 SGPRNLRVFGETTNSLSVAWDHADGPVQQ-YRIIYSPTVGDPIDEYTTVPGRRNNVILQP 2084 Sbjct: 2 SAVFDIHVVYVTTTEMWLDWKSPDGASEYVYHLVIESKHGS-----NHTSTYDKAITLQG 56 Query: 2085 LQPDTPYKITVIAVYEDGDGGHLTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVL 2144 Sbjct: 57 LIPGTLYNITISPEVDHVW-----------GAIQVFDVTAVNISATSLTLIWKVSDNESS 105 Query: 2145 -GYKIVYKPVGSNEPMEAFVGEMTSYTLHNLNPSTTYDVNVYAQYDSG 2191 Sbjct: 106 SNYTYKIHVAGETDSSNLNVS-EPRAVIPGLRSSTFYNITVCPVLGDI 152 Score = 90.1 bits (221), Expect = 4e-20 Identities = 28/167 (16%), Positives = 28/167 (16%), Gaps = 20/167 (11%) Query: 1387 APSNLVISERTHRSFRVSWTPPSDSVDRYKVEYYPVSGGKRQEFYVSRMETSTVLKDLKP 1446 Sbjct: 3 AVFDIHVVYVTTTEMWLDWKSPDGASEYV---YHLVIESKHGSNHTSTYDKAITLQGLIP 59 Query: 1447 ETEYVVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPV--GGATGYI 1504 Sbjct: 60 GTLYNITISPEVDHVWGAI----------QVFDVTAVNISATSLTLIWKVSDNESSSNYT 109 Query: 1505 LSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLHD 1551 Sbjct: 110 YKIHVAGETD-----SSNLNVSEPRAVIPGLRSSTFYNITVCPVLGD 151 Score = 81.1 bits (198), Expect = 2e-17 Identities = 30/169 (17%), Positives = 30/169 (17%), Gaps = 23/169 (13%) Query: 1000 KTLKVDEETENTMRVTWKP--APGKVVNYRVVYRPHGRGKQMVAKVPPTVTSTVLKRLQP 1057 Sbjct: 5 FDIHVVYVTTTEMWLDWKSPDGASEYVYHLVIESKHGSN-----HTSTYDKAITLQGLIP 59 Query: 1058 QTTYDITVLPIYKMGEGKLRQGSGTTASRFKSPRNLKTSDPTMSSFRVTWEPAP-GEVKG 1116 Sbjct: 60 GTLYNITISPEVDHVWGAI------------QVFDVTAVNISATSLTLIWKVSDNESSSN 107 Query: 1117 YKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGE 1165 Sbjct: 108 YTYKIHVAGETD---SSNLNVSEPRAVIPGLRSSTFYNITVCPVLGDIE 153 Score = 73.7 bits (179), Expect = 3e-15 Identities = 25/162 (15%), Positives = 25/162 (15%), Gaps = 14/162 (8%) Query: 1569 PQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRYKTPEEDVKEVEVDRSETSTSLKDLFSQT 1628 Sbjct: 4 VFDIHVVYVTTTEMWLDWKSPDG--ASEYVYHLVIESKHGSNHTSTYDKAITLQGLIPGT 61 Query: 1629 LYTVSVSAVHDEGESPPVTAQETTRPVPAPTNLKITEVTSEGFRGTWDHGASDVSLYRIT 1688 Sbjct: 62 LYNITISPEVDHVWGA-----------IQVFDVTAVNISATSLTLIWKVSDNESSSNYTY 110 Query: 1689 WGPFGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIYAD 1730 Sbjct: 111 KIHVAGETD-SSNLNVSEPRAVIPGLRSSTFYNITVCPVLGD 151 Score = 53.4 bits (127), Expect = 4e-09 Identities = 13/77 (16%), Positives = 13/77 (16%), Gaps = 2/77 (2%) Query: 1937 ARNVQVYNPTPNRLGVRWDPAPGPVLQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIP 1996 Sbjct: 80 VFDVTAVNISATSLTLIWKVSDNESSSNYTYKIHVAGE--TDSSNLNVSEPRAVIPGLRS 137 Query: 1997 DTLYSVNLVALYSDGEG 2013 Sbjct: 138 STFYNITVCPVLGDIEG 154 Score = 51.9 bits (123), Expect = 1e-08 Identities = 17/90 (18%), Positives = 17/90 (18%), Gaps = 5/90 (5%) Query: 634 PPKDLSFSEVTSYGFKTNWS-PAGENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSLK 692 Sbjct: 3 AVFDIHVVYVTTTEMWLDWKSPDGASEYVYHLVIESKHGS----NHTSTYDKAITLQGLI 58 Query: 693 PETLYLVNVTAEYEDGFSIPLAGEETTEEV 722 Sbjct: 59 PGTLYNITISPEVDHVWGAIQVFDVTAVNI 88 Score = 51.9 bits (123), Expect = 1e-08 Identities = 15/89 (16%), Positives = 15/89 (16%), Gaps = 6/89 (6%) Query: 1938 RNVQVYNPTPNRLGVRWDPAPGPV-LQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIP 1996 Sbjct: 5 FDIHVVYVTTTEMWLDWKSPDGASEYVYHLVIESKHGSN-----HTSTYDKAITLQGLIP 59 Query: 1997 DTLYSVNLVALYSDGEGNPSPAQGRTLPR 2025 Sbjct: 60 GTLYNITISPEVDHVWGAIQVFDVTAVNI 88 Score = 47.2 bits (111), Expect = 3e-07 Identities = 15/75 (20%), Positives = 15/75 (20%) Query: 2207 VTDLKTYQIGWDTFCVKWSPHRAATSYRLKLSPADGTRGQEITVRGSETSHCFTGLSPDT 2266 Sbjct: 80 VFDVTAVNISATSLTLIWKVSDNESSSNYTYKIHVAGETDSSNLNVSEPRAVIPGLRSST 139 Query: 2267 DYGVTVFVQTPNLEG 2281 Sbjct: 140 FYNITVCPVLGDIEG 154 Score = 45.3 bits (106), Expect = 1e-06 Identities = 16/92 (17%), Positives = 16/92 (17%), Gaps = 3/92 (3%) Query: 27 PPSDLNFKIIDENTVHMSWAEPVDPIVGYRITVDPTTDGPTKEFTLSASTTETLLSELVP 86 Sbjct: 3 AVFDIHVVYVTTTEMWLDWKSPDGASEYVYHLVIESKHG---SNHTSTYDKAITLQGLIP 59 Query: 87 ETEYVVTITSYDEVEESVPVIGQLTIQTGSST 118 Sbjct: 60 GTLYNITISPEVDHVWGAIQVFDVTAVNISAT 91 Score = 44.9 bits (105), Expect = 1e-06 Identities = 16/92 (17%), Positives = 16/92 (17%), Gaps = 4/92 (4%) Query: 908 PQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLYDDVDTGEKNLPEDAIHTMIENLQP 967 Sbjct: 4 VFDIHVVYVTTTEMWLDWKSPDG--ASEYVYHLVIES--KHGSNHTSTYDKAITLQGLIP 59 Query: 968 ETKYRISVFATYSSGEGEPLTGDATTELSQDS 999 Sbjct: 60 GTLYNITISPEVDHVWGAIQVFDVTAVNISAT 91 Score = 44.5 bits (104), Expect = 2e-06 Identities = 9/80 (11%), Positives = 9/80 (11%), Gaps = 4/80 (5%) Query: 336 PPSNLIAMEVSSKYVKLNWNPSPSPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLS 395 Sbjct: 3 AVFDIHVVYVTTTEMWLDWKSPDGASE----YVYHLVIESKHGSNHTSTYDKAITLQGLI 58 Query: 396 ADTEYQISVSAMKGMTSSEP 415 Sbjct: 59 PGTLYNITISPEVDHVWGAI 78 Score = 41.4 bits (96), Expect = 2e-05 Identities = 11/56 (19%), Positives = 11/56 (19%), Gaps = 2/56 (3%) Query: 653 SPAGENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDG 708 Sbjct: 99 VSDNESSSNYTYKIHVAGETDSSNL--NVSEPRAVIPGLRSSTFYNITVCPVLGDI 152 Score = 41.4 bits (96), Expect = 2e-05 Identities = 12/77 (15%), Positives = 12/77 (15%), Gaps = 4/77 (5%) Query: 336 PPSNLIAMEVSSKYVKLNWNPSPSP-VTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDL 394 Sbjct: 79 QVFDVTAVNISATSLTLIWKVSDNESSSNYTYKIHVAGETDSSN---LNVSEPRAVIPGL 135 Query: 395 SADTEYQISVSAMKGMT 411 Sbjct: 136 RSSTFYNITVCPVLGDI 152 Score = 39.4 bits (91), Expect = 6e-05 Identities = 14/104 (13%), Positives = 14/104 (13%), Gaps = 2/104 (1%) Query: 2207 VTDLKTYQIGWDTFCVKWSPHRAATSYRLKLSPADGTRGQEITVRGSETSHCFTGLSPDT 2266 Sbjct: 4 VFDIHVVYVTTTEMWLDWKSPDGASEYVYHLVIE--SKHGSNHTSTYDKAITLQGLIPGT 61 Query: 2267 DYGVTVFVQTPNLEGPGVSVKEHTTVKPTEAPTEPPTPPPPPTI 2310 Sbjct: 62 LYNITISPEVDHVWGAIQVFDVTAVNISATSLTLIWKVSDNESS 105 Score = 34.7 bits (79), Expect = 0.002 Identities = 10/59 (16%), Positives = 10/59 (16%), Gaps = 3/59 (5%) Query: 40 TVHMSWAEPVDP-IVGYRITVDPTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSY 97 Sbjct: 92 SLTLIWKVSDNESSSNYTYKIHV--AGETDSSNLNVSEPRAVIPGLRSSTFYNITVCPV 148 Score = 32.8 bits (74), Expect = 0.006 Identities = 13/65 (20%), Positives = 13/65 (20%), Gaps = 2/65 (3%) Query: 920 SIGAYWTSAPGMVRGYRVSWKSLYDDVDTGEKNLPEDAIHTMIENLQPETKYRISVFATY 979 Sbjct: 92 SLTLIWKVSDNESSSNYTYKIHVAGETD--SSNLNVSEPRAVIPGLRSSTFYNITVCPVL 149 Query: 980 SSGEG 984 Sbjct: 150 GDIEG 154 >UBA Ubiquitin pathway associated domain Length = 255 Score = 30.1 bits (67), Expect = 0.039 Identities = 24/85 (28%), Positives = 24/85 (28%), Gaps = 9/85 (10%) Query: 2938 RNQPGPPGPPGPPGSAGARGEPGPGGRPGFPGTPGMQGPPGERGLPGEKGERGTGSSGPR 2997 Sbjct: 58 MGIPENLRQPEPQQQTAAAAE-----QPSTAATTAEQ--PAEDDLFAQAAQGGNASSGAL 110 Query: 2998 GLPGPPGPQGESRTGPPGSTGSRGP 3022 Sbjct: 111 GTTG--GATDAAQGGPPGSIGLTVE 133 Score = 23.1 bits (49), Expect = 5.1 Identities = 24/148 (16%), Positives = 24/148 (16%), Gaps = 33/148 (22%) Query: 2849 AMGPRGPPGRPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRGDIASQN--MMRA 2906 Sbjct: 97 AQAAQGGNASSGALGTTGGATDAAQGGPPGSIGLTVEDLLSLRQ-----VVSGNPEALAP 151 Query: 2907 VARQVCE------QLISGQMNRFNQMLNQIPNDYQSSRNQPGPPGPPGPPGSAGARGEPG 2960 Sbjct: 152 LLENISARYPQLREHIMANPEVFVSMLLE----------------AVGDNMQDV--MEGA 193 Query: 2961 PGGRPGFPGTPGMQGPPGERGLPGEKGE 2988 Sbjct: 194 DDMVEG--EDIEVTGEAAAAGLGQGEGE 219 Score = 23.1 bits (49), Expect = 5.9 Identities = 14/42 (33%), Positives = 14/42 (33%) Query: 2760 AKGPRGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLSI 2801 Sbjct: 97 AQAAQGGNASSGALGTTGGATDAAQGGPPGSIGLTVEDLLSL 138 >RHOGEF RHO-type GTPase GDP exchange factor Length = 176 Score = 29.2 bits (65), Expect = 0.074 Identities = 13/69 (18%), Positives = 13/69 (18%), Gaps = 6/69 (8%) Query: 262 RELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQV---CS 318 Sbjct: 14 RELYTV-LLGYRAEMDNPEMFDLMPPLLRNKKDILF--GNMAEIYEFHNDIFLSSLENCA 70 Query: 319 GVDEQLGEL 327 Sbjct: 71 HAPERVGPC 79 Score = 22.2 bits (47), Expect = 9.7 Identities = 9/35 (25%), Positives = 9/35 (25%), Gaps = 1/35 (2%) Query: 1447 ETEYVVNVYSVVEDEYSEPLKGTEKTLPVPVVSLN 1481 Sbjct: 9 ERVYVRELYTVLLG-YRAEMDNPEMFDLMPPLLRN 42 >RASGAP RAS-type GTPase GTP hydrolysis activating protein Length = 292 Score = 27.1 bits (59), Expect = 0.32 Identities = 17/48 (35%), Positives = 17/48 (35%), Gaps = 3/48 (6%) Query: 292 SLNHVFNVANFDAIVDIQNEIISQ--VCSGVDEQLGELVSGE-EVVEP 336 Sbjct: 14 TADHVFPLATYDDLMNLLLESVDQRPITVSAVSILGELVSGKTEVAQP 61 >UBHYD Ubiquitin C-terminal hydrolase domain Length = 884 Score = 26.2 bits (57), Expect = 0.65 Identities = 14/56 (25%), Positives = 14/56 (25%) Query: 1806 NSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNLRVYDPSTSTLN 1861 Sbjct: 427 NKTMIELSDNENPWTIFLETVDPELAASGATLPKFDKDHDVMLFLKMYDPKTRSLN 482 >CATH Cathepsin like protease domain Length = 371 Score = 26.1 bits (57), Expect = 0.65 Identities = 14/70 (20%), Positives = 14/70 (20%), Gaps = 8/70 (11%) Query: 1176 TLSDTTVMPILSSGMECLTRAEA-DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPK 1234 Sbjct: 308 TPNQYVLKVTVQGETLCLSGFSGLDVPPPAGPLWILGDV-------FIGSYYTVFDRDND 360 Query: 1235 RVQIALAQYS 1244 Sbjct: 361 AVGFAKSSGS 370 >S1 S1 RNA binding domain Length = 305 Score = 25.3 bits (55), Expect = 1.2 Identities = 4/44 (9%), Positives = 4/44 (9%), Gaps = 6/44 (13%) Query: 296 VFNVANFDAIVDIQNEI-----ISQVCSGVDEQLGELVS-GEEV 333 Sbjct: 36 ICQHANEGVYVDFGGKSPGFVPVQELGLRPHAEIEDSFPLDSAW 79 >PCNA Proliferating Cell nuclear antigen like domain Length = 280 Score = 24.8 bits (54), Expect = 1.4 Identities = 11/136 (8%), Positives = 11/136 (8%), Gaps = 21/136 (15%) Query: 1694 SSDKMETI---LNGDENTLVFENLNPNT----IYEVSITAIYADESESDDLIGSERTLPI 1746 Sbjct: 101 TPGTLTALRMCYQGYGHPLMLFLEEGGVVTVCKITTQEPEETLDFDFCSTNVMNKIILQ- 159 Query: 1747 LTTQAPKSGPRNLQVYNATSNSLTVKWDPASGRVQKYRITYQPSTGEGNEQTTTIGGRQN 1806 Sbjct: 160 --SEGLREAFSEL---DMTGDVLQITVSPDKPYFR--------LSTFGNAGNSHLDYPKD 206 Query: 1807 SVVLQKLKPDTPYTIT 1822 Sbjct: 207 SDLVEAFHCNKTQVNR 222 >SGTP Small GTPAses Length = 164 Score = 24.7 bits (54), Expect = 1.7 Identities = 16/72 (22%), Positives = 16/72 (22%), Gaps = 8/72 (11%) Query: 2334 IGDDNFNK---VVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSE-FKLNTYNDKALA-L 2388 Sbjct: 6 IGNAGTGKSCLLHQFIEKK---FKDDSNHTIGVEFGSKIINVGGKYVKLQIWDTAGQERF 62 Query: 2389 GALQNIRYRGGN 2400 Sbjct: 63 RSVTRSYYRGAA 74 >KIN Protein kinase domain Length = 313 Score = 24.7 bits (53), Expect = 1.9 Identities = 11/53 (20%), Positives = 11/53 (20%), Gaps = 3/53 (5%) Query: 2614 PSSKTLSFFNKD---TRGEVQTVTFDTEEVKTLFYGSFHKVHIVVTSKSVKIY 2663 Sbjct: 5 PTSDTSNFNDNISYFVYGSQFTVPRRYSIVKCIGHGAYGVVCSAKDNLTGEKV 57 >PUM Pumilio repeat RNA binding domain Length = 337 Score = 24.4 bits (53), Expect = 2.1 Identities = 13/77 (16%), Positives = 13/77 (16%), Gaps = 18/77 (23%) Query: 155 NFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKG 214 Sbjct: 155 KFGFIIDAIVEQNNIITISTHKHGCCVLQKLLSVCTL------QQIFKISVKIVQF---- 204 Query: 215 GNTMTGDAIDYLVKNTF 231 Sbjct: 205 --------LPGLINDQF 213 >INSL Insulinase like Metallo protease domain Length = 433 Score = 24.1 bits (52), Expect = 2.3 Identities = 12/81 (14%), Positives = 12/81 (14%), Gaps = 15/81 (18%) Query: 514 GGSTNTGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDAFRDPAIKLRNSDVEIFA 573 Sbjct: 91 GGNIN---AGTS-KDYTYYHVEIAHPYWKQALEVL-----YQLTMKA--TLDEEMIE--- 136 Query: 574 VGVKDAVRSELEAIASPPAET 594 Sbjct: 137 -KEKPIVIEELRRGKDNPTTV 156 >AAA AAA+ ATPase Module Length = 298 Score = 23.3 bits (49), Expect = 4.6 Identities = 17/139 (12%), Positives = 17/139 (12%), Gaps = 1/139 (0%) Query: 173 GEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLA-AIKKIPYKGGNTMTGDAIDYLVKNTF 231 Sbjct: 113 GSELVQKFIGEGARMVRELFVMAREHAPSIIFMDEIDSIGSRLEGGSGGDSEVQRTMLEL 172 Query: 232 TESAGARVGFPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTP 291 Sbjct: 173 LNQLDGFEATKNIKVIMATNRIDILDSALLRPGRIDRKIEFPPPNEEARLDILKIHSRKM 232 Query: 292 SLNHVFNVANFDAIVDIQN 310 Sbjct: 233 NLTRGINLRKIAELMPGAS 251 >SCP SCP/PR1 domain (Domain found in plant pathogenesis related proteins and animal trypsin inhibitors) Length = 159 Score = 23.3 bits (50), Expect = 5.1 Identities = 7/47 (14%), Positives = 7/47 (14%), Gaps = 7/47 (14%) Query: 303 DAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEPP-SNLIAMEVSSK 348 Sbjct: 4 QDILKEHNDFRQKIA------RGLETRGNPGPQPPAKNMKNLVWNDE 44 >SNARE Alpha helical domains which are involved in vesicle fusion Length = 254 Score = 23.3 bits (50), Expect = 5.3 Identities = 12/73 (16%), Positives = 12/73 (16%), Gaps = 7/73 (9%) Query: 2359 AGIQVSFVQYSDEVKSEFKLNTYN--DKALALGALQNIRYRGGNTRTGKALTFIKEKVLT 2416 Sbjct: 78 YPNVLAF-SFLDELQKEF-ITTYNMMKTNTAVRPYCFIEFDNFIQRTKQRYNNPRSLST- 134 Query: 2417 WESGMRKNVPKVL 2429 Sbjct: 135 --KINLSDMQMEI 145 >PTB Phospho-Tyrosine Binding domain Length = 138 Score = 22.9 bits (49), Expect = 5.8 Identities = 13/86 (15%), Positives = 13/86 (15%), Gaps = 3/86 (3%) Query: 427 VQVECSRGVDIKADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSR 486 Sbjct: 15 VETPKANGSDVAREAIHAIRF--QRDLKRSEQTRETAKLQKVEIRISIDNVIIADIK-TK 71 Query: 487 DPHTEFTLKKFTKVEDIIEAINTFPY 512 Sbjct: 72 APMYTFPLGRISFCADDKDDKRMFSF 97 >MIZFIN MIZ type Cysteine zinc DNA binding domain Length = 172 Score = 23.1 bits (49), Expect = 6.2 Identities = 12/62 (19%), Positives = 12/62 (19%), Gaps = 7/62 (11%) Query: 130 IQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTR 189 Sbjct: 72 LNDCSDV--DEIKFQEDGSWCPMRPK-----KEAMKVTSQPCTKVESSSVFSKPCSVTVA 124 Query: 190 TE 191 Sbjct: 125 SD 126 >DSP Dual specificity protein phosphatase Length = 185 Score = 22.6 bits (48), Expect = 6.8 Identities = 17/87 (19%), Positives = 17/87 (19%), Gaps = 12/87 (13%) Query: 1654 PVPAPTNLKITEVTSEGFRGTWDHGASDVSLYRITWGPFGSSDKMETILN-GDENTLVFE 1712 Sbjct: 21 GCYSLPSQPCNEVTPRIYVGNASV-AQDI--------PKLQKLGITHVLNAAEGRSFMHV 71 Query: 1713 NLNPNTIYEVSIT--AIYADESESDDL 1737 Sbjct: 72 NTNANFYKDSGITYLGIKANDTQEFNL 98 >ARR Arrestin domain Length = 454 Score = 22.5 bits (48), Expect = 8.0 Identities = 7/47 (14%), Positives = 7/47 (14%) Query: 684 TSVVLSSLKPETLYLVNVTAEYEDGFSIPLAGEETTEEVKGAPRNLK 730 Sbjct: 255 SNKTVKKLKIYIIQVADICLFTTASYSCEVARIESNEGFPVGPGGTL 301 >AP2 A plant specific DNA binding domain (Apetala 2 like) Length = 218 Score = 22.4 bits (47), Expect = 8.7 Identities = 11/36 (30%), Positives = 11/36 (30%), Gaps = 5/36 (13%) Query: 278 AADAKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Sbjct: 150 ATGSEELGEIVELPSLG-----SSYDGLTQLGNEFI 180 >DNASE1 DNASE-1/Sphingomyelinase like domain Length = 388 Score = 22.4 bits (47), Expect = 9.5 Identities = 18/113 (15%), Positives = 18/113 (15%), Gaps = 6/113 (5%) Query: 831 KLSWSGAPGKVKQYLVTYTPVAGGETQEVTVRGDTTNTVLQ--GLKEGTQYALSVTALYA 888 Sbjct: 126 ALKWEERKYLILEEILMYQPDVLC-LQEVDHYFDTFQPILSRLGYQCTFLAKPWSPCLDV 184 Query: 889 SGAGDALFGEGTTLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKS 941 Sbjct: 185 EHNNG---PDGCALFFLQDRFQLVNSAKIRLSARTLKTNQVAIAETLQCCETG 234 >CALC Calcineurin like Phosphoesterase domain Length = 274 Score = 22.1 bits (46), Expect = 9.6 Identities = 13/87 (14%), Positives = 13/87 (14%), Gaps = 13/87 (14%) Query: 2590 PSDPFAIWQITD-----RDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTF--DTEEVKT 2642 Sbjct: 10 EKPVIKLLQITDPHLFKDESAELLGVNTQASFAQVLKEIQQENNEFDVILATGD------ 63 Query: 2643 LFYGSFHKVHIVVTSKSVKIYIDCYEI 2669 Sbjct: 64 LVQDSSDEGYIRFVEMMKPFNKPVFWI 90 >ARM Armadillo repeat Length = 532 Score = 22.2 bits (47), Expect = 9.8 Identities = 11/59 (18%), Positives = 11/59 (18%), Gaps = 3/59 (5%) Query: 274 LGIKAADAKELKQIAST---PSLNHVFNVANFDAIVDIQNEIISQVCSGVDEQLGELVS 329 Sbjct: 352 SNITAGNKDQIQTVVEANLISPLVSLLQNAEFDIKKEAAWAISNATSGGSHDQIKYLVE 410 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 24 Number of calls to ALIGN: 47 Length of query: 3063 Total length of test sequences: 20182 Effective length of test sequences: 15939.0 Effective search space size: 48176715.6 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= sp|Q99715|CA1C_HUMAN COLLAGEN ALPHA 1(XII) CHAIN PRECURSOR - Homo sapiens (Human). (3063 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|2582830 [161..353] Integrin A (or I) domain 177 3e-45 gi|420193 [185..357] Integrin A (or I) domain 161 1e-40 gi|2645090 [1..155] Integrin A (or I) domain 149 7e-37 gi|322088 [20..469] Periplasmic binding protein-like II 31 0.27 gi|1072178 [17..364] Phosphoglycerate mutase-like 30 0.39 gi|128081 [13..98] Neurophysin II 30 0.61 gi|120277 [2..148] Flavodoxin-like 30 0.72 gi|951110 [5..279] ConA-like lectins/glucanases 28 2.0 gi|1055187 [23..334] Phosphoglycerate mutase-like 28 2.1 gi|1001472 [191..704] N-terminal nucleophile aminohydrolases... 28 2.4 gi|549660 [99..278] 7-bladed beta-propeller 28 2.5 gi|1723074 [152..573] Periplasmic binding protein-like II 28 3.1 gi|442637 [1..263] Trypsin-like serine proteases 28 3.5 gi|2305210 [39..399] Serpins 27 3.5 gi|1181604 [175..454] 7-bladed beta-propeller 27 4.0 gi|2497794 [4..293] Creatinase/methionine aminopeptidase 27 4.1 gi|227854 [61..330] Periplasmic binding protein-like I 27 4.2 gi|1652862 [45..566] Periplasmic binding protein-like II 27 5.1 gi|1044824 [27..332] Phosphoglycerate mutase-like 27 5.8 gi|2506629 [5..112] Flavodoxin-like 26 6.1 gi|1170097 [1..85] Thioredoxin-like 26 6.2 gi|2494172 [22..295] DNase I-like 26 6.2 gi|452523 [41..225] 7-bladed beta-propeller 27 6.5 gi|2337942 [43..470] Periplasmic binding protein-like II 26 7.4 gi|482247 [22..143] TBP-like 26 7.5 gi|1420859 [36..611] Periplasmic binding protein-like II 26 9.0 gi|2494764 [404..525] GMP synthetase, the C-terminal, dimeri... 26 9.5 gi|1765998 [434..643] ADP-ribosylation 26 9.8 >gi|2582830 [161..353] Integrin A (or I) domain Length = 193 Score = 177 bits (445), Expect = 3e-45 Identities = 62/192 (32%), Positives = 62/192 (32%), Gaps = 15/192 (7%) Query: 140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQ 199 Sbjct: 2 DIVIVLDGSNSIY--PWESVTAFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYLNTYST 59 Query: 200 RDELLAAIKKIPYKGGN-TMTGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQDEVE 258 Sbjct: 60 TEEVMDAALRIRQRGGTQTMTALGIDTAREEAFTEAHGARRGVQKVMVIVTDGESHDNYR 119 Query: 259 I--PARELRNVGVEVFSLGI----------KAADAKELKQIASTPSLNHVFNVANFDAIV 306 Sbjct: 120 LQEVIDKCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALV 179 Query: 307 DIQNEIISQVCS 318 Sbjct: 180 TIVEALGERIFA 191 Score = 169 bits (426), Expect = 5e-43 Identities = 60/191 (31%), Positives = 60/191 (31%), Gaps = 15/191 (7%) Query: 440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTK 499 Sbjct: 2 DIVIVLDGSNSIY--PWESVTAFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYLNTYST 59 Query: 500 VEDIIEAINTFPYRGGS-TNTGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDA-- 556 Sbjct: 60 TEEVMDAALRIRQRGGTQTMTALGIDTAREEAFTEAHGARRGVQKVMVIVTDGESHDNYR 119 Query: 557 FRDPAIKLRNSDVEIFAVGVKD----------AVRSELEAIASPPAETHVFTVEDFDAFQ 606 Sbjct: 120 LQEVIDKCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALV 179 Query: 607 RISFELTQSIC 617 Sbjct: 180 TIVEALGERIF 190 Score = 166 bits (418), Expect = 4e-42 Identities = 56/191 (29%), Positives = 56/191 (29%), Gaps = 15/191 (7%) Query: 1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRD 1258 Sbjct: 2 DIVIVLDGSNSIY--PWESVTAFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYLNTYST 59 Query: 1259 KKSLLQAVANLPYKGGN-TLTGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQD--D 1315 Sbjct: 60 TEEVMDAALRIRQRGGTQTMTALGIDTAREEAFTEAHGARRGVQKVMVIVTDGESHDNYR 119 Query: 1316 VEAPSKKLKDEGVELFAIGIKN----------ADEVELKMIATDPDDTHDYNVADFESLS 1365 Sbjct: 120 LQEVIDKCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALV 179 Query: 1366 RIVDDLTINLC 1376 Sbjct: 180 TIVEALGERIF 190 Score = 150 bits (376), Expect = 3e-37 Identities = 60/192 (31%), Positives = 60/192 (31%), Gaps = 16/192 (8%) Query: 2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYN 2382 Sbjct: 2 DIVIVLDGSNSIY--PWESVTAFLNSLLRNMD-IGPQQTQVGIVQYGQTVVHEFYLNTYS 58 Query: 2383 DKALALGALQNIRYRGGN-TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDE- 2440 Sbjct: 59 TTEEVMDAALRIRQRGGTQTMTALGIDTAREEAFTEAHGARRGVQKVMVIVTDGESHDNY 118 Query: 2441 -VKKAALVIQQSGFSVFVVGVAD----------VDYNELANIASKPSERHVFIVDDFESF 2489 Sbjct: 119 RLQEVIDKCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELAL 178 Query: 2490 EKIEDNLITFVC 2501 Sbjct: 179 VTIVEALGERIF 190 >gi|420193 [185..357] Integrin A (or I) domain Length = 173 Score = 161 bits (405), Expect = 1e-40 Identities = 42/172 (24%), Positives = 42/172 (24%), Gaps = 7/172 (4%) Query: 440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEIS-----PNRVQISLVQYSRDPHTEFTL 494 Sbjct: 1 DIVFLLDGSERLGEQNFHKVRRFVEDVSRRLTLARRDDDPLNARMALLQYGSQNQQQVAF 60 Query: 495 KKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSK-GSRSNVPKVMILITDGKS 553 Sbjct: 61 PLTYNVTTIHEALERATYLNSFSHVGTGIVHAINNVVRGARGGARRHAELSFVFLTDGVT 120 Query: 554 -SDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVEDFDA 604 Sbjct: 121 GNDSLEESVHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDS 172 Score = 155 bits (390), Expect = 8e-39 Identities = 36/173 (20%), Positives = 36/173 (20%), Gaps = 7/173 (4%) Query: 1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIG-----PKRVQIALAQYSGDPRTEWQL 1253 Sbjct: 1 DIVFLLDGSERLGEQNFHKVRRFVEDVSRRLTLARRDDDPLNARMALLQYGSQNQQQVAF 60 Query: 1254 NAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNF-RTQAGMRPRARKIGVLITDGKS 1312 Sbjct: 61 PLTYNVTTIHEALERATYLNSFSHVGTGIVHAINNVVRGARGGARRHAELSFVFLTDGVT 120 Query: 1313 -QDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHDYNVADFESL 1364 Sbjct: 121 GNDSLEESVHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDSL 173 Score = 144 bits (360), Expect = 2e-35 Identities = 35/172 (20%), Positives = 35/172 (20%), Gaps = 7/172 (4%) Query: 140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIG-----EEKTRVGVVQYSSDTRTEFNL 194 Sbjct: 1 DIVFLLDGSERLGEQNFHKVRRFVEDVSRRLTLARRDDDPLNARMALLQYGSQNQQQVAF 60 Query: 195 NQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES-AGARVGFPKVAIIITDGKS 253 Sbjct: 61 PLTYNVTTIHEALERATYLNSFSHVGTGIVHAINNVVRGARGGARRHAELSFVFLTDGVT 120 Query: 254 -QDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVANFDA 304 Sbjct: 121 GNDSLEESVHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDS 172 Score = 138 bits (345), Expect = 2e-33 Identities = 36/172 (20%), Positives = 36/172 (20%), Gaps = 6/172 (3%) Query: 2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDE----ISPAGIQVSFVQYSDEVKSEFKL 2378 Sbjct: 1 DIVFLLDGSERLGEQNFHKVRRFVEDVSRRLTLARRDDDPLNARMALLQYGSQNQQQVAF 60 Query: 2379 NTYNDKALALGALQNIRYRGGNTRTGKALTFIKEKVLTW-ESGMRKNVPKVLVVVTDGRS 2437 Sbjct: 61 PLTYNVTTIHEALERATYLNSFSHVGTGIVHAINNVVRGARGGARRHAELSFVFLTDGVT 120 Query: 2438 -QDEVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSERHVFIVDDFES 2488 Sbjct: 121 GNDSLEESVHSMRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDS 172 >gi|2645090 [1..155] Integrin A (or I) domain Length = 155 Score = 149 bits (373), Expect = 7e-37 Identities = 48/156 (30%), Positives = 48/156 (30%), Gaps = 8/156 (5%) Query: 1200 IVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDK 1259 Sbjct: 1 MLFLLDSSGSVGDENFDKMKEFVKSIVLNFDVDNQLTRIGIIRFDSDAEIIIQLSDHKTL 60 Query: 1260 KSLLQAVANLPYKGGN-TLTGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQD---- 1314 Sbjct: 61 KDLLNDIDSIRYNEGIQTRIDKALERAMEA-FSEKNGGRADATKALVLLADGQNSFIEGS 119 Query: 1315 -DVEAPSKKLKDEGVELFAIGI-KNADEVELKMIAT 1348 Sbjct: 120 QDLNEELKPLIDAKVFRYVIGIGRELDLKELEDIAT 155 Score = 143 bits (358), Expect = 4e-35 Identities = 44/156 (28%), Positives = 44/156 (28%), Gaps = 8/156 (5%) Query: 141 LVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQR 200 Sbjct: 1 MLFLLDSSGSVGDENFDKMKEFVKSIVLNFDVDNQLTRIGIIRFDSDAEIIIQLSDHKTL 60 Query: 201 DELLAAIKKIPYKGGN-TMTGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQD---- 255 Sbjct: 61 KDLLNDIDSIRYNEGIQTRIDKALER-AMEAFSEKNGGRADATKALVLLADGQNSFIEGS 119 Query: 256 -EVEIPARELRNVGVEVFSLGI-KAADAKELKQIAS 289 Sbjct: 120 QDLNEELKPLIDAKVFRYVIGIGRELDLKELEDIAT 155 Score = 141 bits (353), Expect = 1e-34 Identities = 39/156 (25%), Positives = 39/156 (25%), Gaps = 8/156 (5%) Query: 441 IVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKV 500 Sbjct: 1 MLFLLDSSGSVGDENFDKMKEFVKSIVLNFDVDNQLTRIGIIRFDSDAEIIIQLSDHKTL 60 Query: 501 EDIIEAINTFPYRGGS-TNTGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSD---- 555 Sbjct: 61 KDLLNDIDSIRYNEGIQTRIDKALERAME-AFSEKNGGRADATKALVLLADGQNSFIEGS 119 Query: 556 -AFRDPAIKLRNSDVEIFAVGV-KDAVRSELEAIAS 589 Sbjct: 120 QDLNEELKPLIDAKVFRYVIGIGRELDLKELEDIAT 155 Score = 128 bits (319), Expect = 2e-30 Identities = 40/157 (25%), Positives = 40/157 (25%), Gaps = 9/157 (5%) Query: 2324 IVFLTDASWSIGDDNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYND 2383 Sbjct: 1 MLFLLDSSGSVGDENFDKMKEFVKSIVLNFD-VDNQLTRIGIIRFDSDAEIIIQLSDHKT 59 Query: 2384 KALALGALQNIRYRGGN-TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQD--- 2439 Sbjct: 60 LKDLLNDIDSIRYNEGIQTRIDKALERAME-AFSEKNGGRADATKALVLLADGQNSFIEG 118 Query: 2440 --EVKKAALVIQQSGFSVFVVGVA-DVDYNELANIAS 2473 Sbjct: 119 SQDLNEELKPLIDAKVFRYVIGIGRELDLKELEDIAT 155 >gi|322088 [20..469] Periplasmic binding protein-like II Length = 450 Score = 31.1 bits (70), Expect = 0.27 Identities = 29/212 (13%), Positives = 29/212 (13%), Gaps = 34/212 (16%) Query: 1664 TEVTSEGFRGTWDHGASDVSLYRITWGP-FGSSDKMETILNG------------DENTLV 1710 Sbjct: 95 SDLTAEDFEYSWQRMVDPKTASEYAYLATESHVKNAEDINSGKNPDLDSLGVKADGNKVI 154 Query: 1711 FENLNPN-------TIYEVSITAIYADESESDDLIGSERTLPILTTQAPKSGPRNLQVYN 1763 Sbjct: 155 FTLTVPAPQFKSLLSFSNFVPQKESFVKDAGKD-YGTTSEKQIYS------GPYIVKDWN 207 Query: 1764 ATSNSLTVK-----WDPASGRVQK--YRITYQPSTGEGNEQTTTIGGRQNSVVLQKLKPD 1816 Sbjct: 208 GTSGTFKLVKNKNYWDAKNVKTETVNVQTVKKPDTAVQMYKQGKLDFANISGTSAIYNAN 267 Query: 1817 TPYTITVSSLYPDGEGGRMTGRGKTKPLNTVR 1848 Sbjct: 268 KKHKDVVPVLEATTAYIVYNQTGAIEGLNSLK 299 >gi|1072178 [17..364] Phosphoglycerate mutase-like Length = 348 Score = 30.4 bits (68), Expect = 0.39 Identities = 18/97 (18%), Positives = 18/97 (18%), Gaps = 14/97 (14%) Query: 1870 NPRQYKLFYAPAAGGPEELVPIPGNTNYAI---LRNLQPD---TSYTVTVVPVYTEGDGG 1923 Sbjct: 23 PGDDLNNVDYQQIAWPGELTKRGILEEFQLGQRLRKIYGEHFGDTYQPRDFHVYT-GKDN 81 Query: 1924 RTSDTGRTLMRGL--ARNVQVYNPTPNRLGVRWDPAP 1958 Sbjct: 82 RTSASAQAMFAGFLPPNEDQTWNYELK-----WQPVA 113 >gi|128081 [13..98] Neurophysin II Length = 86 Score = 30.1 bits (66), Expect = 0.61 Identities = 5/40 (12%), Positives = 5/40 (12%) Query: 2710 CSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVGPP 2749 Sbjct: 45 QEENYLPSPCQSGQKPCGSGGRCAANGVCCNDESCVIEPE 84 Score = 29.7 bits (65), Expect = 0.64 Identities = 9/55 (16%), Positives = 9/55 (16%), Gaps = 8/55 (14%) Query: 2710 CSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVGPPGPPGPAGGPGAKGPR 2764 Sbjct: 13 CGPGGQG--RCF------GPSICCADALGCFVGTAEALRCQEENYLPSPCQSGQK 59 >gi|120277 [2..148] Flavodoxin-like Length = 147 Score = 29.5 bits (66), Expect = 0.72 Identities = 12/47 (25%), Positives = 12/47 (25%), Gaps = 2/47 (4%) Query: 2426 PKVLVVV--TDGRSQDEVKKAALVIQQSGFSVFVVGVADVDYNELAN 2470 Sbjct: 1 PKALIVYGSTTGNTEYTAETIARELADAGYEVDSRDAASVEAGGLFE 47 >gi|951110 [5..279] ConA-like lectins/glucanases Length = 275 Score = 28.3 bits (63), Expect = 2.0 Identities = 6/29 (20%), Positives = 6/29 (20%) Query: 1763 NATSNSLTVKWDPASGRVQKYRITYQPST 1791 Sbjct: 176 NSIKSAATTKWERRNGQTLNVLVSYDANS 204 >gi|1055187 [23..334] Phosphoglycerate mutase-like Length = 312 Score = 28.0 bits (62), Expect = 2.1 Identities = 6/44 (13%), Positives = 6/44 (13%), Gaps = 1/44 (2%) Query: 1893 GNTNYAILRNLQPDTSYTVTVVPVYTEGDGGRTSDTGRTLMRGL 1936 Sbjct: 9 GKVLRDRYVNSFVDARMLSSQLLFRS-SPVERCLMTLQTVGNTM 51 >gi|1001472 [191..704] N-terminal nucleophile aminohydrolases (Ntn hydrolases) Length = 514 Score = 28.1 bits (62), Expect = 2.4 Identities = 12/65 (18%), Positives = 12/65 (18%), Gaps = 4/65 (6%) Query: 834 WSGAPGKVKQYLVTYTPVAGGETQEVTVRGDTTNTVLQGL---KEGTQYALSVTALYASG 890 Sbjct: 89 WDGEVKSFDRRNNQVK-ILQGNGSYQTLQWEALESVTGIVVAQRGGNAYTLKIPGLDRSG 147 Query: 891 AGDAL 895 Sbjct: 148 AISQF 152 >gi|549660 [99..278] 7-bladed beta-propeller Length = 180 Score = 27.8 bits (61), Expect = 2.5 Identities = 7/49 (14%), Positives = 7/49 (14%), Gaps = 4/49 (8%) Query: 1119 VTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGESS 1167 Sbjct: 61 IAYDPSGLVF----ALGNPENFEIGLYNLKKIQEGPFLIIKINDATFSQ 105 >gi|1723074 [152..573] Periplasmic binding protein-like II Length = 422 Score = 27.6 bits (61), Expect = 3.1 Identities = 15/138 (10%), Positives = 15/138 (10%), Gaps = 21/138 (15%) Query: 1664 TEVTSEGFRGTWDHGASDVSLYRITWGPFGSSDKMETILNGDENTLVFE------NLNPN 1717 Sbjct: 17 TPITWRDIASQIHAISGADKAFEI-ASSSGAERVASVTRGVDDRQAVVTFAKPYAEWRGM 75 Query: 1718 TIYEVSITAIYADESESDDLIGSERTLPILTTQAPKSGPRNLQVYNATSNSLTVK----- 1772 Sbjct: 76 FAGNGMLLPASMTATPEAFNKGQLDGPGPSA------GPFVVSALDRTAQRIVLTRNPRW 129 Query: 1773 WDPASGRVQKYRITYQPS 1790 Sbjct: 130 WG---ARPRLDSITYLVL 144 >gi|442637 [1..263] Trypsin-like serine proteases Length = 263 Score = 27.6 bits (60), Expect = 3.5 Identities = 15/41 (36%), Positives = 15/41 (36%) Query: 1486 GPTTMHVQWQPVGGATGYILSYKPVKDTEPTRPKEVRLGPT 1526 Sbjct: 174 GTTHLNVQWQPSGGVTEPGSSGSPIYSPEKRVLGQLHGGPS 214 >gi|2305210 [39..399] Serpins Length = 361 Score = 27.2 bits (60), Expect = 3.5 Identities = 14/139 (10%), Positives = 14/139 (10%), Gaps = 19/139 (13%) Query: 296 VFNVANFDAIVDIQNEIISQVCSGVDEQLGELVSGEEVVEPPSNLIAMEVSSKYVKLNW- 354 Sbjct: 113 LTQI-NFASPLQVKSIREIFIKWIKKSMNYGARNIASI---TSTQSLNLFNGIHFTDDWM 168 Query: 355 ---NPSPSPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMT 411 Sbjct: 169 YEFLPLNHILPFYS---------PRLRVTDM-PMMERTAKFPYYENQHMQAVSLPFKDSE 218 Query: 412 SSEPISIMEKTQPMKVQVE 430 Sbjct: 219 MQMLIILPKKTFDLA-KFE 236 >gi|1181604 [175..454] 7-bladed beta-propeller Length = 280 Score = 27.1 bits (59), Expect = 4.0 Identities = 14/154 (9%), Positives = 14/154 (9%), Gaps = 19/154 (12%) Query: 1117 YKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGES------SPLV 1170 Sbjct: 86 NSVCFADESGHL----IYSGSDDNLCKVWDRRCFNAKGKPAGILMGHLEGITFIDSRGDG 141 Query: 1171 GQEMTTLSDTTVMPI-LSSGMECLTRAEADIVLLVDGSWSIGRANFRTVRSFISRIVEVF 1229 Sbjct: 142 RYFISNGKDQTIKLWDIRKMSSNAGGTIQSRNSEWDYRWMEYPQEARDLK--------HP 193 Query: 1230 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLL 1263 Sbjct: 194 SDLSGATYKGHSVLCTLIRCYFSPDYSTGQKYIY 227 >gi|2497794 [4..293] Creatinase/methionine aminopeptidase Length = 290 Score = 27.2 bits (60), Expect = 4.1 Identities = 8/50 (16%), Positives = 8/50 (16%), Gaps = 2/50 (4%) Query: 1045 PTVTSTVLKRLQPQTTYDITVLPIYKMGEGKLRQGSGTTASRFKSPRNLK 1094 Sbjct: 167 PNVYERTNQYIDVGDLV--AIEPFATDGFGMVKDGNLGNIYKFLAKRPIR 214 >gi|227854 [61..330] Periplasmic binding protein-like I Length = 270 Score = 27.3 bits (60), Expect = 4.2 Identities = 14/144 (9%), Positives = 14/144 (9%), Gaps = 28/144 (19%) Query: 1301 RKIGVLITDGKSQ---DDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHDYN 1357 Sbjct: 2 RSIGLVIPDLENTSYTRIANYLERQARQRGYQLLIACSEDQPDNEMRCI----EHLLQRQ 57 Query: 1358 VADF---------ESLSRIVDDLTINLCNSVKGPGDLEAPSNLV------------ISER 1396 Sbjct: 58 VDAIIVSTSLPPEHPFYQRWANDPFPIVALDRALDREHFTSVVGADQDDAEMLAEELRKF 117 Query: 1397 THRSFRVSWTPPSDSVDRYKVEYY 1420 Sbjct: 118 PAETVLYLGALPELSVSFLREQGF 141 >gi|1652862 [45..566] Periplasmic binding protein-like II Length = 522 Score = 26.9 bits (59), Expect = 5.1 Identities = 22/140 (15%), Positives = 22/140 (15%), Gaps = 27/140 (19%) Query: 1664 TEVTSEGFRGTWDHGASDVSLYRITWGPFGSSDKMETILNGDENTLVFENLNPNTIYEVS 1723 Sbjct: 81 QPFSAEDVAFTYKF-LSDPKTGATSTGTYEAIAKVEAL---DKNTVKIT-FKEPNPAWFL 135 Query: 1724 ITA----------IYADESESDDLIGSERTLPILTTQAPKSGPRNLQVYNATSNSLTVK- 1772 Sbjct: 136 PFVGSEGMILPQHIYKDFVGEKARQAPANLLPIGT------GPYRVTSFKPGDVVLYEVN 189 Query: 1773 ---WDPASGRVQKYRITYQP 1789 Sbjct: 190 PHYRDRKNIGFQQ--VEIKG 207 >gi|1044824 [27..332] Phosphoglycerate mutase-like Length = 306 Score = 26.8 bits (59), Expect = 5.8 Identities = 4/42 (9%), Positives = 4/42 (9%), Gaps = 1/42 (2%) Query: 1906 DTSYTVTVVPVYTEGDGGRTSDTGRTLMRGLARNVQVYNPTP 1947 Sbjct: 59 DGNLKPRQMYWRS-VNKNRCLSSASTVGAAMFEDPSRHLYVP 99 >gi|2506629 [5..112] Flavodoxin-like Length = 108 Score = 26.4 bits (58), Expect = 6.1 Identities = 10/107 (9%), Positives = 10/107 (9%), Gaps = 11/107 (10%) Query: 2427 KVLVVV----TDGRSQDEVKKAALVIQQSGFSVFVVGVADVDYNEL--ANIASKPSERHV 2480 Sbjct: 1 RIYLVWAHPRHDSLTAHIADAIHQRAMERKIQVTELDLYRRNFNPVMTPEDEPDWKNMDK 60 Query: 2481 FIVDDFES-FEKIED-NLITFVCETATSSCPLI---YLDGYTSPGFK 2522 Sbjct: 61 RYSPEVHQLYSELLEHDTLVVVFPLWWYSFPAMLKGYIDRVWNNGLA 107 >gi|1170097 [1..85] Thioredoxin-like Length = 85 Score = 26.4 bits (58), Expect = 6.2 Identities = 12/61 (19%), Positives = 12/61 (19%) Query: 192 FNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVGFPKVAIIITDG 251 Sbjct: 8 WDIRGLAHAIRLLLEYTDSSYVEKKYTMGDAPDYDRSQWLNEKFKLGLDFPNLPYLIDGA 67 Query: 252 K 252 Sbjct: 68 H 68 >gi|2494172 [22..295] DNase I-like Length = 274 Score = 26.5 bits (58), Expect = 6.2 Identities = 13/55 (23%), Positives = 13/55 (23%) Query: 1608 KEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPPVTAQETTRPVPAPTNLK 1662 Sbjct: 87 DAVSVVDTYLYPDPEDVFSREPFVVKFSAPGTGERAPPLPSRRALTPPPLPAAAQ 141 >gi|452523 [41..225] 7-bladed beta-propeller Length = 185 Score = 26.6 bits (58), Expect = 6.5 Identities = 10/59 (16%), Positives = 10/59 (16%), Gaps = 4/59 (6%) Query: 2217 WDTFCVKWSPHR---AATSYRLKLSPADGTRGQEI-TVRGSETSHCFTGLSPDTDYGVT 2271 Sbjct: 10 WETANNRLHQEYVPSAHLSGTCTCLAWAPARLQAKESPQRKKRKSEAVGMSNQTDLLAL 68 >gi|2337942 [43..470] Periplasmic binding protein-like II Length = 428 Score = 26.5 bits (58), Expect = 7.4 Identities = 26/131 (19%), Positives = 26/131 (19%), Gaps = 14/131 (10%) Query: 432 SRGVDIKA-DIVF----LVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQIS-LVQYS 485 Sbjct: 58 SDGSPLTAEDVVFTYNKAAKSGGKIDMGNFSHARALDARRIEMTLSHPQSTFVNVLGSLG 117 Query: 486 RDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSNVPKVM 545 Sbjct: 118 IVPASRYDEKTFAR-----EPIGAGPYRLVSFQPGQQLIVEANPWY---AGKKNDFNRLV 169 Query: 546 ILITDGKSSDA 556 Sbjct: 170 FVFLDEDNAYA 180 >gi|482247 [22..143] TBP-like Length = 122 Score = 26.1 bits (57), Expect = 7.5 Identities = 13/47 (27%), Positives = 13/47 (27%), Gaps = 12/47 (25%) Query: 2421 MRKNVPKVL--------VVVTDGRSQDEVKKA----ALVIQQSGFSV 2455 Sbjct: 69 MRLRDPKTTALIFASGKMVCTGAKTEEDSNRAARKYAKIIQKIGFPV 115 >gi|1420859 [36..611] Periplasmic binding protein-like II Length = 576 Score = 26.1 bits (57), Expect = 9.0 Identities = 21/148 (14%), Positives = 21/148 (14%), Gaps = 29/148 (19%) Query: 1664 TEVTSEGFRGTWDHGASDVSLYRITWGPFGSSDKMETILNG--DENTLVFENLNPNTIYE 1721 Sbjct: 80 APVTAEDFVTGLKHAVDDKSDA--LYVVEDSIKNLKAYQNGEVDFKEVGVKALDDKTV-Q 136 Query: 1722 VSITAIYADESESDDLIGSERTLPILTTQAPK---------------SGPRNLQVYNATS 1766 Sbjct: 137 YTLNKPESYW---NSKTTYSVLFPVNAKFLKSKGKDFGTTDPSSILVNGAYFLSAFT-SK 192 Query: 1767 NSLTVK-----WDPASGRVQKYRITYQP 1789 Sbjct: 193 SSMEFHKNENYWDAKNVGIESVKLTYSD 220 >gi|2494764 [404..525] GMP synthetase, the C-terminal, dimerisation domain Length = 122 Score = 26.0 bits (57), Expect = 9.5 Identities = 14/63 (22%), Positives = 14/63 (22%), Gaps = 3/63 (4%) Query: 570 EIFAVGVKDAVRSELEAIASPPAETHVFTVEDFDAFQRISFELTQSICLRIEQELAAIKK 629 Sbjct: 50 DVRSVGVQGDGRTYGHPIVLRPVSSEDAMTAD---WTRVPYEVLERISTRITNEVAEVNR 106 Query: 630 KAY 632 Sbjct: 107 VVL 109 >gi|1765998 [434..643] ADP-ribosylation Length = 210 Score = 25.9 bits (56), Expect = 9.8 Identities = 10/21 (47%), Positives = 10/21 (47%) Query: 783 IPDTKYEVSVIPEYFSGPGTP 803 Sbjct: 188 IPDKEQAISALPDYASQPGKP 208 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 28 Number of calls to ALIGN: 38 Length of query: 3063 Total length of test sequences: 256703 Effective length of test sequences: 201489.0 Effective search space size: 607713701.3 Initial X dropoff for ALIGN: 25.0 bits