analysis of sequence from tem30 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) FNVDVKNSMTFSGPVEDMFGYTVQQYENEEGKWVLIGSPLVGQPKNRTGDVYKCPVGRGESLPCVKLDLP VNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVVNSIAPVQECS TQLDIVIVLDGSNSIYPWDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAK KIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCEDENIQRFSIA ILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQSAASFEMEMS QTGFSAHYSQDWVMLGAVGAYDWNGTVVMQKASQIIIPRNTTFNVESTKKNEPLASYLGYTVNSATASSG DVLYIAGQPRYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMGT EKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKNEPCGARFGTAIAAVKDLNLDGFND IVIGAPLEDDHGGAVYIYHGSGKTIRKEYAQRIPSGGDGKTLKFFGQSIHGEMDLNGDGLTDVTIGGLGG AALFWSRDVAVVKVTMNFEPNKVNIQKKNCHMEGKETVCINATVCFEVKLKSKEDTIYEADLQYRVTLDS LRQISRSFFSGTQERKVQRNITVRKSECTKHSFYMLDKHDFQDSVRITLDFNLTDPENGPVLDDSLPNSV HEYIPFAKDCGNKEKCISDLSLHVATTEKDLLIVRSQNDKFNVSLTVKNTKDSAYNTRTIVHYSPNLVFS GIEAIQKDSCESNHNITCKVGYPFLRRGEMVTFKILFQFNTSYLMENVTIYLSATSDSEEPPETLSDNVV NISIPVKYEVGLQFYSSASEYHISIAANETVPEVINSTEDIGNEINIFYLIRKSGSFPMPELKLSISFPN MTSNGYPVLYPTGLSSSENANCRPHIFEDPFSINSGKKMTTSTDHLKRGTILDCNTCKFATITCNLTSSD ISQVNVSLILWKPTFIKSYFSSLNLTIRGELRSENASLVLSSSNQKRELAIQISKDGLPGRVPLWVILLS AFAGLLLLMLLILALWKIGFFKRPLKKKMEK ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > tem30_gi|2829468|sp|P56199|ITA1_HUMAN . . . . . 1 FNVDVKNSMTFSGPVEDMFGYTVQQYENEEGKWVLIGSPLVGQPKNRTGD 50 _________________EEEHHHHHHH_____EEEE______________ . . . . . 51 VYKCPVGRGESLPCVKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFL 100 ____________EEEEEE_________EEEEE__________________ . . . . . 101 ACGPLYAYRCGHLHYTTGICSDVSPTFQVVNSIAPVQECSTQLDIVIVLD 150 _____EEE__________________EEEEEEEEE_______EEEEEEE_ . . . . . 151 GSNSIYPWDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYS 200 _____EEE___HHHHHHHHHHH______EEEEEEE_______________ . . . . . 201 STEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVI 250 _HHHHHHHHHHHHH______________HHHHHHHHHHHHH____EEEEE . . . . . 251 VTDGESHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIK 300 E________HHHHHHHHH_________EEEEE________HHHHHHHHHH . . . . . 301 SIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQSAASFEMEMS 350 HH_______EEEEE__HHHHHHHHHHHHHHHHHHHHHH___HHHHHHHHH . . . . . 351 QTGFSAHYSQDWVMLGAVGAYDWNGTVVMQKASQIIIPRNTTFNVESTKK 400 __________EEEEEEE__EEE__EEEEEE____EEE_____EEE_____ . . . . . 401 NEPLASYLGYTVNSATASSGDVLYIAGQPRYNHTGQVIIYRMEDGNIKIL 450 ____________________EEEEEE_______EEEEEEEEE____EEEE . . . . . 451 QTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMGTEKEEQGKVYV 500 EEE____HHHHHHHEEE___________EEEE____EEE_______EEEE . . . . . 501 YALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKNEPCGARFGTAIAAV 550 EEE__HHHHHHHH____EEE_______________________HHHHHHH . . . . . 551 KDLNLDGFNDIVIGAPLEDDHGGAVYIYHGSGKTIRKEYAQRIPSGGDGK 600 HH_______EEEE__________EEEEE_____HHHHHHHH________E . . . . . 601 TLKFFGQSIHGEMDLNGDGLTDVTIGGLGGAALFWSRDVAVVKVTMNFEP 650 EEEEEE______________EEEE________EEE___EEEEEEEE____ . . . . . 651 NKVNIQKKNCHMEGKETVCINATVCFEVKLKSKEDTIYEADLQYRVTLDS 700 __EEEEE________EEEEEE_______HHHHHH__HHHHHHHHHHHHHH . . . . . 701 LRQISRSFFSGTQERKVQRNITVRKSECTKHSFYMLDKHDFQDSVRITLD 750 HHHHH________HHHHHHHHEEE________EEEE________EEEE__ . . . . . 751 FNLTDPENGPVLDDSLPNSVHEYIPFAKDCGNKEKCISDLSLHVATTEKD 800 ____________________EEEEEE__________HHHHHHHEEE___E . . . . . 801 LLIVRSQNDKFNVSLTVKNTKDSAYNTRTIVHYSPNLVFSGIEAIQKDSC 850 EEEEE_____EEEEEEE__________EEEE_____EEE__EEE______ . . . . . 851 ESNHNITCKVGYPFLRRGEMVTFKILFQFNTSYLMENVTIYLSATSDSEE 900 _____EEEEE_________EEEEEEEE____EEEE__EEEEE________ . . . . . 901 PPETLSDNVVNISIPVKYEVGLQFYSSASEYHISIAANETVPEVINSTED 950 ________EEEEEEEEEEEE_EEEE________________EEEEE____ . . . . . 951 IGNEINIFYLIRKSGSFPMPELKLSISFPNMTSNGYPVLYPTGLSSSENA 1000 ______EEEEEE__________EEE___________EEEE__________ . . . . . 1001 NCRPHIFEDPFSINSGKKMTTSTDHLKRGTILDCNTCKFATITCNLTSSD 1050 _____EEE_________EEEE________________EEEEEEE______ . . . . . 1051 ISQVNVSLILWKPTFIKSYFSSLNLTIRGELRSENASLVLSSSNQKRELA 1100 _______EEEE____EEEEEEEHHHHHHHHHHH____EEEE___HHHHHH . . . . . 1101 IQISKDGLPGRVPLWVILLSAFAGLLLLMLLILALWKIGFFKRPLKKKME 1150 HHHH_________HHHHHHHHHHHHHHHHHHHHHHHHHH___________ 1151 K 1151 _ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 0.3 % beta-contents : 46.1 % coil-contents : 53.6 % class : beta method : 2 alpha-contents : 2.3 % beta-contents : 37.5 % coil-contents : 60.2 % class : beta ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -17.78 -3.78 -1.69 -2.71 -4.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 -29.96 3.73 -1.48 -1.23 -3.59 0.00 0.00 -12.00 -0.09 -0.04 0.00 0.00 -12.00 0.00 0.00 0.00 0.00 -26.70 ID: tem30_gi|2829468|sp|P56199|ITA1_HUMAN AC: xxx Len: 1120 1:I 1106 Sc: -26.70 Pv: 6.229164e-02 NO_GPI_SITE GPI: learning from protozoa -15.96 -2.84 -2.55 -4.00 -4.00 0.00 -4.00 -1.15 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 -34.49 -17.60 -4.45 -1.69 -0.27 -4.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 -28.00 ID: tem30_gi|2829468|sp|P56199|ITA1_HUMAN AC: xxx Len: 1120 1:N 1103 Sc: -28.00 Pv: 4.767807e-02 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? tem30_gi|28 0.552 1137 Y 0.677 1142 Y 0.997 1123 Y 0.082 N # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? tem30_gi|28 0.567 108 Y 0.384 1135 Y 0.974 1125 Y 0.128 N # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? tem30_gi|28 0.622 339 Y 0.403 641 Y 0.983 1127 Y 0.142 N ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) 1-1117 FNVDVKNSMTFSGPVEDMFGYTVQQYENEE GKWVLIGSPLVGQPKNRTGDVYKCPVGRGE SLPCVKLDLPVNTSIPNVTEVKENMTFGST LVTNPNGGFLACGPLYAYRCGHLHYTTGIC SDVSPTFQVVNSIAPVQECSTQLDIVIVLD GSNSIYPWDSVTAFLNDLLKRMDIGPKQTQ VGIVQYGENVTHEFNLNKYSSTEEVLVAAK KIVQRGGRQTMTALGTDTARKEAFTEARGA RRGVKKVMVIVTDGESHDNHRLKKVIQDCE DENIQRFSIAILGSYNRGNLSTEKFVEEIK SIASEPTEKHFFNVSDELALVTIVKTLGER IFALEATADQSAASFEMEMSQTGFSAHYSQ DWVMLGAVGAYDWNGTVVMQKASQIIIPRN TTFNVESTKKNEPLASYLGYTVNSATASSG DVLYIAGQPRYNHTGQVIIYRMEDGNIKIL QTLSGEQIGSYFGSILTTTDIDKDSNTDIL LVGAPMYMGTEKEEQGKVYVYALNQTRFEY QMSLEPIKQTCCSSRQHNSCTTENKNEPCG ARFGTAIAAVKDLNLDGFNDIVIGAPLEDD HGGAVYIYHGSGKTIRKEYAQRIPSGGDGK TLKFFGQSIHGEMDLNGDGLTDVTIGGLGG AALFWSRDVAVVKVTMNFEPNKVNIQKKNC HMEGKETVCINATVCFEVKLKSKEDTIYEA DLQYRVTLDSLRQISRSFFSGTQERKVQRN ITVRKSECTKHSFYMLDKHDFQDSVRITLD FNLTDPENGPVLDDSLPNSVHEYIPFAKDC GNKEKCISDLSLHVATTEKDLLIVRSQNDK FNVSLTVKNTKDSAYNTRTIVHYSPNLVFS GIEAIQKDSCESNHNITCKVGYPFLRRGEM VTFKILFQFNTSYLMENVTIYLSATSDSEE PPETLSDNVVNISIPVKYEVGLQFYSSASE YHISIAANETVPEVINSTEDIGNEINIFYL IRKSGSFPMPELKLSISFPNMTSNGYPVLY PTGLSSSENANCRPHIFEDPFSINSGKKMT TSTDHLKRGTILDCNTCKFATITCNLTSSD ISQVNVSLILWKPTFIKSYFSSLNLTIRGE LRSENASLVLSSSNQKRELAIQISKDGLPG RVPLWVI llsafagllllmllilal 1118-1135 1136-1151 WKIGFFKRPLKKKMEK low complexity regions: SEG 25 3.0 3.3 >tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) 1-1107 FNVDVKNSMTFSGPVEDMFGYTVQQYENEE GKWVLIGSPLVGQPKNRTGDVYKCPVGRGE SLPCVKLDLPVNTSIPNVTEVKENMTFGST LVTNPNGGFLACGPLYAYRCGHLHYTTGIC SDVSPTFQVVNSIAPVQECSTQLDIVIVLD GSNSIYPWDSVTAFLNDLLKRMDIGPKQTQ VGIVQYGENVTHEFNLNKYSSTEEVLVAAK KIVQRGGRQTMTALGTDTARKEAFTEARGA RRGVKKVMVIVTDGESHDNHRLKKVIQDCE DENIQRFSIAILGSYNRGNLSTEKFVEEIK SIASEPTEKHFFNVSDELALVTIVKTLGER IFALEATADQSAASFEMEMSQTGFSAHYSQ DWVMLGAVGAYDWNGTVVMQKASQIIIPRN TTFNVESTKKNEPLASYLGYTVNSATASSG DVLYIAGQPRYNHTGQVIIYRMEDGNIKIL QTLSGEQIGSYFGSILTTTDIDKDSNTDIL LVGAPMYMGTEKEEQGKVYVYALNQTRFEY QMSLEPIKQTCCSSRQHNSCTTENKNEPCG ARFGTAIAAVKDLNLDGFNDIVIGAPLEDD HGGAVYIYHGSGKTIRKEYAQRIPSGGDGK TLKFFGQSIHGEMDLNGDGLTDVTIGGLGG AALFWSRDVAVVKVTMNFEPNKVNIQKKNC HMEGKETVCINATVCFEVKLKSKEDTIYEA DLQYRVTLDSLRQISRSFFSGTQERKVQRN ITVRKSECTKHSFYMLDKHDFQDSVRITLD FNLTDPENGPVLDDSLPNSVHEYIPFAKDC GNKEKCISDLSLHVATTEKDLLIVRSQNDK FNVSLTVKNTKDSAYNTRTIVHYSPNLVFS GIEAIQKDSCESNHNITCKVGYPFLRRGEM VTFKILFQFNTSYLMENVTIYLSATSDSEE PPETLSDNVVNISIPVKYEVGLQFYSSASE YHISIAANETVPEVINSTEDIGNEINIFYL IRKSGSFPMPELKLSISFPNMTSNGYPVLY PTGLSSSENANCRPHIFEDPFSINSGKKMT TSTDHLKRGTILDCNTCKFATITCNLTSSD ISQVNVSLILWKPTFIKSYFSSLNLTIRGE LRSENASLVLSSSNQKRELAIQISKDG lpgrvplwvillsafagllllmllilalwk 1108-1149 igffkrplkkkm 1150-1151 EK low complexity regions: SEG 45 3.4 3.75 >tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) 1-1044 FNVDVKNSMTFSGPVEDMFGYTVQQYENEE GKWVLIGSPLVGQPKNRTGDVYKCPVGRGE SLPCVKLDLPVNTSIPNVTEVKENMTFGST LVTNPNGGFLACGPLYAYRCGHLHYTTGIC SDVSPTFQVVNSIAPVQECSTQLDIVIVLD GSNSIYPWDSVTAFLNDLLKRMDIGPKQTQ VGIVQYGENVTHEFNLNKYSSTEEVLVAAK KIVQRGGRQTMTALGTDTARKEAFTEARGA RRGVKKVMVIVTDGESHDNHRLKKVIQDCE DENIQRFSIAILGSYNRGNLSTEKFVEEIK SIASEPTEKHFFNVSDELALVTIVKTLGER IFALEATADQSAASFEMEMSQTGFSAHYSQ DWVMLGAVGAYDWNGTVVMQKASQIIIPRN TTFNVESTKKNEPLASYLGYTVNSATASSG DVLYIAGQPRYNHTGQVIIYRMEDGNIKIL QTLSGEQIGSYFGSILTTTDIDKDSNTDIL LVGAPMYMGTEKEEQGKVYVYALNQTRFEY QMSLEPIKQTCCSSRQHNSCTTENKNEPCG ARFGTAIAAVKDLNLDGFNDIVIGAPLEDD HGGAVYIYHGSGKTIRKEYAQRIPSGGDGK TLKFFGQSIHGEMDLNGDGLTDVTIGGLGG AALFWSRDVAVVKVTMNFEPNKVNIQKKNC HMEGKETVCINATVCFEVKLKSKEDTIYEA DLQYRVTLDSLRQISRSFFSGTQERKVQRN ITVRKSECTKHSFYMLDKHDFQDSVRITLD FNLTDPENGPVLDDSLPNSVHEYIPFAKDC GNKEKCISDLSLHVATTEKDLLIVRSQNDK FNVSLTVKNTKDSAYNTRTIVHYSPNLVFS GIEAIQKDSCESNHNITCKVGYPFLRRGEM VTFKILFQFNTSYLMENVTIYLSATSDSEE PPETLSDNVVNISIPVKYEVGLQFYSSASE YHISIAANETVPEVINSTEDIGNEINIFYL IRKSGSFPMPELKLSISFPNMTSNGYPVLY PTGLSSSENANCRPHIFEDPFSINSGKKMT TSTDHLKRGTILDCNTCKFATITC nltssdisqvnvslilwkptfiksyfssln 1045-1148 ltirgelrsenaslvlsssnqkrelaiqis kdglpgrvplwvillsafagllllmllila lwkigffkrplkkk 1149-1151 MEK low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) FNVDVKNSMTFSGPVEDMFGYTVQQYENEEGKWVLIGSPLVGQPKNRTGDVYKCPVGRGE SLPCVKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGIC SDVSPTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPWDSVTAFLNDLLKRMDIGPKQTQ VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGA RRGVKKVMVIVTDGESHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIK SIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQSAASFEMEMSQTGFSAHYSQ DWVMLGAVGAYDWNGTVVMQKASQIIIPRNTTFNVESTKKNEPLASYLGYTVNSATASSG DVLYIAGQPRYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDIL LVGAPMYMGTEKEEQGKVYVYALNQTRFEYQMSLEPIKQTccssrqhnscttenknePCG ARFGTAIAAVKDLNLDGFNDIVIGAPLEDDHGGAVYIYHGSGKTIRKEYAQRIPSGGDGK TLKFFGQSIHGEMDLNGDGLTDVTIGGLGGAALFWSRDVAVVKVTMNFEPNKVNIQKKNC HMEGKETVCINATVCFEVKLKSKEDTIYEADLQYRVTLDSLRQISRSFFSGTQERKVQRN ITVRKSECTKHSFYMLDKHDFQDSVRITLDFNLTDPENGPVLDDSLPNSVHEYIPFAKDC GNKEKCISDLSLHVATTEKDLLIVRSQNDKFNVSLTVKNTKDSAYNTRTIVHYSPNLVFS GIEAIQKDSCESNHNITCKVGYPFLRRGEMVTFKILFQFNTSYLMENVTIYLSATSDSEE PPETLSDNVVNISIPVKYEVGLQFYSSASEYHISIAANETVPEVINSTEDIGNEINIFYL IRKSGSFPMPELKLSISFPNMTSNGYPVLYPTGLSSSENANCRPHIFEDPFSINSGKKMT TSTDHLKRGTILDCNTCKFATITCNLTSSDISQVNVSLILWKPTFIKSYFSSLNLTIRGE LRSENASLVLSSSNQKRELAIQISKDGLPGRVPLWVILLSAFAGllllmllilalWKIGF FKRPLKKKMEK 1 - 520 FNVDVKNSMT FSGPVEDMFG YTVQQYENEE GKWVLIGSPL VGQPKNRTGD VYKCPVGRGE SLPCVKLDLP VNTSIPNVTE VKENMTFGST LVTNPNGGFL ACGPLYAYRC GHLHYTTGIC SDVSPTFQVV NSIAPVQECS TQLDIVIVLD GSNSIYPWDS VTAFLNDLLK RMDIGPKQTQ VGIVQYGENV THEFNLNKYS STEEVLVAAK KIVQRGGRQT MTALGTDTAR KEAFTEARGA RRGVKKVMVI VTDGESHDNH RLKKVIQDCE DENIQRFSIA ILGSYNRGNL STEKFVEEIK SIASEPTEKH FFNVSDELAL VTIVKTLGER IFALEATADQ SAASFEMEMS QTGFSAHYSQ DWVMLGAVGA YDWNGTVVMQ KASQIIIPRN TTFNVESTKK NEPLASYLGY TVNSATASSG DVLYIAGQPR YNHTGQVIIY RMEDGNIKIL QTLSGEQIGS YFGSILTTTD IDKDSNTDIL LVGAPMYMGT EKEEQGKVYV YALNQTRFEY QMSLEPIKQT 521 - 537 ccssrqhnsc ttenkne 538 - 1124 PCG ARFGTAIAAV KDLNLDGFND IVIGAPLEDD HGGAVYIYHG SGKTIRKEYA QRIPSGG DGK TLKFFGQSIH GEMDLNGDGL TDVTIGGLGG AALFWSRDVA VVKVTMNFEP NKVNIQK KNC HMEGKETVCI NATVCFEVKL KSKEDTIYEA DLQYRVTLDS LRQISRSFFS GTQERKV QRN ITVRKSECTK HSFYMLDKHD FQDSVRITLD FNLTDPENGP VLDDSLPNSV HEYIPFA KDC GNKEKCISDL SLHVATTEKD LLIVRSQNDK FNVSLTVKNT KDSAYNTRTI VHYSPNL VFS GIEAIQKDSC ESNHNITCKV GYPFLRRGEM VTFKILFQFN TSYLMENVTI YLSATSD SEE PPETLSDNVV NISIPVKYEV GLQFYSSASE YHISIAANET VPEVINSTED IGNEINI FYL IRKSGSFPMP ELKLSISFPN MTSNGYPVLY PTGLSSSENA NCRPHIFEDP FSINSGK KMT TSTDHLKRGT ILDCNTCKFA TITCNLTSSD ISQVNVSLIL WKPTFIKSYF SSLNLTI RGE LRSENASLVL SSSNQKRELA IQISKDGLPG RVPLWVILLS AFAG 1125 - 1135 llllml lilal 1136 - 1151 WKIGF FKRPLKKKME K low complexity regions: DUST >tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) FNVDVKNSMTFSGPVEDMFGYTVQQYENEEGKWVLIGSPLVGQPKNRTGDVYKCPVGRGE SLPCVKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGIC SDVSPTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPWDSVTAFLNDLLKRMDIGPKQTQ VGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGA RRGVKKVMVIVTDGESHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIK SIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQSAASFEMEMSQTGFSAHYSQ DWVMLGAVGAYDWNGTVVMQKASQIIIPRNTTFNVESTKKNEPLASYLGYTVNSATASSG DVLYIAGQPRYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDIL LVGAPMYMGTEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKNEPCG ARFGTAIAAVKDLNLDGFNDIVIGAPLEDDHGGAVYIYHGSGKTIRKEYAQRIPSGGDGK TLKFFGQSIHGEMDLNGDGLTDVTIGGLGGAALFWSRDVAVVKVTMNFEPNKVNIQKKNC HMEGKETVCINATVCFEVKLKSKEDTIYEADLQYRVTLDSLRQISRSFFSGTQERKVQRN ITVRKSECTKHSFYMLDKHDFQDSVRITLDFNLTDPENGPVLDDSLPNSVHEYIPFAKDC GNKEKCISDLSLHVATTEKDLLIVRSQNDKFNVSLTVKNTKDSAYNTRTIVHYSPNLVFS GIEAIQKDSCESNHNITCKVGYPFLRRGEMVTFKILFQFNTSYLMENVTIYLSATSDSEE PPETLSDNVVNISIPVKYEVGLQFYSSASEYHISIAANETVPEVINSTEDIGNEINIFYL IRKSGSFPMPELKLSISFPNMTSNGYPVLYPTGLSSSENANCRPHIFEDPFSINSGKKMT TSTDHLKRGTILDCNTCKFATITCNLTSSDISQVNVSLILWKPTFIKSYFSSLNLTIRGE LRSENASLVLSSSNQKRELAIQISKDGLPGRVPLWVILLSAFAGLLLLMLLILALWKIGF FKRPLKKKMEK ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for tem30_gi|2829468|sp|P56199|ITA1_HUMAN sequence: 1120 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 FNVDVKNSMT FSGPVEDMFG YTVQQYENEE GKWVLIGSPL VGQPKNRTGD VYKCPVGRGE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 SLPCVKLDLP VNTSIPNVTE VKENMTFGST LVTNPNGGFL ACGPLYAYRC GHLHYTTGIC ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 SDVSPTFQVV NSIAPVQECS TQLDIVIVLD GSNSIYPWDS VTAFLNDLLK RMDIGPKQTQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 VGIVQYGENV THEFNLNKYS STEEVLVAAK KIVQRGGRQT MTALGTDTAR KEAFTEARGA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 RRGVKKVMVI VTDGESHDNH RLKKVIQDCE DENIQRFSIA ILGSYNRGNL STEKFVEEIK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~144 4444444444 44422~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 SIASEPTEKH FFNVSDELAL VTIVKTLGER IFALEATADQ SAASFEMEMS QTGFSAHYSQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~12 2222222222 2222222222 ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 420 DWVMLGAVGA YDWNGTVVMQ KASQIIIPRN TTFNVESTKK NEPLASYLGY TVNSATASSG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 480 DVLYIAGQPR YNHTGQVIIY RMEDGNIKIL QTLSGEQIGS YFGSILTTTD IDKDSNTDIL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 540 LVGAPMYMGT EKEEQGKVYV YALNQTRFEY QMSLEPIKQT CCSSRQHNSC TTENKNEPCG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 600 ARFGTAIAAV KDLNLDGFND IVIGAPLEDD HGGAVYIYHG SGKTIRKEYA QRIPSGGDGK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 660 TLKFFGQSIH GEMDLNGDGL TDVTIGGLGG AALFWSRDVA VVKVTMNFEP NKVNIQKKNC ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 720 HMEGKETVCI NATVCFEVKL KSKEDTIYEA DLQYRVTLDS LRQISRSFFS GTQERKVQRN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 780 ITVRKSECTK HSFYMLDKHD FQDSVRITLD FNLTDPENGP VLDDSLPNSV HEYIPFAKDC ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 840 GNKEKCISDL SLHVATTEKD LLIVRSQNDK FNVSLTVKNT KDSAYNTRTI VHYSPNLVFS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 900 GIEAIQKDSC ESNHNITCKV GYPFLRRGEM VTFKILFQFN TSYLMENVTI YLSATSDSEE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 960 PPETLSDNVV NISIPVKYEV GLQFYSSASE YHISIAANET VPEVINSTED IGNEINIFYL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1020 IRKSGSFPMP ELKLSISFPN MTSNGYPVLY PTGLSSSENA NCRPHIFEDP FSINSGKKMT ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 1080 TSTDHLKRGT ILDCNTCKFA TITCNLTSSD ISQVNVSLIL WKPTFIKSYF SSLNLTIRGE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~1 * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------c * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~1 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | LRSENASLVL SSSNQKRELA IQISKDGLPG RVPLWVILLS 1111111111 1111111111 ~~~~~~~~~~ ~~~~~~~~~~ defgabcdef gabcdefgab ---------- ---------- 1111111111 1111111111 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem30.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem30.___inter___ (1 sequences) FNVDVKNSMTFSGPVEDMFGYTVQQYENEEGKWVLIGSPLVGQPKNRTGD VYKCPVGRGESLPCVKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFL ACGPLYAYRCGHLHYTTGICSDVSPTFQVVNSIAPVQECSTQLDIVIVLD GSNSIYPWDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYS STEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVI VTDGESHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIK SIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQSAASFEMEMS QTGFSAHYSQDWVMLGAVGAYDWNGTVVMQKASQIIIPRNTTFNVESTKK NEPLASYLGYTVNSATASSGDVLYIAGQPRYNHTGQVIIYRMEDGNIKIL QTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMGTEKEEQGKVYV YALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKNEPCGARFGTAIAAV KDLNLDGFNDIVIGAPLEDDHGGAVYIYHGSGKTIRKEYAQRIPSGGDGK TLKFFGQSIHGEMDLNGDGLTDVTIGGLGGAALFWSRDVAVVKVTMNFEP NKVNIQKKNCHMEGKETVCINATVCFEVKLKSKEDTIYEADLQYRVTLDS LRQISRSFFSGTQERKVQRNITVRKSECTKHSFYMLDKHDFQDSVRITLD FNLTDPENGPVLDDSLPNSVHEYIPFAKDCGNKEKCISDLSLHVATTEKD LLIVRSQNDKFNVSLTVKNTKDSAYNTRTIVHYSPNLVFSGIEAIQKDSC ESNHNITCKVGYPFLRRGEMVTFKILFQFNTSYLMENVTIYLSATSDSEE PPETLSDNVVNISIPVKYEVGLQFYSSASEYHISIAANETVPEVINSTED IGNEINIFYLIRKSGSFPMPELKLSISFPNMTSNGYPVLYPTGLSSSENA NCRPHIFEDPFSINSGKKMTTSTDHLKRGTILDCNTCKFATITCNLTSSD ISQVNVSLILWKPTFIKSYFSSLNLTIRGELRSENASLVLSSSNQKRELA IQISKDGLPGRVPLWVILLSAFAGLLLLMLLILALWKIGFFKRPLKKKME K (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 4 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 85 105 0.878 Putative 2 617 637 0.754 Putative 3 1116 1136 2.490 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 Loop length 84 511 478 15 K+R profile + + + 7.00 CYT-EXT prof 1.60 0.74 0.81 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.27 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -3.00 (NEG-POS)/(NEG+POS): 0.1579 NEG: 11.0000 POS: 8.0000 -> Orientation: N-out CYT-EXT difference: 1.54 -> Orientation: N-out ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 2 3 Loop length 616 478 15 K+R profile + 7.00 + CYT-EXT prof 1.14 - 0.74 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.39 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.1385 NEG: 74.0000 POS: 56.0000 -> Orientation: N-out CYT-EXT difference: 0.40 -> Orientation: N-out ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 3 Loop length 84 1010 15 K+R profile + 7.00 + CYT-EXT prof 1.60 - 0.84 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.70 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -3.00 (NEG-POS)/(NEG+POS): 0.1579 NEG: 11.0000 POS: 8.0000 -> Orientation: N-out CYT-EXT difference: 0.76 -> Orientation: N-out ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 3 Loop length 1115 15 K+R profile + 7.00 CYT-EXT prof 1.00 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 1.00 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -4.00 (NEG-POS)/(NEG+POS): 0.0962 NEG: 131.0000 POS: 108.0000 -> Orientation: N-out CYT-EXT difference: 1.00 -> Orientation: N-out ---------------------------------------------------------------------- "tem30" 1151 85 105 #f 0.878125 617 637 #f 0.754167 1116 1136 #t 2.48958 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem30.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem30.___inter___ (1 sequences) FNVDVKNSMTFSGPVEDMFGYTVQQYENEEGKWVLIGSPLVGQPKNRTGD VYKCPVGRGESLPCVKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFL ACGPLYAYRCGHLHYTTGICSDVSPTFQVVNSIAPVQECSTQLDIVIVLD GSNSIYPWDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYS STEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVI VTDGESHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIK SIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQSAASFEMEMS QTGFSAHYSQDWVMLGAVGAYDWNGTVVMQKASQIIIPRNTTFNVESTKK NEPLASYLGYTVNSATASSGDVLYIAGQPRYNHTGQVIIYRMEDGNIKIL QTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMGTEKEEQGKVYV YALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKNEPCGARFGTAIAAV KDLNLDGFNDIVIGAPLEDDHGGAVYIYHGSGKTIRKEYAQRIPSGGDGK TLKFFGQSIHGEMDLNGDGLTDVTIGGLGGAALFWSRDVAVVKVTMNFEP NKVNIQKKNCHMEGKETVCINATVCFEVKLKSKEDTIYEADLQYRVTLDS LRQISRSFFSGTQERKVQRNITVRKSECTKHSFYMLDKHDFQDSVRITLD FNLTDPENGPVLDDSLPNSVHEYIPFAKDCGNKEKCISDLSLHVATTEKD LLIVRSQNDKFNVSLTVKNTKDSAYNTRTIVHYSPNLVFSGIEAIQKDSC ESNHNITCKVGYPFLRRGEMVTFKILFQFNTSYLMENVTIYLSATSDSEE PPETLSDNVVNISIPVKYEVGLQFYSSASEYHISIAANETVPEVINSTED IGNEINIFYLIRKSGSFPMPELKLSISFPNMTSNGYPVLYPTGLSSSENA NCRPHIFEDPFSINSGKKMTTSTDHLKRGTILDCNTCKFATITCNLTSSD ISQVNVSLILWKPTFIKSYFSSLNLTIRGELRSENASLVLSSSNQKRELA IQISKDGLPGRVPLWVILLSAFAGLLLLMLLILALWKIGFFKRPLKKKME K (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 4 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 85 105 0.878 Putative 2 617 637 0.754 Putative 3 1116 1136 2.490 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 Loop length 84 511 478 15 K+R profile + + + 7.00 CYT-EXT prof 1.60 0.74 0.81 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 0.27 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -3.00 (NEG-POS)/(NEG+POS): 0.1579 NEG: 11.0000 POS: 8.0000 -> Orientation: N-out CYT-EXT difference: 1.54 -> Orientation: N-out ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 2 3 Loop length 616 478 15 K+R profile + 7.00 + CYT-EXT prof 1.14 - 0.74 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.39 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -1.00 (NEG-POS)/(NEG+POS): 0.1385 NEG: 74.0000 POS: 56.0000 -> Orientation: N-out CYT-EXT difference: 0.40 -> Orientation: N-out ---------------------------------------------------------------------- Structure 3 Transmembrane segments included in this structure: Segment 1 3 Loop length 84 1010 15 K+R profile + 7.00 + CYT-EXT prof 1.60 - 0.84 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 7.00 Tm probability: 0.70 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): -3.00 (NEG-POS)/(NEG+POS): 0.1579 NEG: 11.0000 POS: 8.0000 -> Orientation: N-out CYT-EXT difference: 0.76 -> Orientation: N-out ---------------------------------------------------------------------- Structure 4 Transmembrane segments included in this structure: Segment 3 Loop length 1115 15 K+R profile + 7.00 CYT-EXT prof 1.00 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -7.00 Tm probability: 1.00 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): -4.00 (NEG-POS)/(NEG+POS): 0.0962 NEG: 131.0000 POS: 108.0000 -> Orientation: N-out CYT-EXT difference: 1.00 -> Orientation: N-out ---------------------------------------------------------------------- "tem30" 1151 85 105 #f 0.878125 617 637 #f 0.754167 1116 1136 #t 2.48958 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ NOW EXECUTING: /bio_software/1D/stat/saps/saps-stroh/SAPS.SSPA/saps /people/maria/tem30.___saps___ SAPS. Version of April 11, 1996. Date run: Tue Oct 31 15:44:49 2000 File: /people/maria/tem30.___saps___ ID tem30_gi|2829468|sp|P56199|ITA1_HUMAN DE INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) number of residues: 1151; molecular weight: 127.8 kdal 1 FNVDVKNSMT FSGPVEDMFG YTVQQYENEE GKWVLIGSPL VGQPKNRTGD VYKCPVGRGE 61 SLPCVKLDLP VNTSIPNVTE VKENMTFGST LVTNPNGGFL ACGPLYAYRC GHLHYTTGIC 121 SDVSPTFQVV NSIAPVQECS TQLDIVIVLD GSNSIYPWDS VTAFLNDLLK RMDIGPKQTQ 181 VGIVQYGENV THEFNLNKYS STEEVLVAAK KIVQRGGRQT MTALGTDTAR KEAFTEARGA 241 RRGVKKVMVI VTDGESHDNH RLKKVIQDCE DENIQRFSIA ILGSYNRGNL STEKFVEEIK 301 SIASEPTEKH FFNVSDELAL VTIVKTLGER IFALEATADQ SAASFEMEMS QTGFSAHYSQ 361 DWVMLGAVGA YDWNGTVVMQ KASQIIIPRN TTFNVESTKK NEPLASYLGY TVNSATASSG 421 DVLYIAGQPR YNHTGQVIIY RMEDGNIKIL QTLSGEQIGS YFGSILTTTD IDKDSNTDIL 481 LVGAPMYMGT EKEEQGKVYV YALNQTRFEY QMSLEPIKQT CCSSRQHNSC TTENKNEPCG 541 ARFGTAIAAV KDLNLDGFND IVIGAPLEDD HGGAVYIYHG SGKTIRKEYA QRIPSGGDGK 601 TLKFFGQSIH GEMDLNGDGL TDVTIGGLGG AALFWSRDVA VVKVTMNFEP NKVNIQKKNC 661 HMEGKETVCI NATVCFEVKL KSKEDTIYEA DLQYRVTLDS LRQISRSFFS GTQERKVQRN 721 ITVRKSECTK HSFYMLDKHD FQDSVRITLD FNLTDPENGP VLDDSLPNSV HEYIPFAKDC 781 GNKEKCISDL SLHVATTEKD LLIVRSQNDK FNVSLTVKNT KDSAYNTRTI VHYSPNLVFS 841 GIEAIQKDSC ESNHNITCKV GYPFLRRGEM VTFKILFQFN TSYLMENVTI YLSATSDSEE 901 PPETLSDNVV NISIPVKYEV GLQFYSSASE YHISIAANET VPEVINSTED IGNEINIFYL 961 IRKSGSFPMP ELKLSISFPN MTSNGYPVLY PTGLSSSENA NCRPHIFEDP FSINSGKKMT 1021 TSTDHLKRGT ILDCNTCKFA TITCNLTSSD ISQVNVSLIL WKPTFIKSYF SSLNLTIRGE 1081 LRSENASLVL SSSNQKRELA IQISKDGLPG RVPLWVILLS AFAGLLLLML LILALWKIGF 1141 FKRPLKKKME K -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 56( 4.9%); C : 23( 2.0%); D : 59( 5.1%); E : 73( 6.3%); F : 50( 4.3%) G : 80( 7.0%); H : 22( 1.9%); I : 78( 6.8%); K : 73( 6.3%); L : 93( 8.1%) M : 25( 2.2%); N : 71( 6.2%); P : 44( 3.8%); Q : 42( 3.6%); R : 42( 3.6%) S :100( 8.7%); T : 86( 7.5%); V : 85( 7.4%); W : 8( 0.7%); Y : 41( 3.6%) KR : 115 ( 10.0%); ED : 132 ( 11.5%); AGP : 180 ( 15.6%); KRED : 247 ( 21.5%); KR-ED : -17 ( -1.5%); FIKMNY : 338 ( 29.4%); LVIFM : 331 ( 28.8%); ST : 186 ( 16.2%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 000-0+0000 00000--000 000000-0-- 0+00000000 0000+0+00- 00+0000+0- 61 00000+0-00 000000000- 0+-0000000 0000000000 00000000+0 0000000000 121 0-00000000 0000000-00 000-00000- 00000000-0 000000-00+ +0-000+000 181 0000000-00 00-0000+00 00--00000+ +000+00+00 000000-00+ +-000-0+00 241 ++00++0000 00-0-00-00 +0++000-0- --000+0000 000000+000 00-+00--0+ 301 0000-00-+0 00000--000 0000+000-+ 0000-000-0 00000-0-00 0000000000 361 -000000000 0-00000000 +0000000+0 00000-00++ 0-00000000 0000000000 421 -00000000+ 0000000000 +0--000+00 00000-0000 000000000- 0-+-000-00 481 0000000000 -+--00+000 000000+0-0 0000-00+00 0000+00000 00-0+0-000 541 0+00000000 +-000-000- 0000000--- 0000000000 00+00++-00 0+00000-0+ 601 00+0000000 0-0-000-00 0-00000000 000000+-00 00+00000-0 0+0000++00 661 00-0+-0000 000000-0+0 +0+--000-0 -000+000-0 0+000+0000 000-++00+0 721 000++0-00+ 000000-+0- 00-00+000- 0000-0-000 00--000000 0-00000+-0 781 00+-+000-0 0000000-+- 0000+000-+ 0000000+00 +-00000+00 0000000000 841 00-000+-00 -0000000+0 00000++0-0 000+000000 00000-0000 000000-0-- 901 00-000-000 000000+0-0 000000000- 00000000-0 00-00000-- 000-000000 961 0++0000000 -0+0000000 0000000000 0000000-00 00+0000--0 000000++00 1021 000-00++00 00-0000+00 000000000- 0000000000 0+0000+000 0000000+0- 1081 0+0-000000 00000++-00 0000+-0000 +000000000 0000000000 000000+000 1141 0++00+++0- + A. CHARGE CLUSTERS. Positive charge clusters (cmin = 10/30 or 13/45 or 16/60): none Negative charge clusters (cmin = 11/30 or 14/45 or 17/60): none Mixed charge clusters (cmin = 16/30 or 22/45 or 27/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 5 | 7 | 42 | 10 | 10 | 14 | 12 | 13 | 16 | 7 | 9 | lmin1 6 | 7 | 9 | 50 | 12 | 13 | 16 | 15 | 15 | 20 | 8 | 11 | lmin2 8 | 8 | 11 | 56 | 14 | 14 | 18 | 17 | 17 | 22 | 10 | 12 | (Significance level: 0.010000; Minimal displayed length: 6) There are no charge runs or patterns exceeding the given minimal lengths. Run count statistics: + runs >= 3: 1, at 1146; - runs >= 4: 0 * runs >= 5: 0 0 runs >= 28: 0 -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. __________________________________ High scoring hydrophobic segments: 2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ) -8.00 (KEDR) Expected score/letter: -2.115 M_0.01= 25.97; M_0.05= 21.93 1) From 1114 to 1136: length= 23, score=34.00 ** 1114 LWVILLSAFA GLLLLMLLIL ALW L: 11(47.8%); A: 3(13.0%); ____________________________________ High scoring transmembrane segments: 5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST) -6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED) Expected score/letter: -3.689 M_0.01= 64.72; M_0.05= 54.27; M_0.30= 41.83 1) From 1114 to 1135: length= 22, score=82.00 ** 1114 LWVILLSAFA GLLLLMLLIL AL L: 11(50.0%); A: 3(13.6%); 2. SPACINGS OF C. H2N-53-C-9-C-37-C-7-C-9-C-18-C-129-C-251-C-C-7-C-8-C-120-C-8-C-5-C-52-C-51-C-5-C-63-C-7-C-143-C-31-C-2-C-6-C-107-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-53-C-9-C-37-C-7-C-1-H-1-H-5-C-18-C-52-H-64-H-2-H-8-C-40-H-46-H-75-H-87-C-C-4-H-2-C-8-C-31-H-7-H-30-H-49-C-H-7-C-5-C-52-C-2-H-7-H-31-H-8-C-5-C-6-H-38-H-17-C-3-H-3-C-73-H-69-C-2-H-19-H-8-C-2-C-6-C-107-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 5 B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 9 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 60 (Expected range: 41-- 88) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 17 (6-10) 9 (11-20) 14 (>=21) 21 3. Clusters of amino acid multiplets (cmin = 10/30 or 13/45 or 15/60): none B. CHARGE ALPHABET. 1. Total number of charge multiplets: 32 (Expected range: 9-- 39) 18 +plets (f+: 10.0%), 14 -plets (f-: 11.5%) Total number of charge altplets: 23 (Critical number: 43) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 3 (6-10) 6 (11-20) 7 (>=21) 17 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 1081-1143 9 L........ 6 4 1 1125-1131 1 L 6 4 1 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 7) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10) Location Period Element Copies Core Errors 1081-1143 9 i........ 7 7 0 1116-1133 3 i.. 6 6 0 1125-1133 1 i 9 9 0 1126-1143 3 i.. 6 6 0 ----------------------------- --------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 0- 2 (1.) N( 2)N 72 of 72 0.0001 large minimal spacing 94- 96 (1.) N( 2)N 67 of 72 0.0001 matching minimum 195- 197 (1.) N( 2)N 68 of 72 0.0001 matching minimum 534- 536 (2.) N( 2)N 69 of 72 0.0001 matching minimum 853- 855 (3.) N( 2)N 70 of 72 0.0001 matching minimum 999-1001 (4.) N( 2)N 71 of 72 0.0001 matching minimum ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem30 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- FG-GAP FG-GAP repeat 236.7 3.2e-67 5 vwa von Willebrand factor type A domain 229.7 4.3e-65 1 integrin_A Integrin alpha cytoplasmic region 18.0 0.091 1 Kelch Kelch motif 12.3 5.9 1 FKBP FKBP-type peptidyl-prolyl cis-trans i 5.7 1.5 1 SQS_PSY Squalene and phytoene synthases 2.4 12 1 jmjN jmjN domain 1.6 74 1 E1-E2_ATPase E1-E2 ATPase 0.1 52 1 Leptin Leptin -0.6 72 1 isodh Isocitrate and isopropylmalate dehydr -1.4 83 1 B1 Protein L b1 domain -7.5 48 1 Transglutamin_N Transglutaminase family -62.1 31 1 CDP-OH_P_transf CDP-alcohol phosphatidyltransferase -63.6 97 1 Aldose_epim Aldose 1-epimerase -209.5 79 1 Orbi_VP5 Orbivirus outer capsid protein VP5 -276.5 51 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- FG-GAP 1/5 16 75 .. 1 63 [] 33.9 3.8e-06 E1-E2_ATPase 1/1 59 94 .. 87 119 .. 0.1 52 Leptin 1/1 165 178 .. 133 146 .] -0.6 72 B1 1/1 196 265 .. 1 71 [] -7.5 48 isodh 1/1 282 299 .. 379 398 .] -1.4 83 vwa 1/1 144 327 .. 1 200 [] 229.7 4.3e-65 Kelch 1/1 410 454 .. 1 35 [] 12.3 5.9 FG-GAP 2/5 405 457 .. 1 63 [] 37.5 3.1e-07 FG-GAP 3/5 459 520 .. 1 63 [] 60.2 4.5e-14 FG-GAP 4/5 540 599 .. 1 63 [] 68.4 1.5e-16 FG-GAP 5/5 602 654 .. 1 63 [] 38.2 1.9e-07 FKBP 1/1 671 681 .. 101 111 .] 5.7 1.5 SQS_PSY 1/1 701 714 .. 1 14 [. 2.4 12 Aldose_epim 1/1 633 880 .. 1 367 [] -209.5 79 Transglutamin_N 1/1 834 927 .. 1 131 [] -62.1 31 jmjN 1/1 985 992 .. 1 8 [. 1.6 74 Orbi_VP5 1/1 554 1032 .. 1 510 [] -276.5 51 CDP-OH_P_transf 1/1 1022 1138 .. 1 175 [] -63.6 97 integrin_A 1/1 1137 1151 .] 1 15 [] 18.0 0.091 Alignments of top-scoring domains: FG-GAP: domain 1 of 5, from 16 to 75: score 33.9, E = 3.8e-06 *->gsyFGySvalgDlngDgypDllVGAPragdqtqkgknetGaVyiyrg +FGy+v ++ ++l+G+P+ g q+ kn+tG Vy +++ tem30_gi|2 16 EDMFGYTVQQYENEEGK--WVLIGSPLVG-QP---KNRTGDVYKCPV 56 rpg..rvcpsqvldg.gia<-* +g++ +c+++ l ++ + tem30_gi|2 57 GRGesLPCVKLDLPVnTSI 75 E1-E2_ATPase: domain 1 of 1, from 59 to 94: score 0.1, E = 52 *->GESlPVeKhsdvvtp..dnp.lergnmVfaGTlvvs<-* GESlP K + +v + n++ + nm f++Tlv++ tem30_gi|2 59 GESLPCVKLDLPVNTsiPNVtEVKENMTFGSTLVTN 94 Leptin: domain 1 of 1, from 165 to 178: score -0.6, E = 72 *->LqDmLrqLDlsPgC<-* L D+L+++D+ P tem30_gi|2 165 LNDLLKRMDIGPKQ 178 B1: domain 1 of 1, from 196 to 265: score -7.5, E = 48 *->tPEedskEevtiKanlifadGstqtAeFKGtFaeAtaEAYrYADlLk s Eev + a i + G qt + Gt + A EA+ A + tem30_gi|2 196 LNKYSSTEEVLVAAKKIVQRGGRQTMTALGT-DTARKEAFTEARGAR 241 KenGkYtaDleDgGYtinirFAGK<-* + k + + Dg n r tem30_gi|2 242 RGVKKVMVIVTDGESHDNHRLKKV 265 isodh: domain 1 of 1, from 282 to 299: score -1.4, E = 83 *->LggeadGakyvsTsemgdav<-* Lg++++ +sT+++++++ tem30_gi|2 282 LGSYNR--GNLSTEKFVEEI 299 vwa: domain 1 of 1, from 144 to 327: score 229.7, E = 4.3e-65 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg Div++lDGS+Si++ ++ v +F++++ r+d+gp ++ ++Vg tem30_gi|2 144 DIVIVLDGSNSIYP--WDSVTAFLNDLLKRMDIGP-KQ-----TQVG 182 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq +vQY++nv++E f+ln y++ +evl a++ki+ ++gg +t+T+ + + tem30_gi|2 183 IVQYGENVTHE--FNLNKYSSTEEVLVAAKKIV---QRGGRQTMTALGTD 227 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln ++++++ftea+G+R +g++kv+V++TDG+s+d+ + +++v++ tem30_gi|2 228 TARKEAFTEARGAR------RGVKKVMVIVTDGESHDNHR----LKKVIQ 267 elkkeagvevfaiGvGnadn.......nnleeLreIAskpd.dhvfkvsd + + +++++f+i++ + +n+++ ++++++ee+++IAs+p+++h+f+vsd tem30_gi|2 268 DCED-ENIQRFSIAILGSYNrgnlsteKFVEEIKSIASEPTeKHFFNVSD 316 feaLdtlqelL<-* aL t+++ L tem30_gi|2 317 ELALVTIVKTL 327 Kelch: domain 1 of 1, from 410 to 454: score 12.3, E = 5.9 *->pRsgagvvvlggkiYviG.........vevYDpe.ntWeklpsmp<- ++ ++++ g+ +Y++G ++ ++++ v +Y e++ ++l +++ tem30_gi|2 410 YTVNSATASSGDVLYIAGqprynhtgqVIIYRMEdGNIKILQTLS 454 * tem30_gi|2 - - FG-GAP: domain 2 of 5, from 405 to 457: score 37.5, E = 3.1e-07 *->gsyFGySva.lgDlngDgypDllVGAPragdqtqkgknetGaVyiyr sy+Gy+v +++ gD ++++G Pr++ + tG+V iyr tem30_gi|2 405 ASYLGYTVNsATASSGDV--LYIAGQPRYN--H------TGQVIIYR 441 grpgrvcpsqvldg.gia<-* + + ++ ++ ++ +g++ tem30_gi|2 442 MED--GNIKILQTLsGEQ 457 FG-GAP: domain 3 of 5, from 459 to 520: score 60.2, E = 4.5e-14 *->gsyFGySvalgDlngDgy.pDllVGAPragdqtqkgknetGaVyiyr gsyFG+ ++ +D+++D+ ++ llVGAP+++ +t k+e+G+Vy+y tem30_gi|2 459 GSYFGSILTTTDIDKDSNtDILLVGAPMYM-GT--EKEEQGKVYVYA 502 grpgrvcpsqvldg.gia<-* ++ r++++++l++ +++ tem30_gi|2 503 LNQTRFEYQMSLEPiKQT 520 FG-GAP: domain 4 of 5, from 540 to 599: score 68.4, E = 1.5e-16 *->gsyFGySvalg.DlngDgypDllVGAPragdqtqkgknetGaVyiyr g++FG ++a+++Dln Dg++D+++GAP+++ +++GaVyiy+ tem30_gi|2 540 GARFGTAIAAVkDLNLDGFNDIVIGAPLED-------DHGGAVYIYH 579 grpg..rvcpsqvldg.gia<-* g++ + r +++q++ ++g + tem30_gi|2 580 GSGKtiRKEYAQRIPSgGDG 599 FG-GAP: domain 5 of 5, from 602 to 654: score 38.2, E = 1.9e-07 *->gsyFGySva.lgDlngDgypDllVGAPragdqtqkgknetGaVyiyr +FG+S+++ +DlngDg++D+ +G G + +++ tem30_gi|2 602 LKFFGQSIHgEMDLNGDGLTDVTIGG-------------LGGAALFW 635 grpg.rvcpsqvldg.gia<-* +r++ +v++++++++++++ tem30_gi|2 636 SRDVaVVKVTMNFEPnKVN 654 FKBP: domain 1 of 1, from 671 to 681: score 5.7, E = 1.5 *->naTLvFeVELl<-* naT++FeV L+ tem30_gi|2 671 NATVCFEVKLK 681 SQS_PSY: domain 1 of 1, from 701 to 714: score 2.4, E = 12 *->laqgSkSFalairL<-* l+q S+SF+ ++++ tem30_gi|2 701 LRQISRSFFSGTQE 714 Aldose_epim: domain 1 of 1, from 633 to 880: score -209.5, E = 79 *->ffnakdgqdyrliTlgnnngGKfqVsfanlGATlvdlkVngesGlkn f + d + + ++T+ + + kVn ++ tem30_gi|2 633 LFWSRDVA-VVKVTMNFEPN-----------------KVNIQK---- 657 iVLGfdnlkgYlkddnKEGaffGAtIGRYANRIakgkFsln......... k++ ++ KE ++ At+ F+ ++++++ tem30_gi|2 658 --------KNCHMEG-KETVCINATV----------CFEVKlkskedtiy 688 ..gktyqLtvNnlggNsLHsGieGfdkrrwlgpivktNpkddvvtaeFkL + + y +t+ + s +G + r+ + i + k + ++ +F tem30_gi|2 689 eaDLQYRVTLDS-LRQISRSFFSGTQERKVQRNITV--RKSECTKHSFYM 735 idddgdqGPEFPGdLeVtVqYtLndankGGtLkIeYeAklveEsggesTp +d + q + t+ L+d + + +++ +s p tem30_gi|2 736 LDKHDFQD-----SVRITLDFNLTDPEN---------GPVLD----DSLP 767 vNlTNHsYFNLnkaenksdirgtevqllsdksLeV..deneliPTGkivs + H Y + a++ g + + +sd sL V ++e++l+ iv tem30_gi|2 768 NSV--HEY--IPFAKD----CGNKEKCISDLSLHVatTEKDLL----IVR 805 vTdittFDfrkPtvLgpdipafDyqfvvkenyDqafldtrssndklkpva + +++ F+ + L+ ++ + y+ +++ s n tem30_gi|2 806 S-QNDKFNVS----LTVKNTKDS-------AYNTRTIVHYSPN------- 836 klthpdskltleVlTTePtlqfYTGDyLdgnivraGelYedrsGfAlEag +++G +++ +E+ tem30_gi|2 837 --------------------LVFSGIEAIQKDS-------------CESN 853 rypDAiNhpeWkdycviLkkGetYgskteYkFe<-* + + +++ +L++Ge ++ k+ ++F tem30_gi|2 854 HNIT--CKVGYP----FLRRGEMVTFKILFQFN 880 Transglutamin_N: domain 1 of 1, from 834 to 927: score -62.1, E = 31 *->seelvvesvdlekdeNrreHHTlMYDdeyssqrLiVRRGQ..pFsit s +lv+ ++ ++ ++ H + +++RRG +F i+ tem30_gi|2 834 SPNLVFSGIEAIQKDSCESNHN----ITCKVGYPFLRRGEmvTFKIL 876 LvfsgPRpyePskDkLkLefttGpqkPsetkgTkvvfpvsdsdldsggWs +f+ y+ + +++ ++ ++ se+ p+ + tem30_gi|2 877 FQFN--TSYL--MENVTIYLSATSD--SEE-------PPETL-------- 905 AvlqkqdgnVeltLsitSPAnAiiGkYrLtvevstp<-* ++n +++i++P+ kY ++ + tem30_gi|2 906 ------SDN---VVNISIPV-----KYEVGLQFYSS 927 jmjN: domain 1 of 1, from 985 to 992: score 1.6, E = 74 *->eiPvfyPt<-* ++Pv+yPt tem30_gi|2 985 GYPVLYPT 992 Orbi_VP5: domain 1 of 1, from 554 to 1032: score -276.5, E = 51 *->sLsrfGkkvgnA.LTSn..tAKkiYktiGktaerfaESeiGSAAIDG L f v +A+L +++ A iY Gkt+ + i S DG tem30_gi|2 554 NLDGFNDIVIGApLEDDhgGAVYIYHGSGKTIRKEYAQRIPSGG-DG 599 lvQGsvqSiitGEsyGesvKqAVlLNVlGtgeeiPDPLspGEqgiqakvk t +G+s+ LN G + L + v tem30_gi|2 600 K---------TLKFFGQSIHGEMDLNGDGLTDVTIGGLGGAALFWSRDVA 640 ElEdeekeelVrekhNkkIiekFGkdLeevykfmkgeakaevlekkqfe. + + e + kk Gk+ + + e+k e+ +e tem30_gi|2 641 VVKVTMNFEPNKVNIQKKNCHMEGKETVCINATVCFEVKLKSKEDTIYEa 690 ..ileKAltsylkilkdeserltrLarALQkEedeRtkdEtrMvkEYReK + + + l s+ +i + ++ + +r Q+ + R ++ t+ +K tem30_gi|2 691 dlQYRVTLDSLRQISRSFFSGTQ--ERKVQRNITVRKSECTKHSFYMLDK 738 idALKnAIEiErdgiqeEAiQEiaglsadVlEaAsEEVPlfGAGvAtavA d Q + ++ d + E P++ ++v tem30_gi|2 739 HD-----------------FQDSVRITLDFNLTDPENGPVLDDSLPNSVH 771 ..TtRAIeGgyKLKkvinaLsGIDLSHLrvpkIePkiiekileadtkeeI + A + g K K + DLS L v+ e + + d+ tem30_gi|2 772 eyIPFAKDCGNKEKCI------SDLS-LHVATTEKDLLIVRSQNDKF--- 811 pDksLavaiksKveaidennnEvlHikdeiLpkvkkameeehKEIegieD + +k+ + + ++ + H + +a +++ + tem30_gi|2 812 ---NVSLTVKNTKDSAYNTRT-IVHYSPNLVFSGIEAIQKD----SCESN 853 kkihpkv.........pmrlKi.....pseqqPkihIY.saPWDSDdVFm i kv+ + +++++ +Ki + + s + IY sa DS + tem30_gi|2 854 HNITCKVgypflrrgeMVTFKIlfqfnTSYLMENVTIYlSATSDSEE--P 901 fhcIshHHqneSfflGFDLeieyVhyEDltvhwHalhGaAqeveGRtlre s n S + + ++ + ++ H+ + + e tem30_gi|2 902 PETLSDNVVNISIPVKYEVGLQ---FYSSASEYHIS----IAAN-----E 939 AYsEFlnlAismpytpelHkrrliRSkgvHPIYlGsmhYdisYeqLksNa E +n + + e+ liR g+ P + + is+ ++ sN tem30_gi|2 940 TVPEVIN--STEDIGNEINIFYLIRKSGSFP--MPELKLSISFPNMTSNG 985 lriVyDseL........QmHiLRGPlhfqrRailnALkfGvK...vl<-* + y + L+++++ + Hi P + + + + K++++l tem30_gi|2 986 YPVLYPTGLsssenancRPHIFEDPFSINSGKKMTTSTDHLKrgtIL 1032 CDP-OH_P_transf: domain 1 of 1, from 1022 to 1138: score -63.6, E = 97 *->allfllasitDwlDGyiARklnqvtrFGafLDPvaDkllvvialiiL tD l + G++LD + + ++++++ L tem30_gi|2 1022 --------STDHL------------KRGTILDCNT--CKFATITCNL 1046 aylglvpwwltipaalil....gvaSHYmhrelllsglrevlaylglkep + + ++ + ++ lil ++++ + + s+l + ++ + ++e+ tem30_gi|2 1047 TSSD-ISQVNVS---LILwkptF-------IKSYFSSLNLTIRGELRSEN 1085 aiavlyygkrktiftCAsaqivaiallLlgfslqsvlvgseiysklagvi a +vl ++k+ +ai++++ g++ +++++++ tem30_gi|2 1086 ASLVLSSSNQKRE--------LAIQISKDGLP-----------GRVPLWV 1116 fwfilaaslcgyiaavltlitglqyvvaalay<-* + a+ +l + ++ a+++ tem30_gi|2 1117 IL----------LSAFAGLLLLMLLILALWKI 1138 integrin_A: domain 1 of 1, from 1137 to 1151: score 18.0, E = 0.091 *->KcGFFKRkrkpqeee<-* K GFFKR k+++e tem30_gi|2 1137 KIGFFKRPLKKKMEK 1151 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem30 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- vwa von Willebrand factor type A domain 227.8 7.4e-66 1 FG-GAP FG-GAP repeat 227.5 2e-64 5 integrin_A Integrin alpha cytoplasmic region 16.1 0.0012 1 Kelch Kelch motif 10.3 0.25 1 FKBP FKBP-type peptidyl-prolyl cis-trans i 5.7 1.5 1 Aldose_epim Aldose 1-epimerase 5.1 2.2 1 Smr Smr domain 3.2 21 1 SQS_PSY Squalene and phytoene synthases 2.4 12 1 PsbJ PsbJ 1.6 45 1 jmjN jmjN domain 1.6 74 1 Opiods_neuropep Vertebrate endogenous opioids neurope 1.5 18 1 SRCR Scavenger receptor cysteine-rich doma 1.3 53 1 DUF5 HAMP domain 1.3 87 1 Chromate_transp Chromate transporter 0.7 53 1 UPF0054 Uncharacterized protein family UPF005 0.6 81 1 E1-E2_ATPase E1-E2 ATPase 0.1 52 1 P5CR Delta 1-pyrroline-5-carboxylate reduc 0.1 57 1 Leptin Leptin -0.6 72 1 isodh Isocitrate and isopropylmalate dehydr -1.4 83 1 Orbi_VP5 Orbivirus outer capsid protein VP5 -2.0 84 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- FG-GAP 1/5 16 75 .. 1 63 [] 32.0 9.1e-08 E1-E2_ATPase 1/1 59 94 .. 87 119 .. 0.1 52 Leptin 1/1 165 178 .. 133 146 .] -0.6 72 Orbi_VP5 1/1 212 231 .. 1 20 [. -2.0 84 Smr 1/1 261 279 .. 14 32 .. 3.2 21 isodh 1/1 282 299 .. 379 398 .] -1.4 83 vwa 1/1 144 327 .. 1 200 [] 227.8 7.4e-66 Kelch 1/1 410 454 .. 1 35 [] 10.3 0.25 FG-GAP 2/5 405 457 .. 1 63 [] 35.7 7.7e-09 FG-GAP 3/5 459 520 .. 1 63 [] 58.2 2.2e-15 P5CR 1/1 544 561 .. 256 273 .] 0.1 57 FG-GAP 4/5 540 599 .. 1 63 [] 66.5 9.2e-18 FG-GAP 5/5 602 654 .. 1 63 [] 36.4 4.8e-09 FKBP 1/1 671 681 .. 101 111 .] 5.7 1.5 UPF0054 1/1 682 695 .. 1 14 [. 0.6 81 SQS_PSY 1/1 701 714 .. 1 14 [. 2.4 12 Aldose_epim 1/1 864 880 .. 351 367 .] 5.1 2.2 Opiods_neuropep 1/1 898 912 .. 253 267 .] 1.5 18 jmjN 1/1 985 992 .. 1 8 [. 1.6 74 SRCR 1/1 1034 1045 .. 101 113 .] 1.3 53 PsbJ 1/1 1103 1131 .. 1 24 [. 1.6 45 Chromate_transp 1/1 1110 1134 .. 153 177 .] 0.7 53 DUF5 1/1 1119 1147 .. 1 29 [. 1.3 87 integrin_A 1/1 1137 1151 .] 1 15 [] 16.1 0.0012 Alignments of top-scoring domains: FG-GAP: domain 1 of 5, from 16 to 75: score 32.0, E = 9.1e-08 *->gsyFGySvalgDlngDgypDllVGAPragdqtqkgknetGaVyiyrg +FGy+v ++ ++l+G+P+ g q+ kn+tG Vy +++ tem30_gi|2 16 EDMFGYTVQQYENEEGK--WVLIGSPLVG-QP---KNRTGDVYKCPV 56 rpg..rvcpsqvldg.gia<-* +g++ +c+++ l ++ + tem30_gi|2 57 GRGesLPCVKLDLPVnTSI 75 E1-E2_ATPase: domain 1 of 1, from 59 to 94: score 0.1, E = 52 *->GESlPVeKhsdvvtp..dnp.lergnmVfaGTlvvs<-* GESlP K + +v + n++ + nm f++Tlv++ tem30_gi|2 59 GESLPCVKLDLPVNTsiPNVtEVKENMTFGSTLVTN 94 Leptin: domain 1 of 1, from 165 to 178: score -0.6, E = 72 *->LqDmLrqLDlsPgC<-* L D+L+++D+ P tem30_gi|2 165 LNDLLKRMDIGPKQ 178 Orbi_VP5: domain 1 of 1, from 212 to 231: score -2.0, E = 84 *->sLsrfGkkvgnALTSntAKk<-* +r G+ + AL +tA+k tem30_gi|2 212 IVQRGGRQTMTALGTDTARK 231 Smr: domain 1 of 1, from 261 to 279: score 3.2, E = 21 *->eLekfiaqAllaglpkveI<-* +L k i++++++++++++I tem30_gi|2 261 RLKKVIQDCEDENIQRFSI 279 isodh: domain 1 of 1, from 282 to 299: score -1.4, E = 83 *->LggeadGakyvsTsemgdav<-* Lg++++ +sT+++++++ tem30_gi|2 282 LGSYNR--GNLSTEKFVEEI 299 vwa: domain 1 of 1, from 144 to 327: score 227.8, E = 7.4e-66 *->DivFLlDGSgSigsqnFervKdFvervverLdvgprdkkeedavrVg Div++lDGS+Si++ ++ v +F++++ r+d+gp ++ ++Vg tem30_gi|2 144 DIVIVLDGSNSIYP--WDSVTAFLNDLLKRMDIGP-KQ-----TQVG 182 lvQYSdnvrtEikfklndyqnkdevlqalqkiryedyyggggtnTgaALq +vQY++nv++E f+ln y++ +evl a++ki+ ++gg +t+T+ + + tem30_gi|2 183 IVQYGENVTHE--FNLNKYSSTEEVLVAAKKIV---QRGGRQTMTALGTD 227 yvvrnlfteasGsRiepvaeegapkvlVvlTDGrsqddpspTidirdvln ++++++ftea+G+R +g++kv+V++TDG+s+d+ + +++v++ tem30_gi|2 228 TARKEAFTEARGAR------RGVKKVMVIVTDGESHDNHR----LKKVIQ 267 elkkeagvevfaiGvGnadn.......nnleeLreIAskpd.dhvfkvsd + + +++++f+i++ + +n+++ ++++++ee+++IAs+p+++h+f+vsd tem30_gi|2 268 DCED-ENIQRFSIAILGSYNrgnlsteKFVEEIKSIASEPTeKHFFNVSD 316 feaLdtlqelL<-* aL t+++ L tem30_gi|2 317 ELALVTIVKTL 327 Kelch: domain 1 of 1, from 410 to 454: score 10.3, E = 0.25 *->pRsgagvvvlggkiYviG.........vevYDpe.ntWeklpsmp<- ++ ++++ g+ +Y++G ++ ++++ v +Y e++ ++l +++ tem30_gi|2 410 YTVNSATASSGDVLYIAGqprynhtgqVIIYRMEdGNIKILQTLS 454 * tem30_gi|2 - - FG-GAP: domain 2 of 5, from 405 to 457: score 35.7, E = 7.7e-09 *->gsyFGySva.lgDlngDgypDllVGAPragdqtqkgknetGaVyiyr sy+Gy+v +++ gD ++++G Pr++ + tG+V iyr tem30_gi|2 405 ASYLGYTVNsATASSGDV--LYIAGQPRYN--H------TGQVIIYR 441 grpgrvcpsqvldg.gia<-* + + ++ ++ ++ +g++ tem30_gi|2 442 MED--GNIKILQTLsGEQ 457 FG-GAP: domain 3 of 5, from 459 to 520: score 58.2, E = 2.2e-15 *->gsyFGySvalgDlngDgy.pDllVGAPragd.qtqkgknetGaVyiy gsyFG+ ++ +D+++D+ ++ llVGAP+++++ +e+G+Vy+y tem30_gi|2 459 GSYFGSILTTTDIDKDSNtDILLVGAPMYMGtEK----EEQGKVYVY 501 rgrpgrvcpsqvldg.gia<-* ++ r++++++l++ +++ tem30_gi|2 502 ALNQTRFEYQMSLEPiKQT 520 P5CR: domain 1 of 1, from 544 to 561: score 0.1, E = 57 *->GTTiaGLrvLEkgGfRga<-* GT ia+++ L Gf ++ tem30_gi|2 544 GTAIAAVKDLNLDGFNDI 561 FG-GAP: domain 4 of 5, from 540 to 599: score 66.5, E = 9.2e-18 *->gsyFGySvalg.DlngDgypDllVGAPragdqtqkgknetGaVyiyr g++FG ++a+++Dln Dg++D+++GAP+++ +++GaVyiy+ tem30_gi|2 540 GARFGTAIAAVkDLNLDGFNDIVIGAPLED-------DHGGAVYIYH 579 grpg..rvcpsqvldg.gia<-* g++ + r +++q++ ++g + tem30_gi|2 580 GSGKtiRKEYAQRIPSgGDG 599 FG-GAP: domain 5 of 5, from 602 to 654: score 36.4, E = 4.8e-09 *->gsyFGySva.lgDlngDgypDllVGAPragdqtqkgknetGaVyiyr +FG+S+++ +DlngDg++D+ +G G + +++ tem30_gi|2 602 LKFFGQSIHgEMDLNGDGLTDVTIGG-------------LGGAALFW 635 grpg.rvcpsqvldg.gia<-* +r++ +v++++++++++++ tem30_gi|2 636 SRDVaVVKVTMNFEPnKVN 654 FKBP: domain 1 of 1, from 671 to 681: score 5.7, E = 1.5 *->naTLvFeVELl<-* naT++FeV L+ tem30_gi|2 671 NATVCFEVKLK 681 UPF0054: domain 1 of 1, from 682 to 695: score 0.6, E = 81 *->VsddeIhelNkkfR<-* +d+I e+++++R tem30_gi|2 682 SKEDTIYEADLQYR 695 SQS_PSY: domain 1 of 1, from 701 to 714: score 2.4, E = 12 *->laqgSkSFalairL<-* l+q S+SF+ ++++ tem30_gi|2 701 LRQISRSFFSGTQE 714 Aldose_epim: domain 1 of 1, from 864 to 880: score 5.1, E = 2.2 *->iLkkGetYgskteYkFe<-* +L++Ge ++ k+ ++F tem30_gi|2 864 FLRRGEMVTFKILFQFN 880 Opiods_neuropep: domain 1 of 1, from 898 to 912: score 1.5, E = 18 *->SqEdpesYskelpdv<-* S+E pe++s ++ ++ tem30_gi|2 898 SEEPPETLSDNVVNI 912 jmjN: domain 1 of 1, from 985 to 992: score 1.6, E = 74 *->eiPvfyPt<-* ++Pv+yPt tem30_gi|2 985 GYPVLYPT 992 SRCR: domain 1 of 1, from 1034 to 1045: score 1.3, E = 53 *->cshdgedagVvCs<-* c++ +++a ++C+ tem30_gi|2 1034 CNT-CKFATITCN 1045 PsbJ: domain 1 of 1, from 1103 to 1131: score 1.6, E = 45 *->mantg...RiPLWlvgt..vaGlaaigvl<-* +++ g ++R+PLW++ ++ +aGl+ + +l tem30_gi|2 1103 ISKDGlpgRVPLWVILLsaFAGLLLLMLL 1131 Chromate_transp: domain 1 of 1, from 1110 to 1134: score 0.7, E = 53 *->ktkffivpllliilalllgvlvtpk<-* ++++ +v+ll+++++lll+ l +++ tem30_gi|2 1110 GRVPLWVILLSAFAGLLLLMLLILA 1134 DUF5: domain 1 of 1, from 1119 to 1147: score 1.3, E = 87 *->llivllvallllallalllarrilrPlrr<-* l + ++++ll+l++lal+ ++ rPl++ tem30_gi|2 1119 LSAFAGLLLLMLLILALWKIGFFKRPLKK 1147 integrin_A: domain 1 of 1, from 1137 to 1151: score 16.1, E = 0.0012 *->KcGFFKRkrkpqeee<-* K GFFKR k+++e tem30_gi|2 1137 KIGFFKRPLKKKMEK 1151 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem30 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Tue Oct 31 15:47:19 2000 Sequence file: tem30 ---------------------------------------- Sequence tem30_gi|2829468|sp|P56199|ITA1_HUMAN (1151 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 46: NRTG 72: NTSI 77: NVTE 84: NMTF 189: NVTH 289: NLST 313: NVSD 374: NGTV 390: NTTF 432: NHTG 504: NQTR 671: NATV 720: NITV 752: NLTD 812: NVSL 855: NITC 880: NTSY 887: NVTI 911: NISI 938: NETV 946: NSTE 980: NMTS 1045: NLTS 1055: NVSL 1074: NLTI 1085: NASL Total matches: 26 Matching pattern PS00004 CAMP_PHOSPHO_SITE: 1017: KKMT 1027: KRGT Total matches: 2 Matching pattern PS00005 PKC_PHOSPHO_SITE: 228: TAR 292: TEK 307: TEK 397: STK 398: TKK 490: TEK 523: SSR 581: SGK 584: TIR 601: TLK 700: SLR 722: TVR 744: SVR 797: TEK 816: TVK 857: TCK 872: TFK 1015: SGK 1036: TCK 1076: TIR Total matches: 20 Matching pattern PS00006 CK2_PHOSPHO_SITE: 141: TQLD 200: SSTE 201: STEE 252: TDGE 326: TLGE 418: SSGD 467: TTTD 469: TDID 475: SNTD 490: TEKE 506: TRFE 595: SGGD 682: SKED 686: TIYE 754: TDPE 769: SVHE 797: TEKD 806: SQND 840: SGIE 896: SDSE 904: TLSD 927: SASE 940: TVPE 947: STED 995: SSSE 1021: TSTD 1030: TILD 1047: TSSD Total matches: 28 Matching pattern PS00008 MYRISTYL: 42: GQPKNR 97: GGFLAC 216: GGRQTM 225: GTDTAR 239: GARRGV 288: GNLSTE 366: GAVGAY 459: GSYFGS 463: GSILTT 540: GARFGT 544: GTAIAA 580: GSGKTI 626: GGLGGA 627: GLGGAA 952: GNEINI 993: GLSSSE Total matches: 16 Matching pattern PS00009 AMIDATION: 1015: SGKK Total matches: 1 Matching pattern PS00146 BETA_LACTAMASE_A: 393: FNVESTKKNEPLASYL Total matches: 1 Matching pattern PS00242 INTEGRIN_ALPHA: 1136: WKIGFFKR Total matches: 1 Total no of hits in this sequence: 95 ======================================== 1314 pattern(s) searched in 1 sequence(s), 1151 residues. Total no of hits in all sequences: 95. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** argv[1]=P argv[2]=-m /data/patterns/own/motif.fa argv[4]=-seq tem30 ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 1151 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem30 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem30 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) (1151 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value VWA Von Willebrand factor A domain 146 1e-37 VHS A conserved domain found in VPS-27, Hrs and STAM 25 0.41 CNMP Cyclic nucleotide binding domain 24 0.86 LAMNT Lamin N-terminal domain 24 0.92 SGTP Small GTPAses 23 2.2 HISDAC Histone deacetylase domain 23 3.1 PCNA Proliferating Cell nuclear antigen like domain 22 5.3 CALMO Calmodulin like EF-hand domains 22 5.4 KELCH Kelch repeat- beta propeller like domain 21 7.2 >VWA Von Willebrand factor A domain Length = 255 Score = 146 bits (366), Expect = 1e-37 Identities = 23/211 (10%), Positives = 62/211 (28%), Gaps = 23/211 (10%) Query: 142 QLDIVIVLDGSNSIYPWD-------SVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTH-E 193 +++ LD S ++ D + + D + +Q+GI+ + Sbjct: 53 IRSLILTLDCSEAMLEKDLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLV 112 Query: 194 FNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVIVTD 253 ++ + + + + G + AR ++V+++ Sbjct: 113 SQVSGNPQDHIDALKSIRKQEPKGN-PSLQNALEMARGLL---LPVPAHCTREVLIVFGS 168 Query: 254 GESHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFN 313 + D + + I E I+ + + + +E+ + E + Sbjct: 169 LSTTDPGDIHQTIDSLVSEKIRVKVLGL--------SAQVAICKELCKATNYGDESFYKI 220 Query: 314 VSDELALVTIVKTLGERIFALEATADQSAAS 344 + DE L + E + L + Sbjct: 221 LLDETHLKEL---FNEAVTPLPVNKINKGFT 248 >VHS A conserved domain found in VPS-27, Hrs and STAM Length = 144 Score = 25.2 bits (55), Expect = 0.41 Identities = 11/64 (17%), Positives = 25/64 (38%), Gaps = 7/64 (10%) Query: 262 LKKVIQDCEDENIQRFSIAIL-------GSYNRGNLSTEKFVEEIKSIASEPTEKHFFNV 314 +KK I ++ + ++ +L G ++++ FVE + P V Sbjct: 50 IKKRIVGNKNFHEVMLALTVLETCVKNCGHRFHILVASQDFVESVLVRTILPKNNPPAIV 109 Query: 315 SDEL 318 D++ Sbjct: 110 HDKV 113 >CNMP Cyclic nucleotide binding domain Length = 118 Score = 24.4 bits (53), Expect = 0.86 Identities = 9/37 (24%), Positives = 15/37 (40%), Gaps = 3/37 (8%) Query: 434 TGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTD 470 +G V I + + ++L L GS G I + Sbjct: 43 SGGVDIIASKGVSEQVLAKLG---PGSMAGEIGVVFN 76 >LAMNT Lamin N-terminal domain Length = 256 Score = 24.1 bits (52), Expect = 0.92 Identities = 7/27 (25%), Positives = 14/27 (50%) Query: 945 INSTEDIGNEINIFYLIRKSGSFPMPE 971 +N T D+ E ++ Y+ + + P P Sbjct: 116 VNITIDLEQEFHVAYVWIQMANSPRPG 142 Score = 21.0 bits (44), Expect = 8.4 Identities = 6/34 (17%), Positives = 10/34 (28%) Query: 519 QTCCSSRQHNSCTTENKNEPCGARFGTAIAAVKD 552 +Q + EN G G A ++ Sbjct: 54 SYQDDEQQKSWSQYENPMVRGGHGCGHCNAGNEN 87 >SGTP Small GTPAses Length = 164 Score = 22.8 bits (49), Expect = 2.2 Identities = 27/121 (22%), Positives = 49/121 (40%), Gaps = 12/121 (9%) Query: 190 VTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGAR-RGVKKVM 248 + +F + + T V KI+ GG+ + DTA +E F + RG + Sbjct: 21 IEKKFK-DDSNHTIGVEF-GSKIINVGGKYVKLQI-WDTAGQERFRSVTRSYYRGAAGAL 77 Query: 249 VI--VTDGESHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFV--EEIKSIAS 304 ++ +T E+++ L + D Q I + G N+ +L ++ V E A Sbjct: 78 LVYDITSRETYNA--LTNWLTDARMLASQNIVIILCG--NKKDLDADREVTFLEASRFAQ 133 Query: 305 E 305 E Sbjct: 134 E 134 >HISDAC Histone deacetylase domain Length = 433 Score = 22.6 bits (48), Expect = 3.1 Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 5/30 (16%) Query: 558 FNDIVIGAP-LEDDHGGAVYI----YHGSG 582 NDIV+G L H +YI +HG G Sbjct: 163 LNDIVLGIIELLRYHPRVLYIDIDVHHGDG 192 >PCNA Proliferating Cell nuclear antigen like domain Length = 280 Score = 21.7 bits (46), Expect = 5.3 Identities = 12/131 (9%), Positives = 31/131 (23%), Gaps = 26/131 (19%) Query: 852 SNHNITCKVGYPFLRRGEMVTFKILFQFNTSYLMENVTIYL-SATSDSEEPPETLSDNVV 910 ++ Y G + + + +T T D + + + ++ Sbjct: 102 PGTLTALRMCYQ----GYGHPLMLFLEEGGVVTVCKITTQEPEETLDFDFCSTNVMNKII 157 Query: 911 --------------NISIPVKYEV---GLQFYSSAS----EYHISIAANETVPEVINSTE 949 ++ V F S H+ + + E + + Sbjct: 158 LQSEGLREAFSELDMTGDVLQITVSPDKPYFRLSTFGNAGNSHLDYPKDSDLVEAFHCNK 217 Query: 950 DIGNEINIFYL 960 N + L Sbjct: 218 TQVNRYKLSLL 228 >CALMO Calmodulin like EF-hand domains Length = 147 Score = 21.6 bits (45), Expect = 5.4 Identities = 10/86 (11%), Positives = 30/86 (34%), Gaps = 18/86 (20%) Query: 219 QTMTALGTDTARKEAFTEARGARRGVKKVM----VIVTDGESHDNHRLKKVIQDCEDENI 274 + + + ++ EAF G+ V+ + GE + + ++++ D + Sbjct: 75 RQLKSNDSEQELLEAFKVFDKNGDGLISAAELKHVLTSIGEKLTDAEVDDMLREVSDGS- 133 Query: 275 QRFSIAILGSYNRGNLSTEKFVEEIK 300 G ++ ++F + Sbjct: 134 -------------GEINIQQFAALLS 146 >KELCH Kelch repeat- beta propeller like domain Length = 319 Score = 21.1 bits (44), Expect = 7.2 Identities = 6/30 (20%), Positives = 13/30 (43%) Query: 398 TKKNEPLASYLGYTVNSATASSGDVLYIAG 427 T + + ++ +SG+ LY+ G Sbjct: 245 TYQWTKVGDVTAKRMSCHAVASGNKLYVVG 274 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 9 Number of calls to ALIGN: 10 Length of query: 1151 Total length of test sequences: 20182 Effective length of test sequences: 16136.0 Effective search space size: 17949017.7 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= tem30_gi|2829468|sp|P56199|ITA1_HUMAN INTEGRIN ALPHA-1 (LAMININ AND COLLAGEN RECEPTOR) (VLA-1) (CD49A) (1151 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|2582830 [161..353] Integrin A (or I) domain 178 3e-46 gi|2645090 [1..155] Integrin A (or I) domain 125 6e-30 gi|420193 [185..357] Integrin A (or I) domain 119 3e-28 gi|2137729 [37..266] 7-bladed beta-propeller 30 0.23 gi|452523 [41..225] 7-bladed beta-propeller 29 0.29 gi|1086900 [26..235] 7-bladed beta-propeller 29 0.44 gi|1363708 [373..683] 7-bladed beta-propeller 29 0.59 gi|2832331 [157..459] 7-bladed beta-propeller 28 0.64 gi|2102675 [28..258] 7-bladed beta-propeller 28 1.0 gi|2815132 [909..1205] 7-bladed beta-propeller 28 1.1 gi|1556399 [647..916] 7-bladed beta-propeller 27 1.5 gi|1181604 [175..454] 7-bladed beta-propeller 27 2.2 gi|586529 [222..473] 7-bladed beta-propeller 26 2.4 gi|2104446 [1048..1249] 7-bladed beta-propeller 26 2.8 gi|121286 [210..431] Double-stranded beta-helix 26 3.5 gi|1001440 [1028..1666] 7-bladed beta-propeller 26 3.7 gi|2736365 [143..399] 7-bladed beta-propeller 26 3.7 gi|1001939 [228..429] 7-bladed beta-propeller 25 5.2 gi|1827706 [1..365] Cellulases 25 5.4 gi|398985 [18..447] PLP-dependent transferases 25 5.4 gi|2506563 [66..342] Periplasmic binding protein-like I 25 7.0 gi|2494897 [186..459] 7-bladed beta-propeller 25 7.2 gi|2496115 [252..391] Flavodoxin-like 25 7.4 gi|126343 [24..369] Periplasmic binding protein-like I 25 7.5 gi|2132510 [25..215] Thiamin-binding 25 7.6 gi|1176225 [21..201] Periplasmic binding protein-like II 25 9.5 >gi|2582830 [161..353] Integrin A (or I) domain Length = 193 Score = 178 bits (449), Expect = 3e-46 Identities = 162/193 (83%), Positives = 178/193 (91%) Query: 143 LDIVIVLDGSNSIYPWDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSST 202 LDIVIVLDGSNSIYPW+SVTAFLN LL+ MDIGP+QTQVGIVQYG+ V HEF LN YS+T Sbjct: 1 LDIVIVLDGSNSIYPWESVTAFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYLNTYSTT 60 Query: 203 EEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVIVTDGESHDNHRL 262 EEV+ AA +I QRGG QTMTALG DTAR+EAFTEA GARRGV+KVMVIVTDGESHDN+RL Sbjct: 61 EEVMDAALRIRQRGGTQTMTALGIDTAREEAFTEAHGARRGVQKVMVIVTDGESHDNYRL 120 Query: 263 KKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVT 322 ++VI CEDENIQRF+IAILGSY+RGNLSTEKFVEEIKSIAS+PTEKHFFNVSDELALVT Sbjct: 121 QEVIDKCEDENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALVT 180 Query: 323 IVKTLGERIFALE 335 IV+ LGERIFALE Sbjct: 181 IVEALGERIFALE 193 >gi|2645090 [1..155] Integrin A (or I) domain Length = 155 Score = 125 bits (311), Expect = 6e-30 Identities = 35/165 (21%), Positives = 77/165 (46%), Gaps = 15/165 (9%) Query: 145 IVIVLDGSNSIYP--WDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSST 202 ++ +LD S S+ +D + F+ ++ D+ + T++GI+++ + L+ + + Sbjct: 1 MLFLLDSSGSVGDENFDKMKEFVKSIVLNFDVDNQLTRIGIIRFDSDAEIIIQLSDHKTL 60 Query: 203 EEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVIVTDGES---HDN 259 +++L I G QT + A EAF+E G R K +V++ DG++ + Sbjct: 61 KDLLNDIDSIRYNEGIQTRIDKALERAM-EAFSEKNGGRADATKALVLLADGQNSFIEGS 119 Query: 260 HRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIAS 304 L + ++ D + R+ I I + ++E++ IA+ Sbjct: 120 QDLNEELKPLIDAKVFRYVIGIGRELD---------LKELEDIAT 155 >gi|420193 [185..357] Integrin A (or I) domain Length = 173 Score = 119 bits (296), Expect = 3e-28 Identities = 36/185 (19%), Positives = 71/185 (37%), Gaps = 20/185 (10%) Query: 144 DIVIVLDGSNSI--YPWDSVTAFLNDLLKRMDIG-----PKQTQVGIVQYGENVTHEFNL 196 DIV +LDGS + + V F+ D+ +R+ + P ++ ++QYG + Sbjct: 1 DIVFLLDGSERLGEQNFHKVRRFVEDVSRRLTLARRDDDPLNARMALLQYGSQNQQQVAF 60 Query: 197 NKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEAR-GARRGVKKVMVIVTDGE 255 + + A ++ + G A AR GARR + V +TDG Sbjct: 61 PLTYNVTTIHEALERATYLNS-FSHVGTGIVHAINNVVRGARGGARRHAELSFVFLTDGV 119 Query: 256 SHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVS 315 + ++ L++ + +N+ +A+ G + ++ + S F Sbjct: 120 TGND-SLEESVHSMRKQNVVPTVVAVGGDVD----------MDVLTKISLGDRAAIFREK 168 Query: 316 DELAL 320 D +L Sbjct: 169 DFDSL 173 >gi|2137729 [37..266] 7-bladed beta-propeller Length = 230 Score = 29.8 bits (66), Expect = 0.23 Identities = 15/178 (8%), Positives = 43/178 (23%), Gaps = 29/178 (16%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 N T + + + I S S T + + N + G+ Sbjct: 48 DLNSTSMYLASGGLNNTVNIWDLKSKRLHRSLKDHKCEVTCVAYNWNDCYIASGS----- 102 Query: 490 TEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKNEPCGARFGTAIAA 549 G++ ++++ + S+Q + + Sbjct: 103 ----LSGEIILHSVT--------TNTSSTPFGHGSKQPIRHIKYSLFRKSLLGSVSDNGV 150 Query: 550 VKDLNLDGFNDIVIGAPLEDDHGGAVY--IYHGSGKTIRKEYAQRIPSGGDGKTLKFF 605 V +++ + + H + + + + G K + + Sbjct: 151 VTLWDVNSQSSYHT---FDSTHKAPASGICFSPVNELL-------FVTIGLDKRIILY 198 >gi|452523 [41..225] 7-bladed beta-propeller Length = 185 Score = 29.3 bits (65), Expect = 0.29 Identities = 8/94 (8%), Positives = 22/94 (22%), Gaps = 9/94 (9%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 R + + + S + + + TD+L +G Sbjct: 16 RLHQEYVPSAHLSGTCTCLAWAPARLQAKESPQRKKRKSEAVGMSNQTDLLALGT----- 70 Query: 490 TEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCS 523 G + +Y+ + ++ Sbjct: 71 ----AVGSILLYSTVKGELHSKLISGGHDNRVNC 100 >gi|1086900 [26..235] 7-bladed beta-propeller Length = 210 Score = 28.9 bits (64), Expect = 0.44 Identities = 11/107 (10%), Positives = 27/107 (24%), Gaps = 9/107 (8%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 +N V+ D ++ S + + T + D N ++ G+ Sbjct: 63 AFNEESSVVFSGSMDCTMQAFDCRSRSEKPIQIFNESTDGILSIDVNGHEIVAGS----- 117 Query: 490 TEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKN 536 G VY++ + + + + Sbjct: 118 ----ADGNYRVYSIRDGNMTVDYMGDSVNSVSFTPDSNCLLAGVMGG 160 Score = 27.4 bits (60), Expect = 1.2 Identities = 9/50 (18%), Positives = 17/50 (34%) Query: 435 GQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGA 484 G + D +K+ L + +Y G+ D S+ + G Sbjct: 3 GNYCVTCGSDKTVKLWNPLKSSLLKTYSGTGNEVLDAASSSDNSQIAAGG 52 >gi|1363708 [373..683] 7-bladed beta-propeller Length = 311 Score = 28.7 bits (63), Expect = 0.59 Identities = 11/78 (14%), Positives = 23/78 (29%), Gaps = 9/78 (11%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 + ++II D NIKI G+ S F + ++ + + Sbjct: 208 DISFDSKMIITSSADKNIKIWGLDFGDCHKSLFAHQDSIMNVKFLPQSHNFFSCS----- 262 Query: 490 TEKEEQGKVYVYALNQTR 507 + V + + Sbjct: 263 ----KDAVVKYWDGEKFE 276 >gi|2832331 [157..459] 7-bladed beta-propeller Length = 303 Score = 28.3 bits (62), Expect = 0.64 Identities = 15/91 (16%), Positives = 30/91 (32%), Gaps = 4/91 (4%) Query: 393 FNVESTKKNEPLASYLGYTVNSATASSGDVLYIAGQPRYNHTGQVIIYRMEDGNIKILQT 452 ++VES +K + S VL +A + G+ + I I + Sbjct: 17 WSVESGRKLHVIPRAKKGAEGKPPGHSSHVLCMA----ISSDGKYLASGDRSKLILIWEA 72 Query: 453 LSGEQIGSYFGSILTTTDIDKDSNTDILLVG 483 S + + ++ G + + T L Sbjct: 73 QSCQHLYTFTGHRDAVSGLAFRRGTHQLYST 103 >gi|2102675 [28..258] 7-bladed beta-propeller Length = 231 Score = 27.9 bits (61), Expect = 1.0 Identities = 6/73 (8%), Positives = 20/73 (27%), Gaps = 4/73 (5%) Query: 411 TVNSATASSGDVLYIAGQPRYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTD 470 + + S + + ++ + I++ GE G Sbjct: 17 DDDDPKSISIGEKAYS----FALKNGKVVTAASNNAIQLHTFPDGEPDGILTRFTTNANH 72 Query: 471 IDKDSNTDILLVG 483 + +++ + G Sbjct: 73 VVFNTDGTRIAAG 85 >gi|2815132 [909..1205] 7-bladed beta-propeller Length = 297 Score = 27.6 bits (60), Expect = 1.1 Identities = 13/103 (12%), Positives = 33/103 (31%), Gaps = 9/103 (8%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 + Q +I D +++ +G+ I G + + + SN ++V Sbjct: 107 QLTSNNQFLITGSGDFVVQMWDVTNGKCISRMGGLMAPVSTLAITSNDAFVVVAC----- 161 Query: 490 TEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCSSRQHNSCTT 532 E + V++ + +++ K Q + Sbjct: 162 ----EDETLKVFSTVGGQELHELMGHEGKVNSLVCAQDDCQLF 200 Score = 25.6 bits (55), Expect = 4.7 Identities = 5/54 (9%), Positives = 13/54 (23%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVG 483 ++ + + D + + S T S+ L+ Sbjct: 23 CSSNDSSLFVSTSFDKTVNVWVFSQSTPTMSLTHHTAKVTCAILTSDDQYLITA 76 >gi|1556399 [647..916] 7-bladed beta-propeller Length = 270 Score = 27.1 bits (59), Expect = 1.5 Identities = 12/94 (12%), Positives = 32/94 (33%), Gaps = 12/94 (12%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 ++ + ++ DG + ++ +G I S + + +L G+ Sbjct: 166 AFSPDSRHVLSTGTDGCLNVIDVQTGMLISSMTSDE---PQTCFVWDGNSVLSGS----- 217 Query: 490 TEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCS 523 + G++ V+ L + ++ TC Sbjct: 218 ----QSGELLVWDLLGAKISERIQGHTGAVTCIW 247 Score = 25.6 bits (55), Expect = 5.1 Identities = 10/110 (9%), Positives = 21/110 (19%), Gaps = 11/110 (10%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 +I D N+ G + + G + I + + L + Sbjct: 33 LLLPGDATVITSSWDNNVYFYSIAFGRRQDTLMGHDDAVSKICW--HDNRLYSAS----- 85 Query: 490 TEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKNEPC 539 V V++ + N Sbjct: 86 ----WDSTVKVWSGVPAEMPGTKRHHFDLLAELEHDVSVDTISLNAASTL 131 >gi|1181604 [175..454] 7-bladed beta-propeller Length = 280 Score = 26.8 bits (58), Expect = 2.2 Identities = 11/54 (20%), Positives = 22/54 (40%) Query: 431 YNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGA 484 Y+ + I D N+ I ++G+Q+ + T D N +L+ + Sbjct: 219 YSTGQKYIYTGSHDANVYIYDLVTGDQVSTLQYHKATVRDCSWHPNYPMLVSSS 272 >gi|586529 [222..473] 7-bladed beta-propeller Length = 252 Score = 26.4 bits (57), Expect = 2.4 Identities = 8/102 (7%), Positives = 32/102 (30%), Gaps = 10/102 (9%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 ++H G I+ +E+G +++ +G + + + + ++ Sbjct: 6 AWSHDGNSIVTGVENGELRLWN-KTGALLNVLNFHRAPIVSVKWNKDGTHIISMD----- 59 Query: 490 TEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCSSRQHNSCT 531 + ++ + L+ + ++ H+ Sbjct: 60 ----VENVTILWNVISGTVMQHFELKETGGSSINAENHSGDG 97 Score = 24.4 bits (52), Expect = 9.3 Identities = 8/54 (14%), Positives = 23/54 (41%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVG 483 +N T ++++ +DG ++I +G S++G + + ++ Sbjct: 145 EFNDTNKLLLSASDDGTLRIWHGGNGNSQNSFYGHSQSIVSASWVGDDKVISCS 198 >gi|2104446 [1048..1249] 7-bladed beta-propeller Length = 202 Score = 26.3 bits (57), Expect = 2.8 Identities = 11/49 (22%), Positives = 16/49 (32%) Query: 435 GQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVG 483 G + +YR + L T FG + + N LLV Sbjct: 12 GTIKLYRDYENEKVELVTSWNSLSDLVFGDRNASLLMSWQQNCGHLLVA 60 >gi|121286 [210..431] Double-stranded beta-helix Length = 222 Score = 26.0 bits (57), Expect = 3.5 Identities = 22/160 (13%), Positives = 48/160 (29%), Gaps = 16/160 (10%) Query: 313 NVSDELALVTIVKTLGERIFALEATADQSAASFEMEMSQTGFSAHYSQDWVML-----GA 367 ++ V +++ +R L+ D F + + H D++++ Sbjct: 13 LFKNQYGHVRVLQRFNKRSQQLQNLRDYRILEFNSKPNTLLLPHHADADYLIVILNGTAI 72 Query: 368 VGAYDWNGTVV--MQKASQIIIPRNTTFNVESTKKNEPLASYLGYTVNSATASSGDVLYI 425 + + + +Q + +P TTF V + +E L G T + + L + Sbjct: 73 LTLVNNDDRDSYNLQSGDALRVPAGTTFYVVNPDNDENLRMIAGTTFYVVNPDNDENLRM 132 Query: 426 AGQPRYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSI 465 G + S + SY Sbjct: 133 I---TLAIPVNK------PGRFESFFLSSTQAQQSYLQGF 163 >gi|1001440 [1028..1666] 7-bladed beta-propeller Length = 639 Score = 25.8 bits (55), Expect = 3.7 Identities = 12/107 (11%), Positives = 28/107 (25%), Gaps = 10/107 (9%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGAPMYMG 489 ++ GQ II DG ++ G+ +G+ G + ++ + Sbjct: 199 LWSPDGQHIITSSSDGTARVWS-RHGKCLGTLRGHDHNIHGARFSLDGQKIVTYS----- 252 Query: 490 TEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCSSRQHNSCTTENKN 536 ++ T + S T + + Sbjct: 253 ----TDNTARLWTKEGTLLTILRGHQKEVYDADFSADGRFVFTVSAD 295 Score = 24.6 bits (52), Expect = 9.8 Identities = 9/54 (16%), Positives = 19/54 (34%), Gaps = 1/54 (1%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVG 483 ++ GQ D +K+ Q GE++ + G + + L+ Sbjct: 35 DWSADGQYFATASADHTVKLWQ-RHGEEVATLRGHEDWVRSVHFSPHHQFLVTS 87 >gi|2736365 [143..399] 7-bladed beta-propeller Length = 257 Score = 25.9 bits (56), Expect = 3.7 Identities = 11/77 (14%), Positives = 24/77 (30%), Gaps = 10/77 (12%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYF-GSILTTTDIDKDSNTDILLVGAPMYM 488 ++ G + +D I+ + G+ T + D N +L+ G Sbjct: 153 TWSQAGNQFVSCSQDKTIRFWDLRQQTAVNVISPGNASPVTSVCVDPNGKLLVSGH---- 208 Query: 489 GTEKEEQGKVYVYALNQ 505 E V ++ + Sbjct: 209 -----EDASVALFDIGG 220 Score = 24.8 bits (53), Expect = 7.8 Identities = 11/111 (9%), Positives = 28/111 (24%), Gaps = 10/111 (9%) Query: 412 VNSATASSGDVLYIAGQPR-YNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTD 470 T G V + N + ++ + ++ + +G+ I S G + Sbjct: 37 EMELTCHDGTVRDLIFIESSINRSTILVSGGAGNCHLNVTDCNTGQLIQSMKGHTGSCMI 96 Query: 471 IDKDSNTDILLVGAPMYMGTEKEEQGKVYVYALNQTRFEYQMSLEPIKQTC 521 + + V ++T S ++ Sbjct: 97 VHLMLKISVTKFHI---------LNCTVKTMIYHKTFSTLFHSFREDEKVQ 138 >gi|1001939 [228..429] 7-bladed beta-propeller Length = 202 Score = 25.5 bits (55), Expect = 5.2 Identities = 5/55 (9%), Positives = 16/55 (29%) Query: 430 RYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTTTDIDKDSNTDILLVGA 484 ++ + + + + G + G T + + ++ GA Sbjct: 93 AFSPSQPLYACGSYGRTLGLYAWDDGSPLALLGGHQGGITHLCFHPDGNLFFSGA 147 >gi|1827706 [1..365] Cellulases Length = 365 Score = 25.4 bits (55), Expect = 5.4 Identities = 14/84 (16%), Positives = 24/84 (27%), Gaps = 11/84 (13%) Query: 368 VGAYDWNGTVVMQKASQIIIPRNTTFNVESTKK--NEPLASYLGYTVNSATASSGDVLYI 425 VG W + S + IP T + P +L + + Sbjct: 12 VGVTPWANAYYASEVSSLAIPSLTGAMATAAAAVAKVPSFMWLDTLDKTPLMEQ----TL 67 Query: 426 AGQPRYNH-----TGQVIIYRMED 444 A N GQ +++ + D Sbjct: 68 ADIRTANKNGGNYAGQFVVFDLPD 91 >gi|398985 [18..447] PLP-dependent transferases Length = 430 Score = 25.2 bits (54), Expect = 5.4 Identities = 20/166 (12%), Positives = 50/166 (30%), Gaps = 14/166 (8%) Query: 231 KEAFTEARGARRGVKKVMVIVT-----DGESHDNHRLKKVIQDCEDENIQRFSIAILGSY 285 K + +++ G + +K+I + + Sbjct: 174 KTFAYNNQQENNKNPPCIILHACCHNPTGLDPTKEQWEKIIDTIYELKMVPIVDMAYQGL 233 Query: 286 NRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV-TIVKTL---GERIFALEATADQS 341 GNL +++ + + N S+ + L + K + GER+ +L + Sbjct: 234 ESGNL-----LKDAYLLRLCLNVNKYPNWSNGIFLCQSFAKNMGLYGERVGSLSVITPAT 288 Query: 342 AASFEMEMSQTGFSAHYSQDWVMLGAVGAYDWNGTVVMQKASQIII 387 A + + Q S + D + V + + +++ Sbjct: 289 ANNGKFNPLQQKNSLQQNIDSQLKKIVRGMYSSPPGYGSRVVNVVL 334 >gi|2506563 [66..342] Periplasmic binding protein-like I Length = 277 Score = 24.9 bits (54), Expect = 7.0 Identities = 11/69 (15%), Positives = 22/69 (30%) Query: 1063 PTFIKSYFSSLNLTIRGELRSENASLVLSSSNQKRELAIQISKDGLPGRVPLWVILLSAF 1122 +++ L + L ++ + L + E Q L V VI +A Sbjct: 7 RDLSAPFYAELTAGLTEALEAQGRMVFLLHGGKDGEQLAQRFSLLLNQGVDGVVIAGAAG 66 Query: 1123 AGLLLLMLL 1131 + L + Sbjct: 67 SSDDLRRMA 75 >gi|2494897 [186..459] 7-bladed beta-propeller Length = 274 Score = 24.7 bits (53), Expect = 7.2 Identities = 15/115 (13%), Positives = 30/115 (26%), Gaps = 11/115 (9%) Query: 409 GYTVNSATASSGDVLYIAGQPRYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGSILTT 468 + SG + + + + +DG + L S + I + Sbjct: 147 PDESHRMWRFSGQIERVTWNH---FSPCHFLASTDDGFVYNLDARSDKPIFTLNAHNDEI 203 Query: 469 TDIDKDSNTDILLVGAPMYMGTEKEEQGKVYVYALNQTRFEYQMSLEPIKQTCCS 523 + +D S LV A V ++ + R S + Sbjct: 204 SGLDLSSQIKGCLVTA--------SADKYVKIWDILGDRPSLVHSRDMKMGVLFC 250 >gi|2496115 [252..391] Flavodoxin-like Length = 140 Score = 24.8 bits (54), Expect = 7.4 Identities = 24/142 (16%), Positives = 46/142 (31%), Gaps = 20/142 (14%) Query: 199 YSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVIVT----DG 254 Y+ST+++ A + + G + +T+ TE A K +++ + Sbjct: 7 YNSTKKMAHAIAEGLMEKGVEVKIYRVCETSLSRIMTEILDA-----KYVLVGSPTVNRN 61 Query: 255 ESHDNHRLKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKS-----IASEPTEK 309 + K + + GSY +TEK E K+ + E Sbjct: 62 LYPEV---GKFLAYMDCIRPLDKIGVAFGSYGWMECATEKIKEIFKNLGFKIVDDECLTV 118 Query: 310 HFFNVSDELALVTIVKTLGERI 331 F + L G+R+ Sbjct: 119 RFAPKEEHL---KKCYEFGKRL 137 >gi|126343 [24..369] Periplasmic binding protein-like I Length = 346 Score = 25.0 bits (54), Expect = 7.5 Identities = 8/81 (9%), Positives = 23/81 (27%), Gaps = 4/81 (4%) Query: 220 TMTALGTDTARKEAFTEARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCEDENIQRFSI 279 M G D+++ + +++ +I + + + V + N I Sbjct: 114 IMRTAGLDSSQGPTAAKYILETVKPQRIAIIHDKQQ-YGEGLARSVQDGLKQGNAN---I 169 Query: 280 AILGSYNRGNLSTEKFVEEIK 300 G + ++ Sbjct: 170 VFFDGITAGEKDFSALIARLQ 190 >gi|2132510 [25..215] Thiamin-binding Length = 191 Score = 24.9 bits (54), Expect = 7.6 Identities = 6/40 (15%), Positives = 10/40 (25%), Gaps = 3/40 (7%) Query: 728 CTKHSFYMLDKHDFQD--SVRITLDFNL-TDPENGPVLDD 764 +F + + D R+ D E D Sbjct: 126 LHDSNFKGPNHKVYHDMVKDRVACSVAYLEDIETACDQVD 165 >gi|1176225 [21..201] Periplasmic binding protein-like II Length = 181 Score = 24.6 bits (53), Expect = 9.5 Identities = 4/36 (11%), Positives = 11/36 (30%) Query: 429 PRYNHTGQVIIYRMEDGNIKILQTLSGEQIGSYFGS 464 + + + +K + L G + G+ Sbjct: 64 TGVTYYDGIGFLTHDKAGLKSAKELDGATVCIQAGT 99 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 26 Number of calls to ALIGN: 32 Length of query: 1151 Total length of test sequences: 256703 Effective length of test sequences: 202633.0 Effective search space size: 224068545.9 Initial X dropoff for ALIGN: 25.0 bits