analysis of sequence from AAK62657.1.fa ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. MAEPPKLRVL MVSDFFFPNF GGVENHIYYL SQCLLKLGHK VVVMTHAYGN RSGVRYMTGG LKVYYVPWRP FVMQTTFPTV YGTLPIVRTI LRREKITVVH GHQAFSTLCH EALMHARTMG YKVVFTDHSL YGFADVGSIH MNKVLQFSLA DIDQAICVSH TSKENTVLRS GLSPAKVFMI PNAVDTAMFK PASVRPSTDI ITIVVISRLV YRKGADLLVE VIPEVCRLYP NVRFVVGGDG PKHVRLEEMR EKHSLQDRVE MLGAVPHSRV RSVLVTGHIF LNSSLTEAFC IAILEAASCG LLTVSTRVGG VPEVLPDDMV VLAEPDPDDM VRAIEKAISI LPTINPEEMH NRMKKLYSWQ DVAKRTEIVY DRALKCSNRS LLERLMRFLS CGAWAGKLFC MVMILDYLLW RLLQLLQPDE DIEEAPDICL CHHRGVEVSE GLRKKIK ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > AAK62657.1 . . . . . 1 MAEPPKLRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGN 50 ________EEEEE_______________HHHHHHHH___EEEEEE_____ . . . . . 51 RSGVRYMTGGLKVYYVPWRPFVMQTTFPTVYGTLPIVRTILRREKITVVH 100 ___EEEE____EEEEE___EEEEE_____EEEEEHHHHHHHHHHHEEEE_ . . . . . 101 GHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNKVLQFSLA 150 ___HHHHHHHHHHHHHHH___EEEEEE_____________HHHHHHHHHH . . . . . 151 DIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDTAMFKPASVRPSTDI 200 HHHHEEEEE________________EEEE______EEE__________EE . . . . . 201 ITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVRLEEMR 250 EEEEEHHHHHH_____EEEEEEEEEE_____EEEEE_______HHHHHHH . . . . . 251 EKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCG 300 HHHHHHHHHHHH________EEEEEE___________HHHHHHHHH____ . . . . . 301 LLTVSTRVGGVPEVLPDDMVVLAEPDPDDMVRAIEKAISILPTINPEEMH 350 _EEEE_____________EEEEE______HHHHHHHHHH________HHH . . . . . 351 NRMKKLYSWQDVAKRTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFC 400 HHHHHH___HHHHHHHHHHHHH_______HHHHHHHHHHH_________E . . . . 401 MVMILDYLLWRLLQLLQPDEDIEEAPDICLCHHRGVEVSEGLRKKIK 447 EEHHHHHHHHHHHHHH___________EEEEE_______________ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 15.7 % beta-contents : 60.4 % coil-contents : 23.9 % class : mixed method : 2 alpha-contents : 2.8 % beta-contents : 58.8 % coil-contents : 38.4 % class : beta ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -17.11 -2.13 -4.22 -3.53 -4.00 0.00 -20.00 -0.07 -0.02 -6.37 -1.76 -12.00 -12.00 0.00 0.00 0.00 -83.19 -26.27 -0.49 -0.79 -0.41 -4.00 0.00 0.00 0.00 -0.52 -4.68 -1.76 -12.00 -12.00 -12.00 0.00 0.00 -74.92 ID: AAK62657.1 AC: xxx Len: 447 1:I 425 Sc: -74.92 Pv: 6.725736e-01 NO_GPI_SITE GPI: learning from protozoa -39.93 -0.49 -0.27 -0.72 -4.00 0.00 0.00 0.00 -0.09 -4.14 -6.94 -12.00 -12.00 -12.00 0.00 0.00 -92.57 -35.82 -3.47 -1.24 -0.09 -4.00 0.00 0.00 0.00 0.00 -4.48 -6.94 -12.00 -12.00 -12.00 0.00 0.00 -92.03 ID: AAK62657.1 AC: xxx Len: 447 1:I 428 Sc: -92.03 Pv: 7.331901e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? AAK62657.1 0.380 289 Y 0.367 419 Y 0.964 405 Y 0.182 N # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? AAK62657.1 0.553 396 Y 0.331 396 N 0.849 384 N 0.172 N # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? AAK62657.1 0.485 289 Y 0.336 425 N 0.968 393 Y 0.271 N ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. 1-404 MAEPPKLRVLMVSDFFFPNFGGVENHIYYL SQCLLKLGHKVVVMTHAYGNRSGVRYMTGG LKVYYVPWRPFVMQTTFPTVYGTLPIVRTI LRREKITVVHGHQAFSTLCHEALMHARTMG YKVVFTDHSLYGFADVGSIHMNKVLQFSLA DIDQAICVSHTSKENTVLRSGLSPAKVFMI PNAVDTAMFKPASVRPSTDIITIVVISRLV YRKGADLLVEVIPEVCRLYPNVRFVVGGDG PKHVRLEEMREKHSLQDRVEMLGAVPHSRV RSVLVTGHIFLNSSLTEAFCIAILEAASCG LLTVSTRVGGVPEVLPDDMVVLAEPDPDDM VRAIEKAISILPTINPEEMHNRMKKLYSWQ DVAKRTEIVYDRALKCSNRSLLERLMRFLS CGAWAGKLFCMVMI ldyllwrllqll 405-416 417-447 QPDEDIEEAPDICLCHHRGVEVSEGLRKKI K low complexity regions: SEG 25 3.0 3.3 >AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. 1-306 MAEPPKLRVLMVSDFFFPNFGGVENHIYYL SQCLLKLGHKVVVMTHAYGNRSGVRYMTGG LKVYYVPWRPFVMQTTFPTVYGTLPIVRTI LRREKITVVHGHQAFSTLCHEALMHARTMG YKVVFTDHSLYGFADVGSIHMNKVLQFSLA DIDQAICVSHTSKENTVLRSGLSPAKVFMI PNAVDTAMFKPASVRPSTDIITIVVISRLV YRKGADLLVEVIPEVCRLYPNVRFVVGGDG PKHVRLEEMREKHSLQDRVEMLGAVPHSRV RSVLVTGHIFLNSSLTEAFCIAILEAASCG LLTVST rvggvpevlpddmvvlaepdpddmvra 307-333 334-404 IEKAISILPTINPEEMHNRMKKLYSWQDVA KRTEIVYDRALKCSNRSLLERLMRFLSCGA WAGKLFCMVMI ldyllwrllqll 405-416 417-447 QPDEDIEEAPDICLCHHRGVEVSEGLRKKI K low complexity regions: SEG 45 3.4 3.75 >AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. 1-447 MAEPPKLRVLMVSDFFFPNFGGVENHIYYL SQCLLKLGHKVVVMTHAYGNRSGVRYMTGG LKVYYVPWRPFVMQTTFPTVYGTLPIVRTI LRREKITVVHGHQAFSTLCHEALMHARTMG YKVVFTDHSLYGFADVGSIHMNKVLQFSLA DIDQAICVSHTSKENTVLRSGLSPAKVFMI PNAVDTAMFKPASVRPSTDIITIVVISRLV YRKGADLLVEVIPEVCRLYPNVRFVVGGDG PKHVRLEEMREKHSLQDRVEMLGAVPHSRV RSVLVTGHIFLNSSLTEAFCIAILEAASCG LLTVSTRVGGVPEVLPDDMVVLAEPDPDDM VRAIEKAISILPTINPEEMHNRMKKLYSWQ DVAKRTEIVYDRALKCSNRSLLERLMRFLS CGAWAGKLFCMVMILDYLLWRLLQLLQPDE DIEEAPDICLCHHRGVEVSEGLRKKIK low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. MAEPPKLRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGG LKVYYVPWRPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMG YKVVFTDHSLYGFADVGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMI PNAVDTAMFKPASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDG PKHVRLEEMREKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCG LLTVSTRVGGVPEVLPDDMVVLAEPDPDDMVRAIEKAISILPTINPEEMHNRMKKLYSWQ DVAKRTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFCMVMILDYLLWRLLQLLQPDE DIEEAPDICLCHHRGVEVSEGLRKKIK 1 - 447 MAEPPKLRVL MVSDFFFPNF GGVENHIYYL SQCLLKLGHK VVVMTHAYGN RSGVRYMTGG LKVYYVPWRP FVMQTTFPTV YGTLPIVRTI LRREKITVVH GHQAFSTLCH EALMHARTMG YKVVFTDHSL YGFADVGSIH MNKVLQFSLA DIDQAICVSH TSKENTVLRS GLSPAKVFMI PNAVDTAMFK PASVRPSTDI ITIVVISRLV YRKGADLLVE VIPEVCRLYP NVRFVVGGDG PKHVRLEEMR EKHSLQDRVE MLGAVPHSRV RSVLVTGHIF LNSSLTEAFC IAILEAASCG LLTVSTRVGG VPEVLPDDMV VLAEPDPDDM VRAIEKAISI LPTINPEEMH NRMKKLYSWQ DVAKRTEIVY DRALKCSNRS LLERLMRFLS CGAWAGKLFC MVMILDYLLW RLLQLLQPDE DIEEAPDICL CHHRGVEVSE GLRKKIK low complexity regions: DUST >AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. MAEPPKLRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGG LKVYYVPWRPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMG YKVVFTDHSLYGFADVGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMI PNAVDTAMFKPASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDG PKHVRLEEMREKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCG LLTVSTRVGGVPEVLPDDMVVLAEPDPDDMVRAIEKAISILPTINPEEMHNRMKKLYSWQ DVAKRTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFCMVMILDYLLWRLLQLLQPDE DIEEAPDICLCHHRGVEVSEGLRKKIK ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for AAK62657.1 sequence: 447 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 MAEPPKLRVL MVSDFFFPNF GGVENHIYYL SQCLLKLGHK VVVMTHAYGN RSGVRYMTGG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 LKVYYVPWRP FVMQTTFPTV YGTLPIVRTI LRREKITVVH GHQAFSTLCH EALMHARTMG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 YKVVFTDHSL YGFADVGSIH MNKVLQFSLA DIDQAICVSH TSKENTVLRS GLSPAKVFMI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 PNAVDTAMFK PASVRPSTDI ITIVVISRLV YRKGADLLVE VIPEVCRLYP NVRFVVGGDG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 PKHVRLEEMR EKHSLQDRVE MLGAVPHSRV RSVLVTGHIF LNSSLTEAFC IAILEAASCG ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~111111111 1111111111 11~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~6666 6666666666 551~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 LLTVSTRVGG VPEVLPDDMV VLAEPDPDDM VRAIEKAISI LPTINPEEMH NRMKKLYSWQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 420 DVAKRTEIVY DRALKCSNRS LLERLMRFLS CGAWAGKLFC MVMILDYLLW RLLQLLQPDE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . DIEEAPDICL CHHRGVEVSE GLRKKIK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~ ---------- ---------- ------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** AAK62657.1.fa.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: AAK62657.1.fa.___inter___ (1 sequences) MAEPPKLRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGN RSGVRYMTGGLKVYYVPWRPFVMQTTFPTVYGTLPIVRTILRREKITVVH GHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNKVLQFSLA DIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDTAMFKPASVRPSTDI ITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVRLEEMR EKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCG LLTVSTRVGGVPEVLPDDMVVLAEPDPDDMVRAIEKAISILPTINPEEMH NRMKKLYSWQDVAKRTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFC MVMILDYLLWRLLQLLQPDEDIEEAPDICLCHHRGVEVSEGLRKKIK (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 2 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 70 90 0.933 Putative 2 286 306 1.202 Certain 3 388 408 1.221 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 Loop length 69 195 81 39 K+R profile + + + 6.00 CYT-EXT prof 1.65 -0.12 1.14 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -6.00 Tm probability: 0.83 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 0.00 (NEG-POS)/(NEG+POS): -0.4545 NEG: 3.0000 POS: 8.0000 -> Orientation: undecided CYT-EXT difference: 0.40 -> Orientation: N-out ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 2 3 Loop length 285 81 39 K+R profile + 6.00 + CYT-EXT prof 1.48 - -0.12 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 6.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 3.00 (NEG-POS)/(NEG+POS): -0.1852 NEG: 22.0000 POS: 32.0000 -> Orientation: N-in CYT-EXT difference: 1.60 -> Orientation: N-out ---------------------------------------------------------------------- "AAK62657" 447 70 90 #f 0.933333 286 306 #t 1.20208 388 408 #t 1.22083 ************************************ *TOPPREDM with prokaryotic function* ************************************ AAK62657.1.fa.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: AAK62657.1.fa.___inter___ (1 sequences) MAEPPKLRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGN RSGVRYMTGGLKVYYVPWRPFVMQTTFPTVYGTLPIVRTILRREKITVVH GHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNKVLQFSLA DIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDTAMFKPASVRPSTDI ITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVRLEEMR EKHSLQDRVEMLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCG LLTVSTRVGGVPEVLPDDMVVLAEPDPDDMVRAIEKAISILPTINPEEMH NRMKKLYSWQDVAKRTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFC MVMILDYLLWRLLQLLQPDEDIEEAPDICLCHHRGVEVSEGLRKKIK (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 2 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 70 90 0.933 Putative 2 286 306 1.202 Certain 3 388 408 1.221 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 Loop length 69 195 81 39 K+R profile + + + 6.00 CYT-EXT prof 1.65 -0.12 1.14 - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -6.00 Tm probability: 0.83 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 0.00 (NEG-POS)/(NEG+POS): -0.4545 NEG: 3.0000 POS: 8.0000 -> Orientation: undecided CYT-EXT difference: 0.40 -> Orientation: N-out ---------------------------------------------------------------------- Structure 2 Transmembrane segments included in this structure: Segment 2 3 Loop length 285 81 39 K+R profile + 6.00 + CYT-EXT prof 1.48 - -0.12 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 6.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 3.00 (NEG-POS)/(NEG+POS): -0.1852 NEG: 22.0000 POS: 32.0000 -> Orientation: N-in CYT-EXT difference: 1.60 -> Orientation: N-out ---------------------------------------------------------------------- "AAK62657" 447 70 90 #f 0.933333 286 306 #t 1.20208 388 408 #t 1.22083 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ SAPS. Version of April 11, 1996. Date run: Thu Nov 22 13:12:27 2001 File: /people/b_eisen/AAK62657.1.fa.___saps___ ID AAK62657.1 DE AT3g45100/T14D3_40 [Arabidopsis thaliana]. number of residues: 447; molecular weight: 50.4 kdal 1 MAEPPKLRVL MVSDFFFPNF GGVENHIYYL SQCLLKLGHK VVVMTHAYGN RSGVRYMTGG 61 LKVYYVPWRP FVMQTTFPTV YGTLPIVRTI LRREKITVVH GHQAFSTLCH EALMHARTMG 121 YKVVFTDHSL YGFADVGSIH MNKVLQFSLA DIDQAICVSH TSKENTVLRS GLSPAKVFMI 181 PNAVDTAMFK PASVRPSTDI ITIVVISRLV YRKGADLLVE VIPEVCRLYP NVRFVVGGDG 241 PKHVRLEEMR EKHSLQDRVE MLGAVPHSRV RSVLVTGHIF LNSSLTEAFC IAILEAASCG 301 LLTVSTRVGG VPEVLPDDMV VLAEPDPDDM VRAIEKAISI LPTINPEEMH NRMKKLYSWQ 361 DVAKRTEIVY DRALKCSNRS LLERLMRFLS CGAWAGKLFC MVMILDYLLW RLLQLLQPDE 421 DIEEAPDICL CHHRGVEVSE GLRKKIK -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 26( 5.8%); C : 11( 2.5%); D : 21( 4.7%); E : 25( 5.6%); F : 17( 3.8%) G : 26( 5.8%); H : 17( 3.8%); I : 25( 5.6%); K : 22( 4.9%); L : 48(10.7%) M : 19( 4.3%); N : 11( 2.5%); P : 23( 5.1%); Q : 9( 2.0%); R : 30( 6.7%) S : 27( 6.0%); T : 22( 4.9%); V+ : 50(11.2%); W : 4( 0.9%); Y : 14( 3.1%) KR : 52 ( 11.6%); ED : 46 ( 10.3%); AGP : 75 ( 16.8%); KRED : 98 ( 21.9%); KR-ED : 6 ( 1.3%); FIKMNY : 108 ( 24.2%); LVIFM : 159 ( 35.6%); ST : 49 ( 11.0%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 00-00+0+00 000-000000 000-000000 00000+000+ 0000000000 +000+00000 61 0+000000+0 0000000000 0000000+00 0++-+00000 0000000000 -00000+000 121 0+0000-000 0000-00000 00+0000000 -0-0000000 00+-0000+0 00000+0000 181 0000-0000+ 0000+000-0 0000000+00 0++00-000- 000-00+000 00+00000-0 241 0+00+0--0+ -+0000-+0- 00000000+0 +000000000 000000-000 0000-00000 301 000000+000 00-000--00 000-0-0--0 0+00-+0000 000000--00 0+0++00000 361 -00++0-000 -+00+000+0 00-+00+000 000000+000 00000-0000 +0000000-- 421 -0--00-000 000+00-00- 00+++0+ A. CHARGE CLUSTERS. Positive charge clusters (cmin = 10/30 or 13/45 or 16/60): none Negative charge clusters (cmin = 9/30 or 12/45 or 15/60): none Mixed charge clusters (cmin = 15/30 or 20/45 or 25/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 5 | 7 | 37 | 10 | 9 | 13 | 12 | 11 | 15 | 7 | 9 | lmin1 6 | 6 | 9 | 45 | 12 | 11 | 15 | 14 | 14 | 18 | 9 | 12 | lmin2 7 | 7 | 10 | 50 | 13 | 13 | 17 | 16 | 15 | 20 | 10 | 13 | (Significance level: 0.010000; Minimal displayed length: 6) There are no charge runs or patterns exceeding the given minimal lengths. Run count statistics: + runs >= 3: 1, at 443; - runs >= 3: 1, at 419; * runs >= 5: 0 0 runs >= 25: 0 -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. There are no high scoring hydrophobic segments. There are no high scoring transmembrane segments. 2. SPACINGS OF C. H2N-32-C-75-C-47-C-68-C-63-C-8-C-76-C-14-C-8-C-28-C-1-C-16-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-25-H-6-C-5-H-6-H-53-H-1-H-6-C-H-4-H-12-H-11-H-16-C-2-H-65-C-16-H-9-H-13-H-10-H-11-C-8-C-50-H-25-C-14-C-8-C-28-C-1-C-H-H-14-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 4 Aligned matching blocks: [ 316- 320] PDDMV [ 327- 331] PDDMV B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 8 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 34 (Expected range: 10-- 41) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 12 (6-10) 11 (11-20) 7 (>=21) 5 3. Clusters of amino acid multiplets (cmin = 13/30 or 16/45 or 19/60): none B. CHARGE ALPHABET. 1. Total number of charge multiplets: 11 (Expected range: 0-- 19) 5 +plets (f+: 11.6%), 6 -plets (f-: 10.3%) Total number of charge altplets: 7 (Critical number: 22) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 2 (6-10) 3 (11-20) 2 (>=21) 5 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors There are no periodicities of the prescribed length. B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core:10) Location Period Element Copies Core Errors 9- 50 7 i.00... 6 6 /0/./1/2/./././ 227- 262 6 *..... 6 6 0 281- 350 10 i.0..0.... 7 7 /0/./2/././2/././././ 367- 390 4 *... 6 6 0 -------------------------------------------------------------------------------- SPACING ANALYSIS. There are no unusual spacings. ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- Glycos_transf_1 Glycosyl transferases group 1 107.0 1.5e-28 1 DEAD DEAD/DEAH box helicase -0.8 75 1 DnaB DnaB-like helicase -1.9 70 1 oxidored_q6 NADH ubiquinone oxidoreductase, 20 Kd -55.3 46 1 HrcA HrcA protein C terminal domain -118.3 86 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- DnaB 1/1 7 21 .. 1 15 [. -1.9 70 DEAD 1/1 286 307 .. 37 56 .. -0.8 75 HrcA 1/1 141 315 .. 1 229 [] -118.3 86 oxidored_q6 1/1 259 338 .. 1 145 [] -55.3 46 Glycos_transf_1 1/1 186 339 .. 1 170 [. 107.0 1.5e-28 Alignments of top-scoring domains: DnaB: domain 1 of 1, from 7 to 21: score -1.9, E = 70 *->lekLepddFYfpaHr<-* l++L +dF+fp+++ AAK62657.1 7 LRVLMVSDFFFPNFG 21 DEAD: domain 1 of 1, from 286 to 307: score -0.8, E = 75 *->TlAfllPilerll..lvlaPTR<-* T+Af+++ile+ +l+ + TR AAK62657.1 286 TEAFCIAILEAAScgLLTVSTR 307 HrcA: domain 1 of 1, from 141 to 315: score -118.3, E = 86 *->lekllqkaasiLSeLTnytavvlgPekskdrLkhielipLdphqaLa + k+lq + L+++ ++++v + k++ L++ L+p+++ + AAK62657.1 141 MNKVLQFS---LADIDQAICVSHTS-KENTVLRS----GLSPAKVFM 179 vLVtdsGhVenrtitlPanisesdLekasniLndrLrGltlseihqklet + V++ ++ + +++ d+ +++ i +rL+ ++ +++ ++++ AAK62657.1 180 I----PNAVDTAMFKPASVRPSTDIITIVVI--SRLVYRKGADLLVEVIP 223 elpqllseygelakvydeildlfesileedsseeklyvgGksNLlnqped e+ l ++ +++ +d +++ e+++e + ++ + L + p+ AAK62657.1 224 EVCRLYPNV-----RFVVGGDGPKHVRLEEMREKHSLQDRVEMLGAVPH- 267 FkDleklrslLsllEqervlirlLladednknasgvtvrIGkEneslsdc +++rs+L +++ i+l+ + + ++I + ++c AAK62657.1 268 ----SRVRSVL----VTG-HIFLNSSLTE-----AFCIAI----LEAASC 299 SvItapYkigeekplGalgiiGPTRMdYqrvi<-* + t+ ++g+ + v+ AAK62657.1 300 GLLTVSTRVGGV----------------PEVL 315 oxidored_q6: domain 1 of 1, from 259 to 338: score -55.3, E = 46 *->vEvvyaplapryDderfGvilreiPrqaDvalVeGgVtlknaenlel vE++ a + +r +lV+G + +l AAK62657.1 259 VEMLGAVPHS---RVR-------------SVLVTGHIF--LNSSLTE 287 iyeqrpdpKiVIAmGaCAvsGGvfrmrnlggeskeevLqeaYietattkn + ++IA A++G + AAK62657.1 288 A--------FCIAILEAASCGLLTV------------------------- 304 kgqvPsepiVPplysvvgPvdkvIPVDvyiPGCPPrPEailyaitall<- ++vg+v +v P D + P+P+++++ai +++ AAK62657.1 305 -------------STRVGGVPEVLPDDMVVL-AEPDPDDMVRAIEKAI 338 * AAK62657.1 - - Glycos_transf_1: domain 1 of 1, from 186 to 339: score 107.0, E = 1.5e-28 *->dreeirkklgikedkkiilfvGRlvpeKGidllieAfkkLkkkpkll +++++ + +d ++i+ ++Rlv++KG dll+e+++++++ AAK62657.1 186 TAMFKPASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCR----- 227 klnpnlkLvivGgpYdsedgeeedelkklaeklglednviflGfvpdedl l+pn+++v+ G dg++ +l+++ ek+ l+d+v++lG vp++ + AAK62657.1 228 -LYPNVRFVVGG------DGPKHVRLEEMREKHSLQDRVEMLGAVPHSRV 270 pelyksadvfvlPSryEgFGivllEAmAcGlPVIatncvgGipEvvkdge ++++ ++f+ S +E+F+i++lEA +cGl ++t vgG+pEv+ d+ AAK62657.1 271 RSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVSTR-VGGVPEVLPDDM 319 tGllvepgqdpealaeaiekllk<-* ++ +++ dp+++++aiek+++ AAK62657.1 320 --VVLAEP-DPDDMVRAIEKAIS 339 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- Glycos_transf_1 Glycosyl transferases group 1 107.0 1.5e-28 1 oxidored_q6 NADH ubiquinone oxidoreductase, 20 Kd 2.0 26 1 RuvA RuvA N terminal domain 1.9 12 1 LYTB LytB protein 0.4 31 1 DUF166 Uncharacterized ArCR, COG1810 -0.2 69 1 DEAD DEAD/DEAH box helicase -0.8 75 1 DnaB DnaB-like helicase -1.9 70 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- DnaB 1/1 7 21 .. 1 15 [. -1.9 70 RuvA 1/1 128 133 .. 63 68 .] 1.9 12 DEAD 1/1 286 307 .. 37 56 .. -0.8 75 oxidored_q6 1/1 325 338 .. 132 145 .] 2.0 26 Glycos_transf_1 1/1 186 339 .. 1 170 [. 107.0 1.5e-28 DUF166 1/1 330 340 .. 275 285 .] -0.2 69 LYTB 1/1 327 341 .. 329 343 .] 0.4 31 Alignments of top-scoring domains: DnaB: domain 1 of 1, from 7 to 21: score -1.9, E = 70 *->lekLepddFYfpaHr<-* l++L +dF+fp+++ AAK62657.1 7 LRVLMVSDFFFPNFG 21 RuvA: domain 1 of 1, from 128 to 133: score 1.9, E = 12 *->hlLYGF<-* h+LYGF AAK62657.1 128 HSLYGF 133 DEAD: domain 1 of 1, from 286 to 307: score -0.8, E = 75 *->TlAfllPilerll..lvlaPTR<-* T+Af+++ile+ +l+ + TR AAK62657.1 286 TEAFCIAILEAAScgLLTVSTR 307 oxidored_q6: domain 1 of 1, from 325 to 338: score 2.0, E = 26 *->PrPEailyaitall<-* P+P+++++ai +++ AAK62657.1 325 PDPDDMVRAIEKAI 338 Glycos_transf_1: domain 1 of 1, from 186 to 339: score 107.0, E = 1.5e-28 *->dreeirkklgikedkkiilfvGRlvpeKGidllieAfkkLkkkpkll +++++ + +d ++i+ ++Rlv++KG dll+e+++++++ AAK62657.1 186 TAMFKPASVRPSTDIITIVVISRLVYRKGADLLVEVIPEVCR----- 227 klnpnlkLvivGgpYdsedgeeedelkklaeklglednviflGfvpdedl l+pn+++v+ G dg++ +l+++ ek+ l+d+v++lG vp++ + AAK62657.1 228 -LYPNVRFVVGG------DGPKHVRLEEMREKHSLQDRVEMLGAVPHSRV 270 pelyksadvfvlPSryEgFGivllEAmAcGlPVIatncvgGipEvvkdge ++++ ++f+ S +E+F+i++lEA +cGl ++t vgG+pEv+ d+ AAK62657.1 271 RSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVSTR-VGGVPEVLPDDM 319 tGllvepgqdpealaeaiekllk<-* ++ +++ dp+++++aiek+++ AAK62657.1 320 --VVLAEP-DPDDMVRAIEKAIS 339 DUF166: domain 1 of 1, from 330 to 340: score -0.2, E = 69 *->hkeAverAigv<-* +++A+e+Ai + AAK62657.1 330 MVRAIEKAISI 340 LYTB: domain 1 of 1, from 327 to 341: score 0.4, E = 31 *->PdwiveaViakLrel<-* Pd++v+a++++++ l AAK62657.1 327 PDDMVRAIEKAISIL 341 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Thu Nov 22 13:14:28 2001 Sequence file: AAK62657.1.fa ---------------------------------------- Sequence AAK62657.1 (447 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 50: NRSG 282: NSSL 378: NRSL Total matches: 3 Matching pattern PS00005 PKC_PHOSPHO_SITE: 161: TSK 193: SVR 305: STR 377: SNR Total matches: 4 Matching pattern PS00006 CK2_PHOSPHO_SITE: 148: SLAD 161: TSKE 254: SLQD 284: SLTE 358: SWQD 380: SLLE Total matches: 6 Matching pattern PS00008 MYRISTYL: 21: GGVENH 49: GNRSGV 171: GLSPAK 392: GAWAGK 435: GVEVSE Total matches: 5 Total no of hits in this sequence: 18 ======================================== 1314 pattern(s) searched in 1 sequence(s), 447 residues. Total no of hits in all sequences: 18. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >ER-GOLGI-traffic signal is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >INTRA-SIGNAL-M minimal SH3 binding is the MOTIF name >AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. ;LENGTH=447; DIRECT_SEQUENCE n 1 solutions m %_PXXP 67-70 f >STATISTICS Total : 1 solutions in 1 sequences, 447 units; out of 1 sequences, 447 units >INTRA-SIGNAL-M deubiquitinating enzyme SH3 domain binding motif (Kato, 2000) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >INTRA-SIGNAL-M minimal class I consensus-SH3 binding motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >INTRA-SIGNAL-M minimal class II consensus-SH3 binding motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >INTRA-SIGNAL-M exact 14-3-3 binding consensus (Muslin 1996 Cell 84 889) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >INTRA-SIGNAL-M 14-3-3 binding motif in RAF and others (Muslin 1996 Cell 84 889) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >INTRA-SIGNAL-M WW domain binding motif in formins (Bedford 1997) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >INTRA-SIGNAL-M PY motif for WW domain is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >TM-CYTOPLASMIC-M di-hydrophobic endocytosis motifs for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >TM-CYTOPLASMIC-M tyrosine-based endocytosis motif for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >TM-EXTRACELL-M Endocytosis signal for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >EXTRACELL-M minimal furin protease cleavage site motif is the MOTIF name >AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. ;LENGTH=447; DIRECT_SEQUENCE n 1 solutions m %_RXXR 384-387 f >STATISTICS Total : 1 solutions in 1 sequences, 447 units; out of 1 sequences, 447 units >EXTRACELL-M extended furin protease cleavage site motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >EXTRACELL-M zinc binding motif in MMPs is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >EXTRACELL-M g alpha binding go loco is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS PDX-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS QKI-5 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS HCDA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS SV40 LrgT experimentally determined is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS H2B experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS v-Rel experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Amida experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Amida experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS RanBP3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Pho4p experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS DNAhelicaseQ1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS LEF-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS TCF-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR p53-NLS1 NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hum-Ku70 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS GAL4 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS act/inh betaA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS TR2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS THOV NP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS polyomaVP1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS HIV-1 Tat experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS HIV-1 Rev experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Rex experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS SRY experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS SRY experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS SOX9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS SOX9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS NS5A experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS DNAse EBV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS DNAse EBV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS adenovE1a experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS ystDNApolalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hVDR experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS CPV capsid experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hGlu.cort.experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS cFOS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS cJUN experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hDNApolalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hDNAtopoII experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hDNAtopoII experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hBLM experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hARNT experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS influenzaNP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS influenzaNP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS p54 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS hProTalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Tst1/Oct6 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS protHsc9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS protHsci experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS protHsc3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Ta alpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Pax-QNR experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Hunt.Dis.pro experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS MyoD experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS MyoD experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS opaque2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS CTP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS HCV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS HCV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS p110RB1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS VirD2-Nterm experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS VirD2-Cterm experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Nucloplasmin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Nucleolin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS ICP-8 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Nab2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS M9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS lscMyc experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS humKprotein experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS FluA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Mat-alpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS polyoma Lrg-T experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS polyoma Lrg-T experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS SV40 VP1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS SV40 VP2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS polyoma VP2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS c-myb experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS N-myc experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS p53 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS c-erb-A experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS yeast SKI3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS L29 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS L29 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS Max experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS L3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >NUCLEAR NLS dyskerin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >PDZ domain binding motif science 278_2075_pawson is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units >WW domain binding motif science 278_2075_pawson is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 447 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB+ PSSM from other authors IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. (447 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value ARM Armadillo repeat 23 0.71 CALC Calcineurin like Phosphoesterase domain 22 1.1 BRCT BRCA C-terminal domain 22 1.4 INSL Insulinase like Metallo protease domain 21 3.1 RASGEF RAS-type GTPase GDP exchange factor 21 4.2 MATH The Meprin associated TRAF homology domain 21 4.3 KH HnRNP K homology domain (A RNA binding domain) 20 5.1 DSP Dual specificity protein phosphatase 20 5.7 S1 S1 RNA binding domain 20 7.8 CYCLIN Cyclin/TFIIB domain 20 8.4 RHOGEF RHO-type GTPase GDP exchange factor 19 8.5 >ARM Armadillo repeat Length = 532 Score = 23.4 bits (50), Expect = 0.71 Identities = 6/50 (12%), Positives = 6/50 (12%), Gaps = 4/50 (8%) Query: 292 AILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPDDMVRAIEKAISIL 341 Sbjct: 55 ALQGFPSASAASVDKKLDSLKDMVA----GVWSDDPALQLESTTQFRKLL 100 >CALC Calcineurin like Phosphoesterase domain Length = 274 Score = 22.5 bits (47), Expect = 1.1 Identities = 11/108 (10%), Positives = 11/108 (10%), Gaps = 14/108 (12%) Query: 1 MAEPPKLRVLMVSDFFFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGG 60 Sbjct: 50 QENNEFDVILATGDLVQDSSDEGYIRFVEM---MKPFNKPVFW---IPGNHDFQPKMAEF 103 Query: 61 LKVYYVPWRPFVMQTTFPTVYGTLPIVRTILRREKITVVHGHQAFSTL 108 Sbjct: 104 LNQPPMNAAKHLL------LGEHWQALLLD--SQVYGVPHGQLSQHQL 143 Score = 21.7 bits (45), Expect = 1.8 Identities = 8/63 (12%), Positives = 8/63 (12%), Gaps = 11/63 (17%) Query: 188 MFKPASVRPSTDIITIVVIS----------RLVYRKGADLLVEVIPEVCRLYPNVRFVV- 236 Sbjct: 1 MKNTFVYQAEKPVIKLLQITDPHLFKDESAELLGVNTQASFAQVLKEIQQENNEFDVILA 60 Query: 237 GGD 239 Sbjct: 61 TGD 63 >BRCT BRCA C-terminal domain Length = 138 Score = 22.2 bits (47), Expect = 1.4 Identities = 8/41 (19%), Positives = 8/41 (19%), Gaps = 4/41 (9%) Query: 224 EVCRLYPNVRFVVGGDGPKHVRLEEMREKHSLQDRVEMLGA 264 Sbjct: 2 ERHKVLDGCVIVFSGIVPM----GEKLERTDIYRLCTQFGA 38 >INSL Insulinase like Metallo protease domain Length = 433 Score = 21.0 bits (44), Expect = 3.1 Identities = 9/43 (20%), Positives = 9/43 (20%), Gaps = 4/43 (9%) Query: 405 LDYLLWRLLQLLQP----DEDIEEAPDICLCHHRGVEVSEGLR 443 Sbjct: 114 WKQALEVLYQLTMKATLDEEMIEKEKPIVIEELRRGKDNPTTV 156 >RASGEF RAS-type GTPase GDP exchange factor Length = 196 Score = 20.5 bits (43), Expect = 4.2 Identities = 3/20 (15%), Positives = 3/20 (15%) Query: 321 VLAEPDPDDMVRAIEKAISI 340 Sbjct: 75 ILLTKSSKMKRNVIQRFIHV 94 >MATH The Meprin associated TRAF homology domain Length = 209 Score = 20.6 bits (43), Expect = 4.3 Identities = 2/38 (5%), Positives = 2/38 (5%), Gaps = 4/38 (10%) Query: 36 KLGHKVVVMTHAYGNRS----GVRYMTGGLKVYYVPWR 69 Sbjct: 33 SYMWTINNFSFCREEMGEVIKSSTFSSGANDKLKWCLR 70 >KH HnRNP K homology domain (A RNA binding domain) Length = 92 Score = 20.2 bits (42), Expect = 5.1 Identities = 5/38 (13%), Positives = 5/38 (13%) Query: 240 GPKHVRLEEMREKHSLQDRVEMLGAVPHSRVRSVLVTG 277 Sbjct: 43 GKSGDTIRQLQEKSGCKMILVQDNQSVSDQSKPLRITG 80 >DSP Dual specificity protein phosphatase Length = 185 Score = 20.3 bits (42), Expect = 5.7 Identities = 6/27 (22%), Positives = 6/27 (22%) Query: 27 IYYLSQCLLKLGHKVVVMTHAYGNRSG 53 Sbjct: 106 ADFIDQALAQKNGRVLVHCREGYSRSP 132 >S1 S1 RNA binding domain Length = 305 Score = 19.8 bits (41), Expect = 7.8 Identities = 12/82 (14%), Positives = 12/82 (14%), Gaps = 15/82 (18%) Query: 136 VGSIHMNKVLQFSLADIDQAICVSHTSKENTVLRSGLSPAKVFMIPNAVDTAMFKPASVR 195 Sbjct: 193 AGNIYEGKVAKIQ----PYGVFVEIEGVTGLLHVSQVSGTRVDSLNTLFAFG-------- 240 Query: 196 PSTDIITIVVISRLVYRKGADL 217 Sbjct: 241 ---QAISVYVQEIDEYKNRISL 259 >CYCLIN Cyclin/TFIIB domain Length = 317 Score = 19.5 bits (40), Expect = 8.4 Identities = 7/28 (25%), Positives = 7/28 (25%) Query: 388 FLSCGAWAGKLFCMVMILDYLLWRLLQL 415 Sbjct: 121 GNLRESPLGQEKALEQILEYELLLIQQL 148 >RHOGEF RHO-type GTPase GDP exchange factor Length = 176 Score = 19.4 bits (40), Expect = 8.5 Identities = 9/41 (21%), Positives = 9/41 (21%) Query: 365 RTEIVYDRALKCSNRSLLERLMRFLSCGAWAGKLFCMVMIL 405 Sbjct: 7 QTERVYVRELYTVLLGYRAEMDNPEMFDLMPPLLRNKKDIL 47 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 11 Number of calls to ALIGN: 12 Length of query: 447 Total length of test sequences: 20182 Effective length of test sequences: 16637.0 Effective search space size: 6884215.9 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. (447 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|2039378 [21..454] PLP-dependent transferases 28 0.41 gi|232060 [143..460] Subtilases 26 1.1 gi|576299 [1..317] alpha/beta-Hydrolases 25 2.6 gi|1175888 [4..184] Flavodoxin-like 24 3.3 gi|2500689 [29..252] Periplasmic binding protein-like I 25 3.5 gi|2414449 [42..312] alpha/beta-Hydrolases 24 4.4 gi|126343 [24..369] Periplasmic binding protein-like I 24 4.5 gi|2246648 [5..196] NAD(P)-binding Rossmann-fold domains 24 5.2 gi|1518938 [387..612] Heat shock protein 70kD (HSP70), C-ter... 24 5.2 gi|1945717 [15..256] alpha/beta-Hydrolases 24 5.4 gi|1652197 [32..277] alpha/beta-Hydrolases 24 5.4 gi|1772352 [27..300] alpha/beta-Hydrolases 24 5.4 gi|2209100 [9..467] PLP-dependent transferases 24 5.6 gi|1352909 [83..296] alpha/beta-Hydrolases 24 6.4 gi|293973 [1..262] NAD(P)-binding Rossmann-fold domains 24 6.9 gi|417431 [2..400] beta/alpha (TIM)-barrel 23 7.7 gi|882670 [155..332] Ribonuclease H-like motif 23 8.5 gi|1536958 [63..316] Periplasmic binding protein-like I 23 8.9 >gi|2039378 [21..454] PLP-dependent transferases Length = 434 Score = 27.5 bits (60), Expect = 0.41 Identities = 15/148 (10%), Positives = 15/148 (10%), Gaps = 6/148 (4%) Query: 34 LLKLGHKVVVMTHAYGNRSGVRYM-TGGLKVYYVPWRPFVMQTTF-PTVYGTLPIVRTIL 91 Sbjct: 280 FESQDALPDMVTMAKGIGNGFPLGAVVTSKEIADSFNKSLYFNTYGGNPLASVVGKAVLE 339 Query: 92 RREKITVVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMN----KVLQF 147 Sbjct: 340 VIEEEKLQENSAVVGDYFLKQLAAIDDATIGDVRGKGLMIGVELIDEQGKPLAAAKTAAI 399 Query: 148 SLADIDQAICVSHTSKENTVLRSGLSPA 175 Sbjct: 400 FEDTKNQGLLIGKGGIHGNVLRIKPPMC 427 >gi|232060 [143..460] Subtilases Length = 318 Score = 26.1 bits (57), Expect = 1.1 Identities = 13/78 (16%), Positives = 13/78 (16%), Gaps = 4/78 (5%) Query: 261 MLGAVPHSRVRSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMV 320 Sbjct: 66 LKGVNPGVEMNVYRVFGSKKSEMLWVSK---GIIDAANDDNDVINVSLGN-YLIKDNQNK 121 Query: 321 VLAEPDPDDMVRAIEKAI 338 Sbjct: 122 KKLRDDEKVDYDALQKAI 139 >gi|576299 [1..317] alpha/beta-Hydrolases Length = 317 Score = 24.8 bits (53), Expect = 2.6 Identities = 9/98 (9%), Positives = 9/98 (9%), Gaps = 10/98 (10%) Query: 38 GHKVVVMTHAYGNRSGVRYMTG--------GLKVYYVPWRPFVMQTTFPTVYGTLPIVRT 89 Sbjct: 30 VSKPILLVPGTGTTGPQSFDSNWIPLSTQLGYTPCWISPPPFMLNDTQVNTEYMVNAITA 89 Query: 90 ILRREKITVVH--GHQAFSTLCHEALMHARTMGYKVVF 125 Sbjct: 90 LYAGSGNNKLPVLTWSQGGLVAQWGLTFFPSIRSKVDR 127 >gi|1175888 [4..184] Flavodoxin-like Length = 181 Score = 24.5 bits (53), Expect = 3.3 Identities = 10/68 (14%), Positives = 10/68 (14%), Gaps = 5/68 (7%) Query: 274 LVTGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGG---VPEVLPDDMVVLAE--PDPD 328 Sbjct: 2 IIFAHPNSVRSFGRAIANRIEQISQENGVNVFFRDLYEMNFNPILSHEELQNANNGIIPE 61 Query: 329 DMVRAIEK 336 Sbjct: 62 DIQQEHDF 69 >gi|2500689 [29..252] Periplasmic binding protein-like I Length = 224 Score = 24.5 bits (53), Expect = 3.5 Identities = 4/28 (14%), Positives = 4/28 (14%) Query: 17 FPNFGGVENHIYYLSQCLLKLGHKVVVM 44 Sbjct: 105 ANNYETGKAQGSFVCALAKERGGNKVGM 132 >gi|2414449 [42..312] alpha/beta-Hydrolases Length = 271 Score = 24.1 bits (52), Expect = 4.4 Identities = 15/86 (17%), Positives = 15/86 (17%), Gaps = 12/86 (13%) Query: 120 GYKVVFTDHSLYG------FADV---GSIHMNKVLQFSLAD---IDQAICVSHTSKENTV 167 Sbjct: 113 NYDVYVTGHSLGGALAGLCAPRIVHDGLRQSQKIKVVTFGEPRVGNIEFSRAYDQLVPYS 172 Query: 168 LRSGLSPAKVFMIPNAVDTAMFKPAS 193 Sbjct: 173 FRVVHSGDVVPHLPGCVKDLSYTPPA 198 >gi|126343 [24..369] Periplasmic binding protein-like I Length = 346 Score = 24.2 bits (52), Expect = 4.5 Identities = 19/195 (9%), Positives = 19/195 (9%), Gaps = 31/195 (15%) Query: 34 LLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYVPWR---PFVMQTTFPTVYGTLP----- 85 Sbjct: 65 IVNDGIQYVIGHLCSSSTQPASDIYEDEGILMISPGATNPELTQRGYQYIMRTAGLDSSQ 124 Query: 86 ---IVRTILRR---EKITVVH--GHQAFSTLCHEALMH-ARTMGYKVVFTDHSLYGFADV 136 Sbjct: 125 GPTAAKYILETVKPQRIAIIHDKQQYGEGLA--RSVQDGLKQGNANIVF-----FDGITA 177 Query: 137 GSIHMNKVLQF---SLADIDQAICVSHTSKENTVLRSGLSPAK--VFMIPNAVDTAMFKP 191 Sbjct: 178 GEKDFSALIARLQKENIDF-VYYGGYYPEMGQ-IVRQARANGLKTQFMGPEGVGNASLSN 235 Query: 192 ASVRPSTDIITIVVI 206 Sbjct: 236 IAGGAAEGMLVTMPK 250 >gi|2246648 [5..196] NAD(P)-binding Rossmann-fold domains Length = 192 Score = 23.8 bits (50), Expect = 5.2 Identities = 1/40 (2%), Positives = 1/40 (2%) Query: 29 YLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYVPW 68 Sbjct: 12 CWTESVIHEYGDRVIGITRSVEAAQEMTALYPEHFIPCIA 51 >gi|1518938 [387..612] Heat shock protein 70kD (HSP70), C-terminal, substrate-binding fragment Length = 226 Score = 24.1 bits (52), Expect = 5.2 Identities = 9/38 (23%), Positives = 9/38 (23%), Gaps = 3/38 (7%) Query: 322 LAEPDPDDMVRAIEKAISIL---PTINPEEMHNRMKKL 356 Sbjct: 173 LSAEDKKTILDKCSESLSWLDNNHTAENEEYEYQQKEL 210 >gi|1945717 [15..256] alpha/beta-Hydrolases Length = 242 Score = 24.0 bits (50), Expect = 5.4 Identities = 13/119 (10%), Positives = 13/119 (10%), Gaps = 16/119 (13%) Query: 38 GHKVVVMTHAYGNRSGV-----RYMTGGLKVYYVPWRPFVMQTTFPTVYGT--------- 83 Sbjct: 3 GKASIMFAPGFGCDQSVWNAVAPAFEEDHRVILFDYVGSGHSDLRAYDLNRYQTLDGYAQ 62 Query: 84 -LPIVRTILRREKITVVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHM 141 Sbjct: 63 DVLDVCEALDLKETVFV-GHSVGALIGMLASIRRPELFSHLVMVGPSPCYLNDPPEYYG 120 >gi|1652197 [32..277] alpha/beta-Hydrolases Length = 246 Score = 23.9 bits (50), Expect = 5.4 Identities = 3/45 (6%), Positives = 3/45 (6%) Query: 16 FFPNFGGVENHIYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGG 60 Sbjct: 2 LLHGLPSQSLCWTGVMPLLAEKGLTAIAPDWLGFGFSDILDKRDF 46 >gi|1772352 [27..300] alpha/beta-Hydrolases Length = 274 Score = 23.7 bits (51), Expect = 5.4 Identities = 6/13 (46%), Positives = 6/13 (46%) Query: 120 GYKVVFTDHSLYG 132 Sbjct: 138 DYKIVVVGHSLGA 150 >gi|2209100 [9..467] PLP-dependent transferases Length = 459 Score = 24.0 bits (51), Expect = 5.6 Identities = 9/58 (15%), Positives = 9/58 (15%), Gaps = 1/58 (1%) Query: 27 IYYLSQCLLKLGHKVVVMTHAYGNRSGVRYMTGGLKVYYVPWRPFVMQTTFPTVYGTL 84 Sbjct: 107 AKREREKGLDRSKMVVFSNYFFDTTQGHSQINGA-TVRNVYIKEAFDTTAKHPFKGNF 163 >gi|1352909 [83..296] alpha/beta-Hydrolases Length = 214 Score = 23.7 bits (51), Expect = 6.4 Identities = 6/13 (46%), Positives = 6/13 (46%) Query: 120 GYKVVFTDHSLYG 132 Sbjct: 90 EYRIVVTGHSLGA 102 >gi|293973 [1..262] NAD(P)-binding Rossmann-fold domains Length = 262 Score = 23.5 bits (49), Expect = 6.9 Identities = 23/272 (8%), Positives = 23/272 (8%), Gaps = 19/272 (6%) Query: 95 KITVVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNKVLQFSLADIDQ 154 Sbjct: 2 TIIVTGAA---GFIGSNIVKALNQRGITDIVAVDNLTKGEKFKNLAECEIAHYLDKHEFI 58 Query: 155 AICVSHTSKENTVLRSGLSPAKVFMIPNAVDTAMFKPASVRPSTDIITIVVISRLVYRKG 214 Sbjct: 59 RQVREHILPYQNI-------EAVFHQGACSDTMNHDGLYMMENNYQYTLDLL-DWCQDER 110 Query: 215 ADLLVEVIPEVCRLYPNVRFVVGGDGPKHV--RLEEMREKHSLQDRVEMLGAVPHSRVRS 272 Sbjct: 111 IPFLYASSAAVYGKGEIFREERELEKPLNVYGYSKFLFDQVLRRRMKEGLT-AQVVGFRY 169 Query: 273 VLVTGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPDDMVR 332 Sbjct: 170 FNVYGQHEQHKGRMASVAFHHFHQYREH---GYVNLFGSNDGYGNGEQTRDFVSVEDVAK 226 Query: 333 AIEKAISILPTINPEEMHNRMKKLYSWQDVAK 364 Sbjct: 227 INLYFFDHP--ELSGIYNLGTGRSQQFNELAA 256 >gi|417431 [2..400] beta/alpha (TIM)-barrel Length = 399 Score = 23.4 bits (50), Expect = 7.7 Identities = 10/72 (13%), Positives = 10/72 (13%), Gaps = 5/72 (6%) Query: 196 PSTDIITIVVISRLVYRKGADLLVEVIPEVCRLYPNVRFVVGGDGPKHVRLE--EMREKH 253 Sbjct: 204 PHSNTRTDEYGGSIENR--ARFTLEVVDALVEAIGHEKVGLRLS-PYGVFNSMSGGAETG 260 Query: 254 SLQDRVEMLGAV 265 Sbjct: 261 IVAQYAYVAGEL 272 >gi|882670 [155..332] Ribonuclease H-like motif Length = 178 Score = 23.4 bits (50), Expect = 8.5 Identities = 8/53 (15%), Positives = 8/53 (15%), Gaps = 5/53 (9%) Query: 230 PNVRFVVGGDGPKHVRLEEMREK-----HSLQDRVEMLGAVPHSRVRSVLVTG 277 Sbjct: 55 PYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNHFAKHVIITG 107 >gi|1536958 [63..316] Periplasmic binding protein-like I Length = 254 Score = 23.0 bits (49), Expect = 8.9 Identities = 13/128 (10%), Positives = 13/128 (10%), Gaps = 12/128 (9%) Query: 34 LLKLGH-KVVVMTHAYGNRSGVRYMTGG----LKVYYVPWRPFVMQTTFPTVYGTLPIVR 88 Sbjct: 113 CIEKGYEYFLLITADTSRLSTRIERASGFVDALTDANMRHASLTIEDKHTNLEQIKEFLQ 172 Query: 89 TILRREKITVVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNKVLQFS 148 Sbjct: 173 KEIDPDEKTLVFI--PNCWALPLVFTVIKELNYNLP--QVGLIGFDNTEWTCF---SSPS 225 Query: 149 LADIDQAI 156 Sbjct: 226 VSTLVQPS 233 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 18 Number of calls to ALIGN: 18 Length of query: 447 Total length of test sequences: 256703 Effective length of test sequences: 209547.0 Effective search space size: 85448542.6 Initial X dropoff for ALIGN: 25.0 bits ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ calculation of internal repeats with prospero ***** PROSPERO v1.3 Thu Nov 22 13:15:02 2001 ***** Copyright 2000, Richard Mott, Wellcome Trust Centre for Human Genetics, University of Oxford For help see http://www.well.ox.ac.uk/ariadne For usage use -help using gap penalty 11+1k using matrix BLOSUM62 printing all alignments with eval < 0.100000 using sequence1 AAK62657.1 using self-comparison ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ TIGRFAM hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/tigrfam/tigrfam.hmm Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- TIGR01016 sucCoAbeta: succinyl-CoA synthetase, beta s -297.2 8.2 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- TIGR01016 1/1 90 342 .. 1 389 [] -297.2 8.2 Alignments of top-scoring domains: TIGR01016: domain 1 of 1, from 90 to 342: score -297.2, E = 8.2 *->MkLHEYQAKeiFakYGiPvPeGeVatsveeveeiaeelgegkvVvKa i++++ i v G+ a+s+ e + + g Vv AAK62657.1 90 ----------ILRREKITVVHGHQAFSTLCHEALMHARTMGYKVV-- 124 QVHaGGRGKAGGVkvAkskEealkaAekLLGkeLkTfQtdplGqpVnkiL + ++L G + + + nk+L AAK62657.1 125 --------------FTD---------HSLYGFADV---G---SIHMNKVL 145 iEekidIdkEyYlsiviDRsakkpvimaSteGGvDIEEVAeksPEkIikv ++ dId +s + S+e ++ AAK62657.1 146 QFSLADIDQAICVSHT------------SKE-------------NTVLRS 170 aidPltgllpYqaRelakklgLegelikqvasIikkL..yqiFlEkDAs. + P++ ++ + ++ + ++ + ++Ii + +++ + k A AAK62657.1 171 GLSPAKVFM---IPNAVDTAMFKPASVRPSTDIITIVviSRLVYRKGADl 217 LvEINPLViTkdGnLvaLDAKLtiDDNALFRHpdLeemeDytqedqlEve LvE+ P V + n+ + + +++ +H Leem++ AAK62657.1 218 LVEVIPEVCRLYPNV-----RFVVGGDGP-KHVRLEEMREKHSLQ----- 256 AkqlGLnYVkLDGnIGCmVNGAGLAMaTMDIIKLyGGkPANFLDVGGGAs D + +G+ P AAK62657.1 257 -----------------------------DRVEMLGAVP----------- 266 eErVkeAlklvLSDksVKvvFi..NIFGGItRCDeVAkGlveAlkdvgvn + V+ v+++++IF + ++ ++ l+ ++ AAK62657.1 267 -------------HSRVRSVLVtgHIF---LNSSLTEAFCIAILEAASCG 300 VPvV.VRLeGTNvEeGkkILqesGkDRlniifatsmeeAAeKaVelak<- +V+ R G + + +++ + + +m +A eKa+ ++ AAK62657.1 301 LLTVsTRVGGVPEVLPDDMVVLAE------PDPDDMVRAIEKAISILP 342 * AAK62657.1 - - // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/tigrfam/tigrfam.hmm-f Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- TIGR00418 thrS: threonyl-tRNA synthetase 2.4 5.9 1 TIGR00473 pssA: CDP-diacylglycerol--serine O-phosphat 0.9 28 1 TIGR00544 lgt: prolipoprotein diacylglyceryl transfer -1.6 86 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- TIGR00473 1/1 7 18 .. 163 174 .] 0.9 28 TIGR00418 1/1 218 238 .. 1 25 [. 2.4 5.9 TIGR00544 1/1 396 426 .. 328 358 .] -1.6 86 Alignments of top-scoring domains: TIGR00473: domain 1 of 1, from 7 to 18: score 0.9, E = 28 *->lsvLMvSnIkYP<-* l vLMvS+ +P AAK62657.1 7 LRVLMVSDFFFP 18 TIGR00418: domain 1 of 1, from 218 to 238: score 2.4, E = 5.9 *->rHsiAHlLaeavkqlypdvklavGP<-* ++++++e v++lyp+v ++vG+ AAK62657.1 218 ---LVEVIPE-VCRLYPNVRFVVGG 238 TIGR00544: domain 1 of 1, from 396 to 426: score -1.6, E = 86 *->gqilsllgilgglfllllayklsdykwkenk<-* g ++++++il l ++ll+ +d ++ e AAK62657.1 396 GKLFCMVMILDYLLWRLLQLLQPDEDIEEAP 426 // SMART hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/iprscan/data/smart.HMMs Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- LRRCT Leucine rich repeat C-terminal domain -7.6 43 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- LRRCT 1/1 352 392 .. 1 55 [] -7.6 43 Alignments of top-scoring domains: LRRCT: domain 1 of 1, from 352 to 392: score -7.6, E = 43 *->NPfnCDCeLrwLlrwlea.qnnealqdpvsslrCasPeslrGqplll +++ L w++ ++e + d+ +l+C + + l+ +l + AAK62657.1 352 -------RMKKLYSWQDVaKRTEIVYDR--ALKCSNRSLLE--RLMR 387 llpsefsCp<-* +l sC+ AAK62657.1 388 FL----SCG 392 // COG hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/cogs/cogs.hmm Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- COG0438 147.9 1.8e-40 1 COG1656 -131.2 97 1 COG2362 -178.5 77 1 COG1242 -213.3 27 1 COG0297 -220.7 0.0021 1 COG2022 -237.6 25 1 COG0380 -324.6 59 1 COG0045 -326.4 95 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- COG2022 1/1 121 305 .. 1 284 [] -237.6 25 COG0045 1/1 90 343 .. 1 402 [] -326.4 95 COG1242 1/1 144 363 .. 1 333 [] -213.3 27 COG0438 1/1 199 378 .. 1 255 [] 147.9 1.8e-40 COG0297 1/1 1 379 [. 1 556 [] -220.7 0.0021 COG0380 1/1 1 380 [. 1 586 [] -324.6 59 COG2362 1/1 163 411 .. 1 303 [] -178.5 77 COG1656 1/1 219 413 .. 1 255 [] -131.2 97 Alignments of top-scoring domains: COG2022: domain 1 of 1, from 121 to 305: score -237.6, E = 25 *->meAvqhssaqmsltlqmsqmslkddmLtIaGrtFdSRLllGTGKYpd v+ + + +s ++k+ +++a+ AAK62657.1 121 YKVVFTDHSLYGFADVGSIHMNKVLQFSLAD---------------- 151 levmkeAieaSGaqIvTVAiRRvnlgqapgednLLdlLdpdritlLPNTA + +Ai +S + ++ + + L + L+p ++ + PN AAK62657.1 152 ---IDQAICVSHT-----------SK---ENTVLRSGLSPAKVFMIPN-- 182 GcrtAeEAVrtArLAREald..................gtnwiKLEVigD AV tA+ ++ ++++ + ++ ++g +++ +EVi+ AAK62657.1 183 -------AVDTAMFKPASVRpstdiitivvisrlvyrkGADLL-VEVIPE 224 ertLLPDpvETLkAAEqLVkEGFtVLPYtsdDPVLArRLEevGCaAVMPL L+P++ F+V D+ RLEe AAK62657.1 225 VCRLYPNVR-------------FVV----GGDGPKHVRLEE--------- 248 AaPIGSGlGianpynLeiIieeAkDVPVIVDAGIGsPSDAAqAMELGaDA + ++ L+ +e + VP S + ++ G AAK62657.1 249 ---------MREKHSLQDRVEMLGAVP---------HSRVRSVLVTGH-- 278 VLvNTAIAeAkDPVaMArAmklAV.eAGRLAYlAGRiprkdyAtASSPlt +N+ ++e A+ A+ eA AS AAK62657.1 279 IFLNSSLTE---------AFCIAIlEA-----------------ASC--- 299 Glvrfv<-* Gl++ + AAK62657.1 300 GLLTVS 305 COG0045: domain 1 of 1, from 90 to 343: score -326.4, E = 95 *->sasdlMnLHEYQAKeiFakYGIPvPeGkvatsv..eEaveiakelGg i++ I v G+ a+s+ Ea a+ G+ AAK62657.1 90 ---------------ILRREKITVVHGHQAFSTlcHEALMHARTMGY 121 pvVVKAQVhaGGRGKAGGVKfakspeEakeaAeelLGknLvTyQtgikGe +vV f ++l G ++ AAK62657.1 122 KVV-----------------FTD---------HSLYG------FADVGSI 139 pVnkvLvEeavdiidkEyYlsivlDRsarrpvimaSseGGmDIEEVAeet nkvL + +d id +i + +++ ++ S AAK62657.1 140 HMNKVLQFSLAD-ID----QAICVSHTSKENTVLRSG------------- 171 PEkIikvtiDPlkGlqpYqaReiAfklGLpgelvkqvaki..ikkLYklF l + +++ +++ + +++ v+ ++i +i+ ++l AAK62657.1 172 -----------LSPAKVFMIPNAVDTAMFKPASVRPSTDIitIVVISRLV 210 veyDAs.LvEINPLvitkdGknllALDAKitiDDNAlfRHYpelaelRDp + + A+ LvE+ P v + n+ ++ + ++ +H +l+e+R++ AAK62657.1 211 YRKGADlLVEVIPEVCRLYP-NV-----RFVVGGDGP-KH-VRLEEMREK 252 sqEdplEvkAkeygLnYVkLDGnIGcivNGAGLAMaTMDiiklygGGkPA D + + G+ P AAK62657.1 253 HSLQ----------------------------------DRVEML-GAVP- 266 NFLDvGGGAtaErVkeAfkliLsdpnVKaiFiNIFGGa......ItRCDe + V+++++ +G +++ t + AAK62657.1 267 -----------------------HSRVRSVLV---TGHiflnssLTEAFC 290 VAeGiveAlkevgrgvnvPvVVRLeGTNvEeGrkILeEsGlnnhdlItaa +A i eA+ ++v + V G + + ++ + + AAK62657.1 291 IA--ILEAASCG--LLTVSTRV--GGVPEVLPDDMVVLA-------EPDP 327 tsmeeAAekaVelaeg<-* ++m A+eka+++ AAK62657.1 328 DDMVRAIEKAISILPT 343 COG1242: domain 1 of 1, from 144 to 363: score -213.3, E = 27 *->vmenNyddikiiveimktqklynslgdyLkerYGeKVfKitldgGFs v++ di + ++t k l L KVf i+ + AAK62657.1 144 VLQFSLADIDQAICVSHTSKENTVLRSGLSP---AKVFMIPNAV--- 184 CPNRDGTvGrGGCtFCsdasspdfagdesdqlpsqikeqierakkkykkk D ++ +p ++ +s++++ +++ +y+k+ AAK62657.1 185 ----D-----------TAMFKPASV-RPSTDII----TIVVISRLVYRKG 214 GikKyiaYFQayTNTYAp..VEvLremYeqaLsqedVVGLsVgTRPDCvP A+ VEv e+ VVG + P AAK62657.1 215 ----------------ADllVEVIPEVCRLYPNVRFVVGGD-------GP 241 deVLDlLaEYkerGyevWvELGLQTaHerTLkkINRgHDfacyvd...av + V L+E e+ H +++ v+ +av AAK62657.1 242 KHV--RLEEMREK------------------------HSLQDRVEmlgAV 265 nklrKrGIkVCsHvIlGLPgEdreemmETAKivakLdvdGIKiHpLhvvK r r V +H+ l+ ++ A +a L AAK62657.1 266 PHSRVRSVLVTGHIFLN-------SSLTEAFCIAIL-------------- 294 GTkmaKqYekGelktlsleeYvslvcdfLEhLPPevviHRlsgdkaprel ++ G l++ + +v+ + E+LP ++v+ ++ ++ + AAK62657.1 295 -----EAASCGLLTVST------RVGGVPEVLPDDMVV---LAE-PDPDD 329 liaPeWctfknkwevmnkie.eeLerRGTyQGkrfgrlkees<-* ++ + ++ + ++ i++ee +R k++ + AAK62657.1 330 MVR----AIEKAISILPTINpEEMHNRM----KKLYSWQDVA 363 COG0438: domain 1 of 1, from 199 to 378: score 147.9, E = 1.8e-40 *->dkpvilfvGRlvpeKgldllieafaklkeeipellpdlklvivGgts d ++i++++Rlv++Kg dll+e+++++++ + p++++v+ G AAK62657.1 199 DIITIVVISRLVYRKGADLLVEVIPEVCRLY----PNVRFVVGG--- 238 yiaaeacdGpeeerlrlleklakklglednVeflGfvpdprvldeelpel dGp+ r le++ +k+ l+d+Ve+lG vp ++ ++++ AAK62657.1 239 -------DGPKHVR---LEEMREKHSLQDRVEMLGAVP-----HSRVRSV 273 lkaadvfvlPSrysekrgedrEgfglvllEAmAaGtPViatdvgslelga l ++f+ S++ E+f++++lEA ++G+ ++t+vg AAK62657.1 274 LVTGHIFLNSSLT--------EAFCIAILEAASCGLLTVSTRVG------ 309 neereladkGipEvvedgarylfgenGregkrrlnlllvdpgdeddidsi G+pEv+ d + ++ + d+d AAK62657.1 310 ---------GVPEVLPD--------DM---------VVLAEPDPD----- 328 ealaeaierlledpelreregvsllgrearrrvaerfswekiakrllkly ++++aie++++ + + +e +r ++ +sw+ +akr++ +y AAK62657.1 329 -DMVRAIEKAISILPTINP-------EEMHNRMKKLYSWQDVAKRTEIVY 370 eellekre<-* +++l+ ++ AAK62657.1 371 DRALKCSN 378 COG0297: domain 1 of 1, from 1 to 379: score -220.7, E = 0.0021 *->lnsqdryserMkILfvasEvtPfvKvGGLADVlgaLPkaLkklGhdV + +++ + ++L v+ ++P GG ++ L++ L klGh+V AAK62657.1 1 --MAEPP--KLRVLMVSDFFFP--NFGGVENHIYYLSQCLLKLGHKV 41 rVlLPkYgriqgepieqlykvsegetvavvgreqqfdvlesyldGtvgly +V++ +Yg +g + + + ++++v++r v +++ ++ AAK62657.1 42 VVMTHAYGNRSGVRYMTG----GLKVYYVPWRP---FVMQTTFPT----- 79 lidKndyyfnregnPYhDanlygypDnaeRFafFsaAalelldgldpfwq + + ++ + ++l +++ AAK62657.1 80 VYG------TLPI------------------------VRTILRREK---- 95 PDiVHaHDWhTGLvp.alLKteyrklPFfervKtVFTIHNLaYQGEmIEY + +VH+H + L+ +al++ r K+VFT H L G AAK62657.1 96 ITVVHGHQAFSTLCHeALMH--ARTMG----YKVVFTDHSLY--G----- 132 GEVmTFLifpahylhellglplylfhyeglefpGqinflKaGivfaDhV. ++ h+ +l + l +i+ +i ++ + AAK62657.1 133 -------FADVGSIHMNKVLQFSLA---------DIDQ---AICVSHTSk 163 .TTVSPTYAqEIqTpeygygLeglLkarssegklsGILNGIDyeiWnPet ++TV l + +s + k+ I N +D+ + P+ AAK62657.1 164 eNTV-------------------LRSGLS-PAKVFMIPNAVDTAMFKPA- 192 DpylaanYdagsledpvlFkkKaeNKtaLqeelGLpedddaPligiVsRL s p ++d+ i ++sRL AAK62657.1 193 -----------SVR---------------------P-STDIITIVVISRL 209 teQKGvdLlleiideLlekEFqdaqlViLGtGdPeLEnafrnlaerhpds + KG dLl+e+i+e + + ++++V++G G + ++ ++ e+h AAK62657.1 210 VYRKGADLLVEVIPEVCR-LYPNVRFVVGGDG--PKHVRLEEMREKHS-- 254 gnvavligfdepLA..rriYAGaDfilMPSrFEPCGLtQLiaMrYGTvPI + +v+ p ++ r + +++l S E++ L+a G + + AAK62657.1 255 LQDRVEMLGAVPHSrvRSVLVTGHIFLNSSLTEAFCIAILEAASCGLLTV 304 VReTGGLaDTVvdldydeenleekgtGflFkepdaeallnalsRAlalYr +++GG + +d d ++ epd+ + +a+++A+++ AAK62657.1 305 STRVGGVPEVLPD-DM-----------VVLAEPDPDDMVRAIEKAISILP 342 qelNEICmFmQYIRYCPHpdewqnlvtraMaNCYYHVFadfSWdkSPAke + p+e+ n++++ ++ W++ Ak+ AAK62657.1 343 TIN---------------PEEMHNRMKKLYS-----------WQDV-AKR 365 YvelYegllaktrd<-* +Y ++l + AAK62657.1 366 TEIVYDRALKCSNR 379 COG0380: domain 1 of 1, from 1 to 380: score -324.6, E = 59 *->mtdtaqdntpkkrqris.dletdeakk.qlve...sfgdYSNkaKLk m ++ ++ + ++ ++ e + +l+ ++g AAK62657.1 1 MAEPPKLRVLMVSDFFFpNFGGVENHIyYLSQcllKLGH-------- 39 nsklvvvSneLFsRlPvsiekendgtgeykknavGstGlvtaLerllkrr k vv ++ + n +++ + + G lk + AAK62657.1 40 --KVVVMTH----------AYGNRSGV--R-YMTG----------GLKVY 64 eakEkPqDLDDDPLYgtWvgwpgvttdelpsdkeekdkdgkilkdrfnvh + W ++ t p + ++++ + r + AAK62657.1 65 Y--------------VPWRPFVMQT--TFPTVY--GTLPIVRTILRREK- 95 PViLsdedfkgyYnnySnaiLWPlfHYfintnpdnvnskafernwWdgYv ++ +++ + + ++L H + t +v f++++ +g+ AAK62657.1 96 ---ITVVHGHQAFSTLCHEAL---MHAR--TMGYKVV---FTDHSLYGFA 134 kvNqkFAdkivk.vlkkgDKDSlIWiHDYhLlLvPqmLRvkimkkrlpna +v +k+++ +l++ D q + v k+ + AAK62657.1 135 DVGSIHMNKVLQfSLADID----------------QAICVSHTSKENTVL 168 kIGFFlHiPFPSsEiFriLPeRmReeIleGLlgaDlvGFHterYarnFls + G + +F+i P+ + D t ++ + + AAK62657.1 169 RSG------LSPAKVFMI-PN-----------AVD-----TAMFKPASVR 195 scrrLldvdtrrrkvssrsdtyaGtALAEPELttivryggrlvrVdafPI + + ++ +r v g + +V+++P AAK62657.1 196 PSTDIITIVVISRL---------------------VYRKGADLLVEVIP- 223 GiDpeeltkqaakgsvqekvqeIKsealknkKlIvsvDRlDyiKGivekL e+ ++ n++ v+ D ++ + AAK62657.1 224 ---------------------EVC-RLYPNVRFVVGGDG-------PKHV 244 lAyEelLeehPElr...gkvvlvqiatpsredveYqnlrqeveelVgrIN + Lee +E+++ ++ v + ++ sr v+ +++ + g+I AAK62657.1 245 R-----LEEMREKHslqDRVEMLGAVPHSR--VR-----SVL--VTGHI- 279 geyGnlswsQPvhylhrpipfeeLialfkaaDvaLVtPLRDGMNLVakEY +l++++ + ia++ aa ++L t AAK62657.1 280 -------------FLNSSLTEAFCIAILEAASCGLLT------------- 303 VacsseknnFLCYgGpLILSEFaGaaneLkegAiiVNPwDlkevadAIde ++ + + G +L+++ ++ D +++ AI++ AAK62657.1 304 -VSTRV-G---------------GVPEVLPDDMVVLAEPDPDDMVRAIEK 336 AL....kMskeekrkrwekLkkeVlkrDidhWankllrEefldslkgekp A + ++ +ee +r++kL++ D++ ++ d ++ + AAK62657.1 337 AIsilpTINPEEMHNRMKKLYSWQ---DVAKRTEI-----VYDRALKCSN 378 ss<-* +s AAK62657.1 379 RS 380 COG2362: domain 1 of 1, from 163 to 411: score -178.5, E = 77 *->mqLRvGGeGvLGMklfiSVDmEGlpyipSrvhvfpkEFGGLvngalY v G +k+f+ +p ++ + p AAK62657.1 163 KENTVLRSGLSPAKVFM------IPNAVDTAMFKPA----------- 192 eeaRrImTreaniaaEE.aFasGaeeVlVaDSHgkMvNLlPE..elperV R ++ +++ +++ Ga + v ++PE l + V AAK62657.1 193 -SVRPSTDIITIVVISRlVYRKGA---------DLLVEVIPEvcRLYPNV 232 sLvrGfprPLSMVaGakdEgva.......aALFlGY..HArAgtgrGvld ++v G + P + ++ E ++++++ ++ lG +H r + v + AAK62657.1 233 RFVVGGDGP----KHVRLEEMRekhslqdRVEMLGAvpHSRVRSV-LVTG 277 HTy.sGrtirkVrIngvevgtEylLNAyllGeygVPVllvaGDdvlreEv H + + + + I e++ +L + gVP +vl +++ AAK62657.1 278 HIFlNSSLTEAFCIAILEAA-SCGLLTVSTRVGGVP-------EVLPDDM 319 reELtPwvefVavKralSryAaislspakiereLReaverAVqkkargel L + + p +r a+e+A+ + + AAK62657.1 320 VV-LAE------------------PDPDDMVR----AIEKAISILPT--I 344 kPLtprePavveveFknsarAevAELlPgVERvdGtvtVqyeaksmee.. P + + + + s++ +vA + +E+v + + s++e+ AAK62657.1 345 NPEEMHNR----MKKLYSWQ-DVAKR---TEIVYD-RALKCSNRSLLErl 385 .........AYklfevlvllAagvSyilsR<-* + + + A klf +l y l R AAK62657.1 386 mrflscgawAGKLFCMVMIL----DYLLWR 411 COG1656: domain 1 of 1, from 219 to 413: score -131.2, E = 97 *->yvkivdv.RayarLNdFvrlseRikTvR...FrsfQTVKDvlEAlG. + i v R y Fv + k vR ++ r D++E lG AAK62657.1 219 VEVIPEVcRLYPNV-RFVVGGDGPKHVRleeMREKHSLQDRVEMLGa 264 vPHvEVdLIllNGdPvdFdyrvvdGdliavYPeFealdigevlrltpkri vPH V +l+ G + l ++ l+ + ++lt + AAK62657.1 265 VPHSRVRSVLVTG------HIFLNSSLTEAF-CIAILEAASCGLLTVST- 306 gEmkFvvDemLGkLARwLRllGYDtvygstedDdeiieiakkegRIlLTR R G++ v + Dd + ++ AAK62657.1 307 ------------------RVGGVPEVLP----DDMVVLAEPDP------- 327 DrgLykrarKlgikaiyiesddiEeqLaevvmrlglklsrLkperaRCpy +++ +K++ i + ++++ + ++ + r++ ++R ++ AAK62657.1 328 -DDMVRAIEKAISILPTINPEEMHNRMKKLYSWQDVAK-RTEIVYDRALK 375 C.NgeLekvsrEEVkekVPREkvyreadeFELsrCtkCGriYWkGsHwer C+N++L++ + F +CG+ W G + AAK62657.1 376 CsNRSLLERL-----------------MRF-----LSCGA--WAGKLFC- 400 Mvkkleeklrkget<-* Mv + + l ++ AAK62657.1 401 MVM-ILDYLLWRLL 413 // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/cogs/cogs.hmm-f Sequence file: AAK62657.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAK62657.1 AT3g45100/T14D3_40 [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- COG0438 146.2 1.4e-44 1 COG0297 10.5 0.026 1 COG2839 8.4 0.17 1 COG0022 4.3 2.5 1 COG2236 2.0 20 1 COG0193 1.5 24 1 COG0117 1.2 31 1 COG1923 1.1 48 1 COG0037 0.2 41 1 COG1518 0.0 37 1 COG1373 -0.3 62 1 COG3374 -0.5 88 1 COG0620 -1.2 44 1 COG0145 -1.8 70 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- COG3374 1/1 65 74 .. 202 211 .] -0.5 88 COG0297 1/1 203 240 .. 338 376 .. 10.5 0.026 COG0145 1/1 230 249 .. 731 754 .] -1.8 70 COG1923 1/1 249 258 .. 1 10 [. 1.1 48 COG0117 1/1 319 337 .. 1 19 [. 1.2 31 COG0022 1/1 325 340 .. 321 336 .] 4.3 2.5 COG0620 1/1 325 346 .. 713 738 .. -1.2 44 COG0193 1/1 333 361 .. 168 196 .] 1.5 24 COG2236 1/1 349 362 .. 1 14 [. 2.0 20 COG0438 1/1 199 378 .. 1 255 [] 146.2 1.4e-44 COG0037 1/1 353 389 .. 1 33 [. 0.2 41 COG2839 1/1 395 417 .. 212 234 .] 8.4 0.17 COG1373 1/1 401 424 .. 1 28 [. -0.3 62 COG1518 1/1 432 446 .. 340 354 .] 0.0 37 Alignments of top-scoring domains: COG3374: domain 1 of 1, from 65 to 74: score -0.5, E = 88 *->ylpwtPivtv<-* y+pw P v+ AAK62657.1 65 YVPWRPFVMQ 74 COG0297: domain 1 of 1, from 203 to 240: score 10.5, E = 0.026 *->igiVsRLteQKGvdLlleiideLlekEFqdaqlViLGtG<-* i ++sRL+ KG dLl+e+i+e + + ++++V++G G AAK62657.1 203 IVVISRLVYRKGADLLVEVIPEVCR-LYPNVRFVVGGDG 240 COG0145: domain 1 of 1, from 230 to 249: score -1.8, E = 70 *->pgwqAtvdkygPWPqPihLvieri<-* p+ +++v+++g P h+++e+ AAK62657.1 230 PNVRFVVGGDG----PKHVRLEEM 249 COG1923: domain 1 of 1, from 249 to 258: score 1.1, E = 48 *->makgvkLQDp<-* m ++++LQD+ AAK62657.1 249 MREKHSLQDR 258 COG0117: domain 1 of 1, from 319 to 337: score 1.2, E = 31 *->meefseqdefyMerALelA<-* m+++ e+d M rA+e+A AAK62657.1 319 MVVLAEPDPDDMVRAIEKA 337 COG0022: domain 1 of 1, from 325 to 340: score 4.3, E = 2.5 *->PnparIlaAvkevlef<-* P+p+++++A+++++++ AAK62657.1 325 PDPDDMVRAIEKAISI 340 COG0620: domain 1 of 1, from 325 to 346: score -1.2, E = 44 *->EsvEeikalirkalevvpkErlwvnP<-* +++++++++i+ka+ ++p +nP AAK62657.1 325 PDPDDMVRAIEKAISILPT----INP 346 COG0193: domain 1 of 1, from 333 to 361: score 1.5, E = 24 *->aidkaaealeliieggfaeamnklnslka<-* ai+ka+ l+ i + + + m+kl+s+ AAK62657.1 333 AIEKAISILPTINPEEMHNRMKKLYSWQD 361 COG2236: domain 1 of 1, from 349 to 362: score 2.0, E = 20 *->mkkfkkilySwedI<-* m +k+lySw d+ AAK62657.1 349 MHNRMKKLYSWQDV 362 COG0438: domain 1 of 1, from 199 to 378: score 146.2, E = 1.4e-44 *->dkpvilfvGRlvpeKgldllieafaklkeeipellpdlklvivGgts d ++i++++Rlv++Kg dll+e+++++++ + p++++v+ G AAK62657.1 199 DIITIVVISRLVYRKGADLLVEVIPEVCRLY----PNVRFVVGG--- 238 yiaaeacdGpeeerlrlleklakklglednVeflGfvpdprvldeelpel dGp+ r le++ +k+ l+d+Ve+lG vp ++ ++++ AAK62657.1 239 -------DGPKHVR---LEEMREKHSLQDRVEMLGAVP-----HSRVRSV 273 lkaadvfvlPSrysekrgedrEgfglvllEAmAaGtPViatdvgslelga l ++f+ S++ E+f++++lEA ++G+ ++t+vg AAK62657.1 274 LVTGHIFLNSSLT--------EAFCIAILEAASCGLLTVSTRVG------ 309 neereladkGipEvvedgarylfgenGregkrrlnlllvdpgdeddidsi G+pEv+ d + ++ + d+d AAK62657.1 310 ---------GVPEVLPD--------DM---------VVLAEPDPD----- 328 ealaeaierlledpelreregvsllgrearrrvaerfswekiakrllkly ++++aie++++ + + +e +r ++ +sw+ +akr++ +y AAK62657.1 329 -DMVRAIEKAISILPTINP-------EEMHNRMKKLYSWQDVAKRTEIVY 370 eellekre<-* +++l+ ++ AAK62657.1 371 DRALKCSN 378 COG0037: domain 1 of 1, from 353 to 389: score 0.2, E = 41 *->MmkCskCgrda.viklry.geglck.ic.eehfkerl<-* M+k ++ a++++++y+++ +c +++ e+++ +l AAK62657.1 353 MKKLYSWQDVAkRTEIVYdRALKCSnRSlLERLMRFL 389 COG2839: domain 1 of 1, from 395 to 417: score 8.4, E = 0.17 *->aaKvvlalliliylLwkvldylf<-* a K + ++il ylLw+ l l+ AAK62657.1 395 AGKLFCMVMILDYLLWRLLQLLQ 417 COG1373: domain 1 of 1, from 401 to 424: score -0.3, E = 62 *->mdmledlnpwwfskeiipkiLvdrvike<-* m+m+ d+ +w++ + p d++i+e AAK62657.1 401 MVMILDYLLWRLLQLLQP----DEDIEE 424 COG1518: domain 1 of 1, from 432 to 446: score 0.0, E = 37 *->hLrgekeYeplvlww<-* h+rg++++e+l+ ++ AAK62657.1 432 HHRGVEVSEGLRKKI 446 //