analysis of sequence from AAF18502.1.fa ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. MKEAKEKKNP EISVSITIST WGAFAVYVIT GLFLIFGFHV VRNKYSVDLI SDPTLTLRLL WIIEFPIVVI IYSLFRRNPE KCSYFRAVGR SLVGLIAGAL INALGAVSLG APIGMQSLSK TIHWSFLMSV FTVVPATAVL GASWIDWHRI FASLKPIGII EHMLLVPAYG AIIGGWFGAW PMPLDWERPW QEWPICVCYG AIGGYIGGQM LGLMRTCEAQ YLTVRVIYHG QTLSDDVACN YWLTGPQLCH RFTYMVKCCQ VQPLKHGPTP ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > AAF18502.1 . . . . . 1 MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLI 50 ___________EEEEEEE_____EEEEEE___EEEEEEEEEE____EEE_ . . . . . 51 SDPTLTLRLLWIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGAL 100 ____HHHHHHHHH__EEEEEEE___________________HHHHHHHHH . . . . . 101 INALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWIDWHRI 150 HHH___________________HHHHHHHEEEEE_________HHHHHHH . . . . . 151 FASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYG 200 HHH_____EEE_EEEE___________________________EEEEEE_ . . . . . 201 AIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACNYWLTGPQLCH 250 ___________HHHHHHHHHHHHEEEEE____________________HH . . 251 RFTYMVKCCQVQPLKHGPTP 270 HHHHEEEE____________ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 0.0 % beta-contents : 67.5 % coil-contents : 32.5 % class : beta method : 2 alpha-contents : 0.0 % beta-contents : 72.6 % coil-contents : 27.4 % class : beta ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -27.44 -0.28 -1.07 -0.68 -4.00 -4.00 0.00 0.00 -1.55 -8.02 -2.10 -12.00 -12.00 0.00 -12.00 0.00 -85.15 -26.28 -0.50 -0.26 -1.27 -4.00 -4.00 0.00 0.00 -2.16 -8.23 -2.10 -12.00 -12.00 0.00 -12.00 0.00 -84.80 ID: AAF18502.1 AC: xxx Len: 270 1:I 244 Sc: -84.80 Pv: 8.651195e-01 NO_GPI_SITE GPI: learning from protozoa -38.25 -0.46 -0.13 -0.43 -4.00 0.00 -4.00 0.00 -0.23 -6.55 -7.96 -12.00 -12.00 -4.00 -12.00 0.00 -102.01 -24.64 -3.06 -1.45 -0.43 -4.00 0.00 -28.00 0.00 0.00 -6.38 -7.96 -12.00 0.00 0.00 -12.00 0.00 -99.91 ID: AAF18502.1 AC: xxx Len: 270 1:I 260 Sc: -99.91 Pv: 8.623251e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? AAF18502.1 0.712 44 Y 0.641 44 Y 0.961 132 Y 0.424 N # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? AAF18502.1 0.420 139 N 0.490 139 Y 0.966 134 Y 0.384 N # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? AAF18502.1 0.552 112 Y 0.439 112 Y 0.990 135 Y 0.512 N ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. 1-91 MKEAKEKKNPEISVSITISTWGAFAVYVIT GLFLIFGFHVVRNKYSVDLISDPTLTLRLL WIIEFPIVVIIYSLFRRNPEKCSYFRAVGR S lvgliagalinalgav 92-107 108-270 SLGAPIGMQSLSKTIHWSFLMSVFTVVPAT AVLGASWIDWHRIFASLKPIGIIEHMLLVP AYGAIIGGWFGAWPMPLDWERPWQEWPICV CYGAIGGYIGGQMLGLMRTCEAQYLTVRVI YHGQTLSDDVACNYWLTGPQLCHRFTYMVK CCQVQPLKHGPTP low complexity regions: SEG 25 3.0 3.3 >AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. 1-86 MKEAKEKKNPEISVSITISTWGAFAVYVIT GLFLIFGFHVVRNKYSVDLISDPTLTLRLL WIIEFPIVVIIYSLFRRNPEKCSYFR avgrslvgliagalinalgavslgapigmq 87-118 sl 119-270 SKTIHWSFLMSVFTVVPATAVLGASWIDWH RIFASLKPIGIIEHMLLVPAYGAIIGGWFG AWPMPLDWERPWQEWPICVCYGAIGGYIGG QMLGLMRTCEAQYLTVRVIYHGQTLSDDVA CNYWLTGPQLCHRFTYMVKCCQVQPLKHGP TP low complexity regions: SEG 45 3.4 3.75 >AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. 1-270 MKEAKEKKNPEISVSITISTWGAFAVYVIT GLFLIFGFHVVRNKYSVDLISDPTLTLRLL WIIEFPIVVIIYSLFRRNPEKCSYFRAVGR SLVGLIAGALINALGAVSLGAPIGMQSLSK TIHWSFLMSVFTVVPATAVLGASWIDWHRI FASLKPIGIIEHMLLVPAYGAIIGGWFGAW PMPLDWERPWQEWPICVCYGAIGGYIGGQM LGLMRTCEAQYLTVRVIYHGQTLSDDVACN YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSK TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP 1 - 270 MKEAKEKKNP EISVSITIST WGAFAVYVIT GLFLIFGFHV VRNKYSVDLI SDPTLTLRLL WIIEFPIVVI IYSLFRRNPE KCSYFRAVGR SLVGLIAGAL INALGAVSLG APIGMQSLSK TIHWSFLMSV FTVVPATAVL GASWIDWHRI FASLKPIGII EHMLLVPAYG AIIGGWFGAW PMPLDWERPW QEWPICVCYG AIGGYIGGQM LGLMRTCEAQ YLTVRVIYHG QTLSDDVACN YWLTGPQLCH RFTYMVKCCQ VQPLKHGPTP low complexity regions: DUST >AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLL WIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQSLSK TIHWSFLMSVFTVVPATAVLGASWIDWHRIFASLKPIGIIEHMLLVPAYGAIIGGWFGAW PMPLDWERPWQEWPICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACN YWLTGPQLCHRFTYMVKCCQVQPLKHGPTP ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for AAF18502.1 sequence: 270 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 MKEAKEKKNP EISVSITIST WGAFAVYVIT GLFLIFGFHV VRNKYSVDLI SDPTLTLRLL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 WIIEFPIVVI IYSLFRRNPE KCSYFRAVGR SLVGLIAGAL INALGAVSLG APIGMQSLSK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 TIHWSFLMSV FTVVPATAVL GASWIDWHRI FASLKPIGII EHMLLVPAYG AIIGGWFGAW ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 PMPLDWERPW QEWPICVCYG AIGGYIGGQM LGLMRTCEAQ YLTVRVIYHG QTLSDDVACN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | YWLTGPQLCH RFTYMVKCCQ VQPLKHGPTP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ---------- ---------- ---------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** AAF18502.1.fa.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: AAF18502.1.fa.___inter___ (1 sequences) MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLI SDPTLTLRLLWIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGAL INALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWIDWHRI FASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYG AIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACNYWLTGPQLCH RFTYMVKCCQVQPLKHGPTP (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 1 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 20 40 2.189 Certain 2 55 75 1.152 Certain 3 91 111 1.581 Certain 4 124 144 1.968 Certain 5 162 182 1.701 Certain 6 193 213 1.322 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 Loop length 19 14 15 12 17 10 57 K+R profile 5.00 5.00 2.00 5.00 3.00 1.00 1.00 CYT-EXT prof - - - - - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 12.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 0.00 (NEG-POS)/(NEG+POS): -0.1429 NEG: 3.0000 POS: 4.0000 -> Orientation: undecided CYT-EXT difference: 0.00 -> Orientation: undecided ---------------------------------------------------------------------- "AAF18502" 270 20 40 #t 2.18854 55 75 #t 1.15208 91 111 #t 1.58125 124 144 #t 1.96771 162 182 #t 1.70104 193 213 #t 1.32187 ************************************ *TOPPREDM with prokaryotic function* ************************************ AAF18502.1.fa.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: AAF18502.1.fa.___inter___ (1 sequences) MKEAKEKKNPEISVSITISTWGAFAVYVITGLFLIFGFHVVRNKYSVDLI SDPTLTLRLLWIIEFPIVVIIYSLFRRNPEKCSYFRAVGRSLVGLIAGAL INALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVPATAVLGASWIDWHRI FASLKPIGIIEHMLLVPAYGAIIGGWFGAWPMPLDWERPWQEWPICVCYG AIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVACNYWLTGPQLCH RFTYMVKCCQVQPLKHGPTP (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 1 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 20 40 2.189 Certain 2 55 75 1.152 Certain 3 91 111 1.581 Certain 4 124 144 1.968 Certain 5 162 182 1.701 Certain 6 193 213 1.322 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 Loop length 19 14 15 12 17 10 57 K+R profile 4.00 5.00 2.00 5.00 3.00 1.00 1.00 CYT-EXT prof - - - - - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 11.00 Tm probability: 1.00 -> Orientation: N-in Charge-difference over N-terminal Tm (+-15 residues): 0.00 (NEG-POS)/(NEG+POS): -0.1429 NEG: 3.0000 POS: 4.0000 -> Orientation: undecided CYT-EXT difference: 0.00 -> Orientation: undecided ---------------------------------------------------------------------- "AAF18502" 270 20 40 #t 2.18854 55 75 #t 1.15208 91 111 #t 1.58125 124 144 #t 1.96771 162 182 #t 1.70104 193 213 #t 1.32187 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ SAPS. Version of April 11, 1996. Date run: Thu May 23 15:19:03 2002 File: /people/b_eisen/AAF18502.1.fa.___saps___ ID AAF18502.1 DE ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. number of residues: 270; molecular weight: 30.2 kdal 1 MKEAKEKKNP EISVSITIST WGAFAVYVIT GLFLIFGFHV VRNKYSVDLI SDPTLTLRLL 61 WIIEFPIVVI IYSLFRRNPE KCSYFRAVGR SLVGLIAGAL INALGAVSLG APIGMQSLSK 121 TIHWSFLMSV FTVVPATAVL GASWIDWHRI FASLKPIGII EHMLLVPAYG AIIGGWFGAW 181 PMPLDWERPW QEWPICVCYG AIGGYIGGQM LGLMRTCEAQ YLTVRVIYHG QTLSDDVACN 241 YWLTGPQLCH RFTYMVKCCQ VQPLKHGPTP -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 19( 7.0%); C : 8( 3.0%); D- : 6( 2.2%); E : 9( 3.3%); F : 12( 4.4%) G : 24( 8.9%); H : 7( 2.6%); I+ : 26( 9.6%); K : 10( 3.7%); L : 27(10.0%) M : 8( 3.0%); N : 5( 1.9%); P : 16( 5.9%); Q : 8( 3.0%); R : 11( 4.1%) S : 16( 5.9%); T : 14( 5.2%); V : 22( 8.1%); W+ : 11( 4.1%); Y : 11( 4.1%) KR : 21 ( 7.8%); ED : 15 ( 5.6%); AGP : 59 ( 21.9%); KRED : 36 ( 13.3%); KR-ED : 6 ( 2.2%); FIKMNY : 72 ( 26.7%); LVIFM : 95 ( 35.2%); ST : 30 ( 11.1%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 0+-0+-++00 -000000000 0000000000 0000000000 0+0+000-00 0-00000+00 61 000-000000 00000++00- +0000+000+ 0000000000 0000000000 000000000+ 121 0000000000 0000000000 00000-00+0 0000+00000 -000000000 0000000000 181 0000-0-+00 0-00000000 0000000000 0000+00-00 0000+00000 0000--0000 241 0000000000 +00000+000 0000+00000 A. CHARGE CLUSTERS. Positive charge clusters (cmin = 8/30 or 10/45 or 12/60): none Negative charge clusters (cmin = 6/30 or 8/45 or 10/60): none Mixed charge clusters (cmin = 11/30 or 15/45 or 18/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 4 | 4 | 5 | 57 | 8 | 8 | 10 | 10 | 9 | 13 | 7 | 8 | lmin1 5 | 5 | 6 | 70 | 10 | 9 | 12 | 13 | 11 | 15 | 8 | 10 | lmin2 6 | 6 | 8 | 77 | 12 | 11 | 14 | 14 | 13 | 17 | 9 | 12 | (Significance level: 0.010000; Minimal displayed length: 6) (*) 6(1,0,0); at 2- 8: KEAKEKK (1. quartile) +-0+-++ Run count statistics: + runs >= 3: 0 - runs >= 3: 0 * runs >= 3: 1, at 5; 0 runs >= 38: 0 -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. There are no high scoring hydrophobic segments. ____________________________________ High scoring transmembrane segments: 5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST) -6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED) Expected score/letter: -1.578 M_0.01= 101.6; M_0.05= 79.78; M_0.30= 53.73 1) From 12 to 41: length= 30, score=68.00 12 ISVSITISTW GAFAVYVITG LFLIFGFHVV G: 3(10.0%); S: 3(10.0%); V: 5(16.7%); T: 3(10.0%); I: 5(16.7%); F: 4(13.3%); 2) From 92 to 145: length= 54, score=61.00 (pocket at 116 to 125: length= 10, score=-33.00) 92 LVGLIAGALI NALGAVSLGA PIGM |QSLSKT IHWS| FLMSVF TVVPATAVLG 142 ASWI L: 8(14.8%); A: 8(14.8%); G: 6(11.1%); S: 6(11.1%); V: 6(11.1%); 2. SPACINGS OF C. H2N-81-C-113-C-1-C-18-C-21-C-9-C-8-C-C-11-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-38-H-42-C-40-H-24-H-13-H-33-C-1-C-18-C-11-H-9-C-9-C-H-7-C-C-6-H-4-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 4 Aligned matching blocks: [ 169- 175] YGAIIGG [ 199- 204] YGA_IGG B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 8 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 16 (Expected range: 4-- 28) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 7 (6-10) 2 (11-20) 2 (>=21) 6 3. Clusters of amino acid multiplets (cmin = 11/30 or 14/45 or 16/60): none B. CHARGE ALPHABET. 1. Total number of charge multiplets: 3 (Expected range: 0-- 7) 2 +plets (f+: 7.8%), 1 -plets (f-: 5.6%) Total number of charge altplets: 4 (Critical number: 9) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 0 (6-10) 1 (11-20) 0 (>=21) 3 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 100- 135 9 L........ 4 4 0 200- 215 4 G... 4 4 0 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 5) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 8) Location Period Element Copies Core Errors 26- 52 3 i.. 8 6 1 35- 79 5 i.... 8 6 1 59- 76 3 i.. 6 6 0 95- 148 9 i0000..00 6 6 /0/1/2/2/1/././1/2/ 109- 168 6 i.0..0 9 7 /1/./2/././3/ 151- 168 3 i.0 6 6 /0/./2/ -------------------------------------------------------------------------------- SPACING ANALYSIS. There are no unusual spacings. ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- DUF6 Integral membrane protein DUF6 -36.1 48 1 CDP-OH_P_transf CDP-alcohol phosphatidyltransferase -59.8 49 1 Cyto_ox_2 Cytochrome oxidase subunit II -260.1 49 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- CDP-OH_P_transf 1/1 33 146 .. 1 175 [] -59.8 49 DUF6 1/1 34 162 .. 1 126 [] -36.1 48 Cyto_ox_2 1/1 55 265 .. 1 363 [] -260.1 49 Alignments of top-scoring domains: CDP-OH_P_transf: domain 1 of 1, from 33 to 146: score -59.8, E = 49 *->allfllasitDwlDGyiARklnqvtrFGafLDPvaDkllvvialiiL +l+f ++++ +n+++ +D +++ ++++ L AAF18502.1 33 FLIFGFHVV-----------RNKYS---------VDLISDPTLTLRL 59 aylglvpwwltipaalilgvaSHYmhrelllsglrevlaylglkepaiav ++ +p++++i +l+ r ++ + ++++ AAF18502.1 60 LWIIEFPIVVII---YSLF---------------R-RNPEKCS---YFRA 87 lyygkrktiftCAsaqivaiallLlgfslqsvlvgseiysklagvifwfi + i A+a+i+a++++ lg + ++ +l++ i w AAF18502.1 88 VGRSLVGLI---AGALINALGAVSLGAP--------IGMQSLSKTIHW-- 124 laaslcgyiaavltlitglqyvvaalay<-* + v+t++ ++ ++ a + + AAF18502.1 125 ------SFLMSVFTVVPATAVLGASWID 146 DUF6: domain 1 of 1, from 34 to 162: score -36.1, E = 48 *->fiWalytvfskklle...spltftawrfliagilllilllflkkgpp +i+++ v k+ + ++p + + ++i ++ + i+ ++++ p AAF18502.1 34 LIFGFHVVRNKYSVDlisDPTLTLRLLWIIEFPIVVIIYSLFRRNPE 80 l..lallslk.ilallylgilgtalgyllyfyalkyvsaskasvlsslsP + + + +++ l++ ++++ alg +++ + s sk s+l+ AAF18502.1 81 KcsYFRAVGRsLVGLIAGALIN-ALGAVSLGAPIGMQSLSKTIHWSFLMS 129 vftlilsvllLgEkltlkqllGivlillGvllisl<-* vft++ + ++Lg + ++++ i+ +l + +i AAF18502.1 130 VFTVVPATAVLGAS--WIDWHRIFASLKPIGIIEH 162 Cyto_ox_2: domain 1 of 1, from 55 to 265: score -260.1, E = 49 *->elLplvWfvligvllfgYvvlDGFDlGVGmllpflakdeeERRillN +L+l+W +i + +++ +++++++++ E+ AAF18502.1 55 LTLRLLW--IIEFPIVV------------IIYSLFRRNP-EK----- 81 sIGPVWDGNEVWLvlaGGALFAAFPlaYAtllSglYlPlilvLvgLiFRG +++ FR+ AAF18502.1 82 -------------------------------------------CSY-FRA 87 VaFEyRgKiedakWkkvWDwaffiGSlvpalllGvafGnllqGlPFlvda V+ R + ++++G l+ a+G++ +G P + AAF18502.1 88 VG---R------SLV-----GLIAGALI------NALGAVSLGAP--IGM 115 dlrtsYaGsswdlLnRPfaLLcGlvlvslyalhGatflalKTeGeLqera + s+ + ++ + L+ +++v + a++Ga+++ ++ AAF18502.1 116 Q-------SLSKTIH-WSFLMSVFTVVPATAVLGASWI---------DWH 148 rklArylafvtLvavllvglwllyGiDGyvlvsiDtpatsaplakrVave r +A +++ l+ +pa+ a ++ AAF18502.1 149 RIFASLKP----IGIIEHMLL--------------VPAYGAIIGGW---- 176 igaWwfnfprmp..illalpvLgvvafllllvalrrgrygwaFiltllli +gaW +p++++++++ p+ ++ ++++ + ++ + ++ t+ + AAF18502.1 177 FGAW--PMPLDWerPWQEWPICVCYGAIGGYIGGQM----LGLMRTCEAQ 220 alailgagislfPnvmPSsldpaysLTiwnAassplTLkiMLvialiflP l++ +v+ ++ T+ + ++ + +++ + l AAF18502.1 221 YLTV---------RVIY------HGQTLSDDVACNY------WLTGPQLC 249 ivLgYtiwsYwVFRGKis<-* +Y + + V ++++ AAF18502.1 250 HRFTYMVKCCQV--QPLK 265 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- UPF0016 Uncharacterized protein family UPF0016 1.1 56 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- UPF0016 1/1 32 37 .. 71 76 .] 1.1 56 Alignments of top-scoring domains: UPF0016: domain 1 of 1, from 32 to 37: score 1.1, E = 56 *->LFliFG<-* LFliFG AAF18502.1 32 LFLIFG 37 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Thu May 23 15:20:57 2002 Sequence file: AAF18502.1.fa ---------------------------------------- Sequence AAF18502.1 (270 residues): Matching pattern PS00005 PKC_PHOSPHO_SITE: 56: TLR 153: SLK 223: TVR Total matches: 3 Matching pattern PS00006 CK2_PHOSPHO_SITE: 143: SWID 232: TLSD Total matches: 2 Matching pattern PS00007 TYR_PHOSPHO_SITE: 76: RRNPEKCSY Total matches: 1 Matching pattern PS00008 MYRISTYL: 94: GLIAGA 98: GALINA 110: GAPIGM 170: GAIIGG 174: GGWFGA 200: GAIGGY 203: GGYIGG 208: GQMLGL 212: GLMRTC 230: GQTLSD Total matches: 10 Total no of hits in this sequence: 16 ======================================== 1314 pattern(s) searched in 1 sequence(s), 270 residues. Total no of hits in all sequences: 16. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >ER-GOLGI-traffic signal is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >INTRA-SIGNAL-M minimal SH3 binding is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >INTRA-SIGNAL-M deubiquitinating enzyme SH3 domain binding motif (Kato, 2000) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >INTRA-SIGNAL-M minimal class I consensus-SH3 binding motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >INTRA-SIGNAL-M minimal class II consensus-SH3 binding motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >INTRA-SIGNAL-M exact 14-3-3 binding consensus (Muslin 1996 Cell 84 889) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >INTRA-SIGNAL-M 14-3-3 binding motif in RAF and others (Muslin 1996 Cell 84 889) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >INTRA-SIGNAL-M WW domain binding motif in formins (Bedford 1997) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >INTRA-SIGNAL-M PY motif for WW domain is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >TM-CYTOPLASMIC-M di-hydrophobic endocytosis motifs for internalized transmembrane proteins is the MOTIF name >AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. ;LENGTH=270; DIRECT_SEQUENCE n 1 solutions m %_E 161-161 %_XXXL 162-165 %_V 166-166 f >STATISTICS Total : 1 solutions in 1 sequences, 270 units; out of 1 sequences, 270 units >TM-CYTOPLASMIC-M tyrosine-based endocytosis motif for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >TM-EXTRACELL-M Endocytosis signal for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >EXTRACELL-M minimal furin protease cleavage site motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >EXTRACELL-M extended furin protease cleavage site motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >EXTRACELL-M zinc binding motif in MMPs is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >EXTRACELL-M g alpha binding go loco is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS PDX-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS QKI-5 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS HCDA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS SV40 LrgT experimentally determined is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS H2B experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS v-Rel experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Amida experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Amida experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS RanBP3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Pho4p experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS DNAhelicaseQ1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS LEF-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS TCF-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR p53-NLS1 NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hum-Ku70 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS GAL4 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS act/inh betaA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS TR2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS THOV NP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS polyomaVP1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS HIV-1 Tat experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS HIV-1 Rev experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Rex experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS SRY experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS SRY experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS SOX9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS SOX9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS NS5A experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS DNAse EBV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS DNAse EBV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS adenovE1a experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS ystDNApolalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hVDR experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS CPV capsid experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hGlu.cort.experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS cFOS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS cJUN experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hDNApolalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hDNAtopoII experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hDNAtopoII experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hBLM experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hARNT experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS influenzaNP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS influenzaNP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS p54 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS hProTalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Tst1/Oct6 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS protHsc9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS protHsci experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS protHsc3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Ta alpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Pax-QNR experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Hunt.Dis.pro experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS MyoD experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS MyoD experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS opaque2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS CTP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS HCV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS HCV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS p110RB1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS VirD2-Nterm experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS VirD2-Cterm experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Nucloplasmin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Nucleolin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS ICP-8 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Nab2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS M9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS lscMyc experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS humKprotein experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS FluA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Mat-alpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS polyoma Lrg-T experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS polyoma Lrg-T experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS SV40 VP1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS SV40 VP2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS polyoma VP2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS c-myb experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS N-myc experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS p53 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS c-erb-A experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS yeast SKI3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS L29 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS L29 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS Max experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS L3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >NUCLEAR NLS dyskerin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >PDZ domain binding motif science 278_2075_pawson is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units >WW domain binding motif science 278_2075_pawson is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 270 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB+ PSSM from other authors IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. (270 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value DHHC Novel zinc finger domain with DHHC signature 23 0.47 HECT A ubiquitin conjugating enzyme domain 22 1.2 UB Ubiquitin domain 21 2.7 SNARE Alpha helical domains which are involved in vesicle fu... 19 6.0 DNAJ DNAJ co-chaperone domain (Hsp70 cofactors) 19 6.0 S1 S1 RNA binding domain 19 7.9 >DHHC Novel zinc finger domain with DHHC signature Length = 217 Score = 23.2 bits (49), Expect = 0.47 Identities = 5/65 (7%), Positives = 5/65 (7%), Gaps = 3/65 (4%) Query: 23 AFAVYVITGLFLIFGFHVVRNKYSVDLISDPTLTLRLLWIIEFPIVVIIYSLFRRNPEKC 82 Sbjct: 52 QIVAWLLYLFFAVIGFGILVPLLPHHWVPAGYACMGAIFAG---HLVVHLTAVSIDPADD 108 Query: 83 SYFRA 87 Sbjct: 109 NVRDK 113 >HECT A ubiquitin conjugating enzyme domain Length = 255 Score = 21.7 bits (46), Expect = 1.2 Identities = 5/18 (27%), Positives = 5/18 (27%), Gaps = 1/18 (5%) Query: 78 NPEKCSYFRAVGRSLVGL 95 Sbjct: 74 LDEHLMQFKFLGI-LMGV 90 >UB Ubiquitin domain Length = 128 Score = 20.5 bits (43), Expect = 2.7 Identities = 5/13 (38%), Positives = 5/13 (38%) Query: 224 VRVIYHGQTLSDD 236 Sbjct: 41 QRLIFAGKQLEDG 53 >SNARE Alpha helical domains which are involved in vesicle fusion Length = 254 Score = 19.4 bits (40), Expect = 6.0 Identities = 9/30 (30%), Positives = 9/30 (30%), Gaps = 3/30 (10%) Query: 22 GAFAVYVI---TGLFLIFGFHVVRNKYSVD 48 Sbjct: 192 GIVAFILSLLCGALNLIRGFHAIESLLQSD 221 >DNAJ DNAJ co-chaperone domain (Hsp70 cofactors) Length = 75 Score = 19.4 bits (40), Expect = 6.0 Identities = 5/22 (22%), Positives = 5/22 (22%), Gaps = 2/22 (9%) Query: 215 RTCEAQYLTVRVIYHGQTLSDD 236 Sbjct: 40 KEAEAKFKEIKEAY--EVLTDS 59 >S1 S1 RNA binding domain Length = 305 Score = 19.1 bits (39), Expect = 7.9 Identities = 9/40 (22%), Positives = 9/40 (22%), Gaps = 3/40 (7%) Query: 75 FRRNPEKCSYFRAVGRSLVGLIAGALINALGA-VSLGAPI 113 Sbjct: 15 FAKALDKHDYHAEKGQTVHGKICQ--HANEGVYVDFGGKS 52 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 6 Number of calls to ALIGN: 6 Length of query: 270 Total length of test sequences: 20182 Effective length of test sequences: 17146.0 Effective search space size: 4145178.1 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. (270 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|124505 [1..400] Sugar phosphatases 24 3.2 gi|1465871 [5..212] NAD(P)-binding Rossmann-fold domains 24 3.5 gi|1173357 [36..558] Periplasmic binding protein-like II 23 6.7 gi|1469051 [197..455] (Phosphotyrosine) protein phosphatases... 23 7.2 gi|1546735 [56..315] (Phosphotyrosine) protein phosphatases ... 23 7.7 >gi|124505 [1..400] Sugar phosphatases Length = 400 Score = 23.8 bits (51), Expect = 3.2 Identities = 12/88 (13%), Positives = 12/88 (13%), Gaps = 8/88 (9%) Query: 65 FPIVVIIYSLFRRNPE-----KCSYFRAVGRSLVGLIAGALINAL--GAVSLGAPI-GMQ 116 Sbjct: 165 SADITPNQGIFPSGLQCVTVLIGVYDIQTGVPLMGVINQPFVSQDLHTRRWKGQCYWGLS 224 Query: 117 SLSKTIHWSFLMSVFTVVPATAVLGASW 144 Sbjct: 225 YLGTNIHSLLPPVSTRSNSEAQSQGTQN 252 >gi|1465871 [5..212] NAD(P)-binding Rossmann-fold domains Length = 208 Score = 23.9 bits (50), Expect = 3.5 Identities = 5/44 (11%), Positives = 5/44 (11%), Gaps = 2/44 (4%) Query: 195 ICVCYGAIGGYIGGQMLGLMRTCEAQYLTVRVIYHGQTLSDDVA 238 Sbjct: 167 VCLRFRGIYGPEDPNVTLKVANLIKNGLFIGMVSAHGR--ESVS 208 >gi|1173357 [36..558] Periplasmic binding protein-like II Length = 523 Score = 23.0 bits (49), Expect = 6.7 Identities = 7/23 (30%), Positives = 7/23 (30%) Query: 169 YGAIIGGWFGAWPMPLDWERPWQ 191 Sbjct: 417 YDLILSGWLAGNLDPDGFMRPIL 439 >gi|1469051 [197..455] (Phosphotyrosine) protein phosphatases II Length = 259 Score = 22.7 bits (48), Expect = 7.2 Identities = 1/28 (3%), Positives = 1/28 (3%) Query: 58 RLLWIIEFPIVVIIYSLFRRNPEKCSYF 85 Sbjct: 83 QMIIEQRVKCVLLLLTDSEYDSLGGDFV 110 >gi|1546735 [56..315] (Phosphotyrosine) protein phosphatases II Length = 260 Score = 22.7 bits (48), Expect = 7.7 Identities = 4/28 (14%), Positives = 4/28 (14%) Query: 58 RLLWIIEFPIVVIIYSLFRRNPEKCSYF 85 Sbjct: 61 QMVFEQKSPAIVMLLEDVELGIEKSDKY 88 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 5 Number of calls to ALIGN: 5 Length of query: 270 Total length of test sequences: 256703 Effective length of test sequences: 216517.0 Effective search space size: 51198248.0 Initial X dropoff for ALIGN: 25.0 bits ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ calculation of internal repeats with prospero ***** PROSPERO v1.3 Thu May 23 15:21:22 2002 ***** Copyright 2000, Richard Mott, Wellcome Trust Centre for Human Genetics, University of Oxford For help see http://www.well.ox.ac.uk/ariadne For usage use -help using gap penalty 11+1k using matrix BLOSUM62 printing all alignments with eval < 0.100000 using sequence1 AAF18502.1 using self-comparison ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ TIGRFAM hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/tigrfam/tigrfam.hmm Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- TIGR00916 2A0604s01: protein-export membrane protein -125.6 11 1 TIGR00861 MIP: MIP family channel proteins -143.7 78 1 TIGR00790 fnt: formate/nitrite transporter -169.7 43 1 TIGR00445 mraY: phospho-N-acetylmuramoyl-pentapeptide -242.0 14 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- TIGR00916 1/1 1 134 [. 1 247 [] -125.6 11 TIGR00861 1/1 25 182 .. 1 258 [] -143.7 78 TIGR00790 1/1 5 185 .. 1 279 [] -169.7 43 TIGR00445 1/1 1 227 [. 1 336 [] -242.0 14 Alignments of top-scoring domains: TIGR00916: domain 1 of 1, from 1 to 134: score -125.6, E = 11 *->LdeeGakiFadfTakniGtqkreslaivldnakvisapvvgeaiqpi +e a +n is+ + AAF18502.1 1 MKE----------------------AKEKKN-PEISVSITI------ 18 tGGsgqItGnFtieeAqdLAllLRsGaLPapikileertiGPslGaelir +++Ga + AAF18502.1 19 -----------------------------------------STWGAFAVY 27 aGilAlliaLvlVllYmllrYewrgaiAaiallvhDLvililavlslfgG ++ +L l++ + ++r + + + l++D +l+l +l ++ AAF18502.1 28 -----VITGLFLIFGFHVVRNKYS------VDLISD-PTLTLRLLWIIE- 64 atltLpgIAGllliIGySVddtVVIFdRiREelrdiKykgrtlreainlg + +++I yS+ +r r++++ + AAF18502.1 65 FP---------IVVIIYSL-------------FR------RNPEKC---S 83 fnqtLsriidTnvTTLlAalaLfvfGgGaik.....kGFAltlliGviag + ++ r +++ L+A + + ++G +++ + + + t + ++ + AAF18502.1 84 YFRAVGR----SLVGLIAGALINALGAVSLGapigmQSLSKTIHWSFLMS 129 tySsI<-* +++++ AAF18502.1 130 VFTVV 134 TIGR00861: domain 1 of 1, from 25 to 182: score -143.7, E = 78 *->AEflGTfllVffGvGsvlgvnvaskiiDkmksLssaygavGggasvp A + T l+++fG +v n+ ++v ++ s p AAF18502.1 25 AVYVITGLFLIFGFHVVR--NK---------------YSV-DLISDP 53 gqflgiaLAfGLavatlvyavggkISGAHlNPAVTiAlllgrrfplirvp l ++ + +++++y+ + + NP + + +r++ AAF18502.1 54 TLTLRLLWIIEFPIVVIIYSLFR-R-----NP---------EKCSYFRAV 88 vYiVAQliGAilGaallygltsgaitefegnlavagsvlsaGifatfPks l+G i+Ga++ ++ ++ g + s ls AAF18502.1 89 GRS---LVGLIAGALINALGAVSL-----GAPIGMQS-LSKT-------- 121 yvsagqaffvEfIgTaiLvlvilattdpdrnrvprggilaPLaIGllval +++ + + +++++ + ++ + AAF18502.1 122 -IHWS------------FLMSVFTVV--PATA-----------------V 139 ihlslgpyTGtgmNPARslGPalvagl..igwgnkaFtvghwkYhWvywv ++s ++ +++a+l++ig+ ++ v+ AAF18502.1 140 LGASWIDW------------HRIFASLkpIGIIEHM--------LLVPAY 169 GPivGailgalvY<-* G i+G+ +ga+ AAF18502.1 170 GAIIGGWFGAWPM 182 TIGR00790: domain 1 of 1, from 5 to 185: score -169.7, E = 43 *->KaalsllsllvlSilAGafialGaHASgLlvavvtggtaiegappgl K+++++ ++ i+ + Ga A + ++++++++ + ++ AAF18502.1 5 KEKKNPEISVSITIS-----TWGAFAVYVITGLFLIFGFHVVRNKYS 46 vkLVgGavFpiGLilVViaGsdLFTGNtlfvtmAllrgkISwldllinWv v L+ + ++ L+ ++ + + + l+r+ +++ + AAF18502.1 47 VDLISDPTLTLRLLWIIEF------PIVVII-YSLFRRNPEKCSYFRAVG 89 vswlGNLVGalfvAylisylggiltangAVEKNlwgvgslaiAehKvhgG s++G ++Ga li +lg ++ + g sl +++h AAF18502.1 90 RSLVGLIAGA-----LINALGAVSLGA------PIGMQSLS---KTIH-- 123 AFIAAGKvvShTFvelfllGilCNwlVCLAvwlslsakdlagKffamwfP ++ l + v+ ++ a+ + g ++ w AAF18502.1 124 ------------WSFLMS------------VFTVVPATAVLGASWIDWHR 149 ifaFVAsGFEHsVANMflipiaifighfasPeFWqlvgsapenFadltvg ifa +l pi+i+ + AAF18502.1 150 IFA-------------SLKPIGIIEHM----------------------- 163 qfivkNLIPvtLGNivGGavfvGli.Ywylylk<-* +L P+ Ga+ +G+++ w l+ AAF18502.1 164 -----LLVPAY------GAIIGGWFgAWPMPLD 185 TIGR00445: domain 1 of 1, from 1 to 227: score -242.0, E = 14 *->SlllgPkvIlkLkkLKagQviredGPkSHlkKkAGTPTMGGImivfa + +k +K P i + AAF18502.1 1 --------MKEAKEKK--------NP------E----------ISVS 15 iivstvlwaqllseiNpyvlvvlfVllGYgfiGFvDDyrKikrKsnkGLt i +st w ++ +yv+ +lf +G+ + + K + L+ AAF18502.1 16 ITIST--WGAFA----VYVITGLFLIFGFHV---------VRNKYSVDLI 50 ..AkqKlfgqiiiAl.ifvvwLyyy..GldtdatdiyiPfikdflfdLGt +++ l ++ ii ++i+v ++++++++++ AAF18502.1 51 sdPTLTLRLLWIIEFpIVVIIYSLFrrNPEK------------------- 81 llYillalFvLVgtSNAvNLTDGLDGLAigpsaiafgaLailawatGnan ++F Vg S L GL +g + a+ga ++ a AAF18502.1 82 -----CSYFRAVGRS--------LVGLIAGALINALGAVSLGAPIG-MQS 117 FakYLhiPkikdsgELviFctAlvGaslGFLwFNtyPAkVFMGDtGSLAL +k h + s+ v+ tA++Gas + ++F AAF18502.1 118 LSKTIHWSFL-MSVFTVVPATAVLGASW------IDWHRIF--------- 151 GGaLgavAIltknEiLLv.....imGGVFViETLSVIlQVgsYKttKGPD ++L ++I+ + LLv+ + i+GG F + + + AAF18502.1 152 -ASLKPIGIIEH--MLLVpaygaIIGGWFGAWPMP---------LDW--- 186 GvGKRiFkMAPIHHHfElkGWsEprVvvrF.....WiIsLLlalv....a + W E+ + v + +++i + +l l+++ +a AAF18502.1 187 -----------------ERPWQEWPICVCYgaiggYIGGQMLGLMrtceA 219 iLatlKvR<-* ++t+ v AAF18502.1 220 QYLTVRVI 227 // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/tigrfam/tigrfam.hmm-f Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- TIGR00710 efflux_Bcr_CflA: drug resistance transporte 5.0 2.9 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- TIGR00710 1/1 125 160 .. 133 174 .. 5.0 2.9 Alignments of top-scoring domains: TIGR00710: domain 1 of 1, from 125 to 160: score 5.0, E = 2.9 *->silmlvlalaPavAPliGGAiyvlvflsWhaIFavLsLigil<-* s+lm+v++++Pa A +G+ + Wh IFa L igi+ AAF18502.1 125 SFLMSVFTVVPATA-VLGA-----SWIDWHRIFASLKPIGII 160 // SMART hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/iprscan/data/smart.HMMs Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // COG hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/cogs/cogs.hmm Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- COG0597 -91.1 36 1 COG0839 -95.9 45 1 COG0697 -102.0 34 1 COG0730 -127.0 82 1 COG2011 -138.4 53 1 COG0580 -152.1 19 1 COG1108 -172.4 20 1 COG0004 -367.5 72 1 COG1113 -424.7 48 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- COG0839 1/1 19 158 .. 1 192 [] -95.9 45 COG0597 1/1 3 159 .. 1 213 [] -91.1 36 COG1108 1/1 1 172 [. 1 292 [] -172.4 20 COG2011 1/1 1 180 [. 1 289 [] -138.4 53 COG1113 1/1 1 194 [. 1 481 [] -424.7 48 COG0580 1/1 5 199 .. 1 312 [] -152.1 19 COG0730 1/1 11 215 .. 1 287 [] -127.0 82 COG0004 1/1 1 240 [. 1 466 [] -367.5 72 COG0697 1/1 23 269 .. 1 321 [] -102.0 34 Alignments of top-scoring domains: COG0839: domain 1 of 1, from 19 to 158: score -95.9, E = 45 *->mmietlaFylfaalaiasalgVVlakNpvYSalyLilsllsiAglff +++ ++a y++++l +++++ VV++k YS+ ++ l + l++ AAF18502.1 19 STWGAFAVYVITGLFLIFGFHVVRNK---YSVDLISDPTLTLRLLWI 62 llgAeFlgvlqviVYvGAVmVLflFvvMmlnigveeikeeerrylssivg + eF + vi+Y ++++ + +e+ +r + +s vg AAF18502.1 63 I---EF--PIVVIIY------------SLFRRN-PEKCSYFRAVGRSLVG 94 llvapvaLvlililsivyilialpsplgiaaadinktgNlkaiGavLFtd l++++ + +l ++ ++i +++ s + ++ + AAF18502.1 95 LIAGALINALGAVSLGAPIGMQSLS--KTIH----------------WSF 126 YLlpFElasvLLLvAmVgAIaLarrkrlgrkktdrkkqdekqktk<-* + +F ++ + +A++gA + +++ + ++ AAF18502.1 127 LMSVFTVVPA---TAVLGASWIDWHRIFA----------SLKPIG 158 COG0597: domain 1 of 1, from 3 to 159: score -91.1, E = 36 *->kikeptffsdiMkqtlsanlksglwyllvailvvvilDqltKllvla +ke ++ + + t ++++++++ v+ AAF18502.1 3 EAKEKKNPEISVSIT--------ISTWGAFAVYVI------------ 29 nfklgsgflvaetvpvipstwghfffnitlvrNtGAAFS..lLsdqgGwq +g +i + +++vrN+ S +l+sd + AAF18502.1 30 ---TG--------LFLIFG--------FHVVRNK---YSvdLISD-PTLT 56 lwlllllalaviillvffllrlkkgqrllai...alglIlGGAlGNliDR l ll ++ +++++++++++ r ++++ + +++l +++GAl N + AAF18502.1 57 LRLLWIIEFPIVVIIYSLFRRNPEKCSYFRAvgrSLVGLIAGALINALGA 106 lfrGsghVVDFidfhwggiygdywkerfPiFNlADvsIviGailLvldsf G + + +++ i+ w + ++F + + v+Ga + ++ + AAF18502.1 107 VSLG--APIGMQSLSKT-IH---WSFLMSVFTVVPATAVLGASWIDWHRI 150 fpkdikekekkkneddkkkqs<-* f + k+ + AAF18502.1 151 FA-------SL-----KPIGI 159 COG1108: domain 1 of 1, from 1 to 172: score -172.4, E = 20 *->mmslLlellqyeflqnAllagllvslacgllGsflVlRRmaligDAl m ++ AAF18502.1 1 MKEAKEKKNPE------------------------------------ 11 SHaaLpGvalgyLltgqlglfllginpllgaiafgllgAlligflrrksk ++++ ++ + ++ + ++++gl++ ++ + +r+k+ AAF18502.1 12 -------ISVS--------ITISTWGAFAVYVITGLFLIFGFHVVRNKY- 45 vkeDtaiGivfssglalGvvllsllpgfnnsavdlmnyLFGniLaVsptD + l++ + +++ + AAF18502.1 46 --------------------SVDLISDP----------------TLTLRL 59 LwliaivslGPsVlclvllllllfyrelllisFDpefAkvlGipvrllhy Lw+i + ++ v++ l+++++ +++ ++ AAF18502.1 60 LWIIEFPIV-------VIIYSLFRRNPEKCS---------------YFRA 87 llllLialtiVaavkaVGviLvsaLLiiPaatArllsrslrsmliiAill + L++l a++ a G++ ++ A + sl++ + +++l+ AAF18502.1 88 VGRSLVGLIAGALINALGAV---------SLGAPIGMQSLSKTIHWSFLM 128 gllsgvlG..vlllSyyldlppGpsivliatllfllslllrkkygvl<-* +++++v ++ vl S+++ + +s i ++ +ll ++yg++ AAF18502.1 129 SVFTVVPAtaVLGASWIDWHRIFASLKPIGIIEH---MLLVPAYGAI 172 COG2011: domain 1 of 1, from 1 to 180: score -138.4, E = 53 *->lskk.edrvsmslsnlwgtlaldallratfeTlyMvgaslffavllG + ++e++ + +++ + ++ s++ a+++ AAF18502.1 1 MKEAkEKKNP----EIS------VSITI----------STWGAFAVY 27 iplgllisgappvqsvfvqatllfdvLfltspggletll.alantlvmal ++ gl l+f++ + ++ + + ++ ++ + AAF18502.1 28 VITGLF---------------LIFGFHVVRNKYSVDLISdPTLTLRLLWI 62 lga.lvnilrsiPigltFlaAanlrvnplssarlivlsriLlvvLrpvtr + +++v i+ s++ ++ + + r+v r AAF18502.1 63 IEFpIVVIIYSLF--RRN--------------------PEKCSYFRAVGR 90 livGliFvaatglGptAailaLaIgaiPflArLveeaLPeevdkGviEAa +vG ++A ++ +aL a AAF18502.1 91 SLVG------------------------LIAGALINAL----------GA 106 rAmGAsklqiiakvlLPealPglisgvtytliilirySAmaGaiGAGGLG +++GA+ i++ ++l i+++ ++ ++++ + A AAF18502.1 107 VSLGAP----IGM----QSLSKTIHWSFLMSVFTV--------VPA---- 136 dlairygayqrfnwdvmaviv..lvilvlVlliqsigdrirrrllkr<-* +a + + +w+ +++ ++++ i+ ll+ g+ i + AAF18502.1 137 -TAVLGA--SWIDWHRIFASLkpIGIIEHMLLVPAYGAIIGGWFGAW 180 COG1113: domain 1 of 1, from 1 to 194: score -424.7, E = 48 *->MdpslvmaddnqdaeqeLkRgLknRHIqlIAlGGAIGTGLFlGSgsa M+ +a+ ++ AAF18502.1 1 MK----EAKEKK----------------------------------- 8 IqmAGPsvlLaYaIaGlivflIMRaLGEMavanPvaGSFsdYArkylGpw nP+ AAF18502.1 9 --------------------------------NPE--------------- 11 AGFltGWlYWflWvlvgmaElTAvgiYmqyWfpaFPdvPqWiwALialvl + v+i + W ++ a+ + AAF18502.1 12 ---------------------ISVSITISTW------------GAFAVYV 28 llavNLisVKlFGElEFWFAlIKVaAIvafIvlGlvllfggfgggg.tGf ++ Gl l+fg++ ++ ++ AAF18502.1 29 IT-------------------------------GLFLIFGFHVVRNkYSV 47 sNLwahtGGFFPnGllGlllalqvvvFAFgGiElvGitAgEakdPeksip ++++ + AAF18502.1 48 DLISDP----------------------------------------T--- 54 kAinsViwRIliFYvGslfvilslyPWnqvgsggsgSPFVtvFskiGipa + + + ++ + +++++y + + ++++ + F+++G AAF18502.1 55 -------LTLRLLWIIEFPIVVIIYSLFRRNPEK-----CSYFRAVGRSL 92 AasImNfVVLTAAlSslNSGlYstsRMLysL.AeqGdAPkffaklsrrGV ++ I ++ L+A+ AP ++ ls+ AAF18502.1 93 VGLI-------------------AGALINALgAVSLGAPIGMQSLSK--- 120 PvnAillsavvlllgVlLNYvaPsvekvFelltsssglgalfv.WlmIll +i s++ +vF+ + + ++lga +++W I++ AAF18502.1 121 ---TIHWSFL---------------MSVFTVVPATAVLGASWIdWHRIFA 152 sqLkfRkarkpaqgaalkFkmpLyPfttyltlaFllfvLvlMafdpdtRi s + p + i AAF18502.1 153 S------------------------------------------LKPIGII 160 slfvtpiwlvlLvigYlvfgkrrkaeahgarpvkaee<-* + ++++ + ++++g+ fg +++ +++ p++ + AAF18502.1 161 EHMLLVPAYGAIIGGW--FGAWPMPLDWER-PWQEWP 194 COG0580: domain 1 of 1, from 5 to 199: score -152.1, E = 19 *->mkktlmkefiAEflGTflLIifGcGavaqvklikssaaiPnfeLGig + k+ ++ + T G+ av+ + AAF18502.1 5 KEKKNPEISVSITISTW-----GAFAVYVIT---------------- 30 skGgwieialaWGlaVamgiYaaGgiSGgaAHLNPAVTIaLalvgkFPwk G + l++G+ V+ Y + iS P T+ L + FP AAF18502.1 31 --G----LFLIFGFHVVRNKYSVDLISD------PTLTLRLLWIIEFPIV 68 kaLlWvVlpYIvAQiiGAfiaAalvyllysgkhlietdngnhIvRGtqes ++y l+ + + AAF18502.1 69 V----------------------IIYSLFRR------NP----------- 79 lavlGvfcTsPkavisflraflsEfvgTaiLvlgIlaigddannkrapng + ++f + ++++++ l+ +g L + I + + + + + AAF18502.1 80 -EKCSYFRAVGRSLVGLIAGALINALGAVSLGAPIGMQS--LSK---TIH 123 plapLiiGllVagIgislGgPlTGtAiNPARDLGPRlaaslaGiglaftk + L+ ++V+ + ++lG+ + R++asl +ig AAF18502.1 124 -WSFLMSVFTVVPATAVLGASWIDW---------HRIFASLKPIG----- 158 gdsnwhYfWiPviGPIiGAilGAliYqvliygGhESPVrygdkriearll h +++P G IiG+ +GA+ AAF18502.1 159 -II-EHMLLVPAYGAIIGGWFGAWPMP----------------------- 183 fllsrflpvn.aegae<-* +r+ +++ ++ AAF18502.1 184 LDWERPWQEWpICVCY 199 COG0730: domain 1 of 1, from 11 to 215: score -127.0, E = 82 *->mmnlalllllflvGfiaGflaglfGiGGGlllvPaLlflgg.lpphv +++ +++ + +G +++ ++ L++++g ++ AAF18502.1 11 EIS-------------VSITISTWG-AFAVYVITGLFLIFGfHVVRN 43 AvGTsllqvlftalssayahlkkgnvdlklglilglgsvvGaalGallak +l++ + +l+++ AAF18502.1 44 KYSVDLISDPTL-----------------TLRLLWII------------- 63 lipaeikyeaLklifgllllllalymllskkieikpkreekpnkkslplp f++++++++l ++ p++++++++ ++l AAF18502.1 64 -----E--------FPIVVIIYSLF--RR-----NPEKCSYFRAVGRSL- 92 glggfyepleieyavrawlliliGfliGflsGllGiGGGsllvPaLilll +G+++G l lG+ + + +l AAF18502.1 93 ----------------------VGLIAGALINALGA-VSLGAPIGMQSL- 118 gyslplkkavgtsllaialsslisfllyl.lggp........vdwllalg + + ++ + s++ + ++++ +++++++ + ++ +++ ++l+ AAF18502.1 119 S--KTIHWSFLMSVFTVVPATAVLGASWIdWHRIfaslkpigIIEHMLLV 166 lvlGsiiGaylGarlasrkligprflrllfalilllvalklllkglgmll ++G+iiG+++Ga+ + + + +++++ +++ ++ + lg+ + AAF18502.1 167 PAYGAIIGGWFGAWPMPL-DWERPWQEWPICVCYGAIGGYIGGQMLGLMR 215 <-* AAF18502.1 - - COG0004: domain 1 of 1, from 1 to 240: score -367.5, E = 72 *->mkratfsWlerlwallvlslvatalaqdepatNvsavttvldsgDta mk a + + + v+ ++ AAF18502.1 1 MKEAKEK-------KNPEISVS------IT----------------- 17 WmLlsaaLVllMtPGlAllYgGlvRaKNvlntimkslvdfaivsllWffv +++++ ++ +++ AAF18502.1 18 ------------------------------------ISTWG-AFAVYVIT 30 GYSLAFGddagvvngFiGtlgffllgnvl.........lpdlvffaFQlm G L FG+ + +n ++ + ++++l + + + AAF18502.1 31 GLFLIFGFH--------------VVRNKYsvdlisdptLTLRLLW----- 61 FAaitvaIIsGAvAERmKFsAyLlFsvLwttlvYpPvAHWVWGggGWLak + + ++ ++Y+++ +++ + AAF18502.1 62 -----------------------IIEFPIVVIIYSLFRR---NPEKCSY- 84 lGfmaLDFAGGtVVHingGfaALaaallLGkRkGkydddGkpnaipPHNi f ++ l+G G + + AAF18502.1 85 --FR------------------AVGRSLVGLIAGALIN----------AL 104 pmvvLGaalLWFGWFGFNaGSAlaangea.nggvaalafvnTnlAAAaGm + v LGa + G +++ + ++++ +++f ++ A++ AAF18502.1 105 GAVSLGAPI----------GMQSLSK--TiHWSFLMSVF---TVVPATAV 139 lgWmliewlrnGrKPtvlglasGavAGLVAITPaaGfVspwgAl.iIGlv lg+ i+w r +++l++IG++ AAF18502.1 140 LGASWIDWHRI-----------------------------FASLkPIGII 160 AGvicylavkllkkklkiDDaLDvfgVHGvgGvwGtiltGiFAapsvgam +l + + G+i G F+a + AAF18502.1 161 --------EHMLLVPA-----------------YGAIIGGWFGAWPM--- 182 agisslgaGgllgGnqmlkQLgvQiigvvvvaayafvvtfillkllkltl l + Q ++v+ y++++++i l+l AAF18502.1 183 ---------PLDWERP------WQEWPICVC--YGAIGGYIGGQMLGLMR 215 GgLRVSeeeElvGLDlseHgetAYseeevi<-* e++++ + Hg+t +++ + AAF18502.1 216 T-----CEAQYLTVRVIYHGQTLSDDVACN 240 COG0697: domain 1 of 1, from 23 to 269: score -102.0, E = 34 *->llgllllllaaliWGssfvliKlaledgnyrifvvppltivflRfli ++ + li+G++ v K+ ++ + +p + + l ++i AAF18502.1 23 AFAVYVITGLFLIFGFHVVRNKYSVDLI------SDPTLTLRLLWII 63 aallLlpllllrrrlprr.pgkllllglgsnvetlllllglllglflyfl ++++++++ l rr p + + +++++ l+gl+ + l AAF18502.1 64 EFPIVVIIYSLFRRNPEKcSYFRAVGRS-------------LVGLIAGAL 100 lffyalqytsasvaalltslaPlftallaflllgerltrlqllglllafl +++ ++a ++ +s + ++ l++++ + a+l AAF18502.1 101 INALGAVSLGAPIGMQSLSKTIHWSFLMSVFTVVP----------ATAVL 140 Gvvllll.lggllllGillaLlaalswAlytvlgkrlsarvprptplplt G+ ++ ++ ++ l k ++ ++ +l AAF18502.1 141 GASWIDWhRIFASL--------------------KPIGIIEH-----MLL 165 ivflqlllgallllplalflggpplssptlpspfielsfwlallylgvla + ++ +++g+ + ++ +++ + p ++ w++++ +g ++ AAF18502.1 166 VPAYGAIIGGWFGAWPMPLDWERP------------WQEWPICVCYGAIG 203 t.algyllwnyalrrlgaskaallsylePvf.alllgwllLgErpltlaq + ++g++l + + ++ ++ + + + ++ ++l g l ++ AAF18502.1 204 GyIGGQMLGLMRTCEAQYLTVRVIYHGQTLSdDVACNYWLTGPQ-LCHRF 252 llGaalilaGillvslrarkrarkaase<-* + v + + +k++++ AAF18502.1 253 ----------TYMVKCCQV-QPLKHGPT 269 // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/cogs/cogs.hmm-f Sequence file: AAF18502.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: AAF18502.1 ESTs gb|AI992787, gb|T20398 come from this gene. [Arabidopsis thaliana]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- COG0839 3.9 4.9 1 COG1375 1.6 18 1 COG0038 1.0 18 1 COG1792 -0.3 51 1 COG1966 -2.0 82 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- COG0839 1/1 19 44 .. 1 26 [. 3.9 4.9 COG1792 1/1 26 47 .. 20 41 .. -0.3 51 COG1375 1/1 35 48 .. 188 203 .] 1.6 18 COG0038 1/1 75 108 .. 1 41 [. 1.0 18 COG1966 1/1 115 127 .. 1 13 [. -2.0 82 Alignments of top-scoring domains: COG0839: domain 1 of 1, from 19 to 44: score 3.9, E = 4.9 *->mmietlaFylfaalaiasalgVVlak<-* +++ ++a y++++l +++++ VV++k AAF18502.1 19 STWGAFAVYVITGLFLIFGFHVVRNK 44 COG1792: domain 1 of 1, from 26 to 47: score -0.3, E = 51 *->lliiillslvlllsdsrggils<-* +++i+ l+l+++++++r+++++ AAF18502.1 26 VYVITGLFLIFGFHVVRNKYSV 47 COG1375: domain 1 of 1, from 35 to 48: score 1.6, E = 18 *->iFpvRGevFRdayhee<-* iF + v+R+ y+ + AAF18502.1 35 IFGF--HVVRNKYSVD 48 COG0038: domain 1 of 1, from 75 to 108: score 1.0, E = 18 *->frfetsstpllrkvflrllsgrailigviagllavalllal<-* fr++ +++++r+v r l+g+iag l++al ++ AAF18502.1 75 FRRNPEKCSYFRAVG-RS------LVGLIAGALINALGAVS 108 COG1966: domain 1 of 1, from 115 to 127: score -2.0, E = 82 *->mkkllkklaWvvV<-* m++l+k + W ++ AAF18502.1 115 MQSLSKTIHWSFL 127 //