analysis of sequence from tem2 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] MPASLALLQPRAMMKTLSSGNCTLSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRK VYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTK EAAELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVLFSMAKLPHEMSP ALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRPSVNSDLKYIKAKVLREGQARERDKCTIQ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > gi|9857402|gb|AAG00868.1|AF279143_1 . . . . . 1 MPASLALLQPRAMMKTLSSGNCTLSVPAKNSYRMVVLGASRVGKSSIVSR 50 ___EEEE_HHHHHH_________EEE_____EEEEEE_________EEEE . . . . . 51 FLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSI 100 E_____________HHHHHHHHH______EEEEEE_______HHHHHHHH . . . . . 101 LTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELPMVIC 150 ____EEEEEEE_____HHHHHHHHHHHHHHHHH_HHHHHHHHHH_EEEEE . . . . . 151 GNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVLFS 200 ________________HHHHHH______EEEEEEE_______EEEEHHHH . . . . . 201 MAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRP 250 HH________HHHHHHHHH_________HHHHHHHHHH__EEE_______ . . 251 SVNSDLKYIKAKVLREGQARERDKCTIQ 278 ___HHHHHHHHHHHHHHHHHHHH_____ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 40.5 % beta-contents : 21.9 % coil-contents : 37.6 % class : mixed method : 2 alpha-contents : 35.8 % beta-contents : 15.8 % coil-contents : 48.4 % class : mixed ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -13.30 -3.19 -3.01 -2.07 0.00 0.00 0.00 -0.72 -2.37 -8.39 -2.95 -12.00 -12.00 -8.00 -12.00 0.00 -79.99 2.31 -2.44 -3.17 -1.12 0.00 0.00 -28.00 0.00 -1.23 -9.86 -2.95 -12.00 0.00 0.00 -12.00 0.00 -70.47 ID: gi|9857402|gb|AAG00868.1|AF279143_1 AC: xxx Len: 210 1:I 200 Sc: -70.47 Pv: 5.760329e-01 NO_GPI_SITE GPI: learning from protozoa -25.51 -2.78 -2.04 -1.61 -4.00 0.00 0.00 0.00 -0.73 -5.18 -10.39 -12.00 -12.00 -8.00 -12.00 0.00 -96.25 -26.47 -0.98 -0.01 -0.08 -4.00 0.00 0.00 0.00 0.00 -5.00 -10.39 -12.00 -12.00 -8.00 -12.00 0.00 -90.93 ID: gi|9857402|gb|AAG00868.1|AF279143_1 AC: xxx Len: 210 1:I 184 Sc: -90.93 Pv: 7.130055e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|9857402| 0.304 118 N 0.259 118 N 0.659 6 N 0.176 N # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|9857402| 0.457 15 N 0.243 15 N 0.890 9 Y 0.437 N # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? gi|9857402| 0.377 143 N 0.217 116 N 0.917 42 N 0.382 N ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] 1-278 MPASLALLQPRAMMKTLSSGNCTLSVPAKN SYRMVVLGASRVGKSSIVSRFLNGRFEDQY TPTIEDFHRKVYNIRGDMYQLDILDTSGNH PFPAMRRLSILTGDVFILVFSLDNRESFDE VKRLQKQILEVKSCLKNKTKEAAELPMVIC GNKNDHGELCRQVPTTEAELLVSGDENCAY FEVSAKKNTNVDEMFYVLFSMAKLPHEMSP ALHRKISVQYGDAFHPRPFCMRRVKEMDAY GMVSPFARRPSVNSDLKYIKAKVLREGQAR ERDKCTIQ low complexity regions: SEG 25 3.0 3.3 >gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] 1-278 MPASLALLQPRAMMKTLSSGNCTLSVPAKN SYRMVVLGASRVGKSSIVSRFLNGRFEDQY TPTIEDFHRKVYNIRGDMYQLDILDTSGNH PFPAMRRLSILTGDVFILVFSLDNRESFDE VKRLQKQILEVKSCLKNKTKEAAELPMVIC GNKNDHGELCRQVPTTEAELLVSGDENCAY FEVSAKKNTNVDEMFYVLFSMAKLPHEMSP ALHRKISVQYGDAFHPRPFCMRRVKEMDAY GMVSPFARRPSVNSDLKYIKAKVLREGQAR ERDKCTIQ low complexity regions: SEG 45 3.4 3.75 >gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] 1-278 MPASLALLQPRAMMKTLSSGNCTLSVPAKN SYRMVVLGASRVGKSSIVSRFLNGRFEDQY TPTIEDFHRKVYNIRGDMYQLDILDTSGNH PFPAMRRLSILTGDVFILVFSLDNRESFDE VKRLQKQILEVKSCLKNKTKEAAELPMVIC GNKNDHGELCRQVPTTEAELLVSGDENCAY FEVSAKKNTNVDEMFYVLFSMAKLPHEMSP ALHRKISVQYGDAFHPRPFCMRRVKEMDAY GMVSPFARRPSVNSDLKYIKAKVLREGQAR ERDKCTIQ low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] MPASLALLQPRAMMKTLSSGNCTLSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQY TPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDE VKRLQKQILEVKSCLKNKTKEAAELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAY FEVSAKKNTNVDEMFYVLFSMAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAY GMVSPFARRPSVNSDLKYIKAKVLREGQARERDKCTIQ 1 - 278 MPASLALLQP RAMMKTLSSG NCTLSVPAKN SYRMVVLGAS RVGKSSIVSR FLNGRFEDQY TPTIEDFHRK VYNIRGDMYQ LDILDTSGNH PFPAMRRLSI LTGDVFILVF SLDNRESFDE VKRLQKQILE VKSCLKNKTK EAAELPMVIC GNKNDHGELC RQVPTTEAEL LVSGDENCAY FEVSAKKNTN VDEMFYVLFS MAKLPHEMSP ALHRKISVQY GDAFHPRPFC MRRVKEMDAY GMVSPFARRP SVNSDLKYIK AKVLREGQAR ERDKCTIQ low complexity regions: DUST >gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] MPASLALLQPRAMMKTLSSGNCTLSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQY TPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDE VKRLQKQILEVKSCLKNKTKEAAELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAY FEVSAKKNTNVDEMFYVLFSMAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAY GMVSPFARRPSVNSDLKYIKAKVLREGQARERDKCTIQ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for gi|9857402|gb|AAG00868.1|AF279143_1 sequence: 210 amino acids, 23 residue(s) in coiled coil state Coil 1 * 118* 141* DEVKRLQKQILEVKSCLKNKTKEA . | . | . | . | . | . 60 MPASLALLQP RAMMKTLSSG NCTLSVPAKN SYRMVVLGAS RVGKSSIVSR FLNGRFEDQY ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 TPTIEDFHRK VYNIRGDMYQ LDILDTSGNH PFPAMRRLSI LTGDVFILVF SLDNRESFDE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~233333355 * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- -fgabcdefg * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~11111144 * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~133333333 * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~14444999 * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~12555599 * 14 M'95 -w local . | . | . | . | . | . 180 VKRLQKQILE VKSCLKNKTK EAAELPMVIC GNKNDHGELC RQVPTTEAEL LVSGDENCAY 5555555555 5555555555 54444~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border abcdefgabc defgabcdef gabcd----- ---------- ---------- ---------- * 21 M'95 -w register 4444444444 4444444444 43333~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar 3333333333 3331111111 11111~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class 9999999999 9999999999 999993~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. 9999999999 998~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | FEVSAKKNTN VDEMFYVLFS MAKLPHEMSP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ---------- ---------- ---------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** tem2.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem2.___inter___ (1 sequences) MPASLALLQPRAMMKTLSSGNCTLSVPAKNSYRMVVLGASRVGKSSIVSR FLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSI LTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELPMVIC GNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVLFS MAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRP SVNSDLKYIKAKVLREGQARERDKCTIQ (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 1 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment Loop length 278 K+R profile + CYT-EXT prof 0.48 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 1.00 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.1233 NEG: 32.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 0.48 -> Orientation: N-out ---------------------------------------------------------------------- "tem2" 278 ************************************ *TOPPREDM with prokaryotic function* ************************************ tem2.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: tem2.___inter___ (1 sequences) MPASLALLQPRAMMKTLSSGNCTLSVPAKNSYRMVVLGASRVGKSSIVSR FLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSI LTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELPMVIC GNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVLFS MAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRP SVNSDLKYIKAKVLREGQARERDKCTIQ (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 1 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment Loop length 278 K+R profile + CYT-EXT prof 0.48 For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: 0.00 Tm probability: 1.00 -> Orientation: undecided Charge-difference over N-terminal Tm (+-15 residues): 4.00 (NEG-POS)/(NEG+POS): -0.1233 NEG: 32.0000 POS: 41.0000 -> Orientation: N-in CYT-EXT difference: 0.48 -> Orientation: N-out ---------------------------------------------------------------------- "tem2" 278 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ NOW EXECUTING: /bio_software/1D/stat/saps/saps-stroh/SAPS.SSPA/saps /people/maria/tem2.___saps___ SAPS. Version of April 11, 1996. Date run: Fri Oct 13 19:37:29 2000 File: /people/maria/tem2.___saps___ ID gi|9857402|gb|AAG00868.1|AF279143_1 DE tumor endothelial marker 2 [Homo sapiens] number of residues: 278; molecular weight: 31.6 kdal 1 MPASLALLQP RAMMKTLSSG NCTLSVPAKN SYRMVVLGAS RVGKSSIVSR FLNGRFEDQY 61 TPTIEDFHRK VYNIRGDMYQ LDILDTSGNH PFPAMRRLSI LTGDVFILVF SLDNRESFDE 121 VKRLQKQILE VKSCLKNKTK EAAELPMVIC GNKNDHGELC RQVPTTEAEL LVSGDENCAY 181 FEVSAKKNTN VDEMFYVLFS MAKLPHEMSP ALHRKISVQY GDAFHPRPFC MRRVKEMDAY 241 GMVSPFARRP SVNSDLKYIK AKVLREGQAR ERDKCTIQ -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 18( 6.5%); C : 7( 2.5%); D : 15( 5.4%); E : 17( 6.1%); F : 13( 4.7%) G : 13( 4.7%); H : 6( 2.2%); I : 11( 4.0%); K : 20( 7.2%); L : 25( 9.0%) M+ : 13( 4.7%); N : 13( 4.7%); P : 14( 5.0%); Q : 9( 3.2%); R : 21( 7.6%) S : 22( 7.9%); T : 11( 4.0%); V : 21( 7.6%); W : 0( 0.0%); Y : 9( 3.2%) KR : 41 ( 14.7%); ED : 32 ( 11.5%); AGP : 45 ( 16.2%); KRED : 73 ( 26.3%); KR-ED : 9 ( 3.2%); FIKMNY : 79 ( 28.4%); LVIFM : 83 ( 29.9%); ST : 33 ( 11.9%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 0000000000 +000+00000 00000000+0 00+0000000 +00+00000+ 0000+0--00 61 0000--00++ 0000+0-000 0-00-00000 00000++000 000-000000 00-0+-00-- 121 0++00+000- 0+000+0+0+ -00-000000 00+0-00-00 +00000-0-0 0000--0000 181 0-000++000 0--0000000 00+000-000 000++00000 0-0000+000 0++0+-0-00 241 0000000++0 0000-0+00+ 0+00+-000+ -+-+0000 A. CHARGE CLUSTERS. Positive charge clusters (cmin = 12/30 or 16/45 or 19/60): none Negative charge clusters (cmin = 10/30 or 13/45 or 16/60): none Mixed charge clusters (cmin = 17/30 or 23/45 or 29/60): none B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 5 | 5 | 7 | 29 | 10 | 9 | 13 | 11 | 10 | 15 | 6 | 8 | lmin1 7 | 6 | 9 | 35 | 12 | 11 | 16 | 14 | 13 | 18 | 8 | 10 | lmin2 8 | 7 | 11 | 39 | 13 | 12 | 17 | 16 | 14 | 20 | 9 | 11 | (Significance level: 0.010000; Minimal displayed length: 6) There are no charge runs or patterns exceeding the given minimal lengths. Run count statistics: + runs >= 4: 0 - runs >= 3: 0 * runs >= 5: 1, at 270; 0 runs >= 19: 0 -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. There are no high scoring hydrophobic segments. There are no high scoring transmembrane segments. 2. SPACINGS OF C. H2N-21-C-111-C-15-C-9-C-17-C-51-C-44-C-3-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-21-C-45-H-21-H-43-C-15-C-5-H-3-C-17-C-27-H-6-H-11-H-4-C-44-C-3-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 4 B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 8 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 12 (Expected range: 3-- 27) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 3 (6-10) 2 (11-20) 3 (>=21) 5 3. Clusters of amino acid multiplets (cmin = 9/30 or 11/45 or 13/60): none B. CHARGE ALPHABET. 1. Total number of charge multiplets: 12 (Expected range: 0-- 17) 7 +plets (f+: 14.7%), 5 -plets (f-: 11.5%) Total number of charge altplets: 5 (Critical number: 20) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 3 (6-10) 2 (11-20) 2 (>=21) 6 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors There are no periodicities of the prescribed length. B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9) Location Period Element Copies Core Errors 105- 110 1 i 6 6 0 -------------------------------------------------------------------------------- SPACING ANALYSIS. There are no unusual spacings. ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/Pfam Sequence file: tem2 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- ras Ras family 119.7 5.6e-32 1 Flu_M1 Influenza Matrix protein (M1) -1.2 99 1 arf ADP-ribosylation factor family -106.3 0.76 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Flu_M1 1/1 34 52 .. 226 244 .] -1.2 99 arf 1/1 14 205 .. 1 191 [] -106.3 0.76 ras 1/1 33 278 .] 1 198 [] 119.7 5.6e-32 Alignments of top-scoring domains: Flu_M1: domain 1 of 1, from 34 to 52: score -1.2, E = 99 *->lEvLkayskrnsvlvkkyk<-* + vL a+ + s v +++ gi|9857402 34 MVVLGASRVGKSSIVSRFL 52 arf: domain 1 of 1, from 14 to 205: score -106.3, E = 0.76 *->GlgflkiflkllglwnKEmRILiLGLDnAGKTTILykLklgei.vtt + ++++ l + + ++R+++LG GK+ I+ ++ +g+++++ gi|9857402 14 MKTLSSGNCTLSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFeDQY 60 iPTi...GFNvEtveykNikFtvWDvGGQe...slRPlWrnYfpnTdavI +PTi++ + v + + + + D G ++ + R+l + ++ +++ gi|9857402 61 TPTIedfHRKVYNIRGDMYQLDILDTSGNHpfpAMRRL-SILTGDVFILV 109 fVVDSaDreRieeakeELhaLLneeeL........adAplLifANKQDlp f D re ++e k+ + +L+ + ++++++ a+ p +i +NK D gi|9857402 110 FSLD--NRESFDEVKRLQKQILEVKSClknktkeaAELPMVICGNKNDHG 157 gAmseaEireaLgLhelkgskgkVtlegdRpWeiqgcsAvkGeGLyEGld + +ea el s gd + + sA k ++++E + gi|9857402 158 ELCRQVPTTEA----ELLVS-------GDENCAYFEVSAKKNTNVDEMFY 196 WLsnnikkr<-* L + k + gi|9857402 197 VLFSMAKLP 205 ras: domain 1 of 1, from 33 to 278: score 119.7, E = 5.6e-32 *->KlVliGDsGVGKSsLliRFtdnkFveeYipTIGvDFytktvevDGkt V++G s VGKSs++ RF +++F ++Y pTI +DF++k ++ G gi|9857402 33 RMVVLGASRVGKSSIVSRFLNGRFEDQYTPTI-EDFHRKVYNIRGDM 78 vkLqIWDTAGQERFralrpaYYRgAqGfLLVYDITsrdSFenvkkWleei + L+I+DT G F a+r+++ ++++f+LV++ ++r+SF++vk+ ++i gi|9857402 79 YQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQI 128 lrhad........kdenvpivLVGNKcDLeddedleltegqkRvVsteeG l++++ ++++++ e p+v++GNK D + R+V+t e+ gi|9857402 129 LEVKSclknktkeAAE-LPMVICGNKNDHGE--------LC-RQVPTTEA 168 eal.AkelgAlpFmETSAKtntNVeeaFeelareilkkvse......... e l + + + ++E SAK ntNV+e F+ l ++ + + ++ +++ + gi|9857402 169 ELLvSGDEN-CAYFEVSAKKNTNVDEMFYVLFSMAKLPHEMspalhrkis 217 ............................................vnvnld + ++ ++++ ++ ++ + + ++ ++++ +++ + + ++++ gi|9857402 218 vqygdafhprpfcmrrvkemdaygmvspfarrpsvnsdlkyikaKVLREG 267 qpakkkkskCcil<-* q + + kC i gi|9857402 268 QARERD--KCTIQ 278 // Start with PfamFrag (from /data/patterns/pfam) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/pfam/PfamFrag Sequence file: tem2 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- ras Ras family 130.0 2e-37 1 ELH Egg-laying hormone precursor 1.7 5.1 1 Flu_M1 Influenza Matrix protein (M1) -1.2 99 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Flu_M1 1/1 34 52 .. 226 244 .] -1.2 99 ras 1/1 33 198 .. 1 169 [. 130.0 2e-37 ELH 1/1 244 275 .. 200 230 .. 1.7 5.1 Alignments of top-scoring domains: Flu_M1: domain 1 of 1, from 34 to 52: score -1.2, E = 99 *->lEvLkayskrnsvlvkkyk<-* + vL a+ + s v +++ gi|9857402 34 MVVLGASRVGKSSIVSRFL 52 ras: domain 1 of 1, from 33 to 198: score 130.0, E = 2e-37 *->KlVliGDsGVGKSsLliRFtdnkFveeYipTIGvDFytktvevDGkt V++G s VGKSs++ RF +++F ++Y pTI +DF++k ++ G gi|9857402 33 RMVVLGASRVGKSSIVSRFLNGRFEDQYTPTI-EDFHRKVYNIRGDM 78 vkLqIWDTAGQERFralrpaYYRgAqGfLLVYDITsrdSFenvkkWleei + L+I+DT G F a+r+++ ++++f+LV++ ++r+SF++vk+ ++i gi|9857402 79 YQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQI 128 lrhad........kdenvpivLVGNKcDLeddedleltegqkRvVsteeG l++++ ++++++ e p+v++GNK D + R+V+t e+ gi|9857402 129 LEVKSclknktkeAAE-LPMVICGNKNDHGE--------LC-RQVPTTEA 168 eal.AkelgAlpFmETSAKtntNVeeaFeel<-* e l + + + ++E SAK ntNV+e F+ l gi|9857402 169 ELLvSGDEN-CAYFEVSAKKNTNVDEMFYVL 198 ELH: domain 1 of 1, from 244 to 275: score 1.7, E = 5.1 *->sGisKRisinqDLKAitDmLltE.qiqeRery<-* s ++R s+n DLK i +l E+q +eR ++ gi|9857402 244 SPFARRPSVNSDLKYIKAKVLREgQARERDKC 275 // Start with Repeat Library (from /data/patterns/repeats-Miguel-Andrade/hmm) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/repeats-Miguel-Andrade/hmm/repeats.hmm-lib Sequence file: tem2 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Fri Oct 13 19:39:03 2000 Sequence file: tem2 ---------------------------------------- Sequence gi|9857402|gb|AAG00868.1|AF279143_1 (278 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 21: NCTL 137: NKTK Total matches: 2 Matching pattern PS00004 CAMP_PHOSPHO_SITE: 96: RRLS 186: KKNT 214: RKIS 248: RRPS Total matches: 4 Matching pattern PS00005 PKC_PHOSPHO_SITE: 31: SYR 184: SAK Total matches: 2 Matching pattern PS00006 CK2_PHOSPHO_SITE: 63: TIED 117: SFDE 166: TEAE 173: SGDE 189: TNVD Total matches: 5 Matching pattern PS00007 TYR_PHOSPHO_SITE: 232: RRVKEMDAY Total matches: 1 Matching pattern PS00294 PRENYLATION: 275: CTIQ Total matches: 1 Matching pattern PS00016 RGD: 75: RGD Total matches: 1 Matching pattern PS00017 ATP_GTP_A: 38: GASRVGKS Total matches: 1 Total no of hits in this sequence: 17 ======================================== 1314 pattern(s) searched in 1 sequence(s), 278 residues. Total no of hits in all sequences: 17. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** argv[1]=P argv[2]=-m /data/patterns/own/motif.fa argv[4]=-seq tem2 ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 278 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm.lib Sequence file: tem2 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/own/own-hmm-f.lib Sequence file: tem2 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] (278 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value SGTP Small GTPAses 151 8e-40 CATH Cathepsin like protease domain 26 0.083 LAMNT Lamin N-terminal domain 25 0.10 14-3-3 14-3-3 protein alpha Helical domain 21 1.6 RASGAP RAS-type GTPase GTP hydrolysis activating protein 21 1.7 CBS cystathionine beta -synthase domain (A predicted ligand... 21 2.3 CALC Calcineurin like Phosphoesterase domain 20 5.2 PDE cyclic NMP phosphodiesterase domain 19 7.5 LRR Leucine rich repeats 19 9.2 RHOGEF RHO-type GTPase GDP exchange factor 19 9.7 >SGTP Small GTPAses Length = 164 Score = 151 bits (380), Expect = 8e-40 Identities = 40/168 (23%), Positives = 77/168 (45%), Gaps = 12/168 (7%) Query: 32 YRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIE-DFHRKVYNIRGDMYQLDILDTSGNH 90 ++ +V+G + GKS ++ +F+ +F+D TI +F K+ N+ G +L I DT+G Sbjct: 1 FKFLVIGNAGTGKSCLLHQFIEKKFKDDSNHTIGVEFGSKIINVGGKYVKLQIWDTAGQE 60 Query: 91 PFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELPMVIC 150 F ++ R +LV+ + +RE+++ + + + + +++C Sbjct: 61 RFRSVTRSYYRGAAGALLVYDITSRETYNALTNWLTDA---------RMLASQNIVIILC 111 Query: 151 GNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVL 198 GNK D R+V EA + + E SA +V+E F Sbjct: 112 GNKKDLDA-DREVTFLEASRFAQ-ENELMFLETSALTGEDVEEAFVQC 157 >CATH Cathepsin like protease domain Length = 371 Score = 25.7 bits (56), Expect = 0.083 Identities = 18/69 (26%), Positives = 23/69 (33%), Gaps = 15/69 (21%) Query: 62 PTIE-DFHRKVYNIRGDMYQLDILDTSGNH---PFPAMR------RLSILTGDVFI---- 107 P + K Y + + Y L + F + L IL GDVFI Sbjct: 294 PNVNLVLGGKSYALTPNQYVLKVTVQGETLCLSGFSGLDVPPPAGPLWIL-GDVFIGSYY 352 Query: 108 LVFSLDNRE 116 VF DN Sbjct: 353 TVFDRDNDA 361 >LAMNT Lamin N-terminal domain Length = 256 Score = 25.3 bits (55), Expect = 0.10 Identities = 20/79 (25%), Positives = 28/79 (35%), Gaps = 5/79 (6%) Query: 193 EMFYVLFSMAKLPHEMSPALHRKISVQYGDAFHP-RPFCMRRVKEMDAYGMVS--PFARR 249 + YV MA P S L R S +G + P F + M +GM S P + Sbjct: 127 HVAYVWIQMANSPRPGSWVLER--STDHGKTYQPWFNFAENAAECMRRFGMESLSPISED 184 Query: 250 PSVNSDLKYIKAKVLREGQ 268 SV + L + Sbjct: 185 DSVTCRTDMASLQPLENAE 203 Score = 19.5 bits (40), Expect = 6.1 Identities = 7/32 (21%), Positives = 15/32 (46%), Gaps = 4/32 (12%) Query: 47 IVSRFLNGRF---EDQYTPTIEDFHRKVYNIR 75 +V R L R + + +++F + N+R Sbjct: 204 MVIRILEHRPSSRQFATSEALQNF-TRATNVR 234 >14-3-3 14-3-3 protein alpha Helical domain Length = 270 Score = 21.5 bits (45), Expect = 1.6 Identities = 10/45 (22%), Positives = 19/45 (42%) Query: 176 ENCAYFEVSAKKNTNVDEMFYVLFSMAKLPHEMSPALHRKISVQY 220 E+ Y A++ + M + S+A E++ +SV Y Sbjct: 7 EDAVYLAKLAEQAERYEGMVENMKSVASTDQELTVEERNLLSVAY 51 >RASGAP RAS-type GTPase GTP hydrolysis activating protein Length = 292 Score = 21.3 bits (44), Expect = 1.7 Identities = 11/32 (34%), Positives = 21/32 (65%), Gaps = 3/32 (9%) Query: 82 DILDTSGNHPFPAMR--RLSILTGDVFILVFS 111 D+ + +G H FP+ R R S+++G +F+ F+ Sbjct: 182 DLRECAGEH-FPSNREVRYSVVSGFIFLRFFA 212 >CBS cystathionine beta -synthase domain (A predicted ligand binding domain) Length = 214 Score = 20.7 bits (43), Expect = 2.3 Identities = 5/43 (11%), Positives = 13/43 (29%) Query: 48 VSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNH 90 V + L +E ++ H + +I + + Sbjct: 117 VVKTLAKLYEIPVYKIFKELHNHIGDISWEELMEAAAVVTKRM 159 >CALC Calcineurin like Phosphoesterase domain Length = 274 Score = 19.8 bits (40), Expect = 5.2 Identities = 11/65 (16%), Positives = 21/65 (31%) Query: 28 AKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTS 87 Y +VVL + +S N R + + F + G ++Q + + Sbjct: 153 NPERYTLVVLHHHLLPTNSAWLDQHNLRNSHELAEVLAPFTNVKAILYGHIHQEVNSEWN 212 Query: 88 GNHPF 92 G Sbjct: 213 GYQVM 217 >PDE cyclic NMP phosphodiesterase domain Length = 350 Score = 18.9 bits (38), Expect = 7.5 Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Query: 82 DILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQ 127 D +D + F +M + L F+L NRE++ LQK+ Sbjct: 272 DFIDFVASPLFQSMAK--FLKESQFLLKVISKNRENWQAYMELQKE 315 >LRR Leucine rich repeats Length = 339 Score = 18.8 bits (38), Expect = 9.2 Identities = 10/50 (20%), Positives = 19/50 (38%), Gaps = 12/50 (24%) Query: 76 GDMYQLDILDTSGNH-------PFPAMRRLSILTGDVFILVFSLDNRESF 118 + L +LD N P A ++L+ + FSL+++ Sbjct: 289 HLLPDLKVLDLHENENLVMPPKPNDARKKLAFYN-----IDFSLEHQRKI 333 >RHOGEF RHO-type GTPase GDP exchange factor Length = 176 Score = 18.7 bits (38), Expect = 9.7 Identities = 7/24 (29%), Positives = 9/24 (37%) Query: 112 LDNRESFDEVKRLQKQILEVKSCL 135 F E +R K L + S L Sbjct: 107 YSECAFFQECQRKLKHRLRLDSYL 130 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 10 Number of calls to ALIGN: 11 Length of query: 278 Total length of test sequences: 20182 Effective length of test sequences: 16536.0 Effective search space size: 4025149.7 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= gi|9857402|gb|AAG00868.1|AF279143_1 tumor endothelial marker 2 [Homo sapiens] (278 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|1572819 [7..164] P-loop containing nucleotide triphosphat... 104 2e-24 gi|1531539 [70..226] P-loop containing nucleotide triphospha... 98 1e-22 gi|1086887 [26..189] P-loop containing nucleotide triphospha... 98 2e-22 gi|1666073 [6..176] P-loop containing nucleotide triphosphat... 95 8e-22 gi|4006 [276..439] P-loop containing nucleotide triphosphate... 90 3e-20 gi|2814284 [1..224] P-loop containing nucleotide triphosphat... 90 6e-20 gi|1301656 [50..208] P-loop containing nucleotide triphospha... 88 2e-19 gi|2132165 [3..190] P-loop containing nucleotide triphosphat... 86 9e-19 gi|1695241 [169..336] P-loop containing nucleotide triphosph... 82 1e-17 gi|232142 [190..392] P-loop containing nucleotide triphospha... 79 6e-17 gi|729400 [38..325] P-loop containing nucleotide triphosphat... 73 7e-15 gi|2136468 [19..195] Lipocalins 31 0.025 gi|1515341 [77..406] Acid proteases 28 0.13 gi|2496715 [53..359] NAD(P)-binding Rossmann-fold domains 25 1.2 gi|2144317 [224..416] Lactate & malate dehydrogenases, C-ter... 25 1.7 gi|417211 [2..187] P-loop containing nucleotide triphosphate... 25 1.8 gi|2652959 [169..353] P-loop containing nucleotide triphosph... 23 4.2 gi|1175188 [41..508] Periplasmic binding protein-like II 23 4.4 gi|1723793 [7..233] NAD(P)-binding Rossmann-fold domains 23 5.7 gi|345362 [61..347] Protein kinases (PK), catalytic core 23 5.7 gi|1174715 [19..356] Thiamin-binding 23 5.8 gi|134579 [59..285] P-loop containing nucleotide triphosphat... 23 5.9 gi|120586 [1..210] Class II aldolase 23 6.3 gi|2392193 [1..232] Zn metallo-beta-lactamase 23 6.7 gi|2493523 [117..286] Interleukin-1beta converting enzyme (a... 23 7.0 gi|127802 [1..266] Glucosamine 6-phosphate deaminase 23 7.1 gi|417569 [3..336] Aspartate/ornithine carbamoyltransferase,... 23 7.4 gi|1346784 [8..238] N-terminal nucleophile aminohydrolases (... 22 8.2 gi|280504 [161..330] Glyceraldehyde-3-phosphate dehydrogenas... 22 9.2 gi|1515148 [263..496] Protein kinases (PK), catalytic core 22 9.2 gi|2130121 [317..446] Cyclin-like 23 9.3 gi|1673470 [12..233] Protein kinases (PK), catalytic core 22 9.6 gi|1322391 [16..428] Acid proteases 22 9.9 >gi|1572819 [7..164] P-loop containing nucleotide triphosphate hydrolases Length = 158 Score = 104 bits (258), Expect = 2e-24 Identities = 33/165 (20%), Positives = 64/165 (38%), Gaps = 10/165 (6%) Query: 32 YRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHP 91 R+V++G GK+S+V L + D + D ++ + I+D S Sbjct: 4 VRIVLIGDEGCGKTSLVMSLLEDEWVDAVPRRL-DRVLIPADVTPENVTTSIVDLSIKEE 62 Query: 92 FPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELPMVICG 151 I +V +V+S+ + + D ++ ++ ++T P+++ G Sbjct: 63 DENWIVSEIRQANVICVVYSVTDESTVDGIQTKWLPLIRQSFGEYHET------PVILVG 116 Query: 152 NKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFY 196 NK+D + + E E SA+ NV E+FY Sbjct: 117 NKSDGTA--NNTDKILPIMEAN-TEVETCVECSARTMKNVSEIFY 158 >gi|1531539 [70..226] P-loop containing nucleotide triphosphate hydrolases Length = 157 Score = 98.1 bits (241), Expect = 1e-22 Identities = 44/168 (26%), Positives = 73/168 (43%), Gaps = 13/168 (7%) Query: 32 YRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHP 91 R+ VLG +R GKSS++ RFL G ++ T + ++K + G + + I + Sbjct: 2 LRLGVLGDARSGKSSLIHRFLTGSYQVLE-KTESEQYKKEMLVDGQTHLVLIREE----- 55 Query: 92 FPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELPMVICG 151 A D I VFSL++ SF V RL Q+ ++ + L + + G Sbjct: 56 AGAPDAKFSGWADAVIFVFSLEDENSFQAVSRLHGQLSSLRG------EGRGGLALALVG 109 Query: 152 NKND-HGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVL 198 ++ R V A L + + C+Y+E A NVD +F + Sbjct: 110 TQDRISASSPRVVGDARARALCADMKRCSYYETCATYGLNVDRVFQEV 157 >gi|1086887 [26..189] P-loop containing nucleotide triphosphate hydrolases Length = 164 Score = 98.1 bits (241), Expect = 2e-22 Identities = 33/175 (18%), Positives = 66/175 (36%), Gaps = 14/175 (8%) Query: 33 RMVVLGASRVGKSSIVSRFLNGRFEDQYTPTI-EDFHRKVYNIRGDMYQLDILDTSGNHP 91 +++V+GA GKS+ + R G+F ++ K + +L D G Sbjct: 1 KIIVVGAKNAGKSTFIERVEFGKFHEEKQTEKLYRVIAKKVFEQTLTVELIEKDLGGLTN 60 Query: 92 -FPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELPMVIC 150 ++ + D +L ++ D+ ESF ++K + + + Sbjct: 61 EDNGFKKTEMRDVDAVLLFYAADDLESFKQLKENLVHVQRKIP---------PNANITVV 111 Query: 151 GNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVLFSMAKLP 205 G K D E QV E + + ++ + FE S+K NV+ + + + Sbjct: 112 GTKADVKE--MQVQWQEVDSF-AENQGFSCFETSSKTGVNVEIILHDILETIFER 163 >gi|1666073 [6..176] P-loop containing nucleotide triphosphate hydrolases Length = 171 Score = 95.5 bits (234), Expect = 8e-22 Identities = 53/176 (30%), Positives = 81/176 (45%), Gaps = 17/176 (9%) Query: 33 RMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRK----------VYNIRGDMYQLD 82 R+ VLGA VGK++I+ +FL G + +++ PT + +IR Sbjct: 1 RVAVLGAPGVGKTAIIRQFLFGDYPERHRPTDGPRLYRPAVLLDGAVYDLSIRDGDVAGP 60 Query: 83 ILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEA 142 G +P + S+ D F+LV+ + + +SFD VK L+++I E + A Sbjct: 61 GSSPGGPEEWPDAKDWSLQDTDAFVLVYDICSPDSFDYVKALRQRIAETRP------AGA 114 Query: 143 AELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVL 198 E P+++ GNK D L R P LV C Y E SAK N +V +F L Sbjct: 115 PEAPILVVGNKRDRQRL-RFGPRRALAALVRRGWRCGYLECSAKYNWHVLRLFREL 169 >gi|4006 [276..439] P-loop containing nucleotide triphosphate hydrolases Length = 164 Score = 90.4 bits (221), Expect = 3e-20 Identities = 31/167 (18%), Positives = 57/167 (33%), Gaps = 4/167 (2%) Query: 33 RMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHPF 92 ++V+LGA VGKSS+V+ N I + Y++ I DT+G Sbjct: 1 KLVLLGAPNVGKSSLVNSLTNDDISIVSDIPGTTRDSIDAMINVNGYKVIICDTAGIREK 60 Query: 93 PAMRRLSILTGDVFILVFSLDNRE-SFDEVKRLQKQILEVKSCLKNKTKEAAELPMVICG 151 + + + D D + ++ + L +KT ++I Sbjct: 61 SSDKIEMLGIDRAKKKSVQSDLCLFIVDPTDLSKLLPEDILAHLSSKTFG--NKRIIIVV 118 Query: 152 NKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVL 198 NK+D ++ +L VS K ++ + L Sbjct: 119 NKSDLVS-DDEMTKVLNKLQTRLGSKYPILSVSCKTKEGIESLISTL 164 >gi|2814284 [1..224] P-loop containing nucleotide triphosphate hydrolases Length = 224 Score = 89.6 bits (219), Expect = 6e-20 Identities = 32/183 (17%), Positives = 64/183 (34%), Gaps = 22/183 (12%) Query: 29 KNSYRMVVLGASRVGKSSIVSRFLNGRFEDQY--TPTIEDFHRKVYNIRGDMYQLDILDT 86 + + + + + LN F+ + + + D + + G L + DT Sbjct: 49 RKNLKKNLKKNLSILFYLKNHEKLNSFFKKRQLLSVNVFDTYSTNVMVDGRPINLSLWDT 108 Query: 87 SGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELP 146 +G + R LS DVF++ F+L+N SF+ V+ + P Sbjct: 109 AGQDDYDQFRHLSFPQTDVFLVCFALNNPASFENVRAKWYPEVSHHC---------PNTP 159 Query: 147 MVICGNKND-----------HGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMF 195 +++ G K D + V T+ ++ + Y E SA + ++F Sbjct: 160 IILVGTKADLREDRDTIERLRERRLQPVSHTQGYVMAKEIKAVKYLECSALTQIGLKQVF 219 Query: 196 YVL 198 Sbjct: 220 DEA 222 >gi|1301656 [50..208] P-loop containing nucleotide triphosphate hydrolases Length = 159 Score = 87.6 bits (214), Expect = 2e-19 Identities = 37/167 (22%), Positives = 69/167 (41%), Gaps = 15/167 (8%) Query: 32 YRMVVLGASRVGKSSIVSRFLNGRF---EDQYTPTIEDFHRKVYNIRGDMYQLDILDTSG 88 R+VV+G +VGK++I+ + + + + + L + DT+G Sbjct: 3 MRVVVVGGKKVGKTAILRQVACVEDVTNKPYEPTIEDTYQVLLEEPDKAREILILHDTAG 62 Query: 89 NHPFPA--MRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELP 146 + +++ + D F+LV+S + ESF+ V L+K I K + P Sbjct: 63 VSNYGPIELKKAYVQAADAFVLVYSSADYESFNRVDLLKKWIDRQFGKDKKEV------P 116 Query: 147 MVICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDE 193 +V+ N D V + A + E FEV+AK ++ + Sbjct: 117 IVVLANMRD---RPATVDSAFAHSWAA-REKVKLFEVTAKDRQSLVD 159 >gi|2132165 [3..190] P-loop containing nucleotide triphosphate hydrolases Length = 188 Score = 85.7 bits (209), Expect = 9e-19 Identities = 22/176 (12%), Positives = 44/176 (24%), Gaps = 14/176 (7%) Query: 29 KNSYRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIE---DFHRKVYNIRGDMYQLDILD 85 K Y +++LG GK++ + L D Sbjct: 13 KEQYSILILGLDNAGKTTFLETLKKEYSLAFKALEKIQPTVGQNVATIPVDSKQILKFWD 72 Query: 86 TSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAEL 145 G +M I + +RE DE + ++ + + Sbjct: 73 VGGQESLRSMWSEYYSLCHGIIFIVDSSDRERLDECSTTLQSVVMDEEI--------EGV 124 Query: 146 PMVICGNKNDHGELCR---QVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVL 198 P+++ NK D + + +SA V + + Sbjct: 125 PILMLANKQDRQDRMEVQDIKEVFNKIAEHISARDSRVLPISALTGEGVKDAIEWM 180 >gi|1695241 [169..336] P-loop containing nucleotide triphosphate hydrolases Length = 168 Score = 81.8 bits (199), Expect = 1e-17 Identities = 27/170 (15%), Positives = 59/170 (33%), Gaps = 13/170 (7%) Query: 32 YRMVVLGASRVGKSSIVSRFLNGRF---EDQYTPTIEDFHRKVYNIRGDMYQLDILDTSG 88 + + GA GK S+ +R + ++ P + + Sbjct: 4 VVLRIYGARNCGKKSLANRIHHFATSMAPERVDPDENGNDYTKMTTF--LLNGREVTLEI 61 Query: 89 NHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAELPMV 148 + ++I+++++DNR+SF ++ ++I + Sbjct: 62 LLESTLENSPFRQSKTLYIVMYNMDNRQSFIYATQILERITLANLNNPVPL------QLF 115 Query: 149 ICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVL 198 + GNK D + + T E + + C + EVSA N +E + + Sbjct: 116 LIGNKCDLKRN-QVISTNEGKSVART-FKCDFLEVSALLGMNTEETWTTI 163 >gi|232142 [190..392] P-loop containing nucleotide triphosphate hydrolases Length = 203 Score = 79.3 bits (193), Expect = 6e-17 Identities = 14/144 (9%), Positives = 34/144 (22%), Gaps = 4/144 (2%) Query: 70 KVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQIL 129 + D + D G I V + + ++ Sbjct: 4 FETKFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAIIFVVASSSYNMVIREDNQTNRLQ 63 Query: 130 EVKSCLKN--KTKEAAELPMVICGNK-NDHGELCRQVPTTEAELLVSGDENCAYFEVSAK 186 E + K+ + + +++ NK + E + ++ S Sbjct: 64 EALNLFKSIWNNRWLRTISVILFLNKQDLLAEKVLAGKSKIELFVLDDR-LFQERPFSFI 122 Query: 187 KNTNVDEMFYVLFSMAKLPHEMSP 210 ++ + Y A P Sbjct: 123 EDYFPEFARYTTPEDATPEPGEDP 146 >gi|729400 [38..325] P-loop containing nucleotide triphosphate hydrolases Length = 288 Score = 72.6 bits (175), Expect = 7e-15 Identities = 21/219 (9%), Positives = 56/219 (24%), Gaps = 35/219 (15%) Query: 29 KNSYRMVVLGASRVGKSSIVSRFLNGR------------------FEDQYTPTIEDFHRK 70 + ++ GK++ R L E + + I Sbjct: 2 SKVRNIGIIAHIDAGKTTTTERMLYYAGISKHIGDVDTGDTITDFLEQERSRGITIQSAA 61 Query: 71 VYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILE 130 + + + ++++DT G+ F ++ D +++ K ++ + Sbjct: 62 ISFPWRNTFAINLIDTPGHIDFTFEVIRALKVIDSCVVILDAVAGVEAQTEKVWKQSKSK 121 Query: 131 VKSCLKNKTKEAAELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTN 190 K C NK + + + K+ T+ Sbjct: 122 PKICFINK----------MDRMGASFNHTVNDLINKFMRGTTTK-PVLVNIPYYRKQPTS 170 Query: 191 VDEMFYVL------FSMAKLPHEMSPALHRKISVQYGDA 223 D +F + + P + ++ + Sbjct: 171 NDYVFQGVIDVVNGKRLTWNPENPDEIIVDELDGTSLEQ 209 >gi|2136468 [19..195] Lipocalins Length = 177 Score = 30.8 bits (69), Expect = 0.025 Identities = 19/124 (15%), Positives = 31/124 (24%), Gaps = 15/124 (12%) Query: 153 KNDHGELCRQVP---TTEAELLVS---GDENCAYFEVSAKKNTNVDEMFYVLFSMAKLPH 206 N E + A+ V + CA + T F + + Sbjct: 39 DNIVAEFSVDEYGQMSATAKGRVRLLNNWDVCADMVGTFTD-TEDPAKFKMKYWGVASFL 97 Query: 207 EMSPALHRKISVQYGD-AFHPRPFCMRRVKEMDAYG---MVSPFARRPSVNSDLKYIKAK 262 + H I Y A R+ +D FAR P+ + Sbjct: 98 QKGNDDHWIIDTDYDTYAV----QYSCRLLNLDGTCADSYSFVFARDPNGFPPEVQRIVR 153 Query: 263 VLRE 266 +E Sbjct: 154 RRQE 157 >gi|1515341 [77..406] Acid proteases Length = 330 Score = 28.4 bits (63), Expect = 0.13 Identities = 7/61 (11%), Positives = 11/61 (17%), Gaps = 8/61 (13%) Query: 62 PTIEDFH--RKVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFI----LVFSLDNR 115 T+ +I + T IL G F + Sbjct: 266 ITLTFGFVSGTTVSITERDLVISFFGTCRL-QIIPTTDRIIL-GLPLYRGRCTYFDPIMQ 323 Query: 116 E 116 Sbjct: 324 R 324 >gi|2496715 [53..359] NAD(P)-binding Rossmann-fold domains Length = 307 Score = 25.4 bits (54), Expect = 1.2 Identities = 12/88 (13%), Positives = 27/88 (30%), Gaps = 2/88 (2%) Query: 144 ELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVSAKKNTNVDEMFYVLFSM-- 201 L + + G N+ + A + S + A + + A + + E+ V+ Sbjct: 179 GLDIPVLGGGNNKYQFVHSDDLASACIAASNVKGFATYNIGAAEFGTMRELLQVVIKHAE 238 Query: 202 AKLPHEMSPALHRKISVQYGDAFHPRPF 229 + P ++ A P Sbjct: 239 TGSRIKSIPMGPTALAANLASALGLSPL 266 >gi|2144317 [224..416] Lactate & malate dehydrogenases, C-terminal domain Length = 193 Score = 24.7 bits (54), Expect = 1.7 Identities = 14/98 (14%), Positives = 29/98 (29%), Gaps = 9/98 (9%) Query: 46 SIVSRFLN--GRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTG 103 VS L + Y ++D N +L + A+ G Sbjct: 99 QFVSLLLGNIEQIHGTYYVPLKDA----NNFPIAPGADQLLPLVDGADYFAIPLTITTKG 154 Query: 104 DVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKE 141 ++ ++ + + L + ++K KN K Sbjct: 155 VSYVDYDIVNRMNDMERNQMLPICVSQLK---KNIDKG 189 >gi|417211 [2..187] P-loop containing nucleotide triphosphate hydrolases Length = 186 Score = 24.8 bits (54), Expect = 1.8 Identities = 12/48 (25%), Positives = 21/48 (43%), Gaps = 10/48 (20%) Query: 35 VVLGASRVGKSSIVSRFLNGRFEDQY---------TPTIEDFHRKVYN 73 V+ G S GKS+++ + + D + TP + + K YN Sbjct: 5 VISGPSGTGKSTLLKKLFA-EYPDSFGFSVSSTTRTPRAGEVNGKDYN 51 >gi|2652959 [169..353] P-loop containing nucleotide triphosphate hydrolases Length = 185 Score = 23.4 bits (49), Expect = 4.2 Identities = 10/109 (9%), Positives = 27/109 (24%), Gaps = 6/109 (5%) Query: 34 MVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFP 93 ++ +G GK+ + D I + G N + Sbjct: 4 ILFMGGPGGGKTRHA-ARVADSLADNGLVHICM-PDIIRTALGKYKDKYPEWKEANEHYI 61 Query: 94 AMRRLSILTGDVFILVFSLDNR----ESFDEVKRLQKQILEVKSCLKNK 138 + + + + R +Q+ + + +K+ Sbjct: 62 RGELIPNQLALTLLKAEMGRHPDAMGFFLEGYPREARQVEDFERQVKSV 110 >gi|1175188 [41..508] Periplasmic binding protein-like II Length = 468 Score = 23.4 bits (50), Expect = 4.4 Identities = 14/120 (11%), Positives = 33/120 (26%), Gaps = 10/120 (8%) Query: 7 LLQPRAMMKTLSSGNCTLSVPAKNSYRMVVLGA----SRVGKSSIVSRFLNGRFEDQYTP 62 L + + ++ + L AK++ + GA + I++ N + + Sbjct: 142 LDKASPYLPSMLAHVSLLPHYAKSTEIFISNGAYQLQRQAENQHILTT--NPYYWAKEKV 199 Query: 63 TIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVK 122 + + Y L D N + F+L + + Sbjct: 200 IFQ---QVKYQKISVDADLSDFDVVMNPKKVNQNIQDYPQLCTYFYEFNL-SDPVLQKSA 255 >gi|1723793 [7..233] NAD(P)-binding Rossmann-fold domains Length = 227 Score = 23.1 bits (48), Expect = 5.7 Identities = 10/137 (7%), Positives = 30/137 (21%), Gaps = 10/137 (7%) Query: 121 VKRLQKQILEVKSCLKNKTKEAAELPM----------VICGNKNDHGELCRQVPTTEAEL 170 +++ + + + +L G+ ++ + ++A + Sbjct: 15 IQQFFDINPKPDIHIFDVRDLPEKLSKQFTFNVDDIKFHKGDLTSPDDMENAINESKANV 74 Query: 171 LVSGDENCAYFEVSAKKNTNVDEMFYVLFSMAKLPHEMSPALHRKISVQYGDAFHPRPFC 230 +V NV V+ K + + G H Sbjct: 75 VVHCASPMHGQNPDIYDIVNVKGTRNVIDMCKKCGVNILVYTSSAGVIFNGQDVHNADET 134 Query: 231 MRRVKEMDAYGMVSPFA 247 + + Sbjct: 135 WPIPEVPMDAYNETKAI 151 >gi|345362 [61..347] Protein kinases (PK), catalytic core Length = 287 Score = 23.2 bits (48), Expect = 5.7 Identities = 6/80 (7%), Positives = 19/80 (23%), Gaps = 4/80 (5%) Query: 60 YTPTIEDFHRKVYNIRGDMYQLDIL---DTSGNHPFPAMRRLSILTGDVFILVFSLDNRE 116 + + D++ L +G PF L ++ + + Sbjct: 192 QNSHNWKTKSAYTSEQCDLWALGCTLYFCATGKFPFEHESNNKSLY-HKAVVALTQNPDA 250 Query: 117 SFDEVKRLQKQILEVKSCLK 136 + + + + Sbjct: 251 IAMVLVQKGRDPGSSTDMFE 270 >gi|1174715 [19..356] Thiamin-binding Length = 338 Score = 23.0 bits (49), Expect = 5.8 Identities = 9/31 (29%), Positives = 13/31 (41%) Query: 47 IVSRFLNGRFEDQYTPTIEDFHRKVYNIRGD 77 + SR+ G F+ P F +Y I D Sbjct: 129 MASRYERGLFDPDAEPGASPFDHYIYVIASD 159 >gi|134579 [59..285] P-loop containing nucleotide triphosphate hydrolases Length = 227 Score = 22.8 bits (47), Expect = 5.9 Identities = 13/124 (10%), Positives = 31/124 (24%), Gaps = 19/124 (15%) Query: 24 LSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDI 83 L +++ G GKS+++ + L + M L + Sbjct: 30 LGGGVVPGSAILIGGNPGAGKSTLLLQTLC-------------------KLAQQMKTLYV 70 Query: 84 LDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAA 143 L D ++ + + Q +++ + S + Sbjct: 71 TGEESLQQVAMRAHRLGLPTDNLNMLSETSIEQICLIAEEEQPKLMVIDSIQVMHMADVQ 130 Query: 144 ELPM 147 P Sbjct: 131 SSPG 134 >gi|120586 [1..210] Class II aldolase Length = 210 Score = 22.9 bits (49), Expect = 6.3 Identities = 16/67 (23%), Positives = 28/67 (40%), Gaps = 9/67 (13%) Query: 16 TLSSGNCTLSVPAKNSYRMVVLGA----SRVGKSSIVSRFLNGRFEDQYTPTIE-DFHRK 70 ++GN + + M++ ++ +S IV NG+ E+ P+ E FH Sbjct: 24 QGTAGNVS----VRYQDGMLITPTGIPYEKLTESHIVFIDGNGKHEEGKLPSSEWRFHMA 79 Query: 71 VYNIRGD 77 Y R D Sbjct: 80 AYQSRPD 86 >gi|2392193 [1..232] Zn metallo-beta-lactamase Length = 232 Score = 22.7 bits (48), Expect = 6.7 Identities = 5/62 (8%), Positives = 17/62 (27%), Gaps = 3/62 (4%) Query: 70 KVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQIL 129 + + +Y L + ++ L+ D + + + L + Sbjct: 12 SITQLSDKVYTYVSLAEIEGWGMVPSNGMIVINNHQAALL---DTPINDAQTEMLVNWVT 68 Query: 130 EV 131 + Sbjct: 69 DS 70 >gi|2493523 [117..286] Interleukin-1beta converting enzyme (a cysteine protease) Length = 170 Score = 22.9 bits (49), Expect = 7.0 Identities = 16/89 (17%), Positives = 31/89 (33%), Gaps = 3/89 (3%) Query: 117 SFDEVKRLQKQILEVKSCLKNKTKEAAELPMVICGNKNDHGELCRQVPTTEAELLVSGDE 176 S +E K++Q + ++ + L ++IC N+ R + + + Sbjct: 2 SAEEFKKIQDTQGDKIYDVRKREGRKG-LALIIC-NEKFENLNERHGAKVDLDGMTKLLN 59 Query: 177 NCAYFEVSAKKNTNVDEMFYVLFSMAKLP 205 Y +V N EM V+ A Sbjct: 60 ELGY-QVHPHTNLTKTEMVKVMKEFAAQE 87 >gi|127802 [1..266] Glucosamine 6-phosphate deaminase Length = 266 Score = 22.6 bits (48), Expect = 7.1 Identities = 8/49 (16%), Positives = 16/49 (32%) Query: 23 TLSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKV 71 T+ V V++ K+ + + G +T + H K Sbjct: 187 TVGVGTLLDAEEVMILVLGSQKALALQAAVEGCVNHMWTISCLQLHPKA 235 >gi|417569 [3..336] Aspartate/ornithine carbamoyltransferase, Catalytic-chain Length = 334 Score = 22.5 bits (48), Expect = 7.4 Identities = 9/52 (17%), Positives = 17/52 (32%), Gaps = 1/52 (1%) Query: 55 RFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVF 106 R +Q ++ R R + + +T HP P R + + Sbjct: 256 RMGEQVLKKQDELRRS-ITFRKEFIEKLPENTRFYHPLPRHRVHPTIPTFLD 306 >gi|1346784 [8..238] N-terminal nucleophile aminohydrolases (Ntn hydrolases) Length = 231 Score = 22.5 bits (48), Expect = 8.2 Identities = 9/45 (20%), Positives = 16/45 (35%), Gaps = 4/45 (8%) Query: 7 LLQPRAMMKTLSSGNCTLSVPAKNSYRMVVLGA-SRVGKSSIVSR 50 L Q + + G+ + V ++ VVLG + R Sbjct: 9 LFQVEYAQEAVKKGSTAVGVRGRDI---VVLGVEKKSVAKLQDER 50 >gi|280504 [161..330] Glyceraldehyde-3-phosphate dehydrogenase-like, C-terminal domain Length = 170 Score = 22.5 bits (48), Expect = 9.2 Identities = 7/31 (22%), Positives = 17/31 (54%) Query: 97 RLSILTGDVFILVFSLDNRESFDEVKRLQKQ 127 R+ +TG + L +++ + + +EV K+ Sbjct: 83 RVPTITGSLVDLTVNVNAKVTKEEVNAALKK 113 >gi|1515148 [263..496] Protein kinases (PK), catalytic core Length = 234 Score = 22.3 bits (46), Expect = 9.2 Identities = 6/56 (10%), Positives = 17/56 (29%), Gaps = 3/56 (5%) Query: 49 SRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDIL---DTSGNHPFPAMRRLSIL 101 + + R ++ +++ L +L +G +PF + Sbjct: 140 FTRRSVCVPFMAPEMLTAESRTHHSYSTELWGLGVLLYILLTGKYPFHENSMPLLF 195 >gi|2130121 [317..446] Cyclin-like Length = 130 Score = 22.5 bits (48), Expect = 9.3 Identities = 12/69 (17%), Positives = 26/69 (37%), Gaps = 3/69 (4%) Query: 32 YRMVVLGASRVGKSSIV-SRFLNGRFEDQYTPTIEDF-HRKVYNIRGDMYQL-DILDTSG 88 Y ++ S + S+I +RF+ + + T+ + K + + L + Sbjct: 40 YSLLSYPPSLIAASAIFLARFILQPTKYPWNSTLAHYTQYKPSKLSECVKALHRLCSVGS 99 Query: 89 NHPFPAMRR 97 PA+R Sbjct: 100 GSNLPAIRE 108 >gi|1673470 [12..233] Protein kinases (PK), catalytic core Length = 222 Score = 22.4 bits (46), Expect = 9.6 Identities = 9/115 (7%), Positives = 23/115 (19%), Gaps = 1/115 (0%) Query: 47 IVSRFLNGRFEDQYTPTIEDFHRKVYNIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVF 106 + + R + T + + A+ Sbjct: 33 VKFVHGSQREKSFKTELSVFRKLDSLGNSKRNGFPQFCENFQVDGYYAIVLSDEGDNLND 92 Query: 107 ILVFSLDNRESFDEVKRLQKQILEVKSCLKNKTKEAAEL-PMVICGNKNDHGELC 160 I + + + V + + S L + A+L + Sbjct: 93 IRKRNKNLVFNRKNVATVGCAVSRSLSTLHSIGFFHADLHEQNLMVPYPCRKINI 147 >gi|1322391 [16..428] Acid proteases Length = 413 Score = 22.2 bits (47), Expect = 9.9 Identities = 15/73 (20%), Positives = 18/73 (24%), Gaps = 14/73 (19%) Query: 62 PTIE-DFHRKVYNIRGDMYQLDILDTS------GNHPFPAMRRLSILTGDVFI----LVF 110 T K Y I L+ S IL G FI + Sbjct: 343 VTFGMTIGGKQYKIERKNLVLEEDKDSCMIAMTPLSSVG-FGPQWIL-GAPFIRQYCNIH 400 Query: 111 SLDN-RESFDEVK 122 + N F E K Sbjct: 401 DMRNNTIGFAEPK 413 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 33 Number of calls to ALIGN: 33 Length of query: 278 Total length of test sequences: 256703 Effective length of test sequences: 207231.0 Effective search space size: 49097454.9 Initial X dropoff for ALIGN: 25.0 bits