The PTS1 predictor
The learning set
Metaozan LH set
Identifier
|
Sequence
|
Ce025#1 | CVWRLDWALGKL |
Ce033#1 | ILWRMATMSASM |
Ce036#1 | VAGQHRSLVARL |
Ce039#1 | WHWKCFEHVARL |
Ce051#1 | GEGAGFNQLAKL |
Ce096#1* | DPNEAMQWLARL |
Ce123#1 | LGGPAPEFWSSL |
Ce145#5 | QPGSTAMRMSKL |
Ce146#3 | AAPVRLVLGHKL |
Ce147#1 | GKATGRRLKAKL |
Ce148#2 | VVWPLHRLRASL |
Ce149#2 | DRKGWTHQPSKL |
Ce150#1 | AVSFLSMRRARL |
Ce152#3 | GLSDSFYRRSML |
Ce153#1 | LWTALLGRTCSL |
Ce154#1 | AGSAFSSCRSKL |
Ce155#2 | KQWTGKGGRSKL |
Ce157#1 | RERHSYMQRAKL |
Ce159#1 | NLEFWGMMRSKM |
Ce161#2 | YGVGRELRQAKL |
Ce167#1 | EGKIRTLLWCKL |
Ce168#1 | DGAHREAPASKL |
Ce169#1 | GFRGYMSGLCKL |
Ce171 | RKLMSEECRGKL |
Ce172 | MRREVVLLRAKI |
Ce173#1 | TCARFGRLRAKV |
Ce175#1 | RWRLGWSWRAML |
Ce176 | WCWRAPNLGSKL |
Ce179#2 | RTEPKLMLRASL |
Ce182#1 | RGGGPCQWKAKL |
Ce185#1 | GHAPSRPAKARL |
Ce186#1 | VSCRCRRLQSRL |
Ce187#1* | PDVRRCAGAAKL |
Ce188#1 | GSTGWWLLQAKL |
Ce190 | EHRRSSTSTAKL |
Ce191 | VYTVPGTPRCSL |
Ce192 | VYGGGVTRGHKL |
Ce193 | CHVGSTPWRAAL |
Ce194 | CALVGSHIRSRL |
Ce195 | QCVHAEVHLARL |
Ce196* | LVDPLITSRSML |
Ce199#1 | RSEEMARSPAKL |
Ce200#1 | RLEMARWPRFKM |
Ce201 | CQVRGACGMAKL |
Ce202 | RSGSMVELRARL |
Ce204#1 | NYPQTARVCAKL |
Ce205 | CVSDLGHIVHKL |
Ce208 | GEKFRDAWRARL |
CeD1#1 | IGRRMAPLRSSL |
CeF#1* | LELVDPGMRAKL |
Hs01 | IGLDWLVVISKL |
Hs02 | GEYGVGLLWSKL |
Hs03 | WGDPLIMPGSKL |
Hs04 | SLLALAVQCSKL |
Hs05 | RGMSWFSSNSKL |
Hs06 | ALYGWWALGSKL |
Hs07 | GNVGSAVTAAKL |
Hs08 | KLTVRWAWRAKL |
Hs09 | ERVSGRHPQAKL |
Hs10 | GVEVLVPGTAKL |
Hs11 | GPDPLMSILAKL |
Hs12* | PVKCVRKRLSRL |
Hs13 | ICDAIFQGGSRL |
Hs14* | PVSWNWMVRSRL |
Hs15 | IQTGGGRAISRL |
Hs16 | RWDIPLRWWSRL |
Hs17 | GAMGMLGGVSRL |
Hs18 | GEALLGGFLSRL |
Hs19 | PPNNIGCTASRL |
Hs20* | VDPIVKVPESRL |
Hs21 | LAGHAWKALSRL |
Hs22* | ELVDPEGLRSRL |
Hs23 | NNGRGKVVLSRL |
Hs24 | LRHGVNGLWSRL |
Hs25 | VRGGYLVRTARL |
Hs26 | VAGQHRSLVARL |
Hs27 | NGIEQGKKWARL |
Hs28 | CWRDLPQMIARL |
Hs29 | AILGGEYTMARL |
Hs30* | KEILELVDPARL |
Hs31 | SPYTGGSLSCKL |
Hs32 | MQIEHSLPFCRL |
Hs33 | LVPLYHLIPCRL |
Hs34 | FWSYWMVESCRL |
Hs35 | VSGWGAGGMSRM |
Hs36 | VWREGWGVRARM |
Hs37 | PLTPRCILICRM |
Hs38 | QDWAGLPLRSHL |
Hs39 | KEEWPWGYRAHM |
Hs40 | EGLIVMLERGKL |
Hs41* | LVDPLYILWGRL |
Hs42 | YCYVLRVGGGRL |
Hs43* | LVDPFQVMFGRL |
Hs44 | MGMCTGMLWPRL |
Hs45* | KEILELVDPPRL |
Hs46 | EQSGDSGLKPKM |
Hs47* | PAYRLVAVLANL |
Hs48 | WSMELTPIWCNL |
Hs49 | SNMSLTFMSPNL |
Hs50 | IAMNCVQVKSQL |
Hs51 | WARYENIMSSQL |
Hs52* | VDPSERRLRSML |
Hs53* | LVDPLGPLFSML |
Hs54 | TTMRGDTLTSLL |
Hs55 | RGMGFPAVRSLL |
Hs56 | CRSGLPCLQSLL |
Hs57 | RAGQGTMWRSLL |
Hs58 | VEYMMKWPRALL |
Hs59 | SWSRSRLVSALL |
Hs60 | GRDRTSVVRCLL |
Hs61 | VVGVGGCVKSYL |
Hs62 | LGNFGVSLCSSL |
Hs63 | NGQVRDWCRPSL |
Hs64 | RIASNCNLVSAL |
Hs65 | MVVRMNPLKCVL |
* random C-terminus shorter than 12 residues
Metazoan SW set
SPROT-ID(AC)
|
Sequence
|
ADAS_CAEEL (O45218) | LIDIIGSPHCKL |
ADAS_DROME (Q9V778) | PPTSSTPPKAKL |
AK11_RAT (Q62924) | REDGKWAMSCRL |
AMAC_HUMAN (Q9UHK6) | KIIESNKVKASL |
AMAC_MOUSE (O09174) | RIVESDKLKANL |
AMAC_RAT (P70473) | RIIESNKLKANL |
AOPP_HUMAN (P30044) | TCSLAPNIISQL |
AOPP_MOUSE (P99029) | TCSLAPNILSQL |
CACP_COLLI (P52826) | RSLLQSAPKSKL |
CACP_HUMAN (P43155) | RALLQSHPRAKL |
CACP_MOUSE (P47934) | RTLLQNHPRAKL |
CAOP_CAEEL (P34355) | VEKYLKPMTSKL |
CAOP_HUMAN (Q15067) | SYKHLKSLQSKL |
CAOP_RAT (P07872) | YHKHLKPLQSKL |
CAOQ_RAT (Q63448) | NKSVANRLKSQL |
CATA_ASCSU (P90682) | KNISNLAKYCKY |
CATA_BRARE (Q9PT92) | GGASAVAAASKM |
CATA_CANFA (O97492) | GSHLAAREKANL |
CATA_CAVPO (Q64405) | GSHLSAKEKANL |
CATA_DROME (P17336) | TEELNLAKSSKF |
CATA_HUMAN (P04040) | GSHLAAREKANL |
CATA_MOUSE (P24270) | GSHMAAKGKANL |
CATA_RANRU (Q9PWF7) | SAHVTANDKANL |
CATA_RAT (P04762) | GSHIAAKGKANL |
DAPT_HUMAN (O15228) | KTPIGKPATAKL |
DAPT_MOUSE (P98192) | KKPIGKPATAKL |
DAPT_RAT (Q9ES71) | KKPIGKPATAKL |
ECH1_HUMAN (Q13011) | NKELKTVTFSKL |
ECH1_MOUSE (O35459) | KRDTKSITFSKL |
ECH1_RAT (Q62651) | KKDSKSITFSKL |
ECHP_CAVPO (P55100) | WQSLAGLPSSKL |
ECHP_HUMAN (Q08426) | WQSLAGSPSSKL |
ECHP_RAT (P07896) | WQSLAGPHGSKL |
HAO1_HUMAN (Q9UJM8) | LVRKNPLAVSKI |
HAO1_MOUSE (Q9WU19) | LVRKNPLAVSKI |
HAO2_HUMAN (Q9NYQ3) | EINRNLVQFSRL |
HAO3_HUMAN (Q9NYQ2) | EISPDLIQFSRL |
HAO3_MOUSE (Q9JI00) | EISPDLIQFSRL |
HAO3_RAT (Q07523) | EISPDLIQFSRL |
HMGL_BOVIN (Q29448) | TNSKVAQATCKL |
HMGL_CHICK (P35915) | TNSKVSQAACRL |
HMGL_HUMAN (P35914) | TSSKVAQATCKL |
HMGL_MOUSE (P38060) | TSSKVAQATCKL |
HMGL_RAT (P97519) | TSSKVAQATCKL |
HYES_HUMAN (P34913) | SDARNPPVVSKM |
HYES_MOUSE (P34914) | TEVQNPSVTSKI |
HYES_RAT (P80299) | TEIQNPSVTSKI |
IDHC_HUMAN (O75874) | ENLKIKLAQAKL |
IDHC_MICME (Q9Z2K9) | ENLKAKLAQAKL |
IDHC_MICOH (Q9Z2K8) | ENLKAKLAQAKL |
IDHC_MOUSE (O88844) | ENLKAKLAQAKL |
IDHC_RAT (P41562) | ENLKAKLAQAKL |
IDI1_HUMAN (Q13907) | NQFVDHEKIYRM |
IDI1_MESAU (O35586) | SQFVDHEKIHRM |
IDI1_MOUSE (P58044) | SPFVDHEKIHRL |
IDI1_RAT (O35760) | SPFVDHEKIHRM |
LUCI_LUCCR (P13129) | IREILKKPVAKM |
LUCI_LUCLA (Q01158) | IREILKKPVAKM |
LUCI_PHOPY (P08659) | LIKAKKGGKSKL |
NLTP_CHICK (Q07598) | QNLQLQPGKAKL |
NLTP_HUMAN (P22307) | QNLQLQPGNAKL |
NLTP_MOUSE (P32020) | QNLQLQPGKAKL |
NLTP_RAT (P11915) | QSLQLQPDKAKL |
O55223 (O55223) | MSRFSTLSKAHL |
OXDA_HUMAN (P14920) | EKKLSRMPPSHL |
OXDA_MOUSE (P18894) | EKKLSRLPPSHL |
OXDA_PIG (P00371) | ERNLLTMPPSHL |
OXDA_RABIT (P22942) | EKKSSRMPPSHL |
OXDA_RAT (O35078) | EKNLSRMPPSHL |
OXDD_BOVIN (P31228) | QVLRTPAPKSKL |
OXDD_HUMAN (Q99489) | HALRTPIPKSNL |
P79371 (P79371) | ISRFPSLGKAHL |
PECI_HUMAN (O75521) | AVVNFLSRKSKL |
PECI_MOUSE (Q9WUR2) | AIMSFVSRKPKL |
PMVK_HUMAN (Q15126) | LENLIEFIRSRL |
PTE1_HUMAN (O14734) | IRVKPQVSESKL |
PTE1_MOUSE (P58137) | IRLKPQVSESKL |
PTE2_HUMAN (P49753) | LGGREGTIPSKV |
PTE2_MOUSE (Q9QYR7) | LDGKKKTIPAKL |
Q27757 (Q27757) | LRQMFEKHKSKL |
Q99424 (Q99424) | IRPLLQSWRSKL |
SPYA_CALJA (P31029) | REALQHCPKKKL |
SPYA_FELCA (P41689) | QEALQRCSRNKL |
SPYA_HUMAN (P21549) | RAALQHCPKKKL |
SPYA_MOUSE (O35423) | REALQHCPKNKL |
SPYA_RABIT (P31030) | REALQHCAQSQL |
SPYA_RAT (P09139) | REALQHCPKNKL |
URIC_DROME (P16163) | AQLARKNINSHL |
URIC_DROPS (P22673) | AQLARKNISSHL |
URIC_DROSU (O44111) | AQLARKNLNSHL |
URIC_DROVI (P23194) | AQLSRKSLKSHL |
URIC_MOUSE (P25688) | TGTVKRKLPSRL |
URIC_PAPHA (P25689) | TGTVKRKLSSRL |
URIC_PIG (P16164) | TGTVKRKLTSRL |
URIC_RABIT (P11645) | TGTVKRKLSSRL |
URIC_RAT (P09118) | TGTVRRKLPSRL |
Fungal LH set
Identifier
|
Sequence
|
Sc01* | LELVDPCERSKL |
Sc02 | RMDATKRRESKL |
Sc03* | VDPRCLARISKL |
Sc04 | LSRGRSVSRSRL |
Sc05 | AVHGTFSWRSRL |
Sc06 | NGWGFMTRLSRL |
Sc07 | NGRDRGGWWAKL |
Sc08 | LSANALGGLAKL |
Sc09 | RSGRQGGGFAKL |
Sc10 | GWDWAVSPRAKL |
Sc11 | RDRGTGQGLARL |
Sc12 | WTRDGSHRMARL |
Sc13 | SLLGGAAGWARL |
Sc14 | SGSAVCSRVCRL |
Sc15 | EWEEKSFIKCRL |
Sc16 | STGKRSRSGAHL |
Sc17 | VAWVPRKRVCHL |
Sc18* | PSGGVVARAAKM |
Sc19 | WRATGVSRQAKF |
Sc20 | SSCCVQTPKAKF |
Sc21 | RAPGGVGHKCNL |
Sc22 | ETKGLNAVYGKL |
Sc23 | EWFPVYNRSTKL |
Sc24 | GSESHGSARQKL |
Sc25 | KAGEIPGRMHRL |
Sc26 | RRQWSTGRKLKL |
Sc27 | GPGCCRRRDLKL |
Sc28 | TWGPCDGRRVKL |
Sc29 | ERSVRHRREFRL |
Sc30 | ELGISGARWYKL |
Sc31 | IWDGSRTWAPKL |
Sc32* | PVWVSLGRRWKL |
Sc33 | PLVGRKGGPWKL |
Sc34 | GGIGRKSCGWKL |
Sc35 | SMNGYQRRQWRL |
* random C-terminus shorter than 12 residues
Fungal SW set
Identifier
|
Sequence
|
ACEA_ASHGO (O94198) | EEQFGSSNGAKL |
ACEA_CANTR (P20014) | TEDQFKETKAKV |
ACEA_COPCI (O13439) | AGVTESQFTSKL |
ACEA_YARLI (P41555) | AGVTEDQFKSKL |
AHP1_YEAST (P38013) | TVSSVESVLAHL |
ALOX_CANBO (Q00922) | LKTYEQTGAARY |
ALOX_PICAN (P04841) | LGTYEETGLARF |
AOFN_ASPNG (P46882) | ELGTKREVKARL |
CACP_CANTR (Q00614) | TKGLLTDAKPKL |
CACP_YEAST (P32796) | ALENENKRKAKL |
CATA_PICAN (P30263) | ELKRKASSPSKI |
CATA_YEAST (P15202) | KHASELSSNSKF |
CISZ_YEAST (P08679) | YKELVKNIESKL |
DAS_PICAN (P06834) | KEKPNHDKVNKL |
FAT2_YEAST (P38137) | TFAKSSRNKSKL |
FOX2_CANTR (P22414) | AAIKLVGDKAKI |
FOX2_YEAST (Q02207) | AAVKLSQAKSKL |
MASY_EMENI (P28344) | NEISSPGTASKL |
MASY_NEUCR (P28345) | TSAGNSLPASKL |
MASY_YEAST (P30952) | STKATPTDLSKL |
MASZ_YEAST (P21826) | KPSAKPVDLSKL |
MDHP_YEAST (P32419) | KGKSFILDSSKL |
O93884 (O93884) | LTEKPKHDQNHL |
OXDA_FUSSO (P24552) | VDKVGKAAKSKL |
OXDA_RHOTO (P80324) | QRYHGAARESKL |
PEX8_PICAN (Q00925) | EHVNESQEKAKL |
PEX8_PICPA (Q01962) | YENVNAQSTAKL |
PEX8_YEAST (P53248) | YTTVLSSQSSKL |
PTE1_YEAST (P41903) | VYGSERDIRAKF |
PX18_CANMA (Q00680) | SVFKKLDPRPKL |
PX18_CANTR (P22009) | AVFKKLDPRPKL |
Q12598 (Q12598) | VVIEKIDADAKL |
Q96VB8 (Q96VB8) | KKSPRGASKNKF |
URIC_ASPFL (Q00511) | CTVGRSSLKSKL |
URIC_EMENI (P33282) | KCTVGRKSKAKL |
URIC_PICJA (P78609) | KCTVVRKEKTKL |
VAOX_PENSI (P56216) | WPSQYSHVTWKL |
Remaining learning set sequences
Identifier
|
Sequence
|
ACE1_SOYBN (P45456) | DRGSIVVAKARM |
ACE2_SOYBN (P45457) | DRGSIVVAKARM |
ACEA_ARATH (P28297) | EGTSLVVAKSRM |
ACEA_BRANA (P25248) | EGTSLVVAKSRM |
ACEA_CUCMA (P93110) | EEGSVVVAKSRM |
ACEA_CUCSA (P49296) | EEGNVVVAKSRM |
ACEA_DENCR (Q9SE26) | RGGITVNAKSRL |
ACEA_GOSHI (P17069) | SEGNLVVAKARM |
ACEA_LYCES (P49297) | GDGSVVIAKARM |
ACEA_PINTA (Q43097) | IGAGTVLAKSRM |
ACEA_RICCO (P15479) | SAGSEVVAKARM |
ADAS_DICDI (O96759) | LFDVVNVKYPKL |
ADAS_TRYBB (O97157) | KMGIPGALQAHL |
CAT1_CUCPE (P48350) | KLASHLNVRPSI |
CAT1_GOSHI (P17598) | KLASLLNVRPSI |
CAT1_HORVU (P55307) | KLASRLKIKPNM |
CAT1_LYCES (P30264) | KVASRLTVKPTM |
CAT1_MAIZE (P18122) | KLPSRLNLKPSM |
CAT1_NICPL (P49315) | KLASRLNVRPSI |
CAT1_RICCO (Q01297) | KLATRLNVKPSI |
CAT1_SOLTU (P49284) | KVASRLTVKPTM |
CAT1_TOBAC (P49319) | KVASRLTLKPTM |
CAT1_WHEAT (Q43206) | KLASRLSSKPSM |
CAT2_ARATH (P25819) | KLASRLNVRPSI |
CAT2_CUCPE (P48351) | KIASRMNARPNM |
CAT2_GOSHI (P30567) | KIASRLNVRPSI |
CAT2_HORVU (P55308) | KVANRLNVKPSM |
CAT2_MAIZE (P12365) | KLASRLSAKPSM |
CAT2_NICPL (P49316) | KVASRLTLKPTM |
CAT2_RICCO (P49318) | KLASRLNVRPNI |
CAT2_SOLTU (P55312) | KVASRLTVKPTM |
CAT2_WHEAT (P55313) | KLASRLKIKPNM |
CAT3_ARATH (Q42547) | KLASRLNVRPSI |
CAT3_CUCPE (P48352) | KIASRLNVRPNI |
CAT3_NICPL (P49317) | KIASRLNVRPTM |
CATA_DICDI (O77229) | NDVIKFAARSNL |
CATA_HELAN (P45739) | KIASRLNVKPNY |
CATA_IPOBA (P07145) | KVASRLNIRPTM |
CATA_ORYSA (P29611) | KIANRLNVKPSM |
CATA_PEA (P25890) | KLASHLNMRPSI |
CATA_PHAAU (P32290) | KIASHLNMRPNI |
CATA_SECCE (P55310) | KVANRLNVKPSM |
CATA_SOLME (P55311) | KVASRLLVKPTM |
CATA_SOYBN (P29756) | KIASHLNLKPSI |
CATA_TOXGO (Q9XZD5) | GLPTAACYPAKM |
CATB_ORYSA (P55309) | KLASRLNLKPNM |
DHAB_HORVU (Q40024) | ELYGWYQRPSKL |
DHAB_ORYSA (O24174) | EPYGWYRPPSKL |
G3PG_LEIME (Q27890) | YMAAKDAASSKM |
G3PG_TRYBB (P22512) | RHMAARDRAAKL |
G3PG_TRYCR (P22513) | RHMASKDRSARL |
G6PI_TRYBB (P13377) | GLINMFNELSHL |
GOX1_ARATH (Q9LRS0) | TEWDTPRHLPRL |
GOX2_ARATH (Q9LRR9) | TEWDTPRPSARL |
GOX_SPIOL (P05414) | WDGPSSRAVARL |
GPDA_TRYBB (P90593) | EGLPALPRTSKM |
GPDA_TRYBR (Q26756) | EGLPALPRTSKM |
MASY_BRANA (P13244) | IVAHYPINASRL |
MASY_CUCMA (P24571) | IVIHHPRELSRL |
MASY_CUCSA (P08216) | IVIHHPRELSKL |
MASY_GOSHI (P17432) | VIHHPKDVSSKL |
MASY_MAIZE (P49081) | VAHHPGASPCKL |
MASY_RAPSA (Q43827) | IVAHYPINVSRL |
MASY_RICCO (P17815) | IVIHYPKGSSRL |
MASY_SOYBN (P45458) | IVVHHPRETSKL |
PGKC_TRYBB (P07378) | GTGTLSNRWSSL |
URIC_ARATH (O04420) | IEATLSRITSKL |
URIC_CANLI (P34798) | IQASLRRLWSKL |
URIC_PHAVU (P53763) | IEASLSRVWSKL |
URIC_SOYBN (P04670) | IQASLSRLWSKL |
URID_CANLI (P34799) | IQASLSRLWSKL |
URID_SOYBN (O04104) | IQASLSRLWSKL |
Gapless alignments of the 12 C-terminal resudues in clustalx colors:
Secondary structure prediction
using the PREDATOR program
Search for low complexity regions
[12-2.2-2.5]
[25-3.0-3.3]
[45-3.4-3.75]
using the SEG program