ID POLG_EC12T STANDARD; PRT; 2193 AA. AC Q66575; Q66576; DT 15-JUL-1999 (Rel. 38, Created) DT 15-JUL-1999 (Rel. 38, Last sequence update) DT 15-JUL-1999 (Rel. 38, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEIN VP4 (P1A); COAT PROTEIN VP2 DE (P1B); COAT PROTEIN VP3 (P1C); COAT PROTEIN VP1 (P1D); PICORNAIN 2A DE (EC 3.4.22.29) (P2A); CORE PROTEIN P2B; CORE PROTEIN P2C; CORE PROTEIN DE P3A; GENOME-LINKED PROTEIN VPG (P3B); PICORNAIN 3C (EC 3.4.22.28) DE (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE (EC 2.7.7.48) (P3D)]. OS Echovirus 12 (strain Travis). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Enterovirus. OX NCBI_TaxID=103909; RN [1] RP SEQUENCE FROM N.A. RC STRAIN=WILDTYPE; RX MEDLINE=95364006; PubMed=7637032; RA Kraus W., Zimmermann H., Zimmermann A., Eggers H.J., Nelsen-Salz B.; RT "Infectious cDNA clones of echovirus 12 and a variant resistant RT against the uncoating inhibitor rhodanine differ in seven amino RT acids."; RL J. Virol. 69:5853-5858(1995). CC -!- FUNCTION: P2A AND THE P3C POLYPEPTIDES ARE PROTEASES THAT CLEAVE CC AT CERTAIN Q/G SITES IN THE POLYPROTEIN. THEY ARE CYSTEINE CC PROTEASES. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC CLEAVAGE BETWEEN VP4 AND VP2 IS AUTOCATALYTIC; VP1/P2A IS CC CATALYZED BY P2A; ALL OTHER CLEAVAGES ARE CATALYZED BY P3C. CC -!- SIMILARITY: P2A PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -!- SIMILARITY: P3C PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; X79047; CAA55650.1; -. DR EMBL; X77708; CAA54783.1; -. DR MEROPS; C03.001; -. DR MEROPS; C03.020; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001643; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. DR PRINTS; PR00918; CALICVIRUSNS. KW Polyprotein; Coat protein; Core protein; Transferase; Myristate; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease. FT CHAIN 2 69 COAT PROTEIN VP4. FT CHAIN 70 330 COAT PROTEIN VP3. FT CHAIN 331 568 COAT PROTEIN VP2. FT CHAIN 569 859 COAT PROTEIN VP1. FT CHAIN 860 1009 PICORNAIN 2A. FT CHAIN 1010 1108 CORE PROTEIN P2B. FT CHAIN 1109 1437 CORE PROTEIN P2C. FT CHAIN 1438 1526 CORE PROTEIN P3A. FT CHAIN 1527 1548 GENOME-LINKED PROTEIN VPG. FT CHAIN 1549 1731 PICORNAIN 3C. FT CHAIN 1732 2193 RNA-DIRECTED RNA POLYMERASE. FT LIPID 2 2 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1695 1695 PROTEASE 3C (POTENTIAL). FT ACT_SITE 1709 1709 PROTEASE 3C (POTENTIAL). FT VARIANT 223 223 H -> Y (IN RHODAMINE-RESISTANT VARIANT). FT VARIANT 228 228 G -> S (IN RHODAMINE-RESISTANT VARIANT). FT VARIANT 376 376 I -> M (IN RHODAMINE-RESISTANT VARIANT). FT VARIANT 643 643 Y -> C (IN RHODAMINE-RESISTANT VARIANT). FT VARIANT 669 669 V -> A (IN RHODAMINE-RESISTANT VARIANT). FT VARIANT 725 725 V -> A (IN RHODAMINE-RESISTANT VARIANT). FT VARIANT 2094 2094 C -> R (IN RHODAMINE-RESISTANT VARIANT). SQ SEQUENCE 2193 AA; 244803 MW; 1EA34E298F55130C CRC64; MGAQVSTQKT GAHETGLSAS GNSIIHYTNI NYYKDAASNS ANRQDFTQDP GKFTEPVKDI MIKSMPALNS PTAEECGYSD RVRSITLGNS TITTQECANV VVGYGTWPDY LHDDEATAED QPTQPDVATC RFYTLESIQW QKTSDGWWWK FPEALKDMGL FGQNMHYHYL GRSGYTIHVQ CNASKFHQGC LLVVCVPEAE MGCATVANEV NAAALSSGET AKHFAKTGAT GTHTVQSIVT NAGMGVGVGN LTIFPHQWIN LRTNNSATIV MPYINSVPMD NMFRHYNFTL MIIPFVPLDF TAEASTYVPI TVTVAPMCAE YNGLRLASHQ GLPTMNTPGS NQFLTSDDFQ SPSAMPQFDV TPELRIPGEV KNLMEIAEVD SVVPVNNTQD SVYNMDVYKI PVSGGNQLST QVFGFQMQPG LNSVFKRTLL GEILNYYAHW SGSVKLTFVF CGSAMALAKF LLAYSPPGAD PPKSRKEAML GTHVIWDIGL QSSCVLCVPW ISQTHYRLVQ QDEYTSAGYV TCWYQTSLVV PPGAPATCGV LCLASACNDF SVRMLRDTPF IEQKQLLQGD VEEAVNRAVA RVADTLPTGP RNSESIPALT AAETGHTSQV VPGDTMQTRH VKNYHSRTES SVENFLCRAA CVYITKYKTK DSDPVQRYAN WRINTRQMVQ LRRKFELFTY LRFDMEVTFV ITSSQDDGTQ LAQDMPVLTH QVMYIPPGGP VPNSVTDFAW QSSTNPSIFW TEGNAPARMS IPFISIGNAY SNFYDGWSHF TQDGVYGFNS LNNMGSIYIR HVNEQSPYAI TSTVRVYFKP KHVRAWVPRP PRLCAYEKSS NVNFKPTDVT TSRTSITEVP SLRPSVVNTG AFGQQSGAAY VGNYRVVNRH LATHVDWQNC VWEDYNRDLL VSTTTAHGCD TIARCQCTTG VYFCASRNKH YPVSFEGPGL VEVQESEYYP RRYQSHVLLA AGFSEPGDCG GILRCEHGVI GLVTMGGEGV VGFADVRDLL WLEDDAMEQG VKDYVEQLGN AFGSGFTNQI CEQVNLLKES LVGHDSILEK SLKALVKIIS ALVIVVRNHD DLITVTATLA LIGCTSSPWR WLKHKVSQYY GIPMAERQSN GWLKKFTEMT NACKGMEWIA IKIQKFIEWL KLKILPEVKE KHEFLNRLKQ LPLLESQIAT IEQSAPSQSD QEQLFSNVQY FAHYCRKYAP LYAAEAKRVF SLEKKMSNYI QFKSKCRIEP VCLLLHGSPG AGKSVATSLI GRSLAEKLNS SVYSLPPDPD HFDGYKQQAV VIMDDLCQNP DGKDVSLFCQ MVSSVDFVPP MAALEEKGIL FTSPFVLAST NAGSINAPTV SDSRALARRF HFDMNIEVIS MYSQNGKINM PMSVKTCDEE CCPVNFKRCC PLVCGKAIQF IDRRTQVRYS LDMLVTEMFR EYNHRHSVGA TLEALFQGPP VIREIKISVA PETPPPPAIA DLLKSVDSEA VREYCKEKGW LVPEVNSTLQ IEKHVSRAFI CLQALTTFVS VAGIIYIIYK LFAGFQGAYT GMPNQKPKVP TLRQAKVQGP AFEFAVAMMK RNASTVKTEY GEFTMLGIYD RWAVLPHHAK PGPTILMNDQ EIGVLDAKEL VDKDGTNLEL TLLKLNRNEK FRDIRGFLAR EEAEVNEAVL AINTSKFPNM YIPVGQVTDY GFLNLGGTPT KRMLMYNFPT RAGQCGGVLM STGKVLGIHV GGNGHQGFSA ALLRHYFNEE QGEIEFIESS KDAGFPVINT PSKTKLEPSV FHQVFEGNKE PAVLRNGDPR LKVNFEEAIF SKYIGNINTH VDEYMLEAVD HYAGQLATLD ISTEPMKLED AVYGTEGLEA LDLTTSAGYP YVAIGIKKRD ILSKKTKDLT KLKECMDKYG LNLPMVTYVK DELRSSEKVA KGKSRLIEAS SLNDSVAMRQ TFGNLYKTFH LNPGIVTGSA VGCDPDLFWS KIPVMLDGHL IAFDYSGYDA SLSPVWFACL KLLLEKLGYT HRETNYIDYL CNSHHLYRDK HYFVRGGMPS GCSGTSIFNS MINNIIIRTL MLKVYKGIDL DQFRMIAYGD DVIASYPWPI DASLLAEAGK GYGLIMTPAD KGECFNEVTW TNVTFLKRYF RADEQYPFLV HPVMPMKDIH ESIRWTKDPK NTQDHVRSLC LLAWHNGEQE YEEFIRKIRS VPVGRCLTLP AFSTLRRKWL DSF //