ID POLG_EMCVD STANDARD; PRT; 2292 AA. AC P17594; DT 01-AUG-1990 (Rel. 15, Created) DT 01-FEB-1996 (Rel. 33, Last sequence update) DT 15-DEC-1998 (Rel. 37, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Encephalomyocarditis virus (strain emc-d diabetogenic). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Cardiovirus. OX NCBI_TaxID=12106; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=89243189; PubMed=2541543; RA Bae Y.S., Eun H.M., Yoon J.W.; RT "Genomic differences between the diabetogenic and nondiabetogenic RT variants of encephalomyocarditis virus."; RL Virology 170:282-287(1989). CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; M22458; AAA43034.1; -. DR PIR; A31473; GNNYED. DR HSSP; P12296; 1MEC. DR MEROPS; C03.009; -. DR MEROPS; U29.001; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate. FT PROPEP 1 67 LEADER PEPTIDE. FT CHAIN 68 137 COAT PROTEIN VP4 (RHO). FT CHAIN 138 393 COAT PROTEIN VP2 (BETA). FT CHAIN 394 624 COAT PROTEIN VP3 (GAMMA). FT CHAIN 625 901 COAT PROTEIN VP1 (ALPHA). FT CHAIN 902 1058 CORE PROTEIN P2A (G). FT CHAIN 1059 1194 CORE PROTEIN P2B (I). FT CHAIN 1195 1519 CORE PROTEIN P2C (F). FT CHAIN 1520 1607 CORE PROTEIN P3A. FT CHAIN 1608 1627 GENOME-LINKED PROTEIN VPG (H). FT CHAIN 1628 1832 PICORNAIN 3C (P22). FT CHAIN 1833 2292 RNA-DIRECTED RNA POLYMERASE P3D (E). FT LIPID 68 68 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1786 1786 PROTEASE (POTENTIAL). FT ACT_SITE 1804 1804 PROTEASE (POTENTIAL). SQ SEQUENCE 2292 AA; 255426 MW; F2B0627B0F444107 CRC64; MATTMEQEIC AHSLTFKGCP KCSALQYRNG FYLLKYDEEW YPEELLTDGE DDVFDPELDM EVVFELQGNS TSSDKNNSSS DGNEGVIINN FYSNQYQNSI DLSANATGSD PPRTYGQFSN LLSGAVNAFS NMIPLLADQN TEEMENLSDR VLQDTAGNTV TNTQSTVGRL VGYGAVHDGE HPASCADTAS EKILAVERYY TFKVNDWTST QKPFEYIRIP LPHVLSGEDG GVFGAALRRH YLVKTGWRVQ VQCNASQFHA GSLLVFMAPE YPTLDAFAMD NRWSKDNLPN GTKTQTNRKG PFAMDHQNFW QWTLYPHQFL NLRTNTTVDL EVPYVNIAPT SSWTQHASWT LVIAVVAPLT YSTGASTSLD ITASIQPVRP VFNGLRHETL SRQSPIPVTI REHAGTWYST LPDSTVPIYG KTPVAPANYM VGEYKDFLEI AQIPTFIGNK IPNAVPYIEA SNTAVKTQPL ATYQVTLSCS CLANTFLAAL SRNFAQYRGS LVYTFVFTGT AMMKGKFLIA YTPPGAGKPT SRDQAMQATY AIWDLGLNSS YSFTVPFISP THFRMVGTDQ VNITNVDGWV TVWQLTPLTY PPGCPTSAKI LTMVSAGKDF SLKMPISPAP WSPQGVENAE RGVTEDTDAT ADFVAQPVYL PENQTKVAFF YDRSSPIGAF AVKSGSLESG FAPFSNETCP NSVILTPGPQ FDPAYDQLRP QRLTEIWGNG NEETSKVFPL KSKQDYSFCL FSPFVYYKCD LEVTLSPHTS GNHGLLVRWC PTGTPAKPTT QVLHEVSSLS EGRTPQVYSA GPGISNQISF VVPYNSPLSV LPAVWYNGHK RFDNTGSLGI APNSDFGTLF FAGTKPDIKF TVYLRYKNMR VFCPRPTVFF PWPSSGDKID MTPRAGVLML ESPNALDISR TYPTLHILIQ FNHGGLEIRL FRHGMFWAEA HADVILRSRT KQISFLNNGS FPSMDARAPW NPWKNTYHAV LRAEPYRVTM DVYHKRIRPF RLPLVQKEWN VREENVFGLY GIFNAHYAGY FADLLIHDIE TNPGPFMAKP KKQVFQTQGA AVSSMAQTLL PNDLASKVMG SAFTALLDAN EDAQKAMRII KTLSSLSDAW ENVKETLNNP EFWKQLLSRC VQLIAGMTIA VMHPDPLTLL CLGTLTAAEI TSQTSLCEEI VAKFKKIFTT PPPRFPTISL FQQQSPLKQV NDVFSLAKNL DWAVKTVEKV VDWFGTWVVQ EEKEQTLDQL LQRFPEHAKR ISDLRNGMSA YVECKESFDF FEKLYNQAVK EKRTGIAAVC EKFRQKHDHA TARCEPVVIV LRGDAGQGKS LSSQVIAQAV SKTIFGRQSV YSLPPDSDFF DGYENQFAAI MDDLGQNPDG SDFTTFCQMV STTNFLPNMA SLERNGTPFT SQIVVATTNL PEFRPVTIAH YPAVERRITF DYSVSAGPVC SKTEAGYKVL DVERAFRPTG DAPLPCFQNN CLFLEKAGLQ FRDNRTKEIL SLVDVIERAV ARIERKKKVL TTVQTLVAQA PVDEVSFHSV VQQLKARQEA TDEQLEELQE AFAKTQERSS VFSDWMKISA MLCAATLALS QVVKMAKTVK QMVRPDLVRV QLDEQEQGPY NEAVRAKPKT LQLLDIQGPN PVMDFEKYVA KFVTAPIDFV YPTGVSTQTC LLVKGRTLAV NRHMAESDWS SIVVRGVTHA RSTVRILAIA KAGKETDVSF IRLSSGPLFR DNTSKFVKAD DVLPATSAPV IGIMNTDIPM MFTGTFLKAG VSVPVETGQT FNHCIHYKAN TRKGWCGSAL LADLGGKKKI LGMHSAGSMG RTAASIVSQE MICAVVSAFE PQGALERLPD GPRIHVPRKT ALRPTVARRV FQPAYAPAVL SKFDPRTEAD VDEVAFSKHT SNQESLPPVF RMVAKEYANR VFTLLGRDNG RLTVKQALEG LEGMDPMDKN TSPGLPYTAL GMRRTDVVDW ESATLIPYAA DRLKKMNEGD FSDIVYQTFL KDELRPVEKV QAAKTRIVDV PPFEHCILGR QLLGRFASKF QTQPGLELGS AIGCDPDVHW TAFGVAMQGF ERVYDVDYSN FDSTHSVAMF RLLAEEFFTP ENGFDPLVKE YLESLAISTH AFEEKRYLIT GGLPSGCAAT SMLNTIMNNI IIRAGLYLTY KNFEFDDVKV LSYGDDLLVA TNYQLNFDKV RASLAKTGYK ITPANKTSTF PLDSTLEDVV FLKRKFKKEG PLYRPVMNRE ALEAMLSYYR PGTLSEKLTS ITMLAVHSGK PEYDRLFAPF REVGVVVPSF ESVEYRWRSL FW //