ID POLG_TMEVB STANDARD; PRT; 2303 AA. AC P08544; Q88583; Q88584; Q88585; Q88586; Q88587; Q88588; Q88589; AC Q88590; Q88591; Q88592; DT 01-AUG-1988 (Rel. 08, Created) DT 01-AUG-1988 (Rel. 08, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Theiler's murine encephalomyelitis virus (strain BeAn 8386) (TMEV). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Cardiovirus. OX NCBI_TaxID=12125; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=87198877; PubMed=3033278; RA Pevear D.C., Calenoff M., Rozhon E., Lipton H.L.; RT "Analysis of the complete nucleotide sequence of the picornavirus RT Theiler's murine encephalomyelitis virus indicates that it is closely RT related to cardioviruses."; RL J. Virol. 61:1507-1516(1987). RN [2] RP X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROMS). RX MEDLINE=92196127; PubMed=1312722; RA Luo M., He C., Toth K.S., Zhang C.X., Lipton H.L.; RT "Three-dimensional structure of Theiler murine encephalomyelitis virus RT (BeAn strain)."; RL Proc. Natl. Acad. Sci. U.S.A. 89:2409-2413(1992). CC -!- FUNCTION: IT IS THOUGHT THAT THE P2C PROTEIN ATTACHES TO VESICULAR CC MEMBRANES AND IS ASSOCIATED WITH VIRAL RNA SYNTHESIS. CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: CLOSELY RELATED TO ENCEPHALOMYOCARDITIS VIRUS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; M16020; AAA47930.1; -. DR PIR; A29535; GNNYTM. DR PDB; 1TMF; 31-OCT-93. DR MEROPS; C03.009; -. DR MEROPS; U29.001; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate; KW 3D-structure. FT PROPEP 1 76 LEADER PEPTIDE. FT CHAIN 77 147 COAT PROTEIN VP4 (P1A). FT CHAIN 148 414 COAT PROTEIN VP2 (P1B). FT CHAIN 415 646 COAT PROTEIN VP3 (P1C). FT CHAIN 647 922 COAT PROTEIN VP1 (P1D). FT CHAIN 923 1064 CORE PROTEIN P2A. FT CHAIN 1065 1191 CORE PROTEIN P2B. FT CHAIN 1192 1517 CORE PROTEIN P2C. FT CHAIN 1518 1605 CORE PROTEIN P3A. FT CHAIN 1606 1625 GENOME-LINKED PROTEIN VPG (P3B). FT CHAIN 1626 1842 PICORNAIN 3C. FT CHAIN 1843 2303 RNA-DIRECTED RNA POLYMERASE P3D. FT LIPID 77 77 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1793 1793 PROTEASE (POTENTIAL). FT ACT_SITE 1811 1811 PROTEASE (POTENTIAL). FT HELIX 103 106 FT HELIX 158 160 FT STRAND 162 166 FT TURN 167 168 FT STRAND 169 173 FT STRAND 180 181 FT STRAND 200 200 FT HELIX 204 206 FT STRAND 210 217 FT TURN 219 220 FT TURN 223 224 FT STRAND 225 226 FT STRAND 228 231 FT TURN 233 235 FT TURN 239 240 FT HELIX 241 247 FT STRAND 250 253 FT STRAND 256 262 FT TURN 267 268 FT STRAND 272 278 FT TURN 297 298 FT STRAND 305 305 FT TURN 313 314 FT TURN 325 326 FT TURN 330 334 FT STRAND 338 341 FT TURN 343 345 FT STRAND 348 353 FT STRAND 362 363 FT TURN 364 366 FT STRAND 370 378 FT TURN 384 385 FT STRAND 390 399 FT STRAND 403 406 FT STRAND 428 429 FT TURN 430 431 FT TURN 459 463 FT STRAND 470 470 FT STRAND 478 478 FT STRAND 483 483 FT STRAND 493 495 FT TURN 499 500 FT TURN 505 506 FT HELIX 508 511 FT TURN 513 514 FT STRAND 518 520 FT STRAND 524 529 FT TURN 533 534 FT STRAND 536 543 FT TURN 546 547 FT HELIX 554 557 FT TURN 558 559 FT STRAND 561 562 FT STRAND 565 566 FT STRAND 572 575 FT STRAND 586 587 FT STRAND 602 611 FT TURN 614 615 FT STRAND 620 620 FT STRAND 622 627 FT STRAND 633 635 FT TURN 652 653 FT TURN 680 684 FT STRAND 691 692 FT TURN 707 708 FT STRAND 713 714 FT TURN 760 764 FT STRAND 770 780 FT STRAND 791 794 FT TURN 796 797 FT STRAND 820 822 FT STRAND 831 835 FT STRAND 844 845 FT TURN 865 866 FT STRAND 871 874 FT STRAND 880 892 SQ SEQUENCE 2303 AA; 256280 MW; E2C7737DFDBEB786 CRC64; MACKHGYPDV CPICTAVDAT PGFEYLLMAD GEWYPTDLLC VDLDDDVFWP SDTSNQSQTM DWTDVPLIRD IVMEPQGNSS SSDKSNSQSS GNEGVIINNF YSNQYQNSID LSASGGNAGD APQTNGQLSN ILGGAANAFA TMAPLLLDQN TEEMENLSDR VASDKAGNSA TNTQSTVGRL CGYGKSHHGE HPASCADTAT DKVLAAERYY TIDLASWTTS QEAFSHIRIP LPHVLAGEDG GVFGATLRRH YLCKTGWRVQ VQCNASQFHA GSLLVFMAPE FYTGKGTKTG TMEPSDPFTM DTEWRSPQGA PTGYRYDSRT GFFATNHQNQ WQWTVYPHQI LNLRTNTTVD LEVPYVNVAP SSSWTQHANW TLVVAVLSPL QYATGSSPDV QITASLQPVN PVFNGLRHET VIAQSPIPVT VREHKGCFYS TNPDTTVPIY GKTISTPSDY MCGEFSDLLE LCKLPTFLGN PNTNNKRYPY FSATNSVPAT SMVDYQVALS CSCMANSMLA AVARNFNQYR GSLNFLFVFT GAAMVKGKFL IAYTPPGAGK PTTRDQAMQS TYAIWDLGLN SSFNFTAPFI SPTHYRQTSY TSPTITSVDG WVTVWKLTPL TYPSGTPTNS DILTLVSAGD DFTLRMPISP TKWVPQGVDN AEKGKVSNDD ASVDFVAEPV KLPENQTRVA FFYDRAVPIG MLRPGQNMET TFNYQENDYR LNCLLLTPLP SFCPDSSSGP QKTKAPVQWR WVRSGGVNGA NFPLMTKQDY AFLCFSPFTF YKCDLEVTVS ALGMTRVASV LRWAPTGAPA DVTDQLIGYT PSLGETRNPH MWLVGAGNSQ VSFVVPYNSP LSVLPAAWFN GWSDFGNTKD FGVAPNADFG RLWIQGNTSA SVRIRYKKMK VFCPRPTLFF PWPTPTTTKI NADNPVPILE LENPAALYRI DLFITFTDEF ITFDYKVHGR PVLTFRIPGF GLTPAGRMLV CMGEQPAHGP FTSSRSLYHV IFTATCSSFS FSIYKGRYRS WKKPIHDELV DRGYTTFGEF FKAVRGYHAD YYRQRLIHDV ETNPGPVQSV FQPQGAVLTK SLAPQAGIQN LLLRLLGIDG DCSEVSKAIT VVTDLVAAWE KAKTTLVSPE FWSKLILKTT KFIAASVLYL HNPDFTTTVC LSLMTGVDLL TNDSVFDWLK QKLSSFFRTP PPACPNVMQP QGPLREANEG FTFAKNIEWA MKTIQSVVNW LTSWFKQEED HPQSKLDKLL MEFPDHCRNI MDMRNGRKAY CECTASFKYF DELYNLAVTC KRIPLASLCE KFKNRHDHSV TRPEPVVVVL RGAAGQGKSV TSQIIAQSVS KMAFGRQSVY SMPPDSEYFD GYENQFSVIM DDLGQNPDGE DFTVFCQMVS STNFLPNMAH LERKGTPFTS SFIVATTNLP KFRPVTVAHY PAVDRRITFD FTVTAGPHCK TPAGMLDVEK AFDEIPGSKP QLACFSADCP LLHKRGVMFT CNRTQTVYNL QQVVKMVNDT ITRKTENVKK MNSLVAQSPP DWEHFENILT CLRQNNAALQ DQLDELQEAF AQARERSDFL SDWLKVSAII FAGIASLSAV IKLASKFKES IWPTPVRVEL SEGEQAAYAG RARAQKQALQ VLDIQGGGKV LAQAGNPVMD FELFCAKNIV APITFYYPDK AEVTQSCLLL RAHLFVVNRH VAETDWTAFK LKDVRHERHT VALRSVNRSG AKTDLTFIKV TKGPLFKDNV NKFCSNKDDF PARNDTVTGI MNTGLAFVYS GNFLIGNQPV NTTTGACFNH CLHYRAQTRR GWCGSAIICN VNGKKAVYGM HSAGGGGLAA ATIITKELIE AAEKSMLALE PQGAIVDIAT GSVVHVPRKT KLRRTVAHDV FQPKFEPAVL SRYDPRTDKD VDVVAFSKHT TNMESLPPIF DVVCGEYANR VFTILGKENG LLTVEQAVLG LPGMDPMEKD TSPGLPYTQQ GLRRTDLLNF ITAKMTPQLD YAHSKLVIGV YDDVVYQSFL KDEIRPIEKI HEAKTRIVDV PPFAHCIWGR QLLGRFASKF QTKPGLELGS AIGTDPDVDW TRYAVELSGF NYVYDVDYSN FDASHSTAMF ECLINNFFTE QNGFDRRIAE YLRSLAVSRH AYEDRRVLIR GGLPSGCAAT SMLNTIMNNV IIRAALYLTY SNFDFDDIKV LSYGDDLLIG TNYQIDFNLV KERLAPFGYK ITPANKTTTF PLTSHLQDVT FLKRRFVRFN SYLFRPQMDA VNLKAMVSYC KPGTLKEKLM SIALLAVHSG PDIYDEIFLP FRNVGIVVPT YSSMLYRWLS LFR //