ID POLG_TMEVD STANDARD; PRT; 2301 AA. AC P13899; Q88564; Q88565; Q88566; Q88567; Q88568; Q88569; Q88570; AC Q88571; Q88572; Q88573; Q88574; Q89580; DT 01-JAN-1990 (Rel. 13, Created) DT 01-JAN-1990 (Rel. 13, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Theiler's murine encephalomyelitis virus (strain DA) (TMEV). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Cardiovirus. OX NCBI_TaxID=12126; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=88206072; PubMed=2834872; RA Ohara Y., Stein S., Fu J., Stillman L., Klaman L., Roos R.P.; RT "Molecular cloning and sequence determination of DA strain of RT Theiler's murine encephalomyelitis viruses."; RL Virology 164:245-255(1988). RN [2] RP X-RAY CRYSTALLOGRAPHY (2.8 ANGSTROMS). RX MEDLINE=92196057; PubMed=1549565; RA Grant R.A., Filman D.J., Fujinami R.S., Icenogle J.P., Hogle J.M.; RT "Three-dimensional structure of Theiler virus."; RL Proc. Natl. Acad. Sci. U.S.A. 89:2061-2065(1992). CC -!- FUNCTION: IT IS THOUGHT THAT THE P2C PROTEIN ATTACHES TO VESICULAR CC MEMBRANES AND IS ASSOCIATED WITH VIRAL RNA SYNTHESIS. CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: CLOSELY RELATED TO ENCEPHALOMYOCARDITIS VIRUS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; M20301; AAA47928.1; -. DR PIR; A31228; GNNYTN. DR PDB; 1TME; 31-JAN-94. DR MEROPS; C03.009; -. DR MEROPS; U29.001; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate; KW 3D-structure. FT PROPEP 1 76 LEADER PEPTIDE. FT CHAIN 77 147 COAT PROTEIN VP4 (P1A). FT CHAIN 148 414 COAT PROTEIN VP2 (P1B). FT CHAIN 415 646 COAT PROTEIN VP3 (P1C). FT CHAIN 647 920 COAT PROTEIN VP1 (P1D). FT CHAIN 921 1062 CORE PROTEIN P2A. FT CHAIN 1063 1189 CORE PROTEIN P2B. FT CHAIN 1190 1515 CORE PROTEIN P2C. FT CHAIN 1516 1603 CORE PROTEIN P3A. FT CHAIN 1604 1623 GENOME-LINKED PROTEIN VPG (P3B). FT CHAIN 1624 1840 PICORNAIN 3C. FT CHAIN 1841 2301 RNA-DIRECTED RNA POLYMERASE P3D. FT LIPID 77 77 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1791 1791 PROTEASE (POTENTIAL). FT ACT_SITE 1809 1809 PROTEASE (POTENTIAL). FT TURN 96 97 FT HELIX 103 106 FT STRAND 109 110 FT STRAND 162 166 FT TURN 167 168 FT STRAND 169 173 FT STRAND 179 181 FT HELIX 182 184 FT TURN 193 194 FT STRAND 200 201 FT HELIX 204 206 FT STRAND 210 217 FT TURN 219 220 FT TURN 223 224 FT STRAND 226 231 FT TURN 232 232 FT HELIX 233 235 FT HELIX 238 247 FT TURN 248 249 FT STRAND 250 262 FT TURN 267 268 FT STRAND 270 279 FT TURN 280 280 FT STRAND 286 287 FT TURN 289 291 FT STRAND 294 295 FT TURN 297 298 FT STRAND 305 305 FT STRAND 324 324 FT TURN 325 326 FT HELIX 330 335 FT STRAND 338 342 FT TURN 343 345 FT STRAND 348 353 FT STRAND 362 362 FT HELIX 364 366 FT STRAND 367 367 FT STRAND 370 381 FT TURN 384 385 FT STRAND 390 406 FT TURN 423 426 FT TURN 430 431 FT STRAND 438 438 FT STRAND 444 445 FT TURN 449 450 FT STRAND 454 455 FT STRAND 457 457 FT HELIX 458 463 FT STRAND 466 467 FT STRAND 470 471 FT TURN 473 474 FT STRAND 477 478 FT STRAND 480 484 FT STRAND 493 496 FT TURN 499 500 FT TURN 503 506 FT HELIX 508 513 FT TURN 514 515 FT STRAND 516 520 FT STRAND 523 529 FT TURN 533 534 FT STRAND 536 544 FT HELIX 554 557 FT TURN 558 559 FT STRAND 561 566 FT STRAND 572 577 FT STRAND 586 587 FT STRAND 601 611 FT TURN 614 615 FT STRAND 619 628 FT TURN 630 631 FT STRAND 633 637 FT STRAND 648 649 FT HELIX 651 653 FT TURN 661 664 FT TURN 676 677 FT STRAND 678 678 FT HELIX 679 683 FT STRAND 687 692 FT TURN 698 699 FT STRAND 706 707 FT TURN 708 709 FT STRAND 710 711 FT STRAND 713 715 FT STRAND 721 722 FT TURN 727 728 FT TURN 732 733 FT STRAND 737 738 FT STRAND 741 741 FT STRAND 744 744 FT STRAND 752 752 FT STRAND 755 755 FT HELIX 758 762 FT TURN 763 763 FT STRAND 766 779 FT STRAND 786 792 FT TURN 794 795 FT TURN 806 807 FT HELIX 810 812 FT STRAND 818 822 FT TURN 824 825 FT STRAND 828 833 FT STRAND 842 843 FT STRAND 847 847 FT STRAND 849 849 FT TURN 853 854 FT TURN 859 859 FT STRAND 860 860 FT TURN 863 864 FT STRAND 869 874 FT STRAND 878 892 SQ SEQUENCE 2301 AA; 256159 MW; 0B6095DF153DBFDF CRC64; MACKHGYPDV CPICTAVDVT PGFEYLLLAD GEWFPTDLLC VDLDDDVFWP SNSSNQSETM EWTDLPLVRD IVMEPQGNAS SSDKSNSQSS GNEGVIINNF YSNQYQNSID LSASGGNAGD APQNNGQLSN ILGGAANAFA TMAPLLLDQN TEEMENLSDR VASDKAGNSA TNTQSTVGRL CGYGEAHHGE HPASCADTAT DKVLAAERYY TIDLASWTTT QEAFSHIRIP LPHVLAGEDG GVFGATLRRH YLCKTGWRVQ VQCNASQFHA GSLLVFMAPE FYTGKGTKTG DMEPTDPFTM DTTWRAPQGA PTGYRYDSRT GFFAMNHQNQ WQWTVYPHQI LNLRTNTTVD LEVPYVNIAP TSSWTQHANW TLVVAVFSPL QYASGSSSDV QITASIQPVN PVFNGLRHET VIAQSPIAVT VREHKGCFYS TNPDTTVPIY GKTISTPNDY MCGEFSDLLE LCKLPTFLGN PNSNNKRYPY FSATNSVPTT SLVDYQVALS CSCMCNSMLA AVARNFNQYR GSLNFLFVFT GAAMVKGKFL IAYTPPGAGK PTTRDQAMQA TYAIWDLGLN SSFVFTAPFI SPTHYRQTSY TSATIASVDG WVTVWQLTPL TYPSGAPVNS DILTLVSAGD DFTLRMPISP TKWAPQGSDN AEKGKVSNDD ASVDFVAEPV KLPENQTRVA FFYDRAVPIG MLRPGQNIES TFVYQENDLR LNCLLLTPLP SFCPDSTSGP VKTKAPVQWR WVRSGGTTNF PLMTKQDYAF LCFSPFTYYK CDLEVTVSAL GTDTVASVLR WAPTGAPADV TDQLIGYTPS LGETRNPHMW LVGAGNTQIS FVVPYNSPLS VLPAAWFNGW SDFGNTKDFG VAPNADFGRL WIQGNTSASV RIRYKKMKVF CPRPTLFFPW PVSTRSKINA DNPVPILELE NPAAFYRIDL FITFIDEFIT FDYKVHGRPV LTFRIPGFGL TPAGRMLVCM GEKPAHGPFT SSRSLYHVIF TATCSSFSFS IYKGRYRSWK KPIHDELVDR GYTTFGEFFR AVRAYHADYY KQRLIHDVEM NPGPVQSVFQ PQGAVLTKSL APQAGIQNLL LRLLGIDGDC SEVSKAITVV TDLFAAWERA KTTLVSPEFW SKLILKTTKF IAASVLYLHN PDFTTTVCLS LMTGVDLLTN DSVFDWLKNK LSSFFRTPPP VCPNVLQPQG PLREANEGFT FAKNIEWAMK TIQSIVNWLT SWFKQEEDHP QSKLDKFLME FPDHCRNIMD MRNGRKAYCE CTASFKYFDE LYNLAVTCKR IPLASLCEKF KNRHDHSVTR PEPVVVVLRG AAGQGKSVTS QIIAQSVSKM AFGRQSVYSM PPDSEYFDGY ENQFSVIMDD LGQNPDGEDF TVFCQMVSST NFLPNMAHLE RKGTPFTSSF IVATTNLPKF RPVTVAHYPA VDRRITFDFT VTAGPHCTTS NGMLDIEKAF DEIPGSKPQL ACFSADCPLL HKRGVMFTCN RTKAVYNLQQ VVKMVNDTIT RKTENVKKMN SLVAQSPPDW EHFENILTCL RQNNAALQDQ LDELQEAFAQ ARERSDFLSD WLKVSAIIFA GIASLSAVIK LASKFKESIW PSPVRVELSE GEQAAYAGRA RAQKQALQVL DIQGGGKVLA QAGNPVMDFE LFCAKNMVAP ITFYYPDKAE VTQSCLLLRA HLFVVNRHVA ETEWTAFKLK DVRHERDTVV TRSVNRSGAE TDLTFIKVTK GPLFKDNVNK FCSNKDDFPA RNDAVTGIMN TGLAFVYSGN FLIGNQPVNT TTGACFNHCL HYRAQTRRGW CGSAVICNVN GKKAVYGMHS AGGGGLAAAT IITRELIEAA EKSMLALEPQ GAIVDISTGS VVHVPRKTKL RRTVAHDVFQ PKFEPAVLSR YDPRTDKDVD VVAFSKHTTN MESLPPVFDI VCDEYANRVF TILGKDNGLL TVEQAVLGLP GMDPMEKDTS PGLPYTQQGL RRTDLLNFNT AKMTPQLDYA HSKLVLGVYD DVVYQSFLKD EIRPLEKIHE AKTRIVDVPP FAHCIWGRQL LGRFASKFQT KPGLELGSAI GTDPDVDWTP YAAELSGFNY VYDVDYSNFD ASHSTAMFEC LIKNFFTEQN GFDRRIAEYL RSLAVSRHAY EDRRVLIRGG LLSGCAATSM LNTIMNNVII RAALYLTYSN FEFDDIKVLS YGDDLLIGTN YQIDFNLVKE RLAPFGYKIT PANKTTTFPL TSHLQDVTFL KRRFVRFNSY LFRPQMDAVN LKAMVSYCKP GTLKEKLMSI ALLAVHSGPD IYDEIFLPFR NVGIVVPTYS SMLYRWLSLF R //