ID POLG_BOVEV STANDARD; PRT; 2175 AA. AC P12915; DT 01-OCT-1989 (Rel. 12, Created) DT 01-OCT-1989 (Rel. 12, Last sequence update) DT 15-JUL-1999 (Rel. 38, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Bovine enterovirus (strain VG-5-27) (BEV). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Enterovirus. OX NCBI_TaxID=12065; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=88117392; PubMed=2828511; RA Earle J.A.P., Skuce R.A., Fleming C.S., Hoey E.M., Martin S.J.; RT "The complete nucleotide sequence of a bovine enterovirus."; RL J. Gen. Virol. 69:253-263(1988). RN [2] RP X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROMS) OF 1-840. RX MEDLINE=95292108; PubMed=7773791; RA Smyth M., Tate J., Hoey E.M., Lyons C., Martin S.J., Stuart D.; RT "Implications for viral uncoating from the structure of bovine RT enterovirus."; RL Nat. Struct. Biol. 2:224-231(1995). CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; D00214; BAA24003.1; ALT_SEQ. DR PIR; A29824; GNNYBE. DR PDB; 1BEV; 16-SEP-98. DR MEROPS; C03.001; -. DR MEROPS; C03.020; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate; KW 3D-structure. FT CHAIN 2 69 COAT PROTEIN VP4 (P1A). FT CHAIN 70 317 COAT PROTEIN VP2 (P1B). FT CHAIN 318 559 COAT PROTEIN VP3 (P1C). FT CHAIN 560 840 COAT PROTEIN VP1 (P1D). FT CHAIN 841 990 CORE PROTEIN P2A. FT CHAIN 991 1089 CORE PROTEIN P2B. FT CHAIN 1090 1419 CORE PROTEIN P2C. FT CHAIN 1420 1508 CORE PROTEIN P3A. FT CHAIN 1509 1531 GENOME-LINKED PROTEIN VPG (P3B). FT CHAIN 1532 1714 PICORNAIN 3C. FT CHAIN 1715 2175 RNA-DEPENDENT RNA POLYMERASE P3D. FT LIPID 2 2 MYRISTATE. FT ACT_SITE 1678 1678 PROTEASE (POTENTIAL). FT ACT_SITE 1692 1692 PROTEASE (POTENTIAL). SQ SEQUENCE 2175 AA; 242502 MW; 44FCADE8704E48FD CRC64; MGAQLSRNTA GSHTTGTYAT GGSTINYNNI NYYSHAASAA QNKQDFTQDP SKFTQPIADV IKETAVPLKS PSAEACGYSD RVAQLTLGNS TITTQEAANI CVAYGCWPAK LSDTDATSVD KPTEPGVSAD AFYTLRSKPW QADSKGWYWK LPDALNNTGM FGQNAQFHYI YRGGWAVHVQ CNATKFHQGT LLVLAIPEHQ IATQEQPAFD RTMPGSEGGT FQEPFWLEDG TSLGNSLIYP HQWINLRTNN SATLILPYVN AIPMDSAIRH SNWTLAIIPV APLKYAAETT PLVPITVTIA PMETEYNGLR RAIASNQGLP TKPGPGSYQF MTTDEDCSPC ILPDFQPTLE IFIPGKVNNL LEIAQVESIL EANNREGVEG VERYVIPVSV QDALDAQIYA LRLELGGSGP LSSSLLGTLA KHYTQWSGSV EITCMFTGTF MTTGKVLLAY TPPGGDMPRN REEAMLGTHV VWDFGLQSSI TLVIPWISAS HFRGVSNDDV LNYQYYAAGH VTIWYQTNMV IPPGFPNTAG IIMMIAAQPN FSFRIQKDRE DMTQTAILQN DPGKMLKDAI DKQVAGALVA GTTTSTHSVA TDSTPALQAA ETGATSTARD ESMIETRTIV PTHGIHETSV ESFFGRSSLV GMPLLATGTS ITNWRIDFRE FVQLRAKMSW FTYMRFDVEF TIIATSSTGQ NVTTEQHTTY QVMYVPPGAP VPSNQDSFQW QSGCNPSVFA DTDGPPAQFS VPFMSSANAY STVYDGYARF MDTDPDRYGI LPSNFLGFMY FRTLEDAAHQ VRFRICAKIK HTSCWIPRAP RQAPYKKRYN LVFSGDSDRI CSNRASLTSY GPFGQQQGAA YVGSYKILNR HLATYADWEN EVWQSYQRDL LVTRVDAHGC DTIARCNCRS GIYYCKSTAK HYPIVVTPPS IYKIEANDYY PERMQTHILL GIGFAEPGDC GGLLRCEHGV MGILTVGGGD HVGFADVRDL LWIEDDAMEQ GITDYVQQLG NAFGAGFTAE IANYTNQLRD MLMGSDSVVE KIIRSLVRLV SALVIVVRNH QDLITVGATL ALLGCEGSPW KWLKRKVCQI LGINMAERQS DNWMKKFTEM CNAFRGLDWI AAKISKFIDW LKQKILPELK ERAEFVKKLK QLPLLEAQVN TLEHSSASQE RQEQLFGNVQ YLAHHCRKNA PLYAAEAKRV YHLEKRVLGA MQFKTKNRIE PVCALIHGSP GTGQSLATMI VGRKLAEYEG SDVYSLPPDP DHFDGYQQQA VVVMDDLLQN PDGKDMTLFC QMVSTAPFTV PMAALEDKGK LFTSKFVLAS TNAGQVTPPT VADYKALQRR FFFDCDIEVQ KEYKRDGVTL DVAKATETCE DCSPANFKKC MPLICGKALQ LKSRKGDGMR YSLDTLISEL RRESNRRYNI GNVLEALFQG PVCYKPLRIE VHEEEPAPSA ISDLLQAVDS EEVREYCRSK GWIVEERVTE LKLERNVNRA LAVIQSVSLI AAVAGTIYIV YRLFSGMQGP YSGIGTNYAT KKPVVRQVQT QGPLFDFGVS LLKKNIRTVK TGAGEFTALG VYDTVVVLPR HAMPGKTIEM NGKDIEVLDA YDLNDKTDTS LELTIVKLKM NEKFRDIRAM VPDQITDYNE AVVVVNTSYY PQLFTCVGRV KDYGFLNLAG RPTHRVLMYE FPTKAGQCGG VVISMGKIVG VHVGGNGAQG FAASLLRRYF TAEQGQIEYI EKSKDAGYPV INAPTQTKLE PSVFFDVFPG VKEPAVLHKK DKRLETNFEE ALFSKYIGNV QRDMPEELLI AIDHYSEQLK MLNIDPRPIS MEDAIYGTEG LEALDLGTSA SYPYVAMGIK KRDILNKETR DVTKMQECID KYGLNLPMVT YVKDELRAPD KIRKGKSRLI EASSLNDSVA MRCYFGNLYK VFHTNPGTIS GCAVGCDPET FWSKIPVMMD GELFGFDYTA YDASLSPMWF HALAEVLRRI GFVECKHFID QLCCSHHLYM DKHYYVVGGM PSGCSGTSIF NSMINNLIIR TLVLTVYKNI DLDDLKIIAY GDDVLASYPY EIDASLLAEA GKSFGLIMTP PDKSAEFVKL TWDNVTFLKR KFVRDARYPF LVHPVMDMSN IHESIRWTKD PRHTEDHVRS LCLLAWHCGE EEYNEFVTKI RSVPVGRALH LPSFKALERK WYDSF //