ID POLG_CXA16 STANDARD; PRT; 2193 AA. AC Q65900; DT 01-NOV-1997 (Rel. 35, Created) DT 01-NOV-1997 (Rel. 35, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEIN VP4 (P1A); COAT PROTEIN VP2 DE (P1B); COAT PROTEIN VP3 (P1C); COAT PROTEIN VP1 (P1D); CORE PROTEIN DE P2A; CORE PROTEIN P2B; CORE PROTEIN P2C; CORE PROTEIN P3A; GENOME- DE LINKED PROTEIN VPG (P3B); PICORNAIN 3C (EC 3.4.22.28) (PROTEASE 3C) DE (P3C); RNA-DIRECTED RNA POLYMERASE (EC 2.7.7.48) (P3D)]. OS Coxsackievirus A16 (strain G-10). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Enterovirus. OX NCBI_TaxID=69159; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=94303216; PubMed=8030260; RA Poyry T., Hyypiae T., Horsnell C., Kinnunen L., Hovi T., Stanway G.; RT "Molecular analysis of coxsackievirus A16 reveals a new genetic group RT of enteroviruses."; RL Virology 202:982-987(1994). CC -!- FUNCTION: IT IS THOUGHT THAT THE P2C PROTEIN ATTACHES TO VESICULAR CC MEMBRANES AND IS ASSOCIATED WITH VIRAL RNA SYNTHESIS. CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; U05876; AAA50478.1; -. DR HSSP; P03299; 1POV. DR MEROPS; C03.011; -. DR MEROPS; C03.022; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate. FT CHAIN 2 69 COAT PROTEIN VP4. FT CHAIN 70 323 COAT PROTEIN VP2. FT CHAIN 324 565 COAT PROTEIN VP3. FT CHAIN 566 862 COAT PROTEIN VP1. FT CHAIN 863 1012 CORE PROTEIN P2A. FT CHAIN 1013 1111 CORE PROTEIN P2B. FT CHAIN 1112 1440 CORE PROTEIN P2C. FT CHAIN 1441 1526 CORE PROTEIN P3A. FT CHAIN 1527 1548 GENOME-LINKED PROTEIN VPG. FT CHAIN 1549 1731 PICORNAIN 3C. FT CHAIN 1732 2193 RNA-DIRECTED RNA POLYMERASE. FT LIPID 2 2 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1695 1695 PROTEASE (POTENTIAL). FT ACT_SITE 1709 1709 PROTEASE (POTENTIAL). SQ SEQUENCE 2193 AA; 243209 MW; 04B3BCE572A76E38 CRC64; MGSQVSTQRS GSHENSNSAS EGSTINYTTI NYYKDAYAAS AGRQDMSQDP KKFTDPVMDV IHEMAPPLKS PSAEACGYSD RVAQLTIGNS TITTQEAANI IIAYGEWPEY CKDADATAVD KPTRPDVSVN RFFTLDTKSW AKDSKGWYWK FPDVLTEVGV FGQNAQFHYL YRSGFCVHVQ CNASKFHQGA LLVAILPEYV LGTIAGGDGN ENSHPPYVTT QPGQVGAVLT NPYVLDAGVP LSQLTVCPHQ WINLRTNNCA TIIVPYMNTV PFDSALNHCN FGLIVVPVVP LDFNAGATSE IPITVTIAPM CAEFAGLRQA IKQGIPTELK PGTNQFLTTD DGVSAPILPG FHPTPAIHIP GEVRNLLEIC RVETILEVNN LQSNETTPMQ RLCFPVSVQS KTGELCAVFR ADPGRNGPWQ STILGQLCRY YTQWSGSLEV TFMFAGSFMA TGKMLIAYTP PGGGVPADRL TAMLGTHVIW DFGLQSSVTL VIPWISNTHY RAHAKDGYFD YYTTGTITIW YQTNYVVPIG APTTAYIVAL AAAQDNFTMK LCKDTEDIEQ SANIQGDGIA DMIDQAVTSR VGRALTSLQV EPTAANTNAS EHRLGTGLVP ALQAAETGAS SNAQDENLIE TRCVLNHHST QETTIGNFFS RAGLVSIITM PTTGTQNTDG YVNWDIDLMG YAQMRRKCEL FTYMRFDAEF TFVAAKPNGE LVPQLLQYMY VPPGAPKPTS RDSFAWQTAT NPSIFVKLTD PPAQVSVPFM SPASAYQWFY DGYPTFGAHP QSNDADYGQC PNNMMGTFSI RTVGTEKSPH SITLRVYMRI KHVRAWIPRP LRNQPYLFKT NPNYKGNDIK CTSTSRDKIT TLGKFGQQSG AIYVGNYRVV NRHLATHNDW ANLVWEDSSR DLLVSSTTAQ GCDTIARCDC QTGVYYCSSR RKHYPVSFSK PSLIFVEASE YYPARYQSHL MLAVGHSEPG DCGGILRCQH GVVGIVSTGG NGLVGFADVR DLLWLDEEAM EQGVSDYIKG LGDAFGTGFT DAVSREVEAL KNHLIGSEGA VEKILKNLIK LISALVIVIR SDYDMVTLTA TLALIGCHGS PWAWIKAKTA SILGIPIAQK QSASWLKKFN DMANAAKGLE WISNKISKFI DWLKEKIIPA AKEKVEFLNN LKQLPLLENQ ISNLEQSAAS QEDLEAMFGN VSYLAHFCRK FQPLYATEAK RVYALEKRMN NYMQFKSKHR IEPVCLIIRG SPGTGKSLAT GIIARAIADK YHSSVYSLPP DPDHFDGYKQ QVVTVMDDLC QNPDGKDMSL FCQMVSTVDF IPPMASLEEK GVSFTSKFVI ASTNASNIIV PTVSDSDAIR RRFYMDCDIE VTDSYKTDLG RLDAGRAARL CSENNTANFK RCSPLVCGKA IQLRDRKSKV RYSVDTVVSE LIREYNNRYA IGNTIEALFQ GPPKFRPIRI SLEEKPAPDA ISDLLASVDS EEVRQYCRDQ GWIIPETPTN VERHLNRAVL IMQSIATVVA VVSLVYVIYK LFAGFQGAYS GAPKQTLKKP ILRTATVQGP SLDFALSLLR RNIRQVQTDQ GHFTMLGVRD RLAVLPRHSQ PGKTIWVEHK LINILDAVEL VDEQGVNLEL TLVTLDTNEK FRDITKFIPE NISAASDATL VINTEHMPSM FVPVGDVVQY GFLNLSGKPT HRTMMYNFPT KAGQCGGVVT SVGKVIGIHI GGNGRQGFCA GLKRSYFASE QGEIQWVKPN KETGRLNING PTRTKLEPSV FHDVFEGNKE PAVLHSRDPR LEVDFEQALF SKYVGNTLHE PDEYIKEAAL HYANQLKQLD INTSQMSMEE ACYGTENLEA IDLHTSAGYP YSALGIKKRD ILDPTTRDVS KMKFYMDKYG LDLPYSTYVK DELRSIDKIK KGKSRLIEAS SLNDSVYLRM AFGHLYETFH ANPGTITGSA VGCNPDTFWS KLPILLPGSL FAFDYSGYDA SLSPVWFRAL ELVLREVGYS EEAVSLIEGI NHTHHVYRNK TYCVLGGMPS GCSGTSIFNS MINNIIIRTL LIKTFKGIDL DELNMVAYGD DVLASYPFPI DCLELARTGK EYGLTMTPAD KSPCFNEVNW GNATFLKRGF LPDEQFPFLI HPTMPMKEIH ESIRWTKDAR NTQDHVRSLC LLAWHNGKQE YEKFVSTIRS VPVGKALAIP NYENLRRNWL ELF //