ID POLG_CXA9 STANDARD; PRT; 2201 AA. AC P21404; DT 01-MAY-1991 (Rel. 18, Created) DT 01-AUG-1991 (Rel. 19, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEIN VP4 (P1A); COAT PROTEIN VP2 DE (P1B); COAT PROTEIN VP3 (P1C); COAT PROTEIN VP1 (P1D); CORE PROTEIN DE P2A; CORE PROTEIN P2B; CORE PROTEIN P2C; CORE PROTEIN P3A; GENOME- DE LINKED PROTEIN VPG (P3B); PICORNAIN 3C (EC 3.4.22.28) (PROTEASE 3C) DE (P3C); RNA-DIRECTED RNA POLYMERASE (EC 2.7.7.48) (P3D)]. OS Coxsackievirus A9 (strain Griggs). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Enterovirus. OX NCBI_TaxID=12068; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=90111704; PubMed=2558158; RA Chang K.H., Auvinen P., Hyypiae T., Stanway G.; RT "The nucleotide sequence of coxsackievirus A9; implications for RT receptor binding and enterovirus classification."; RL J. Gen. Virol. 70:3269-3280(1989). RN [2] RP X-RAY CRYSTALLOGRAPHY (2.9 ANGSTROMS) OF 1-870. RX MEDLINE=20113480; PubMed=10647183; RA Hendry E., Hatanaka H., Fry E., Smyth M., Tate J., Stanway G., RA Santti J., Maaronen M., Hyypia T., Stuart D.; RT "The crystal structure of coxsackievirus A9: new insights into the RT uncoating mechanisms of enteroviruses."; RL Structure 7:1527-1538(1999). CC -!- FUNCTION: IT IS THOUGHT THAT THE P2C PROTEIN ATTACHES TO VESICULAR CC MEMBRANES AND IS ASSOCIATED WITH VIRAL RNA SYNTHESIS. CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; D00627; BAA00518.1; -. DR PIR; JQ0523; GNNYA9. DR PDB; 1D4M; 23-DEC-99. DR MEROPS; C03.011; -. DR MEROPS; C03.022; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; 3D-structure; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate. FT CHAIN 2 69 COAT PROTEIN VP4. FT CHAIN 70 330 COAT PROTEIN VP2. FT CHAIN 331 568 COAT PROTEIN VP3. FT CHAIN 569 870 COAT PROTEIN VP1. FT CHAIN 871 1017 CORE PROTEIN P2A. FT CHAIN 1018 1116 CORE PROTEIN P2B. FT CHAIN 1117 1445 CORE PROTEIN P2C. FT CHAIN 1446 1534 CORE PROTEIN P3A. FT CHAIN 1535 1556 GENOME-LINKED PROTEIN VPG. FT CHAIN 1557 1739 PICORNAIN 3C. FT CHAIN 1740 2201 RNA-DIRECTED RNA POLYMERASE. FT LIPID 2 2 MYRISTATE. FT ACT_SITE 1703 1703 PROTEASE (POTENTIAL). FT ACT_SITE 1717 1717 PROTEASE (POTENTIAL). FT SITE 858 860 CELL ATTACHMENT SITE. SQ SEQUENCE 2201 AA; 246533 MW; CCEA86F9E80F385F CRC64; MGAQVSTQKT GAHETSLSAA GNSIIHYTNI NYYKDAASNS ANRQDFTQDP SKFTEPVKDV MIKSLPALNS PTVEECGYSD RVRSITLGNS TITTQECANV VVGYGRWPTY LRDDEATAED QPTQPDVATC RFYTLDSIKW EKGSVGWWWK FPEALSDMGL FGQNMQYHYL GRAGYTIHLQ CNASKFHQGC LLVVCVPEAE MGGAVVGQAF SATAMANGDK AYEFTSATQS DQTKVQTAIH NAGMGVGVGN LTIYPHQWIN LRTNNSATIV MPYINSVPMD NMFRHYNFTL MVIPFVKLDY ADTASTYVPI TVTVAPMCAE YNGLRLAQAQ GLPTMNTPGS TQFLTSDDFQ SPCALPQFDV TPSMNIPGEV KNLMEIAEVD SVVPVNNVQD TTDQMEMFRI PVTINAPLQQ QVFGLRLQPG LDSVFKHTLL GEILNYYAHW SGSMKLTFVF CGSAMATGKF LIAYSPPGAN PPKTRKDAML GTHIIWDIGL QSSCVLCVPW ISQTHYRLVQ QDEYTSAGYV TCWYQTGMIV PPGTPNSSSI MCFASACNDF SVRMLRDTPF ISQDNKLQGD VEEAIERARC TVADTMRTGP SNSASVPALT AVETGHTSQV TPSDTMQTRH VKNYHSRSES TVENFLGRSA CVYMEEYKTT DKHVNKKFVA WPINTKQMVQ MRRKLEMFTY LRFDMEVTFV ITSRQDPGTT LAQDMPVLTR QIMYVPPGGP IPAKVDDYAW QTSTNPSIFW TEGNAPARMS IPFISIGNAY SNFYDGWSNF DQRGSYGYNT LNNLGHIYVR HVSGSSPHPI TSTIRVYFKP KHTRAWVPRP PRLCQYKKAF SVDFTPTPIT DTRKDINTVT TVAQSRRRGD MSTLNTHGAF GQQSGAVYVG NYRVINRHLA THTDWQNCVW EDYNRDLLVS TTTAHGCDVI ARCQCTTGVY FCASKNKHYP VSFEGPGLVE VQESEYYPKR YQSHVLLAAG FSEPGDCGGI LRCEHGVIGI VTMGGEGVVG FADVRDLLWL EDDAMEQGVK DYVEQLGNAF GSGFTNQICE QVNLLKESLV GQDSILEKSL KALVKIISAL VIVVRNHDDL ITVTAILALI GCTSSPWRWL KQKVSQYYGI PMAERQNDSW LKKFTEMTNA CKRMEWIAIK IQKFIEWLKV KILPEVREKH EFLNRLKQLP LLESQIATIE QSAPSQSDQE QLFSNVQYFA HYCRKYAPLY AAEAKRVFSL EKKMSNYIQF KSKCRIEPVC LLLHGSPGAG KSVATNLIGR SLAEKLNSSV YSLPPDPDHF DGYKQQAVVI MDDLCQNPDG KDVSLFCQMV SSVDFVPPMA ALEEKGILFT SPFVLASTNA GSINAPTVSD SRALARRFHF DMNIEVISMY SQNGKINMPM SVKTCDEECC PVNFKKCCPL VCGKAIQFID RRTQVRYSLD MLVTEMFREY NHRHSVGATL EALFQGPPIY REIKISVAPE TPPPPVIADL LKSVDSEDVR EYCKEKGWLI PEVNSTLQIE KYVSRAFICL QAITTFVSVA GIIYIIYKLF AGFQGAYTGI PNQKPKVPTL RQAKVQGPAF EFAVAMMKRN SSTVKTEYGE FTMLGIYDRW AVLPRHAKPG PTILMNDQEV GVMDAKELVD KDGTNLELTL LKLNRNEKFR DIRGFLAKEE MEVNEAVLAI NTSKFPNMYI PVGQVTDYGF LNLGGTPTKR MLMYNFPTRA GQCGGVLMST GKVLGIHVGG NGHQGFSAAL LKHYFNDEQG EIEFIESSKD AGFPIINTPS KTKLEPSVFH QVFEGVKEPA VLRNGDPRLK ANFEEAIFSK YIGNVNTHVD EYMLEAVDHY AGQLATLDIS TEPMKLEDAV YGTEGLEALD LTTSAGYPYV ALGIKKRDIL SKKTRDLTKL KECMDKYGLN LPMITYVKDQ LRSAEKVAKG KSRLIEASSL NDSVAMRQTF GNLYKTFHLN PGIVTGSAVG CDPDLFWSKI PVMLNGHLIA FDYSGYDASL SPVWFACLKL LLEKLGYSHK ETNYIDYLCN SHHLYRDKHY FVRGGMPSGC SGTSIFNSMI NNIIIRTLML KVYKGIDLDQ FRMIAYGDDV IASYPWPIDA SLLAEAGKDY GLIMTPADKG ECFNEVTWTN VTFLKRYFRA DEQYPFLVHP VMPMKDIHES IRWTKDPKNT QDHVRSLCLL AWHNGEHEYE EFIRKIRSVP VGRCLTLPAF STLRRKWLDS F //