ID POLG_HRV16 STANDARD; PRT; 2153 AA. AC Q82122; DT 15-JUL-1998 (Rel. 36, Created) DT 15-DEC-1998 (Rel. 37, Last sequence update) DT 15-DEC-1998 (Rel. 37, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Human rhinovirus 16 (HRV-16). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Rhinovirus. OX NCBI_TaxID=31708; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=95250310; PubMed=7732663; RA Lee W.M., Wang W., Rueckert R.R.; RT "Complete sequence of the RNA genome of human rhinovirus 16, a RT clinically useful common cold virus belonging to the ICAM-1 receptor RT group."; RL Virus Genes 9:177-181(1995). RN [2] RP X-RAY CRYSTALLOGRAPHY (3.5 ANGSTROMS) OF 2-853. RX MEDLINE=94348864; PubMed=7915182; RA Oliveira M.A., Zhao R., Lee W.M., Kremer M.J., Minor I., RA Rueckert R.R., Diana G.D., Pevear D.C., Dutko F.J., McKinlay M.A., RA Rossmann M.G.; RT "The structure of human rhinovirus 16."; RL Structure 1:51-68(1993). RN [3] RP X-RAY CRYSTALLOGRAPHY (2.15 ANGSTROMS) OF 2-853, AND REV. TO 547-548. RX MEDLINE=97238938; PubMed=9083115; RA Hadfield A.T., Lee W.M., Zhao R., Oliveira M.A., Minor I., RA Rueckert R.R., Rossmann M.G.; RT "The refined structure of human rhinovirus 16 at 2.15-A resolution: RT implications for the viral life cycle."; RL Structure 5:427-441(1997). CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; L24917; AAA69862.1; -. DR PDB; 1AYN; 21-JAN-98. DR PDB; 1AYM; 21-JAN-98. DR MEROPS; C03.007; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate; KW 3D-structure. FT CHAIN 2 69 COAT PROTEIN VP4 (P1A). FT CHAIN 70 330 COAT PROTEIN VP2 (P1B). FT CHAIN 331 568 COAT PROTEIN VP3 (P1C). FT CHAIN 569 853 COAT PROTEIN VP1 (P1D). FT CHAIN 854 995 CORE PROTEIN P2A. FT CHAIN 996 1090 CORE PROTEIN P2B. FT CHAIN 1091 1412 CORE PROTEIN P2C. FT CHAIN 1413 1489 CORE PROTEIN P3A. FT CHAIN 1490 1510 GENOME-LINKED PROTEIN VPG (P3B). FT CHAIN 1511 1693 PICORNAIN 3C. FT CHAIN 1694 2153 RNA-DIRECTED RNA POLYMERASE P3D. FT LIPID 2 2 MYRISTATE. FT ACT_SITE 1657 1657 PROTEASE (POTENTIAL). FT ACT_SITE 1671 1671 PROTEASE (POTENTIAL). FT CONFLICT 547 548 KD -> NH (IN REF. 1). SQ SEQUENCE 2153 AA; 242242 MW; 6B11D0D93DF11C04 CRC64; MGAQVSRQNV GTHSTQNMVS NGSSLNYFNI NYFKDAASSG ASRLDFSQDP SKFTDPVKDV LEKGIPTLQS PSVEACGYSD RIIQITRGDS TITSQDVANA VVGYGVWPHY LTPQDATAID KPTQPDTSSN RFYTLDSKMW NSTSKGWWWK LPDALKDMGI FGENMFYHFL GRSGYTVHVQ CNASKFHQGT LLVVMIPEHQ LATVNKGNVN AGYKYTHPGE AGREVGTQVE NEKQPSDDNW LNFDGTLLGN LLIFPHQFIN LRSNNSATLI VPYVNAVPMD SMVRHNNWSL VIIPVCQLQS NNISNIVPIT VSISPMCAEF SGARAKTVVQ GLPVYVTPGS GQFMTTDDMQ SPCALPWYHP TKEIFIPGEV KNLIEMCQVD TLIPINSTQS NIGNVSMYTV TLSPQTKLAE EIFAIKVDIA SHPLATTLIG EIASYFTHWT GSLRFSFMFC GTANTTLKVL LAYTPPGIGK PRSRKEAMLG THVVWDVGLQ STVSLVVPWI SASQYRFTTP DTYSSAGYIT CWYQTNFVVP PNTPNTAEML CFVSGCKDFC LRMARDTDLH KQTGPITQNP VERYVDEVLN EVLVVPNINQ SHPTTSNAAP VLDAAETGHT NKIQPEDTIE TRYVQSSQTL DEMSVESFLG RSGCIHESVL DIVDNYNDQS FTKWNINLQE MAQIRRKFEM FTYARFDSEI TMVPSVAAKD GHIGHIVMQY MYVPPGAPIP TTRDDYAWQS GTNASVFWQH GQPFPRFSLP FLSIASAYYM FYDGYDGDTY KSRYGTVVTN DMGTLCSRIV TSEQLHKVKV VTRIYHKAKH TKAWCPRPPR AVQYSHTHTT NYKLSSEVHN DVAIRPRTNL TTVGPSDMYV HVGNLIYRNL HLFNSDIHDS ILVSYSSDLI IYRTSTQGDG YIPTCNCTEA TYYCKHKNRY YPINVTPHDW YEIQESEYYP KHIQYNLLIG EGPCEPGDCG GKLLCKHGVI GIITAGGEGH VAFIDLRHFH CAEEQGITDY IHMLGEAFGS GFVDSVKDQI NSINPINNIS SKMVKWMLRI ISAMVIIIRN SSDPQTIIAT LTLIGCNGSP WRFLKEKFCK WTQLTYIHKE SDSWLKKFTE MCNAARGLEW IGNKISKFID WMKSMLPQAQ LKVKYLSELK KLNFLEKQVE NLRAADTNTQ EKIKCEIDTL HDLSCKFLPL YASEAKRIKV LYHKCTNIIK QKKRSEPVAV MIHGPPGTGK SITTSFLARM ITNESDIYSL PPDPKYFDGY DNQSVVIMDD IMQNPGGEDM TLFCQMVSSV TFIPPMADLP DKGKPFDSRF VLCSTNHSLL APPTISSLPA MNRRFYLDLD ILVHDNYKDN QGKLDVSRAF RLCDVDSKIG NAKCCPFVCG KAVTFKDRNT CRTYSLSQIY NQILEEDKRR RQVVDVMSAI FQGPISMDKP PPPAITDLLR SVRTPEVIKY CQDNKWIVPA DCQIERDLNI ANSIITIIAN IISIAGIIYI IYKLFCSLQG PYSGEPKPKT KVPERRVVAQ GPEEEFGMSI IKNNTCVVTT TNGKFTGLGI YDRILILPTH ADPGSEIQVN GIHTKVLDSY DLFNKEGVKL EITVLKLDRN EKFRDIRKYI PESEDDYPEC NLALVANQTE PTIIKVGDVV SYGNILLSGT QTARMLKYNY PTKSGYCGGV LYKIGQILGI HVGGNGRDGF SSMLLRSYFT EQQGQIQISK HVKDVGLPSI HTPTKTKLQP SVFYDIFPGS KEPAVLTEKD PRLKVDFDSA LFSKYKGNTE CSLNEHIQVA VAHYSAQLAT LDIDPQPIAM EDSVFGMDGL EALDLNTSAG YPYVTLGIKK KDLINNKTKD ISKLKLALDK YDVDLPMITF LKDELRKKDK IAAGKTRVIE ASSINDTILF RTVYGNLFSK FHLNPGVVTG CAVGCDPETF WSKIPLMLDG DCIMAFDYTN YDGSIHPIWF KALGMVLDNL SFNPTLINRL CNSKHIFKST YYEVEGGVPS GCSGTSIFNS MINNIIIRTL VLDAYKHIDL DKLKIIAYGD DVIFSYKYKL DMEAIAKEGQ KYGLTITPAD KSSEFKELDY GNVTFLKRGF RQDDKYKFLI HPTFPVEEIY ESIRWTKKPS QMQEHVLSLC HLMWHNGPEI YKDFETKIRS VSAGRALYIP PYELLRHEWY EKF //