ID POLG_HRV14 STANDARD; PRT; 2179 AA. AC P03303; Q82083; Q82123; Q84736; Q84737; Q84738; Q84739; Q84740; AC Q84741; Q89441; Q89763; Q89883; Q84774; Q84775; Q84776; Q84777; AC Q89649; Q84778; Q84779; DT 21-JUL-1986 (Rel. 01, Created) DT 21-JUL-1986 (Rel. 01, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Human rhinovirus 14 (HRV-14). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Rhinovirus. OX NCBI_TaxID=12131; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=85037949; PubMed=6093056; RA Stanway G., Hughes P.J., Mountford R.C., Minor P.D., Almond J.W.; RT "The complete nucleotide sequence of a common cold virus: human RT rhinovirus 14."; RL Nucleic Acids Res. 12:7859-7875(1984). RN [2] RP SEQUENCE FROM N.A. RX MEDLINE=93188162; PubMed=8383233; RA Lee W.M., Monroe S., Rueckert R.R.; RT "Role of maturation cleavage in infectivity of picornaviruses: RT activation of an infectosome."; RL J. Virol. 67:2110-2122(1993). RN [3] RP SEQUENCE FROM N.A. RX MEDLINE=85140171; PubMed=2983312; RA Callahan P.L., Mizutani S., Colonno R.J.; RT "Molecular cloning and complete sequence determination of RNA genome RT of human rhinovirus type 14."; RL Proc. Natl. Acad. Sci. U.S.A. 82:732-736(1985). RN [4] RP X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROMS). RX MEDLINE=85296372; PubMed=2993920; RA Rossman M.G., Arnold E., Erickson J.W., Frankenberger E.A., RA Griffith J.P., Hecht H.-J., Johnson J.E., Kamer G., Luo M., RA Mosser A.G., Rueckert R.R., Sherry B., Vriend G.; RT "Structure of a human common cold virus and functional relationship to RT other picornaviruses."; RL Nature 317:145-153(1985). RN [5] RP X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROMS). RA Arnold E., Rossman M.G.; RT "The use of molecular-replacement phases for the refinement of the RT human rhinovirus 14 structure."; RL Acta Crystallogr. A 44:270-282(1988). RN [6] RP X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROMS). RX MEDLINE=90189144; PubMed=2156077; RA Arnold E., Rossman M.G.; RT "Analysis of the structure of a common cold virus, human rhinovirus RT 14, refined at a resolution of 3.0 A."; RL J. Mol. Biol. 211:763-801(1990). CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -!- CAUTION: THE PDB DATA BANK CONTAINS THE 3D-STRUCTURE COORDINATE CC OF PROTEINS VP1, VP2, VP3 AND VP4. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; K02121; AAA45756.1; -. DR EMBL; X01087; CAA25565.1; -. DR EMBL; L05355; AAA45758.1; -. DR PIR; A03901; GNNYH4. DR PDB; 4RHV; 15-OCT-94. DR PDB; 1RMU; 15-OCT-94. DR PDB; 2RMU; 15-OCT-94. DR PDB; 2RM2; 15-OCT-94. DR PDB; 2R04; 15-OCT-94. DR PDB; 2R06; 15-OCT-94. DR PDB; 2R07; 15-OCT-94. DR PDB; 1R08; 15-OCT-94. DR PDB; 1R09; 15-OCT-94. DR PDB; 2RR1; 15-OCT-94. DR PDB; 2RS1; 15-OCT-94. DR PDB; 2RS3; 15-OCT-94. DR PDB; 2RS5; 15-OCT-94. DR PDB; 1HRI; 15-OCT-94. DR PDB; 2HWB; 01-NOV-94. DR PDB; 2HWC; 01-NOV-94. DR PDB; 1HRV; 03-JUN-95. DR PDB; 1RUC; 14-NOV-95. DR PDB; 1RUD; 14-NOV-95. DR PDB; 1RUE; 14-NOV-95. DR PDB; 1RUF; 14-NOV-95. DR PDB; 1RUG; 14-NOV-95. DR PDB; 1RUH; 14-NOV-95. DR PDB; 1RUI; 14-NOV-95. DR PDB; 1RUJ; 14-NOV-95. DR PDB; 1RVF; 25-FEB-98. DR PDB; 1VRH; 12-FEB-97. DR MEROPS; C03.007; -. DR MEROPS; C03.021; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate; KW 3D-structure. FT CHAIN 2 69 COAT PROTEIN VP4 (P1A). FT CHAIN 70 331 COAT PROTEIN VP2 (P1B). FT CHAIN 332 567 COAT PROTEIN VP3 (P1C). FT CHAIN 568 856 COAT PROTEIN VP1 (P1D). FT CHAIN 857 1002 CORE PROTEIN P2A. FT CHAIN 1003 1099 CORE PROTEIN P2B. FT CHAIN 1100 1429 CORE PROTEIN P2C. FT CHAIN 1430 1514 CORE PROTEIN P3A. FT CHAIN 1515 1537 GENOME-LINKED PROTEIN VPG (P3B). FT CHAIN 1538 1719 PICORNAIN 3C. FT CHAIN 1720 2179 RNA-DIRECTED RNA POLYMERASE P3D. FT LIPID 2 2 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1683 1683 PROTEASE (POTENTIAL). FT ACT_SITE 1697 1697 PROTEASE (POTENTIAL). FT CONFLICT 368 368 P -> L (IN REF. 3). FT CONFLICT 459 459 I -> T (IN REF. 3). FT CONFLICT 722 722 P -> H (IN REF. 3). FT CONFLICT 726 727 NP -> KS (IN REF. 3). FT CONFLICT 729 731 EWD -> RVG (IN REF. 3). FT CONFLICT 913 913 C -> R (IN REF. 3). FT CONFLICT 942 942 N -> S (IN REF. 3). FT CONFLICT 962 962 P -> L (IN REF. 3). FT CONFLICT 982 982 G -> E (IN REF. 3). FT CONFLICT 1193 1193 L -> F (IN REF. 3). FT CONFLICT 1193 1193 L -> H (IN REF. 2). FT CONFLICT 1220 1220 I -> T (IN REF. 2 AND 3). FT CONFLICT 1399 1399 I -> V (IN REF. 2 AND 3). FT CONFLICT 1446 1446 P -> S (IN REF. 3). FT CONFLICT 1739 1739 P -> A (IN REF. 3). FT HELIX 36 38 FT TURN 50 50 FT HELIX 51 54 FT STRAND 57 57 FT TURN 63 64 FT STRAND 83 87 FT TURN 88 89 FT STRAND 90 94 FT STRAND 101 102 FT HELIX 103 105 FT HELIX 113 115 FT STRAND 123 123 FT HELIX 126 128 FT TURN 129 129 FT STRAND 133 134 FT STRAND 138 141 FT TURN 142 143 FT STRAND 147 151 FT TURN 152 152 FT HELIX 153 155 FT TURN 156 157 FT HELIX 159 167 FT STRAND 168 180 FT TURN 185 186 FT STRAND 188 197 FT TURN 198 198 FT STRAND 203 203 FT TURN 204 205 FT TURN 207 208 FT HELIX 213 216 FT HELIX 219 221 FT STRAND 223 224 FT TURN 225 226 FT TURN 231 232 FT STRAND 234 234 FT HELIX 238 240 FT TURN 241 244 FT STRAND 245 245 FT HELIX 247 252 FT STRAND 255 259 FT TURN 260 262 FT STRAND 265 270 FT STRAND 279 279 FT TURN 281 283 FT STRAND 284 284 FT STRAND 287 298 FT TURN 301 302 FT STRAND 306 323 FT TURN 339 342 FT TURN 346 347 FT STRAND 354 354 FT TURN 357 358 FT STRAND 370 371 FT STRAND 373 373 FT HELIX 374 377 FT TURN 378 379 FT STRAND 382 383 FT TURN 386 389 FT HELIX 395 397 FT STRAND 399 404 FT TURN 405 405 FT STRAND 410 415 FT TURN 418 419 FT TURN 421 422 FT HELIX 423 425 FT HELIX 427 432 FT TURN 433 434 FT STRAND 435 439 FT STRAND 442 448 FT TURN 452 453 FT STRAND 455 455 FT STRAND 457 463 FT TURN 465 466 FT HELIX 473 477 FT TURN 478 478 FT STRAND 480 485 FT STRAND 491 496 FT STRAND 505 506 FT TURN 512 513 FT STRAND 517 522 FT STRAND 526 527 FT TURN 530 531 FT STRAND 535 544 FT TURN 546 547 FT STRAND 549 553 FT TURN 557 558 FT STRAND 585 587 FT STRAND 590 590 FT TURN 599 600 FT STRAND 601 602 FT HELIX 604 606 FT HELIX 614 616 FT TURN 617 617 FT STRAND 620 620 FT STRAND 623 624 FT HELIX 630 632 FT STRAND 633 633 FT HELIX 634 638 FT STRAND 642 651 FT TURN 655 656 FT TURN 660 660 FT HELIX 661 663 FT TURN 664 664 FT STRAND 666 670 FT HELIX 677 683 FT TURN 684 685 FT STRAND 686 702 FT STRAND 714 719 FT TURN 722 723 FT TURN 730 731 FT HELIX 733 736 FT STRAND 742 746 FT TURN 747 748 FT STRAND 750 755 FT STRAND 764 765 FT STRAND 770 770 FT STRAND 790 795 FT STRAND 804 822 FT STRAND 826 826 FT TURN 833 834 FT TURN 853 854 SQ SEQUENCE 2179 AA; 242989 MW; 827201A3032F0285 CRC64; MGAQVSTQKS GSHENQNILT NGSNQTFTVI NYYKDAASTS SAGQSLSMDP SKFTEPVKDL MLKGAPALNS PNVEACGYSD RVQQITLGNS TITTQEAANA VVCYAEWPEY LPDVDASDVN KTSKPDTSVC RFYTLDSKTW TTGSKGWCWK LPDALKDMGV FGQNMFFHSL GRSGYTVHVQ CNATKFHSGC LLVVVIPEHQ LASHEGGNVS VKYTFTHPGE RGIDLSSANE VGGPVKDVIY NMNGTLLGNL LIFPHQFINL RTNNTATIVI PYINSVPIDS MTRHNNVSLM VIPIAPLTVP TGATPSLPIT VTIAPMCTEF SGIRSKSIVP QGLPTTTLPG SGQFLTTDDR QSPSALPNYE PTPRIHIPGK VHNLLEIIQV DTLIPMNNTH TKDEVNSYLI PLNANRQNEQ VFGTNLFIGD GVFKTTLLGE IVQYYTHWSG SLRFSLMYTG PALSSAKLIL AYTPPGARGP QDRREAMLGT HVVWDIGLQS TIVMTIPWTS GVQFRYTDPD TYTSAGFLSC WYQTSLILPP ETTGQVYLLS FISACPDFKL RLMKDTQTIS QTVALTEGLG DELEEVIVEK TKQTVASISS GPKHTQKVPI LTANETGATM PVLPSDSIET RTTYMHFNGS ETDVECFLGR AACVHVTEIQ NKDATGIDNH REAKLFNDWK INLSSLVQLR KKLELFTYVR FDSEYTILAT ASQPDSANYS SNLVVQAMYV PPGAPNPKEW DDYTWQSASN PSVFFKVGDT SRFSVPYVGL ASAYNCFYDG YSHDDAETQY GITVLNHMGS MAFRIVNEHD EHKTLVKIRV YHRAKHVEAW IPRAPRALPY TSIGRTNYPK NTEPVIKKRK GDIKSYGLGP RYGGIYTSNV KIMNYHLMTP EDHHNLIAPY PNRDLAIVST GGHGAETIPH CNCTSGVYYS TYYRKYYPII CEKPTNIWIE GNPYYPSRFQ AGVMKGVGPA EPGDCGGILR CIHGPIGLLT AGGSGYVCFA DIRQLECIAE EQGLSDYITG LGRAFGVGFT DQISTKVTEL QEVAKDFLTT KVLSKVVKMV SALVIICRNH DDLVTVTATL ALLGCDGSPW RFLKMYISKH FQVPYIERQA NDGWFRKFND ACNAAKGLEW IANKISKLIE WIKNKVLPQA KEKLEFCSKL KQLDILERQI TTMHISNPTQ EKREQLFNNV LWLEQMSQKF APLYAVESKR IRELKNKMVN YMQFKSKQRI EPVCVLIHGT PGSGKSLTTS IVGRAIAEHF NSAVYSLPPD PKHFDGYQQQ EVVIMDDLNQ NPDGQDISMF CQMVSSVDFL PPMASLDNKG MLFTSNFVLA STNSNTLSPP TILNPEALVR RFGFDLDICL HTTYTKNGKL NAGMSTKTCK DCHQPSNFKK CCPLVCGKAI SLVDRTTNIR YSVDQLVTAI ISDFKSKMQI TDSLETLFQG PVYKDLEIDV CNTPPPECIN DLLKSVDSEE IREYCKKKKW IIPEIPTNIE RAMNQASMII NTILMFVSTL GIVYVIYKLF AQTQGPYSGN PPHNKLKAPT LRPVVVQGPN TEFALSLLRK NIMTITTSKG EFTGLGIHDR VCVIPTHAQP GDDVLVNGQK IRVKDKYKLV DPENINLELT VLTLDRNEKF RDIRGFISED LEGVDATLVV HSNNFTNTIL EVGPVTMAGL INLSSTPTNR MIRYDYATKT GQCGGVLCAT GKIFGIHVGG NGRQGFSAQL KKQYFVEKQG QVIARHKVRE FNINPVNTPT KSKLHPSVFY DVFPGDKEPA VLSDNDPRLE VKLTESLFSK YKGNVNTEPT ENMLVAVDHY AGQLLSLDIP TSELTLKEAL YGVDGLEPID ITTSAGFPYV SLGIKKRDIL NKETQDTEKM KFYLDKYGID LPLVTYIKDE LRSVDKVRLG KSRLIEASSL NDSVNMRMKL GNLYKAFHQN PGVLTGSAVG CDPDVFWSVI PCLMDGHLMA FDYSNFDASL SPVWFVCLEK VLTKLGFAGS SLIQSICNTH HIFRDEIYVV EGGMPSGCSG TSIFNSMINN IIIRTLILDA YKGIDLDKLK ILAYGDDLIV SYPYELDPQV LATLGKNYGL TITPPDKSET FTKMTWENLT FLKRYFKPDQ QFPFLVHPVM PMKDIHESIR WTKDPKNTQD HVRSLCMLAW HSGEKEYNEF IQKIRTTDIG KCLILPEYSV LRRRWLDLF //