ID POLG_HRV1B STANDARD; PRT; 2157 AA. AC P12916; Q89704; Q82106; Q82107; Q82108; Q82109; Q82110; Q82111; AC Q82112; Q82113; Q82114; Q82115; DT 01-OCT-1989 (Rel. 12, Created) DT 01-OCT-1989 (Rel. 12, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Human rhinovirus 1B (HRV-1B). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Rhinovirus. OX NCBI_TaxID=12129; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=88089537; PubMed=2826669; RA Hughes P.J., North C., Jellis C.H., Minor P.D., Stanway G.; RT "The nucleotide sequence of human rhinovirus 1B: molecular RT relationships within the rhinovirus genus."; RL J. Gen. Virol. 69:49-58(1988). CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; D00239; BAA00168.1; -. DR PIR; A28699; GNNY1B. DR HSSP; Q82122; 1AYN. DR MEROPS; C03.007; -. DR MEROPS; C03.021; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate. FT CHAIN 2 69 COAT PROTEIN VP4 (P1A). FT CHAIN 70 332 COAT PROTEIN VP2 (P1B). FT CHAIN 333 570 COAT PROTEIN VP3 (P1C). FT CHAIN 571 857 COAT PROTEIN VP1 (P1D). FT CHAIN 858 999 CORE PROTEIN P2A. FT CHAIN 1000 1094 CORE PROTEIN P2B. FT CHAIN 1095 1416 CORE PROTEIN P2C. FT CHAIN 1417 1493 CORE PROTEIN P3A. FT CHAIN 1494 1514 GENOME-LINKED PROTEIN VPG (P3B). FT CHAIN 1515 1697 PICORNAIN 3C. FT CHAIN 1698 2157 RNA-DIRECTED RNA POLYMERASE P3D. FT LIPID 2 2 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1661 1661 PROTEASE (POTENTIAL). FT ACT_SITE 1675 1675 PROTEASE (POTENTIAL). SQ SEQUENCE 2157 AA; 242313 MW; 42DB649063B677B9 CRC64; MGAQVSRQNV GTHSTQNSVS NGSSLNYFNI NYFKDAASSG ASRLDFSQDP SKFTDPVKDV LEKGIPTLQS PSVEACGYSD RIIQITRGDS TITSQDVANA VVGYGVWPHY LTPQDATAID KPTQPDTSSN RFYTLESKHW NGDSKGWWWK LPDALKEMGI FGENMYYHFL GRSGYTVHVQ CNASKFHQGT LLVAMIPEHQ LASAKNGSVT AGYNLTHPGE AGRVVGQQRD ANLRQPSDDS WLNFDGTLLG NLLIFPHQFI NLRSNNSATL IVPYVNAVPM DSMLRHNNWS LVIIPISPLR SETTSSNIRP ITVSISPMCA EFSGARAKNV RQGLPVYITP GSGQFMTTDD MQSPCALPWY HPTKEISIPG EVKNLIEMCQ VDTLIPVNNV GTNVGNISMY TVQLGNQMDM AQEVFAIKVD ITSQPLATTL IGEIASYYTH WTGSLRFSFM FCGTANTTLK LLLAYTPPGI DKPATRKDAM LGTHVVWDVG LQSTISLVVP WVSASHFRLT ANDKYSMAGY ITCWYQTNLV VPPNTPQTAD MLCFVSACKD FCLRMARDTD LHIQSGPIEQ NPVENYIDEV LNEVLVVPNI KESHHTTSNS APLLDAAETG HTSNVQPEDA IETRYVMTSQ TRDEMSIESF LGRSGCVHIS RIKVDYNDYN GVNKNFTTWK ITLQEMAQIR RKFELFTYVR FDSEVTLVPC IAGRGDDIGH VVMQYMYVPP GAPIPKTRND FSWQSGTNMS IFWQHGQPFP RFSLPFLSIA SAYYMFYDGY DGDNSSSKYG SIVTNDMGTI CSRIVTEKQE HPVVITTHIY HKAKHTKAWC PRPPRAVPYT HSRVTNYVPK TGDVTTAIVP RASMKTVGPS DLYVHVGNLI YRNLHLFNSE MHDSILVSYS SDLIIYRTNT TGDDYIPSCN CTEATYYCKH KNRYYPIKVT PHDWYEIQES EYYPKHIQYN LLIGEGPCEP GDCGGKLLCR HGVIGIITAG GEGHVAFTDL RQFQCAEEQG ITDYIHMLGE AFGNGFVDSV KEQINAINPI NSISKKVIKW LLRIISAMVI IIRNSSDPQT IIATLTLIGC NGSPWRFLKE KFCKWTQLTY IHKESDSWLK KFTEMCNAAR GLEWIGNKIS KFIDWMKSML PQAQLKVKYL NEIKKLSLLE KQIENLRAAD NATQEKIKCE IDTLHDLSCK FLPLYAHEAK RIKVLYNKCS NIIKQRKRSE PVAVMIHGPP GTGKSITTNF LARMITNESD VYSLPPDPKY FDGYDNQSVV IMDDIMQNPD GEDMTLFCQM VSSVTFIPPM ADLPDKGKPF DSRFVLCSTN HSLLAPPTIS SLPAMNRRFF FDLDIVVHDN YKDAQGKLNV SKAFQPCNVN TKIGNAKCCP FVCGKAVSFK DRSTCSTYTL AQVYNHILEE DKRRRQVVDV MSAIFQGPIS LDAPPPPAIA DLLQSVRTPE VIKYCQDNKW IVPAECQIER DLSIANSIIT IIANIISIAG IIFVIYKLFC TLQGPYSGEP KPKTKMPERR VVAQGPEEEF GRSILKNNTC VITTDNGKFT GLGIYDRTLI IPTHADPGRE VQVNGIHTKV LDSYDLYNRD GVKLEITVIQ LDRNEKFRDI RKYIPETEDD YPECNLALSA NQVEPTIIKV GDVVSYGNIL LSGNQTARML KYNYPTKSGY CGGVLYKIGQ ILGIHVGGNG RDGFSAMLLR SYFTDTQGQI KISKHANECG LPTIHTPSKT KLQPSVFYDV FPGSKEPAVS RDNDPRLKVN FKEALFSKYK GNTECSLNQH MEIAIAHYSA QLITLDIDSK PIALEDSVFG IEGLEALDLN TSAGFPYVTM GIKKRDLINN KTKDISRLKE ALDKYGVDLP MITFLKDELR KKEKISAGKT RVIEASSIND TILFRTTFGN LFSKFHLNPG VVTGSAVGCD PETFWSKIPV MLDGDCIMAF DYTNYDGSIH PVWFQALKKV LENLSFQSNL IDRLCYSKHL FKSTYYEVAG GVPSGCSGTS IFNTMINNII IRTLVLDAYK NIDLDKLKII AYGDDVIFSY KYTLDMEAIA NEGKKYGLTI TPADKSTEFK KLDYNNVTFL KRGFKQDEKH TFLIHPTFPV EEIYESIRWT KKPSQMQEHV LSLCHLMWHN GRKVYEDFSS KIRSVSAGRA LYIPPYDLLK HEWYEKF //