ID POLG_POL2W STANDARD; PRT; 2205 AA. AC P23069; DT 01-NOV-1991 (Rel. 20, Created) DT 01-NOV-1991 (Rel. 20, Last sequence update) DT 15-DEC-1998 (Rel. 37, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Poliovirus type 2 (strain W-2). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Enterovirus. OX NCBI_TaxID=12085; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=90155230; PubMed=2154539; RA Pevear D.C., Oh C.K., Cunningham L.L., Calenoff M., Jubelt B.; RT "Localization of genomic regions specific for the attenuated, mouse- RT adapted poliovirus type 2 strain W-2."; RL J. Gen. Virol. 71:43-52(1990). CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; D00625; BAA00516.1; ALT_SEQ. DR PIR; A34032; GNNY2W. DR HSSP; P03299; 1POV. DR MEROPS; C03.001; -. DR MEROPS; C03.020; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate. FT CHAIN 2 69 COAT PROTEIN VP4 (P1A). FT CHAIN 70 340 COAT PROTEIN VP2 (P1B). FT CHAIN 341 578 COAT PROTEIN VP3 (P1C). FT CHAIN 579 879 COAT PROTEIN VP1 (P1D). FT CHAIN 880 1028 PROTEASE 2A. FT CHAIN 1029 1125 CORE PROTEIN 2B. FT CHAIN 1126 1454 CORE PROTEIN 2C. FT CHAIN 1455 1541 CORE PROTEIN 3A. FT CHAIN 1542 1563 GENOME-LINKED PROTEIN VPG. FT CHAIN 1564 1746 PICORNAIN 3C. FT CHAIN 1747 2205 RNA-DIRECTED RNA POLYMERASE 3D. FT LIPID 2 2 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1710 1710 PROTEASE (POTENTIAL). FT ACT_SITE 1724 1724 PROTEASE (POTENTIAL). SQ SEQUENCE 2205 AA; 245701 MW; 2A42AB039E0254AD CRC64; MGAQVSSQKV GAHENSNRAY GGSTINYTTI NYYRDSASNA ASKQDFAQDP SKFTEPIKDV LIKTAPTLNS PNIEACGYSD RVMQLTLGNS TITTQEAANS VVAYGRWPEY IKDSEANPVD QPTEPDVAAC RFYTLDTVTW RKESRGWWWK LPDALKDMGL FGQNMFYHYL GRASYTVHVQ CNASKFHQGA LGVFAVPEMC LAGDSATHML TKYENANPGE KGGEFKGSFT LDTNATNPAR NFCPVDYLFG SGVLAGNAFV YPHQIINLRT NNCATLVLPY VNSLSIDSMT KHNNWGIAIL PLAPLDFATE SSTEIPITLT IAPMCCEFNG LRNITVPRTQ GLPVLNTPGS NQYLTADNYQ SPCAIPEFDV TPPIDIPGEV RNMMELAEID TMIPLNLTSQ RKNTMDMYRV ELNDAAHSDT PILCLSLSPA SDPRLAHTML GEILNYYTHW AGSLKFTFLF CGSMMATGKL LVSYAPPGAK APESRKEAML GTHVIWDIGL QSSCTMVVPW ISNTTYRQTI NDSFTEGGYI SMFYQTRVVV PLSTPRKMDI LGFVSACNDF SVRLLRDTTH ISQEVMPQGL GDLIEGVVEG VTRNALTPLT PVNNLPDTRS SGPAHSKETP ALTAVETGAT NPLVPSDTVQ TRHVIQKRTR SESTVESFFA RGACVAIIEV DNDAPTRRAS KLFSVWKITY KDTVQLRRKL EFFTYSRFDM EFTFVVTSNY TDANNGHALN QVYQIMYIPP GAPIPGKRND YTWQTSSNPS VFYTYGAPPA RISVPYVGIA NAYSHFYDGF AKVPLAGQAS TEGDSLYGAA SLNDFGSLAV RVVNDHNPTK LTSKIRVYMK PKHVRVWCPR PPRAVPYYGP GVDYKDGLTP LPEKGLITYG FGHQNKAVYT AGYKICNYHL ATQEDLQNAI NIMWIRDLLV VESKAQGIDS IARCNCHTGV YYCESRRKYY PVSFVGPTFQ YMEANEYYPA RYQSHMLIGH GFASPGDCGG ILRCQHGVIG IITAGGEGLV AFSDIRDLYA YEVEAMEQGV SNYIESLGAA FGSGFTQQIG NKISELTSMV TSTITEKLLK NLIKIISSLV IITRNYEDTT TVLATLALLG CDASPWQWLK KKACDILEIP YIMRQGDSWL KKFTEACNAA KGLEWVSNKI SKFIDWLKEK IIPQARDKLE FVTKLKQLEM LENQIATIHQ SCPSQEHQEI LFNNVRWLSI QSRRFAPLYA VEAKRIQKLE HTINNYVQFK SKHRIEPVCL LVHGSPGTGK SVATNLIARA IAEKENTSTY SLPPDPSHFD GYKQQGVVIM DDLNQNPDGA DMKLFCQMVS TVEFIPPMAS LEEKGILFTS NYVLASTNSS RITPPTVAHS DALARRFAFD MDIQIMSEYS RDGKLNMAMA TEMCKNCHQP ANFKRCCPLV CGKAIQLMDK SSRVRYSIDQ ITTMIINERN RRSSIGNCME ALFQSPLQYK DLKIDIKTTP PPECINDLLH AVDSQEVRDY CEKKGWIADI TSQVQTERNI NRAMTILQAV TTFAAVAGVV YVMYKLFAGH QGAYTGLPNK RPNVPTIRTA KVQGPGFDYA VAMAKRNILT ATTIKGEFTM LGVHDNVAIL PTHASPGETI VIDGKEVEVL DAKALEDQAG TNLEITIVTL KRNEKFRDIR PHIPTQITET NDGVLIVNTS KYPNMYVPVG AVTEQGYLNL GGRQTARTLM YNFPTRAGQC GGVITCTGKV IGMHVGGNGS HGFAAALKRS YFTQSQGEIQ WMRPSKEVGY PVINAPSKTK LEPSAFHYVF EGVKEPAVLT KSDPRLKTDF EEAIFSKYVG NKITEVDEYM KEAVDHYAGQ LMSLDINTEQ MCLEDAMYGT DGLEALDLST SAGYPYVAMG KKKRDILNKQ TRDTKEMQRL LDTYGINLPL VTYVKDELRS KTKVEQGKSR LIEASSLNDS VAMRMAFGNL YAAFHKNPGV VTGSAVGCDP DLFWSKIPVL MEEKLFDYTG YDASLSPAWF EALKMVLEKI GFGDRVDYID YLNHSHHLYK NKTYCVKGGM PSGCSGTSIF NSMINNLIIR TLLLKTYKGI DLDHLKMIAY GDDVIASYPH EVDASLLAQS GKDYGLTMTP ADKSATFETV TWENVTFLKR FFRADEKYPF LVHPVMPMKE IHESIRWTKD PRNTQDHVRS LCLLAWHSGE EEYNKFLAKI RSVPIGRALL LPEYSTLYRR WLDSF //