ID POLG_POL1S STANDARD; PRT; 2209 AA. AC P03301; Q84881; Q84882; Q84883; Q84884; Q84885; Q84886; Q84887; AC Q84888; Q84889; Q84890; DT 21-JUL-1986 (Rel. 01, Created) DT 21-JUL-1986 (Rel. 01, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Poliovirus type 1 (strain Sabin). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Enterovirus. OX NCBI_TaxID=12082; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=83299876; PubMed=6310545; RA Nomoto A., Omata T., Toyoda H., Kuge S., Horie H., Kataoka Y., RA Genba Y., Nakano Y., Imura N.; RT "Complete nucleotide sequence of the attenuated poliovirus Sabin 1 RT strain genome."; RL Proc. Natl. Acad. Sci. U.S.A. 79:5793-5797(1982). CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- MISCELLANEOUS: THIS VIRUS IS A LIVE VACCINE STRAIN DERIVED FROM CC THE MAHONEY STRAIN BY SPONTANEOUS MUTATIONS DURING THE ATTENUATION CC PROCESS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; V01150; CAA24465.1; -. DR PIR; A03899; GNNY3P. DR HSSP; P03299; 1POV. DR MEROPS; C03.001; -. DR MEROPS; C03.020; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate. FT CHAIN 2 69 COAT PROTEIN VP4. FT CHAIN 70 341 COAT PROTEIN VP2. FT CHAIN 342 579 COAT PROTEIN VP3. FT CHAIN 580 881 COAT PROTEIN VP1. FT CHAIN 882 1030 CORE PROTEIN P2A. FT CHAIN 1031 1127 CORE PROTEIN P2B. FT CHAIN 1128 1456 CORE PROTEIN P2C. FT CHAIN 1457 1543 CORE PROTEIN P3A. FT CHAIN 1544 1565 GENOME-LINKED PROTEIN VPG. FT CHAIN 1566 1747 PICORNAIN 3C. FT CHAIN 1748 2209 RNA-DIRECTED RNA POLYMERASE P3D. FT LIPID 2 2 MYRISTATE. FT ACT_SITE 1712 1712 PROTEASE (POTENTIAL). FT ACT_SITE 1726 1726 PROTEASE (POTENTIAL). SQ SEQUENCE 2209 AA; 246576 MW; 9EC1EF4D174A28A4 CRC64; MGAQVSSQKV GAHENSNRAY GGSTINYTTI NYYRDSASNA ASKQDFSQDP SKFTEPIKDV LIKTSPMLNS PNIEACGYSD RVLQLTLGNS TITTQEAANS VVAYGRWPEY LRDSEANPVD QPTEPDVAAC RFYTLDTVSW TKESRGWWWK LPDALRDMGL FGQNMYYHYL GRSGYTVHVQ CNASKFHQGA LGVFAVPEMC LAGDSNTTTM HTSYQNANPG EKGGTFTGTF TPDDNQTSPA RRFCPVDYLF GNGTLLGNAF VFPHQIINLR TNNCATLVLP YVNSLSIDSM VKHNNWGIAI LPLAPLNFAS ESSPEIPITL TIAPMCCEFN GLRNITLPRL QGLPVMNTPG SNQYLTADNF QSPCALPEFD VTPPIDIPGE VKNMMELAEI DTMIPFDLSA KKKNTMEMYR VRLSDKPHTD DPILCLSLSP ASDPRLSHTM LGEILNYYTH WAGSLKFTFL FCGSMMATGK LLVSYAPPGA DPPKKRKEAM LGTHVIWDIG LQSSCTMVVP WISNTTYRQT IDDSFTEGGY ISVFYQTRIV VPLSTPREMD ILGFVSACND FSVRLMRDTT HIEQKALAQG LGQMLESMID NTVRETVGAA TSRDALPNTE ASGPAHSKEI PALTAVETGA TNPLVPSDTV QTRHVVQHRS RSESSIESFF ARGACVAIIT VDNSASTKNK DKLFTVWKIT YKDTVQLRRK LEFFTYSRFD MEFTFVVTAN FTETNNGHAL NQVYQIMYVP PGAPVPEKWD DYTWQTSSNP SIFYTYGTAP ARISVPYVGI SNAYSHFYDG FSKVPLKDQS AALGDSLYGA ASLNDFGILA VRVVNDHNPT KVTSKIRVYL KPKHIRVWCP RPPRAVAYYG PGVDYKDGTL TPLSTKDLTT YGFGHQNKAV YTAGYKICNY HLATQEDLQN AVNVMWNRDL LVTESRAQGT DSIARCNCNA GVYYCESRRK YYPVSFVGPT FQYMEANNYY PARYQSHMLI GHGFASPGDC GGILRCHHGV IGIITAGGEG LVAFTDIRDL YAYEEEAMEQ GITNYIESLG AAFGSGFTQQ IGDKITELTN MVTSTITEKL LKNLIKIISS LVIITRNYED TTTVLATLAL LGCDASPWQW LRKKACDVLE IPYVTKQGDS WLKKFTEACN AAKGLEWVSN KISKFIDWLK EKIIPQARDK LEFVTKLRQL EMLENQISTI HQSCPSQEHQ EILFNNVRWL SIQSKRFAPL YAVEAKRIQK LEHTINNYIQ FKSKHRIEPV CLLVHGSPGT GKSVATNLIA RAIAERENTS TYSLPPDPSH FDGYKQQGVV IMDDLNQNPD GADMKLFCQM VSTVEFIPPM ASLEEKGILF TSNYVLASTN SSRISPPTVA HSDALARRFA FDMDIQVMNE YSRDGKLNMA MATEMCKNCH QPANFKRCCP LVCGKAIQLM DKSSRVRYSI DQITTMIINE RNRRSNIGNC MEALFQGPLQ YKDLKIDIKT SPPPECINDL LQAVDSQEVR DYCEKKGWIV NITSQVQTER NINRAMTILQ AVTTFAAVAG VVYVMYKLFA GHQGAYTGLP NKKPNVPTIR TAKVQGPGFD YAVAMAKRNI VTATTSKGEF TMLGVHDNVA ILPTHASPGE SIVIDGKEVE ILDAKALEDQ AGTNLEITII TLKRNEKFRD IRPHIPTQIT ETNDGVLIVN TSKYPNMYVP VGAVTEQGYL NLGGRQTART LMYNFPTRAG QCGGVITCTG KVIGMHVGGN GSHGFAAALK RSYFTQSQGE IQWMRPSKEV GYPIINAPSK TKLEPSAFHY VFEGVKEPAV LTKNDPRLKT NFEEAIFSKY VGNKITEVDE HMKEAVDHYA GQLMSLDINT EQMCLEDAMY GTDGLEALDL STSAGYPYVA MGKKKRDILN KQTRDTKEMQ KLLDTYGINL PLVTYVKDEL RSKTKVEQGK SRLIEASSLN DSVAMRMAFG NLYAAFHKNP GVITGSAVGC DPDLFWSKIP VLMEEKLFAF DYTGYDASLS PAWFEALEMV LEKIGFGDRV DYIDYLNHSH HLYKNKTYCV KGGMPSGCSG TSIFNSMINN LIIRTLLLKT YKGIDLDHLK MIAYGDDVIA SYPHEVDASL LAQSGKDYGL TMTPADKSAI FETVTWENVT FLKRFFRADE KYPFLIHPVM PMKEIHESIR WTKDPRNTQD HVRSLCLLAW HNGEEEYNKF LAKIRSVPIG RALLLPEYST LYRRWLDSF //