ID POLG_HE701 STANDARD; PRT; 2194 AA. AC P32537; DT 01-OCT-1993 (Rel. 27, Created) DT 01-NOV-1995 (Rel. 32, Last sequence update) DT 15-DEC-1998 (Rel. 37, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: COAT PROTEINS VP1 TO VP4; CORE PROTEINS DE P2A TO P2C, P3A; GENOME-LINKED PROTEIN VPG; PICORNAIN 3C DE (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED RNA POLYMERASE P3D DE (EC 2.7.7.48)]. OS Human enterovirus 70 (strain J670/71) (EV 70). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Enterovirus. OX NCBI_TaxID=31915; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=91037960; PubMed=2172447; RA Ryan M.D., Jenkins O., Hughes P.J., Brown A., Knowles N.J., Booth D., RA Minor P.D., Almond J.W.; RT "The complete nucleotide sequence of enterovirus type 70: RT relationships with other members of the picornaviridae."; RL J. Gen. Virol. 71:2291-2299(1990). CC -!- FUNCTION: P3C POLYPEPTIDE IS A PROTEASE THAT CLEAVES AT CERTAIN CC Q/G SITES IN THE POLYPROTEIN. IT MAY BE A CYSTEINE PROTEASE. CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; D00820; BAA18891.1; -. DR PIR; A36253; GNNYE7. DR HSSP; P03299; 1POV. DR MEROPS; C03.001; -. DR MEROPS; C03.020; -. DR INTERPRO; IPR000081; -. DR INTERPRO; IPR000199; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR INTERPRO; IPR002527; -. DR PFAM; PF00548; Cys-protease-3C; 1. DR PFAM; PF00947; Pico_P2A; 1. DR PFAM; PF01552; Pico_P2B; 1. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; Transferase; KW RNA-directed RNA polymerase; Hydrolase; Thiol protease; Myristate. FT CHAIN 2 69 COAT PROTEIN VP4 (P1A). FT CHAIN 70 319 COAT PROTEIN VP2 (P1B). FT CHAIN 320 561 COAT PROTEIN VP3 (P1C). FT CHAIN 562 871 COAT PROTEIN VP1 (P1D). FT CHAIN 872 1014 CORE PROTEIN P2A. FT CHAIN 1015 1113 CORE PROTEIN P2B. FT CHAIN 1114 1443 CORE PROTEIN P2C. FT CHAIN 1444 1532 CORE PROTEIN P3A. FT CHAIN 1533 1554 GENOME-LINKED PROTEIN VPG (P3B). FT CHAIN 1555 1737 PICORNAIN 3C. FT CHAIN 1738 2194 RNA-DIRECTED RNA POLYMERASE P3D. FT LIPID 2 2 MYRISTATE (BY SIMILARITY). FT ACT_SITE 1701 1701 PROTEASE (POTENTIAL). FT ACT_SITE 1715 1715 PROTEASE (POTENTIAL). SQ SEQUENCE 2194 AA; 244590 MW; 15DBAE96EE06673C CRC64; MGAQVSRQQT GTHENANVAT GGSSITYNQI NFYKDSYAAS ASKQDFSQDP SKFTEPVAEA LKAGAPVLKS PSAEACGYSD RVLQLKLGNS SIVTQEAANI CCAYGEWPTY LPDNEAVAID KPTQPETSTD RFYTLKSKKW ESNSTGWWWK LPDALNQIGM FGQNVQYHYL YRSGFLCHVQ CNATKFHQGT LLIVAIPEHQ IGKKGTGTSA SFAEVMKGAE GGVFEQPYLL DDGTSLACAL VYPHQWINLR TNNSATIVLP WMNSAPMDFA LRHNNWTLAV IPVCPLAGGT GNTNTYVPIT ISIAPMCAEY NGLRNAITQG VPTCLLPGSN QFLTTDDHSS APAFPDFSPT PEMHIPGQVH SMLEIVQIES MMEINNVNDA SGVERLRVQI SAQSDMDQLL FNIPLDIQLE GPLRNTLLGN ISRYYTHWSG SLEMTFMFCG SFMTTGKLII CYTPPGGSSP TDRMQAMLAT HVVWDFGLQS SITIIIPWIS GSHYRMFNTD AKAINANVGY VTCFMQTNLV APVGAADQCY IVGMVAAKKD FNLRLMRDSP DIGQSAILPE QAATTQIGEI VKTVANTVES EIKAELGVIP SLNAVETGAT SNTEPEEAIQ TRTVINMHGT AECLVENFLG RSALVCMRSF EYKNHSTSTS SIQKNFFIWT LNTRELVQIR RKMELFTYLR FDTEITIVPT LRLFSSSNVS FSGLPNLTLQ AMYVPTGARK PSSQDSFEWQ SACNPSVFFK INDPPARLTI PFMSINSAYA NFYDGFAGFE KKATVLYGIN PANTMGNLCL RVVNSYQPVQ YTLTVRVYMK PKHIKAWAPR APRTMPYTNI LNNNYAGRSA APNAPTAIVS HRSTIKTMPN DINLTTAGPG YGGAFVGSYK IINYHLATDE EKERSVYVDW QSDVLVTTVA AHGKHQIARC RCNTGVYYCK HKNRSYPVCF EGPGIQWINE SDYYPARYQT NTLLAMGPCQ PGDCGGLLVC SHGVIGLVTA GGEGIVAFTD IRNLLWLEDD AMEQGITDYI QNLGSAFGTG FTETISEKAK EIQNMLVGED SLLEKLLKAL IKIVSAMVIV IRNSEDLVTV TATLALLGCN DSPWAFLKQK VCSYLGIPYT IRQSDSWLKK FTEACNALRG LDWLAQKIDK FINWLKTKIL PEAREKHEFV QKLKQLPVIE SQINTIEHSC PNSEXQQALF NNVQYYSHYC KKYAPLYALE AKRVSALERK INNYIQFKSK SRIEPVCLII HGSPGTGKSV ASNLIARAIT EKLGGDSYSL PPDPKYFDGY KQQTVVLMDD LMQNPDGNDI AMFCQMVSTV DFIPPMASLE EKGTLYTSPF LIATTNAGSI HAPTVSDSKA LARRFKFDME IESMESYKDG VRLDMFKAVE LCNPEKCRPT NYKKCCPLIC GKAIQFRDKR TNVRYSVDML VTEMIKEYRI RNSTQDKLEA LFQGPPTFKE IKISVTPETP APDAINDLLR SIDSQEVRDY CQKKGWIVMH PPTELVVDKH ISRAFIALQA ITTFVSIAGV VYVIYKLFAG IQGPYTGLPN QKPKVPTLRT AKVQGPSLDF AQAIMRKNTV IARTSKGEFT MLGIYDRIAV VPTHASVEEE IYINDVPVKV KDAYALRDIN DVNLEITVVE LDRNEKFRDI RGFLPKYEDD YNDAILSVNT SKFPNMYIPV GQTLNYGFLN LGGTPTHRIL MYNFPTRAGQ CGGVVTTTGK VIGIHVGGNG AQGFAAMLLQ NYFTEKQGEI VSIEKTGVFI NAPAKTKLEP SVFHEVFEGV KEPAVLHSKD KRLKVDFEEA IFSKYVGNKT MLMDEYMEEA VDHYVGCLEP LDISTEPIKL EEAMYGMDGL EALDLTTSAG YPYLLQGKKK RDIFNRQTRD TTEMTKMLDK YGVDLPFVTF VKDELRSREK VEKGKSRLIE ASSLNDSVAM RVAFGNLYAT FHKNPGVATG SAVGCDPDLF WSKIPVXLDG KIFAFDYTGY DASLSPVWFA CLKKTLVKLG YTHQTAFVDY LCHSVHLYKD RKYIVNGGMP SGSSGTSIFN TMINNIIIRT LLLKVYKGID LDQFKMIAYG DDVIASYPHE IDPGLLAKAG KEYGLIMTPA DKSSGFTETT WENVTFLKRY FRADEQYPFL IHPVMPMKEI HESIRWTKDP RNTQDHVRSL CLLAWHNGEE TYNEFCRKIR TVPVGRALAL PVYSSLRRKW LDSF //