ID POLG_FMDV1 STANDARD; PRT; 2333 AA. AC P03306; Q84750; Q84751; Q84752; Q84753; Q84754; Q89824; Q84760; AC Q84761; Q84762; Q84763; Q84764; Q84765; Q84766; Q84767; Q64768; AC Q84768; Q84769; DT 21-JUL-1986 (Rel. 01, Created) DT 21-JUL-1986 (Rel. 01, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: NONSTRUCTURAL PROTEIN P20A; COAT DE PROTEINS VP1 TO VP4; CORE PROTEIN P52; GENOME-LINKED PROTEINS VPG1 TO DE VPG3; PICORNAIN 3C (EC 3.4.22.28) (PROTEASE 3C) (P3C); RNA-DIRECTED DE RNA POLYMERASE P56A (EC 2.7.7.48)]. OS Foot-and-mouth disease virus (strain A10-61) (Aphthovirus A). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Aphthovirus. OX NCBI_TaxID=12112; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=84169547; PubMed=6324120; RA Carroll A.R., Rowlands D.J., Clarke B.E.; RT "The complete nucleotide sequence of the RNA coding for the primary RT translation product of foot and mouth disease virus."; RL Nucleic Acids Res. 12:2461-2472(1984). RN [2] RP SEQUENCE OF 115-1048 FROM N.A. RX MEDLINE=82211814; PubMed=6282711; RA Boothroyd J.C., Harris T.J.R., Rowlands D.J., Lowe P.A.; RT "The nucleotide sequence of cDNA coding for the structural proteins of RT foot-and-mouth disease virus."; RL Gene 17:153-161(1982). CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; V01130; CAA24361.1; -. DR EMBL; X00429; CAA25127.1; -. DR PIR; A03908; GNNY2F. DR HSSP; P13899; 1TME. DR MEROPS; C03.008; -. DR MEROPS; C28.001; -. DR MEROPS; U29.002; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; RNA-directed RNA polymerase; KW Transferase; Hydrolase; Thiol protease; Nonstructural protein; KW Myristate. FT CHAIN 1 201 NONSTRUCTURAL PROTEIN P20A. FT CHAIN 202 286 COAT PROTEIN VP4. FT CHAIN 287 504 COAT PROTEIN VP2. FT CHAIN 505 725 COAT PROTEIN VP3. FT CHAIN 726 937 COAT PROTEIN VP1. FT CHAIN 938 1578 CORE PROTEIN P52. FT CHAIN 1579 1601 GENOME-LINKED PROTEIN VPG1. FT CHAIN 1602 1625 GENOME-LINKED PROTEIN VPG2. FT CHAIN 1626 1649 GENOME-LINKED PROTEIN VPG3. FT CHAIN 1650 1863 PROTEASE P20B. FT CHAIN 1864 2333 RNA-DIRECTED RNA POLYMERASE P56A. FT LIPID 202 202 MYRISTATE. FT CONFLICT 396 396 S -> C (IN REF. 2). FT CONFLICT 632 632 P -> L (IN REF. 2). SQ SEQUENCE 2333 AA; 259645 MW; 4FC667DCC521BC60 CRC64; MNTTNCFIAL VYLIREIKTL FRSRTKGKME FTLHNGEKKT FYSRPNNHDN CWLNTILQLF RYVDEPFFDW VYNSPENLTL DAIKQLENFT GLELHEGGPP ALVIWNIKHL LQTGIGTASR PSEVCMVDGT DMCLADFHAG IFMKGQEHAV FACVTSDGWY AIDDEDFYPW TPDPSDVLVF VPYDQEPLNG DWKTQVQKKL KGAGQSSPAT GSQNQSGNTG SIINNYYMQQ YQNSMSTQLG DNTISGGSNE GSTDTTSTHT TNTQNNDWFS KLASSAFTGL FGALLADKKT EETTLLEDRI LTTRNGHTTS TTQSSVGVTY GYSTEEDHVA GPNTSGLETR VVQAERFFKK FLFDWTTDKP FGYLTKLELP TDHHGVFGHL VDSYAYMRNG WDVEVSAVGN QFNGGCLLVA MVPEWKAFDT REKYQLTLFP HQFISPRTNM TAHITVPYLG VNRYDQYKKH KPWTLVVMVL SPLTVSNTAA PQIKVYANIA PTYVHVAGEL PSKEGIFPVA CADGYGGLVT TDPKTADPVY GKVYNPPKTN YPGRFTNLLD VAEACPTFLR FDDGKPYVVT RADDTRLLAK FDVSLAAKHM SNTYLSGIAQ YYTQYSGTIN LHFMFTGSTD SKARYMVAYI PPGVETPPDT PEEAAHCIHA EWDTGLNSKF TFSIPYVSAA DYAYTASDTA ETTNVQGWVC VYQITHGKAE NDTLLVSASA GKDFELRLPI DPRTQTTTTG ESADPVTTTV ENYGGDTQVQ RRHHTDVGFI MDRFVKINSL SPTHVIDLMQ THKHGIVGAL LRAATYYFSD LEIVVRHDGN LTWVPNGAPE AALSNTSNPT AYNKAPFTRL ALPYTAPHRV LATVYDGTNK YSASDSRSGD LGSIAARVAT QLPASFNYGA IQAQAIHELL VRMKRAELYC PRPLLAIKVT SQDRYKQKII APAKQLLNFD LLKLAGDVES NLGPFFFADV RSNFSKLVDT INQMQEDMST KHGPDFNRLV SAFEELATGV KAIRTGLDEA KPWYKLIKLL SRLSCMAAVA ARSKDPVLVA IMLADTGLEI LDSTFVVKKS SDSLSSLFHV PAPAFSFGAP VLLAGLVKVA SSFFRSTPED LERAEKQLKA RDINDIFAIL KNGEWLVKLI LAIRDWIKAW IASEEKFVTM TDLVPGILEK QRDLNDPGKY KEAKEWLDNA RQACLKSGNV HIANLCKVVA PAPSKSRPEP VVVCLRGKSG QGKSFLANVL AQAISTHFTG RIDSVWYCPP DPDHFDGYNQ QTVVVMDDLG QNPDGKDFKY FAQMVSTTGF IPPMASLEDK GKPFNSKVII ATTNLYSGFT PRTMVCPDAL NRRFHFDIDV SAKDGYKINN KLDIIKALED THTNPVAMFQ YDCALLNGMA VEMKRLQQDM FKPQPPLQNV YQLVQEVIER VELHEKVSSH PIFKQISIPS QKSVLYFLIE KGQHEAAIEF FEGMVHDSVK EELRPLIQQT SFVKRAFKRL KENFEIVALC LTLLANIVIM IRETRKRQKM VDDAVNEYIE RANITTDDKT LDEAEKNPLE TSGASTVGFR ERSLTGQKVR DDVSSEPAQP AEDQPQAEGP YSGPLERQKP LKVRAKLPQQ EGPYAGPMER QKPLKVKVKA PVVKEGPYEG PVKKPVALKV KARNLIVTES GAPPTDLQKM VMGNTKPVEL NLDGKTVAIC CATGVFGTAY LVPRHLFAEK YDKIMLDGRA MTDSDYRVFE FEIKVKRTGH ALRRGTHWLL HRGNCVRDIT KHFRDTARMK KGTPVVGVVN NADVGRLIFS GEALTYKDIV VCMDGDTMPG LFAYKAATRA GYCGGAVLAK DGADTFIVGT HSAGGNGVGY CSCVSRSMLQ KMKAHVDPEP HHEGLIVDTR DVEERVHVMR KTKLAPTVAY GVFNPEFGPA ALSNKDPRLN EGVVLDDVIF SKHKGDAKMT EEDKALFRRC AADYASRLHS VLGTANAPLS IYEAIKGVDG LDAMEPDTAP GLPWALQGKR RGALIDFENG TVGPEVEAAL KLMEKREYKF ACQTFLKDEI RPMEKVRAGK TRIVDVLPVE HILYTKMMIG RFCAQMHSNN GPQIGSAVGC NPDVDWQRFG THFAQYRNVW DVDYSAFDAN HCSDAMNIMF EEVFRTDFGF HPNAEWILKT LVNTEHAYEN KRITVEGGMP SGCSATSIIN TILNNIYVLY ALRRHYEGVE LDTYTMISYG DDIVVASDYD LDFEALKPHF KSLGQTITPA DKSDKGFVLG QSITDVTFLK RHFHMDYGTG FYKPVMASKT LEAILSFARR GTIQEKLISV AGLAVHSGPD EYRRLFEPFQ GLFEIPSYRS LYLRWVNAVC GDA //