ID POLG_FMDVA STANDARD; PRT; 2332 AA. AC P03308; P03312; Q65038; Q65039; Q65040; Q65041; Q65042; Q65043; AC Q65044; Q65045; Q65046; Q65047; DT 21-JUL-1986 (Rel. 01, Created) DT 01-JAN-1988 (Rel. 06, Last sequence update) DT 30-MAY-2000 (Rel. 39, Last annotation update) DE GENOME POLYPROTEIN [CONTAINS: NONSTRUCTURAL PROTEIN P20A; COAT DE PROTEINS VP1 TO VP4; CORE PROTEINS X, P14, P41, P19; GENOME-LINKED DE PROTEINS VPG1 TO VPG3; PICORNAIN 3C (EC 3.4.22.28) (PROTEASE 3C) DE (P3C); RNA-DIRECTED RNA POLYMERASE (EC 2.7.7.48)]. OS Foot-and-mouth disease virus (strain A12) (Aphthovirus A). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Picornaviridae; OC Aphthovirus. OX NCBI_TaxID=12114; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=85211015; PubMed=2987518; RA Robertson B.H., Grubman M.J., Weddell G.N., Moore D.M., Welsh J.D., RA Fischer T., Dowbenko D.J., Yansura D.G., Small B., Kleid D.G.; RT "Nucleotide and amino acid sequence coding for polypeptides of foot- RT and-mouth disease virus type A12."; RL J. Virol. 54:651-660(1985). RN [2] RP SEQUENCE OF 1863-2332 FROM N.A. RX MEDLINE=83225613; PubMed=6305004; RA Robertson B.H., Morgan D.O., Moore D.M., Grubman M.J., Card J., RA Fischer T., Weddell G.N., Dowbenko D.J., Yansura D.G.; RT "Identification of amino acid and nucleotide sequence of the foot-and- RT mouth disease virus RNA polymerase."; RL Virology 126:614-623(1983). RN [3] RP SEQUENCE OF 715-955 FROM N.A. RX MEDLINE=82061853; PubMed=6272395; RA Kleid D.G., Yansura D.G., Small B., Dowbenko D.J., Moore D.M., RA Grubman M.J., McKercher P.D., Morgan D.O., Robertson B.H., RA Bachrach H.L.; RT "Cloned viral protein vaccine for foot-and-mouth disease: responses in RT cattle and swine."; RL Science 214:1125-1129(1981). CC -!- SUBUNIT: THE VIRUS CAPSID IS COMPOSED OF 60 ICOSAHEDRAL UNITS, CC EACH OF WHICH IS COMPOSED OF ONE COPY EACH OF PROTEINS VP1, VP2, CC VP3, AND VP4. CC -!- PTM: SPECIFIC ENZYMATIC CLEAVAGES IN VIVO YIELD MATURE PROTEINS. CC -!- SIMILARITY: THE PROTEASE BELONGS TO PEPTIDASE FAMILY C3. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; M10975; AAA42593.1; -. DR EMBL; J02187; AAA42670.1; -. DR PIR; A25794; GNNY4F. DR HSSP; P13899; 1TME. DR MEROPS; C03.008; -. DR MEROPS; U29.002; -. DR INTERPRO; IPR000605; -. DR INTERPRO; IPR001205; -. DR INTERPRO; IPR001676; -. DR PFAM; PF00680; RNA_dep_RNA_pol; 1. DR PFAM; PF00910; RNA_helicase; 1. DR PFAM; PF00073; rhv; 3. KW Polyprotein; Coat protein; Core protein; RNA-directed RNA polymerase; KW Transferase; Hydrolase; Thiol protease; Nonstructural protein; KW Myristate. FT CHAIN 1 200 NONSTRUCTURAL PROTEIN P20A. FT CHAIN 201 285 COAT PROTEIN VP4. FT CHAIN 286 503 COAT PROTEIN VP2. FT CHAIN 504 723 COAT PROTEIN VP3. FT CHAIN 724 937 COAT PROTEIN VP1. FT CHAIN 938 953 CORE PROTEIN X. FT CHAIN 954 1107 CORE PROTEIN P14. FT CHAIN 1108 1425 CORE PROTEIN P41. FT CHAIN 1426 1578 CORE PROTEIN P19. FT CHAIN 1579 1601 GENOME-LINKED PROTEIN VPG1. FT CHAIN 1602 1625 GENOME-LINKED PROTEIN VPG2. FT CHAIN 1626 1649 GENOME-LINKED PROTEIN VPG3. FT CHAIN 1650 1862 PROTEASE. FT CHAIN 1863 2332 RNA-DIRECTED RNA POLYMERASE. FT LIPID 201 201 MYRISTATE. SQ SEQUENCE 2332 AA; 259408 MW; EE77DA739CBEDC6A CRC64; MNTTNCFIAL VHAIREIRAF FLSRATGKME FTLYNGERKT FYSRPNNHDN CWLNTILQLF RYVDEPFFDW VYNSPENLTL AAIKQLEELT GLELHEGGPP ALVIWNIKHL LQTGIGTASR PARCMVDGTN MCLADFHAGI FLKEQEHAVF ACVTSNGWYA IDDEDFYPWT PDPSDVLVFV PYDQEPLNGG WKANVQRKLK GAGQSSPATG SQNQSGNTGS IINNYYMQQY QNSMDTQLGD NAISGGSNEG STDTTSTHTT NTQNNDWFSK LASSAFTGLF GALLADKKTE ETTLLEDRIL TTRNGHTTST TQSSVGVTYG YSTEEDHVAG PNTSGLETRV VQAERFFKKF LFDWTPDKPF GHRTKLELPT DHHGVFGHLV DSYAYMRNGW DVEVSAVGNQ FNGGCLLVAM VPEWKTFDTR EEYQLTLFPH QFISPRTNMT AHITVPYLGV NRYDQYKKHK PWTLVIMVLS PLTVSNTAAT QIKVYANIAP TYVHVAGELP SKVGIFPVAC SDGYGGLVTT DPKTADPVYG KEYNPPKTNY PRRFTNLLDV AEACPTFLCF DDGKPYVVTR TDDTRLLAKF DVSLAAKHMS NTYLSGIAQY YTQYSGTINL HFMFTGSTDS KARYMVAYIP PGVETPPETP EGAAHCIHAE WDTGLNSKFT FSIPYVSAAD YAYTASDTAE TTNVQGWVCI YQITHGKAED DTLVVSASAG KDFELRLPID PRSQTTATGE SADPVTTTVE NYGGETQVQR RHHTDVSFIM DRFVKIKSLN PTHVIDLMQT HQHGLVGALL RAATYYFSDL EIVVRHDGNL TWVPNGAPEA ALSNTGNPTA YNKAPFTRLA LPYTAPHRVL ATVYNGTNKY SASGSGVRGD FGSLAPRVAR QLPASFNYGA IKAETIHELL VRMKRAELYC PRPLLAIEVS SQDRHKQKII APGKQLLNFD LLKLAGDVES NPRPFFFADV RSNFSKLVDT INQMQEDMST KHGPDFNRLV SAFEELATGV KAIRTGLDEA KPWYKLIKLL SRLSCMAAVA ARTKDPVLVA IMLADTGLEI LDSTFVVKKI SDSLSSLFHV PAPVFSFGAP VLLAGLVKVA SSFLRSTPED LERAEKQLKA RDINDIFAIL KNGEWLVKLI LAIRDWIKAW IASEEKFVTM TDLVLGILEK QRDLNDPSKY KEAKEWLDNA RQACLKSGNV HIANLCKVVA PAPSKSRPEP VVVCLRGKSG QGKSFLANVL AQAISTHFTG RTDSVWYCPP DPDHFDGYNQ QTVVVMDDLG QNPDGKDFKY FAQMVSTTGF IPPMASLEDK GKPFNSKVII ATTNLYSGFT PRTMVCPDAL NRRFHFDIDV SAKDGYKINN KLDIVKALED THTNPVAMFQ YDCALLNGMA VEMKRMQQDM FKPQPPLQNV YQLVQEVIER VELHEKVSSH PIFKQISIPS QKSVLYFLIE KGQHEAAIEF FEGMVHDSIK EELRPLIQQT SFVKRAFKRL KENFEIVALC LTLLANIVIM IRETRKRQKM VDDAVNEYIE KANITTDDTT LDEAEKNPLE TSGASTVGFR ERTLTGQRAC NDVNSEPARP AEEQPQAEGP YTGPLERQRP LKVRAKLPQQ EGPYAGPLER QKPLKVKAKA PVVKEGPYEG PVKKPVALKV KAKNLIVTES GAPPTDLQKM VMGNTKPVEL ILDGKTVAIC CATGVFGTAY LVPRHLFAEK YDKIMLDGRA MTDSDYRVFE FEIKVKGQDM LSDAALMVLH RGNRVRDITK HFRDTARMKK GTPVVGVVNN ADVGRLIFSG EALTYKDIVV CMDGDTMPSL FAYKAATKAG YCGGAVLAKD GADTFIVGTH SAGGNGVGYC SCVSKSMLLR MKAHVDPEPQ HEGLIVDTRD VEERVHVMRK TKLAPTVAHG VFNPEFGPAA LSNKDPRLNE GVVLDEVIFS KHKGDTKMSA EDKALFRACA ADYASRLHSV LGTANAPLSI YEAIKGVDGL DAMESDTAPG LPWAFQGKRR GALIDFENGT VGPEVEAALK LMEKREYKFV CQTFLKDEIR PMEKVRAGKT RIVDVLPVEH ILYTRMMIGR FCAQMHSNNG PQIGSAVGCN PDVDWQRFGT HFAQYRNVWD VDYSAFDTNH CSDAMNIMFE EVFRTDFGFH PNAEWILKTL VNTEHAYENK RITVEGGMPS DCSATGIINT ILNNIYVLYA LRRHYEGVEL DTYTMISYGD DIVVASDYDL DFEALKPHFK SLGQTITPAD KSDKGFVLGH SITDVTFLKR HFHIDYGTGF YKPVMASKTL EAILSFARRG TIQEKLTSVA GLAVHSGPDE YRRLFEPFQG LFEIPSYRSL YLRWVNAVCG DA //