ID GAG_HV1MN STANDARD; PRT; 506 AA. AC P05888; DT 01-NOV-1988 (Rel. 09, Created) DT 01-FEB-1994 (Rel. 28, Last sequence update) DT 15-JUL-1998 (Rel. 36, Last annotation update) DE GAG POLYPROTEIN [CONTAINS: CORE PROTEINS P17, P24, P2, P7, P1, P6]. GN GAG. OS Human immunodeficiency virus type 1 (MN isolate) (HIV-1). OC Viruses; Retroid viruses; Retroviridae; Lentivirus. OX NCBI_TaxID=11696; RN [1] RP SEQUENCE, AND POST-TRANSLATIONAL MODIFICATIONS. RX MEDLINE=92194415; PubMed=1548743; RA Henderson L.E., Bowers M.A., Sowder R.C. II, Serabyn S.A., RA Johnson D.G., Bess J.W. Jr., Arthur L.O., Bryant D.K., Fenselau C.; RT "Gag proteins of the highly replicative MN strain of human RT immunodeficiency virus type 1: posttranslational modifications, RT proteolytic processings, and complete amino acid sequences."; RL J. Virol. 66:1856-1865(1992). RN [2] RP SEQUENCE FROM N.A. RX MEDLINE=88219542; PubMed=3369091; RA Gurgo C., Guo H.-G., Franchini G., Aldovini A., Collalti E., RA Farrell K., Wong-Staal F., Gallo R.C., Reitz M.S. Jr.; RT "Envelope sequences of two new United States HIV-1 isolates."; RL Virology 164:531-536(1988). RN [3] RP STRUCTURE BY NMR OF 380-434. RX MEDLINE=93278285; PubMed=1304355; RA Summers M.F., Henderson L.E., Chance M.R., Bess J.W. Jr., South T.L., RA Blake P.R., Sagi I., Perez-Alvarado G., Sowder R.C. III, Hare D.R., RA Arthur L.O.; RT "Nucleocapsid zinc fingers detected in retroviruses: EXAFS studies of RT intact viruses and the solution-state structure of the nucleocapsid RT protein from HIV-1."; RL Protein Sci. 1:563-574(1992). CC -!- FUNCTION: PERFORMS HIGHLY COMPLEX ORCHESTRATED TASKS DURING THE CC ASSEMBLY, BUDDING, MATURATION, AND INFECTION STAGES OF THE VIRAL CC REPLICATION CYCLE. DURING VIRAL ASSEMBLY, THE PROTEINS FORM CC MEMBRANE ASSOCIATIONS AND SELF-ASSOCIATIONS THAT ULTIMATELY CC RESULT IN BUDDING OF AN IMMATURE VIRION FROM THE INFECTED CELL. CC GAG PRECURSORS ALSO FUNCTION DURING VIRAL ASSEMBLY TO SELECTIVELY CC BIND AND PACKAGE TWO PLUS STRANDS OF GENOMIC RNA. CC -!- PTM: THE P24 PROTEIN IS PHOSPHORYLATED. CC -!- MISCELLANEOUS: THE MN ISOLATE WAS TAKEN FROM A PEDIATRIC AIDS CC PATIENT IN 1984. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- DR EMBL; M17449; AAA44853.1; -. DR PIR; A38068; A38068. DR PDB; 1AAF; 31-JAN-94. DR HIV; M17449; GAG$MN. DR INTERPRO; IPR000071; -. DR INTERPRO; IPR000721; -. DR INTERPRO; IPR001878; -. DR PFAM; PF00540; gag_p17; 1. DR PFAM; PF00607; gag_p24; 1. DR PFAM; PF00098; zf-CCHC; 2. DR PRINTS; PR00234; HIV1MATRIX. DR PRINTS; PR00939; C2HCZNFINGER. KW AIDS; Core protein; Polyprotein; Myristate; Phosphorylation; KW Zinc-finger; 3D-structure. FT INIT_MET 0 0 FT CHAIN 1 134 CORE PROTEIN P17 (MATRIX ANTIGEN). FT CHAIN 135 365 CORE PROTEIN P24 (CORE ANTIGEN). FT CHAIN 366 379 CORE PROTEIN P2. FT CHAIN 380 434 CORE PROTEIN P7 (NUCLEOCAPSID PROTEIN). FT CHAIN 435 450 CORE PROTEIN P1. FT CHAIN 451 506 CORE PROTEIN P6. FT LIPID 1 1 MYRISTATE. FT ZN_FING 394 407 C2HC-TYPE. FT VARIANT 34 34 V -> I. FT VARIANT 45 45 I -> V. FT VARIANT 74 74 R -> L, S OR N. FT VARIANT 92 92 K -> E. FT CONFLICT 17 17 K -> N (IN REF. 2). FT CONFLICT 141 141 Q -> E (IN REF. 2). FT CONFLICT 220 220 A -> V (IN REF. 2). FT CONFLICT 226 226 A -> T (IN REF. 2). FT CONFLICT 318 319 WM -> RT (IN REF. 2). FT CONFLICT 447 448 PG -> R (IN REF. 2). SQ SEQUENCE 506 AA; 56629 MW; AC6F3CEB691C4726 CRC64; GARASVLSGG ELDRWEKIRL RPGGKKKYKL KHVVWASREL ERFAINPGLL ETSEGCRQIL GQLQPSLQTG SEERKSLYNT VATLYCVHQK IKIKDTKEAL EKIEEEQNKS KKKAQQAAAD TGNRGNSSQV SQNYPIVQNI QGQMVHQAIS PRTLNAWVKV VEEKAFSPEV IPMFSALSEG ATPQDLNTML NTVGGHQAAM QMLKETINEE AAEWDRLHPA HAGPIAPGQM REPRGSDIAG TTSTLQEQIG WMTNNPPIPV GEIYKRWIIL GLNKIVRMYS PSSILDIRQG PKEPFRDYVD RFYKTLRAEQ ASQEVKNWMT ETLLVQNANP DCKTILKALG PAATLEEMMT ACQGVGGPGH KARVLAEAMS QVTNSATIMM QRGNFRNQRK IIKCFNCGKE GHIAKNCRAP RKRGCWKCGK EGHQMKDCTE RQANFLGKIW PSCKGRPGNF PQSRTEPTAP PEESFRFGEE TTTPYQKQEK KQETIDKDLY PLASLKSLFG NDPLSQ //