N-end rule

The N-end rule is a rule that governs the rate of protein degradation through recognition of the N-terminal residue of proteins. The rule states that the N-terminal amino acid of a protein determines its half-life (time after which half of the total amount of a given polypeptide is degraded). The rule applies to both eukaryotic and prokaryotic organisms, but with different strength, rules, and outcome.[1] In eukaryotic cells, these N-terminal residues are recognized and targeted by ubiquitin ligases, mediating ubiquitination thereby marking the protein for degradation.[2] The rule was initially discovered by Alexander Varshavsky and co-workers in 1986.[3] However, only rough estimations of protein half-life can be deduced from this 'rule', as N-terminal amino acid modification can lead to variability and anomalies, whilst amino acid impact can also change from organism to organism. Other degradation signals, known as degrons, can also be found in sequence.

Rules in different organisms

The rule may operate differently in different organisms.

Yeast

N-terminal residues - approximate half-life of proteins for S. cerevisiae[3]

  • Met, Gly, Ala, Ser, Thr, Val, Pro - > 20 hrs (stabilizing)
  • Ile, Glu - approx. 30 min (stabilizing)
  • Tyr, Gln - approx. 10 min (destabilizing)
  • Leu, Phe, Asp, Lys - approx. 3 min (destabilizing)
  • Arg - approx. 2 min (destabilizing)

Mammals

N-terminal residues - approximate half-life of proteins in mammalian systems [4]

  • Val (V)→ 100h
  • Met (M), Gly (G) → 30h
  • Pro (P) → 20h
  • Ile (I)→ 20h
  • Thr (T) → 7.2h
  • Leu (L) → 5.5h
  • Ala (A) → 4.4h
  • His (H) → 3.5h
  • Trp (W) → 2.8h
  • Tyr (Y) → 2.8h
  • Ser (S) → 1.9h
  • Asn (N) → 1.4h
  • Lys (K) → 1.3h
  • Cys (C) → 1.2h
  • Asp (D) → 1.1h
  • Phe (F) → 1.1h
  • Glu (E) → 1.0h
  • Arg (R) → 1.0h
  • Gln (Q) → 0.8h

Bacteria

In Escherichia coli, positively-charged and some aliphatic and aromatic residues on the N-terminus, such as arginine, lysine, leucine, phenylalanine, tyrosine, and tryptophan, have short half-lives of around 2-minutes and are rapidly degraded.[5] These residues (when located at the N-terminus of a protein) are referred to as destabilising residues. In bacteria, destabilising residues can be further defined as Primary destabilising residues (leucine, phenylalanine, tyrosine, and tryptophan) or secondary destabilising residues (arginine, lysine and in a special case methionine [6] ). Secondary destabilising residues are modified by the attachment of a Primary destabilising residue by the enzyme leucyl/phenylalanyl-tRNA-protein transferase.[5][6] All other amino acids when located at the N-terminus of a protein are referred to as stabilising residues and have half-lives of more than 10 hours .[5] Proteins bearing an N-terminal Primary destabilising residue are specifically recognised by the bacterial N-recognin (recognition component) ClpS.[7][8] ClpS is as a specific adaptor protein for the ATP-dependent AAA+ protease ClpAP, and hence ClpS delivers N-degron substrates to ClpAP for degradation.

A complicating issue is that the first residue of bacterial proteins is normally expressed with an N-terminal formylmethionine (f-Met). The formyl group of this methionine is quickly removed, and the methionine itself is then removed by methionyl aminopeptidase. The removal of the methionine is more efficient when the second residue is small and uncharged (for example alanine), but inefficient when it is bulky and charged such as arginine. Once the f-Met is removed, the second residue becomes the N-terminal residue and are subject to the N-end rule. Residues with middle sized side-chains such as leucine as the second residue therefore may have a short half-life.[9]

Chloroplasts

There are several reasons why it is possible that the N-end rule functions in the chloroplast organelle of plant cells as well.[10] The first piece of evidence comes from the endosymbiotic theory which encompasses the idea that chloroplasts are derived from cyanobacteria, photosynthetic organisms that can convert light into energy.[11][12] It is thought that the chloroplast developed from an endosymbiosis between a eukaryotic cell and a cyanobacterium, because chloroplasts share several features with the bacterium, including photosynthetic capabilities.[11][12] The bacterial N-end rule is already well documented; it involves the Clp protease system which consists of the adaptor protein ClpS and the ClpA/P chaperone and protease core.[5][7][13] A similar Clp system is present in the chloroplast stroma, suggesting that the N-end rule might function similarly in chloroplasts and bacteria.[10][14]

Additionally, a 2013 study in Arabidopsis thaliana revealed the protein ClpS1, a possible plastid homolog of the bacterial ClpS recognin.[15] ClpS is a bacterial adaptor protein that is responsible for recognizing protein substrates via their N-terminal residues and delivering them to a protease core for degradation.[7] This study suggests that ClpS1 is functionally similar to ClpS, also playing a role in substrate recognition via specific N-terminal residues (degrons) like its bacterial counterpart.[15] It is posited that upon recognition, ClpS1 binds to these substrate proteins and brings them to the ClpC chaperone of the protease core machinery to initiate degradation.[15]

In another study, Arabidopsis thaliana stromal proteins were analyzed to determine the relative abundance of specific N-terminal residues.[16] This study revealed that Alanine, Serine, Threonine, and Valine were the most abundant N-terminal residues, while Leucine, Phenylalanine, Tryptophan, and Tyrosine (all triggers for degradation in bacteria) were among the residues that were rarely detected.[16]

Furthermore, an affinity assay using ClpS1 and N-terminal residues was performed to determine whether ClpS1 did indeed have specific binding partners.[17] This study revealed that Phenylalanine and Tryptophan bind specifically to ClpS1, making them prime candidates for N-degrons in chloroplasts.[17]

Further research is currently being conducted to confirm whether the N-end rule operates in chloroplasts.[10][17]

Apicoplast

An apicoplast is a derived non-photosynthetic plastid found in most Apicomplexa, including Toxoplasma gondii, Plasmodium falciparum and other Plasmodium spp. (parasites causing malaria). Similar to plants, several Apicomplexan species, including Plasmodium falciparum contain all of the necessary components [18][19] required for a Apicoplast-localized Clp-protease, including a potential homolog of the bacterial ClpS N-recognin.[20][21] In vitro data demonstrate that Plasmodium falciparum ClpS is able to recognize a variety of N-terminal primary destabilizing residues, not only the classic bacterial Primary destabilizing residues (leucine, phenylalanine, tyrosine and tryptophan) but also N-terminal Isoleucine and hence exhibits broad specificity (in comparison to its bacterial counterpart).[21]

References

  1. ^ Varshavsky A (January 1997). "The N-end rule pathway of protein degradation". Genes to Cells. 2 (1): 13–28. doi:10.1046/j.1365-2443.1997.1020301.x. PMID 9112437. S2CID 27736735.
  2. ^ Tasaki T, Sriram SM, Park KS, Kwon YT (2012). "The N-end rule pathway". Annual Review of Biochemistry. 81: 261–89. doi:10.1146/annurev-biochem-051710-093308. PMC 3610525. PMID 22524314.
  3. ^ a b Bachmair A, Finley D, Varshavsky A (October 1986). "In vivo half-life of a protein is a function of its amino-terminal residue". Science. 234 (4773): 179–86. Bibcode:1986Sci...234..179B. doi:10.1126/science.3018930. PMID 3018930.
  4. ^ Gonda DK, Bachmair A, Wünning I, Tobias JW, Lane WS, Varshavsky A (October 1989). "Universality and structure of the N-end rule". The Journal of Biological Chemistry. 264 (28): 16700–12. doi:10.1016/S0021-9258(19)84762-2. PMID 2506181.
  5. ^ a b c d Tobias JW, Shrader TE, Rocap G, Varshavsky A (November 1991). "The N-end rule in bacteria". Science. 254 (5036): 1374–7. Bibcode:1991Sci...254.1374T. doi:10.1126/science.1962196. PMID 1962196.
  6. ^ a b Ninnis RL, Spall SK, Talbo GH, Truscott KN, Dougan DA (June 2009). "Modification of PATase by L/F-transferase generates a ClpS-dependent N-end rule substrate in Escherichia coli". The EMBO Journal. 28 (12): 1732–44. doi:10.1038/emboj.2009.134. PMC 2699360. PMID 19440203.
  7. ^ a b c Erbse A, Schmidt R, Bornemann T, Schneider-Mergener J, Mogk A, Zahn R, et al. (February 2006). "ClpS is an essential component of the N-end rule pathway in Escherichia coli". Nature. 439 (7077): 753–6. Bibcode:2006Natur.439..753E. doi:10.1038/nature04412. PMID 16467841. S2CID 4406838.
  8. ^ Schuenemann VJ, Kralik SM, Albrecht R, Spall SK, Truscott KN, Dougan DA, Zeth K (May 2009). "Structural basis of N-end rule substrate recognition in Escherichia coli by the ClpAP adaptor protein ClpS". EMBO Reports. 10 (5): 508–14. doi:10.1038/embor.2009.62. PMC 2680879. PMID 19373253.
  9. ^ Hirel PH, Schmitter MJ, Dessen P, Fayat G, Blanquet S (November 1989). "Extent of N-terminal methionine excision from Escherichia coli proteins is governed by the side-chain length of the penultimate amino acid". Proceedings of the National Academy of Sciences of the United States of America. 86 (21): 8247–51. Bibcode:1989PNAS...86.8247H. doi:10.1073/pnas.86.21.8247. PMC 298257. PMID 2682640.
  10. ^ a b c Bouchnak I, van Wijk KJ (October 2019). "N-Degron Pathways in Plastids". Trends in Plant Science. 24 (10): 917–926. doi:10.1016/j.tplants.2019.06.013. PMID 31300194. S2CID 196351051.
  11. ^ a b Archibald JM (October 2015). "Endosymbiosis and Eukaryotic Cell Evolution". Current Biology. 25 (19): R911-21. doi:10.1016/j.cub.2015.07.055. PMID 26439354.
  12. ^ a b McFadden GI (January 2001). "Chloroplast origin and integration". Plant Physiology. 125 (1): 50–3. doi:10.1104/pp.125.1.50. PMC 1539323. PMID 11154294.
  13. ^ Dougan DA, Micevski D, Truscott KN (January 2012). "The N-end rule pathway: from recognition by N-recognins, to destruction by AAA+proteases". Biochimica et Biophysica Acta (BBA) - Molecular Cell Research. 1823 (1): 83–91. doi:10.1016/j.bbamcr.2011.07.002. PMID 21781991.
  14. ^ Nishimura K, van Wijk KJ (September 2015). "Organization, function and substrates of the essential Clp protease system in plastids". Biochimica et Biophysica Acta (BBA) - Bioenergetics. 1847 (9): 915–30. doi:10.1016/j.bbabio.2014.11.012. PMID 25482260.
  15. ^ a b c Nishimura K, Asakura Y, Friso G, Kim J, Oh SH, Rutschow H, et al. (June 2013). "ClpS1 is a conserved substrate selector for the chloroplast Clp protease system in Arabidopsis". The Plant Cell. 25 (6): 2276–301. doi:10.1105/tpc.113.112557. PMC 3723626. PMID 23898032.
  16. ^ a b Rowland E, Kim J, Bhuiyan NH, van Wijk KJ (November 2015). "The Arabidopsis Chloroplast Stromal N-Terminome: Complexities of Amino-Terminal Protein Maturation and Stability". Plant Physiology. 169 (3): 1881–96. doi:10.1104/pp.15.01214. PMC 4634096. PMID 26371235.
  17. ^ a b c Montandon C, Dougan DA, van Wijk KJ (May 2019). "N-degron specificity of chloroplast ClpS1 in plants". FEBS Letters. 593 (9): 962–970. doi:10.1002/1873-3468.13378. PMID 30953344.
  18. ^ Florentin A, Cobb DW, Fishburn JD, Cipriano MJ, Kim PS, Fierro MA, et al. (November 2017). "PfClpC Is an Essential Clp Chaperone Required for Plastid Integrity and Clp Protease Stability in Plasmodium falciparum". Cell Reports. 21 (7): 1746–1756. doi:10.1016/j.celrep.2017.10.081. PMC 5726808. PMID 29141210.
  19. ^ El Bakkouri M, Rathore S, Calmettes C, Wernimont AK, Liu K, Sinha D, et al. (January 2013). "Structural insights into the inactive subunit of the apicoplast-localized caseinolytic protease complex of Plasmodium falciparum". The Journal of Biological Chemistry. 288 (2): 1022–31. doi:10.1074/jbc.M112.416560. PMC 3542988. PMID 23192353.
  20. ^ LaCount DJ, Vignali M, Chettier R, Phansalkar A, Bell R, Hesselberth JR, et al. (November 2005). "A protein interaction network of the malaria parasite Plasmodium falciparum". Nature. 438 (7064): 103–7. Bibcode:2005Natur.438..103L. doi:10.1038/nature04104. PMID 16267556. S2CID 4401702.
  21. ^ a b Tan JL, Ward L, Truscott KN, Dougan DA (October 2016). "The N-end rule adaptor protein ClpS from Plasmodium falciparum exhibits broad substrate specificity". FEBS Letters. 590 (19): 3397–3406. doi:10.1002/1873-3468.12382. PMID 27588721.