In molecular biology, protein aggregation is a phenomenon in which intrinsically-disordered or mis-folded proteins aggregate (i.e., accumulate and clump together) either intra- or extracellularly. Protein aggregates have been implicated in a wide variety of diseases known as amyloidoses, including ALS, Alzheimer's, Parkinson's and prion disease.
After synthesis, proteins typically fold into a particular three-dimensional conformation that is the most thermodynamically favorable: their native state. This folding process is driven by the hydrophobic effect: a tendency for hydrophobic (water-fearing) portions of the protein to shield themselves from the hydrophilic (water-loving) environment of the cell by burying into the interior of the protein. Thus, the exterior of a protein is typically hydrophilic, whereas the interior is typically hydrophobic.
Protein structures are stabilized by non-covalent interactions and disulfide bonds between two cysteine residues. The non-covalent interactions include ionic interactions and weak van der Waals interactions. Ionic interactions form between an anion and a cation and form salt bridges that help stabilize the protein. Van der Waals interactions include nonpolar interactions (i.e. London dispersion force) and polar interactions (i.e. hydrogen bonds, dipole-dipole bond). These play an important role in a protein's secondary structure, such as forming an alpha helix or a beta sheet, and tertiary structure. Interactions between amino acid residues in a specific protein are very important in that protein's final structure.
When there are changes in the non-covalent interactions, as may happen with a change in the amino acid sequence, the protein is susceptible to misfolding or unfolding. In these cases, if the cell does not assist the protein in re-folding, or degrade the unfolded protein, the unfolded/misfolded protein may aggregate, in which the exposed hydrophobic portions of the protein may interact with the exposed hydrophobic patches of other proteins. There are three main types of protein aggregates that may form: amorphous aggregates, oligomers, and amyloid fibrils.
Causes
Protein aggregation can occur due to a variety of causes. There are four classes that these causes can be categorized into, which are detailed below.
Mutations
Mutations that occur in the DNA sequence may or may not affect the amino acid sequence of the protein. When the sequence is affected, a different amino acid may change the interactions between the side chains that affect the folding of the protein. This can lead to exposed hydrophobic regions of the protein that aggregate with the same misfolded/unfolded protein or a different protein.
In addition to mutations in the affected proteins themselves, protein aggregation could also be caused indirectly through mutations in proteins in regulatory pathways such as the refolding pathway (molecular chaperones) or the ubiquitin-proteasome pathway (ubiquitin ligases). Chaperones help with protein refolding by providing a safe environment for the protein to fold. Ubiquitin ligases target proteins for degradation through ubiquitin modification.
Problems with protein synthesis
Protein aggregation can be caused by problems that occur during transcription or translation. During transcription, DNA is copied into mRNA, forming a strand of pre-mRNA that undergoes RNA processing to form mRNA. During translation, ribosomes and tRNA help translate the mRNA sequence into an amino acid sequence. If problems arise during either step, making an incorrect mRNA strand and/or an incorrect amino acid sequence, this can cause the protein to misfold, leading to protein aggregation.
Environmental stresses
Environmental stresses such as extreme temperatures and pH or oxidative stress can also lead to protein aggregation. One such disease is cryoglobulinemia.
Extreme temperatures can weaken and destabilize the non-covalent interactions between the amino acid residues. pHs outside of the protein's pH range can change the protonation state of the amino acids, which can increase or decrease the non-covalent interactions. This can also lead to less stable interactions and result in protein unfolding.
Oxidative stress can be caused by radicals such as reactive oxygen species (ROS). These unstable radicals can attack the amino acid residues, leading to oxidation of side chains (e.g. aromatic side chains, methionine side chains) and/or cleavage of the polypeptide bonds. This can affect the non-covalent interactions that hold the protein together correctly, which can cause protein destabilization, and may cause the protein to unfold.
Aging
Cells have mechanisms that can refold or degrade protein aggregates. However, as cells age, these control mechanisms are weakened and the cell is less able to resolve the aggregates.
The hypothesis that protein aggregation is a causative process in aging is testable now since some models of delayed aging are in hand. If the development of protein aggregates was an aging independent process, slowing down aging will show no effect on the rate of proteotoxicity over time. However, if aging is associated with decline in the activity of protective mechanisms against proteotoxicity, the slow aging models would show reduced aggregation and proteotoxicity. To address this problem several toxicity assays have been done in C. elegans. These studies indicated that reducing the activity of insulin/IGF signaling (IIS), a prominent aging regulatory pathway protects from neurodegeneration-linked toxic protein aggregation. The validity of this approach has been tested and confirmed in mammals as reducing the activity of the IGF-1 signaling pathway protected Alzheimer's model mice from the behavioral and biochemical impairments associated with the disease.
Aggregate localization
Several studies have shown that cellular responses to protein aggregation are well-regulated and organized. Protein aggregates localize to specific areas in the cell, and research has been done on these localizations in prokaryotes (E.coli) and eukaryotes (yeast, mammalian cells). From the macroscopic point of view, positron emission tomography tracers are used for certain misfolded proitein. Recently, a team of researchers led by Dr. Alessandro Crimi has proposed a machine learning method to predict future deposition in the brain.
Bacteria
The aggregates in bacteria asymmetrically end up at one of the poles of the cell, the "older pole." After the cell divides, the daughter cells with the older pole gets the protein aggregate and grows more slowly than daughter cells without the aggregate. This provides a natural selection mechanism for reducing protein aggregates in the bacterial population.
Yeast
Most of the protein aggregates in yeast cells get refolded by molecular chaperones. However, some aggregates, such as the oxidatively damaged proteins or the proteins marked for degradation, cannot be refolded. Rather, there are two compartments that they can end up in. Protein aggregates can be localized at the Juxtanuclear quality-control compartment (JUNQ), which is near the nuclear membrane, or at the Insoluble Protein deposit (IPOD), near the vacuole in yeast cells. Protein aggregates localize at JUNQ when they are ubiquitinated and targeted for degradation. The aggregated and insoluble proteins localize at IPOD as a more permanent deposition. There is evidence that the proteins here may be removed by autophagy. These two pathways work together in that the proteins tend to come to the IPOD when the proteasome pathway is being overworked.
Mammalian cells
In mammalian cells, these protein aggregates are termed "aggresomes" and they are formed when the cell is diseased. This is because aggregates tend to form when there are heterologous proteins present in the cell, which can arise when the cell is mutated. Different mutates of the same protein may form aggresomes of different morphologies, ranging from diffuse dispersion of soluble species to large puncta, which in turn bear different pathogenicity. The E3 ubiquitin ligase is able to recognize misfolded proteins and ubiquinate them. HDAC6 can then bind to the ubiquitin and the motor protein dynein to bring the marked aggregates to the microtubule organizing center (MTOC). There, they pack together into a sphere that surrounds the MTOC. They bring over chaperones and proteasomes and activate autophagy.
Elimination
There are two main protein quality control systems in the cell that are responsible for eliminating protein aggregates. Misfolded proteins can get refolded by the bi-chaperone system or degraded by the ubiquitin proteasome system or autophagy.
Refolding
The bi-chaperone system utilizes the Hsp70 (DnaK-DnaJ-GrpE in E. coli and Ssa1-Ydj1/Sis1-Sse1/Fe1 in yeast) and Hsp100 (ClpB in E. coli and Hsp104 in yeast) chaperones for protein disaggregation and refolding.
Hsp70 interacts with the protein aggregates and recruits Hsp100. Hsp70 stabilizes an activated Hsp100. Hsp100 proteins have aromatic pore loops that are used for threading activity to disentangle single polypeptides. This threading activity can be initiated at the N-terminus, C-terminus or in the middle of the polypeptide. The polypeptide gets translocated through Hsp100 in a series of steps, utilizing an ATP at each step. The polypeptide unfolds and is then allowed to refold either by itself or with the help of heat shock proteins.
Degradation
Misfolded proteins can be eliminated through the ubiquitin-proteasome system (UPS). This consists of an E1-E2-E3 pathway that ubiquinates proteins to mark them for degradation. In eukaryotes, the proteins get degraded by the 26S proteasome. In mammalian cells, the E3 ligase, carboxy-terminal Hsp70 interacting protein (CHIP), targets Hsp70-bound proteins. In yeast, the E3 ligases Doa10 and Hrd1 have similar functions on endoplasmic reticulum proteins. On the molecular level, degradation rate of aggregates vary from protein to protein due to their different internal environments, and thus different accessibility for protease molecules.
Misfolded proteins can also be eliminated through autophagy, in which the protein aggregates are delivered to the lysosome.
Toxicity
Although it has been thought that the mature protein aggregates themselves are toxic, evidence suggests that it is in fact immature protein aggregates that are most toxic. The hydrophobic patches of these aggregates can interact with other components of the cell and damage them. The hypotheses are that the toxicity of protein aggregates is related to mechanisms of the sequestration of cellular components, the generation of reactive oxygen species and the binding to specific receptors in the membrane or through the disruption of membranes. A quantitative assay has been used to determine that higher molecular weight species are responsible for the membrane permeation. It is known that protein aggregates in vitro can destabilize artificial phospholipid bilayers, leading to permeabilization of the membrane.[]