1. Introduction
The propagation of misfolded proteins is a hallmark of a wide variety of diseases, including neurodegenerative disorders, such as prion diseases (or transmissible spongiform encephalopathies, TSE), Alzheimer’s disease (AD), Parkinson’s syndrome and amyotrophic lateral sclerosis [
1]. Despite the proteins involved in these diseases being different, the mechanisms of protein aggregation have common features: protein monomers assemble into oligomers, which in turn form protofibrils and finally large amyloid fibers [
2]. Protein oligomers have acquired increasing importance as they are supposed to be the cell’s main toxic species [
3,
4]. However, the structural characterization of oligomeric states is extremely difficult as multiple states can coexist. Single-molecule (SM) techniques are well suited to exploring such complex and heterogeneous folding landscapes as they can characterize rare and transient states. Single-molecule force spectroscopy (SMFS) has been used to reconstruct folding pathways and to observe transient structures, enabling a quantitative description of these processes [
5]. Conformational equilibria of intrinsically disordered proteins have been extensively characterized using SMFS performed through coupled atomic force microscopy (AFM-SMFS) [
6], optical tweezers (OT) force spectroscopy [
7] and SM fluorescence [
8].
Specifically, in AFM-SMFS, the AFM probe is used to stretch a macromolecule (e.g., a protein) which is tethered between the probe and the substrate surface. The deflection of the calibrated AFM cantilever measures the applied force (through Hooke’s law), while the extension is measured. An AFM instrument for performing SMFS consists of a piezoelectric positioner used to stretch the macromolecule. The AFM probe, the macromolecule and the substrate are all immersed in a buffer solution. Commonly, surface adsorbed proteins are attached to the AFM probe by pushing the probe into the protein layer with a controlled force: such attachment can withstand the pulling forces necessary to unfold commonly folded proteins [
9,
10].
AFM-SMFS approaches are uniquely suitable for tackling the challenge with transient features of amyloid and intrinsically disordered proteins causing neurodegenerative diseases, such as, for instance, the amyloid-β peptide (the peptide involved in AD) or α-synuclein, a protein implicated in Parkinson’s syndrome. The mechanical unfolding of the monomeric prion protein, i.e., the causal agent of TSE, has been previously explored by means of OT-SMFS experiments. Few studies employing experimental approaches similar to AFM-SMFS are available, and they mainly use monomeric or fibrillar forms of the recombinant prion protein.
TSE are the prototypical misfolding diseases and include Creutzfeldt–Jakob disease in humans, scrapie in sheep and goats, and chronic wasting disease in cervids. The structural conversion of the α-helical folded cellular prion protein (PrP
C) into its pathological form, PrP
Sc, causes TSE [
11]. The very recently solved cryo-electron microscopy (EM) near-atomic structures of infectious, brain-derived PrP
Sc fibrils unveiled a continuum of β-strand serpentine threading of the protein C-terminal domain and following studies have confirmed that the PrP
Sc amyloids feature parallel in-register β-strand stack folding [
12,
13,
14,
15,
16,
17]. However, the conformational conversion of PrP
C into PrP
Sc appears as a multi-step process whose molecular mechanisms still remain unclear and are extremely difficult to probe at the single-molecule level.
In an effort to gain new insights into the initial phases of protein aggregation processes, we applied SMFS to the characterization of the unfolding events occurring in both monomeric and dimeric forms of the recombinant α-helical folded truncated mouse prion protein (hereafter denoted as PrP, from residue 89 to 230) when mechanically pulled.
To stretch PrP molecules by AFM-SMFS approaches, a series of PrP constructs were designed that included monomeric and dimeric PrP forms, each flanked by 4 GB1 protein modules on both N- and C-termini (
Figure 1).
GB1 was chosen as the reference protein module as it has the shortest unfolding length with respect to other available and mechanically characterized proteins that have been used for such a purpose: it has an unfolding or contour length, ΔLc, of about 18 nm. The lengths of the protein chain segments that unfold in each consecutive mechanically-induced unfolding event are reflected by ΔLc, and GB1 polyprotein constructs have characteristic unfolding patterns, well-defined ΔLc and unfolding forces that can serve as internal control modules to validate single-molecule interactions [
18,
19]. It is customary to flank the protein of interest by two series of GB1 domains, such as 4 on each side, in order to obtain easily recognizable saw-tooth pattern force-extension curves. It is also a way to be sure that the protein of interest has been stretched whenever it is possible to recognize and count a number of GB1 unfolding events that would require the stretching of the macromolecule section, including the protein of interest (such as from 5 to 8 GB1 events, when 4 GB1′s flank on both sides).
The dimeric PrP form has been previously reported as an important intermediate in oligomerization [
20]. Therefore, the investigation of mechanical unfolding of dimeric PrP constructs may shed light on the very early events of prion conversion and assembly, which are still unknown.
Our AFM-SMFS measurements confirmed a significant conformational diversity in PrP, involving contacts along the entire polypeptide chain. Interestingly, dimeric PrP unfolding forces appear higher when two C-terminal domains are placed in close proximity.
3. Results
The monomeric and dimeric PrP constructs were flanked by 4 GB1 protein modules on both N- and C-termini. To mimic the tandem modular design of natural elastomeric proteins such as titin, we used a polyprotein GB1 construct consisting of eight identical tandem repeats of GB1 domains. Previous studies demonstrated that GB1 polyprotein exhibits a combination of mechanical features, including fast, high-fidelity folding kinetics, low mechanical fatigue and ability to fold against residual force that make GB1 particularly ideal for AFM-SMFS approaches [
18,
23]. Furthermore, with its 18 nm average unfolding length, GB1 is the smallest protein module in use as a SMFS internal standard, so it was chosen in order to allow a facile determination of the predicted long unfolding events due to protein aggregation. Dimeric PrP were obtained either by an engineered construct encoding for two tandem-arranged PrP segments oriented from the N- to C-termini (hereafter N-C) or by a disulfide linkage between two PrP with an added C-terminal Cys, achieving a C- to C-terminal orientation (C-C) (
Figure 1A). The addition of GB1 modules linked to PrP allowed the expression of soluble polyproteins that were then purified by SEC prior to each SMFS experiment to remove aggregated forms (
Figure S1). The force-unfolding events of the monomeric PrP construct were compared with the control made of the 8 GB1 modules (
Figure 1B,C). As we and others already reported [
18,
23], stretching 8 GB1 results in force–extension curves (FEC) of characteristic saw-tooth pattern appearance (
Figure 1B), with an average unfolding force of about 200 pN for GB1 domains, which is similar to the mechanical stability of the I27 domain from the natural elastomeric protein titin [
24].
In the FEC of
Figure 1B,C, each peak, except the detachment peak, has an unfolding force (denoted as F) and a contour-length increment (ΔLc) which can be extracted for the analysis. Calculating the difference between the L values of two separated rupture events, the change of delta contour length (ΔLc) can be then obtained. This parameter gives the exact length of an unfolded protein module [
25]. A FEC for the mechanical unfolding of (GB1)
4PrP(GB1)
4 was considered valid if it had at least five unfolding peaks with ΔLc of 18.5 nm ± 3 nm (i.e., characteristic of GB1) so that the included PrP module had been certainly stretched. The unfolding peaks in the curve of
Figure 1B,C can be interpreted as GB1 modules (up to eight in total) or as the unfolding of PrP.
While the GB1 octamer control displays only very few unfolding events occurring at contour lengths (ΔLc) longer than 18 nm, the inclusion of a PrP monomer in the construct brings a significant occurrence of events at unfolding lengths in the ΔLc 23–40 nm range (i.e., longer than the GB1 distribution). These unfolding events are not tightly clustered around any specific contour length or unfolding force, but rather they are scattered (
Figure 2 and
Figure S5). The maximum theoretical unfolding length of a complete PrP fragment in AFM-SMFS is about 21 nm. It thus resulted that the monomeric truncated PrP showed a number of rare unfolding events spanning the unfolding length range between 23 and 40 nm, with an apparently continuous distribution in unfolding forces. This behavior witnesses a conformational heterogeneity of PrP in a significant fraction of the molecules. When molecules are not folded natively in the construct, they can create mechanically stable non-native contacts that we record as long-unfolding events.
A quantitative description of the distribution of the unfolding events with a ΔLc in the range of GB1 unfolding (around ΔLc = 18 nm in the
inset of
Figure 2) led us to a further characterization of the molecular system.
Differential analysis of the mechanical unfolding of constructs containing GB1 with or without monomeric PrP exhibits a marked and significant difference. As evident from the kernel density estimation (KDE) “heat maps” in
Figure 3A (see also
Figures S3 and S4 for the individual maps), a clear increase in the number of events at about 19–24 nm unfolding length and at ~200 pN unfolding force is recorded. Even though a partial overlap with the GB1 unfolding distribution is present, these events are clearly distinguishable and they are numerically equivalent to about 0.8 ± 0.2 events per poly-protein unfolding event. Due to the unfolding length values and the numerical consistency, we can interpret that this unfolding region mainly pertains to the native monomeric PrP structure. This unfolding contour length is in very good agreement with the known C-terminal structured portion of PrP that is expected to measure about 21 nm when fully stretched [
26]. We can conclude that PrP has a native fold in most of the protein constructs we adsorbed on the surface and that AFM SMFS unfolding experiments can characterize the conformational heterogeneity of PrP from single molecule data.
As a control, we verified our interpretation of this region in the distribution of unfolding events by preparing a poly-protein construct including only the C-terminal portion of the truncated PrP protein (residues 125–230), flanked by the four GB1 modules at both N- and C-termini. With this, we recorded a force-distance distribution of events that was practically indistinguishable from that of the full PrP construct in the region at less than 23 nm unfolding length, with the same numeric consistency of native PrP events in the range of 19–24 nm unfolding length (
Figure 3B). Such a construct lacking the N-terminal portion of the truncated PrP showed no events in the 24–40 nm region, as expected by the lack of the N-terminal section and confirming that these are due to interactions requiring the full length of the truncated PrP chain.
As additional proof of the unfolding of the PrP native structures, we performed force spectroscopy after chemical reduction of the intramolecular disulfide bridge of PrP. Such a reduction has been reported to dramatically alter the stability of native PrP folding [
27]. Our measurements of the mechanical unfolding of constructs containing the reduced PrP module show no additional stable structures in the 19–23 nm region (
Figure 3C). A slightly more narrowly peaked distribution for GB1 has been observed, possibly due to a lower experimental error in determining the initial point of chain stretching, together with an increase in the unfolded chain length in agreement with a fully unfolded PrP polypeptide chain (
Figure S8). The constructs with the chemically reduced PrP monomer display a slightly lower frequency of long unfolding events at 24–40 nm.
To gain insight into the early aggregation processes, two different dimeric PrP constructs were prepared with flanking GB1: (GB1)
4PrP
2(GB1)
4, where the two PrP moieties display different known reciprocal orientations. In the first one, the C-terminal of the first PrP moiety was linked to the N-terminal of the second PrP moiety, resulting in an N- to C-terminal orientation, i.e., (GB1)
4PrP
2(GB1)
4 N-C construct. In the second, the two C-termini of (GB1)
4PrP were linked together in a C-C orientation via a disulfide bridge between two identical protein molecules, using a C-terminal cysteine appended at position 231, i.e., the (GB1)
4PrP
2(GB1)
4 C-C construct. This preparation strategy leads to unequivocal orientation of PrP in the constructs and enables the study of the effect of the vicinity and of the orientation on the emergence of new associative structures and on the stability of the C-terminal folded portion. Constructs containing PrP dimers in both N-C or C-C orientations display a comparable number of events not only in the 24–40 nm range of non-native unfolding contour-lengths but also in the even longer 40–80 nm range of unfolding lengths (
Figure S6). As these long unfolding lengths are never present in the monomeric PrP construct, we interpreted these unfolding events as characteristic unfoldings of the PrP dimeric constructs. Unfolding lengths of up to 80 nm would comprise the full lengths of the two PrP chains and they could occur only if the native folding of both PrP was missing. The number and the distribution of unfolding lengths and unfolding forces for the 24–40 nm and the 40–80 nm unfoldings do not appear statistically different in the C-C and N-C dimeric constructs (
Figure S7). Unfoldings at lengths larger than 80 nm are virtually absent, confirming the lack of signals due to nonspecific interactions between PrP and GB1.
The analysis of the results of force spectroscopy on the constructs containing dimeric PrP showed a markedly different behavior in the region of the natively folded PrP structure as a function of the reciprocal dimer orientation. While the N-C dimers showed unfolding signals analogous (see
Figure 4A versus
Figure 3A) to the natively folded structure found in monomeric PrP, the C-C dimers lacked such signals completely (
Figure 4A,B).
By comparing the unfolding signals of the two constructs comprising dimeric PrP, we noticed that the C-C dimers contained a new highly populated region of unfolding events at higher force and at a shorter unfolding length than the native PrP folding: an unfolding force of 250–330 pN and 17–21 nm, i.e., even higher force than the unfolding of GB1 (
Figure 4C), which is found at the usual lower force range (e.g., centered at about 210 pN). It turned out that about 0.5 events per poly-protein unfolding were found in this new region, as compared to 1.1 events found in the native PrP region in the N-C dimer, i.e., still lower than twice the 0.8 events per unfolding in the monomer (
Figure S7). The estimates of the size of these populations could be affected by some experimental error, as these regions overlap with the more numerous GB1 module unfolding events. Nonetheless, the almost complete disappearance of the unfoldings of the native PrP structures and the emergence of this new structure with half the occurrence strongly suggested that this might represent the emergence of a new and more stable associative structure that could involve parts of the two neighboring C-terminal structures. In the dimers, the structured C-terminal portions are covalently bound, thus their effective relative concentration is significantly higher than in a solution of monomers, possibly destabilizing their native structure to some extent and facilitating their association.
The linking of the two folded structures through the unfolded N-terminal section leads to them “seeing” each other with an augmented concentration with respect to being independent in solution [
28]. Using the WLC model [
22], we can estimate a reciprocal concentration of about 10–20 mM for the C-terminal portions linked in an N-C dimer. Such a high concentration might be the reason for a lower occurrence of folded native structures than was expected (1.1 versus 1.6 per polyprotein pulled). When the C-terminal sections are instead directly linked through their C termini in the C-C dimers, their relative concentration is much higher, possibly reaching the molar range. It is thus not surprising that their interaction could take place and the formation of associative structures could be very fast. Quite likely, the vicinity of the two natively folded domains in the C-C dimers could pose some hindrance to their correct folding or, alternatively, lead to the stabilization of folding due to interactions between folded structures. The different geometry of the constructs could imply different degrees of molecular crowding, an effect that is expected to lead to the stabilization of folding [
28].
It can be concluded that the orientation of the PrP domains in the dimers plays a strong role in the determination of their mechanical unfolding behavior, leading to strong changes in the mechanical stability of the folded structures, probably due to the possibility of neighboring C-terminal domains to interact with each other.
4. Discussion
In summary, we used AFM SMFS approaches to record a relatively large number of heterogeneous events involving the non-native folding of extended portions of the PrP chain, both when probed in its monomeric and dimeric forms. Our experiments confirmed that the PrP C-terminal domain unfolds with a two-state mechanism without any intermediate, as it was not possible to identify any different pattern of unfolding. The peculiar orientation of the PrP dimers used in this study let us evidence a significant change in the nanomechanical unfolding signals, possibly due to the emergence of stable associative structures involving the otherwise natively folded C-termini.
The relevance of PrP dimers has been presented and argued many times in the literature, often with conflicting results. In early studies, it was suggested that the dimeric form of PrP
Sc was the smallest possible size for infectivity [
29,
30]; conversely, PrP
C dimers may exhibit a protecting activity against prion replication [
31]. Later, X-ray crystallography studies on the C-terminal PrP domain showed the possibility of dimerization via an interchain disulfide bridge that forms due to domain-swapping, although the physiological relevance of this dimer was not clear [
32]. Cell biology studies using engineered constructs expressing covalently linked PrP
C propose that PrP
C homodimerization might represent a protective dominant-negative mechanism that sequesters PrP
C from prion conversion [
33,
34]. However, the structural events leading to dimer formation, i.e., if the dimer is in an N-C or C-C conformation, are unclear so far. Only recently, two dimeric native PrP forms have been described at the atomic level, called α1 and α3 dimers as a reference to the α-helices involved in the dimerization interface. Notably, dimer formation requires a C-C orientation of each monomer, and this building block might potentially lead to an infinite polymer [
35]. The cryo-EM structure of brain derived fibrils supports the model of an amyloid composed of monomeric PrP
Sc units disposed in C-C orientation [
12].
Recently, Woodside and coworkers studied hamster PrP dimer constructs with DNA handles with OT SMFS [
36]. Their findings seem to differ from ours, as they report that dimers lack their native folding. The OT studies are done in different conditions than the AFM and probe proteins in different ranges of force-loading rates. Furthermore, while in our case, the AFM probes each solution-folded molecule only once, the OT experiments commonly perform many unfolding-refolding cycles (on an often more limited number of different molecules) in order to study folding kinetics and build molecule statistics. Woodside and coworkers could describe a number of intermediate misfolded structures in their study thanks to the high force sensitivity of their OT experiments. As in our system it is only possible to probe more mechanically stable structures, pulled at a higher loading rate, our study might be reporting the behavior of the molecular system probed farther away from equilibrium. Our finding of a strongly heterogeneous set of structures involving a large portion of the PrP chain might be the result of this.
Globally, it can be asserted that AFM SMFS data can record a relatively large number of heterogeneous events involving the non-native folding of extended portions of the PrP chain, both when probed in its monomer and in its dimeric forms.
Our work provides the first biophysical description of the unfolding events linked to a dimer form oriented in a C-C conformation. We propose that this early dimeric form could represent a building block of an amyloid fibril. Additional single-molecule experiments and novel constructs harboring more PrP fragments or different internal polyprotein fingerprinting constructs [
19] might shed more light on the still obscure initial states of PrP aggregation.