1. Introduction
In the eukaryotic nucleus, genomic DNA is packed into chromatin, which regulates all nuclear processes involving DNA, including DNA replication, DNA repair and transcription. Histones represent the major protein component of chromatin and comprise core and linker histones. Core histones (H2A, H2B, H3 and H4) assemble in a histone octamer, around which approximately 146 bp of DNA is wrapped, forming a nucleosome. Linker histone H1 binds the free DNA (~20 bp-long) present between individual nucleosomes, contributing to the formation of higher-order chromatin structures. Together, the nucleosome, linker DNA and linker histone H1 form the chromatosome. Core histones exert their function mainly through a number of reversible post-translational modifications (PTMs) that can be deposed at their
N-terminal tails, whose role in regulating gene expression has been extensively studied and elucidated in recent years [
1]. In contrast, much less is known regarding the role of histone H1 besides its general chromatin condensation function, although increasing evidence indicates histone H1 potential to regulate transcription in a locus-specific manner [
2,
3,
4].
Linker histones comprise 11 variants (or subtypes) in human and mouse, including seven somatic variants (H1.0, H1x and H1.1 to H1.5), three testis-specific variants (H1t, H1T2 and HILS1) and one oocyte-specific variant (H1oo). Histones H1.1 to H1.5 are expressed in a replication-dependent manner, while histones H1.0 and H1x are replication-independent variants transcribed throughout the cell cycle [
5]. The definition of the structures of the chromatosome core particle and of the 30 nm nucleosome fiber revealed that histone H1 variants can bind differently to the nucleosome, determining distinct higher-order chromatin structures and contributing to the regulation of nuclear functions, including transcription, DNA replication and repair, and genome stability (reviewed in [
3]). In addition to existing as different variants, histone H1 contains different PTMs that may modulate its functions, which include methylation, phosphorylation, acetylation, ubiquitination, citrullination, formylation and ADP ribosylation [
6].
Alterations in the levels of bulk histone H1 and of specific variants have been observed in cancer (reviewed in [
7]). For instance, a 40% reduction of bulk H1 mRNA levels was found in ovarian adenocarcinoma [
8], and a number of variant-specific changes have been observed in different types of tumors [
7]. Generally, replication-dependent variants are upregulated in tumors compared with normal tissues, as the result of increased cell proliferation. Histone H1.0, which is usually present at high levels in normal tissues, shows heterogeneous patterns in cancers and is overall reduced, especially in aggressive and undifferentiated tumors [
7]. These results suggest that H1 levels could be useful biomarkers to discriminate benign and malignant lesions and to possibly provide information about patient prognosis.
Finally, recurrent mutations of histone H1 subtypes have been found in different types of cancers [
7]. Although their relevance in tumorigenesis is currently poorly understood, this finding could support a driver role played by linker histones in cancer, whose investigation could not only lead to the identification of biomarkers for patient stratification, but possibly also to the discovery of novel epigenetic mechanisms and therapeutic strategies.
Although histone H1 investigation in clinical samples relied so far mostly on the analysis of RNA expression levels or antibody–based detection of protein levels, none of these approaches is ideal. First, alterations in mRNA levels may not correspond to changes in protein abundance, since H1 genes are regulated at the post-transcriptional level and subjected to translational control [
9]; second, antibody-based methods, such as immunohistochemistry, require variant-specific antibodies, which represents a major challenge given the high similarity of the H1.1–H1.5 subtypes (
Figure 1). In this context, mass spectrometry (MS)–based proteomics offers the ideal tool to analyze in a quantitative manner the protein levels of different histone H1 variants, also in clinical samples.
In this study, we applied a label-free MS-based proteomics approach to the analysis of histone H1 variants in clinical samples, including very low abundance tissues. After setting up the method in cell lines and laser micro-dissected mouse samples, we applied it to human breast cancer samples, detecting differences in several variants between triple-negative breast cancers with different outcomes.
3. Discussion
In this study, we show the feasibility of applying a simple label-free MS-based approach for the quantitative and simultaneous analysis of histone H1 variant levels from clinical samples. This method takes advantage of the MaxLFQ algorithm, which has been extensively used in quantitative global proteomics studies, but whose application to protein isoforms, such as histone H1 variants, has not been investigated so far. Our experiments show that the MaxLFQ algorithm allows a precise quantification of histone H1 isoforms even in the presence of different amounts of starting material and from very low-abundance samples. Although the MaxLFQ quantification of histone H1 isoforms can be used from unfractionated samples (data not shown), we chose to perform a SDS–PAGE separation combined with an in-gel digestion prior to MS analysis, in order to eliminate MS contaminants from the sample preparation and to select a specific MW range for digestion, which enriches histone H1 relative to background proteins. In addition, this in-gel strategy allows performing histone H1 and core histone PTM analyses, which we typically carry out through in-gel digestion followed by protein derivatization ([
17] and unpublished data), from the same samples. This allows a comprehensive epigenetic profiling of samples that may be available in very limited amounts, such as clinical specimens. In this regard, we also verified that our method allows quantifying potentially all histone H1 variants from as low as 1000 laser micro-dissected cells, although some variants may be missed if expressed at particularly low levels in specific tissues.
To aid the identification of histone H1 peptides from samples available in low amounts, we took advantage of the “match between runs” feature available in MaxQuant, which was used to transfer identifications, derived from a standard sequence database searching approach, from an abundant reference sample to low-abundance samples. An alternative approach is represented by spectral library searching, where observed MS/MS spectra are searched against a library of experimental MS/MS spectra to assign identifications [
18]. Because the library spectra contain peaks from non–canonical fragment ions that may not be present in a standard database containing in silico digested proteins, as well as intensity information, spectral library searching is particularly useful to increase identification rates for lower-quality spectra, such as those resulting from low-abundance samples. Histone H1 variants could also be analyzed though targeted MS approaches [
19], where proteotypic peptides, namely peptides that uniquely represent a target proteins/protein variant and that can be detected in a reproducible manner, can be followed with increased throughput and sensitivity. Our results could help the design of targeted strategies by providing information on histone H1 peptides that can be reliably detected across the samples analyzed in this study.
We verified that histone H1 variants can be analyzed from frozen (including OCT frozen) as well as FFPE tissues, thus including essentially all types of patient samples available from hospital biobanks. For the proof-of-concept laser micro-dissection experiments performed on patient tissues, we isolated different tumor/normal regions to be analyzed based on morphological evaluation of the tissue. As an alternative, the choice of the tissue areas could be based on the spatial molecular profiles obtained by matrix-assisted laser desorption/ionization (MALDI) imaging. This technique combines MS ability to analyze in a comprehensive and unbiased manner a high number of analytes, such as protein or peptides, with the capability of obtaining a spatial distribution of such analytes within tissue sections [
20]. MALDI imaging has emerged as a useful tool for cancer diagnosis and prognosis, determination of tumor margins and investigation of tumor heterogeneity [
21]. In particular, it has been employed to guide the definition of different areas within heterogeneous breast cancers to be analyzed by microproteomics [
22]. Similarly, one can imagine that MALDI imaging profiling of proteins, or ideally histone H1 variants, could be used to select areas to be laser micro-dissected and processed for MS analysis. A MALDI imaging workflow was developed and used to characterize the in situ distribution of all somatic H1 subtypes (except H1.1) in the mammalian brain [
23]. This method has also been applied in preliminary studies to cancer patient samples [
24].
The analysis of histone H1 variants in breast cancer clinical samples revealed a significant decrease of various variants in triple-negative breast tumors with worse outcome. Usually, H1.0 is highly expressed in normal tissues and is downregulated in various types of cancers, as well as in higher-grade and more aggressive tumors [
25,
26]. Furthermore,
H1F0 is an independent predictor of patient survival in breast cancer, especially in triple-negative breast cancer [
16]. Consistent with these observations, we found a decrease of H1.0 in tumor compared with normal cells, and in tumors showing relapse. Among the breast cancer molecular subtypes, the triple-negative group includes tumors without any known molecular targets, whose treatment is mostly limited to chemotherapy and is often not successful. The recent finding that high H1.0 levels can be restored by Quisinostat suggests that our method could be useful to stratify patients that would most benefit from Quisinostat treatment to prevent disease relapse [
14]. Interestingly, we found that the other histone H1 variants were also decreased in triple-negative tumors showing relapse in both the datasets analyzed. Although cell-cycle dependent isoforms are often found at increased levels in tumors with higher proliferation rates and more aggressive features, alterations of histone H1 patters can be context-specific. For instance, it has been shown that H1.1, H1.4 and H1x, in addition to H1.0, were significantly reduced in malignant ovarian adenocarcinomas compared with benign adenomas [
8]. Our results suggest that epigenetic mechanisms may be involved in the process of tumor relapse, and that the change in histone H1 variants will be worth further investigations both as potential biomarkers predictive of patient outcome and as an epigenetic mechanism underlying tumor recurrence. It will also be interesting to investigate histone H1 at the level of PTMs associated with different variants, which can be performed from the same data acquired to quantify histone H1 amounts, as our protocols preserve modifications. This information, together with the analysis of core histone PTMs, will contribute to gaining a more complete picture of epigenetic mechanisms linked with cancer.
4. Materials and Methods
4.1. Tissue Culture
MEFs expressing a doxycycline-inducible H1.0-targeting shRNA (5′UAGCAAAUUCGAAUCAACUGGA-3′, Mirimus, Brooklyn, NY, USA) were cultured in Dulbecco’s Modified Eagle’s Medium (DMEM), supplemented with 10% FBS, 2 mM L–glutamine, 100 U/mL penicillin and 100 µg/mL streptomycin at 37 °C in 5% CO2. To induce H1.0 knockdown, MEFs were treated with 1 μg/mL of doxycycline (Merck KGaA, Darmstadt, Germany, D9891) for 10 days (H1.0 KD). To increase H1.0 levels, MEFs were treated with 100 nM Quisinostat (Insight Biotechnology, Wembley, UK, HY–15433-1mL) for 24 h. Untreated MEFs were used as control.
4.2. Patient Samples
The use of patient samples was conducted in accordance with the Declaration of Helsinki. Patient samples stored in OCT were obtained from the Biobank for Translational Medicine Unit (B4MED) of the European Institute of Oncology in Milan. Sample collection by the Biobank, in the presence of patient consent, was approved by the Ethical Committee of the European Institute of Oncology on 6 June 2011, and the samples can be used for retrospective studies without any further approval by the Ethical Committee [
27]. The levels of hormone receptors, Her-2 and Ki-67, were ascertained by immunohistochemistry. Breast cancer subtypes were defined as follows: Luminal A-like, ER (estrogen receptor) and/or PgR (progesterone receptor)(+), HER2 (Human epidermal growth factor receptor 2)(−), Ki67 < 20%; luminal B–like: ER and/or PgR(+), HER2(−), Ki67 ≥ 20; triple-negative: ER, PgR, and HER2(−), irrespective of Ki67 score; HER2–positive: HER2(+), irrespective of ER, PgR, or Ki67. ER/PgR positivity was defined as ≥ 1% of immunoreactive neoplastic cells, and HER2 positivity was defined as >10% of neoplastic cells with strong and continuous staining of the cell membrane (3 + by immunohistochemistry) and/or amplified by in situ hybridization techniques, in accordance to the ASCO (American Society of Clinical Oncology)/CAP (College of American Pathologists) guidelines. The samples were selected and evaluated by a trained pathologist. Samples with infiltrating carcinoma were selected to have a tumor cellularity of at least 40%, as assessed by hematoxylin and eosin (H&E) staining. Specimens with in situ carcinoma areas, large necrosis areas and massive flogistic infiltration were discarded. For storage, samples were collected and snap-frozen in liquid nitrogen, frozen in optimal cutting temperature compound, or fixed overnight in 4% formalin and embedded in paraffin. The list of patient samples analyzed in this study is summarized in
Table S1.
4.3. Laser Micro-Dissection
For experiments with mouse pancreas, 4 µm thick FFPE sections or 10 µm thick snap-frozen OCT–embedded cryosections of mouse adult pancreas were either processed for protein extraction as whole sections or mounted on polyethylene naphthalate membrane (PEN) slides (Leica No. 11600289) previously UV–photoactivated in a UV crosslinker for 30 min (BLX–254, Bio–Link). FFPE sections were de-paraffinized with two changes of xylene, while OCT–embedded sections were fixed in cold anhydrous ethanol for 3 min before proceeding to partial rehydration in graded alcohols up to 50%. Sections were then counterstained for 30 s with alcoholic-based buffered cresyl-violet freshly prepared (0.8% cresyl violet in 60% EtOH and 4 mM Tris–HCl, pH 8.0), washed twice in 75% EtOH and air dried completely before proceeding to the micro-dissection. Areas of pancreatic tissues corresponding to ~20,000, 5000, 2500, and 1000 acinar cells were micro-dissected using a UV–based LMD7 laser micro-dissection system (Leica Microsystems, Wetzlar, Germany), collected into the caps of 0.5 mL tubes and stored at 4 °C until further processing. Experimental procedures involving animals were performed in accordance with the Italian Laws (D.lgs. 26/2014), which enforces Dir. 2010/63/EU (“Directive 2010/63/EU of the European Parliament and of the Council of 22 September 2010 on the protection of animals used for scientific purposes”). All animal procedures were approved by the OPBA (Organismo per il Benessere e Protezione Animale) of the Cogentech animal facility at the IFOM-IEO Campus, Milan, and authorized by the Italian Ministry of Health. For the experiment shown in
Figure 5A,B, 10 µm thick sections from a fresh-frozen breast cancer sample were mounted on PEN slides and stained with hematoxylin. Areas corresponding to normal epithelial cells (four areas, 1800–2100 cells each) or infiltrating carcinoma were collected (two tumor regions, fours areas/tumor region, 4200–4500 cells each) by laser micro-dissection, as described above.
4.4. Sample Preparation for MS Analysis
Culture cells (0.5–2*106) were resuspended in 1 mL of phosphate buffered saline (PBS) buffer containing 0.1% Triton X–100 and protease inhibitors. One-tenth of the preparation was lysed by adding 0.1% SDS (whole lysates). Nuclei were isolated from the remaining solution through 10 min centrifugation at 2300× g, resuspended in 100 µL of the same buffer containing 0.1% SDS and incubated for a few minutes at 37 °C in the presence of 250 U of benzonase to digest nucleic acids. Protein concentration was evaluated with the bicinchoninic acid assay (BCA, Thermo Fisher Scientific, Waltham, MA, USA), and 5–10 µg of proteins were loaded on a 4–12% precast gel (Thermo Fisher Scientific, Waltham, MA, USA).
Laser micro–dissected tissue pieces were transferred at the bottom of the tubes through 3 min centrifugation at maximum speed and processed through methods previously developed for histone PTM analysis [
17], which were adapted to low-abundance samples. FFPE micro-dissected tissue pieces were deparaffinized once in 200 µL hystolemon (Dasit Group Carlo Erba, Cornaredo, Italy) and rehydrated in the same volume of solutions containing decreasing concentrations of ethanol (95%, 50%, 20% ethanol and water). The same rehydration steps were also performed for the frozen micro-dissected samples. Then, all the samples were resuspended in 35–40 µL 20 mM Tris pH 7.4 containing 2% SDS and homogenized by sonication in a Bioruptor device, through 10 cycles (30 s on/30 s off) at high potency. For FFPE samples, proteins were extracted and de-crosslinked at 95 °C for 45 min and 65 °C for 4 h. The whole volume was then loaded on a SDS–PAGE gel. OCT and FFPE whole sections (10 µm thick, which contain approximately 1.5 × 10
6 cells) were collected in 1.7 mL tubes and processed as described in [
17]. Protein content was evaluated with the BCA Protein Assay kit (Thermo Fisher Scientific, Waltham, MA, USA), and 10 µg of protein extract (corresponding to approximately 1/3 of the total preparation) were loaded on a gel. A large band (20–45 kDa) around the size of histone H1 variants was excised for in–gel digestion with trypsin [
28].
4.5. LC-MS/MS
The MS analysis was performed on a 50 cm EASY-Spray column connected online to a Q Exactive HF instrument through an EASY-Spray™ Ion Source (Thermo Fisher Scientific, Waltham, MA, USA). Solvent A was 0.1% formic acid (FA) in ddH2O and solvent B was 80% ACN plus 0.1% FA. Peptides were injected in an aqueous 1% trifluoroacetic acid (TFA) solution at a flow rate of 500 nL/min and were separated with a 95 min 3%–60% gradient of solvent B (80 min 3–30%, 10 min 30–40%, 5 min 40–60%), at a flow rate of 250 nL/min. The Q Exactive HF instrument was operated in the data-dependent acquisition (DDA) mode to automatically switch between full scan MS and MS/MS acquisition. Survey full scan MS spectra (m/z 375–1650) were analyzed in the Orbitrap detector with a resolution of 60,000 at m/z 200. The 10 most intense peptide ions with charge states comprised between 2 and 4 were sequentially isolated to a target value for MS1 of 3 × 106 and fragmented by higher-energy collisional dissociation (HCD) with a normalized collision energy setting of 28%. The maximum allowed ion accumulation times were 20 ms for full scans and 80 ms for MS/MS, and the target value for MS/MS was set to 1 × 105. The dynamic exclusion time was set to 20 s, and the standard mass spectrometric conditions for all experiments were as follows: spray voltage of 1.8 kV, no sheath and auxiliary gas flow.
4.6. MS Data Analysis
Acquired raw data were analyzed using the integrated MaxQuant software v.1.6.2.3 (Max Planck Institute of Biochemistry, Planegg, Germany [
12]), which performed peak list generation and protein identification using the Andromeda search engine. The Uniprot HUMAN_1802 and MOUSE 2019 databases were used for peptide identification. Enzyme specificity was set to trypsin, and two missed cleavages were allowed. Methionine oxidation and
N-terminal acetylation were included as variable modifications, and the FDR was set to 1%, both at the protein and peptide level. The label-free software MaxLFQ [
11] was activated as well as the “match between runs” feature (match from and to, matching time window = 2 min). The LFQ values for histone H1 variants were extracted from the “protein groups” MaxQuant output file and analyzed using Perseus [
29] and GraphPad Prism (GraphPad). Changes in histone H1 variant levels between two different conditions were evaluated by two-stage linear step-up procedure of Benjamini, Krieger and Yekutieli, with FDR = 5%, while changes among three conditions were evaluated by one-way ANOVA. iBAQ scores (calculated by dividing a protein’s total intensity by the number of tryptic peptides between 6 and 30 amino acids in length) were also generated by the MaxQuant algorithm and used to evaluate the abundance of a protein relative to the others present in the same sample [
30]. The LFQ values for histone H1 variants in the samples described in the manuscript are reported in
Tables S2–S6. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium [
31] via the PRIDE partner repository with the dataset identifiers PXD020537 and PXD020524.
4.7. Immunofluorescence
Quantitative immunofluorescence microscopy was performed to measure H1.0 levels. Ten thousand cells for each condition (control, H1.0 KD and Quisinostat-treated) were plated onto poly-l–lysine–coated coverslips (Thermo Fisher Scientific, Waltham, MA, USA, 354085) for 30 min prior to fixation with 4% paraformaldehyde (Thermo Fisher Scientific, Waltham, MA, USA, 43368) for 15 min. Cells were then washed three times with PBS and permeabilized with 0.5% Triton–X100 in PBS for 5 min. Immunofluorescence staining was performed by first incubating cells for 1 h in blocking buffer (PBS containing 3% BSA and 0.05% Triton X–100), incubating for 1 h with a primary mouse monoclonal H1.0 antibody (Upstate, 05-629, 1:100), washing three times with PBS, and finally incubating for 1 h incubation with a donkey anti-mouse AlexaFluor® 568 (Thermo Fisher Scientific, Waltham, MA, USA, A10037) diluted 1:400 in blocking buffer. Cells were finally washed three times with PBS and mounted onto glass slides with Vectashield containing DAPI (Vector H–1200). Quantification of the fluorescent signal was performed using Metamorph software.
4.8. Quantitative RT-PCR
RNA extraction was carried out using the RNeasy Plus Mini Kit (Qiagen, Hilden, Germany, 74134) following the manufacturer’s instructions, and cDNA was generated using the High Capacity cDNA Reverse Transcription Kit (Merck KGaA, Darmstadt, Germany, 4374966). Gene expression levels were analyzed on a CFX96 real-time PCR detection system using SsoAdvanced™ Universal SYBR® Green Supermix (Bio-rad, Hercules, CA, 1725274), CFX manager 3.0 software and the following primers: H1.0 FW: 5′-CTGGCTGCCACGCCCAAGAA-3′, H1.0 RV: 5′-CGGCCCTCTTGGCACTGGCA-3′; PPIA FW: 5′-GTCAACCCCACCGTGTTCTT-3′, PPIA RV: 5′-CTGCTGTCTTTGGGACCTTGT-3′. PPIA encodes for Cyclophilin A and was used as reference housekeeping gene.