1. Introduction
The extracellular matrix (ECM) is a complex macromolecular network that surrounds the cells of all tissues and organs [
1]. The ECM is a product of the cells; thus, it is organ- and tissue-specific. By providing adhesion and anchorage to the cells [
2], the ECM ensures tissue integrity. The composition and architecture of the ECM change with modifications of cellular phenotypes. In turn, the differentiation and biological activity of the cells are reciprocally controlled via signaling from the ECM [
1,
3,
4]. Proteins of the ECM (collectively termed the “matrisome” [
5]) play an essential role in tissue and organ development [
6] and cellular metabolism [
7]. According to the classification proposed by Naba et al. [
5], the matrisome consists of the “core matrisome” and “matrisome-associated proteins”. The core matrisome is formed by structural ECM components (collagens, ECM glycoproteins and proteoglycans) [
8,
9]. The term “matrisome-associated proteins” covers the proteins which are found in the ECM but can not be classified as core matrisome components. These proteins are further categorized as (i) ECM-affiliated proteins, which have a similar architecture to ECM proteins and/or are known to be associated with ECM proteins, (ii) ECM regulators, and (iii) secreted factors, which are shown to interact with the ECM [
5,
10]. Identified matrisome proteins of different human and mouse tissues and tumors are deposited in the “Matrisome Database” (MD) (
http://matrisomeproject.mit.edu/, accessed on 23 January 2023).
Pathological changes of the matrisome (e.g., excessive accumulation, destruction or dysfunctionality of certain proteins or protein groups) orchestrate many diseases of diverse etiologies, with particular involvement in inflammation, wound healing, cancers and fibrosis [
11,
12,
13]. A deep analysis of the matrisome is an invaluable instrument for understanding tissues’ and organs’ functions, as well as the mechanisms and biomarkers of various diseases.
In the last decade, mass spectrometry (MS) has become the leading analytical approach in matrisome studies. It is intensively used for the discovery of biomarkers and assessment of the ECM [
14]. Rapid technological developments in proteomics, such as enhanced sample preparation protocols, database searching, and bioinformatics analysis now allow unbiased protein quantification, identification and characterization, including the discovery of post-translational modifications (PTMs) [
15]. However, despite these impressive advancements, standard protein extraction techniques, which use whole tissue lysates, need to be better adapted to matrisome studies. The continuing challenges in ECM proteomics are the enrichment of matrisome proteins in the samples by separating low-abundance ECM proteins from intracellular proteins, and solubilizing heavily cross-linked [
9] extracted proteins. Additionally, identification and quantification of the matrisome proteins in the presence of abundant PTMs is a complex task [
16].
To tackle the difficulty of matrisome enrichment, several variations of decellularization methodologies that gradually reduce the amount of cellular components in the sample while preserving the ECM proteins have been proposed. The most widely used extraction method applicable for proteomic studies of the matrisome [
5,
17,
18,
19,
20] employs a commercially available kit from Millipore for tissue protein fractionation by sequential incubations of the sample in buffers of different pH, salt and detergent concentrations. This procedure results in the biochemical separation and removal of the proteins of the cells’ cytosolic, nuclear, membrane and cytoskeletal compartments. It then allows the enrichment of ECM proteins in the residual insoluble product. This method is further referred to as “Compartmental Matrix Enrichment” (CME). Another method is termed “Sequential Matrix Enrichment” (SME). It uses guanidine hydrochloride (Gu-HCl) to further solubilize insoluble matrisome proteins after a decellularization step [
20,
21,
22,
23]. This extraction method is performed with an ionic (NaCl) buffer to additionally extract loosely bound ECM proteins (enzymes, secreted factors, ECM-associated and newly deposited proteins), as well as a low-concentration detergent (sodium dodecyl sulphate, SDS) with a shorter incubation time to eliminate intracellular proteins, and then uses a Gu-HCl buffer to enhance the obtaining of heavily cross-linked ECM proteins.
In the current study, we applied these two ECM extraction methods (
Figure 1a) to explore the matrisome of kidneys in healthy adult mice with an overall goal of creating a reference database for future renal research. This work was particularly motivated by the fact that, now, information on the matrisome composition of mouse kidneys is very limited and incomplete. While mouse models are crucial for studies of renal morphogenesis and diseases [
24], chronic kidney disease (CKD) [
25], as well as for drug and biomarker discovery [
26,
27], only two recent studies are currently available on the identification of ECM proteins in the insoluble tissue fractions of mouse kidney extracts [
20,
28]. Importantly, the methodology applied in these studies did not allow the identification of loosely bound ECM proteins, which suggests incomplete characterization of the matrisome. Moreover, quantification of ECM proteins in mouse kidneys has yet to be achieved.
The matrisome proteins extracted using CME and SME methods were further studied with liquid chromatography using a tandem mass spectrometry (LC-MS/MS) system (
Figure 1b). Firstly, the identified proteins were explored with the Matrisome Database (MD) and UniProt databases. To compare the abundance of identified matrisome proteins from the two extraction methods, MaxQuant label-free quantification (LFQ) was employed, and the data was then analyzed using LFQ-Analyst [
29].
3. Discussion
The current understanding of matrisome composition and how it is regulated during pathophysiological processes remains very limited. One of the major obstacles is the need for optimization of ECM protein extraction and characterization methods. Many previous studies which examined kidney tissues of different species mainly used the decellularization approach that was developed for tissue engineering applications and involved removal of cellular proteins followed by matrisome protein examination in the decellularized scaffolds [
33,
34,
35]. However, long detergent incubations in tissue engineering-focused decellularization methods may cause degradation and elimination of some matrisome proteins [
36].
In the current study, we aimed to comprehensively characterize the mouse renal matrisome using proteomic technologies. In order to overcome the limitations imposed by the tissue engineering decellularization technologies, we first examined two ECM extraction and enrichment methods (CME and SME) in healthy mouse kidney tissues (see
Figure 1).
The first method, CME, relied on using a commercially available Millipore Compartment Protein fractionation kit. This method biochemically separates subcellular proteins and enriches matrisome proteins in the insoluble pellet at the final step of the extraction series. The method is well documented in previous studies, and many ECM proteins have been identified by using this extraction method. For example, Naba et al. [
5] identified 100 matrisome proteins in mouse lungs and colon, Schiller et al. [
18] identified 435 matrisome proteins in healthy mouse lungs and Gocheva et al. [
19] identified 113 matrisome proteins in human lungs. Just recently, this method has been applied for the first time by Lipp et al. [
28] to identify 79 matrisome proteins of mouse kidneys. In our study using Millipore Compartment Fractionation (here termed the CME method), we detected 83 matrisome proteins of which 16 proteins were not previously listed in the MD. Although our study and Lipp et al. [
28] used the same matrisome enrichment method, the protein precipitation, deglycosylation and LC-MS/MS set-up were different. This could explain the higher total number of matrisome proteins identified in our work, compared to the reference [
28]. It is important to note that the Millipore Compartment Fractionation only analyses one fraction, while the loosely bound and soluble matrisome proteins potentially remain in the “cellular” fractions that are not used in matrisome studies.
In an attempt to improve the ECM protein isolation on the second half of the same kidney sample, we applied another matrisome enrichment method, a sequential matrix extraction (SME). This method allowed the extraction of loosely bound or soluble matrisome proteins using a high-salt buffer before removal of cellular components by SDS and enriched the matrisome fractions by solubilizing the insoluble pellet with Gu-HCl [
22]. Using a method similar to our SME approach, Massey et al. [
23] identified 79 matrisome proteins from all three fractions of mouse liver. This extraction technique is potentially more comprehensive for identifying matrisome proteins, but it has not yet been optimized for kidney tissue. Our study, for the first time, used this sequential approach (SME) to identify matrisome proteins in mouse kidneys. We have identified 105 matrisome proteins, of which 22 proteins were yet to be listed in the currently available MD (
http://matrisomeproject.mit.edu/, accessed on 23 January 2023) (see
Figure 2). Compared to the results obtained by Lipp et al. [
28], our study could distinctly detect 57 matrisome proteins. These 57 proteins included mostly the matrisome division of ECM-associated proteins. We can infer from these findings that SME fractions allow for more efficient detection of ECM-associated proteins compared to CME.
It should be noted that the location of most of the 22 proteins have not previously been exclusively attributed to ECM compartments, but to different “Gene Ontology-Cellular Components” or/and “Subcellular Locations”, including ECM space/regions in UniProt. For example, Nucleobindin-1 (Nucb1) was previously shown to be localized in the Golgi apparatus, cytoplasm, or extracellular regions which were inferred from genome-based computational annotations (
https://www.uniprot.org/help/gene_ontology). However, it was additionally manually asserted in UniProt as “Secreted” and recently it has been shown that Nucb1 is secreted with metalloproteinase-2 (MMP-2) and regulates ECM remodeling [
37]. Hence, we described this protein as an ECM-regulator. Another set of examples includes enzymes with transmembrane domains such as Ace, Ace2, Dpp-4 and Mme. These enzymes are localized at the cell surface and have extracellular, cytoplasmic and transmembrane domains. However, it has been shown that they may also be present in soluble forms after cleavage and play a role in ECM remodeling ([
37,
38,
39,
40,
41,
42]). Similar to the examples above, we referred to the literature in order to confirm the identified proteins as components of the matrisome (see
Table A7 in
Appendix A).
We then compared mouse matrisome proteins identified by studies of McCabe et al. [
20], Lipp et al. [
28] and Liu et al. [
30], and human matrisome proteins identified by the studies of Louzao-Martinez et al. [
31] and Randles et al. [
32]. These studies also differed in matrisome protein enrichment, protein precipitation, deglycosylation, digestion and LC-MS/MS methods (see
Figure A1,
Table A9 in
Appendix A). The comparison was undertaken to identify mouse and human species-specificity of kidney matrisome proteins. The comparison could identify 95 matrisome proteins only identified in mice and 44 only identified in human kidneys (see
Table A10,
Table A11,
Table A12,
Table A13,
Table A14 and
Table A15 in
Appendix A). Interestingly, five of the matrisome proteins shown in this study—namely Lgals3bp (glycoprotein), Mfge8 (glycoprotein) Emcn (ECM-affiliated), Apoe (ECM regulator) and Sod3 (ECM regulator)—were previously identified only in human kidneys but not in mouse kidneys [
32]. However, among mouse kidney studies, only our work identified those proteins in mouse kidneys. Hence, our study shows the necessity of using different approaches to identify matrisome proteins and thus develop a robust matrisome protein list.
In addition to matrisome protein identification, our study for the first time allowed quantification of the ECM proteins extracted from heathy mouse kidneys. We performed a discovery proteomics study known as shotgun proteomics, which uses bottom-up approaches based on unbiased analysis. To compare the quantities of proteins obtained via each extraction method, we applied label-free-based quantification (LFQ) data generated with MaxQuant analysis [
29]. This is a widely used method in which the quantitation can be performed either based on chromatographic ion intensities or based on spectral counting of identified proteins [
43]. It provides high throughput and does not have the limitations of label-based quantifications such as high complexity of sample preparation, requirement of high concentrations of samples and incomplete labelling [
15,
43]. In our study using LFQ, we successfully quantified 57 matrisome proteins via CME and 81 proteins via SME (see
Figure 5). Some of the quantifiable proteins were more abundant either in CME or SME. This study demonstrated that CME could detect greater intensities of core matrisome proteins, but fewer ECM-associated proteins, compared to SME (see
Figure 4). To show the benefit of each extraction method, which can be used as a guideline to study different matrisome proteins in different kidney diseases, we compared SME fractions with CME, separately (see
Table S2 in Supplementary Information). For example, IgA nephropathy (IgAN), which is one of the most prevalent chronic glomerular diseases, had significantly more matrisome proteins (Col4a1, Lamb1, Hspg2, Emilin, Fgg and Fbln) in diseased patients compared to healthy patients [
34]. These proteins could be better detected using CME, while Col4a1 and Col15a1, which can be detected by both methods and were also elevated in IgAN, could be better detected by CME or in the SME-F3 fraction. In renal fibrosis, the hallmark of CKD, it is known that Col 1, 2, 3, 5, 6, 7 and 15, Fn1, Dcn and Bgn accumulate in the ECM [
44]. We detected those matrisome proteins using both extraction methods, although some of them had higher abundances in CME product. Collectively, CME was powerful at extracting several matrisome proteins in higher abundances, while SME allowed for the quantification of more matrisome proteins.
Subsequently, we quantitatively analyzed matrisome proteins in SME fractions. It has previously been shown in mouse lung [
21] and liver [
23] that sequential extraction methods, similar to that of the SME used in our study, can extract loosely bound matrisome proteins by using a NaCl buffer which displaces polyanionic interactions and further solubilizes the insoluble proteins using Gu-HCl [
21,
23]. The SME-F1 fraction allowed detection of mostly secreted, basement membrane proteins and some interstitial matrix proteins (Col1 and Col6). SME-F3 distinctly revealed proteoglycans which could not be found in other fractions, but also detected interstitial matrix and basement membrane proteins. Conversely, the SME-F2 fraction allowed for detection of only a few core matrisome proteins which are located in the interstitial matrix, and also secreted and basement membrane proteins. Although the fractions of the extracted samples revealed matrisome proteins from different ECM locations, each fraction had advantages in extracting different matrisome proteins.
Although many matrisome proteins identified in this study have also been previously identified in other studies, our study was able to detect additional matrisome proteins. The most plausible explanation is the use of different extraction methods. For example, LOX enzymes, which are involved in collagen cross-linking, could not be detected in our extraction methods, but were detected in another mouse kidney proteomic study where different extraction methods were used with a combination of a modified deglycosylation/protein digestion process [
20,
28]. In addition, the removal of glycosaminoglycans (GAGs) by digestion enzymes such as chondroitinase and heparinase has been suggested to improve peptide identification [
22,
45,
46]. However, McCabe et al. [
20] reported that using GAG-digesting enzymes only improved proteoglycan identification, but not for other matrisome proteins. It is important to note that this study was performed on healthy kidney tissue, while it is known that GAG deposition in the ECM is increased during disease, e.g., fibrosis [
47,
48]. Hence, the opportunities for the improvement of analysis by GAG-digesting enzymes are still not completely clear, and our study is limited as we used only PNGaseF to remove N-glycans. We would suggest performing a pilot study using GAG-digesting enzymes to analyze whether it improves matrisome protein identification before applying it to all samples. Finally, the native biological variation in the proteome needs to be considered. In particular, the role of the age of the animals has been demonstrated in several tissue-specific proteomic studies in mice [
49,
50].