Next Article in Journal
Significance of the Overexpression of Substance P and Its Receptor NK-1R in Head and Neck Carcinogenesis: A Systematic Review and Meta-Analysis
Next Article in Special Issue
Integrative Transcriptomic Analysis Reveals Distinctive Molecular Traits and Novel Subtypes of Collecting Duct Carcinoma
Previous Article in Journal
A Novel Role of Bergamottin in Attenuating Cancer Associated Cachexia by Diverse Molecular Mechanisms
Previous Article in Special Issue
Renal Cell Tumors: Uncovering the Biomarker Potential of ncRNAs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Radiomics in Renal Cell Carcinoma—A Systematic Review and Meta-Analysis

by
Julia Mühlbauer
1,†,
Luisa Egen
1,†,
Karl-Friedrich Kowalewski
1,
Maurizio Grilli
2,
Margarete T. Walach
1,
Niklas Westhoff
1,
Philipp Nuhn
1,
Fabian C. Laqua
3,
Bettina Baessler
3,† and
Maximilian C. Kriegmair
1,*,†
1
Department of Urology and Urological Surgery, University Medical Center Mannheim, University of Heidelberg, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany
2
Library of the Medical Faculty Mannheim of the University of Heidelberg, University Medical Center Mannheim, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany
3
Institute of Diagnostic and Interventional Radiology, University Hospital Zurich, University of Zurich, Raemistrasse 100, 8091 Zurich, Switzerland
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Cancers 2021, 13(6), 1348; https://doi.org/10.3390/cancers13061348
Submission received: 15 February 2021 / Revised: 7 March 2021 / Accepted: 10 March 2021 / Published: 17 March 2021
(This article belongs to the Special Issue Kidney Cancers)

Abstract

:

Simple Summary

Radiomics may answer questions where the conventional interpretation of medical imaging has limitations. The aim of our systematic review and meta-analysis was to assess the (current) status of evidence in the application of radiomics in the field of renal masses. We focused on its role in diagnosis, sub-entity discrimination and treatment response assessment in renal cell carcinoma (RCC) and benign renal masses. Our quantitative synthesis showed promising results in discrimination of tumor dignity, nevertheless, the value added to human assessment remains unclear and should be the focus of future research. Furthermore, the benefit regarding treatment response assessment remains unclear as well, since the existing studies are investigating already abandoned systemic therapies (ST), which no longer represent the current “reference” standard. Open science could enable to establish technical and clinical validity of radiomic signatures prior to the incorporation of radiomics into everyday clinical practice.

Abstract

Radiomics may increase the diagnostic accuracy of medical imaging for localized and metastatic RCC (mRCC). A systematic review and meta-analysis was performed. Doing so, we comprehensively searched literature databases until May 2020. Studies investigating the diagnostic value of radiomics in differentiation of localized renal tumors and assessment of treatment response to ST in mRCC were included and assessed with respect to their quality using the radiomics quality score (RQS). A total of 113 out of 1098 identified studies met the criteria and were included in qualitative synthesis. Median RQS of all studies was 13.9% (5.0 points, IQR 0.25–7.0 points), and RQS increased over time. Thirty studies were included into the quantitative synthesis: For distinguishing angiomyolipoma, oncocytoma or unspecified benign tumors from RCC, the random effects model showed a log odds ratio (OR) of 2.89 (95%-CI 2.40–3.39, p < 0.001), 3.08 (95%-CI 2.09–4.06, p < 0.001) and 3.57 (95%-CI 2.69–4.45, p < 0.001), respectively. For the general discrimination of benign tumors from RCC log OR was 3.17 (95%-CI 2.73–3.62, p < 0.001). Inhomogeneity of the available studies assessing treatment response in mRCC prevented any meaningful meta-analysis. The application of radiomics seems promising for discrimination of renal tumor dignity. Shared data and open science may assist in improving reproducibility of future studies.

1. Introduction

The concept of radiomics is based on the assumption that data from biomedical image methods such as computed tomography (CT), positron emission tomography (PET), or magnetic resonance imaging (MRI) contain information of disease- and patient-specific processes that are imperceptible to the human eye [1,2]. Through mathematical extraction of the spatial distribution of signal-intensities and pixel-interrelationships, radiomics may quantify this hidden textural information in a tissue [3,4]. Especially in the field of oncology, the number of studies reporting the application of radiomics to medical imaging data in order to classify a disease type or stage or to predict patient outcome has risen exponentially over the past few years.
This rise includes imaging of renal cell carcinoma (RCC), which represents a common urological malignancy. According to the US National Cancer Institute, 74,000 new cases are diagnosed annually, and the incidence is currently increasing mainly due to the wide use of advanced imaging modalities which incidentally detect renal tumors [5]. Currently, there are two important challenges of traditional radiological imaging in RCC: (i) The limited accuracy in predicting dignity of small renal tumors. Remarkably, up to 30% of resected tumors reveal a benign histology in pathological evaluation after partial or radical nephrectomy [6]. And (ii), the complex assessment of treatment response to systemic therapy (ST) in patients with metastatic RCC (mRCC), where various ST options are available. Especially for immunomodulatory ST, which may result in imaging pseudo progression, current standards of therapy response assessment based on the size of measurable lesions are reaching their limits [7].
Consequently, the recent application of radiomics in RCC has predominately focused on two main goals: (i) to improve the accuracy of preoperative, non-invasive histologic subtyping of small renal tumors in order to improve treatment decision between active treatment and surveillance, and (ii) to optimize and objectify the assessment of therapy response in patients with mRCC under ST in order to prevent a possible wrong therapy change.
A former systematic review and meta-analysis by Ursprung et al. found, that radiomics might be a promising tool to discriminate angiomyolipoma (AML) from RCC [8]. However, the work limited its focus on discriminating AML from RCC and the value of radiomics in the differentiation of other benign tumors-in particular oncocytoma-remained unclear. At this point, an improved diagnostic accuracy would be highly anticipated in clinical practice. A prospective proof-of-concept study showed promising results for characterization of treatment-related changes in multiple radiomics features after ST in integrated PET/MRI [9].
Hence, the aim of our systematic review and meta-analysis was to assess the current evidence for the application of radiomics to renal masses, with a special focus on non-invasive classification of dignity and assessment of treatment response.

2. Materials and Methods

This review follows the Cochrane Handbook of Systematic Reviews and Interventions and is in concordance of the AMSTAR-2 criteria and the PRISMA guidelines [10,11,12]. It was prospectively registered on PROSPERO (ID: CRD42020221345).

2.1. Sources

A comprehensive database search including MEDLINE (via PubMed), Web of Science, CENTRAL, Cinahl, Current Contests Medicine (CCMed, via LIVIVO) and ClinicalTrials.gov (accessed on 15 February 2021) was conducted by a specialized librarian of the University of Heidelberg (M.G.) in May 2020. No time frame or language restrictions were applied. In addition, the reference lists of the included studies were searched for studies that could potentially also be included and the authors were contacted if necessary.

2.2. Inclusion and Exclusion Criteria

All single studies, comparative studies and original studies on humans, which met the following PICO criteria, were included:
  • P (patients): Patients with benign or malign renal tumors;
  • I (interventions): Radiomics or texture analysis;
  • C (comparison): CT or MRI;
  • O (outcome): Histologic subtyping (including differentiation of different RCC subtypes, differentiation and/or analysis of any benign and/or malign renal tumors, tumor grading, and any mutation analyses) and treatment response assessment.
We defined treatment response assessment as the prediction of a therapy response including prediction of survival or as the assessment of an existing response under ST including the correlation with survival. Reviews and case studies were excluded

2.3. Search Terms

The full search terms can be found in Table S1.

2.4. Study Selection

Two reviewers (J.M. and L.E.) independently screened titles and abstracts. Consecutively, the same two reviewers evaluated the full texts of eligible studies, verifying whether the inclusion criteria were met. Disagreements were solved by consensus or a third reviewer (B.B.).

2.5. Quality Assessment

Quality assessment of all included studies was recorded using the radiomics quality score (RQS), which is a radiomics-specific quality assessment tool [1]. It is made of a total of 16 criteria, for each of which a certain number of points can be achieved. The respective number of points corresponds to the importance regarding the methodological quality of a study [1]. The score ranges from -8 to +36 points, with a score of −8 to 0 points corresponding to 0% and 36 points corresponding 100% [13]. For details, see Table 1. The data of included studies were extracted by two reviewers (J.M. and L.E.) independently. Consecutively, the mean of the two evaluations was then calculated for each study to provide the final RQS. Furthermore, the average value for the respective criteria was given for the entirety of all studies, again using the mean value of both reviewers for each respective criterion.

2.6. Meta-Analysis

Two meta-analyses were planned within the included studies:
  • a meta-analysis of all studies investigating the use of radiomics to compare benign versus malign renal tumors;
  • a meta-analysis of all studies investigating the use of radiomics for treatment response assessment of metastatic RCC with any ST.
Data of all included studies were extracted by two reviewers (J.M. and L.E.) independently. Disagreement was solved by consensus or a third reviewer, if this was necessary (B.B.). Only studies, from which a two-by-two contingency table could be extracted or reconstructed were included. If multiple models were reported in a study, only the one with the highest area under the curve (AUC) was extracted. If no AUC was provided, the model with the highest Youden’s Index was chosen. Models deriving from augmented data were excluded. In case of multiple publications deriving from one study, only the manuscript with better methodological quality according to the RQS was included.

2.7. Statistical Analysis

Random effects meta-analyses were performed using the Mantel-Haenszel model and reported as log odds ratio (OR). Forest plots were used for visualization of the results. Subgroup analyses stratified by different histologic subtypes (AML versus RCC, oncocytoma versus RCC, benign tumors not further specified versus RCC) were performed to evaluate whether findings can be applied across the different subtypes and the entirety of all benign vs. malignant tumors. Due to the small number of included studies, no differentiation between the administered therapies could be made for the subgroup analysis of treatment response assessment. Forest plots were designed to provide subgroup and overall analysis at the same time for each outcome.
The heterogeneity of studies was assessed using the I2 index. An I2 value of 0–25% represents insignificant heterogeneity, >25–50% low heterogeneity, >50–75% moderate heterogeneity, and >75% high heterogeneity [14]. A p value of ≤0.05 was considered as statistically significant. We did not adjust for multiple testing in concordance with earlier arguments for this approach [15]. All analyses were performed using R (R Core Team (2019). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria) following an a priori specified statistical analysis plan.

3. Results

3.1. Included Studies

Figure 1 shows the PRISMA flow-chart of the included studies of this systematic review and meta-analysis. Table S2 summarizes the characteristics and research questions of the included studies. In the 109 studies for which the number of patients was available, a total of 12,054 patients were included. In four studies, only the number of investigated tumors was given. Most studies investigated the use of radiomics using CT imaging (n = 84), less than a quarter based on magnetic resonance (MR) imaging (n = 23), and two based on a combination of CT and MR imaging. Another three studies investigated the use of radiomics on PET MR images and one pilot study investigated its use on 18F-fluorodeoxyglucose positron emission tomography (FDG PET) images.
Over the past few years, the number of studies investigating the use of radiomics in RCC has increased continuously. In particular, there has been an increase in the number of studies carried out in the last two years: 47% (n = 53) of all included studies were published between 2019 and 2020. Another quarter of all included studies were published in the two previous years (2017 and 2018, n = 32), which roughly corresponds to the number of all studies from previous years (Figure 2a).

3.2. Quality Assessment

Table 1 shows the average score for each criterion of the RQS of all included studies. The respective ratings for each study are shown in Table S2. The mean RQS score of all included studies was 13.6% (4.9 points, standard deviation (SD) ± 4.60 points), the median score was 13.9% (5.0 points, inter-quartile range (IQR) 0.25–7.0 points) and the range was 0–41.7% (−3.0 to 15.0 points).
Over time, an increase in the quality according to the RQS of the included studies could be observed (Figure 2b). In 2020 in particular, the RQS was better than in previous years with a mean RQS of 21.6% (7.76 points, SD ± 4.97 points) and a median score of 19.4% (7.0 points, IQR 5.0–13.0 points). The respective ratings over the years are shown in Table S3.

3.3. Differentiation of Benign and Malign Renal Tumors

Of the 66 studies which dealt with the prediction of histological subtypes of renal tumors, 52 investigated the prediction of the dignity of renal masses, of which 30 could be included in the quantitative analysis. The most common exclusion criterion was the insufficient disclosure of the results (n = 18), which prevented the possibility to extract a two-by-two contingency table. Two of the studies used data augmentation with generation of new data through transformation of existing cases and therefore had to be excluded. Another two studies had to be excluded because multiple publications derived from one study. Table S4 lists further details on the exclusion criteria. Furthermore, Table S4 provides an overview of all 52 studies investigating the prediction of the dignity of renal masses including the respective research question, a brief summary of the used methods, the main results and the used reference for comparison with the use of radiomics for each study. All studies used the histologically confirmed diagnosis for the respective tumor subtype as the gold reference standard. In most cases (n = 31), the histological diagnosis was based on the surgically removed tumor by partial or radical nephrectomy. However, some studies (n = 21) did not clarify whether the histology came from a biopsy or from the surgically removed tumor. None of the studies indicated that the histological result was only verified by biopsy.
Within the 30 included studies, the discrimination of AML and RCC was investigated in 14, the discrimination of oncocytoma and RCC in seven, and the discrimination of unspecified benign renal tumors and RCC in 12 of the studies. Two of the studies (Deng et al. [16] and Raman et al. [17]) investigated the discrimination of RCC from various sub-entities of benign tumors, which explains the multiple listing of these studies in the respective analyses. Figure 3 shows the forest plot of the effect size calculated as log OR for studies investigating AML versus RCC (a), oncocytoma versus RCC (b), benign tumors not further specified versus RCC (c), and the sum of all studies (d).
For differentiating AML versus RCC, the random effects model showed a log OR of 2.89 (95%-CI 2.40–3.39, p < 0.001) with low heterogeneity of the studies of I2 = 40.5% (95%-CI 0.0–68.4, p = 0.058). For the differentiation of oncocytoma versus RCC, log OR was 3.08 (95%-CI 2.09–4.06, p < 0.001) with moderate heterogeneity of I2 = 67.7% (95%-CI 28.2–85.4, p = 0.005), and for the differentiation of benign tumors not further specified versus RCC, log OR was 3.57 (95%-CI 2.69–4.45, p < 0.001) with high heterogeneity of 86.3% (95%-CI 77.9–91.6, p < 0.001). For the overall analysis of all the above-mentioned studies investigating the differentiation of benign versus malign renal tumors, the random effects model showed a log OR 3.17 (95%-CI 2.73–3.62, p < 0.0001) with moderate heterogeneity of 74.6% (95%-CI 63.7–82.2, p < 0.001).

3.4. Treatment Response Assessment

Overall, six studies were identified which assessed response to ST and prediction of treatment response using radiomics. Table S5 summarizes the characteristics and investigated features of the respective studies.
Due to the large inhomogeneity especially concerning regions of interest (ROI) and endpoints of the studies, no sufficient data could be extracted that would have enabled a meaningful meta-analysis.

4. Discussion

The use of radiomics in oncological imaging for different tumor entities has become increasingly popular in recent years with the overall goal to improve clinical decision-making [1]. The high rate of benign renal tumors diagnosed at pathological evaluation after surgery as well as the increasing complexity of treatment response assessment has also led to a growing interest in using radiomics in the field of renal tumor imaging. A former review including studies until Oct. 2018 found that it might be a promising tool to discriminate AML from RCC, but it also showed that the studies that existed up to this point were of poor quality and therefore must be interpreted with caution [8]. Furthermore, the differentiation of other benign tumors than AML and RCC as well as the possibility of therapy response assessment was not investigated [8].

4.1. Quality Assessment

In our systematic review, we observed only a slight increase of the quality of radiomics studies in recent years with best quality scores in 2020 (median RQS 19.4% (7.0 points, IQR 5.0–13.0 points)). However, the overall quality was still low (median RQS 13.9% (5.0 points, IQR 0.25–7.0 points)). The main reasons for the low quality of the studies were the deficiency in using feature reduction, the lack of internal and external validation data as well as the lack of use of open science and data. These results are in line with the findings by Ursprung et al. who found a ratio between analysed features and patients ranging from 25 to 240 times more features than patients with only 51% of the included studies in their analysis using feature reduction methods [8]. Furthermore, they also reported a low rate of only 5% of studies who performed validation and found that methodological aspects of the radiomics studies in this field might further increase the risk of bias [8]. These methodological aspects contain the patient selection in particular and the timing of index and reference tests [8]. The poor quality of reporting prediction model studies is a known problem, which amongst others led to the TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnoses) initiative [13,18]. A set of recommendations was established for the reporting of the studies assessing prediction models to minimize bias and to increase their clinical usefulness [18]. A relatively large number of radiomics studies are pilot or feasibility studies, but they do not provide open data or the codes [17,19,20]. This prevents validation analyses to confirm the respective results. Code and data sharing (“Open Science”) is likely to improve reproducibility in subsequent studies. Especially, the application of deep learning methods for image segmentation will not only highly reduce the time needed for manual segmentation but minimizes potential bias and variance induced by human readers [21].
Proceedings in standardization and homogenization are likely to reduce bias even further following the overall goal to evaluate the real added value of radiomics. In addition, cost-effectiveness analyses are necessary regarding future investigations. These play a crucial role in understanding the clinical and economic consequences of this technique, as well as its possible future integration into everyday clinical practice [22].

4.2. Classification of Dignity of Renal Tumors

Our meta-analysis of all studies investigating the differentiation of benign versus malign renal tumors showed promising results in the overall (log OR 3.17, 95%-CI 2.73–3.62, p < 0.0001) and the subgroup analyses. Nevertheless, these results must be considered with care.
First, the inconsistency of the use of radiomics in clinical practice should be considered. This inconsistency is represented by methodological differences between the included studies in our meta-analysis as well as the inconsistent and partly insufficient disclosure of the results in the respective studies. As a result, a large part of the studies investigating the dignity of renal masses (22 out of 52 studies) could unfortunately not be included in the meta-analysis, which could possibly have biased our results.
Second, given the overall low quality of the analysed studies when it comes to the RQS as well as the well-known vulnerability of radiomics to a large number of technical and non-technical factors, machine learning models are highly prone to overfitting due to the usually high number of radiomics features, considerably exceeding the number of patients included in the study [23]. Algorithms performing complex calculations with radiomics data might easily be confounded by the tiniest amount of noise, and without adhering to strong quality control of study design and especially statistical analysis, it is very likely that the results of our meta-analysis are over-optimistic.
Third, it should be noted that all studies included in the meta-analysis used histopathological findings as the reference standard, which was based on the surgically removed tumor in most cases. However, since the use of radiomics is crucial for clinical decision-making in the preoperative setting, the comparison with alternative diagnostic options, which are available as reference standards in the preoperative context, would be a more meaningful comparison for assessing the real added value of radiomics. Such diagnostic options are, on the one hand, a histologically confirmed diagnosis based on renal biopsies and, on the other hand, human imaging assessment. In comparison to human assessment, the results of Uhlig et al. and Sun et al. suggest, that the use of radiomics and machine learning could provide an added value to differentiate benign and malign renal lesions [24,25]. Picard et al. and Said et al. furthermore found that the combination of qualitative and quantitative image analyses seems to be a promising approach [26,27]. In contrast, Takahashi et al. did not find a significant benefit of radiomics compared to human assessment [28].
In summary, although the majority of the results indicate that radiomics could be a valuable tool to differentiate benign and malignant renal tumors and that it might bring an added value compared to and in combination with human assessment, the following should be considered for future studies in this context: To confirm the added value or even a superior diagnostic or prognostic performance of radiomics as compared to other available diagnostic options in the preoperative context, human assessment and the comparison to histologically confirmed diagnoses based on renal biopsies should be considered as a reference standard for future studies in addition to histopathological findings based on surgically removed tumors. Furthermore, adherence to rigorous quality standards and complete as well as consistent disclosure of the results should be ensured.
Ultimately, prospective preferably (cluster) randomized controlled trials would need to be conducted for evaluation of promising models comparing the outcomes following treatment decisions based on those predictions to the current human benchmark or other preoperative available diagnostic options. However, the results of our meta-analysis suggest, that especially model stability is not yet suitable to allow for such a step in this research area.

4.3. Treatment Response Assessment

All available studies investigating the use of radiomics for therapy response assessment for which the information regarding the respective treatment was available included patients receiving different tyrosine kinase inhibitors (TKI) [9,29,30,31,32]. For patients receiving sunitinib, Antunes et al. found in their proof-of-concept study including two patients, that radiomics might be a powerful tool to characterize treatment related changes in integrated PET/MRI [9], highlighting the potential of additional functional/hybrid imaging in radiomics as well as in response assessment. Furthermore, the results of Bharwani et al. and Haider et al. suggest that changes in different histogram parameters in MRI and CT might be associated with the overall and progression-free survival of patients receiving sunitinib [29,32]. Although these results seem promising, the inhomogeneity of all the studies investigating the therapy response assessment using radiomics unfortunately did not enable a meaningful meta-analysis for these studies. The main problem was the diversity of investigated endpoints. Of the three studies for which the data for the therapy response assessment could be extracted, the change in different radiomics features was investigated. This made a direct comparison not useful. Furthermore, the diversity of the analysed ROIs and of the included ST, as well as the small number of included patients also counteracted a meaningful meta-analysis for these studies. Moreover, it should be noted that none of the studies investigating the use of radiomics to assess therapy response included patients with immunomodulatory therapies, which represent the current reference standard in ST for mRCC [33]. With regard to future investigations for the assessment of therapy response using radiomics, the steadily increasing number of available therapeutic options will be a hurdle. However, it does not seem useful to integrate the selection of relevant radiomics features into the study-protocols of prospective trials investigating the effectiveness of new ST [1]. Standardised and optimized study protocols carry the risk that the feature selection depends more on the imaging process rather than the pathology [34]. Rather, the goal should be to share the data from such studies so that the biological correlation of already established radiomic signatures can be performed [34].

4.4. Limitations

Several limitations of this review merit consideration. First, the heterogeneity of the studies included in the quantitative synthesis must be mentioned. This relates not only to the control groups used (different subtypes of RCC or no classification of RCC subtype), but also to the assessed imaging (MRI and CT), the imaging protocols and the deriving phases of the respective imaging from which the features were extracted. In addition, newly investigated image modalities like the use of single-photon emission computed tomography (SPECT)/CT and the use of quantitative SPECT/CT reconstruction methods in the field of renal tumor diagnostic management, which showed promising results in recently published studies, were not considered in this review [35,36,37,38]. Furthermore, the included studies differed in terms of the methodology of the used image reconstruction, feature extraction, and the algorithms used. Due to the inhomogeneity of the reported results, only 30 of the 52 available studies assessing the differentiation of benign and malign renal tumors could be included in the quantitative analysis. Beyond that, no meta-analysis regarding the assessment of treatment response using radiomics could be carried out due to the heterogeneity of the methods and reported results of the available studies. Nevertheless, the results of the quantitative synthesis provide an overview of the benefit that the use of radiomics might have.

5. Conclusions

Our review shows that there has been an increasing number of studies investigating the use of radiomics for RCC, especially in recent years. Even if the quality of the studies improved over time, the RQS in recent years was still quite low. Our quantitative synthesis showed promising results regarding the differentiation between benign and malign renal tumors using radiomics. However, the results are likely over-optimistic and must be considered with care and hence the added value compared to human assessment accuracy remains unclear. Human assessment should be considered as a reference standard for future studies in addition to histopathological findings and adherence to rigorous quality standards should be ensured. Furthermore, the benefit regarding therapy response assessment remains unclear as the existing studies are investigating ST which no longer correspond to the current reference standard. Data sharing of prospective trials investigating new ST might enable to perform biological correlation of already established radiomic signatures.
This review reveals that there are still hurdles to overcome before the added value of radiomics in the diagnostic of RCC can be ascertained. Above all, the weak points are the heterogeneity and missing standardization of the available studies and the lack of evaluation of the benefit against current standards, which represent decisive milestones before the use of radiomics will find its way into everyday clinical practice. By highlighting these challenges, this review can improve the planning of subsequent studies in this context.

Supplementary Materials

The following are available online at https://www.mdpi.com/2072-6694/13/6/1348/s1, Table S1: Search terms, Table S2: Summary of studies included in the qualitative synthesis (n = 113), Table S3: RQS over time, Table S4: Summary of studies investigating the dignity of renal masses (n = 52) and reasons for exclusion from the meta-analysis, Table S5: Studies investigating treatment response assessment using radiomics (n = 6).

Author Contributions

Conceptualization, M.C.K. and J.M.; data base search, M.G.; screening and data extraction, J.M., L.E. and B.B.; data analysis, J.M. and K.-F.K.; writing—original draft preparation, J.M.; writing—review and editing, L.E., K.-F.K., M.T.W., N.W., P.N., F.C.L., B.B. and M.C.K.; supervision, M.C.K. and B.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

Not applicable.

Conflicts of Interest

F.C.L. currently holds a seed grant from the European Society of Radiology in cooperation with the European Institute for Biomedical Imaging Research. F.C.L. holds shares from companies, that develop or sell products in the field of machine learning, artificial intelligence and cancer diagnostics and treatment. This had no influence on the present work. All other authors declare no conflict of interest.

References

  1. Lambin, P.; Leijenaar, R.T.H.; Deist, T.M.; Peerlings, J.; de Jong, E.E.C.; van Timmeren, J.; Sanduleanu, S.; Larue, R.T.H.M.; Even, A.J.G.; Jochems, A.; et al. Radiomics: The bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 2017, 14, 729–762. [Google Scholar] [CrossRef] [PubMed]
  2. Mannil, M.; von Spiczak, J.; Manka, R.; Alkadhi, H. Texture Analysis and Machine Learning for Detecting Myocardial Infarction in Noncontrast Low-Dose Computed Tomography: Unveiling the Invisible. Investig. Radiol. 2018, 53, 338–343. [Google Scholar] [CrossRef] [PubMed]
  3. Castellano, G.; Bonilha, L.; Li, L.M.; Cendes, F. Texture analysis of medical images. Clin. Radiol. 2004, 59, 1061–1069. [Google Scholar] [CrossRef] [PubMed]
  4. Tourassi, G.D. Journey toward computer-aided diagnosis: Role of image texture analysis. Radiology 1999, 213, 317–320. [Google Scholar] [CrossRef] [PubMed]
  5. Siegel, R.L.; Miller, K.D.; Jemal, A. Cancer statistics, 2020. CA Cancer J. Clin. 2020, 70, 7–30. [Google Scholar] [CrossRef]
  6. Bauman, T.M.; Potretzke, A.M.; Wright, A.J.; Knight, B.A.; Vetter, J.M.; Figenshau, R.S. Partial Nephrectomy for Presumed Renal-Cell Carcinoma: Incidence, Predictors, and Perioperative Outcomes of Benign Lesions. J. Endourol. 2017, 31, 412–417. [Google Scholar] [CrossRef]
  7. Seymour, L.; Bogaerts, J.; Perrone, A.; Ford, R.; Schwartz, L.H.; Mandrekar, S.; Lin, N.U.; Litière, S.; Dancey, J.; Chen, A. iRECIST: Guidelines for response criteria for use in trials testing immunotherapeutics. Lancet Oncol. 2017, 18, e143–e152. [Google Scholar] [CrossRef] [Green Version]
  8. Ursprung, S.; Beer, L.; Bruining, A.; Woitek, R.; Stewart, G.D.; Gallagher, F.A.; Sala, E. Radiomics of computed tomography and magnetic resonance imaging in renal cell carcinoma-a systematic review and meta-analysis. Eur. Radiol. 2020, 30, 3558–3566. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Antunes, J.; Viswanath, S.; Rusu, M.; Valls, L.; Hoimes, C.; Avril, N.; Madabhushi, A. Radiomics Analysis on FLT-PET/MRI for Characterization of Early Treatment Response in Renal Cell Carcinoma: A Proof-of-Concept Study. Transl. Oncol. 2016, 9, 155–162. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Cumpston, M.; Li, T.; Page, M.J.; Chandler, J.; Welch, V.A.; Higgins, J.P.T.; Thomas, J. Updated guidance for trusted systematic reviews: A new edition of the Cochrane Handbook for Systematic Reviews of Interventions. Cochrane. Database. Syst. Rev. 2019, 10. [Google Scholar] [CrossRef] [Green Version]
  11. Shea, B.J.; Reeves, B.C.; Wells, G.; Thuku, M.; Hamel, C.; Moran, J.; Moher, D.; Tugwell, P.; Welch, V.; Kristjansson, E.; et al. AMSTAR 2: A critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ 2017, 358. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. J. Clin. Epidemiol. 2009, 1006–1012. [Google Scholar] [CrossRef] [PubMed]
  13. Park, J.E.; Kim, D.H.; Kim, H.S.; Park, S.Y.; Kim, J.Y.; Cho, S.J.; Shin, J.H.; Kim, J.H. Quality of science and reporting of radiomics in oncologic studies: Room for improvement according to radiomics quality score and TRIPOD statement. Eur. Radiol. 2020, 30, 523–536. [Google Scholar] [CrossRef] [PubMed]
  14. Higgins, J.P.T.; Thompson, S.G.; Deeks, J.J.; Altman, D.G. Measuring inconsistency in meta-analyses. BMJ 2003, 327, 557–560. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Rothman, K.J. No adjustments are needed for multiple comparisons. Epidemiology 1990, 1, 43–46. [Google Scholar] [CrossRef] [Green Version]
  16. Deng, Y.; Soule, E.; Cui, E.; Samuel, A.; Shah, S.; Lall, C.; Sundaram, C.; Sandrasegaran, K. Usefulness of CT texture analysis in differentiating benign and malignant renal tumours. Clin. Radiol. 2020, 75, 108–115. [Google Scholar] [CrossRef] [PubMed]
  17. Raman, S.P.; Chen, Y.; Schroeder, J.L.; Huang, P.; Fishman, E.K. CT texture analysis of renal masses: Pilot study using random forest classification for prediction of pathology. Acad. Radiol. 2014, 21, 1587–1596. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Collins, G.S.; Reitsma, J.B.; Altman, D.G.; Moons, K.G.M. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration. Ann. Intern. Med. 2015, 162, W1–W73. [Google Scholar] [CrossRef] [Green Version]
  19. Lee, H.W.; Cho, H.W.; Joung, J.G.; Jeon, H.G.; Jeong, B.C.; Jeon, S.S.; Lee, H.M.; Nam, D.H.; Park, W.Y.; Kim, C.K. Integrative Radiogenomics Approach for Risk Assessment of Post-Operative Metastasis in Pathological T1 Renal Cell Carcinoma: A Pilot Retrospective Cohort Study. Cancers 2020, 12, 866. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Tang, Z.; Yu, D.; Ni, T.; Zhao, T.; Jin, Y.; Dong, E. Quantitative Analysis of Multiphase Contrast-Enhanced CT Images: A Pilot Study of Preoperative Prediction of Fat-Poor Angiomyolipoma and Renal Cell Carcinoma. AJR Am. J. Roentgenol. 2020, 214, 370–382. [Google Scholar] [CrossRef]
  21. Hosny, A.; Aerts, H.j.; Mak, R.H. Handcrafted versus deep learning radiomics for prediction of cancer therapy response. Lancet Digit. Health 2019, 1, e106–e107. [Google Scholar] [CrossRef] [Green Version]
  22. Sanders, G.D.; Neumann, P.J.; Basu, A. Recommendations for Conduct, Methodological Practices, and Reporting of Cost-effectiveness Analyses: Second Panel on Cost-Effectiveness in Health and Medicine. JAMA 2016, 316, 1093–1103. [Google Scholar] [CrossRef] [PubMed]
  23. van Timmeren, J.E.; Cester, D.; Tanadini-Lang, S.; Alkadhi, H.; Baessler, B. Radiomics in medical imaging—“How-to” guide and critical reflection. Insights Imaging 2020, 11, 91. [Google Scholar] [CrossRef]
  24. Sun, X.Y.; Feng, Q.-X.; Xu, X.; Zhang, J.; Zhu, F.-P.; Yang, Y.-H.; Zhang, Y.-D. Radiologic-Radiomic Machine Learning Models for Differentiation of Benign and Malignant Solid Renal Masses: Comparison With Expert-Level Radiologists. AJR Am. J. Roentgenol. 2020, 214, W44–W54. [Google Scholar] [CrossRef] [PubMed]
  25. Uhlig, J.; Biggemann, L.; Nietert, M.M.; Beißbarth, T.; Lotz, J.; Kim, H.S.; Trojan, L.; Uhlig, A. Discriminating malignant and benign clinical T1 renal masses on computed tomography: A pragmatic radiomics and machine learning approach. Medicine (Baltimore) 2020, 99, e19725. [Google Scholar] [CrossRef] [PubMed]
  26. Picard, M.; Shah, N.; Flemming, B.; Kovacs, M.; Freeman, K.; Burchett, P.F.; Giovagnoli, D.A.; Hardie, A.D. Combined Qualitative and Quantitative Assessment of Low-Attenuation Renal Lesions Improves Identification of Renal Malignancy on Noncontrast Computed Tomography. J. Comput. Assist. Tomogr. 2019, 43, 852–856. [Google Scholar] [CrossRef] [PubMed]
  27. Said, D.; Hectors, S.J.; Wilck, E.; Rosen, A.; Stocker, D.; Bane, O.; Beksaç, A.T.; Lewis, S.; Badani, K.; Taouli, B. Characterization of solid renal neoplasms using MRI-based quantitative radiomics features. Abdom. Radiol. 2020, 45, 2840–2850. [Google Scholar] [CrossRef] [PubMed]
  28. Takahashi, N.; Takeuchi, M.; Sasaguri, K.; Leng, S.; Froemming, A.; Kawashima, A. CT negative attenuation pixel distribution and texture analysis for detection of fat in small angiomyolipoma on unenhanced CT. Abdom. Radiol. 2016, 41, 1142–1151. [Google Scholar] [CrossRef]
  29. Bharwani, N.; Miquel, M.E.; Powles, T.; Dilks, P.; Shawyer, A.; Sahdev, A.; Wilson, P.D.; Chowdhury, S.; Berney, D.M.; Rockall, A.G. Diffusion-weighted and multiphase contrast-enhanced MRI as surrogate markers of response to neoadjuvant sunitinib in metastatic renal cell carcinoma. Br. J. Cancer 2014, 110, 616–624. [Google Scholar] [CrossRef] [Green Version]
  30. Boos, J.; Revah, G.; Brook, O.R.l.; Rangaswamy, B.; Bhatt, R.S.; Brook, A.; Raptopoulos, V. Intensity Distribution Curve (Histogram) Analysis of Patients Undergoing Antiangiogenic Therapy for Metastatic Renal Cell Carcinoma. AJR Am. J. Roentgenol. 2017, 209, W85–W92. [Google Scholar] [CrossRef]
  31. Goh, V.; Ganeshan, B.; Nathan, P.; Juttla, J.K.; Vinayan, A.; Miles, K.A. Assessment of response to tyrosine kinase inhibitors in metastatic renal cell cancer: CT texture as a predictive biomarker. Radiology 2011, 261, 165–171. [Google Scholar] [CrossRef]
  32. Haider, M.A.; Vosough, A.; Khalvati, F.; Kiss, A.; Ganeshan, B.; Bjarnason, G.A. CT texture analysis: A potential tool for prediction of survival in patients with metastatic clear cell carcinoma treated with sunitinib. Cancer Imaging 2017, 17. [Google Scholar] [CrossRef]
  33. Osawa, T.; Takeuchi, A.; Kojima, T.; Shinohara, N.; Eto, M.; Nishiyama, H. Overview of current and future systemic therapy for metastatic renal cell carcinoma. Jpn. J. Clin. Oncol. 2019, 49, 395–403. [Google Scholar] [CrossRef]
  34. Fournier, L.; Costaridou, L.; Bidaut, L.; Michoux, N.; Lecouvet, F.E.; de Geus-Oei, L.-F.; Boellaard, R.; Oprea-Lager, D.E.; Obuchowski, N.A.; Caroli, A.; et al. Incorporating radiomics into clinical trials: Expert consensus on considerations for data-driven compared to biologically driven quantitative biomarkers. Eur. Radiol. 2021, 2021. [Google Scholar] [CrossRef]
  35. Gorin, M.A.; Rowe, S.P.; Baras, A.S.; Solnes, L.B.; Ball, M.W.; Pierorazio, P.M.; Pavlovich, C.P.; Epstein, J.I.; Javadi, M.S.; Allaf, M.E. Prospective Evaluation of 99mTc-sestamibi SPECT/CT for the Diagnosis of Renal Oncocytomas and Hybrid Oncocytic/Chromophobe Tumors. Eur Urol. 2016, 69, 413–416. [Google Scholar] [CrossRef] [PubMed]
  36. Rossi, S.H.; Prezzi, D.; Kelly-Morland, C.; Goh, V. Imaging for the diagnosis and response assessment of renal tumours. World. J. Urol. 2018, 36, 1927–1942. [Google Scholar] [CrossRef] [Green Version]
  37. Sheikhbahaei, S.; Jones, C.S.; Porter, K.K.; Rowe, S.P.; Gorin, M.A.; Baras, A.S.; Pierorazio, P.M.; Ball, M.W.; Higuchi, T.; Johnson, P.T. Defining the Added Value of 99mTc-MIBI SPECT/CT to Conventional Cross-Sectional Imaging in the Characterization of Enhancing Solid Renal Masses. Clin. Nucl. Med. 2017, 42, e188–e193. [Google Scholar] [CrossRef] [PubMed]
  38. Jones, K.M.; Solnes, L.B.; Rowe, S.P.; Gorin, M.A.; Sheikhbahaei, S.; Fung, G.; Frey, E.C.; Allaf, M.E.; Dy, Y.; Javadi, M.S. Use of quantitative SPECT/CT reconstruction in 99m Tc-sestamibi imaging of patients with renal masses. Ann. Nucl. Med. 2018, 32, 87–93. [Google Scholar] [CrossRef] [PubMed]
Figure 1. PRISMA Flow-chart of included studies.
Figure 1. PRISMA Flow-chart of included studies.
Cancers 13 01348 g001
Figure 2. Number (a) and box-and-whisker plots (b) of the studies over time. In (b), the outliers are represented by circles and the extremes by asterisks.
Figure 2. Number (a) and box-and-whisker plots (b) of the studies over time. In (b), the outliers are represented by circles and the extremes by asterisks.
Cancers 13 01348 g002
Figure 3. Forest plot of the effect size calculated as log odds ratio for studies investigating (a) AML versus RCC, (b) oncocytomas versus RCC, (c) benign tumors not further specified versus RCC, and (d) the sum of all studies. TP: number of AML (a), oncocytomas (b) or benign renal tumors (c) correctly diagnosed; FN: number of AML (a), oncocytomas (b) or benign renal tumors (c) incorrectly diagnosed as RCC; FP: number of RCC incorrectly diagnosed as AML (a), oncocytomas (b) or benign renal tumors (c); TN: number of RCC correctly diagnosed.
Figure 3. Forest plot of the effect size calculated as log odds ratio for studies investigating (a) AML versus RCC, (b) oncocytomas versus RCC, (c) benign tumors not further specified versus RCC, and (d) the sum of all studies. TP: number of AML (a), oncocytomas (b) or benign renal tumors (c) correctly diagnosed; FN: number of AML (a), oncocytomas (b) or benign renal tumors (c) incorrectly diagnosed as RCC; FP: number of RCC incorrectly diagnosed as AML (a), oncocytomas (b) or benign renal tumors (c); TN: number of RCC correctly diagnosed.
Cancers 13 01348 g003aCancers 13 01348 g003b
Table 1. RQS and average score for the included studies.
Table 1. RQS and average score for the included studies.
CriteriaPointsAverage Score
1Image protocol quality+1 if protocols are well-documented
+1 if public protocol is used
0.65
2Multiple segmentations+1 if multiple segmentations are carried out (i.e., different physicians/algorithms/software)0.53
3Phantom study+1 if phantom study is used on all scanners0.00
4Multiple time points+1 if images are collected at additional time points0.07
5Feature reduction or adjustment for multiple testing−3 if neither measure is implemented
+3 if either measure is implemented
0.69
6Multivariable analysis with non-radiomics features+1 if multivariable analysis with non-radiomics features is carried out0.12
7Biological correlates+1 if phenotypic differences are demonstrated0.96
8Cut-of-analyses+1 if risk groups are determined by either the median, a previously published cut-off or if a continuous risk variable is reported0.09
9Discrimination statistics+1 if a discrimination statistic and its statistical significance is reported (i.e., ROC curve, AUC)
+1 if a resampling method technique is also applied (i.e., bootstrapping, cross-validation)
1.21
10Calibration statistics+1 if a calibration statistic and its statistical significance is reported (i.e. Calibration-in-the-large/slope)
+1 if a resampling method technique is also applied (i.e., bootstrapping, cross-validation)
0.02
11Prospective+7 for prospective validation of a radiomics signature in an appropriate trial0.46
12Validation−5 if validation is missing
+2 if validation is based on a dataset from the same institute
+3 if validation is based on a dataset from another institute
+4 if validation is based on two datasets from two institutes
+4 if study validates a previously published signature
+5 of validation is based in three or more datasets from distinct institutes
−3.88
13Gold standard+2 if comparison to the current gold standard is carried out1.91
14Potential clinical utility+2 if a potential application in a clinical setting is reported2.00
15Cost-effectiveness analysis+1 if the cost-effectiveness of the clinical application is reported0.00
16Open science and data+1 if scans are open source
+1 if region of interest (ROI) segmentations are open source
+1 if code is open source
+1 if representative segmentations and features are open source
0.16
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Mühlbauer, J.; Egen, L.; Kowalewski, K.-F.; Grilli, M.; Walach, M.T.; Westhoff, N.; Nuhn, P.; Laqua, F.C.; Baessler, B.; Kriegmair, M.C. Radiomics in Renal Cell Carcinoma—A Systematic Review and Meta-Analysis. Cancers 2021, 13, 1348. https://doi.org/10.3390/cancers13061348

AMA Style

Mühlbauer J, Egen L, Kowalewski K-F, Grilli M, Walach MT, Westhoff N, Nuhn P, Laqua FC, Baessler B, Kriegmair MC. Radiomics in Renal Cell Carcinoma—A Systematic Review and Meta-Analysis. Cancers. 2021; 13(6):1348. https://doi.org/10.3390/cancers13061348

Chicago/Turabian Style

Mühlbauer, Julia, Luisa Egen, Karl-Friedrich Kowalewski, Maurizio Grilli, Margarete T. Walach, Niklas Westhoff, Philipp Nuhn, Fabian C. Laqua, Bettina Baessler, and Maximilian C. Kriegmair. 2021. "Radiomics in Renal Cell Carcinoma—A Systematic Review and Meta-Analysis" Cancers 13, no. 6: 1348. https://doi.org/10.3390/cancers13061348

APA Style

Mühlbauer, J., Egen, L., Kowalewski, K. -F., Grilli, M., Walach, M. T., Westhoff, N., Nuhn, P., Laqua, F. C., Baessler, B., & Kriegmair, M. C. (2021). Radiomics in Renal Cell Carcinoma—A Systematic Review and Meta-Analysis. Cancers, 13(6), 1348. https://doi.org/10.3390/cancers13061348

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop