Next Article in Journal
Bisphenols A and S Alter the Bioenergetics and Behaviours of Normal Urothelial and Bladder Cancer Cells
Next Article in Special Issue
Use of Personalized Biomarkers in Metastatic Colorectal Cancer and the Impact of AI
Previous Article in Journal
Identification of Leukemia-Associated Immunophenotypes by Databaseguided Flow Cytometry Provides a Highly Sensitive and Reproducible Strategy for the Study of Measurable Residual Disease in Acute Myeloblastic Leukemia
Previous Article in Special Issue
Identification and Assessment of Risks in Biobanking: The Case of the Cancer Institute of Bari
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Artificial Intelligence Predictive Models of Response to Cytotoxic Chemotherapy Alone or Combined to Targeted Therapy for Metastatic Colorectal Cancer Patients: A Systematic Review and Meta-Analysis

by
Valentina Russo
1,
Eleonora Lallo
1,
Armelle Munnia
1,
Miriana Spedicato
1,
Luca Messerini
2,
Romina D’Aurizio
3,
Elia Giuseppe Ceroni
3,
Giulia Brunelli
3,
Antonio Galvano
4,
Antonio Russo
4,
Ida Landini
5,
Stefania Nobili
6,
Marcello Ceppi
7,
Marco Bruzzone
7,
Fabio Cianchi
2,
Fabio Staderini
2,
Mario Roselli
8,
Silvia Riondino
8,
Patrizia Ferroni
9,10,
Fiorella Guadagni
9,10,
Enrico Mini
5,† and
Marco Peluso
1,*,†
add Show full author list remove Hide full author list
1
Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
2
Department of Experimental and Clinical Medicine, University of Florence, 50134 Florence, Italy
3
Institute of Informatics and Telematics, National Research Council, 56124 Pisa, Italy
4
Department of Surgical, Oncological and Oral Sciences, University of Palermo, 90127 Palermo, Italy
5
Department of Health Sciences, University of Florence, 50139 Florence, Italy
6
Department of Neurosciences, Imaging and Clinical Sciences, “G. D’Annunzio” Chieti-Pescara, 66100 Chieti, Italy
7
Clinical Epidemiology Unit, IRCCS-Ospedale Policlinico San Martino, 16131 Genova, Italy
8
Medical Oncology Unit, Department of Systems Medicine, Tor Vergata University, 00133 Rome, Italy
9
BioBIM (InterInstitutional Multidisciplinary Biobank), IRCCS San Raffaele Roma, 00166 Rome, Italy
10
Department of Human Sciences & Quality of Life Promotion, San Raffaele Roma Open University, 00166 Rome, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Cancers 2022, 14(16), 4012; https://doi.org/10.3390/cancers14164012
Submission received: 28 June 2022 / Revised: 26 July 2022 / Accepted: 12 August 2022 / Published: 19 August 2022

Abstract

:

Simple Summary

Metastatic colorectal cancer (mCRC) has high incidence and mortality. Nevertheless, innovative biomarkers have been developed for predicting the response to therapy. We have examined the ability of learning methods to build prognostic and predictive models to predict response to chemotherapy, alone or combined with targeted therapy in mCRC patients, by targeting specific narrative publications. After a literature search, 26 original articles met inclusion and exclusion criteria and were included in the study. We showed that all investigations conducted in this field provided generally promising results in predicting the response to therapy or toxic side-effects, using a meta-analytic approach. We found that radiomics and molecular biomarker signatures were able to discriminate response vs. non-response by correctly identifying up to 99% of mCRC patients who were responders and up to 100% of patients who were non-responders. Our study supports the use of computer science for developing personalized treatment decision processes for mCRC patients.

Abstract

Tailored treatments for metastatic colorectal cancer (mCRC) have not yet completely evolved due to the variety in response to drugs. Therefore, artificial intelligence has been recently used to develop prognostic and predictive models of treatment response (either activity/efficacy or toxicity) to aid in clinical decision making. In this systematic review, we have examined the ability of learning methods to predict response to chemotherapy alone or combined with targeted therapy in mCRC patients by targeting specific narrative publications in Medline up to April 2022 to identify appropriate original scientific articles. After the literature search, 26 original articles met inclusion and exclusion criteria and were included in the study. Our results show that all investigations conducted on this field have provided generally promising results in predicting the response to therapy or toxic side-effects. By a meta-analytic approach we found that the overall weighted means of the area under the receiver operating characteristic (ROC) curve (AUC) were 0.90, 95% C.I. 0.80–0.95 and 0.83, 95% C.I. 0.74–0.89 in training and validation sets, respectively, indicating a good classification performance in discriminating response vs. non-response. The calculation of overall HR indicates that learning models have strong ability to predict improved survival. Lastly, the delta-radiomics and the 74 gene signatures were able to discriminate response vs. non-response by correctly identifying up to 99% of mCRC patients who were responders and up to 100% of patients who were non-responders. Specifically, when we evaluated the predictive models with tests reaching 80% sensitivity (SE) and 90% specificity (SP), the delta radiomics showed an SE of 99% and an SP of 94% in the training set and an SE of 85% and SP of 92 in the test set, whereas for the 74 gene signatures the SE was 97.6% and the SP 100% in the training set.

Graphical Abstract

1. Introduction

Colorectal cancer (CRC) is one of the most common cancers and a main leading cause of cancer death in the world [1,2]. The most recent data from GLOBOCAN recorded more than 1.88 million cases and 916,000 deaths per year [3]. Despite the effort of screening programs for early detection of CRC, about 20% of CRC patients present with metastatic disease at diagnosis (synchronous metastases), with low curative surgical control and high rate of tumor-related deaths [4]. In addition, about 25% of patients who present with localized disease will develop metastases at a later time (metachronous metastases) even after radical loco-regional treatment (surgery, radiotherapy) of the primary tumor and adjuvant chemotherapy [4]. When localized, the vast majority of CRC patients are curable by surgery (stage I and low-risk stage II), but the prognosis for metastatic CRC (mCRC) patients remains poor. Indeed, while the the 5-year overall survival (OS) rate for localized CRC is approximately 90%, this estimate drastically decreases for patients with metastatic CRC (mCRC) to a mere 15.5% (https://seer.cancer.gov/statfacts/html/colorect.html (accessed on 30 April 2022)). Treatment strategies are commonly based on clinical staging, with surgery, neoadjuvant (either with chemotherapy or radiation therapy), and postoperative adjuvant chemotherapy for high-risk locally advanced stages (high-risk stage II and stage III) and with systemic anticancer drugs as primary treatment for metastatic disease.
A combination of cytotoxic agents including fluoropyrimidine, oxaliplatin and/or irinotecan, with a targeted agent, either an anti-epidermal growth factor receptor (EGFR) or an anti-vascular endothelial growth factor (VEGF) monoclonal antibody (MoAb), is nowadays considered the primary standard treatment for mCRC patients to achieve improved disease control and prolong survival [5,6]. Results from the first phase III clinical trial evaluating the addition of the anti-VEGF MoAb bevacizumab to 5-fluorouracil and irinotecan represented the beginning of a new era for mCRC treatment by showing that such combined treatment could result into a significant improvement in tumor objective response (OR), progression-free survival (PFS), and OS [7]. While the median OS for mCRC patients following single agent 5-fluorouracil or biochemical modulation of 5-fluorouracil by folinic acid ranged from 10 to 12 months in the 1980s, it has now been significantly increased, to approximately 26–33 months, by the introduction of targeted therapy [8]. Despite these treatment advances, therapeutic outcomes are still unsatisfactory and variable among patients receiving the same drugs [9], due to the occurrence of tumor drug resistance either at the time of initial treatment (intrinsic resistance) or after the first-line treatment with cytotoxic and targeted agents (acquired resistance) in nearly all patients [10]. Moreover, a large part of mCRC patients can experience several adverse effects, as anemia, neutropenia, nausea, vomiting, diarrhea and cardiotoxicity [11] that might cause treatment delay or even discontinuation.
Evolution of genetically and/or epigenetically diverse tumor-cell populations during tumor growth and progression is considered the main factor causing tumor heterogeneity, displaying inherent functional variability in tumor propagation potential and tolerance to pharmacological treatment with either cytotoxic or targeted cancer therapeutics [5]. Personalized treatments for mCRC disease are today limited to a small number of drugs towards molecular targets such as anti-VEGF, anti-EGFR for RAS wild-type tumors, encorafenib for BRAF (V600E) mutated tumors, programmed death-ligand 1/programmed cell death-1(PDL-1/PD-1) inhibitors for mismatch repair deficiency (dMMR)/microsatellite instability-high (MSI-H) tumors, and possibly KRAS tyrosine kinase inhibitors for G12C KRAS mutated tumors [12]. Recent additions to the mCRC armamentarium are the inhibitors of HER2 (rarely overexpressed in CRC, with a higher prevalence in RAS/BRAF–wild type tumors) and NTRK gene fusions (limited to cancers that are wild-type for KRAS, NRAS, and BRAF) [13,14]. As above reported, although VEGF and its receptors (VEGFR) inhibitors have a therapeutic role in the treatment of mCRC [6] resulting in angiogenic blockade, their use still lacks stratification based on predictive biomarkers of drug response of clinical relevance, especially in terms of resistance [15]. The use of intensive chemotherapy combination regimens requires also a fine balance between clinical outcome and treatment related toxicity. Therefore, there is an unmet need for early identification of mCRC cases who will be responsive to specific regimens and clinical indicators of toxicity burden. Indeed, at the present, no imaging criteria or molecular biomarkers are available in clinical practice today for early identification of treatment response before the start of the therapy and even the radiologists cannot have sufficient imaging information on the baseline examination to predict which tumor lesions will respond to the treatments.
Artificial intelligence (AI), a field of computer science that mimics human intelligent behavior and performs tasks that commonly require human intervention, has gained interest in the field of cancer research for its problem solving, decision making and pattern recognition abilities [16]. Machine learning (ML) and deep learning (DL), two subsets of AI, have emerged as important tools that could effectively improve the field of molecular cancer characterization, from epidemiology [17], to diagnosis, prognosis and patient classification [18], with higher performance than traditional approaches [19,20,21,22,23,24]. Accumulating evidence suggests that radiomics can be used to study tumor heterogeneity and to predict therapy in CRC [25]. Biological insight on data-driven radiomics features has been recently provided by different biological metrics such as gene expression, immunohistochemistry staining intensity and microscopic histologic textures [26]. In particular, radiomics in combination with molecular biomarkers has even been shown to be capable of predicting genetic mutations including both KRAS mutations and KRAS/BRAF status in CRC patients [25]. Nevertheless, there are only scattered reports on applying AI techniques to predict treatment response in oncology, especially for metastatic colorectal disease; therefore, a systematic review and meta-analysis was designed to assess the effectiveness of learning algorithms to predict the response to chemotherapy alone or combined to targeted therapy, as well as the individual toxicity of standard regimens in this patient setting. To the best of our knowledge, this represents the largest meta-analysis of predictive classifiers for mCRC so far, aimed at validating or refuting previous findings, with the final goal of evaluating whether learning models could be used in clinical trials to assess prognosis or toxicity in standard of care settings and as predictors of therapy response.

1.1. Machine and Deep Learning Techniques

In recent years, the application of computer aided intelligence has expanded to healthcare for finding novel clinical solutions drawing on the subfields of the traditional ML and the cutting-edge DL architectures and seeking improved performance in respect to traditional statistical methods [18]. They are both capable of learning from data and identifying the distribution and patterns of key features, but have different learning structures, with DL mimicking the property of the human brain for data processing using multi-layered structured models [18]. ML has emerged as a powerful computational approach that uses algorithms to iteratively and automatically learn from data to perform a task without being specifically programmed [27]. Main ML analytical tasks consist in classification, pattern recognition, clustering, key feature identification and construction of predictive models using three different approaches: (i) supervised, (ii) unsupervised, and (iii) reinforcement learning [16]. In supervised learning, labeled data are used to train algorithms to classify data or predict outcomes; while in unsupervised learning, unlabeled or uncategorized data are used in learning processes and relationships are discovered on the basis of hidden similarities; lastly, reinforcement learning is an environment-driven approach, i.e., it enables software agents to automatically receive rewards or penalties to improve their performance [27]. Supervised algorithms can be grouped in classification and regression applications, where the former refers to the prediction of categorical outcomes, and the latter to the prediction of continuous outcomes. Common supervised ML models are Support Vector Machines (SVM), Decision Trees (DT), and Artificial Neural Networks (ANN). SVM is one of the most used algorithms which map the input variables into a feature space of higher dimensionality using the kernel trick [28] and identify the best decision surface, the hyperplane, which separates the dataset into two different classes. The DTs are tree-based classification models where the dataset is split at each node of the tree using input feature thresholds and leaves represent the final classification decision. A DT performs features selection at each node using metrics as information gain to select the most informative features during its training. DTs can be used for solving regression and classification problems also by learning simple decision rules inferred from prior data in the training process [18]. ANNs are biologically inspired computational networks based on studies of the brain and nervous system, that can compute values from inputs and examine interactions between groups of representative features to predict outcomes [29]. Unsupervised algorithms analyze unlabeled datasets for clustering, density estimation, feature learning, dimensionality reduction, finding association rules and anomaly detection. Clustering is a common task in biomedical research employing molecular profiling to find similarities among samples and allow stratification of patients into clinically-meaningful subgroups. Among the available techniques, the most frequently used are hierarchical and k-means clustering. Besides clustering, Principal Component Analysis (PCA), factor analysis, or more complex techniques such as matrix factorization are frequently used techniques to address dimensionality reduction especially for datasets with large number of variables and relative small numbers of samples [30]. Semi-supervised learning methods also exist which exploit labeled and unlabeled data to train a model [27]. On the other hand, DL is a subfield of AI technique that uses multi-layered neural network algorithms similar to the neurological architecture to infer predictions [31]. DL is essentially a computational architecture composed by different processing layers, namely an input, (multiple) hidden, and an output layer [27]. In respect of ML, the neural network architecture is used for deep processing, automated feature extraction and pattern recognition along complex data, as large-scale image classification. In DL architecture, the representation of output data produced by each layer is used as input to the subsequent layer and so forth, retaining the data of interest and discarding other information. By this approach, DL can identify complex combinations of original features within combinations of deeper layers and extrapolate new features that will increase the predictive value in respect to original ones [31]. Common DL methods are the Recurrent Neural Networks (RNN) and the Convolutional Neural Networks (CNN). RNNs are designed for making predictive models based on temporal sequences using current and past features to automatically construct algorithms. CNN is a type of network that is specified for image analysis and computer vision. CNN is able to perform specific tasks such as image classification and automated features extractions [32]. In ML/DP, the quality of the datasets is fundamental and a preprocessing strategy, encompassing feature standardization and extraction is a necessary step before applying any learning algorithm.

1.2. Standard Chemotherapy, Targeted Therapy and Mutational Profiles

Since the Sixties, 5-fluorouracil represents the basis of mCRC treatment. In the last 25 years, capecitabine, a pro-drug of 5-fluorouracil, oxaliplatin, a platinum compound, and irinotecan, a topoisomerase I inhibitor, have also been introduced in the clinical practice leading to improved patient outcomes [33,34]. Currently, the main doublet chemotherapies used in first-line treatment are FOLFIRI (5-fluorouracil/folinic acid plus irinotecan) and FOLFOX (5-fluorouracil/folinic acid plus oxaliplatin) regimens, while CapeOX (capecitabine plus oxaliplatin) and XELIRI (capecitabine plus irinotecan) represent less commonly used chemotherapy regimens. A triple chemotherapy, the FOLFOXIRI or FOLFIRINOX (5-fluorouracil/folinic acid plus oxaliplatin and irinotecan) regimen, is also a first-line treatment option in selected cases [5,35]. The availability of anti-EGFR MoAbs, such as cetuximab or panitumumab, and of anti-VEGF MoAbs, such as bevacizumab, from the early years after 2000 have further increased activity and efficacy of first-line drug treatment in mCRC when combined with the above chemotherapy regimens [5]. These biological therapeutics can act directly on tumor cells through different mechanisms including targeting cell surface receptors for specific relevant growth factors (e.g., EGF) and consequently by blocking oncogenic signaling leading to cell proliferation arrest, inhibition of cell differentiation and cell migration, interference with cell cycle progression and enhancement of apoptotic death. They can also act indirectly on tumor cells through inhibition of blood vessels in the tumor microenvironment by targeting specific endothelial cell membrane receptors or their growth factors (e.g., VEGF) [5]. More recently, immune checkpoint inhibitors (ICIs), working by blocking immunoinhibitory signals and enabling patients to induce an anti-tumor action, have also proven active in the treatment of mCRC [36]. ICIs targeting PD-1 are in fact highly effective in a selected mCRC patient population whose tumors display dMMR or MSI-H [37].
In the case of anti-EGFR MoAbs, the study of mutational profiling in KRAS and NRAS sequences is highly relevant for the choice of clinical treatments [38]. Patients with colorectal metastasis bearing mutations in KRAS or NRAS are considered non-responders or poor responders to anti-EGFR therapy, whereas the presence of BRAF V600E mutation identifies a very poor-prognosis subgroup of patients [39], who also display resistance to anti-EGFR MoAbs [40,41]. In BRAF mutated tumors the use of a potent BRAF tyrosine kinase inhibitor, such as encorafenib, can overcome resistance to anti-EGFR MoAbs [5]. In RAS wild-type tumors, cetuximab and panitumumab in combination with chemotherapy can instead significantly improve the median OS of mCRC patients [6]. Difference in response to chemotherapy can exist among patients carrying the same mutation profiles, since patients with KRAS or NRAS wild-type tumors can be partially not responsive to anti-EGFR agents [42]. Besides the above mentioned V600E BRAF mutations, also PIK3CA mutations, PTEN loss, HER2 overexpression/amplification or MET amplification have been linked to resistance to anti-EGFR targeted therapies [43]. Interestingly, excision repair cross complementation group 1 (ERCC1) expression is inhibited by KRAS mutation, making tumors more sensitive to cytotoxic treatments [44]. This has led to hypotheses that several other genetic and epigenetic mechanisms can be involved in primary or secondary drug resistance and tumor progression [45].
As above reported, dMMR/MSI-H are predictive biomarkers of tumor response to ICIs pembrolizumab and nivolumab MoAbs against PD-1 [2,6]. Indeed, PD-1 blockade with pembrolizumab or nivolumab in this setting of mCRC patients has provided an efficacious treatment option for chemotherapy resistant tumors [46,47,48,49]. In this regard, later phase III clinical studies of pembrolizumab versus standard chemotherapy in patients with previously untreated MSI-H or dMMR mCRC have shown that pembrolizumab significantly improves PFS [50] and OS [51] as first-line therapy as well as health-related quality of life [37,52]. These data have supported FDA and EMA approvals of pembrolizumab as a new standard of care in this mCRC subpopulation. Indeed, a treatment based on “double immunity” with nivolumab in combination with ipilimumab has also been evaluated in a non-randomized phase II trial in first-line setting with an OR rate of 69%, and a median PFS and OS not yet reached at a 24-month follow-up [53]. The v1.2021 NCCN guidelines now recommend nivolumab ± ipilimumab or pembrolizumab as a first-line treatment for patients with unresectable advanced or mCRC with MSI-H/dMMR status (NCCN Clinical Practice Guideline in Oncology. Version 1.2021. Available at: Rectal Cancer.NCCN.org (accessed on 30 April 2022)). ICIs have not yet shown definitive evidence of clinical efficacy in patients with proficient MMR or MS-stable mCRC. However, a recent study has shown that nivolumab added to encorafenib and cetuximab in V600E BRAF mutated, microsatellite-stable mCRC induces higher response rates [54] when compared with encorafenib and cetuximab alone from the previously reported study [55]. Moreover, retrospective studies have shown that patients with distal primary tumors have improved outcomes as compared to those with proximal tumors [56]. Proximal tumors are more aggressive and differ from distal tumors in showing diverse molecular and pathological characters [57]. Tumor location can be also associated with the response to targeted therapy. mCRC patients with distal tumors respond better to anti-EGFR agents than those with proximal tumors [39], possibly due to the different embryologic origin of distal tumors, whether embryonic hindgut or embryonic midgut. Interestingly, differences in embryologic origin are associated to different phenotypes, with proximal tumors carrying RAS and BRAF mutations, whereas distal tumors carry KRAS, APC, PIK3CA, and TP53 mutations [58]. Similarly, therapy responses can be different among these two kinds of colorectal tumors: cancer patients with left-sided tumors may be responsive to anti-EGFR targeted therapy, while those with right-sided tumors do not have the same benefit from conventional treatments [59].
In addition to somatic mutations, gene expression may contribute to tumor drug response. Through the years, different approaches have been used to identify genes whose expression levels could discriminate between responders and non-responders to pharmacological treatment in CRC. The “candidate gene” approach, widely used in the past years, has led to the identification of a number of potential biomarkers of drug response [60,61,62,63,64,65]. More recently, genes potentially predictive of drug response in CRC have also been identified and validated starting from an RNA seq transcriptomic approach [66,67]. In addition, by using unsupervised clustering methods, CRC transcriptomic data provided new molecular classifications, including the “consensus molecular subtypes” (CMS) [68] and the “cancer cell intrinsic transcriptional traits” (CRIS) [69]. Both of them, although not currently recommended in clinical practice, are good predictors of prognosis whereas their ability to predict drug response has not been proven.

2. Methods

This review and meta-analysis was conducted according to the Preferred Reporting Items for Reviews and Meta-Analyses (PRISMA) guidelines [70] and not registered in PROSPERO.

2.1. Search Strategy and Data Extraction

A systematic literature search was performed using National Library of Medicine medical literature database (PubMed, https://pubmed.ncbi.nlm.nih.gov/ (accessed on 30 April 2022)) from January 2005 up April 2022, to identify studies performed with AI techniques for the development and validation of accurate prognostic tools to be employed for predicting treatment response and/or individual toxicity in mCRC patients. This objective was obtained by targeting specific narrative publications in medical literature and identifying appropriated original scientific articles. The publications were searched with main domains connected by the Boolean operators “AND” and “OR”: “artificial intelligence”, “machine learning”, “deep learning”, “neural networks”, “predictor”, “classifier”, “signature” OR “radiomics” AND “metastatic colorectal cancer” OR “colorectal cancer” AND “chemotherapy”, “targeted therapy”, “cytotoxic regimens”, “immunotherapy”, “response to therapy” OR “responders”. To supplement PubMed search, additional articles were collected by checking reference lists of identified studies, reviews and systematic reviews. Two investigators (EL and VR) have independently screened the PUBMED publications against the criteria of inclusion and exclusion. In case of disagreement, the opinion of a third investigator (MP) was requested.
When available, the following variables were extracted for each eligible study: first line and previous chemotherapy and/or targeted therapy, AI networks (ML or DL), total sample size, training and validation cohort sample sizes, input features used to feed algorithms, signature used for learning models (radiomics or molecular biomarkers), learning model performances at training and/or at validation sets. We distinguished performance metrics based on the task, e.g., classification or regression/survival analysis, for each study. A detailed description of each metric is reported in Table 1.

2.2. Study Selection

Studies had to meet the following prespecified inclusion criteria: (1) patients with mCRC; (2) use of AI algorithms to assess response to therapy or side effects; and (3) published in English language. Exclusion criteria were: (i) case reports, review articles, editorials, letters, comments, and conference abstracts; (ii) studies lacking therapy response analysis; and (iii) studies lacking learning model performance evaluation.

2.3. Objectives

The primary objective of the study was to evaluate the effectiveness of learning algorithms in the prediction of the response to chemotherapy alone or combined to targeted therapy and/or side effects for mCRC patients. The secondary objective was to assess the efficacy of the aforementioned algorithms in discriminating responder mCRC patients/treated metastatic lesions vs. non-responders mCRC patients/untreated metastatic lesions.

2.4. Publication Quality

The study of publication quality was assessed by two investigators (MP and EL) using study quality items that were criteria of quality publication in the Luo scale [71], in keeping with a recent systematic review on ML studies [72].

2.5. Rational Framework

A rational framework for interpreting model performances was developed using the area under the receiver operating characteristic (ROC) curve (AUC), and the sensitivity (SE) and the specificity (SP) estimates reported in the AI models. AUC values were interpreted as follows: 0.6–0.7 (worthless), 0.7–0.8 (poor), 0.8–0.9 (good), and >0.9 (excellent) [73]. In addition, the SE was referred to the ability of a test to correctly identify patients who are responders, whereas the SP to the ability to correctly classify those who are non-responders [74,75].

2.6. Statistical Analysis

A meta-analysis was carried out including only the studies reporting the AUC or HR estimates with 95% C.I. in training and/or validation sets. The random effects model [76,77,78] was used to calculate global response to therapy, reported as overall AUC calculated from the receiver operating characteristic (ROC) analyses or overall HR with 95% CIs from proportional hazards models (Cox regression). The pooled HR was considered statistically significant if the 95% C.I. did not include 1.0 while the confidence intervals for global AUC not including 0.50 suggest that the test is able to discriminate between the comparison groups. To stabilize the within-study variance, the logit of the AUC and the natural logarithm of HR were used for the statistical analysis. To assess whether the pooled performances were stable or significantly dependent on one or more studies, sensitivity analyses were conducted by interactively recalculating the overall estimates after exclusion of one study at a time. Egger’s test was also applied to assess the occurrence of publication bias. All statistical analyses were conducted using Stata Software version 14.2 (StataCorp LLC, College Station, TX, USA).

3. Results

A total of 1947 records were identified from databases using the predefined search criteria (Figure 1). After a review of titles and abstracts, duplicates and papers not related to prediction models were rejected. 221 remaining manuscripts were analyzed in-depth to evaluate their concordance with inclusion and exclusion criteria. Other articles were collected by checking reference lists of identified studies, reviews and systematic reviews. The final database consisted of 26 studies with 50,257 patients eligible for inclusion in the database [74,75,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102]. Table 2 and Table 3 show the 26 original AI studies that were included in the systematic review. There were 13 studies with 4193 patients, 5 of radiomics and 8 of molecular biomarkers learning models, that evaluated the response to chemotherapy alone, and 2 other studies with 36,050 patients that assessed the individual toxicity of standard regimens.
Likewise, there were 14 studies, with 3 investigations in common with Table 3, with 10,014 patients, 7 of radiomics and 7 of molecular biomarkers based classifiers that analyzed the response to targeted therapy. The AI studies were listed and divided in predictive learning models of response or side effects to chemotherapy alone and predictive learning models of response to targeted therapy. Investigations collected under each category were stratified in imaging, molecular and clinical biomarkers based models, summarized in detail and discussed. Table 2 and Table 3 report the variables extracted for each eligible study.

3.1. Outcomes and Performance Estimates of Predictive Models

Different clinical endpoints, including the median values of the OS, the PFS, the TTNT or the Response Evaluation Criteria in Solid Tumors (RECIST) criteria, were used to analyze the response to therapy in periods of time consistent between different studies, commonly after 6 or more months from the last cycle of chemotherapy, but shorter time pharmacoparameters were employed to study toxic side-effects. Several performance metrics, such as the AUC, the Harrell’s concordance C (C-index), the SE, the SP, the accuracy (ACC), the positive predictive value (PPV) and the negative predictive value (NPV), the classification precision and F1 scores (the harmonic means of the precision and recall) and patient survival estimation by hazard ratio (HR) were used to evaluate the predictive power of various algorithms for correctly classifying response vs. non-response.

3.2. Publication Validation

When data quality was assessed in the 26 AI study included in the systematic review and the meta-analysis [74,75,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102], we identified a common gap that consisted in the lack of an external validation cohort, that was missing in more than the half of the investigations (Table 4 and Supplementary Table S1). Outliers, missing values and C.I. were also not handled by most of the studies.

3.3. Artificial Intelligence Predictive Models

In the following paragraphs, we provide a perspective of the progress in computer science by reviewing the roles that learning techniques have played in cancer research with predictive algorithms of response to chemotherapy alone or combined with targeted therapy as well as for predicting individual toxicity for mCRC patients. As shown in Table 2 and Table 3, 26 studies were conducted to develop and validate different predictive algorithms for mCRC patients [74,75,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102]. In the AI investigations, cohorts were generally divided into a training and a validation set to train and validate the developed models to avoid overfitting, with the exception of three studies [79,92,93], that did not split the data. Data on training and validation set were also not always reported. In radiomics studies, different timepoints in the same person were often used in both the training and testing sets, e.g., at baseline and after chemotherapy. The distribution of the outcome cases, e.g., numbers of responders vs. non-responders, was commonly different in the testing and training data sets of the 26 AI investigations. Information on the distribution of the outcome cases, e.g., numbers of responders vs. non-responders, was generally missing, with the exception of three AI studies, where the following frequency were reported: 131 responder vs. 61 non-responders in the study of Wei et al. [80], 42 responder vs. 41 non responders in the study of Tian et al. [95], and 184 responders vs. 210 non responders in the study of Liu et al. [94].

3.4. Cytotoxic Chemotherapy and Radiomics Learning Models

As mentioned above, oxaliplatin is widely used as part of first-line therapy in mCRC patients. However, only some patients achieve OR, and most of them will progress in the subsequent months [5]. To evaluate the treatment response of mCRC to first-line FOLFOX chemotherapy, Giannini et al. [75] have employed several ML networks to build a radiomic signature using texture features from contrast-enhanced computed tomography (CT) scans of liver metastasis performed at baseline and at subsequent time points. In that investigation, the authors published a delta-radiomic signature capable of correctly identifying lesions as responder vs. non-responder to oxaliplatin-based regimen. The best performance was shown by a tree-structured algorithm in respect to other networks, a SVM and a random forest, with AUCs of 0.99 (95% C.I. 0.97–1.0, 99% SE, 94% SP, ACC of 97%) and 0.93, (95% C.I. 0.87–0.96, 85% SE, 92% SP, ACC of 86%) in training and validation sets, respectively. In another study, Nakanishi et al. [79] developed computer-aided prognostic models aiming to distinguish mCRC patients suitable for standard chemotherapy. Specifically, they used a learning regression model to construct a radiomic signature for assessing the response to oxaliplatin-based first-line chemotherapy using the texture features from pretreatment contrast-enhanced CT scans of liver metastases. The radiomic signature built was able to discriminate responders vs. non-responders and the performance of the learning model reached AUCs of 0.851, 95% C.I. 0.771–0.930 and 0.779, 95% C.I. 0.617–0.940, in training and validation arms, respectively.
Using a computational approach, Wei et al. [80] have evaluated the power of a DL model to predict the response to first-line regimens including oxaliplatin or irinotecan for mCRC or recurrent CRC patients using data from pretreatment contrast-enhanced multidetector tomography (MDCT) scans of liver metastases. The radiomic signature constructed by a residual network architecture algorithm reached a high predictive performance for chemotherapy response with an AUC of 0.903, 95% C.I. 0.851–0.955. The classifier was found to be able to identify responders vs. non-responders with 84.7% SE, 84.8% SP, ACC of 84.7% on the training set, but lower estimates were observed in the validation set (AUC of 0.745, 95% C.I. 0.659–0.831, 90% SE, 73% SP and ACC of 85.4%). A better performance was instead obtained by the final fusion radiomic signature that integrated the response of carcinoembryonic antigen serum level, with AUCs of 0.935, 95% C.I. 0.897–0.973 and 0.830, 95% C.I. 0.688–0.973 on training and validation sets, respectively.
To identify radiomics signatures predictive of response to standard chemotherapy (i.e., first-line FOLFOX or FOLFIRI) for mCRC patients, Defeudis et al. [81], have developed and validated several ML technologies for predicting the response to first-line FOLFOX or FOLFIRI using data derived from pretreatment CT scans of metastatic lesions. In that study, different radiomic features were fed as inputs to the learning model, including those extracted from the whole metastasis by a 3D approach and others from the traditional region of interest. The best performance in identifying responder vs. non-responder lesions was given by a SVM classifier with 3D data approach rather than by a Gaussian naive Bayes or a multilayer perceptron algorithms in per-lesion analysis. The performance of SVM showed 76% SE, 67% SP, ACC of 72%, 69% PPV, 75% NPV and 61% SE, 60% SP, ACC of 61%, 64% NPV, 57% PPV in training and validation sets, respectively. Conversely, when SVM algorithms were carried out in per-patient analysis, the model yielded only an ACC of 32%, 41% SE and 21% SP, with 23% NPV and 38% PPV on validation set.

3.5. Cytotoxic Chemotherapy Alone and Biomarkers and Clinical Learning Models

Tian et al. [95] have investigated the response to FOLFOX chemotherapy by ML technique using publicly available genomic data of mCRC patients from GSE28702 and GSE72970 cohorts. In this study, a 74-gene signature, identified by different learning algorithms in the top rank 250 genes, was used by an iterative supervised learning (IML) method to classify mCRC cases into responder or non-responder. The learning prediction model showed a SE of 97.6% and a SP of 100% in the training set, whereas the survival analysis resulted in a HR of 2.6 in the validation set with a median OS time of 13.4 months in non-responders vs. 36.6 months in responders. The pattern of gene signature of non-responders showed increased expression of ERCC excision repair 1, endonuclease non-catalytic subunit (ERCC1) and dihydropyrimidine dehydrogenase (DPYD) genes, suggesting high catabolism rate and efficient nucleotide excision repair.
Long noncoding RNAs (lncRNAs) have been involved in most of fundamental tumor processes including epithelial-to mesenchymal transition (EMT), immunity and angiogenesis [103,104], through a lncRNA transcriptome analysis that identified lncRNAs associated with these and other features in CRC. However, the clinical significance of lncRNAs is still unclear. To explore this issue, Liu et al. [84] have employed different ML models to develop and validate an immune related signature able to predict the response to 5-fluorouracil-based regimens with or without anti-VEGF therapy. In particular, immune infiltration data of CRC patients, including cohorts of patients with CRC metastasis, from the TCGA Research Network portal (https://portal.gdc.cancer.gov/ (accessed on 30 April 2022)) and Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/ (accessed on 30 April 2022)) were used. In this system, 235 lncRNA modulators of immune-related pathways, including the T cell receptor signaling and antigen processing and presentation pathways, were identified by integrated algorithms using molecular data derived from immune infiltration patterns of the TCGA-CRC database (https://portal.gdc.cancer.gov/ (accessed on 30 April 2022)). Forty-three potential predictive lncRNAs were then used to build a consensus immune-related lncRNA signature (IRLS) by an integrative combination of Lasso and stepwise Cox regression, which identified a final signature formed by 16 predictive lncRNAs. The classifier was formed by immune-related lncRNAs encoded from the following RNA genes, the imprinted maternally expressed transcript (H19), the long intergenic non-protein coding RNA 308 (LINC00308), the epithelial cellular adhesion molecule (EpCAM), the SEPTIN7 divergent transcript (SEPTIN7-DT), the ASH1 like histone lysine methyltransferase (ASH1L), the MRGPRG antisense RNA 1 (MRGPRG-AS1), the MIR210 host gene (MIR210HG), the uncharacterized LOC100507250 (LOC100507250), the fibrous sheath interacting protein 1 (FSIP1), the leucine-rich repeat-containing protein 61 (LRRC61-1) and the long intergenic non-protein coding RNA 2560 (LINC02560). In this study, responders to chemotherapy were characterized as having higher levels of IRLS signature than non-responders, whereas the levels of IRLS signature were decreased in metastatic or recurrent CRC patients who were sensitive to anti-VEGF therapy. In particular, the IRLS signature was found to predict the response to fluorouracil-based regimens for metastatic or recurrent CRC patients on the GSE19860 (AUC of 0.843), GSE69657 (AUC of 0.765), and GSE72970 (AUC of 0.709) cohorts, as well as the benefits of adding bevacizumab on the GSE19860 (AUC of 0.771), GSE72970 (AUC of 0.781) and GSE19862 (AUC of 0.694) cohorts. Of note, performance estimates might reflect small sample size of different GSE cohorts (Table 2 and Table 3).
Prognosis for mCRC can be influenced by different gene expression profiles [105]. Hence Lu et al. [101] have investigated the performance of several ML techniques to predict the response to first-line FOLFOX chemotherapy using gene expression data. In this investigation, the authors identified 18 differentially expressed genes (DEGs), that were enriched in autophagy, ErbB signaling pathway, mitophagy, endocytosis, FoxO signaling, apoptosis, and antifolate resistance pathways, between FOLFOX responders and non-responders. Based on the 18-gene classifier, a random forest was able to predict the response to FOLFOX chemotherapy with a performance (AUC) of 0.877, 95% C.I. 0.747–1.00, with 85% in SE and 69.2% in SP in validation set. In addition, the authors demonstrated that the over expression of the mixed lineage kinase domain like pseudokinase (MLKL) and the coiled-coil domain containing 124 (CCDC124) genes were associated with increased survival (HR of 0.358, 95% C.I. 0.178–0.717 and 0.563, 95% C.I. 0.336–0.943, respectively). However, when the learning models were applied to predict the response to FOLFIRI chemotherapy, the best model was the neural network algorithm with an AUC of 0.778, 95% C.I. 0.576–0.979.
Chen et al. [83] had the aim of constructing a learning model predictor for the response to oxaliplatin- or irinotecan-based regimens using circulating cytokines in mCRC patients. In that study, the authors identified a predictive signature of 17-cytokines using univariate SVMs, including the fibroblast growth factor-2 (FGF-2), a growth factor and signaling protein, the transforming growth factor α (TGFα), an angiogenic cytokine and an endothelial growth factor, the cytokine Fms-like tyrosine kinase 3 ligand (Flt3), an important regulator of hematopoiesis, the human interferon α-2 (INFα2), the inflammatory interleukins 2, 7, 8 and 10 (IL-2, IL-7, IL-8, and IL-10), the soluble form of IL-2 receptor (sIL-2Ra), the tumor necrosis factor α (TNFα), the inflammatory cytokines monocyte chemotactic proteins 1 and 3 (MCP-1 and MCP-3), the VEGF, the macrophage-derived chemokine (MDC), and the granulocyte-macrophage colony-stimulating factor (GM-CSF) [106]. Based on the 17-cytokine signature, a COX proportional hazard model was able to predict a worst survival in the group of high-risk patients characterized by high-cytokine expression, with an overall SE of 83.5%, 80% in SP, and an ACC of 81% on the training set. Comparable estimates were computed on the validation set (83.1% in SE, 66.7% in SP, and 74.9% ACC). Moreover, high-risk patients showed a worst OS as compared to low-risk patients in both training and validation arms.
Tsuji et al. [82] utilized a random forest to build a learning model that could predict the response to first-line FOLFOX chemotherapy using gene-expression data from mCRC patients. In this study, the authors determined a gene signature for FOLFOX based on the gene expression of 14 classifier genes. In more detail, the 14-gene signature was constituted by different genes, including the smad ubiquitin regulatory factor 2 (SMURF2), a negative regulator of TGF-β signaling with a tumor-suppressive role [107], the Mbt domain containing 1 (MBTD1), encoding a potent transcriptional coactivator [108], the adaptor related protein complex 3 subunit Mu 2 (AP3M2), the RING finger protein 141 (RNF141), a gene reported to interact with KRAS promoting CRC progression [109], the aminopeptidase puromycin sensitive (NPEPPS), encoding key zinc metallopeptidases [110], the bromodomain PHD finger transcription factor (BPTF), a chromatin remodeling-related gene [111], the family with sequence similarity 73, member A (FAM73A), the amyloid beta precursor protein binding protein 2 (APPBP2), the archaelysin family metallopeptidase 2 pseudogene 1 (AMZ2P1), the SLIT-ROBO rho GTPase activating protein 1 (SRGAP1), that encodes a protein mediating cell migration associated with CRC tumor progression and poor prognosis [112], the N-myristoyltransferase-1 (NMT1), necessary for lysosomal degradation and mammalian target of rapamycin (mTOR) signaling pathway [113], the centrosome and spindle pole associated protein 1 (CSPP1), whose circ-CSPP1 was found to be significantly overexpressed in CRC tissues [114], the eukaryotic translation initiation factor 1 (EIF1), and the centrosomal protein 290 (CEP290) genes. Using the 14-gene classifier, the learning model was capable of classifying responders with an overall SE of 91.3% compared with non-responders with 95.6% in SP, achieving an ACC of 80.2%. In addition, responders showed an extent of 22.7 more months of median OS compared with non-responders in the training set.
Yuan et al. [84] have constructed a protein-based prediction model for analyzing the response to first-line with FOLFOX or FOLFIRI regimens for mCRC patients. Using this protein-based algorithm, responders to both treatments were identified with an overall SE of 92.9% and an 81.3% SP for FOLFOX and a SE of 92.3% and 92.3% SP for FOLFIRI regimens. Similarly, Del Rio et al. [92] have used ML approaches to build a predictor of chemotherapy response using gene expression data for mCRC receiving FOLFIRI regimens. By a learning approach, a pattern of 14 highly discriminatory genes involved in several pathways was investigated for assessing the potential predictivity. The main pathways were including RNA splicing (U2AF1L2), regulation of transcription (ZNF32 and ZNF582), cell adhesion (F8, galectin-8, PSG9), cell differentiation (SERPINE2, BOLL), ion transport (ATP5O), signal transduction (DRD5), development (ANGPTL2) and visual perception (EML2). In this investigation, a SVM model reached 92% SE, 100% SP and 95% ACC in classifying mCRC patients and identifying the subset of them who could benefit from FOLFIRI chemotherapy.
Irinotecan-based regimens can cause a heavy collateral effect leading to the delay or even the suspension of chemotherapy with a subsequent impact on the survival of mCRC patients. The study of pharmacokinetic parameters is a valuable option to improve both efficacy and safety of cancer drugs administration. In mCRC treatment, pharmacokinetic studies have revealed that systemic exposure to irinotecan is highly variable among patients. Therapeutic drug monitoring and genotype-driven studies might thus be helpful in applying dosage adjustments for this drug. Oyaga-Iriarte et al. [93] developed various ML algorithms for predicting side effects of irinotecan regimens using pharmacokinetic parameters in mCRC patients. In this network, the best mathematical model for predicting leukopenia in patients with liver metastasis was built by random forests (AUC of 0.74, 89% in SE, 60% in SP, and 76% ACC). The more performant model for predicting neutropenia was the supervised SVM algorithm (AUC of 0.88, 70% in SE, 70% in SP, and 75% ACC), while the best learning model for predicting diarrhea was the backward stepwise logistic regression (AUC of 0.95, 81% in SE, 100% in SP, and 91% ACC).
To develop a predictive model for the risk of cardiotoxicity, a severe side effect of standard chemotherapy, Li et al. [96] employed different ML methods to analyze the outcome of 30-day cardiotoxicity—using any cardiotoxicity event—on a large cohort of 36,030 CRC patients undergoing fluoropyrimidine-based chemotherapy. The cardiotoxicity events were correlated to six broad classes of variables available in the SEER-Medicare database, including anticancer treatment information, cancer information, demographic variables, socioeconomic status, prior cardiovascular disease histories and other comorbidities, prior medications. The algorithm reaching the highest performance was an extreme gradient boosting (XGBoost) algorithm, based on cardiovascular features such as ischemic heart diseases, cardiomyopathy, arrhythmia and stroke, that showed a precision of 0.621, F1 score of 0.396 and an AUC of 0.801, 95% C.I. 0.781–0.821 in predicting cardiotoxicity in mCRC patients.

3.6. Targeted Therapy and Radiomics Learning Models

Tumor response assessment is commonly based on tumor size change on instrumental images, but morphological changes in tumor can occur earlier in response to antiangiogenic agents. To face this limit, Lu et al. [85] utilized data derived from pretreatment and follow-up CT scans of metastases and lymph nodes to construct DL networks aimed to predict the response to anti-VGEF agents in mCRC patients from the VELOUR trial [115]. In this investigation, the DL network architecture constructed with CNN and RNN networks and based on the quantitative characterization of tumor morphological changes, was able to give an early prediction of treatment response. The performance of the DL model showed C-indexes of 0.678, 95% 0.650–0.706 and 0.649, 95% C.I. 0.619–0.679 on training and validation arms. The integration of the DL network with traditional size-based methodology still improved the performance with a C-Index of 0.694, 95% C.I. 0.661–0.720. The DL network also reached higher power in classifying responders and non-responders to antiangiogenic drugs with responders having a better OS than the non-responders, with median OS 18 vs. 10.4 months, respectively (HR of 0.49, 95% C.I. 0.40–0.61).
Learning models combined with the analysis of RAS mutations can offer important advantages for predicting the response to treatments of CRC metastases. Patients with tumors carrying RAS mutations are in fact considered non-responders to anti-EGFR therapy whereas those with wild-type RAS status are not always sensitive to anti-EGFR antibodies due to the less frequent mutations in the EGFR signaling pathway [39]. To correlate radiomic parameters to RAS mutational status, Granata et al. [91] have examined the association of RAS mutational status with anti-EGFR therapy by ML using data derived from pretreatment contrast enhanced magnetic resonance imaging (MRI) of liver metastases in mCRC patients from the OBELICS cohort [116]. Using a radiomic signature based on both textures and morphological features, RAS mutational status was better detected by a supervised SVM algorithm as compared to other learning algorithms, as a k-nearest neighbors (KNN), an ANN, and a DT network (AUCs of 0.79, 95% C.I. 0.70–0.85, 78% SE, 74.2% SP, ACC of 76.1% and 83.3% SE, 75% SP, ACC of 79.2% in training and validation sets). However, the best result was obtained by a supervised KNN algorithm exclusively fed with robust textures as predictors (AUC of 0.84, 95% C.I. 0.780–0.91, 90% SE, 67.8% SP, ACC of 76.9% and 91.7% SE, 83.3% SP, ACC of 87.5% on training and validation arms).
Zhu et al. [98] analyzed the response to anti-VEGF therapy by DL, using features extracted from preoperative and post-operative MRI of no more than five liver metastatic lesions in mCRC patients. Using a densely connected center cropping CNN (DC3CNN) architecture, features from input data, pre-treatment T2-weighted image, post-treatment data, were extracted to construct three predictive models. The best performance for differentiating non-responder and responder patients was obtained by the DL model containing all the features with an AUC of 0.849, 95% C.I. 0.737–0.926, 91.7% SE, 75% SP, ACC of 87.5%, 75% NPV, 91.7% PPV in the validation set. In addition, the performance of the radiomic signature was further validated in an external independent cohort, with AUC of 0.833, 95% C.I. 0.695–1.000, 91.9% SE, 75% SP, ACC of 88.5%, 69.2% NPV, 93.8% PPV. DL also demonstrated better PFS and OS in responder vs. non-responder groups.
Maaref et al. [86] utilized a DL computational tool to predict the response to anti-VEGF therapy using the textures derived from pretreatment contrast enhanced CT scans of liver metastases of mCRC patients. In this work, a fully automated network (DCNN) achieved higher performance to identify new lesions appearing on CT images by classifying tumors in “treated” vs. “untreated lesions” as compared to other algorithms, as DT, SVM, and ANN models, with AUCs of 0.97 vs. 0.66, 0.60 and 0.62, respectively. The unsupervised DCNN algorithm showed early detection of non-responsive patients with respect to all untreated lesions with AUCs of 0.83 (95% C.I. 0.78–0.87, 97% SE, 59% SP, ACC of 78%) and 0.88 (95% C.I. 0.85–0.94, 98% SE‚ 54% SP, ACC of 76%) in the training and validation arms, respectively.
Dercle et al. [87] conducted a study aiming to develop an algorithm predictive of tumor sensitiveness to standard chemotherapy with or without anti-EGFR agents by ML, using data derived from pretreatment and follow-up CT scans for mCRC patients from the CRYSTAL cohort [9]. The authors observed that the radiomic signature based on spatial and temporal heterogeneity of tumors was highly performant to predict treatment-sensitive tumors to anti-EGFR targeted therapy. The best predictive model was constructed by a random forest algorithm using high quality medical images with AUCs of 0.83 (95% C.I. 0.75–0.92, 77% SE, 85% SP) and 0.80 (95% C.I. 0.69–0.94, 80% SE, 78% SP) in the training and validation sets, respectively. Comparable results were obtained using standard of care scans with AUCs of 0.84 (95% C.I. 0.76–0.89) and 0.72 (95% C.I. 0.59–0.83) in the training and validation arms, respectively. The high-risk radiomic signature was associated to shorter OS as compared to the patients with low-risk signature in both training and validation sets. However, such radiomic signatures failed to predict sensitivity when the response to chemotherapy alone was investigated.
Vera-Yunca et al. [88] constructed a predictive model of response to anti-EGFR therapy by learning algorithms using features derived from pretreatment CT scans and MRI for mCRC patients from the CRYSTAL [9], APEC [117], Study 045 [118] and OPUS [119] cohorts. In this investigation, the k-means clustering constructed with data of tumor heterogeneity in lesion dynamics was predictive of OS in mCRC patients. The model showed that high tumor heterogeneity was associated with a worst OS with an HR of 1.44, 95% C.I. 1.08–1.92, mostly in patients with KRAS mutated tumors.
Recently, anti-HER2 therapy has proven to be a beneficial treatment option for mCRC patients with HER2 amplification or overexpression [43]. Therefore, to predict the response to dual anti-HER2 therapy, Giannini et al. [74] developed and validated ML networks predictive of the response to trastuzumab and lapatinib using data from baseline CT scans of liver metastases of mCRC patients with RAS wild-type and HER2 amplified tumors from the HERACLES cohort [120]. In this network, a Gaussian naïve Bayesian classifier reached high performance in identifying responder vs. non-responder lesions in patients with heterogeneous response, with 89% SE, 85% SP, 93% NPV, 78% PPV, and 90% in SE, 42% in SP, 73% NPV, 71% PPV, in training and validation lesions sets, respectively. Comparable high predictive values were also obtained by the radiomic score in predicting overall patient response with 92% SE, 86% SP, 75% NPV, 96% PPV in patient arm.

3.7. Targeted Therapy and Biomarkers Learning Models

The reported limited clinical efficacy of angiogenesis inhibitors [5] has prompted the need for better understanding of resistance mechanisms and the elucidation of novel predictive models. To direct attention to this item, Abraham et al. [99] studied the response to anti-VEGF therapy, including bevacizumab plus FOLFOX or FOLFOXIRI regimens, by a learning approach using data from next generation sequencing (NGS) in mCRC patients from the TRIBE2 cohort [121]. In that investigation, the genomic features of 592-gene data fed as inputs into more than one hundred learning models, produced a final model containing a 64-gene classifier. This 64-gene predictive classifier, named FOLFOXai, consisted of several genes involved in relevant pathways. The following genes were included in the signatures: the mediating WNT signaling (BCL9 and CDX2), epithelial-to-mesenchymal transition (EMT, INHBA, PRRX1, PBX1, and YWHAE), chromatin remodeling (EP300, ARID1A, SMARC4, and NSD3), DNA repair (WRN and BRIP1), NOTCH signaling (MAML2), and cell-cycle regulation (CNTRL and CCNE1). By FOLFOXai, mCRC patients were distinguished in “increased benefit” (IB) vs. “decreased benefit” (DB), where the category of IB showed improved clinical outputs in both oxaliplatin-containing arms. Moreover, the signatures showed a median TTNT of 11.5 months for IB and 8.2 months for DB, HR of 0.537, 95% C.I. 0.428–0.674, as well as a median OS of 42 months for IB and 24.5 months for DB, HR of 0.466, 95% C.I. 0.325–0.670 in the FOLFOX/bevacizumab arm. Conversely, the FOLFOXai prediction resulted inverted in FOLFIRI cohort with a median OS of 18.7 months for IB and 34.4 months for DB, HR of 2.631, 95% C.I. 1.041–6.649. A median PFS of 9.6 months for IB vs. 8.7 months for DB, HR of 0.757, 95% C.I. 0.505–1.135 and a median OS of 24.8 months for IB vs. 18.7 months, HR of 0.629; 95% C.I. 0.404–0.981 were detected in the FOLFOX/bevacizumab arm. Improved median PFS of 13.8 vs. 7.6 months, HR of 0.683, C.I. 0.396–1.181 and higher median OS of 30 vs. 15.9 months, HR of 0.483, C.I. 0.270–0.864 were even reported in the FOLFOXIRI/bevacizumab arm.
KRAS mutation profiles status can predict the response to treatment in mCRC [39], therefore Naseem et al. [102] have investigate by random survival forests method selected single nucleotide polymorphisms (SNPs) potentially predictive of the response to anti-VEGF therapy, including bevacizumab plus FOLFIRI, in both KRAS mutant and wild-type mCRC patients from the TRIBE and FIRE-3 cohorts [121]. Among 27 SNPs, the learning algorithms identified that SNPs in Wnt/β-catenin, tumor associated macrophage and sex-differentiation pathways were predictive of OS and/or PFS in mCRC patients treated with FOLFIRI/bevacizumab. In detail, the CBP rs129963 T/T variant and the β-catenin rs3864004 A/A genotype were associated with worst OS and PFS in KRAS wild-type (OS of 22.8 vs. 26 months and PFS of 9.5 vs. 10.5 months) and mutant (OS of 16.3 vs. 26.3 months and PFS of 7.8 vs. 9.6 months) patients, respectively. Conversely, improved OS and PFS were observed in both KRAS mutant (PFS of 10.3 vs. 8.6 months) and wild-type (OS of 31.3 vs. 24.8 months and PFS of 11.3 vs. 10.3 months) patients with the TBK1 rs7486100 A/A variant. Higher estimates were also obtained for the CCL2 rs4586 T/T carriers (OS of 30.9 vs. 22.8 months), the VEGFR2 rs2305948 any C carriers (OS of 26.2 vs. 17.0 months) and in the MMP2 rs243865 any T carriers (OS of 28.5 vs. 20.3 months).
In order to identify single nucleotide polymorphisms (SNPs) potentially predictive of the mechanisms of response and resistance associated to anti-VEGF therapy, Barat et al. [89] analyzed by learning methods the association between SNPs and the response to bevacizumab plus FOLFIRI chemotherapy using data from exome-sequencing in mCRC patients from the ANGIOPREDICT, MAVERICC and TRIBE cohorts [121]. In this study, 47 novel predictive SNPs were selected by learning algorithms across all exomes. Along the novel SNPs, two SNPs alone, the NLRP1 rs12150220 and the SRL rs13334970, one of which encodes a pro-inflammatory gene (NLRP1), were found to be associated with response to anti-VEGF therapy. For the rs12150220, missense at amino acid 155 of NLRP1 (NLR Family Pyrin Domain Containing 1), TT carriers had better median PFS than AA carriers in the ANGIOPREDICT and MAVERICC cohorts, HRs of 0.52, 95% C.I. 0.33–0.83 and 0.42, 95% C.I. 0.21–0.85, respectively. Conversely, for the rs13334970, mapping to SRL (Sarcalumenin), AA carriers showed shorter median PFS than carriers of at least one G allele in ANGIOPREDICT and MAVERICC cohorts, HRs of 2.3, 95% C.I. 1.19–4.57 and 2.5, 95% C.I. 1.12–5.5, respectively. The NLRP1 any A and the SRL AA carriers had worse PFS in ANGIOPREDICT and MAVERICC datasets, HR of 8.3, 95% C.I. 3.3–21 and 2.2, 95% C.I. 1–5, respectively. Association with worse outcomes was found with KRAS wild-type and concomitant carriers in combination with the NLRP1 any A and the SRL AA, HRs of 7.8, 95% C.I. 2.5–24.4 and 3, 95% C.I. 1.2–8 in MAVERICC and TRIBE. In KRAS wild-type patients, the NLRP1 any A and the SRL AA carriers showed a worse PFS than in the other two genotype combinations with shorter median PFS (4 vs. 11 and 15 months), and also shorter median OS (12 vs. 27 months).
Williams et al. [97] have examined by a ML approach using immunohistochemical data the response to anti-EGFR therapy with or without irinotecan regimens in mCRC patients with KRAS wild-type from the PICCOLO dataset [122]. By whole slide learning and computer vision techniques, the percentage of positively stained tumor cells within the tumor areas for EGFR ligands amphiregulin (AREG) and epiregulin (EREG) were found to predict benefit from the anti-EGFR agent panitumumab. Higher PFS was associated to the response to targeted therapy in patients with KRAS wild-type and high ligand expression as compared to those receiving standard regimens (8.0 vs. 3.2 months, HR of 0.54, 95% C.I. 0.37–0.79), and in those with both KRAS and BRAF wild-type, HR of 0.53 (0.36–0.78).
Ubels et al. [90] have recently evaluated whether genome-wide genotyping could be used to predict the efficacy of cetuximab in mCRC patients from the CAIRO2 dataset [123]. In this study, a random forest approach based on 781-SNP signature was able to classify mCRC patients in two classes, “benefit” vs. “no benefit”; a subset of patients showed benefit from receiving anti-EGFR therapy demonstrated by a higher median PFS (HR of 0.69, 95% C.I. 0.49–0.98), as compared to the category ‘no benefit’ (HR of 1.32, 95% C.I. 1.07–1.62). A greater performance was even found when chromosomal sex was incorporated in learning models (HR of 0.52, 95% C.I. 0.35–0.76).
Johnson et al. [100] have investigated by a ML approach with gene-mutation data the response to standard chemotherapy with or without target therapy in mCRC patients from the MSK cohort [124]. In this study, mutation profiles fed as inputs into a RF model, produced a final model containing a classifier based on mutation profiles of KRAS, BRAF, ERBB2, MAP2K1, TSC2, TP53, and APC. The 7-gene algorithm showed high predictive power for PFS to identify responder (non-progressed) vs. non-responder (progressed) mCRC patients with HR of 16.9, 95% C.I. 4.2–68.0.

3.8. Meta-Analysis

We performed a meta-analysis of the 9 AI studies reporting performance estimates of the response to therapy and 95% C.I. with 2.441 patients [75,79,80,85,86,87,98,99,101]. Table 5 reports the estimates of AUCs and HRs and their C.I. used for discriminating response to therapy that were extracted from the predictive models from the aforementioned studies. Results of meta-analysis are shown in Figure 2 reporting the single study and the overall weighted mean of AUC and HR with their 95% C.I. The sensitivity analysis showed that no study heavily influenced the result of this meta-analysis. In addition, it was found that the occurrence of publication bias was unlikely. Using the random-effect model, we calculated summary risk estimates in training and validation sets. Overall weighted means of AUCs are 0.90, 95% C.I. 0.80–0.95 and 0.83, 95% C.I. 0.74–0.89 in training and validation sets, respectively, showing good classification performance in discriminating response vs. non-response. On the other hand, an overall HR of OS of 0.51, 95% C.I. 0.43–0.60 was found for the learning models that significantly predict improved survival (Figure 1).
Using the random-effect model, we calculated summary risk estimates in training and validation sets. Overall weighted means of AUCs were 0.87, 95% C.I. 0.80–0.92 and 0.79, 95% C.I. 0.69–0.87 in training and validation cohorts, respectively, showing high classification performance in discriminating responder mCRC patients/treated lesions vs. non-responder mCRC patients/untreated lesions to standard chemotherapy alone or combined with targeted therapy. On the other hand, overall HR of OS of 0.51, 95% C.I. 0.43–0.60 was found for learning models that significantly predict improved survival (Figure 1).

3.9. Predictive Model Performances

Model performances were then analyzed in 13 AI studies that reported AUCs, SE and SP estimates [74,75,79,80,81,86,87,94,95,98,99,101]. Considering the AI studies that have analyzed the model performance using the AUCs, Table 6 shows that both radiomics and molecular signatures were effective in predicting the response to therapy with good or excellent estimates in both the training and the validation sets. Comparable estimates of the global AUCs were also reported by our meta-analysis. Moreover, when we evaluated the AI investigation with tests that reached the 80% SE and the 90% SP, there were the delta radiomics that showed the SE of 99% and the SP of 94% in the training set and the SE of 85% and the SP of 92 in the test set, and the 74 gene signatures that obtained the SE of 97.6% and the SP of 100% in the training set.

4. Discussion

Selection of the optimal first-line treatment represents a crucial step in the therapeutic pathway of mCRC patients, in order to obtain a significant improvement of PFS and, possibly, OS due to the development and combination of cytotoxic and biologic drugs chosen on the basis of the tumor mutational status. However, biological drugs, being directed towards specific “actionable” targets, might cause a heterogeneous tumor response, depending on clinical characteristics and/or disease biology of each patient. Therefore, despite selection of optimal therapy based on the patient’s molecular phenotype, a percentage of patients is not responsive to targeted therapy, leaving us to hypothesize that additional mediators could be involved in the dysregulation of molecular mechanisms, that expression of further potentially actionable genes might intervene and/or that mechanisms of innate/acquired resistance to target inhibitors occur. Even the presence of genetic mutations in tumor RAS or BRAF sequences and MSI status cannot always predict the therapeutic response in mCRC patients [5]. No imaging criteria are also available that could predict the response to therapy before the start of the therapy and or even predict the kind of metastatic lesions that will respond to the treatment [125]. To face these limits, the design of optimal strategy for mCRC on a case by case basis has been proposed, where therapeutic interventions should be modulated depending on patient/tumor’s characteristics. Therefore, it is relevant to characterize CRC complexity at the individual level to understand if a patient will respond to a specific therapy or will show resistance, considering the opportunity to leverage new emerging computer science solutions from research to real-world conditions. In this regard, growing emphasis has been put on clinical decision support systems based on AI, in general, and ML techniques, in particular, to develop predictive models of cancer progression or response to treatment using ML techniques, especially in mCRC. This approach has already proven capable of exploiting significant patterns in routinely collected demographic, clinical and biochemical data and allowed the design of clinical decision support systems (DSS) that can be easily adapted to different tumors [126,127].
Our review summarizes the current literature exploring the use of machine and deep learning methods in predicting the response to cytotoxic and/or targeted treatments in mCRC patients. Table 1 and Table 2 show that most of the studies have benefited from the availability of high-throughput information from radiomics data and different genomic characterization levels, as genome, transcriptome, proteome, and metabolome. All investigations conducted in this field have shown promising outcomes. As reported in Table 2, ML networks were mostly utilized to analyze standard therapy response in mCRC patients, with the exception of the study of Wei et al. [80] that employed a residual network architecture DL model. In five studies [75,79,80,81,87], imaging biomarkers, including delta and fusion radiomic signatures, were used to evaluate the response to therapy. Molecular biomarkers classifiers, ranging from lncRNAs to gene-expression signatures were chosen by the authors of the other ten investigations for developing predictive models that could assist clinicians in decision making processes [82,83,84,92,93,94,95,96,99,101]. Results in Table 2 show that the algorithm most predictive of response to standard chemotherapy in mCRC patients was built using a delta-radiomic signature [75], where a trees structured algorithm reached the highest performance by classifying patients into responders vs. non-responders (AUC = 0.93). Importantly, this ML signature evaluating temporal changes in tumor phenotype was created as intermediate surrogate biomarker to support treatment continuation decisions in clinical yardsticks. Inconclusive findings were instead reported by the investigation of Dercle et al. [87], where a temporal radiomic signature failed to predict the treatment response. Great predictive performances in both training and validation sets were obtained by a DL model based on a fusion radiomic signature [80].
On the other hand, when we have examined the findings of the eleven studies based on molecular biomarkers [82,83,84,92,93,94,95,96,99,101,102], we found that microarray gene expression data were more often chosen by the different authors for building molecular biomarkers based classifiers [82,92,95,101]. Noteworthy, molecular information from immune infiltration, circulating cytokine expression, and pharmacokinetics and cardiovascular parameters were also often employed to fed as inputs learning models aimed to classify mCRC patients in responders vs. non-responders, as well as for evaluating side effects of standard regimens [82,83,84,93,94,96]. Along the molecular biomarker-based learning models, the greatest predictive performances were obtained by both 16-long noncoding RNA (AUC = 0.843) and 18-gene signatures (AUC = 0.877) [94,101]. Even a multiple linear regression model was able to correctly evaluate the response to chemotherapy [83]. However, the prediction of FOLFOXai classifier resulted inconclusively in the FOLFIRI cohort [99]. In addition, results indicate that pharmacokinetics and cardiovascular features could be used to predict different side effects, including cardiotoxicity and leukopenia, caused by standard cytotoxic regimens [93,96].
An improved knowledge of the mechanisms involved in colorectal carcinogenesis and tumor progression has allowed the development of biological agents with targeted actions, as inhibitors of angiogenesis, EGFR-targeted therapy, and immunotherapy [5]. As shown in Table 3, ML models [74,87,88,89,90,91,94,97,99,100,102] were mainly utilized to analyze the response to targeted therapy rather than DL networks [85,86,98]. Findings show that the prognostic and predictive values of radiomics and molecular biomarkers in predicting the benefit of targeted therapy in CRC metastases have been extensively analyzed, with seven radiomic studies demonstrating the utility of diagnostic imaging in predicting clinical outcomes [74,85,86,87,88,91,98] and with seven molecular biomarkers based investigations that developed successful predictive molecular classifiers [89,90,94,97,99,100,102]. Other inputs can come from a recent study of Lindner et al. [128], where several signaling proteins and phosphoproteins associated with EGFR and other relevant cancer signaling pathways when examined across mouse models. In that investigation an ML approach allowed identification of a 14-phospoprotein signature associated with EGFR and other relevant cancer signaling pathways including PDK1, caspase-8, Shc, Stat3, p27, GSK-3β, HER3, PKC-α, EGFR, Akt, S6 ribosomal protein, HER3, NF-κB-p65 and Gab-1, potentially able to discriminate tumors sensitive and insensitive to targeted therapy.
In this context, our meta-analysis demonstrated a good performance power of the learning models used to predict response to chemotherapy alone or combined with targeted therapy by discriminating response vs. non-response. The calculation of overall HR also shows that learning models have strong ability to predict improved survival. Next, when we analyzed performance, both radiomics and molecular signatures showed the capacity to predict the response to therapy with good or excellent values. In particular, we found that both the delta radiomics and the 74 gene signatures were able to discriminate response vs. non-response by correctly classifying up to 99% of mCRC patients as responders and up to 100% of patients as non-responders.
There are several limitations to this study. First, in some cases, data of the same cohort were shared between training and test sets, whereas the use of an external validation cohort, that is critical to ensure model accuracy, was noted only in the half of the publications. In addition, three studies did not split the data [79,92,93]. This can be due to the lack of appropriate cohorts or the lack of awareness of the importance of an external validation cohort before its application in clinical settings. Missing data and outliers were also not appropriately managed by most of the AI studies. Some of them also have small sample sizes that could impact on study outcomes and decrease the statistical power. Then, the size of the validation sets was generally lower than that observed in the training sets, possibly reflecting the low number of mCRC patients that were available overall. Nevertheless, the use of k-fold validation that allows proper stratification of predictive outcomes [94,101] has been performed by the majority of the studies (77%), even at greater rate than in a previous review [72].
AI techniques can provide novel methods for clinicians focusing on predictive modeling for conventional chemotherapy alone or combined with target therapy as compared to previous statistical methods, that were more based on the study of data associations [24,129,130]. AI is indeed becoming popular for developing predictive modeling and defining treatment effects in epidemiology. Improving these approaches in molecular epidemiology will be useful for explaining cancer risk factors, including nature and nurture factors [24,131,132,133,134], in public health. Until now, researchers have benefited from a variety of AI methods for supervised and unsupervised pattern analysis in large big data originating from multiple sources, including serial imaging and molecular biomarkers. The major challenges are data variability, inter- and intra-study, and the lack of large amounts of high-quality labeled data, which both limit the potentiality of ML and DL models [135]. Privacy related issues, challenges in the patient enrollment process and cumbersome bureaucratic procedures do not allow researchers to easily obtain high quantities and/or high quality data.
When resistance to treatments occurs, few treatment options are available for resistant mCRC that may be difficult to tackle through non-cross-resistant anticancer drugs for second and subsequent lines of therapy despite new diagnostic and therapeutic methods [10]. Currently, about 20–25% of mCRC patients with hepatic metastasis are resectable, 60–70% of distal mCRC patients will develop local or distal recurrence, while only 20% will achieve long-term remission [136]. Up to 41% of mCRC patients receiving routine care have a poor performance status, and have more toxic side-effects if they are treated as well as inferior survival outcomes [137]. Furthermore, we cannot stratify mCRC patients or those who are at risk of developing colorectal metastasis at diagnosis using biomarkers or other currently available techniques. Even the presence of genetic mutations in tumor RAS or BRAF sequences and MSI status cannot always predict the therapeutic response in mCRC patients [5]. On the other hand, AI signatures can reach better performance in distinguishing responders from non-responders in respect to routine clinical indicators, such as cancer stage, adjuvant therapies, surgery on primary tumor and RECIST criteria [5,80,83,85,98,100,138]. New classifications of mCRC based on AI parameters will soon emerge and we will be able to identify new subtypes of mCRC patients, for whom the definition will be based on combinations of radiomics and/or molecular signatures contributing to improve the clinical management of mCRC and clinical decision making. AI approaches might provide innovative tools to distinguish between responders and non-responders before treatment starts, to know if a mCRC patient will respond to a specific therapy or will show resistance.
In the future, especially with regard to clinical trials, our findings suggest that mCRC patients would be randomized to treatment based on a predictive model vs. usual treatment. However, the possibility of employing predictive models in choosing the most effective and cost-effective therapeutic intervention for cancer patients relies on the development of improved and innovative AI based computational frameworks. These models require the evaluation of a huge amount of clinical data, with emphasis on factors influencing positive and negative responses, prognostic biomarkers, and molecular predictors of therapeutic response or disease to be appropriately advanced to make such trials ethically correct. A rigorous process must always underlie any AI model to ensure reliable prediction of mCRC patient response to treatments. In this field, we are awaiting the first results of the European Union funded study titled “Targeted therapy for advanced colorectal cancer patients, REVERT” (https://www.revert-project.eu/ (accessed on 30 April 2022)) aimed at strengthening this ambitious goal by addressing the specific challenge of understanding at system level the pathophysiology of mCRC cancer in patients responding well or poorly to therapies, to design optimal strategies for mCRC on a case by case basis, with therapeutic interventions modulated depending on the patient’s features.

5. Conclusions

Our systematic review shows that a good performance in predicting the response to therapy was obtained by the different algorithms that were analyzed by a meta-analytic approach. The calculation of overall HR indicates that learning models have a strong ability to predict improved survival. In addition, the delta-radiomics and the 74 gene signatures were found to able to discriminate between response vs. non-response by correctly identifying up to 99% of mCRC patients who are responders and up to 100% of patients who are non-responders. Our findings support the use of computer science for developing personalized treatment decision processes for mCRC patients. The identification of clinical, pathological and molecular comprehensive markers/signatures predictive of efficacy and toxicity could be useful for validation in prospective clinical trials. Further understanding of CRC biology linked to AI approaches will be able to improve matching mCRC patient individual traits with appropriate therapies to increase their survival and quality of life.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cancers14164012/s1. Table S1. Study quality of eligible publications, modified Luo scale [71].

Author Contributions

M.P., F.G., L.M. and E.M. conceived and designed the study; V.R., E.L., M.S., A.M., I.L. and S.N. assisted to data management; M.P. and E.M. wrote the manuscript; F.G., E.M. and M.P. provided funding for the study; M.C., M.B. and M.P. performed the analysis; R.D., E.G.C., G.B., M.R., S.R., P.F., F.G., F.C., F.S., M.C., A.G., A.R., A.M., L.M., F.G., E.M., S.N. and I.L. reviewed the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been funded by a grant from the European Project Horizon 2020 SC1-BHC-02-2019 [REVERT, GA n. 848098], a grant of the Tuscany Region (CRCSCREEN) and a grant by the Tuscany region (KNOWPAIN).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Global Burden of Disease Cancer Collaboration. Global, Regional, and National Cancer Incidence, Mortality, Years of Life Lost, Years Lived with Disability, and Disability-Adjusted Life-Years for 29 Cancer Groups, 1990 to 2017: A Systematic Analysis for the Global Burden of Disease Study. JAMA Oncol. 2019, 5, 1749–1768. [Google Scholar] [CrossRef] [PubMed]
  2. Dekker, E.; Tanis, P.J.; Vleugels, J.L.A.; Kasi, P.M.; Wallace, M.B. Colorectal cancer. Lancet 2019, 394, 1467–1480. [Google Scholar] [CrossRef]
  3. Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J. Clin. 2021, 71, 209–249. [Google Scholar] [CrossRef] [PubMed]
  4. Biller, L.H.; Schrag, D. Diagnosis and Treatment of Metastatic Colorectal Cancer. JAMA 2021, 325, 669–685. [Google Scholar] [CrossRef]
  5. Xie, Y.-H.; Chen, Y.-X.; Fang, J.-Y. Comprehensive review of targeted therapy for colorectal cancer. Signal Transduct. Target. Ther. 2020, 5, 22. [Google Scholar] [CrossRef]
  6. Benson, A.B.; Venook, A.P.; Al-Hawary, M.M.; Arain, M.A.; Chen, Y.-J.; Ciombor, K.K.; Cohen, S.; Cooper, H.S.; Deming, D.; Farkas, L.; et al. Colon Cancer, Version 2.2021, NCCN Clinical Practice Guidelines in Oncology. J. Natl. Compr. Cancer Netw. 2021, 19, 329–359. [Google Scholar] [CrossRef]
  7. Hurwitz, H.; Fehrenbacher, L.; Novotny, W.; Cartwright, T.; Hainsworth, J.; Heim, W.; Berlin, J.; Baron, A.; Griffing, S.; Holmgren, E.; et al. Bevacizumab plus Irinotecan, Fluorouracil, and Leucovorin for Metastatic Colorectal Cancer. N. Engl. J. Med. 2004, 350, 2335–2342. [Google Scholar] [CrossRef]
  8. Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2018, 68, 394–424. [Google Scholar] [CrossRef]
  9. Van Cutsem, E.; Köhne, C.-H.; Hitre, E.; Zaluski, J.; Chien, C.-R.C.; Makhson, A.; D’Haens, G.; Pintér, T.; Lim, R.; Bodoky, G.; et al. Cetuximab and Chemotherapy as Initial Treatment for Metastatic Colorectal Cancer. N. Engl. J. Med. 2009, 360, 1408–1417. [Google Scholar] [CrossRef]
  10. Mullard, A. Stemming the tide of drug resistance in cancer. Nat. Rev. Drug Discov. 2020, 19, 221–223. [Google Scholar] [CrossRef]
  11. Bruera, G.; Ricevuto, E. Toxicity Syndromes, Patient-Related Clinical Indicator of Toxicity Burden Induced by Intensive Triplet Chemotherapy-Based Regimens in Gastrointestinal Cancers with Metastatic Disease. Front. Oncol. 2020, 10, 172. [Google Scholar] [CrossRef] [PubMed]
  12. Fakih, M.G.; Kopetz, S.; Kuboki, Y.; Kim, T.W.; Munster, P.N.; Krauss, J.C.; Falchook, G.S.; Han, S.-W.; Heinemann, V.; Muro, K.; et al. Sotorasib for previously treated colorectal cancers with KRASG12C mutation (CodeBreaK100): A prespecified analysis of a single-arm, phase 2 trial. Lancet Oncol. 2021, 23, 115–124. [Google Scholar] [CrossRef]
  13. Raghav, K.; Loree, J.M.; Morris, J.S.; Overman, M.J.; Yu, R.; Meric-Bernstam, F.; Menter, D.; Korphaisarn, K.; Kee, B.; Muranyi, A.; et al. Validation of HER2 Amplification as a Predictive Biomarker for Anti–Epidermal Growth Factor Receptor Antibody Therapy in Metastatic Colorectal Cancer. JCO Precis. Oncol. 2019, 3, 1–13. [Google Scholar] [CrossRef] [PubMed]
  14. Okamura, R.; Boichard, A.; Kato, S.; Sicklick, J.K.; Bazhenova, L.; Kurzrock, R. Analysis of NTRK Alterations in Pan-Cancer Adult and Pediatric Malignancies: Implications for NTRK-Targeted Therapeutics. JCO Precis. Oncol. 2018, 2, 1–20. [Google Scholar] [CrossRef] [PubMed]
  15. Haibe, Y.; Kreidieh, M.; El Hajj, H.; Khalifeh, I.; Mukherji, D.; Temraz, S.; Shamseddine, A. Resistance Mechanisms to Anti-angiogenic Therapies in Cancer. Front. Oncol. 2020, 10, 221. [Google Scholar] [CrossRef]
  16. Iqbal, M.J.; Javed, Z.; Sadia, H.; Qureshi, I.A.; Irshad, A.; Ahmed, R.; Malik, K.; Raza, S.; Abbas, A.; Pezzani, R.; et al. Clinical applications of artificial intelligence and machine learning in cancer diagnosis: Looking into the future. Cancer Cell Int. 2021, 21, 270. [Google Scholar] [CrossRef]
  17. Wiemken, T.L.; Kelley, R.R. Machine Learning in Epidemiology and Health Outcomes Research. Annu. Rev. Public Health 2020, 41, 21–36. [Google Scholar] [CrossRef]
  18. Kourou, K.; Exarchos, K.P.; Papaloukas, C.; Sakaloglou, P.; Exarchos, T.; Fotiadis, D.I. Applied machine learning in cancer research: A systematic review for patient diagnosis, classification and prognosis. Comput. Struct. Biotechnol. J. 2021, 19, 5546–5555. [Google Scholar] [CrossRef]
  19. Munnia, A.; Giese, R.W.; Polvani, S.; Galli, A.; Cellai, F.; Peluso, M.E. Bulky DNA Adducts, Tobacco Smoking, Genetic Susceptibility, and Lung Cancer Risk. Adv. Clin. Chem. 2017, 81, 231–277. [Google Scholar]
  20. Ceppi, M.; Munnia, A.; Cellai, F.; Bruzzone, M.; Peluso, M.E. Linking the generation of DNA adducts to lung cancer. Toxicology 2017, 390, 160–166. [Google Scholar] [CrossRef]
  21. Brancato, B.; Munnia, A.; Cellai, F.; Ceni, E.; Mello, T.; Bianchi, S.; Catarzi, S.; Risso, G.G.; Galli, A.; Peluso, M.E.M. 8-Oxo-7,8-dihydro-2′-deoxyguanosine and other lesions along the coding strand of the exon 5 of the tumour suppressor gene P53 in a breast cancer case-control study. DNA Res. 2016, 23, 395–402. [Google Scholar] [CrossRef] [PubMed]
  22. Galli, A.; Munnia, A.; Cellai, F.; Tarocchi, M.; Ceni, E.; van Schooten, F.J.; Godschalk, R.; Giese, R.W.; Peluso, M. Ligation-Mediated Polymerase Chain Reaction Detection of 8-Oxo-7,8-Dihydro-2′-Deoxyguanosine and 5-Hydroxycytosine at the Codon 176 of the p53 Gene of Hepatitis C-Associated Hepatocellular Carcinoma Patients. Int. J. Mol. Sci. 2020, 21, 6753. [Google Scholar] [CrossRef] [PubMed]
  23. Carozzi, F.M.; Bisanzi, S.; Carrozzi, L.; Falaschi, F.; Pegna, A.L.; Mascalchi, M.; Picozzi, G.; Peluso, M.; Sani, C.; Greco, L.; et al. Multimodal lung cancer screening using the ITALUNG biomarker panel and low dose computed tomography. Results of the ITALUNG biomarker study. Int. J. Cancer 2017, 141, 94–101. [Google Scholar] [CrossRef]
  24. Agudo, A.; Peluso, M.; Munnia, A.; Luján-Barroso, L.; Sánchez, M.-J.; Molina-Montes, E.; Sánchez-Cantalejo, E.; Navarro, C.; Tormo, M.-J.; Chirlaque, M.-D.; et al. Aromatic DNA Adducts and Risk of Gastrointestinal Cancers: A Case–Cohort Study within the EPIC–Spain. Cancer Epidemiol. Biomark. Prev. 2012, 21, 685–692. [Google Scholar] [CrossRef] [PubMed]
  25. Badic, B.; Tixier, F.; le Rest, C.C.; Hatt, M.; Visvikis, D. Radiogenomics in Colorectal Cancer. Cancers 2021, 13, 973. [Google Scholar] [CrossRef] [PubMed]
  26. Tomaszewski, M.R.; Gillies, R.J. The Biological Meaning of Radiomic Features. Radiology 2021, 298, 505–516. [Google Scholar] [CrossRef]
  27. Sarker, I.H. Machine Learning: Algorithms, Real-World Applications and Research Directions. SN Comput. Sci. 2021, 2, 160. [Google Scholar] [CrossRef]
  28. Hastie, T.; Tibshirani, R.; Friedman, J. Model inference and averaging. In The Elements of Statistical Learning: Data Mining, Inference, Prediction Springer Series in Statistics; Springer: New York, NY, USA, 2009; pp. 261–294. [Google Scholar] [CrossRef]
  29. Lynch, C.; Liston, C. New machine-learning technologies for computer-aided diagnosis. Nat. Med. 2018, 24, 1304–1305. [Google Scholar] [CrossRef]
  30. Parimbelli, E.; Marini, S.; Sacchi, L.; Bellazzi, R. Patient similarity for precision medicine: A systematic review. J. Biomed. Informat. 2018, 83, 87–96. [Google Scholar] [CrossRef]
  31. Tran, K.A.; Kondrashova, O.; Bradley, A.; Williams, E.D.; Pearson, J.V.; Waddell, N. Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Med. 2021, 13, 152. [Google Scholar] [CrossRef]
  32. LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
  33. Vodenkova, S.; Buchler, T.; Cervena, K.; Veskrnova, V.; Vodicka, P.; Vymetalkova, V. 5-fluorouracil and other fluoropyrimidines in colorectal cancer: Past, present and future. Pharmacol. Ther. 2019, 206, 107447. [Google Scholar] [CrossRef] [PubMed]
  34. Glimelius, B.; Stintzing, S.; Marshall, J.; Yoshino, T.; de Gramont, A. Metastatic colorectal cancer: Advances in the folate-fluoropyrimidine chemotherapy backbone. Cancer Treat. Rev. 2021, 98, 102218. [Google Scholar] [CrossRef] [PubMed]
  35. André, F.; Ciruelos, E.M.; Juric, D.; Loibl, S.; Campone, M.; Mayer, I.A.; Rubovszky, G.; Yamashita, T.; Kaufman, B.; Lu, Y.-S.; et al. Alpelisib plus fulvestrant for PIK3CA-mutated, hormone receptor-positive, human epidermal growth factor receptor-2–negative advanced breast cancer: Final overall survival results from SOLAR-1. Ann. Oncol. 2021, 32, 208–217. [Google Scholar] [CrossRef]
  36. Mahoney, K.M.; Rennert, P.D.; Freeman, G.J. Combination cancer immunotherapy and new immunomodulatory targets. Nat. Rev. Drug Discov. 2015, 14, 561–584. [Google Scholar] [CrossRef]
  37. Dai, Y.; Zhao, W.; Yue, L.; Dai, X.; Rong, D.; Wu, F.; Gu, J.; Qian, X. Perspectives on Immunotherapy of Metastatic Colorectal Cancer. Front. Oncol. 2021, 11, 659964. [Google Scholar] [CrossRef]
  38. Amado, R.G.; Wolf, M.; Peeters, M.; van Cutsem, E.; Siena, S.; Freeman, D.J.; Juan, T.; Sikorski, R.; Suggs, S.; Radinsky, R.; et al. Wild-Type KRAS Is Required for Panitumumab Efficacy in Patients with Metastatic Colorectal Cancer. J. Clin. Oncol. 2008, 26, 1626–1634. [Google Scholar] [CrossRef]
  39. Bylsma, L.C.; Gillezeau, C.; Garawin, T.A.; Kelsh, M.A.; Fryzek, J.P.; Sangaré, L.; Lowe, K.A. Prevalence of RAS and BRAF mutations in metastatic colorectal cancer patients by tumor sidedness: A systematic review and meta-analysis. Cancer Med. 2019, 9, 1044–1057. [Google Scholar] [CrossRef]
  40. De Roock, W.; Claes, B.; Bernasconi, D.; de Schutter, J.; Biesmans, B.; Fountzilas, G.; Kalogeras, K.T.; Kotoula, V.; Papamichael, D.; Laurent-Puig, P.; et al. Effects of KRAS, BRAF, NRAS, and PIK3CA mutations on the efficacy of cetuximab plus chemotherapy in chemotherapy-refractory metastatic colorectal cancer: A retrospective consortium analysis. Lancet Oncol. 2010, 11, 753–762. [Google Scholar] [CrossRef]
  41. De Roock, W.; de Vriendt, V.; Normanno, N.; Ciardiello, F.; Tejpar, S. KRAS, BRAF, PIK3CA, and PTEN mutations: Implications for targeted therapies in metastatic colorectal cancer. Lancet Oncol. 2010, 12, 594–603. [Google Scholar] [CrossRef]
  42. Tran, N.H.; Cavalcante, L.L.; Lubner, S.J.; Mulkerin, D.L.; LoConte, N.K.; Clipson, L.; Matkowskyj, K.A.; Deming, D.A. Precision medicine in colorectal cancer: The molecular profile alters treatment strategies. Ther. Adv. Med. Oncol. 2015, 7, 252–262. [Google Scholar] [CrossRef] [PubMed]
  43. Meric-Bernstam, F.; Hurwitz, H.; Raghav, K.P.S.; McWilliams, R.R.; Fakih, M.; VanderWalde, A.; Swanton, C.; Kurzrock, R.; Burris, H.; Sweeney, C.; et al. Pertuzumab plus trastuzumab for HER2-amplified metastatic colorectal cancer (MyPathway): An updated report from a multicentre, open-label, phase 2a, multiple basket study. Lancet Oncol. 2019, 20, 518–530. [Google Scholar] [CrossRef]
  44. Morelli, C.; Formica, V.; Riondino, S.; Russo, A.; Ferroni, P.; Guadagni, F.; Roselli, M. Irinotecan or Oxaliplatin: Which is the First Move for the Mate? Curr. Med. Chem. 2021, 28, 3158–3172. [Google Scholar] [CrossRef]
  45. Dienstmann, R.; Vermeulen, L.; Guinney, J.; Kopetz, S.; Tejpar, S.; Tabernero, J. Consensus molecular subtypes and the evolution of precision medicine in colorectal cancer. Nat. Rev. Cancer 2017, 17, 79–92, Erratum in Nat. Rev. Cancer 2017, 17, 268. [Google Scholar] [CrossRef] [PubMed]
  46. Le, D.T.; Durham, J.N.; Smith, K.N.; Wang, H.; Bartlett, B.R.; Aulakh, L.K.; Lu, S.; Kemberling, H.; Wilt, C.; Luber, B.S.; et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science 2017, 357, 409–413. [Google Scholar] [CrossRef]
  47. Le, D.T.; Kim, T.W.; van Cutsem, E.; Geva, R.; Jäger, D.; Hara, H.; Burge, M.; O’Neil, B.; Kavan, P.; Yoshino, T.; et al. Phase II Open-Label Study of Pembrolizumab in Treatment-Refractory, Microsatellite Instability–High/Mismatch Repair–Deficient Metastatic Colorectal Cancer: KEYNOTE-164. J. Clin. Oncol. 2020, 38, 11–19. [Google Scholar] [CrossRef]
  48. Overman, M.J.; McDermott, R.; Leach, J.L.; Lonardi, S.; Lenz, H.-J.; Morse, M.A.; Desai, J.; Hill, A.; Axelson, M.; Moss, R.A.; et al. Nivolumab in patients with metastatic DNA mismatch repair-deficient or microsatellite instability-high colorectal cancer (CheckMate 142): An open-label, multicentre, phase 2 study. Lancet Oncol. 2017, 18, 1182–1191. [Google Scholar] [CrossRef]
  49. Overman, M.J.; Lonardi, S.; Wong, K.Y.M.; Lenz, H.-J.; Gelsomino, F.; Aglietta, M.; Morse, M.A.; van Cutsem, E.; McDermott, R.; Hill, A.; et al. Durable Clinical Benefit with Nivolumab Plus Ipilimumab in DNA Mismatch Repair–Deficient/Microsatellite Instability–High Metastatic Colorectal Cancer. J. Clin. Oncol. 2018, 36, 773–779. [Google Scholar] [CrossRef]
  50. André, T.; Shiu, K.-K.; Kim, T.W.; Jensen, B.V.; Jensen, L.H.; Punt, C.; Smith, D.; Garcia-Carbonero, R.; Benavides, M.; Gibbs, P.; et al. Pembrolizumab in Microsatellite-Instability–High Advanced Colorectal Cancer. N. Engl. J. Med. 2020, 383, 2207–2218. [Google Scholar] [CrossRef]
  51. A Diaz, L.; Shiu, K.-K.; Kim, T.-W.; Jensen, B.V.; Jensen, L.H.; Punt, C.; Smith, D.; Garcia-Carbonero, R.; Benavides, M.; Gibbs, P.; et al. Pembrolizumab versus chemotherapy for microsatellite instability-high or mismatch repair-deficient metastatic colorectal cancer (KEYNOTE-177): Final analysis of a randomised, open-label, phase 3 study. Lancet Oncol. 2022, 23, 659–670. [Google Scholar] [CrossRef]
  52. Andre, T.; Amonkar, M.; Norquist, J.M.; Shiu, K.-K.; Kim, T.W.; Jensen, B.V.; Jensen, L.H.; A Punt, C.J.; Smith, D.; Garcia-Carbonero, R.; et al. Health-related quality of life in patients with microsatellite instability-high or mismatch repair deficient metastatic colorectal cancer treated with first-line pembrolizumab versus chemotherapy (KEYNOTE-177): An open-label, randomised, phase 3 trial. Lancet Oncol. 2021, 22, 665–677. [Google Scholar] [CrossRef]
  53. Lenz, H.-J.; van Cutsem, E.; Limon, M.L.; Wong, K.Y.M.; Hendlisz, A.; Aglietta, M.; García-Alfonso, P.; Neyns, B.; Luppi, G.; Cardin, D.B.; et al. First-Line Nivolumab Plus Low-Dose Ipilimumab for Microsatellite Instability-High/Mismatch Repair-Deficient Metastatic Colorectal Cancer: The Phase II CheckMate 142 Study. J. Clin. Oncol. 2022, 40, 161–170. [Google Scholar] [CrossRef] [PubMed]
  54. Morris, V.K.; Parseghian, C.M.; Escano, M.; Johnson, B.; Raghav, K.P.S.; Dasari, A.; Huey, R.; Overman, M.J.; Willis, J.; Lee, M.S.; et al. Phase I/II trial of encorafenib, cetuximab, and nivolumab in patients with microsatellite stable, BRAFV600E metastatic colorectal cancer. J. Clin. Oncol. 2022, 40, 12. [Google Scholar] [CrossRef]
  55. Tabernero, J.; Grothey, A.; van Cutsem, E.; Yaeger, R.; Wasan, H.; Yoshino, T.; Desai, J.; Ciardiello, F.; Loupakis, F.; Hong, Y.S.; et al. Encorafenib Plus Cetuximab as a New Standard of Care for Previously Treated BRAF V600E–Mutant Metastatic Colorectal Cancer: Updated Survival Results and Subgroup Analyses from the BEACON Study. J. Clin. Oncol. 2021, 39, 273–284. [Google Scholar] [CrossRef]
  56. Grassadonia, A.; di Marino, P.; Ficorella, C.; Cortellini, A.; Cannita, K.; Parisi, A.; Gamucci, T.; Zoratto, F.; Vici, P.; Barba, M.; et al. Impact of primary tumor location in patients with RAS wild-type metastatic colon cancer treated with first-line chemotherapy plus anti-EGFR or anti-VEGF monoclonal antibodies: A retrospective multicenter study. J. Cancer 2019, 10, 5926–5934. [Google Scholar] [CrossRef]
  57. Tejpar, S.; Stintzing, S.; Ciardiello, F.; Tabernero, J.; van Cutsem, E.; Beier, F.; Esser, R.; Lenz, H.-J.; Heinemann, V. Prognostic and Predictive Relevance of Primary Tumor Location in Patients with RAS Wild-Type Metastatic Colorectal Cancer: Retrospective Analyses of the CRYSTAL and FIRE-3 Trials. JAMA Oncol. 2017, 3, 194–201. [Google Scholar] [CrossRef]
  58. Bahl, A.; Talwar, V.; Sirohi, B.; Mehta, P.; Arya, D.; Shrivastava, G.; Dahiya, A.; Pavithran, K. Primary Tumor Location as a Prognostic and Predictive Marker in Metastatic Colorectal Cancer (mCRC). Front. Oncol. 2020, 10, 964. [Google Scholar] [CrossRef]
  59. Baran, B.; Ozupek, N.M.; Tetik, N.Y.; Acar, E.; Bekcioglu, O.; Baskin, Y. Difference Between Left-Sided and Right-Sided Colorectal Cancer: A Focused Review of Literature. Gastroenterol. Res. 2018, 11, 264–273. [Google Scholar] [CrossRef]
  60. Johnston, P.G.; Fisher, E.R.; Rockette, H.E.; Fisher, B.; Wolmark, N.; Drake, J.C.; Chabner, B.A.; Allegra, C.J. The role of thymidylate synthase expression in prognosis and outcome of adjuvant chemotherapy in patients with rectal cancer. J. Clin. Oncol. 1994, 12, 2640–2647. [Google Scholar] [CrossRef]
  61. Salonga, D.; Danenberg, K.D.; Johnson, M.; Metzger, R.; Groshen, S.; Tsao-Wei, D.D.; Lenz, H.-J.; Leichman, C.G.; Leichman, L.; Diasio, R.B.; et al. Colorectal Tumors Responding to 5-Fluorouracil Have Low Gene Expression Levels of Dihydropyrimidine Dehydrogenase, Thymidylate Synthase, and Thymidine Phosphorylase1. Clin. Cancer Res. 2000, 6, 1322–1327. [Google Scholar]
  62. Popat, S.; Matakidou, A.; Houlston, R.S. Thymidylate Synthase Expression and Prognosis in Colorectal Cancer: A Systematic Review and Meta-Analysis. J. Clin. Oncol. 2004, 22, 529–536. [Google Scholar] [CrossRef] [PubMed]
  63. Morganti, M.; Ciantelli, M.; Giglioni, B.; Putignano, A.L.; Nobili, S.; Papi, L.; Landini, I.; Napoli, C.; Valanzano, R.; Cianchi, F.; et al. Relationships between promoter polymorphisms in the thymidylate synthase gene and mRNA levels in colorectal cancers. Eur. J. Cancer 2005, 41, 2176–2183. [Google Scholar] [CrossRef] [PubMed]
  64. Vignoli, M.; Nobili, S.; Napoli, C.; Putignano, A.L.; Morganti, M.; Papi, L.; Valanzano, R.; Cianchi, F.; Tonelli, F.; Mazzei, T.; et al. Thymidylate synthase expression and genotype have no major impact on the clinical outcome of colorectal cancer patients treated with 5-fluorouracil. Pharmacol. Res. 2011, 64, 242–248. [Google Scholar] [CrossRef] [PubMed]
  65. Nobili, S.; Napoli, C.; Landini, I.; Morganti, M.; Cianchi, F.; Valanzano, R.; Tonelli, F.; Cortesini, C.; Mazzei, T.; Mini, E. Identification of potential pharmacogenomic markers of clinical efficacy of 5-fluorouracil in colorectal cancer. Int. J. Cancer 2010, 128, 1935–1945. [Google Scholar] [CrossRef]
  66. Mini, E.; Lapucci, A.; Perrone, G.; D’Aurizio, R.; Napoli, C.; Brugia, M.; Landini, I.; Tassi, R.; Picariello, L.; Simi, L.; et al. RNA sequencing reveals PNN and KCNQ1OT1 as predictive biomarkers of clinical outcome in stage III colorectal cancer patients treated with adjuvant chemotherapy. Int. J. Cancer 2019, 145, 2580–2593. [Google Scholar] [CrossRef]
  67. Lapucci, A.; Perrone, G.; di Paolo, A.; Napoli, C.; Landini, I.; Roviello, G.; Calosi, L.; Naccarato, A.G.; Falcone, A.; Bani, D.; et al. PNN and KCNQ1OT1 Can Predict the Efficacy of Adjuvant Fluoropyrimidine-Based Chemotherapy in Colorectal Cancer Patients. Oncol. Res. Featur. Preclin. Clin. Cancer Ther. 2020, 28, 631–644. [Google Scholar] [CrossRef]
  68. Guinney, J.; Dienstmann, R.; Wang, X.; de Reyniès, A.; Schlicker, A.; Soneson, C.; Marisa, L.; Roepman, P.; Nyamundanda, G.; Angelino, P.; et al. The consensus molecular subtypes of colorectal cancer. Nat. Med. 2015, 21, 1350–1356. [Google Scholar] [CrossRef]
  69. Isella, C.; Brundu, F.G.; Bellomo, S.E.; Galimi, F.; Zanella, E.R.; Porporato, R.; Petti, C.; Fiori, A.; Orzan, F.N.; Senetta, R.; et al. Selective analysis of cancer-cell intrinsic transcriptional traits defines novel clinically relevant subtypes of colorectal cancer. Nat. Commun. 2017, 8, 15107. [Google Scholar] [CrossRef]
  70. Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. Int. J. Surg. 2010, 8, 336–341. [Google Scholar] [CrossRef]
  71. Luo, W.; Phung, Q.-D.; Tran, T.; Gupta, S.; Rana, S.; Karmakar, C.; Shilton, A.; Yearwood, J.L.; Dimitrova, N.; Ho, T.B.; et al. Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View. J. Med. Internet Res. 2016, 18, e323. [Google Scholar] [CrossRef]
  72. Brnabic, A.; Hess, L.M. Systematic literature review of machine learning methods used in the analysis of real-world data for patient-provider decision making. BMC Med. Informat. Decis. Mak. 2021, 21, 54. [Google Scholar] [CrossRef] [PubMed]
  73. Mandrekar, J.N. Receiver Operating Characteristic Curve in Diagnostic Test Assessment. J. Thorac. Oncol. 2010, 5, 1315–1316. [Google Scholar] [CrossRef] [PubMed]
  74. Giannini, V.; Rosati, S.; DeFeudis, A.; Balestra, G.; Vassallo, L.; Cappello, G.; Mazzetti, S.; de Mattia, C.; Rizzetto, F.; Torresin, A.; et al. Radiomics predicts response of individual HER2 -amplified colorectal cancer liver metastases in patients treated with HER2 -targeted therapy. Int. J. Cancer 2020, 147, 3215–3223. [Google Scholar] [CrossRef] [PubMed]
  75. Giannini, V.; Pusceddu, L.; Defeudis, A.; Nicoletti, G.; Cappello, G.; Mazzetti, S.; Sartore-Bianchi, A.; Siena, S.; Vanzulli, A.; Rizzetto, F.; et al. Delta-Radiomics Predicts Response to First-Line Oxaliplatin-Based Chemotherapy in Colorectal Cancer Patients with Liver Metastases. Cancers 2022, 14, 241. [Google Scholar] [CrossRef] [PubMed]
  76. Peluso, M.E.; Munnia, A.; Ceppi, M. Bisphenol-A exposures and behavioural aberrations: Median and linear spline and meta-regression analyses of 12 toxicity studies in rodents. Toxicology 2014, 325, 200–208. [Google Scholar] [CrossRef] [PubMed]
  77. DerSimonian, R.; Laird, N. Meta-analysis in clinical trials. Control Clin. Trials 1986, 7, 177–188. [Google Scholar] [CrossRef]
  78. Peluso, M.; Ceppi, M.; Munnia, A.; Puntoni, R.; Parodi, S. Analysis of 13 (32)P-DNA postlabeling studies on occupational cohorts exposed to air pollution. Am. J. Epidemiol. 2001, 153, 546–558. [Google Scholar] [CrossRef]
  79. Nakanishi, R.; Oki, E.; Hasuda, H.; Sano, E.; Miyashita, Y.; Sakai, A.; Koga, N.; Kuriyama, N.; Nonaka, K.; Fujimoto, Y.; et al. ASO Author Reflection: Radiomics-Based Prediction for the Responder to First-Line Oxaliplatin-Based Chemotherapy in Patients with Colorectal Liver Metastasis. Ann. Surg. Oncol. 2021, 28, 2986–2987. [Google Scholar] [CrossRef]
  80. Wei, J.; Cheng, J.; Gu, D.; Chai, F.; Hong, N.; Wang, Y.; Tian, J. Deep learning-based radiomics predicts response to chemotherapy in colorectal liver metastases. Med. Phys. 2020, 48, 513–522. [Google Scholar] [CrossRef]
  81. Defeudis, A.; Cefaloni, L.; Giannetto, G.; Cappello, G.; Rizzetto, F.; Panic, J.; Barra, D.; Nicoletti, G.; Mazzetti, S.; Vanzulli, A.; et al. Comparison of radiomics approaches to predict resistance to 1st line chemotherapy in liver metastatic colorectal cancer. In Proceedings of the 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Virtual, 1–5 November 2021; pp. 3305–3308. [Google Scholar]
  82. Tsuji, S.; Midorikawa, Y.; Takahashi, T.; Yagi, K.; Takayama, T.; Yoshida, K.; Sugiyama, Y.; Aburatani, H. Potential responders to FOLFOX therapy for colorectal cancer by Random Forests analysis. Br. J. Cancer 2011, 106, 126–132. [Google Scholar] [CrossRef]
  83. Chen, Z.-Y.; He, W.-Z.; Peng, L.-X.; Jia, W.-H.; Guo, R.-P.; Xia, L.-P.; Qian, C.-N. A prognostic classifier consisting of 17 circulating cytokines is a novel predictor of overall survival for metastatic colorectal cancer patients. Int. J. Cancer 2014, 136, 584–592. [Google Scholar] [CrossRef] [PubMed]
  84. Yuan, Y.; Shen, H.; Li, M.-D.; Tan, C.-W.; Yu, J.-K.; Fang, X.-F.; Zheng, S. Identification of the biomarkers for the prediction of efficacy in the first-line chemotherapy of metastatic colorectal cancer patients using SELDI-TOF-MS and artificial neural networks. J. Clin. Oncol. 2012, 30, e14026. [Google Scholar] [CrossRef]
  85. Lu, L.; Dercle, L.; Zhao, B.; Schwartz, L.H. Deep learning for the prediction of early on-treatment response in metastatic colorectal cancer from serial medical imaging. Nat. Commun. 2021, 12, 6654. [Google Scholar] [CrossRef]
  86. Maaref, A.; Romero, F.P.; Montagnon, E.; Cerny, M.; Nguyen, B.; Vandenbroucke, F.; Soucy, G.; Turcotte, S.; Tang, A.; Kadoury, S. Predicting the Response to FOLFOX-Based Chemotherapy Regimen from Untreated Liver Metastases on Baseline CT: A Deep Neural Network Approach. J. Digit. Imaging 2020, 33, 937–945. [Google Scholar] [CrossRef]
  87. Dercle, L.; Lu, L.; Schwartz, L.H.; Qian, M.; Tejpar, S.; Eggleton, P.; Zhao, B.; Piessevaux, H. Radiomics Response Signature for Identification of Metastatic Colorectal Cancer Sensitive to Therapies Targeting EGFR Pathway. J. Natl. Cancer Inst. 2020, 112, 902–912. [Google Scholar] [CrossRef]
  88. Vera-Yunca, D.; Girard, P.; Guillen, Z.P.P.; Munafo, A.; Trocóniz, I.F.; Terranova, N. Machine Learning Analysis of Individual Tumor Lesions in Four Metastatic Colorectal Cancer Clinical Studies: Linking Tumor Heterogeneity to Overall Survival. AAPS J. 2020, 22, 58. [Google Scholar] [CrossRef]
  89. Barat, A.; Smeets, D.; Moran, B.; Zhang, W.; Cao, S.; Das, S.; Klinger, R.; Betge, J.; Murphy, V.; Bacon, O.; et al. Combination of variations in inflammation- and endoplasmic reticulum-associated genes as putative biomarker for bevacizumab response in KRAS wild-type colorectal cancer. Sci. Rep. 2020, 10, 9778. [Google Scholar] [CrossRef]
  90. Ubels, J.; Schaefers, T.; Punt, C.; Guchelaar, H.-J.; de Ridder, J. RAINFOREST: A random forest approach to predict treatment benefit in data from (failed) clinical drug trials. Bioinformatics 2020, 36, i601–i609. [Google Scholar] [CrossRef]
  91. Granata, V.; Fusco, R.; Avallone, A.; de Stefano, A.; Ottaiano, A.; Sbordone, C.; Brunese, L.; Izzo, F.; Petrillo, A. Radiomics-Derived Data by Contrast Enhanced Magnetic Resonance in RAS Mutations Detection in Colorectal Liver Metastases. Cancers 2021, 13, 453. [Google Scholar] [CrossRef]
  92. Del Rio, M.; Molina, F.; Bascoul-Mollevi, C.; Copois, V.; Bibeau, F.; Chalbos, P.; Bareil, C.; Kramar, A.; Salvetat, N.; Fraslon, C.; et al. Gene Expression Signature in Advanced Colorectal Cancer Patients Select Drugs and Response for the Use of Leucovorin, Fluorouracil, and Irinotecan. J. Clin. Oncol. 2007, 25, 773–780. [Google Scholar] [CrossRef]
  93. Oyaga-Iriarte, E.; Insausti, A.; Sayar, O.; Aldaz, A. Prediction of irinotecan toxicity in metastatic colorectal cancer patients based on machine learning models with pharmacokinetic parameters. J. Pharmacol. Sci. 2019, 140, 20–25. [Google Scholar] [CrossRef] [PubMed]
  94. Liu, Z.; Liu, L.; Weng, S.; Guo, C.; Dang, Q.; Xu, H.; Wang, L.; Lu, T.; Zhang, Y.; Sun, Z.; et al. Machine learning-based integration develops an immune-derived lncRNA signature for improving outcomes in colorectal cancer. Nat. Commun. 2022, 13, 816. [Google Scholar] [CrossRef] [PubMed]
  95. Tian, S.; Wang, F.; Lu, S.; Chen, G. Identification of Two Subgroups of FOLFOX Resistance Patterns and Prediction of FOLFOX Response in Colorectal Cancer Patients. Cancer Investig. 2020, 39, 62–72. [Google Scholar] [CrossRef]
  96. Li, C.; Chen, L.; Chou, C.; Ngorsuraches, S.; Qian, J. Using Machine Learning Approaches to Predict Short-Term Risk of Cardiotoxicity Among Patients with Colorectal Cancer After Starting Fluoropyrimidine-Based Chemotherapy. Cardiovasc. Toxicol. 2021, 22, 130–140. [Google Scholar] [CrossRef]
  97. Williams, C.; Seligmann, J.F.; Guetter, C.; Zhang, L.; Yan, D.; Muranyi, A.; Bai, I.; Singh, S.; Elliott, F.; Shires, M.; et al. Artificial intelligence-assisted immunohistochemical (IHC) evaluation of tumor amphiregulin (AREG) and epiregulin (EREG) expression as a combined predictive biomarker for panitumumab (Pan) therapy benefit in RAS wild-type (wt) metastatic colorectal cancer (mCRC): Analysis within the phase III PICCOLO trial. J. Clin. Oncol. 2021, 39, 111. [Google Scholar] [CrossRef]
  98. Zhu, H.; Xu, D.; Ye, M.; Sun, L.; Zhang, X.; Mm, X.L.; Nie, P.; Xing, B.; Sun, Y. Deep learning-assisted magnetic resonance imaging prediction of tumor response to chemotherapy in patients with colorectal liver metastases. Int. J. Cancer 2020, 148, 1717–1730. [Google Scholar] [CrossRef]
  99. Abraham, J.P.; Magee, D.; Cremolini, C.; Antoniotti, C.; Halbert, D.D.; Xiu, J.; Stafford, P.; Berry, D.A.; Oberley, M.J.; Shields, A.F.; et al. Clinical Validation of a Machine-learning–derived Signature Predictive of Outcomes from First-line Oxaliplatin-based Chemotherapy in Advanced Colorectal Cancer. Clin. Cancer Res. 2021, 27, 1174–1183. [Google Scholar] [CrossRef]
  100. Johnson, H.; El-Schich, Z.; Ali, A.; Zhang, X.; Simoulis, A.; Wingren, A.G.; Persson, J.L. Gene-Mutation-Based Algorithm for Prediction of Treatment Response in Colorectal Cancer Patients. Cancers 2022, 14, 2045. [Google Scholar] [CrossRef]
  101. Lu, W.; Fu, D.; Kong, X.; Huang, Z.; Hwang, M.; Zhu, Y.; Chen, L.; Jiang, K.; Li, X.; Wu, Y.; et al. FOLFOX treatment response prediction in metastatic or recurrent colorectal cancer patients via machine learning algorithms. Cancer Med. 2020, 9, 1419–1429. [Google Scholar] [CrossRef]
  102. Naseem, M.; Cao, S.; Yang, D.; Millstein, J.; Puccini, A.; Loupakis, F.; Stintzing, S.; Cremolini, C.; Tokunaga, R.; Battaglin, F.; et al. Random survival forests identify pathways with polymorphisms predictive of survival in KRAS mutant and KRAS wild-type metastatic colorectal cancer patients. Sci. Rep. 2021, 11, 12191. [Google Scholar] [CrossRef]
  103. He, Q.; Long, J.; Yin, Y.; Li, Y.; Lei, X.; Li, Z.; Zhu, W. Emerging Roles of lncRNAs in the Formation and Progression of Colorectal Cancer. Front. Oncol. 2020, 9, 1542. [Google Scholar] [CrossRef] [PubMed]
  104. De Bony, E.J.; Bizet, M.; van Grembergen, O.; Hassabi, B.; Calonne, E.; Putmans, P.; Bontempi, G.; Fuks, F. Comprehensive identification of long noncoding RNAs in colorectal cancer. Oncotarget 2018, 9, 27605–27629. [Google Scholar] [CrossRef] [PubMed]
  105. Jensen, N.F.; Stenvang, J.; Beck, M.K.; Hanáková, B.; Belling, K.C.; Do, K.N.; Viuff, B.; Nygård, S.B.; Gupta, R.; Rasmussen, M.H.; et al. Establishment and characterization of models of chemotherapy resistance in colorectal cancer: Towards a predictive signature of chemoresistance. Mol. Oncol. 2015, 9, 1169–1185. [Google Scholar] [CrossRef]
  106. Liu, C.; Chu, D.; Kalantar-Zadeh, K.; George, J.; Young, H.A.; Liu, G. Cytokines: From Clinical Significance to Quantification. Adv. Sci. 2021, 8, 2004433. [Google Scholar] [CrossRef] [PubMed]
  107. Sato, N.; Sakai, N.; Furukawa, K.; Takayashiki, T.; Kuboki, S.; Takano, S.; Ohira, G.; Miyauchi, H.; Matsubara, H.; Ohtsuka, M. Tumor-suppressive role of Smad ubiquitination regulatory factor 2 in patients with colorectal cancer. Sci. Rep. 2022, 12, 5495. [Google Scholar] [CrossRef]
  108. Li, J.; Galbo, P.M., Jr.; Gong, W.; Storey, A.J.; Tsai, Y.-H.; Yu, X.; Ahn, J.H.; Guo, Y.; Mackintosh, S.G.; Edmondson, R.D.; et al. ZMYND11-MBTD1 induces leukemogenesis through hijacking NuA4/TIP60 acetyltransferase complex and a PWWP-mediated chromatin association mechanism. Nat. Commun. 2021, 12, 1045. [Google Scholar] [CrossRef]
  109. Zhang, J.; Jiang, X.; Yin, J.; Dou, S.; Xie, X.; Liu, T.; Wang, Y.; Wang, S.; Zhou, X.; Zhang, D.; et al. RNF141 interacts with KRAS to promote colorectal cancer progression. Oncogene 2021, 40, 5829–5842. [Google Scholar] [CrossRef]
  110. Agrawal, N.; A Brown, M. Genetic associations and functional characterization of M1 aminopeptidases and immune-mediated diseases. Genes Immun. 2014, 15, 521–527. [Google Scholar] [CrossRef]
  111. Lee, J.H.; Kim, M.S.; Yoo, N.J.; Lee, S.H. BPTF, a chromatin remodeling-related gene, exhibits frameshift mutations in gastric and colorectal cancers. APMIS 2016, 124, 425–427. [Google Scholar] [CrossRef]
  112. Feng, Y.; Feng, L.; Yu, D.; Zou, J.; Huang, Z. srGAP1 mediates the migration inhibition effect of Slit2-Robo1 in colorectal cancer. J. Exp. Clin. Cancer Res. 2016, 35, 191. [Google Scholar] [CrossRef]
  113. Chen, Y.-C.; Navarrete, M.S.; Wang, Y.; McClintock, N.C.; Sakurai, R.; Wang, F.; Chen, K.T.; Chou, T.-F.; Rehan, V.K.; Lee, D.J.; et al. N-myristoyltransferase-1 is necessary for lysosomal degradation and mTORC1 activation in cancer cells. Sci. Rep. 2020, 10, 11952. [Google Scholar] [CrossRef] [PubMed]
  114. Li, M.; Zhuang, J.; Kang, D.; Chen, Y.; Song, W. Identification of circRNA circ-CSPP1 as a potent driver of colorectal cancer by directly targeting the miR-431/LASP1 axis. Open Life Sci. 2021, 16, 523–536. [Google Scholar] [CrossRef] [PubMed]
  115. Tabernero, J.; van Cutsem, E.; Lakomý, R.; Prausová, J.; Ruff, P.; van Hazel, G.A.; Moiseyenko, V.M.; Ferry, D.R.; McKendrick, J.J.; Soussan-Lazard, K.; et al. Aflibercept versus placebo in combination with fluorouracil, leucovorin and irinotecan in the treatment of previously treated metastatic colorectal cancer: Prespecified subgroup analyses from the VELOUR trial. Eur. J. Cancer 2013, 50, 320–331. [Google Scholar] [CrossRef] [PubMed]
  116. Avallone, A.; Nasti, G.; Rosati, G.; Carlomagno, C.; Romano, C.; Bilancia, D.; de Stefano, A.; Ottaiano, A.; Cassata, A.; Bianco, F.; et al. Optimization of the combination of bevacizumab with FOLFOX/OXXEL in patients with metastatic colorectal cancer (mCRC): The multicentre, randomized phase 3 study OBELICS. Ann. Oncol. 2017, 28, vi5. [Google Scholar] [CrossRef]
  117. Cheng, A.-L.; Cornelio, G.; Shen, L.; Price, T.; Yang, T.-S.; Chung, I.J.; Dai, G.-H.; Lin, J.-K.; Sharma, A.; Yeh, K.-H.; et al. Efficacy, Tolerability, and Biomarker Analyses of Once-Every-2-Weeks Cetuximab Plus First-Line FOLFOX or FOLFIRI in Patients with KRAS or All RAS Wild-Type Metastatic Colorectal Cancer: The Phase 2 APEC Study. Clin. Color. Cancer 2016, 16, e73–e88. [Google Scholar] [CrossRef]
  118. Tabernero, J.; Ciardiello, F.; Rivera, F.; Rodriguez-Braun, E.; Ramos, F.J.; Martinelli, E.; Vega-Villegas, M.E.; Roselló, S.; Liebscher, S.; Kisker, O.; et al. Cetuximab administered once every second week to patients with metastatic colorectal cancer: A two-part pharmacokinetic/pharmacodynamic phase I dose-escalation study. Ann. Oncol. 2009, 21, 1537–1545. [Google Scholar] [CrossRef]
  119. Bokemeyer, C.; Bondarenko, I.; Makhson, A.; Hartmann, J.T.; Aparicio, J.; de Braud, F.; Donea, S.; Ludwig, H.; Schuch, G.; Stroh, C.; et al. Fluorouracil, Leucovorin, and Oxaliplatin with and Without Cetuximab in the First-Line Treatment of Metastatic Colorectal Cancer. J. Clin. Oncol. 2009, 27, 663–671. [Google Scholar] [CrossRef]
  120. Siravegna, G.; Lazzari, L.; Crisafulli, G.; Sartore-Bianchi, A.; Mussolin, B.; Cassingena, A.; Martino, C.; Lanman, R.B.; Nagy, R.J.; Fairclough, S.; et al. Radiologic and Genomic Evolution of Individual Metastases during HER2 Blockade in Colorectal Cancer. Cancer Cell 2018, 34, 148–162.e7. [Google Scholar] [CrossRef]
  121. Smeets, D.; Miller, I.S.; O’Connor, D.P.; Das, S.; Moran, B.; Boeckx, B.; Gaiser, T.; Betge, J.; Barat, A.; Klinger, R.; et al. Copy number load predicts outcome of metastatic colorectal cancer patients receiving bevacizumab combination therapy. Nat. Commun. 2018, 9, 4112. [Google Scholar] [CrossRef]
  122. Seymour, M.T.; Brown, S.R.; Middleton, G.; Maughan, T.; Richman, S.; Gwyther, S.; Lowe, C.; Seligmann, J.F.; Wadsley, J.; Maisey, N.; et al. Panitumumab and irinotecan versus irinotecan alone for patients with KRAS wild-type, fluorouracil-resistant advanced colorectal cancer (PICCOLO): A prospectively stratified randomised trial. Lancet Oncol. 2013, 14, 749–759. [Google Scholar] [CrossRef]
  123. Tol, J.; Koopman, M.; Cats, A.; Rodenburg, C.J.; Creemers, G.J.; Schrama, J.G.; Erdkamp, F.L.; Vos, A.H.; van Groeningen, C.J.; Sinnige, H.A.; et al. Chemotherapy, Bevacizumab, and Cetuximab in Metastatic Colorectal Cancer. N. Engl. J. Med. 2009, 360, 563–572. [Google Scholar] [CrossRef]
  124. Mondaca, S.; Walch, H.; Nandakumar, S.; Chatila, W.K.; Schultz, N.; Yaeger, R. Specific Mutations in APC, but Not Alterations in DNA Damage Response, Associate with Outcomes of Patients with Metastatic Colorectal Cancer. Gastroenterology 2020, 159, 1975–1978.e4. [Google Scholar] [CrossRef]
  125. Lambin, P.; Rios-Velazquez, E.; Leijenaar, R.; Carvalho, S.; van Stiphout, R.G.P.M.; Granton, P.; Zegers, C.M.L.; Gillies, R.; Boellard, R.; Dekker, A.; et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 2012, 48, 441–446. [Google Scholar] [CrossRef] [PubMed]
  126. Ferroni, P.; Zanzotto, F.M.; Riondino, S.; Scarpato, N.; Guadagni, F.; Roselli, M. Breast Cancer Prognosis Using a Machine Learning Approach. Cancers 2019, 11, 328. [Google Scholar] [CrossRef] [PubMed]
  127. Ferroni, P.; Zanzotto, F.M.; Scarpato, N.; Riondino, S.; Guadagni, F.; Roselli, M. Validation of a Machine Learning Approach for Venous Thromboembolism Risk Prediction in Oncology. Dis. Markers 2017, 2017, 8781379. [Google Scholar] [CrossRef] [PubMed]
  128. Lindner, A.U.; Carberry, S.; Monsefi, N.; Barat, A.; Salvucci, M.; O’Byrne, R.; Zanella, E.R.; Cremona, M.; Hennessy, B.T.; Bertotti, A.; et al. Systems analysis of protein signatures predicting cetuximab responses in KRAS, NRAS, BRAF and PIK3CA wild-type patient-derived xenograft models of metastatic colorectal cancer. Int. J. Cancer 2020, 147, 2891–2901. [Google Scholar] [CrossRef]
  129. Agudo, A.; Peluso, M.; Munnia, A.; Luján-Barroso, L.; Barricarte, A.; Amiano, P.; Navarro, C.; Sánchez, M.-J.; Ardanaz, E.; Larrañaga, N.; et al. Aromatic DNA adducts and breast cancer risk: A case-cohort study within the EPIC-Spain. Carcinogenesis 2017, 38, 691–698. [Google Scholar] [CrossRef]
  130. Gilbertson, T.; Peluso, M.E.; Munia, A.; Luján-Barroso, L.; Sánchez, M.-J.; Navarro, C.; Amiano, P.; Barricarte, A.; Quirós, J.; Molina-Montes, E.; et al. Aromatic adducts and lung cancer risk in the European Prospective Investigation into Cancer and Nutrition (EPIC) Spanish cohort. Carcinogenesis 2014, 35, 2047–2054. [Google Scholar] [CrossRef]
  131. Peluso, M.E.M.; Munnia, A.; Srivatanakul, P.; Jedpiyawongse, A.; Sangrajrang, S.; Ceppi, M.; Godschalk, R.W.; van Schooten, F.J.; Boffetta, P. DNA adducts and combinations of multiple lung cancer at-risk alleles in environmentally exposed and smoking subjects. Environ. Mol. Mutagen. 2013, 54, 375–383. [Google Scholar] [CrossRef]
  132. Peluso, M.E.M.; Munnia, A.; Bollati, V.; Srivatanakul, P.; Jedpiyawongse, A.; Sangrajrang, S.; Ceppi, M.; Giese, R.W.; Boffetta, P.; Baccarelli, A. Aberrant Methylation of Hypermethylated-in-Cancer-1 and Exocyclic DNA Adducts in Tobacco Smokers. Toxicol. Sci. 2013, 137, 47–54. [Google Scholar] [CrossRef]
  133. Peluso, M.; Bollati, V.; Munnia, A.; Srivatanakul, P.; Jedpiyawongse, A.; Sangrajrang, S.; Piro, S.; Ceppi, M.; Bertazzi, P.A.; Boffetta, P.; et al. DNA methylation differences in exposed workers and nearby residents of the Ma Ta Phut industrial estate, Rayong, Thailand. Int. J. Epidemiol. 2012, 41, 1753–1760. [Google Scholar] [CrossRef] [PubMed]
  134. Vineis, P.; Chuang, S.-C.; Vaissiare, T.; Cuenin, C.; Ricceri, F.; Genair-EPIC Collaborators; Johansson, M.; Ueland, P.; Brennan, P.; Herceg, Z. DNA methylation changes associated with cancer risk factors and blood levels of vitamin metabolites in a prospective study. Epigenetics 2011, 6, 195–201. [Google Scholar] [CrossRef] [PubMed]
  135. Ching, T.; Himmelstein, D.S.; Beaulieu-Jones, B.K.; Kalinin, A.A.; Do, B.T.; Way, G.P.; Ferrero, E.; Agapow, P.-M.; Zietz, M.; Hoffman, M.M.; et al. Opportunities and obstacles for deep learning in biology and medicine. J. R. Soc. Interface 2018, 15, 20170387. [Google Scholar] [CrossRef] [PubMed]
  136. Jones, R.P.; Jackson, R.; Dunne, D.F.J.; Malik, H.Z.; Fenwick, S.W.; Poston, G.J.; Ghaneh, P. Systematic review and meta-analysis of follow-up after hepatectomy for colorectal liver metastases. Br. J. Surg. 2012, 99, 477–486. [Google Scholar] [CrossRef] [PubMed]
  137. Travers, A.; Jalali, A.; Begbie, S.; Semira, C.; Kosmider, S.; Ananda, S.; Wong, R.; Lee, M.; Shapiro, J.; Burge, M.; et al. Real-World Treatment and Outcomes of Metastatic Colorectal Cancer Patients with a Poor or Very Poor Performance Status. Clin. Color. Cancer 2020, 20, e21–e34. [Google Scholar] [CrossRef]
  138. Tsilimigras, D.I.; Ntanasis-Stathopoulos, I.; Bagante, F.; Moris, D.; Cloyd, J.; Spartalis, E.; Pawlik, T.M. Clinical significance and prognostic relevance of KRAS, BRAF, PI3K and TP53 genetic mutation analysis for resectable and unresectable colorectal liver metastases: A systematic review of the current evidence. Surg. Oncol. 2018, 27, 280–288. [Google Scholar] [CrossRef]
Figure 1. Systematic review flow diagram. The PRISMA flow diagram for the systematic review detailing the database searches, the number of abstracts screened and the full texts retrieved.
Figure 1. Systematic review flow diagram. The PRISMA flow diagram for the systematic review detailing the database searches, the number of abstracts screened and the full texts retrieved.
Cancers 14 04012 g001
Figure 2. Meta-analysis of 21 artificial intelligence predictive models of response to therapy for metastatic colorectal cancer (mCRC) patients. The efficacy of learning algorithms in discriminating responders mCRC patients/treated metastatic lesions vs. non-responders mCRC patients/untreated metastatic lesions was analyzed by estimating the area under the receiver operating characteristic (ROC) curve (AUC) and 95% Confidence Intervals (C.I.) in training (A) and validation sets (A,B). (C) shows efficacy of the Hazard Ratios (HRs) and 95% C.I. from proportional hazards models (Cox regression) to predict improved survival.
Figure 2. Meta-analysis of 21 artificial intelligence predictive models of response to therapy for metastatic colorectal cancer (mCRC) patients. The efficacy of learning algorithms in discriminating responders mCRC patients/treated metastatic lesions vs. non-responders mCRC patients/untreated metastatic lesions was analyzed by estimating the area under the receiver operating characteristic (ROC) curve (AUC) and 95% Confidence Intervals (C.I.) in training (A) and validation sets (A,B). (C) shows efficacy of the Hazard Ratios (HRs) and 95% C.I. from proportional hazards models (Cox regression) to predict improved survival.
Cancers 14 04012 g002aCancers 14 04012 g002b
Table 1. Evaluation measures.
Table 1. Evaluation measures.
Problem TypeMeasureExplanation
ClassificationAUCArea under the receiver operating characteristic (ROC) curve. The ROC curve is created by plotting the true positive rate (TPR) against the false positive rate (FPR) at various discrimination threshold settings for binary classifiers.
SESensitivity or true positive rate (TPR), is the fraction of actual positive cases that has been predicted as positive in the population. It is an evaluation measure for binary classifiers.
SPSpecificity or true negative rate (TNR) refers to the fraction of true negative that has been predicted as negative in the population. It is an evaluation measure for binary classifier.
ACCAccuracy is the ratio of the number of correct predictions to the total number of input samples.
PPVPositive predictive value is the ratio of patients truly diagnosed as positive to all those who had positive test results. It is an evaluation measure for binary classifiers.
NPVNegative predictive value refers to the fraction of subjects truly diagnosed as negative among those who had negative test results. It is an evaluation measure for binary classifiers.
Classification PrecisionSame as PPV
F1 scoreIt is defined as the harmonic mean of the precision and recall
PLRThe Positive Likelihood Ratio is defined as the ratio between the true positivity rate and the false positivity rate.
NLRThe Negative Likelihood Ratio is the opposite of the PLR.
Regression\Survival AnalysisDFSDisease-Free Survival is defined as the time from randomization to recurrence of tumor or death.
TTNTTime-to-next treatment is defined as the time between baseline (randomization, inclusion or treatment initiation) and the date of subsequent systemic treatment initiation.
C-indexHarrell’s concordance C-index: used to evaluate risk models in survival analysis. It is computed as the fraction of concordant patient pairs on the sum of concordant and discordant patient pairs, where two patients are considered concordant if the patient with the higher risk score is the one having a shorter time-to-disease.
HRIn survival analysis, the hazard ratio is defined as the ratio of the hazard rates corresponding to the conditions described by two levels of an explanatory variable (e.g., cases vs. controls)
Table 2. Artificial intelligence analysis of the response to standard chemotherapy alone for patients with colorectal metastasis. Data are grouped for imaging or molecular or clinical biomarkers based learning models.
Table 2. Artificial intelligence analysis of the response to standard chemotherapy alone for patients with colorectal metastasis. Data are grouped for imaging or molecular or clinical biomarkers based learning models.
TherapyAITotal, Training, TestInput FeaturesSignatureEvaluated Models and Selected Model Training Set *Test Set *Reference
Imaging biomarkers learning models
First-line FOLFOX chemotherapyML 57
(242,172,70 features)
Contrast-enhanced computed tomography (CT) scans at differential time points
14 lesion shape-based
18 images intensity-based statistics
75 images gray level-based statistics
Delta-radiomicsLinear SVM
Logistic regression
Generalized linear Model with Poisson distribution
Random Forest
Decision Tree
Response vs. non-response: AUC = 0.99, 95% C.I. 0.97–1.0, 99% SE, 94% SP, 97% ACC, 95% PPV, 99% NPVResponse vs. non-response: AUC = 0.93, 95% C.I. 0.87–0.96, 85% SE, 92% SP, 86% ACC, 90% PPV, 97% NPV[75]
First-line oxaliplatin-based regimensML42
(126,94,32
features)
Contrast-enhanced CT scans at baselineRadiomics Response vs. non-response: AUC = 0.851, 95% C.I. 0.771–0.930Response vs. non-response: AUC = 0.779, 95% C.I. 0.617–0.940[79]
First-line CAPEOX or FOLFOX and FOLFIRI or XELIRI chemotherapiesDL192,144,48
(3490 features)
Contrast-enhanced multidetector tomography (MDCT) scans at baseline;
583 radiomics features (traditional radiomics)
Fusion radiomicsRESNETResponse vs. non-response: AUC = 0.903, 95% C.I. 0.851–0.955, 84.7% SE, 84.8% SP, 84.7% ACC (radiomics model)
Response vs. non-response: AUC = 0.935, 95% C.I. 0.897–0.973, 89.8% SE, 84.8% SP, 88.2% ACC (fusion radiomics model)
Response vs. non-response: AUC = 0.745, 95% C.I. 0.659–0.831, 90% SE, 73% SP, 85.4% ACC (radiomics model)
Response vs. non-response: AUC = 0.830, 95% C.I. 0.688–0.973, 90.9% SE, 73.3% SP, 85.4% ACC
(fusion radiomics model)
[80]
First-line FOLFOX or FOLFIRI chemotherapiesML92,54,38CT scans at baseline
75 radiomics features
RadiomicsGaussian Naive Bayes
Multilayer Perceptron
Gaussian SVM
Response vs. non-response: 76% SE, 67% SP, 72% ACC, 69%, PPV, 75% NPV (per-lesion 3D set)Response vs. non-response: 61% SE, 60% SP, 61% ACC, 57% PPV, 64% NPV
(per-lesion 3D set)
Response vs. non-response: 41% SE, 21% SP, 32% ACC, 38% PPV, 23% NPV
(per-patient 3D set)
[81]
First-line FOLFIRI chemotherapyML 129,78,51 (HQ set)
236,158,78 (SQ set)
High and standard quality (HQ and SQ sets) CT scans at differential time points
1749 Radiomics Features
1742 VGG16 extracted image features
8 rim specific features
RadiomicsFor feature selection:
RELIEF
Fisher Score
Chi Squared
MRMR
t-Test
Wilcoxon
Univariate models
For classification:
SVM
KNN
Naive Bayes
Bagging
Lasso
Random Forest
Response vs. non-response: AUC = 0.75, 95% C.I. 0.63–0.85 (HQCT cohort)
Response vs. non-response: AUC = 0.75, 95% C.I. 0.67–0.82 (SQCT cohort)
Response vs. non-response: AUC = 0.59, 95% C.I. 0.44–0.72 (HQCT cohort)
Response vs. non-response: AUC = 0.55, 95% C.I. 0.43–0.66 (SQCT cohort)
[87]
Molecular biomarkers learning models
First-line FOLFOX chemotherapyML115,83,32Microarray gene-expression data74 genesIterative supervised learning (IML)Response vs. non-response: 97.6% SE, 100% SPResponse vs. non-response: OS = 13.4 vs. 36.6 mo
HROS = 2.6
[95]
First-line 5-fluorouracil-based regimensML2317,2277,40Immune infiltration data16 long noncoding RNA Feature selection:
Ensemble of statistical and machine learning models
Survival analysis:
Cox proportional hazard model
Response vs. non-response: AUC = 0.843[94]
First-line 5-fluorouracil-based regimensML2307,2277,30Immune infiltration data16 long noncoding RNA Feature selection:
Ensemble of statistical and machine learning models
Survival analysis:
Cox proportional hazard model
Response vs. non-response: AUC = 0.765[94]
First-line 5-fluorouracil-based regimensML2201,2277,124Immune infiltration data16 long noncoding RNA Feature selection:
Ensemble of statistical and machine learning models
Survival analysis:
Cox proportional hazard model
Response vs. non-response: AUC = 0.709[94]
First-line FOLFIRI chemotherapyML82,27,55Next generation sequencing (NGS) data67 genesSVM
Logistic regression
KNN
Multilayer Perceptron
Naive Bayes
Quadratic Discriminant analysis
Gaussian Process
Random Forest Ensemble
Decreased survival: HROS = 2.631, 95% C.I. 1.041–6.649, OS = 18.7 vs. 34.4 mo [99]
First-line FOLFOX chemotherapyML83,54,29Microarray differentially expressed (DE) gene profiles18 genes SVM
KNN
Gradient Boosting Machine
Random Forest
Decision Tree
Neural Networks
Response vs. non-response: AUC = 0.877, 95% C.I. 0.747–1.00, 85% SE, 69.2% SP
Improved survival: HROS = 0.358, 95% C.I. 0.178–0.717 in up-regulation of MLK1
Improved survival: HROS = 0.563, 95% C.I. 0.336–0.943 in up-regulation of CCDC124
[101]
First-line FOLFIRI chemotherapyML75,54,21Microarray DE gene profiles18 genes SVM
Neural Networks
Random Forest
Response vs. non-response: AUC = 0.778, 95% C.I. 0.575–979[101]
First-line oxaliplatin or irinotecan-based regimensML176,99,77Cytokine expression data17 cytokinesFeature selection:
SVM
Classification:
Multiple linear regression
Response vs. non-response: 83.5% SE, 80% SP, 81% ACC, OS = 18.4 vs. 51.5 mo Response vs. non-response: 83.1% SE, 66.7% SP, 74.9% ACC), OS = 16.8 vs. 55.9 mo [83]
First-line FOLFOX chemotherapyML83,54,29Microarray DE gene profiles14 genes Random Forest for feature selection, outlier detection and classificationResponse vs. non-response: 91.3% SE, 95.6% SP, 80.2% ACC Response vs. non-response: 80% SE, 92.8% SP, 69.2% ACC[82]
First-line FOLFOX chemotherapyML44Protein pattern data6-proteins Response vs. non-response: 92.9% SE, 81.3% SP [84]
First-line FOLFIRI chemotherapyML26Protein pattern data7-proteins Response vs. non-response: 92.3% SE, 92.3% SP [84]
First-line FOLFIRI chemotherapyML21Microarray DE gene profiles14-genes Response vs. non-response: 92% SE, 100% SP, 95% ACC [92]
Clinical biomarkers learning models
Irinotecan plus FOLFIRI chemotherapyML20Demographic data
Liver function tests
Tumor markers
Pharmacokinetic parameters
Anticancer treatment information; i.e., prior nonfuoropyrimidine regimen, prior surgery, prior radiation, fuoropyrimidine-based chemotherapy, concurrent surgery, concurrent radiation (categorical variables)
3pharmacoparameters Classification:
BSLR: Backward Stepwise Logistic
Regression (diarrhea)
C4.5 algorithm
Random Forest (leukopenia)
SVM (neutropenia)
Toxic side-effect: AUC = 0.74, 89% SE, 60% SP, 76% ACC (leukopenia)
Toxic side-effect: AUC = 0.88, 70% SE, 70% SP, 75% ACC (neutropenia)
for toxic side-effect: AUC = 0.95, 81% SE, 100% SP = 100%, 91% ACC (diarrhea)
[93]
First-line fluoropyrimidine-based regimensML 36,030 patients
(training set size not reported, used data from 2006-11 + 10-fold cross validation)
(test set size not reported, used data from 2012-14)
Anticancer treatment information; i.e., prior nonfuoropyrimidine regimen, prior surgery, prior radiation, fuoropyrimidine-based chemotherapy, concurrent surgery, concurrent radiation
-cancer information; i.e., cancer stage, tumor sizes
demographic variables; i.e., age, sex, race, and geographic areas
socioeconomic status; i.e., county-level median household income, education
prior cardiovascular disease histories and other comorbidities; i.e., hypertension, hyperlipidemia, diabetes
prior medications; i.e., beta-blockers, angiotensin-converting enzyme (ACE) inhibitors
Logistic Regression
Random Forest
XGBoost
Toxic side-effect: precision = 0.621, F1 = 0.396 AUC = 0.801, 95% C.I. 0.781–0.821 (cardiotoxicity)[96]
AI, artificial intelligence; ML, machine learning; DL, deep learning. AUC, receiver operating characteristic curve; SE, sensitivity; SP, specificity; ACC, accuracy; PPV, positive predictive value; NPV, negative predictive value; HR, hazard ratio.
Table 3. Artificial Intelligence analysis of the response to targeted therapy for patients with colorectal metastasis. Data are grouped for imaging or biomarkers based learning models.
Table 3. Artificial Intelligence analysis of the response to targeted therapy for patients with colorectal metastasis. Data are grouped for imaging or biomarkers based learning models.
TherapyAITotal, Training, TestInput FeaturesSignatureEvaluated Models and Selected Model Training Set Test Set Reference
Imaging biomarkers learning models
Second-line anti-VEGF therapy (aflibercept plus FOLFIRI, prior bevacizumab plus oxaliplatin-based regimens)DL 1028,502,526
(3757,1864,1893 lesions)
Automatically extracted features from CT scans at differential time pointsRadiomicsGoogleNet + LSTM with 4 times stepsResponse vs. non-response: C-Index = 0.678, 95% 0.650–0.706 (DL model) Response vs. non-response: C-IndexOS = 0.649, 95% C.I. 0.619–0.679 (DL model)
C-IndexOS = 0.694, 95% C.I. 0.661–0.720 (DL model)
OS = 18 vs. 10.4 mo (HR = 0.49, 95% C.I. 0.40–0.61)
[85]
First-line anti-VEGF therapy (bevacizumab plus oxaliplatin/fluoropyrimidine-based regimens)ML76,52,24Magnetic resonance imaging (MRI) at baseline
48 texture features
15 morphological features
RadiomicsFeatures Selection:
Lasso
Classification:LDA
SVM
KNNArtificial neural network
Decision tree
RAS mutation status and non-responder to anti-EGFR therapy: AUC = 0.79, 95% C.I. 0.70–0.85, 78% SE, 74.2% SP, 76.1% ACC (texture and morphological features model)
Response vs. non-response according to RAS mutation status: AUC = 0.84, 95% C.I. 0.780–0.91, 90% SE, 67.8% SP, 76.9% ACC (texture features model)
RAS mutation status and non-responder to anti-EGFR therapy: 83.3% SE, 75% SP, 79.2% ACC (texture and morphological features model)
91.7% SE, 83.3% SP, 87.5% ACC (texture features model)
[91]
First-line anti-VEGF therapy (bevacizumab with or without OLFOX/FOLFIRI/XELOX, combined with anti-EGFR in KRAS, NRAS and BRAF wild-types)DL180,101,79
(433,264,169 lesions)
MRI at differential time pointsRadiomicsDC3CNN Response vs. non-response: AUC = 0.849, 95% C.I. 0.737–0.926, 91.7% SE, 75% SP, ACC of 87.5%, 75% NPV, 91.7% PPV (four features model)
Response vs. non-response: AUC = 0.833, 95% C.I. 0.695–1.000, 91.9% SE, 75% SP, ACC of 88.5%, 69.2% NPV, 93.8% PPV (four features model)
[98]
\First-line anti-VEGF therapy (bevacizumab plus FOLFOX chemotherapy)DL 202,162,40 (treated/untreated lesion classification)
120,84,12,24 (treatment response classification)
Contrast-enhanced (CT) scans at baseline
Texture analysis (TA) features
RadiomicsTreated/untreated classification:
DT, SVM-RBF, ANN, Inception-inspired CNN
Treatment response prediction:
CNN
Response vs. non-response: AUC = 0.83, 95% C.I. 0.78–0.87, 97% SE, 59% SP, 78% ACC)Response vs. non-response: AUC = 0.88, 95% C.I. 0.85–0.94, 98% SE, 54% SP, 76% ACC [86]
First-line anti-EGFR therapy (cetuximab plus FOLFIRI chemotherapy) ML116,78,38 (HQ set)
186,124,62 (SQ set)
High and standard quality (HQ and SQ) CT scans at differential time points
1749 Radiomics Features
1742 VGG16 extracted image features
8 rim specific features
RadiomicsFor feature selection:
RELIEF
Fisher ScoreChi Squared
MRMR
t-Test
Wilcoxon
Univariate models
For classification:
SVM
KNN
Naive Bayes
Bagging
Lasso
Random Forest
Response vs. non-response: AUC = 0.83, 95% C.I. 0.75–0.92, 77% SE, 85% SP (HQ cohort) AUC = 0.84, 95% C.I. 0.76–0.89 (SQ cohort)Response vs. non-response: AUC = 0.80, 95% C.I. 0.69–0.94, 80% SE, 78% SP (HQ cohort)
AUC = 0.72, 95% C.I. 0.59–0.83, 82% SE, 61% SP (SQ cohort)
[87]
First-line anti-EGFR therapy
(cetuximab plus FOLFIRI or FOLFOX chemotherapies)
ML 1886CT scans and MRI at baselineRadiomicsClustering:
K-Means
Response vs. non-response in KRAS mutated tumors: HROS = 1.44, 95% C.I. 1.08–1.92 [88]
Last-line dual anti-HER2 therapy (lapatinib plus trastuzumab or pertuzumab plus trastuzumabemantansine)ML38,28,10 patients
141,108,33 lesions
CT scans at baseline
24 radiomics features
RadiomicsFeatures Selection
Genetic algorithms (GAs)
ClassificationGaussian naïve Bayesian classifier
Response vs. non-response in RAS wild-type and HER2 amplified tumors: 89% SE, 85% SP, 93% PPV, 78% NPV (lesion model)Response vs. non-response in RAS wild-type and HER2 amplified tumors: 90% SE, 42% SP, 73% PPV, 71% NPV (lesion model)
92% SE, 86% SP, 96% PPV, 75% NPV (patient model)
[74]
Molecular biomarkers learning models
First-line anti-VEGF therapy (bevacizumab plus 5-fluorouracil-based regimens)ML2289,2277,12Immune infiltration data16-long noncoding RNA Feature selection:
Ensemble of statistical and machine learning models
Survival analysis:
Cox proportional hazard model
Response vs. non-response: AUC = 0.771 [94]
First-line anti-VEGF therapy (bevacizumab plus 5-fluorouracil-based regimens)ML2291,2277,14Immune infiltration data16-long noncoding RNA Feature selection:
Ensemble of statistical and machine learning models
Survival analysis:
Cox proportional hazard model
Response vs. non-response: AUC = 0.696 [94]
First-line anti-VEGF therapy (bevacizumab plus 5-fluorouracil-based regimens)ML2305,2277,28Immune infiltration data16-long noncoding RNA Feature selection:
Ensemble of statistical and machine learning models
Survival analysis:
Cox proportional hazard model
Response vs. non-response: AUC = 0.781 [94]
First-line anti-VEGF therapy (bevacizumab plus FOLFOX chemotherapy) ML650,105,545Next generation sequencing (NGS) data67-gene SVM
Logistic regression
KNN
Multilayer Perceptron
Naive Bayes
Quadratic Discriminant analysis
Gaussian Process
Random Forest Ensemble
Improved survival: HRTTNT = 0.537, 95% C.I. 0.428–0.674
TTNT = 11.5 vs. 8.2 mo
Improved survival: HROS = 0.466, 95% C.I. 0.325–0.670
OS = 42 vs. 24.5 mo
Improved survival: HROS = 0.629, 95% C.I. 0.404–0.981
OS = 30 vs. 15.9 mo
[99]
First-line anti-VEGF therapy (bevacizumab plus FOLFOXIRI chemotherapy) ML208,105,103NGS data67-gene SVMLogistic regression
KNN
Multilayer Perceptron
Naive Bayes
Quadratic Discriminant analysis
Gaussian Process
Random Forest Ensemble
Improved survival: HROS = 0.483, 95% C.I. 0.270–0.864
OS = 30 vs. 15.9 mo
[99]
First-line anti-VEGF therapy
(bevacizumab plus FOLFIRI chemotherapy
ML488,345,143Genotyping data27 SNPsFeature selection:
Random Survival Forests
Variable importance Minimal depth
Survival analysis:
Kaplan–Meier curves Log-rank test
Survival in KRAS wild-type with CBP rs129963 T/T variant: OS = 22.8 vs. 26 mo and PFS = 9.5 vs. 10.5 mo
Survival in KRAS wild-type with TBK1 rs7486100 A/A variant: OS = 31.3 vs. 24.8 mo and PFS = 11.3 vs. 10.3 mo
Survival in CCL2 rs4586 T/T carriers: OS = 30.9 vs. 22.8 mo
Survival in VEGFR2 rs2305948 any C carriers: OS = 26.2 vs. 17.0 mo
Survival in DMRT1 rs755383 any T carriers: PFS = 9.4 vs. 9.0 mo
Survival in MMP2 rs243865 any T carriers: OS = 28.5 vs. 20.3 mo
Survival in KRAS mutant with β-catenin rs3864004 A/A genotype: OS = 16.3 vs. 26.3 mo and PFS = 7.8 vs. 9.6 mo
Survival in KRAS mutant with TBK1 rs7486100 A/A variant: PFS = 10.3 vs. 8.6 mo
[102]
First-line anti-VEGF therapy
(bevacizumab plus FOLFIRI chemotherapy)
ML558,180,378Exome-sequencing data1 or 2-SNPs Combination of two Cox penalized regression models:
Lasso
Elastic Net
Improved survival: HRPFS = 0.52, 95% C.I. 0.33–0.83
Decreased survival: HRPFS = 2.3, 95% C.I. 1.19–4.57
Decreased survival in NLRP1 any A and SRL AA carriers: HRPFS = 8.3, 95% C.I. 3.3–21 and 2.2, 95% C.I. 1–5
Decreased survival in KRAS wild-type and concomitant carriers in combination with NLRP1 any A and SRL AA: HRPFS = 8.3, 95% C.I. 3.3–21 and 2.2, 95% C.I. 1–5
OS = 12 vs. 27 mo
Improved survival: HRPFS = 0.42, 95% C.I. 0.21–0.85 in the NLRP1 TT carriers
Decreased survival: HRPFS = 2.5, 95% C.I. 1.12–5.5 in the SRL AA carrier
[89]
Second-line anti-EGFR therapy (panitumumab with or without irinotecan, prior fluoropyrimidine-based regimens, with or without oxaliplatin)ML 499,274,225Immunochemistry (IHC) dataAmphiregulin
/epiregulin
Improved survival in KRAS wild-type: HRPFS = 0.54, 95% C.I. 0.37–0.79, OS = 8.0 vs. 3.2 mo
Improved survival in KRAS and BRAF wild-types: HRPFS = 0.53, 95% C.I. 0.36–0.78
[97]
First-line anti-EGFR therapy (CAPEOX-B chemotherapy plus or without cetuximab) ML553,368,185Genome wide
genotyping and
survival data
781-SNPsModified Random Forest using SurvDiff in place of Gini index to split the data at each node
Classic RF with survival derived data labels
Response vs. non-response: HRbenefit = 0.69, 95% C.I. 0.49–0.98
Response vs. non-response: HRno benefit = 1.32, 95% C.I. 1.07–1.62 (SNP model)
Response vs. non-response: HRbenefit = 0.52, 95% C.I. 0.35–0.76
(sex chromosome model)
[90]
First-line anti-EGFR therapy (FOLFIRI) or anti-VEGF therapy (fluoropyrimidine-based regimens, with or without oxaliplatin or irinotecan or FOLFOXIRI or other regimens plus or without bevacizumab) ML859,471,388NGS data7-gene classifier Decreased survival: HRPFS = 16.9, 95% C.I. 4.2–68.0 [100]
AI, artificial intelligence; ML, machine learning; DL, deep learning. AUC, receiver operating characteristic curve; SE, sensitivity; SP, specificity; ACC, accuracy; PPV, positive predictive value; NPV, negative predictive value; HR, hazard ratio; TTNT, time-to-next treatment.
Table 4. Least frequently met study quality items modified by Brnabic and Hess [72].
Table 4. Least frequently met study quality items modified by Brnabic and Hess [72].
Study
State if Outliers with Impossible or Extreme Responses Are Removed; State Any Criteria Used for Outlier Removal.State How Missing Values Were Handled.External Validation Should also Be Performed Whenever PossibleIf Possible, Report the Parameter Estimates in the Model and Their Confidence Intervals or Report Non-Parametric Estimates from Bootstrap Samples.Meta-Analysis
Inclusion
References
Johnson (2022)YesYesYesNoNo[100]
Li (2022)YesYesYesYesNo[96]
Liu (2022)NoNoYesNoNo[94]
Giannini (2022) YesYesYesYesYes[75]
Granata (2021) YesYesYesYesNo[91]
Abraham (2021) NoNoYesYesYes[99]
Naseem (2021)NoNoYesNoNo[102]
Defeudis (2021)YesNoNoNoNo[81]
Nakanishi (2021) YesYesNoYesYes[79]
Wei (2021) NoNoNoNoYes[80]
Williams (2021)NoNoNoYesNo[97]
Tian (2021)YesYesYesYesNo[95]
Lu (2021)NoNoNoYesNo[85]
Giannini (2020)YesYesNoNoNo[74]
Ubels (2020)NoNoNoYesYes[90]
Dercle (2020) NoYesNoYesYes[87]
Barat (2020)NoNoYesYesNo[89]
Zhu (2020) NoYesYesYesYes[98]
Maaref (2020) NoNoNoYesYes[86]
Lu (2020)NoNoNoYesYes[101]
Vera-Yunca (2020)YesNoNoYesNo[88]
Oyaga-Iriarte (2019)NoNoNoNoNo[93]
Chen (2014)NoNoNoYesNo[83]
Tsuji (2012)YesYesNoNoNo[82]
Yuan (2012)NoNoNoNoNo[84]
Del Rio (2007)NoNoNoYesNo[92]
Table 5. The 21 artificial intelligence models reporting the estimates of receiver operating characteristic curve (AUCs) and/or hazard ratios (HRs) and 95% Confidence Intervals (C.I.) for evaluating predictive response or overall survival included in the meta-analysis. Classifiers, cytotoxic chemotherapy, targeted therapy, population size, AUCs, HRs and 95% C.I. in training and/or validations sets are reported.
Table 5. The 21 artificial intelligence models reporting the estimates of receiver operating characteristic curve (AUCs) and/or hazard ratios (HRs) and 95% Confidence Intervals (C.I.) for evaluating predictive response or overall survival included in the meta-analysis. Classifiers, cytotoxic chemotherapy, targeted therapy, population size, AUCs, HRs and 95% C.I. in training and/or validations sets are reported.
Study Included, Signature and Regimens Performance EstimatesReferences
Training setsnAUC, 95% C.I.
Giannini (2022), radiomics signature and chemotherapy1720.99, 95% C.I. 0.97–1.00[75]
Nakanishi (2021), radiomics signature and chemotherapy940.851, 95% C.I. 0.771–0.93[79]
Wei (2021), radiomics signature and chemotherapy1440.935, 95% C.I. 0.897–0.973[80]
Dercle (2020), radiomics signature and chemotherapy780.75, 95% C.I. 0.63–0.85[87]
Dercle (2020), radiomics signature and targeted therapy780.83, 95% C.I. 0.75–0.92[87]
Maaref (2020), radiomics signature and targeted therapy1620.83, 95% C.I. 0.78–0.87[86]
Validation sets AUC, 95% C.I.
Giannini (2022), radiomics signature and chemotherapy700.93, 95% C.I. 0.87–0.96[75]
Nakanishi (2022), radiomics signature and chemotherapy320.779, 95% C.I. 0.617–0.94[79]
Wei (2021), radiomics signature and chemotherapy480.830, 95% C.I. 0.688–0.973[80]
Lu (2020), 18-gene signature and chemotherapy (FOLFOX)290.877, 95% C.I. 0.747–1.00[101]
Lu (2020) 18-gene signature and chemotherapy (FOLFIRI)210.778, 95% C.I. 0.575–0.979[101]
Dercle (2020), radiomics signature and chemotherapy510.59, 95% C.I. 0.44–0.72[87]
Dercle (2020), radiomics signature and targeted therapy380.80, 95% C.I. 0.69–0.94[87]
Zhu (2020), radiomics signature and targeted therapy790.849, 95% C.I. 0.737–0.926[98]
Zhu (2020), radiomics signature and targeted therapy730.833, 95% C.I. 0.695–1.00[98]
Maaref (2020), radiomics signature and targeted therapy400.88, 95% C.I. 0.85–0.94[86]
Validation sets HR, 95% C.I.
Lu (2020), MLK1-gene signature and chemotherapy (FOLFOX)290.358, 95% C.I. 0.178–0.717[101]
Lu (2020), CCDC124-gene signature and chemotherapy (FOLFOX)290.563, 95% C.I. 336–0.943[101]
Lu (2021), radiomics learning models and targeted therapy5260.49, 95% C.I. 0.4–0.61[85]
Abraham (2021), 67-gene signature and targeted therapy1030.483, 95% C.I. 0.270–0.864[99]
Abraham (2021), 67-gene signature and targeted therapy5450.629, 95% C.I. 0.404–0.981[99]
Table 6. Performances of the artificial intelligence studies.
Table 6. Performances of the artificial intelligence studies.
TherapyAISignatureTraining Set Test Set Reference
ChemotherapyML Delta-radiomicsExcellent, 99% correctly classified as responders, 94% correctly classified as non-respondersExcellent, 85% correctly classified as responders, 92% correctly classified as non-responders[75]
MLRadiomicsGoodPoor[79]
DLFusion radiomicsExcellent, 84.7% correctly classified as responders, 84.8% correctly classified as non-responders (radiomics model)
Excellent, 89.8% correctly classified as responders, 84.8% correctly classified as non-responders (fusion radiomics model)
Poor, 90% correctly classified as responders, 73% correctly classified as non-responders (radiomics model)
Good, 90.9% correctly classified as responders, 73.3% correctly classified as non-responders (fusion radiomics model)
[80]
MLRadiomics76% correctly classified as responders, 67% correctly classified as non-responders (per-lesion 3D set)61% correctly classified as responders, 60% correctly classified as non-responders (per-lesion 3D set)
41% correctly classified as responders, 21% correctly classified as non-responders (per-patient 3D set)
[81]
ML RadiomicsPoor (high standard quality computed tomography scan, HQCT, set)
Poor (high standard quality computed tomography scan, SQCT, set)
Failed (HQCT set)
Failed (SQCT set)
[87]
ML74 genes 97.6% correctly classified as responders, 100% correctly classified as non-responders [95]
ML16 long noncoding RNA Good[94]
ML16 long noncoding RNA Poor[94]
ML16 long noncoding RNA Poor[94]
Targeted therapyDLRadiomics Good, 91.7% correctly classified as responders, 75% correctly classified as non-responders (four features model)
Good, 91.9% correctly classified as responders, 75% correctly classified as non-responders (four features model)
[98]
DL RadiomicsGood, 97% correctly classified as responders, 59% correctly classified as non-respondersGood, 98% correctly classified as responders, 54% correctly classified as non-responders[86]
MLRadiomicsGood, 77% correctly classified as responders, 85% correctly classified as non-responders (high quality, HQ, cohort)
Good (standard quality, SQ, cohort)
Good, 80% correctly classified as responders, 78% correctly classified as non-responders (HQ cohort)
Poor, 82% correctly classified as responders, 61% correctly classified as non-responders (SQ cohort)
[87]
MLRadiomics89% correctly classified as responders, 85% correctly classified as non-responders (lesion model)90% correctly classified as responders, 42% correctly classified as non-responders (lesion model)
92% correctly classified as responders, 86% correctly classified as non-responders (patient model)
[74]
ML16-long noncoding RNA Poor[94]
ML16-long noncoding RNA Worthless[94]
ML16-long noncoding RNA Poor[94]
ML67-gene GoodPoor[99]
Chemotherapy and targeted therapy ML/DLImaging/molecular signaturesGoodGoodPresent meta-analysis
ML, machine learning; and DL, deep learning.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Russo, V.; Lallo, E.; Munnia, A.; Spedicato, M.; Messerini, L.; D’Aurizio, R.; Ceroni, E.G.; Brunelli, G.; Galvano, A.; Russo, A.; et al. Artificial Intelligence Predictive Models of Response to Cytotoxic Chemotherapy Alone or Combined to Targeted Therapy for Metastatic Colorectal Cancer Patients: A Systematic Review and Meta-Analysis. Cancers 2022, 14, 4012. https://doi.org/10.3390/cancers14164012

AMA Style

Russo V, Lallo E, Munnia A, Spedicato M, Messerini L, D’Aurizio R, Ceroni EG, Brunelli G, Galvano A, Russo A, et al. Artificial Intelligence Predictive Models of Response to Cytotoxic Chemotherapy Alone or Combined to Targeted Therapy for Metastatic Colorectal Cancer Patients: A Systematic Review and Meta-Analysis. Cancers. 2022; 14(16):4012. https://doi.org/10.3390/cancers14164012

Chicago/Turabian Style

Russo, Valentina, Eleonora Lallo, Armelle Munnia, Miriana Spedicato, Luca Messerini, Romina D’Aurizio, Elia Giuseppe Ceroni, Giulia Brunelli, Antonio Galvano, Antonio Russo, and et al. 2022. "Artificial Intelligence Predictive Models of Response to Cytotoxic Chemotherapy Alone or Combined to Targeted Therapy for Metastatic Colorectal Cancer Patients: A Systematic Review and Meta-Analysis" Cancers 14, no. 16: 4012. https://doi.org/10.3390/cancers14164012

APA Style

Russo, V., Lallo, E., Munnia, A., Spedicato, M., Messerini, L., D’Aurizio, R., Ceroni, E. G., Brunelli, G., Galvano, A., Russo, A., Landini, I., Nobili, S., Ceppi, M., Bruzzone, M., Cianchi, F., Staderini, F., Roselli, M., Riondino, S., Ferroni, P., ... Peluso, M. (2022). Artificial Intelligence Predictive Models of Response to Cytotoxic Chemotherapy Alone or Combined to Targeted Therapy for Metastatic Colorectal Cancer Patients: A Systematic Review and Meta-Analysis. Cancers, 14(16), 4012. https://doi.org/10.3390/cancers14164012

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop