Next Article in Journal
Exploring the Link Between Obligate Anaerobe-Related Dysbiosis and Prostate Cancer Development: A Pilot Study
Next Article in Special Issue
A Machine Learning-Based Radiomics Model for the Differential Diagnosis of Benign and Malignant Thyroid Nodules in F-18 FDG PET/CT: External Validation in the Different Scanner
Previous Article in Journal
Therapeutic Approaches for Advanced Basal Cell Carcinoma: A Comprehensive Review
Previous Article in Special Issue
The Challenge of External Generalisability: Insights from the Bicentric Validation of a [68Ga]Ga-PSMA-11 PET Based Radiomics Signature for Primary Prostate Cancer Characterisation Using Histopathology as Reference
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

A Systematic Review of the Applications of Deep Learning for the Interpretation of Positron Emission Tomography Images of Patients with Lymphoma

by
Theofilos Kanavos
,
Effrosyni Birbas
and
Theodoros P. Zanos
*
Institute of Health System Science, Feinstein Institutes for Medical Research, Manhasset, NY 11030, USA
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Cancers 2025, 17(1), 69; https://doi.org/10.3390/cancers17010069
Submission received: 15 November 2024 / Revised: 20 December 2024 / Accepted: 24 December 2024 / Published: 29 December 2024

Simple Summary

Positron emission tomography (PET) is useful for assessing lymphomas, while artificial intelligence (AI) shows potential as a reliable resource for analyzing medical images. In this context, several studies have developed AI models for the automated interpretation of lymphoma PET images. The proposed models achieved promising performance in various medical tasks, including detection, histological subtyping, differential diagnosis, and prognostication. AI techniques for lymphoma PET evaluation are not designed to replace physicians but to assist them in handling large volumes of scans through rapid and accurate calculations, reduce their workload, and provide them with decision support tools that can improve their ability to deliver precise care for better outcomes.

Abstract

Background: Positron emission tomography (PET) is a valuable tool for the assessment of lymphoma, while artificial intelligence (AI) holds promise as a reliable resource for the analysis of medical images. In this context, we systematically reviewed the applications of deep learning (DL) for the interpretation of lymphoma PET images. Methods: We searched PubMed until 11 September 2024 for studies developing DL models for the evaluation of PET images of patients with lymphoma. The risk of bias and applicability concerns were assessed using the prediction model risk of bias assessment tool (PROBAST). The articles included were categorized and presented based on the task performed by the proposed models. Our study was registered with the international prospective register of systematic reviews, PROSPERO, as CRD42024600026. Results: From 71 papers initially retrieved, 21 studies with a total of 9402 participants were ultimately included in our review. The proposed models achieved a promising performance in diverse medical tasks, namely, the detection and histological classification of lesions, the differential diagnosis of lymphoma from other conditions, the quantification of metabolic tumor volume, and the prediction of treatment response and survival with areas under the curve, F1-scores, and R2 values of up to 0.963, 87.49%, and 0.94, respectively. Discussion: The primary limitations of several studies were the small number of participants and the absence of external validation. In conclusion, the interpretation of lymphoma PET images can reliably be aided by DL models, which are not designed to replace physicians but to assist them in managing large volumes of scans through rapid and accurate calculations, alleviate their workload, and provide them with decision support tools for precise care and improved outcomes.

1. Introduction

Lymphomas represent a heterogeneous group of hematological malignancies derived from lymphocytes and are estimated to account for approximately 5% of all cancers. They are broadly classified into Hodgkin lymphoma (HL) and non-Hodgkin lymphoma (NHL), which constitute about 10% and 90% of all cases, respectively. Each category is further divided into several subtypes (Table 1) [1]. HL is rare and considered one of the most highly curable malignancies in both pediatric and adult patients with a survival rate that exceeds 90% [2,3]. NHLs are categorized into aggressive and indolent forms based on their rate of progression and this distinction has substantial therapeutic and prognostic implications. Aggressive types grow and spread rapidly, while indolent ones generally progress slowly [4]. Diffuse large B cell lymphoma (DLBCL) is the most common aggressive NHL, accounting for 80% of all cases [5], whereas follicular lymphoma (FL) is the most frequent indolent NHL, representing 70% of all cases [6].
Positron emission tomography (PET) is a sophisticated physiology-based imaging modality that provides information regarding the function of tissues. The technique relies on detecting radioactivity emitted after the intravenous injection of a radiotracer. The most widely used radioactive tracer is 18F-fluorodeoxyglucose (18F-FDG), which is a radiolabeled glucose analog and therefore an imaging biomarker for glucose metabolism. Consequently, 18F-FDG PET enables the visualization of glucose consumption by tissues and can be used to detect metabolically active lesions [7,8]. In addition, PET can be combined with other cross-sectional imaging modalities, namely, computed tomography (CT) or magnetic resonance imaging (MRI), to provide a comprehensive view that integrates both functional and anatomical information. In other words, PET/CT and PET/MRI provide physicians with physiological details linked to a specific anatomical site, which enhances the accuracy of disease characterization [9]. Although PET comes with its own set of limitations, including radiation exposure, false positive and negative results, cost and availability issues, and interpretation variability, it constitutes a powerful imaging tool for the staging, treatment response evaluation, and post-treatment surveillance in patients with either HL or NHL [10].
Artificial intelligence (AI) refers to the broad concept of computers performing tasks that would typically require human intelligence, such as decision-making, pattern recognition, and problem-solving. Machine learning (ML) is a subfield of AI that involves the development of models that can learn from data without being explicitly programmed to make predictions or identify patterns. Deep learning (DL) is a specific type of ML that uses artificial neural networks to analyze large volumes of complex data [11]. Convolutional neural networks (CNNs) constitute a DL architecture that is particularly effective in processing and analyzing image data [12]. In recent years, AI has emerged as a revolutionary technology in the medical field due to the need to enhance diagnostic accuracy and streamline clinical workflows. A large number of DL models have been developed by multiple scientific groups for the analysis of various kinds of histopathological [13], cytopathological [14], radiological [15], and nuclear medicine [16] images.
As far as lymphomas are concerned, AI-powered models are continuously being developed with the objective of assisting physicians in the management of these hematological malignancies and optimizing clinical workflows. Many of these models are based on PET, which serves as a powerful tool for various types of lymphoma (Figure 1).
In this context, the present systematic review aims to outline the applications of DL for the interpretation of PET images of patients with lymphoma and highlight the potential of AI models to support physicians without replacing their clinical expertise. To provide a comprehensive overview of this topic, we structured our manuscript by categorizing relevant studies based on the distinct medical tasks that AI can facilitate (Figure 2).

2. Materials and Methods

Our systematic review was performed in accordance with the recommendations of the preferred reporting items for systematic reviews and meta-analyses (PRISMA) guidance [17]. In addition, the present study was registered with the international prospective register of systematic reviews, PROSPERO, as CRD42024600026.

2.1. Search Strategy

We performed a systematic search of PubMed from its inception to 11 September 2024 to identify primary studies that involved the development of a DL model for the interpretation of PET images of patients with lymphoma. For this purpose, the following search algorithm was applied: (deep learning OR convolutional neural network OR CNN) AND (PET OR positron) AND lymphoma.

2.2. Eligibility Criteria

Eligible studies were selected based on the following inclusion and exclusion criteria. To be eligible for inclusion, studies needed to focus on the development of at least one DL model for the interpretation of PET images of patients diagnosed with lymphoma. The exclusion criteria encompassed reviews, case reports, datasets, editorials, comments, letters, interviews, preprints, articles published in languages other than English, papers involving non-human subjects, and studies that solely addressed lesion segmentation.

2.3. Selection Process

The titles and abstracts of all the citations collected using the previously described methodology were independently screened by two authors using the Rayyan web application [18]. During this screening period, the researchers convened regularly to discuss any disagreements and resolve conflicts by consensus. The full texts of potentially eligible articles identified through this process were then retrieved for further examination to determine the articles that were ultimately included in our systematic review.

2.4. Data Extraction Process

To streamline the data extraction process, we generated a spreadsheet form that all authors could access to import data from the included articles. From each study, we manually extracted information regarding the authors, the year of publication, the number of participants and their mean or median age, the imaging modality, the medical objective, the DL architecture, the performance metrics, and whether external validation was conducted.

2.5. Study Quality Assessment

The risk of bias and applicability concerns of the studies included in our systematic review were assessed using the prediction model risk of bias assessment tool (PROBAST), which is designed for studies aiming to develop, validate, or update prediction models that can be either diagnostic or prognostic. To accommodate the specific needs of AI-based models, the signaling questions within the tool were adjusted accordingly. PROBAST is organized into four domains, namely, participants, predictors, outcomes, and analysis. The risk of bias was evaluated across all four domains and was rated as low, unclear, or high for each of them. Concerns regarding applicability were assessed for the first three of the aforementioned domains and were judged similarly to the risk of bias. This evaluation was performed by two independent authors and disagreements were resolved through discussion and consensus. An overall judgment about the risk of bias and applicability concerns was also established based on the classifications for each domain. A model with a low risk of bias in all domains was considered to present a low overall risk of bias. The presence of an unclear risk of bias in at least one domain in the absence of a high risk of bias in any domain rendered the overall risk of bias unclear. A high overall risk of bias was assigned if a high risk of bias was noted in any domain. The judgment on the overall applicability concerns was governed by the same rules [19,20].

3. Results

This section provides a comprehensive overview of the papers that are relevant to the topic of our systematic review, detailing the selection, characteristics, risk of bias, and key findings of the included studies.

3.1. Study Selection

Our systematic search yielded 71 articles, 38 of which were excluded during the title and abstract screening phase. As a result, 33 papers were selected for full-text assessment. In the end, 21 articles were deemed to fulfill our criteria of eligibility and were included in our review (Figure 3).

3.2. Study Characteristics

The articles included in our systematic review were diverse in terms of their medical objectives and methodologies (Figure 4). The number of participants in the studies varied from 39 to 2030 with a median of 167. Sixteen papers (76%) included at least 100 individuals, while three (14%) considered 1000 or more participants. With respect to imaging modalities, 18 articles (86%) were based on PET/CT, two (9%) on PET alone, and only one (5%) on PET/MRI. In terms of medical tasks, three papers (14%) focused on detection, two (10%) on classification, three (14%) on differential diagnosis, six (29%) on metabolic tumor volume (MTV) estimation, four (19%) on treatment response prediction, and three (14%) on survival prediction. Regarding lymphoma types, nine studies (43%) investigated DLBCL, one (5%) focused on mantle cell lymphoma (MCL), one (5%) utilized participants diagnosed with extranodal NK/T cell lymphoma, nasal type (ENKTL-NT), one (5%) involved patients with HL, four (19%) used different types of NHL, three (14%) included both HL and NHL cases, while two papers (9%) did not specify the histological subtypes addressed. All 21 studies (100%) utilized retrospective data, three (14%) used data from open-access sources, two (10%) examined exclusively pediatric cases, three (14%) reported excluding images of poor quality, and 10 (48%) performed external validation.

3.3. Risk of Bias in Studies

The risk of bias and concerns regarding the applicability of the included studies were evaluated using PROBAST. Although none of the studies were classified as having a high overall risk of bias, approximately 62% of them were found to have an unclear one. In particular, the small sample size and the lack of external validation were the primary limitations of multiple studies. With respect to applicability, the majority of studies presented low overall concerns. However, one study [21] raised unclear concerns in the domain of participants due to the additional use of lung cancer and melanoma scans for model training, while two studies [22,23] exhibited high concerns in the domain of predictors because they considered clinical parameters alongside PET/CT scans (Figure 5).

3.4. Results of Included Studies

Below, we present the findings of the included studies, organized by their primary objective, namely, detection, classification, differential diagnosis, MTV estimation, and treatment response and survival prediction (Table 2).

3.4.1. Detection

Some research groups have proposed DL models for the automatic detection of lymphoma lesions in PET images, which can rapidly provide clinicians with useful information about the location and burden of a neoplasm. The manual identification of multiple lymphoma lesions in medical images can be a tedious and lengthy process; therefore, models that automatically perform this task could significantly ease the workload for physicians and potentially limit the time interval between the acquisition and interpretation of PET images.
The quantitative analysis of PET/CT images of patients with lymphoma offers significant prognostic insights but requires the identification of all lesions, which can be time-consuming, labor-intensive, and susceptible to inter-reader variability. To address this issue, Zhou et al. [24] constructed an Xception-based U-Net for the computer-aided detection of MCL in PET/CT images. When tested on external institution scans, the model achieved a median sensitivity of 84% with 14 false positives (FPs) per patient. Interestingly, the performance of the model did not exhibit a statistically significant difference between the images from within and outside the institution, which indicates its robustness and applicability to independent datasets. It is worth mentioning that most FPs were limited to sites where normal physiological processes or benign inflammatory conditions could explain the findings. In addition, the model demonstrated relatively low sensitivity for lesions with a maximum standardized uptake value (SUVmax) beneath the liver reference, which may correspond to indolent MCL cases. Similarly, Weisman et al. [25] developed an ensemble of three-dimensional CNNs for the automatic detection of lymph nodes involved in lymphoma in PET/CT images, which achieved a sensitivity of 85% with an average of four FPs per patient and performed comparably to the differences observed between the annotations of two physicians. Furthermore, Wang et al. [26] generated a multimodal DL algorithm for the computer-aided identification of lymphoma lesions in PET/MRI scans of pediatric patients, which performed adequately with a sensitivity of 76% and a false discovery rate of 10%.

3.4.2. Classification

As already mentioned, lymphomas are classified into HL and NHL and each category is further divided into multiple subtypes, the determination of which has both therapeutic and prognostic implications. A biopsy followed by a histopathological examination is necessary for the definite diagnosis and subtype identification of lymphomas; however, such an approach comes with certain drawbacks. Needle core biopsy may not always provide sufficient information for an accurate analysis due to the spatial and temporal heterogeneity of lymphomas, while excision biopsy carries the risk of anesthetic and surgical complications. On top of that, the histopathological examination of lymphoma samples under the microscope is a time-consuming and challenging task [4,27].
A study involving both aggressive and indolent NHL cases indicated that 18F-FDG uptake is higher in aggressive than in indolent forms and that an SUVmax greater than 10 may predict aggressive behavior [41]. However, in another study of 38 indolent subtypes of NHL, the SUVmax of lymphomas widely ranged from 1.7 to 41.2, suggesting that the SUVmax can vary significantly across different histological subtypes of indolent NHL [42]. Consequently, adopting a single SUVmax cut-off for distinguishing between aggressive and indolent NHLs on PET/CT images would be unreliable. Therefore, although nuclear medicine experts can identify lymphomas on PET/CT images based on abnormal 18F-FDG avidity and anatomical features, such as lymph node enlargement, they cannot reliably determine the specific lymphoma subtype solely based on imaging findings [4].
Given the limitations of visual inspection of medical images for lymphoma classification and the challenges associated with histopathological examination, the implementation of AI models that predict lymphoma subtypes based on PET/CT scans in clinical practice would be highly beneficial for physicians involved in the management of patients with lymphoma. Such models have been recently proposed by two research groups.
Xu et al. [4] developed a hybrid few-shot multiple-instance learning model to distinguish between FL and DLBCL using PET/CT images from 61 NHL patients. The model achieved an area under the curve (AUC) of 0.795 and outperformed its traditional counterparts, which used radiomic features, random forests, and support vector machines (SVMs). Their approach utilized hybrid features generated by fusing DL-based and radiomic features, which enhanced classification precision. Combining hybrid features with self-supervised learning, few-shot learning, and multiple-instance learning efficiently addresses challenges related to data scarcity and incomplete annotations.
A more thorough NHL classification was sought by Wang and Jiang [27], who built a light-weighted pseudo-spatial-temporal radiomic network for the classification of NHL into eight different subtypes, namely, DLBCL, anaplastic large cell lymphoma, Burkitt lymphoma (BL), NK/T cell lymphoma, FL, MCL, peripheral T cell lymphoma, and lymphoblastic lymphoma, using PET/CT images from 80 patients. The model employed a novel radiomics method that extracted both global and local radiomic features from lymphoma PET/CT images to enhance intra-tumor heterogeneity classification and thus enable accurate NHL subtyping. This method outperformed its competing ones, achieving an AUC of 0.902.

3.4.3. Differential Diagnosis

The differential diagnosis of lymphomas is extensive as numerous diseases can mimic the clinical and radiological appearance of these malignancies. Given that lymphomas require very specific therapeutic approaches, rapid and accurate differentiation from other diseases resembling them is paramount. This ensures the timely initiation of the indicated therapeutic plan, prevents unnecessary and inappropriate treatments, and ultimately improves the patient’s outcome.
Differentiation between sarcoidosis and DLBCL is crucial as sarcoidosis is a benign systematic disease managed by a watch-and-wait approach or immunosuppressive therapy, whereas DLBCL exhibits an aggressive behavior and generally requires prompt treatment with immunochemotherapy [43,44]. While histopathological examination is the gold standard method for differentiating between the two entities, obtaining biopsies from lesions located in deep anatomical structures or near blood vessels can be especially difficult [28]. To address the above challenges, Aoki et al. [28] proposed a CNN model to distinguish sarcoidosis from DLBCL using PET/CT images. The model achieved a high level of performance and differentiated sarcoidosis from DLBCL based on differences in the distribution of 18F-FDG accumulation with an AUC of 0.963.
Primary breast lymphoma is a rare malignancy, accounting for 0.4% of all breast cancers, and DLBCL is recognized as its most common histological subtype [45]. Distinguishing breast lymphoma from the far more prevalent breast carcinoma is challenging as physical examination and standard imaging modalities, namely, mammography, ultrasonography (US), CT, and MRI, reveal overlapping characteristics [46,47]. Therefore, the diagnosis of breast lymphoma is usually feasible only after a breast biopsy. However, needle biopsy can lead to misdiagnosis, especially if necrotic tissue is sampled when puncturing large breast nodules containing necrotic regions. Furthermore, breast lymphoma is primarily managed with chemotherapy and radiation in contrast to breast carcinoma, which is mainly treated with surgery [48]. Nonetheless, due to the rarity of breast lymphoma and its clinical and radiological similarities with carcinoma, many lymphomatous nodules are initially misdiagnosed as carcinomas and surgically removed [29,48]. Considering the above, exploring non-invasive techniques to precisely distinguish breast lymphoma from carcinoma is clinically important. In this context, Chen et al. [29] designed a novel attention-based aggregate CNN model for the non-invasive and accurate differentiation of breast DLBCL from invasive ductal carcinoma using PET/CT images from 236 patients. The AUC of this model showed an improvement of nearly 5% compared to previous models, reaching 0.886 in an internal and 0.788 in an external dataset.
A significant differential diagnostic challenge frequently encountered in clinical practice is the distinction between metastatic and lymphomatous cervical lymphadenopathy. This difficulty arises from the overlapping clinical and radiological features of the two entities. While US-guided fine needle aspiration cytology (FNAC) is a valuable tool for lymphadenopathy investigation, it presents certain limitations that inhibit timely and precise lymphoma diagnosis. More specifically, the adequacy of the sample can vary significantly depending on the physician’s expertise and the specific location of the suspicious lymph node. Additionally, FNAC may lead to misdiagnosis as it may fail to capture the entire tumor architecture, while there is a risk of misinterpreting lymphomas, especially low-grade NHLs with minor cytomorphological atypia, as reactive lymphoid hyperplasia and vice versa [30,49]. In response to these difficulties, Yang et al. [30] developed a ResNet50-based model to distinguish between lymph node metastases from solid tumors and lymphoma involvement in enlarged cervical lymph nodes using PET/CT scans. Their model integrated both handcrafted and DL-based radiomic features and achieved a remarkable performance, reaching an AUC of 0.948. This is the first study to use DL to distinguish metastatic from lymphomatous enlarged cervical lymph nodes, paving the way for further research in this direction.

3.4.4. Metabolic Tumor Volume Estimation

MTV is a measure of the metabolically active volume of tumors derived from 18F-FDG PET. It is calculated as the sum of the volume of voxels with a standardized uptake value (SUV) that exceeds a particular threshold within the tumor [50]. MTV has demonstrated substantial prognostic value in both HL [51] and various types of NHL, including DLBCL [52], FL [53], MCL [54], BL [55], primary mediastinal large B cell lymphoma [56], peripheral T cell lymphoma [57], and T cell lymphoblastic lymphoma [58]. The quantification of MTV through the manual delineation of multiple lesions throughout the body constitutes a time-consuming and operator-dependent process. To address this issue, several studies have focused on developing automated methods for the rapid, reliable, and consistent computation of MTV of lymphomas as part of daily workflow.
A couple of research groups have studied various lymphoma types. To facilitate automated MTV calculation from lymphoma PET/CT images, Yousefirizi et al. [21] constructed a reliable two-step cascaded segmentation approach, the predictions of which demonstrated a strong correlation with ground truth values with an R2 of 0.89. Similarly, another research group developed an automated method for the quantification of MTV from PET/CT images of patients with 18F-FDG-avid lymphomas, namely, DLBCL, FL, and HL. The model performed impressively with a Spearman’s correlation coefficient (SCC) between the manually extracted and predicted MTV reaching 0.98 [31].
Other studies have focused on specific types of lymphoma. One of them proposed a three-dimensional patch-based, multi-resolution pathway CNN architecture for the extraction of MTV from baseline PET/CT images of pediatric HL cases. The study demonstrated strong agreement between the automated and physician-determined MTV with a Pearson’s correlation coefficient (PCC) of 0.88 although the former was slightly underestimated [32]. Furthermore, Kuker et al. [33] developed a DL model to automatize the estimation of MTV from PET/CT images of DLBCL and reported high concordance in calculations between the automated method and two nuclear medicine physicians with PCCs and intraclass correlation coefficients (ICCs) around 0.98. The same task was also addressed by another research group, which reported that, although their model tended to slightly underestimate MTV, the predicted values showed a strong correlation with the ground truth measurements, achieving an R2 of 0.82 in an external validation cohort [34]. Finally, Jiang et al. [35] constructed a three-dimensional U-Net architecture to quantify MTV in DLBCL using only PET images. They reported a notable positive correlation between the ground truth and computer-derived MTV with an R2 of 0.94 in an external validation cohort. In addition, the predicted MTV was proven to be an independent prognostic factor of progression-free survival (PFS) and overall survival (OS) in DLBCL patients.

3.4.5. Treatment Response Prediction

Treatment response prediction can be very helpful to healthcare providers involved in the management of patients with lymphoma as it can guide therapeutic decisions and personalize treatment plans. Recent advancements in AI have led to a growing interest in utilizing DL models to evaluate treatment outcomes based on PET images.
Ferrández et al. [36] investigated the capability of a CNN to predict the treatment outcomes of patients with DLBCL using maximum intensity projection (MIP) images from baseline PET/CT scans. Their model was designed to predict whether a patient would experience a short time to progression (TTP) of up to two years or not and achieved an AUC of 0.74 in an external dataset, demonstrating superior performance compared to an international prognostic index (IPI)-based model. In a subsequent study, the authors aimed to validate their method in five external datasets from different clinical trials and showed that their model remained predictive of treatment outcome and outperformed the IPI score in all five independent clinical trials [59]. Similarly, another scientific group used a DL-based method on pre-treatment PET images to predict the response of DLBCL and FL to chimeric antigen receptor (CAR) T cell therapy. The average AUC of the lesion-level response prediction using single whole slice-based input was 0.93. Additionally, the model could adequately predict patient-level treatment response using a rule-based reasoning approach applied to the lesion-level prediction results. This methodology holds promise for providing valuable prognostic insights for clinical decision-making prior to the initiation of CAR T cell therapy [37].
Yuan et al. [38] developed a multimodal DL model to predict primary treatment failure in DLBCL using interim PET/CT images. Their hybrid learning model was optimized with a contrastive training objective and achieved an AUC of 0.925 in an external dataset. The ability of the model to accurately predict primary treatment failure in patients with DLBCL receiving the front-line standard of care suggests that it has the potential to guide personalized treatment decisions. Furthermore, another study proposed a DL-based algorithm for assessing treatment response in NHL according to the Lugano 2014 classification using PET/CT scans. The Lugano 2014 classification is currently the standard method for the assessment of treatment response in NHL. It considers four categories of treatment response based on PET/CT images, namely, complete metabolic response (CMR), partial metabolic response, no metabolic response, and progressive metabolic disease. Obtaining a CMR by the end of treatment offers prognostic value. Hence, the authors evaluated the performance of the model in distinguishing between CMR and non-CMR cases and reported sensitivities and specificities ranging from 0.85 to 0.97 and from 0.50 to 0.78, respectively, across different test datasets [39].

3.4.6. Survival Prediction

The survival of patients with lymphoma varies significantly among individuals [60]. The integration of radiomics and AI into prognosis prediction tools aims to achieve more accurate prognostic evaluation and improve clinical decision-making for patients with lymphoma [61].
Identifying DLBCL patients at increased risk of relapse who would benefit from intensified chemotherapy is critical, given that 30–40% of DLBCL patients experience disease recurrence and die [22,62]. Although widely used in clinical practice, the IPI has been found to be insufficient to precisely identify DLBCL patients with an unfavorable prognosis [22,63]. In a retrospective study of 684 DLBCL patients conducted by Jiang et al. [22], DL scores, namely, DL-based imaging biomarkers, were extracted from PET/CT images and integrated along with clinical risk factors and metabolic metrics into a multiparametric model for prognosis prediction. This model outperformed the IPI and a conventional model that only comprised metabolic metrics and clinical variables, achieving C-indices for PFS and OS of 0.760 and 0.770, respectively, in an external validation cohort. Similarly, Qian et al. [23] developed another multiparametric model that combined DL-based radiomic features with clinical and PET parameters for prognosis prediction in DLBCL patients. The model achieved C-indices for PFS and OS of 0.770 and 0.771, respectively, in an external validation dataset. The above studies indicate that DL-based radiomic features could operate as reliable imaging biomarkers for precise survival prediction in DLBCL patients and that multiparametric models incorporating these features could enable personalized prognosis assessment and customized treatment strategies for DLBCL patients [22,23].
ENKTL-NT is an uncommon type of lymphoma with a poor prognosis [64,65]. This aggressive neoplasm is strongly associated with Epstein-Barr virus (EBV) and is more frequent in Asia and Central and South America, where it accounts for 5–15% of all lymphomas, compared to the United States and Europe, where it represents only 0.2–0.4% of NHLs [66]. Due to its rarity and the resulting limited data availability, predicting the prognosis of ENKTL-NT is challenging. Although studies have shown that some PET metrics, such as SUVmax, mean SUV (SUVmean), MTV, and total lesion glycolysis (TLG), are correlated with survival, these findings are considered controversial [40]. To overcome the issue of data scarcity, Guo et al. [40] developed a weakly supervised DL model for the prognosis prediction of ENKTL-NT using PET/CT scans from 167 patients. The proposed method successfully included 83 patients with missing or incomplete follow-up data together with 64 patients with complete follow-up data in the training process. The model achieved an AUC of 0.875 for the prediction of PFS and outperformed a conventional model trained solely on patients with complete follow-up data.
To further enhance survival prediction and risk stratification in patients with lymphoma, recent studies have explored novel metrics that reflect tumor dissemination and are derived from baseline PET/CT scans. These include the maximum distance between two lesions (Dmax) as well as the largest distance between the spleen and a lesion (Dspleen) [67,68]. It has been shown that high Dmax normalized with the body surface area and MTV constitute independent prognostic factors in DLBCL, which complement each other in characterizing the tumor by capturing two different aspects of the disease, namely, its spread and burden, respectively [67]. Moreover, a study has demonstrated that Dspleen normalized with the body surface area strongly predicts survival in DLBCL and its integration with MTV and IPI can further improve prognostication [68]. The quantification of the above PET-based biomarkers can be assisted by DL tools, bridging the gap between AI and clinical workflows. Indeed, Girum et al. [69] developed a DL model for the automated estimation of surrogate Dmax from only two MIP images of whole-body PET scans in patients with DLBCL. The AI-derived surrogate Dmax was highly correlated with its original counterpart and was effective in predicting PFS and OS, offering a streamlined approach to prognostication in DLBCL patients [69].

4. Discussion

Over the past decade, the implementation of AI in healthcare and particularly in medical imaging has increased. As a result, a growing number of DL models are continuously being developed for diverse diagnostic and prognostic tasks, many of which are based on imaging data and concern malignant diseases. A substantial effort has been made by several scientific groups to summarize the findings of studies that propose such methodologies in order to inform future research and guide clinical applications. For instance, a recent systematic review focused on DL models for the diagnosis of lung cancer from CT images [70], while another one investigated AI models for the identification of ovarian cancer in US, MRI, and CT scans [71]. To the best of our knowledge, this is the first systematic review to specifically address the performance and feasibility of DL-based methods for the computer-aided analysis of lymphoma PET images.

4.1. General Interpretation of the Results

The constant effort of numerous research groups to design DL methods for the computer-aided interpretation of PET scans of individuals with lymphoma represents a major advancement in medical imaging with significant potential to benefit the management of patients. These models showcase promising results across several medical tasks and can therefore assist medical professionals through diverse applications. DL methods for the automatic detection of lymphoma in PET images can ease clinicians’ workload and expedite the identification of multiple lesions with sensitivities ranging from 76% to 85% and FPs per patient between four and 14 [24,25,26]. DL models can also aid in the histological classification of lymphoma using PET scans with AUCs of 0.795 and 0.902 for binary and 8-class subtyping tasks, respectively [4,27]. In addition, PET-based DL algorithms can be helpful in distinguishing lymphomas from other pathological conditions, such as sarcoidosis, breast carcinoma, and lymph node metastases from solid tumors, with AUCs ranging from 0.788 to 0.963 depending on the specific differential diagnosis task [28,29,30]. Furthermore, DL models utilizing PET images can reliably quantify lymphoma MTV, a process that is time-consuming and operator-dependent when performed manually, achieving strong correlations with ground truth measurements with R2 values of 0.82–0.94 [21,31,32,33,34,35]. Predictive modeling of treatment outcomes and survival constitutes another area where AI shows considerable promise. DL algorithms based on PET scans can effectively anticipate treatment responses in patients with lymphoma and potentially guide personalized therapeutic approaches, demonstrating AUCs between 0.74 and 0.93 [36,37,38,39]. Additionally, DL methods using PET images can provide valuable prognostic insights regarding the survival of lymphoma patients with C-indices of 0.760–0.770 and 0.770–0.771 for PFS and OS, respectively [22,23,40].
The studies included in our systematic review varied widely in regard to their medical objectives. Almost 43% of them centered on the quantification of disease burden, either by simply detecting lymphomatous lesions [24,25,26] or by calculating MTV [21,31,32,33,34,35]. The constant effort by multiple research groups to develop models for this specific task is not surprising, given that the manual evaluation of tumor burden is a challenging and time-consuming procedure and that tumor burden is considered a substantial prognostic factor not only in lymphomas [72] but also in numerous kinds of solid tumors [73,74]. More specifically, one study presented tumor burden as the most significant prognostic marker in early-stage HL [75]. In addition, the prognostic value of disease burden has been demonstrated in several types of NHL, including DLBCL [76], FL [77], and BL [78]. Based on the above, the development of automated methods for the precise and reliable quantification of tumor burden in lymphomas is clinically relevant and could facilitate treatment planning. Interestingly, a recent study proposed a standardized MTV measurement workflow for lymphoma that allowed for highly reproducible MTV quantifications and could act as a reference to test improvements in the calculation of MTV using AI tools [79].
Two of the articles described in our paper considered only pediatric participants [26,32]. Notably, Wang et al. [26] developed a DL algorithm for the automatic detection of lymphoma lesions in PET/MRI scans of pediatric patients. This kind of model is particularly relevant and impactful for two key reasons. Firstly, lymphoma represents the most common type of cancer in adolescents aged 15–19 years and the third most frequent in children up to 14 years old [80]. Secondly, approximately 54–81% of the radiation dose from a whole-body PET/CT scan is attributed to the CT examination, while the remaining amount is the result of the injection of the PET radioactive tracer [81]. The replacement of the CT component with MRI has been reported to reduce the radiation dose by 45–48% [82,83], which is especially important for young patients, who are more sensitive to the potential harmful effects of ionizing radiation [84].

4.2. Limitations of the Included Studies

Despite their promising results, the included studies exhibited some limitations. The number of participants across the studies ranged from 39 to 2030 with a median of 167, while roughly one-quarter of the studies included PET images from fewer than 100 patients. The limited dataset size for model development may have affected the performance and predictive ability of the DL models used in these studies [40]. However, there is no consensus regarding the number of patient cases necessary for training and developing a specific DL model, while the dataset size can vary significantly depending on the complexity of the task and the heterogeneity of the training sample [25]. Furthermore, over half of the studies did not conduct external validation, which is essential for assessing a model’s reproducibility and generalizability to diverse and unseen patient populations [85,86]. External validation corroborates a model’s reputation and trustworthiness; therefore, its absence in these studies reduces the potential of their models for clinical implementation [85,86,87]. Another limitation observed in some studies was the need for the manual delineation of lymphoma lesions in medical images during the preprocessing stage, which can be time-consuming and prone to high inter-reader variability [88].

4.3. Limitations of the Review

Our systematic review has certain limitations that should be acknowledged. Firstly, our search was confined to a single database, namely, MEDLINE, which was chosen for its extensive coverage of biomedical literature. Additionally, given the rapid pace of advancements in the emerging field that our review focuses on, some relevant articles may have been released after our search was conducted and before our manuscript was published. Furthermore, we focused exclusively on DL models that used images and excluded studies proposing other ML approaches based on different kinds of data that could still provide insights about patients with lymphoma. In addition, we only considered algorithms for the interpretation of lymphoma PET images and did not include studies that presented denoising techniques [89,90] and others that aimed at reducing radiation exposure by generating high-quality diagnostic PET images from ultra-low-dose PET scans [91]. Another limitation of our review is that we excluded studies that solely addressed tumor segmentation in order to concentrate on those dedicated to broader interpretative tasks. Lastly, publication bias could potentially exist since our systematic review was restricted to published literature, which may not represent the entire spectrum of available research. Studies that propose highly accurate models are more likely to be published than others with less attractive results [92].

4.4. Implications of the Results for Practice and Future Research

The shortage of clinicians coupled with an increasing patient population has led to an overwhelming workload for healthcare professionals, which can negatively impact their decision-making. Conversely, there is evidence to suggest that a reduction in the clinical workload can improve clinical judgments and therefore positively influence patient outcomes [93]. Excessive workload has also been associated with burnout [94], which in turn negatively affects not only the health of the physicians but also the delivery and cost of care [95]. In this context, the careful integration of AI into healthcare is considered a promising strategy to potentially alleviate the workload of physicians across various clinical activities and thereby mitigate burnout and augment the quality of care [96].
The goal of producing computer-aided methods for medical tasks is not to replace physicians but rather to support them by leveraging the computational power and data processing capabilities of AI [97]. With their ability to perform rapid and precise calculations, DL-based models could assist nuclear medicine physicians, radiologists, and pathologists in managing high volumes of medical images and tissue specimens. In addition, AI algorithms can detect subtle patterns, anomalies, and indicators of disease that may be overlooked by the human eye [98]. Another potential use of AI models is smart worklist prioritization, which could enable the identification of scans that physicians should examine promptly [99]. Ultimately, AI holds the potential to provide physicians with decision support tools to enhance their ability to deliver precise care and reduce their, often overwhelming, workload.
Two principal methodologies are currently used to develop AI-driven imaging biomarkers: radiomics and DL [100]. Radiomics refers to the extraction of quantitative features from medical scans to describe tissue and lesion characteristics, such as shape and intensity [101,102,103]. Radiomics has demonstrated significant potential in cancer screening, diagnosis, prognosis, and treatment response assessment [104]. The predictive value of radiomics can be enhanced through its integration with other types of data. Notably, recent findings indicate that PET/CT radiomics combined with liquid biopsy provides improved outcome prediction in patients with lymphoma compared to imaging alone [105]. Radiomic features can function as inputs for ML models, which in turn can predict various outcomes [100]. In addition to standard radiomics, which relies on static snapshots, delta radiomics emerges as an innovative approach that captures changes in tissues and lesions over time, which has potential in regard to outcome prediction [106,107]. While traditional radiomics methods have proven utility, they are often criticized for introducing human bias and for being prone to reproducibility issues due to the variability in imaging techniques and manual delineation, which is highly observer-dependent [108,109]. In contrast, DL algorithms can automatically extract features directly from raw images without the need for prior manual segmentation [109,110]. Both radiomics and DL are rapidly evolving technologies, which can be combined to revolutionize the field of medical imaging and drive the advancement of precision medicine [111].
The extensive use of PET for lymphoma evaluation has generated an unprecedented volume of imaging data. As a result, there is an urgent need for advanced analytical methods to effectively manage and interpret this data deluge. In this context, DL has emerged as a crucial tool, offering sophisticated methods for analyzing lymphoma PET images that enable timely and precise detection, histological subtyping, differential diagnosis, and prognostication.
While DL presents immense opportunities in regard to lymphoma research, several challenges must be addressed before these models can be deployed in clinical settings. The high heterogeneity of multiple lymphoma subtypes characterized by distinct biological profiles, diverse imaging appearances, and variable clinical outcomes complicates the development of DL models that can effectively generalize across all subtypes [112,113]. Additionally, variations in imaging equipment, scanning protocols, and population characteristics across institutions may restrict the usefulness of AI models trained on data from specific technical, clinical, and socioeconomic settings [114]. Another concern is that AI algorithms often operate as “black boxes” since their decision-making processes might remain obscure and not easily understandable [114,115,116]. This lack of transparency challenges the reliability of DL models as it may be difficult for clinicians to comprehend and trust the rationale behind their predictions, which may hamper the implementation of DL tools in clinical practice [115]. In addition, the opacity of AI methods may hinder the patients’ understanding and thereby affect their ability to provide informed consent and exercise their autonomy [117]. However, when the effectiveness of such models has been sufficiently proven through rigorous validation and robust performance metrics, the cost of providing explanation may exceed the potential benefit [117,118].
To address these limitations, it is vital to utilize large, precisely labeled, and diverse datasets for training and validation, establish standardized imaging protocols, and encourage collaboration among AI developers, clinicians, ethicists, and regulatory bodies. Such efforts are essential to ensure that DL models effectively meet high standards of performance, safety, and fairness across diverse subpopulations while proactively addressing potential biases and ethical concerns [114,119,120]. This holistic approach will facilitate the smooth integration of DL into clinical practice, enhancing physicians’ ability to deliver precise care to lymphoma patients and thereby improve clinical outcomes.
Through our systematic review, we identified several areas where future research regarding the DL-based analysis of lymphoma PET images could focus on. Subsequent studies could further explore models that utilize PET/MRI scans for the evaluation of lymphomas, which are underrepresented in the current literature compared to PET/CT images. Specifically, only one of the 21 articles included in our review was based on PET/MRI. The field would also benefit from the development of innovative methods tailored specifically to pediatric lymphomas, which exhibit distinct epidemiological characteristics, histological distribution, and biological behavior compared to adult cases. Furthermore, DL algorithms for the computer-aided staging of lymphomas using PET images are currently lacking, highlighting a potential direction for future investigation. Finally, the availability of large, high-quality, and accessible datasets of diverse PET images of patients with lymphoma would facilitate the utilization of a wide range of patient populations and lymphoma types, thereby enabling the production of robust models with enhanced generalizability and promoting fairness by ensuring equitable representation across different demographic and clinical subgroups.

5. Conclusions

DL models for the analysis of lymphoma PET images have achieved promising performance across different medical tasks, namely, detection, classification, differential diagnosis, MTV calculation, and treatment response and survival prediction. They are not designed to replace physicians but to aid them in handling large volumes of scans, alleviate their workload, and offer them decision support tools that can improve their ability to provide precise care to their patients. Despite the encouraging results of automated methods for analyzing lymphoma PET images, it appears that there is still a gap to be bridged before these models can reach the necessary maturity for widespread use in clinical settings.

Author Contributions

Conceptualization, T.K., E.B. and T.P.Z.; methodology, T.K. and E.B.; writing—original draft preparation, T.K. and E.B.; writing—review and editing, T.P.Z.; visualization, T.K. and E.B.; supervision, T.P.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

References

  1. Mugnaini, E.N.; Ghosh, N. Lymphoma. Prim. Care 2016, 43, 661–675. [Google Scholar] [CrossRef] [PubMed]
  2. Munir, F.; Hardit, V.; Sheikh, I.N.; AlQahtani, S.; He, J.; Cuglievan, B.; Hosing, C.; Tewari, P.; Khazal, S. Classical Hodgkin Lymphoma: From Past to Future-A Comprehensive Review of Pathophysiology and Therapeutic Advances. Int. J. Mol. Sci. 2023, 24, 10095. [Google Scholar] [CrossRef] [PubMed]
  3. Heneghan, M.B.; Belsky, J.A.; Milgrom, S.A.; Forlenza, C.J. The Pediatric Approach to Hodgkin Lymphoma. Semin. Hematol. 2024, 61, 245–252. [Google Scholar] [CrossRef] [PubMed]
  4. Xu, C.; Feng, J.; Yue, Y.; Cheng, W.; He, D.; Qi, S.; Zhang, G. A Hybrid Few-Shot Multiple-Instance Learning Model Predicting the Aggressiveness of Lymphoma in PET/CT Images. Comput. Methods Programs Biomed. 2024, 243, 107872. [Google Scholar] [CrossRef]
  5. Xu-Monette, Z.Y.; Young, K.H. Therapeutic Vaccines for Aggressive B-Cell Lymphoma. Leuk. Lymphoma 2020, 61, 3038–3051. [Google Scholar] [CrossRef]
  6. Pavanello, F.; Steffanoni, S.; Ghielmini, M.; Zucca, E. Systemic Front Line Therapy of Follicular Lymphoma: When, to Whom and How. Mediterr. J. Hematol. Infect. Dis. 2016, 8, e2016062. [Google Scholar] [CrossRef]
  7. Rong, J.; Haider, A.; Jeppesen, T.E.; Josephson, L.; Liang, S.H. Radiochemistry for Positron Emission Tomography. Nat. Commun. 2023, 14, 3257. [Google Scholar] [CrossRef] [PubMed]
  8. Berger, A. How Does It Work? Positron Emission Tomography. BMJ 2003, 326, 1449. [Google Scholar] [CrossRef]
  9. Shagam, J.Y. Positron Emission Tomography. Radiol. Technol. 2001, 72, 551–569; quiz 570–572. [Google Scholar] [PubMed]
  10. Ansell, S.M.; Armitage, J.O. Positron Emission Tomographic Scans in Lymphoma: Convention and Controversy. Mayo Clin. Proc. 2012, 87, 571–580. [Google Scholar] [CrossRef] [PubMed]
  11. Pettit, R.W.; Fullem, R.; Cheng, C.; Amos, C.I. Artificial Intelligence, Machine Learning, and Deep Learning for Clinical Outcome Prediction. Emerg. Top. Life Sci. 2021, 5, 729–745. [Google Scholar] [CrossRef] [PubMed]
  12. Yamashita, R.; Nishio, M.; Do, R.K.G.; Togashi, K. Convolutional Neural Networks: An Overview and Application in Radiology. Insights Imaging 2018, 9, 611–629. [Google Scholar] [CrossRef] [PubMed]
  13. van der Laak, J.; Litjens, G.; Ciompi, F. Deep Learning in Histopathology: The Path to the Clinic. Nat. Med. 2021, 27, 775–784. [Google Scholar] [CrossRef] [PubMed]
  14. Dey, P. The Emerging Role of Deep Learning in Cytology. Cytopathology 2021, 32, 154–160. [Google Scholar] [CrossRef] [PubMed]
  15. Mohseni, A.; Ghotbi, E.; Kazemi, F.; Shababi, A.; Jahan, S.C.; Mohseni, A.; Shababi, N. Artificial Intelligence in Radiology: What Is Its True Role at Present, and Where Is the Evidence? Radiol. Clin. N. Am. 2024, 62, 935–947. [Google Scholar] [CrossRef]
  16. Artesani, A.; Bruno, A.; Gelardi, F.; Chiti, A. Empowering PET: Harnessing Deep Learning for Improved Clinical Insight. Eur. Radiol. Exp. 2024, 8, 17. [Google Scholar] [CrossRef] [PubMed]
  17. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 Statement: An Updated Guideline for Reporting Systematic Reviews. Int. J. Surg. 2021, 88, 105906. [Google Scholar] [CrossRef]
  18. Ouzzani, M.; Hammady, H.; Fedorowicz, Z.; Elmagarmid, A. Rayyan-a Web and Mobile App for Systematic Reviews. Syst. Rev. 2016, 5, 210. [Google Scholar] [CrossRef]
  19. Wolff, R.F.; Moons, K.G.M.; Riley, R.D.; Whiting, P.F.; Westwood, M.; Collins, G.S.; Reitsma, J.B.; Kleijnen, J.; Mallett, S.; PROBAST Group. PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies. Ann. Intern. Med. 2019, 170, 51–58. [Google Scholar] [CrossRef]
  20. Moons, K.G.M.; Wolff, R.F.; Riley, R.D.; Whiting, P.F.; Westwood, M.; Collins, G.S.; Reitsma, J.B.; Kleijnen, J.; Mallett, S. PROBAST: A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration. Ann. Intern. Med. 2019, 170, W1–W33. [Google Scholar] [CrossRef] [PubMed]
  21. Yousefirizi, F.; Klyuzhin, I.S.; Joo Hyun, O.; Harsini, S.; Tie, X.; Shiri, I.; Shin, M.; Lee, C.; Cho, S.Y.; Bradshaw, T.J.; et al. TMTV-Net: Fully Automated Total Metabolic Tumor Volume Segmentation in Lymphoma PET/CT Images—A Multi-Center Generalizability Analysis. Eur. J. Nucl. Med. Mol. Imaging 2024, 51, 1937–1954. [Google Scholar] [CrossRef]
  22. Jiang, C.; Qian, C.; Jiang, Z.; Teng, Y.; Lai, R.; Sun, Y.; Ni, X.; Ding, C.; Xu, Y.; Tian, R. Robust Deep Learning-Based PET Prognostic Imaging Biomarker for DLBCL Patients: A Multicenter Study. Eur. J. Nucl. Med. Mol. Imaging 2023, 50, 3949–3960. [Google Scholar] [CrossRef] [PubMed]
  23. Qian, C.; Jiang, C.; Xie, K.; Ding, C.; Teng, Y.; Sun, J.; Gao, L.; Zhou, Z.; Ni, X. Prognosis Prediction of Diffuse Large B-Cell Lymphoma in 18F-FDG PET Images Based on Multi-Deep-Learning Models. IEEE J. Biomed. Health Inform. 2024, 28, 4010–4023. [Google Scholar] [CrossRef] [PubMed]
  24. Zhou, Z.; Jain, P.; Lu, Y.; Macapinlac, H.; Wang, M.L.; Son, J.B.; Pagel, M.D.; Xu, G.; Ma, J. Computer-Aided Detection of Mantle Cell Lymphoma on 18F-FDG PET/CT Using a Deep Learning Convolutional Neural Network. Am. J. Nucl. Med. Mol. Imaging 2021, 11, 260–270. [Google Scholar] [PubMed]
  25. Weisman, A.J.; Kieler, M.W.; Perlman, S.B.; Hutchings, M.; Jeraj, R.; Kostakoglu, L.; Bradshaw, T.J. Convolutional Neural Networks for Automated PET/CT Detection of Diseased Lymph Node Burden in Patients with Lymphoma. Radiol. Artif. Intell. 2020, 2, e200016. [Google Scholar] [CrossRef] [PubMed]
  26. Wang, H.; Sarrami, A.; Wu, J.T.-Y.; Baratto, L.; Sharma, A.; Wong, K.C.L.; Singh, S.B.; Daldrup-Link, H.E.; Syeda-Mahmood, T. Multimodal Pediatric Lymphoma Detection Using PET and MRI. AMIA Annu. Symp. Proc. 2023, 2023, 736–743. [Google Scholar] [PubMed]
  27. Wang, M.; Jiang, H. PST-Radiomics: A PET/CT Lymphoma Classification Method Based on Pseudo Spatial-Temporal Radiomic Features and Structured Atrous Recurrent Convolutional Neural Network. Phys. Med. Biol. 2023, 68, 235014. [Google Scholar] [CrossRef]
  28. Aoki, H.; Miyazaki, Y.; Anzai, T.; Yokoyama, K.; Tsuchiya, J.; Shirai, T.; Shibata, S.; Sakakibara, R.; Mitsumura, T.; Honda, T.; et al. Deep Convolutional Neural Network for Differentiating between Sarcoidosis and Lymphoma Based on [18F]FDG Maximum-Intensity Projection Images. Eur. Radiol. 2024, 34, 374–383. [Google Scholar] [CrossRef] [PubMed]
  29. Chen, W.; Liu, F.; Wang, R.; Qi, M.; Zhang, J.; Liu, X.; Song, S. End-to-End Deep Learning Radiomics: Development and Validation of a Novel Attention-Based Aggregate Convolutional Neural Network to Distinguish Breast Diffuse Large B-Cell Lymphoma from Breast Invasive Ductal Carcinoma. Quant. Imaging Med. Surg. 2023, 13, 6598–6614. [Google Scholar] [CrossRef] [PubMed]
  30. Yang, Y.; Zheng, B.; Li, Y.; Li, Y.; Ma, X. Computer-Aided Diagnostic Models to Classify Lymph Node Metastasis and Lymphoma Involvement in Enlarged Cervical Lymph Nodes Using PET/CT. Med. Phys. 2023, 50, 152–162. [Google Scholar] [CrossRef]
  31. Revailler, W.; Cottereau, A.S.; Rossi, C.; Noyelle, R.; Trouillard, T.; Morschhauser, F.; Casasnovas, O.; Thieblemont, C.; Gouill, S.L.; André, M.; et al. Deep Learning Approach to Automatize TMTV Calculations Regardless of Segmentation Methodology for Major FDG-Avid Lymphomas. Diagnostics 2022, 12, 417. [Google Scholar] [CrossRef] [PubMed]
  32. Weisman, A.J.; Kim, J.; Lee, I.; McCarten, K.M.; Kessel, S.; Schwartz, C.L.; Kelly, K.M.; Jeraj, R.; Cho, S.Y.; Bradshaw, T.J. Automated Quantification of Baseline Imaging PET Metrics on FDG PET/CT Images of Pediatric Hodgkin Lymphoma Patients. EJNMMI Phys. 2020, 7, 76. [Google Scholar] [CrossRef] [PubMed]
  33. Kuker, R.A.; Lehmkuhl, D.; Kwon, D.; Zhao, W.; Lossos, I.S.; Moskowitz, C.H.; Alderuccio, J.P.; Yang, F. A Deep Learning-Aided Automated Method for Calculating Metabolic Tumor Volume in Diffuse Large B-Cell Lymphoma. Cancers 2022, 14, 5221. [Google Scholar] [CrossRef]
  34. Blanc-Durand, P.; Jégou, S.; Kanoun, S.; Berriolo-Riedinger, A.; Bodet-Milin, C.; Kraeber-Bodéré, F.; Carlier, T.; Le Gouill, S.; Casasnovas, R.-O.; Meignan, M.; et al. Fully Automatic Segmentation of Diffuse Large B Cell Lymphoma Lesions on 3D FDG-PET/CT for Total Metabolic Tumour Volume Prediction Using a Convolutional Neural Network. Eur. J. Nucl. Med. Mol. Imaging 2021, 48, 1362–1370. [Google Scholar] [CrossRef] [PubMed]
  35. Jiang, C.; Chen, K.; Teng, Y.; Ding, C.; Zhou, Z.; Gao, Y.; Wu, J.; He, J.; He, K.; Zhang, J. Deep Learning-Based Tumour Segmentation and Total Metabolic Tumour Volume Prediction in the Prognosis of Diffuse Large B-Cell Lymphoma Patients in 3D FDG-PET Images. Eur. Radiol. 2022, 32, 4801–4812. [Google Scholar] [CrossRef] [PubMed]
  36. Ferrández, M.C.; Golla, S.S.V.; Eertink, J.J.; de Vries, B.M.; Lugtenburg, P.J.; Wiegers, S.E.; Zwezerijnen, G.J.C.; Pieplenbosch, S.; Kurch, L.; Hüttmann, A.; et al. An Artificial Intelligence Method Using FDG PET to Predict Treatment Outcome in Diffuse Large B Cell Lymphoma Patients. Sci. Rep. 2023, 13, 13111. [Google Scholar] [CrossRef] [PubMed]
  37. Tong, Y.; Udupa, J.K.; Chong, E.; Winchell, N.; Sun, C.; Zou, Y.; Schuster, S.J.; Torigian, D.A. Prediction of Lymphoma Response to CAR T Cells by Deep Learning-Based Image Analysis. PLoS ONE 2023, 18, e0282573. [Google Scholar] [CrossRef]
  38. Yuan, C.; Shi, Q.; Huang, X.; Wang, L.; He, Y.; Li, B.; Zhao, W.; Qian, D. Multimodal Deep Learning Model on Interim [18F]FDG PET/CT for Predicting Primary Treatment Failure in Diffuse Large B-Cell Lymphoma. Eur. Radiol. 2023, 33, 77–88. [Google Scholar] [CrossRef] [PubMed]
  39. Jemaa, S.; Ounadjela, S.; Wang, X.; El-Galaly, T.C.; Kostakoglu, L.; Knapp, A.; Ku, G.; Musick, L.; Sahin, D.; Wei, M.C.; et al. Automated Lugano Metabolic Response Assessment in 18F-Fluorodeoxyglucose-Avid Non-Hodgkin Lymphoma with Deep Learning on 18F-Fluorodeoxyglucose-Positron Emission Tomography. J. Clin. Oncol. 2024, 42, 2966–2977. [Google Scholar] [CrossRef]
  40. Guo, R.; Hu, X.; Song, H.; Xu, P.; Xu, H.; Rominger, A.; Lin, X.; Menze, B.; Li, B.; Shi, K. Weakly Supervised Deep Learning for Determining the Prognostic Value of 18F-FDG PET/CT in Extranodal Natural Killer/T Cell Lymphoma, Nasal Type. Eur. J. Nucl. Med. Mol. Imaging 2021, 48, 3151–3161. [Google Scholar] [CrossRef] [PubMed]
  41. Ngeow, J.Y.Y.; Quek, R.H.H.; Ng, D.C.E.; Hee, S.W.; Tao, M.; Lim, L.C.; Tan, Y.H.; Lim, S.T. High SUV Uptake on FDG-PET/CT Predicts for an Aggressive B-Cell Lymphoma in a Prospective Study of Primary FDG-PET/CT Staging in Lymphoma. Ann. Oncol. 2009, 20, 1543–1547. [Google Scholar] [CrossRef] [PubMed]
  42. Bodet-Milin, C.; Kraeber-Bodéré, F.; Moreau, P.; Campion, L.; Dupas, B.; Le Gouill, S. Investigation of FDG-PET/CT Imaging to Guide Biopsies in the Detection of Histological Transformation of Indolent Lymphoma. Haematologica 2008, 93, 471–472. [Google Scholar] [CrossRef] [PubMed]
  43. Matasar, M.J.; Zelenetz, A.D. Overview of Lymphoma Diagnosis and Management. Radiol. Clin. N. Am. 2008, 46, 175–198. [Google Scholar] [CrossRef]
  44. Gümüştaş, S.; Inan, N.; Akansel, G.; Başyïğït, I.; Cïftçi, E. Differentiation of Lymphoma versus Sarcoidosis in the Setting of Mediastinal-Hilar Lymphadenopathy: Assessment with Diffusion-Weighted MR Imaging. Sarcoidosis Vasc. Diffuse Lung Dis. 2013, 30, 52–59. [Google Scholar]
  45. Jennings, W.C.; Baker, R.S.; Murray, S.S.; Howard, C.A.; Parker, D.E.; Peabody, L.F.; Vice, H.M.; Sheehan, W.W.; Broughan, T.A. Primary Breast Lymphoma: The Role of Mastectomy and the Importance of Lymph Node Status. Ann. Surg. 2007, 245, 784–789. [Google Scholar] [CrossRef] [PubMed]
  46. Surov, A.; Holzhausen, H.-J.; Wienke, A.; Schmidt, J.; Thomssen, C.; Arnold, D.; Ruschke, K.; Spielmann, R.-P. Primary and Secondary Breast Lymphoma: Prevalence, Clinical Signs and Radiological Features. Br. J. Radiol. 2012, 85, e195–e205. [Google Scholar] [CrossRef] [PubMed]
  47. Picasso, R.; Tagliafico, A.; Calabrese, M.; Martinoli, C.; Pistoia, F.; Rossi, A.; Zaottini, F.; Derchi, L. Primary and Secondary Breast Lymphoma: Focus on Epidemiology and Imaging Features. Pathol. Oncol. Res. 2020, 26, 1483–1488. [Google Scholar] [CrossRef] [PubMed]
  48. Nicholson, B.T.; Bhatti, R.M.; Glassman, L. Extranodal Lymphoma of the Breast. Radiol. Clin. N. Am. 2016, 54, 711–726. [Google Scholar] [CrossRef] [PubMed]
  49. Ha, H.J.; Lee, J.; Kim, D.Y.; Kim, J.-S.; Shin, M.-S.; Noh, I.; Koh, J.S.; Kim, E.J.; Lee, S.-S. Utility and Limitations of Fine-Needle Aspiration Cytology in the Diagnosis of Lymphadenopathy. Diagnostics 2023, 13, 728. [Google Scholar] [CrossRef]
  50. Matsumoto, Y.; Baba, S.; Endo, M.; Setsu, N.; Iida, K.; Fukushi, J.-I.; Kawaguchi, K.; Okada, S.; Bekki, H.; Isoda, T.; et al. Metabolic Tumor Volume by 18F-FDG PET/CT Can Predict the Clinical Outcome of Primary Malignant Spine/Spinal Tumors. Biomed. Res. Int. 2017, 2017, 8132676. [Google Scholar] [CrossRef] [PubMed]
  51. Cottereau, A.-S.; Versari, A.; Loft, A.; Casasnovas, O.; Bellei, M.; Ricci, R.; Bardet, S.; Castagnoli, A.; Brice, P.; Raemaekers, J.; et al. Prognostic Value of Baseline Metabolic Tumor Volume in Early-Stage Hodgkin Lymphoma in the Standard Arm of the H10 Trial. Blood 2018, 131, 1456–1463. [Google Scholar] [CrossRef] [PubMed]
  52. Zhao, P.; Yu, T.; Pan, Z. Prognostic Value of the Baseline 18F-FDG PET/CT Metabolic Tumour Volume (MTV) and Further Stratification in Low-Intermediate (L-I) and High-Intermediate (H-I) Risk NCCNIPI Subgroup by MTV in DLBCL MTV Predict Prognosis in DLBCL. Ann. Nucl. Med. 2021, 35, 24–30. [Google Scholar] [CrossRef] [PubMed]
  53. Meignan, M.; Cottereau, A.S.; Versari, A.; Chartier, L.; Dupuis, J.; Boussetta, S.; Grassi, I.; Casasnovas, R.-O.; Haioun, C.; Tilly, H.; et al. Baseline Metabolic Tumor Volume Predicts Outcome in High–Tumor-Burden Follicular Lymphoma: A Pooled Analysis of Three Multicenter Studies. JCO 2016, 34, 3618–3626. [Google Scholar] [CrossRef] [PubMed]
  54. Vergote, V.K.J.; Verhoef, G.; Janssens, A.; Woei-A-Jin, F.J.S.H.; Laenen, A.; Tousseyn, T.; Dierickx, D.; Deroose, C.M. [18F]FDG-PET/CT Volumetric Parameters Can Predict Outcome in Untreated Mantle Cell Lymphoma. Leuk. Lymphoma 2023, 64, 161–170. [Google Scholar] [CrossRef]
  55. Albano, D.; Re, A.; Tucci, A.; Giubbini, R.; Bertagna, F. Prognostic Role of ΔMTV and ΔTLG in Burkitt Lymphoma. Ann. Nucl. Med. 2019, 33, 280–287. [Google Scholar] [CrossRef]
  56. Liu, Y.; Jiang, J.; Liu, L.; Wang, Z.; Yu, B.; Xia, Z.; Zhang, Q.; Ji, D.; Liu, X.; Lv, F.; et al. Prognostic Significance of Clinical Characteristics and 18Fluorodeoxyglucose-Positron Emission Tomography/Computed Tomography Quantitative Parameters in Patients with Primary Mediastinal B-Cell Lymphoma. J. Int. Med. Res. 2022, 50, 3000605211063027. [Google Scholar] [CrossRef] [PubMed]
  57. Zhou, Y.; Zhang, X.; Qin, H.; Zhao, Z.; Li, J.; Zhang, B.; Sang, S.; Wu, Y.; Deng, S. Prognostic Values of Baseline 18F-FDG PET/CT in Patients with Peripheral T-Cell Lymphoma. Biomed. Res. Int. 2020, 2020, 9746716. [Google Scholar] [CrossRef] [PubMed]
  58. Feng, X.; Wen, X.; Li, L.; Li, X.; Zhenchang, S.; Zhang, L.; Qiu, L.; Zhang, M. Prognostic Value of Total Metabolic Tumor Volume and Total Lesion Glycolysis Measured By 18F-FDG PET-CT in Patients with T-Cell Lymphoblastic Lymphoma. Blood 2019, 134, 2851. [Google Scholar] [CrossRef]
  59. Ferrández, M.C.; Golla, S.S.V.; Eertink, J.J.; Wiegers, S.E.; Zwezerijnen, G.J.C.; Heymans, M.W.; Lugtenburg, P.J.; Kurch, L.; Hüttmann, A.; Hanoun, C.; et al. Validation of an Artificial Intelligence-Based Prediction Model Using 5 External PET/CT Datasets of Diffuse Large B-Cell Lymphoma. J. Nucl. Med. 2024, 65, 1802–1807. [Google Scholar] [CrossRef]
  60. Li, X.; Chen, Y.; Sun, A.; Wang, Y.; Liu, Y.; Lei, H. Development and Validation of Prediction Model for Overall Survival in Patients with Lymphoma: A Prospective Cohort Study in China. BMC Med. Inform. Decis. Mak. 2023, 23, 125. [Google Scholar] [CrossRef]
  61. Hasanabadi, S.; Aghamiri, S.M.R.; Abin, A.A.; Abdollahi, H.; Arabi, H.; Zaidi, H. Enhancing Lymphoma Diagnosis, Treatment, and Follow-Up Using 18F-FDG PET/CT Imaging: Contribution of Artificial Intelligence and Radiomics Analysis. Cancers 2024, 16, 3511. [Google Scholar] [CrossRef]
  62. Poletto, S.; Novo, M.; Paruzzo, L.; Frascione, P.M.M.; Vitolo, U. Treatment Strategies for Patients with Diffuse Large B-Cell Lymphoma. Cancer Treat. Rev. 2022, 110, 102443. [Google Scholar] [CrossRef] [PubMed]
  63. Sehn, L.H.; Berry, B.; Chhanabhai, M.; Fitzgerald, C.; Gill, K.; Hoskins, P.; Klasa, R.; Savage, K.J.; Shenkier, T.; Sutherland, J.; et al. The Revised International Prognostic Index (R-IPI) Is a Better Predictor of Outcome than the Standard IPI for Patients with Diffuse Large B-Cell Lymphoma Treated with R-CHOP. Blood 2007, 109, 1857–1861. [Google Scholar] [CrossRef] [PubMed]
  64. Lee, J.; Suh, C.; Park, Y.H.; Ko, Y.H.; Bang, S.M.; Lee, J.H.; Lee, D.H.; Huh, J.; Oh, S.Y.; Kwon, H.-C.; et al. Extranodal Natural Killer T-Cell Lymphoma, Nasal-Type: A Prognostic Model from a Retrospective Multicenter Study. J. Clin. Oncol. 2006, 24, 612–618. [Google Scholar] [CrossRef] [PubMed]
  65. Teras, L.R.; DeSantis, C.E.; Cerhan, J.R.; Morton, L.M.; Jemal, A.; Flowers, C.R. 2016 US Lymphoid Malignancy Statistics by World Health Organization Subtypes. CA Cancer J. Clin. 2016, 66, 443–459. [Google Scholar] [CrossRef] [PubMed]
  66. van Doesum, J.A.; Niezink, A.G.H.; Huls, G.A.; Beijert, M.; Diepstra, A.; van Meerten, T. Extranodal Natural Killer/T-Cell Lymphoma, Nasal Type: Diagnosis and Treatment. Hemasphere 2021, 5, e523. [Google Scholar] [CrossRef] [PubMed]
  67. Cottereau, A.-S.; Meignan, M.; Nioche, C.; Capobianco, N.; Clerc, J.; Chartier, L.; Vercellino, L.; Casasnovas, O.; Thieblemont, C.; Buvat, I. Risk Stratification in Diffuse Large B-Cell Lymphoma Using Lesion Dissemination and Metabolic Tumor Burden Calculated from Baseline PET/CT†. Ann. Oncol. 2021, 32, 404–411. [Google Scholar] [CrossRef]
  68. Girum, K.B.; Cottereau, A.-S.; Vercellino, L.; Rebaud, L.; Clerc, J.; Casasnovas, O.; Morschhauser, F.; Thieblemont, C.; Buvat, I. Tumor Location Relative to the Spleen Is a Prognostic Factor in Lymphoma Patients: A Demonstration from the REMARC Trial. J. Nucl. Med. 2024, 65, 313–319. [Google Scholar] [CrossRef]
  69. Girum, K.B.; Rebaud, L.; Cottereau, A.-S.; Meignan, M.; Clerc, J.; Vercellino, L.; Casasnovas, O.; Morschhauser, F.; Thieblemont, C.; Buvat, I. 18F-FDG PET Maximum-Intensity Projections and Artificial Intelligence: A Win-Win Combination to Easily Measure Prognostic Biomarkers in DLBCL Patients. J. Nucl. Med. 2022, 63, 1925–1932. [Google Scholar] [CrossRef] [PubMed]
  70. Wang, T.-W.; Hong, J.-S.; Chiu, H.-Y.; Chao, H.-S.; Chen, Y.-M.; Wu, Y.-T. Standalone Deep Learning versus Experts for Diagnosis Lung Cancer on Chest Computed Tomography: A Systematic Review. Eur. Radiol. 2024, 34, 7397–7407. [Google Scholar] [CrossRef] [PubMed]
  71. Xu, H.-L.; Gong, T.-T.; Liu, F.-H.; Chen, H.-Y.; Xiao, Q.; Hou, Y.; Huang, Y.; Sun, H.-Z.; Shi, Y.; Gao, S.; et al. Artificial Intelligence Performance in Image-Based Ovarian Cancer Identification: A Systematic Review and Meta-Analysis. eClinicalMedicine 2022, 53, 101662. [Google Scholar] [CrossRef] [PubMed]
  72. Meignan, M.; Cottereau, A.-S.; Specht, L.; Mikhaeel, N.G. Total Tumor Burden in Lymphoma—An Evolving Strong Prognostic Parameter. Br. J. Radiol. 2021, 94, 20210448. [Google Scholar] [CrossRef] [PubMed]
  73. Tsilimigras, D.I.; Hyer, J.M.; Bagante, F.; Guglielmi, A.; Ruzzenente, A.; Alexandrescu, S.; Poultsides, G.; Sasaki, K.; Aucejo, F.; Pawlik, T.M. Resection of Colorectal Liver Metastasis: Prognostic Impact of Tumor Burden vs KRAS Mutational Status. J. Am. Coll. Surg. 2021, 232, 590–598. [Google Scholar] [CrossRef] [PubMed]
  74. Iacovelli, R.; Lanoy, E.; Albiges, L.; Escudier, B. Tumour Burden Is an Independent Prognostic Factor in Metastatic Renal Cell Carcinoma. BJU Int. 2012, 110, 1747–1753. [Google Scholar] [CrossRef]
  75. Specht, L.; Nordentoft, A.M.; Cold, S.; Clausen, N.T.; Nissen, N.I. Tumor Burden as the Most Important Prognostic Factor in Early Stage Hodgkin’s Disease. Relations to Other Prognostic Factors and Implications for Choice of Treatment. Cancer 1988, 61, 1719–1727. [Google Scholar] [CrossRef] [PubMed]
  76. Jagannath, S.; Velasquez, W.S.; Tucker, S.L.; Fuller, L.M.; McLaughlin, P.W.; Manning, J.T.; North, L.B.; Cabanillas, F.C. Tumor Burden Assessment and Its Implication for a Prognostic Model in Advanced Diffuse Large-Cell Lymphoma. J. Clin. Oncol. 1986, 4, 859–865. [Google Scholar] [CrossRef] [PubMed]
  77. Mozas, P.; Rivas-Delgado, A.; Rivero, A.; Dlouhy, I.; Nadeu, F.; Balagué, O.; González-Farré, B.; Baumann, T.; Giné, E.; Delgado, J.; et al. High Serum Levels of IL-2R, IL-6, and TNF-α Are Associated with Higher Tumor Burden and Poorer Outcome of Follicular Lymphoma Patients in the Rituximab Era. Leuk. Res. 2020, 94, 106371. [Google Scholar] [CrossRef] [PubMed]
  78. Magrath, I.; Lee, Y.J.; Anderson, T.; Henle, W.; Ziegler, J.; Simon, R.; Schein, P. Prognostic Factors in Burkitt’s Lymphoma: Importance of Total Tumor Burden. Cancer 1980, 45, 1507–1515. [Google Scholar] [CrossRef] [PubMed]
  79. Boellaard, R.; Buvat, I.; Nioche, C.; Ceriani, L.; Cottereau, A.-S.; Guerra, L.; Hicks, R.J.; Kanoun, S.; Kobe, C.; Loft, A.; et al. International Benchmark for Total Metabolic Tumor Volume Measurement in Baseline 18F-FDG PET/CT of Lymphoma Patients: A Milestone Toward Clinical Implementation. J. Nucl. Med. 2024, 65, 1343–1348. [Google Scholar] [CrossRef]
  80. Steliarova-Foucher, E.; Colombet, M.; Ries, L.A.G.; Moreno, F.; Dolya, A.; Bray, F.; Hesseling, P.; Shin, H.Y.; Stiller, C.A.; Bouzbid, S.; et al. International Incidence of Childhood Cancer, 2001–2010: A Population-Based Registry Study. Lancet Oncol. 2017, 18, 719–731. [Google Scholar] [CrossRef] [PubMed]
  81. Huang, B.; Law, M.W.-M.; Khong, P.-L. Whole-Body PET/CT Scanning: Estimation of Radiation Dose and Cancer Risk. Radiology 2009, 251, 166–174. [Google Scholar] [CrossRef] [PubMed]
  82. Sher, A.C.; Seghers, V.; Paldino, M.J.; Dodge, C.; Krishnamurthy, R.; Krishnamurthy, R.; Rohren, E.M. Assessment of Sequential PET/MRI in Comparison with PET/CT of Pediatric Lymphoma: A Prospective Study. Am. J. Roentgenol. 2016, 206, 623–631. [Google Scholar] [CrossRef]
  83. Gatidis, S.; Schmidt, H.; Gücke, B.; Bezrukov, I.; Seitz, G.; Ebinger, M.; Reimold, M.; Pfannenberg, C.A.; Nikolaou, K.; Schwenzer, N.F.; et al. Comprehensive Oncologic Imaging in Infants and Preschool Children with Substantially Reduced Radiation Exposure Using Combined Simultaneous 18F-Fluorodeoxyglucose Positron Emission Tomography/Magnetic Resonance Imaging: A Direct Comparison to 18F-Fluorodeoxyglucose Positron Emission Tomography/Computed Tomography. Investig. Radiol. 2016, 51, 7–14. [Google Scholar] [CrossRef]
  84. Ng, C.K.C. Artificial Intelligence for Radiation Dose Optimization in Pediatric Radiology: A Systematic Review. Children 2022, 9, 1044. [Google Scholar] [CrossRef] [PubMed]
  85. Cabitza, F.; Campagner, A.; Soares, F.; García de Guadiana-Romualdo, L.; Challa, F.; Sulejmani, A.; Seghezzi, M.; Carobene, A. The Importance of Being External. Methodological Insights for the External Validation of Machine Learning Models in Medicine. Comput. Methods Programs Biomed. 2021, 208, 106288. [Google Scholar] [CrossRef]
  86. Ramspek, C.L.; Jager, K.J.; Dekker, F.W.; Zoccali, C.; van Diepen, M. External Validation of Prognostic Models: What, Why, How, When and Where? Clin. Kidney J. 2021, 14, 49–58. [Google Scholar] [CrossRef] [PubMed]
  87. Chalkidis, G.; McPherson, J.P.; Beck, A.; Newman, M.G.; Guo, J.-W.; Sloss, E.A.; Staes, C.J. External Validation of a Machine Learning Model to Predict 6-Month Mortality for Patients with Advanced Solid Tumors. JAMA Netw. Open 2023, 6, e2327193. [Google Scholar] [CrossRef] [PubMed]
  88. Koh, D.-M.; Papanikolaou, N.; Bick, U.; Illing, R.; Kahn, C.E.; Kalpathi-Cramer, J.; Matos, C.; Martí-Bonmatí, L.; Miles, A.; Mun, S.K.; et al. Artificial Intelligence and Machine Learning in Cancer Imaging. Commun. Med. 2022, 2, 133. [Google Scholar] [CrossRef] [PubMed]
  89. Yan, L.; Wang, Z.; Li, D.; Wang, Y.; Yang, G.; Zhao, Y.; Kong, Y.; Wang, R.; Wu, R.; Wang, Z. Low 18F-Fluorodeoxyglucose Dose Positron Emission Tomography Assisted by a Deep-Learning Image-Denoising Technique in Patients with Lymphoma. Quant. Imaging Med. Surg. 2024, 14, 111–122. [Google Scholar] [CrossRef]
  90. de Vries, B.M.; Golla, S.S.V.; Zwezerijnen, G.J.C.; Hoekstra, O.S.; Jauw, Y.W.S.; Huisman, M.C.; van Dongen, G.A.M.S.; Menke-van der Houven van Oordt, W.C.; Zijlstra-Baalbergen, J.J.M.; Mesotten, L.; et al. 3D Convolutional Neural Network-Based Denoising of Low-Count Whole-Body 18F-Fluorodeoxyglucose and 89Zr-Rituximab PET Scans. Diagnostics 2022, 12, 596. [Google Scholar] [CrossRef]
  91. Wang, Y.-R.J.; Baratto, L.; Hawk, K.E.; Theruvath, A.J.; Pribnow, A.; Thakor, A.S.; Gatidis, S.; Lu, R.; Gummidipundi, S.E.; Garcia-Diaz, J.; et al. Artificial Intelligence Enables Whole-Body Positron Emission Tomography Scans with Minimal Radiation Exposure. Eur. J. Nucl. Med. Mol. Imaging 2021, 48, 2771–2781. [Google Scholar] [CrossRef] [PubMed]
  92. Thornton, A.; Lee, P. Publication Bias in Meta-Analysis: Its Causes and Consequences. J. Clin. Epidemiol. 2000, 53, 207–216. [Google Scholar] [CrossRef] [PubMed]
  93. Shamszare, H.; Choudhury, A. Clinicians’ Perceptions of Artificial Intelligence: Focus on Workload, Risk, Trust, Clinical Decision Making, and Clinical Integration. Healthcare 2023, 11, 2308. [Google Scholar] [CrossRef]
  94. De Hert, S. Burnout in Healthcare Workers: Prevalence, Impact and Preventative Strategies. Local. Reg. Anesth. 2020, 13, 171–183. [Google Scholar] [CrossRef] [PubMed]
  95. Dominic, C.; Gopal, D.P.; Sidhu, A. “It’s like Juggling Fire Daily”: Well-Being, Workload and Burnout in the British NHS—A Survey of 721 Physicians. Work 2021, 70, 395–403. [Google Scholar] [CrossRef]
  96. Gandhi, T.K.; Classen, D.; Sinsky, C.A.; Rhew, D.C.; Vande Garde, N.; Roberts, A.; Federico, F. How Can Artificial Intelligence Decrease Cognitive and Work Burden for Front Line Practitioners? JAMIA Open 2023, 6, ooad079. [Google Scholar] [CrossRef] [PubMed]
  97. Krishnan, G.; Singh, S.; Pathania, M.; Gosavi, S.; Abhishek, S.; Parchani, A.; Dhar, M. Artificial Intelligence in Clinical Medicine: Catalyzing a Sustainable Global Healthcare Paradigm. Front. Artif. Intell. 2023, 6, 1227091. [Google Scholar] [CrossRef] [PubMed]
  98. Fatima, M.; Pachauri, P.; Akram, W.; Parvez, M.; Ahmad, S.; Yahya, Z. Enhancing Retinal Disease Diagnosis through AI: Evaluating Performance, Ethical Considerations, and Clinical Implementation. Inform. Health 2024, 1, 57–69. [Google Scholar] [CrossRef]
  99. Baltruschat, I.; Steinmeister, L.; Nickisch, H.; Saalbach, A.; Grass, M.; Adam, G.; Knopp, T.; Ittrich, H. Smart Chest X-Ray Worklist Prioritization Using Artificial Intelligence: A Clinical Workflow Simulation. Eur. Radiol. 2021, 31, 3837–3845. [Google Scholar] [CrossRef] [PubMed]
  100. Bera, K.; Braman, N.; Gupta, A.; Velcheti, V.; Madabhushi, A. Predicting Cancer Outcomes with Radiomics and Artificial Intelligence in Radiology. Nat. Rev. Clin. Oncol. 2022, 19, 132–146. [Google Scholar] [CrossRef] [PubMed]
  101. Jeong, S.; Yu, H.; Park, S.-H.; Woo, D.; Lee, S.-J.; Chong, G.O.; Han, H.S.; Kim, J.-C. Comparing Deep Learning and Handcrafted Radiomics to Predict Chemoradiotherapy Response for Locally Advanced Cervical Cancer Using Pretreatment MRI. Sci. Rep. 2024, 14, 1180. [Google Scholar] [CrossRef] [PubMed]
  102. Li, X.; Yang, L.; Jiao, X. Comparison of Traditional Radiomics, Deep Learning Radiomics and Fusion Methods for Axillary Lymph Node Metastasis Prediction in Breast Cancer. Acad. Radiol. 2023, 30, 1281–1287. [Google Scholar] [CrossRef]
  103. Mayerhoefer, M.E.; Materka, A.; Langs, G.; Häggström, I.; Szczypiński, P.; Gibbs, P.; Cook, G. Introduction to Radiomics. J. Nucl. Med. 2020, 61, 488–495. [Google Scholar] [CrossRef] [PubMed]
  104. Majumder, S.; Katz, S.; Kontos, D.; Roshkovan, L. State of the Art: Radiomics and Radiomics-Related Artificial Intelligence on the Road to Clinical Translation. BJR Open 2024, 6, tzad004. [Google Scholar] [CrossRef] [PubMed]
  105. Dondolin, R.; Garrou, F.; Almasri, M.; Cosentino, C.; Talotta, D.; Bruna, R.; Maria Rivolta, G.; Nabki, J.; Al Deeban, B.; Bruscaggin, A.; et al. Integration of [18f]FDG-PET Radiomics with Liquid Biopsy Improves Outcome Prediction in Newly Diagnosed DLBCL Patients. Blood 2024, 144, 2980. [Google Scholar] [CrossRef]
  106. Yousefirizi, F.; Gowdy, C.; Klyuzhin, I.S.; Sabouri, M.; Tonseth, P.; Hayden, A.R.; Wilson, D.; Sehn, L.H.; Scott, D.W.; Steidl, C.; et al. Evaluating Outcome Prediction via Baseline, End-of-Treatment, and Delta Radiomics on PET-CT Images of Primary Mediastinal Large B-Cell Lymphoma. Cancers 2024, 16, 1090. [Google Scholar] [CrossRef] [PubMed]
  107. Cousin, F.; Louis, T.; Dheur, S.; Aboubakar, F.; Ghaye, B.; Occhipinti, M.; Vos, W.; Bottari, F.; Paulus, A.; Sibille, A.; et al. Radiomics and Delta-Radiomics Signatures to Predict Response and Survival in Patients with Non-Small-Cell Lung Cancer Treated with Immune Checkpoint Inhibitors. Cancers 2023, 15, 1968. [Google Scholar] [CrossRef] [PubMed]
  108. Hosny, A.; Aerts, H.J.; Mak, R.H. Handcrafted versus Deep Learning Radiomics for Prediction of Cancer Therapy Response. Lancet Digit. Health 2019, 1, e106–e107. [Google Scholar] [CrossRef] [PubMed]
  109. Afshar, P.; Mohammadi, A.; Plataniotis, K.N.; Oikonomou, A.; Benali, H. From Handcrafted to Deep-Learning-Based Cancer Radiomics: Challenges and Opportunities. IEEE Signal Process. Mag. 2019, 36, 132–160. [Google Scholar] [CrossRef]
  110. Li, M.; Jiang, Y.; Zhang, Y.; Zhu, H. Medical Image Analysis Using Deep Learning Algorithms. Front. Public. Health 2023, 11, 1273253. [Google Scholar] [CrossRef]
  111. Parekh, V.S.; Jacobs, M.A. Deep Learning and Radiomics in Precision Medicine. Expert. Rev. Precis. Med. Drug Dev. 2019, 4, 59–72. [Google Scholar] [CrossRef] [PubMed]
  112. Shen, M.; Jiang, Z. Artificial Intelligence Applications in Lymphoma Diagnosis and Management: Opportunities, Challenges, and Future Directions. J. Multidiscip. Healthc. 2024, 17, 5329–5339. [Google Scholar] [CrossRef] [PubMed]
  113. Johnson, S.A.; Kumar, A.; Matasar, M.J.; Schöder, H.; Rademaker, J. Imaging for Staging and Response Assessment in Lymphoma. Radiology 2015, 276, 323–338. [Google Scholar] [CrossRef] [PubMed]
  114. Viswanathan, V.S.; Parmar, V.; Madabhushi, A. Towards Equitable AI in Oncology. Nat. Rev. Clin. Oncol. 2024, 21, 628–637. [Google Scholar] [CrossRef]
  115. Marey, A.; Arjmand, P.; Alerab, A.D.S.; Eslami, M.J.; Saad, A.M.; Sanchez, N.; Umair, M. Explainability, Transparency and Black Box Challenges of AI in Radiology: Impact on Patient Care in Cardiovascular Radiology. Egypt. J. Radiol. Nucl. Med. 2024, 55, 183. [Google Scholar] [CrossRef]
  116. Pahud de Mortanges, A.; Luo, H.; Shu, S.Z.; Kamath, A.; Suter, Y.; Shelan, M.; Pöllinger, A.; Reyes, M. Orchestrating Explainable Artificial Intelligence for Multimodal and Longitudinal Data in Medical Imaging. NPJ Digit. Med. 2024, 7, 195. [Google Scholar] [CrossRef]
  117. Abgrall, G.; Holder, A.L.; Chelly Dagdia, Z.; Zeitouni, K.; Monnet, X. Should AI Models Be Explainable to Clinicians? Crit. Care 2024, 28, 301. [Google Scholar] [CrossRef] [PubMed]
  118. Markus, A.F.; Kors, J.A.; Rijnbeek, P.R. The Role of Explainability in Creating Trustworthy Artificial Intelligence for Health Care: A Comprehensive Survey of the Terminology, Design Choices, and Evaluation Strategies. J. Biomed. Inform. 2021, 113, 103655. [Google Scholar] [CrossRef]
  119. Aggarwal, R.; Sounderajah, V.; Martin, G.; Ting, D.S.W.; Karthikesalingam, A.; King, D.; Ashrafian, H.; Darzi, A. Diagnostic Accuracy of Deep Learning in Medical Imaging: A Systematic Review and Meta-Analysis. npj Digit. Med. 2021, 4, 65. [Google Scholar] [CrossRef] [PubMed]
  120. Xu, Z.; Li, J.; Yao, Q.; Li, H.; Zhao, M.; Zhou, S.K. Addressing Fairness Issues in Deep Learning-Based Medical Image Analysis: A Systematic Review. npj Digit. Med. 2024, 7, 286. [Google Scholar] [CrossRef]
Figure 1. The role of artificial intelligence in optimizing the management of patients with lymphoma using positron emission tomography images without replacing the human touch. Abbreviations: AI, artificial intelligence; DL, deep learning; PET, positron emission tomography. Created using BioRender. Kanavos, T. (2024) https://BioRender.com/w36u293, accessed on 23 December 2024.
Figure 1. The role of artificial intelligence in optimizing the management of patients with lymphoma using positron emission tomography images without replacing the human touch. Abbreviations: AI, artificial intelligence; DL, deep learning; PET, positron emission tomography. Created using BioRender. Kanavos, T. (2024) https://BioRender.com/w36u293, accessed on 23 December 2024.
Cancers 17 00069 g001
Figure 2. Artificial intelligence-assisted workflow for the analysis of lymphoma medical images. Deep learning models can use positron emission tomography images with or without computed tomography or magnetic resonance imaging scans as inputs to deliver valuable clinical outputs. Abbreviations: CT, computed tomography; MRI, magnetic resonance imaging; PET, positron emission tomography.
Figure 2. Artificial intelligence-assisted workflow for the analysis of lymphoma medical images. Deep learning models can use positron emission tomography images with or without computed tomography or magnetic resonance imaging scans as inputs to deliver valuable clinical outputs. Abbreviations: CT, computed tomography; MRI, magnetic resonance imaging; PET, positron emission tomography.
Cancers 17 00069 g002
Figure 3. Flow diagram presenting the study selection process along with the number of included and excluded papers.
Figure 3. Flow diagram presenting the study selection process along with the number of included and excluded papers.
Cancers 17 00069 g003
Figure 4. Graphs illustrating the proportion of studies focusing on each different medical task (a), the proportion of studies that used each imaging modality (b), the proportion of studies that conducted external validation versus those that did not (c), and the distribution of studies by sample size (d). Abbreviations: CT, computed tomography; MRI, magnetic resonance imaging; MTV, metabolic tumor volume; PET, positron emission tomography.
Figure 4. Graphs illustrating the proportion of studies focusing on each different medical task (a), the proportion of studies that used each imaging modality (b), the proportion of studies that conducted external validation versus those that did not (c), and the distribution of studies by sample size (d). Abbreviations: CT, computed tomography; MRI, magnetic resonance imaging; MTV, metabolic tumor volume; PET, positron emission tomography.
Cancers 17 00069 g004
Figure 5. Results of the evaluation of the risk of bias and concerns regarding applicability using the prediction model risk of bias assessment tool (PROBAST).
Figure 5. Results of the evaluation of the risk of bias and concerns regarding applicability using the prediction model risk of bias assessment tool (PROBAST).
Cancers 17 00069 g005
Table 1. Classification of lymphomas.
Table 1. Classification of lymphomas.
Non-Hodgkin LymphomaHodgkin Lymphoma
Aggressive types
  Diffuse large B cell lymphoma
  Primary mediastinal large B cell lymphoma
  Burkitt lymphoma
  Peripheral T cell lymphoma
  Anaplastic large cell lymphoma
  Others
Classical
  Nodular sclerosis
  Mixed cellularity
  Lymphocyte-rich
  Lymphocyte-depleted
Indolent types
  Follicular lymphoma
  Small lymphocytic lymphoma
  Marginal zone lymphoma
  Lymphoplasmacytic lymphoma
  Cutaneous T cell lymphoma
  Others
Nodular lymphocyte-predominant
Table 2. Main characteristics of the studies included in the systematic review.
Table 2. Main characteristics of the studies included in the systematic review.
StudyNumber of Participants (Mean/Median Age in Years if Reported)Imaging ModalityMedical ObjectiveDL ArchitectureExternal ValidationPerformance Metrics
Zhou et al., 2021 [24]142 (58)PET/CTDetectionAn Xception-based U-NetYesSe: 84.00%
FPs/patient: 14
Weisman, Kieler et al., 2020 [25]90 (44.47)PET/CTDetectionAn ensemble of 3D patch-based, multi-resolution pathway CNNsNoSe: 85.00%
FPs/patient: 4
Wang et al., 2023 [26]73 (16)PET/MRIDetectionA multimodal fusion algorithmNoSe: 76.00%
FDR: 10.00%
Xu et al., 2024 [4]61 (56.49)PET/CTClassificationA hybrid few-shot multiple-instance learning modelNoAUC: 0.795
F1-score 1: 75.30%
Wang and Jiang, 2023 [27]80PET/CTClassificationA novel model based on SA-R2-Net and ResNetNoAUC: 0.902
F1-score: 85.20%
Aoki et al., 2024 [28]118 (60.95)PET/CTDifferential diagnosisA novel CNN modelNoAUC: 0.963
F1-score: 87.49%
Chen et al., 2023 [29]236 (51.31)PET/CTDifferential diagnosisA novel attention-based aggregate CNN modelYesAUC: 0.788
F1-score: 70.90%
Yang et al., 2023 [30]165PET/CTDifferential diagnosisA DL-SVM model based on ResNet50 architectureNoAUC: 0.948
F1-score: 81.80%
Yousefirizi et al., 2024 [21]1418PET/CTMTV estimationA cascaded U-Net-based approachYesR2: 0.89
Revailler et al., 2022 [31]2030PET/CTMTV estimationA 3D V-Net modelNoSCC: 0.92–0.98
Weisman, Kim et al., 2020 [32]100 (15.80)PET/CTMTV estimationAn ensemble of 3D patch-based, multi-resolution pathway CNNsNoPCC: 0.88
Kuker et al., 2022 [33]100PET/CTMTV estimationA model built off of a pre-trained 2D dilated residual U-Net architectureNoPCC: 0.82
ICC: 0.82
Blanc-Durand et al., 2021 [34]733PET/CTMTV estimationA 3D U-net architecture with two input channelsYesR2: 0.82
Jiang et al., 2022 [35]414PETMTV estimationA fully CNN with an nnU-Net architectureYesR2: 0.94
Ferrández et al., 2023 [36]636PET/CTTreatment response predictionA dual-branch four-layer CNNYesAUC: 0.74
Tong et al., 2023 [37]39PETTreatment response predictionTransfer learning using a pre-trained CNN modelNoAUC: 0.93
Yuan et al., 2023 [38]249PET/CTTreatment response predictionConv-LSTM-based hybrid learning model optimized with a contrastive training objectiveYesAUC: 0.925
F1-score: 54.55%
Jemaa et al., 2024 [39]1418PET/CTTreatment response predictionA novel DL-based algorithmYesSe: 85–97%
Sp: 50–78%
Jiang et al., 2023 [22]684PET/CTSurvival predictionA multiparametric model with a DL component based on VGG19 and DenseNet121YesC-index for PFS: 0.760
C-index for OS: 0.770
Qian et al., 2024 [23]449PET/CTSurvival predictionA multiparametric model with a DL componentYesC-index for PFS: 0.770
C-index for OS: 0.771
Guo et al., 2021 [40]167PET/CTSurvival predictionA weekly supervised DL model based on ResNet18NoAUC: 0.875
1 F1-score is a machine learning evaluation metric that considers both precision and recall and is calculated as the harmonic mean of these two measures. Abbreviations: 2D, two-dimensional; 3D, three-dimensional; AUC, area under the curve; CNN, convolutional neural network; CT, computed tomography; DL, deep learning; FDR, false discovery rate; FP, false positive; ICC, interclass correlation coefficient; MRI, magnetic resonance imaging; MTV, metabolic tumor volume; OS, overall survival; PCC, Pearson’s correlation coefficient; PET, positron emission tomography; PFS, progression-free survival; SCC, Spearman’s correlation coefficient; Se, sensitivity; Sp, specificity; SVM, support vector machine.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kanavos, T.; Birbas, E.; Zanos, T.P. A Systematic Review of the Applications of Deep Learning for the Interpretation of Positron Emission Tomography Images of Patients with Lymphoma. Cancers 2025, 17, 69. https://doi.org/10.3390/cancers17010069

AMA Style

Kanavos T, Birbas E, Zanos TP. A Systematic Review of the Applications of Deep Learning for the Interpretation of Positron Emission Tomography Images of Patients with Lymphoma. Cancers. 2025; 17(1):69. https://doi.org/10.3390/cancers17010069

Chicago/Turabian Style

Kanavos, Theofilos, Effrosyni Birbas, and Theodoros P. Zanos. 2025. "A Systematic Review of the Applications of Deep Learning for the Interpretation of Positron Emission Tomography Images of Patients with Lymphoma" Cancers 17, no. 1: 69. https://doi.org/10.3390/cancers17010069

APA Style

Kanavos, T., Birbas, E., & Zanos, T. P. (2025). A Systematic Review of the Applications of Deep Learning for the Interpretation of Positron Emission Tomography Images of Patients with Lymphoma. Cancers, 17(1), 69. https://doi.org/10.3390/cancers17010069

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop