Next Article in Journal
Anatomical Characteristics of Cervicomedullary Compression on MRI Scans in Children with Achondroplasia
Previous Article in Journal
The Methodology of Adaptive Levels of Interval for Laser Speckle Imaging
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Preclinical Implementation of matRadiomics: A Case Study for Early Malformation Prediction in Zebrafish Model

1
Department of Mechanical and Aerospace Engineering, Sapienza University of Rome, Eudossiana 18, 00184 Rome, Italy
2
Institute of Bioimaging and Complex Biological Systems—National Research Council (IBSBC—CNR), Contrada Pietrapollastra-Pisciotto, 90015 Cefalù, Italy
*
Author to whom correspondence should be addressed.
J. Imaging 2024, 10(11), 290; https://doi.org/10.3390/jimaging10110290
Submission received: 11 October 2024 / Revised: 8 November 2024 / Accepted: 12 November 2024 / Published: 14 November 2024

Abstract

:
Radiomics provides a structured approach to support clinical decision-making through key steps; however, users often face difficulties when switching between various software platforms to complete the workflow. To streamline this process, matRadiomics integrates the entire radiomics workflow within a single platform. This study extends matRadiomics to preclinical settings and validates it through a case study focused on early malformation differentiation in a zebrafish model. The proposed plugin incorporates Pyradiomics and streamlines feature extraction, selection, and classification using machine learning models (linear discriminant analysis—LDA; k-nearest neighbors—KNNs; and support vector machines—SVMs) with k-fold cross-validation for model validation. Classifier performances are evaluated using area under the ROC curve (AUC) and accuracy. The case study indicated the criticality of the long time required to extract features from preclinical images, generally of higher resolution than clinical images. To address this, a feature analysis was conducted to optimize settings, reducing extraction time while maintaining similarity to the original features. As a result, SVM exhibited the best performance for early malformation differentiation in zebrafish (AUC = 0.723; accuracy of 0.72). This case study underscores the plugin’s versatility and effectiveness in early biological outcome prediction, emphasizing its applicability across biomedical research fields.

1. Introduction

Radiomics has become a valuable tool in medical imaging, enabling the extraction of quantitative features that support clinical decision-making, especially in oncology [1,2,3,4]. It provides insights into tumor heterogeneity, early treatment response [5], and patient prognosis [6,7,8,9]. The typical radiomics workflow includes key steps: target identification and segmentation, feature extraction and selection, and predictive model development using machine learning (ML) [10,11,12,13]. Despite the availability of various software for these tasks, transitioning between tools remains challenging [14,15,16,17]. To streamline this, matRadiomics was developed to integrate all workflow steps into a single environment [18]. Initially developed for clinical applications, matRadiomics has proven effective in simplifying radiomics analyses, particularly in positron emission tomography (PET) imaging for prostate cancer, by reducing dependency on multiple software tools [19]. Enhanced by Artificial Intelligence, it further improves decision-making and diagnostic accuracy in clinical settings [20,21].
As preclinical research gains prominence, there is an increasing need to adapt radiomics tools for these studies [22]. Preclinical imaging plays a critical role in understanding disease mechanisms and evaluating treatment efficacy [23]. However, these images often have a higher resolution than clinical images. Unlike tomographic techniques, such as computed tomography (CT), magnetic resonance imaging (MRI), and PET, which are used in both clinical and preclinical settings for large animals, such as mice, optical imaging is applied in zebrafish or cellular models to capture fine details [24,25]. These types of images, not being obtained by medical instrumentation, have a higher resolution than clinical images. For this reason, they present unique challenges in terms of data processing and feature extraction [26]. Feature extraction on large masks significantly increases computational time due to the high data volume. This study analyzed optimal radiomics settings to address the extended processing times while ensuring reliable outputs. To the best of our knowledge, no guidelines exist in this area. For this reason, our goal was to propose preliminary recommendations to optimize feature extraction from high-resolution preclinical images, balancing accuracy and computational efficiency. These preliminary guidelines will help researchers produce reliable data without prohibitive processing times, making radiomics more accessible for exploratory preclinical studies.
With this goal, we have extended matRadiomics to distinguish it from other radiomics tools by integrating several specialized features, designed specifically for preclinical radiomics workflows. First, it supports a wide range of image formats (JPG, TIFF, and PNG) not common in other platforms, allowing users to work with high-resolution images essential in preclinical studies, such as those of zebrafish models, which we used in our case study described below. In this way, unlike existing tools, like LIFEx and FeAture Explorer (FAE) [16,27], which focus on clinical data, matRadiomics extends beyond to handle cell-based and small-animal models, accommodating the specific imaging needs of these contexts. Additionally, matRadiomics incorporates Pyradiomics for standardized and reproducible feature extraction, adhering to Image Biomarker Standardization Initiative (IBSI) guidelines, thus reducing variability and enhancing comparability across studies [28,29]. Thanks to the integration of Pyradiomics and incorporating ML algorithms for the implementation of predictive models, it allows you to complete a radiomics study with the same platform without switching between software, unlike all other State-of-the-Art radiomics software. Furthermore, matRadiomics offers a streamlined workflow with advanced segmentation and feature extraction options specifically optimized for high-resolution preclinical images, which tend to require lengthy processing. By incorporating intensity normalization and voxel resampling, it addresses the computational challenges of large preclinical datasets. These improvements ensure that researchers can generate meaningful insights more rapidly, enabling faster turnaround times for exploratory studies and supporting translational research efforts. In particular, the tool’s support for simultaneous multi-region analysis further helps researchers conduct more complex analyses, which is crucial for biological studies where there are multiple targets to analyze (e.g., toxicity or radiopharmaceutical accumulated in various organs [22]). Through these advancements, matRadiomics significantly contributes to the field, enhancing usability, flexibility, and reliability in preclinical radiomics workflows. This makes it a valuable resource not only for image analysis but also for fostering collaborative research efforts: to expand the sample set, the software includes an “Import Study” function, which allows users to import the various parameter settings used to complete the radiomics study, enabling more immediate collaboration between different research institutions.
As anticipated, the feasibility evaluation of this extended version of matRadiomics is based on a case study involving zebrafish models. The zebrafish is a validated vertebrate model for disease, drug screening, target identification, and pharmacology [30]. It is suitable for studying a variety of different situations, such as genetics, cell biology, toxicology, and embryology, filling a scientific niche between in vitro models and higher organisms [31]. Concerning toxicity assays, Sipes et al. [32] reviewed the concordance of 55–100% between zebrafish and mammalian models in evaluating chemicals with toxic effects. So, the percent concordance across the mammalian species suggests that the response of zebrafish is on par with mammalian toxicity models and supports the utility of this model in toxicology research, as to identify drug targets or optimize clinically relevant treatments, such as radiotherapy [33]. Their widespread and increasing use in toxicological studies is mostly due to their genetic similarity to humans, with 70% of homology and 82% of orthologous human disease-related genes [34], their transparency, rapid reproductive cycle and embryonic development, high permeability to substances, low maintenance costs, and exemption from ethical protocols for zebrafish up to 120 hpf. These factors enable precise and real-time observations of biological malformations and organ toxicity, providing useful information in toxicity assessments [30,31,32,33]. Furthermore, zebrafish show high fertility: each pair can produce from around 200 to 300 eggs per week, thus permitting large-scale experimental analysis and, consequently, a higher statistical validity of the result [35].
Considering that the onset of malformations represents a visible response to a treatment, in this study, we assess the platform’s ability to discriminate “early” between different levels of biological malformations following radiotherapy treatment at 30 Gy. The analysis was performed at only 72 h post-fertilization (hpf), unlike other State-of-the-Art works analyzing images at 96 and 120 hpf [36,37,38,39], which are also the time points at which larvae show greater mobility and activity [40]. Therefore, our idea is to enable more timely predictions of treatment toxicity, without necessarily reaching later time points. In our analyses, high and low levels of malformation were detected at only 72 hpf. This early detection would also improve the efficiency of toxicity screening in drug discovery and development.
In conclusion, this study aims to establish matRadiomics as a reference tool not only for clinical applications but also for the broader field of biomedical research, facilitating the integration of radiomics into preclinical studies and expanding its potential to address key questions in disease modeling and drug development.

2. Material and Methods

The extended version of matRadiomics was enhanced to include the following functionalities: (i) create or import a preclinical radiomics study; (ii) import and visualize images in JPG, TIFF, and PNG formats; (iii) perform target segmentation using multiple methods, including importing pre-existing masks; (iv) extract features from multiple masks using Pyradiomics; (v) conduct feature selection; and (vi) Apply ML algorithms for predictive modeling.

2.1. Improvements in matRadiomics

The architecture of the tool remains the same as in the previous version. However, to make the radiomics analysis applicable in the preclinical context, new functions were introduced to compensate for the absence of certain image information typically available in clinical images (e.g., Digital Imaging and Communications in Medicine, DICOM, format). In particular, when information regarding voxel spacing and slice thickness is unavailable, unit values of [1,1,1] were assigned. This parameter is crucial for the correct execution of the feature extraction process using the Pyradiomics module, and for an accurate interpretation of the extracted features [41].
Upon initiating a radiomics analysis—either as a new study or by importing an existing one—it is now possible to import images in JPG, TIFF, and PNG formats.
In addition to the previously available manual and semi-automatic segmentation tools, users can now import externally created masks in JPG format. In cases of multiple segmentation, such as in fish where various body parts need to be segmented, each mask can be loaded sequentially with a unique name, facilitating easy identification in subsequent radiomics stages. When analyzing the first image study, the user must manually enter the name for each mask. After the initial feature extraction, the program stores these mask names, allowing them to be easily associated with masks from subsequent image studies by selecting from a list. It is also possible to derive new masks through subtraction operation, for example, by removing a specific organ, such as the heart, from the whole animal. Feature extraction is performed using the Pyradiomics module, which resulted in the extraction of 107 features, including shape features that describe the geometric properties of the regions of interest, which are critical to understanding anatomical variations; first-order statistics that provide information about the distribution of pixel intensities, reflecting the overall brightness and contrast of the image; and texture features that capture patterns and relationships in pixel intensities, providing valuable insights into tissue heterogeneity. Unlike the previous version, the new version allows for the extraction of features from multiple masks in series. The extracted features are displayed in the “Features Extraction” table, with each mask recognizable thanks to the associated name. Both the extracted features (in “.xlsx” format) and the masks (in “.mat” format) are saved within the study folder, organized by sample.
For the feature selection and prediction model selection, users can choose from several methods, including ReliefF [42], Point Biserial Correlation (PBC) [43], t-test, and LASSO [44,45]. In terms of prediction modeling, the software offers three classifiers: linear discriminant analysis (LDA) [46], k-nearest neighbors (KNNs) [47], and support vector machines (SVMs) [48]. Two methods are available for the model validation: k-fold cross-validation and stratified k-fold cross-validation.
Finally, to expand the number of samples, the software provides an “Import Study” section. This feature allows users to collaborate by importing a radiomics study that has already completed the feature extraction phase. The program automatically recognizes and stores the names of the analyzed masks upon import, making it easy to use when creating masks for new images to analyze.
To further clarify the improvements made in this release of matRadiomics, we highlight that the newly introduced features directly address the challenges faced in preclinical imaging. By allowing the import of various image formats and enabling the integration of external masks, the tool improves usability and flexibility. Furthermore, the ability to manage multiple masks during analysis simplifies the workflow, especially in complex preclinical studies where the analysis of multiple regions of interest is crucial. These advances not only improve the efficiency of radiomics analysis but also aim to facilitate collaboration between researchers through the functionality of importing external studies.
Figure 1 shows the workflow of matRadiomics, showing its basic characteristics.

2.2. Resolution Analysis for Radiomics Feature Extraction

During this study, it was observed that feature extraction for large masks resulted in a significant increase in computational time, likely due to the high resolution of the preclinical images. The literature provides limited guidance on this problem, other than recommendations to preprocess images to reduce dimensionality or use GPU to complete the task faster [26,49]. To address this, we conducted an analysis to identify the optimal settings for the radiomics study, ensuring faster feature extraction without compromising the integrity of the features.
The first step was to evaluate the time required for feature extraction. Specifically, we applied the radiomics workflow to images at their original size (in our case, 2560 x 1920) and a bin count of 64. Bin count refers to the number of bins used to group pixel intensity values. A value of 64 is commonly used for gray-level discretization to balance sensitivity and computational feasibility. Conversely, setting a bin count of 1 means using a single bin, making it easier to analyze gray-level related features. This value preserves the original gray-level values, allowing for a comparison that reveals how much influence binning has on feature stability and error rate. This approach allowed us to specifically examine the influence of gray-level features on feature variability, allowing for a more targeted analysis. We determined that, for a single mask, comparable in size to the image, approximately 15 min was required for feature extraction (specifications of the computer used for the extraction: processor, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80 GHz; RAM, 32.0 GB; 64-bit operating system, x64-based processor; graphics card, Intel(R) Iris(R) Xe Graphics, 4 GB RAM; graphics max dynamic clock, 1500 MHz; SSD, 1 TB). Given the huge number of masks and images in our case study, this time request was unfeasible, necessitating interventions to improve extraction speed.
To address the issue, small-sized masks of six zebrafish images were initially considered. A total of 107 features were analyzed. For all cases, we evaluated the magnitude of the relative percentage error for each feature and mask as follows:
δ i , j = v A i , j v E i , j v E i , j   · 100 ;  
v A i , j : o b s e r v e d   v a l u e ;   v E i , j : o r i g i n a l   v a l u e ;   i : f e a t u r e s ,   j :   m a s k s
The mean of the relative percentage error among the masks for each type of feature was determined as follows:
δ i = j = 1 m δ j m ;   m = m a s k s   n u m b e r   f o r   e a c h   f e a t u r e s
And, finally, we calculated the average relative percentage error across all features:
M = i = 1 n δ i n ;     n = e x t r a c t e d   f e a t u r e s   n u m b e r
After this, we evaluated two different preprocessing methods: loading resized images and masks or applying the matRadiomics resampling tool after loading the images and the masks in their original size. The resizing function was implemented directly in the program’s code, adding a function to apply resizing to the images and masks upon import. Regarding resampling, no modifications were made to the existing code; instead, the preprocessing tool already provided by matRadiomics was utilized. The resampling function is provided by Pyradiomics, which applies the type of interpolation specified by the user for the image—in this case, stick-nearest neighbor—while always using nearest-neighbor interpolation for the mask. A reduction factor of 2.5 was applied in both cases. The choice of this factor represents a compromise between reducing the dimensionality of the images and minimizing information loss. A reduction by 1/2 did not significantly decrease the dimensionality, while a 1/3 reduction resulted in excessive loss of detail. Therefore, a reduction factor of 1/2.5 was selected to achieve a meaningful decrease in dimensionality without sacrificing too much information. During resizing, the voxel spacing was adjusted accordingly, directly in the program’s code, while for resampling, the aforementioned techniques were used, with no modifications made to the existing code. A bin count of 64 was applied for gray-level analysis. Successively, we assessed the influence of gray-level features on the average relative percentage error by setting the bin count to 1. Finally, the analyses were repeated on a further dataset of six zebrafish to confirm the results obtained.

2.3. The Case Study

The case study validated the extended version of matRadiomics by assessing its ability to distinguish between two classes, high and low malformation, in images of zebrafish larvae at an early time point of 72 h post-fertilization (hpf), following radiotherapy treatment at 30 Gy [50]. Classification into high vs. low malformation was based on current scoring systems in the literature, which rely on a qualitative, operator-dependent assessment of malformations in treated larvae at 96–120 hpf [36,37,38,39]. So, the decision to use a radiomics tool in this study stems from the need to extract quantitative imaging features from optical images, which can capture complex patterns indistinguishable by the human eye alone, and which can give a more objective assessment of treatment-induced tissue changes, thus improving the accuracy and reliability of the toxicity assessment.
Preclinical studies on animal models allow us to evaluate the level of toxicity of a treatment on a complex living organism. In this context, zebrafish larvae have become a valuable model organism in toxicology studies due to their genetic similarity to humans, allowing for the extrapolation of findings to potential human outcomes. Zebrafish share approximately 70% of their genes with humans [51], making them particularly suitable for investigating the effects of drugs and therapeutic treatments. Their transparent embryos and rapid development also facilitate real-time observation of toxic effects, which manifest through visible changes in their morphological characteristics. In biomedical engineering, zebrafish are increasingly used to study drug toxicity, disease mechanisms, and the efficacy of novel treatments. The small size, ease of genetic manipulation, and high reproductive rate of zebrafish provide a practical and cost-effective system for large-scale screening of compounds [52].
Several studies have identified key morphological changes in zebrafish larvae as reliable indicators of toxicity. These visible abnormalities are critical markers for evaluating the toxic impact of compounds. The embryo’s development is extremely rapid, with all major organs and tissues, most of which have comparable position and function to human ones, fully formed by 48 hpf [53]. This consideration leads us to perform a very detailed and significant post-treatment phenotypic analysis in a very short time window. To effectively assess these changes, specific organs and structures in the zebrafish larvae have been identified as the primary focus for toxicity analyses. The organs of interest include the (i) eye, (ii) head, (iii) jaw, (iv) heart, (v) yolk, (vi) swim bladder, (vii) body length, (viii) and curvature of the spine [54]. Although each malformation is indicative of induced toxicity, as slowed or impaired development, they do not all have the same weight. Alterations such as the curvature in the spine (spinal curve, SC) or a reduction in the diameter of the head, as well as an increase in the diameter of the heart sac (Pericardial Edema, PE), are the most interesting ones, strictly correlated with the future survival of the larva, and they have a pathological counterpart in humans [52]. The spine is the most typical characteristic of vertebrates, and it is essential for survival and reproduction. The most common spinal disorder in humans is spinal curvature, and zebrafish have several advantages for modeling human scoliosis. Firstly, there is a high similarity in the morphology and structure of the spine, with a high degree of genetic conservation. Secondly, it is hypothesized that the mechanical force generated during swimming is loaded on the spine in a similar way to that in humans for posture and movement [55]. It follows that the presence of the SC in zebrafish leads to an impairment in mobility and swimming ability. Microcephalia in zebrafish is recognizable when the head is disproportionately small compared with other body structures, and it represents, as well as in humans, a disorder characterized by significantly reduced brain size and neurological development [56]. Zebrafish have proven useful for modeling human heart diseases due to the similarity of zebrafish and mammalian hearts. In addition, embryonic cardiac development is rapid, and its function is easy to observe and quantify. As with heart failure in humans, PE may lead to death in zebrafish larvae [57].
Radiomics tools, such as matRadiomics, can further enhance the precision of these toxicology studies by quantifying morphological changes and correlating them with treatment outcomes. Indeed, by adapting radiomics methodologies to preclinical models, this preliminary application opens new avenues for translational research, providing a robust platform for evaluating drug safety and understanding the mechanisms underlying toxicity, with potential implications for human health.

2.3.1. The Dataset

The dataset consists of two datasets in JPEG format that are organized according to the degree of malformation of the zebrafish. According to scoring systems described in the literature and based on the analysis of the whole larva [36,37], the larvae presenting scores 1–2 were included in the low malformation group, while those presenting scores 3–4 were included in the high malformation group. Specifically, the belonging of a larva to one score rather than another, from the value of 1 to the value of 4, derives from the presence of single malformations classified as mild (1) vs. multiple mild malformations/a single moderate malformation (2) vs. multiple moderate malformations/a single severe malformation (3) vs. multiple severe malformations (4). The high-malformation dataset consists of 82 images, while the low-malformation one consists of 61 images. Following the literature on the use of zebrafish in toxicological analysis [39], six masks were analyzed for each zebrafish: whole fish, eye, heart, yolk, head, and length. The experimental protocol described in this study was carried out exclusively with zebrafish embryos and larvae up to 120 hpf. At this stage of their life cycle, zebrafish are not capable of independent feeding and, therefore, are not subject to the Italian (D.lgs 26/2014) and European (2010/63/EU) rules on the protection of animals used for scientific purposes.

2.3.2. Performance Evaluation and Statistical Analysis

Classification performance was evaluated using the area under the ROC curve (AUC) and accuracy for each mask type and ML classifier. The AUC value, ranging from 1 to 0.5, reflects the quality of class discrimination, with values close to 1 indicating strong discrimination, and those near 0.5 suggesting random performance. Accuracy captures classification correctness by accounting for true positives and true negatives. To compare resampling and resizing methods in image preprocessing, a two-tailed paired-sample t-test was performed based on δ i , as defined in Section 2.2, with a p-value < 0.05 indicating statistical significance.

3. Results

3.1. Resolution Analysis for Radiomics Feature Extraction

To identify the optimal settings for the radiomics study, ensuring faster feature extraction without compromising the integrity of the features, two different preprocessing methods were used with a reduction factor of 2.5: (i) loading resized images and (ii) applying the matRadiomics resampling tool after loading the images in their original size.
Figure 2 shows that the resampling method gives better results than resizing. In addition, the error with the resampling method (M = 38.22%) is lower than the resizing one (M = 79.96%) for 81 out of 107 values. The two-tailed t-test for paired samples yielded a p-value of << 0.05, suggesting that the observed differences were statistically significant and not due to random variation. Consequently, resampling proved to be the most effective preprocessing method. However, the total average error remained high. The largest deviations were observed in gray-level features, prompting us to alter the bin count to one, thereby eliminating the extraction of certain gray-level-related features to see if this reduced the error. This adjustment resulted in smaller mean relative errors compared to using a bin count of 64. Figure 3 shows the mean relative error graphically, while the Table 1 shows the total average error for bin count equal to 1 and 64. The mean relative errors of the features with the bin count equal to 1 are smaller than those with bin count equal to 64 for 90 out of 107 values. The two-tailed t-test for paired samples yielded a p-value of 0.0016. Also in this case, the observed differences were statistically significant and not due to random variation. Consequently, the bin count equal to one proved to be the best parameter to reduce the variations compared to the original images. The total average error is smaller than the previous case of analysis, leading to the conclusion that the gray-level features have a great impact on features variation.
To further refine the analysis, we evaluated the variation in feature estimation on an additional six-zebrafish dataset, again using small-sized masks. The previous results were confirmed, with an average error of 16.96% in this case. Figure 4 shows that the results for the two groups are very similar, allowing us to conclude that the setting previously identified is valid for our case study. Given the lack of established guidelines on this issue, our analysis suggests that, for an initial preclinical radiomics study based on zebrafish, the optimal settings are those reported in Table 2.
In this way, we reduced computation time while keeping the extracted features as close to their original form as possible, thus enhancing workflow efficiency without sacrificing feature validity. Reducing the error ensures a high similarity to the original features, strengthening the consistency and reliability of our analysis. Table 3 shows the features where the average relative error was greater than 2%.

3.2. Case Study

Six regions of interest (ROIs) were analyzed for each zebrafish: the whole fish, eye, heart, yolk, head, and length, as depicted in Figure 5. The user manually segmented each area, and for the first time, a unique name was assigned to each area (see Figure 6a), simplifying identification and tracking in subsequent stages of the radiomics workflow. After the analysis of the first image, the mask names were stored, allowing for easier selection from a list in subsequent analyses, as illustrated in Figure 6b.
For the feature extraction part, the settings given in the feature variation analysis part were used (see Table 2). In this way, 107 features were obtained. At the end of the extraction of all the features, different types of analyses were carried out: a first analysis based on all the masks, and a second analysis based on each type of mask.

3.2.1. Feature Selection

Feature selection allows us to reduce the number of features by identifying only the most important ones for the study, avoiding overfitting problems (see an example in Figure 7). Model interpretability is relevant to feature selection and reduction, ensuring that users understand how features contribute to outcomes [58]. By repeating the PBC method for each study, we obtained the selected features. Table 4 shows the features selected for all analyses. Features related to morphology were commonly selected, consistent with the previous literature. However, some masks revealed unexpected features as being highly relevant, such as texture features that capture patterns and relationships in pixel intensities, providing valuable insights into tissue heterogeneity. These findings suggest that a radiomics analysis can uncover novel biomarkers or patterns that go beyond traditional features, offering new insights into the underlying biological processes. This reinforces the versatility and depth of radiomics in providing data that may not be captured by conventional analysis methods. The selection process also highlighted the importance of model interpretability, ensuring that users can trace how specific features contribute to the final predictions [36,37,38,39].

3.2.2. Machine Learning

All the classifiers proposed by the program were used to train the predictive model, and k-fold stratified cross-validation with k=10 was used for validation. The ML algorithms integrated into matRadiomics are the default implementations from sklearn, the open-source ML and data modeling library for Python. For the KNN classifier, a k = 10 was chosen. This choice was made to balance model accuracy with stability. The k value represents the number of nearest neighbors used to classify each data point. Setting k = 10 provides enough neighborhood information to reduce classification variance, smoothing outliers and making predictions more reliable than lower values of k that can lead to overfitting and high sensitivity to noise. Additionally, k = 10 is often recommended based on empirical tests on datasets, where moderate values generally yield a good performance without causing excessive computational complexity. An example of the performance result (receiver operating characteristic (ROC) curve, accuracy, and confusion matrix) is shown in Figure 8.
The classification results for each type of mask and classifier are shown in Table 5, including the area under the AUC and accuracy.
The classifiers, namely LDA, KNN, and SVM, exhibit varying degrees of success depending on the type of mask analyzed. For example, the whole-fish mask generally produced higher AUC scores across all classifiers, whereas individual masks (such as eye or heart) yielded more variable results. This observation suggests that comprehensive masks encompassing larger regions may be more reliable in predicting outcomes, likely due to the broader range of information they capture. Among the classifiers, SVM exhibited a slightly better performance compared to LDA and KNN, particularly for the whole-fish mask and length mask, with an AUC of up to 0.723. The performance of each classifier remains highly dependent on the specific features selected, as evidenced by the variation across masks. These results reinforce the importance of a careful feature selection to avoid overfitting. Figure 9 summarizes, for greater clarity, the best results achieved for each mask, along with the corresponding selected features and the predictive model that demonstrated the highest performance.

4. Discussion

In this study, we extended matRadiomics to the preclinical domain, specifically for the analysis of high-resolution images, such as microscopy images [59], 2D spheroid images [60], and zebrafish images. The case study presented here, based on zebrafish model, demonstrates the adaptability of the platform to non-clinical image formats and its capability to early discriminate biological outcomes in a preclinical context. matRadiomics retains its core functionality while introducing new functions that make it a viable tool for preclinical imaging studies. This is important especially because there are currently no guidelines or tools in the literature that allow for the application of radiomics to preclinical studies. About our case study, existing scoring systems for the zebrafish model are based on two analysis modes. The first is the one we used for the image’s classification, and it is based on the observation of the entire larva following the parameters already described in Section 2.3.1. The second performed a qualitative–quantitative evaluation of the individual malformations (and not their coexistence), classifying them according to a dimensional parameter in relation to other body districts or, for some of them, performing appropriate measurements using the ImageJ Software [36,37,38,39]. It results in the greater probability of a wrong evaluation between different operators, and it is also highly time-consuming, resulting from the measurements of the individual images. Furthermore, the literature showed that this type of evaluation is performed during the last time points of analysis, i.e., 96 and 120 hpf [36,37,38,39], which are also those in which the larvae are more mobile and active [40]. The idea, therefore, of carrying out an automated analysis on images acquired at 72 hpf could lead both to concrete, reliable, and replicable data and to the possibility of predicting, at an early and easier time point for image acquisitions, how toxic a treatment may be, without necessarily reaching the final time points. For this purpose, we distinguish between high and low malformations in zebrafish, as reported in Section 2.3: high malformations refer to cases with more severe or numerous developmental abnormalities in zebrafish, while low malformations indicate fewer or less observable defects. This distinction helps to categorize the severity of developmental problems, allowing us to analyze the predictive power of the proposed radiomics workflow.
Our analyses started by assigning unit values for voxel spacing and slice thickness, since the actual information was not available but needed to ensure the feature extraction process. These parameters influence the value of several features related to the shape, volume, and texture. In the case where this information is lacking, we believe that a unitary setting is the best choice in terms of features interpretation. However, we acknowledge that this assumption may introduce bias, as it may not accurately represent the actual geometry of the imaging data. This simplification may affect the quantitative analysis and subsequent predictive modeling. Future studies may benefit from acquiring accurate pixel information to improve the validity of the results.
Successively, we identified the optimal settings for the radiomics study, as reported in Table 2, ensuring a faster extraction of the features without compromising their integrity. Figure 2 suggests that the resampling method significantly improves computational efficiency during feature extraction compared to resizing (average relative percentage error of 38.22% and 79.96% for the resampling and resizing methods, respectively). This highlights the need for efficient preprocessing in high-resolution preclinical images, where computational demands can pose challenges when dealing with large amounts of data. Figure 3 shows how using different bin counts for gray-level analysis also had a considerable impact on the extracted features, with smaller bin counts (i.e., one) producing more reliable and consistent results. This finding indicates that adjusting bin count could be a potential strategy for reducing error when analyzing preclinical images. However, despite improvements in error reduction, further studies are required to assess how these methods impact other feature types beyond gray-level features, such as texture-based or high-order radiomics features. The high average errors observed, particularly with the resizing method (79.96%), raise concerns about the reliability of predictions when using these preprocessing techniques. These errors may affect the consistency of feature extraction, potentially undermining the accuracy of the final model. While the resampling method shows a better performance, the substantial error differences highlight the need for further optimization. To mitigate these issues, future studies should focus on refining preprocessing techniques, such as experimenting with different resampling parameters or exploring alternative methods that may offer better precision without sacrificing efficiency. To the best of our knowledge, there are currently no reference data in the literature regarding the impact of resampling and bin count variation on the computational efficiency during feature extraction. This work is the first to investigate these aspects and to propose an effective strategy.
The final goal of our case study analysis was to evaluate the ability of matRadiomics to distinguish between high and low malformations early in zebrafish models. Morphological features were predominantly selected, aligning with prior studies [36,37,38,39]. However, some masks highlighted unexpected features, such as texture-based features, which capture patterns in pixel intensities and provide valuable insights into tissue heterogeneity, as shown in Table 4. This indicates that radiomics can reveal novel biomarkers or patterns beyond traditional features, offering a deeper understanding of biological processes. Table 5 shows the potential of matRadiomics as an early decision support tool in preclinical research (AUC of 0.723, whole-fish mask-based SVM model). The best result based on the SVM model analyzing the whole-fish mask could be explained by the scoring method used to divide the images into two classes (high malformation vs. low malformation), as it evaluates the entire larva [36]. This comprehensive evaluation considers the potential presence of multiple malformations across the fish. As such, it makes sense that the best-performing model is the one that examines the entire fish image, as it aligns more closely with the scoring criteria that reflect the overall condition of the larva rather than the entity of the individual alterations. Nevertheless, considering how each fish district could have a different weight for the future survival of the individual post-treatment, it could be useful to carry out a second analysis, in which the high vs. low classification is carried out for each malformation separately [37].
Our study has some limitations. While the plugin integrates three ML models, it currently lacks more advanced models, such as random forests or deep learning approaches, that could improve its suitability for complex analyses. Expanding the model options would greatly improve the versatility of the tool and the accuracy assessment [61]. Additionally, the feature extraction process for high-resolution images remains to be explored; further optimization is needed to increase efficiency in handling high-resolution preclinical data. Although the zebrafish case study demonstrates the functionality of the plugin, its generalizability to other preclinical models remains untested, such as microscopy [59] and 2D spheroid images [60], to further validate its versatility. However, caution is needed when translating our findings. Variations in imaging modalities, biological markers, or species-specific differences could affect the generalizability of the methodology. For instance, different models may require adjustments in preprocessing steps or feature extraction methods. Future research should aim to validate these techniques across a wider range of species and experimental settings to establish their robustness and adaptability. Finally, the case study does not include external validation on independent datasets, an essential step to ensure the tool’s robustness and applicability across different scenarios.

5. Conclusions

In this study, we extended matRadiomics to the preclinical domain. Our case study on zebrafish models demonstrated the platform’s ability to discriminate biological outcomes early, a critical step in preclinical research. Our results suggest that an early automated analysis at 72 hpf can predict treatment toxicity, potentially reducing the need for more laborious subsequent assessments. Although our study highlights the potential of matRadiomics, some limitations remain. The impact of parameter settings for the feature extraction process for high-resolution images requires further investigation. Furthermore, expanding the range of ML models and validating the platform on different preclinical models are essential for its wider adoption. Despite these limitations, our work represents a significant step towards integrating radiomics into preclinical research, accelerating drug discovery and development.

Author Contributions

Conceptualization, A.S.; data curation, G.P. (Gaia Pucci); formal analysis, E.M.; funding acquisition, G.R.; investigation, G.P. (Gaia Pucci); methodology, A.S.; project administration, A.S.; resources, G.P. (Giovanni Pasini), G.I.F. and F.M.; software, E.M. and G.P. (Giovanni Pasini); supervision, F.B.; validation, G.P. (Gaia Pucci); visualization, E.M.; writing—original draft, E.M. and A.S.; writing—review and editing, E.M., G.P. (Gaia Pucci), G.P. (Giovanni Pasini), and A.S. All authors have read and agreed to the published version of the manuscript.

Funding

GIOCONDA (Understanding the radiobiological effects of BNCT and FLASH therapy) project, from an Anthem (AdvaNced Technologies for Human-centrEd Medicine) project’s BAC—PNC0000003.

Institutional Review Board Statement

Ethical review and approval for the experiments performed in zebrafish were waived for this study due to the following reason: All the experimental protocols described in this study were carried out exclusively with zebrafish embryos and larvae up to 120 hpf. At this stage of their life cycle, zebrafish are not capable of independent feeding and, therefore, are not subject to the Italian (D.lgs 26/2014) and European (2010/63/EU) rules on the protection of animals used for scientific purposes.

Informed Consent Statement

Not applicable, as the data used in this study were downloaded from a public database.

Data Availability Statement

Source code, documentation, and examples are available upon free request to the authors. After the publication of the article, the software will be available on the institutional websites of the authors (IBSBC-CNR and Sapienza University).

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Gillies, R.J.; Kinahan, P.E.; Hricak, H. Radiomics: Images Are More than Pictures, They Are Data. Radiology 2016, 278, 563–577. [Google Scholar] [CrossRef]
  2. Rizzo, S.; Botta, F.; Raimondi, S.; Origgi, D.; Fanciullo, C.; Morganti, A.G.; Bellomi, M. Radiomics: The Facts and the Challenges of Image Analysis. Eur. Radiol. Exp. 2018, 2, 36. [Google Scholar] [CrossRef]
  3. Mayerhoefer, M.E.; Materka, A.; Langs, G.; Häggström, I.; Szczypiński, P.; Gibbs, P.; Cook, G. Introduction to Radiomics. J. Nucl. Med. 2020, 61, 488–495. [Google Scholar] [CrossRef] [PubMed]
  4. Castiglioni, I.; Gilardi, M.C. Radiomics: Is It Time to Compose the Puzzle? Clin. Transl. Imaging 2018, 6, 411–413. [Google Scholar] [CrossRef] [PubMed]
  5. Banna, G.L.; Anile, G.; Russo, G.; Vigneri, P.; Castaing, M.; Nicolosi, M.; Strano, S.; Gieri, S.; Spina, R.; Patanè, D.; et al. Predictive and Prognostic Value of Early Disease Progression by PET Evaluation in Advanced Non-Small Cell Lung Cancer. Oncology 2016, 92, 39–47. [Google Scholar] [CrossRef]
  6. Sala, E.; Mema, E.; Himoto, Y.; Veeraraghavan, H.; Brenton, J.D.; Snyder, A.; Weigelt, B.; Vargas, H.A. Unravelling Tumour Heterogeneity Using Next-Generation Imaging: Radiomics, Radiogenomics, and Habitat Imaging. Clin. Radiol. 2017, 72, 3–10. [Google Scholar] [CrossRef] [PubMed]
  7. Wang, K.; Qiao, Z.; Zhao, X.; Li, X.; Wang, X.; Wu, T.; Chen, Z.; Fan, D.; Chen, Q.; Ai, L. Individualized Discrimination of Tumor Recurrence from Radiation Necrosis in Glioma Patients Using an Integrated Radiomics-Based Model. Eur. J. Nucl. Med. Mol. Imaging 2019, 18, 1400–1411. [Google Scholar] [CrossRef]
  8. Ugga, L.; Cuocolo, R.; Solari, D.; Guadagno, E.; D’Amico, A.; Somma, T.; Cappabianca, P.; del Basso de Caro, M.L.; Cavallo, L.M.; Brunetti, A. Prediction of High Proliferative Index in Pituitary Macroadenomas Using MRI-Based Radiomics and Machine Learning. Neuroradiology 2019, 61, 1365–1373. [Google Scholar] [CrossRef]
  9. Liberini, V.; Laudicella, R.; Balma, M.; Nicolotti, D.G.; Buschiazzo, A.; Grimaldi, S.; Lorenzon, L.; Bianchi, A.; Peano, S.; Bartolotta, T.V.; et al. Radiomics and Artificial Intelligence in Prostate Cancer: New Tools for Molecular Hybrid Imaging and Theragnostics. Eur. Radiol. Exp. 2022, 6, 27. [Google Scholar] [CrossRef]
  10. Leithner, D.; Schöder, H.; Haug, A.; Vargas, H.A.; Gibbs, P.; Häggström, I.; Rausch, I.; Weber, M.; Becker, A.S.; Schwartz, J.; et al. Impact of ComBat Harmonization on PET Radiomics-Based Tissue Classification: A Dual-Center PET/MRI and PET/CT Study. J. Nucl. Med. 2022, 63, 1611–1616. [Google Scholar] [CrossRef]
  11. Zhou, M.; Scott, J.; Chaudhury, B.; Hall, L.; Goldgof, D.; Yeom, K.W.; Iv, M.; Ou, Y.; Kalpathy-Cramer, J.; Napel, S.; et al. Radiomics in Brain Tumor: Image Assessment, Quantitative Feature Descriptors, and Machine-Learning Approaches. Am. J. Neuroradiol. 2018, 39, 208–216. [Google Scholar] [CrossRef] [PubMed]
  12. Loddo, A.; Usai, M.; Di Ruberto, C. Gastric Cancer Image Classification: A Comparative Analysis and Feature Fusion Strategies. J. Imaging 2024, 10, 195. [Google Scholar] [CrossRef] [PubMed]
  13. Corso, R.; Stefano, A.; Salvaggio, G.; Comelli, A. Shearlet Transform Applied to a Prostate Cancer Radiomics Analysis on MR Images. Mathematics 2024, 12, 1296. [Google Scholar] [CrossRef]
  14. Stefano, A. Challenges and Limitations in Applying Radiomics to PET Imaging: Possible Opportunities and Avenues for Research. Comput. Biol. Med. 2024, 179, 108827. [Google Scholar] [CrossRef]
  15. Russo, G.; Stefano, A.; Alongi, P.; Comelli, A.; Catalfamo, B.; Mantarro, C.; Longo, C.; Altieri, R.; Certo, F.; Cosentino, S.; et al. Feasibility on the Use of Radiomics Features of 11[C]-MET PET/CT in Central Nervous System Tumours: Preliminary Results on Potential Grading Discrimination Using a Machine Learning Model. Curr. Oncol. 2021, 28, 5318–5331. [Google Scholar] [CrossRef]
  16. Nioche, C.; Orlhac, F.; Boughdad, S.; Reuze, S.; Goya-Outi, J.; Robert, C.; Pellot-Barakat, C.; Soussan, M.; Frouin, F.; Buvat, I. Lifex: A Freeware for Radiomic Feature Calculation in Multimodality Imaging to Accelerate Advances in the Characterization of Tumor Heterogeneity. Cancer Res. 2018, 78, 4786–4789. [Google Scholar] [CrossRef]
  17. Szczypiński, P.M.; Strzelecki, M.; Materka, A.; Klepaczko, A. MaZda-A Software Package for Image Texture Analysis. Comput. Methods Programs Biomed. 2009, 94, 66–76. [Google Scholar] [CrossRef]
  18. Pasini, G.; Bini, F.; Russo, G.; Comelli, A.; Marinozzi, F.; Stefano, A. MatRadiomics: A Novel and Complete Radiomics Framework, from Image Visualization to Predictive Model. J. Imaging 2022, 8, 221. [Google Scholar] [CrossRef]
  19. Pasini, G.; Stefano, A.; Mantarro, C.; Richiusa, S.; Comelli, A.; Russo, G.I.; Sabini, M.G.; Cosentino, S.; Ippolito, M.; Russo, G. A Robust [18F]-PSMA-1007 Radiomics Ensemble Model for Prostate Cancer Risk Stratification. J. Imaging Inform. Med. 2024. [Google Scholar] [CrossRef]
  20. Hu, Q.; Li, K.; Yang, C.; Wang, Y.; Huang, R.; Gu, M.; Xiao, Y.; Huang, Y.; Chen, L. The Role of Artificial Intelligence Based on PET/CT Radiomics in NSCLC: Disease Management, Opportunities, and Challenges. Front. Oncol. 2023, 13, 1133164. [Google Scholar] [CrossRef]
  21. Khanam, J.J.; Foo, S.Y. A Comparison of Machine Learning Algorithms for Diabetes Prediction. ICT Express 2021, 7, 432–439. [Google Scholar] [CrossRef]
  22. Benfante, V.; Stefano, A.; Comelli, A.; Giaccone, P.; Cammarata, F.P.; Richiusa, S.; Scopelliti, F.; Pometti, M.; Ficarra, M.; Cosentino, S.; et al. A New Preclinical Decision Support System Based on PET Radiomics: A Preliminary Study on the Evaluation of an Innovative 64Cu-Labeled Chelator in Mouse Models. J. Imaging 2022, 8, 92. [Google Scholar] [CrossRef] [PubMed]
  23. Anderson, C.J.; Ferdani, R. Copper-64 Radiopharmaceuticals for PET Imaging of Cancer: Advances in Preclinical and Clinical Research. Cancer Biother. Radiopharm. 2009, 24, 379–393. [Google Scholar] [CrossRef] [PubMed]
  24. Castranova, D.; Samasa, B.; Venero Galanternik, M.; Gore, A.V.; Goldstein, A.E.; Park, J.S.; Weinstein, B.M. Long-Term Imaging of Living Adult Zebrafish. Development 2022, 149, dev199667. [Google Scholar] [CrossRef]
  25. Betzig, E.; Patterson, G.H.; Sougrat, R.; Lindwasser, O.W.; Olenych, S.; Bonifacino, J.S.; Davidson, M.W.; Lippincott-Schwartz, J.; Hess, H.F. Imaging Intracellular Fluorescent Proteins at Nanometer Resolution. Science 2006, 313, 1642–1645. [Google Scholar] [CrossRef] [PubMed]
  26. Jiao, Y.; Ijurra, O.M.; Zhang, L.; Shen, D.; Wang, Q. CuRadiomics: A GPU-Based Radiomics Feature Extraction Toolkit. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Cham, Switzerland, 2020; Volume 11991, pp. 44–52. [Google Scholar]
  27. Song, Y.; Zhang, J.; Zhang, Y.D.; Hou, Y.; Yan, X.; Wang, Y.; Zhou, M.; Yao, Y.F.; Yang, G. FeAture Explorer (FAE): A Tool for Developing and Comparing Radiomics Models. PLoS ONE 2020, 15, e0237587. [Google Scholar] [CrossRef] [PubMed]
  28. Van Griethuysen, J.J.M.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.H.; Fillion-Robin, J.C.; Pieper, S.; Aerts, H.J.W.L. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef]
  29. Zwanenburg, A.; Vallières, M.; Abdalah, M.A.; Aerts, H.J.W.L.; Andrearczyk, V.; Apte, A.; Ashrafinia, S.; Bakas, S.; Beukinga, R.J.; Boellaard, R.; et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-Based Phenotyping. Radiology 2020, 295, 328–338. [Google Scholar] [CrossRef]
  30. MacRae, C.A.; Peterson, R.T. Zebrafish as Tools for Drug Discovery. Nat. Rev. Drug Discov. 2015, 14, 721–731. [Google Scholar] [CrossRef]
  31. Horzmann, K.A.; Freeman, J.L. Making Waves: New Developments in Toxicology with the Zebrafish. Toxicol. Sci. 2018, 163, 5–12. [Google Scholar] [CrossRef]
  32. Sipes, N.S.; Padilla, S.; Knudsen, T.B. Zebrafish—As an Integrative Model for Twenty-first Century Toxicity Testing. Birth Defects Res. C Embryo Today 2011, 93, 256–267. [Google Scholar] [CrossRef] [PubMed]
  33. Zhao, W.; Chen, Y.; Hu, N.; Long, D.; Cao, Y. The Uses of Zebrafish (Danio rerio) as an in Vivo Model for Toxicological Studies: A Review Based on Bibliometrics. Ecotoxicol. Environ. Saf. 2024, 272, 116023. [Google Scholar] [CrossRef]
  34. Howe, K.; Clark, M.D.; Torroja, C.F.; Torrance, J.; Berthelot, C.; Muffato, M.; Collins, J.E.; Humphray, S.; McLaren, K.; Matthews, L.; et al. The Zebrafish Reference Genome Sequence and Its Relationship to the Human Genome. Nature 2013, 496, 498–503. [Google Scholar] [CrossRef] [PubMed]
  35. Hoo, J.Y.; Kumari, Y.; Shaikh, M.F.; Hue, S.M.; Goh, B.H. Zebrafish: A Versatile Animal Model for Fertility Research. Biomed. Res. Int. 2016, 2016, 9732780. [Google Scholar] [CrossRef] [PubMed]
  36. Bar-Ilan, O.; Albrecht, R.M.; Fako, V.E.; Furgeson, D.Y. Toxicity Assessments of Multisized Gold and Silver Nanoparticles in Zebrafish Embryos. Small 2009, 5, 1897–1910. [Google Scholar] [CrossRef]
  37. Szabó, E.R.; Brand, M.; Hans, S.; Hideghéty, K.; Karsch, L.; Lessmann, E.; Pawelke, J.; Schürer, M.; Beyreuther, E. Radiobiological Effects and Proton RBE Determined by Wildtype Zebrafish Embryos. PLoS ONE 2018, 13, e0206879. [Google Scholar] [CrossRef]
  38. Panzica-Kelly, J.M.; Zhang, C.X.; Danberry, T.L.; Flood, A.; DeLan, J.W.; Brannen, K.C.; Augustine-Rauch, K.A. Morphological Score Assignment Guidelines for the Dechorionated Zebrafish Teratogenicity Assay. Birth Defects Res. B Dev. Reprod. Toxicol. 2010, 89, 382–395. [Google Scholar] [CrossRef]
  39. Brunner, S.; Tőkés, T.; Szabó, E.R.; Polanek, R.; Szabó, I.Z.; Reisz, Z.; Gubán, B.K.; Szijártó, A.L.; Brand, M.; Hans, S.; et al. Dose-Dependent Changes After Proton and Photon Irradiation in a Zebrafish Model. Anticancer Res. 2020, 40, 6123–6135. [Google Scholar] [CrossRef]
  40. Rosa, J.G.S.; Lima, C.; Lopes-Ferreira, M. Zebrafish Larvae Behavior Models as a Tool for Drug Screenings and Pre-Clinical Trials: A Review. Int. J. Mol. Sci. 2022, 23, 6647. [Google Scholar] [CrossRef]
  41. Peng, J.; Lu, F.; Huang, J.; Zhang, J.; Gong, W.; Hu, Y.; Wang, J. Development and Validation of a Pyradiomics Signature to Predict Initial Treatment Response and Prognosis during Transarterial Chemoembolization in Hepatocellular Carcinoma. Front Oncol 2022, 12, 853254. [Google Scholar] [CrossRef]
  42. Robnik-Šikonja, M.; Kononenko, I. Theoretical and Empirical Analysis of ReliefF and RReliefF. Mach. Learn. 2003, 53, 23–69. [Google Scholar] [CrossRef]
  43. Barone, S.; Cannella, R.; Comelli, A.; Pellegrino, A.; Salvaggio, G.; Stefano, A.; Vernuccio, F. Hybrid Descriptive-Inferential Method for Key Feature Selection in Prostate Cancer Radiomics. Appl. Stoch. Models Bus. Ind. 2021, 37, 961–972. [Google Scholar] [CrossRef]
  44. Kammer, M.; Dunkler, D.; Michiels, S.; Heinze, G. Evaluating Methods for Lasso Selective Inference in Biomedical Research: A Comparative Simulation Study. BMC Med. Res. Methodol. 2022, 22, 206. [Google Scholar] [CrossRef] [PubMed]
  45. Pasini, G.; Russo, G.; Mantarro, C.; Bini, F.; Richiusa, S.; Morgante, L.; Comelli, A.; Russo, G.I.; Sabini, M.G.; Cosentino, S.; et al. A Critical Analysis of the Robustness of Radiomics to Variations in Segmentation Methods in 18F-PSMA-1007 PET Images of Patients Affected by Prostate Cancer. Diagnostics 2023, 13, 3640. [Google Scholar] [CrossRef]
  46. Xue, J.H.; Titterington, D.M. Do Unbalanced Data Have a Negative Effect on LDA? Pattern Recognit. 2008, 41, 1558–1571. [Google Scholar] [CrossRef]
  47. Soltaninejad, S.; Keshani, M.; Tajeripour, F. Lung Nodule Detection by KNN Classifier and Active Contour Modelling and 3D Visualization. In Proceedings of the 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP 2012), Shiraz, Iran, 2–3 May 2012; pp. 440–445. [Google Scholar] [CrossRef]
  48. Keshani, M.; Azimifar, Z.; Tajeripour, F.; Boostani, R. Lung Nodule Segmentation and Recognition Using SVM Classifier and Active Contour Modeling: A Complete Intelligent System. Comput. Biol. Med. 2013, 43, 287–300. [Google Scholar] [CrossRef]
  49. Kumar, V.; Gu, Y.; Basu, S.; Berglund, A.; Eschrich, S.A.; Schabath, M.B.; Forster, K.; Aerts, H.J.W.L.; Dekker, A.; Fenstermacher, D.; et al. Radiomics: The Process and the Challenges. Magn. Reson. Imaging 2012, 30, 1234–1248. [Google Scholar] [CrossRef] [PubMed]
  50. Pucci, G.; Forte, G.I.; Cavalieri, V. Evaluation of Epigenetic and Radiomodifying Effects during Radiotherapy Treatments in Zebrafish. Int. J. Mol. Sci. 2021, 22, 9053. [Google Scholar] [CrossRef]
  51. Kalueff, A.V.; Stewart, A.M.; Gerlai, R. Zebrafish as an Emerging Model for Studying Complex Brain Disorders. Trends Pharmacol. Sci. 2014, 35, 63–75. [Google Scholar] [CrossRef]
  52. Pucci, G.; Savoca, G.; Iacoviello, G.; Russo, G.; Forte, G.I.; Cavalieri, V. Curcumin’s Radioprotective Effects on Zebrafish Embryos. Antioxidants 2024, 13, 1281. [Google Scholar] [CrossRef]
  53. Kimmel, C.B.; Ballard, W.W.; Kimmel, S.R.; Ullmann, B.; Schilling, T.F. Stages of Embryonic Development of the Zebrafish. Dev. Dyn. 1995, 203, 253–310. [Google Scholar] [CrossRef] [PubMed]
  54. Carreira, M.J.; Vila-Blanco, N.; Cabezas-Sainz, P.; Sánchez, L. Zftool: A Software for Automatic Quantification of Cancer Cell Mass Evolution in Zebrafish. Appl. Sci. 2021, 11, 7721. [Google Scholar] [CrossRef]
  55. Xie, H.; Li, M.; Kang, Y.; Zhang, J.; Zhao, C. Zebrafish: An Important Model for Understanding Scoliosis. Cell. Mol. Life Sci. 2022, 79, 506. [Google Scholar] [CrossRef]
  56. Novorol, C.; Burkhardt, J.; Wood, K.J.; Iqbal, A.; Roque, C.; Coutts, N.; Almeida, A.D.; He, J.; Wilkinson, C.J.; Harris, W.A. Microcephaly Models in the Developing Zebrafish Retinal Neuroepithelium Point to an Underlying Defect in Metaphase Progression. Open Biol. 2013, 3, 130065. [Google Scholar] [CrossRef]
  57. Narumanchi, S.; Wang, H.; Perttunen, S.; Tikkanen, I.; Lakkisto, P.; Paavola, J. Zebrafish Heart Failure Models. Front. Cell Dev. Biol. 2021, 9, 662583. [Google Scholar] [CrossRef] [PubMed]
  58. Huang, A.A.; Huang, S.Y. Increasing Transparency in Machine Learning through Bootstrap Simulation and Shapely Additive Explanations. PLoS ONE 2023, 18, e0281922. [Google Scholar] [CrossRef]
  59. Piccinini, F.; Tazzari, M.; Tumedei, M.M.; Stellato, M.; Remondini, D.; Giampieri, E.; Martinelli, G.; Castellani, G.; Carbonaro, A. Data Science for Health Image Alignment: A User-Friendly Open-Source ImageJ/Fiji Plugin for Aligning Multimodality/Immunohistochemistry/Immunofluorescence 2D Microscopy Images. Sensors 2024, 24, 451. [Google Scholar] [CrossRef] [PubMed]
  60. Peirsman, A.; Blondeel, E.; Ahmed, T.; Anckaert, J.; Audenaert, D.; Boterberg, T.; Buzas, K.; Carragher, N.; Castellani, G.; Castro, F.; et al. MISpheroID: A Knowledgebase and Transparency Tool for Minimum Information in Spheroid Identity. Nat. Methods 2021, 18, 1294–1303. [Google Scholar] [CrossRef]
  61. Huang, A.A.; Huang, S.Y. Computation of the Distribution of Model Accuracy Statistics in Machine Learning: Comparison between Analytically Derived Distributions and Simulation-based Methods. Health Sci. Rep. 2023, 6, e1214. [Google Scholar] [CrossRef]
Figure 1. The workflow of the extended version of matRadiomics for preclinical studies.
Figure 1. The workflow of the extended version of matRadiomics for preclinical studies.
Jimaging 10 00290 g001
Figure 2. Mean relative error for each feature for the two different preprocessing methods.
Figure 2. Mean relative error for each feature for the two different preprocessing methods.
Jimaging 10 00290 g002
Figure 3. Mean relative error for bin count equal to one and equal to sixty-four.
Figure 3. Mean relative error for bin count equal to one and equal to sixty-four.
Jimaging 10 00290 g003
Figure 4. Mean relative error on features for the first and the second zebrafish dataset.
Figure 4. Mean relative error on features for the first and the second zebrafish dataset.
Jimaging 10 00290 g004
Figure 5. Example of all_fish (a), heart (b), head (c), eye (d), yolk (e), and length (f) masks used in the study.
Figure 5. Example of all_fish (a), heart (b), head (c), eye (d), yolk (e), and length (f) masks used in the study.
Jimaging 10 00290 g005
Figure 6. (a) The window that appears to manually assign the name of the mask for the analysis of the first image. (b) The window with the list of masks used after the extraction of the features of the first image.
Figure 6. (a) The window that appears to manually assign the name of the mask for the analysis of the first image. (b) The window with the list of masks used after the extraction of the features of the first image.
Jimaging 10 00290 g006
Figure 7. Example of bar plot for selected features for all masks without all_fish mask. In red, the feature selected using the PBC method.
Figure 7. Example of bar plot for selected features for all masks without all_fish mask. In red, the feature selected using the PBC method.
Jimaging 10 00290 g007
Figure 8. Example of performance of the predictive model for the all_fish mask based on the ROC curve, precision and confusion matrix.
Figure 8. Example of performance of the predictive model for the all_fish mask based on the ROC curve, precision and confusion matrix.
Jimaging 10 00290 g008
Figure 9. The best results obtained for each mask, together with the corresponding image, the selected features, and the predictive ML model that achieves the highest performance.
Figure 9. The best results obtained for each mask, together with the corresponding image, the selected features, and the predictive ML model that achieves the highest performance.
Jimaging 10 00290 g009
Table 1. Parameters for feature variation in the case of gray-level analysis.
Table 1. Parameters for feature variation in the case of gray-level analysis.
ResamplingBin Count = 1Bin Count = 64
Total average error16.04%38.22%
Table 2. Parameter setting used in our preclinical study.
Table 2. Parameter setting used in our preclinical study.
PreprocessingResampling TechniqueBin Count
Resampling, 2.5 factorStick-nearest neighbor1
Table 3. Features with an average relative error major than 2%.
Table 3. Features with an average relative error major than 2%.
Features with Error > 2%
“original_firstorder_Energy”
“original_firstorder_Kurtosis”
“original_firstorder_Skewness”
“original_firstorder_TotalEnergy”
“original_gldm_DependenceNonUniformity”
“original_gldm_DependenceVariance”
“original_gldm_GrayLevelNonUniformity”
“original_glrlm_GrayLevelNonUniformity”
“original_glrlm_LongRunEmphasis”
“original_glrlm_LongRunHighGrayLevelEmphasis”
“original_glrlm_LongRunLowGrayLevelEmphasis”
“original_glrlm_RunEntropy”
“original_glrlm_RunLengthNonUniformity”
“original_glrlm_RunVariance”
“original_glszm_GrayLevelNonUniformity”
“original_glszm_LargeAreaEmphasis”
“original_glszm_LargeAreaHighGrayLevelEmphasis”
“original_glszm_LargeAreaLowGrayLevelEmphasis”
“original_glszm_SizeZoneNonUniformity”
“original_glszm_ZoneEntropy”
“original_glszm_ZoneVariance”
Table 4. Selected features for each type of mask.
Table 4. Selected features for each type of mask.
Mask TypeSelected FEATURES
All masksoriginal_glszm_ZoneVariance
All_fishoriginal_shape_SurfaceArea.
Eyeoriginal_gldm_DependenceNonUniformityNormalized
Headoriginal_shape_MinorAxisLength
Heartoriginal_glrlm_GrayLevelNonUniformity
Lengthoriginal_glszm_ZoneVariance,
original_shape_MeshVolume
Yolkoriginal_shape_MinorAxisLength
Table 5. Performance of predictive model for all the analyses. The best result obtained is in bold.
Table 5. Performance of predictive model for all the analyses. The best result obtained is in bold.
Mask TypeLDAKNNSVM
All masks except all_fishAUC: 0.568
Acc: 0.582
AUC: 0.576
Acc: 0.558
AUC: 0.464
Acc: 0.591
All_fishAUC: 0.720
Acc: 0.679
AUC: 0.684
Acc: 0.628
AUC: 0.723
Acc: 0.72
EyeAUC: 0.702
Acc: 0.658
AUC: 0.686
Acc: 0.608
AUC: 0.706
Acc: 0.629
HeadAUC: 0.692
Acc: 0.663
AUC: 0.708
Acc: 0.651
AUC: 0.683
Acc: 0.650
HeartAUC: 0.704
Acc: 0.670
AUC: 0.696
Acc: 0.671
AUC: 0.677
Acc: 0.670
LengthAUC: 0.712
Acc: 0.692
AUC: 0.670
Acc: 0.638
AUC: 0.718
Acc: 0.693
YolkAUC: 0.693
Acc: 0.665
AUC: 0.613
Acc: 0.622
AUC: 0.623
Acc: 0.629
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bini, F.; Missori, E.; Pucci, G.; Pasini, G.; Marinozzi, F.; Forte, G.I.; Russo, G.; Stefano, A. Preclinical Implementation of matRadiomics: A Case Study for Early Malformation Prediction in Zebrafish Model. J. Imaging 2024, 10, 290. https://doi.org/10.3390/jimaging10110290

AMA Style

Bini F, Missori E, Pucci G, Pasini G, Marinozzi F, Forte GI, Russo G, Stefano A. Preclinical Implementation of matRadiomics: A Case Study for Early Malformation Prediction in Zebrafish Model. Journal of Imaging. 2024; 10(11):290. https://doi.org/10.3390/jimaging10110290

Chicago/Turabian Style

Bini, Fabiano, Elisa Missori, Gaia Pucci, Giovanni Pasini, Franco Marinozzi, Giusi Irma Forte, Giorgio Russo, and Alessandro Stefano. 2024. "Preclinical Implementation of matRadiomics: A Case Study for Early Malformation Prediction in Zebrafish Model" Journal of Imaging 10, no. 11: 290. https://doi.org/10.3390/jimaging10110290

APA Style

Bini, F., Missori, E., Pucci, G., Pasini, G., Marinozzi, F., Forte, G. I., Russo, G., & Stefano, A. (2024). Preclinical Implementation of matRadiomics: A Case Study for Early Malformation Prediction in Zebrafish Model. Journal of Imaging, 10(11), 290. https://doi.org/10.3390/jimaging10110290

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop