Next Article in Journal
Genotype, but Not Climate, Affects the Resistance of Honey Bees (Apis mellifera) to Viral Infections and to the Mite Varroa destructor
Previous Article in Journal
Human-Borne Pathogens: Are They Threatening Wild Great Ape Populations?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Reproducibility and Feasibility of Classification and National Guidelines for Histological Diagnosis of Canine Mammary Gland Tumours: A Multi-Institutional Ring Study

by
Serenella Papparella
1,†,
Maria Ines Crescio
2,†,
Valeria Baldassarre
1,
Barbara Brunetti
3,
Giovanni P. Burrai
4,5,
Cristiano Cocumelli
6,
Valeria Grieco
7,
Selina Iussich
8,
Lorella Maniscalco
8,
Francesca Mariotti
9,
Francesca Millanta
10,
Orlando Paciello
1,
Roberta Rasotto
11,
Mariarita Romanucci
12,
Alessandra Sfacteria
13 and
Valentina Zappulli
14,*
1
Department of Veterinary Medicine and Animal Production, Unit of Pathology, University of Naples Federico II, 80138 Naples, Italy
2
National Reference Center for the Veterinary and Comparative Oncology (CEROVEC), Experimental Zooprophylactic Institute of Piedmont, Liguria and Valle d’Aosta, 10154 Turin, Italy
3
Department of Veterinary Medical Sciences, University of Bologna, 40126 Bologna, Italy
4
Department of Veterinary Medicine, University of Sassari, 07100 Sassari, Italy
5
Mediterranean Center for Disease Control (MCDC), University of Sassari, 07100 Sassari, Italy
6
Experimental Zooprophylactic Institute of Lazio and Toscana M. Aleandri, 00178 Rome, Italy
7
Department of Veterinary Medicine, University of Milan, 26900 Lodi, Italy
8
Department of Veterinary Science, University of Turin, 10095 Turin, Italy
9
School of Bioscience and Veterinary Medicine, University of Camerino, 62032 Camerino, Italy
10
Department of Veterinary Sciences, University of Pisa, 56124 Pisa, Italy
11
Independent Researcher, Via Messer Ottonello 1, 37127 Verona, Italy
12
Faculty of Veterinary Medicine, University of Teramo, 64100 Teramo, Italy
13
Department of Veterinary Science, University of Messina, 98122 Messina, Italy
14
Department of Comparative Biomedicine and Food Science, University of Padua, 35020 Padua, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Vet. Sci. 2022, 9(7), 357; https://doi.org/10.3390/vetsci9070357
Submission received: 3 May 2022 / Revised: 4 July 2022 / Accepted: 6 July 2022 / Published: 13 July 2022
(This article belongs to the Section Veterinary Biomedical Sciences)

Abstract

:

Simple Summary

Tumours of the mammary gland are common in humans, as in canine species. They are very heterogenous with numerous morphological variants and different biologic behaviours. In the last few decades, several efforts have been made to classify these tumours histologically and establish the level of malignancy by using histologic grading systems. However, reproducibility and diagnostic agreement of such classification and grading have been only rarely assessed. In this study, we tested the variability in diagnoses performed by 15 pathologists using the same classification and grading system. Prior to the study, pathologists agreed on guidelines regarding how to apply these systems. Pathologists worked blindly on 36 digital histologic slides of canine mammary tumours. The agreement was statistically analysed using Cohen’s kappa coefficient that, when equal to 1, indicates perfect agreement. The overall agreement in the identification of hyperplastic-dysplastic/benign/malignant lesions was substantial (kappa 0.76), while outcomes on morphological classification had only a moderate agreement (k = 0.54). Tumour grade assigned by pathologists was the least concordant and kappa could not be calculated. Although promising, the results underline that each diagnostic/grading system should be assessed and optimized for standardization and high diagnostic agreement.

Abstract

Histological diagnosis of Canine Mammary Tumours (CMTs) provides the basis for proper treatment and follow-up. Nowadays, its accuracy is poorly understood and variable interpretation of histological criteria leads to a lack of standardisation and impossibility to compare studies. This study aimed to quantify the reproducibility of histological diagnosis and grading in CMTs. A blinded ring test on 36 CMTs was performed by 15 veterinary pathologists with different levels of education, after discussion of critical points on the Davis-Thompson Foundation Classification and providing consensus guidelines. Kappa statistics were used to compare the interobserver variability. The overall concordance rate of diagnostic interpretations of WP on identification of hyperplasia-dysplasia/benign/malignant lesions showed a substantial agreement (average k ranging from 0.66 to 0.82, with a k-combined of 0.76). Instead, outcomes on ICD-O-3.2 morphological code /diagnosis of histotype had only a moderate agreement (average k ranging from 0.44 and 0.64, with a k-combined of 0.54). The results demonstrated that standardised classification and consensus guidelines can produce moderate to substantial agreement; however, further efforts are needed to increase this agreement in distinguishing benign versus malignant lesions and in histological grading.

1. Introduction

The histopathological diagnosis and grade of Canine Mammary Tumours (CMTs) are considered the gold standard for patient management and research outcomes [1,2,3]. In this regard, misdiagnosis and/or interinstitutional diagnostic variability between pathologists can seriously affect the interpretation of clinical data that use the histological output as the reference standard. In particular, the evaluation of therapeutic protocols as well as the interpretation of predictive/prognostic molecular markers can be adversely affected [4]. Histopathological diagnostic criteria for CMTs were largely established during years with updated internationally recognized classifications from the World Health Organization (WHO) [5,6] and from the Davis-Thompson DVM Foundation (DTF) [7]. However, whether these classification systems are uniformly applied, as well as the challenges encountered by pathologists in agreeing on the diagnosis and grade of CMTs, is unclear. Two studies were performed on CMT diagnostic agreement already underlying some diagnostic disagreement [8,9]. In addition, significant problems can be related to the application of different classification and grading systems (human versus veterinary; national versus international) and, even when applying common systems, to the subjective interpretation of histological criteria or misleading concepts and terms used to classify and grade CMTs.
Many factors, therefore, could promote interobserver variability (IOV) and decrease the diagnostic reproducibility among pathologists, resulting in classification and/or grading errors that lead to failure to predict tumour behaviour. Standardisation of diagnosis is also a prerequisite to allow a comparison between research studies worldwide. Efforts for standardisation are common both in human and veterinary medicine and assessment of agreement has been performed on some tumoral and non-tumoral diseases, as non-exhaustively summarised in Table 1 and references therein. Studies evaluating the agreement based on common classification/grading systems or histological criteria underlined that the concordance, despite never being excellent/perfect, is higher when the applied system/criteria are shared and discussed in consensus meetings, thus, establishing the sources of IOV [10,11,12,13,14].
To address the issue of IOV for the diagnosis of CMTs, as part of a national initiative of the Italian Association of Veterinary Pathologists (AIPVET), a group of 15 Italian veterinary pathologists developed national guidelines for CMT assessment using, as a starting point, the last DTF classification of CMTs [7].
Therefore, the aim of this study was to assess IOV in the classification and grading of CMTs when applying the same system and guidelines. The effect of this standardisation on diagnostic concordance and agreement rates was evaluated and critical aspects were pointed out and discussed.

2. Materials and Methods

Fifteen veterinary pathologists (see authors list) from academic Schools of Veterinary Medicine, from Veterinary State laboratories (Experimental Zooprophylactic Institutes), and from private veterinary diagnostic laboratories constituted a working group (Working panel, WP) to discuss critical aspects of the recently published DTF classification of CMTs. Two (RR and VZ) of the fifteen pathologists were among the authors in the DTF classification. The WP produced national guidelines [63] and established consensus criteria for histological diagnosis and grading of malignancies for the entities of the DTF classification. For this purpose and due to COVID-19 pandemic, twenty telematic meetings lasting an average of 90 min were held by all components to address common challenges and misconceptions in histopathological assessment of CMTs. Moreover, unified national guidelines were created to reduce the discrepancies between pathologists operating in different institutions and private diagnostic laboratories. The WP identified the principal causes of diagnostic disagreement by interjecting their own direct and indirect (e.g., held seminars and discussions with other colleagues) experiences into the discussion.
More specifically, the consensus regarded the following critical aspects: a. histological subtypes; b. grading; c. criteria for malignancy; d. approach for lymph node metastases and micrometastases; e. pathological prognostic factors; f. markers for phenotype and prognosis; g. content of the histopathological report; and h: application and revision of ICD-O-3.2 codes [64]. ICD-O-3.2 codes have been used now for more than 35 years, principally in human tumour or cancer registries, for coding the site (topography) and the histology (morphology) of the neoplasm, in this way helping standardization. For the purpose of this study, only aspects a) to c) and h) will be presented.
A consensus on the aforementioned critical aspects was reached and applied during the ring study. The ring study was performed on selected histological samples to evaluate the effect of the classification and the national guidelines in the reproducibility of morphological diagnosis. One experienced pathologist (VZ, ECVP diplomat), internationally recognized for research and continuing veterinary education on mammary gland pathology, selected 34 slides best representing the hyperplastic/dysplastic/neoplastic lesions of the canine mammary gland described in the DTF classification (Table 2). Slides were chosen from the available archive of a university diagnostic veterinary pathology service (BCA Dept., University of Padua, Italy). As per Directive 2010/63/EU of the European Parliament and of the Council of 22 September 2010, regarding the protection of animals used for scientific purposes, the Italian legislature (D. Lgs. n. 26/2014) does not require approval from ethical committees for the use of stored samples in retrospective studies. Additionally, submitting vets sign an informed consent for privacy and to allow the use of protected data regarding samples in research studies. Two more slides were provided by a second participant (RR, ECVP diplomate), also with broad experience on mammary gland pathology. The 36 slides, with minimal repetition of the same histological diagnosis, were progressively numbered (from 1 to 36), digitally scanned (D Sight Menarini) and distributed to the WP for digital examination. Participants were provided the same single hematoxylin-and-eosin-stained slide per case. WP diagnoses were anonymous and blinded to previous and to each other’s interpretations and the diagnosis was recorded providing multiple pre-filled choices of answer to minimize errors (i.e., one drop-down menu for H/B/M and one drop-down menu for the three possible features associated with the scoring for each criterion of the grading). WP participants were asked to interpret the cases following the DTF classification and the newly established national guidelines. They had to classify the lesion as hyperplasia-dysplasia (H) for non-neoplastic lesions, or as benign (B) or malignant (M) for tumours and to identify the specific histological diagnosis also including the corresponding ICD-O-3.2 [64] code as reported in Table 2. Regarding ICD-O codes, the WP analysed the available codes at the time of the study taking as a reference the International Classification for Disease in Oncology ICD-O-3.2. [64] The goal of this action was to update and standardise the cancer codes for the current veterinary cancer registries active in Italy. WP members were also asked to report the histological CMT grading features (i.e., mitotic count, percentage of tubules formation, and degree of pleomorphism) according to Peña and co-authors [2] to calculate the grade. Participants were given a time frame of 4 weeks to complete the evaluations. No clinical details or immunohistochemistry (IHC) results were provided.

Statistical Analysis

Statistical analysis was carried out by calculating Cohen’s kappa (k) [65,66,67]. The k evaluates the agreement between panellists taking into account agreements due solely to chance. The lacking gold standard was replaced by the mode of the results given by panellists for each sample (majority opinion, GM) [68]. The k is scaled to be 0 when the amount of agreement is what would be expected by chance and 1 when there is perfect agreement between the observers. Kappa values between 0.21 and 0.40 were considered to represent fair agreement; 0.41–0.60 indicated moderate agreement; 0.61–0.80 substantial agreement; and 0.81-1.00 excellent agreement [67]. K was calculated for each panellist versus all (k_ava) and for each panellist versus the GM (k_vGM). The performances of the single panellist were obtained by calculating the mean of the k_ava and of the k_vGM. To synthesise the overall results, a k-combined was calculated for each statistic separately, according to Fleiss and co-authors, indicating the mean of all the k_ava means and k-vGM means, respectively [68]. Statistics were computed for the following parameters: (1) samples identified as H/B/M and (2) specific histological diagnosis, reported as ICD-O-3.2 code (Table 2).
To detect and comment on the differences among the specific histological diagnosis under study, the proportion of cases correctly identified (i.e., cases corresponding to GM) over the total of cases was calculated. This measure does not take into account the effect of chance and, for the purpose of this paper, was referred to as concordance.
The panellists’ experience was then evaluated by performing a hierarchical cluster analysis with the Ward’s method. Cluster analysis was firstly applied to the self-reported variables denoting experience (years of experience, caseload per week, number of published papers), then a second analysis was performed on the classification of lesions as H/B/M. Ward’s method of clustering joins the two groups that result in the minimum increase in the error sum of squares [69].

3. Results

3.1. Guidelines and WP Composition

For the purpose of this study, the WP discussed some critical points and established and reported into the guidelines a consensus regarding the following aspects.
  • Histological subtypes—To precisely apply the histological diagnosis reported in the DTF classification, as proposed by the authors. For example, the term “carcinoma in situ” was not applied and instead used atypical hyperplasia or atypical epitheliosis, depending on specific morphological aspects. As another example, it was agreed that the tumour histotype was defined based on the prevalent morphological pattern where more than one pattern was observed (e.g., tubular and solid).
  • Grading—To use the canine grading system proposed by Peña and colleagues [2], as summarised in Table 3. The histological grading was reported, regardless of the presence or absence of vascular invasion.
  • Criteria for malignancy—To employ the following parameter as criteria for malignancy: (I) tumour architecture with reduced tubular organisation (with no objective measurement and no specific cut off); (II) marked cellular and nuclear pleomorphism (with no objective measurement and no specific cut off); and (III) high mitotic count. A cut off ≥6 mitoses per 2.37 mm2 was proposed and applied exclusively when other criteria for malignancy were borderline/unclear. This was to indicate the possibility of a lesion with clear evidence of malignancy (e.g., anaplastic carcinoma) and a mitotic count below 6, or of a clearly benign lesion (e.g., ductal adenoma) with a number of mitoses higher or equal to 6. Regarding the mitotic count, it was performed digitally by the WP following these criteria: total area of observation of 2.37 mm2 [70] taking into consideration that the digital fields to obtain this total area had to be highly cellular and avoid cystic/necrotic fields. If the expected area (2.37 mm2) could not be obtained, the mitotic count was proportionally determined; most mitotically active areas (usually at the periphery of the tumour) were chosen to start, moving to consecutive fields. After two fields with no mitoses, the third new field was chosen as the next new mitotically active field to then proceed again consecutively, and so on until ten counted fields in total. In order to do so, each participant calculated the number of fields to be examined on their screen to cover the standardised 2.37 mm2 area. This was done by dividing 2.37 mm2 by the total area of a 40“×” U+00D7 image field, which was measured with a ruler tool on the screen [70]. Additional criteria for malignancy were (IV) presence of small areas of random necrosis (groups of neoplastic cells with karyolysis and karyorrhexis), keeping in mind that central wide necrosis can be present both in benign and malignant lesions; (V) peripheral infiltration, determined as an irregular contour of the tumour showing a desmoplastic reaction, often associated with a mixed inflammatory infiltrate; (VI) pluristratification of neoplastic cells with loss of polarity, atypia, and dysplasia; and (VII) lymphatic vessel invasion by neoplastic cells.
WP participants’ characteristics and relative data and experience are shown in Table 4.

3.2. Outcomes Expressed in Terms of Hyperplasia-Dysplasia/Benign/Malignant (H, B, M) Showed a Substantial Agreement

The results, in terms of hyperplasia-dysplasia (H), benign (B), or malignant (M) communicated by individual readers, are reported in Figure 1.
Only for one case (n.10) it was not possible to establish a GM being equally diagnosed as B/M. Out of 36 cases, 23 cases were 100% concordant (15/15 panellists), among which 17 had a GM = M. Among the 13 discordant cases, only 4 cases were M/B/H discordant. Furthermore, 3 out of 36 cases had 93% concordance (14/15 panellists), 2/3 with a GM = B (14 B and 1M) and 1 with a GM = M (14M and 1 H). The remaining discordant cases were two cases of GM = M versus B discordant, two of GM = B versus M discordant, and two cases GM = H versus B discordant, (Figure 1). The highest discordance was seen for four cases (case 7, 10, 26, 29) with less than 10 panellists agreeing on a diagnosis and having a GM = B or no GM.
Figure 2a shows that the agreement between the participating laboratories was not uniform: the participants, in fact, had an average k ranging from 0.66 to 0.82 (with the 95% CI limits varying between 0.43 and 0.98) and the k-combined is equal to 0.76 (0.74–0.79). The k-vGM, shown in Figure 2b, presents relatively better results than those relating to the k-ava: panellists had average k-vGM ranging between 0.71 and 0.95 (with the 95% CI limits varying between 0.47 and 1.00). The k-combined for the panellists vs. GM was 0.86 (range of means 0.62–1.00).

3.3. Outcomes Expressed in Terms of ICD-O Morphological Code/Diagnosis Had a Moderate Agreement

The results expressed by the participants as morphological diagnosis together with the GM are reported in Table 5. As such, 14/15 participants repeated a diagnosis at least once. The estimate of the k for each participant reported in Figure 3a shows that agreement among participants was not uniform: participants had an average kappa ranging between 0.44 and 0.64 (with 95% CI limits ranging between 0.39 and 0.70). The k-combined is equal to 0.54 (95% CI 0.54–0.55). The analysis with respect to the k-vGM, shown in Figure 3b, presents relatively better results than those relating to the k-ava. Panellists had an average k-vGM ranging between 0.52 and 0.94 (with 95% CI limits ranging between 0.47 and 1.00). The k-combined for the panellists vs GM was 0.70 (range of means 0.64–0.76).
A 100% (15/15) or 93% (14/15) concordance among panellists was seen in 6 and 5 cases, respectively, with one additional case (case n. 25) having 2/15 only slightly differing diagnoses (i.e., 13/15 osteosarcoma, 1/15 sarcoma, 1/15 chondrosarcoma). The 5 cases with very high (15/15) concordance included only simple tumours (i.e., composed of one single cell type). They were three cases with GM = M (comedocarcinoma, mucinous carcinoma, anaplastic carcinoma), one with a GM = B (myoepithelioma), and two with GM = H (melanosis of the teat and ductal ectasia). The 5 cases with a high (14/15) concordance had a GM = M in 3 cases ( lipid-rich carcinoma, squamous cell carcinoma, carcinosarcoma), a GM = B in 1 case (ductal adenoma) and a GM = H in 1 case (papillomatosis).
The most discordant (≥5/15 and <10/15 GM concordant panellists) cases were 11/36 (30.5%). They included six cases with GM = M, among which three had a complex/mixed/solid nature (case no. 24, 30, 31), indicating some difficulties in classifying tumours with a B/M myoepithelial component. Further, one of the six was a ductal-associated neoplasm (GM = ductal carcinoma, case no. 28), one of the six had a spindle appearance (GM = other sarcomas, case no. 20), and one of the six had a combined tubular and papillary pattern (GM = simple tubulopapillary carcinoma, case no. 5). An additional 3/11 discordant cases had a GM = B and included 1/3 cases with a GM = fibroadenoma (case no. 26) mainly differentially diagnosed as hyperplasia with fibrosis, 1/3 case with a GM of simple adenoma (case no. 10), which had one of the lowest concordances (5/15 panellists) and included several differential diagnoses (i.e., ductal adenoma/carcinoma; simple tubular carcinoma; lobular hyperplasia with atypia), indicating a difficulty in identifying also the M/B/H nature. A further 2/11 discordant cases had a GM = H, 1 with a GM = lobular hyperplasia with fibrosis (case no. 18) also diagnosed as fibroadenoma (3/15) or hyperplasia with atypia (4/15) and 1 with a GM = lobular hyperplasia with atypia (case no. 29) associated with several differential diagnoses, including lobular hyperplasia with fibrosis (1/15), simple adenoma (3/15), complex adenoma (4/15), and complex carcinoma (1/15).

3.4. Outcomes Expressed in Terms of Grading

Since grading was assessed only for samples diagnosed as malignant, a certain amount of heterogeneity was seen. Therefore, we decided not to calculate the k, but to give instead a description of the most discordant elements of grading.
Grading (Supplementary Table S1) was never 100% concordant. With regard to those lesions with a GM of a malignant tumour and a GM of histological subtype for which the grading was applicable (15 cases), in 11/15 cases, all the three grades were used by panellists, and in the remaining 4 cases, either grade III or grade I was not applied, two tumours did not reach a GM for grade (n. 30 and n. 36), and the most common GM was grade 2 (9/13). The highest concordance was for one grade II tumour (case n. 5, 73% with 11/15 panellists and with a 100% concordant GM of simple tubulopapillary carcinoma) and for one grade III tumour (case n. 11, 66% with 10/15 panellists and with a 100% concordant GM of comedocarcinoma). Cases with GM = B when diagnosed as malignant (six cases) were predominantly scored grade I, two cases with GM = H diagnosed as malignant were scored as grade I (n.13 by two panellists) and grade II (n. 12 by one panellist).

3.5. Outcomes Expressed Considering Panellist Features

The cluster analysis performed on the variables synthesising the panellists’ experience pointed out the existence, at the first level of partition, of two groups: one with 11 members and one with 4 members (Figure 4). The four members (3,8,11,12) in the smaller group were identified among the five “experts”. This definition coincides with being considered an expert on CMTs by colleagues, as reported in Table 4. The cluster analysis performed on the classification of lesions, as H/B/M shows, at the first level of partition, found the existence of two groups of eight and seven members. All the experts belong to the first group and cluster among them on the second and the third level of partition (Figure 4).
The five panellists considered experts by reputation had 100% concordance between them in 28/36 (77.8%) in terms of H/B/M classification, and these cases were always concordant with the GM. With regard to the morphological diagnosis, their concordance was 100% only in 13/36 cases (36.1%) and these were always concordant with the GM. Only in 6/36 (16.6%) cases, their discordance was regarding non-tumoural/benign versus malignant histotypes.

4. Discussion

In this study, we evaluated concordance and agreement in the diagnosis of CMTs applying the same histological classification system (DTF classification [7]), and consensus guidelines [63]. However, as already demonstrated in the literature (see Table 1 and references therein), the overall concordance was below 100% and the overall agreement was below excellent values.
Difficulties in reaching perfect diagnostic consensus are both reader related and lesion related and multiple variables are involved ([71] and references therein).
Considering the reader-related elements, the application of the same classification and grading systems is fundamental for standardization as well as the establishment of international consensus working groups and guidelines [11,12,13,14,71,72]. Nevertheless, even when applying approved systems, high accordance is not easily achieved [71]. In our study, we tested the reproducibility and the feasibility of the DTF classification [7] implemented with national consensus guidelines [63]. No previous studies have been performed on the application of a detailed histological classification and/or guidelines for the diagnosis of CMTs. For human breast cancer (HBC), several attempts have been made (see references in Table 1) obtaining 75% of concordance or a very variable agreement depending on specific subtypes [27,31]. The need for consensus discussions and shared guidelines have, therefore, already been pointed out in human medicine, both for tumoural and non-tumoural lesions, as the classification systems are still too prone to variability of application.
This variability is related to many additional factors.
Among reader-related factors, the highest is the expertise and the longest is the experience of the pathologists in a specific field, then the highest can be the consensus in that specific area, as demonstrated by our cluster analysis, in which the experienced CMT pathologists are grouped together. In some human studies, a similar higher diagnostic agreement was observed in multivariate analyses as associated with higher diagnostic confidence, similar years of experience, and expertise in a specific area [35,71]. In addition, variable diagnostic approaches can also be pathologist-related aspects, impacting diagnostic variability during the routine, such as the number of sections evaluated per lesion, application of ancillary analyses, such as histochemical and immunohistochemical tests, and a combination of both pathological and clinical aspects to produce a diagnosis [73,74,75,76,77]. All these aspects are very hard to standardise and complex dedicated protocol guidelines should be considered in the attempt of reducing this variability [78,79,80]. They were not targeted in our study but should certainly receive further additional attention.
With regard to lesion-related aspects, some histological features (e.g., cellular/nuclear pleomorphism for establishing malignancy and grading) convey intrinsic qualitative subjective evaluation so that IOV is very hard to minimise [81,82]. Beyond this, biological processes are often a continuum of progressive steps identifying those that necessitate detailed morphological thresholds, which are not always available [83,84]. In CMTs and tumours in general, a major point of discussion is the identification of the transition of a lesion from non-neoplastic to neoplastic and, even more importantly, from benign to malignant [85,86]. In consideration of this point, we investigated the concordance/agreement in the identification of hyperplasia-dysplasia and benign/malignant lesions. Our study showed a relatively good result with the k-combined considered in the literature from “moderate” to “substantial” (means of k_ava ranging from 0.66 to 0.82 with the 95% CI limits varying between 0.43 and 0.98 and the k-combined equal to 0.76) and 23/36 cases with 100% concordance. It can, therefore, be said that the level of histological diagnosis in discriminating between benign, malignant, or hyperplastic-dysplastic lesions was quite satisfactory. However, since k-ava is strongly affected by the diagnosis of each single panellist (i.e., a strong disagreement of only one participant can severely decrease the k), in this study, we also calculated the k-vGM representing the distance of the single panellist from the majority opinion (GM) (ISO13528:2015). With k-vGM, we observed, indeed, an even better agreement (k-combined=0.86, CI 95% 0.62–1.00).
In a similar study conducted in Taiwan, 10 experienced pathologists classified 15 CMTs as either benign or malignant with no further histological classification and, likewise, our study obtained a moderate average level of agreement (0.43k) [9]. Prior to and during the study, these authors did not agree on any specific classification criteria or guidelines; however, they did not include hyperplasia/dysplasia as possible diagnosis, decreasing possibilities of discordance. In our study, a strong discordance was observed in 4/36 cases, in which morphological aspects were overlapping between hyperplastic/benign/malignant lesions. Distinction could be made more on a subjective evaluation than on (missing) objective criteria (e.g., a lesion with a simple tubular organisation with mild atypia can receive a diagnosis of lobular hyperplasia or simple adenoma or simple carcinoma grade I). In the attempt to implement agreement, particularly in these more subjective/borderline lesions, application of specific parameters/thresholds (e.g., mitotic count threshold) were agreed by the WP and probably helped consensus. Application of specific thresholds/methodologies has already been demonstrated to improve concordance in specific areas [49,79]. The parameters applied in our study were taken from the DTF classification and were based on authors’ experience and not on published data. Before establishing precise morphological features/thresholds allowing the identification of tumour progression, the parameters should be carefully evaluated in follow-up large-scale studies, which, however, are very lacking in veterinary medicine [87]. For this reason, the authors still believe that multi-institutional and international application of similar default thresholds would help standardisation, comparison of studies, and collection of large-scale data to assess and possibly redefine the thresholds themselves.
When it comes to the identification of specific histological tumour subtypes, the complexity of the lesions can increase difficulties in reaching blinded consensus diagnoses [7,31,35].
In our study, the agreement on the diagnostic code (identification of a specific histotype) was more unsatisfactory; the average k-combined for k_ava showed values considered in the literature as moderate (0.54k; 95% CI 0.54–0.55). In this case as well, the k_vGM gave a better agreement (k-combined = 0.70, CI 95% 0.64–0.76) suggesting that this type of statistic should always be calculated versus either a standard diagnosis or a majority opinion that is usually lacking within the studies (Table 1). No similar studies have been performed in CMTs. However, similarly to us, two distinct works analysed agreement in classifying canine soft tissue sarcomas and canine and feline nervous system tumours [59] applying specific histological systems and obtained, respectively, moderate (0.60k) and substantial agreement (0.66k) for IOV. In this regard, CMTs are well known for their heterogeneity and complexity of classification [7]. In our study, the tumours characterised by proliferation of myoepithelial cells were included in those lesions receiving less concordance/agreement. The presence of more than one cell type (including myoepithelial cells) in CMTs often requires IHC for definitive characterization; therefore, ancillary tests could be necessary for a definitive diagnosis and should be suggested and accounted for within the report [7]. Additional histostaining and IHC were demonstrated as also improving agreement in other types of tumours [21,59].
In our study, tumour grading also showed some discordance, and the agreement could not be calculated because all three grades were frequently applied by WP for the same malignant lesions. Grading has been often found as one of the most reliable prognostic parameters in multivariate analysis [1,2,88]. However, our and other studies, both on humans and on dogs [25] and references therein, indicate that the grading system contains weakly standardizable parameters that can be more easily affected by subjective evaluation [8]. As already reported, a two-tiered system might eventually increase the concordance [10]. In one study evaluating IOV of histological grading of 46 malignant CMTs performed by three mammary pathologists from the same institution, a moderate to substantial agreement (range of kappa means 0.51–0.71) was obtained [8]. This was in accordance with other similar human studies [10,25]. The lowest values were those conferred to nuclear pleomorphism (0.51k) and mitotic count (0.69k) [8]. Evaluation of pleomorphism has already been considered as one of the least concordant features in tumours, due to its heterogeneity within the same tumour and the qualitative subjective nature of the evaluation [10,25]. The mitotic count is instead strongly affected by the selection of areas for the evaluation [8,25,27]. In our study it was performed on digital slides, precisely defining the methodology; however, fields of evaluation varied depending on the starting field that was subjectively established as it was the chosen direction of consecutive fields. Within this framework, digital and computer-aided pathology (CAD), referred to a computational diagnosis system or a set of methodologies that utilises computers or software to interpret pathologic images, are considered emergent fields that will deeply change the temporal and spatial domains of pathologic diagnosis. Thus, CAD systems using machine learning algorithms have been demonstrated to improve classification accuracy and improve reproducibility, reducing the IOV [89,90,91,92].
Taking into account that in veterinary medicine, ring studies to assess IOV are few [8,53,57,59,61] and that a multitude of methodologies are utilised, our study should be interpreted considering some limitations.
First, the pathologists were aware that they were evaluating slides covering nearly all entities present in the DTF classification and this could have influenced interpretive performance, although this bias is likely to have been, at least partially, overcome during the observation of the slides by all the participants who, in the end, repeated the same diagnosis once or twice.
Second, we used only a single section per case. However, in clinical practice, pathologists typically review multiple slides per case and can request additional levels or ancillary immunohistochemical stains to reach a final diagnosis, particularly when more than one cell type is suspected, for example, involving pleomorphic myoepithelial cells [93,94].
Third, being aware of the complexity of CMT diagnoses, the WP carefully defined consensus guidelines based on the DTF classification that could have raised the level of concordance. In order to precisely assess the role of guidelines versus just the DTF classification, a new study should be performed comparing two groups of pathologists applying the same DTF classification and then either using or not the discussed guidelines. The application of guidelines has been already demonstrated as useful in increasing consensus and, therefore, should be considered in addition to or within internationally recognized classification systems [39,71,95].
Further ring studies should be performed, correcting some biases. Surely, the inclusion of more pathologists with even more variable professional expertise form worldwide countries should be considered, in which impact and, therefore, experience in CMTs can be diverse (e.g., Mediterranean countries have more CMTs compared to the US due to cultural attitudes in spaying female dogs) [96,97] and the distribution of cases with variable more realistic frequencies, as it would be in standard routine diagnosis.

5. Conclusions

There is no doubt that pathological examination has led to many of the currently used classifications and that morphological observation and its correlation with clinical parameters has provided a sound basis for clinical medicine as it is today. It is also true, however, that subjective histopathological approaches invalidate the overall concepts. Therefore, it is of critical importance to have a diagnosis that is reproducible. The reduction in methodological variables between veterinary pathologists would also improve comparison of studies regarding CMTs. To achieve this goal, we set to revisit the histopathological criteria for diagnosis of CMTs, considering the main findings of all entities described in the last classification of CMTs and to assign a weighting to criteria that drive the diagnosis and grade of these tumours. In this study of pathologists, the overall agreement between the individual pathologists’ interpretation and reference diagnosis (majority opinion) was relatively high when classifying the nature of the lesion (H/B/M), but a bit lower when categorising the specific histotype.
Therefore, several efforts still need to be made to further standardise the application of international classification systems, particularly when approaching heterogeneous diseases, as mammary tumours are in dogs.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/vetsci9070357/s1, Table S1: histological grading given by panellists to the 36 studied cases.

Author Contributions

Conceptualization, S.P., M.I.C., V.B., B.B., G.P.B., C.C., V.G., S.I., L.M., F.M. (Francesca Mariotti), F.M. (Francesca Millanta), O.P., R.R., M.R., A.S., V.Z.; methodology, M.I.C.; software, M.I.C.; formal analysis, S.P., M.I.C., V.B., B.B., G.P.B., C.C., V.G., S.I., L.M., F.M. (Francesca Mariotti), F.M. (Francesca Millanta), O.P., R.R., M.R., A.S., V.Z.; investigation, S.P., M.I.C., V.B., B.B., G.P.B., C.C., V.G., S.I., L.M., F.M. (Francesca Mariotti), F.M. (Francesca Millanta), O.P., R.R., M.R., A.S., V.Z.; data curation, M.I.C. and V.Z.; writing—original draft preparation, S.P., M.I.C., V.Z.; writing—review and editing, S.P., M.I.C., V.B., B.B., G.P.B., C.C., V.G., S.I., L.M., F.M. (Francesca Mariotti), F.M. (Francesca Millanta), O.P., R.R., M.R., A.S., V.Z.; supervision, V.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from V.Z. and was the one signed by owners or clinicians submitting the samples.

Data Availability Statement

Original data of samples are within institutional archives and have mainly been submitted through the platform www.simbavet.org, however personal data on clinicians, owners and animals are not available to the public.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Nguyen, F.; Peña, L.; Ibisch, C.; Loussouarn, D.; Gama, A.; Rieder, N.; Belousov, A.; Campone, M.; Abadie, J. Canine invasive mammary carcinomas as models of human breast cancer. Part 1: Natural history and prognostic factors. Breast Cancer Res. Treat. 2018, 167, 635–648. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Peña, L.; De Andrés, P.J.; Clemente, M.; Cuesta, P.; Pérez-Alenza, M.D. Prognostic value of histological grading in noninflammatory canine mammary carcinomas in a prospective study with two-year follow-up: Relationship with clinical and histological characteristics. Vet. Pathol. 2013, 50, 94–105. [Google Scholar] [CrossRef] [PubMed]
  3. Rasotto, R.; Berlato, D.; Goldschmidt, M.H.; Zappulli, V. Prognostic Significance of Canine Mammary Tumor Histologic Subtypes: An Observational Cohort Study of 229 Cases. Vet. Pathol. 2017, 54, 571–578. [Google Scholar] [CrossRef] [PubMed]
  4. Valdivia, G.; Alonso-Diez, A.; Perez-Alenza, D.; Pena, L. From Conventional to Precision Therapy in Canine Mammary Cancer: A Comprehensive Review. Front. Vet. Sci. 2021, 8. [Google Scholar] [CrossRef] [PubMed]
  5. Hampe, J.F.; Misdorp, W. Tumours and dysplasias of the mammary gland. Bull. World Health Organ. 1974, 50, 111–133. [Google Scholar] [PubMed]
  6. Misdorp, W.; Else, R.W.; Hellmen, E.; Lipscomb, T.P. Histological Classification of Mammary Tumors of the Dog and the Cat; Armed Forces Institute of Pathology: Washington, DC, USA, 1999. [Google Scholar]
  7. Zapulli, V.; Peña, L.; Rasotto, R.; Goldschmidt, M.H.; Gama, A.; Scruggs, J.L. Surgical Pathology of Tumors of Domestic Animals Volume 2: Volume 2: Mammary Tumors: Mammary Tumors; Kiupel, M., Ed.; Davis-Thompson DVM Foundation: Washington, DC, USA, 2019; Volume 2. [Google Scholar]
  8. Santos, M.; Correia-Gomes, C.; Santos, A.; de Matos, A.; Dias-Pereira, P.; Lopes, C. Interobserver Reproducibility of Histological Grading of Canine Simple Mammary Carcinomas. J. Comp. Pathol. 2015, 153, 22–27. [Google Scholar] [CrossRef] [PubMed]
  9. Chu, P.Y.; Liao, A.T.; Liu, C.H. Interobserver Variation in the Morphopathological Diagnosis of Canine Mammary Gland Tumor Among Veterinary Pathologists. Int. J. Appl. Res. Vet. Med. 2011, 9, 388–391. [Google Scholar]
  10. Adams, A.L.; Chhieng, D.C.; Bell, W.C.; Winokur, T.; Hameed, O. Histologic grading of invasive lobular carcinoma: Does use of a 2-tiered nuclear grading system improve interobserver variability? Ann. Diagn. Pathol. 2009, 13, 223–225. [Google Scholar] [CrossRef]
  11. Denkert, C.; Wienert, S.; Poterie, A.; Loibl, S.; Budczies, J.; Badve, S.; Bago-Horvath, Z.; Bane, A.; Bedri, S.; Brock, J.; et al. Standardized evaluation of tumor-infiltrating lymphocytes in breast cancer: Results of the ring studies of the international immuno-oncology biomarker working group. Mod. Pathol. 2016, 29, 1155–1164. [Google Scholar] [CrossRef]
  12. van Dam, P.J.; van der Stok, E.P.; Teuwen, L.A.; Van den Eynden, G.G.; Illemann, M.; Frentzas, S.; Majeed, A.W.; Eefsen, R.L.; Coebergh van den Braak, R.R.J.; Lazaris, A.; et al. International consensus guidelines for scoring the histopathological growth patterns of liver metastasis. Br. J. Cancer 2017, 117, 1427–1441. [Google Scholar] [CrossRef] [Green Version]
  13. Lohse, A.W.; Sebode, M.; Bhathal, P.S.; Clouston, A.D.; Dienes, H.P.; Jain, D.; Gouw, A.S.H.; Guindi, M.; Kakar, S.; Kleiner, D.E.; et al. Consensus recommendations for histological criteria of autoimmune hepatitis from the International AIH Pathology Group: Results of a workshop on AIH histology hosted by the European Reference Network on Hepatological Diseases and the European Society of Pathology: Results of a workshop on AIH histology hosted by the European Reference Network on Hepatological Diseases and the European Society of Pathology. Liver. Int. 2022, 42, 1058–1069. [Google Scholar] [CrossRef]
  14. Willmann, M.; Yuzbasiyan-Gurkan, V.; Marconato, L.; Dacasto, M.; Hadzijusufovic, E.; Hermine, O.; Sadovnik, I.; Gamperl, S.; Schneeweiss-Gleixner, M.; Gleixner, K.V.; et al. Proposed Diagnostic Criteria and Classification of Canine Mast Cell Neoplasms: A Consensus Proposal. Front. Vet. Sci. 2021, 8, 10. [Google Scholar] [CrossRef]
  15. Gilles, F.H.; Tavare, C.J.; Becker, L.E.; Burger, P.C.; Yates, A.J.; Pollack, I.F.; Finlay, J.L. Pathologist interobserver variability of histologic features in childhood brain tumors: Results from the CCG-945 study. Pediatr. Dev. Pathol. 2008, 11, 108–117. [Google Scholar] [CrossRef]
  16. Eefting, D.; Schrage, Y.M.; Geirnaerdt, M.J.; Le Cessie, S.; Taminiau, A.H.; Bovée, J.V.; Hogendoorn, P.C.; Consortium, E. Assessment of interobserver variability and histologic parameters to improve reliability in classification and grading of central cartilaginous tumors. Am. J. Surg. Pathol. 2009, 33, 50–57. [Google Scholar] [CrossRef]
  17. Evans, H.L.; Ayala, A.G.; Romsdahl, M.M. Prognostic factors in chondrosarcoma of bone: A clinicopathologic analysis with emphasis on histologic grading. Cancer 1977, 40, 818–831. [Google Scholar] [CrossRef]
  18. Corazza, G.R.; Villanacci, V.; Zambelli, C.; Milione, M.; Luinetti, O.; Vindigni, C.; Chioda, C.; Albarello, L.; Bartolini, D.; Donato, F. Comparison of the interobserver reproducibility with different histologic criteria used in celiac disease. Clin. Gastroenterol. Hepatol. 2007, 5, 838–843. [Google Scholar] [CrossRef]
  19. Oberhuber, G.; Granditsch, G.; Vogelsang, H. The histopathology of coeliac disease: Time for a standardized report scheme for pathologists. Eur. J. Gastroenterol. Hepatol. 1999, 11, 1185–1194. [Google Scholar] [CrossRef]
  20. Corazza, G.R.; Villanacci, V. Coeliac disease. J. Clin. Pathol. 2005, 58, 573–574. [Google Scholar] [CrossRef] [Green Version]
  21. Rugge, M.; Sacchi, D.; Genta, R.M.; Zanco, F.; Guzzinati, S.; Pizzi, M.; Fassan, M.; Di Sabatino, A.; El-Serag, H. Histological assessment of gastric pseudopyloric metaplasia: Intra- and inter-observer consistency. Dig. Liver Dis. 2021, 53, 61–65. [Google Scholar] [CrossRef]
  22. Nagtegaal, I.D.; Odze, R.D.; Klimstra, D.; Paradis, V.; Rugge, M.; Schirmacher, P.; Washington, K.M.; Carneiro, F.; Cree, I.A.; WHO Classification of Tumours Editorial Board. The 2019 WHO classification of tumours of the digestive system. Histopathology 2020, 76, 182–188. [Google Scholar] [CrossRef] [Green Version]
  23. Barbosa, S.H.B.; Lázaro, G.C.; Franco, L.M.; Valença, J.T.; Nobre E Souza, M.; Souza, M.H.L.P. Agreement between different pathologists in histopathologic diagnosis od 128 gastric polyps. Arq. Gastroenterol. 2017, 54, 263–266. [Google Scholar] [CrossRef] [Green Version]
  24. Park, H.R.; Min, K.; Kim, H.S.; Jung, W.W.; Park, Y.K. Expression of vascular endothelial growth factor-C and its receptor in osteosarcomas. Pathol. Res. Pract. 2008, 204, 575–582. [Google Scholar] [CrossRef]
  25. Meyer, J.S.; Alvarez, C.; Milikowski, C.; Olson, N.; Russo, I.; Russo, J.; Glass, A.; Zehnbauer, B.A.; Lister, K.; Parwaresch, R.; et al. Breast carcinoma malignancy grading by Bloom-Richardson system vs proliferation index: Reproducibility of grade and advantages of proliferation index. Mod. Pathol. 2005, 18, 1067–1078. [Google Scholar] [CrossRef]
  26. Elston, C.; Ellis, I. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: Experience from a large study with long-term follow-up. Histopathology 1991, 19, 403–410. [Google Scholar] [CrossRef]
  27. Longacre, T.A.; Ennis, M.; Quenneville, L.A.; Bane, A.L.; Bleiweiss, I.J.; Carter, B.A.; Catelano, E.; Hendrickson, M.R.; Hibshoosh, H.; Layfield, L.J.; et al. Interobserver agreement and reproducibility in classification of invasive breast carcinoma: An NCI breast cancer family registry study. Mod. Pathol. 2006, 19, 195–207. [Google Scholar] [CrossRef]
  28. Allison, K.H.; Hammond, M.E.H.; Dowsett, M.; McKernin, S.E.; Carey, L.A.; Fitzgibbons, P.L.; Hayes, D.F.; Lakhani, S.R.; Chavez-MacGregor, M.; Perlmutter, J.; et al. Estrogen and Progesterone Receptor Testing in Breast Cancer: American Society of Clinical Oncology/College of American Pathologists Guideline Update. Arch. Pathol Lab. Med. 2020, 144, 545–563. [Google Scholar] [CrossRef] [Green Version]
  29. Gomes, D.S.; Porto, S.S.; Balabram, D.; Gobbi, H. Inter-observer variability between general pathologists and a specialist in breast pathology in the diagnosis of lobular neoplasia, columnar cell lesions, atypical ductal hyperplasia and ductal carcinoma in situ of the breast. Diagn. Pathol. 2014, 9, 121. [Google Scholar] [CrossRef] [Green Version]
  30. Lakhani, S.R.; Ellis, I.O.; Schnitt, S.; Tan, P.H.; van de Vijver, M. WHO Classification of Tumours of the Breast, IV ed.; World Health Organization, International Agency for Research on Cancer: Lyon, France, 2012. [Google Scholar]
  31. Elmore, J.G.; Longton, G.M.; Carney, P.A.; Geller, B.M.; Onega, T.; Tosteson, A.N.; Nelson, H.D.; Pepe, M.S.; Allison, K.H.; Schnitt, S.J.; et al. Diagnostic concordance among pathologists interpreting breast biopsy specimens. JAMA 2015, 313, 1122–1132. [Google Scholar] [CrossRef]
  32. Makela, K.; Hodgson, U.; Piilonen, A.; Kelloniemi, K.; Bloigu, R.; Sutinen, E.; Salmenkivi, K.; Ronty, M.; Lappi-Blanco, E.; Myllarniemi, M.; et al. Analysis of the Histologic Features Associated With Interobserver Variation in Idiopathic Pulmonary Fibrosis. Am. J. Surg. Pathol. 2018, 42, 672–678. [Google Scholar] [CrossRef]
  33. Raghu, G.; Collard, H.R.; Egan, J.J.; Martinez, F.J.; Behr, J.; Brown, K.K.; Colby, T.V.; Cordier, J.F.; Flaherty, K.R.; Lasky, J.A.; et al. An Official ATS/ERSARS/ALAT Statement: Idiopathic Pulmonary Fibrosis: Evidence-based Guidelines for Diagnosis and Management. Am. J. Respir. Crit. Care Med. 2011, 183, 788–824. [Google Scholar] [CrossRef] [Green Version]
  34. Hashisako, M.; Tanaka, T.; Terasaki, Y.; Uekusa, T.; Achcar, R.D.; Aswad, B.I.; Bamefleh, H.S.; Capelozzi, V.L.; English, J.C.; Fabro, A.T.; et al. Interobserver Agreement of Usual Interstitial Pneumonia Diagnosis Correlated With Patient Outcome. Arch. Pathol. Lab. Med. 2016, 140, 1375–1382. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Grilley-Olson, J.E.; Hayes, D.N.; Moore, D.T.; Leslie, K.O.; Wilkerson, M.D.; Qaqish, B.F.; Hayward, M.C.; Cabanski, C.R.; Yin, X.Y.; Socinski, M.A.; et al. Validation of Interobserver Agreement in Lung Cancer Assessment: Hematoxylin-Eosin Diagnostic Reproducibility for Non-Small Cell Lung Cancer The 2004 World Health Organization Classification and Therapeutically Relevant Subsets. Arch. Pathol. Lab. Med. 2013, 137, 32–40. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Travis, W.D.; Brambilla, E.; Muller-Hermelink, H.K.; Harris, C.C. World Health Organization classification of tumours. Am. J. Surg. Pathol. 2004, 10, 179–184. [Google Scholar]
  37. Thunnissen, E.; Beasley, M.B.; Borczuk, A.C.; Brambilla, E.; Chirieac, L.R.; Dacic, S.; Flieder, D.; Gazdar, A.; Geisinger, K.; Hasleton, P.; et al. Reproducibility of histopathological subtypes and invasion in pulmonary adenocarcinoma. An international interobserver study. Mod. Pathol. 2012, 25, 1574–1583. [Google Scholar] [CrossRef]
  38. Yoshizawa, A.; Motoi, N.; Riely, G.J.; Sima, C.S.; Gerald, W.L.; Kris, M.G.; Park, B.J.; Rusch, V.W.; Travis, W.D. Impact of proposed IASLC/ATS/ERS classification of lung adenocarcinoma: Prognostic subgroups and implications for further revision of staging based on analysis of 514 stage I cases. Mod. Pathol. 2011, 24, 653–664. [Google Scholar] [CrossRef]
  39. Nicholson, A.G.; Torkko, K.; Viola, P.; Duhig, E.; Geisinger, K.; Borczuk, A.C.; Hiroshima, K.; Tsao, M.S.; Warth, A.; Lantuejoul, S.; et al. Interobserver Variation among Pathologists and Refinement of Criteria in Distinguishing Separate Primary Tumors from Intrapulmonary Metastases in Lung. J. Thorac. Oncol. 2018, 13, 205–217. [Google Scholar] [CrossRef]
  40. Girard, N.; Deshpande, C.; Lau, C.; Finley, D.; Rusch, V.; Pao, W.; Travis, W.D. Comprehensive Histologic Assessment Helps to Differentiate Multiple Lung Primary Nonsmall Cell Carcinomas From Metastases. Am. J. Surg. Pathol. 2009, 33, 1752–1764. [Google Scholar] [CrossRef] [Green Version]
  41. Shi, K.H.; Compres, E.; Walton, K.E.; Mohan, L.S.; Zhang, B.; Panah, E.; Quan, V.T.L.; Garfield, E.M.; Khan, A.U.; Kim, D.; et al. Incorporation of dermoscopy improves inter-observer agreement among dermatopathologists in histologic assessment of melanocytic neoplasms. Arch. Dermatol. Res. 2021, 313, 101–108. [Google Scholar] [CrossRef]
  42. Furness, P.N.; Taub, N.; Assmann, K.J.M.; Banfi, G.; Cosyns, J.P.; Dorman, A.M.; Hill, C.M.; Kapper, S.K.; Waldherr, R.; Laurinavicius, A.; et al. International variation in histologic grading is large, and persistent feedback does not improve reproducibility. Am. J. Surg. Pathol. 2003, 27, 805–810. [Google Scholar] [CrossRef] [Green Version]
  43. Racusen, L.C.; Solez, K.; Colvin, R.B.; Bonsib, S.M.; Castro, M.C.; Cavallo, T.; Croker, B.P.; Demetris, A.J.; Drachenberg, C.B.; Fogo, A.B.; et al. The Banff 97 working classification of renal allograft pathology. Kidney Int. 1999, 55, 713–723. [Google Scholar] [CrossRef] [Green Version]
  44. Ganti, A.; Brown, H.J.; Gattuso, P.; Ghai, R.; Papagiannopoulos, P.; Batra, P.S.; Tajudeen, B.A. Inter-pathologist Agreement on Structured Histopathology Reporting in Chronic Rhinosinusitis. Ann. Otol. Rhinol. Laryngol. 2021, 130, 899–903. [Google Scholar] [CrossRef]
  45. Hasegawa, T.; Yamamoto, S.; Nojima, T.; Hirose, T.; Nikaido, T.; Yamashiro, K.; Matsuno, Y. Validity and reproducibility of histologic diagnosis and grading for adult soft-tissue sarcomas. Hum. Pathol. 2002, 33, 111–115. [Google Scholar] [CrossRef]
  46. Hasegawa, T.; Yokoyama, R.; Lee, Y.H.; Shimoda, T.; Beppu, Y.; Hirohashi, S. Prognostic relevance of a histological grading system using MIB-1 for adult soft-tissue sarcoma. Oncology 2000, 58, 66–74. [Google Scholar] [CrossRef]
  47. Tramm, T.; Di Caterino, T.; Jylling, A.M.B.; Lelkaitis, G.; Laenkholm, A.V.; Rago, P.; Tabor, T.P.; Talman, M.L.M.; Vouza, E.; Sci Comm, P.; et al. Standardized assessment of tumor-infiltrating lymphocytes in breast cancer: An evaluation of inter-observer agreement between pathologists. Acta Oncol. 2018, 57, 90–94. [Google Scholar] [CrossRef] [Green Version]
  48. Salgado, R.; Denkert, C.; Campbell, C.; Savas, P.; Nuciforo, P.; Aura, C.; de Azambuja, E.; Eidtmann, H.; Ellis, C.E.; Baselga, J.; et al. Tumor-Infiltrating Lymphocytes and Associations With Pathological Complete Response and Event-Free Survival in HER2-Positive Early-Stage Breast Cancer Treated With Lapatinib and Trastuzumab A Secondary Analysis of the NeoALTTO Trial. JAMA Oncol. 2015, 1, 448–455. [Google Scholar] [CrossRef]
  49. Kilmartin, D.; O’Loughlin, M.; Andreu, X.; Bago-Horvath, Z.; Bianchi, S.; Chmielik, E.; Cserni, G.; Figueiredo, P.; Floris, G.; Foschini, M.P.; et al. Intra-Tumour Heterogeneity Is One of the Main Sources of Inter-Observer Variation in Scoring Stromal Tumour Infiltrating Lymphocytes in Triple Negative Breast Cancer. Cancers 2021, 13, 4410. [Google Scholar] [CrossRef]
  50. Phythian, C.J.; Cripps, P.J.; Grove-White, D.; Michalopoulou, E.; Duncan, J.S. Inter-observer agreement for clinical examinations of foot lesions of sheep. Vet. J. 2016, 216, 189–195. [Google Scholar] [CrossRef] [Green Version]
  51. Hodgkinson, O. The importance of feet examination in sheep health management. Small Rumin. Res. 2010, 92, 67–71. [Google Scholar] [CrossRef]
  52. Winter, A. Lameness in sheep 1. Diagnosis. In Practice 2004, 26, 58–63. [Google Scholar] [CrossRef]
  53. Lidbury, J.A.; Hoffmann, A.R.; Ivanek, R.; Cullen, J.M.; Porter, B.F.; Oliveira, F.; Van Winkle, T.J.; Grinwis, G.C.; Sucholdolski, J.S.; Steiner, J.M. Interobserver Agreement Using Histological Scoring of the Canine. J. Vet. Intern. Med. 2017, 31, 778–783. [Google Scholar] [CrossRef]
  54. Ingh, T.S.G.A.M.v.d.; Winkle, T.J.V.; Cullen, J.M.; Charles, J.A.; Desmet, V.J. Chapter 7 Morphological classification of parenchymal disorders of the canine and feline liver: 2. Hepatocellular death, hepatitis and cirrhosis. In WSAVA Standards for Clinical and Histological Diagnosis of Canine and Feline Liver Disease; Saunders: Philadelphia, PA, USA, 2006; pp. 85–101. [Google Scholar]
  55. Goldschmidt, M.; Pena, L.; Rasotto, R.; Zappulli, V. Classification and Grading of Canine Mammary Tumors. Vet. Path. 2011, 48, 117–131. [Google Scholar] [CrossRef]
  56. Karayannopoulou, M.; Kaldrymidou, E.; Constantinidis, T.C.; Dessiris, A. Histological grading and prognosis in dogs with mammary carcinomas: Application of a human grading method. J. Comp. Pathol. 2005, 133, 246–252. [Google Scholar] [CrossRef]
  57. Northrup, N.C.; Howerth, E.W.; Harmon, B.G.; Brown, C.A.; Carmicheal, K.P.; Garcia, A.P.; Latimer, K.S.; Munday, J.S.; Rakich, P.M.; Richey, L.J.; et al. Variation among pathologists in the histologic grading of canine cutaneous mast cell tumors with uniform use of a single grading reference. J. Vet. Diagn. Investig. 2005, 17, 561–564. [Google Scholar] [CrossRef] [Green Version]
  58. Patnaik, A.K.; Ehler, W.J.; MacEwen, E.G. Canine cutaneous mast cell tumor: Morphologic grading and survival time in 83 dogs. Vet. Pathol. 1984, 21, 469–474. [Google Scholar] [CrossRef]
  59. Belluco, S.; Avallone, G.; Di Palma, S.; Rasotto, R.; Oevermann, A. Inter- and Intraobserver Agreement of Canine and Feline Nervous System Tumors. Vet. Pathol. 2019, 56, 342–349. [Google Scholar] [CrossRef]
  60. Higgins, R.J.; Bollen, A.W.; Dickinson, P.G.; Sisò-Llonch, S. Tumours of the nervous system. In Tumors in Domestic Animals, 5th ed.; Wiley, J., Ed.; 2017; Volume 98, pp. 891–934. [Google Scholar]
  61. Yap, F.W.; Rasotto, R.; Priestnall, S.L.; Parsons, K.J.; Stewart, J. Intra- and inter-observer agreement in histological assessment of canine soft tissue sarcoma. Vet. Comp. Oncol. 2017, 15, 1553–1557. [Google Scholar] [CrossRef] [Green Version]
  62. Dennis, M.M.; Reddacliff, L.A.; Whittington, R.J. Longitudinal study of clinicopathological features of Johne’s disease in sheep naturally exposed to Mycobacterium avium subspecies paratuberculosis. Vet. Pathol. 2011, 48, 565–575. [Google Scholar] [CrossRef]
  63. Zappulli, V.; Baldassarre, V.; Brunetti, B.; Burrai, G.P.; Cocumelli, C.; Grieco, V.; Iussich, S.; Maniscalco, V.; Mariotti, F.; Millanta, F.; et al. Linee Guida per la Diagnosi dei Processi Iperplastici/Displasici e Neoplastici della Mammella del Cane e del Gatto; AIPVet: Dolenjske Toplice, Slovenia, 2021. [Google Scholar]
  64. Fritz, A.G. International Classification of Diseases for Oncology: ICD-O, 3rd ed.; first revision; World Health Organization: Geneva, Switzerland, 2013. [Google Scholar]
  65. Altman, D.G. Practical Statistics for Medical Research, 2nd ed.; Chapman & Hall/CRC: London, UK, 1991. [Google Scholar]
  66. Shoukri, M.M. Measures of Interobserver Agreement and Reliability; Chapman & Hall/CRC: London, UK, 2004. [Google Scholar]
  67. Szklo, M.a.N.F.J. Quality assurance and control. In Epidemiology: Beyond the Basis, 2nd ed.; Jones and Barlett: Sudbury, ON, Canada, 2006; pp. 297–350. [Google Scholar]
  68. Fleiss, J.L.; Levin, B.; Paik, M.C. Statistical Methods for Rates and Proportions; John Wiley and Sons: New York, USA, 2003. [Google Scholar]
  69. Joe, H.W. Hierarchical Grouping to Optimize an Objective Function, urldate = 2022-04-21. J. Am. Stat. Assoc. 1963, 58, 236–244. [Google Scholar]
  70. Meuten, D.J.; Moore, F.M.; George, J.W. Mitotic Count and the Field of View Area: Time to Standardize. Vet. Pathol. 2016, 53, 7–9. [Google Scholar] [CrossRef] [Green Version]
  71. Allison, K.H.; Reisch, L.M.; Carney, P.A.; Weaver, D.L.; Schnitt, S.J.; O’Malley, F.P.; Geller, B.M.; Elmore, J.G. Understanding diagnostic variability in breast pathology: Lessons learned from an expert consensus review panel. Histopathology 2014, 65, 240–251. [Google Scholar] [CrossRef] [Green Version]
  72. Cardoso, F.; Paluch-Shimon, S.; Senkus, E.; Curigliano, G.; Aapro, M.S.; André, F.; Barrios, C.H.; Bergh, J.; Bhattacharyya, G.S.; Biganzoli, L.; et al. 5th ESO-ESMO international consensus guidelines for advanced breast cancer (ABC 5). Ann. Oncol. 2020, 31, 1623–1649. [Google Scholar] [CrossRef] [PubMed]
  73. Lester, S.C.; Bose, S.; Chen, Y.Y.; Connolly, J.L.; de Baca, M.E.; Fitzgibbons, P.L.; Hayes, D.F.; Kleer, C.; O’Malley, F.P.; Page, D.L.; et al. Protocol for the examination of specimens from patients with invasive carcinoma of the breast. Arch. Pathol. Lab. Med. 2009, 133, 1515–1538. [Google Scholar] [CrossRef] [PubMed]
  74. Murali, R.; Hughes, M.T.; Fitzgerald, P.; Thompson, J.F.; Scolyer, R.A. Interobserver variation in the histopathologic reporting of key prognostic parameters, particularly clark level, affects pathologic staging of primary cutaneous melanoma. Ann. Surg. 2009, 249, 641–647. [Google Scholar] [CrossRef] [PubMed]
  75. Scolyer, R.A.; Judge, M.J.; Evans, A.; Frishberg, D.P.P.; Prieto, V.G.; Thompson, J.F.; Trotter, M.J.; Walsh, M.Y.; Walsh, N.M.; Ellis, D.W. Data set for pathology reporting of cutaneous invasive melanoma: Recommendations from the international collaboration on cancer reporting (ICCR). Am. J. Surg. Pathol. 2013, 37, 1797–1814. [Google Scholar] [CrossRef]
  76. Reagan, J.K.; Selmic, L.E.; Fallon, C.; Driskell, E.A.; Garrett, L.D. Evaluation of information presented within mast cell tumour histopathology reports in the United States: 2012–2015. Vet. Med. Sci. 2018, 4, 252–262. [Google Scholar] [CrossRef]
  77. Taylor, L.A.; Eguchi, M.M.; Reisch, L.M.; Radick, A.C.; Shucard, H.; Kerr, K.F.; Piepkorn, M.W.; Knezevich, S.R.; Elder, D.E.; Barnhill, R.L.; et al. Histopathologic synoptic reporting of invasive melanoma: How reliable are the data? Cancer 2021, 127, 3125–3136. [Google Scholar] [CrossRef]
  78. Nurdjaja, V.; Yozu, M.; Mathy, J.A. Essential Components of Melanoma Histopathological Reporting: The Surgical Oncologist’s Perspective. J. Skin Cancer 2018, 2018, 9838410. [Google Scholar] [CrossRef] [Green Version]
  79. Kim, B.H.; Kim, J.M.; Kang, G.H.; Chang, H.J.; Kang, D.W.; Kim, J.H.; Bae, J.M.; Seo, A.N.; Park, H.S.; Kang, Y.K.; et al. Standardized Pathology Report for Colorectal Cancer, 2nd Edition. J. Pathol. Transl. Med. 2020, 54, 1–19. [Google Scholar] [CrossRef] [Green Version]
  80. Cho, S.Y.; Park, S.Y.; Bae, Y.K.; Kim, J.Y.; Kim, E.K.; Kim, W.G.; Kwon, Y.; Lee, A.; Lee, H.J.; Lee, J.S.; et al. Standardized Pathology Report for Breast Cancer. J. Breast Cancer 2021, 24, 1–21. [Google Scholar] [CrossRef]
  81. Santos, M.; Correia-Gomes, C.; Santos, A.; de Matos, A.; Rocha, E.; Lopes, C.; Pereira, P.D. Nuclear pleomorphism: Role in grading and prognosis of canine mammary carcinomas. Vet. J. 2014, 200, 426–433. [Google Scholar] [CrossRef]
  82. Gandomkar, Z.; Brennan, P.C.; Mello-Thoms, C. Computer-Assisted Nuclear Atypia Scoring of Breast Cancer: A Preliminary Study. J. Digit. Imaging 2019, 32, 702–712. [Google Scholar] [CrossRef]
  83. Mousavikhamene, Z.; Sykora, D.J.; Mrksich, M.; Bagheri, N. Morphological features of single cells enable accurate automated classification of cancer from non-cancer cell lines. Sci. Rep. 2021, 11, 24375. [Google Scholar] [CrossRef]
  84. Dimitriou, N.M.; Flores-Torres, S.; Kinsella, J.M.; Mitsis, G.D. Quantifying the Morphology and Mechanisms of Cancer Progression in 3D in-vitro environments: Integrating Experiments and Multiscale Models. bioRxiv 2022. [Google Scholar] [CrossRef]
  85. Sorenmo, K.U.; Kristiansen, V.M.; Cofone, M.A.; Shofer, F.S.; Breen, A.M.; Langeland, M.; Mongil, C.M.; Grondahl, A.M.; Teige, J.; Goldschmidt, M.H. Canine mammary gland tumours; a histological continuum from benign to malignant; clinical and histopathological evidence. Vet. Comp. Oncol. 2009, 7, 162–172. [Google Scholar] [CrossRef]
  86. Tan, P.H.; Ellis, I.; Allison, K.; Brogi, E.; Fox, S.B.; Lakhani, S.; Lazar, A.J.; Morris, E.A.; Sahin, A.; Salgado, R.; et al. The 2019 World Health Organization classification of tumours of the breast. Histopathology 2020, 77, 181–185. [Google Scholar] [CrossRef]
  87. Webster, J.D.; Dennis, M.M.; Dervisis, N.; Heller, J.; Bacon, N.J.; Bergman, P.J.; Bienzle, D.; Cassali, G.; Castagnaro, M.; Cullen, J.; et al. Recommended guidelines for the conduct and evaluation of prognostic studies in veterinary oncology. Vet. Pathol. 2011, 48, 7–18. [Google Scholar] [CrossRef]
  88. Tanaka, Y.; Koyama, K.; Horiuchi, N.; Watanabe, K.; Kobayashi, Y. Relationship between Histological Grade and Histopathological Appearance in Canine Mammary Carcinomas. J. Comp. Pathol. 2020, 179, 59–64. [Google Scholar] [CrossRef]
  89. Nawaz, M.A.; Sewissy, A.A.; Soliman, T.H.A. Automated Classification of Breast Cancer Histology Images Using Deep Learning Based Convolutional Neural Networks. Int. J. Comput. Sci. Netw. Secur. 2018, 18, 152–160. [Google Scholar]
  90. Pantanowitz, L.; Hartman, D.; Qi, Y.; Cho, E.Y.; Suh, B.; Paeng, K.; Dhir, R.; Michelow, P.; Hazelhurst, S.; Song, S.Y.; et al. Accuracy and efficiency of an artificial intelligence tool when counting breast mitoses. Diagn. Pathol. 2020, 15, 80. [Google Scholar] [CrossRef]
  91. Lashen, A.; Ibrahim, A.; Katayama, A.; Ball, G.; Mihai, R.; Toss, M.; Rakha, E. Visual assessment of mitotic figures in breast cancer: A comparative study between light microscopy and whole slide images. Histopathology 2021, 79, 913–925. [Google Scholar] [CrossRef]
  92. Ginter, P.S.; Idress, R.; D’Alfonso, T.M.; Fineberg, S.; Jaffer, S.; Sattar, A.K.; Chagpar, A.; Wilson, P.; Harigopal, M. Histologic grading of breast carcinoma: A multi-institution study of interobserver variation using virtual microscopy. Mod. Pathol. 2021, 34, 701–709. [Google Scholar] [CrossRef]
  93. Rasotto, R.; Goldschmidt, M.H.; Castagnaro, M.; Carnier, P.; Caliari, D.; Zappulli, V. The dog as a natural animal model for study of the mammary myoepithelial basal cell lineage and its role in mammary carcinogenesis. J. Comp. Pathol. 2014, 151, 166–180. [Google Scholar] [CrossRef]
  94. Sánchez-Céspedes, R.; Millán, Y.; Guil-Luna, S.; Reymundo, C.; Espinosa de Los Monteros, A.; Martín de Las Mulas, J. Myoepithelial cells in canine mammary tumours. Vet. J. 2016, 207, 45–52. [Google Scholar] [CrossRef]
  95. Klaver, E.; van der Wel, M.; Duits, L.; Pouw, R.; Seldenrijk, K.; Offerhaus, J.; Visser, M.; Ten Kate, F.; Biermann, K.; Brosens, L.; et al. Performance of gastrointestinal pathologists within a national digital review panel for Barrett’s oesophagus in the Netherlands: Results of 80 prospective biopsy reviews. J. Clin. Pathol. 2021, 74, 48–52. [Google Scholar] [CrossRef]
  96. Oberbauer, A.M.; Belanger, J.M.; Famula, T.R. A Review of the Impact of Neuter Status on Expression of Inherited Conditions in Dogs. Front. Vet. Sci. 2019, 6, 397. [Google Scholar] [CrossRef]
  97. Fausak, E.D. A Mapping Study of Veterinary Literature on Perceptions and Attitudes of Female Canine Spaying. Front. Vet. Sci. 2020, 7, 559659. [Google Scholar] [CrossRef]
Figure 1. Frequency of results in terms of hyperplasia-dysplasia (H), benign (B), or malignant (M) communicated by individual readers, by sample identification number.
Figure 1. Frequency of results in terms of hyperplasia-dysplasia (H), benign (B), or malignant (M) communicated by individual readers, by sample identification number.
Vetsci 09 00357 g001
Figure 2. Performances of the panellists for outcomes expressed in terms of Hyperplasia-Dysplasia/Benign/Malignant (H, B, M). (a) mean k of each panellist versus all (k_ava); (b) mean k of each panellist versus GM (k_vGM). The points (green) represent the mean of Cohen’s k for each panellist; the vertical bars (orange) represent the 95% confidence interval. The horizontal line (red) represents the value above which the agreement is considered substantial.
Figure 2. Performances of the panellists for outcomes expressed in terms of Hyperplasia-Dysplasia/Benign/Malignant (H, B, M). (a) mean k of each panellist versus all (k_ava); (b) mean k of each panellist versus GM (k_vGM). The points (green) represent the mean of Cohen’s k for each panellist; the vertical bars (orange) represent the 95% confidence interval. The horizontal line (red) represents the value above which the agreement is considered substantial.
Vetsci 09 00357 g002
Figure 3. Performances of the panellists for outcomes expressed in terms of ICD-O morphological code/diagnosis. (a) mean k of each panellist versus all (k_ava); (b) mean k of each panellist versus GM (k_vGM). The points (green) represent the mean of Cohen’s k for each panellist; the vertical bars (orange) represent the 95% confidence interval. The horizontal line (red) represents the value above which the agreement is considered substantial.
Figure 3. Performances of the panellists for outcomes expressed in terms of ICD-O morphological code/diagnosis. (a) mean k of each panellist versus all (k_ava); (b) mean k of each panellist versus GM (k_vGM). The points (green) represent the mean of Cohen’s k for each panellist; the vertical bars (orange) represent the 95% confidence interval. The horizontal line (red) represents the value above which the agreement is considered substantial.
Vetsci 09 00357 g003
Figure 4. Left, dendrogram resulting after a cluster analysis on the variables synthesising the panellists’ experience; right, dendrogram resulting after a cluster analysis on the classification of lesions as hyperplasia, benign, malignant.
Figure 4. Left, dendrogram resulting after a cluster analysis on the variables synthesising the panellists’ experience; right, dendrogram resulting after a cluster analysis on the classification of lesions as hyperplasia, benign, malignant.
Vetsci 09 00357 g004
Table 1. Scientific studies dealing with inter- and intraobserver variability in the pathological diagnosis (references are listed in alphabetical order for the topic, separately for humans and animals).
Table 1. Scientific studies dealing with inter- and intraobserver variability in the pathological diagnosis (references are listed in alphabetical order for the topic, separately for humans and animals).
StudyTopicSpeciesNo. PatNo. Cases RD Lesion ClassificationClassification SystemGrading SystemOutcomeInterO AgreementIntraO Agreement
Gilles et al., 2008 [15]brain tumours H5229NAhistological criteriaNANAweighted k0.65NA
Eefting et al., 2009 [16]cartilagino-us tumoursH1816NAhistological gradeNAEvans (1977) [17]weighted k0.58–0.78NA
Corazza et al., 2007 [18]celiac diseaseH660NAhistological criteriaOberhuber (1999) [19] & Corazza (2005) [20]NAweighted k0.35–0.55NA
Rugge et al., 2021 [21]gastric metaplasiaH374NAhistological criteria & IHCNagtegaal (2020) [22]NAweighted k0.7–0.90.7 k
Barbosa et al., 2017 [23]gastric polypsH3128majority diagnosisstandard diagnosisPark (2008) [24]NAunweighted k0.40–0.79NA
Meyer et al., 2005 [25]HBCH79000NAhistological grade & IHCNAElston and Ellis (1991) [26]unweighted k0.5–0.59NA
Longacre et al., 2006 [27]HBCH1335NAstandard diagnosis & gradestructured reportElston and Ellis (1991) [26]unweighted k0.3–1 NA
Adams et al., 2009 [10]HBCH538NAhistological gradeNAElston and Ellis (1991) [26] vs 2-tiered systemunweighted k0.32 vs. 0.47NA
Allison et al., 2014 [28]HBCH3201NAstandard diagnosisNRNA% agreement62.70%NA
Gomes et al., 2014 [29]HBCH1610original reporthistological subtypesLakhani (2012) [30]NAunweighted k0.22–0.68NA
Elmore et al., 2015 [31]HBCH1152403 panel members 4 categoriesNRNA% agreement75.30%NA
Mäkelä et al., 2018 [32].lung fibrosisH460NA4 categoriesRaghu (2011) [33]NAunweighted k0.4–0.77NA
Hashisako et al., 2016 [34]lung (interstitial pneumonia)H1120NAhistological criteriaRaghu (2011) [33]NAunweighted k0.23NA
Grilley-Olson et al., 2012 [35]lung tumoursH2496majority diagnosisstandard diagnosis & categoriesTravis (2004) [36]NAweighted k, bootstrap for IC0.25–0.48NA
Thunnissen et al., 2012 [37]lung tumoursH26 (28)115 (64)NAstandard diagnosis (invasion)Yoshizawa (2011) [38]NAunweighted k0.38–0.77 (0.08–0.55)NA
Nicholson et al., 2018 [39]lung tumoursH16126NAhistological criteriaGirard (2009) [40]NAunweighted k0.6NA
Shi et al., 2021 [41]melanocytic neoplasmsH3136NA3 categoriesNRNRunweighted k0.496NA
Furness et al., 2003 [42].renal allograftsH2185NAhistological criteriaRacusen (1999) [43]NAunweighted k0.2–0.4NA
Ganti et al., 2021 [44]rhinosinusitis (chronic)H292NAhistological criteriaStructured reportNAunweighted k0.22–0.64NA
Hasegawa et al., 2002 [45]soft tissue sarcomasH4130expert panel standard diagnosis, grade & IHCNRHasegawa (2000) [46]% agreement, unweighted k75-100%, 0.34–0.86NA
Denkert et al., 2016 [11]TILs in HBCH32 & 28120NAsemiquantitative percentageweb-based & software systemNAICC, unweighted k0.7 & 0.89, 0.45 & 0.63NA
Tramm et al., 2018 [47]TILs in HBCH9124NAcutoff categoriesSalgado (2015) [48]NAICC, unweighted k0.71, 0.38–0.46NA
Kilmartin et al., 2021 [49]TILs in HBCH2349NAabsolute n. & cutoff categoriesscoring digital tool (https://www.tilsinbreastcancer.org)NAICC0.63 & 0.57NA
Phytian et al., 2016 [50]foot lesionsO81158test standard observer macroscopic criteriaHodginkson (2010) [51] & Winter (2004) [52]NAunweighted k0.47–0.72NA
Lidbury et al., 2017 [53]liver lesionsC650NAscoring systemvan den Ingh (2016) [54]NAunweighted k0.16–0.35NA
Chu et al., 2011 [9]mammary tumoursC1015NAbenign vs. malignantNRNAunweighted k0.43NA
Santos et al., 2015 [8]mammary tumoursC3462 panel vet membershistological gradeGoldschmidt (2011) [55]Karayannopoulou (2005) [56]weighted & unweighted k0.5–0.7NA
Northrup et al., 2005 [57]mast cell tumoursC1060previous reporthistological gradeNAPatnaik (1984) [58]% agreement, weighted k62.1%, 0.62NA
Belluco et al., 2019 [59]nervous system tumoursC & F446neuropathologiststandard diagnosis & IHCHiggins (2017) [60]NAunweighted k0.66–0.76NA
Yap et al., 2016 [61]soft tissue sarcomasC370NAhistological criteria & gradeDennis (2011) [62]Dennis (2011) [62]ICC, unweighted k0.6 & 0.43k0.78–1 ICC
RD, reference diagnosis; Pat, pathologists; H, human; O, ovine; C, canine; F, feline; HBC, human breast cancer, TILs, tumor infiltrating lymphocytes; NA, not applicable; NR, not reported; k, Fleiss’s/Cohen’s kappa; ICC, intraclass correlation coefficient; IntraO/interO, intraobserver/interobserver.
Table 2. Davis Thomson Foundation classification of canine mammary tumours [7], associated ICD-O-3.2 codes [64], and applied category of lesion. Main categories are indicated in bold and subcategories in italics.
Table 2. Davis Thomson Foundation classification of canine mammary tumours [7], associated ICD-O-3.2 codes [64], and applied category of lesion. Main categories are indicated in bold and subcategories in italics.
LesionsICD-O-3.2 CodesCategory
1. Hyperplasia/Dysplasia
1.1 Duct ectasia (DE)NAH
1.2 Lobular hyperplasia (LH) (adenosis)
1.2.1 regular (LH-R)NAH
1.2.2 with secretory activity (LH-S)NAH
1.2.3 with fibrosis (LH-F)NAH
1.2.4 with atypia (LH-A)NAH
1.3 Epitheliosis (EP)NAH
1.4 Papillomatosis (PAP)8060/0H
2. Benign epithelial neoplasms
2.1 Simple benign tumours
2.1.1 Adenoma—simple (SAD)8211/0 B
2.1.2 Myoepithelioma (MEP)8982/0B
2.2 Non-simple benign tumours
2.2.1 Complex adenoma (CAD)8983/0B
2.2.2 Benign mixed tumour (BMT)8940/0 B
2.2.3 Fibroadenoma (FAD)9010/0B
2.3 Ductal-associated benign tumours
2.3.1 Ductal adenoma (DAD)8147/0 *B
2.3.2 Intraductal papillary adenoma (IDPA)8503/0B
3. Malignant neoplasms
3.1 Carcinoma–in situnot applied
3.2 Simple carcinomas
3.2.1 Tubular (including cribriform) carcinoma (STC)8211/3M
3.2.2 Tubulopapillary carcinoma (STPC)8263/3M
3.2.3 Solid carcinoma (SoC)8230/3M
3.2.4 Invasive micropapillary carcinoma (IMPC)8507/3M
3.2.5 Comedocarcinoma (CoC)8501/3M
3.2.6 Anaplastic carcinoma (AC)8021/3M
3.3 Non-simple carcinoma
3.3.1 Carcinoma arising in complex adenoma/benign mixed tumour (C in B)8941/3 *M
3.3.2 Complex carcinoma (CC)8983/3M
3.3.3 Carcinoma and malignant myoepithelioma (C&MM)8562/3 M
3.3.4 Mixed carcinoma (MC)8940/3M
3.4 Ductal-associated carcinoma
3.4.1 Ductal carcinoma (DC)8147/3 *M
3.4.2 Intraductal papillary carcinoma (including papillary-cystic) (IDPC)8503/3 M
4. Malignant epithelial neoplasms-special types
4.1 Squamous cell carcinoma (SCC)8070/3 *M
4.2 Adenosquamous carcinoma (ASC)8560/3 *M
4.3 Mucinous carcinoma (MuC)8480/3M
4.4 Lipid-rich carcinoma (LRC)8314/3M
4.5 Spindle cell carcinoma (SPC)8572/3 *M
4.6 Malignant myoepithelioma (MM)8982/3 *M
5. Malignant mesenchymal neoplasms
5.1 Osteosarcoma (OC)9180/3 *M
5.2 Chondrosarcoma (CS)9220/3 *M
5.3 Fibrosarcoma (FS)8810/3 *M
5.4 Hemangiosarcoma (HS)9120/3 *M
5.5 Other sarcomas (other S)8800/3 *M
6. Carcinosarcoma (CS)8980/3 *M
7. Hyperplasia/dysplasia of the Teat
7.1 Melanosis of the skin of the teat (Skin M)NDH
7.2 Hyperplasia of the teat (TH)NDH
8. Neoplasms of the teat
8.1 Benign ductal-associated neoplasms
8.1.1 Ductal adenoma8147/0 *B
8.1.2 Intraductal papillary adenoma8503/0B
8.2 Malignant ductal-associated neoplasms
8.2.1 Ductal carcinoma8147/3 *M
8.2.2 Intraductal papillary carcinoma8503/3 M
8.3 Carcinoma with epidermal infiltration (Paget-like disease) (C-EI)8540/3M
NA, not available; H, hyperplasia/dysplasia; B, Benign tumour; M, malignant tumour. * Code assigned also when name of histotype was different but histological description identical between human and canine lesions.
Table 3. Histological grading for canine mammary tumours [2]. Main categories used for grading are indicated in bold.
Table 3. Histological grading for canine mammary tumours [2]. Main categories used for grading are indicated in bold.
FeaturePoints
A. Tubules formation (a)
Tubules comprise >75% of the tumour1
Tubules comprise 10–75% of the tumour (moderate formation of tubules admixed with non-tubular areas)2
Tubules comprise <10% (minimal or no tubule formation)3
B. Nuclear pleomorphism (b)
Uniform, regular, small nuclei with occasional small nucleoli1
Moderate degree of variation in nuclear size and shape, hyperchromatic nucleus, presence of nucleoli (some of which can be prominent)2
Marked variation in nuclear size, hyperchromatic nucleus, often with more than 1 prominent nucleoli3
C. Mitoses per 10 hpf (c)
0–9/10 hpf1
10–19/10 hpf2
20 or more/10 hpf3
Histological malignant gradingTotale score (A + B + C)
I (low, well differentiated)3–5
II (intermediate, moderately differentiated)6–7
III (high, poorly differentiated)8–9
a In complex and mixed tumours, the percentage of tubular formation is scored considering only epithelial areas. In malignant myoepithelioma, tubular formation is 2. In heterogeneous canine mammary carcinomas, tubular scoring should be assessed in the most representative malignant area. b In complex and mixed tumours, nuclear pleomorphism is evaluated in all the malignant components. c HPF, high-power field. The fields are selected at the periphery or the most mitotically active parts of the sample (not only epithelial cells). Diameter of the field of view = 0.55 mm.
Table 4. Characteristics of working panellists at the time of the study.
Table 4. Characteristics of working panellists at the time of the study.
AgeSexAffiliationYears of Experience *Position, Titles (In Addition to DVM)CMTs Biopsies (per Week)Self-Assessment of Level of Confidence in CMTs (High/Medium/Low)Considered an Expert by Colleagues on CMTsPublished Papers on MTs°
34FPrivate and University6Histopathology Consultant, PhD, ECVP15mediumNO0
46FUniversity15AP, PhD, ECVP5highYES14
40MUniversity7AsP, PhD, ECVP2highYES8
40MIZS°10Senior Scientist10mediumNO0
57FUniversity25PhD3mediumNO2
48FUniversity20AP, MSc^4mediumNO6
38FIZS°10Senior Scientist, PhD7mediumNO6
54FUniversity10AsP4mediumNO4
49FUniversity20AP20highYES16
45MUniversity14FP, PhD5medium NO0
62FUniversity25FP, PhD2mediumNO0
38FPrivate14Senior Consultant, PhD ECVP8highYES12
42FUniversity15AsP2mediumNO1
48FUniversity17AP, PhD1mediumNO2
46FUniversity15FP, MSc, PhD, ECVP6highYES21
* Years interpreting mammary gland pathology cases (not including residency/fellowship training) at 1st June 2021. ^: Master of Science.
Table 5. Classification of histological subtypes by the 15 panellists (P) for the 36 canine mammary tumour samples included in the study. In bold red the diagnoses that differed from the majority opinion (GM), in grey boxes diagnoses repeated by the same panellist.
Table 5. Classification of histological subtypes by the 15 panellists (P) for the 36 canine mammary tumour samples included in the study. In bold red the diagnoses that differed from the majority opinion (GM), in grey boxes diagnoses repeated by the same panellist.
S-IDP01P02P03P04P05P06P07P08P09P10P11P12P13P14P15GM
1IDPAIDPADCDCDCIDPC DCIDPC IDPCDCIDPC IDPAIDPC DCIDPC
2IMPCIMPCSTCIMPCIMPCIMPCIMPCIMPCIMPCIMPCIMPCSTCIMPCIMPCIMPCIMPC
3CADCADCCCCCCCADCADCADCADCADCADCADCCCADCADCAD
4MCC in BMCMCMCMCMCC in BMCMCC in BC in BMCC in BMCMC
5DCSTPCSTPCSTPCSTPCDCIDPCSTPCIDPCIDPCSTPCSTPCSTPCIDPCIDPCSTPC
6C-EISTCSTCSTCSTCICSTPCIMPCSTCSTPCSTCICSTCCCSTCSTC
7STPCIDPAIDPCIDPAIDPASTPCIDPCDCIDPAIDPAIDPAIDPCIDPADADIDPAIDPA
8Skin MSkin MSkin MSkin MSkin MSkin MSkin MSkin MSkin MSkin MSkin MSkin MSkin MSkin MSkin MSkin M
9DEDEDEDEDEDEDEDEDEDEDEDEDEDEDEDE
10STCLH-ASADLH-ALH-ASTCSTCSADDCSADSADSADDADSTCDCSAD
11CoCCoCCoCCoCCoCCoCCoCCoCCoCCoCCoCCoCCoCCoCCoCCoC
12EPEPLH-FEPEPEPEPEPLH-REPEPEPEPEPLH-REP
13IDPATHC-EITHTHTHTHC-EITHTHTHTHPAPTHTHTH
14LRCMMLRCLRCLRCLRCLRCLRCLRCLRCLRCLRCLRCLRCLRCLRC
15DADDADDADDADDADDADDADSTCDADDADDADDADSADSADDADDAD
16THASCSCCSCCSCCSCCSCCSCCSCCSCCSCCSCCSCCSCCSCCSCC
17MuCMuCMuCMuCMuCMuCMuCMuCMuCMuCMuCMuCMuCMuCMuCMuC
18FADLH-FLH-ALH-FLH-FFADVLH-ALH-ALH-FLH-FLH-FLH-FFADLH-ALH-F
19MEPMEPMEPMEPMEPMEPMEPMEPMEPMEPMEPMEPMEPMEPMEPMEP
20SPCSPCOther SSPCSPCOther SOther SOther SMMOther SOther SOther SFSSoCMMOther S
21ACACICACACACACACACICACACACICACAC
22LH-RLH-RLH-RLH-RLH-RLH-RLH-SLH-FLH-SLH-RLH-RLH-RLH-RLH-RLH-SLH-R
23PAPPAPPAPPAPPAPPAPPAPPAPPAPPAPPAPPAPLH-SPAPPAPPAP
24C in BMCMCC in BC in BC in BC in BMCMCC in BMCMCC in BMCMCMC
25ChSOSOSOSOSChSOSOSOSOSOSOSOther SOSOSOS
26LH-FFADFADFADFADLH-FFADFADLH-FLH-AFADFADC&MMLH-ALH-FFAD
27SoCLRCLRCLRCLRCLRCLRCSoCLRCSoCLRCLRCOther SLRCLRCLRC
28SoCDCMMSoCSoCSoCDCMMDCSTCDCDCDCDCDCDC
29SADSADCADCADCADSADLH-ALH-ALH-ACCLH-ALH-ACADLH-FLH-ALH-A
30IDPCIDPCSoCIDPCIDPCC&MMC&MMC&MMSoCASCC&MMC&MMSoCSTPCSoCC&MM
31MMC&MMC&MMC&MMC&MMMMCCSPCCCC&MMCCCCMMC&MMCCC&MM
32BMTBMTC in BBMTBMTBMTBMTBMTCADBMTBMTBMTBMTBMTCADBMT
33ASCIDPCASCASCASCASCIDPAASCASCC-EIASCASCASCIDPAASCASC
34CSCSCSCSCSCSCSCSCSCSCSCSOSCSCSCS
35LH-SLH-SLH-SLH-SLH-SLH-SSADLH-SSADLH-SLH-SLH-SLH-ALH-SSADLH-S
36SCCSoCACHSHSHSSoCSoCACHSSoCSoCHSACAC
S-ID, sample identification; see Table 2 for other abbreviations.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Papparella, S.; Crescio, M.I.; Baldassarre, V.; Brunetti, B.; Burrai, G.P.; Cocumelli, C.; Grieco, V.; Iussich, S.; Maniscalco, L.; Mariotti, F.; et al. Reproducibility and Feasibility of Classification and National Guidelines for Histological Diagnosis of Canine Mammary Gland Tumours: A Multi-Institutional Ring Study. Vet. Sci. 2022, 9, 357. https://doi.org/10.3390/vetsci9070357

AMA Style

Papparella S, Crescio MI, Baldassarre V, Brunetti B, Burrai GP, Cocumelli C, Grieco V, Iussich S, Maniscalco L, Mariotti F, et al. Reproducibility and Feasibility of Classification and National Guidelines for Histological Diagnosis of Canine Mammary Gland Tumours: A Multi-Institutional Ring Study. Veterinary Sciences. 2022; 9(7):357. https://doi.org/10.3390/vetsci9070357

Chicago/Turabian Style

Papparella, Serenella, Maria Ines Crescio, Valeria Baldassarre, Barbara Brunetti, Giovanni P. Burrai, Cristiano Cocumelli, Valeria Grieco, Selina Iussich, Lorella Maniscalco, Francesca Mariotti, and et al. 2022. "Reproducibility and Feasibility of Classification and National Guidelines for Histological Diagnosis of Canine Mammary Gland Tumours: A Multi-Institutional Ring Study" Veterinary Sciences 9, no. 7: 357. https://doi.org/10.3390/vetsci9070357

APA Style

Papparella, S., Crescio, M. I., Baldassarre, V., Brunetti, B., Burrai, G. P., Cocumelli, C., Grieco, V., Iussich, S., Maniscalco, L., Mariotti, F., Millanta, F., Paciello, O., Rasotto, R., Romanucci, M., Sfacteria, A., & Zappulli, V. (2022). Reproducibility and Feasibility of Classification and National Guidelines for Histological Diagnosis of Canine Mammary Gland Tumours: A Multi-Institutional Ring Study. Veterinary Sciences, 9(7), 357. https://doi.org/10.3390/vetsci9070357

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop