Next Article in Journal
The Effect of Herbal Feed Additives in the Diet of Dairy Goats on Intestinal Lactic Acid Bacteria (LAB) Count
Next Article in Special Issue
Biomolecules, Fatty Acids, Meat Quality, and Growth Performance of Slow-Growing Chickens in an Organic Raising System
Previous Article in Journal
Sunflower Meal Inclusion Rate and the Effect of Exogenous Enzymes on Growth Performance of Broiler Chickens
Previous Article in Special Issue
Effects of Cyclic Thermal Stress at Later Age on Production Performance and Meat Quality of Fast-Growing, Medium-Growing and Thai Native Chickens
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Reliability of a White Striping Scoring System and Description of White Striping Prevalence in Purebred Turkey Lines

by
Ryley J. Vanderhout
1,
Emily M. Leishman
1,
Heather Hiscock
2,
Emhimad A. Abdalla
1,
Bayode O. Makanjuola
1,
Jeff Mohr
3,
Alexandra Harlander-Matauschek
4,
Shai Barbut
2,
Benjamin J. Wood
1,3,5,
Christine F. Baes
1,6,* and
Nienke van Staaveren
1,4
1
Centre for the Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1, Canada
2
Department of Food Science, University of Guelph, Guelph, ON N1G 2W1, Canada
3
Hybrid Turkeys, 650 Riverbend Drive Suite C, Kitchener, ON N2K 3S2, Canada
4
The Campbell Centre for the Study of Animal Welfare, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1, Canada
5
School of Veterinary Science, University of Queensland, Gatton, QLD 4343, Australia
6
Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001 Bern, Switzerland
*
Author to whom correspondence should be addressed.
Animals 2022, 12(3), 254; https://doi.org/10.3390/ani12030254
Submission received: 13 October 2021 / Revised: 22 December 2021 / Accepted: 18 January 2022 / Published: 21 January 2022
(This article belongs to the Special Issue Poultry Production: Factors Affecting Meat Quality)

Abstract

:

Simple Summary

A relatively recent issue in the turkey industry is white striping (presence of white striations on the surface of the breast fillets). This defect influences consumer acceptance and the nutritional value of the meat and, therefore, is of economic importance to the industry. This study is aimed to test the reliability of a white striping scoring system used by several observers and estimate the prevalence of this defect in modern turkeys. After a few training sessions, the scoring system was found to be moderately reliable within and between the six participating observers. We found that 88% of turkeys in the studied population had some degree of white striping, with most scores being of moderate-severe severity (Score 1 or 2). Furthermore, white striping severity was found to be associated with higher slaughter weight, breast weight, and breast meat yield. Future research is needed to evaluate the use of white striping information in turkey genetic selection programs, as a balanced approach is needed to avoid slowing gains in economically favorable traits, such as growth.

Abstract

To efficiently meet consumer demands for high-quality lean meat, turkeys are selected for increased meat yield, mainly by increasing breast muscle size and growth efficiency. Over time, this has altered muscle morphology and development rates, which are believed to contribute to the prevalence of myopathies. White striping is a myopathy of economic importance which presents as varying degrees of white striations on the surface of skinless breast muscle and can negatively affect consumer acceptance at the point of sale. Breeding for improved meat quality may be a novel strategy for mitigating the development of white striping in turkey meat; however, it is crucial to have a reliable assessment tool before it can be considered as a phenotype. Six observers used a four-category scoring system (0–3) to score severity in several controlled rounds and evaluate intra- and inter-observer reliability of the scoring system. After sufficient inter-observer reliability (Kendall’s W > 0.6) was achieved, 12,321 turkey breasts, from four different purebred lines, were scored to assess prevalence of the condition and analyze its relationship with important growth traits. Overall, the prevalence of white striping (Score > 0) was approximately 88% across all genetic lines studied, with most scores being of moderate-severe severity (Score 1 or 2). As was expected, increased white striping severity was associated with higher slaughter weight, breast weight, and breast meat yield (BMY) within each genetic line. This study highlights the importance of training to improve the reliability of a scoring system for white striping in turkeys and was required to provide an updated account on white striping prevalence in modern turkeys. Furthermore, we showed that white striping is an important breast muscle myopathy in turkeys linked to heavily selected traits such as body weight and BMY. White striping should be investigated further as a novel phenotype in future domestic turkey selection through use of a balanced selection index.

1. Introduction

The consumption of turkey meat is increasing worldwide. It is the second most-consumed poultry meat globally, particularly in Europe and North and South America [1]. With consumer preference for smaller, deboned, or further processed products typically sold in trays covered with clear plastic film, the visual appearance of skinless portions is the most important determinant of acceptance at the point of sale [2,3,4].
Genetic selection in poultry has been geared toward increasing body weight, average daily gain, and breast muscle yield to meet consumer meat demand more efficiently [5]. This selection pressure has increased myopathies of the Pectoralis major, such as white striping (WS) [6,7]. From a microscopic point of view, breasts affected by WS show an increased sign of muscle cell injury thought to be related to rapid growth of the breast muscle tissue leading to ischemia and myofibril necrosis with minimal time for cell repair [8,9,10,11,12]. The necrotic muscle tissue is then infiltrated by fat and connective tissue leading to the macroscopic presence of white striations on the surface of the fillet and resulting in fillets with a higher lipid and lower protein content [8,9,13]. WS has been shown to negatively affect consumer acceptance of fresh, skinless broiler chicken meat [14,15], and one would assume the same finding is likely in turkeys. This is because consumers associate these white striations with meat from older poultry (e.g., older chickens tend to show more striations than young broiler chickens). However, today we see these white striations also in young birds [16]. WS has also been shown to affect the quality of the meat by decreasing marinade uptake, increasing cooking loss, and increasing hardness of the meat as a result of decreased functional proteins [12,17,18,19,20,21]. Consequently, larger breast fillets that are commonly used for further processing will also be negatively affected by WS.
The incidence of WS has been researched in broiler chickens, and flock prevalence between 56–87% has been reported in heavy strains [10,22,23,24]. Research on WS in turkeys is limited compared to broiler chickens, with a recent study showing a flock-level prevalence of over 60% [25]. Genetically, WS has been estimated to be moderate to highly heritable in broiler chickens [26,27,28,29], but to our knowledge, no estimates have been published for turkeys.
Although the development of WS is associated with microscopic changes to the muscle tissue [17], this myopathy is clearly observable at the macro level and can be scored without microscopes or other technologies. A four-category scoring scheme, primarily accounting for thickness of the striations, has been used to evaluate the severity of WS in broiler meat; this scheme classifies the appearance of breast meat samples as normal (no distinct white lines), moderate (small lines, <1 mm), severe (large white lines, 1–2 mm), or extreme (thick bands, >2 mm) [9,14]. This scheme has also been used in several turkey studies [25,30,31]. Zampiga et al. [32] used a similar four-category scoring scheme based on the proportion of the breast fillets covered by stripes as opposed to the thickness of the bands.
As muscle myopathies are a rising problem in the North American meat industry [26,33,34] and consumers’ awareness of these pathologies and their concerns about animal well-being rise [35,36,37], efforts have been made by breeding companies to improve the health of the animal and the appearance and quality of meat products by reducing the occurrence of emerging muscle myopathies. To do so, it is important to find a reliable method of identifying and classifying defects such as WS. There is a lack of evaluation of the reliability of visual scoring schemes for WS with multiple observers. Accurate recording of WS is important to ensure consistency over time and between observers, especially when considering the inclusion of a trait in breeding programs [38]. The objectives of this study were to: (1) evaluate the intra- and inter-observer reliability when using a WS scoring system adapted for turkeys, (2) evaluate the prevalence and severity of WS in turkey breast muscles, and (3) evaluate associations between WS scores and production traits (i.e., breast weight, slaughter weight, and breast meat yield).

2. Materials and Methods

2.1. Animals

Adult male turkeys, 20–24 weeks old, from four purebred lines (A, B, C, and D) were processed over 44 weeks between 2018 and 2019 at a local commercial poultry processing plant as part of a larger project focusing on genomic selection in turkeys [34]. The number of birds included in this study was 12,321 with 2839, 3728, 2034, and 3720 from lines A, B, C, and D, respectively. Birds were reared under identical conditions according to Hybrid Turkeys [39] and were weighed two days prior to processing (i.e., slaughter weight) (OHAUS scale, NJ, USA, accuracy to 0.01 kg). During processing at a commercial processing plant, the birds were electrically stunned and exsanguinated. Birds were scalded, defeathered, and eviscerated before moving to the chiller for 24 h prior to deboning and collecting meat quality and breast muscle weights (OHAUS scale, NJ, USA, accuracy to 0.01 kg). Breast meat yield (BMY, %) was then calculated as a proportion of slaughter weight.

2.2. Scoring System

Both Pectoralis major muscles were photographed (Hero 6, GoPro, San Mateo, CA, USA) approximately 24 h post mortem. Photographs were taken approximately 40 cm from the surface of the fillets using the “normal” focal length setting of the camera to minimize distortion of the image. These photographs were used to evaluate a 0–3 scoring scale for WS which was adapted from a system originally used in broiler chickens [9]. As shown in Figure 1, a score of 0 represented no to minimal white striations, Score 1 represented thin white striations visible on the breast, most of which tended to occur at the caudal end of the fillet, Score 2 represented white striations visible on the breast spread between the caudal end and the main body of the fillet, and Score 3 represented thick white striations visible on the breast covering a majority of the outer surface. If the two breasts in each photograph differed in severity, observers were instructed to record the more severe score. All observers were blind to all additionally recorded data of each image including genetic line, age at slaughter, slaughter weight, breast weight, and BMY.

2.3. Scoring Evaluation

The intra- and inter-observer reliability of the scoring system was assessed among six observers due to the high number of samples to be scored. One observer had previously tested the scoring system on a subset of photos to determine its feasibility, while others had no previous experience scoring WS severity. The scoring system was discussed among the observers before the first scoring session. A session consisted of all observers scoring 50 photographs in duplicate (two rounds per session). The initial subset of photographs was semi-randomly selected in that they had to give a clear visual of the breast muscle (e.g., without any damage due to trimming). Observers scored the same 50 photographs in an initial round and repeated the exercise several days later with the same set of photographs in a different randomized order. Intra-observer reliability was determined based on the results of the two rounds within one session for each observer. In contrast, inter-observer reliability was determined based on the specific round across observers. Following the first session (session 1), all observers discussed the results to align their scoring before repeating another session with the same 50 photos (session 2). A final session (session 3) was conducted after further discussion with a new subset of 50 photos that were randomly selected to reflect photographs collected under more typical processing plant conditions (e.g., excessive fat/skin obscuring the muscle, damaged breast muscle).

2.4. Reliability Analysis

All agreement calculations were performed using SAS® statistical software, version 9.4 (SAS Institute Inc., Cary, NC, USA [40]).
Multiple measures of agreement were calculated for intra- and inter-observer reliability to comprehensively understand the reliability of the scoring system. Intra-observer reliability was assessed by calculating exact agreement between two rounds, Spearman’s rank correlations, Fleiss–Cohen’s kappa coefficient, linear weighted kappa, and quadratic weighted kappa. Kappa statistic estimates were calculated to determine the level of agreement corrected for chance. Weighted kappa coefficients (linear and quadratic) were calculated to attribute partial credit depending on the relative extent of the disagreement between scores using Fleiss–Cohen weights [41]. Kappa values were interpreted following the classifications suggested by Landis and Koch [42] and presented in Petrie and Watson [43], where κ ≤ 0.20 is ‘poor’, 0.21 ≤ κ ≤ 0.40 is ‘fair’, 0.41 ≤ κ ≤ 0.60 is ‘moderate’, 0.61 ≤ κ ≤ 0.80 is ‘substantial’, and κ > 0.80 is ‘good’. It should be noted that these classifications are arbitrary [42], and the interpretation of kappa is dependent on the research field; however, a value closer to 1 indicates better agreement.
Inter-observer reliability was assessed by calculating estimates and tests of agreement among multiple observers using the %MAGREE macro (v3.8, SAS Institute Inc., Cary, NC, USA). The number of observers agreeing on each photograph was determined to calculate the percentage of cases in which 2, 3, 4, 5, or all 6 observers agreed on a score. Exact and weighted (linear and quadratic) kappa statistics were estimated [41]. Kendall’s W coefficient of concordance was estimated to test agreement of the observers’ ranking of the photographs. Finally, Gwet’s weighted agreement coefficient (AC2) for ordinal data was calculated to assess agreement on each score category while treating both observers and photographs as sampled rather than fixed to assess agreement of the entire populations of observers and photographs. For both Kendall’s W and Gwet’s agreement coefficient, the maximum value of 1 indicates perfect agreement.

2.5. White Striping Prevalence and Association with Production Traits

Upon completion of the three scoring sessions (six rounds), all 12,321 photos were divided among the six observers. Using R version 3.5.2 [44] and ggplot2 [45], the score frequency within each line and within the entire population was plotted. To test the influence of genetic line on WS score, a Tukey’s HSD test implemented in the stats package within R [44] was conducted on a linear model with genetic line as the only fixed effect. In addition to the WS scores, 12,282 birds had measurements for body weight two days before processing (slaughter weight), 12,190 had measures for breast weight (combined weight of both Pectoralis major and minor muscles), and 12,168 had measures for breast meat yield (BMY; breast weight as a percentage of slaughter weight). For each trait, outliers were excluded based on 3 standard deviations from the mean, and therefore, sample numbers differed slightly between the traits. Least square (LS) means of the three weight measures, for each score category, were calculated using a linear model with WS score and genetic line as fixed effects. LS means were then plotted for each genetic line. Statistical significance between the means was determined using Tukey’s HSD test implemented in the stats package within R.

3. Results

3.1. Intra-Observer Reliability

The results of the intra-observer reliability assessment in the three sessions for each observer are shown in Table 1. The percentage of exact agreement ranged between 62–78% in the first session for all observers, except Observer 1, who acknowledged having been influenced by discussion in between rounds in the first session. As a result, kappa values ranged from poor to substantial in the first session, showing a substantial difference between observers in intra-observer reliability. These values improved in the second session, with all observers showing a moderate to good agreement. From Sessions 2 to 3, kappa values remained similar or showed a slight decrease, which can likely be attributed to artifacts (e.g., excessive fat/skin obscuring the muscle, damaged breast muscle) in the new subset of photographs, which made scoring more difficult but represented more realistic examples. Considering the weighted kappa values, all observers had moderate to good agreement within themselves when considering the linear weighting. In contrast, all observers had good agreement when considering the quadratic weighting (Session 2 and 3). In the second session, the increase in kappa when using the quadratic weighing was larger for Observers 5 and 6, suggesting a larger difference in their scores (e.g., a score of 1 vs. 3 rather than 1 vs. 2). However, these were more similar across observers by the third session, suggesting only small differences in scores.

3.2. Inter-Observer Reliability

The results of the inter-observer reliability assessment in the six rounds between all observers are shown in Table 2. In the initial round, there were no instances in which all observers agreed on the same score. Instead, in nearly half of the images, exactly three observers gave the same score. The agreement increased in subsequent rounds, with the best results observed in the third and fourth rounds, where ≥4 observers agreed on most scores. In these rounds, we also observed all six observers agreeing; however, this percentage was not higher than 32%, showing the difficulty in obtaining exact agreement on the scoring scale among six observers. This was also highlighted by the poor to fair kappa values for exact agreement and fair to moderate kappa values when using linear and quadratic weighing. In contrast, Kendall’s W was relatively high (>0.6), indicating good agreement for the ranking of the photographs. Similarly, Gwet’s agreement coefficient for ordinal data showed substantial to good agreement overall, especially when considering the quadratic weighing.

3.3. Prevalence of White Striping

The WS prevalence in the population studied is shown in Figure 2. Descriptive statistics for the recorded weight measures for each genetic line are shown in Table 3. The frequency of affected breasts (i.e., breasts scored as 1, 2, or 3) within the entire population was 88.1%. The four genetic lines showed similar prevalence, with Line A having the highest frequency at 90.5% and Line D the lowest frequency at 84.1%. Line D was the only line to show a higher proportion of lower scores (0 and 1) than higher scores (2 and 3), with the lowest frequency of breasts scored as extreme (3.3% as Score 3). In contrast, Line B had 11.3% of breasts scored as extreme (score of 3). Consequently, genetic line was found to be a significant factor when analyzing WS score (p < 0.05). Although it has the largest average slaughter weight and breast meat weight of the four genetic lines, Line D showed the lowest average WS score (2.30 ± 0.013, p < 0.05) compared to Line A (2.57 ± 0.015), Line B (2.61 ± 0.013), and Line C (2.47 ± 0.017). Average WS score of all lines significantly differed (p < 0.05) with the exception of lines A and B, which tended to have a more similar average WS score (p = 0.07).
Figure 3, Figure 4 and Figure 5 show the average slaughter weight, breast weight, and BMY for each score within each line, respectively. Compared across lines, a similar trend was observed in that higher WS scores tended to be associated with higher average breast weight, slaughter weight, and BMY. For all lines, a WS score of 3 was associated with a significantly larger average weight measurement than score 0 (p > 0.05). The only exception was BMY of line A, which was numerically larger but not significantly different. The average percent increase between Score 0 and Score 3 was 4.6% for slaughter weight, 7.9% for breast weight, and 3.0% for BMY.

4. Discussion

The first objective of this study was to evaluate the reliability of a WS scoring scale for turkey breast muscles among multiple observers, followed by describing the prevalence and severity of WS in a turkey population consisting of birds from four genetic lines. Overall, we found that the scoring scale was reliable for multiple observers to score breasts for WS based on the cumulative information provided by the multiple coefficients estimated. Inter-observer reliability showed moderate to good agreement after completing three training sessions containing two rounds each. The inclusion of training is highly recommended for future studies attempting to score meat quality defects, such as WS, as the intra- and inter-observer reliability of the scoring system was improved after multiple sessions. Santos et al. [33] reported a weighted kappa of 0.85 for intra-reliability of WS in broiler chickens, which is similar to the results of the observers with the highest intra-observer reliability in the current study. It was not reported how many training sessions were required to reach this value in their study. To the best of the authors’ knowledge, no other studies have reported reliability measurements for WS in poultry. Reporting reliability measures is needed to evaluate and interpret data from such visual scoring systems. Observers in the current study appeared to stabilize their scoring and show the highest intra-observer reliability measures in Session 3 (Round 5 and 6), while the inter-observer reliability was strongest in Session 2 (Round 3 and 4). This is in line with research in other fields using discrete scoring systems to assess animal health. For example, D’Eath [46] reported that scoring reliability improved until the fifth scoring event. However, regular evaluation of the reliability of scoring systems and retraining is typically recommended.
This study suggests that the most difficulty in reaching agreement between observers was found between Score 1 and 2, which showed lower agreement coefficients. In contrast, the extremes of Score 0 and 3 were more easily agreed upon among all observers. Intermediate scores in discrete scales have previously been reported to have poorer reliability [45,46,47]. This difficulty in agreeing on Scores 1 and 2 could show a need for clearer definitions and examples, a broader scoring scale (fewer categories), or a reduction in the overall number of observers. More detailed scoring systems with more categories have more room for disagreement and require more training [46,47,48,49]. Depending on the purpose of the WS scoring, it could be recommended to reduce the scoring scale to combine Scores 1 and 2. However, merging scores is also considered to lead to a loss of granularity and should be suitable for the intended purpose of the assessment [46,47,48,49]. This seems relevant here, as Scores 1 and 2 were associated with differences in breast muscle weight, slaughter weight, and breast muscle yield in some of the genetic lines in the current study (Figure 3, Figure 4 and Figure 5). Ultimately, the development of a machine vision system capable of scoring breasts quantitatively, based on the number and thickness of the white striations, would be optimal; however, the development and use of such a program was not considered here.
It should be noted that this study used photographs to assess WS rather than live scoring due to practical constraints within the processing plant. Photographs or videos can be appropriate alternatives to live scoring [48,50]. Technology is increasingly implemented in processing plants to take advantage of automated methods (e.g., scoring of footpad lesions, grading broiler carcasses), which allow collection of larger datasets and benefits in terms of feasibility at high line-speeds [4]. However, disadvantages exist, and troubleshooting was required when initially setting up the camera system to ensure good quality photographs (e.g., adjustments for height, blur, and positioning). Feedback from observers showed that glare, positioning of breast outside of the area of focus, surface colour (striations were less noticeable with lighter coloured breast) or damage of the breast muscle (e.g., by trimming) could influence the scoring. These scenarios were more present in Session 3, which used less ideal photographs to mimic more realistic processing plant conditions. This could explain the slight dip in inter-observer reliability from Session 2 to 3, as observers adjusted to these conditions. These issues may be less of a problem when assessing breast muscle in-person. In-person training might be useful on a small subset of breasts for aligning scores between observers and an initial discussion of these scenarios, while time-consuming scoring of large samples sizes may be more appropriate via photographs.
Compared to the results of this study, a lower prevalence of WS (61.3%) has previously been reported in turkeys using a 0–2 scoring system [25]. A potential factor worth considering in future studies is the type of birds used, since pedigree stock, which was used in the present study, tends to be larger and may show differing levels of WS compared to commercial birds that are typically a hybrid cross. The WS prevalence of the studied population was similar, if not slightly higher than that previously published in broiler chickens, ranging between 56–87% in heavy strains [10,22,23,24]. A case has been made for WS being an emerging problem in broiler chickens over the past years and that this issue may also be present in turkeys.
Fast-growing broiler chickens and turkeys have been shown to have increased severity of WS [10,12,13,31]. The same finding was also observed within each line in the present study. However, this was not the case when comparing WS and average body weight between the four studied lines (i.e., the largest line, Line D, showed the lowest WS severity). Increases in breast weight between normal breasts and breasts affected by WS in broilers range from 16% to 25% [12,20], and an increase of 20% was observed in turkeys [13]. These reported differences in breast muscle weight between normal and affected breasts were higher than those observed in the present study between normal and extreme breasts (average of 7.9% increase). However, it should be noted that the weight used in the present study was the total weight of both Pectoralis major and minor muscles, which may influence the correlation between breast weight and WS. Significant increases in both slaughter weight and BMY between normal and extreme breast fillets were also observed in this study (average of 4.6% and 3.0%, respectively). This negative association between larger birds (i.e., one of the major economic goals of poultry breeding), and increases in muscle myopathies, such as WS, may pose an issue for breeders that could be managed through a selection index with appropriate economic weights.
It is noteworthy that the four lines studied had differing average slaughter ages depending on the birds’ size. The largest two lines, A and D, had an average slaughter age of 148.1 days and 143.8 days, respectively. The smaller two lines, B and C, had an average slaughter age of 153.7 days and 156.2 days. We do not believe this is a significant factor in the prevalence results as the range between the highest and lowest slaughter ages is minimal; however, it should be considered as age has been shown to affect the severity of WS [51].

5. Conclusions

The reliability of a WS scoring system for turkey breast muscles with multiple observers was developed and assessed. Later, the prevalence of WS in a large population of purebred turkey toms was determined. Overall, we report that the scoring system showed moderate to good reliability within and between the observers and that reliability improved after multiple training sessions. The prevalence of WS varied across the studied turkey lines and was 88.1% on average, with the majority of scores being moderate-severe (Scores 1 and 2). This highlights that this breast muscle myopathy is quite prevalent in turkeys and should continue to be investigated due to its negative effect on consumer preference and composition of the meat. In general, WS severity was found to be associated with higher slaughter weight, breast weight, and BMY, suggesting negative correlations that will have to be dealt with using a balanced selection index to reduce the presence of WS while continuing to maintain an increase in these economically significant traits.

Author Contributions

Conceptualization, R.J.V., N.v.S. and C.F.B.; methodology, R.J.V. and N.v.S.; validation, R.J.V. and N.v.S.; formal analysis, R.J.V. and N.v.S.; investigation, R.J.V., N.v.S., E.M.L., H.H., E.A.A. and B.O.M.; resources, J.M.; data curation, R.J.V.; writing—original draft preparation, R.J.V.; writing—review and editing, R.J.V., N.v.S., E.M.L., E.A.A., S.B., A.H.-M., B.J.W. and C.F.B.; visualization, R.J.V.; supervision, C.F.B., B.J.W., S.B. and A.H.-M.; project administration, N.v.S. and J.M.; funding acquisition, C.F.B. and B.J.W. All authors have read and agreed to the published version of the manuscript.

Funding

This project was funded by the Government of Canada through Genome Canada and the Ontario Genomics Institute (OGI-133). This study was part of the project entitled “Application of genomic selection in turkeys for health, welfare, efficiency and production traits” funded by the Government of Canada through the Genome Canada Genomic Application Partnership Program and administered by Ontario Genomics (recipients: B.J. Wood (Industry) and C.F. Baes (Academic)). The authors would also like to acknowledge NSERC and Hybrid Turkeys for financial support.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Canadian Council on Animal Care and approved by the Animal Care Committee of the University of Guelph (Animal Use Protocol 3785, approved 8 May 2017).

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available from the corresponding author upon reasonable request.

Acknowledgments

The authors would like to gratefully acknowledge Michelle Yahiro, Elizah McFarland, and Jadelyn Appleby for assisting with data collection. The authors also extend their gratitude to the managers and personnel of Hayter’s Farm (Dashwood, ON, Canada) and Hybrid Turkeys pedigree farm (Kitchener, ON, Canada) for collaborating on this study.

Conflicts of Interest

J.M. and B.W. were employees of Hybrid Turkeys at the time of the study. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. FAOSTAT. Crops and Livestock Products. Available online: https://www.fao.org/faostat/en/#data/QCL/visualize (accessed on 17 November 2021).
  2. Min, B.; Ahn, D.U. Sensory properties of packaged fresh and processed poultry meat. In Advances in Meat, Poultry and Seafood Packaging; Woodhead Publishing: Sawston, UK, 2012; pp. 112–153. ISBN 9781845697518. [Google Scholar]
  3. Font-i-Furnols, M.; Guerrero, L. Consumer preference, behavior and perception about meat and meat products: An overview. Meat Sci. 2014, 98, 361–371. [Google Scholar] [CrossRef]
  4. Barbut, S. Meat industry 4.0: A distant future? Anim. Front. 2020, 10, 38–47. [Google Scholar] [CrossRef]
  5. Havenstein, G.B.; Ferket, P.R.; Grimes, J.L.; Qureshi, M.A.; Nestor, K.E. Comparison of the performance of 1966- versus 2003-type Turkeys when fed representative 1966 and 2003 Turkey diets: Growth rate, livability, and feed conversion. Poult. Sci. 2007, 86, 232–240. [Google Scholar] [CrossRef] [PubMed]
  6. Velleman, S.G.; Nestor, K.E. Effect of selection for growth rate on myosin heavy chain temporal and spatial localization during Turkey breast muscle development. Poult. Sci. 2003, 82, 1373–1377. [Google Scholar] [CrossRef]
  7. Hocking, P.M. Unexpected consequences of genetic selection in broilers and turkeys: Problems and solutions. Br. Poult. Sci. 2014, 55, 1–12. [Google Scholar] [CrossRef] [PubMed]
  8. Barbut, S. Recent myopathies in broiler’s breast meat fillets. Worlds Poult. Sci. J. 2019, 75, 559–582. [Google Scholar] [CrossRef]
  9. Kuttappan, V.A.; Hargis, B.M.; Owens, C.M. White striping and woody breast myopathies in the modern poultry industry: A review. Poult. Sci. 2016, 95, 2724–2733. [Google Scholar] [CrossRef]
  10. Russo, E.; Drigo, M.; Longoni, C.; Pezzotti, R.; Fasoli, P.; Recordati, C. Evaluation of White Striping prevalence and predisposing factors in broilers at slaughter. Poult. Sci. 2015, 94, 1843–1848. [Google Scholar] [CrossRef]
  11. Livingston, M.L.; Landon, C.; Barnes, H.J.; Brake, J. White striping and wooden breast myopathies of broiler breast muscle is affected by time-limited feeding, genetic background, and egg storage. Poult. Sci. 2019, 98, 217–226. [Google Scholar] [CrossRef]
  12. Baldi, G.; Soglia, F.; Mazzoni, M.; Sirri, F.; Canonico, L.; Babini, E.; Laghi, L.; Cavani, C.; Petracci, M. Implications of white striping and spaghetti meat abnormalities on meat quality and histological features in broilers. Animal 2018, 12, 164–173. [Google Scholar] [CrossRef] [Green Version]
  13. Soglia, F.; Baldi, G.; Laghi, L.; Mudalal, S.; Cavani, C.; Petracci, M. Effect of white striping on Turkey breast meat quality. Animal 2018, 12, 2198–2204. [Google Scholar] [CrossRef]
  14. Kuttappan, V.A.; Lee, Y.S.; Erf, G.F.; Meullenet, J.F.C.; Mckee, S.R.; Owens, C.M. Consumer acceptance of visual appearance of broiler breast meat with varying degrees of white striping. Poult. Sci. 2012, 91, 1240–1247. [Google Scholar] [CrossRef]
  15. de Carvalho, L.M.; Ventanas, S.; Olegario, L.S.; Madruga, M.S.; Estévez, M. Consumers awareness of white-striping as a chicken breast myopathy affects their purchasing decision and emotional responses. LWT 2020, 131, 109809. [Google Scholar] [CrossRef]
  16. Griffin, J.R.; Moraes, L.; Wick, M.; Lilburn, M.S. Onset of white striping and progression into wooden breast as defined by myopathic changes underlying Pectoralis major growth. Estimation of growth parameters as predictors for stage of myopathy progression. Avian Pathol. 2018, 47, 2–13. [Google Scholar] [CrossRef] [PubMed]
  17. Kuttappan, V.A.; Shivaprasad, H.L.; Shaw, D.P.; Valentine, B.A.; Hargis, B.M.; Clark, F.D.; McKee, S.R.; Owens, C.M. Pathological changes associated with white striping in broiler breast muscles. Poult. Sci. 2013, 92, 331–338. [Google Scholar] [CrossRef]
  18. Petracci, M.; Mudalal, S.; Babini, E.; Cavani, C. Effect of White Striping on Chemical Composition and Nutritional Value of Chicken Breast Meat. Ital. J. Anim. Sci. 2014, 13, 179–183. [Google Scholar] [CrossRef]
  19. Petracci, M.; Mudalal, S.; Bonfiglio, A.; Cavani, C. Occurrence of white striping under commercial conditions and its impact on breast meat quality in broiler chickens. Poult. Sci. 2013, 92, 1670–1675. [Google Scholar] [CrossRef]
  20. Mudalal, S.; Lorenzi, M.; Soglia, F.; Cavani, C.; Petracci, M. Implications of white striping and wooden breast abnormalities on quality traits of raw and marinated chicken meat. Animal 2015, 9, 728–734. [Google Scholar] [CrossRef] [Green Version]
  21. Sanchez Brambila, G.; Bowker, B.C.; Zhuang, H. Comparison of sensory texture attributes of broiler breast fillets with different degrees of white striping. Poult. Sci. 2016, 95, 2472–2476. [Google Scholar] [CrossRef] [PubMed]
  22. Lorenzi, M.; Mudalal, S.; Cavani, C.; Petracci, M. Incidence of white striping under commercial conditions in medium and heavy broiler chickens in Italy. J. Appl. Poult. Res. 2014, 23, 754–758. [Google Scholar] [CrossRef]
  23. Cruz, R.F.A.; Vieira, S.L.; Kindlein, L.; Kipper, M.; Cemin, H.S.; Rauber, S.M. Occurrence of white striping and wooden breast in broilers fed grower and finisher diets with increasing Lysine levels. Poult. Sci. 2017, 96, 501–510. [Google Scholar] [CrossRef]
  24. Golzar Adabi, S.; Demirok Soncu, E. White striping prevalence and its effect on meat quality of broiler breast fillets under commercial conditions. J. Anim. Physiol. Anim. Nutr. 2019, 103, 1060–1069. [Google Scholar] [CrossRef]
  25. Mudalal, S. Incidence of white striping and its effect on the quality traits of raw and processed Turkey breast meat. Food Sci. Anim. Resour. 2019, 39, 410–417. [Google Scholar] [CrossRef]
  26. Bailey, R.A.; Watson, K.A.; Bilgili, S.F.; Avendano, S. The genetic basis of pectoralis major myopathies in modern broiler chicken lines. Poult. Sci. 2015, 94, 2870–2879. [Google Scholar] [CrossRef] [PubMed]
  27. Lake, J.A.; Dekkers, J.C.M.; Abasht, B. Genetic basis and identification of candidate genes for wooden breast and white striping in commercial broiler chickens. Sci. Rep. 2021, 11, 6785. [Google Scholar] [CrossRef] [PubMed]
  28. Pampouille, E.; Berri, C.; Boitard, S.; Hennequet-Antier, C.; Beauclercq, S.A.; Godet, E.; Praud, C.; Jégo, Y.; Le Bihan-Duval, E. Mapping QTL for white striping in relation to breast muscle yield and meat quality traits in broiler chickens. BMC Genom. 2018, 19, 202. [Google Scholar] [CrossRef]
  29. Alnahhas, N.; Berri, C.; Chabault, M.; Chartrin, P.; Boulay, M.; Bourin, M.C.; Bihan-Duval, E. Le Genetic parameters of white striping in relation to body weight, carcass composition, and meat quality traits in two broiler lines divergently selected for the ultimate pH of the pectoralis major muscle. BMC Genet. 2016, 17, 61. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Mudalal, S.; Zaid, A.; Abu-Khalaf, N.; Petracci, M. Predicting the quality traits of white striped turkey breast by visible/near infra-red spectroscopy and multivariate data analysis. Ital. J. Anim. Sci. 2020, 19, 676–686. [Google Scholar] [CrossRef]
  31. Carvalho, L.T.; Owens, C.M.; Giampietro-Ganeco, A.; Malagoli de Mello, J.L.; Ferrari, F.B.; de Carvalho, F.A.L.; Alves de Souza, R.; Amoroso, L.; Alves de Souza, P.; Borba, H.; et al. Quality of turkeys breast meat affected by white striping myopathy. Poult. Sci. 2021, 100, 101022. [Google Scholar] [CrossRef]
  32. Zampiga, M.; Tavaniello, S.; Soglia, F.; Petracci, M.; Mazzoni, M.; Maiorano, G.; Meluzzi, A.; Clavenzani, P.; Sirri, F. Comparison of 2 commercial Turkey hybrids: Productivity, occurrence of breast myopathies, and meat quality properties. Poult. Sci. 2019, 98, 2305–2315. [Google Scholar] [CrossRef]
  33. Santos, M.N.; Rothschild, D.; Widowski, T.M.; Barbut, S.; Kiarie, E.G.; Mandell, I.; Guerin, M.T.; Edwards, A.M.; Torrey, S. In pursuit of a better broiler: Carcass traits and muscle myopathies in conventional and slower-growing strains of broiler chickens. Poult. Sci. 2021, 100, 101309. [Google Scholar] [CrossRef]
  34. Malchiodi, F.; Wood, B.; Barbut, S.; Harlander, A.; Baes, C. 306 Application of genomic selection for enhancing health, welfare, efficiency and production traits in turkeys. J. Anim. Sci. 2018, 96, 116. [Google Scholar] [CrossRef]
  35. Prisco, F.; De Biase, D.; Piegari, G.; D’Aquino, I.; Lama, A.; Comella, F.; Mercogliano, R.; Dipineto, L.; Papparella, S.; Paciello, O. Pathologic characterization of white striping myopathy in broiler chickens. Poult. Sci. 2021, 100, 101150. [Google Scholar] [CrossRef] [PubMed]
  36. World Organization for Animal Health. Introduction to the recommendations for animal welfare. In Terrestrial Animal Health Code; World Organization for Animal Health: Paris, France, 2008; pp. 235–236. [Google Scholar]
  37. Boerboom, G.; Van Kempen, T.; Navarro-Villa, A.; Pérez-Bonilla, A. Unraveling the cause of white striping in broilers using metabolomics. Poult. Sci. 2018, 97, 3977–3986. [Google Scholar] [CrossRef] [PubMed]
  38. Kapell, D.N.R.G.; Hill, W.G.; Neeteson, A.-M.; McAdam, J.; Koerhuis, A.N.M.; Avendaño, S. Genetic parameters of foot-pad dermatitis and body weight in purebred broiler lines in 2 contrasting environments. Poult. Sci. 2012, 91, 565–574. [Google Scholar] [CrossRef] [PubMed]
  39. Hybrid Turkeys. Technical Guide for Hybrid Turkeys Commercial Products; Hybrid Turkeys: Boxmeer, The Netherlands, 2020; pp. 1–75. [Google Scholar]
  40. SAS Institute Inc. SAS® 9.4 Statements: Reference 2016; SAS Institute Inc.: Cary, NC, USA, 2016. [Google Scholar]
  41. Watson, P.F.; Petrie, A. Method agreement analysis: A review of correct methodology. Theriogenology 2010, 73, 1167–1179. [Google Scholar] [CrossRef] [Green Version]
  42. Landis, J.R.; Koch, G.G. The Measurement of Observer Agreement for Categorical Data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [Green Version]
  43. Handel, I.G. Statistics for Veterinary and Animal Science, 3rd ed.; British Veterinary Association: London, UK, 2013. [Google Scholar] [CrossRef]
  44. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
  45. Wickman, H. ggplot2: Elegant Graphics for Data Analysis; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
  46. D’Eath, R.B. Repeated locomotion scoring of a sow herd to measure lameness: Consistency over time, the effect of sow characteristics and inter-observer reliability. Anim. Welf. 2012, 21, 219–231. [Google Scholar] [CrossRef]
  47. Garner, J.P.; Falcone, C.; Wakenell, P.; Martin, M.; Mench, J.A. Reliability and validity of a modified gait scoring system and its use in assessing tibial dyschondroplasia in broilers. Br. Poult. Sci. 2002, 43, 355–363. [Google Scholar] [CrossRef]
  48. Schlageter-Tello, A.; Bokkers, E.A.M.; Groot Koerkamp, P.W.G.; Van Hertem, T.; Viazzi, S.; Romanini, C.E.B.; Halachmi, I.; Bahr, C.; Berckmans, D.; Lokhorst, K. Effect of merging levels of locomotion scores for dairy cows on intra- and interrater reliability and agreement. J. Dairy Sci. 2014, 97, 5533–5542. [Google Scholar] [CrossRef]
  49. Decina, C.; Berke, O.; van Staaveren, N.; Baes, C.F.; Harlander-Matauscheck, A. Development of a scoring system to assess feather damage in canadian laying hen flocks. Animals 2019, 9, 436. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Palczynski, L.J.; Buller, H.; Lambton, S.L.; Weeks, C.A. Farmer attitudes to injurious pecking in laying hens and to potential control strategies. Anim. Welf. 2016, 25, 29–38. [Google Scholar] [CrossRef] [Green Version]
  51. Radaelli, G.; Piccirillo, A.; Birolo, M.; Bertotto, D.; Gratta, F.; Ballarin, C.; Vascellari, M.; Xiccato, G.; Trocino, A. Effect of age on the occurrence of muscle fiber degeneration associated with myopathies in broiler chickens submitted to feed restriction. Poult. Sci. 2017, 96, 309–319. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Visual scoring system used for scoring white striping severity of the Pectoralis major (fillet). Where 0 (normal) = no to minimal white striations; 1 (moderate) = thin white striations visible on the breast, most of which tended to occur at the caudal end of the fillet (bottom of pictured fillets); 2 (severe) = large white striations visible on the breast spread between the caudal end and the main body of the fillet (top of pictures fillets); 3 (extreme) = thick white striations visible on the breast covering majority of the outer surface. Pictures by Ryley J. Vanderhout.
Figure 1. Visual scoring system used for scoring white striping severity of the Pectoralis major (fillet). Where 0 (normal) = no to minimal white striations; 1 (moderate) = thin white striations visible on the breast, most of which tended to occur at the caudal end of the fillet (bottom of pictured fillets); 2 (severe) = large white striations visible on the breast spread between the caudal end and the main body of the fillet (top of pictures fillets); 3 (extreme) = thick white striations visible on the breast covering majority of the outer surface. Pictures by Ryley J. Vanderhout.
Animals 12 00254 g001
Figure 2. White striping severity score (Score 0–3 with 0 representing normal, 1 moderate, 2 severe, and 3 extreme) percentage within purebred turkey toms of each genetic line, A (n = 2839), B (n = 3728), C (n = 2034) or D (n = 3720) and within the entire studied population (n = 12,321).
Figure 2. White striping severity score (Score 0–3 with 0 representing normal, 1 moderate, 2 severe, and 3 extreme) percentage within purebred turkey toms of each genetic line, A (n = 2839), B (n = 3728), C (n = 2034) or D (n = 3720) and within the entire studied population (n = 12,321).
Animals 12 00254 g002
Figure 3. Least square means for slaughter weight (live weight of the bird 2 days prior to slaughter in kg) of purebred turkey toms for each white striping score (0–3) within each genetic line (A: n = 2834, B: n = 3715, C: n = 2032, D: n = 3701). Error bars show standard error. Means with different letters (a–d) within genetic line represent statistical significance (p < 0.05) as determined by Tukey’s HSD test.
Figure 3. Least square means for slaughter weight (live weight of the bird 2 days prior to slaughter in kg) of purebred turkey toms for each white striping score (0–3) within each genetic line (A: n = 2834, B: n = 3715, C: n = 2032, D: n = 3701). Error bars show standard error. Means with different letters (a–d) within genetic line represent statistical significance (p < 0.05) as determined by Tukey’s HSD test.
Animals 12 00254 g003
Figure 4. Least square means for breast weight (combined weight of Pectoralis major and minor muscles in kg) of purebred turkey toms for each white striping score (0–3) within each genetic line (A: n = 2830, B: n = 3677, C: n = 2031, D: n = 3652). Error bars show standard error. Means with different letters (a–d) within genetic line represent statistical significance (p < 0.05) as determined by Tukey’s HSD test.
Figure 4. Least square means for breast weight (combined weight of Pectoralis major and minor muscles in kg) of purebred turkey toms for each white striping score (0–3) within each genetic line (A: n = 2830, B: n = 3677, C: n = 2031, D: n = 3652). Error bars show standard error. Means with different letters (a–d) within genetic line represent statistical significance (p < 0.05) as determined by Tukey’s HSD test.
Animals 12 00254 g004
Figure 5. Least square means for breast meat yield (breast meat weight as a percentage of slaughter weight) of purebred turkey toms for each white striping score (0–3) within each genetic line (A: n = 2827, B: n = 3666, C: n = 2028, D: n = 3647). Error bars show standard error. Means with different letters (a–d) within genetic line represent statistical significance (p < 0.05) as determined by Tukey’s HSD test.
Figure 5. Least square means for breast meat yield (breast meat weight as a percentage of slaughter weight) of purebred turkey toms for each white striping score (0–3) within each genetic line (A: n = 2827, B: n = 3666, C: n = 2028, D: n = 3647). Error bars show standard error. Means with different letters (a–d) within genetic line represent statistical significance (p < 0.05) as determined by Tukey’s HSD test.
Animals 12 00254 g005
Table 1. Intra-observer reliability coefficients of white striping scoring of turkey breast muscle (0–3 scale) to assess agreement within 6 observers over multiple training sessions. The 95% confidence interval for kappa, linear weighted kappa, and quadratic weighted kappa are given below the coefficients in brackets.
Table 1. Intra-observer reliability coefficients of white striping scoring of turkey breast muscle (0–3 scale) to assess agreement within 6 observers over multiple training sessions. The 95% confidence interval for kappa, linear weighted kappa, and quadratic weighted kappa are given below the coefficients in brackets.
Observer 1Observer 2Observer 3Observer 4Observer 5Observer 6Average
Session 1 (round 1 and 2) 1
Exact agreement (%)38746878627466
Spearman correlation0.48 **0.68 ***0.69 ***0.79 ***0.63 ***0.81 ***0.68
Kappa0.12
(–0.05–0.29)
0.51
(0.28–0.73)
0.43
(0.21–0.65)
0.64
(0.45–0.82)
0.41
(0.21–0.61)
0.62
(0.44–0.80)
0.45
Linear weighted
kappa
0.20
(0.06–0.34)
0.58
(0.37–0.78)
0.54
(0.35–0.73)
0.70
(0.54–0.86)
0.51
(0.33–0.69)
0.72
(0.58–0.86)
0.54
Quadratic weighted
kappa
0.31
(0.17–0.46)
0.68
(0.50–0.86)
0.68
(0.53–0.83)
0.78
(0.66–0.91)
0.62
(0.45–0.80)
0.82
(0.72–0.92)
0.65
Session 2 (round 3 and 4) 1
Exact agreement (%)80868874687278
Spearman correlation0.73 ***0.87 ***0.86 ***0.67 ***0.70 ***0.74 ***0.76
Kappa0.62
(0.41–0.83)
0.77
(0.61–0.93)
0.80
(0.64–0.95)
0.53
(0.31–0.75)
0.47
(0.27–0.68)
0.56
(0.36–0.76)
0.63
Linear weighted
kappa
0.66
(0.46–0.85)
0.82
(0.68–0.95)
0.83
(0.70–0.96)
0.59
(0.39–0.79)
0.57
(0.40–0.75)
0.65
(0.49–0.82)
0.69
Quadratic weighted
kappa
0.71
(0.50–0.91)
0.87
(0.77–0.97)
0.88
(0.78–0.98)
0.67
(0.49–0.84)
0.70
(0.56–0.83)
0.76
(0.63–0.89)
0.76
Session 3 (round 5 and 6) 2
Exact agreement (%)80868876807681
Spearman correlation0.65 ***0.88 ***0.86 ***0.65 ***0.81 ***0.71 ***0.76
Kappa0.59
(0.37–0.81)
0.80
(0.66–0.94)
0.80
(0.65–0.95)
0.52
(0.29–0.76)
0.69
(0.51–0.86)
0.58
(0.38–0.78)
0.66
Linear weighted
kappa
0.62
(0.41–0.83)
0.84
(0.72–0.96)
0.83
(0.69–0.96)
0.57
(0.36–0.79)
0.75
(0.61–0.90)
0.63
(0.46–0.81)
0.71
Quadratic weighted
kappa
0.67
(0.47–0.88)
0.88
(0.77–0.99)
0.87
(0.76–0.97)
0.64
(0.45–0.83)
0.83
(0.72–0.94)
0.71
(0.55–0.86)
0.77
1 Sessions 1 and 2 were conducted with the same set of 50 photographs; 2 Session 3 was conducted with a new set of 50 photographs, ** p < 0.01, *** p < 0.001.
Table 2. Inter-observer reliability coefficients of white striping scoring of turkey breast muscle (0–3 scale) to assess agreement between 6 observers over multiple training sessions. The 95% confidence interval for kappa, linear weighted kappa, quadratic weighted kappa, and Gwet’s AC2 are given below the coefficients in brackets.
Table 2. Inter-observer reliability coefficients of white striping scoring of turkey breast muscle (0–3 scale) to assess agreement between 6 observers over multiple training sessions. The 95% confidence interval for kappa, linear weighted kappa, quadratic weighted kappa, and Gwet’s AC2 are given below the coefficients in brackets.
Session 1 1Session 2 1Session 3 2
Round 1Round 2Round 3Round 4Round 5Round 6
Percentage agreement (%) 3
All 6 observers agree0632221210
5 observers agree242428363228
4 observers agree244636262640
3 observers agree48244162622
2 observers agree400040
Exact agreement
Kappa0.17
(0.10−0.25)
0.27
(0.20–0.34)
0.33
(0.26–0.39)
0.29
(0.22–0.36)
0.21
(0.15–0.28)
0.24
(0.16–0.31)
Kendall’s W 0.61 ***0.68 ***0.74 ***0.70 ***0.61 ***0.66 ***
Linear weighting
Linear weighted kappa0.32
(0.20–0.44)
0.45
(0.36–0.54)
0.52
(0.45–0.59)
0.48
(0.39–0.56)
0.38
(0.28–0.47)
0.41
(0.31–0.51)
Gwet’s AC2
Overall0.54
(0.28–0.81)
0.65
(0.56–0.74)
0.79
(0.69–0.89)
0.76
(0.66–0.87)
0.67
(0.58–0.77)
0.68
(0.58–0.79)
Score 00.88
(0.80–0.96)
0.86
(0.77–0.94)
0.93
(0.88–0.99)
0.96
(0.91–1.00)
0.88
(0.78–0.97)
0.89
(0.80–0.97)
Score 10.47
(0.28–0.66)
0.51
(0.32–0.70)
0.72
(0.59–0.84)
0.70
(0.51–0.89)
0.57
(0.41–0.72)
0.60
(0.43–0.78)
Score 20.49
(0.26–0.72)
0.58
(0.45–0.72)
0.68
(0.53–0.84)
0.61
(0.46–0.77)
0.58
(0.44–0.73)
0.56
(0.38–0.73)
Score 30.81
(0.58–1.00)
0.92
(0.86–0.98)
0.93
(0.87–0.99)
0.89
(0.81–0.96)
0.91
(0.84–0.98)
0.89
(0.81–0.97)
Quadratic weighting
Quadratic weighted kappa0.32
(0.20–0.44)
0.45
(0.36–0.54)
0.52
(0.45−0.59)
0.48
(0.39–0.56)
0.38
(0.28–0.47)
0.41
(0.31–0.51)
Gwet’s AC2
Overall0.74
(0.53–0.95)
0.83
(0.77–0.88)
0.91
(0.85–0.96)
0.89
(0.83–0.95)
0.83
(0.76–0.90)
0.85
(0.78–0.92)
Score 00.88
(0.80–0.96)
0.86
(0.77–0.95)
0.93
(0.88–0.99)
0.96
(0.91–1.00)
0.88
(0.78–0.97)
0.89
(0.80–0.97)
Score 10.47
(0.28–0.66)
0.51
(0.32–0.70)
0.72
(0.59–0.84)
0.70
(0.51–0.89)
0.57
(0.42–0.72)
0.60
(0.43–0.78)
Score 20.49
(0.26–0.72)
0.58
(0.45–0.72)
0.68
(0.53–0.84)
0.61
(0.46–0.77)
0.58
(0.44–0.73)
0.56
(0.38–0.73)
Score 30.81
(0.58–1.00)
0.92
(0.86–0.98)
0.93
(0.87–0.99)
0.89
(0.81–0.96)
0.91
(0.84–0.98)
0.89
(0.81–0.97)
1 Sessions 1 and 2 were conducted with the same set of 50 photographs; 2 Session 3 was conducted with a new set of 50 photographs; 3 Percentage agreement by each exact number of observers; *** p < 0.001.
Table 3. Descriptive statistics (count, mean, and standard deviation) for white striping score, slaughter weight, breast weight, and breast meat yield (BMY) for the four genetic lines of turkeys studied.
Table 3. Descriptive statistics (count, mean, and standard deviation) for white striping score, slaughter weight, breast weight, and breast meat yield (BMY) for the four genetic lines of turkeys studied.
Genetic LineWhite Striping Score (0–3)Slaughter Weight (kg)Breast Weight (kg)BMY (% BW)
NMeanSDNMeanSDNMeanSDNMeanSD
A28392.570.776283421.981.54828305.230.583282723.811.966
B37282.610.81371519.111.28636774.740.499366624.811.789
C20342.470.782203214.771.00520313.770.394202825.511.704
D37202.300.771370124.871.80136526.100.757364724.472.108
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Vanderhout, R.J.; Leishman, E.M.; Hiscock, H.; Abdalla, E.A.; Makanjuola, B.O.; Mohr, J.; Harlander-Matauschek, A.; Barbut, S.; Wood, B.J.; Baes, C.F.; et al. Reliability of a White Striping Scoring System and Description of White Striping Prevalence in Purebred Turkey Lines. Animals 2022, 12, 254. https://doi.org/10.3390/ani12030254

AMA Style

Vanderhout RJ, Leishman EM, Hiscock H, Abdalla EA, Makanjuola BO, Mohr J, Harlander-Matauschek A, Barbut S, Wood BJ, Baes CF, et al. Reliability of a White Striping Scoring System and Description of White Striping Prevalence in Purebred Turkey Lines. Animals. 2022; 12(3):254. https://doi.org/10.3390/ani12030254

Chicago/Turabian Style

Vanderhout, Ryley J., Emily M. Leishman, Heather Hiscock, Emhimad A. Abdalla, Bayode O. Makanjuola, Jeff Mohr, Alexandra Harlander-Matauschek, Shai Barbut, Benjamin J. Wood, Christine F. Baes, and et al. 2022. "Reliability of a White Striping Scoring System and Description of White Striping Prevalence in Purebred Turkey Lines" Animals 12, no. 3: 254. https://doi.org/10.3390/ani12030254

APA Style

Vanderhout, R. J., Leishman, E. M., Hiscock, H., Abdalla, E. A., Makanjuola, B. O., Mohr, J., Harlander-Matauschek, A., Barbut, S., Wood, B. J., Baes, C. F., & van Staaveren, N. (2022). Reliability of a White Striping Scoring System and Description of White Striping Prevalence in Purebred Turkey Lines. Animals, 12(3), 254. https://doi.org/10.3390/ani12030254

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop