Next Article in Journal
Increasing Hepatitis E Virus Seroprevalence in Domestic Pigs and Wild Boar in Bulgaria
Previous Article in Journal
Content of Selected Macro- and Microelements in the Liver of Free-Living Wild Boars (Sus scrofa L.) from Agricultural Areas and Health Risks Associated with Consumption of Liver
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genetic Variability in Polish Lowland Sheepdogs Assessed by Pedigree and Genomic Data

Institute of Animal Breeding and Genetics, University of Veterinary Medicine Hannover (Foundation), 30559 Hannover, Germany
*
Author to whom correspondence should be addressed.
Animals 2020, 10(9), 1520; https://doi.org/10.3390/ani10091520
Submission received: 20 July 2020 / Revised: 23 August 2020 / Accepted: 24 August 2020 / Published: 27 August 2020
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:

Simple Summary

Dogs are an important part of society. The Polish Lowland Sheepdog (PON) is one of 353 of the world’s largest cynological organization listed dog breeds. Breeds with small population sizes, like the PONs, are often characterized by high inbreeding rates and thus an increased risk of congenital diseases. To examine the endangerment of the PONs, measures for genetic diversity and inbreeding were calculated for the German PON population. The study showed that the PONs had to be classified as a minimally endangered population according to threshold values specified by the European Association for Animal Production. However, the very recent trend showed a slight improvement.

Abstract

Genetic variability of Polish Lowland Sheepdog (PON) population was evaluated using both pedigree and genomic data. The analyzed pedigree encompassed 8628 PONs, including 153 individuals genotyped on the Illumina CanineHD BeadChip. Runs of homozygosity (ROH) were defined for homozygous stretches extending over 60 to 4300 kb. The inbreeding coefficients FPed based on pedigree data and FROH50 based on ROHs were at 0.18 and 0.31. The correlation between both was 0.41 but 0.52 when excluding animals with less than seven complete generations. The realized effective population size ( N e ¯ ) was 22.2 with an increasing trend over years. Five PONs explained 79% of the genetic diversity of the reference population. The effective population size derived from linkage disequilibrium measured by r² was 36. PANTHER analysis of genes in ROHs shared by ≥50% of the PONs revealed four highly over- or underrepresented biological processes. One among those is the 7.35 fold enriched “forelimb morphogenesis”. Candidate loci for hip dysplasia and patent ductus arteriosus were discovered in frequently shared ROHs. In conclusion, the inbreeding measures of the PONs were high and the genetic variability small compared to various dog breeds. Regarding N e ¯ , PON population was minimally endangered according to the European Association for Animal Production.

1. Introduction

The calculation of demographic measures and thereby the evaluating of the genomic architecture of a breed are an important issue in dog breeding. Pedigree-based inbreeding coefficients in several dog breeds ranged from 0.03 to 0.1 [1,2,3]. The estimation of genetic variability has been both performed based on pedigree and single nucleotide polymorphism (SNP) array data. However, genome-based measures are more accurate than pedigree-based [4] and inbreeding coefficients based on genomic data were higher compared to those based on pedigree data [3].
The genetic variability largely depends on breeding decisions and practices. Selection for special characteristics in dog breeds can result in bottlenecks in the populations and thus high inbreeding rates. Thereby, the probability of homozygosity for recessive genes increases, which leads to a higher incidence of hereditary diseases [5]. Especially in small populations, where the possible choice of breeding mates is small, the risk of highly inbred offspring increases.
Polish Lowland Sheepdogs (Polski Owczarek Nizinny, PON) are an example for a small population. In 2009 to 2018 only 57 to 116 PONs per year were born in the German Kennel Club (VDH) [6]. In contrast, over all breeds the average of dogs born per year in the VDH was 255 to 299, with a range from 0 to 15,870, in this period [6]. At the beginning of pure breeding after World War I PONs were highly inbred on one dog to introduce a breeding standard [7]. The Fédération Cynologique Internationale (FCI) officially recognized the medium-sized dogs in 1963 under the standard number 251 [8]. The breed originally is a sheepdog, but nowadays most often a pet. An inbreeding coefficient (FPed) of 0.04 was calculated for an Australian PON population [9]. However, the studied population contained only 12 animals. As summarized by Bell, et al. [10] health problems in PONs are: Canine hip dysplasia (CHD), hypothyroidism, several ocular diseases like progressive retinal atrophy (PRA) and cataract, neuronal ceroid lipofuscinosis, and patent ductus arteriosus (PDA). According to the disease statistics of the Orthopedic Foundation for Animals the most frequent diseases are CHD and PRA (rcd4-mutation) with 16.7% (of 634 dogs examined dogs) and 17.5% (of 331 dogs) abnormal results [11]. The rcd4-mutation explained most of the PRA cases in PONs [12]. A few cases (1.6%/127) of canine elbow dysplasia (CED) were also reported in the OFA statistics [11].
The objective of this study was to analyze the genetic variability of the PONs registered in the Allgemeiner Klub für polnische Hunderassen e.V. (APH), Germany. The measures were estimated using pedigree and SNP array data. Therefore, 153 PONs were genotyped on the Illumina CanineHD BeadChip (Illumina, San Diego, CA, USA). The genetic variability was estimated based on linkage disequilibria (LDs) and runs of homozygosity (ROHs). Estimated fixation index FIS and FPed based on pedigree data were compared to FIS and FROH based on genotype information. The genes located in the ROHs were considered regarding their potential relevance for potential health problems of PONs.

2. Materials and Methods

2.1. Data, Samples, and Genotyping

Pedigree data were provided by the APH. Missing information in pedigrees was supplemented with data from herd books. The pedigrees encompassed 8628 PONs born since 1953 and with parental information. Blood samples of 153 PONs were obtained from the bio-bank of the Institute for Animal Breeding and Genetics at the University of Veterinary Medicine Hannover, which contained blood samples of about 800 APH-registered PONs. Sampling of the genotyped PONs regarded population structure and was done in such a way that sires and dams with larger progeny numbers were represented in the data set. These animals were born from 1995 to 2017. Genomic DNA was extracted using a standard ethanol fractionation with concentrated sodiumchloride (6 M NaCl) and sodium dodecyl sulphate (10% SDS). The concentration of each DNA sample was adjusted to 50 ng/µL and then genotyped using the Illumina CanineHD BeadChip (Illumina, San Diego, CA, USA) containing 220,853 SNPs.

2.2. Analysis of Pedigree Data

The demographic measures based on pedigree data were estimated using the software ENDOG, version 4.8, Departamento de Producción Animal, Facultad de Vaterinaria, Universidad Complutense de Madrid, Madrid, Spain [13]. All parameters were calculated for a reference population and the BeadChip sample, which contained the 153 genotyped PONs with available pedigree information. Both were considered as subpopulations of the entire APH-registered PON population. The reference population was chosen on basis of the pedigree completeness. For each birth cohort the mean equivalent complete generations (EqG) were calculated (Figure 1). To reach a number of at least 30 animals per cohort those from 1953 to 1968 were put into bins of eight years and those from 1969 to 1974 into bins of two years. EqG are the sum over all known ancestors [14]. The parameter was calculated as the sum of (1/2)n where n was defined as the number of generations separating the individual to each known ancestor. The threshold for birth cohorts to be included in the reference population was set at EqG larger than eight. Dogs born in 2019 were excluded because of the incomplete information of the cohort. Thus, the reference population included all PONs born from 1990 to 2018 dogs. These were 4920 PONs. Per year, 91 to 397 PONs were born (Figure S1). The BeadChip sample was a part of the reference sample and for this subpopulation genealogical and genomic measures were compared.
Further measures of pedigree completeness were the completeness of each ancestor in the pedigree to the 5th parental generation and the index of completeness by MacCluer, et al. [15].
The generation interval was computed as the average age of the parents at the birth of their offspring that was used for reproduction [16].
As one measure of genetic variability the effective number of founders (fe) was calculated as
f e = 1 k = 1 f q k 2
where f was the number of founders and qk the probability of gene origin of the individual ancestor (k) [17]. The parameter is defined as the number of equally contributing founders that are expected to produce the same genetic diversity as in the population under study [18]. Founders are those animals without known ancestors and to which all other animals of the population can be traced back.
To evaluate the balanced use of reproductive individuals in the PON population we computed the effective number of ancestors (fa). The parameter was calculated as the marginal genetic contribution (q) of an individual ancestor (qj), which is the contribution by an ancestor that could not be explained by other ancestors before:
f a = 1 j = 1 a q j 2
with a = number of ancestors [17].
Founder genome equivalents (Ng) are the number of equally contributed founders that would be expected to produce the same genetic diversity as the population under study without any random loss of founder alleles in descendants [18]. The program segreg.f of the PEDIG software, Institut National de la Recherche Agronomique, Jouy en Josas Cedex, France [19] was used for the calculations.
To evaluate the unbalanced use of reproducers, we calculated the effective number of sires (NeffS) and dams (NeffD) as:
N e f f S =   1 i s i 2
N e f f D =   1 i d i 2
where si (di) was the relative frequency of use of the sire or dam i among all sires (dams) of the reference population [20].
Furthermore, the inbreeding coefficient (FPed) according to Meuwissen and Luo [21] was calculated. The parameter describes the probability that an individual is homozygous by descent at a random chosen locus in the genome.
We estimated FIS as a part of the F-Statistics [22]. It was calculated with
F I S _ P e d = F P e d   ˜   f ¯ 1     f ¯
where F P e d ˜ was the mean inbreeding coefficient for the PON population and f ¯ the mean coancestry for the subpopulation [23,24]. We defined every birth cohort as one subpopulation.
The individual increase in inbreeding (ΔFPed_i) was calculated based on FPed_i dependent on the equivalent complete generations (t) [25]:
Δ F P e d _ i = 1   1 F P e d _ i   t     1 .
Thereof the increase in inbreeding of the reference population (ΔFPed) was calculated as mean of the individual values.
The effective population size was estimated as realized effective size ( N e ¯ ) from the individual increase in inbreeding with [26]:
N e ¯ =   1 2 Δ F P e d ¯
To get an even more precise Ne in the case of substructures in the studied population [27], the Ne additionally was estimated via the paired increase in coancestry according to [26]:
Δ c j k = 1     1     c j k ( g j   +   g k 2 )
where cjk was the coancestry coefficient between the two paired individuals j and k and gj and gk the discrete equivalent generations of j and k.
The average coancestries of the reference population and the BeadChip sample were calculated using the program par3.f of the PEDIG software [19].

2.3. Statistical Analysis of Genomic Data

For the estimation of the genetic variability measures, all SNPs from sex chromosomes were excluded from the dataset. SNPs and individuals with a genotyping rate < 0.95 were removed. We did not remove SNPs through MAF to avoid a removing of homozygous regions. Previously, in a similar study a filtering for MAF did mostly result in indistinguishable estimates [28]. Thus, the final dataset contained 167,183 autosomal SNPs. These SNPs covered 2,201,666,442 bp of the autosomal dog genome.
PLINK (www.cog-genomics.org/plink/1.9/), version 1.9, Complete Genomics, Mountain View, CA, USA [29] was used for the ROH detection. The minimum run length of SNPs in a ROH (l) resulting in a type I error rate α of 0.05 was calculated as:
l   = log ( α n S N P   ×   n i ) log ( h o m )
where nSNP was the mean number of SNPs per individual, ni the number of genotyped individuals and hom the mean homozygosity across all SNPs [30,31]. This corresponded to a length of 50 SNPs. We defined a ROH as a homozygous stretch that extended over 60 to 4300 kb and according to the average SNP distance on the array contained at least 5, 7, 10, 20, 30, 40, 50, 65, or 358 SNPs, respectively (Table 1) [30,31,32]. It has been previously shown that selection signatures going back many generations could only be captured in short ROHs with a type I error rate α > 0.05 [33]. Especially in regions with low marker density or high recombination rates use of smaller ROHs had benefits [34]. Therefore, the short ROHs containing less than 50 SNPs were included in the study. In order to capture recent inbreeding, ROHs containing more SNPs were included in the study. We did not allow for heterozygous SNPs and only two to five missing SNP genotypes in the homozygous region were admitted [32]. The matching proportions of overlapping ROHs were pooled. A consensus ROH was a matching proportion that occurred in all PONs. We watched out for known causal or associated loci and genes for the most breed-relevant diseases CHD and PRA (rcd4-mutation) located in the regions of the ROH5 shared by at least 50, 75, or 90% of the genotyped PONs. Furthermore, we screened for genes fibroblast growth factor 5 (FGF5) and R-spondin 2 (RSPO2) responsible for the coat structure of the PONs as all PONs are long coated and have furnishings according to the breeding standard [35]. Screening was done for ROH5 to cover a maximum number of ROHs including the very ancient ones. Significant differences of the amount of ROHs per chromosome were tested with a paired samples Wilcoxon test and R, version 3.6.3, R Foundation for Statistical Computing, Vienna, Austria [36].
The inbreeding coefficient FROH and fixation index FIS_SNP were estimated using the software SAS, version 9.4 (Statistical Analysis System, Cary, NC, USA, 2019). FROH was calculated as a proportion of the length of all ROHs to the total length of all autosomes covered by SNPs [37]. FIS-values for each individual (i) were calculated as:
F I S _ S N P g , i =   O i   E i n S N P , i   E i ,
with Ei = number of expected homozygous SNPs by chance, Oi = number of observed homozygous SNPs and nSNP,i = number of all observed autosomal SNPs in the considered individual [38]. Pearson correlation coefficients for the inbreeding coefficients were calculated. To prove the impact of the complete generations on the correlation between FPed and FROH, dogs with less than four, five, six, and seven, respectively, complete generations in the pedigree dataset were excluded from the calculations.
The effective population size and the increase in inbreeding per generation based on the LD were detected using PLINK (www.cog-genomics.org/plink/1.9/), version 1.9 [29]. Therefore, the LD was measured as the squared correlation between SNP pairs (r2). Distance bins of 0.1 Mb were created containing the r²-values for SNP pairs with distances of 1 kb to 33.3 Mb. For each distance bin the mean r²-value was estimated. The Ne_SNP was computed as N e _ S N P =   1   r 2 4 c r 2 , with c = recombination rate in Morgan units [39]. Regarding the distance c we assumed that 100 Mb = 1 Morgan. ΔFSNP was estimated with Δ F S N P =   1 2 N e _ S N P [27]. The number of generations in the past (t) was computed as t =   1 2 c and rounded to the nearest integer. If there was more than one value of Ne_SNP or ΔFSNP per integer, the values were averaged.
Using Net View package, version 3.4.1 [40,41] implemented in R, version 3.6.3 [36], a NetView analysis was applied to detect fine-scale population structures. It is based on a relationship matrix which was derived from distances between SNP markers calculated with PLINK (www.cog-genomics.org/plink/1.9/), version 1.9 [29,41]. NetView clusters based on mutual k-nearest neighbors were created to visualize the relatedness between the individuals [40,41]. The maximum number of nearest neighbors (K-NN) was set to K-NN = 2–10 as recommended for small animal sets [41]. Analyses at higher K-NN to investigate large-scale genetic structures were not performed, because the analysis just included one breed. The visualization of the population network for K-NN = 7 and 10 was done using Cytoscape, version 3.7.0 [42]. To enable a visualization of potential connections between genetic distances and inbreeding coefficients, the PONs were grouped into classes of different inbreeding coefficients (FROH5 < 0.16, FROH5 = 0.16–0.19, FROH5 = 0.19–0.22, FROH5 = 0.22–0.25, FROH5 = 0.25–0.28, FROH5 = 0.28–0.31, FROH5 = 0.31–0.34, FROH5 = 0.34–0.37, FROH5 = 0.37–0.40, FROH5 = 0.40–0.43, FROH5 ≥ 0.43).
PANTHER 15.0 (Protein Analysis Through Evolutionary Relationships), Division of Bioinformatics, Department of Preventive Medicine, Keck School of Medicine of USC, University of Southern California, Los Angeles, CA, USA [43] was used to investigate the functional relevance of the ROHs. All genes located in ROH5 shared by at least 50, 75, or 90% of the PONs were analyzed by the “functional classification” analysis and the “statistical overrepresentation” test. The overrepresentation test was done for the annotation sets “GO biological process complete” and “Panther Protein Class”. To cover maximal ROHs including the very ancient ones the analysis was started for ROH5.
In addition, the ROHs shared by 75 or 90% of the PONs were cross-referenced with the databases OMIM [44] and OMIA [45].
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

3. Results

3.1. Demographic Measures Using Pedigree Analyses

The demographic measures were calculated for a reference population born from 1990 to 2018 and the BeadChip sample, which contained the 153 genotyped PONs (Table 2; Table S1). The mean of the EqG in the reference population was 10.1 and 11.2 in the BeadChip sample. Complete generations varied from 4.5 to 6.0 in the reference population (Figure 2). In the fifth parental generation, 93.1 to 99.1% of the ancestors were known in the reference population (BeadChip: 86.3–100.0%). The index of completeness by MacCluer, Boyce, Dyke, Weitkamp, Pfenning, and Parsons [15] was above 0.9 up to the seventh and ninth generation in the reference population and the BeadChip sample.
The mean generation interval was 4.7 years in the reference population (BeadChip: 4.9 years). Ten effective founders and six effective ancestors were identified. Five of the ancestors explained 79% respectively 78% of the genetic diversity of the population in the reference population and the Beadchip sample. NeffS was 150 at a total number of 433 sires and NeffD was 335 at a total number of 700 dams.
In the reference population, FPed ranged from 0.00 to 0.45 with a mean of 0.18 (BeadChip: 0.05–0.40, mean 0.18). It was consistent over the last 30 years (Figure 3). FIS_Ped decreased from 0.017 in 1990 to −0.022 in 2018 (Figure 3). The realized Ne increased from 16.3 (1990) to 29.1 (2018) (Figure 4).

3.2. Measurement of Genetic Variability Using Genomic Data

We identified 6764 (ROH358) to 27,276 (ROH5) ROHs (Table 3). None of the ROHs was a consensus ROH. Eight of the ROH5 were shared by at least 75% of the PONs (Table 4). The length varied from 61 kb to 83 Mb with an average length of 3.8 ± 4.9 Mb (ROH5) to 10.4 ± 6.5 Mb (ROH358). Most ROHs (77%) had a length less than 5 Mb (Figure 5). On average, CFA 29 had the most homozygous regions (Figure S2, Table S2). The SNPs that were homozygous most frequent (99%) were located on CFA 13 (Figure S3). The mean FROH for all genotyped PONs was 0.30 to 0.31, respectively, and ranged from 0.12 to 0.47 except FROH358, which was 0.21 (0.00–0.41) (Table 3, Figure 6). For FIS_SNP a mean of −0.01 (−0.19 to 0.21) was calculated (Figure 6). The parameter was below zero in 62% of the PONs. There was a statistically significant (p < 0.05) correlation of 0.92 between FROH and FIS_SNP. Between FROH and FPed, the Pearson correlation coefficient ranged from 0.25 (FROH358) to 0.42 (FROH40) with a p-value > 0.0001 (Table 5). An exclusion of dogs with low complete generations lead to an increase of the correlation (Table 6).
The gene for furnishing and quantitative trait loci previously reported to be associated with CHD, were located within the ROH5 shared by at least 50, 75, and 90% of the PONs (Table S3) [46,47,48,49,50,51,52]. According to the database OMIM, the gene angiopoietin 1 (ANGPT1) is involved in the development of vascular structures [53].
The mean r2 as a measure of LD ranged between 0.70 for SNPs 269 bp apart and 0.015 for SNPs 47.5 Mb apart. Ne_SNP was calculated for the last 200 generations, which corresponded to a marker distance of 250 kb to 50 Mb. Ne_SNP decreased from 309 to 101 (50 generations ago) and 38 (five generations ago) (Figure 7). In the very recent generation, the Ne_SNP was 36. From four to two generations ago the Ne_SNP increased to 42. In other words, the increase in inbreeding per generation reached its maximum in the recent generation (ΔFSNP = 0.014) (Figure 8).
The NetView analysis displayed the PONs as one community (Figure S4). There were no clear clusters within the community. The connections were independent of the inbreeding coefficient.

3.3. PANTHER Statistical Overrepresentation Analysis and Functional Classification Test

PANTHER analysis was started with 812 (≥50%), 53 (≥75%), and 12 (≥90%) genes, respectively. Out of these genes, PANTHER could map 743, 48, and 10 genes, respectively. The genes located in 5-SNP ROHs shared by at least 75% of the genotyped PONs (Table S3) were mostly assigned to the biological classes “cellular” and “metabolic processes” (Figure S5). Those shared by at least 50% of the dogs additionally often were assigned to “biological regulation”. The genes in the ROHs shared by at least 90% were assigned to “cellular” and “metabolic processes” and “cellular component organization or biogenesis”. Genes were involved in pathways of angiogenesis, among others (Figure S6). There were no significantly overrepresented or underrepresented gene classes in ROH5 shared by at least 75%. In those ROH5 shared by at least 50% of the dogs, genes involved in forelimb morphogenesis were overrepresented among other biological processes (Table S4).

4. Discussion

PONs in Germany represent a dog population with a small population size. Anecdotally, all PONs can be traced back to one founder animal in the beginning of pure breeding [7]. Therefore, a low genetic variability and high inbreeding coefficients were expected in this population. Thus, we estimated demographic measures and genetic variability using pedigree and genomic data.
Dogs genotyped were representative for the reference population and thus for the studied PON pedigree population. The mean inbreeding coefficient was slightly lower and Ne slightly higher in the genotyped dogs. This was plausible because we avoided sampling of closely related PONs for genotyping. In addition, the average coancestries of both populations were very similar. Netview analysis showed that the genotyped dogs did not belong to just one line, but reflected many lines.
The FPed in the PON population was higher than in other dog breeds [1,2,54]. It was far higher than in the earlier studied Australian PON population [9]. However, that study only contained 12 animals with a mean of 1.7 EqG. Thus, FPed was most likely underestimated in that previous study. Important for the high inbreeding coefficient in the PONs studied herein may be the small number of effective founders (Table 2). Especially the proportion of 0.1 effective founders to observed founders was small compared to other breeds. Of 36 Australian populations only the smooth coated Fox terrier and the German shepherd dog had a proportion below the one of the PONs [9].
There were no big differences between the different FROH capturing more or less recent inbreeding except for FROH358. Thus, α = 0.05 did not seem to be a too strict threshold for the definition of ROHs in the PON population. Despite the high inbreeding coefficient, we did not identify any consensus ROH. It is conceivable that the few effective founders were very distantly related and not inbred that much yet. In addition, breeders avoided line breeding and thus, highly inbred dogs were not mated with closely related dogs. This presumption was supported by NetView analysis that did not display any subpopulations differing in the value of inbreeding coefficients but the amount of inbreeding was equally distributed over the genotyped PONs. The FIS-values around zero indicated that matings among closely related animals leading to a loss of genetic diversity were not that often in the PON population. The ratios fa/fe and Ng/fe underlined that there was not that much loss of genetic diversity over the time as in other breeds. Of those studied dog breeds with high known pedigree depth (EqG ≥ 7), only few breeds like the Australian cattle dog or the Lancashire Heeler dog had a higher fa/fe ratio than the PONs [9,54,55]. This indicated there were less bottlenecks than in various other breeds. The proportion Ng/fe was higher than in PONs only in few breeds (EqG ≥ 7), too [9,55]. Thus, the genetic drift was comparatively small.
Even if an unbalanced use of sires and dams was proved, it did not seem to have much impact on the amount of inbreeding coefficients. Leroy and Baumung [20] reported a proportion of 0.77 between the effective number of reproducers and the total number of reproducers at a random use of reproducers. For the PONs this proportion was 0.35 (sires) and 0.48 (dams), respectively. In the males, those with more than 48 descendants, which corresponded to 13 sires, caused this deviation of 0.77. However, those sires may have been mated to distantly related dams and thus did not influence the inbreeding coefficient significantly.
We expected FROH to be higher than FPed because of the more precise prediction of the actual proportion of identical by descent (IBD) genome, which related individuals share, even at a large pedigree depth [4,56]. Additionally, the pedigree depth limits the results of pedigree analyses [57]. ROHs cover more generations than those recorded in pedigrees. Because short ROHs capture more ancient inbreeding [57,58], especially the FROH based on short ROHs were expected to be higher than FPed. However, FROH and FPed were more similar in the PONs than in various other breeds. The ratio of FPed to FROH was about 60% in the PONs. In UK Labrador Retrievers it was one-third (100-SNP-FROH), in the Lundehund and German Dalmatians it was one-ninth and one-sixth of the 65-SNP-FROH [3,28,33]. The reason for these similar values in the PONs was a quite high pedigree completeness. In the Beadchip sample, a pedigree completeness above 90% was found up to the ninth parental generation and mean EqG was 11.2. In the Labrador Retrievers and Dalmatians the mean EqG were 5.6 and 7.5 [3,33]. The completeness and quality of pedigree data is of major importance for the estimation of inbreeding measures using pedigree analysis. An incomplete pedigree can result in underestimated inbreeding measures [17,59,60].
The correlation between FROH and FPed was in the lower range compared to other studies [3,33,56]. The reason may be the variable depth of branches in the pedigrees. Most PONs had very deep pedigrees. However, several PONs had imbalanced pedigrees. Although the mean completeness of their pedigrees was high with maximal generations of 23, there often were branches in their pedigrees with only very few individuals. Thus, some individuals included in the analysis showed only a few complete generations despite the EqG was in the mean of the PONs. The FPed of these individuals was likely to be underestimated. Consequently, for most individuals FPed was predicted very well, but there also were several PONs for which FPed was underestimated. This may had led to the lower correlation. To prove this hypothesis, the calculation of the correlation between FPed and FROH was done again without the individuals with less than four, five, six, or seven complete generations. The Pearson correlation coefficient increased to 0.52 and thus was in a similar range compared to previous studies [3,33,56].
As homozygosity increases the incidence of inherited disorders [5], the genes located in most common ROHs were proved to have relevance for health problems in PONs. With ANGPT1 on CFA 13, a gene located in the ROHs shared by above 90% of the PONs was found, which possibly may be involved in the pathogenesis of PDA, because it is known to be involved in angiogenesis [61]. Furthermore, some previously reported loci for CHD were located in homozygous regions in the PONs [46,47,48,49,50,51,52]. It has to be investigated in further studies, if these loci are also relevant in the progression of CHD in PONs. Genes involved in the forelimb morphogenesis were overrepresented in the ROHs. Thus, those genes were more often observed in homozygous regions in PONs compared to the canine reference genome. This may indicate a predisposition for forelimb diseases like CED. However, there were only reported a few cases of CED as it is like other forelimb diseases are not a very common in the PONs [11]. The gene responsible for furnishing was located in a ROH that was shared by more than 90% of the PONs. This was plausible because all PONs have furnishings by breeding standard.
One factor for high inbreeding coefficients is a small population size. Because of the small choice of possible breeding mates, the probability of matings with related animals is increasing. The N e ¯ of the PONs ( N e ¯ = 22) was lower than in many other breeds. In two French studies with 50 estimated breeds only five breeds had a N e ¯ below 50 and of these only the Barbets had a lower N e ¯ (20) than the PONs. Most breeds had N e ¯ above 100 with a maximum of 1216 and 2136 in Bulldogs and West Highland White Terriers [1,54]. The Ne_SNP (36) also was low in comparison to various other dog breeds [3,33,62]. Nevertheless the Ne_SNP of the Lundehund (Ne_SNP = 19) was even lower [28]. In various studies critical values of 50 to 100 for the Ne based on pedigree data were given. The threshold value to minimize inbreeding and random genetic impoverishment of Ne = 50 [63] was not exceeded by the PONs. Bijma [64] suggested a ΔF of 0.5 to 1.0% per generation. This corresponds to an Ne of 50 to 100. With an expected increase of inbreeding of 22.04% within 50 years the PON population was minimally endangered according to the European Association for Animal Production (EAAP) [65]. Recently the N e ¯ increased slightly to 29 in 2018. The threshold of 5% increase in inbreeding in 50 years for a not endangered population according to the EAAP would correspond to an Ne of 104 in PONs.

5. Conclusions

In conclusion, both the pedigree-based as well as the SNP-based measures of genetic variability showed the same trend. The correlation between FPed and FROH was in similar range compared to previous studies. The genetic variability in the PON population was small and may be caused by the small number of founder animals at the beginning of pure breeding. Mean inbreeding coefficients of birth year cohorts were on the same level since 1990. However, according to the recommendation of the EAAP the breed had to be classified as minimally endangered. In order to maintain genetic diversity, number of matings for sires should be limited. In case, sires have not more than 48 progeny a popular sire effect can be avoided.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-2615/10/9/1520/s1: Figure S1: Development of the population size of the Polish Lowland Sheepdogs from 1990 to 2018. Given are the number of offsprings and breeding sires and dams per year. Figure S2: Amount of runs of homozygosity (ROH) per chromosome (CFA). The chromosomes are listed from left to right. Figure S3: Incidence-plot of the single nucleotide polymorphisms (SNP) per chromosome. Given is the incidence based on the position of each SNP. Figure S4: Visualization of fine-scale population network for Polish Lowland Sheepdogs. K-NN = 7 (a) and 10 (b). Grey: FROH5 < 0.16 (3.3%), purple: FROH5 = 0.16–0.19 (1.3%), turquoise: FROH5 = 0.19–0.22 (2.0%), dark blue: FROH5 = 0.22–0.25 (5.3%), dark green: FROH5 = 0.25–0.28 (17.1%), light green: FROH5 = 0.28–0.31 (25.0%), orange: FROH5 = 0.31–0.34 (25.7%), yellow: FROH5 = 0.34–0.37 (9.9%), brown: FROH5 = 0.37–0.40 (5.3%), light blue: FROH5 = 0.40–0.43 (3.9%), pink: FROH5 ≥ 0.43 (1.3%). Figure S5: PANTHER functional classification analysis. Gene lists of the genes located in ROH5 shared by at least 75% or 90% of the genotyped Polish Lowland Sheepdogs were analyzed with the functional classification analysis tool of PANTHER. Listed are the biological processes. Figure S6: Results of the functional classification analysis with PANTHER 15.0 for ROH5 shared by at least 75% or 90% of the genotyped Polish Lowland Sheepdogs. Listed are the pathways. Table S1: Overview of individual results of the analysis of genetic variability in Polish Lowland Sheepdogs. Given are the sex, the year of birth, the number of complete and equivalent (EqG) generations, the inbreeding coefficient (FPed) and the increase in inbreeding (ΔFPed) based on pedigree data, the total length (in bp) of the runs of homozygosity (ROH) found in the individual and the inbreeding coefficient (FROH20) and fixation index (FIS_SNP) based on genomic data. Table S2: Significances of the differences of the amount of runs of homozygosity per chromosome (CFA). Listed are the p-values corrected after Bonferroni and calculated by paired samples Wilcoxon test. Significant (p < 0.05) differences are marked in yellow. Table S3: Genes located in ROH5 in Polish Lowland Sheepdogs (PON). Given are the considered dog chromosome (CFA), the start and the end of the region with runs of homozygosity (ROH) on that CFA, the length of the ROH-region in SNPs and all genes located in the ROH-regions. Genes having just an ENSCAFG-number are not listed. For ROH5 shared by at least 50, 75, or 90% of the PONs quantitative trait loci (QTL), detected linkage analyses (LA) or association studies (AS), SNPs and genes already known to be causal or associated with the most breed-relevant diseases canine hip dysplasia (CHD) and progressive retinal atrophy (PRA) or the coat structure of the breed and located in the ROH5 are listed. (a) Includes all ROH5. (b) Includes ROH5 shared by at least 50% of the genotyped PONs. (c) Includes ROH5 shared by at least 75% of the genotyped PONs. (d) Includes ROH5 shared by at least 90% of the genotyped PONs. Table S4: PANTHER statistical overrepresentation test. Gene lists of the genes located in ROH5 shared by at least 50% or 90% of the genotyped Polish Lowland Sheepdogs were analyzed with the overrepresentation analysis tool of PANTHER. Given are the biological processes the enriched genes are involved in, their fold enrichment and the false discovery rate (p-value).

Author Contributions

Conceptualization, O.D. and P.W.M.; methodology, O.D. and P.W.M.; formal analysis, O.D. and P.W.M.; investigation, P.W.M. and O.D.; data curation: O.D. and P.W.M.; writing—original draft preparation, P.W.M.; writing—review and editing, O.D.; visualization, P.W.M.; supervision, O.D.; project administration, O.D.; funding acquisition, O.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Cosmo Art & Science Foundation, Winsen/Luhe, Germany.

Acknowledgments

We are grateful to the Allgemeiner Klub für Polnische Hunderassen e.V. (APH), Germany for providing pedigree data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Leroy, G.; Rognon, X.; Varlet, A.; Joffrin, C.; Verrier, E. Genetic variability in French dog breeds assessed by pedigree data. J. Anim. Breed. Genet. 2006, 123, 1–9. [Google Scholar] [CrossRef]
  2. Calboli, F.C.; Sampson, J.; Fretwell, N.; Balding, D.J. Population structure and inbreeding from pedigree analysis of purebred dogs. Genet. Soc. Am. 2008, 179, 593–601. [Google Scholar] [CrossRef] [Green Version]
  3. Wiener, P.; Sánchez-Molano, E.; Clements, D.N.; Woolliams, J.A.; Haskell, M.J.; Blott, S.C. Genomic data illuminates demography, genetic structure and selection of a popular dog breed. BMC Genom. 2017, 18, 609. [Google Scholar] [CrossRef]
  4. Kardos, M.; Luikart, G.; Allendorf, F.W. Measuring individual inbreeding in the age of genomics: Marker-based measures are better than pedigrees. Heredity 2015, 115, 63. [Google Scholar] [CrossRef]
  5. Ubbink, G.; Knol, B.; Bouw, J. The relationship between homozygosity and the occurrence of specific diseases in Bouvier Belge des Flandres dogs in the Netherlands: Inbreeding and disease in the bouvier dog. Vet. Q. 1992, 14, 137–140. [Google Scholar] [CrossRef]
  6. Statistic of Born Dogs in VDH. Available online: https://www.vdh.de/ueber-den-vdh/welpenstatistik/ (accessed on 23 April 2020).
  7. Räber, H. Enzyklopädie der Rassehunde; Franckh-Kosmos Verlag: Stuttgart, Germany, 1993. [Google Scholar]
  8. FCI Breeds Nomenclature. Available online: http://www.fci.be/en/nomenclature/POLISH-LOWLAND-SHEEPDOG-251.html (accessed on 23 April 2020).
  9. Shariflou, M.R.; James, J.W.; Nicholas, F.W.; Wade, C.M. A genealogical survey of Australian registered dog breeds. Vet. J. 2011, 189, 203–210. [Google Scholar] [CrossRef]
  10. Bell, J.; Cavanagh, K.; Tilley, L.; Smith, F.W. Veterinary Medical Guide to Dog and Cat Breeds; Teton NewMedia: Jackson, WY, USA, 2012. [Google Scholar]
  11. Polish Lowland Sheepdog Statistics. Available online: https://www.ofa.org/diseases/breed-statistics#detail (accessed on 24 July 2020).
  12. Svensson, M.; Olsén, L.; Winkler, P.A.; Petersen--Jones, S.M.; Bergström, T.; Garncarz, Y.; Narfström, K. Progressive retinal atrophy in the Polski Owczarek Nizinny dog: A clinical and genetic study. Vet. Ophthalmol. 2016, 19, 195–205. [Google Scholar] [CrossRef]
  13. Gutiérrez, J.; Goyache, F. A note on ENDOG: A computer program for analysing pedigree information. J. Anim. Breed. Genet. 2005, 122, 172–176. [Google Scholar] [CrossRef] [Green Version]
  14. Maignel, L.; Boichard, D.; Verrier, E. Genetic variability of French dairy breeds estimated from pedigree information. Interbull Bull. 1996, 14, 49. [Google Scholar]
  15. MacCluer, J.W.; Boyce, A.J.; Dyke, B.; Weitkamp, L.R.; Pfenning, D.W.; Parsons, C.J. Inbreeding and pedigree structure in Standardbred horses. J. Hered. 1983, 74, 394–399. [Google Scholar] [CrossRef]
  16. James, J. A note on selection differential and generation length when generations overlap. Anim. Sci. 1977, 24, 109–112. [Google Scholar] [CrossRef]
  17. Boichard, D.; Maignel, L.; Verrier, E. The value of using probabilities of gene origin to measure genetic variability in a population. Genet. Sel. Evol. 1997, 29, 5. [Google Scholar] [CrossRef]
  18. Lacy, R.C. Analysis of founder representation in pedigrees: Founder equivalents and founder genome equivalents. Zoo Biol. 1989, 8, 111–123. [Google Scholar] [CrossRef]
  19. Boichard, D. PEDIG: A fortran package for pedigree analysis suited for large populations. In Proceedings of the 7th World Congress on Genetics Applied to Livestock Production, Montpellier, France, 19–23 August 2020; pp. 525–528. [Google Scholar]
  20. Leroy, G.; Baumung, R. Mating practices and the dissemination of genetic disorders in domestic animals, based on the example of dog breeding. Anim. Genet. 2011, 42, 66–74. [Google Scholar] [CrossRef]
  21. Meuwissen, T.; Luo, Z. Computing inbreeding coefficients in large populations. Genet. Sel. Evol. 1992, 24, 305–313. [Google Scholar] [CrossRef]
  22. Wright, S. The relation of livestock breeding to theories of evolution. J. Anim. Sci. 1978, 46, 1192–1200. [Google Scholar] [CrossRef]
  23. Caballero, A.; Toro, M.A. Interrelations between effective population size and other pedigree tools for the management of conserved populations. Genet. Res. 2000, 75, 331–343. [Google Scholar] [CrossRef]
  24. Caballero, A.; Toro, M.A. Analysis of genetic diversity for the management of conserved subdivided populations. Conserv. Genet. 2002, 3, 289–299. [Google Scholar] [CrossRef]
  25. Gutiérrez, J.; Cervantes, I.; Goyache, F. Improving the estimation of realized effective population sizes in farm animals. J. Anim. Breed. Genet. 2009, 126, 327–332. [Google Scholar] [CrossRef]
  26. Cervantes, I.; Goyache, F.; Molina, A.; Valera, M.; Gutiérrez, J. Estimation of effective population size from the rate of coancestry in pedigreed populations. J. Anim. Breed. Genet. 2011, 128, 56–63. [Google Scholar] [CrossRef]
  27. Gutiérrez, J.P.; Cervantes, I.; Molina, A.; Valera, M.; Goyache, F. Individual increase in inbreeding allows estimating effective sizes from pedigrees. Genet. Sel. Evol. 2008, 40, 359. [Google Scholar] [CrossRef]
  28. Pfahler, S.; Distl, O. Effective Population Size, Extended Linkage Disequilibrium and Signatures of Selection in the Rare Dog Breed Lundehund. PLoS ONE 2015, 10, e0122680. [Google Scholar] [CrossRef]
  29. Chang, C.C.; Chow, C.C.; Tellier, L.C.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-generation PLINK: Rising to the challenge of larger and richer datasets. GigaScience 2015, 4, s13742-015. [Google Scholar] [CrossRef]
  30. Lencz, T.; Lambert, C.; DeRosse, P.; Burdick, K.E.; Morgan, T.V.; Kane, J.M.; Kucherlapati, R.; Malhotra, A.K. Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia. Proc. Natl. Acad. Sci. USA 2007, 104, 19942–19947. [Google Scholar] [CrossRef] [Green Version]
  31. Purfield, D.C.; Berry, D.P.; McParland, S.; Bradley, D.G. Runs of homozygosity and population history in cattle. BMC Genet. 2012, 13, 70. [Google Scholar] [CrossRef] [Green Version]
  32. Howrigan, D.P.; Simonson, M.A.; Keller, M.C. Detecting autozygosity through runs of homozygosity: A comparison of three autozygosity detection algorithms. BMC Genom. 2011, 12, 460. [Google Scholar] [CrossRef] [Green Version]
  33. Vasiliadis, D.; Metzger, J.; Distl, O. Demographic assessment of the Dalmatian dog–effective population size, linkage disequilibrium and inbreeding coefficients. Canine Med. Genet. 2020, 7, 1–11. [Google Scholar] [CrossRef]
  34. Sams, A.J.; Boyko, A.R. Fine-scale resolution of runs of homozygosity reveal patterns of inbreeding and substantial overlap with recessive disease genotypes in domestic dogs. G3 Genes Genomes Genet. 2019, 9, 117–123. [Google Scholar] [CrossRef] [Green Version]
  35. FCI-Rassestandard Polski Owczarek Nizinny (Polski Owczarek Nizinny (PON)). Available online: https://www.vdh.de/welpen/mein-welpe/polski-owczarek-nizinny-pon (accessed on 15 May 2019).
  36. R Core Team. R: A Language and Environment for Statistical Computing; Version 3.6.3; R Foundation for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
  37. McQuillan, R.; Leutenegger, A.-L.; Abdel-Rahman, R.; Franklin, C.S.; Pericic, M.; Barac-Lauc, L.; Smolej-Narancic, N.; Janicijevic, B.; Polasek, O.; Tenesa, A. Runs of homozygosity in European populations. Am. J. Hum. Genet. 2008, 83, 359–372. [Google Scholar] [CrossRef] [Green Version]
  38. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.; Bender, D.; Maller, J.; Sklar, P.; De Bakker, P.I.; Daly, M.J. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [Green Version]
  39. Sved, J. Linkage disequilibrium and homozygosity of chromosome segments in finite populations. Theor. Popul. Biol. 1971, 2, 125–141. [Google Scholar] [CrossRef]
  40. Steinig, E.J.; Neuditschko, M.; Khatkar, M.S.; Raadsma, H.W.; Zenger, K.R. NetView P: A network visualization tool to unravel complex population structure using genome--wide SNPs. Mol. Ecol. Resour. 2016, 16, 216–227. [Google Scholar] [CrossRef] [PubMed]
  41. Neuditschko, M.; Khatkar, M.S.; Raadsma, H.W. NetView: A high-definition network-visualization approach to detect fine-scale population structures from genome-wide patterns of variation. PLoS ONE 2012, 7, e48375. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef]
  43. Mi, H.; Muruganujan, A.; Ebert, D.; Huang, X.; Thomas, P.D. PANTHER version 14: More genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Res. 2018, 47, D419–D426. [Google Scholar] [CrossRef]
  44. McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University (Baltimore, MD). Online Mendelian Inheritance in Man, OMIM®. Available online: https://omim.org/ (accessed on 12 August 2020).
  45. Sydney School of Veterinary Science. Online Mendelian Inheritance in Animals, OMIA. Available online: https://omia.org/browse/ (accessed on 12 August 2020).
  46. Todhunter, R.J.; Mateescu, R.; Lust, G.; Burton-Wurster, N.I.; Dykes, N.L.; Bliss, S.P.; Williams, A.J.; Vernier-Singer, M.; Corey, E.; Harjes, C.; et al. Quantitative trait loci for hip dysplasia in a crossbreed canine pedigree. Mamm. Genome 2005, 16, 720–730. [Google Scholar] [CrossRef]
  47. Todhunter, R.J.; Garrison, S.J.; Jordan, J.; Hunter, L.; Castelhano, M.G.; Ash, K.; Meyers-Wallen, V.; Krotscheck, U.; Hayward, J.J.; Grenier, J. Gene expression in hip soft tissues in incipient canine hip dysplasia and osteoarthritis. J. Orthop. Res. 2019, 37, 313–324. [Google Scholar] [CrossRef] [Green Version]
  48. Liu, T.; Todhunter, R.J.; Wu, S.; Hou, W.; Mateescu, R.; Zhang, Z.; Burton-Wurster, N.I.; Acland, G.M.; Lust, G.; Wu, R. A random model for mapping imprinted quantitative trait loci in a structured pedigree: An implication for mapping canine hip dysplasia. Genomics 2007, 90, 276–284. [Google Scholar] [CrossRef] [Green Version]
  49. Marschall, Y.; Distl, O. Mapping quantitative trait loci for canine hip dysplasia in German Shepherd dogs. Mamm. Genome 2007, 18, 861–870. [Google Scholar] [CrossRef]
  50. Phavaphutanon, J.; Mateescu, R.G.; Tsai, K.L.; Schweitzer, P.A.; Corey, E.E.; Vernier-Singer, M.A.; Williams, A.J.; Dykes, N.L.; Murphy, K.E.; Lust, G.; et al. Evaluation of quantitative trait loci for hip dysplasia in Labrador Retrievers. Am. J. Vet. Res. 2009, 70, 1094–1101. [Google Scholar] [CrossRef]
  51. Fels, L.; Marschall, Y.; Philipp, U.; Distl, O. Multiple loci associated with canine hip dysplasia (CHD) in German shepherd dogs. Mamm. Genome 2014, 25, 262–269. [Google Scholar] [CrossRef] [PubMed]
  52. Fels, L.; Distl, O. Identification and Validation of Quantitative Trait Loci (QTL) for Canine Hip Dysplasia (CHD) in German Shepherd Dogs. PLoS ONE 2014, 9, e96618. [Google Scholar] [CrossRef] [PubMed]
  53. Johns Hopkins University, Baltimore, MD. Online Mendelian Inheritance in Man, OMIM®. MIM Number: 601667. Available online: https://omim.org/entry/601667?search=601667&highlight=601667 (accessed on 12 August 2020).
  54. Leroy, G.; Verrier, E.; Meriaux, J.-C.; Rognon, X. Genetic diversity of dog breeds: Within--breed diversity comparing genealogical and molecular data. Anim. Genet. 2009, 40, 323–332. [Google Scholar] [CrossRef]
  55. Mäki, K. Population structure and genetic diversity of worldwide Nova Scotia Duck Tolling Retriever and Lancashire Heeler dog populations. J. Anim. Breed. Genet. 2010, 127, 318–326. [Google Scholar] [CrossRef] [PubMed]
  56. Silió, L.; Rodríguez, M.; Fernández, A.; Barragán, C.; Benítez, R.; Óvilo, C.; Fernández, A. Measuring inbreeding and inbreeding depression on pig growth from pedigree or SNP-Derived metrics. J. Anim. Breed. Genet. 2013, 130, 349–360. [Google Scholar] [CrossRef]
  57. Zhang, Q.; Calus, M.P.; Guldbrandtsen, B.; Lund, M.S.; Sahana, G. Estimation of inbreeding using pedigree, 50k SNP chip genotypes and full sequence data in three cattle breeds. BMC Genet. 2015, 16, 88. [Google Scholar] [CrossRef] [Green Version]
  58. Bjelland, D.; Weigel, K.; Vukasinovic, N.; Nkrumah, J. Evaluation of inbreeding depression in Holstein cattle using whole-genome SNP markers and alternative measures of genomic inbreeding. J. Dairy Sci. 2013, 96, 4697–4706. [Google Scholar] [CrossRef]
  59. Lutaaya, B.E.; Misztal, I.; Bertrand, J.; Mabry, J. Inbreeding in populations with incomplete pedigrees. J. Anim. Breed. Genet. 1999, 116, 475–480. [Google Scholar] [CrossRef]
  60. Cassell, B.; Adamec, V.; Pearson, R. Effect of incomplete pedigrees on estimates of inbreeding and inbreeding depression for days to first service and summit milk yield in Holsteins and Jerseys. J. Dairy Sci. 2003, 86, 2967–2976. [Google Scholar] [CrossRef]
  61. Nourhaghighi, N.; Teichert-Kuliszewska, K.; Davis, J.; Stewart, D.J.; Nag, S. Altered expression of angiopoietins during blood-brain barrier breakdown and angiogenesis. Lab. Investig. 2003, 83, 1211–1222. [Google Scholar] [CrossRef] [Green Version]
  62. Alam, M.; Han, K.; Lee, D.; Ha, J.; Kim, J. Estimation of effective population size in the Sapsaree: A Korean native dog (Canis familiaris). Asian Australas. J. Anim. Sci. 2012, 25, 1063. [Google Scholar] [CrossRef] [Green Version]
  63. DGfZ. Empfehlungen zur Erhaltung genetischer Vielfalt bei einheimischen Nutztieren. Züchtungskunde 1991, 63, 426–430. [Google Scholar]
  64. Bijma, P. Long-Term Genetic Contributions: Prediction of Rates of Inbreeding and Genetic Gain in Selected Populations; Department of Animal Sciences, Wageningen University: Wageningen, The Netherlands, 2000. [Google Scholar]
  65. Simon, D.L.; Buchenauer, D. Genetic Diversity of European Livestock Breeds; Wageningen Pers: Wageningen, The Netherlands, 1993. [Google Scholar]
Figure 1. Mean equivalent complete generations by birth years of Polish Lowland Sheepdogs (PONs). The mean equivalent complete generations across all 8628 PONs was 8.73 generations.
Figure 1. Mean equivalent complete generations by birth years of Polish Lowland Sheepdogs (PONs). The mean equivalent complete generations across all 8628 PONs was 8.73 generations.
Animals 10 01520 g001
Figure 2. Mean complete generations by the year of birth of Polish Lowland Sheepdogs (PONs).
Figure 2. Mean complete generations by the year of birth of Polish Lowland Sheepdogs (PONs).
Animals 10 01520 g002
Figure 3. Inbreeding coefficient (FPed) and fixation index FIS_Ped by the year of birth of the Polish Lowland Sheepdog reference population.
Figure 3. Inbreeding coefficient (FPed) and fixation index FIS_Ped by the year of birth of the Polish Lowland Sheepdog reference population.
Animals 10 01520 g003
Figure 4. Realized effective population size ( N e ¯ ) by the year of birth of the Polish Lowland Sheepdog reference population.
Figure 4. Realized effective population size ( N e ¯ ) by the year of birth of the Polish Lowland Sheepdog reference population.
Animals 10 01520 g004
Figure 5. Number of detected runs of homozygosity (ROHs) per length category for ROH20 and ROH50.
Figure 5. Number of detected runs of homozygosity (ROHs) per length category for ROH20 and ROH50.
Animals 10 01520 g005
Figure 6. Distribution of the genomic inbreeding coefficients FROH20 and FROH50 and the fixation index FIS_SNP in the genotyped Polish Lowland Sheepdogs. Outliers are given with dots.
Figure 6. Distribution of the genomic inbreeding coefficients FROH20 and FROH50 and the fixation index FIS_SNP in the genotyped Polish Lowland Sheepdogs. Outliers are given with dots.
Animals 10 01520 g006
Figure 7. Effective population size (Ne_SNP) of the Polish Lowland Sheepdogs in the last 20 generations based on the linkage disequilibrium among single nucleotide polymorphism (SNP) alleles per chromosome.
Figure 7. Effective population size (Ne_SNP) of the Polish Lowland Sheepdogs in the last 20 generations based on the linkage disequilibrium among single nucleotide polymorphism (SNP) alleles per chromosome.
Animals 10 01520 g007
Figure 8. Increase in inbreeding (ΔFSNP) in the Polish Lowland Sheepdogs in the last 20 generations based on the linkage disequilibrium among single nucleotide polymorphism (SNP) alleles per chromosome.
Figure 8. Increase in inbreeding (ΔFSNP) in the Polish Lowland Sheepdogs in the last 20 generations based on the linkage disequilibrium among single nucleotide polymorphism (SNP) alleles per chromosome.
Animals 10 01520 g008
Table 1. Overview of the different definitions of runs of homozygosity (ROHs) in this study.
Table 1. Overview of the different definitions of runs of homozygosity (ROHs) in this study.
Term for DefinitionThresholds
Length (kb)Minimum SNPsMaximum Missing SNPs
ROH56052
ROH78472
ROH10120102
ROH20240202
ROH30360303
ROH40480404
ROH50600504
ROH65780655
ROH35843003585
Given are the minimum length, the numbers of minimum single nucleotide polymorphisms (SNPs) and the maximum number of missing SNPs in the region for each ROH-definition. Heterozygous SNPs were not allowed in any definition.
Table 2. Results of the pedigree-based inbreeding measures calculation for the reference population born from 1990 to 2018 and the genotyped Polish Lowland Sheepdogs. All available pedigree data included 8628 animals.
Table 2. Results of the pedigree-based inbreeding measures calculation for the reference population born from 1990 to 2018 and the genotyped Polish Lowland Sheepdogs. All available pedigree data included 8628 animals.
Inbreeding Measures Population
1990–2018
Genotyped PONs
Number of animals4920153
Mean FPed0.180.18
Average coancestry0.170.19
Mean FIS_Ped 0.011−0.014
Mean equivalent generations (EqG)10.0911.23
Completeness of ancestors in the 5th parental generation0.93–0.990.86–1.00
Mean individual increase in inbreeding (ΔFPed_i)0.0230.019
Realized effective population size ( N e ¯ ) (±SE)22.16 ± 1.4626.17 ± 0.00
Effective population size by paired increase in coancestry ( Δ c j k ) (±SE)24.40 ± 0.44
Number of founders (f)10348
Number of effective founders (fe)1010
Number of ancestors (a)10229
Number of effective ancestors (fa)66
Number of ancestors explaining 55% of genetic diversity22
Number of ancestors explaining 75% of genetic diversity55
Founder genome equivalents (Ng)3.01 ± 0.652.81 ± 0.67
Mean generation interval in years (±SE)4.67 ± 2.054.95 ± 1.82
Table 3. Overview of the amount and length of runs of homozygosity (ROHs). A ROH was defined in nine different ways. For all definitions the number and length of detected ROHs per individual and detected consensus ROHs are given, respectively. The inbreeding coefficient FROH was estimated for all definitions. In total, the genotyped Single Nucleotide Polymorphisms covered 2,201,666,442 bp of the autosomal dog genome.
Table 3. Overview of the amount and length of runs of homozygosity (ROHs). A ROH was defined in nine different ways. For all definitions the number and length of detected ROHs per individual and detected consensus ROHs are given, respectively. The inbreeding coefficient FROH was estimated for all definitions. In total, the genotyped Single Nucleotide Polymorphisms covered 2,201,666,442 bp of the autosomal dog genome.
Definition ROHOverall Number of ROHsLength of Shortest ROH (Kb)Mean Length of ROH (Mb)Length of Longest ROH (Mb)Average Cumulative Length of All ROHs (Mb)Mean FROH (and Its Range)
ROH527,27661.443.7682.92669.680.30 (0.12–0.46)
ROH727,27494.393.7682.92669.670.30 (0.12–0.46)
ROH1027,267126.143.7682.92669.670.30 (0.12–0.46)
ROH2027,249241.703.7682.92669.650.30 (0.12–0.46)
ROH3027,042375.583.8582.92680.150.31 (0.16–0.47)
ROH4027,114480.663.8682.92684.870.31 (0.17–0.47)
ROH5025,888600.014.0282.92679.940.31 (0.16–0.47)
ROH6521,391780.294.7282.92659.320.30 (0.15–0.46)
ROH35867644316.6410.4482.92461.340.21 (0.00–0.41)
Table 4. Overview of the amount and length of runs of homozygosity (ROHs) shared by at least 75% of the genotyped dogs. A ROH was defined in nine different ways. For all definitions the number (n) of ROHs, the mean length of detected ROHs and the cumulative length of all ROHs are given.
Table 4. Overview of the amount and length of runs of homozygosity (ROHs) shared by at least 75% of the genotyped dogs. A ROH was defined in nine different ways. For all definitions the number (n) of ROHs, the mean length of detected ROHs and the cumulative length of all ROHs are given.
Definition ROHnMean Length (kb)Cumulative Length of All ROHs (Mb)
ROH58731.685.85
ROH78731.685.85
ROH108731.685.85
ROH208731.685.85
ROH308778.556.23
ROH408778.506.23
ROH507813.265.69
ROH654881.863.53
ROH3580--
Table 5. Pearson correlation coefficients among inbreeding coefficients. The correlation coefficients with a p-value < 0.0001 between the pedigree inbreeding coefficient FPED, the genomic inbreeding coefficients FROH for the different lengths of runs of homozygosity (ROHs), and the fixation index FIS_SNP are given.
Table 5. Pearson correlation coefficients among inbreeding coefficients. The correlation coefficients with a p-value < 0.0001 between the pedigree inbreeding coefficient FPED, the genomic inbreeding coefficients FROH for the different lengths of runs of homozygosity (ROHs), and the fixation index FIS_SNP are given.
Inbreeding MeasuresFPEDFIS_SNPFROH5/7/10/20FROH30FROH40FROH50FROH65FROH358
FPED10.500.350.390.420.410.390.25
FIS_SNP0.5010.820.890.920.910.890.57
FROH5/7/10/200.350.8210.990.970.980.990.88
FROH300.390.890.9911110.82
FROH400.420.920.9711110.79
FROH500.410.910.9811110.80
FROH650.390.890.9911110.83
FROH3580.250.570.880.820.790.800.831
Table 6. Pearson correlation coefficients between the genomic inbreeding coefficient FROH20 or FROH50 and the pedigree inbreeding coefficient FPed by number of complete generations. The correlation coefficient with a p-value < 0.0001 between FROH20 or FROH50 and FPed is given. Either all dogs or those with more than three, four, five, or six complete generations in the dataset were regarded in the calculations.
Table 6. Pearson correlation coefficients between the genomic inbreeding coefficient FROH20 or FROH50 and the pedigree inbreeding coefficient FPed by number of complete generations. The correlation coefficient with a p-value < 0.0001 between FROH20 or FROH50 and FPed is given. Either all dogs or those with more than three, four, five, or six complete generations in the dataset were regarded in the calculations.
Complete GenerationsAll>3>4>5>6
Correlation FROH200.350.370.400.400.46
Correlation FROH500.410.420.460.460.52
N15315114211389

Share and Cite

MDPI and ACS Style

Michels, P.W.; Distl, O. Genetic Variability in Polish Lowland Sheepdogs Assessed by Pedigree and Genomic Data. Animals 2020, 10, 1520. https://doi.org/10.3390/ani10091520

AMA Style

Michels PW, Distl O. Genetic Variability in Polish Lowland Sheepdogs Assessed by Pedigree and Genomic Data. Animals. 2020; 10(9):1520. https://doi.org/10.3390/ani10091520

Chicago/Turabian Style

Michels, Paula Wiebke, and Ottmar Distl. 2020. "Genetic Variability in Polish Lowland Sheepdogs Assessed by Pedigree and Genomic Data" Animals 10, no. 9: 1520. https://doi.org/10.3390/ani10091520

APA Style

Michels, P. W., & Distl, O. (2020). Genetic Variability in Polish Lowland Sheepdogs Assessed by Pedigree and Genomic Data. Animals, 10(9), 1520. https://doi.org/10.3390/ani10091520

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop