Next Article in Journal
Systematic Analysis of Zinc Finger-Homeodomain Transcription Factors (ZF-HDs) in Barley (Hordeum vulgare L.)
Previous Article in Journal
Comprehensive Bioinformatic Investigation of TP53 Dysregulation in Diverse Cancer Landscapes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Defining a Haplotype Encompassing the LCORL-NCAPG Locus Associated with Increased Lean Growth in Beef Cattle

1
UTIA Genomics Center for the Advancement of Agriculture, Institute of Agriculture, University of Tennessee, Knoxville, TN 37996, USA
2
Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
*
Author to whom correspondence should be addressed.
Genes 2024, 15(5), 576; https://doi.org/10.3390/genes15050576
Submission received: 29 March 2024 / Revised: 23 April 2024 / Accepted: 28 April 2024 / Published: 30 April 2024
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
Numerous studies have shown genetic variation at the LCORL-NCAPG locus is strongly associated with growth traits in beef cattle. However, a causative molecular variant has yet to be identified. To define all possible candidate variants, 34 Charolais-sired calves were whole-genome sequenced, including 17 homozygous for a long-range haplotype associated with increased growth (QQ) and 17 homozygous for potential ancestral haplotypes for this region (qq). The Q haplotype was refined to an 814 kb region between chr6:37,199,897–38,014,080 and contained 218 variants not found in qq individuals. These variants include an insertion in an intron of NCAPG, a previously documented mutation in NCAPG (rs109570900), two coding sequence mutations in LCORL (rs109696064 and rs384548488), and 15 variants located within ATAC peaks that were predicted to affect transcription factor binding. Notably, rs384548488 is a frameshift variant likely resulting in loss of function for long isoforms of LCORL. To test the association of the coding sequence variants of LCORL with phenotype, 405 cattle from five populations were genotyped. The two variants were in complete linkage disequilibrium. Statistical analysis of the three populations that contained QQ animals revealed significant (p < 0.05) associations with genotype and birth weight, live weight, carcass weight, hip height, and average daily gain. These findings affirm the link between this locus and growth in beef cattle and describe DNA variants that define the haplotype. However, further studies will be required to define the true causative mutation.

1. Introduction

Body size is a trait frequently subject to selection in domesticated animals. This trait is of particular importance in species raised for meat as body size directly correlates to the size of the carcass and thus quantity of meat produced per animal. In beef cattle, body size can generally be described as a highly complex polygenic trait, with even the most impactful DNA variants explaining only a modest fraction of the variance observed [1]. However, there are a select few loci that can be considered highly impactful for body size, one of them being the LCORL-NCAPG locus on bovine chromosome 6. Over the years, many studies have indicated this locus as being associated with stature [1,2,3,4], birth weight [5,6,7,8], and carcass weight [9,10,11,12], as well as other carcass characteristics such as increased ribeye area and reduced adiposity [13], suggesting a potential role in increasing lean growth. These findings are supported by research in other species, including humans [14,15,16], dogs [17,18], horses [19,20,21], and sheep [22,23], which has also found variants surrounding this locus to be associated with increased body size.
However, while there is no doubt that this locus has a significant impact on body size, the biological mechanism mediating this effect remains unclear. The large effect size of this locus resulted in rapid selective sweeps in cattle [1,10,24,25] and other domesticated species [26,27,28]; the ensuing linkage disequilibrium (LD) has long confounded attempts to discern a causative mutation within the selected haplotype. Indeed, it has been shown the same haplotype at this locus is almost fixed in some breeds of cattle such as Brown Swiss and Montbeliard [1]. That same study found that in other breeds such as Charolais, this haplotype is present and very abundant, but not to the exclusion of other possible haplotypes for the region. This locus has been narrowed to a range as small as 591 kb [9], but further refinement remains complicated by the aforementioned LD.
Roughly half of this range encompasses a gene-dense region containing four genes: LCORL, NCAPG, DCAF16, and FAM184B. The other portion of this range is intergenic space upstream of LCORL. Many genome-wide association studies have found the most strongly associated markers with growth to be associated with the intergenic region upstream of LCORL [8,12,29,30], suggesting a mutation affecting the regulation of LCORL transcription may be responsible for the change in phenotype. Earlier studies pointed to a missense mutation in NCAPG [2,31], NCAPGc.1326T>G, but subsequent studies have found this mutation present in q haplotypes as well, implying this SNP is simply in high LD with the causative mutation in certain populations [29,32].
Recent evidence in other species places more weight in favor of LCORL as the gene responsible for this locus’ influence on body size and development. In dogs, a mutation that results in the loss of function of the long isoform of LCORL has been found to be exclusive to medium- and large-sized breeds of dogs [18]. A similar mutation exists in large-sized Pakistani goat breeds [26]; however, functional validation of these loss-of-function mutations has yet to be performed in either species. The long isoform of LCORL encodes for a protein dubbed PALI2 by Conway et al. [33]. This protein has not been directly characterized as of the time of writing, but it is postulated to have similar activity to PALI1, a protein encoded by LCOR, the paralog of LCORL.
PALI1 acts as an accessory protein to polycomb repressive complex 2 (PRC2), an essential protein complex that is responsible for the creation and maintenance of cellular identity by mono-, di-, and trimethylation of H3K27. This methylation represses transcription and is maintained from early development onward [34]. Disturbances of PRC2 function have been shown to cause abnormalities in body plan formation in animals, demonstrating that this complex plays a key role in this process [35]. The repressive activity of PRC2 is increased when the complex is accessorized with PALI1 [33]. It is thought that due to a shared domain between PALI1 and PALI2, PALI2 can likely interact with PRC2 in the same way; though as of the time of writing, no functional data have been published for PALI2.
It is presently unclear what molecular variation is responsible for the influence this region has on animal growth. Thus, the objective of this study was to comprehensively define the extended haplotype associated with increased growth by determining all potential variants that can be considered in-phase or not-in-phase with the haplotype using 34 Charolais-sired calves, 17 of which are QQ (homozygous for bearing the mutant haplotype associated with increased growth) and 17 which are qq (homozygous for ancestral haplotypes; or in other words not bearing the Q haplotype). These variants were then used to explore the associated effects on phenotype by genotyping a larger, multi-breed population for which several growth and carcass traits had been collected.

2. Materials and Methods

2.1. Sample Collection and Sequencing

Whole-blood samples were used from 439 cattle across six contemporary groups conducted at the University of Illinois in accordance with the animal care and use committee protocol associated with this project (IACUC Protocol #19118). DNA was extracted from these samples using a classic salting-out procedure [36] for populations 0, 1, 2, and 3. For populations 4 and 5, DNA was extracted using a Quick-DNA kit (Zymo Research, Irvine, CA, USA) following the manufacturer’s protocol.
For the initial objective, 34 Charolais-sired calves were selected for whole-genome sequencing. These animals were previously shown to be homozygous for the QQ (n = 17) and qq (n = 17) haplotypes within the region of interest [30]. DNA libraries were prepared using an Illumina DNA Prep library kit (Illumina, San Diego, CA, USA), following the manufacturer’s instructions. Libraries were paired-end sequenced using both lanes of a Novaseq 6000 S2 flowcell at 2 × 150 cycles.
Read quality was assessed using FastQC version 0.11.9 [37], and trimming was carried out using Trimmomatic version 0.39 [38] in paired-end mode using the following parameters: HEADCROP:1 ILLUMINACLIP:2:30:10 LEADING:28 SLIDINGWINDOW:15:28 MINLEN:75. Reads were aligned to the ARS-UCD 2.0 assembly of the bovine genome using BWA-MEM [39] with default settings. Alignments with a MAPQ below 30 were filtered out to remove poor-quality mappings and multi-mapped reads, before being passed to GATK 4.2.2.0 for final processing and variant calling [40]. Duplicates were flagged using MarkDuplicatesSpark and variants were called using HaplotypeCaller, both using default parameters. The variant call files were merged with GenomicsDBImport and joint genotyped with GenotypeGVCFs to create a combined VCF with all animal genotypes. Lastly, variants were hard-filtered with VariantFiltration with the following filters: “QD < 5.0”, “SOR > 2.5”, -filter “FS > 20.0”, and “MQ < 50.0”. Mapping Quality Rank Sum (MQRS) and Read Position Rank Sum (ReadPosRankSum) were not used due to their inability to be calculated when there are no samples heterozygous for a variant present. The choice of using QQ and qq animals for sequencing required that the variants of interest not be heterozygous, so filtering by those parameters would have removed them. For similar reasons, as well as for quality control, multiallelic variants were also removed.

2.2. Identification of Haplotype-Defining Mutations

To identify the variants that could be causing the QQ growth phenotype, a subtractive approach was taken. Using 17 QQ and 17 qq animals known from previous haplotype and phenotype information, the haplotype in this population was refined to a span between chr6:37,199,897–38,014,080, where almost all variants are entirely fixed in QQ individuals. To assess the validity of variants that did not present as fixed, they were visualized in Integrative Genomics Viewer (IGV) [41]. If called variants could be attributed to a single read or PCR duplicate or were found in an area where alignment might be impeded such as a structural variant or repetitive element, they were removed from the final list of variants for consideration. Filtering based on these criteria removed all but 5 variants that were not in phase with the Q haplotype. To determine variants in the haplotype that defined Q, the list was further narrowed to variants fixed in QQ animals, but where all qq individuals were homozygous for the opposite allele. Under the assumption that Dominette was qq, any variants where the Q-exclusive variant matched the reference were also removed. By filtering based on these criteria, the potential variant set was refined to variants exclusive to and completely in phase with the Q haplotype in this population.
Using Ensembl’s Variant Effect Predictor (VEP) [42], this set of variants was annotated based on location in the annotated genome (intronic, coding sequence, untranslated regions, intergenic, etc.), as well as impacts on coding sequence and reading frame. Effects on splicing were predicted with Pangolin [43] using the default settings. To investigate potential changes in transcription factor binding, variants were loaded into the UCSC genome browser as a custom track [44], alongside the bovine Assay for Transposase Accessible Chromatin using the sequencing (ATAC-seq) peak catalog published by Yuan et al. [45]. Variants located within ATAC-seq peaks and thus potentially affecting transcription factor (TF) binding were identified by visualizing the data in the browser. For further analysis, only variants within peaks reaching a signal score of at least 0.5 were considered. The effects of those variants were further interrogated using the Transcription Factor Binding Site Prediction tool provided by AnimalTFDB4.0 [46] to identify specific TFs that could bind to the affected regions. Find Individual Motif Occurrence (FIMO) [47] was then used to calculate scores for the original and mutated versions of the sequence, using all motifs associated with that transcription factor from CIS-BP [48]. For the final comparison, the motif with the highest FIMO score possible was selected for each TF.

2.3. Individual Variant Genotyping

To validate the presence of these variants in a broader population and to assess their potential as surrogates for the haplotype and predictors of phenotype, the remaining 405 animals were genotyped for two coding sequence variants found in LCORL exclusive to the Q haplotype, rs109696064 (chr6:g.37403795T) and rs384548488 (chr6:g.37401771_37401772del). For populations 1, 2, and 3, genotypes were called using a PCR-RFLP assay. Primers and enzymes used are described in Table S1. PCR was conducted in 20 µL reactions containing 100 ng genomic DNA, 0.5U HotStarTaq DNA polymerase (QIAGEN, Valencia, CA, USA), 1× PCR buffer, 200 µM of each dNTP, and 0.5 µM each of forward and reverse primers. Amplification was performed with an initial incubation at 95 °C for 3 min, followed by 34 cycles of 94 °C for 20 s, 45 s, and 72 °C for 45 s, with a final incubation at 72 °C for 5 m. Following PCR, 10 µL of master mix (1.5 µL of 10× enzyme buffer, 5 Units of restriction enzyme, and 8 µL nuclease-free water) was added to each PCR reaction and incubated for one hour at 37 °C. The digested fragments were subjected to electrophoresis in 1.5% agarose, 0.5× tris-borate-EDTA gels with 0.1 µg/mL ethidium bromide. Genotypes were visualized by UV illumination. For populations 4 and 5, a fluorescent 5′–3′ exonuclease assay was performed using primers and probes as described in Table S2. Quantitative PCR was performed in 10 µL reactions containing 1× PrimeTime™ Gene Expression Master Mix (Integrated DNA Technologies, Coralville, IA, USA), 0.5 µM each of forward and reverse primers, and 0.2 µM of each allele-specific probe. Amplification was performed with an initial incubation at 95 °C for 3 min followed by 40 cycles of 95 °C for 15 s, 60 °C for 45 s. Genotypes were called using CFX Maestro software 2.3, version 5.3.022 (Bio-Rad Laboratories, Hercules, CA, USA).

2.4. Statistical Analysis

Statistical analyses were performed in R version 4.3.2. lmer() [49] was used to construct linear mixed-effect models using genotype and sex as a fixed effect and farm as a random variable. Due to a lack of individuals bearing the alleles used as surrogates for the QQ genotype, populations 1 and 3 were not used for analysis. The final models for the first set were constructed using populations 2, 4, and 5 combined. Due to the difference in background genetics in these populations, population was used as a random effect to account for potential epistatic effects arising from those differences, as well as other differences that could not be accounted for between population groups, such as environment. Genotype and sex were used as fixed effects. Association was tested between genotype and 13 phenotypes: birth weight (BW), adjusted weaning weight (WW), three weight points through life (W1, W2, and W3), average daily gain (ADG), hip height (HH), dry matter intake (DMI), hot carcass weight (HCW), backfat thickness (BF), ribeye area (REA), kidney pelvic heart fat (KPH), and marbling (MB). As of the writing of this manuscript, DMI and carcass phenotypes were not available for populations 4 and 5, so only population 2 was considered for DMI, HCW, BF, REA, KPH, and MB. Additionally, only steers had HH and W3 measured in populations 4 and 5, although there were measurements from steers and heifers for these traits in population 2. Due to being calved considerably later than their contemporaries, five animals from population 4, and 34 from population 5, were not used for analyses. To adjust for multiple testing, the BenjaminiHochberg correction was used [50]. All phenotypes within classes broken down by population and sex passed the ShapiroWilks normalcy test (p > 0.05), with the exception of W2, ADG, and MB for the steers of population 2, and birth weight for the steers of populations 4 and 5.

3. Results

3.1. Defining Mutations Exclusive to the QQ Haplotype

Thirty-four Charolais calves were whole-genome sequenced (17 QQ and 17 qq) with the goal of building a complete list of the potential causative mutations to better understand the functional impact of this haplotype. An average of 119.1 million read pairs were generated per sample, with 103.3 million read pairs remaining after trimming. After the removal of reads with a mapping quality below 30, each sample had 170.7 million reads mapped to the genome on average, resulting in a final coverage of roughly 9.2× for each sample, with the range of coverage for individual samples being between 7.6× and 11.5×.
As the ultimate objective was to investigate the Q haplotype at the LCORL-NCAPG locus, the first step was to identify the span of LD within this population. Based on the evidence from previous studies [9,30], the initial area of exploration encompassed 37,000,000 and 38,200,000 on BTA6, which contained LAP3, MED28, FAM184B, DCAF16, NCAPG, LCORL, and the 600 kb intergenic region upstream of LCORL. Within this area, an 814 kb region between 37,199,897–38,014,080 was shown to be in complete LD among the QQ animals, removing LAP3, MED28, and a portion of the 3′ end of FAM184B from consideration. In total, 7278 variants were called in this 814 kb region, 147 of which did not present as fixed in the Q haplotype. To validate these mutations, they were visualized in IGV. Of these variant calls, 81 could be attributed to a single read or PCR duplicate. Another 35 were in repetitive regions and could be construed as issues with alignment. There were three regions containing 21 total variants that were the result of failed alignment of larger repetitive regions. Five variants were clustered around apparent structural variants that are in phase with the haplotype, and the remaining five (rs109576691, rs379524098, chr6:g.37726299T>A, rs109270787, and rs383633472) did not fit any of these criteria and could be true germline variants existing within the Q haplotype or somatic mutations within individual animals. None of these five variants presented as being due to recombination events; the Q haplotype continued uninterrupted on either side of the variant in the individuals where they did not fit the defined Q genotype. Because of this, these variants were not useful in refining the Q haplotype further, nor are likely to be causative.
After quality control pruning, a total of 7131 SNPs and indels were identified as being fixed among QQ animals in this region. To be considered a variant ‘defining’ the haplotype, a variant had to be completely absent in qq individuals. In other words, if QQ animals were homozygous for one allele, the qq cattle had to be homozygous for the opposite allele. There were 217 variants that met these criteria. These variants were subject to further investigation to identify potential causative mechanisms using three prediction methods: VEP, for coding sequence and effects on reading frame; Pangolin, to detect changes in splicing sites; and combining published ATAC-seq data for cattle with FIMO predictions of binding affinity for transcription factors obtained from AnimalTFDB.
Of the 217 variants, only 22 were within the coding sequence or an ATAC peak (Table 1). No variants had a notable impact on splicing; the highest increase in splice probability score calculated by Pangolin was 0.02, and no decreases in splice score were observed. Of these 22 variants, three were detected in coding sequence: rs109570900 (the NCAPGc.1326T>G seen in previous studies), rs3845484488, and rs109696064. SIFT scores calculated for the SNPs indicate that rs109696064 is tolerated (0.38), but rs109570900 is deleterious (0.01). The last coding sequence variant, rs3845484488, is a frameshift that results in a truncation of the long isoform of the LCORL protein, so it is likely to be impactful. The other 19 variants were found in 18 ATAC peaks. Notably, rs109114124 and rs109092727 were found in the promoter region directly upstream of LCORL. However, neither of the variants in this region were in the major promoter ATAC peak, but in smaller adjacent peaks. The variant rs109145748 was located in the promoter for FAM184B, but the other regions were distributed among intronic and intergenic regions. A summary of the ATAC peaks these variants are in can be found in Table 2, and the differential transcription factor binding between the Q and q alleles for these variants is shown in Table 3.
While four of the variants in ATAC peaks had no predicted change in transcription factor affinity or any transcription factor affinity in their site, the remaining 15 had some degree of change. Most of the ATAC peaks harboring variants were relatively small, with only six having a signal score greater than 0.5. Notably for the variants near the LCORL promoter, although rs109092727 did not have any motif hits, the Q allele for rs109114124 results in a loss of affinity for EGR1 and EGR2 among others, but simultaneously a gain in affinity for FOXA1. A variant located in an embryonic ATAC peak, rs110458346, was also remarkable for its mutation causing a loss of affinity for all TFs predicted for that region in qq, including thyroid hormone receptors THRA, THRB, and SRF.

3.2. Discovery of Structural Variants

As part of quality control, regions with ambiguous calls were visualized in IGV. This led to the discovery of three apparent structural variants: (1) a 157 bp deletion within the first intron of NCAPG (chr6:37,326,536–37,326,693) (Figure S1), (2) an insertion within the fifth intron of NCAPG (chr6:37,336,715–37,336,716) (Figure S2), and (3) a small insertion in the intergenic space upstream of LCORL (chr6:37,619,145–37,619,149). As all three of these were within the fixed haplotype region for Q, they were homozygous in all QQ samples. Two of these structural variants were found to be present in qq samples. The smaller insertion upstream of LCORL was very common; 10 of the qq individuals were homozygous for this insertion and three were heterozygous. While rarer, the deletion in intron 1 of NCAPG was also present in qq animals, with four qq animals being heterozygous. However, the insertion within NCAPG seems to be completely absent among qq animals, and thus would qualify as being a defining mutation. This structural variant is displayed alongside all genotypes for the 1.2 Mb region in Figure 1. Mate pairs of reads entering the insertion point at this region appear on multiple different chromosomes, suggesting the insertion may be a repetitive element. The insertion is large enough that no mate-pair reads appear on the opposite side of the insertion point. This makes the insertion challenging to reconstruct using short reads alone, and it is unclear at this time if this insertion would have any impact on splicing or transcription of NCAPG.

3.3. Genotype–Phenotype Relationship

To confirm the existence of some of these mutations in a broader population, and to assess the association between selected SNPs and phenotype, an additional 405 cattle from five populations were genotyped for the rs384548488 and rs109696064 variants. These variants were selected due to their location in the coding sequence, the potentially significant impact of rs384548488, and to assess if either of these two very closely neighboring variants were found independent of one another. The variants were found to be in complete LD, with all animals being either homozygous ACT-C/ACT-C (as in ancestral haplotypes, or qq), A-T/A-T (QQ), or heterozygous for both (ACT-C/A-T, or Qq). Genotype distribution by population is listed in Table 4. Due to the proximity of these variants, and that these were the only variants genotyped, it cannot be confirmed if these cattle truly have the extended Q haplotype. However, given the evidence presented in the whole-genome sequenced animals, it may be possible to use these variants as a surrogate for the haplotype.
To assess the association between genotype and phenotype, linear mixed-effects models were constructed. As there were no QQ individuals in populations 1 and 3, only populations 2, 4, and 5 were used. Due to unavailability of DMI and carcass phenotypes for populations 4 and 5, only population 2 was used for the DMI, HCW, REA, BF, KPH, and MB phenotypes. The model constructed for each phenotype is presented in Table 5. BW, W2, W3, HH, ADG, and HCW all passed the significance threshold, with Qq and QQ cattle trending toward higher body weight, increased stature, and greater average daily gain than qq animals. This is consistent with the known effect of this locus on phenotype and demonstrates that these variants can serve as effective markers for the haplotype.

4. Discussion

The results of this investigation support the findings of numerous prior studies demonstrating the link between the NCAPG-LCORL locus and increased lean growth, and several variants have been identified that could underlie the changes in phenotype. In agreement with the results presented by Bouwman et al. [1] and others [9,13,25,51], there is a large region of LD in sequenced QQ animals, which unfortunately was not narrowed further compared to previous studies and continues to confound attempts to find a causative variant. Nevertheless, the list has been refined further by identifying variants exclusive to the haplotype and several potential molecular explanations for how this locus exerts its influence.
The most straightforward and tempting answers lie within the coding sequence mutations. The NCAPGc.1326T>G substitution, rs109570900, has been previously identified as a putative quantitative trait nucleotide (QTN) because the resulting missense substitution is predicted to be functionally damaging. Indeed, the NCAPGc.1326G allele was present in the 34 cattle sequenced here and was exclusive to the Q haplotype. Many studies investigating this region have found this SNP to be significantly associated with various phenotypes [2,9,31], and without doubt, it is in LD with this haplotype. However, other studies have questioned whether this mutation is causative. At least two studies have documented animals that would be heterozygous for the growth haplotype at this locus (Qq) being homozygous for the G allele, or animals that should not have the haplotype (qq) carrying a copy of this allele [29,32]. While none of the qq animals used in this study carried the G allele, it seems likely that this variant may exist outside of this haplotype in the broader population and may simply have been present on the chromosome on which the causative mutation first arose.
While evidence from previous studies suggests that rs109570900 is most likely not the causative variant, NCAPG itself cannot be fully ruled out, as indeed, expression of NCAPG seems to be important for muscle development. A recent study has found myogenic differentiation to be impaired in fetal myoblasts where NCAPG has been knocked down [52]. None of the 217 variants exclusive to Q were located within ATAC peaks in or directly upstream of NCAPG. However, the existence of an insertion within an intron of NCAPG, 84 bp downstream from an exonintron junction, could potentially impact the transcription, splicing, or function of this gene, although more direct evidence will be necessary to confirm this impact.
Alternatively, the frameshift mutation of the long isoform of LCORL is quite compelling. The fact that other studies have documented a loss of function for this long isoform to be linked to increased stature in dogs and goats implies a similar mechanism could be at work here [18,26]. The frameshift variant in cattle, rs384548488, has been noted within the last year by Sanchez et al., Gualdrón Duarte et al., and Cai et al. to be associated with several beef production traits in mostly Charolais, Brown Swiss, and original Braunvieh cattle, height and length in Belgian Blues, and reduced young stock survival in Nordic Red cattle, respectively [53,54,55].
While the association with growth and carcass traits is unsurprising, given the body of literature surrounding this locus at this point, the influence on young stock survival observed by Cai et al. [54] is perhaps less expected. It may be easy to rationalize this effect as due to dystocia caused by the increase in birth weight associated with this genotype. Indeed, when considering stillbirths, almost the exact same region found in the current study’s results (chr6:37,236,226–38,027,078) was the most highly associated with the trait. The immediately adjacent proximal region (chr6:36,679,547–37,179,665) was the most significant region for calf survival within their first year, but the previously mentioned haplotype region (chr6: 37,236,226–38,027,078) still exceeded the significance threshold as well. It is curious that a mutation mostly known for its effect on growth is linked to early death, even after the first month of life. This locus was also linked to a decrease in longevity in the same study that identified the frameshift mutation in dogs [18]. That connection could simply be accounted for by correlation over causation with large breeds within a species tending towards a shorter lifespan, but it may also be that epigenetic changes spurred by the loss of function of PALI2, while also resulting in increased body frame, can impact longevity and survival. In their research on the epigenetic clock in dogs and humans, Horvath et al. [56] found regions that gained methylation with age were enriched for PRC2 targets and genes involved in development. Under the hypothesis that the long isoform of LCORL modulates PRC2 activity, there is an inviting connection to be made. However, much more evidence, particularly regarding epigenetic changes associated with this frameshift mutation, would be needed to draw this conclusion.
In a similar vein to the young stock survival locus being adjacent to, but not directly within the LCORL-NCAPG haplotype, Sanchez et al. [53] acknowledge that the frameshift mutation is not in very high LD with the lead SNP in Belgian Blues. Given the extensive LD in this region and the nature of GWAS for quantitative traits, caution should be exercised when looking at a lead SNP, as it may exist in a small number of higher-performing Qq/qq animals in addition to being attached to the Q haplotype. The fact that this region is in such LD is strong evidence for a selective sweep, and that the causative mutation is likely exclusive to the haplotype. Confirmation of further meiotic events to narrow down the haplotype further would be ideal. It is not lost on the author that over a decade ago, Setoguchi et al. suggested a considerably smaller, 591 kb haplotype region in Japanese Black cattle [9]. Translated locations of their markers to ARS-UCD2.0 would place their range at chr6:37,278,524–37,869,348, pruning roughly 144 kb from the region furthest upstream of LCORL. This recombination can be traced to sire C in their study. While the methods and reference genome then were different from today, it would be interesting to see how the haplotype presents in Japanese Black cattle, or if other recombination events can be found.
Setting aside the coding sequence mutations, the variants located in ATAC peaks, while perhaps not as straightforward an answer, are worthy of consideration as well. It has been shown previously that changes in LCORL expression have been linked both to this haplotype [30], as well as to feed intake [57]. Thus, changes in the regulation of LCORL may be contributing to this phenotype as well. Overall, changes to transcription factor binding trended toward loss of TF affinity in the Q haplotype; there are 37 TFs exclusive to qq, 22 exclusive to QQ, and 46 shared between the two haplotypes.
Though the distal region around chr6:37,900,000 would be excluded by the findings from Setoguchi et al., this region is fixed in the Q haplotype in the population genotyped in the present study. As this Charolais population is the same as those used in a study by Martins Rodrigues [30] to demonstrate increased LCORL expression in QQ individuals, it may still be worth considering the effects on transcription from this region. The site around rs110458346 is of particular interest, as it showed some of the strongest changes in TF affinity, and all resulted in a predicted loss of TF binding (Table S6) in QQ. Additionally, this region is among the ATAC sites with a stronger signal and also is an embryonic-exclusive peak, suggesting a potential role in early developmental regulation. The most impacted transcription factors among these were the THRA and THRB thyroid hormone receptors. Thyroid hormone is important for normal embryonic development [58], and the presence of thyroid hormone receptor binding sites implies that the regulation of the genes at this locus may be thyroid-hormone-sensitive. However, the directionality of expression in response to T3 cannot be inferred from the sequence alone, as these receptors can promote or repress transcription in the presence of T3, depending on the other proteins involved in the complex at the locus [59]. The TF with the highest affinity for this site, SRF, is known for its important role in both development and skeletal muscle accretion [60,61], but similarly has fairly complicated and nuanced activity. SRF is most commonly known for promoting the expression of its target genes in response to growth factor stimulation [62], but it also can have a repressive action in competing with other transcription factors for binding sites [63].
In the case of the site around rs109114124, the most impactful changes to TF binding are the loss of affinity for the early growth response (EGR) transcription factors and the gain of FOXA1 binding. EGR1 and EGR2 are crucial regulators of cellular proliferation and apoptosis [64]. EGR1 has been demonstrated to activate pro-apoptotic and pro-survival pathways, again, depending on the context of the cell’s status as a whole [65]. FOXA1 is actually able to act as a ‘pioneer factor’, binding to condensed regions of chromatin and promoting the opening and transcription of previously inaccessible regions of the chromosome, although this is also dependent on the epigenetic marks of the histone as well as the sequence [66]. Thus, this gain of FOXA1 affinity interestingly suggests that individuals with the QQ haplotype may be able to promote LCORL transcription when it may otherwise be inaccessible for expression. Transcription factor activity ultimately relies on the coordinated activity of likely many cofactors interacting with each transcription factor associated with these binding sites, making it difficult to predict direct impacts caused by any individual variants by sequence alone. However, it does demonstrate that these changes likely affect the TF binding environment in some way and could warrant further investigation into these changes and their impact on transcription in the region and phenotype as a whole.

5. Conclusions

The LCORL-NCAPG locus is a critical region for growth in beef cattle, as evidenced by the findings presented here and the overwhelming body of evidence in the literature. This study has identified 218 mutations exclusive to the haplotype in the region associated with increased growth, including a structural variant in NCAPG, a frameshift variant causing a loss of function of the long isoform of LCORL, and several mutations affecting transcription factor binding and thus the potential regulation of genes in this locus. Genotyping for some of these variants showed statistically significant associations for birth weight, carcass weight, and average daily gain, making them useful markers for selection and prediction of performance. Though the true causative variant has yet to be determined due to the extensive LD in this region, these findings further clarify details underpinning this region and hopefully can contribute to future studies investigating how this locus mediates its effects.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes15050576/s1. Figure S1: A 157-bp deletion within an intron of NCAPG; Figure S2: An insertion within the fifth intron of NCAPG; Table S1: Primers, enzymes, and expected fragment sizes for the PCR-RFLP genotyping assay; Table S2: Primers and probes for the 5’-3’ exonuclease genotyping assay. Table S3: Full genotypes for all 34 animals between 37M and 38.2M on BTA6; Table S4: Short list summarizing variants in coding sequence or ATAC peaks; Table S5: Full list summarizing all Q-exclusive variants; Table S6: Summary of FIMO score predictions.

Author Contributions

Conceptualization, J.E.B. and L.E.M.; methodology, L.E.M.; validation, J.E.B. and L.E.M.; formal analysis, L.E.M.; investigation, L.E.M.; resources, A.C.D., D.W.S. and J.C.M.; writing—original draft preparation, L.E.M.; writing—review and editing, J.E.B.; visualization, L.E.M.; supervision, J.E.B.; project administration, J.E.B.; funding acquisition, A.C.D., D.W.S., J.C.M. and J.E.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by USDA NIFA, grant numbers 2014-67015-21819 and 2020-67015-31342. The APC was funded by USDA NIFA grant number 2020-67015-31342.

Institutional Review Board Statement

The animal study protocol was approved by the Institutional Review Board of University of Illinois Institutional Animal Care and Use Committee (IACUC). Approval Code: Protocol #19118, 07/15/2019.

Data Availability Statement

Raw fastqs from all animals will be available on NCBI SRA.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Bouwman, A.C.; Daetwyler, H.D.; Chamberlain, A.J.; Ponce, C.H.; Sargolzaei, M.; Schenkel, F.S.; Sahana, G.; Govignon-Gion, A.; Boitard, S.; Dolezal, M.; et al. Meta-analysis of genome-wide association studies for cattle stature identifies common genes that regulate body size in mammals. Nat. Genet. 2018, 50, 362–367. [Google Scholar] [CrossRef] [PubMed]
  2. Setoguchi, K.; Watanabe, T.; Weikard, R.; Albrecht, E.; Kühn, C.; Kinoshita, A.; Sugimoto, Y.; Takasuga, A. The SNP c.1326T>G in the non-SMC condensin I complex, subunit G (NCAPG) gene encoding a p.Ile442Met variant is associated with an increase in body frame size at puberty in cattle. Anim. Genet. 2011, 42, 650–655. [Google Scholar] [CrossRef] [PubMed]
  3. Bolormaa, S.; Pryce, J.E.; Reverter, A.; Zhang, Y.; Barendse, W.; Kemper, K.; Tier, B.; Savin, K.; Hayes, B.J.; Goddard, M.E. A Multi-Trait, Meta-analysis for Detecting Pleiotropic Polymorphisms for Stature, Fatness and Reproduction in Beef Cattle. PLoS Genet. 2014, 10, e1004198. [Google Scholar] [CrossRef] [PubMed]
  4. Doyle, J.L.; Berry, D.P.; Veerkamp, R.F.; Carthy, T.R.; Walsh, S.W.; Evans, R.D.; Purfield, D.C. Genomic Regions Associated with Skeletal Type Traits in Beef and Dairy Cattle Are Common to Regions Associated with Carcass Traits, Feed Intake and Calving Difficulty. Front. Genet. 2020, 11, 20. [Google Scholar] [CrossRef] [PubMed]
  5. Gutiérrez-Gil, B.; Williams, J.L.; Homer, D.; Burton, D.; Haley, C.S.; Wiener, P. Search for quantitative trait loci affecting growth and carcass traits in a cross population of beef and dairy cattle. J. Anim. Sci. 2009, 87, 24–36. [Google Scholar] [CrossRef] [PubMed]
  6. Snelling, W.M.; Allan, M.F.; Keele, J.W.; Kuehn, L.A.; McDaneld, T.; Smith, T.P.L.; Sonstegard, T.S.; Thallman, R.M.; Bennett, G.L. Genome-wide association study of growth in crossbred beef cattle. J. Anim. Sci. 2010, 88, 837–848. [Google Scholar] [CrossRef]
  7. Smith, J.L.; Wilson, M.L.; Nilson, S.M.; Rowan, T.N.; Oldeschulte, D.L.; Schnabel, R.D.; Decker, J.E.; Seabury, C.M. Genome-wide association and genotype by environment interactions for growth traits in U.S. Gelbvieh cattle. BMC Genom. 2019, 20, 926. [Google Scholar] [CrossRef]
  8. Smith, J.L.; Wilson, M.L.; Nilson, S.M.; Rowan, T.N.; Schnabel, R.D.; Decker, J.E.; Seabury, C.M. Genome-wide association and genotype by environment interactions for growth traits in U.S. Red Angus cattle. BMC Genom. 2022, 23, 517. [Google Scholar] [CrossRef]
  9. Setoguchi, K.; Furuta, M.; Hirano, T.; Nagao, T.; Watanabe, T.; Sugimoto, Y.; Takasuga, A. Cross-breed comparisons identified a critical 591-kb region for bovine carcass weight QTL (CW-2) on chromosome 6 and the Ile-442-Met substitution in NCAPG as a positional candidate. BMC Genet. 2009, 10, 43. [Google Scholar] [CrossRef]
  10. Nishimura, S.; Watanabe, T.; Mizoshita, K.; Tatsuda, K.; Fujita, T.; Watanabe, N.; Sugimoto, Y.; Takasuga, A. Genome-wide association study identified three major QTL for carcass weight including the PLAG1-CHCHD7 QTN for stature in Japanese Black cattle. BMC Genet. 2012, 13, 40. [Google Scholar] [CrossRef]
  11. Purfield, D.C.; Evans, R.D.; Berry, D.P. Reaffirmation of known major genes and the identification of novel candidate genes associated with carcass-related metrics based on whole genome sequence within a large multi-breed cattle population. BMC Genom. 2019, 20, 720. [Google Scholar] [CrossRef] [PubMed]
  12. Keogh, K.; Carthy, T.R.; McClure, M.C.; Waters, S.M.; Kenny, D.A. Genome-wide association study of economically important traits in Charolais and Limousin beef cows. Animal 2021, 15, 100011. [Google Scholar] [CrossRef] [PubMed]
  13. Lindholm-Perry, A.K.; Sexten, A.K.; Kuehn, L.A.; Smith, T.P.L.; King, D.A.; Shackelford, S.D.; Wheeler, T.L.; Ferrell, C.L.; Jenkins, T.G.; Snelling, W.M.; et al. Association, effects and validation of polymorphisms within the NCAPG–LCORL locus located on BTA6 with feed intake, gain, meat and carcass traits in beef cattle. BMC Genet. 2011, 12, 103. [Google Scholar] [CrossRef] [PubMed]
  14. Horikoshi, M.; Yaghootkar, H.; Mook-Kanamori, D.O.; Sovio, U.; Taal, H.R.; Hennig, B.J.; Bradfield, J.P.; St Pourcain, B.; Evans, D.M.; Charoen, P.; et al. New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism. Nat. Genet. 2013, 45, 76–82. [Google Scholar] [CrossRef] [PubMed]
  15. Wood, A.R.; Esko, T.; Yang, J.; Vedantam, S.; Pers, T.H.; Gustafsson, S.; Chu, A.Y.; Estrada, K.; Luan, J.a.; Kutalik, Z.; et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 2014, 46, 1173–1186. [Google Scholar] [CrossRef] [PubMed]
  16. Helgeland, Ø.; Vaudel, M.; Juliusson, P.B.; Lingaas Holmen, O.; Juodakis, J.; Bacelis, J.; Jacobsson, B.; Lindekleiv, H.; Hveem, K.; Lie, R.T.; et al. Genome-wide association study reveals dynamic role of genetic variation in infant and early childhood growth. Nat. Commun. 2019, 10, 4448. [Google Scholar] [CrossRef] [PubMed]
  17. Vaysse, A.; Ratnakumar, A.; Derrien, T.; Axelsson, E.; Rosengren Pielberg, G.; Sigurdsson, S.; Fall, T.; Seppälä, E.H.; Hansen, M.S.T.; Lawley, C.T.; et al. Identification of Genomic Regions Associated with Phenotypic Variation between Dog Breeds Using Selection Mapping. PLoS Genet. 2011, 7, e1002316. [Google Scholar] [CrossRef] [PubMed]
  18. Plassais, J.; Kim, J.; Davis, B.W.; Karyadi, D.M.; Hogan, A.N.; Harris, A.C.; Decker, B.; Parker, H.G.; Ostrander, E.A. Whole genome sequencing of canids reveals genomic regions under selection and variants influencing morphology. Nat. Commun. 2019, 10, 1489. [Google Scholar] [CrossRef]
  19. Signer-Hasler, H.; Flury, C.; Haase, B.; Burger, D.; Simianer, H.; Leeb, T.; Rieder, S. A Genome-Wide Association Study Reveals Loci Influencing Height and Other Conformation Traits in Horses. PLoS ONE 2012, 7, e37282. [Google Scholar] [CrossRef]
  20. Tetens, J.; Widmann, P.; Kühn, C.; Thaller, G. A genome-wide association study indicates LCORL/NCAPG as a candidate locus for withers height in German Warmblood horses. Anim. Genet. 2013, 44, 467–471. [Google Scholar] [CrossRef]
  21. Staiger, E.A.; Abri, M.A.A.; Pflug, K.M.; Kalla, S.E.; Ainsworth, D.M.; Miller, D.; Raudsepp, T.; Sutter, N.B.; Brooks, S.A. Skeletal variation in Tennessee Walking Horses maps to the LCORL/NCAPG gene region. Physiol. Genom. 2016, 48, 325–335. [Google Scholar] [CrossRef] [PubMed]
  22. Al-Mamun, H.A.; Kwan, P.; Clark, S.A.; Ferdosi, M.H.; Tellam, R.; Gondro, C. Genome-wide association study of body weight in Australian Merino sheep reveals an orthologous region on OAR6 to human and bovine genomic regions affecting height and weight. Genet. Sel. Evol. 2015, 47, 66. [Google Scholar] [CrossRef]
  23. Posbergh, C.J.; Huson, H.J. All sheeps and sizes: A genetic investigation of mature body size across sheep breeds reveals a polygenic nature. Anim. Genet. 2021, 52, 99–107. [Google Scholar] [CrossRef] [PubMed]
  24. Bongiorni, S.; Mancini, G.; Chillemi, G.; Pariset, L.; Valentini, A. Identification of a Short Region on Chromosome 6 Affecting Direct Calving Ease in Piedmontese Cattle Breed. PLoS ONE 2012, 7, e50137. [Google Scholar] [CrossRef]
  25. Zhao, G.; Liu, Y.; Niu, Q.; Zheng, X.; Zhang, T.; Wang, Z.; Xu, L.; Zhu, B.; Gao, X.; Zhang, L.; et al. Runs of homozygosity analysis reveals consensus homozygous regions affecting production traits in Chinese Simmental beef cattle. BMC Genom. 2021, 22, 678. [Google Scholar] [CrossRef]
  26. Saif, R.; Henkel, J.; Jagannathan, V.; Drögemüller, C.; Flury, C.; Leeb, T. The LCORL Locus Is under Selection in Large-Sized Pakistani Goat Breeds. Genes 2020, 11, 168. [Google Scholar] [CrossRef]
  27. Carneiro, M.; Hu, D.; Archer, J.; Feng, C.; Afonso, S.; Chen, C.; Blanco-Aguiar, J.A.; Garreau, H.; Boucher, S.; Ferreira, P.G.; et al. Dwarfism and Altered Craniofacial Development in Rabbits Is Caused by a 12.1 kb Deletion at the HMGA2 Locus. Genetics 2017, 205, 955–965. [Google Scholar] [CrossRef]
  28. Rubin, C.-J.; Megens, H.-J.; Barrio, A.M.; Maqbool, K.; Sayyab, S.; Schwochow, D.; Wang, C.; Carlborg, Ö.; Jern, P.; Jørgensen, C.B.; et al. Strong signatures of selection in the domestic pig genome. Proc. Natl. Acad. Sci. USA 2012, 109, 19529–19536. [Google Scholar] [CrossRef] [PubMed]
  29. Markey, A. Mapping of Monogenic and Quantitative Trait Loci Using a Whole Genome Scan Approach and Single Nucleotide Polymorphism Platforms. Ph.D. Dissertation, University of Illinois at Urbana-Champaign, Champaign, IL, USA, 2013. [Google Scholar]
  30. Martins Rodrigues, F. Transcriptional Variation in Muscle from Cattle with Alternative NCAPG/LCORL QTL Genotypes. Master’s Thesis, University of Illinois, Champaign, IL, USA, 2017. [Google Scholar]
  31. Eberlein, A.; Takasuga, A.; Setoguchi, K.; Pfuhl, R.; Flisikowski, K.; Fries, R.; Klopp, N.; Fürbass, R.; Weikard, R.; Kühn, C. Dissection of Genetic Factors Modulating Fetal Growth in Cattle Indicates a Substantial Role of the Non-SMC Condensin I Complex, Subunit G (NCAPG) Gene. Genetics 2009, 183, 951–964. [Google Scholar] [CrossRef]
  32. Gutiérrez-Gil, B.; Wiener, P.; Williams, J.L.; Haley, C.S. Investigation of the genetic architecture of a bone carcass weight QTL on BTA6. Anim. Genet. 2012, 43, 654–661. [Google Scholar] [CrossRef]
  33. Conway, E.; Jerman, E.; Healy, E.; Ito, S.; Holoch, D.; Oliviero, G.; Deevy, O.; Glancy, E.; Fitzpatrick, D.J.; Mucha, M.; et al. A Family of Vertebrate-Specific Polycombs Encoded by the LCOR/LCORL Genes Balance PRC2 Subtype Activities. Mol. Cell 2018, 70, 408–421.e408. [Google Scholar] [CrossRef] [PubMed]
  34. Wiles, E.T.; Selker, E.U. H3K27 methylation: A promiscuous repressive chromatin mark. Curr. Opin. Genet. Dev. 2017, 43, 31–37. [Google Scholar] [CrossRef] [PubMed]
  35. Margueron, R.; Reinberg, D. The Polycomb complex PRC2 and its mark in life. Nature 2011, 469, 343–349. [Google Scholar] [CrossRef]
  36. Miller, S.A.; Dykes, D.D.; Polesky, H.F. A simple salting out procedure for extracting DNA from human nucleated cells. Nucleic Acids Res. 1988, 16, 1215. [Google Scholar] [CrossRef] [PubMed]
  37. Andrews, S. FastQC: A Quality Control Tool for High Thoroughput Sequence Data. BMC Bioinform. 2010, 14, 1–4. [Google Scholar]
  38. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  39. Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013, arXiv:1303.3997. [Google Scholar]
  40. Van der Auwera, G.A.; O’Connor, B.D. Genomics in the Cloud: Using Docker, GATK, and WDL in Terra, 1st ed.; O‘Reilly Media: Sebastopol, CA, USA, 2020. [Google Scholar]
  41. Thorvaldsdóttir, H.; Robinson, J.T.; Mesirov, J.P. Integrative Genomics Viewer (IGV): High-performance genomics data visualization and exploration. Brief. Bioinform. 2012, 14, 178–192. [Google Scholar] [CrossRef] [PubMed]
  42. McLaren, W.; Gil, L.; Hunt, S.E.; Riat, H.S.; Ritchie, G.R.S.; Thormann, A.; Flicek, P.; Cunningham, F. The Ensembl Variant Effect Predictor. Genome Biol. 2016, 17, 122. [Google Scholar] [CrossRef]
  43. Zeng, T.; Li, Y.I. Predicting RNA splicing from DNA sequence using Pangolin. Genome Biol. 2022, 23, 103. [Google Scholar] [CrossRef]
  44. Raney, B.J.; Dreszer, T.R.; Barber, G.P.; Clawson, H.; Fujita, P.A.; Wang, T.; Nguyen, N.; Paten, B.; Zweig, A.S.; Karolchik, D.; et al. Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser. Bioinformatics 2013, 30, 1003–1005. [Google Scholar] [CrossRef]
  45. Yuan, C.; Tang, L.; Lopdell, T.; Petrov, V.A.; Oget-Ebrad, C.; Moreira, G.C.M.; Gualdrón Duarte, J.L.; Sartelet, A.; Cheng, Z.; Salavati, M.; et al. An organism-wide ATAC-seq peak catalog for the bovine and its use to identify regulatory variants. Genome Res. 2023, 33, 1848–1864. [Google Scholar] [CrossRef]
  46. Shen, W.K.; Chen, S.Y.; Gan, Z.Q.; Zhang, Y.Z.; Yue, T.; Chen, M.M.; Xue, Y.; Hu, H.; Guo, A.Y. AnimalTFDB 4.0: A comprehensive animal transcription factor database updated with variation and expression annotations. Nucleic Acids Res. 2023, 51, D39–D45. [Google Scholar] [CrossRef]
  47. Grant, C.E.; Bailey, T.L.; Noble, W.S. FIMO: Scanning for occurrences of a given motif. Bioinformatics 2011, 27, 1017–1018. [Google Scholar] [CrossRef] [PubMed]
  48. Weirauch, M.T.; Yang, A.; Albu, M.; Cote, A.G.; Montenegro-Montero, A.; Drewe, P.; Najafabadi, H.S.; Lambert, S.A.; Mann, I.; Cook, K.; et al. Determination and inference of eukaryotic transcription factor sequence specificity. Cell 2014, 158, 1431–1443. [Google Scholar] [CrossRef] [PubMed]
  49. Bates, D.; Mächler, M.; Bolker, B.; Walker, S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Softw. 2015, 67, 1–48. [Google Scholar] [CrossRef]
  50. Benjamini, Y.; Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B (Methodol.) 1995, 57, 289–300. [Google Scholar] [CrossRef]
  51. Niu, Q.; Zhang, T.; Xu, L.; Wang, T.; Wang, Z.; Zhu, B.; Gao, X.; Chen, Y.; Zhang, L.; Gao, H.; et al. Identification of Candidate Variants Associated with Bone Weight Using Whole Genome Sequence in Beef Cattle. Front. Genet. 2021, 12, 750746. [Google Scholar] [CrossRef]
  52. Hu, X.; Xing, Y.; Fu, X.; Yang, Q.; Ren, L.; Wang, Y.; Li, Q.; Li, J.; Zhang, L. NCAPG Dynamically Coordinates the Myogenesis of Fetal Bovine Tissue by Adjusting Chromatin Accessibility. Int. J. Mol. Sci. 2020, 21, 1248. [Google Scholar] [CrossRef]
  53. Sanchez, M.-P.; Tribout, T.; Kadri, N.K.; Chitneedi, P.K.; Maak, S.; Hozé, C.; Boussaha, M.; Croiseau, P.; Philippe, R.; Spengeler, M.; et al. Sequence-based GWAS meta-analyses for beef production traits. Genet. Sel. Evol. 2023, 55, 70. [Google Scholar] [CrossRef]
  54. Cai, Z.; Wu, X.; Thomsen, B.; Lund, M.S.; Sahana, G. Genome-wide association study identifies functional genomic variants associated with young stock survival in Nordic Red Dairy Cattle. J. Dairy Sci. 2023, 106, 7832–7845. [Google Scholar] [CrossRef] [PubMed]
  55. Gualdrón Duarte, J.L.; Yuan, C.; Gori, A.-S.; Moreira, G.C.M.; Takeda, H.; Coppieters, W.; Charlier, C.; Georges, M.; Druet, T. Sequenced-based GWAS for linear classification traits in Belgian Blue beef cattle reveals new coding variants in genes regulating body size in mammals. Genet. Sel. Evol. 2023, 55, 83. [Google Scholar] [CrossRef]
  56. Horvath, S.; Lu, A.T.; Haghani, A.; Zoller, J.A.; Li, C.Z.; Lim, A.R.; Brooke, R.T.; Raj, K.; Serres-Armero, A.; Dreger, D.L.; et al. DNA methylation clocks for dogs and humans. Proc. Natl. Acad. Sci. USA 2022, 119, e2120887119. [Google Scholar] [CrossRef] [PubMed]
  57. Lindholm-Perry, A.K.; Kuehn, L.A.; Oliver, W.T.; Sexten, A.K.; Miles, J.R.; Rempel, L.A.; Cushman, R.A.; Freetly, H.C. Adipose and Muscle Tissue Gene Expression of Two Genes (NCAPG and LCORL) Located in a Chromosomal Region Associated with Cattle Feed Intake and Gain. PLoS ONE 2013, 8, e80882. [Google Scholar] [CrossRef] [PubMed]
  58. Ashkar, F.A.; Semple, E.; Schmidt, C.H.; St. John, E.; Bartlewski, P.M.; King, W.A. Thyroid hormone supplementation improves bovine embryo development in vitro. Hum. Reprod. 2009, 25, 334–344. [Google Scholar] [CrossRef] [PubMed]
  59. Wondisford, F.E. Chapter 77—Thyroid Hormone Action. In Endocrinology: Adult and Pediatric, 7th ed.; Jameson, J.L., De Groot, L.J., de Kretser, D.M., Giudice, L.C., Grossman, A.B., Melmed, S., Potts, J.T., Weir, G.C., Eds.; W.B. Saunders: Philadelphia, PA, USA, 2016; pp. 1336–1349.e1333. [Google Scholar]
  60. Vlahopoulos, S.; Zimmer, W.E.; Jenster, G.; Belaguli, N.S.; Balk, S.P.; Brinkmann, A.O.; Lanz, R.B.; Zoumpourlis, V.C.; Schwartz, R.J. Recruitment of the Androgen Receptor via Serum Response Factor Facilitates Expression of a Myogenic Gene. J. Biol. Chem. 2005, 280, 7786–7792. [Google Scholar] [CrossRef] [PubMed]
  61. Schratt, G.; Philippar, U.; Hockemeyer, D.; Schwarz, H.; Alberti, S.; Nordheim, A. SRF regulates Bcl-2 expression and promotes cell survival during murine embryonic development. EMBO J. 2004, 23, 1834–1844. [Google Scholar] [CrossRef]
  62. Miano, J.M. Role of serum response factor in the pathogenesis of disease. Lab. Investig. 2010, 90, 1274–1284. [Google Scholar] [CrossRef] [PubMed]
  63. Lee, H.J.; Yun, C.H.; Lim, S.H.; Kim, B.C.; Baik, K.G.; Kim, J.M.; Kim, W.H.; Kim, S.J. SRF is a nuclear repressor of Smad3-mediated TGF-β signaling. Oncogene 2007, 26, 173–185. [Google Scholar] [CrossRef]
  64. Kumbrink, J.; Kirsch, K.H.; Johnson, J.P. EGR1, EGR2, and EGR3 activate the expression of their coregulator NAB2 establishing a negative feedback loop in cells of neuroectodermal and epithelial origin. J. Cell. Biochem. 2010, 111, 207–217. [Google Scholar] [CrossRef]
  65. Baron, V.; Adamson, E.D.; Calogero, A.; Ragona, G.; Mercola, D. The transcription factor Egr1 is a direct regulator of multiple tumor suppressors including TGFβ1, PTEN, p53, and fibronectin. Cancer Gene Ther. 2006, 13, 115–124. [Google Scholar] [CrossRef] [PubMed]
  66. Lupien, M.; Eeckhoute, J.; Meyer, C.A.; Wang, Q.; Zhang, Y.; Li, W.; Carroll, J.S.; Liu, X.S.; Brown, M. FoxA1 Translates Epigenetic Signatures into Enhancer-Driven Lineage-Specific Transcription. Cell 2008, 132, 958–970. [Google Scholar] [CrossRef] [PubMed]
Figure 1. A display of all variants observed between chr6:37,000,000–38,200,000, with each horizontal bar being one of the genotyped cattle, and each vertical line representing the genotype for each variant. Gray represents homozygous for matching the reference (0/0), light blue is heterozygous (0/1), and dark blue is homozygous for the mutant allele (1/1). White indicates that a genotype was not able to be confidently called for that individual at that variant. qq animals are on the bottom, and the QQ haplotype is displayed on the top. Each star marks one of the 22 variants in phase with the haplotype that were found in the coding sequence or an ATAC peak, and the inverse-colored star marks where the structural variant in phase with the haplotype roughly is, though the SV itself, as well as the five variants not in phase with the haplotype, are not displayed on this map.
Figure 1. A display of all variants observed between chr6:37,000,000–38,200,000, with each horizontal bar being one of the genotyped cattle, and each vertical line representing the genotype for each variant. Gray represents homozygous for matching the reference (0/0), light blue is heterozygous (0/1), and dark blue is homozygous for the mutant allele (1/1). White indicates that a genotype was not able to be confidently called for that individual at that variant. qq animals are on the bottom, and the QQ haplotype is displayed on the top. Each star marks one of the 22 variants in phase with the haplotype that were found in the coding sequence or an ATAC peak, and the inverse-colored star marks where the structural variant in phase with the haplotype roughly is, though the SV itself, as well as the five variants not in phase with the haplotype, are not displayed on this map.
Genes 15 00576 g001
Table 1. The 22 variants detected in the coding sequence or ATAC peaks.
Table 1. The 22 variants detected in the coding sequence or ATAC peaks.
VariantLocation 1q AlleleQ AlleleNearby GeneType, Consequence
rs10943868737,214,389TCFAM184BATAC peak, intron
rs10946751937,214,736CTFAM184BATAC peak, intron
rs10914574837,301,160GCFAM184BATAC peak, 5′ UTR
rs10957090037,343,379TGNCAPGCoding sequence, missense
rs21038698337,379,506AGLCORLATAC peak, 3′ UTR
rs20749678737,379,507ATLCORLATAC peak, 3′ UTR
rs37944914337,381,106AGLCORLATAC peak, intron
rs38454848837,401,770ACTALCORLCoding sequence, frameshift
rs10969606437,403,795CTLCORLCoding sequence, missense
rs51749430537,452,882CCALCORLATAC peak, intron
rs11029394737,479,269GCLCORLATAC peak, intron
rs37978761137,487,010TCLCORLATAC peak, intron
rs10911412437,555,677CALCORLATAC peak, intron
rs10909272737,559,117AGLCORLATAC peak, upstream
rs11047069437,608,504CT ATAC peak, intergenic
rs10906034737,627,776GC ATAC peak, intergenic
rs20768904637,669,453AG ATAC peak, intergenic
rs10933179337,681,968CT ATAC peak, intergenic
rs11045834637,934,068CT ATAC peak, intergenic
rs11088820437,946,012CT ATAC peak, intergenic
rs11093065337,962,887GT ATAC peak, intergenic
rs11065846837,997,160CT ATAC peak, intergenic
1 All variants are located on chromosome 6, ARS-UCD2.0 (NC_037333.1).
Table 2. Consensus ATAC peaks with variants, their associated tissues, and signal scores.
Table 2. Consensus ATAC peaks with variants, their associated tissues, and signal scores.
VariantConsensus PeakTissueSignal Score 1
rs109438687chr6_37214241_37214456_NMF12_0.33Liver & Testicle0.353
rs109467519chr6_37214731_37214965_NMF10_0.92Muscle0.209
rs109145748chr6_37300595_37301446_NMF10_0.14Ubiquitous0.909
rs210386983 & rs207496787chr6_37379341_37379516_NMF13_1.008-cell Embryo0.260
rs379449143chr6_37381082_37381259_NMF13_1.008-cell Embryo0.302
rs517494305chr6_37452831_37452975_NMF16_0.67Adipose0.222
rs110293947chr6_37479127_37479316_NMF7_0.58Cerebellum0.277
rs379787611chr6_37487007_37487238_NMF13_1.008-cell Embryo0.337
rs109114124chr6_37555562_37555843_NMF9_0.15Ubiquitous0.522
rs109092727chr6_37558579_37559199_NMF14_0.35Embryo0.866
rs110470694chr6_37608344_37608531_NMF5_1.00Colon & Embryo0.422
rs109060347chr6_37627598_37627866_NMF5_0.33Colon, Rumen, Epithelial, & Embryo0.947
rs207689046chr6_37669337_37669559_NMF5_1.00Colon0.715
rs109331793chr6_37681884_37682015_NMF13_1.008-cell Embryo0.294
rs110458346chr6_37933851_37934401_NMF13_1.008-cell Embryo0.758
rs110888204chr6_37945955_37946153_NMF10.88Cerebrum0.324
rs110930653chr6_37962696_37962988_NMF16_0.49Epididymis0.198
rs110658468chr6_37996798_37997347_NMF5_0.69Colon0.401
1 Signal score was determined by the highest signal in the entire consensus peak where the variant was found.
Table 3. Transcription factors predicted to bind to the ATAC regions containing variants.
Table 3. Transcription factors predicted to bind to the ATAC regions containing variants.
VariantShared TFsqq TFsQQ TFs
rs109438687ZNF621-NR2C2, PAX6
rs109467519---
rs109145748GCM1, MAZ, SP2, ZNF180, ZNF212, ZNF341, ZNF467, ZNF527, ZNF548, ZNF596, ZNF792PAX6, ZBTB14, ZFP64, ZNF264KLF11, SP1, ZBTB17, ZNF329
rs210386983 & rs207496787---
rs379449143-BATF3POU6F1
rs517494305NFATC1, RELA, ZNF484NFATC3, RESTNFYA, NFYB, NFYC, ZNF280A, ZNF619
rs110293947NFE2L2, NHLH1, NHLH2, OLIG2, TCF12, ZBTB18, ZNF273, ZNF331ASCL2, MYOG, ZNF549, ZNF69, ZSCAN31-
rs379787611IRF1, STAT2, ZIM3, ZNF225, ZNF487, ZNF502MEF2A, ZNF394IRF2, IRF3, IRF4, IRF5, IRF8, IRF9, ZNF573
rs109114124RREB1, ZNF263, ZNF283, ZNF785, ZNF805EGR1, EGR2, MAZ, ZNF460, ZNF580FOXA1
rs109092727---
rs110470694KLF15, ZNF383, ZNF432, ZNF880-ZNF449
rs109060347ZNF335NKX2-5SOX18, ZNF200, ZNF808
rs207689046ESR1, NR1H3, NR2C2, YY1CREB3L1, CREB3L2, RORARXRG
rs109331793CUX1, CUX2ZNF667ZNF605
rs110458346-MEF2B, POU6F1, SRF, THRA, THRB, ZNF774, ZNF823-
rs110888204ZNF768PPARG, ZBTB12, ZNF543, ZNF621, ZNF768ZNF440
rs110930653-NR1I2-
rs110658468---
Table 4. Genotype frequencies by population.
Table 4. Genotype frequencies by population.
PopulationBreed CompositionnqqQqQQ
1Simmental-Angus & Angus3018120
2Shorthorn8749317
3Angus8363200
4Simmental & Simmental-Angus78144717
5Simmental-Angus12762596
Table 5. Parameter estimates and statistical significance of genotype for each phenotype.
Table 5. Parameter estimates and statistical significance of genotype for each phenotype.
PhenotypenIntercept 1β_Qqβ_QQβ_SteerPop2Pop4Pop5ASE 2p-Value 3
Birth Weight (BW), kg25331.7 ± 2.30.4 ± 0.74.2 ± 1.13.1 ± 0.6+3.5−3.9+0.51.6 ± 0.53.67 × 10−4 (*)
Adjusted Weaning Weight (WW), kg241206.8 ± 17.45.3 ± 4.69.5 ± 7.213.9 ± 4.1+5.6−31.6+26.04.9 ± 3.30.327
Weight 1 (W1), kg247161.3 ± 54.57.0 ± 2.93.7 ± 4.712.5 ± 2.7+108.9−51.1−57.83.5 ± 2.20.0603
Weight 2 (W2), kg242476.4 ± 12.922.6 ± 5.935.6 ± 9.756.0 ± 5.5+20.7−19.3−1.419.4 ± 4.44.10 × 10−5 (*)
Weight 3 (W3), kg157517.8 ± 21.431.0 ± 7.732.2 ± 13.055.3 ± 9.6+37.3−18.1−19.221.7 ± 5.71.40 × 10−4 (*)
Average Daily Gain (ADG), kg2441.46 ± 0.120.10 ± 0.030.18 ± 0.040.37 ± 0.02−0.21+0.01+0.200.09 ± 0.022.27 × 10−5 (*)
Hip Height (HH), cm157120.9 ± 1.31.8 ± 0.72.6 ± 1.23.4 ± 0.9+1.6−1.1−0.61.5 ± 0.50.0185 (*)
Dry Matter Intake (DMI), kg879.02 ± 0.150.31 ± 0.210.44 ± 0.360.47 ± 0.20---0.26 ± 0.150.221
Hot Carcass Weight (HCW), kg83350.2 ± 5.421.4 ± 7.416.7 ± 13.632.5 ± 7.0---14.2 ± 5.50.0152 (*)
Backfat (BF), cm831.59 ± 0.060.04 ± 0.09−0.30 ± 0.16−0.20 ± 0.08---−0.07 ± 0.070.130
Ribeye Area (REA), cm28382.4 ± 1.55.2 ± 2.02.8 ± 3.87.0 ± 1.9---3.1 ± 1.50.0423
Kidney Pelvic Heart Fat (KPH), %832.11 ± 0.03−0.06 ± 0.04−0.15 ± 0.08−0.20 ± 0.04---−0.07 ± 0.030.102
Marbling (MB)83538.5 ± 15.532.1 ± 21.2−12.5 ± 39.0−73.3 ± 20.2---10.9 ± 15.80.267
1 Intercept for these models is qq heifer, with β from Qq, QQ, and/or sex effect for steers being added as applicable. Population is included as a random effect added to the intercept. For DMI, HCW, BF, REA, KPH, and MB, there is no farm effect, due to only population 2 being phenotyped for these traits. 2 Allele substitution effect (ASE) was calculated using haplotype as an additive instead of a categorical variable. There was no change in which traits passed the significance threshold between using a categorical or additive model. 3 Calculated using haplotype as a categorical variable. Models where genotype exceeded the BenjaminiHochberg adjusted-significance threshold are denoted by (*).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Majeres, L.E.; Dilger, A.C.; Shike, D.W.; McCann, J.C.; Beever, J.E. Defining a Haplotype Encompassing the LCORL-NCAPG Locus Associated with Increased Lean Growth in Beef Cattle. Genes 2024, 15, 576. https://doi.org/10.3390/genes15050576

AMA Style

Majeres LE, Dilger AC, Shike DW, McCann JC, Beever JE. Defining a Haplotype Encompassing the LCORL-NCAPG Locus Associated with Increased Lean Growth in Beef Cattle. Genes. 2024; 15(5):576. https://doi.org/10.3390/genes15050576

Chicago/Turabian Style

Majeres, Leif E., Anna C. Dilger, Daniel W. Shike, Joshua C. McCann, and Jonathan E. Beever. 2024. "Defining a Haplotype Encompassing the LCORL-NCAPG Locus Associated with Increased Lean Growth in Beef Cattle" Genes 15, no. 5: 576. https://doi.org/10.3390/genes15050576

APA Style

Majeres, L. E., Dilger, A. C., Shike, D. W., McCann, J. C., & Beever, J. E. (2024). Defining a Haplotype Encompassing the LCORL-NCAPG Locus Associated with Increased Lean Growth in Beef Cattle. Genes, 15(5), 576. https://doi.org/10.3390/genes15050576

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop