1. Introduction
The Sarcidano Horse is a very small population of semi-feral horses, autochthonous of Sardinia, the second largest island in the Mediterranean Sea. The origins of these animals are uncertain; it is assumed their descendance from ancient Iberian horses introduced in Sardinia during the Aragonese domination [
1]. Sarcidano Horse frequently has supernumerary bilateral upper premolars, which have disappeared in modern horse breeds, placing this breed on an archaic model (similarly to the Przewalski’s Horse). Previous research confirmed the marginal impact of crossbreeding on the Sarcidano Horse mitochondrial gene pools and the lack of recent gene flow from outside [
2], suggesting that this breed represents an important resource to be preserved, but also an important model to be studied.
Due to the ancient origin and to the geographic isolation this breed should have a conservation program focused on maintaining the typical phenotype features and avoiding crossbreeding. Unfortunately, there are no real conservation plans for this breed at the moment. Among the major criticalities, obviously the first aspect to consider in native breeds is the low number within the population, which imposes the need for adequate mating strategies, the application of the most modern genetic improvement techniques to limit inbreeding and, above all, technical and economic assistance to the few breeders who take care of these animals so that they can adhere to the Stud Book, mating plans and functional checks.
Since it is well known that the great variety of coat colors present in many current breeds derives from domestication and human selection [
3], a first step towards a deeper knowledge of this breed could be to study the genetic basis of the coat colors. According to the breed Standard, the Sarcidano Horse may be chestnut, black, bay, and, more rarely, grey. Although the coat color of an individual is visible, its real detection based exclusively on phenotype is often difficult. Two genes are primarily responsible for determination of base coat color in the horse: melanocortin-1-receptor (
MC1R), encoded by the Extension (
E) locus, and its peptide antagonist agouti-signaling-protein (
ASIP), encoded by the Agouti (
A) locus. These two loci are linked by a close epistasis relationship, where
MC1R is epistatic to
ASIP, so controlling the relative amounts of melanin pigments in mammals [
4]. More precisely, the
MC1R dominant allele E
E (hereafter simply referred as E) determines the production of black pigment (eumelanin), while the recessive E
e (=e) allele determines the production of a red-yellow pigment (pheomelanin) [
5,
6]. The “e” allele is produced by a C > T mutation [
6] within the
MC1R gene Exon 1 (chr:3: 36,979,560). Conversely, in the
ASIP locus, the dominant A
A (A) allele encodes production of the agouti signaling protein, which has the ability to block the melanocortin receptor 1 function in the melanocytes. When this receptor is blocked, there is no stimulus for the production of eumelanin, and only pheomelanin synthesis occurs [
7]. A recessive 11-bp deletion in
ASIP Exon 3, causing the “a” allele, produces a frameshift that causes a loss of function in the agouti signaling protein, resulting in the uniform production of eumelanin [
5]. Combination of specific genotypes at
MC1R and
ASIP loci result in the three basic phenotypes of horse coat colors: black, bay and chestnut [
8].
This study aimed to contribute to increase the knowledge about the Sarcidano Horse, by identifying the genetic basis of the coat color, also establishing how the colors are genetically distributed within the existing population, and by updating the exact locations of the genotyping sites, based on the current EquCab3.0 genome assembly version.
3. Results
Captured horses resulted in 63 females and 37 males, aged from 15 days to 17 years old. Age determination, by teeth observation, exhibited the following distribution: 51 were young (aged 0–4 years); 21 were adults (5–10 years) and 20 were over 10 years old (even up to 17 years). The 70 individuals here genotyped were 27 males and 43 females, distributed in 37 young, 16 adults and 17 elderly, following the same age classification explained above.
All the 70 samples processed have been successfully amplified and genotyped both at the
MC1R and
ASIP locus. Individual information about
ASIP and
MC1R genes genotype, age, sex and phenotypic/genetic coat colors in the studied population are available in
supplementary file Table S1.
The sequencing result did not produce a useful reading for the ASIP gene fragment, due to its shortness, so that all the genotypes identified here were obtained as follows.
PCR products containing the mutation positions in
MC1R (317-bp)—later digested by
TaqI endonuclease—and in
ASIP (102-bp) genes, allowed identification of the genotypes set (E/E, E/e, e/e and A/A, A/a, aa/) for
MC1R and
ASIP locus, respectively (
Figure 1). A summary of the
MC1R and
ASIP genes genotypes, and allele and genotype frequencies for the horse coat colors are shown in
Table 1,
Table 2 and
Table 3.
The comparison between previous polymorphic sites positions and the latest EquCab3.0 horse genome assembly version resulted an updated set of information about these loci, described below in detail.
3.1. MC1R Locus Genotypes
The MC1R gene is on chromosome 3 in horses, and it extends from the 36,979,313 to 36,980,266 position of the latest horse genome version EquCab3.0 (GCF_002863925.1). The entire MC1R gene consists of only one exon of 954 base pair (bp) in length. A single fragment of 317-bp in length was obtained by PCR analysis of the MC1R locus, corresponding to part of the unique Exon 1.
After restriction fragment length polymorphism (RFLP) analysis, carried out through
TaqI restriction enzyme digestion, one polymorphic site was found at position 180 of the amplified fragment, corresponding to chr3: 36,979,560 of the EquCab3.0 genome assembly, involving a C with a T substitution. This polymorphic site corresponded to the
MC1R C901T mutation recognized in previous studies [
6,
14]. This SNP produces three different genotypes, here named E/E, E/e and e/e, and recognized, after electrophoretic run, on the basis of the fragments number and size. More precisely, a unique 317-bp fragment corresponded to C/C (or E/E) dominant homozygous genotype; a 180 + 137-bp electrophoretic pattern matched with T/T (e/e) recessive homozygous genotype, and finally a 317 + 180 + 137 electrophoretic pattern corresponded to C/T (E/e) heterozygous genotype (
Figure 1). The
MC1R genotype distribution exhibited 47 horses carrying recessive homozygous e/e (T/T) genotype, corresponding to 67%, 20 horses carrying heterozygous E/e (C/T) genotype, corresponding to 29%, and only three horses carrying dominant E/E (C/C) genotype, corresponding to 4% of the analyzed population (
Table 2). Allele frequency was 0.19 for the “E” allele and 0.81 for the “e” allele (
Table 2).
3.2. ASIP Locus Genotypes
The ASIP gene is on chromosome 22 in the horse and ranges from the nucleotide 26,009,341 to 26,072,655 of the EquCab3.0 horse genome assembly, spanning 63,315 nucleotides. It consists of four exons in the principal isoform and of five putative exons in the X1 isoform. Before the EquCab3.0 sequence was available, the mutation responsible for the phenotypic coat color change was believed to occur in Exon 2 [
15], while it is now known that this mutation falls in Exon 3.
The Exon 1 counts 499-bp in length, ranging from 26,009,341 to 26,009,839 position; it is the largest of the 4 exons composing the entire ASIP gene. The Exon 2 covers 170-bp, from 26,065,977 to 26,066,136 position; Exons 1 and 2 are separated by an intron of approximately 56.1-kbp. The Exon 3 is the shortest, ranging from 26,067,437 to 26,067,501, but within its 65-bp sequence falls the causative mutation. It is separated from the Exon 2 by the Intron 2, which covers 1.3-kbp, and from the Exon 4 by the Intron 3, spanning approximately 2.3-kbp. Finally, the Exon 4 covers 177-bp, from position 26,069,795 to 26,069,971.
Here we amplified a 102-bp fragment, corresponding to the entire Exon 3—consisting of only 65 nucleotides—and small parts of the adjacent Introns 1 and 2. This entire 102-bp fragment identified the dominant, wild type “A” allele, that inhibit the eumelanin production, while an 11-bp deletion mutation in the above fragment leads to the mutant, recessive “a” allele (consisting in the resulting 91-bp fragment after amplification) (
Figure 1). This 11-bp deletion mutation consists in the elimination of the nucleotide sequence CAGAAAAGAAG from chr22: 26,067,476 to 26,067,486 position (rs396813234) of the EquCab3.0 genome assembly (GCF_002863925.1).
The
ASIP genotype distribution was similar to those observed in
MC1R locus. Indeed, 44 horses carried the recessive a/a homozygous genotype, corresponding to 63%, 24 horses carried heterozygous A/a genotype, corresponding to 34%, and only two horses carried dominant homozygous A/A genotype, corresponding to 3% of the studied population. Consequently, allele frequency was 0.20 for the “A” allele and 0.80 for the “a” allele (
Table 3). In both the loci, mutant recessive allele resulted more frequently than the wild, dominant type.
3.3. Colour Coat Assessment
The most representative color coat in Sarcidano Horse was the chestnut, both from a genetic and phenotypic point of view: a number of 47 horses of the analyzed 70 were Chestnut, corresponding to 67% of the studied population. The 27% were Black (19 horses) and only 6% (4 horses) were Bay. Some differences were observed in the color shade of the chestnut and bay coat, from light to dark, depending on the allele combination. The most frequent allele combinations resulted in e/e-a/a that correspond to cherry-liver chestnut phenotype, which is the most visible coat color in the Sarcidano population. Another lighter chestnut phenotype was present, although in only two subjects, obtained by the e/e-A/A allele combination. Bay coat color resulted both phenotypically and genetically poorly represented in the Sarcidano Horse, with 4 subjects carrying only E/e-A/a allele combination, and none carrying one of the other possible three genotypes for bay color (E/e-A/A; E/E-A/A and E/E-A/a). The χ
2 test results showed that the phenotypes of horse coat colors were significantly related with the genotypes and alleles of
MC1R and
ASIP (
p < 0.001,
Table 1 and
Table 2).
Finally, Black color coat resulted moderately represented in the studied population with 16 subjects carrying E/e-a/a genotype and 3 carrying E/E-a/a genotype. In total 8 horses (2 males and 6 mares) were phenotypically Grey, but their genetic color resulted in all cases Chestnut, i.e., 4 carrying A/a-e/e and 4 carrying a/a-e/e genotype.
4. Discussion
In genetics, the knowledge of the link between genotype and phenotype is crucial to understand how phenotypic variation influences strength and timing of selection through evolutionary change within a population.
Coat, hair and skin colors depend on the pigment produced in the melanocytes. In horses, the base coat color is mainly influenced by two candidate genes: melanocortin -1 receptor (
MC1R) and agouti signaling protein (
ASIP), that work together to control the coat color phenotypic trait [
7,
16,
17].
The
MC1R gene produces a protein called melanocortin 1 receptor (MC1R), which plays an important role in normal pigmentation, and it is primarily located on the surface of melanocytes (specialized cells that produce a pigment called melanin). Melanin is the substance that gives color to skin, hair, and eyes, and it is also found in the retina, where it plays a role in normal vision [
18].
When the MC1R is activated, by the melanocyte stimulating hormone (MSH) produced by the intermediate pituitary, stimulates melanocytes to make eumelanin; while if the receptor is not activated or if it is blocked, melanocytes produce pheomelanin instead of eumelanin [
19].
Common variations (polymorphisms) in the MC1R gene result in differences in skin and coat color, although also other genes contribute to the normal mammalian pigmentation. These MC1R polymorphisms prevent eumelanin production by melanocortin 1 receptor, inducing output of pheomelanin.
The melanocortin 1 receptor plays also an important role in the body’s immune and inflammatory responses. The receptor’s function in these cells is still poorly known.
MC1R gene is also known as
E locus (where E means
Extension), and two allele are recognized within this locus: the dominant “E” allele, determining the production of eumelanin; and the recessive “e” allele determining the production of pheomelanin [
20]. A SNP located at position chr:3: 36,979,560 of the EquCab3.0 genome assembly, within
MC1R gene nucleotide sequence causes a T to a C transition, that leads an amino acid substitution from Serine (TCC) to Phenylalanine (TTC) at position 83 of the protein chain (NCBI Reference Sequence: NP_001108006.1). This mutation is involved in a transmembrane domain of the transport protein and its substitution is likely to disrupt the secondary local structure. The normal allele, with C was named “E” and the recessive one, with C replaced by T, “e” [
21]. This recessive allele generated by the above mutation leads to the production of only pheomelanin within melanocytes [
6]. Thus, pheomelanin production depends on melanocytes that are homozygous for the recessive e/e genotype at
MC1R locus which therefore cannot be activated by the MSH [
6,
22]. When the melanocortin -1 receptor is activated by the MSH, it triggers a series of chemical reactions within the melanocytes, leading to production of eumelanin. In fact, melanocytes are capable of producing both eumelanin and pheomelanin, but when the animal carry the e/e genotype, the receptor is defective, making it unable to properly transmit the information passed by MSH, thus leading to production of pheomelanin, only [
6,
22]. The net result of the forms of this gene is therefore an animal with a black coat when there is at least one dominant allele (E/E or E/e), or a Chestnut horse (reddish color) when it is homozygous to the recessive allele (e/e). The recessive allele (e) is a mutation of the wild form (E), which occurred about 7000 years BP [
23].
The other gene involved in the skin and coat color in mammals is the
ASIP gene: the dominant “A” allele encodes production of a protein called agouti signaling protein, which has the ability to block the melanocortin -1 receptor in existing melanocytes in the body of the horse, but not in the extreme points (mane, tail, ear edges, and lower legs). Therefore, an eumelanic horse (E/-) with a dominant allele at the Agouti locus (A/-) yields a Bay horse—a horse with a reddish to brownish body color and black points. When the receptor is blocked, there is no stimulus for the production of eumelanin (as explained above), and only pheomelanin synthesis occurs [
6].
About 8000 years BP, a recessive mutation occurred in the ASIP locus, yielding black color throughout the entire body of the horse [
23]. For this to happen, however, two recessive a/a allele are required. Interestingly, the agouti signaling protein is effective only in eumelanin-producer horses (i.e., carrying E/E or E/e genotype), whereas, in chestnuts (whose genotype is e/e) melanocortin-1 receptor is defective, consequently the protein has no effect. Similar to in other domesticated species, also in horses, artificial selection started from domestication process, caused mutation in color associated genes, followed by their fixation and increase in frequency [
3]. In the history of domestic horses, crosses among breeds/lineages were common, resulting in a widespread distribution of coat-color-associated alleles, starting with their initial introduction in the gene pool of domestic horses. Therefore, artificial selection is the main factor responsible for the large phenotypic variation observed in domesticated animals today [
3]. Through time, humans have actively encouraged coat color changes and consequently the proliferation of new coat color alleles, often fascinated by the new combinations thus produced [
24]. The alleles for the basic colors (bay, black, chestnut) in the
MC1R gene [
14] and in the ASIP gene [
5] are at least 6300 years old and were already found in pre-domestic times [
25,
26,
27]. Consequently, the color alleles for bay, black, and chestnut occur in nearly all breeds, but their proportion can differ on the base of the breed specific history and geographical location; for example, in Misaki horses, a small local breed in southwestern Japan the most representative coat color is Black, with very small number of Chestnut horses [
28], in contrast with what was found here. The absence of coat color dilutions or spotting in Sarcidano Horse, allows to confirm the state of genetic isolation in which this population has evolved and has reached the present day.
One of the most interesting aspects of the present study is that the attribution of the coat color in horses, based on visual observation alone, is not always reliable. Additional difficulties on identification arise when considering several situations, some of which are explained below.
Different nomenclatures for the same coat color are not rare, depending on different countries, or regions within the same country. Moreover, different classifications in horse coat color derive from specific breed standard: although genetically the black color is possible in the horse as in all other mammals, some breed standards do not admit it among the coat colors and identify all dark coats as Dark Bay.
A subjective perception of a particular shade can create a difficult identification of horse’s color; thus, only genetic characterization is correct and reliable. The influences of external and individual factors, such as age, time of year, living condition, and nutritional status can make it difficult to identify a horse’s color. Thus, a Black horse kept always outdoors, could exhibit a “fading” effect on coat color, making it easily confounded with a Dark Brown horse. Conversely, the Dark Bay horse coat color is a black mane, tail and legs, and very dark brown or reddish hair at the head, neck, back and hip, which makes it confusing with a Black coat faded by sunlight.
Horses with brown coat color have black pigment in the mane, tail and legs, and have a nearly black phenotype of their whole body, but reddish or tan pigments around their eyes, muzzle, and abdomen between the elbow and stifle. This makes them easily confused with Liver-Chestnut coat color. At the same time, Chestnut horses are reddish with no black color on their entire body including the mane, but the shade of red can be so intense, that it can be confused with a Bay-Brown coat color.
Grey horses are born with an original coat color (such as black, bay, or chestnut) but gradually lose hair pigmentation at ages (6 to 8 years) but maintain the dark skin [
29]. The Grey coat color is due to the presence of dominant allele—G—at the grey locus. The genotype of grey horses will be either G/G or G/g and the horse without grey gene is symbolized as g/g. Grey is epistatic to all coat color genes except white and grey horses must have at least one grey parent [
30]. Thus, defining a horse as Grey tells us nothing about its real color and, very importantly, what coat color its offspring will inherit.