1. Introduction
The autosomal dominant spinocerebellar ataxias (ADCAs) are a complex group of neurodegenerative diseases characterized by progressive dysfunction of the cerebellum and its afferent and efferent connections, which involve the brainstem and spinal cord [
1,
2]. ADCAs can be divided into seven different diseases of episodic ataxias (EA) with recurrent episodes of poor coordination and balance (ataxia) and spinocerebellar ataxias (SCAs), of which 43 different causative loci are known at the present time [
3,
4]. Seven of these different diseases (SCA1, SCA2, SCA3, SCA6, SCA7, SCA17, and DRPLA) are caused by CAG repeats in the coding exons of these unrelated genes, leading to polyglutamine expansions (poly-Q) in the translated gene products. The modified proteins are responsible for a predominant toxic gain-of-function, which depends on the length of the glutamine residues. There is an inverse correlation between the length of polyglutamine tracts and age and the appearance of neuropathological signs [
5,
6]. Although SCAs are genetically and clinically heterogeneous, the cerebellar pathologies are commonly associated with motor coordination failures of limbs and ataxic gait. Other clinical symptoms include variably pyramidal and extrapyramidal signs, peripheral neuropathy, pigmentary retinopathy, ophthalmoplegia, and dementia [
7]. In contrast to the late-onset clinical features, the molecular and cellular events that underlie the neurological symptoms are still poorly understood.
To gain insights into the mechanisms that contribute to ataxic phenotypes, the zebrafish
Danio rerio has emerged as a useful model system to study brain development, neuronal function, network activity, and neuropathy in vertebrates. Zebrafish models are gaining popularity in neurobiology due to their optic transparency, the similar general organization of their CNS to other vertebrates, and the neuronal genes, which show reminiscent functions in humans [
8]. These conserved genetics allow both the elucidation of normal gene function in vertebrates and the analysis of genetic dysfunction and the mechanisms involved in inherited neurodegenerative diseases. Neuropathological processes with phenotypic overlaps give insights into common molecular pathways that are responsible for similar behavioral patterns between species. A wide spectrum of behavioral protocols have been utilized to support behavioral phenotype analysis in regards to compound screens for the therapeutic treatment of the CNS in zebrafish embryos, larvae, and adults [
9,
10,
11,
12,
13,
14]. Therefore, zebrafish models have many advantages and have become increasingly important to the study of neurodegenerative diseases such as SCAs, which are incurable, debilitating disorders.
There is still a lack of adult zebrafish SCA models and a huge gap in the knowledge of
ataxin genes and their functions in zebrafish. Therefore, it is important to expand the range of zebrafish models for the study of the different types of dominant ataxias [
4]. One of the frequent dominant ataxias in which a similar zebrafish protein function is expected is spinocerebellar ataxia type 1 (SCA1). SCA1 is characterized by cerebellar and brainstem degeneration [
5]. The fatal late-onset disorder is caused by an expansion of CAG repeats in the coding region of the
ATXN1 gene. The accumulation of polyglutamine-expanded ATXN1-poly-Q results in selective neuronal degeneration in the cerebellum and brainstem, causing the loss of motor coordination, which induces swallowing and breathing difficulties of patients and leads to premature death [
15]. Although ATXN1 is broadly expressed in the human brain, the neuronal degeneration caused by
poly-Q-ATXN1 selectively affects the neurons of the cerebellum and the brainstem [
5,
6,
15].
In a previous work, three
atxn1 gene family members were identified in zebrafish: two
ATXN1 orthologs,
atxn1a and
atxn1b, and the
ATXN1L ortholog
atxn1l [
16]. The authors demonstrated that all are expressed in the developing cerebellum and in adult Purkinje cells by pan-ATXN1 immunohistochemistry. This suggests that the zebrafish is a suitable model system with which to study the SCA1 pathogenesis. However, for further analysis of the distinct
atxn1 genes in zebrafish, a broader knowledge of their individual spatiotemporal expression is missing and may be beneficial for future research. Here, we compare the genetic structures of the
atxn1 genes in the zebrafish genome and analyze the phylogenetic relationship of their deduced proteins in vertebrates. We prove the bioinformatically predicted transcript variants of
atxn1a,
atxn1b, and
atxn1l genes and provide new insights into shared and different expression domains during zebrafish development.
3. Discussion
In neurodegenerative diseases such as hereditary dominant ataxias, neuronal cells in the brain lose their function over time and finally die. To date, there is no cure for these movement disorders. There is a critical need to identify the pathological processes and to develop new approaches for treatment or prevention. Therefore, the zebrafish
Danio rerio has already become an increasingly important model organism for human neurodegenerative diseases such as the autosomal dominant spinocerebellar ataxias SCA2, SCA3, SCA6, SCA8, and SCA18 (for review see [
4]). It could be important to expand the range of zebrafish models also to SCA1, which is largely caused by a proteotoxic gain-of-function mechanism due to an expansion of a polyglutamine stretch in the human ATXN1 protein [
5,
17]. ATXN1 shows a nuclear–cytoplasmic localization, interacts with large protein complexes, binds RNA, and is thought to be involved in transcriptional repression [
18]. Interestingly, although SCA1 is caused by an expansion of a CAG repeat in the coding region of the
ATXN1 gene, no repeats were found in the corresponding paralog gene region of rodents, prosimian, and New World monkeys. Perfect (uninterrupted) CAG repeats were found in Old World monkeys and interrupted CAG repeats in hominoids [
19]. This suggests that the disrupted CAG structure of the
ATXN1 gene was acquired gradually during human evolution, and the interruption of the CAG sequence is associated with a slower progression of neurodegeneration.
Likewise, little is known about
atxn1 genes in zebrafish. Three
atxn1 genes are indicated in the Zebrafish Information Network Database (ZFIN,
https://zfin.org/ (accessed on 3 March 2021)), two
ataxin-1 genes, annotated as
atxn1a and
atxn1b as well as one
ataxin-1-like gene,
atxn1l. In a recent work, three Atxn1 proteins, Atxn1a, Atxn1b, and Atxn1l have been described in zebrafish. Using a cross-specific anti-ATXN1 antibody that recognizes all three zebrafish proteins, it was shown that all are expressed in the cerebellar anlage of embryos and in the adult zebrafish cerebellum [
16]. Three conserved regions of high homology were identified between the human ATXN1 and the three zebrafish proteins, including the N-terminus, the AXH domain as an important mediator of protein-protein interactions, and the C-terminus. The long polyglutamine repeat is not conserved in zebrafish [
16]. Our comparative sequence alignment revealed that this is also the case in all other vertebrate species examined. The polyglutamine repeat is only found in humans and chimpanzees. We compared the protein sequence of human ATXN1 and ATXN1L and other vertebrate models to the zebrafish ortholog proteins in relation to the size, the similarity and the phylogenetic distance. Besides the species-specific polyglutamine stretch, the length of the human ATXN1 (815 amino acid residues, aa) is in the range of the other somewhat smaller ortholog proteins in chimpanzees, rats, mice, chickens, frogs and zebrafish (including the zebrafish Atxn1a (881 aa) and the longer zebrafish Atxn1b (828 aa)). A sequence comparison revealed 46% (Atxn1a) and 33% (Atxn1b) amino acid identities to ATXN1. The smaller ATXN1L protein (689 aa) and the vertebrate orthologs have similar sizes. The zebrafish Atxn1l protein (865 aa) has a comparably longer amino acid chain than the other selected vertebrate orthologs and is 42% identical to the human protein. These data were used to elucidate the phylogenetic relationship between the zebrafish proteins Atxn1a and Atxn1b as well as their human and other vertebrate orthologs. Our phylogenetic analysis indicated two clusters for human ATXN1 proteins and ATXN1L proteins in vertebrates. The ATXN1 cladogram revealed a close relationship in hominids and rodents but much less so in chickens, mice, and zebrafish. The same phylogenetic relationships were found for the ATXN1L cluster between species. The phylogenetic analysis of all three zebrafish proteins revealed Atxn1a as the evolutionarily closer protein variant to the human ATXN1 protein. In contrast to Atxn1a, we showed that the zebrafish variant Atxn1b is phylogenetically closer to its own Atxn1l paralog and that of other vertebrates.
The two Atxn1a and Atxn1b protein variants in zebrafish result from a duplication of the
atxn1 gene and have a common phylogenetic origin. The zebrafish belongs to the very diverse infraclass of
Teleostei that accounts for about half of the living vertebrates and over 96% of all fish species [
20,
21]. The teleost lineage underwent a whole-genome duplication event (TGD) about 320 million years ago, resulting in two copies of each gene in the genome [
22]. Such paralogous genes were presumably redundant, and one copy was lost randomly during further evolution [
23]. This was likely the case for the
atxn1l gene, with only one copy of the gene present on the zebrafish chromosome seven minus strand. The
atxn1a gene maps on chromosome 19 plus strand and the
atxn1b gene on chromosome 16 minus strand. These chromosomes were identified as paralogous chromosomes [
24]. We performed a comparative synteny analysis of the human
ATXN1 gene locus to the zebrafish
atxn1a and
atxn1b genomic loci. We identified the
atxn1b gene with conserved synteny to the human ortholog. No synteny was found for the zebrafish
atxn1a locus. However, when both paralogs are maintained, accumulations of mutations can affect one gene copy differently to the other and lead to new functional properties or asymmetric acceleration of the evolutionary rate [
25].
We analyzed and compared the genetic structures and expression patterns of the three
atxn1 genes and could identify similarities and differences in the zebrafish. The
atxn1a gene spans 198 kb of genomic DNA and is organized in eight exons that are each employed to express four different transcript variants, which differ from one to three exon sequences in the 5′ untranslated region (5 ′ UTR), but all of them use exon E4 and E5, which contain the coding sequences. This is similar for the
atxn1b gene, which spans 35 kb with five exons that transcribe two different variants, X1 and X2. Both differ in three different exons in the 5′ UTR (X1: E1′ and E2′; X2: E1) and use the larger exons E2 and E3 that harbor the coding sequences. This variability of the 5′ UTR is similar to the human ATXN1 gene, which is much larger (450 kb) and organized in nine exons. The first seven exons are located in the 5′ UTR, while exons E8 and E9 contain the coding region and the 3′ UTR [
17]. We confirmed all in silico suggested
atxn1a (X1–X4) 5′ UTR transcript variants and the
atxn1b X1variant via RT-PCR and sequencing. However, in the case of
atxn1b X2 variant, the predicted exon E2′ could not be confirmed experimentally. Sequencing revealed that this exon is either weakly expressed or not expressed and not amplifiable by RT- PCR or spliced out in the
atxn1b X2 variant. Together,
atxn1a and
atxn1b variants contain variable noncoding exons in the 5′ UTR. It is likely that these noncoding exons undergo alternative splicing in different tissues of the zebrafish. This leads to the assumption that the transcriptional and translational regulation of
atxn1a and
atxn1b may be complex and tissue dependent in zebrafish. In contrast, the
atxn1l gene in zebrafish contains only three exons that are distributed in a region over 18 kb of the genomic DNA and encode only one transcript. The predicted 5′ UTR exon sequence of
atxn1l was also experimentally confirmed in our study.
Interestingly, the protein size of human ATXN1 (815 aa) with a molecular mass of 88 kD is almost the same to the gene products of Atxn1a (881 aa) and Atxn1b (828 aa). However, the human ATXN1 mRNA (NM_000332.3) is much longer (10,636 nt) than all the predicted zebrafish transcript (5–8 kb) variants. In humans, the last coding exon contains a 8288 bp long 3′ untranslated region (3′ UTR). In contrast, the predicted 3′ UTRs of atxn1 transcripts in zebrafish are much shorter. In addition to the confirmation of the 5′ends of the atxn1 variants in zebrafish, we could also experimentally confirm the coding regions and most of the predicted 3′ UTR by RT-PCR using primer pair combinations covering the complete predicted sequences. However, we failed to amplify the exact 3′ prime end in all three atxn1 transcript variants. We performed several 3′ RACE (rapid amplification of cDNA ends) experiments using different sets of forward primers at the 3′ end of the cDNA and reverse primers containing oligo-(dT) universal amplification sequence and also prolonged extension times in PCR. We could not amplify any reliable cDNA amplicons in several attempts. We assume that the 3′ UTRs of atxn1 transcripts are likely much longer or different than predicted and will need to be studied in the future.
In addition to the conserved organization of the
ATXN1 and
ATXN1L genes between humans and zebrafish, we were interested in whether this was also true for their expression patterns. In human, ATXN1 expression domains were identified by immunoblot analysis in different brain regions, but also in nonneuronal tissues such as the liver, heart and skeletal muscle. In nonneuronal tissues, ATXN1 is predominantly located in the cytoplasm, in contrast to the brain where the neurons of the cortex, the basal ganglia, and the pons show a nuclear localization of ATXN1. A nuclear–cytoplasmic localization was shown for Purkinje cells of the cerebellum [
18]. Purkinje cells are the key neurons in spinocerebellar ataxia type 1 because an expansion of the trinucleotide CAG repeat in the
ATXN1 gene leads to nuclear inclusions of the altered mutated protein in these neurons involved in the pathogenesis of SCA1 [
5,
17,
26,
27]. In zebrafish, Atxn1a and Atxn1b contain a nuclear localization signal (NLS) and both proteins localize to the nucleus in tissue culture experiments. Like in human, the zebrafish Atxn1l protein is mostly cytoplasmic, but when co-expressed in COS-1 cells, both are predominantly found in the nuclei [
16]. This strengthens the previous assumption that ATXN1 may support the shuttling of ATXN1L to the nucleus and interact in large protein complexes to cause SCA1 neuropathology [
28]. RT-PCR was used to demonstrate that each member of the
atxn1 gene family is expressed in the adult zebrafish cerebellum, and
atxn1a and
atxn1b are expressed in the cerebellar anlage of 28 hpf embryos, supporting the role of both genes in cerebellar development [
16]. Our detailed expression analysis of all
atxn1 members in zebrafish comprehensively expands the expression data of
atxn1 in the embryos and the larvae of this model system. Our data show a spatiotemporal expression of all three
atxn1 family members with overlapping expression domains during early embryogenesis followed by different expression levels and domains during further development.
Atxn1a is uniformly expressed in the embryo during the segmentation phase with increasing levels in the diencephalon at 1 dpf. The expression domains extend to areas such as the epiphysis, the telencephalon, the metencephalon, the rhombencephalon, the optic tectum, and the cerebellum during further developmental stages until 4 dpf. Atxn1a expression persists in the gills but decreases or disappears almost completely in the rhombencephalon at the larval stages of 7 dpf and 10 dpf. At this stage, a rather low and diffuse expression is found in the cerebellum. Interestingly, the expression domain of atxn1a more closely resembles the pattern of atxn1l than that of atxn1b, with the exception that atxn1l expression is enhanced in the rostral and caudal region during the segmentation phase, as compared to a more uniform expression of atxn1a. Then the expression domain of atxn1l shifts to the anterior part at 1 dpf and includes the developing CNS with the tegmentum, the diencephalon and the mesencephalon, the cerebellum, the metencephalon, and the myelencephalon at 2 dpf. Atxn1l expression also becomes downregulated in almost all brain regions when transcripts are still detectable in the tectum and the cerebellum, but primarily in the regions of gill arches from stages 4 dpf to 10 dpf. The spatiotemporal expression domain of atxn1b differs to some extent from those seen for the other two gene family members. A strong expression is already obvious in the telencephalic region at 1 dpf and extends to the brain regions of the tectum opticum, the cerebellum, and the rhombencephalon at 4 dpf. In contrast to atxn1a and atxn1l, atxn1b expression decreases only in the telencephalon, but persists in the midbrain and hindbrain region from 7 dpf to 17 dpf. Cryosectioning of the stained embryos confirmed the expression domains in the thalamus, the tectum opticum, the retina, the pineal gland, the cerebellum, and the rhombencephalon.
The RT-PCR analysis confirmed the expression of all atxn1 genes during embryonic and larval stages in zebrafish. While atxn1l is continuously and uniformly expressed from fertilization until adulthood, atxn1a expression decreases during stage of early brain development but subsequently recovers to previous levels at 2 dpf. Instead, maternal atxn1b expression rapidly declines during the first hours of development and reappears as zygotic expression at 12 hpf when early brain development occurs and lasts until the adult stages. Together with the spatiotemporal expression profile observed with the WISH analysis, these findings argue that atxn1b is more strictly reflecting neuronal development, differentiation, and homeostasis.
Taken together, the results of all whole-mount in situ hybridization experiments on atxn1 gene expression indicate that atxn1a, atxn1b, and atxn1l are upregulated primarily during embryonic development of the zebrafish brain, but only atxn1b is continuously expressed at higher levels in the midbrain and hindbrain during further larval development and is strictly confined to the cerebellum. This highlights the need to perform further experiments with the goal of identifying the function of these proteins in the zebrafish model in the future.