1. Introduction
The establishment of precise, high-resolution temporal sequences for morphogenetic events in laboratory mice, currently the most extensively used experimental mammal for studying human development and disease, remains a vexing issue in developmental biology. One of the main obstacles to establishing a precise developmental timeline for any developmental event is the high level of variability in developmental progress exhibited by embryos collected at similar days of gestation. Mouse embryos collected at the same embryonic age, even littermates, show disparities in morphological progress indicating variation in the rate of development [
1,
2,
3]. As a result, for any given developmental process (e.g., palate elevation, heart septation, limb bud development, organogenesis, etc.), each mouse within a litter will exhibit a different amount of progress along the trajectory towards completion of that process. Similar developmental differences appear across litters collected at the same time, with some litters appearing older or younger in aggregate than other litters. Consequently, selecting specimens to analyze molecular or cellular details of rapidly occurring developmental phenomena based solely on age information determined from timed matings is problematic. Inter-individual differences in rates of development make it difficult to select individuals to represent precise stages of development, even when matings are conducted on short-duration timelines and when time-consuming and often subjective examination of developmental morphology is used to stage individuals.
One example of the difficulty in establishing precise temporal sequence is the initial establishment of the mineralized skeleton. Dermal bone is not preformed in cartilage, but forms directly through intramembranous ossification [
4,
5] that involves the differentiation and proliferation of osteoprogenitor cells directly from mesenchyme. The initiation of skeletogenesis of dermal bones of the cranial vault and facial skeleton (the dermatocranium) is marked by ossification center differentiation, while skeletogenesis of endochondrally forming bone (the chondrocranium and the majority of post cranial skeletal elements) is marked by the initiation of mineralization of cartilage models. Despite nearly a century of research into bone development using laboratory mice as a model, few researchers have precisely characterized the developmental timing and sequence of ossification center appearance in mice. Even the meticulous atlases of mouse development compiled by Theiler [
6] and Kaufman [
7] do not attempt to establish an absolute sequence of ossification center development in the mouse.
The production of an embryo represents an assembly of diverse developmental events within any specific developmental process whose sequence is obscured due to the rapidity and variability of mouse development. Skeletogenesis is a developmental process marked by a series of developmental events: the proliferation, condensation, and differentiation of mesenchymal cells to a chondrogenic or osteogenic fate [
8], continued proliferation of those cells, and the production and mineralization of osteoid by osteoblasts. The appearance of ossification centers for cranial bones can occur rapidly, being separated by less than two hours of embryonic development, which confounds even short duration (commonly two hour) mating schemes (e.g., [
9]). Moreover, mice within a litter can display a range of developmental morphologies that exceed what can be achieved during two hours of gestational development. Consequently, since each embryo grows at its own rate and represents a slightly different stage of development, we cannot determine the precise sequence and timing of many developmental events strictly by using timed conceptions [
10]. Establishing a reliable read-out of the sequence of ossification center appearance requires a method for sorting embryos according to the relative level of development [
2,
3].
One potential method for sorting embryos is the use of crown–rump length (CRL), a simple and easy-to-collect metric of the maximum total length of an embryo, to order specimens according to size (
Figure 1). Ferguson [
11] found that CRL predicted the level of palatal development in rats better than embryo weight and other linear measurements. The precise nature of the relationship between size measured by CRL during embryogenesis and developmental progress is not clear, however. It is possible, despite the high correlation between size and developmental progress, that these two variables are autonomous for some developmental processes, and that CRL measurements cannot be used to generate relative sequences for some aspects of development. Despite the fact that embryos increase in size as their development progresses, the advancement of any individual developmental process (e.g., palate development) may not inherently depend on or be correlated with overall size increase.
Another solution to determining the precise sequence and timing of specific developmental events is the implementation of staging systems that use developmental morphology to establish a metric of the relative level of development of a mouse embryo (developmental age). Developmental age estimates provide information about the stage of development of an individual that cannot be determined from the amount of time elapsed between conception and embryo collection (age at harvest). Staging systems enable researchers to estimate a developmental age for each individual, sort individuals on the basis of developmental ages, and determine the precise sequence of a set of developmental events. There is no single method for determining developmental age for a mouse embryo, as any aspect of embryonic morphology that changes in synchrony with the level of overall development provides a potential metric for measuring developmental age. Thus, if methods are precise, staging systems based on the developmental morphology of the head [
1] and those based on the limb [
2,
12,
13] are equally valid for assigning developmental ages to embryos. However, some may be more or less appropriate for particular uses (e.g., head morphology-based staging systems may prove to be better predictors of palatal development relative to limb-based systems).
Though many systems for staging mouse embryos exist [
1,
6,
12,
13], they are not routinely used because they are not quantitative and thus contain implicit bias, are low resolution (grouping embryos by embryonic days or half days) providing no better precision than time elapsed since fertilization, or are too time-consuming to be of practical use for most research programs. The embryonic Mouse Ontogenetic Staging System (eMOSS) [
3] is a high-resolution staging method that uses the two-dimensional outline of an embryo’s hindlimb to provide an estimate of the relative level of development for embryos collected on embryonic days 10 through 15 (E10–E15). eMOSS age estimates (hereafter called limbstaging ages) are scaled to developmental timing, so that limbstaging ages are provided as point estimates in embryonic days and hours, with an associated confidence interval given as a range of hours of development. eMOSS estimates can be generated rapidly, since the method is scale-invariant and only a picture of the hindlimb is required to establish a limbstaging age estimate. eMOSS is particularly well-suited to the task of establishing a sequence for ossification center appearance, as the process of ossification begins and a large number of ossification centers appear during the embryonic period covered by eMOSS.
The distinction between these two classes of systems for ordering embryos is that one measures the extent of development on the basis of a size metric (CRL) while the other uses a metric designed to represent shape (limb outline) to track development. In evolutionary developmental biology, changes in the timing or rate of developmental events, known as heterochrony [
14,
15,
16,
17], lead to changes in size and shape that affect the overall morphology of the organism. Previous investigations into evolution by heterochrony assume that growth and development, represented mathematically by change in size and change in shape, respectively, are evolutionarily dissociable processes [
15,
16,
17]. A large literature rests on this assumption, though little evidence has been gathered on the dissociability of size and shape (as proxies for growth and development) among individuals of the same species.
Here, we use precise data collected from laboratory mice to determine the magnitude of variation exhibited by mouse embryos of the same age at harvest to determine variability in growth and development across our sample. We stage our samples of embryos using information on change in size (using crown–rump length) and change in shape (using eMOSS limbstaging ages) to establish a detailed sequence of cranial ossification center appearance in C57BL/6 and in Osx-GFP mice in which the expression machinery of the Osx (Osterix) gene drives expression of a green fluorescent protein (GFP) reporter gene. We examine the difference in sequences of ossification center appearance in mouse embryos using these two systems. Our use of these two systems enables the establishment of concrete timelines for the appearance of ossification centers using two different aspects of the initiation of ossification centers (osteoblast differentiation and bone mineralization), and provides an indication of the delay between these two events in the development of an ossification center. Our data also provide the first experimental evidence of the dissociability of size and shape in developing mice.
3. Discussion
Establishing temporal priority for a series of developmental events using laboratory mice presents a significant obstacle in developmental research due to variation in the rate of development among individuals. A synchrony of developmental events in mice contributes to the complexity. Unravelling the complex nature of sequential and causally-linked events such as gene expression and cellular differentiation is crucial to understanding the developmental processes that comprise morphogenesis. To understand the initiation of a developmental event, it is often necessary to detect changes in gene expression patterns that may be limited in time and therefore difficult to capture using standard mating schemes. Here, we have shown that measures of growth (CRL) and development (limbstaging age) provide significantly higher resolution timelines for a relatively obvious and binary developmental event (ossification center appearance) than the age at harvest for mouse embryos. Moreover, our data establish a framework for breeding for selected timepoints for research into osteoblast differentiation and mineralization by providing more precise timelines of when specific ossification centers emerge. The methods established here are easily transferable for characterizing the temporal sequence of other events of interest. Limbstaging methodology is readily applicable to establishing temporal priority of gene expression sequences, patterns that shift rapidly during developmental events [
19], yet are critical to understanding the nature of the relationship between genotype and phenotype.
Previous staging methodologies have not generally been used to provide the type of variation and developmental sequence data presented here for two main reasons. First, many of the staging systems lack the temporal precision needed to properly sort individual specimens. The Theiler staging system, for instance, describes the major developmental events on each embryonic day, but does not provide sufficient information to sort embryos according to specific levels of development within embryonic days [
6]. Second, while other staging systems possess the temporal resolution to correctly sort specimens according to size or developmental morphology, they lack the ease of collecting CRL or eMOSS data. Most prominent among these is the craniofacial development staging system outlined by Miyake et al. [
1] that established substages within the Theiler system for E11–E14. Over this range, Miyake et al. found similarly high levels of variation for embryos collected at the same harvesting age. However, scoring the level of development for individual embryos using this system requires high-resolution images of the craniofacial region of embryos as well as the knowledge that enables a researcher to discriminate among subtle differences in craniofacial developmental anatomy. While CRL data has always been available to researchers using inbred mice, it has not been used to establish temporal sequences and variation data outside of a few notable exceptions [
11,
18].
In a broader context, we have demonstrated that, although size and shape are coupled during development, this relationship is not fixed. Size and shape are highly correlated over this sample whose chronological ages span E12–E15 (R
2 = 0.92), but the correlation varies across harvesting age groups (ranging from R
2 = 0.53–0.8). Additionally, we have shown that size and shape show contrasting patterns of variation during this age period for our metrics. Variation in CRL increases with age, while variation in shape (limbstaging age) decreases with age, supporting previous findings of other researchers who studied variation in CRL [
18] and in craniofacial development [
1].
Finally, the sequence of appearance of ossification centers summarizing data collected from Osx-GFP and C57BL/calcein mice establish slightly different ontogenetic trajectories based on whether the specimens are ordered according to size or shape, though both measures present a more reasonable sequence than that obtained by using harvesting age. This indicates that for research projects seeking to establish temporal priority in developmental events, the metric used to sort individual embryos can seriously impact the results and their interpretation. It is evident that no single “best metric” exists for sorting individuals according to their developmental morphology. Sorting our data according to both limbstaging age (shape) and CRL (size) resulted in highly detailed and reasonable timelines of ossification center appearance, but the timelines are not the same. There may be no unique way to precisely measure the level of development of an individual embryo, and developmental processes may each bear a certain level of dissociation. The detected pattern of dissociability may reflect underlying modular patterns of development that facilitate flexibility that contributes to morphological evolution (see an example in opercle development [
20]). In short, each developmental event has its own timeline which may or may not correspond with the pace of other developmental events. Consequently, developmental events used to mark developmental progress of specific processes or of varying anatomical systems may not be appropriate across developmental time or across anatomical systems. Whether an internal (pertaining to the development of a particular system) or external (pertaining to general developmental principles) system is appropriate depends largely upon the research question being posed.
4. Methods
Litters of C57BL/6J embryos were generated by timed, overnight matings. The mice were bred twice a week, with males being introduced into the cages housing females at 3:00 p.m. and removed the following morning at 9:00 a.m. Pregnancy was verified by visually confirming the presence of a copulatory plug and weighing breeder females daily. Pregnant females were sacrificed at 9:00 a.m. via CO2 exposure followed by cervical dislocation. Embryos were removed from their placental sac, washed with saline, and fixed in 4% PFA for 24 h. The embryos were then transferred to 0.01% Sodium azide solution in PBS for preservation. All mouse procedures employed in this research were approved by the Institutional Animal Care and Use Committee of the Pennsylvania State University (IACUC46558, IBC46590).
To test the dissociability of size and shape, a total of 356 C57BL/6J embryos were collected at 24-h intervals for embryonic days 12–16. Ten litters were bred for each embryonic day, with sample sizes for each day ranging between 65 and 86 individual embryos (
Table 5).
To establish temporal sequence for ossification center development, two lines were used. Litters of Osx-GFP mice [
21] were collected at six-hour intervals from E12.0 through E15.0 to study the appearance of osteoblasts aggregated in developing ossification centers. Osx marks the differentiation of cells into osteoblasts [
22]. To study the appearance of mineralized ossification centers, we used C57BL6/J embryos whose mothers were injected with (20 mg/kg of body weight) calcein one hour prior to sacrifice. Litters of C57BL/calcein mice were collected at 6-hour intervals from E14.0 through E16.0 from the injected C57BL6/J dams. Mice from these two lines will be referred to as Osx-GFP mice and C57BL/calcein mice, respectively. Embryos were imaged using a light microscope with an attached Retiga 2000R monochrome camera (QImaging: Surry, BC, Canada) with an attached X-Cite 120Q to detect fluorescence.
Crown–rump length (CRL) was recorded for each specimen using ImageJ [
23]. CRL is defined in this study as the maximum cranial-caudal dimension of an embryo, which typically runs from the most superior/posterior portion of the head to the base of the tail in the sagittal plane (
Figure 1). CRL measurements were acquired for each embryo twice on different days and had to meet an intra-observer measurement error standard of no greater than 0.1 mm to be included in the study. The two CRL measurements were then averaged to establish a single CRL score for each embryo.
2D outlines of the hindlimb of each embryo were used to estimate limb staging ages. Briefly, a photograph of the hindlimb was uploaded to the embryonic Mouse Ontogenetic Staging System (eMOSS) [
3]. The 2D outline of the limb of each embryo is traced and then compared with a library of mean hindlimb shapes for each hour of development between E10–E15 to provide an estimate of an embryo’s limbstaging age [
3]. Whenever possible, both the left and right hindlimbs were staged and their average was used to estimate limbstaging age. All statistical analyses for this work were performed using R [
24]. Determination of the average time of appearance for each ossification center in the Osx-GFP and C57BL/calcein samples were estimated using the R package OptimalCutpoints [
25]. OptimalCutpoints uses receiver operating characteristic (ROC) curves to establish boundaries between binary variables (presence/absence of ossification centers) based on their relationship to another variable (limbstaging age or CRL) and accuracy measures selected by the researcher (sensitivity/specificity ratio).
The use of two different samples of mice to compare the level of bone development with crown–rump length and limbstaging age provides two advantages. First, osteoblast differentiation and bone mineralization are distinct developmental processes with osteoblast differentiation preceding mineralization, and it is possible that each process has a unique individual relationship with size and shape. Second, the two samples cover two embryonic time periods, which allows examination of the relationship between size and shape and of the progress of bone development over a larger proportion of developmental time.