To the best of our knowledge, this is the first study that developed accelerometer cut-points for the hip, non-dominant wrist, and dominant wrist with a consistent protocol in older adults (≥70 years old), cross-validating these cut-points in an independent sample following a different protocol of activities. Overall, the cut-points showed fair-to-good accuracy to classify ST and MVPA. This study contributes to the field by providing ST and MVPA cut-points from different wear locations in older adults (≥70 years old), based on a number of activities that characterize the older adults’ common activities. Furthermore, we provide cut-points based on ENMO, which are comparable across different accelerometers [
21], as well as ActiGraph’s activity counts, which have been traditionally used in the field [
8]. Additionally, this is the first study providing cut-points based on counts for wrist-worn accelerometers in older adults. The wide range of activities, the different wear locations, and the acceleration metrics used also allow for a deep understanding about what choices are better adapted to measure ST and PA intensities in older adults.
4.1. Wear Location: Hip, Non-Dominant Wrist, or Dominant Wrist
We placed accelerometers at the right hip, the non-dominant wrist, and the dominant wrist in this study. These body attachment sites are commonly used in population-based studies to monitor ST and PA in free-living conditions. As examples, the National Health And Nutrition Examination Survey (NHANES) collected accelerometer data from the hip and non-dominant wrist in a population-based sample of US citizens [
22], and the UK biobank placed accelerometers on the dominant wrist of British adults [
23]. As such, we provide relevant accelerometer cut-points to enhance the comparability across studies and to develop new studies with hip- and wrist-worn accelerometers in older adults.
Regarding ST, caution is advised as we did not consider posture (i.e., sitting, reclining, or lying) but only energy expenditure as a criterion. In this regard, a previous study on the AREA study participants developed ST-to-light PA cut-points for counts using activPAL-defined postures as a criterion [
24]. If scaled up from counts/5 s to counts/min, the cut-points in this study are lower than the previously developed ones (i.e., 12 vs. 174 counts/min for the hip, 1224 vs. 1853 counts/min for the non-dominant wrist, and 2100 vs. 2303 counts/min for the dominant wrist) [
24]. Cut-point differences can be explained by the different criterion used to identify ST and MVPA (i.e., postures vs. energy expenditure). ST based on energy expenditure could include some standing activities with low energy requirement, while the posture-based estimations might include sitting activities with high energy requirement. This may partially explain the lower thresholds observed in this study. Furthermore, we observed that ST classification accuracy was higher for wrists than the hip using ENMO (i.e., +7% in wrists) and vice versa for counts (i.e., +4% in hip). The 4% higher performance in the hip versus wrist counts was also observed in a previous study on the AREA study participants [
24]. In addition, previous studies using ENMO have found a similar accuracy of the hip and wrist for the ST classification (i.e., 7 m
g in this study, 12 m
g in Duncan et al. [
7], and 6 m
g in Sanders et al. [
6]) and the non-dominant wrist (i.e., 18 m
g in this study, 18 m
g in Duncan et al. [
7], and 20 m
g in Sanders et al. [
6]). The dominant wrist threshold was higher in our study (i.e., 22 m
g) compared with Duncan et al. (i.e., 10 m
g) [
7]. The inclusion of several sedentary activities with arm movements, (e.g., writing, dealing cards) should be considered as they can lower the performance of some ROC curves, yet they are more representative of daily life activities of older adults.
We found better accuracy to classify MVPA by hip data than wrist data in the acceleration metrics investigated (i.e., +13% to +26% for the hip compared to wrists). Two previous studies also found better accuracy in hip monitors compared to wrist monitors to classify MVPA based on ENMO [
6,
7]. Between-location differences were smaller in Sanders et al.’s cut-points (i.e., +6% accuracy in the hip) [
6] than in Duncan et al.’s cut-points (i.e., +16% to +19% accuracy in the hip) [
7]. Different protocols of activities are likely responsible of the different estimations, as well as differences in the study samples investigated (e.g., we focused on an older sample (≥70 years old) compared to previous studies). Likewise, the definition of the 1 MET value may explain how we found lower thresholds than previous studies, since we used VO
2 = 2.8 mL/kg/min instead of the higher values (i.e., 3 to 3.5 mL/kg/min) used in some of the previous studies [
6,
7,
16]. To note, the hip versus non-dominant wrist differences were roughly similar after removing cycling from the activities analyzed (i.e., +16% with cycling included) [
7]. An overall outperformance of hip versus wrist in the classification of MVPA can be concluded from the different studies using ENMO [
6,
7]. This can be partially explained by the higher variability observed in wrist accelerations, which may complicate their classification into specific categories. No previous studies have provided MVPA cut-points for counts using wrist-worn accelerometers in older adults.
Overall, dominant and non-dominant wrists provided a similar classification accuracy both for ST and MVPA in this study. Similar conclusions were obtained for the ST classification with counts in a previous study on the AREA study participants [
24]. Duncan et al. found a better classification accuracy of dominant wrist vs. non-dominant wrist for ST and vice versa for MVPA [
7], although this depended on the activities included in the analysis (i.e., considering or not cycling) [
7]. Fraysse et al. observed a similar accuracy of dominant and non-dominant wrists for the classification of ST yet a slightly higher accuracy of the non-dominant compared to the dominant wrist to classify MVPA (i.e., +5%) [
25]. Of note, cut-points developed by Fraysse et al. were based on a different acceleration metric than ours; thus, the absolute thresholds cannot be directly compared.
4.2. Acceleration Metrics: ENMO or Counts
The inclusion of both ENMO and counts is of relevance in this study. Although ActiGraph activity counts have been traditionally used in the field, concerns have been raised about their replicability with other monitors and the lack of transparency in the processing methods to obtain them [
26]. In this regard, modern accelerometers provide raw accelerations that can be consistently processed to obtain comparable outputs from different monitors [
27]. In this line, open-source metrics, such as ENMO, are of great value for the field as they ease replicability and comparability across different cohorts using different devices [
11,
28]. We provided cut-points using both ENMO and counts in this study in order to fit different needs.
Regarding the hip cut-points, counts outperformed ENMO in the classification of ST and MVPA by 10% and 3%, respectively. Otherwise, ENMO outperformed counts in the classification of MVPA in both wrists by 9–11%. The fact that the ENMO metric was originally developed from wrist data [
29], while counts were developed from hip data, can partly explain this finding. It may be that the calibration and filtering procedures of ENMO and counts are better adapted to wrist and hip motion, respectively. To our knowledge, no previous studies have compared ENMO and counts in the classification of ST and MVPA, which is a novel finding in this study.
4.3. Choosing Cut-Points
We provided cut-points to fit different needs in this study. A common scenario is that the data were previously collected; thus, decisions on wear location were already decided. In this case, we provided cut-points for the hip, non-dominant wrist, and dominant wrist to adapt to the different data collection protocols. Likewise, raw data require high storage capacity, hence are often removed after their initial processing. In this case, we provided cut-points based on two different acceleration metrics that are most frequently used in the field at the moment (i.e., ENMO and counts).
In a different scenario, data collection was not performed yet and advanced decisions can be made. In this regard, a smart decision would be to use cut-points with higher accuracy for the main variable of interest (e.g., ST or MVPA). Yet, other considerations apart from accuracy should be balanced. For example, acceptability of accelerometers could be higher for certain body attachment sites in a given population, resulting in higher compliance with the accelerometers [
30,
31]. Likewise, other accelerometer-assessed variables, such as sleep, might be of interest, and sleep algorithms are mainly based on wrist-worn accelerometers, which may point out the body attachment site to use in a hypothetical study. While not always possible, it would be of value to cross-validate the cut-points with the sample of interest (or a subsample of it). Although we cross-validated our cut-points with an independent sample, they might not be extrapolated to every older adult’s sample. Regarding our cross-validation, we roughly observed high sensitivity and specificity values for ST and MVPA in our hip and non-dominant wrist cut-points (sensitivity > 0.83 and specificity > 0.80). The ENMO cut-point for the hip requires further attention as its sensitivity was lower than that of the rest (i.e., 0.70). A similar scenario was observed in a previous study (i.e., 19 m
g), and the authors suggested the relevance of increasing the specificity as much as possible, while keeping the sensitivity at 0.6 to ensure that light PA was not misclassified as MVPA [
6]. Following this approach to define the threshold in our calibration sample, we obtained a cut-point of 23 m
g, which is still lower than previous thresholds proposed for older adults (e.g., ~55 m
g [
7]
, 69 m
g [
6]). The activities selected in the calibration protocol may be partially responsible of this, with some activities requiring low hip movement but light-to-moderate-intensity MET values (e.g., standing still, washing dishes, kneading dough, dressing, shopping, dealing cards). Indeed, we observed a lower MVPA cut-point for counts in the hip than previously proposed cut-points. If scaled up to 15 s epochs, we provided a cut-point of 162 counts/15 s, while Evenson et al. proposed cut-points from 296 to 620 counts/15 s in 200 older women [
32]. Unfortunately, no cross-validation of the dominant wrist cut-points could be performed. Future studies should investigate the agreement between the ST and MVPA metrics obtained in free-living conditions with the different sets of cut-points presented in this paper.
4.4. Limitations and Strengths
This study is not without limitations. First, although we had a relatively large sample size compared to previous studies, we lost data of 30 participants (38%) because of calibration issues with the accelerometer data or missing data. Age, sex, and anthropometrics were similar between included and excluded participants. Second, cut-points are specific of the data collection protocol and participants taking part in the calibration study. To improve generalization, we cross-validated the cut-points in an independent sample performing different activities. However, this may not be enough, and more cross-validation studies should be performed in older adults. Population-standardized values could also be used to harmonize and improve representativity of accelerometer data. Likewise, our definition of 1 MET (i.e., 2.8 mL/kg/min) could not be determined specifically for each individual since our protocol did not include a sitting activity of enough duration. Instead, we calculated the oxygen consumption while lying for 10 min to support the decision previously proposed in the literature of using lower oxygen consumption values than the standard 3.5 mL/kg/min [
14]. Current guidelines suggest measuring the resting metabolic rate (i.e., 1 MET) while participants are lying down, not sitting [
15]. Additionally, lower values of oxygen consumption have also been used in previous calibration studies on older adults [
6,
16,
17]. Furthermore, the accelerometers did collect raw accelerations at different frequencies in the calibration (i.e., 80 Hz) and cross-validation (i.e., 60 Hz) studies. An effect of the sampling frequency in the count generation has been previously observed [
33], and this may affect the results of the cross-validation of our count-based cut-points. However, such effect was observed at relatively high intensities, which are unlikely to occur in our ≥70-year-old participants. Another limitation is that we could not cross-validate the dominant wrist cut-points. Finally, we acknowledge that the laboratory setting limited the ecological validity of the data, even if the participants performed activities that replicated their daily life activities.
The strengths of this study included the focus on ≥70-year-olds, a population with limited evidence thus far. We used 2.8 mL/min as a reference for the MET calculation, being better adapted to older populations than the standard 3.5 mL/min [
14]. Cut-points were developed based on tri-axial accelerations with two different acceleration metrics and placing the accelerometers in three body attachment sites (i.e., hip, non-dominant wrist, and dominant wrist). Open-source algorithms were used for ENMO-based cut-points, which enhanced the replicability and comparability across studies. Likewise, we used a wide array of tasks with a variety of activity intensities, including activities with substantial arm movements, which increased the extrapolation to free living studies.