Performance evaluation of algorithms to estimate daily sedentary time using wrist-worn sensors in free-living adults.
Matthews CE., Saint-Maurice P., Freeman JR., Hayes HA., Shreves AH., Doherty A., Hyde ET., Ylarregui K., Jones RR., Keadle SK.
PURPOSE: Given the limited real-world testing of algorithms for wrist-worn sensors to estimate sedentary time, we examined the performance of 21 algorithms in free-living adults. METHODS: Seventy-one adults (35-65 years) wore a GENEActiv (wrist) and an activPAL (thigh) sensor for up to 10 days. activPAL was our reference measure. We estimated sedentary time (hours/day) using 21 classification algorithms, including cut point and machine-learning methods. Valid days from each monitor were matched by date and mean values were calculated. Equivalence testing (±10%) and linear regression were used to compare each algorithm's estimate to the reference, over all participants and by sex and age. RESULTS: activPAL recorded a mean of 9.4 hours/d sedentary. Five of 21 algorithms (24%) estimated sedentary time within 10% (±0.94 hours) of the reference. Two of these methods employed machine-learning algorithms (Trost Extended, OxWearables) and three employed cut points (GGIR ENMO 40mg; Bakrania ENMO 32.6mg; Fraysse ENMOa 62.5mg). Variance explained in linear regression was relatively high for the machine-learning (R2=0.44-0.63) and cut point algorithms developed for younger (R2=0.30-0.64) and older (R2=0.45-0.66) adults. More accurate performance was noted for algorithms developed in studies using posture-based ground truth measures and conducted in free-living settings. CONCLUSION: Fifteen of 21 (71%) algorithms produced estimates of sedentary time that were moderate-strongly correlated with the reference measure, but only five (24%) were within 10% of the reference. Free-living benchmarking studies like this can identify more accurate and precise algorithms to estimate sedentary time and identify characteristics of algorithm development studies that yield better results.