Comparison of consumer-grade wearable devices with a research-grade instrument for measuring physical activity in a free-living setting
Takuya Miwa,
Kazuma Mii,
Ryouichi Chatani and
Yasuo Sugitani
PLOS ONE, 2026, vol. 21, issue 2, 1-15
Abstract:
Introduction: Wearable accelerometer devices are now widely used in both research and daily life settings. This study aimed to compare the accuracy of three commercially available consumer-grade activity monitors with the medical-grade ActiGraph device in a free-living setting in Japan. Methods: Thirty-six office workers were enrolled and provided with an ActiGraph. Data were analyzed from participants who also wore Apple Watch (n = 21), Fitbit (n = 22), and Oura Ring (n = 5) over a 3-week period. Step count, physical activity energy expenditure (PAEE), and moderate-to-vigorous physical activity (MVPA) data were collected from all devices. Data were analyzed using correlation coefficients, mean differences, and Bland–Altman plots. Results: ActiGraph data confirmed comparable physical activity levels across the participant subgroups, ensuring a valid basis for the subsequent inter-device comparisons. Step counts were largely consistent across devices, with Apple Watch and Oura Ring measurements within 10% of ActiGraph measurements (mean percentage differences 2.12% and −6.24%, respectively), while the Fitbit overestimated step count by 18.00%. MVPA showed greater variability, with Apple Watch and Oura Ring underestimating by 46.22% and 11.64% respectively, whereas the Fitbit showed minimal mean difference (0.62%). PAEE showed the largest discrepancies, with Apple Watch and Fitbit overestimating by 25.91% and 139.19% respectively, and Oura Ring underestimating by 16.87%. Correlation coefficients were strong for step counts (r = 0.84–0.92) but lower for MVPA and PAEE across all devices. Bland–Altman analysis revealed proportional bias in the Fitbit’s PAEE and the Apple Watch’s MVPA, with errors increasing at higher activity levels. Conclusion: Step counts were largely consistent with the ActiGraph for most devices; however, the Fitbit showed a notable overestimation. However, the ability of those devices to accurately measure MVPA and PAEE appeared to be more limited, particularly at higher activity levels. These findings underscore that the selection of a consumer-grade wearable for research or clinical use must be carefully guided by the specific metric of interest. However, the findings for the Oura Ring should be interpreted with caution due to the small sample size.
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0342543 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 42543&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0342543
DOI: 10.1371/journal.pone.0342543
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().