A new Apple-supported study argues that your behavior data (movement, sleep, exercise, etc.) can often be a stronger health signal than traditional biometric measurements like heart rate or blood oxygen.To prove it, the researchers developed a foundation model trained on behavioral data collected from wearables, and it performed surprisingly well.Here are the details.
This preprint paper, Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions, comes as a result of the Apple Heart and Movement Study (AHMS).They trained a new foundation model on more than 2.5 billion hours of wearable data, showing it can match (and even outperform) existing models built on low-level sensor data.They call the new model WBM, which stands for Wearable Behavior Model.
And while previous health-related foundation models mostly relied on raw sensor streams like the Apple Watch’s heart rate sensor (PPG, or photoplethysmograph) or its electrocardiograph (ECG), WBM learns directly from higher-level behavioral metrics: step count, gait stability, mobility, VO₂ max, and so on.All of which the Apple Watch produces in abundance.But if the Apple Watch has these sensors, what’s the point of the new model? Great question.
And the answer is in the study: In other words, while the Apple Watch collects raw sensor data, that data can be noisy, overwhelming, and not always aligned with meaningful health events.While the metrics used by WBM are based on that sensor data, the data is refined to highlight real-world behaviors and health-relevant trends.They’re more stable, easier to interpret, and better structured for modeling long-term health trends.
In practice, WBM learns from the patterns found in processed behavioral data, rather than relying directly on raw sensor signals.The nerdy bits WBM was trained on Apple Watch and iPhone data from 161,855 participants in AHMS.Instead of raw streams, the model was fed 27 human-interpretable behavioral metrics, such as active energy, walking pace, heart rate variability, respiratory rate, and sleep duration.
The data was broken down into weekly blocks and passed through a new architecture built on Mamba-2, which performs better than traditional Transformers (the base for GPT) for this use case.When evaluated on 57 health-related tasks, WBM outperformed a strong PPG-based model in 18 of the 47 static health prediction tasks (like whether someone takes beta blockers), and in but one of the dynamic tasks (like detecting pregnancy, sleep quality, or respiratory infection).The exception was diabetes, for which PPG alone won out.
Even better: combining both WBM and PPG data representations produced the most accurate results overall.The hybrid model achieved a whopping 92% accuracy for pregnancy detection, and consistent gains in sleep quality, infection, injury, and cardiovascular-related tasks like Afib detection.In the end, the study doesn’t try to replace sensor data with WBM, but rather complement it.
Models like WBM capture long-range behavioral signals, while PPG catches short-term physiological changes.But together, they’re better at flagging meaningful health shifts early.If you’d like to know more about the Apple Heart and Movement Study and other studies, we’ve got you covered.
AirPods deals on Amazon
AirPods Pro 2, USB-C Charging: $149 (down from $249)
AirPods 4 USB-C Charging: $89 (down from $129)
AirPods 4, USB-C and Wireless Charging: $119 (down from $179)
AirPods Max, USB-C Charging, Midnight: $429.99 (down from $549)
You’re reading 9to5Mac — experts who break news about Apple and its surrounding ecosystem, day after day.Be sure to check out our homepage for all the latest news, and follow 9to5Mac on Twitter, Facebook, and LinkedIn to stay in the loop.Don’t know where to start? Check out our exclusive stories, reviews, how-tos, and subscribe to our YouTube channel