Beiwe Research Platform

Dr. Onnela received a 2013 NIH Director’s New Innovator Award, which enabled the crystallization of the concept of digital phenotyping, the construction of the Beiwe research platform, and the development of statistical methods and tools to help turn the collected data into biomedical and clinical insights. We have received a large number of requests about our approach and the research platform, and we attempt to address some of the most frequently asked questions here. This is an evolving document, which will likely change substantially in the fall of 2017. Regarding the most frequently asked question, Beiwe is a transliteration of a Nordic goddess of sunlight and mental health. We pronounce it bee-we.

What is digital phenotyping? We have defined digital phenotyping as the “moment-by-moment quantification of the individual-level human phenotype in situ using data from personal digital devices,” in particular smartphones. This is our definition of the concept and it highlights some of the important aspects of digital phenotyping, such as using existing personal devices rather than introducing additional instrumentation. To truly leverage moment-by-moment data collected in situ, in the wild, one must rely on the use of passive data, i.e., smartphone sensor and usage data (see below).

Why is digital phenotyping better than existing survey-based approaches to social and behavioral phenotyping? Of the many different phenotype classes, behavior has presented special challenges for phenomics, the systematic study of phenotypes on a genome-wide scale, because of its temporal nature and context dependence. The traditional approach to behavioral phenotyping has relied on pen-and-paper surveys, but these self-reported accounts tend to be highly unreliable and subject to different kinds of recall biases. Ecological momentary assessment (EMA) is an approach that attempts to survey subjects’ behaviors and experiences in real time in their natural environments. EMA used to be carried out using specialized hardware, such personal digital assistants, limiting its scalability, but can now be implemented on smartphones. While EMA can certainly provide interesting insights into behavior, as a survey methodology it suffers from the same problems that all surveys do, mainly reliance on subjective accounts of behavior rather than objective measurement of behavior. Other limitations are that frequent surveys require active user engagement, which may be difficult with certain clinical populations, and long-term adherence is also typically low. Frequent surveying also causes respondent fatigue, and may inadvertently constitute an intervention. Finally, our preliminary findings suggest that subjects are less likely to take surveys under two diagonally opposite situations, either when they are doing very well or when they are doing very poorly. This means that survey data tend to be unavailable at times when it may be most insightful.

What types of phenotypes may be acquired using smartphone-based digital phenotyping? In short, behavioral patterns, sleep, social interactions, physical mobility, gross motor activity, cognitive functioning, and speech production, among others. Digital phenotyping is also compatible with the RDoC research framework for studying mental disorders. As defined by the NIMH, the framework consists of a matrix, where the rows represent specific dimension of function (Domains and Constructs) and the columns represent areas for study (Units of Analysis). The five domains of the RDoC matrix are negative valence systems (responsible for responses to averse situations), positive valence systems (responsible to positive situations), cognitive systems (responsible for cognitive processes), systems for social processes (mediating responses to interpersonal settings), and arousal / regulatory systems (responsible for generating activation of neural systems). The daily use of smartphones generates a byproduct of rich social and behavioral data, and when complemented with surveys and audio diary entries, these data can address several of the RDoC domains and several units of analysis (self-report, behavior, and physiology).

What is the difference between mobile health (mHealth) and digital phenotyping? Mobile health (also, mHealth or m-health) is a broad category and can be defined in different ways, but it usually refers to the “delivery of healthcare services via mobile communication devices.” Digital phenotyping, by definition, refers to the collection and analysis of moment-by-moment individual-level human phenotype data in situ, in the wild, using data from personal digital devices, in particular smartphones. The main goal of digital phenotyping is to advance evidence-based research in the biomedical sciences, and as such it can be seen as part of deep phenotyping, which supports other approaches to phenotyping and natural complements genotyping and genome sequencing.

How much does digital phenotyping cost? The digital phenotyping approach is incredibly cost effective and scalable. In the near future, we anticipate that one will be able to carry out digital phenotyping for as little as $1 per subject-year, which translates to about $75 per lifetime cost. It is important to point out that these are very early days for digital phenotyping in terms of technology, data, data analysis, and science. Phenotyping is often contrasted with genotyping. The first sequencing of the whole human genome cost roughly $2.7 billion in 2003, whereas in 2017, research-grade whole genome sequencing costs around $750. These numbers are certainly approximate, but they suggest that the life-time cost of smartphone-based digital phenotyping would be about 10% of the cost of sequencing.

What is the goal of the lab’s research in this area? To standardize data collection and to develop statistical methods for smartphone-based digital phenotyping in research and clinical use. We have developed the Beiwe platform for investigators in the biomedical sciences who are interested in using research-grade passive phone sensor data (e.g., GPS and accelerometer) and phone usage data (e.g., communication logs and screen activity) in smartphone-based digital phenotyping. Our overall philosophy is to do as much as possible using passively collected data, because this is the only way to run long-term studies without significant problems with subject adherence. A large majority of studies interested in using smartphones as a data collection or intervention tool do not fit this definition. This is loosely analogous with the contrast between genotyping and full genome sequencing. In a typical study, the Beiwe front-end generates about 1 billion data points–a volume of data that most studies do not need. We believe that the main intellectual challenge in digital phenotyping is now beginning to move from data collection to data analysis, and development of statistical methods for making sense of the collected data is currently our top research priority.

Why does Beiwe collect raw data? In short, research requires research-grade raw data. Software development kits for Android (ResearchStack, etc.) and Apple iOS (ResearchKit, HealthKit, CareKit, etc.) collect processed data summaries rather than raw sensor and phone usage data. This introduces an opaque layer between the data generating process and data analysis, making it difficult to compare data across devices or pool data across studies as the data summaries are likely different. The use of pre-defined data summaries results in a loss of information, narrowing down potential use cases of data to those conceived at the time of data collection (e.g., number of steps taken), and as such diminishes the value of data biobanking. Collection and storage of raw data makes it possible to compute any summaries of interest at a future data, thus enhancing study replicability and facilitating re-analyses of data. Downsides to collecting raw data are the large volume of data and the difficulty of keeping phone sensors awake, but both of these challenges are manageable. Raw data lets investigators ask and answer questions they care about and makes data collection and data analysis transparent.

What is the distinction between smartphone application (app) and a research platform? A smartphone app simply a software application that runs on a smartphone. It is important to note that the Beiwe app is just one of the three components of the Beiwe platform. The other components are the Beiwe back-end and the Beiwe data analysis pipeline. The Beiwe back-end makes use of Amazon Web Services (AWS) cloud computing infrastructure and is used to manage studies (e.g., study creation, addition of users, regeneration of passwords) and collect data. For the latter, it uses AWS Elastic Beanstalk, which automatically handles the details of capacity provisioning and load balancing, making it essentially infinitely scalable. The data analysis pipeline performs data pre-processing, checks data quality, transforms data, carries out imputation, and computes summary statistics of interest. The input to the pipeline is raw data collected by Beiwe, and the output is a p x T matrix, one per subject, where the p rows correspond to different daily summary statistics (e.g., total distance traveled obtained from GPS data and total call duration obtained from communication logs) and the T columns correspond to days. In supervised learning, the goal is to finding associations between passively collected data and any other type of data (e.g., surveys or clinical data), and in this setting the obtained matrices can be fed into different longitudinal statistical models, such as generalized estimating equations (GEE) or generalized linear mixed models (GLMM), depending on the goals of the analysis. In unsupervised learning, the goal might be to find anomalies in behavioral data or to perform clustering using a range of possible methods.

Is the Beiwe app available on Android and iOS? Our lab has developed both Android and iOS versions of the Beiwe app that connect seamlessly to the same research platform, enabling researchers to recruit individuals with phones using either operating system.

Is the Beiwe app a native app or a web app? What is the difference? Because Beiwe relies on sensor data, both Android and iOS versions of the Beiwe app are native applications rather than web applications. A web app requires only a browser and an internet connection (either a cell signal or Wi-Fi), is easy and inexpensive to develop and maintain, but cannot typically access phone sensor data. A native app works independently of the web and can access phone sensor data, but is generally much more difficult and expensive to develop and maintain. Further, because they collect raw phone sensor and usage data, they cannot rely on software development kits but instead rely on a codebase that has been developed from scratch for this specific purpose.

Who has developed Beiwe? The Beiwe research platform has been developed by the Onnela Lab at the Harvard T.H. Chan School of Public Health by funding from the National Institutes of Health (NIH). Specifically, the large majority of the development work has been enabled by a 2013 NIH Director’s New Innovator Award to Dr. Onnela. The lab has worked with two different software development groups to create the front-end smartphone applications for Android and iOS devices, as well as the development of the back-end data collection system. The lab has developed the data analysis methods and data analysis pipeline internally.

Why can various different studies use the same platform? This is because each study within Beiwe is independent of any other study, and each study has its own subjects, its own study coordinators and investigators, and each contains its own active and passive data collection features and sampling schedules. Within each study, the subjects are generated their Beiwe user IDs (e.g., yixg8437) and temporary passwords. Once subjects have downloaded the app and entered their user ID and password, the system automatically connects them with the right study, which among other things means that the subjects receive the surveys configured for that study and passive data is collected according to the specifications of the study. This includes what data streams are collected, how they are sampled, how frequently uploaded, and whether Wi-Fi or cellular network is used for upload. The flow of information in the studies making use of Beiwe is from the user to the system, which is what is to be expected from a phenotyping platform. It is possible to make use of the Beiwe back-end to design a sister component of the Beiwe app for delivering interventions. This is however likely to be study specific, and falls outside of our main research area. In contrast, phone sensors are what they are, and therefore the most one can do is to collect as much of the available data as possible and try to make the most sense of the collected data.

How about reproducibility and replicability of Beiwe studies? Only 6% of biomedical studies have been found to be completely reproducible (Prinz et al, 2011). From this point of view, we do not need more studies but rather we need more studies that are reproducible. To achieve reproducibility, it is key to focus on both data collection and data analysis. With the Beiwe platform, we attempt to address both of these stages. We started by building a platform that collects research-grade data. The old adage about data analysis captures the sentiment perfectly: garbage in, garbage out. Therefore, our first step was to improve the quality of measurements. Many researchers have advocated the role of better measurement in studies that involve any type of quantification of human behavior, a point that has been made repeatedly and vigorously by Andrew Gellman among others. Beiwe captures all study settings in human readable JSON formatted configuration files, and the platform enables an investigator to export and import these files with a single click. Therefore, an investigator wishing to replicate a previous Beiwe study only needs this one file to collect identical data in an identical manner. Data analysis can be replicated by studying the scripts that are used to analyze the output matrices of the Beiwe platform.

Can I use Beiwe in my own studies? This is a very frequently asked question. In short, we very much hope so. Beiwe is currently being used in numerous studies including at all Harvard Medical School teaching hospitals. Investigators worldwide have two different ways of using Beiwe in their studies.

  1. Beiwe Service Center. For more information, see Beiwe Service Center
  2. Beiwe Open Source. The Beiwe research platform source code is available to investigators worldwide for free on GitHub under the permissive 3-clause BSD open source license. Under this model, individuals or institutions interested in using Beiwe will set up their own AWS account and then deploy Beiwe using one of two different ways (single serve deployment vs. server cluster deployment). The Beiwe apps, named Beiwe2 for open source users, are available for free on Apple’s App Store and Google’s Play Store. In this model, the investigators using the open source version would naturally be responsible for all expenses.