Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys
Asthma is one of the most common chronic diseases that has a profound impact on people’s well-being and our society. In this study, we link multiple large-scale data sources to construct an epidemiological model to predict asthma prevalence across geographic regions. We use: (1) the Social Media Monitoring (SMM) data from Twitter (N = 500 million tweets/day), (2) the 2014 Behavioral Risk Factor Surveillance System (BRFSS) (N = 464,664), and (3) the 2014 American Community Survey (ACS) conducted by the U.S. Census Bureau (N = 3.5 million per year). We predict asthma prevalence in the traditional survey (BRFSS) using social media information collected from Twitter and socioeconomic factors collected from ACS. The evidence suggests that monitoring asthma-related tweets may provide real-time information that can be used to predict outcomes from traditional surveys
Annals of the AAPSS
asthma, social media monitoring, SMM, ACS, BRFSS, data linkage
Dai, H., Lee, B. R., Hao, J. Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys Annals of the AAPSS 669, 75-92 (2017).