Visualizing Health Tweets over Regions and Timestamps

  • Bonpagna Kann Université Grenoble Alpes
  • Sihem Amer-Yahia Université Grenoble Alpes
  • Michael Ortega Université Grenoble Alpes
  • Jean-Louis Pépin Université Grenoble Alpes
  • Sébastien Bailly Université Grenoble Alpes
Keywords: Social Media, Topic Models, Visualization, Health Data, Data Analysis


Social media has become one of the major data sources for social studies through users’ expressions, such as significant moments in their daily life or their feelings and perceptions toward specific discussion topics. In health care, social media is thoroughly used to study people’s discourse on ailments and derive insights into the impact of ailments on patients’ quality of life. Recently, there has been an increasing interest in applying machine learning algorithms to enhance the prediction of ailments through users’ social media data. In this study, nearly 800 million posts were retrieved from Twitter through preprocessing and running the time-aware ailment topic aspect model (T-ATAM) to predict diseases, symptoms, and remedies for two chronic conditions, namely sleep apnea and chronic liver diseases. The study was conducted on English tweets emitted during 2018, most of which were from European countries and the United States. The data were processed using T-ATAM by regions, timestamps, and treatment, namely continuous positive airway pressure (CPAP), to see the differences in the distributions of top diseases along with the top symptoms and remedies in different regions; timestamps; as well as before, during, and after CPAP was introduced. Based on approximately 331,000 tweets related to liver diseases and 1 million tweets on sleep apnea, various visualizations of statistics are displayed, including world maps, word clouds, and histograms. Results of this study indicate that depression and drinking are the leading symptoms of liver diseases; meanwhile, lack of nighttime sleep and overworking are considered the main factors of sleep apnea.


S. Dorle and N. Pise, “Political Sentiment Analysis through Social Media,” 2018 Second Int. Confer. Comput. Methodol., Commun. (ICCMC), 2018, pp. 869–873.

C. Udanor, S. Aneke, and B.O. Ogbuokiri, “Determining Social Media Impact on The Politics of Developing Countries Using Social Network Analytics,” Program, Vol. 50, No. 4, pp. 481–507, Sep. 2016.

G. Wolfsfeld, E. Segev, and T. Sheafer, “Social Media and the Arab Spring,” Int. J. Press/Politics, Vol. 18, No. 2, pp. 115–137, Jan. 2013.

N. Hoog and P. Verboon, “Is the News Making Us Unhappy? The Influence of Daily News Exposure on Emotional States,” Brit. J. Psychol., Vol. 111, No. 2, pp. 157–173, May 2020.

S. Petrović, M. Osborne, and V. Lavrenko, “Streaming First Story Detection with Application to Twitter: Human Language Technologies,” 2010 Annu. Confer. North Amer. Chapter, Assoc. Comput. Linguist., 2020, pp. 181–189.

T. Sakaki, M. Okazaki, and Y. Matsuo, “Earthquake Shakes Twitter Users,” Proc. 19th Int. Confer. World Wide Web (WWW ’10), 2010, pp. 851–860.

S.K.T. Bhavaraju, C. Beyney, and C. Nicholson, “Quantitative Analysis of Social Media Sensitivity to Natural Disasters,” Int. J. Disaster Risk Reduct., Vol. 39, pp. 1–10, Oct. 2019.

S. Sidana, et al., “Health Monitoring on Social Media over Time,” Proc. 39th Int. ACM SIGIR Confer. Res., Develop. Inf. Retr., 2016, pp. 1–6.

S. Shrivastava and P.S. Nair, “Mood Prediction on Tweets Using Classification Algorithm,” Int. J. Sci., Res. (IJSR), Vol. 4, No. 11, pp. 295–299, Nov. 2015.

N.S. Cook, et al., “Patients’ Perspectives on COPD: Findings from a Social Media Listening Study,” ERJ Open Res., Vol. 5, No. 1, pp. 1–10, Feb. 2019.

S. Gohil, S. Vuik, and A. Darzi, “Sentiment Analysis of Health Care Tweets: Review of the Methods Used,” JMIR Public Health, Surveill., Vol. 4, No. 2, pp. 1–10, Apr. 2018.

M.P. Hamm, et al., “Social Media Use Among Patients and Caregivers: A Scoping Review,” BMJ Open, Vol. 3, No. 5, pp. 1–9, May 2013.

D. Murthy and M. Eldredge, “Who Tweets About Cancer? An Analysis of Cancer-Related Tweets in the USA,” Digit. Health, Vol. 2, pp. 1–16, Jul. 2016.

Y.T. Yang, M. Horneffer, and N. DiLisio, “Mining Social Media and Web Searches for Disease Detection,” J. Pub. Health Res., Vol. 2, No. 1, pp. 17–21, May 2013.

V. Lampos and N. Cristianini, “Tracking the Flu Pandemic by Monitoring the Social Web,” 2010 2nd Int. Workshop Cogn. Inf. Process., 2010, pp. 411–416.

E.M. Glowacki, A.J. Lazard, and G.B. Wilcox, “E-Cigarette Topics Shared by Medical Professionals: A Comparison of Tweets from the United States and United Kingdom,” Cyberpsychol., Behav., Soc. Netw., Vol. 20, No. 2, pp. 133–137, Feb. 2017.

A. Culotta, “Towards Detecting Influenza Epidemics by Analyzing Twitter Messages,” Proc. First Workshop Soc. Media Anal. (SOMA ’10), 2010, pp. 115–122.

P.M. Polgreen, Y. Chen, D.M. Pennock, and F.D. Nelson, “Using Internet Searches for Influenza Surveillance,” Clin. Infect. Dis., Vol. 47, No. 11, pp. 1443–1448, Dec. 2008.

E.A. Tsochatzis, J. Bosch, and A.K. Burroughs, “Liver Cirrhosis,” Lancet, Vol. 383, No. 9930, pp. 1749–1761, May 2014.

A. Schlarb, A. Friedrich, and M. Claßen, “Sleep Problems in University Students – An Intervention,” Neuropsychiatr. Dis. Treat., Vol. 13, pp. 1989–2001, Jul. 2017.

How to Cite
Bonpagna Kann, Sihem Amer-Yahia, Michael Ortega, Jean-Louis Pépin, & Sébastien Bailly. (2022). Visualizing Health Tweets over Regions and Timestamps. Jurnal Nasional Teknik Elektro Dan Teknologi Informasi, 11(4), 237-243.