May 17, 2022

Research Preprint: “Ginger Cannot Cure Cancer: Battling Fake Health News with a Comprehensive Data Repository”

The following preprint was recently shared on arXiv.


“Ginger Cannot Cure Cancer: Battling Fake Health News with a Comprehensive Data Repository”


Enyan Dai
Pennsylvania State University

Yiwei Sun
Pennsylvania State University

Suhang Wang
Pennsylvania State University


via arXiv


Nowadays, Internet is a primary source of attaining health information. Massive fake health news which is spreading over the Internet, has become a severe threat to public health. Numerous studies and research works have been done in fake news detection domain, however, few of them are designed to cope with the challenges in health news. For instance, the development of explainable is required for fake health news detection. To mitigate these problems, we construct a comprehensive repository, FakeHealth, which includes news contents with rich features, news reviews with detailed explanations, social engagements and a user-user social network. Moreover, exploratory analyses are conducted to understand the characteristics of the datasets, analyze useful patterns and validate the quality of the datasets for health fake news detection. We also discuss the novel and potential future research directions for the health fake news detection.

Direct to Full Text Paper (Preprint)
11 pages; PDF.

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.