Exploring Digital Health Trends in the Headlines via Knowledge Graph Analysis
We introduce a method for analyzing digital health transformation in our society by constructing a Knowledge Graph from a large corpus of 7.8 million English news articles from the Dow Jones Data, News, and Analytics platform, dating from 1987 through 2023. We first sampled around 97k articles relevant to the Digital Health topic by training and deploying a Deep Learning binary classifier by fine-tuning BERT. Successively, by deploying NLP techniques, we extracted triples from the identified articles to form a Digital Health News Knowledge Graph, which consists of 431k distinct triples connecting 186k entities through 1866 relations. This graph provides insights into the evolution of Digital Health in news media and serves as a resource for further research in the field. Our analysis reveals significant trends in Digital Health as reflected in the news, with notable peaks coinciding with key events like the COVID-19 pandemic. We split the analysis geographically for the United States (US) and European countries and tracked over time for each macro-region the predominant entities and relations. The classifier, the knowledge graph, and data analytics visualizations are made publicly available for future work.
ZAVARELLA Vanni;
CONSOLI Sergio;
REFORGIATO RECUPERO Diego;
FENU Gianni;
2025-03-05
SPRINGER VERLAG
JRC138049
1611-3349 (online),
https://doi.org/10.1007/978-3-031-82481-4_9,
https://publications.jrc.ec.europa.eu/repository/handle/JRC138049,
10.1007/978-3-031-82481-4_9 (online),
Additional supporting files
File name | Description | File type | |