May 17, 2022

Report: “NLM Leverages Data, Text Mining to Sharpen COVID-19 Research Databases”

From GovernmentCIO:

“As of May 1, about 46,000 articles had been deposited by publishers to PMC [PubMed Central] or updated in PMC to have a license that allows for text and data-mining, of which more than 5,600 articles specifically focus on the current novel coronavirus,” said NLM National Center for Biotechnology Information Acting Director Stephen Sherry. “Some 49 publishers are now included in the PMC COVID-19 initiative.”

Within the first few weeks since launching the project, PMC saw significant COVID-19 download and data-sharing rates, said PMC Program Manager Kathryn Funk in an NIH webinar. As part of the project, Funk’s team worked to standardize submission data in a machine-readable format.

Read the Complete Article

See Also: Preprints in PubMed Central
Video recording forthcoming. Recorded on May 8, 2020.

See Also: Video Recording and Materials: “Webinar on Sharing, Discovering, and Citing COVID-19 Data and Code in Generalist Repositories”
Recorded on April 24, 2020.

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.