May 20, 2022

New Article: “Mining Scientific Papers for Bibliometrics: a (very) Brief Survey of Methods and Tools”


Mining Scientific Papers for Bibliometrics: a (very) Brief Survey of Methods and Tools


Iana Atanassova
Centre Tesniere, University of Franche-Comte, France

Marc Bertin
Centre Interuniversitaire de Rercherche sur la Science et la Technologie (CIRST), Université du Québec à Montréal (UQAM), Canada

Philipp Mayr
GESIS, Leibniz Institute for the Social Sciences, Germany


via arXiv

From the Introduction:

The Open Access movement in scientific publishing and search engines like Google Scholar have made scientific articles more broadly accessible. During the last decade, the availability of scientific papers in full text has become more and more widespread thanks to the growing number of publications on online platforms such as ArXiv and CiteSeer. The efforts to provide articles in machine-readable formats and the rise of Open Access publishing have resulted in a number of standardized formats for scientific papers (such as NLM-JATS, TEI, DocBook).

Direct to Full Text Article (2 pages; PDF)

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.