May 22, 2022

Infotrieve Successfully Completes Cloud-Based Text Mining Pilot Using Journal Article Tags Suite (JATS)

From Infotrieve:

[Infotrieve] today announced that it has completed a pilot project that included the successful implementation of a beta text mining solution leveraging the capabilities of the Mobile Library platform.

Infotrieve partnered with long standing client, global biopharmaceutical leader UCB, to develop a solution aimed at providing a cloud-based text mining platform that would allow UCB staff to mine full text articles from multiple publishers using a variety of popular text mining tools in a standardized, copyright compliant and end user friendly manner.


In completing the pilot Infotrieve collaborated with six major life science publishers, collecting from each hundreds of full text articles in Extensible Markup Language (XML) format.

Infotrieve tagged each article using Journal Article Tags Suite (JATS), thereby creating a standardized and homogenized corpus for text mining, and loaded the resultant articles into its cloud-based Mobile Library platform. The Mobile Library’s flexible search capabilities, content management tools and powerful rights management engine provided pilot participants the ability to search and create a research set, and then further refine the results to clearly identify articles currently under subscription versus those requiring purchase, as well as those available for text mining.

Read the Complete Announcement

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.