May 27, 2022

A New Experimental Prototype Tool From the Allen Institute For AI: Paper to HTML Converter

From the AI2 Newsletter:

This week, a team of researchers and engineers led by Lucy Lu Wang released a prototype of their tool that converts scientific PDFs into HTML, making them readable by screen readers and much more easily visible on mobile devices. After learning that fewer than 3% of scientific papers meet minimum criteria for accessibility, AI2 is pursuing new and better ways to make scientific publishing accessible to the broadest possible audience.

From the Paper to HTML Converter Website:

This is an experimental prototype that aims to render scientific papers in HTML so they can be more easily read by screen readers or on mobile devices. Because of our reliance on statistical machine learning techniques, some errors are inevitable. We are continuing to improve upon our models.

Paper to HTML Converter works with the following formats:

  • PDF files (*.pdf)
  • LaTeX source (*.gz, *.tar)
  • JATS XML (*.nxml)

Direct to Paper to HTML Converter

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.