New From Cross Ref Labs: First Public Release of “pdf-extract”

April 18, 2012 by Gary Price

CrossRef Labs is happy to announce the first public release of “pdf-extract” an open source set of tools and libraries for extracting citation references (and, eventually, other semantic metadata) from PDFs. We first demonstrated this tool to CrossRef members at our annual meeting last year. See the pdf-extract labs page for a detailed introduction to this new set of tools.

The blog post adds that if you’re unable to download the software, Extracto, a web-based resource from CrossRef Labs is available to extract citations from PDF files. However, the blog post also says that Extracto is, “running on very feeble server using an erratic and slow internet connection.”
In their words:

The only guarantee that we can make about using it is that it will repeatedly fall over and annoy you. The weasel has spoken.

Filed under: Libraries, Publishing, Resources

CrossRef CrossRef Labs Info Organization and Cataloging Information Technology Open Source Software Web Tools

About Gary Price

Gary Price (gprice@gmail.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com.

New From Cross Ref Labs: First Public Release of “pdf-extract”

About Gary Price

Archives

FOLLOW US ON X

New From Cross Ref Labs: First Public Release of “pdf-extract”

About Gary Price

Archives

Related Infodocket Posts

FOLLOW US ON X