January 17, 2022

File Characterization Tools Tested: Project Report From National Library of Australia (Digital Preservation)

From an Open Planets Foundation Blog Post by Matthew Hutchins

The National Library of Australia has just completed a small project to investigate and test a number of software tools of interest to digital preservation activities. The result of this project was an internal report describing the tests and the results, and giving some recommendations about the potential for using these tools in a planned replacement of the Library’s infrastructure for managing digital content.


The tools tested for file format identification were:

File Investigator Engine
Outside-In File ID
Unix file / libmagic

The tools tested for metadata extraction were:

File Investigator Engine
pdfinfo from the Xpdf toolkit
Apache Tika

Read the Complete Blog Post

Access the Full Text Report (57 pages; PDF)

About Gary Price

Gary Price (gprice@mediasourceinc.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at Ask.com, and is currently a contributing editor at Search Engine Land.