January 21, 2022

Journal Article: “From Digital Library to Open Datasets: Embracing a ‘Collections as Data’ Framework”

The following article was recently published by Information Technology and Libraries (ITAL).


From Digital Library to Open Datasets: Embracing a ‘Collections as Data’ Framework


Rachel Wittmann
Marriott Library, University of Utah 

Anna Neatrour
Marriott Library, University of Utah 

Rebekah Cummings
Marriott Library, University of Utah 

Jeremy Myntt
Marriott Library, University of Utah


Information Technology and Libraries, 38(4)
DOI: 10.6017/ital.v38i4.11101


This article discusses the burgeoning “collections as data” movement within the fields of digital libraries and digital humanities. Faculty at the University of Utah’s Marriott Library are developing a collections as data strategy by leveraging existing Digital Library and Digital Matters programs. By selecting various digital collections, small- and large-scale approaches to developing open datasets are explored. Five case studies chronicling this strategy are reviewed, along with testing the datasets using various digital humanities methods, such as text mining, topic modeling, and GIS (geographic information system).

Direct to Full Text Article
13 pages; PDF.

About Gary Price

Gary Price (gprice@mediasourceinc.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at Ask.com, and is currently a contributing editor at Search Engine Land.