May 17, 2022

Statistics: Data From a Massive Media Digitization Project at Indiana University

From a Media Digitization and Preservation Initiative at Indiana University Bloomington Blog Post:

Over the past four years, Indiana University’s Media Digitization and Preservation Initiative has digitized more than 315,000 audio and video recordings. In the meantime, we’ve heard from many other folks who are planning digitization projects that predicting file size and recording length is difficult.  Of course, every collection is different, and nobody else will have exactly the same mix of holdings as we do at IU. Still, the general statistics we’ve gathered from the 315,000+ items we’ve reformatted to date may help provide others with at least some rough guidance for project planning.

The winner in the duration contest for video formats is VHS with an average running time of just about 100 minutes. This is not a surprise given that VHS is a fully consumer format. Consumers place higher value on the length of time available for recording than other characteristics, and so the format was developed with an emphasis on maximizing this variable. That’s why the EP (extended play) version of the format—that triples the possible recording time while reducing quality—was introduced. Over 3,000 of our VHS tapes are EP. One practical implication of this fact is that we must acquire and integrate consumer-level playback machines into our digitization signal chain, since professional decks do not support EP.

Learn More, Read the Complete Blog Post (approx. 1160 words)

Direct to MDPI Website

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.