The Effectiveness and Durability of Digital Preservation and Curation Systems (New Research Report From Ithaka S+R)
The report linked below was published today by Ithaka S+R.
Oya Y. Rieger Senior Strategist
Roger C. Schonfeld Vice President, Organizational Strategy and Libraries, Scholarly Communication, and Museums
Liam Sweeney Senior Analyst
19 July 2022
From the Executive Summary
Our cultural, historic, and scientific heritage is increasingly being produced and shared in digital forms, whether born-digital or reformatted from physical materials. There are fundamentally two different types of approaches being taken to preservation: One is programmatic preservation, a series of cross-institutional efforts to curate and preserve specific content types or collections usually based on the establishment of trusted repositories. Examples of providers in this category that provide programmatic preservation include CLOCKSS, Internet Archive, HathiTrust, and Portico. In addition, there are third-party preservation platforms, which are utilized by individual heritage organizations that undertake their own discrete efforts to provide curation, discovery, and long-term management of their institutional digital content and collections.
In August 2020, with funding from the Institute of Library and Museum Services (IMLS), Ithaka S+R launched an 18-month research project to examine and assess the sustainability of these third-party digital preservation systems. In addition to a broad examination of the landscape, we more closely studied eight systems: APTrust, Archivematica, Arkivum, Islandora, LIBNOVA, MetaArchive, Samvera and Preservica. Specifically, we assessed what works well and the challenges and risk factors these systems face in their ability to continue to successfully serve their mission and the needs of the market. In scoping this project and selecting these organizations, we intentionally included a combination of profit-seeking and not-for-profit initiatives, focusing on third-party preservation platforms rather than programmatic preservation.
Because so many heritage organizations pursue th
e preservation imperative for their collections with increasingly limited resources, we examine not only the sustainability of the providers but also the decision-making processes of heritage organiz
ations and the challenges they face in working with the providers.
Our key findings include:
- The term “preservation” has become devalued nearly to the point of having lost its meaning. Providers are marketing their offerings as “preservation systems” regardless of actual functionality or storage configurations. Many systems marketed as preservation systems usually address only some aspects of preservation work, such as providing workflow systems (and user interfaces) to streamline the process of moving content into and out of a storage layer.
- Because no digital preservation system is truly turnkey, digital preservation cannot be fully outsourced. Digital preservation is a distributed and iterative activity that requires in-house expertise, adequate staffing, and access to different technologies and systems. While it is possible to outsource key components of the digital preservation process to a system provider, no digital preservation system is truly turnkey. Today, it is neither feasible nor desirable for a heritage organization to outsource responsibility for its digital preservation program.
- Heritage organizations select preservation systems within the context of marketplace competition. Many observers believe that heritage organizations should support not-for-profit solutions based on shared values and other common principles. But this has not always been the principal driver of organizational behavior. Providers compete within a marketplace that recognizes organizational values as one characteristic among many, such as the total cost of implementation and the feasibility of local implementation.
- The not-for-profit preservation platforms are at risk. They tend to have limited capital and have comparatively ponderous governance structures. As a result, many have not been able to innovate quickly enough to keep up with the needs of heritage organizations. Their business and governance models are often ill-suited to the demands of a competitive marketplace, even if growth is not their primary objective. It seems reasonable to forecast additional mergers or buy outs (if not outright failures) among this category of providers.
- The growing reliance on profit-seeking providers carries risks. The profit-seekers tend to pursue a growth strategy, and by this measure they are succeeding. Private capital and a decision to scale across multiple sectors has enabled this category of providers to grow substantially in the heritage sector. Because of a lack of financial transparency, the sustainability of this sector is largely unknowable, and because of a lack of technical transparency, the robustness of the solutions themselves are not widely understood. Competitive pricing and strong service seem likely nevertheless to continue driving growth.
- The diversity of approaches to ensure long-term access to digital content—while a strength—can challenge the imperative to maintain high standards. Given the accelerating rates and increasing complexity of digital information, the heritage community needs a rich array of services. But this array of services must not result in a view that “there are no right ways of doing preservation.” It does a disservice to the preservation imperative if heritage communities are unwilling to critique flawed offerings. Systems designed to play a role in preservation need to strike a balance between agility and inclusivity, taking into consideration the diverse needs of users and organizational resources.
- Very little digital preservation activity is actually taking place. While we did not embark on this project to quantify the level of preservation, there appear to be thousands of heritage organizations undertaking little to no digital preservation activity. While cross-institutional programmatic preservation activities were also out of scope for this project, we note with continuing concern that vast categories of important content types, such as journalism and social media, remain largely outside the scope of any heritage community preservation initiative. Ultimately, heritage organizations are severely underinvesting in digital preservation.
The study aims not only to further increase our understanding of sustainability principles but also to help the sector refine and consider how to best implement them. To this end, we are convening a series of forums to share the findings with members of the relevant digital preservation and curation systems alongside higher education community, funders, and policy makers to facilitate discussions. A series of blog posts, to be published in summer 2022, will incorporate the feedback gathered through the stakeholder convenings and share strategies for moving forward.
We are grateful for the participation of the leaders and clients of the eight digital preservation and curation systems examined in this study. In addition, we greatly benefited from the experiences of several preservation specialists and service providers and appreciate the community’s deep expertise, generosity in sharing insights, and commitment to advancing the field.
Direct to Full Text Report
Direct to PDF Version
50 pages; PDF.
About Gary Price
Gary Price (firstname.lastname@example.org) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com. Gary is also the co-founder of infoDJ an innovation research consultancy supporting corporate product and business model teams with just-in-time fact and insight finding.