January 17, 2022

The April 2011 HathiTrust Update is Now Available Online

Direct to HathiTrust Update (April 2011)

  • New PageTurner
    Online E-Reading Tool Enhanced
  • Local Digitization Ingest
    “Michigan staff worked with Universidad Complutense de Madrid, Yale University, and the University of Illinois in April on ingest of locally-digitized volumes.  We expect to begin ingest of volumes from Madrid in May, as well as the full set of volumes from Yale (a sample was ingested in December).”
  • Harvard University
    “Ingest of an initial set of more than 50,000 volumes from Harvard University was completed in April.”
  • Collections
    “The Collections Committee continues to work on a series of recommendations regarding duplicate volumes in HathiTrust, coordinated print management, and responding to users requests to contribute volumes to the repository.  A draft discussion paper on duplicates will be shared with the Strategic Advisory Board in June for initial feedback.”
  • Metadata
    “The HathiTrust Metadata Management System team completed development of the core database system in April, as well as an API to export bibliographic data in XML format. Approximately 200,000 records have been loaded into the system for initial testing. The team is analyzing MARC records from current content-contributing partner institutions, received from the University of Michigan, looking for irregularities and performing a general survey of the record set. CDL staff continue to interview for a Principal Metadata Analyst. Details on the project are available at http://www.hathitrust.org/htmms”
  • Full-text Search
    “Michigan staff began development work on priority features for full-text search as identified in the Full-Text Search Working Group’s report. The implementation team is focusing initially on relevance ranking of search results based on a combination of full-text OCR and bibliographic metadata, and on faceting of results using bibliographic metadata. The goal is to release significant new features that use the bibliographic data to enhance full-text search results by July 1, 2011.”

Number of Volumes Added

April Total
Columbia University 3 58,483
Cornell University 40,729 311,110
Harvard University 52,709 52,709
Indiana University 893 183,881
Library of Congress 0 71,418
New York Public Library 0 258,691
Penn State University 18 39,016
Princeton University 8,810 237,034
University of California 41,512 2,408,727
The University of Chicago 0 5,172
University of Illinois 0 14,501
University of Madrid 15,486 103,797
University of Michigan 19,974 4,338,368
University of Minnesota 1,419 84,985
University of Wisconsin 10,602 454,332
Yale University Library 0 161
Total 192,155 8,662,385

Public Domain (~27%)

Total* 181,909 2,386,430

* This count includes volumes already in the repository to which rights holders have newly opened access

Direct to Full Text of HathiTrust Update (April 2011)

About Gary Price

Gary Price (gprice@mediasourceinc.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at Ask.com, and is currently a contributing editor at Search Engine Land.