"The First Decade of Web Archiving at the Library of Congress"
Eleven years ago, the Library of Congress established a pilot web archiving project to study methods to evaluate, select, collect, catalog, provide access to and preserve at-risk born digital content for future generations. We could write a book (or at least a few blog posts!) about lessons learned since then, yet we continue to face a variety of challenges.
[Clip]
We’ve collected over 240 terabytes of content, in almost 40 event and thematic collections. Our strengths are in government, public policy and law: we archive U.S. national elections, house and senate and committee sites, changes in the Supreme Court and legal blawgs.
e also build web archives with our special collection divisions – the Manuscript, Prints and Photographs and Music divisions are archiving sites related to their physical holdings. In recent years Library staff in overseas offices in Egypt, Brazil, Indonesia, India and Pakistan captured born digital content documenting elections and other events.
Read the Full Text of Abbie Grotke’s Post
More posts in this series will be available soon.
Direct to Library of Congress Web Archives
Filed under: Archives and Special Collections, Libraries, Resources

About Gary Price
Gary Price (gprice@gmail.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com.