Eleven years ago, the Library of Congress established a pilot web archiving project to study methods to evaluate, select, collect, catalog, provide access to and preserve at-risk born digital content for future generations. We could write a book (or at least a few blog posts!) about lessons learned since then, yet we continue to face a variety of challenges.
We’ve collected over 240 terabytes of content, in almost 40 event and thematic collections. Our strengths are in government, public policy and law: we archive U.S. national elections, house and senate and committee sites, changes in the Supreme Court and legal blawgs.
e also build web archives with our special collection divisions – the Manuscript, Prints and Photographs and Music divisions are archiving sites related to their physical holdings. In recent years Library staff in overseas offices in Egypt, Brazil, Indonesia, India and Pakistan captured born digital content documenting elections and other events.
Read the Full Text of Abbie Grotke’s Post
More posts in this series will be available soon.
Direct to Library of Congress Web Archives