New Digital Collection: Library of Congress Introduces “General News on the Internet” Web Archive
A new digital collection, The General News on the Internet, is a free archive of online-only news sites collected from the web. The Library of Congress began preserving these sites in June 2014.
How are these news-based sites captured? The Library uses a hybrid approach of weekly captures of the websites, augmented with twice-daily capture of known RSS feeds (Real Simple Syndication). This produces a more complete news archive. Given the dynamic nature of the 24-hour news cycle of today, these archives are meant to capture as much of the news distribution as possible given current limitations in technology and resources.
For instance, many news sites are included in the Library’s September 11, 2001 Web Archive, the Public Policy Topics Web Archive, and the United States Elections Web Archive. The web archiving team also picks up content on pages through links from other sites.
You will see that we are not including major news sites and are only focusing on born-digital sites. Copyright restrictions play a major role and we also wanted to capture sites that could be at-risk of disappearing.
Direct to General News on the Internet Collection
About Gary Price
Gary Price (firstname.lastname@example.org) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com. Gary is also the co-founder of infoDJ an innovation research consultancy supporting corporate product and business model teams with just-in-time fact and insight finding.