October 24, 2014

National Library of New Zealand Announces Details About 2013 .NZ Web Harvest

share save 171 16 National Library of New Zealand Announces Details About 2013 .NZ Web Harvest

From the National Library:

The National Library is planning its third New Zealand Web Harvest in February 2013.

[Clip]

The planned New Zealand Web Harvest 2013 recognises the importance of the internet in all areas of New Zealand society and culture by taking a ‘snapshot’ of the whole .nz domain as it exists on the web in February 2013. The Library has conducted two previous harvests, in October 2008 and April 2010.

The National Librarian is authorised to harvest websites by the National Library of New Zealand (Te Puna Mātauranga o Aotearoa) Act 2003 and the Minister’s National Library Requirement (Electronic Documents) Notice 2006.

The Library undertakes two streams of web archiving: selective harvesting and domain harvesting.

Selective archiving is where Library staff select high-value websites for inclusion in our collections. The Library has been selectively harvesting for several years.

Domain harvesting is an attempt to harvest as much material as is technically possible with a minimum of human intervention. It is called “domain harvesting” because the simplest approach is to try to harvest an internet domain, such as the NZ (or “.nz”) domain for New Zealand.

[Clip]

In 2008 we harvested 105 million URLs, comprising about 4.1 terabytes of uncompressed data.

In 2010 we harvested 130 million URLs, comprising about 8 terabytes of uncompressed data.

The data is stored at the National Library in Wellington.

The Internet Archive will conduct the harvest on behalf of the National Library.

Learn More About the 2013 Web Harvest

Learn More About Web Archiving at the National Library of New Zealand/Access the Archive

See Also: NZ Web Harvest Twitter Feed

share save 171 16 National Library of New Zealand Announces Details About 2013 .NZ Web Harvest
Gary Price About Gary Price

Gary Price (gprice@mediasourceinc.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at Ask.com, and is currently a contributing editor at Search Engine Land.