Note: The final LiWA project review took place in Luxembourg on March 11th, 2011.
Web content plays an increasingly important role in the knowledge-based society, and preservation and longterm accessibility of Web history have high value. The European funded project LiWAs looked beyond the pure “freezing” of Web content snapshots for a long time, by transforming pure snapshot storage into a “Living” Web Archive. In order to create Living Web Archives, the LiWA project addressed R&D challenges in the following three areas: Archive Fidelity, Archive coherence and Archive Interpretability.
From the “To Continue” Portion of the Article:
During the LiWA project many new approaches have been developed to address major issues in Web archiving and archive accessibility. LiWA can be seen as a starting point for a number of new activities in the field of Web archiving and Web preservation. The sheer size, complexity and dynamics of the Web make high quality Web archiving still an expensive and time-consuming challenge. Therefore new crawling strategies are necessary that focus on content completeness in term of opinions, topics or entities etc.
The new Integrated Project ARCOMEM (From Collect-All Archives to Community Memories) leverages the Social Web for content appraisal and selection. Beside preservation, a deeper understanding of the Internet content characteristics (size, distribution, form, structure, evolution, dynamic) is also necessary in many areas of today’s science.
The European funded project LAWA (Longitudinal Analytics of Web Archive data) builds an experimental testbed for large-scale data analytics. Its focus is on developing a sustainable infrastructure, scalable methods, and easily usable software tools for aggregating, querying, and analysing heterogeneous data at Internet scale. Particular emphasis is given to longitudinal data analysis along the time dimension for Web data that has been crawled over extended time periods.