January 20, 2022

Web Preservation: UK Web Archive Improves Full Text Search Filters

From a UK Web Archive Blog Post:

Search results are now grouped by domain, making it easier to immediately see which websites contain references to the search term(s) and easily identifying the context in which the search term appears. For domain results we group URLs by date. This eliminates duplicate entries in results but still provides temporal access when there is more than one instance captured.

We have improved our content type filter, making it quicker and easier to filter by content type(s). Search results are now grouped by content type, separating ‘documents’ from ‘images’ and ‘multimedia’, in recognition of the fact that people will often be searching for a specific type of content. This is still in development and we know that it doesn’t always work perfectly – images can appear in the documents tab when they are served from a single html page, for example. We’re keen to hear from people about this feature, and whether they think it’s useful.

We’ve also started to roll out some social media integration. It’s now easy to share any of the resources in the search results, using the links provided under each one.

And finally, you can now use the Advanced Search tab to filter by archiving organisation. For example, if you’re only interested in sites archived by the Wellcome Library, you can specify this prior to running the search. Only sites selected by these institutions will then be included in your search results.

Direct to UK Web Archive Search Interface

Direct to UK Web Archive (Visualization Search Tools)
Generate N-gram of phrases / terms.

Learn More About the UK Web Archive

Read the Complete Blog Post With Screen Caps

About Gary Price

Gary Price (gprice@mediasourceinc.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at Ask.com, and is currently a contributing editor at Search Engine Land.