SUBSCRIBE
SUBSCRIBE
EXPLORE +
  • About infoDOCKET
  • Academic Libraries on LJ
  • Research on LJ
  • News on LJ
  • Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Libraries
    • Academic Libraries
    • Government Libraries
    • National Libraries
    • Public Libraries
  • Companies (Publishers/Vendors)
    • EBSCO
    • Elsevier
    • Ex Libris
    • Frontiers
    • Gale
    • PLOS
    • Scholastic
  • New Resources
    • Dashboards
    • Data Files
    • Digital Collections
    • Digital Preservation
    • Interactive Tools
    • Maps
    • Other
    • Podcasts
    • Productivity
  • New Research
    • Conference Presentations
    • Journal Articles
    • Lecture
    • New Issue
    • Reports
  • Topics
    • Archives & Special Collections
    • Associations & Organizations
    • Awards
    • Funding
    • Interviews
    • Jobs
    • Management & Leadership
    • News
    • Patrons & Users
    • Preservation
    • Profiles
    • Publishing
    • Roundup
    • Scholarly Communications
      • Open Access

October 24, 2016 by Gary Price

New Beta Release Allows Users to Keyword Search Some Material Found in The Wayback Machine

October 24, 2016 by Gary Price

UPDATE October 26, 2016 It has been quite a week (so far) from the Internet Archive for new and improved search options.
On Monday, The Wayback Machine launched a keyword search beta. Our post about this new option is below this updated. We also shared news of new and enhanced search options from The Open Book Project, an Internet Archive initiative.
Today, news of MORE new search capabilities now available when searching the Internet Archive including using facets to focus search results and full text search (beta) of books. Details here.
—
Some VERY exciting news! Something we’ve all wanted for a long time.
The Internet Archive has just launched (beta release) the ability to keyword search a limited amount of material found in The Wayback Machine.
At this point you CANNOT keyword search specific words/phrases on specific pages.
This topic is discussed in a set of FAQs available here. We hope complete keyword search comes soon. Today’s launch is a terrific start to making Wayback even more useful.
So, what is available today? What can you search?
1. Keyword search a limited amount of Wayback Machine content, the homepages of more than 350 million sites. Fyi, the complete Wayback Machine contains over 510 billion pages.
2. Keyword search using word(s) that describe a site. For example, “Toronto Government” or websites related to “air traffic control”
3. You can limit your search to a specific domain or site by utilizing, site:. This filter/syntax can be combined with keywords. e.g. pages from MIT pages related to economics.
4. Results appear as you type. Impressive and no hiccups/stutter when I ran several searches. Impressive!
2016-10-24_16-33-01
4. Clicking on any result will take you to a traditional Wayback Machine results page with links to archived copies of the page, PDF. etc.
4. The beta index is multilingual.
5. Cool! You can search the index using unicode characters!
Much more in the complete blog post, FAQs, and this discussion about how the Internet Archive defines web pages, web sites, and web captures.
Resources
Direct to Wayback Machine’s NEW Keyword Search Beta: web-beta.archive.org
Direct to Wayback Machine (Complete Index, Not Keyword Searchable) web.archive.org
Note: Archived Pages Made Available in Archive-It Public Collections Have Always Been Keyword Searchable
Note: How to Instantly Archive Web Pages and PDFs Using Wayback

Filed under: News, Patrons and Users

SHARE:

About Gary Price

Gary Price (gprice@gmail.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com. Gary is also the co-founder of infoDJ an innovation research consultancy supporting corporate product and business model teams with just-in-time fact and insight finding.

ADVERTISEMENT

Archives

Job Zone

ADVERTISEMENT

Related Infodocket Posts

Smithsonian’s Tsione Wolde-Michael Tapped to Lead President’s Committee on the Arts and the Humanities

From the Institute of Museum and Library Services (IMLS): Tsione Wolde-Michael, most recently the founding Director of the Smithsonian’s Center for Restorative History, has been named Executive Director of the ...

New Video: How Oxford English Dictionary (OED) Editors Find New Words

The video embedded below was recently shared by Oxford Languages/Oxford University Press. Description Eleanor Maier, OED Executive Editor, provided an overview of how new words are added to the Oxford ...

Senate Votes to Ban TikTok Use on Government Devices; Census Bureau Tables a Controversial Privacy Tool for Survey;...

Amazon to End Print Textbook Rentals, Overhaul Magazine and Newspaper Subscriptions (via PW) Census Bureau Tables a Controversial Privacy Tool for Survey (via Associated Press) FOLIO Launches Nolana Release Library ...

Library of Congress: 25 Eclectic Films Chosen for National Film Registry; 2022 Entries Include Iron Man, Hairspray, When...

From LC: Librarian of Congress Carla Hayden announced today the annual selection of 25 influential motion pictures to be inducted into the National Film Registry of the Library of Congress. ...

DPLA’s Digital Equity Project Update; First-of-its-Kind Report on the U.S. Literary Arts Field; and More News Headlines

DPLA’s Digital Equity Project: An Update from Charlotte Mecklenburg Library’s Living Archives project IIIF Online Conference Recordings Now Available First-of-its-Kind Report on the U.S. Literary Arts Field (via Academy of ...

Preprint: "Phase 1 of the NIH Preprint Pilot: Testing the Viability of Making Preprints Discoverable in PubMed Central...

The research article (preprint) linked below was recently shared on bioRxiv. Title Phase 1 of the NIH Preprint Pilot: Testing the Viability of Making Preprints Discoverable in PubMed Central and ...

Research Tools: OpenCorporates Unifies Official Company Data From All 50 US States

Ed. Note: Wonderful news from a favorite and frequently used open web resource. From an OpenCorporates Blog Post: OpenCorporates, the world’s definitive source for company data, has made transparent company data from ...

Public Library of Science (PLOS) Releases First Open Science Indicators Dataset

DoFrom the PLOS Blog: Open Science is on the rise. We can infer as much from the proliferation of Open Access publishing options; the steady upward trend in bioRxiv postings; ...

Gabriel Morley Declines Job Offer to Lead Indianapolis Public Library as CEO; “Out Of Control”: Dozens of Telehealth...

Bookforum is Closing, Leaving Ever Fewer Publications Devoted to Books (via NY Times) Cambridge University Press Makes Raft of Physics Titles Available as Open Access Books (via The Bookseller)  Digital ...

Journal Article: "Designing the Diversity of Canadian Libraries: Excerpts from the CARL Inclusion Perspectives Webinar by Racialized Library...

The article linked below was recently published by Partnership: The Canadian Journal of Library and Information Practice and Research. Title Designing the Diversity of Canadian Libraries: Excerpts from the CARL ...

Milestones: Medline Reaches 30 Million Citations

From the NLM (National Library of Medicine) Technical Bulletin: On December 7, 2022, MEDLINE attained a major milestone when the 30 millionth journal citation was added to the database. This count includes ...

COAR and ASAPbio Announce the Publication of "Ten Recommended Practices for Managing Preprints in Generalist and Institutional Repositories"

From a COAR: Confederation of Open Access Repositories Announcement: Today, COAR and ASAPbio are pleased to announce the publication of “Ten Recommended Practices for Managing Preprints in Generalist and Institutional ...

ADVERTISEMENT

FOLLOW US ON TWITTER

Tweets by infoDOCKET

ADVERTISEMENT

This coverage is free for all visitors. Your support makes this possible.

This coverage is free for all visitors. Your support makes this possible.

Primary Sidebar

  • News
  • Reviews+
  • Technology
  • Programs+
  • Design
  • Leadership
  • People
  • COVID-19
  • Advocacy
  • Opinion
  • INFOdocket
  • Job Zone

Reviews+

  • Booklists
  • Prepub Alert
  • Book Pulse
  • Media
  • Readers' Advisory
  • Self-Published Books
  • Review Submissions
  • Review for LJ

Awards

  • Library of the Year
  • Librarian of the Year
  • Movers & Shakers 2022
  • Paralibrarian of the Year
  • Best Small Library
  • Marketer of the Year
  • All Awards Guidelines
  • Community Impact Prize

Resources

  • LJ Index/Star Libraries
  • Research
  • White Papers / Case Studies

Events & PD

  • Online Courses
  • In-Person Events
  • Virtual Events
  • Webcasts
  • About Us
  • Contact Us
  • Advertise
  • Subscribe
  • Media Inquiries
  • Newsletter Sign Up
  • Submit Features/News
  • Data Privacy
  • Terms of Use
  • Terms of Sale
  • FAQs
  • Careers at MSI


© 2022 Library Journal. All rights reserved.


© 2022 Library Journal. All rights reserved.