SUBSCRIBE
SUBSCRIBE
EXPLORE +
  • About infoDOCKET
  • Academic Libraries on LJ
  • Research on LJ
  • News on LJ
  • Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Libraries
    • Academic Libraries
    • Government Libraries
    • National Libraries
    • Public Libraries
  • Companies (Publishers/Vendors)
    • EBSCO
    • Elsevier
    • Ex Libris
    • Frontiers
    • Gale
    • PLOS
    • Scholastic
  • New Resources
    • Dashboards
    • Data Files
    • Digital Collections
    • Digital Preservation
    • Interactive Tools
    • Maps
    • Other
    • Podcasts
    • Productivity
  • New Research
    • Conference Presentations
    • Journal Articles
    • Lecture
    • New Issue
    • Reports
  • Topics
    • Archives & Special Collections
    • Associations & Organizations
    • Awards
    • Funding
    • Interviews
    • Jobs
    • Management & Leadership
    • News
    • Patrons & Users
    • Preservation
    • Profiles
    • Publishing
    • Roundup
    • Scholarly Communications
      • Open Access

May 20, 2011 by Gary Price

So Long and Goodbye! Google Is Ending Newspaper Digitization Project

May 20, 2011 by Gary Price

Update (5/22): We’ve Added A Few Pieces of Info From the Letter Google Sent to Google Newspaper Digitization Partners. You’ll Find Them At the Bottom of this Entry.

Google’s goal to digitize all of the world’s newspapers is ending.

Matt McGee from SEL reports that after approximately 32 months after a formal launch announcement, Google is ending the newspaper digitization program.

Search Engine Land also received this statement from Google:

We work closely with newspaper partners on a number of initiatives, and as part of the Google News Archives digitization program we collaborated to make older newspapers accessible and searchable online. These have included publications like the London Advertiser in 1895, L’Ami du Lecteur at the turn of the century, and the Milwaukee Sentinel from 1910 to 1995.

Users can continue to search digitized newspapers at http://news.google.com/archivesearch, but we don’t plan to introduce any further features or functionality to the Google News Archives and we are no longer accepting new microfilm or digital files for processing.

According to McGee, about 2000 newspapers are currently listed in the Google newspaper directory.

The SEL article also points to this Boston Phoenix story that begins:

Google told partners in its News Archive project that it would cease accepting, scanning, and indexing microfilm and other archival material from newspapers, and was instead focusing its energies on “newer projects that help the industry, such as Google One Pass, a platform that enables publishers to sell content and subscriptions directly from their own sites.”

[Clip]

Some newspapers complained that Google, after quickly scanning their archives, was slow to process the scans. The Phoenix sent Google a stash of archives covering several decades; some fraction of those have made their way online.

[Clip]

It remains to be seen whether Google will complete the process of indexing the newspapers it has scanned*. We’d guess not. Are we mad at that? Ehhh, not really. The deal Google struck with partner newspapers stipulated that, somewhere down the line, a paper could purchase Google’s digital scans of its content for a fee. That fee is now being waived, and Google is not only giving publishers free access to the scanned files, but also the rights to publish them with other partners.

* For the searcher this means that not every issue of every paper Google lists in its directory is available.

Complete Search Engine Land Story

Complete Boston Phoenix Story

Comment: New leadership is in place at Google and new leadership can often bring changes. This is likely one of them. Although the newspaper digitization service is going to shutdown today’s news is an excellent reminder that Google is a money making venture and like any other business they make business decisions that we often love but at other might we might wish things were different. The company has to work with a large number of groups that can have varying interests. They include shareholders, business partners, searchers/users, and others.

1.  Google is a company like any other.  With all of the useful/good things they do they’re still a for-profit business. It can sometimes be easy to forget this important fact.

2.  Google is often and correctly referred to as an advertising or marketing company.  This doesn’t mean that they don’t and can’t do good and useful things, that wouldn’t work for anyone. However, as we said a moment ago all companies have to make decisions based on a variety of factors that often, in one way or another, involve money. Around 95% of Google’s revenue comes from advertising/marketing.

3. Specifically, why they ended the digitization program “newer projects that help the industry, such as ‘Google One Pass‘” is worth mentioning (of course there is likely more to it) but why the did it is really not the issue. What is the issue? That Google can end a program, service, or feature if for WHATEVER reason they want to unless it’s stipulated in the contract.

4.. Because Google makes business decisions things can change quickly because business can change quickly. So, it might be useful to be aware of a variety of possible providers of a service or resource or to put it another way be very careful not to put all of your eggs in one basket. Keeping this in mind is what’s important. Of course, if one company was the only possible choice as an info provider that brings up a lot of issues for another time.

See Also: The Chronicling America Newspaper Digitization Project from the Library of Congress and NEH Continues to Digitize U.S. Newspapers from
1860-1922.

See Also: A Week Ago We Pointed Out That the Australian Newspaper Digitization Program Had Recently Passed the Five Million Digitized Pages Mark

UPDATE: We We’ve Been Able to Read the Full Text of the Letter Google Sent to Google News Archive Digitization Partners. A Few Pieces of Info from the Letter Follow:

  • The Letter Says That Google Digitized 60 Million Newspaper Pages. Caveat (at least for now): As Mentioned Earlier in this Post, According to the Boston Phoenix Not All Digitized Material Has Been Indexed by Google.  So, we’re not sure exactly what the 60 Million Number Means. We have contacted Google hoping to learn more.
  • The Letter Says: Publishers Want to Sell Content So Google’s Program Not As Appealing as What Others Offer, So “To Keep Up With the Shift…” Google is Focusing Resources on Newer Projects To Help the Industry. Ex. Google OnePass
  • Google Suggests ProQuest (a Google Partner) as a Company Publishers Might Want to Consider “To Explore New Opportunities.”
  • Publishers Can Request Copies of Files Google Digitized and Can Use Them With No Fees (Google Waived a Per Page Fee as a Way to Express Their Gratitude to Participants) or Limitations
  • The Letter Links to a Request Form With Info About What is Sent to Publishers If They Ask for Copies of the the Digital Files. For Each Digitized Page Three Files Will Be Sent:
  • Two High-Resolution Image Files (the Original “Raw” image and the “Cleaned” Image With a White Background and Sharper Text)
  • One HTML File Containing the OCRd text (File is Suitable for Search But not for Display)
  • Google Points Out that One Reel of Microfilm Can Return a File With Up to 20GB of Digitized Material

Filed under: Archives and Special Collections, Companies (Publishers/Vendors), Digital Preservation, Journal Articles, Libraries, Management and Leadership, News, Patrons and Users, Reports, Resources

SHARE:

ArchivesChronicling AmericaDigitizationDigitized Archives & LibrariesDigitized NewspapersGoogle

About Gary Price

Gary Price (gprice@gmail.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com.

ADVERTISEMENT

Archives

Job Zone

ADVERTISEMENT

Related Infodocket Posts

Andrea Jackson Gavin Appointed Inaugural Program Director of the HBCU Digital Library Trust

Below is the Full Text of the Announcement Letter (via the Harvard Library): We are delighted to announce the appointment of Andrea Jackson Gavin as the inaugural Program Director of the ...

U.S. Census Releases 2020 Data for Nearly 1,500 Detailed Race and Ethnicity Groups, Tribes and Villages

From the U.S. Census: The U.S. Census Bureau today released 2020 Census population counts and sex-by-age statistics for 300 detailed race and ethnic groups, as well as 1,187 detailed American ...

Book Bans Spike by 33% During the Last School Year, According to New Research by PEN America

From PEN America:  The number of public school book bans across the country increased by 33 percent in the 2022-23 school year compared to the 2021-22 school year, according to ...

Penn State Leads Big Ten Academic Alliance Project on Open Homework Systems; ChatGPT Usage is Rising Again as...

AI ChatGPT Usage is Rising Again as Students Return to School (via Bloomberg) Universities Rethink Using AI Writing Detectors to Vet Students’ Work (via Bloomberg) Amazon AI-Generated Books Force Amazon ...

$800,000 Budget Cut Proposed: West Virginia University Library System Plans to Reduce Staff, Modify Space Amid University Cuts;...

From WCHS: Following the vote to cut 28 majors and more than 100 faculty positions at West Virginia University, the university’s library system could be the next to take the ...

American Library Association (ALA) Releases Preliminary Data on 2023 Book Challenges; Highest Number of Book Challenges Since ALA...

UPDATE LeVar Burton to Lead 2023 Banned Books Week as Honorary Chair (via ALA) —End Update— Below is the full text of a statement released today by the American Library ...

Harris County Libraries Declared a 'Book Sanctuary' Amid State Crackdown; UCLA Library Receives $4.2 Million Political Cartoon Collection...

Acquisitions UCLA Library Receives $4.2 Million Political Cartoon Collection Spanning Centuries (via UCLA  California At 20, San Jose’s MLK Library Remains a Partnership For the Books (via The Mercury News) ...

The Lens Loads Now Open Dataset From Crossref of Retraction Watch Papers; Digital Science Announces Brand Redesign for...

Clarivate Clarivate Unveils Citation Laureates 2023 – Annual List of Researchers of Nobel Class Digital Science Digital Science Announces Brand Redesign for ReadCube and Papers Internet Archive IMLS National Leadership Grant ...

New From AUPresses & Ithaka S+R: "Print Revenue and Open Access Monographs: A University Press Study"

From a Joint News Release: The Association of University Presses (AUPresses) and Ithaka S+R today publish “Print Revenue and Open Access Monographs: A University Press Study.” This report is the ...

Making IIIF Official at the Internet Archive; Exploring Equity on Wikipedia; & More News Headlines

American Library Association (ALA) ALA Introduces New LibGuide on How to Explore and Use Library of Congress Digital Collections In Library Programming ALA ‘s Committee on Library Advocacy Releases Update ...

Journal Article: "Redesigning Research Guides: Lessons Learned from Usability Testing at the University of Memphis"

The article linked below was published today by Information Technology and Libraries (ITAL). Title Redesigning Research Guides: Lessons Learned from Usability Testing at the University of Memphis Authors Jessica McClure ...

University of Illinois: Information Sciences Professor Developing Tool to Make Data Visualizations Accessible to Blind Researchers, Students

From the University of Illinois:  JooYoung Seo, a professor of information sciences at the University of Illinois Urbana-Champaign, is developing a data visualization tool that will help make visual representations of statistical ...

ADVERTISEMENT

FOLLOW US ON TWITTER

Tweets by infoDOCKET

ADVERTISEMENT

This coverage is free for all visitors. Your support makes this possible.

This coverage is free for all visitors. Your support makes this possible.

Primary Sidebar

  • News
  • Reviews+
  • Technology
  • Programs+
  • Design
  • Leadership
  • People
  • COVID-19
  • Advocacy
  • Opinion
  • INFOdocket
  • Job Zone

Reviews+

  • Booklists
  • Prepub Alert
  • Book Pulse
  • Media
  • Readers' Advisory
  • Self-Published Books
  • Review Submissions
  • Review for LJ

Awards

  • Library of the Year
  • Librarian of the Year
  • Movers & Shakers 2022
  • Paralibrarian of the Year
  • Best Small Library
  • Marketer of the Year
  • All Awards Guidelines
  • Community Impact Prize

Resources

  • LJ Index/Star Libraries
  • Research
  • White Papers / Case Studies

Events & PD

  • Online Courses
  • In-Person Events
  • Virtual Events
  • Webcasts
  • About Us
  • Contact Us
  • Advertise
  • Subscribe
  • Media Inquiries
  • Newsletter Sign Up
  • Submit Features/News
  • Data Privacy
  • Terms of Use
  • Terms of Sale
  • FAQs
  • Careers at MSI


© 2023 Library Journal. All rights reserved.


© 2022 Library Journal. All rights reserved.