SUBSCRIBE
SUBSCRIBE
EXPLORE +
  • About infoDOCKET
  • Academic Libraries on LJ
  • Research on LJ
  • News on LJ
  • Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Libraries
    • Academic Libraries
    • Government Libraries
    • National Libraries
    • Public Libraries
  • Companies (Publishers/Vendors)
    • EBSCO
    • Elsevier
    • Ex Libris
    • Frontiers
    • Gale
    • PLOS
    • Scholastic
  • New Resources
    • Dashboards
    • Data Files
    • Digital Collections
    • Digital Preservation
    • Interactive Tools
    • Maps
    • Other
    • Podcasts
    • Productivity
  • New Research
    • Conference Presentations
    • Journal Articles
    • Lecture
    • New Issue
    • Reports
  • Topics
    • Archives & Special Collections
    • Associations & Organizations
    • Awards
    • Funding
    • Interviews
    • Jobs
    • Management & Leadership
    • News
    • Patrons & Users
    • Preservation
    • Profiles
    • Publishing
    • Roundup
    • Scholarly Communications
      • Open Access

April 16, 2026 by Gary Price

Free Law Project is Scanning America’s Case Law; Goal is to Scan 2.5 Million Pages by the Fall

April 16, 2026 by Gary Price

From a FLP Announcement:

Since our founding, Free Law Project has worked tirelessly to create a complete database of every American legal decision ever written. In 2024, we announced that we added historical case law digitized by Harvard Law Library to CourtListener. This made our platform one of the most comprehensive and transparent sources of case law.

Today we’re announcing the next milestone. We’re picking up where Harvard left off by scanning and digitizing thousands of books that have since been published.

This is a step that we’ve been planning for years and that is essential to make CourtListener complete. These scans build on our system of court scrapers, and fill the gap between when the Harvard data ends, in 2018, and today.

This is a large-scale effort. So far we have scanned over 200,000 pages of case law. Our immediate goal is to scan 2.5 million pages by the fall, and we will continue scanning books as they are published so that the CourtListener collection is, and remains, comprehensive.

[Clip]

A key part of what makes this possible is a new system we developed called Blackletter. This tool uses machine learning to intelligently identify and remove editorial material from the scans of the books. Legal opinions themselves are unquestionably public domain, but the editorial additions layered on top of them are sometimes challenged as copyrightable. Separating the two has historically been a labor-intensive, manual process, but Blackletter automates this, allowing us to do millions of redactions nearly in real time.

Learn More, Read the Complete Post (about 970 words)

Filed under: Data Files, Digital Preservation, Libraries, News

SHARE:

About Gary Price

Gary Price (gprice@gmail.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com.

ADVERTISEMENT

Archives

Job Zone

ADVERTISEMENT

Related Infodocket Posts

ADVERTISEMENT

FOLLOW US ON X

Tweets by infoDOCKET

ADVERTISEMENT

This coverage is free for all visitors. Your support makes this possible.

This coverage is free for all visitors. Your support makes this possible.

Primary Sidebar

  • News
  • Reviews+
  • Technology
  • Programs+
  • Design
  • Leadership
  • People
  • COVID-19
  • Advocacy
  • Opinion
  • INFOdocket
  • Job Zone

Reviews+

  • Booklists
  • Prepub Alert
  • Book Pulse
  • Media
  • Readers' Advisory
  • Self-Published Books
  • Review Submissions
  • Review for LJ

Awards

  • Library of the Year
  • Librarian of the Year
  • Movers & Shakers 2022
  • Paralibrarian of the Year
  • Best Small Library
  • Marketer of the Year
  • All Awards Guidelines
  • Community Impact Prize

Resources

  • LJ Index/Star Libraries
  • Research
  • White Papers / Case Studies

Events & PD

  • Online Courses
  • In-Person Events
  • Virtual Events
  • Webcasts
  • About Us
  • Contact Us
  • Advertise
  • Subscribe
  • Media Inquiries
  • Newsletter Sign Up
  • Submit Features/News
  • Data Privacy
  • Terms of Use
  • Terms of Sale
  • FAQs
  • Careers at MSI


© 2026 Library Journal. All rights reserved.


© 2022 Library Journal. All rights reserved.