SUBSCRIBE
SUBSCRIBE
EXPLORE +
  • About infoDOCKET
  • Academic Libraries on LJ
  • Research on LJ
  • News on LJ
  • Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Libraries
    • Academic Libraries
    • Government Libraries
    • National Libraries
    • Public Libraries
  • Companies (Publishers/Vendors)
    • EBSCO
    • Elsevier
    • Ex Libris
    • Frontiers
    • Gale
    • PLOS
    • Scholastic
  • New Resources
    • Dashboards
    • Data Files
    • Digital Collections
    • Digital Preservation
    • Interactive Tools
    • Maps
    • Other
    • Podcasts
    • Productivity
  • New Research
    • Conference Presentations
    • Journal Articles
    • Lecture
    • New Issue
    • Reports
  • Topics
    • Archives & Special Collections
    • Associations & Organizations
    • Awards
    • Funding
    • Interviews
    • Jobs
    • Management & Leadership
    • News
    • Patrons & Users
    • Preservation
    • Profiles
    • Publishing
    • Roundup
    • Scholarly Communications
      • Open Access

June 3, 2021 by Gary Price

University of Illinois Urbana-Champaign: New iSchool Project Aims to Digitize Predigital Scientific Literature

June 3, 2021 by Gary Price

From the UIUC ISchool:

Jill Naiman (Photo Source: UIUC)

Teaching Assistant Professor Jill Naiman has received a $506,912 grant from the National Aeronautics and Space Administration (NASA) to digitize predigital scientific literature. Her project, “The Reading Time Machine: Transforming Astrophysical Literature into Actionable Data,” is a collaboration with Harvard University and the Astrophysics Data System (ADS), a digital library portal operated by the Smithsonian Astrophysical Observatory (SAO) under a NASA grant. With over 15 million records, ADS is one of the most important archives in the scientific field of astronomy.

“Newer documents are ‘born digital,’ making them machine-readable and parseable,” said Naiman. “This has not only helped domain scientists find relevant research more efficiently, but through methods like natural language processing, it also has facilitated new discoveries in these fields.”

Naiman’s project aims to extend these capabilities to predigital documents by extracting their text, figures, and tables, allowing researchers to apply the same information mining methods that are available to “born digital” documents. This will result in more easily searchable documents and new discoveries. The work will also enhance the screen-reading capabilities of these documents to make them more accessible.

For the project, researchers will use optical character recognition and object detection methods to find and “extract” any tables and figure captions in the text. According to Naiman, this is something that has been done in biomedical literature but not in astronomy. After the images are extracted, they will be classified (i.e., graph, photo, picture of sky), and the figure labels will be parsed to extract science-relevant information.

“In each step, we plan on publishing a database—to be hosted by ADS—and the code so that other folks can do the same to their ‘old’ scientific literature,” she said. “The wealth of science generated by such ‘indexing’ efforts in other STEM fields has demonstrated that we have only scratched the surface of the discoveries possible when the community has access to science-ready data collected from the literature.”

Naiman earned her PhD in astronomy and astrophysics from the University of California, Santa Cruz, and completed National Science Foundation and Institute of Theory and Computation postdoctoral fellowships at the Harvard-Smithsonian Center for Astrophysics before coming to the University of Illinois. She is a Fiddler Faculty Fellow at the National Center for Supercomputing Applications (NCSA) at Illinois.

Filed under: Archives and Special Collections, Data Files, Digital Collections, Funding, Interactive Tools, Libraries, News, Publishing

SHARE:

About Gary Price

Gary Price (gprice@gmail.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com.

ADVERTISEMENT

Archives

Job Zone

ADVERTISEMENT

Related Infodocket Posts

ADVERTISEMENT

FOLLOW US ON X

Tweets by infoDOCKET

ADVERTISEMENT

This coverage is free for all visitors. Your support makes this possible.

This coverage is free for all visitors. Your support makes this possible.

Primary Sidebar

  • News
  • Reviews+
  • Technology
  • Programs+
  • Design
  • Leadership
  • People
  • COVID-19
  • Advocacy
  • Opinion
  • INFOdocket
  • Job Zone

Reviews+

  • Booklists
  • Prepub Alert
  • Book Pulse
  • Media
  • Readers' Advisory
  • Self-Published Books
  • Review Submissions
  • Review for LJ

Awards

  • Library of the Year
  • Librarian of the Year
  • Movers & Shakers 2022
  • Paralibrarian of the Year
  • Best Small Library
  • Marketer of the Year
  • All Awards Guidelines
  • Community Impact Prize

Resources

  • LJ Index/Star Libraries
  • Research
  • White Papers / Case Studies

Events & PD

  • Online Courses
  • In-Person Events
  • Virtual Events
  • Webcasts
  • About Us
  • Contact Us
  • Advertise
  • Subscribe
  • Media Inquiries
  • Newsletter Sign Up
  • Submit Features/News
  • Data Privacy
  • Terms of Use
  • Terms of Sale
  • FAQs
  • Careers at MSI


© 2026 Library Journal. All rights reserved.


© 2022 Library Journal. All rights reserved.