Research Article: Seek and You May (Not) Find: A Multi-Institutional Analysis of Where Research Data are Shared”
The article linked below was published today by PLOS One.
Title
Seek and You May (Not) Find: A Multi-Institutional Analysis of Where Research Data are Shared
Authors
Lisa R. Johnston
University of Wisconsin-Madison
Alicia Hofelich Mohr
University of Minnesota
Joel Herndon
Duke University
Shawna Taylor
Association of Research Libraries
Jake R. Carlson
University of Buffalo
Lizhao Ge
George Washington University
Jennifer Moore
Washington University in St. Louis
Jonathan Petters
Virginia Tech
Wendy Kozlowski
Cornell University
Cynthia Hudson Vitale
Association of Research Libraries
Source
PLoS ONE 19(4): e0302426
April 25, 2024
DOI: 10.1371/journal.pone.0302426
Abstract
Research data sharing has become an expected component of scientific research and scholarly publishing practice over the last few decades, due in part to requirements for federally funded research. As part of a larger effort to better understand the workflows and costs of public access to research data, this project conducted a high-level analysis of where academic research data is most frequently shared. To do this, we leveraged the DataCite and Crossref application programming interfaces (APIs) in search of Publisher field elements demonstrating which data repositories were utilized by researchers from six academic research institutions between 2012–2022. In addition, we also ran a preliminary analysis of the quality of the metadata associated with these published datasets, comparing the extent to which information was missing from metadata fields deemed important for public access to research data. Results show that the top 10 publishers accounted for 89.0% to 99.8% of the datasets connected with the institutions in our study.
Known data repositories, including institutional data repositories hosted by those institutions, were initially lacking from our sample due to varying metadata standards and practices. We conclude that the metadata quality landscape for published research datasets is uneven; key information, such as author affiliation, is often incomplete or missing from source data repositories and aggregators. To enhance the findability, interoperability, accessibility, and reusability (FAIRness) of research data, we provide a set of concrete recommendations that repositories and data authors can take to improve scholarly metadata associated with shared datasets.
Direct to Full Text Article
Related Resources
- New Report From the Association of Research Libraries (ARL): “Publicly Shared Data: A Gap Analysis of Researcher Actions and Institutional Support throughout the Data Life Cycle” (March 24, 2024)
- ARL Releases Two New Reports: “Making Research Data Publicly Accessible: Estimates of Institutional & Researcher Expense” & “Realities of Academic Data Sharing (RADS) Initiative: Research Methodology 2022–2023 Surveys and Interviews” (February 7, 2024)
- New From ARL: “Realities of Academic Data Sharing (RADS) Initiative: Research Update #1 (April 12, 2022)
Filed under: Academic Libraries, Associations and Organizations, Companies (Publishers/Vendors), Data Files, Interviews, Libraries, News, Open Access, PLOS, Publishing, Reports
About Gary Price
Gary Price (gprice@gmail.com) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. He earned his MLIS degree from Wayne State University in Detroit. Price has won several awards including the SLA Innovations in Technology Award and Alumnus of the Year from the Wayne St. University Library and Information Science Program. From 2006-2009 he was Director of Online Information Services at Ask.com.