May 19, 2022

New Journal Article: “Frictionless Data: Making Research Data Quality Visible”

The following article was recently published by the International Journal of Digital Curation.


Frictionless Data: Making Research Data Quality Visible


Dan Fowler
Open Knowledge International

Jo Barratt
Open Knowledge International

Paul Walsh
Open Knowledge International


International Journal Of Digital Curation
Vol. 12, Iss. 2. (2017)
Online: May 13, 2018
DOI: 10.2218/ijdc.v12i2.577
Note: An earlier version of this paper was presented at the 12th International Digital Curation Conference


There is significant friction in the acquisition, sharing, and reuse of research data. It is estimated that eighty percent of data analysis is invested in the cleaning and mapping of data (Dasu and Johnson,2003). This friction hampers researchers not well versed in data preparation techniques from reusing an ever-increasing amount of data available within research data repositories. Frictionless Data is an ongoing project at Open Knowledge International focused on removing this friction. We are doing this by developing a set of tools, specifications, and best practices for describing, publishing, and validating data. The heart of this project is the “Data Package”, a containerization format for data based on existing practices for publishing open source software. This paper will report on current progress toward that goal.

Direct to Full Text Article
12 pages; PDF.

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.