May 20, 2022

Europeana Launches an Image Classification Pilot Project

From a Europeana Pro Blog Post:

Enrichment plays a fundamental role in Europeana’s activities. In our context, enrichment can be defined as generating metadata from the data provided by our partners, adding extra value to the data we receive. We use the combination of original and enriched metadata for indexing our records, and this lets us build functionalities that allow people to search and browse our collections, and receive recommendations. Achieving automatic enrichment using machine learning algorithms is one of the objectives of the Europeana Strategy 2020-2025, triggering projects such as Saint George on a Bike.

Europeana’s R&D team is exploring how computer vision techniques (systems which can make sense of visual data) can improve the enrichment Europeana conducts. We decided to start a pilot on image classification, where we build a model that is able to classify images from digitised cultural heritage objects into a set of predefined categories. We believe that a system trained with the selected categories would prove useful in enriching our collections.

Learn More, Read the Complete Blog Post (1000 words)

Direct to Classification Pilot Project on Github

About Gary Price

Gary Price ( is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at, and is currently a contributing editor at Search Engine Land.