The Echo Nest, a music intelligence platform powering smarter music apps across the web and various devices, announced on Tuesday that it has provided music analysis and metadata to the Million Song Dataset, a collaboration between The Echo Nest and Columbia University’s LabROSA (Laboratory for the Recognition and Organization of Speech and Audio) department, with hosting by Infochimps and funding from the National Science Foundation.
The core of the Million Song Dataset consists of detailed data about one million songs, but no audio files. However, it includes mapping to 7digital’s library of 30-second samples, allowing researchers to test their technologies in the real world. This large dataset (approximately 200GB, depending on which files the developer chooses) is hosted by Infochimps.
“There are a lot of compelling music applications that haven’t been built because of the heavy lifting involved with the infrastructure,” said Infochimps CEO Nick Ducoff. “Between The Echo Nest’s platform and the Million Song Dataset available on Infochimps, the only thing keeping a developer from building a compelling music-focused app is his or her imagination.”
Interested parties can visit http://labrosa.ee.columbia.edu/millionsong/ for the code, instructions on how to use it, benchmark results for example tasks (such as automatic song tagging and artist recognition), artist mapping to Yahoo’s user ratings, and demonstrations of how to fetch audio snippets from 7digital and represent artists on a world map using the data, as well as a forum and FAQ.
Music Info: "The Echo Nest And Columbia University Announce 'Million Song Dataset'
Gary Price (firstname.lastname@example.org) is a librarian, writer, consultant, and frequent conference speaker based in the Washington D.C. metro area. Before launching INFOdocket, Price and Shirl Kennedy were the founders and senior editors at ResourceShelf and DocuTicker for 10 years. From 2006-2009 he was Director of Online Information Services at Ask.com, and is currently a contributing editor at Search Engine Land.