First, a Couple of Comments from infoDOCKET Founder/Editor Gary Price:
At this point, six years after it was first announced, LC has nothing new to report about if or when the Twitter archive will ever become available. As we’ve said before, don’t hold your breath.
The Atlantic article provides both an update and useful review. For some additional background, we’ve shared three infoDOCKET posts on the project at the bottom of this post.
In terms of researchers being able to accessing large datasets of tweets the article does not mention that Twitter launched it’s own program to provide selected academics access to data.
The Twitter Data Grants program launched in February 2014 and the first grants were announced two months later.
Now to The Atlantic article, “Can Twitter Fit Inside the Library of Congress?”
From the Article:
Six years after the announcement, the Library of Congress still hasn’t launched the heralded tweet archive, and it doesn’t know when it will. No engineers are permanently assigned to the project. So, for now, staff regularly dump unprocessed tweets into a server—the digital equivalent of throwing a bunch of paper clipped manuscripts into a chest and giving it a good shake. There’s certainly no way to search through all that they’ve collected. And, in the meantime, the value of a vast tweet cache has soared. This frustrates researchers, who had hoped to mine the archive for insights about language and society—and who currently have to pay heavy licensing fees to Twitter for its data.