Tag Archives for library of congress
The Library of Congress just keeps on keeping on, don’t they? First they agree to archive Twitter. Then, they say it’s OK to jailbreak your iPhone and copy your DVDs for educational use. Now, they’ve released an iPhone app that will take you on a virtual tour of some of the beauty of the building and its collections. From the announcement:
The app includes highlights of exhibitions and architectural features, with photos, audio by curators and other experts, links to more detailed online exhibitions, and even a video about the history of Thomas Jefferson’s Library, which in 1815 reconstituted the Library of Congress after the British burned the Capitol in the War of 1812. The architectural photos come courtesy of Carol M. Highsmith, who has been donating magnificent collections of images to the Library copyright-free, for the American people.
The app itself is easy and fun to use, each page of the exhibits includes a description, photographs and audio or video to accompany if available. Some screenshots are below.
Most of this content is already on the myloc.gov website; the virtual tours there take advantage of Silverlight and are quite something that you just can’t replicate on a 5-inch screen. However, that doesn’t stop this app from providing a great tour experience, one that would be a fantastic companion, especially in the halls of the library itself.
By now everyone is up to their ears with tweets about the Library of Congress’s annoucement that they will archive every Tweet. Here are my initial concerns and lauds.
- Cost. Library Journal has already questioned this. How much storage space is this going to require? How will it be sustainable? And how often are they planning on doing updates to the data stream? Will they begin collecting Tweets in real time? Monthly? Yearly?
- Content and archival quality. What about all those shortened bit.ly links? Or the old ones from services that have shut down, like Twurl? Or the really old ones that might be full URLs but that have rotted away? We can’t expect this to be perfect, but is LOC planning on trying to capture anything external to what the tweets may refer to? I got this idea from @dancohen. He suggests that LOC may need to take snapshots of the linked websites, and I think that sounds almost essential in a way albeit messy and difficult.
- Searchability. This could either be the greatest thing to happen to Twitter search, or a huge disappointment. Will LOC make their database of Tweets searchable? Right now, Twitter search is good for about two weeks. Library of Congress has a huge opportunity to blast that wide open, and we can only hope that they are able (infrastructure and $$$-wise) to do so.
- Privacy. A commenter was posted on the LJ blog about this issue. Is there a privacy problem here? Yes, our tweets are public, but is it somehow unethical even if it may not be a violation of copyright to republish Tweets in what could become public archive? Don’t ask me for an answer. Because I’ll say “no, it isn’t.”
- Metadata. How will the data about the tweets and their authors be captured and stored? Furthermore, Twitter is about to let us start adding annotations and other metadata to tweets in our stream. Will this sort of marginalia be lost?
All in all I have a feeling that this project is going to set a tone for social media archiving practice. One of the most talked about services being archived by one of the world’s largest libraries. If they truly think this is important (and I am tempted to agree), I think there is an excellent opportunity here to demonstrate that importance publicly. Essentially, I think the LOC is about the create the standard and best practices for social media archiving with this project, for better or for worse. If it is not implemented well in the beginning, it has the potential to set the bar too low (in both the technical and the public eye) for future endeavours seeking to capture online content.
In any case, this is a very exciting development to round off my library education. Two more days!
UPDATED Apr 15:
- ReadWriteWeb has some more good questions. Among them: “Will the archive include friend/follower connection data? Will it be usable for commercial purposes? Will there be a Web interface for searching it, and will that change the face of Twitter search for good? Is there any way that the much larger archive of Facebook data could be submitted to the same body for analysis of the same kind?” The answer to some of these is already known: no commercial use, there will [sounds like] be little web interface for searching–instead they will present a curated set for public use, while the entire archive will remain for serious research only.
- To address the problem of search, Google Replay was announced yesterday as well. This is Google’s attempt to capture what SearchEngineBLog calls a “vox populi” view of historical events. You can essentially search Google’s index of tweets easily for a specific date or range and keywords to get a sense of what was said about topics such as health care reform. With Twitter handling a reported 19-billion searches a month on their junky index, it’s about time we got another option. Google Replay, just like in their real-tme results display, resolves those shortened links, but I don’t know whether or not the full URL is saved within the index or if it is resolved on the fly. My guess is the latter.
What I want and have always wanted was a way to search for specific tweets by specific users. Sometimes I can recall a fuzzy thing like, “I know @somebody tweeted something about “Topic A” like a month ago.” With Google Replay, we’re getting closer, but it’s not perfect, yet. It does effectively use Twitter handles as a search term, for example: “iphone @danhooker” brings up some tweets (but not all) that I have sent or that were RTd by me. I hope it will get better. Google has that habit, so I fully expect–and pray–this will be a workable option for meaningful Twitter search in the future.
Side show at the Vermont state fair,
Originally uploaded by
The Library of Congress
Now we will be turning to the eminent, the evanescent, the entropic Library of Congress, to examine their Flickr know-how. Outlook good.
From the home page, hiding away towards the bottom-right is a link to some info about their Flickr projects. Turns out that they have quite the collection, and they are actually co-founders of something called “The Commons,” which is a project on Flickr to generate user tags and classification for some of their public domain photographs. There are now around 15 museums and libraries contributing to and participating in this project. A similar project is underway at Steve.museum.