Things I Did Not Know About the Internet Archive

I started my new role at the Internet Archive earlier this week, joining the Web Group, an awesome team of passionate and talented new colleagues.

I knew many things about the Internet Archive. This is where the idea of archiving the global web was born and the world’s largest web archive resides. I have visited the Archive before and worked with various people on collaborative projects over the years…

In week one I discovered three things I did not know about the Internet Archive:

1. I was told about the Internet Archive’s scanning centre in London at the Wellcome Library, where I now have a desk and the privilege of working inside this absolute architectural highlight.  What I did not quite realise was the ambition and scale of what was going on in the darkness of blackout blinds. Ten UK libraries’ historical books on medial history are being digitised here, at the rate of 800 pages per hour and with the goal to digitise 16-17 million pages by 2016. Saving Human Knowledge at 800 Pages an Hour has the details including some stunning photographs.

2. The Internet Archive Companion, an iPhone and iPad app for accessing Internet Archive’s collections. This is not produced by or officially associated with the Internet Archive and the web archive, or the Wayback Machine, does not seem to be included but it really does not matter. The app brought close to me so many other things: text, video, audio, image, concert… While making dinner last night, I thoroughly enjoyed watching Jane Eyre, the 2011 film directed by Cary Fukunaga, starring Mia Wasikowska and Michael Fassbender.

When will I be able to use other libraries’ collections like this?

嵌入图像的永久链接
3. Internet Archive has also made a range of web crawl data available for research. This can be  downloaded from https://archive.org/details/web&tab=collection. I was slightly surprised but very pleased to see domain crawls of some national libraries among the list – many national web archives are not publicly accessible due to legal requirements – This is great news for researchers.

I look forward to discovering more, which should not be hard as I am setting off on Monday for a 5-week orientation at Internet Archive’ headquarter in San Francisco.

Advertisements

One thought on “Things I Did Not Know About the Internet Archive

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s