FYI
-------- Original Message -------- Subject: Hadoop workshop at SARA, December 7 Date: Wed, 3 Nov 2010 17:13:08 +0100 From: Evert Lammerts Evert.Lammerts@sara.nl To: Evert Lammerts Evert.Lammerts@sara.nl
What: SARA Apache Hadoop computing workshop When: Tuesday, December 7th Where: SARA, Science Park 121 Amsterdam, room TBA
Dear readers,
You are all invited to participate in the first Apache Hadoop "hackathon" in The Netherlands on Tuesday, December 7th. This event is also the kick-off of the SARA Proof-of-Concept Hadoop service. More information on the event and the technology can be found below.
If you are interested in participating, please register as soon as possible, and not later than December 1st by sending an email to evert.lammerts@sara.nl.
Best regards, Evert Lammerts
===================================================
SARA Apache Hadoop "Hackathon"
Back in 2004, driven by the need to process extreme amounts of data, Google developed a system that exists of the Google File System and MapReduce. Based on two papers that Google published on the topic[1,2], Doug Cutting[3] created what is now the Apache Hadoop software stack. Apache Hadoop is used in several capacities by a number of internet giants (e.g. Yahoo!, IBM, eBay, Facebook, Last.fm, LinkedIn, Twitter and many more[4]). It's ability to store and process extremely large datasets makes it particularly beneficial for sciences as Natural Language Processing, BioInformatics, Social Sciences and Humanities. SARA provides a Proof-of-Concept Apache Hadoop storage and computing service for scientific use[5].
The "hackathon" will be a hands-on introduction to Hadoop. You are invited to come spend a day at SARA to play with the system, with the support of two experienced users, Edgar Meij (UvA) and Djoerd Hiemstra (UT). You can work on your own or in groups and analyze your own dataset or play with for example the Wikipedia, ENRON or White House access records datasets. Programming skills will be assumed, and you need to take your own laptops.
Coffee and soft drinks will be provided throughout the day, and lunch will be served at 12.00 AM.
Program:
09:30 - 09:35 Introduction to the day 09:35 - 10:15 Examples of data analysis with Hadoop 10:30 - 17:00 Hackathon 17:00 - end Presentation of results
[1] http://labs.google.com/papers/gfs.html [2] http://labs.google.com/papers/mapreduce.html [3] http://en.wikipedia.org/wiki/Doug_Cutting [4] http://wiki.apache.org/hadoop/PoweredBy [5] http://www.sara.nl/news/recent/20101103/Hadoop_proof-of-concept.html
Evert Lammerts Consultant eScience Support SARA Computing & Network Services High Performance Computing & Visualization
Phone: +31 20 888 4101 Email: evert.lammerts@sara.nl http://www.sara.nl