F.Y.I.

keywords: Hadoop, HEP, data analysis, Cloud, ...

Begin forwarded message:

From: Alexei Klimentov <alexei@mail.cern.ch>
Date: August 19, 2009 6:56:57 PM GMT+02:00
To: ADC Operations <atlas-project-adc-operations@cern.ch>
Subject: CERN Computing Colloquium announcement: August 21st, 2009 (fwd)


FYI


From: David Myers <David.Myers@cern.ch>
Date: 12 August 2009 09:06:03 GMT+02:00
To: "it-dep-full (All the members in IT Department in its extended  
form)" <it-dep-full@cern.ch>
Subject: CERN Computing Colloquium announcement: August 21st, 2009

_____________________________________________
Dear All,

The next Computing Colloquium will take place:

On:             Friday , August 21st
At:             14:00
In:             Main Auditorium

Title:          Analyzing petabytes of data with Hadoop

Speaker:        Mr. Jeff Hammerbacher, Chief Scientist, Cloudera

Abstract: The open source Apache Hadoop project provides a powerful  
suite of tools for storing and analyzing petabytes of data using  
commodity hardware. After several years of production use inside of  
web companies like Yahoo! and Facebook and nearly a year of  
commercial support and development by Cloudera, the technology is  
spreading rapidly through other disciplines, from financial  
services and government to life sciences and high energy physics.

The talk will motivate the design of Hadoop and discuss some key  
implementation details in depth. It will also cover the major  
subprojects in the Hadoop ecosystem, go over some example  
applications, highlight best practices for deploying Hadoop in your  
environment, discuss plans for the future of the technology, and  
provide pointers to the many resources available for learning more.

In addition to providing more information about the Hadoop  
platform, a major goal of this talk is to begin a dialogue with the  
ATLAS research team on how the tools commonly used in their  
environment compare to Hadoop, and how Hadoop could improve better  
to serve the high energy physics community.

Short Biography: Jeff Hammerbacher is Vice President of Products  
and Chief Scientist at Cloudera. Jeff was an Entrepreneur in  
Residence at Accel Partners immediately prior to founding Cloudera.  
Before Accel, he conceived, built, and led the Data team at  
Facebook. The Data team was responsible for driving many of the  
applications of statistics and machine learning at Facebook, as  
well as building out the infrastructure to support these tasks for  
massive data sets. The team produced two open source projects:  
Hive, a system for offline analysis built above Hadoop, and  
Cassandra, a structured storage system on a P2P network. Before  
joining Facebook, Jeff was a quantitative analyst on Wall Street.  
Jeff earned his Bachelor's Degree in Mathematics from Harvard  
University and recently served as contributing editor to the book  
"Beautiful Data", published by O'Reilly in July 2009.

The InDiCo link is here:  http://indico.cern.ch/conferenceDisplay.py?confId=59791

With regards,

  David Myers.

-------------- European Organization for Nuclear Research  
--------------
Dr. D.R. Myers,                              Tel:    +41 22 767 4646
Head, Computing Security,                    Mobile: +41 76 487 3994
CERN, IT Division,
CH-1211 Geneva 23, Switzerland.              EMail:  David.Myers@cern.ch
------------------------------------------------------------------------