Hi
What is Hadoop exactly? Reading through the specs it doesn't look like a
competitor for dCache. So, can anybody comment on the remark below: "HDFS is
much better than dCache"?
Thanks,
Sander
> Plan:
>
> 1) Gain experience w/ the Hadoop File System via a large
> installation on the cluster in B240.
>
> 2) Assuming it really works according to our expectations,
> investigate how to best "pseudo-daemoncore-ify" the HDFS service so
> that runs under the condor_master, responds to Condor administrative
> commands such as on, off, restart, reconfig, etc, and hopefully has
> similar behavior for a debug log and config settings. Note: some
> challenges here include HDFS service is implemented in Java, perhaps
> relies on ssh for network authentication, etc.
>
> 3) Enable the Condor file transfer service to stage files in/out of
> HDFS, in addition to just the shadow.
That's really interesting.
As one datapoint: I was talking to Brian Bockelman about HDFS. He has
been deployed HDFS at his OSG site in Nebraska, and thinks that it is
great, much better than dCache.
An interesting question, once we get far enough along with this work:
can the interface to HDFS be made pluggable? That is, can Condor call
out to a file transfer service? I don't know if it makes sense, but we
might be able to plug in other services one day.