[FRIAM] Google & IBM giving students a distributed systems labusing Hadoop
Phil Henshaw
sy at synapse9.com
Wed Oct 10 21:45:48 EDT 2007
But doesn't most evidence point to the likelihood that not having enough
computing power isn't our problem with natural systems?
Phil Henshaw ¸¸¸¸.·´ ¯ `·.¸¸¸¸
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
680 Ft. Washington Ave
NY NY 10040
tel: 212-795-4844
e-mail: pfh at synapse9.com
explorations: www.synapse9.com <http://www.synapse9.com/>
-----Original Message-----
From: friam-bounces at redfish.com [mailto:friam-bounces at redfish.com] On
Behalf Of Tom Johnson
Sent: Wednesday, October 10, 2007 11:49 AM
To: Friam at redfish. com
Cc: David Collins
Subject: [FRIAM] Google & IBM giving students a distributed systems
labusing Hadoop
FYI. Following on a brief discussion Tuesday at the data mining
session....
Google
<http://feeds.feedburner.com/%7Er/oreilly/radar/atom/%7E3/167584952/goog
le_ibm_give.html> & IBM giving students a distributed systems lab using
Hadoop
Posted: 09 Oct 2007 04:07 PM CDT
By Jesse Robbins
<http://lucene.apache.org/hadoop/> hadoop-logo.jpg Google
<http://www.google.com/intl/en/press/pressrel/20071008_ibm_univ.html> &
IBM have partnered
<http://www-03.ibm.com/press/us/en/pressrelease/22414.wss> to give
university students hands-on experience developing software for
large-scale distributed systems. This initiative focuses on parallel
processing for large data sets using Hadoop
<http://lucene.apache.org/hadoop/> , an open source implementation of
Google's <http://labs.google.com/papers/mapreduce.html> MapReduce. (See
Tim's earlier post about Yahoo
<http://radar.oreilly.com/archives/2007/08/yahoos_bet_on_h.html> &
Hadoop )
"The goal of this initiative is to improve computer science students'
knowledge of highly parallel computing practices to better address the
emerging paradigm of large-scale distributed computing. IBM and Google
are teaming up to provide hardware, software and services to augment
university curricula and expand research horizons. With their combined
resources, the companies hope to lower the financial and logistical
barriers for the academic community to explore this emerging model of
computing."
The project currently includes the University of Washington,
Carnegie-Mellon University, MIT, Stanford, UC Berkeley and the
University of Maryland. Students in participating classes will have
access to a dedicated cluster of "several hundred computers" running
Linux under XEN <http://www.xensource.com/Pages/default.aspx>
virtualization. The project is expected to expand to thousands of
processors and eventually be open to researchers and students at other
institutions.
As part of this effort, Google and the University of Washington have
released a Creative Commons licensed curriculum to help teach
distributed systems concepts and
<http://code.google.com/edu/content/parallel.html> techniques. IBM is
also providing Hadoop
<http://www.alphaworks.ibm.com/tech/mapreducetools> plug-ins for
Eclipse.
Note: You can also build similar systems using Hadoop
<http://wiki.apache.org/lucene-hadoop/AmazonEC2> with Amazon EC2 . Tom
White recently posted an excellent guide
<http://developer.amazonwebservices.com/connect/entry.jspa?externalID=87
3&categoryID=112> and Powerset has been using this in production
<http://www.royans.net/arch/2007/09/13/scaling-powerset-using-amazons-ec
2-and-s3/> for quite some time.
--tj
--
==========================================
J. T. Johnson
Institute for Analytic Journalism -- Santa Fe, NM USA
www.analyticjournalism.com
505.577.6482(c) 505.473.9646(h)
http://www.jtjohnson.com tom at jtjohnson.us
"You never change things by fighting the existing reality.
To change something, build a new model that makes the
existing model obsolete."
-- Buckminster Fuller
==========================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://redfish.com/pipermail/friam_redfish.com/attachments/20071010/c93da283/attachment.html
More information about the Friam
mailing list