Disco is a framework for cluster computing on large datasets. It supports MapReduce-style computations.
http://github.com/discoproject/disco.git
develop