This tutorial explains the creation of a map/reduce application based on Lokad.Cloud; the application computes the histogram of an image using several Azure workers and handles the data in blob storage and queues.
This tutorial explains the creation of a map/reduce application based on Lokad.Cloud; the application computes the histogram of an image using several Azure workers and handles the data in blob storage and queues.
Hadoop is an Apache project for distributed parallel computing using the MapReduce computational framework.
While Hadoop is written in Java, hadoop-sharp enables developers to use any CLR (.NET/Mono) supported programming language (C#, VB.NET, F#, IronPython, IronRuby) for their MapReduce computations.
The current version uses the Hadoop Pipes interface and is modeled after the Hadoop Pipes C++ API, passing around byte arrays. The idea is to evolve the API to use all the fancy features of the CLR such as generics, lambda functions etc. and eventually LINQ.
(The current version is NOT ready for end-users. This Google Code project is a temporary home in order to facilitate rapid collaborative development before being submitted to the upstream Hadoop repository)