The Disco MapReduce Framework

Summary

Chris Mueller from Life Technologies introduces us to Disco, a MapReduce framework built in Python and Erlang.

Showing that Hadoop is not alone in the MapReduce world, Chris reviews the basic MapReduce paradigm, dataflow, file and job distribution, and goes on to explain the Disco Distributed Filesystem (DDFS) before going into some use- case scenarios in next generation genomic sequencing.