MIREX (MapReduce Information Retrieval Experiments) provides solutions to easily and quickly run large-scale information retrieval experiments on a cluster of machines using Hadoop. Version 0.3 includes tools for the TREC ClueWeb09 and ClueWeb12 collections.
Djoerd Hiemstra and Claudia Hauff. MapReduce for information retrieval evaluation: "Let's quickly test this on 12 TB of data". In: Multilingual and Multimodal Information Access Evaluation. Lecture Notes in Computer Science 6360. Springer Verlag. pages 64-69, September 2010.