6.5830/6.5831: Database Systems
Fall 2022
Papers:

  • Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified Data Processing on Large Clusters In OSDI, 2004. [PDF]

This paper talks about a specific implementation of a simple query system (called Map-Reduce) on top of the Google cluster.

As you read the paper, consider the following question:

  1. What kinds of failures can a search engine (or the Map-Reduce system) tolerate? What consistency guarantees are provided in the face of failures?