VII · Modern builders

Doug Cutting

1963–
Cloudera (chief architect) · ex-Yahoo · co-creator of Hadoop · Lucene · Nutch
Gave us

Lucene (1999, search). Nutch (2002, web crawler). Hadoop (2006, with Mike Cafarella) — an open-source clone of Google's GFS + MapReduce papers that powered a decade of "big data" infrastructure.

The story

Read Google's 2003 GFS paper and the 2004 MapReduce paper, then proceeded to spend the next two years implementing them in his garage office, named the project after his son's stuffed elephant. The name has outlasted the technology.

One more thing

The elephant is real. Was named Hadoop. Is presumably still around somewhere.