To an outsider driving past, an oil refinery looks like a tangled mess of pipes flowing to and from various undifferentiated structures. To the uninitiated, the Apache Hadoop platform appears equally opaque. Just as a process engineer might lead a refinery tour by discussing the refinery’s inputs, outputs, processes and