My plan is to integrate GE in a Cloudera Hadoop System. Right now I’m figuring out what the best way would be to integrate things with Oozie (I’d prefer Airflow, but customer works with Oozie).
In Germany, bigger mid-size companies are still a lot on premises.
In the end I’d like to use DataDocs, too so have to figure out where to deploy all that results and how to do alerting (don’t know yet if letting do GE or Oozie the E-Mail after failure is better).
any success implementing it so far? I have the same issue: a Cloudera cluster (yes, old and dreadful) and am looking for to adding great expectations to it.