You are here
Home > Spotlight > Google’s Dataflow pipeline tool can now run on Spark, thanks to Cloudera

Google’s Dataflow pipeline tool can now run on Spark, thanks to Cloudera

If you wish to process huge piles of data very, very quickly, you’re in luck. From the comfort of your own data center, you can now use Google’s recently announced Dataflow programming model for processing data in batches or as it comes in, on top of the fast Spark open-source

Top