We are pleased to announce the 0.1.0 release of Cascading.groovy, our Groovy language interpreter extension. With Cascading.groovy, Hadoop applications can be scripted by both advanced and casual Hadoop users without thinking in MapReduce. Read our Groovy Scripting Overview for more details.
Welcome
Cascading is a large dataset build tool and a processing API for Hadoop.
The "scale free" processing API lets the developer quickly assemble complex distributed processes without having to "think" in MapReduce. The build functionality allows common processes to be reused against different datasets, while efficiently scheduling these processes based on their dependencies.
Cascading is generally applicable to any industry that generates and manages large amounts of data, from megabytes to petabytes.
If new to Cascading, see our Cascading presentation (PDF) and About page. For a simple primer on using Cascading, please read our Gentle Introduction. And for those already familiar with Hadoop and MapReduce, read our detailed Overview on how Cascading works.
Now with our Groovy extension, Cascading.groovy, it is even simpler for both advanced and casual Hadoop users to create complex Hadoop workflows. Read more on our Groovy Scripting page.
Recent Events
Version 0.5.0 of Cascading is now available for download. For details on new features and bug fixes, see the CHANGES.txt file. For a quick summary, read on.
Version 0.4.0 of Cascading is now available for download. See below for a review of the major changes. For more details, see the changes.txt file.
Cascading 0.3.0 has just been packaged and is available for download from our downloads page. It incorporates many great changes, read on for more.
A little note to let everyone know Cascading is now available for download and includes the full source. Please visit our project site for more information.