Learn about the future of Functional Programming and Big Data by reading this evaluation of three relevant open source technologies; PigPen, Cascalog, and Apache Spark. A small report is written, that reports on the per minute count of post actions from a two hour test run of a web service that generated 4077809 events, in all three languages and environments. Based on the findings of that development work, these three technologies are evaluated and compared.
|lines of code||27||28||5|
|run time in seconds||123||85||13|
|the good||easy to understand||implicit grouping||small and fast|
|the bad||broken and cumbersome||funky punctuation syntax||limited reduce side functionality|
|the ugly||evil pig UDF hack||not truly functional||not always using hadoop|
If you want more detail but are too impatient to read the 5 page document below, then check out this slide deck summary.