In a previous blog on some big data open source projects covered at OSCON 2014, I reviewed three distributed computing functional programming technologies by using them to write a simple aggregation report. In that blog, I mentioned reproducing the news feed performance map reduce job from a Clojure news feed service that I test ran…
Category: big data
architecture, big data, Technology
Functional Programming and Big Data
by glenn •
Functional Programming and Big Data for the Impatient Learn about the future of Functional Programming and Big Data by reading this evaluation of three relevant open source technologies; PigPen, Cascalog, and Apache Spark. A small report is written, that reports on the per minute count of post actions from a two hour test run of…
big data, Technology
Discovering Big Data with Open Source
by glenn •
With all the media attention on big data and what part Hadoop plays in it, I thought that I would return to basics and blog about real-world Hadoop from a developer’s perspective. This is not about flume, falcon, impala, ambari, oozie, or sqoop. This is a story about a developer, a weekend, some data, and…
big data, Technology
Using Embedded Domain Specific Languages in NoSQL Data Stores
by glenn •
This story starts long ago with a Model Driven Software Development system that I wrote. The generated server side Java code used JDBC to access data in a database. Later on, I provided support for Cassandra, Amazon SimpleDB, and Solr by writing JDBC connectors for those NoSQL data stores. It was nothing fancy. I subclassed…
big data, Technology
Analytics Part 3 – Graphing
by glenn •
Only through the process of constant and unrelenting improvement can you hope to remain competitive. Analytics provides the reality check for that process.
big data, Technology
Analytics Part 2 – Data Selection
by glenn •
Learn how to massage your application’s event logging into data that is compatible with analytics.
big data, Technology
Analytics Part 1 – Logging
by glenn •
Analytics has grown increasingly popular over the years and for good reason. There is currently no better way to arrive at a solid understanding of what your complex application is doing right and wrong than to numerically analyze its activity. This blog is a three parter. In today’s blog, I will cover the basics of…