-
- Technical
Pick‘n’Mix: Apache Cassandra®, Apache Spark™, Zeppelin, Elassandra, Kibana™, and Apache Kafka®
Kafkaesque: \ käf-kə-ˈesk \ Marked by a senseless, disorienting, menacing, nightmarishly complexity. One morning when I woke from troubled dreams, I decided to blog about something potentially Kafkaesque: Which Instaclustr managed open-source-as-a-service(s) can be used together (current and future)? Which combinations are actually possible? Which ones are realistically sensible? And which are nightmarishly Kafkaesque!? In previous blogs,...
Learn MorePaul BrebnerDecember 05, 2017 -
- Technical
Apache Spark™ Structured Streaming With DataFrames
This blog provides an exploration of Spark Structured Streaming with DataFrames The blog extends the previous Spark MLLib Instametrics data prediction blog example to make predictions from streaming data. We demonstrate a 2-phase approach to debugging, starting with static DataFrames first, and then turning on streaming. Finally, we explain Spark structured streaming in more detail...
Learn MorePaul BrebnerNovember 28, 2017 -
- Technical
A Luxury Voyage of (Data) Exploration by Apache Zeppelin
Data Exploration into the cutting-edge technology of Apache Zeppelin (Source: Shutterstock) The catastrophic crash of the Hindenburg in 1937 ended the era of luxury travel in the colossal fast ships of the air that were pushing the boundaries of air travel technology. Zeppelins had many experimental innovations like an auto-pilot, were made from Duralumin girders...
Learn MorePaul BrebnerNovember 09, 2017 -
- Technical
Behind The Scenes
Spoiler alert! Kubrick’s scientific consultant Frederick Ordway once revealed that Kubrick had the props for the film destroyed because he didn’t want to ruin the illusion of 2001 for people. If you prefer to believe that 2001 was real, stop reading now, as behind-the-scenes photos did survive. 2001 pioneered lots of special effects! It was...
Learn MorePaul BrebnerOctober 25, 2017 -
- Technical
Fourth Contact With a Monolith
“The thing’s hollow — it goes on forever — and — oh my God! — it’s full of stars!” It’s full of Spreadsheets! (DataFrames) (Source: Wikimedia Commons) Given that a dog, Laika, was the 1st astronaut to orbit the earth, it’s appropriate for a dog to travel through the wormhole. After travelling through the wormhole,...
Learn MorePaul BrebnerOctober 20, 2017 -
- Technical
Third Contact with a Monolith: Part C—In the Pod
A Simple Classification Problem: Will the Monolith React? Is It Safe?! Maybe a cautious approach to a bigger version of the Monolith (2km long) in a POD that is only 2m in diameter is advisable. What do we know about how Monoliths react to stimuli? A simple classification problem consists of the category (label) “no...
Learn MorePaul BrebnerSeptember 29, 2017 -
- Technical
Third Contact With a Monolith—Beam Me Down Scotty
Regression Analysis is (relatively) easy Hypothesis: Using only a subset of GC metrics we can compute linear regression functions using only heap space used to predict when the next GC occurs. To do this we don’t need access to all the metrics per host, just a subset. And we can extend it in the future to...
Learn MorePaul BrebnerSeptember 20, 2017 -
- Technical
Hello Cassandra! A Java Client Example
This is the third (and final) part of my blog-series on creating a demonstration Cassandra cluster, connecting, and communicating. We landed on the moon and made Second Contact with the Monolith (CQL shell) in our last blog, but what can we do to understand the Monolith better? Let’s explore Cassandra Java client program. Java Client...
Learn MorePaul BrebnerSeptember 07, 2017 -
- Technical
Consulting Cassandra: Second Contact with the Monolith
(Source: Shutterstock) In the first part of this blog (Cluster Creation in Under Ten Minutes), I created a Cassandra cluster. In this part, we blast off to the Moon for 2nd contact. Consulting the Oracles Croesus: Hi Oracle. How will my war with Cyrus the Persian go? Oracle: If you proceed, a great empire will be...
Learn MorePaul BrebnerSeptember 06, 2017