Hey again! I got a few other things done early this week that kind of fit into the category of the last post's topic. Two fun developments were 1) a bit of work in Apache Zeppelin (for prototyping/viewing 'big data' analysis in a notebook environment), and 2) a crash course in the Cassandra distributed and non-relational datastore technology.
The Cassandra tutorial was especially fun because within a few hours of starting the course, I was (admittedly, with my hand held through the process!) creating and updating a mock vehicle tracking web application facilitated through Java/CassandraQL created/hosted in a virtual Ubuntu environment. It looks like soon I'll be spinning up several virtual machines to test out scalability on a small Cassandra cluster. It's fascinating stuff, and I'm looking forward to coding up my own projects in similar spaces.
A couple of screenshots (from the tutorials above) follow.
Figure 1. This is actually just output from a ready-made tutorial script for running and analyzing a Spark job in Zeppelin, but still, the results were cool to see! I like that this kind of notebook environment is available. While it's not something you'd run in production/deployment, I enjoy the option to fiddle around graphically with code and objects during early prototyping of a project (sort like the Python/Jupyter relationship).
Figure 2. Here I'm completing a code-along creation of a Java/Cassandra-based web application that updates and displays (via interaction with Google Maps) vehicle tracking data. Fun stuff! I'm looking forward to deploying database tasks to a cluster of several virtual nodes in the next few days. Thanks to Ruth Stryker (of Infinite Skills) for the great udemy.com course I'm taking to learn about Cassandra!