After finishing up the udemy.com Cassandra intro course described in the last blog post, I've been eager to learn more about distributed data administration. A natural next step was to proceed over to datastax.com and to commence wolfing down several of their own free (and very useful!) Cassandra tutorials.
The DataStax tutorials are building quite effectively on the udemy.com introduction material. Where before I was setting up communication between one or several virtual Ubuntu instances on my local Windows machine, the DataStax sequence has me working among AWS EC2 instances as well, which is pretty cool! Their 201- and 220-level courses had me coding through exercises on just one EC2 node, and today I started linking up 3 t2.medium nodes to tackle some of their 210 course (enterprise operations) problems. This presented me with a whole bunch of rewarding challenges back-to-back:
1) linking up S3 buckets with EC2 nodes;
2) using the S3 bucket to transfer and install a tarball distribution of DataStax Enterprise (when a direct connection between the EC2 instances and DataStax was giving me some trouble);
3) negotiating healthy topology of a small Cassandra cluster through conditions that differ slightly from DataStax's current documentation for their 210 course; and
4) troubleshooting the bootstrapping of an individual node in the cluster (I think my running a stress test on that node in isolation before adding it to the cluster gave me some problems!).
At the end of the day, I was able to get all three nodes up and running normally. As I may have mentioned before, I've got a bunch of ideas for web application projects that I'd like to start building, but I'm trying to take manageable 'baby steps' here first. By the end of the week, I'd like to have at least an absolute bare-bones web server up and communicating with a Cassandra cluster.
Figure 1. I finally cleaned up my final AWS EC2 node enough to keep it from hanging during the Cassandra bootstrapping process. Yay! As you can see, I tend to run nodetool status/dsetool status a lot.