used to fetch cluster table, as well as to connect to the endpoints to stream data to
schema of the table to write to, the table must exist in Cassandra
extracts individual column values from each RDD item
temporary, writeable directory to dump SSTables into
if true, the sstables will be deleted after successful streaming
the larger the buffer, the more memory it takes on the client, but produces bigger sstables, so less compaction on the C* side is needed
BulkTableWriter writes data directly to sstables in a local directory and then streams sstables to the Cassandra cluster.