We have data that gets filled into Hive/ presto every few hours.
We want that data to be transferred to cassandra tables.
What are some of the high performance ETL options for transferring data between hive or presto into cassandra?
Also does anybody have any performance numbers comparing
- loading data from S3 to cassandra using SStableloader
- and loading data from S3 to cassandra using other means (like spark-api)?