git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

ETL options from Hive/Presto/s3 to cassandra


Hi all,
We have data that gets filled into Hive/ presto  every few hours.
We want that data to be transferred to cassandra tables.
What are some of the high performance ETL options for transferring data between hive  or presto into cassandra?

Also does anybody have any performance numbers comparing
- loading data from S3 to cassandra using SStableloader
- and loading data from S3 to cassandra using other means (like spark-api)?

Thanks,
mugunthan