Speaker: Venkat Sowrirajan, Software Engineer, Qubole
Presentation: Spark applications are difficult to tune for optimal performance and with the use of cloud stores like S3 as truth-store makes things even more complex. This talk will cover briefly about SparkLens (Spark tuning tool), Spark with Rubix (distributed cache), direct-write for Hive tables and its performance numbers.
Learn more about.. Data Platforms Conference: https://www.dataplatforms.com Qubole: https://www.qubole.com
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.