Qubole On-Demand Webinar - Speed to Value How to Justify Data Analytics Investments

January 11, 2019
Apache Spark is a powerful open-source engine used for processing complex, memory-intensive workloads to create data pipelines or to build and train machine learning models. Running Spark on a cloud data activation platform enables rapid processing of petabyte size datasets. Qubole runs the biggest Spark clusters in the cloud and supports a broad variety of use cases from ETL and machine learning to analytics. Qubole supports a performance-enhanced and cloud-optimized version of the open source framework Apache Spark. Qubole brings all of the cost and performance optimization features of Qubole’s cloud native data platform to Spark workloads. Qubole improves the performance of Spark workloads with enhancements such as fast storage, distributed caching, advanced indexing, metadata caching, job isolation on multi-tenant clusters. Qubole has open sourced SparkLens, a Spark profiler that provides insights into Spark application that help users optimize their Spark workloads. In this webinar, you’ll learn: - Why Spark is essential for big data, machine learning, and artificial intelligence - How a cloud-native platform allows you to scale Spark across your organization, enable all data users, and successfully deploy AI and ML at scale - How Spark runs on Qubole in a live demo - Real-world examples of companies using Spark on Qubole
Previous Video
Sentiment Analysis with H2O, PySpark and Word2Vec on Qubole
Sentiment Analysis with H2O, PySpark and Word2Vec on Qubole

Using Qubole Notebooks to analyze Amazon product reviews using word2vec, pyspark, and H2O Sparkling water ...

Next Video
Qubole Security Update: Role-Based Access for Presto, Spark, and Hive Commands
Qubole Security Update: Role-Based Access for Presto, Spark, and Hive Commands

Restrict the visibility of commands to other users in the Qubole account by setting command access to private