New Enhancements for Qubole Notebooks
In an earlier blog post, we discussed the availability of Jupyter-based Notebooks for Machine Learning (ML) and analytics with a host of features that make…
In an earlier blog post, we discussed the availability of Jupyter-based Notebooks for Machine Learning (ML) and analytics with a host of features that make…
“If you are a cloud adopter rapidly adopting cloud services, but not developing the finance governance muscle, you will certainly be visiting the cloud optimization…
Enterprises leverage cloud providers’ compute and storage services for their ad-hoc data analytics, streaming analytics, and ML use cases as cloud data lakes provide significant…
All data-driven organizations use data in three ways: To report on the past To understand the present To predict the future Data warehouses and Business…
How to Optimize Spark Clusters on Qubole for Cost Reliability and Performance This second blog from the three-part series explains how a Spark cluster on…
Spot nodes on AWS (and preemptible VMs on Google Cloud Platform, GCP) are a great way to reduce your Total Cost of Ownership (TCO) for…
As a best practice, we recommend users create a few large Presto clusters that are shared between different teams, instead of creating multiple small clusters…
Qubole has provided Datadog as an integrated monitoring service for its clusters, including Presto clusters. This brings many improvements compared to the “old approach” for…
Data Lake Essentials, Part 3 – Data Lake Data Catalog, Metadata, and Search In this multi-part series, we will take you through the architecture of…
Data Lake Essentials, Part 2 – File Formats, Compression, and Security In this multi-part series, we will take you through the architecture of a Data…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.