Most Recent Articles

As organizations grapple with the sudden economic turmoil created by the pandemic, there is a critical need to balance cost savings with the need to drive innovation.

Join our on-demand demo to learn how the most data-driven organizations are able to significantly enhance TCO, performance and optimization of their cloud data lakes with Qubole.

Welcome to the first episode of Inside the Brain of Cloud Data Platform Leaders (ITB). A webinar series where we bring to you, an interactive discussion between Data Lake Platform Industry Leaders.

Learn best practices for building a cloud data lake operation, from people and tools to processes, in this webinar.

Qubole data privacy and integrity experts cover how to maintain data integrity and privacy of data residing in data lakes using various open-source engines.

Best practices for working with different datasets, and when to use Apache Spark, Presto and other engines

Comcast, Fanatics and MediaMath discuss their successes and challenges creating a data-driven enterprises

Qubole SVP of Product Mohit Bhatnagar shares how Qubole’s cloud-native platform helps companies scale their operations, activate petabytes of data, and reach admin-to-user ratios as high as 1:200

Learn how to use Qubole to acquire and transform data sets for data science and analytics, make data sets available to different users, and fully leverage your data lake.

Learn the key differences between on-premise and cloud solutions, benefits of cloud data lakes and data warehouses, and how to build the right architecture for your analytics and ML needs.

Common challenges faced by data engineers when building pipelines for ML and how to address them

Simple, practical solutions for common challenges faced by data engineering teams

How best-in-class companies are generating rapid value from big data while also managing costs

451 Research covers best practices such as using automation, enabling collaboration, and financial governance

How to modernize your architecture with data lakes and data warehouses on the cloud

How to iIdentify areas of cost optimization to drive maximum performance for the lowest TCO

What Cloudera, Hortonworks or MapR customers should consider when moving to a cloud-native platform

Brief introduction to Apache Airflow, its optimal use cases, and real-world examples

Tips for when to use Presto versus Apache Spark, and how to enable self-service access to your data lake

Deep dive into the use cases for Apache Spark on Qubole, including ETL and machine learning