Most Recent Articles
Succeeding with a Cloud Data Lake - from Architecture to Operations
Learn best practices for building a cloud data lake operation, from people and tools to processes, in this webinar.
Mastering Data Governance on Cloud Data Lakes with Multiple Engines
Qubole data privacy and integrity experts cover how to maintain data integrity and privacy of data residing in data lakes using various open-source engines.
Mastering Data Discovery on Cloud Data Lakes
Best practices for working with different datasets, and when to use Apache Spark, Presto and other engines
Comcast, Fanatics and MediaMath at Data Platforms 2018
Comcast, Fanatics and MediaMath discuss their successes and challenges creating a data-driven enterprises
Introduction to Qubole: A Data Platform Built To Scale
Qubole SVP of Product Mohit Bhatnagar shares how Qubole’s cloud-native platform helps companies scale their operations, activate petabytes of data, and reach admin-to-user ratios as high as 1:200
Leveraging Streaming and Batch Data Sets for ML Applications
Learn how to use Qubole to acquire and transform data sets for data science and analytics, make data sets available to different users, and fully leverage your data lake.
Key Differences Between On-Prem and Cloud Data Platforms
Learn the key differences between on-premise and cloud solutions, benefits of cloud data lakes and data warehouses, and how to build the right architecture for your analytics and ML needs.
How To Build Scalable Data Pipelines for Machine Learning
Common challenges faced by data engineers when building pipelines for ML and how to address them
Data Engineering Pitfalls and How to Avoid Them
Simple, practical solutions for common challenges faced by data engineering teams
Speed to Value: How To Justify Your Big Data Investments
How best-in-class companies are generating rapid value from big data while also managing costs
Succeeding with Big Data Analytics and Machine Learning in The Cloud (451 Research)
451 Research covers best practices such as using automation, enabling collaboration, and financial governance
How To Increase Value from Machine Learning and Advanced Analytics on Azure
How to modernize your architecture with data lakes and data warehouses on the cloud
Keeping Costs Under Control When Processing Big Data in the Cloud
How to iIdentify areas of cost optimization to drive maximum performance for the lowest TCO
Best Practices for Moving Big Data from On-Prem To The Cloud
What Cloudera, Hortonworks or MapR customers should consider when moving to a cloud-native platform
Modern Data Engineering and The Rise of Apache Airflow
Brief introduction to Apache Airflow, its optimal use cases, and real-world examples
Using Qubole Presto for Interactive and Ad-Hoc Queries
Tips for when to use Presto versus Apache Spark, and how to enable self-service access to your data lake
Running Apache Spark at Scale in the Cloud
Deep dive into the use cases for Apache Spark on Qubole, including ETL and machine learning
Migrating to a Modern Cloud-Native Data Lake with Microsoft Azure and Qubole
Benefits of migrating to a cloud-native data lake and how to choose the right data architecture
Enterprise-Scale Big Data Analytics on Google Cloud Platform
Why a unified experience with native notebooks, a command workbench, and integrated Apache Airflow are a must.
Why You Need a Cloud Platform to Succeed with Big Data