Latest Content
Welcome to Qubole's Recourse Hub. Deep dive into our latest Blogs, Case Studies, Videos and more.
-
Big Data Engineering for Machine Learning
How big data engines are used for exploring and preparing data, building pipelines, and delivering data sets to ML applications
-
Learn how we helped MiQ scale their data lake ecosystem to be on par with their company's growth
-
What is an Open Data Lake?
A data lake, where data is stored in an open format and accessed through open standards-based interfaces, is defined as an Open Data Lake.
-
The Open Data Lake Platform Brief
-
Learn how we helped Gaia implement a data lake platform and migrate from a legacy data warehouse environment
-
Qubole on Amazon AWS: Security and Compliance
A Whitepaper of Qubole that how it passionate about making data easily accessible for open data lake platforms while using Amazon AWS for our customer's data with proper security measures & compliance
-
3 Steps to Justify & Reduce The Cost of Your Data Lake
How to position the data lake expenditure to finance.
-
How to Scale New Products with a Data Lake using Qubole
TiVo shares best practices for ingesting, processing, and making available for analysis terabytes of streaming and batch viewership data from millions of households
-
O'Reilly eBook: Creating a Data-Driven Enterprise in Media
-
Using Qubole Presto for Interactive and Ad-Hoc Queries
Tips for when to use Presto versus Apache Spark, and how to enable self-service access to your data lake
-
Modern Data Engineering and The Rise of Apache Airflow
Brief introduction to Apache Airflow, its optimal use cases, and real-world examples
-
O'Reilly ebook: Machine Learning at Enterprise Scale
Real-world data science practitioners offer perspectives and advice on six common Machine Learning problems
-
Running Apache Spark at Scale in the Cloud
Deep dive into the use cases for Apache Spark on Qubole, including ETL and machine learning
-
Migrating to a Modern Cloud-Native Data Lake with Microsoft Azure and Qubole
Benefits of migrating to a cloud-native data lake and how to choose the right data architecture
-
Enterprise-Scale Big Data Analytics on Google Cloud Platform
Why a unified experience with native notebooks, a command workbench, and integrated Apache Airflow are a must.
-
O'Reilly ebook: Financial Governance for Data Processing in the Cloud
A comprehensive guide to understand effective financial governance
-
Why You Need a Cloud Platform to Succeed with Big Data
The benefits of a single cloud platform and centralized access to data
-
Delivering Self-Service Analytics and Discovery from your Data Lake
Best practices for data collaboration and data lake access using SQL
-
O'Reilly ebook: Operationalizing the Data Lake
Best practices for building a cloud data lake operation—from people and tools to processes
-
How to Increase the Scalability of HiveServer2 with Qubole
Technical overview of Qubole's HiveServer2 solution that distributes memory-intensive processes and enables scalability
-
Loading More...