Qubole Team, Qubole, Author at Qubole

Apache Spark On Qubole: Sky Is the Limit

By Shefali Aggarwal and Qubole |September 13, 2016

At Qubole, we’ve made significant progress on our adoption of Spark on QDS with new features and scalability. Here are some recent stats pertaining to…

SparkSQL in the Cloud: Optimized Split Computation

By Shefali Aggarwal and Qubole |August 30, 2016

When it comes to Big Data processing in the cloud compared to on-premise, one of the fundamental differences between the two is how the data…

Apache Spark AWS Cloud Data Engineer Performance SparkSQL

The Value of Auto-scaling

By Shefali Aggarwal and Qubole |August 16, 2016

Intro In a recent blog post, we benchmarked auto-scaling and demonstrated that an auto-scaling cluster was a lot less expensive and only a little bit…

Apache Spark Autoscaling Cost Management Data Admin

The Cloud Advantage: Decoupling Storage and Compute

By Qubole Team and Qubole |August 11, 2016

When Hadoop is deployed with on-premises architecture, compute and storage are combined together. As a result, compute and storage must be scaled together and the…

Cloud Computing Cost Management Data Admin Data Engineer Scalability

Qubole’s Notebook Integration with Github is Generally Available

By Shefali Aggarwal and Qubole |August 10, 2016

We are excited to announce the general availability of GitHub integration for QDS Notebooks. GitHub is an effective way to collaborate on development projects. GitHub…

Github Integration Qubole Notebook

Benchmarking Auto-scaling Spark Clusters

By Shefali Aggarwal and Qubole |August 8, 2016

Intro Have you ever had trouble deciding how large to make a cluster? Do you sometimes feel like you’re wasting money when a cluster isn’t…

Apache Spark Autoscaling Data Admin Performance

Up to 80% savings with AWS Spot Instances

By Shefali Aggarwal and Qubole |July 21, 2016

In a previous post, we outlined the case for selecting cloud infrastructure over an on-premises deployment for managing big data workloads. Taking advantage of Spot…

Autoscaling AWS Cloud Cost Management Data Admin Spot Instances

Optimize Queries with Materialized Views and Quark

By Shefali Aggarwal and Qubole |July 14, 2016

This blog post explores how queries can be sped up by keeping optimized copies of the data. First, we will explore the techniques and benchmark…

Big Data Data Admin Data Analyst Performance Presto Quark

Build or Buy: The Case for Cloud Infrastructure

By Shefali Aggarwal and Qubole |July 7, 2016

Managing big data creates several challenges for data infrastructure teams: Managing “bursty” and unpredictable workloads Coordinating ad hoc and batch workloads Storing rapidly growing data…

Cloud Infrastructure Data Infrastructure

Quark: Control and Optimize SQL Across Hadoop and RDBMS

By Shefali Aggarwal and Qubole |June 27, 2016

One of the important functions of a database administrator is to manage storage structures to optimize performance in a relational database. Admins use tables, views,…

Apache Hadoop Apache Hive Data Admin Databases ODBC/JDBC Performance Quark

Qubole Makes Key Hires to Leadership Team to Support Accelerating Market Demand

By Shefali Aggarwal and Qubole |June 21, 2016

The company Appoints David Hsieh as Senior Vice President of Marketing and Ken Tamura as Vice President of Finance MOUNTAIN VIEW, CA–(Marketwired – Jun 21,…

Cloud data warehouse

RubiX: Fast Cache Access for Big Data Analytics on Cloud Storage

By Shefali Aggarwal and Qubole |

Qubole introduced first-generation Caching for S3 files in Presto in 2014 and documented the observed performance gains. In a nutshell: for CPU-efficient engines like Spark…

Apache Hadoop Apache Hive Apache Spark AWS Cloud Caching Performance Presto RubiX

Qubole

Apache Spark On Qubole: Sky Is the Limit

SparkSQL in the Cloud: Optimized Split Computation

The Value of Auto-scaling

The Cloud Advantage: Decoupling Storage and Compute

Qubole’s Notebook Integration with Github is Generally Available

Benchmarking Auto-scaling Spark Clusters

Up to 80% savings with AWS Spot Instances

Optimize Queries with Materialized Views and Quark

Build or Buy: The Case for Cloud Infrastructure

Quark: Control and Optimize SQL Across Hadoop and RDBMS

Qubole Makes Key Hires to Leadership Team to Support Accelerating Market Demand

RubiX: Fast Cache Access for Big Data Analytics on Cloud Storage

Product

Company

Helpful Links

START YOUR FREE TRIAL OF QUBOLE

Contact Form

On-Demand Qubole Demo

Google Cloud Sessions

Thank you!

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

UNLOCK QUBOLE FOR FREE

Qubole

START YOUR FREE TRIAL OF QUBOLE

Contact Form

On-Demand Qubole Demo

Google Cloud Sessions

Thank you!