Snowflake Data Engineering with Apache Spark
NEW! Spark 3.3 is now available on Qubole. Qubole’s multi-engine data lake fuses ease of use with cost-savings. Now powered by Spark 3.3, it’s faster…
NEW! Spark 3.3 is now available on Qubole. Qubole’s multi-engine data lake fuses ease of use with cost-savings. Now powered by Spark 3.3, it’s faster…
Read Part 1 of 3 Read Part 2 of 3 This is Part 3 of 3 Snowflake Spark Connector Snowflake and Qubole have partnered to…
SQL Joins are a common and critical component of interactive SQL workloads. The Qubole Presto team has worked on two important JOIN optimizations to dramatically…
Open source projects are forked for many reasons such as: Communities that hope to take the project in a different direction. For example MariaDB Large…
In response to significant demand from our customers, we are happy to announce a new extension of the Notebooks feature on the Qubole Data Service:…
With the recent announcement of Dashboards, we at Qubole want to make sure that our customers have everything that they need to succeed with using…
UPDATE: Qubole’s Spark tuning tool is now open source and named Sparklens. To contribute, check out the source code from https://github.com/qubole/sparklens.To analyze your spark applications…
Last week, during the Deep Learning Summit at AWS re:Invent 2017, Terrence Sejnowski (a pioneer of deep learning) succinctly said “Whoever has more data wins”.…
Drug discovery is the process of identifying molecular compounds which are likely to become the active ingredient in prescription medicine. At a high level, it…
QDSIn 2017, Qubole saved customers $140M in cloud costs by smartly—and automatically— leveraging all the cloud resources available to business. With a new tool from…
Today, Qubole is announcing the availability of a working implementation of Apache Spark on AWS Lambda. This prototype has been able to show a successful…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.