The Importance of Data Due Diligence
Earlier this month, a study was published indicating that the widely used “Reddit dataset” (released in 2015 by Jason Baumgartner) had significant, previously unidentified gaps.…
Earlier this month, a study was published indicating that the widely used “Reddit dataset” (released in 2015 by Jason Baumgartner) had significant, previously unidentified gaps.…
This is Part 1 of 3 Read Part 2 of 3 Read Part 3 of 3 Snowflake Big Data Snowflake and Qubole have partnered to…
NEW! Spark 3.3 is now available on Qubole. Qubole’s multi-engine data lake fuses ease of use with cost-savings. Now powered by Spark 3.3, it’s faster…
Read Part 1 of 3 Read Part 2 of 3 This is Part 3 of 3 Snowflake Spark Connector Snowflake and Qubole have partnered to…
SQL Joins are a common and critical component of interactive SQL workloads. The Qubole Presto team has worked on two important JOIN optimizations to dramatically…
Open source projects are forked for many reasons such as: Communities that hope to take the project in a different direction. For example MariaDB Large…
In response to significant demand from our customers, we are happy to announce a new extension of the Notebooks feature on the Qubole Data Service:…
With the recent announcement of Dashboards, we at Qubole want to make sure that our customers have everything that they need to succeed with using…
UPDATE: Qubole’s Spark tuning tool is now open source and named Sparklens. To contribute, check out the source code from https://github.com/qubole/sparklens.To analyze your spark applications…
Last week, during the Deep Learning Summit at AWS re:Invent 2017, Terrence Sejnowski (a pioneer of deep learning) succinctly said “Whoever has more data wins”.…
Drug discovery is the process of identifying molecular compounds which are likely to become the active ingredient in prescription medicine. At a high level, it…
QDSIn 2017, Qubole saved customers $140M in cloud costs by smartly—and automatically— leveraging all the cloud resources available to business. With a new tool from…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.