Under The Hood : Building AIR at Qubole
In one of our previous blog posts on AIR Infrastructure, we had discussed the various data sources for AIR and the architecture for collecting these…
In one of our previous blog posts on AIR Infrastructure, we had discussed the various data sources for AIR and the architecture for collecting these…
SQL Joins are a common and critical component of interactive SQL workloads. The Qubole Presto team has worked on two important JOIN optimizations to dramatically…
Last week, during the Deep Learning Summit at AWS re:Invent 2017, Terrence Sejnowski (a pioneer of deep learning) succinctly said “Whoever has more data wins”.…
Qubole is proud to announce that our flagship Qubole Data Service (QDS) will switch to billing all customer usage in “per-second” increments for all AWS…
Nexla is a data operations platform that focuses on enabling data movement between companies with security and scale. The platform is simple enough for the…
Jupyter™ notebooks is one of the most popular IDE of choice among Python users. Traditionally, most Jupyter users work with small or sampled datasets that…
Co-authored by Jeffrey Ellin, Solutions Architect, Qubole. In our previous post, we wrote about on-demand ETL pipeline with AWS Lambda and Qubole to facilitate event-based processing of long running ETL…
The new survey reveals big data initiatives at risk due to high demand, false confidence and immature processes SANTA CLARA, Calif. – Mar. 9, 2017…
Co-authored by Jeffrey Ellin, Solutions Architect, Qubole. Serverless architecture allows you to execute code without requiring the traditional cost of compute resources. Each component of your…
This post is authored by Ashish Thusoo, Co-Founder and Chief Executive Officer, Qubole We are honored to be named one of Entrepreneur’s Top 50 Company…