Enabling Spark SQL MERGE via optimized ACID Data Source v0.6.0
We are pleased to announce the 0.6.0 release of ACID Data source for Apache Spark. This release should further empower Data lake users in enterprises…
We are pleased to announce the 0.6.0 release of ACID Data source for Apache Spark. This release should further empower Data lake users in enterprises…
We are pleased to announce the availability of Apache Spark 3.0 in the Qubole environment. Spark 3.0 release comes with a lot of exciting new…
Data Engineers and Enterprises continue to struggle with Stream Processing at scale. In our extensive discussions with customers and partners, we repeatedly found the following…
Structured Streaming API, introduced in Apache Spark version 2.0, enables developers to create stream processing applications. These APIs are different from DStream-based legacy Spark Streaming…
Structured Streaming (SS) is one of the core components of Apache Spark. As part of the Spark on Qubole offering, our customers can build and…
AWS recently announced Managed Streaming for Kafka (MSK) at AWS re:Invent 2018. Apache Kafka is one of the most popular open source streaming message queues.…
Introduction In the first blog post of the series, We gave an overview of the data pipeline required to find the trending topics in Wikipedia. In…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.