Presto Summit India 2019 - "Towards GDPR CCPA compliance with Hive ACID"

September 30, 2019

Qubole now supports efficient updates and deletes for data stored in Cloud data lakes. Users can make inserts, updates and deletes on transactional Hive Tables—defined over files in a data lake via Apache Hive—and query the same via Apache Spark or Presto. Our changes to support reads on such tables from Apache Spark and Presto have been open sourced, and ongoing efforts for multi-engine updates and deletes will be open sourced as well. In this video, Shubham Tagra, Sr. Staff Engineer at Qubole, describes the capabilities, the design choices, implementation details, and future roadmap.

Previous Case Study
TubeMogul (Adobe) Delivers Big Data Insights at Enterprise Scale
TubeMogul (Adobe) Delivers Big Data Insights at Enterprise Scale

TubeMogul was able to scale up to meet the demands of queries against large data sets with as many as 30 us...

Next Article
Qubole Open-Sources Multi-Engine Support for Updates and Deletes in Data Lakes
Qubole Open-Sources Multi-Engine Support for Updates and Deletes in Data Lakes

Qubole now supports efficient updates and deletes for data stored in Cloud data lakes. Users can make inser...