Presto Summit India 2019 – “Towards GDPR CCPA compliance with Hive ACID”

Qubole now supports efficient updates and deletes for data stored in Cloud data lakes. Users can make inserts, updates and deletes on transactional Hive Tables—defined over files in a data lake via Apache Hive—and query the same via Apache Spark or Presto. Our changes to support reads on such tables from Apache Spark and Presto have been open sourced, and ongoing efforts for multi-engine updates and deletes will be open sourced as well. In this video, Shubham Tagra, Sr. Staff Engineer at Qubole, describes the capabilities, the design choices, implementation details, and future roadmap.