Mastering Data Governance on Cloud Data Lakes with Multiple Engines
Qubole data privacy and integrity experts cover how to maintain data integrity and privacy of data residing in data lakes using various open-source engines.
Big Data Activation Report
The data on big data -- what engines are used most, for what, and which are the rising stars.
Apache Sqoop 1.4.7 – 9 reasons why you need it
The sixth release of Apache Sqoop i.e. 1.4.7 is out! This is one of the most significant updates to the Sqoop platform. We give you… The post Apache Sqoop 1.4.7 – 9 reasons why you need it...
Hive on Qubole runs 4x faster than Hive on Alternative Platforms
Introduction ETL workloads form a major component of big data processing at any data-driven organization – from SMBs to enterprises, and ETL data pipelines at… The post Hive on Qubole runs 4x...
Ensighten: Building a world-class digital advertising analytics platform using Qubole
Ensighten was able to decouple their compute from storage and handle user-level management and permissions across a variety of Spark, Hadoop and Presto with Qubole
DataXu Uses Qubole to Make Big Data Cloud Querying, Highly Available, and Efficient
By using Qubole Data Platform, DataXu can put its big data processing tasks on auto-pilot
Komli Media Improves Utilization with Premium Big Data Platform Qubole
Komli saw improvements in big data processing, lower total cost of ownership, faster performance and unlimited scale at a lower cost with Qubole
TubeMogul (Adobe) Delivers Big Data Insights at Enterprise Scale
TubeMogul was able to scale up to meet the demands of queries against large data sets with as many as 30 users running queries simultaneously.
Presto Summit India 2019 - "Towards GDPR CCPA compliance with Hive ACID"
Qubole now supports efficient updates and deletes for data stored in Cloud data lakes. Users can make inserts, updates and deletes on transactional Hive Tables—defined over files in a data lake via Ap
Qubole Open-Sources Multi-Engine Support for Updates and Deletes in Data Lakes
Qubole now supports efficient updates and deletes for data stored in Cloud data lakes. Users can make inserts, updates and deletes on transactional Hive Tables—defined… The post Qubole...
Introducing Hive 3.1.1 in Qubole
Qubole is the first and only vendor to deliver Hive 3.1.1 in the cloud
Building a Data Lake the Right Way
Key considerations for building a scalable transactional data lake Data-driven companies are driving rapid business transformation with cloud data lakes. Cloud data lakes are enabling… The post...
How to Increase the Scalability of HiveServer2 with Qubole
Technical overview of Qubole's HiveServer2 solution that distributes memory-intensive processes and enables scalability
Qubole Security Update: Role-Based Access for Presto, Spark, and Hive Commands
Restrict the visibility of commands to other users in the Qubole account by setting command access to private
Auto Tuning Twitter Hadoop Jobs (Or: Don’t Touch That Analytics Dial!) Data Platforms 2018
Speakers: - Ben Pence, Software Engineer, Twitter - Anton Panasenko, Software Engineer, Twitter Presentation: Every day at Twitter, hundreds of thousands of Hadoop jobs transform and aggregate petaby
Embrace Big Data Choice: Curate and Analyze Data with Hive, Spark, and Presto
The big data ecosystem is insanely complex — just making sense of the right tools and technologies can be more difficult than data mining itself.… The post Embrace Big Data Choice: Curate and...
Evolution of Hadoop
Over the course of the next month, we will be going deeper into some of the trends uncovered in our 2018 Big Data Activation Report.… The post Evolution of Hadoop appeared first on Qubole.
Hive Performance – 10 Best Practices for Apache Hive
How to scale Apache Hive and make the most of Hive performance