Data Lake Business Intelligence

Tableau Visualization at Cloud Scale

Qubole is an open, simple, and secure data lake platform for machine learning, streaming analytics, data exploration, and ad-hoc analytics. No other platform radically simplifies data management, data engineering, and run-time services like Qubole. We enable reliable, secure data access and collaboration among users while reducing time to value, improving productivity, and lowering cloud data lake costs from day one.

Business Intelligence

Qubole includes a native connector for the Tableau server and desktop. Together, Qubole and Tableau simplify Business Intelligence (BI) on any data lake with any file format, for any type of query, enabling easier, more collaborative visual analytics on the cloud.

The Qubole native connector for Tableau delivers unparalleled response time—powered by Presto on Qubole—with the industry’s best cloud infrastructure cost controls. Qubole eliminates the painful compute capacity planning required to meet SLAs, automatically starting, stopping, and rebalancing clusters based on user demand. This prevents resource competition or performance impact for Tableau users that need to analyze the same data simultaneously.

Increased openness and flexibility

Tableau customers have choice and flexibility, as Qubole’s native connector allows querying of unstructured or semi-structured data on any data lake regardless of the storage file format – CSV, JSON, AVRO, or Parquet.

Performance boost for any type of query

Leveraging the power of optimized Presto on Qubole – a high-performance, distributed SQL query engine – Tableau users can query multiple big data sources with industry-leading response times without changing their normal workflow.

Abstraction from administrative complexity

Qubole manages cloud infrastructure automatically based on workloads, SLA, or job priority, eliminating the need for manual administration or rebalancing of compute clusters with changing BI needs.

Financial governance

With Qubole’s native workload-aware autoscaling and intelligent cluster management capabilities, Tableau customers avoid data processing cost overruns with guaranteed compute resources for their queries at all times.

For Data Analysts

  • Visualize big data on any cloud data lake
  • Petabyte-level big data queries from multiple sources
  • No resource competition or performance impact for simultaneous user queries
  • Refresh, publish, and schedule any worksheet with unstructured or semi-structured data
  • No administrative burden with best financial governance
  • Role-based controls for users and groups

For Data Engineers

  • Native Connector for Tableau Server and Desktop leveraging Presto on Qubole
  • Performance boost via automatic join reordering and dynamic filtering
  • Automatic provisioning, management, and scalability of infrastructure on any cloud
  • Native API / SDK for all commands
  • Predefined or granular custom IAM roles
  • Separate access roles for compute and storage

Automated Cluster Lifecycle Management

Qubole allows you to efficiently and autonomously manage all major functions of cluster lifecycle — configure, provision, monitor, scale, optimize, and recover — through automation. Qubole’s built-in financial governance capabilities provide immediate visibility into platform usage costs with advanced tools for budget allocation, chargeback, monitoring, and control of your cloud spend.

Workload-Aware Autoscaling

Qubole’s workload-aware autoscaling upscales, downscales, and rebalances clusters with a complete context of the workload, SLA, and priority of each job. Aggressive Cluster Downscaling uses intelligent self-learning algorithms such as Smart Victim Selection, Graceful Downscaling, and Container Packing to balance workloads across active nodes and decommission idle ones without the risk of data loss.

Intelligent Low-cost Compute Management

Qubole’s intelligent management of low-cost compute nodes* allows organizations to optimize their use, resulting in drastic cost savings. Qubole provides policy-based automation of low-cost compute usage to balance performance, cost, and SLA compliance.

*AWS Spot instances; Google’s Preemptible VMs; or Azure Low-priority VMs

Heterogeneous Cluster Configuration

Qubole’s Heterogeneous Cluster Configuration for on-demand and Preemptible VMs allows you to pick the most cost-effective combination for your job through automation. Qubole enables you to configure heterogeneous clusters by mixing nodes of multiple instance types, delivering much greater data processing efficiency.

GUIDE
Qubole - Tableau Integration Guide