Qubole and Google Join Forces to Deliver Unified User Experience for Apache Spark and Hadoop

Start Free Trial
April 10, 2019 by Updated April 3rd, 2024

I’m very excited to announce our expanded partnership with Google Cloud Platform (GCP). We have joined forces to offer an enterprise self-service data platform powered by optimized versions of Apache Spark and Hadoop, with unified tools for data science and data engineering running on GCP.

Why Now?

My co-founder Joydeep and I have always felt strongly that the future of big data is on the cloud. As a result, we created a platform with the flexibility to use the technologies and frameworks that best fit your environment today as well as how that environment will look tomorrow.

In recent years we’ve seen not only the expansion of cloud usage but also a discernible shift toward a multi-cloud world where customers demand choice. We recognize that Google Cloud offers a compelling choice for customers with large-scale data processing needs.

Qubole remains focused on making big data in the cloud accessible to every user who needs it. In the spirit of continuing that mission, we have expanded our technology integration and partnership with GCP to offer our comprehensive cloud-native data platform on Google Cloud.

Why Qubole + GCP?

As data processing needs on Google Cloud increase, we recognize the demand for an easy-to-use, self-service platform with a rich user interface for data scientists and data engineers. The new capabilities of Qubole on GCP will significantly improve how users collaborate, analyze, and interact with complex forms of data. The unique benefits of Qubole and GCP include:

Unified Experience for Data Science and Data Engineering

Access a native workbench that includes notebooks, dashboards, and a common user interface to execute any command. This common user interface enables data engineers and data scientists to work and collaborate using familiar tools, languages, and performance-optimized data processing engines such as Apache Spark and Hadoop.

Low Cost and High Reliability

Automatic upscaling, rebalancing, and aggressive downscaling of clusters with the complete context of workload, SLA, and priority of each job. Includes intelligent autonomous and policy-based management of regular compute instances or Preemptible VMs.

Enterprise-Grade Security

Fine-grained predefined or custom identity and access management roles to separate compute and data access. Qubole also offers role-based access controls for secure collaboration in notebooks and commands.

Easy Access to Many Data Sources

Connectors for Google Cloud Storage, Google BigQuery, MySQL, Postgres, MongoDB, and more.

Day-1 Self-Service Access Through the GCP Marketplace

Fast access to Qubole through the GCP Marketplace, with automatic account setup, Google Cloud authentication, simplified user onboarding, and day-one access to the service.

24×7 Support for Open Source Engines

Leverage highly optimized versions of Apache Spark and Hadoop with advanced caching and performance optimizations and frameworks such as Airflow. Users will also have access to dedicated support and engineering teams specialized by engine.

Learn more about the new service in our technical blog on Qubole on GCP. You can also read our Press Release, visit us at booth #S1261 at the Google Cloud Next ’19 conference exhibit, or Test Drive Qubole on GCP for free.

Start Free Trial
Read Managing Costs with Financial Governance While Democratizing Data at Scale