Tech Blog

Cloud-native Big Data Activation Platform

  • Spark Cluster Optimization for Cost, Reliability and Performance

    Spark Cluster Optimization for Cost, Reliability and Performance

    How to Optimize Spark Clusters on Qubole for Cost, Reliability and Performance This second blog from the three part series explains how a Spark cluster… The post Spark Cluster Optimization for...

    Read Article
  • Maximizing Spot Utilization by Leveraging Qubole Heterogeneous Clusters

    Maximizing Spot Utilization by Leveraging Qubole Heterogeneous Clusters

    How Qubole Maximizes Spot Utilization and Reduces Costs One of our customers—a large enterprise cloud content management company—runs several sophisticated machine learning (ML) predictive...

    Read Article
  • How to Install Apache Airflow to Run Different Executors

    How to Install Apache Airflow to Run Different Executors

    Now that we know about Airflow’s different components and how they interact, let’s start with setting up Airflow on our workstation so that we can… The post How to Install Apache Airflow to Run...

    Read Article
  • Spot Interruption Handling with Presto on Qubole, Save 60% on Cost

    Spot Interruption Handling with Presto on Qubole, Save 60% on Cost

    Spot nodes on AWS (and preemptible VMs on Google Cloud Platform, GCP) are a great way to reduce your total cost of ownership (TCO) for… The post Spot Interruption Handling with Presto on Qubole,...

    Read Article
  • Understand Apache Airflow’s Modular Architecture

    Understand Apache Airflow’s Modular Architecture

    Airflow Architecture diagram for Celery Executor based Configuration   Before we start using Apache Airflow to build and manage pipelines, it is important to understand… The post Understand Apache...

    Read Article
  • Using Resource Groups to Dynamically Size Presto Clusters on Qubole

    Using Resource Groups to Dynamically Size Presto Clusters on Qubole

    As a best practice, we recommend users to create few large Presto clusters that are shared between different teams, instead of creating multiple small clusters… The post Using Resource Groups to...

    Read Article
  • Qubole enhances Presto Cluster Monitoring with Datadog

    Qubole enhances Presto Cluster Monitoring with Datadog

    Qubole has provided Datadog as an integrated monitoring service for its clusters, including Presto clusters. This brings many improvements compared to the “old approach” for… The post Qubole...

    Read Article
  • loading
    Loading More...