Shefali Aggarwal

  • Part 3: Transactions on the Data Lake

    Part 3: Transactions on the Data Lake

    Data Lakes are becoming increasingly central to the analytical operations of organizations.  This brings in many more ‘transactional’ requirements on the pipeline architecture and the… The post...

    Read Article
  • Architecting Data Lakes for Scale and Speed – The Data Lake Summit Speaker Lineup

    Architecting Data Lakes for Scale and Speed – The Data Lake Summit Speaker Lineup

    Cloud data lakes are enabling new business models and near real-time analytics to support better decision making. However, as the number of workloads migrating to… The post Architecting Data Lakes...

    Read Blog
  • Data Lake TCO Optimization – The Data Lake Summit Speaker Lineup

    Data Lake TCO Optimization – The Data Lake Summit Speaker Lineup

    Running ad hoc analytics, streaming analytics, and machine learning workloads in the cloud offer unique cost, performance, and time to value advantages. But the unpredictability… The post Data...

    Read Blog
  • Part 2: Tuning the Data Ingestion process

    Part 2: Tuning the Data Ingestion process

    In Part 1 of this series, we briefly touched upon the various design considerations to be made when architecting the Data Lake. We saw how… The post Part 2: Tuning the Data Ingestion process...

    Read Article
  • Data Lakes and Data Warehouses – The Data Lake Summit Speaker Lineup

    Data Lakes and Data Warehouses – The Data Lake Summit Speaker Lineup

    Today’s applications for machine learning and real-time predictive analytics require a robust set of capabilities from the underlying data platform. These must meet the growing… The post Data...

    Read Blog
  • Data Lakes for Artificial Intelligence and Machine Learning – The Data Lake Summit Speaker Lineup

    Data Lakes for Artificial Intelligence and Machine Learning – The Data Lake Summit Speaker Lineup

    Artificial Intelligence and machine learning workloads leverage multiple data formats that are a combination of batch and real-time and require scalable computing resources. Leveraging data… The...

    Read Blog
  • Enhanced Network Security with AWS PrivateLink on Qubole

    Enhanced Network Security with AWS PrivateLink on Qubole

    Increase data security and simplify the infrastructure with Qubole About Qubole Open Data Lake Platform Qubole is an open and secure data lake platform for… The post Enhanced Network Security with...

    Read Article
  • Part 1: Ingestion into the Data Lake

    Part 1: Ingestion into the Data Lake

    Data Lakes are a core pillar in an organization’s data strategy. Data lakes make organizational data from different sources, accessible to various end-users like business… The post Part 1:...

    Read Article
  • Qubole University Launches Badge Program

    Qubole University Launches Badge Program

    For decades our desks were covered in trophies, certificates, and medals demonstrating our accomplishments, achievements, and competencies. Over the time, these methods of recognition have… The...

    Read Blog
  • Introducing Capacity Reservation for Application Master to increase Workload Reliability despite Spot Interruptions

    Introducing Capacity Reservation for Application Master to increase Workload Reliability despite Spot Interruptions

    AWS Spot instances reduce cloud costs by up to 90% but can be interrupted by AWS at any given time causing running workloads to fail.… The post Introducing Capacity Reservation for Application...

    Read Article
  • Terraforming the Open Data Lake

    Terraforming the Open Data Lake

    Image credits: https://science.howstuffworks.com/terraforming.htm The Qubole Open Data Lake Platform Qubole is the open data lake company that provides a simple and secure data lake platform… The...

    Read Article
  • Columnar Format in Data Lakes  For Dummies

    Columnar Format in Data Lakes For Dummies

    Columnar data formats have become the standard in data lake storage for fast analytics workloads as opposed to row formats. Columnar formats significantly reduce the… The post Columnar Format in...

    Read Article
  • Introducing Qubole Release 59

    Qubole regularly releases its software for processing petabytes of data on the cloud through major releases once a quarter. This is in addition to several… The post Introducing Qubole Release 59...

    Read Article
  • How to Optimize Costs in a Changing World

    How to Optimize Costs in a Changing World

    Last week, we welcomed our customers Justin Wainwright, Systems Analyst at Oracle Data Cloud and Rajit Saha, Director of Data Platform at LendingClub, to discuss… The post How to Optimize Costs in...

    Read Blog
  • Rails: Why Upgrading Matters – Part 2

    Rails: Why Upgrading Matters – Part 2

    This is Part 2 of a 2 blog series on this topic.  You can read Part 1 here. Rollout Strategy:  We have different tiers in… The post Rails: Why Upgrading Matters – Part 2 appeared first on Qubole.

    Read Article
  • Ruby on Rails: Why Upgrading Matters – Part 1

    Ruby on Rails: Why Upgrading Matters – Part 1

    Ruby on Rails (or Rails) is a web development  framework that gives Rails developers an optimized experience to write their (Ruby) code. Rails is one… The post Ruby on Rails: Why Upgrading Matters...

    Read Article
  • How to Optimize Spark Applications for Performance using Qubole Sparklens

    How to Optimize Spark Applications for Performance using Qubole Sparklens

    This final part of the three part spark optimization series explains how a Spark application can be optimized for performance by using Qubole Sparklens. The… The post How to Optimize Spark...

    Read Article
  • Spark Cluster Optimization for Cost, Reliability and Performance

    Spark Cluster Optimization for Cost, Reliability and Performance

    How to Optimize Spark Clusters on Qubole for Cost Reliability and Performance This second blog from the three part series explains how a Spark cluster… The post Spark Cluster Optimization for...

    Read Article
  • Maximizing Spot Utilization by Leveraging Qubole Heterogeneous Clusters

    Maximizing Spot Utilization by Leveraging Qubole Heterogeneous Clusters

    How Qubole Maximizes Spot Utilization and Reduces Costs One of our customers—a large enterprise cloud content management company—runs several sophisticated machine learning (ML) predictive...

    Read Article
  • Using Resource Groups to Dynamically Size Presto Clusters on Qubole

    Using Resource Groups to Dynamically Size Presto Clusters on Qubole

    As a best practice, we recommend users to create few large Presto clusters that are shared between different teams, instead of creating multiple small clusters… The post Using Resource Groups to...

    Read Article
  • loading
    Loading More...