Shefali Aggarwal

  • Part 3: Transactions on the Data Lake

    Part 3: Transactions on the Data Lake

    Data Lakes are becoming increasingly central to the analytical operations of organizations.  This brings in many more ‘transactional’ requirements on the pipeline architecture and the… The post...

    Read Article
  • Architecting Data Lakes for Scale and Speed – The Data Lake Summit Speaker Lineup

    Architecting Data Lakes for Scale and Speed – The Data Lake Summit Speaker Lineup

    Cloud data lakes are enabling new business models and near real-time analytics to support better decision making. However, as the number of workloads migrating to… The post Architecting Data Lakes...

    Read Blog
  • Data Lake TCO Optimization – The Data Lake Summit Speaker Lineup

    Data Lake TCO Optimization – The Data Lake Summit Speaker Lineup

    Running ad hoc analytics, streaming analytics, and machine learning workloads in the cloud offer unique cost, performance, and time to value advantages. But the unpredictability… The post Data...

    Read Blog
  • Part 2: Tuning the Data Ingestion process

    Part 2: Tuning the Data Ingestion process

    In Part 1 of this series, we briefly touched upon the various design considerations to be made when architecting the Data Lake. We saw how… The post Part 2: Tuning the Data Ingestion process...

    Read Article
  • Data Lakes and Data Warehouses – The Data Lake Summit Speaker Lineup

    Data Lakes and Data Warehouses – The Data Lake Summit Speaker Lineup

    Today’s applications for machine learning and real-time predictive analytics require a robust set of capabilities from the underlying data platform. These must meet the growing… The post Data...

    Read Blog
  • Data Lakes for Artificial Intelligence and Machine Learning – The Data Lake Summit Speaker Lineup

    Data Lakes for Artificial Intelligence and Machine Learning – The Data Lake Summit Speaker Lineup

    Artificial Intelligence and machine learning workloads leverage multiple data formats that are a combination of batch and real-time and require scalable computing resources. Leveraging data… The...

    Read Blog
  • 10 Reasons to Attend The Data Lake Virtual Summit

    10 Reasons to Attend The Data Lake Virtual Summit

    The Data Lake Summit, brought to you by Qubole, in collaboration with AWS and Google Cloud, is scheduled to be held during October 13-14, 2020.… The post 10 Reasons to Attend The Data Lake Virtual...

    Read Blog
  • Enhanced Network Security with AWS PrivateLink on Qubole

    Enhanced Network Security with AWS PrivateLink on Qubole

    Increase data security and simplify the infrastructure with Qubole About Qubole Open Data Lake Platform Qubole is an open and secure data lake platform for… The post Enhanced Network Security with...

    Read Article
  • Part 1: Ingestion into the Data Lake

    Part 1: Ingestion into the Data Lake

    Data Lakes are a core pillar in an organization’s data strategy. Data lakes make organizational data from different sources, accessible to various end-users like business… The post Part 1:...

    Read Article
  • Qubole University Launches Badge Program

    Qubole University Launches Badge Program

    For decades our desks were covered in trophies, certificates, and medals demonstrating our accomplishments, achievements, and competencies. Over the time, these methods of recognition have… The...

    Read Blog
  • Announcing The Data Lake Virtual Summit 2020!

    Announcing The Data Lake Virtual Summit 2020!

    We’re pleased to launch The Data Lake Summit–the definitive virtual conference for all things Data Lake! The two-day conference by Qubole will be held on… The post Announcing The Data Lake Virtual...

    Read Blog
  • Data Lake and Data Warehouse- Collision or Synergies

    Data Lake and Data Warehouse- Collision or Synergies

    As the volume, velocity, and variety of data increases, the choice of the right data platform to manage data has never felt more important. Should… The post Data Lake and Data Warehouse- Collision...

    Read Blog
  • Introducing Capacity Reservation for Application Master to increase Workload Reliability despite Spot Interruptions

    Introducing Capacity Reservation for Application Master to increase Workload Reliability despite Spot Interruptions

    AWS Spot instances reduce cloud costs by up to 90% but can be interrupted by AWS at any given time causing running workloads to fail.… The post Introducing Capacity Reservation for Application...

    Read Article
  • Terraforming the Open Data Lake

    Terraforming the Open Data Lake

    Image credits: https://science.howstuffworks.com/terraforming.htm The Qubole Open Data Lake Platform Qubole is the open data lake company that provides a simple and secure data lake platform… The...

    Read Article
  • Columnar Format in Data Lakes  For Dummies

    Columnar Format in Data Lakes For Dummies

    Columnar data formats have become the standard in data lake storage for fast analytics workloads as opposed to row formats. Columnar formats significantly reduce the… The post Columnar Format in...

    Read Article
  • Introducing Qubole Release 59

    Qubole regularly releases its software for processing petabytes of data on the cloud through major releases once a quarter. This is in addition to several… The post Introducing Qubole Release 59...

    Read Article
  • How to Optimize Costs in a Changing World

    How to Optimize Costs in a Changing World

    Last week, we welcomed our customers Justin Wainwright, Systems Analyst at Oracle Data Cloud and Rajit Saha, Director of Data Platform at LendingClub, to discuss… The post How to Optimize Costs in...

    Read Blog
  • Rails: Why Upgrading Matters – Part 2

    Rails: Why Upgrading Matters – Part 2

    This is Part 2 of a 2 blog series on this topic.  You can read Part 1 here. Rollout Strategy:  We have different tiers in… The post Rails: Why Upgrading Matters – Part 2 appeared first on Qubole.

    Read Article
  • What is an Open Data Lake?

    A data lake is a system or repository that stores data in its raw format as well as transformed trusted datasets and provides both programmatic… The post What is an Open Data Lake? appeared first...

    Read Blog
  • Ruby on Rails: Why Upgrading Matters – Part 1

    Ruby on Rails: Why Upgrading Matters – Part 1

    Ruby on Rails (or Rails) is a web development  framework that gives Rails developers an optimized experience to write their (Ruby) code. Rails is one… The post Ruby on Rails: Why Upgrading Matters...

    Read Article
  • loading
    Loading More...