-
Part 3: Transactions on the Data Lake
Data Lakes are becoming increasingly central to the analytical operations of organizations. This brings in many more ‘transactional’ requirements on the pipeline architecture and the… The post...
-
Architecting Data Lakes for Scale and Speed – The Data Lake Summit Speaker Lineup
Cloud data lakes are enabling new business models and near real-time analytics to support better decision making. However, as the number of workloads migrating to… The post Architecting Data Lakes...
-
Data Lake TCO Optimization – The Data Lake Summit Speaker Lineup
Running ad hoc analytics, streaming analytics, and machine learning workloads in the cloud offer unique cost, performance, and time to value advantages. But the unpredictability… The post Data...
-
Part 2: Tuning the Data Ingestion process
In Part 1 of this series, we briefly touched upon the various design considerations to be made when architecting the Data Lake. We saw how… The post Part 2: Tuning the Data Ingestion process...
-
Data Lakes and Data Warehouses – The Data Lake Summit Speaker Lineup
Today’s applications for machine learning and real-time predictive analytics require a robust set of capabilities from the underlying data platform. These must meet the growing… The post Data...
-
Data Lakes for Artificial Intelligence and Machine Learning – The Data Lake Summit Speaker Lineup
Artificial Intelligence and machine learning workloads leverage multiple data formats that are a combination of batch and real-time and require scalable computing resources. Leveraging data… The...
-
10 Reasons to Attend The Data Lake Virtual Summit
The Data Lake Summit, brought to you by Qubole, in collaboration with AWS and Google Cloud, is scheduled to be held during October 13-14, 2020.… The post 10 Reasons to Attend The Data Lake Virtual...
-
Enhanced Network Security with AWS PrivateLink on Qubole
Increase data security and simplify the infrastructure with Qubole About Qubole Open Data Lake Platform Qubole is an open and secure data lake platform for… The post Enhanced Network Security with...
-
Part 1: Ingestion into the Data Lake
Data Lakes are a core pillar in an organization’s data strategy. Data lakes make organizational data from different sources, accessible to various end-users like business… The post Part 1:...
-
Qubole University Launches Badge Program
For decades our desks were covered in trophies, certificates, and medals demonstrating our accomplishments, achievements, and competencies. Over the time, these methods of recognition have… The...
-
Announcing The Data Lake Virtual Summit 2020!
We’re pleased to launch The Data Lake Summit–the definitive virtual conference for all things Data Lake! The two-day conference by Qubole will be held on… The post Announcing The Data Lake Virtual...
-
Data Lake and Data Warehouse- Collision or Synergies
As the volume, velocity, and variety of data increases, the choice of the right data platform to manage data has never felt more important. Should… The post Data Lake and Data Warehouse- Collision...
-
Introducing Capacity Reservation for Application Master to increase Workload Reliability despite Spot Interruptions
AWS Spot instances reduce cloud costs by up to 90% but can be interrupted by AWS at any given time causing running workloads to fail.… The post Introducing Capacity Reservation for Application...
-
Terraforming the Open Data Lake
Image credits: https://science.howstuffworks.com/terraforming.htm The Qubole Open Data Lake Platform Qubole is the open data lake company that provides a simple and secure data lake platform… The...
-
Columnar Format in Data Lakes For Dummies
Columnar data formats have become the standard in data lake storage for fast analytics workloads as opposed to row formats. Columnar formats significantly reduce the… The post Columnar Format in...
-
Introducing Qubole Release 59
Qubole regularly releases its software for processing petabytes of data on the cloud through major releases once a quarter. This is in addition to several… The post Introducing Qubole Release 59...
-
How to Optimize Costs in a Changing World
Last week, we welcomed our customers Justin Wainwright, Systems Analyst at Oracle Data Cloud and Rajit Saha, Director of Data Platform at LendingClub, to discuss… The post How to Optimize Costs in...
-
Rails: Why Upgrading Matters – Part 2
This is Part 2 of a 2 blog series on this topic. You can read Part 1 here. Rollout Strategy: We have different tiers in… The post Rails: Why Upgrading Matters – Part 2 appeared first on Qubole.
-
What is an Open Data Lake?
A data lake is a system or repository that stores data in its raw format as well as transformed trusted datasets and provides both programmatic… The post What is an Open Data Lake? appeared first...
-
Ruby on Rails: Why Upgrading Matters – Part 1
Ruby on Rails (or Rails) is a web development framework that gives Rails developers an optimized experience to write their (Ruby) code. Rails is one… The post Ruby on Rails: Why Upgrading Matters...
-
Loading More...