Qubole’s HBase-as-a-Service is Generally Available on AWS
- By Rajat Venkatesh
- June 9, 2016
The HBase team at Qubole is happy to announce the general availability of QDS HBase-as-a-Service on AWS. Through the Beta program, QDS has helped administrators run HBase at scale in production with higher uptime and reliability while exploiting cloud elasticity for more agile deployments.
In building our HBase offering, we worked closely with early customers and prospects to help solve common pain points associated with running HBase in the Cloud. One of the customers we worked with was Ola, an online transportation network company based out of India.
“Qubole’s HBase-as-a-service helped us get out of the box, production-grade HBase clusters up and running quickly. Using QDS for provisioning and management of the HBase clusters freed up our time to focus on building HBase applications. The scheduled backups for disaster recovery ensures higher availability through quicker recovery, while the ability to add nodes easily to the cluster allowed us to scale our deployment based on workload.” – Vaibhav Shah, Director of Engineering at Ola
Key features of our HBase-as-a-Service offering include:
- Scheduled Backups and Restore to any cluster: HBase data is backed up to S3 regularly, and you can use Qubole’s cluster management UI to easily configure schedules for your backups. All the data or specific tables can be restored to any other HBase cluster. This feature is not only useful for disaster recovery but also to bring up test or dev clusters as well as clusters for one-off analysis.
- Cluster Management: HBase clusters in QDS are elastic! Our users regularly increase the size of the capacity of the clusters. This is a boon for capacity planners as they can be more reactive to workload patterns saving time and money.
- Zero impact node rotations: By carefully orchestrating data compaction and HDFS block transfers prior to reassigning region servers, QDS minimizes the impact of adding/deleting/replacing nodes in a HBase cluster.
- High-Availability Configurations: Putting together a high availability HBase cluster in the cloud can be daunting. QDS provides prebuilt templates for deploying HA HBase clusters through a single click.
- Monitoring: QDS provides multiple integrations to use HBase and Hadoop2 metrics for monitoring and alerting. We have customers pushing metrics to an in-house Graphite cluster as well as third-party providers such as Datadog.
- Hannibal for HBase: Hannibal is a monitoring tool that focuses on HBase regions and compactions. It is useful to understand the impact of data skew and data re-distribution.
- Apache Zeppelin: HBase clusters run Apache Zeppelin on the clusters. Users have access to HBase shell as well as Bash through Zeppelin notebooks – all through a browser based interface. This feature helps to triage a HBase cluster quickly as well as create and manage runbooks for administration.
Over and above these features, HBase clusters come with the typical set of QDS goodies such as:
- Optimal out-of-the-box configuration.
- Ability to customize the cluster through node bootstrap. This is useful to install or run OpenTSDB or Apache Phoenix.
- Interoperable with Hive/Hadoop/Spark clusters on QDS.
- Excellent support.
In the past few months, we’ve seen QDS-managed HBase clusters enter production state, and we have launched and actively run hundreds of HBase nodes. We have an exciting roadmap ahead of us and look forward to working with more customers and the open-source community to bring the benefits of HBase to more users in the cloud.