Additional Managed Services

Qubole offers a growing selection of managed open-source big-data technologies to complement its core managed services Hadoop, Hive, Spark, and Presto, and help data teams simplify, test, and automate the most challenging use cases, particularly those that involve complex ETL jobs.

 

Apache Airflow

Apache Airflow

Author workflows as directed acyclic graphs (DAGs) of tasks and easily integrate them in your data management processes.

Available on AWS

Learn more

Apache Pig

Apache Pig

Use Pig Latin to test and develop ETL workflow scripts, then move them into production on large-scale clusters.

Available on AWS

Learn more

cascading

Cascading

Future-proof the big-data querying capabilities of your Java applications

Available on AWS

Learn more

Best-in-class big data technologies made autonomous

Our additional managed services are self-managing and self-optimizing implementations of the original open-source projects, designed to take full advantage of the underlying Autonomous Data Platform capabilities.

Leverage the platform’s AIR (Alerts, Insights, Recommendations) capabilities to help data teams focus on the outcome, instead of the platform

Agent technology augments original open-source engine with a self-managing and self-optimizing platform:

Cloud-optimized for faster workload performance

Smarter object storage access for split computation, batching of writes, pre-fetching, and multiple caching layers, SSD Caching

Easier to integrate with existing data sources and tools

  • ODBC/JDBC drivers
  • Database connectors (MySQL, SQL Server, Oracle DB, RDS, Redshift, Kinesis, and many others)
  • Exhaustive dictionary of REST APIs for application integration

Best-in-class security

  • HDFS and SSL encryption
  • SAML Authentication
  • VPC support
  • Dual IAM roles

Our Analytics Team uses “Pig As a Service” from Qubole very extensively. The Qubole UI QDS is very intuitive and make us very productive to write pig queries faster. Ability to test your pig script on smaller data set without changing the input path is really innovative and very helpful.

Shailesh Garg, Sr. Engineering Manager, Komli Media