Presto is a high-performance, distributed SQL query engine for big data. Presto was originally designed and developed at Facebook for their data analysts to run interactive queries on its large data warehouse in Apache Hadoop.
Presto’s architecture allows users to query a variety of data sources such as Hadoop, AWS S3, Alluxio, MySQL, Cassandra, Kafka, and MongoDB. One can even query data from multiple data sources within a single query.
Qubole has been offering a managed Presto service since 2014. We offer our customers multiple Presto versions and maintain a regular upgrade process. Qubole’s managed Presto offering has been tailored to the needs of our customers. Qubole blends the latest features form the open source community with Qubole’s proprietary solutions that boost performance, lower cost, improve user experience, and provide smooth administration of Presto clusters.
Ease of Use
|Graceful Low-cost Compute Shutdown *|
|Spot (AWS) Rebalancing|
|Spot Block (AWS) Support|
|Aggressive Downscaling with graceful decommissioning|
|Smart Query Retry|
|Cost Explorer & Analysis|
(prevent runaway queries)
* AWS Spot, Azure Lo-cost VMs, Google Pre-emptible VMs
|Compute Optimization for joins and filters|
|Required Worker Node|
|S3 Direct writes optimization|
|S3 listing optimization|
|Rubix (distributed caching)|
|Dashboarding (Presto Notebook)|
|Collaboration and sharing|
|Monitoring (Ganglia, DataDog, etc)|
|Intelligent Log Access|
|Access control for notebooks, clusters, jobs, structured data|
|Audit end-user activity logs|
|Apache Ranger Integration|
|SSO with SAML 2.0 support|
|HIPAA, SOC2 Type2, ISO-27001 compliant environments|
|Custom Connector with BI tools (Tableau, Looker, etc.)|
|AWS Glue Support|
|Data Source Connectors (Redshift, Postgres, Kinesis*, etc)|
* Kinesis is being contributed back to OSS
|24/7 support from our Presto experts|
|Support multiple versions of Presto|
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.