IoT with Amazon Kinesis and Spark Streaming on Qubole
Co-authored by Jeffrey Ellin, Solutions Architect, Qubole. The Internet of Things (IoT) is increasingly becoming an important topic in the world of application development. This…
Co-authored by Jeffrey Ellin, Solutions Architect, Qubole. The Internet of Things (IoT) is increasingly becoming an important topic in the world of application development. This…
Execution engines like M/R, Tez, Presto, and Spark provide a set of knobs or configuration parameters that control the behavior of the execution engine. In…
Jupyter™ Notebooks is one of the most popular IDE of choice among Python users. Traditionally, most Jupyter users work with small or sampled datasets that…
This is a guest post by Evan Harris, Data Scientist, Return Path At Return Path, I work on a data science team that uses machine…
Today distributed compute engines are the backbone of many analytic, batch & streaming applications. Spark provides many advanced features (pivot, analytic window functions, etc.) out…
In Part 1 you learned how to get started with installing distributed deep learning library BigDL on Qubole. In this Part 2 of a two-part…
BACKGROUND Qubole Notebooks give data scientists and data analysts an easy way to interact with data stored in Cloud data stores such as Amazon S3,…
BigDL runs natively on Apache Spark, which makes for a perfect deployment platform because Qubole offers a greatly enhanced and optimized Spark as a service.…
Today, we are excited to announce the private Beta availability of StreamX as a managed service within the Qubole Data Service (QDS) platform. With this…
A while back we shared the post about Qubole choosing Apache Airflow as its workflow manager. Then last year there was a post about GAing…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.