Spark Streaming: IoT with Amazon Kinesis and Visualizing with Qubole Notebooks
Co-authored by Jeffrey Ellin, Solutions Architect, Qubole. In our last post, we discussed setting up Amazon IoT, Kinesis and Qubole to build a streaming pipeline.…
Co-authored by Jeffrey Ellin, Solutions Architect, Qubole. In our last post, we discussed setting up Amazon IoT, Kinesis and Qubole to build a streaming pipeline.…
Nexla is a data operations platform that focuses on enabling data movement between companies with security and scale. The platform is simple enough for the…
Co-authored by Jeffrey Ellin, Solutions Architect, Qubole. The Internet of Things (IoT) is increasingly becoming an important topic in the world of application development. This…
Execution engines like M/R, Tez, Presto, and Spark provide a set of knobs or configuration parameters that control the behavior of the execution engine. In…
Jupyter™ Notebooks is one of the most popular IDE of choice among Python users. Traditionally, most Jupyter users work with small or sampled datasets that…
This is a guest post by Evan Harris, Data Scientist, Return Path At Return Path, I work on a data science team that uses machine…
Today distributed compute engines are the backbone of many analytic, batch & streaming applications. Spark provides many advanced features (pivot, analytic window functions, etc.) out…
In Part 1 you learned how to get started with installing distributed deep learning library BigDL on Qubole. In this Part 2 of a two-part…
BACKGROUND Qubole Notebooks give data scientists and data analysts an easy way to interact with data stored in Cloud data stores such as Amazon S3,…
BigDL runs natively on Apache Spark, which makes for a perfect deployment platform because Qubole offers a greatly enhanced and optimized Spark as a service.…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.