Sentiment Analysis Using Word2Vec and Deep Learning with Apache Spark on Qubole
This post covers the use of Qubole, Zeppelin, PySpark, and H2O PySparkling to develop a sentiment analysis model capable of providing real-time alerts on customer…
This post covers the use of Qubole, Zeppelin, PySpark, and H2O PySparkling to develop a sentiment analysis model capable of providing real-time alerts on customer…
For many data scientists and statisticians, R is their tool of choice. It provides many useful abstractions, is easy to script in, and has tons…
Earlier this month, a study was published indicating that the widely used “Reddit dataset” (released in 2015 by Jason Baumgartner) had significant, previously unidentified gaps. This study…
In response to significant demand from our customers, we are happy to announce a new extension of the Notebooks feature on the Qubole Data Service:…
With the recent announcement of Dashboards, we at Qubole want to make sure that our customers have everything that they need to succeed with using…
Last week, during the Deep Learning Summit at AWS re:Invent 2017, Terrence Sejnowski (a pioneer of deep learning) succinctly said “Whoever has more data wins”.…
When I walked into Qubole’s office on June 10th—first day of my Product Analyst Internship there—I had nothing with me except for a notebook and…
Intro In a recent blog post, we benchmarked auto-scaling and demonstrated that an auto-scaling cluster was a lot less expensive and only a little bit…
Intro Have you ever had trouble deciding how large to make a cluster? Do you sometimes feel like you’re wasting money when a cluster isn’t…
Free access to Qubole for 30 days to build data pipelines, bring machine learning to production, and analyze any data type from any data source.
See what our Open Data Lake Platform can do for you in 35 minutes.