Blog

 
 

5 Reasons Savvy New Gen Companies Turn to the Cloud for Big Data

  • By Jonathan Buckley
  • May 21, 2015
624x154_5_reasons_Saavy_Companies
 

Of all the current trends in technology, few have created as much buzz as cloud computing and big data. While both grew in popularity, it only stands to reason that they would eventually cross paths. This is exactly what has happened in recent years as the number of cloud services based around big data analytics […]

 
Read More..

Hadoop Happenings: Hadoop Growth Slowing?

  • By Jonathan Buckley
  • May 19, 2015
hadoop-happenings
 

Grab all the latest news and commentary about Hadoop in this week’s Hadoop Happenings. Gartner’s latest survey indicating slow growth in demand for Hadoop garnered extensive media attention this week. Commentators pointed out that the high opportunity cost for deploying Hadoop can be overcome by Hadoop-as-a-Service solutions, and others dismissed the concerns altogether. See the […]

 
Read More..

Hadoop Happenings: ORC, Spark and Flink

  • By Jonathan Buckley
  • May 12, 2015
hadoop-happenings
 

Grab all the latest news and commentary about Hadoop in this week’s Hadoop Happenings. This week Apache ORC became a top-level project. Commentary continued on Apache Spark and Apache Flink, and Forbes discussed whether big data will have an impact on next year’s presidential election. See all the stories below. 1. Apache ORC Launches as […]

 
Read More..

7 Big Data Security Concerns

  • By Jonathan Buckley
  • May 7, 2015
624x154-Big-Data-Security
 

Big data is more than just some trending business phrase that’s big on style and low on substance; it brings with it tangible benefits for any company willing to use it. The advantages of leveraging big data are real and oftentimes far-reaching, which is why so many organizations have adopted big data for their own […]

 
Read More..

Announcing Qubole’s HBase-as-a-Service for AWS

  • By Jonathan Buckley
  • May 6, 2015
 

Today we are pleased to announce the Beta offering of Qubole’s HBase-as-a-Service. QDS can now provide fully managed HBase 1.0.0 running on Hadoop 2.6.0 as a managed service on the AWS Cloud. Introduction to HBase Apache HBase is an integral part of the Apache Hadoop ecosystem. When fast reads and writes with high concurrency and […]

 
Read More..

Hadoop Happenings: Actionable Big Data

  • By Jonathan Buckley
  • May 5, 2015
hadoop-happenings
 

Grab the latest news and commentary about Hadoop and big data in this week’s Hadoop Happenings. This week focus turned to big data use cases and asking the right questions when performing data analytics. Apache Parquet was also announced as a top-level project. 1. Apache Parquet paves the way for better Hadoop data storage Infoworld.com- […]

 
Read More..

Opportunities and Challenges of Big Data in Ad Tech

  • By Jonathan Buckley
  • April 30, 2015
624x154-opportunities-challenges-banner-sm
 

Big data is transforming numerous industries around the world, so perhaps it shouldn’t come as a surprise that advertising technology is one of those recipients. While ad tech has been around for a while now, only in the past few years have companies latched onto the idea that big data can make online advertising that […]

 
Read More..

Hadoop Happenings: ODP Fireworks

  • By Jonathan Buckley
  • April 28, 2015
hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place in this week’s Hadoop Happenings. This week the debate continued over the Open Data Platform. SAP highlighted its support for Hadoop in the enterprise, and a data engineer from Shazam discussed why he chose Presto over Apache Hive. See the full stories […]

 
Read More..

In Case You Missed It: Webinar – Getting to 1.5M Ads/sec

  • By Jonathan Buckley
  • April 23, 2015
Qubole-DataXu-AWS-Post-Webinar
 

At the end of March, Qubole, along with DataXu and Amazon Web Services, hosted a special webinar detailing DataXu’s work with big data and the special platforms provided by both Qubole and AWS. Speakers at this highly informative webinar included Scott Ward, a Solutions Architect at Amazon Web Services, Ashish Dubey, a Solutions Architect at […]

 
Read More..

Hadoop Happenings: Positioning Battles

  • By Jonathan Buckley
  • April 21, 2015
hadoop-happenings
 

  Grab all of the latest news and commentary about Hadoop in this week’s Hadoop Happenings. This week vendors continued their positioning battle as Pepperdata got more funding, Hortonworks acquired a new partner, and commentators dismissed much of the messaging as a distraction. See all of the stories below. 1. Pepperdata Scores $15M for Hadoop […]

 
Read More..

Connecting Offline and Online Data: A Powerful Tool for Marketers/Advertisers

  • By Jonathan Buckley
  • April 16, 2015
Connecting-offline-online-data-624x154
 

The rise of big data has ushered in a new era for marketers and advertisers— the era of data driven marketing. With massive amounts of rich online data constantly flowing in from multiple sources, marketers can use analytics to gain insights about customer habits, behaviors and preferences that the analysis of offline data could never […]

 
Read More..

Hadoop Happenings: Cloud Rises

  • By Jonathan Buckley
  • April 14, 2015
hadoop-happenings
 

Grab the latest news and commentary about Hadoop in this week’s Hadoop Happenings. This week focus turned to the cloud as more vendors are seeking to offer cloud deployments. Apple and IBM also teamed up to offer more advanced digital health data, and Think Big released its Dashboard Engine for Hadoop. 1. Why Cybersecurity Needs […]

 
Read More..

6 Big Data Mistakes Businesses Make

  • By Jonathan Buckley
  • April 9, 2015
624x154_big_data_mistakes
 

Big Data is reaping big benefits for businesses. But the key to success with big data lies in doing it right. All too often businesses jump on the big data bandwagon with unclear strategies and unreasonable expectations for what big data can do. As a result, the full potential and value of big data are […]

 
Read More..

Hadoop Happenings: Spark, Hadoop, Infrastructure

  • By Jonathan Buckley
  • April 7, 2015
hadoop-happenings
 

Grab all of the latest news about Hadoop in this week’s Hadoop Happenings. This week discussion continued on Apache Spark with some arguing its hype could be a self-fulfilling prophecy and others claiming the debate shouldn’t be about the technology at all but rather the infrastructure. See the full debate and other commentary below. 1. […]

 
Read More..

Big Data, Ad Tech, and Privacy Concerns: How The Digital Advertising Industry Can Boost Transparency

  • By Jonathan Buckley
  • April 2, 2015
624x154-Big-Data-Privacy
 

Big data is a game changer for the digital advertising industry. Thanks to powerful big data tools, digital advertisers can analyze mountains of insight-rich data from multiple sources, enabling them to deliver online and mobile ads to consumers that are more personalized and targeted than ever before. This new era of data driven marketing is […]

 
Read More..

Hadoop Happenings: Velocity and Quality

  • By Jonathan Buckley
  • March 31, 2015
hadoop-happenings
 

Grab the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week conversation centered on businesses ensuring the quality of their data. Hotels.com provided an interesting use case, and Forbes published an article focused on the velocity of data. 1. Apple acquires big data analytics firm Acunu AppleInsider.com- Apple […]

 
Read More..

Qubole Presents New Webinar: Getting to 1.5M Ads/sec

  • By Jonathan Buckley
  • March 27, 2015
qubole-dataxu-aws-webinar
 

Qubole, DataXu, and Amazon Web Services are set to present a new webinar coming up on March 30, 2015 at 10 a.m. PDT / 1 p.m. EDT. The webinar is titled “Getting to 1.5M Ads/sec.” Much of the focus of the upcoming webinar involves DataXu and how the company manages their big data. DataXu works […]

 
Read More..

5 Tips to Attract and Retain Talent From the STEM Fields

  • By Jonathan Buckley
  • March 26, 2015
retain-talent-from-stem-banner624x154
 

Every day more and more companies are turning to big data and analytics to become more competitive and profitable. The natural result of this growing trend is an increase in demand for talented individuals in the Science, Technology, Engineering, and Mathematics (STEM) fields. This presents a problem for companies in the Ad Tech, Internet, and […]

 
Read More..

Hadoop Happenings: Growth, Jabs and Questions

  • By Jonathan Buckley
  • March 24, 2015
hadoop-happenings
 

Grab the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week discussion continued on the Hadoop vs. Spark debate. Security continues to be a top concern with enterprises adopting Hadoop, and big data adoption continues to grow. 1. Is Apache Spark going to replace Hadoop? Aptuz.com- This post […]

 
Read More..

5 Tips To Communicate Your Tech Company’s Benefits To Potential Customers

  • By Jonathan Buckley
  • March 19, 2015
5_Tips_Sm_v01
 

In 1999, NASA’s $125 Million Mars Climate Orbiter was forever lost in space because one team of scientists measured in metric and the other team used English imperial. While this fundamental lack of communication on NASA’s part is laughable, the loss of potential customers due to an inability to effectively communicate the benefits of their […]

 
Read More..

Qubole connects to Amazon Redshift

  • By Jonathan Buckley
  • March 18, 2015
 

A few weeks ago, we announced the addition of Apache Spark to the Qubole Data Service. This new capability has been received with tremendous excitement among customers who can now take advantage of Spark’s blazing fast speed. Today, we’ve extended the Qubole platform even further with a connector to Amazon Redshift. This makes data scientists’ […]

 
Read More..

Hadoop Happenings: Apache Tajo Released

hadoop-happenings
 

Grab the latest news and commentary about Hadoop all in one place in this week’s Hadoop Happenings. This week Apache Tajo was officially released for commercial use, commentary focused on the ever-changing Hadoop ecosystem, and discussion continued on Hadoop security. 1. Navigating the Hadoop Ecosystem Oreilly.com- An introductory overview of Field Guide to Hadoop, this […]

 
Read More..

Qubole on Google Compute Engine

  • By Joydeep Sen Sarma
  • December 7, 2013
 

Qubole is a leading provider of Hadoop as a service with the mission of providing a simple, integrated, high-performance big data stack that businesses can use to derive actionable insights from their data sources quickly. Qubole Data Service offers self-service and auto-scaled Hadoop in the cloud along with an integrated library of data connectors and an easy-to-use GUI […]

 
Read More..

Qubole Hive Server

  • By Sivaramakrishnan Narayanan
  • March 27, 2013
 

Qubole offers Hive as a service. When a user logs in to Qubole, he/she sees the tables and functions associated with their account and can submit a HiveQL command via the composer pane. Qubole takes care of executing the HiveQL command, spawning a Hadoop cluster if necessary and saving results and logs. Now, multiple users […]

 
Read More..

Easy reusable commands with templates

  • By Hariharan Iyer
  • January 7, 2014
 

A common characteristic of many analytics queries is that they are mostly invariant in form and function. Over multiple invocations of the query or command, one would find that only the range of inputs varies in the form of a couple of inputs, while the major part of the query remains the same. Command templates […]

 
Read More..

Waiting for Mr. Ntpd

  • By Sivaramakrishnan Narayanan
 

In one of our earlier blog posts, we announced the availability of the Qubole Hadoop Platform on Google Compute Engine. This was also featured on the Google Cloud Platform Blog. In this post we talk about a critical issue that we faced (and eventually managed to circumvent) a few days before the Qubole-on-GCE beta release. […]

 
Read More..

Canonicalizing hive queries to find top workloads

  • By Rohit Agarwal
  • April 23, 2014
 

Motivation One of Qubole Data Services’ most popular offering is Hive-as-a-Service in the cloud. Users run a large number of ad-hoc, analytical Hive queries against their data in S3 or HDFS. It wasn’t apparent to us how many of these queries were truly unique and how many were simple variants. The hypothesis was that if […]

 
Read More..

Presto Performance

  • By Sivaramakrishnan Narayanan
  • April 14, 2014
 

Presto is an open source distributed SQL query engine, developed by Facebook. Presto was designed and written from the ground up for interactive analytics and approaches the speed of commercial data warehouses. Qubole started its Presto-as-a-Service program a few weeks ago to make it easily accessible with a single click for its users. A good […]

 
Read More..

Re-using JVMs across Hadoop jobs

  • By Sivaramakrishnan Narayanan
  • December 22, 2014
 

One of the oft-discussed problems with Hadoop is that it launches new JVMs for each map or reduce task. Launching a new JVM and loading all the classes is pretty expensive and can take anywhere from 4-8 seconds. If the job is a small one, this startup overhead can be a substantial part of overall […]

 
Read More..

Hadoop with Enhanced Networking on AWS

  • By Hariharan Iyer
  • March 13, 2015
 

Introduction At Qubole, many of our customers run their Hadoop clusters on AWS EC2 instances. Each of these instances is a Linux guest on a Xen hypervisor. Traditionally each guest’s network traffic goes through the hypervisor, which adds a little bit of overhead to the bandwidth. EC2 now supports Single Root I/O Virtualization (called Enhanced […]

 
Read More..

Information and Insights: Big Data vs. Actionable Data

Info_and_Insights_sm
 

The era of data-driven business has arrived. Big data analytics tools are enabling organizations to capture, manage and mine mountains of raw chaotic data from multiple sources to gain insights that inform products, services and marketing strategies. The challenge is that not all big data insights are relevant and meaningful enough to spark real change […]

 
Read More..

Hadoop Happenings: War of Words

hadoop-happenings
 

Grab the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week distributions waged a war of words as Cloudera dismissed its competitors, and Hortonworks defended the Open Data Platform. Apache Tajo was also declared ready for commercial use by the Apache Software Foundation. 1. The End Game for […]

 
Read More..

Bridging HDFS2 with HDFS1

  • By Rajat Jain
  • March 14, 2015
 

Industry is rapidly moving to adopt Hadoop 2.x. With every upgrade process — especially one that is so big in nature — there is a level of complexity involved. Qubole has already started offering a beta service to our customers. Our customers have started to try out Hadoop 2 as well, and as with any […]

 
Read More..

Improving the Consumer Experience: How Media Companies are Using Big Data

Improving_Consumer_Experience_Sm_final
 

Big data is transforming entire industries, and the media industry is no exception.  Once media companies relied on traditional data to make educated guesses at what content consumers were looking for. Today they have mountains of rich data that reveals what consumers are doing, searching, consuming, tweeting, liking and sharing. Armed with this information, media […]

 
Read More..

Plugging in Presto UDFs

  • By Sivaramakrishnan Narayanan
  • March 4, 2015
 

Presto is a great query engine for a variety of SQL workloads. We’ve been offering  Presto-as-a-Service for many months now and a frequent question that comes up is: “How can I plug-in custom user-defined functions in Presto?” In this blogpost, we will answer this very question. We’ve created a Presto UDF Project in github that […]

 
Read More..

Hadoop Happenings: Market Rumblings

hadoop-happenings
 

1. Making Sense of the ODP-Where Does Hadoop Go From Here? Datanami.com- Debate over the purpose of Open Data Platform got heated with some dismissing it as a distraction or a grasp at relevance while others claim its necessary to spur Hadoop innovation. Read More 2. Big Data Bits: Strata + Hadoop World Rewind CMSWire.com- […]

 
Read More..

Clickstream Data Analysis: A Powerful Tool for Your Business

Clickstream-data-analysis-thumb-v02
 

  Big data analytics platforms such as cloud-based Hadoop have become powerful tools for businesses looking to leverage vast sets of customer data for competitive advantage. But with so much rich data streaming in from multiple sources, the analytics challenge for many businesses is determining what types of data will yield the highest amounts of […]

 
Read More..

Hadoop Happenings: Strata + Hadoop 2015

hadoop-happenings
 

Grab the latest news and commentary about Hadoop in one place in this week’s Hadoop Happenings.  From the controversial Open Data Platform to increasing support from Spark, coverage of the recent O’Reilly Strata + Hadoop world conference along with accompanying vendor announcements dominated the news this week. Hadoop vendor Cloudera also pressed pause on movement […]

 
Read More..

Qubole Adds Apache Spark to Hadoop-based Cloud Offering

 

One of the things customers love about Qubole is that they’re able to use the latest and greatest technologies—without having to fiddle with deploying it on their own. Continuing this tradition, I’m pleased to announce that we’ve expanded our portfolio of services on the Qubole Data Services (QDS) platform to include Apache Spark. Data scientists […]

 
Read More..

Hadoop Happenings: Open Data Platform

hadoop-happenings
 

Grab the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week several large companies formed a new alliance. Mortar Data was purchased by one of its customers, and the first hints at Cloudera’s IPO were released. 1. Cloudera appears to be preparing for and IPO in the next […]

 
Read More..

Using Big Data for Digital Decision Making

Using-Big-Data-for-Decision-Making3-thumb
 

  Technology is a non-stop rollercoaster ride of innovation, with devices becoming increasingly faster and smarter. Computers infused with artificial intelligence systems are now able to analyze more data, recognize patterns and make decisions in real-time like never before. This presents a number of new opportunities for many industries. Improved analytics and decision-making abilities allow […]

 
Read More..

2014 Was a Great Year for Qubole

 

Today we reported some impressive stats on our growth in 2014. In short, last year was a phenomenal year for the company. The amount of data our clients processed on Qubole in 2014 soared to 519 petabytes of data, compared to 34 petabytes in 2013. In fact, we’re now processing more than 100 petabytes per […]

 
Read More..

Hadoop Happenings: Vendor Shifts

hadoop-happenings
 

Grab the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week Cloudera announced its acquisition of startup Xplain.io. Discussion continued on the future of Apache Spark, and rumors flew about Pivotal’s future in the big data arena. 1. Cloudera acquires self-service data-modeling startup Xplain.io Gigaom.com- Cloudera acquired Xplain.io, […]

 
Read More..

Qubole on Azure

  • By Nate Philip
  • February 9, 2015
 

Qubole is the leading provider of Hadoop as a service. Our mission is to provide a simple, integrated, high-performance big data stack that businesses can use to derive actionable insights from their data sources quickly. Qubole Data Service (QDS) offers self-service and auto-scaling Hadoop in the cloud (patent pending) along with an integrated suite of data […]

 
Read More..

Reaping the Benefits of Real-Time Analytics

reaping-benefits-blog-banner-624x154
 

  Discussions surrounding big data often mention its three Vs: volume, variety and velocity. The most commonly discussed of those three is obviously volume, which isn’t surprising given the name big data. However, variety and velocity are just as important in the equation. In fact, velocity is too often overlooked. Companies are so focused on […]

 
Read More..

Hadoop Happenings: Spark Escalates

hadoop-happenings
 

Grab the latest news and commentary about Hadoop all in one place in this week’s Hadoop Happenings. This week focus turned to Apache Spark as it continues to gain interest as a faster more flexible alternative to MapReduce. There was also discussion on the growing role of data scientists and whether the role should be […]

 
Read More..

360-Degree Customer View: Seeing the Big Picture Through the Big Data Lens

360-Degree-Customer-View_small
 

  Poaching continues to be a significant problem throughout the world, specifically in Africa. Every year thousands of different animals are illegally hunted, many of which are endangered and on the brink of extinction. In an effort to fight this problem, scientists have gone to great lengths to better understand these creatures. They study them […]

 
Read More..

Qubole Uses Drone To Announce Expansion

 

Big Data as a Service leader Qubole recently moved to larger quarters in Mountain View, Calif. To mark the occasion, the company used a DJI Phantom 2 Vision+ to videotape and commemorate the occasion. Qubole has joined a number of high technology firms using drones to get the word out. Qubole co-founder and CEO Ashish […]

 
Read More..

Hadoop Happenings: Apache Falcon Graduates

hadoop-happenings
 

Grab the latest news and commentary about Hadoop in this week’s Hadoop Happenings. This week the Apache Software Foundation announced Apache Falcon has graduated to a top-level project. Hortonworks’ distribution is now available on the Google Cloud Platform, and Netflix is open sourcing some of its analytics tools. 1. Netflix is open sourcing tools for […]

 
Read More..

Streamline Multi-Channel Marketing with Big Data

multi-channel-marketing
 

The growing impact of mobile devices is no secret to marketers. A 2013 report from the Winterberry group found that from 2012 to 2013 spend on mobile search marketing doubled, and the cost-per-click is now higher on tablets than desktops. Of course, the true challenge isn’t mobile marketing but the consumer’s propensity to move from […]

 
Read More..

Qubole partner ecosystem continues to grow: Xcentium uses Qubole Data Service to enhance analytics for e-commerce companies

 

2015 is off to a great start for Qubole, with the addition of a new member to our partner ecosystem—Xcentium. Xcentium is a world-class digital services and technology provider to Fortune 1000 organizations. Its E-commerce & Digital Services Practice is using Qubole’s self-service big data platform to quickly build enterprise scale solutions for its e-commerce […]

 
Read More..

Hadoop Happenings: Security Concerns

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in this week’s Hadoop Happenings. This week Gigaom continued a series on the rivaling Hadoop security projects. Cloudian become a Hortonworks technology partner, and Information Week covered Hortonworks’ dilemma of becoming a profitable company while offering solely open source software. 1. Big data upstart MapR […]

 
Read More..

Big Data and Customer Micro-Segmentation: Applications in Media and E-Commerce

customer-segmentation
 

Customer segmentation is nothing new to an experienced marketer. Traditional B2C segments based on demographic, psychographic and behavioral data are taught in introductory college courses, and B2B marketers are well familiar with segments based on company size or purchase criteria. While these segments have served as a useful guide for decades, the era of big […]

 
Read More..

Hadoop Happenings: A Re-framing of Hadoop

hadoop-happenings
 

  Grab all the latest news and commentary on Hadoop in this week’s Hadoop Happenings. This week a guest post on the Hortonworks blog discussed Apache Ranger, AutoDesk discussed its plans for Hadoop and the cloud, and several sites weighed in on how Hadoop will continue to transform as an enterprise solution. 1. Hadoop Security: […]

 
Read More..

Big Data: The Solution to Ad Fraud?

big-data-ad-fraud
 

Those in the online advertising industry have a grim reality to face: fraud is rampant and increasingly costly. Considering how important advertising revenue is for online companies, media sites, and publishers, the problem hasn’t received much attention until recently. Having advertisers paying for fake ad impressions represents a potential breach in trust as long as […]

 
Read More..

Hadoop Happenings: Revitalize a Brand

hadoop-happenings
 

Grab the latest news and commentary about Hadoop in this week’s Hadoop Happenings. This week Datameer published statistics on big data and Hadoop in the news. Timberland discussed how it used data science to revitalize its brand, and a post discussed Facebook’s development of HydraBase. 1. In 2015, enterprises can better utilize the paradigm-shifting Hadoop […]

 
Read More..

Finding Business Value in Sentiment Analysis Data

sentiment-analysis
 

The explosion of social media and the proliferation of mobile devices have created a “perfect storm” of opportunities for customers to express their feelings and attitudes about anything and everything at anytime. This opinion or “sentiment” data, generated through social channels in the form of reviews, chats, shares, likes tweets, etc., often includes comments that […]

 
Read More..

Hadoop Happenings: Hadoop Lives Up to Hype

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week focused on analysts’ projections for the Hadoop industry’s growth as well as the growing need for data scientists. Use cases for Hadoop in banking and telecommunications were also discussed. 1. Is Hadoop over-hyped? Market-watchers say […]

 
Read More..

Looking Forward: Hadoop Industry Trends

The-Internet-of-Things-and-Big-Data_small
 

  From its primitive beginnings as a modest open source search engine called “Nutch”, Hadoop has evolved into a powerful big data analytics platform. As big data technologies and policies rapidly advance, Hadoop is just getting started. In a recent article on Computerworld.com, writer Robert L. Mitchell interviews IT leaders, consultants and industry analysts to […]

 
Read More..

Hadoop Happenings: Predictions for Hadoop

hadoop-happenings
 

  Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. Predictions for 2015 continued this week with a focus on big data adoption and the simplification of Hadoop. Forrester released a report on the big data in the cloud industry, and MapR’s CEO discussed the possibility […]

 
Read More..

Not All Hadoop Distributions are Created Equal

hadoop-distributions
 

The debate is over. Big data analytics has proven benefits. And organizations looking to implement a big data solution now have a number of options to choose from. The challenge is selecting the right Hadoop vendor, as not all Hadoop distributions are created equal. As a help to finding the best fit, here are a […]

 
Read More..

Hadoop Happenings: New Round of Funding

hadoop-happenings
 

1. The 6 Things Everyone Needs to Know about the Big Data Economy SmartDataCollective.com- In this post Bernard Marr argues that big data is moving mainstream and discusses several elements of the big data economy. Read More 2. Altiscale Lands $30M To Continue Building Hadoop Cloud Service Techcrunch.com- Altiscale announced Series B funding led by […]

 
Read More..

Hadoop Happenings: Looking to 2015

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week there are many predictions for what Hadoop’s future holds. 1. The End of the Hadoop Bubble? Forbes.com- Hortonworks’ rush into a reduced IPO may be to grasp the interest of a potential buyer. There are […]

 
Read More..

Upcoming Webinar: Forrester Analyst Discusses Big Data in the Cloud

Big-Data-in-the-Cloud_small
 

Join us for a live webinar Dec. 10, 2014 at 10am PST/1 pm EST hosted by Noel Yuhanna, principal analyst of enterprise architecture at Forrester Research, and Ashish Thusoo, co-founder and CEO of Qubole. The webinar will discuss how the cloud has helped companies keep up with the fast-changing technology landscape and discuss why big […]

 
Read More..

Hadoop Happenings: Apache Pig 0.14.0

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week a new version of Apache Pig was released. Forrester has a new report reviewing the Hadoop ecosystem, and LinkedIn provided details about its Gobblin big data framework. 1. Storage Hangout: Hadoop Plug-in Refresh Release and […]

 
Read More..

Hadoop Happenings: New Releases; Partnerships

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week Cloudera and MapR formed new partnerships. Splice Machine’s SQL on Hadoop database went on general release, and eHarmony discussed it’s future plans with Hadoop. 1. Why eHarmony is rebuilding itself atop Hadoop and (probably) OpenStack […]

 
Read More..

Qubole Partners with Microsoft Azure

announcing_partnership
 

By Marcy Campbell, Qubole SVP of Sales & Business Development Today I am excited to announce a new strategic relationship with Microsoft Azure, an important step towards fulfilling our mission to make big data solutions more accessible to more people on more platforms. Microsoft partners and Azure customers can now bring their Azure subscription to […]

 
Read More..

Hadoop Happenings: Big Data’s Turning Point

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week HP released HP Vertica for SQL on Hadoop not far from the announcement that Hortonworks is filing for an IPO. Articles also discussed applications of Hadoop in video and the airline industry. 1. Data Science […]

 
Read More..

Cluster Computing Comparisons: MapReduce vs. Apache Spark

spark-vs-mapreduce
 

Since its early beginnings some 10 years ago, the MapReduce Hadoop implementation has become the go to enterprise grade solution for storing, managing and processing massively large data volumes. Today, as organizations face the growing need for real-time data analysis to achieve competitive advantage, a new open source Hadoop data processing engine, Apache Spark, has […]

 
Read More..

Hadoop Happenings: Hortonworks IPO; Experts Weigh In

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week Hortonworks filed paperwork to take the first step toward an Initial Public Offering. A post from Micron evaluated the value of SSDs for Hadoop, and CenturyLink sought to simplify the process for configuring a Cloudera […]

 
Read More..

White Paper: Big Data Belongs in the Cloud

big-data-cloud
 

Big data is growing faster than ever before, and businesses are looking to take full advantage of it. In fact, the big data market is growing about six times faster than the IT market in general, making it an essential ingredient to success for many companies and industries in the world. It’s through big data […]

 
Read More..

Hadoop Happenings: Hadoop Lingo; Focus on Growth

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week discussion focused on Hadoop’s place in the enterprise, data governance and the growing need for data migration tools. 1. Four tips for putting business users in touch with Hadoop SaS.com- There are four main drivers […]

 
Read More..

The Key to Success with Big Data Projects (Updated)

The-Key-to-Success_small
 

This article was originally published June 5, 2014 and has since been updated. Big data has been generating a lot of buzz recently because of its numerous capabilities and the relative ease with which the road to Hadoop can now be accessed and used. It’s an impressive combination that is allowing businesses to innovate and […]

 
Read More..

Hadoop Happenings: Product Announcements; Data Logisitics

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place in this week’s Hadoop Happenings. This week several companies made product announcements, an article from Forbes discussed data logistics and minimizing technical debt, and an interview with the author of Big Data @ Work was released. 1. Databricks demolishes big data benchmark […]

 
Read More..

Mediamath Analysts use Presto for Interactive Queries

Performance-Tests-Prove-Value-of-Caching-in-Presto_small3
 

As the demand for leveraging big data to gain insightful analysis has grown, so too have the facilitating technologies. End-users have to adapt to keep pace with the volume of big data as it expands. Today, this requires utilizing the evolution of big data technology. Facebook’s low latency query engine, Presto, presented itself as a […]

 
Read More..

High Performance Hadoop with New Generation AWS Instances

 

Welcome New Generation Instance Types Amazon Web Services (AWS) offers a range of instance types for supporting compute-intensive workloads. The compute optimized instance family has a higher ratio of compute power to memory. The older generation C1 and CC2 instance types have been very useful in batch data processing  frameworks such as Hadoop. Late last […]

 
Read More..

Hadoop Happenings: Breaking the Silicon Valley Bubble

Hadoop-Happenings
 

Grab all of the latest news and commentary in one place with this week’s Hadoop Happenings. This week Teradata formed a new partnership with Cloudera and discussion centered on Hadoop’s breaking out of the Silicon Valley bubble. Arguments for big data in the cloud were also highlighted. 1. Hadoop: Breaking out of the Silicon Valley […]

 
Read More..

The Internet of Things and Big Data: A Marriage Made in the Cloud

The-Internet-of-Things-and-Big-Data_small
 

Thanks to smartphones, tablets, “wearables” and apps, people are connected to the Internet like never before. And if emerging technologies have their way, the very objects that surround us—our cars, our refrigerators, even our toothbrushes—will soon be connected to the web as well—24/7. Welcome to the era of the Internet of Things.   As the Internet […]

 
Read More..

Hadoop Happenings: Retail Use Cases; Apache Tez

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop in one place in this week’s Hadoop Happenings. Uses cases for Hadoop in the retail and entertainment industries were a popular topic of discussion this week. Apache Tez received accolades from the Bossie awards, and real-time analytics continued to be a hot topic. 1. Does […]

 
Read More..

Qubole Turns 3 Years Old: A Letter from the CEO

Qubole-Turns-3-Years-Old_small
 

Qubole turned 3 years old on 9/9/2014. Wow, it has been that long!! Time flies when you are having fun and creating impactful things. We knew when we started that cloud was a big disruptive force, and we set out to build a company around using its disruptive nature to build a game changing big […]

 
Read More..

Komli Media Discusses Path to Big Data Provider

komli_big_data_small
 

In a recent interview with Tech Target, the engineering manager of Komli Media shared his decision-making process to select a big data service. Komli Media is an ad placement agency that uses data based on a potential customer’s past purchases and search history to select which advertisements will be most effective. The company relied on […]

 
Read More..

Hadoop Happenings: Apache Storm Graduates

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop all in one place in this week’s Hadoop Happenings. Several product announcements were made this week including Cask, formerly Continuuity, going open source and Apache Storm graduating to top-level status. Cloudera and Pivotal formed new partnerships. 1. Real World Examples: Real-time Data From Internet of […]

 
Read More..

Hadoop Happenings: Product Announcements; State of Hadoop

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop all in one place in this week’s Hadoop Happenings. Several distributions made product announcements this week, including the first Hadoop offering for China. SiliconAngle reviewed the state of Hadoop, including who is using it and why. 1. The New Normal: Business Understanding Recode.net- Bob Muglia, […]

 
Read More..

Qubole Selected for Big Data Innovation Award

Ventana-Award
 

Ventana Research has announced that Qubole will receive the big data innovation award at the Technology Innovation Awards Ceremony October 21, 2014. The Technology Innovation Awards are among the most prestigious awards offered annually and are meant to recognize technology innovations that improve productivity and outcomes for business and IT. Qubole was recognized alongside other […]

 
Read More..

Hadoop Happenings: Focus on the Internet of Things

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week, commentators explored the intersection of big data, Hadoop and the Internet of Things. Cisco and MapR released product announcements and several big data case studies were released. 1. Hadoop Basics Course SapHanaTutorial.com- This Hadoop Basics […]

 
Read More..

Big Data Dilemma: Quantity Vs. Quality

Big-Data-Dilemma-Quantity-Vs-Quality-small
 

Advances in big data technologies and capabilities have increased adoption in the enterprise dramatically. But as organizations implement big data platforms to collect, manage, and analyze large data sets for competitive advantage, they are faced with a dilemma. Should they direct most of their resources on collecting massive volumes of data? Or should they focus […]

 
Read More..

Securely sharing data across Organizations with Qubole

 

Customers love that Qubole enables collaboration via a shared workbench across multiple analysts in an organization. Increasingly though, we have started finding use cases where organizations want to share data across Qubole accounts. Departments in different geographies want to share selected data sets with each other. Also, organizations want to share data with their partner […]

 
Read More..

Hadoop Happenings: Stinger Initiative; Hadoop Challenges

Hadoop-Happenings
 

Grab all of the latest news and commentary on Hadoop in one place in this week’s Hadoop Happenings. The Stinger Initiative was officially completed in April, but the Hadoop community is still learning all that it accomplished. Additional commentary was given on the challenges of using Hadoop. 1. Seven signs your hair is on fire: The […]

 
Read More..

Top 5 Big Data Myths Debunked

Top-5-Big-Data-Myths-Debunked_small_v2
 

The era of big data has arrived. Today, companies both large and small are discovering the benefits of analyzing vast pools of unstructured data for new insights and competitive advantage. That being said, there are a number of lingering misconceptions about big data that companies looking to successfully implement a big data analytics platform need […]

 
Read More..

Hadoop Happenings: Spark, Storm and Kafka

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. The Hadoop landscape is rapidly changing as new products from Spark to Storm gain traction. The Hadoop community also continues to educate businesses on the big data technology available. 1. The Future of Apache Storm: Secure, Highly […]

 
Read More..

Caching in Presto

 

Qubole’s Presto-as-a-Service is primarily targeted at Data Analysts who are tasked with translating ad-hoc business questions into SQL queries and getting results. Since the questions are often ad-hoc, there is some trial and error involved. Therefore, arriving at the final results may involve a series of SQL queries. By reducing the response time of these […]

 
Read More..

Qubole Releases Industry’s First Auto-Scaling Presto Clusters

Auto-Scaling-Presto-Clusters_small
 

Qubole was the first big data platform to offer a true auto-scaling Hadoop-as-a-Service solution. Now, Qubole is pleased to announce the industry’s first auto-scaling Presto-as-a-Service solution. Why Auto-Scaling Presto-as-a-Service Explorative analytics is one area that can get quite bursty. A single business question can easily require multiple short queries. For example, let’s say a data […]

 
Read More..

Hadoop Happenings: Strata + Hadoop Conference

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place in this week’s Hadoop Happenings. This week coverage of the announcements and keynotes at the 2014 Strata + Hadoop conference were prevalent as were commentary on the Hadoop data lake and other trends in big data analytics. 1. Swimming in a lake […]

 
Read More..

Hadoop Happenings: Getting Started with Data Analytics

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop all in one place. This week’s focus went back to the basics with popular content covering what Hadoop is, what it can do and how to get started. IBM also presented use case videos for big data in the enterprise. 1. What’s Hadoop, can I […]

 
Read More..

Hadoop and the Data Warehouse: A Winning Combination for Your Business

Hadoop-and-the-Data-Warehouse_small
 

Once the subject of speculation, big data analytics has emerged as a powerful tool businesses can use to manage, mine, and monetize vast stores of unstructured data for competitive advantage. As a result, the rate of adoption of Hadoop big data analytics platforms by companies has increased dramatically. In this rush to leverage big data, […]

 
Read More..

Hadoop Happenings: Curing Parkinson’s; Vendor Predictions

Hadoop-Happenings
 

Grab all of the latest Hadoop news and commentary all in one place with this week’s Hadoop Happenings. An article from VentureBeat looked at how Intel plans to use big data for Parkinson’s research. A new research report suggested the future of Hadoop may lie in the cloud, and an analyst predicted MapR would be […]

 
Read More..

Hadoop vs. Traditional Database: Which Better Serves Your Big Data Business Needs?

Hadoop-vs.-Traditional-Database_small_v2
 

Today’s ultra-connected world is generating massive volumes of data at ever-accelerating rates. As a result, big data analytics has become a powerful tool for businesses looking to leverage mountains of valuable data for profit and competitive advantage. In the midst of this big data rush, Hadoop, as an on-premise or cloud-based platform has been heavily […]

 
Read More..

Hadoop Happenings: Small Data and Data Governance

Hadoop-Happenings
 

Grab the latest news and commentary on Hadoop all in one place. This week, Google announced a new data-warehousing system called Mesa. Cloudera made the argument that Hadoop is moving toward small data, and the importance of data governance was a hot topic. 1. Hadoop for Small Data Cloudera.com- This blog post discusses the growing […]

 
Read More..

Case Study: Big Data’s Impact on B2B Marketing

Big-Data-B2B-Marketing
 

Insightera offers a learning B2B targeting and personalization platform that relies on a combination of big data, machine learning and predictive analytics. The platform displays content to targeted prospects in real-time in order to continuously boost ROI. The company first turned to Hadoop to manage its big data but found Hadoop to be much too […]

 
Read More..

Hadoop Happenings: More Partnerships; Hadoop Matures

Hadoop-Happenings
 

Check out the latest news and insights about Hadoop. This past week Hortonworks announced a partnerships with Pivotal. Gigaom released a research report on Hadoop 2.0, and a Forbes column discussed the impact of Hadoop on Supply Chain Management. 1. Hortonworks gains ground as its Hadoop management tool gets adoption at Pivotal Venturebeat.com- Hortonworks and […]

 
Read More..
 
 
 

Get Blog Updates

Search Blog

 
 
 
 

Featured Blogs