Blog

 
 

Hadoop Happenings: New Round of Funding

hadoop-happenings
 

1. The 6 Things Everyone Needs to Know about the Big Data Economy SmartDataCollective.com- In this post Bernard Marr argues that big data is moving mainstream and discusses several elements of the big data economy. Read More 2. Altiscale Lands $30M To Continue Building Hadoop Cloud Service Techcrunch.com- Altiscale announced Series B funding led by […]

 
Read More..

Hadoop Happenings: Looking to 2015

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week there are many predictions for what Hadoop’s future holds. 1. The End of the Hadoop Bubble? Forbes.com- Hortonworks’ rush into a reduced IPO may be to grasp the interest of a potential buyer. There are […]

 
Read More..

Upcoming Webinar: Forrester Analyst Discusses Big Data in the Cloud

Big-Data-in-the-Cloud_small
 

Join us for a live webinar Dec. 10, 2014 at 10am PST/1 pm EST hosted by Noel Yuhanna, principal analyst of enterprise architecture at Forrester Research, and Ashish Thusoo, co-founder and CEO of Qubole. The webinar will discuss how the cloud has helped companies keep up with the fast-changing technology landscape and discuss why big […]

 
Read More..

Hadoop Happenings: Apache Pig 0.14.0

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week a new version of Apache Pig was released. Forrester has a new report reviewing the Hadoop ecosystem, and LinkedIn provided details about its Gobblin big data framework. 1. Storage Hangout: Hadoop Plug-in Refresh Release and […]

 
Read More..

Hadoop Happenings: New Releases; Partnerships

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week Cloudera and MapR formed new partnerships. Splice Machine’s SQL on Hadoop database went on general release, and eHarmony discussed it’s future plans with Hadoop. 1. Why eHarmony is rebuilding itself atop Hadoop and (probably) OpenStack […]

 
Read More..

Qubole Partners with Microsoft Azure

announcing_partnership
 

By Marcy Campbell, Qubole SVP of Sales & Business Development Today I am excited to announce a new strategic relationship with Microsoft Azure, an important step towards fulfilling our mission to make big data solutions more accessible to more people on more platforms. Microsoft partners and Azure customers can now bring their Azure subscription to […]

 
Read More..

Hadoop Happenings: Big Data’s Turning Point

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week HP released HP Vertica for SQL on Hadoop not far from the announcement that Hortonworks is filing for an IPO. Articles also discussed applications of Hadoop in video and the airline industry. 1. Data Science […]

 
Read More..

Cluster Computing Comparisons: MapReduce vs. Apache Spark

spark-vs-mapreduce
 

Since its early beginnings some 10 years ago, the MapReduce Hadoop implementation has become the go to enterprise grade solution for storing, managing and processing massively large data volumes. Today, as organizations face the growing need for real-time data analysis to achieve competitive advantage, a new open source Hadoop data processing engine, Apache Spark, has […]

 
Read More..

Hadoop Happenings: Hortonworks IPO; Experts Weigh In

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week Hortonworks filed paperwork to take the first step toward an Initial Public Offering. A post from Micron evaluated the value of SSDs for Hadoop, and CenturyLink sought to simplify the process for configuring a Cloudera […]

 
Read More..

White Paper: Big Data Belongs in the Cloud

big-data-cloud
 

Big data is growing faster than ever before, and businesses are looking to take full advantage of it. In fact, the big data market is growing about six times faster than the IT market in general, making it an essential ingredient to success for many companies and industries in the world. It’s through big data […]

 
Read More..

Hadoop Happenings: Hadoop Lingo; Focus on Growth

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week discussion focused on Hadoop’s place in the enterprise, data governance and the growing need for data migration tools. 1. Four tips for putting business users in touch with Hadoop SaS.com- There are four main drivers […]

 
Read More..

The Key to Success with Big Data Projects (Updated)

The-Key-to-Success_small
 

This article was originally published June 5, 2014 and has since been updated. Big data has been generating a lot of buzz recently because of its numerous capabilities and the relative ease with which the road to Hadoop can now be accessed and used. It’s an impressive combination that is allowing businesses to innovate and […]

 
Read More..

Hadoop Happenings: Product Announcements; Data Logisitics

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place in this week’s Hadoop Happenings. This week several companies made product announcements, an article from Forbes discussed data logistics and minimizing technical debt, and an interview with the author of Big Data @ Work was released. 1. Databricks demolishes big data benchmark […]

 
Read More..

Mediamath Analysts use Presto for Interactive Queries

Performance-Tests-Prove-Value-of-Caching-in-Presto_small3
 

As the demand for leveraging big data to gain insightful analysis has grown, so too have the facilitating technologies. End-users have to adapt to keep pace with the volume of big data as it expands. Today, this requires utilizing the evolution of big data technology. Facebook’s low latency query engine, Presto, presented itself as a […]

 
Read More..

High Performance Hadoop with New Generation AWS Instances

 

Welcome New Generation Instance Types Amazon Web Services (AWS) offers a range of instance types for supporting compute-intensive workloads. The compute optimized instance family has a higher ratio of compute power to memory. The older generation C1 and CC2 instance types have been very useful in batch data processing  frameworks such as Hadoop. Late last […]

 
Read More..

Hadoop Happenings: Breaking the Silicon Valley Bubble

Hadoop-Happenings
 

Grab all of the latest news and commentary in one place with this week’s Hadoop Happenings. This week Teradata formed a new partnership with Cloudera and discussion centered on Hadoop’s breaking out of the Silicon Valley bubble. Arguments for big data in the cloud were also highlighted. 1. Hadoop: Breaking out of the Silicon Valley […]

 
Read More..

The Internet of Things and Big Data: A Marriage Made in the Cloud

The-Internet-of-Things-and-Big-Data_small
 

Thanks to smartphones, tablets, “wearables” and apps, people are connected to the Internet like never before. And if emerging technologies have their way, the very objects that surround us—our cars, our refrigerators, even our toothbrushes—will soon be connected to the web as well—24/7. Welcome to the era of the Internet of Things.   As the Internet […]

 
Read More..

Hadoop Happenings: Retail Use Cases; Apache Tez

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop in one place in this week’s Hadoop Happenings. Uses cases for Hadoop in the retail and entertainment industries were a popular topic of discussion this week. Apache Tez received accolades from the Bossie awards, and real-time analytics continued to be a hot topic. 1. Does […]

 
Read More..

Qubole Turns 3 Years Old: A Letter from the CEO

Qubole-Turns-3-Years-Old_small
 

Qubole turned 3 years old on 9/9/2014. Wow, it has been that long!! Time flies when you are having fun and creating impactful things. We knew when we started that cloud was a big disruptive force, and we set out to build a company around using its disruptive nature to build a game changing big […]

 
Read More..

Komli Media Discusses Path to Big Data Provider

komli_big_data_small
 

In a recent interview with Tech Target, the engineering manager of Komli Media shared his decision-making process to select a big data service. Komli Media is an ad placement agency that uses data based on a potential customer’s past purchases and search history to select which advertisements will be most effective. The company relied on […]

 
Read More..

Hadoop Happenings: Apache Storm Graduates

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop all in one place in this week’s Hadoop Happenings. Several product announcements were made this week including Cask, formerly Continuuity, going open source and Apache Storm graduating to top-level status. Cloudera and Pivotal formed new partnerships. 1. Real World Examples: Real-time Data From Internet of […]

 
Read More..

Hadoop Happenings: Product Announcements; State of Hadoop

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop all in one place in this week’s Hadoop Happenings. Several distributions made product announcements this week, including the first Hadoop offering for China. SiliconAngle reviewed the state of Hadoop, including who is using it and why. 1. The New Normal: Business Understanding Recode.net- Bob Muglia, […]

 
Read More..

Qubole Selected for Big Data Innovation Award

Ventana-Award
 

Ventana Research has announced that Qubole will receive the big data innovation award at the Technology Innovation Awards Ceremony October 21, 2014. The Technology Innovation Awards are among the most prestigious awards offered annually and are meant to recognize technology innovations that improve productivity and outcomes for business and IT. Qubole was recognized alongside other […]

 
Read More..

Hadoop Happenings: Focus on the Internet of Things

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. This week, commentators explored the intersection of big data, Hadoop and the Internet of Things. Cisco and MapR released product announcements and several big data case studies were released. 1. Hadoop Basics Course SapHanaTutorial.com- This Hadoop Basics […]

 
Read More..

Big Data Dilemma: Quantity Vs. Quality

Big-Data-Dilemma-Quantity-Vs-Quality-small
 

Advances in big data technologies and capabilities have increased adoption in the enterprise dramatically. But as organizations implement big data platforms to collect, manage, and analyze large data sets for competitive advantage, they are faced with a dilemma. Should they direct most of their resources on collecting massive volumes of data? Or should they focus […]

 
Read More..

Securely sharing data across Organizations with Qubole

 

Customers love that Qubole enables collaboration via a shared workbench across multiple analysts in an organization. Increasingly though, we have started finding use cases where organizations want to share data across Qubole accounts. Departments in different geographies want to share selected data sets with each other. Also, organizations want to share data with their partner […]

 
Read More..

Hadoop Happenings: Stinger Initiative; Hadoop Challenges

Hadoop-Happenings
 

Grab all of the latest news and commentary on Hadoop in one place in this week’s Hadoop Happenings. The Stinger Initiative was officially completed in April, but the Hadoop community is still learning all that it accomplished. Additional commentary was given on the challenges of using Hadoop. 1. Seven signs your hair is on fire: The […]

 
Read More..

Top 5 Big Data Myths Debunked

Top-5-Big-Data-Myths-Debunked_small_v2
 

The era of big data has arrived. Today, companies both large and small are discovering the benefits of analyzing vast pools of unstructured data for new insights and competitive advantage. That being said, there are a number of lingering misconceptions about big data that companies looking to successfully implement a big data analytics platform need […]

 
Read More..

Hadoop Happenings: Spark, Storm and Kafka

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop in one place with this week’s Hadoop Happenings. The Hadoop landscape is rapidly changing as new products from Spark to Storm gain traction. The Hadoop community also continues to educate businesses on the big data technology available. 1. The Future of Apache Storm: Secure, Highly […]

 
Read More..

Caching in Presto

 

Qubole’s Presto-as-a-Service is primarily targeted at Data Analysts who are tasked with translating ad-hoc business questions into SQL queries and getting results. Since the questions are often ad-hoc, there is some trial and error involved. Therefore, arriving at the final results may involve a series of SQL queries. By reducing the response time of these […]

 
Read More..

Qubole Releases Industry’s First Auto-Scaling Presto Clusters

Auto-Scaling-Presto-Clusters_small
 

Qubole was the first big data platform to offer a true auto-scaling Hadoop-as-a-Service solution. Now, Qubole is pleased to announce the industry’s first auto-scaling Presto-as-a-Service solution. Why Auto-Scaling Presto-as-a-Service Explorative analytics is one area that can get quite bursty. A single business question can easily require multiple short queries. For example, let’s say a data […]

 
Read More..

Hadoop Happenings: Strata + Hadoop Conference

hadoop-happenings
 

Grab all of the latest news and commentary about Hadoop in one place in this week’s Hadoop Happenings. This week coverage of the announcements and keynotes at the 2014 Strata + Hadoop conference were prevalent as were commentary on the Hadoop data lake and other trends in big data analytics. 1. Swimming in a lake […]

 
Read More..

Hadoop Happenings: Getting Started with Data Analytics

Hadoop-Happenings
 

Grab all of the latest news and commentary about Hadoop all in one place. This week’s focus went back to the basics with popular content covering what Hadoop is, what it can do and how to get started. IBM also presented use case videos for big data in the enterprise. 1. What’s Hadoop, can I […]

 
Read More..

Hadoop and the Data Warehouse: A Winning Combination for Your Business

Hadoop-and-the-Data-Warehouse_small
 

Once the subject of speculation, big data analytics has emerged as a powerful tool businesses can use to manage, mine, and monetize vast stores of unstructured data for competitive advantage. As a result, the rate of adoption of Hadoop big data analytics platforms by companies has increased dramatically. In this rush to leverage big data, […]

 
Read More..

Hadoop Happenings: Curing Parkinson’s; Vendor Predictions

Hadoop-Happenings
 

Grab all of the latest Hadoop news and commentary all in one place with this week’s Hadoop Happenings. An article from VentureBeat looked at how Intel plans to use big data for Parkinson’s research. A new research report suggested the future of Hadoop may lie in the cloud, and an analyst predicted MapR would be […]

 
Read More..

Hadoop vs. Traditional Database: Which Better Serves Your Big Data Business Needs?

Hadoop-vs.-Traditional-Database_small_v2
 

Today’s ultra-connected world is generating massive volumes of data at ever-accelerating rates. As a result, big data analytics has become a powerful tool for businesses looking to leverage mountains of valuable data for profit and competitive advantage. In the midst of this big data rush, Hadoop, as an on-premise or cloud-based platform has been heavily […]

 
Read More..

Hadoop Happenings: Small Data and Data Governance

Hadoop-Happenings
 

Grab the latest news and commentary on Hadoop all in one place. This week, Google announced a new data-warehousing system called Mesa. Cloudera made the argument that Hadoop is moving toward small data, and the importance of data governance was a hot topic. 1. Hadoop for Small Data Cloudera.com- This blog post discusses the growing […]

 
Read More..

Case Study: Big Data’s Impact on B2B Marketing

Big-Data-B2B-Marketing
 

Insightera offers a learning B2B targeting and personalization platform that relies on a combination of big data, machine learning and predictive analytics. The platform displays content to targeted prospects in real-time in order to continuously boost ROI. The company first turned to Hadoop to manage its big data but found Hadoop to be much too […]

 
Read More..

Hadoop Happenings: More Partnerships; Hadoop Matures

Hadoop-Happenings
 

Check out the latest news and insights about Hadoop. This past week Hortonworks announced a partnerships with Pivotal. Gigaom released a research report on Hadoop 2.0, and a Forbes column discussed the impact of Hadoop on Supply Chain Management. 1. Hortonworks gains ground as its Hadoop management tool gets adoption at Pivotal Venturebeat.com- Hortonworks and […]

 
Read More..

Case Study: Pinterest’s Journey to Qubole

Pinterest-Journey-Qubole
 

With 20 terabytes of new data logged each day, managing big data is not an option at Pinterest. In order to provide an optimal user experience with the most relevant and recent content, Pinterest turned to Hadoop to help process the data. Unfortunately, Hadoop in its raw form doesn’t act as a self-serve platform because […]

 
Read More..

Hadoop Happenings: Tez Graduates; HP Joins the Club

Hadoop-Happenings
 

Check out the top news and analysis about Hadoop from the past week. This week, HP jumped into the Hadoop investment pool by partnering with Hortonworks, and Apache Tez graduated to top-level status at the Apache Software Foundation. 1. Why Marketers Love Big Data & Hadoop Socialmediatoday.com- Big data can play a big role in […]

 
Read More..

Big Data Tips From the Experts: A Data Science Mindset

Big-Data-Tips
 

Moving beyond storing large volumes of data to collecting business insights from that data requires more than having the right technology in place. A good business structure, a good team and insight into best practices are all critical to big data success. Check out the following tips from experienced big data users to learn some […]

 
Read More..

Hadoop Happenings: Back to the Basics and SQL-on-Hadoop Frenzy

Hadoop-Happenings
 

Learn about the latest happenings with Hadoop in this week’s Hadoop roundup. Last week it was back to the basics with emphasis on what Hadoop is and what its business case is. This week there was a big media frenzy surrounding Oracle’s release of SQL-on-Hadoop software. Gartner released a new research report, and Gigaom is hosting […]

 
Read More..

Big Data Tips From the Experts: Evaluate and Adapt

Big-Data-Tips
 

Having the right technology is crucial to the success of a big data project, but having the right business process in place is equally critical. Check out these tips from seasoned big data experts to ensure that your next project is a success. 1. Measure Everything “Measure and record everything, and keep an eye on […]

 
Read More..

The Shift from CMO to CMT

 

A recent article from the Harvard Business review highlighted the growing role of technology in marketing and how the role of the Chief Marketing Officer is evolving thanks to technology. The CMO is taking on the role of aligning technology with business goals, selecting technology providers and acting as a liaison to IT. This expanding […]

 
Read More..

Hadoop Happenings: The Past and Future of Hadoop

Hadoop-Happenings
 

Check out this week’s most popular content about Hadoop. There was a lot of discussion this week about the history of Hadoop and how it will continue to be used as the technology transforms. CBInsights also offered a comparison of Hadoop vs. NoSQL vendor funding. 1. Cloudera: Impala’s it for interactive SQL on Hadoop; everything […]

 
Read More..

Big Data Tips From the Experts: Setup is Key

New-Series-Big-Data-Tips-From-the-Experts_2_small
 

Are you struggling with a current big data project or looking for advice on how to make your first project successful? Check out the following tips from data scientists, engineers and experienced business users. This is the second blog post in a series of big data tips. To see more tips from big data experts, […]

 
Read More..

Qubole Partner XCentium Offers Tutorial on ODBC Driver

XCentium-Tutorial-Qubole-ODBC-from-C_small
 

This is a tutorial from XCentium, a partner with Qubole which works to help increase productivity for businesses and their employees. XCentium does this by providing special technology services to businesses looking to connect with their clients. In this tutorial, you’ll find out how to set up the Qubole ODBC Driver through instructions on getting […]

 
Read More..

Hadoop Happenings: Google Abandons MapReduce, Splice Machine Goes Beta

Hadoop-Happenings
 

Check out this week’s top content and stories about Hadoop. Google’s announcement that it was abandoning MapReduce for a new analytics system created waves in the Hadoop community with some predicting Hadoop’s demise and others praising continued progress. Splice Machine released a beta version of its real-time relational database on Hadoop, and Dell partnered with […]

 
Read More..

10 Best Practices for Apache Hive

Hive-Best-Practices-Blog-Post_small
 

Apache Hive is an SQL-like software used with Hadoop to give users the capability of performing SQL-like queries on it’s own language, HiveQL, quickly and efficiently . It also gives users additional query and analytical abilities not available on traditional SQL structures. With Apache Hive, users can use HiveQL or traditional Mapreduce systems, depending on […]

 
Read More..

How Public Data Sets Can Benefit Businesses

How-Public-Data-Sets_small
 

The government started to release large, public data sets through Data.gov in 2009, and since then forward-thinking companies have utilized such data to supplement their own internal business intelligence reports. Two industries in particular – healthcare and energy – now regularly use public data to enhance their business intelligence efforts, and gain competitive marketplace advantages. […]

 
Read More..

Hadoop Happenings: Company Case Studies, Reliability Issues

Hadoop-Happenings
 

Check out this week’s top content on Hadoop. Several companies released testimonials on how they are benefiting from Hadoop. Forbes addressed some of Hadoop’s reliability issues, and Qubole released a Q&A from its founders. 1. 5+ Big Data Companies to Watch Gartner.com- This post offers several big data companies to watch along with the authors’ […]

 
Read More..

New Series: Big Data Tips from the Experts

New-Series-Big-Data-Tips-From-the-Experts_1_small
 

Getting the most out of a big data project is an art as well as a science. That is why this month Qubole will be running a series of blog posts featuring big data tips from data scientists, industry experts, experienced big data users, and, of course, Qubole’s own experts. The first round of tips […]

 
Read More..

Hadoop Happenings: Hadoop Summit Sound off

Hadoop-Happenings
 

Check out the most popular content on Hadoop from the last week. The Hadoop summit from last week (June 2-3) was a big source of discussion with observations from providers and journalists alike. There was also a large focus on the future of Hadoop as it has gone mainstream. 1. The Biggest Hadoop and Big Data […]

 
Read More..

Qubole CEO Highlights Big Data in the Cloud Advantages

Big Data Components
 

Qubole’s co-founder and CEO, Ashish Thusoo recently addressed attendees at Data Driven NYC #26 in April. He focused his remarks on the benefits of using big data in the cloud as opposed to an on-premise infrastructure. Thusoo brings a unique perspective to the cloud vs. on-premise debate having worked on both sides of the issue […]

 
Read More..

Hadoop Happenings: Real-time Analytics, Acquisitions and Hadoop Summit

Hadoop-Happenings
 

Check out this week’s most popular content on Hadoop. Cloudera made its first major acquisition. Several benchmarks and how-to guides were also released. 1. Cloudera acquires big data encryption specialist Gazzang Gigaom.com- Following competitor Hortonworks’ lead, Cloudera acquired Gazzanga, a startup specializing in encryption in Hadoop environments. Read More 2. AT&T Labs, Continuuity will open source […]

 
Read More..

Qubole Founders Open Up About the Transformation of Hadoop

featured_banner_small_v2
 

Seven years ago, Joydeep Sen Sarma and Ashish Thusoo were first introduced to big data technology. Now, in 2014 they are the guiding force behind one of big data’s fastest growing and innovative companies — Qubole. By leveraging their expertise in big data technology along with the ever-expanding capabilities of the cloud, Sen Sarma and […]

 
Read More..

June 2014 Product Update

 

At Qubole, we’re continually improving our platform and bringing the features and functionality that matter most to our users a reality. This month, we’re proud to announce the launch of a several vital new features to our platform. Multi-Cluster – We are pleased to announce a new feature in Qubole Data Services (QDS) – support for […]

 
Read More..

Hadoop Happenings: Focus on the Cloud, Summer Conferences

Hadoop-Happenings
 

Check out this week’s and last week’s most popular content on Hadoop. The past two weeks there’s been a lot of attention on the benefits Hadoop in the cloud. Invitations to summer conferences have also begun. Week Ending May 23 1. Yahoo Betting on Apache Hive, Tez and Yarn Yahoodevelopers.com – This comprehensive post dives […]

 
Read More..

Forbes: Qubole Data Service Road to Hadoop

Qubole-Featured-on-Forbes_small
 

On Monday, May 26, 2014, Qubole was featured on Forbes.com. Technology contributor Dan Woods wrote an article titled “Two Roads to Instant Big Data” that highlighted the important advances that Qubole is effecting in the world of big data. Woods highlighted that many of the enormous advances being made today in big data technology are […]

 
Read More..

The Evolution of Big Data

The-Evolution-of-Big-Data_small
 

Big data is still an enigma to many people. It’s a relatively new term that was only coined during the latter part of the last decade. While it may still be ambiguous to many people, since it’s inception it’s become increasingly clear what big data is and why it’s important to so many different companies. […]

 
Read More..

Hadoop Happenings: XA Secure Acquisition, Tutorials and Use Cases

Hadoop-Happenings
 

Hadoop Happenings compiles 10 of the most shared content on Hadoop from the past week. This week there was a lot of noise about Hortonworks’ acquisition of XA Secure. Resource lists and tutorials were also popular among professionals getting to know the technology. 1. Hortonworks Purchase Points to Need to Lock Down Hadoop WSJ.com – […]

 
Read More..

Hadoop Happenings: Elephants, Expert Insights and Emerging Technologies

Hadoop-Happenings
 

Check out a list of the top content on Hadoop from since the beginning of May. Spark was a major focus as was the announcement of a partnership between Cloudera and MongoDB. Interviews with insights from Hadoop experts also abounded. This week’s content went back to the basics exploring big data technology, big data use […]

 
Read More..

Hadoop in the Cloud: Qubole shows 2x – 8x speedup in performance over Apache Hadoop

Hadoop-in-the-Cloud-Qubole-shows_small
 

Qubole aims to provide the best platform for big data analysis in the cloud. In previous posts, we have already discussed on our hadoop/hive optimizations for the cloud, performance analysis of Qubole versus Amazon EMR and our Presto offering. In this post, we will discuss how Qubole compares to Apache Hadoop/Hive in the cloud. Setup […]

 
Read More..

Big Data and the Customer: What C-Level Execs Want to Know

Big-Data-and-the-Customer_small
 

Not long ago, getting a feel for the customer was more a matter of art than science. Today, with the proliferation of technologies that inform, connect and empower people, companies have turned to Big Data analytics to get to know their customers better. Through data-driven marketing, vast pools of rich multi-structured data can yield new […]

 
Read More..

Hadoop Happenings: How-to Guides and Provider Overviews

Hadoop-Happenings
 

This week, education was the focus from a look at Hadoop hardware leaders to in-depth data analytics guides. 1. Why Hortonworks’ Hadoop Pitch May Be Perfect CMSWire.com – This post evaluated Hortonworks’ choice to stick with the open source version of Hadoop without adding on proprietary software. Read More 2. Automated Install of HDP 2.1 […]

 
Read More..

Announcing General Availability of Presto-as-a-Service

Announcing-General-Availability-of-Presto_small
 

Presto Ready! We announced our Presto-as-a-Service Alpha Program on Amazon Web Services back in January. Now, we’re offering general availability after our alpha tests gave us the green light. In case you’re not already familiar with Presto, a technology created by Facebook, it is an open source distributed SQL query engine for running fast interactive […]

 
Read More..

Job Scheduling in Hadoop – A 7 Year Perspective

job-scheduling-in-hadoop_small
 

In a recent presentation at Flipkart’s 2014 SlashN conference, I summarized seven years of progress in Hadoop and Big Data. In its beginning stages, Hadoop exhibited several weaknesses in its job scheduling. As a result, users who shared a Hadoop cluster would experience a slow cluster due to a bad job, or one user might take […]

 
Read More..

Hadoop Happenings: New Tools, Current State of Hadoop, Top Startups

Hadoop-Happenings
 

The weekly Hadoop Happenings ending April 18, highlighted new tools announced by Google and Microsoft, the current state of Hadoop was a favorite topic, and a new roundup of top Hadoop startups was released by CIO. 1. Microsoft Announces New Tools Bringing the Cloud to The Internet of Things TechCrunch.com – Microsoft’s Azure Intelligent Systems Services is […]

 
Read More..

Weekly Roundup: Hadoop Happenings Ending 4/11

Hadoop-Happenings
 

This week Teradata’s announcement of new features sparked a round of articles debating the impact Hadoop will have on legacy databases. There was also commentary on data quality and the release of Hadoop 2.4.0. 1. The Big Data Challenge to Legacy Data Management Companies This New York Times post discusses the impact Hadoop will have […]

 
Read More..

Qubole Shows Growing Demand for Hadoop in the Cloud

  • By Gil Allouche
  • April 11, 2014

How-Google-Compute-Engine-and-Qubole-are-a-match-made-in-heaven_small
 

Way back in August 2013, Gigaom wrote an article called “Is Qubole proving a demand for Hadoop in the cloud?” based on Qubole’s July 2013 numbers showing that we had used more than 100,000 nodes to process more than a petabyte of data, Gigaom concluded that that this “seems like a fair amount of activity […]

 
Read More..

Fastest Auto-Scaling Hadoop Service Now on Google Cloud

  • By Gil Allouche
  • April 7, 2014

How-Google-Compute-Engine-and-Qubole-are-a-match-made-in-heaven_small
 

Hadoop as a Service is reaching the point of critical mass adoption. Thanks to the recent price cuts by Amazon Web Services and Google Compute Engine, customers are starting to see real alternatives in terms of storing and analyzing Big Data. Qubole is proud to be part of this progress. Starting today, Big Data companies […]

 
Read More..

Weekly Roundup: Hadoop Happenings Ending 4/4

Hadoop-Happenings
 

This week the battle between providers continued as Cloudera got new funding and Amazon Web Services dropped its pricing. Apache Tajo reached top level status, and case studies on the applications of Hadoop continued to roll in. 1. Cloudera takes $740M bag of money from Intel-much more than we expected Intel, which recently abandoned its […]

 
Read More..

Real-Time Data Query: The Next Competitive Advantage

Real-Time-Data-Query_small_v2
 

Back when Big Data was just getting off the ground, early adopters of open source Hadoop achieved competitive advantage through analysis of vast stores of multi-structured data to gain actionable insights. Over the last few years, with the extensive adoption of Big Data analytics platforms by business, the playing field has begun to level off. […]

 
Read More..

Weekly Roundup: Hadoop Happenings ending 3/28

Hadoop-Happenings
 

1. Intel Jettisons its Hadoop Distro and puts Millions behind Cloudera It was big news when Intel announced it was abandoning its Hadoop distribution and opting to support Cloudera’s distribution instead. Read More 2. Introduction to Apache Falcon: Data Governance for Hadoop Apache Falcon can be used to define and monitor data pipelines and trace […]

 
Read More..

The Challenges and Opportunities for E-commerce in a Big Data World

Challenges-and-Opportunities-for-E-commerce-in-a-Big-Data-World_small
 

The highly competitive world of e-commerce is driven by price and advertising. Companies must find ways to successfully reach customers, who generally are not loyal, through successful ad campaigns and effective pricing techniques. A vital tool for success in e-commerce is Big Data. To succeed in e-commerce, companies rely on information that tells them how […]

 
Read More..

Weekly Roundup: Hadoop Happenings ending 3/21

Hadoop-Happenings
 

This week resources for Hadoop beginners were popular from free ebooks, to overviews of Hadoop companies, to a guide to a successful Hadoop job interview. Allied Market Research projects the Hadoop market to reach $50.2 billion by 2020. 1. 15 Free eBooks on Hadoop! For those looking for some good resource material for learning about […]

 
Read More..

The C-Suite of the Future: How Big Data is Changing Things Up

The-C-Suite-of-the-Future-How-Big-Data-is-Changing-Things-Up_small
 

The recent departure of Target’s Chief Information Officer (CIO) following Target Corp.’s massive data breach serves as a wake-up call to all upper-level execs that big data is starting to shake things up in the C-Suite. With the ability to capture, store and analyze huge amounts of customer data for competitive advantage comes the responsibility […]

 
Read More..

Weekly Roundup: Hadoop Happenings ending 3/14

Hadoop-Happenings
 

1. 5 Things that Will Remake Big Data in the Next 5 Years Derrick Harris identifies 5 trends in big data and discusses how big data and Hadoop will evolve over the next five years. Read More 2. Big Data is Not Hadoop-Part 1 This first post in a series on big data discusses the […]

 
Read More..

The Impact of Big Data on the Digital Advertising Industry

The-Impact-of-Big-Data-on-the-Digital-Advertising-Industry_small
 

The digital advertising industry is evolving like never before. The ability to capture and analyze massive amounts of structured and unstructured data is helping digital advertisers to discover new relationships, spot emerging trends and patterns, and gain actionable insights that lead to competitive advantage. As a result, traditional advertising is shifting rapidly into the realm […]

 
Read More..

Weekly Roundup: Hadoop Happenings ending March 7

Hadoop-Happenings
 

1. 9 Myths about Hadoop Edureka’s blog post addressed some of the myths floating around about Hadoop and clarifies what Hadoop is and what it can be used for. Read More 2. The Defense Department’s Data Strategy: Huge, massive and distributed This article covers highlights from an interview with Ely Kahn, who has worked in […]

 
Read More..

Announcing Qubole’s Hadoop Happenings Weekly Roundup

  • By Moran Altarac
  • March 3, 2014

Qubole Hadoop happenings banner
 

With so much great content and news about Hadoop coming out every week, we’ve decided to add a new feature to our blog, sharing the most talked about content around the web from the past week. To kick things off, here is the most popular content from the past six months. 1. Facebook open sources […]

 
Read More..

New Feature: Top Hadoop Influencers to Follow on Twitter

Hadoop-Twitter-List_small
 

Starting this week, Qubole will be highlighting some of the top Hadoop influencers on Twitter. The list, which will be updated weekly, is meant to be a resource to those researching Hadoop and are looking for thought leaders to follow. The rankings on the list are based on data from Followerwonk. Followerwonk assigns social authority […]

 
Read More..

Accenture Technology Labs Hadoop Deployment Comparison Study

Accenture-blog-post_small
 

Background The Accenture Technology Labs Hadoop Deployment Comparison study recently stated something that we at Qubole have known for a long time, that an investment in Hadoop-as-a-Service has many advantages over implementing a bare-metal Hadoop cluster. The study used Accenture’s Data Platform Benchmark, to assess the Total Cost of Ownership for both solutions. This method of […]

 
Read More..

Komli Media Improves Utilization with Premium Big Data Platform Qubole

komli_big_data_small
 

Komli Media, Asia Pacific’s leading media technology company, depends on reaching targeted audiences efficiently and at scale in order to please their customers. Over time, the company has collected more than 100 TB of data with information such as consumer behavior, clicks, impressions and ads created. Users access this data in order to optimize campaigns […]

 
Read More..

Social Media Marketing Best Practices with Big Data

social media best practices with big data
 

Social media and big data. You’ve heard the terms, but do you know how they relate and why it’s important? In my last article, I discussed the potential and eventual convergence of big data and social media. These two phenomenons go hand-in-hand; social media is today’s vehicle for user opinions and conversations to be heard […]

 
Read More..

4 Big Reasons Big Data is Booming in the Gaming Industry

4-Big-Reasons-Big-Data-is-Booming-in-the-Gaming-Industry_small_v2
 

The video gaming industry has come a long way since those early days of coin-ops and cumbersome consoles. Bringing in $20 billion a year in the U.S. alone, the gaming industry is exploding with no signs of slowing down. In fact, with the proliferation of online gaming—coupled with advances in Big Data analytics— the gaming […]

 
Read More..

Marketing, Technology and Big Data: Bridging the Gap Between the CMO and the CIO

Marketing,-Technology-and-Big-Data_Small
 

The rise of Big Data and digital marketing is dramatically changing the roles, responsibilities and relationships of Chief Marketing Officers (CMOs) and Chief Information Officers (CIOs). In fact, marketing and technology are now so inextricably linked that Gartner predicts that by 2017, CMOs will actually outspend CIOs on technology purchases. As CMOs strive to meet […]

 
Read More..

Cloud vs. On-Premise Hadoop Providers: Top 5 Questions From A Business Perspective

on-premise-vs.-cloud---business-perspective_small
 

As topics of conversation go, the terms “Big Data” and “Hadoop functionality” seem more appropriate for IT and CIO’s than for CEO’s and CFO’s. And yet, choosing the right Hadoop provider for your business is every bit as much a business decision as it is a tech decision. After all, the ultimate goal of Big […]

 
Read More..

Save Time Executing Hive Queries Using Command Templates

executing_hive_queries_small
 

A common characteristic of many analytics queries is that they are mostly invariant in form and function. Over multiple invocations of the query or command, one would find that only the range of inputs varies in the form of a couple of inputs, while the major part of the query remains the same. Command templates […]

 
Read More..

Running Big Data Infrastructure : Five Areas That Need Your Attention

big data infrastructure
 

Many businesses are starting to understand the benefits of Big Data. And those who have fully grasped the potential of gaining valuable business insights from it are scrambling to leverage this potential before competitors do. Unfortunately, some of them are becoming overeager and have consequently overlooked the most important component when embarking on a massive […]

 
Read More..

The Status Dashboard

The-Status-Dashboard_small
 

The entire team here at Qubole is dedicated to offering our customers not only the most reliable data as a service solution, but also to provide continued support for long-term success. We are pleased to bring Qubole Data Service users a new feature: the Status Dashboard. The Status Dashboard includes a number of new features […]

 
Read More..

How Google Compute Engine and Qubole are a match made in heaven

  • By Sivaramakrishnan Narayanan
  • January 3, 2014

How-Google-Compute-Engine-and-Qubole-are-a-match-made-in-heaven_small
 

Big data as a service last week took another step towards becoming the primary method of data analysis. In recent times, there has been a great amount of interest in cloud-based services and a shift from on-site technologies to cloud based platforms. In keeping with this trend, the partnership between Qubole and Google Compute Engine […]

 
Read More..

Big Data 101: What Big Data Means For Social

Big-Data-101--What-Big-Data-Means-For-Social_small_banner
 

  Big Data 101: What Big Data Means For Social It seems that every couple of months, the tech world is hit by another revolutionary product that will change our lives and make everything shiny and wonderful. But along with the hype comes a steady stream of jargon so technical that it can eliminate any […]

 
Read More..

Big Data and Social Media – The Best of Both Worlds

Big-Data-and-Social-Media
 

Big Data and Social Media – The Best of Both Worlds There are two phrases that you just can’t escape in the tech world today – “big data” and “social media.” It seems that everywhere we turn someone is preaching on the importance of big data or social media in marketing, but did you know […]

 
Read More..

5 Tips for efficient Hive queries

  • By [email protected]
  • October 18, 2013

5-Tips-for-efficient-Hive-queries
 

Hive on Hadoop makes data processing so straightforward and scalable that we can easily forget to optimize our Hive queries. Well designed tables and queries can greatly improve your query speed and reduce processing cost. This article includes five tips, which are valuable for ad-hoc queries, to save time, as much as for regular ETL […]

 
Read More..

DataWeek: Hadoop Innovation Award for Qubole

  • By Gil Allouche
  • October 10, 2013

Data-Week
 

DataWeek 2013 was a huge success with industry leaders, including Qubole, attending the week-long event in San Francisco. The conference continues to grow significantly, catching up to the Hadoop Summit and other recognized data conferences. This is what we left with At the conference, Quoble accepted the 1st place award of excellence in Innovation in […]

 
Read More..

Deploy Demotrends using the Scheduler

Deploy-Demotrends-using-the-Scheduler
 

Introduction In previous blog posts, we explained how to create a data pipeline to process the raw data, generate a list of trending topics and export it to the web app. In this blog, we will explain to you how to deploy the data pipeline using the Qubole scheduler. The data pipeline shown in the […]

 
Read More..

Top 10 Industry Examples of HDFS

Top-10-Industry-Examples-of-HDFS
 

Top 10 Industry Examples of HDFS Not everyone comes to us with a clear strategy for harnessing the potential of Hadoop. There are even those who, for instance, are still unsure whether the benefits of using an HDFS cluster apply to their organization at all. Actually, practically any organization who wants to draw insightful or […]

 
Read More..

Big data v small data – ERB

Big-data-v-small-data
 

Big Data vs. Small Data… Why not both? Big data, small data… do we really have to choose sides? In a world where data is essential to business, and big data is all the rage, one must ask if all data has to be “big”? Can businesses benefit from “small” data? The answer is sure, […]

 
Read More..