Data Engineering
-
The Key to Building Data Pipelines for Machine Learning: Support for Multiple Engines
Which engines are most effective for each stage of the data engineering cycle
-
Streamlining Operations of Machine Learning Models
Guest authors: Jerry Xu, Co-founder and CEO Datatron; Lekhni Randive, Product Manager, Datatron Qubole author: Jorge Villamariona, Sr. Product Marketing Manager, Qubole In today’s world,… The post...
-
Apache Sqoop 1.4.7 – 9 reasons why you need it
The sixth release of Apache Sqoop i.e. 1.4.7 is out! This is one of the most significant updates to the Sqoop platform. We give you… The post Apache Sqoop 1.4.7 – 9 reasons why you need it...
-
Predicting, Detecting, and Eliminating Online Threats: Malwarebytes
The cybersecurity company yields greater data-processing at lower costs, and realizes more powerful insights with Qubole.
-
Neustar Boosts Automation, Efficiency, and Savings with Qubole
The flexibility, APIs, and financial governance offered by Qubole enables Neustar to automate its solutions.
-
Workload-Aware: Auto-Scaling A new paradigm for Big Data Workloads
Learn more about Workload-Aware-Auto-Scaling-- an alternative architectural approach to Auto-Scaling that is better suited for the Cloud and applications like Hadoop, Spark and Presto.
-
Mastering Data Governance on Cloud Data Lakes with Multiple Engines
Qubole data privacy and integrity experts cover how to maintain data integrity and privacy of data residing in data lakes using various open-source engines.
-
AgilOne: Machine Learning at Enterprise Scale
AgilOne runs a variety of workloads for querying data, running ML models, orchestrating ML workflows, and more on Qubole
-
Data Engineering Pitfalls and How to Avoid Them
Simple, practical solutions for common challenges faced by data engineering teams
-
How To Build Scalable Data Pipelines for Machine Learning
Common challenges faced by data engineers when building pipelines for ML and how to address them
-
What’s New with Airflow on Qubole? DAG Explorer and More
Use Apache Airflow to author workflows as directed acyclic graphs (DAGs) of tasks
-
ETL Processes with AWS Data Pipeline And Qubole
How to facilitate event-based processing of long running ETL processes with AWS Data Pipeline and Qubole
-
Airflow on Anaconda: A Match Made in Heaven, Perfected by Qubole
How Airflow on Anaconda makes running machine learning pipelines and data science tasks seamless
-
Big Data Activation Report
The data on big data -- what engines are used most, for what, and which are the rising stars.
-
G2 Crowd Grid Report for Big Data Processing and Distribution | Fall 2019
Which vendors rank highest in customer satisfaction for big data processing
-
Leveraging Streaming and Batch Data Sets for ML Applications
Learn how to use Qubole to acquire and transform data sets for data science and analytics, make data sets available to different users, and fully leverage your data lake.
-
Nauto Improves its Data Scientist Productivity, Accelerates Product Development
Nauto Improves its Data Scientist Prodcutivity, Accelerates Product Development
-
Ibotta Builds a Self-Service Data Lake to Enable Business Growth
Ibotta cut costs thanks to Qubole’s autoscaling and downscaling capabilities, and the ability to isolate workloads to separate clusters.
-
Poshmark Experiences Hyper Growth and Uses Qubole to Create Value for the Poshmark Community
Qubole saved Poshmark up to one year to start transforming big data into creating value for its community
-
Loading More...