Historically, Hadoop has been a platform for big data that you either deploy on-premises with your own hardware or in the cloud and managed by a hosting vendor. Deploying on-premises affords you specific benefits, like control and flexibility over your deployment. But the cloud provides other benefits like elastic scale, fast time to value, and automatic redundancy, amongst others.
With the recent announcement of the Hortonworks Data Platform 2.2 being made generally available, Microsoft and Hortonworks are partnered to deliver Hadoop on Hybrid infrastructure in both on-premises and cloud. This will give customers the best of both worlds with control & flexibility of on-premises deployments and the elasticity & redundancy of the cloud.
What are some of the top scenarios or use cases for Hybrid Hadoop? And what are the benefits of taking advantage of a hybrid model?
- Elasticity: Easily scale out during peak demand times by quickly spinning up more Hadoop nodes (with HDInsight)
- Reliability: Use the cloud as an automated disaster recovery solution that automatically geo-replicates your data. Or
- Breadth of Analytics Offerings: If you’re already working with on-prem Hortonworks offerings, you now have access to a suite of turn-key data analytics and management services in Azure, like HDInsight, Machine Learning, Data Factory, and Stream Analytics.
To get started, customers need Hortonworks Data Platform 2.2 with Apache Falcon configured to move data from on-premises into Azure. Detailed instructions can be found here.
We are excited to be working with Hortonworks to give Hadoop users Hadoop/Big Data on a hybrid cloud. For more resources:
- Step-by-step instructions on configuring Hortonworks HDP 2.2 to move data into Azure
- Hortonworks Data Platform 2.2
- Azure HDInsight Service Page
- Free 30-day Trial for Azure (including HDInsight)
- Learning Map – HDInsight documentation