In this tutorial, you will learn how to clone your Elastic MapReduce (EMR) clusters into an Elastigroup. AWS EMR provides a managed Big Data framework that enables you to easily add/remove cluster capacity to meet the necessary workloads for your application. EMR supports Hadoop, Apache Spark, and other popular distributed frameworks. Running your EMR clusters on Elastigroup provides you with the significant discounts that Spot instances offer while maintaining 100% availability.
This tutorial focuses on cloning an existing EMR into Elastigroup. Elastigroup also enables you to wrap your existing cluster with Spot instances Task nodes. Head to our tutorial on Wrapping EMR Clusters to learn more.
Prerequisites:
Login to the Elastigroup Console (console.spotinst.com) and navigate to the Creation Wizard by clicking the Create button in the Elastigroups tab.
In the Creation Wizard select EMR:
Set the name and region of the Elastigroup. Click Next.
{Warning: decreasing root volume size is not recommended and might affect the proper launch of the instance group or the cluster}
{Caution: This adds any steps configured in the original cluster to the clone}
The Creation Wizard prepares a JSON template to launch an Elastigroup with the EMR configuration. All that’s left to do is click Create!
You’ve now created an EMR on Elastigroup and are in the Elastigroup Manager view, where you can review, manage and monitor your running Elastigroup.
You have now learned how to create an EMR cluster on Spot instances with Spot by NetApp, letting you:
Complete access
for up to 20 instances