The MapR Sahara plugin allows to provision MapR clusters on OpenStack in an easy way and do it, quickly, conveniently and simply.
The MapR Plugin performs the following four primary functions during cluster creation:
The Sahara MapR plugin can make use of either minimal (operating system only) images or pre-populated MapR images. The base requirement for both is that the image is cloud-init enabled and contains a supported operating system (see http://doc.mapr.com/display/MapR/OS+Support+Matrix).
The advantage of a pre-populated image is that provisioning time is reduced, as packages do not need to be downloaded which make up the majority of the time spent in the provisioning cycle. In addition, provisioning large clusters will put a burden on the network as packages for all nodes need to be downloaded from the package repository.
For more information about MapR images, refer to https://github.com/openstack/sahara-image-elements.
There are eight VM images provided for use with the MapR Plugin, that can also be built using the tools available in sahara-image-elements:
MapR plugin needs an image to be tagged in Sahara Image Registry with two tags: ‘mapr’ and ‘<MapR version>’ (e.g. ‘4.0.1.mrv2’).
Note that Spark should be run on plain or 4.0.1 images.
The default username specified for these images is different for each distribution:
OS | username |
---|---|
Ubuntu 14.04 | ubuntu |
CentOS 6.5 | cloud-user |
The MapR plugin currently supports Hadoop 0.20.2 (3.1.1, 4.0.1.mrv1, 4.0.2.mrv1), Hadoop 2.4.1 (4.0.2.mrv2) and Hadoop 2.5.1 (4.0.2.mrv2).
When the user creates or scales a Hadoop cluster using a mapr plugin, the cluster topology requested by the user is verified for consistency.
Every MapR cluster must contain:
Every Hadoop cluster must contain exactly 1 Oozie process
Every MapReduce v1 cluster must contain:
Every MapReduce v2 cluster must contain:
Every Spark cluster must contain:
HBase service is considered valid if:
Hive service is considered valid if:
Hue service is considered valid if:
HttpFS service is considered valid if cluster has exactly 1 HttpFS process
Sqoop service is considered valid if cluster has exactly 1 Sqoop2-Server process
For more information, please contact MapR.