Deploying Hadoop Clusters using Ansible & Vagrant

This project uses vagrant and ansible to launch hadoop on Ubuntu, either as a single node (with or without HDFS) or as a cluster. Avro is also installed.

To use, install vagrant & ansible, then just run vagrant up.

By default a single node will be built with HDFS. To disable HDFS, set install_hdfs to False in the file group_vars/all.

To build a cluster, set build_cluster to True in group_vars/all, and also change the the variable dfs_replication to '3' in the same file. Then rename Vagrantfile.cluster to Vagrantfile and run vagrant up.

Once built, the web UIs are available at:

JobTracker web interface: http://192.168.50.4:50030/
NameNode web interface: http://192.168.50.4:50070/ (only when HDFS is enabled)

Note: Hadoop will use a lot of disk space for the slaves when building a cluster.

For more information, see https://github.com/ansible/ansible-examples/tree/master/hadoop which contains the original playbook, and this post which contained the Vagrantfile http://leonidmirsky.com/ansible/hadoop/devops/2013/11/19/creating-hadoop-test-environment-with-ansible-and-vagrant.html.

Note: The original examples provided a HA build. Getting that to work on Ubuntu is left as an exercise for the reader ;-)

Both needed modifying to work.

This comes with absolutely no warranty, etc. Make sure to comply with relevant licences.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
group_vars		group_vars
playbooks		playbooks
roles		roles
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
Vagrantfile		Vagrantfile
Vagrantfile.cluster		Vagrantfile.cluster
hosts-cluster.yml		hosts-cluster.yml
hosts-singlenode.yml		hosts-singlenode.yml
site.yml		site.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deploying Hadoop Clusters using Ansible & Vagrant

About

Releases

Packages

License

boosh/ansible-vagrant-hadoop-cluster

Folders and files

Latest commit

History

Repository files navigation

Deploying Hadoop Clusters using Ansible & Vagrant

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages