What are the advantages disadvantages of Hadoop Dockerization

0 votes
I am working on a Hadoop cluster which is using Hue, Flume & Cassandra. I have heard about Docker & have an idea about how it works. Before actually deploying the cluster in a real time environment, I want to consider the advantages & disadvantages of using docker container for Hadoop?

I guess the portability is one of the major benefit of using docker, but I am interested in knowing and comparing more. Can anyone help me out on this?
Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,589 views

1 answer to this question.

0 votes
As you are already having a Hadoop cluster, you can understand it is difficult to reproduce this environment. Then next important thing is, docker helps you to isolate the environment which will not conflict any dependencies with other applications present in your host machine.

Talking from the perspective of Hadoop, to easily setup a multi node cluster. You can setup one docker Hadoop container. Then replicate the container and the change the setting. So, it will be very easy to setup a multi node cluster.

But again I would like to add, if you have no pain in setting a multi node cluster or you have no issue with the dependencies, So, you do not have to use the docker container because it is a hot topic. It only depends on your need and your ease.
answered Apr 18, 2018 by coldcode
• 2,090 points

Related Questions In Big Data Hadoop

0 votes
1 answer
0 votes
1 answer

What are the different ways of Installing Hadoop into our local machine?

Hadoop runs on Unix and on Windows. ...READ MORE

answered Aug 4, 2018 in Big Data Hadoop by Neha
• 6,300 points
5,261 views
0 votes
1 answer

What do we exactly mean by “Hadoop” – the definition of Hadoop?

The official definition of Apache Hadoop given ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by Shubham
1,882 views
0 votes
13 answers

What is the difference between Hadoop/HDFS & HBase?

HDFS is a distributed file system whereas ...READ MORE

answered Apr 26, 2019 in Big Data Hadoop by Arihar
• 160 points
33,784 views
0 votes
1 answer

What is the use of sequence file in Hadoop?

Sequence files are binary files containing serialized ...READ MORE

answered Apr 6, 2018 in Big Data Hadoop by Ashish
• 2,650 points
9,587 views
0 votes
1 answer

What are the hardware requirements for installing Hadoop on my Laptop?

You can either install Apache Hadoop on ...READ MORE

answered Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,490 points
8,070 views
0 votes
12 answers

What is Zookeeper? What is the purpose of Zookeeper in Hadoop Ecosystem?

Hey, Apache Zookeeper says that it is a ...READ MORE

answered Apr 29, 2019 in Big Data Hadoop by Gitika
• 65,770 points
29,906 views
0 votes
1 answer

Best way of starting & stopping the Hadoop daemons with command line

First way is to use start-all.sh & ...READ MORE

answered Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
10,069 views
0 votes
1 answer

What are some of the famous visualization tools which can be integrated with Hadoop & Hive?

I have personally used two visualization tools ...READ MORE

answered May 1, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,991 views
0 votes
1 answer

What are the different ways to load data from Hadoop to Azure Data Lake?

I would recommend you to go through ...READ MORE

answered Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,056 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP